The rotation problem and Hamilton's discovery of quaternions (II) | Famous Math Problems 13b

Channel: Insights into Mathematics Published: 2013-05-01 8,150 words Source: manual_caption

Advanced Mathematics & Geometric Physics

Transcript

[Music] Hello everyone, I'm Norman Wildberger. This is our second lecture in this presentation of Hamilton's discovery of quaternions, essentially solving the rotation problem—the problem of how to effectively describe rotations of three-dimensional space.

In our first video on this topic, we looked at the planar situation—rotations in the plane—and the close connection with the algebra of complex numbers. Today, we move up to three-dimensional space and consider rotations about a fixed point in three-dimensional space.

So, we're going to talk about rotations, we're going to get to Euler's theorem on products of rotations, and we're also going to introduce dot products and cross products. Now, there are sort of two different ways we can approach this topic, and they're complementary, but it's interesting to keep them separate.

One is a purely geometrical approach where we think about the physics, the geometry of rotations about a fixed point, often by working with a model of a sphere which is centered at the point in question. This way, rotation of three-dimensional space can be represented by movements of

this sphere, and typically, we have an axis, say something like this—a line that passes through the center of the sphere and two opposite or antipodal points. One of the main problems in the subject is to understand how we can compose or multiply such rotations. Alright, so

we have this geometrical side of things, but we also have an analytic side where we introduce coordinates—typically x, y, and z—and where we use the linear algebra of matrices, vectors, and then associated dot products and other algebraic constructions to help us with the geometry. So, although these are very closely linked, one should be aware that there are subtle differences and perhaps even a little bit of tension between the two of them.

There are many advantages to working with the analytic approach, but there is also a possibility of a little bit of confusion because when we introduce coordinates, we are placing distinguished roles on three particular directions in our space which are not really inherent in the geometrical picture. And this introduction of these three distinguished lines has sort of number-theoretical issues that

can play a role in connecting these two topics. Alright, so let's start by thinking about rotations from a geometrical, intuitive, physical point of view. Alright, so we're going to be considering these rotations of three-dimensional space,

and we're going to consider these as rigid motions of the entire space which fix a distinguished fixed point, usually called O, the origin. And to illustrate this kind of motion, we're going to often employ the sphere whose center is the fixed point O. So instead of

worrying about how the entire space moves, we're just going to worry about how the points on this sphere move, because that's easier to represent. Now, the distinguishing property of a rotation is that it preserves the separation of points on the sphere. So, for example, if we have two points

separated by that amount, then if we rotate, the two points that we're going to get are still going to be separated by the same amount, and that's independent of how we actually decide to measure the separation. So in fact, there are lots of different possible ways of talking about how far two points are separated on the surface of the sphere, but no matter

which way you choose, that's going to be one of the properties of rotation—that it preserves separation. Another important property is that it preserves orientation. So that means that, for example, if you had a little circle on the sphere, and say you had a positive orientation of that

circle, maybe a counterclockwise orientation, then if you're going to rotate the sphere, the image is still going to have that same counterclockwise rotation. That's as opposed to a reflection—a reflection in the plane going through the origin, which has the property that it reverses orientation.

So reflections are also an important part of the story, and we'll come to them a little bit later. Alright, so a rotation is a particular kind of transformation which has a fixed axis. So there are two opposite points, or antipodal points. A stumbling block for

understanding many aspects of three-dimensional geometry, and I know it's very commonly assumed that rotations and angles are intimately linked, but in fact, theoretically, it's not so, and there are many advantages in avoiding angles and their essentially transcendental nature. So to make that point another way, it's important to realize that a rotation

is really a transformation that starts in a certain position and ends in a certain position, and the actual motion from the starting position to the final position is not part of the rotation. So, for example, let's put the axis up and down like this, and let's consider the rotation that—in fact, maybe I'll position things so that this blue line here

is the equator, and there's an axis something like that. Alright, so a rotation is something that takes the sphere in this position and sends it to this position. Alright, I've deliberately moved the sphere so you didn't see how I went from this position to this position.

So perhaps I went like this, or perhaps I went like this, something like that. But the point is that the rotation itself is really determined by the initial position and the final position, and not the actual motion from the initial to the final position.

So on our diagrams, we're going to have pictures like this: our sphere represented by this little picture here, and here maybe is the axis of the rotation with fixed points like the North Pole and the South Pole, say A and A-bar, and we have some rotation which sends, say, that point B to that point B-prime, that point C to that point C-prime, that point D on the equator to that point D-prime.

But we'll try to avoid drawing arrows necessarily indicating that we have to actually move in this direction as opposed to going the other way around. The key point is that this point is sent to this point, and how it actually gets there doesn't concern us.

It's useful to mention that angles come from astronomy, and in astronomy, there is a very good reason why angles are important, because the Earth is rotating uniformly on its axis once a day. So everything that we observe in terms of the night sky is moving around us at a fixed rate because of this uniform motion.

So in astronomy, angles are completely unavoidable and play a very important role. Surprisingly though, when we're studying the geometry of three-dimensional space, and in particular rotations and reflections and other motions, we don't need to use angles, and often using angles makes life more

difficult for us. That's going to be an important idea that will even inform our discussion of quaternions in our next lecture. So one of the main problems in the subject is to describe what happens when we combine rotations.

So in other words, we have one rotation about one axis, and then perhaps followed by another rotation about another axis, and we want to consider the composition of those two transformations—one followed by the second. That's a very important practical and theoretical problem.

Now there's sort of a special case of this. The special case is when the axes of the subsequent rotations are pretty close together. So, for example, suppose that one axis was like this—we

had some rotation—and then another axis just a little bit moved from the first one, and then another rotation about that one, and maybe a third axis also, again, close to the original ones. So in this case, we have perhaps three or more axes which are all more or less in the same region of space.

In that case, the various rotations that we have can be viewed just on a part of the sphere's surface, a little map of the region around the three points, say the North Poles of the axes, and then the rotations that we're doing here are approximated by planar rotations in this two-dimensional planar approximation of the spherical surface near those points.

So what we're saying here is that if we want to understand the composition of rotations on a sphere or in three-dimensional space, probably we should understand how planar rotations work when we have centers of rotation that are not necessarily the same point. So we have a rotation about this point, and then maybe

a rotation about this point, and a rotation about this one. How do we effectively work with products of rotations in the plane about different points? Okay, so there's a nice way of dealing with that, and it's to recall that a planar rotation can be specified by a pair of reflections.

We saw this in our last video, that a rotation corresponding to multiplication by a complex number of unit quadrants can be thought of in terms of a product of reflections of two lines through the origin. So, for example, here—this is all in the plane now—there's, say, a center O, and here are two lines, let's say L1 here and L2 here, and let's consider

the reflections in those two lines. Let's call them Sigma 1, the reflection in L1, and Sigma 2, the reflection in L2, and let's have a look at the composition Sigma 1 followed by Sigma 2. In other words, that sends a point X to Sigma 1 of X, and then applies Sigma 2 to that.

What happens to various points? Well, for example, this point A—when we reflect it about L1, we're going to get first this point right here, and then when we reflect that about L2, we're going to get this point over here. So the net effect of this composition is to send A to A-prime over

here. Suppose we choose some different points, say U, this point B—when we reflect it in L1, we're going to get some point over here, for example, and then when we reflect that in L2, we're going to get this point, point B-prime, over here. And you can try some other points

and convince yourself that effectively the composition of these two reflections is a rotation. It's a rotation about the point O, essentially of twice this angle, where you can think of the angle as not a measurement but actually of that geometrical segment. Imagine

putting that wedge together with another wedge beside it, and you're getting the size of the rotation formed by the composition of those two reflections. So that's a very interesting and important tool in thinking about rotations—that we can break them up as products of reflections, which are, in some sense, more simple and more fundamental kinds of transformations.

Let's start now then by considering the product of two rotations in the plane. We have a first rotation about the point A1, called Row 1—that's rotation number one, rotation around A1—and another rotation, Row 2, around the point A2, and we're interested in describing what happens when we perform Row 1 and

then Row 2. And let's suppose that we understand Row 1 in terms of a pair of reflections—the reflection in the line L1, and then the reflection in the line L2. We can symbolize that by this little arrow here.

So this little arrow from the line L1 to the line L2 somehow represents this rotation—the combined effect of reflecting in L1 and then following by L2. That's a rotation, effectively, of twice that angle about A1. Similarly, this reflection Row 2—we're going

to think about that as being the composition of reflection in the line M1 and the line M2, represented by this arrow, this red arrow. Alright, so how are we going to compose this rotation followed by this rotation? Well, the key is to remember that this pair of lines that encodes the rotation is not unique.

If we rotate both pairs of lines about A1, say we take the initial one here and we rotate both of them to get two lines like this, then the rotation represented by this pair of lines is the same as the rotation defined by this pair of lines because this separation between these two lines here is the same as the separation between these two lines here.

So the rotation of essentially twice that angle is going to be the same as this rotation of twice the angle there. Alright, that's the key kind of flexibility that we have in this kind of description of a rotation as a product of reflections—that we're allowed to, in fact, rotate the pair of reflections

that we're using to define the rotation. So how are we going to use that to help us compose these two? Well, what we're going to do is we're going to rotate this pair, and we're going to rotate this pair in a very special way. We're going to rotate this pair so that the second

line, L2, happens to go through not just A1 but also A2. Alright, so we're going to take this pair here, and we're going to rotate it around—so think about this L2, we're going to move that around so that it's now facing in that direction. There it is there.

Then the L1 line will have moved down here, so it will be in that position, so we can call this L2-prime, and this one L1-prime. These are the same lines here after we've rotated them. And similarly with

this representation of Row 2, we're going to move that one around so that the first line, M1, also has this property that it's this line right here—that it joins this center to this center. So we're going to take M1, and we're going to move it around so that it's in this position, giving us M1-prime, and then M2 correspondingly will be in this position.

So this pair of lines will have moved so that they're now in that position there. So we've arranged it so that the new L2 and the new M1, L2-prime, and M1-prime are the same. Why is that good for us? Well, because it's

going to induce a cancellation of reflections, for the simple reason that if you have a reflection in a line and you do it twice, then the effect cancels out—you reflect in a line, and then you reflect in the same line—that's the same as doing nothing. So, in other words, algebraically, the rotation R1 followed by the rotation R2 is going to be the composition of the

reflection in L1-prime followed by the reflection in L2-prime—that's what Row 1 is. And then Row 2 is the reflection in M1-prime followed by reflection in M2-prime. So with this composition business, we sort of read from right to left, so the composition is a reflection first in this

one, and then in this one, and then in this one, and then in this one. And because these two lines are the same, we've arranged that it means these two reflections in here cancel, and the whole thing just simplifies to the reflection in L1-prime followed by the reflection in M2-prime.

L1-prime was this one, and M2-prime was this one. So the product of the two reflections is over here—same picture, just over here is the reflection in this line, followed by the reflection in this line. These two lines meet at a new point—let's call it A3, which is

this point here in this diagram—and the original points A1 and A2 now don't really play any more of an important role. The two reflections in L1-prime and M2-prime now generate a rotation about A3, and we know exactly what that rotation is—it's the rotation of twice the angle between these two lines, L1-prime and M2-prime.

So this is a calculus or way of computing with planar rotations purely geometrically, in terms of pairs of lines that encode the rotations via reflections. You move the pairs of lines around so that the middle two agree, and then they disappear, and then just the outer two that determine the new rotation.

So we've described what happens when you compose rotation Row 1 with rotation Row 2—the first one was a rotation around A1, the second one was a rotation around A2, and now we find that the product is Row 3, the rotation around A3. Now it's proper for me to mention that

sometimes something a little bit different can happen. So these two lines, L1-prime and M2-prime, that we obtained here through A1 and A2, in this case, they meet at this point A3. It's also possible for those two lines to be parallel.

Now if those two lines had been parallel, then there would be no point A3 where they meet, and in fact, in that case, we would not get a rotation as the product. The product of the reflection in a line with the reflection in a parallel line is rather a translation in their common normal direction. Please make sure that you convince yourself of

that—in fact, it's a translation by essentially twice the separation between the two parallel lines. In some sense, the center of rotation is at infinity. Alright, so that can also happen, so we don't necessarily get a rotation—it could be that the product of two rotations is a translation in

the planar setting. All this is a very geometrical approach—it's a kind of ruler and compass approach that if you actually had two rotations, you could make diagrams and geometrically with constructions find out where the center of the product of two rotations is, and what the angle or the amount of rotation about that new point would be by computing these two new lines.

This is essentially a Euclidean geometry construction of how to multiply rotations in the plane. This is going to motivate and give us some direction for understanding the more complicated problem of how to deal with three-dimensional rotations.

So let's apply what we've learned in the planar case to try to understand rotations of three-dimensional space—rotations of the sphere. So the key point is that a rotation of three-dimensional space can also be described as a product of two reflections about two planes through the origin.

So let me illustrate that here with this example sphere—I've taken an equator, this one here now, and I've taken the North and South Poles—there they are. Okay, and we're considering a rotation around that axis, and in particular, I'm going to consider a rotation which is related to two planes that go through this axis, and one of the planes is

cutting the sphere along this orange great circle, which goes through our North and South Poles, and the other one cutting the sphere along this blue great circle. And I want to consider what happens if we reflect first in the orange plane, followed by reflection in the blue plane. Let's

have a look first at reflection in this orange plane. So a reflection in a plane, like a mirror image on the surface of the sphere—this fellow here, who's sort of currently facing in that direction—if we reflect him in the orange line, we're going to get that fellow there—that's his reflection.

Notice that the orientation has been changed—if he was left-handed over here, he would be right-handed on this side. Reflecting in a mirror changes the orientation. This point here, this little circle, for example, would be reflected to this circle, and we could compute the image of any point—of course, points lying on the orange great circle themselves are fixed under that reflection.

Alright, now let's follow that by reflection in this second plane, the plane of the blue line that also goes through the center of the sphere. We're going to reflect in that blue line—then this point over here, this little figure, will get reflected to this one over here. Right, so we're going from there to there—that's the

reflection in the blue line. And similarly, this circle here got first reflected here, and then actually its image was here, not the one over here—that's a mistake. So the reflection in the blue one here is there, and then the net effect is to send this little circle rotated to here.

So overall, the combined effect of reflection in the orange plane followed by reflection in the blue plane is to rotate the sphere—rotate the sphere by essentially an angle which is twice this angle between these two lines. When I say angle here, I probably should use a different word, but twice

the separation contained by these two lines. Okay, so a vector like this is not going to be sent to there, but it's going to be sent to there—the North and South Poles remain fixed. So this is a way of representing a rotation of the three-dimensional space as a product of two reflections in planes.

And just as in the two-dimensional situation, these two planes are not really canonical—they can be moved, they can themselves be rotated, so if we rotated both of them around this axis, we'd get a new set of pairs of planes which would generate the same rotation that we started with. So in terms of our picture, here's

a two-dimensional representation of that. So here's our sphere, here's a North Pole and a South Pole, and there's an axis of rotation through those two poles, and we've shown the equator of that axis as well. So the plane of that equator is

perpendicular to the axis that we're rotating around. Okay, and we're going to suppose that our rotation that we're considering is generated by two reflections about planes, and the first plane, say, is given by this great circle here. Okay, let's call that C1—that's a great circle on the

sphere. It means that it's the place where the sphere meets a plane that passes through the center of the sphere—not just any old plane, but a plane that goes right through the middle of the sphere—that cuts out a great circle. These are the biggest circles on

the sphere. And we have a second such plane, represented by this great circle here, going behind the sphere and coming out like that, and that's C2. So those are our two great circles. So what we're doing is we're considering the rotation that's the product of reflection Sigma

1 in the first circle, C1, followed by Sigma 2, the reflection in the circle, C2—that's our rotation, we're going to call it Row. And let's make sure that we see what it's doing. So, for example, this point A—when we reflect it in the first great circle, we maybe get

a point like this, and then when we reflect that in the second great circle, we get a point like that. So the cumulative effect is to send this point A to this point A-prime. But again, there's no requirement that the point is actually sent along this curve—it could also, you can also think

of it as having walked around the backside of the sphere, or in fact traversed any old path to get from here to here. The crucial thing is only that we're starting here and ending here. What about, say, the point B over here on the side of the sphere facing us? If we reflect first in C1,

we're going to get this point over here, and then when we reflect that in C2, we're going to get a point B-prime over there. So the cumulative effect is to send this B to this B-prime, and that's a rotation of the sphere, again by essentially twice the separation of those two lines. So

how are we going to describe geometrically that rotation? Well, one way would be to describe it by specifying the two great circles on the sphere, but there's a little bit more efficient way of doing that, and that's to use this equator, which is another great circle perpendicular to both of the great circles that we've just been considering. So the first circle, C1, has

itself a perpendicular axis direction. So if we think of the plane, C1, it has a perpendicular axis which meets the equator at some point—let's call it P1. Okay, so this point, P1, is the point on the equator which is like the North Pole or South Pole if we think of this C1 as being a new equator.

Similarly, if we think of C2 as being some kind of new equator, then it has a North and South Pole, and these are going to be, say, this point and this point over here. That line would be perpendicular to the plane that we used to cut out C2, and those two points are going to

also lie on the original equator, which is the equator corresponding to the North and South Poles where the two planes C1 and C2 meet. So how are we going to represent this combination of reflections? We're going to do something different than we did in the planar case.

We're not going to use the two lines or the two great circles passing through our North Pole—we're going to rather use these two points, P1 and P2, on the equator of the axis of rotation. So there's P1, there's P2. You might

say, well, which P1 are we going to use? There's actually two of them. And which P2 are we going to use? There's also two of them. That's right, so we're going to choose one of these P1s—it doesn't really matter—and we're going to choose the P2 which is closest to the P1.

One of them will be typically a little bit closer on the equator than the other, and so the shorter arc, we're going to use that as the arc to represent this rotation, and we're going to actually draw a little arc in red on this equator, joining P1 to P2, and that little arc will contain the information of this rotation.

In fact, what the rotation ends up being, it ends up being a rotation by essentially twice that arc, with this as its equator. Alright, so there's quite a lot in this diagram. It's much easier to visualize if you get yourself a ball—it could be

a soccer ball, basketball, a globe, beach ball—get some kind of ball and try to essentially recreate this picture and make sure you understand where all the various points are. Okay, it's a bit complicated, but a very rich picture, and it's sort of an interesting and non-intuitive idea that we're going to represent this rotation about this axis

here by these two points. So this little vector, joining these two points on the equator of that rotation, and we have some flexibility about doing that because we can rotate these two points along the equator—we can put them wherever we want along the equator as long as they're separated by the same amount, they're going to represent the same rotation.

So to come back to our original picture here, I guess we had this story like this where we were rotating about the North and South Pole here—we're rotating that to send this vector over to some place like that, to go from here to over there. And so we're thinking of that as being the composition of the reflection in this

plane followed by the reflection in this plane. So how would we represent that? Well, we could take our first plane, which is the orange one, and find a normal to it, so the North Pole essentially of that great circle—maybe that one—I'll make it big—that'd be the first one there. Alright, and then we're going to look at the second plane, which is the

blue plane—it also has a North Pole and a South Pole—maybe that one there, I'll choose. Okay, that's the North Pole for the blue line, and that's the North Pole for the original orange line, and then the representation of the rotation that we're considering is this point and this point, followed by being connected with a vector. Okay, so that vector joining that point

to that point on this equator represents the following rotation—it represents the rotation by essentially twice this amount, so something like that. Right, and it doesn't have to be this vector—we could move this vector over, so I could move it over so it's, say, between here and roughly an equal amount, say over there.

Right, so this vector on the sphere would represent the same rotation as this vector—they would both represent the rotation by essentially twice that separation, with this as the axis. That's a geometrical way of representing rotations of three-dimensional space, by spherical vectors on the surface of the

sphere—vectors which lie on great circles, essentially on the equatorial great circles of the rotations we're considering. So now we can use these rotation vectors for rotations to actually describe the composition or the algebraic structure of rotations of three-dimensional space.

And we can do so in a very geometric and direct way, which is completely analogous to what we were doing in the planar situation, even though the object that we are using are a little bit different. Alright, so here's how it works: we have two rotations of our sphere, or space, and they're represented by these two rotation vectors, Row 1 and Row

2. Alright, so Row 1 is the vector which is lying on this equatorial great circle—that's an equatorial circle for the rotation Row 1, which actually rotates by twice that separation. So it would send this point to that point there—a rotation with this as the equatorial direction, and the axis would be somewhere like this.

And then we have a second rotation represented by this little vector, which is also lying on a great circle between this point and this point, and that represents a rotation with an axis in that direction somewhere, essentially through twice the separation here. So it would send this point to a point around the other side.

Alright, so these are the objects that represent the two rotations, and to combine these two rotations, we're going to rotate both pairs so that the final point of Row 1, this one here, and the initial point of Row 2, this one here, match up. Alright, so we're able to do that because

this Row 1 vector represents the same rotation as the one that we would get by moving that, sliding it down so it would be in that position there. Alright, so I'm going to slide this vector down until it's in that position there—it still represents exactly the same rotation as before, it's just that it's now in a different position along its equator.

And I'm going to slide this vector, Row 2, along its equator so that its initial point is at the same place where the final point of Row 1 is, where the two equators meet. And why does this help us? Well, it's because the rotation Row 1 that we're talking about can be thought of in this way: the rotation is a product

of two reflections, and the reflections are the reflection in the plane which is perpendicular to this point. So this point here has an axis coming out from the center of the sphere, and there's a perpendicular plane to that direction from the center of the sphere, which determines a great circle—that's the first reflection that's involved in Row 1.

And the second reflection is the one in the plane perpendicular to this point. So the combination of Row 1 and Row 2 is a reflection in the plane perpendicular to this point, followed by a reflection in the plane perpendicular to this point, and then Row 2—reflection in the plane perpendicular to

this point, followed by reflection in the plane perpendicular to this point. And just as in the planar case, those two intermediate reflections cancel because they're the same reflections—we've cooked it up so that they are the same. So the product reduces to just a product of two

reflections in planes perpendicular to points, namely to this point and to this point. So the composition of these two rotations ends up being the rotation determined by reflecting in this point—actually, reflecting in the plane perpendicular to this point, followed by reflecting in the plane perpendicular to that point.

And that's given by the vector—the vector on the great circle joining these two points. So that's the algebra—you take a spherical vector like this, you move it down, and you move this one across so that they're lined up just like an addition of vectors in the plane.

And we're kind of adding these vectors just as we do vectors in the plane, except it's all happening on the surface of the sphere. So we get this vector plus this vector equals this vector here, which I haven't shown here, but here it is.

And that corresponds to this rotation times this—this rotation is equal to this rotation. So we see that geometrically, rotations can be thought of in a very, very simple way if we are happy to use these spherical vectors lying on the sphere, and if we're happy to agree that rotating a spherical vector along its own direction doesn't essentially change

it. Alright, that's an interesting kind of algebra. So we end up expressing the product of rotation Row 1, Row 2 as the rotation Row And notice that special kind of thing that happened in the plane where we ended up getting two lines which don't meet—it's

not going to happen here because on the sphere, any two great circles are guaranteed to have an intersection point. Get a sphere, play around, convince yourself this works. So great, we've basically given a demonstration

of this important theorem going back to Euler in 1776, which is that the product of two rotations is a rotation. In other words, if you take your sphere and you take any axis and you perform one rotation, and then you take some other axis at random and you perform another rotation, then that composite transformation—going from some initial position to some other initial

position—is actually a rotation. Somewhere, there's an axis so that you can essentially get back to where you started just by rotating around that axis. It's not entirely obvious that that's the case, and Euler realized that it had to be proven, and he proved it.

And so basically, we've given a geometrical demonstration of that fact using this idea of spherical addition of vectors on the surface of the sphere. Okay, so that's all very geometrical with pictures and models and so on, which is great.

But there's also a need for having an analytic approach where we use coordinates. And this is a more familiar approach, I suppose, because people are very familiar with linear algebra. If you've done any mathematics at the university, you've probably

taken a linear algebra course. If you haven't, and you'd like to learn some linear algebra, may I recommend my series Wild Lin Alg, which lays out linear algebra in a more geometrical way than most courses, and it's ongoing. So in fact, the second half of this course, which is going to

be starting soonish, will be talking a lot about the kinds of things that we're talking about today and will give a lot more detail and derivations. So if you're interested in this kind of thing, you should check out this series—especially the second half. Of course, you should have already watched the first half before you watch the second half.

Okay, so we're going to introduce some basic things from three-dimensional space, in particular, first coordinates. So we have an X direction, a Y direction, and a Z direction—these are three mutually perpendicular lines.

You can think of the Y direction being in the plane of the board, the Z direction is being in the plane of the board, and the X direction coming straight out—it's just that you're sort of over there somewhere, and you're looking at it in that direction, so that the X direction appears something like this to you. Then we use these coordinates

to specify the position of any point in space, say the point P with coordinates A, B, C—is that point whose X coordinate is A, whose Y coordinate is B, whose Z coordinate is C. And it's best thought of in terms of a box whose sides are parallel to these axes. And here's

the box here, kind of going A in the X direction, B in the Y direction, C in the Z direction—that's the point A, B, C. Alright, so what we want to do is we want to apply some metrical considerations to this three-dimensional space. So we want to take

planar notions that are metrical and extend them to three-dimensional space. So the fundamental planar notion is the notion of quadrants. Okay, so if you have, say, consider the vector or the segment from zero to this point R here, which just in the plane has coordinate A and B, then

its quadrants, the quadrants between O and R, is A squared plus B squared. Notice that we're using this quadrant and not its square root—the square root would usually be called the distance or the length, and we want to avoid that because taking a square root is a transcendental operation. It's

usually called algebraic, but it's not algebraic at all—it's actually a transcendental infinite process that never stops. You can't actually complete a calculation of a square root—that's just the reality of things. We want to avoid that—that's too complicated.

Quadrants, A squared plus B squared—very nice and simple. How do we extend that to three dimensions? Well, once we have the quadrants here, there's another sort of right triangle going up C.

So we have a quadrant of A squared plus B squared here, a quadrant of C squared here, because the coordinate is C, so the quadrant is C squared. And so Pythagoras's Theorem applied to this triangle here tells us that the quadrants between O and P should be the quadrants between O and R, which is A squared plus B squared, plus the quadrants between R and P, which is C squared, for

a combination of A squared plus B squared plus C squared. So we are extending the definition of quadrants to three dimensions. We can define the quadrants of a vector A, B, C to be A squared plus B squared plus C squared.

So motivated by Pythagoras's Theorem, we can define this to be the quadrants of a vector, and that's the fundamental metrical notion in three-dimensional space. The next important notion is perpendicularity. How do we tell or how do we define what it means for two directions to be perpendicular? Alright,

we'll work in the language of vectors. So suppose we have one vector, V1, and so the vector is now actually something with a start and an end, so it's a directed line segment. I denote it with round brackets because it's subtly different from the point with square

brackets. So when you see round brackets, I'm usually talking about a vector. So we have vector V1, X1, Y1, Z1, and another vector, V2, X2, Y2, Z2, and we want to ask, well, when are these perpendicular vectors like these ones? Well, if we look at this triangle here in

vector form, this one's V1, and this one's V2, then that vector will be V2 minus V1. And if we have perpendicularity here, then we expect that Pythagoras's Theorem should apply to this triangle, which means that the quadrants of V1 plus the quadrants of V2 ought to be the quadrants of V1 minus V2, or V2 minus V1—it's the same. So what does this equation relating the three

quadrants amount to in the coordinates? Well, quadrants of V1, we've defined it to be this—quadrants of V2, we've defined it to be this—and the quadrants of the difference? Well, we have to take the vector, which is the difference of the coordinates, and we have to take the sum of the squares of those coefficients. So this equation represents perpendicularity in space

if we want Pythagoras's Theorem to be true. And we want Pythagoras's Theorem to be true in three dimensions, just as it is in two dimensions. So fortunately, this equation appears rather long here, but there's a lot of cancellation.

Once you expand the right-hand side, because the X1 squared will cancel with that, and the X2 squared will cancel with that, the only thing that will be left on this side after you divide by two and maybe take out a minus sign is the expression X1 times X2 plus Y1 times Y2 plus Z1 times Z2, and that's got to be equal to zero because that's all that's going to be left on the left-hand side.

So I'm showing you now that if you want Pythagoras's Theorem to be true, then you want this relation to hold between the coefficients of V1 and the coefficients of V2. This is a very important expression in the coefficients of V1 and V2 for exactly this reason.

Okay, this is the reason why this is important. Now, of course, in linear algebra, they just sometimes introduce this and say, well, here's the dot product—we're just going to define the dot product without telling you why you want to know about the dot product.

Okay, I'm telling you why this is important—it's important because it's going to guarantee that Pythagoras works in this situation. So here's the definition: V1 dot V2 is X1 times X2 plus Y1 times Y2 plus Z1 times Z2. And this is a number.

So given two vectors, we get a number, and that number has different names—sometimes it's called a dot product, sometimes it's called an inner product, sometimes it's referred to as a bilinear form—all valid names for it. Very important object, and basically, we've established that two vectors are perpendicular precisely when

V1 dot V2 equals zero, where perpendicularity is essentially Pythagoras's Theorem. Of course, if you don't want to introduce Pythagoras, you could say, well, we're just going to introduce this and then we'll define this to be perpendicularity, and then you could prove Pythagoras's Theorem—that's also a reasonable approach, maybe actually cleaner algebraically.

But it's important to realize the motivation for this is because of this computation linking Pythagoras and the bilinear form. Now, there's another important construction that we can do with vectors in three-dimensional space that we can't really do with vectors

in two-dimensional space or, in fact, very easily with vectors in any other dimensional space—something special about three dimensions, the world in which we live in—and that's the cross product of two vectors. Here's the definition: so we again have vector V1 and V2 with coordinates just as before, and we're going to define this particular combination, and this

time I am just going to pull it out of the blue, but I'm going to explain why in a second. Alright, so it's a new vector. So the cross product of two vectors is a vector, not a number like the dot product, a vector, and it has three components, and they are: the first component is Y1 times

Z2 minus Y2 times Z1, and then the next one is sort of obtained cyclically from this by replacing Y's with Z's, Z's with X's, and X's with Y's. Alright, so we get Z1 times X2 minus Z2 times X1, and then replace Z's with X's—X1 times Y2 minus X2 times Y1. Alright, so

for those of you who know some determinants, you can write this in terms of a determinant, and it's another vector, and it geometrically has the following properties: that if, say, there's the vector V1 and there's the vector V2, that V1 cross V2 is a vector perpendicular to both V1 and V2. And furthermore, its quadrants are intimately connected with the area of the parallelogram

formed by V1 and V2, so the bigger that area is, the wider V1 and V2 are apart, in some sense, then the bigger the cross product is going to be. So I'm going to just state a few properties linking the dot product and the cross product—these are pretty familiar formulas in a linear algebra course, and in the Wild Lin Alg course, we are going

to talk more extensively about these and other properties of the dot product and cross product. So the first is that V1 cross V2 is really perpendicular to both V1 and V2, and it's going to be an exercise for you to check that. So what you have to check is that when you take the dot product between this one and this one, you get zero, and when you take the dot

product between this one and this one, you get zero. That's what perpendicularity means in the linear algebra context. The second property to check is that if we change the order of the vectors, then the cross product changes by a sign.

The next thing is that it's bilinear, so that V cross U plus W, for two different vectors U and W, is going to be V cross U plus V cross W. And then there are some more fancy, interesting properties of the cross product—the first is Jacobi's identity, which is that V cross U cross W, now that's

going to be another vector—U cross W is a vector, and you're allowed to take the cross product with a vector and that one. So that combination plus U cross W cross V plus W cross V cross U—they all amount to zero. Maybe I should say the zero vector.

Another is that the quadrants of V cross U, so you take the quadrants of this vector, you're getting the quadrants of V times the quadrants of U minus the dot product V dot U squared. It's a very important formula, something that's called Euler's identity or closely associated with Euler's identity.

And then there's a generalization of this, which is that U cross V dot W cross Z, when we have four vectors, can be written as U dot W times V dot Z minus U dot Z times V dot W. You might like to think why this is a generalization of that.

Alright, so these are exercises for you to do to practice your algebraic skills. And many of you will have learned about dot products and cross products and connected them with angles, formulas involving sines and cosines—may I humbly suggest that you not think in that direction.

Okay, that is not really going to help your ultimate understanding of this subject. I'm sorry to say, but I must say that. Okay, the cosine and sine business is a way of introducing transcendental aspects where

they don't properly belong. Okay, so this is a purely algebraic story—that's the way God meant it to be. Okay, it's purely algebraic—high school algebra just rolls out if you do it in the right way.

Okay, of course, there's a lot more to be said about the dot product and cross product, but this is enough of an orientation to prepare us for our understanding and appreciating Hamilton's discovery of quaternions. And there's a lot of interesting twists to this story—in Hamilton's time, this notion of dot product and cross product was not around.

It's very familiar to modern students, especially if you've taken some linear algebra at the university level, or some physics because these notions are very important in physics. But in Hamilton's day, they were not around, and his discovery of quaternions, in fact, ultimately motivated people to think about the dot

product and cross product, which are intimately connected with the algebra of quaternions and ended up replacing quaternions in people's minds—but not before a long battle ensued between the proponents of quaternions and the advocates of the vector calculus that I'm describing here. So the history is quite interesting.

So in our next lecture, we're going to talk about Hamilton's discovery of quaternions—we're going up to four dimensions now. Two dimensions first, now three dimensions, next time four dimensions—this beautiful algebraic structure that sheds a lot of light on what's happening here in three dimensions.

I hope you join me for that. I'm Norman Wildberger, thanks for [Music] listening.

The rotation problem and Hamilton&#39;s discovery of quaternions (II) | Famous Math Problems 13b

Transcript

The rotation problem and Hamilton's discovery of quaternions (II) | Famous Math Problems 13b