Electromagnetism using Geometric Algebra versus Components

The task for today is to compare some more-sophisticated and less-sophisticated ways of expressing the laws of electromagnetism. In particular we compare Geometric Algebra, ordinary vectors, and vector components.

We do this in the spirit of the correspondence principle: whenever you learn a new formalism, you should check that it is consistent with what you already know.

* Contents

2 Preview

As we shall see in section ‍5, Maxwell’s equations for the electromagnetic field can be written in the remarkably compact and elegant form:

∇ F =

c є₀

 J ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍(1)

where J a vector in spacetime, representing the charge and current, and F is a bivector, representing the electromagnetic field. It is worth learning the geometric algebra (aka Clifford algebra) formalism just to see this result.

It is also interesting to apply the correspondence principle, to see how this equation reproduces results that may be more familiar in other forms. Therefore let’s take a step back and review the prosaic non-spacetime non-geometric version of Maxwell’s equation.

3 Vectors

We start by writing the Maxwell equations in terms of vector fields in three dimensions, namely:

∇ · E

‍

є₀

∇ × E

‍

−

∂B

∂t

c² ∇ × B

‍

∂E

∂t

є₀

∇ · B

‍

‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍(2)

These equations have several deep symmetries. We can make some of the symmetries more apparent by making a few superficial changes. The reasons for this will be explained in moment.

∇ · E

‍

c є₀

 cρ

∇ × cB −

∂

∂ct

‍

c є₀

∇ · cB

‍

∇ × E +

∂

∂ct

‍

‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍(3)

These equations are invariant with respect to rotations in three dimensions. They are manifestly invariant, because they have been written in vector notation. We have not yet specified a basis for three-dimensional space, so if Alice uses a reference frame that is that is rotated relative to Bob’s reference frame, equation ‍3 not only means the same thing to both of them, it looks the same, verbatim.

These equations have an additional invariance, an invariance that is important but not manifest, namely relativistic invariance. The invariance is hidden because, among other things, the t coordinate appears explicitly. If Alice uses a reference frame that is moving relative to Bob’s reference frame, they won’t be able to agree on the value of t. For that matter, they won’t be able to agree on the values of the E-field and B-field.

Of course the non-agreement about the coordinates and the non-agreement about the fields cancel in the end, so Alice and Bob eventually agree about what the equations predict will happen physically.

Therefore equation ‍3 represents an intermediate level of sophistication: manifest invariance with respect to rotations, but non-manifest invariance with respect to boosts.

In passing from equation ‍2 to equation ‍3, we added factors of c in strategic places. This helps make the equations more manifestly symmetric. Specifically:

4 Components

We can construct an even-less-sophisticated expression by choosing a basis and writing out the components:

∇_i E_i

‍

є₀

є_ijk ∇_j cB_k −

∂

∂ct

E_i

‍

c є₀

 j_i

∇_i cB_i

‍

є_ijk ∇_j E_k +

∂

∂ct

cB_i

‍

‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍(4)

Expressing things in components like this is sometimes convenient for calculations, but it conceals rotation-invariance. If Alice uses a reference frame that is rotated relative to Bob’s, they won’t be able to agree on what x_i means or what E_i means. Of course the rotation-invariance is still there; it has just become non-manifest.

5 Electromagnetism using Geometric Algebra

Geometric Algebra (also known as Clifford Algebra) has many advantages, as discussed in section ‍8. It turns out we can write the Maxwell equations in the easy-to-remember form

∇ F =

c є₀

 J ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍(5)

which contains the entire meaning of the less-sophisticated version, equation ‍3, as we shall demonstrate in a moment.

This expression has the advantage of being manifestly Lorentz invariant (including boosts as well as rotations). Contrast this with equation ‍3 in which the Lorentz invariance is not manifest.

Overall, the best approach would be to solve practical problems by direct appeal to equation ‍1. Some examples can be found in section ‍11 and reference ‍3.

However, that’s not the main purpose of this document. Instead, we want to derive the less-sophisticated Maxwell equations (equation ‍3) starting from equation ‍1. This can be considered a test or an application of the correspondence principle.

For starters, we need to establish the correspondence between the 3-dimensional electric current j and the corresponding four-vector current J. That is,

J = c ρ γ₀ + j^k γ_k ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍(6)

where we have chosen a reference frame in which γ₀, γ₁, γ₂, and γ₃ are the orthonormal basis vectors. In particular, γ₀ is the timelike basis vector. We see that ρ has to do with continuity of flow of charge in the time direction, just as the ordinary three-dimensional current j represents flow in the spacelike directions. See reference ‍4 for more about the idea of conservation and continuity of flow.

We also need to know how F is related to the old-fashioned fields E and B. In any particular frame,

‍

(E + i cB) γ₀ ^‍

‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍(7)

F	‍	=	‍	E γ₀ − cB γ₁γ₂γ₃ ^‍
‍	‍	=	‍	E^k γ_k γ₀ − cB^k γ_k γ₁γ₂γ₃

‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍(8)

where E^k and B^k are the components of the old-fashioned electric field and magnetic field as measured in our chosen frame.


F	‍	=	‍	− E¹ γ₀γ₁ − E² γ₀γ₂ − E³ γ₀γ₃	‍ ‍ ‍ ‍	(9a)

	‍		‍	− B¹ γ₂γ₃ + B² γ₁γ₃ − B³ γ₁γ₂	‍ ‍ ‍ ‍	(9b)

where only the y-component of B gets a + sign. It is sometimes convenient to swap the basis vectors on this one term, whereupon we can write:


F	‍	=	‍	− E¹ γ₀γ₁ − E² γ₀γ₂ − E³ γ₀γ₃	‍ ‍ ‍ ‍	(10a)

	‍		‍	− B¹ γ₂γ₃ − B² γ₃γ₁ − B³ γ₁γ₂	‍ ‍ ‍ ‍	(10b)

so that everybody gets a − sign. Mnemonic: This makes the superscripts and subscripts in each term of equation ‍10a become cyclic permutations of (123). The notion of cyclic order makes sense in some situations but not others; for starters, it does not apply to equation ‍10a.

The equations in this section have quite an interesting structure. They tell us we ought to view the electromagnetic field as a bivector. In any particular frame this bivector F has two contributions: one contribution is a bivector having one edge in the timelike direction, associated with E, while the other contribution is a bivector having both edges in spacelike directions, associated with cB.

We are making heavy use of the central feature of the Clifford Algebra, namely the ability to multiply vectors. This multiplication obeys the usual associative and distributive laws, but is not in general commutative.¹ In particular because our basis vectors γ_µ are orthogonal, each of them anticommutes with the others:

γ_µ γ_ν = − γ_ν γ_µ ‍ ‍ ‍ for all µ ≠ ν ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍(12)

and the normalization condition² in D=1+3 requires a minus sign in the timelike component:

γ₀ γ₀ = −1, ‍ ‍ ‍ γ₁ γ₁ = +1, ‍ ‍ ‍ γ₂ γ₂ = +1, ‍ ‍ ‍ γ₃ γ₃ = +1 ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍(13)

There will be 12 terms involving E, because E has three components E^k and the derivative operator has four components ∇^µ. Similarly there will be 12 terms involving B.

∇ F =	+∇⁰ E¹ γ₁	‍ ‍	+∇¹ E¹ γ₀	‍ ‍	−∇² E¹ γ₀ γ₁ γ₂	‍ ‍	+∇³ E¹ γ₀ γ₃ γ₁
‍	+∇⁰ E² γ₂	‍ ‍	+∇¹ E² γ₀ γ₁ γ₂	‍ ‍	+∇² E² γ₀	‍ ‍	−∇³ E² γ₀ γ₂ γ₃
‍	+∇⁰ E³ γ₃	‍ ‍	−∇¹ E³ γ₀ γ₃ γ₁	‍ ‍	+∇² E³ γ₀ γ₂ γ₃	‍ ‍	+∇³ E³ γ₀
‍ ‍ ‍
‍	−∇⁰ cB¹ γ₀ γ₂ γ₃	‍ ‍	−∇¹ cB¹ γ₁γ₂γ₃	‍ ‍	−∇² cB¹ γ₃	‍ ‍	+∇³ cB¹ γ₂
‍	−∇⁰ cB² γ₀ γ₃ γ₁	‍ ‍	+∇¹ cB² γ₃	‍ ‍	−∇² cB² γ₁ γ₂ γ₃	‍ ‍	−∇³ cB² γ₁
‍	−∇⁰ cB³ γ₀ γ₁ γ₂	‍ ‍	−∇¹ cB³ γ₂	‍ ‍	+∇² cB³ γ₁	‍ ‍	−∇³ cB³ γ₁ γ₂ γ₃

‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍(14)

Let’s discuss what this means. We start with the nine terms highlighted in blue. The six terms involving cB are the components of ∇ × cB. Similarly, the three terms involving E are the components of +∇⁰ E, which is the same as −(∂/c∂t) E. These terms each involve exactly one of the spacelike basis vectors (γ₁, γ₂, and γ₃), so we are dealing with a plain old vector in D=3 space. The RHS of the equation ‍1 has a vector that matches this, namely the D=3 current density. So the blue terms are telling us that ∇ × cB − (∂/c∂t) E = (1/cє₀) j, which agrees nicely with equation ‍3.

Next, we consider the nine terms highlighted in red. The six terms involving E are the components of ∇ × E. Similarly, the three terms involving cB are the components of −∇⁰ cB, which is the same as +(∂/c∂t) cB. These nine terms are all the trivectors with a projection in the timelike direction (γ₀). Since the RHS of equation ‍1 doesn’t have any trivector terms, we must conclude that these red terms add up to zero, that is, ∇ × E + (∂/c∂t) cB = 0, which also agrees with equation ‍3.

The three black terms involving E match up with the timelike piece of J and tell us that ∇ · E = (1/є₀) ρ. The three black terms involving cB tell us that ∇ · cB = 0.³

Let me say few words about how this was calculated. It really was quite mechanical, just following the formalism. Consider the term +∇² cB³ γ₁ in the last row. We started from the expression ∇ F which has two factors, so the term in question will have two factors, ∇²γ₂ and −cB³γ₃ γ₁γ₂γ₃, which combine to make −∇²γ₂cB³γ₃γ₁γ₂γ₃. All we have to do is permute the γ vectors to get this into standard form. Pull the scalars to the front and permute the first two vectors using equation ‍12 to get +∇²cB³γ₃γ₂γ₁γ₂γ₃. Permute again to get −∇²cB³γ₃γ₁γ₂γ₂γ₃ which reduces using equation ‍13 to −∇²cB³γ₃γ₁γ₃. Then one more permutation and one more reduction and the job is done.

The only part that required making a decision was writing γ₀γ₃γ₁ in places where I could have written −γ₀γ₁γ₃. This is just cosmetic; it makes the signs fall into a nice pattern so it is easier to see the correspondence with the old-fashioned cross product. We can make this seem more elegant and less arbitrary if we say the rule is to write all pseudovectors using the basis {i γ_µ for µ=0,1,2,3}, where i is the unit pseudoscalar (equation ‍48).

After the calculation was done, deciding how to color the terms took some judgment, but not much, because the terms naturally segregate as vectors and trivectors, spacelike and timelike.

6 Charge, Force, and Energy

6.1 Conservation of Charge

Preview: Our goal is to prove that charge is conserved, i.e. that ∇·J=0. We are not going to assume conservation; we are going to prove that conservation is already guaranteed as a consequence of equation ‍1, the Maxwell equation. We will do that by taking the divergence of both sides of the equation.

Background: We are going to need a mathematical lemma that says the divergence of the divergence of a bivector is always zero. To derive this, consider an arbitrary bivector W. We temporarily assume W is a simple blade, i.e. W = a γ₅ γ₆. Then the divergence is

∇ · W	‍	=	‍	∇ · a γ₅ γ₆
	‍	=	‍	⟨∇ a γ₅ γ₆⟩₁
	‍	=	‍	∇⁵ γ₅ a γ₅ γ₆ + ∇⁶ γ₆ a γ₅ γ₆
	‍	=	‍	∇⁵ a γ₆ − ∇⁶ a γ₅

‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍(15)

where on the second line we have used the general rule that the dot product is the low-grade piece of the full geometric product. On the last line we have temporarily assumed that γ₅ and γ₆ are spacelike, but we shall see that this assumption is unnecessary.

∇ · (∇ · W)	‍	=	‍	∇ · (∇⁵ a γ₆ − ∇⁶ a γ₅)
	‍	=	‍	⟨∇ (∇⁵ a γ₆ − ∇⁶ a γ₅)⟩₀
	‍	=	‍	∇⁶ γ₆ ∇⁵ a γ₆ − ∇⁵ γ₅ ∇⁶ a γ₅
	‍	=	‍	∇⁶ ∇⁵ a − ∇⁵ ∇⁶ a
	‍	=	‍	0

‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍(16)

On the last line we have used the fact that the various components of the gradient operator commute with each other.

We now lift the assumption that our basis vectors are timelike. You should verify that it doesn’t really matter whether γ₅ and γ₆ are spacelike or timelike. Hint: a fuller calculation would give us:

∇ · (∇ · W)	‍	=	‍	∇⁶ ∇⁵ a γ₅² γ₆² − ∇⁵ ∇⁶ a γ₅² γ₆²
	‍	=	‍	0

‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍(17)

We now lift the assumption that W is a blade. By the distributive law, if ∇·(∇·W) is zero for any grade=2 blade, it is zero for any sum of such blades, i.e. for any bivector whatsoever. We conclude in all generality:

∇ · (∇ · W)

‍

‍ ‍ ‍ ‍

(for any bivector W)

‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍(18)

∇W	‍	=	‍	⟨∇W⟩₁ + ⟨∇W⟩₃
	‍	=	‍	∇·W + ∇∧W

‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍(19)

∇F

‍

∇·F + ∇∧F

‍ ‍ ‍ ‍

(20a)

∇·F

‍

c є₀

‍ ‍ ‍ ‍

(20b)

∇∧F

‍

something

‍ ‍ ‍ ‍

(in all generality)

‍ ‍ ‍ ‍

(20c)

‍

‍ ‍ ‍ ‍

(assuming no monopoles)

‍ ‍ ‍ ‍

(20d)

For the purposes of this section, all we need is equation ‍20b. That is the grade=1 piece of the Maxwell equation. We do not need to assume the non-existence of monopoles. We do not need to know anything about the trivector piece of the Maxwell equation. We do not need equation ‍20d or even equation ‍20c.

∇·J	‍	=	‍	cє₀ ∇·(∇·F)
	‍	=	‍	0

‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍(21)

We are of course using the four-dimensional divergence. Zero divergence expresses the continuity of world-lines in spacetime. For an explanation of why this is the right way to express the idea of conservation in terms of continuity of flow, see reference ‍4.

6.2 Lorentz Force Law

As remarked above, our theory of electromagnetism would be incomplete without the Lorentz force law.

∂

∂t

 p = q(E +

 × cB) ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍(22)

where p is the momentum, q is the charge, and v is the ordinary 3-dimensional velocity.

As with practically any equation involving cross products, equation ‍22 can be improved by rewriting it using Geometric Algebra instead:

∂

∂τ

 p = q u · F ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍(23)

where τ is the proper time, u = dx/dτ is the 4-dimensional proper velocity,⁴ p = m u is the momentum, and m is the invariant mass. Here p and u are vectors in D=1+3 spacetime. This is the relativistically-correct generalization of equation ‍22.

Equation ‍23, unlike previous equations, involves a dot product. In particular, it involves the dot product of a vector with a bivector. Such things are not quite as easy to compute as the dot product between two vectors, but they are still reasonably easy to compute in terms of the geometric product. In general, the dot product is the lowest-grade part of the full geometric product, as discussed in reference ‍5. In the case of a vector dotted with a bivector, we have:

A·(B∧C)	‍	=	‍	⟨A(B∧C)⟩₁
	‍	=	‍	½⟨ABC−ACB⟩₁

‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍(24)

That means we just form the geometric product and throw away everything but the grade=1 part. Another way of dealing with “vector dot bivector” is:

A·(B∧C)

‍

(A·B)C − (A·C)B

‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍(25)

which can be considered a sort of “distributive law” for distributing the dot-operator over the wedge-operator. Equation ‍25 tells us that the product A·(B∧C) is a vector that lies in the plane spanned by B and C.

The following examples are useful for checking the validity of the foregoing equations:

γ₁·(γ₁∧γ₁)	‍	=	‍	0
γ₁·(γ₁∧γ₂)	‍	=	‍	γ₂
γ₂·(γ₁∧γ₂)	‍	=	‍	−γ₁
γ₃·(γ₁∧γ₂)	‍	=	‍	0

‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍(26)

To say the same thing in geometric (rather than algebraic) terms, you can visualize the product of a vector with a bivector as follows:

An example of the Lorentz law in action is shown in figure ‍1, for the case of an electromagnetic field bivector (F) that is uniform in space, oriented purely in the plane of the paper. The cyclotron orbit shown in the figure corresponds to the motion of a positive test charge with some initial velocity, free of forces other than the indicated electromagnetic field.

Figure ‍1: Lorentz Law

It is straightforward to understand this result. If the particle is moving in the direction of the red vector, it will experience a force in the blue direction. If the particle is moving in the blue direction, it will experience a force opposite to the red direction.

To summarize: The magnetic part of the Lorentz force law is super-easy to remember:

A field bivector in the plane of the paper
leads to a cyclotron orbit in the plane of the paper.

‍ ‍ ‍ ‍ ‍

The foregoing applies if the field F is already expressed in modern terms, as a bivector. Now, in the spirit of this document, we re-examine the situation to exhibit the correspondence between the bivector idea and old-fashioned ideas such as the electric field vector and the magnetic field pseudovector.

The bivector shown in figure ‍1 is purely spatial, so it must correspond to a magnetic field, with no electric field in our frame of reference. The magnetic field pseudovector is perpendicular to the paper, directed out of the paper. You can check using the right-hand force rule that the cyclotron orbit shown in figure ‍1 is correct for a positive test charge moving in such a magnetic field.

It is amusing to check the general case, for any F that is known in terms of the old-fashioned electric field vector and magnetic field pseudovector, as in equation ‍7 or equation ‍8. As suggested by equation ‍23, we should take the dot product of u with both sides of our expression for F. The correspondence principle suggests we should recover the old-fashioned 3-vector version of the force law, i.e. equation ‍22. To carry out the dot product, we could just turn the crank ... but in fact we hardly need to do any work at all. The dot product in u · F uses a subset of the full geometric product u F, namely the plain vector (grade=1) terms. See equation 18 in reference ‍6. We can avoid some work, because u F has the same structure as ∇ F – it’s just the geometric product of some vector with F – so we can just re-use equation ‍14, replacing ∇ by u everywhere. Then we throw away all the trivector terms, and what remains is the dot product.

In the nonrelativistic limit, the timelike component of the velocity equals unity, plus negligible higher-order terms. So the blue terms in equation ‍14 give us the usual Lorentz equation for the spacelike components of the momentum-change: 1 E + v × B.

The black terms involving E give us a bonus: They tell us the power (i.e. the rate of work, i.e. the time-derivative of the kinetic energy), namely v · E.

6.3 Lagrangian Density

Let us consider the gorm of the electromagnetic field, namely gorm(F) ≡ ⟨FF^{^∼}⟩₀. You can readily verify that:

⟨FF^{^∼}⟩₀ = (cB)² − E² ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍(27)

This is a scalar, a Lorentz-invariant scalar. It is useful in a number of ways, not least of which is the fact that −є₀((cB)² − E²) is the Lagrangian density for the electromagnetic field.

6.4 Poynting Vector

Let’s continue looking for energy-related expressions involving F. Section ‍6.3 gives us hint as to where to look; the Lagrangian density is not “the” energy density, but it at least has dimensions of energy density.

We know from old-fashioned electromagnetism that there should be an energy density that goes like the square of the field strength. This tells us the amount of energy per unit volume. In old-fashioned terms, the energy density is ½ є₀ (E² + c² B²).

There is also a Poynting vector, which tells us the amount of energy flowing per unit surface area, per unit time. In old-fashioned terms, it is c є₀ E×cB. .

So, without further motivation, we use 20/20 hindsight to assert that F γ₀ F will be interesting. Following the spirit of this document, let’s check that assertion by working out F γ₀ F in terms of the old-fashioned E and B fields, and seeing what we get. We substitute for F using equation ‍7 and turn the crank:

F γ₀ F	‍	=	‍	(E^k γ_k γ₀ − cB^j γ_j γ₁γ₂γ₃) γ₀ (E^k γ_k γ₀ − cB^j γ_j γ₁γ₂γ₃)
	‍	=	‍	E^k γ_k γ₀ γ₀ E^j γ_j γ₀
	‍		‍	− E^k γ_k γ₀ γ₀ cB^j γ_j γ₁γ₂γ₃ − cB^k γ_k γ₁γ₂γ₃ γ₀ E^j γ_j γ₀
	‍		‍	+ cB^k γ_k γ₁γ₂γ₃ γ₀ cB^j γ_j γ₁γ₂γ₃
	‍	=	‍	− E^k γ_k E^j γ_j γ₀
	‍		‍	+ E^k γ_k cB^j γ_j γ₁γ₂γ₃ − cB^k γ_k γ₁γ₂γ₃ E^j γ_j
	‍		‍	+ cB^k γ_k γ₁γ₂γ₃ cB^j γ_j γ₁γ₂γ₃ γ₀
	‍	=	‍	− E^k γ_k E^j γ_j γ₀
	‍		‍	+ E^k γ_k cB^j γ_j γ₁γ₂γ₃ − cB^j γ_j γ₁γ₂γ₃ E^k γ_k
	‍		‍	− cB^k γ_k cB^j γ_j γ₀
	‍	=	‍	− E·E γ₀
	‍		‍	+ E^k cB^j (γ_k γ_j − γ_j γ_k) γ₁γ₂γ₃
	‍		‍	− c² B·B γ₀
	‍	=	‍	− (E·E + c² B·B) γ₀
	‍		‍	− 2 (E×cB)^k γ_k

‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍(28)

In going from the second line to the third line, we used the fact that (γ₀)² = −1. We also used the fact that γ₀γ_k = − γ₀γ_k for all k ∈ {1,2,3}. On the other hand, (γ₁γ₂γ₃)γ_k = +γ_k(γ₁γ₂γ₃). That is, when we commute γ_k across the three factors in (γ₁γ₂γ₃), we pick up only two factors of −1, not three, since for one of the factors the subscript on that factor will match the subscript k, and γ_k obviously commutes with itself.

In the next step, we used the fact that (γ₁γ₂γ₃)² = −1. We also changed some dummy indices.

T(γ₀)

‍

−½ є₀ F γ₀ F

‍

є₀

⎡
⎢
⎢
⎢
⎣

(E² + c² B²)/2

(E×cB)₁

(E×cB)₂

(E×cB)₃

⎤
⎥
⎥
⎥
⎦

‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍(29)

The spacelike part of T(γ₀) is the old-fashioned three-dimensional Poynting vector (apart from a missing factor of c), while the timelike component represents the corresponding energy density.

Although this T(γ₀)-vector has four components, it is not a well-behaved Lorentz-covariant four-vector. It is actually just one column of a 4×4 object, namely the stress-energy tensor, T. Writing T(γ₀) in terms of E and B (as in the second line of equation ‍29) only makes sense in the particular frame where E and B are defined. Also, if you want to connect T(γ₀) to the Poynting vector in a given frame, γ₀ cannot be just any basis vector, but must be the 4-velocity of the frame itself, i.e. the unit vector in the time direction in the given frame.

T(a) := −½ є₀ F a F ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍(30)

represents the flow of [energy, momentum] across the hypersurface perpendicular to the vector a. A more general way of looking at this is presented in section ‍6.5.

6.5 Stress-Energy Tensor

The stress-energy tensor T for the electromagnetic field (in a vacuum) has the following matrix elements:

T_µν = F γ_µF γ_ν ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍(31)

7 Vector-ish Potential

7.1 The Basic Idea

In four dimensions, the electromagnetic field bivector F can always be written as the exterior derivative of a quasi-vector-ish potential A. Conversely, we can integrate the electromagnetic field to find the potential difference between point P and point Q.

F	‍	=	‍	dA
	‍	=	‍	∇∧A

‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍(32)

This implicitly defines what we mean by A. However, A is not uniquely defined, as discussed in section ‍7.3. Furthermore, even though A looks like it might be a four-vector, it’s not.

7.2 D=3 versus D=4

At this point you should be asking yourself, how can ∇∧(field) be nonzero in three dimensions but zero in four dimensions? How does that not violate the correspondence principle? How does that not contradict the claim made in reference ‍7 that Minkowski spacetime is very very similar to Euclidean space?

The answer is that when we switch from three dimensions to four, we redefine mean by “the” field, “the” potential, and “the” wedge product. In four dimensions, the exterior derivative of a vector has more terms. Invoking the correspondence principle, we can explain this in terms of the old-style E and B fields as follows: when we compute ∇∧F, the time derivative of the B-component cancels the spatial derivatives of the E-component.

This is a trap for the unwary. Don’t let your experience with D=3 poison your intuition about D=4. Consider the contrast:

For some problems, there is a natural reference frame that has immense practical significance.

For some problems, the frame-independent spacetime approach is simple, convenient, powerful, and elegant.

For example, if you are dealing with transformers or ground loops, you care a lot about the electric field in the frame of the device. The fact that this field cannot be written as the gradient of any potential is important. See reference ‍8 for suggestions on how to visualize what’s going on.

7.3 Gauge Invariance

The vector-ish potential is implicitly defined by equation ‍32. However, for any given field F, you don’t know whether the vector-ish potential is A or A + λ′, since we can write:

‍

∇∧(A + λ′)

‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍(37)

∇∧λ′

‍

‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍(38)

λ′

‍

∇λ

‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍(39)

which is guaranteed to work since ∇∧∇(anything) is automatically zero. Beware of inconsistent terminology: Sometimes λ is called «the» gauge field, and sometimes λ′ is called «the» gauge field.

7.4 The Maxwell Equation in terms of the Vector-ish Potential

The fact that we can write the electromagnetic field bivector as the derivative of a vector field is related to the fact that there are no trivector terms on the RHS of the Maxwell equation (equation ‍1). In particular, because ∇ is a vector, we can always write:

∇F	‍	=	‍		‍	∇·F	‍ ‍ ‍ ‍	(vector piece)
	‍		‍		+	∇∧F	‍ ‍ ‍ ‍	(trivector piece)

‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍(40)

Equation ‍40 is a mathematical identity, valid for any F you can think of. Applying it to the electromagnetic field in particular and plugging in equation ‍32 we obtain:

∇F	‍	=	‍			∇·∇∧A	‍ ‍ ‍ ‍	(vector piece)
	‍		‍		+	∇∧∇∧A	‍ ‍ ‍ ‍	(trivector piece)

‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍(41)

So we could not write F = ∇∧A unless we already knew that ∇∧F was zero, since ∇∧∇∧A is automatically zero. Indeed ∇∧∇∧(anything) is automatically zero; see equation ‍20.

Combining these ideas, we see that another way of writing the Maxwell equation is:

∇·∇∧ A =

c є₀

 J ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍(42)

∇² A =

c є₀

 J ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍(43)

were ∇² is called the d’Alembertian, or (equivalently) the four-dimensional Laplacian. It’s the dot product of the derivative operator with itself.

□² A =

c є₀

 J ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍(44)

Beware that yet other references use plain unsquared □ to represent the d’Alembertian. The idea is that they reserve ∇² to represent the three-dimensional Laplacian, and use □² to represent the four-dimensional generalization. However, in this document, we assume that all vectors are four-dimensional unless otherwise specified; for example, p is the four-momentum, A is the four-vector-ish potential, ∇ is the four-dimensional gradient, et cetera.

8 Geometric Algebra – General Remarks

8.1 Overview

Geometric Algebra has some tremendous advantages. It provides a unified view of inner products, outer products, D=2 flatland, D=3 space, D=1+3 spacetime, vectors, tensors, complex numbers, quaternions, spinors, rotations, reflections, boosts, and more. This may sound too good to be true, but it actually works.

If you need an introduction to Geometric Algebra, please see reference ‍9, reference ‍10, and other references in section ‍12. Just as I did not include an introductory discussion of the divergence and curl operators in equation ‍3, I will not include an introductory discussion of Geometric Algebra here. There’s no point in duplicating what’s in the references. In particular, reference ‍10 discusses electromagnetism using D=3 Clifford Algebra, which is easier to follow than the D=4 discussion here, but the results are not as simple and elegant as equation ‍1. The calculation here, while not particularly difficult, does not pretend to be entirely elementary.

8.2 No Decorations

In Geometric Algebra, it traditional to not distinguish vectors using boldface or other decorations. This is appropriate, since the Clifford Algebra operates on multivectors and treats all multivectors on pretty much the same footing. Multivectors can be scalars, vectors, bivectors, pseudovectors, pseudoscalars — or linear combinations of the above.

8.3 No Cross Product

Observe that there is no cross-product operator in equation ‍1 or equation ‍23. That is good. Cross products are trouble. They don’t exist in two dimensions, they are worse than useless in four dimensions, and aren’t even 100% trustworthy in three dimensions. For example, consider a rotating object and its angular-momentum vector r × p. If you look at the object in a mirror, the angular-momentum vector is reversed. You can’t draw a picture of the rotating object and its angular-momentum vector and expect the picture to be invariant under reflections.

As far as I can tell, every physics formula involving a cross product can be improved by rewriting it using a wedge product instead.

For a rotating object, the cross product r × p is a vector oriented according to the axis of rotation, while the wedge product r ∧ p is an area oriented according to the plane of rotation. The concept of “axis of rotation” is not portable to D=2 or D=4, but the concept of “plane of rotation” works fine in all dimensions.

If you think cross products are trouble, wait till you see Euler angles. They are only defined with respect to a particular basis. It’s pathetic to represent rotations in a way that is not rotationally invariant. Geometric Algebra fixes this.

8.4 No Chirality

Note that Clifford Algebra does not require any right-hand rule. In equation ‍13, the timelike vector is distinguished from the spacelike vector, but otherwise that equation and equation ‍12 treat all the basis vectors on an equal footing; renaming or re-ordering them doesn’t matter.

In D=3 or D=1+3 the unit pseudoscalar (equation ‍48) is chiral; that is, constructing it requires the right-hand rule. The axioms of Clifford Algebra sometimes permit but never require the construction of such a critter. The laws of electromagnetism are completely left/right symmetric. The magnetic term in equation ‍7 contains B, which is chiral because it was defined via the old-fashioned cross product ... but the same term contains a factor of i which makes the overall expression left/right symmetric. It would be better to write the magnetic field as a bivector to begin with (as in reference ‍3), so the equations would make manifest the intrinsic left/right symmetry of the physical laws.

8.5 Multiple Approaches

There are at three different approaches to defining an F-like quantity as part of a geometric-algebra formulation of electromagnetism.

Each approach is self-consistent, and most of the equations, such as equation ‍1, are the same across all systems.

The advantage of the bivector + bivector approach is that it is “at home in spacetime”, i.e. it treats x and t on the same footing, and treats B and E on the same footing (to the extent possible). It makes it easy and intuitive to draw bivector diagrams of the sort used in reference ‍3.

9 Pitfalls to Avoid

9.1 Definition of Dot Product

A·B ‍?=? ‍ A₁B₁ + A₂B₂ + A₃B₃ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍(45)

as if that were the definition of dot product ... but that is not the definition, and you’ll get the wrong answer if you try the corresponding thing in a non-Euclidean space, such as spacetime. So what you should do instead is to expand

A = A^µγ_µ = A⁰γ₀ + A¹γ₁ + A²γ₂ + A³γ₃ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍(46)

where the γ_µ are the basis vectors. Such an expansion is always legal. That is what defines the components A^µ. The superscripts on A label the components of A; they are not exponents. The subscripts on γ do not indicate components; they simply label which of the basis vectors we are talking about. It is possible but not particularly helpful to think of γ₀ as the zeroth component of some “vector of vectors”; in any case remember that γ₀ is a vector unto itself.

When you take the dot product A·B, the expansion equation ‍46 (and a similar expansion for B) gives you sixteen terms, since the dot product distributes over addition in the usual way. The twelve off-diagonal terms vanish, since they involve things like γ₁.γ₂ and the basis vectors are mutually orthogonal. So we are left with

A·B	‍	=	‍	A⁰B⁰γ₀.γ₀ + A¹B¹γ₁.γ₁ + A²B²γ₂.γ₂ + A³B³γ₃.γ₃
	‍	=	‍	− A⁰B⁰ + A¹B¹ + A²B² + A³B³

‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍(47)

9.2 Unit Pseudoscalar

Another thing to watch out for when reading the Geometric Algebra literature concerns the use of the symbol i for the unit pseudoscalar:

i := γ₀γ₁γ₂γ₃ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍(48)

It’s nice to have a symbol for the unit pseudoscalar, and choosing i has some intriguing properties stemming from the fact that i² = −1, but there’s a pitfall: you may be tempted to treat i as a scalar, but it’s not. Scalars commute with everything, whereas this i anticommutes with vectors (and all odd-grade multivectors). This is insidious because in D=3 the unit pseudoscalar commutes with everything. For these reasons we have mostly avoided using i in the main part of this note.

9.3 Exponents; Squared versus Norm Squared

Logical consistency requires that when using superscripts as exponents, they should denote simple powers:

_‍M²	‍	:=	‍	MM
_‍M³	‍	:=	‍	MMM
‍etc.	‍

‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍(49)

for any multivector M. However, there is an unfortunate tendency for some authors to write M² when they mean MM^{^∼} where M^{^∼} is the reverse of M, formed by writing in reverse order all the vectors that make up M; for example the reverse of equation ‍7 tells us that F^{^∼} = γ₀(E+cBi).

This is insidious because for scalars and vectors MM^{^∼} = MM; the distinction is only important for grade-2 objects and higher.

I recommend writing out MM^{^∼} whenever you mean MM^{^∼}. Many authors are tempted to come up with a shorthand for this – perhaps M², |M|², or ||M||² – but in my experience such things are much more trouble than they are worth. You need to be especially careful in the case where there are timelike vectors involved, since MM^{^∼} might well be negative. In such a case, any notation that suggests that MM^{^∼} is the square of anything is just asking for trouble.

A related and very important idea is the gorm of an object M, defined to be the scalar part of MM^{^∼}, i.e. ⟨MM^{^∼}⟩₀. (We saw a good physical example, namely the gorm of the electromagnetic field, in section ‍6.3.)

9.4 Dot Product Not Necessarily Commutative

The dot product of a vector with a bivector is anticommutative, so be careful how you write the Lorentz force law:

u · F = − F · u ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍(50)

This is insidious because the dot product is commutative when acting on two vectors, or on “almost” any combination of multivectors. It anticommutative only in cases where one of them has odd grade, and the other has a larger even grade. That is, in general,

A · B = (−1)^{min(r,s)|r−s|} B · A ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍(51)

where r is the grade of A and s is the grade of B. This result may seem somewhat counterintuitive, but it is easy to prove; compare equation 22 in reference ‍6.

10 Additional Remarks

10.1 More about the Notation

10.2 Factors of c

In the field of electromagnetism, when we move beyond the introductory level to the intermediate level or the professional level, it is traditional to measure time in units of length, so that the speed of light is c=1 in the chosen units.

This is a reasonable choice. However, it should remain a choice, not an obligation. We should be allowed to choose old-fashioned units of time if we wish. There are sometimes non-perverse reasons for choosing c≠1 – such as when checking the correspondence principle, as we do in this document.

This causes difficulties, because in the literature, some of the key formulas blithely assume c=1, and if you want to go back and generalize the formulas so that they work even when c≠1, it is not always obvious how to do it. It’s “usually” obvious, but not always.

In particular, consider the gorm of a vector (i.e. 4-vector) R that specifies position in spacetime. For any grade=1 vector R, the gorm is equal to the dot product, R·R. For a position vector, we can write the gorm in terms of components, namely −c² t² + x² + y² + z². Leaving out the factor of c² would make this expression incorrect, indeed dimensionally unsound ... unless c=1. Working backwards from the usual definition of dot product, that tells us that the position vector is R = [c t, x, y, z] not simply [t, x, y, z].

A similar argument tells us that the [energy, momentum] 4-vector is [E, c p_x, c p_y, c p_z] not simply [E, p_x, p_y, p_z].

The terminology in this area is trap for the unwary. You need to be careful to distinguish between “the time” (namely t) and “the timelike component of the position vector” (namely ct).

10.3 Hodge Dual

Multiplying by γ₁γ₂γ₃ is nothing more or less than taking the 3-dimensional Hodge dual. That’s the standard way of converting a pseudovector to the corresponding bivector (and vice versa), as explained in reference ‍5. So we can rewrite equation ‍8 as:

‍

E γ₀ − cB §_E

‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍(57)

where the subscript E on the §_E operator stands for Euclidean, to remind us that it applies to the spacelike dimensions only. This is distinct from the §_M operator that applies in the larger Minkowski space.

11 An Application: Plane Waves

11.1 Running Waves

As a modest application of equation ‍1, let’s try to find some solutions for it. In keeping with the spirit of this document, we will emphasize simplicity rather than elegance. We will formulate the problem in modern 4-dimensional terms, but in a way that maintains contact with old-style 3-dimensional frame-dependent concepts such as E and B. Also we will restrict attention to plane waves in free space.

∇ F = 0 ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍(58)

F	‍	=	‍	E(ky−vt)γ₁γ₀ + D(ky−vt)γ₂γ₀ − cB(ky−vt)γ₁γ₂
	‍	=	‍	E(Φ)γ₁γ₀ + D(Φ)γ₂γ₀ − cB(Φ)γ₁γ₂

‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍(59)

where F is the electromagnetic field bivector, E, D, and B are simple scalar functions of one scalar argument with as-yet undetermined physical significance, and Φ is the scalar phase:

Φ	‍	:=	‍	ky − vt
k	‍	=	‍	+1	‍ ‍ ‍	for propagation in the +y direction
k	‍	=	‍	−1	‍ ‍ ‍	for propagation in the −y direction

‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍(60)

If we take a snapshot at any given time, we find that every plane parallel to the xz plane is a wavefront. That is to say, every such plane is a contour of constant phase. That’s because it is, by construction, a contour of constant t and constant y. The phase depends on t and y, but not on x or z. This is what we would expect for a plane wave traveling in the y direction.

∂E

∂ct

‍

dΦ

‍

∂Φ

∂t

‍

(−v/c) ‍E′

∂E

∂y

‍

dΦ

‍

∂Φ

∂y

‍

kE′

‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍(61)

Corresponding statements can be made about B and D ... just apply the chain rule in the corresponding way. Here E′ is pronounced “E prime” and denotes the total derivative of E with respect to the scalar phase Φ.

Since there are three terms in equation ‍59, taking the derivative gives us six terms; three for the timelike part of the gradient and three for the spacelike part. Plugging in and simplifying a bit gives us:

∇F	‍	=	‍	(−∂/∂ct)Eγ₀γ₁γ₀ + (−∂/∂ct)Dγ₀γ₂γ₀ − (−∂/∂ct)cBγ₀γ₁γ₂
	‍		‍	+ (∂/∂y)Eγ₂γ₁γ₀ + (∂/∂y)Dγ₂γ₂γ₀ − (∂/∂y)cBγ₂γ₁γ₂
	‍	=	‍	(v/c)E′γ₀γ₁γ₀ + (v/c)D′γ₀γ₂γ₀ − (v/c)cB′γ₀γ₁γ₂
	‍		‍	+ kE′γ₂γ₁γ₀ + kD′γ₂γ_Dγ₀ − kcB′γ₂γ₁γ₂
	‍	=	‍	(v/c)E′γ₁ + (v/c)D′γ₂ − (v/c)cB′γ₀γ₁γ₂
	‍		‍	− kE′γ₀γ₁γ₂ − kD′γ₀ + kcB′γ₁

‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍(62)

By equation ‍58 we know this must equal zero. Each vector component must separately equal zero. Therefore:

E′

‍

−v

cB′

‍ ‍

from the trivector part

‍

D′

‍

‍ ‍

from the γ₀ (timelike) part

‍

cB′

‍

−v

E′

‍ ‍

from the γ₁ (spacelike) part

‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍(63)

For additional follow-up on these results, see section ‍11.2. For now, let’s combine these results so as to obtain a consistency requirement for E′:

E′

‍

(v²/c²) E′

‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍(64)

The first thing that we learn from the equation ‍64 is that the electromagnetic plane wave in free space must propagate at speed |v|=c. This is an unavoidable consequence of the Maxwell equation in free space, equation ‍58.

The second thing that we learn is that for any wave propagating at the required speed, the wavefunction can have any shape whatsoever, so long as it is differentiable function of its argument, i.e. a differentiable function of the phase Φ. It must be emphasized that we have not assumed that E is sinusoidal or even periodic. Any function E(Φ) you can think of, so long as it is differentiable, is an acceptable wavefunction for a plane wave in free space. Even an isolated blip, such as shown in figure ‍2, can be a solution to equation ‍58. The blip is moving left-to-right at the speed of light; the figure shows only a snapshot taken at time t=0.

Figure ‍2: Snapshot of an Isolated Blip

The third thing we learn from equation ‍64 in conjunction with equation ‍63 is that once we have chosen E, then cB is constrained by equation ‍63. That is, at every point in spacetime, E = −kcB + g, where g is some constant of integration. This g is not very interesting. It is constant across all of space and time, and represents some uniform, non-propagating background field. It has no effect on the propagating wave; the wave just propagates past it.

First of all, we know the Maxwell equations are invariant under spacelike rotations, so we know there must exist plane waves propagating in any direction, not just the y direction. Any rotated version of our solution is another solution.

Secondly, you can easily verify that the factor of γ₁ in equation ‍59 did not play any important role in the calculation; mostly it just went along for the ride. We could easily replace it with γ₃ and thereby obtain another solution, propagating in the same direction as the previous solution, but linearly independent of it. This phenomenon is called polarization. The Ansatz in equation ‍59 is polarized in the γ₁ direction. You can verify that the polarization vector must be transverse to the direction of propagation; otherwise equation ‍59 does not work as a solution to equation ‍58.

We won’t prove it, but we assert that we now have all the ingredients needed to construct the most general solution for plane waves in free space: first, pick a direction of propagation. Then choose a basis for the polarization vector, i.e. two unit vectors in the plane perpendicular to the direction of propagation. Then think of two arbitrary differentiable functions of phase, one for each component of the polarization vector. Finally, take arbitrary superpositions of all the above.

Tangential remark: Even though the Ansatz in equation ‍59 contains three terms, the fact that E=kcB and D=0 means it can be written as a single blade, i.e. a bivector that is the simply the product of two vectors. Specifically:

‍

E(Φ) γ₁ (γ₀ − kγ₂)

‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍(65)

The structure here, and for any running plane wave, is simple. There are three factors: a scalar function E(Φ) that specifies the shape of the wave, times a spacelike vector that represents the polarization, times a null vector that represents the direction of propagation.

The general electromagnetic plane wave is not a single blade, but it can be written as a sum of blades of this form. Even more generally, there are lots of waves that are not plane waves.

11.2 Running Wave Phase Relationships

As noted in section ‍11.1, there is a strict correspondence between the electric part and the magnetic part in an electromagnetic running plane wave. For a blip (or anything else) running left to right

E′

‍

−cB′

‍ ‍ ‍

pointwise everywhere in space and time

‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍(66)

This is sometimes expressed by saying the E field and the cB field are “in phase”. (Such an expression makes more sense for sinusoidal waves than for blips.)

E′

‍

+cB′

‍ ‍ ‍

pointwise everywhere in space and time

‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍(67)

That is, once again there is a strict relationship between E and cB ... but the relationship in equation ‍67 is diametrically opposite to the relationship in equation ‍66. One of them is 180 degrees out of phase with the other.

If you consider the superposition of a left-running blip and a right-running blip, the whole notion of “phase relationship” goes out the window. You can have places where E is zero but cB is not, or vice versa, or anything you like, and the local relationship between E and cB will be wildly changing as a function of space and time. A particular type of superposition is considered in section ‍11.3.

11.3 Standing Wave Phase Relationships

A standing wave can be viewed as the superposition of equal-and-opposite running waves. In particular, let’s start with the sinusoidal waves

E₁	‍	=	‍	cos(ct − y)
E₂	‍	=	‍	cos(ct + y)
E	‍	=	‍	E₁ + E₂
cB₁	‍	=	‍	−E₁
cB₂	‍	=	‍	+E₂
cB	‍	=	‍	cB₁ + cB₂
	‍	=	‍	−E₁ + E₂

‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍(68)

At any particular location y, the wave is a sinusoidal function of time. Choosing a different location just changes the phase. Let’s apply the trigonmetric sum-of-angles identity:

E₁	‍	=	‍	cos(ct) cos(y) + sin(ct) sin(y)
E₂	‍	=	‍	cos(ct) cos(y) − sin(ct) sin(y)
E	‍	=	‍	E₁ + E₂
E	‍	=	‍	2 cos(ct) cos(y)
cB	‍	=	‍	−E₁ + E₂
	‍	=	‍	−2 sin(ct) sin(y)

‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍ ‍(69)

So, as advertised above, we see that at most locations – i.e. any location where cos(y) and sin(y) are both nonzero – the E-field and the B-field are 90 degrees out of phase for a standing wave. (They are in phase for a running wave, as discussed in section ‍11.2.)

11.4 Spacetime Picture

This section is restricted to the case where k=+1; that is, the wave is propagating in the +y direction. Also we assume the constant of integration g is zero. Therefore E = cB everywhere.

The blip we saw in figure ‍2 is portrayed again in figure ‍3. The former portrayed two variables, namely E versus y (at constant t). The latter portrays three variables, namely t, y, and E. The value of E is represented by the closeness of the flux lines. You can see that in the front half of the blip (larger y values) the E field is twice as large as in the back half of the blip.

Figure ‍3: Radiation : Flux Lines in Spacetime

The fact that E = cB corresponds to the fact that, at each and every point in spacetime, the number of flux lines per unit distance in the timelike direction is equal to the number of flux lines per unit distance in the spacelike direction. An example of this is portrayed by the two small blue arrows in the figure. Not only does each arrow cross the same number of flux lines, it crosses the same flux lines.

You can see that this is a direct consequence of the geometry of spacetime, and the fact that the wave is propagating with velocity v=c.

As shown by the purple lines, contours of constant phase run from southwest to northeast. Phase increases toward the south and east. Phase increasing to the south corresponds to temporal period, and phase increasing to the east corresponds to spatial period i.e. wavelength. Note that any attempt to measure period or wavelength is utterly frame-dependent. Some properties of the wave (such as the total number of cycles) are frame-independent, but other properties (such as period, frequency, wavelength, and wavenumber) are necessarily frame-dependent.

In figure ‍3, the x and z directions are not visible. If we made a more complicated diagram, from a different perspective, the electromagnetic field bivector F would be represented by tubes. The magnitude of F corresponds to the number of tubes per unit area.

You may be familiar with matrix multiplication, which has many of the same axioms as Geometric Algebra, including the associative law, the distributive law, and non-commutative multiplication. But the analogy is not perfect: the product of two matrices is another matrix, whereas the geometric product of two vectors isn’t another vector: it could be a scalar (force times distance = work) or a bivector (force times distance = torque) or perhaps a combination of the two – but it won’t be a proper vector.

Some other authors use the opposite convention, in which γ₀ γ₀ = +1 and all the others are −1. It doesn’t make much difference; the physics works out the same using either convention. But the convention used here makes it slightly easier to see the correspondence with plain old D=3 vectors.

Magnetic monopoles would be described by a trivector (i.e. pseudovector) term on the RHS of equation ‍1.

The proper velocity u = dx/dτ is not to be confused with the coordinate velocity v = dx/dt. They’re the same when they’re small, but in general they differ by a factor of gamma. You could rewrite equation ‍23 in terms of t-derivatives rather than τ-derivatives, but it would be less elegant and less useful.

Electromagnetism using Geometric Algebra versus Components

1 Introduction

* Contents

2 Preview

3 Vectors

4 Components

5 Electromagnetism using Geometric Algebra

6 Charge, Force, and Energy

6.1 Conservation of Charge

6.2 Lorentz Force Law

6.3 Lagrangian Density

6.4 Poynting Vector

6.5 Stress-Energy Tensor

7 Vector-ish Potential

7.1 The Basic Idea

7.2 D=3 versus D=4

7.3 Gauge Invariance

7.4 The Maxwell Equation in terms of the Vector-ish Potential

8 Geometric Algebra – General Remarks

8.1 Overview

8.2 No Decorations

8.3 No Cross Product

8.4 No Chirality

8.5 Multiple Approaches

9 Pitfalls to Avoid

9.1 Definition of Dot Product

9.2 Unit Pseudoscalar

9.3 Exponents; Squared versus Norm Squared

9.4 Dot Product Not Necessarily Commutative

10 Additional Remarks

10.1 More about the Notation

10.2 Factors of c

10.3 Hodge Dual

11 An Application: Plane Waves

11.1 Running Waves

11.2 Running Wave Phase Relationships

11.3 Standing Wave Phase Relationships

11.4 Spacetime Picture

12 References