ASF

Maxwell's Equations and Gauge Theory: Electromagnetism as a Principal Bundle

March 22, 2026|
A
A

A compass needle aligns with a magnetic field threading through empty air. A lightning bolt drives a current through the atmosphere via the same electric force that pulls charged particles in a wire. Radio waves cross a continent at the speed of light, which turns out to be the same constant that emerges from the equations governing both electricity and magnetism. All of this follows from four equations that James Clerk Maxwell assembled in 1865.

The classical formulation in vector calculus is powerful and concrete, but it hides something. The equations have a symmetry—gauge invariance—that suggests the fields E and B are not the most fundamental objects. Behind them sits the vector potential A, and behind A sits the geometry of a principal fibre bundle over spacetime. The same geometric structure that appears in electromagnetism reappears, with a non-abelian gauge group, in the quantum chromodynamics of quarks and the electroweak theory of the Standard Model. In 2000, the Clay Mathematics Institute designated the question of whether the non-abelian (Yang–Mills) version of this structure always has a positive mass gap as one of the seven Millennium Prize Problems, with a million-dollar prize still unclaimed.

This post develops four parallel languages for Maxwell's equations: the classical vector calculus form, the coordinate-free language of differential forms on Minkowski spacetime, the spacetime algebra (geometric algebra) encoding all four equations in a single line, and the principal fibre bundle construction that reveals gauge theory as differential geometry. At each level the physical content is identical; what changes is how much of the underlying structure becomes visible.

1. The Classical Laws

The four Maxwell equations rest on four independent experimental observations, each discovered in a 46-year window between 1785 and 1831. Coulomb measured the force between static charges. Biot and Savart quantified the force between current-carrying wires. Faraday discovered that a moving magnet induces a voltage in a nearby coil. Ampere related closed line integrals of B\mathbf{B} to enclosed currents. In 1865, Maxwell added one term to Ampere's law to repair an inconsistency with charge conservation. As a consequence, he found that light is an electromagnetic wave.

Before stating the laws, two pieces of notation need grounding.

Charge is an intrinsic scalar property of matter, analogous to mass but with one fundamental difference: it can be positive, negative, or zero. Two particles with the same sign of charge repel each other; two with opposite signs attract. Coulomb quantified this force in 1785, and that measurement is the foundational observation that justifies introducing charge as a concept at all. Classical theory offers no deeper explanation for why matter carries charge; it is a label that organizes the forces we observe. Field theory adds structural content to that label: charge is the source term that couples matter to the electromagnetic field. Without charge, Gauss's law reduces to the homogeneous equation E=0\nabla \cdot \mathbf{E} = 0, and the field has no reason to be nonzero anywhere. Charge is what forces the field to respond to the presence of matter.

Charge density ρ(x,t)\rho(\mathbf{x}, t) extends this to continuous distributions. When charge is spread through a volume (as in a plasma or the conduction band of a metal), we cannot assign a charge to each geometric point; instead we ask how much charge occupies an infinitesimally small region near x\mathbf{x}. Let Bε(x)B_\varepsilon(\mathbf{x}) be a ball of radius ε\varepsilon centered at x\mathbf{x}. Then

ρ(x,t)  =  limε0Q ⁣(Bε(x),t)Vol(Bε)\rho(\mathbf{x},t) \;=\; \lim_{\varepsilon \to 0} \frac{Q\!\left(B_\varepsilon(\mathbf{x}),\,t\right)}{\mathrm{Vol}(B_\varepsilon)}

where Q(Bε,t)Q(B_\varepsilon, t) is the total charge inside that ball at time tt and Vol(Bε)=43πε3\mathrm{Vol}(B_\varepsilon) = \tfrac{4}{3}\pi\varepsilon^3. This is the same construction as mass density in continuum mechanics: charge per unit volume, measured in coulombs per cubic meter. The total charge in any region VV is recovered by integration:

QV(t)=Vρ(x,t)d3xQ_V(t) = \iiint_V \rho(\mathbf{x},t)\,\mathrm{d}^3x

Point charges and the Dirac measure. An electron carries all of its charge at a single spatial point, to every precision ever measured. A density function for this situation must be zero everywhere except at the electron's location, yet integrate over all of space to the total charge ee. No function in the ordinary (Lebesgue) sense can achieve this: a function that is zero almost everywhere has integral zero, regardless of the value at any isolated point.

The kk-dimensional Dirac delta δ(k)(x)\delta^{(k)}(\mathbf{x}) resolves this by working in the broader class of distributions: continuous linear functionals on smooth test functions, rather than pointwise-valued maps. The three-dimensional Dirac delta centered at the origin is defined by its action on any smooth compactly supported test function φ\varphi:

R3δ(3)(x)φ(x)d3x  =  φ(0)\int_{\mathbb{R}^3} \delta^{(3)}(\mathbf{x})\,\varphi(\mathbf{x})\,\mathrm{d}^3x \;=\; \varphi(\mathbf{0})

It is the evaluation functional at the origin. A concrete way to approach it: δ(3)(x)\delta^{(3)}(\mathbf{x}) is the distributional limit of the family of normalized Gaussians

gσ(x)=1(2πσ2)3/2exp ⁣(x22σ2)g_\sigma(\mathbf{x}) = \frac{1}{(2\pi\sigma^2)^{3/2}}\exp\!\left(-\frac{|\mathbf{x}|^2}{2\sigma^2}\right)

Drag the slider right to push σ0\sigma \to 0. The Gaussian concentrates to an infinitely tall spike while its area stays fixed at 1. The Dirac delta δ(x)\delta(x) is what the family converges to in the distributional sense.

as σ0\sigma \to 0. For any fixed smooth φ\varphi, one has R3gσφφ(0)\int_{\mathbb{R}^3} g_\sigma\,\varphi \to \varphi(\mathbf{0}) as σ0\sigma \to 0. The Dirac delta is what that Gaussian family converges to in the distributional sense: a spike of unit mass that has collapsed to a single point. The kk-dimensional version δ(k)(x)\delta^{(k)}(\mathbf{x}) is the same construction in Rk\mathbb{R}^k, satisfying Rkδ(k)(x)φ(x)dkx=φ(0)\int_{\mathbb{R}^k} \delta^{(k)}(\mathbf{x})\,\varphi(\mathbf{x})\,\mathrm{d}^k x = \varphi(\mathbf{0}).

The charge density of a point charge QQ at the origin is then ρ=Qδ(3)(x)\rho = Q\,\delta^{(3)}(\mathbf{x}), a distributional statement with full physical weight: substituting into Gauss's law and integrating over any sphere enclosing the origin recovers Coulomb's inverse-square field exactly. A point particle moving along a worldline γ(t)\boldsymbol{\gamma}(t) generalizes this to ρ=Qδ(3)(xγ(t))\rho = Q\,\delta^{(3)}(\mathbf{x} - \boldsymbol{\gamma}(t)), with current density J=Qγ˙(t)δ(3)(xγ(t))\mathbf{J} = Q\,\dot{\boldsymbol{\gamma}}(t)\,\delta^{(3)}(\mathbf{x} - \boldsymbol{\gamma}(t)). In field theory, the interaction between a charged particle and the electromagnetic potential is the spacetime integral JμAμd4x\int J^\mu A_\mu\,\mathrm{d}^4x. For a point particle, the delta function collapses this volume integral to a line integral γA\oint_{\boldsymbol{\gamma}} A along the worldline. That line integral is precisely the geometric object whose ambiguity encodes gauge freedom and whose measurability produces the Aharonov–Bohm phase in Section 8.

In a conductor or plasma, ρ\rho is instead a smooth function spread continuously through space. Both cases, the point-supported delta and the smooth continuous distribution, are valid instances of the same distributional framework; only the degree of spatial concentration differs.

On quantum mechanics. The treatment above assigns charge density via Dirac deltas for point particles or smooth functions for continuous distributions, and in both cases the electron is treated as if it occupies a definite location or a definite extended region. Quantum mechanics alters this. An electron is not at a definite point; it occupies a quantum state described by a wavefunction ψ(x,t)\psi(\mathbf{x},t), and ψ(x,t)2|\psi(\mathbf{x},t)|^2 is the probability density for finding the electron near x\mathbf{x}. The charge density it contributes is accordingly ρ=eψ2\rho = -e|\psi|^2 (negative because the electron carries charge e-e). This is not charge spread spatially in the classical sense; it is the expectation value of the charge density operator in the state ψ\psi. In an atom, ψ\psi is an orbital, and eψ2-e|\psi|^2 is a smooth function spread over the orbital volume that sources the electric field through Gauss's law exactly as a classical continuous distribution does. Section 10 develops this connection: how quantum charge densities enter Maxwell's equations, how the resulting electrostatic potential feeds back into the Schrödinger equation, and how the coupled system is solved computationally.

The closed surface integral SFdS\oiint_S \mathbf{F} \cdot \mathrm{d}\mathbf{S} adds up the outward flux of a vector field F\mathbf{F} through every infinitesimal patch dS=n^dA\mathrm{d}\mathbf{S} = \hat{\mathbf{n}}\,\mathrm{d}A of a closed surface SS (one with no boundary, like a sphere or a box). It measures how much of F\mathbf{F} is leaving versus entering the enclosed region. The divergence theorem relates this to a volume integral: SFdS=VFdV\oiint_S \mathbf{F}\cdot\mathrm{d}\mathbf{S} = \iiint_V \nabla\cdot\mathbf{F}\,\mathrm{d}V, which is what converts the integral form of each law into the differential (pointwise) form below.

Each law is stated as a proposition and derived from the corresponding experiment below.

Proposition (Gauss's law for electricity).

Let ρ(x,t)\rho(\mathbf{x},t) be the charge density. The electric field satisfies

E=ρε0\nabla \cdot \mathbf{E} = \frac{\rho}{\varepsilon_0}

Derivation. Coulomb's law gives the field of a point charge QQ at the origin as E=Q4πε0r^r2\mathbf{E} = \frac{Q}{4\pi\varepsilon_0}\frac{\hat{\mathbf{r}}}{r^2}. The flux through a sphere SrS_r of radius rr is

SrEdS=Q4πε04πr21r2=Qε0\oiint_{S_r} \mathbf{E} \cdot \mathrm{d}\mathbf{S} = \frac{Q}{4\pi\varepsilon_0} \cdot 4\pi r^2 \cdot \frac{1}{r^2} = \frac{Q}{\varepsilon_0}

By the divergence theorem, SrEdS=VEdV\oiint_{S_r} \mathbf{E}\cdot\mathrm{d}\mathbf{S} = \iiint_V \nabla\cdot\mathbf{E}\,\mathrm{d}V. Replacing QQ by a continuous distribution and equating the integrands gives E=ρ/ε0\nabla\cdot\mathbf{E} = \rho/\varepsilon_0 pointwise. \square

Proposition (Gauss's law for magnetism).

For any magnetic field B\mathbf{B} produced by currents,

B=0\nabla \cdot \mathbf{B} = 0

Derivation. The Biot–Savart law gives B(x)=μ04πJ(x)×x ⁣(1xx)d3x\mathbf{B}(\mathbf{x}) = \frac{\mu_0}{4\pi}\int \mathbf{J}(\mathbf{x}')\times\nabla_x\!\left(\frac{-1}{|\mathbf{x}-\mathbf{x}'|}\right)\mathrm{d}^3x', which can be written as [eq:b-potential] with A=μ04πJ(x)xxd3x\mathbf{A} = \frac{\mu_0}{4\pi}\int\frac{\mathbf{J}(\mathbf{x}')}{|\mathbf{x}-\mathbf{x}'|}\,\mathrm{d}^3x'. Then B=(×A)=0\nabla\cdot\mathbf{B} = \nabla\cdot(\nabla\times\mathbf{A}) = 0 by the identity (×A)=0\nabla \cdot (\nabla \times \mathbf{A}) = 0, which holds identically for any smooth vector field. \square

Proposition (Faraday's law of induction).

For any stationary closed curve CC bounding surface Σ\Sigma,

×E=Bt\nabla \times \mathbf{E} = -\frac{\partial \mathbf{B}}{\partial t}

Derivation. Faraday observed in 1831 that a changing magnetic flux ΦB=ΣBdS\Phi_B = \iint_\Sigma \mathbf{B}\cdot\mathrm{d}\mathbf{S} through a loop drives an EMF:

E=CEd=dΦBdt\mathcal{E} = \oint_C \mathbf{E}\cdot\mathrm{d}\boldsymbol{\ell} = -\frac{\mathrm{d}\Phi_B}{\mathrm{d}t}

For a stationary loop, dΦBdt=ΣBtdS-\frac{\mathrm{d}\Phi_B}{\mathrm{d}t} = -\iint_\Sigma \frac{\partial\mathbf{B}}{\partial t}\cdot\mathrm{d}\mathbf{S}. By Stokes' theorem, CEd=Σ(×E)dS\oint_C \mathbf{E}\cdot\mathrm{d}\boldsymbol{\ell} = \iint_\Sigma(\nabla\times\mathbf{E})\cdot\mathrm{d}\mathbf{S}. Since this holds for every Σ\Sigma, the integrands must match: ×E=tB\nabla\times\mathbf{E} = -\partial_t\mathbf{B}. \square

Proposition (Ampere–Maxwell law).

For current density J\mathbf{J} and electric field E\mathbf{E},

×B=μ0J+μ0ε0Et\nabla \times \mathbf{B} = \mu_0\mathbf{J} + \mu_0\varepsilon_0\frac{\partial\mathbf{E}}{\partial t}

Derivation. Ampere's part. The Biot–Savart law for a long straight wire gives B=μ0I2πsϕ^\mathbf{B} = \frac{\mu_0 I}{2\pi s}\hat{\boldsymbol{\phi}} at distance ss. Integrating around a circular Amperian loop: Bd=μ0Ienc\oint \mathbf{B}\cdot\mathrm{d}\boldsymbol{\ell} = \mu_0 I_{\mathrm{enc}}. By Stokes, this becomes ×B=μ0J\nabla\times\mathbf{B} = \mu_0\mathbf{J} in the static case.

Maxwell's correction. Taking the divergence: 0=(×B)=μ0J0 = \nabla\cdot(\nabla\times\mathbf{B}) = \mu_0\nabla\cdot\mathbf{J}, but charge conservation requires J=tρ\nabla\cdot\mathbf{J} = -\partial_t\rho, which is nonzero during transients. By Gauss's law, tρ=ε0t(E)=(ε0tE)\partial_t\rho = \varepsilon_0\partial_t(\nabla\cdot\mathbf{E}) = \nabla\cdot(\varepsilon_0\partial_t\mathbf{E}). Adding the displacement current ε0tE\varepsilon_0\partial_t\mathbf{E} to the right side restores consistency: (×B)=μ0(J+tρ)=0\nabla\cdot(\nabla\times\mathbf{B}) = \mu_0(\nabla\cdot\mathbf{J} + \partial_t\rho) = 0 by continuity. \square

Corollary (Electromagnetic waves propagate at speed c).

In vacuum (ρ=0\rho = 0, J=0\mathbf{J} = 0), the fields E\mathbf{E} and B\mathbf{B} satisfy the wave equation with propagation speed

c=1μ0ε03.0×108  m/sc = \frac{1}{\sqrt{\mu_0\varepsilon_0}} \approx 3.0\times 10^8\;\text{m/s}

equal to the measured speed of light.

Proof. Apply ×\nabla\times to [eq:faraday-eq] (Faraday's law): ×(×E)=t(×B)=μ0ε0t2E\nabla\times(\nabla\times\mathbf{E}) = -\partial_t(\nabla\times\mathbf{B}) = -\mu_0\varepsilon_0\partial_t^2\mathbf{E} using [eq:ampere-eq] (vacuum Ampere–Maxwell law). The identity ×(×E)=(E)2E=2E\nabla\times(\nabla\times\mathbf{E}) = \nabla(\nabla\cdot\mathbf{E}) - \nabla^2\mathbf{E} = -\nabla^2\mathbf{E} (with E=0\nabla\cdot\mathbf{E} = 0 in vacuum) gives 2E=μ0ε0t2E\nabla^2\mathbf{E} = \mu_0\varepsilon_0\partial_t^2\mathbf{E}. Maxwell identified c=1/μ0ε03×108c = 1/\sqrt{\mu_0\varepsilon_0} \approx 3\times10^8 m/s with the known speed of light and concluded that light is an electromagnetic wave. The same derivation for B\mathbf{B} gives an identical equation. \square

Remark (Natural units convention).

For the rest of this post, when it reduces clutter without ambiguity, we set c=1c = 1 and ε0=1\varepsilon_0 = 1 (natural units). The speed-of-light factors can be reinstated by dimensional analysis: every time t\partial_t appears where a spatial derivative would sit, supply a factor of 1/c1/c.

2. The Vector Potential and Gauge Freedom

The equation B=0\nabla \cdot \mathbf{B} = 0 says that the magnetic field has no divergence. A standard result from vector calculus (the Poincaré lemma in disguise) says that any divergence-free vector field on a simply connected domain can be written as the curl of another vector field. This defines the magnetic vector potential.

Definition (Electromagnetic potentials).

Because B=0\nabla \cdot \mathbf{B} = 0, we write

B=×A\mathbf{B} = \nabla \times \mathbf{A}

for a vector field A(x,t)\mathbf{A}(\mathbf{x},t) called the magnetic vector potential. Substituting into Faraday's law gives ×(E+tA)=0\nabla \times (\mathbf{E} + \partial_t \mathbf{A}) = 0, so the combination E+tA\mathbf{E} + \partial_t \mathbf{A} is curl-free and can be written as the gradient of a scalar:

E=φAt\mathbf{E} = -\nabla \varphi - \frac{\partial \mathbf{A}}{\partial t}

The scalar φ(x,t)\varphi(\mathbf{x},t) is the electric scalar potential.

The fields E and B are determined by the pair (φ,A)(\varphi, \mathbf{A}), but not uniquely: different potentials can produce the same fields.

Theorem (Gauge invariance).

For any smooth scalar function χ(x,t)\chi(\mathbf{x},t), the transformation

A    A=A+χφ    φ=φχt\mathbf{A} \;\longrightarrow\; \mathbf{A}' = \mathbf{A} + \nabla\chi \qquad \varphi \;\longrightarrow\; \varphi' = \varphi - \frac{\partial \chi}{\partial t}

leaves the physical fields E and B unchanged: E=E\mathbf{E}' = \mathbf{E} and B=B\mathbf{B}' = \mathbf{B}.

Proof. Direct substitution: B=×(A+χ)=×A+×χ=B+0=B\mathbf{B}' = \nabla \times (\mathbf{A} + \nabla\chi) = \nabla \times \mathbf{A} + \nabla \times \nabla\chi = \mathbf{B} + 0 = \mathbf{B} since any gradient is curl-free. For E: E=(φtχ)t(A+χ)=φ+tχtAtχ=E\mathbf{E}' = -\nabla(\varphi - \partial_t\chi) - \partial_t(\mathbf{A} + \nabla\chi) = -\nabla\varphi + \nabla\partial_t\chi - \partial_t\mathbf{A} - \partial_t\nabla\chi = \mathbf{E} since partial derivatives commute. \square

The freedom in choosing χ\chi is called gauge freedom. Two gauge choices that often appear in practice:

Definition (Lorenz and Coulomb gauges).

The Lorenz gauge imposes the covariant condition A+tφ=0\nabla \cdot \mathbf{A} + \partial_t \varphi = 0, which reduces the potential equations to decoupled wave equations: φ=ρ/ε0\Box\varphi = \rho/\varepsilon_0 and A=μ0J\Box\mathbf{A} = \mu_0\mathbf{J}, where =t22\Box = \partial_t^2 - \nabla^2 is the d'Alembertian.

The Coulomb gauge imposes A=0\nabla \cdot \mathbf{A} = 0, which makes the scalar potential satisfy Poisson's equation 2φ=ρ/ε0\nabla^2 \varphi = -\rho/\varepsilon_0 instantaneously, without retardation. This gauge is often used in quantum mechanics and in the canonical quantization of the electromagnetic field.

The potentials (φ,A)(\varphi, \mathbf{A}) were originally treated as mathematical auxiliaries with no independent physical meaning. Section 8 shows this view is wrong: the Aharonov–Bohm effect demonstrates that the vector potential has measurable physical consequences in regions where B=0\mathbf{B} = 0.

3. Differential Forms and the Faraday Tensor

Vector calculus works beautifully in flat three-dimensional space, but it cannot be easily extended to curved spacetime or to general dimensions. Differential forms provide the coordinate-free language that makes this extension natural. The key payoff: all four Maxwell equations reduce to two compact statements.

A kk-form on a smooth manifold MM is a field of antisymmetric multilinear maps on the tangent space, eating kk tangent vectors and returning a number. These spaces are organized into the de Rham complex: a sequence of vector spaces connected by the exterior derivative d\mathrm{d}, which maps kk-forms to (k+1)(k+1)-forms, subject to the central identity d2=0\mathrm{d}^2 = 0. This single algebraic fact encodes conservation laws, Bianchi identities, and the existence of gauge potentials all at once.

Definition (The de Rham complex).

On a smooth nn-manifold MM, the de Rham complex is the sequence

0    Ω0(M)    d    Ω1(M)    d    Ω2(M)    d        d    Ωn(M)    00 \;\longrightarrow\; \Omega^0(M) \;\xrightarrow{\;\mathrm{d}\;}\; \Omega^1(M) \;\xrightarrow{\;\mathrm{d}\;}\; \Omega^2(M) \;\xrightarrow{\;\mathrm{d}\;}\; \cdots \;\xrightarrow{\;\mathrm{d}\;}\; \Omega^n(M) \;\longrightarrow\; 0

where Ωk(M)\Omega^k(M) is the space of smooth kk-forms on MM and each arrow is the exterior derivative. The defining property d2=0\mathrm{d}^2 = 0 says: applying the exterior derivative twice gives zero, so every exact form (one of the form dη\mathrm{d}\eta) is automatically closed (satisfies dω=0\mathrm{d}\omega = 0).

In three-dimensional Euclidean space R3\mathbb{R}^3, the sequence specializes to

0    Ω0(R3)        Ω1(R3)    ×    Ω2(R3)        Ω3(R3)    00 \;\longrightarrow\; \Omega^0(\mathbb{R}^3) \;\xrightarrow{\;\nabla\;}\; \Omega^1(\mathbb{R}^3) \;\xrightarrow{\;\nabla\times\;}\; \Omega^2(\mathbb{R}^3) \;\xrightarrow{\;\nabla\cdot\;}\; \Omega^3(\mathbb{R}^3) \;\longrightarrow\; 0

and the identity d2=0\mathrm{d}^2 = 0 recovers two classical identities simultaneously: ×(f)=0\nabla \times (\nabla f) = \mathbf{0} (the curl of any gradient is zero) and (×A)=0\nabla \cdot (\nabla \times \mathbf{A}) = 0 (the divergence of any curl is zero). These are not independent facts; they are both instances of d2=0\mathrm{d}^2 = 0 read at different grades of the complex.

A kk-form ω\omega is closed if dω=0\mathrm{d}\omega = 0, and exact if ω=dη\omega = \mathrm{d}\eta for some (k1)(k{-}1)-form η\eta. Every exact form is closed, but whether every closed form is exact depends on the global topology of MM. The kk-th de Rham cohomology group measures the gap:

Hk(M)  =  ker ⁣(d:Ωk(M)Ωk+1(M))im ⁣(d:Ωk1(M)Ωk(M))H^k(M) \;=\; \frac{\ker\!\bigl(\mathrm{d} : \Omega^k(M) \to \Omega^{k+1}(M)\bigr)}{\operatorname{im}\!\bigl(\mathrm{d} : \Omega^{k-1}(M) \to \Omega^k(M)\bigr)}

A nonzero class in Hk(M)H^k(M) corresponds to a closed form that cannot globally be written as an exact form, signaling a kk-dimensional topological hole in MM.

The Poincaré lemma provides the converse on contractible domains: on any contractible open set URnU \subset \mathbb{R}^n, every closed form is exact. In three dimensions, this says that any curl-free vector field on a simply connected region is a gradient, and any divergence-free field is a curl. On non-simply-connected domains (such as R3{a line}\mathbb{R}^3 \setminus \{\text{a line}\}), closed forms need not be exact, and this failure is precisely what allows the Aharonov–Bohm effect (Section 8).

The de Rham complex is the continuum structure that discrete exterior calculus (DEC) faithfully discretizes: smooth manifolds become simplicial complexes, smooth kk-forms become kk-cochains, and the exterior derivative d\mathrm{d} becomes a combinatorial coboundary operator that inherits d2=0\mathrm{d}^2 = 0 exactly from the boundary identity 2=0\partial^2 = 0. Section 10 develops this construction and uses it to formulate Maxwell's equations on a discrete mesh, where B=0\nabla \cdot \mathbf{B} = 0 and gauge invariance hold exactly at the discrete level rather than approximately. For Maxwell's equations, the relevant complex runs over four-dimensional Minkowski spacetime R1,3\mathbb{R}^{1,3} up to Ω4\Omega^4.

Definition (Minkowski spacetime and conventions).

Minkowski spacetime R1,3\mathbb{R}^{1,3} carries coordinates (x0,x1,x2,x3)=(t,x,y,z)(x^0, x^1, x^2, x^3) = (t, x, y, z) with metric signature (+,,,)(+,-,-,-). The metric tensor is η=diag(+1,1,1,1)\eta = \operatorname{diag}(+1,-1,-1,-1). Lowering and raising indices uses ημν\eta_{\mu\nu} and its inverse ημν\eta^{\mu\nu}.

The Levi-Civita symbol εμνρσ\varepsilon^{\mu\nu\rho\sigma} with ε0123=+1\varepsilon^{0123} = +1 defines the Hodge star: for a kk-form ω\omega with components ωμ1μk\omega_{\mu_1\cdots\mu_k}, the Hodge dual ω\star\omega is the (4k)(4-k)-form with components proportional to εμ1μkν1ν4kωμ1μk\varepsilon^{\mu_1\cdots\mu_k}{}_{\nu_1\cdots\nu_{4-k}}\omega_{\mu_1\cdots\mu_k}.

Definition (Faraday 2-form).

The electromagnetic field is encoded in the Faraday 2-form FΩ2(R1,3)F \in \Omega^2(\mathbb{R}^{1,3}), with components

F=12FμνdxμdxνF = \frac{1}{2} F_{\mu\nu}\, \mathrm{d}x^\mu \wedge \mathrm{d}x^\nu

In terms of the electric and magnetic field components:

F0i=EiFij=εijkBkF_{0i} = E_i \qquad F_{ij} = \varepsilon_{ijk} B^k

Explicitly in coordinates:

F=Exdtdx+Eydtdy+Ezdtdz+Bxdydz+Bydzdx+BzdxdyF = E_x\, \mathrm{d}t\wedge \mathrm{d}x + E_y\, \mathrm{d}t\wedge \mathrm{d}y + E_z\, \mathrm{d}t\wedge \mathrm{d}z + B_x\, \mathrm{d}y\wedge \mathrm{d}z + B_y\, \mathrm{d}z\wedge \mathrm{d}x + B_z\, \mathrm{d}x\wedge \mathrm{d}y

The current density defines a 1-form J=ρdt+Jxdx+Jydy+JzdzJ = -\rho\, \mathrm{d}t + J_x\, \mathrm{d}x + J_y\, \mathrm{d}y + J_z\, \mathrm{d}z (or its Hodge dual 3-form, depending on convention).

Theorem (Maxwell's equations as two differential form equations).

The four Maxwell equations are equivalent to

dF=0dF=J\mathrm{d}F = 0 \qquad\qquad \mathrm{d}{\star}F = {\star}J

The first equation, dF=0\mathrm{d}F = 0, encodes Faraday's law and the no-monopole condition B=0\nabla \cdot \mathbf{B} = 0. The second, dF=J\mathrm{d}{\star}F = {\star}J, encodes Gauss's law and Ampere's law.

Proof. We work in coordinates (t,x,y,z)(t,x,y,z) with η=diag(+1,1,1,1)\eta = \mathrm{diag}(+1,-1,-1,-1). We use the convention in which the component form of FF is F0i=EiF_{0i} = -E_i, Fij=εijkBkF_{ij} = \varepsilon_{ijk}B_k (consistent with the STA bivector in Definition 4.2). In this convention the field 2-form reads

F  =  Exdtdx    Eydtdy    Ezdtdz  +  Bxdydz  +  Bydzdx  +  Bzdxdy.F \;=\; -E_x\,\mathrm{d}t\wedge\mathrm{d}x \;-\; E_y\,\mathrm{d}t\wedge\mathrm{d}y \;-\; E_z\,\mathrm{d}t\wedge\mathrm{d}z \;+\; B_x\,\mathrm{d}y\wedge\mathrm{d}z \;+\; B_y\,\mathrm{d}z\wedge\mathrm{d}x \;+\; B_z\,\mathrm{d}x\wedge\mathrm{d}y.

Part 1: dF=0\mathrm{d}F = 0. Because dF\mathrm{d}F is a 3-form in four dimensions it has exactly four independent components. The general formula for the antisymmetrised derivative of a 2-form is

(dF)λμν  =  λFμν+μFνλ+νFλμ.(\mathrm{d}F)_{\lambda\mu\nu} \;=\; \partial_\lambda F_{\mu\nu} + \partial_\mu F_{\nu\lambda} + \partial_\nu F_{\lambda\mu}.

The (x,y,z)(x,y,z) component — the only purely spatial triple. Only the magnetic 2-forms contribute (the electric terms involve dt\mathrm{d}t and vanish on a purely spatial triple):

(dF)xyz  =  xFyz+yFzx+zFxy  =  xBx+yBy+zBz  =  B.(\mathrm{d}F)_{xyz} \;=\; \partial_x F_{yz} + \partial_y F_{zx} + \partial_z F_{xy} \;=\; \partial_x B_x + \partial_y B_y + \partial_z B_z \;=\; \nabla\cdot\mathbf{B}.

Setting this to zero gives B=0\nabla\cdot\mathbf{B} = 0, the no-monopole condition.

The (t,x,y)(t,x,y) component. Only ExE_x, EyE_y, and BzB_z contribute, since these are the only field components whose basis 2-forms share two of the three indices {t,x,y}\{t,x,y\}:

(dF)txy  =  tFxy+xFyt+yFtx  =  tBz+xEyyEx.(\mathrm{d}F)_{txy} \;=\; \partial_t F_{xy} + \partial_x F_{yt} + \partial_y F_{tx} \;=\; \partial_t B_z + \partial_x E_y - \partial_y E_x.

Setting this to zero gives tBz=yExxEy=(×E)z\partial_t B_z = \partial_y E_x - \partial_x E_y = -(\nabla\times\mathbf{E})_z, the zz-component of Faraday's law tB+×E=0\partial_t\mathbf{B} + \nabla\times\mathbf{E} = 0. The remaining two components follow by cycling (x,y,z)(x,y,z):

(dF)tyz=tBx+zEyyEz=0,(dF)txz=tBy+xEzzEx=0,(\mathrm{d}F)_{tyz} = \partial_t B_x + \partial_z E_y - \partial_y E_z = 0, \qquad (\mathrm{d}F)_{txz} = -\partial_t B_y + \partial_x E_z - \partial_z E_x = 0,

which are the xx- and yy-components of tB+×E=0\partial_t\mathbf{B} + \nabla\times\mathbf{E} = 0.

Part 2: dF=J\mathrm{d}{\star}F = {\star}J. The Hodge star in (R1,3,η)(\mathbb{R}^{1,3}, \eta) acts on basis 2-forms by (dxμdxν)=12ημαηνβεαβρσdxρdxσ{\star}(\mathrm{d}x^\mu\wedge \mathrm{d}x^\nu) = \tfrac{1}{2}\eta^{\mu\alpha}\eta^{\nu\beta}\varepsilon_{\alpha\beta\rho\sigma}\,\mathrm{d}x^\rho\wedge\mathrm{d}x^\sigma with ε0123=+1\varepsilon_{0123}=+1. Applying this to FF, using F=F\star\star F = -F to verify signs:

F  =  Bxdtdx    Bydtdy    Bzdtdz    Ezdxdy  +  Eydxdz    Exdydz.{\star}F \;=\; -B_x\,\mathrm{d}t\wedge\mathrm{d}x \;-\; B_y\,\mathrm{d}t\wedge\mathrm{d}y \;-\; B_z\,\mathrm{d}t\wedge\mathrm{d}z \;-\; E_z\,\mathrm{d}x\wedge\mathrm{d}y \;+\; E_y\,\mathrm{d}x\wedge\mathrm{d}z \;-\; E_x\,\mathrm{d}y\wedge\mathrm{d}z.

The Hodge star has swapped the electric and magnetic sectors. Now applying d\mathrm{d} to F{\star}F:

The (x,y,z)(x,y,z) component. Only the spatial 2-forms in F{\star}F contribute a purely spatial 3-form:

(dF)xyz  =  x(Ex)+y(Ey)+z(Ez)  =  E.(\mathrm{d}{\star}F)_{xyz} \;=\; \partial_x(-E_x) + \partial_y(-E_y) + \partial_z(-E_z) \;=\; -\nabla\cdot\mathbf{E}.

The source 3-form J{\star}J (the Hodge dual of the current 1-form J=ρdtJxdxJydyJzdzJ = \rho\,\mathrm{d}t - J_x\,\mathrm{d}x - J_y\,\mathrm{d}y - J_z\,\mathrm{d}z) has (J)xyz=ρ({\star}J)_{xyz} = -\rho in this convention. Setting (dF)xyz=(J)xyz(\mathrm{d}{\star}F)_{xyz} = ({\star}J)_{xyz} gives E=ρ\nabla\cdot\mathbf{E} = \rho, Gauss's law.

The (t,x,y)(t,x,y) component. The magnetic 2-forms Bxdtdx-B_x\,\mathrm{d}t\wedge\mathrm{d}x, Bydtdy-B_y\,\mathrm{d}t\wedge\mathrm{d}y and the spatial form Ezdxdy-E_z\,\mathrm{d}x\wedge\mathrm{d}y contribute:

(dF)txy  =  tEzxBy+yBx  =  tEz+(×B)z.(\mathrm{d}{\star}F)_{txy} \;=\; -\partial_t E_z - \partial_x B_y + \partial_y B_x \;=\; -\partial_t E_z + (\nabla\times\mathbf{B})_z.

Setting equal to (J)txy=Jz({\star}J)_{txy} = -J_z gives (×B)ztEz=Jz(\nabla\times\mathbf{B})_z - \partial_t E_z = J_z, the zz-component of Ampere's law ×BtE=J\nabla\times\mathbf{B} - \partial_t\mathbf{E} = \mathbf{J}. The (t,y,z)(t,y,z) and (t,x,z)(t,x,z) components give the remaining components by the same calculation.

All eight equations — four from [eq:maxwell-forms-eq] — are precisely Maxwell's four equations in classical form. \square

Remark (Gauge potential from the Poincaré lemma).

Since d2=0\mathrm{d}^2 = 0, the first equation in [eq:maxwell-forms-eq] says F is a closed 2-form. On a contractible domain, the Poincaré lemma guarantees a 1-form A such that F=dAF = \mathrm{d}A. This is the gauge potential in differential form language. The components of A are exactly (φ,Ax,Ay,Az)(\varphi, A_x, A_y, A_z) from Section 2. The gauge transformation AA+dχA \mapsto A + \mathrm{d}\chi leaves F=dAF = \mathrm{d}A unchanged since d(dχ)=0\mathrm{d}(\mathrm{d}\chi) = 0.

The forms language also makes the charge conservation law automatic: applying d\mathrm{d} to dF=J\mathrm{d}{\star}F = {\star}J and using d2=0\mathrm{d}^2 = 0 gives dJ=0\mathrm{d}{\star}J = 0, which in coordinates is tρ+J=0\partial_t \rho + \nabla \cdot \mathbf{J} = 0, the continuity equation.

4. Spacetime Algebra: F=J\nabla F = J

The differential forms approach breaks Maxwell's equations into two statements. Geometric algebra (Clifford algebra applied to spacetime) collapses them into one. The gain is more than notational: the algebraic structure makes the spinor formulation of relativistic quantum mechanics and the Yang–Mills generalization (Section 7) more transparent.

Definition (Spacetime algebra Cl(1,3)).

The spacetime algebra (STA) is the real Clifford algebra Cl(1,3)\mathrm{Cl}(1,3) generated by four basis vectors {γ0,γ1,γ2,γ3}\{\gamma_0, \gamma_1, \gamma_2, \gamma_3\} satisfying

γμγν+γνγμ=2ημν\gamma_\mu \gamma_\nu + \gamma_\nu \gamma_\mu = 2\eta_{\mu\nu}

so that γ02=+1\gamma_0^2 = +1 and γi2=1\gamma_i^2 = -1 for i=1,2,3i=1,2,3. The 16-dimensional algebra has a basis organized by grade: grade 0 (scalars, 1 element), grade 1 (vectors, 4 elements), grade 2 (bivectors, 6 elements), grade 3 (trivectors, 4 elements), grade 4 (pseudoscalar, 1 element). The pseudoscalar is I=γ0γ1γ2γ3I = \gamma_0\gamma_1\gamma_2\gamma_3 with I2=1I^2 = -1.

The spacetime gradient is =γμμ\nabla = \gamma^\mu \partial_\mu, where γμ=ημνγν\gamma^\mu = \eta^{\mu\nu}\gamma_\nu.

Definition (Electromagnetic field as a bivector).

In the STA, the electromagnetic field is a grade-2 multivector (bivector):

F=E+IcBF = \mathbf{E} + Ic\mathbf{B}

where E=Ekγkγ0\mathbf{E} = E_k \gamma_k \gamma_0 and B=Bkγkγ0\mathbf{B} = B_k \gamma_k \gamma_0 are relative bivectors (bivectors with one timelike leg), and II is the pseudoscalar. In natural units c=1c = 1:

F=Exγ1γ0+Eyγ2γ0+Ezγ3γ0+Bxγ2γ3+Byγ3γ1+Bzγ1γ2F = E_x \gamma_1\gamma_0 + E_y \gamma_2\gamma_0 + E_z \gamma_3\gamma_0 + B_x \gamma_2\gamma_3 + B_y \gamma_3\gamma_1 + B_z \gamma_1\gamma_2
Theorem (Maxwell's equation in the STA).

All four Maxwell equations are equivalent to the single multivector equation

F=J\nabla F = J

where J=JμγμJ = J^\mu \gamma_\mu is the 4-current. The vector part F1=J\langle \nabla F \rangle_1 = J encodes Gauss and Ampere. The trivector part F3=0\langle \nabla F \rangle_3 = 0 (when F\nabla F is equated to the grade-1 object J) encodes Faraday and the no-monopole condition.

Proof. Write =γμμ\nabla = \gamma^\mu\partial_\mu with γμ=ημνγν\gamma^\mu = \eta^{\mu\nu}\gamma_\nu, so γ0=γ0\gamma^0 = \gamma_0 and γi=γi\gamma^i = -\gamma_i. The Clifford product of a grade-1 vector vv with a grade-2 bivector FF decomposes by grade:

γμμF  =  γμ(μF)  +  γμ(μF),\gamma^\mu \partial_\mu F \;=\; \gamma^\mu \cdot (\partial_\mu F) \;+\; \gamma^\mu \wedge (\partial_\mu F),

where 1\langle\cdot\rangle_1 (the dot part) decreases grade by 1 and 3\langle\cdot\rangle_3 (the wedge part) increases grade by 1. Since FF is a bivector, F\nabla F is the sum of a grade-1 vector and a grade-3 trivector.

Grade-1 part: inhomogeneous equations. Expanding γμ(μF)\gamma^\mu \cdot (\partial_\mu F) in the canonical blade basis (computed explicitly via the Clifford multiplication rules γμγν=γνγμ\gamma_\mu\gamma_\nu = -\gamma_\nu\gamma_\mu for μν\mu\neq\nu and γμ2=ημμ\gamma_\mu^2 = \eta_{\mu\mu}):

F1  =  (xEx+yEy+zEz)γ0  +  (zByyBztEx)γ1  +  \langle\nabla F\rangle_1 \;=\; (\partial_x E_x + \partial_y E_y + \partial_z E_z)\,\gamma_0 \;+\; (\partial_z B_y - \partial_y B_z - \partial_t E_x)\,\gamma_1 \;+\; \cdots

The γ0\gamma_0 coefficient is E\nabla\cdot\mathbf{E}; equating to J0=ρJ^0 = \rho gives Gauss's law. The γk\gamma_k coefficient is (×B)ktEk(\nabla\times\mathbf{B})_k - \partial_t E_k; equating to JkJ^k gives Ampere's law ×BtE=J\nabla\times\mathbf{B} - \partial_t\mathbf{E} = \mathbf{J}.

Grade-3 part: homogeneous equations. Expanding γμ(μF)\gamma^\mu\wedge(\partial_\mu F) in the trivector basis:

F3  =  (tBz+yExxEy)γ0γ1γ2  +  (tBx+zEyyEz)γ0γ2γ3  +  \langle\nabla F\rangle_3 \;=\; (\partial_t B_z + \partial_y E_x - \partial_x E_y)\,\gamma_0\gamma_1\gamma_2 \;+\; (\partial_t B_x + \partial_z E_y - \partial_y E_z)\,\gamma_0\gamma_2\gamma_3 \;+\; \cdots

The γ0γkγl\gamma_0\gamma_k\gamma_l coefficients are the components of tB+×E\partial_t\mathbf{B} + \nabla\times\mathbf{E}; each set to zero gives a component of Faraday's law. The purely spatial trivector coefficient is

(xBx+yBy+zBz)γ1γ2γ3  =  (B)Iγ0,(\partial_x B_x + \partial_y B_y + \partial_z B_z)\,\gamma_1\gamma_2\gamma_3 \;=\; (\nabla\cdot\mathbf{B})\,I\gamma_0,

which set to zero gives B=0\nabla\cdot\mathbf{B}=0. Since J=JμγμJ = J^\mu\gamma_\mu is a grade-1 object, equating F=J\nabla F = J forces the grade-3 part to vanish identically, recovering all four homogeneous equations simultaneously. The grade-1 equation then yields the four inhomogeneous equations of [eq:maxwell-forms-eq]. \square

Remark (Yang–Mills in geometric algebra).

For a non-abelian gauge field (Section 7), the STA equation becomes

F+[A,F]=J\nabla F + [A, F] = J

where the bracket is the commutator in the Lie algebra, and F=A+A2F = \nabla A + A^2 is the non-abelian curvature. The additional commutator term reflects the self-interaction of non-abelian gauge bosons: gluons in QCD, for instance, carry color charge and interact with each other, unlike photons. The STA formulation makes visible that this self-interaction is a purely algebraic consequence of non-commutativity.

5. Principal Fibre Bundles

The gauge potential A of electromagnetism is a 1-form on spacetime with values in a Lie algebra. This structure has a precise geometric home: it is a connection on a principal fibre bundle. Understanding that home requires building up the definition carefully, but the physical picture is always a family of gauge-equivalent copies of the structure group, one sitting over each point of spacetime.

Think of a helix wrapping around a cylinder. The cylinder is the base space (the spacetime you navigate), the circle at each height is the fibre (the gauge group), and the helix tells you how to move consistently from one fibre to the next. That consistent prescription is a connection.

Definition (Principal G-bundle).

A principal GG-bundle over a smooth manifold MM is a smooth manifold PP together with:

  1. a smooth surjection π:PM\pi: P \to M (the bundle projection),
  2. a smooth free right action P×GPP \times G \to P, written ppgp \mapsto p \cdot g, that preserves fibres: π(pg)=π(p)\pi(p \cdot g) = \pi(p) for all pPp \in P, gGg \in G,
  3. local triviality: each xMx \in M has an open neighborhood UU and a diffeomorphism ϕ:π1(U)U×G\phi: \pi^{-1}(U) \to U \times G intertwining the right action with right multiplication in G.

The fibre over xx is π1(x)G\pi^{-1}(x) \cong G. A local section is a smooth map σ:UP\sigma: U \to P with πσ=idU\pi \circ \sigma = \mathrm{id}_U.

Example (The Hopf bundle S³ → S²).

The 3-sphere S3C2S^3 \subset \mathbb{C}^2 is the set of pairs (z0,z1)(z_0, z_1) with z02+z12=1|z_0|^2 + |z_1|^2 = 1. The unit circle U(1)={eiθ}U(1) = \{e^{i\theta}\} acts on S3S^3 by (z0,z1)eiθ=(eiθz0,eiθz1)(z_0, z_1) \cdot e^{i\theta} = (e^{i\theta}z_0, e^{i\theta}z_1). The quotient S3/U(1)S^3 / U(1) is the 2-sphere S2CP1S^2 \cong \mathbb{CP}^1, and the map π:S3S2\pi: S^3 \to S^2 given by the Hopf map

π(z0,z1)=[z0:z1]\pi(z_0, z_1) = [z_0 : z_1]

is a principal U(1)U(1)-bundle. This bundle is nontrivial: there is no global section, meaning no smooth way to pick one representative from each fibre over all of S2S^2. The obstruction is topological and is measured by the first Chern class c1=1H2(S2,Z)Zc_1 = 1 \in H^2(S^2, \mathbb{Z}) \cong \mathbb{Z}.

Definition (Electromagnetic U(1) bundle).

The gauge group of electromagnetism is U(1)={eiα:αR}U(1) = \{e^{i\alpha} : \alpha \in \mathbb{R}\}. Over Minkowski spacetime R1,3\mathbb{R}^{1,3}, the electromagnetic field lives on a principal U(1)U(1)-bundle PR1,3P \to \mathbb{R}^{1,3}. Since R1,3\mathbb{R}^{1,3} is contractible, this bundle is trivial and a global section exists: this is why we can define A as a globally defined 1-form. Over topologically nontrivial spacetimes (for instance, outside an infinitely long solenoid, where the effective base space is R2{0}\mathbb{R}^2 \setminus \{0\} with fundamental group π1=Z\pi_1 = \mathbb{Z}), the bundle can be genuinely nontrivial. That nontriviality is the Aharonov–Bohm effect.

6. Connections and Curvature

A connection on a principal bundle answers the question: given a path in the base manifold, how do we lift it to a path in the total space in a consistent, GG-equivariant way? The answer defines parallel transport, and the failure of parallel transport to close around loops is curvature.

Concretely, a connection splits the tangent space of PP at each point into a vertical part (tangent to the fibre, "staying at the same base point") and a horizontal part (transverse to the fibre, "moving in the base direction"). The choice of horizontal subspace, required to be G-equivariant and smooth, is the connection.

Definition (Connection 1-form).

A connection on a principal GG-bundle π:PM\pi: P \to M is a g\mathfrak{g}-valued 1-form ωΩ1(P,g)\omega \in \Omega^1(P, \mathfrak{g}) satisfying:

  1. ω(X)=X\omega(X^\dagger) = X for all XgX \in \mathfrak{g}, where XX^\dagger is the fundamental vector field generated by X,
  2. Rgω=Adg1ωR_g^* \omega = \mathrm{Ad}_{g^{-1}} \omega (G-equivariance under right multiplication).

Given a local section σ:UP\sigma: U \to P, the local gauge potential is the pullback

A=σωΩ1(U,g)A = \sigma^* \omega \in \Omega^1(U, \mathfrak{g})

This is exactly the gauge field A of physics.

Theorem (Gauge transformation of the local potential).

If σ=σg\sigma' = \sigma \cdot g is a different local section (a gauge transformation by a smooth map g:UGg: U \to G), then the local potentials are related by

A=g1Ag+g1dgA' = g^{-1} A g + g^{-1} \mathrm{d}g

For the abelian case G=U(1)G = U(1) with g=eiχg = e^{i\chi}, this reduces to

A=A+idχA' = A + i\, \mathrm{d}\chi

which, after identifying Aie(φdtAdx)A \leftrightarrow -ie(\varphi\, \mathrm{d}t - \mathbf{A}\cdot \mathrm{d}\mathbf{x}), recovers [eq:gauge-transform-eq] from Section 2.

Definition (Curvature 2-form).

The curvature of a connection ω\omega is the g\mathfrak{g}-valued 2-form

Ω=dω+12[ω,ω]Ω2(P,g)\Omega = \mathrm{d}\omega + \frac{1}{2}[\omega, \omega] \in \Omega^2(P, \mathfrak{g})

where the bracket is the Lie bracket in g\mathfrak{g}. Locally, via a section σ\sigma, the curvature pulls back to

F=dA+12[A,A]=dA+AAF = \mathrm{d}A + \frac{1}{2}[A, A] = \mathrm{d}A + A \wedge A

For the abelian case G=U(1)G = U(1), the Lie algebra u(1)=iRu(1) = i\mathbb{R} is commutative, so [A,A]=0[A,A] = 0 and

F=dAF = \mathrm{d}A

This is [eq:maxwell-forms-eq], the Faraday 2-form of Section 3. The four Maxwell equations follow by imposing the Yang–Mills equations (Section 7) on this abelian bundle.

Remark (Bianchi identity from geometry).

The Bianchi identity dF=0\mathrm{d}F = 0 (for the abelian case) is not a physical law but a geometric identity. It says d(dA)=0\mathrm{d}(\mathrm{d}A) = 0, which follows from d2=0\mathrm{d}^2 = 0. In the non-abelian case the identity becomes the covariant Bianchi identity DF=dF+[A,F]=0DF = \mathrm{d}F + [A, F] = 0, reflecting the structure equations of the Lie algebra.

7. Yang–Mills Theory

In 1954, Chen-Ning Yang and Robert Mills asked: what happens if the gauge group is non-abelian? The same geometric structure (principal bundle, connection, curvature) applies, but the non-commutativity of the group means the gauge bosons themselves carry charge and interact. In electromagnetism, photons are electrically neutral and do not interact with each other. In a non-abelian gauge theory, the gauge bosons have charges under their own gauge group and self-interact.

Definition (Yang–Mills action).

For a principal GG-bundle with connection A and curvature F=dA+AAF = \mathrm{d}A + A \wedge A, the Yang–Mills action is

SYM[A]=12g2MTr(FF)S_{\mathrm{YM}}[A] = -\frac{1}{2g^2}\int_{M} \mathrm{Tr}(F \wedge {\star}F)

where Tr\mathrm{Tr} is the Killing form on g\mathfrak{g} and {\star} is the Hodge star of the spacetime metric. For G=U(1)G = U(1) with F=dAF = \mathrm{d}A, this is proportional to (E2+B2)d4x\int (|\mathbf{E}|^2 + |\mathbf{B}|^2)\, \mathrm{d}^4x, the electromagnetic field energy.

Theorem (Yang–Mills equations).

The Euler-Lagrange equations of SYMS_{\mathrm{YM}} under variation of A are

DF=JD {\star} F = {\star} J

where D=d+[A,]D = \mathrm{d} + [A, \cdot] is the covariant exterior derivative. Together with the Bianchi identity DF=0DF = 0, these are the Yang–Mills equations. For G=U(1)G = U(1) they reduce to [eq:maxwell-forms-eq] (Maxwell's equations).

Example (Standard Model gauge group).

The Standard Model of particle physics uses the gauge group G=U(1)×SU(2)×SU(3)G = U(1) \times SU(2) \times SU(3):

  • The U(1)U(1) factor gives the hypercharge gauge field, which mixes with the SU(2)SU(2) gauge boson to produce the photon (electromagnetism) and the Z boson after spontaneous symmetry breaking.
  • The SU(2)SU(2) factor governs the weak nuclear force, with gauge bosons W±W^\pm and Z0Z^0.
  • The SU(3)SU(3) factor governs quantum chromodynamics (QCD), with 8 gluon gauge bosons carrying color charge. Because SU(3)SU(3) is non-abelian, gluons interact with each other, producing confinement: quarks cannot be extracted individually from hadrons.

Each of these three factors is a Yang–Mills theory on the same principal bundle framework.

Remark (Millennium Prize: Yang–Mills mass gap).

Pure SU(2)SU(2) Yang–Mills theory on R4\mathbb{R}^4 is expected on physical and numerical grounds to have a mass gap: the lowest energy excitation above the vacuum has energy at least Δ>0\Delta > 0. This would explain why the weak and strong forces are short-range (massive mediators) despite the gauge bosons being classically massless. A rigorous mathematical proof (constructing a quantum Yang–Mills theory on R4\mathbb{R}^4 and proving a positive mass gap) has not been found. The Clay Mathematics Institute lists this as one of the seven Millennium Prize Problems, each carrying a $1,000,000 prize.[3]

8. The Aharonov–Bohm Effect

Consider a solenoid: a tightly wound coil of wire carrying current. Inside the solenoid there is a strong magnetic field. Outside, by symmetry and Ampere's law, the field is nearly zero. Classical electromagnetism predicts that electrons traveling only in the field-free region outside the solenoid should be unaffected by whatever happens inside.

In 1959, Yakir Aharonov and David Bohm predicted that this classical intuition is wrong. Even though B=0B = 0 outside, the vector potential AA is nonzero (it must be, since B=×AB = \nabla\times A is nonzero inside). And the vector potential has a directly measurable effect on quantum mechanics: it shifts the phase of the electron wavefunction along any path. If two electron paths encircle the solenoid from opposite sides and recombine, the phase difference between the paths depends on the total enclosed magnetic flux.

Theorem (Aharonov–Bohm phase shift).

An electron wavefunction acquires a phase

ψ    ei(e/)γAψ\psi \;\longrightarrow\; e^{i(e/\hbar)\int_\gamma A}\,\psi

along a path γ\gamma in spacetime, where ee is the electron charge and A is the electromagnetic gauge potential. For two paths γ1\gamma_1 (upper) and γ2\gamma_2 (lower) that together encircle the solenoid, the relative phase difference is

Δφ=eγ1γ2A=eΣBdS=eΦ=2πΦΦ0\Delta\varphi = \frac{e}{\hbar} \oint_{\gamma_1 - \gamma_2} A = \frac{e}{\hbar} \iint_\Sigma B \cdot \mathrm{d}\mathbf{S} = \frac{e\Phi}{\hbar} = \frac{2\pi\Phi}{\Phi_0}

where Φ=BdS\Phi = \iint B \cdot \mathrm{d}\mathbf{S} is the total magnetic flux through the solenoid, Φ0=h/e\Phi_0 = h/e is the magnetic flux quantum, and Σ\Sigma is any surface bounded by γ1γ2\gamma_1 - \gamma_2. The second equality uses Stokes' theorem. The effect is nonzero whenever Φ0\Phi \neq 0, regardless of whether B is zero on both paths.

The prediction was confirmed experimentally by Chambers (1960) and definitively by Tonomura and collaborators (1986) using electron holography with a toroidal magnet fully enclosed in a superconducting shield, ensuring B=0\mathbf{B} = 0 everywhere on the electron paths to very high precision.

Drag the flux slider to shift the interference fringe pattern. Even though B=0\mathbf{B} = 0 outside the solenoid, the phase difference [eq:ab-phase-eq] grows linearly with flux.

In the language of fibre bundles, the AB phase is the holonomy of the connection A around the closed loop formed by the two paths. Holonomy measures how much parallel transport around a loop fails to return to the starting point in the fibre. For an abelian bundle, holonomy is a phase in U(1)U(1).

Remark (AB effect as holonomy).

The AB phase around a loop γ\gamma is

Holγ(A)=exp ⁣(ieγA)U(1)\mathrm{Hol}_\gamma(A) = \exp\!\left(\frac{ie}{\hbar}\oint_\gamma A\right) \in U(1)

This can be nonzero even when the curvature F=dA=0F = \mathrm{d}A = 0 along the entire path (since B=0\mathbf{B} = 0 outside the solenoid). The discrepancy is possible because the loop is not contractible in the base space: R2{0}\mathbb{R}^2 \setminus \{0\} has fundamental group π1=Z\pi_1 = \mathbb{Z}. The winding number of the loop around the excluded origin counts how many times the path encircles the solenoid. A flat bundle (zero curvature) can still have nontrivial holonomy on a topologically nontrivial base.

This shows that A is not just a gauge artifact. The physical content of electromagnetism is not fully captured by the local fields E and B; the full content is the connection on the principal U(1)U(1) bundle, including its holonomy properties over topologically nontrivial paths.

9. Dirac Monopoles and Chern Classes

No magnetic monopole has ever been observed. But Dirac showed in 1931 that if a single monopole exists anywhere in the universe, electric charge must be quantized in discrete units: every particle's charge is an integer multiple of the electron charge. This is a striking example of topology constraining physics.

The argument uses the same two-patch construction that classifies line bundles over spheres, now called the Wu–Yang construction.

Definition (Dirac monopole setup).

A magnetic monopole of strength gg at the origin creates a radial magnetic field

B=g4πr2r^\mathbf{B} = \frac{g}{4\pi r^2}\hat{\mathbf{r}}

satisfying B=gδ(3)(x)\nabla \cdot \mathbf{B} = g\,\delta^{(3)}(\mathbf{x}). On any sphere SR2S^2_R of radius RR surrounding the origin,

SR2BdS=g\iint_{S^2_R} \mathbf{B} \cdot \mathrm{d}\mathbf{S} = g

The problem: we need to write B=×A\mathbf{B} = \nabla \times \mathbf{A} for a potential A defined on the sphere, but Stokes' theorem would then give S2BdS=S2A=0\iint_{S^2} \mathbf{B} \cdot \mathrm{d}\mathbf{S} = \oint_{\partial S^2} A = 0 (a sphere has no boundary), contradicting the nonzero flux. A globally defined A on S2S^2 cannot exist when the flux is nonzero.

Theorem (Wu–Yang construction and Dirac quantization).

Split S2S^2 into two overlapping patches: a northern hemisphere UNU_N (including the north pole) and a southern hemisphere USU_S (including the south pole), overlapping on an equatorial strip. Define gauge potentials ANA_N on UNU_N and ASA_S on USU_S that correctly reproduce the monopole field on each patch. On the equatorial overlap, they must differ by a gauge transformation:

AS=AN+dχA_S = A_N + \mathrm{d}\chi

For the wavefunction of an electron (charge ee) to be single-valued as it traverses the equator once, the gauge transformation parameter χ\chi must satisfy

eequatordχ=e[χ]equator2πZ\frac{e}{\hbar} \oint_{\text{equator}} \mathrm{d}\chi = \frac{e}{\hbar}[\chi]_{\text{equator}} \in 2\pi\mathbb{Z}

Combined with equator(ANAS)=S2BdS=g\oint_{\text{equator}} (A_N - A_S) = \iint_{S^2} B\cdot \mathrm{d}S = g, this gives the Dirac quantization condition:

eg=2πnnZ\frac{eg}{\hbar} = 2\pi n \qquad n \in \mathbb{Z}

If any magnetic charge gg exists, electric charge ee must be a rational multiple of /g\hbar/g, explaining charge quantization universally.

Definition (First Chern class).

The two-patch construction defines a principal U(1)U(1)-bundle over S2S^2, classified by the transition function on the equatorial overlap. Since the equator is a circle S1S^1, the transition function is a map S1U(1)S^1 \to U(1) classified by its winding number nπ1(U(1))=Zn \in \pi_1(U(1)) = \mathbb{Z}.

This integer nn is the first Chern class c1(P)H2(S2,Z)Zc_1(P) \in H^2(S^2, \mathbb{Z}) \cong \mathbb{Z} of the bundle. In terms of the curvature:

c1=12πS2FZc_1 = \frac{1}{2\pi}\int_{S^2} F \in \mathbb{Z}

For the trivial bundle (no monopole), c1=0c_1 = 0. For the Hopf bundle of Section 5, c1=1c_1 = 1. The bundle of a Dirac monopole with Dirac charge nn has c1=nc_1 = n.

Remark (Topological quantization).

The integrality of c1c_1 is a theorem, not an assumption. For any principal U(1)U(1)-bundle over any compact 2-manifold, the integral of the curvature form over the manifold is an integer multiple of 2π2\pi. This is a special case of the Chern-Weil theorem, which relates the topology of a bundle (characteristic classes, computed from curvature forms) to the topology of the base manifold.

The physical content: charges are quantized because the topology of gauge bundles is quantized. The same integer that counts the winding of the transition function also counts the units of magnetic charge. Topology constrains physics.

10. Orbitals, Maxwell's Equations, and Computational Electrostatics

The four languages developed in the preceding sections describe the electromagnetic field given its sources. In classical physics, those sources are point charges and continuous charge distributions, handled via Dirac deltas and smooth densities as in Section 1. In quantum mechanics, even a single particle has an extended charge distribution: the source term in Maxwell's equations is determined by the particle's quantum state. This section develops the semiclassical coupling, in which the charge distribution is quantum-mechanical but the electromagnetic field is treated classically.

Definition (Quantum charge and current density).

For an electron in a normalized quantum state ψ(x,t)\psi(\mathbf{x},t), the semiclassical charge density and current density sourcing Maxwell's equations are

ρ(x,t)=eψ(x,t)2\rho(\mathbf{x},t) = -e\,|\psi(\mathbf{x},t)|^2J(x,t)=e2mei ⁣(ψψψψ)e2meAψ2\mathbf{J}(\mathbf{x},t) = -\frac{e\hbar}{2m_e i}\!\left(\psi^*\nabla\psi - \psi\nabla\psi^*\right) - \frac{e^2}{m_e}\mathbf{A}\,|\psi|^2

where e-e is the electron charge, mem_e is the electron mass, and A\mathbf{A} is the magnetic vector potential. The two terms in J\mathbf{J} are the paramagnetic current (driven by the phase gradient of ψ\psi) and the diamagnetic current (driven by the vector potential itself). The continuity equation tρ+J=0\partial_t\rho + \nabla\cdot\mathbf{J} = 0 is satisfied identically as a consequence of the time-dependent Schrödinger equation, confirming that quantum mechanics is internally consistent with charge conservation.

Example (Charge densities of hydrogen orbitals).

For hydrogen, the Schrödinger equation with Coulomb potential V=e2/(4πε0r)V = -e^2/(4\pi\varepsilon_0 r) has normalized eigenstates ψnlm(r,θ,ϕ)=Rnl(r)Ylm(θ,ϕ)\psi_{nlm}(r,\theta,\phi) = R_{nl}(r)\,Y_l^m(\theta,\phi), where a0=4πε02/(mee2)0.529A˚a_0 = 4\pi\varepsilon_0\hbar^2/(m_e e^2) \approx 0.529\,\text{Å} is the Bohr radius. The charge densities of the lowest orbitals are:

ρ1s(r)=eπa03e2r/a0\rho_{1s}(r) = -\frac{e}{\pi a_0^3}\,e^{-2r/a_0}ρ2pz(r,θ)=e32πa05r2er/a0cos2 ⁣θ\rho_{2p_z}(r,\theta) = -\frac{e}{32\pi a_0^5}\,r^2 e^{-r/a_0}\cos^2\!\theta

Both are smooth, extended functions. The 1s density is spherically symmetric with characteristic scale a0a_0; the 2pz2p_z density has the dumbbell profile of the orbital's angular part. From far away (ra0r \gg a_0), both densities integrate to e-e over all space, so the electron looks like a point charge e-e at the nucleus, consistent with Coulomb's law.

Proposition (Hartree potential from the Poisson equation).

The electrostatic potential sourced by an electron in state ψ\psi satisfies Gauss's law in the form of Poisson's equation:

2φH(x)=eε0ψ(x)2\nabla^2\varphi_H(\mathbf{x}) = \frac{e}{\varepsilon_0}\,|\psi(\mathbf{x})|^2

The solution is the Hartree potential,

φH(x)=e4πε0R3ψ(x)2xxd3x\varphi_H(\mathbf{x}) = -\frac{e}{4\pi\varepsilon_0}\int_{\mathbb{R}^3}\frac{|\psi(\mathbf{x}')|^2}{|\mathbf{x}-\mathbf{x}'|}\,\mathrm{d}^3x'

This is Coulomb's law with the point charge replaced by the quantum charge density. For the 1s orbital, the integral evaluates in closed form via the expansion of 1/xx1/|\mathbf{x}-\mathbf{x}'| in Legendre polynomials, giving a potential that behaves as e/(4πε0r)-e/(4\pi\varepsilon_0 r) for ra0r \gg a_0 and smoothly interpolates to a finite value at the nucleus.

Derivation. The charge density ρ=eψ2\rho = -e|\psi|^2 is the source in Gauss's law [eq:gauss-e]. In the electrostatic case, E=φH\mathbf{E} = -\nabla\varphi_H, so (φH)=ρ/ε0\nabla\cdot(-\nabla\varphi_H) = \rho/\varepsilon_0, giving 2φH=ρ/ε0=(e/ε0)ψ2\nabla^2\varphi_H = -\rho/\varepsilon_0 = (e/\varepsilon_0)|\psi|^2. The integral formula is the Green's function solution of Poisson's equation on R3\mathbb{R}^3 with source eψ2-e|\psi|^2. \square

Self-consistent fields. In a many-electron atom or molecule with NN electrons occupying orbitals {ψi}\{\psi_i\}, the total charge density is ρ=eiψi2\rho = -e\sum_i |\psi_i|^2 and the Hartree potential sourced by all electrons is

VH(x)=e24πε0jψj(x)2xxd3xV_H(\mathbf{x}) = \frac{e^2}{4\pi\varepsilon_0}\sum_{j}\int\frac{|\psi_j(\mathbf{x}')|^2}{|\mathbf{x}-\mathbf{x}'|}\,\mathrm{d}^3x'

Each orbital satisfies a Schrödinger equation with this potential:

[22me2+Vion(x)+VH(x)]ψi(x)=εiψi(x)\left[-\frac{\hbar^2}{2m_e}\nabla^2 + V_{\mathrm{ion}}(\mathbf{x}) + V_H(\mathbf{x})\right]\psi_i(\mathbf{x}) = \varepsilon_i\,\psi_i(\mathbf{x})

The orbitals determine ρ\rho, which sources VHV_H through Poisson's equation, which in turn enters the Schrödinger equation for the orbitals. This circularity is the self-consistent field (SCF) problem: the solution must be found iteratively. Adding exchange corrections (the antisymmetry of the many-electron wavefunction) gives the Hartree–Fock equations; replacing the many-body wavefunction with the electron density ρ(x)\rho(\mathbf{x}) as the primary variable gives density functional theory (DFT), which is exact in principle via the Hohenberg–Kohn theorem and tractable in practice via approximate exchange-correlation functionals.

Minimal coupling and the vector potential. So far only the scalar potential φ\varphi has appeared. The full coupling between an electron and the electromagnetic field is captured by minimal coupling: replace the canonical momentum p\mathbf{p} with the kinetic momentum peA\mathbf{p} - e\mathbf{A} everywhere in the Hamiltonian:

H=(peA)22me+eφ+VionH = \frac{(\mathbf{p} - e\mathbf{A})^2}{2m_e} + e\varphi + V_{\mathrm{ion}}

Expanding and using the Coulomb gauge A=0\nabla\cdot\mathbf{A} = 0 (so that p\mathbf{p} and A\mathbf{A} commute):

H=p22meemeAp+e2A22me+eφ+VionH = \frac{\mathbf{p}^2}{2m_e} - \frac{e}{m_e}\mathbf{A}\cdot\mathbf{p} + \frac{e^2\mathbf{A}^2}{2m_e} + e\varphi + V_{\mathrm{ion}}

The term eAp/me-e\mathbf{A}\cdot\mathbf{p}/m_e is the paramagnetic coupling, responsible for orbital magnetic moments and NMR chemical shifts; the term e2A2/(2me)e^2\mathbf{A}^2/(2m_e) is the diamagnetic coupling. The Aharonov–Bohm phase ei(e/)γAe^{i(e/\hbar)\oint_\gamma A} of Section 8 is the unitary transformation implementing minimal coupling for a wavepacket traveling along γ\gamma: substituting ψei(e/)γAψ\psi \to e^{i(e/\hbar)\int_\gamma A}\psi into the free Schrödinger equation produces exactly the minimally coupled Hamiltonian above. Gauge invariance of the Schrödinger equation under (A,φ)(A+χ,φtχ)(\mathbf{A},\varphi) \to (\mathbf{A}+\nabla\chi,\, \varphi - \partial_t\chi) together with ψei(e/)χψ\psi \to e^{i(e/\hbar)\chi}\psi matches the gauge transformation of Section 2 exactly.

Computational pipeline. Solving the coupled Schrödinger–Poisson system for a molecule or solid proceeds iteratively:

  1. Initialize trial orbitals {ψi(0)}\{\psi_i^{(0)}\} (typically superposed atomic orbitals).
  2. Build the electron density ρ(k)(x)=eiψi(k)(x)2\rho^{(k)}(\mathbf{x}) = -e\sum_i|\psi_i^{(k)}(\mathbf{x})|^2.
  3. Solve Poisson's equation 2φ(k)=ρ(k)/ε0\nabla^2\varphi^{(k)} = -\rho^{(k)}/\varepsilon_0 for the Hartree potential. In plane-wave codes, this is a pointwise division in Fourier space: k2φ~(k)=ρ~(k)/ε0-|\mathbf{k}|^2\,\tilde\varphi(\mathbf{k}) = -\tilde\rho(\mathbf{k})/\varepsilon_0, so φ~(k)=ρ~(k)/(ε0k2)\tilde\varphi(\mathbf{k}) = \tilde\rho(\mathbf{k})/(\varepsilon_0|\mathbf{k}|^2). In real-space codes it is solved by multigrid or finite element methods on a numerical grid.
  4. Build the Kohn–Sham or Hartree–Fock Hamiltonian H(k)H^{(k)} with updated VH(k)=eφ(k)V_H^{(k)} = e\varphi^{(k)}.
  5. Diagonalize: solve H(k)ψi(k+1)=εiψi(k+1)H^{(k)}\psi_i^{(k+1)} = \varepsilon_i\psi_i^{(k+1)}.
  6. Check convergence: if ρ(k+1)ρ(k)<tol\|\rho^{(k+1)} - \rho^{(k)}\| < \text{tol}, accept; otherwise return to step 2.

Step 3 is the point where Maxwell's equations enter the quantum calculation: the charge density generated by quantum mechanics is fed into the classical electrostatic equation, and the resulting potential modifies the quantum dynamics. The loop between step 2 and step 5 is a discrete version of the self-consistent field condition, and its convergence is guaranteed (under mild conditions) by the Banach fixed-point theorem applied to the density mixing map.

Discrete exterior calculus and cochain complexes. The computational methods above (finite element grids, plane-wave Fourier solves) are instances of a common algebraic framework: discrete exterior calculus (DEC). The central idea is to replace the smooth de Rham complex of Section 3 with an exact combinatorial analogue defined on a mesh.

Definition (Cochain complex on a simplicial mesh).

Let KK be a simplicial complex (a mesh of nodes, edges, triangles, and tetrahedra) triangulating a domain in R3\mathbb{R}^3. A kk-cochain is a linear map from the kk-cells of KK to R\mathbb{R}: it assigns a real number to each edge (k=1k=1), each face (k=2k=2), each volume (k=3k=3), or each node (k=0k=0). The space of kk-cochains is Ck(K)C^k(K).

The coboundary operator d:Ck(K)Ck+1(K)\mathrm{d}: C^k(K) \to C^{k+1}(K) is defined by duality with the boundary operator \partial:

(dω)(σ)=ω(σ)(\mathrm{d}\,\omega)(\sigma) = \omega(\partial\sigma)

for any (k+1)(k+1)-cell σ\sigma, where σ\partial\sigma is its oriented boundary (a signed sum of kk-cells). Since 2=0\partial^2 = 0 (the boundary of a boundary is empty), we get d2=0\mathrm{d}^2 = 0 automatically. This gives the discrete cochain complex:

0    C0(K)    d    C1(K)    d    C2(K)    d    C3(K)    00 \;\longrightarrow\; C^0(K) \;\xrightarrow{\;\mathrm{d}\;}\; C^1(K) \;\xrightarrow{\;\mathrm{d}\;}\; C^2(K) \;\xrightarrow{\;\mathrm{d}\;}\; C^3(K) \;\longrightarrow\; 0

In coordinates, d:C0C1\mathrm{d}: C^0 \to C^1 is the node-to-edge incidence matrix, d:C1C2\mathrm{d}: C^1 \to C^2 is the edge-to-face incidence matrix, and d:C2C3\mathrm{d}: C^2 \to C^3 is the face-to-volume incidence matrix. These are sparse integer matrices encoding the combinatorial topology of the mesh.

The de Rham complex of Section 3 is the continuum limit of this construction: as the mesh is refined, smooth kk-forms integrate over kk-cells to give cochains, and the coboundary operator d\mathrm{d} converges to the exterior derivative. Whitney forms (piecewise linear kk-forms associated to each kk-simplex) are the interpolating basis functions that carry this correspondence to the finite element setting; their span over a simplicial complex forms a subcomplex of the de Rham complex, which is the foundation of finite element exterior calculus (FEEC).

Definition (Discrete Hodge star and the primal–dual mesh).

The Hodge star of Section 3 maps kk-forms to (nk)(n-k)-forms using the metric. Discretely, it maps primal kk-cochains to dual (nk)(n-k)-cochains, where the dual mesh KK^* is constructed by placing a dual node at the circumcenter of each primal simplex and connecting dual nodes across shared faces.

The discrete Hodge star :Ck(K)Cnk(K)\star: C^k(K) \to C^{n-k}(K^*) is a diagonal matrix k\star_k whose entries are ratios of dual and primal volumes:

(k)σσ=σσ(\star_k)_{\sigma\sigma} = \frac{|\sigma^*|}{|\sigma|}

where σ|\sigma| is the volume of a primal kk-cell and σ|\sigma^*| is the volume of its dual (nk)(n-k)-cell. The discrete Laplacian on kk-cochains is then

Lk=k11dkd+dk+11dkL_k = \star_{k-1}^{-1}\,\mathrm{d}^\top\,\star_k\,\mathrm{d} + \mathrm{d}\,\star_{k+1}^{-1}\,\mathrm{d}^\top\,\star_k

a sparse symmetric positive semi-definite matrix. On 0-cochains, L0=01d1dL_0 = \star_0^{-1} \mathrm{d}^\top \star_1 \mathrm{d} is the discrete scalar Laplacian: the equation L0φh=ρh/ε0L_0\,\varphi_h = -\rho_h/\varepsilon_0 is the cochain form of Poisson's equation from step 3 of the SCF pipeline.

Theorem (Maxwell's equations as cochain equations).

Place the electromagnetic degrees of freedom on the primal and dual meshes as follows. The scalar potential φ\varphi lives on primal 0-cochains; the vector potential A\mathbf{A} on primal 1-cochains; the magnetic field B\mathbf{B} on primal 2-cochains; and the electric field E\mathbf{E} on dual 1-cochains. With this placement, [eq:maxwell-forms-eq] from Section 3 become exact algebraic identities:

dBh=0(no magnetic monopoles: B=0)\mathrm{d}\,\mathbf{B}_h = 0 \qquad (\text{no magnetic monopoles: } \nabla\cdot\mathbf{B} = 0)dAh=Bh(potential: B=×A)\mathrm{d}\,\mathbf{A}_h = \mathbf{B}_h \qquad (\text{potential: } \mathbf{B} = \nabla\times\mathbf{A})

Both hold at the level of integer arithmetic on the mesh because d2=0\mathrm{d}^2 = 0. The source equations (Gauss and Ampere–Maxwell) become

11d2Bh=μ0Jh+μ0ε0tEh\star_1^{-1}\,\mathrm{d}^\top\,\star_2\,\mathbf{B}_h = \mu_0\,\mathbf{J}_h + \mu_0\varepsilon_0\,\partial_t\mathbf{E}_h01d1Eh=ρh/ε0\star_0^{-1}\,\mathrm{d}^\top\,\star_1\,\mathbf{E}_h = \rho_h/\varepsilon_0

These are the DEC discretization of dF=J\mathrm{d}{\star}F = {\star}J from Section 3.

The Yee finite-difference time-domain (FDTD) grid, the standard computational tool for solving Maxwell's equations in 3D, is a special case of this structure: it is the DEC complex on a Cartesian hexahedral mesh, where E-field components sit on primal edges, B-field components sit on primal faces, and the staggering in space and time ensures that d2=0\mathrm{d}^2 = 0 is satisfied exactly at every grid point and every time step.

Three structural properties follow from the cochain complex that are difficult to achieve by ad hoc discretizations. First, B=0\nabla\cdot\mathbf{B} = 0 is preserved exactly by the time-stepping scheme, not just approximately, because it is an algebraic consequence of d2=0\mathrm{d}^2 = 0. Second, the discrete gauge freedom of Section 2 is preserved: replacing AhAh+dχh\mathbf{A}_h \to \mathbf{A}_h + \mathrm{d}\chi_h leaves Bh=dAh\mathbf{B}_h = \mathrm{d}\mathbf{A}_h unchanged because d2χh=0\mathrm{d}^2\chi_h = 0. Third, the discrete Hodge decomposition gives an exact splitting of any 1-cochain into a gradient part, a curl part, and a harmonic part, enabling gauge-fixed solvers (discrete Lorenz gauge or Coulomb gauge) that are guaranteed to be consistent.

For the SCF pipeline, DEC replaces the ad hoc Poisson solve in step 3 with a principled algebraic system: the charge density ρh\rho_h is a 3-cochain (a number per mesh volume), the potential φh\varphi_h is a 0-cochain (a number per node), and the discrete Poisson operator L0L_0 is the sparse matrix assembled once from the mesh geometry. Solving L0φh=ρh/ε0L_0\varphi_h = -\rho_h/\varepsilon_0 by conjugate gradient (exploiting the symmetry and sparsity of L0L_0) is the core linear algebra step shared by real-space DFT codes such as GPAW and Octopus.

11. Synthesis

We have presented the same physical theory in four languages. Here is what each one shows.

LanguageEquationReveals
Vector calculusE=ρ/ε0\nabla \cdot \mathbf{E} = \rho/\varepsilon_0 etc.Forces on charges, wave propagation, practical computation
Differential formsdF=0,  dF=J\mathrm{d}F = 0,\; \mathrm{d}{\star}F = {\star}JCoordinate freedom, relativistic covariance, Stokes' theorem, charge conservation
Spacetime algebraF=J\nabla F = JSingle compact equation, spinor structure, algebraic unification with gravity
Principal bundlesF=dA+AA,  DF=JF = \mathrm{d}A + A\wedge A,\; D{\star}F = {\star}JGauge invariance as geometry, topological invariants, non-abelian extension

The fibre bundle picture is the most fundamental. The electric and magnetic fields E and B are not the primary objects; they are components of the curvature 2-form FF of a connection AA on a principal U(1)U(1) bundle. The connection AA is itself not quite primary: it depends on a choice of local section (gauge choice), and only gauge-invariant quantities (such as the curvature FF and holonomies along loops) are physical. The primary object is the bundle itself together with the gauge-equivalence class of connections.

Three consequences follow immediately from this geometric viewpoint:

Gauge invariance is geometry. The freedom to choose a gauge is the freedom to choose a local section of the bundle. Changing the section transforms A by Ag1Ag+g1dgA \mapsto g^{-1}Ag + g^{-1}\mathrm{d}g and leaves the curvature FF invariant. This is the geometric origin of gauge invariance, which from the vector calculus viewpoint appears as an arbitrary-seeming symmetry of the potentials.

The Aharonov–Bohm effect is holonomy. The phase acquired by a charged particle traveling around a loop is the holonomy of the connection around that loop. This is nonzero even when the curvature is zero (F=0F = 0 on the path), because the bundle can be topologically nontrivial over the base space with the solenoid excluded. The vector potential A is not redundant: it carries the holonomy information that F does not.

Charge quantization is topology. The first Chern class of a U(1)U(1) bundle is an integer. If magnetic monopoles exist, the Dirac quantization condition forces electric charge to be an integer multiple of a fundamental unit. The integrality is not a dynamical result but a consequence of the fact that bundles are classified by integers.

The same framework, extended to non-abelian gauge groups SU(2)SU(2) and SU(3)SU(3), gives the full Standard Model of particle physics. The weak force is a Yang–Mills theory for SU(2)SU(2); the strong force is Yang–Mills for SU(3)SU(3); both share the principal bundle geometry developed here. The unsolved mass gap problem for SU(2)SU(2) Yang–Mills is a question about the quantum theory of connections: when the classical theory is quantized, do its energy excitations have a positive lower bound? The classical geometry is understood. What remains is the quantum.

References

  1. Y. Aharonov and D. Bohm, "Significance of electromagnetic potentials in the quantum theory," Phys. Rev. 115 (1959), 485–491. DOI
  2. A. Tonomura et al., "Evidence for Aharonov–Bohm effect with magnetic field completely shielded from electron wave," Phys. Rev. Lett. 56 (1986), 792–795. DOI
  3. A. Jaffe and E. Witten, "Quantum Yang–Mills Theory," in The Millennium Prize Problems, Clay Mathematics Institute, 2006. PDF
  4. P. A. M. Dirac, "Quantised singularities in the electromagnetic field," Proc. R. Soc. London A 133 (1931), 60–72. DOI
  5. T. T. Wu and C. N. Yang, "Concept of nonintegrable phase factors and global formulation of gauge fields," Phys. Rev. D 12 (1975), 3845–3857. DOI
  6. J. C. Maxwell, "A dynamical theory of the electromagnetic field," Phil. Trans. R. Soc. London 155 (1865), 459–512. DOI

Comments

Loading…

Leave a comment

or comment as guest

$...$ inline · $$...$$ display

0 / 2000