The animation above shows a sealed, rigid container whose walls curve inward and outward; inside, a swarm of particles begins concentrated near a single point and diffuses outward. The container is a closed 3-manifold (a deformed sphere), and its curvature is not decorative. It determines how the swarm spreads. Near the poles of the container, where the wall curves sharply, the particles are reflected at steeper angles than on the flatter equatorial band. The gold-to-blue colour encodes the heat kernel value Kt(x,x0)=exp(−∣x−x0∣2/4t) evaluated at each particle's position: gold is high concentration, deep blue is low.
That formula is the heat kernel in flat space R3. On a curved manifold it is only an approximation; the true kernel acquires corrections at every order in t whose coefficients are curvature invariants. The first such correction, which appears at order t, is the scalar curvature R(x).
This post derives that result from first principles. We begin with the classical heat equation on Rn and its Gaussian fundamental solution (§1). We then build the Riemannian machinery (metric tensor, covariant gradient, divergence, and Laplace–Beltrami operator) from the ground up (§2). The heat equation on a Riemannian manifold (M,g) has its own fundamental solution, derived through the parametrix construction (§3). We close with the spectral representation of the heat kernel and Weyl's law (§4).
Sign convention. The Laplace–Beltrami operator is defined as ΔM=divg∘gradg, which is negative semi-definite on functions. The heat equation is ∂tu=ΔMu. The competing probabilistic convention replaces this by ∂tu=21ΔMu, matching the generator of Brownian motion; the two conventions differ only by the substitution t↦2t throughout and produce kernels related by Ktprob(x,y)=Kt/2(x,y).
1. The Heat Equation on Rn
1.1. Derivation from conservation
Let u:Rn×[0,∞)→R be the concentration of a diffusing substance. Fick's first law [1] states that diffusive flux opposes the concentration gradient:
J(x,t)=−∇u(x,t).
Conservation of mass over any smooth bounded region Ω⊂Rn requires
dtd∫Ωudx=−∮∂ΩJ⋅n^dσ,
where n^ is the outward unit normal to ∂Ω. Applying the divergence theorem to the right side of [eq:conservation] and substituting [eq:fick]:
dtd∫Ωudx=∫ΩΔudx.
Since Ω is arbitrary and all integrands are continuous, we conclude:
∂tu=Δu,(x,t)∈Rn×(0,∞).
1.2. Solution via the Fourier transform
For f∈L1(Rn) the Fourier transform and its inverse are
where the last step uses the standard Gaussian integral ∫Rne−∣z∣2dz=πn/2, which factors as a product of n one-dimensional integrals ∫−∞∞e−zi2dzi=π [2].
(3) is immediate from ∣x−y∣2=∣y−x∣2.
Proof of (4). Write I=∫RnKs(x,z)Kt(z,y)dz. The prefactors contribute (4πs)−n/2(4πt)−n/2. The combined exponent is
(5) follows from (2) and the standard theory of approximate identities [2, Ch. 5]. □
The semigroup property has a direct visual interpretation. Hold the total time T=s+t fixed and vary the split s↦T−s. The blue kernel Ks and the gold kernel Kt=KT−s change shape continuously, yet their convolution—the muted silver curve KT—is constant throughout.
2. The Riemannian Setup
2.1. Metric tensor and volume form
Definition 0 (Riemannian metric).
A Riemannian metric on a smooth n-manifold M is a smooth, symmetric, positive-definite covariant 2-tensor field g. In local coordinates (x1,…,xn) it reads
g=gijdxi⊗dxj,gij=g(∂xi∂,∂xj∂),
with Einstein summation throughout. The matrix [gij] is symmetric positive definite at every point [3, Ch. 2].
Write g=det[gij] for the determinant of the metric matrix. The Riemannian volume form, the natural measure on (M,g), is
dVg=gdx1∧⋯∧dxn.
This is independent of the chart: under a coordinate change the Jacobian determinant from the transformation of dx1∧⋯∧dxn exactly cancels the factor from g. The L2 inner product on functions is
⟨f,h⟩L2(M,g)=∫MfhdVg.
2.2. Gradient and divergence
Definition 0 (Riemannian gradient).
The gradient of f∈C∞(M) is the unique vector field gradgf characterised by
g(gradgf,X)=df(X)=X(f)for all X∈X(M).
In local coordinates: (gradgf)i=gij∂jf, where [gij]=[gij]−1.
Proof of the coordinate formula. We need components Xi satisfying gijXiVj=(∂jf)Vj for all vectors V. Setting V=∂k gives gikXi=∂kf, whence Xi=gij∂jf. □
Definition 0 (Riemannian divergence).
The divergence of X∈X(M) is the smooth function divgX characterised by
LX(dVg)=(divgX)dVg,
where LX is the Lie derivative. In local coordinates:
divgX=g1∂i(gXi).
Proof of the coordinate formula. By Cartan's formula LXω=ιXdω+d(ιXω) applied to the closed top-degree form dVg (for which d(dVg)=0), we get LX(dVg)=d(ιXdVg). In coordinates, expanding ιX(dVg) and taking the exterior derivative yields d(ιXdVg)=g1∂i(gXi)dVg. Comparing with [eq:div-def-eq] gives [eq:div-coords] [4, Ch. 16]. □
2.3. The Laplace–Beltrami operator
Definition 0 (Laplace–Beltrami operator).
The Laplace–Beltrami operator on (M,g) is
ΔMf=divg(gradgf).
Substituting [eq:grad-def-eq] into [eq:div-coords] gives the coordinate formula:
ΔMf=g1∂i(ggij∂jf).
On Rn with the Euclidean metric, gij=δij, g=1, and [eq:lbo-coords] reduces to the standard Laplacian Δf=∑i∂i2f.
Proposition 0 (Variational characterisation).
ΔM is the negative L2(M,g)-gradient of the Dirichlet energy
E[f]=21∫Mgij(∂if)(∂jf)gdnx.
Explicitly, for any φ∈Cc∞(M):
dεdε=0E[f+εφ]=−∫M(ΔMf)φdVg.
Proof. Differentiating under the integral and using the symmetry of gij:
The eigenfunctions of −ΔS2 are the spherical harmonics Yℓm with eigenvalues λℓ=ℓ(ℓ+1), multiplicity 2ℓ+1 [5, App. B].
The flat torus. On Tn=Rn/Zn with gij=δij, the eigenfunctions are the Fourier modes e2πik⋅x for k∈Zn, eigenvalues −4π2∣k∣2, and the heat kernel is
KtTn(x,y)=k∈Zn∑e−4π2∣k∣2te2πik⋅(x−y),
which converges to the flat Gaussian [eq:flat-kernel] as t→0+ (lattice images are exponentially suppressed).
3. Diffusion on a Curved Manifold
3.1. The heat equation on (M,g)
Definition 0 (Heat equation on a Riemannian manifold).
Let (M,g) be a smooth Riemannian manifold. The heat equation on M is
∂tu=ΔMu,u(⋅,0)=u0∈L2(M,dVg).
For compact M without boundary, a smooth solution for t>0 follows from the spectral theory of §4 via the heat semigroup u=etΔMu0. For complete non-compact M, existence and uniqueness were established by Dodziuk [6] and Li–Yau [7].
3.2. The heat kernel
Definition 0 (Heat kernel).
The heat kernel of (M,g) is a smooth function K:(0,∞)×M×M→R such that
u(x,t)=∫MKt(x,y)u0(y)dVg(y)
solves [eq:heat-curved] for every u0∈L2(M,dVg). It satisfies:
(∂t−ΔMx)Kt(x,y)=0 for all t>0, x=y.
Kt(x,y)=Kt(y,x) (symmetry from self-adjointness).
On (Rn,δ) the kernel [eq:flat-kernel] depends on x and y only through the Euclidean distance ∣x−y∣. On a Riemannian manifold the canonical substitute is the geodesic distance dg(x,y). The heat kernel is not simply (4πt)−n/2e−dg(x,y)2/4t, since curvature introduces corrections at every order in t, but this is the leading term. The parametrix ansatz [8] is
Ht(x,y)=(4πt)n/2e−dg(x,y)2/4tk=0∑Nuk(x,y)tk,
where the transport coefficients uk are smooth functions determined by substituting [eq:parametrix] into the heat equation and matching powers of t.
3.4. The transport equations and the short-time expansion
We derive the coefficients by inserting [eq:parametrix] into (∂t−ΔMx)Ht=0 and reading off each power of t.
Setup. Fix y∈M and work in normal coordinates centred at y (provided by the exponential map expy). In these coordinates [3, Ch. 5]:
gij(y)=δij and ∂kgij(y)=0,
dg(x,y)2=∣x∣2 to second order,
the volume density Θ(x,y):=g(x) satisfies Θ(x,y)=1−61Rij(y)xixj+O(∣x∣3).
Set ϕ(x,y)=dg(x,y)2/4, so the exponential factor is e−ϕ/t. Write the ansatz as Ht=(4πt)−n/2e−ϕ/tΦ where Φ=∑kuktk.
Applying the heat operator. Using the product formula for ΔM(eψf) with ψ=−ϕ/t:
We evaluate two key quantities. The eikonal identity for geodesic distance:
∣gradgϕ∣g2=ϕ,
which holds because gradgdg=r^ is the unit radial vector and ϕ=dg2/4 gives ∣gradgϕ∣2=dg2∣gradgdg∣2/4=dg2/4=ϕ. The Laplacian ofϕ: writing gradgϕ=21dgr^ and applying the Leibniz rule,
Collecting terms at order tk−1 and using g(gradgϕ,gradg⋅)=2dg∂dg:
kuk+2dg∂dguk+2dg(∂dglogΘ)uk=ΔMuk−1
(with u−1≡0). Multiplying through by 2dg2k−1Θ shows that the left side equals ∂dg(dg2kΘuk)/dgk−1Θ1/2, yielding the k-th transport ODE:
∂dg(dgkΘ1/2uk)=dgk−1Θ1/2ΔMuk−1.
Coefficient u0. Set k=0 in [eq:transport-ODE]:
∂dg(Θ1/2u0)=0,
so Θ1/2u0=c(y). The normalisation condition Ht(x,y)→δy(x) as t→0+ requires the leading coefficient to equal 1 at x=y; since Θ(y,y)=1, we get c(y)=1 and
u0(x,y)=Θ(x,y)−1/2.
In particular, u0(y,y)=1, recovering the flat-space leading term.
Coefficient u1 on the diagonal. Integrate [eq:transport-ODE] for k=1 from 0 to dg:
It remains to compute ΔM(Θ−1/2) at x=y. In normal coordinates, Θ=g has the expansion [8, Ch. 4]:
Θ−1/2(x,y)=1+121Rij(y)xixj+O(∣x∣3).
Since ΔM(xixj)∣x=0=2δij, applying ΔM to [eq:theta-expand] at x=y and using Rii=R(y) (contraction of the Ricci tensor):
ΔM(Θ−1/2)(y,y)=121⋅2Rij(y)δij=6R(y).
Substituting [eq:delta-theta] into [eq:u1-lhopital]:
u1(y,y)=6R(y).
The short-time diagonal expansion. Combining [eq:u0] and [eq:u1-diag] in [eq:parametrix] evaluated at x=y [9]:
Kt(x,x)∼(4πt)n/21(1+6R(x)t+O(t2))as t→0+.
Remark 0 (Curvature and the animation).
Return to the opening animation. Particles are coloured by exp(−∣x−x0∣2/4t), the flat-space approximation. Equation [eq:heat-expansion] quantifies the correction: at a point x where R(x)>0 (positive curvature, near the poles of the deformed container), the diagonal kernel Kt(x,x) exceeds the flat baseline; there is less room for the particle density to spread, so it stays more concentrated. Where R(x)<0 (saddle regions), the kernel is below the flat value and particles disperse faster.
The higher coefficients are universal polynomials in the curvature tensor and its covariant derivatives. The next term is [10]:
The parametrix [eq:parametrix] is not yet the true heat kernel; it satisfies (∂t−ΔMx)Ht=O(tN−n/2) but is not exactly zero on the right. To construct the true kernel, one writes Kt=Ht+correction where the correction solves a Volterra integral equation involving Ht. For compact M, the correction is smooth and O(tN−n/2+1), confirming that [eq:heat-expansion] is asymptotically exact [11, Ch. 4].
4. The Spectral Perspective
4.1. Spectral theorem for compact manifolds
Theorem 0 (Spectral theorem for the Laplace–Beltrami operator).
Let (M,g) be compact and connected without boundary. The operator −ΔM extends to a self-adjoint unbounded operator on L2(M,dVg) with domain H2(M). Its spectrum is a discrete sequence
0=λ0<λ1≤λ2≤⋯↗∞,
and the corresponding eigenfunctions {ϕk}k≥0 are smooth and form a complete orthonormal basis for L2(M,dVg):
−ΔMϕk=λkϕk,⟨ϕj,ϕk⟩L2=δjk.
The proof uses compactness of the Sobolev embedding H1(M)↪L2(M) (Rellich–Kondrachov), which forces the resolvent of −ΔM to be compact. The spectral theorem for compact self-adjoint operators then gives the discrete spectrum; elliptic regularity implies smoothness of eigenfunctions [12, Ch. 8].
4.2. Heat kernel as a spectral series
The heat semigroupetΔM acts on L2(M) by
etΔMf=k=0∑∞e−λkt⟨f,ϕk⟩ϕk,
which converges in L2 for all t≥0 and in C∞ for t>0 (exponential decay of e−λkt dominates polynomial growth of λk). The integral kernel of etΔM is
Kt(x,y)=k=0∑∞e−λktϕk(x)ϕk(y),
confirming that [eq:kernel-rep] with this kernel reproduces [eq:semigroup-def].
On the interval [0,π] with Dirichlet boundary conditions, ϕn(x)=sin(nx) and λn=n2. The figure below animates the solution u(x,t)=∑n∈{1,3,5,7}ansin(nx)e−n2t: faded traces are individual modes, bright gold is the sum. Mode n=7 (eigenvalue 49) decays 49 times faster than mode n=1.
4.3. Weyl's law
Theorem 0 (Weyl's law [<a href='#ref-weyl'>13</a>]).
Let N(λ)=#{k:λk≤λ}. Then
N(λ)∼(2π)nωnVol(M)λn/2as λ→∞,
where ωn=πn/2/Γ(n/2+1) is the volume of the unit ball in Rn. Equivalently, λk∼(2π)2ωn−2/n(k/Vol(M))2/n as k→∞.
Proof via the heat trace. Define the heat trace
Z(t)=Tr(etΔM)=k=0∑∞e−λkt=∫MKt(x,x)dVg(x).
The two expressions for Z(t) are equal: substituting the spectral series [eq:spectral-kernel] into the integral gives ∑ke−λkt∫Mϕk2dVg=∑ke−λkt by orthonormality.
Step 1: heat trace asymptotics. Integrating the diagonal expansion [eq:heat-expansion] over M:
Z(t)∼(4πt)n/2Vol(M)(1+6Rt+O(t2))as t→0+,
where R=Vol(M)−1∫MRdVg is the average scalar curvature. The leading term is
Z(t)∼(4πt)n/2Vol(M)as t→0+.
Step 2: rewriting as a Laplace–Stieltjes transform. Regard N(λ)=#{k:λk≤λ} as a right-continuous step function on [0,∞). Since λk↗∞, the Lebesgue–Stieltjes measure dN(λ) is the sum of unit point masses at each λk, and
Z(t)=k=0∑∞e−λkt=∫0∞e−λtdN(λ).
This is the Laplace–Stieltjes transform of N.
Step 3: Karamata's Tauberian theorem. The general statement is: if μ is a non-negative measure on [0,∞) and its Laplace transform satisfies ∫0∞e−λtdμ(λ)∼Ct−α as t→0+, then μ([0,λ])∼Cλα/Γ(α+1) as λ→∞ [2, App. B].
Apply this with μ=dN, C=Vol(M)/(4π)n/2, and α=n/2. The conclusion is
N(λ)∼(4π)n/2Vol(M)⋅Γ(n/2+1)λn/2as λ→∞.
Recognising ωn=πn/2/Γ(n/2+1) and (4π)n/2=2nπn/2:
(4π)n/2Γ(n/2+1)1=2nπn/2Γ(n/2+1)1=(2π)nωn,
so N(λ)∼(2π)nωnVol(M)λn/2, which is exactly [eq:weyl].
Step 4: equivalent eigenvalue asymptotics. Inverting: if N(λk)≈k for large k, then
The figure below plots the exact trace Z(t)=∑k=160e−k2t (gold) against the Weyl leading term π/(2t) (blue) for the interval [0,π]. The animated cursor sweeps t from large to small: agreement is excellent near t=0 but breaks down at large t, where only the k=1 mode survives.
4.4. Geometry encoded in the spectrum
The heat trace expansion [eq:trace-expansion] shows that the first two coefficients determine the volume and total scalar curvature∫MRdVg. For a compact surface (n=2), the Gauss–Bonnet theorem gives ∫MKdVg=2πχ(M) where K=R/2, so the second heat invariant encodes the Euler characteristic—a topological invariant.
The higher heat invariants are integrals of local curvature polynomials. Their computation underlies the heat-kernel proof of the Atiyah–Singer index theorem [14, 15]: for an elliptic operator D on a compact manifold, the alternating heat trace Tr(e−tD∗D)−Tr(e−tDD∗) is constant in t and equals the Fredholm index ind(D); expanding via the parametrix and integrating expresses the index as an integral of characteristic classes. The heat kernel is thus a bridge between the analytic data of a differential operator, the geometric data of a Riemannian metric, and the topology of the underlying manifold.
I. Chavel, Eigenvalues in Riemannian Geometry, Academic Press, 1984. ScienceDirect
J. Dodziuk, "Maximum principle for parabolic inequalities and the heat flow on open manifolds," Indiana Univ. Math. J. 32(5) (1983), 703–716. JSTOR
P. Li and S.-T. Yau, "On the parabolic kernel of the Schrödinger operator," Acta Math. 156 (1986), 153–201. doi:10.1007/BF02399203
M. Berger, P. Gauduchon, and E. Mazet, Le spectre d'une variété riemannienne, Springer LNM 194, 1971. doi:10.1007/BFb0064643
S. Minakshisundaram and Å. Pleijel, "Some properties of the eigenfunctions of the Laplace operator on Riemannian manifolds," Canad. J. Math. 1 (1949), 242–256. doi:10.4153/CJM-1949-021-5
P. B. Gilkey, Invariance Theory, the Heat Equation, and the Atiyah–Singer Index Theorem, 2nd ed., CRC Press, 1995. doi:10.1201/9780849378744
S. Rosenberg, The Laplacian on a Riemannian Manifold, Cambridge University Press, 1997. doi:10.1017/CBO9780511623783
M. E. Taylor, Partial Differential Equations II, Springer Applied Math. Sciences 116, 1996. doi:10.1007/978-1-4757-4187-2
H. Weyl, "Über die asymptotische Verteilung der Eigenwerte," Nachrichten der Königlichen Gesellschaft der Wissenschaften zu Göttingen (1912), 110–117. GDZ Göttingen
M. F. Atiyah, R. Bott, and V. K. Patodi, "On the heat equation and the index theorem," Inventiones Mathematicae 19 (1973), 279–330. doi:10.1007/BF01425417
E. Getzler, "A short proof of the local Atiyah–Singer index theorem," Topology 25(1) (1986), 111–117. doi:10.1016/0040-9383(86)90008-X