05.09.02 · symplectic / integrable

Adiabatic invariants

shipped3 tiersLean: none

Anchor (Master): Ehrenfest 1916 *Adiabatische Invarianten und Quantentheorie* (originator); Arnold-Kozlov-Neishtadt *Mathematical Aspects of Classical and Celestial Mechanics* Ch. 6; Born *The Mechanics of the Atom* (1925)

Intuition [Beginner]

Imagine a child on a swing whose chains are slowly being shortened. Each individual swing happens fast — back and forth in a second or two — while the shortening happens over many minutes. A natural question: as the chains contract, does the swing get wilder, calmer, or stay the same?

The answer is one of the most useful approximate conservation laws in physics. A particular geometric quantity — the area enclosed by the swing's trajectory in the position-velocity plane over one full oscillation — barely changes. The swing's energy goes up as the chains shorten, the frequency goes up, but the ratio of energy to frequency stays nearly fixed. Physicists call this ratio an adiabatic invariant — a quantity preserved when a parameter of the system varies slowly compared with the system's own internal motion.

This is why a slowly tuned guitar string keeps its loudness as its pitch rises, why charged particles stay trapped in slowly-varying magnetic bottles, and why early quantum theorists guessed correctly which quantities to quantize. The slow change does not destroy the bookkeeping; it carries it along.

Visual [Beginner]

Two phase portraits of a harmonic oscillator with frequency $ω$ that slowly increases over time. On the left the oscillator at frequency $ω_{0}$ : a circular orbit in the position-momentum plane with area $A_{0}$ . On the right the same oscillator at frequency $ω_{1} > ω_{0}$ after slow tuning: a narrower, taller ellipse with the same enclosed area $A_{1} = A_{0}$ . The orbit shape changes; the enclosed area survives.

$Two side-by-side phase portraits of a harmonic oscillator under slow frequency change. Left: a circular orbit at low frequency. Right: a taller, narrower ellipse at higher frequency. The enclosed area is the same in both pictures, illustrating the adiabatic invariant $I = E/\omega$.$

The picture captures the headline result: the area swept out by one period of the orbit is the conserved bookkeeping quantity, and it survives slow changes to the system even when energy and frequency individually drift.

Worked example [Beginner]

Take a one-dimensional harmonic oscillator with mass $1$ and time-varying spring stiffness $k (t)$ , giving the equation of motion $\overset{q}{¨} + k (t) q = 0$ . Start with $k (0) = 1$ — frequency $ω_{0} = 1$ — and an initial state with energy $E_{0} = 0.5$ . The action is $I = E / ω = 0.5$ .

Slowly increase the stiffness over a long time so that at the end $k (T) = 4$ , frequency $ω_{1} = 2$ . The slow-parameter rate is $ϵ = 1/ T$ for the total tuning time $T$ , with $T$ much larger than the oscillator period $2 π$ .

Claim: at the end the action $I$ is still approximately $0.5$ . Therefore the new energy is $E_{1} = I \cdot ω_{1} = 0.5 \cdot 2 = 1.0$ . The energy doubled even though no impulsive work was done — the slow tuning fed energy in at exactly the rate that keeps the ratio $E / ω$ fixed.

Numerical check: integrate $\overset{q}{¨} + k (t) q = 0$ with $k (t)$ rising linearly from $1$ to $4$ over $T = 1000$ . Compute $I = E / ω$ at the start and the end. The two values agree to about $0.001$ — first-order in $ϵ = 0.001$ , exactly as the adiabatic theorem predicts.

Takeaway: the action $I = E / ω$ is the right currency for tracking a slowly-tuned oscillator. Energy alone is not.

Check your understanding [Beginner]

Exercise (easy, multiple choice).

Which of these qualifies as an adiabatic process for a one-dimensional oscillator?

A. The frequency is changed instantaneously by a sudden impulse
B. The frequency oscillates randomly on a timescale comparable to the system's period
C. The frequency drifts smoothly over a timescale very long compared with one oscillation
D. The frequency is held constant forever

Hint

The word adiabatic refers to a separation of timescales: slow parameter, fast motion.

Answer

C. The frequency drifts smoothly over a timescale very long compared with one oscillation.

Feedback-correct: correct; the slow-vs-fast timescale split is what makes the action approximately conserved. Feedback-wrong: a sudden change does not give the system time to follow the parameter, and a frozen system has no parameter variation at all.

Exercise (easy, true/false).

True or false: in a slowly-varying magnetic field, the magnetic moment of a charged particle gyrating around field lines is an adiabatic invariant. This is the principle behind magnetic-mirror confinement.

Hint

The gyration is the fast motion; the slow change of $B$ along the particle's path is the slow parameter.

Answer

True. The magnetic moment $μ = m v_{⊥}^{2} / (2 B)$ has the same structure as $E / ω$ for the gyration motion: numerator is a kinetic energy, denominator is a frequency (the cyclotron frequency $ω_{c} = q B / m$ rescaled). Magnetic-mirror confinement and tokamak design rely on this adiabatic invariant: as a particle drifts into a region of stronger $B$ , its perpendicular kinetic energy rises and its parallel kinetic energy drops, eventually reflecting the particle if $B$ rises high enough.

Formal definition [Intermediate+]

Let $(M^{2 n}, ω)$ be a symplectic manifold and let $H : M \times R \to R$ be a smooth Hamiltonian depending on time through a slowly-varying parameter $λ$ , $$ H(q, p; \lambda(\tau)), \qquad \tau := \epsilon t, $$ with $ϵ > 0$ a small positive parameter and $λ : R \to Λ$ a smooth path in a parameter manifold $Λ$ . The variable $t$ is the fast time; $τ$ is the slow time.

Assume that for each fixed $λ \in Λ$ , the Hamiltonian $H_{λ} (q, p) := H (q, p; λ)$ is integrable 05.02.03 in a neighbourhood $U_{λ} \subset M$ of an orbit family of interest, with $λ$ -dependent action-angle coordinates 05.02.04 $$ (I, \theta) \in U_\lambda^{\mathrm{aa}} \times \mathbb{T}^n \subset \mathbb{R}^n \times \mathbb{T}^n $$ in which $ω = \sum_{i} d I_{i} \land d θ_{i}$ and $H_{λ} = h_{0} (I; λ)$ depends only on the actions. The frequency map is $ω (I; λ) = \partial_{I} h_{0} (I; λ)$ .

The action variable at a fixed $λ$ is the cycle integral $$ I_i(\lambda) = \frac{1}{2\pi} \oint_{\gamma_i(\lambda)} p \cdot dq, $$ where $γ_{i} (λ)$ is the $i$ -th independent loop on the $n$ -torus level set; for $n = 1$ this is the area enclosed by one period of the orbit, divided by $2 π$ .

A trajectory of the slowly-varying Hamiltonian $H (q, p; λ (ϵ t))$ has its action $I_{i}$ evaluated along the trajectory. The question is the size of $∣ I_{i} (t) - I_{i} (0) ∣$ as the slow time $τ$ ranges over an interval of length $O (1)$ — that is, the fast time ranges over an interval of length $O (ϵ^{- 1})$ .

Counterexamples to common slips

The unperturbed orbit is not preserved. The instantaneous trajectory in $(q, p)$ space changes, sometimes dramatically: the orbit at $τ = 1$ can be a shape utterly different from the orbit at $τ = 0$ . What survives is the action, not the orbit.
Adiabatic invariance is not exact. The action drift is $O (ϵ)$ , not zero. Stronger statements — exponentially small drift in $ϵ$ — hold under analyticity hypotheses (Neishtadt) but fail under generic smoothness.
Resonant tori fail. In dimension $n \geq 2$ , the bound $\dot{I} = O (ϵ^{2})$ requires the unperturbed frequencies to be non-resonant; on resonance hypersurfaces the action can drift by an $O (1)$ amount. This is where adiabatic invariance and the KAM regime begin to interact.
Action versus energy. $I$ is conserved adiabatically; energy generally is not. An oscillator whose frequency rises will gain energy at the rate that keeps $I$ fixed.

Key theorem with proof [Intermediate+]

Theorem (classical adiabatic invariant, one degree of freedom — Burgers / Ehrenfest 1916). Let $H (q, p; λ)$ be a smooth Hamiltonian on $R^{2}$ with parameter $λ \in Λ$ . Suppose for each $λ$ in a compact set the level set ${H_{λ} = E}$ is a smooth closed curve $γ (E; λ)$ enclosing area $2 π I (E; λ)$ , with frequency $ω (I; λ) = \partial_{I} h_{0} (I; λ) > 0$ uniformly bounded above and below. Let $λ (τ)$ be a smooth path with $τ = ϵ t$ , and let $(q (t), p (t))$ solve Hamilton's equations for $H (q, p; λ (ϵ t))$ . Then the action $I (t) := I (H (q (t), p (t); λ (ϵ t)); λ (ϵ t))$ satisfies $$ |I(t) - I(0)| \leq C \epsilon $$ uniformly for $t \in [0, T / ϵ]$ , where $T > 0$ is fixed and $C$ depends only on $T$ , on $λ$ and its derivatives, and on uniform bounds for $H$ and $\partial_{I} H$ on the trajectory ^{[Ehrenfest 1916; ref: TODO_REF Arnold]}.

Proof (averaging method). Pass to action-angle coordinates $(I, θ)$ for the frozen Hamiltonian $H_{λ}$ . In these coordinates the slowly-varying Hamiltonian becomes $$ \widetilde H(I, \theta; \lambda) = h_0(I; \lambda), $$ a function only of $I$ at each fixed $λ$ , since the action-angle coordinates are tailored to each $λ$ separately. The remaining subtlety is that the action-angle map itself depends on $λ$ , so the symplectic transformation $(q, p) \mapsto (I, θ)$ is time-dependent through $λ (ϵ t)$ .

Time-dependent canonical transformations contribute a generating-function correction. Writing the transformation as $Φ_{λ}$ with generator $S (q, I; λ)$ in the Type-II convention, the new Hamiltonian in $(I, θ)$ coordinates is $$ K(I, \theta; \tau) = h_0(I; \lambda(\tau)) + \epsilon \frac{\partial S}{\partial \lambda}(q(I, \theta; \lambda), I; \lambda) \cdot \dot\lambda(\tau), $$ where $q (I, θ; λ)$ is the inverse coordinate map and $\dot{λ} = d λ / d τ$ . The first term depends only on the action; the second is a slow perturbation of size $ϵ$ , and it depends on $θ$ through the inverse coordinate map.

Hamilton's equations in the new variables become $$ \dot I = -\frac{\partial K}{\partial \theta} = -\epsilon \frac{\partial}{\partial \theta} \left( \frac{\partial S}{\partial \lambda} \right) \cdot \dot\lambda(\tau), \qquad \dot\theta = \frac{\partial K}{\partial I} = \omega(I; \lambda(\tau)) + O(\epsilon). $$ The first equation says $\dot{I}$ is itself $O (ϵ)$ , so on a time interval $T / ϵ$ the action could drift by $O (1)$ — not yet a useful estimate. The improvement comes from averaging.

Averaging step. Decompose the perturbation into its angle-average and angle-fluctuation: $$ F(I, \theta; \lambda) := \frac{\partial S}{\partial \lambda}(q(I, \theta; \lambda), I; \lambda) = \langle F \rangle(I; \lambda) + \widetilde F(I, \theta; \lambda), $$ with $⟨ F ⟩ (I; λ) := (2 π)^{- 1} \int_{0}^{2 π} F (I, θ; λ) d θ$ and $F$ of zero average.

The angle-average part contributes $- ϵ \partial_{θ} ⟨ F ⟩ \cdot \dot{λ} = 0$ to $\dot{I}$ , since $⟨ F ⟩$ is independent of $θ$ . So only the zero-average part $F$ drives the action.

Solve the cohomological equation. On the circle $T^{1}$ with frequency $ω^{*} := ω (I; λ)$ frozen at the current values, look for a function $G (I, θ; λ)$ solving $$ \omega^* \frac{\partial G}{\partial \theta} = \widetilde F. $$ Fourier-expand $F = \sum_{k \neq = 0} F_{k} e^{ik θ}$ ; the solution is $G = \sum_{k \neq = 0} F_{k} / (ik ω^{*}) \cdot e^{ik θ}$ . The denominators are bounded below by $ω^{*} > 0$ (no small-divisor problem in one frequency), so $G$ is smooth and bounded uniformly for $(I, λ)$ in the compact regime.

Change of variables. Pass to a near-identity coordinate change $I := I + ϵ G (I, θ; λ) \cdot \dot{λ}$ . To leading order in $ϵ$ , the equation of motion for $I$ is $$ \dot{\widehat I} = \dot I + \epsilon , \omega^* , \partial_\theta G \cdot \dot\lambda + O(\epsilon^2) = -\epsilon , \widetilde F \cdot \dot\lambda + \epsilon , \widetilde F \cdot \dot\lambda + O(\epsilon^2) = O(\epsilon^2). $$ The first-order driving term cancels by construction. Integrating $\dot{I} = O (ϵ^{2})$ over $t \in [0, T / ϵ]$ gives $∣ I (t) - I (0) ∣ \leq C ϵ$ . Since $∣ I - I ∣ = O (ϵ)$ at every time, the same bound holds for the original action $I$ . $□$

Bridge. The averaging argument builds toward the multi-frequency theory of perturbations, where the same step-by-step cancellation of angle-dependent terms appears again in the KAM theorem 05.09.01 and in the Birkhoff normal form near elliptic fixed points. Both apparatuses begin from the same cohomological equation $ω^{*} \partial_{θ} G = F$ and run into the same dichotomy: in one frequency the equation is harmless because the Fourier denominator $ω^{*}$ is a single positive number, while in $n \geq 2$ frequencies the denominators $⟨ k, ω^{*} ⟩$ become arbitrarily small near resonance hyperplanes. Putting these together, the foundational reason adiabatic invariance is so much easier than KAM is exactly that one frequency carries no small-divisor problem; KAM appears again in the next chapter as the perturbative theory of a slowly varying action that nonetheless has many frequencies. This is the same Newton-iteration scheme that recurs throughout perturbation theory — the bridge between the analytic input and the geometric output.

Exercises [Intermediate+]

Exercise 4 (medium, short-answer).

State the adiabatic theorem for a one-degree-of-freedom Hamiltonian and identify the role of the assumption $ω (I; λ) > c > 0$ uniformly. What goes wrong if the frequency is allowed to vanish at some $λ^{*}$ ?

Hint

The cohomological equation is $ω^{*} \partial_{θ} G = F$ . What happens at $ω^{*} = 0$ ?

Answer

Theorem: $∣ I (t) - I (0) ∣ \leq C ϵ$ uniformly on $t \in [0, T / ϵ]$ provided $ω = \partial_{I} h_{0}$ stays bounded below by $c > 0$ . If $ω \to 0$ at some $λ^{*}$ (separatrix crossing — the orbit period diverges), the cohomological equation $ω^{*} \partial_{θ} G = F$ has unbounded inverse, $G$ blows up, and the cancellation of the first-order driving term fails. A trajectory passing through $λ = λ^{*}$ generically gains an $O (1)$ jump in its action — this is the separatrix-crossing phenomenon of Hannay and Cary-Skodje. Rubric: full credit for naming the cohomological-equation breakdown and identifying the orbit-period divergence as the geometric reason.

Exercise 5 (medium, symbolic).

For a slowly-varying harmonic oscillator $H = (p^{2} + ω (t)^{2} q^{2}) /2$ with $ω (t) = 1 + ϵ t$ for $0 \leq t \leq T / ϵ$ and $T$ a fixed positive number, write down the exact frequency-time relation along the orbit and verify $I = E / ω$ to leading order in $ϵ$ . Show that the orbit's area in the $(q, p)$ plane at time $t$ equals its area at time $0$ to first order in $ϵ$ .

Hint

WKB ansatz $q (t) = A (t) cos (ϕ (t))$ with $ϕ^{'} (t) = ω (t) + O (ϵ)$ and $A^{2} ω \approx const$ .

Answer

Make the WKB ansatz $q (t) = A (t) cos (ϕ (t))$ with $ϕ (t) = \int_{0}^{t} ω (s) d s + O (ϵ)$ . Substituting into $\overset{q}{¨} + ω^{2} q = 0$ and matching gives $A^{2} ω = const + O (ϵ)$ , which is $E / ω = const + O (ϵ)$ . The orbit area at time $t$ is $2 π A^{2} ω \cdot (1/2) = π I$ (the factor of $1/2$ from the action's normalisation), conserved to first order. Direct numerical integration of $\overset{q}{¨} + (1 + ϵ t)^{2} q = 0$ with $ϵ = 1 0^{- 3}$ over $T / ϵ$ confirms $∣ I (t) - I (0) ∣ \leq C ϵ$ with $C \approx 1$ .

Exercise 6 (medium, short-answer).

Why is the angle-average step central to the proof? Explain in two or three sentences.

Hint

Compare the size of $\dot{I}$ before and after the angle-average has been removed.

Answer

Without averaging, $\dot{I} = - ϵ \partial_{θ} F \cdot \dot{λ}$ is $O (ϵ)$ but the integrand contains both an angle-average part (which would drive secular drift over a long time interval) and a zero-average oscillating part. The angle-average of $\partial_{θ} F$ vanishes identically — the integral around the circle of any total derivative is zero — so the secular-drift term is absent automatically, and only the oscillating part remains. The oscillating part is in turn the gradient of a bounded function $G$ via the cohomological equation, so it integrates to a bounded near-identity correction rather than a secular drift. Rubric: full credit for identifying the secular-versus-oscillating split and the role of $\partial_{θ}$ killing the average.

Exercise 7 (hard, short-answer).

Sketch why the adiabatic invariant for a one-frequency system can fail in dimension $n \geq 2$ on a resonant torus. Identify the role of the cohomological equation $⟨ k, ω^{*} ⟩ G_{k} = F_{k}$ .

Hint

Resonance means $⟨ k, ω^{*} ⟩ = 0$ for some non-zero $k \in Z^{n}$ .

Answer

For $n \geq 2$ frequencies, the cohomological equation reads $⟨ k, ω^{*} ⟩ G_{k} = F_{k}$ for each Fourier mode $k$ . On a resonant torus, $⟨ k, ω^{*} ⟩ = 0$ for some $k \neq = 0$ , so the corresponding mode $G_{k}$ is undetermined and the equation has no solution unless $F_{k} = 0$ . The angle-fluctuation $F$ is generically non-zero in the resonant Fourier mode, so the cancellation of the first-order driving term fails and $\dot{I}$ remains $O (ϵ)$ rather than $O (ϵ^{2})$ , producing $O (1)$ drift over a time of length $ϵ^{- 1}$ . Off resonance the equation is solvable but the inverses $1/ ⟨ k, ω^{*} ⟩$ become large near resonance hyperplanes, leading to the small-divisor problem that KAM addresses by restricting to Diophantine frequencies. Rubric: full credit for naming the resonance hyperplane, the failure of the cohomological equation on it, and the connection to small divisors.

Exercise 8 (hard, short-answer).

State and motivate the Neishtadt exponential-precision result: under analyticity of $H$ in $λ$ , the action drift over $[0, T / ϵ]$ is bounded by $C exp (- c / ϵ)$ rather than $C ϵ$ . Why does smoothness alone fail to give exponential precision?

Hint

Analyticity gives Cauchy bounds on Fourier coefficients; smoothness only gives polynomial decay.

Answer

Under real-analyticity in $λ$ , the slow time-derivatives $d^{k} λ / d τ^{k}$ extend holomorphically and are bounded uniformly on a strip. Iterating the averaging step $N$ times reduces the drift driver from $O (ϵ)$ to $O (ϵ^{N})$ at the cost of an $N!$ growth in the constant. Optimising over $N ≍ 1/ ϵ$ — using analyticity to control the growth of derivatives via Cauchy bounds — gives an exponentially-small bound $C exp (- c / ϵ)$ . Without analyticity, smoothness gives only polynomial decay of the iterated constants and the optimisation halts at any finite order, leaving a polynomial-in- $ϵ$ residual. Rubric: full credit for naming the iterative averaging, the role of analyticity in providing Cauchy bounds, and the optimal-truncation strategy. Neishtadt 1981 is the canonical reference.

Exercise 9 (hard, short-answer).

Outline the Born-Fock 1928 quantum adiabatic theorem and connect it with the classical statement. What is the role of Berry's 1984 phase?

Hint

The slow parameter is $λ (ϵ t)$ ; the fast oscillation is the energy eigenphase $exp (- i E_{n} (λ) t /ℏ)$ .

Answer

Born-Fock theorem: a quantum system whose Hamiltonian $H (λ (ϵ t))$ depends slowly on a parameter, with discrete spectrum and no level crossings, has its state stay in the $n$ -th instantaneous eigenstate of $H (λ)$ up to a corrective amplitude of size $ϵ$ . The classical analogue is the conservation of $I = E / ω$ , where $ω$ plays the role of energy spacing in the semiclassical limit; Bohr-Sommerfeld quantisation $I = n ℏ$ then reads as the discreteness of the quantum spectrum. Berry 1984: the dynamical phase $exp (- i \int E_{n} (λ (ϵ t)) /ℏ d t)$ acquires an additional geometric phase $exp (i γ_{n})$ depending only on the path in $Λ$ and not on its parameterisation, computed as $γ_{n} = \oint ⟨ n ∣ i d_{λ} ∣ n ⟩$ — the holonomy of a connection on the eigenstate bundle. This Berry holonomy is the modern lift of the adiabatic theorem to differential-geometric language. Rubric: full credit for stating the Born-Fock conclusion, the connection to classical $I = E / ω$ via Bohr-Sommerfeld, and the geometric-phase reformulation.

Exercise 10 (hard, short-answer).

A pendulum has its support point oscillated vertically at frequency $Ω$ much higher than the pendulum's natural frequency $ω_{0}$ . Argue using adiabatic / averaging methods that the effective Hamiltonian for the slow pendulum motion has an extra effective potential $V_{eff}$ proportional to the squared amplitude of the support oscillation. (Kapitza pendulum.)

Hint

Now the slow variable is the pendulum angle and the fast variable is the support phase, with the roles reversed compared with the standard adiabatic setup.

Answer

For a pendulum with support oscillating vertically at $y (t) = a cos (Ω t)$ with $Ω ≫ ω_{0}$ and $a$ small, the equation of motion for the pendulum angle is $\ddot{θ} + (ω_{0}^{2} - Ω^{2} a / ℓ cos (Ω t)) sin θ = 0$ . Decompose $θ = Θ (t) + ξ (t)$ with $Θ$ slow and $ξ$ a fast small perturbation. Time-averaging the fast oscillation, $ξ \approx (a / ℓ) cos (Ω t) sin Θ$ , and computing the average of the rapidly-varying terms over one period of $Ω$ yields an effective slow Hamiltonian for $Θ$ with potential $V_{eff} (Θ) = - ω_{0}^{2} cos Θ + (Ω^{2} a^{2} / (4 ℓ^{2})) sin^{2} Θ$ . The new term stabilises the inverted equilibrium $Θ = π$ provided $a^{2} Ω^{2} / (2 ℓ) > g$ — Kapitza's stabilisation. The adiabatic / averaging principle is the same as the action-conservation argument, run with fast and slow roles exchanged. Rubric: full credit for naming the averaging step, deriving the effective potential, and stating the inverted-pendulum-stability criterion.

Lean formalization [Intermediate+]

lean_status: none — Mathlib lacks the action-angle, slow-time, and averaging-theorem infrastructure needed for the adiabatic invariant theorem. A formal statement would look like the following pseudocode, with each axiom replaced by a real definition once the prerequisites are in Mathlib.

[object Promise]

A formal route would assemble: smooth function spaces with parameter-dependent action-angle charts; the cohomological equation $ω^{*} \partial_{θ} G = F$ on $T^{1}$ solved by Fourier division; a near-identity symplectic change of variables eliminating the first-order driving term; a Gronwall-type integration over the slow time interval. The quantum adiabatic theorem (Born-Fock 1928) requires additional spectral-theoretic infrastructure: parameter-dependent self-adjoint operators with isolated discrete eigenvalues, Kato's theorem on smoothness of eigenprojectors, and the standard interaction-picture derivation. Both classical and quantum statements remain Mathlib-roadmap items.

Advanced results [Master]

The classical adiabatic theorem is the headline of a much larger structural circle of ideas concerning slow-fast Hamiltonian systems. Five refinements deepen, generalise, or sit beside the basic statement.

Higher-dimensional adiabatic invariance. In dimension $n \geq 2$ the situation splits sharply between resonant and non-resonant tori. Off the resonance hypersurfaces ${⟨ k, ω^{*} ⟩ = 0}$ for $k \in Z^{n} ∖ {0}$ , the cohomological equation $⟨ k, ω^{*} ⟩ G_{k} = F_{k}$ has bounded solutions and the averaging argument runs as in the one-dimensional case, giving $\dot{I} = O (ϵ^{2})$ and $∣ I (t) - I (0) ∣ \leq C ϵ$ on $t \in [0, T / ϵ]$ . On resonance, the equation has no bounded solution: $F_{k}$ at the resonant $k$ drives a secular $O (1)$ drift in the corresponding action component. The set of bad initial conditions has small measure for non-degenerate $H_{0}$ , so adiabatic invariance survives in measure-typical sense, and this sets up the bridge into the perturbative regime of KAM theory 05.09.01. The Arnold-Kasuga theorem makes this quantitative: under non-degeneracy and an averaging hypothesis, action drift on time scales $ϵ^{- 1}$ is $O (ϵ)$ for measure-typical initial conditions, and the bad set shrinks polynomially with $ϵ$ .

Neishtadt exponential precision. Under real-analyticity of $H$ in the slow parameter $λ$ , the action drift improves dramatically: $∣ I (t) - I (0) ∣ \leq C exp (- c / ϵ)$ on $t \in [0, T / ϵ]$ . The proof iterates the averaging step $N$ times and optimises $N ≍ 1/ ϵ$ , using Cauchy bounds to control the growing constants at each step. Neishtadt's 1981 paper is the canonical reference ^{[Arnold-Kozlov-Neishtadt]}. The exponential bound is sharp: separatrix crossings introduce $O (ϵ ∣ ln ϵ ∣)$ corrections that prevent uniform exponential precision, and Hannay-Berry phase contributions are themselves of size $ϵ$ rather than exponentially small.

Separatrix crossings. When the slow parameter $λ (τ)$ steers the system across a separatrix of the frozen Hamiltonian — for instance, a slowly-tuned pendulum whose energy crosses the unstable equilibrium — the classical adiabatic theorem fails. The orbit period diverges as the trajectory approaches the separatrix, the cohomological equation has unbounded inverse, and the action $I$ acquires a discontinuity of size $O (ϵ ∣ ln ϵ ∣)$ rather than $O (ϵ)$ . Cary-Escande-Tennyson 1986 and Hannay 1986 quantified this loss: the post-separatrix action equals the pre-separatrix action plus a probabilistically-distributed jump determined by the phase at which the trajectory entered the separatrix neighbourhood. The phenomenon is foundational for plasma transport in tokamaks, where field-line resonances act as tomographic separatrices and the quantitative jump theory specifies the leakage rate.

Quantum adiabatic theorem. Born and Fock proved in 1928 ^{[Born-Fock 1928]} that a slowly-varying quantum Hamiltonian $H (λ (ϵ t))$ with isolated discrete spectrum and no level crossings keeps its state in the $n$ -th instantaneous eigenstate up to corrections of size $ϵ$ . The proof constructs a near-identity unitary transformation eliminating the leading $θ$ -dependence, exactly as in the classical averaging argument, with the role of the angle variable played by the dynamical phase $exp (- i E_{n} (λ) t /ℏ)$ . Berry 1984 ^{[Berry 1984]} sharpened the result by extracting the geometric phase $γ_{n} = \oint ⟨ n ∣ i d_{λ} ∣ n ⟩$ , the holonomy of the natural connection on the eigenstate bundle over $Λ$ . Berry phase identifies the adiabatic theorem with parallel transport in a complex line bundle and thereby connects classical and quantum perturbation theory through a single differential-geometric apparatus. The Aharonov-Anandan generalisation removes the eigenstate-tracking hypothesis and produces a purely geometric phase associated with any closed loop in projective Hilbert space.

Slow-fast Hamiltonian systems. The averaging principle generalises far beyond the adiabatic setup to systems of the form $$ \dot I = \epsilon f(I, \theta, y), \quad \dot\theta = \omega(I) + \epsilon g(I, \theta, y), \quad \dot y = h(I, \theta, y), $$ with $y \in R^{m}$ a fast variable obeying its own dynamics. The averaged system $\dot{\overset{ˉ}{I}} = ϵ ⟨ f ⟩ (\overset{ˉ}{I})$ — where $⟨ f ⟩$ is the time-average over the fast trajectory — approximates the true action drift to within $O (ϵ)$ on time intervals of length $ϵ^{- 1}$ . Anosov 1960 and Neishtadt 1976 systematised this; the modern reference is the Sanders-Verhulst-Murdock textbook. Adiabatic invariants are the special case where the fast variable is the angle of an integrable subsystem; the broader averaging principle handles dissipative slow drifts, weakly coupled multi-frequency systems, and homogenisation problems. The slow drift is generically governed by an equation that is itself Hamiltonian (under symplectic structure) — a feature that fails in dissipative averaging.

Synthesis. The adiabatic theorem is one prototype of a broader paradigm: when a Hamiltonian system has fast oscillation and a slow drift of the parameters, the only secular dynamics on the slow timescale is what the angle-average of the perturbation generates. Putting this together, the same averaging step recurs in the Birkhoff normal form near elliptic fixed points, where iterated averaging produces a formal first integral 05.09.01, in the WKB approximation of geometric optics, where the action becomes the eikonal phase, in the Hamilton-Jacobi inversion that produces action-angle coordinates 05.05.03, and in the modern theory of adiabatic perturbations of Hamiltonian PDE. The foundational reason these scattered results all use the same step is that the linearised problem is in each case a cohomological equation $ω^{*} \partial_{θ} G = F$ on a torus: in one frequency it has a bounded solution; in many frequencies the small-divisor structure decides whether the solution exists and, with KAM, what its support looks like. The bridge from the analytic input (the angle-average) to the geometric output (a near-identity coordinate change) is the foundational thread connecting one-dimensional adiabatic invariance to the full perturbative apparatus of Hamiltonian mechanics.

Full proof set [Master]

Lemma (cohomological equation on $T^{1}$ ). Let $\omega^ > 0 $an d l e t$ \widetilde F : \mathbb{T}^1 \to \mathbb{R} $b es m oo t h w i t h z er o a v er a g e$ \int_0^{2\pi} \widetilde F, d\theta = 0 $. T h e n t h ee q u a t i o n$ \omega^* \partial_\theta G = \widetilde F $ha s a u ni q u es m oo t h z er o - m e an so l u t i o n$ G $, w i t h$ |G|{C^k} \leq (\omega^*)^{-1} |\widetilde F|{C^k} $f or e v er y$ k \geq 0$.*

Proof. Write $F (θ) = \sum_{k \neq = 0} F_{k} e^{ik θ}$ . Set $G (θ) := \sum_{k \neq = 0} F_{k} / (ik ω^{*}) \cdot e^{ik θ}$ . Then $\partial_{θ} G = \sum_{k \neq = 0} F_{k} / ω^{*} \cdot e^{ik θ} = F / ω^{*}$ . The $C^{k}$ bound follows from $∣ G_{k} ∣ \leq ∣ F_{k} ∣/ (ω^{*} ∣ k ∣) \leq ∣ F_{k} ∣/ ω^{*}$ for $k \neq = 0$ , summed against the $∣ k ∣^{k}$ derivative-mode weight. $□$

Lemma (one-step near-identity transformation). In the setup of the adiabatic theorem with one frequency, let $F (I, θ; λ)$ be the perturbation Hamiltonian and $F = F - ⟨ F ⟩$ its zero-mean part. Let $G$ solve $\omega^ \partial_\theta G = \widetilde F $. T h e n t h e n e a r - i d e n t i t y c han g eo f v a r iab l es$ \widehat I := I + \epsilon G(I, \theta; \lambda) \dot\lambda $i ssy m pl ec t i c t o l e a d in g or d er in$ \epsilon $an d s a t i s f i es$ \dot{\widehat I} = -\epsilon \langle F \rangle_\theta \dot\lambda + O(\epsilon^2) $, w h er e$ \langle F\rangle_\theta := \partial_\theta \langle F\rangle = 0 $. H e n ce$ \dot{\widehat I} = O(\epsilon^2)$.*

Proof. Compute $\dot{I} = \dot{I} + ϵ \dot{G} \dot{λ} + ϵ G \ddot{λ}$ . Using $\dot{I} = - ϵ \partial_{θ} F \dot{λ} + O (ϵ^{2})$ and $\dot{G} = \partial_{θ} G \cdot \dot{θ} + \partial_{I} G \cdot \dot{I} + \partial_{λ} G \cdot \dot{λ}$ , the leading-order computation gives $\dot{G} = ω^{*} \partial_{θ} G + O (ϵ) = F + O (ϵ)$ . Substituting, $$ \dot{\widehat I} = -\epsilon (\langle F\rangle + \widetilde F)\theta \dot\lambda + \epsilon \widetilde F \dot\lambda + O(\epsilon^2) = -\epsilon \widetilde F \dot\lambda + \epsilon \widetilde F \dot\lambda + O(\epsilon^2) = O(\epsilon^2), $$ using $\partial\theta \langle F\rangle = 0 $an d$ \partial_\theta \widetilde F = \widetilde F_\theta = \widetilde F $(in t h e p r ec i sese n seo f h o w$ G $so l v es t h eco h o m o l o g i c a l e q u a t i o n :$ \partial_\theta G = \widetilde F/\omega^ $, so$ \widetilde F = \omega^ \partial_\theta G $, an d t h ec an ce l l a t i o ni se x a c t a t f i r s t or d er) . T h esy m pl ec t i c i t y t o l e a d in g or d er f o l l o w s f r o m$ G $b e in g t h e g e n er a t in g f u n c t i o n o f an e a r - i d e n t i t y H ami l t o nian t r an s f or ma t i o n .$ \square$

Theorem (one-frequency adiabatic invariance). Under the hypotheses stated, $∣ I (t) - I (0) ∣ \leq C ϵ$ uniformly on $t \in [0, T / ϵ]$ .

Proof. Apply the previous lemma to obtain a near-identity coordinate change $I = I + ϵ G \dot{λ}$ with $∣ G \dot{λ} ∣$ bounded uniformly by $∥ G ∥_{C^{0}} ∥ \dot{λ} ∥_{C^{0}} \leq ω_{*}^{- 1} ∥ F ∥_{C^{0}} ∥ \dot{λ} ∥_{C^{0}}$ , where $ω_{*} := min ω^{*}$ . Hence $∣ I - I ∣ \leq C_{1} ϵ$ at every time, with $C_{1}$ depending on the uniform bounds. From the lemma, $\dot{I} = O (ϵ^{2})$ uniformly, so integrating over $t \in [0, T / ϵ]$ gives $$ |\widehat I(t) - \widehat I(0)| \leq T/\epsilon \cdot C_2 \epsilon^2 = C_2 T \epsilon. $$ Combining, $∣ I (t) - I (0) ∣ \leq ∣ I (t) - I (t) ∣ + ∣ I (t) - I (0) ∣ + ∣ I (0) - I (0) ∣ \leq 2 C_{1} ϵ + C_{2} T ϵ = C ϵ$ . $□$

Theorem (Neishtadt exponential precision, statement only). If $H (q, p; λ)$ and $λ (τ)$ extend holomorphically to a complex strip $∣ Im τ ∣ < r$ in slow time, with bounded analytic norms, then $∣ I (t) - I (0) ∣ \leq C exp (- c / ϵ)$ uniformly on $t \in [0, T / ϵ]$ , with $c, C$ depending only on $r$ , on the analytic norms, and on $T$ . The proof iterates the one-step lemma $N$ times — at each step the residual is reduced by a factor $ϵ$ , at the cost of derivative loss controlled by Cauchy bounds — and optimises $N ≍ 1/ ϵ$ . Stated without proof; Neishtadt 1981 is the canonical reference ^{[Arnold-Kozlov-Neishtadt]}.

Connections [Master]

Action-angle coordinates 05.02.04 — the canonical chart in which adiabatic invariance is set up; the action $I = (2 π)^{- 1} \oint p d q$ is the conserved quantity, and the angle variable $θ$ is the integration variable for the averaging step.
Integrable system 05.02.03 — the frozen-parameter system at each $λ$ is integrable; adiabatic invariance is the statement that the integrability structure deforms smoothly under slow parameter variation, with the action as the persistent label.
Symplectic manifold 05.01.02 — the ambient category in which the adiabatic theorem lives; the time-dependent canonical transformation that introduces action-angle coordinates is itself a symplectomorphism, and the cancellation of the first-order driving term relies on this symplectic structure.
KAM theorem 05.09.01 — the modern refinement of the perturbative picture for non-time-dependent perturbations of integrable systems. Adiabatic invariance handles slow time-variation; KAM handles small static perturbations. Both rest on the same cohomological equation and the same Diophantine analysis of the small-divisor problem.
Generating functions 05.05.03 — the time-dependent action-angle map is described through a Type-II generating function $S (q, I; λ)$ , and the slow-time perturbation enters as $ϵ \partial_{λ} S \cdot \dot{λ}$ in the new Hamiltonian. The same generating-function machinery drives the KAM iteration.
Hamiltonian vector field 05.02.01 — Hamilton's equations for the slowly-varying Hamiltonian are the dynamics under study; the perturbation in action-angle coordinates is itself a Hamiltonian vector field with a small parameter, and the averaging principle is a statement about its long-time behaviour.
Poisson bracket 05.02.02 — the cohomological equation $ω^{*} \partial_{θ} G = F$ is the linearisation of ${H_{0}, G} = F$ at frozen action; this is the Poisson-bracket structure that recurs throughout perturbation theory.
Berry phase / quantum adiabatic theorem — the differential-geometric refinement of the classical adiabatic invariant: the Born-Fock theorem is the quantum analogue of $I = E / ω$ conservation, and Berry's geometric phase is the holonomy of the natural connection on the eigenstate bundle, an additional invariant beyond the classical action.
Magnetic-mirror confinement / tokamak physics — the practical application: the magnetic moment $μ = m v_{⊥}^{2} / (2 B)$ of a charged particle in a slowly-varying magnetic field is the adiabatic invariant of the gyration motion, and its conservation underlies plasma confinement in fusion devices.

The bridge between the analytic input — the cohomological equation with one frequency — and the geometric output — a near-identity symplectic correction with bounded slow drift — is the foundational reason adiabatic invariance unifies the perturbative theory of the rest of the chapter. Putting these together, the same averaging principle recurs in normal-form theory, in WKB asymptotics, and in the modern theory of Hamiltonian PDE; adiabatic invariance is a special case of the general averaging principle for slow-fast Hamiltonian systems, while KAM is the limit in which the slow parameter is taken to be a static perturbation of the integrable Hamiltonian.

Historical & philosophical context [Master]

Hendrik Lorentz raised the adiabatic-invariant problem at the 1911 Solvay Congress in Brussels: a pendulum whose length is slowly altered — does its energy stay quantised in Planck-style units? The question was concrete because Bohr's model of the hydrogen atom was in development, and Lorentz wanted to know which mechanical quantities were the right candidates for quantisation. Albert Einstein answered at the meeting: the action $\oint p d q$ , not the energy, is what survives slow tuning. Einstein's remark was the seed of what became the adiabatic-invariant programme of old quantum theory.

Paul Ehrenfest formalised the principle in his 1916 paper Adiabatische Invarianten und Quantentheorie in Annalen der Physik 51 ^{[Ehrenfest 1916]}. Ehrenfest defined adiabatic invariants as the mechanical quantities preserved under arbitrary slow continuous deformation of the parameters of a conditionally periodic system, and proposed them as the natural candidates for Bohr-Sommerfeld quantisation: $\oint p_{i} d q_{i} = n_{i} h$ . Independently, J. M. Burgers gave a parallel rigorous mechanics derivation the same year in the Versl. Akad. Wet. Amsterdam ^{[Burgers 1916]}. The Ehrenfest principle became the standard framework for old quantum theory between 1916 and 1925 — Sommerfeld's relativistic hydrogen, the Stern-Gerlach experiment, Born and Heisenberg's matrix-mechanics precursors all ran on adiabatic-invariant arguments. Born's 1925 The Mechanics of the Atom ^{[Born 1925]} is the canonical synthesis of the programme.

The arrival of the new quantum mechanics in 1925-1927 displaced adiabatic invariants from their primary role: Heisenberg's matrix mechanics and Schrödinger's wave mechanics gave direct dynamical equations for quantum amplitudes without the detour through classical actions. Born and Fock's 1928 paper Beweis des Adiabatensatzes in Z. Physik 51 ^{[Born-Fock 1928]} established the quantum-mechanical adiabatic theorem in its modern form: a state slowly varying in time stays in the corresponding instantaneous energy eigenstate up to small corrections. The Born-Fock theorem is the direct quantum analogue of Ehrenfest's classical statement, with the eigenstate index $n$ playing the role of the classical action.

The rigorous classical theory was completed in the post-war Russian school. T. Kasuga gave the first rigorous proof of one-frequency adiabatic invariance in 1961 in the Proceedings of the Japan Academy ^{[Kasuga 1961]}, and Vladimir Arnold systematised the multi-frequency theory in his 1963 papers and in §52 of Mathematical Methods of Classical Mechanics (1974/1989) ^[Arnold]. The Arnold-Kozlov-Neishtadt encyclopaedic treatment ^{[Arnold-Kozlov-Neishtadt]} consolidated the classical theory together with Neishtadt's exponential-precision results from the 1980s. Michael Berry's 1984 paper Quantal phase factors accompanying adiabatic changes ^{[Berry 1984]} reopened the quantum side by identifying the geometric-phase holonomy that Born-Fock had quietly absorbed into a phase convention, and the resulting Berry-phase apparatus became foundational for topological insulators and the modern theory of quantum geometry.

Bibliography [Master]

[object Promise]

Prerequisites

05.02.04
05.02.03
05.01.02

Used in

05.09.01

Tier anchors

beginner: Goldstein *Classical Mechanics* §12.5 (informal slowly-varying-parameter picture); Arnold *Mathematical Methods of Classical Mechanics* §52 informal opening
intermediate: Arnold *Mathematical Methods of Classical Mechanics* §52; Goldstein §12.5
master: Ehrenfest 1916 *Adiabatische Invarianten und Quantentheorie* (originator); Arnold-Kozlov-Neishtadt *Mathematical Aspects of Classical and Celestial Mechanics* Ch. 6; Born *The Mechanics of the Atom* (1925)

References

TODO_REF
Ehrenfest 1916 — Adiabatische Invarianten und Quantentheorie · Annalen der Physik 51, originator paper coining the term
TODO_REF
Burgers 1916 — Adiabatic invariants of mechanical systems · Versl. Akad. Wet. Amsterdam 25, parallel rigorous mechanics statement
TODO_REF
Born 1925 — The Mechanics of the Atom · Ch. III, the adiabatic-invariant programme in old quantum theory
TODO_REF
Born and Fock 1928 — Beweis des Adiabatensatzes · Z. Physik 51, quantum adiabatic theorem
TODO_REF
Arnold — Mathematical Methods of Classical Mechanics · §52, classical adiabatic-invariant theory
TODO_REF
Arnold, Kozlov, Neishtadt — Mathematical Aspects of Classical and Celestial Mechanics · Ch. 6, modern averaging-theoretic treatment
TODO_REF
Goldstein, Poole, Safko — Classical Mechanics · §12.5, harmonic-oscillator and accelerator examples
TODO_REF
Kasuga 1961 — On the adiabatic theorem for the Hamiltonian system of differential equations in classical mechanics · Proc. Japan Acad. 37, rigorous Russian-style averaging proof
TODO_REF
Berry 1984 — Quantal phase factors accompanying adiabatic changes · Proc. Roy. Soc. London A 392, Berry-phase holonomy

Reviewer

TBD

Estimated time

beginner: 18m
intermediate: 45m
master: 80m