Adiabatic invariants
Anchor (Master): Ehrenfest 1916 *Adiabatische Invarianten und Quantentheorie* (originator); Arnold-Kozlov-Neishtadt *Mathematical Aspects of Classical and Celestial Mechanics* Ch. 6; Born *The Mechanics of the Atom* (1925)
Intuition [Beginner]
Imagine a child on a swing whose chains are slowly being shortened. Each individual swing happens fast — back and forth in a second or two — while the shortening happens over many minutes. A natural question: as the chains contract, does the swing get wilder, calmer, or stay the same?
The answer is one of the most useful approximate conservation laws in physics. A particular geometric quantity — the area enclosed by the swing's trajectory in the position-velocity plane over one full oscillation — barely changes. The swing's energy goes up as the chains shorten, the frequency goes up, but the ratio of energy to frequency stays nearly fixed. Physicists call this ratio an adiabatic invariant — a quantity preserved when a parameter of the system varies slowly compared with the system's own internal motion.
This is why a slowly tuned guitar string keeps its loudness as its pitch rises, why charged particles stay trapped in slowly-varying magnetic bottles, and why early quantum theorists guessed correctly which quantities to quantize. The slow change does not destroy the bookkeeping; it carries it along.
Visual [Beginner]
Two phase portraits of a harmonic oscillator with frequency that slowly increases over time. On the left the oscillator at frequency : a circular orbit in the position-momentum plane with area . On the right the same oscillator at frequency after slow tuning: a narrower, taller ellipse with the same enclosed area . The orbit shape changes; the enclosed area survives.
The picture captures the headline result: the area swept out by one period of the orbit is the conserved bookkeeping quantity, and it survives slow changes to the system even when energy and frequency individually drift.
Worked example [Beginner]
Take a one-dimensional harmonic oscillator with mass and time-varying spring stiffness , giving the equation of motion . Start with — frequency — and an initial state with energy . The action is .
Slowly increase the stiffness over a long time so that at the end , frequency . The slow-parameter rate is for the total tuning time , with much larger than the oscillator period .
Claim: at the end the action is still approximately . Therefore the new energy is . The energy doubled even though no impulsive work was done — the slow tuning fed energy in at exactly the rate that keeps the ratio fixed.
Numerical check: integrate with rising linearly from to over . Compute at the start and the end. The two values agree to about — first-order in , exactly as the adiabatic theorem predicts.
Takeaway: the action is the right currency for tracking a slowly-tuned oscillator. Energy alone is not.
Check your understanding [Beginner]
Formal definition [Intermediate+]
Let be a symplectic manifold and let be a smooth Hamiltonian depending on time through a slowly-varying parameter , $$ H(q, p; \lambda(\tau)), \qquad \tau := \epsilon t, $$ with a small positive parameter and a smooth path in a parameter manifold . The variable is the fast time; is the slow time.
Assume that for each fixed , the Hamiltonian is integrable 05.02.03 in a neighbourhood of an orbit family of interest, with -dependent action-angle coordinates 05.02.04
$$
(I, \theta) \in U_\lambda^{\mathrm{aa}} \times \mathbb{T}^n \subset \mathbb{R}^n \times \mathbb{T}^n
$$
in which and depends only on the actions. The frequency map is .
The action variable at a fixed is the cycle integral $$ I_i(\lambda) = \frac{1}{2\pi} \oint_{\gamma_i(\lambda)} p \cdot dq, $$ where is the -th independent loop on the -torus level set; for this is the area enclosed by one period of the orbit, divided by .
A trajectory of the slowly-varying Hamiltonian has its action evaluated along the trajectory. The question is the size of as the slow time ranges over an interval of length — that is, the fast time ranges over an interval of length .
Counterexamples to common slips
- The unperturbed orbit is not preserved. The instantaneous trajectory in space changes, sometimes dramatically: the orbit at can be a shape utterly different from the orbit at . What survives is the action, not the orbit.
- Adiabatic invariance is not exact. The action drift is , not zero. Stronger statements — exponentially small drift in — hold under analyticity hypotheses (Neishtadt) but fail under generic smoothness.
- Resonant tori fail. In dimension , the bound requires the unperturbed frequencies to be non-resonant; on resonance hypersurfaces the action can drift by an amount. This is where adiabatic invariance and the KAM regime begin to interact.
- Action versus energy. is conserved adiabatically; energy generally is not. An oscillator whose frequency rises will gain energy at the rate that keeps fixed.
Key theorem with proof [Intermediate+]
Theorem (classical adiabatic invariant, one degree of freedom — Burgers / Ehrenfest 1916). Let be a smooth Hamiltonian on with parameter . Suppose for each in a compact set the level set is a smooth closed curve enclosing area , with frequency uniformly bounded above and below. Let be a smooth path with , and let solve Hamilton's equations for . Then the action satisfies $$ |I(t) - I(0)| \leq C \epsilon $$ uniformly for , where is fixed and depends only on , on and its derivatives, and on uniform bounds for and on the trajectory [Ehrenfest 1916; ref: TODO_REF Arnold].
Proof (averaging method). Pass to action-angle coordinates for the frozen Hamiltonian . In these coordinates the slowly-varying Hamiltonian becomes $$ \widetilde H(I, \theta; \lambda) = h_0(I; \lambda), $$ a function only of at each fixed , since the action-angle coordinates are tailored to each separately. The remaining subtlety is that the action-angle map itself depends on , so the symplectic transformation is time-dependent through .
Time-dependent canonical transformations contribute a generating-function correction. Writing the transformation as with generator in the Type-II convention, the new Hamiltonian in coordinates is $$ K(I, \theta; \tau) = h_0(I; \lambda(\tau)) + \epsilon \frac{\partial S}{\partial \lambda}(q(I, \theta; \lambda), I; \lambda) \cdot \dot\lambda(\tau), $$ where is the inverse coordinate map and . The first term depends only on the action; the second is a slow perturbation of size , and it depends on through the inverse coordinate map.
Hamilton's equations in the new variables become $$ \dot I = -\frac{\partial K}{\partial \theta} = -\epsilon \frac{\partial}{\partial \theta} \left( \frac{\partial S}{\partial \lambda} \right) \cdot \dot\lambda(\tau), \qquad \dot\theta = \frac{\partial K}{\partial I} = \omega(I; \lambda(\tau)) + O(\epsilon). $$ The first equation says is itself , so on a time interval the action could drift by — not yet a useful estimate. The improvement comes from averaging.
Averaging step. Decompose the perturbation into its angle-average and angle-fluctuation: $$ F(I, \theta; \lambda) := \frac{\partial S}{\partial \lambda}(q(I, \theta; \lambda), I; \lambda) = \langle F \rangle(I; \lambda) + \widetilde F(I, \theta; \lambda), $$ with and of zero average.
The angle-average part contributes to , since is independent of . So only the zero-average part drives the action.
Solve the cohomological equation. On the circle with frequency frozen at the current values, look for a function solving $$ \omega^* \frac{\partial G}{\partial \theta} = \widetilde F. $$ Fourier-expand ; the solution is . The denominators are bounded below by (no small-divisor problem in one frequency), so is smooth and bounded uniformly for in the compact regime.
Change of variables. Pass to a near-identity coordinate change . To leading order in , the equation of motion for is $$ \dot{\widehat I} = \dot I + \epsilon , \omega^* , \partial_\theta G \cdot \dot\lambda + O(\epsilon^2) = -\epsilon , \widetilde F \cdot \dot\lambda + \epsilon , \widetilde F \cdot \dot\lambda + O(\epsilon^2) = O(\epsilon^2). $$ The first-order driving term cancels by construction. Integrating over gives . Since at every time, the same bound holds for the original action .
Bridge. The averaging argument builds toward the multi-frequency theory of perturbations, where the same step-by-step cancellation of angle-dependent terms appears again in the KAM theorem 05.09.01 and in the Birkhoff normal form near elliptic fixed points. Both apparatuses begin from the same cohomological equation and run into the same dichotomy: in one frequency the equation is harmless because the Fourier denominator is a single positive number, while in frequencies the denominators become arbitrarily small near resonance hyperplanes. Putting these together, the foundational reason adiabatic invariance is so much easier than KAM is exactly that one frequency carries no small-divisor problem; KAM appears again in the next chapter as the perturbative theory of a slowly varying action that nonetheless has many frequencies. This is the same Newton-iteration scheme that recurs throughout perturbation theory — the bridge between the analytic input and the geometric output.
Exercises [Intermediate+]
Lean formalization [Intermediate+]
lean_status: none — Mathlib lacks the action-angle, slow-time, and averaging-theorem infrastructure needed for the adiabatic invariant theorem. A formal statement would look like the following pseudocode, with each axiom replaced by a real definition once the prerequisites are in Mathlib.
A formal route would assemble: smooth function spaces with parameter-dependent action-angle charts; the cohomological equation on solved by Fourier division; a near-identity symplectic change of variables eliminating the first-order driving term; a Gronwall-type integration over the slow time interval. The quantum adiabatic theorem (Born-Fock 1928) requires additional spectral-theoretic infrastructure: parameter-dependent self-adjoint operators with isolated discrete eigenvalues, Kato's theorem on smoothness of eigenprojectors, and the standard interaction-picture derivation. Both classical and quantum statements remain Mathlib-roadmap items.
Advanced results [Master]
The classical adiabatic theorem is the headline of a much larger structural circle of ideas concerning slow-fast Hamiltonian systems. Five refinements deepen, generalise, or sit beside the basic statement.
Higher-dimensional adiabatic invariance. In dimension the situation splits sharply between resonant and non-resonant tori. Off the resonance hypersurfaces for , the cohomological equation has bounded solutions and the averaging argument runs as in the one-dimensional case, giving and on . On resonance, the equation has no bounded solution: at the resonant drives a secular drift in the corresponding action component. The set of bad initial conditions has small measure for non-degenerate , so adiabatic invariance survives in measure-typical sense, and this sets up the bridge into the perturbative regime of KAM theory 05.09.01. The Arnold-Kasuga theorem makes this quantitative: under non-degeneracy and an averaging hypothesis, action drift on time scales is for measure-typical initial conditions, and the bad set shrinks polynomially with .
Neishtadt exponential precision. Under real-analyticity of in the slow parameter , the action drift improves dramatically: on . The proof iterates the averaging step times and optimises , using Cauchy bounds to control the growing constants at each step. Neishtadt's 1981 paper is the canonical reference [Arnold-Kozlov-Neishtadt]. The exponential bound is sharp: separatrix crossings introduce corrections that prevent uniform exponential precision, and Hannay-Berry phase contributions are themselves of size rather than exponentially small.
Separatrix crossings. When the slow parameter steers the system across a separatrix of the frozen Hamiltonian — for instance, a slowly-tuned pendulum whose energy crosses the unstable equilibrium — the classical adiabatic theorem fails. The orbit period diverges as the trajectory approaches the separatrix, the cohomological equation has unbounded inverse, and the action acquires a discontinuity of size rather than . Cary-Escande-Tennyson 1986 and Hannay 1986 quantified this loss: the post-separatrix action equals the pre-separatrix action plus a probabilistically-distributed jump determined by the phase at which the trajectory entered the separatrix neighbourhood. The phenomenon is foundational for plasma transport in tokamaks, where field-line resonances act as tomographic separatrices and the quantitative jump theory specifies the leakage rate.
Quantum adiabatic theorem. Born and Fock proved in 1928 [Born-Fock 1928] that a slowly-varying quantum Hamiltonian with isolated discrete spectrum and no level crossings keeps its state in the -th instantaneous eigenstate up to corrections of size . The proof constructs a near-identity unitary transformation eliminating the leading -dependence, exactly as in the classical averaging argument, with the role of the angle variable played by the dynamical phase . Berry 1984 [Berry 1984] sharpened the result by extracting the geometric phase , the holonomy of the natural connection on the eigenstate bundle over . Berry phase identifies the adiabatic theorem with parallel transport in a complex line bundle and thereby connects classical and quantum perturbation theory through a single differential-geometric apparatus. The Aharonov-Anandan generalisation removes the eigenstate-tracking hypothesis and produces a purely geometric phase associated with any closed loop in projective Hilbert space.
Slow-fast Hamiltonian systems. The averaging principle generalises far beyond the adiabatic setup to systems of the form $$ \dot I = \epsilon f(I, \theta, y), \quad \dot\theta = \omega(I) + \epsilon g(I, \theta, y), \quad \dot y = h(I, \theta, y), $$ with a fast variable obeying its own dynamics. The averaged system — where is the time-average over the fast trajectory — approximates the true action drift to within on time intervals of length . Anosov 1960 and Neishtadt 1976 systematised this; the modern reference is the Sanders-Verhulst-Murdock textbook. Adiabatic invariants are the special case where the fast variable is the angle of an integrable subsystem; the broader averaging principle handles dissipative slow drifts, weakly coupled multi-frequency systems, and homogenisation problems. The slow drift is generically governed by an equation that is itself Hamiltonian (under symplectic structure) — a feature that fails in dissipative averaging.
Synthesis. The adiabatic theorem is one prototype of a broader paradigm: when a Hamiltonian system has fast oscillation and a slow drift of the parameters, the only secular dynamics on the slow timescale is what the angle-average of the perturbation generates. Putting this together, the same averaging step recurs in the Birkhoff normal form near elliptic fixed points, where iterated averaging produces a formal first integral 05.09.01, in the WKB approximation of geometric optics, where the action becomes the eikonal phase, in the Hamilton-Jacobi inversion that produces action-angle coordinates 05.05.03, and in the modern theory of adiabatic perturbations of Hamiltonian PDE. The foundational reason these scattered results all use the same step is that the linearised problem is in each case a cohomological equation on a torus: in one frequency it has a bounded solution; in many frequencies the small-divisor structure decides whether the solution exists and, with KAM, what its support looks like. The bridge from the analytic input (the angle-average) to the geometric output (a near-identity coordinate change) is the foundational thread connecting one-dimensional adiabatic invariance to the full perturbative apparatus of Hamiltonian mechanics.
Full proof set [Master]
Lemma (cohomological equation on ). Let $\omega^ > 0\widetilde F : \mathbb{T}^1 \to \mathbb{R}\int_0^{2\pi} \widetilde F, d\theta = 0\omega^* \partial_\theta G = \widetilde FG|G|{C^k} \leq (\omega^*)^{-1} |\widetilde F|{C^k}k \geq 0$.*
Proof. Write . Set . Then . The bound follows from for , summed against the derivative-mode weight.
Lemma (one-step near-identity transformation). In the setup of the adiabatic theorem with one frequency, let be the perturbation Hamiltonian and its zero-mean part. Let solve $\omega^ \partial_\theta G = \widetilde F\widehat I := I + \epsilon G(I, \theta; \lambda) \dot\lambda\epsilon\dot{\widehat I} = -\epsilon \langle F \rangle_\theta \dot\lambda + O(\epsilon^2)\langle F\rangle_\theta := \partial_\theta \langle F\rangle = 0\dot{\widehat I} = O(\epsilon^2)$.*
Proof. Compute . Using and , the leading-order computation gives . Substituting, $$ \dot{\widehat I} = -\epsilon (\langle F\rangle + \widetilde F)\theta \dot\lambda + \epsilon \widetilde F \dot\lambda + O(\epsilon^2) = -\epsilon \widetilde F \dot\lambda + \epsilon \widetilde F \dot\lambda + O(\epsilon^2) = O(\epsilon^2), $$ using $\partial\theta \langle F\rangle = 0\partial_\theta \widetilde F = \widetilde F_\theta = \widetilde FG\partial_\theta G = \widetilde F/\omega^\widetilde F = \omega^ \partial_\theta GG\square$
Theorem (one-frequency adiabatic invariance). Under the hypotheses stated, uniformly on .
Proof. Apply the previous lemma to obtain a near-identity coordinate change with bounded uniformly by , where . Hence at every time, with depending on the uniform bounds. From the lemma, uniformly, so integrating over gives $$ |\widehat I(t) - \widehat I(0)| \leq T/\epsilon \cdot C_2 \epsilon^2 = C_2 T \epsilon. $$ Combining, .
Theorem (Neishtadt exponential precision, statement only). If and extend holomorphically to a complex strip in slow time, with bounded analytic norms, then uniformly on , with depending only on , on the analytic norms, and on . The proof iterates the one-step lemma times — at each step the residual is reduced by a factor , at the cost of derivative loss controlled by Cauchy bounds — and optimises . Stated without proof; Neishtadt 1981 is the canonical reference [Arnold-Kozlov-Neishtadt].
Connections [Master]
Action-angle coordinates
05.02.04— the canonical chart in which adiabatic invariance is set up; the action is the conserved quantity, and the angle variable is the integration variable for the averaging step.Integrable system
05.02.03— the frozen-parameter system at each is integrable; adiabatic invariance is the statement that the integrability structure deforms smoothly under slow parameter variation, with the action as the persistent label.Symplectic manifold
05.01.02— the ambient category in which the adiabatic theorem lives; the time-dependent canonical transformation that introduces action-angle coordinates is itself a symplectomorphism, and the cancellation of the first-order driving term relies on this symplectic structure.KAM theorem
05.09.01— the modern refinement of the perturbative picture for non-time-dependent perturbations of integrable systems. Adiabatic invariance handles slow time-variation; KAM handles small static perturbations. Both rest on the same cohomological equation and the same Diophantine analysis of the small-divisor problem.Generating functions
05.05.03— the time-dependent action-angle map is described through a Type-II generating function , and the slow-time perturbation enters as in the new Hamiltonian. The same generating-function machinery drives the KAM iteration.Hamiltonian vector field
05.02.01— Hamilton's equations for the slowly-varying Hamiltonian are the dynamics under study; the perturbation in action-angle coordinates is itself a Hamiltonian vector field with a small parameter, and the averaging principle is a statement about its long-time behaviour.Poisson bracket
05.02.02— the cohomological equation is the linearisation of at frozen action; this is the Poisson-bracket structure that recurs throughout perturbation theory.Berry phase / quantum adiabatic theorem — the differential-geometric refinement of the classical adiabatic invariant: the Born-Fock theorem is the quantum analogue of conservation, and Berry's geometric phase is the holonomy of the natural connection on the eigenstate bundle, an additional invariant beyond the classical action.
Magnetic-mirror confinement / tokamak physics — the practical application: the magnetic moment of a charged particle in a slowly-varying magnetic field is the adiabatic invariant of the gyration motion, and its conservation underlies plasma confinement in fusion devices.
The bridge between the analytic input — the cohomological equation with one frequency — and the geometric output — a near-identity symplectic correction with bounded slow drift — is the foundational reason adiabatic invariance unifies the perturbative theory of the rest of the chapter. Putting these together, the same averaging principle recurs in normal-form theory, in WKB asymptotics, and in the modern theory of Hamiltonian PDE; adiabatic invariance is a special case of the general averaging principle for slow-fast Hamiltonian systems, while KAM is the limit in which the slow parameter is taken to be a static perturbation of the integrable Hamiltonian.
Historical & philosophical context [Master]
Hendrik Lorentz raised the adiabatic-invariant problem at the 1911 Solvay Congress in Brussels: a pendulum whose length is slowly altered — does its energy stay quantised in Planck-style units? The question was concrete because Bohr's model of the hydrogen atom was in development, and Lorentz wanted to know which mechanical quantities were the right candidates for quantisation. Albert Einstein answered at the meeting: the action , not the energy, is what survives slow tuning. Einstein's remark was the seed of what became the adiabatic-invariant programme of old quantum theory.
Paul Ehrenfest formalised the principle in his 1916 paper Adiabatische Invarianten und Quantentheorie in Annalen der Physik 51 [Ehrenfest 1916]. Ehrenfest defined adiabatic invariants as the mechanical quantities preserved under arbitrary slow continuous deformation of the parameters of a conditionally periodic system, and proposed them as the natural candidates for Bohr-Sommerfeld quantisation: . Independently, J. M. Burgers gave a parallel rigorous mechanics derivation the same year in the Versl. Akad. Wet. Amsterdam [Burgers 1916]. The Ehrenfest principle became the standard framework for old quantum theory between 1916 and 1925 — Sommerfeld's relativistic hydrogen, the Stern-Gerlach experiment, Born and Heisenberg's matrix-mechanics precursors all ran on adiabatic-invariant arguments. Born's 1925 The Mechanics of the Atom [Born 1925] is the canonical synthesis of the programme.
The arrival of the new quantum mechanics in 1925-1927 displaced adiabatic invariants from their primary role: Heisenberg's matrix mechanics and Schrödinger's wave mechanics gave direct dynamical equations for quantum amplitudes without the detour through classical actions. Born and Fock's 1928 paper Beweis des Adiabatensatzes in Z. Physik 51 [Born-Fock 1928] established the quantum-mechanical adiabatic theorem in its modern form: a state slowly varying in time stays in the corresponding instantaneous energy eigenstate up to small corrections. The Born-Fock theorem is the direct quantum analogue of Ehrenfest's classical statement, with the eigenstate index playing the role of the classical action.
The rigorous classical theory was completed in the post-war Russian school. T. Kasuga gave the first rigorous proof of one-frequency adiabatic invariance in 1961 in the Proceedings of the Japan Academy [Kasuga 1961], and Vladimir Arnold systematised the multi-frequency theory in his 1963 papers and in §52 of Mathematical Methods of Classical Mechanics (1974/1989) [Arnold]. The Arnold-Kozlov-Neishtadt encyclopaedic treatment [Arnold-Kozlov-Neishtadt] consolidated the classical theory together with Neishtadt's exponential-precision results from the 1980s. Michael Berry's 1984 paper Quantal phase factors accompanying adiabatic changes [Berry 1984] reopened the quantum side by identifying the geometric-phase holonomy that Born-Fock had quietly absorbed into a phase convention, and the resulting Berry-phase apparatus became foundational for topological insulators and the modern theory of quantum geometry.