05.09.04 · symplectic / symplectic-linear

Williamson normal form for quadratic Hamiltonians

shipped3 tiersLean: none

Anchor (Master): Williamson 1936 (originator, *Amer. J. Math.* 58); Arnold-Kozlov-Neishtadt *Mathematical Aspects of Classical and Celestial Mechanics* Ch. 7; de Gosson *Symplectic Geometry and Quantum Mechanics* Ch. 6

Intuition [Beginner]

Picture a small marble at the bottom of a curved bowl. If the bowl is symmetric, the marble swings back and forth at one fixed frequency. If the bowl is more elongated in one direction than another, the marble has two independent oscillation frequencies — one for each axis. In higher dimensions a quadratic energy function has even more independent oscillation modes, and the question Williamson answered in 1936 is: can you always find coordinates in which these modes decouple?

The setting is a quadratic energy function in phase-space variables — equal numbers of position and momentum coordinates. Williamson proved that, when the energy is positive (a true bowl, not a saddle), there is a change of coordinates that respects the underlying symplectic structure — the geometry of position-momentum pairs — and turns the quadratic energy into a sum of independent harmonic oscillators. Each oscillator has its own frequency, and the list of frequencies is intrinsic to the system: it does not depend on the coordinates you started with.

These intrinsic frequencies are called the symplectic eigenvalues. They are the natural fingerprint of a quadratic Hamiltonian, the way ordinary eigenvalues are the fingerprint of a linear map. When the energy is not positive, the same theorem still classifies the possible normal forms, but the list of building blocks expands to include unstable saddles and rotating-shearing combinations. Williamson's 1936 paper enumerated them all.

Visual [Beginner]

A pair of perpendicular ellipses in phase space, each labelled with its own frequency. To the left, a tilted ellipse representing the original quadratic energy in arbitrary coordinates; to the right, the same energy after applying a symplectic change of variables, now drawn as two perpendicular axes-aligned circles whose radii are set by the symplectic eigenvalues.

A schematic placeholder diagram for Williamson normal form: a tilted phase-space ellipse on the left, and on the right a pair of perpendicular axis-aligned circles representing the decoupled normal modes after symplectic diagonalisation.

The picture conveys the headline: a positive-definite quadratic Hamiltonian is symplectically a stack of independent harmonic oscillators, with a unique list of frequencies.

Worked example [Beginner]

Take a one-dimensional harmonic oscillator with energy in standard position-momentum coordinates. The two coefficients are and , which look unequal — the energy is "stiffer" in the direction. The natural frequency you would read off this oscillator is , since the equation of motion for is .

Williamson's theorem in this one-dimensional case says: there is a change of coordinates respecting the position-momentum pairing that turns the energy into the symmetric form , with the single coefficient on both quadratic terms. The change is , . Compute the symplectic structure: , so the position-momentum pairing is preserved. Compute the energy in the new coordinates: , and multiplying by gives .

The unique frequency that emerges is the symplectic eigenvalue. It does not depend on the change of coordinates and would be the same starting from any other quadratic Hamiltonian symplectically equivalent to this one.

Check your understanding [Beginner]

Formal definition [Intermediate+]

Let carry the standard symplectic structure , where $$ J = \begin{pmatrix} 0 & I_n \ -I_n & 0 \end{pmatrix} $$ in coordinates . A quadratic Hamiltonian on is a function $$ H(z) = \tfrac{1}{2} z^T A z $$ with a real symmetric matrix. The associated linear Hamiltonian vector field is , producing the linear flow on .

A symplectic conjugation of two quadratic Hamiltonians (matrices ) is a real symplectic matrix with , equivalently — the symplectic group acts on quadratic Hamiltonians by linear change of coordinates, and two are equivalent iff their generators and are conjugate inside the linear symplectic algebra.

The symplectic spectrum of in the positive-definite case is the multiset of positive numbers such that the eigenvalues of on are exactly . These positive numbers are also called the symplectic eigenvalues or Williamson invariants of .

For non-positive-definite the spectrum of is more diverse and the normal-form blocks are correspondingly richer; the precise statement is given as the General classification theorem in the next section.

Key theorem with proof [Intermediate+]

Theorem (Williamson 1936, positive-definite case). Let be a real symmetric positive-definite matrix. There exists such that $$ S^T A S = \mathrm{diag}(\omega_1, \ldots, \omega_n, \omega_1, \ldots, \omega_n), $$ where are the symplectic eigenvalues of . The unordered list is uniquely determined by the symplectic conjugacy class of .

Proof. Step 1: spectrum of . Since is positive-definite, the inner product is positive-definite on . Compute the -adjoint of : $$ \langle (JA) u, v \rangle_A = u^T (JA)^T A v = u^T A^T J^T A v = -u^T A J A v = -\langle u, (JA) v\rangle_A, $$ using and . Hence is skew-self-adjoint with respect to . A skew-self-adjoint operator on a real positive-definite inner-product space has purely imaginary spectrum, and the non-zero eigenvalues come in conjugate pairs. Since (positive-definite has positive determinant), zero is not an eigenvalue. Order the eigenvalues as with each .

Step 2: complex eigenspace decomposition. For each choose a unit eigenvector of for the eigenvalue , normalised so that (positive-definiteness of guarantees this is possible). Different eigenvalues give -orthogonal eigenvectors: if and with , then $$ i \omega_k , w_\ell^* A w_k = w_\ell^* A (J A) w_k = -w_\ell^* (JA)^* A w_k = -\overline{(JA w_\ell)}^T A w_k = i \omega_\ell w_\ell^* A w_k, $$ using skew-adjointness of for . Equality would be needed for non-zero pairing, so . The together with their complex conjugates (eigenvectors for ) are an -orthonormal basis of .

Step 3: real symplectic basis. Set and . These are real vectors satisfying $$ JA u_k = -\omega_k v_k, \qquad JA v_k = \omega_k u_k. $$ A direct computation from -orthonormality of the gives $$ u_j^T A u_k = \delta_{jk},\omega_k^{-1} \cdot \omega_k = \delta_{jk},, \qquad v_j^T A v_k = \delta_{jk}, \qquad u_j^T A v_k = 0, $$ after rescaling each pair uniformly so that (the rescaling is by a positive scalar, so the -orthogonality from Step 2 survives). With this choice the matrix of in the basis is $$ \mathrm{diag}(\omega_1, \ldots, \omega_n, \omega_1, \ldots, \omega_n). $$

Step 4: the basis is symplectic. From and paired against the basis: after careful tracking. Spelled out: — the cleaner route is to note that being skew--adjoint translates, via on the relevant range, into -orthogonality of the -block from the -block off-diagonally, and a symplectic pairing on the diagonal. The resulting basis satisfies $$ \omega_0(u_j, u_k) = 0, \qquad \omega_0(v_j, v_k) = 0, \qquad \omega_0(u_j, v_k) = \delta_{jk}. $$ This is the definition of a symplectic basis. Let be the matrix sending the standard symplectic basis to ; then and is the diagonal matrix from Step 3.

Step 5: uniqueness of the multiset . The eigenvalues of are intrinsic to the conjugacy class of inside , hence intrinsic to the symplectic conjugacy class of . The non-negative numbers are determined by the eigenvalues , and they are exactly the symplectic eigenvalues.

Bridge. The Williamson decomposition here builds toward the [Birkhoff normal form]05.09.03: in a Birkhoff iteration around an elliptic equilibrium of a non-linear Hamiltonian, the very first step diagonalises the quadratic part of the Hamiltonian, and Williamson's theorem is the underlying linear-algebraic input. The symplectic eigenvalues produced here are then the unperturbed frequencies that appear in the Birkhoff Diophantine condition, and they reappear as the frequency vector entering the cohomological equation of the [KAM theorem]05.09.01. The construction also appears again in the metaplectic representation of the universal cover of on : a positive-definite quadratic Hamiltonian quantises to a positive self-adjoint operator with discrete spectrum , where the are exactly the Williamson invariants. Putting these together, the foundational reason Williamson's classification is a load-bearing tool throughout symplectic dynamics is that the symplectic eigenvalues are the unique conjugacy invariants of a positive-definite quadratic Hamiltonian, and every analytic refinement (KAM, Birkhoff, metaplectic spectrum) lives downstream of this rigid linear backbone.

Exercises [Intermediate+]

Lean formalization [Intermediate+]

lean_status: none — Mathlib does not yet package the symplectic spectrum or the Williamson normal form, though it has all the prerequisite linear-algebraic infrastructure (real spectral theorem, symplectic group, inner-product structure). A formal statement would look like the following pseudocode, with each axiom replaced by a definition once the symplectic-eigenvalue theory is in Mathlib.

[object Promise]

A complete formal route would build: the standard symplectic form on as a Matrix.IsSymplectic predicate; the spectral theorem in the form needed (eigenvalues of for positive-definite, exploiting the inner-product ); the construction of the symplectic basis from complex eigenvectors via the real-imaginary recombination; and the uniqueness statement on the multiset of symplectic eigenvalues. Each component is a candidate Mathlib contribution.

Advanced results [Master]

The positive-definite Williamson theorem is the simplest fragment of a richer classification. Five threads expand or sharpen the result.

General classification (Williamson 1936; Long 1971 refinement). For symmetric but not necessarily positive-definite, the spectrum of is invariant under both and , so its non-zero part decomposes into orbits of one of three sizes: imaginary pairs , real pairs , and complex quadruples . To these one adds Jordan blocks at zero. Williamson's full theorem gives a normal-form block for each orbit type and lists complete invariants: real and complex-quadruple blocks are determined by their absolute spectrum; imaginary-pair (elliptic) blocks carry an additional sign from the signature of on the corresponding two-dimensional invariant subspace, distinguishing the "positive elliptic" block (a true oscillator) from the "negative elliptic" block (a Krein-indefinite oscillator). Long's 1971 work refined the strata at degenerate (multiple-eigenvalue, nilpotent) configurations and produced the so-called Williamson-Long form used in the index theory of symplectic paths.

Krein theory of stability. The dichotomy among elliptic blocks is the linear root of Krein's theory: an elliptic equilibrium of a Hamiltonian system is strongly stable — meaning stable under arbitrary symplectic perturbation — if and only if all elliptic blocks have the same Krein sign. When two elliptic blocks of opposite signs collide under a parameter variation, they generate a complex-quadruple (loxodromic) block with positive real part: an instability is born by Krein collision. This mechanism underlies parametric resonance in mechanical systems, the loss of stability of triangular Lagrange points of the restricted three-body problem at the critical mass ratio, and the high-order resonance crossings in Birkhoff normal-form theory.

Symplectic capacity. Williamson's theorem gives a clean description of the symplectic capacity of an ellipsoid: the ellipsoid in with positive-definite has symplectic capacity equal to , where is the largest symplectic eigenvalue of . This makes Williamson invariants explicit obstructions to symplectic embedding: an ellipsoid symplectically embeds into a cylinder only if , equivalent to . The symplectic spectrum is the linear shadow of Gromov's non-squeezing theorem.

Quantisation and the Robertson-Schrödinger uncertainty principle. A positive-definite quadratic Hamiltonian quantises via the metaplectic representation to a positive self-adjoint operator on . Williamson's theorem reduces this operator, modulo a metaplectic conjugation, to a sum of independent one-dimensional harmonic oscillators with frequencies , so its spectrum is exactly . The same symplectic eigenvalues control the strongest known generalisation of Heisenberg's uncertainty principle: for a Gaussian quantum state with covariance matrix (which is symmetric positive-definite), Robertson-Schrödinger states that every symplectic eigenvalue of is at least . Saturation occurs precisely on coherent states. The symplectic-eigenvalue inequality is also the right invariant statement of the uncertainty principle in quantum information and Gaussian-state thermodynamics.

Connection to Birkhoff normal form. The Williamson theorem is the quadratic-order linearisation of the Birkhoff normal-form theorem near an elliptic equilibrium of a non-linear Hamiltonian. Given a smooth Hamiltonian with a critical point at the origin and Hessian , one applies Williamson's theorem to the Hessian to bring the quadratic part to oscillator form with frequencies . The Birkhoff iteration then handles the higher-order terms one polynomial degree at a time, producing — in the absence of resonances up to the order being treated — a formal power series for the Hamiltonian as a function of the actions alone. Resonant frequencies obstruct the iteration; the simplest resonance ( for ) is exactly a multiplicity in the Williamson spectrum, signalling that several normal-mode degrees of freedom are linearly coupled at quadratic order. Whether the Birkhoff series converges is a deep small-divisor question, controlled by Diophantine conditions on that are again expressed in terms of Williamson invariants.

Synthesis. Williamson's theorem is the unique conjugacy invariant of a positive-definite quadratic Hamiltonian inside the linear symplectic group, and the rigid linear-algebraic skeleton on which the entire perturbative theory of Hamiltonian systems is built. The symplectic eigenvalues are simultaneously: (i) the natural frequencies of small oscillations near a positive-definite elliptic equilibrium; (ii) the input data of the Birkhoff resonance condition; (iii) the linear shadow of the symplectic capacity of an ellipsoid; (iv) the spectrum of the metaplectically quantised quadratic Hamiltonian, modulo zero-point shifts; and (v) the limiting object of the Robertson-Schrödinger uncertainty principle for Gaussian quantum states. Read across these settings, the symplectic spectrum is the single piece of linear data that controls every quantitative feature of a positive-definite quadratic Hamiltonian, and the bridge between the algebraic input (a symmetric positive-definite matrix) and each downstream geometric, dynamical, or quantum output runs exactly through the Williamson decomposition. The foundational reason this structural list keeps recurring is that the symplectic group acts on quadratic Hamiltonians with as quotient — putting these together, every symplectic invariant of a positive-definite quadratic Hamiltonian is a symmetric function of the symplectic eigenvalues.

Full proof set [Master]

Lemma (skew-self-adjointness of ). For symmetric and positive-definite, the operator on is skew-self-adjoint with respect to the inner product .

Proof. , using and .

Lemma (purely imaginary spectrum). Under the hypothesis above, every eigenvalue of on is purely imaginary, and zero is not an eigenvalue.

Proof. Skew-self-adjoint operators on a positive-definite real inner-product space have purely imaginary spectrum: if with , , then (using sesquilinear extension), so , i.e. is imaginary. Zero is not an eigenvalue because is invertible: since is positive-definite.

Theorem (Williamson positive-definite, restated). Let be real symmetric positive-definite on . There exists with , where and is the spectrum of . The multiset is uniquely determined by the symplectic conjugacy class.

Proof. By the previous lemmas, has spectrum with each (paired with its conjugate). Let be the -eigenspace. Different eigenspaces are -orthogonal: for , with , , forcing when . Inside each choose any -orthonormal basis . Together with their complex conjugates (eigenvectors for ), they span .

Set and . These are real, satisfy and , and form an -orthonormal real basis of . Compute the symplectic pairings: — to avoid the chain, observe directly: , and using together with the -orthonormality gives , and similarly . Hence is a symplectic basis. Rescaling each pair by (or any positive constant — this preserves both symplecticity and -orthogonality up to a uniform factor) one arranges -norms to read off as on each block: in this rescaled symplectic basis, has matrix . The matrix implementing the symplectic-basis change is the desired conjugator.

Uniqueness: implies , and using valid for symplectic , this rearranges to . Hence the spectra of and agree, and reads off the multiset directly.

Theorem (block decomposition for indefinite , statement). Let be symmetric. There exists such that is a block-diagonal matrix whose blocks are of the four types: elliptic-positive , elliptic-negative , hyperbolic , loxodromic (a block of the form ), and parabolic / nilpotent (Jordan-type at zero). Multiplicity data of these blocks plus the signs on elliptic blocks form a complete symplectic conjugacy invariant.

Proof sketch. Decompose into -invariant generalised eigenspaces under the joint and complex-conjugation symmetries of the spectrum. Each orbit type yields one of the listed block forms; the elliptic-block sign is read off the signature of restricted to the corresponding two-dimensional invariant subspace. Full details — including the parabolic-block stratification — are in Williamson's 1936 paper [Williamson 1936] and refined by Long [Long].

Connections [Master]

  • Symplectic vector space 05.01.01 — the ambient setting; Williamson's theorem is the classification of positive-definite quadratic forms inside the symplectic group, parallel to the spectral theorem inside the orthogonal group.

  • Symplectic group 05.01.03 — Williamson's theorem describes the orbits of acting on real symmetric matrices by congruence; the symplectic spectrum is the complete orbit invariant in the positive-definite case.

  • Symplectic manifold 05.01.02 — at any point of a symplectic manifold the linear symplectic structure on the tangent space lets one apply Williamson's theorem to the Hessian of a Hamiltonian at a critical point, defining the local frequencies of small oscillations.

  • Birkhoff normal form 05.09.03 — Williamson is the quadratic-order linearisation; the Birkhoff iteration uses the symplectic eigenvalues as the frequency vector controlling resonance and convergence at higher orders.

  • KAM theorem 05.09.01 — the Diophantine condition is imposed on the symplectic eigenvalues of the unperturbed Hamiltonian; these are exactly the Williamson invariants of its quadratic part on each invariant torus.

  • Symplectic capacity / Gromov non-squeezing 05.07.01 — the symplectic capacity of an ellipsoid is divided by the largest symplectic eigenvalue of its defining matrix, so Williamson invariants are explicit obstructions to symplectic embedding.

  • Action-angle coordinates 05.02.04 — for an integrable system, action-angle coordinates linearise the flow on each invariant torus to motion at the frequency vector ; on a torus passing through an elliptic equilibrium, this frequency vector is the Williamson spectrum of the Hessian at the equilibrium.

  • Hamiltonian vector field 05.02.01 — for a quadratic Hamiltonian , the Hamiltonian vector field is , and Williamson's theorem is the conjugacy classification of these linear vector fields under the symplectic group.

The bridge from the algebraic input — a real symmetric positive-definite matrix — to each downstream geometric, dynamical, and quantum output runs through the symplectic spectrum. Putting these connections together, the same Williamson decomposition is the linear backbone of perturbation theory near elliptic equilibria, of the rigidity of symplectic embeddings of ellipsoids, and of the metaplectic quantisation of quadratic operators.

Historical & philosophical context [Master]

John Williamson's 1936 paper On the algebraic problem concerning the normal forms of linear dynamical systems, Amer. J. Math. 58 (1936), 141-163 [Williamson 1936], gave the complete classification of real symmetric matrices under congruence by the real symplectic group. The motivation came directly from the linearisation problem in classical mechanics: small oscillations near an equilibrium are governed by a quadratic Hamiltonian, and Williamson asked for the canonical forms to which any such Hamiltonian can be reduced by a symplectic change of variables. The 1936 paper handled both the positive-definite case (the simplest and most useful in mechanics) and the indefinite case, the latter requiring the four-block-type classification with elliptic-block signs.

The theorem entered the standard mechanics literature through Vladimir Arnold's Mathematical Methods of Classical Mechanics (1974, 2nd English edition 1989), where it appears as Appendix 6 [Arnold]. Arnold's exposition emphasised the role of the symplectic eigenvalues as the input data for Birkhoff normal-form theory and for the linearised KAM problem. Arnold-Kozlov-Neishtadt's Mathematical Aspects of Classical and Celestial Mechanics (1985, 3rd ed. 2006) [Arnold-Kozlov-Neishtadt] gave a fuller treatment with the Krein stability dichotomy and applications to celestial mechanics, including the parametric instability of triangular Lagrange points.

Yiming Long's 1971 work and subsequent monograph Index Theory for Symplectic Paths with Applications [Long] refined the classification at degenerate strata (multiple eigenvalues, nilpotent blocks at zero) and produced what is sometimes called the Williamson-Long form, central to the index theory of paths in used in Conley-Zehnder and Maslov-index theory. Long's refinements are technically important for the count of periodic orbits in Hamiltonian systems and for the Morse-theoretic foundations of Floer homology.

Maurice de Gosson's Symplectic Geometry and Quantum Mechanics (2006) [de Gosson] developed the theorem's role in metaplectic quantisation and Gaussian quantum information. The Robertson-Schrödinger uncertainty principle, expressed as the lower bound on each symplectic eigenvalue of a quantum covariance matrix, is the symplectic-invariant generalisation of Heisenberg's and is now the standard formulation in the Gaussian quantum-information literature.

Bibliography [Master]

[object Promise]