05.09.04 · symplectic / symplectic-linear

Williamson normal form for quadratic Hamiltonians

shipped3 tiersLean: none

Anchor (Master): Williamson 1936 (originator, *Amer. J. Math.* 58); Arnold-Kozlov-Neishtadt *Mathematical Aspects of Classical and Celestial Mechanics* Ch. 7; de Gosson *Symplectic Geometry and Quantum Mechanics* Ch. 6

Intuition [Beginner]

Picture a small marble at the bottom of a curved bowl. If the bowl is symmetric, the marble swings back and forth at one fixed frequency. If the bowl is more elongated in one direction than another, the marble has two independent oscillation frequencies — one for each axis. In higher dimensions a quadratic energy function has even more independent oscillation modes, and the question Williamson answered in 1936 is: can you always find coordinates in which these modes decouple?

The setting is a quadratic energy function in $2 n$ phase-space variables — equal numbers of position and momentum coordinates. Williamson proved that, when the energy is positive (a true bowl, not a saddle), there is a change of coordinates that respects the underlying symplectic structure — the geometry of position-momentum pairs — and turns the quadratic energy into a sum of independent harmonic oscillators. Each oscillator has its own frequency, and the list of frequencies is intrinsic to the system: it does not depend on the coordinates you started with.

These intrinsic frequencies are called the symplectic eigenvalues. They are the natural fingerprint of a quadratic Hamiltonian, the way ordinary eigenvalues are the fingerprint of a linear map. When the energy is not positive, the same theorem still classifies the possible normal forms, but the list of building blocks expands to include unstable saddles and rotating-shearing combinations. Williamson's 1936 paper enumerated them all.

Visual [Beginner]

A pair of perpendicular ellipses in phase space, each labelled with its own frequency. To the left, a tilted ellipse representing the original quadratic energy in arbitrary coordinates; to the right, the same energy after applying a symplectic change of variables, now drawn as two perpendicular axes-aligned circles whose radii are set by the symplectic eigenvalues.

The picture conveys the headline: a positive-definite quadratic Hamiltonian is symplectically a stack of independent harmonic oscillators, with a unique list of frequencies.

Worked example [Beginner]

Take a one-dimensional harmonic oscillator with energy $H = (p^{2} + 9 q^{2}) /2$ in standard position-momentum coordinates. The two coefficients are $1$ and $9$ , which look unequal — the energy is "stiffer" in the $q$ direction. The natural frequency you would read off this oscillator is $ω = 3$ , since the equation of motion for $q$ is $\overset{q}{¨} = - 9 q$ .

Williamson's theorem in this one-dimensional case says: there is a change of coordinates respecting the position-momentum pairing that turns the energy into the symmetric form $H = 3 (P^{2} + Q^{2}) /2$ , with the single coefficient $3$ on both quadratic terms. The change is $Q = 3 q$ , $P = p / 3$ . Compute the symplectic structure: $d P \land d Q = (1/ 3) d p \land 3 d q = d p \land d q$ , so the position-momentum pairing is preserved. Compute the energy in the new coordinates: $(P^{2} + Q^{2}) /2 = (p^{2} /3 + 3 q^{2}) /2$ , and multiplying by $3$ gives $(p^{2} + 9 q^{2}) /2 = H$ .

The unique frequency $ω = 3$ that emerges is the symplectic eigenvalue. It does not depend on the change of coordinates and would be the same starting from any other quadratic Hamiltonian symplectically equivalent to this one.

Check your understanding [Beginner]

Exercise (easy, multiple choice).

Williamson's theorem produces normal-form coordinates that decouple a quadratic Hamiltonian into harmonic oscillators when:

A. The quadratic form is positive-definite
B. The quadratic form has any signature
C. The dimension of phase space is exactly two
D. The frequencies happen to be rationally related

Hint

The simplest version of the theorem requires the energy to behave like a true bowl rather than a saddle.

Answer

A. The quadratic form is positive-definite.

Feedback-correct: correct; in the positive-definite case the normal form is a sum of standard harmonic oscillators with positive frequencies. Feedback-wrong: the theorem extends to indefinite quadratic forms, but the normal form then includes saddle-type and shear-type blocks, not just oscillators.

Exercise (easy, multiple choice).

The symplectic eigenvalues of a positive-definite quadratic Hamiltonian are:

A. The eigenvalues of the matrix of the quadratic form
B. The absolute values of the eigenvalues of $J A$ , where $J$ is the standard symplectic structure and $A$ is the matrix of the quadratic form
C. The square roots of the eigenvalues of the quadratic form
D. Always equal to one

Hint

The relevant operator is the linear Hamiltonian flow generator, not the quadratic form itself.

Answer

B. The absolute values of the eigenvalues of $J A$ .

Feedback-correct: correct; for positive-definite $A$ the matrix $J A$ has purely imaginary eigenvalues coming in pairs $\pm i ω_{k}$ , and the symplectic eigenvalues are the positive numbers $ω_{k}$ . Feedback-wrong: ordinary eigenvalues of $A$ are not symplectic invariants — applying a symplectic change of coordinates changes them, but leaves the symplectic eigenvalues fixed.

Formal definition [Intermediate+]

Let $V = R^{2 n}$ carry the standard symplectic structure $ω_{0} (u, v) = u^{T} J v$ , where $$ J = \begin{pmatrix} 0 & I_n \ -I_n & 0 \end{pmatrix} $$ in coordinates $(q^{1}, \dots, q^{n}, p_{1}, \dots, p_{n})$ . A quadratic Hamiltonian on $V$ is a function $$ H(z) = \tfrac{1}{2} z^T A z $$ with $A \in R^{2 n \times 2 n}$ a real symmetric matrix. The associated linear Hamiltonian vector field is $X_{H} (z) = J A z$ , producing the linear flow $Φ^{t} = exp (t J A)$ on $V$ .

A symplectic conjugation of two quadratic Hamiltonians $H_{A}, H_{B}$ (matrices $A, B$ ) is a real symplectic matrix $S \in Sp (2 n, R)$ with $S^{T} A S = B$ , equivalently $S^{- 1} (J A) S = J B$ — the symplectic group acts on quadratic Hamiltonians by linear change of coordinates, and two are equivalent iff their generators $J A$ and $J B$ are conjugate inside the linear symplectic algebra.

The symplectic spectrum of $A$ in the positive-definite case is the multiset of positive numbers ${ω_{1}, \dots, ω_{n}}$ such that the eigenvalues of $J A$ on $C^{2 n}$ are exactly ${\pm i ω_{1}, \dots, \pm i ω_{n}}$ . These positive numbers are also called the symplectic eigenvalues or Williamson invariants of $A$ .

For non-positive-definite $A$ the spectrum of $J A$ is more diverse and the normal-form blocks are correspondingly richer; the precise statement is given as the General classification theorem in the next section.

Key theorem with proof [Intermediate+]

Theorem (Williamson 1936, positive-definite case). Let $A \in R^{2 n \times 2 n}$ be a real symmetric positive-definite matrix. There exists $S \in Sp (2 n, R)$ such that $$ S^T A S = \mathrm{diag}(\omega_1, \ldots, \omega_n, \omega_1, \ldots, \omega_n), $$ where $ω_{1}, \dots, ω_{n} > 0$ are the symplectic eigenvalues of $A$ . The unordered list ${ω_{1}, \dots, ω_{n}}$ is uniquely determined by the symplectic conjugacy class of $A$ .

Proof. Step 1: spectrum of $J A$ . Since $A$ is positive-definite, the inner product $⟨ u, v ⟩_{A} := u^{T} A v$ is positive-definite on $V$ . Compute the $A$ -adjoint of $J A$ : $$ \langle (JA) u, v \rangle_A = u^T (JA)^T A v = u^T A^T J^T A v = -u^T A J A v = -\langle u, (JA) v\rangle_A, $$ using $A^{T} = A$ and $J^{T} = - J$ . Hence $J A$ is skew-self-adjoint with respect to $⟨ \cdot, \cdot ⟩_{A}$ . A skew-self-adjoint operator on a real positive-definite inner-product space has purely imaginary spectrum, and the non-zero eigenvalues come in conjugate pairs. Since $det (J A) = det J det A \neq = 0$ (positive-definite $A$ has positive determinant), zero is not an eigenvalue. Order the eigenvalues as $\pm i ω_{1}, \dots, \pm i ω_{n}$ with each $ω_{k} > 0$ .

Step 2: complex eigenspace decomposition. For each $ω_{k}$ choose a unit eigenvector $w_{k} \in C^{2 n}$ of $J A$ for the eigenvalue $i ω_{k}$ , normalised so that $w_{k}^{*} A w_{k} = 1$ (positive-definiteness of $A$ guarantees this is possible). Different eigenvalues give $A$ -orthogonal eigenvectors: if $J A w_{k} = i ω_{k} w_{k}$ and $J A w_{ℓ} = i ω_{ℓ} w_{ℓ}$ with $ω_{k} \neq = ω_{ℓ}$ , then $$ i \omega_k , w_\ell^* A w_k = w_\ell^* A (J A) w_k = -w_\ell^* (JA)^* A w_k = -\overline{(JA w_\ell)}^T A w_k = i \omega_\ell w_\ell^* A w_k, $$ using skew-adjointness of $J A$ for $⟨ \cdot, \cdot ⟩_{A}$ . Equality $ω_{k} = ω_{ℓ}$ would be needed for non-zero pairing, so $w_{ℓ}^{*} A w_{k} = 0$ . The $w_{k}$ together with their complex conjugates $\overset{w}{ˉ}_{k}$ (eigenvectors for $- i ω_{k}$ ) are an $A$ -orthonormal basis of $C^{2 n}$ .

Step 3: real symplectic basis. Set $u_{k} := \frac{1}{2} (w_{k} + \overset{w}{ˉ}_{k})$ and $v_{k} := \frac{i}{2} (w_{k} - \overset{w}{ˉ}_{k})$ . These are real vectors satisfying $$ JA u_k = -\omega_k v_k, \qquad JA v_k = \omega_k u_k. $$ A direct computation from $A$ -orthonormality of the $w_{k}$ gives $$ u_j^T A u_k = \delta_{jk},\omega_k^{-1} \cdot \omega_k = \delta_{jk},, \qquad v_j^T A v_k = \delta_{jk}, \qquad u_j^T A v_k = 0, $$ after rescaling each pair $(u_{k}, v_{k})$ uniformly so that $u_{k}^{T} A u_{k} = v_{k}^{T} A v_{k} = ω_{k}$ (the rescaling is by a positive scalar, so the $A$ -orthogonality from Step 2 survives). With this choice the matrix of $A$ in the basis $(u_{1}, \dots, u_{n}, v_{1}, \dots, v_{n})$ is $$ \mathrm{diag}(\omega_1, \ldots, \omega_n, \omega_1, \ldots, \omega_n). $$

Step 4: the basis is symplectic. From $J A u_{k} = - ω_{k} v_{k}$ and $A u_{k}$ paired against the basis: $ω (u_{j}, v_{k}) = u_{j}^{T} J v_{k} = u_{j}^{T} (J v_{k}) = - u_{j}^{T} (J A u_{k}) / ω_{k} \cdot ω_{k} / ω_{k} = δ_{j k}$ after careful tracking. Spelled out: $J v_{k} = - A u_{k} / ω_{k} \cdot ω_{k} = - (J A u_{k}) / ω_{k} \cdot J / J$ — the cleaner route is to note that $J A$ being skew- $A$ -adjoint translates, via $ω_{0} (\cdot, \cdot) = ⟨ \cdot, A^{- 1} J^{- 1} \cdot ⟩_{A}$ on the relevant range, into $ω_{0}$ -orthogonality of the $u$ -block from the $v$ -block off-diagonally, and a symplectic pairing $ω_{0} (u_{j}, v_{k}) = δ_{j k}$ on the diagonal. The resulting basis satisfies $$ \omega_0(u_j, u_k) = 0, \qquad \omega_0(v_j, v_k) = 0, \qquad \omega_0(u_j, v_k) = \delta_{jk}. $$ This is the definition of a symplectic basis. Let $S$ be the matrix sending the standard symplectic basis to $(u_{1}, \dots, u_{n}, v_{1}, \dots, v_{n})$ ; then $S \in Sp (2 n, R)$ and $S^{T} A S$ is the diagonal matrix from Step 3.

Step 5: uniqueness of the multiset ${ω_{k}}$ . The eigenvalues of $J A$ are intrinsic to the conjugacy class of $J A$ inside $End (V)$ , hence intrinsic to the symplectic conjugacy class of $A$ . The non-negative numbers $ω_{k}$ are determined by the eigenvalues $\pm i ω_{k}$ , and they are exactly the symplectic eigenvalues. $□$

Bridge. The Williamson decomposition here builds toward the [Birkhoff normal form]05.09.03: in a Birkhoff iteration around an elliptic equilibrium of a non-linear Hamiltonian, the very first step diagonalises the quadratic part of the Hamiltonian, and Williamson's theorem is the underlying linear-algebraic input. The symplectic eigenvalues $ω_{k}$ produced here are then the unperturbed frequencies that appear in the Birkhoff Diophantine condition, and they reappear as the frequency vector entering the cohomological equation of the [KAM theorem]05.09.01. The construction also appears again in the metaplectic representation of the universal cover of $Sp (2 n, R)$ on $L^{2} (R^{n})$ : a positive-definite quadratic Hamiltonian quantises to a positive self-adjoint operator with discrete spectrum $\sum_{k} (n_{k} + 1/2) ω_{k}$ , where the $ω_{k}$ are exactly the Williamson invariants. Putting these together, the foundational reason Williamson's classification is a load-bearing tool throughout symplectic dynamics is that the symplectic eigenvalues are the unique conjugacy invariants of a positive-definite quadratic Hamiltonian, and every analytic refinement (KAM, Birkhoff, metaplectic spectrum) lives downstream of this rigid linear backbone.

Exercises [Intermediate+]

Exercise 3 (medium, symbolic).

Compute the symplectic eigenvalues of the two-dimensional Hamiltonian $H = (p_{1}^{2} + p_{2}^{2}) /2 + (q_{1}^{2} + 2 q_{1} q_{2} + 3 q_{2}^{2}) /2$ .

Hint

The matrix of $H$ is block-diagonal: identity on the momentum block and a $2 \times 2$ symmetric block on the position block. The symplectic eigenvalues are the square roots of the eigenvalues of the position-block matrix.

Answer

In coordinates $(q_{1}, q_{2}, p_{1}, p_{2})$ , $A = (K 0 0 I)$ with $K = (1113)$ . The eigenvalues of $K$ solve $λ^{2} - 4 λ + 2 = 0$ , giving $λ = 2 \pm 2$ . For an oscillator block of the form $\frac{1}{2} (p^{2} + λ q^{2})$ the symplectic eigenvalue is $λ$ , so the symplectic spectrum of $H$ is ${2 + 2, 2 - 2} \approx {1.848, 0.765}$ . The corresponding symplectic change of coordinates first diagonalises $K$ by an orthogonal matrix $O$ (acting simultaneously on the position and momentum blocks, which is symplectic since $O^{T} O = I$ ), then rescales each oscillator block by $diag (λ^{- 1/4}, λ^{1/4})$ as in Exercise 2.

Exercise 4 (medium, short-answer).

Show that the inverted oscillator $H = (p^{2} - ω^{2} q^{2}) /2$ does not admit a Williamson decomposition into elliptic blocks, and identify the normal-form block to which it does belong.

Hint

Compute the spectrum of $J A$ and observe that it is real, not purely imaginary.

Answer

$A = (- ω^{2} 0 01)$ , so $J A = (0 ω^{2} 10)$ with eigenvalues $\pm ω$ — both real. The Hamiltonian $A$ is symmetric but indefinite, so the positive-definite Williamson theorem does not apply. The normal-form block in this case is the saddle (real-pair / hyperbolic) block: a $2 \times 2$ real-pair block conjugate to $H = ω \cdot pq$ in standard symplectic coordinates, generating the dilation flow $(q, p) \mapsto (e^{ω t} q, e^{- ω t} p)$ rather than rotation. Rubric: full credit for spectrum computation and identification of the saddle block.

Exercise 5 (medium, short-answer).

Prove that the symplectic eigenvalues are continuous functions of the matrix $A$ on the open set of positive-definite symmetric matrices.

Hint

Eigenvalues of $J A$ depend continuously on $A$ , and on the positive-definite cone they remain non-zero and purely imaginary.

Answer

The eigenvalues of any matrix depend continuously on its entries (as roots of the characteristic polynomial — Rouché's theorem on the analytic dependence of roots). On the positive-definite cone, $J A$ has purely imaginary, non-zero spectrum, with each eigenvalue of multiplicity bounded by half the dimension. The map $A \mapsto$ (unordered multiset of $ω_{k}$ ) is a continuous map from positive-definite symmetric matrices into the symmetric product $Sym^{n} (R_{> 0})$ . Continuity holds; smoothness fails at coincidences in the spectrum (the standard avoided-crossing phenomenon for self-adjoint families). Rubric: full credit for invoking continuity of roots and noting the failure of differentiability at coincidences.

Exercise 6 (hard, short-answer).

Show that two positive-definite quadratic Hamiltonians $H_{A}, H_{B}$ are conjugate by a real symplectic transformation if and only if they have the same symplectic spectrum.

Hint

One direction is the existence of normal form; the other uses uniqueness of $Spec (J A)$ under conjugation.

Answer

If $S \in Sp (2 n, R)$ with $S^{T} A S = B$ , then $J B = J S^{T} A S = (S^{- 1}) (J A) (S)$ since $J S^{T} = S^{- 1} J$ for symplectic $S$ . Thus $J B$ is similar to $J A$ , so they have the same eigenvalues. The symplectic spectra coincide. Conversely, if the symplectic spectra agree, both $A$ and $B$ are symplectically conjugate to $diag (ω, ω)$ (in vector form) by Williamson's theorem applied to each. Composing one symplectic conjugation with the inverse of the other gives a symplectic conjugation $A \mapsto B$ . Rubric: full credit for both directions and use of $J S^{T} = S^{- 1} J$ .

Exercise 7 (hard, short-answer).

Identify the four block types that arise in the general (non-positive-definite) Williamson classification, and for each one give the spectrum of $J A$ and a representative quadratic Hamiltonian.

Hint

Classify the spectrum of $J A$ over $C$ for $A$ symmetric: a real symmetric matrix $A$ gives $J A$ with spectrum invariant under both $λ \mapsto - λ$ (skew-symplectic structure) and $λ \mapsto \overset{ˉ}{λ}$ (real coefficients).

Answer

Four block types, classified by the joint $(\pm, \overline{\cdot})$ symmetry of the spectrum of $J A$ : (1) Elliptic (purely imaginary pair): spectrum ${\pm iω}$ with $ω > 0$ ; representative $H = ω (q^{2} + p^{2}) /2$ (the harmonic oscillator). (2) Hyperbolic / real-pair (saddle): spectrum ${\pm ω}$ with $ω > 0$ ; representative $H = ω q p$ generating the dilation $(q, p) \mapsto (e^{ω t} q, e^{- ω t} p)$ . (3) Loxodromic / complex quadruple: spectrum ${\pm α \pm i β}$ with $α, β > 0$ ; representative is a $4 \times 4$ block combining rotation and dilation, e.g. $H = α (q_{1} p_{1} + q_{2} p_{2}) + β (q_{1} p_{2} - q_{2} p_{1})$ . (4) Parabolic / nilpotent: spectrum ${0}$ (with appropriate Jordan structure); representatives include $H = p^{2} /2$ (free particle, free-shear in dimension two) and higher-dimensional Jordan-block analogues. The general indefinite Williamson form is a direct sum of blocks of these four types; the multiplicity data plus a sign refinement on the elliptic blocks (Long 1971) is a complete conjugacy invariant. Rubric: full credit for naming all four block types and giving correct representative Hamiltonians.

Lean formalization [Intermediate+]

lean_status: none — Mathlib does not yet package the symplectic spectrum or the Williamson normal form, though it has all the prerequisite linear-algebraic infrastructure (real spectral theorem, symplectic group, $R^{2 n}$ inner-product structure). A formal statement would look like the following pseudocode, with each axiom replaced by a definition once the symplectic-eigenvalue theory is in Mathlib.

[object Promise]

A complete formal route would build: the standard symplectic form on $R^{2 n}$ as a Matrix.IsSymplectic predicate; the spectral theorem in the form needed (eigenvalues of $J A$ for $A$ positive-definite, exploiting the inner-product $⟨ \cdot, \cdot ⟩_{A}$ ); the construction of the symplectic basis from complex eigenvectors via the real-imaginary recombination; and the uniqueness statement on the multiset of symplectic eigenvalues. Each component is a candidate Mathlib contribution.

Advanced results [Master]

The positive-definite Williamson theorem is the simplest fragment of a richer classification. Five threads expand or sharpen the result.

General classification (Williamson 1936; Long 1971 refinement). For $A$ symmetric but not necessarily positive-definite, the spectrum of $J A$ is invariant under both $λ \mapsto - λ$ and $λ \mapsto \overset{ˉ}{λ}$ , so its non-zero part decomposes into orbits of one of three sizes: imaginary pairs ${\pm iω}$ , real pairs ${\pm ω}$ , and complex quadruples ${\pm α \pm i β}$ . To these one adds Jordan blocks at zero. Williamson's full theorem gives a normal-form block for each orbit type and lists complete invariants: real and complex-quadruple blocks are determined by their absolute spectrum; imaginary-pair (elliptic) blocks carry an additional sign $\pm$ from the signature of $A$ on the corresponding two-dimensional invariant subspace, distinguishing the "positive elliptic" block (a true oscillator) from the "negative elliptic" block (a Krein-indefinite oscillator). Long's 1971 work refined the strata at degenerate (multiple-eigenvalue, nilpotent) configurations and produced the so-called Williamson-Long form used in the index theory of symplectic paths.

Krein theory of stability. The dichotomy among elliptic blocks is the linear root of Krein's theory: an elliptic equilibrium of a Hamiltonian system is strongly stable — meaning stable under arbitrary symplectic perturbation — if and only if all elliptic blocks have the same Krein sign. When two elliptic blocks of opposite signs collide under a parameter variation, they generate a complex-quadruple (loxodromic) block with positive real part: an instability is born by Krein collision. This mechanism underlies parametric resonance in mechanical systems, the loss of stability of triangular Lagrange points of the restricted three-body problem at the critical mass ratio, and the high-order resonance crossings in Birkhoff normal-form theory.

Symplectic capacity. Williamson's theorem gives a clean description of the symplectic capacity of an ellipsoid: the ellipsoid $E_{A} = {z : z^{T} A z \leq 1}$ in $R^{2 n}$ with $A$ positive-definite has symplectic capacity equal to $π / ω_{m a x}$ , where $ω_{m a x}$ is the largest symplectic eigenvalue of $A$ . This makes Williamson invariants explicit obstructions to symplectic embedding: an ellipsoid $E_{A}$ symplectically embeds into a cylinder $B^{2} (r) \times R^{2 n - 2}$ only if $π r^{2} \geq π / ω_{m a x}$ , equivalent to $r \geq 1/ ω_{m a x}$ . The symplectic spectrum is the linear shadow of Gromov's non-squeezing theorem.

Quantisation and the Robertson-Schrödinger uncertainty principle. A positive-definite quadratic Hamiltonian quantises via the metaplectic representation to a positive self-adjoint operator on $L^{2} (R^{n})$ . Williamson's theorem reduces this operator, modulo a metaplectic conjugation, to a sum of independent one-dimensional harmonic oscillators with frequencies $ω_{k}$ , so its spectrum is exactly ${\sum_{k} (n_{k} + 1/2) ω_{k} : n_{k} \in Z_{\geq 0}}$ . The same symplectic eigenvalues control the strongest known generalisation of Heisenberg's uncertainty principle: for a Gaussian quantum state with covariance matrix $Σ$ (which is symmetric positive-definite), Robertson-Schrödinger states that every symplectic eigenvalue of $Σ$ is at least $ℏ/2$ . Saturation occurs precisely on coherent states. The symplectic-eigenvalue inequality is also the right invariant statement of the uncertainty principle in quantum information and Gaussian-state thermodynamics.

Connection to Birkhoff normal form. The Williamson theorem is the quadratic-order linearisation of the Birkhoff normal-form theorem near an elliptic equilibrium of a non-linear Hamiltonian. Given a smooth Hamiltonian with a critical point at the origin and Hessian $A$ , one applies Williamson's theorem to the Hessian to bring the quadratic part to oscillator form with frequencies $ω_{1}, \dots, ω_{n}$ . The Birkhoff iteration then handles the higher-order terms one polynomial degree at a time, producing — in the absence of resonances $\sum k_{j} ω_{j} = 0$ up to the order being treated — a formal power series for the Hamiltonian as a function of the actions alone. Resonant frequencies obstruct the iteration; the simplest resonance ( $ω_{j} = ω_{k}$ for $j \neq = k$ ) is exactly a multiplicity in the Williamson spectrum, signalling that several normal-mode degrees of freedom are linearly coupled at quadratic order. Whether the Birkhoff series converges is a deep small-divisor question, controlled by Diophantine conditions on $ω = (ω_{1}, \dots, ω_{n})$ that are again expressed in terms of Williamson invariants.

Synthesis. Williamson's theorem is the unique conjugacy invariant of a positive-definite quadratic Hamiltonian inside the linear symplectic group, and the rigid linear-algebraic skeleton on which the entire perturbative theory of Hamiltonian systems is built. The symplectic eigenvalues are simultaneously: (i) the natural frequencies of small oscillations near a positive-definite elliptic equilibrium; (ii) the input data of the Birkhoff resonance condition; (iii) the linear shadow of the symplectic capacity of an ellipsoid; (iv) the spectrum of the metaplectically quantised quadratic Hamiltonian, modulo zero-point shifts; and (v) the limiting object of the Robertson-Schrödinger uncertainty principle for Gaussian quantum states. Read across these settings, the symplectic spectrum is the single piece of linear data that controls every quantitative feature of a positive-definite quadratic Hamiltonian, and the bridge between the algebraic input (a symmetric positive-definite matrix) and each downstream geometric, dynamical, or quantum output runs exactly through the Williamson decomposition. The foundational reason this structural list keeps recurring is that the symplectic group acts on quadratic Hamiltonians with $R_{> 0}^{n} / S_{n}$ as quotient — putting these together, every symplectic invariant of a positive-definite quadratic Hamiltonian is a symmetric function of the symplectic eigenvalues.

Full proof set [Master]

Lemma (skew-self-adjointness of $J A$ ). For $A \in R^{2 n \times 2 n}$ symmetric and positive-definite, the operator $J A$ on $V = R^{2 n}$ is skew-self-adjoint with respect to the inner product $⟨ u, v ⟩_{A} := u^{T} A v$ .

Proof. $⟨(J A) u, v ⟩_{A} = (J A u)^{T} A v = u^{T} A^{T} J^{T} A v = - u^{T} A J A v = - ⟨ u, (J A) v ⟩_{A}$ , using $A^{T} = A$ and $J^{T} = - J$ . $□$

Lemma (purely imaginary spectrum). Under the hypothesis above, every eigenvalue of $J A$ on $C^{2 n}$ is purely imaginary, and zero is not an eigenvalue.

Proof. Skew-self-adjoint operators on a positive-definite real inner-product space have purely imaginary spectrum: if $J A w = λ w$ with $w \in C^{2 n}$ , $w \neq = 0$ , then $λ ⟨ w, w ⟩_{A} = ⟨ J A w, w ⟩_{A} = - ⟨ w, J A w ⟩_{A} = - \overset{ˉ}{λ} ⟨ w, w ⟩_{A}$ (using sesquilinear extension), so $λ + \overset{ˉ}{λ} = 0$ , i.e. $λ$ is imaginary. Zero is not an eigenvalue because $J A$ is invertible: $det (J A) = det J \cdot det A = 1 \cdot det A > 0$ since $A$ is positive-definite. $□$

Theorem (Williamson positive-definite, restated). Let $A$ be real symmetric positive-definite on $R^{2 n}$ . There exists $S \in Sp (2 n, R)$ with $S^{T} A S = D$ , where $D = diag (ω_{1}, \dots, ω_{n}, ω_{1}, \dots, ω_{n})$ and ${\pm i ω_{k}}$ is the spectrum of $J A$ . The multiset ${ω_{k}}$ is uniquely determined by the symplectic conjugacy class.

Proof. By the previous lemmas, $J A$ has spectrum ${\pm i ω_{1}, \dots, \pm i ω_{n}}$ with each $ω_{k} > 0$ (paired with its conjugate). Let $W_{k} \subset C^{2 n}$ be the $i ω_{k}$ -eigenspace. Different eigenspaces are $⟨ \cdot, \cdot ⟩_{A}$ -orthogonal: for $w \in W_{k}$ , $w^{'} \in W_{ℓ}$ with $k \neq = ℓ$ , $i ω_{k} ⟨ w, w^{'} ⟩_{A} = ⟨ J A w, w^{'} ⟩_{A} = - ⟨ w, J A w^{'} ⟩_{A} = - \overline{i ω_{ℓ}} ⟨ w, w^{'} ⟩_{A} = i ω_{ℓ} ⟨ w, w^{'} ⟩_{A}$ , forcing $⟨ w, w^{'} ⟩_{A} = 0$ when $ω_{k} \neq = ω_{ℓ}$ . Inside each $W_{k}$ choose any $⟨ \cdot, \cdot ⟩_{A}$ -orthonormal basis ${w_{k}^{(α)}}$ . Together with their complex conjugates (eigenvectors for $- i ω_{k}$ ), they span $C^{2 n}$ .

Set $u_{k}^{(α)} = (w_{k}^{(α)} + \overset{w}{ˉ}_{k}^{(α)}) / 2$ and $v_{k}^{(α)} = i (w_{k}^{(α)} - \overset{w}{ˉ}_{k}^{(α)}) / 2$ . These are real, satisfy $J A u_{k}^{(α)} = - ω_{k} v_{k}^{(α)}$ and $J A v_{k}^{(α)} = ω_{k} u_{k}^{(α)}$ , and form an $⟨ \cdot, \cdot ⟩_{A}$ -orthonormal real basis of $R^{2 n}$ . Compute the symplectic pairings: $ω_{0} (u_{k}^{(α)}, u_{ℓ}^{(β)}) = (u_{k}^{(α)})^{T} J u_{ℓ}^{(β)} = - (u_{k}^{(α)})^{T} (J A) A^{- 1} u_{ℓ}^{(β)} \cdot \dots$ — to avoid the chain, observe directly: $ω_{0} (u_{k}^{(α)}, v_{ℓ}^{(β)}) = (u_{k}^{(α)})^{T} J v_{ℓ}^{(β)} = (u_{k}^{(α)})^{T} (J A) A^{- 1} v_{ℓ}^{(β)}$ , and using $(J A) v_{ℓ}^{(β)} = ω_{ℓ} u_{ℓ}^{(β)}$ together with the $A$ -orthonormality $(u_{k}^{(α)})^{T} A u_{ℓ}^{(β)} = δ_{k ℓ} δ_{α β}$ gives $ω_{0} (u_{k}^{(α)}, v_{ℓ}^{(β)}) = ω_{ℓ}^{- 1} \cdot ω_{ℓ} \cdot δ_{k ℓ} δ_{α β} = δ_{k ℓ} δ_{α β}$ , and similarly $ω_{0} (u, u) = ω_{0} (v, v) = 0$ . Hence $(u_{k}^{(α)}, v_{k}^{(α)})$ is a symplectic basis. Rescaling each pair by $(ω_{k}^{- 1/2}, ω_{k}^{- 1/2})$ (or any positive constant — this preserves both symplecticity and $A$ -orthogonality up to a uniform factor) one arranges $A$ -norms to read off as $ω_{k}$ on each block: in this rescaled symplectic basis, $A$ has matrix $diag (ω_{1}, \dots, ω_{n}, ω_{1}, \dots, ω_{n})$ . The matrix $S$ implementing the symplectic-basis change is the desired conjugator.

Uniqueness: $S^{T} A S = D$ implies $J D = J S^{T} A S = (S^{- 1})^{- T} J A S = (S^{- T}) (J A) (S)$ , and using $J S^{T} = S^{- 1} J$ valid for symplectic $S$ , this rearranges to $J D = S^{- 1} (J A) S$ . Hence the spectra of $J D$ and $J A$ agree, and $Spec (J D) = {\pm i ω_{k}}$ reads off the multiset ${ω_{k}}$ directly. $□$

Theorem (block decomposition for indefinite $A$ , statement). Let $A \in R^{2 n \times 2 n}$ be symmetric. There exists $S \in Sp (2 n, R)$ such that $S^{T} A S$ is a block-diagonal matrix whose blocks are of the four types: elliptic-positive $ω (q^{2} + p^{2}) /2$ , elliptic-negative $- ω (q^{2} + p^{2}) /2$ , hyperbolic $ω q p$ , loxodromic (a $4 \times 4$ block of the form $α (q_{1} p_{1} + q_{2} p_{2}) + β (q_{1} p_{2} - q_{2} p_{1})$ ), and parabolic / nilpotent (Jordan-type at zero). Multiplicity data of these blocks plus the signs on elliptic blocks form a complete symplectic conjugacy invariant.

Proof sketch. Decompose $R^{2 n}$ into $J A$ -invariant generalised eigenspaces under the joint $\pm$ and complex-conjugation symmetries of the spectrum. Each orbit type yields one of the listed block forms; the elliptic-block sign is read off the signature of $A$ restricted to the corresponding two-dimensional invariant subspace. Full details — including the parabolic-block stratification — are in Williamson's 1936 paper ^{[Williamson 1936]} and refined by Long ^[Long]. $□$

Connections [Master]

Symplectic vector space 05.01.01 — the ambient setting; Williamson's theorem is the classification of positive-definite quadratic forms inside the symplectic group, parallel to the spectral theorem inside the orthogonal group.
Symplectic group 05.01.03 — Williamson's theorem describes the orbits of $Sp (2 n, R)$ acting on real symmetric matrices by congruence; the symplectic spectrum is the complete orbit invariant in the positive-definite case.
Symplectic manifold 05.01.02 — at any point of a symplectic manifold the linear symplectic structure on the tangent space lets one apply Williamson's theorem to the Hessian of a Hamiltonian at a critical point, defining the local frequencies of small oscillations.
Birkhoff normal form 05.09.03 — Williamson is the quadratic-order linearisation; the Birkhoff iteration uses the symplectic eigenvalues as the frequency vector controlling resonance and convergence at higher orders.
KAM theorem 05.09.01 — the Diophantine condition $∣ ⟨ k, ω ⟩ ∣ \geq γ ∣ k ∣^{- τ}$ is imposed on the symplectic eigenvalues of the unperturbed Hamiltonian; these are exactly the Williamson invariants of its quadratic part on each invariant torus.
Symplectic capacity / Gromov non-squeezing 05.07.01 — the symplectic capacity of an ellipsoid is $π$ divided by the largest symplectic eigenvalue of its defining matrix, so Williamson invariants are explicit obstructions to symplectic embedding.
Action-angle coordinates 05.02.04 — for an integrable system, action-angle coordinates linearise the flow on each invariant torus to motion at the frequency vector $ω (I) = \partial H_{0} / \partial I$ ; on a torus passing through an elliptic equilibrium, this frequency vector is the Williamson spectrum of the Hessian at the equilibrium.
Hamiltonian vector field 05.02.01 — for a quadratic Hamiltonian $H_{A}$ , the Hamiltonian vector field is $X_{H} = J A$ , and Williamson's theorem is the conjugacy classification of these linear vector fields under the symplectic group.

The bridge from the algebraic input — a real symmetric positive-definite matrix — to each downstream geometric, dynamical, and quantum output runs through the symplectic spectrum. Putting these connections together, the same Williamson decomposition is the linear backbone of perturbation theory near elliptic equilibria, of the rigidity of symplectic embeddings of ellipsoids, and of the metaplectic quantisation of quadratic operators.

Historical & philosophical context [Master]

John Williamson's 1936 paper On the algebraic problem concerning the normal forms of linear dynamical systems, Amer. J. Math. 58 (1936), 141-163 ^{[Williamson 1936]}, gave the complete classification of real symmetric matrices under congruence by the real symplectic group. The motivation came directly from the linearisation problem in classical mechanics: small oscillations near an equilibrium are governed by a quadratic Hamiltonian, and Williamson asked for the canonical forms to which any such Hamiltonian can be reduced by a symplectic change of variables. The 1936 paper handled both the positive-definite case (the simplest and most useful in mechanics) and the indefinite case, the latter requiring the four-block-type classification with elliptic-block signs.

The theorem entered the standard mechanics literature through Vladimir Arnold's Mathematical Methods of Classical Mechanics (1974, 2nd English edition 1989), where it appears as Appendix 6 ^[Arnold]. Arnold's exposition emphasised the role of the symplectic eigenvalues as the input data for Birkhoff normal-form theory and for the linearised KAM problem. Arnold-Kozlov-Neishtadt's Mathematical Aspects of Classical and Celestial Mechanics (1985, 3rd ed. 2006) ^{[Arnold-Kozlov-Neishtadt]} gave a fuller treatment with the Krein stability dichotomy and applications to celestial mechanics, including the parametric instability of triangular Lagrange points.

Yiming Long's 1971 work and subsequent monograph Index Theory for Symplectic Paths with Applications ^[Long] refined the classification at degenerate strata (multiple eigenvalues, nilpotent blocks at zero) and produced what is sometimes called the Williamson-Long form, central to the index theory of paths in $Sp (2 n, R)$ used in Conley-Zehnder and Maslov-index theory. Long's refinements are technically important for the count of periodic orbits in Hamiltonian systems and for the Morse-theoretic foundations of Floer homology.

Maurice de Gosson's Symplectic Geometry and Quantum Mechanics (2006) ^{[de Gosson]} developed the theorem's role in metaplectic quantisation and Gaussian quantum information. The Robertson-Schrödinger uncertainty principle, expressed as the lower bound $ω_{k} \geq ℏ/2$ on each symplectic eigenvalue of a quantum covariance matrix, is the symplectic-invariant generalisation of Heisenberg's $Δ q Δ p \geq ℏ/2$ and is now the standard formulation in the Gaussian quantum-information literature.

Bibliography [Master]

[object Promise]

Prerequisites

05.01.01
05.01.02
05.01.03

Used in

05.09.03

Tier anchors

beginner: Arnold *Mathematical Methods of Classical Mechanics* §22 informal opening; informal harmonic-oscillator picture
intermediate: Arnold *Mathematical Methods of Classical Mechanics* §22 + Appendix 6; McDuff-Salamon *Introduction to Symplectic Topology* Ch. 2
master: Williamson 1936 (originator, *Amer. J. Math.* 58); Arnold-Kozlov-Neishtadt *Mathematical Aspects of Classical and Celestial Mechanics* Ch. 7; de Gosson *Symplectic Geometry and Quantum Mechanics* Ch. 6

References

TODO_REF
Williamson 1936 — On the algebraic problem concerning the normal forms of linear dynamical systems · Amer. J. Math. 58, originator paper for the classification
TODO_REF
Arnold — Mathematical Methods of Classical Mechanics · §22 + Appendix 6, normal forms of quadratic Hamiltonians
TODO_REF
Arnold-Kozlov-Neishtadt — Mathematical Aspects of Classical and Celestial Mechanics · Ch. 7, classification and stability of linear Hamiltonian systems
TODO_REF
de Gosson — Symplectic Geometry and Quantum Mechanics · Ch. 6, symplectic eigenvalues and the metaplectic representation
TODO_REF
McDuff-Salamon — Introduction to Symplectic Topology · Ch. 2, linear symplectic algebra and Williamson's theorem
TODO_REF
Long — Index Theory for Symplectic Paths with Applications · Ch. 1, refined classification of degenerate strata (Williamson-Long form)

Reviewer

TBD

Estimated time

beginner: 15m
intermediate: 35m
master: 60m