05.04.03 · symplectic / moment-reduction

Atiyah-Guillemin-Sternberg convexity theorem

shipped3 tiersLean: none

Anchor (Master): Atiyah 1982 (originator); Guillemin-Sternberg 1982 (independent originator); Cannas da Silva §27; Audin §IV; Kirwan *Cohomology of Quotients* §3

Intuition [Beginner]

When a torus acts on a closed symplectic manifold in a way that respects the symplectic form, every choice of one-parameter subgroup of gives a function on whose flow is the corresponding rotation. Bundling these functions together gives the moment map . The Atiyah-Guillemin-Sternberg theorem describes the image as a remarkably rigid object: it is always a convex polytope, and that polytope is the convex hull of the images of the points fixed by the whole torus.

The simplest case is the rotation of the round sphere about a vertical axis. The height function is the moment map; its image is the interval , and the two endpoints are the two poles — the only fixed points. Three-dimensional examples like produce filled triangles, with the three vertices coming from three fixed points of the torus action.

The theorem is a bridge between symplectic geometry and the combinatorics of polytopes. Once you know the polytope, you know a great deal about the manifold; the Delzant classification later turns this into a full equivalence for toric manifolds.

Visual [Beginner]

The sphere rotating about a vertical axis, with horizontal arrows showing the orbits as latitude circles, and the moment-map image drawn beside it as the segment with the north and south pole marked.

A schematic placeholder diagram for the Atiyah-Guillemin-Sternberg convexity theorem.

The picture you should keep is: the orbits of the torus collapse onto a flat polytope, and the corners of the polytope are exactly the fixed points.

Worked example [Beginner]

Take with homogeneous coordinates and the action of the two-torus by . The moment map is

The fixed points of the -action are the three coordinate vertices: , , and . Their moment-map images are , , and . Every other point of has positive entries summing to at most , so the image is the closed triangle with these three corners.

What this tells us: a complicated four-real-dimensional manifold collapses, under the moment map, to a flat triangle whose corners are the only fixed points. The convexity theorem promises this picture for every torus action of this type.

Check your understanding [Beginner]

Formal definition [Intermediate+]

Let be a closed connected symplectic manifold of dimension , and let be a torus acting on in a Hamiltonian fashion, with moment map , where and is its linear dual. For each the component of the moment map in direction is the smooth function

whose Hamiltonian flow is the one-parameter subgroup acting on . The element is generic when the closure of the one-parameter subgroup it generates is all of ; equivalently, has rationally independent coordinates with respect to a basis of the integer lattice .

The fixed-point set of the -action is denoted . Each connected component is a closed symplectic submanifold of . The image is a finite set when has finitely many components.

A subset is a convex polytope when it is the convex hull of a finite set of points. Equivalently, is a bounded intersection of finitely many closed half-spaces.

Key theorem with proof [Intermediate+]

Theorem (Atiyah 1982; Guillemin-Sternberg 1982). Let be a closed connected symplectic manifold with a Hamiltonian action of a torus and moment map $\mu : M \to \mathfrak{t}^$. Then:*

  1. For every $c \in \mathfrak{t}^\mu^{-1}(c)$ is connected (or empty).*
  2. The image is a convex polytope.
  3. That polytope is the convex hull of the images of the fixed-point components: .

The proof rests on a Morse-theoretic study of the components . The structure of the argument is the same in Atiyah's and in Guillemin-Sternberg's papers; the form below follows Atiyah's exposition.

Proof. The argument proceeds in five stages.

Stage 1 (Morse-Bott property). For any , the function is Morse-Bott, with critical set equal to the fixed-point set of the closed one-parameter subgroup . When is generic, , so the critical set of is exactly . The Hamiltonian condition identifies critical points of with zeros of the fundamental vector field , and zeros of are exactly the fixed points of the flow of . The Hessian of along the normal directions to is non-degenerate because is non-degenerate and the linearised action is a faithful representation of on the normal bundle.

Stage 2 (index parity). Every Morse-Bott index and coindex of is even. At a fixed point , the tangent space decomposes under the linearised -action into a direct sum of weight spaces, where each is a real two-dimensional subspace on which acts by a non-zero weight . The Hessian of on is a multiple of the standard rotation-invariant symplectic form, with sign equal to . Each weight space contributes a two-dimensional summand to either the negative or the positive eigenspace; both index and coindex are sums of these even-dimensional contributions.

Stage 3 (connectedness of level sets). Index parity implies that for every and every , the sublevel set is connected, and so is the level set . The argument is the standard Morse-Bott handle decomposition: as increases through a critical value of , the sublevel set changes by attaching handles along a sphere bundle of dimension . When the index is , the change adds a new connected component, but in our setting any such component is attached to the existing manifold along a coindex-direction sphere bundle of dimension as soon as the coindex is at least — which is forced by index-plus-coindex equalling the codimension of the critical submanifold and both being even and the codimension being positive away from a global minimum. When the index is , the attaching sphere is connected, so a connected sublevel set remains connected. A symmetric argument applies to superlevel sets, so level sets are connected. Connectedness of is the base case of the induction on .

Stage 4 (convexity, by induction on rank). The image is convex. Induct on . The case is empty. The case : for the generator is a connected real-valued function on connected , so its image is an interval — a convex polytope in . For the inductive step assume the statement for tori of rank . Pick any two points ; one needs to produce a path from to in that follows the straight line in . Choose a generic direction and use connectedness of the level sets of at every height between and : each such level set is connected by Stage 3. Restrict to ; this restricted moment map takes values in the affine hyperplane , and after quotienting by the rank- subtorus — which acts as the identity on by construction — one obtains a Hamiltonian action of the rank- quotient torus on the symplectic-reduced quotient with moment map taking values in this hyperplane. The inductive hypothesis applied to the reduced action gives convexity of the image inside the hyperplane. Sweeping from to produces the desired straight-line path in , establishing convexity.

Stage 5 (vertices and the polytope description). The image is a polytope, and its vertices are images of fixed-point components. A point is an extreme point of the convex set iff every attains its minimum or maximum on for some ; the critical-set characterisation of Stage 1 gives at every extreme point. Conversely, the moment-map image of any fixed-point component is a single point in , and these finitely many points generate by convexity. Hence , a convex polytope with vertex set contained in . Connectedness of fibres in (1) follows from Stage 3 applied to a generic and the fact that two points in the same generic--level set lying in are connected through the reduced-space convexity of Stage 4.

Bridge. The convexity theorem builds toward 05.04.04 (Delzant's theorem), where the polytope produced here appears again in the role of a complete invariant: when and the action is effective, the Delzant polytope determines up to equivariant symplectomorphism. The Morse-Bott index-parity argument is exactly the mechanism that lets reduced-space symplectic geometry 05.04.02 inherit a smooth structure from the ambient , because connectedness of is what makes the regular reduction a connected symplectic manifold rather than a disjoint union. Putting these together, the foundational reason convexity holds is exactly that the symplectic structure forces every Morse-Bott index to come in two-dimensional weight-space contributions — index parity is the bridge between the analytic non-degeneracy of and the combinatorial flatness of . The theorem identifies the Hamiltonian-action structure on with a piece of polyhedral data on .

Exercises [Intermediate+]

Lean formalization [Intermediate+]

Mathlib does not currently support Hamiltonian torus actions or moment maps in a form that admits the Atiyah-Guillemin-Sternberg theorem. The skeleton below sketches the statement at the type level; the proof would require the Morse-Bott apparatus described in lean_mathlib_gap.

[object Promise]

Advanced results [Master]

The convexity theorem belongs to a family of results that combine Hamiltonian symmetry with convex-geometric output. Several distinct generalisations exist, each retaining the Morse-Bott / level-set-connectedness skeleton.

Non-abelian convexity (Kirwan 1984). When the symmetry group is a compact connected Lie group rather than a torus, the moment map no longer has convex image directly; the image is -invariant. Kirwan's theorem states that the intersection with a closed positive Weyl chamber is a convex polytope. The proof reduces to the abelian case applied to the maximal torus, plus the fact that every -orbit in meets in exactly one point.

Symplectic implosion (Guillemin-Jeffrey-Sjamaar 2002). A construction that turns a Hamiltonian -manifold into a Hamiltonian -manifold whose moment polytope is exactly the Kirwan polytope. The implosion machinery realises Kirwan's theorem as an instance of Atiyah-Guillemin-Sternberg applied to a derived -manifold.

Singular convexity. Convexity persists when one drops compactness and replaces it by properness of the moment map: Lerman-Meinrenken-Tolman-Woodward 1998 and earlier Hilgert-Neeb-Plank 1994 established convexity of the image under properness assumptions.

Local convexity at fixed points. Atiyah's argument also yields a local statement: the image of near any fixed point is the cone on the polytope of the linearised torus action on , intersected with a translate of the fixed-point image . This is the seed of the local description used in Delzant's classification.

Morse theory of . Kirwan's Cohomology of Quotients (1984) develops a Morse theory for the function , whose critical sets stratify in a way compatible with the moment-polytope facets. This stratification is the key technical input for computing cohomology of symplectic quotients .

Vertex-edge structure. When acts effectively with — the toric case — the moment polytope is a Delzant polytope: simple, rational, smooth at each vertex (in the sense that the edge directions at each vertex form a -basis of the integer lattice). The edges of the polytope correspond to one-dimensional fixed-component spheres in , and their lengths are determined by the symplectic areas of those spheres.

Synthesis. The Atiyah-Guillemin-Sternberg theorem identifies a piece of symplectic geometry with a piece of polyhedral combinatorics, and the bridge is a single analytic fact: components of the moment map are Morse-Bott functions whose indices are constrained to be even by the weight-space decomposition of the symplectic structure. Read as a structural statement, the theorem generalises the elementary fact that a connected real-valued Morse function on a connected manifold has an interval as its image — index parity is what allows iterating to higher rank without breaking connectedness. Read in the opposite direction, the theorem is dual to a representation-theoretic fact: the multiplicity of a weight in a quantisation of is supported on the lattice points of the polytope , and the polytope is therefore the support of the spectrum of a quantum operator.

Putting these together, the foundational reason that fixed points control the global image is exactly Stage 2 of the proof: the symplectic non-degeneracy forces the second-order behaviour of at any fixed point to be a sum of two-dimensional rotation Hessians, and this single linear-algebra ingredient is the bridge between the analytic moment map and the combinatorial polytope. The theorem appears again in Delzant's classification, where the polytope is upgraded from invariant to complete invariant — putting these together one sees that every compact symplectic toric manifold is exactly its polytope, in the precise categorical sense that the polytope-to-manifold construction is an equivalence of groupoids.

Full proof set [Master]

Lemma (weight-space decomposition). Let be a closed one-parameter subgroup of acting on a symplectic vector space by symplectomorphisms, and let be the fixed subspace. Then the symplectic complement admits an orthogonal direct-sum decomposition into real two-dimensional symplectic subspaces, indexed by a finite set of non-zero weights , such that acts on by the rotation of weight .

Proof. The action of on commutes with , so the complexification decomposes into character spaces on which acts by the character . Pairs for assemble into real two-dimensional summands ; the symplectic form pairs each with itself non-degenerately because preserves . The fixed subspace is the summand and is automatically symplectic.

Lemma (Hessian sign on weight spaces). In the setting of the previous lemma, the Hessian of at the fixed point, restricted to the weight space , equals , where is the positive-definite rotation-invariant inner product on induced by the symplectic structure and a choice of -invariant complex structure.

Proof. On choose a -invariant complex structure with . Then acts on by , the moment map for this circle action is (the sign convention from ). The Hessian at is , with sign relative to the positive-definite . Either sign of gives a definite quadratic form on , contributing to either index or coindex.

Lemma (connectedness of level sets). Let be Morse-Bott on a closed connected manifold with all indices and coindices even. Then for every the level set is empty or connected, and so is every sublevel set.

Proof. Connectedness of sublevel sets is by induction on critical values. Below the minimum, the sublevel set is empty. Just above the minimum, the sublevel set retracts onto the connected critical submanifold (assuming connected has a connected global-minimum critical set; if there are multiple minima, repeat the argument component by component, but on connected the global minimum must be a single connected submanifold by Morse-Bott + connectedness of , since otherwise a path between two minima would have to cross a critical level of higher index whose attaching coindex sphere is connected, making the two pieces equal in ). At a critical value the sublevel set changes by attaching the disk bundle of the negative normal bundle of the critical submanifold , glued along the sphere bundle of dimension . Even index makes this attaching sphere bundle have connected fibres, hence the gluing keeps connectedness. Index would create a new component, but on connected and Morse-Bott, this only happens at the global minimum (already handled). Symmetric argument for superlevel sets via . The level set is a deformation retract of a thin shell between sublevel and superlevel, both of which are connected; so is connected.

Theorem (full statement, restated and proved). Let be closed connected symplectic with a Hamiltonian -action and moment map . Then is connected (or empty) for all $c \in \mathfrak{t}^\mu(M) = \mathrm{conv}\big(\mu(M^T)\big)$ is a convex polytope.*

Proof. Stages 1-5 of the proof in the Intermediate section, made rigorous by the three lemmas above. Stage 1 uses the Hamiltonian condition and non-degeneracy of to identify critical points of with zeros of the fundamental vector field, hence with fixed points of the closure of . The Morse-Bott property along comes from the weight-space lemma applied to the linearised action on the normal bundle. Stage 2 is the Hessian-sign lemma: every weight-space contribution is even-dimensional, so total index and coindex are even. Stage 3 is the connectedness-of-level-sets lemma applied to for any .

Stage 4 (convexity) inducts on . The base case : image is an interval. Inductive step: pick generic , restrict to a level set , observe that inherits a Hamiltonian action of the rank- quotient torus via the symplectic reduction (regular reduction at the level of gives a smooth symplectic manifold when is a regular value of , which it is for almost every in the image; the restricted moment map is the projection of to the hyperplane). The inductive hypothesis gives convexity of the image inside the hyperplane. Sweeping from to produces the straight-line path. Stage 5 (vertex characterisation) follows because every extreme point of the image is the unique minimum or maximum of some , which is a fixed-point image. Connectedness of fibres uses Stage 3 plus the inductive convexity.

Connections [Master]

  • Moment map 05.04.01. Atiyah-Guillemin-Sternberg is the central global theorem about the moment map; the unit's three-condition definition is what enables Stage 1 of the proof, and the Hamiltonian sign convention used here is the same as that unit's.

  • Symplectic reduction 05.04.02. The inductive step in Stage 4 of the proof uses regular Marsden-Weinstein-Meyer reduction at non-critical levels of to descend to a torus action of one rank lower. Index parity in Stage 2 also explains why regular reduction levels have connected fibres — a fact used implicitly in the smooth-quotient construction.

  • Delzant's theorem 05.04.04. The downstream classification result: in the toric case (), the moment polytope produced by Atiyah-Guillemin-Sternberg is upgraded to a complete invariant, classifying compact symplectic toric manifolds up to equivariant symplectomorphism. The vertices and edge structure of the polytope encode the global manifold.

  • Coadjoint orbit 05.03.01. The Schur-Horn theorem is the convexity theorem applied to a coadjoint orbit of with the maximal-torus action — giving the permutohedron as moment polytope and the diagonal-of-Hermitian-matrix as moment map. This is the historical bridge between the convexity theorem and 1920s linear algebra.

  • Symplectic manifold 05.01.02. The convexity statement crucially uses non-degeneracy of — both in the Morse-Bott property of at fixed points and in the weight-space sign argument. A degenerate two-form would not force index parity, and the conclusion would fail.

  • Hamiltonian vector field 05.02.01. The components are the Hamiltonians whose flow is the corresponding one-parameter subgroup of ; the proof's analytic backbone is precisely the unit's identity .

Historical & philosophical context [Master]

Michael Atiyah's 1982 paper Convexity and commuting Hamiltonians (Bull. London Math. Soc. 14, 1-15) [Atiyah 1982] and Victor Guillemin and Shlomo Sternberg's Convexity properties of the moment mapping (Invent. Math. 67, 491-513) [Guillemin-Sternberg 1982] appeared independently and in the same year. Both proofs use the Morse-theoretic structure of moment-map components, with Atiyah's exposition emphasising the index-parity step and Guillemin-Sternberg's developing the level-set-connectedness machinery in greater generality. The two papers are joint originators of the theorem.

The result has a substantial linear-algebra prehistory. Issai Schur's 1923 paper Über eine Klasse von Mittelbildungen mit Anwendungen auf die Determinantentheorie (Sitzungsber. Berlin. Math. Ges. 22) [Schur 1923] established that the diagonal entries of a Hermitian matrix with prescribed eigenvalues lie in the convex hull of the permutations of the eigenvalues — the "" direction of what is now called the Schur-Horn theorem. Alfred Horn's 1954 paper Doubly stochastic matrices and the diagonal of a rotation matrix (Amer. J. Math. 76, 620-630) [Horn 1954] proved the converse direction, completing the equivalence. Atiyah and Guillemin-Sternberg's theorem reveals the Schur-Horn statement as the special case of moment-map convexity for the maximal-torus action on a unitary coadjoint orbit.

Frances Kirwan's 1984 monograph Cohomology of Quotients of Symplectic and Algebraic Varieties [Kirwan 1984] generalised the convexity theorem to non-abelian compact group actions, establishing that the intersection of the moment image with a closed positive Weyl chamber is convex polyhedral. Kirwan's argument also yields a Morse theory for that has become the standard tool for computing cohomology of symplectic quotients. Michèle Audin's Topology of Torus Actions on Symplectic Manifolds (1991) [Audin 1991] gave the canonical textbook treatment of the abelian convexity result and its local-cone refinements. Thomas Delzant's 1988 thesis [Delzant 1988] carried the convexity theorem to its sharpest form in the toric case, classifying compact symplectic toric manifolds by their moment polytopes.

Bibliography [Master]

[object Promise]