05.04.03 · symplectic / moment-reduction

Atiyah-Guillemin-Sternberg convexity theorem

shipped3 tiersLean: none

Anchor (Master): Atiyah 1982 (originator); Guillemin-Sternberg 1982 (independent originator); Cannas da Silva §27; Audin §IV; Kirwan *Cohomology of Quotients* §3

Intuition [Beginner]

When a torus $T = T^{k}$ acts on a closed symplectic manifold $M$ in a way that respects the symplectic form, every choice of one-parameter subgroup of $T$ gives a function on $M$ whose flow is the corresponding rotation. Bundling these functions together gives the moment map $μ : M \to R^{k}$ . The Atiyah-Guillemin-Sternberg theorem describes the image $μ (M)$ as a remarkably rigid object: it is always a convex polytope, and that polytope is the convex hull of the images of the points fixed by the whole torus.

The simplest case is the rotation of the round sphere $S^{2}$ about a vertical axis. The height function is the moment map; its image is the interval $[- 1, 1]$ , and the two endpoints are the two poles — the only fixed points. Three-dimensional examples like $CP^{2}$ produce filled triangles, with the three vertices coming from three fixed points of the torus action.

The theorem is a bridge between symplectic geometry and the combinatorics of polytopes. Once you know the polytope, you know a great deal about the manifold; the Delzant classification later turns this into a full equivalence for toric manifolds.

Visual [Beginner]

The sphere $S^{2}$ rotating about a vertical axis, with horizontal arrows showing the orbits as latitude circles, and the moment-map image drawn beside it as the segment $[- 1, 1]$ with the north and south pole marked.

The picture you should keep is: the orbits of the torus collapse onto a flat polytope, and the corners of the polytope are exactly the fixed points.

Worked example [Beginner]

Take $M = CP^{2}$ with homogeneous coordinates $[z_{0} : z_{1} : z_{2}]$ and the action of the two-torus $T^{2}$ by $(θ_{1}, θ_{2}) \cdot [z_{0} : z_{1} : z_{2}] = [z_{0} : e^{i θ_{1}} z_{1} : e^{i θ_{2}} z_{2}]$ . The moment map is

μ ([z_{0} : z_{1} : z_{2}]) = \frac{1}{∣ z _{0} ∣ ^{2} + ∣ z _{1} ∣ ^{2} + ∣ z _{2} ∣ ^{2}} (∣ z_{1} ∣^{2}, ∣ z_{2} ∣^{2}) .

The fixed points of the $T^{2}$ -action are the three coordinate vertices: $[1 : 0 : 0]$ , $[0 : 1 : 0]$ , and $[0 : 0 : 1]$ . Their moment-map images are $(0, 0)$ , $(1, 0)$ , and $(0, 1)$ . Every other point of $CP^{2}$ has positive entries summing to at most $1$ , so the image $μ (CP^{2})$ is the closed triangle with these three corners.

What this tells us: a complicated four-real-dimensional manifold collapses, under the moment map, to a flat triangle whose corners are the only fixed points. The convexity theorem promises this picture for every torus action of this type.

Check your understanding [Beginner]

Formal definition [Intermediate+]

Let $(M, ω)$ be a closed connected symplectic manifold of dimension $2 n$ , and let $T = T^{k}$ be a torus acting on $M$ in a Hamiltonian fashion, with moment map $μ : M \to t^{*}$ , where $t = Lie (T) ≅ R^{k}$ and $t^{*} ≅ R^{k}$ is its linear dual. For each $ξ \in t$ the component of the moment map in direction $ξ$ is the smooth function

μ^{ξ} := ⟨ μ, ξ ⟩ : M \to R,

whose Hamiltonian flow is the one-parameter subgroup $exp (t ξ) \subset T$ acting on $M$ . The element $ξ$ is generic when the closure of the one-parameter subgroup it generates is all of $T$ ; equivalently, $ξ$ has rationally independent coordinates with respect to a basis of the integer lattice $Λ \subset t$ .

The fixed-point set of the $T$ -action is denoted $M^{T} \subset M$ . Each connected component is a closed symplectic submanifold of $M$ . The image $μ (M^{T}) \subset t^{*}$ is a finite set when $M^{T}$ has finitely many components.

A subset $P \subset t^{*}$ is a convex polytope when it is the convex hull of a finite set of points. Equivalently, $P$ is a bounded intersection of finitely many closed half-spaces.

Key theorem with proof [Intermediate+]

Theorem (Atiyah 1982; Guillemin-Sternberg 1982). Let $(M, ω)$ be a closed connected symplectic manifold with a Hamiltonian action of a torus $T = T^{k}$ and moment map $\mu : M \to \mathfrak{t}^$. Then:*

For every $c \in \mathfrak{t}^ $, t h e f ib r e$ \mu^{-1}(c)$ is connected (or empty).*
The image $μ (M)$ is a convex polytope.
That polytope is the convex hull of the images of the fixed-point components: $μ (M) = conv (μ (M^{T}))$ .

The proof rests on a Morse-theoretic study of the components $μ^{ξ}$ . The structure of the argument is the same in Atiyah's and in Guillemin-Sternberg's papers; the form below follows Atiyah's exposition.

Proof. The argument proceeds in five stages.

Stage 1 (Morse-Bott property). For any $ξ \in t$ , the function $μ^{ξ}$ is Morse-Bott, with critical set equal to the fixed-point set of the closed one-parameter subgroup $T_{ξ} = \overline{exp (R ξ)} \subseteq T$ . When $ξ$ is generic, $T_{ξ} = T$ , so the critical set of $μ^{ξ}$ is exactly $M^{T}$ . The Hamiltonian condition $d μ^{ξ} = ι_{X_{ξ}} ω$ identifies critical points of $μ^{ξ}$ with zeros of the fundamental vector field $X_{ξ}$ , and zeros of $X_{ξ}$ are exactly the fixed points of the flow of $ξ$ . The Hessian of $μ^{ξ}$ along the normal directions to $M^{T_{ξ}}$ is non-degenerate because $ω$ is non-degenerate and the linearised action is a faithful representation of $T_{ξ}$ on the normal bundle.

Stage 2 (index parity). Every Morse-Bott index and coindex of $μ^{ξ}$ is even. At a fixed point $p \in M^{T_{ξ}}$ , the tangent space $T_{p} M$ decomposes under the linearised $T_{ξ}$ -action into a direct sum $T_{p} M = T_{p} M^{T_{ξ}} \oplus ⨁_{α} V_{α}$ of weight spaces, where each $V_{α}$ is a real two-dimensional subspace on which $T_{ξ}$ acts by a non-zero weight $α : t_{ξ} \to R$ . The Hessian of $μ^{ξ}$ on $V_{α}$ is a multiple of the standard rotation-invariant symplectic form, with sign equal to $sign α (ξ)$ . Each weight space contributes a two-dimensional summand to either the negative or the positive eigenspace; both index and coindex are sums of these even-dimensional contributions.

Stage 3 (connectedness of level sets). Index parity implies that for every $ξ \in t$ and every $a \in R$ , the sublevel set $(μ^{ξ})^{- 1} ((- \infty, a])$ is connected, and so is the level set $(μ^{ξ})^{- 1} (a)$ . The argument is the standard Morse-Bott handle decomposition: as $a$ increases through a critical value $c$ of $μ^{ξ}$ , the sublevel set changes by attaching handles along a sphere bundle of dimension $index - 1$ . When the index is $0$ , the change adds a new connected component, but in our setting any such component is attached to the existing manifold along a coindex-direction sphere bundle of dimension $coindex - 1 \geq 1$ as soon as the coindex is at least $2$ — which is forced by index-plus-coindex equalling the codimension of the critical submanifold and both being even and the codimension being positive away from a global minimum. When the index is $\geq 2$ , the attaching sphere is connected, so a connected sublevel set remains connected. A symmetric argument applies to superlevel sets, so level sets are connected. Connectedness of $M$ is the base case of the induction on $a$ .

Stage 4 (convexity, by induction on rank). The image $μ (M)$ is convex. Induct on $k = dim T$ . The case $k = 0$ is empty. The case $k = 1$ : $μ^{ξ}$ for the generator $ξ$ is a connected real-valued function on connected $M$ , so its image is an interval — a convex polytope in $R$ . For the inductive step assume the statement for tori of rank $< k$ . Pick any two points $p_{0}, p_{1} \in M$ ; one needs to produce a path from $μ (p_{0})$ to $μ (p_{1})$ in $μ (M)$ that follows the straight line in $t^{*}$ . Choose a generic direction $ξ \in t$ and use connectedness of the level sets of $μ^{ξ}$ at every height between $μ^{ξ} (p_{0})$ and $μ^{ξ} (p_{1})$ : each such level set $L_{a} := (μ^{ξ})^{- 1} (a)$ is connected by Stage 3. Restrict $μ$ to $L_{a}$ ; this restricted moment map takes values in the affine hyperplane ${η \in t^{*} : ⟨ η, ξ ⟩ = a}$ , and after quotienting by the rank- $1$ subtorus $T_{ξ}$ — which acts as the identity on $L_{a} / T_{ξ}$ by construction — one obtains a Hamiltonian action of the rank- $(k - 1)$ quotient torus $T / T_{ξ}$ on the symplectic-reduced quotient $L_{a} / T_{ξ}$ with moment map taking values in this hyperplane. The inductive hypothesis applied to the reduced action gives convexity of the image inside the hyperplane. Sweeping $a$ from $μ^{ξ} (p_{0})$ to $μ^{ξ} (p_{1})$ produces the desired straight-line path in $μ (M)$ , establishing convexity.

Stage 5 (vertices and the polytope description). The image is a polytope, and its vertices are images of fixed-point components. A point $v \in μ (M)$ is an extreme point of the convex set $μ (M)$ iff every $μ^{ξ}$ attains its minimum or maximum on $μ^{- 1} (v)$ for some $ξ$ ; the critical-set characterisation of Stage 1 gives $μ^{- 1} (v) \cap M^{T} \neq = \emptyset$ at every extreme point. Conversely, the moment-map image of any fixed-point component is a single point in $t^{*}$ , and these finitely many points generate $μ (M)$ by convexity. Hence $μ (M) = conv (μ (M^{T}))$ , a convex polytope with vertex set contained in $μ (M^{T})$ . Connectedness of fibres in (1) follows from Stage 3 applied to a generic $ξ$ and the fact that two points in the same generic- $μ^{ξ}$ -level set lying in $μ^{- 1} (c)$ are connected through the reduced-space convexity of Stage 4. $□$

Bridge. The convexity theorem builds toward 05.04.04 (Delzant's theorem), where the polytope produced here appears again in the role of a complete invariant: when $dim T = \frac{1}{2} dim M$ and the action is effective, the Delzant polytope $μ (M) \subset t^{*}$ determines $(M, ω, T)$ up to equivariant symplectomorphism. The Morse-Bott index-parity argument is exactly the mechanism that lets reduced-space symplectic geometry 05.04.02 inherit a smooth structure from the ambient $M$ , because connectedness of $μ^{- 1} (c)$ is what makes the regular reduction $μ^{- 1} (c) / T$ a connected symplectic manifold rather than a disjoint union. Putting these together, the foundational reason convexity holds is exactly that the symplectic structure forces every Morse-Bott index to come in two-dimensional weight-space contributions — index parity is the bridge between the analytic non-degeneracy of $ω$ and the combinatorial flatness of $μ (M)$ . The theorem identifies the Hamiltonian-action structure on $M$ with a piece of polyhedral data on $t^{*}$ .

Exercises [Intermediate+]

Exercise 3 (medium, short-answer).

Show that the Schur-Horn theorem is a special case of Atiyah-Guillemin-Sternberg. Schur-Horn states: the diagonal entries of an $n \times n$ Hermitian matrix with prescribed eigenvalues $λ_{1} \geq \dots \geq λ_{n}$ form the convex hull of the orbit of $λ$ under the symmetric group $S_{n}$ acting by permutation.

Hint

The relevant manifold is the coadjoint orbit $O_{λ} \subset u (n)^{*}$ of the diagonal matrix with entries $λ_{i}$ , with the action of the maximal torus.

Answer

Identify $u (n)^{*}$ with the space of Hermitian matrices via the trace pairing. The coadjoint orbit $O_{λ}$ of $diag (λ_{1}, \dots, λ_{n})$ is the set of all Hermitian matrices with eigenvalue spectrum ${λ_{i}}$ , and it carries the Kirillov-Kostant-Souriau symplectic form. The maximal torus $T \subset U (n)$ of diagonal unitaries acts on $O_{λ}$ by conjugation; this action is Hamiltonian, with moment map $μ : O_{λ} \to t^{*} ≅ R^{n}$ given by reading off the diagonal entries of the Hermitian matrix. Fixed points of $T$ on $O_{λ}$ are exactly the $n!$ permutation matrices $diag (λ_{σ (1)}, \dots, λ_{σ (n)})$ , $σ \in S_{n}$ . Convexity gives $μ (O_{λ}) = conv ({(λ_{σ (1)}, \dots, λ_{σ (n)}) : σ \in S_{n}})$ , the permutohedron of $λ$ — exactly the Schur-Horn statement. Rubric: full credit for naming the coadjoint orbit, the diagonal moment map, and the permutation fixed points.

Exercise 4 (medium, short-answer).

Why is the index of $μ^{ξ}$ at a critical submanifold even? State the linear-algebra fact that forces this.

Hint

Use the weight-space decomposition of the linearised torus action.

Answer

At a fixed point $p$ , the linearised $T_{ξ}$ -action splits the normal bundle into real two-dimensional weight spaces $V_{α}$ where $T_{ξ}$ acts by rotation with weight $α$ . The Hessian of $μ^{ξ}$ on each $V_{α}$ is a multiple of the rotation-invariant symplectic form on $V_{α}$ , with sign $sign α (ξ)$ . Each weight space contributes its full two real dimensions to either the negative-definite or positive-definite part of the Hessian. The index — total negative-definite dimension — is therefore a sum of $2$ 's, hence even. Rubric: full credit for naming the weight-space decomposition and the rotation-form Hessian.

Exercise 5 (medium, short-answer).

Show that a Morse-Bott function with all even indices on a closed connected manifold has connected level sets.

Hint

Track the change in $π_{0}$ of sublevel sets across a critical value.

Answer

Let $f$ be the function. Sublevel set $f^{- 1} ((- \infty, a])$ changes by attaching, across each critical value $c$ , a disk-bundle neighbourhood of the corresponding critical submanifold $C_{c}$ of $f$ along a sphere bundle of dimension $coindex - 1$ . When the coindex is $\geq 2$ , the attaching sphere is connected and the new piece glues onto the existing sublevel without introducing a new component. When the coindex is $0$ , $C_{c}$ is a local minimum and we are creating a new component, but with codimension $0$ this can happen only at the global minimum, where a connected manifold contributes a connected starting set. Even-index forces coindex to be even; combined with the manifold being connected (so the global minimum is unique up to a connected critical submanifold) the sublevel set stays connected. Apply the same argument to superlevel sets via $- f$ . Level sets, intersections of sublevel and superlevel, inherit connectedness — this last step uses connectedness of the level-set fibration over a Morse-Bott chart, a standard tubular-neighbourhood argument. Rubric: full credit for the index-vs-coindex parity argument and the explicit handle-attaching step.

Exercise 7 (hard, short-answer).

Let $T^{2}$ act on $CP^{1} \times CP^{1}$ via $(θ_{1}, θ_{2}) \cdot ([z_{0} : z_{1}], [w_{0} : w_{1}]) = ([z_{0} : e^{i θ_{1}} z_{1}], [w_{0} : e^{i θ_{2}} w_{1}])$ . Compare the moment polytope of this action with the polytope of the previous exercise.

Hint

A standard normalisation of moment map for the $S^{1}$ -action on $CP^{1}$ has image $[0, 1]$ , not $[- 1, 1]$ .

Answer

With the Fubini-Study symplectic form normalised so the moment map of the standard $S^{1}$ -action on $CP^{1}$ has image $[0, 1]$ (with fixed points at $[1 : 0] \mapsto 0$ , $[0 : 1] \mapsto 1$ ), the polytope of the diagonal $T^{2}$ -action on $CP^{1} \times CP^{1}$ is the unit square $[0, 1] \times [0, 1]$ . The polytope of Exercise 6 is the centred square $[- 1, 1]^{2}$ — these differ by an affine reparameterisation that reflects the choice of round vs Fubini-Study area form on each factor. Both polytopes have the same combinatorial type — a four-vertex square — and the underlying symplectic toric structures are equivariantly symplectomorphic up to rescaling. Rubric: full credit for identifying both polytopes and explaining that they are affinely equivalent.

Lean formalization [Intermediate+]

Mathlib does not currently support Hamiltonian torus actions or moment maps in a form that admits the Atiyah-Guillemin-Sternberg theorem. The skeleton below sketches the statement at the type level; the proof would require the Morse-Bott apparatus described in lean_mathlib_gap.

[object Promise]

Advanced results [Master]

The convexity theorem belongs to a family of results that combine Hamiltonian symmetry with convex-geometric output. Several distinct generalisations exist, each retaining the Morse-Bott / level-set-connectedness skeleton.

Non-abelian convexity (Kirwan 1984). When the symmetry group is a compact connected Lie group $G$ rather than a torus, the moment map $μ : M \to g^{*}$ no longer has convex image directly; the image is $G$ -invariant. Kirwan's theorem states that the intersection $μ (M) \cap t_{+}^{*}$ with a closed positive Weyl chamber is a convex polytope. The proof reduces to the abelian case applied to the maximal torus, plus the fact that every $G$ -orbit in $g^{*}$ meets $t_{+}^{*}$ in exactly one point.

Symplectic implosion (Guillemin-Jeffrey-Sjamaar 2002). A construction that turns a Hamiltonian $G$ -manifold into a Hamiltonian $T$ -manifold whose moment polytope is exactly the Kirwan polytope. The implosion machinery realises Kirwan's theorem as an instance of Atiyah-Guillemin-Sternberg applied to a derived $T$ -manifold.

Singular convexity. Convexity persists when one drops compactness and replaces it by properness of the moment map: Lerman-Meinrenken-Tolman-Woodward 1998 and earlier Hilgert-Neeb-Plank 1994 established convexity of the image under properness assumptions.

Local convexity at fixed points. Atiyah's argument also yields a local statement: the image of $μ$ near any fixed point $p$ is the cone on the polytope of the linearised torus action on $T_{p} M$ , intersected with a translate of the fixed-point image $μ (p)$ . This is the seed of the local description used in Delzant's classification.

Morse theory of $∣ μ ∣^{2}$ . Kirwan's Cohomology of Quotients (1984) develops a Morse theory for the function $∣ μ ∣^{2} : M \to R$ , whose critical sets stratify $M$ in a way compatible with the moment-polytope facets. This stratification is the key technical input for computing cohomology of symplectic quotients $M / / G$ .

Vertex-edge structure. When $T$ acts effectively with $dim T = \frac{1}{2} dim M$ — the toric case — the moment polytope is a Delzant polytope: simple, rational, smooth at each vertex (in the sense that the edge directions at each vertex form a $Z$ -basis of the integer lattice). The edges of the polytope correspond to one-dimensional fixed-component spheres in $M$ , and their lengths are determined by the symplectic areas of those spheres.

Synthesis. The Atiyah-Guillemin-Sternberg theorem identifies a piece of symplectic geometry with a piece of polyhedral combinatorics, and the bridge is a single analytic fact: components of the moment map are Morse-Bott functions whose indices are constrained to be even by the weight-space decomposition of the symplectic structure. Read as a structural statement, the theorem generalises the elementary fact that a connected real-valued Morse function on a connected manifold has an interval as its image — index parity is what allows iterating to higher rank without breaking connectedness. Read in the opposite direction, the theorem is dual to a representation-theoretic fact: the multiplicity of a weight $α \in Λ^{*}$ in a quantisation of $(M, ω, μ)$ is supported on the lattice points of the polytope $μ (M)$ , and the polytope is therefore the support of the spectrum of a quantum operator.

Putting these together, the foundational reason that fixed points control the global image is exactly Stage 2 of the proof: the symplectic non-degeneracy forces the second-order behaviour of $μ^{ξ}$ at any fixed point to be a sum of two-dimensional rotation Hessians, and this single linear-algebra ingredient is the bridge between the analytic moment map and the combinatorial polytope. The theorem appears again in Delzant's classification, where the polytope is upgraded from invariant to complete invariant — putting these together one sees that every compact symplectic toric manifold is exactly its polytope, in the precise categorical sense that the polytope-to-manifold construction is an equivalence of groupoids.

Full proof set [Master]

Lemma (weight-space decomposition). Let $T_{ξ}$ be a closed one-parameter subgroup of $T$ acting on a symplectic vector space $(V, ω_{V})$ by symplectomorphisms, and let $V^{T_{ξ}} \subseteq V$ be the fixed subspace. Then the symplectic complement $W = (V^{T_{ξ}})^{ω_{V}}$ admits an orthogonal direct-sum decomposition $W = ⨁_{α} W_{α}$ into real two-dimensional symplectic subspaces, indexed by a finite set of non-zero weights $α : t_{ξ} \to R$ , such that $T_{ξ}$ acts on $W_{α}$ by the rotation of weight $α$ .

Proof. The action of $T_{ξ} ≅ S^{1}$ on $V$ commutes with $ω_{V}$ , so the complexification $V \otimes_{R} C$ decomposes into character spaces $V_{α}^{C}$ on which $T_{ξ}$ acts by the character $θ \mapsto e^{i α θ}$ . Pairs $(V_{α}^{C}, V_{- α}^{C})$ for $α \neq = 0$ assemble into real two-dimensional summands $W_{α} \subset V$ ; the symplectic form pairs each $W_{α}$ with itself non-degenerately because $T_{ξ}$ preserves $ω_{V}$ . The fixed subspace $V^{T_{ξ}}$ is the $α = 0$ summand and is automatically symplectic. $□$

Lemma (Hessian sign on weight spaces). In the setting of the previous lemma, the Hessian of $μ^{ξ}$ at the fixed point, restricted to the weight space $W_{α}$ , equals $sign α (ξ) \cdot Q_{α}$ , where $Q_{α}$ is the positive-definite rotation-invariant inner product on $W_{α}$ induced by the symplectic structure and a choice of $T_{ξ}$ -invariant complex structure.

Proof. On $W_{α}$ choose a $T_{ξ}$ -invariant complex structure $J$ with $ω_{V} (\cdot, J \cdot) > 0$ . Then $T_{ξ}$ acts on $W_{α} ≅ C$ by $θ \cdot z = e^{i α θ} z$ , the moment map for this circle action is $μ^{ξ} = - \frac{α ( ξ )}{2} ∣ z ∣^{2}$ (the sign convention from $d μ^{ξ} = ι_{X_{ξ}} ω$ ). The Hessian at $z = 0$ is $- α (ξ) \cdot ∣ \cdot ∣^{2}$ , with sign $- sign α (ξ)$ relative to the positive-definite $∣ \cdot ∣^{2}$ . Either sign of $α (ξ)$ gives a definite quadratic form on $W_{α}$ , contributing $2$ to either index or coindex. $□$

Lemma (connectedness of level sets). Let $f : M \to R$ be Morse-Bott on a closed connected manifold $M$ with all indices and coindices even. Then for every $a \in R$ the level set $f^{- 1} (a)$ is empty or connected, and so is every sublevel set.

Proof. Connectedness of sublevel sets is by induction on critical values. Below the minimum, the sublevel set is empty. Just above the minimum, the sublevel set retracts onto the connected critical submanifold (assuming connected $M$ has a connected global-minimum critical set; if there are multiple minima, repeat the argument component by component, but on $M$ connected the global minimum must be a single connected submanifold by Morse-Bott + connectedness of $M$ , since otherwise a path between two minima would have to cross a critical level of higher index whose attaching coindex sphere is connected, making the two pieces equal in $π_{0}$ ). At a critical value $c$ the sublevel set changes by attaching the disk bundle of the negative normal bundle of the critical submanifold $C_{c}$ , glued along the sphere bundle of dimension $index (C_{c}) - 1$ . Even index $\geq 2$ makes this attaching sphere bundle have connected fibres, hence the gluing keeps connectedness. Index $0$ would create a new component, but on $M$ connected and Morse-Bott, this only happens at the global minimum (already handled). Symmetric argument for superlevel sets via $- f$ . The level set $f^{- 1} (a)$ is a deformation retract of a thin shell between sublevel and superlevel, both of which are connected; so $f^{- 1} (a)$ is connected. $□$

Theorem (full statement, restated and proved). Let $(M, ω)$ be closed connected symplectic with a Hamiltonian $T = T^{k}$ -action and moment map $μ$ . Then $μ^{- 1} (c)$ is connected (or empty) for all $c \in \mathfrak{t}^ $, an d$ \mu(M) = \mathrm{conv}\big(\mu(M^T)\big)$ is a convex polytope.*

Proof. Stages 1-5 of the proof in the Intermediate section, made rigorous by the three lemmas above. Stage 1 uses the Hamiltonian condition $d μ^{ξ} = ι_{X_{ξ}} ω$ and non-degeneracy of $ω$ to identify critical points of $μ^{ξ}$ with zeros of the fundamental vector field, hence with fixed points of the closure $T_{ξ}$ of $exp (R ξ)$ . The Morse-Bott property along $M^{T_{ξ}}$ comes from the weight-space lemma applied to the linearised action on the normal bundle. Stage 2 is the Hessian-sign lemma: every weight-space contribution is even-dimensional, so total index and coindex are even. Stage 3 is the connectedness-of-level-sets lemma applied to $μ^{ξ}$ for any $ξ$ .

Stage 4 (convexity) inducts on $k$ . The base case $k = 1$ : image is an interval. Inductive step: pick generic $ξ$ , restrict to a level set $L_{a} = (μ^{ξ})^{- 1} (a)$ , observe that $L_{a}$ inherits a Hamiltonian action of the rank- $(k - 1)$ quotient torus $T / T_{ξ}$ via the symplectic reduction $L_{a} / T_{ξ}$ (regular reduction at the level $a$ of $μ^{ξ}$ gives a smooth symplectic manifold $L_{a} / T_{ξ}$ when $a$ is a regular value of $μ^{ξ}$ , which it is for almost every $a$ in the image; the restricted moment map is the projection of $μ ∣_{L_{a}}$ to the hyperplane). The inductive hypothesis gives convexity of the image inside the hyperplane. Sweeping $a$ from $μ^{ξ} (p_{0})$ to $μ^{ξ} (p_{1})$ produces the straight-line path. Stage 5 (vertex characterisation) follows because every extreme point of the image is the unique minimum or maximum of some $μ^{ξ}$ , which is a fixed-point image. Connectedness of fibres uses Stage 3 plus the inductive convexity. $□$

Connections [Master]

Moment map 05.04.01. Atiyah-Guillemin-Sternberg is the central global theorem about the moment map; the unit's three-condition definition is what enables Stage 1 of the proof, and the Hamiltonian sign convention used here is the same as that unit's.
Symplectic reduction 05.04.02. The inductive step in Stage 4 of the proof uses regular Marsden-Weinstein-Meyer reduction at non-critical levels of $μ^{ξ}$ to descend to a torus action of one rank lower. Index parity in Stage 2 also explains why regular reduction levels have connected fibres — a fact used implicitly in the smooth-quotient construction.
Delzant's theorem 05.04.04. The downstream classification result: in the toric case ( $dim T = n = \frac{1}{2} dim M$ ), the moment polytope produced by Atiyah-Guillemin-Sternberg is upgraded to a complete invariant, classifying compact symplectic toric manifolds up to equivariant symplectomorphism. The vertices and edge structure of the polytope encode the global manifold.
Coadjoint orbit 05.03.01. The Schur-Horn theorem is the convexity theorem applied to a coadjoint orbit of $U (n)$ with the maximal-torus action — giving the permutohedron as moment polytope and the diagonal-of-Hermitian-matrix as moment map. This is the historical bridge between the convexity theorem and 1920s linear algebra.
Symplectic manifold 05.01.02. The convexity statement crucially uses non-degeneracy of $ω$ — both in the Morse-Bott property of $μ^{ξ}$ at fixed points and in the weight-space sign argument. A degenerate two-form would not force index parity, and the conclusion would fail.
Hamiltonian vector field 05.02.01. The components $μ^{ξ}$ are the Hamiltonians whose flow is the corresponding one-parameter subgroup of $T$ ; the proof's analytic backbone is precisely the unit's identity $d μ^{ξ} = ι_{X_{ξ}} ω$ .

Historical & philosophical context [Master]

Michael Atiyah's 1982 paper Convexity and commuting Hamiltonians (Bull. London Math. Soc. 14, 1-15) ^{[Atiyah 1982]} and Victor Guillemin and Shlomo Sternberg's Convexity properties of the moment mapping (Invent. Math. 67, 491-513) ^{[Guillemin-Sternberg 1982]} appeared independently and in the same year. Both proofs use the Morse-theoretic structure of moment-map components, with Atiyah's exposition emphasising the index-parity step and Guillemin-Sternberg's developing the level-set-connectedness machinery in greater generality. The two papers are joint originators of the theorem.

The result has a substantial linear-algebra prehistory. Issai Schur's 1923 paper Über eine Klasse von Mittelbildungen mit Anwendungen auf die Determinantentheorie (Sitzungsber. Berlin. Math. Ges. 22) ^{[Schur 1923]} established that the diagonal entries of a Hermitian matrix with prescribed eigenvalues lie in the convex hull of the permutations of the eigenvalues — the " $\subseteq$ " direction of what is now called the Schur-Horn theorem. Alfred Horn's 1954 paper Doubly stochastic matrices and the diagonal of a rotation matrix (Amer. J. Math. 76, 620-630) ^{[Horn 1954]} proved the converse direction, completing the equivalence. Atiyah and Guillemin-Sternberg's theorem reveals the Schur-Horn statement as the special case of moment-map convexity for the maximal-torus action on a unitary coadjoint orbit.

Frances Kirwan's 1984 monograph Cohomology of Quotients of Symplectic and Algebraic Varieties ^{[Kirwan 1984]} generalised the convexity theorem to non-abelian compact group actions, establishing that the intersection of the moment image with a closed positive Weyl chamber is convex polyhedral. Kirwan's argument also yields a Morse theory for $∣ μ ∣^{2}$ that has become the standard tool for computing cohomology of symplectic quotients. Michèle Audin's Topology of Torus Actions on Symplectic Manifolds (1991) ^{[Audin 1991]} gave the canonical textbook treatment of the abelian convexity result and its local-cone refinements. Thomas Delzant's 1988 thesis ^{[Delzant 1988]} carried the convexity theorem to its sharpest form in the toric case, classifying compact symplectic toric manifolds by their moment polytopes.

Bibliography [Master]

[object Promise]

Prerequisites

05.04.01
05.01.02

Used in

05.04.04

Tier anchors

beginner: Cannas da Silva *Lectures on Symplectic Geometry* §27 (informal)
intermediate: Cannas da Silva §27; Audin *Topology of Torus Actions on Symplectic Manifolds* §IV
master: Atiyah 1982 (originator); Guillemin-Sternberg 1982 (independent originator); Cannas da Silva §27; Audin §IV; Kirwan *Cohomology of Quotients* §3

References

TODO_REF
Atiyah 1982 — Convexity and commuting Hamiltonians · Bull. London Math. Soc. 14, originator paper for the convexity theorem
TODO_REF
Guillemin-Sternberg 1982 — Convexity properties of the moment mapping · Invent. Math. 67, independent originator paper
TODO_REF
Cannas da Silva — Lectures on Symplectic Geometry · §27 the convexity theorem
TODO_REF
Audin — Topology of Torus Actions on Symplectic Manifolds · §IV moment map and convexity
TODO_REF
Kirwan — Cohomology of Quotients · §3 stratifications and Morse theory of the moment map

Reviewer

TBD

Estimated time

beginner: 16m
intermediate: 45m
master: 80m