Marsden J.E. - Lectures On Mechanics (1992) PDF
Marsden J.E. - Lectures On Mechanics (1992) PDF
Marsden J.E. - Lectures On Mechanics (1992) PDF
Second Edition
Jerrold E. Marsden
Preface iv
1 Introduction 1
1.1 The Classical Water Molecule and the Ozone Molecule . . . . . . . . 1
1.2 Lagrangian and Hamiltonian Formulation . . . . . . . . . . . . . . . 3
1.3 The Rigid Body . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
1.4 Geometry, Symmetry and Reduction . . . . . . . . . . . . . . . . . . 11
1.5 Stability . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
1.6 Geometric Phases . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
1.7 The Rotation Group and the Poincaré Sphere . . . . . . . . . . . . . 23
i
ii
4 Relative Equilibria 76
4.1 Relative Equilibria on Symplectic Manifolds . . . . . . . . . . . . . . 76
4.2 Cotangent Relative Equilibria . . . . . . . . . . . . . . . . . . . . . . 78
4.3 Examples . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 81
4.4 The Rigid Body . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 85
References 204
Index 223
Preface
I venture to hope that my lectures may interest engineers, physicists, and as-
tronomers as well as mathematicians. If one may accuse mathematicians as a
class of ignoring the mathematical problems of the modern physics and astron-
omy, one may, with no less justice perhaps, accuse physicists and astronomers
of ignoring departments of the pure mathematics which have reached a high
degree of development and are fitted to render valuable service to physics and
astronomy. It is the great need of the present in mathematical science that
the pure science and those departments of physical science in which it finds
its most important applications should again be brought into the intimate
association which proved so fruitful in the work of Lagrange and Gauss. Felix
Klein, 1896
These lectures cover a selection of topics from recent developments in the ge-
ometric approach to mechanics and its applications. In particular, we emphasize
methods based on symmetry, especially the action of Lie groups, both continuous
and discrete, and their associated Noether conserved quantities veiwed in the geo-
metric context of momentum maps. In this setting, relative equilibria, the analogue
of fixed points for systems without symmetry are especially interesting. In general,
relative equilibria are dynamic orbits that are also group orbits. For the rotation
group SO(3), these are uniformly rotating states or, in other words, dynamical
motions in steady rotation.
Some of the main points to be treated are as follows:
• The stability of relative equilibria analyzed using the method of separation of
internal and rotational modes, also referred to as the block diagonalization or
normal form technique.
• Geometric phases, including the phases of Berry and Hannay, are studied
using the technique of reduction and reconstruction.
• Mechanical integrators, such as numerical schemes that exactly preserve the
symplectic structure, energy, or the momentum map.
iv
Preface v
Shmuel Weissman, Steve Wiggins, and Brett Zombro. The work of others is cited
at appropriate points in the text.
I would like to especially thank David Chillingworth for organizing the LMS
lecture series in Southampton, April 15–19, 1991 that acted as a major stimulus for
preparing the written version of these notes. I would like to also thank the Mathe-
matical Sciences Research Institute and especially Alan Weinstein and Tudor Ratiu
at Berkeley for arranging a preliminary set of lectures along these lines in April,
1989, and Francis Clarke at the Centre de Recherches Mathématique in Montréal
for his hospitality during the Aisenstadt lectures in the fall of 1989. Thanks are
also due to Phil Holmes and John Guckenheimer at Cornell, the Mathematical
Sciences Institute, and to David Sattinger and Peter Olver at the University of
Minnesota, and the Institute for Mathematics and its Applications, where several
of these talks were given in various forms. I also thank the Humboldt Stiftung of
Germany, Jürgen Scheurle and Klaus Kirchgässner who provided the opportunity
and resources needed to put the lectures to paper during a pleasant and fruitful
stay in Hamburg and Blankenese during the first half of 1991. I also acknowledge a
variety of research support from NSF and DOE that helped make the work possible.
I thank several participants of the lecture series and other colleagues for their useful
comments and corrections. I especially thank Hans Peter Kruse, Oliver O’Reilly,
Rick Wicklin, Brett Zombro and Florence Lin in this respect.
Very special thanks go to Barbara for typesetting the lectures and for her sup-
port in so many ways. Thomas the Cat also deserves thanks for his help with our
understanding of 180◦ cat manouvers. This work was not responsible for his unfor-
tunate fall from the roof (resulting in a broken paw), but his feat did prove that
cats can execute 90◦ attitude control as well.
Chapter 1
Introduction
This chapter gives an overview of some of the topics that will be covered so the reader
can get a coherent picture of the types of problems and associated mathematical
structures that will be developed.1
1
1. Introduction 2
m
m
z
r1 r2 M
• How do vibrations affect overall rotations? Can one use them to control overall
rotations? To stabilize otherwise unstable motions?
• Can one separate symmetric (the two hydrogen atoms moving as mirror im-
ages) and non-symmetric vibrations using a discrete symmetry?
• Does a deeper understanding of the classical mechanics of the water molecule
help with the corresponding quantum problem?
It is interesting that despite the old age of classical mechanics, new and deep
insights are coming to light by combining the rich heritage of knowledge already well
founded by masters like Newton, Euler, Lagrange, Jacobi, Laplace, Riemann and
Poincaré, with the newer techniques of geometry and qualitative analysis of people
like Arnold and Smale. I hope that already the classical water molecule and related
systems will convey some of the spirit of modern research in geometric mechanics.
The water molecule is in fact too hard an example to carry out in as much detail
as one would like, although it illustrates some of the general theory quite nicely. A
simpler example for which one can get more detailed information (about relative
equilibria and their bifurcations, for example) is the double spherical pendulum.
Here, instead of the symmetry group being the full (non-abelian) rotation group
SO(3), it is the (abelian) group S 1 of rotations about the axis of gravity. The
double pendulum will also be used as a thread through the lectures. The results for
this example are drawn from Marsden and Scheurle [1993]. To make similar progress
with the water molecule, one would have to deal with the already complex issue of
finding a reasonable model for the interatomic potential. There is a large literature
on this going back to Darling and Dennison [1940] and Sorbie and Murrell [1975].
For some of the recent work that might be important for the present approach, and
for more references, see Xiao and Kellman [1989] and Li, Xiao and Kellman [1990].
1. Introduction 3
The special case of the ozone molecule with its three equal masses is also of
great interest, not only for environmental reasons, but because this molecule has
more symmetry than the water molecule. In fact, what we learn about the water
molecule can be used to study the ozone molecule by putting m = M . A big change
that has very interesting consequences is the fact that the discrete symmetry group
is enlarged from “reflections” Z2 to the “symmetry group of a triangle” D3 . This
situation is also of interest in chemistry for things like molecular control by using
laser beams to control the potential in which the molecule finds itself. Some believe
that, together with ideas from semiclassical quantum mechanics, the study of this
system as a classical system provides useful information. We refer to Pierce, Dahleh
and Rabitz [1988], Tannor [1989] and Tannor and Jin [1991] for more information
and literature leads.
called the principle of critical action, in which the variation is over all curves
with two fixed endpoints and with a fixed time interval [a, b]. Curiously, Lagrange
knew the more sophisticated principle of least action, but not the proof of the
equivalence of (1.2.1) and (1.2.2), which is simple and is as follows. Let q(t, ²) be a
family of curves with q(t) = q(t, 0) and let the variation be defined by
¯
d ¯
δq(t) = q(t, ²)¯¯ . (1.2.3)
d² ²=0
where we have integrated the second term by parts and have used δq i (a) = δq i (b) =
0. Since δq i (t) is arbitrary except for the boundary conditions, the equivalence of
(1.2.1) and (1.2.2) becomes evident.
The collection of pairs (q, q̇) may be thought of as elements of the tangent
bundle T Q of configuration space Q. We also call T Q the velocity phase space.
One of the great achievements of Lagrange was to realize that (1.2.1) and (1.2.2)
make intrinsic (coordinate independent) sense; today we would say that Lagrangian
mechanics can be formulated on manifolds. For mechanical systems like the rigid
body, coupled structures etc., it is essential that Q be taken to be a manifold and
not just Euclidean space.
If we perform the Legendre transform, that is, change variables to the cotangent
bundle T ∗ Q by
∂L
pi = i
∂ q̇
(assuming this is an invertible change of variables) and let the Hamiltonian be
defined by
H(q i , pi ) = pi q̇ i − L(q i , q̇ i ) (1.2.4)
(summation on repeated indices understood), then the Euler-Lagrange equations
become Hamilton’s equations
∂H ∂H
q̇ i = ; ṗi = − ; i = 1, . . . , n. (1.2.5)
∂pi ∂qi
The symmetry in these equations leads to a rich geometric structure.
Ω = A−1 ω, (1.3.3)
so that Ω is the angular velocity as seen in a body fixed frame. The kinetic energy
1. Introduction 6
Theorem 1.3.1 The curve A(t) ∈ SO(3) satisfies the Euler-Lagrange equations
for Z
1
L(A, Ȧ) = ρ(X)kȦXk2 d3 X (1.3.6)
2 B
if and only if Ω(t) defined by A−1 Ȧv = Ω × v for all v ∈ R3 satisfies Euler’s
equations:
IΩ̇ = IΩ × Ω. (1.3.7)
Moreover, this equation is equivalent to conservation of the spatial angular momen-
tum:
d
π=0 (1.3.8)
dt
where π = AIΩ.
Σ̂ = A−1 δA (1.3.11)
1. Introduction 7
Σ̂v = Σ × v. (1.3.12)
i.e.,
c = Σ̂˙ + [Ω̂, Σ̂].
δΩ (1.3.14)
Now one checks the identity
[Ω̂, Σ̂] = (Ω × Σ)ˆ (1.3.15)
by using Jacobi’s identity for the cross product. Thus, (1.3.13) gives
δΩ = Σ̇ + Ω × Σ. (1.3.16)
on R3 where the variations δΩ are of the form (1.3.15) with Σ(a) = Σ(b) = 0.
To complete the proof of Theorem 1.3.1, it suffices to work out the equations
equivalent to the reduced variational principle (1.3.17). Since l(Ω) = 12 hIΩ, Ωi, and
I is symmetric, we get
Z b Z b
δ l dt = hIΩ, δΩidt
a a
Z b
= hIΩ, Σ̇ + Ω × Σidt
a
·¿
Z b À ¸
d
= − IΩ, Σ + hIΩ, Ω × Σi
a dt
Z b¿ À
d
= − IΩ + IΩ × Ω, Σ dt
a dt
1. Introduction 8
where we have integrated by parts and used the boundary conditions Σ(b) = Σ(a) =
0. Since Σ is otherwise arbitrary, (1.3.18) is equivalent to
d
− (IΩ) + IΩ × Ω = 0,
dt
which are Euler’s equations.
As we shall see in Chapter 2, this calculation is a special case of a procedure valid
for any Lie group and, as such, leads to the Euler-Poincaré equations; (Poincaré
[1901a]).
The body angular momentum is defined, analogous to linear momentum
p = mv, as
Π = IΩ
so that in principal axes,
Π = (Π1 , Π2 , Π3 ) = (I1 Ω1 , I2 Ω2 , I3 Ω3 ).
As we have seen, the equations of motion for the rigid body are the Euler-
Lagrange equations for the Lagrangian L equal to the kinetic energy, but regarded
as a function on T SO(3) or equivalently, Hamilton’s equations with the Hamiltonian
equal to the kinetic energy, but regarded as a function on the cotangent bundle of
SO(3). In terms of the Euler angles and their conjugate momenta, these are the
canonical Hamilton equations, but as such they are a rather complicated set of six
ordinary differential equations.
Assuming that no external moments act on the body, the spatial angular mo-
mentum vector π = AΠ is conserved in time. As we shall recall in Chapter 2, this
follows by general considerations of symmetry, but it can also be checked directly
from Euler’s equations:
dπ
= ȦIΩ + A(IΩ × Ω) = A(A−1 ȦIΩ + IΩ × Ω)
dt
= A(Ω × IΩ + IΩ × Ω) = 0.
of motion) for the system is given by the magnitude of the total angular momentum
vector: kΠk2 = Π21 + Π21 + Π21 . This follows from conservation of π and the fact that
kπk = kΠk or can be verified directly from the Euler equations by computing the
time derivative of kΠk2 and observing that the sum of the coefficients in (1.3.19) is
zero.
Because of conservation of kΠk, the evolution in time of any initial point Π(0)
is constrained to the sphere kΠk2 = kΠ(0)k2 = constant. Thus we may view the
Euler equations as describing a two dimensional dynamical system on an invariant
sphere. This sphere is the reduced phase space for the rigid body equations. In
fact, this defines a two dimensional system as a Hamiltonian dynamical system on
the two-sphere S 2 . The Hamiltonian structure is not obvious from Euler’s equations
because the description in terms of the body angular momentum is inherently non-
canonical. As we shall see in §1.4 and in more detail in Chapter 4, the theory
of Hamiltonian systems may be generalized to include Euler’s formulation. The
Hamiltonian for the reduced system is
µ ¶
1 −1 1 Π21 Π22 Π23
H = hΠ, I Πi = + + (1.3.20)
2 2 I1 I2 I3
and we shall show how this function allows us to recover Euler’s equations (1.3.19).
Since solutions curves are confined to the level sets of H (which are in general ellip-
soids) as well as to the invariant spheres kΠk = constant, the intersection of these
surfaces are precisely the trajectories of the rigid body, as shown in Figure 1.3.1.
On the reduced phase space, dynamical fixed points are called relative equilib-
ria. These equilibria correspond to periodic orbits in the unreduced phase space,
specifically to steady rotations about a principal inertial axis. The locations and sta-
bility types of the relative equilibria for the rigid body are clear from Figure 1.3.1.
The four points located at the intersections of the invariant sphere with the Π1
and Π2 axes correspond to pure rotational motions of the body about its major
and minor principal axes. These motions are stable, whereas the other two rela-
tive equilibria corresponding to rotations about the intermediate principal axis are
unstable.
In Chapters 4 and 5 we shall see how the stability analysis for a large class of more
complicated systems can be simplified through a careful choice of non-canonical co-
ordinates. We managed to visualize the trajectories of the rigid body without doing
any calculations, but this is because the rigid body is an especially simple system.
Problems like the rotating water molecule will prove to be more challenging. Not
only is the rigid body problem integrable (one can write down the solution in terms
of integrals), but the problem reduces in some sense to a two dimensional manifold
and allows questions about trajectories to be phrased in terms of level sets of in-
tegrals. Many Hamiltonian systems are not integrable and trajectories are chaotic
and are often studied numerically. The fact that we were able to reduce the number
of dimensions in the problem (from twelve to two) and the fact that this reduction
was accomplished by appealing to the non-canonical coordinates Ω or Π turns out
to be a general feature for Hamiltonian systems with symmetry. The reduction
procedure may be applied to non-integrable or chaotic systems, just as well as to
1. Introduction 10
Π3
Π2
Π1
Figure 1.3.1: Phase portrait for the rigid body. The magnitude of the angular
momentum vector determines a sphere. The intersection of the sphere with the
ellipsoids of constant Hamiltonian gives the trajectories of the rigid body.
interval, and the computational cost for such a procedure could be prohibitive for
long time integrations. This situation is worse still for bodies with internal degrees
of freedom like our water molecule, robots, and large-scale space structures. Such
examples point out the need to go beyond canonical formulations.
Once H is specified, the chain rule shows that the statement “Ḟ = {F, H} for all
smooth functions F ” is equivalent to Hamilton’s equations. In fact, it tells how any
function F evolves along the flow.
This representation of the canonical equations of motion suggests a generaliza-
tion of the bracket notation to cover non-canonical formulations. As an example,
consider Euler’s equations. Define the following non-canonical rigid body bracket
of two smooth functions F and K on the angular momentum space:
where {F, K} and the gradients of F and K are evaluated at the point Π =
(Π1 , Π2 , Π3 ). The notation in (1.4.3) is that of the standard scalar triple prod-
uct operation in R3 . If H is the rigid body Hamiltonian (see (1.3.18)) and F is,
in turn, allowed to be each of the three coordinate functions Πi , then the formula
Ḟ = {F, H} yields the three Euler equations.
The non-canonical bracket corresponding to the reduced free rigid body problem
is an example of what is known as a Lie-Poisson bracket. In Chapter 2 we
shall see how to generalize this to any Lie algebra. Other bracket operations have
been developed to handle a wide variety of Hamiltonian problems in non-canonical
1. Introduction 12
These two conditions are generalized to arbitrary mechanical systems with sym-
metry in the general reduction theory of Meyer [1973] and Marsden and Weinstein
[1974], which was inspired by the seminal works of Arnold [1966] and Smale [1970].
In this theory, one begins with a given phase space that we denote by P . We assume
there is a group G of symmetry transformations of P that transform P to itself by
canonical transformation. Generalizing 2, we use the symmetry group to generate
a vector-valued conserved quantity denoted J and called the momentum map.
Analogous to the set where the total angular momentum has a given value, we
consider the set of all phase space points where J has a given value µ; i.e., the
µ-level set for J. The analogue of the two dimensional body angular momentum
sphere in Figure 1.3.1 is the reduced phase space, denoted Pµ that is constructed
as follows:
1. Introduction 13
Pµ is the µ-level set for J on which any two points that can be trans-
formed one to the other by a group transformation are identified.
1.5 Stability
There is a standard procedure for determining the stability of equilibria of an ordi-
nary differential equation
ẋ = f (x) (1.5.1)
where x = (x1 , . . . , xn ) and f is smooth. Equilibria are points xe such that f (xe ) =
0; i.e., points that are fixed in time under the dynamics. By stability of the fixed
1. Introduction 14
point xe we mean that any solution to ẋ = f (x) that starts near xe remains close
to xe for all future time. A traditional method of ascertaining the stability of xe is
to examine the first variation equation
ξ˙ = Df (xe )ξ (1.5.2)
∂H ∂H
(qe , pe ) = (qe , pe ) = 0.
∂q ∂p
Hence the fixed point occurs at a critical point of the Hamiltonian.
Lagrange-Dirichlet Criterion If the 2n × 2n matrix δ 2 H of second
partial derivatives, (the second variation) is either positive or negative
definite at (qe , pe ) then it is a stable fixed point.
The proof is very simple. Consider the positive definite case. Since H has a non-
degenerate minimum at ze = (qe , pe ), Taylor’s theorem with remainder shows that
its level sets near ze are bounded inside and outside by spheres of arbitrarily small
radius. Since energy is conserved, solutions stay on level surfaces of H, so a solution
starting near the minimum has to stay near the minimum.
For a Hamiltonian of the form kinetic plus potential V , critical points occur
when pe = 0 and qe is a critical point of the potential of V . The Lagrange-Dirichlet
Criterion then reduces to asking for a non-degenerate minimum of V .
In fact, this criterion was used in one of the classical problems of the 19th
century: the problem of rotating gravitating fluid masses. This problem was studied
by Newton, MacLaurin, Jacobi, Riemann, Poincaré and others. The motivation for
its study was in the conjectured birth of two planets by the splitting of a large mass
of solidifying rotating fluid. Riemann [1860], Routh [1877] and Poincaré [1885, 1892,
1901] were major contributors to the study of this type of phenomenon and used the
potential energy and angular momentum to deduce the stability and bifurcation.
1. Introduction 15
The Lagrange-Dirichlet method was adapted by Arnold [1966, 1969] into what
has become known as the energy-Casimir method . Arnold analyzed the stability
of stationary flows of perfect fluids and arrived at an explicit stability criterion when
the configuration space Q for the Hamiltonian of this system is the symmetry group
G of the mechanical system.
A Casimir function C is one that Poisson commutes with any function F
defined on the phase space of the Hamiltonian system, i.e.,
{C, F } = 0. (1.5.4)
Large classes of Casimirs can occur when the reduction procedure is performed,
resulting in systems with non-canonical Poisson brackets. For example, in the case
of the rigid body discussed previously, if Φ is a function of one variable and µ is the
angular momentum vector in the inertial coordinate system, then
C(µ) = Φ(kµk2 ) (1.5.5)
is readily checked to be a Casimir for the rigid body bracket (1.3.3).
Energy-Casimir method Choose C such that H + C has a critical
point at an equilibrium ze and compute the second variation δ 2 (H +
C)(ze ). If this matrix is positive or negative definite, then the equilib-
rium ze is stable.
When the phase space is obtained by reduction, the equilibrium ze is called a
relative equilibrium of the original Hamiltonian system.
The energy-Casimir method has been applied to a variety of problems including
problems in fluids and plasmas (Holm et al. [1985]) and rigid bodies with flexible
attachments (Krishnaprasad and Marsden [1987]). If applicable, the energy-Casimir
method may permit an explicit determination of the stability of the relative equi-
libria. It is important to remember, however, that these techniques give stability
information only. As such one cannot use them to infer instability without further
investigation.
The energy-Casimir method is restricted to certain types of systems, since its
implementation relies on an abundant supply of Casimir functions. In some impor-
tant examples, such as the dynamics of geometrically exact flexible rods, Casimirs
have not been found and may not even exist. A method developed to overcome this
difficulty is known as the energy momentum method , which is closely linked
to the method of reduction. It uses conserved quantities, namely the energy and
momentum map, that are readily available, rather than Casimirs.
The energy momentum method (Marsden, Simo, Lewis and Posbergh [1989],
Simo, Posbergh and Marsden [1990, 1991], Simo, Lewis and Marsden [1991], and
Lewis and Simo [1990]) involves the augmented Hamiltonian defined by
Hξ (q, p) = H(q, p) − ξ · J(q, p) (1.5.6)
where J is the momentum map described in the previous section and ξ may be
thought of as a Lagrange multiplier. For the water molecule, J is the angular mo-
mentum and ξ is the angular velocity of the relative equilibrium. One sets the first
1. Introduction 16
The space VRIG (rigid variations) is generated by the symmetry group, and
VINT are the internal or shape variations. In addition, the whole matrix δ 2 Hξ
block diagonalizes in a very efficient manner as we will see in Chapter 5. This often
allows the stability conditions associated with δ 2 Vµ | V × V to be recast in terms of
a standard eigenvalue problem for the second variation of the amended potential.
1. Introduction 17
This splitting i.e., block diagonalization, has more miracles associated with it.
In fact,
This result has several interesting implications. In the case of pseudo-rigid bodies
(Lewis and Simo [1990]), it reduces the stability problem from an unwieldy 14 × 14
matrix to a relatively simple 3 × 3 subblock on the diagonal. The block diagonaliza-
tion procedure enabled Lewis and Simo to solve their problem analytically, whereas
without it, a substantial numerical computation would have been necessary.
As we shall see in Chapter 8, the presense of discrete symmetries (as for the
water molecule and the pseudo-rigid bodies) gives further, or refined, subblocking
properties in the second variation of δ 2 Hξ and δ 2 Vµ and the symplectic form.
In general, this diagonalization explicitly separates the rotational and internal
modes, a result which is important not only in rotating and elastic fluid systems,
but also in molecular dynamics and robotics. Similar simplifications are expected in
the analysis of other problems to be tackled using the energy momentum method.
cut and
unroll cone
parallel translate
frame along a
line of latitude
Figure 1.6.1: The parallel transport of a coordinate frame along a curved surface.
of wire on which is placed a small bead. The bead is set in motion and allowed
to slide along the wire at a constant speed (see Figure 1.6.2). (We will need the
notation in this figure only later in Chapter 6.) Clearly the bead will return to its
initial position after, say, τ seconds, and will continue to return every τ seconds
after that. Allow the bead to make many revolutions along the circuit, but for a
fixed amount of total time, say T .
q'(s)
α
q(s)
s
Figure 1.6.2: A bead sliding on a planar, non-circular hoop of area A and length L.
The bead slides around the hoop at constant speed with period τ and is allowed to
revolve for time T .
Suppose that the wire hoop is slowly rotated in its plane by 360 degrees while
the bead is in motion for exactly the same total length of time T . At the end of the
rotation, the bead is not in the location where we might expect it, but instead will
be found at a shifted position that is determined by the shape of the hoop. In fact,
the shift in position depends only on the length of the hoop, L, and on the area it
encloses, A. The shift is given by 8π 2 A/L2 as an angle, or by 4πA/L as length. (See
§6.6 for a derivation of these formulas.) To be completely concrete, if the bead’s
1. Introduction 19
initial position is marked with a tick and if the time of rotation is a multiple of the
bead’s period, then at the end of rotation the bead is found approximately 4πA/L
units from its initial position. This is shown in Figure 1.6.3. Note that if the hoop
is circular then the angular shift is 2π.
k
Rθ
Rθq(s)
Rθq'(s)
Figure 1.6.3: The hoop is slowly rotated in the plane through 360 degrees. After
one rotation, the bead is located 4πA/L units behind where it would have been,
had the rotation not occurred.
2ET
∆θ = −Λ + . (1.6.1)
kµk
Here Λ is the solid angle subtended by the closed curve Π(t) on the sphere S 2 and
is oriented according to the right hand rule. The approximate phase formula ∆θ ∼ =
8π 2 A/L2 for the ball in the hoop is derived by the classical techniques of averaging
and the variation of constants formula. However, Formula (1.6.1) is exact. (In
Whittaker [1959], (1.6.1) is expressed as a complicated quotient of theta functions!)
An interesting feature of (1.6.1) is the manner in which ∆θ is split into two
parts. The term Λ is purely geometric and so is called the geometric phase. It
does not depend on the energy of the system or the period of motion, but rather on
the fraction of the surface area of the sphere Pµ that is enclosed by the trajectory
1. Introduction 20
Π(t). The second term in (1.6.1) is known as the dynamic phase and depends
explicitly on the system’s energy and the period of the reduced trajectory.
Geometrically we can picture the rigid body as tracing out a path in its phase
space. More precisely, conservation of angular momentum implies that the path lies
in the submanifold consisting of all points that are mapped onto µ by the reduction
process. As Figure 1.2.1 shows, almost every trajectory on the reduced space is
periodic, but this does not imply that the original path was periodic, as is shown in
Figure 1.6.4. The difference between the true trajectory and a periodic trajectory
is given by the holonomy plus the dynamic phase.
true trajectory
πµ
reduced trajectory
D
PPµµ
Figure 1.6.4: Holonomy for the rigid body. As the body completes one period in
the reduced phase space Pµ , the body’s true configuration does not return to its
original value. The phase difference is equal to the influence of a dynamic phase
which takes into account the body’s energy, and a geometric phase which depends
only on the area of Pµ enclosed by the reduced trajectory.
analysis from another point of view, see Ashbaugh, Chicone and Cushman [1991].
Figure 1.6.5: A book tossed in the air about an axis that is close to the middle
(unstable) axis experiences a holonomy of approximately 180 degrees about its long
axis when caught after one revolution.
There are other everyday occurrences that demonstrate holonomy. For example,
a falling cat usually manages to land upright if released upside down from complete
rest, that is, with total angular momentum zero. This ability has motivated several
investigations in physiology as well as dynamics and more recently has been analyzed
by Montgomery [1990] with an emphasis on how the cat (or, more generally, a
deformable body) can efficiently readjust its orientation by changing its shape.
By “efficiently”, we mean that the reorientation minimizes some function — for
example the total energy expended. In other words, one has a problem in optimal
control . Montgomery’s results characterize the deformations that allow a cat to
reorient itself without violating conservation of angular momentum. In his analysis,
Montgomery casts the falling cat problem into the language of principal bundles.
Let the shape of a cat refer to the location of the cat’s body parts relative to each
other, but without regard to the cat’s orientation in space. Let the configuration of
a cat refer both to the cat’s shape and to its orientation with respect to some fixed
reference frame. More precisely, if Q is the configuration space and G is the group
of rigid motions, then Q/G is the shape space.
If the cat is completely rigid then it will always have the same shape, but we
can give it a different configuration by rotating it through, say 180 degrees about
some axis. If we require that the cat has the same shape at the end of its fall as it
had at the beginning, then the cat problem may be formulated as follows: Given
an initial configuration, what is the most efficient way for a cat to achieve a desired
final configuration if the final shape is required to be the same as the initial shape?
1. Introduction 22
If we think of the cat as tracing out some path in configuration space during its
fall, the projection of this path onto the shape space results in a trajectory in the
shape space, and the requirement that the cat’s initial and final shapes are the same
means that the trajectory is a closed loop. Furthermore, if we want to know the
most efficient configuration path that satisfies the initial and final conditions, then
we want to find the shortest path with respect to a metric induced by the function
we wish to minimize. It turns out that the solution of a falling cat problem is
closely related to Wong’s equations that describe the motion of a colored particle in
a Yang-Mills field (Montgomery [1990], Shapere and Wilczek [1989]). We will come
back to these points in Chapter 7.
The examples above indicate that holonomic occurrences are not rare. In fact,
Shapere and Wilczek showed that aquatic microorganisms use holonomy as a form
of propulsion. Because these organisms are so small, the environment in which they
live is extremely viscous to them. The apparent viscosity is so great, in fact, that
they are unable to swim by conventional stroking motions, just as a person trapped
in a tar pit would be unable to swim to safety. These microorganisms surmount their
locomotion difficulties, however, by moving their “tails” or changing their shapes
in a topologically nontrivial way that induces a holonomy and allows them to move
forward through their environment. There are probably many consequences and
applications of “holonomy drive” that remain to be discovered.
Yang and Krishnaprasad [1990] have provided an example of holonomy drive for
coupled rigid bodies linked together with pivot joints as shown in Figure 1.6.5. (For
simplicity, the bodies are represented as rigid rods.) This form of linkage permits
the rods to freely rotate with respect of each other, and we assume that the system
is not subjected to external forces or torques although torques will exist in the joints
as the assemblage rotates. By our assumptions, angular momentum is conserved in
this system. Yet, even if the total angular momentum is zero, a turn of the crank
(as indicated in Figure 1.6.6) returns the system to its initial shape but creates a
holonomy that rotates the system’s configuration. See Thurston and Weeks [1984]
for some relationships between linkages and the theory of 3-manifolds (they do not
study dynamics, however). Brockett [1987, 1989a] studies the use of holonomy in
micromotors.
Holonomy also comes up in the field of magnetic resonance imaging (MRI) and
spectroscopy. Berry’s work shows that if a quantum system experiences a slow (adi-
abatic) cyclic change, then there will be a shift in the phase of the system’s wave
function. This is a quantum analogue to the bead on a hoop problem discussed
above. This work has been verified by several independent experiments; the im-
plications of this result to MRI and spectroscopy are still being investigated. For
a review of the applications of the geometric phase to the fields of spectroscopy,
field theory, and solid-state physics, see Zwanziger, Koenig and Pines [1990] and
the bibliography therein.
Another possible application of holonomy drive is to the somersaulting robot.
Due to the finite precision response of motors and actuators, a slight error in the
robot’s initial angular momentum can result in an unsatisfactory landing as the
robot attempts a flip. Yet, in spite of the challenges, Hodgings and Raibert [1990]
report that the robot can execute 90 percent of the flips successfully. Montgomery,
1. Introduction 23
crank
Figure 1.6.6: Rigid rods linked by pivot joints. As the “crank” traces out the path
shown, the assemblage experiences a holonomy resulting in a clockwise shift in its
configuration.
Raibert and Li [1990] are asking whether a robot can use holonomy to improve this
rate of success. To do this, they reformulate the falling cat problem as a problem
in feedback control: the cat must use information gained by its senses in order to
determine how to twist and turn its body so that it successfully lands on its feet.
It is possible that the same technique used by cats can be implemented in a robot
that also wants to complete a flip in mid-air. Imagine a robot installed with sensors
so that as it begins its somersault it measures its momenta (linear and angular) and
quickly calculates its final landing position. If the calculated final configuration
is different from the intended final configuration, then the robot waves mechanical
arms and legs while entirely in the air to create a holonomy that equals the difference
between the two configurations.
If “holonomy drive” can be used to control a mechanical structure, then there
may be implications for future satellites like a space telescope. Suppose the tele-
scope initially has zero angular momentum (with respect to its orbital frame), and
suppose it needs to be turned 180 degrees. One way to do this is to fire a small jet
that would give it angular momentum, then, when the turn is nearly complete, to
fire a second jet that acts as a brake to exactly cancel the aquired angular momen-
tum. As in the somersaulting robot, however, errors are bound to occur, and the
process of returning the telescope to (approximately) zero angular momentum may
be a long process. It would seem to be more desirable to turn it while constantly
preserving zero angular momentum. The falling cat performs this very trick. A
telescope can mimic this with internal momentum wheels or with flexible joints.
This brings us to the area of control theory and its relation to the present ideas.
Bloch, Krishnaprasad, Marsden and Sánchez de Alvarez [1991] represents one step
in this direction. We shall also discuss their result in Chapter 7.
should try to understand it a little more deeply. As a first try, one can contemplate
Euler’s theorem, which states that every rotation in R3 is a rotation through some
angle about some axis. While true, this can be misleading if not used with care. For
example, it suggests that we can identify the set of rotations with the set consisting
of all unit vectors in R3 (the axes) and numbers between 0 and 2π (the angles);
that is, with the set S 2 × S 1 . However, this is false for reasons that involve some
basic topology. Thus, a better approach is needed.
One method for gaining deeper insight is to realize SO(3) as SU (2)/Z2 (where
SU (2) is the group of 2 × 2 complex unitary matrices of determinant 1), using
quaternions and Pauli spin matrices; see Abraham and Marsden [1978], p. 273–
4, for the precise statement. This approach also shows that the group SU (2) is
diffeomorphic to the set of unit quaternions, the three sphere S 3 in R4 .
,
The Hopf fibration is the map of S 3 to S 2 defined, using the above approach
to the rotation group, as follows. First map a point w ∈ S 3 to its equivalence class
A = [w] ∈ SO(3) and then map this point to Ak, where k is the standard unit
vector in R3 (or any other fixed unit vector in R3 ).
Closely related to the above description of the rotation group, but in fact a little
,
more straightforward, is Poincaré’s representation of SO(3) as the unit circle bundle
T1 S 2 of the two sphere S 2 . This comes about as follows. Elements of SO(3) can be
identified with oriented orthonormal frames in R3 ; i.e., with triples of orthonormal
vectors (n, m, n × m). Such triples are in one to one correspondence with the points
,
in T1 S 2 by (n, m, n×m) ↔ (n, m) where n is the base point in S 2 and m is regarded
as a vector tangent to S 2 at the point n (see Figure 1.7.1).
m
m
n×m
Figure 1.7.1: The rotation group is diffeomorphic to the unit tangent bundle to the
two sphere.
an axis and an angle, despite Euler’s theorem). This is because of the (topological)
fact that every vector field on S 2 must vanish somewhere. Thus, we see that this is
closely related to the nontriviality of the Hopf fibration.
Not only does Poincaré’s representation show that SO(3) is topologically non-
trivial, but this representation is useful in mechanics. In fact, Poincare’s represen-
tation of SO(3) as the unit circle bundle T1 S 2 is the stepping stone in the formu-
lation of optimal singularity free parameterizations for geometrically exact plates
and shells. These ideas have been exploited within a numerical analysis context in
Simo, Fox and Rifai [1990] for statics and [1991] for dynamics. Also, this represen-
tation is helpful in studying the problem of reorienting a rigid body using internal
momentum wheels. We shall treat some aspects of these topics in the course of the
lectures.
Chapter 2
A Crash Course in
Geometric Mechanics
We now set out some of the notation and terminology used in subsequent chapters.
The reader is referred to one of the standard books, such as Abraham and Marsden
[1978], Arnold [1989], Guillemin and Sternberg [1984] and Marsden and Ratiu [1992]
for proofs omitted here.1
26
2. A Crash Course in Geometric Mechanics 27
Proposition 2.2.1 The following hold for Hamiltonian systems on Poisson man-
ifolds:
The first part of this proposition is true even if H is a time dependent Hamilto-
nian, while the second part is true only when H is independent of time.
Θ = pi dq i ; (2.3.1)
(See Marsden [1981] and Olver [1988] for a discussion of the infinite dimensional
case.)
Hamilton’s equations in these canonical coordinates have the classical form
∂H
q̇ i =
∂pi
∂H
ṗi = − i
∂q
as one can readily check.
The local structure of Poisson manifolds is more complex than the symplectic
case. However, Kirillov [1976a] has shown that every Poisson manifold is the union
of symplectic leaves; to compute the bracket of two functions in P , one does it
leaf-wise. In other words, to calculate the bracket of f and g at z ∈ P , select the
symplectic leaf Sz through z, and evaluate the bracket of f |Sz and g|Sz at z. We
shall see a specific case of this picture shortly.
Definition 2.4.1 Hamilton’s principle singles out particular curves q(t) by the
condition Z a
δ L(q(t), q̇(t))dt = 0, (2.4.1)
b
where the variation is over smooth curves in Q with fixed endpoints.
canonical one form, pi dq i . This turns out to be a useful remark in more complex
field theories.
If one prefers, the action principle states that the map I defined by I(q(·)) =
Rb
a
L(q(t), q̇(t))dt from the space of curves with prescribed endpoints in Q to R has
a critical point at the curve in question. In any case, a basic and elementary result
of the calculus of variations, whose proof was sketched in §1.2, is:
Proposition 2.4.2 The principle of critical action for a curve q(t) is equivalent to
the condition that q(t) satisfies the Euler-Lagrange equations
d ∂L ∂L
− i = 0. (2.4.2)
dt ∂ q̇ i ∂q
Definition 2.4.3 Let L be a Lagrangian on T Q and let FL : T Q → T ∗ Q be defined
(in coordinates) by
(q i , q̇ j ) 7→ (q i , pj )
where pj = ∂L/∂ q̇ j . We call FL the fiber derivative. (Intrinsically, FL differen-
tiates L in the fiber direction.)
A Lagrangian L is called hyperregular if FL is a diffeomorphism. If L is a
hyperregular Lagrangian, we define the corresponding Hamiltonian by
H(q i , pj ) = pi q̇ i − L.
One checks that the Euler-Lagrange equations for L are equivalent to Hamilton’s
equations for H.
In a relativistic context one finds that the two conditions pj = ∂L/∂ q̇ j and H =
pi q̇ − L, defining the Legendre transform, fit together as the spatial and temporal
i
components of a single object. Suffice it to say that the formalism developed here
is useful in the context of relativistic fields.
Proposition 2.5.1 The dual space g∗ is a Poisson manifold with either of the two
brackets ¿ · ¸À
δf δk
{f, k}± (µ) = ± µ, , . (2.5.1)
δµ δµ
Here g is identified with g∗∗ in the sense that δf /δµ ∈ g is defined by hν, δf /δµi =
Df (µ) · ν for ν ∈ g∗ , where D denotes the derivative. (In the infinite dimensional
case one needs to worry about the existence of δf /δµ; in this context, methods
2. A Crash Course in Geometric Mechanics 31
like the Hahn-Banach theorem are not always appropriate!) The notation δf /δµ is
used to conform to the functional derivative notation in classical field theory. In
coordinates, (ξ 1 , . . . , ξ m ) on g and corresponding dual coordinates (µ1 , . . . , µm ) on
g∗ , the Lie-Poisson bracket (2.5.1) is
∂f ∂k
{f, k}± (µ) = ±µa Cbc
a
; (2.5.2)
∂µb ∂µc
a c
here Cbc are the structure constants of g defined by [ea , eb ] = Cab ec , where
(e1 , . . . , em ) is the coordinate basis of g and where, for ξ ∈ g, we write ξ = ξ a ea ,
and for µ ∈ g∗ , µ = µa ea , where (e1 , . . . , em ) is the dual basis. Formula (2.5.2)
appears explicitly in Lie [1890], §75.
Which sign to take in (2.5.2) is determined by understanding Lie-Poisson
reduction, which can be summarized as follows. Let
and
For the rigid body one chooses the minus sign in the Lie-Poisson bracket. This is
because the rigid body Lagrangian (and hence Hamiltonian) is left invariant and so
its dynamics pushes to g∗ by the map λ in (2.5.3).
Starting with the kinetic energy Hamiltonian derived in Chapter 1, we directly
obtain the formula H(Π) = 12 Π · (I−1 Π), the kinetic energy of the rigid body. One
verifies from the chain rule and properties of the triple product that:
2. A Crash Course in Geometric Mechanics 32
Proposition 2.5.2 Euler’s equations are equivalent to the following equation for
all f ∈ F(R3 ):
f˙ = {f, H}. (2.5.7)
dµ
= ad∗δh/δµ µ (2.5.10)
dt
where adξ : g → g is defined by adξ η = [ξ, η] and ad∗ξ is its dual, i.e.,
d δh
µ̇a = Cba µd (2.5.11)
δµb
2. A Crash Course in Geometric Mechanics 33
We now make some remarks about the proof. First of all, the equivalence of
i and ii is general for any cotangent bundle, as we have already noted. Next, the
equivalence of ii and iv follows directly from the fact that λ is a Poisson map (as
we have mentioned, this follows from the fact that λ is a momentum map; see
Proposition 2.7.6 below) and H = h ◦ λ. Finally, we establish the equivalence of iii
and iv. Indeed, f˙ = {f, h}− means
¿ À ¿ · ¸À
δf δf δh
µ̇, = − µ, ,
δµ δµ δµ
¿ À
δf
= µ, adδh/δµ
δµ
¿ À
∗ δf
= adδh/δµ µ, .
δµ
Let us discuss the main ideas of the proof. First of all, the equivalence of i and
ii holds on the tangent bundle of any configuration manifold Q, as we have seen,
secondly, ii and iv are equivalent. To see this, one needs to compute the variations
δξ induced on ξ = g −1 ġ = T Lg−1 ġ by a variation of g. To calculate this, we need to
differentiate g −1 ġ in the direction of a variation δg. If δg = dg/d² at ² = 0, where
g is extended to a curve g² , then,
µ ¶¯
d −1 d
¯
δξ = g g ¯¯
d² dt ²=0
The difference δξ − η̇ is the commutator, [ξ, η]. This argument is fine for matrix
groups, but takes a little more work to make precise for general Lie groups. See
Bloch, Krishnaprasad, Ratiu and Marsden [1994b] for the general case. Thus, ii
and iv are equivalent.
To complete the proof, we show the equivalence of iii and iv. Indeed, using the
definitions and integrating by parts,
Z Z
δl
δ l(ξ)dt = δξ dt
δξ
Z
δl
= (η̇ + adξ η)dt
δξ
Z · µ ¶ ¸
d δl ∗ δl
= − + adξ η dt
dt δξ δξ
Generalizing what we saw directly in the rigid body, one can check directly from
the Euler- Poincaré equations that conservation of spatial angular momentum holds:
d
π=0 (2.6.5)
dt
where π is defined by
∂l
π = Ad∗g . (2.6.6)
∂ξ
2. A Crash Course in Geometric Mechanics 35
and therefore
Ja = pi K ia (q). (2.7.3)
Recall that by differentiating the conjugation operation h 7→ ghg −1 at the iden-
tity, one gets the adjoint action of G on g. Taking its dual produces the coadjoint
action of G on g∗ .
Proposition 2.7.4 The momentum map for cotangent lifted actions is equivari-
ant, i.e., the diagram in Figure 2.7.1 commutes.
J -
T ∗Q g∗
G-action coadjoint
on T ∗ Q action
? ?
T ∗Q - g∗
J
The above development concerns momentum maps using the Hamiltonian point
of view. However, one can also consider them from the Lagrangian point of view.
In this context, we consider a Lie group G acting on a configuration manifold Q and
lift this action to the tangent bundle T Q using the tangent operation. Given a G-
invariant Lagrangian L : T Q → R, the corresponding momentum map is obtained
by replacing the momentum pq in (2.7.1) with the fiber derivative FL(vq ). Thus,
J : T Q → g∗ is given by
or, in coordinates,
∂L i
Ja = K , (2.7.5)
∂ q̇ i a
where the action coefficients Kai are defined as before by writing ξQ (q i ) = Kai ξ a ∂/∂q i .
2. A Crash Course in Geometric Mechanics 37
Now we consider the variation q(t, s) = exp(φ(t, s)ξ) · q(t). The corresponding
infinitesimal variation is given by
Since φ0 is arbitrary, except for endpoint conditions, it follows that the integrand
vanishes, and so the time derivative of the momentum map is zero and so the
proposition is proved. ¥
For Poisson reduction we start with a Poisson manifold P and let the Lie
group G act on P by Poisson maps. Assuming P/G is a smooth manifold, endow
it with the unique Poisson structure on such that the canonical projection π : P →
P/G is a Poisson map. We can specify the Poisson structure on P/G explicitly as
follows. For f and k : P/G → R, let F = f ◦ π and K = k ◦ π, so F and K are f
and k thought of as G-invariant functions on P . Then {f, k}P/G is defined by
To show that {f, k}P/G is well defined, one has to prove that {F, K}P is G-invariant.
This follows from the fact that F and K are G-invariant and the group action of G
on P consists of Poisson maps.
For P = T ∗ G we get a very important special case.
For symplectic reduction we begin with a symplectic manifold (P, Ω). Let G
be a Lie group acting by symplectic maps on P ; in this case the action is called a
symplectic action. Let J be an equivariant momentum map for this action and
H a G-invariant Hamiltonian on P . Let Gµ = {g ∈ G | g · µ = µ} be the isotropy
subgroup (symmetry subgroup) at µ ∈ g∗ . As a consequence of equivariance, Gµ
leaves J−1 (µ) invariant. Assume for simplicity that µ is a regular value of J, so that
J−1 (µ) is a smooth manifold (see §2.8 below) and that Gµ acts freely and properly
on J−1 (µ), so that J−1 (µ)/Gµ =: Pµ is a smooth manifold. Let iµ : J−1 (µ) → P
denote the inclusion map and let πµ : J−1 (µ) → Pµ denote the projection. Note
that
dim Pµ = dim P − dim G − dim Gµ . (2.8.2)
Building on classical work of Jacobi, Liouville, Arnold and Smale, we have the
following basic result of Marsden and Weinstein [1974] (see also (Meyer [1973]).
Theorem 2.8.2 (Reduction Theorem) There is a unique symplectic structure
Ωµ on Pµ satisfying
i∗µ Ω = πµ∗ Ωµ . (2.8.3)
P
6 6
iµ
reduction J−1 (µ) reconstruction
πµ
? ?
Pµ
Figure 2.8.1: Reduction to Pµ and reconstruction back to P .
of G. The set of orbits consists of spheres and a single point. The reduction process
confirms that all orbits are symplectic manifolds. One calculates that the symplectic
structure on the spheres is a multiple of the area element. ¨
The orbit reduction theorem of Marle [1976] and Kazhdan, Kostant and
Sternberg [1978] states that Pµ may be alternatively constructed as
/Gµ /G /G
? ? ?
Pµ ∼
= PO ⊂ P/G
Kirillov has shown that every Poisson manifold P is the union of symplec-
tic leaves, although the preceding construction explicitly realizes these symplectic
leaves in this case by the reduction construction. A special case is the foliation of
the dual g∗ of any Lie algebra g into its symplectic leaves, namely the coadjoint
orbits. For example SO(3) is the union of spheres plus the origin, each of which is
a symplectic manifold. Notice that the drop in dimension from T ∗ SO(3) to O is
from 6 to 2, a drop of 4, as in general SO(3) reduction. An exception is the singular
point, the origin, where the drop in dimension is larger. We turn to these singular
points next.
Proof The point z is regular when the range of the linear map DJ(z) is all of g∗ .
However, ζ ∈ g is orthogonal to the range (in the sense of the g, g∗ pairing) if and
only if for all v ∈ Tz P ,
hζ, DJ(z) · vi = 0
2. A Crash Course in Geometric Mechanics 41
i.e.,
dhJ, ζi(z) · v = 0
or
Ω(XhJ,ζi (z), v) = 0
or
Ω(ζP (z), v) = 0.
As Ω is nondegenerate, ζ is orthogonal to the range iff ζP (z) = 0. ¥
The above proposition is due to Smale [1970]. It is the starting point of a large
literature on singularities in the momentum map and singular reduction. Arms,
Marsden and Moncrief [1981] show, under some reasonable hypotheses, that the
level sets J−1 (0) have quadratic singularities. As we shall see in the next chapter,
there is a general shifting construction that enables one to effectively reduce J−1 (µ)
to the case J−1 (0). In the finite dimensional case, this result can be deduced from
the equivariant Darboux theorem, but in the infinite dimensional case, things are
much more subtle. In fact, the infinite dimensional results were motivated by,
and apply to, the singularities in the solution space of relativistic field theories
such as gravity and the Yang-Mills equations (see Fischer, Marsden and Moncrief
[1980], Arms, Marsden and Moncrief [1981, 1982] and Arms [1981]). The convexity
theorem states that the image of the momentum map of a torus action is a convex
polyhedron in g∗ ; the boundary of the polyhedron is the image of the singular
(symmetric) points in P ; the more symmetric the point, the more singular the
boundary point. These results are due to Atiyah [1982] and Guillemin and Sternberg
[1984] based on earlier convexity results of Kostant and the Shur-Horne theorem on
eigenvalues of symmetric matrices. The literature on these topics and its relation to
other areas of mathematics is vast. See, for example, Goldman and Millison [1990],
Sjamaar [1990], Bloch, Flaschka and Ratiu [1990], Sjamaar and Lerman [1991] and
Lu and Ratiu [1991].
B = Bx dy ∧ dz − By dx ∧ dz + Bz dx ∧ dy.
Thinking of B as a magnetic field, the equations of motion for a particle with charge
e and mass m are given by the Lorentz force law :
dv e
m = v×B (2.10.1)
dt c
2. A Crash Course in Geometric Mechanics 42
where v = (ẋ, ẏ, ż). On R3 × R3 i.e., on (x, v)-space, consider the symplectic form
e
ΩB = m(dx ∧ dẋ + dy ∧ dẏ + dz ∧ dż) − B. (2.10.2)
c
For the Hamiltonian, take the kinetic energy:
m 2
H= (ẋ + ẏ 2 + ż 2 ) (2.10.3)
2
writing XH (u, v, w) = (u, v, w, (u̇, v̇, ẇ)), the condition defining XH , namely iXH ΩB =
dH is
which is equivalent to u = ẋ, v = ẏ, w = ż, mu̇ = e(Bz v − By w)/c, mv̇ = e(Bx w −
Bz u)/c, and mẇ = e(By u − Bx v)/c, i.e., to
e
mẍ = (Bz ẏ − By ż)
c
e
mÿ = (Bx ż − Bz ẋ) (2.10.5)
c
e
mz̈ = (By ẋ − Bx ẏ)
c
which is the same as (2.10.1). Thus the equations of motion for a particle in a
magnetic field are Hamiltonian, with energy equal to the kinetic energy and with the
symplectic form ΩB .
If B = dA; i.e., B = −∇ × A, where A is a one-form and A is the associ-
ated vector field, then the map (x, v) 7→ (x, p) where p = mv + eA/c pulls back
the canonical form to ΩB , as is easily checked. Thus, Equations (2.10.1) are also
Hamiltonian relative to the canonical bracket on (x, p)-space with the Hamiltonian
1 e
HA = kp − Ak2 . (2.10.6)
2m c
Even in Euclidean space, not every magnetic field can be written as B = ∇ × A.
For example, the field of a magnetic monopole of strength g 6= 0, namely
r
B(r) = g (2.10.7)
krk3
cannot be written this way since the flux of B through the unit sphere is 4πg, yet
Stokes’ theorem applied to the two hemispheres would give zero. Thus, one might
think that the Hamiltonian formulation involving only B (i.e., using ΩB and H) is
preferable. However, one can recover the magnetic potential A by regarding A as
a connection on a nontrivial bundle over R3 \{0}. The bundle over the sphere S 2 is
in fact the same Hopf fibration S 3 → S 2 that we encountered in §1.6. This same
2. A Crash Course in Geometric Mechanics 43
construction can be carried out using reduction. For a readable account of some
aspects of this situation, see Yang [1980]. For an interesting example of Weinstein
in which this monopole comes up, see Marsden [1981], p. 34.
When one studies the motion of a colored (rather than a charged) particle in a
Yang-Mills field, one finds a beautiful generalization of this construction and related
ideas using the theory of principal bundles; see Sternberg [1977], Weinstein [1978]
and Montgomery [1985]. In the study of centrifugal and Coriolis forces one discovers
some structures analogous to those here (see Marsden and Ratiu [1994] for more
information). We shall return to particles in Yang-Mills fields in our discussion of
optimal control in Chapter 7.
Chapter 3
44
3 Tangent and Cotangent Bundle Reduction 45
I ∂H ∂H
XH = B IJ or ż I = B IJ (z) (z). (3.1.5)
∂z J ∂z J
Equation (3.1.1) reads
1 ij
H(q, p) = g pi pj + V (q) (3.1.6)
2
and (3.1.2) reads
hp, vi = pi v i , (3.1.7)
while (3.1.3) reads
Ja (q, p) = pi K ia (q) (3.1.8)
where the action tensor K ia is defined, as in Chapter 2, by
Example 1 The spherical pendulum Here Q = S 2 , the sphere on which the bob
moves, the metric is the standard one, the potential is the gravitational potential
and G = S 1 acts on S 2 by rotations about the vertical axis. The momentum map
is simply the angular momentum about the z-axis. We will work this out in more
detail below. ¨
Example 2 The double spherical pendulum Here the configuration space is
Q = S 2 × S 2 , the group is S 1 acting by simultaneous rotation about the z-axis and
again the momentum map is the total angular momentum about the z-axis. Again,
we will see more detail below. ¨
Example 3 Coupled rigid bodies Here we have two rigid bodies in R3 coupled
by a ball in socket joint. We choose Q = R3 × SO(3) × SO(3) describing the
joint position and the attitude of each of the rigid bodies relative to a reference
configuration B. The Hamiltonian is the kinetic energy, which defines a metric on
Q. Here G = SE(3) which acts on the left in the obvious way by transforming
the positions of the particles xi = Ai X + w, i = 1, 2 by Euclidean motions. The
momentum map is the total linear and angular momentum. If the bodies have
additional material symmetry, G is correspondingly enlarged; cf . Patrick [1989].
See Figure 3.1.1. ¨
z
w
y
X A2X + w
x
B
2
Figure 3.1.1: The configuration space for the dynamics of two coupled rigid bodies.
refer to Marsden and Weinstein [1982, 1983] for more information and corresponding
ideas for plasma physics, and to Marsden and Hughes [1983] and Simo, Marsden
and Krishnaprasad [1988] for elasticity. ¨
The examples we will focus on in these lectures are the spherical pendula and
the classical water molecule. Let us pause to give some details for the later example.
a · (R, r1 , r2 , P, p1 , p2 ) = (R + a, r1 + a, r2 + a, P, p1 , p2 )
j(R, r1 , r2 , P, p1 , p2 ) = P + p1 + p2 . (3.2.3)
m
r
m
s
and
j = m(ṙ1 + ṙ2 ) + M Ṙ,
which give
1
ṙ1 = Ṙ − ṡ − ṙ
2
1
ṙ2 = Ṙ − ṡ + ṙ
2
1
Ṙ = (j + 2mṡ)
M
where M = M + 2m is the total mass. Substituting into (3.2.1) and simplifying
gives
kjk2 m
L= + kṙk2 + M mkṡk2 − V (3.2.5)
2M 4
where m = m/M and where one expresses the potential V in terms of r and s (if
rotationally invariant, it depends on krk, ksk and r · s alone).
A crucial feature of the above choice is that the Riemannian metric (mass ma-
trix) associated to L is diagonal as (3.2.5) shows. The conjugate momenta are
∂L m 1
π= = ṙ = (p2 − p1 ) (3.2.6)
∂ ṙ 2 2
and, with M = M/M,
∂L
σ = = 2M mṡ = 2mP − M (p1 + p2 )
∂ ṡ
= P − M j = 2mj − p1 − p2 . (3.2.7)
3 Tangent and Cotangent Bundle Reduction 49
We next check that this is consistent with Hamiltonian reduction. First, note
that the symplectic form before reduction,
Ω = dr ∧ dπ + ds ∧ dσ + dr̄ ∧ dj (3.2.8)
Remark There are other choices of canonical coordinates for the system after
reduction by translations, but they might not be as convenient as the above Jacobi-
Bertrand-Haretu coordinates. For example, if one uses the apparently more “demo-
cratic” choice
s1 = r1 − R and s2 = r2 − R,
then their conjugate momenta are
π1 = p1 − mj and π2 = p2 − mj.
Next we turn to the action of the rotation group, SO(3) and the role of the
discrete group that “swaps” the two masses m.
Our phase space is Z = T ∗ (R3 × R3 ) parametrized by (r, s, π, σ) with the canon-
ical symplectic structure. The group G = SO(3) acts on Z by the cotangent lift of
rotations on R3 × R3 . The corresponding infinitesimal action on Q is
ξQ (r, s) = (ξ × r, ξ × s)
J = r×π+s×σ (3.2.10)
µ ¶
1 1
= (r2 − r1 ) × (p2 − p1 ) + R − (r1 + r2 ) × (P − M j)
2 2
which becomes, after simplification,
Remark This expression for J also equals s1 ×π1 +s2 ×π2 in terms of the alternative
variables discussed above. ¨
In coordinates,
Aa = Iab gij K ib v j . (3.3.4)
3 Tangent and Cotangent Bundle Reduction 51
J
T ∗Q - g∗
6 6
FLq I(q)
TQ - g
Aq
Using the formula (3.1.3) for J, we see that horq is the space orthogonal to the
G-orbits, or, equivalently, the space of states with zero total angular momentum in
the case G = SO(3) (see Figure 3.3.2). This may be used to define the mechani-
cal connection—in fact, principal connections are characterized by their horizontal
spaces. The vertical space consists of vectors that are mapped to zero under the
projection Q → S = Q/G; i.e.,
G .q
verq
q Q
horq
[q] Q/G
i.e.,
(Aµ )i = gij K jb µa Iab . (3.3.8)
This coordinate formula shows that Aµ is given by (I−1 µ)Q . We prove this intrin-
sically in the next Proposition.
Proposition 3.3.2 The one form Aµ takes values in J−1 (µ). Moreover, identify-
ing vectors and one-forms,
Aµ = (I−1 µ)Q .
Proof The first part holds for any connection on Q → Q/G and follows from the
property A(ξQ )=ξ. Indeed,
hJ(Aµ (q)), ξi = hAµ (q), ξQ (q)i = hµ, A(ξQ (q))i = hµ, ξi.
Notice that this proposition holds for any connection on Q → Q/G, not just the
mechanical connection. It is straightforward to see that Aµ is characterized by
where Jq = J|Tq∗ Q and K(q, p) = 12 kpk2q is the kinetic energy function (see Fig-
ure 3.3.3). Indeed (3.3.9) means that Aµ (q) lies in and is orthogonal to the affine
space J−1 −1
q (µ), i.e., is orthogonal to the linear space Jq (µ) − Aµ (q) = Jq (0). The
latter means Aµ (q) is in the vertical space, so is of the form ηq and the former
means η is given by I−1 (q)µ.
−1
J (µ) ∩ Tq*Q
Aµ(q)
q
Q
Tq*Q
Figure 3.3.3: The mechanical connection as the orthogonal vector to the level set
of J in the cotangent fiber.
i.e.,
ω a = Iab K ib pi , (3.3.12)
∗ −1
and, using a slight abuse of notation, a projection hor : T Q → J (0) by
i.e.,
(hor(q, p))i = pi − gij K jb pk K ka Iab . (3.3.14)
This map hor in (3.3.13) will play a fundamental role in what follows. We also refer
to hor as the shifting map. This map will be used in the proof of the cotangent
bundle reduction theorem in §3.4; it is also an essential ingredient in the description
of a particle in a Yang-Mills field via the Kaluza-Klein construction, generalizing
the electromagnetic case in which p 7→ p − ec A. This aspect will be brought in later.
3 Tangent and Cotangent Bundle Reduction 54
Example Let Q = G be a Lie group with symmetry group G itself, acting say on
the left. In this case, Aµ is independent of the Hamiltonian. Since Aµ (q) ∈ J −1 (µ),
it follows that Aµ is the right invariant one form whose value at the identity e is µ.
(This same one form Aµ was used in Marsden and Weinstein [1974].) ¨
where [A, A]µ (v, w) = hµ, [A(v), A(w)]i, the bracket being the Lie algebra bracket.
The amended potential Vµ is defined by
Vµ = H ◦ A µ ; (3.3.18)
Reduction at zero is simple: J−1 (0)/G is isomorphic with T ∗ (Q/G) by the following
identification: βq ∈ J−1 (0) satisfies hβq , ξQ (q)i = 0 for all ξ ∈ g, so we can regard
βq as a one form on T (Q/G).
As a set, the fiber of the map horO is identified with O. Therefore, we have real-
ized (T ∗ Q)O as a coadjoint orbit bundle over T ∗ (Q/G). The spaces are summarized
in Figure 3.4.1.
T ∗Q ⊂ J−1 (µ) ⊂ J−1 (O) ⊂ T ∗ Q
/Gµ /G πO
? ?
∗
(T Q)µ ∼
= ∗
(T Q)O
injection horµ horO surjection
? ?
T ∗ (Q/Gµ ) T ∗ (Q/G)
? ? ? ?
Q −→ Q/Gµ −→ Q/G ←− Q
The Poisson bracket structure of the bundle horO as a synthesis of the Lie-
Poisson structure, the cotangent structure, the magnetic and interaction terms is
investigated in Montgomery, Marsden and Ratiu [1984] and Montgomery [1986].
For the symplectic structure, it is a little easier to use Pµ . Here, we restrict
the map hor to J−1 (µ) and quotient by Gµ to get a map of Pµ to J−1 (0)/Gµ . If
Jµ denotes the momentum map for Gµ , then J−1 (0)/Gµ embeds in J−1 ∼
µ (0)/Gµ =
3 Tangent and Cotangent Bundle Reduction 56
T ∗ (Q/Gµ ). The resulting map horµ embeds Pµ into T ∗ (Q/Gµ ). This map is induced
by the shifting map:
pq 7→ pq − Aµ (q). (3.4.3)
The symplectic form on Pµ is obtained by restricting the form on T ∗ (Q/Gµ ) given
by
Ωcanonical − dAµ . (3.4.4)
The two form dAµ drops to a two form βµ on the quotient, so (3.4.4) defines the
symplectic structure of Pµ . We call βµ the magnetic term following the ideas
from §2.10 and the terminology of Abraham and Marsden [1978]. We also note that
on J−1 (µ) (and identifying vectors and covectors via FL), [A(v), A(w)] = [µ, µ] = 0,
so that the form βµ may also be regarded as the form induced by the µ-component
of the curvature. In the PO context, the distinction between dAµ and (curv α)µ
becomes important. We shall see this aspect in Chapter 4.
Two limiting cases are noteworthy. The first (that one can associate with Arnold
[1966]) is when Q = G in which case PO ∼ = O and the base is trivial in the PO →
T ∗ (Q/G) picture, while in the Pµ → T ∗ (Q/Gµ ) picture, the fiber is trivial and the
base space is Q/Gµ ∼ = O. Here the description of the orbit symplectic structure
induced by dαµ coincides with that given by Kirillov [1976].
The other limiting case (that one can associate with Smale [1970]) is when
G = Gµ ; for instance, this holds in the abelian case. Then
Pµ = PO = T ∗ (Q/G)
Note that the construction of Pµ requires only that the Gµ action be free;
then one gets Pµ embedded in T ∗ (Q/Gµ ). However, the bundle picture of PO →
T ∗ (Q/G) with fiber O requires G to act freely on Q, so Q/G is defined. If the
G-action is not free, one can either deal with singularities or use Montgomery’s
method above, replacing shape space Q/G with the modified shape space Qµ /G
(and drop (3.4.5)). For example, already in the Kepler problem in which G = SO(3)
acts on Q = R3 \{0}, the action of G on Q is not free; for µ 6= 0, Qµ is the plane
orthogonal to µ (minus {0}). On the other hand, Gµ = S 1 , the rotations about the
3 Tangent and Cotangent Bundle Reduction 57
axis µ 6= 0, acts freely on J−1 (µ), so Pµ is defined. But Gµ does not act freely on
Q! Instead, Pµ is actually T ∗ (Qµ /Gµ ) = T ∗ (0, ∞), with the canonical cotangent
structure. The case µ = 0 is singular and is more interesting! (See Marsden [1981].)
Another context in which one can get interesting reduction results, both math-
ematically and physically, is that of semi-direct products (Marsden, Ratiu and We-
instein [1984a,b]). Here we consider a linear representation of G on a vector space
V . We form the semi-direct product S = G s V and look at its coadjoint orbit
Oµ,a through (µ, a) ∈ g∗ × V ∗ . The semi-direct product reduction theorem
says that Oµ,a is symplectomorphic with the reduced space obtained by reducing
T ∗ G at µa = µ|ga by the sub-group Ga (the isotropy for the action of G on V ∗ at
a ∈ V ∗ ). If Ga is abelian (the generic case) then abelian reduction gives
Oµ,a ∼
= T ∗ (G/Ga )
with the canonical plus magnetic structure. For example, the generic orbits in
SE(3) = SO(3) s R3 are cotangent bundles of spheres; but the orbit symplectic
structure has a non-trivial magnetic term — see Marsden and Ratiu [1994] for their
computation.
Semi-direct products come up in a variety of interesting physical situations,
such as the heavy rigid body, compressible fluids and MHD; we refer to Marsden,
Ratiu, and Weinstein [1984a,b] and Holm, Marsden, Ratiu, and Weinstein [1985]
for details.
For semidirect products, one has a useful result called reduction by stages.
Namely, one can reduce by S in two successive stages, first by V and then by G.
For example, for SE(3) one can reduce first by translations, and then by rotations,
and the result is the same as reducing by SE(3). For a general reduction by stages
result, see Marsden and Ratiu [1995].
Indeed,
1
HAµ (q, p) = hhp + Aµ , p + Aµ iiq + V (q)
2
1 1
= kpk2q + hhp, Aµ iiq + kAµ k2q + V (q). (3.4.7)
2 2
3 Tangent and Cotangent Bundle Reduction 58
If p = FL · v, then hhp, Aµ iiq = hAµ , vi = hµ, A(q, v)i = hµ, I(q)J(p)i = 0 since
J(p) = 0. Thus, on J−1 (0),
1
HAµ (q, p) = kpk2q + Vµ (q). (3.4.8)
2
In T ∗ (Q/Gµ ), we obtain Hµ by selecting any representative (q, p) of T ∗ (Q/Gµ )
in J−1 (0) ⊂ T ∗ Q, shifting it to J−1 (µ) by p 7→ p + Aµ (q) and then calculating H.
Thus, the above calculation (3.4.8) proves:
1
Hµ (q, p) = kpk2 + Vµ (q) (3.4.9)
2
defined on T ∗ (Q/Gµ ) with the symplectic structure
Ωµ = Ωcan − βµ (3.4.10)
where βµ is the two form on Q/Gµ obtained from dAµ on Q by passing to the
quotient. Here we use the quotient metric on Q/Gµ and identify Vµ with a function
on Q/Gµ .
Example If Q = G and the symmetry group is G itself, then the reduction con-
struction embeds Pµ ⊂ T ∗ (Q/Gµ ) as the zero section. In fact, Pµ is identified with
Q/Gµ ∼ = G/Gµ ∼= Oµ . ¨
The symplectic structure in this second picture was described abstractly above.
To describe it concretely in terms of T ∗ (Q/G) and O is more difficult; this problem
was solved by Montgomery, Marsden and Ratiu [1984]. We shall see some special
aspects of this structure when we separate internal and rotational modes in the next
chapters.
3 Tangent and Cotangent Bundle Reduction 59
3.5 Examples
We next give some basic examples of the cotangent bundle reduction construction.
The examples will be the spherical pendulum, the double spherical pendulum, and
the water molecule. Of course many more examples can be given and we refer the
reader to the literature cited for a wealth of information. The ones we have chosen
are, we hope, simple enough to be of pedagogical value, yet interesting.
y
ϕ
eθ
θ
eϕ
x
er
Figure 3.5.1: The configuration space of the spherical pendulum is the two sphere.
Note that I(θ, ϕ) = m(R sin θ)2 is the (instantaneous) moment of inertia of the
mass m about the z-axis.
In this example, we identify Q/S 1 with the interval [0, π]; i.e., the θ-variable.
In fact, it is convenient to regard Q/S 1 as S 1 mod Z2 , where Z2 acts by reflection
θ 7→ −θ. This helps to understand the singularity in the quotient space.
For µ ∈ g∗ ∼ = R, the one form Aµ is given by (3.3.3) and (3.3.7) as
From (3.5.4), one sees directly that Aµ takes values in J−1 (µ). The shifting map is
given by (3.3.13):
hor(θ, ϕ, pθ , pϕ ) = (θ, ϕ, pθ , 0).
The curvature of the connection A is zero in this example. The amended potential
is
1 1 µ2
Vµ (θ) = V (θ, ϕ) + hµ, I(θ, ϕ)−1 µi = −mgR cos θ +
2 2 mR2 sin2 θ
and so the reduced Hamiltonian on T ∗ (S 1 /Z2 ) is
1 p2θ
Hµ (θ, pθ ) = + Vµ (θ). (3.5.8)
2 mR2
The reduced Hamiltonian equations are therefore
pθ
θ̇ = ,
mR2
and
cos θ µ2
ṗθ = −mgR sin θ + . (3.5.9)
sin3 θ mR2
Note that the extra term is singular at θ = 0. ¨
q1
x
gravity m1
q2
m2
Figure 3.5.2: The configuration space for the double spherical pendulum consists of
two copies of the two sphere.
Note that (3.5.10) has the standard form of kinetic minus potential energy. We
identify the velocity vectors q̇1 and q̇2 with vectors perpendicular to q1 and q2 ,
respectively.
The conjugate momenta are
∂L
p1 = = m1 q̇1 + m2 (q̇1 + q̇2 ) (3.5.11)
∂ q̇1
and
∂L
p2 = = m2 (q̇1 + q̇2 ) (3.5.12)
∂ q̇2
regarded as vectors in R3 that are paired with vectors orthogonal to q1 and q2
respectively.
The Hamiltonian is therefore
1 1
H(q1 , q2 , p1 , p2 ) = kp1 − p2 k2 + kp2 k2
2m1 2m2
+ m1 gq1 · k + m2 g(q1 + q2 ) · k. (3.5.13)
The equations of motion are the Euler-Lagrange equations for L or, equivalently,
Hamilton’s equations for H. To write them out explicitly, it is easiest to coordinatize
Q. We will describe this process in §5.5.
Now let G = S 1 act on Q by simultaneous rotation about the z-axis. If Rθ is
the rotation by an angle θ, the action is
(q1 , q2 ) 7→ (Rθ q1 , Rθ q2 ).
3 Tangent and Cotangent Bundle Reduction 62
i.e.,
J = k · [q1 × p1 + q2 × p2 ]. (3.5.14)
From (3.5.11) and (3.5.12),
The locked inertia tensor is read off from the metric defining L in (3.5.10):
I(q1 , q2 ) = m1 kq⊥ ⊥ 2
1 k + m2 k(q1 + q2 ) k ,
2
(3.5.15)
where kq⊥1 k = kq1 k − kq1 · kk is the square length of the projection of q1 onto
2 2 2
the xy-plane. Note that I is the moment of inertia of the system about the k-axis.
The mechanical connection is given by (3.3.3):
Example 3 The water (or ozone) molecule We now compute the locked inertia
tensor, the mechanical connection and the amended potential for this example at
symmetric states. Since I(q) : g → g∗ satisfies
where hh , ii is the kinetic energy metric coming from the Lagrangian, I(r, s) is the
3 × 3 matrix determined by
From our expression for the Lagrangian, we get the kinetic energy metric
m
hh(u1 , u2 ), (v1 , v2 )ii = (u1 · v1 ) + 2M m(u2 · v2 ).
2
Thus,
where
m
γ = γ(r, s) =krk2 + 2M mksk2 .
2
With r = krk and s = ksk, we arrive at
1 m 4M m
I(r, s)−1 = Id + r⊗r+ s⊗s
γ 4M ms2 γ mr2 γ
· ¸
1 1 4M
= Id + r ⊗ r + s ⊗ s (3.5.22)
γ 4M s2 r2
We state the general case. Let s⊥ be the projection of s along the line perpendicular
s·r
to r, so s = s⊥ + 2 r. Then
r
I(r, s)−1 u = η, (3.5.23)
where
η = ar + bs⊥ + cr × s⊥
and
δu · r + βu · s⊥ βu · r + αu · s⊥ u · (r × s⊥ )
a= , b = , c =
αδ − β 2 αδ − β 2 γrs⊥
and
1
α = γr2 − mr4 − 2M m(s · r)2 , β = 2M m(s⊥ )2 s · r, δ = γr2 − 2M m(s⊥ )4 .
2
The mechanical connection, defined in general by
gives the “overall” angular velocity of the system. For us, let (r, s, ṙ, ṡ) be a tangent
vector and (r, s, π, σ) the corresponding momenta given by the Legendre transform.
Then, A is determined by
I·A=r×π+s×σ
or,
hm i hm i
krk2 + 2M mksk2 A − r(r · A) + 2M ms(s · A) = r × π + s × σ.
2 2
The formula for A is especially simple for symmetric molecules, when r ⊥ s. Then
one finds:
ṡ · n ṙ · n 1 nm o
A= r̂ − ŝ + krk2 s · ṙ − 2M mksk2 r · ṡ n (3.5.24)
ksk krk γkrkksk 2
Fix a total angular momentum vector µ and let hAµ (q), vq i = hµ, A(vq )i, so
Aµ (q) ∈ T ∗ Q. For example, using A above (for symmetric molecules), we get
· ¸
1 m 2M m
Aµ = (µ × r) − (s · µ)(r × s) dr
γ 2 r2
1h m i
+ 2M m(µ × s) + 2 (r · µ)(r × s) ds.
γ 2s
The momentum Aµ has the property that J(Aµ ) = µ, as may be checked. The
amended potential is then
Notice that for the water molecule, the reduced bundle (T ∗ Q)µ → T ∗ S is an S 2 bun-
dle over shape space. The sphere represents the effective “body angular momentum
variable”. ¨
Iwai [1987c, 1990a] has given nice formulas for the curvature of the mechanical
connection using a better choice of variables than Jacobi coordinates and Hsiang and
Montgomery [1995] have given a related phase formula for the three body problem.
where A is the mechanical connection. Notice that the Routhian has the form of a
Lagrangian with a gyroscopic term; see Bloch, Krishnaprasad, Marsden and Sánchez
[1992] and Krishnaprasad and Wang [1992] for information on the use of gyroscopic
systems in control theory. One can regard Rµ as a partial Legendre transform of
L, changing the angular velocity A(q, v) to the angular momentum µ. We will see
this explicitly in coordinate calculations below.
A basic observation about the Routhian is that solutions of the Euler-Lagrange
equations for L can be regarded as solutions of the Euler-Lagrange equations for the
3 Tangent and Cotangent Bundle Reduction 66
β = dAµ , (3.6.2)
∂Aj ∂Ai
βij = i
− ,
∂q ∂q j
We say that q(t) satisfies the Euler-Lagrange equations for a Lagrangian L with the
magnetic term β provided that the associated variational principle in the sense of
Lagrange and d’Alembert is satisfied:
Z b Z b
δ L(q(t), q̇)dt = iq̇ β, (3.6.4)
a a
where the variations are over curves in Q with fixed endpoints and where iq̇ denotes
the interior product by q̇. This condition is equivalent to the coordinate condition
stating that the Euler-Lagrange equations with gyroscopic forcing are satisfied:
d ∂L ∂L
− i = q̇ j βij . (3.6.5)
dt ∂ q̇ i ∂q
Proposition 3.6.1 A curve q(t) in Q whose tangent vector has momentum J(q, q̇) =
µ is a solution of the Euler-Lagrange equations for the Lagrangian L iff it is a so-
lution of the Euler-Lagrange equations for the Routhian Rµ with gyroscopic forcing
given by β.
Proof Let p denote the momentum conjugate to q for the Lagrangian L (so that
in coordinates, pi = gij q̇ j ) and let p be the corresponding conjugate momentum for
the Routhian. Clearly, p and p are related by the momentum shift p = p − Aµ .
Thus by the chain rule,
d d
p = p − T Aµ · q̇,
dt dt
or in coordinates,
d d ∂Ai j
p i = pi − q̇ . (3.6.6)
dt dt ∂q j
Likewise, Dq Rµ = Dq L − Dq hA(q, v), µi or in coordinates,
∂Rµ ∂L ∂Aj j
i
= i− q̇ . (3.6.7)
∂q ∂q ∂q i
3 Tangent and Cotangent Bundle Reduction 67
Proof Use the definition hor = v − ξQ (q, v) and expand the square using the
definition of J. ¥
where
θ̇a (q, q̇) = [µc − gαc (q)q̇ α ]Ica (q) (3.6.12)
is the unique solution of Ja (q, q̇, θ̇) = µa with respect to θ̇a .
µ
Proposition 3.6.3 Rclass = Rµ + µc gαa q̇ α Ica .
and
Aµ = µa dθa + gbα µa Iab dq α . (3.6.14)
By (3.3.14), and the preceding equation, we get
khor(q,θ) (q̇, θ̇)k2 = gαβ (q)q̇ α q̇ β − gαa (q)gbγ (q)q̇ α q̇ γ Iab (q). (3.6.15)
Using this and the identity (Iab ) = (gab )−1 , the proposition follows from the defini-
µ
tions of Rµ and Rclass by a straightforward algebraic computation. ¥
For the rigid body, we get a variational principle for curves on the momentum
sphere — here Q/Gµ ∼ = S 2 . For it to be well defined, it is essential that one uses
the variational principle in the sense of Lagrange and d’Alembert, and not in the
naive sense of the Lagrange-Hamilton principle. In this case, one checks that the
dropped Routhian is just (up to a sign) the kinetic energy of the body in body
coordinates. The principle then says that the variation of the kinetic energy over
curves with fixed points on the two sphere equals the integral of the magnetic term
(in this case a factor times the area element) contracted with the tangent to the
curve. One can check that this is correct by a direct verification. (If one wants
a variational principle in the usual sense, then one can do this by introduction of
“Clebsch variables”, as in Marsden and Weinstein [1983] and Cendra and Marsden
[1987].)
The rigid body also shows that the reduced variational principle given by Propo-
sition 3.6.4 in general is degenerate. This can be seen in two essentially equivalent
ways; first, the projection of the constraint J = µ can produce a nontrivial condi-
tion in T (Q/Gµ ) — corresponding to the embedding as a symplectic subbundle of
Pµ in T ∗ (Q/Gµ ). For the case of the rigid body, the subbundle is the zero section,
and the symplectic form is all magnetic (i.e., all coadjoint orbit structure). The
second way to view it is that the kinetic part of the induced Lagrangian is degen-
erate in the sense of Dirac, and so one has to cut it down to a smaller space to
get well defined dynamics. In this case, one cuts down the metric corresponding
to its degeneracy, and this is, coincidentally, the same cutting down as one gets by
imposing the constraint coming from the image of J = µ in the set T (Q/Gµ ). Of
course, this reduced variational principle in the Routhian context is consistent with
the variational principle for the Euler-Poincaré equations discussed in §2.6.
For the rigid body, and more generally, for T ∗ G, the one form αµ is independent
of the Lagrangian, or Hamiltonian. It is in fact, the right invariant one form on G
equaling µ at the identity, the same form used by Marsden and Weinstein [1974]
in the identification of the reduced space. Moreover, the system obtained by the
Lagrangian reduction procedure above is “already Hamiltonian”; in this case, the
reduced symplectic structure is “all magnetic”.
There is a well defined reconstruction procedure for these systems. One can
horizontally lift a curve in Q/Gµ to a curve d(t) in Q (which therefore has zero
angular momentum) and then one acts on it by a time dependent group element
solving the equation
ġ(t) = g(t)ξ(t)
where ξ(t) = α(d(t)), as in the theory of geometric phases — see Chapter 6 and
Marsden, Montgomery and Ratiu [1990].
In general, one arrives at the reduced Hamiltonian description on Pµ ⊂ T ∗ (Q/Gµ )
with the amended potential by performing a Legendre transform in the non-degenerate
variables; i.e., the fiber variables corresponding to the fibers of Pµ ⊂ T ∗ (Q/Gµ ).
For example, for abelian groups, one would perform a Legendre transformation in
all the variables.
Remarks
3 Tangent and Cotangent Bundle Reduction 70
d ∂l ∂l a d
b
− a Cdb η = 0. (3.7.2)
dt ∂η ∂η
However, this representation of the equations does not make global intrinsic sense
(unless Q → S admits a global flat connection) and the variables may not be
the best ones for other issues in mechanics, such as stability. The introduction of
a connection allows one to intrinsically and globally split the original variational
principle relative to horizontal and vertical variations. One gets from one form to
the other by means of the velocity shift given by replacing η by the vertical part
relative to the connection:
ξ a = Aaα ẋα + η a .
Here, Adα are the local coordinates of the connection A. This change of coordinates
is well motivated from the mechanical point of view. Indeed, the variables ξ have
the interpretation of the locked angular velocity and they often complete the square
in the kinetic energy expression, thus, helping to bring it to diagonal form. The
resulting reduced Euler-Lagrange equations are:
d ∂l ∂l ∂l ¡ a β ¢
− α = Bαβ ẋ + Bαda d
ξ (3.7.3)
dt ∂ ẋα ∂x ∂ξ a
d ∂l ∂l
= (B a ẋα + Cdb
a d
ξ ). (3.7.4)
dt ∂ξ b ∂ξ a αb
Bαd
a a
Cbd
Proposition 3.8.1 The bracket (3.8.2) makes g∗ × P into a Poisson manifold and
φ : T ∗ G × P → g∗ × P is a Poisson map, where the Poisson structure on T ∗ G × P is
3 Tangent and Cotangent Bundle Reduction 73
given by the sum of the canonical bracket on T ∗ G and the bracket on P . Moreover,
φ is G invariant and induces a Poisson diffeomorphism of (T ∗ G×P )/G with g∗ ×P .
δL
dg S ·
δαg
Thus { , }0 is (3.8.2) with the coupling or interaction terms dropped. We claim that
the map α eliminates the coupling:
Here, {dx F, hδK/δν, dx Ji}P means the pairing of dx F with the Hamiltonian vector
field associated with the one form hδK/δν, dx Ji, which is (δK/δν)P , by definition
of the momentum map. Thus the corresponding four terms in (3.8.5) cancel. Let
us consider the remaining terms. First of all, we consider
½¿ À ¿ À¾
δF δK
, dx J , , dx J . (3.8.6)
δν δν P
Since J is equivariant, it is a Poisson map to g∗+ . Thus, (3.8.6) becomes hJ, [δF/δν, δK/δν]i.
Similarly each of the terms
¿ µ ¶ À ¿ µ ¶ À
δF δK δK δF
− , dx J · and , dx J ·
δν δν P δν δν P
equal − hJ, [δF/δν, δK/δν]i, and therefore these three terms collapse to − hJ, [δF/δν, δK/δν]i
which combines with − hµ, [δF/δν, δK/δν]i to produce the expression − hν, [δF/δν, δK/δν]i =
{F, K}− . Thus, (3.8.5) collapses to (3.8.4). ¥
Remark This result is analogous to the isomorphism between the “Sternberg” and
“Weinstein” representations of a reduced principal bundle. See Sternberg [1977],
Weinstein [1978], Montgomery, Marsden and Ratiu [1984] and Montgomery [1984].
We conclude this section with some consequences. The first is a connection with
semi-direct products. Namely, we notice that if h is another Lie algebra and G acts
on h, we can reduce T ∗ G × h∗ by G.
3 Tangent and Cotangent Bundle Reduction 75
Corollary 3.8.4 Giving T ∗ G × h∗ the sum of the canonical and the minus Lie-
Poisson structure on h∗ , the reduced space (T ∗ G × h∗ )/G is g∗ × h∗ with the bracket
µ ¶ µ ¶
δK δF
{F, K} = {F, K}g∗ + {F, K}h∗ − dν F · + dν K · (3.8.7)
δµ h∗ δµ h∗
where (µ, ν) ∈ g∗ × h∗ , which is the Lie-Poisson bracket for the semidirect product
g s h.
where δ F̄ /δp means the fiber derivative of F̄ regarded as a vector field and dg K̄
means the derivative holding µ fixed. Using the chain rule, one gets (3.8.8). ¥
In the same spirit, one gets the next corollary by using the previous corollary
twice.
δK δF
− dg F · T Rg · + dg K · T Rg ·
δµ1 δµ1
δK δF
+ dg F · T Lg · − dg K · T Lg · (3.8.9)
δµ2 δµ2
where {F, K}− µ1 is the minus Lie-Poisson bracket with respect to the first variable
µ1 , and similarly for {F, K}−
µ2 .
Chapter 4
Relative Equilibria
XH (ze ) ∈ Tze (G · ze )
i.e., if the Hamiltonian vector field at ze points in the direction of the group orbit
through ze .
76
4 Relative Equilibria 77
Remarks
5. We note that the criteria here for relative equilibria are related to the principle
of symmetric criticality. See Palais [1979, 1985] for details. ¨
Proof The logic will go as follows:
i ⇒ iv ⇒ iii ⇒ ii ⇒ i
and iv ⇒ v ⇒ vi ⇒ vii ⇒ viii ⇒ ii.
First assume i, so XH (ze ) = ξP (ze ) for some ξ ∈ g. By definition of momentum
map, this gives XH (ze ) = XhJ,ξi (ze ) or XH−hJ,ξi (ze ) = 0. Since P is symplectic,
this implies H − hJ, ξi has a critical point at ze , i.e., that Hξ has a critical point at
ze , which is iv.
Next, assume iv. Let ϕt denote the flow of XH and ψtξ that of XhJ,ξi , so ψtξ (z) =
exp(tξ) · z. Since H is G-invariant, ϕt and ψtξ commute, so the flow of XH−hJ,ξi
ξ ξ
is ϕt ◦ ψ−t . Since H − hJ, ξi has a critical point at ze , it is fixed by ϕt ◦ ψ−t , so
ϕt (exp(−tξ) · ze ) = ze for all t ∈ R. Thus, ϕt (ze ) = exp(tξ) · ze , which is iii.
Condition iii shows that ze (t) ∈ G · ze ; but ze (t) ∈ J−1 (µ) and G · ze ∩ J−1 (µ) =
Gµ ·ze by equivariance, so iii implies ii and by taking tangents, we see that ii implies
i.
Assume iv again and notice that Hξ |J−1 (µ) = H|J−1 (µ), so v clearly holds.
That v is equivalent to vi is one version of the Lagrange multiplier theorem.
4 Relative Equilibria 78
and
ii ξ(ze ) ∈ gµ i.e., Ad∗exp tξ µ = µ.
Proof Since
ze (t) = exp(tξ) · ze ∈ J−1 (µ) ∩ G · ze = Gµ · ze ,
exp(tξ) ∈ Gµ , which is ii. Property i follows from the identity HAdg ξ (gz) = Hξ (z).
¥
1
+ V (q) − hhξQ (q), ξQ (q)iiq + hµ, ξi
2
1 1
= kpk − hp, ξQ (q)i + hhξQ (q), ξQ (q)iiq
2
2 2
1
+ V (q) − hhξQ (q), ξQ (q)iiq + hµ, ξi
2
1
= kpk − hJ(q, p), ξi + V (q) + hµ, ξi
2
2
= H(q, p) − hJ(q, p) − µ, ξi = Hξ (q, p).
This calculation together with parts i and iv of the relative equilibrium theorem
proves the following.
The functions Kξ and Vξ are called the augmented kinetic and potential en-
ergies respectively. The main point of this proposition is that it reduces the job
of finding relative equilibria to finding critical points of Vξ . We also note that
Proposition 4.2.1 follows directly from the method of Lagrangian reduction given
in §3.6.
There is another interesting way to rearrange the terms in Hξ , using the me-
chanical connection A and the amended potential Vµ . In carrying this out, it will
be useful to note these two identities:
which gives (4.2.4) and (4.2.5) was proved in the last chapter.
At a relative equilibrium, the relation pe = FL(ξQ (qe )) and (4.2.4) give
i.e.,
µ = I(qe )ξ. (4.2.6)
−1
We now show that if ξ = I(q) µ, then
where
1
Kµ (z) = kp − Aµ (q)k2 (4.2.8)
2
and Vµ (q) is the amended potential as before. Indeed,
Kµ (z) + Vµ (q)
1 1
= kp − Aµ (q)k2 + V (q) + hµ, I(q)−1 µi
2 2
1 1 1
= kpk2 − hhp, Aµ (q)iiq + kAµ (q)k2q + V (q) + hµ, ξi
2 2 2
1 1 1
= kpk2 − hµ, A(FL−1 (q, p))i + hµ, A(FL−1 (Aµ (q))i + V (q) + hµ, ξi
2 2 2
1 1 1
= kpk2 − hµ, I−1 J(q, p)i + hµ, I(q)−1 J(Aµ (q)i + V (q) + hµ, ξi
2 2 2
1 1 1
= kpk2 − hI−1 µ, J(q, p)i + hµ, ξi + V (q) + hµ, ξi
2 2 2
1
= kpk2 + V (q) − hJ(q, p), ξi + hµ, ξi = Hξ (q, p)
2
using J(Aµ (q)) = µ and J(z) = µ. Next we observe that
pe = Aµ (qe ) (4.2.9)
since
hAµ (qe ), vi = hµ, I−1 (qe )J(FL(v)i = hJ(qe , pe ), I−1 (qe )J(FL(v))i
= hpe , [I−1 (qe )J(FL(v))]Q i
= hFL(ξQ (q)), [I−1 (qe )J(FL(v))]Q i
= hξ, J(FL(v))i = hξQ (qe ), FL(v)i
= hhξQ (qe ), viiq = hFL(ξQ (qe )), vi = hpe , vi.
One can also derive (4.2.9) from (4.2.7) and also one can get ii directly from
ii of Proposition 4.2.1. As we shall see later, it will be Vµ that gives the sharpest
stability results, as opposed to Vξ .
In Simo, Lewis and Marsden [1991] it is shown how, in an appropriate sense, Vξ
and Vµ are related by a Legendre transformation.
However, I(qe )−1 µ = ξ by (4.2.6) and ad∗ξ µ = 0 by Proposition 4.1.3, part ii. Hence
Xh (µ) = 0. ¥
4.3 Examples
Example 1 The spherical pendulum Recall from Chapter 3 that Vξ and Vµ are
given by
ξ2
Vξ (θ) = −mgR cos θ − mR2 sin2 θ
2
and
1 µ2
Vµ (θ) = −mgR cos θ + .
2 mR sin2 θ
2
The relation (4.2.6) is µ = mR2 sin2 θe ξ, the usual relation between angular mo-
mentum µ and angular velocity ξ. The conditions of Proposition 4.2.1 are:
1 µ2
Vµ (q1 , q2 ) = m1 gq1 · k + m2 g(q1 + q2 ) · k + , (4.3.1)
2 I
where
I(q1 , q2 ) = m1 kq⊥ ⊥ ⊥ 2
1 k + m2 kq1 + q2 k .
2
The relative equilibria are computed by finding the critical points of Vµ . There
are four obvious relative equilibria — the ones with q⊥ ⊥
1 = 0 and q2 = 0, in which
the individual pendula are pointing vertically upwards or vertically downwards.
We now search for solutions with each pendulum pointing downwards, and with
4 Relative Equilibria 82
q⊥ ⊥
1 6= 0 and q2 6= 0. There are some other relative equilibria with one of the
pendula pointing upwards, as we shall discuss below.
We next express Vµ as a function of q⊥ ⊥
1 and q2 by using the constraints, which
gives the third components:
q q
q13 = − l12 − kq⊥
1k
2 and q23 = − l22 − kq⊥2k .
2
Thus,
q q
1 µ2
Vµ (q⊥ ⊥
1 , q2 )= −(m1 + m2 )g l1 − kq1 k − m2 g l22 − kq⊥
2 ⊥ 2
2k +
2 . (4.3.2)
2 I
Setting the derivatives of Vµ equal to zero gives
q⊥ µ2
(m1 + m2 )g p 1
= [(m1 + m2 )q⊥ ⊥
1 + m2 q2 ]
l1 − kq⊥
2
1k
2 I2
q⊥ µ2
m2 g p 2
= [m2 (q⊥ ⊥
1 + q2 )]. (4.3.3)
l22 − kq⊥
2k
2 I2
We note that the equations for critical points of Vξ give the same equations with
µ = Iξ.
From (4.3.3) we see that the vectors q⊥ ⊥
1 and q2 are parallel. Therefore, define
a parameter α by
q⊥2 = αq1
⊥
(4.3.4)
and let λ be defined by
kq⊥
1 k = λl1 . (4.3.5)
Notice that α and λ determine the shape of the relative equilibrium. Define the
system parameters r and m by
l2 m1 + m2
r= , m= , (4.3.6)
l1 m2
so that conditions (4.3.3) are equivalent to
mg 1 µ2
√ = 2 (m + α)
l1 1 − λ2 I . (4.3.7)
g α µ2
√ = 2 (1 + α)
l1 r 2 − α 2 λ 2 I
The restrictions on the parameters are as follows: First, from kq⊥
1 k ≤ l1 and
kq⊥
2k ≤ l2 , we get
0 ≤ λ ≤ min{r/α, 1} (4.3.8)
and next, from Equations (4.3.7), we get the restriction that either
α>0 or − m < α < −1. (4.3.9)
The intervals (−∞, m) and (−1, 0) are also possible and correspond to pendulum
configurations with the first and second pendulum inverted, respectively. Dividing
the Equations (4.3.7) to eliminate µ and using a little algebra proves the following
result.
4 Relative Equilibria 83
Theorem 4.3.1 All of the relative equilibria of the double spherical pendulum apart
the four equilibria with the two pendula vertical are given by the points on the graph
of
L2 − r2
λ2 = 2 (4.3.10)
L − α2
where ³ µ ¶
α´ α
L(α) = 1 + ,
m 1+α
subject to the restriction 0 ≤ λ2 ≤ r2 /α2 .
From (4.3.7) we can express either µ or ξ in terms of α. In Figures 4.3.1 and 4.3.2
we show the relative equilibria for two sample values of the system parameters.
Note that there is a bifurcation of relative equilibria for fixed m and increasing r,
and that it occurs within the range of restricted values of α and λ. Also note that
there can be two or three relative equilibria for a given set of system parameters.
λ2
1.5
λ2 = r2/α2
1.25
1
0.75
0.5
0.25
α
−3 −2 −1 1 2
−0.25
−0.5
The bifurcation of relative equilibria that happens between Figures 4.3.1 and
4.3.2 does so along the curve in the (r, m) plane given by
2m
r=
1+m
as is readily seen. For instance, for m = 2 one gets r = 4/3, in agreement with the
figures. The above results are in agreement with those of Baillieul [1987].
λ2
1.5 λ2 = r2/α2
1.25
0.75
0.5
0.25
α
−3 −2 −1 1 2
−0.25
−0.5
Figure 4.3.2: The graph of λ2 verses α for r = 1.35 and m = 2 and of λ2 = r2 /α2 .
Hξ = H − hJ, ξi
1 kπk2 1
= kσk2 + + kjk2 + V
4mM m 2M
− π · (ξ × r) − σ · (ξ × s). (4.3.11)
Thus, the conditions for a critical point associated with δHξ = 0 are:
2
δπ : π =ξ×r (4.3.12)
m
1
δσ : σ =ξ×s (4.3.13)
2mM
∂V
δr : =π×ξ (4.3.14)
∂r
∂V
δs : = σ × ξ. (4.3.15)
∂s
The first two equations express π and σ in terms of r and s. When they are
substituted in the second equations, we get conditions on (r, s) alone:
∂V m m£ ¤
= (ξ × r) × ξ = ξ(ξ · r) − rkξk2 (4.3.16)
∂r 2 2
4 Relative Equilibria 85
and
∂V £ ¤
= 2mM (ξ × s) × ξ = 2mM ξ(ξ · s) − skξk2 . (4.3.17)
∂s
These are the conditions for a relative equilibria we sought. Any collection (ξ, r, s)
satisfying (4.3.16) and (4.3.17) gives a relative equilibrium.
Now consider Vξ for the water molecule:
1 h 2 ³m ´i
Vξ (r, s) = V (r, s) − kξk krk2 + 2M mksk2
2 2
1 hm i
− (ξ · r)2 + 2M m(ξ · s)2 . (4.3.18)
2 2
The conditions for a critical point are:
∂V mr m
δr : − kξk2 − (ξ · r)ξ = 0 (4.3.19)
∂r 2 2
∂V
δs : − kξk · 2M ms − 2M m(ξ · s)ξ = 0,
2
(4.3.20)
∂s
which coincide with (4.3.16) and (4.3.17). The extra condition from Proposi-
tion 4.2.1, namely pe = FL(ξQ (qe )), gives (4.3.12) and (4.3.13).
Equations (4.4.1) and (4.4.2) identify the Lie algebra so(3) with R3 and the Lie
algebra bracket with the cross product of vectors. If Λ ∈ SO(3) and Θ̂ ∈ so(3), the
adjoint action is given by
AdΛ Θ̂ = [ΛΘ]ˆ. (4.4.3)
The fact that the adjoint action is a Lie algebra homomorphism, corresponds to the
identity
Λ(r × s) = Λr × Λs, (4.4.4)
for all r, s ∈ R3 .
4 Relative Equilibria 86
Given Λ ∈ SO(3), let v̂Λ denote an element of the tangent space to SO(3) at Λ.
Since SO(3) is a submanifold of GL(3), the general linear group, we can identify v̂Λ
with a 3 × 3 matrix, which we denote with the same letter. Linearizing the defining
(submersive) condition ΛΛT = 1 gives
T
Λv̂Λ + v̂Λ ΛT = 0, (4.4.5)
which defines TΛ SO(3). We can identify TΛ SO(3) with so(3) by two isomorphisms:
First, given Θ̂ ∈ so(3) and Λ ∈ SO(3), define (Λ, Θ̂) 7→ Θ̂Λ ∈ TΛ SO(3) by letting
Θ̂Λ be the left invariant extension of Θ̂:
Θ̂Λ := Te LΛ · Θ̂ ∼
= (Λ, ΛΘ̂). (4.4.6)
Second, given θ̂ ∈ so(3) and Λ ∈ SO(3), define (Λ, θ̂) 7→ θ̂Λ ∈ TΛ SO(3) through
right translations by setting
θ̂Λ := Te RΛ · θ̂ ∼
= (Λ, θ̂Λ). (4.4.7)
1 1
hΠ̂Λ , Θ̂Λ i = tr[Π̂TΛ Θ̂Λ ] = tr[Π̂T Θ̂] = Π · Θ. (4.4.8)
2 2
Again, explicit indication of the base point will often be omitted and we shall simply
write ΛΠ̂ and π̂Λ for Π̂Λ and π̂Λ , respectively. If (4.4.9) and (4.4.10) represent the
same covector, then
π̂ = ΛΠ̂ΛT , (4.4.11)
which coincides with the co-adjoint action. Equivalently, using the isomorphism (4.4.2)
we have
π = ΛΠ. (4.4.12)
The mechanical set-up for rigid body dynamics is as follows: the configuration
manifold Q and the phase space P are
or on R by 3
Z
hha, bii = ρref (X)(a × X) · (b × X) d3 X = a · Ib (4.4.15)
B
where Z
I= ρref (X)[kXk2 Id − X ⊗ X] d3 X (4.4.16)
B
is the inertia tensor .
The corresponding Hamiltonian is
1 1
H= Π · I−1 Π = π · I−1 π (4.4.17)
2 2
where I = ΛIΛ−1 is the time dependent inertia tensor .
The expression H = 12 Π · I−1 Π reflects the left invariance of H under the action
of SO(3). Thus left reduction by SO(3) to body coordinates induces a function on the
quotient space T ∗ SO(3)/SO(3) ∼ = so(3)∗ . The symplectic leaves are spheres, kΠk =
constant. The induced function on these spheres is given by (4.4.17) regarded as a
function of Π. The dynamics on this sphere is obtained by intersection of the sphere
kΠk2 = constant and the ellipsoid H = constant that we discussed in Chapter 1.
Consistent with the preceding discussion, let G = SO(3) act from the left on
Q = SO(3) i.e.,
Q · Λ = LQ Λ = QΛ, (4.4.18)
for all Λ ∈ SO(3) and Q ∈ G. Hence the action of G = SO(3) on P = T ∗ SO(3)
is by cotangent lift of left translations. Since the infinitesimal generator associated
with ξˆ ∈ so(3) is obtained as
¯
d ¯
ξˆSO(3) (Λ) = ˆ ¯
exp[tξ]Λ ˆ
= ξΛ, (4.4.19)
dt ¯
t=0
b e ∈ SO(3),
² 7→ Λ² = exp[²δθ]Λ (4.4.23)
which starts at πe . These constructions induce a curve ² 7→ π̂Λe ∈ T ∗ SO(3) via the
isomorphism (4.4.10); that is, π̂Λe := (Λ² , π̂² Λ² ). With this notation at hand we
compute the first variation using the chain rule. Let
¯
d ¯
δHξ |e := Hξ,² ¯¯ = 0, (4.4.26)
d² ²=0
where
1
Hξ,² := π² · I−1
² π² − ξ · π² and I−1
² := Λ² I
−1 T
Λ² .
2
In addition, at equilibrium, J(ze ) = µ reads
π = πe . (4.4.27)
δHξ |e = δπ · [I−1 −1
e πe − ξ] + δθ · [Ie πe × πe ] = 0. (4.4.29)
I−1
e πe × πe = 0, and I−1
e πe = ξ. (4.4.30)
4 Relative Equilibria 89
Equivalently,
ξ × πe = 0, and I−1
e ξ = λξ, (4.4.31)
where λ > 0 by positive definiteness of Ie = Λe IΛTe . These conditions state that πe
is aligned with a principal axis, and that the rotation is about this axis. Note that
πe = Ie ξ, so that ξ does indeed correspond to the angular velocity.
Equivalently, we can locate the relative equilibria using Proposition 4.2.1. The
locked inertia tensor (as the notation suggests) is just I and here pe = FL(ξQ (qe ))
reads πe = Ie ξ, while qe = Λe is required to be a critical point of Vξ or Vµ , where
µ = πe = Iξ. Thus
1 1
Vξ (Λ) = ξ · (Iξ) and Vµ (Λ) = − µ · (I−1 µ). (4.4.32)
2 2
The calculation giving (4.4.28) shows that Λe is a critical point of Vξ , or of Vµ iff
ξ × πe = 0.
These calculations show that rigid body relative equilibria correspond to steady
rotational motions about their principal axes. The “global” perspective taken above
is perhaps a bit long winded, but it is a point of view that is useful in the long run.
Chapter 5
The Energy-Momentum
Method
This chapter develops the energy-momentum method of Simo, Posbergh and Mars-
den [1990, 1991] and Simo, Lewis and Marsden [1991]. This is a technique for
determining the stability of relative equilibria and for putting the equations of mo-
tion linearized at a relative equilibrium into normal form. This normal form is
based on a special decomposition of variations into rigid and internal components
that gives a block structure to the Hamiltonian and symplectic structure. There has
been considerable development of stability and bifurcation techniques over the last
decade, and some properties like block diagonalization have been seen in a variety
of problems; for example, this appears to be what is happening in Morrison and
Pfirsch [1990].
90
5 The Energy-Momentum Method 91
J−1(µe )
S
ze
Gµe . ze
Figure 5.1.1: The energy-momentum method tests the second variation of the aug-
mented Hamiltonian for definiteness on the space S.
Proof First, one has Gµ -orbital stability within J−1 (µ) because the reduced dy-
namics induces a well defined dynamics on the orbit space Pµ ; this dynamics induces
a dynamical system on S for which Hξ is an invariant function. Since it has a non-
degenerate extremum at ze , invariance of its level sets gives the required stability.
Second, one gets Gν orbital stability within J−1 (ν) for ν close to µ since the
form of Figure 5.1.1 changes in a regular way for nearby level sets, as µ is both a
regular value and a generic point in g∗ . ¥
For results for non-generic µ or non-regular µ, see Patrick [1990] and Lewis
[1991].
A well known example that may be treated as an instance of the energy-
momentum method, where the necessity of considering orbital stability is especially
clear, is the stability of solitons in the KdV equation. (A result of Benjamin [1972]
and Bona [1974].) Here one gets stability of solitons modulo translations; one cannot
expect literal stability because a slight change in the amplitude cases a translational
drift, but the soliton shape remains dynamically stable.
This example is actually infinite dimensional, and here one must employ addi-
tional hypotheses for Theorem 5.1.1 to be valid. Two possible infinite dimensional
versions are as follows: one uses convexity hypotheses going back to Arnold [1969]
5 The Energy-Momentum Method 92
(see Holm et al. [1985] for a concise summary) or, what is appropriate for the KdV
equation, employ Sobolev spaces on which the calculus argument of Theorem 5.1.1
is correct — one needs the energy norm defined by δ 2 Hξ (ze ) to be equivalent to
a Sobolev norm in which one has global existence theorems and on which Hξ is a
smooth function. Sometimes, such as in three dimensional elasticity, this can be a
serious difficulty (see, for instance, Ball and Marsden [1984]). In other situations,
a special analysis is needed, as in Wan and Pulvirente [1984] and Batt [1990].
The energy-momentum method can be compared to Arnold’s energy-Casimir
method that takes place on the Poisson manifold P/G rather than on P itself.
Consider the diagram in Figure 5.1.2.
P
@
π @ J
@
R
@
P/G g∗
@
@
C @ Φ
R
@
R
δΦ
(µ) = −ξ,
δµ
then one can define a Casimir C by C ◦ π = Φ ◦ J and H + C will have a critical
point at ue .
Remark This diagram gives another way of viewing symplectic reduction, namely,
as the level sets of Casimir functions in P/G. In general, the symplectic reduced
spaces may be viewed as the symplectic leaves in P/G. These may or may not be
realizable as level sets of Casimir functions. ¨
ideal flow. On the other hand, the energy-Casimir method is able to treat some
singular cases rather easily since P/G can still be a smooth manifold in the singular
case. However, there is still the problem of interpretation of the results in P ; see
Patrick [1990].
For simple mechanical systems, one way to choose S is as follows. Let
i.e., the metric orthogonal of the tangent space to the Gµ -orbit in Q. Let
Taking the variation of (5.1.4) with respect to z, evaluating at ze and using the fact
that dHξ (ze ) = 0, one gets the expression
which vanishes if Tze J(ze ) · δz = 0, i.e., if δz ∈ ker [Tze J(ze )] = Tze J−1 (µe ). ¥
Proposition 5.1.3 δ 2 Hξ (ze ) vanishes identically on ker [Tze J(ze )] along the direc-
tions tangent to the orbit Gµ · ze ; that is
Proof A general fact about reduction is that Tze (Gµ ·ze ) = Tze (G·ze )∩ker[Tze J(ze )].
Since Tze (Gµ · ze ) ⊂ Tze (G · ze ) the result follows from (5.1.2) by taking δz = ξP (ze )
with ξ ∈ gµ . ¥
These propositions confirm the general geometric picture set out in Figure 5.1.1.
They show explicitly that the orbit directions are neutral directions of δ 2 Hξ (ze ), so
one has no chance of proving definiteness except on a transverse to the Gµ -orbit.
and
I1 > I2 , I1 > I3 . (5.2.8)
Consequently,
µ ¶ µ ¶2
1 1
Φ(x) = − x+ x−
I1 2
satisfies (5.2.5) and makes the second derivative of HCΦ at (1, 0, 0) positive definite,
so stationary rotation around the longest axis is stable. The quadratic form is
negative definite provided µ ¶
00 1
Φ <0 (5.2.9)
2
and
I1 < I2 , I1 < I3 . (5.2.10)
A specific function Φ satisfying the requirements (5.2.5) and (5.2.9) is
µ ¶ µ ¶2
1 1
Φ(x) = − x− x− .
I1 2
This proves that the rigid body in steady rotation around the short axis is (Lia-
punov) stable. Finally, the quadratic form (5.2.6) is indefinite if
or the other way around. One needs an additional argument to show that rota-
tion around the middle axis is unstable. Perhaps the simplest way is as follows:
5 The Energy-Momentum Method 96
Both eigenvalues are real by (5.2.11) and one is strictly positive. Thus Πe is spec-
trally unstable and thus is (nonlinearly) unstable. Thus, in the motion of a free
rigid body, rotation around the long and short axes is (Liapunov) stable and around
the middle axis is unstable.
The energy-Casimir method deals with stability for the dynamics on Π-space.
However, one also wants (corresponding to what one “sees”) to show orbital stabil-
ity for the motion in T ∗ SO(3). This follows from stability in the variable Π and
conservation of spatial angular momentum. The energy-momentum method gives
this directly.
While it appears to be more complicated for the rigid body, the power of the
energy-momentum method is revealed when one does more complex examples, as
in Simo, Posbergh and Marsden [1990] and Lewis and Simo [1990].
In §4.4 we set up the rigid body as a mechanical system on T ∗ SO(3) and we
located the relative equilibria. By differentiating as in (4.4.29) we find the following
formula:
Note that this matrix is 6×6. Corresponding to the two conditions on S, we restrict
the admissible variations (δ, π, δθ) ∈ R3∗ × R3 . Here, J(π̂∧ ) = π̂Λ ; hence µ = π̂e
and Tze (Gµ · ze ) = infinitesimal rotations about the axis πe ; i.e., multiples of πe ,
or equivalently ξ. Variations that are orthogonal to this space and also lie in the
space δπ = 0 (which is the condition δJ = δπ = 0) are of the form δθ with δθ ⊥ πe .
Thus, we choose
S = {(δπ, δθ) | δπ = 0, δθ ⊥ πe }. (5.2.14)
5 The Energy-Momentum Method 97
which is the co-adjoint orbit through πe . The second variation (5.2.13) restricted
to the subspace S is given by
If λ is the largest or smallest eigenvalue of I, (5.2.16) will be definite; note that the
null space of I−1 − λ1 in (5.2.16) consists of vectors parallel to πe , which have been
excluded. Also note that in this example, S is a 2-dimensional space and (5.2.16)
in fact represents a 2 × 2 matrix. As we shall see below, this 2 × 2 block can also
be viewed as δ 2 Vµ (qe ) on the space of rigid variations.
where χ(q,µ) : g⊥
µ → g is defined by
The Arnold form appears in Arnold’s [1966] stability analysis of relative equilibria
in the special case Q = G. At a relative equilibrium, the form Aµ is symmetric,
as is verified either directly or by recognizing it as the second variation of Vµ on
VRIG × VRIG (see (5.3.15) below for this calculation).
At a relative equilibrium, the form Aµ is degenerate as a symmetric bilinear
form on g⊥ ⊥
µ when there is a non-zero ζ ∈ gµ such that
Indeed, non-degeneracy of the Arnold form implies VRIG ∩ VINT = {0} and, at
least in the finite dimensional case, a dimension count gives (5.3.5). In the infinite
dimensional case, the relevant ellipticity conditions are needed.
The split (5.3.5) can now be used to induce a split of the phase space
Using a more mechanical viewpoint, Simo, Lewis and Marsden [1991] show how
SRIG can be defined by extending VRIG from positions to momenta using superposed
rigid motions. For our purposes, the important characterization of SRIG is via the
mechanical connection:
SRIG = Tq αµ · VRIG , (5.3.7)
so SRIG is isomorphic to VRIG . Since αµ maps Q to J−1 (µ) and VRIG ⊂ V, we get
SRIG ⊂ S. Define
SINT = {δz ∈ S | δq ∈ VINT }; (5.3.8)
then (5.3.6) holds if the Arnold Form is non-degenerate. Next, we write
∗
SINT = WINT ⊕ WINT , (5.3.9)
∗
where WINT and WINT are defined as follows:
∗
WINT = Tq αµ · VINT and WINT = {vert(γ) | γ ∈ [g · q]0 } (5.3.10)
where g·q = {ζQ (q) | ζ ∈ g}, [g·q]0 ⊂ Tq∗ Q is its annihilator, and vert(γ) ∈ Tz (T ∗ Q)
is the vertical lift of γ ∈ Tq∗ Q; in coordinates, vert(q i , γj ) = (q i , pj , 0, γj ). The
vertical lift is given intrinsically by taking the tangent to the curve σ(s) = z + sγ
at s = 0.
and
· ¸ · ¸
coadjoint orbit internal rigid
0
symplectic form coupling
· ¸
Ωze =
− internal rigid
S 1
coupling
0 −1 0
∗
where the columns represent elements of SRIG , WINT and WINT , respectively.
5 The Energy-Momentum Method 100
Remarks
1. For the viewpoint of these splittings being those of a connection, see Lewis,
Marsden, Ratiu and Simo [1990].
2. Arms, Fischer and Marsden [1975] and Marsden [1981] describe an important
phase space splitting of Tz P at a point z ∈ J−1 (0) into three pieces, namely
where each of the functions Kµ and Vµ has a critical point at the relative equilib-
rium. It suffices to show that each term separately satisfies the lemma. The second
variation of Kµ is
and so
D2 I−1 (q) · (∆q, δq)] · µ = adη [(DI(q)−1 · δq)µ] + (DI(q)−1 · δq) · ad∗η µ.
hµ, D2 I−1 (q) · (∆q, δq) · µi = 2hµ, adη [DI(q)−1 · δq)µ]i. (5.3.15)
These calculations and similar ones given below establish the block diagonal
structure of δ 2 Hξ (ze ). We shall see in Chapter 8 that discrete symmetries can pro-
duce interesting subblocking within δ 2 Vµ on VINT . Lewis [1991] has shown how
to perform these same constructions for general Lagrangian systems. This is im-
portant because not all systems are of the form kinetic plus potential energy. For
example, gyroscopic control systems are of this sort. We shall see one example in
Chapter 7 and refer to Wang and Krishnaprasad [1992] for others, some of the gen-
eral theory of these systems and some control theoretic applications. The hallmark
of gyroscopic systems is in fact the presense of magnetic terms and we shall discuss
this next in the block diagonalization context.
As far as stability is concerned, we have the following consequence of block
diagonalization.
This follows from the fact that δ 2 Kµ is positive definite and δ 2 Hξ has the above
block diagonal structure.
In examples, it is this form of the energy-momentum method that is often the
easiest to use.
5 The Energy-Momentum Method 102
Finally in this section we should note the relation between δ 2 Vξ (qe ) and δ 2 Vµ (qe ).
A straightforward calculation shows that
and the correction term is positive. Thus, if δ 2 Vξ (qe ) is positive definite, then so is
δ 2 Vµ (qe ), but not necessarily conversely. Thus, δ 2 Vµ (qe ) gives sharp conditions for
stability (in the sense of predicting definiteness of δ 2 Hξ (ze )), while δ 2 Vξ gives only
sufficient conditions.
Using the notation ζ = [DI−1 (qe ) · δq]µ ∈ gµ (see (5.3.3)), observe that the
“correcting term” in (5.3.16) is given by hI(qe )ζ, ζi = hhζQ (qe ), ζQ (qe )ii. For-
mula (5.3.16) is often the easiest to compute with since, as we saw with the water
molecule, Vµ can be complicated compared to Vξ .
Example The water molecule For the water molecule, VRIG is two dimensional
(as it is for any system with G = SO(3)). The definition gives, at a configuration
(r, s), and angular momentum µ,
³m ´
VRIG = {(η × r, η × s) | η ∈ R3 satisfies krk2 + 2M mksk2 η · µ
2
m
− (r · η)(r · µ) + 2M m(s · µ)(s · η) = 0}.
2
The condition on η is just the condition η ∈ g⊥ µ . The internal space is three
dimensional, the dimension of shape space. The definition gives
M q̈ + S q̇ + Λq = 0
5 The Energy-Momentum Method 103
where ζ = I(qe )−1 ad∗η µ and vert denotes the vertical lift.
Proof We shall give a coordinate proof and leave it to the reader to supply an
intrinsic one. In coordinates, Aµ is given by (3.3.8) as (Aµ )i = gij K jb µa Iab . Differ-
entiating,
∂gij k j ∂K jb k ∂Iab
[T Aµ · ν]i = k
ν K µ
b a Iab
+ gij k
ν µa Iab + gij K jb µa k ν k . (5.4.2)
∂q ∂q ∂q
The fact that the action consists of isometries gives an identity allowing us to
eliminate derivatives of gij :
∂gij k a ∂K ka a ∂K ka a
k
K a η + gkj i
η + gik η = 0. (5.4.3)
∂q ∂q ∂q j
For group actions, one has the general identity [ηQ , ζQ ] = −[η, ζ]Q which gives
∂K ka m ∂K kb m
K − K = K kc Cab
c
. (5.4.5)
∂q m b ∂q m a
This allows one to simplify the first two terms in (5.4.4) giving
∂K ka j a bc ∂Iab
(∆z)i = −gik K kd Cab
d a
η µc Ibc − gkj i
K b η µc I + gij K jb K kc k µa η c . (5.4.6)
∂q ∂q
∂Iab k c
K η = Ced η I + Iae Cde
a e db b d
η . (5.4.7)
∂q k c
5 The Energy-Momentum Method 104
∂K ka j a bc
(∆z)i = gik K kd Cab
c a
η µc Ibd − gkj K b η µc I (5.4.8)
∂q i
which is exactly (5.4.1) since at a relative equilibrium, p = Aµ (q); i.e., pk =
gkj K jb Ibc µc . ¥
Ω(ze )(∆z, δz) = (δp)i (∆q)i − (∆p)i (δq)i = δpi K ia η a − (∆p)i (δq i ).
∂K ia k a
h(DJ · δz), ηi = δpi K ia η a + pi δq η
∂q k
gives (5.4.9). ¥
If we define the one form Aξ by Aξ (q) = FL(ξQ (q)), then the definition of VINT
shows that on this space dAµ = dAξ . This is a useful remark since dAξ is somewhat
easier to compute in examples.
If we had made the “naive” choice of VINT as the orthogonal complement of the
G-orbit, then we could also replace dAµ by hµ, curv Ai. However, with our choice
of VINT , one must be careful of the distinction.
∗
We leave it for the reader to check that the rest of the WINT , WINT block is as
stated.
1 µ2
+ . (5.5.3)
2 m1 r12 + m2 (r12 + r22 + 2r1 r2 cos ϕ)
Notice that Vµ depends on the angles θ1 and θ2 only through ϕ = θ2 − θ1 , as it
should by S 1 -invariance. Next one calculates the second variation at one of the
relative equilibria found in §4.3. If we calculate it as a 3 × 3 matrix in the variables
r1 , r2 , ϕ, then one checks that we will automatically be in a space orthogonal to the
Gµ -orbits. One finds, after some computation, that
a b 0
δ 2 Vµ = b d 0 (5.5.4)
0 0 e
where
µ2 (3(m + α)2 − α2 (m − 1)) gm2 m
a = +
λ4 l14 m2 (m + α2 + 2α)3 l1 (1 − λ2 )3/2
µ2 3(m + α2 + 2α) + 4α(m − 1)
b = (sign α) 4 4
λ l1 m2 (m + α2 + 2α)3
µ 2
3(α + 1) + 1 − m m2 g
2
r2
d = +
λ4 l14 m2 (m + α2 + 2α)3 l1 (r2 − λ2 α2 )3/2
2
µ α
e = 2 .
λ l1 m2 (m + α2 + 2α)2
2
Notice the zeros in (5.5.4); they are in fact a result of discrete symmetry, as we
shall see in Chapter 8. Without the help of these zeros (for example, if the calcula-
tion is done in arbitrary coordinates), the expression for δ 2 Vµ might be intractible.
Based on this calculation one finds:
Proposition 5.5.1 The signature of δ 2 Vµ along the “straight out” branch of the
double spherical pendulum (with α > 0) is (+, +, +) and so is stable. The signature
along the “cowboy branch” with α < 0 and emanating from the straight down state
(λ = 0) is (−, −, +) and along the remaining branches is (−, +, +).
5 The Energy-Momentum Method 107
The stability along the cowboy branch requires further analysis that we shall
outline in Chapter 10. The remaining branches are linearly unstable since the index
is odd (this is for reasons we shall go into in Chapter 10).
To get this instability and bifurcation information, one needs to linearize the
reduced equations and compute the corresponding eigenvalues. There are (at least)
three methodologies that can be used for computing the reduced linearized equa-
tions:
i Compute the Euler-Lagrange equations from (5.5.2), drop them to J−1 (µ)/Gµ
and linearize the resulting equations.
ii Read off the linearized reduced equations from the block diagonal form of
δ 2 Hξ and the symplectic structure.
iii Perform Lagrangian reduction as in §3.6 to obtain the Lagrangian structure
of the reduced system and linearize it at a relative equilibrium.
For the double spherical pendulum, perhaps the first method is the quickest to get
the answer, but of course the other methods provide insight and information about
the structure of the system obtained.
The linearized system obtained has the following standard form expected for
abelian reduction:
M q̈ + S q̇ + Λq = 0. (5.5.5)
In our case q = (r1 , r2 , ϕ) and Λ is the matrix (5.5.4) given above. The mass matrix
M is
m11 m12 0
M = m12 m22 0
0 0 m33
where
µ ¶
m1 + m2 αλ2
m11 = , m 12 = (sign α)m 2 1 + √ √
1 − λ2 1 − λ2 r 2 − α 2 λ2
2
r α2
m22 = m2 2 , m33 = m2 l12 λ2 (m − 1)
r −λ α 2 2 m + α2 + 2α
and the gyroscopic matrix S (the magnetic term) is
0 0 s13
S= 0 0 s23
−s13 −s23 0
where
µ 2α2 (m − 1)
s13 = and
λl1 (m + α2 + 2α)2
µ 2α(m − 1)
s23 = −(sign α) .
λl1 (m + α2 + 2α)2
We will pick up this discussion again in Chapter 10.
Chapter 6
Geometric Phases
In this chapter we give the basic ideas for geometric phases in terms of reconstruc-
tion, prove Montgomery’s formula for rigid body phases, and give some other basic
examples. We refer to Marsden, Montgomery and Ratiu [1990] for more informa-
tion.
108
6 Geometric Phases 109
inertial frame
body #1
body #2
θ1
θ2
This is the amount by which body #1 rotates, each time body #2 goes around
once. This result is independent of the detailed dynamics and only depends on the
fact that angular momentum is conserved and that body #2 goes around once. In
particular, we get the same answer even if there is a “hinge potential” hindering
the motion or if there is a control present in the joint. Also note that if we want to
rotate body #1 by −2πkI2 /(I1 + I2 ) radians, where k is an integer, all one needs
to do is spin body #2 around k times, then stop it. By conservation of angular
momentum, body #1 will stay in that orientation after stopping body #2.
In particular, if we think of body #1 as a spacecraft and body #2 as an internal
rotor, this shows that by manipulating the rotor, we have control over the attitude
(orientation) of the spacecraft.
Here is a geometric interpretation of this calculation. Define the one form
I2
A = dθ + dψ. (6.1.3)
I1 + I2
6.2 Reconstruction
This section presents a reconstruction method for the dynamics of a given Hamil-
tonian system from that of the reduced system. Let P be a symplectic manifold
on which a Lie group acts in a Hamiltonian manner and has a momentum map
J : P → g∗ . Assume that an integral curve cµ (t) of the reduced Hamiltonian vector
field XHµ on the reduced space Pµ is known. For z0 ∈ J−1 (µ), we search for the
corresponding integral curve c(t) = Ft (z0 ) of XH such that πµ (c(t)) = cµ (t), where
πµ : J−1 (µ) → Pµ is the projection.
To do this, choose a smooth curve d(t) in J−1 (µ) such that d(0) = z0 and
πµ (d(t)) = cµ (t). Write c(t) = Φg(t) (d(t)) for some curve g(t) in Gµ to be deter-
mined, where the group action is denoted g · z = Φg (z). First note that
XH (c(t)) = c0 (t)
= Td(t) Φg(t) (d0 (t))
¡ ¢
+ Td(t) Φg(t) · Tg(t) Lg(t)−1 (g 0 (t)) P (d(t)). (6.2.1)
This is an equation for g(t) written in terms of d(t) only. We solve it in two steps:
Step 1 Find ξ(t) ∈ gµ such that
Step 2 With ξ(t) determined, solve the following non-autonomous ordinary differ-
ential equation on Gµ :
Step 1 is typically of an algebraic nature; in coordinates, for matrix Lie groups, (6.2.3)
is a matrix equation. We show later how ξ(t) can be explicitly computed if a con-
nection is given on J−1 (µ) → Pµ . With g(t) determined, the desired integral curve
c(t) is given by c(t) = Φg(t) (d(t)). A similar construction works on P/G, even if the
G-action does not admit a momentum map.
Step 2 can be carried out explicitly when G is abelian. Here the connected
component of the identity of G is a cylinder Rp × Tk−p and the exponential map
exp(ξ1 , . . . , ξk ) = (ξ1 , . . . , ξp , ξp+1 (mod2π), . . . , ξk (mod2π)) is onto, so we can write
g(t) = exp η(t), where η(0) = 0. Therefore ξ(t) = Tg(t) Lg(t)−1 (g 0 (t)) = η 0 (t) since η 0
Rt
and η commute, i.e., η(t) = 0 ξ(s)ds. Thus the solution of (6.2.4) is
µZ t ¶
g(t) = exp ξ(s)ds . (6.2.5)
0
6 Geometric Phases 111
Suppose cµ is a closed curve with period T ; thus, both c and d reintersect the
same fiber. Write
d(T ) = ĝ · d(0) and c(T ) = h · c(0)
for ĝ, h ∈ Gµ . Note that
h = g(T )ĝ. (6.2.6)
The Lie group element ĝ (or the Lie algebra element log ĝ) is called the geometric
phase. It is the holonomy of the path cµ with respect to the connection A and
has the important property of being parametrization independent. The Lie group
element g(T ) (or log g(T )) is called the dynamic phase.
For compact or semi-simple G, the group Gµ is generically abelian. The com-
putation of g(T ) and ĝ are then relatively easy, as was indicated above.
Remarks
1. The result of Proposition 6.3.1 holds for any exact symplectic manifold. We
shall use this for the rigid body.
2. In the next section we shall show how to construct a connection on J−1 (µ) →
Pµ in general. For the rigid body it is easy to check that these two construc-
tions coincide! If G = Q and Gµ = S 1 , then the construction in Proposition
6.3.1 agrees with the pullback of the mechanical connection for G semisimple
and the metric defined by the Cartan-Killing form.
3. Choose P to be a complex Hilbert space with symplectic form
Ω(ϕ, ψ) = −Imhϕ, ψi
tµ i π
J−1 (µ) - J−1 (µ0 ) - J−1 (0) - T ∗Q - Q
µ µ
πµ /Gµ ρµ
? ? ?
(T ∗ Q)µ - T ∗ (Q/Gµ ) - Q/Gµ
¯
where FH : T ∗ Q → T Q is the fiber derivative of H, i.e., FH(αq )·βq = d
dt H(αq + tβq )¯t=0 .
Proof γ̃ · XH = γ · T π · XH = γ · FH. ¥
Proof Apply Proposition 6.4.1 and use the fact that FH(αq ) = a]q . (In coordinates,
∂H/∂pµ = g µν pν .) ¥
To see that the connection of Proposition 6.4.1 coincides with the one in §6.3
for the rigid body, use the fact that Q = G and αµ must lie in J−1 (µ), so αµ is the
right invariant one form equalling µ at g = e and that Gµ = S 1 .
and so
g(T )−1 µ = g(0)−1 µ i.e., g(T )g(0)−1
6 Geometric Phases 115
is a rotation about the axis µ. We want to compute the angle of this rotation.
To answer this question, let c(t) be the corresponding trajectory in J−1 (µ) ⊂ P .
Identify T ∗ SO(3) with SO(3) × R3 by left trivialization, so c(t) gets identified with
(g(t), Π(t)). Since the reduced trajectory Π(t) closes after time T , we recover the
fact that c(T ) = gc(0) where g = g(T )g(0)−1 ∈ Gµ . Thus, we can write
g = exp[(∆θ)ζ] (6.5.1)
where θµ is the pull back to J−1 (µ) of the canonical one-form θ on T ∗ SO(3). The
curvature of A as a two-form on the base Pµ , the sphere of radius kµk in R3 , is
given by
1 1
− ωµ = dS. (6.5.4)
kµk kµk2
The first terms in (6.5.2) represent the geometric phase, i.e., the holonomy of the
reduced trajectory with respect to this connection. The logarithm of the holonomy
(modulo 2π) is given as minus the integral over D of the curvature, i.e., it equals
Z
1 1
ωµ = − (area D) = −Λ(mod 2π). (6.5.5)
kµk D kµk2
The second terms in (6.5.2) represent the dynamic phase. By Theorem 6.3.1,
it is calculated in the following way. First one horizontally lifts the reduced closed
trajectory Π(t) to J−1 (µ) relative to the connection (6.5.3). This horizontal lift is
easily seen to be (identity, Π(t)) in the left trivalization of T ∗ SO(3) as SO(3) × R3 .
Second, we need to compute
Since in coordinates
∂ ∂
θµ = pi dq i and XH = pi + terms
∂q i ∂p
6 Geometric Phases 116
P
for pi = j g ij pj , where g ij is the inverse of the Riemannian metric gij on SO(3),
we get
(θµ · XH )(Π(t)) = pi pi = 2H(identity, Π(t)) = 2Hµ , (6.5.7)
where Hµ is the value of the energy on S 2 along the integral curve Π(t). Conse-
quently,
2Hµ
ξ(t) = ζ. (6.5.8)
kµk
Third, since ξ(t) is independent of t, the solution of the equation
µ ¶
2Hµ 2Hµ t
ġ = gξ = gζ is g(t) = exp ζ
kµk kµk
Formulas (6.5.5) and (6.5.9) prove (6.5.2). Note that (6.5.2) is independent of
which spherical cap one chooses amongst the two bounded by Π(t). Indeed, the
solid angles on the unit sphere defined by the two caps add to 4π, which does not
change Formula (6.5.2).
For other examples of the use of (6.5.2) see Chapter 7 and Levi [1993]. We also
note that Goodman and Robinson [1958] and Levi [1993] give an interesting link
between this result, the Poinsot description of rigid body motion and the Gauss-
Bonnet theorem.
P(Y )(q, p) = p · Y (q) for Y ∈ X(Q), and where Zt⊥ denotes the orthogonal pro-
jection of Zt to mt (Q). Even though the Lagrangian and Hamiltonian are time
dependent, we recall that the Euler-Lagrange equations for Lmt are still equivalent
to Hamilton’s equations for Hmt . These give the correct equations of motion for this
moving system. (An interesting example of this is fluid flow on the rotating earth,
where it is important to consider the fluid with the motion of the earth superposed,
rather than the motion relative to an observer. This point of view is developed in
Chern [1991].)
Let G be a Lie group that acts on Q. (For the bead in the hoop, this will be the
dynamics of H0 itself.) We assume for the general theory that H0 is G-invariant.
Assuming the “averaging principle” (cf. Arnold [1978], for example) we replace
Hmt by its G-average,
1 1
hHmt i(q, p) = kpk2 − hP(Zt )i − hkZt⊥ k2 i + V (q) + hU (q(t))i (6.6.5)
2 2
6 Geometric Phases 118
where h . i denotes the G-average. This principle can be hard to rigorously justify
in general. We will use it in a particularly simple example where we will see how
to check it directly. Furthermore, we shall discard the term 12 hkZt⊥ k2 i; we assume
it is small compared to the rest of the terms. Thus, define
H(q, p, t) = (6.6.6)
1
kpk2 − hP(Zt )i + V (q) + hU (q(t))i = H0 (q, p) − hP(Zt )i + hU (q(t))i.
2
The dynamics of H on the extended space T ∗ Q × M is given by the vector field
¡ ¢
(XH , Zt ) = XH0 − XhP(Zt )i + XhU ◦mt i , Zt . (6.6.7)
d
Rθ(t) q(s(t)) = Rθ(t) q0 (s(t))ṡ(t) + Rθ(t) [ω(t) × q(s(t))],
dt
we set
1
L(s, ṡ, t) = mkq0 (s)ṡ + ω × q(s)k2 . (6.7.1)
2
The Euler-Lagrange equations
d ∂L ∂L
=
dt ∂ ṡ ∂s
become
d
m[ṡ + q0 · (ω × q)] = m[ṡq00 · (ω × q) + ṡq0 · (ω × q0 ) + (ω × q) · (ω × q0 )]
dt
6 Geometric Phases 119
i.e.,
s̈ − (ω × q) · (ω × q0 ) + q0 · (ω̇ × q) = 0. (6.7.2)
The second and third terms in (6.7.2) are the centrifugal and Euler forces re-
spectively. We rewrite (6.7.2) as
where α is as in Figure 1.5.2 and q = kqk. From (6.7.3), Taylor’s formula with
remainder gives
Z t
© ª
s(t) = s0 +ṡ0 t+ (t−t0 ) ω(t0 )2 q · q0 (s(t0 )) − ω̇(t0 )q(s(t0 )) sin α(s(t0 )) dt0 . (6.7.4)
0
Now ω and ω̇ are assumed small with respect to the particle’s velocity, so by the
averaging theorem (see, e.g. Hale [1969]), the s-dependent quantities in (6.7.4) can
be replaced by their averages around the hoop:
Aside The essence of the averaging can be seen as follows. Suppose g(t) is a rapidly
varying function and f (t) is slowly varying on an interval [a, b]. Over one period of
g, say [α, β], we have
Z β Z β
f (t)g(t)dt ≈ f (t)ḡdt (6.7.6)
α α
where Z β
1
ḡ = g(t)dt
β−α α
If this is added up over [a, b] one still gets something small as the period of g tends
to zero. ¨
6 Geometric Phases 120
The first integral in (6.7.5) over s vanishes and the second is 2A where A is the
area enclosed by the hoop. Now integrate by parts:
Z T Z T
(T − t0 )ω̇(t0 )dt0 = −T ω(0) + ω(t0 )dt0 = −T ω(0) + 2π, (6.7.7)
0 0
assuming the hoop makes one complete revolution in time T . Substituting (6.7.7)
in (6.7.5) gives
2A 4πA
s(T ) ≈ s0 + ṡ0 T + ω0 T − . (6.7.8)
L L
The initial velocity of the bead relative to the hoop is ṡ0 , while that relative to the
inertial frame is (see (6.7.1)),
Now average (6.7.8) and (6.7.9) over the initial conditions to get
4πA
hs(T ) − s0 − v0 T i ≈ − (6.7.10)
L
which means that on average, the shift in position is by 4πA/L between the rotated
and nonrotated hoop. This extra length 4πA/L, (or in angular measure, 8π 2 A/L2 )
is the Hannay-Berry phase. Note that if ω0 = 0 (the situation assumed by Berry
[1985]) then averaging over initial conditions is not necessary. This process of av-
eraging over the initial conditions used in this example is related to work of Golin
and Marmi [1990] on procedures to measure the phase shift.
Chapter 7
121
7 Stabilization and Control 122
and
∂L
l= = Irotor (Ω + Ωr ) (7.1.3)
∂Ωr
and the equations of motion including internal torques (controls) u in the rotors are
ṁ = m × Ω = m × (Ilock − Irotor )−1 (m − l)
(7.1.4)
l˙ = u.
where k is a constant real matrix such that k does not have 1 as an eigenvalue and
such that the matrix J = (1 − k)−1 (Ilock − Irotor ) is symmetric, the system (7.1.4)
reduces to a Hamiltonian system on so(3)∗ with respect to the rigid body bracket
{F, G}(m) = −m · (∇F × ∇G).
Proof We have
l˙ = u = kṁ = k((Ilock Ω + Irotor Ωr ) × Ω). (7.2.2)
Therefore, the vector
km − l = p, (7.2.3)
is a constant of motion. Hence our feedback control system becomes
where ξ = −(1 − k)−1 p and 1 is the identity. Define the k-dependent “inertia
tensor”
J = (1 − k)−1 (Ilock − Irotor ). (7.2.5)
Then the equations become
ṁ = ∇C × ∇H (7.2.6)
where
1
C= kmk2 (7.2.7)
2
7 Stabilization and Control 123
and
1
H= (m − ξ) · J−1 (m − ξ). (7.2.8)
2
Clearly (7.2.6) are Hamiltonian on so(3)∗ with respect to the standard Lie-Poisson
structure (see §2.6). ¥
The conservation law (7.2.3), which is key to our methods, may be regarded as
a way to choose the control (7.2.1). This conservation law is equivalent to
i.e.,
(Irotor − kIrotor )Ωr = (kIlock − Irotor )Ω − p.
Therefore, if
kIlock = Irotor (7.2.9)
then one obtains as a special case, the dual spin case in which the acceleration
feedback is such that each rotor rotates at constant angular velocity relative to the
carrier (see Krishnaprasad [1985] and Sánchez de Alvarez [1986]). Also note that
the Hamiltonian in (7.2.8) can be indefinite.
If we set mb = m − ξ, (7.2.4) become
be the (body) locked inertia tensor. Then from (7.1.2) and (7.1.3), the natural
momenta are
mi = (Ji + Ii )Ωi = λi Ωi , i = 1, 2
m3 = λ3 Ω3 + J3 α̇
l3 = J3 (Ω3 + α̇). (7.3.2)
Choosing µ ¶
1 1
u = ka3 m1 m2 where a3 = − ,
λ2 λ1
and noting that l3 − km3 = p is a constant, we get:
Theorem 7.3.1 With this choice of u and p, the Equations (7.3.3) reduce to
µ ¶
(1 − k)m3 − p m3 m2
ṁ1 = m2 −
I3 λ2
µ ¶
(1 − k)m3 − p m3 m1
ṁ2 = −m1 +
I3 λ1
ṁ3 = a3 m1 m2 (7.3.4)
which are Hamiltonian on so(3)∗ with respect to the standard rigid body Lie-Poisson
bracket, with Hamiltonian
µ ¶
1 m21 m22 ((1 − k)m3 − p)2 1 p2
H= + + + (7.3.5)
2 λ1 λ2 (1 − k)I3 2 J3 (1 − k)
where p is a constant.
When k = 0, we get the equations for the rigid body carrying a free spinning
rotor — note that this case is not trivial! The rotor interacts in a nontrivial way
with the dynamics of the carrier body. We get the dual spin case for which J3 α̈ = 0,
7 Stabilization and Control 125
when kIlock = Irotor from (7.2.9), or, in this case, when k = J3 /λ3 . Notice that for
this k, p = (1 − k)α̇, a multiple of α̇.
Theorem 7.3.2 For p = 0 and k > 1 − (J3 /λ2 ), the system (7.3.4) is stabilized
about the middle axis, i.e., about the relative equilibrium (0, M, 0).
For p = 0, i.e., l3 = km3 , (0, M, 0) is a relative equilibrium and (7.3.7) are satisfied
if ϕ0 = −1/λ2 at equilibrium. In that case,
µ ¶ µ ¶
1 1 1−k 1
δ 2 (H + C) = (δm1 )2 − + (δm3 )2 − + ϕ00 (δm2 )2 .
λ1 λ2 I3 λ2
Now
1 1 I2 − I1
− = <0 for I1 > I2 > I3 .
λ1 λ2 λ1 λ2
For k satisfying the condition in the theorem,
1−k 1
− < 0,
I3 λ2
so if one chooses ϕ such that ϕ00 < 0 at equilibrium, then the second variation is
negative definite and hence stability holds. ¥
7 Stabilization and Control 126
For a geometric interpretation of the stabilization found here, see Holm and
Marsden [1991], along with other interesting facts about rigid rotors and pendula,
for example, a proof that the rigid body phase space is a union of simple pendulum
phase spaces.
Corresponding to the Hamiltonian (7.3.5) there is a Lagrangian found using the
inverse Legendre transformation
m1
Ω̃1 =
λ1
m2
Ω̃2 =
λ2
(1 − k)m3 − p (7.3.9)
Ω̃3 =
I3
(1 − k)m3 − p
p
α̃˙ = − + .
(1 − k)I3 (1 − k)I3
Note that Ω̃1 , Ω̃2 and Ω̃3 equal the angular velocities Ω1 , Ω2 , and Ω3 for the free
system, but that α̃˙ is not equal to α̇. In fact, we have the interesting velocity shift
α̇ km3
α̃˙ = − . (7.3.10)
(1 − k) (1 − k)J3
calculate the amount of this rotation following the method of Montgomery we used
in Chapter 6 to calculate the phase shift for the single rigid body.
As we have seen, the equations of motion for the rigid body-rotor system with
feedback law (7.2.1) are
Ȧ = AΩ̂,
where ˆ denotes the isomorphism between R3 and so(3) (see §4.4) and in the
presence of the feedback law (7.2.1),
is a periodic orbit of period T on the momentum sphere, kmb +ξk2 = kµk2 , enclosing
a solid angle Φsolid . Let Ωav denote the average value of the body angular velocity
over this period, E denote the constant value of the Hamiltonian, and kµk denote
the magnitude of the angular momentum vector. Then the body undergoes a net
rotation ∆θ about the spatial angular momentum vector µ given by
2ET T
∆θ = + (ξ · Ωav ) − Φsolid . (7.4.7)
kµk kµk
7 Stabilization and Control 128
Hence
A(T + t0 )A(t0 )−1 µ = µ.
Thus A(T + t0 )A(t0 )−1 ∈ Gµ , so
µ ¶
µ
A(T + t0 )A(t0 )−1 = exp ∆θ (7.4.11)
kµk
C1 = {z(t) | t0 ≤ t ≤ t0 + T }
can use general facts about holonomy (as in Marsden, Montgomery and Ratiu [1992]) that require
only a bounding surface in the base space, which is obvious in this case.
7 Stabilization and Control 129
Hence
Z Z Z T Z T
dq
pdq = p· dt = JΩ · Ωdt + ξ · Ωdt = 2ET + (ξ · Ωav )T (7.4.15)
C1 C1 dt 0 0
Finally we note that the map πµ from the set of points in phase space with angular
momentum µ to Pµ satisfies
ZZ ZZ
d(pdq) = dA = kµkΦsolid (7.4.17)
Σ πµ (Σ)
where dA is the area form on the two-sphere and πµ (Σ) is the spherical cap bounded
by the periodic orbit {mb (t) | t0 ≤ t ≤ t0 + T } ⊂ Pµ . Combining (7.4.15) – (7.4.17)
we get the result. ¥
Remarks
1. When ξ = 0, (7.4.7) reduces to the Goodman-Robinson-Montgomery formula
in Chapter 6.
2. This theorem may be viewed as a special case of a scenario that is useful for
other systems, such as rigid bodies with flexible appendages. As we saw in
Chapter 6, phases may be viewed as occurring in the reconstruction process,
which lifts the dynamics from Pµ to J−1 (µ). By the cotangent bundle re-
duction theorem, Pµ is a bundle over T ∗ S, where S = Q/G is shape space.
The fiber of this bundle is Oµ , the coadjoint orbit through µ. For a rigid
body with three internal rotors, S is the three torus T3 parametrized by the
rotor angles. Controlling them by a feedback or other control and using other
conserved quantities associated with the rotors as we have done, leaves one
with dynamics on the “rigid variables” Oµ , the momentum sphere in our case.
Then the problem reduces to that of lifting the dynamics on Oµ to J−1 (µ)
with the T ∗ S dynamics given. For G = SO(3) this “reduces” the problem to
that for geometric phases for the rigid body.
3. Some interesting control manouvers for “satellite parking” using these ideas
may be found in Sastry and Walsh [1993].
4. See Montgomery [1990], p. 569 for comments on the Chow-Ambrose-Singer
theorem in this context. ¨
Finally, following a suggestion of Krishnaprasad, we show that in the zero total
angular momentum case one can compensate for this drift using two rotors. The
total spatial angular momentum if one has only two rotors is of the form
where the scalars α̇1 and α̇2 represent the rotor velocities relative to the body frame.
The attitude matrix A satisfies
Ȧ = AΩ̂, (7.4.19)
as above. If µ = 0, then from (7.4.18) and (7.4.19) we get,
Ȧ = −A((I−1 −1
lock b1 )ˆα̇1 + (Ilock b2 )ˆα̇2 ). (7.4.20)
It is well known (see for instance Brockett [1973] or Crouch [1986]) that if we treat
the α̇i , i = 1, 2 as controls, then attitude controllability holds iff
(I−1
lock b1 )ˆ and (I−1
lock b2 )ˆ generate so(3),
I−1
lock b1 and I−1
lock b2 are linearly independent. (7.4.21)
Moreover, one can write the attitude matrix as a reverse path-ordered exponential
· Z t ¸
© −1 −1
ª
A(t) = A(0) · P̄ exp − (Ilock b1 )ˆα̇1 (σ) + (Ilock b2 )ˆα̇2 (σ) dσ . (7.4.22)
0
The right hand side of (7.4.22) depends only on the path traversed in the space
T2 of rotor angles (α1 , α2 ) and not on the history of velocities α̇i . Hence the
Formula (7.4.22) should be interpreted as a “geometric phase”. Furthermore, the
controllability condition can be interpreted as a curvature condition on the principal
connection on the bundle T2 × SO(3) → T2 defined by the so(3)-valued differential
1-form,
θ(α1 , α2 ) = −((I−1 −1
lock b1 )ˆdα1 + (Ilock b2 )ˆdα2 ). (7.4.23)
For further details on this geometric picture of multibody interaction see Krish-
naprasad [1989] and Wang and Krishnaprasad [1992]. In fact, our discussion of the
two coupled bodies in §6.1 can be viewed as an especially simple planar version of
what is required.
cf. Marsden, Ratiu and Weinstein [1984a,b]. For a charged particle, the resultant
system is in fact geodesic motion!
Recall that if B = −∇ × A is a given magnetic field on R3 , then with respect
to canonical variables (q, p), the Hamiltonian is
1 e
H(q, p) = kp − Ak2 . (7.5.1)
2m c
We can obtain (7.5.1) via the Legendre transform if we choose
1 e
L(q, q̇) = mkq̇k2 + A · q̇ (7.5.2)
2 c
for then
∂L e
p= = mq̇ + A (7.5.3)
∂ q̇ c
and
e 1 e
p · q̇ − L(q, q̇) = (mq̇ + A) · q̇ − mkq̇k2 − A · q̇
c 2 c
1 2
= mkq̇k
2
1 e
= kp − Ak2 = H(q, p). (7.5.4)
2m c
Thus, the Euler-Lagrange equations for (7.5.2) reproduce the equations for a particle
in a magnetic field. (If an electric field E = −∇ϕ is present as well, subtract eϕ
from L, treating eϕ as a potential energy.) Let the Kaluza-Klein configuration
space be
QK = R3 × S 1 (7.5.5)
with variables (q, θ) and consider the one-form
ω = A + dθ (7.5.6)
Since (7.5.7) is quadratic and positive definite in q̇ and θ̇, the Euler-Lagrange equa-
tions are the geodesic equations on R3 × S 1 for the metric for which LK is the
7 Stabilization and Control 132
kinetic energy. Since pθ is constant in time as can be seen from the Euler-Lagrange
equation for (θ, θ̇), we can define the charge e by setting
pθ = e/c; (7.5.9)
This space of (suitably differentiable) paths may be regarded as the space of hori-
zontal paths with a given holonomy g. Recall from §3.3 that horizontal means that
7 Stabilization and Control 133
this path is one along which the total angular momentum (i.e., the momentum map
of the curve in T ∗ Q corresponding to q̇) is zero.
The projection of the curves in hor (q0 , gq0 ) to shape space S = Q/G are closed,
as in Figure 7.6.1.
G-orbit
gq0
q0
The optimal control problem we wish to discuss is: Find a path c in hor(q0 , gq0 )
whose base curve s has minimal (or extremal ) length. The minimum (or extremum)
is taken over all paths s obtained from projections of paths in hor(q0 , gq0 ).
One may wish to use functions other than the length of c to extremize. For
example, the cat may wish to turn itself over by minimizing the amount of work
done.
The Wong Hamiltonian HW : T ∗ Q → R is defined by
1
HW (q, p) = khor(q, p)k2 (7.6.2)
2
where hor is the horizontal projection defined in §3.3. The function HW may also
be regarded as a function on T ∗ S — it is the kinetic energy associated to the metric
induced on S and as such, its integral curves are solutions of Wong’s equations;
note that (7.6.2) corresponds to (7.5.1) and A is replaced by the mechanical con-
nection. The corresponding Kaluza-Klein metric on Q is the given metric and its
Hamiltonian is the Wong Hamiltonian plus 12 kJk2 , assuming that g carries an Ad-
7 Stabilization and Control 134
invariant metric. The situation of the preceding section then becomes a special case
of this one.
Theorem 7.6.1 Consider a path c ∈ hor(q0 , gq0 ) with given holonomy g. Then c
is extremal iff the curve s satisfies Wong’s equations.
This result is closely related to, and may be deduced from, our work on La-
grangian reduction in §3.6. The point is that c ∈ hor(q0 , gq0 ) means ċ ∈ J−1 (0),
and what we want to do is relate a variational problem on Q but within J−1 (0) and
with Hamiltonian the Kaluza-Klein kinetic energy on T Q with a reduced variational
principle on S. This is exactly the set up to which the discussion in §3.6 applies.
We also note that s is the projection of a geodesic in Q in hor(q0 , gq0 ) obtained
from c by adjusting the phase, as in Chapter 6.
What is less obvious is how to use the methods of the calculus of variations
to show the existence of minimizing loops with a given holonomy and what their
smoothness properties are. For this purpose, the subject of sub-Riemannian geom-
etry (dealing with degenerate metrics) is relevant. Note that HW in (7.6.2) when
regarded as a Hamiltonian on T ∗ Q (rather that T ∗ S) is associated with a degener-
ate co-metric (a degenerate bilinear form in the momentum). These are sometimes
called Carnot-Caratheodory metrics. We refer to Montgomery [1990, 1991] and
Enos [1993] for further discussion.
Chapter 8
Discrete reduction
135
8 Discrete Reduction 136
symplectic involution of phase space. The invariant subsystem defined by the fixed
point set of this involution is just the planar pendulum.
The double spherical pendulum has the same continuous and discrete symmetry
group as the spherical pendulum. However, the double spherical pendulum has
another nontrivial symmetry involving a spacetime symmetry, much as the rotating
liquid drop and the water molecule. Namely, we are at a fixed point of a symmetry
when the two pendula are rotating in a steadily rotating vertical plane — the
symmetry is reflection in this plane. This symmetry is, in fact, the source of the
subblocking property of the second variation of the amended potential that we
observed in Chapter 5. In the S 1 reduced space, a steadily rotating plane is a
stationary plane and the discrete symmetry just becomes reflection in that plane.
A related example is the double whirling mass system. This consists of two
masses connected to each other and to two fixed supports by springs, but with
no gravity. Assume the masses and springs are identical. Then there are two Z2
symmetries now, corresponding to reflection in a (steadily rotating) vertical plane as
with the double spherical pendulum, and to swapping the two masses in a horizontal
plane.
The classical water molecule has a discrete symplectic symmetry group Z2 as
well as the continuous symmetry group SO(3). The discrete symmetry is closely
related to the symmetry of exchanging the two hydrogen atoms.
For these examples, there are some basic links to be made with the block diago-
nalization work from Chapter 5. In particular, we show how the discrete symmetry
can be used to refine the block structure of the second variation of the augmented
Hamiltonian and of the symplectic form. Recall that the block diagonalization
method provides coordinates in which the second variation of the amended poten-
tial on the reduced configuration space is a block diagonal matrix, with the group
variables separated from the internal variables; the group part corresponds to a bi-
linear form computed first by Arnold for purposes of examples whose configuration
space is a group. The internal part corresponds to the shape space variables, that
are on Q/G the quotient of configuration space by the continuous symmetry group.
For the water molecule, we find that the discrete symmetry provides a further block-
ing of this second variation, by splitting the internal tangent space naturally into
symmetric modes and nonsymmetric ones.
In the dynamics of coupled rigid bodies one has interesting symmetry breaking
bifurcations of relative equilibria and of relative periodic orbits. For the latter,
discrete spacetime symmetries are important. We refer to Montaldi, Stewart and
Roberts [1988], Oh, Sreenath, Krishnaprasad and Marsden [1989] and to Patrick
[1989, 1990] for further details. In the optimal control problem of the falling cat
(Montgomery [1990] and references therein), the problem is modeled as the dynam-
ics of two identical coupled rigid bodies and the fixed point set of the involution
that swaps the bodies describes the “no-twist” condition of Kane and Shur [1969];
this plays an essential role in the problem and the dynamics is integrable on this
set.
The second class of examples motivating the study of discrete symmetries are in-
tegrable systems, including those of Bobenko, Reyman and Semenov-Tian-Shansky
[1989]. This (spectacular) reference shows in particular how reduction and dual
8 Discrete Reduction 137
pairs, together with the theory of R-matrices, can be used to understand the inte-
grability of a rich class of systems, including the celebrated Kowalewski top. They
also obtain all of the attendant algebraic geometry in this context. It is clear from
their work that discrete symmetries, and specifically those obtained from Cartan in-
volutions, play a crucial role. We refer the reader to the paper of Harnad, Hurtubise
and Marsden [1991] for the details of this topic.
gP - P
P
σP σP
? ?
P - P
[σG (g)]P
i.e.,
J ◦ σP = σg∗ ◦ J + (cocycle). (8.1.4)
We shall assume, in addition, that the cocycle is zero. In other words, we make:
J ◦ σP = σg∗ ◦ J. (8.1.5)
J
P - g∗
σP σ g∗
? ?
P - g∗
J
Remarks
1. If G is connected, then (8.1.5) (or (8.1.4)) implies (8.1.1).
Proof Put on P a metric that is Σ-invariant and exponentiate the linear fixed
point set (Tz P )Σ of the tangent map Tz σ : Tz σ : Tz P → Tz P for each z ∈ PΣ to
give a local chart for PΣ . Since (Tz P )Σ is invariant under the associated complex
structure, it is symplectic, so PΣ is symplectic. ¥
We are not necessarily advocating that one should always perform this discrete
reduction procedure, or that it contains in some sense equivalent information to the
original system, as is the case with continuous reduction. We do claim that the
procedure is an interesting way to identify invariant subsystems, and to generate
new ones that are important in their own right. For example, reducing the spherical
pendulum by discrete reflection in a plane gives the half-dimensional simple planar
8 Discrete Reduction 140
g = gΣ ⊕ (g∗Σ )0 (8.1.10)
where g∗Σ = Fix(Σ, g∗ ) is the fixed point set for the action of Σ on g∗ and where the
superscript zero denotes the annihilator in g. Similarly, the dual splits as
hµ, ξi = hµ̄, ξi = 0.
It will be useful later to note that this argument also proves a more general
result as follows:
W = WΣ ⊕ (WΣ∗ )0 (8.1.13)
where WΣ is the fixed point set for the action of Σ on W and where (WΣ∗ )0 is the
annihilator of the fixed point set for the dual action of Σ on W ∗ . Similarly, the dual
splits as
W ∗ = WΣ∗ ⊕ (WΣ )0 . (8.1.14)
8 Discrete Reduction 141
which is the splitting of the symplectic vector space W × W ∗ into the symplectic
subspace WΣ × WΣ∗ and its symplectic orthogonal complement.
The splitting (8.1.10) gives a natural identification
(gΣ )∗ ∼
= g∗Σ .
Note that the splittings (8.1.10) and (8.1.11) do not involve the choice of a
metric; that is, the splittings are natural.
The following block diagonalization into isotypic components will be used
to prove the subblocking theorem in the energy-momentum method.
if w ∈ WΣ and u ∈ (WΣ∗ ) .
0
Proof Consider the element α ∈ W ∗ defined by α(v) = B(w, v). It suffices to show
that α is fixed by the action of Σ on W ∗ since it annihilates WΣ∗ . To see this, note
that invariance of B means B(σw, σv) = B(w, v), i.e., B(w, σv) = B(σ −1 w, v).
Then
(σ ∗ α)(v) = α(σv) = B(w, σv) = B(σ −1 w, v) = B(w, v) = α(v)
since w ∈ WΣ so w is fixed by σ. This shows that α ∈ WΣ∗ and so α(u) = B(w, u) = 0
for u in the annihilator. The proof that B(u, w) = 0 is similar. ¥
Proposition 8.1.6 The inclusion J(PΣ ) ⊂ g∗Σ holds and the momentum map for
the GΣ action on PΣ equals J restricted to PΣ , and takes values in g∗Σ ∼
= (gΣ )∗ .
so J(z) ∈ Fix(Σ, g∗ ). The second result follows since the momentum map for GΣ
acting on PΣ is J composed with the projection of g∗ to g∗Σ . ¥
WΣ = J−1 (g∗Σ ) ⊂ P
See Sjamaar [1990], Lemma 3.2.3 for a related result. In particular, this result
implies that
Corollary 8.1.8 The set of ν ∈ g∗Σ for which there is a Σ-fixed point in J−1 (ν), is
open.
gQ - Q
Q
σQ σQ
? ?
Q - Q
[σG (g)]Q
Proof Equation (8.1.1) follows from (8.2.1) and the fact that cotangent lift pre-
serves compositions. Differentiation of (8.2.1) with respect to g at the identity in
the direction ξ ∈ g gives
Evaluating this at q, pairing the result with a covector p ∈ Tσ∗Q (q) Q, and using the
formula for the momentum map of a cotangent lift gives
i.e.,
∗
hT σQ · p, ξQ (q)i = hp, (σg · ξ)Q (σQ (q))i
i.e.,
∗
hJ(T σQ · p), ξi = hJ(p), σg · ξi = hσg−1
∗ J(p), ξi
i.e.,
J ◦ σP−1 = σg−1
∗ ◦ J
so (8.1.5) holds. ¥
PΣ = T ∗ (QΣ ) (8.2.3)
with the canonical cotangent structure. Moreover, the action of GΣ is the cotangent
lift of its action on QΣ and its momentum map is the standard one for cotangent
lifts.
For (8.2.3) to make sense, we need to know how Tq∗ QΣ is identified with a
subspace of Tq∗ Q. To see this, consider the action of Σ on Tq∗ Q by
¡ ¢∗
σ · αq = [T σQ (q)]−1 · αq
Lemma 8.2.3
Tq Q = Tq QΣ ⊕ [Fix(Σ, Tq∗ Q)]0 . (8.2.4)
hσ, vi = hᾱ, vi = 0
∼ Fix(Σ, T ∗ Q).
identifying Tq∗ QΣ = q
Returning to the proof of Proposition 8.2.2, by definition of PΣ , αq ∈ PΣ iff
σP (αq ) = αq for all σ ∈ Σ; i.e.,
¡ ¢∗
[T σQ (q)]−1 αq = αq
i.e., αq ∈ Fix(Σ, Tq∗ Q) = T ∗ QΣ . Thus, PΣ = T ∗ (QΣ ). The symplectic structure
on PΣ is obtained from T ∗ Q by restriction. We need to show that the inclusion
map iΣ : T ∗ QΣ → T ∗ Q defined by (8.2.6) is a symplectic embedding. In fact, it
is readily checked that iΣ pulls the canonical one form on T ∗ Q back to that on
T ∗ QΣ because the projection πΣ : T ∗ Q → P satisfies π ◦ iΣ = πΣ . The rest of the
proposition now follows. ¥
8.3 Examples
Example 1 Let the phase space be P = C2 with the symplectic structure given by
Ω((z1 , w1 ), (z2 , w2 )) = 2Im(z1 z̄2 + w1 w̄2 ).
Let G = U (2) act simply by matrix multiplication and let Σ = Z2 act on P by
σ(z, w) = (w,· z), where
¸ σ denotes the nontrivial element of Σ. Identifying σ with
0 1
the matrix , the actions are compatible (Assumption 1) if we let Σ act on
1 0
G by conjugation: σ · A = σAσ −1 .
Note that the actions of G and Σ on P are symplectic. The Lie algebra u(2) of
U (2) is identified with the skew hermetian matrices, which we write as
· ¸
iu β
ξ=
−β̄ iv
where u and v are real and β ∈ C. The equivariant momentum map for the U (2)
action is given by
µ · ¸ · ¸¶
1 z z
hJ(z, w), ξi = Ω ξ , = u|z|2 + v|w|2 + 2Im(β z̄w).
2 w w
Assumption 2 reads
hJ(σ(z, w)), ξi = hJ(z, w), σξσ −1 i,
which is easily checked.
In this example, GΣ consists of elements of G that commute · with
¸ σ; that is,
a b
GΣ is the subgroup of U (2) consisting of matrices of the form . Note that
b a
PΣ = {(z, z) ∈ P | z ∈ C} and that the GΣ action on PΣ is (z, z) 7→ ((a + b)z, (a +
¸ the Lie algebra of GΣ with gΣ consisting of matrices of the form
b)z). ·We identify
u y
ξ=i where u and y are real. Note that
y u
consistent with Proposition 8.1.6. In §8.5, we shall see an alternative way of viewing
this example in terms of dual pairs using SU (2) and S 1 separately, rather than as
U (2). This will also bring out links with the one-to-one resonance more clearly. ¨
Notice that interchanging the two pendula does not produce a discrete symmetry
because the presence of gravity leads to a noninvariance of the kinetic and potential
energies under particle interchange. ¨
An example with a slightly richer discrete symmetry group is the system of two
whirling masses.
q2
m1
m2
q1
Let G = S 1 act by rotations of both masses about the (vertical) z-axis. The
momentum map is the angular momentum about the vertical axis. We again let
Σ = Z2 , but now there are two cases:
Case 1 Here let σ act by reflection in a vertical plane, say the yz-plane, acting
simultaneously on both factors in Q. As above, this action produces an associated
invariant subsystem on its fixed point space, the problem of two planar masses
connected by springs. This is the case even if the masses of the two particles are
different and if the springs are different. As in Example 2, the group Σ s G is O(2).
As with the double spherical pendulum we can consider the discrete symmetry of
reflection in a moving vertical plane. This is important again in the study of relative
8 Discrete Reduction 147
equilibria.
Case 2 Here we let the masses of the two particles be the same, the two outside
springs be identical and all three springs be reflection invariant. Now let Σ act on
Q by simultaneous reflection in the horizontal plane (and translation of the vector
base points to the opposite support). In this case the actions of G and S on Q
(and hence on P ) commute, so we can let Σ act trivially on G and Assumption 1Q
holds. Here, Σ s G = Z2 × SO(2), a direct product. Now GΣ = S 1 and PΣ = T ∗ R3
corresponds to identical motions of the two masses. Thus, in this case discrete
reduction gives the motion of a symmetric two mass system on which S 1 still acts.
This richer collection of discrete symmetries then naturally leads to a rich block
structure in the second variation δ 2 Vµ and a simpler normal form for the linearized
equations at a relative equilibrium. See Zombro and Holmes [1993] for more infor-
mation. ¨
Example 5 The rigid body has three copies of Z2 as symmetry group. Here
Q = SO(3) and we choose a plane P corresponding to one of the principle moments
of inertia and let σ be the reflection in this plane. Let Σ = Z2 be generated by σ.
Let P = T ∗ Q, and G = SO(3). Let G act on Q by left multiplication (as is usual
for the rigid body), so the momentum map is the spatial angular momentum of the
body. The discrete symmetries in this example yield the well known symmetry of
the phase portrait of the rigid body as viewed on the two sphere. Let Σ act on Q
by conjugation; for A ∈ Q, σ · A = σAσ −1 . If we let Σ act on G the same way, then
Remark This example illustrates two cautions that are needed when dealing with
discrete symplectic symmetries. First, while there is a well defined action of Σ on
P/G, this action need not be Poisson. For example the induced action on so(3)∗ in
Example 5 is anti-Poisson. Also, for σ ∈ Σ, σ maps J−1 (µ) to J−1 (σg∗ µ), so there
need not be a well defined action on Pµ , let alone a symplectic one. (For the rigid
body one gets an anti-symplectic map of S 2 to itself.) ¨
product is the direct product Z2 × SO(3), so the full rotation group still acts on
PΣ =. Now, QΣ represents configurations that have a plane of symmetry, so in this
case, discrete reduction of this system correponds to restriction to symmetric bodies.
In view of the block diagonalization results outlined in the next section, it is
of interest to not to actually carry out the discrete reduction, but rather to use
the discrete symmetry to study stability. This seems to explain, in part, why, in
their calculations, further subblocking, in some cases, even to diagonal matrices,
was found. ¨
where the Arnold form means δ 2 Vµ computed on the coadjoint orbit tangent space,
and the Smale form means δ 2 Vµ computed on Q/G. Perhaps even more interesting
is the structure of the linearized dynamics near a relative equilibrium. That is, both
the augmented Hamiltonian Hξ = H − hJ, ξi and the symplectic structure can be
simultaneously brought into the following normal form:
Arnold form 0 0
δ 2 Hξ =
0 Smale form 0
0 0 Kinetic Energy > 0
and
coadjoint orbit form C 0
Symplectic Form =
−C magnetic (coriolis) I
0 −I 0
8 Discrete Reduction 149
where the columns represent the coadjoint orbit variables (G/Gµ ), the shape
variables (Q/G) and the shape momenta respectively. The term C is an inter-
action term between the group variables and the shape variables. The magnetic
term is the curvature of the µ-component of the mechanical connection, as we de-
scribed earlier.
Suppose that we have a compact discrete group Σ acting by isometries on Q,
and preserving the potential, so we are in the setting of the preceeding section.
This action lifts to the cotangent bundle, as we have seen. The resulting fixed point
space is the cotangent bundle of the fixed point space QΣ . This fixed point space
represents the Σ-symmetric configurations.
Of more concern here is the fact that the action also gives an action on the
quotient space, or shape space Q/G. We can split the tangent space to Q/G at a
configuration corresponding to the relative equilibrium according to Lemma 8.1.4
and can apply Lemma 8.1.5 to the Smale form. Here, one must check that the
amended potential is actually invariant under Σ. In general, this need not be the
case, since the discrete group need not leave the value of the momentum µ invariant.
However, there are two important cases for which this is verified. The first is for
SO(3) with Z2 acting by conjugation, as in the rigid body example, it maps µ to
its negative, so in this case, from the formula Vµ (q) = V (q) − µI(q)−1 µ we see that
indeed Vµ is invarint. The second case, which is relevant for the water molecule, is
when Σ acts trivially on G. Then Σ leaves µ invariant, and so Vµ is again invariant.
The blocks in the Smale form are given by Lemma 8.1.4: they are the Σ-
symmetric variations, and their complement chosen according to that lemma as
the annihilator of the symmetric dual variations. Lemma 8.1.5 can also be applied
to the symplectic form, showing that it subblocks as well.
These remarks apply to the classical rotating water molecule as follows. (We let
the reader work out the case of the two whirling masses in a similar, but simpler
vein.) The discrete symmetry Σ is Z2 acting, roughly speaking, by interchanging
the two hydrogen atoms. We shall describe the action more precisely in a moment.
The action of Z2 on SO(3) will be the trivial one, so the semi-direct product is the
direct product and so GΣ is again SO(3). The discrete reduction procedure then
yields the system consisting of symmetric molecules (with the two hydrogen atoms
moving in a symmetric way), but still with symmetry group the rotation group,
which makes good sense physically. The subblocking property mentioned above
shows that the Smale form, which in this example, is a 3 × 3 symmetric matrix,
becomes block diagonal, with a 2 × 2 subblock corresponding to the symmetric
variations, and a singleton block corresponding to the nonsymmetric variations.
Explicitly, the nontrivial element σ of Σ acts on the configuration space as
8 Discrete Reduction 150
follows:
µ ¶
r
σ(r, s) = r, s − 2(r · s) .
k r k2
This symmetry interchanges the role of the two masses as in Figure 8.4.1.
m m
M
s'
r r
s
m σ
m
M
If one prefers, σ is the transformation that rotates the system by 180 degrees
about a line perpendicular to r in the plane of the system, followed by an interchange
of the particles. One now checks the claims made earlier; for example, that the
Assumption 1Q is satisfied, and that the fixed point set of the discrete symmetry
group does correspond to the symmetric configurations, etc. Notice that for the
ozone molecule, one can apply this operation to any two pairs of atoms, and this
leads to interesting consequences (such as, for the planar molecule, the “breathing
mode” decouples dynamically from the other modes, and in space, it couples very
simply via the internal rigid coupling discussed in Chapter 5.
We conclude that δ 2 Vµ takes the following form:
a b 0
δ 2 Vµ = b d 0 (8.4.1)
0 0 g
· ¸
a b
where the block corresponds to the symmetric variations and [g] to the
b d
symmetry breaking variations.
a drop in space. In either case, the corresponding momentum map is the angular
momentum of the drop. However, there is a discrete symmetry as well, that can be
set up in a way somewhat similar to that of the rigid body, described in §8.3. For
the planar case, one chooses a line say L in the reference configuration of the fluid
and calls the reflection in L by σL . The configuration space for the drop is Q =
all volume preserving embeddings of a reference configuration to the plane. Then
σL acts on an element η ∈ Q by conjugation, on the right by σL and on the left
by the reflection in the line that is the line through the images η(P1 ), η(P2 ), where
P1 , P2 are the intersection points of the line with the boundary of the reference
configuration. This action is then lifted to the cotangent space in the standard way.
Here the fixed point set is the set of symmetric drops, and the action of the discrete
group on the group S 1 is trivial, so the fixed point group GΣ is again S 1 and it still
acts on the symmetric drops, as it should. One can view the planar water molecule
as a finite dimensional analogue of this model.
These discrete symmetries are clearly available for other problems as well, such
as the classical problem of rotating gravitational fluid masses. In this case, one has
a richer symmetry structure coming from finite subgroups of the orthogonal group
O(3). From the general discussion above, the subblocking property that the discrete
symmetry gives, should be useful for the study of stability and bifurcation of these
systems.
gP or hP -P
P
σP σP
? ?
P -P
[σG (g)]P or [σH (h)]P
JG JH - h∗
g∗ P
σ g∗ σP σ h∗
? ? ?
g∗ P - h∗
JG JH
is a Poisson embedding, where g∗ has the Lie-Poisson structure (the minus structure
if the actions are left, and the plus structure if the actions are on the right) and
where P/H has the quotient Poisson structure. Thus, we have the commutative
diagram in Figure 8.5.3.
JGΣ JHΣ - h∗
g∗Σ PΣ Σ
I
@ @
@ @
J̄GΣ @ proj proj @ J̄HΣ
@ @
R
PΣ /HΣ PΣ /GΣ
Figure 8.5.3: Reduction of dual pairs.
For the following proposition, note that OzGΣ ⊂ OzG ∩ PΣ and similarly for H.
Proposition 8.5.1 Let (P, G, H) be a dual pair, acted on, as above, by Σ. Suppose
that for each z ∈ PΣ ,
Let us consider briefly the condition (8.5.3). As we noted, we always have the
inclusion OzHΣ ⊂ OzH ∩ PΣ . Let z 0 = hz belong to OzH ∩ PΣ . As σP (z 0 ) = z 0 , and
σP (z) = z, for all σ ∈ Σ, one obtains:
for some s ∈ StabH (z). This is frequently possible: one simple case is when the
action of H is free. When Σ is generated by an involution σ, (i.e., Σ = Z2 ), the
preceding equation will be solvable if the map
is surjective.
JG : J−1 ∗ −1 ∗ ∗
G (gΣ ) → JG (gΣ )/H ⊂ gΣ (8.5.4)
and an HΣ bundle in PΣ
JGΣ : J−1 ∗ −1 ∗ ∗
GΣ : (gΣ ) → JGΣ (gΣ )/HΣ ⊂ gΣ (8.5.5)
giving an inclusion:
J−1 ∗ −1 ∗
GΣ (gΣ )/HΣ ⊂ JG (gΣ )/H.
Over J−1 ∗
GΣ (gΣ )/HΣ , (8.5.5) is a subbundle of (8.5.4). This, in fact, occurs over an
open set:
Proposition 8.5.2 Under the hypotheses given above, J−1 ∗
GΣ (gΣ )/HΣ is an open sub-
set of J−1 ∗
G (gΣ )/H.
components of the Hopf map; see Cushman and Rod [1982] and Marsden [1987].
Again, we choose the discrete group Σ to be Z2 given by the map σ(z, w) = (w, z).
Here the hypotheses above are directly verified, and one finds, using the sort of
computations in Example 1 of §8.3, that the discrete reduction of this dual pair is the
trivial dual pair given by PΣ = C2 with GΣ = S 1 acting by complex multiplication
and with HΣ = S 1 acting the same way. In other words, the discrete reduction of the
(completely integrable) system defined by the one-to-one resonance consisting of two
identical harmonic oscillators (integrable via the group SU (2)), is the (completely
integrable) system consisting of a single harmonic oscillator, trivially integrable
using the group S 1 . ¨
Mechanical Integrators
where ϕt is the flow of X, and what are the error estimates? There are some
general theorems guaranteeing this, with an important hypothesis being stability ;
155
9 Mechanical Integrators 156
i.e., (Ft/n )n (z) must remain close to z for small t and all n = 1, 2, . . .. We refer to
Chorin, Hughes, Marsden and McCracken [1978] and Abraham, Marsden and Ratiu
[1988] for details. For example, the Lie-Trotter formula
is an instance of this.
An algorithm Fτ is
1. a symplectic-integrator if each Fτ is symplectic,
2. an energy-integrator if H ◦ Fτ = H (where X = XH ),
3. a momentum-integrator if J ◦ Fτ = J (where J is the momentum map for
a G-action).
If Fτ has one or more of these properties, we call it a mechanical integrator .
Notice that if an integrator has one of these three properties, then so does any
iterate of it.
There are two ways that have been employed to find mechanical integrators.
For example, one can search amongst existing algorithms and find ones with special
algebraic properties that make them symplectic or energy-preserving. Second, one
can attempt to design mechanical integrators from scratch. Here are some simple
examples:
Example 1 A first order explicit symplectic scheme in the plane is given by the
map (q0 , p0 ) 7→ (q, p) defined by
q = q0 + (∆t)p0
p = p0 − (∆t)V 0 (q0 + (∆t)p0 ). (9.1.4)
This map is a first order approximation to the flow of Hamilton’s equations for the
Hamiltonian H = (p2 /2) + V (q). Here, one can verify by direct calculation that
this scheme is in fact a symplectic map. ¨
Example 2 An implicit symplectic scheme in the plane for the same Hamiltonian
as in Example 1 is
q = q0 + (∆t)(p + p0 )/2
p = p0 − (∆t)V 0 ((q + q0 )/2). ¨ (9.1.5)
9 Mechanical Integrators 157
Other examples are sometimes based on special observations. The next example
shows that the second order accurate mid-point rule is symplectic (Feng [1987]).
This algorithm is also useful in developing almost Poisson integrators (Austin, Kr-
ishnaprasad and Wang [1991]).
Example 3 In a symplectic vector space the following mid point rule is symplec-
tic:
µ k ¶
z k+1 − z k z + z k+1
= XH . (9.1.6)
∆t 2
Notice that for small ∆t the map defined implicitly by this equation is well
defined by the implicit function theorem. To show it is symplectic, we use the
fact that the Cayley transform S of an infinitesimally symplectic linear map A,
namely
−1
S = (1 − λA) (1 + λA) (9.1.7)
is symplectic if 1 − λA is invertible for some real λ. To apply this to our situation,
rewrite the algorithm (9.1.6) as
µ ¶
z + Fτ (z)
Fτ (z) − z − τ XH = 0. (9.1.8)
2
³ ´
Letting S = DFτ (z) and A = DXH z+F2τ (z) we get, by differentiation, S − 1 −
1
2 τ A(1 + S)= 0; i.e., (9.1.7) holds with λ = τ /2. Thus, (9.1.6) defines a symplectic
scheme. ¨
H(qn+1 , pn ) − H(qn , pn )
pn+1 = pn − ∆t µ, (9.1.11)
µT (qn+1 − qn )
where
∂H
λ= (αqn+1 + (1 − α)qn , βpn+1 + (1 − β)pn ), (9.1.12)
∂p
∂H
µ= (γqn+1 + (1 − γ)qn , δpn+1 + (1 − δ)pn ), (9.1.13)
∂q
and where α, β, γ, δ are arbitrarily chosen constants in [0, 1].
9 Mechanical Integrators 158
Example 5 Let us apply the Lie-Trotter or time splitting idea to the simple pen-
dulum. The equations are
µ ¶ µ ¶ µ ¶
d ϕ p 0
= + .
dt p 0 − sin ϕ
Gτ (ϕ, p) = (ϕ + τ p, p)
and
Hτ (ϕ, p) = (ϕ, p − τ sin ϕ)
each of which is symplectic. Thus, the composition Fτ = Gτ ◦ Hτ , namely,
is a first order symplectic scheme for the simple pendulum. It is closely related to
the standard map. The orbits of Fτ need not preserve energy and they may be
chaotic, whereas the trajectories of the simple pendulum are of course not chaotic.
¨
We refer to the cited references, and to Ruth [1983], Feng [1986], Sanz-Serna
[1988] and references therein for more examples of this type, including symplectic
Runge-Kutta schemes.
makes them unsuitable for problems in satellite dynamics for example, where the
exact conservation of a momentum integral is essential to the control mechanism.
One can get angular momentum drift in energy-conservative simulations of, for
example, rods that are free to vibrate and rotate, and presumably in the water
molecule. To control such drifts and attain the high levels of computational accu-
racy demanded by automated control mechanisms, one would be forced to reduce
computational step sizes to such an extent that the numerical simulation would
be prohibitively inefficient. Similarly, if one attempts to use a standard energy-
conservative algorithm to simulate both the rotational and vibrational modes of a
freely moving flexible rod, the algorithm may predict that the rotational motion
will come to a virtual halt after only a few cycles! For a documented simulation
of a problem with momentum conservation, see Simo and Wong [1989]. One can
readily imagine that in the process of enforcing energy conservation one could upset
conservation of angular momentum.
What seems rather surprising is that all of the implicit members of the New-
mark family, perhaps the most widely used time-stepping algorithms in nonlinear
structural dynamics, are not designed to conserve energy and also fail to conserve
momentum. Among the explicit members, only the central difference method pre-
serves momentum. The analytical proof of these results is in Simo, Tarnow and
Wong [1991].
As we shall demonstrate in §9.4, the problem of how to generate a numerical
algorithm that exactly conserves momentum (or, more generally, all momentum-
like integrals of motion) is fairly easy to resolve. Since momentum integrals in
Hamiltonian systems are associated with invariance of the system under the ac-
tion of symmetry groups, one might guess that to derive momentum-conservative
algorithms, one constrains the algorithm to obey, in some sense, the same group
invariance as the actual dynamics.
In traditional integrators, much attention has been paid to energy conservation
properties, some, as we have noted to momentum conservation, and even less to
conserving the symplectic or Poisson structure. However, one can imagine that it
is also quite important.
The three notions of symplectic, energy, and momentum integrators are con-
nected in interesting ways. For example, as we shall show below, a result of Ge
(see Ge and Marsden [1988]) is that under fairly weak additional assumptions, a
G-equivariant symplectic integrator is also momentum preserving. For example, a
symplectic integrator of this type applied to a free rigid body motion would exactly
preserve the angular momentum vector in space.
Given the importance of conserving integrals of motion and the important role
played by the Hamiltonian structure in the reduction procedure for a system with
symmetry, one might hope to find an algorithm that combines all of the desirable
properties: conservation of energy, conservation of momenta (and other independent
integrals), and conservation of the symplectic structure. However, one cannot do
all three of these things at once unless one relaxes one or more of the conditions in
the following sense:
tum preserving and if the dynamics is nonintegrable on the reduced space (in the
sense spelled out in the proof ) then the algorithm already gives the exact solution
of the original dynamics problem up to a time reparametrization.
Proof Suppose F∆t is our symplectic algorithm of the type discussed above, and
consider the application of the algorithm to the reduced phase space. We assume
that the Hamiltonian H is the only integral of motion of the reduced dynamics
(i.e., all other integrals of the system have been found and taken out in the re-
duction process in the sense that any other conserved quantity (in a suitable class)
is functionally dependent on H. Since F∆t is symplectic it is the ∆t-time map of
some time-dependent Hamiltonian function K. Now assume that the symplectic
map F∆t also conserves H for all values of ∆t. Thus {H, K} = 0 = {K, H}. The
latter equation implies that K is functionally dependent on H since the flow of H
(the “true dynamics”) had no other integrals of motion. The functional dependence
of K on H in turn implies that their Hamiltonian vector fields are parallel, so the
flow of K (the approximate solution) and the flow of H (the exact solution) must
lie along identical curves in the reduced phases space; thus the flows are equivalent
up to time reparametrization. ¥
Thus, it is unlikely one can find an algorithm that simultaneously conserves the
symplectic structure, the momentum map, and the Hamiltonian. It is tempting
(but probably wrong) to guess from this that one can monitor accuracy by keeping
track of all three. Non-symplectic algorithms that conserve both momentum and
energy have been studied by Simo and Wong [1989] and Austin, Krishnaprasad and
Wang [1991]. We study the basic method in §9.5.
Recall also one of the basic facts about Hamilton-Jacobi theory, namely that
the flow of Hamilton’s equations is the canonical transformation generated by the
solution of the Hamilton-Jacobi equation
µ ¶
∂S ∂S
+ H q, =0 (9.3.2)
∂t ∂q
where S(q0 , q, t)|q=q0 generates the identity. (This may require singular behavior in
t=0
t; for example, consider S = 2t 1
(q − q0 )2 .) The strategy is to find an approximate
solution of the Hamilton-Jacobi equation for small time ∆t and to use this to obtain
the algorithm using (9.3.1).
There are several other versions of the algorithm that one can also treat. For
example, if specific coordinates are chosen on the phase space, one can use a generat-
ing function of the form S(q i , p0i , t). In this case one can get the simple formula for
a first order algorithm given in Example 1 in §9.1 by using S = p0i q i − ∆tH(q i , p0i ),
which is easy to implement, and for Hamiltonians of the form kinetic plus poten-
tial, leads to the stated explicit symplectic algorithm. As explained in Ge [1991a],
one can use other types of generating functions. For example, using the Poincaré
generating function, one recovers the mid-point scheme.
ϕ◦g =g◦ϕ
with respect to the group element in the direction of ξ at the identity of the group.
This results in ϕ∗ ξP = ξP or, by definition of the momentum map,
ϕ∗ XhJ,ξi = XhJ,ξi .
9 Mechanical Integrators 162
XhJ,ξi◦ϕ = XhJ,ξi .
Since two Hamiltonian vector fields are equal if and only if the Hamiltonians differ
by a constant,
hJ, ξi ◦ ϕ − hJ, ξi = constant.
We need hJ, ξi ◦ ϕ = hJ, ξi for the value of J to be preserved by the map ϕ, so we
need to establish sufficient conditions under which the constant will vanish.
An example of such conditions are:
iv P is a symplectic manifold endowed with an exact symplectic
form ω = −dθ;
v ϕ∗ θ − θ, which is closed, is also G− exact; that is, if
ϕ∗ θ − θ = dS, then S : P → R is a G-invariant generating
function for the map ϕ;
vi hJ, ξi = iξp θ.
Then,
The first term in this last expression is hJ, ξi again, and the final term vanishes
by equivariance of S. Thus, the desired conservation condition, hJ, ξi ◦ ϕ = hJ, ξi,
follows from these assumptions.
Assuming that the original system is given in terms of canonical coordinates on
a cotangent bundle P = T ∗ Q, we have ω = −dθ0 , where θ0 = pi dq i is the canonical
one-form on the cotangent bundle. If the symmetry group G acts by cotangent lifts,
then vi follows automatically. Condition v is equivalent to (9.1.1) if we regard S as
a function of q0 and q.
This argument applies to all “types” of generating functions, but when applied
to ones of the form S(q0 , q, t) we get:
This may also be seen directly by differentiating the invariance condition as-
sumed on S with respect to g ∈ G in the direction of ξ ∈ g and utilizing the
definitions of ϕS , J and ξQ . The following is also true: If G acts on Q freely, and
a given canonical transformation ϕ conserves J, then its generating function S can
be defined on an open set of Q × Q which is invariant under the action of G, and
S is invariant under the action of G. This is proved in Ge [1991a].
Note that if H is invariant under the action of G, then the corresponding solution
of the Hamilton-Jacobi equation is G invariant as well. This follows from the short
time uniqueness of the generating function of the type assumed for the flow of the
9 Mechanical Integrators 163
J−1(µ)
Gµ-orbits
constant energy
surfaces
projection ∆t
reduced space
Figure 9.5.1: An energy preserving algorithm is designed on the reduced space and
is then lifted to the level set of the momentum map by specifying phase information.
9 Mechanical Integrators 164
iii To uniquely determine a map from within the above class, we must determine
how points in one Gµ -orbit are mapped to points in another orbit. There is still
an ambiguity about how phase space points drift in the Gµ -orbit directions.
This drift is closely connected with geometric phases (Chapter 6)! In fact by
discretizing the geometric phase formula for the system under consideration
we can specify the shift along each Gµ -orbit associated with each iteration of
the map.
The papers of Simo and Wong [1989] and Krishnaprasad and Austin [1990]
provide systematic methods for making the choices required in Steps ii and iii.
The general construction given above is, in fact, precisely the approach advocated
in Simo, Tarnow and Wong [1991]. There it is shown that projection from the
level set of constant angular momentum onto the surface of constant energy can be
performed implicitly or explicitly leading to predictor/corrector type of algorithms.
From a numerical analysis standpoint, the nice thing is that the cost involved in the
actual construction of the projection reduces to that of a line search (i.e., basically
for free). The algorithm advocated in Simo and Wong [1989] is special in the sense
that the projection is not needed for Q = SO(3): the discrete flow is shown to lie in
the intersection of the level set of angular momentum and the surface of constant
energy. This algorithm is singularity–free and integrates the dynamics exactly up to
a time reparametrization, consistent with the restrictions on mechanical integrators
given in §9.2. Extensions of these schemes to elasticity, rods and shells suitable for
large-scale calculation and amenable to parallelization are given in Simo, Fox and
Rifai [1991], and Simo and Doblare [1991].
We shall give the special case of the rigid body for illustration, taking G = SO(3).
Since the momentum map is preserved, one also gets an induced algorithm on the
coadjoint orbits, or in the more general cases, on the symplectic reduced spaces.
The proofs can be routinely provided by tracing through the definitions.
We begin with the reduced Hamilton-Jacobi equation itself. Thus, let H be a
G-invariant function on T ∗ G and let HL be the corresponding left reduced Hamil-
tonian on g∗ . (To be specific, we deal with left actions — of course there are similar
statements for right reduced Hamiltonians.) If S is invariant, there is a unique func-
tion SL such that S(g, g0 ) = SL (g −1 g0 ). (One gets a slightly different representation
for S by writing g0−1 g in place of g −1 g0 .)
Proposition 9.6.1 The left reduced Hamilton-Jacobi equation is the following equa-
tion for a function SL : G → R:
∂SL
+ HL (−T Rg∗ · dSL (g)) = 0 (9.6.1)
∂t
which we call the Lie-Poisson Hamilton-Jacobi equation. The Lie-Poisson
flow of the Hamiltonian HL is generated by the solution SL of (9.6.1) in the sense
that the flow is given by the Poisson transformation of g∗ : Π0 7→ Π defined as
follows: Define g ∈ G by solving the equation
Π0 = −T L∗g · dg SL (9.6.2)
Here Ad denotes the adjoint action and so the action in (9.6.3) is the coad-
joint action. Note that (9.6.3) and (9.6.2) give Π = −T Rg∗ · dSL (g). Note also
that (9.6.2) and (9.6.3) are the analogues of Equation (9.1.1) and that (9.6.1) is the
analogue of (9.1.2). Thus, one can obtain a Lie-Poisson integrator by approximately
solving (9.6.1) and then using (9.6.2) and (9.6.3) to generate the algorithm. This
algorithm (9.6.3) manifestly preserves the coadjoint orbits (the symplectic leaves
in this case). As in the canonical case, one can generate algorithms of arbitrary
accuracy this way.
There may be conditions necessary on Π0 for the solvability of Equation (9.6.2).
This is noted in the example of the rigid body below.
For the case of the rigid body, these equations read as follows. First, Equa-
tion (9.6.1) reads
∂SL
+ HL (−∇SL (A) · AT ) = 0 (9.6.4)
∂t
i.e., Ã !
∂SL ∂SL k
+ HL − i A j (9.6.5)
∂t ∂A j
metric on the rotation group. This metric corresponds to the standard Euclidean
metric on the Lie algebra, when identified with Euclidean 3-space. This identifica-
tion maps the Lie algebra bracket to the cross product. The expression ∇SL (A)·AT
is a skew symmetric matrix, i.e., it lies in the Lie algebra so(3), so it makes sense
for HL to be evaluated on it. As usual, one has to be careful how the gradient
(derivative) ∇SL is computed, since there is a constraint AAT = I involved. If
it is computed naively in the coordinates of the ambient space of 3 × 3 matrices,
then one interprets the expression ∇SL (A) · AT using naive partial derivatives and
skew symmetrizing the result; this projects the gradient to the constraint space, so
produces the gradient of the constrained function.
Equation (9.6.5) thus is the Hamilton-Jacobi equation for the dynamics of a
rigid body written directly in body representation. The flow of the Hamiltonian is
generated by SL in the following way: It is the transformation of initial conditions
at time t = 0 to a general t determined by first solving the equation
for the matrix A and then setting Π = AΠ0 , where Π̂ = [Πij ] is the skew matrix
associated to the vector Π in the usual way: Π̂ · v = Π × v. (Again, the right
hand side of (9.6.6) is to be skew symmetrized if the derivative was taken in the
naive way with the constraint ignored.) We have written the result in terms of the
body angular momentum vector Π; one can rewrite it in terms of the body angular
velocity vector by using the relation Π = Iω, where I is the moment of inertia
tensor. In coordinates, Equation (9.6.6) reads as follows:
∂SL
(Π0 )ki = −Aji . (9.6.7)
∂Ajk
Finally, we note that similar equations also apply for fluids and plasmas, since
they are also Lie-Poisson systems (but with right reduction). Also, the methods here
clearly will generalize to the situation for reduction of any cotangent bundle; this
generality is needed for example, for the case of free boundary fluids — see Lewis,
Marsden, Montgomery and Ratiu [1986]. One of the ideas of current interest is to
use mechanical integrators on vortex algorithms. For example, one can use it on
point vortices in the plane (Pullen and Saffman [1991]) or on vortex dipoles in three
space (cf. Rouhi [1988] and Buttke [1991]) both of which live on finite dimensional
coadjoint orbits for the Euler equations (Marsden and Weinstein [1983]).
A general way to construct first order algorithms valid in the Lie-Poisson setting
(as well as its analogues in the symplectic and Poisson context) is as follows. Let
H : g∗ → R be a given Hamiltonian function and let S0 be a function that generates
a Poisson transformation ϕ0 : g∗ → g∗ and let
For small ∆t, (9.6.8) generates a Poisson transformation, say ϕ∆t : g∗ → g∗ . Then
we have:
9 Mechanical Integrators 167
and so Proposition 9.6.2 gives the following specific Lie-Poisson algorithm for rigid
body dynamics: It is the scheme Πk 7→ Πk+1 defined by
· ¸
1 1 ˆ ˆ
k
Π̂ = {AI(A − A ) + I(A − A )A }∆t + (A − A )
T T T T
(9.7.2)
2 4
· ¸
1 1 ˆ ˆ − AT )}∆t + (A − AT )
Π̂k+1 = {I(A − AT )A + AT I(A (9.7.3)
2 4
where, as before, the first equation is to be solved for the rotation matrix A and
the result substituted into the second. Letting AS = 12 [A − AT ] denote the skew
part of the matrix A, we can rewrite the scheme as
Π̂k ˆ S )S ∆t
= AS + (AIA (9.7.4)
Π̂ k+1 ˆ S )S ∆t.
= AS + (AT IA (9.7.5)
Of course, one can write A = exp(ξ) and solve for ξ and express the whole
algorithm in terms of g and g∗ alone. This type of algorithm does not keep track
(except implicitly) of the rigid body phase. One can imagine combining the ideas
here with those in Chapter 6 to do that.
We know from the general theory that this scheme will automatically be Poisson
and will, in particular, preserve the coadjoint orbits, i.e., the total angular momen-
tum surfaces kΠk2 = constant. Of course, using other choices of S0 , it is possible
to generate other algorithms for the rigid body, but the choice S0 (A) = trace(A) is
particularly simple. We point out the interesting feature that the function (9.6.10)
for the case of ideal Euler fluid flow is the function that assigns to a fluid placement
field ϕ (an element of the diffeomorphism group of the containing region) the trace
of the linear operator ω 7→ ϕ∗ ω, on vorticity fields ω, which measures the vortex
distortion due to the nonrigidity of the flow. (See Marsden and Weinstein [1983]
for further information.)
9 Mechanical Integrators 169
where the endpoints are free; here S is a function of q, q0 , and t. Condition (9.8.2)
gives (suppressing indices and assuming S is constant when q = q0 , so it does not
contribute to the lower limit),
Z µ ¶
∂L ∂L
δq + δ q̇ dt − δ(S(q, q0 , t)) = 0
∂q ∂ q̇
i.e.,
Z µ ¶
∂L d ∂L ∂S ∂S
− δqdt + pδq − p0 δq0 − δq − δq0 = 0. (9.8.3)
∂q dt ∂ q̇ ∂q ∂q0
Proposition 9.8.1 The variational principle (9.8.2) with free endpoints is equiva-
lent to the Euler-Lagrange equations plus the conditions (9.1.1) defining a symplectic
map.
This principle works the same way starting using the phase space variational
principle Z ½ ¾
d
δ pi q̇ − H(q, p) − S(q, q0 , t) dt.
i
(9.8.4)
dt
Thus we see that the symplectic nature of the flow can be built into the vari-
ational principle. We now imagine S is determined by requiring it be an (approx-
imate) solution of the Hamilton-Jacobi equation, which will make the symplectic
map (9.1.1) compatible with the Euler-Lagrange equations.
It is reasonable to suggest that numerical methods be based on these variational
principles using finite element ideas.
An interesting way to introduce the Hamilton-Jacobi equation is to observe that
the equations of motion are consistent with (9.3.1) iff S satisfies the Hamilton-Jacobi
equation (with H possibly modified by a constant).
Chapter 10
Hamiltonian Bifurcation
170
10 Hamiltonian Bifurcation 171
try groups, the energy momentum method can be used to help put the linearized
system into normal form and from this one can calculate the stability transitions,
and the eigenvalue evolution.
To begin with a simple example, consider a ball moving with no friction in a
hoop constrained to rotate with angular velocity ω (Figure 10.1.1).
R
g = acceleration
y due to gravity
θ
x
.
θ
increasing ω .
θ
already complicated, even when only discrete symmetry of the underlying system is
considered. A similar bifurcation occurs in the dynamics of rotating planar coupled
rigid bodies, as was analyzed by Oh, Sreenath, Krishnaprasad and Marsden [1989]
and in the bifurcations of two coupled rigid bodies, as in Patrick [1991]. It also
occurs in our double spherical pendulum, as we shall see.
Another basic bifurcation in the Hamiltonian context is the one-to-one reso-
nance, or the Hamiltonian Hopf bifurcation. In this case, two eigenvalues of the
system linearized at a given equilibrium (or relative equilibrium) come together on
the imaginary axis — that is, two frequencies in the system become equal. This
is the Hamiltonian analogue of the well known Poincaré-Hopf bifurcation for non-
hamiltonian vector fields.
A classical example of the Hamiltonian Hopf bifurcation is the fast slow transi-
tion in an upright spinning heavy Lagrange top. When the top makes the transition
from a fast top to a slow top √
2 M g`I1
ω↓
I3
an instability sets in of this sort. For a study of the possible bifurcations in the
dynamics of a heavy top see Lewis, Ratiu, Simo and Marsden [1992].
10 Hamiltonian Bifurcation 173
R
eθ
ϕ y
θ
x
eϕ
er
Figure 10.1.3
The position of the ball in space is specified by the angles θ and ϕ, as shown.
We can take ϕ = ωt, so the position of the ball becomes determined by θ alone. Let
the orthonormal frame along the coordinate directions eθ , eϕ and er be as shown.
The forces acting on the particle are:
1. Friction, proportional to the velocity of the ball relative to the hoop: −νRθ̇eθ ,
where ν ≥ 0 is a constant.
2. Gravity: −mgk.
3. Constraint forces in the directions er and eϕ to keep the ball in the hoop.
10 Hamiltonian Bifurcation 174
The equations of motion are derived from Newton’s second law F = ma. To get
them, we calculate the acceleration a. Relative to the xyz coordinate system, we
have
x = R sin θ cos ϕ
y = R sin θ sin ϕ . (10.1.1)
z = −R cos θ
Calculating the second derivatives using ϕ = ωt and the chain rule in (10.1.1) gives
The eϕ and er components of F = ma tell us what the constraint forces must be;
the equation of motion comes from the eθ component:
F · eθ = ma · eθ . (10.1.5)
while from (10.1.1) – (10.1.4), the right side of (10.1.5) is, after some algebra,
In fact our system can be viewed just as well as a whirling planar pendu-
lum.
We notice that (10.1.8), when expressed in terms of the angular momentum is
formally the same as the reduction of the spherical pendulum. See Equation
(3.5.9). It is interesting that (10.1.8) is nonsingular near θ = 0, whereas (3.5.9)
has a singularity.
10 Hamiltonian Bifurcation 175
so we choose
1 mR2 ω 2
L=T −V = mR2 θ̇2 + sin2 θ + mgR cos θ. (10.1.13)
2 2
The Euler-Lagrange equations
d ∂L ∂L
− =T
dt ∂ θ̇ ∂θ
then give (10.1.8). The Legendre transform gives p = mR2 θ̇ and the Hamilto-
nian (10.1.10).
Consider equilibrium solutions; i.e., solutions satisfying θ̇ = 0, and θ̈ =
0; (10.1.8) gives
Rω 2 sin θ cos θ = g sin θ. (10.1.14)
Certainly θ = 0, and θ = π solve (10.1.14) corresponding to the particle at the
bottom or top of the hoop. If θ 6= 0 or π, (10.1.14) becomes
Rω 2 cos θ = g (10.1.15)
is the critical rotation rate. (Notice that ωc is the frequency of linearized oscil-
lations for the simple pendulum i.e., for Rθ̈ + gθ = 0.) For ω < ωc there are only
two solutions θ = 0, π, while for ω > ωc there are four solutions,
³ g ´
θ = 0, π, ± cos−1 . (10.1.17)
Rω 2
10 Hamiltonian Bifurcation 176
figure 10.1.4
Figure 10.1.5: Phase portrait for the ball in the off-center hoop.
Near θ = 0, the potential function has changed from the symmetric bifurca-
tion in Figure 10.1.2 to the unsymmetric one in Figure 10.1.6. This is the cusp
catastrophe.
Aside For the symmetric ball in the hoop, imagine that the hoop is subject to
small periodic pulses; say ω = ω0 + ² cos(ηt), or perhaps the hoop is coupled to
another oscillator. Using the Melnikov method described below, it is reasonable to
expect that the resulting time periodic system has horseshoe chaos if ² and ν are
both small, but ²/ν is large enough.
10 Hamiltonian Bifurcation 177
(a) ε = 0
(b) ε > 0
Figure 10.1.6: Potential for the centered and off-center ball in the hoop as the
angular velocity increases.
(a)
i i i
(b)
−i −i −i
Figure 10.2.1: The splitting case; (a) for the steady state bifurcation, (b) for the
one-to-one resonance.
Symmetry can influence the above generic behavior (see Golubitsky, Sheaffer
and Stewart [1988]). Indeed, for certain symmetry groups (such as the circle group
S 1 ), the passing of eigenvalues may be generic in a one parameter family. Galin
[1982] shows that without symmetry, the generic situation is splitting and one would
require three parameters to see passing.
In the steady state case, the dichotomy in eigenvalue movements can be un-
derstood using definiteness properties of the Hamiltonian, i.e., by energetics, or
group theoretically . The group theoretic approach was discussed in Golubitsky
and Stewart [1987]. The energetics method has a complex history, going back to
at least Cartan [1922] — a modern reference is Oh [1987]. For example, one can
sometimes say that a S 1 symmetry forces the eigenvalues to stay on the imaginary
axis, or one can say that eigenvalues must split because the second variation changes
from positive definite to indefinite with just one eigenvalue crossing through zero.
These methods have also been used extensively in the study of solitons; see Pego
and Weinstein [1992] and references therein.
Here are some simple observations about the method of energetics. Start with a
linear (or linearized) Hamiltonian system and write it in the form XH = JB where
B is a symmetric
· matrix
¸ whose associated quadratic form is the Hamiltonian H and
0 1
where J = is the Poisson tensor. If H is positive (or negative) definite,
−1 0
then the spectrum of XH is on the imaginary axis — this follows because the
system is necessarily stable, and the spectrum of XH is symmetric under reflection
in the two axis of the complex plane, so the spectrum must be confined to the
10 Hamiltonian Bifurcation 179
(a)
i i i
(b)
−i −i −i
Figure 10.2.2: The passing case; (a) for the steady state bifurcation, (b) for the
one-to-one resonance.
imaginary axis. If H has an odd number of eigenvalues that are negative, then
taking the determinant, we see that since J has determinant one, XH has a negative
determinant. Thus in this case, it must have at least one pair of real eigenvalues, and
therefore be linearly unstable. If H has the standard form of kinetic plus potential
energy, and the kinetic term is positive definite, and the potential energy has at
least one negative eigenvalue, then again XH has real eigenvalues, and so is linearly
unstable. However, for gyroscopic systems, such as those that arise in reduction,
the situation is not so simple, and deeper insight is needed.
An example relevant to the above remarks concerns bifurcations of relative equi-
libria of a rotating liquid drop: the system consists of the two dimensional Euler
equations for an ideal fluid with a free boundary. A rigidly rotating circular drop
is an equilibrium solution (in the spatially reduced equations). The energy-Casimir
method shows stability, provided
Ω2
< 1. (10.2.1)
12R3 τ
Here Ω is the angular velocity of the circular drop, R is its radius and τ is the
surface tension, a constant. As Ω increases and (10.2.1) is violated, the stability
of the circular solution is lost and is picked up by elliptical-like solutions with
Z2 × Z2 symmetry. (The bifurcation is actually subcritical relative to Ω and is
supercritical relative to the angular momentum.) This is proved in Lewis, Marsden
and Ratiu [1987] and Lewis [1989], where other references may also be found (see
Figure 10.2.3).
10 Hamiltonian Bifurcation 180
increasing
R angular
momentum
During this transition, the eigenvalues stay on the imaginary axis — they are
forced to because of the symmetry. This is consistent with the energetics approach
since an even number of eigenvalues of the second variation cross through zero,
namely two. The situation for the ball in the hoop and the liquid drop examples is
presented in Figure 10.2.4.
y y
C C
x x
Figure 10.2.4: Eigenvalue evolution for the liquid drop and the ball in the hoop.
2. If U1 and U2 are complex of the same type, then generically the eigenvalues
pass and H is indefinite.
10 Hamiltonian Bifurcation 182
3. In the case of complex duals the eigenvalues can generically pass or split and
these possibilities correspond precisely to definiteness and indefiniteness of the
quadratic form induced on U1 ⊕ U2 by the linearization.
then H0 = (|z1 |2 + |z2 |2 )/2 and so H0 is invariant under the action of SU (2), the
complex 2 × 2 unitary matrices of determinant one. The corresponding conserved
quantities are
W1 = 2(q1 q2 + p1 p2 )
W2 = 2(q2 p1 − q1 p2 )
W3 = q12 + p21 − q22 − p22 (10.3.3)
which comprise the components of a (momentum) map for the action of SU (2) on
C2 :
J : C2 ∼
= R4 → su(2)∗ ∼
= R3 . (10.3.4)
From the relation 4H02 = W12 + W22 + W32 , one finds that J restricted to S 3 gives a
map
j : S3 → S2. (10.3.5)
The fibers j −1 (point) are circles and the dynamics of H0 moves along these circles.
The map j is the Hopf fibration which describes S 3 as a topologically nontrivial
circle bundle over S 2 . (The reduction of R4 by the action by the flow of H0 is S 2 .)
10 Hamiltonian Bifurcation 183
Apparently the role of the Hopf fibration in mechanics was already known to Reeb
around 1950.
With P = C2 , we have a basic example of a dual pair determined by the
above momentum maps (see Example 1 of §8.3 and the Example in §8.5). See
Figure 10.3.1.
C2
@
J @ H0
@
R
@
su(2)∗ R
Figure 10.3.1: The dual pair of the harmonic oscillator.
Normal form theory allows one (up to finite order) to change coordinates by
averaging over the S 1 action determined by the flow of H0 . In this way one gets a
new Hamiltonian H from H that is S 1 invariant. Since we have a dual pair, such
an H can be written as
H = h ◦ J. (10.3.6)
In other words, a function invariant on one side collectivizes on the other . In
particular, since J is a Poisson map, the dynamics of H can be reduced to dynamics
on su(2)∗ ∼= R3 with the rigid body Lie-Poisson structure. This proves one of the
results of Cushman and Rod [1982]. This procedure can be of help in locating
interesting bifurcations, as in David, Holm and Tratnik [1990].
The Hopf fibration occurs in a number of other interesting mechanical systems.
One of these is the free rigid body. When doing reduction for the rigid body, we
construct the reduced space J−1 (µ)/Gµ = J−1 (µ)/S 1 , which is the sphere S 2 . Also,
J−1 (µ) is topologically the same as the rotation group SO(3), which in turn is the
same as S 3 /Z2 . Thus, the reduction map is a map of SO(3) to S 2 . Such a map is
given explicitly by taking an orthogonal matrix A and mapping it to the vector on
the sphere given by Ak, where k is the unit vector along the z-axis. This map that
does the projection is in fact a restriction of a momentum map and, when composed
with the map of S 3 ∼ = SU (2) to SO(3), is just the Hopf fibration again. Thus, not
only does the Hopf fibration occur in the one-to-one resonance, it occurs in the rigid
body in a natural way as the reduction map from material to body representation!
M q̈ + S q̇ + Λq = 0 (10.4.1)
10 Hamiltonian Bifurcation 184
for certain 3 × 3 matrices M, S and Λ. These equations have the Hamiltonian form
Ḟ = {F, H} where p = M q̇,
1 1
H= pM −1 P + qΛq (10.4.2)
2 2
and
∂F ∂K ∂K ∂F ∂F ∂K
{F, K} = − i − Sij (10.4.3)
∂q i ∂pi ∂q ∂pi ∂pi ∂pj
i.e., ¾
q̇ = M −1 p
(10.4.4)
ṗ = −S q̇ − Λq = −SM −1 p − Λq.
The following is a standard useful observation:
Proposition 10.4.1 The eigenvalues λ of the linear system (10.4.4) are given by
the roots of
det[λ2 M + λS + Λ] = 0. (10.4.5)
Proof Let (u, v) be an eigenvector of (10.4.4) with eigenvalue λ; then
M −1 v = λu and − SM −1 v − Λu = λv
For the double spherical pendulum, we call the eigenvalue γ (since λ is already
used for something else in this example) and note that the polynomial
We shall give a proof of this result below and in the course of the proof, we shall
clarify what it means to have transverse separatrices.
To apply this method to Equation (10.6.1), let x = (φ, φ̇) so (10.6.1) becomes
· ¸ · ¸ · ¸
d φ φ̇ 0
= +² . (10.6.4)
dt φ̇ − sin φ cos ωt
10 Hamiltonian Bifurcation 187
Changing variables and using the fact that sech is even and sin is odd, we get
µZ ∞ ¶
M (t0 ) = ∓2 sech t cos ωtdt cos(ωt0 ).
−∞
γ0 (t) = (x0 , t)
is a periodic orbit for (10.6.2), whose stable and unstable manifolds W0s (γ0 ) and
W0u (γ0 ) are coincident. For ² > 0 the hyperbolic closed orbit γ0 perturbs to a
10 Hamiltonian Bifurcation 188
nearby hyperbolic closed orbit which has stable and unstable manifolds W²s (γ² ) and
W²u (γ² ). If W²s (γ² ) and W²u (γ² ) intersect transversally, we again say that (10.6.2)
admits horseshoes. These two definitions of admitting horseshoes are equivalent.
We use the energy function H0 to measure the first order movement of W²s (γ² ) at
x̄(0) at time t0 as ² is varied. Note that points of x̄(t) are regular points for H0 since
H0 is constant on x̄(t) and x̄(0) is not a fixed point. Thus, the values of H0 can be
used to measure the distance from the homoclinic orbit. If (xs² (t, t0 ), t) is the curve
on W²s (γ² ) that is an integral curve of the suspended system in xt-space, and has an
initial condition xs (t0 , t0 ) which is the perturbation of W0s (γ0 ) ∩ { the plane t = t0 }
in the normal direction to the homoclinic orbit, then H0 (xs² (t0 , t0 )) measures the
normal distance. But
Z T
d
H0 (x² (T, t0 )) − H0 (x² (t0 , t0 )) =
s s
H0 (xs² (t, t0 ))dt, (10.6.7)
t0 dt
and so
Z T
H0 (xs² (T, t0 )) − H0 (xs² (t0 , t0 )) = {H0 , H0 + ²H1 }(xs² (t, t0 ), t)dt. (10.6.8)
t0
Similarly,
Z t0
H0 (xu² (t0 , t0 )) − H0 (xu² (−S, t0 )) = ² {H0 , H1 }(x̄(t − t0 , t))dt + O(²2 ). (10.6.10)
−S
Since xs² (T, t0 ) → γ² , a periodic orbit for the perturbed system as T → +∞, we can
choose T and S such that H0 (xs² (T, t0 )) − H0 (xu² (−S, t0 )) → 0 as T, S → ∞. Thus,
adding (10.6.9) and (10.6.10), and letting T, S → ∞, we get
Z ∞
H0 (xu² (t0 , t0 )) − H0 (xs² (t0 , t0 )) = ² {H0 , H1 }(x̄(t − t0 , t))dt + O(²2 ). (10.6.11)
−∞
It follows that if M (t0 ) has a simple zero at time t0 , then xu² (t0 , t0 ) and xs² (t0 , t0 )
intersect transversally near the point x̄(0) at time t0 . (Since dH0 → 0 exponen-
tially at the saddle points, the integrals involved in this criterion are automatically
convergent.) ¥
comprehensive (the reader can track down many additional references by consulting
Wiggins [1988] and the references cited below).
If in (10.6.2), X0 only is a Hamiltonian vector field, the same conclusion holds
if (10.6.3) is replaced by
Z ∞
M (t0 ) = (X0 × X1 )(x̄(t − t0 ), t)dt, (10.6.12)
−∞
where X0 × X1 is the (scalar) cross product for planar vector fields. In fact, X0
need not even be Hamiltonian if an area expansion factor is inserted. For exam-
ple, (10.6.12) applies to the forced damped Duffing equation
and if ² is small.
A version of the Poincaré-Melnikov theorem applicable to PDE’s (due to Holmes
and Marsden [1981]). One basically still uses formula (10.6.12) where X0 × X1 is
replaced by the symplectic pairing between X0 and X1 . However, there are two
new difficulties in addition to standard technical analytic problems that arise with
PDE’s. The first is that there is a serious problem with resonances. These can be
dealt with using the aid of damping — the undamped case would need an infinite
dimensional version of Arnold diffusion. Secondly, the problem is not reducible to
two dimensions; the horseshoe involves all the modes. Indeed, the higher modes do
seem to be involved in the physical buckling processes for the beam model discussed
next.
A PDE model for a buckled forced beam is
µZ 1 ¶
0000 0
ẅ + w + Γw − K [w ] dz w00 = ²(f cos ωt − δ ẇ)
0 2
(10.6.21)
0
where w(z, t), 0 ≤ z ≤ 1 describes the deflection of the beam, ˙ = ∂/∂t,0 = ∂/∂z and
Γ, K, . . . are physical constants. For this case, the theory shows that if
1. π 2 < Γ < 4ρ3 (first mode is buckled)
2. j 2 π 2 (j 2 π 2 − Γ) 6= ω 2 , j = 2, 3, . . . (resonance condition)
µ ¶
f π(Γ − π 2 ) ω
3. > √ cosh √ (transversal zeros for M (t0 ))
δ 2ω K 2 Γ − ω2
4. δ > 0
and ² is small, then (10.6.21) has horseshoes. Experiments (see Moon and Holmes
[1979]) showing chaos in a forced buckled beam provided the motivation which lead
to the study of (10.6.21).
This kind of result can also be used for a study of chaos in a van der Waal fluid
(Slemrod and Marsden [1983]) and there is a growing literature using these methods
for forced and damped soliton equations. For example, in the damped, forced Sine-
Gordon equation one has chaotic transitions between breathers and kink-antikink
pairs and in the Benjamin-Ono equation one can have chaotic transitions between
solutions with different numbers of poles.
For Hamiltonian systems with two degrees of freedom, Holmes and Marsden
[1982a] show how the Melnikov method may be used to prove the existence of
horseshoes on energy surfaces in nearly integrable systems. The class of systems
studied have a Hamiltonian of the form
where (θ, I) are action-angle coordinates for the oscillator G; G(0) = 0, G0 > 0. It
is assumed that F has a homoclinic orbit x̄(t) = (q̄(t), p̄(t)) and that
Z ∞
M (t0 ) = {F, H1 }dt, (10.6.23)
−∞
10 Hamiltonian Bifurcation 191
the integral taken along (x̄(t − t0 ), Ωt, I), has simple zeros. Then (10.6.22) has
horseshoes on energy surfaces near the surface corresponding to the homoclinic
orbit and small I; the horseshoes are taken relative to a Poincaré map strobed to
the oscillator G. The paper Holmes and Marsden [1982a] also studies the effect of
positive and negative damping. These results are related to those for forced one
degree of freedom systems since one can often reduce a two degrees of freedom
Hamiltonian system to a one degree of freedom forced system.
For some systems in which the variables do not split as in (10.6.22), such as
a nearly symmetric heavy top, one can exploit symmetry of the system and make
use of reduction ideas. The general theory for this is given in Holmes and Marsden
[1983] and was applied to show the existence of horseshoes in the nearly symmetric
heavy top; see also some closely related results of Ziglin [1981].
The Poincaré-Melnikov theory has been used by Ziglin [1980b] in vortex dynam-
ics, for example to give a proof of the non-integrability of the restricted four vortex
problem. There have also been recent applications to the dynamics of general rel-
ativity showing the existence of horseshoes in Bianchi IX models. See Oh et al.
[1989] for applications to the dynamics of coupled planar rigid bodies and to David,
Holm and Tratnik [1990] to the study of polarization laser dynamics.
Arnold [1964] extended the Poincaré-Melnikov theory to systems with several
degrees of freedom. In this case the transverse homoclinic manifolds are based
on KAM tori and allow the possibility of chaotic drift from one torus to another.
This drift, now known as Arnold diffusion is a basic ingredient in the study of
chaos in Hamiltonian systems (see for instance, Chirikov [1979] and Lichtenberg and
Lieberman [1983] and references therein). Instead of a single Melnikov function, one
now has a Melnikov vector given schematically by
R∞
→ −∞
{H0 , H1 }dt
M= R (10.6.24)
∞
−∞ k
{I , H1 }dt
where Ik are integrals for the unperturbed (completely integrable) system and where
→ →
M depends on t0 and on angles conjugate to I1 , . . . , In . One requires M to have
transversal zeros in the vector sense. This result was given by Arnold for forced
systems and was extended to the autonomous case by Holmes and Marsden [1982b,
1983]; see also Robinson [1988]. These results apply to systems such as pendulum
coupled to several oscillators and the many vortex problems. It has also been used
in power systems by Salam, Marsden and Varaiya [1983], building on the horseshoe
case treated by Kopell and Washburn [1982]. There have been a number of other
directions of research on these techniques. For example, Grundler developed a
multidimensional version applicable to the spherical pendulum and Greenspan and
Holmes showed how it can be used to study subharmonic bifurcations. See Wiggins
[1988] references and for more information.
In Poincaré’s celebrated memoir [1890] on the three-body problem, he intro-
duced the mechanism of transversal intersection of separatrices which obstructs the
integrability of the equations and the attendant convergence of series expansions for
the solutions. This idea was subsequently developed by Birkhoff and Smale using
10 Hamiltonian Bifurcation 192
which is exponentially small (or beyond all orders) in µ. Poincaré was well aware
of the difficulties that this exponentially small behavior causes; on p. 224 of his
article, he states: “En d’autres termes, si on regarde µ comme un infiniment petit du
premier ordre, la distance BB 0 sans être nulle, est un infiniment petit d’ordre infini.
C’est ainsi que la fonction e−1/µ est un infiniment petit d’ordre infini sans ètre nulle
0
√ BB est
. . . Dans l’example particulier que nous avons traité plus haut, la distance
du mème ordre de grandeur que l’integral J, c’est à dire que exp(−π/ 2µ).”
This is a serious difficulty that arises when one uses the Melnikov method near
an elliptic fixed point in a Hamiltonian system or in bifurcation problems giving
birth to homoclinic orbits. The difficulty is related to those described by Poincaré
(see Sanders [1982]). Near elliptic points, one sees homoclinic orbits in normal forms
and after a temporal rescaling, this leads to a rapidly oscillatory perturbation that
is modelled by the following variation of (10.6.1):
µ ¶
ωt
φ̈ + sin φ = ² cos . (10.6.25)
²
ẍ + µx + x2 = 0 (10.6.27)
.
x
.
x
x x
−µ −µ
µ<0 µ>0
ẍ + µx + x2 = δf (t). (10.6.30)
The exponentially small estimates of Holmes, Marsden and Scheurle [1989] apply
to (10.6.32). One gets upper and lower estimates in certain algebraic sectors of the
(δ, µ) plane.
Now we consider
ẍ + µx + x2 + x3 = δf (t). (10.6.33)
With δ = 0, there are equilibria at
µ
x = 0, −r, or − and ẋ = 0, (10.6.34)
r
where √
1 − 4µ
1+
r= , (10.6.35)
2
which is approximately 1 when µ ≈ 0. The phase portrait of (10.6.33) with δ = 0
and µ = − 12 is shown in Figure 10.6.2. As µ passes through 0, the small lobe
undergoes the same bifurcation as in Figure 10.6.1, with the large lobe changing
only slightly.
.
x
Notice that for δ = 0, the phase portrait is µ-dependent. The homoclinic orbit
surrounding the small lobe for µ < 0 is given explicitly in terms of ξ by
4eτ
ξ(τ ) = ¡ ¢ , (10.6.37)
2 2
eτ + 3 − 2µ
10 Hamiltonian Bifurcation 195
remains intact.
Remarks
10 Hamiltonian Bifurcation 196
2. Spectral stability need not imply stability, even linear stability. This is shown
by the unstable linear system q̇ = p, ṗ = 0 with a pair of eigenvalues at zero.
3. If Q has odd index (an odd number of negative eigenvalues), then L has
a real positive eigenvalue; see Oh [1987]. Indeed, in canonical coordinates,
and identifying Q with its corresponding matrix, we have L = JQ. Thus,
det L = det Q is negative. Since det L is the product of the eigenvalues of
L and they come in conjugate pairs, there must be at least one real pair of
eigenvalues, and in fact an odd number of positive real eigenvalues.
4. If P = T ∗ Q with the standard symplectic structure (no magnetic terms)
and if H is of the form kinetic plus potential so that an equilibrium has the
form (qe , 0), and if δ 2 V (qe ) has negative index, then again L must have real
eigenvalues. This is because one can diagonalize δ 2 V (qe ) with respect to the
kinetic energy inner product, in which case the eigenvalues are evident. ¨
To get more interesting effects than covered by the above remarks, we consider
gyroscopic systems; i.e., linear systems of the form
M q̈ + S q̇ + Λq = 0 (10.7.1)
If the index of V is even (see Remark 3) one can get situations where δ 2 H is
indefinite and yet spectrally stable. Roughly, this is a situation that is capable of
undergoing a Hamiltonian Hopf bifurcation, so includes examples like the “cowboy”
solution for the double spherical pendulum and certain regimes of the heavy top.
This result, due to Bloch, Krishnaprasad, Marsden and Ratiu [1994] builds on
basic work of Chetaev [1961] and Hahn [1967]. The argument proceeds in two steps.
for small β.
This function has the beautiful property that for β small enough, W has the
same index as H, yet Ẇ is negative definite, where the overdot is taken in the
dynamics of (10.7.4). This alone is enough to prove Liapunov instability, as is seen
by studying the equation
Z T
W (q(T ), p(T )) = W (q0 , p0 ) + Ẇ (q(t), p(t))dt (10.7.6)
0
and choosing (q0 , p0 ) in the sector where W is negative, but arbitrarily close to the
origin.
states that
¯
ξ(JB) anti ξ
Reλ² = ² + O(²2 ) (10.7.7)
ξ¯T Jξ
where we write the linearized equations in the form
ẍ − g ẏ + γ ẋ + αx = 0
(10.7.9)
ÿ + g ẋ + δ ẏ + βy = 0,
and Hamiltonian
1 2 1
H= (ẋ + ẏ 2 ) + (αx2 + βy 2 ). (10.7.11)
2 2
(Note that for α = β, angular momentum is conserved.)
The characteristic polynomial is computed to be
Let
p0 (λ) = λ4 + (g 2 + α + β)λ2 + αβ. (10.7.13)
Since p0 is a quadratic in λ2 , its roots are easily found. One gets:
i If α > 0, β > 0, then H is positive definite and the eigenvalues are on the
imaginary axis; they are coincident in a one-to-one resonance for α = β.
ii If α and β have opposite signs, then H has index 1 and there is one eigenvalue
pair on the real axis and one pair on the imaginary axis.
iii If α < 0 and β < 0 then H has index 2. Here the eigenvalues may or may not
be on the imaginary axis.
10 Hamiltonian Bifurcation 199
c If D > 0 and g 2 + α + β > 0, the roots are on the imaginary axis and if
g 2 + α + β < 0, they are on the real axis.
√
Thus the case in which D ≥ 0 and g 2 + α + β > 0 (i.e., if g 2 + α + β ≥ 2 αβ), is
one to which the dissipation induced instabilities theorem applies.
Note that for g 2 + α + β > 0, if D decreases through zero, a Hamiltonian Hopf
bifurcation occurs. For example, as g increases and the eigenvalues move onto the
imaginary axis, one speaks of the process as gyroscopic stabilization.
Now we add damping and get
Proof We use the Routh-Hurwitz criterion (see Gantmacher [1959, vol. 2]), which
states that the number of strict right half plane roots of the polynomial
λ4 + ρ1 λ3 + ρ2 λ2 + ρ3 λ + ρ4
{+, +, +, −, +}.
where γ is the magneto-mechanical ratio (so that γkBk is the Larmour frequency)
and λ is the damping coefficient due to domain walls.
In each case, it is well known that the equations without damping can be written
in either Euler-Poincaré form or in Lie-Poisson (Hamiltonian) form. The equations
are Hamiltonian with the rigid body Poisson bracket:
can be derived from a skew symmetric Poisson bracket. For the case of the rigid
body, this bracket is
As we have already indicated, the same formalism can be applied to other sys-
tems as well. In fact, later in the paper we develop an abstract construction for
dissipative terms with the same general properties as the above examples. When
this method is applied to fluids one gets a dissipative mechanism related to that of
Vallis, Carnevale, and Young [1989] and Shepherd [1992] as follows. One modifies
the Euler equations for a perfect fluid, namely
∂v
+ v · ∇v = −∇p (10.8.9)
∂t
where v is the velocity field, assumed divergence free and parallel to the boundary
of the fluid container, and where p is the pressure. With dissipation, the equations
become: µ³ ´] ¶
∂v
+ v · ∇v = −∇p + αP £u(v) v [
(10.8.10)
∂t
where α is a positive constant, P is the Hodge projection onto the divergence free
part, and where µ³ ´] ¶
[
u(v) = P £v v .
The flat and sharp symbols denote the index lowering and raising operators induced
by the metric; that is, the operators that convert vectors to one forms and vice versa.
Written in terms of the vorticity, these equations become
d
ω + £v ω = α£u(v) ω.
dt
This dissipative term preserves the coadjoint orbits, that is, the isovortical surfaces
(in either two or three dimensions, or in fact, on any Riemannian manifold), and
with it, the time derivative of the energy is strictly negative (except at equilibria,
where it is zero). As we shall see, there is a similar dissipative term in the case of
the Vlasov-Poisson equation for plasma physics.
References
Abarbanel, H.D.I. and D.D. Holm [1987] Nonlinear stability analysis of inviscid flows
in three dimensions: incompressible fluids and barotropic fluids, Phys. Fluids 30,
3369–3382.
Abarbanel, H.D.I., D.D. Holm, J.E. Marsden, and T.S. Ratiu [1986] Nonlinear stability
analysis of stratified fluid equilibria, Phil. Trans. R. Soc. Lond. A 318, 349–409;
also Phys. Rev. Lett. 52 [1984] 2352–2355.
Abed, E.H. and J-H. Fu [1986] Local feedback stabilization and bifurcation control. Syst.
and Cont. Lett. 7, 11-17, 8, 467-473.
Abraham, R. and J.E. Marsden [1978] Foundations of Mechanics. Second Edition,
Addison-Wesley Publishing Co., Reading, Mass..
Abraham, R., J.E. Marsden, and T.S. Ratiu [1988] Manifolds, Tensor Analysis, and
Applications. Second Edition, Springer-Verlag, New York.
Adams, M.R., J. Harnad, and E. Previato [1988] Isospectral Hamiltonian flows in finite
and infinite dimensions I. Generalized Moser systems and moment maps into loop
algebras, Comm. Math. Phys. 117, 451–500.
Aeyels, D. and M. Szafranski [1988] Comments on the stabilizability of the angular ve-
locity of a rigid body, Syst. Cont. Lett. 10, 35–39.
Aharonov, Y. and J. Anandan [1987] Phase change during acyclic quantum evolution,
Phys. Rev. Lett. 58, 1593–1596.
Alber, M. and J.E. Marsden [1992] On geometric phases for soliton equations, Comm.
Math. Phys. 149, 217–240.
Anandan, J. [1988] Geometric angles in quantum and classical physics, Phys. Lett. A
129, 201–207.
Andrews, D.G. [1984] On the existence of nonzonal flows satisfying sufficient conditions
for stability, Geo. Astr. Fluid Dyn. 28, 243–256.
Armbruster, D., J. Guckenheimer, and P. Holmes [1988] Heteroclinic cycles and modu-
lated traveling waves in systems with O(2)-symmetry, Physica D 29, 257–282.
Arms, J.M. [1981] The structure of the solution set for the Yang-Mills equations, Proc.
Camb. Phil. Soc. 90, 361–372.
Arms, J.M., A. Fischer, and J.E. Marsden [1975] Une approche symplectique pour des
théorémes de décomposition en géométrie ou relativité générale, C. R. Acad. Sci.
Paris 281, 517–520.
204
References 205
Arms, J.M., J.E. Marsden, and V. Moncrief [1981] Symmetry and bifurcations of mo-
mentum mappings, Comm. Math. Phys. 78, 455–478.
Arms, J.M., J.E. Marsden, and V. Moncrief [1982] The structure of the space solutions of
Einstein’s equations: II Several Killing fields and the Einstein-Yang-Mills equations,
Ann. Phys. 144, 81–106.
Arnold, V.I. [1964] Instability of dynamical systems with several degrees of freedom, Dokl.
Akad. Nauk. SSSR 156, 9–12.
Arnold, V.I. [1966] Sur la géometrie differentielle des groupes de Lie de dimenson infinie
et ses applications à l’hydrodynamique des fluids parfaits, Ann. Inst. Fourier,
Grenoble 16, 319–361.
Arnold, V.I. [1969] On an a priori estimate in the theory of hydrodynamical stability,
Am. Math. Soc. Transl. 79, 267–269.
Arnold, V.I. [1988] Dynamical Systems III. Encyclopedia of Mathemaics 3, Springer-
Verlag.
Arnold, V.I. [1989] Mathematical Methods of Classical Mechanics. Second Edition, Grad-
uate Texts in Math. 60, Springer-Verlag.
Ashbaugh, M.S., C.C. Chicone, and R.H. Cushman [1991] The twisting tennis racket, J.
Dyn. Diff. Eqns. 3, 67–85.
Atiyah, M. [1982] Convexity and commuting Hamiltonians, Bull. Lon. Math. Soc. 14,
1–15.
Austin, M., P.S. Krishnaprasad, and L.S. Wang [1993] Almost Poisson integration of rigid
body systems. J. Comput. Phys. 107, 105–117.
Baider, A., R.C. Churchill, and D.L. Rod [1990] Monodromy and nonintegrability in
complex Hamiltonian systems, J. Dyn. Diff. Eqns. 2, 451–481.
Baillieul, J. [1987] Equilibrium mechanics of rotating systems, Proc. CDC 26, 1429–1434.
Baillieul, J. and M. Levi [1987] Rotational elastic dynamics, Physica D 27, 43–62.
Baillieul, J. and M. Levi [1991] Constrained relative motions in rotational mechanics,
Arch. Rat. Mech. An. 115, 101–135.
Ball, J.M. and J.E. Marsden [1984] Quasiconvexity at the boundary, positivity of the
second variation and elastic stability, Arch. Rat. Mech. An. 86, 251–277.
Bao, D., J.E. Marsden, and R. Walton [1984] The Hamiltonian structure of general
relativistic perfect fluids, Comm. Math. Phys. 99, 319–345.
Batt, J. and G. Rein [1991] A rigorous stability result for the Vlasov-Poisson system in
three dimensions, Ann. Math. Pura Appl. (to appear).
Benjamin, T.B. [1972] The stability of solitary waves. Proc. Roy. Soc. London 328A,153–
183.
Benjamin, T.B. [1984] Impulse, flow force and variational principles, IMA J. of Appl.
Math. 32, 3–68.
Benjamin, T.B. and P.J. Olver [1982] Hamiltonian structure, symmetries and conserva-
tion laws for water waves, J. Fluid Mech. 125, 137–185.
Berry, M. [1984] Quantal phase factors accompanying adiabatic changes, Proc. Roy. Soc.
London A 392, 45–57.
References 206
Berry, M. [1985] Classical adiabatic angles and quantal adiabatic phase, J. Phys. A.
Math. Gen. 18, 15–27.
Berry, M. and J. Hannay [1988] Classical non-adiabatic angles, J. Phys. A. Math. Gen.
21, 325–333.
Bertrand, J. [1852] J. de Math. 27, 393.
Bloch, A.M.[1989] Steepest descent, linear programming and Hamiltonian flows, Cont.
Math. AMS 114, 77–88.
Bloch, A.M., R.W. Brockett, and T.S. Ratiu [1990] A new formulation of the generalized
Toda Lattice equations and their fixed point analysis via the momentum map, Bull.
Amer. Math. Soc. 23, 477–485.
Bloch, A.M., R.W. Brockett, and T.S. Ratiu [1992] Completely integrable gradient flows.
Comm. Math. Phys. 147, 57–74.
Bloch, A.M. and P.E. Crouch [1994] Reduction of Euler Lagrange problems for con-
strained variational problems with optimal control problems. Proc. CDC 33 (to
appear).
Bloch, A.M., H. Flaschka, and T.S. Ratiu [1990] A convexity theorem for isospectral
manifolds of Jacobi matrices in a compact Lie algebra. Duke Math. J. 61, 41–65.
Bloch, A.M., P.S. Krishnaprasad, J.E. Marsden, and R. Murray [1994] Nonholonomic
mechanical systems with symmetry. (preprint.)
Bloch, A.M., P.S. Krishnaprasad, J.E. Marsden, and T.S. Ratiu [1994] Dissipation In-
duced Instabilities, Ann. Inst. H. Poincaré, Analyse Nonlineare 11, 37–90.
Bloch, A.M., P.S. Krishnaprasad, J.E. Marsden, and T.S. Ratiu [1995] The Euler-Poincaré
equations and double bracket dissipation, Comm. Math. Phys. (to appear).
Bloch, A.M., P.S. Krishnaprasad, J.E. Marsden, and G. Sánchez de Alvarez [1992] Sta-
bilization of rigid body dynamics by internal and external torques, Automatica 28,
745–756.
Bloch, A.M. and J.E. Marsden [1989] Controlling homoclinic orbits, Theor. and Comp.
Fluid Mech. 1, 179–190.
Bloch, A.M. and J.E. Marsden [1990] Stabilization of rigid body dynamics by the energy-
Casimir method, Syst. and Cont. Lett. 14, 341–346.
Bloch, A.M., N.H. McClamroch, and M. Reyhanoglu [1990] Controllability and stabiliz-
ability properties of a nonholonomic control system, Proc. CDC, Hawaii, 1312–1314.
Bobenko, A.I., A.G. Reyman, and M.A. Semenov-Tian-Shansky [1989] The Ko-walewski
Top 99 years later: A Lax pair, generalizations and explicit solutions, Comm. Math.
Phys. 122, 321–354.
Bona, J. [1975] On the stability theory of solitary waves. Proc. Roy. Soc. London 344A,
363–374.
Bonnard, B. [1986] Controllabilité des systèmes méchanique sur les groups de Lie , SIAM
J. Cont. Optim. 22, 711–722.
Born, M. and L. Infeld [1935] On the quantization of the new field theory, Proc. Roy.
Soc. A 150, 141.
Brockett, R.W. [1973] Lie algebras and Lie groups in control theory, in Geometric Methods
in Systems Theory, Proc. NATO Advanced Study Institute, R.W. Brockett and
D.Q. Mayne (eds.), Reidel, 43–82.
References 207
Brockett, R.W. [1976] Nonlinear systems and differential geometry, Proc. IEEE 64, No
1, 61–72.
Brockett, R.W. [1981] Control theory and singular Riemannian geometry, in New Direc-
tions in Applied Mathematics, P.J. Hilton and G.S. Young (eds.), Springer.
Brockett, R.W. [1983] Asymptotic stability and feedback stabilization, in Differential
Geometric Control Theory, R.W. Brockett, R.S. Millman and H. Sussman (eds.),
Birkhauser.
Brockett, R.W. [1987] On the control of vibratory actuators, Proc. 1987 IEEE Conf.
Decision and Control , 1418–1422.
Brockett, R.W. [1988] Dynamical systems that sort lists and solve linear programming
problems. Proc. IEEE 27, 799–803 and Linear Algebra and its Appl. 146, (1991),
79–91.
Brockett, R.W. [1989] On the rectification of vibratory motion, Sensors and Actuators
20, 91–96.
Brockett, R.W. [1993] Differential geometry and the design of gradient algorithms. Proc.
Symp. Pure Math., AMS 54, Part I, 69–92.
Byrnes, C.I. and A. Isidori [1989] Attitude stabilization of rigid spacecraft, Automatica,
to appear.
Carnevale, G.F. and T.G. Shepherd [1990] On the interpretation of Andrew’s theorem,
Geo. Astro. Fluid. Dyn. 51, 4–7.
Cartan, E. [1923] Sur les varietes a connexion affine et theorie de relativité généralizée,
Ann. Ecole Norm. Sup. 40, 325–412, 41, 1–25.
Cartan, E. [1928] Sur la stabilité ordinaire des ellipsoides de Jacobi, Proc. Int. Math.
Cong. Toronto 2, 9–17.
Cendra, H. and J.E. Marsden [1987] Lin constraints, Clebsch potentials and variational
principles, Physica D 27, 63–89.
Cendra, H., A. Ibort, and J.E. Marsden [1987] Variational principal fiber bundles: a
geometric theory of Clebsch potentials and Lin constraints, J. Geom. Phys. 4,
183–206.
Chandrasekhar, S. [1977] Ellipsoidal Figures of Equilibrium. Dover.
Chandrasekhar, S. [1961] Hydrodynamic and Hydromagnetic Instabilities. Oxford Uni-
versity Press.
Channell, P. and C. Scovel [1990] Symplectic integration of Hamiltonian systems, Non-
linearity 3, 231–259.
Chern, S.J. [1991] Mathematical Theory of the Barotropic Model in Geophysical Fluid
Dynamics. Thesis, Cornell University.
Chern, S.J. and J.E. Marsden [1990] A note on symmetry and stability for fluid flows,
Geo. Astro. Fluid. Dyn. 51, 1–4.
Chernoff, P.R. and J.E. Marsden [1974] Properties of infinite dimensional Hamiltonian
systems. Lecture Notes in Math. 425, Springer-Verlag, New York.
Chetayev, N.G. [1961] The Stability of Motion. Trans. by M. Nadler Pergamon Press,
New York.
References 208
Chillingworth, D.R.J., J.E. Marsden, and Y.H. Wan [1982] Symmetry and bifurcation in
three dimensional elasticity, Arch. Rat. Mech. An. 80, 295–331, 83 [1983] 363–395,
84 [1984] 203–233.
Chirikov, B.V. [1979] A universal instability of many dimensional oscillator systems,
Phys. Rep. 52, 263–379.
Chorin, A.J, T.J.R. Hughes, J.E. Marsden, and M. McCracken [1978] Product Formulas
and Numerical Algorithms, Comm. Pure Appl. Math. 31, 205–256.
Chorin, A.J. and J.E. Marsden [1989] A Mathematical Introduction to Fluid Mechanics.
Texts in Applied Mathematics, 4, Springer-Verlag, New York.
Chow, S.N. and J.K. Hale [1982] Methods of Bifurcation Theory. Springer, New York.
Clebsch, A. [1857] Über eine allgemeine Transformation der hydrodynamischen Gleichun-
gen, Zeitschrift für Reine Angew. Math. 54, 293–312.
Clebsch, A. [1859] Über die Integration der hydrodynamischen Gleichungen, Zeitschrift
für Reine Angew. Math. 56, 1–10.
Cordani, B. [1986] Kepler problem with a magnetic monopole, J. Math. Phys. 27,
2920–2921.
Crouch, P.E. [1986] Spacecraft attitude control and stabilization: Application of geomet-
ric control to rigid body models, IEEE Trans. Aut. Cont. AC-29, 321–331.
Cushman, R. and D. Rod [1982] Reduction of the semi-simple 1:1 resonance, Physica D
6, 105–112.
Darling, B.T. and D.M. Dennison [1940] The water vapor molecule, Phys. Rev. 57,
128–139.
David, D. and D.D. Holm [1990] Multiple Lie-Poisson structures, reductions, and geo-
metric phases for the Maxwell-Bloch travelling-wave equations, preprint.
David, D., D.D. Holm, and M. Tratnik [1990] Hamiltonian chaos in nonlinear optical
polarization dynamics, Physics reports 187, 281–370.
Dellnitz, M., J.E. Marsden, I. Melbourne, and J. Scheurle [1992] (in preparation).
Dellnitz, M., I. Melbourne, and J.E. Marsden [1992] Generic bifurcation of Hamiltonian
vector fields with symmetry, (to appear).
Deprit, A. [1983] Elimination of the nodes in problems of N bodies, Celestial Mech. 30,
181–195.
deVogelaére, R. [1956] Methods of integration which preserve the contact transformation
property of the Hamiltonian equations, Department of Mathematics, University of
Notre Dame Report 4.
Dirac, P.A.M. [1964] Lectures on quantum mechanics, Belfer Graduate School of Science,
Monograph Series 2, Yeshiva University.
Dritchel, D. [1989] The stability of elliptical vortices in an external straining flow, J.
Fluid Mech. 210, 223–261.
Duistermaat, J.J. [1980] On global action angle coordinates, Comm. Pure Appl. Math.
33, 687–706.
Dzyaloshinskii, I.E. and G.E. Volovick [1980] Poisson brackets in condensed matter
physics, Ann. Phys. 125, 67–97.
References 209
Ebin, D.G. and J.E. Marsden [1970] Groups of diffeomorphisms and the motion of an
incompressible fluid, Ann. Math. 92, 102–163.
Eckart, C. [1935] Some studies concerning rotating axes and polyatomic molecules, Phys.
Rev. 47, 552–558.
Emmrich, C. and H. Römer [1990] Orbifolds as configuration spaces of systems with
gauge symmetries, Comm. Math. Phys. 129, 69–94.
Enos, M.J. [1993] On an optimal control problem on SO(3) × SO(3) and the falling cat.
Fields Inst. Comm. 1, 75–112.
Ercolani, N., M.G. Forest, and D.W. McLaughlin [1990] Geometry of the modulational
instability, III. Homoclinic orbits for the periodic sine-Gordon equation, Physica D
43, 349.
Faybusovich, L. [1991] Hamiltonian structure of dynamical systems which solve linear
programing problems, preprint.
Feng, K. [1986] Difference Schemes for Hamiltonian Formalism and Symplectic Geometry,
J. Comp. Math. 4, 279–289.
Feng, K. and Z. Ge [1988] On approximations of Hamiltonian systems, J. Comp. Math.
6, 88–97.
Feng, K. and M.Z. Qin [1987] The symplectic methods for the computation of Hamiltonian
equations, Springer Lecture Notes in Math. 1297, 1–37.
Fischer, A.E. and J.E. Marsden [1972] The Einstein equations of evolution — a geometric
approach, J. Math. Phys. 13, 546–68.
Fischer, A.E. and J.E. Marsden [1979] Topics in the dynamics of general relativity, in
Isolated Gravitating systems in General Relativity, J. Ehlers (ed.), Italian Physical
Society, 322–395.
Fischer, A.E., J.E. Marsden, and V. Moncrief [1980] The structure of the space of so-
lutions of Einstein’s equations, I: One Killing field, Ann. Inst. H. Poincaré 33,
147–194.
Fontich, E. and J.C. Simo [1990] The splitting of separatrices for analytic diffeomor-
phisms, Erg. Th. Dyn. Sys. 10, 295–318.
Galin, D.M. [1982] Versal deformations of linear Hamiltonian systems, AMS Transl. 118,
1–12, (1975 Trudy Sem. Petrovsk. 1, 63–74).
Ge, Z. [1990] Generating functions, Hamilton-Jacobi equation and symplectic groupoids
over Poisson manifolds, Ind. Univ. Math. J. 39, 859–876.
Ge, Z. [1991a] Equivariant symplectic difference schemes and generating functions, Phys-
ica D 49, 376–386.
Ge, Z. [1991b] A constrained variational problem and the space of horizontal paths, Pac.
J. Math. 149, 61–94.
Ge, Z. and J.E. Marsden [1988] Lie-Poisson integrators and Lie-Poisson Ham-iltonian-
Jacobi theory, Phys. Lett. A 133, 134–139.
Goldman, W.M. and J.J. Millson [1990] Differential graded Lie algebras and singularities
of level sets of momentum mappings, Comm. Math. Phys. 131, 495–515.
Golin, S., A. Knauf, and S. Marmi [1989] The Hannay angles: geometry, adiabaticity,
and an example, Comm. Math. Phys. 123, 95–122.
References 210
Golin, S. and S. Marmi [1990] A class of systems with measurable Hannay angles, Non-
linearity 3, 507–518.
Golubitsky, M. and I. Stewart [1987] Generic bifurcation of Hamiltonian systems with
symmetry, Physica D 24, 391–405.
Golubitsky, M., I. Stewart, and D. Schaeffer [1988] Singularities and Groups in Bifurca-
tion Theory, Vol. 2. Springer-Verlag, AMS Series 69.
Goodman, L.E. and A.R. Robinson [1958] Effects of finite rotations on gyroscopic sensing
devices. J. Appl. Mech 28, 210–213. (See also Trans. ASME 80, 210–213.)
Gotay, M., J. Isenberg, J.E. Marsden, and R. Montgomery [1992] Momentum Maps and
the Hamiltonian Structure of Classical Relativistic Field Theories. Springer-Verlag,
(to appear).
Gotay, M., R. Lashof, J. Sniatycki, and A. Weinstein [1980] Closed forms on symplectic
fiber bundles, Comm. Math. Helv. 58, 617–621.
Greenspan, D. [1974] Discrete Numerical Methods in Physics and Engineering. Academic
Press.
Greenspan, D. [1984] Conservative numerical methods for ẍ = f (x), J. Comp. Phys. 56,
21–48.
Grossman, R., P.S. Krishnaprasad, and J.E. Marsden [1988] The dynamics of two coupled
rigid bodies, in Dynamical Systems Approaches to Nonlinear Problems in Systems
and Circuits, Salam and Levi (eds.), SIAM , 373–378.
Guckenheimer, J. and P. Holmes [1983] Nonlinear Oscillations, Dynamical Systems and
Bifurcations of vector Fields. Springer Applied Math. Sciences 43.
Guckenheimer, J. and P. Holmes [1988] Structurally stable heteroclinic cycles, Math.
Proc. Cam. Phil. Soc. 103, 189–192.
Guichardet, A. [1984] On rotation and vibration motions of molecules, Ann. Inst. H.
Poincaré 40, 329–342.
Guillemin, V. and E. Prato [1990] Heckman, Kostant, and Steinberg formulas for sym-
plectic manifolds, Adv. Math. 82, 160–179.
Guillemin, V. and S. Sternberg [1980] The moment map and collective motion, Ann.
Phys. 1278, 220–253.
Guillemin, V. and S. Sternberg [1984] Symplectic Techniques in Physics. Cambridge
University Press.
Hahn, W. [1967] Stability of Motion. Springer-Verlag, New York.
Hamilton, W.R. [1834] On a general method in dynmaics Phil. Trans. Roy. Soc. Lon.
Part II, 247–308, Part I for 1835, 95–144.
Hannay, J. [1985] Angle variable holonomy in adiabatic excursion of an itegrable Hamil-
tonian, J. Phys. A 18, 221–230.
Harnad, J. and J. Hurtubise [1990] Generalized tops and moment maps into loop algebras,
preprint, CRM .
Harnad, J., J. Hurtubise, and J.E. Marsden [1991] Reduction of Hamiltonian systems
with discrete symmetry, preprint.
Hirschfelder, J.O. and J.S. Dahler [1956] The kinetic energy of relative motion, Proc.
Nat. Acad. Sci. 42, 363–365.
References 211
Hodgins, J. and M. Raibert [1990] Biped gymnastics, Int. J. Robotics Research, (in
press).
Holm, D.D., B.A. Kupershmidt, and C.D. Levermore [1985] Hamiltonian differencing of
fluid dynamics, Adv. in Appl. Math. 6, 52–84.
Holm, D.D. and J.E. Marsden [1991] The rotor and the pendulum, Proc. Souriaufest,
Birkhauser.
Holm, D.D., J.E. Marsden, and T.S. Ratiu [1986] The Hamiltonian structure of contin-
uum mechanics in material, spatial and convective representations, Séminaire de
Mathématiques supérieurs, Les Presses de L’Univ. de Montréal 100, 11–122.
Holm, D.D., J.E. Marsden, T.S. Ratiu, and A. Weinstein [1985] Nonlinear stability of
fluid and plasma equilibria, Phys. Rep. 123, 1–116.
Holmes, P.J. and J.E. Marsden [1981] A partial differential equation with infinitely many
periodic orbits: chaotic oscillations of a forced beam, Arch. Rat. Mech. Anal. 76,
135-166.
Holmes, P.J. and J.E. Marsden [1982a] Horseshoes in perturbations of Hamiltonian sys-
tems with two degrees of freedom, Comm. Math. Phys. 82, 523–544.
Holmes, P.J. and J.E. Marsden [1982b] Melnikov’s method and Arnold diffusion for per-
turbations of integrable Hamiltonian systems, J. Math. Phys. 23, 669–675.
Holmes, P.J. and J.E. Marsden [1983] Horseshoes and Arnold diffusion for Hamiltonian
systems on Lie groups, Indiana Univ. Math. J. 32, 273–310.
Holmes, P.J., J.E. Marsden, and J. Scheurle [1988] Exponentially small splittings of
separatrices with applications to KAM theory and degenerate bifurcations, Cont.
Math. 81, 213–244.
Howard, J.E. and R.S. Mackay [1987] Linear stability of symplectic maps, J. Math. Phys.
28, 1036–1051.
Howard, J.E. and R.S. Mackay [1987] Calculation of linear stability boundaries for equi-
libria of Hamiltonian systems, Phys. Lett. A. 122, 331–334.
Hughes, T.J.R., W.K. Liu, and P. Caughy [1978] Transient finite element formulations
that preserve energy, J. Appl. Mech. 45, 366–370.
Hsiang, W.Y. and R. Montgomery [1995] The geometric phase for the three body problem.
Ishlinskii, A. [1952] Mechanics of Special Gyroscopic Systems (in Russian). National
Acad. Ukrainian SSR, Kiev.
Iwai, T. [1982] On a “conformal” Kepler problem and its reduction, J. Math. Phys. 22,
1633–1639.
Iwai, T. [1982] The symmetry group of the harmonic oscillator and its reduction, J. Math.
Phys. 23, 1088–1092.
Iwai, T. [1987a] A gauge theory for the quantum planar three-body system, J. Math.
Phys. 28, 1315–1326.
Iwai, T. [1987b] A geometric setting for internal motions of the quantum three-body
system, J. Math. Phys. 28, 1315–1326.
Iwai, T. [1987c] A geometric setting for classical molecular dynamics, Ann. Inst. H.
Poincaré, Phys. Th. 47, 199–219.
Iwai, T. [1990a] On the Guichardet/Berry connection, Phys. Lett. A 149, 341–344.
References 212
Iwai, T. [1990b] The geometry of the SU (2) Kepler problem, J. Geom. Phys. 7, 507–535.
Iwai, T. and Y. Uwano [1991] The MIC-Kepler problem and its symmetry group for
positive energies both in classical and quantum mechanics, Il Nuovo Cimento, 106,
849–878.
Iwiínski, Z.R. and L.A. Turski [1976] Canonical theories of systems interacting electro-
magnetically, Lett. Appl. Engin. Sci. 4, 179–191.
Jacobi, C.G.K. [1843] J. für Math. 26, 115.
Jellinek, J. and D.H. Li [1989] Separation of the energy of overall rotations in an N -body
system, Phys. Rev. Lett. 62, 241–244.
Jepson, D.W. and J.O. Hirschfelder [1958] Set of coordinate systems which diagonalize
the kinetic energy of relative motion, Proc. Nat. Acad. Sci. 45, 249–256.
Kammer, D.C. and G.L. Gray [1993] A nonlinear control design for energy sink simulation
in the Euler-Poinsot problem. J. Astro. Sci. 41, 53–72.
Kane, T.R. and M. Scher [1969] A dynamical explanation of the falling cat phenomenon,
Int. J. Solids Struct. 5, 663–670.
Kazhdan, D., B. Kostant, and S. Sternberg [1978] Hamiltonian group actions and dy-
namical systems of Calogero type, Comm. Pure Appl. Math. 31, 481–508.
Kirillov, A.A. [1962] Unitary representations of nilpotent Lie groups, Russian Math. Sur-
veys 17, 53–104.
Kirillov, A.A. [1976a] Local Lie Algebras. Russian Math. Surveys 31, 55–75.
Kirillov, A.A. [1976b] Elements of the Theory of Representations. Grundlehren Math.
Wiss. Springer-Verlag.
Klein, F. [1897] The Mathematical Theory of the Top. Scribner’s, New York.
Koiller, J. and M.J.H. Dantes [1990] Motion of impurities in ideal planar fluids, preprint.
Kopell, N. and R.B. Washburn, Jr. [1982] Chaotic motions in the two degree-of-freedom
swing equations, IEEE Trans. Circuits Syst., 738–746.
Kosmann-Schwarzbach, Y. and F. Magri [1990] Poisson-Nijenhuis structures, Ann. Inst.
H. Poincaré 53, 35–81.
Krein, M.G. [1950] A generalization of several investigations of A.M. Liapunov on linear
differential equations with periodic coefficients, Dokl. Akad. Nauk. SSSR 73, 445–
448.
Krishnaprasad, P.S. [1985] Lie-Poisson structures, dual-spin spacecraft and asymptotic
stability, Nonl. An. Th. Meth. Appl. 9, 1011–1035.
Krishnaprasad, P.S. [1989] Eulerian many-body problems, Cont. Math. AMS 97, 187–
208.
Krishnaprasad, P.S. and J.E. Marsden [1987] Hamiltonian structure and stability for rigid
bodies with flexible attachments, Arch. Rat. Mech. An. 98, 137–158.
Krishnaprasad, P.S. and L.S. Wang [1992] Gyroscopic control and stabilization, J. Non-
linear Sci. (to appear).
Krishnaprasad, P.S. and R. Yang [1991] Geometric phases, anholonomy and optimal
movement, Proc. IEEE Conf. Rob. Aut., 1–5.
Krishnaprasad, P.S., R. Yang, and W.P. Dayawansa [1991] Control problems on principle
bundles and nonholonomic mechanics, preprint.
References 213
Krupa, M. [1990] Bifurcations of relative equilibria, SIAM J. Math. An. 21, 1453–1486.
Kruskal, M. and H. Segur [1991] Asymptotics beyond all orgers in a model of crystal
growth. Stud. Appl. Math. 85, 129–181.
Kummer, M. [1981] On the construction of the reduced phase space of a Hamiltonian
system with symmetry, Indiana Univ. Math. J. 30, 281–291.
Labudde, R.A. and D. Greenspan [1976] Energy and momentum conserving methods of
arbitrary order for the numerical integration of equations of motion, Num. Meth.
25, 323–346, 26, 1–16.
Lagrange, J.L. [1788] Mëcanique Analitique. Chez la Veuve Desaint
Leimkuhler, B. and S. Reich [1994] Symplectic integration of constrained Hamiltonian
systems. Math. Comp. to appear .
Levi, M. [1989] Morse theory for a model space structure, Cont. Math. AMS 97, 209–216.
Levi, M. [1993] Geometric phases in the motion of rigid bodies, Arch. Rat. Mech. Anal.
122, 213–229.
Lewis, D.R. [1989] Nonlinear stability of a rotating planar liquid drop, Arch. Rat. Mech.
Anal. 106, 287–333.
Lewis, D.R. [1991] Lagrangian block diagonalization, Dyn. Diff. Eqn’s. (to appear).
Lewis, D.R., J.E. Marsden, R. Montgomery, and T.S. Ratiu [1986] The Hamiltonian
structure for dynamic free boundary problems, Physica D 18, 391–404.
Lewis, D.R., J.E. Marsden, and T.S. Ratiu [1987] Stability and bifurcation of a rotating
liquid drop, J. Math. Phys. 28, 2508–2515.
Lewis, D.R., J.E. Marsden, T.S. Ratiu, and J.C. Simo [1990] Normalizing connections and
the energy-momentum method, Proceedings of the CRM conference on Hamiltonian
systems, Transformation Groups, and Spectral Transform Methods, CRM Press,
Harnad and Marsden (eds.), 207–227.
Lewis, D.R., T.S. Ratiu, J.C. Simo, and J.E. Marsden [1992] The heavy top, a geometric
treatment, Nonlinearity (to appear).
Lewis, D.R. and J.C. Simo [1990] Nonlinear stability of rotating pseudo-rigid bodies,
Proc. Roy. Soc. Lon. A 427, 281–319.
Li, C.W. and Qin, M.Z. [1988] A symplectic difference scheme for the infinite dimensional
Hamilton system, J. Comp. Math. 6, 164–174.
Li, Z., L. Xiao and M.E. Kellman [1990] Phase space bifurcation structure and the gen-
eralized local-to-normal transition in resonantly coupled vibrations, J. Chem. Phys.
92, 2251–2268.
Lichtenberg, A.J. and M.A. Liebermann [1983] Regular and stochastic motion, Applied
Math. Science 38, Springer-Verlag.
Lie, S. [1890] Theorie der Transformationsgruppen, Zweiter Abschnitt. Teubner, Leipzig.
Littlejohn, R.G. [1988] Cyclic evolution in quantum mechanics and the phases of Bohr-
Sommerfeld and Maslov, Phys. Rev. Lett. 61, 2159–2162.
Low, F.E. [1958] A Lagrangian formulation of the Boltzmann-Vlasov equation for plas-
mas, Proc. Roy. Soc. Lond. A 248, 282–287.
Lu, J.H. and T.S. Ratiu [1991] On Kostant’s convexity theorem, J. AMS (to appear).
References 214
Marsden, J.E., J.C. Simo, D.R. Lewis, and T.A. Posbergh [1989] A block diagonalization
theorem in the energy momentum method, Cont. Math. AMS 97, 297–313.
Marsden, J.E. and A. Weinstein [1974] Reduction of symplectic manifolds with symmetry,
Rep. Math. Phys. 5, 121–130.
Marsden, J.E. and A. Weinstein [1982] The Hamiltonian structure of the Maxwell-Vlasov
equations, Physica D 4, 394–406.
Marsden, J.E. and A. Weinstein [1983] Coadjoint orbits, vortices and Clebsch variables
for incompressible fluids, Physica D 7, 305–323.
Marsden, J.E., A. Weinstein, T.S. Ratiu, R. Schmid, and R.G. Spencer [1983] Hamilto-
nian systems with symmetry, coadjoint orbits and plasma physics, in Proc. IUTAM-
IS1MM Symposium on Modern Developments in Analytical Mechanics, Torino 1982,
Atti della Acad. della Sc. di Torino 117, 289–340.
Martin, J.L. [1959] Generalized classical dynamics and the “classical analogue” of a Fermi
oscillation, Proc. Roy. Soc. A 251, 536.
Melnikov, V.K. [1963] On the stability of the center for time periodic perturbations,
Trans. Moscow Math. Soc. 12, 1–57.
Meyer, K.R. [1973] Symmetries and integrals in mechanics, in Dynamical Systems, M.
Peixoto (ed.), Academic Press, 259–273.
Meyer, K.R. [1981] Hamiltonian systems with a discrete symmetry, J. Diff. Eqn’s. 41,
228–238.
Meyer, K.R. and R. Hall [1991] Hamiltonian Mechanics and the n-body Problem. Springer-
Verlag, Applied Mathematical Sciences.
Misner, C., K. Thorne, and J.A. Wheeler [1973] Gravitation. W.H. Freeman, San Fran-
cisco.
Montaldi, J.A., R.M. Roberts, and I.N. Stewart [1988] Periodic solutions near equilibria
of symmetric Hamiltonian systems, Phil. Trans. R. Soc. Lond. A 325, 237–293.
Montgomery, R. [1984] Canonical formulations of a particle in a Yang-Mills field, Lett.
Math. Phys. 8, 59–67.
Montgomery, R [1986] The bundle picture in mechanics Thesis, UC Berkeley.
Montgomery, R. [1988] The connection whose holonomy is the classical adiabatic angles of
Hannay and Berry and its generalization to the non-integrable case, Comm. Math.
Phys. 120, 269–294.
Montgomery, R. [1990] Isoholonomic problems and some applications, Comm. Math
Phys. 128, 565–592.
Montgomery, R. [1991] How much does a rigid body rotate? A Berry’s phase from the
18th century, Am. J. Phys. 59, 394–398.
Montgomery, R., J.E. Marsden, and T.S. Ratiu [1984] Gauged Lie-Poisson structures,
Cont. Math. AMS 28, 101–114.
Montgomery, R., M. Raibert, and Zexing Li [1990] Dynamics and optimal control of
legged locomotion systems, preprint.
Moon, F.C. and P.J. Holmes [1979] A magnetoelastic strange attractor, J. Sound Vib.
65, 275–296, 69, 339.
References 216
Morozov, V.M., V.N. Rubanovskii, V.V. Rumiantsev, and V.A. Samsonov [1973] On the
bifurcation and stability of the steady state motions of complex mechanical systems,
PMM 37, 387–399.
Morrison, P.J. [1980] The Maxwell-Vlasov equations as a continuous Hamiltonian system,
Phys. Lett. A 80, 383–386.
Morrison, P.J. [1982] Poisson Brackets for fluids and plasmas, in Mathematical Methods
in Hydrodynamics and Integrability in Related Dynamical Systems, AIP Conf. Proc.
88, M. Tabor and Y.M. Treve (eds.), La Jolla, Calif.
Morrison, P.J. [1986] A paradigm for joined Hamiltonian and dissipative systems, Physica
D 18, 410–419.
Morrison, P.J. [1987] Variational principle and stability of nonmonotone Vlasov-Poisson
equilibria, Z. Naturforsch. 42a, 1115–1123.
Morrison, P.J. and S. Eliezer [1986] Spontaneous symmetry breaking and neutral stability
on the noncanonical Hamiltonian formalism, Phys. Rev. A 33, 4205.
Morrison, P.J. and J.M. Greene [1980] Noncanonical Hamiltonian density formulation of
hydrodynamics and ideal magnetohydrodynamics, Phys. Rev. Lett. 45, 790–794,
errata 48 (1982), 569.
Morrison, P.J. and R.D. Hazeltine [1984] Hamiltonian formulation of reduced magneto-
hydrodynamics, Phys. Fluids 27, 886–897.
Morrison, P.J. and D. Pfirsch [1990] The free energy of Maxwell-Vlasov equilibria, Phys.
Fluids B 2, 1105–1113.
Moser, J. [1958] New aspects in the theory of stability of Hamiltonian systems, Comm.
Pure Appl. Math. XI, 81–114.
Moser, J. [1965] On the volume elements on a manifold, Trans. Am. Soc. 120, 286–294.
Moser, J. [1976] Periodic orbits near equilibrium and a theorem by Alan Weinstein,
Comm. Pure Appl. Math. 29, 727–747.
Nambu, Y. [1973] Generalized Hamiltonian dynamics, Phys. Rev. D 7, 2405–2412.
Neishtadt, A. [1984] The separation of motions in systems with rapidly rotating phase,
P.M.M. USSR 48, 133–139.
Noether, E. [1918] Invariante Variationsprobleme, Kgl. Ges. Wiss. Nachr. Göttingen.
Math. Physik. 2, 235–257.
Oh, Y.G. [1987] A stability criterion for Hamiltonian systems with symmetry, J. Geom.
Phys. 4, 163–182.
Oh, Y.G., N. Sreenath, P.S. Krishnaprasad, and J.E. Marsden [1989] The Dynamics of
Coupled Planar Rigid Bodies Part 2: Bifurcations, Periodic Solutions, and Chaos,
Dyn. Diff. Eq’ns. 1, 269–298.
Olver, P.J. [1980] On the Hamiltonian structure of evolution equations, Math. Proc.
Camb. Phil. Soc. 88, 71–88.
Olver, P.J. [1986] Applications of Lie Groups to Differential Equations. Graduate Texts
in Mathematics 107, Springer, Berlin.
Olver, P.J. [1988] Darboux’ theorem for Hamiltonian differential operators, J. Diff. Eqn’s
71, 10–33.
O’Reilly, O.M., N.K. Malhotra, and N.S. Namamchchivaya [1994] Destabilization of the
equilibria of reversible dynamical systems. to appear .
References 217
Otto, M. [1987] A reduction scheme for phase spaces with almost Kähler symmetry
regularity results for momentum level sets, J. Geom. Phys. 4, 101–118.
Palais, R. [1979] The principle of symmetric criticality, Comm. Math. Phys. 69, 19-30.
Palais, R. [1985] Applications of the symmetric criticality principle in mathemaitcal
physics and differential geometry, Proc. of 1981 Symp. on Diff. Geom. Diff.
Eqn’s, 247-302.
Patrick, G. [1989] The dynamics of two coupled rigid bodies in three space, Cont. Math.
AMS 97, 315–336.
Patrick, G. [1990] Nonlinear Stability of Coupled Rigid Bodies. PhD Thesis, University
of California, Berkeley.
Pauli, W. [1953] On the Hamiltonian structure of non-local field theories, Il Nuovo Ci-
mento X, 648–667.
Pego, R.L. and M.I. Weinstein [1992] Eigenvalues and instabilities of solitary waves. Phil.
Trans. Roy. Soc. Lon. 340, 47–94.
Penrose, O. [1960] Electrostatic instabilities of a uniform non-Maxwellian plasma, Phys.
Fluids 3, 258–265.
Pierce, A.P., M.A. Dahleh, and H. Rabitz [1988] Phys. Rev. A. 37, 4950.
Pfaffelmoser, K. [1989] Global Classical Solutions of the Vlasov-Poisson System in Three
Dimensions for General Initial Data. Thesis, University of Munich.
Poincaré, H. [1885] Sur l’équilibre d’une masse fluide animée d’un mouvement de rotation,
Acta. Math. 7, 259.
Poincaré, H. [1890] Sur la probléme des trois corps et les équations de la dynamique, Acta
Math. 13, 1–271.
Poincaré, H. [1892] Les formes d’équilibre d’une masse fluide en rotation, Revue Générale
des Sciences 3, 809–815.
Poincaré, H. [1901] Sur la stabilité de l’équilibre des figures piriformes affectées par une
masse fluide en rotation, Philosophical Transactions A 198, 333–373.
Pullin, D.I., and P.G. Saffman [1991] Long time symplectic integration: the example of
four-vortex motion, Proc. Roy. Soc. Lon. A 432, 481-494.
Ratiu, T.S. [1982] Euler-Poisson equations on Lie algebras and the N-dimensional heavy
rigid body, Am. J. Math. 104, 409–448, 1337.
Reyhanoglu, M. and N.H. McClamroch [1991] Reorientation of space multibody systems
maintaining zero angular momentum, AIAA Conf. Nav. Control .
Rayleigh, J.W.S. [1880] On the stability or instability of certain fluid motions, Proc.
Lond. Math. Soc. 11, 57–70.
Lord Rayleigh [1916] On the dynamics of revolving fluids, Proc. R. Soc. Lond. A 93,
148–154.
Reinhall, P.G., T.K. Caughy, and D.Q. Storti [1989] Order and chaos in a discrete Duffing
oscillator, implications for numerical integration, Trans. ASME , 56, 162–176.
Reyman, A.G. and M.A. Semenov-Tian-Shansky [1990] Group theoretical methods in
the theory of integrable systems, Encyclopedia of Mathematical Sciences, Springer-
Verlag 16.
References 218
Simo, J.C., J.E. Marsden, and P.S. Krishnaprasad [1988] The Hamiltonian structure of
nonlinear elasticity: The material, spatial, and convective representations of solids,
rods, and plates, Arch. Rat. Mech. An. 104, 125–183.
Simo, J.C., T.A. Posbergh, and J.E. Marsden [1990] Stability of coupled rigid body and
geometrically exact rods: block diagonalization and the energy-momentum method,
Physics Reports 193, 280–360.
Simo, J.C., T.A. Posbergh, and J.E. Marsden [1991] Stability of relative equilibria II:
Three dimensional elasticity, Arch. Rat. Mech. Anal. 115, 61–100.
Simo, J.C., M.S. Rifai, and D.D Fox [1992], On a stress resultant geometrically exact
shell models. Part VI: Conserving algorithms for nonlinear dynamics, Comp. Meth.
Appl. Mech. Eng. 34, 117–164.
Simo, J.C. and N. Tarnow [1992] The discrete energy momentum method. Conserving
algorithms for nonlinear elastodynamics, ZAMP 43, 757–792.
Simo, J.C., N. Tarnow, and K.K. Wong [1991], Exact energy-momentum conserving
algorithms and symplectic schemes for nonlinear dynamics, Computer Methods in
Applied Mechanics and Engineering, in press
Simo, J.C. and K.K. Wong [1989] Unconditionally stable algorithms for the orthogonal
group that exactly preserve energy and momentum, Int. J. Num. Meth. Eng. 31,
19–52.
Sjamaar, R. [1990] Singular Orbit Spaces in Riemannian and Symplectic Geometry. The-
sis, Utrecht.
Sjamaar, R. and E. Lerman [1991] Stratified symplectic spaces and reduction, Ann. of
Math. (to appear).
Slemrod, M. and J.E. Marsden [1985] Temporal and spatial chaos in a van der Waals
fluid due to periodic thermal fluctuations, Adv. Appl. Math. 6, 135–158.
Smale, S. [1967] Differentiable dynamical systems, Bull. Am. Math. Soc. 73, 747–817.
Smale, S. [1970] Topology and mechanics, Inv. Math. 10, 305–331, 11, 45–64.
Smale, S. [1980] The Mathematics of Time. Springer-Verlag.
Sorbie, K.S. and J.N. Murrell [1975] Analytical properties for triatomic molecules from
spectroscopic data, Mol. Phys. 29, 1387–1407.
Souriau, J.M. [1970] Structure des Systemes Dynamiques. Dunod, Paris.
Sreenath, N., Y.G. Oh, P.S. Krishnaprasad, and J.E. Marsden [1988] The dynamics of
coupled planar rigid bodies. Part 1: Reduction, equilibria and stability, Dyn. Stab.
Sys. 3, 25–49.
Sternberg, S. [1977] Minimal coupling and the symplectic mechanics of a classical particle
in the presence of a Yang–Mills field, Proc. Nat. Acad. Sci. 74, 5253–5254.
Stiefel, E. and C. Scheifele [1971] Linear and Regular Celestial Mechanics, Springer-
Verlag.
Stofer, D.M. [1987] Some Geometric and Numerical Methods for Perturbed Integrable
Systems. Thesis, Zurich.
Sudarshan, E.C.G. and N. Mukunda [1974] Classical Mechanics: A Modern Perspective.
Wiley, New York, 1974; Second Edition, Krieber, Melbourne–Florida, 1983.
Symes, W.W. [1980] Hamiltonian group actions and integrable systems, Physica D 1,
339–374.
References 220
Szeri, A.J. and P.J. Holmes [1988] Nonlinear stability of axisymmetric swirling flow, Phil.
Trans. R. Soc. Lon. A 326, 327–354.
Tannor, D.J. [1989] Jahn-Teller effects in the photodissociation of ozone. J. Am. Chem.
Assoc. 111, 2772–2776.
Tannor, D.J. and Y. Jin[1991] Design of femtosecond pulse sequences to control photo-
chemical products. Mode Selective Chemistry (ed. by P. Pullman, J. Tortner, and
R.D. Levine), Reidel.
Thomson, W (Lord Kelvin) and P.G. Tait [1879] Treatise on Natural Philosophy. Cam-
bridge University Press.
Thurston, W.P. and J.R. Weeks [1984] The mathematics of three-dimensional manifolds,
Scientific American 251, 108–120.
Turkington, B. [1989] Vortex rings with swirl: axisymmetric solutions of the Euler equa-
tions with nonzero helicity, SIAM J. Math. An. 20.
van der Meer, J.C. [1985] The Hamiltonian Hopf Bifurcation. Springer Lecture Notes in
Mathematics 1160.
van der Meer, J.C. [1990] Hamiltonian Hopf bifurcation with symmetry, Nonlinearity 3,
1041–1056.
van der Schaft, A. and D.E. Crouch [1987] Hamiltonian and self-adjoint control systems,
Sys. Cont. Lett. 8, 289–295.
van Gils, S.A., M. Krupa, and W.F. Langford [1990] Hopf bifurcation with non-semisimple
1:1 resonance, Nonlinearity 3, 825–830.
Walsh, G. [1991] Holonomy and satellite parking, preprint.
Wan, Y.H. [1986] The stability of rotating vortex patches, Comm. Math. Phys. 107,
1–20.
Wan, Y.H. [1988] Instability of vortex streets with small cores, Phys. Lett. A 127, 27–32.
Wan, Y.H. [1988] Desingularizations of systems of point vortices, Physica D 32, 277–295.
Wan, Y.H. [1988] Variational principles for Hill’s spherical vortex and nearly spherical
vortices, Trans. Am. Math. Soc. 308, 299–312.
Wan, Y.H. [1990] Nonlinear stability of stationary spherically symmetric models in stellar
dynamics, Arch. Rat. Mech. An. 112, 83–95.
Wan, Y.H. [1991] Codimension two bifurcations of symmeteic sycles in Hamiltonian sys-
tems with antisymplectic involutions, Dyn. Diff. Eqn’s. (to appear).
Wan, Y.H. and M. Pulvirente [1984] Nonlinear stability of circular vortex patches, Comm.
Math. Phys. 99, 435–450.
Wang, L.S., P.S. Krishnaprasad, and J.H. Maddocks [1991] Hamiltonian dynamics of a
rigid body in a central gravitaional field, Cel. Mech. Dyn. Astron. 50, 349–386.
Weinstein, A. [1977] Lectures on symplectic manifolds. CBMS Regional Conf. Ser. in
Math. 29, AMS, Providence, Rhode Island.
Weinstein, A. [1978a] A universal phase space for particles in Yang-Mills fields, Lett.
Math. Phys. 2, 417–420.
Weinstein, A. [1978b] Bifurcations and Hamilton’s principle, Math. Zeit. 159, 235–248.
Weinstein, A. [1983] Sophus Lie and symplectic geometry, Expo. Math. 1, 95–96.
References 221
Zwanziger, J.W., M. Koenig and A. Pines [1990] Berry’s phase, Ann. Rev. Phys. Chem.
41, 601–646.
Index
223
Index 224