Structure and Interpretation of Classical Mechanics

Our motivation for the development of Hamilton's equations was to focus attention on the quantities that can be conserved -- the momenta and the energy. In the Hamiltonian formulation the generalized configuration coordinates and the conjugate momenta comprise the state of the system at a given time. We know from the Lagrangian formulation that if the Lagrangian does not depend on some coordinate then the conjugate momentum is conserved. This is also true in the Hamiltonian formulation, but there is a distinct advantage to the Hamiltonian formulation. In the Lagrangian formulation the knowledge of the conserved momentum does not lead immediately to any simplification of the problem, but in the Hamiltonian formulation the fact that momenta are conserved gives an immediate reduction in the dimension of the system to be solved. In fact, if a coordinate does not appear in the Hamiltonian then the dimension of the system of coupled equations that remain to be solved is reduced by two -- the coordinate does not appear and the conjugate momentum is constant.

Let H(t, q, p) be a Hamiltonian for some problem with an n-dimensional configuration space and 2n-dimensional phase space. Suppose the Hamiltonian does not depend upon the ith coordinate qⁱ: (

₁ H)_i = 0.¹⁹ According to Hamilton's equations, the conjugate momentum p_i is conserved. Hamilton's equations of motion for the remaining 2n - 2 phase-space variables do not involve qⁱ (because it does not appear in the Hamiltonian), and p_i is a constant. Thus the dimension of the difficult part of the problem, the part that involves the solution of coupled ordinary differential equations, is reduced by two. The remaining equation governing the evolution of qⁱ in general depends on all the other variables, but once the reduced problem has been solved, the equation of motion for qⁱ can be written so as to give Dqⁱ explicitly as a function of time. We can then find qⁱ as a definite integral of this function.²⁰

Contrast this result with analogous results for more general systems of differential equations. There are two independent situations. One situation is that we know a constant of the motion. In general, constants of the motion can be used to reduce by one the dimension of the unsolved part of the problem. To see this, let the system of equations be

where m is the dimension of the system. Assume we know some constant of the motion

At least locally, we expect that we can use this equation to solve for z^m-1(t) in terms of all the other variables, and use this solution to eliminate the dependence on z^m-1(t). The first m - 1 equations then depend only upon the first m - 1 variables. The dimension of the system of equations to be solved is reduced by one. After the solution for the other variables has been found, z^m-1(t) can be found using the constant of the motion.

The second situation is that one of the variables, say zⁱ, does not appear in the equations of motion (but there is an equation for Dzⁱ). In this case the equations for the other variables form an independent set of equations of one dimension less than the original system. After these are solved, then the remaining equation for zⁱ can be solved by definite integration.

In both situations the dimension of the system of coupled equations is reduced by one. Hamilton's equations are different in that these two situations often come together. If a Hamiltonian for a system does not depend on a particular coordinate, then the equations of motion for the other coordinates and momenta do not depend on that coordinate. Furthermore, the momentum conjugate to that coordinate is a constant of the motion. An added benefit is that the use of this constant of the motion to reduce the dimension of the remaining equations is automatic in the Hamiltonian formulation. The conserved momentum is a state variable and just a parameter in the remaining equations.

So if there is a continuous symmetry it will probably be to our advantage to choose a coordinate system that explicitly incorporates the symmetry, making the Hamiltonian independent of a coordinate. Then the dimension of the phase space of the coupled system will be reduced by two for every coordinate that does not appear in the Hamiltonian.²¹

Motion in a central potential

Consider the motion of a particle of mass m in a central potential. A natural choice for generalized coordinates that reflects the symmetry is polar coordinates. A Lagrangian is (equation 1.67):

The momenta are p_r = m

and p = m r²

. The kinetic energy is a homogeneous quadratic form in the velocities, so the Hamiltonian is T + V with the velocities rewritten in terms of the momenta:

The potential energy depends on the distance from the origin, r, as does the kinetic energy in polar coordinates, but neither the potential energy nor the kinetic energy depends on the polar angle

. The angle

does not appear in the Lagrangian so we know that p, the momentum conjugate to

, is conserved along realizable trajectories. The fact that p is constant along realizable paths is expressed by one of Hamilton's equations. That p has a constant value is immediately made use of in the other Hamilton's equations: the remaining equations are a self-contained subsystem with constant p. To make a lower-dimensional subsystem in the Lagrangian formulation we have to use each conserved momentum to eliminate one of the other state variables, as we did for the axisymmetric top (see section 2.10).

We can check our derivations with the computer. A procedure implementing the Lagrangian has already been introduced (below equation 1.67). We can use this to get the Hamiltonian:

(show-expression ((Lagrangian->Hamiltonian (L-central-polar 'm (literal-function 'V))) (up 't (up 'r 'phi) (down 'p_r 'p_phi))))

(show-expression (((Hamilton-equations (Lagrangian->Hamiltonian (L-central-polar 'm (literal-function 'V)))) (up (literal-function 'r) (literal-function 'phi)) (down (literal-function 'p_r) (literal-function 'p_phi))) 't))

Axisymmetric top

We reconsider the axisymmetric top (see section 2.10) from the Hamiltonian point of view. Recall that a top is a rotating rigid body, one point of which is fixed in space. The center of mass is not at the fixed point, and there is a uniform gravitational field. An axisymmetric top is a top with an axis of symmetry. We consider here an axisymmetric top with the fixed point on the symmetry axis.

The axisymmetric top has two continuous symmetries we would like to exploit. It has the symmetry that neither the kinetic nor potential energy is sensitive to the orientation of the top about the symmetry axis. The kinetic and potential energy are also insensitive to a rotation of the physical system about the vertical axis, because the gravitational field is uniform. We take advantage of these symmetries by choosing coordinates that naturally express them. We already have an appropriate coordinate system that does the job -- the Euler angles. We choose the reference orientation of the top so that the symmetry axis is vertical. The first Euler angle

expresses a rotation about the symmetry axis. The next Euler angle

is the tilt of the symmetry axis of the top from the vertical. The third Euler angle

expresses a rotation of the top about the fixed z axis. The symmetries of the problem imply that the first and third Euler angles do not appear in the Hamiltonian. As a consequence the momenta conjugate to these angles are conserved quantities. The problem of determining the motion of the axisymmetric top is reduced to the problem of determining the evolution of

and p. Let's work out the details.

In terms of Euler angles, a Lagrangian for the axisymmetric top is (see section 2.10):

(define ((L-axisymmetric-top A C gMR) local) (let ((q (coordinate local)) (qdot (velocity local))) (let ((theta (ref q 0)) (thetadot (ref qdot 0)) (phidot (ref qdot 1)) (psidot (ref qdot 2))) (+ (* 1/2 A (+ (square thetadot) (square (* phidot (sin theta))))) (* 1/2 C (square (+ psidot (* phidot (cos theta))))) (* -1 gMR (cos theta))))))

where gMR is the product of the gravitational acceleration, the mass of the top, and the distance from the point of support to the center of mass. The Hamiltonian is nicer than we have a right to expect:

(show-expression ((Lagrangian->Hamiltonian (L-axisymmetric-top 'A 'C 'gMR)) (up 't (up 'theta 'phi 'psi) (down 'p_theta 'p_phi 'p_psi))))

Note that the angles

and

do not appear in the Hamiltonian, as expected. Thus the momenta p and p are constants of the motion.

For given values of p and p we must determine the evolution of

and p. The effective Hamiltonian for

and p has one degree of freedom, and does not involve the time. Thus the value of the Hamiltonian is conserved along realizable trajectories. So the trajectories of

and p trace contours of the effective Hamiltonian. This gives us a big picture of the possible types of motion and their relationship, for given values of p and p.

If the top is standing vertically then p = p. Let's concentrate on the case that p = p, and define p = p = p. The effective Hamiltonian becomes (after a little trigonometric simplification)

If p is large, U_p has a single minimum at

= 0, as seen in figure 3.5. For small p there is a minimum for finite positive

and a symmetrical minimum for negative

; there is a local maximum at

= 0. There is a critical value of p at which

= 0 changes from a minimum to a local maximum. Denote the critical value by p_c. A simple calculation shows p_c = (4 g M R A)^1/2. For

= 0 we have p = C

, where

is the rotation rate. Thus to p_c there corresponds a critical rotation rate

For

_c the top can stand vertically; for

_c the top falls if slightly displaced from the vertical. The top that stands vertically is called the ``sleeping'' top. For a more realistic top, friction gradually slows the rotation; the rotation rate eventually falls below the critical rotation rate and the top ``wakes up.''

We get additional insight into the sleeping top and the awake top by looking at the trajectories in the

, p phase plane. The trajectories in this plane are simply contours of the Hamiltonian, because the Hamiltonian is conserved. Figure 3.6 shows a phase portrait for

_c. All of the trajectories are loops around the vertical (

= 0). Displacing the top slightly from the vertical simply places the top on a nearby loop, so the top stays nearly vertical. Figure 3.7 shows the phase portrait for

_c. Here the vertical position is an unstable equilibrium. The trajectories that approach the vertical are asymptotic -- they take an infinite amount of time to reach it, just as a pendulum with just the right initial conditions can approach the vertical but never reach it. If the top is displaced slightly from the vertical then the trajectories loop around another center with nonzero

. A top started at the center point of the loop stays there, and one started near this equilibrium point loops stably around it. Thus we see that when the top ``wakes up'' the vertical is unstable, but the top does not fall to the ground. Rather, it oscillates around a new equilibrium.

It is also interesting to consider the axisymmetric top when p ne p. Consider the case p > p. Some trajectories in the

, p plane are shown in figure 3.8. Note that in this case trajectories do not go through

= 0. The phase portrait for p < p is similar and is not shown.

We have reduced the motion of the axisymmetric top to quadratures by choosing coordinates that express the symmetries. It turns out that the resulting integrals can be expressed in terms of elliptic functions. Thus, the axisymmetric top can be solved analytically. We do not dwell on this solution because it is not very illuminating: since most problems cannot be solved analytically, there is little profit in dwelling on the analytic solution of one of the rare problems that is analytically solvable. Rather, we have focused on the geometry of the solutions in the phase space and the use of integrals to reduce the dimension of the problem. With the phase-space portrait we have found some interesting qualitative features of the motion of the top.

Exercise 3.8. Sleeping top
Verify that the critical angular velocity above which an axisymmetric top can sleep is given by equation (3.104).

3.4.1 Lagrangian Reduction

Suppose there are cyclic coordinates. In the Hamiltonian formulation, the equations of motion for the coordinates and momenta for the other degrees of freedom form a self-contained subsystem in which the momenta conjugate to the cyclic coordinates are parameters. We can form a Lagrangian for this subsystem by performing a Legendre transform of the reduced Hamiltonian. Alternatively, we can start with the full Lagrangian and perform a Legendre transform for only those coordinates that are cyclic. The equations of motion are Hamilton's equations for those variables that are transformed and Lagrange's equations for the others. The momenta conjugate to the cyclic coordinates are conserved and can be treated as parameters in the Lagrangian for the remaining coordinates.

Divide the tuple q of coordinates into two subtuples q = (x, y). Assume L(t; x, y; v_x, v_y) is a Lagrangian for the system. Define the Routhian R as the Legendre transform of L with respect to the v_y slot:

To define the function R we must solve equation (3.105) for v_y in terms of the other variables, and substitute this into equation (3.105).

Now suppose that the Lagrangian is cyclic in y. Then

_1,1 L =

_1,1 R = 0, and p_y(t) is a constant c on any realizable path. Equation (3.113) does not depend on y, by assumption, and we can replace p_y by its constant value c. So equation (3.113) forms a closed subsystem for the path x. The Lagrangian L_c

describes the motion of the subsystem (the minus sign is introduced for convenience, and ¤ indicates that the function's value is independent of this argument). The path y can be found by integrating equation (3.114) using the independently determined path x.

The realizable paths x satisfy the Lagrange equations with the Lagrangian L_c, so the action S'_c is stationary with respect to variations

of x that are zero at the end times:

For realizable paths q the action S[q](t₁, t₂) is stationary with respect to variations

of q that are zero at the end times. Along these paths the momentum p_y(t) has the constant value c. For these same paths the action S'_c[x](t₁, t₂) is stationary with respect to variations

of x that are zero at the end times. The dimension of

is smaller than the dimension of

Exercise 3.9. Routhian equations of motion
Verify that the equations of motion are given by equations (3.113-3.115).

¹⁹ If a Lagrangian does not depend on a particular coordinate then neither does the corresponding Hamiltonian, because the coordinate is a passive variable in the Legendre transform. Such a Hamiltonian is said to be cyclic in that coordinate.

²⁰ Traditionally, when a problem has been reduced to the evaluation of a definite integral it is said to be reduced to a ``quadrature.'' Thus, the determination of the evolution of a cyclic coordinate qⁱ is reduced to a problem of quadrature.

²¹ It is not always possible to choose a set of generalized coordinates in which all symmetries are simultaneously manifest. For these systems, the reduction of the phase space is more complicated. We have already encountered such a problem: the motion of a free rigid body. The system is invariant under rotation about any axis, yet no single coordinate system can reflect this symmetry. Nevertheless, we have already found that the dynamics is described by a system of lower dimension that the full phase space: the Euler equations.

3.4 Phase Space Reduction

Motion in a central potential

Axisymmetric top

3.4.1 Lagrangian Reduction