fix(HJ): A.08 — Move Binet comparison up, cross-refs U3/U6, centrifugal barrier, conic geometry

This commit is contained in:
2026-05-02 12:43:24 -05:00
parent 4f8205da23
commit db7585d12c

View File

@@ -1,30 +1,36 @@
\subsection{The Kepler Problem} \subsection{The Kepler Problem}
This subsection treats the inverse-square central potential $V(r) = -k/r$ through the Hamilton-- Jacobi formalism, derives conic-section orbits from Jacobi's theorem, and uses action-- angle variables to recover Kepler's third law and the degeneracy that makes bound orbits close. This subsection treats the inverse-square central potential $V(r) = -k/r$ through the Hamilton--Jacobi formalism. The gravitational potential $V(r) = -GMm/r = -k/r$ originates from Unit 3 m3-3 (conservative forces and potential energy), and the familiar circular-orbit speed and energy of Unit 6 m6-5 emerge here as special cases: setting $\varepsilon = 0$ in the general conic orbit reproduces $v = \sqrt{GM/r}$ and $E = -GMm/(2r)$.
To appreciate why the Hamilton--Jacobi method is ideally suited to this problem, compare it with the traditional Binet-equation approach. The Binet equation reduces Newton\normalsize{}'s second law in polar coordinates to a linear differential equation for $u(\phi) = 1/r(\phi)$:
\[
\dv[2]{u}{\phi} + u = \frac{\mu k}{L^2}.
\]
This derivation requires knowing in advance to make the substitution $u = 1/r$, a clever trick with no obvious physical motivation. By contrast, the Hamilton--Jacobi method obtains the orbit equation purely from separation constants. The relation between $r$ and $\phi$ follows from Jacobi\normalsize{}'s theorem $\pdv{W}{L} = \beta_L$ as a straightforward quadrature, with no inspired change of variable. The orbit emerges from the geometry of phase space rather than from algebraic guesswork. Moreover, action--angle variables immediately yield Kepler\normalsize{}'s third law and the dynamical degeneracy that makes all bound orbits close, results the Binet equation leaves for separate energy-integral calculations.
\dfn{Kepler Hamiltonian}{ \dfn{Kepler Hamiltonian}{
Consider the central potential $V(r) = -k/r$ where $k = GM\mu$ with $G$ the gravitational constant, $M$ the mass of the central body, and $\mu$ the reduced mass of the two-- body system. In spherical coordinates $(r,\theta,\phi)$ the kinetic energy is $T = \tfrac{1}{2}\mu(\dot{r}^2 + r^2\dot{\theta}^2 + r^2\sin^2\theta\,\dot{\phi}^2)$, and the canonical momenta are $p_r = \mu\dot{r}$, $p_\theta = \mu r^2\dot{\theta}$, $p_\phi = \mu r^2\sin^2\theta\,\dot{\phi}$. The Legendre transform yields the Hamiltonian: Consider the central potential $V(r) = -k/r$ where $k = GM\mu$ with $G$ the gravitational constant, $M$ the mass of the central body, and $\mu$ the reduced mass of the two--body system. From A.02, the scale factors for spherical coordinates $(r,\theta,\phi)$ are $h_r = 1$, $h_\theta = r$, $h_\phi = r\sin\theta$, giving the kinetic energy $T = \tfrac{1}{2}\mu(\dot{r}^2 + r^2\dot{\theta}^2 + r^2\sin^2\theta\,\dot{\phi}^2)$. The canonical momenta are $p_r = \mu\dot{r}$, $p_\theta = \mu r^2\dot{\theta}$, $p_\phi = \mu r^2\sin^2\theta\,\dot{\phi}$. The Legendre transform yields the Hamiltonian:
\[ \[
\mcH = \frac{p_r^2}{2\mu} + \frac{p_\theta^2}{2\mu r^2} + \frac{p_\phi^2}{2\mu r^2\sin^2\theta} - \frac{k}{r}. \mcH = \frac{p_r^2}{2\mu} + \frac{p_\theta^2}{2\mu r^2} + \frac{p_\phi^2}{2\mu r^2\sin^2\theta} - \frac{k}{r}.
\] \]
The Hamilton-- Jacobi equation for the principal function $\mcS(r,\theta,\phi,t)$ is The Hamilton--Jacobi equation for the principal function $\mcS(r,\theta,\phi,t)$ is
\[ \[
\frac{1}{2\mu}\left(\pdv{\mcS}{r}\right)^2 \frac{1}{2\mu}\left(\pdv{\mcS}{r}\right)^2
+ \frac{1}{2\mu r^2}\left(\pdv{\mcS}{\theta}\right)^2 + \frac{1}{2\mu r^2}\left(\pdv{\mcS}{\theta}\right)^2
+ \frac{1}{2\mu r^2\sin^2\theta}\left(\pdv{\mcS}{\phi}\right)^2 + \frac{1}{2\mu r^2\sin^2\theta}\left(\pdv{\mcS}{\phi}\right)^2
- \frac{k}{r} + \pdv{\mcS}{t} = 0. - \frac{k}{r} + \pdv{\mcS}{t} = 0.
\] \]
Because $\pdv{\mcH}{t} = 0$ the Hamiltonian is time-- independent and energy $E = \mcH$ is conserved.} Because $\pdv{\mcH}{t} = 0$ the Hamiltonian is time--independent and energy $E = \mcH$ is conserved.}
\nt{Two-- body reduction and reduced mass}{ \nt{Two--body reduction and reduced mass}{
A system of two bodies with masses $M$ and $m$ interacting through a central potential depends only on the distance between them. Introducing the relative coordinate $\mathbf{r} = \mathbf{r}_1 - \mathbf{r}_2$ and the center of mass $\mathbf{R} = (M\mathbf{r}_1 + m\mathbf{r}_2)/(M+m)$, the Lagrangian splits into the free motion of the center of mass and the relative motion with the reduced mass $\mu = Mm/(M+m)$. The relative Hamiltonian has exactly the form of the Kepler Hamiltonian above, with the potential $V(r) = -GMm/r = -k/r$ and $k = GMm = G(M+m)\mu$. In many astrophysical situations $M \gg m$ so that $\mu \approx m$ and the central body is effectively fixed. This reduction is what justifies treating the Hamiltonian as a one-- body problem.} A system of two bodies with masses $M$ and $m$ interacting through a central potential depends only on the distance between them. Introducing the relative coordinate $\mathbf{r} = \mathbf{r}_1 - \mathbf{r}_2$ and the center of mass $\mathbf{R} = (M\mathbf{r}_1 + m\mathbf{r}_2)/(M+m)$, the Lagrangian splits into the free motion of the center of mass and the relative motion with the reduced mass $\mu = Mm/(M+m)$. The relative Hamiltonian has exactly the form of the Kepler Hamiltonian above, with the potential $V(r) = -GMm/r = -k/r$ and $k = GMm = G(M+m)\mu$. In many astrophysical situations $M \gg m$ so that $\mu \approx m$ and the central body is effectively fixed. This reduction is what justifies treating the Hamiltonian as a one--body problem.}
\thm{Separated Hamilton-- Jacobi equations for the Kepler problem}{ \thm{Separated Hamilton--Jacobi equations for the Kepler problem}{
With the separation ansatz With the separation ansatz
\[ \[
\mcS(r,\theta,\phi,t) = W_r(r) + W_\theta(\theta) + L_z\phi - Et, \mcS(r,\theta,\phi,t) = W_r(r) + W_\theta(\theta) + L_z\phi - Et,
\] \]
the Hamilton-- Jacobi equation breaks into three ordinary differential equations. The azimuthal equation is the Hamilton--Jacobi equation breaks into three ordinary differential equations. The azimuthal equation is
\[ \[
\pdv{\mcS}{\phi} = L_z, \pdv{\mcS}{\phi} = L_z,
\] \]
@@ -32,21 +38,21 @@ a constant. The polar angular equation is
\[ \[
\left(\der{W_\theta}{\theta}\right)^2 + \frac{L_z^2}{\sin^2\theta} = L^2, \left(\der{W_\theta}{\theta}\right)^2 + \frac{L_z^2}{\sin^2\theta} = L^2,
\] \]
where $L$ is the total-- angular-- momentum separation constant. The radial equation is where $L$ is the total--angular--momentum separation constant. The radial equation is
\[ \[
\left(\der{W_r}{r}\right)^2 = 2\mu E + \frac{2\mu k}{r} - \frac{L^2}{r^2}. \left(\der{W_r}{r}\right)^2 = 2\mu E + \frac{2\mu k}{r} - \frac{L^2}{r^2}.
\] \]
The three constants of motion $E$, $L$, and $L_z$ provide the complete integral required by Jacobi's theorem.} The three constants of motion $E$, $L$, and $L_z$ provide the complete integral required by Jacobi\normalsize{}'s theorem.}
\pf{Derivation of the separated equations from the full HJ PDE}{ \pf{Derivation of the separated equations from the full HJ PDE}{
Begin by eliminating the time dependence. Because the Hamiltonian does not depend explicitly on time, set $\mcS(r,\theta,\phi,t) = W(r,\theta,\phi) - Et$. The time derivative contributes $-E$ and the Hamilton-- Jacobi equation becomes Begin by eliminating the time dependence. Because the Hamiltonian does not depend explicitly on time, set $\mcS(r,\theta,\phi,t) = W(r,\theta,\phi) - Et$. The time derivative contributes $-E$ and the Hamilton--Jacobi equation becomes
\[ \[
\frac{1}{2\mu}\left(\pdv{W}{r}\right)^2 \frac{1}{2\mu}\left(\pdv{W}{r}\right)^2
+ \frac{1}{2\mu r^2}\left(\pdv{W}{\theta}\right)^2 + \frac{1}{2\mu r^2}\left(\pdv{W}{\theta}\right)^2
+ \frac{1}{2\mu r^2\sin^2\theta}\left(\pdv{W}{\phi}\right)^2 + \frac{1}{2\mu r^2\sin^2\theta}\left(\pdv{W}{\phi}\right)^2
- \frac{k}{r} = E. - \frac{k}{r} = E.
\] \]
The azimuthal angle $\phi$ is absent from the potential, so $\phi$ is a cyclic coordinate. Write $W = W_{r\theta}(r,\theta) + W_\phi(\phi)$, and because the $\phi$-- term appears only through $\pdv{W}{\phi}$ it must be a constant: The azimuthal angle $\phi$ is absent from the potential, so $\phi$ is a cyclic coordinate. Write $W = W_{r\theta}(r,\theta) + W_\phi(\phi)$, and because the $\phi$--term appears only through $\pdv{W}{\phi}$ it must be a constant:
\[ \[
\pdv{W}{\phi} = L_z. \pdv{W}{\phi} = L_z.
\] \]
@@ -57,7 +63,7 @@ This is the canonical momentum conjugate to $\phi$ and equals the $z$\-component
\] \]
Multiply by $2\mu r^2$: Multiply by $2\mu r^2$:
\[ \[
r^2\left(\pdv{W_{r\theta}}{r}\right)^2 - \frac{2\mu k r}{1} - 2\mu E r^2 r^2\left(\pdv{W_{r\theta}}{r}\right)^2 - 2\mu k r - 2\mu E r^2
= -\left(\pdv{W_{r\theta}}{\theta}\right)^2 - \frac{L_z^2}{\sin^2\theta}. = -\left(\pdv{W_{r\theta}}{\theta}\right)^2 - \frac{L_z^2}{\sin^2\theta}.
\] \]
The left side depends only on $r$ and the right side depends only on $\theta$. Each must therefore equal a constant, which we call the separation constant $L^2$ because it will be identified with the square of the total angular momentum: The left side depends only on $r$ and the right side depends only on $\theta$. Each must therefore equal a constant, which we call the separation constant $L^2$ because it will be identified with the square of the total angular momentum:
@@ -67,7 +73,7 @@ The left side depends only on $r$ and the right side depends only on $\theta$. E
\[ \[
r^2\left(\pdv{W_{r\theta}}{r}\right)^2 - 2\mu k r - 2\mu E r^2 = -L^2. r^2\left(\pdv{W_{r\theta}}{r}\right)^2 - 2\mu k r - 2\mu E r^2 = -L^2.
\] \]
Assume additive separation $W_{r\theta}(r,\theta) = W_r(r) + W_\theta(\theta)$. The $\theta$-- equation becomes Assume additive separation $W_{r\theta}(r,\theta) = W_r(r) + W_\theta(\theta)$. The $\theta$--equation becomes
\[ \[
\left(\der{W_\theta}{\theta}\right)^2 + \frac{L_z^2}{\sin^2\theta} = L^2, \left(\der{W_\theta}{\theta}\right)^2 + \frac{L_z^2}{\sin^2\theta} = L^2,
\] \]
@@ -79,14 +85,21 @@ which rearranges to
\[ \[
\left(\der{W_r}{r}\right)^2 = 2\mu E + \frac{2\mu k}{r} - \frac{L^2}{r^2}. \left(\der{W_r}{r}\right)^2 = 2\mu E + \frac{2\mu k}{r} - \frac{L^2}{r^2}.
\] \]
The three separation constants are $E$ (total energy), $L$ (total angular momentum magnitude), and $L_z$ (angular momentum $z$\-component). Together with Jacobi's theorem, these equations determine the trajectory without solving any second-- order differential equation.} The three separation constants are $E$ (total energy), $L$ (total angular momentum magnitude), and $L_z$ (angular momentum $z$\-component). Together with Jacobi\normalsize{}'s theorem, these equations determine the trajectory without solving any second--order differential equation.}
\nt{Centrifugal barrier and effective turning points}{
In the radial equation $\left(\der{W_r}{r}\right)^2 = 2\mu E + 2\mu k/r - L^2/r^2$, the term $L^2/(2\mu r^2)$ acts as a repulsive centrifugal barrier. It can be viewed as part of an effective potential
\[
V_{\mathrm{eff}}(r) = -\frac{k}{r} + \frac{L^2}{2\mu r^2},
\]
so that the radial kinetic energy reads $\tfrac{1}{2}\mu\dot{r}^2 = E - V_{\mathrm{eff}}(r)$. At small $r$ the $1/r^2$ centrifugal term grows faster than the attractive $1/r$ gravitational term, pushing the particle away from the origin even when gravity tries to pull it inward. The turning points of the motion occur where $p_r = 0$, i.e., where $E = V_{\mathrm{eff}}(r)$. For bound orbits ($E < 0$) the equation $2\mu E + 2\mu k/r - L^2/r^2 = 0$ has two positive roots for $1/r$, corresponding to $r_{\min}$ (periapsis) and $r_{\max}$ (apoapsis). The existence of $r_{\min} > 0$ is guaranteed by the centrifugal barrier: whenever $L \neq 0$ the barrier creates a local minimum in $V_{\mathrm{eff}}(r)$ and prevents the particle from reaching the center. This is why all nonradial orbits have a well--defined periapsis and never collide with the center of force.}
\thm{Orbit equation for the Kepler problem}{ \thm{Orbit equation for the Kepler problem}{
The trajectory $r(\phi)$ of a particle moving in the potential $V(r) = -k/r$ is a conic section: The trajectory $r(\phi)$ of a particle moving in the potential $V(r) = -k/r$ is a conic section:
\[ \[
r(\phi) = \frac{\ell}{1 + \varepsilon\cos(\phi - \phi_0)}, r(\phi) = \frac{\ell}{1 + \varepsilon\cos(\phi - \phi_0)},
\] \]
where the semi\-latus rectum $\ell$ and the eccentricity $\varepsilon$ are determined by the constants of motion: where the semi--latus rectum $\ell$ and the eccentricity $\varepsilon$ are determined by the constants of motion:
\[ \[
\ell = \frac{L^2}{\mu k}, \ell = \frac{L^2}{\mu k},
\qquad \qquad
@@ -94,10 +107,10 @@ where the semi\-latus rectum $\ell$ and the eccentricity $\varepsilon$ are deter
\qquad \qquad
\phi_0 = \text{constant}. \phi_0 = \text{constant}.
\] \]
The angle $\phi_0$ fixes the orientation of the conic in the orbital plane.} Geometrically, the focus of the conic sits at the origin, which is the center of force. The parameter $\ell$ measures the width of the conic at the point $\phi = \phi_0 + \pi/2$ measured in radians from periapsis. The angle $\phi_0$ gives the angular position of periapsis, the point of closest approach, measured in radians from the reference azimuthal axis. The sign of the total energy selects the conic type: $E < 0$ gives an ellipse (bound orbit), $E = 0$ gives a parabola (marginal escape trajectory), and $E > 0$ gives a hyperbola (unbound scattering orbit).}
\pf{Derivation of the orbit from Jacobi's theorem}{ \pf{Derivation of the orbit from Jacobi\normalsize{}'s theorem}{
Because the motion takes place in a fixed plane (the plane normal to the angular momentum vector), we may choose the orbital plane as $\theta = \pi/2$. In this plane $\sin\theta = 1$ and the radial momentum equals $p_r = \der{W_r}{r}$. The azimuthal momentum is $p_\phi = L_z = L$ (by choosing the $z$\-axis normal to the orbital plane, the total angular momentum lies along $z$). Jacobi's theorem states that the derivatives of the characteristic function $W$ with respect to the separation constants are themselves constants determined by the initial conditions: Because the motion takes place in a fixed plane (the plane normal to the angular momentum vector), we may choose the orbital plane as $\theta = \pi/2$. In this plane $\sin\theta = 1$ and the radial momentum equals $p_r = \der{W_r}{r}$. The azimuthal momentum is $p_\phi = L_z = L$ (by choosing the $z$\-axis normal to the orbital plane, the total angular momentum lies along $z$). Jacobi\normalsize{}'s theorem states that the derivatives of the characteristic function $W$ with respect to the separation constants are themselves constants determined by the initial conditions:
\[ \[
\pdv{W}{E} = \beta_E, \pdv{W}{E} = \beta_E,
\qquad \qquad
@@ -105,13 +118,20 @@ Because the motion takes place in a fixed plane (the plane normal to the angular
\qquad \qquad
\pdv{W}{L_z} = \beta_{L_z}. \pdv{W}{L_z} = \beta_{L_z}.
\] \]
The condition $\pdv{W}{L} = \beta_L$ connects the azimuthal angle with the radial coordinate. We have $W = W_r(r) + W_\theta(\theta) + L_z\phi$. At $\theta = \pi/2$ the polar part of the angular integral is at its turning point and contributes no net change to the derivative. The dependence of $W$ on $L$ enters through $W_r$, where $L$ appears in the effective-potential term $-L^2/r^2$, and through the azimuthal term via $L_z = L$ (since for planar motion all angular momentum lies in the $z$-direction). Differentiating $W_r$ with respect to $L$: The condition $\pdv{W}{L} = \beta_L$ connects the azimuthal angle with the radial coordinate. We have $W = W_r(r) + W_\theta(\theta) + L_z\phi$. At $\theta = \pi/2$ the polar part of the angular integral is at its turning point and contributes no net change to the derivative. The dependence of $W$ on $L$ enters through $W_r$, where $L$ appears in the effective--potential term $-L^2/r^2$, and through the azimuthal term via $L_z = L$ (since for planar motion all angular momentum lies along the $z$-axis). We evaluate $\pdv{W_r}{L}$ explicitly. Write
\[ \[
\pdv{W_r}{L} = \int\frac{1}{2\der{W_r}{r}}\cdot\pdv{(\der{W_r}{r})^2}{L}\dd r W_r(r) = \int \sqrt{2\mu E + \frac{2\mu k}{r} - \frac{L^2}{r^2}}\,\dd r.
= \int\frac{1}{2p_r}\cdot\left(-\frac{2L}{r^2}\right)\dd r \]
The integrand depends on the parameter $L$ through the term $-L^2/r^2$. By the Leibniz integral rule, differentiating with respect to a parameter inside the integral gives
\[
\pdv{W_r}{L} = \int \pdv{}{L}\left[\sqrt{2\mu E + \frac{2\mu k}{r} - \frac{L^2}{r^2}}\right]\dd r.
\]
Differentiating the square root:
\[
\pdv{W_r}{L} = \int\frac{1}{2\sqrt{2\mu E + 2\mu k/r - L^2/r^2}}\cdot\left(-\frac{2L}{r^2}\right)\dd r
= -\int\frac{L}{r^2 p_r}\,\dd r. = -\int\frac{L}{r^2 p_r}\,\dd r.
\] \]
The key observation is that the same integral appears in the relation between $\phi$ and $r$. From Hamilton's equations or from the $\phi$-- part of Jacobi's theorem, the azimuthal advance per radial step is The key observation is that the same integral appears in the relation between $\phi$ and $r$. From Hamilton\normalsize{}'s equations or from the $\phi$--part of Jacobi\normalsize{}'s theorem, the azimuthal advance per radial step is
\[ \[
\mathrm{d}\phi = \frac{p_\phi}{\mu r^2}\frac{\dd t}{1} \mathrm{d}\phi = \frac{p_\phi}{\mu r^2}\frac{\dd t}{1}
= \frac{L}{\mu r^2}\frac{\dd r}{p_r/\mu} = \frac{L}{\mu r^2}\frac{\dd r}{p_r/\mu}
@@ -161,7 +181,7 @@ r\bigl(1 + \varepsilon\cos(\phi - \phi_0)\bigr) = \ell,
\[ \[
r(\phi) = \frac{\ell}{1 + \varepsilon\cos(\phi - \phi_0)}. r(\phi) = \frac{\ell}{1 + \varepsilon\cos(\phi - \phi_0)}.
\] \]
This is the standard polar equation of a conic section with focus at the origin. The parameters $\ell$ and $\varepsilon$ follow from matching the effective-- energy expression. The radial turning points occur when $p_r = 0$: This is the standard polar equation of a conic section with focus at the origin. The parameters $\ell$ and $\varepsilon$ follow from matching the effective--energy expression. The radial turning points occur when $p_r = 0$:
\[ \[
2\mu E + \frac{2\mu k}{r} - \frac{L^2}{r^2} = 0, 2\mu E + \frac{2\mu k}{r} - \frac{L^2}{r^2} = 0,
\qquad \qquad
@@ -169,28 +189,32 @@ r^2 + \frac{\mu k}{\mu E}\,r - \frac{L^2}{2\mu E} = 0.
\] \]
Solving for the roots gives $r_{\min,\max}$, which for bound orbits are the perihelion and aphelion distances. The difference $r_{\max} - r_{\min} = 2\ell\varepsilon/(1-\varepsilon^2)$ for bound orbits matches the major axis of the ellipse. Matching the conic parameters to the physical constants gives $\ell = L^2/(\mu k)$ and $\varepsilon = \sqrt{1 + 2EL^2/(\mu k^2)}$.} Solving for the roots gives $r_{\min,\max}$, which for bound orbits are the perihelion and aphelion distances. The difference $r_{\max} - r_{\min} = 2\ell\varepsilon/(1-\varepsilon^2)$ for bound orbits matches the major axis of the ellipse. Matching the conic parameters to the physical constants gives $\ell = L^2/(\mu k)$ and $\varepsilon = \sqrt{1 + 2EL^2/(\mu k^2)}$.}
\mprop{Classification of conic-- section orbits by eccentricity}{ \mprop{Classification of conic--section orbits by eccentricity}{
The eccentricity $\varepsilon = \sqrt{1 + 2EL^2/(\mu k^2)}$ determines the shape of the orbit $r(\phi) = \ell/(1 + \varepsilon\cos(\phi - \phi_0))$. The orbit is The eccentricity $\varepsilon = \sqrt{1 + 2EL^2/(\mu k^2)}$ determines the shape of the orbit $r(\phi) = \ell/(1 + \varepsilon\cos(\phi - \phi_0))$. The orbit is
\begin{enumerate}[label=\bfseries\tiny\protect\circled{\small\arabic*}] \begin{enumerate}[label=\bfseries\tiny\protect\circled{\small\arabic*}]
\item An ellipse when $\varepsilon < 1$. This corresponds to $-k^2\mu/(2L^2) < E < 0$ and $L \neq 0$. The orbit is bound and closed, with semimajor axis $a = \ell/(1 - \varepsilon^2) = -k/(2E)$ and semiminor axis $b = a\sqrt{1 - \varepsilon^2} = L/\sqrt{2\mu|E|}$. The period of one complete revolution is $T = 2\pi\sqrt{\mu a^3/k}$. \item An ellipse when $\varepsilon < 1$. This corresponds to $-k^2\mu/(2L^2) < E < 0$ and $L \neq 0$. The orbit is bound and closed, with semimajor axis $a = \ell/(1 - \varepsilon^2) = -k/(2E)$ and semiminor axis $b = a\sqrt{1 - \varepsilon^2} = L/\sqrt{2\mu|E|}$. The period of one complete revolution is $T = 2\pi\sqrt{\mu a^3/k}$. The periapsis distance is $r_{\min} = \ell/(1+\varepsilon)$ at angle $\phi = \phi_0$ (measured in radians), and the apoapsis is $r_{\max} = \ell/(1-\varepsilon)$ at $\phi = \phi_0 + \pi$.
\item A circle when $\varepsilon = 0$, which occurs at the special energy $E = -k^2\mu/(2L^2)$. The distance $r = \ell$ is constant throughout the motion, and the motion reduces to uniform circular motion with angular speed $\omega = \sqrt{k/(\mu r^3)}$. \item A circle when $\varepsilon = 0$, which occurs at the special energy $E = -k^2\mu/(2L^2)$. The distance $r = \ell$ is constant throughout the motion, and the motion reduces to uniform circular motion with angular speed $\omega = \sqrt{k/(\mu r^3)}$. This matches the circular-orbit results of Unit 6 m6-5.
\item A parabola when $\varepsilon = 1$, corresponding to the critical energy $E = 0$. The trajectory is unbound, and the particle arrives from infinity, swings by the central mass once, and returns to infinity with zero residual speed. \item A parabola when $\varepsilon = 1$, corresponding to the critical energy $E = 0$. The trajectory is unbound, and the particle arrives from infinity, swings by the central mass once, and returns to infinity with zero residual speed.
\item A hyperbola when $\varepsilon > 1$, corresponding to $E > 0$. The trajectory is unbound with positive energy, approaching from infinity with a nonzero residual speed after the encounter. The angle between the two asymptotes of the hyperbola is $2\arccos(-1/\varepsilon)$. \item A hyperbola when $\varepsilon > 1$, corresponding to $E > 0$. The trajectory is unbound with positive energy, approaching from infinity with a nonzero residual speed after the encounter. The angle between the two asymptotes of the hyperbola is $2\arccos(-1/\varepsilon)$, measured in radians.
\end{enumerate}} \end{enumerate}}
\nt{Action-- angle variables and Kepler's third law}{ \nt{Action--angle variables and Kepler\normalsize{}'s third law}{
The three independent action variables for the Kepler problem are computed by integrating the appropriate momenta over one complete cycle of each coordinate. For the azimuthal coordinate, $J_\phi = \oint p_\phi\,\mathrm{d}\phi = 2\pi L_z$. For the polar coordinate, $J_\theta = \oint p_\theta\,\mathrm{d}\theta = 2\pi(L - |L_z|)$. For the radial coordinate, the integral $J_r = \oint p_r\,\dd r$ requires careful evaluation between the two radial turning points for bound orbits ($E < 0$). The result is $J_r = 2\pi(-L + k\sqrt{\mu/(2|E|)})$. Adding all three actions eliminates the angular-- momentum dependence: The three independent action variables for the Kepler problem are computed by integrating the appropriate momenta over one complete cycle of each coordinate. For the azimuthal coordinate, $J_\phi = \oint p_\phi\,\mathrm{d}\phi = 2\pi L_z$. For the polar coordinate, $J_\theta = \oint p_\theta\,\mathrm{d}\theta = 2\pi(L - |L_z|)$. For the radial coordinate, the integral $J_r = \oint p_r\,\dd r$ requires careful evaluation between the two radial turning points for bound orbits ($E < 0$). The result is $J_r = 2\pi(-L + k\sqrt{\mu/(2|E|)})$.
Each action variable carries a distinct physical meaning. The azimuthal action $J_\phi = 2\pi L_z$ measures the conserved rotation about the vertical axis: it is the circulation of angular momentum along the $\phi$ direction and sets the rate of azimuthal precession. The polar action $J_\theta = 2\pi(L - |L_z|)$ measures the inclination of the orbital plane: when the orbit is equatorial ($L = |L_z|$) we have $J_\theta = 0$, and larger values of $L - |L_z|$ correspond to a more tilted orbit with greater polar oscillation between $\theta_{\min}$ and $\pi - \theta_{\min}$. The radial action $J_r = 2\pi(-L + k\sqrt{\mu/(2|E|)})$ measures the extent of the radial excursion between periapsis and apoapsis: for a circular orbit $J_r = 0$ (no radial oscillation), and for highly eccentric orbits $J_r$ grows as the particle swings farther from the center.
Adding all three actions eliminates the angular--momentum dependence:
\[ \[
J_{\mathrm{tot}} = J_r + J_\theta + J_\phi J_{\mathrm{tot}} = J_r + J_\theta + J_\phi
= 2\pi k\sqrt{\frac{\mu}{2|E|}}. = 2\pi k\sqrt{\frac{\mu}{2|E|}}.
\] \]
Inverting this expression gives $|E| = 2\pi^2\mu k^2/J_{\mathrm{tot}}^2$, which expresses the energy in terms of the single total action $J_{\mathrm{tot}}$. Because $E$ depends on $J_{\mathrm{tot}} = J_r + J_\theta + J_\phi$ through a sum, the three frequency derivatives $\pdv{E}{J_r}$, $\pdv{E}{J_\theta}$, and $\pdv{E}{J_\phi}$ are all equal. Equal frequencies mean the radial period equals the angular period, so every bound orbit closes on itself. This degeneracy is the deep mathematical origin of Kepler's third law.} Inverting this expression gives $|E| = 2\pi^2\mu k^2/J_{\mathrm{tot}}^2$, which expresses the energy in terms of the single total action $J_{\mathrm{tot}}$. Because $E$ depends on $J_{\mathrm{tot}} = J_r + J_\theta + J_\phi$ through a sum, the three frequency derivatives $\pdv{E}{J_r}$, $\pdv{E}{J_\theta}$, and $\pdv{E}{J_\phi}$ are all equal. Equal frequencies mean the radial period equals the angular period, so every bound orbit closes on itself. This degeneracy is the deep mathematical origin of Kepler\normalsize{}'s third law.}
\ex{Action-- angle derivation of $E(J_{\mathrm{tot}})$ and the degenerate frequencies}{ \ex{Action--angle derivation of $E(J_{\mathrm{tot}})$ and the degenerate frequencies}{
We evaluate the three action variables for the Kepler problem explicitly. We evaluate the three action variables for the Kepler problem explicitly.
\textbf{Azimuthal action.} The momentum conjugate to $\phi$ is $p_\phi = L_z$, a constant. Integrating over one full revolution: \textbf{Azimuthal action.} The momentum conjugate to $\phi$ is $p_\phi = L_z$, a constant. Integrating over one full revolution:
@@ -215,7 +239,7 @@ The radial action integral is
\[ \[
J_r = 2\int_{r_-}^{r_+}\sqrt{2\mu|E| + \frac{2\mu k}{r} - \frac{L^2}{r^2}}\;\frac{\dd r}{r^2/r^2}. J_r = 2\int_{r_-}^{r_+}\sqrt{2\mu|E| + \frac{2\mu k}{r} - \frac{L^2}{r^2}}\;\frac{\dd r}{r^2/r^2}.
\] \]
The standard contour-- integration or-- elliptic-- integral evaluation gives The standard contour--integration or elliptic--integral evaluation gives
\[ \[
J_r = 2\pi\!\left(-L + k\sqrt{\frac{\mu}{2|E|}}\right). J_r = 2\pi\!\left(-L + k\sqrt{\frac{\mu}{2|E|}}\right).
\] \]
@@ -226,7 +250,7 @@ J_{\mathrm{tot}} = J_r + J_\theta + J_\phi
= 2\pi\!\left(-L + k\sqrt{\frac{\mu}{2|E|}}\right) + 2\pi(L - |L_z|) + 2\pi L_z = 2\pi\!\left(-L + k\sqrt{\frac{\mu}{2|E|}}\right) + 2\pi(L - |L_z|) + 2\pi L_z
= 2\pi k\sqrt{\frac{\mu}{2|E|}}. = 2\pi k\sqrt{\frac{\mu}{2|E|}}.
\] \]
The angular-- momentum terms $L$ and $|L_z|$ cancel exactly. Invert the total-- action relation to obtain the energy: The angular--momentum terms $L$ and $|L_z|$ cancel exactly. Invert the total--action relation to obtain the energy:
\[ \[
\sqrt{\frac{\mu}{2|E|}} = \frac{J_{\mathrm{tot}}}{2\pi k}, \sqrt{\frac{\mu}{2|E|}} = \frac{J_{\mathrm{tot}}}{2\pi k},
\qquad \qquad
@@ -236,7 +260,7 @@ The angular-- momentum terms $L$ and $|L_z|$ cancel exactly. Invert the total--
E(J_{\mathrm{tot}}) = -\frac{2\pi^2\mu k^2}{J_{\mathrm{tot}}^2}. E(J_{\mathrm{tot}}) = -\frac{2\pi^2\mu k^2}{J_{\mathrm{tot}}^2}.
\] \]
\textbf{Degenerate frequencies.} The three action-- angle frequencies are \textbf{Degenerate frequencies.} The three action--angle frequencies are
\[ \[
\omega_r = \pdv{E}{J_r} = \pdv{E}{J_{\mathrm{tot}}}\pdv{J_{\mathrm{tot}}}{J_r} \omega_r = \pdv{E}{J_r} = \pdv{E}{J_{\mathrm{tot}}}\pdv{J_{\mathrm{tot}}}{J_r}
= \frac{4\pi^2\mu k^2}{J_{\mathrm{tot}}^3}\cdot 1, = \frac{4\pi^2\mu k^2}{J_{\mathrm{tot}}^3}\cdot 1,
@@ -249,52 +273,38 @@ E(J_{\mathrm{tot}}) = -\frac{2\pi^2\mu k^2}{J_{\mathrm{tot}}^2}.
\omega_\phi = \pdv{E}{J_\phi} = \pdv{E}{J_{\mathrm{tot}}}\pdv{J_{\mathrm{tot}}}{J_\phi} \omega_\phi = \pdv{E}{J_\phi} = \pdv{E}{J_{\mathrm{tot}}}\pdv{J_{\mathrm{tot}}}{J_\phi}
= \frac{4\pi^2\mu k^2}{J_{\mathrm{tot}}^3}\cdot 1. = \frac{4\pi^2\mu k^2}{J_{\mathrm{tot}}^3}\cdot 1.
\] \]
All three frequencies are equal: $\omega_r = \omega_\theta = \omega_\phi$. The period of radial oscillation equals the period of angular advance. In a single radial period the azimuthal angle advances by $2\pi$, so the orbit closes after exactly one revolution. This is Kepler's third law: the orbital period is determined solely by the energy and is independent of the angular momentum.} All three frequencies are equal: $\omega_r = \omega_\theta = \omega_\phi$. The period of radial oscillation equals the period of angular advance. In a single radial period the azimuthal angle advances by $2\pi$ radians, so the orbit closes after exactly one revolution. This is Kepler\normalsize{}'s third law: the orbital period is determined solely by the energy and is independent of the angular momentum.}
\nt{Comparison with the Binet equation}{ \nt{Binet equation: alternative derivation}{
The Binet equation is an alternative derivation of Kepler orbits that begins with Newton's second law and the substitution $u(\phi) = 1/r(\phi)$. The radial equation of motion in polar coordinates is The Binet approach starts from Newton\normalsize{}'s second law in polar coordinates and the substitution $u(\phi) = 1/r(\phi)$. The radial equation of motion is
\[ \[
\mu(\ddot{r} - r\dot{\phi}^2) = -\frac{k}{r^2}. \mu(\ddot{r} - r\dot{\phi}^2) = -\frac{k}{r^2}.
\] \]
With the angular momentum $L = \mu r^2\dot{\phi}$, eliminate $\dot{\phi}$ in favor of $L$: With $L = \mu r^2\dot{\phi}$, eliminate $\dot{\phi}$:
\[ \[
\ddot{r} - \frac{L^2}{\mu^2 r^3} = -\frac{k}{\mu r^2}. \ddot{r} - \frac{L^2}{\mu^2 r^3} = -\frac{k}{\mu r^2}.
\] \]
Write $r = 1/u(\phi)$ and use the chain rule to convert time derivatives into $\phi$ derivatives. Since $\dot{\phi} = L/(\mu r^2) = L u^2/\mu$: Setting $r = 1/u$ and using $\dot{\phi} = L u^2/\mu$:
\[ \[
\dot{r} = \dv{r}{\phi}\dot{\phi} = -\dv{u}{\phi}\cdot\frac{1}{u^2}\cdot\frac{L u^2}{\mu} \dot{r} = -\frac{L}{\mu}\dv{u}{\phi},
= -\frac{L}{\mu}\dv{u}{\phi}. \qquad
\ddot{r} = -\frac{L^2}{\mu^2}\dv[2]{u}{\phi}.
\] \]
Differentiate once more: The radial equation becomes the linear ODE
\[ \[
\ddot{r} = \dv{}{t}\!\left(-\frac{L}{\mu}\dv{u}{\phi}\right) \dv[2]{u}{\phi} + u = \frac{\mu k}{L^2},
= -\frac{L}{\mu}\dv[2]{u}{\phi}\dot{\phi}
= -\frac{L^2}{\mu^2}\dv[2]{u}{\phi}.
\] \]
Substitute into the radial equation: with general solution
\[ \[
-\frac{L^2}{\mu^2}\dv[2]{u}{\phi} - \frac{L^2}{\mu^2}\,u = -\frac{k}{\mu}, u(\phi) = \frac{\mu k}{L^2} + A\cos(\phi - \phi_0).
\] \]
which rearranges to the Binet equation: Inverting $r = 1/u$ gives the conic $r(\phi) = \ell/(1 + \varepsilon\cos(\phi - \phi_0))$ with $\ell = L^2/(\mu k)$ and $\varepsilon = \mu k A/L^2$. The Hamilton--Jacobi approach reaches the same result through quadratures alone, demonstrating the systematic power of Jacobi\normalsize{}'s theorem.}
\[
\dv[2]{u}{\phi} + u = \frac{\mu k}{L^2}.
\]
The general solution is
\[
u(\phi) = \frac{\mu k}{L^2} + A\cos(\phi - \phi_0),
\]
where $A$ is determined by the energy. Inverting $r = 1/u$ gives
\[
r(\phi) = \frac{1}{\mu k/L^2 + A\cos(\phi - \phi_0)}
= \frac{L^2/(\mu k)}{1 + \mu k A/L^2\cos(\phi - \phi_0)},
\]
which matches the form $\ell/(1 + \varepsilon\cos(\phi - \phi_0))$ with $\ell = L^2/(\mu k)$ and $\varepsilon = \mu k A/L^2$. This derivation is shorter but requires knowing the $u = 1/r$ substitution. The Hamilton-- Jacobi approach reaches the same result through quadratures without any clever change of variable, demonstrating the power of Jacobi's theorem as a unifying principle. Moreover, the action-- angle formalism provides immediate access to the energy-- period-- semimajor axis relations that the Binet equation leaves as an afterthought. The Binet method also cannot be extended to noncentral potentials or to higher dimensions without substantial modification, while the Hamilton-- Jacobi approach generalizes naturally to any separable system.}
\qs{Earth-- sun system parameters from the HJ formulation}{ \qs{Earth--sun system parameters from the HJ formulation}{
For the Earth orbiting the Sun, take the semimajor axis $a = 1.50\times 10^{11}\,\mathrm{m}$, the solar mass $M_{\text{sun}} = 1.99\times 10^{30}\,\mathrm{kg}$, the gravitational constant $G = 6.674\times 10^{-11}\,\mathrm{N\!\cdot\!m^2/kg^2}$, and the Earth mass $m_{\text{earth}} = 5.97\times 10^{24}\,\mathrm{kg}$. The gravitational coupling constant is $k = GM_{\text{sun}}\,m_{\text{earth}}$. For the Earth orbiting the Sun, take the semimajor axis $a = 1.50\times 10^{11}\,\mathrm{m}$, the solar mass $M_{\text{sun}} = 1.99\times 10^{30}\,\mathrm{kg}$, the gravitational constant $G = 6.674\times 10^{-11}\,\mathrm{N\!\cdot\!m^2/kg^2}$, and the Earth mass $m_{\text{earth}} = 5.97\times 10^{24}\,\mathrm{kg}$. The gravitational coupling constant is $k = GM_{\text{sun}}\,m_{\text{earth}}$.
\begin{enumerate}[label=(\alph*)] \begin{enumerate}[label=(\alph*)]
\item Compute $k$ and the binding energy $E = -k/(2a)$ for the circular-- orbit limit. Show that $E \approx -2.65\times 10^{33}\,\mathrm{J}$. \item Compute $k$ and the binding energy $E = -k/(2a)$ for the circular--orbit limit. Show that $E \approx -2.65\times 10^{33}\,\mathrm{J}$.
\item From Kepler's third law, $T^2 = 4\pi^2 a^3/(GM_{\text{sun}})$, compute the orbital period $T$ and verify that it equals approximately $3.16\times 10^7\,\mathrm{s}$, or one year. \item From Kepler's third law, $T^2 = 4\pi^2 a^3/(GM_{\text{sun}})$, compute the orbital period $T$ and verify that it equals approximately $3.16\times 10^7\,\mathrm{s}$, or one year.
\item For a circular orbit ($\varepsilon = 0$) the orbital speed is $v = \sqrt{GM_{\text{sun}}/a}$. Show that $v \approx 29.8\times 10^3\,\mathrm{m/s} = 29.8\,\mathrm{km/s}$. \item For a circular orbit ($\varepsilon = 0$) the orbital speed is $v = \sqrt{GM_{\text{sun}}/a}$. Show that $v \approx 29.8\times 10^3\,\mathrm{m/s} = 29.8\,\mathrm{km/s}$.
\end{enumerate}} \end{enumerate}}
@@ -325,9 +335,9 @@ Rounding the coupling constant to $k = 7.94\times 10^{44}\,\mathrm{J\!\cdot\!m}$
E = -\frac{7.94\times 10^{44}}{3.00\times 10^{11}}\,\mathrm{J} E = -\frac{7.94\times 10^{44}}{3.00\times 10^{11}}\,\mathrm{J}
= -2.65\times 10^{33}\,\mathrm{J}. = -2.65\times 10^{33}\,\mathrm{J}.
\] \]
The large negative value confirms that the Earth is deeply bound to the Sun's gravitational potential. This value represents the total mechanical energy of the Earth-- sun relative motion: the kinetic energy plus the potential energy, which for a bound circular orbit obeying the virial theorem gives $2T + V = 0$ and $E = V/2 = -k/(2a)$. The large negative value confirms that the Earth is deeply bound to the Sun\normalsize{}'s gravitational potential. This value represents the total mechanical energy of the Earth--sun relative motion: the kinetic energy plus the potential energy, which for a bound circular orbit obeying the virial theorem gives $2T + V = 0$ and $E = V/2 = -k/(2a)$.
\textbf{Part (b).} Kepler's third law follows from the action-- angle energy relation. The gravitational parameter is \textbf{Part (b).} Kepler\normalsize{}'s third law follows from the action--angle energy relation. The gravitational parameter is
\[ \[
GM_{\text{sun}} = (6.674\times 10^{-11})(1.99\times 10^{30})\,\mathrm{m^3/s^2} GM_{\text{sun}} = (6.674\times 10^{-11})(1.99\times 10^{30})\,\mathrm{m^3/s^2}
= 13.28\times 10^{19}\,\mathrm{m^3/s^2} = 13.28\times 10^{19}\,\mathrm{m^3/s^2}
@@ -386,7 +396,9 @@ v = \sqrt{8.87\times 10^8}\,\mathrm{m/s}
= 29.8\times 10^3\,\mathrm{m/s} = 29.8\times 10^3\,\mathrm{m/s}
= 29.8\,\mathrm{km/s}. = 29.8\,\mathrm{km/s}.
\] \]
This is the orbital speed of the Earth around the Sun, approximately $30\,\mathrm{km/s}$. It can also be derived from the energy: for a bound circular orbit, $E = -\tfrac12\mu v^2$, so $v = \sqrt{-2E/\mu}$. Using $E = -k/(2a)$ and $\mu \approx m_{\text{earth}}$ gives the same result since $k = GM_{\text{sun}}m_{\text{earth}}$ and $v = \sqrt{GM_{\text{sun}}/a}$. Therefore, This is the orbital speed of the Earth around the Sun, approximately $30\,\mathrm{km/s}$. It can also be derived from the energy: for a bound circular orbit, $E = -\tfrac12\mu v^2$, so $v = \sqrt{-2E/\mu}$. Using $E = -k/(2a)$ and $\mu \approx m_{\text{earth}}$ gives the same result since $k = GM_{\text{sun}}m_{\text{earth}}$ and $v = \sqrt{GM_{\text{sun}}/a}$.
Therefore,
\[ \[
k = 7.94\times 10^{44}\,\mathrm{J\!\cdot\!m}, k = 7.94\times 10^{44}\,\mathrm{J\!\cdot\!m},
\qquad \qquad