Deriving Kepler's Laws
Overview
In antiquity, the great mathematician and astronomer Ptolemy had laid out a system for computing where planets would be found in the sky. It was based on the theory that the sun and the planets moved around the earth in epicycles, a pattern of circles which are themselves spinning in circles. It should be noted that his computations actually worked quite well to predict the night sky. However, the computations were quite messy. In the 16th century, Copernicus pointed out that the calculations could be simplified if instead it was assumed that the earth moved around the sun. This theory had been considered and rejected by Ptolemy. While Copernicus was correct that it simplified the calculations, it still required the use of epicycles. Johannes Kepler discovered that epicycles could be entirely avoided if instead of the planets were assumed to move in elliptical orbits instead of circular orbits. Kepler's three laws of planetary motion are:- Planetary orbits are ellipses with the sun at one focus.
- Planets do not move with a constant speed, but the line segment joining the sun and a planet will sweep out equal areas in equal times.
- The square of the period of the orbit of a planet is proportional to the cube of the semi-major axis of the orbit.
Acceleration in polar coordinates
It will be most convenient to work in polar coordinates, with the sun at the origin and the axes oriented so aphelion, the point where the orbit is farthest from the sun, is along $\theta=0$. We write the position of the orbiting planet is the standard form $$\text{position} = \langle r\cos(\theta),r\sin(\theta) \rangle.$$ I use $\langle a,b \rangle$ for the vector because we will soon have enough $()$'s that we it will be helpful to have different separators for vectors. Here $r=r(t)$ and $\theta=\theta(t)$ are assumed to be functions changing with time, since the planet is moving. To find the velocity vector, we differentiate position with respect to time. We use the product rule to get $$\text{velocity} = \langle r'\cos(\theta)-r\theta'\sin(\theta), r'\sin(\theta)+r\theta'\cos(\theta) \rangle.$$ Differentiating again we get acceleration. $$ \begin{align} \text{acceleration} &= \langle r''\cos(\theta) - 2r'\theta'\sin(\theta) - r(\theta')^2\cos(\theta) - r\theta''\sin(\theta), r''\sin(\theta) + 2r'\theta'\cos(\theta) - r(\theta')^2\sin(\theta) + r\theta''\cos(\theta) \rangle \\ &= \langle (r'' - r(\theta')^2\cos(\theta) + (2r'\theta'+r\theta'')(-\sin(\theta)), (r'' - r(\theta')^2\sin(\theta) + (2r'\theta'+r\theta'')(\cos(\theta)) \rangle \\ &=(r''-r(\theta')^2)\langle\cos(\theta),\sin(\theta)\rangle\quad + \quad(2r'\theta' + r\theta'')\langle-\sin(\theta),\cos(\theta)\rangle \end{align} $$ Note that our acceleration vector naturally splits into two pieces. Since the vector $\langle\cos(\theta),\sin(\theta)\rangle$ lies along the radius from the origin, we call $$a_r = r'' - r(\theta')^2$$ the radial component of accelartion. The vector $\langle-\sin(\theta),\cos(\theta)\rangle$ is normal to the radial vector and we call $$a_{\theta} = 2r'\theta' + r\theta''$$ the transverse component of acceleration.Newton's Law of Gravity
Newton's law of gravity states that the attractive force between two masses is $$\frac{GMm}{r^2}$$ where $M$ is the mass of one object, $m$ the mass of the other object, $r$ the distance between the two objects, and $G$ is the gravitational constant, $G\approx 6.67 \times 10^{-11} m^3/(kg\cdot sec^2).$ We let $M$ be the mass of the sun and $m$ be the mass of the planet. Since $F=ma$, the $m$ cancels out and we find the planet undergoes an acceleration of magnitude $GM/r^2$ toward the sun. Since the acceleration is toward the sun, that means we have $a_r = -GM/r^2$. On the other hand, $a_{\theta}=0$ since the acceleration is purely toward the sun and so there is no transverse component. Using our formulas for $a_r$ and $a_{\theta}$ from the previous section gives us a system of two non-linear differential equations for $r$ and $\theta$, $$ \begin{align} r''-r(\theta')^2 &= -\frac{GM}{r^2} \\ 2r'\theta'+r\theta'' &= 0 \end{align} $$Conservation of Angular Momentum
From the second equation we can derive Kepler's second law. The key trick is to observe $$ \begin{align} \frac{d}{dt}(r^2\theta') &= 2rr'\theta' + r^2\theta'' \\ &= r(2r'+r\theta'') \\ &=ra_{\theta} \\ &=0 \end{align} $$ So $r^2\theta' = c$ for some constant $c$. Physicists refer to this as the law of conservation of angular momentum. In a little bit, we will find it very useful to have $$ \theta' = \frac{c}{r^2} $$ to be able to replace $\theta'$ terms by $r$ terms. But first, we will use this result to establish Kepler's second law.Kepler's Second Law
In calculus you (should have) learned that the area swept out by a polar curve is $$ \text{Area} = \int \frac12 r^2 d\theta. $$ Treating $r$ and $\theta$ as functions of $t$, so $d\theta = \theta'dt$, and using the result from the previous section that $r^2\theta' = c$, we get that the area swept out by the line joining the sun (at the origin) to the planet between times $t_0$ and $t_1$ is $$ \begin{align} \text{Area} &= \int_{t_0}^{t_1} \frac12 r^2\theta' dt \\ &= \int_{t_0}^{t_1} \frac{c}{2} dt \\ &=\frac{c(t_1-t_0)}{2}. \end{align} $$ Hence the area swept out depends only on the length of time, or in Kepler's terms, the planet sweeps out equal areas in equal times.Writing An Equation For The Orbit
We now turn our attention to the key equation, $$r'' - r(\theta')^2 = -\frac{GM}{r^2}.$$ This equation involves $r$ and $\theta$ as functions of time $t$. If we want to plot the orbit as a whole, instead of determining the coordinates of the planet at time $t$, it would be better to have an equation for $r$ as a function of $\theta$. We have our standard trick to write $$ \frac{dr}{d\theta} = \frac{r'}{\theta'}. $$ But this doesn't actually help much since our equations aren't just simple formulas for $r'$ and $\theta'$. However, there is an inspired change of variables that makes everything work out neatly. Instead of working with $r$, we will use $u = 1/r$. Then $\displaystyle u' = -r'/r^2$. And then $$ \begin{align} \frac{du}{d\theta} &= \frac{u'}{\theta'} \\ &= \frac{-r'/r^2}{c/r^2} \\ &= -\frac{r'}{c} \end{align} $$ where we used the formula for $\theta'$ we found in the section on "Conservation of Angular Momentum." Now since we have a second order equation, we will need to compute the second derivative of $u$ with respect to $\theta$. $$ \begin{align} \frac{d^2u}{d\theta^2} &= \frac{d}{d\theta}\left(\frac{du}{d\theta}\right) \\ &= \frac{d/dt(du/d\theta)}{d\theta/dt} \\ &= \frac{d/dt(-r'/c)}{\theta'} \\ &= \frac{-r''/c}{c/r^2} \\ &= -\frac{r^2r''}{c^2}. \end{align} $$ Now return to our equation $$ r'' - r(\theta')^2 = -\frac{GM}{r^2}. $$ Making our standard substitution for $\theta'$ we get $$ r'' - r\left(\frac{c}{r^2}\right)^2 = -\frac{GM}{r^2}. $$ Multiply through by $\displaystyle -\frac{r^2}{c^2}$ to obtain $$ -\frac{r^2r''}{c^2} + \frac{1}{r} = \frac{GM}{c^2}. $$ Now substitute in our formulas for $u$ and $\displaystyle \frac{d^2u}{d\theta^2}$ to get $$ \frac{d^2u}{d\theta^2} + u = \frac{GM}{c^2}. $$ So we have found a nice constant coefficient linear differential equation. And we can solve this using the techniques of chapter 2 to obtain $$ u = \frac{GM}{c^2} - A\cos(\theta - \theta_0), $$ where $A$ is an arbitrary constant. Changing back to $r = 1/u$ we get $$ r = \frac{1}{GM/c^2 - A\cos(\theta - \theta_0)}. $$ Remember that we defined our coordinate system so aphelion occurs at $\theta = 0$. Since $r$ will be largest when the denominator is smallest, and $\cos$ takes its maximum value at 0, we must have $\theta_0=0$ so the orbit of the planet in polar coordinates is $$ r = \frac{1}{GM/c^2 - A\cos(\theta)}. $$Kepler's First Law
Isaac Newton would have recognized the equation above as the general equation for a conic section in polar coordinates with a focus at the origin and major axis along the
UPDATE
I wrote that the odds against seeing an extra-solar object on a hyperbolic
orbit were astronomical in 2014.
In 2017, asteroid 1I/2017 U1, of interstellar origin was observed on exactly
such a hyperbolic orbit. The object was named Oumuamua (after the Hawaiian
word for scout). In fact, astronomers believe that such objects are more
common than I realized, but we had never spotted one before. You can read
more online if you are interested. I recommend the NASA site or the
Wikipedia article that I’ve listed below.
Area Of An Orbit
Kepler's third law relates the period, the time it takes the planet to complete an orbit, to the semimajor axis of the ellipse. Kepler's second law relates time to the area swept out, and we also know how to find the area of an ellipse given the major and minor axes. We will use this to find two formulas for the area of an orbit, and then use those to deduce Kepler's third law.Kepler's Third Law
From the above we can compute $$ \begin{align} \text{semi-major axis} &= \frac{\text{perihelion} + \text{aphelion}}2 \\ &= \frac{1}{2}\left(\frac{1}{GM/c^2 + A}+\frac{1}{GM/c^2 - A}\right) \\ &=\frac{GM/c^2}{(GM/c^2)^2 - A^2}. \end{align} $$ We want to relate this to the orbital period $T$, the time it takes the planet to complete an orbit. In the section "Kepler's First Law," we deduced that the area swept out in a time $t$ was $ct/2$. So in time $T$, we sweep out area $cT/2$. But this must be the area of the entire orbit, which we computed in previous section. So $$ \begin{align} \frac{cT}{2} &= \pi \frac{GM/c^2}{\left((GM/c^2)^2 - A^2 \right)^{3/2}} \\ T &= 2\pi \frac{GM/c^3}{\left( (GM/c^2)^2 - A^2 \right)^{3/2}} \\ T^2 &= 4\pi^2 \frac{(GM)^2/c^6}{\left( (GM/c^2)^2 - A^2 \right)^3} \\ T^2 &= \frac{4\pi^2}{GM}\frac{(GM)^3/c^6}{\left( (GM/c^2)^2 - A^2 \right)^3} \\ T^2 &= \frac{4\pi^2}{GM}\left(\frac{(GM)/c^2}{(GM/c^2)^2 - A^2}\right)^3 \\ T^2 &= \frac{4\pi^2}{GM}(\text{semi-major axis})^3. \end{align} $$ So the orbital period squared is proportional to the semi-major axis cubed, with the constant of proportionality $\displaystyle \frac{4\pi^2}{GM}$. Note that we have not only established Kepler's third law, we've also worked out the specific constant of proportionality in terms of physical constants. This enables us to "weigh" the sun. An accurate determination of the semi-major axis of the earth's orbit was made based on measurements taken during the transits of Venus in 1761 and 1769. An accurate measurement for $G$ was carried out by Henry Cavendish in 1798 using a torsion balance. Since the orbital period of the earth is 1 year, from this information we are able to measure the mass of the Sun.If you have any problems with this page, please contact bennett@math.ksu.edu.
©2010, 2014 Andrew G. Bennett