American Journal of Physics, Vol. 72, No. 4, pp. 510–513, April 2004
Š2004 American Association of Physics Teachers. All rights reserved.

Deriving Lagrange's equationsusing elementary calculus

Jozef Hanc^a)

Technical University, Vysokoskolska 4, 042 00 Kosice, Slovakia

Edwin F. Taylor^b)

Massachusetts Instituteof Technology, Cambridge, Massachusetts 02139

Slavomir Tuleja^c)

Gymnazium arm. gen. L. Svobodu, Komenskeho 4,066 51 Humenne, Slovakia

Received: 30 December 2002; accepted: 20 June 2003

We derive Lagrange's equations ofmotion from the principle of least action using elementary calculusrather than the calculus of variations. We also demonstrate theconditions under which energy and momentum are constants of themotion. Š 2004 American Association of Physics Teachers.

I. INTRODUCTION

Theequations of motion¹ of a mechanical system can be derivedby two different mathematical methods—vectorial and analytical. Traditionally, introductory mechanicsbegins with Newton's laws of motion which relate the force,momentum, and acceleration vectors. But we frequently need to describesystems, for example, systems subject to constraints without friction, forwhich the use of vector forces is cumbersome. Analytical mechanicsin the form of the Lagrange equations provides an alternativeand very powerful tool for obtaining the equations of motion.Lagrange's equations employ a single scalar function, and there areno annoying vector components or associated trigonometric manipulations. Moreover, theanalytical approach using Lagrange's equations provides other capabilities² that allowus to analyze a wider range of systems than Newton'ssecond law.

The derivation of Lagrange's equations in advanced mechanicstexts³ typically applies the calculus of variations to the principleof least action. The calculus of variation belongs to importantbranches of mathematics, but is not widely taught or usedat the college level. Students often encounter the variational calculusfirst in an advanced mechanics class, where they struggle toapply a new mathematical procedure to a new physical concept.This paper provides a derivation of Lagrange's equations from theprinciple of least action using elementary calculus,⁴ which may beemployed as an alternative to (or a preview of) themore advanced variational calculus derivation.

In Sec. II we developthe mathematical background for deriving Lagrange's equations from elementary calculus.Section III gives the derivation of the equations of motionfor a single particle. Section IV extends our approach todemonstrate that the energy and momentum are constants of themotion. The Appendix expands Lagrange's equations to multiparticle systems andadds angular momentum as an example of generalized momentum.

II. DIFFERENTIAL APPROXIMATION TO THE PRINCIPLE OFLEAST ACTION

A particle moves along the x axis with potentialenergy V(x) which is time independent. For this special casethe Lagrange function or Lagrangian L has the form:⁵

Theaction S along a world line is defined as

Theprinciple of least action requires that between a fixed initialevent and a fixed final event the particle follow aworld line such that the action S is a minimum.

The action S is an additive scalar quantity, and isthe sum of contributions L Delta t from each segment along theentire world line between two events fixed in space andtime. Because S is additive, it follows that the principleof least action must hold for each individual infinitesimal segmentof the world line.⁶ This property allows us to passfrom the integral equation for the principle of least action,Eq. (2), to Lagrange's differential equation, valid anywhere along theworld line. It also allows us to use elementary calculusin this derivation.

We approximate a small section of theworld line by two straight-line segments connected in the middle(Fig. 1) and make the following approximations: The average positioncoordinate in the Lagrangian along a segment is at themidpoint of the segment.⁷ The average velocity of the particleis equal to its displacement across the segment divided bythe time interval of the segment. These approximations applied tosegment A in Fig. 1 yield the average Lagrangian L_Aand action S_A contributed by this segment:

with similar expressionsfor L_B and S_B along segment B.

Figure 1.

III.DERIVATION OF LAGRANGE'S EQUATION

We employ the approximations of Sec. IIto derive Lagrange's equations for the special case introduced there.As shown in Fig. 2, we fix events 1 and3 and vary the x coordinate of the intermediate eventto minimize the action between the outer two events.

Figure 2.

Forsimplicity, but without loss of generality, we choose the timeincrement Delta t to be the same for each segment, whichalso equals the time between the midpoints of the twosegments. The average positions and velocities along segments A andB are

The expressions in Eq. (4) are all functionsof the single variable x. For later use we takethe derivatives of Eq. (4) with respect to x:

LetL_A and L_B be the values of the Lagrangian onsegments A and B, respectively, using Eq. (4), and labelthe summed action across these two segments as S_AB:

Theprinciple of least action requires that the coordinates of themiddle event x be chosen to yield the smallest valueof the action between the fixed events 1 and 3.If we set the derivative of S_AB with respect tox equal to zero⁸ and use the chain rule, weobtain

We substitute Eq. (5) into Eq. (7), divide throughby Delta t, and regroup the terms to obtain

To firstorder, the first term in Eq. (8) is the averagevalue of [partial-derivative] L/x on the two segments A and B.In the limit Delta t --> 0, this term approaches the value ofthe partial derivative at x. In the same limit, thesecond term in Eq. (8) becomes the time derivative ofthe partial derivative of the Lagrangian with respect to velocityd(L/v)/dt. Therefore in the limit Delta t --> 0, Eq. (8) becomes theLagrange equation in x:

We did not specify the locationof segments A and B along the world line. Theadditive property of the action implies that Eq. (9) isvalid for every adjacent pair of segments.

An essentially identicalderivation applies to any particle with one degree of freedomin any potential. For example, the single angle phi tracksthe motion of a simple pendulum, so its equation ofmotion follows from Eq. (9) by replacing x with phi without the need to take vector components.

IV. MOMENTUM AND ENERGYAS CONSTANTS OF THE MOTION

A. Momentum

We consider the case inwhich the Lagrangian does not depend explicitly on the xcoordinate of the particle (for example, the potential is zeroor independent of position). Because it does not appear inthe Lagrangian, the x coordinate is "ignorable" or "cyclic." Inthis case a simple and well-known conclusion from Lagrange's equationleads to the momentum as a conserved quantity, that is,a constant of motion. Here we provide an outline ofthe derivation.

For a Lagrangian that is only a functionof the velocity, L = L(v), Lagrange's equation (9) tells us thatthe time derivative of [partial-derivative] L/v is zero. From Eq. (1),we find that L/v = mv, which implies that the x momentum,p = mv, is a constant of the motion.

This usual considerationcan be supplemented or replaced by our approach. If werepeat the derivation in Sec. III with L = L(v) (perhaps asa student exercise to reinforce understanding of the previous derivation),we obtain from the principle of least action

We substituteEq. (5) into Eq. (10) and rearrange the terms tofind:

(([partial-derivative]LA)/([partial-derivative]vA)) = (([partial-derivative]LB)/([partial-derivative]vB))

Again we can use the arbitrary location ofsegments A and B along the world line to concludethat the momentum p is a constant of the motioneverywhere on the world line.

B. Energy

Standard texts⁹ obtain conservationof energy by examining the time derivative of a Lagrangianthat does not depend explicitly on time. As pointed outin Ref. 9, this lack of dependence of the Lagrangianimplies the homogeneity of time: temporal translation has no influenceon the form of the Lagrangian. Thus conservation of energyis closely connected to the symmetry properties of nature.¹⁰ Aswe will see, our elementary calculus approach offers an alternativeway¹¹ to derive energy conservation.

Consider a particle in atime-independent potential V(x). Now we vary the time of themiddle event (Fig. 3), rather than its position, requiring thatthis time be chosen to minimize the action.

Figure 3.

For simplicity,we choose the x increments to be equal, with thevalue Delta x. We keep the spatial coordinates of all threeevents fixed while varying the time coordinate of the middleevent and obtain

These expressions are functions of the singlevariable t, with respect to which we take the derivatives

and

Despite the form of Eq. (13), the derivatives ofvelocities are not accelerations, because the x separations are heldconstant while the time is varied.

As before [see Eq.(6)],

Note that students sometimes misinterpret the time differences inparentheses in Eq. (14) as arguments of L.

We findthe value of the time t for the action tobe a minimum by setting the derivative of S_AB equalto zero:

If we substitute Eq. (13) into Eq. (15)and rearrange the result, we find

Because the action isadditive, Eq. (16) is valid for every segment of theworld line and identifies the function v [partial-derivative] L/v–L as a constantof the motion. By substituting Eq. (1) for the Lagrangianinto vL/v–L and carrying out the partial derivatives, we canshow that the constant of the motion corresponds to thetotal energy E = T + V.

V. SUMMARY

Our derivation and the extension to multiple degreesof freedom in the Appendix allow the introduction of Lagrange'sequations and its connection to the principle of least actionwithout the apparatus of the calculus of variations. The derivationsalso may be employed as a preview of Lagrangian mechanicsbefore its more formal derivation using variational calculus.

One ofus (ST) has successfully employed these derivations and the resultingLagrange equations with a small group of talented high schoolstudents. They used the equations to solve problems presented inthe Physics Olympiad. The excitement and enthusiasm of these studentsleads us to hope that others will undertake trials withlarger numbers and a greater variety of students.

ACKNOWLEDGMENT

The authors would liketo express thanks to an anonymous referee for his orher valuable criticisms and suggestions, which improved this paper.

APPENDIX: EXTENSION TO MULTIPLE DEGREES OF FREEDOM

Wediscuss Lagrange's equations for a system with multiple degrees offreedom, without pausing to discuss the usual conditions assumed inthe derivations, because these can be found in standard advancedmechanics texts.³

Consider a mechanical system described by the followingLagrangian:

where the q are independent generalized coordinates and thedot over q indicates a derivative with respect to time.The subscript s indicates the number of degrees of freedomof the system. Note that we have generalized to aLagrangian that is an explicit function of time t. Thespecification of all the values of all the generalized coordinatesq_i in Eq. (17) defines a configuration of the system.The action S summarizes the evolution of the system asa whole from an initial configuration to a final configuration,along what might be called a world line through multidimensionalspace–time. Symbolically we write:

The generalized principle of least actionrequires that the value of S be a minimumfor the actual evolution of the system symbolized inEq. (18). We make an argument similar to that inSec. III for the one-dimensional motion of a particlein a potential. If the principle of least action holdsfor the entire world line through the intermediate configurations ofL in Eq. (18), it also holds for an infinitesimalchange in configuration anywhere on this world line.

Let thesystem pass through three infinitesimally close configurations in the orderedsequence 1, 2, 3 such that all generalized coordinates remainfixed except for a single coordinate q at configuration2. Then the increment of the action from configuration1 to configuration 3 can be considered to be afunction of the single variable q. As a consequence, foreach of the s degrees of freedom, we canmake an argument formally identical to that carried outfrom Eq. (3) through Eq. (9). Repeated s times, oncefor each generalized coordinate q_i, this derivation leads to sscalar Lagrange equations that describe the motion of the system:

The inclusion of time explicitly in the Lagrangian (17) doesnot affect these derivations, because the time coordinate is heldfixed in each equation.

Suppose that the Lagrangian (17) isnot a function of a given coordinate q_k. An argumentsimilar to that in Sec. IV A tells us that thecorresponding generalized momentum [partial-derivative] L/ q-dot _k is a constant of the motion.As a simple example of such a generalized momentum, weconsider the angular momentum of a particle in a centralpotential. If we use polar coordinates r, theta to describethe motion of a single particle in the plane, thenthe Lagrangian has the form L = T–V = m( r-dot ² + r² theta-dot ²)/2–V(r), and the angular momentumof the system is represented by [partial-derivative] L/.

If the Lagrangian(17) is not an explicit function of time, then aderivation formally equivalent to that in Sec. IVB (with timeas the single variable) shows that the function ( [summation] q-dot ₁ [partial-derivative] L/_i)–L, sometimescalled¹² the energy function h, is a constant of themotion of the system, which in the simple cases wecover¹³ can be interpreted as the total energy E ofthe system.

If the Lagrangian (17) depends explicitly on time,then this derivation yields the equation dh/dt = – [partial-derivative] L/t.

REFERENCES

Citation links [e.g., Phys. Rev. D 40, 2172 (1989)] go to online journal abstracts. Other links (see Reference Information) are available with your current login. Navigation of links may be more efficient using a second browser window.

We take "equations of motion" to mean relations between the accelerations, velocities, and coordinates of a mechanical system. See L. D. Landau and E. M. Lifshitz, Mechanics (Butterworth-Heinemann, Oxford, 1976), Chap. 1, Sec. 1. first citation in article
Besides its expression in scalar quantities (such as kinetic and potential energy), Lagrangian quantities lead to the reduction of dimensionality of a problem, employ the invariance of the equations under point transformations, and lead directly to constants of the motion using Noether's theorem. More detailed explanation of these features, with a comparison of analytical mechanics to vectorial mechanics, can be found in Cornelius Lanczos, The Variational Principles of Mechanics (Dover, New York, 1986), pp. xxi–xxix. first citation in article
Chapter 1 in Ref. 1 and Chap. V in Ref. 2; Gerald J. Sussman and Jack Wisdom, Structure and Interpretation of Classical Mechanics (MIT, Cambridge, 2001), Chap. 1; Herbert Goldstein, Charles Poole, and John Safko, Classical Mechanics (Addison–Wesley, Reading, MA, 2002), 3rd ed., Chap. 2. An alternative method derives Lagrange's equations from D'Alambert principle; see Goldstein, Sec. 1.4. first citation in article
Our derivation is a modification of the finite difference technique employed by Euler in his path-breaking 1744 work, "The method of finding plane curves that show some property of maximum and minimum." Complete references and a description of Euler's original treatment can be found in Herman H. Goldstine, A History of the Calculus of Variations from the 17th Through the 19th Century (Springer-Verlag, New York, 1980), Chap. 2. Cornelius Lanczos (Ref. 2, pp. 49–54) presents an abbreviated version of Euler's original derivation using contemporary mathematical notation. first citation in article
R. P. Feynman, R. B. Leighton, and M. Sands, The Feynman Lectures on Physics (Addison–Wesley, Reading, MA, 1964), Vol. 2, Chap. 19. first citation in article
See Ref. 5, p. 19-8 or in more detail,
J. Hanc, S. Tuleja, and M. Hancova, "Simple derivation of Newtonian mechanics from the principle of least action," Am. J. Phys. 71 (4), 386–391 (2003). [ISI] first citation in article
There is no particular reason to use the midpoint of the segment in the Lagrangian of Eq. (2). In Riemann integrals we can use any point on the given segment. For example, all our results will be the same if we used the coordinates of either end of each segment instead of the coordinates of the midpoint. The repositioning of this point can be the basis of an exercise to test student understanding of the derivations given here. first citation in article
A zero value of the derivative most often leads to the world line of minimum action. It is possible also to have a zero derivative at an inflection point or saddle point in the action (or the multidimensional equivalent in configuration space). So the most general term for our basic law is the principle of stationary action. The conditions that guarantee the existence of a minimum can be found in I. M. Gelfand and S. V. Fomin, Calculus of Variations (Prentice–Hall, Englewood Cliffs, NJ, 1963). first citation in article
Reference 1, Chap. 2 and Ref. 3, Goldstein et al., Sec. 2.7. first citation in article
The most fundamental justification of conservation laws comes from symmetry properties of nature as described by Noether's theorem. Hence energy conservation can be derived from the invariance of the action by temporal translation and conservation of momentum from invariance under space translation. See N. C. Bobillo-Ares, "Noether's theorem in discrete classical mechanics," Am. J. Phys. 56 (2), 174–177 (1988)
or C. M. Giordano and A. R. Plastino, "Noether's theorem, rotating potentials, and Jacobi's integral of motion," ibid. 66 (11), 989–995 (1998). first citation in article
Our approach also can be related to symmetries and Noether's theorem, which is the main subject of J. Hanc, S. Tuleja, and M. Hancova, "Symmetries and conservation laws: Consequences of Noether's theorem," Am. J. Phys. (to be published). first citation in article
Reference 3, Goldstein et al., Sec. 2.7. first citation in article
For the case of generalized coordinates, the energy function h is generally not the same as the total energy. The conditions for conservation of the energy function h are distinct from those that identify h as the total energy. For a detailed discussion see Ref. 12. Pedagogically useful comments on a particular example can be found in A. S. de Castro, "Exploring a rheonomic system," Eur. J. Phys. 21, 23–26 (2000) [Inspec]
and C. Ferrario and A. Passerini, "Comment on Exploring a rheonomic system," ibid. 22, L11–L14 (2001). [Inspec] [ISI] first citation in article

CITING ARTICLES

This list contains links to other online articles that cite the article currently being viewed.

Hamilton's principle: Why is the integrated difference of the kinetic and potential energy minimized?
Alberto G. Rojo, Am. J. Phys. 73, 831 (2005)

FIGURES

Full figure (5 kB)

Fig. 1. An infinitesimal sectionof the world line approximated by two straight line segments. First citation in article

Full figure (6 kB)

Fig. 2. Derivationof Lagrange's equations from the principle of least action. Points1 and 3 are on the true world line. Theworld line between them is approximated by two straight linesegments (as in Fig. 1). The arrows show that thex coordinate of the middle event is varied. All othercoordinates are fixed. First citation in article

Full figure (6 kB)

Fig. 3. A derivation showing that the energy is aconstant of the motion. Points 1 and 3 are onthe true world line, which is approximated by two straightline segments (as in Figs. 1 and 2). The arrowsshow that the t coordinate of the middle event isvaried. All other coordinates are fixed. First citation in article

FOOTNOTES

^aElectronic mail:jozef.hanc@tuke.sk

^bElectronic mail: eftaylor@mit.edu; http://www.eftaylor.com

^cElectronic mail: tuleja@stonline.sk

Up: Issue Table of Contents
Go to: Previous Article | Next Article
Other formats: HTML (smaller files) | PDF (83 kB)

Deriving Lagrange's equations using elementary calculus

Jozef Hanca)