American Journal of Physics, Vol. 72, No. 4, pp. 522–527, April 2004
Š2004 American Association of Physics Teachers. All rights reserved.

Getting the mostaction out of least action: A proposal

Thomas A. Moore^a)

Department of Physicsand Astronomy, Pomona College, 610 N. College Avenue, Claremont, California 91711

Received: 8September 2003; accepted: 12 December 2003

Lagrangian methods lie at the foundation ofcontemporary theoretical physics. Several recent articles have explored the possibilityof making the principle of least action and Lagrangian methodsa part of the first-year physics curriculum. I examine someof this proposal's implications for subsequent courses in the undergraduatephysics major, and focus on the influence that this proposalmight have on the selection of topics and the opportunitiesthis proposal presents for teaching these courses in a morecontemporary way. Many of these ideas are relevant even ifstudents first learn Lagrangian methods in a sophomore mechanics course.Š 2004 American Association of Physics Teachers.

I. INTRODUCTION

Hamilton's principle,¹ more generally known as theprinciple of least action (particularly since the publication of Feynman'slectures²) has played a seminal role in the development oftheoretical physics in the latter part of the 20th century.Lagrangian methods that extend this principle lie at the heartof general relativity, quantum field theory, and the standard modelof particle physics, and such methods play a crucial rolein conceptually framing and expressing these theories.

Edwin Taylor hasrecently argued that this principle provides a simple but powerfulframework for unifying Newtonian mechanics, relativity, and quantum mechanics,³ andhe and his collaborators have begun to lay the foundationsfor teaching the principle in the introductory course.⁴^,⁵^,⁶^,⁷^,⁸ If wepresume that this proposal is possible and desirable, it hasimplications for subsequent courses in the physics major. In thisarticle, I will examine some of these implications, focusing onnew opportunities that teaching least action in the introductory coursemakes possible, as well as on what changes in upper-levelcourses might best support these opportunities in subsequent courses.

Mypurpose is not to describe a new upper-level curriculum indetail. Instead, I hope that by presenting an overview ofthe issues and providing references to some available resources, Iwill provide some guidance to those who might develop suchcurricula. This article also might be interesting to those seekingto modernize the upper-level courses that follow an intermediate mechanicscourse which discusses Lagrangian methods.

II. THE MODERN PHYSICS COURSE

Most upper-level physics curricula openwith a course in "modern physics," which for the sakeof argument in what follows, I will assume to bea sophomore-level class that at least discusses special relativity, somebasic quantum theory, atomic and nuclear physics, and perhaps someparticle physics. In a curriculum where the classical principle ofleast action is taught in introductory physics, the modern physicscourse might be reworked somewhat to address two important goals:connect the classical principle of least action with quantum mechanicsand relativity, and build a solid foundation for using theprinciple in subsequent courses. I will discuss the link toquantum mechanics first (for reasons that will become clearer aswe go).

Taylor, Vokos, O'Meara, and Thornber have recently publisheda curricular plan that connects quantum mechanics with the principleof least action at a level that seems appropriate forsophomores.⁹ This plan starts with the students working through thefirst half of Feynman's popular book QED.¹⁰ Feynman's book demonstratesthat it is possible to explain the results of classicaloptics in a variety of practical situations using the followingsimple model: a photon explores all possible paths between emissionand detection, we imagine the photon traveling along each possiblepath to carry an arrow that rotates a number oftimes that is proportional to the action along that path,and the probability that the photon will be observed atthe detection event is proportional to the squared length ofthe vector sum of the final arrows for all thepaths that the photon explores. Sophomore-level majors (unlike Feynman's intendedaudience) should be able to understand that the arrows arevisual representations of complex numbers, but this visualization is powerfuland useful even when students can do the calculations withcomplex numbers.

The fundamental problem with the "explore all paths"model is that actually summing the arrows over all possiblepaths is a daunting task. Taylor and his collaborators makethis task simpler by providing computer programs that compute thesums for various simple paths so that students can explorethe implications of the model. Building on this foundation, Taylorand his collaborators (aided by more programs) then extend Feynman'sdescription to help students discover methods for handling free electronsand then electrons with potential energy, the concept of awave function, the concept of the free-particle propagator, and ultimatelythe concept of a bound-state wave function, all with verylittle mathematics.

The method Taylor and his collaborators use todevelop the free-particle propagator illustrates their general approach to makingdifficult ideas more accessible. The key to making the "exploremany paths" approach practical is to get rid of thesummation over all possible paths. The world line through spacetimebetween a given starting event a and a given endingevent b that has the least action is by definitionthe world line along which the particle's arrow undergoes thefewest turns from start to finish. With the help ofthe computer programs, a student can find that the onlypaths that contribute significantly to the arrow representing the finalsum at event b are those contributing final arrows thatmake an angle of less than with the arrowcontributed by the world line of least action; neither thelength nor the direction of the sum is much affectedif one ignores all other paths. Indeed, one finds thatfor a free particle, the direction (in the complex plane)of the arrow representing the sum at b is alwaysrotated by 45° relative to the direction of the arrowcontributed by the least-action world line at b (which inturn is simply a rotated version of the arrow atthe initial event a), and the sum's magnitude depends onhow far a path must deviate from the least-action worldline to yield a contributed arrow that makes an angleof with the least-action arrow.

Therefore it should bepossible in principle to forego the sum entirely and calculatethe arrow representing the sum over all paths by rotatingthe direction of the arrow contributed by the single least-actionpath by 45° and multiplying by a factor that specifiesthe degree to which small deviations from this path affectthe angle of the path's contributed arrow. For a freeparticle, this factor can only be a function of theparticle's mass m, Planck's constant h, the time interval betweenthe initial and final events, and the spatial separation ofthose events. Taylor and his collaborators⁹ argue that we candetermine the correct expression for this factor by assuming thata free-particle wave function which is uniform over space ata certain time must remain uniform as time passes (aresult required by symmetry). We can consider any wave functionat a given time to be a set of arrows(that is, complex numbers) distributed over space. Assume that weknow the wave function arrows psi (x_i,t₀) at various positions x_iat some initial time t₀. The arrow psi (x,t) at adifferent position x and later time t is determined bydetermining the sum of the arrows contributed by all pathsstarting from the arrow psi (x_i,t₀) at a given x_i attime t₀ and arriving at position x and time t,and then summing over all x_i (see Fig. 1). Forthe free particle, we can do the sum over allpaths by calculating the arrow contributed by the least-action pathfrom x_i,t₀ to x, t (which is a straight worldline for a free particle) and use a formula (rule)involving h, m, t–t₀, and x–x_i to convert this arrowto an arrow representing the sum over all paths. Byusing a program constructed for this purpose, students can experimentwith different rules until they find one that preserves theuniform wave function. The process is quite intuitive and requiresvery little mathematics.

Figure 1.

Once we know how to generate afuture wave function from a past one, we can generalizeto particles that are not free and begin to exploreboth stationary and dynamic states of bound particles.⁹ After developingthe general concept of a stationary state, we might introducethe Schrödinger equation and explore bound states of other systemsin a more conventional manner.

The approach in Ref. 9is plausibly accessible to sophomore-level physics majors, and has theadvantage of giving these students a deeper, more intuitive, andperhaps more engaging understanding of quantum mechanics than one typicallygets in a modern physics course. Moreover, this approach isthe only way that I know in which we mightplausibly link the classical principle of least action and quantummechanics at this level. This approach, however, will take afair amount of class time, and thus will probably displacesome other topics usually covered in such a course.¹¹

NextI would like to discuss the treatment of special relativityin the modern physics course. The argument about the propagatorassumes that the reader understands what events, world lines, andspacetime diagrams are (Fig. 1 is essentially a spacetime diagram).Therefore, a careful treatment of these concepts in the relativityportion of the course is essential for the success ofthe quantum section. My experience is that taking the timeto teach students to use spacetime diagrams and the geometricanalogy to relativity before teaching the Lorentz transformation equations greatlyimproves their understanding. Students understand much better the meaning ofthe Lorentz transformation equations after they have seen a spacetimediagram that shows the axes for two different reference frames,and after they have understood the crucial differences between coordinatemeasurements and the invariant spacetime interval.

The other topic thatneeds to be explored is the concept of a four-vector.This concept not only makes the relationship between energy, momentum,and mass much easier to understand, but it provides anessential foundation for any future application of Lagrangian methods tospecial relativity, general relativity, or electricity and magnetism. This courseis not where we should introduce index notation and theEinstein summation convention, but most students at this level understandcolumn vectors and matrix multiplication, and we can go along way with these tools and explore the most crucialcharacteristics of four-vectors (such as their transformation properties, the invarianceof a four-vector's magnitude, the invariance of the dot productof four-vectors, and the frame-independence of four-vector equations).

This partof the course also should link the classical principle ofleast action with the principle that a straight world lineis the world line of longest proper time between twogiven events (the latter is easily proved using an elementaryargument¹²^,¹³ and should be a part of the development ofthe concept of proper time). The action S for arelativistic free particle for a given world line can bewritten as

where c is the speed of light, andthe integral yields the total proper time measured along thepath. The minus sign ensures that the action is aminimum for whatever path has maximal proper time, and thefactor mc² gives the action the appropriate units and thecorrect linear dependence on the particle's mass.

We can writeEq. (1) in the form of a coordinate-time integration overa Lagrangian as follows:

which implies that

A simple applicationof the Euler–Lagrange equations and some basic calculus establishes thatthe particle's velocity components must be constant. We see, therefore,that we can develop a relativistic principle of least-action fora free particle and obtain the constant-velocity result that weknow must be true from other arguments. This result supportsthe idea (used in the quantum section) that the worldline of least action for even a relativistic free particleis indeed a straight world line, and Eq. (2b) isan essential first step in developing an electromagnetic Lagrangian.

Sucha discussion would imply a relativity section that is threeto four weeks long, which is more time than isusually spent on the topic. In what follows, however, Iwill show that this discussion would open up significant opportunitiesfor subsequent courses. Because applications of relativity are increasingly importantin modern technology, a solid understanding of relativity is moreimportant to physicists and engineers now than it was eventwo decades ago.¹⁴

III. THE INTERMEDIATE MECHANICS COURSE

Thenext course a typical physics major might encounter would beone in intermediate classical mechanics, which typically discusses subjects suchas orbital motion, damped and driven harmonic oscillators, rotation ofrigid bodies, and perhaps even some chaos and non-linear dynamics.Texts for this course commonly include a discussion of theprinciple of least action and Lagrangian methods.¹⁵ If these ideasare thoroughly discussed in the introductory course, then some timewould become available in this course. My recommendation is thatat least some of this extra time be spent exploringthe application of Lagrangian methods to continuous media. This applicationis important because the same methods apply to fields, sothis discussion of continuous media would provide essential background forany subsequent application of Lagrangian methods to the electromagnetic field.Reference 16 presents a very nice discussion of continuous media.

IV. QUANTUM MECHANICS

Most undergraduate major programsinclude a quantum mechanics course in the junior or senioryear. I will assume that students in this course arefamiliar with partial derivatives, complex numbers, looking up integrals, andTaylor-series expansions.

A crucial first step in this course wouldbe to firmly and formally connect the explore all pathsmodel presented in the sophomore course with the time-dependent Schrödingerequation. Once this connection has been made, the rest ofthe course can be taught in the standard way. Inwhat follows, I will briefly sketch the logic of theargument: more details can be found in Ref. 17.

Inthe sophomore-level course, students should have discovered that for afree particle, the propagator function that species the contribution tothe quantum amplitude (arrow) psi (x,t) made by arrows of theparticle's wave function within a sufficiently small range Delta x_i aroundthe position x_i at an earlier time t₀ is givenby

where S_direct is the action measured along the straightworldline from x_i,t₀ to x, t. For a free particlemoving in one dimension with a constant potential energy V,the value of S_direct is simply

where Delta t [equivalent] t–t₀ is the(coordinate) time difference between the events and ux_i–x. So inthis case, we have

To find the complete wave functionamplitude psi (x,t), we must sum K psi (x_i,t₀) Delta x_i over all possible initialpositions x_i, as schematically shown in Fig. 1. Note thatthe middle factor in Eq. (5) is the only thingthat varies as x_i varies, because it will cause uto vary, and this term rotates the phase angle ofthe resulting complex amplitude. As discussed, arrows rotated by anangle greater than relative to the arrow for u = 0do not contribute significantly to the result, and we reallyonly need to be concerned about the contributions from theinitial positions x_i close enough to the final position xso that

Equation (6) proves to be the key tousing the explore all paths approach to derive the Schrödingerequation. Note that if we choose the time step Delta t = t–t₀between the initial and final wave functions to be infinitesimal,then u also must be infinitesimal, which means that thepositions of points along all the paths in Fig. 1that contribute significantly will not be much different from x.Therefore, even if the particle's potential energy varies with position,its value over the range of interest for calculating psi (x,t)will be essentially equal to V(x), its value at x,so Eqs. (3,4,5) apply even to the case of nonuniformV(x) in the limit Delta t --> 0. The sum over all x_iin this limit therefore becomes

because x_i = x + u and dx_i = du. Ifwe expand the exponential involving V to order Delta t, psi (x + u,t₀)to order u², and do some integrals of the form [integral] <sub>–[infinity]</sub><sup>[infinity]</sup> uⁿe^–au²du,¹⁸ we find that

If we subtract psi (x,t₀) from bothsides, multiply through by i [h-bar] / Delta t, and take the limit Delta t --> 0,we find the time-dependent Schrödinger equation for one dimension. (Itis not very difficult to generalize this derivation to threedimensions, but it does not yield any deeper understanding.)

V.ELECTRICITY AND MAGNETISM

The undergraduate curriculum also typically includes a coursein electricity and magnetism offered at the sophomore, junior, orsenior level. I will assume that this course is offeredfor juniors and/or seniors and that students have taken amodern physics course and intermediate mechanics course of the typealready described.

The first task in this course would beto discuss index notation and the Einstein summation convention, theLorentz transformation properties of scalars, vectors, and covectors, and thefour-gradient. My experience is that juniors and seniors can becomecomfortable with this material within four to five class sessionsif the material is taught carefully.¹⁹ The relativistic Lorentz forcelaw provides a good physical context for practicing the notation.In appropriate units,²⁰ this law can be written as

where

and u is the charged particle's four-velocity with components u^t = [1–²/c²]^–1/2 [equivalent] gamma ,uⁱ = gamma ⁱ/c, p^ľ = mc u^ľ is the particle's four-momentum, q is its charge, tau is the proper time measured along its world lineand I am using a metric with a timelike signature(+–––). Equation (9) involves scalars, vectors, covectors, and tensors andyet when the sums are written out explicitly, the threespatial components reduce to the Lorentz law taught in introductoryphysics and the time component reduces to conservation of energy.By examining the transformation properties of all the pieces, studentscan demonstrate that Eq. (9) must have the same formin all reference frames. It also is a good exercisefor students to show that the antisymmetric nature of F^ľensures that d(p^ľp_ľ)/d tau = 0, meaning that the particle's rest mass m = p^ľp_ľis fixed.

To fully connect electricity and magnetism with theprinciple of least action, we also must develop the conceptof the magnetic potential A. Textbooks at this level avoidor marginalize the magnetic potential, partly because when it ispresented in the usual way, it can be a trickyand abstract concept. However, there are ways to make themagnetic potential more accessible,²¹ and there are some good reasonsto discuss it fully even if we ignore the principleof least action.²²

One possible story line for introducing thefour-potential is made possible by the principle of least action.The action for a non-relativistic particle moving in a staticelectric field is

Our goal is to see if wecan guess the appropriate relativistic action for this case. Wealready know how to generalize the kinetic energy part; theaction for a free particle is given in Eq. (2a).Like this part, whatever we add to the action toaccount for the field must be a relativistic scalar. Butis the electric potential phi a relativistic scalar or somethingelse? By considering the field between the plates of aparallel-plate capacitor when viewed in a frame moving parallel tothe plates, it can be quickly argued that phi musttransform like the time component of a four-vector. So ina fully relativistic expression for the action, the electromagnetic fieldmust appear in the form of a four-vector that wewill call A^ľ. However, the term we add to theLagrangian must be a relativistic scalar, so the term mustbe the dot product of A^ľ and some other four-vector.The only available four-vector in the case of a pointparticle is the particle's own four-velocity u^ľ. So we proposea relativistic action of the form

where the components ofA are the spatial components of A^ľ. We can easilyshow that S in Eq. (11b) reduces to Eq. (10)in the non-relativistic limit (except for an extra rest energyterm that does not affect the motion).

What kind ofmotion does this principle imply? Although we can quickly givethe result in index notation, let me demonstrate the argumentin a form that might be more accessible to ajunior physics major. Consider the x component of the Euler–Lagrangeequation. The partial derivatives of the Lagrangian in this caseare

where p^x is the relativistic momentum. The Euler–Lagrange equationsin this case therefore imply that

which implies that

Theusual definition of the electric field is the force perunit charge on a test charge at rest, so wehave

If we identify

we can easily see that Eq.(13b) is equivalent to the x component of the Lorentzforce law given by Eq. (9a). We also can seequite generally that

and that Faraday's law and div B = 0 areidentities implied by Eq. (15).

Once we have gone thisfar, we can derive the source-dependent Maxwell equations from aplausible principle of least action.²³ Students should know from thetreatment of continuous media in the intermediate mechanics course thata least-action principle for the electromagnetic field will involve integratinga Lagrangian density over all space and time. This Lagrangiandensity must be a relativistic scalar and must involve aterm that is quadratic in the field quantities. These requirementsimply that the resulting Euler–Lagrange equations will produce linear differentialequations in the field, which is required for the fieldto obey the superposition principle. The only plausible candidates forsuch terms are A_ľA^ľ and F_ľF^ľ. The first of theseleads to absurd results, for example, the resulting field equationsin the electrostatic case involve phi directly, not the derivativesof phi , which does not match Gauss' law. For thesecond case, we can argue that the sign of theintegral has to be negative for the quantity to havea plausible minimum,²⁴ and that we must have a factorof 1/k (where k is Coulomb's constant) to make theunits come out right. The Lagrangian density also must involvea term that is linear in the four-current J^ľ = [ rho ,j/c], wherej is the ordinary current density, so that the sourceswill appear linearly in the field equation. The only plausibleterm with the right units in this case is A_ľJ^ľ.Therefore, the least-action principle for the electromagnetic field must besomething like

where g^ľ is the inverse flat-space metric andb is some unitless constant that specifies the relative magnitudeand sign of the two terms. The field quantities A_ľplay the role of coordinates and the gradients [partial-derivative] _ľA playthe role of "velocities." With only a bit of work,²⁵the Euler–Lagrange equations yield

If we choose b = –16, the timecomponent of Eq. (17) matches Gauss' law. By writing themout, students can discover that the other components spell outthe Ampere–Maxwell relation.

VI. OTHERINTERESTING APPLICATIONS OF LEAST ACTION

Because many electromagnetic circuits have directmechanical analogs, we often can use Lagrangian methods to findequations of motion for such circuits, even for very complicatedelectromechanical circuits. It turns out that we can even handlerealistic resistors by treating them as generalized external forces. Theseissues (along with many other applications of Lagrangian techniques) arebeautifully discussed by Wells.²⁶

Another interesting source of applications ofthe principle of least action to fields at a fairlyadvanced level is a book written some time ago bySoper.²⁷ This book even includes a discussion of dissipative effectsthat might be appropriate in an upper-level course.

Once studentsare used to the principle of least action, other variationalcalculations become conceptually simpler. Several years ago, Van Baak discusseda variational technique that enables one to solve complicated steady-statecircuits without invoking Kirchoff's loop rule.²⁸ Because applying the looprule requires careful attention to signs, it is a commonsource of student errors. Van Baak's approach avoids this problem.

Finally, I point out that if students have studied specialrelativity in some depth and have seen index notation andknow about four-vectors, covectors, and tensors, they have a backgroundthat provides a great springboard for studying general relativity. Thegeodesic equations of motion can be treated as a least-actionprinciple. One can even use a Lagrangian to find equationsof motion for the gravitational field,²⁹ a method widely usedby researchers in the field (particularly those doing numerical simulations).

VII. CONCLUSIONS

Mygoal has been to reflect on what kinds of changesto the upper-level curriculum might help students take full advantageof an introductory-level exploration of the principle of least action.I have only provided a broad sketch; there is muchwork to be done before these suggestions can become anythingapproaching a practical curriculum. The proposed changes would in somecases mean shifting priorities to allow sufficient time for thedevelopment of some of the techniques, and I have nodoubt that some of the changes would present problems thatwould have to be worked out.

However, the proposed changescould create a very exciting upper-level curriculum that could moreclearly display the deep underlying connections between mechanics, relativity, electrodynamics,and quantum mechanics. These changes would give us a thoroughly21st-century physics curriculum that teaches viewpoints and techniques currently usedby researchers. The principle of least action is among themost beautiful and powerful physical principles ever envisioned. With somevision and effort, the least action principle could become agreater part of the common background of physics undergraduates.

ACKNOWLEDGMENTS

I wouldlike to thank E. F. Taylor, the editors of thisspecial issue, and the reviewers for making valuable suggestions abouthow to improve this article.

REFERENCES

Citation links [e.g., Phys. Rev. D 40, 2172 (1989)] go to online journal abstracts. Other links (see Reference Information) are available with your current login. Navigation of links may be more efficient using a second browser window.

Herbert Goldstein, Charles P. Poole, Jr., and John L. Safko, Classical Mechanics (Addison–Wesley, San Francisco, 2002), 3rd ed., Vol. 1, Chap. 2, pp. 34ff. first citation in article
Richard P. Feynman, Robert B. Leighton, and Matthew Sands, The Feynman Lectures on Physics (Addison–Wesley, Reading, MA, 1964), Vol. 2, Chap. 19, pp. 19–1ff. first citation in article
Edwin F. Taylor, "A call to action," Am. J. Phys. 71, 423–425 (2003). [ISI] first citation in article
Jozef Hanc, Slavomir Tuleja, and Martina Hancova, "Simple derivation of Newtonian mechanics from the principle of least action," Am. J. Phys. 71, 386–391 (2003). [ISI] first citation in article
Jozef Hanc, Edwin F. Taylor, and Slavomir Tuleja, "Deriving Lagrange's equations using elementary calculus," Am. J. Phys. (submitted). See www.eftaylor.com/leastaction.html. first citation in article
Edwin F. Taylor and Jozef Hanc, "From conservation of energy to the principle of least action: A story line," Am. J. Phys. (submitted). See www.eftaylor.com/leastaction.html. first citation in article
Jozef Hanc, Slavomir Tuleja, and Martina Hancova, "Symmetries and conservation laws: Consequences of Noether's theorem," Am. J. Phys. (submitted). See www.eftaylor.com/leastaction.html. first citation in article
Jozef Hanc, "The original Euler's calculus-of-variations method: Key to Lagrangian mechanics for beginners," Am. J. Phys. (submitted). See www.eftaylor.com/leastaction.html. first citation in article
Edwin F. Taylor, Stamatis Vokos, John M. O'Meara, and Nora S. Thornber, "Teaching Feynman's sum-over-paths quantum theory," Comput. Phys. 12, 190–199 (1998). Current versions of the draft teaching materials and computer programs discussed in this article are available online at www.eftaylor.com/download.html#quantum. first citation in article
Richard S. Feynman, QED: The Strange Theory of Light and Matter (Princeton U.P., Princeton, 1985). first citation in article
Such modern physics courses often include a discussion of the historical development of quantum mechanics that would be less relevant to this approach. Cutting much of this material will help make some room. first citation in article
Edwin F. Taylor and John Archibald Wheeler, Spacetime Physics (Freeman, New York, 1992), 2nd ed., p. 149ff. first citation in article
Thomas A. Moore, A Traveler's Guide to Spacetime (McGraw–Hill, New York, 1995), pp. 86–87. The same argument also appears on pp. 83–84 of Moore's introductory textbook, Six Ideas That Shaped Physics, Unit R: The Laws of Physics are Frame-Independent (McGraw–Hill, New York, 2003), 2nd ed. first citation in article
The relativity of simultaneity has become a very practical engineering problem for the designers of the global positioning system. Students can see the delay imposed by light travel time when satellite communications are used on television. Experimental general relativity has mushroomed in recent years, and gravitational waves will likely be discovered in the coming decade. Moreover, aspects of relativistic cosmology previously considered esoteric are likely to have a large impact on physics in the next couple of decades. first citation in article
Examples include Jerry B. Marion and Stephen T. Thornton, Classical Dynamics of Particles and Systems (Saunders, Fort Worth, 1995), 4th ed.;
Ralph Baierlein, Newtonian Dynamics (McGraw–Hill, New York, 1983); and
Grant R. Fowles, Analytical Mechanics (Saunders, Philadelphia, 1986), 4th ed. first citation in article
Herbert Goldstein, Charles P. Poole, Jr., and John L. Safko, Classical Mechanics (Addison–Wesley, San Francisco, 2002), 3rd ed. Secs. 13.1 and 13.2 (up to the middle of p. 563) are at a level suitable for sophomores or juniors. One would probably not need to derive the Euler–Lagrange equations the way that they do, but rather state the equations (appealing to analogy) and show that they work for a simple case (as the authors do at the top of p. 563). first citation in article
Ramamurti Shankar, Principles of Quantum Mechanics (Plenum, New York, 1980), Sec. 8.5, pp. 240–241. first citation in article
The results for these definite integrals given in standard integral tables assume (usually implicitly) that a is real. However, the same results apply even if a is complex, as long as the real part of a>0. See, for example, Milton Abramowitz and Irene A. Stegun, Handbook of Mathematical Functions (Dover, New York, 1964), p. 302, where no such assumption is made. I am not sure that it is necessary to have students worry about this issue unless they ask. first citation in article
I regularly teach a junior-level course in general relativity where students are required to master this material. I have found that there are some tricks for teaching index notation at this level that are beyond the scope of this article to discuss in detail, but it helps greatly if students are explicitly taught to recognize the difference between free and summed indices, and if they write out expanded versions of the equations when necessary. Students also should be required to calculate the time derivative of a product involving an implied sum and do other exercises where the correct answer depends on correctly recognizing the implied sums. J. B. Hartle's Gravity (Addison–Wesley, San Francisco, 2003) is better than most general relativity books in teaching the notation (and in presenting the entire subject of relativity to undergraduates). first citation in article
We can conveniently combine the advantages of Gaussian and SI units by defining BcB_conv, where B_conv is the conventional magnetic field measured in teslas. The redefined B has units of N/C, just like the electric field (with 300 MN/C corresponding to 1 T.) All electromagnetic equations then take the same mathematical form as they would in Gaussian units, except that factors of 4 become 4k, where k is the Coulomb constant. However, the units for all quantities other than the magnetic field are in SI. This system makes the symmetries between the electric and magnetic fields apparent (and the equations much more beautiful) without having to deal with Gaussian units. This unit system also has the advantage of making it easy to show the connections between electromagnetic field theory and gravitational field theory (where the gravitational constant G is not typically suppressed as is the corresponding Coulomb constant k in Gaussian units). first citation in article
For example, A can be given a more physical meaning than often is supposed. In a static situation where = 0 and a particle moves perpendicular to A, the Euler–Lagrange equations implied by Eq. (11) imply that the quantity p + (q/c)A is constant in time. Just as the scalar potential at a point in space near a static charge distribution is the total work per unit charge that one would have to do on a charged test particle to move it from infinity to that point, the quantity A/c at a point in space near a static (and neutral) current distribution is the total momentum per unit charge that one would have to supply to a charged test particle to keep it moving from infinity to that position along a path that is always perpendicular to A. Therefore, if represents potential energy per unit charge, A represents "potential momentum" per unit charge. first citation in article
For example, the Aharonov–Bohm effect suggests that the magnetic potential is more fundamental than E and B, and is certainly more directly connected to quantum mechanics. See J. J. Sakurai, Modern Quantum Mechanics, edited by San Fu Tuan (Addison–Wesley, Redwood City, CA, 1985), pp. 136–139, or John S. Townsend, A Modern Approach to Quantum Mechanics (McGraw–Hill, New York, 1992), pp. 399–404 for good discussions of this effect. The four-potential also provides significant advantages for calculating electromagnetic fields: indeed, R. L. Coren of Drexel University once told me that computer programs used by electrical engineers almost always calculate the scalar and magnetic potentials instead of calculating E and B directly. first citation in article
The general argument for the least-action derivation of the field equations comes from L. D. Landau and E. M. Lifschitz, The Classical Theory of Fields (Pergamon, Oxford, 1975), 4th ed., pp. 67–74, and from John David Jackson, Classical Electrodynamics (Wiley, New York, 1999), 3rd ed., Sec. 12.7. first citation in article
Reference 23, Landau and Lifschitz, p. 68. first citation in article
With students who are still becoming familiar with the index notation, the easiest way to have them work out the implications of the electromagnetic Lagrangian is for them to write out the implied sums in the two terms (because the metric is diagonal, there are not that many terms to write) and then calculate the Euler–Lagrange equation for a specific field coordinate (say A^x) to see how the calculation goes. first citation in article
Dare A. Wells, Shaum's Outline of Theory and Problems of Lagrangian Dynamics (McGraw–Hill, New York, 1967). The section on electrical and electromechanical systems is Chap. 15. first citation in article
Davison E. Soper, Classical Field Theory (Wiley, New York, 1976). first citation in article
D. A. Van Baak, "Variational alternatives to Kirchoff's loop theorem," Am. J. Phys. 67, 36–44 (1999). [ISI] first citation in article
Charles W. Misner, Kip S. Thorne, and John Archibald Wheeler, Gravitation (Freeman, San Francisco, 1973), Chap. 21. first citation in article

CITING ARTICLES

This list contains links to other online articles that cite the article currently being viewed.

Hamilton's principle: Why is the integrated difference of the kinetic and potential energy minimized?
Alberto G. Rojo, Am. J. Phys. 73, 831 (2005)

FIGURES

Full figure (9 kB)

Fig. 1. Wecan calculate the wave function amplitude psi (x,t) at position xat time t by using Eq. (1) to calculate thecontribution of the wave function amplitude psi (x_i,t₀) at a positionx_i at an earlier time t₀ and then summing overall x_i. The diagonal lines show the direct paths thatconnect the various points x_i with the final point x. First citation in article

FOOTNOTES

^aElectronic mail: tmoore@pomona.edu

Up: Issue Table of Contents
Go to: Previous Article | Next Article
Other formats: HTML (smaller files) | PDF (86 kB)

Getting the most action out of least action: A proposal

Thomas A. Moorea)

Department of Physics and Astronomy, Pomona College, 610 N. College Avenue, Claremont, California 91711

Contents