Tuesday, June 17, 2014

thermodynamics - Why exactly do we say L=L(q,dotq) and H=H(q,p)?


In classical mechanics, we perform a Legendre transform to switch from L(q,˙q) to H(q,p). This has always been confusing to me, because we can always write L in terms of q and p by just taking the expression for ˙q(q,p) and stuffing it in.


In thermodynamics, we say U is a function of S, V, and N because dU=TdS+pdV+μdN,

which is exceptionally simple. But for the Lagrangian, we instead generally have dL=(horrible expression)dq+(horrible expression)d˙q
In this case, I see no loss in 'naturalness' to switch to q and p, so what's the real difference between considering L(q,˙q) and L(q,p)?



Answer



We should abandon the "naive" langauge of functions depending on coordinates and consider functions as maps between mathematical spaces, which are only expressed in local coordinates after their domains have been defined.


The starting point for both the Lagrangian and the Hamiltonian formalism is a configuration space Q, whose coordinates are called qi. It should be thought of as the space of positions of the system under considerations. The two formalisms now immediately take different paths: Lagrangian mechanics takes place on the tangent bundle TQ, Hamiltonian mechanics on the cotangent bundle TQ. The local coordinates on TQ are denoted (qi,˙qi), the local coordinates on TQ are (qi,pi). Note that, since there is no metric on Q, you do not have a canonical identification of tangents and cotangents and therefore cannot switch between the description freely as one might be used to from Riemannian geometry. Note furthermore that ˙q is not the derivative of anything - it's simply a notation for a new coordinate.


The Lagrangian is a function L:TQR. Given it, we may define a function f:TQTQ in local coordinates by f(q,˙q)=(q,L˙q(q,˙q))

and the associated Hamiltonian H:TQR in local coordinates as the Legendre transform H(q,p)=sup˙q(pi˙qiL(q,˙q)).
It should be clear here that neither H(q,˙q) nor L(q,p) are meaningful objects in this context - H and L act on different spaces, you cannot feed a p into L at all. Observe now that f does permit us to do this in some sense, only rigorously: If f is invertible, one may define a "co-Lagrangian" or "Hamiltonian Lagrangian" LH:TQR by LH(q,p)=L(f1(q,p)). Crucially, L and LH are different functions and should, for clarity's sake, never be denoted by the same symbol.


The expression in the definition of the Legendre transform obtains its extremum at pi=L˙qi(q,˙q),

which means that H(q,p)=pi˙qiL(q,˙q)
holds exactly for a triple (q,˙q,p) such that f(q,˙q)=(q,p).
Note that the fact that H does not depend on ˙q means that ˙q in eq. (0) is implicitly a function ˙q(q,p) as defined implicitly by eq. (1).



Only when we impose the relation eq. (1) there is a functional relation between the q,˙q,p, otherwise there is not. This is why, as abstract functions, the Lagrangian is not a function of p and the Hamiltonian is not a function of ˙q - these are coordinates on different spaces with no relation to each other. It is only when we impose eq. (1) in order to express the Hamiltonian without the extremisation procedure prescribed in the Legendre transform that they become related, and not necessarily uniquely so. If f is not invertible, then the Lagrangian system is a gauge theory and the Hamiltonian system is constrained - both terms which essentially mean that the relation between the p and the ˙q is not uniquely defined.


Finally, let me address a closely related confusion which nevertheless crops up because of the same reason, i.e. not respecting the actual domains functions are defined on. The q,˙q arguments of the Lagrangian are independent, and become dependent only when we consider a path γ:IQ, which induces a path ˜γ:ITQ,t(γ(t),˙γ(t)) on the tangent bundle, where ˙γ now denotes the actual time derivative, i.e. the tangent vector field to γ. The action is a function S:[I,Q]R, where [I,Q] denotes the space of all maps IQ, and is defined as S[γ]=IL(˜γ).

When now considering this action, the physicist often writes the coordinates of ˜γ as (q(t),˙q(t)), and it is only in this context that ˙q(t) truly is a time-dependent function and the derivative of q(t).


No comments:

Post a Comment

classical mechanics - Moment of a force about a given axis (Torque) - Scalar or vectorial?

I am studying Statics and saw that: The moment of a force about a given axis (or Torque) is defined by the equation: $M_X = (\vec r \times \...