This question is about a mathematical object (the tensor product) but thinking about the motivation that comes from Physics. Algebraists motivate the tensor product like that: "given k vector spaces V1,…,Vk over the same field K we want to find a new space S and a universal multilinear map T such that for every vector space W and multilinear mapping g:V1×⋯×Vk→W we have a linear map f:S→W such that g=f∘T".
Then, they prove this thing exists by constructing it. They take the free vector space M=F(V1×⋯×Vk) and consider the subspace M0 spanned by all elements of the form
(v1,…,v′i+av″i,…,vk)−(v1,…,v′i,…,vk)−a(v1,…,v″i,…,vk),
and define S=M/M0 denoting S=V1⊗⋯⊗Vk and define T(v1,…,vk)=(v1,…,vk)+M0 and denote this by T(v1,…,vk)=v1⊗⋯⊗vk.
That's fine, but tensors appear a lot in Physics. In General Relativity, in Electrodynamics, in Classical Mechanics, in Quantum Mechanics, etc. So, if someone asked me: "what's the motivation for that definition of tensor prodct" and I wished to motivate it through Physics, what should be the motivation?
How would I convince myself that the tensor product as defined like that is useful in Physics?
I know that one can defined tensors as multilinear maps, and that is far mor intuitive, however I'm interest to see how one would motivate this definition.
Answer
It is essentially impossible to answer the general question of "how does multilinearity come up naturally in physics?" because of the myriad of possible examples that make up the total answer. Instead, let me describe a situation that very loudly cries out for the use of tensor products of two vectors.
Consider the problem of conservation of momentum for a continuous distribution of electric charge and current, which interacts with an electromagnetic field, under the action of no other external force. I will describe it more or less along the lines of Jackson (Classical Electrodynamics, 3rd edition, §6.7) but depart from it towards the end. This will get very electromagneticky for a while, so if you want to skip to the tensors, you can go straight to equation (1).
The rate of change of the total mechanical momentum of the system is the total Lorentz force, given by dPmechdt=∫V(ρE+J×B)dx.
One can then put these expressions back, to a nice vector calculus work-out, and come up with the following relation: dPmechdt+ddt∫Vϵ0E×Bdx=ϵ0∫V[E(∇⋅E)−E×(∇×E)+c2B(∇⋅B)−c2B×(∇×B)]dx.
The integral on the left-hand side can be identified as the total electromagnetic momentum, and differs from the integral of the Poynting vector by a factor of 1/c2. To get this in the proper form for a conservation law, though, such as the one for energy in this setting, dEmechdt+ddtϵ02∫V(E2+c2B2)dx=−∮SS⋅da,
The way to do this, is, of course, the divergence theorem. However, that theorem is for scalars, and what we have so far is a vector equation. To work further then, we need to (at least temporarily) work in some specific basis {e1,e2,e3}, and write E=∑iEiei. Let's work with the electric field term first; after that the results also apply to the magnetic term. Thus, to start with, ∫V[E(∇⋅E)−E×(∇×E)]dx=∑iei∫V[Ei(∇⋅E)−ei⋅(E×(∇×E))]dx.
In terms of conservation law structure, we're essentially done, as we've reduced the rate of change of momentum to a surface term. However, it is crying out for some simplification. In particular, this expression is basis-dependent, but it is so close to being basis independent that it's worth a closer look.
The first term, for instance, is simply crying out for a simplification that would look something like ∑iei∮SEiE⋅da=∮SEE⋅da
So what, then, should this new vector multiplication be? One key to realizing what we really need is noticing the fact that we haven't yet assigned any real physical meaning to the combination EE; instead, we're only ever interacting with it by dotting "one of the vectors of the product" with the surface area element da, and that leaves a vector EE⋅da which we can integrate to get a vector, and that requires no new structure.
Let's then write a list of how we want this new product to behave. To keep things clear, let's give it some fancy new symbol like ⊗, mostly to avoid unseemly combinations like uv. We want then,
- a function ⊗:V×V→W, which takes euclidean vectors in V=R3 into some vector space W in which we'll keep our fancy new objects.
- Combinations of the form u⊗v should be linear in both u and v.
- For all vectors w in V, and all combinations (u,v)∈V×V, we want the combination (u⊗v)⋅w to be a vector in V. Even more, we want that to be the vector (v⋅w)u∈V.
That last one looks actually pretty strong, but there's evidently room for improvement. For one, it depends on the euclidean structure, which is not actually necessary: we can make an equivalent statement that uses the vector space's dual.
- For all (u,v)∈V×V and all f∈V∗, we want f→(u⊗v)=f(v)u∈V to hold, where f→ simply means that f acts on the factor on the right.
Finally, if we're doing stuff with the dual, we can reformulate that in a slightly prettier way. Since two vectors u,v∈V are equal if and only if f(u)=f(v) for all f∈V∗, we can give another equivalent statement of the same statement:
- For all (u,v)∈V×V and all f,g∈V∗, we want g←f→(u⊗v)=g(u)f(v)∈V.
[Note, here, that this last rephrasing isn't really that fancy. Essentially, it is saying that the vector equation (1) is really to be interpreted as a component-by-component equality, and that's not really off the mark of how we actually do things.]
I could keep going, but it's clear that this requirement can be rephrased into the universal property of the tensor product, and that rephrasing is a job for the mathematicians. Thus, you can see the story like this: Upon hitting equation (1), we give to the mathematicians this list of requirements. They go off, think for a bit, and come back telling us that such a structure does exist (i.e. there exist rigorous constructions that obey those requirements) and that it is essentially unique, in the sense that multiple such constructions are possible, but they are canonically isomorphic. For a physicist, what that means is that it's OK to write down objects like u⊗v as long as one does keep within the rules of the game.
As far as electromagnetism goes, this means that we can write our conservation law in the form dPmechdt+ddt∫Vϵ0E×Bdx=∮AT⋅da
I could go on and on about this, but I think this really captures the essence of how and where it happens in physics that a situation is really begging the use of a tensor product. There are other such situations, of course, but this is the clearest one I know.
No comments:
Post a Comment