quantum mechanics - What's the physical meaning of the statement that "photons don't have positions"?

It's been mentioned elsewhere on this site that one cannot define a position operator for the one-photon sector of the quantized electromagnetic field, if one requires the position operator have certain formal properties. This is a theorem that holds only for massless particles of helicity $|\lambda| \geq 1$, in particular it does not apply to massless scalars.

A lot of people, particularly mathematical physicists or older quantum field theory textbooks, seem to interpret this to mean that we should never speak of the position of anything in relativistic quantum field theory. But it still seems possible to say something about where a photon is. For example, if I have an ideal cavity and excite the lowest mode with one photon, I know that the photon is in that cavity. Furthermore, I can localize the photon arbitrarily well using smaller and smaller cavities.

When an optics experiment is done using a laser beam, it is perfectly meaningful to talk about photons being in the beam. We can also speak of a photon being emitted by an atom, in which case it is obviously localized near the atom when the emission occurs. Furthermore, in the usual analysis of the double slit experiment one has, at least implicitly, a wavefunction for the photon, which successfully recovers the high school result.

When one talks about scattering experiments, such as in photon-photon scattering, one has to talk about localized wavepackets in order to describe a real beam. Furthermore, unlike the massive case, where the Compton wavelength provides a characteristic length, there is no characteristic length for photons, suggesting that beams can be made arbitrarily narrow in principle: the complaint that you would start causing pair production below the Compton wavelength doesn't apply.

In other words, while the theorem is airtight, it doesn't seem to impose any practical limitations on things we would actually like to do experimentally. But you can find very strange-sounding descriptions of what this theorem is telling us online. For example, on PhysicsForums you can read many obviously wrong statements (e.g. here and here and here) such as:

The photon has no rest frame. Computing an expectation of position for such an object is nonsense.

One good reason is that photons are massless and move at the speed of light and have no rest frame! Then also they are bosons, so you can't tell which are which.

These are wrong because they also apply to massless scalars, for which there does exist a (Newton-Wigner) position operator. It also just doesn't make sense -- if you can't measure the position of something if you're not in its rest frame, then how can I catch a ball?

In relativistic quantum (field) theory there is no concept of single photons.

You cannot define "position" for an electromagnetic field or of photons, which are certain states of this field (namely single-photon Fock states). Nobody thinking about classical electromagnetic waves would ever come to the idea to ask, what the position of a field might be.

This is wrong because the one-particle sector of a quantum field theory is perfectly well-defined, and it is perfectly valid to define operators acting on it alone.

It can be shown that in the context of relativistic quantum theory the position operator leads to violations of causality.

This is rather vague because quantum field theory is causal, so it's unclear how "the position operator" overturns that.

It could just be that PhysicsForums is an exceptionally low-quality site, but I think the real problem is that interpreting this theorem is actually quite tricky. What nontrivial physical consequences does the nonexistence of a formal photon position operator have?

Answer

We could spend forever playing whac-a-mole with all of the confusing/confused statements that continue popping up on this subject, on PhysicsForums and elsewhere. Instead of doing that, I'll offer a general perspective that, for me at least, has been refreshingly clarifying.

I'll start by reviewing a general no-go result, which applies to all relativistic QFTs, not just to photons. Then I'll explain how the analogous question for electrons would be answered, and finally I'll extend the answer to photons. The reason for doing this in that order will probably be clear in hindsight.

A general no-go result

First, here's a review of the fundamental no-go result for relativistic QFT in flat spacetime:

In QFT, observables are associated with regions of spacetime (or just space, in the Schrödinger picture). This association is part of the definition of any given QFT.

In relativistic QFT, the Reeh-Schlieder theorem implies that an observable localized in a bounded region of spacetime cannot annihilate the vacuum state. Intuitively, this is because the vacuum state is entangled with respect to location.

Particles are defined relative to the vacuum state. By definition, the vacuum state has zero particles, so the Reeh-Schlieder theorem implies that an observable representing the number of particles in a given bounded region of spacetime cannot exist: if an observable is localized in a bounded region of spacetime, then it can't always register zero particles in the vacuum state.

That's the no-go result, and it's very general. It's not restricted to massless particles or to particles of helicity $\geq 1$. For example, it also applies to electrons. The no-go result says that we can't satisfy both requirements: in relativistic QFT, we can't have a detector that is both

perfectly reliable,

localized in a strictly bounded region.

But here's the important question: how close can we get to satisfying both of these requirements?

Warm-up: electrons

First consider the QFT of non-interacting electrons, with Lagrangian $L\sim \overline\psi(i\gamma\partial+m)\psi$. The question is about photons, and I'll get to that, but let's start with electrons because then we can use the electron mass $m$ to define a length scale $\hbar/mc$ to which other quantities can be compared.

To construct observables that count electrons, we can use the creation/annihilation operators. We know from QFT $101$ how to construct creation/annihilation operators from the Dirac field operators $\psi(x)$, and we know that this relationship is non-local (and non-localizable) because of the function $\omega(\vec p) = (\vec p^2+m^2)^{1/2}$ in the integrand, as promised by Reeh-Schlieder.

However, for electrons with sufficiently low momentum, this function might as well be $\omega\approx m$. If we replace $\omega\to m$ in the integrand, then the relationship between the creation/annihilation operators becomes local. Making this replacement changes the model from relativistic to non-relativistic, so the Reeh-Schlieder theorem no longer applies. That's why we can have electron-counting observables that satisfy both of the above requirements in the non-relativistic approximation.

Said another way: Observables associated with mutually spacelike regions are required to commute with each other (the microcausality requirement). The length scale $\hbar/mc$ is the scale over which commutators of our quasi-local detector-observables fall off with increasing spacelike separation. Since the non-zero tails of those commutators fall off exponentially with characteristic length $\hbar/mc$, we won't notice them in experiments that have low energy/low resolution compared to $\hbar/mc$.

Instead of compromising strict localization, we can compromise strict reliability instead: we can construct observables that are localized in a strictly bounded region and that almost annihilate the vacuum state. Such an observable represents a detector that is slightly noisy. The noise is again negligible for low-resolution detectors — that is, for detector-observables whose localization region is much larger than the scale $\hbar/mc$.

This is why non-relativistic few-particle quantum mechanics works — for electrons.

Photons

Now consider the QFT of the elelctromagnetic field by itself, which I'll call QEM. All of the observables in this model can be expressed in terms of the electric and magnetic field operators, and again we know from QFT $101$ how to construct creation/annihilation operators that define what "photon" means in this model: they are the positive/negative frequency parts of the field operators. This relationship is manifestly non-local. We can see this from the explicit expression, but we can also anticipate it more generally: the definition of positive/negative frequency involves the infinite past/future, and thanks to the time-slice principle, this implies access to arbitrarily large spacelike regions.

In QEM, there is no characteristic scale analogous to $\hbar/mc$, because $m=0$. The ideas used above for electrons still work, except that the deviations from localization and/or reliability don't fall off exponentially with any characteristic scale. They fall of like a power of the distance instead.

As far as this question is concerned, that's really the only difference between the electron case and the photon case. That's enough of a difference to prevent us from constructing a model for photons that is analogous to non-relativistic quantum mechanics for electrons, but it's not enough of a difference to prevent photon-detection observables from being both localized and reliable for most practical purposes. The larger we allow its localization region to be, the more reliable (less noisy) a photon detector can be. Our definition of how-good-is-good-enough needs to be based on something else besides QEM itself, because QEM doesn't have any characteristic length-scale of its own. That's not an obstacle to having relatively well-localized photon-observables in practice, because there's more to the real world than QEM.

Position operators

What is a position operator? Nothing that I said above refers to such a thing. Instead, everything I said above was expressed in terms of observables that represent particle detectors (or counters). I did that because the starting point was relativistic QFT, and QFT is expressed in terms of observables that are localized in bounded regions.

Actually, non-relativistic QM can also be expressed that way. Start with the traditional formulation in terms of the position operator $X$. (I'll consider only one dimension for simplicity.) This single operator $X$ is really just a convenient way of packaging-and-labeling a bunch of mutually-commuting projection operators, namely the operators $P(R)$ that project a wavefunction $\Psi(x)$ onto the part with $x\in R$, cutting off the parts with $x\notin R$. In fancy language, the commutative von Neumann algebra generated by $X$ is the same as the commutative von Neumann algebra generated by all of the $P(R)$s, so aside from how things are labeled with "eigenvalues," they both represent the same observable as far as Born's rule is concerned. If we look at how non-relativistic QM is derived from its relativistic roots, we see that the $P(R)$s are localized within the region $R$ by QFT's definition of "localized" — at least insofar as the non-relativistic approximation is valid. In this sense, non-relativistic single-particle QM is, like QFT, expressed in terms of observables associated with bounded regions of space. The traditional formulation of single-particle QM obscures this.

Here's the point: when we talk about a position operator for an electron in a non-relativistic model, we're implicitly talking about the projection operators $P(R)$, which are associated with bounded regions of space. The position operator $X$ is a neat way of packaging all of those projection operators and labeling them with a convenient spatial coordinate, so that we can use concise statistics like means and standard deviations, but you can't have $X$ without also having the projection operators $P(R)$, because the existence of the former implies the existence of the latter (through the spectral theorem or, through the von-Neumann-algebra fanciness that I mentioned above).

So... can a photon have a position operator? If by position operator we mean something like the projection operators $P(R)$, which are both (1) localized in a strictly bounded region and (2) strictly reliable as "detectors" of things in that region, then the answer is no. A photon can't have a position operator for the same reason that a photon can't have a non-relativistic approximation: for a photon, there is no characteristic length scale analogous to $\hbar/mc$ to which the size of a localization region can be compared, without referring to something other than the electromagnetic field itself. What we can do is use the usual photon creation/annihilation operators to construct photon-detecting/counting observables that are not strictly localized in any bounded region but whose "tails" are negligible compared to anything else that we care about (outside of QEM), if the quasi-localization region is large enough.

What is a physical consequence?

What is a physical consequence of the non-existence of a strict position operator? Real localized detectors are necessarily noisy. The more localized they are, the noisier they must be. Reeh-Schlieder guarantees this, both for electrons and for photons, the main difference being that for electrons, the effect decreases exponentially as the size of the localization region is increased. For photons, it decreases only like a power of the size.

Blog

Wednesday, March 9, 2016