10 Commutators and Uncertainty Principle

Discussing two effects of non-commuting operators: on the measurement process, and on the limitation of the wave function description of particles.

This lecture is motivated by the question:

To answer this question, we introduce the commutator [A,B] of two Hermitian operators and explore its physical interpretation. We will prove a generalisation of Heisenberg’s uncertainty principle, which is a fundamental limitation on the precision that observables A and B can be determined simultaneously.

10.1 The Commutator

Two commuting operators share an (orthonormal) basis of eigenfunctions. This is extremely important, because eigenfunctions and eigenvalues are related to possible measurement outcomes.

Suppose we have two linear operators A and B, such as position x, momentum p or the Hamiltonian H. The commutator is defined by (10.1)[A,B]:=ABBA. It has the following properties:

The commutator plays an important role in quantum mechanics due to the following theorem:

Theorem: “Two commuting matrices are simultaneously diagonalizable." If A and B are Hermitian operators with [A,B]=0, it is possible to find an orthonormal basis of wave functions that are simultaneous eigenfunctions of A and B.

Proof: To keep things simple, we will assume the spectrum of eigenvalues {aj} of A is discrete and non-degenerate. This means that up to normalisation there is a unique solution to Aϕj(x)=ajϕj(x) for each eigenvalue aj. The normalisation can be chosen to make the basis orthonormal, ϕi,ϕj=δij.

We want to prove that ϕj(x) is simultaneously an eigenfunction of B. The commutator [A,B]=0 is equivalent to AB=BA. This means (10.2)A(Bϕj)=B(Aϕj)=B(ajϕj)=aj(Bϕj) so Bϕj is also an eigenfunction A with eigenvalue aj. But such wave function is unique up to normalisation and therefore (10.3)Bϕj(x)=bjϕj(x) for some bjR. So ϕj(x) is simultaneously an eigenfunction of B.

10.2 Compatibility

Quantum operators which originate from classical observables which have a vanishing Poisson bracket are called ‘‘compatible''.

This theorem motivates the following definition.

Let us first determine whether the observables we have encountered so far are compatible. Recall that the position and momentum operators are (10.4)x^=xp^=ix while (10.5)H^=p^22m+V(x)=22m2x2+V(x).

  1. Position and Momentum. Let us compute the commutator of position and momentum acting on a generic wave function, (10.6)[x^,p^]ψ=x(iψx)+ix(xψ)=iψ. This is the “canonical commutation relation" (10.7)[x^,p^]=i, which shows position and momentum are not compatible.

  2. Momentum and Energy. A similar computation shows that (10.8)[H,p]=[V(x),p]=iV(x)x. So momentum p and energy H are compatible only if V(x) is constant. We will return to this result later in the course.

  3. Position and Energy. From the commutator of position and momentum, (10.9)[H,x]=12m[p2,x]=12m(p[p,x]+[p,x]p)=imp, so position and energy are never compatible.

10.3 Compatibility and Measurement

To understand the physical significance of compatibility, suppose we measure A and find the eigenvalue aj. Then the wave function immediately after the measurement is the eigenfunction ϕj(x).

If A and B do not commute, and we measure A, immediately after that, the uncertainty of B has to be nonzero. This is a statement about subsequent measurements.

It is important to emphasise that the subsequent measurements discussed in this section are measurements without time between them. A generic eigenstate of an operator will not stay an eigenstate under time evolution, so a measurement of A, followed by a delay and then another measurement of A, will typically still lead to a non-zero spread. Only eigenstates of the Hamiltonian operator, or operators which commute with it, remain eigenstates under time evolution.

10.4 The Generalised Uncertainty Principle

If [A,B]0, we cannot necessarily find simultaneous eigenfunctions of A and B with both ΔA=0 and ΔB=0. In fact, there is a fundamental limitation in quantum mechanics on the how small we can simultaneously make the uncertainties ΔA and ΔB. This is quantified by the “Generalised Uncertainty Principle”:

The generalised uncertainty principle relates the product of uncertainties of two operators to their commutator. It says something about the spread of two observables in the same state, not about subsequent measurements.

Theorem: For any square-normalisable wave function, (10.11)ΔAΔB12|[A,B]|.

Proof: We will assume here that A=B=0 for the wave function in question. This will simplify the argument without losing any of its essence. The translation to A0, B0 as an exercise for the interested reader.

With our assumption, the uncertainty in A can be expressed, (10.12)(ΔA)2=A2=ψ,A2ψ=Aψ,Aψ=ψA,ψA, where ψA:=Aψ. There is an identical statement for B and therefore we can write (10.13)(ΔA)2(ΔB)2=ψA,ψAψB,ψB. We can now use the Cauchy-Schwarz inequality, (10.14)ψA,ψAψB,ψB|ψA,ψB|2. This result holds for any Hermitian inner product. It is analogous to the standard result |a|2|b|2|ab|2 from real euclidean geometry, which follows from the formula ab=|a||b|cos(θ) for the dot product.

The right-hand side of this inequality can be expressed as (10.15)ψA,ψB=AB=12(ABBA)+12(AB+BA)=12[A,B]+12{A,B} where [A,B]:=ABBA is the commutator and {A,B}:=AB+BA is the “anti-commutator". It is straightforward to check that,

  1. [A,B] is anti-Hermitian [A,B]iR.

  2. {A,B} is Hermitian {A,B}R.

so the commutator and anti-commutator provide the imaginary and real parts of ψA,ψB. Recalling the formula |z|2=x2+y2 for the modulus squared of a complex number z=x+iy, we have (10.16)|ψA,ψB|2=14|[A,B]|2+14|{A,B}|214|[A,B]|2. This concludes the proof.

  1. Position and Momentum. For position and momentum, (10.17)ΔxΔp2, which is Heisenberg’s uncertainty principle.

  2. Momentum and Energy. For momentum and energy, (10.18)ΔpΔH2V(x), which vanishes automatically when V(x) is constant.

  3. Position and Energy. For position and energy, (10.19)ΔxΔH2mp. This implies that square-normalisable Hamiltonian eigenfunctions must have p=0. In fact, square-normalisable Hamiltonian eigenfunctions may always be chosen real, compatible with this statement.

Note that what is derived here is a statement which is different from the one in the previous section: the generalised uncertainty principle as derived above says nothing about subsequent measurements (see also Jacques Distler and Sonia Paban. Uncertainties in successive measurements. Physical Review A, Jun 2013. URL: http://dx.doi.org/10.1103/PhysRevA.87.062112, doi:10.1103/physreva.87.062112.).

The uncertainty relation always contains on the right hand side, and is thus fundamentally ‘‘quantum''.

It should be emphasised that this is a fundamental feature of quantum mechanics. Only in the classical limit, 0, can we simultaneously determine exactly the values of non-compatible observables such as position and momentum.

10.5 Problems

  1. Generalised uncertainty principle:

    Derive the generalised uncertainty principle, (10.20)(ΔA)2(ΔB)2(12i[A^,B^])2, where (ΔA^)2=A^2A^2 and similar for B^.

  2. Energy-position uncertainty relation:

    Show that measurements of position and measurements of energy of a particle in one dimension satisfy the uncertainty relation (10.21)ΔxΔE2m|p|.

  3. TUTORIAL 3
    Time-energy uncertainty relation (sort of) For an operator Q^ which does not depend on time explicitly, we have (10.22)ddtQ^=i[H^,Q^]. For small Δt and small standard-deviation ΔQ we can write (10.23)ΔQ|dQ^dt|Δt. What is the meaning of Δt here? Use (10.23) to derive the “time-energy uncertainty relation” ΔHΔt/2. What does this ‘uncertainty relation’ express?

    Solution

    1. Writing (10.24)a(x):=(A^ψA^ψ)ψ(x) and ditto for B^ we can write (10.25)(ΔA)2(ΔB)2=a,ab,b|a,b|2. Setting z=a,b we then have (10.26)|z|2Im(z)2=(12i(zz))2, For the particular z we have (10.27)a,b=A^B^A^B^ and similar for the conjugate. Using that for the difference zz then gives the requested result.

    2. This requires computing (10.28)=12m[x^,p^2]=imp^. and thus (10.29)(Δx)2(ΔE)2(2mp^)2, from which the relation follows.

    3. Insertion of the Schrödinger equation into the uncertainty relation gives (10.30)ΔHΔQ2|dQ^dt|. Then inserting the definition of Δt and rearranging factors produces the Et uncertainty relation.

      The symbol Δt is the time it takes the expectation value of the (arbitrary) operator Q^ to change by one standard deviation.

      The uncertainty relation expresses the fact that if all observables change rapidly (Δt small), then the uncertainty in the energy must be large, and if all observables change slowly, the uncertainty in the energy is small.

      See Griffiths for more detail and discussion.