# Uncertainty principle

In quantum mechanics, the uncertainty principle (also known as Heisenberg's uncertainty principle) is any of a variety of mathematical inequalities asserting a fundamental limit to the accuracy with which the values for certain pairs of physical quantities of a particle, such as position, x, and momentum, p, can be predicted from initial conditions.

Such variable pairs are known as complementary variables or canonically conjugate variables; and, depending on interpretation, the uncertainty principle limits to what extent such conjugate properties maintain their approximate meaning, as the mathematical framework of quantum physics does not support the notion of simultaneously well-defined conjugate properties expressed by a single value. The uncertainty principle implies that it is in general not possible to predict the value of a quantity with arbitrary certainty, even if all initial conditions are specified.

Introduced first in 1927 by the German physicist Werner Heisenberg, the uncertainty principle states that the more precisely the position of some particle is determined, the less precisely its momentum can be predicted from initial conditions, and vice versa. In the published 1927 paper, Heisenberg concludes that the uncertainty principle was originally $\Delta$ p$\Delta$ q ~ h using the full Planck constant. The formal inequality relating the standard deviation of position σx and the standard deviation of momentum σp was derived by Earle Hesse Kennard later that year and by Hermann Weyl in 1928:

$\sigma _{x}\sigma _{p}\geq {\frac {\hbar }{2}}~~$ where ħ is the reduced Planck constant, h/(2π).

Historically, the uncertainty principle has been confused with a related effect in physics, called the observer effect, which notes that measurements of certain systems cannot be made without affecting the system, that is, without changing something in a system. Heisenberg utilized such an observer effect at the quantum level (see below) as a physical "explanation" of quantum uncertainty. It has since become clearer, however, that the uncertainty principle is inherent in the properties of all wave-like systems, and that it arises in quantum mechanics simply due to the matter wave nature of all quantum objects. Thus, the uncertainty principle actually states a fundamental property of quantum systems and is not a statement about the observational success of current technology. It must be emphasized that measurement does not mean only a process in which a physicist-observer takes part, but rather any interaction between classical and quantum objects regardless of any observer.[note 1] [note 2]

Since the uncertainty principle is such a basic result in quantum mechanics, typical experiments in quantum mechanics routinely observe aspects of it. Certain experiments, however, may deliberately test a particular form of the uncertainty principle as part of their main research program. These include, for example, tests of number–phase uncertainty relations in superconducting or quantum optics systems. Applications dependent on the uncertainty principle for their operation include extremely low-noise technology such as that required in gravitational wave interferometers.

## Introduction

The superposition of several plane waves to form a wave packet. This wave packet becomes increasingly localized with the addition of many waves. The Fourier transform is a mathematical operation that separates a wave packet into its individual plane waves. The waves shown here are real for illustrative purposes only, whereas in quantum mechanics the wave function is generally complex.

It is vital to illustrate how the principle applies to relatively intelligible physical situations since it is indiscernible on the macroscopic  scales that humans experience. Two alternative frameworks for quantum physics offer different explanations for the uncertainty principle. The wave mechanics picture of the uncertainty principle is more visually intuitive, but the more abstract matrix mechanics picture formulates it in a way that generalizes more easily.

Mathematically, in wave mechanics, the uncertainty relation between position and momentum arises because the expressions of the wavefunction in the two corresponding orthonormal bases in Hilbert space are Fourier transforms of one another (i.e., position and momentum are conjugate variables). A nonzero function and its Fourier transform cannot both be sharply localized at the same time. A similar tradeoff between the variances of Fourier conjugates arises in all systems underlain by Fourier analysis, for example in sound waves: A pure tone is a sharp spike at a single frequency, while its Fourier transform gives the shape of the sound wave in the time domain, which is a completely delocalized sine wave. In quantum mechanics, the two key points are that the position of the particle takes the form of a matter wave, and momentum is its Fourier conjugate, assured by the de Broglie relation p = ħk, where k is the wavenumber.

In matrix mechanics, the mathematical formulation of quantum mechanics, any pair of non-commuting self-adjoint operators representing observables are subject to similar uncertainty limits. An eigenstate of an observable represents the state of the wavefunction for a certain measurement value (the eigenvalue). For example, if a measurement of an observable A is performed, then the system is in a particular eigenstate Ψ of that observable. However, the particular eigenstate of the observable A need not be an eigenstate of another observable B: If so, then it does not have a unique associated measurement for it, as the system is not in an eigenstate of that observable.

### Wave mechanics interpretation

(Ref )

Propagation of de Broglie waves in 1d—real part of the complex amplitude is blue, imaginary part is green. The probability (shown as the colour opacity) of finding the particle at a given point x is spread out like a waveform, there is no definite position of the particle. As the amplitude increases above zero the curvature reverses sign, so the amplitude begins to decrease again, and vice versa—the result is an alternating amplitude: a wave.

According to the de Broglie hypothesis, every object in the universe is a wave, i.e., a situation which gives rise to this phenomenon. The position of the particle is described by a wave function $\Psi (x,t)$ . The time-independent wave function of a single-moded plane wave of wavenumber k0 or momentum p0 is

$\psi (x)\propto e^{ik_{0}x}=e^{ip_{0}x/\hbar }~.$

The Born rule states that this should be interpreted as a probability density amplitude function in the sense that the probability of finding the particle between a and b is

$\operatorname {P} [a\leq X\leq b]=\int _{a}^{b}|\psi (x)|^{2}\,\mathrm {d} x~.$

In the case of the single-moded plane wave, $|\psi (x)|^{2}$  is a uniform distribution. In other words, the particle position is extremely uncertain in the sense that it could be essentially anywhere along the wave packet.

On the other hand, consider a wave function that is a sum of many waves, which we may write as

$\psi (x)\propto \sum _{n}A_{n}e^{ip_{n}x/\hbar }~,$

where An represents the relative contribution of the mode pn to the overall total. The figures to the right show how with the addition of many plane waves, the wave packet can become more localized. We may take this a step further to the continuum limit, where the wave function is an integral over all possible modes
$\psi (x)={\frac {1}{\sqrt {2\pi \hbar }}}\int _{-\infty }^{\infty }\varphi (p)\cdot e^{ipx/\hbar }\,dp~,$

with $\varphi (p)$  representing the amplitude of these modes and is called the wave function in momentum space. In mathematical terms, we say that $\varphi (p)$  is the Fourier transform of $\psi (x)$  and that x and p are conjugate variables. Adding together all of these plane waves comes at a cost, namely the momentum has become less precise, having become a mixture of waves of many different momenta.

One way to quantify the precision of the position and momentum is the standard deviation σ. Since $|\psi (x)|^{2}$  is a probability density function for position, we calculate its standard deviation.

The precision of the position is improved, i.e. reduced σx, by using many plane waves, thereby weakening the precision of the momentum, i.e. increased σp. Another way of stating this is that σx and σp have an inverse relationship or are at least bounded from below. This is the uncertainty principle, the exact limit of which is the Kennard bound. Click the show button below to see a semi-formal derivation of the Kennard inequality using wave mechanics.

Proof of the Kennard inequality using wave mechanics —

We are interested in the variances of position and momentum, defined as

$\sigma _{x}^{2}=\int _{-\infty }^{\infty }x^{2}\cdot |\psi (x)|^{2}\,dx-\left(\int _{-\infty }^{\infty }x\cdot |\psi (x)|^{2}\,dx\right)^{2}$

$\sigma _{p}^{2}=\int _{-\infty }^{\infty }p^{2}\cdot |\varphi (p)|^{2}\,dp-\left(\int _{-\infty }^{\infty }p\cdot |\varphi (p)|^{2}\,dp\right)^{2}~.$

Without loss of generality, we will assume that the means vanish, which just amounts to a shift of the origin of our coordinates. (A more general proof that does not make this assumption is given below.) This gives us the simpler form

$\sigma _{x}^{2}=\int _{-\infty }^{\infty }x^{2}\cdot |\psi (x)|^{2}\,dx$

$\sigma _{p}^{2}=\int _{-\infty }^{\infty }p^{2}\cdot |\varphi (p)|^{2}\,dp~.$

The function $f(x)=x\cdot \psi (x)$  can be interpreted as a vector in a function space. We can define an inner product for a pair of functions u(x) and v(x) in this vector space:

$\langle u\mid v\rangle =\int _{-\infty }^{\infty }u^{*}(x)\cdot v(x)\,dx,$

where the asterisk denotes the complex conjugate.

With this inner product defined, we note that the variance for position can be written as

$\sigma _{x}^{2}=\int _{-\infty }^{\infty }|f(x)|^{2}\,dx=\langle f\mid f\rangle ~.$

We can repeat this for momentum by interpreting the function ${\tilde {g}}(p)=p\cdot \varphi (p)$  as a vector, but we can also take advantage of the fact that $\psi (x)$  and $\varphi (p)$  are Fourier transforms of each other. We evaluate the inverse Fourier transform through integration by parts:

{\begin{aligned}g(x)&={\frac {1}{\sqrt {2\pi \hbar }}}\cdot \int _{-\infty }^{\infty }{\tilde {g}}(p)\cdot e^{ipx/\hbar }\,dp\\&={\frac {1}{\sqrt {2\pi \hbar }}}\int _{-\infty }^{\infty }p\cdot \varphi (p)\cdot e^{ipx/\hbar }\,dp\\&={\frac {1}{2\pi \hbar }}\int _{-\infty }^{\infty }\left[p\cdot \int _{-\infty }^{\infty }\psi (\chi )e^{-ip\chi /\hbar }\,d\chi \right]\cdot e^{ipx/\hbar }\,dp\\&={\frac {i}{2\pi }}\int _{-\infty }^{\infty }\left[{\cancel {\left.\psi (\chi )e^{-ip\chi /\hbar }\right|_{-\infty }^{\infty }}}-\int _{-\infty }^{\infty }{\frac {d\psi (\chi )}{d\chi }}e^{-ip\chi /\hbar }\,d\chi \right]\cdot e^{ipx/\hbar }\,dp\\&={\frac {-i}{2\pi }}\int _{-\infty }^{\infty }\int _{-\infty }^{\infty }{\frac {d\psi (\chi )}{d\chi }}e^{-ip\chi /\hbar }\,d\chi \,e^{ipx/\hbar }\,dp\\&=\left(-i\hbar {\frac {d}{dx}}\right)\cdot \psi (x),\end{aligned}}

where the canceled term vanishes because the wave function vanishes at infinity. Often the term ${\textstyle -i\hbar {\frac {d}{dx}}}$  is called the momentum operator in position space. Applying Parseval's theorem, we see that the variance for momentum can be written as
$\sigma _{p}^{2}=\int _{-\infty }^{\infty }|{\tilde {g}}(p)|^{2}\,dp=\int _{-\infty }^{\infty }|g(x)|^{2}\,dx=\langle g\mid g\rangle .$

The Cauchy–Schwarz inequality asserts that

$\sigma _{x}^{2}\sigma _{p}^{2}=\langle f\mid f\rangle \cdot \langle g\mid g\rangle \geq |\langle f\mid g\rangle |^{2}~.$

The modulus squared of any complex number z can be expressed as

$|z|^{2}={\Big (}{\text{Re}}(z){\Big )}^{2}+{\Big (}{\text{Im}}(z){\Big )}^{2}\geq {\Big (}{\text{Im}}(z){\Big )}^{2}=\left({\frac {z-z^{\ast }}{2i}}\right)^{2}.$

we let $z=\langle f|g\rangle$  and $z^{*}=\langle g\mid f\rangle$  and substitute these into the equation above to get
$|\langle f\mid g\rangle |^{2}\geq \left({\frac {\langle f\mid g\rangle -\langle g\mid f\rangle }{2i}}\right)^{2}~.$

All that remains is to evaluate these inner products.

{\begin{aligned}\langle f\mid g\rangle -\langle g\mid f\rangle ={}&\int _{-\infty }^{\infty }\psi ^{*}(x)\,x\cdot \left(-i\hbar {\frac {d}{dx}}\right)\,\psi (x)\,dx\\&{}-\int _{-\infty }^{\infty }\psi ^{*}(x)\,\left(-i\hbar {\frac {d}{dx}}\right)\cdot x\,\psi (x)\,dx\\={}&i\hbar \cdot \int _{-\infty }^{\infty }\psi ^{*}(x)\left[\left(-x\cdot {\frac {d\psi (x)}{dx}}\right)+{\frac {d(x\psi (x))}{dx}}\right]\,dx\\={}&i\hbar \cdot \int _{-\infty }^{\infty }\psi ^{*}(x)\left[\left(-x\cdot {\frac {d\psi (x)}{dx}}\right)+\psi (x)+\left(x\cdot {\frac {d\psi (x)}{dx}}\right)\right]\,dx\\={}&i\hbar \cdot \int _{-\infty }^{\infty }\psi ^{*}(x)\psi (x)\,dx\\={}&i\hbar \cdot \int _{-\infty }^{\infty }|\psi (x)|^{2}\,dx\\={}&i\hbar \end{aligned}}

Plugging this into the above inequalities, we get

$\sigma _{x}^{2}\sigma _{p}^{2}\geq |\langle f\mid g\rangle |^{2}\geq \left({\frac {\langle f\mid g\rangle -\langle g\mid f\rangle }{2i}}\right)^{2}=\left({\frac {i\hbar }{2i}}\right)^{2}={\frac {\hbar ^{2}}{4}}$

or taking the square root
$\sigma _{x}\sigma _{p}\geq {\frac {\hbar }{2}}~.$

Note that the only physics involved in this proof was that $\psi (x)$  and $\varphi (p)$  are wave functions for position and momentum, which are Fourier transforms of each other. A similar result would hold for any pair of conjugate variables.

### Matrix mechanics interpretation

(Ref )

In matrix mechanics, observables such as position and momentum are represented by self-adjoint operators. When considering pairs of observables, an important quantity is the commutator. For a pair of operators Â and , one defines their commutator as

$[{\hat {A}},{\hat {B}}]={\hat {A}}{\hat {B}}-{\hat {B}}{\hat {A}}.$

In the case of position and momentum, the commutator is the canonical commutation relation
$[{\hat {x}},{\hat {p}}]=i\hbar .$

The physical meaning of the non-commutativity can be understood by considering the effect of the commutator on position and momentum eigenstates. Let $|\psi \rangle$  be a right eigenstate of position with a constant eigenvalue x0. By definition, this means that ${\hat {x}}|\psi \rangle =x_{0}|\psi \rangle .$  Applying the commutator to $|\psi \rangle$  yields

$[{\hat {x}},{\hat {p}}]|\psi \rangle =({\hat {x}}{\hat {p}}-{\hat {p}}{\hat {x}})|\psi \rangle =({\hat {x}}-x_{0}{\hat {I}}){\hat {p}}\,|\psi \rangle =i\hbar |\psi \rangle ,$

where Î is the identity operator.

Suppose, for the sake of proof by contradiction, that $|\psi \rangle$  is also a right eigenstate of momentum, with constant eigenvalue p0. If this were true, then one could write

$({\hat {x}}-x_{0}{\hat {I}}){\hat {p}}\,|\psi \rangle =({\hat {x}}-x_{0}{\hat {I}})p_{0}\,|\psi \rangle =(x_{0}{\hat {I}}-x_{0}{\hat {I}})p_{0}\,|\psi \rangle =0.$

On the other hand, the above canonical commutation relation requires that
$[{\hat {x}},{\hat {p}}]|\psi \rangle =i\hbar |\psi \rangle \neq 0.$

This implies that no quantum state can simultaneously be both a position and a momentum eigenstate.

When a state is measured, it is projected onto an eigenstate in the basis of the relevant observable. For example, if a particle's position is measured, then the state amounts to a position eigenstate. This means that the state is not a momentum eigenstate, however, but rather it can be represented as a sum of multiple momentum basis eigenstates. In other words, the momentum must be less precise. This precision may be quantified by the standard deviations,

$\sigma _{x}={\sqrt {\langle {\hat {x}}^{2}\rangle -\langle {\hat {x}}\rangle ^{2}}}$

$\sigma _{p}={\sqrt {\langle {\hat {p}}^{2}\rangle -\langle {\hat {p}}\rangle ^{2}}}.$

As in the wave mechanics interpretation above, one sees a tradeoff between the respective precisions of the two, quantified by the uncertainty principle.

## Heisenberg limit

In quantum metrology, and especially interferometry, the Heisenberg limit is the optimal rate at which the accuracy of a measurement can scale with the energy used in the measurement. Typically, this is the measurement of a phase (applied to one arm of a beam-splitter) and the energy is given by the number of photons used in an interferometer. Although some claim to have broken the Heisenberg limit, this reflects disagreement on the definition of the scaling resource. Suitably defined, the Heisenberg limit is a consequence of the basic principles of quantum mechanics and cannot be beaten, although the weak Heisenberg limit can be beaten.

## Robertson–Schrödinger uncertainty relations

The most common general form of the uncertainty principle is the Robertson uncertainty relation.

For an arbitrary Hermitian operator ${\hat {\mathcal {O}}}$  we can associate a standard deviation

$\sigma _{\mathcal {O}}={\sqrt {\langle {\hat {\mathcal {O}}}^{2}\rangle -\langle {\hat {\mathcal {O}}}\rangle ^{2}}},$

where the brackets $\langle {\mathcal {O}}\rangle$  indicate an expectation value. For a pair of operators ${\hat {A}}$  and ${\hat {B}}$ , we may define their commutator as
$[{\hat {A}},{\hat {B}}]={\hat {A}}{\hat {B}}-{\hat {B}}{\hat {A}},$

In this notation, the Robertson uncertainty relation is given by

$\sigma _{A}\sigma _{B}\geq \left|{\frac {1}{2i}}\langle [{\hat {A}},{\hat {B}}]\rangle \right|={\frac {1}{2}}\left|\langle [{\hat {A}},{\hat {B}}]\rangle \right|,$

The Robertson uncertainty relation immediately follows from a slightly stronger inequality, the Schrödinger uncertainty relation,

$\sigma _{A}^{2}\sigma _{B}^{2}\geq \left|{\frac {1}{2}}\langle \{{\hat {A}},{\hat {B}}\}\rangle -\langle {\hat {A}}\rangle \langle {\hat {B}}\rangle \right|^{2}+\left|{\frac {1}{2i}}\langle [{\hat {A}},{\hat {B}}]\rangle \right|^{2},$

where we have introduced the anticommutator,

$\{{\hat {A}},{\hat {B}}\}={\hat {A}}{\hat {B}}+{\hat {B}}{\hat {A}}.$

Proof of the Schrödinger uncertainty relation —

The derivation shown here incorporates and builds off of those shown in Robertson, Schrödinger and standard textbooks such as Griffiths. For any Hermitian operator ${\hat {A}}$ , based upon the definition of variance, we have

$\sigma _{A}^{2}=\langle ({\hat {A}}-\langle {\hat {A}}\rangle )\Psi |({\hat {A}}-\langle {\hat {A}}\rangle )\Psi \rangle .$

we let $|f\rangle =|({\hat {A}}-\langle {\hat {A}}\rangle )\Psi \rangle$  and thus
$\sigma _{A}^{2}=\langle f\mid f\rangle \,.$

Similarly, for any other Hermitian operator ${\hat {B}}$  in the same state

$\sigma _{B}^{2}=\langle ({\hat {B}}-\langle {\hat {B}}\rangle )\Psi |({\hat {B}}-\langle {\hat {B}}\rangle )\Psi \rangle =\langle g\mid g\rangle$

for $|g\rangle =|({\hat {B}}-\langle {\hat {B}}\rangle )\Psi \rangle .$

The product of the two deviations can thus be expressed as

$\sigma _{A}^{2}\sigma _{B}^{2}=\langle f\mid f\rangle \langle g\mid g\rangle .$

(1)

In order to relate the two vectors $|f\rangle$  and $|g\rangle$ , we use the Cauchy–Schwarz inequality which is defined as

$\langle f\mid f\rangle \langle g\mid g\rangle \geq |\langle f\mid g\rangle |^{2},$

and thus Equation (1) can be written as
$\sigma _{A}^{2}\sigma _{B}^{2}\geq |\langle f\mid g\rangle |^{2}.$

(2)

Since $\langle f\mid g\rangle$  is in general a complex number, we use the fact that the modulus squared of any complex number $z$  is defined as $|z|^{2}=zz^{*}$ , where $z^{*}$  is the complex conjugate of $z$ . The modulus squared can also be expressed as

$|z|^{2}={\Big (}\operatorname {Re} (z){\Big )}^{2}+{\Big (}\operatorname {Im} (z){\Big )}^{2}={\Big (}{\frac {z+z^{\ast }}{2}}{\Big )}^{2}+{\Big (}{\frac {z-z^{\ast }}{2i}}{\Big )}^{2}.$

(3)

we let $z=\langle f\mid g\rangle$  and $z^{*}=\langle g\mid f\rangle$  and substitute these into the equation above to get

$|\langle f\mid g\rangle |^{2}={\bigg (}{\frac {\langle f\mid g\rangle +\langle g\mid f\rangle }{2}}{\bigg )}^{2}+{\bigg (}{\frac {\langle f\mid g\rangle -\langle g\mid f\rangle }{2i}}{\bigg )}^{2}$

(4)

The inner product $\langle f\mid g\rangle$  is written out explicitly as

$\langle f\mid g\rangle =\langle ({\hat {A}}-\langle {\hat {A}}\rangle )\Psi |({\hat {B}}-\langle {\hat {B}}\rangle )\Psi \rangle ,$

and using the fact that ${\hat {A}}$  and ${\hat {B}}$  are Hermitian operators, we find
{\begin{aligned}\langle f\mid g\rangle &=\langle \Psi |({\hat {A}}-\langle {\hat {A}}\rangle )({\hat {B}}-\langle {\hat {B}}\rangle )\Psi \rangle \\[4pt]&=\langle \Psi \mid ({\hat {A}}{\hat {B}}-{\hat {A}}\langle {\hat {B}}\rangle -{\hat {B}}\langle {\hat {A}}\rangle +\langle {\hat {A}}\rangle \langle {\hat {B}}\rangle )\Psi \rangle \\[4pt]&=\langle \Psi \mid {\hat {A}}{\hat {B}}\Psi \rangle -\langle \Psi \mid {\hat {A}}\langle {\hat {B}}\rangle \Psi \rangle -\langle \Psi \mid {\hat {B}}\langle {\hat {A}}\rangle \Psi \rangle +\langle \Psi \mid \langle {\hat {A}}\rangle \langle {\hat {B}}\rangle \Psi \rangle \\[4pt]&=\langle {\hat {A}}{\hat {B}}\rangle -\langle {\hat {A}}\rangle \langle {\hat {B}}\rangle -\langle {\hat {A}}\rangle \langle {\hat {B}}\rangle +\langle {\hat {A}}\rangle \langle {\hat {B}}\rangle \\[4pt]&=\langle {\hat {A}}{\hat {B}}\rangle -\langle {\hat {A}}\rangle \langle {\hat {B}}\rangle .\end{aligned}}

Similarly it can be shown that $\langle g\mid f\rangle =\langle {\hat {B}}{\hat {A}}\rangle -\langle {\hat {A}}\rangle \langle {\hat {B}}\rangle .$

Thus we have

$\langle f\mid g\rangle -\langle g\mid f\rangle =\langle {\hat {A}}{\hat {B}}\rangle -\langle {\hat {A}}\rangle \langle {\hat {B}}\rangle -\langle {\hat {B}}{\hat {A}}\rangle +\langle {\hat {A}}\rangle \langle {\hat {B}}\rangle =\langle [{\hat {A}},{\hat {B}}]\rangle$

and
$\langle f\mid g\rangle +\langle g\mid f\rangle =\langle {\hat {A}}{\hat {B}}\rangle -\langle {\hat {A}}\rangle \langle {\hat {B}}\rangle +\langle {\hat {B}}{\hat {A}}\rangle -\langle {\hat {A}}\rangle \langle {\hat {B}}\rangle =\langle \{{\hat {A}},{\hat {B}}\}\rangle -2\langle {\hat {A}}\rangle \langle {\hat {B}}\rangle .$

We now substitute the above two equations above back into Eq. (4) and get

$|\langle f\mid g\rangle |^{2}={\Big (}{\frac {1}{2}}\langle \{{\hat {A}},{\hat {B}}\}\rangle -\langle {\hat {A}}\rangle \langle {\hat {B}}\rangle {\Big )}^{2}+{\Big (}{\frac {1}{2i}}\langle [{\hat {A}},{\hat {B}}]\rangle {\Big )}^{2}\,.$

Substituting the above into Equation (2) we get the Schrödinger uncertainty relation

$\sigma _{A}\sigma _{B}\geq {\sqrt {{\Big (}{\frac {1}{2}}\langle \{{\hat {A}},{\hat {B}}\}\rangle -\langle {\hat {A}}\rangle \langle {\hat {B}}\rangle {\Big )}^{2}+{\Big (}{\frac {1}{2i}}\langle [{\hat {A}},{\hat {B}}]\rangle {\Big )}^{2}}}.$

This proof has an issue related to the domains of the operators involved. For the proof to make sense, the vector ${\hat {B}}|\Psi \rangle$  has to be in the domain of the unbounded operator ${\hat {A}}$ , which is not always the case. In fact, the Robertson uncertainty relation is false if ${\hat {A}}$  is an angle variable and ${\hat {B}}$  is the derivative with respect to this variable. In this example, the commutator is a nonzero constant—just as in the Heisenberg uncertainty relation—and yet there are states where the product of the uncertainties is zero. (See the counterexample section below.) This issue can be overcome by using a variational method for the proof., or by working with an exponentiated version of the canonical commutation relations.

Note that in the general form of the Robertson–Schrödinger uncertainty relation, there is no need to assume that the operators ${\hat {A}}$  and ${\hat {B}}$  are self-adjoint operators. It suffices to assume that they are merely symmetric operators. (The distinction between these two notions is generally glossed over in the physics literature, where the term Hermitian is used for either or both classes of operators. See Chapter 9 of Hall's book for a detailed discussion of this important but technical distinction.)

### Mixed states

The Robertson–Schrödinger uncertainty relation may be generalized in a straightforward way to describe mixed states.

$\sigma _{A}^{2}\sigma _{B}^{2}\geq \left|{\frac {1}{2}}\operatorname {tr} (\rho \{A,B\})-\operatorname {tr} (\rho A)\operatorname {tr} (\rho B)\right|^{2}+\left|{\frac {1}{2i}}\operatorname {tr} (\rho [A,B])\right|^{2}.$

### The Maccone–Pati uncertainty relations

The Robertson–Schrödinger uncertainty relation can be trivial if the state of the system is chosen to be eigenstate of one of the observable. The stronger uncertainty relations proved by Maccone and Pati give non-trivial bounds on the sum of the variances for two incompatible observables. (Earlier works on uncertainty relations formulated as the sum of variances include, e.g., Ref. due to Huang.) For two non-commuting observables $A$  and $B$  the first stronger uncertainty relation is given by

$\sigma _{A}^{2}+\sigma _{B}^{2}\geq \pm i\langle \Psi \mid [A,B]|\Psi \rangle +\mid \langle \Psi \mid (A\pm iB)\mid {\bar {\Psi }}\rangle |^{2},$

where $\sigma _{A}^{2}=\langle \Psi |A^{2}|\Psi \rangle -\langle \Psi \mid A\mid \Psi \rangle ^{2}$ , $\sigma _{B}^{2}=\langle \Psi |B^{2}|\Psi \rangle -\langle \Psi \mid B\mid \Psi \rangle ^{2}$ , $|{\bar {\Psi }}\rangle$  is a normalized vector that is orthogonal to the state of the system $|\Psi \rangle$  and one should choose the sign of $\pm i\langle \Psi \mid [A,B]\mid \Psi \rangle$  to make this real quantity a positive number.

The second stronger uncertainty relation is given by

$\sigma _{A}^{2}+\sigma _{B}^{2}\geq {\frac {1}{2}}|\langle {\bar {\Psi }}_{A+B}\mid (A+B)\mid \Psi \rangle |^{2}$

where $|{\bar {\Psi }}_{A+B}\rangle$  is a state orthogonal to $|\Psi \rangle$ . The form of $|{\bar {\Psi }}_{A+B}\rangle$  implies that the right-hand side of the new uncertainty relation is nonzero unless $|\Psi \rangle$  is an eigenstate of $(A+B)$ . One may note that $|\Psi \rangle$  can be an eigenstate of $(A+B)$  without being an eigenstate of either $A$  or $B$ . However, when $|\Psi \rangle$  is an eigenstate of one of the two observables the Heisenberg–Schrödinger uncertainty relation becomes trivial. But the lower bound in the new relation is nonzero unless $|\Psi \rangle$  is an eigenstate of both.

### Phase space

In the phase space formulation of quantum mechanics, the Robertson–Schrödinger relation follows from a positivity condition on a real star-square function. Given a Wigner function $W(x,p)$  with star product ★ and a function f, the following is generally true:

$\langle f^{*}\star f\rangle =\int (f^{*}\star f)\,W(x,p)\,dx\,dp\geq 0~.$

Choosing $f=a+bx+cp$ , we arrive at

$\langle f^{*}\star f\rangle ={\begin{bmatrix}a^{*}&b^{*}&c^{*}\end{bmatrix}}{\begin{bmatrix}1&\langle x\rangle &\langle p\rangle \\\langle x\rangle &\langle x\star x\rangle &\langle x\star p\rangle \\\langle p\rangle &\langle p\star x\rangle &\langle p\star p\rangle \end{bmatrix}}{\begin{bmatrix}a\\b\\c\end{bmatrix}}\geq 0~.$

Since this positivity condition is true for all a, b, and c, it follows that all the eigenvalues of the matrix are non-negative.

The non-negative eigenvalues then imply a corresponding non-negativity condition on the determinant,

$\det {\begin{bmatrix}1&\langle x\rangle &\langle p\rangle \\\langle x\rangle &\langle x\star x\rangle &\langle x\star p\rangle \\\langle p\rangle &\langle p\star x\rangle &\langle p\star p\rangle \end{bmatrix}}=\det {\begin{bmatrix}1&\langle x\rangle &\langle p\rangle \\\langle x\rangle &\langle x^{2}\rangle &\left\langle xp+{\frac {i\hbar }{2}}\right\rangle \\\langle p\rangle &\left\langle xp-{\frac {i\hbar }{2}}\right\rangle &\langle p^{2}\rangle \end{bmatrix}}\geq 0~,$

or, explicitly, after algebraic manipulation,
$\sigma _{x}^{2}\sigma _{p}^{2}=\left(\langle x^{2}\rangle -\langle x\rangle ^{2}\right)\left(\langle p^{2}\rangle -\langle p\rangle ^{2}\right)\geq \left(\langle xp\rangle -\langle x\rangle \langle p\rangle \right)^{2}+{\frac {\hbar ^{2}}{4}}~.$

### Examples

Since the Robertson and Schrödinger relations are for general operators, the relations can be applied to any two observables to obtain specific uncertainty relations. A few of the most common relations found in the literature are given below.

• For position and linear momentum, the canonical commutation relation $[{\hat {x}},{\hat {p}}]=i\hbar$  implies the Kennard inequality from above:
$\sigma _{x}\sigma _{p}\geq {\frac {\hbar }{2}}.$

• For two orthogonal components of the total angular momentum operator of an object:
$\sigma _{J_{i}}\sigma _{J_{j}}\geq {\frac {\hbar }{2}}{\big |}\langle J_{k}\rangle {\big |},$

where i, j, k are distinct, and Ji denotes angular momentum along the xi axis. This relation implies that unless all three components vanish together, only a single component of a system's angular momentum can be defined with arbitrary precision, normally the component parallel to an external (magnetic or electric) field. Moreover, for $[J_{x},J_{y}]=i\hbar \varepsilon _{xyz}J_{z}$ , a choice ${\hat {A}}=J_{x}$ , ${\hat {B}}=J_{y}$ , in angular momentum multiplets, ψ = |j, m⟩, bounds the Casimir invariant (angular momentum squared, $\langle J_{x}^{2}+J_{y}^{2}+J_{z}^{2}\rangle$ ) from below and thus yields useful constraints such as j(j + 1) ≥ m(m + 1), and hence jm, among others.
• In non-relativistic mechanics, time is privileged as an independent variable. Nevertheless, in 1945, L. I. Mandelshtam and I. E. Tamm derived a non-relativistic time–energy uncertainty relation, as follows. For a quantum system in a non-stationary state ψ and an observable B represented by a self-adjoint operator ${\hat {B}}$ , the following formula holds:
$\sigma _{E}{\frac {\sigma _{B}}{\left|{\frac {d\langle {\hat {B}}\rangle }{dt}}\right|}}\geq {\frac {\hbar }{2}},$

where σE is the standard deviation of the energy operator (Hamiltonian) in the state ψ, σB stands for the standard deviation of B. Although the second factor in the left-hand side has dimension of time, it is different from the time parameter that enters the Schrödinger equation. It is a lifetime of the state ψ with respect to the observable B: In other words, this is the time intervalt) after which the expectation value $\langle {\hat {B}}\rangle$  changes appreciably.
An informal, heuristic meaning of the principle is the following: A state that only exists for a short time cannot have a definite energy. To have a definite energy, the frequency of the state must be defined accurately, and this requires the state to hang around for many cycles, the reciprocal of the required accuracy. For example, in spectroscopy, excited states have a finite lifetime. By the time–energy uncertainty principle, they do not have a definite energy, and, each time they decay, the energy they release is slightly different. The average energy of the outgoing photon has a peak at the theoretical energy of the state, but the distribution has a finite width called the natural linewidth. Fast-decaying states have a broad linewidth, while slow-decaying states have a narrow linewidth.
The same linewidth effect also makes it difficult to specify the rest mass of unstable, fast-decaying particles in particle physics. The faster the particle decays (the shorter its lifetime), the less certain is its mass (the larger the particle's width).
• For the number of electrons in a superconductor and the phase of its Ginzburg–Landau order parameter
$\Delta N\,\Delta \varphi \geq 1.$

### A counterexample

Suppose we consider a quantum particle on a ring, where the wave function depends on an angular variable $\theta$ , which we may take to lie in the interval $[0,2\pi ]$ . Define "position" and "momentum" operators ${\hat {A}}$  and ${\hat {B}}$  by

${\hat {A}}\psi (\theta )=\theta \psi (\theta ),\quad \theta \in [0,2\pi ],$

and
${\hat {B}}\psi =-i\hbar {\frac {d\psi }{d\theta }},$

where we impose periodic boundary conditions on ${\hat {B}}$ . The definition of ${\hat {A}}$  depends on our choice to have $\theta$  range from 0 to $2\pi$ . These operators satisfy the usual commutation relations for position and momentum operators, $[{\hat {A}},{\hat {B}}]=i\hbar$ .

Now let $\psi$  be any of the eigenstates of ${\hat {B}}$ , which are given by $\psi (\theta )=e^{2\pi in\theta }$ . These states are normalizable, unlike the eigenstates of the momentum operator on the line. Also the operator ${\hat {A}}$  is bounded, since $\theta$  ranges over a bounded interval. Thus, in the state $\psi$ , the uncertainty of $B$  is zero and the uncertainty of $A$  is finite, so that

$\sigma _{A}\sigma _{B}=0.$

Although this result appears to violate the Robertson uncertainty principle, the paradox is resolved when we note that $\psi$  is not in the domain of the operator ${\hat {B}}{\hat {A}}$ , since multiplication by $\theta$  disrupts the periodic boundary conditions imposed on ${\hat {B}}$ . Thus, the derivation of the Robertson relation, which requires ${\hat {A}}{\hat {B}}\psi$  and ${\hat {B}}{\hat {A}}\psi$  to be defined, does not apply. (These also furnish an example of operators satisfying the canonical commutation relations but not the Weyl relations.)

For the usual position and momentum operators ${\hat {X}}$  and ${\hat {P}}$  on the real line, no such counterexamples can occur. As long as $\sigma _{x}$  and $\sigma _{p}$  are defined in the state $\psi$ , the Heisenberg uncertainty principle holds, even if $\psi$  fails to be in the domain of ${\hat {X}}{\hat {P}}$  or of ${\hat {P}}{\hat {X}}$ .

## Examples

(Refs )

### Quantum harmonic oscillator stationary states

Consider a one-dimensional quantum harmonic oscillator. It is possible to express the position and momentum operators in terms of the creation and annihilation operators:

${\hat {x}}={\sqrt {\frac {\hbar }{2m\omega }}}(a+a^{\dagger })$

${\hat {p}}=i{\sqrt {\frac {m\omega \hbar }{2}}}(a^{\dagger }-a).$

Using the standard rules for creation and annihilation operators on the energy eigenstates,

$a^{\dagger }|n\rangle ={\sqrt {n+1}}|n+1\rangle$

$a|n\rangle ={\sqrt {n}}|n-1\rangle ,$

the variances may be computed directly,
$\sigma _{x}^{2}={\frac {\hbar }{m\omega }}\left(n+{\frac {1}{2}}\right)$

$\sigma _{p}^{2}=\hbar m\omega \left(n+{\frac {1}{2}}\right)\,.$

The product of these standard deviations is then
$\sigma _{x}\sigma _{p}=\hbar \left(n+{\frac {1}{2}}\right)\geq {\frac {\hbar }{2}}.~$

In particular, the above Kennard bound is saturated for the ground state n=0, for which the probability density is just the normal distribution.

### Quantum harmonic oscillators with Gaussian initial condition

In a quantum harmonic oscillator of characteristic angular frequency ω, place a state that is offset from the bottom of the potential by some displacement x0 as

$\psi (x)=\left({\frac {m\Omega }{\pi \hbar }}\right)^{1/4}\exp {\left(-{\frac {m\Omega (x-x_{0})^{2}}{2\hbar }}\right)},$

where Ω describes the width of the initial state but need not be the same as ω. Through integration over the propagator, we can solve for the full time-dependent solution. After many cancelations, the probability densities reduce to
$|\Psi (x,t)|^{2}\sim {\mathcal {N}}\left(x_{0}\cos {(\omega t)},{\frac {\hbar }{2m\Omega }}\left(\cos ^{2}(\omega t)+{\frac {\Omega ^{2}}{\omega ^{2}}}\sin ^{2}{(\omega t)}\right)\right)$

$|\Phi (p,t)|^{2}\sim {\mathcal {N}}\left(-mx_{0}\omega \sin(\omega t),{\frac {\hbar m\Omega }{2}}\left(\cos ^{2}{(\omega t)}+{\frac {\omega ^{2}}{\Omega ^{2}}}\sin ^{2}{(\omega t)}\right)\right),$

where we have used the notation ${\mathcal {N}}(\mu ,\sigma ^{2})$  to denote a normal distribution of mean μ and variance σ2. Copying the variances above and applying trigonometric identities, we can write the product of the standard deviations as
{\begin{aligned}\sigma _{x}\sigma _{p}&={\frac {\hbar }{2}}{\sqrt {\left(\cos ^{2}{(\omega t)}+{\frac {\Omega ^{2}}{\omega ^{2}}}\sin ^{2}{(\omega t)}\right)\left(\cos ^{2}{(\omega t)}+{\frac {\omega ^{2}}{\Omega ^{2}}}\sin ^{2}{(\omega t)}\right)}}\\&={\frac {\hbar }{4}}{\sqrt {3+{\frac {1}{2}}\left({\frac {\Omega ^{2}}{\omega ^{2}}}+{\frac {\omega ^{2}}{\Omega ^{2}}}\right)-\left({\frac {1}{2}}\left({\frac {\Omega ^{2}}{\omega ^{2}}}+{\frac {\omega ^{2}}{\Omega ^{2}}}\right)-1\right)\cos {(4\omega t)}}}\end{aligned}}

From the relations

${\frac {\Omega ^{2}}{\omega ^{2}}}+{\frac {\omega ^{2}}{\Omega ^{2}}}\geq 2,\quad |\cos(4\omega t)|\leq 1,$

we can conclude the following: (the right most equality holds only when Ω = ω) .
$\sigma _{x}\sigma _{p}\geq {\frac {\hbar }{4}}{\sqrt {3+{\frac {1}{2}}\left({\frac {\Omega ^{2}}{\omega ^{2}}}+{\frac {\omega ^{2}}{\Omega ^{2}}}\right)-\left({\frac {1}{2}}\left({\frac {\Omega ^{2}}{\omega ^{2}}}+{\frac {\omega ^{2}}{\Omega ^{2}}}\right)-1\right)}}={\frac {\hbar }{2}}.$

### Coherent states

A coherent state is a right eigenstate of the annihilation operator,

${\hat {a}}|\alpha \rangle =\alpha |\alpha \rangle ,$

which may be represented in terms of Fock states as
$|\alpha \rangle =e^{-{|\alpha |^{2} \over 2}}\sum _{n=0}^{\infty }{\alpha ^{n} \over {\sqrt {n!}}}|n\rangle$

In the picture where the coherent state is a massive particle in a quantum harmonic oscillator, the position and momentum operators may be expressed in terms of the annihilation operators in the same formulas above and used to calculate the variances,

$\sigma _{x}^{2}={\frac {\hbar }{2m\omega }},$

$\sigma _{p}^{2}={\frac {\hbar m\omega }{2}}.$

Therefore, every coherent state saturates the Kennard bound
$\sigma _{x}\sigma _{p}={\sqrt {\frac {\hbar }{2m\omega }}}\,{\sqrt {\frac {\hbar m\omega }{2}}}={\frac {\hbar }{2}}.$

with position and momentum each contributing an amount ${\textstyle {\sqrt {\hbar /2}}}$  in a "balanced" way. Moreover, every squeezed coherent state also saturates the Kennard bound although the individual contributions of position and momentum need not be balanced in general.

### Particle in a box

Consider a particle in a one-dimensional box of length $L$ . The eigenfunctions in position and momentum space are

$\psi _{n}(x,t)={\begin{cases}A\sin(k_{n}x)\mathrm {e} ^{-\mathrm {i} \omega _{n}t},&0

and
$\varphi _{n}(p,t)={\sqrt {\frac {\pi L}{\hbar }}}\,\,{\frac {n\left(1-(-1)^{n}e^{-ikL}\right)e^{-i\omega _{n}t}}{\pi ^{2}n^{2}-k^{2}L^{2}}},$

where ${\textstyle \omega _{n}={\frac {\pi ^{2}\hbar n^{2}}{8L^{2}m}}}$  and we have used the de Broglie relation $p=\hbar k$ . The variances of $x$  and $p$  can be calculated explicitly:
$\sigma _{x}^{2}={\frac {L^{2}}{12}}\left(1-{\frac {6}{n^{2}\pi ^{2}}}\right)$

$\sigma _{p}^{2}=\left({\frac {\hbar n\pi }{L}}\right)^{2}.$

The product of the standard deviations is therefore

$\sigma _{x}\sigma _{p}={\frac {\hbar }{2}}{\sqrt {{\frac {n^{2}\pi ^{2}}{3}}-2}}.$

For all $n=1,\,2,\,3,\,\ldots$ , the quantity ${\textstyle {\sqrt {{\frac {n^{2}\pi ^{2}}{3}}-2}}}$  is greater than 1, so the uncertainty principle is never violated. For numerical concreteness, the smallest value occurs when $n=1$ , in which case
$\sigma _{x}\sigma _{p}={\frac {\hbar }{2}}{\sqrt {{\frac {\pi ^{2}}{3}}-2}}\approx 0.568\hbar >{\frac {\hbar }{2}}.$

### Constant momentum

Position space probability density of an initially Gaussian state moving at minimally uncertain, constant momentum in free space

Assume a particle initially has a momentum space wave function described by a normal distribution around some constant momentum p0 according to

$\varphi (p)=\left({\frac {x_{0}}{\hbar {\sqrt {\pi }}}}\right)^{1/2}\exp \left({\frac {-x_{0}^{2}(p-p_{0})^{2}}{2\hbar ^{2}}}\right),$

where we have introduced a reference scale ${\textstyle x_{0}={\sqrt {\hbar /m\omega _{0}}}}$ , with $\omega _{0}>0$  describing the width of the distribution—cf. nondimensionalization. If the state is allowed to evolve in free space, then the time-dependent momentum and position space wave functions are
$\Phi (p,t)=\left({\frac {x_{0}}{\hbar {\sqrt {\pi }}}}\right)^{1/2}\exp \left({\frac {-x_{0}^{2}(p-p_{0})^{2}}{2\hbar ^{2}}}-{\frac {ip^{2}t}{2m\hbar }}\right),$

$\Psi (x,t)=\left({\frac {1}{x_{0}{\sqrt {\pi }}}}\right)^{1/2}{\frac {e^{-x_{0}^{2}p_{0}^{2}/2\hbar ^{2}}}{\sqrt {1+i\omega _{0}t}}}\,\exp \left(-{\frac {(x-ix_{0}^{2}p_{0}/\hbar )^{2}}{2x_{0}^{2}(1+i\omega _{0}t)}}\right).$

Since $\langle p(t)\rangle =p_{0}$  and $\sigma _{p}(t)=\hbar /({\sqrt {2}}x_{0})$ , this can be interpreted as a particle moving along with constant momentum at arbitrarily high precision. On the other hand, the standard deviation of the position is

$\sigma _{x}={\frac {x_{0}}{\sqrt {2}}}{\sqrt {1+\omega _{0}^{2}t^{2}}}$

such that the uncertainty product can only increase with time as
$\sigma _{x}(t)\sigma _{p}(t)={\frac {\hbar }{2}}{\sqrt {1+\omega _{0}^{2}t^{2}}}$

### Systematic and statistical errors

The inequalities above focus on the statistical imprecision of observables as quantified by the standard deviation $\sigma$ . Heisenberg's original version, however, was dealing with the systematic error, a disturbance of the quantum system produced by the measuring apparatus, i.e., an observer effect.

If we let $\varepsilon _{A}$  represent the error (i.e., inaccuracy) of a measurement of an observable A and $\eta _{B}$  the disturbance produced on a subsequent measurement of the conjugate variable B by the former measurement of A, then the inequality proposed by Ozawa — encompassing both systematic and statistical errors — holds:

$\varepsilon _{A}\,\eta _{B}+\varepsilon _{A}\,\sigma _{B}+\sigma _{A}\,\eta _{B}\,\geq \,{\frac {1}{2}}\,\left|{\Bigl \langle }{\bigl [}{\hat {A}},{\hat {B}}{\bigr ]}{\Bigr \rangle }\right|$

Heisenberg's uncertainty principle, as originally described in the 1927 formulation, mentions only the first term of Ozawa inequality, regarding the systematic error. Using the notation above to describe the error/disturbance effect of sequential measurements (first A, then B), it could be written as

$\varepsilon _{A}\,\eta _{B}\,\geq \,{\frac {1}{2}}\,\left|{\Bigl \langle }{\bigl [}{\hat {A}},{\hat {B}}{\bigr ]}{\Bigr \rangle }\right|$

The formal derivation of the Heisenberg relation is possible but far from intuitive. It was not proposed by Heisenberg, but formulated in a mathematically consistent way only in recent years. Also, it must be stressed that the Heisenberg formulation is not taking into account the intrinsic statistical errors $\sigma _{A}$  and $\sigma _{B}$ . There is increasing experimental evidence that the total quantum uncertainty cannot be described by the Heisenberg term alone, but requires the presence of all the three terms of the Ozawa inequality.

Using the same formalism, it is also possible to introduce the other kind of physical situation, often confused with the previous one, namely the case of simultaneous measurements (A and B at the same time):

$\varepsilon _{A}\,\varepsilon _{B}\,\geq \,{\frac {1}{2}}\,\left|{\Bigl \langle }{\bigl [}{\hat {A}},{\hat {B}}{\bigr ]}{\Bigr \rangle }\right|$

The two simultaneous measurements on A and B are necessarily unsharp or weak.

It is also possible to derive an uncertainty relation that, as the Ozawa's one, combines both the statistical and systematic error components, but keeps a form very close to the Heisenberg original inequality. By adding Robertson

$\sigma _{A}\,\sigma _{B}\,\geq \,{\frac {1}{2}}\,\left|{\Bigl \langle }{\bigl [}{\hat {A}},{\hat {B}}{\bigr ]}{\Bigr \rangle }\right|$

and Ozawa relations we obtain

$\varepsilon _{A}\eta _{B}+\varepsilon _{A}\,\sigma _{B}+\sigma _{A}\,\eta _{B}+\sigma _{A}\sigma _{B}\geq \left|{\Bigl \langle }{\bigl [}{\hat {A}},{\hat {B}}{\bigr ]}{\Bigr \rangle }\right|.$

The four terms can be written as:
$(\varepsilon _{A}+\sigma _{A})\,(\eta _{B}+\sigma _{B})\,\geq \,\left|{\Bigl \langle }{\bigl [}{\hat {A}},{\hat {B}}{\bigr ]}{\Bigr \rangle }\right|.$

Defining:
${\bar {\varepsilon }}_{A}\,\equiv \,(\varepsilon _{A}+\sigma _{A})$

as the inaccuracy in the measured values of the variable A and
${\bar {\eta }}_{B}\,\equiv \,(\eta _{B}+\sigma _{B})$

as the resulting fluctuation in the conjugate variable B, Fujikawa established an uncertainty relation similar to the Heisenberg original one, but valid both for systematic and statistical errors:

${\bar {\varepsilon }}_{A}\,{\bar {\eta }}_{B}\,\geq \,\left|{\Bigl \langle }{\bigl [}{\hat {A}},{\hat {B}}{\bigr ]}{\Bigr \rangle }\right|$

### Quantum entropic uncertainty principle

For many distributions, the standard deviation is not a particularly natural way of quantifying the structure. For example, uncertainty relations in which one of the observables is an angle has little physical meaning for fluctuations larger than one period. Other examples include highly bimodal distributions, or unimodal distributions with divergent variance.

A solution that overcomes these issues is an uncertainty based on entropic uncertainty instead of the product of variances. While formulating the many-worlds interpretation of quantum mechanics in 1957, Hugh Everett III conjectured a stronger extension of the uncertainty principle based on entropic certainty. This conjecture, also studied by Hirschman and proven in 1975 by Beckner and by Iwo Bialynicki-Birula and Jerzy Mycielski is that, for two normalized, dimensionless Fourier transform pairs f (a) and g(b) where

$f(a)=\int _{-\infty }^{\infty }g(b)\ e^{2\pi iab}\,db$     and     $\,\,\,g(b)=\int _{-\infty }^{\infty }f(a)\ e^{-2\pi iab}\,da$

the Shannon information entropies

$H_{a}=\int _{-\infty }^{\infty }f(a)\log(f(a))\,da,$

and
$H_{b}=\int _{-\infty }^{\infty }g(b)\log(g(b))\,db$

are subject to the following constraint,

$H_{a}+H_{b}\geq \log(e/2)$

where the logarithms may be in any base.

The probability distribution functions associated with the position wave function ψ(x) and the momentum wave function φ(x) have dimensions of inverse length and momentum respectively, but the entropies may be rendered dimensionless by

$H_{x}=-\int |\psi (x)|^{2}\ln \left(x_{0}\,|\psi (x)|^{2}\right)dx=-\left\langle \ln \left(x_{0}\,\left|\psi (x)\right|^{2}\right)\right\rangle$

$H_{p}=-\int |\varphi (p)|^{2}\ln(p_{0}\,|\varphi (p)|^{2})\,dp=-\left\langle \ln(p_{0}\left|\varphi (p)\right|^{2})\right\rangle$

where x0 and p0 are some arbitrarily chosen length and momentum respectively, which render the arguments of the logarithms dimensionless. Note that the entropies will be functions of these chosen parameters. Due to the Fourier transform relation between the position wave function ψ(x) and the momentum wavefunction φ(p), the above constraint can be written for the corresponding entropies as

$H_{x}+H_{p}\geq \log \left({\frac {e\,h}{2\,x_{0}\,p_{0}}}\right)$

where h is Planck's constant.

Depending on one's choice of the x0 p0 product, the expression may be written in many ways. If x0 p0 is chosen to be h, then

$H_{x}+H_{p}\geq \log \left({\frac {e}{2}}\right)$

If, instead, x0 p0 is chosen to be ħ, then

$H_{x}+H_{p}\geq \log(e\,\pi )$

If x0 and p0 are chosen to be unity in whatever system of units are being used, then

$H_{x}+H_{p}\geq \log \left({\frac {e\,h}{2}}\right)$

where h is interpreted as a dimensionless number equal to the value of Planck's constant in the chosen system of units. Note that these inequalities can be extended to multimode quantum states, or wavefunctions in more than one spatial dimension.

The quantum entropic uncertainty principle is more restrictive than the Heisenberg uncertainty principle. From the inverse logarithmic Sobolev inequalities

$H_{x}\leq {\frac {1}{2}}\log(2e\pi \sigma _{x}^{2}/x_{0}^{2})~,$

$H_{p}\leq {\frac {1}{2}}\log(2e\pi \sigma _{p}^{2}/p_{0}^{2})~,$

(equivalently, from the fact that normal distributions maximize the entropy of all such with a given variance), it readily follows that this entropic uncertainty principle is stronger than the one based on standard deviations, because
$\sigma _{x}\sigma _{p}\geq {\frac {\hbar }{2}}\exp \left(H_{x}+H_{p}-\log \left({\frac {e\,h}{2\,x_{0}\,p_{0}}}\right)\right)\geq {\frac {\hbar }{2}}~.$

In other words, the Heisenberg uncertainty principle, is a consequence of the quantum entropic uncertainty principle, but not vice versa. A few remarks on these inequalities. First, the choice of base e is a matter of popular convention in physics. The logarithm can alternatively be in any base, provided that it be consistent on both sides of the inequality. Second, recall the Shannon entropy has been used, not the quantum von Neumann entropy. Finally, the normal distribution saturates the inequality, and it is the only distribution with this property, because it is the maximum entropy probability distribution among those with fixed variance (cf. here for proof).

A measurement apparatus will have a finite resolution set by the discretization of its possible outputs into bins, with the probability of lying within one of the bins given by the Born rule. We will consider the most common experimental situation, in which the bins are of uniform size. Let δx be a measure of the spatial resolution. We take the zeroth bin to be centered near the origin, with possibly some small constant offset c. The probability of lying within the jth interval of width δx is

$\operatorname {P} [x_{j}]=\int _{(j-1/2)\delta x-c}^{(j+1/2)\delta x-c}|\psi (x)|^{2}\,dx$

To account for this discretization, we can define the Shannon entropy of the wave function for a given measurement apparatus as

$H_{x}=-\sum _{j=-\infty }^{\infty }\operatorname {P} [x_{j}]\ln \operatorname {P} [x_{j}].$

Under the above definition, the entropic uncertainty relation is

$H_{x}+H_{p}>\ln \left({\frac {e}{2}}\right)-\ln \left({\frac {\delta x\delta p}{h}}\right).$

Here we note that δx δp/h is a typical infinitesimal phase space volume used in the calculation of a partition function. The inequality is also strict and not saturated. Efforts to improve this bound are an active area of research.

### The Efimov inequality by Pauli matrices

In 1976, Sergei P. Efimov deduced an inequality that refines the Robertson relation by applying high-order commutators. His approach is based on the Pauli matrices. Later V.V. Dodonov used the method to derive relations for several observables by using Clifford algebra.

According to Jackiw, the Robertson uncertainty is valid only when the commutator is C-number. The Efimov method is effective for variables that have commutators of high-order - for example for the kinetic energy operator and for coordinate one. Consider two operators ${\hat {A}}$  and ${\hat {B}}$  that have commutator ${\hat {C}}$ :

$\left[{\hat {A}},{\hat {B}}\right]={\hat {C}}.$

To shorten formulas we use the operator deviations:

$\delta {\hat {A}}={\hat {A}}-\left\langle {\hat {A}}\right\rangle ,$

when new operators have the zero mean deviation. To use the Pauli matrices we can consider the operator:
${\hat {F}}=\gamma _{1}\,\delta {\hat {A}}\,\sigma _{1}+\gamma _{2}\,\delta {\hat {B}}\,\sigma _{2}+\gamma _{3}\,\delta {\hat {C}}\,\sigma _{3},$

where 2×2 spin matrices $\sigma _{i}$  have commutators:
$\left[\sigma _{i},\sigma _{k}\right]=\,i\,e_{ikl}\sigma _{l},$

where $e_{ikl}$  antisymmetric symbol. They act in the spin space independently from $\delta {\hat {A}}{,}\,\delta {\hat {B}}{,}\,\delta {\hat {C}}$ . Pauli matrices define the Clifford algebra. We take arbitrary numbers $\gamma _{i}$  in operator ${\hat {F}}$  to be real.

Physical square of the operator is equal to:

${\hat {F}}{\hat {F}}^{+}=\gamma _{1}^{2}(\,\delta {\hat {A}}\delta {\hat {A}}^{+})+\gamma _{2}^{2}(\,\delta {\hat {B}}\delta {\hat {B}}^{+})+\gamma _{3}^{2}(\,\delta {\hat {C}}\delta {\hat {C}}^{+})+\gamma _{1}\gamma _{2}\,{\hat {C}}\,\sigma _{3}+\gamma _{2}\gamma _{3}\,{\hat {C}}_{2}\,\sigma _{1}-\gamma _{1}\gamma _{3}\,{\hat {C}}_{3}\,\sigma _{2},$

where ${\hat {F}}^{+}$  is adjoint operator and commutators ${\hat {C}}_{2}$  and ${\hat {C}}_{3}$  are following:

${\hat {C}}_{2}=i\left[\delta {\hat {B}},{\hat {C}}\right],\qquad {\hat {C}}_{3}=i\left[\delta {\hat {A}},{\hat {C}}\right].$

Operator ${\hat {F}}{\hat {F}}^{+}$  is positive-definite, what is essential to get an inequality below . Taking average value of it over state $\left|\psi \right\rangle$ , we get positive-definite matrix 2×2:

{\begin{aligned}\left\langle \psi \right|{\hat {F}}{\hat {F}}^{+}\left|\psi \right\rangle &=\gamma _{1}^{2}\,\left\langle (\delta {\hat {A}})^{2}\right\rangle +\gamma _{2}^{2}\,\left\langle (\delta {\hat {B}})^{2}\right\rangle +\gamma _{3}^{2}\,\left\langle (\delta {\hat {C}})^{2}\right\rangle +\\&+\gamma _{1}\gamma _{2}\,\left\langle {\hat {C}}\right\rangle \,\sigma _{3}+\gamma _{2}\gamma _{3}\,\left\langle {\hat {C}}_{2}\right\rangle \,\sigma _{1}-\gamma _{1}\gamma _{3}\,\left\langle {\hat {C}}_{3}\right\rangle \,\sigma _{2},\end{aligned}}

where used the notion:
$\left\langle (\delta {\hat {A}})^{2}\right\rangle =\left\langle (\delta {\hat {A}}\delta A^{+})\right\rangle$

and analogous one for operators $B,\,C$ . Regarding that coefficients $\gamma _{i}$  are arbitrary in the equation, we get the positive-definite matrix 6×6. Sylvester's criterion says that its leading principal minors are non-negative. The Robertson uncertainty follows from minor of forth degree. To strengthen result we calculate determinant of sixth order:

$\left\langle {(\delta {\hat {A}})^{2}}\right\rangle \left\langle {(\delta {\hat {B}})^{2}}\right\rangle \left\langle {(\delta {\hat {C}})^{2}}\right\rangle \geq {\frac {1}{4}}\left\langle {\hat {C}}\right\rangle ^{2}\left\langle {(\delta {\hat {C}})^{2}}\right\rangle +{\frac {1}{4}}\left\langle (\delta {\hat {A}})^{2}\right\rangle \left\langle {\hat {C}}_{2}\right\rangle ^{2}+{\frac {1}{4}}\left\langle (\delta {\hat {B}})^{2}\right\rangle \left\langle {\hat {C}}_{3}\right\rangle ^{2}$

The equality is observed only when the state is an eigenstate for the operator ${\hat {F}}$  and likewise for the spin variables:

${\hat {F}}\,{\left|\psi \right\rangle }{\left|{\hat {s}}\right\rangle }=0.$

Found relation we may apply to the kinetic energy operator ${\textstyle {\hat {E}}_{\text{kin}}={\frac {\mathbf {\hat {p}} ^{2}}{2}}}$  and for operator of the coordinate $\mathbf {\hat {x}}$ :

$\left\langle (\delta {\hat {E}})^{2}\right\rangle \left\langle (\delta {\hat {\mathbf {x} }})^{2}\right\rangle \geq {\frac {\hbar ^{2}}{4}}\left\langle \,\mathbf {\hat {p}} \,\right\rangle ^{2}+{\frac {\hbar ^{2}}{2}}\left\langle (\delta {\hat {E}})^{2}\right\rangle \left\langle (\delta \mathbf {\hat {p}} )^{2}\right\rangle ^{-1}$

In particular, equality in the formula is observed for the ground state of the oscillator, whereas the right-hand item of the Robertson uncertainty vanishes:

$\left\langle \,\mathbf {\hat {p}} \,\right\rangle =0.$

Physical meaning of the relation is more clear if to divide it by the squared nonzero average impulse what yields:

$\left\langle (\delta {\hat {E}})^{2}\right\rangle (\delta t)^{2}\geq {\frac {\hbar ^{2}}{4}}+{\frac {\hbar ^{2}}{2}}\left\langle (\delta {\hat {E}})^{2}\right\rangle \left\langle (\delta \mathbf {\hat {p}} )^{2}\right\rangle ^{-1}\left\langle \,\mathbf {\hat {p}} \,\right\rangle ^{-2},$

where $(\delta t)^{2}=\left\langle (\delta \mathbf {\hat {x}} )^{2}\right\rangle \left\langle \mathbf {\,} {\hat {p}}\,\right\rangle ^{-2}$  is squared effective time within which a particle moves near the mean trajectory (Mass of the particle is equal to 1).

The method can be applied for three noncommuting operators of angular momentum $\mathbf {\hat {L}}$ . We compile the operator:

${\hat {F}}=\gamma _{1}\,\delta {\hat {L}}_{x}\,\sigma _{1}+\gamma _{2}\,\delta {\hat {L}}_{y}\,\sigma _{2}+\gamma _{3}\,\delta {\hat {L}}_{z}\,\sigma _{3}.$

We recall that the operators $\sigma _{i}$  are auxiliary and there is no relation between the spin variables of the particle. In such way, their commutative properties are of importance only. Squared and averaged operator ${\hat {F}}$  gives positive-definite matrix where we get following inequality from:

$\left\langle {(\delta {\hat {L}}_{x})}^{2}\right\rangle \left\langle {(\delta {\hat {L}}_{y})}^{2}\right\rangle \left\langle {(\delta {\hat {L}}_{z})}^{2})\right\rangle \geq {\frac {\hbar ^{2}}{4}}\sum _{i=1}^{3}\left\langle (\delta {\hat {L}}_{i})^{2}\right\rangle \left\langle {\hat {L}}_{i}\right\rangle ^{2}$

To develop method for a group of operators one may use the Clifford algebra instead of the Pauli matrices.

## Harmonic analysis

In the context of harmonic analysis, a branch of mathematics, the uncertainty principle implies that one cannot at the same time localize the value of a function and its Fourier transform. To wit, the following inequality holds,

$\left(\int _{-\infty }^{\infty }x^{2}|f(x)|^{2}\,dx\right)\left(\int _{-\infty }^{\infty }\xi ^{2}|{\hat {f}}(\xi )|^{2}\,d\xi \right)\geq {\frac {\|f\|_{2}^{4}}{16\pi ^{2}}}.$

Further mathematical uncertainty inequalities, including the above entropic uncertainty, hold between a function f and its Fourier transform ƒ̂:

$H_{x}+H_{\xi }\geq \log(e/2)$

### Signal processing

In the context of signal processing, and in particular time–frequency analysis, uncertainty principles are referred to as the Gabor limit, after Dennis Gabor, or sometimes the Heisenberg–Gabor limit. The basic result, which follows from "Benedicks's theorem", below, is that a function cannot be both time limited and band limited (a function and its Fourier transform cannot both have bounded domain)—see bandlimited versus timelimited. Thus

$\sigma _{t}\cdot \sigma _{f}\geq {\frac {1}{4\pi }}\approx 0.08{\text{ cycles}},$

where $\sigma _{t}$  and $\sigma _{f}$  are the standard deviations of the time and frequency estimates respectively.

Stated alternatively, "One cannot simultaneously sharply localize a signal (function f) in both the time domain and frequency domain (ƒ̂, its Fourier transform)".

When applied to filters, the result implies that one cannot achieve high temporal resolution and frequency resolution at the same time; a concrete example are the resolution issues of the short-time Fourier transform—if one uses a wide window, one achieves good frequency resolution at the cost of temporal resolution, while a narrow window has the opposite trade-off.

Alternate theorems give more precise quantitative results, and, in time–frequency analysis, rather than interpreting the (1-dimensional) time and frequency domains separately, one instead interprets the limit as a lower limit on the support of a function in the (2-dimensional) time–frequency plane. In practice, the Gabor limit limits the simultaneous time–frequency resolution one can achieve without interference; it is possible to achieve higher resolution, but at the cost of different components of the signal interfering with each other.

As a result, in order to analyze signals where the transients are important, the wavelet transform is often used instead of the Fourier.

### Discrete Fourier transform

Let $\left\{\mathbf {x_{n}} \right\}:=x_{0},x_{1},\ldots ,x_{N-1}$  be a sequence of N complex numbers and $\left\{\mathbf {X_{k}} \right\}:=X_{0},X_{1},\ldots ,X_{N-1},$  its discrete Fourier transform.

Denote by $\|x\|_{0}$  the number of non-zero elements in the time sequence $x_{0},x_{1},\ldots ,x_{N-1}$  and by $\|X\|_{0}$  the number of non-zero elements in the frequency sequence $X_{0},X_{1},\ldots ,X_{N-1}$ . Then,

$\|x\|_{0}\cdot \|X\|_{0}\geq N.$

This inequality is sharp, with equality achieved when x or X is a Dirac mass, or more generally when x is a nonzero multiple of a Dirac comb supported on a subgroup of the integers modulo N (in which case X is also a Dirac comb supported on a complementary subgroup, and vice versa).

More generally, if T and W are subsets of the integers modulo N, let $L_{T},R_{W}:\ell ^{2}(\mathbb {Z} /N\mathbb {N} )\to \ell ^{2}(\mathbb {Z} /N\mathbb {N} )$  denote the time-limiting operator and band-limiting operators, respectively. Then

$\|L_{T}R_{W}\|^{2}\leq {\frac {|T||W|}{|G|}}$

where the norm is the operator norm of operators on the Hilbert space $\ell ^{2}(\mathbb {Z} /N\mathbb {Z} )$  of functions on the integers modulo N. This inequality has implications for signal reconstruction.

When N is a prime number, a stronger inequality holds:

$\|x\|_{0}+\|X\|_{0}\geq N+1.$

Discovered by Terence Tao, this inequality is also sharp.

### Benedicks's theorem

Amrein–Berthier and Benedicks's theorem intuitively says that the set of points where f is non-zero and the set of points where ƒ̂ is non-zero cannot both be small.

Specifically, it is impossible for a function f in L2(R) and its Fourier transform ƒ̂ to both be supported on sets of finite Lebesgue measure. A more quantitative version is

$\|f\|_{L^{2}(\mathbf {R} ^{d})}\leq Ce^{C|S||\Sigma |}{\bigl (}\|f\|_{L^{2}(S^{c})}+\|{\hat {f}}\|_{L^{2}(\Sigma ^{c})}{\bigr )}~.$

One expects that the factor CeC|S||Σ| may be replaced by CeC(|S||Σ|)1/d, which is only known if either S or Σ is convex.

### Hardy's uncertainty principle

The mathematician G. H. Hardy formulated the following uncertainty principle: it is not possible for f and ƒ̂ to both be "very rapidly decreasing". Specifically, if f in $L^{2}(\mathbb {R} )$  is such that

$|f(x)|\leq C(1+|x|)^{N}e^{-a\pi x^{2}}$

and
$|{\hat {f}}(\xi )|\leq C(1+|\xi |)^{N}e^{-b\pi \xi ^{2}}$

($C>0,N$  an integer),

then, if ab > 1, f = 0, while if ab = 1, then there is a polynomial P of degree N such that

$f(x)=P(x)e^{-a\pi x^{2}}.$

This was later improved as follows: if $f\in L^{2}(\mathbb {R} ^{d})$  is such that

$\int _{\mathbb {R} ^{d}}\int _{\mathbb {R} ^{d}}|f(x)||{\hat {f}}(\xi )|{\frac {e^{\pi |\langle x,\xi \rangle |}}{(1+|x|+|\xi |)^{N}}}\,dx\,d\xi <+\infty ~,$

then
$f(x)=P(x)e^{-\pi \langle Ax,x\rangle }~,$

where P is a polynomial of degree (Nd)/2 and A is a real d × d positive definite matrix.

This result was stated in Beurling's complete works without proof and proved in Hörmander (the case $d=1,N=0$ ) and Bonami, Demange, and Jaming for the general case. Note that Hörmander–Beurling's version implies the case ab > 1 in Hardy's Theorem while the version by Bonami–Demange–Jaming covers the full strength of Hardy's Theorem. A different proof of Beurling's theorem based on Liouville's theorem appeared in ref.

A full description of the case ab < 1 as well as the following extension to Schwartz class distributions appears in ref.

Theorem —  If a tempered distribution $f\in {\mathcal {S}}'(\mathbb {R} ^{d})$  is such that

$e^{\pi |x|^{2}}f\in {\mathcal {S}}'(\mathbb {R} ^{d})$

and
$e^{\pi |\xi |^{2}}{\hat {f}}\in {\mathcal {S}}'(\mathbb {R} ^{d})~,$

then
$f(x)=P(x)e^{-\pi \langle Ax,x\rangle }~,$

for some convenient polynomial P and real positive definite matrix A of type d × d.

## History

Werner Heisenberg formulated the uncertainty principle at Niels Bohr's institute in Copenhagen, while working on the mathematical foundations of quantum mechanics.

In 1925, following pioneering work with Hendrik Kramers, Heisenberg developed matrix mechanics, which replaced the ad hoc old quantum theory with modern quantum mechanics. The central premise was that the classical concept of motion does not fit at the quantum level, as electrons in an atom do not travel on sharply defined orbits. Rather, their motion is smeared out in a strange way: the Fourier transform of its time dependence only involves those frequencies that could be observed in the quantum jumps of their radiation.

Heisenberg's paper did not admit any unobservable quantities like the exact position of the electron in an orbit at any time; he only allowed the theorist to talk about the Fourier components of the motion. Since the Fourier components were not defined at the classical frequencies, they could not be used to construct an exact trajectory, so that the formalism could not answer certain overly precise questions about where the electron was or how fast it was going.

According to one account: “Heisenberg's paper marked a radical departure from previous attempts to solve atomic problems by making use of observable quantities only. ‘My entire meagre efforts go toward killing off and suitably replacing the concept of the orbital paths that one cannot observe,’ he wrote in a letter dated 9 July 1925.”

It was actually Einstein who first raised the problem to Heisenberg in 1926 upon their first real discussion.  Einstein had invited Heisenberg to his home for a discussion of matrix mechanics upon its introduction.  As Heisenberg describes the discussion: “On the way home, he questioned me about my background, my studies with Sommerfeld.  But on arrival he at once began with a central question about the philosophical foundation of the new quantum mechanics.  He pointed out to me that in my mathematical description the notion of ‘electron path’ did not occur at all, but that in a cloud-chamber the track of the electron can of course be observed directly.  It seemed to him absurd to claim that there was indeed an electron path in the cloud-chamber, but none in the interior of the atom.”  In this situation, of course, we [Heisenberg and Bohr] had many discussions, difficult discussions, because we all felt that the mathematical scheme of quantum or wave mechanics was already final.  It could not be changed, and we would have to do all our calculations from this scheme.  On the other hand, nobody knew how to represent in this scheme such a simple case as the path of an electron through a cloud chamber.” 

In March 1926, working in Bohr's institute, Heisenberg realized that the non-commutativity implies the uncertainty principle. This implication provided a clear physical interpretation for the non-commutativity, and it laid the foundation for what became known as the Copenhagen interpretation of quantum mechanics. Heisenberg showed that the commutation relation implies an uncertainty, or in Bohr's language a complementarity. Any two variables that do not commute cannot be measured simultaneously—the more precisely one is known, the less precisely the other can be known. Heisenberg wrote:

It can be expressed in its simplest form as follows: One can never know with perfect accuracy both of those two important factors which determine the movement of one of the smallest particles—its position and its velocity. It is impossible to determine accurately both the position and the direction and speed of a particle at the same instant.

In his celebrated 1927 paper, "Über den anschaulichen Inhalt der quantentheoretischen Kinematik und Mechanik" ("On the Perceptual Content of Quantum Theoretical Kinematics and Mechanics"), Heisenberg established this expression as the minimum amount of unavoidable momentum disturbance caused by any position measurement, but he did not give a precise definition for the uncertainties Δx and Δp. Instead, he gave some plausible estimates in each case separately. In his Chicago lecture he refined his principle:

$\Delta x\,\Delta p\gtrsim h$

(A1)

Kennard in 1927 first proved the modern inequality:

$\sigma _{x}\sigma _{p}\geq {\frac {\hbar }{2}}$

(A2)

where ħ = h/2π, and σx, σp are the standard deviations of position and momentum. Heisenberg only proved relation (A2) for the special case of Gaussian states.

### Terminology and translation

Throughout the main body of his original 1927 paper, written in German, Heisenberg used the word "Ungenauigkeit" ("indeterminacy"), to describe the basic theoretical principle. Only in the endnote did he switch to the word "Unsicherheit" ("uncertainty"). When the English-language version of Heisenberg's textbook, The Physical Principles of the Quantum Theory, was published in 1930, however, the translation "uncertainty" was used, and it became the more commonly used term in the English language thereafter.

### Heisenberg's microscope

Heisenberg's gamma-ray microscope for locating an electron (shown in blue). The incoming gamma ray (shown in green) is scattered by the electron up into the microscope's aperture angle θ. The scattered gamma-ray is shown in red. Classical optics shows that the electron position can be resolved only up to an uncertainty Δx that depends on θ and the wavelength λ of the incoming light.

The principle is quite counter-intuitive, so the early students of quantum theory had to be reassured that naive measurements to violate it were bound always to be unworkable. One way in which Heisenberg originally illustrated the intrinsic impossibility of violating the uncertainty principle is by utilizing the observer effect of an imaginary microscope as a measuring device.

He imagines an experimenter trying to measure the position and momentum of an electron by shooting a photon at it.: 49–50

• Problem 1 – If the photon has a short wavelength, and therefore, a large momentum, the position can be measured accurately. But the photon scatters in a random direction, transferring a large and uncertain amount of momentum to the electron. If the photon has a long wavelength and low momentum, the collision does not disturb the electron's momentum very much, but the scattering will reveal its position only vaguely.
• Problem 2 – If a large aperture is used for the microscope, the electron's location can be well resolved (see Rayleigh criterion); but by the principle of conservation of momentum, the transverse momentum of the incoming photon affects the electron's beamline momentum and hence, the new momentum of the electron resolves poorly. If a small aperture is used, the accuracy of both resolutions is the other way around.

The combination of these trade-offs implies that no matter what photon wavelength and aperture size are used, the product of the uncertainty in measured position and measured momentum is greater than or equal to a lower limit, which is (up to a small numerical factor) equal to Planck's constant. Heisenberg did not care to formulate the uncertainty principle as an exact limit, and preferred to use it instead, as a heuristic quantitative statement, correct up to small numerical factors, which makes the radically new noncommutativity of quantum mechanics inevitable.

## Critical reactions

The Copenhagen interpretation of quantum mechanics and Heisenberg's Uncertainty Principle were, in fact, seen as twin targets by detractors who believed in an underlying determinism and realism. According to the Copenhagen interpretation of quantum mechanics, there is no fundamental reality that the quantum state describes, just a prescription for calculating experimental results. There is no way to say what the state of a system fundamentally is, only what the result of observations might be.

Albert Einstein believed that randomness is a reflection of our ignorance of some fundamental property of reality, while Niels Bohr believed that the probability distributions are fundamental and irreducible, and depend on which measurements we choose to perform. Einstein and Bohr debated the uncertainty principle for many years.

### The ideal of the detached observer

Wolfgang Pauli called Einstein's fundamental objection to the uncertainty principle "the ideal of the detached observer" (phrase translated from the German):

"Like the moon has a definite position" Einstein said to me last winter, "whether or not we look at the moon, the same must also hold for the atomic objects, as there is no sharp distinction possible between these and macroscopic objects. Observation cannot create an element of reality like a position, there must be something contained in the complete description of physical reality which corresponds to the possibility of observing a position, already before the observation has been actually made." I hope, that I quoted Einstein correctly; it is always difficult to quote somebody out of memory with whom one does not agree. It is precisely this kind of postulate which I call the ideal of the detached observer.

• Letter from Pauli to Niels Bohr, February 15, 1955

### Einstein's slit

The first of Einstein's thought experiments challenging the uncertainty principle went as follows:

Consider a particle passing through a slit of width d. The slit introduces an uncertainty in momentum of approximately h/d because the particle passes through the wall. But let us determine the momentum of the particle by measuring the recoil of the wall. In doing so, we find the momentum of the particle to arbitrary accuracy by conservation of momentum.

Bohr's response was that the wall is quantum mechanical as well, and that to measure the recoil to accuracy Δp, the momentum of the wall must be known to this accuracy before the particle passes through. This introduces an uncertainty in the position of the wall and therefore the position of the slit equal to h/Δp, and if the wall's momentum is known precisely enough to measure the recoil, the slit's position is uncertain enough to disallow a position measurement.

A similar analysis with particles diffracting through multiple slits is given by Richard Feynman.

### Einstein's box

Bohr was present when Einstein proposed the thought experiment which has become known as Einstein's box. Einstein argued that "Heisenberg's uncertainty equation implied that the uncertainty in time was related to the uncertainty in energy, the product of the two being related to Planck's constant." Consider, he said, an ideal box, lined with mirrors so that it can contain light indefinitely. The box could be weighed before a clockwork mechanism opened an ideal shutter at a chosen instant to allow one single photon to escape. "We now know, explained Einstein, precisely the time at which the photon left the box." "Now, weigh the box again. The change of mass tells the energy of the emitted light. In this manner, said Einstein, one could measure the energy emitted and the time it was released with any desired precision, in contradiction to the uncertainty principle."

Bohr spent a sleepless night considering this argument, and eventually realized that it was flawed. He pointed out that if the box were to be weighed, say by a spring and a pointer on a scale, "since the box must move vertically with a change in its weight, there will be uncertainty in its vertical velocity and therefore an uncertainty in its height above the table. ... Furthermore, the uncertainty about the elevation above the Earth's surface will result in an uncertainty in the rate of the clock," because of Einstein's own theory of gravity's effect on time. "Through this chain of uncertainties, Bohr showed that Einstein's light box experiment could not simultaneously measure exactly both the energy of the photon and the time of its escape."