Quantum Mechanics - Foundations



Back to Index

THIS PAGE IS WORK IN PROGRESS - It is becoming an elemntary text on quantum mechanics! WHICH IS DEFINITELY NOT WHAT WAS INTENDED.

1.0      Mathematical Prelimenaries

This section gives a very informal overview of the some of the mathematics required to understand Quantum Mechanics. If you do not feel comfortable with any of the topics on this page, it probably makes sense to go swat up on those topics elsewhere and come back at a later date.

1.1 Vector Space

Intuitively a vector space is a space of "arrows". Vectors can be added and scaled.

More formally... for all vectors x, y, z and complex numbers a, b:

x + y = y + x
(x + y) + z = (x + y) + z
x + 0 = x = 0 + x
x + (-x) = 0
a(bx) = (ab)x
(a + b)x = ax + bx
a(x + y) = ax + ay
1.x = x

Definition: A set of vectors {e1, ... ,en} forms a basis for the vector space if:

  • For every vector x, there are numbers x1, ... ,xn such that x = x1e1 + ... + xnen
  • There is no subset of {e1, ... ,en} which satisfies the above condition.

The number of vectors required to form a basis for a vector space is the called the dimension of the vector space.

1.2 Linear Operators

A linear operator is a mapping A: V ® V such that A(ax + by) = a.Ax + b.Ay

1.3 Inner Products

The formal definition is significantly different from intuitive concept. Inner products allows us to think about "the angle between vectors" in the space. In particular

<x|y> = ||x|| ||y|| cos q
Þ cos q = <x|y>/||x|| ||y||

where ||.|| is the norm (length) of a vector. If the "angle" between vectors is 90º, they are said to be orthogonal. I.e. vectors are orthogonal if and only if <x|y> = 0 (cos q = 0, intuitively q = 90 degrees)

1.4 Norm

The norm of a vector corresponds to it's length. A inner product defines a norm by ||x|| = sqrt( <x|x> ). Formally, a norm has the following properties

||x|| >= 0
||x|| = 0 iff x = 0
||a.x|| = |a|.||x||
||x + y|| < = ||x|| + ||y||

1.5 Completeness

This is a technical requirement - which allows us to use calculus within the space. The space should not have "holes" in it, created by "missing" points. Formally, every cauchy sequence must converge. The requirement can be understood by looking at number sequences. Irrational numbers "plug" the holes in the real number line "between" fractions. If the distance between the numbers in a sequence becomes less and less, then the sequence must converges to a number in the space. E.g.

a0 = 1
a1 = 1 + 1/2!
a2 = 1 + 1/2! + 1/3!
a3 = 1 + 1/2! + 1/3! + 1/4!

The sequence an consists of rational numbers and converges to e = 2.71828... but e is NOT a rational number. The space of rational numbers is therefore not complete. e is however a real number - the space of real numbers is complete.

1.6 Metric spaces

A metric space is equipped with a distance function, or metric, that gives the distance between any 2 points in the space. Formally, there exists a function, dist(.,.), such that

dist(x,y) = dist(y,x)
dist(x,y) >=0
dist(x,y) = 0 iff x=y
dist(x,y) + dist(y,z) >= dist(x,z)

1.7      Hilbert Spaces

A Hilbert Space H is a vector space equipped with an inner product (denoted <.|.>), resulting norm (denoted ||.||) such that it is a complete metric space.

1.8      Dual Spaces

The dual space V* of a vector space V is the vector space created by the linear functionals defined on V.
A linear functional is a mapping f: V ® F where F is the set of scalars (real or complex numbers) and f(ax + by) = a.f(x) + b.f(y).

V* is isomorphic (looks like) V. If {e1, ... ,en} forms a basis for V, then {e1, ... ,en} forms a basis for V* where ei(ek) = dik

  • V* has the same dimension as V.
  • Each vector x = S xiei is associated with a (co)vector x* = S xiei in V*.
  • If {x1, ... ,xN} is a basis for V, then {x*1, ... ,x*N} is a basis for V*
  • Vectors in V are called contravariant vectors, and vectors in V* are called covariant vectors.

1.9      Tensor Spaces

Quantum Mechanics doesn't require a full formal treatment of Tensor Spaces; the definition given here is somewhat informal. The tensor product of 2 vector spaces V and W is a second vector space, denoted by V Ä W, constructed from V x W with addition and multiplication defined so

x Ä (y + z) = x Ä y + x Ä z
(x + y) Ä z = x Ä x + y Ä z
a.(x Ä y) = (ax) Ä y = x Ä (ay)

The tensor product is associative (the separate spaces are actually isomorphic).

(x Ä y) Ä z = x Ä ( y Ä z) = x Ä y Ä z
  • If V has dimension N and W has dimension M, then V Ä W has dimension N*M.
  • If {x1, ... ,xN} is a basis for V, and {y1, ... ,yM} is a basis for W, then {xi Ä yk | i=1..N, k=1..M} forms a basis for V Ä W.

If x Ä y is an element of V Ä V*, and the contraction of x Ä y is y(x).

Note for physicists

Traditionally physicists have defined tensors in terms of how the components of a tensor transform with a change of basis; the indices of components of (contravariant) vectors in V are denoted as superscripts, while indices of components of (covariant) vectors in V* are denoted as subscripts. For example, if x = S xiei Î V, and y = S ykek Î V*, the tensor product x Ä y = S xiei Ä S ykek = S S xi yk eiÄ ek would be denoted xi yk.

If x = S xiei Î V, and y = S ykek Î V*, then the contraction of x Ä y = S S xi yk ek(ei) = S xi yi would be denoted xi yi.

The mathematical notation is much more powerful, and we will use the coordinate approach sparingly

1.10    Further theorems

Definition: A hermitian operator A is a linear operator such that A = A* where A* is the transpose of the complex conjugate.

  • If A is a hermitian operator then <Y|Aj>. = <j|AY>*
  • The eigenvalues of a hermitian operator are real (not complex).
  • The eigenvectors of a hermitian operator are orthogonal.

Definition: A unitary operator U is a linear operator with the property UU* = 1 = U*U

  • If U is a unitary operator then <UY|UY> = <Y|Y>.

if A&B are linear operators, then [A, B] = 0 if and only if A&B have the same eignevectors.

 

 

 

2.0     Standard Postulates of Quantum Mechanics

2.1   Postulates

The standard postulates of QM originate primarily from John Von Neumann's book Mathematical Foundations of Quantum Mechanics. It's still in print, and well worth the read.

Interpretations of QM have to be able to "explain" these postulates, or derive them if an alternative set of postulates are used.

(1) The possible states of a system are described by a Hilbert space; at any instant the state of the system will be described by a point in that Hilbert Space.

Why a Hilbert Space? A Hilbert Space H is a vector space equipped with an inner product (denoted <.|.>), resulting norm (denoted ||.||) such that it is a complete metric space. That raises the questions:

  1. Why a vector space? A very, very good question.
  2. Why require an inner product?
  3. Why require a norm?
  4. Why require completeness? - Allows us to use calculus within the space.
  5. Why a metric space? - Allows us to use calculus within the space.

(2) An observable (physically measurable quantity A) corresponds to a linear hermitian operator (typically denoted A)on H.

  1. Why should operators be linear? Another good question
  2. Why hermitian? The requirement that the operator is hermitian ensures that measurements always return real numbers (See below).

(3) If a system is in a state Y, then the expected value of an observable A is given by

E(A|Y) = <Y|AY> / <Y|Y>

Y is an eignevector of A if

AY = a.Y

Why are measured states eignevectors and measured values eigenvalues? There is no mystery here: In general, a measurement of an observable A on a system in a state Y results in 2 things: (1) the system is described by a new state F, where (2) F is associated with the measured value A=a. More formally,

MeasurementOf(A, Y) ® F where AssociatedWith(F, A=a)

Furthermore, if the initial state Y is associated with a measured value A=a, then MeasurementOf(A, Y) ® Y The use of eignvalues and eigenvectors is simply a elegant way of associated a value with a vector (provided we assume the previous algebraic description).

2.2    Consequences

  1. AY = aY Þ a measurement of quantity A will yield the result a with probabaility = 1 (not quite the same as certainty).
  2. .. all of quantum mechanics...

2.3    Notation

Bra-ket notation

Dirac Bra-ket notation. Vectors in H looks lilke |x>. Vectors in H * look like by <y| .

The tensor product of a H x H * looks like |x><y|. The tensor product of a H * x H looks like <y|x>, which of course becomes notationally indistinguishable from the inner product <y|x> created by the natural contraction.

Bracket notation is particularly useful when describing states which are not particularly mathematical in nature. E.g |cat is dead >.

 

3.0      Heisenberg Uncertainty Principle

In 1929, Howard Robertson showed that if A, B and C are operators, [A,B] = iC, sA is the standard deviation of expected distribution of measurements of A, then

sA.s B ³ 1/2 < C >

In the case where A = x (a position operator) and B = p (a momentum operator), [A,B] = [x,p] = ih/2 p. Robertson's theorem implies

sx.sp ³ h/4 p

which is Heisenbegr's Uncertainty Principle. The proof is sufficently illuminating that it is reproduced.

Cauchy-Schwartz Inequality: For any vector Y, <AY|BY>2 <= ||AY||2 ||BY||2

|<AY|BY>|  ³  | im(<AY|BY>) |2 = | 1/2(<AY|BY> - <AY|BY>*) =  1/2 |<AY|BY> - <BY|AY>| = 1/2 |<ABY|Y> - <BAY|Y>|  = 1/2 |<(AB -  BA)Y|Y>| = 1/2 EY([A,B])

Þ ||AY||2 ||BY||2 ³ <AY|BY> ³ 1/4 EY([A,B])2

Make the substitution A -> A - <A> and B -> B - <B>, then ||AY||2 =sA,Y

ÞsA,YsB,Y ³ 1/2 | EY([A,B]) |

 

4.0      Pure and Mixed States

A point Y in this space represents the best description possible of the system, and is sometimes called a "pure" state.

If the system could be in a number of possible (pure) states Y1, Y2, ... then there is added uncertainty and the system is described as being in a "mixed" state.

TODO: Origin of mixed states - e.g. particle sources, interaction of systems.

4.1      Mixed States

A Mixed state generally cannot be represented by a point in the state space, i.e. as a pure state.

Proof: Demonstrate by example. A electron source produces electrons with completely random spins. If a measurement is made, it is equally likely that the electron will be found in the spin up or down states, or spin right or spin left states. I.e. P(Yu) = P(Yd) = P(Yr) = P(Yl) = 0.5

Yr = (1/Ö2) Yu + (1/Ö2) Yd
Yr = (1/Ö2) Yu - (1/Ö2) Yd

Is the system in a well defined state? Suppose the system is in a pure state Y = a Yu + b Yd , then

P(Yu|Y)
= | <Yu|Y> |²
= | <Yu|aYu - bYd> |²
= | a<Yu|Yu> + b<Yu|Yu> |²
= a² | <Yu|Yu> |²
= a² = 0.5
Þ a = ±(1/Ö2)

Similarly P(Yd) = 0.5 so b = ±(1/Ö2) and therefore Y = (1/Ö2) Yu ± (1/Ö2) Yd. I.e. Y = ±Yr or Y = ±Yl.
But P(Yr|Y) = 0.5 so either P(Yr|Y) = 0.5 = P(Yr|±Yr) = 1.0 which is nonsense, or P(Yr|Y) = 0.5 = P(Yr|±Yl) = 0 which is also nonsense. The electron therefore cannot be described by a state vector Y.

Furthermore, once probability has been assigned to a basis, it has been assigned for each point in the space. I.e. It is no longer possible to independently assign probability to other points in the state space. In our example, once probability is assign to Yu and Yd, then it is also assigned to Yr and Yl, and any other point in the space.

P(Yr|mixed state)
= P(Yr|Yu)P(Yu|mixed state) + P(Yr|Yd)P(Yd|mixed state)
= P(Yr|Yu)0.5 + P(Yr|Yd)0.5
= (0.5 * 0.5) + (0.5 * 0.5)
= 0.5

4.2      The Density Operator

The density operator is the quantum version of a probability density function defined on the state space. Define the density operator r to be a member of H Ä H * such that

  • For a pure state Y,   r = Y Ä Y*. (It is often convenient to write r(x) = Y <Y|x>)
  • For a mixed state, where the system has probability pi of being in state |Yi>,

    r = å piYi Ä Yi*

Theorem: The probability that a system in a mixed state with be found to be in state Y is then given by

P(Y|mixed state) = <Y|rY>

Theorem:

E(A|mixed state) = tr(A.r)

Proof: TODO: ?

How do we tell if a density operator represents a pure state or not?
Theorem: r.r = r if and only if r = Y Ä Y* for some Y.

Proof: Let r = Y Ä Y*, then r.r = Y Ä Y*( Y Ä Y*) = Y Ä Y* = r
Let r = å piYi Ä Yi*, then r.r = å piYi Ä Yi* ( å pkYk Ä Yk* ) = å å pipk Yi Ä Yi* ( Yk Ä Yk*) = å å pipi Yi Ä Yi*
r.r = r Þ pipi = pi. Hence pi = 0 or pi = 1. Since å pi = 1, there can be at most one pi = 1. Therefore r = Y Äi Yi* for at most one i. QED.

4.3      Entanglement

Example 1: Independent particles

Y = Y1 Ä Y2 = 1/Ö2 (|up > + |down >) Ä 1/Ö2 (|up > + |down >)

r = Y Ä Y*

Þ r = (1/Ö2 (|up > + |down >) Ä 1/Ö2 (|up > + |down >) ) Ä (1/Ö2 (<up | + <down |) Ä 1/Ö2 (<up | + <down |) )

Example 2: Entangled particles (EPR)

r = 0.5 Yud Ä Y*ud + 0.5 Ydu Ä Y*du

r = 0.5 (|up > Ä |down >) Ä (<down | Ä <up |) + 0.5 (|down > Ä |up>) Ä (<down | Ä <up |)

The density operator r is invariant under the transformation d®u, u®d.

 

5.0      Information

Definition (Classical): The amount of information conveyed by a symbol is log2(p) bits where p is the probability that he symbol will occur. The entropy of random variable is -åpilog2(pi)

TODO: This page needs to be completely restructured.

Quantum Interpretations

 

6.0      Spin

S-G apparatus (follow Feynman).

Physics Background: Special Relativity, Maxwells Equations - links only.

 

7.0      Quantum Field Theory

TODO: Lagrangian, U(1), SO(3), Poncaire, SU(2).

Copyright (c) Shaun O'Kane, 2003, 2004. You are free to redistribute this work provided you give due credit to it's author.