![]() |
Qunatum theory has its foundation in the uncertainty principle, which states in mathematical form : x px > , where denotes the uncertainty, x is the position of the point mass m along the x-axis, px = m vx is the momentum along the x-axis, vx is the velocity along the x-axis, and = h/2 = 1.054x10-27 erg-sec (Figure 01). A similar relation exists for the y and z components, and for the time t and energy E (including the rest mass energy mc2), e.g., t E > . The uncertainty
|
Figure 01 Uncertainty Principle |
principle seems to be applicable up to a size of about 10-7 cm - the size of the C60 buckyball. In case of macroscopic objects with larger size and heavier mass, the uncertainties in position and time becomes very small, classical physics is applicable once more. |
---------- (1),
---------- (2),
---------- (3a),
---------- (3b).
(x,t), e.g.,
[- x
+
(x)]
= i
---------- (4)
is called wave function. It is interpreted as the probability of finding the particle at (x,t) and thus a wave property to a particle. Dynamic of the wave function is to be determined from the energy equation for a free particle :
2/2m)
= i

---------- (6)
(x,t)=A
(x)e-iEt/
, where the solution for t has been evaluated as an oscillating exponential function by virtue of the imaginary index, otherwise it will be an exponential function not in agreement with observation. Eq.(6) can then be reduced to an ordinary differential equation of the form :
= -(2mE/
2)
---------- (7)
taking the role of X, x replacing t, and the force constant K = (2mE/
2). Thus the solution of Eq.(7) is another oscillating function :
= eikx ---------- (9)
. The final solution can be written as :
(x,t) = A e-i(
t-kx) ---------- (10)
=E/
. The integration constant A is determined by demanding that the probability of finding the particle within the extent of length L is 1, i.e., 
*
dx = A2L = 1 ---------- (11)
*
= 1/L, which indicates that the chance of finding the particle at any point within becomes smaller as the extension getting larger.
1,
2, ... are the solutions with k equals to k1, k2, ... respectively, the combined solution
= A1
1 + A2
2 + ... is again a solution of the same differential equation, where the Ai's are interpreted in quantum theory as the probability amplitude of finding the particle in state ki's and (A1)*A1+(A2)*A2+...=1. This is known as superposition, which forms the base in explaining interference and the production of wave packets, .... in wave theory. For example, superposition in the following form will generate a wave packet:![]() | ---------- (12) |
to +
, the sum becomes integration over the k-space. In particular, when the weighting function is in the Gaussian form:![]() | ---------- (13) |
![]() |
where k is the spread of bell curve in k-space (it is related to the spread in x-space by the uncertainty relation k x 1), the wave packet can be evaluated explicitly :
| ||
Figure 02a Wave Packet |
equation (e.g., by Eqs.(6) and (7)) anymore, and thus the break down (or collapse) of the superposition. |
![]() |
![]() |
the nature of wave appears as soon as the electrons are allowed to pass through both slits (even down to only one electron at a time). The experiment also shows that the properties of particle and wave cannot be displayed simultaneously. Mathematically, there are four terms in the double slits experiment - two "separate-slit" terms from slit 1 and 2, and two "interference" term between the waves from slit 1 and 2, i.e., P12 = 1* 1 + 2* 2 + 1* 2 + 2* 1.
|
Figure 02b Two Slits Exp. |
Figure 02c Three Slits Exp. [view large image] |
It is the latter two terms that produce the interference pattern, This is the Born's rule - a key law in quantum mechanics proposed in 1926. In plain language, it states that interference occurs in pairs of possibilities, higher order terms are ruled out. (see more in footnote) |
1*
2
3*
1 should be included in the three slits experiment (Figure 02c). Accordingly, an experiment was run in the late 2000's to check out the claim by subtracting the three slits patterns with the prediction from the Born's rule. The pattern is nullified within the 1% experimental error. Thus, the Born's rule has been vindicated for now. But the verdict is not final, there is room to improve the experimental uncertainty and the theoretical calculation should be reviewed for magnitude of the correction.![]() | ---------- (16) |
![]() | ---------- (17) |
2[c2
-
]
= m2c4
---------- (18).
= 1 by changing the unit of length to [1/(3x1010)]sec, and the unit of time to 1/(1.054x10-27erg), then the unit of the 4-momentum, energy and mass becomes the inverse of length, while the charge, velocity, and angular momentum are dimensionless. The original value of c and
can always be re-introduced by dimensional consideration (see more in Natural Units). Thus, Eq.(18) can be re-written as :
-
]
= m2
, ---------- (19)
= A e-i(
t-kx) ---------- (20)
=E, and k2=E2-m2. Thus the 4-momentum can be written in the form : (kx, ky, kz, iE). By re-arranging the energy equation into the form : E2=m2+k2 and taking the square root :
(m2+k2)1/2 ---------- (21).
and
* fields respectively.
i) :![]() | ---------- (25) |
+ ax
)][i(
- ax
)] v = m2 v ---------- (26)| v = | ![]() | ---------- (27) |
![]() |
Eq.(26) can be split into two : [i( - ax )] v = m u ---------- (28a)[i( + ax )] u = m v ---------- (28b)where u is another two components column matrix similar to v. It is a mathematical property of the differential equations that more components will be created when transforms a second-order equation to two first-order equations, the components would be mixed between the first-order equations. Eqs.(28a) and (28b) are equivalent to the original equation written down by Dirac. The |
Figure 02d Dirac Equation [view large image] |
original form of the Dirac equation is obtained by adding and subtracting Eqs.(28a) and (28b) and then combining them into one first order differential equation in terms of the 4x4 gamma matrices operating on a 4-components single column matrix : |
![]() | ---------- (29) |
i and the 4-components field (usually known as Dirac spinor)
can be expressed in the forms :![]() | ---------- (30) |
![]() |
To get around the problem of negative energy, Dirac proposed an energy spectrum containing all electrons in the universe (see Figure 03). In addition to the normal positive-energy spectrum, it also contains the negative-energy variety, which spans the spectrum from -m0c2 down to negative infinity. All the negative-energy levels are filled, thus the positive-energy particle is inhibited from transition into these lower energy states bypassing the unobserved phenomena of falling into the negative energy levels. Thus, the existence of negative energy has no observable effect in the real world. Only when there is enough energy available, e.g., when E 2moc2, a real particle and anti-particle pair with positive-energy can be created from this unseen sea of negative-energy particles. The particle is the electron originally resided in the negative energy region, while the anti-particle (positron) can be interpreted as the hole in the vacated energy level acquiring a mass mo. The law of charge conservation demands that this anti-particle carries a positive charge.
|
Figure 03 Negative Energy [view large image] |
Many interesting things occur when the electron meets the hole (see Figure 03). Over the past three decades, the use of e+e- collisions to probe the vacuum have yielded a great deal of information about the nature of the strong, weak, and electromagnetic interactions, and have played a major role in establishing the Standard Model of elementary particles (see more in "Quantum Vacuum"). Positron was first detected in 1932 by C. Anderson. |
![]() | ---------- (31) |
![]() | ---------- (32) |
C = ei
C
* ---------- (33)
(1) to
(4), and
(2) to
(3) (in Eq.(31)) as expected for the
C's to be the negative energy solutions. The spinors here exclude the exponential ei(px-Et), and
is a phase factor (0 and
respectively for the examples above).![]() |
orientations in the magnetic field. The fine structure of the hydrogen atom (1 energy level splits into 2) further determined that the spin of the electron is 1/2. In the presence of a magnetic field it is the z component which aligns with the field direction having a quantized value of ![]() /2 while the entire magnetic moment precesses around the field with a magnitude S = [(s+1)s]1/2 (Figure 04). The expression for the magnetic moment derived in quantum theory has the form: M = (e/m)s ---------- (32) |
Figure 04 Electron Spin [view large image] |
where s = ( /2)(ax + ay + az), indicating that the ai's are related to the spin of the electron. |
![]() |
Eq.(35a) describes a massless particle with direction of spin opposite to the direction of motion (left-handed helicity), while Eq.(35b) describes a massless particle with spin pointing to the same direction of motion (right-handed helicity). This kind of 2-components spinor is often referred to as Weyl spinor. They used to be associated with neutrino (only left-handed neutrino and right-handed anti-neutrino exist in nature) when it was considered as massless. The Weyl spinor has also been applied to the formulation of the Standard Model with uL participated in both the electromagnetic and weak interactions, while uR involves with only the electromagnetic interaction. Both the electron and neutrino are considered to be massless originally, only the electron acquires mass via interaction with the Higgs field in that model. The Weyl spinor also found its way into the theory of supersymmetry, which insists that every boson have a fermion partner (in the form of Weyl spinor). |
Figure 05 Neutrino Mass |
When neutrino has mass, its speed would always be lower than the speed of light, theoretically an observer can move in a speed faster than the left-handed neutrino, overtakes this neutrino and sees a right-handed anti-neutrino (Figure 05). That's why the two Weyl spinors mix in case of particle with mass as shown in Eqs.(28a) and (28b). |
[(r1 - ir2)/2]1/2 ---------- (38a)
[(-r1 - ir2)/2]1/2 ---------- (38b)![]() |
For example, let's take R=(1, i, 0), then S=(1, 0) or S=(-1, 0). If R is rotated by an angle such that R => e-i R around r3, then S => e-i /2S around s1. That is, a rotation of R by an angle translates to a rotation of S by an angle /2. It is this peculiar prpperty of spinor that makes it so different from the other kinds of field. Particles associated with spinor are called fermion to distinguish themselves from the other types called boson (Figure 06).
|
Figure 06 Different Kinds of Spin [view large image] |
This special property of rotation for spinor is also applicable to the internal (invisible) rotation in gauge transformation. |
E term in Eq.(40), and thus put the equations into a consistent set which implied new physical phenomena, at that time unknown but subsequently verified in all details by experiments. In terms of the electric field E and magnetic field B the Maxwell's equations are four first order differential equations :
, the i, j, k, are unit vectors along the x, y, z axis respectively, J is the current density,
is the charge density, and c is the velocity of light. For regions of space where charge and current are absent, the right-hand sides vanish in all the equations. These first order differential equations can be translated into second order equations more suitable for describing the electromagnetic wave. By virtue of the mathematical identity 
(
A)=0, Eq.(41) can be defined by the vector potential A:
A ---------- (43)
(
)=0, we can define E in terms of A and the scalar potential
as :
- (1/c)
A ---------- (44)
2
+ (1/c)
(
A) = 0 ---------- (45)
(
) A = -
2A +
(
A), we get another second order differential equation :
2A - (1/c2)
A -
(
A + (1/c)
) = 0 ---------- (46)
can be added without changing anything by virtue of the identity
(
) = 0. It means that the Maxwell's equations are unchanged by the transformation :
A' = A + 
---------- (47)
' =
- (1/c)
---------- (48)
such that
2
= -
A leaving 
A' = 0. This condition is called "transverse gauge", which helps to formulate the Maxwell's equations for electromagnetic wave in free space. It follows that :

A' = 0.
' = 0 in free space without the presence of any source, the Maxwell's equations for electromagnetic wave become :
2A' - (1/c2)
A' = 0 ---------- (49)
A' ---------- (50)
A' ---------- (51)
2E - (1/c2)
E = 0 ---------- (50a)
2B - (1/c2)
B = 0 ---------- (51a)
/L (n =
1,
2, ...) with k
x = k
x -
t = k
x - |k|ct. The ck coefficients is linked to the spectrum associated with the electromagnetic radiation. It will assume the role of q-number in second quantization, which endows particle attributes to the wave. While the vector potential A has dimension (erg/cm)1/2, the dimension for the ck is erg1/2-cm.
![]() |
![]() |
For coherent EM wave propagating along the +z axis (Figure 07a), the two transversal electric fields can be expressed as : Ex = Ex0 cos( t - kz + x) ---------- (53a)Ey = Ey0 cos( t - kz + y) ---------- (53b)where Ex0, Ey0 are amplitudes, and x, y are the phase angles of the x, y components respectively (Figure 07b). These equations can be simplified somewhat if the view is facing the x-y plane fixed at z=0 (such as the view in Figure 07c), and by using the relative phase angle = x - y : |
Figure 07a Polari-zation |
Figure 07b Phase Angles |
Ex = Ex0 cos( t) ---------- (53c)Ey = Ey0 cos( t - ) ---------- (53d) |
= tan-1(Ey/Ex) = tan-1{(Ey0/Ex0)[cos(
) + tan(
t)sin(
)]} ---------- (53e)
as shown in Table 01 below.Phase Angle ![]() |
tan( ) |
![]() |
Polarization |
|---|---|---|---|
| 0o | 0 | 0o | Linear along x-axis |
| 0o | ![]() |
90o | Linear along y-axis |
| 0o | 1 | 45o | Linear at 45o |
| 45o | (1/2)1/2(Ey0/Ex0)[1 + tan( t)] |
Right-handed rotation | Elliptical or circular |
| 90o | (Ey0/Ex0)[tan( t)] |
Right-handed rotation | Elliptical |
| 90o | tan( t) |
t |
Right-handed Circular for Ey0=Ex0 |
| 180o | 0 | 0o | Linear along x-axis |
| 180o | -![]() |
-90o | Linear along y-axis |
| 180o | -1 | -45o | Linear at -45o |
| -90o | -(Ey0/Ex0)[tan( t)] |
Left-handed rotation | Elliptical |
| -90o | -tan( t) |
- t |
Left-handed Circular for Ey0=Ex0 |
![]() |
Table 01 shows that for linear polarization the angle is constant with Ex and Ey varying in such a way that its ratio is still a constant. For circular polarization the length of the field |E| is constant, but changes with a frequency determined by . For elliptical polarization even |E| is not a constant. Another way to check out the polarization is to view the animation provided by Amanogawa in Figure 07c. Table 02 shows the types of polarization associated with different sets of parameters in the application. |
Figure 07c Polarization of EM Wave [see animation] |
| Amplitude x | Phase x | Amplitude y | Phase y | Polarization |
|---|---|---|---|---|
| 1.0 | 0o | 1.0 | 0o | Linear at 45o |
| 1.0 | 0o | 0.0 | 0o | Linear along x-axis |
| 0.0 | 0o | 1.0 | 0o | Linear along y-axis |
| 1.0 | 45o | 0.0 | 0o | Linear along x-axis |
| 1.0 | 45o | 1.0 | 0o | Right-handed elliptical |
| 0.5 | 0o | 1.0 | 0o | Linear at 63.4o |
| 0.5 | 30o | 1.0 | 30o | Linear at 63.4o |
| 1.0 | 0o | 1.0 | 90o | Left-handed circular |
| 0.2 | 0o | 1.0 | 90o | Left-handed elliptical |
= 0, the direction of oscillation
depends on the ratio Ey0/Ex0. Let's take Ey0=Ex0 to simplify the formulas, then the polarization vector can be expressed as :
|ex> + sin
|ey> ---------- (53f)
and |ey> =
are the unit vector in the x (upper component) and y (lower component) directions respectively. It is similar to the Jones vector in classical electrodynamics and the polarization vectors in Eq.(52). Since the unit vectors satisfies the orthogonality relation : < ei|ej> =
(i-j) for (i, j) = (x or y), < e|e> = 1, if it is interpreted as the total probability, then cos2
and sin2
can be interpreted as the probability in polarization state |ex> or |ey> respectively.
= 90o or -90o,
= +
t (right-handed circular polarization) or -
t (left-handed circular polarization). The polarization vector in Eq.(53f) can be expressed in the form :
t/
) |eL> + (ei
t/
) |eR> ---------- (53g)
=
/
---------- (53h)
=
/
---------- (53i)
---------- (53j)
---------- (53k)
(i-j) for (i, j) = (L or R), < e|e> = 1. If it is interpreted as the total probability, then
there are 50% probability for each of the circular polarization state in Eq.(53g). In order to obtain a definite circular polarization, we have to pick either +
t or -
t but not both. A linear polarization will be produced if we keep both +
t and -
t in Eq.(53g). For example, the purely right-handed rotating polarization vector is described by :
t |eR> ---------- (53l)
c) ---------- (53m)
U/|
| ---------- (53n)
) is energy of the wave per unit volume, the + sign is for right-handed circular polarization, the - sign for the left-handed one. An experiment back in 1936 had demonstrated conclusively that circular electromagnetic wave does carry angular momentum.
/V, where N is the number of photons in volume V, and the energy has been quantized to 
. Thus, the angular momentum of a photon can be parallel or anti-parallel to the z-axis :
---------- (53o)![]() |
This formula shows that angular momentum of electromagnetic wave is related to the right- or left-handed polarization, which becomes photon spin of + or - respectively when goes over to quantum theory, i.e., the photon is spin 1 particle. It is said to have right-handed helicity if the spin aligns with the direction of motion (+ ), or left-handed helicity in opposite direction (- ) as shown in Figure 07d. As usual in quantum theory, the photon would be in a superposition of both the right- and left-handed helicity states, it is only when a measurement is performed that it assumes a definite right- or left-handed helicity state (see Copenhagen Interpretation). |
Figure 07d Photon Helicity [view large image] |
![]() | ----------(54) |
, while the "canonical momentum" is just
= 
in this case. Then the rule for second quantization becomes :
- 
= i
---------- (55)![]() | ---------- (56) |
to generate:
= nk|nk
= (nk+1)1/2|nk+1
= nk1/2|nk-1
![]() | ---------- (57) |
![]() | ---------- (58) |
= |1(k),1(k')
= ck'*ck*|0
= |1(k'),1(k)
---------- (59)
= |2(k)
---------- (60)![]() | ---------- (61) |
![]() | ---------- (62) |
)ij = (A*)ji, the b's assume the same role as the c's except that the state vector can have only 0 or 1 particle for a given state (p, r). For example, exchange of states for two particles produces a minus sign in the state vector :
= |1(p),1(p')
= -bp'*bp*|0
= -|1(p'),1(p)
---------- (63)
= |2(p)
= -bp*bp*|0
= -|2(p)
---------- (64)
= 0. Therefore, two spin 1/2 particles cannot occupy the same state - a characteristics of Fermi-Dirac statistics.![]() | ---------- (65) |
, where g is the coupling constant.
![]() |
Unlike non-relativistic quantum theory in which the main concern is about energy levels, the quantum field theory works mostly with transition probability from an initial state i to the final state f, e.g., S = f
| HI | i (known as the S matrix). Since the equation cannot be solved analytically with the addition of interaction, perturbation theory is employed to do the job
|
Figure 08 Nucleon Scattering [view large image] |
approximately. Each term in the perturbation expansions can be represented by a Feynman diagram, which has a mathematical correspondence to compute the S matrix. Figure 08 shows a simple Feynman diagram for nucleon-nucleon scattering via the exchange of virtual pion. |
![]() |
![]() |
e(2 i r/ )/r, where r is the distance from the point in question to the element of surface on the wave front, and is the wavelength. This spread of wave is somewhat similar to the pattern (in the wall behind) generated by firing a machine gun at an iron plate with two slits on it. Most of the bullets through the slits would align with each, some may be scattered sideways by hitting the edges although the chance becomes rarer as deviation getting further. One big difference is that bullets don't interfere with each other (story ascribed to Feynman). Anyway, assuming that the slit is very narrow, the two waves from slit 1 and 2 can be represented as: 1 = Ae(2 i r1/ )/r1 ---------- (66a) 2 = Ae(2 i r2/ )/r2 ---------- (66b) |
Figure 09 Double-slit Exp. |
Figure 10 Probabilities [view large image] |
where A is the normalization constant, r1, and r2 are the distance from the slits to the screen as shown in Figure 09, which lays out the x-y plane only, while the z direction is perpendicular to the |
r2 and they are lumped together as r (see Figure 09). However the tiny difference d sin
(in the interference term) is crucial in producing the pattern (see Figure 09 again). Thus, the probabilities can be expressed as :
1*
1
2*
2
(A2/r2) ---------- (67a)
1*
2 +
2*
1
(2A2/r2) cos[2
(dsin
/
)] ---------- (67b)
) =
1*
1 +
2*
2 +
1*
2 +
2*
1 = (4A2/r2) cos2[
(dsin
/
)] ---------- (68)
from -
/2 to +
/2 :
cos2
cos2[
(dsin
/
)] d
---------- (68a)
quoted below, the integral is equal to about 0.390, which gives A = 0.566D. For rapid oscillation of the interference term, i.e., when d >>
, the integral approaches a value of
/8, giving A2 = D2/
or A = D/
1/2 = 0.564D. Then it shows clearly that P12(
) can be interpreted as the probability per unit radian.
= 0.001 cm, d = 0.005 cm, and D = 10 cm. The "separate terms" together is plotted in green, the "interference terms" in blue, and the total probability is in red. The only running variable in the numerical computation is sin
, which varies from -0.8 to +0.8 in steps of 0.001. Since cos2
= 1 - sin2
, tan
= sin
/cos
, from which we can compute r = D/cos
, x = D tan
. It is rather obvious from Eq.(68) that maxima occur at :
= n
, where n = 0, 1, 2, 3, ...
= (n/2)
, where n = 1, 2, 3, ...