Spin-orbit coupling

Spin-orbit coupling is the interaction between a particle’s spin angular momentum and orbital angular momentum. An electron orbiting around the nucleus ‘sees’ the nucleus circling it, just as a person on earth perceives the sun circling the earth while the latter orbits around the sun.

This apparent nuclear orbit creates a magnetic field B that exerts a torque on the electron’s spin magnetic dipole moment \mu_s, resulting in an additional term of \hat{H}_{so}=\sum_{i=1}^{n}A_i\hat{\boldsymbol{\mathit{S}}}_i\cdot\hat{\boldsymbol{\mathit{L}}}_i (where A_i=\frac{1}{2m_e^{\;2}c^{2}}\frac{1}{r_i}\frac{dV_i}{dr_i}) in the multi-electron Hamiltonian. To derive this term, we consider a 1-electron atom.

Let L=m_erv_{\perp} be the orbital angular momentum of the electron and I be the proton’s current loop, which generates a magnetic field of magnitude B=\frac{\mu_0I}{2r} given by the Biot-Savart law. Since \frac{1}{t}=\frac{v_{\perp}}{2\pi r}, we have

I=\frac{e}{t}=\frac{eL}{2\pi m_er^{2}}\; \; \; \; \; \; \; \; 259

Substitute eq259 in B, we have, B=\frac{\mu_0e}{4\pi m_er^{3}}L or \frac{\boldsymbol{\mathit{B}}}{\hat{\boldsymbol{\mathit{B}}}}=\frac{\mu_0e}{4\pi m_er^{3}}\frac{\boldsymbol{\mathit{L}}}{\hat{\boldsymbol{\mathit{L}}}}, where \hat{\boldsymbol{\mathit{B}}} and \hat{\boldsymbol{\mathit{L}}} are unit vectors. Since \boldsymbol{\mathit{B}} and \boldsymbol{\mathit{L}} point in the same direction, \hat{\boldsymbol{\mathit{B}}}=\hat{\boldsymbol{\mathit{L}}}=\hat{\boldsymbol{\mathit{M}}}. Multiplying both sides of \frac{\boldsymbol{\mathit{B}}}{\hat{\boldsymbol{\mathit{M}}}}=\frac{\mu_0e}{4\pi m_er^{3}}\frac{\boldsymbol{\mathit{L}}}{\hat{\boldsymbol{\mathit{M}}}} by \hat{\boldsymbol{\mathit{M}}}, we have \boldsymbol{\mathit{B}}=\frac{\mu_0e}{4\pi m_er^{3}}\boldsymbol{\mathit{L}}, which we substitute in eq65 (where \boldsymbol{\mathit{\mu}} is the spin analogue of eq61) to give U=-\frac{\gamma_e\mu_0e}{4\pi m_er^{3}}\boldsymbol{\mathit{S}}\cdot\boldsymbol{\mathit{L}}.

Substituting eq164 and \frac{1}{c^{2}}=\mu_0\varepsilon_0 in U yields

U=\frac{e^{2}}{4\pi\varepsilon_0 m_e^{\;2}c^{2}r^{3}}\boldsymbol{\mathit{S}}\cdot\boldsymbol{\mathit{L}}

For a 1-electron atom, V=-\frac{e^2}{4\pi\varepsilon_0r} and so

U=\frac{1}{ m_e^{\;2}c^{2}r}\frac{dV}{dr}\boldsymbol{\mathit{S}}\cdot\boldsymbol{\mathit{L}}\; \; \; \; \; \; \; \; 260

Eq260 can be written in terms of the Larmor frequency of the electron. From eq149, \omega_L=\frac{eB}{m_e}=\frac{e}{m_e}\frac{\mu_0e}{4\pi m_er^{3}}L=\frac{1}{m_e^{\;2}c^{2}r}\frac{dV}{dr}L. So, U=\frac{\omega_L}{L}\boldsymbol{\mathit{S}}\cdot\boldsymbol{\mathit{L}}. Swapping \omega_L with the Thomas precession rate \omega_T, we obtain the correction term of U_{tp}=-\frac{1}{2 m_e^{\;2}c^{2}r}\frac{dV}{dr}\boldsymbol{\mathit{S}}\cdot\boldsymbol{\mathit{L}}.

The total spin-orbit Hamiltonian is

The spin-orbit energy corresponds to the expectation value , where is the hydrogenic wavefunction. Since , we have . Substituting this and into gives:

For a given , we usually express the above equation as:

where is regarded as a constant.

For a multi-electron system,

The spin-orbit energy expression for a multi-electron system is very similar to eq261a if we assume Russell-Saunders coupling, where spin-orbit interactions are weak compared to the Coulomb interaction (). Here, the electrons’ orbital angular momenta couple to form the total orbital angular momentum separately from their spin angular momenta, which couple to form . Each electron in the system experiences a force due to its Coulomb interaction with the other electrons .

Consider electron at along the laboratory -axis and electron at on the -axis. The Coulomb force between electrons and acts in the direction , pointing downwards and to the right. The component of perpendicular to the -axis produces a torque on electron , where . Because the electrons are not symmetrically arranged at every instant, the resultant torque on electron due to all other electrons is generally non-zero. Since (see eq71), the individual orbital angular momenta are not conserved. By Newton’s third law, the torque exerted by electron on electron is equal and opposite to the torque exerted by on . When summed over all electrons, these internal torques cancel, giving

Thus, while the total orbital angular momentum is conserved, the individual are not. To satisfy , and at all times, the individual orbital angular momenta can change only in direction, leading to their precession about the fixed total orbital angular momentum . By the same reasoning, the individual spin angular momenta precess around . Since , these precessions occur on a much shorter timescale than the subsequent precessions of and around the total angular momentum .

To continue the derivation of the spin-orbit energy expression for a multi-electron system, we decompose the individual orbital and spin angular momenta into components parallel and perpendicular to and :

Under rapid precession, the time-averaged value of and vanish. So, the average value of is

Since and correspondingly , where and are unit vectors,

where .

Substituting eq261c into eq261b gives:

It follows that

If spin-orbit interactions become strong compared to the Coulomb interaction (), -coupling replaces Russell-Saunders coupling, and the spin-orbit Hamiltonian must be treated in the form of eq261b.

 

Next article: Perturbation theory
Previous article: Thomas precession
Content page of quantum mechanics
Content page of advanced chemistry
Main content page

Thomas precession

Thomas precession is a relativistic kinematic correction to the spin-orbit coupling Hamiltonian.

Consider an electron orbiting counter-clockwise around a nucleus at O along the path of a regular n-sided polygon in the xy-plane (see diagram above). At t=0, the nucleus sees the electron making a turn at (x_0,y_0) at an angle \theta=tan^{-1}\frac{y_1}{x_1}. At subsequent vertices, the nucleus sees the electron turning at the same angle before arriving back to (x_0,y_0).

In the rest frame of the electron (denoted by blue axes), (x_1,y_1) becomes \left ( \frac{x_1}{\gamma},y_1 \right ) due to length contraction along the direction of travel, and the electron sees itself turning at angle of \theta'=tan^{-1}\left ( \gamma\frac{y_1}{x_1} \right ) at (x_0,y_0), where \gamma=\frac{1}{\sqrt{1-\frac{v^{2}}{c^{2}}}}. At subsequent vertices, the electron sees the same turning angle before arriving back to (x_0,y_0). Substituting tan\theta=\frac{y_1}{x_1} in tan\theta'=\gamma\frac{y_1}{x_1}, we have tan\theta'=\gamma tan\theta. In the limit of large n, the polygon approaches a circle with each turning angle being very small. Using the small-angle approximation, \theta'=\gamma\theta. According to the nucleus and the electron, each turn corresponds to \theta=\frac{2\pi}{n} and \theta'=\frac{2\pi\gamma}{n} respectively. So, after completing a full orbit, the electron sees itself turning an angle of \theta'_T=2\pi\gamma.

In the rest frame of the nucleus (denoted by red axes), the orientation of the spin axis of the electron S remains unchanged (just like the spin axis of the earth being constant from the sun’s perspective). However, in the rest frame of the electron, the spin axis of the electron appears to rotate clockwise. After a completing a full orbit, the change in the orientation of S is \Delta\theta'=2\pi\gamma-2\pi. Since the rate of precession is defined as the rate of change in orientation of the rotational axis of a rotating body, the rate of \Delta\theta', called the Thomas precession, is

\omega_T=\omega(\gamma-1)\; \; \; \; \; \; \; \; 255

where \omega_T=\frac{\Delta\theta'}{t}, \omega=\frac{2\pi}{t} and t is the period of the orbit.

Substitute a=\frac{v_{\perp}^{\;2}}{r} and eq58, where \omega=\frac{v_{\perp}}{r}, in eq255 (note that the radius is perpendicular to the path of travel and is not contracted, and therefore common to both nucleus and electron), \omega_T=\frac{v_{\perp}}{r}(\gamma-1)=\frac{v_{\perp}a}{v_{\perp}^{\;2}}(\gamma-1). Since v\ll c, we have \left | -\frac{v_{\perp}^{\;2}}{c^{2}} \right |<1. Let x=-\frac{v_{\perp}^{\;2}}{c^{2}} in the binomial series (1+x)^{-\frac{1}{2}}=1-\frac{x}{2}+\frac{3x^{2}}{8}-\cdots. We have \omega_T\approx \frac{v_{\perp}a}{v_{\perp}^{\;2}}\left ( 1+\frac{v_{\perp}^{\;2}}{2c^{2}}-1 \right ) or

\omega_T= \frac{v_{\perp}a}{2c^{2}}\; \; \; \; \; \; \; \; 256

Substitute F=m_ea=-\frac{dV}{dr} and eq59, where L=rm_ev_{\perp}, in eq256

\omega_T= -\frac{1}{2m_e^{\;2}c^{2}r}\frac{dV}{dr}L\; \; \; \; \; \; \; \; 257

 

Next article: Spin-orbit coupling
Previous article: Why does a current flowing through a wire generate a magnetic field?
Content page of quantum mechanics
Content page of advanced chemistry
Main content page

Length contraction

Length contraction is the shortening of the length of an object moving relative to an observer, compared to its proper length, which is the length measured by an observer at rest relative to the object. Consider a moving train with a clock, which consists of a photon bouncing horizontally between two mirrors A and B (see diagram below).

For observer X, who is at rest on a platform and seeing the passing train, the total distance travelled by the photon from A to B in t_1 (indicated by the top purple arrow) is

ct_1=L+ut_1\; \; \; \; \; \; \; \; 250

where c is the speed of light, L is length of the clock according to X, and u is the speed of the train.

When the photon returns from B to A in t_2, A would have travelled a distance of ut_2. Thus, the distance travelled by the returning photon from B to A (indicated by the bottom purple arrow) is

ct_2=L-ut_2\; \; \; \; \; \; \; \; 251

The total time taken is t=t_1+t_2. Substituting eq250, where t_1=\frac{L}{c-u}, and eq251, where t_2=\frac{L}{c+u}, in t, yields

t=\frac{2L}{c}\frac{1}{1-\frac{u^{2}}{c^2}}\; \; \; \; \; \; \; \; 252

For observer Y, who is on the train, the total time t_0 (or proper time) measured by him is t_0=\frac{2L_0}{c}, where L_0 is called the proper length, which is the length measured by an observer who is at rest with respect to the object being measured. From eq249,

t=t_0\frac{1}{\sqrt{1-\frac{v^{2}}{c^{2}}}}=\frac{2L_0}{c}\frac{1}{\sqrt{1-\frac{v^{2}}{c^{2}}}}\; \; \; \; \; \; \; \; 253

Equating eq252 with eq253 gives

L=\frac{L_0}{\gamma}\; \; \; \; \; \; \; \; 254

where \gamma=\frac{1}{\sqrt{1-\frac{v^{2}}{c^{2}}}}.

To illustrate eq254, consider a person on Earth observing a spaceship moving from Earth to the Moon. The proper length between the Earth and the Moon is the length measured by an observer at rest with respect to the two (i.e., the Earth frame), while the proper length of the spaceship is the length measured by an astronaut on the spaceship. The astronaut measures the proper length of the spaceship, which remains the same regardless of the spaceship’s motion. However, due to length contraction, the observer on Earth sees the spaceship as contracted in the direction of its motion. Meanwhile, the astronaut, traveling on the spaceship, observes the distance between the Earth and the moon as contracted, since, from his perspective, the Earth and the moon are moving.

 

Next article: Why does a current flowing through a wire generate a magnetic field?
Previous article: Time dilation
Content page of quantum mechanics
Content page of advanced chemistry
Main content page

Time dilation

Time dilation is the difference in time measured by an observer who is moving relative to another observer.

Consider two frames of reference, each with an observer and a clock, which consists of a photon bouncing between two mirrors A and B. If the frame of observer X is at rest, while the frame of observer Y is moving away from it, observer Y sees the photon of his clock travelling vertically between the mirrors (diagram I) and measures t_0=\frac{2L}{c}, where t_0 is known as the proper time, L is the distance between the mirrors and c is the speed of light.

Observer X, however, sees the same photon moving in a triangular path (as observer Y’s clock moves away from him) and measures t=\frac{2D}{c} (diagram II), where t is the time recorded using his own clock.

Substituting D=\sqrt{\left ( \frac{vt}{2}\right )^{2}+L^{2}}, where v is the speed of the moving frame relative to the rest frame, and L=\frac{ct_0}{2} in t=\frac{2D}{c}, we have

t=\gamma t_0\; \; \; \; \; \; \; \; 249

where \gamma=\frac{1}{\sqrt{1-\frac{v^{2}}{c^{2}}}} is called the Lorentz factor, which is useful in deriving the Thomas half.

Eq249 states that the time interval measured by an observer who is stationary relative to the moving clock for events occurring on the moving clock will always be longer than the time interval measured by someone at rest with respect to the clock. This longer time interval means that, from the perspective of the stationary observer, the moving clock is slowing down relative to the observer’s own clock. In other words, the stationary observer will see the moving clock as ticking more slowly, and the time measured by the moving clock will seem dilated or stretched from the viewpoint of the stationary observer.

 

Next article: Length contraction
Previous article: Many-electron systems
Content page of quantum mechanics
Content page of advanced chemistry
Main content page

Many-electron systems

An atom with z>1 is an example of a many-electron system.

The electron configuration of a multi-electron atom, e.g. carbon: 1s22s22p2, specifies a general distribution of electrons in the atom’s orbitals. It is slightly vague in terms of energy levels, as different states may arise from a given electron configuration. In the case of carbon, the two 2p electrons could reside in any of the three 2p orbitals, with any spin orientation combinations that do not violate the Pauli Exclusion Principle. For example, the electrons in 2p can be arranged as [\uparrow\; \; \uparrow\; \times], [\uparrow\downarrow\; \times\; \: \times], [\uparrow\; \; \downarrow\; \times], etc.

To evaluate the relative non-relativistic energy levels of these different states for carbon, let’s assume the following:

  1. The core electrons of 1s22s2 do not affect the relative energy levels. This is because there is only one way to arrange these electrons, with the same arrangement common to all the different states. Consequently, the electronic configuration of carbon is effectively reduced to p2.
  2. The coupling of orbital angular momenta of the p2 electrons occurs separately from the coupling of spin angular momenta of the electrons, where \boldsymbol{\mathit{L}}=\sum_{i=1}^{n}\boldsymbol{l}_i and \boldsymbol{\mathit{S}}=\sum_{i=1}^{n}\boldsymbol{s}_i.
  3. The states \vert L,M_L,S,M_S\rangle are represented by total wavefunctions that are antisymmetric with respect to electron label exchange.

The non-relativistic multi-electron Hamiltonian \hat{H}_T of eq240 is constructed without considering orbital angular momentum coupling and spin angular momentum coupling. To satisfy our second assumption, we modify \hat{H}_T as follows:

\hat{H}_T^{'}=-\frac{\hbar^{2}}{2m_e}\sum_{i=1}^{n}\nabla_i^{\; 2}+\sum_{i=1}^{n-1}\sum_{j=i+1}^{n}C_{ij}\boldsymbol{\mathit{l}}_i\cdot\boldsymbol{\mathit{l}}_j-\sum_{i=1}^{n}\frac{Ze^2}{4\pi\varepsilon_0r_i}+\sum_{i=1}^{n-1}\sum_{j=i+1}^{n}\frac{e^{2}}{4\pi\varepsilon_0r_{ij}}

The extra term is a correction to the energy levels of states due to angular momentum coupling. The correction for spin-spin interactions, however, does not require an additional term, as it is accounted for in the last term (will be explained shortly).

 

Question

How is the new term derived?

Answer

Orbital angular momentum coupling is the interaction of the orbital magnetic moments of a pair of electrons. The interaction creates a potential in the form of eq67, where

U_{oo}=A\boldsymbol{\mathit{l}}_1\cdot\boldsymbol{\mathit{B}}

where A=\frac{e}{2m_e}, \boldsymbol{\mathit{l}}_1 is the angular momentum of one electron and \boldsymbol{\mathit{B}} is the magnetic field due to the orbital motion of the other electron.

Since \boldsymbol{\mathit{B}}\propto\boldsymbol{\mathit{l}}_2 (see article on Biot-Savart law), U_{oo}=C_{12}\boldsymbol{\mathit{l}}_1\cdot\boldsymbol{\mathit{l}}_2, where C_{12} is the constant of proportionality. For a system of more than two electrons, U_{oo}=\sum_{i=1}^{n-1}\sum_{j=i+1}^{n}C_{ij}\boldsymbol{\mathit{l}}_i\cdot\boldsymbol{\mathit{l}}_j.

 

Rewriting \hat{H}_T^{'} in terms of eq50 and eq51,

\hat{H}_T^{'}=\hat{H}_{radial}+\sum_{i=1}^{n}\frac{\hat{L}_i^{\;2}}{2mr_i^{\;2}}+\sum_{i=1}^{n-1}\sum_{j=i+1}^{n}C_{ij}\boldsymbol{\mathit{l}}_i\cdot\boldsymbol{\mathit{l}}_j+\sum_{i=1}^{n-1}\sum_{j=i+1}^{n}\frac{e^{2}}{4\pi\varepsilon_0r_{ij}}

For the p2 configuration, which involves equivalent electrons, r_1=r_2=r. So,

\hat{H}_T^{'}=\hat{H}_{radial}+D\sum_{i=1}^{n}\hat{L}_i^{\;2}+\sum_{i=1}^{n-1}\sum_{j=i+1}^{n}C_{ij}\boldsymbol{\mathit{l}}_i\cdot\boldsymbol{\mathit{l}}_j+\sum_{i=1}^{n-1}\sum_{j=i+1}^{n}\frac{e^{2}}{4\pi\varepsilon_0r_{ij}}

where the constant D=\frac{1}{2mr^{2}}.

Comparing the second and third terms of \hat{H}_T^{'} with eq181, C_{ij}=\frac{1}{mr^{2}}. Therefore,

\hat{H}_T^{'}=\hat{H}_{radial}+D\hat{{L}^{2}}^{(T)}+\sum_{i=1}^{n-1}\sum_{j=i+1}^{n}\frac{e^{2}}{4\pi\varepsilon_0r_{ij}}\; \; \; \; \; \; \; \; 248

For a given electron configuration like p2, the values of r_i do not change. Therefore, the relative energies of states arising from an electron configuration are determined by the second and third terms of \hat{H}_T^{'}. This reduces \hat{H}_T^{'} to

\hat{H}_{relative}=\hat{H}_{angular}+\hat{H}_{ee\: repulsion}

We know from eq244 that \left [\hat{{L}^{2}}^{(T)}, \hat{H}_T \right]=0 and hence \left [\hat{{L}^{2}}^{(T)}, \hat{H}_{relative} \right]=0, which results in our ability to choose a common set of eigenstates for \hat{{L}^{2}}^{(T)} and \hat{H}_{relative}. Since the eigenvalue of \hat{{L}^{2}}^{(T)} is a function of the total orbital angular momentum quantum number L, E_{angular} is a function of L.

 

Question

Show that \left [\hat{{L}^{2}}^{(T)}, \sum_{i=1}^{n-1}\sum_{j=i+1}^{n}C_{ij}\boldsymbol{\mathit{l}}_i\cdot\boldsymbol{\mathit{l}}_j\right ]=0 for a two-electron system.

Answer

Using eq181, eq99, eq100, eq101, the identities [\hat{A}+\hat{B}+\hat{C},\hat{D}]=[\hat{A},\hat{D}]+[\hat{B},\hat{D}]+[\hat{C},\hat{D}] and [\hat{A}\hat{B},\hat{C}]=[\hat{A},\hat{C}]\hat{B}+\hat{A}[\hat{B},\hat{C}], and that the component operators of \hat{L}_1 and \hat{L}_2 commute because they act on different vector spaces, we have \left [\hat{{L}^{2}}^{(T)}, \sum_{i=1}^{n-1}\sum_{j=i+1}^{n}C_{ij}\boldsymbol{\mathit{l}}_i\cdot\boldsymbol{\mathit{l}}_j\right ]=0.

 

Lastly, two factors contribute to r_{ij}=\vert\boldsymbol{\mathit{r}}_i-\boldsymbol{\mathit{r}}_j\vert of the term \sum_{i=1}^{n-1}\sum_{j=i+1}^{n}\frac{e^{2}}{4\pi\varepsilon_0r_{ij}}.  The first, called Coulomb interaction, can be explained by classical mechanics, where two electrons orbiting in the same direction meet less often than when they are orbiting in opposite directions. This implies that \vert\boldsymbol{\mathit{r}}_i-\boldsymbol{\mathit{r}}_j\vert_{ave}(same)>\vert\boldsymbol{\mathit{r}}_i-\boldsymbol{\mathit{r}}_j\vert_{ave}(opposite). Furthermore, the individual angular momentum of each electron orbiting in the same direction adds vectorially to give a higher total angular momentum (and hence a higher value of the quantum number L) than electrons orbiting in opposite directions. Hence, the Coulomb contribution to \sum_{i=1}^{n-1}\sum_{j=i+1}^{n}\frac{e^{2}}{4\pi\varepsilon_0r_{ij}} is dependent on the quantum number L. The second factor affecting r_{ij}=\vert\boldsymbol{\mathit{r}}_i-\boldsymbol{\mathit{r}}_j\vert is the exchange interaction. As mentioned in an earlier article, the exchange interaction is due to the symmetry of the total wavefunction of the state \vert L,M_L,S,M_S\rangle. We can infer from eq247 that E_{ee\: repulsion} for total wavefunctions with anti-symmetric spin wavefunctions are greater than E_{ee\: repulsion} for total wavefunctions with symmetric spin wavefunctions. Hence, E_{ee\: repulsion} is dependent on the total spin angular momentum quantum number S.

This above analysis of energy levels of different states (with respect to \hat{H}_T) can be extended to any electron configuration of equivalent electrons. In short, for a given electron configuration with equivalent electrons, states with the same value of L and the same value of S have the same energy.

 

Question

How about a particular electron configuration of non-equivalent electrons, where r_i\neq r_j?

Answer

The concept of states with the same value of L and the same value of S having the same energy also applies, to reasonable estimation, to a given electron configuration of non-equivalent electrons because the eigenvalue U_{oo} of the orbit-orbit coupling Hamiltonian 2(\hat{\boldsymbol{\mathit{L}}}_1\cdot\hat{\boldsymbol{\mathit{L}}}_2) in \hat{{L}^{2}}^{(T)}=\hat{L}_1^{\;2}+\hat{L}_2^{\;2}+2(\hat{\boldsymbol{\mathit{L}}}_1\cdot\hat{\boldsymbol{\mathit{L}}}_2) is relatively small compared to the eigenvalues of the other terms in \hat{H}_T^{'}. In fact, 2(\hat{\boldsymbol{\mathit{L}}}_1\cdot\hat{\boldsymbol{\mathit{L}}}_2) is usually omitted in the Hamiltonian, whether we are dealing with equivalent or non-equivalent electrons. Therefore, \hat{{L}^{2}}^{(T)}\approx \hat{L}_1^{\;2}+\hat{L}_2^{\;2} and

\hat{H}_T^{'}\approx -\frac{\hbar^{2}}{2m_e}\sum_{i=1}^{n}\nabla_i^{\; 2}-\sum_{i=1}^{n}\frac{Ze^2}{4\pi\varepsilon_0r_i}+\sum_{i=1}^{n-1}\sum_{j=i+1}^{n}\frac{e^{2}}{4\pi\varepsilon_0r_{ij}}

 

In summary, we have shown that different states that arise from a particular electron configuration have the same energy if they have the same value of L and the same value of S. However, we have not discussed the complete solution to \hat{H}_T , which involves deriving the exact formula of total wavefunctions of \hat{H}_T and the exact eigenvalues of \hat{H}_T. The solution to \hat{H}_T can be found using the Hartree-Fock method, which will be explained in another article.

 

Next article: Time dilation
Previous article: Exchange force
Content page of quantum mechanics
Content page of advanced chemistry
Main content page

Exchange force (quantum mechanics)

Exchange force is the interaction between particles of a system, as a result of the symmetry of the wavefunction describing the system. This interaction results in a change in the expectation values of inter-particle distances, and hence, a change in energy eigenvalues.

Consider a system of two electrons. The exchange force between electrons is also known as spin correlation. Since the electrons are indistinguishable, the normalised spatial wavefunction can be expressed generally as:

\psi_{\pm}(r_1,r_2)=\frac{1}{\sqrt{2}}[\phi_a(r_1)\phi_b(r_2)\pm\phi_b(r_1)\phi_a(r_2)]

where \psi_+ and \psi_- are symmetric spatial wavefunction and anti-symmetric spatial wavefunction respectively of the system of a pair of identical electrons, \phi_a and \phi_b are orthonormal one-electron wavefunctions, and the notation of \phi_a(r_1)\phi_b(r_2) denotes electron 1 in the state \phi_a and electron 2 in the state \phi_b.

To evaluate the effect of wavefunction symmetry on the expectation values of inter-particle distances, we analyse, for convenience, the average value of the square of the inter-particle distance:

\langle(\boldsymbol{\mathit{r}}_1-\boldsymbol{\mathit{r}}_2)^{2}\rangle=\frac{1}{2}\int \psi_{\pm}^{\;*}(r_1^{\;2}+r_2^{\;2}-2\boldsymbol{\mathit{r}}_1\cdot\boldsymbol{\mathit{r}}_2)\psi_{\pm} dr_1dr_2

Substitute \psi_{\pm} in the above equation and expanding,

where \langle r_n^{\;2}\rangle_k=\int \phi_k^{\;*}(r_n) r_n^{\;2}\phi_k(r_n)dr_n and \langle \boldsymbol{\mathit{r}}_n \rangle_k=\int \phi_k^{\;*}(r_n)\boldsymbol{\mathit{r}}_n\phi_k(r_n)dr_n, with n=1,2 and k=a,b.

Applying the Hermitian property of \boldsymbol{\mathit{r}}_1 and \boldsymbol{\mathit{r}}_2 for the last two terms of the above equation,

where \langle\boldsymbol{\mathit{r}}_n\rangle_{ab}=\int \phi_a^{\;*}(r_n)\boldsymbol{\mathit{r}}_n\phi_b(r_n)dr_n.

Since the two particles are indistinguishable, \langle r_1^{\;2}\rangle_k=\langle r_2^{\;2}\rangle_k=\langle r^{2}\rangle_k. Similarly, \langle \boldsymbol{\mathit{r}}_1\rangle_k=\langle \boldsymbol{\mathit{r}}_2\rangle_k=\langle \boldsymbol{\mathit{r}}\rangle_k and \langle \boldsymbol{\mathit{r}}_1\rangle_{ab}=\langle \boldsymbol{\mathit{r}}_2\rangle_{ab}=\langle \boldsymbol{\mathit{r}}\rangle_{ab}. So,

\langle(\boldsymbol{\mathit{r}}_1-\boldsymbol{\mathit{r}}_2)^{2}\rangle=\langle r^{2}\rangle_a+\langle r^{2}\rangle_b-2\langle\boldsymbol{\mathit{r}}\rangle_a\langle\boldsymbol{\mathit{r}}\rangle_b\mp2\vert\langle\boldsymbol{\mathit{r}}\rangle_{ab}\vert^{2}\; \; \; \; \; \; \; \; 246

where we have used the logic that \frac{1}{2}\langle r^{2}\rangle_k\pm\frac{1}{2}\langle r^{2}\rangle_k=\langle r^{2}\rangle_k because the difference of the terms is zero.

The consequence of eq246 is that \langle(\boldsymbol{\mathit{r}}_1-\boldsymbol{\mathit{r}}_2)^{2}\rangle_{\psi_+}<\langle(\boldsymbol{\mathit{r}}_1-\boldsymbol{\mathit{r}}_2)^{2}\rangle_{\psi_-}, which implies that

\frac{1}{\; \; \vert(\boldsymbol{\mathit{r}}_1-\boldsymbol{\mathit{r}}_2)\vert_{\psi_+}}>\frac{1}{\; \; \vert(\boldsymbol{\mathit{r}}_1-\boldsymbol{\mathit{r}}_2)\vert_{\psi_-}}\; \; \; \; \; \; \; \; 247

The eigenvalues of the Hamiltonian of eq239 are therefore expected to be different for a singlet state and a triplet state because the total wavefunction of the system, according to Pauli’s exclusion principle, must be anti-symmetric with respect to particle exchange.

 

 

Next article: Many-electron systems
Previous article: Non-relativistic multi-electron Hamiltonian
Content page of quantum mechanics
Content page of advanced chemistry
Main content page

Non-relativistic multi-electron Hamiltonian

The three dimensional Hamiltonian operator \hat{H}_T for an n-electron atom (excluding spin-orbit and other interactions) is:

\hat{H}_T=\frac{1}{2m_e}\sum_{i=1}^{n}\hat{p}_i^{\;2}-\sum_{i=1}^{n}\frac{Ze^{2}}{4\pi\varepsilon_0r_i}+\sum_{i=1}^{n-1}\sum_{j=i+1}^{n}\frac{e^{2}}{4\pi\varepsilon_0r_{ij}}\; \; \; \; \; \; \; \; 239

where \hat{p}_i^{\;2}=\hat{p}_{ix}^{\;2}+\hat{p}_{iy}^{\;2}+\hat{p}_{iz}^{\;2}=\left ( \frac{\hbar}{i}\frac{\partial}{\partial x} \right )^{2}+\left ( \frac{\hbar}{i}\frac{\partial}{\partial y} \right )^{2}+\left ( \frac{\hbar}{i}\frac{\partial}{\partial z} \right )^{2}, r_i=\sqrt{x_i^{\;2}+y_i^{\;2}+z_i^{\;2}} and r_{ij}=\vert\boldsymbol{\mathit{r}}_i-\boldsymbol{\mathit{r}}_j\vert=\sqrt{(x_i-x_j)^{2}+(y_i-y_j)^{2}+(z_i-z_j)^{2}}

or

\hat{H}_T=-\frac{\hbar^{2}}{2m_e}\sum_{i=1}^{n}\nabla_i^{\;2}-\sum_{i=1}^{n}\frac{Ze^{2}}{4\pi\varepsilon_0r_i}+\sum_{i=1}^{n-1}\sum_{j=i+1}^{n}\frac{e^{2}}{4\pi\varepsilon_0r_{ij}}\; \; \; \; \; \; \; \; 240

where \nabla_i^{\;2}=\frac{\partial^{2}}{\partial x_i^{\;2}}+\frac{\partial^{2}}{\partial y_i^{\;2}}+\frac{\partial^{2}}{\partial z_i^{\;2}}.

It is the total energy operator of the atom. The first term consists of the kinetic energy operators of the atom’s electrons and acts on a function of r, \theta and \phi. The second term, which is the electrostatic potential energy for the attractions between the electrons and the protons, acts on a function of just r because we have assumed an infinitely heavy nucleus that is reduced to a point at the origin. The third term, which acts on a function of r, \theta and \phi, is the electrostatic potential energy of electron repulsions. To illustrate the double summation for the last term, we consider a four-electron system with electrons e1, e2, e3 and e4. The possible interacting potentials, in the form of ejek, are:

which can be represented by the double summation {\color{Red} \sum_{i=1}^{n-1}}{\color{Blue} \sum_{j=i+1}^{n}}. Note that this double summation can also be represented as \frac{1}{2}{\color{Blue} \sum_{j\neq i}}{\color{red} \sum_{i=1}^{n}}, which is illustrated as:

or {\color{Blue} \sum_{j> i}}{\color{red} \sum_{i=1}^{n}}, which is

To show that eq239 commutes with the components of the total orbital angular momentum operator \hat{\boldsymbol{\mathit{L}}}^{(T)}, we consider a two electron system. As we know, \hat{\boldsymbol{\mathit{L}}}^{(T)}=\hat{\boldsymbol{\mathit{L}}}_1+\hat{\boldsymbol{\mathit{L}}}_2, where \hat{\boldsymbol{\mathit{L}}}_1 and \hat{\boldsymbol{\mathit{L}}}_2 are the individual orbital angular momentum of the system. Using eq76, we have

\hat{\boldsymbol{\mathit{L}}}^{(T)}=\hat{\boldsymbol{\mathit{i}}}\hat{L}_{1x}+\hat{\boldsymbol{\mathit{j}}}\hat{L}_{1y}+\hat{\boldsymbol{\mathit{k}}}\hat{L}_{1z}+\hat{\boldsymbol{\mathit{i}}}\hat{L}_{2x}+\hat{\boldsymbol{\mathit{j}}}\hat{L}_{2y}+\hat{\boldsymbol{\mathit{k}}}\hat{L}_{2z}

\hat{\boldsymbol{\mathit{L}}}^{(T)}=\hat{\boldsymbol{\mathit{i}}}(\hat{L}_{1x}+\hat{L}_{2x}) +\hat{\boldsymbol{\mathit{j}}}(\hat{L}_{1y}+\hat{L}_{2y}) +\hat{\boldsymbol{\mathit{k}}}(\hat{L}_{1z}+\hat{L}_{2z})

where (\hat{L}_{1x}+\hat{L}_{2x}), (\hat{L}_{1y}+\hat{L}_{2y}) and (\hat{L}_{1z}+\hat{L}_{2z}) are components of \hat{\boldsymbol{\mathit{L}}}^{(T)}.

For the uncoupled case, where the orbital angular momenta of the two electrons do not interact, eq239 becomes \hat{H}_{T,uc}=\frac{1}{2m_e}\sum_{i=1}^{2}\hat{p}_i^{\;2}-\sum_{i=1}^{2}\frac{Ze^{2}}{4\pi\varepsilon_0r_i}. From eq103, eq104 and eq105, we have [\hat{L}_{mk},\frac{1}{r}]=0 and [\hat{L}_{mk},\hat{p}^{2}]=0, where m=1,2 and k=x,y,z. This implies that the components of \hat{\boldsymbol{\mathit{L}}}_1 and \hat{\boldsymbol{\mathit{L}}}_2 commutes with \hat{H}_{T,uc}, and that \left [ \hat{L}_k^{\;(T)}=\hat{L}_{1k}+\hat{L}_{2k},\frac{1}{r} \right ]=0 and \left [ \hat{L}_k^{\;(T)},\hat{p}^{2} \right ]=0, and hence \left [ \hat{L}_k^{\;(T)},\sum_{i=1}^{2}\frac{1}{r}_i \right ]=0 and \left [ \hat{L}_k^{\;(T)},\sum_{i=1}^{2}\hat{p}_i^{\;2} \right ]=0. Therefore, when we analyse the commutation relations of the components of \hat{\boldsymbol{\mathit{L}}}_1 and \hat{\boldsymbol{\mathit{L}}}_2 with \hat{H}_T, we are left with the commutation relations of the components of \hat{\boldsymbol{\mathit{L}}}_1 and \hat{\boldsymbol{\mathit{L}}}_2 with \frac{1}{\vert\boldsymbol{\mathit{r}}_1-\boldsymbol{\mathit{r}}_2\vert}.

 

Question

Show that the components of \hat{\boldsymbol{\mathit{L}}}^{(T)}, but not those of \hat{\boldsymbol{\mathit{L}}}_1 and \hat{\boldsymbol{\mathit{L}}}_2, commute with \frac{1}{\vert\boldsymbol{\mathit{r}}_1-\boldsymbol{\mathit{r}}_2\vert}.

Answer

Using eq74,

\left [ \hat{L}_{1z},\frac{1}{\vert\boldsymbol{\mathit{r}}_1-\boldsymbol{\mathit{r}}_2\vert} \right ]=\frac{\hbar}{i\vert\boldsymbol{\mathit{r}}_1-\boldsymbol{\mathit{r}}_2\vert^{3/2} }\left [ -x_1(y_1-y_2)+y_1(x_1-x_2) \right ]\neq0\; \; \; \; \; \; \; \; 241

\left [ \hat{L}_{2z},\frac{1}{\vert\boldsymbol{\mathit{r}}_1-\boldsymbol{\mathit{r}}_2\vert} \right ]=\frac{\hbar}{i\vert\boldsymbol{\mathit{r}}_1-\boldsymbol{\mathit{r}}_2\vert^{3/2} }\left [ x_2(y_1-y_2)-y_2(x_1-x_2) \right ]\neq0\; \; \; \; \; \; \; \; 242

Similarly,  the x,y-components of the uncoupled operators \hat{\boldsymbol{\mathit{L}}}_1 and \hat{\boldsymbol{\mathit{L}}}_2 do not commute with with \frac{1}{\vert\boldsymbol{\mathit{r}}_1-\boldsymbol{\mathit{r}}_2\vert}. Therefore, all components of \hat{\boldsymbol{\mathit{L}}}_1 and \hat{\boldsymbol{\mathit{L}}}_2 do not commute with the Hamiltonian \hat{H}_T.

Using the identity [\hat{A}+\hat{B},\hat{C}]=[\hat{A},\hat{C}]+[\hat{B},\hat{C}] and substituting eq241 and eq242 into \left [ \hat{L}_{1z}+\hat{L}_{2z},\frac{1}{\vert\boldsymbol{\mathit{r}}_1-\boldsymbol{\mathit{r}}_2\vert} \right ], we have \left [ \hat{L}_{1z}+\hat{L}_{2z},\frac{1}{\vert\boldsymbol{\mathit{r}}_1-\boldsymbol{\mathit{r}}_2\vert} \right ]=0. Similarly, we find that \left [ \hat{L}_{1x}+\hat{L}_{2x},\frac{1}{\vert\boldsymbol{\mathit{r}}_1-\boldsymbol{\mathit{r}}_2\vert} \right ]=0 and \left [ \hat{L}_{1y}+\hat{L}_{2y},\frac{1}{\vert\boldsymbol{\mathit{r}}_1-\boldsymbol{\mathit{r}}_2\vert} \right ]=0. So,

\left [ \hat{L}_x^{\;(T)},\hat{H}_T \right ]=\left [ \hat{L}_y^{\;(T)},\hat{H}_T \right ]=\left [ \hat{L}_z^{\;(T)},\hat{H}_T \right ]=0 \; \; \; \; \; \; \; \; 243

 

Therefore, the components of the coupled total orbital angular momentum operator commutes with the Hamiltonian \hat{H}_T. If so, we can evaluate \left [ \hat{{L}^{2}}^{(T)},\hat{H}_T \right ] using the identities [\hat{A}+\hat{B}+\hat{C},\hat{D}]=[\hat{A},\hat{D}]+[\hat{B},\hat{D}]+[\hat{C},\hat{D}] and [\hat{A}\hat{B},\hat{C}]=[\hat{A},\hat{C}]\hat{B}+\hat{A}[\hat{B},\hat{C}] and eq243, which gives us

\left [ \hat{{L}^{2}}^{(T)},\hat{H}_T \right ]=0\; \; \; \; \; \; \; \; 244

Finally, \hat{H}_T also commutes with \hat{{S}^{2}}^{(T)} and \hat{S}_z^{\;(T)} because \hat{H}_T and spin angular momentum operators act on different vector spaces.

 

Question

Show that \hat{\boldsymbol{\mathit{L}}}^{(T)}\cdot\hat{\boldsymbol{\mathit{S}}}^{(T)} commutes with \hat{H}_T, where \hat{\boldsymbol{\mathit{L}}}^{(T)}=\sum_{i=1}^{n}\hat{\boldsymbol{\mathit{L}}}_i and .\hat{\boldsymbol{\mathit{S}}}^{(T)}=\sum_{i=1}^{n}\hat{\boldsymbol{\mathit{S}}}_i.

Answer

\left [ \hat{\boldsymbol{\mathit{L}}}^{T}\cdot\hat{\boldsymbol{\mathit{S}}}^{T},\hat{H}_T\right ]=\left [\hat{L_x^{\;(T)}}\hat{S_x^{\;(T)}}+\hat{L_y^{\;(T)}}\hat{S_y^{\;(T)}}+\hat{L_z^{\;(T)}}\hat{S_z^{\;(T)}},\hat{H}_T \right ]

Using the identities [\hat{A}+\hat{B}+\hat{C},\hat{D}]=[\hat{A},\hat{D}]+[\hat{B},\hat{D}]+[\hat{C},\hat{D}] and [\hat{A}\hat{B},\hat{C}]=[\hat{A},\hat{C}]\hat{B}+\hat{A}[\hat{B},\hat{C}], and noting that every component of \hat{\boldsymbol{\mathit{L}}} commutes with every component of \hat{\boldsymbol{\mathit{S}}} because they act on different vector spaces, and that the components of \hat{\boldsymbol{\mathit{S}}} commutes with \hat{H}_T because they act on different vector spaces, we have

\left [ \hat{\boldsymbol{\mathit{L}}}^{T}\cdot\hat{\boldsymbol{\mathit{S}}}^{T},\hat{H}_T\right ]=0\; \; \; \; \; \; \; \; 245

 

 

Next article: Exchange force
Previous article: Pauli exclusion principle
Content page of quantum mechanics
Content page of advanced chemistry
Main content page

Pauli exclusion principle

The Pauli exclusion principle states that no two identical fermions (particles with spin s=\frac{1}{2},\frac{3}{2},\frac{5}{2},\cdots) can occupy the same state.

The principle is a consequence of the postulate that the wavefunction of a system of fermions is antisymmetric with respect to label exchange, i.e. the wavefunction changes sign when the labels are exchanged.

The identical property of particles is fulfilled by the linear combinations:

\psi_{\pm}(r_1,r_2)=c\left [ \psi_{r_1}(1)\psi_{r_2}(2)\pm\psi_{r_1}(2)\psi_{r_2}(1)\right ]

where the labels 1 and 2 denote the first particle and second particle respectively, \psi_{r_j}(i) denotes particle i in the state \psi_{r_j} and there is equal probability \vert c\vert^{2} of the wavefunction \psi_{\pm} collapsing to \psi_{r_1}(1)\psi_{r_2}(2) and \psi_{r_1}(2)\psi_{r_2}(1) upon measurement.

 

Question

Show that \psi_+ is symmetric with respect to label exchange, while \psi_- is antisymmetric with respect to label exchange.

Answer

Exchanging the labels of \psi_+=c\left [ \psi_{r_1}(1)\psi_{r_2}(2)+\psi_{r_1}(2)\psi_{r_2}(1)\right ] and \psi_-=c\left [ \psi_{r_1}(1)\psi_{r_2}(2)-\psi_{r_1}(2)\psi_{r_2}(1)\right ],

c\left [ \psi_{r_1}(2)\psi_{r_2}(1)+\psi_{r_1}(1)\psi_{r_2}(2)\right ]=\psi_+

c\left [ \psi_{r_1}(2)\psi_{r_2}(1)-\psi_{r_1}(1)\psi_{r_2}(2)\right ]=-\psi_-

 

Since the wavefunction of a system of fermions is antisymmetric with respect to label exchange, it must be expressed by

\psi_-(r_1,r_2)=c\left [ \psi_{r_1}(1)\psi_{r_2}(2)-\psi_{r_1}(2)\psi_{r_2}(1)\right ]

When \psi_{r_1}=\psi_{r_2}, the wavefunction \psi_-=0, which contradicts the proposition that an eigenfunction is a non-zero function. Therefore, no two identical fermions can occupy the same state.

If the system is an atom, the total wavefunction (spatial + spin) describing two identical electrons (s=\frac{1}{2}) is

\Psi(n,l,m_l,m_s)=\psi(n,l,m_l)\sigma(m_s)

\psi(n,l,m_l)=\psi_{\pm}=\phi_{n_1,l_1,m_{l1}}(1)\phi_{n_2,l_2,m_{l2}}(2)\pm\phi_{n_1,l_1,m_{l1}}(2)\phi_{n_2,l_2,m_{l2}}(1) is the composite spatial wavefunction of the system, where \phi is a one-electron spatial wavefunction (also known as hydrogenic wavefunction). \sigma(m_s) is the composite spin wavefunction of the system and is explicitly given by eq222 through eq225. Since \Psi must be antisymmetric with respect to label exchange

\Psi=\left\{\begin{matrix} \psi_+\sigma_d\\\psi_-\sigma_{a\: or\: b\: or\: c} \end{matrix}\right.

When n_1=n_2, l_1=l_2 and m_{l1}=m_{l2} only \psi_+\sigma_d is non-zero. Since \sigma_d refers to the singlet state (i.e. a state with anti-parallel spins, where m_{s1}\neq m_{s2}), no two identical electrons in an atom can have the same set of quantum numbers. This is consistent with the general statement that no two identical fermions can occupy the same state because a distinct quantum state of an atom is characterised by a particular set of quantum numbers, e.g. n=1, l=1, m_l=0 and m_s=\frac{1}{2}.

 

Next article: Non-relativistic multi-electron Hamiltonian
Previous article: Exchange operator
Content page of quantum mechanics
Content page of advanced chemistry
Main content page

Exchange operator (quantum mechanics)

The exchange operator \hat{P}_{ik} acts on a function resulting in the swapping of labels of any two identical particles, i.e.

\hat{P}_{ik}\psi(q_1,\cdots,q_i,\cdots,q_k,\cdots,q_n)=\psi(q_1,\cdots,q_k,\cdots,q_i,\cdots,q_n)

where the label q_n refers to the n-th particle.

If we apply \hat{P}_{ik} twice on the function,

\hat{P}_{ik}^{\; \; 2}f(\cdots,q_i,\cdots,q_k,\cdots)=\hat{P}_{ik}f(\cdots,q_k,\cdots,q_i,\cdots)=f(\cdots,q_i,\cdots,q_k,\cdots)

Therefore, \hat{P}_{ik}^{\; \; 2}=\hat{I}, where \hat{I} is the identity operator.

 

Question

Show that the eigenvalues of \hat{P}_{ik} are \lambda=\pm1.

Answer

If is an eigenfunction of , then \hat{P}_{ik}^{\; \; 2}\psi=\lambda\hat{P}_{ik}\psi=\lambda^{2}\psi. Since \hat{P}_{ik}^{\; \; 2}=\hat{I}, we have \psi=\lambda^{2}\psi. As an eigenfunction must be non-zero\lambda=\pm1.

 

Experiment data reveals that the wavefunction of a system of two identical fermions (particles with spin s=\frac{1}{2},\frac{3}{2},\frac{5}{2},\cdots)  is antisymmetric with respect to label exchange (i.e. the eigenvalue is -1 when the exchange operator acts on the wavefunction), while the wavefunction of a system of two identical bosons (s=0,1,2,\cdots) is symmetric with respect to label exchange (eigenvalue of +1). The antisymmetric property of fermion wavefunctions and the symmetric property of boson wavefunctions can be regarded as postulates of quantum mechanics.

For identical fermions, \hat{P}_{ik}\psi(\cdots,q_i,\cdots,q_k,\cdots)=-\psi(\cdots,q_k,\cdots,q_i,\cdots). Since the way we label identical particles cannot affect the state of the system, the commutation relation between \hat{P}_{ik} and the Hamiltonian is

\left [\hat{P}_{ik},\hat{H}\right ]\psi=\hat{P}_{ik}\hat{H}\psi-\hat{H}\hat{P}_{ik}\psi=E\hat{P}_{ik}\psi+\hat{H}\psi=-E\psi+E\psi=0

This implies that, for a system of identical fermions, we can select a common complete set of eigenfunctions for \hat{P}_{ik} and \hat{H}, with the eigenfunctions being antisymmetric under label exchange.

 

Question

Show that \hat{P}_{ik} commutes with \hat{{S}^{2}}^{(T)}, \hat{S}_z^{\; (T)}, \hat{S}_1^{\; 2}, \hat{S}_2^{\; 2}, \hat{\boldsymbol{\mathit{S}}}_1\cdot\hat{\boldsymbol{\mathit{S}}}_2, \hat{{L}^{2}}^{(T)}, \hat{L}_z^{\; (T)}, \hat{L}_{1z}, \hat{S}_{1z}, \hat{\boldsymbol{\mathit{S}}}^{(T)}\cdot\hat{\boldsymbol{\mathit{L}}}^{(T)}, \hat{J}^{2} and \hat{J}_z.

Answer

\left [ \hat{P}_{12},\hat{S}_1^{\; 2} \right ]\psi=s_1(s_1+1)\hbar^{2}\hat{P}_{12}\psi+\hat{S}_1^{\;2}\psi=-s_1(s_1+1)\hbar^{2}\psi+s_1(s_1+1)\hbar^{2}\psi=0

Similarly, \left [ \hat{P}_{12},\hat{S}_2^{\; 2} \right ]=0. For \hat{{S}^{2}}^{(T)} and \hat{S}_z^{\;(T)}, we have \hat{{S}^{2}}^{(T)}\psi=S(S+1)\hbar^{2}\psi and \hat{S}_z^{\;(T)}\psi=m_S\hbar\psi respectively. So \left [\hat{P}_{12} ,\hat{{S}^{2}}^{(T)}\right ]=0 and \left [\hat{P}_{12} ,\hat{S}_z^{\;(T)}\right ]=0. Since \hat{{S}^{2}}^{(T)}=\hat{S}_1^{\;2}+2\hat{\boldsymbol{\mathit{S}}}_1\cdot\hat{\boldsymbol{\mathit{S}}}_2+\hat{S}_2^{\;2}, we have \left [\hat{P}_{12},\hat{\boldsymbol{\mathit{S}}}_1\cdot\hat{\boldsymbol{\mathit{S}}}_2\right ]=0. Similarly, \hat{P}_{12} commutes with \hat{{L}^{2}}^{(T)}, \hat{L}_z^{\;(T)}, \hat{L}_{1z} and \hat{S}_{1z}.

Next, \left [\hat{P}_{12},\hat{\boldsymbol{\mathit{S}}}^{(T)}\cdot\hat{\boldsymbol{\mathit{L}}}^{(T)}\right ]=\left [\hat{P}_{12},\left ( \hat{\boldsymbol{\mathit{S}}}_1+\hat{\boldsymbol{\mathit{S}}}_2\right )\cdot\left ( \hat{\boldsymbol{\mathit{L}}}_1+\hat{\boldsymbol{\mathit{L}}}_2\right )\right ]. Expanding this equation using eq76 and eq179, and noting that \left [\hat{P}_{12},\hat{L}_{mk}\right ]=0 and \left [\hat{P}_{12},\hat{S}_{mk}\right ]=0, where m=1,2 and k=x,y,z, we have \left [\hat{P}_{12},\hat{\boldsymbol{\mathit{S}}}^{(T)}\cdot\hat{\boldsymbol{\mathit{L}}}^{(T)}\right ]=0. It follows that \left [\hat{P}_{12},\hat{J}^{2}\right ]=\left [\hat{P}_{12},\hat{{L}^{2}}^{(T)}\right ]+\left [\hat{P}_{12},\hat{{S}^{2}}^{(T)}\right ]+\left [\hat{P}_{12},2\hat{\boldsymbol{\mathit{S}}}^{(T)}\cdot\hat{\boldsymbol{\mathit{L}}}^{(T)}\right ]=0 and that \left [\hat{P}_{12},\hat{J}_z\right ]=\left [\hat{P}_{12},\hat{L}_z^{\; (T)}\right ]+\left [\hat{P}_{12},\hat{S}_z^{\; (T)}\right ]=0.

 

 

Next article: Pauli exclusion principle
Previous article: BB84 protocol
Content page of quantum mechanics
Content page of advanced chemistry
Main content page

BB84 protocol

BB84 is a quantum key distribution protocol that is based on photon polarisation states. It was invented by Charles Bennett and Gilles Brassard in 1984. The protocol can be broken down into the following steps:

1) The sender transmits a raw key in the form of a string of photons via an optical fibre to the receiver. Each photon is randomly chosen by the sender to be polarised along one of four directions: i) z-direction or \updownarrow  (\theta=0^{\circ}), ii) x-direction or \leftrightarrow (\theta=90^{\circ}), iii) ⤢ (\theta=45^{\circ}), iv) ⤡ (\theta=-45^{\circ}). This can be accomplished by passing photons generated by a laser L through an attenuator A (a material that reduces the intensity of the laser beam through absorption or reflection to produce single photons), followed by a birefringent crystal B_1 to separate photons into two orthogonal polarisations, one of which is passed through a Pockels cell P_1 for the appropriate rotations (see diagram below).

2) The polarised photons are assigned the following qubit values: \updownarrow=0, \leftrightarrow=1, ⤢ = 0, ⤡ = 1, which can be grouped into the following bases:

Basis 0 1
+ \updownarrow \leftrightarrow
\times

3) The receiver, using Pockels cell B_2, randomly selects one of two rotations (90^{\circ} representing the basis +, or 45^{\circ} representing the basis \times) to analyse each photon sent by the sender. The outputs of the detectors D_1 and D_2 are set to 1 and 0 respectively.

Assuming a raw key of twelve bits are sent, we may have the following case:

Sender Polarisation \small \updownarrow \small \leftrightarrow \small \leftrightarrow \small \leftrightarrow \small \updownarrow \small \updownarrow \small \leftrightarrow
Qubit value 0 1 1 1 0 1 0 0 1 1 0 1
Receiver Basis \small + \small + \small \times \small + \small + \small \times \small \times \small \times \small \times \small + \small \times \small +
Qubit value 0 1 1 0 0 0 0 0 1 1 1 1
Retained qubits 0 1 0 1 1

1) The sender and receiver share their bases with each other publicly (e.g. over the internet).

2) Only the qubits corresponding to polarisations with the same basis are retained. To complete the protocol, all other qubits can be converted to 0. The final key in the above example is 010000001001, which is used to encrypt a message (assuming of the same qubit-length) using modular addition.

Let’s suppose a third party intercepts the sender’s qubits and measures them using the same method as the receiver. The interceptor then has to guess the sender’s polarisation for each measured qubit and resend them to the receiver (see table below). Since the interceptor does not know the bases selected by the sender, the bits he measured are only useful if he is lucky enough to choose the same bases for the qubits that the sender and receiver eventually retained.

Sender Polarisation \small \updownarrow \small \leftrightarrow \small \leftrightarrow \small \leftrightarrow \small \updownarrow \small \updownarrow \small \leftrightarrow
Qubit value 0 1 1 1 0 1 0 0 1 1 0 1
Interceptor Basis \small \times \small + \small \times \small \times \small + \small + \small + \small \times \small + \small + \small \times \small \times
Qubit value 0 1 0 1 1 1 0 1 0 0 0 1
Resends \small \leftrightarrow \small \leftrightarrow \small \leftrightarrow \small \updownarrow \small \updownarrow \small \updownarrow
Receiver Basis \small + \small + \small \times \small + \small + \small \times \small \times \small \times \small \times \small + \small \times \small +
Qubit value 1 1 0 0 1 0 0 1 0 0 0 1
Retained qubits 1 1 0 0 1
Discrepancies Y N N Y N

To detect the presence of an interceptor, the sender and receiver share pre-determined segments of qubits with each other. The probability that the sender and receiver select the same bases but the interceptor chooses different bases is \small \frac{1}{4}, of which half the time the interceptor resend polarisations of the ‘wrong’ basis to the receiver. Therefore, the probability of the sender and the receiver detecting discrepancies in the retained qubits when they share segments of their qubit values is \small \frac{1}{8} (assuming the shared qubits are representative of a randomly selected segment), which is very significant if the qubit-length of the raw key is relatively long. For example, if the raw key has 72 qubits, of which 40 common qubits are eventually shared between the sender and the receiver, 5 retained qubits will be erroneous. When discrepancies are detected, the sender and receiver will discard the key and start over.

 

 

Next article: exchange operator
Previous article: Photon polarisation
Content page of quantum mechanics
Content page of advanced chemistry
Main content page
Mono Quiz