Rotational Raman selection rules for linear rotors

Rotational Raman selection rules describe the allowed changes in a molecule’s rotational angular momentum states during inelastic light scattering.

The explicit rotational Raman selection rules for a linear rotor can be derived by considering the molecule with a molecular frame interacting with an external electric field applied along the laboratory -axis ().

The angle is the polar angle between the molecular -axis and the laboratory -axis, while is the azimuthal angle in the molecular -plane, measured from the -axis towards the -axis (see diagram above).

The first step of the derivation involves expressing the induced dipole moment in terms of the molecular -axis before projecting it onto the direction of the electric field (laboratory -axis).

Simple geometry shows that the direction cosines (the projections of the unit vector of the laboratory -axis onto the molecular axes) are:

Similarly, the electric field applied along the laboratory -axis has the following components in the molecular frame:

In the molecular frame, the induced dipole moment is a vector with three components :

where are the unit vectors along the molecular axes.

Likewise, the laboratory unit vector can be written in terms of the molecular frame as . Since the projection of any vector onto a unit vector is given by the dot product between the two vectors (see diagram below), the induced dipole moment along the laboratory -axis can be expressed by projecting onto as follows:

Substituting eq4 into eq6 yields:

For a linear molecule, the molecular axes coincide with the principal axes of the polarisability tensor, so that off-diagonal components vanish. Combining ,  and with eq5 and eq7 results in:

Replacing and with , and with (since the molecular -axis is the parallel axis of rotation, while the molecular – and -axes are the perpendicular axes of rotation), and using , gives:

As explained in the previous article, the transition probability associated with scattered radiation is evaluated using , which in this case, is:

where and are the un-normalised spherical harmonics.

The integral is equal to zero and when and  respectively. Thus, we require for a probable transition to occur (), resulting in:

where



is non-zero when either or is non-zero, or when both are non-zero. For the first integral, let and , giving:

which is non-zero only if , or equivalently,

This corresponds to Rayleigh scattering.

The second integral, with and , is:

Substituting the recurrence relation into eq10 yields:

where


Letting in the same recurrence relation and substituting it into eq11 results in:

where




for a linear rotor because . and are non-zero only if , which gives the same Rayleigh scattering selection rule as eq9. For to be non-zero, we require or equivalently, . The selection rule corresponding to is . Therefore, the selection rules for rotational Raman transition of a linear rotor are:

 

Question

Explain why is consistent with for a linear rotor.

Answer

The polarisability of a linear rotor, as shown in the previous article, changes with the rotational angle : . This mirrors the fact that . If ,the molecule is spherically symmetric and there is no change in as the molecule rotates, i.e. .

 

The selection rules can also be explained semiclassically, with the induced dipole moment given by:

where is the angular frequency of the incident radiation.

If the linear molecule is rotating an angular frequency , the value of its polarisability repeats twice per revolution cycle (see diagram above):

where is the constant baseline (average) polarisability of the molecule.

Substituting eq13 into eq12 gives:

Using the identity yields:

Eq14 shows that the induced dipole moment consists of three components: one oscillating at the incident frequency, and two at . Each time-dependent component of the induced dipole moment corresponds to an oscillating dipole, which radiates electromagnetic energy at the frequency at which it oscillates. It follows that the components oscillating at and give rise to Rayleigh-scattered radiation and Raman-scattered radiation respectively.

 

Question

Why does each component on the RHS of eq14 correspond to an oscillating, rather than the entire sum corresponding to a single oscillating dipole?

Answer

An oscillating electric dipole is defined by an electric dipole moment that varies sinusoidally in time at a single frequency, for example . Therefore, eq14 represents a superposition of three independent sinusoidal components, which can be viewed as a Fourier decomposition of the induced dipole moment. Each term oscillates at a distinct frequency and therefore corresponds to an independent oscillating dipole that radiates at that frequency.

 

Raman selection rules are often discussed for thermally populated rotational levels, where is typically large (the classical limit). For large , the magnitude of the quantum angular momentum of the molecule is . Substituting this approximation into gives:

is the angular frequency of a Stokes-scattered photon, which is associated with the resultant quantum rotational transitional frequency of . This corresponds to the selection rule , because at large ,

Similarly, the angular frequency (anti-Stokes scattering) leads to the selection rule .

 

Next article: rotational raman selection rules for symmetric rotors
Previous article: General selection rules for scattered radiation
Content page of Raman spectroscopy
Content page of advanced chemistry

General selection rules for scattered radiation

Selection rules for scattered radiation describe the allowed changes in a molecule’s state during light scattering.

According to the time-dependent perturbation theory, the transition probability between the orthogonal states and is proportional to the square of the corresponding transition matrix element . For example, in conventional rotational spectroscopy, this involves the electric dipole moment operator , and transitions occur only for molecules with a permanent dipole moment.

Transitions associated with scattering radiation, however, arise from an induced dipole moment rather than a permanent one. It follows, with reference to eq1, that the transition probability is given by:

Thus, a necessary condition for scattered-radiation transitions is that the matrix element be non-zero. To examine this condition further, we consider how the molecular polarisability depends on the molecular coordinate , which may represent a rotational angle or a vibrational displacement. This dependence can be expressed through a Taylor series about the equilibrium configuration:

Substituting this series (and neglecting higher-order terms) into gives:

Because the wavefunctions are orthogonal, the first term vanishes unless , which corresponds to Rayleigh scattering. Consequently, a Raman transition can occur only if , i.e. the polarisability of the molecule must change as it undergoes rotational or vibrational motion.

When , we have , which is the transition probability for Rayleigh scattering. When , we have the transition probability for Raman scattering:

Since represents a small perturbation of the electron cloud of the molecule, . Furthermore, the magnitude of the transition matrix element is never large: for vibrational Raman scattering it is proportional to the small zero-point vibrational amplitude, while for rotational Raman scattering it is a dimensionless angular matrix element bounded by unity. Consequently,

 

Next article: rotational raman selection rules for linear rotors
Previous article: Raman spectroscopy (instrumentation)
Content page of Raman spectroscopy
Content page of advanced chemistry

Raman spectroscopy (instrumentation)

Raman spectroscopy relies on a compact yet sophisticated set of instruments designed to probe molecular rovibrational states through photon-matter interactions.

The laser provides monochromatic, coherent and intense light required to induce Raman scattering. Common laser types include diode lasers, Nd:YAG lasers and argon-ion lasers, operating at wavelengths ranging from the ultraviolet to the near-infrared region. The sample compartment is designed to accommodate solids, liquids or gases with minimal preparation. Raman instruments may use backscattering (180°), right-angle (90°) or forward-scattering geometries, depending on the application. In the above diagram, scattered radiation perpendicular to the laser beam is focused onto a pinhole.

The pinhole acts as a spatial filter that controls the amount of light arriving at the detector and helps define the resolution of the final spectrum. After passing through the slit, the scattered radiation still contains a mixture of Rayleigh, Stokes and anti-Stokes frequencies and is therefore dispersed into its individual wavelength components using a reflective diffraction grating. Each groove on the grating acts as a secondary source of light, generating spherical wavefronts in accordance with Huygen’s principle. Neighbouring reflected wavefronts interfere with one another to produce constructive diffraction patterns as defined by:

where

is the order of the diffraction (with the detector typically positioned to record ).
is the wavelength of the light.
is the spacing between the grooves.
is the angle between the incident ray and grating’s normal vector.
is the angle between the diffracted ray and grating’s normal vector.

 

Question

Derive eq3.

Answer

With respect to the above diagram, the difference in path length between two adjacent incident rays is , and that between two adjacent diffracted rays is . Since the incident ray and the diffracted ray are on opposite sides of the normal, the total path difference is , assuming that both angles are positive. This difference must be an integer multiple of the light’s wavelength for constructive interference to occur. Therefore, .

 

Since the groove spacing is fixed, each wavelength is diffracted at a unique angle, causing light reflected from different grooves to travel slightly different path lengths. Constructive interference occurs when these wavefronts arrive in phase, producing intensity maxima at specific angles, while destructive interference reduces or cancels the signal at other angles. This process results in the spatial separation of wavelengths at the focal plane of the instrument.

A charge-coupled device (CCD) is positioned at this focal plane to detect the dispersed light. Its location is precisely calculated to receive the first-order () diffracted rays. The CCD consists of a two-dimensional array of light-sensitive pixels fabricated on a silicon substrate and is widely used due to its high sensitivity, low-noise quality, and ability to simultaneously record a broad spectral range. When photons strike a pixel, they are absorbed by the silicon, generating electron–hole pairs via the photoelectric effect. The number of electrons produced is proportional to the intensity of the incident light. Because each wavelength is focused onto a distinct position on the detector, individual pixels record the intensity corresponding to a single Raman frequency, yielding the final spectrum of intensity versus frequency (or versus Raman shift ).

 

Next article: General selection rules for scattered radiation
Previous article: Raman scattering
Content page of Raman spectroscopy
Content page of advanced chemistry

Raman scattering

Raman scattering is the inelastic scattering of light by molecules in which the exchanged energy reveals information about the molecules’ vibrational or rotational states.

This effect arises from the interaction between incident light and the internal energy structure of molecules, and is best understood by contrasting it with ordinary absorption. When a photon collides with a molecule and its frequency corresponds exactly to the energy difference between two stationary molecular states, the molecule can absorb the photon and undergo a real electronic, vibrational or rotational transition. In this situation, the photon’s energy matches the gap between two eigenstates of the unperturbed molecular Hamiltonian, making true absorption possible.

However, most photons do not have energies that coincide with any allowed transition. In such cases, absorption cannot occur, yet the photon may still interact with the molecule. This is the domain of Raman scattering, which is an inelastic scattering process rather than a resonant absorption phenomenon. The mechanism begins when the electric field associated with the incident photon interacts with the molecule’s charge distribution. Although the photon is detected as a particle, it propagates as an electromagnetic wave, and its oscillating electric field transiently polarises the molecule. In other words, an applied electric field can distort the molecule and induce an electric dipole moment , with a stronger field resulting in a larger dipole moment for a given molecule:

where is the polarisability of the molecule, which is a measure of the degree to which the electrons in the molecule can be displaced relative to the nuclei.

Since and , eq1 can also be written in matrix form as:

Here, is the -component of , where denotes the direction of the applied electric field and denotes the direction of the induced dipole moment component.

 

Question

Why is expressed as a rank-two tensor, known as the polarisability tensor, in eq2?

Answer

In general, the spatial distribution of electrons in a molecule is constrained by the geometry of the nuclei and the bonding orbitals. Consequently, an electric field applied in one direction can induce a redistribution of electron density in any direction permitted by those constraints, as is the case for a non-spherically symmetric molecule such as glycerol (C3H8O3). As an example, if the electric field lies along the laboratory -axis, then the distortion of the electron cloud are measured by the polarisability components , and .

For spherically symmetric particles, such as an atom or a spherical rotor, the same distortion is induced regardless of the direction of the applied field, resulting in all off-diagonal elements of the tensor being zero and the diagonal elements being equal, (see above diagram). The polarisability tensor of a diatomic molecule, whether homonuclear or heteronuclear, also has zero off-diagonal elements. However, while , where the -axis is chosen to be the molecular axis. Typically, .

Furthermore, is symmetric (e.g. ) because the work done to polarise a molecule depends only on the final state of the electric field, not the path taken to reach it. If a field is applied first in the -direction and then the -direction, the net change in potential energy due to the molecule’s polarisability must be the same as if the fields were applied in the reverse order.

 

The redistributed electrons transform the molecule from its ground state into a short-lived virtual energy state. Unlike a real excited state, this state is an eigenstate of the perturbed Hamiltonian. Because it exists only during the presence of the perturbing field, the virtual state lasts on the order of femtoseconds, with its existence permitted by the time–energy uncertainty principle. Since the virtual state is not stable, the molecule quickly relaxes. If it relaxes back to the same stationary state it occupied before the interaction, the photon re-emitted has the same energy as the incident photon. This elastic process is known as Rayleigh scattering, and it accounts for the vast majority of scattered photons (see diagram below).

But if the molecule relaxes to a different stationary vibrational or rotational state than the one it started in, energy must be conserved by adjusting the energy of the scattered photon accordingly. If the molecule ends up in a state of higher internal energy, the scattered photon necessarily loses that amount of energy and emerges with a lower frequency (Stokes scattering). Conversely, if the molecule relaxes to a lower one, the scattered photon gains the excess internal energy and is emitted with a higher frequency (anti-Stokes scattering). These two types of scattering are collectively known as the Raman effect or Raman scattering.

Thus, Raman scattering provides a window into the vibrational and rotational structure of molecules: the energy shifts — Stokes and anti-Stokes — encode the quantised energy differences between molecular states. By measuring these shifts, Raman spectroscopy reveals the “fingerprint” of molecular vibrations, enabling powerful chemical identification and structural analysis without requiring the incident light to resonate with any real molecular transition.

However, due to the instantaneous orientation of the molecule, different photons colliding with it may lead to different extents of polarisation according to the polarisation tensor. The transition probability for Raman scattering is even lower, at about one in ten million. Therefore, Raman spectroscopy requires a high-power laser source, with the complete instrumentation described in the next article.

 

Next article: Raman spectroscopy (instrumentation)
Content page of Raman spectroscopy
Content page of advanced chemistry

Determining infrared and Raman activities of molecular vibrations using character tables

Character tables provide a systematic group-theoretical framework for determining whether molecular vibrational modes are infrared– or Raman-active by examining how those modes transform relative to the electric dipole moment and polarisability operators.

The first step is to work out the symmetries of the normal modes of vibration of a molecule. For example, H2O lying in the -plane belongs to the point group. Two of its three normal modes (symmetric stretch and bend), as explained in an earlier article, transform according to the irreducible representation, while the remaining one (asymmetric stretch) transforms according to .

The character table (see above table) shows that the linear functions (or Cartesian coordinates) and also transform according to and respectively. This correspondence means that the collective atomic displacements associated with each of the three H2O normal modes transform in the same way under the symmetry operations of the group as the corresponding Cartesian vector component.

The second step is to recognise that an IR-active vibrational transition requires a change in the molecular dipole moment during the vibration. The dipole moment is a vector quantity whose non-zero components transform as the linear functions , and . If a normal mode transforms according to the same irreducible representation as one of these components, then the corresponding component of the dipole moment changes as the atoms are displaced from their equilibrium positions.

Therefore, the modes that transform according to and are IR-active.

For vibrations to be Raman-active, we refer to the quadratic functions because the components of the polarisability tensor transform in the same way as these functions. In this case, both the and  modes are Raman-active.

 

Question

Why are IR and Raman activities mutually exclusive for vibrational modes of centrosymmetric molecules?

Answer

See this article for explanation.

 

 

Previous article: Fourier-transform infrared spectroscopy
Content page of vibrational spectroscopy
Content page of advanced chemistry

Determining the normal modes of CO2 using the descent of symmetry method

The descent of symmetry method offers an efficient approach to determining the normal modes of CO₂.

The conventional procedure for determining the normal modes of a molecule involves the following steps:

    1. Forming a basis set of unit displacement vectors representing the instantaneous motions of the atoms.
    2. Determining geometrically the transformed vectors for each symmetry operation associated with the point group of the molecule.
    3. Constructing the symmetry operation matrix such that .
    4. Evaluating the trace of each matrix to obtain the character of the reducible representation, and decomposing each representation to determine the irreducible representations of the normal modes of the molecule.

However, CO₂ is a linear molecule belonging to the ​ point group, whose infinitely many symmetry elements render the above procedure impractical. The descent of symmetry method circumvents this difficulty by replacing ​ with a suitable finite subgroup, while preserving the essential symmetry properties of all degrees of freedom. This is justified because

    • the symmetry operations of both the parent group and its subgroup return the molecule to physically indistinguishable configurations; and
    • both the parent group and its subgroup act on the same set of basis functions (e.g. the Cartesian displacement vectors), from which reducible representations of either group are generated and subsequently decomposed to obtain the irreducible representations of the respective group.

As a result, all the normal modes of a molecule that transform according to certain irreducible representations of a parent group must also transform according to irreducible representations of the subgroup. The final step is to match the irreducible representations of both groups through a process known as correlation.

To proceed, we select an appropriate subgroup of for constructing the transformation matrices. While not strictly required, it is generally advantageous for the chosen subgroup to contain symmetry elements belonging to the same classes as those of the parent group, in order to minimise ambiguity during the correlation step. A common and convenient choice is the point group, which includes the key symmetry operations of , , and  preserved from the parent group.

Next, we follow steps 1 to 3 by imposing the symmetry operations on the set of nine unit displacement vectors . For example, the transformation matrix for inversion is:

where .

The eight transformation matrices form a reducible representation of the point group with the following traces:

Using eq27a, is decomposed to . Subtracting the translational modes () and the rotational modes () gives the vibrational representation:

The correlation step involves comparing the characters of symmetry operations common to both and , using their respective character tables. In particular:

    • Since rotation about the -axis (intermolecular axis) is not a physical degree of freedom for a linear molecule, characters associated with in are not used in the correlation. Instead, the characters of in are compared with those of and in .
    • Only the reflection planes and in  correspond to in ( does not).

This leads to the following correlation between the irreducible representations of the two groups:

It is evident that the vibrational mode transforming according to the totally symmetric irreducible representation in , and correspondingly according to (also totally symmetric) in , represents the symmetric stretching motion of CO2.

Two of the four vibrational modes are degenerate and transform according to when the full symmetry of CO2 is considered. These two modes describe bending motions in mutually perpendicular planes and share the same vibrational frequency. Upon descending from to , this degeneracy is lifted, and the doubly degenerate bending mode splits into two symmetry-distinct components transforming as and .

The remaining vibrational mode transforms according to and is antisymmetric with respect to inversion, identifying it as the antisymmetric stretching mode.

 

Question

How do we determine the IR and Raman activities of CO2 vibrational modes?

Answer

See this article for details.

 

Next article: Correlation diagram
Previous article: Subgroup (descent of symmetry)
Content page of group theory
Content page of advanced chemistry
Main content page

Exclusion rule in vibrational spectroscopy

The exclusion rule in vibrational spectroscopy states that, for molecules with a centre of symmetry, a vibrational mode that is infrared-active is Raman-inactive, and a mode that is Raman-active is infrared-inactive.

From a group‐theoretical perspective, the exclusion rule follows from the symmetry properties of the operators that govern infrared and Raman transitions. A vibrational mode is IR-active if its irreducible representation transforms like one of the linear basis functions (Cartesian coordinates) , , or , because the electric dipole moment operator has the same symmetry. It is Raman-active if its irreducible representation appears in the symmetry of the quadratic basis functions , , , , , or , which correspond to components of the polarisability tensor.

A point group to which a centrosymmetric molecule belongs always includes the inversion symmetry operator. Each of the basis functions , , or  changes sign (e.g. ) when it is inverted through the origin. On the other hand, quadratic basis functions, which are products of two linear functions, do not change sign upon inversion because both linear functions constituting the quadratic function change sign (e.g. and ).

This implies that the character corresponding to the inversion operation of an irreducible representation associated with a linear basis function is always different from that associated with a quadratic basis function. It follows that irreducible representations that are ungerade (u) can be IR-active but cannot be Raman-active, while irreducible representations that are gerade (g) can be Raman-active but cannot be IR-active. In other words, no vibrational mode of centrosymmetric molecules can be both IR- and Raman-active, which is the essence of the mutual exclusion rule.

This principle is clearly illustrated by the vibrational spectrum of CO2​, which has four vibrational modes belonging to the point group. The symmetric stretching vibration transforms as the ​ irreducible representation and is therefore Raman-active but IR-inactive. In contrast, the asymmetric stretching vibration transforms as ​, which only has the same symmetry as the function, making it IR-active but Raman-inactive. The two bending vibrational modes are doubly degenerate and transform as the ​ irreducible representation, which corresponds to the and functions. Consequently, the bending modes are also IR-active and Raman-inactive. Thus, no mode is simultaneously IR- and Raman-active.

Accordingly, the IR spectrum of CO2​ contains absorption bands corresponding to the asymmetric stretch and the bending vibrations, whereas the Raman spectrum shows a strong band associated with the symmetric stretch (see diagram below).

 

Question

Is the vibrational mode of O2 IR-active or Raman-active?

Answer

O2 vibrates by simply moving its two atoms closer and farther apart. Since both atoms are identical, this stretching mode is perfectly symmetric and transforms according to the irreducible representation of the point group. Therefore, it is Raman-active but not IR-active.

 

Previous article: Why components of the polarisability tensor transform like quadratic functions
Content page of Raman spectroscopy
Content page of advanced chemistry

Tight-binding model

The tight-binding model incorporates the Hückel approximation to describe the electronic structure of solids by treating electrons as localised around atoms and hopping between neighbouring sites.

Consider a linear chain of identical atoms located at position , with representing the s-orbital of atom . According to the Hückel approximation, the Hamiltonian is defined by two parameters:

1) is the energy of an electron localised on a single atom.

2) is the energy associated with an electron hopping between adjacent atoms.

3) All other matrix elements are zero.

To find the eigenvalues and eigenstates satisfying the Schrödinger equation , we adopt the linear combination of atomic orbitals (LCAO) approach, in which

Multiplying the Schrödinger equation on the left by the bra gives:

Substituting eq1 into eq2, and noting that the states  are orthonormal, yields:

Expanding the first summation in eq3 and applying the Hückel approximation results in:

Substituting the trial solution into eq4, and imposing the boundary conditions of , gives:

This rearranges to:

Using the trigonometric identity , with  and yields:

which simplifies to:

Since , we have . So, , which when substituted into eq5 gives:

where .

Although is an integer due to the condition , its specific range in eq6 follows the fact that each eigenvalue is associated with one of linearly independent eigenstates described by eq1.

 

Question

Show that and are trivial solutions.

Answer

If , then and everywhere, which is a trivial solution. If , then and is again zero everywhere. Therefore, the non-trivial solutions correspond to .

 

An electron localised on an atom resides in a bound state with energy measured relative to the vacuum level (). Therefore, the conventional value of is negative. For s-orbitals, is also negative because corresponds to an integral of the form (positive wavefunction) (negative potential) (positive wavefunction). It follows that the lower-energy states occurs when is small in eq6 (so that the cosine term is close to +1 for large ). In other words, and represent the lowest and highest energy states of the system respectively (see diagram below).

One important application of the tight-binding model is its role in explaining band theory, which will be explored in the next article.

 

Next article: Band theory
Content page of solid-state chemistry
Content page of advanced chemistry
Main content page

Band theory

Band theory describes how atomic orbitals in a solid combine to form extended molecular orbitals, whose closely spaced energies effectively create continuous bands.

Consider a linear chain of identical atoms located at position , with representing the s-orbital of atom . When two atoms overlap, molecular orbital (MO) theory states that two MOs are formed: a bonding orbital and an antibonding orbital. For three atoms, three MOs are produced — bonding, non-bonding and antibonding — while the orbital overlap of four atoms results in four MOs: two bonding and two antibonding orbitals. In general, a chain of atoms generates  molecular orbitals. If is large, the energy levels of these MOs merge to form an energy band (see diagram above).

Mathematically, the energies are derived using the tight-binding model:

where , and .

To show that the energies merge to form an energy band, we analyse the energy separation:

Using the trigonometric identity ,

As and in eq7.

For example, the 1s-orbitals of Na, which has the electronic configuration 1s22s22p63s1, overlap to form the 1s band. Similarly, the 2s orbitals form the 2s band, and so on (see diagram above). The various Na bands do not overlap because the energy differences between different Na atomic orbitals (AOs) are much greater than the separation within each type of AO. Another characteristic of the band-MO diagram of Na is that the 3s band is only partially filled. In contrast, the 3s and 3p bands of Mg overlap to form a continuous band. Nevertheless, this merged band is still only partially filled because each 3p AO is initially unoccupied.

In metals, the Fermi energy — the energy of the highest occupied electronic state at absolute zero — lies within a partially filled band. However, it is more meaningful to associate frontier energies with the Fermi level , defined as the energy level at which the probability of finding an electron is 50% at thermodynamic equilibrium according to the Fermi-Dirac distribution, for any temperature above 0 K.

Electronic states of metals near the Fermi energy at absolute zero arise from the overlap of many AOs and are extremely closely spaced in energy. Because the density of states varies only weakly in this region and the gaps between occupied and unoccupied states are extremely small, increasing the temperature from 0 K to room temperature excites only a tiny fraction of electrons into slightly higher-energy levels. This redistribution is so small that the energy at which the occupation probability is 50% at room temperature remains essentially the same as the Fermi energy at 0 K. Consequently, electrons can easily move into the empty states just above the Fermi level when an electric field is applied, giving metals their high electrical conductivity. This conductivity decreases at higher temperatures due to increased collisions between moving electrons and the vibrating lattice atoms (phonons), which disrupt the flow of charge.

On the other hand, solid Ne is a non-conductor. Each Ne atom has the electronic configuration 1s22s22p6, with the 2p band completely filled in the solid. The next available band (derived from 3s orbitals) lies far higher in energy, creating a large band gap. With no empty states near the top of the filled band, electrons cannot be thermally promoted into any empty states, and the material remains insulating. In such insulators, the Fermi level lies within the band gap.

The main types of semiconductors are intrinsic (undoped), p-type and n-type (see diagram above). They also possess a band gap between the valence band and the conduction band. For an intrinsic semiconductor, such as GaAs, the valence band is completely filled at absolute zero, while the conduction band is completely empty. Therefore, the Fermi level lies near the middle of the band gap. Although the gap (1.42 eV) is much larger than the thermal energy at room temperature (0.026 eV), it is still small enough that a small but statistically significant number of electrons can be thermally excited across it. Once promoted into the conduction band, these electrons can be driven by an applied electric field to produce a current. Unlike metals, the conductivity of semiconductors increases with temperature because the number of thermally generated charge carriers grow exponentially, easily outweighing the increased scattering from lattice vibrations.

When dopants are introduced into intrinsic semiconductors, they create additional electronic states within the band gap. In a p-type semiconductor, acceptor impurities introduce energy levels slightly above the valence band. At 0 K, these acceptor levels are empty, while the valence band remains fully occupied. It follows that the Fermi level lies between the valence band and the acceptor levels. Electrical conductivity arises when electrons are thermally promoted from the valence band into the acceptor levels, leaving behind mobile holes in the valence band. These holes serve as the majority carriers responsible for current flow.

n-type semiconductors contain donor levels positioned just below the conduction band. At 0 K, these donor levels are fully occupied by electrons supplied by the dopant atoms, with the Fermi level lying between the donor levels and the conduction band. When electrons are promoted from the donor levels into the conduction band, they become free to move, producing a current.

In conclusion, band theory provides a molecular-orbital perspective on how electronic structure governs conductivity across different materials and forms the foundation for devices such as transistors and solar cells.

 

Next article: Superconductor
Previous article: Tight-binding model
Content page of solid-state chemistry
Content page of advanced chemistry
Main content page

Superconductor

A superconductor is a material that, below a certain critical temperature, conducts electricity with zero electrical resistance and expels magnetic fields.

The first superconductor was discovered in 1911, when Dutch physicist Heike Kamerlingh Onnes observed that mercury’s electrical resistance suddenly vanished at about 4.2 K. In the years that followed, additional elements were identified as superconductors, with critical temperatures below 23 K. These superconducting elements are mostly transition metals that are bunched into certain parts of the periodic table (elements highlighted in yellow in the above diagram). In contrast, the noble metals (such as copper, silver and gold) and the alkali metals generally do not become superconducting at ambient pressure.

Metals normally resist the flow of electricity. Electrons, the carriers of current, constantly scatter off vibrating atoms (phonons), impurities and even each other. Yet, below a certain temperature, some metals suddenly lose all electrical resistance.

How can electrons stop scattering entirely? The answer lies in a profound insight developed by John Bardeen, Leon Cooper and Robert Schrieffer, known as the BCS theory.

According to this theory, electrons stop acting like isolated particles. Instead, they form highly coordinated pairs that move in perfect harmony, allowing current to flow without energy loss. At first glance, the idea that electrons can pair seems impossible. After all, they repel each other electrically. But the key insight is that electrons can interact indirectly through the crystal lattice. When an electron moves through a lattice, it slightly attracts the positively charged ions nearby, creating a tiny distortion. This distortion, in turn, can attract a second electron (see diagram below). This effective attraction is extremely weak, only acts for electrons near the Fermi surface, and operates within a narrow energy range. Such a phonon-mediated interaction is generally isotropic (the same in all directions) in momentum space, or at least does not have a strong directional dependence.

In 1956, Leon Cooper showed that if even the tiniest attraction exists between two electrons just above the Fermi level, they will inevitably form a bound pair — a Cooper pair. Essentially, the enormous density of available electronic states near the Fermi surface acts like a magnifying glass for even the tiniest attractions. Because so many states are packed closely together, two electrons with a weak attractive interaction have an overwhelming number of ways to pair up. This abundance of possibilities amplifies the effect of the weak attraction, making it sufficient to bind electrons into a Cooper pair. Even interactions that would be negligible elsewhere in the energy spectrum become decisive near the Fermi surface, ensuring that pairing always occurs when conditions are right.

A Cooper pair isn’t two electrons clinging together like magnets. It is a quantum superposition extending over thousands of lattice spacings, characterised by:

    • A total spin of 0 (singlet)
    • A total momentum of 0 (electrons at and )
    • A highly delocalised wavefunction that strongly overlaps with others

The wavefunction of the Cooper pair is given by:

where

is a function describing the amplitude
is an operator that creates an electron with momentum and spin
is an operator that creates an electron with momentum and spin
is the vacuum state where the system has zero particles

 

Question

Why does the Cooper pair have zero momentum?

Answer

In conventional superconductors, a Cooper pair with a total linear momentum of zero minimises kinetic energy, making this configuration energetically favourable. As the phonon-mediated interaction is approximately isotropic in momentum space, the Cooper pair’s spatial wavefunction is spherical with zero angular momentum .

Electrons are fermions and the total wavefunction must be antisymmetric under exchange of the two electrons. The total wavefunction of the pair is the product of a spatial part, which is symmetric (), and a spin part, which must then be antisymmetric, forming a singlet.

 

Because the wavefunctions overlap so extensively, all Cooper pairs in the metal lock into a single quantum state. This creates a kind of “quantum super-traffic-jam,” where every pair is aware of every other pair through a shared phase. When billions of Cooper pairs occupy the same quantum state, they form a macroscopic quantum condensate, a unified wavefunction stretching across the entire metal. This condensate is rigid and tiny disturbances cannot knock individual pairs out of step.

How then is the material able to conduct electricity without resistance? Resistance arises when electrons scatter off imperfections or vibrations. Cooper pairs, however:

    • Are spread out over huge distances
    • Share a common quantum phase
    • Cannot scatter individually

For a Cooper pair to scatter, all pairs must do so simultaneously, which requires a prohibitive amount of energy. Small impurities or low-energy phonons simply cannot disrupt this concerted motion. Once the condensate begins moving, it keeps moving forever unless something strong enough breaks the pairs. Furthermore, breaking a Cooper pair costs energy, creating the superconducting energy gap. This gap shields the condensate from thermal fluctuations, explaining why superconductors exhibit zero electrical resistance and persistent currents at low temperatures where phonon-mediated interactions are not disrupted by thermal motion of lattice ions. In other words, electrons in a superconductor below its critical temperature move together forever, without resistance.

This remarkable ability to carry persistent currents without energy loss is directly exploited in technologies that require stable, intense magnetic fields. For example, in magnetic resonance imaging (MRI) machines, niobium-titanium coils are cooled below their critical temperature (10 K) with liquid helium to form persistent currents. These currents generate strong and extremely stable magnetic fields, which are essential for producing high-resolution images of internal tissues. Because the currents circulate indefinitely without decay, MRI magnets do not require continuous power input to maintain their fields, making them highly efficient and reliable. Beyond medical imaging, the same principle is exploited in quantum computing, where persistent, lossless currents are critical.

The second defining property of a superconductor is its ability to expel magnetic fields from its interior, such that . This phenomenon is called the Meissner effect, and it occurs even if an external magnetic field was already present before the material was cooled.

In a normal conductor, magnetic fields penetrate the material almost uniformly (see diagram above), limited only by weak and short-lived effects such as induced eddy currents. If the external magnetic field changes with time, Faraday’s law generates eddy currents in the conductor, but these currents quickly decay because the material has finite electrical resistance. As they die out, they no longer oppose the applied field, allowing the magnetic field to fully penetrate the bulk.

In contrast, when a superconductor cools below , Cooper pairs form and create a coherent superconducting condensate. In the presence of an external magnetic field, the condensate develops spatial variations that raise its kinetic energy, which in turn induces circulating screening currents along the surface. These currents generate magnetic fields that exactly oppose the applied field inside the material. Because the resistance is zero, the screening currents persist indefinitely without any power source, maintaining complete magnetic field expulsion.

Consequently, when a magnet is brought near a superconducting material, the induced screening currents produce a magnetic field that repels the magnet. This repulsive force can counteract gravity, allowing the magnet to float or levitate above the superconductor. Conversely, if the superconductor is placed above a magnet, it can also hover, held in place by the interaction between its induced currents and the magnetic field.

This principle of magnetic levitation can be harnessed to create frictionless transportation. In 1986, high-temperature superconductors ( of up to 138 K) were discovered. These materials, typically containing copper oxide and other metals such as barium and yttrium, are exemplified by YBa2Cu3Ox (), where .

In maglev trains, YBa2Cu3Ox magnets are mounted on the train and cooled inside well-insulated chambers using liquid nitrogen. These magnets interact with permanent magnets or electromagnets embedded in the track, causing the train to levitate and eliminating friction with the track. By carefully controlling the magnetic fields along the track, the train can also be propelled forward. Changing the position or intensity of the track’s magnetic fields induces forces on the superconducting magnets via electromagnetic induction, pushing or pulling the train along its path. The combination of frictionless levitation and contactless propulsion allows maglev trains to achieve very high speeds with minimal energy loss, making them a highly efficient transportation technology.

 

Question

Why is Yba2Cu3Ox a superconductor when the individual elements are not superconducting?

Answer

YBa2Cu3Ox has a layered, perovskite-like crystal structure, with alternating layers of CuO₂ planes and other layers containing yttrium and barium (see above diagram for the unit cell of YBa2Cu3O7). The CuO₂ planes are where superconductivity primarily occurs, while the other layers act as charge reservoirs and provide structural support for the lattice. This arrangement creates an energetically favourable environment in which the electronic properties necessary for superconductivity can emerge.

Within the CuO₂ planes, strong hybridisation between copper orbitals and oxygen orbitals creates an extensive two-dimensional network of electronic states. The superconductivity itself is enabled by controlling the oxygen content through a process known as doping, which introduces holes (positively charged carriers) into the planes. These holes are the microscopic charge carriers that pair up to form Cooper pairs.

Furthermore, the two charge carriers that form a Cooper pairs in the CuO₂ planes are much more tightly bound than in conventional superconductors. This results in a very short coherence length (the characteristic size of a Cooper pair) of only a few nanometres, compared with the tens to hundreds of nanometres typical in phonon-mediated superconductors such as NbTi. The short coherence length reflects a much stronger effective attractive interaction, likely arising from magnetic (spin-fluctuation) mechanisms rather than phonons. This stronger pairing interaction is one of the key reasons YBa₂Cu₃Oₓ can sustain superconductivity at temperatures far higher than those of conventional superconductors.

In essence, superconductivity in YBa2Cu3Ox is not due to the individual elements, but emerges from the specific arrangement of copper and oxygen atoms in the planes, combined with proper doping and crystal structure.

 

Previous article: Band theory
Content page of solid-state chemistry
Content page of advanced chemistry
Main content page
Mono Quiz