Rotation operator

A rotation operator is a unitary operator that rotates quantum states by an angle about an axis in a Hilbert space.

Mathematically, it is given by:

where , the angular momentum operator, is the generator of rotations about axis .

To derive eq100, we begin by noting that the probability outcome of a quantum mechanical measurement is described by the Born interpretation:

where and are wavefunctions representing quantum states.

Since rotations are symmetry operations that do not change physical probabilities, we require:

Using the matrix identity (see property 13 of this article for proof), we have . Therefore, is unitary, where .

Let’s approximate as a power series of a small change in :

where and are first and second order rotation generator matrices respectively.

Since a rotation by zero angle must do nothing, , and eq101 (ignoring higher order terms) becomes

To preserve unitarity, . Expanding this equation and ignoring higher-order terms gives or . This implies that is not Hermitian, which contradicts the postulate that all physical observables in quantum mechanics are represented by Hermitian operators. It follows that eq102 must have the following form:

For a quantum mechanical operator to represent the same physical observable in the passively rotated frame, its expectation value must be the same whether calculated in the initial frame or with respect to the new frame:

Substituting into eq104 gives , which means that and are related by the similarity transformation or equivalently,

where because .

Substituting eq103 into eq105 yields:

where we have let the operator be the position operator .

Expanding eq106 and ignoring higher order terms results in , or equivalently,

where .

To determine the nature of , we refer to the active classical rotation of the vector by an infinitesimal angle about an arbitrary axis represented by the unit vector (see diagram above), with the infinitesimal change in given by:

where the cross product results in a vector with a magnitude proportional to and a direction tangential to the path of rotation.

Using the methodology of replacing classical variables with their quantum mechanical analogues to derive quantum mechanical expressions, eq108 becomes:

where we have added a minus sign because in eq108 is defined by an active rotation about the axis, while in eq107 describes a passive rotation about the axis.

 

Question

Why is an active rotation and a passive rotation related by a negative sign in eq109?

Answer

An active rotation refers an anticlockwise rotation of a position vector in a fixed coordinate system by the angle (see this article for details). In a passive rotation, the position vector remains stationary while the coordinate system rotates around it. To produce the same effect on the coordinates, a passive rotation corresponds to an anticlockwise rotation of the coordinate system by . In other words, a passive rotation of the coordinate system by  has the same result as an active rotation of the vector by . Therefore, .

 

Substituting into eq109 and expanding it gives:

Since , and , we have:

Substituting into 107 and setting to be the -axis for simplicity gives:

Comparing eq110 with eq111 yields:

Comparing these three commutation relations with the total angular commutation relations of , and , we find that . In other words, is the -th component of the total angular momentum operator, expressed in units of .

A finite rotation by an angle can be constructed by apply successive infinitesimal rotations, each by an angle . From eq103,

 

Question

Show that .

Answer

Let . So, . Using the Taylor series for small , we have

As , we find that , or equivalently, . So, . This makes eq112 the definition of the matrix exponential function.

 

Therefore, eq112 becomes:

Eq113 is usually written as:

where the eigenvalues of and hence are expressed in units of

Since is a finite-dimensional linear operator acting on a finite-dimensional vector space, can be represented by an matrix in both group theory and quantum mechanical calculations. In general, the rotation operator in an Euler angle system, which is defined by three successive rotations by the angles , and , is given by:

where , and .

The application of eq115 involves the rotation operator transforming a quantum state with total angular momentum projection along the molecular axis into a linear combination of states with the projection along the lab -axis:

where , the entries of the Wigner D-matrix, are the coefficients of .

 

Question

Why is the transformed state a linear combination of ?

If eq114 describes an operator that rotates quantum states by an angle about an axis in a Hilbert space, how do three consecutive rotations described by eq115 ensure the complete transformation from the molecular coordinate system to the lab coordinate system?

Answer

In quantum mechanics, the states form a complete orthonormal basis in the Hilbert space corresponding to a fixed . Therefore, any state with total angular momentum , including the rotated state , can be expressed as a linear combination of the basis states .

In the diagram above, represent the molecular axes, while represent the lab-frame axes. Without assuming any specific rotation convention, the diagram illustrates that any target orientation can be achieved through three consecutive rotations. The first rotation about brings into the -plane. The second rotation, about , aligns with . Finally, the last rotation about aligns the remaining axes of the two coordinate systems.

 

Next article: Wigner D-matrix
Previous article: Centrifugal distortion
Content page of rotational spectroscopy
Content page of advanced chemistry
Main content page

SO(3) Group

The SO(3) group, or special orthogonal group in three dimensions, is an infinite group of all 3D rotations.

Each element of the group corresponds to a rotation operator, characterised by a unit rotation axis and an angle . Since there are infinitely many possible rotation angles and axes, SO(3) contains infinitely many elements. These elements satisfy the properties of a group. For example, the binary operation of two rotation operators, each with a specific rotation angle about an axis, results in another rotation operator, demonstrating the closure property.

Consider the rotation of the spherical harmonics around the -axis. When the rotation operator acts on by an angle around the -axis, only the azimuthal angle is affected and each basis is transformed into . The transformation for all basis functions is summarised as:

 

Question

Prove that if , then .

Answer

Consider the Taylor series of the exponential function . Since

We have

 

In other words, , and according to the above Q&A,

where is the -component of the angular momentum operator, expressed in units.

Since an irreducible representation of a group is expressed by a set of matrices (or matrix-valued functions) that represent the group elements as linear operators on a vector space, an irreducible representation of the SO(3) group is dimensional (see eq120), with the set forming a basis for the irreducible representation for a fixed .

 

Question

Why does the matrix representation of about the -axis belong to an irreducible representation of SO(3) when it is diagonal?

Answer

An irreducible representation is a group representation whose matrices cannot be simultaneously transformed, via the same invertible similarity transformation, into block diagonal form. For SO(3), an irreducible representation consists of matrices representing rotation operators about different axes. Even though the matrix of about the -axis is diagonal, which implies reducibility on its own, other matrices in the same representation, such as about the -axis, contain non-zero off-diagonal elements. These non-diagonal matrices cannot all be simultaneously diagonalised by the same similarity transformation. Therefore, the matrix representation of about the -axis belongs to an irreducible representation of SO(3).

 

The character of the rotation matrix is , which is a geometric series , where , and . This implies that

Using Euler’s formula of ,

The SO(3) group does not have a standard character table in the same way that finite point groups do. Its irreducible representations are labelled by a non-negative integer , which form a discrete set. However, the group elements and their associated characters are continuous functions that depend on the rotation angle.

Nevertheless, for illustrative purposes, we can express the relationships between group elements, their irreducible representations and corresponding characters in the following way:

where each can be further expanded as follows:

 

Question

Why is for ?

Answer

From eq122, for . At , this expression is indeterminate, but we can evaluate the limit. Substituting into the trigonometric identity gives . So,

Substituting the half-angle formula into the above equation yields . Therefore, . Since  is represented by the identity matrix, the trace equals the dimension of the representation of . Using the same logic, for .

 

In general, a rotation operator of the SO(3) group is given by (see this article for derivation):

where , and are the Euler angles.

The question, then, is whether the table relating the general rotation group elements, their irreducible representations and corresponding characters will be the same as that of a single-axis, single-angle rotation operator such as ?

The character of an irreducible representation of SO(3) depends only on the total rotation angle, not on the specific rotation axis or the individual Euler angles. Since SO(3) is the infinite group of all possible 3D rotations, there exists a one-to-one correspondence in symmetry between three consecutive rotations described by and a single rotation described by . In other words, even though is expressed in terms of Euler angles, it represents the same group element as some , because we can always find a symmetry operation about an appropriate axis that produces the same group element and therefore the same character value. Hence,

Finally, the rotation of , the Wigner D-matrix elements, for fixed and fixed is given by:

This shows that, for fixed , the set forms a basis of the same -dimensional irreducible representation of SO(3) as the set . Since are eigenfunctions of the angular momentum operators ( and ), it follows that are also eigenfunctions of the corresponding operators and . In other words, are also rotational wavefunctions.

 

Question

Elaborate on why are also eigenfunctions of and  if are eigenfunctions of and .

Answer

As mentioned earlier, (or equivalently ) is a generator of an irreducible representation of SO(3). Its matrix representation is

However, the matrix itself, is not an element of an irreducible representation of SO(3). , or any component of the angular momentum operator, is related to the rotation operator , by (see eq121 for an example). Using the Taylor series definition of the exponential function yields:

Applying the commutation relation identities and gives:

Since , where is a constant, and commutes with , i.e. ,  we have:

According to Schur’s first lemma, any non-zero matrix that commutes with all matrices of an irreducible representation of a group is a multiple of the identity matrix. Therefore, (or ) must be a multiple of the identity operator within the -dimensional subspace:

When (or ) acts on any basis function within this irreducible subspace, we obtain:

This shows that the basis functions  are eigenfunctions of and . Similarly,, are eigenfunctions of and .

 

Previous article: Correlation diagram
Content page of group theory
Content page of advanced chemistry
Main content page

Coriolis effect and Coriolis coupling

The Coriolis effect is the apparent deflection of a moving object when observed from a rotating reference frame.

Consider a person holding a ball and standing on the edge of a large circular platform that is rotating anti-clockwise as viewed from above. At the twelve o’clock position, the person throws the ball directly towards the centre point O of the platform.

From the perspective of a stationary observer hovering above the platform (i.e. in an inertia frame of reference), the ball follows a straight-line path towards point X, slightly to the left of O. This trajectory occurs because, at the moment of release, the ball has two components of velocity:

    • A radial component directed towards O.
    • A tangential velocity due to the rotation of the platform at the point of release (in this case, towards the nine o’clock direction).

The resulting trajectory is the vector sum of these two components, which points towards X.

However, to the thrower, who is in the rotating frame of reference, the motion of the ball appears quite different. In this rotating frame, the ball seems to curve clockwise, as if being deflected to the right of its intended path. This illusion arises because, after release, the ball retains its original tangential speed, while the platform between the thrower and point O are rotating more slowly (closer to the center, where tangential speed is lower). In other words, the ball appears to be moving ahead of the rotating platform beneath it.

It follows that an apparent force, acting perpendicular to the direction of the ball’s motion, is influencing its path. This fictitious force, introduced to account for the apparent deflection experienced in rotating frames, is known as the Coriolis force.

This phenomenon isn’t just a curiosity of rotating platforms. On the molecular level, the Coriolis force introduces a crucial vibration-rotation interaction (also known as Coriolis coupling) in the dynamics of a rotating molecule — an interaction that would otherwise be neglected in a first-order approximation, where vibration and rotation are treated as independent motions.

When a molecule rotates, its internal motion (vibration) occurs simultaneously within the rotating molecular frame. To an observer in this rotating frame, the atoms undergoing vibration appear to be deflected perpendicular to their vibrational velocity. The resulting Coriolis force, associated with this apparent deflection, acts to couple the molecule’s vibrational angular momentum with its overall rotational angular momentum.

In quantum mechanics, this coupling appears as an additional term in the Hamiltonian that couples vibrational and rotational states:

where is proportional to the product of vibrational and rotational angular momentum operators.

Explicitly,

where

denotes the molecular principal axes ().
is the operator for the -component of the total angular momentum of the molecule (rotation).
is the operator for the -component of the vibrational angular momentum.

This leads to shifts in rotational energy levels depending on the vibrational state, as well as the splitting or mixing of rotational levels associated with degenerate vibrations. In the infrared spectrum, this manifests as an anomalous splitting of the P, Q and R branches.

On a much larger scale, the Coriolis effect plays a crucial role in shaping natural systems on Earth. Because Earth is a rotating sphere, this effect influences the motion of air masses, ocean currents and weather patterns. For instance, a cyclone’s spin results from two forces acting on the atmosphere simultaneously:

    • The pressure gradient force moves air from a high-pressure region to a low-pressure region.
    • The Coriolis effect deflects that moving air

A low-pressure system is essentially a partial vacuum that draws in air from all directions. Consider a low-pressure region in the Northern Hemisphere (the upper yellow region in the diagram below). To an observer in a rotating frame of reference (e.g. a satellite moving at the same angular velocity as Earth from west to east), winds moving from the equator towards the low-pressure region appear to be deflected to the right, while winds flowing towards the region from the north appear to be deflected to the left. The combined movements result in an anti-clockwise sprial. The same principle causes winds to sprial clockwise in the Southern Hemisphere.

In conclusion, the Coriolis effect demonstrates the profound influence of rotational motion across vastly different scales — from affecting molecular behaviour to shaping global weather systems. At the molecular level, the principle manifests as Coriolis coupling, where rotational motion interacts with vibrational motion, subtly altering energy levels and spectral properties. On Earth, it governs the deflection of winds and ocean currents, giving rise to the rotation of cyclones and other large-scale atmospheric patterns. Together, these phenomena highlight the unifying power of rotational dynamics in both macroscopic and microscopic systems.

 

Previous article: Vibration-rotation spectra of polyatomic molecules
Content page of rovibrational spectroscopy
Content page of advanced chemistry
Main content page

Vibration-rotation spectra of polyatomic molecules

Vibration-rotation spectra arise from the simultaneous vibrational and rotational transitions of molecules, typically observed in the infrared (IR) region of the electromagnetic spectrum. These spectra provide detailed information about molecular structure, bond strength, and moment of inertia.

For a molecule to be IR active, it must have a permanent dipole or a dipole moment that changes over time. Since polyatomic molecules have vibrational modes, some which do not result in a change in the molecules’ dipole moments, only certain vibration-rotation transitions are IR active. The transition selection rules are given by eq7 or eq8b, depending on whether the molecule is linear or non-linear.

 

Linear molecules

An example of a polyatomic linear molecule is CO2, which has the same expressions for , and as those for diatomic molecules. The symmetric stretch of CO2 is IR inactive because the net electric dipole moment is always zero (see diagram above).

 

Question

What are the components of the rotational quantum number ?

Answer

The quantum number represents the magnitude of the coupled total angular momentum vector , where

is the nuclear rotation angular momentum (end-over-end rotation).
is the electronic orbital angular momentum.
is the vibrational angular momentum.
is the electronic spin angular momentum.

 

In contrast, the antisymmetric stretch is IR active because it produces a dipole moment that varies with displacement. Since CO2 is a closed-shell molecule, . Furthermore, this vibrational mode does not produce vibrational angular momentum (). Therefore, the absorption of the photon’s angular momentum must be accounted for by , resulting in (P-branch or R-branch).

The degenerate bending modes are also IR active but differ from the asymmetric stretch in an important way: they can involve vibrational angular momentum. Because the two bending modes occur in perpendicular planes and are degenerate, their linear combination can produce a circular (or elliptical) motion of the O atoms about the C atom, effectively giving rise to vibrational angular momentum. This allows the molecule to absorb the photon’s angular momentum along an alternate pathway without requiring a change in the rotational quantum number. In other words, after photon absorption, but the vector addition of and can result in the magnitude of being unchanged (possible when ), enabling transitions to appear in the IR spectrum as the Q-branch.

 

Symmetric tops

For a symmetric top, its molecular rotational energy is given by eq51. Therefore, eq9 becomes:

Any allowed vibrational transition of a symmetric rotor like CH3I can involve a change in the dipole moment along the molecule’s symmetry axis (known as a parallel transition), as well as changes in and , which are perpendicular to (perpendicular transitions). The general selection rules governing parallel transitions for symmetric rotors are given by eq8:

while those for perpendicular transitions are given by eq8a:

It follows that the parallel transition expressions for , and  are the same as those for linear molecules. For perpendicular transitions, each branch has two sub-branches depending on :

 

Therefore, the IR spectrum for CH3I (see diagram above) is more complicated than that for CO2. Furthermore, there are vibrational modes, which are categorised into six types:

The first three modes are symmetric vibrations (see diagram below), with a change in dipole moment parallel to the C3 axis. The remaining three modes are doubly degenerate asymmetric vibrations, with a change in dipole moment perpendicular to the C3 axis.

 

Spherical tops

Although a spherical top is non-polar overall, some of its vibrational modes are IR active due to temporary dipole moments during vibration. The first-order approximation of the vibration-rotation energy levels of a spherical top is also given by eq9, with the selection rules governing IR-active transitions between these levels being the same as those for linear molecules. An example is CH4, which has vibrational modes (see table and diagram below): one symmetric stretch, one doubly degenerate bend, and two triply degenerate modes (one stretch and one bend).

Mode

Symmetry Description

Wavenumber /cm-1

Symmetric C-H stretch

IR inactive

Symmetric bend (scissoring)

IR inactive

Asymmetric C-H stretch

3020

Asymmetric bend (umbrella)

1300

However, only the modes are IR active due to their symmetry and ability to cause a dynamic dipole moment (see diagram below).

In conclusion, the derivation of the three branches using the first-order vibration-rotation energy levels (eq9 and eq17) of an IR-active polyatomic molecule allows us to analyse most IR spectra with ease. However, these energy levels are more complex due to changes in at higher , centrifugal distortion, anharmonicity and vibration-rotation interactions such as Coriolis coupling.

 

Next article: coriolis effect and Coriolis coupling
Previous article: Vibration-rotation spectra of diatomic molecules
Content page of rovibrational spectroscopy
Content page of advanced chemistry
Main content page

Vibration-rotation spectra of diatomic molecules

Vibration-rotation spectra arise from the simultaneous vibrational and rotational transitions of molecules, typically observed in the infrared (IR) region of the electromagnetic spectrum. These spectra provide detailed information about molecular structure, bond strength, and moment of inertia.

For a molecule to be IR active, it must have a permanent dipole or a dipole moment that changes over time. Homonuclear diatomic molecules, such as O2, lack these properties and are therefore IR inactive. In contrast, when a heteronuclear diatomic molecule absorbs IR radiation, it can undergo a change in its vibrational energy level along with a change in its rotational state, resulting in a distinctive pattern of spectral lines. These energy levels (expressed in wavenumbers), characterised by the rigid rotorharmonic oscillator approximation, are the sum of its allowed rotational and vibrational energies:

The selection rules governing transitions are given by eq7:

However, the condition applies only to molecules with a nonzero projection of the electronic orbital angular momentum onto the internuclear axis (i.e. ), such as NO. To elaborate, the rotational quantum number represents the magnitude of the coupled total angular momentum vector, given by

where:

is the nuclear rotation angular momentum (end-over-end rotation).
is the electronic orbital angular momentum.
is the vibrational angular momentum.
is the electronic spin angular momentum.

Note: Nuclear spin angular momentum is excluded because it couples only weakly to the other components.

 

Question

What is the relationship between the component vectors of and their corresponding quantum numbers?

Answer

Note: The spin quantum number should not be confused with the electronic state .

 

The selection rules for a pure vibrational transition are , while those for pure rotational transitions (ignoring the -related fine structure transitions, which are often unresolved in most experimental spectra) are . Additionally, the total angular momentum of the system must be conserved during a photon-mediated transition, as a photon carries one unit of angular momentum.

For a closed shell heteronuclear diatomic molecule like HCl ( ), , with . Therefore, the absorption of the IR photon’s angular momentum during must be accounted for by , resulting in .

Conversely, the ground state of NO () has an unpaired electron in a degenerate molecular orbital, giving , , . This allows the coupling (vector addition) of and to reorient internally in such a way that it can exactly compensate for the photon’s angular momentum (see diagram above). In other words,  can be preserved without changing , making the transitions , where , no longer forbidden.

In general, the absorption lines in a vibration-rotation spectrum can be grouped into three types, called branches (see diagram below). The R branch consists of all transitions, with energies given by:

where .

transitions form the P branch, with energies:

where .

The energy separation between adjacent lines in the P and R branches corresponds to . Therefore, the vibration-rotation spectrum, like the pure rotation spectrum, allows the moment of inertia and bond length of molecules to be calculated. The relative intensities of the lines in the P and R branches depend on the product of and , the same mechanism that governs the intensities in a pure rotational spectrum.

 

Question

According to the above diagram, the rotational constant is smaller for NO than for HCl. Why?

Answer

The rotational constant is given by , where . Since the reduced mass of HCl is smaller than that of NO, .

 

Finally, the Q branch, if allowed by selection rules, consists of  transitions, all of which have the same energy:

This means that transitions such as and appear as a single line in the spectrum under the rigid rotor-harmonic oscillator approximation. In reality, the Q branch appears as a series of closely spaced lines rather than a single line. This is because changes with the vibrational quantum number , where the bond length increases slightly in higher vibrational states, causing the moment of inertia to increase and to decrease. From eq9,

Eq14 shows that transitions such as and now appear as separate lines. Since , the spacing between these lines decreases with . At lower resolution, these closely spaced lines give the Q-branch the characteristic appearance of a single, strong, broad peak.

Using the same logic,

which rearranges to:

As the spacing between the lines of the R-branch decreases slightly as increases.

 

Question

Is the change in  at higher the same as centrifugal distortion?

Answer

No, it is different from centrifugal distortion. When a molecule vibrates at higher , the average bond length increases slightly. This is purely a vibrational effect. In contrast, centrifugal distortion is a rotational effect, which also causes the bond to stretch slightly as the rotational speed increases at higher . In other words, the former phenomenon depends on , while the latter depends on . If centrifugal distortion in considered, eq9 becomes and

 

For the P-branch,

Since the third term on RHS of eq16 is negative, the lines of the P-branch diverge slightly as increases. Note that eq14, eq15 and eq16 reduce to eq13, eq11 and eq12 respectively if .

 

Question

Why do many IR spectra have broad, continuous peaks rather than discrete lines?

Answer

Many common IR spectra are taken in the liquid phase (or solid), which leads to broadened, continuous-looking peaks. In the liquid or solid phase, molecules interact strongly with each other via van der Waals forces and hydrogen bonding. These interactions distort the the dipole moments of molecules and broaden energy levels. On the other hand, intermolecular forces are minimal in dilute samples of gaseous molecules. When these gas-phase samples are analysed with high-resolution instruments, discrete peaks — especially rotational fine structure — can be observed.

 

Next article: Vibration-rotation spectra of polyatomic molecules
Previous article: Selection rules for Vibration-rotation transitions
Content page of rovibrational spectroscopy
Content page of advanced chemistry
Main content page

Selection rules for vibration-rotation transitions

The selection rules for vibration-rotation transitions govern which simultaneous changes in vibrational and rotational energy levels are allowed when a molecule absorbs or emits infrared radiation.

These rules arise from quantum mechanical principles and the requirement that transitions involve a change in the molecule’s electric dipole moment , and they combine the selection rules for pure vibrational and pure rotational transitions.

In determining the selections rules for pure vibrational and pure rotational transitions, we evaluated conditions under which and  separately. However, the selection rules for vibration-rotation transitions require that . To show that this ultimately reduces to the selection rules for pure vibrational and pure rotational transitions, we begin with

where , and are the orthogonal lab-frame axes; , and are the orthogonal principal axes of inertia of the molecule (molecular frame); and the ’s are unit vectors along the axes.

The relationship between the two frames is shown in the diagram above, with the origin of both frames commonly placed at the centre of mass of the molecule. The lab-frame is stationary in space and does not rotate with the molecule, whereas the molecular frame moves with the molecule and rotates as it does. , known as the line of nodes, denotes the line passing through the intersections of the and planes. The orientation of the molecular frame with respect to the lab frame is described by the Euler angles , , and .

From eq1,

where , and .

Similarly, and . However, we can simplify the analysis to , which can always be arbitrarily chosen as the polarisation direction of the incident oscillating radiation.

To express the cosines in terms of the Euler angles, we need to derive the full rotation matrix according to the convention of a specific sequence of three consecutive elemental rotations to describe any 3D orientation.

The first rotation is about the lab-frame -axis by :

The second rotation is about the axis, also known as the line of nodes , by . It lies in the initial -plane but is fixed in its direction after the first rotation and is perpendicular to both the initial -axis and the final -axis:

The last rotation is about the axis, which is also the molecular -axis, by :

Therefore, the full rotation matrix is:

A rotation matrix is the transpose of the change of basis matrix , which in this case is

Therefore,

and eq2 becomes

We can also determine and in terms of the Euler angles by repeating the steps above.

Now, certain vibrational motions change a molecule’s electric dipole moment, such that , where represents the normal coordinates typically used to describe vibrational motions. Similarly, rotational motions also affect the dipole moment, with , where collectively denotes the Euler angles. This implies that the dipole moment is a function of both and , i.e. . Since the components , and in eq3 are projections of onto the molecular axes, which rotate with the molecule, they are independent of the Euler angles, and must therefore be functions of alone.

If we assume that the atoms in the molecule vibrate with a small amplitude about their equilibrium positions, we can expand , and  in a Taylor series, e.g.

where for non-linear polyatomic molecules and for linear molecules.

Substituting eq4 (by ignoring the higher terms) and the corresponding expansions for and  into gives:

where

.
.
.
.
.

 

Question

Why is ?

Answer

Consider an arrow in a 3D space. To fully describe its rotation, we must determine its complete orientation. This is achieved by pointing the arrow in a certain direction and spinning it around its shaft. In other words, we must first specify the direction of a chosen axis of a 3D object (like the arrow’s shaft), and then the rotation about that axis to describe its rotation. In the Euler angle framework, the first two angles and define a direction on the unit sphere (like latitude and longitude on Earth). Small changes to this direction are expressed as (see diagram below, where ). The third angle determines the extent of rotation around that direction. Therefore, the combined infinitesimal change in the object’s orientation in rotation space is .

 

Since the vibrational wavefunctions are orthonormal, the first integral on RHS of eq5 is non-zero only if for all , i.e. when there is no vibrational transition. Assuming , the selection rules associated with this term pertain to pure rotational transitions. The second term is non-zero only if both and the terms in the square bracket are non-zero. This implies that the selection rules associated with the 2nd term correspond to those of both pure vibrational and pure rotational transitions.

For a polar linear molecule with as the molecular axis, while , and the first term on RHS of eq5 reduces to if for all . Furthermore,  for non-degenerate normal modes like stretching, but may not be zero for degenerate modes like bending. Despite that, and therefore, the general selection rules for vibration-rotation transitions of polar linear molecules, which require for linearly polarised light, are:

In practice, eq6 is written more simply as:

The condition is omitted because the -related fine structure is often unresolved in most experimental spectra. Additionally, the inclusion of highlights that total angular momentum conservation must be satisfied during a photon-mediated vibrational transition, even though is forbidden for most linear molecules in electric dipole transitions.

Eq7 also corresponds to be the vibration-rotation selection rules for spherical tops.

For a symmetric rotor, any allowed vibrational transition can involve a change in along the symmetry axis (known as parallel transitions), as well as changes in and , which are perpendicular to (perpendicular transitions). Since the pure rotational selection rules for a symmetric rotor (neglecting -related fine structure), where the permanent dipole lies along the symmetry axis, are , the selection rules for parallel transitions are:

In the case of a perpendicular transition, and , while . This results in the following selection rules:

 

Question

Explain why for a perpendicular transition.

Answer

The rotational wavefunction can be approximated as the Wigner D-functions . Since ,

The integral is nonzero only if either exponent is zero, which happens when or , i.e. . The same logic when applied to also results in .

 

Combining eq8 and eq8a, the general vibration-rotation selection rules for a symmetric rotor are:

 

Next article: Vibration-rotation spectra of diatomic molecules
Content page of rovibrational spectroscopy
Content page of advanced chemistry
Main content page

Mean molecular energies of non-interacting molecules

The mean molecular energy of a system of non-interacting molecules is defined as the average internal energy per molecule, calculated over all possible quantum states accessible to the molecule at a given temperature.

This average is weighted by the Boltzmann probability of each state and reflects the contributions from translational, rotational, vibrational and electronic motions, depending on the system’s complexity.

Mathematically, the mean molecular energy of a molecule in thermal equilibrium is given by:

where is energy the -th state measured relative to the ground state energy of the molecule, and is the probability of the molecule being in that state.

Substituting eq251 into eq300 gives:

where

is the molecular partition function,
is the Boltzmann constant,
is the absolute temperature,
.

Since and , eq301 can be expressed as:

or

 

Question

Why does eq303 (or eq302) involve a partial derivative?

Answer

A partial derivative is used because  may be dependent on a few variables, such as and (see eq266).

 

Substituting eq257 into eq303 yields:

where , , and .

 

Mean translational energy

The mean translational energy of a non-interacting molecule is derived by substituting eq266 into to give:

 

Question

Show that each of the three components of is equal to .

Answer

Substituting eq261 into results in:

Let’s focus on the first derivative in eq306. Substituting eq265 into it yields:

Similarly, each of the other two components is equal to because and .

 

Thus, each translational degree of freedom contributes , consistent with the equipartition theorem.

 

Mean rotational energy

The mean rotational energy of a non-interacting heteronuclear linear molecule at low temperatures is derived by substituting eq270 into to give:

At very low temperatures, almost all molecules occupy the ground state with , so eq307 reduces to . As the temperature increases, is approximately given by expanding the summations in eq307 and differentiating to yield:

At higher temperatures, we substitute eq275, where and , into to give:

which is consistent with the equipartition theorem.

Since the symmetry number cancels out, the mean rotational energy of a non-interacting symmetrical linear molecule at high temperatures is also . Unlike linear rotors, which have two independent rotational degrees of freedom, spherical rotors, symmetric rotors and asymmetric rotors have three. Substituting eq280, eq286 and eq287 separately into and differentiating results in for each of the three types of rotors, which is again consistent with the equipartition theorem.

 

Mean vibrational energy

The mean vibrational energy of a non-interacting diatomic molecule oscillating harmonically at low temperatures is derived by substituting eq291 into , where , to give:

If we compute the derivative in eq310 and multiply the result by , we get:

At high temperatures, , which allows us to expand as a Taylor series () to give:

As mentioned in an earlier article, eq291 evaluates absolute energy levels, including the zero-point energy. If we derive using eq293 instead of eq291, we obtain:

Although the equipartition theorem states that the mean vibrational energy of a classical oscillator at high temperatures is equal to , both eq312 and eq313 are consistent with this result. This is because the theorem accounts only for the thermal contribution to vibrational energy; the zero-point energy term is a quantum mechanical artifact that is independent of temperature.

For polyatomic molecules, each normal mode of vibration behaves approximately like a separate harmonic oscillator, with the total vibrational partition function given by eq294:

where  for non-linear molecules, for linear molecules, and is the number of atoms.

So,

Since ,

In other words, the total mean vibrational energy of a non-interacting polyatomic molecule oscillating harmonically is the sum of the mean energy of each normal mode. This implies that

which are consistent with the equipartition theorem at high temperatures.

 

Mean electronic energy

The mean electronic energy of a non-interacting molecule is derived by substituting eq296, where ,  into to give:

At low temperatures, . Therefore, all the numerator terms in eq315 approach zero, giving:

This can be explained qualitatively: the energy gap between the ground electronic state and the first excited electronic state of a typical molecule is very large, which results in the molecule occupying only the electronic ground state, which has zero energy.

At higher temperatures, the mean electronic energy increases as excited electronic states become thermally accessible. However, this regime is often not reached before the molecule dissociates

 

Next article: Thermodynamic functions involving the Molecular partition function
Previous article: Electronic Molecular partition function
Content page of statistical thermodynamics
Content page of advanced chemistry
Main content page

The canonical ensemble

A canonical ensemble is a collection of identical, non-interacting copies of a physical system, each with a fixed number of particles , fixed volume , and constant temperature (maintained by thermal equilibrium with a heat bath), in which particles within each system may interact.

The word “canonical” means “according to a rule”, which, in this case, refers to fixed , and . As mentioned in the above definition, a canonical ensemble clearly states that even though the individual systems in the ensemble are identical from a thermodynamic perspective, they may not be identical at the molecular level. Such a theoretical construct serves to connect the microscopic behaviour of systems (governed by quantum mechanics or classical mechanics) to the macroscopic thermodynamic properties.

 

Question

Why do we only consider three parameters for each system?

Answer

While a physical system, such as a salt solution, may involve many microscopic variables (like individual particle positions and velocities), its macroscopic behaviour can often be fully characterised by a small set of thermodynamic parameters. In the canonical ensemble, fixing , and is sufficient to statistically describe the equilibrium properties of the system.

 

All thermodynamic properties are, in essence, measured as averages. Consider each system in the ensemble as a rigid container holding a gas. The pressure of the gas in each system is the time average of countless rapid collisions of gas particles with the walls of the container. The force on the wall fluctuates constantly, but a pressure gauge cannot register each individual fluctuation. Instead, the gauge averages the force over a short time to provide a stable reading. Similarly, while the kinetic energy of individual molecules varies moment to moment, the temperature reflects the average kinetic energy of all molecules at thermal equilibrium.

Due to fluctuations in properties such as pressure, kinetic energy or even quantum-mechanical variables, each system can occupy many possible energy states (microstates) at any given moment. In quantum mechanical systems, the energy corresponding to each microstate is determined by solving the Schrodinger equation , where is the wavefunction that describes the quantum state resulting from the positions and momenta of all particles in the system, and is the total energy of system for the -th microstate.

If the number of systems in the ensemble is taken to be infinitely large, we postulate that:

Postulate 1

 The macroscopic properties of a system are given by statistical averages over all possible microstates in the ensemble at any given moment (ensemble average).

For example, the thermodynamic internal energy is equal to the statistical average energy of the systems in the ensemble:

where is the probability that a system in the ensemble is in the -th microstate with energy .

To determine , we further postulate that:

Postulate 2

All microstates with equal energy have equal probability of occurring for any system of fixed volume, composition and temperature.

Therefore, if we consider any such system in thermal equilibrium with a constant temperature heat bath, and identify two of its microstates, 1 and 2, with energies and respectively, the corresponding probabilities are given by and , where is the number of systems with energy . It follows that the relative probability is:

In thermodynamics and quantum mechanics, energy is always referenced to a zero of energy , which can be defined in different ways. Since the function represents a physical quantity, it must yield the same result regardless of how is defined. This is only possible if the function depends on the difference between and , i.e. .

 

Question

Show that the function is invariant to .

Answer

Let and . We have , which does not depend on .

 

Multiplying by and using gives:

Letting and , we can rewrite eq122 as:

Taking the natural logarithm on both sides of eq123 and then differentiating with respect to  yields:

Using the chain rule,

Similarly, taking the natural logarithm on both sides of eq123 and then differentiating with respect to yields:

Equating eq124 and eq125 results in:

Each side of eq126 is a function of a different independent variable. The only way eq126 can be valid for all values of and is if both functions are equal to the same constant :

Integrating , i.e. , gives or equivalently,

where .

Accordingly, we have:

Substituting eq129 into yields or:

where .

can be evaluated by summing both sides of eq130 over :

Since ,

Substituting eq131 back into eq130 results in:

where .

Eq133 represents the general probability of finding a system, which is in thermal equilibrium with a constant temperature heat bath, in a specific microstate with energy . Its derivation relies on fundamental principles of probability and the nature of equilibrium, not on the specific nature of the particles themselves (e.g., whether they’re distinguishable or indistinguishable). In other words, eq133 is applicable to both classical and quantum mechanical systems. Therefore, eq120 becomes:

where

is known as the canonical partition function.

The fact that eq133 yields the same expression as the Boltzmann distribution is not a coincidence. It demonstrates that the Boltzmann distribution naturally emerges from the statistical treatment of systems in thermal equilibrium, and illustrates the deep consistency between thermodynamic principles and statistical reasoning.

 

Question

Can two different microstates have the same energy?

Answer

Yes. Consider a system with two particles. In one microstate, particle A has an energy of 1 unit, and Particle B has an energy of 2 units. In another microstate, particle A has an energy of 2 units, while Particle B has an energy of 1 unit. These two distinct microstates (or configurations) clearly have the same total energy of 3 units. Microstates that share the same energy are called degenerate states.

 

Since different microstates can be degenerate, the probability that a system has energy is:

where is the degeneracy of .

 

Next article: canonical partition function
Previous article: Statistical thermodynamics (overview)
Content page of statistical thermodynamics
Content page of advanced chemistry
Main content page

Canonical partition function for a system of non-interacting particles

The canonical partition function is a central quantity in statistical mechanics, encapsulating the thermodynamic behaviour of a system in equilibrium with a heat reservoir at fixed temperature. It allows us to derive key thermodynamic quantities of a system, such as pressure (see eq146) and internal energy (see eq148). These relationships, e.g. , applies to both systems with interacting and non-interacting particles.

For a system with interacting particles, the Schrödinger equation is incredibly complex due to the interaction terms between the particles, making it difficult to calculate the exact energy of a given microstate ​. On the other hand, in a system of non-interacting particles, intermolecular forces are absent. This simplifies the evaluation of significantly, since the Hamiltonian operator for a system of non-interacting particles can be expressed as . The total energy of the system then becomes the sum of the individual particle energies,

Here refers to the energy of particle 1 in quantum state when the system is in microstate . The energy of each particle is determined by solving the corresponding one-particle Schrödinger equation, e.g. .

If the non-interacting particles are distinguishable (e.g. atoms or molecules with fixed positions in a solid), then the microstate of the system is defined by the quantum state of each individual particle. For example, microstate 1 might have the configuration , while microstate 2 is . The sum over all microstates of the system, by substituting eq160 into eq155, is given by:

which is equivalent to:

where we have set and for illustrative purposes.

Each term on the RHS of eq161 corresponds to a distinct microstate, represented by a specific permutation of exponential factors. Eq161 can also be written as:

We can show that eq162 is equivalent to eq161 by expanding it:

If we multiply out all the terms in eq163, each resulting term corresponds to a unique microstate, as expressed in eq161. Therefore,

where , ,…,  are known as the molecular partition functions.

Since each  represents the total statistical weight of a single particle, the superscript in becomes redundant. Thus, we can express a molecular partition function more simply as . If molecules are of the same species, where , then

 

Question

Using the example of a system of 3 distinguishable, identical, non-interacting particles, show that the sum over all microstates of the system then becomes the product of the separate sums of the quantum states of each particle.

Answer

Suppose each particle can be in one of two quantum states with energy and . The total possible microstates is , comprising of:

Microstate Total energy

So,

Similarly,

 

It follows that for distinguishable, identical, non-interacting particles,

If the identical, non-interacting particles are indistinguishable, such as those of an ideal gas, and if each particle occupies a distinct quantum state, then the total number of microstates is no longer given by the number of permutations of the quantum states, but by the number of combinations. One way to approximate in this case is to divide eq166 by a factor :

This gives at high temperatures, where each particle is assumed to be in a distinct quantum state. It is important to note that the relation is not a strict mathematical identity, because is not a literal count of microstates but a weighted sum of exponential terms, each corresponding to a microstate. Nevertheless, this approximation works well in the classical limit and successfully resolves the Gibbs paradox.

 

Next article: Molecular partition function
Previous article: Thermodynamic functions involving the Canonical partition function
Content page of statistical thermodynamics
Content page of advanced chemistry
Main content page

Thermodynamic functions involving the canonical partition function

Thermodynamic functions involving the canonical partition function describe thermodynamic properties of a canonical ensemble.

In statistical mechanics, the canonical ensemble provides a powerful framework for connecting microscopic states to macroscopic thermodynamic properties. Central to this approach is the canonical partition function, , which encapsulates the statistical behaviour of a system in thermal equilibrium at fixed temperature, volume, and particle number. From , one can derive key thermodynamic functions such as pressure, internal energy and entropy through straightforward mathematical relations.

In the previous article, we derived the expressions for pressure and internal energy:

where and .

 

Question

How does connect quantum mechanics to thermodynamics?

Answer

, the pressure of a system, is a macroscopic thermodynamic property, while is the energy of a quantum mechanical microstate, which is an eigenvalue of the Hamiltonian. Therefore, the equation provides a bridge between the microscopic quantum description and macroscopic thermodynamic observables.

 

The corresponding expression for entropy can be derived by first dividing the fundamental equation of thermodynamics by to give:

Applying the chain rule to eq148 gives:

Substituting eq146 and eq157 into eq156 yields:

Integrating eq158 results in , where is a constant. As , only the ground state is populated. Assuming the ground state is non-degenerate, the partition function becomes , with . According to the third law of thermodynamics, entropy approaches zero as the temperature approaches absolute zero. Therefore, must be zero to satisfy this requirement, resulting in:

It follows that the Helmholtz energy expression is:

Using the pressure and internal energy equations, the expression for enthalpy is:

 

Question

Does eq159b contradict the application of enthalpy, which describes a system at constant pressure?

Answer

The definition of enthalpy, , holds for any thermodynamic state, regardless of whether the pressure is constant. The condition of constant pressure becomes relevant when evaluating the change in enthalpy, , which equals the heat exchanged in a constant-pressure process . Eq159b expresses the enthalpy of a system in terms of , and . It gives the enthalpy for a particular equilibrium state, not just under constant pressure. Therefore, we can calculate the enthalpy of a system at any given state using eq159b. By evaluating it at two different states (e.g. the initial and final states of a process), we obtain . If the process connecting those states occur at constant pressure, then . So, there is no contradiction.

 

The expression for constant-volume heat capacity is:

For Gibbs energy, we have . Hence,

If we define chemical potential as , and is an integer-valued discrete variable, then the derivative becomes a finite difference, so:

 

Next article: canonical partition function for a system of non-interacting particles
Previous article: Canonical partition function
Content page of statistical thermodynamics
Content page of advanced chemistry
Main content page
Mono Quiz