Mesomeric effect

The mesomeric effect is the electron-donating or electron-withdrawing effect of a substituent transmitted through a conjugated π-system by resonance.

Resonance, or the resonance effect, typically occurs when atoms are connected by alternating single and multiple bonds, where electrons can be shared across several atoms instead of being localised. For instance, the π-electrons in benzene are dislocalised over the entire ring rather than confined to three double bonds depicted in a Lewis structure.

In other words, the mesomeric effect is an electron-donating (+M) or electron-withdrawing (-M) effect via resonance. It requires a functional group attached to a conjugated system and focuses on how the group changes the electron distribution.

 

Question

Is the mesomeric effect different from the inductive effect?

Answer

Yes, the inductive effect is the displacement of electron density through -bonds along a chain of atoms due to the presence of an electronegative or electropositive atom or group. In contrast, the mesomeric effect involves the delocalisation of electrons through π-bonds. Although the mesomeric effect is governed mainly by resonance and orbital interactions, the electronegativities of atoms in substituents may be considered when analysing the mesomeric effect of a molecule (e.g. via the Hückel method). Nevertheless, the two effects are generally different. For example, lone pairs on atoms such as O or N often give rise to a +M effect. However, these same atoms may exert a –I effect at the same time. Therefore, the mesomeric and inductive effects can act in the same direction or in opposite directions in a molecule with a substituent.

 

 

-M effect

The Hückel method is the standard semi-empirical approach to quantify the mesomeric effect in a conjugated system. Consider the acrolein molecule CH2=CH-CHO, which contains two π-bonds, one of which belongs to an electron-withdrawing formyl group (-CHO). In general, we can express the one-electron molecular orbital (MO) wavefunction of acrolein as:

where and represent the coefficients and the p atomic orbital wavefunctions respectively; corresponds to the oxygen atom, corresponds to the carbonyl carbon,  corresponds to the middle ethylene carbon and corresponds to the terminal ethylene carbon.

One quick way to analyse the MO energies of acrolein is to refer to the secular determinant of butadiene:

which represents a four- atom framework with two alternating π-bonds.

To account for the electronegativity of oxygen, we modify the Hückel parameters as follows:

where

is the Coulomb integral for oxygen, with being a factor that lowers the orbital energy.
is the Coulomb integral for the carbonyl carbon.
is the resonance integral for overlap between C and O.
is the Coulomb integral for carbon atoms.
is the resonance integral between carbon atoms.
is the energy of an MO.

The next step involves estimating trial values for , and . Since oxygen is more electronegative than carbon, we set . This results in C=O being a stronger bond compared to C-C. Consequently, we set . The corresponding inductive effect by oxygen on the the carbonyl carbon causes it to be slightly electronegative relative to the adjacent ethylene carbon. We can therefore set . It is important to note that although the electronegativity of oxygen helps stabilise the system, it is not the primary cause of the mesomeric effect, which is fundamentally due to conjugation and orbital overlap.

Using the matrix identity of , and multiplying both sides of eq390 by , give

where .

Expanding the determinant gives the characteristic equation:

with solutions , which when substituted back into yields:

MO Type
1.52 Antibonding
0.31 LUMO
-1.16 HOMO
-2.86 Bonding

 

Question

Show that the isolated C=O Hückel bond energy is .

Answer

The secular determinant for the C=O bond is

Solving for gives . Therefore, the bonding MO energy is .

 

The total ground state π-electron energy, , is lower than the sum of the energies of isolated C=C () and C=O () bonds by , indicating the delocalisation energy provided by the mesomeric effect. To further understand the electron distribution of the molecule, we determine the coefficients of the wavefunctions as follows:

which implies:

The simultaneous equations for can be solved by substituting into them and expressing all coefficients in terms of :

Normalisation gives:

Therefore, . Repeating the calculations for and yields:

Coefficient  (bonding)  (HOMO)  (LUMO) Ground state electron density
c4 0.07 0.60 0.67 0.73
c3 0.20 0.70 -0.21 1.06
c2 0.50 0.21 -0.60 0.59
c1 0.82 -0.35 0.37 1.59

where the ground state electron density is given by for two electrons in each MO.

The value of 1.59 at oxygen indicates significant accumulation of π-electron density. This arises from the –M effect of the carbonyl group, where electron density is delocalised towards the oxygen atom. In contrast, the terminal carbon and the carbonyl carbon are electron deficient due to this delocalisation (see resonance structures below). While oxygen’s electronegativity contributes to the overall polarisation of the molecule, the calculated electron densities primarily reflect the mesomeric effect rather than the inductive effect. In this case, both effects act in the same direction.

The calculations also suggest regioselectivity of acrolein in reactions with nucleophiles. For example, “hard” (ionic) nucleophiles, such as LiAlH4 or Grignard reagents, are driven mainly by electrostatic attraction and attack the most electron-deficient carbonyl carbon (with ground state electron density of 0.59), resulting in 1,2-addition reactions (see diagram below). On the other hand, “soft” (covalent) nucleophiles, like enolates, thiols or organocuprates, are governed by orbital interactions. They attack the atom where the LUMO has the largest “lobe” (), which is the terminal ethylene carbon of acrolein, leading to 1,4-addition (Michael addition) reactions.

 

+M effect

The +M effect is typically attributed to a substituent with lone pair electrons. Consider vinylamine CH2=CH-NH2, with the general one-electron MO wavefunction:

where represents the p atomic orbital wavefunctions; 1 denotes the nitrogen atom, 2 denotes the middle ethylene carbon and 3 denotes the terminal ethylene carbon.

The secular determinant is given by:

where

is the Coulomb integral for nitrogen, with being a factor that lowers the orbital energy.
is the resonance integral for overlap between C and N.
is the Coulomb integral for carbon atoms.
is the resonance integral between carbon atoms.
is the energy of an MO.

Let’s set due to the electronegativity of N, and because the C-N bond is slightly weaker than the C=C bond. This translates to:

with the corresponding characteristic equation:

Substituting the solutions back into yields:

MO Type
1.13 LUMO
-0.68 HOMO
-1.95 Bonding

Unlike acrolein, in which one electron from each of the four atoms populates the MOs, vinylamine has two nitrogen electrons (lone pair) and one electron from each carbon atom contributing to its bonding and highest occupied MOs. The total ground state π-electron energy, , is lower than the sum of the energies of isolated C=C bond () and the nitrogen lone-pair (), indicating a delocalisation energy of provided by the +M effect.

 

Question

Show that the isolated nitrogen lone pair energy is .

Answer

Since nitrogen’s lone pair is not yet delocalised into the π-system, its energy corresponds to twice the Coulomb integral , which is .

 

The coefficients of the wavefunctions are given by:

Substituting the three values of into the matrix equation and solving it gives:

Coefficient  (bonding)  (HOMO)  (LUMO) Ground state electron density
c3 0.24 0.72 -0.65 1.15
c2 0.48 0.49 0.73 0.94
c1 0.85 -0.48 -0.22 1.91

Since the ground state electron density of a neutral, non-polarised one-electron atom is 1.0, the ground state electron density of the nitrogen lone pair of 1.91 is lower than its unpolarised value of 2.0 (density = number of electrons per unit volume). In contrast, the electron density of the terminal carbon of 1.15 is higher than its non-polarised value of 1.0. This shift of electron density from N to the terminal C is a consequence of the +M effect of the amine group, even though N exerts a –I effect on its neighbours. In this case, mesomeric and inductive effects act in opposite directions. Furthermore, the relatively electron-rich terminal carbon and nitrogen atoms (ground state electron density > 1.0) make the molecule susceptible to electrophilic attack.

In terms of regioselectivity, a “hard” electrophile like H+ (small, concentrated positive charge) is attracted to N, which has the highest ground state electron density of 1.91. Protonation at N forms an ammonium ion. H+ can also attack the terminal carbon (electron density = 1.15) to yield an iminium ion. The relative proportion of ammonium and iminium ions formed depends on temperature and reaction time.

The ammonium ion forms quickly at relatively low temperatures through fast collisions and is considered the kinetic product. However, it is not energetically favourable because formation of the -bond between H+ and N disrupts the conjugated system, reducing the molecule’s delocalisation energy. Given enough time for equilibration at room temperature, the resonance-stabilised  iminium ion, in which the positive charge is delocalised between carbon and nitrogen, becomes the thermodynamic product and predominates. If water is present, this iminium ion quickly hydrolyses to acetaldehyde and an ammonium salt, which explains why vinylamines (and enamines) are often used as intermediates to functionalise carbonyls.

On the other hand, a “soft” electrophile such as CH3I, in which the the partially positive carbon is diffuse due to iodine’s polarisability, seeks the largest HOMO lobe for orbital overlap. This leads to formation of a new carbon-carbon bond at the terminal carbon of vinylamine.

Overall, the reactivity of vinylamine illustrates how electron density, substituent effects and the nature of the electrophile work together to determine both regioselectivity and product distribution. Understanding these effects allows chemists to predict and control reaction outcomes, making vinylamines valuable intermediates in organic synthesis.

 

Next article: Ortho/Para and meta directing groups
Previous article: Inductive effect
Content page of quantum mechanics
Content page of advanced chemistry
Main content page

Ortho/para and meta directing groups

Ortho/para-directing and meta-directing groups are substituents that guide incoming electrophiles to the ortho and para positions or to the meta position, respectively, on an aromatic ring.

This directing effect arises from how the substituent interacts with the ring’s electron density. Ortho/para-directing groups are typically electron-donating (e.g. –OH, –NH₂, –CH₃); they increase electron density at the ortho and para positions through resonance or inductive effects, stabilising the intermediate carbocation during electrophilic aromatic substitution. In contrast, meta-directing groups are usually electron-withdrawing (e.g., –NO₂, –COOH, –CN); they pull electron density away from the ring, making the ortho and para positions less stable for substitution and thus favouring attack at the meta position.

 

Ortho/para-directing groups

A classic example of an ortho/para-directing group is the hydroxyl group in phenol. Although it is inductively withdrawing (-I) due to the electronegativity of the oxygen atom, the lone pair of electrons on oxygen exerts a +M effect on the benzene ring, overwhelming the inductive effect and activating the ortho and para positions for electrophilic substitution.

To quantum-mechanically analyse the mesomeric effect of the hydroxyl group, we consider the general molecular orbital (MO) wavefunction of phenol:

where and represent the coefficients and the p atomic orbital wavefunctions respectively; corresponds to the oxygen atom, corresponds to the substituted carbon, and through correspond to the remaining carbons in an anticlockwise direction.

Applying the Hückel method gives the following secular determinant:

where

is the Coulomb integral for oxygen, with being a factor that lowers the orbital energy.
is the resonance integral for overlap between C and O.
is the Coulomb integral for carbon atoms.
is the resonance integral between carbon atoms.
is the energy of an MO.

Incidentally, the minor of the above secular determinant is the secular determinant of benzene.

The next step involves estimating trial values for  and . Since oxygen is more electronegative than carbon, we set , lowering the MO relative to carbon. Furthermore, we set , which is lower than the reference of for every resonance integral between carbon atoms, because the oxygen’s 2p orbitals are more contracted than carbon’s, leading to a less effective overlap between the p orbitals of oxygen and carbon versus that between two carbon atoms.

Using the matrix identity of , and multiplying both sides of the secular determinant equation by , give

where .

Expanding the determinant yields the characteristic equation:

with solutions

Substituting the solutions back into yields:

MO Type
1.994 Antibonding
1.192 Antibonding
0.617 LUMO
-0.213 HOMO
-0.744 Bonding
-1.419 Bonding
-2.427 Bonding

The reactivity of phenol is reflected in its HOMO energy level, , which is higher than that of benzene (). Because the HOMO is higher in energy, the electrons are less tightly held and more nucleophilic than those in benzene towards electrophiles. To further understand the directing effects of the hydroxyl group, we determine the coefficients of the HOMO wavefunction as follows:

which implies:

Using an optimising tool, such as Microsoft Excel Solver, the normalised HOMO coefficients are approximately:

 Coefficient  (HOMO) HOMO electron density
c0 0.623 0.777
c1 (ipso) -0.323 0.208
c2 (ortho) -0.344 0.236
c3 (meta) 0.121 0.030
c4 (para) 0.415 0.344
c5 (meta) 0.121 0.030
c6 (ortho) -0.344 0.236

where the HOMO electron density (see diagram below) is given by for two electrons.

The calculated values show that the ortho and para carbons have higher electron densities than the meta carbons, with the para carbon having the highest. This is due to the +M and –I effects of the hydroxyl group. When phenol reacts with an electrophile such as bromine at low temperatures in non-polar solvents (e.g. CS2 or CCl4), mono-brominated products are formed, giving a para-bromophenol to ortho-bromophenol ratio of about 80:20. At higher temperatures, phenol reacts with aqueous bromine to yield the fully substituted 2,4,6-tribromophenol. Therefore, the hydroxyl group is known as an ortho/para-directing group. Substituents with similar effects include the amino group (-NH2), substituted amino groups (-NR2), alkoxy groups (-OR), phosphino and thio groups (-PR2 and -SR) and the phenyl group (-C6H5).

 

Meta-directing groups

An example of a meta-directing group is the ammonium group -NH3+ in the anilinium ion C6H5-NH3+. This substituent is purely inductively withdrawing (-I) due to the positively charged nitrogen atom. To analyse the inductive effect of the ammonium group quantum mechanically, we consider the general molecular orbital (MO) wavefunction of the anilinium system:

where and represent the coefficients and the p atomic orbital wavefunctions respectively; corresponds to the substituted carbon, and through correspond to the remaining carbons in an anticlockwise direction.

As the nitrogen atom lacks a lone pair of electrons to interact with the benzene π-system, the substituted carbon atom is treated as being perturbed by the electron-withdrawing group, leading to the following secular determinant:

Expanding the determinant yields the characteristic equation:

with solutions

Substituting the solutions back into yields:

MO Type
1.891 Antibonding
1.000 Antibonding
0.705 LUMO
-1.000 HOMO
-1.317 Bonding
-2.278 Bonding

To further understand the directing effects of the ammonium group, we determine the coefficients of the HOMO wavefunction as follows:

Solving the above six simultaneous equations, along with the normalisation equation, using algebra for the HOMO coefficients and Excel Solver for the remaining coefficients, give:

Coefficient  (HOMO) Electron density
c1 (ipso) 0.646 -0.517 0.000 1.370
c2 (ortho) 0.413 -0.082 0.500 0.855
c3 (meta) 0.295 0.409 0.500 1.008
c4 (para) 0.259 0.621 0.000 0.904
c5 (meta) 0.295 0.409 -0.500 1.008
c6 (ortho) 0.413 -0.082 -0.500 0.855

where the electron density is given by for two electrons in each MO.

The calculated values show that the meta carbons have higher electron densities than the ortho and para carbons. When the anilinium ion reacts with an electrophile such as the nitronium ion NO2+ at low temperatures, m-anilinium is the major product. Therefore, the ammonium group is known as a meta-directing group. Substituents with similar effects include the nitro group (-NO2), sulfonyl groups (-SO2R), the cyano group (-CN), formyl and acyl groups (-CHO and -COR) and the carboxyl group (-CO2H).

 

Question

Why doesn’t the electrophile attack the ipso (latin for “itself” or “that very one”) carbon and displace the ammonium group?

Answer

Although the Hückel method indicates that the ipso carbon has the highest electron density, it considers only the π-electrons and does not account for constraints imposed by the σ-framework. Electrophilic attack at the ipso carbon would require either displacement of the ammonium group or formation of a stable σ-complex. Both pathways are energetically very unfavourable because -NH3+ is a poor leaving group and the ipso carbon has already reached its maximum valency.

 

Previous article: Mesomeric effect
Content page of quantum mechanics
Content page of advanced chemistry
Main content page

The twin paradox

The twin paradox is an apparent contradiction in special relativity where each of two twins in relative motion expects the other to age more slowly, yet the travelling twin ages less because their paths through spacetime are not equivalent.

Consider twin Jane and Jill. Suppose Jane makes a round trip from Earth to a star system 10 light-years away and back at 80% the speed of light (). From Jill’s perspective on Earth, Jane takes years to complete the entire trip. Using the time dilation equation , where , Jill calculates how much proper time elapses along Jane’s worldline and concludes that Jane ages years during those 25 Earth years.

From Jane’s point of view, she is stationary and Jill is moving. A direct application of time dilation might suggest the opposite result, that Jill ages less, leading to an apparent paradox. However, due to length contraction, the distance to the star is no longer 10 light-years in Jane’s frame; it is reduced to light-years, with the round trip taking  years to complete.

Therefore, the two perspectives do not contradict each other: both agree that Jane ages less and experiences about 15 years during the journey. Although each twin sees the other’s clock running slow locally, Jane covers a shorter contracted distance at high speed. Applying time dilation and length contraction consistently shows that she ages less.

 

Question

Wouldn’t Jane’s spaceship need to accelerate (speed up, slow down or stop) when leaving Earth and at the turnaround point?

Answer

Not necessarily. The scenario can be reformulated so that no single traveller undergoes acceleration. Instead of departing from rest on Earth, Jane can be imagined as already moving at a constant speed and passing Earth. As she passes, she synchronises her clock with Jill’s. At the distant point 10 light-years away, a second spaceship carrying a third observer (Judy) passes Jane while moving towards Earth at the same constant speed. At that event, Judy compares clocks with Jane and then continues towards Earth. When Judy reaches Earth, her clock is compared with Jill’s. In this way, the entire analysis can be carried out using only inertial observers. The result is that the travelling clocks (Jane’s and Judy’s) accumulate less proper time than Jill’s clock on Earth.

 

Next article: Why does a current flowing through a wire generate a magnetic field
Previous article: Length contraction
Content page of quantum mechanics
Content page of advanced chemistry
Main content page

Secular equations in the variational method

Secular equations in the variational method are linear equations obtained by minimising the energy with respect to trial-function coefficients, yielding a matrix eigenvalue problem whose solutions give approximate energy levels and corresponding wavefunctions.

Consider the eigenvalue equation , where and the coefficients are real. Multiplying the eigenvalue equation on the left by and integrating over all space gives:

where .

Using the orthonormality condition  results in:

Taking the partial derivative of with respect to each coefficient gives:

Applying the variational method by setting and computing the remaining derivatives yield:

Assuming that the expectation values are real and the Hamiltonian is Hermitian, , and

Relabelling the second summation index from to , we have  or equivalently,

which rearranges to the secular equations:

where we have relabelled the index to .

We can express the secular equations in the following matrix form:

This is a linear homogeneous equation with non-trivial solutions only if

or equivalently, if

where is the Hamiltonian matrix with elements and is the identity matrix.

Expanding the determinant gives the characteristic equation, which can then be solved for its roots.

Secular equations are used to determine the allowed energy levels and wavefunctions of electrons in molecules. They are fundamental in solving eigenvalue problems in many aspects of chemistry, including:

    1. degenerate perturbation theory
    2. the Hückel method
    3. the Stark effect and the Zeeman effect
    4. vibration of polyatomic molecules
    5. the Hartree-Fock-Roothaan procedure

 

Next article: Overlap integral
Previous article: The variational method
Content page of quantum mechanics
Content page of advanced chemistry
Main content page

Linear Stark effect

The linear Stark effect is the splitting or shifting of atomic or molecular energy levels in proportion to the strength of an applied electric field.

It is defined as the first-order energy correction in perturbation theory and is most clearly observed in hydrogenic atoms, where the high symmetry of the Coulomb potential leads to degeneracies that allow first-order shifts. In the strong-field regime ( V/m), where the static, uniform electric field strength  produces energy shifts that are large compared to those arising from spin-orbit interactions, the Hamiltonian  for a hydrogenic atom is:

where is the Hamiltonian of the unperturbed atom and is the perturbation due to the Stark effect.

The perturbing Hamiltonian is given by eq354:

where is the operator of the electric dipole moment of the atom.

This suggests that vanishes if the electric dipole operator is zero. Substituting eq350 into gives

where is the operator of the position vector of the electron, with the vector measured from the nucleus (taken as the origin).

Compared to the Zeeman Hamiltonian, which depends on both the orbital and spin angular momenta of the electron, the Stark Hamiltonian depends only on and therefore acts only on the spatial part of the hydrogenic wavefunction.

Assuming that the electric field is directed along the laboratory -axis, eq361 becomes

where is the angle between and the -axis.

To understand how the perturbation affects the energy levels, we analyse its effect on the and states of the atom. The uncoupled wavefunction , which is a good eigenstate of , is used as a basis to begin the perturbation analysis. For , the eigenstate is the non-degenerate ground state . Therefore, we may apply the first-order non-degenerate perturbation theory, which corresponds to the expectation value of :

or equivalently, in spherical coordinates,

Since , we have . Therefore, the ground state of a hydrogenic atom exhibits no first-order Stark effect, as its spherical symmetry implies that it possesses no permanent electric dipole moment. In fact, because is odd under spatial inversion (it changes sign about the origin), its expectation value vanishes for any eigenstate of even parity, since the integral of an odd function multiplied by an even function over all space is zero.

Unlike the ground state, the unperturbed level is fourfold degenerate described by the following basis wavefunctions (see this article and this article for derivation):

Due to this degeneracy, we must use degenerate perturbation theory, which requires constructing the Stark Hamiltonian matrix with elements . Because has even parity and is odd under spatial inversion, all diagonal matrix elements vanish:

Furthermore, the matrix elements , , , and , together with their complex conjugates, are all zero because their corresponding azimuthal integrals involve terms such as , or , all of which vanish. This leaves and its complex conjugate , each of which evaluates to .

 

Question

Show that

Answer

Letting , , we have , and

Using the identity  completes the maths.

 

Therefore, we need to solve the eigenvalue equation

where , , and are the coefficients in the basis and

To find the eigenvalues, we solve the secular equation or equivalently,

Evaluating the determinant gives the characteristic equation:

with solutions:

Therefore, the fourfold degenerate level is split by the Stark effect into three distinct levels (see diagram below): , and , where is the eigenvalue of the unperturbed Hamiltonian .

To determine the eigenstates corresponding to the three distinct levels, we refer to eq364, where

For , we require and . Since there are no conditions on or , any linear combination of and is an eigenstate with eigenvalue 0. However, a state , with both and , is not an eigenstate of (i.e. is not well-defined). Therefore, only the linear combinations in which either or , namely the unmixed states or , are good eigenstates of both and for .

For , eq366 gives the condition and . Hence, the normalised eigenstate is

Similarly, for , the same equation yields and  resulting in the normalised eigenstate

Since both and are eigenstates of , each with eigenvalue , and remain eigenstates of . However, and  are no longer eigenstates of , meaning is not a good quantum number once the electric field is applied.

The fact that the level exhibits a first-order Stark effect for and  implies that their electron distributions are no longer spherically symmetric and acquire a permanent electric dipole moment along the field direction (see diagram below). Furthermore, the three distinct energy levels, , and , classically suggest that the dipole moment has magnitude  and three possible orientations: parallel, antiparallel and perpendicular to the field.

Because the Stark energy shifts in hydrogenic atoms are proportional to the first power of the electric field, the phenomenon is known as the linear Stark effect. Although the ground state of hydrogen does not exhibit a linear Stark effect, it does exhibit a quadratic Stark effect, which we will explore in the next article.

 

Next article: Quadratic Stark effect
Previous article: Stark effect
Content page of quantum mechanics
Content page of advanced chemistry
Main content page

Quadratic Stark effect

The quadratic Stark effect is the shift of atomic or molecular energy levels proportional to the square of an applied electric field, arising when non-degenerate states produce only second-order energy corrections.

As explained in the previous article, the ground state () of hydrogen does not exhibit a linear Stark effect, which is defined as the first-order energy correction in perturbation theory, because its spherical symmetry implies that it possesses no permanent electric dipole moment. With reference to eq270a, the second-order energy correction for the ground state of hydrogen is

where
is the perturbing part of the Hamiltonian due to the Stark effect (see eq362)
is the external electric field directed along the -axis
is a linear combination (mixing) of excited eigenstates, with given by eq269a
and are the ground state eigenfunction and eigenvalue
and are the eigenfunctions and eigenvalues for .

Since corresponds to the 1s wavefunction, which is an even function under spatial inversion, and is odd under spatial inversion, is an odd function. Consequently, for ,  must be odd. The term in the summation with the smallest magnitude of the denominator contributes most significantly to . This occurs when , implying that the 2p wavefunctions (which are the only odd-parity states in the level) dominate the correction to the eigenvalue.

It follows that the ground state of hydrogen exhibits a Stark effect when the second-order energy correction is taken into account. To show that it is a quadratic effect and that the associated electric dipole moment is an induced dipole moment, we substitute eq362 into eq370 to obtain

Clearly, the second-order energy correction is proportional to , and hence represents a quadratic Stark effect.

In classical electromagnetism, the energy of an electric dipole moment in an external field is given by eq354, or in differential form:

Substituting the definition of an induced electric dipole moment into gives

Since the external electric field is directed along the -axis,

If in eq371 is an energy shift arising from an induced dipole moment, it must have the form of given in eq372. Comparing eq371 with eq372 yields

where is the polarisability of the atom, which is a measure of the degree to which the electron in hydrogen can be displaced relative to the nucleus.

Therefore, in eq371 is an energy shift arising from an induced dipole moment. Even though the ground state of hydrogen does not possess a permanent dipole moment, it exhibits a quadratic Stark effect due to the electric dipole moment induced by an external electric field.

 

NExt article: Inductive effect
Previous article: Linear Stark effect
Content page of quantum mechanics
Content page of advanced chemistry
Main content page

Stark effect

The Stark effect is the shifting and splitting of atomic or molecular spectral lines when an external electric field is applied.

First observed in 1913 by Johannes Stark, this phenomenon provided important early evidence for the interaction between electromagnetic fields and atomic structure. The effect arises because the electric field perturbs the energy levels of electrons, altering the frequencies of light emitted or absorbed during electronic transitions. As the electric analogue of the Zeeman effect, the Stark effect is analysed using perturbation theory and plays a significant role in spectroscopy and quantum mechanics.

The precise way in which these spectral lines shift, however, depends on both the internal structure of the atom or molecule and the strength of the applied electric field. In practice, two principal regimes are distinguished: the linear Stark effect and the quadratic Stark effect.

The linear Stark effect is defined as the first-order energy correction in perturbation theory and occurs when the energy shift is directly proportional to the strength of the applied electric field. This behaviour typically appears in systems with degenerate energy levels, such as in the hydrogen atom. In contrast, the quadratic Stark effect corresponds to the second-order energy correction and occurs when the energy shift is proportional to the square of the electric field strength. This is the more common situation for atoms and molecules with non-degenerate energy levels.

Details of these two effects will be discussed in the next two articles.

Next article: Linear Stark effect
Previous article: Electric dipole moment
Content page of quantum mechanics
Content page of advanced chemistry
Main content page

Electric dipole moment

An electric dipole is a pair of equal and opposite electric charges, and , separated by a small distance . The electric dipole moment (also denoted by ) is a measure of the strength and orientation of the electric dipole and is defined as:

where is the position vector of the corresponding charge measured from the origin, which is usually taken at the midpoint between the charges.

The SI unit of electric dipole moment is the Coulomb-meter (C·m), although the Debye (D) is more commonly used in practice (1D » 3.33 x 10-30 C·m)

 

Question

Is the electric dipole a vector?

Answer

The electric dipole is a physical configuration of two equal and opposite charges, not a vector. However, the associated electric dipole moment is a vector quantity. By physics convention, its direction points from the negative charge to the positive charge (note that in chemistry, the dipole direction is taken from the less electronegative atom to the more electronegative atom).

 

When an electric dipole is placed in an external electric field (see diagram above), it experiences a torque that tends to rotates the dipole so as to align it with the field, thereby lowering its potential energy. The torque is defined as the product of the component of a force normal to the axis of rotation and the distance from the origin to the point of application of the force ( or equivalently ). For a spatially uniform electric field,

and define a plane, and is perpendicular to this plane, with its direction given by the right-hand rule. Therefore, the magnitude of the torque is

where is the angle between the rotating axis and the force, which is equivalent to the angle between and .

For a rotating system (see diagram above), the work done is given by

Since and the change in potential energy is the negative of the work done by the field,

Substituting eq352 into eq353 yields:

Because the force exerted by the electric field rotates the dipole towards a lower potential energy, . To satisfy this condition, as changes from to during the rotation, we define when the dipole is perpendicular to the field (), giving

where and the negative sign arises naturally to ensure that ( is positive for ), with the corresponding potential energy in vector form being

The electric dipole moment is used to determine electronic transition probabilities in atoms and molecules.

 

Next article: Stark effect
Previous article: Intermediate-field case of the Zeeman effect
Content page of quantum mechanics
Content page of advanced chemistry
Main content page

Intermediate-field case of the Zeeman effect

The intermediate-field case of the Zeeman effect is the regime in which an atom or molecule in an external static magnetic field exhibits splitting of its spectral lines into multiple components, with the field strength comparable to the internal spin–orbit interaction.

To analyse the intermediate-field regime, we consider a hydrogen atom in an external magnetic field whose strength is comparable to that of the internal spin-orbit coupling (). In this case, neither the coupled basis nor the uncoupled basis is strictly a good quantum state. Nevertheless, the Hamiltonian remains invariant under rotations about the direction of the external magnetic field (taken as the lab -axis).

With reference to eq330 and eq331, where the spin-orbit Hamiltonian is , the perturbed part of the Hamiltonian becomes

if we take , where is a unit vector.

The spin-orbit term commutes with and but not with or , while the Zeeman term commutes with and , and hence with , but not with . Consequently,  and may still be regarded as a good quantum number.

 

Question

Show that .

Answer

The ladder operators of quantum orbital angular momentum are defined as and . Therefore, and . Similarly, the ladder operators of spin angular momentum are and , with and . It follows that and . Therefore, .

 

Substituting into gives:

To determine the energy levels, we can no longer rely on the simple first-order perturbation formulas derived in eq334 or eq336. Instead, the spin-orbit Hamiltonian and the Zeeman Hamiltonian must be treated on equal footing. One qualitative approach is to interpolate between the energy sublevels obtained in the weak-field and strong-field limits. However, a more rigorous method involves the following steps:

To illustrate this method, we consider the 2p1 configuration of the hydrogen atom, with and . In the absence of an external magnetic field, spin-orbit coupling (using LS coupling) splits the configuration into two energy levels, and , with degeneracies and  respectively. Even though only remains a good quantum number, we can choose as the uncoupled basis state and form linear combinations as needed to generate the eigenstates.

 

Question

Explain further why can be chosen as the basis state even though and are not good quantum numbers.

Answer

In Hilbert space, the set of states forms a complete orthonormal basis and therefore spans the entire space, regardless of whether and  are good quantum numbers. Quantum numbers only need to be good if we want the Hamiltonian to be diagonal in the chosen basis.

As mentioned earlier, the spin-orbit Hamiltonian and the Zeeman Hamiltonian must be treated on equal footing. The basis diagonalises the spin-orbit Hamiltonian but not the Zeeman Hamiltonian, while the basis diagonalises the Zeeman Hamiltonian but not the spin-orbit Hamiltonian. Since neither term dominates, no basis diagonalises the full Hamiltonian. Nevertheless, is a convenient choice because the Zeeman term is diagonal in this basis and the matrix elements of the spin–orbit operator can be calculated relatively easily.

 

Since is the only conserved quantity, we group the eigenstates according to their values. For and , the possible values are and .

The basis states and corresponding to and respectively are regarded as unmixed states because each value of corresponds to a unique pair of and . Consequently, these states, and , are eigenstates of the perturbed part of the Hamiltonian.

For , there are two possible basis states, and , and the corresponding eigenstates are linear combinations of these states: . Similarly, for  , the eigenstates are linear combinations of and , i.e. . We refer to these linear combinations as mixed states.

Having identified the eigenstates, the remaining task is to determine the energy levels. The expectation value for is , in which is given by eq340. From eq144, and noting that can be expressed as the Krönecker product ,

For , we have . Similarly, using the spin analogue of eq144,

Furthermore, and . Therefore,

where .

Using eq147 and its spin analogue, and repeating the above logic,

For the mixed state , where and , the Hamiltonian is:

Repeating the steps used for the unmixed states to compute the matrix elements gives:

To find the eigenvalues, we solve the secular equation or equivalently,

Expanding the determinant gives the characteristic equation:

Solving the quadratic equation yields two energy levels that depend on :

A similar analysis for , where and  leads to:

with

and

Therefore, the intermediate-field case results in six energy levels for the 2p1 configuration of the hydrogen atom, consistent with the weak-field and strong-field cases, with the perturbed portion of the Hamiltonian given by a matrix:

To show that the matrix elements of are consistent with the eigenvalues at the extreme limits of , we consider the cases and .

When  eq343 and eq345 reduce to the same equation: , with solutions and . These correspond to the spin-orbit energy levels and in the absence of an external magnetic field. Similarly, the eigenvalues of the unmixed states collapse onto the same energy as the level, namely . In other words, the degeneracies are restored, with being four-fold degenerate and being two-fold degenerate.

In the opposite limit , the unmixed-state eigenvalues become , which agrees with the strong-field regime for the states . For the mixed states, we refer to eq346, where the and terms dominate the discriminant, allowing it to be approximated by the perfect square . So,

Similarly, eq346 yields

The energies given by eq347 and eq348 are again consistent with those obtained in the strong-field case.

 

Question

Why is the intermediate-field case of the Zeeman effect analysed using degenerate perturbation theory, while the weak-field and strong-field cases are not?

Answer

Degenerate perturbation theory can be applied to all three cases. However, in the weak-field and strong-field regimes the eigenvalues are usually obtained more simply by computing expectation values, since the relevant Hamiltonians are already diagonal in the corresponding unmixed eigenstate bases.

 

NExt article: Electric dipole moment
Previous article: Strong-field limit of the Zeeman effect
Content page of quantum mechanics
Content page of advanced chemistry
Main content page

Strong-field limit of the Zeeman effect

The strong-field limit of the Zeeman effect is the regime in which an atom or molecule in an external static magnetic field exhibits splitting of its spectral lines into multiple components, with the field strength large compared to the internal spin–orbit interaction.

To analyse the strong-field regime, we consider a hydrogen atom in an external magnetic field much stronger than the internal spin-orbit field , so that the interaction of the electron with the external field dominates. In this regime, torques are generated separately on the orbital and spin magnetic moments, and , with the vectors and antiparallel to and respectively. Each torque, or , is perpendicular to the external magnetic field direction (taken as the lab -axis), so its component along the field axis is zero. Consequently, the projections and cannot change. The only way for and to evolve in time while maintaining constant projections onto the -axis is for them to precess independently about the direction of the magnetic field. Hence, and are good quantum numbers, with the state described by the uncoupled form .

Using eq331 and taking , where is a unit vector, the Zeeman Hamiltonian becomes

The corresponding expectation value is

Furthermore, . Because and precess rapidly around the direction of the external field (-axis), the time-average contributions from the and components vanish. Consequently, and , where is a constant. From eq330,

where for the ground state of hydrogen (see eq297).

Eq336 shows that in a strong external magnetic field the energy levels are split according to the quantum numbers and , revealing the fine-structure of the system and largely lifting the degeneracy of the energy levels.

 

Question

Describe the strong-field Zeeman effect on the hydrogen atom configuration of 2p1, with and .

Answer

The allowed and values for the 2p1 configuration are and , which according to eq335, results in six distinct energy sublevels. If we include the spin-orbit interactions, the energies corresponding to are given by the table below.

 

+1 +1/2
+1 -1/2
0 +1/2
0 -1/2
-1 +1/2
-1 -1/2

 

Next article: Intermediate-field case of the Zeeman effect
Previous article: Weak-field limit of the Zeeman effect
Content page of quantum mechanics
Content page of advanced chemistry
Main content page
Mono Quiz