Why lithium-ion batteries sometimes catch fire

Lithium-ion batteries power everything from smartphones and laptops to electric vehicles and energy storage systems. They are efficient, lightweight and rechargeable. However, under certain conditions, they can overheat, expand and even catch fire. Understanding why this happens comes down to how these batteries are built and what can go wrong inside them.

A lithium-ion battery consists of three key components: anode, cathode and the electrolyte (see diagram above). Separating the anode and cathode is a thin porous membrane called the separator, which plays the critical role of keeping the two electrodes apart while still allowing ions to pass through. As the battery charges and discharges, lithium ions shuttle back and forth between the electrodes, storing and releasing energy.

Under normal conditions, lithium-ion batteries are safe. Problems arise when internal or external factors disrupt their delicate balance, with the most serious outcome being thermal runaway — a chain reaction in which rising temperature accelerates further heat generation. The main trigger for a thermal runaway is an internal short circuit, which occurs when the anode and cathode come into direct contact. When this happens, charges no longer flow through the higher-resistance electrolyte, but instead travel via a very low-resistance path between the electrodes. This surge in internal current flow results in rapid heat generation that can lead to combustion.

So what causes the anode and cathode to come into direct contact? Physical damage, such as dropping, crushing or puncturing a battery can deform its internal structures. Manufacturing defects may leave contaminants, like metal particles or dust, inside the battery that can pierce the separator. Another cause is lithium plating, which can lead to dendrite growth.

Lithium plating occurs when lithium ions deposit as metallic lithium on the surface of the anode, Li+(aq) + e → Li(s), instead of moving into the anode’s internal structure (a process known as intercalation). Fast charging drives lithium ions rapidly towards the anode and can overload its surface. If the intercalation rate is slower than the incoming flux, the excess ions tend to accumulate and are reduced to metallic lithium on the anode surface. Low operating temperatures (< 273.15 K), which increase the viscosity of the electrolyte and decrease ionic mobility, can also promote lithium plating (the slower diffusion of ions into the graphite layers causes them to accumulate and plate onto the surface).

Initially, this plated lithium layer is often relatively uniform, so the anode and cathode remain physically separated by the separator, although the battery capacity may be affected. However, if the plated lithium grows unevenly over time, it can form needle-like structures called dendrites. These body-centred cubic structures can pierce the separator and reach the cathode, creating a direct, low-resistance path between the electrodes and leading to an internal short circuit.

Finally, gas buildup in a lithium-ion battery, caused by electrolyte decomposition over time, can also result in thermal runaway. This decomposition involves the battery’s electrolyte, which typically consists of the lithium salt LiPF₆ dissolved in an organic solvent such as ethylene carbonate or dimethyl carbonate. Under stress (e.g. overcharging or high temperature), LiPF₆ can decompose to PF₅ (LiPF₆ → LiF + PF₅), which then reacts with trace amounts of water to produce hydrogen fluoride gas (PF5 + 4H2O → H3PO4 + 5HF). The organic solvents can also oxidise or decompose, producing gases like CO, CO₂ or hydrocarbons, leading to battery swelling commonly observed as bulging laptop casings. The increased internal pressure may displace the electrodes, causing an internal short circuit, or even rupture the battery. This significantly increases the risk of explosion, especially if the gases are flammable.

To mitigate such risks, high-quality lithium-ion battery designs use a combination of materials, architecture and protective systems to prevent lithium plating, dendrite formation, gas buildup, and short circuits. For instance, high-quality graphite, silicon-graphite composites or lithium titanate anodes ensure even lithium intercalation, reducing the risk of plating. Additives are also included in the electrolyte to regulate lithium deposition and suppress dendrite growth. Furthermore, multi-layer microporous polymer separators can resist puncture by dendrites.

Additionally, devices powered by lithium-ion batteries incorporate electronic circuitry that monitor battery voltage, current and temperature (battery management systems) to prevent overcharging and over-discharging. Cooling systems, such as personal computer fans, also help reduce overheating and electrolyte decomposition.

 

Question

Do electric cars (EVs) also use lithium-ion batteries? If so, why do they rarely catch fire?

Answer

Yes, most EVs use lithium-ion batteries. While the chemistry is similar to that of batteries in phones or laptops, EV battery systems are much more advanced, with high-quality electrodes and robust casings that protect against physical damage. Sophisticated battery management and thermal management systems are also employed to prevent overheating (see diagram below).

 

 

In summary, lithium-ion batteries catch fire not because they are inherently unsafe, but because their high energy density makes them sensitive to internal failures and external stress. When key safeguards (separator, electrolyte stability or controlled ion movement) are compromised, a cascade of reactions can lead to thermal runaway. However, with continued advances in materials science, battery design and management systems, the risks are being steadily reduced. When properly designed, manufactured and used, lithium-ion batteries remain a safe and indispensable technology in modern life.

next article: Electrolysis of water
Previous article: lithium-ion battery
Content page of intermediate electrochemistry
Content page of intermediate chemistry
Main content page

Inductive effect

The inductive effect is the shift of electron density in -bonds caused by electron-withdrawing or electron-donating groups, resulting in polarised bonds.

This effect occurs because different atoms or groups influence electron density according to their electronegativity relative to the atoms to which they are bonded. Atoms or groups that are more electronegative tend to withdraw electron density, while those that are less electronegative can donate electron density through the -bond framework. In organic chemistry, the inductive effect is often discussed for substituents attached to carbon chains, where electron density is shifted along the bonds. As a result, atoms near an electron-withdrawing group may become slightly electron-deficient and develop a partial positive charge, whereas atoms near an electron-donating group may become slightly electron-rich and develop a partial negative charge. Whether a substituent is labelled electron-withdrawing (–I) or electron-donating (+I) depends entirely on how its electronegativity compares to a hydrogen atom bonded to that same position.

 

Electron-withdrawing groups (-I)

To quantum-mechanically analyse the inductive effect caused by electron-withdrawing substituents (classified as –I substituents), we consider an atom in which the energy of an electron can be expressed as two main contributions:

where is the Hamiltonian, is the kinetic energy of the electron and is the electrostatic potential energy arising from attraction to the nucleus.

The potential energy is approximately

where is the effective nuclear charge and is the electron-nucleus distance.

Consider the heteronuclear diatomic molecule HCl, in which the Cl atom is more electronegative than the H atom. Since , the potential energy of an electron on Cl is more negative than that on H, lowering the valence orbital energy of Cl relative to that of H. In other words, the more electronegative atom or group usually has a lower-energy valence orbital.

When the two atoms combine to form a -bond, the bonding molecular orbital (MO) wavefunction is

where and are the atomic orbital (AO) wavefunctions of H and Cl respectively, and and are real-valued coefficients that are key to explaining the inductive effect.

If is normalised and the AOs are orthogonal, the electron density of the -bond is given by

This implies that and determine the contribution of each AO to the electron density of the bond. The larger coefficient corresponds to a greater contribution and therefore greater electron density in the region of that atom. Because Cl is more electronegative, we expect . To justify this, we multiply the eigenvalue equation on the left by and integrate to give

where , , and we have

Applying the variational principle to eq380 by setting the partial derivative of with respect to each coefficient to zero, we obtain a set of simultaneous equations known as secular equations:

Expressing eq382 and eq383 in matrix form yields:

Eq384 is a linear homogeneous equation with non-trivial solutions only if

Expanding the determinant gives the characteristic equation:

with two solutions corresponding to the bonding MO () and antibonding MO ():

The energy separation between these two MOs is

If , then , which implies that the two AOs do not interact and no splitting occurs. It follows that increases the separation between the two MOs, with a larger resulting in a greater separation, and hence, a greater stabilisation of the bonding MO.

Rearranging eq383 and taking absolute values on both sides yield:

Substituting in eq385 into gives:

Multiplying and dividing the RHS by its conjugate yields

Since , then . is the expectation value of the hydrogen AO, i.e. , which is less negative than . So, and . Therefore, , where , or equivalently,

Substituting this into eq386 results in

This proves that the coefficient of the more electronegative atom corresponds to a greater contribution to the -bond, and therefore, greater electron density in the region of that atom.

In terms of polyatomic aliphatic molecules, the inductive effect is transmitted along covalent bonds in the chain, but its strength decreases rapidly with distance. To illustrate this, consider the C-C-Cl -framework in the molecule CH3CH2Cl, with the following bonding MO wavefunction:

where 1 denotes the terminal carbon, 2 denotes the carbon bonded to Cl and 3 denotes the chlorine atom.

The difference in electronegativity between chlorine and the adjacent carbon is significantly larger than that between the two carbon atoms. We would therefore expect , with the electron density skewed towards chlorine, leaving the middle carbon slightly electron-deficient and effectively more electronegative relative to the terminal carbon atom. Applying similar reasoning to the C-C bond, we obtain , and consequently,

This result implies that the inductive effect transmitted along covalent bonds in a chain decreases in strength with increasing distance from the substituent. It also means that the C-Cl moiety may be regarded as an electron withdrawing group. More generally, a group containing several atoms may behave collectively as an electron-withdrawing group if its internal bonding renders an atom electron-deficient relative to the surrounding network and capable of attracting electron density from neighbouring bonds.

For example, in the formyl group (–CHO), the carbonyl oxygen strongly stabilises the system of the C=O bond, lowering the energy of orbitals centred on the carbonyl carbon. As a result, the carbonyl carbon becomes relatively electron-deficient and can draw electron density from the neighbouring framework. Such redistribution of electron density can significantly influence the chemical behaviour of molecules. For instance, inductive effects can alter the acidity of different amino acids by stabilising or destabilising charged intermediates, and can also affect reaction mechanisms by increasing the electrophilicity of certain atoms, making them more susceptible to nucleophilic attack.

 

Electron-donating groups (+I)

A group that has the opposite effect on a chain of atoms compared to an electron-withdrawing group is known as an electron-donating group.

The same molecular orbital reasoning can be used to explain electron-donating groups. The key idea is that the direction of electron density shift depends on the relative energies of the interacting atomic orbitals, which are reflected in the Coulomb integrals ​. For an electron-donating group, the situation is reversed. The atom attached to the carbon chain has higher-energy valence orbitals, corresponding to a larger Coulomb integral than that of the neighbouring carbon atom. When the bonding molecular orbital is formed, the coefficient on this substituent becomes relatively smaller than that on the adjacent carbon atom. Consequently, electron density is pushed into the carbon skeleton. Thus, the substituent effectively donates electron density through the framework, producing a positive inductive effect (+I).

Alkyl substituents provide a simple example. Because carbon is slightly more electronegative than hydrogen, the electron density in C-H bonds of an alkyl group such as CH3– is shifted towards the carbon atom. This leaves the carbon atom relatively electron-rich and its valence orbitals at slightly higher energy than those of a methylene carbon (CH2). When these units combine to form the ethyl group CH3CH2-, the methyl fragment behaves as an electron-donating substituent. Comparing H3C-CH2– and H-CH2-, we expect the methylene carbon in the ethyl group to be more electron-rich than that in the methyl group, and hence the ethyl group to be a stronger electron-donating group than the methyl group. It follows that, in terms of increasing +I effect:

 

Next article: Mesomeric effect
Previous article: Linear Stark effect
Content page of quantum mechanics
Content page of advanced chemistry
Main content page

Mesomeric effect

The mesomeric effect is the electron-donating or electron-withdrawing effect of a substituent transmitted through a conjugated π-system by resonance.

Resonance, or the resonance effect, typically occurs when atoms are connected by alternating single and multiple bonds, where electrons can be shared across several atoms instead of being localised. For instance, the π-electrons in benzene are dislocalised over the entire ring rather than confined to three double bonds depicted in a Lewis structure.

In other words, the mesomeric effect is an electron-donating (+M) or electron-withdrawing (-M) effect via resonance. It requires a functional group attached to a conjugated system and focuses on how the group changes the electron distribution.

 

Question

Is the mesomeric effect different from the inductive effect?

Answer

Yes, the inductive effect is the displacement of electron density through -bonds along a chain of atoms due to the presence of an electronegative or electropositive atom or group. In contrast, the mesomeric effect involves the delocalisation of electrons through π-bonds. Although the mesomeric effect is governed mainly by resonance and orbital interactions, the electronegativities of atoms in substituents may be considered when analysing the mesomeric effect of a molecule (e.g. via the Hückel method). Nevertheless, the two effects are generally different. For example, lone pairs on atoms such as O or N often give rise to a +M effect. However, these same atoms may exert a –I effect at the same time. Therefore, the mesomeric and inductive effects can act in the same direction or in opposite directions in a molecule with a substituent.

 

 

-M effect

The Hückel method is the standard semi-empirical approach to quantify the mesomeric effect in a conjugated system. Consider the acrolein molecule CH2=CH-CHO, which contains two π-bonds, one of which belongs to an electron-withdrawing formyl group (-CHO). In general, we can express the one-electron molecular orbital (MO) wavefunction of acrolein as:

where and represent the coefficients and the p atomic orbital wavefunctions respectively; corresponds to the oxygen atom, corresponds to the carbonyl carbon,  corresponds to the middle ethylene carbon and corresponds to the terminal ethylene carbon.

One quick way to analyse the MO energies of acrolein is to refer to the secular determinant of butadiene:

which represents a four- atom framework with two alternating π-bonds.

To account for the electronegativity of oxygen, we modify the Hückel parameters as follows:

where

is the Coulomb integral for oxygen, with being a factor that lowers the orbital energy.
is the Coulomb integral for the carbonyl carbon.
is the resonance integral for overlap between C and O.
is the Coulomb integral for carbon atoms.
is the resonance integral between carbon atoms.
is the energy of an MO.

The next step involves estimating trial values for , and . Since oxygen is more electronegative than carbon, we set . This results in C=O being a stronger bond compared to C-C. Consequently, we set . The corresponding inductive effect by oxygen on the the carbonyl carbon causes it to be slightly electronegative relative to the adjacent ethylene carbon. We can therefore set . It is important to note that although the electronegativity of oxygen helps stabilise the system, it is not the primary cause of the mesomeric effect, which is fundamentally due to conjugation and orbital overlap.

Using the matrix identity of , and multiplying both sides of eq390 by , give

where .

Expanding the determinant gives the characteristic equation:

with solutions , which when substituted back into yields:

MO Type
1.52 Antibonding
0.31 LUMO
-1.16 HOMO
-2.86 Bonding

 

Question

Show that the isolated C=O Hückel bond energy is .

Answer

The secular determinant for the C=O bond is

Solving for gives . Therefore, the bonding MO energy is .

 

The total ground state π-electron energy, , is lower than the sum of the energies of isolated C=C () and C=O () bonds by , indicating the delocalisation energy provided by the mesomeric effect. To further understand the electron distribution of the molecule, we determine the coefficients of the wavefunctions as follows:

which implies:

The simultaneous equations for can be solved by substituting into them and expressing all coefficients in terms of :

Normalisation gives:

Therefore, . Repeating the calculations for and yields:

Coefficient  (bonding)  (HOMO)  (LUMO) Ground state electron density
c4 0.07 0.60 0.67 0.73
c3 0.20 0.70 -0.21 1.06
c2 0.50 0.21 -0.60 0.59
c1 0.82 -0.35 0.37 1.59

where the ground state electron density is given by for two electrons in each MO.

The value of 1.59 at oxygen indicates significant accumulation of π-electron density. This arises from the –M effect of the carbonyl group, where electron density is delocalised towards the oxygen atom. In contrast, the terminal carbon and the carbonyl carbon are electron deficient due to this delocalisation (see resonance structures below). While oxygen’s electronegativity contributes to the overall polarisation of the molecule, the calculated electron densities primarily reflect the mesomeric effect rather than the inductive effect. In this case, both effects act in the same direction.

The calculations also suggest regioselectivity of acrolein in reactions with nucleophiles. For example, “hard” (ionic) nucleophiles, such as LiAlH4 or Grignard reagents, are driven mainly by electrostatic attraction and attack the most electron-deficient carbonyl carbon (with ground state electron density of 0.59), resulting in 1,2-addition reactions (see diagram below). On the other hand, “soft” (covalent) nucleophiles, like enolates, thiols or organocuprates, are governed by orbital interactions. They attack the atom where the LUMO has the largest “lobe” (), which is the terminal ethylene carbon of acrolein, leading to 1,4-addition (Michael addition) reactions.

 

+M effect

The +M effect is typically attributed to a substituent with lone pair electrons. Consider vinylamine CH2=CH-NH2, with the general one-electron MO wavefunction:

where represents the p atomic orbital wavefunctions; 1 denotes the nitrogen atom, 2 denotes the middle ethylene carbon and 3 denotes the terminal ethylene carbon.

The secular determinant is given by:

where

is the Coulomb integral for nitrogen, with being a factor that lowers the orbital energy.
is the resonance integral for overlap between C and N.
is the Coulomb integral for carbon atoms.
is the resonance integral between carbon atoms.
is the energy of an MO.

Let’s set due to the electronegativity of N, and because the C-N bond is slightly weaker than the C=C bond. This translates to:

with the corresponding characteristic equation:

Substituting the solutions back into yields:

MO Type
1.13 LUMO
-0.68 HOMO
-1.95 Bonding

Unlike acrolein, in which one electron from each of the four atoms populates the MOs, vinylamine has two nitrogen electrons (lone pair) and one electron from each carbon atom contributing to its bonding and highest occupied MOs. The total ground state π-electron energy, , is lower than the sum of the energies of isolated C=C bond () and the nitrogen lone-pair (), indicating a delocalisation energy of provided by the +M effect.

 

Question

Show that the isolated nitrogen lone pair energy is .

Answer

Since nitrogen’s lone pair is not yet delocalised into the π-system, its energy corresponds to twice the Coulomb integral , which is .

 

The coefficients of the wavefunctions are given by:

Substituting the three values of into the matrix equation and solving it gives:

Coefficient  (bonding)  (HOMO)  (LUMO) Ground state electron density
c3 0.24 0.72 -0.65 1.15
c2 0.48 0.49 0.73 0.94
c1 0.85 -0.48 -0.22 1.91

Since the ground state electron density of a neutral, non-polarised one-electron atom is 1.0, the ground state electron density of the nitrogen lone pair of 1.91 is lower than its unpolarised value of 2.0 (density = number of electrons per unit volume). In contrast, the electron density of the terminal carbon of 1.15 is higher than its non-polarised value of 1.0. This shift of electron density from N to the terminal C is a consequence of the +M effect of the amine group, even though N exerts a –I effect on its neighbours. In this case, mesomeric and inductive effects act in opposite directions. Furthermore, the relatively electron-rich terminal carbon and nitrogen atoms (ground state electron density > 1.0) make the molecule susceptible to electrophilic attack.

In terms of regioselectivity, a “hard” electrophile like H+ (small, concentrated positive charge) is attracted to N, which has the highest ground state electron density of 1.91. Protonation at N forms an ammonium ion. H+ can also attack the terminal carbon (electron density = 1.15) to yield an iminium ion. The relative proportion of ammonium and iminium ions formed depends on temperature and reaction time.

The ammonium ion forms quickly at relatively low temperatures through fast collisions and is considered the kinetic product. However, it is not energetically favourable because formation of the -bond between H+ and N disrupts the conjugated system, reducing the molecule’s delocalisation energy. Given enough time for equilibration at room temperature, the resonance-stabilised  iminium ion, in which the positive charge is delocalised between carbon and nitrogen, becomes the thermodynamic product and predominates. If water is present, this iminium ion quickly hydrolyses to acetaldehyde and an ammonium salt, which explains why vinylamines (and enamines) are often used as intermediates to functionalise carbonyls.

On the other hand, a “soft” electrophile such as CH3I, in which the the partially positive carbon is diffuse due to iodine’s polarisability, seeks the largest HOMO lobe for orbital overlap. This leads to formation of a new carbon-carbon bond at the terminal carbon of vinylamine.

Overall, the reactivity of vinylamine illustrates how electron density, substituent effects and the nature of the electrophile work together to determine both regioselectivity and product distribution. Understanding these effects allows chemists to predict and control reaction outcomes, making vinylamines valuable intermediates in organic synthesis.

 

Next article: Ortho/Para and meta directing groups
Previous article: Inductive effect
Content page of quantum mechanics
Content page of advanced chemistry
Main content page

Ortho/para and meta directing groups

Ortho/para-directing and meta-directing groups are substituents that guide incoming electrophiles to the ortho and para positions or to the meta position, respectively, on an aromatic ring.

This directing effect arises from how the substituent interacts with the ring’s electron density. Ortho/para-directing groups are typically electron-donating (e.g. –OH, –NH₂, –CH₃); they increase electron density at the ortho and para positions through resonance or inductive effects, stabilising the intermediate carbocation during electrophilic aromatic substitution. In contrast, meta-directing groups are usually electron-withdrawing (e.g., –NO₂, –COOH, –CN); they pull electron density away from the ring, making the ortho and para positions less stable for substitution and thus favouring attack at the meta position.

 

Ortho/para-directing groups

A classic example of an ortho/para-directing group is the hydroxyl group in phenol. Although it is inductively withdrawing (-I) due to the electronegativity of the oxygen atom, the lone pair of electrons on oxygen exerts a +M effect on the benzene ring, overwhelming the inductive effect and activating the ortho and para positions for electrophilic substitution.

To quantum-mechanically analyse the mesomeric effect of the hydroxyl group, we consider the general molecular orbital (MO) wavefunction of phenol:

where and represent the coefficients and the p atomic orbital wavefunctions respectively; corresponds to the oxygen atom, corresponds to the substituted carbon, and through correspond to the remaining carbons in an anticlockwise direction.

Applying the Hückel method gives the following secular determinant:

where

is the Coulomb integral for oxygen, with being a factor that lowers the orbital energy.
is the resonance integral for overlap between C and O.
is the Coulomb integral for carbon atoms.
is the resonance integral between carbon atoms.
is the energy of an MO.

Incidentally, the minor of the above secular determinant is the secular determinant of benzene.

The next step involves estimating trial values for  and . Since oxygen is more electronegative than carbon, we set , lowering the MO relative to carbon. Furthermore, we set , which is lower than the reference of for every resonance integral between carbon atoms, because the oxygen’s 2p orbitals are more contracted than carbon’s, leading to a less effective overlap between the p orbitals of oxygen and carbon versus that between two carbon atoms.

Using the matrix identity of , and multiplying both sides of the secular determinant equation by , give

where .

Expanding the determinant yields the characteristic equation:

with solutions

Substituting the solutions back into yields:

MO Type
1.994 Antibonding
1.192 Antibonding
0.617 LUMO
-0.213 HOMO
-0.744 Bonding
-1.419 Bonding
-2.427 Bonding

The reactivity of phenol is reflected in its HOMO energy level, , which is higher than that of benzene (). Because the HOMO is higher in energy, the electrons are less tightly held and more nucleophilic than those in benzene towards electrophiles. To further understand the directing effects of the hydroxyl group, we determine the coefficients of the HOMO wavefunction as follows:

which implies:

Using an optimising tool, such as Microsoft Excel Solver, the normalised HOMO coefficients are approximately:

 Coefficient  (HOMO) HOMO electron density
c0 0.623 0.777
c1 (ipso) -0.323 0.208
c2 (ortho) -0.344 0.236
c3 (meta) 0.121 0.030
c4 (para) 0.415 0.344
c5 (meta) 0.121 0.030
c6 (ortho) -0.344 0.236

where the HOMO electron density (see diagram below) is given by for two electrons.

The calculated values show that the ortho and para carbons have higher electron densities than the meta carbons, with the para carbon having the highest. This is due to the +M and –I effects of the hydroxyl group. When phenol reacts with an electrophile such as bromine at low temperatures in non-polar solvents (e.g. CS2 or CCl4), mono-brominated products are formed, giving a para-bromophenol to ortho-bromophenol ratio of about 80:20. At higher temperatures, phenol reacts with aqueous bromine to yield the fully substituted 2,4,6-tribromophenol. Therefore, the hydroxyl group is known as an ortho/para-directing group. Substituents with similar effects include the amino group (-NH2), substituted amino groups (-NR2), alkoxy groups (-OR), phosphino and thio groups (-PR2 and -SR) and the phenyl group (-C6H5).

 

Meta-directing groups

An example of a meta-directing group is the ammonium group -NH3+ in the anilinium ion C6H5-NH3+. This substituent is purely inductively withdrawing (-I) due to the positively charged nitrogen atom. To analyse the inductive effect of the ammonium group quantum mechanically, we consider the general molecular orbital (MO) wavefunction of the anilinium system:

where and represent the coefficients and the p atomic orbital wavefunctions respectively; corresponds to the substituted carbon, and through correspond to the remaining carbons in an anticlockwise direction.

As the nitrogen atom lacks a lone pair of electrons to interact with the benzene π-system, the substituted carbon atom is treated as being perturbed by the electron-withdrawing group, leading to the following secular determinant:

Expanding the determinant yields the characteristic equation:

with solutions

Substituting the solutions back into yields:

MO Type
1.891 Antibonding
1.000 Antibonding
0.705 LUMO
-1.000 HOMO
-1.317 Bonding
-2.278 Bonding

To further understand the directing effects of the ammonium group, we determine the coefficients of the HOMO wavefunction as follows:

Solving the above six simultaneous equations, along with the normalisation equation, using algebra for the HOMO coefficients and Excel Solver for the remaining coefficients, give:

Coefficient  (HOMO) Electron density
c1 (ipso) 0.646 -0.517 0.000 1.370
c2 (ortho) 0.413 -0.082 0.500 0.855
c3 (meta) 0.295 0.409 0.500 1.008
c4 (para) 0.259 0.621 0.000 0.904
c5 (meta) 0.295 0.409 -0.500 1.008
c6 (ortho) 0.413 -0.082 -0.500 0.855

where the electron density is given by for two electrons in each MO.

The calculated values show that the meta carbons have higher electron densities than the ortho and para carbons. When the anilinium ion reacts with an electrophile such as the nitronium ion NO2+ at low temperatures, m-anilinium is the major product. Therefore, the ammonium group is known as a meta-directing group. Substituents with similar effects include the nitro group (-NO2), sulfonyl groups (-SO2R), the cyano group (-CN), formyl and acyl groups (-CHO and -COR) and the carboxyl group (-CO2H).

 

Question

Why doesn’t the electrophile attack the ipso (latin for “itself” or “that very one”) carbon and displace the ammonium group?

Answer

Although the Hückel method indicates that the ipso carbon has the highest electron density, it considers only the π-electrons and does not account for constraints imposed by the σ-framework. Electrophilic attack at the ipso carbon would require either displacement of the ammonium group or formation of a stable σ-complex. Both pathways are energetically very unfavourable because -NH3+ is a poor leaving group and the ipso carbon has already reached its maximum valency.

 

Previous article: Mesomeric effect
Content page of quantum mechanics
Content page of advanced chemistry
Main content page

The twin paradox

The twin paradox is an apparent contradiction in special relativity where each of two twins in relative motion expects the other to age more slowly, yet the travelling twin ages less because their paths through spacetime are not equivalent.

Consider twin Jane and Jill. Suppose Jane makes a round trip from Earth to a star system 10 light-years away and back at 80% the speed of light (). From Jill’s perspective on Earth, Jane takes years to complete the entire trip. Using the time dilation equation , where , Jill calculates how much proper time elapses along Jane’s worldline and concludes that Jane ages years during those 25 Earth years.

From Jane’s point of view, she is stationary and Jill is moving. A direct application of time dilation might suggest the opposite result, that Jill ages less, leading to an apparent paradox. However, due to length contraction, the distance to the star is no longer 10 light-years in Jane’s frame; it is reduced to light-years, with the round trip taking  years to complete.

Therefore, the two perspectives do not contradict each other: both agree that Jane ages less and experiences about 15 years during the journey. Although each twin sees the other’s clock running slow locally, Jane covers a shorter contracted distance at high speed. Applying time dilation and length contraction consistently shows that she ages less.

 

Question

Wouldn’t Jane’s spaceship need to accelerate (speed up, slow down or stop) when leaving Earth and at the turnaround point?

Answer

Not necessarily. The scenario can be reformulated so that no single traveller undergoes acceleration. Instead of departing from rest on Earth, Jane can be imagined as already moving at a constant speed and passing Earth. As she passes, she synchronises her clock with Jill’s. At the distant point 10 light-years away, a second spaceship carrying a third observer (Judy) passes Jane while moving towards Earth at the same constant speed. At that event, Judy compares clocks with Jane and then continues towards Earth. When Judy reaches Earth, her clock is compared with Jill’s. In this way, the entire analysis can be carried out using only inertial observers. The result is that the travelling clocks (Jane’s and Judy’s) accumulate less proper time than Jill’s clock on Earth.

 

Next article: Why does a current flowing through a wire generate a magnetic field
Previous article: Length contraction
Content page of quantum mechanics
Content page of advanced chemistry
Main content page

Secular equations in the variational method

Secular equations in the variational method are linear equations obtained by minimising the energy with respect to trial-function coefficients, yielding a matrix eigenvalue problem whose solutions give approximate energy levels and corresponding wavefunctions.

Consider the eigenvalue equation , where and the coefficients are real. Multiplying the eigenvalue equation on the left by and integrating over all space gives:

where .

Using the orthonormality condition  results in:

Taking the partial derivative of with respect to each coefficient gives:

Applying the variational method by setting and computing the remaining derivatives yield:

Assuming that the expectation values are real and the Hamiltonian is Hermitian, , and

Relabelling the second summation index from to , we have  or equivalently,

which rearranges to the secular equations:

where we have relabelled the index to .

We can express the secular equations in the following matrix form:

This is a linear homogeneous equation with non-trivial solutions only if

or equivalently, if

where is the Hamiltonian matrix with elements and is the identity matrix.

Expanding the determinant gives the characteristic equation, which can then be solved for its roots.

Secular equations are used to determine the allowed energy levels and wavefunctions of electrons in molecules. They are fundamental in solving eigenvalue problems in many aspects of chemistry, including:

    1. degenerate perturbation theory
    2. the Hückel method
    3. the Stark effect and the Zeeman effect
    4. vibration of polyatomic molecules
    5. the Hartree-Fock-Roothaan procedure

 

Next article: Overlap integral
Previous article: The variational method
Content page of quantum mechanics
Content page of advanced chemistry
Main content page

Linear Stark effect

The linear Stark effect is the splitting or shifting of atomic or molecular energy levels in proportion to the strength of an applied electric field.

It is defined as the first-order energy correction in perturbation theory and is most clearly observed in hydrogenic atoms, where the high symmetry of the Coulomb potential leads to degeneracies that allow first-order shifts. In the strong-field regime ( V/m), where the static, uniform electric field strength  produces energy shifts that are large compared to those arising from spin-orbit interactions, the Hamiltonian  for a hydrogenic atom is:

where is the Hamiltonian of the unperturbed atom and is the perturbation due to the Stark effect.

The perturbing Hamiltonian is given by eq354:

where is the operator of the electric dipole moment of the atom.

This suggests that vanishes if the electric dipole operator is zero. Substituting eq350 into gives

where is the operator of the position vector of the electron, with the vector measured from the nucleus (taken as the origin).

Compared to the Zeeman Hamiltonian, which depends on both the orbital and spin angular momenta of the electron, the Stark Hamiltonian depends only on and therefore acts only on the spatial part of the hydrogenic wavefunction.

Assuming that the electric field is directed along the laboratory -axis, eq361 becomes

where is the angle between and the -axis.

To understand how the perturbation affects the energy levels, we analyse its effect on the and states of the atom. The uncoupled wavefunction , which is a good eigenstate of , is used as a basis to begin the perturbation analysis. For , the eigenstate is the non-degenerate ground state . Therefore, we may apply the first-order non-degenerate perturbation theory, which corresponds to the expectation value of :

or equivalently, in spherical coordinates,

Since , we have . Therefore, the ground state of a hydrogenic atom exhibits no first-order Stark effect, as its spherical symmetry implies that it possesses no permanent electric dipole moment. In fact, because is odd under spatial inversion (it changes sign about the origin), its expectation value vanishes for any eigenstate of even parity, since the integral of an odd function multiplied by an even function over all space is zero.

Unlike the ground state, the unperturbed level is fourfold degenerate described by the following basis wavefunctions (see this article and this article for derivation):

Due to this degeneracy, we must use degenerate perturbation theory, which requires constructing the Stark Hamiltonian matrix with elements . Because has even parity and is odd under spatial inversion, all diagonal matrix elements vanish:

Furthermore, the matrix elements , , , and , together with their complex conjugates, are all zero because their corresponding azimuthal integrals involve terms such as , or , all of which vanish. This leaves and its complex conjugate , each of which evaluates to .

 

Question

Show that

Answer

Letting , , we have , and

Using the identity  completes the maths.

 

Therefore, we need to solve the eigenvalue equation

where , , and are the coefficients in the basis and

To find the eigenvalues, we solve the secular equation or equivalently,

Evaluating the determinant gives the characteristic equation:

with solutions:

Therefore, the fourfold degenerate level is split by the Stark effect into three distinct levels (see diagram below): , and , where is the eigenvalue of the unperturbed Hamiltonian .

To determine the eigenstates corresponding to the three distinct levels, we refer to eq364, where

For , we require and . Since there are no conditions on or , any linear combination of and is an eigenstate with eigenvalue 0. However, a state , with both and , is not an eigenstate of (i.e. is not well-defined). Therefore, only the linear combinations in which either or , namely the unmixed states or , are good eigenstates of both and for .

For , eq366 gives the condition and . Hence, the normalised eigenstate is

Similarly, for , the same equation yields and  resulting in the normalised eigenstate

Since both and are eigenstates of , each with eigenvalue , and remain eigenstates of . However, and  are no longer eigenstates of , meaning is not a good quantum number once the electric field is applied.

The fact that the level exhibits a first-order Stark effect for and  implies that their electron distributions are no longer spherically symmetric and acquire a permanent electric dipole moment along the field direction (see diagram below). Furthermore, the three distinct energy levels, , and , classically suggest that the dipole moment has magnitude  and three possible orientations: parallel, antiparallel and perpendicular to the field.

Because the Stark energy shifts in hydrogenic atoms are proportional to the first power of the electric field, the phenomenon is known as the linear Stark effect. Although the ground state of hydrogen does not exhibit a linear Stark effect, it does exhibit a quadratic Stark effect, which we will explore in the next article.

 

Next article: Quadratic Stark effect
Previous article: Stark effect
Content page of quantum mechanics
Content page of advanced chemistry
Main content page

Quadratic Stark effect

The quadratic Stark effect is the shift of atomic or molecular energy levels proportional to the square of an applied electric field, arising when non-degenerate states produce only second-order energy corrections.

As explained in the previous article, the ground state () of hydrogen does not exhibit a linear Stark effect, which is defined as the first-order energy correction in perturbation theory, because its spherical symmetry implies that it possesses no permanent electric dipole moment. With reference to eq270a, the second-order energy correction for the ground state of hydrogen is

where
is the perturbing part of the Hamiltonian due to the Stark effect (see eq362)
is the external electric field directed along the -axis
is a linear combination (mixing) of excited eigenstates, with given by eq269a
and are the ground state eigenfunction and eigenvalue
and are the eigenfunctions and eigenvalues for .

Since corresponds to the 1s wavefunction, which is an even function under spatial inversion, and is odd under spatial inversion, is an odd function. Consequently, for ,  must be odd. The term in the summation with the smallest magnitude of the denominator contributes most significantly to . This occurs when , implying that the 2p wavefunctions (which are the only odd-parity states in the level) dominate the correction to the eigenvalue.

It follows that the ground state of hydrogen exhibits a Stark effect when the second-order energy correction is taken into account. To show that it is a quadratic effect and that the associated electric dipole moment is an induced dipole moment, we substitute eq362 into eq370 to obtain

Clearly, the second-order energy correction is proportional to , and hence represents a quadratic Stark effect.

In classical electromagnetism, the energy of an electric dipole moment in an external field is given by eq354, or in differential form:

Substituting the definition of an induced electric dipole moment into gives

Since the external electric field is directed along the -axis,

If in eq371 is an energy shift arising from an induced dipole moment, it must have the form of given in eq372. Comparing eq371 with eq372 yields

where is the polarisability of the atom, which is a measure of the degree to which the electron in hydrogen can be displaced relative to the nucleus.

Therefore, in eq371 is an energy shift arising from an induced dipole moment. Even though the ground state of hydrogen does not possess a permanent dipole moment, it exhibits a quadratic Stark effect due to the electric dipole moment induced by an external electric field.

 

NExt article: Inductive effect
Previous article: Linear Stark effect
Content page of quantum mechanics
Content page of advanced chemistry
Main content page

Stark effect

The Stark effect is the shifting and splitting of atomic or molecular spectral lines when an external electric field is applied.

First observed in 1913 by Johannes Stark, this phenomenon provided important early evidence for the interaction between electromagnetic fields and atomic structure. The effect arises because the electric field perturbs the energy levels of electrons, altering the frequencies of light emitted or absorbed during electronic transitions. As the electric analogue of the Zeeman effect, the Stark effect is analysed using perturbation theory and plays a significant role in spectroscopy and quantum mechanics.

The precise way in which these spectral lines shift, however, depends on both the internal structure of the atom or molecule and the strength of the applied electric field. In practice, two principal regimes are distinguished: the linear Stark effect and the quadratic Stark effect.

The linear Stark effect is defined as the first-order energy correction in perturbation theory and occurs when the energy shift is directly proportional to the strength of the applied electric field. This behaviour typically appears in systems with degenerate energy levels, such as in the hydrogen atom. In contrast, the quadratic Stark effect corresponds to the second-order energy correction and occurs when the energy shift is proportional to the square of the electric field strength. This is the more common situation for atoms and molecules with non-degenerate energy levels.

Details of these two effects will be discussed in the next two articles.

Next article: Linear Stark effect
Previous article: Electric dipole moment
Content page of quantum mechanics
Content page of advanced chemistry
Main content page

Electric dipole moment

An electric dipole is a pair of equal and opposite electric charges, and , separated by a small distance . The electric dipole moment (also denoted by ) is a measure of the strength and orientation of the electric dipole and is defined as:

where is the position vector of the corresponding charge measured from the origin, which is usually taken at the midpoint between the charges.

The SI unit of electric dipole moment is the Coulomb-meter (C·m), although the Debye (D) is more commonly used in practice (1D » 3.33 x 10-30 C·m)

 

Question

Is the electric dipole a vector?

Answer

The electric dipole is a physical configuration of two equal and opposite charges, not a vector. However, the associated electric dipole moment is a vector quantity. By physics convention, its direction points from the negative charge to the positive charge (note that in chemistry, the dipole direction is taken from the less electronegative atom to the more electronegative atom).

 

When an electric dipole is placed in an external electric field (see diagram above), it experiences a torque that tends to rotates the dipole so as to align it with the field, thereby lowering its potential energy. The torque is defined as the product of the component of a force normal to the axis of rotation and the distance from the origin to the point of application of the force ( or equivalently ). For a spatially uniform electric field,

and define a plane, and is perpendicular to this plane, with its direction given by the right-hand rule. Therefore, the magnitude of the torque is

where is the angle between the rotating axis and the force, which is equivalent to the angle between and .

For a rotating system (see diagram above), the work done is given by

Since and the change in potential energy is the negative of the work done by the field,

Substituting eq352 into eq353 yields:

Because the force exerted by the electric field rotates the dipole towards a lower potential energy, . To satisfy this condition, as changes from to during the rotation, we define when the dipole is perpendicular to the field (), giving

where and the negative sign arises naturally to ensure that ( is positive for ), with the corresponding potential energy in vector form being

The electric dipole moment is used to determine electronic transition probabilities in atoms and molecules.

 

Next article: Stark effect
Previous article: Intermediate-field case of the Zeeman effect
Content page of quantum mechanics
Content page of advanced chemistry
Main content page
Mono Quiz