A scientific journey through density functional theory, surface thermodynamics, statistical simulation, and machine learning — applied to the grand challenge of green hydrogen production.
The transition to renewable energy requires efficient storage technologies. Water electrolysis — splitting H₂O into hydrogen and oxygen using electricity — is a cornerstone of this transition. Understanding and optimizing the underlying catalytic reactions is a central scientific challenge.
Despite scientific consensus on anthropogenic climate change, global energy production in 2019 still relied overwhelmingly on fossil fuels. Atmospheric CO₂ concentrations are at historic highs — a trend confirmed by Antarctic ice-core records stretching back 800,000 years.
Renewable sources (solar, wind, hydro) are growing rapidly, but their intermittent nature creates a fundamental challenge: energy must be stored when supply exceeds demand and released when it does not.
Relative capacity/applicability of current storage technologies
Relative suitability of energy storage technologies by scale and deployment flexibility
First demonstrated in 1789 by van Troostwijk and Deiman, water electrolysis decomposes liquid water into gaseous hydrogen and oxygen using electrical energy. The thermodynamic requirement is 2.46 eV per water molecule (237.2 kJ/mol), corresponding to 1.23 V per electron transfer.
The overall reaction proceeds through two half-cell reactions at the electrodes:
4H⁺ + 4e⁻ → 2H₂
Low overpotential on most metals
2H₂O → O₂ + 4H⁺ + 4e⁻
Rate-limiting step, high overpotential
Schematic CV showing anodic (blue) and cathodic (orange) sweeps with characteristic peaks
The oxygen evolution reaction (OER) involves four sequential proton-coupled electron transfers, each ideally requiring 1.23 V. Any additional voltage needed — the overpotential η — represents an energy loss.
The four-step OER mechanism on a metal oxide surface:
H₂O + * → *OH + H⁺ + e⁻Water dissociation and OH adsorption
*OH → *O + H⁺ + e⁻Dehydrogenation to adsorbed oxygen
*O + H₂O → *OOH + H⁺ + e⁻Often the rate-limiting step on most catalysts
*OOH → O₂ + H⁺ + e⁻ + *Desorption regenerates surface site
The hydrogen evolution reaction (HER) is simpler — a two-step Volmer–Heyrovsky or Volmer–Tafel mechanism — and proceeds with much lower overpotential on most metallic catalysts.
The Brønsted–Evans–Polanyi (BEP) relation states that kinetic barriers for surface reactions scale linearly with adsorption energies. This allows us to compute theoretical overpotentials from DFT adsorption energies alone — orders of magnitude faster than computing full transition states.
Assuming BEP scaling holds, the descriptor ΔG*O − ΔG*OH captures the competition between oxygen binding strength (too strong → *OOH formation limited) and too-weak binding (→ *OH dehydrogenation limited).
Plotting theoretical overpotential against the OER descriptor (ΔG*O − ΔG*OH) for a range of transition metal oxides produces the characteristic volcano plot. Materials at the peak — IrO₂ and RuO₂ — bind oxygen intermediates optimally.
Industrial gold standard for acidic OER. Binds oxygen intermediates optimally — close to the volcano peak. Stable in acidic conditions but based on scarce Ir.
Slightly more active than IrO₂ but dissolves ~10× faster at 1.2 V in acidic media. Investigates as comparison material in surface phase diagram studies.
Mixing Ti into the IrO₂ host can modify adsorption energies and improve OER activity. Computationally designed — DFT identifies optimal Ti motifs and surface configurations.
Recent work explores multi-component oxides (5+ elements) that access vast composition spaces. ML-driven screening and in-situ XAS characterize active sites.
Density Functional Theory provides a quantum-mechanical framework to compute the electronic structure of materials from first principles — without empirical parameters. It is the workhorse of modern computational materials science.
Artist's rendition of a rutile-type crystal lattice with electron density clouds (blue) around metal atoms and oxygen (red)
The first Hohenberg-Kohn theorem (1964) establishes that the ground-state electron density n(r) uniquely determines all properties of a many-electron system. The second theorem provides a variational principle: the true ground state minimizes the total energy functional E[n].
Kohn and Sham (1965) reformulated the problem using a fictitious system of non-interacting electrons moving in an effective potential, making DFT computationally tractable for real materials.
The exchange-correlation (XC) functional Exc[n] accounts for quantum-mechanical exchange and electron correlation effects. Its exact form is unknown; approximations form the "Jacob's Ladder" of DFT:
Local Density Approximation — uses only local electron density. Simple, systematic underbinding.
Generalized Gradient Approximation. The Perdew-Burke-Ernzerhof functional is standard for surface catalysis.
Van der Waals corrections (Tkatchenko-Scheffler, Dion-DF) capture long-range dispersion — critical for water and adsorbates.
Mixes exact exchange — improves band gaps and electronic structure but at higher computational cost.
The Vienna Ab initio Simulation Package (VASP) solves the Kohn-Sham equations in a plane-wave basis set with periodic boundary conditions. Core electrons are treated by the Projector Augmented Wave (PAW) method, substantially reducing computational effort while maintaining accuracy.
Electrochemical reactions occur at the interface between a solid electrode and liquid electrolyte. The atomic-scale structure of the catalyst surface — its adsorbate coverage, oxidation state, and composition — directly controls catalytic activity and stability.
Under electrochemical conditions, the surface is in thermodynamic equilibrium with the electrolyte and applied potential. The grand canonical surface phase diagram maps out which surface configuration is thermodynamically stable as a function of the chemical potential of oxygen and hydrogen.
The chemical potential μ_H is linked to the electrode potential via the computational hydrogen electrode (CHE) — a key bridge between DFT energies and electrochemical observables.
The rutile IrO₂(110) surface exposes two types of surface sites: coordinatively unsaturated (cus) Ir atoms and bridging oxygen (Obr). Under OER conditions, these sites are sequentially covered by OH and O adsorbates.
A central question is the hydrogen coverage of the surface before and during the CV measurement. Hydrogen atoms are hard to detect experimentally due to their small scattering cross-section. DFT-based phase diagrams reveal that the surface transitions through distinct hydrogen-covered states as potential is swept.
IrO₂ crystallizes in the rutile structure (tetragonal, space group P4₂/mnm). The (110) surface is the lowest-energy facet and the orientation grown by MBE for experimental studies. Each surface Ir atom has 5-fold coordination (vs 6-fold bulk).
RuO₂ shares the rutile structure and exhibits similar surface chemistry to IrO₂, with slightly stronger oxygen binding. It is catalytically more active but far less stable — dissolving ~10× faster than IrO₂ at operating potentials in acidic media. The phase diagram for RuO₂ also captures the effect of explicit water adsorption and solvation corrections.
Implicit solvent models (VASPsol) shift adsorption energies of polar adsorbates like *OH and *OOH by up to 0.1–0.2 eV.
Explicit H₂O molecules on cus sites form hydrogen-bond networks that stabilize surface *OH configurations at low potentials.
RuO₂ dissolution becomes thermodynamically favorable at potentials >1.2 V vs RHE, limiting practical use. IrO₂ remains stable.
Experimental evidence shows that incorporating Ti into IrO₂ enhances OER activity compared to pure IrO₂. The mechanism for this enhancement is not immediately obvious — TiO₂ alone is a poor OER catalyst.
DFT calculations of multiple Ti substitution motifs (cus-Ir → Ti, subsurface Ti, mixed layers) reveal that specific configurations shift the oxygen adsorption energy of neighboring Ir sites toward the volcano peak.
More recent literature (Mints et al. 2024, Ospina-Acevedo et al. 2024) extends alloying strategies to high-entropy oxides (Ru₁₋ₓMₓO₂, M = Zr, Nb, Ta) and cooperative multi-site mechanisms. In-situ XAS and SEIRAS measurements now directly probe active site structures under reaction conditions.
Bridging the gap between DFT calculations of individual configurations and thermodynamic observables requires statistical mechanics. The cluster expansion and Monte Carlo simulation together enable prediction of equilibrium surface properties at finite temperature and electrochemical potential.
A surface in contact with an electrolyte at fixed potential is described by the grand canonical ensemble. The grand partition function sums over all possible surface configurations weighted by their Boltzmann factor e−βΩ, where Ω = E − μN is the grand potential.
Computing this sum exactly for a surface with N adsorption sites and k possible states per site requires kN evaluations — infeasible for large N. The cluster expansion makes this tractable.
The cluster expansion maps the DFT formation energy of any surface configuration onto a compact, rapidly-evaluable polynomial of site occupation variables σᵢ ∈ {0,1}. It is the key bridge that makes grand canonical Monte Carlo with DFT accuracy feasible.
Any configuration-dependent property can be expanded as:
where α indexes clusters (single sites, pairs, triplets, ...), J_α are the effective cluster interactions (ECIs), and Φ_α are cluster correlation functions — products of site occupation variables for all sites in the cluster.
For the hydrogen coverage problem, each surface site is either occupied (σ=1, H present) or empty (σ=0). The CE is trained on DFT energies of ~50–200 distinct configurations.
ECIs are fitted to a database of DFT formation energies by minimizing the cross-validated prediction error (LOOCV or k-fold CV). Feature selection (which clusters to include) is guided by genetic algorithms or compressive sensing.
CASM (Crystal Alloy Structure Maker) is an open-source framework implementing the complete cluster expansion workflow: structure enumeration, symmetry analysis, CE fitting, and grand canonical Monte Carlo sampling.
Randomly select a site and flip σᵢ: 0↔1
Compute ΔΩ = ΔE − μ_H·Δσ using the CE
If ΔΩ < 0: always accept. Else: accept with P = e−βΔΩ
After equilibration, measure ⟨N_H⟩, correlations, phase
The Metropolis Monte Carlo algorithm — introduced by Metropolis and Ulam (1949) — samples the grand canonical ensemble without computing the full partition function. Combined with the cluster expansion, it enables thermodynamic averages with near-DFT accuracy at negligible cost per step.
For the IrO₂ hydrogen coverage problem, MC runs use supercells of ~1000 surface sites over 105–106 steps per temperature/potential point, enabling converged phase diagrams and CV simulation.
Machine learning has transformed computational materials science by making it possible to learn accurate interatomic potentials, predict properties, and navigate vast chemical spaces — tasks that are intractable with DFT alone.
Abstract visualization of a neural network (input → hidden layers → output) and a fitted Gaussian kernel below, representing ML potential training
Machine-learned interatomic potentials (MLIPs) — such as the Gaussian Approximation Potential (GAP) — learn the DFT potential energy surface from training data, enabling molecular dynamics and Monte Carlo at near-DFT accuracy but many orders of magnitude faster.
GAP uses Gaussian process regression with structural descriptors (SOAP — Smooth Overlap of Atomic Positions) as input features. The covariance (kernel) function measures similarity between atomic environments.
The combination of high-throughput DFT, ML, and experimental screening has accelerated catalyst discovery dramatically. Recent highlights include ML-predicted high-entropy oxide catalysts for OER and CO₂ reduction.
ML models identify which configurations to compute with DFT next, minimizing the number of expensive calculations needed to achieve a target accuracy. Used in generative catalyst discovery workflows.
5+ component oxides span enormous composition spaces (~10⁶ + configurations). ML potentials and surrogate models screen these spaces in hours, identifying promising compositions for experimental synthesis.
SOAP, ACSF, and symmetry-invariant descriptors encode atomic structure in ML-friendly representations. Recursive feature elimination (RFE) selects the most predictive subset.
A key outcome of the thesis is the computation of a theoretical cyclic voltammogram for IrO₂(110) using the cluster expansion + Monte Carlo pipeline. By sweeping the chemical potential μ_H (equivalent to sweeping electrode potential), the simulation yields <N_H>(V) — the H coverage as a function of potential.
The derivative d<N_H>/dV corresponds directly to the capacitive current measured in a CV. Peaks in the simulated CV correspond to potential-driven adsorption/desorption events on specific surface sites.
Schematic CV illustrating the forward (blue) and return (orange) sweeps; peaks indicate surface phase transitions
Computational predictions are validated against and guided by experimental characterization techniques that probe catalyst structure and electrochemical behavior under realistic conditions.
Sweeping electrode potential while recording current reveals surface adsorption/desorption events as peaks. MBE-grown single-crystal surfaces produce well-defined CVs ideal for comparison with theory.
MBE grows atomically flat, well-oriented oxide films on single-crystal substrates. Essential for producing model catalyst surfaces with defined composition and crystallography that mirror the DFT slab model.
Operando X-ray absorption spectroscopy (XAS) and surface-enhanced infrared spectroscopy (SEIRAS) probe oxidation states and adsorbate species on the catalyst surface in real time during electrochemical reactions.
Online ICP-MS measurements quantify dissolution of Ir and Ru under OER conditions, directly measuring catalyst stability and corroborating thermodynamic stability predictions from surface phase diagrams.