Bayesian learning of chemisorption for bridging the complexity of electronic descriptors

Wang, Siwen; Pillai, Hemanth Somarajan; Xin, Hongliang

doi:10.1038/s41467-020-19524-z

Download PDF

Article
Open access
Published: 30 November 2020

Bayesian learning of chemisorption for bridging the complexity of electronic descriptors

Siwen Wang¹^na1,
Hemanth Somarajan Pillai¹^na1 &
Hongliang Xin ORCID: orcid.org/0000-0001-9344-1697¹

Nature Communications volume 11, Article number: 6132 (2020) Cite this article

5289 Accesses
42 Citations
58 Altmetric
Metrics details

Subjects

Abstract

Building upon the d-band reactivity theory in surface chemistry and catalysis, we develop a Bayesian learning approach to probing chemisorption processes at atomically tailored metal sites. With representative species, e.g., *O and *OH, Bayesian models trained with ab initio adsorption properties of transition metals predict site reactivity at a diverse range of intermetallics and near-surface alloys while naturally providing uncertainty quantification from posterior sampling. More importantly, this conceptual framework sheds light on the orbitalwise nature of chemical bonding at adsorption sites with d-states characteristics ranging from bulk-like semi-elliptic bands to free-atom-like discrete energy levels, bridging the complexity of electronic descriptors for the prediction of novel catalytic materials.

Statistical learning goes beyond the d-band model providing the thermochemistry of adsorbates on transition metals

Article Open access 15 October 2019

Using statistical learning to predict interactions between single metal atoms and modified MgO(100) supports

Article Open access 21 July 2020

High-throughput calculations of catalytic properties of bimetallic alloy surfaces

Article Open access 28 May 2019

Introduction

Adsorption of molecules or their fragments at transition-metal surfaces is a fundamental process for many technological applications, such as chemical sensing, molecular self-assembly, and heterogeneous catalysis. Because of the convoluted interplay between electron transfer and orbital coupling, chemical bonding can be formidably complex. Recent decades have brought major advances in spectroscopic tools^1,2, which reveal orbitalwise information of chemisorbed systems and concurrently in predicting chemical reactivity at sites of interest via electronic factors, e.g., the number of valence d-electrons³, density of d-states at the Fermi level⁴, d-band center⁵, and d-band upper edge^6,7. Compared with a full quantum-mechanics treatment of many-body systems, the simplicity of physics-inspired descriptors comes at a cost of limited generalization, particularly for high-throughput materials screening. Incorporation of multifidelity site features into reactivity models with machine learning (ML) algorithms has shown early promise for the prediction of adsorption energies, with an accuracy comparable to the typical error (~0.1−0.2 eV) of density functional theory (DFT) calculations^{8,9,10,11,12,13,14,15,16}. However, the approach is largely black-box in nature, prohibiting its physical interpretation. Developing a theory-based, generalizable model of chemisorption that bridges the complexity of electronic descriptors, and predicts the binding affinity of active sites to key reaction intermediates with uncertainty quantification represents one of the biggest challenges in fundamental catalysis.

Here, we present a Bayesian inference approach to probing chemisorption processes at metal sites by learning from ab initio datasets. The model is built upon the basic framework of the d-band reactivity theory⁵, while employing a Newns–Anderson-type Hamiltonian^17,18 to capture essential physics of adsorbate-substrate interactions. Such types of simplified Hamiltonians were originally used for describing magnetic properties of impurities in a bulk metallic host¹⁷, and later extended with success by Newns and Grimley to chemisorption at surfaces^18,19. A basis set of orbitals consisting of the adsorbate and substrate states was used for solving the hybridization problem within a self-consistent Hartree–Fock scheme¹⁸. Despite a remarkable success in advancing the basic understanding of adsorption phenomena at surfaces, particularly for d-block metals⁶, its application in materials design remains limited due to the lack of accurate model parameters and meaningful error estimates. Bayesian inference produces the posterior probability distribution of model parameters under the influence of observations and prior knowledge²⁰. With representative species, e.g., *O and *OH, we demonstrate the predictive performance and physical interpretability of Bayesian models for chemical bonding at a diverse range of intermetallics and near-surface alloys, bridging the complexity of electronic descriptors in search of novel catalytic materials.

Results

The d-band reactivity theory

Within the basic framework of the d-band reactivity theory for transition-metal surfaces, the formation of the adsorbate-metal bond conceptually takes place in two consecutive steps⁵, as illustrated in Fig. 1. First, the adsorbate frontier orbital (or orbitals) $\left|a\right\rangle$ at ${\epsilon }_{{\mathrm{a}}}^{0}$ couples to the delocalized, free-electron-like sp-states of the metal substrate, leading to a Lorenzian-shaped resonance state at ϵ_a. Second, the adsorbate resonance state interacts with the localized, narrowly-distributed metal d-states, shifting up in energies due to the orthogonalization penalty for satisfying the Pauli principle, and then splitting into bonding and antibonding states. The first step interaction contributes a constant ΔE₀ albeit often the largest part of chemical bonding. The variation in adsorption energies from one metal to another is determined by the metal d-states. This part of the interaction energy ΔE_d can be further partitioned into orbital orthogonalization and orbital hybridization contributions²¹. To a first approximation, the orbital hybridization energy can be evaluated by the changes of integrated one-electron energies. The orbital orthogonalization cost is considered simply as proportional to the product of interatomic coupling matrix and overlap matrix, VS, or equivalently αV², where α is the orbital overlap coefficient. The absolute value of V² can be written as $\beta {V}_{{\mathrm{ad}}}^{2}$, in which the standard values of ${V}_{{\mathrm{ad}}}^{2}$ relative to Cu are readily available on the Solid State Table²². The overall adsorption energy ΔE can then be written as the sum of the energy contributions from the sp-states ΔE₀ and the d-states ΔE_d, with the latter depending on the symmetry and degeneracy of adsorbate frontier orbitals. Another important information from this framework is the evolving density of states projected onto the adsorbate orbital(s) upon adsorption, ρ_a. A full account of the theoretical framework is presented in the “Methods” section.

There are a number of unknown parameters within the basic framework of the d-band reactivity theory as discussed above and detailed in “Methods” section, including the energy contribution from the sp-band ΔE₀, adsorbate resonance energy ϵ_a relative to the Fermi level, sp-band chemisorption function Δ₀, orbital overlap coefficient α, and orbital coupling coefficient β. By least-squares fitting of the adsorbate density of states and the integrated one-electron energy changes to those from DFT calculations^23,24, the Schmickler model of electron transfer has been developed to understand H₂ evolution/oxidation and OH⁻ adsorption at metal–electrolyte interfaces. However, the deterministic fitting of adsorption properties from a single surface is prone to overfitting or trapping into a locally optimal region, limiting its application in catalysis.

Bayesian learning

We instead employ Bayesian learning to infer the vector of model parameters $\overrightarrow{\theta }={(\Delta {E}_{0},{\epsilon }_{{\mathrm{a}}},{\Delta }_{0},\alpha ,\beta )}^{\prime}$ from the evidence, i.e., ab initio adsorption properties, along with prior knowledge if available²⁰. In Bayes’ view, those parameters are not deterministic point values, but rather probabilistic distributions reflecting the uncertainty of physical variables. The use of parameter distributions as opposed to computationally-derived point values has obvious advantages for uncertainty quantification. In the chemical sciences, Bayesian learning has been used for calibration and validation of thermodynamic models for the uptake of CO₂ in mesoporous silica-supported amines²⁵, designing the Bayesian error estimation functional with van der Waals correlations²⁶, and identifying potentially active sites and mechanisms of catalytic reactions²⁷, just to name a few. The Bayesian approach allows one to infer the posterior probability distribution $P(\overrightarrow{\theta }| {\mathcal{D}})$ for latent variables based on the prior $P(\overrightarrow{\theta })$ as well as the likelihood function $P({\mathcal{D}}| \overrightarrow{\theta })$ subject to the observation ${\mathcal{D}}$. The mathematical relationship between the prior, observation, and posterior is given by the Bayes’ theorem²⁰, $P(\overrightarrow{\theta }| {\mathcal{D}})=P({\mathcal{D}}| \overrightarrow{\theta })P(\overrightarrow{\theta })/P({\mathcal{D}})$. Our initial belief about likely parameter values is provided by weakly informative priors to minimize potential bias. For example, ΔE₀ and ϵ_a can be estimated from DFT calculations of the adsorbate on a simple metal, e.g., sodium (Na) at the face-centered cubic (fcc) phase. Specifically, we took Normal for floating-point variables unrestricted in sign, LogNormal for non-negative parameters, and Uniform for others (see the details of Bayesian learning and parameter choices in the “Methods” section). Computing the normalizing constant $P({\mathcal{D}})$, denominator of the posterior distribution, is impossible in most practical scenarios. To avoid this complication, the Markov chain Monte Carlo (MCMC) method²⁸, whose sampling criterion only depends on the relative posterior density of the newly explored point and its preceding point, is used. To compute the transition probability of each MCMC step, we define the sum of the (negative) logarithm of the likelihood functions corresponding to binding energies and projected density of states onto each adsorbate orbital with a hyperparameter λ adjusting the weight of two contributing metrics, see details in the “Methods” section. After a large number of MCMC samplings, burning (discard) of the first half of the trajectory and then thinning (1 out of 5 samplings) were performed before extracting converged values from the joint posterior distributions. The convergence of the MCMC sampling is checked by using parallel chains with different starting parameter sets such that the variance of interchain samplings is close or within 1.2–1.5 times to that of intrachains²⁸. The complete code, named Bayeschem, is now available at a Github repository https://github.com/hlxin/bayeschem for public access.

Model development

In Fig. 2a, we are showing the co-variance of the joint posterior distribution for each parameter pair and the 1D histogram of model parameters (ΔE₀, ϵ_a, Δ₀, α, and β) from MCMC simulations for *O adsorption at the fcc-hollow site of the {111}-terminated transition-metal surfaces (Cu, Ag, Au, Ni, Pd, Pt, Co, Rh, Ir, and Ru). We assume three degenerate O_2p orbitals as used before²⁹ for demonstration of the approach, while later extend it to multiorbital models. To attain converged posterior distributions, 200k MCMC sampling steps with the Metropolis–Hastings algorithm were performed in a multidimensional parameter space illustrated in Fig. 2b. In Fig. 2, the approximate contours for 68, 95, and 99% confidence regions are shown at the lower triangle, showing little to no correlation between latent-variable pairs.

With the converged Bayesian sampling, in Fig. 3a, it shows the model-predicted adsorption energies of *O at the fcc-hollow site of transition-metal surfaces, with a mean absolute error (MAE) ~0.17 eV compared to DFT calculations. The standard deviation of model prediction using the posterior distribution of model parameters ($\overrightarrow{\theta },\,\overrightarrow{\sigma }$) is overlaid, providing for the first time uncertainty quantification of adsorption energies within the d-band reactivity theory. Figure 3b shows DFT-calculated and model-constructed projected density of states onto the O_2p orbital using the posterior means of model parameters, taking Pt(111) as an example (see all the surfaces in Supplementary Fig. 1). The chemisorption function Δ(ϵ) and its Hilbert transform Λ(ϵ) along with the straight adsorbate line (ϵ − ϵ_a) are shown for the graphical solution of the Newns–Anderson model¹⁸. The intersects indicated by solid circles in Fig. 3b represent the O_2p–Pt_5d bonding and antibonding states, with the latter above the Fermi level, suggesting a strong covalent interaction of *O at Pt(111). Given the simplicity of the model, the clearly captured electronic structure of the adsorbate–substrate system and the reactivity trend are satisfying.

**Fig. 3: Model-predicted adsorption properties.**

To extend the approach for adsorbates with multiple valence orbitals that possibly contribute to bonding, we have explicitly treated O_2p states with the doubly degenerate p_xy orbitals and the single p_z orbital in Bayesian learning. We infer model parameters (ϵ_a, Δ₀, and β) corresponding to each non-equivalent adsorbate orbital together with an orbital-independent α²⁹ and a global parameter ΔE₀. The posterior parameter distributions are shown in Supplementary Fig. 2. From the posterior means of model parameters, we can see that the orbital coupling coefficient β of p_xy (1.67 eV⁻¹) is smaller than that of p_z (1.77 eV⁻¹), consistent with the symmetry analysis, that the p_xy orbitals that are parallel to a surface form π bonds with the d-states, while the p_z orbital can interact through a stronger σ bond. A weaker coupling manifests itself in a narrower orbital splitting of π/π^* than that of σ/σ^*, which has been previously observed using the angle-resolved photoemission spectroscopy on Cu and Ni³⁰. In Supplementary Figs. 3 and 4, it shows that the model-constructed projected density of states onto symmetry-resolved orbitals closely resemble the DFT-calculated distributions and the predicted values of *O adsorption energies have a MAE ~0.17 eV. To demonstrate the robustness and generalizability of the approach, we have also optimized the Bayeschem model of *O at the atop configuration, see Supplementary Figs. 5–7. In this model scheme, an individual set of parameters is obtained for the adsorbate at a given site. Compared to the linear adsorption-energy scaling relations³¹ that link adsorption energies of different adsorbates, Bayeschem creates the connection between the electronic structure of a surface site and the adsorption energy.

To test the prediction capability of the Bayeschem model for unseen systems, we took the *OH species at the atop adsorption configuration as a case study because of its fundamental importance in understanding the nature of chemical bonding³², and practical interests as a key reactivity descriptor in transition metal catalysis^33,34,35. Three frontier molecular orbitals, i.e., 3σ, 1π, and 4σ*, are assumed to be involved in chemical bonding³². Symmetry-resolved, molecular orbital density of states projected onto OH along with adsorption energies are used as the DFT ground truth Y in Eq. (6). With the Bayeschem model developed here (see Supplementary Figs. 8–10 for posterior parameter distributions, model-predicted adsorption energies and projected density of states on training samples), we predict *OH binding energies at a diverse range of intermetallics and near-surface alloys. Specifically, we included A₃B, A′@A_ML, A-B@A_ML, A₃B@A_ML, A@A₃B, and A@AB₃, where A (A′) represents ten fcc/hcp metals used in the model development and B covers d-metals across the periodic table (see ref. ³⁶ for structural details and tabulated data). The coupling matrix element V_ad for alloys is assumed to be constant from the Solid State Table²². Its dependence on the local chemical environment can be incorporated into the model using the tight-binding approximation³³. The A sites of above-mentioned surfaces exhibit diverse characteristics of the metal d-states ranging from bulk-like semi-elliptic bands to free-atom-like discrete energy levels³⁷, as illustrated in Fig. 4a using Pt and Ag₃Pt as examples. Similar to previous observations of single-atom alloys with coinage metal hosts^37,38, a reactive guest metal often exhibits peaky signatures within the d-band due to the energy misalignment of coupling d–d orbitals⁷. A direct consequence of such diverse electronic properties of adsorption sites is that no single electronic descriptor can capture the local chemical reactivity accurately. Encouragingly, the Bayeschem model, parameterized using ten pristine transition-metal data, predicts *OH adsorption energies on 512 alloy surfaces with a MAE 0.16 eV, see Fig. 4b. The standard deviation of predicted *OH adsorption energies from the posterior distribution of model parameters is marked for uncertainty quantification. It shows a similar performance to data-driven ML models^8,9,10,11 while outperforming the state-of-the-art electronic descriptors, e.g., the d-band center ϵ_d (MAE: 0.20 eV) and upper edge ϵ_u (MAE: 0.23 eV). The approach can be easily extended to more complex adsorbates than *O and *OH, e.g., *OOH, without losing its generalizability in the development workflow.

**Fig. 4: Model test and interpretation.**

Orbitalwise interpretation of chemical bonding

More importantly, the Bayesian framework with built-in physics allows us to quantitatively interrogate the underlying mechanism of chemical bonding, that is difficult to obtain from purely data-driven regression models. Taking *OH adsorption at the M (10 fcc/hcp metals) site of {111}-terminated Ag₃M intermetallics as examples, Fig. 4c shows the partition of *OH adsorption energies resulting from the 2^nd step interaction (ΔE_d) into orbital orthogonalization and hybridization. As we can see, for 3d, 4d, and 5d series of the guest metal M, the orthogonalization and hybridization contributions decrease in magnitude from left to right across the periodic table, while the hybridization dominates the reactivity trends. The changes in $\Delta {E}_{{\mathrm{d}}}^{{\mathrm{hyb}}}$ can be understood from the simplified d-band model, with the position and occupancy of adsorbate–substrate antibonding states tracking with the d-band center or upper edge. The orthogonalization energy is proportional to the filling f and ${V}_{{\mathrm{ad}}}^{2}$ (see Eq. (4)), which are offsetting each other to a certain extent (${V}_{{\mathrm{ad}}}^{2}$ decreases while f increases across 3d, 4d, and 5d series), leading to a less dominant role than the hybridization. The orbitalwise contributions shown in Fig. 4c with different fill patterns suggest that the sole contribution of *OH adsorption at d-metal surfaces is from the 1π orbital, while those from 3σ and 4σ^* are too small to be visible. This is supported by projected molecular orbital density of states in Supplementary Fig. 7, which shows that 3σ and 4σ^* are forming resonance states after their interactions with the sp-states of the metal site without noticeable splitting due to d-states. Thus, they do not contribute to the observed trend of *OH adsorption. The Bayesian-optimized orbital coupling coefficients of 3σ and 4σ^* are rather small (0.12 and 0.001 as shown in Supplementary Fig. 5, respectively), supporting unfavorable orbital overlaps with the d-states. This rationalizes the observation that *OH prefers the nearly-parallel adsorption geometry on most of the d-metals to maximize the interaction of the 1π orbital with metal d-states, while *OH on Na(111) adsorbs more strongly in a up-straight orientation because of a lack of such directional interactions. This orbitalwise insight of chemical bonding could provide guidance in tailoring orbital-specific characteristics of the metal d-band for desired catalytic properties through site engineering. Despite an exclusive discussion about the d-metals, it is possible to extend the Bayeschem framework to p-block metals and alloys see Supplementary Fig. 11, unifying the reactivity theory of metal surfaces.

To conclude, we present the first Bayesian model of chemisorption by learning from ab initio adsorption properties. The model leverages the well-established d-band reactivity theory and a Newns–Anderson-type Hamiltonian for capturing essential physics of chemisorption processes. We demonstrated that the Bayeschem models of descriptor species, e.g., *O and *OH, optimized with pristine transition-metal data predicts adsorption energies at a diverse range of atomically-tailored metal sites with a MAE ~0.1–0.2 eV while providing uncertainty quantification. Incorporation of physics-based models into data-driven ML algorithms, e.g., deep learning, might hold the promise toward developing highly accurate while interpretable reactivity models. Furthermore, this conceptual framework can be broadly applied to unravel orbital-specific factors governing adsorbate–substrate interactions, paving the path toward design strategies to go beyond adsorption-energy scaling limitations in catalysis.

Methods

DFT calculations

Spin-polarized DFT calculations were performed through Quantum ESPRESSO³⁹ with ultrasoft pseudopotentials. The exchange-correlation was approximated within the generalized gradient approximation (GGA) with Perdew–Burke–Ernzerhof (PBE)⁴⁰. {111}-terminated metal surfaces were modeled using (2 × 2) supercells with four layers and a vacuum of 15 Å between two images. The bottom two layers were fixed while the top two layers and adsorbates were allowed to relax until a force criteria of .1 eV/Å. A plane wave energy cutoff of 500 eV was used. A Monkhorst-Pack mesh of 6 × 6 × 1 was used to sample the Brillouin zone, while for molecules and radicals only the Gamma point was used. Gas phase species of O and OH were used as the reference for adsorption energies of *O and *OH, respectively. The projected atomic and molecular density of states were obtained by projecting the eigenvectors of the full system at a denser k-point sampling (12 × 12 × 1) with a energy spacing 0.01 eV onto the ones of the part, as determined by gas-phase calculations. The convergence of DFT calculations was thoroughly tested to be within 0.05 eV. Further details and tabulated data can be found in the ref. ⁹.

The d-band reactivity theory

To revisit the d-band theory of chemisorption along with new developments, let’s consider a metal substrate M in which electrons occupy a set of continuous states with one-electron wavefunctions $\left|k\right\rangle$ and eigenenergies ϵ_k, and an isolated adsorbate species A with a valence electron described by an atomic wavefunction $\left|a\right\rangle$ at ${\epsilon }_{{\mathrm{a}}}^{0}$, see Fig. 1. When the adsorbate is brought close to the substrate, the two sets of states will overlap and hybridize with each other. The strength of such interactions is determined by the coupling integral ${V}_{{\mathrm{ak}}}\,=\,\langle a| \hat{{\mathcal{H}}}| k\rangle$, where $\hat{{\mathcal{H}}}$ is the system Hamiltonian. Within the Newns–Anderson model of chemisorption^17,18,19, $\hat{{\mathcal{H}}}$ is defined as,

$$\hat{{\mathcal{H}}}=\mathop{\sum }\limits_{\sigma }\left\{{\epsilon }_{{\mathrm{a}}\sigma }{n}_{{\mathrm{a}}\sigma }+\mathop{\sum }\limits_{{\mathrm{k}}}{\epsilon }_{{\mathrm{k}}}{n}_{{\mathrm{k}}\sigma }+\mathop{\sum }\limits_{{\mathrm{k}}}({V}_{{\mathrm{ak}}}{c}_{{\mathrm{k}}\sigma }^{\dagger }{c}_{{\mathrm{a}}\sigma }+H.c.)\right\},$$

(1)

where σ denotes the electron spin, n is the orbital occupancy operator, and c^† and c represent the creation and annihilation operator, respectively. The first two terms in Eq. (1) are the one-electron energies from the adsorbate and the substrate when they are infinitely separated in space. The last term captures the coupling, or intuitively electron hopping, between the adsorbate orbital $\left|a\right\rangle$ and a continuum of substrate states $\left|k\right\rangle$. If the one-electron states of the whole system can be described as a linear combination of the unperturbed adsorbate and substrate states, the one-electron Schrödinger equation can be solved using the Green’s function approach¹⁸. In Fig. 1, we illustrate the chemisorption process of a simple adsorbate onto a d-block metal site characterized by delocalized sp-states and localized d-states²¹. The interaction of the adsorbate state at ${\epsilon }_{{\mathrm{a}}}^{0}$ with the structureless sp-states, typically accompanied with electron transfer from/to the Fermi sea, results in a broadened resonance (or so-called renormalized adsorbate state) at an effective energy level ϵ_a. Conceptually viewing chemical bonding as consecutive steps in Fig. 1, the renormalized adsorbate state then couples with the narrowly distributed d-states, shifting up in energies due to orbital orthogonalization that increases the kinetic energy of electrons and splitting into bonding and antibonding states. One important information from this framework is the evolving density of states projected onto the adsorbate orbital $\left|a\right\rangle$ upon adsorption

$${\rho }_{{\mathrm{a}}}(\epsilon )\,=\,\frac{1}{\pi }\frac{\Delta (\epsilon )}{{\left[\epsilon -({\epsilon }_{{\mathrm{a}}}+\Lambda (\epsilon ))\right]}^{2}\,+\,\Delta {(\epsilon )}^{2}},$$

(2)

in which spin is neglected for simplicity. The effective adsorbate energy level, ϵ_a, is determined by the image potential of a charged particle in front of conducting surfaces and the Coulomb repulsion between electrons in the same orbital¹⁸. The chemisorption function Δ(ϵ) includes contributions from the sp-states and the d-states

$$\Delta (\epsilon )\,=\,\pi \mathop{\sum }\limits_{{\mathrm{k}}}{V}_{{\mathrm{ak}}}^{2}\delta (\epsilon -{\epsilon }_{{\mathrm{k}}})\,=\,{\Delta }_{0}\,+\,{\Delta }_{{\mathrm{d}}}.$$

(3)

To simplify the matter, only the 2^nd step interaction, i.e., the coupling of the renormalized adsorbate state with the substrate d-states, is explicitly considered in Eq. (2). As a new development in our approach, we include an energy-independent constant Δ₀ along with Δ_d as the chemisorption function Δ(ϵ). The inclusion of Δ₀ provides a lifetime broadening of the adsorbate state, serving as a mathematical trick to avoid burdensome sampling of the resonance, i.e., the Lorentzian distribution ${\tilde{\rho }}_{{\mathrm{a}}}$ from the 1^st step interaction in Fig. 1. Accordingly, ϵ_a represents the renormalized adsorbate state. Attributed to the narrowness of a typical metal d-band, Δ_d can be simplified as the projected density of d-states onto the metal site ρ_d(ϵ) modulated by an effective coupling integral squared V², i.e., Δ_d ≃ πV²ρ_d(ϵ). Λ(ϵ) is the Hilbert transform of Δ(ϵ). In this framework, the interaction energy between the adsorbate and the substrate can be partitioned into two contributions, i.e., ΔE₀ and ΔE_d. ΔE₀ is the energy change due to the interaction of the unperturbed adsorbate orbital(s) with the delocalized sp-states, while ΔE_d is the energy contribution from further interactions with the localized d-states of the substrate. Since all d-block metals have a similar, free-electron-like sp-band, ΔE₀ can be approximated as a surface-independent constant albeit the largest contribution to bonding²¹. To calculate ΔE_d, we include both the attractive orbital hybridization $\Delta {E}_{{\mathrm{d}}}^{{\mathrm{hyb}}}$ and repulsive orbital orthogonalization $\Delta {E}_{{\mathrm{d}}}^{{\mathrm{orth}}}$^29,41:

$$\begin{array}{l}\Delta {E}_{{\mathrm{d}}}^{{\mathrm{hyb}}}\,=\,\frac{2}{\pi }\mathop{\int}\nolimits_{-\infty }^{{\epsilon }_{{\rm{F}}}}{\tan }^{-1}\left[\frac{\Delta (\epsilon )}{\epsilon -{{\epsilon }}_{{\mathrm{a}}}-\Lambda (\epsilon )}\right]d\epsilon -\frac{2}{\pi }\mathop{\int}\nolimits_{-\infty }^{{\epsilon }_{{\rm{F}}}}{\tan }^{-1}\left[\frac{{\Delta }_{0}(\epsilon )}{\epsilon -{{\epsilon }}_{{\mathrm{a}}}}\right]d\epsilon \\ \Delta {E}_{{\mathrm{d}}}^{{\mathrm{orth}}}\,=\,2(\langle {\tilde{n}}_{{\mathrm{a}}}\rangle +f)\alpha \beta {V}_{{\mathrm{ad}}}^{2}.\hfill\end{array}$$

(4)

The constant 2 considers spin degeneracy of the orbital, $\langle {\tilde{n}}_{{\mathrm{a}}}\rangle$ is the occupancy of the renormalized adsorbate state by integrating the Lorentzian distribution ${\tilde{\rho }}_{{\mathrm{a}}}$ up to the Fermi level ϵ_F (taken as 0), and f is the idealized d-band filling of the metal atom. The ${\tan }^{-1}$ is defined to lie between −π to 0 since Δ₀ is a nonzero constant across the energy scale [−15, 15] eV. Thus there is no need to explicitly include localized states even if present below or above the d-band. In Eq. (4), α is termed the orbital overlap coefficient, i.e., S ≈ α∣V∣, in which the overlap integral S is linearly proportional to the coupling integral V for a given orbital. Similarly, the effective coupling integral squared V² can be written as $\beta {V}_{{\mathrm{ad}}}^{2}$, where β denotes the orbital coupling coefficient and ${V}_{{\mathrm{ad}}}^{2}$ characterizes the interorbital coupling strength when the bonding atoms are aligned along the z-axis at a given distance⁴². Its values of d-block metals relative to that of Cu are readily available on the Solid State Table²². It is important to note that β is in the chemisorption function, which determines both the adsorption energy and adsorbate density of states, whereas α only affects the orbital orthogonalization energy since overlap was not explicitly considered.

Bayesian learning

Due to the computationally intensive nature of the MCMC algorithm, there is a need for a more efficient implementation of the Newns–Anderson model than what is obtained by Python and standard libraries like SciPy and NumPy. We make extensive use of Cython, a C++ extension to the standard Python, to speed up the performance (10–1000 times) of some CPU-intensive functions in the model, e.g., Hilbert transform. To perform MCMC sampling, we use PyMC, a flexible and extensible Python package which includes a wide selection of built-in statistical distributions and sampling algorithms⁴³, e.g., Metropolis-Hastings. A “burn-in” of the first half of the samplings and then thinning (1 out of 5 samplings) was performed to ensure that subsequent ones are representative of the posterior distribution. Convergence of our MCMC-based sampling was verified using parallel chains²⁸. The MCMC sampling results can be directly visualized using corner, a open-source Python module. We took Normal for floating-point variables unrestricted in sign, LogNormal for non-negative parameters, and Uniform for others. ΔE₀ and ϵ_a can be estimated from DFT calculations of the adsorbate on a simple metal, e.g., sodium (Na) at the face-centered cubic (fcc) phase. Specifically, for *O, we used ΔE₀ ~ N(−5.0, 1), ϵ_a ~ N(−5, 1), Δ₀ ~ LN(1, 0.25), β ~ LN(2, 1), and α ~ U(0, 1). For *OH, we used ΔE₀ ~ N(−3.0, 1), ${\epsilon }_{{\mathrm{a}}}^{3\sigma } \sim N(-6,1)$, ${\epsilon }_{{\mathrm{a}}}^{1\pi } \sim N(-2,1)$, and ${\epsilon }_{{\mathrm{a}}}^{4{\sigma }^{* }} \sim N(4,1)$. We assume that the predicted adsorption properties from Eqs. (2) and (4) are subject to independent normal errors. Specifically, for the property Y and the surface i we have

$${Y}_{{\mathrm{i}}}={\hat{Y}}_{{\mathrm{i}}}(\overrightarrow{\theta })+\sigma {\epsilon }_{{\mathrm{i}}},\,i=1,\,2,\,\ldots ,\,n,$$

(5)

where ϵ_i is an independent and standard normal random variable and σ is the standard deviation, allowing for a mismatch between the model prediction ${\hat{Y}}_{{\mathrm{i}}}(\overrightarrow{\theta })$ and the DFT ground truth Y_i. In this approach, we define the likelihood function of the property Y from n observations⁴⁴

$$P(Y| \overrightarrow{\theta },\,\sigma )\propto {\sigma }^{-n}\exp \left[-\frac{1}{2{\sigma }^{2}}\mathop{\sum }\limits_{i = 1}^{n}{\left\{{Y}_{{\mathrm{i}}}-{\hat{Y}}_{{\mathrm{i}}}(\overrightarrow{\theta })\right\}}^{2}\right],$$

(6)

where the sum runs over n training samples for the property Y, which is either the projected density of states onto an adsorbate orbital or adsorption energies. For adsorption energies, Y_i and ${\hat{Y}}_{{\mathrm{i}}}$ are scalar values with no ambiguity. For projected density of states, it is a vector of paired values, i.e., the one-electron energy of a state and its probability density, thus deserving a clarification. The mean squared residuals of model prediction from Eq. (2) for the surface i is used as ${\{{Y}_{{\mathrm{i}}}-{\hat{Y}}_{{\mathrm{i}}}(\overrightarrow{\theta })\}}^{2}$ in Eq. (6). To compute the transition probability of each MCMC step, we define the sum of the (negative) logarithm of the likelihood functions corresponding to projected density of states onto each adsorbate orbital and binding energies with a hyper parameter λ adjusting the weight of two contributing metrics, i.e., $-{\mathrm{ln}}\,({P}_{\Delta {\mathrm{E}}})-\lambda \sum {\mathrm{ln}}\,({P}_{{\rho }_{{\mathrm{a}}}})$. To optimize this parameter, we varied it on a grid of 1.0e−3, 1.0e−2, 1.0e−1, and 1, and found that 1.0e−2 is the optimal value to obtain the best performance in adsorption energy prediction.

Data availability

The training data of metal surfaces used for model development is available at the Github repository https://github.com/hlxin/bayeschem while the test data are from the article https://doi.org/10.1039/C7TA01812F10.1039/C7TA01812F.

Code availability

The complete code of Bayeschem is available at a Github repository https://github.com/hlxin/bayeschem for public access.

References

Nilsson, A., Pettersson, L. & Nørskov, J. K. Chemical Bonding at Surfaces and Interfaces. (Elsevier, Amsterdam, Oxford, 2008).
Google Scholar
Somorjai, G. A. & Li, Y. Introduction to Surface Chemistry and Catalysis (Wiley, Hoboken, 2010).
Calle-Vallejo, F. et al. Number of outer electrons as descriptor for adsorption processes on transition metals and their oxides. Chem. Sci. 4, 1245–1249 (2013).
Article CAS Google Scholar
Tong, Y. Y., Renouprez, A. J., Martin, G. A. & van der Klink, J. J. In Studies in Surface Science and Catalysis (eds Hightower, J. W. et al.) Vol. 101, 901–910 (Elsevier, Amsterdam, 1996).
Hammer, B. & Nørskov, J. K. Electronic factors determining the reactivity of metal surfaces. Surf. Sci. 343, 211–220 (1995).
Article ADS CAS Google Scholar
Vojvodic, A., Nørskov, J. K. & Abild-Pedersen, F. Electronic structure effects in transition metal surface chemistry. Top. Catal. 57, 25–32 (2014).
Article CAS Google Scholar
Xin, H., Vojvodic, A., Voss, J., Nørskov, J. K. & Abild-Pedersen, F. Effects of d-band shape on the surface reactivity of transition-metal alloys. Phys. Rev. B Condens. Matter 89, 115114 (2014).
Article ADS Google Scholar
Ma, X., Li, Z., Achenie, L. E. K. & Xin, H. Machine-Learning-Augmented chemisorption model for CO₂ electroreduction catalyst screening. J. Phys. Chem. Lett. 6, 3528–3533 (2015).
Article CAS Google Scholar
Li, Z., Wang, S., Chin, W. S., Achenie, L. E. & Xin, H. High-throughput screening of bimetallic catalysts enabled by machine learning. J. Mater. Chem. A 5, 24131–24138 (2017).
Article CAS Google Scholar
Tran, K. & Ulissi, Z. W. Active learning across intermetallics to guide discovery of electrocatalysts for CO₂ reduction and H₂ evolution. Nat. Catal. 1, 696–703 (2018).
Article CAS Google Scholar
Palizhati, A., Zhong, W., Tran, K., Back, S. & Ulissi, Z. W. Towards predicting intermetallics surface properties with high-throughput DFT and convolutional neural networks. J. Chem. Inf. Model. 59, 4742–4749 (2019).
Back, S. et al. Convolutional neural network of atomic surface structures to predict binding energies for high-throughput screening of catalysts. J. Phys. Chem. Lett. 10, 4401–4408 (2019).
Article CAS Google Scholar
Andersen, M., Levchenko, S. V., Scheffler, M. & Reuter, K. Beyond scaling relations for the description of catalytic materials. ACS Catal. 9, 2752–2759 (2019).
Article CAS Google Scholar
Gu, G. H. et al. Practical deep-learning representation for fast heterogeneous catalyst screening. J. Phys. Chem. Lett. 11, 3185–3191 (2020).
Article CAS Google Scholar
Montemore, M. M., Nwaokorie, C. F. & Kayode, G. O. General screening of surface alloys for catalysis. Catal. Sci. Technol. 10, 4467–4476 (2020).
Article CAS Google Scholar
Esterhuizen, J. A., Goldsmith, B. R. & Linic, S. Theory-Guided Machine Learning Finds Geometric Structure-Property Relationships for Chemisorption on Subsurface Alloys. Chem 6, 3100–3117 (2020).
Article CAS Google Scholar
Anderson, P. W. Localized magnetic states in metals. Phys. Rev. 124, 41 (1961).
Article ADS MathSciNet CAS Google Scholar
Edwards, D. M. & Newns, D. M. Electron interaction in the band theory of chemisorption. Phys. Lett. A 24, 236–237 (1967).
Article ADS CAS Google Scholar
Grimley, T. B. The indirect interaction between atoms or molecules adsorbed on metals. Proc. Phys. Soc. Lond. 90, 751 (1967).
Article ADS CAS Google Scholar
Bayes, T. & Price, N. LII. an essay towards solving a problem in the doctrine of chances. by the late rev. mr. bayes, f. r. s. communicated by mr. price, in a letter to john canton, a. m. f. r. S. Philos. Trans. R. Soc. Lond. 53, 370–418 (1763).
MATH Google Scholar
Hammer, B., Morikawa, Y. & Nørskov, J. K. CO chemisorption at metal surfaces and overlayers. Phys. Rev. Lett. 76, 2141 (1996).
Article ADS CAS Google Scholar
Harrison, W. A. & Physics. Electronic Structure and the Properties of Solids: The Physics of the Chemical Bond (Dover Publications, New York, 1989).
Santos, E., Quaino, P. & Schmickler, W. Theory of electrocatalysis: hydrogen evolution and more. Phys. Chem. Chem. Phys. 14, 11224–11233 (2012).
Article CAS Google Scholar
Román, A. M., Dudoff, J., Baz, A. & Holewinski, A. Identifying “optimal” electrocatalysts: impact of operating potential and charge transfer model. ACS Catal. 7, 8641–8652 (2017).
Article Google Scholar
Mebane, D. S. et al. Bayesian calibration of thermodynamic models for the uptake of CO2 in supported amine sorbents using ab initio priors. Phys. Chem. Chem. Phys. 15, 4355–4366 (2013).
Article CAS Google Scholar
Wellendorff, J. et al. Density functionals for surface science: Exchange-correlation model development with bayesian error estimation. Phys. Rev. B Condens. Matter 85, 235149 (2012).
Article ADS Google Scholar
Walker, E. A., Mitchell, D., Terejanu, G. A. & Heyden, A. Identifying active sites of the Water–Gas shift reaction over titania supported platinum catalysts under uncertainty. ACS Catal. 8, 3990–3998 (2018).
Article CAS Google Scholar
Gamerman, D. & Lopes, H. F. Markov Chain Monte Carlo: Stochastic Simulation for Bayesian Inference 2nd Edn (Chapman and Hall/CRC, London, 2006).
Hammer, B. & Nørskov, J. K. In Chemisorption and Reactivity on Supported Clusters and Thin Films: Towards an Understanding of Microscopic Processes in Catalysis (eds Lambert, R. M. & Pacchioni, G.), 285–351 (Springer, Dordrecht, 1997).
Wandelt, K. Photoemission studies of adsorbed oxygen and oxide layers. Surf. Sci. Rep. 2, 1–121 (1982).
Article ADS CAS Google Scholar
Abild-Pedersen, F. et al. Scaling properties of adsorption energies for hydrogen-containing molecules on transition-metal surfaces. Phys. Rev. Lett. 99, 016105 (2007).
Article ADS CAS Google Scholar
Xin, H. & Linic, S. Communications: exceptions to the d-band model of chemisorption on metal surfaces: the dominant role of repulsion between adsorbate states and metal d-states. J. Chem. Phys. 132, 221101–221104 (2010).
Google Scholar
Xin, H., Holewinski, A. & Linic, S. Predictive structure–reactivity models for rapid screening of Pt-based multimetallic electrocatalysts for the oxygen reduction reaction. ACS Catal. 2, 12–16 (2012).
Article CAS Google Scholar
Tang, M. T., Peng, H., Lamoureux, P. S., Bajdich, M. & Abild-Pedersen, F. From electricity to fuels: descriptors for C1 selectivity in electrochemical CO₂ reduction. Appl. Catal. B 279, 119384 (2020).
Article CAS Google Scholar
Strmcnik, D. et al. Improving the hydrogen oxidation reaction rate by promotion of hydroxyl adsorption. Nat. Chem. 5, 300–306 (2013).
Article CAS Google Scholar
Li, Z., Ma, X. & Xin, H. Feature engineering of machine-learning chemisorption models for catalyst design. Catal. Today 280, 232–238 (2017).
Thirumalai, H. & Kitchin, J. R. Investigating the reactivity of single atom alloys using density functional theory. Top. Catal. 61, 462–474 (2018).
Article CAS Google Scholar
Greiner, M. T. et al. Free-atom-like d states in single-atom alloy catalysts. Nat. Chem. 10, 1008–1015 (2018).
Article CAS Google Scholar
Giannozzi, P. et al. QUANTUM ESPRESSO: a modular and open-source software project for quantum simulations of materials. J. Phys. Condens. Matter 21, 395502 (2009).
Article Google Scholar
Perdew, J. P., Burke, K. & Ernzerhof, M. Generalized gradient approximation made simple. Phys. Rev. Lett. 77, 3865–3868 (1996).
Article ADS CAS Google Scholar
Vojvodic, A., Nørskov, J. K. & Abild-Pedersen, F. Electronic structure effects in transition metal surface chemistry. Top. Catal. 57, 25–32 (2014).
Article CAS Google Scholar
Ma, X. & Xin, H. Orbitalwise coordination number for predicting adsorption properties of metal nanocatalysts. Phys. Rev. Lett. 118, 036101 (2017).
Article ADS Google Scholar
Patil, A., Huard, D. & Fonnesbeck, C. J. PyMC: Bayesian stochastic modelling in python. J. Stat. Softw. 35, 1–81 (2010).
Article Google Scholar
Baggaley, A. W., Sarson, G. R., Shukurov, A., Boys, R. J. & Golightly, A. Bayesian inference for a wave-front model of the neolithization of Europe. Phys. Rev. E 86, 016105 (2012).
Article ADS Google Scholar

Download references

Acknowledgements

S.W., H.S.P., and H.X. acknowledge the financial support from the NSF CAREER program (CBET-1845531). The computational resource used in this work is provided by the advanced research computing at Virginia Polytechnic Institute and State University. H.X. acknowledges the insightful discussion with Prof. John Kitchin from Carnegie Mellon University that inspired the work.

Author information

These authors contributed equally: Siwen Wang, Hemanth Somarajan Pillai.

Authors and Affiliations

Department of Chemical Engineering, Virginia Polytechnic Institute and State University, Blacksburg, VA, 24061, USA
Siwen Wang, Hemanth Somarajan Pillai & Hongliang Xin

Authors

Siwen Wang
View author publications
You can also search for this author in PubMed Google Scholar
Hemanth Somarajan Pillai
View author publications
You can also search for this author in PubMed Google Scholar
Hongliang Xin
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.W. and H.S.P. equally contributed to the work. H.X. supervised the research. S.W. and H.X. conceived the idea and designed the general approach. S.W. and H.S.P. conducted DFT calculations and coding. S.W. and H.S.P. performed the detailed analysis. All authors revised the manuscript.

Corresponding author

Correspondence to Hongliang Xin.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Nature Communications thanks Christopher Bartel and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Wang, S., Pillai, H.S. & Xin, H. Bayesian learning of chemisorption for bridging the complexity of electronic descriptors. Nat Commun 11, 6132 (2020). https://doi.org/10.1038/s41467-020-19524-z

Download citation

Received: 24 July 2020
Accepted: 12 October 2020
Published: 30 November 2020
DOI: https://doi.org/10.1038/s41467-020-19524-z

This article is cited by

Material symmetry recognition and property prediction accomplished by crystal capsule representation
- Chao Liang
- Yilimiranmu Rouzhahong
- Huashan Li
Nature Communications (2023)
Interpretable design of Ir-free trimetallic electrocatalysts for ammonia oxidation with graph neural networks
- Hemanth Somarajan Pillai
- Yi Li
- Hongliang Xin
Nature Communications (2023)
Bridging the complexity gap in computational heterogeneous catalysis with machine learning
- Tianyou Mou
- Hemanth Somarajan Pillai
- Hongliang Xin
Nature Catalysis (2023)
A review of the recent progress in battery informatics
- Chen Ling
npj Computational Materials (2022)
Breaking adsorption-energy scaling limitations of electrocatalytic nitrate reduction on intermetallic CuPd nanocubes by machine-learned insights
- Qiang Gao
- Hemanth Somarajan Pillai
- Huiyuan Zhu
Nature Communications (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

The d-band reactivity theory

Bayesian learning

Model development

Orbitalwise interpretation of chemical bonding

Methods

DFT calculations

The d-band reactivity theory

Bayesian learning

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links