Deducing subnanometer cluster size and shape distributions of heterogeneous supported catalysts

Liao, Vinson; Cohen, Maximilian; Wang, Yifan; Vlachos, Dionisios G.

doi:10.1038/s41467-023-37664-w

Download PDF

Article
Open access
Published: 08 April 2023

Deducing subnanometer cluster size and shape distributions of heterogeneous supported catalysts

Nature Communications volume 14, Article number: 1965 (2023) Cite this article

3030 Accesses
6 Citations
4 Altmetric
Metrics details

Subjects

Abstract

Infrared (IR) spectra of adsorbate vibrational modes are sensitive to adsorbate/metal interactions, accurate, and easily obtainable in-situ or operando. While they are the gold standards for characterizing single-crystals and large nanoparticles, analogous spectra for highly dispersed heterogeneous catalysts consisting of single-atoms and ultra-small clusters are lacking. Here, we combine data-based approaches with physics-driven surrogate models to generate synthetic IR spectra from first-principles. We bypass the vast combinatorial space of clusters by determining viable, low-energy structures using machine-learned Hamiltonians, genetic algorithm optimization, and grand canonical Monte Carlo calculations. We obtain first-principles vibrations on this tractable ensemble and generate single-cluster primary spectra analogous to pure component gas-phase IR spectra. With such spectra as standards, we predict cluster size distributions from computational and experimental data, demonstrated in the case of CO adsorption on Pd/CeO₂(111) catalysts, and quantify uncertainty using Bayesian Inference. We discuss extensions for characterizing complex materials towards closing the materials gap.

Identification of stable adsorption sites and diffusion paths on nanocluster surfaces: an automated scanning algorithm

Article Open access 25 October 2019

Real-time dynamics and structures of supported subnanometer catalysts via multiscale simulations

Article Open access 14 September 2021

Infrared spectroscopy data- and physics-driven machine learning for characterizing surface microstructure of complex materials

Article Open access 23 March 2020

Introduction

Actual catalytic materials are inherently heterogeneous and consist of a distribution of sites, sizes, and shapes. Supported single-atom (SA) and subnanometer cluster catalysts have been of great interest due to their reduction in cost coupled with their notable catalytic activity and selectivity in many relevant chemistries, including, but not limited to, hydrogenation, oxidation, hydroformylation, reforming, and C-C coupling reactions^1,2,3. Advances in microscopy applied to single-atom catalysts^4,5 co-existing with small clusters have revealed the complexity of these materials and their dynamic nature, especially under working conditions. Characterization, i.e., elucidating the distributions and structure-dependent catalytic performances⁶, is challenging due to many factors such as low metal loadings⁷, poor instrumental signal-to-noise ratios (SNR), limitations of characterization techniques, the inapplicability of certain operando measurements⁸, and the inherent heterogeneity of the materials. Advances in addressing these challenges is imperative to improving catalyst characterization and eventually catalyst performance^9,10.

Excitations, probed via infrared (IR) spectroscopy¹¹, are sensitive to interactions between adsorbates and metals, and have been extensively used to study the structure of metal oxides, supported metal particles and metal oxides, as well as single-atom catalysts^12,13,14. They can accurately probe adsorbate normal vibrational modes, account for coverage effects, and can be used in-operando. Most IR-based peaks, however, are typically assigned heuristically for relatively simple spectra following the gold standard of well-defined single crystals. Inorganic complexes in the form of homogeneous catalysts have also served as molecular analogs to mononuclear metal active sites of SA catalysts to aid in peak identification^15,16,17. However, IR-deduced detailed characterization of real-world catalysts is lacking¹⁸ due to strong interactions of the highly undercoordinated metal atoms with the support^19,20,21, resulting in each cluster size and shape giving a different signal that is difficult to distinguish in the sampled spectra.

First-principles calculations can help with peak interpretation, but models are limited and often consider a single active site on a well-defined crystallographic plane. The disparity between simple models and real-world working materials is reminiscent of the well-known materials gap^22,23. Current IR quantification methodologies to bridge this gap have found limited applicability to real-world catalysts, as they have mainly been restricted to spectra obtained from large nanoparticles (NPs). A framework introduced by Lansford et al. is restricted to spectra obtained from unsupported NPs¹⁸, and predicts the fraction of planes and adsorbate site-types, but is unable to distinguish the heterogeneity in the distributions of clusters. Kale et al. utilized site-specific extinction coefficients with peak deconvolution, interaction, and a priori assumptions about nanoparticle size and coverage to determine the catalyst active sites²⁴, but again is limited to NPs in the order of tens of nanometers in diameter.

Here, we develop a two-step framework to interpret and deconvolute complex IR spectra of supported single-atoms and subnanometer cluster catalysts exposed to adsorbates using first-principles spectroscopies and data-based methods. We introduce a methodology to mitigate the computational cost of isomeric combinatorial search by predicting an ensemble of low-energy (CO)_m/Pd_n structures under working conditions that contributes maximally to the spectroscopic signature. We utilize first-principles density-functional theory (DFT) calculations coupled with signal processing techniques to generate realistic, single-cluster primary spectra analogous to pure component spectra in gas-phase IR spectroscopy^25,26 for this ensemble. These primary spectra serve as calibration standards. We utilize a physics-driven surrogate model to construct realistic synthetic spectra that accounts for coverage effects to benchmark spectra deconvolution. Finally, we perform spectra deconvolution of synthetic and experimental spectra within the Bayesian Inference framework to predict cluster size distributions and quantify uncertainty stemming from DFT errors and noise. We derive a criterion for matching modeled and observed spectra using the signal-to-noise ratio (SNR). We discuss the applications to characterize complex materials under working conditions to close the materials gap. We benchmark our methodology on Pd_n/CeO₂(111) (n = 1–20) exposed to carbon monoxide (CO). Our framework can accurately predict cluster size and shape distributions for both synthetic and experimental spectra and is robust to overfitting spectral peaks to noise. Our results obtained directly from the deconvolution of IR spectra with little to no a priori assumptions are consistent with those made from other characterization techniques. The methodology is an important tool in catalyst characterization toward closing the materials gap.

Results and discussion

Modeling overview

Here, we provide an overview of our framework for determining the sizes and shapes of supported subnanometer clusters exposed to adsorbates directly from IR spectra. Our methodology is inspired by the deconvolution of gas and liquid-phase IR spectra composed of a linear combination of pure component spectra, a consequence of the Beer-Lambert Law. The linear contribution of each component is traditionally solved through a system of linear equations via least-squares fitting. Pure component calibration spectra can be easily obtained for gas and liquid phase species (from an appropriate vendor, for example) but is almost impossible to obtain for heterogenous catalysts due to the difficulty in synthesizing samples with atomic uniformity.

Our framework is composed of two major steps: (1) generation of calibration spectra from first principles (rather than experimentally) and (2) deconvolution of spectra. Given the lack of calibration standards for heterogeneous materials, our framework utilizes computational IR frequencies and intensities to generate calibration spectra. Each of these spectra, deemed primary spectra, reflects a catalyst sample composed of a single supported cluster isomer exposed to adsorbates. However, the number of cluster/adsorbate configurations even for a single size can be huge. For instance, we estimate that computing the primary spectra for every possible isomer of Pd₂₀/CeO₂ saturated with CO would take years. We bypass this combinatorial search by computing a low-energy ensemble of metal/adsorbate structures at working conditions for each cluster size using various machine learning and optimization techniques. This ensemble consists of low-energy structures that are thermodynamically favorable and is the subject of first principles primary spectra calculations. This step reduces the number of first principles calculations by many orders of magnitude. Experimental spectra of real materials is then deconvoluted by solving the system of linear equations associated with the Beer Lambert Law within the Bayesian inference framework to predict cluster size and shape distributions and their associated uncertainties. The Bayesian approach, rather than the commonly used frequentist approach, propagates errors and uncertainties associated with first principles computed spectra. Figure 1 shows a schematic of the overall Bayesian spectra deconvolution framework. We benchmark our framework using a model system of Pd_n/CeO₂(111) (n = 1–20) exposed to saturated CO at 323 K.

**Fig. 1: Schematic of the Bayesian infrared spectra deconvolution procedure.**

Low-energy ensemble generation

The catalyst heterogeneity is evidenced by a distribution of cluster sizes and shapes for each respective size (hereafter, also called isomers or structures). The number of isomers grows exponentially with size, and each isomer exposes a distribution of sites for adsorption and reaction²⁷. The existence of multiple support facets and defects further enhances the heterogeneity of the material. Accounting for the combinatorics of all cluster structures and adsorbate configurations is challenging for any supported metal and adsorbate system. Determining structures directly from spectra requires solving an optimization problem to minimize the distance of computed and experimental spectra. For each trial structure generated during the optimization, adsorbate frequencies and intensities must be computed using DFT. This task is incredibly costly, and the direct structure-to-spectra matching approach is impractical. The heterogeneity of the catalyst implies that distributions rather than a single size and structure need to be accounted for, making optimization much harder. Furthermore, experimental spectrometers have limited resolution in the frequency domain, preventing the existence of an observable unique spectroscopic signature for each structure and rendering the deconvolution problem ill-posed (theoretically, with an infinite spectroscopic resolution, each potential adsorbate has a unique detectable spectroscopic signature).

To tackle these barriers, we determine the ensemble of low-energy metal/adsorbate configurations for each cluster size at a given temperature and CO partial pressure using a cluster genetic algorithm coupled with a Grand Canonical Monte Carlo (GCMC) algorithm²⁸. To achieve this, one needs to develop Hamiltonians describing the metal-support, metal-metal, metal-adsorbate, and adsorbate-adsorbate (lateral) interactions using DFT and machine learning. Machine learned Hamiltonians allow for the prediction of electronic energies of arbitrary CO-Pd/CeO₂ structures with a minimal amount of expensive first principles calculations. The GCMC algorithm effectively minimizes the Gibbs free energy to determine the structure of the metal cluster and the distribution of surface adsorbates simultaneously at a specified temperature and CO partial pressure. This simultaneous optimization is necessary as adsorbates significantly alter the cluster structures to create preferred low-energy sites. This optimization scheme is repeated for each cluster size up to 20 Pd atoms. The low Gibbs free energy structures of each size form the low-energy ensemble that contains the most abundant structures contributing maximally to the spectral intensity.

Figure 2a shows the most energetically stable cluster/adsorbate configurations at 323 K saturated with CO for Pd_n/CeO₂(111) for n = 5–20. We do not show Pd clusters smaller than 5 atoms as the number of possible isomers is minimal. Overall, the metal clusters have a flat or truncated pyramidal shape to maximize contact with the support especially as cluster size increases. The ratio of surface adsorbate coverage to the number of exposed surface metal atoms approaches 1:1. In addition, strong metal-support interactions also play a significant role in CO adsorption that is not captured in traditionally modeled extended surfaces. Our machine learned Hamiltonians, as well as Monte Carlo simulations, show that CO prefers to adsorb on (1) bridge and threefold sites to maximize metal coordination and (2) sites that are closer to the support for electronic stabilization. On average, our simulations show that clusters flatten under a CO environment, suggesting that the stabilization gained via the adsorption energy of CO serves as a thermodynamic driving force to offset the stability loss by overwetting of the cluster to the support.

**Fig. 2: Low energy structures versus Pd_n/CeO₂ cluster size for n = 5–20 at 323 K and saturated CO.**

Figure 2b shows the distributions of the Gibbs free energy normalized by the number of Pd atoms as a function of the cluster size. The free energies are referenced to a CO reservoir and calculated according to Eq. (2) of the Methods. The entropic contributions to the free energies can be decomposed into the respective configurational and vibrational contributions. We ignored vibrational entropy contributions to the free energy differences, as the change in vibrational entropy of adsorbed CO on different sites is typically less than 0.03 eV at 323 K on metals^29,30,31,32. Configurational entropy is explicitly accounted for by the Metropolis sampling scheme. The Gibbs free energies vary widely (from −3.0 to −1.0 eV/atom) for the same size clusters and with varying sizes due to the differences in the number of available surface sites and site-types for different isomers. Notably, the structures of the most stable Pd clusters with adsorbed CO differ from that of the bare clusters. For example, the most energetically stable isomer of bare Pd₂₀/CeO₂ becomes the 5th most stable isomer once CO is introduced. Literature supports the observed phenomenon; upon CO adsorption, Pd atoms diffuse and reconfigure, changing the observed structure^33,34,35,36.

To approximate the relative abundance of each cluster/adsorbate for a given size, we utilize a Boltzmann equilibrium. Figure 2c shows the ensemble probability density and Boltzmann probability density at 323 K for Pd₂₀/CeO₂ as a function of the normalized free energy (for Pd₅-Pd₁₉/CeO₂, refer to Fig. S1). Each point along the probability density curves represents a discrete minima (CO)_m/Pd₂₀/CeO₂ configuration sampled in the GCMC algorithm. The former refers to each discrete state being equally probable, and the latter weighted by Boltzmann statistics. The two probability density curves coincide at the limit of infinite temperature. The shaded region represents the 95% integrated probability density of the Boltzmann curve, chosen as modern FTIR spectrometers with a resolution of 2 cm⁻¹ typically have a signal-to-noise ratio (SNR) in the order 400 at the frequencies of the highest observed intensity peaks (i.e., C-O stretch region of 1600–2000 cm⁻¹). This corresponds to signal to perceived noise amplitude ratio of 20:1 (refer to Supplementary Information for more information)^37,38. Thus, we expect 95% of the observed signal to be from the system and 5% from noise. As a result, clusters with predicted Boltzmann probabilities outside the 95% integrated probability density region contribute IR intensities indistinguishable from noise. The ensemble of structures for each cluster size within this 95% cutoff form the low-energy ensemble. For our dataset, 40 unique structures of (CO)_m/Pd₁-Pd₂₀/CeO₂ meet the 95% cutoff Boltzmann criterion, a remarkably small number.

We also perform an analogous Boltzmann equilibrium analysis on the bare Pd_n/CeO₂ clusters at an identical 323 K to determine the effect that CO has on the number of thermodynamically accessible states. Figure 3 shows the ensemble and Boltzmann probability densities for bare Pd₂₀/CeO₂ (for Pd₅-Pd₁₉/CeO₂, refer to Fig. S2). We find that the number of discrete states that meet the 95% cutoff Boltzmann criteria doubles, from 4 to 8 states, between the saturated CO/Pd₂₀/CeO₂ system as seen in Fig. 2c and the bare system, respectively. For the entire dataset, we find that 262 unique structures of Pd₁-Pd₂₀/CeO₂ meet the 95% cutoff Boltzmann criterion, almost an order-of-magnitude larger than those for (CO)_m/Pd₁-Pd₂₀/CeO₂. This suggests that the introduction of CO to the system leads to a thermodynamic confinement effect, limiting the number of thermodynamically accessible states at low temperatures.

**Fig. 3: Ensemble and Boltzmann probabilities of bare Pd₂₀/CeO₂ at 323 K.**

Primary spectra generation

We perform first-principles computations for the 40 configurations of (CO)_m/Pd_n/CeO₂ that make up the low-energy ensemble directly using DFT to construct the primary spectra. We describe the details of generating primary spectra from DFT-computed IR frequencies and intensities in the Methods section. Primary spectra are analogous to pure component spectra in gas-phase IR and are the spectroscopic signature of catalyst sample composed of a single supported cluster isomer exposed to CO. The primary spectra cannot easily be obtained experimentally due to the difficulty synthesizing homogeneous supported clusters with atomic precision. We note that DFT-computed frequencies are often systematically underestimated, and as a result, it is customary to fit linear scaling factors to experimental data to account for these errors. Linear frequency scaling factors are used for our computed primary spectra, which are optimized during the fitting procedure. Each cluster can be thought of as having a distribution of spectroscopic signatures stemming from the uncertainty of DFT, in which the best spectra is chosen during the fitting procedure. Scaling factors computed for adsorbates on well-defined single crystals are used as informative priors to regularize and prevent overfitting. These calculated factors serve as reasonable estimates for the error in DFT frequencies. More information on the construction of linear scaling factors can be found in the Supporting Information.

Figure 4 shows primary spectra at differential CO coverage (corresponding to 1 CO per cluster) and saturated CO coverage for various Pd cluster sizes. Note that the intensities of the metal-carbon stretch region (<1000 cm⁻¹) are magnified tenfold for visibility. At differential coverages (Fig. 4a, b), it is difficult to distinguish the spectroscopic signatures of Pd₁ and Pd₁₀ as there are relatively few peaks observed. Discerning cluster sizes at low coverages leads to high uncertainty as many combinations of single high-intensity peak spectra can form an observed IR spectra. However, at saturated CO coverage (Fig. 4c, d), multiple high-intensity peaks couple as the surface contains more adsorbates, leading to a more discernable spectroscopic signature. It is interesting that the dominant peak in the spectra of Fig. 4d (corresponding to Pd₂₀/CeO₂), centered in the ~1650 cm⁻¹ regime, is blue shifted when compared to the spectra in Fig. 4c (corresponding to Pd₁₀). This can be rationalized by CO preferentially adsorbing on lower wavenumber bridge and threefold sites on the Pd₂₀/CeO₂ cluster, while predominantly occupying higher wavenumber atop and bridge sites on Pd₁₀/CeO₂. The preferential adsorption on threefold and bridge sites on larger supported Pd clusters has also been observed in the literature²⁸. Thus, we choose to operate in the saturated CO coverage regime for the remainder of our work due to increase in the number of spectroscopic peaks as compared to at differential coverages.

**Fig. 4: Primary spectra of CO on various sizes of Pd/CeO₂ and CO coverages from DFT-computed frequencies and intensities.**

In the Supplementary Information, we elaborate further on the effects of isomer configuration for identical sizes and CO adsorption site-types on the generated primary spectra, at both differential and saturated coverages. At differential coverages, the frequency of the highest intensity peaks (i.e., C-O stretch frequencies) is almost entirely determined by the adsorption site type (i.e., atop, bridge, hollow), as shown in Fig. S3. This trend is observed at all cluster size regimes studied, and even extends to CO frequencies at the palladium nanoparticle and single-crystal regime³⁹. At saturated coverages, the spectroscopic signature of isomers of the same size exhibit large differences as the surface contains many more adsorbates than at differential coverages, and the adsorbate configurations vary greatly (Fig. S4). The ability to distinguish between different isomers further supports our decision to operate at the saturated CO coverage regime.

Synthetic spectra generation

To benchmark our deconvolution methodology, we construct synthetic spectra representative of heterogeneous systems composed of many different cluster sizes and isomers using our primary spectra. We take advantage of the fact that IR spectral intensities obey Beers’ Law and are linear with respect to the number of entities⁴⁰. We construct synthetic spectra by taking a direct vector sum of the desired primary spectra weighted with their respective fractional contributions. Figure 5 shows an example of complex spectra of equal fractions of supported monomeric, dimeric, and trimeric Pd clusters and their individual primary cluster spectra. Intensities are normalized to ignore the effects of metal loading (and, consequently, adsorbate loading). One can see differences in the spectra with varying nuclearity; such differences allow discriminating sizes and potentially isomers. A broadening of the peaks when overlap among spectra of clusters happens is also noticeable. The applicability of this surrogate model (vs. direct DFT-computation of arbitrary heterogeneous systems) depends on the following two assumptions: (1) adsorbates on different clusters are non-interacting and (2) interacting adsorbates on the same cluster are accounted for in the primary spectra. Assumption (1) is often fulfilled for supported single atoms and clusters as metal loadings are low (i.e., high dispersion). Assumption (2) is accounted for with direct DFT computations of clusters exposed to high coverages of adsorbates.

**Fig. 5: Synthetic spectra of a system containing equal fractions of supported Pd/CeO₂ monomers, dimers, and trimers saturated with CO.**

Spectra deconvolution via Bayesian inference (BI)

IR spectra deconvolution is traditionally difficult due to the linearly overlapping peaks of many potential candidates, each with a unique spectroscopic signature. Our Bayesian model leverages prior information of the characteristic spectral pattern and uncertainty of viable candidates for regularization to recognize overlapped signals. Expert knowledge is used to specify tighter and more informative prior distributions, which lead to narrower predicted distributions⁴¹ (refer to Methods section and Supplementary Information for more information on the specification of prior distributions). We model the IR spectrum, $\vec{{{{{{\rm{y}}}}}}}$, as a vector sum of wavenumber discretized primary spectra, ${\vec{x}}_{i}$, weighted by their relative fraction, ${c}_{i}$, plus some noise, ɛ:

$$\overrightarrow{y}=\mathop{\sum }\limits_{i=1}^{N}{c}_{i}{\overrightarrow{x}}_{i}+\varepsilon,\,\varepsilon \sim {{{{{\rm{N}}}}}}\left(0,\, {\sigma }^{2}{\left[\mathop{\sum }\limits_{j=1}^{N}{E}_{j}\mathop{\sum }\limits_{i=1}^{N}{c}_{i}{\overrightarrow{x}}_{i}{e}_{j}\right]^{2}}\left.\right]\right)$$

(1)

The error term, ɛ, is entirely random and is intended to account for (1) background noise absent from the computational spectra, (2) DFT error in computed frequencies, and (3) spectral intensities for clusters/adsorbates not accounted for in the low-energy ensemble. We note that the DFT errors in computed frequencies, are usually systematically underestimated due to the infinite mass approximation and may not be entirely represented in the proposed mathematical form. Here, ${{{{{{\rm{E}}}}}}}_{j}$ is the $({N\; x\; N})$ identity matrix (where N is the number of primary spectra considered) with 1 in position $\left(j,\, j\right)$ and zeroes everywhere else, ${{{{{{\rm{e}}}}}}}_{{{{{{\rm{j}}}}}}}$ is the $(1{x\; N})$ row vector with 1 in position $\left(1,\, j\right)$ and zeroes everywhere else, and σ is a scalar controlling the amount of noise in the spectra. The term $\mathop{\sum }\limits_{j=1}^{N}{E}_{j}\mathop{\sum }\limits_{i=1}^{N}{c}_{i}{\overrightarrow{x}}_{i}{e}_{j}$ leads to a diagonal matrix with the nonzero elements being the intensities of the reconstructed spectra, $\mathop{\sum }\limits_{i=1}^{N}{c}_{i}{\overrightarrow{x}}_{i}$, at each observed frequency, without noise. This allows for a Gaussian error with standard deviation proportional (by a factor of σ) to the observed amplitude signal at each frequency to be accounted for. The scalar, σ, can assess the fit quality and is mathematically equivalent to the reciprocal of the amplitude ratio (refer to Supplementary Information for derivation). Ideally, σ should approach 0.05 as it mimics the 20:1 amplitude ratio for an observed SNR of 400 we utilize to construct our low-energy ensemble. Thus, σ allows us to infer the signal-to-noise ratio where the reconstructed spectra, $\mathop{\sum }\limits_{i=1}^{N}{c}_{i}{\overrightarrow{x}}_{i}$, and observed spectra, $\overrightarrow{y}$, match. Note that this equation can also be used to compare any two arbitrary spectra, ${\overrightarrow{y}}_{1}$ and ${\overrightarrow{y}}_{2}$, and their equivalent SNRs. This is useful in analyzing spectra obtained from time-resolved FTIR, for example, to determine statistically significant differences over the temporal domain. The main objective of the Bayesian Inference methodology is the estimation of the posterior distributions of each ${c}_{i}$ by iterative sampling while accounting for uncertainty in the computed primary spectra and noise (σ) in the given experimental or computational spectra. The theory and sampling methodology behind Bayesian Inference are given in Methods and Supplementary Information.

For visual simplicity, we demonstrate the deconvolution process on synthetic spectra containing equifractions of supported Pd₁, Pd_2, and Pd₃/CeO₂ saturated with CO as constructed using the surrogate model, like the one previously shown in Fig. 5. The only difference is that we introduce random Gaussian noise corresponding to an SNR of 400 (σ = 0.05) to mimic experimental spectra. Figure 6a shows the synthetic spectra, the reconstructed and deconvoluted spectra (where the means of the sampled posterior distributions are used as point estimates for the cluster fractions), and the predicted spectral noise. Note that the model does not a priori assume that Pd₄-Pd₂₀/CeO₂ is not present in the system. The intensities of the metal-carbon stretch region (<1000 cm⁻¹) are magnified tenfold for visibility.

**Fig. 6: Synthetic spectra deconvolution of a system containing equifractions of supported Pd₁, Pd₂, and Pd₃/CeO₂ saturated with CO.**

The most stable adsorption configuration for Pd₁/CeO₂ and Pd₂/CeO₂ contain a single adsorbate on an atop site, so both primary spectra contain a single distinct peak. However, the C-O stretch frequencies are close together, and as a result, the broadened peaks overlap (Fig. 6a, blue). Without the simulated noise, a slight shoulder in the spectra can be observed to potentially distinguish the peaks (Fig. 5a), but with the conservative amount of noise introduced, heuristic assignment by the naked eye would be unable to discern them. Our framework also utilizes the information in the metal-carbon stretch region of 300–500 cm⁻¹ that is otherwise lost to further distinguish these overlapping peaks. The primary spectra of Pd₃/CeO₂ contains a doublet, with only 1 peak within the vicinity of the Pd₁ and Pd₂/CeO₂ peaks, that is easily distinguished from the other peaks. The predicted spectral noise is uncorrelated as a function of frequency and exhibits random Gaussian-like behavior, and thus suggests that the deconvolution procedure has not overfit spectral peaks to noise.

Figure 6b, c show examples of trace plots and sampled posterior distributions for the noise term, σ, and the relative concentration of Pd₁, respectively. A trace plot shows the sampled values of a particular parameter as a function of the number of iterations and is a visual way to determine how well the sampling algorithm has converged to the true posterior distribution. In general, random scatter around the median value suggests that the sampling algorithm has converged. Note that the Bayesian inference sampling methodology is inherently stochastic, so a trace plot is useful for diagnostic purposes. Also shown in the figure are the sampled posterior distributions, and the corresponding means, medians, and 95% credible intervals (CI). The mean and median of the distribution coincide and are often used as point estimates when needed. The maximum a posteriori estimation (MAP), equivalent to the distribution mode, is also often used as a point estimate but may not be appropriate for distributions that are not unimodal⁴². In this example, the mean, median, and MAP coincide and can be used as point estimates for spectra deconvolution and reconstruction. The mean value of σ = 0.054 corresponds to an equivalent SNR of 350, which is in good agreement with the specified SNR of 400 of the original synthetic spectra.

Finally, Fig. 6d shows the means and 95% CIs for each species fractions. The true values of 0.33 for Pd₁, Pd₂, and Pd₃ all lie within the 95% CIs of each distribution. The model predicts almost no clusters that are larger than Pd₃ without having evidence of this a priori. The true value of 0 is statistically difficult to sample as that value is identically the prescribed lower bound of the sampled values of ${c}_{i}$, so it does not fall within the predicted 95% CI. Our framework can estimate a distribution of the predicted metal cluster sizes on the support, but lacks detailed structural information such as local metal dispersion (i.e., heterogeneity in the distribution of the metal on the support), preferred metal adsorption sites (e.g., formation of adsorbate islands), and support defects (e.g., existence of oxygen vacancies). These local interactions that deviate from our proposed linear model are accounted for by the error term in our model and cannot be directly interpreted. Due to the limitations of our model and experimental equipment resolution, determining local spatial information directly from IR spectra is outside our current capabilities and is the scope of future work.

We also demonstrate the efficacy and robustness of our deconvolution method in the Supplementary Information over many synthetic spectra with randomly generated cluster fractions and varying amounts of simulated noise. Noise is simulated with signal-to-noise ratios ranging from infinity (e.g., infinitesimally small noise, the limit as ${{{{{\rm{\sigma }}}}}}$ approaches 0) to 25 (e.g., the lowest SNR of FTIR receivers reported in literature, the limit as ${{{{{\rm{\sigma }}}}}}$ approaches 0.20^37,38) by uniformly sampling values of 0 < σ < 0.20. Note both SNR bounds are unrealistic for experimental spectra with modern day FTIR receivers and purely serve as benchmarks. A parity plot comparing MAPs of the predicted cluster fraction distribution versus true values of 100 synthetic spectra is shown in Fig. S5. We obtain a mean absolute error (MAE) of 0.049, but more importantly, the true cluster fractions lie within the 95% CI for all 100 spectra. Surprisingly, the prediction error is not correlated with σ, the amount of noise in the system, for the range of values studied. This is a good indication the model is robust enough to avoid overfitting spectra to noise.

Experimental spectra deconvolution

Detailed experimental surface and nanocluster characterization is difficult to achieve for working materials and is often limited to simpler ordered adsorbate overlayers on single crystals⁴³. We test our spectra deconvolution methodology on literature-reported IR spectra of 1 wt% Pd on CeO₂ nanorods saturated with CO at 323 K in which a tandem of experimental characterization techniques was used⁴⁴. The nanorods are composed predominantly of the (111) facet of our primary spectra dataset. The published spectra provided enough detail in the C-O stretch region to be digitized, so we only utilize the frequencies and corresponding intensities in the 1825–2400 cm⁻¹ range, with a discretization of 2 cm⁻¹. Spectroscopic information in the metal-carbon region can be helpful for overlapping peak discrimination, as shown in the previous synthetic spectra example, but is difficult to obtain in practice.

Figure 7a shows the experimental spectra and the reconstructed spectra using the means of the posterior distribution as the point estimates for the relative species concentrations. There are no spectroscopic signatures in our dataset that exceed 2200 cm⁻¹, so we cannot account for the broad peak centered around 2350 cm⁻¹. Spezzati et al. assigned this peak to CO₂ rather than CO/Pd/CeO₂, which agrees with our procedure. Our reconstructed spectra account for the major peaks at approximately 2100 and 2150 cm⁻¹. Figure 7b shows the trace and posterior distribution for σ, the error parameter that accounts for noise. Our reconstructed spectra have a mean σ value of 0.14 compared to the ideal value of 0.05. This suggests that the reconstructed and experimental spectra are in good agreement for an SNR of 60. Figure 7c shows the trace and posterior distribution for the Pd₁ fractions and suggests the presence of single atoms, with a mean of 0.198. Finally, Fig. 7d shows the means and 95% credible intervals (CI) of the predicted fractions of Pd₁, as well as two aggregated bins of Pd₂-Pd₅ and Pd₆-Pd₂₀. These bins were chosen to demarcate monolayer from bilayer (or larger) clusters in our dataset. Our results agree with Spezzati et al.’s TEM imaging, suggesting that Pd is highly dispersed (either as single atoms or monolayer clusters) on the support. However, we suggest that close to half of the clusters may reconfigure to larger 3-dimensional clusters (Pd₆-Pd₂₀) upon exposure to CO.

**Fig. 7: Experimental spectra deconvolution of 1 wt% Pd/CeO₂ system saturated with CO at 323 K.**

We note that the oxidation state of Pd is uncertain and, as a result, the clusters may not be entirely metallic. However, there is significant evidence (by the authors and in literature) that small PdO clusters as well as single atoms can be reduced by CO at low temperatures²¹, so we assume that Pd is metallic. Comparison to the experimental spectra provides further evidence for this. We also note that we model a defect free CeO_2, while the extent of reduction of the support of the sample in unknown due to limited characterization. The effect of oxygen vacancies on IR spectra is undoubtedly an important topic for future research.

We also benchmarked our methodology on a Pd/CeO₂ system with higher loadings (5 wt%) reported by Binet et al.⁴⁵. At high loadings, we do not expect Pd to exist as single atoms or dimers/trimers due to the high probability of sintering. The catalyst is predominantly composed of (100) and (111) facets of CeO₂, so part of the spectra may not be accounted for in our model. We note that the sample was reduced at 423 K in H₂ but the authors were able to deduce, via methanol and TCNE adsorption, no observable reduction of the support. Figure 8a shows the experimental and reconstructed spectra. The reconstructed spectra account for the major peak at ~1975 cm⁻¹ and general spectral intensities between 1300–1900 cm⁻¹. Figure 8b shows the trace and posterior distribution for σ with a mean of 0.11 compared to the ideal value of 0.05. This suggests that the reconstructed and experimental spectra are in good agreement for an SNR of 80. The reconstructed spectra accounts for a large portion of the experimental spectra, suggesting that the support may be composed mainly of CeO₂(111), the (111) facet may stabilize more Pd, or that the spectroscopic signatures on both facets are similar. We did not pursue this point further, but it is worth exploring in future work. The trace and posterior distribution of Pd₁ (Fig. 8c) show little to no evidence for single atoms. Despite the spectral intensities near 2050 cm⁻¹ (the calculated frequency of the C-O stretch of CO/Pd₁; see Fig. 4a for primary spectra) in the experimental spectra, the deconvolution process does not support the existence of single atoms. Figure 8d shows the mean and 95% credible intervals for Pd₁, Pd₂-Pd₅, and Pd₆-Pd₂₀. Once again, the deconvolution procedure finds little evidence for monolayer-supported clusters of less than 6 atoms. Most of the Pd atoms at high loadings exist as large multilayer nanoparticles, supported by the predicted concentrations directly from spectra.

**Fig. 8: Experimental spectra deconvolution of 5 wt% Pd/CeO₂ system saturated with CO at 323 K.**

Deducing the structure of heterogeneous single-atoms and subnanometer cluster catalysts has been a challenge. Surface spectroscopy, like IR, is sensitive to the sites exposed but the interpretation of experimental spectra is challenging due to the inhomogeneity of real-world world materials. The combinatorial nature of cluster shapes and sites, the DFT computational cost, and the lack of experimental methods with atomic resolution impede detailed characterization. In this work, we introduce a first principles-driven computational framework to characterize supported single-atoms and subnanometer clusters exposed to adsorbates directly from IR spectroscopic data, inspired by the deconvolution of IR spectra in the gas phase. We predict a low-energy ensemble of viable structures to reduce the combinatorial complexity of spectra deconvolution. We utilize calculations of high-coverage adsorbate, low-energy structures to generate single-cluster primary spectra. We use state-of-the-art UHV single-crystal experiments as ground truths to correct for errors associated with DFT-computed frequencies. Finally, we perform peak deconvolution of synthetic and experimental spectra using Bayesian Inference to characterize and interpret IR spectra and derive a criterion for determining the equivalence of modeled and observed spectra using the signal-to-noise ratio. We determine cluster size distributions from computational and experimental spectra while accounting for spectral noise and uncertainties. The deconvolution procedure discriminates overlapping peaks and discerns single atoms from small clusters and large nanoparticles with results consistent with other experimental characterization techniques. Our methodology allows deduction of cluster sizes and shapes from experimental spectra without performing an unrealistic number of expensive quantum calculations. Applications in real-world materials will require an extension to many different supported facets. The general methodology presented will only improve as more accurate computational data is available.

Methods

Adsorbate probe molecule selection

IR spectroscopy requires the selection of an appropriate probe molecule. Carbon monoxide is extensively used due to its well-defined experimental peaks⁴⁶. Its distinctive C-O stretch frequencies depend highly on the adsorbate site-type and local metal coordination environment and can be accurately calculated^47,48,49. Carbon monoxide also does not strongly adsorb on CeO₂(111); computed adsorption energies are in the order of −0.2 eV, while adsorption energies on supported Pd clusters are in the order of −2.0 eV⁵⁰. This makes CO an ideal probe for discriminating clusters based on their corresponding spectroscopic signature.