Heterogeneous catalyst design by generative adversarial network and first-principles based microkinetics

Ishikawa, Atsushi

doi:10.1038/s41598-022-15586-9

Download PDF

Article
Open access
Published: 08 July 2022

Heterogeneous catalyst design by generative adversarial network and first-principles based microkinetics

Atsushi Ishikawa¹

Scientific Reports volume 12, Article number: 11657 (2022) Cite this article

2769 Accesses
3 Citations
6 Altmetric
Metrics details

Subjects

Abstract

Microkinetic analysis based on density functional theory (DFT) was combined with a generative adversarial network (GAN) to enable the artificial proposal of heterogeneous catalysts based on the DFT-calculated dataset. The approach was applied to the NH₃ formation reaction on Rh−Ru alloy surfaces as an example. The NH₃ formation turnover frequency (TOF) was calculated by DFT-based microkinetics. Six elementary reactions, namely, N₂ dissociation, H₂ dissociation, NH_x (x = 1–3) formation, and NH₃ desorption, were explicitly considered, and their reaction energies were evaluated by DFT calculations. Based on the TOF values and atomic compositions, new alloy surfaces were generated using the GAN. This approach successfully generated the surfaces that were not included in the initial dataset but exhibited higher TOF values. The N₂ dissociation reaction was more exothermic for the generated surfaces, leading to higher TOF. The present study demonstrates that the automatic improvement of catalyst materials is possible using DFT calculations and GAN sample generation.

Decoding reactive structures in dilute alloy catalysts

Article Open access 11 February 2022

Generative adversarial networks (GAN) based efficient sampling of chemical composition space for inverse design of inorganic materials

Article Open access 26 June 2020

Constrained crystals deep convolutional generative adversarial network for the inverse design of crystal structures

Article Open access 10 May 2021

Introduction

Catalysts play a crucial role in energy and environmental science, and its performance is often evaluated by the reaction rate or turnover frequency (TOF). Researchers have devoted great efforts to find new and more active catalysts. It is well known that the reaction rate is governed by several factors: the activation energies, number of active sites (or the catalyst surface area), sticking coefficients on the surface, etc. Unfortunately, accurate measurement of such quantities requires both special expertise and considerable effort; therefore, detailed kinetic profiles have only been clarified for limited cases.

In the last few decades, theoretical simulation and computational methods have become feasible alternatives for the evaluation of reaction kinetics. For example, ab initio or first-principles calculation are popular because they provide atomic-scale information without requiring experimental data. For example, these approaches can be helpful in identifying the active site on the catalyst surface, which is a fundamental issue in catalysis. Owing to recent developments in algorithms and computational resources, such atomic-scale simulations, especially those using density functional theory (DFT), are being widely employed.

While computational methods are useful for studying given real or proposed materials, they cannot automatically suggest new ones. Recently, the application of machine learning to computational chemistry was found to afford a computation-based material proposal¹. Several promising examples have been reported in catalysis^2,3,4,5. Among the several possibilities of combining computational chemistry with machine learning, the present author considers the so-called generative model a particularly important machine learning algorithm because it enables “extrapolative” proposal in the material or configuration space; hence, the search is not confined within a given dataset. As an example of a generative model, the generative adversarial network (GAN) is widely used, especially in artificial image generation^6,7. Several groups have reported the application of a GAN to material science; Kim et al. used it to discover new zeolite systems⁸, and several groups also used it to artificially generate crystal structures with desirable properties^9,10.

A catalytic reaction often involves several species and elementary reactions. Microkinetic analysis explicitly treats a set of elementary reactions. Therefore, it is often more accurate than the kinetic analysis based on the global rate expression¹¹. Currently, DFT-based microkinetics is widely used in catalysis research because it is a powerful tool that allows the calculation of kinetic information, such as the reaction energy and activation barrier, can be calculated by DFT^{11,12,13,14,15}. Considering this, a combination of DFT calculations, microkinetics, and catalytic material generation from generative models could be a promising approach for the rational design of catalysts.

The present paper describes a new approach based on DFT calculations and sample generation by a GAN for heterogeneous catalyst searching. The generation procedure is extrapolative, because the proposed catalytic material need not be included in the initially prepared dataset. Here, the GAN part aims to generate materials with a high TOF for the target catalytic reaction, where the TOF is calculated using DFT-based microkinetics. The ammonia (NH₃) synthesis reaction, which is also known as the Haber−Bosch process, is considered in this study as a representative heterogeneous catalytic reaction^16,17,18. Below, details of the DFT-GAN procedure are described in the Methods section, and its performance is discussed in the Results and Discussion section.

Methods

Models and details of DFT calculation

For the target model, the catalytic synthesis of NH₃ was assumed to occur on a Rh−Ru bimetallic alloy surface. The Ru stepped surface was constructed first, and the bimetallic alloys were constructed by replacing Ru atoms with Rh atoms. Stepped metal surfaces were considered because NH₃ formation is known to occur on these types of surfaces^17,19. The positions of the Ru atoms replaced by Rh atoms were randomly selected in the original dataset, while during the DFT-GAN iterations the positions of the Rh atoms were determined by the GAN; the details will be discussed later. The original dataset included 100 metal surfaces. The metal surfaces were modeled by repeated slabs, and the stepped surface was modeled by removing half of the atoms in the top layer. Each slab consisted of a (6 × 4) supercell in the lateral direction, with four atomic layers in the z-direction. Consequently, 84 atoms were included in the model. The typical structure of the surface model is shown in Fig. 1. The adsorption positions of N, H, NH, and NH₂ were assumed to be the fcc three-fold hollow sites, and atop adsorption was assumed for NH₂ and NH₃, as these positions are the most stable adsorption sites on the Ru-stepped surface²⁰. These adsorption sites are also shown in Fig. 1.

The BEEF-vdW exchange–correlation functional was used in the DFT calculations because it provides an accurate description of the van der Waals interaction²¹. The core electrons were represented by the projector-augmented wave (PAW) potentials²², and the valence electrons were expanded with plane waves up to a cutoff energy of 400 eV. Spin polarization was included throughout, and no symmetry constraint was imposed on the geometries. A Gaussian scheme was used in the smearing of the electron occupation close to the Fermi level. The convergence thresholds for the electronic and geometry optimizations were set to 1.0 × 10⁻⁴ eV and 1.0 × 10⁻¹ eV/Å in energy and force, respectively.

The geometries of the surfaces generated by the GAN were optimized using DFT, with the same computational condition as the initial dataset. A vacuum layer of ~ 12 Å was placed between the slabs, and dipole correction was applied in the z-direction to cancel the artificial interactions between slabs. As the substitution of Rh for Ru causes some distortion in the unit cell, the unit cell was optimized for all surfaces; the BEEF-vdW functional was also used for this purpose, and the convergence threshold for the unit cell optimization was set to 1.0 × 10⁻⁴ eV in energy change. An orthorhombic unit cell was used in all cases. All DFT calculations were performed using the Vienna ab initio simulation package (VASP), version 5.4.4^23,24.

Elementary reactions and rate of NH₃ formation

The overall reaction for the synthesis of NH₃ is represented by:

$${\text{N}}_{{2}} \,\, + \,\,3{\text{H}}_{{2}} \to 2{\text{NH}}_{{3}}$$

(1)

which is generally considered to include the following six elementary reactions²⁵.

$${\text{N}}_{{2}} \,\, + \,\,2{*} \to 2{\text{N*}}$$

(2)

$${\text{H}}_{{2}} \,\, + \,\,2{*} \rightleftharpoons 2{\text{H*}}$$

(3)

$${\text{N*}}\,\, + \,\,{\text{H*}} \rightleftarrows {\text{NH*}}\,\,{ + }\,\,{*}$$

(4)

$${\text{NH*}}\,\, + \,\,{\text{H*}} \rightleftarrows {\text{NH}}_{{2}} {*}\,\,{ + }\,\,{*}$$

(5)

$${\text{NH}}_{{2}} {*}\,\, + \,\,{\text{H*}} \rightleftarrows {\text{NH}}_{{3}} {*}\,\,{ + }\,\,{*}$$

(6)

$${\text{NH}}_{{3}} {*} \rightleftarrows {\text{NH}}_{{3}} \,\,{ + }\,\,{*}$$

(7)

where an asterisk (*) denotes a vacant active site on the metal surface, and the species with asterisks are the adsorbed species. The reaction energies in Eqs. (2)–(7) were determined from the total energy calculated by DFT, i.e., the sum of the electronic and nuclear repulsion energies.

Previous experimental and theoretical research suggests that the rate-determining step (RDS) is Eq. (2), namely, the dissociative adsorption of N₂^17,26. Based on this assumption, the present study employed the simple microkinetic model, called the Langmuir−Hinshelwood−Hougen−Watson kinetic model, wherein the RDS was assumed to be Eq. (2) irrespective of temperature and pressure changes. In this case, the fractional surface coverage of the adsorbed species i (θ_i) is written as

$$\begin{gathered} \theta_{{\text{N}}} = \frac{{p_{{{\text{NH}}_{{3}} }} }}{{K_{3}^{3/2} p_{{{\text{H}}_{2} }}^{3/2} K_{4} K_{5} K_{6} K_{7} }}\theta_{{{\text{vac}}}} \hfill \\ \theta_{{\text{H}}} = \sqrt {K_{3} p_{{{\text{H}}_{{2}} }} } \theta_{{{\text{vac}}}} \hfill \\ \theta_{{{\text{NH}}}} = \frac{{p_{{{\text{NH}}_{{3}} }} }}{{K_{3} p_{{{\text{H}}_{2} }} K_{5} K_{6} K_{7} }}\theta_{{{\text{vac}}}} \hfill \\ \theta_{{{\text{NH}}_{{2}} }} = \frac{{p_{{{\text{NH}}_{{3}} }} }}{{\sqrt {K_{3} p_{{{\text{H}}_{2} }} } K_{6} K_{7} }}\theta_{{{\text{vac}}}} \hfill \\ \theta_{{{\text{NH}}_{3} }} = \frac{{p_{{{\text{NH}}_{{3}} }} }}{{K_{7} }}\theta_{{{\text{vac}}}} \hfill \\ \theta_{{{\text{vac}}}} = \frac{1}{{1 + \frac{{p_{{{\text{NH}}_{{3}} }} }}{{K_{3}^{3/2} p_{{{\text{H}}_{2} }}^{3/2} K_{4} K_{5} K_{6} K_{7} }} + \sqrt {K_{3} p_{{{\text{H}}_{{2}} }} } + \frac{{p_{{{\text{NH}}_{{3}} }} }}{{K_{3} p_{{{\text{H}}_{2} }} K_{5} K_{6} K_{7} }} + \frac{{p_{{{\text{NH}}_{{3}} }} }}{{\sqrt {K_{3} p_{{{\text{H}}_{2} }} } K_{6} K_{7} }} + \frac{{p_{{{\text{NH}}_{{3}} }} }}{{K_{7} }}}} \hfill \\ \end{gathered}$$

(8)

where p_i is the partial pressure of NH₃ or H₂, and K_i is the equilibrium constant of Eqs. (2)–(7)²⁵. Thus, the total reaction rate is written as

$$R = k \cdot p_{{{\text{N}}_{{2}} }} \theta_{{{\text{vac}}}} \left( {1 - \frac{1}{{K_{2} K_{3}^{3} K_{4}^{2} K_{5}^{2} K_{6}^{2} K_{7}^{2} }}\frac{{p_{{{\text{NH}}_{{3}} }}^{2} }}{{p_{{{\text{N}}_{{2}} }} p_{{{\text{H}}_{{2}} }}^{3} }}} \right)$$

(9)

where k is the rate constant of the RDS (Eq. (2)) calculated using the Arrhenius equation:

$$k = A\exp \left( { - \frac{{E_{a} }}{{R_{{{\text{gas}}}} T}}} \right).$$

(10)

E_a is the activation energy of Eq. (2), A is the pre-exponential factor with the value of 0.241 s⁻¹ given by Logadottir et al.²⁰, R_gas is the universal gas constant, and T is the temperature. For the zero-point energies and thermal correction terms, experimental values from the NIST webbook were used²⁷.

Although it is possible to evaluate E_a with DFT by locating the transition state, this process requires considerable computational effort. Instead, this study evaluated E_a using the linear free energy relationship (or the Brønsted−Evans−Polanyi principle), in which E_a is expressed as a linear function of ΔE as

$$E_{a} = \alpha \Delta E + \beta .$$

(11)

The values of α = 0.87 and β = 1.34 for the stepped metal surface were taken from the literature²⁸. The calculation was carried out at T = 700 K and a total pressure of 100 bar. Stoichiometric quantities of N₂ and H₂ were used in the inlet gas, i.e., p_N2 : p_H2 = 1 : 3 and the conversion of N₂ was set to 10%. It has been previously demonstrated by Honkala et al. that the adsorbate−adsorbate interaction has some impact on the kinetics on the NH₃ formation¹⁷. However, these interactions are not considered in the present system, and hence the TOF values should be considered as representing the semi-quantitative level.

Details of the GAN

Similar to the original and several extended versions of the GAN, the entire system here consists of the generator (G) and discriminator (D) networks, the structures of which are shown in Fig. 2. In the present case, fake samples with a high NH₃ formation rate are desirable, and they can be generated using G. Therefore, the conditional GAN (CGAN) was applied because it enables the generation of fake samples corresponding to a given label⁷. The metal surface was encoded by a one-dimensional string array consisting of Rh or Ru. Then the string is converted to the one-dimensional vector of either 0 or 1 value. This vector and the DFT-calculated TOF value are used as the descriptor and target values, respectively. These alloys and their DFT-calculated TOF values were used together for learning. The mean-squared error was used for the loss functions of the D and G parts²⁹. Because the fake samples are expressed as a one-dimensional vector of the transition metal elements and the initial atomic positions, DFT calculations can be carried out for these samples. This also implies that their NH₃ formation rates could be evaluated with the same accuracy as the original dataset. The DFT calculation results for the generated samples were added to the original dataset, and the augmented dataset was used for the iterative training of the GAN. The specific steps are as follows:

(1)
DFT calculations are performed to obtain the E_a and ΔE values for the elementary reactions of Eqs. (2)–(7). This is done for all samples in the dataset.
(2)
The TOF for NH₃ formation is calculated according to Eq. (9) using the DFT-calculated ΔE and E_a values.
(3)
The calculated samples are sorted according to the DFT values and are grouped into several classes (n) according to the NH₃ formation rate. Here, the number of groups is set to five, and the group with the highest TOF is labeled as n = 1.
(4)
Networks D and G are trained with the dataset using the backpropagation scheme.
(5)
G generates fake samples for n = 1. Any generated surface that overlaps with the existing sample set is removed.
(6)
DFT calculations are performed for the newly generated samples. The results are added to the dataset for use in the subsequent iteration.

It should be noted that the size of the dataset increases with increasing number of iterations. This feature is favorable in terms of training neural networks, as a larger number of samples can be used in the training. Several studies have shown that such iterative training of the GAN is effective^30,31.

The present form of the target function only considered the reactivity. The stability of the surface is not taken into account, although it could be important for practical applications. The surface stability may be evaluated using the DFT method from the surface energy or bulk energy. Because there is no limitation in the target function for the present approach, it is possible to employ the surface stability or a mixed function of the stability and the TOF as the target function. The detailed examination of such a target function and its effect on the generated samples would be a suitable future study, however, have not been considered in the current study.

When training D and G, the loss function was set to the mean-squared error in both cases, and the ADAM optimizer was used. The learning rate was set to 1.0 × 10⁻³, and the parameters β₁ and β₂ were set to 0.5 and 0.999, respectively. The dropout rate was set to 0.3, and the minibatch size was set to 20% of the sample size for each iteration. The maximum number of the training processes (i.e., the epoch) was set to 2000.

The Python library atomic simulation environment (ASE) was used to construct the model and perform the DFT calculations³². The GAN part was calculated using PyTorch version 1.8. These Python codes are freely available on the author’s GitHub page³³.

Results and discussion

Initially, 100 bimetallic alloy surfaces were generated by randomly replacing Ru atoms with Rh atoms on a surface. DFT calculations were performed on these samples to obtain the TOF values. In the following, this dataset is referred to as the “original dataset.” Then, the iterative GAN procedure described above was applied. Five iterations were performed, meaning that four generations of new samples were created in addition to the original dataset.

Figure 3a plots the TOFs of the metal surfaces on a logarithmic scale, and the surfaces are sorted in descending order of TOF. In the original dataset (iter = 0), the metal surfaces have a wide range of TOF values ranging from 1.0 × 10⁻⁴ (Rh₈Ru₇₆) to 2.3 × 10⁻¹⁹ site⁻¹·s⁻¹ (Rh₅₀Ru₃₄). This indicates a strong dependence of the TOF on the metal surface composition. The new metal surfaces generated by the first to fifth iterations of DFT-GAN (iter = 1–5) are also depicted in the figure. It can be seen that the generated surfaces tend to have relatively higher TOF values than those of the original dataset. At iter = 3, Rh₄Ru₈₀ is generated, and it has a TOF value of 3.1 × 10⁻⁴ site⁻¹·s⁻¹; this value is higher than the maximum TOF in the original dataset. At iter = 5, the optimal TOF value (1.1 × 10⁻³ site⁻¹·s⁻¹) is obtained with Rh₈Ru₇₆; this TOF value is more than ten times larger than the highest value in the original dataset. These findings show that the GAN successfully generated a metal surface with high catalytic performance in an extrapolative manner. The TOF values at each iteration are summarized in the so-called violin plot in Fig. 3b. The violin-shaped curves show the probability density of the TOF values, and the boxes inside the individual curves indicate the quartiles. The plot shows that the original dataset has widely distributed TOF values. The TOF distributions of the generated surfaces (iter = 1–5) are more skewed toward the high-TOF region. This trend clearly shows that the GAN is much more efficient than the random sampling, to obtain the metal surface with a high NH₃ formation rate.

The property of the generated surface agrees with the existing chemical or physical insights. It is widely known that NH₃ formation is much faster on a Ru surface than on a Rh surface³⁴. In other words, Ru occupies a higher position in the activity volcano plot than Rh. In this study, the surfaces generated by the GAN tend to have a high proportion of Ru; the full dataset can be found in the author's GitHub repository. These results show that the neural network captures the experimental tendency of transition metal elements by learning from the TOF of NH₃ formation calculated by the DFT, since the alloying more Ru into the Rh surface leads a higher position in the volcano plot.

Further analysis was carried out to understand the atomic configuration of the GAN-generated surfaces. Figure 4 shows the distributions of the Ru and Rh atoms for all the metal surfaces. In the initial dataset (iter = 0), there is no clear tendency in the Ru and Rh distributions, as the position of Ru atoms was determined randomly. However, the proportion of Ru atoms is larger for the GAN-generated surfaces (iter = 1–5). Another significant trend is that the Ru atoms tend to occupy the step sites (marked in the figure), especially at higher iterations. These atomic positions are neighboring to the sites of the adsorbate atoms or molecules, and these neighboring atoms should have a larger impact on their adsorption energy the other atomic positions. As a result, these sites strongly affect the kinetics and thermodynamics of NH₃ formation. The present result also indicates that the GAN successfully learned this feature and placed the Ru atoms close to the adsorption sites.

The generator loss (G-loss) and discriminator loss (D-loss) during the training process are plotted in Fig. 5. Since each iteration has 2000 epochs, there are a total of 10,000 epochs over the five iterations. In the earlier stage of training, the G-loss is smaller than the D-loss. However, after ~ 1000 epochs, the D-loss becomes lower than the G-loss, meaning that the D part of the GAN is well-trained.

To understand why the generated surfaces have higher TOFs, the energetics of the NH₃ formation reaction were analyzed. Figure 6 summarizes the potential energy curves, the reaction energies of the elementary reactions, and surface fractional coverages of the adsorbates. In this case, the Rh−Ru surfaces with the highest TOF values at iter = 0, 3, 4, and 5 are compared, and the results for all the iterations are shown in the Figures S1−S3 (Supplementary Information). The compositions of these surfaces are Rh₈Ru₇₆, Rh₄Ru₈₀, Rh₁₂Ru₇₂, and Rh₈Ru₇₆, respectively. The optimal TOF value (1.1 × 10⁻³ site⁻¹·s⁻¹) is that of Rh₈Ru₇₆, which is generated at iter = 5. This value is much higher than that of Rh₈Ru₇₆ at iter = 0 (1.0 × 10⁻⁴ site⁻¹·s⁻¹), which is the highest TOF in the original dataset (for brevity, these surfaces are denoted as Rh₈Ru₇₆-iter5 and Rh₈Ru₇₆-iter0 in the following discussion). It should also be noted that these two surfaces have the same composition, but the positions of the Rh atoms are different.

Figure 6a plots the potential energy curves for NH₃ formation. During this process, two NH₃ molecules are formed because the dissociation of one N₂ molecule generates two N* (surface-adsorbed N) atoms. The plots show a deep potential energy sink at the NH₂* + N* + 4H* state caused by the exothermic formation of NH and NH₂ and endothermic formation of NH₃. This energy sink is unfavorable from a thermodynamic viewpoint because the high stability of NH* and NH₂* leads to surface poisoning. Furthermore, the lack of vacancies in the active site prohibits the next catalytic reaction; this issue will be discussed later. The potential energy curves for iter = 2–4 have lower potential energy at NH₂* + N* + 4H*, similar to that for iter = 0. However, at iter = 5, the endothermicity of the reaction is significantly improved. This is beneficial in terms of the accessibility of the active sites to gaseous N₂ molecules. Thus, the potential energy curves show that the GAN-generated surfaces successively improve the thermodynamic character of NH₃ formation.

The potential energy curve of the Rh₈Ru₇₆ surface (i.e., the GAN-generated surface with the highest TOF value) is also compared with those of the Ru and Rh surfaces (Figure S4, Supplementary Information). It should be noted that the pure Ru and Rh surfaces are not included in the dataset, hence they are approximated by the RhRu₈₃ and Rh₈₃Ru surfaces, respectively. Interestingly, the Rh₈Ru₇₆ surface is found to be superior to the other two surfaces owing to its relatively lower E_a for the N₂ dissociation step; as its E_a is much lower than that of the Rh surface and close to the that of the Ru surface. In addition, the hydrogen poisoning around the NH_x (x = 1−3) species on Rh₈Ru₇₆ is significantly reduced compared to that observed over the pure-Ru surface.

The reaction energies (ΔEs) of the elementary reactions (Eqs. (2)–(7)) on several surfaces are shown in Fig. 6b. The reaction energy of N₂ dissociation (Eq. (2)) is more exothermic at iter = 2–5. For example, the ΔE on Rh₈Ru₇₆-iter5 is −0.79 eV, which is lower than that on Rh₈Ru₇₆-iter0 (−0.72 eV). Consequently, the activation barrier E_a becomes lower in the latter owing to the linear free energy relationship (Eq. (11)). Significantly, the GAN-generated surfaces have lower E_a at earlier iterations, while at latter iterations, it focused more on the energy sink in the potential energy profile. This feature can be understood in terms of the volcano curves: the over-stabilization of the N atom leads to a lower TOF even though the corresponding E_a is also low. The present results suggest that the GAN learned such a tendency from the data. Another notable feature is the ΔE for NH₃ formation. As stated above, the facile formation of NH₃ alleviates surface poisoning by NH₂. The Fig. 6b shows that the value of ΔE for the NH₃ formation progressively becomes less endothermic as the iteration proceeds. This suggests that the GAN improves not only the kinetics but also the thermodynamics of the NH₃ formation.

Figure 6c compares the fractional coverages (θ) of the different adsorbates (N, H, NH, NH₂, and NH₃) and the vacant sites (i.e., the active sites) over several Rh−Ru surfaces. A notable difference is seen in the coverage of the vacant site (θ_vac); for example, θ_vac is 7.5 × 10⁻³ for Rh₈Ru₇₆-iter5, but 3.9 × 10⁻³ for Rh₈Ru₇₆-iter0. The higher θ_vac for Rh₈Ru₇₆-iter5 facilitates NH₃ formation by leaving accessible active sites for the N₂ dissociation reaction. In accordance with its higher θ_vac, Rh₈Ru₇₆-iter5 has a lower θ_NH2 (0.73) than Rh₈Ru₇₆-iter0 (0.89). Furthermore, since NH₂ is the most abundant adsorbate during NH₃ formation, a high θ_NH2 indicates NH₂ poisoning on the surface to reduce the NH₃ formation rate, which is known to be a serious disadvantage of Ru surfaces³⁵. Thus, lowering the θ_NH2 and θ_H values is desirable for the catalytic performance. The present data indicate that the GAN-generated surface has a lower θ_NH2 value than those included in the original dataset.

All these results show that the proposed DFT-GAN improves the TOF of NH₃ formation by adjusting the detailed energetics of the elementary reactions. Although the GAN was not explicitly provided with such information, training the neural networks with DFT data successfully captured these details.

Conclusions

The present paper proposes a new approach combining computational chemistry and machine learning to generate new catalytic surfaces in an extrapolative manner. Density functional theory (DFT) is used to calculate the energies of elementary reactions on a provided set of catalytic materials. The results are fed into a generative adversarial network (GAN) to propose additional materials. Here, the approach was used to enhance the turnover frequency (TOF) of NH₃ synthesis in the Rh−Ru bimetallic alloy surface system. The DFT-GAN iterations consist of the following six key steps. (i) DFT is used to obtain the reaction energies (ΔE) of the elementary reactions; this is performed for all surfaces in the initially prepared dataset. (ii) The TOF for NH₃ formation is obtained from the ΔE values assuming N₂ dissociation to be the rate-determining step, and the metal surfaces are labeled according to the TOF values. (iii) The GAN consisting of the discriminator and the generator is trained using the above DFT dataset that has metal surface information and TOF values. (v) The generator part of the GAN produces samples that are not contained in the present dataset; the conditional GAN is used here, and the generator part aims to produce surfaces with higher TOF values. (vi) DFT calculations are performed for the newly generated samples, and the results are added to the dataset.

The iterative process was started with 100 stepped alloy surfaces generated by random atomic replacement. After five iterations, Rh₈Ru₇₆ was successfully obtained as a surface not present in the original dataset. The TOF of the generated surface was more than ten times higher than the optimal TOF value in the original dataset. Overall, the samples generated in later iterations tend to have higher TOFs, indicating that the iterative DFT-GAN scheme helps train the neural networks in the GAN. Furthermore, the generated surfaces generally have a higher proportion of Ru atoms, which agrees with the experimental fact that the Ru surface is a far better catalyst than the Rh surface. The generated surfaces have higher TOF value because of (a) a lower N₂ dissociation reaction energy (which reduces the activation energy for the rate-determining step) and (b) a lower energy of NH₃ formation (which reduces NH₂ coverage on the surface and alleviates NH₂ poisoning). The present study shows that the combination of the DFT and the GAN is a promising strategy for the automatic and continuous improvement of catalyst performances.

Data availability

The datasets generated and analyzed during the current study are available in the authors' GitHub repository at https://github.com/atsushi-ishikawa.

References

Keith, J. A. et al. Combining machine learning and computational chemistry for predictive insights into chemical systems. Chem. Rev. 121(16), 9816–9872 (2021).
Article CAS Google Scholar
Ma, X., Li, Z., Achenie, L. E. K. & Xin, H. Machine-learning-augmented chemisorption model for CO₂ electroreduction catalyst screening. J. Phys. Chem. Lett. 6(18), 3528–3533 (2015).
Article CAS Google Scholar
Schlexer Lamoureux, P. et al. Machine learning for computational heterogeneous catalysis. ChemCatChem 11(16), 3581–3601 (2019).
Article CAS Google Scholar
Zhong, M. et al. Accelerated discovery of CO₂ electrocatalysts using active machine learning. Nature 581(7807), 178–183 (2020).
Article ADS CAS Google Scholar
Goldsmith, B. R., Esterhuizen, J., Liu, J.-X., Bartel, C. J. & Sutton, C. Machine learning for heterogeneous catalyst design and discovery. AlChE J. 64(7), 2311–2323 (2018).
Article CAS Google Scholar
Goodfellow, I. J.; Pouget-Abadie, J.; Mirza, M.; Xu, B.; Warde-Farley, D.; Ozair, S.; Courville, A.; Bengio, Y. Generative adversarial networks. http://arxiv.org/abs/1406.2661.
Mirza, M.; Osindero, S. Conditional generative adversarial nets. https://arxiv.org/abs/1411.1784.
Kim, B., Lee, S. & Kim, J. Inverse design of porous materials using artificial neural networks. Sci. Adv. 6(1), eaax9324 (2020).
Article ADS CAS Google Scholar
Long, T. et al. Constrained crystals deep convolutional generative adversarial network for the inverse design of crystal structures. npj Comput. Mater. 7(1), 66 (2021).
Article ADS CAS Google Scholar
Kim, S., Noh, J., Gu, G. H., Aspuru-Guzik, A. & Jung, Y. Generative adversarial networks for crystal structure prediction. ACS Cent. Sci. 6(8), 1412–1420 (2020).
Article CAS Google Scholar
Dumesic, J., Rudd, D. F., Aparicio, L. M., Rekoske, J. E. & Trevino, A. A. The Microkinetics of Heterogeneous Catalysis 24 (ACS Professional Reference Book, 1993).
Google Scholar
Filot, I. A. W. et al. First-principles-based microkinetics simulations of synthesis gas conversion on a stepped rhodium surface. ACS Catal. 5(9), 5453–5467 (2015).
Article CAS Google Scholar
Reuter, K. Ab initio thermodynamics and first-principles microkinetics for surface catalysis. Catal. Lett. 146(3), 541–563 (2016).
Article CAS Google Scholar
Ishikawa, A. & Tateyama, Y. First-principles microkinetic analysis of NO + CO reactions on Rh(111) surface toward understanding NO_x reduction pathways. J. Phys. Chem. C 122(30), 17378–17388 (2018).
Article CAS Google Scholar
Ishikawa, A. & Tateyama, Y. A first-principles microkinetics for homogeneous-heterogeneous reactions: application to oxidative coupling of methane catalyzed by magnesium oxide. ACS Catal. 11(5), 2691–2700 (2021).
Article CAS Google Scholar
Ertl, G. Surface science and catalysis—studies on the mechanism of ammonia synthesis: the P H. Emmett award address. Catal. Rev. 21(2), 201–223 (1980).
Article CAS Google Scholar
Honkala, K. et al. Ammonia synthesis from first-principles calculations. Science 307(5709), 555–558 (2005).
Article ADS CAS Google Scholar
Liu, H. Ammonia Synthesis Catalysts: Innovation and Practice 1–6 (World Scientific/Chemical Industry Press, 2013).
Book Google Scholar
Dahl, S. et al. Role of steps in N₂ activation on Ru(0001). Phys. Rev. Lett. 83(9), 1814–1817 (1999).
Article ADS Google Scholar
Logadottir, A. & Nørskov, J. K. Ammonia synthesis over a Ru(0001) surface studied by density functional calculations. J. Catal. 220(2), 273–279 (2003).
Article CAS Google Scholar
Wellendorff, J. et al. Density functionals for surface science: Exchange-correlation model development with Bayesian error estimation. Phys. Rev. B 85(23), 235149 (2012).
Article ADS Google Scholar
Blochl, P. E. Projector augmented-wave method. Phys. Rev. B 50(24), 17953–17979 (1994).
Article ADS CAS Google Scholar
Kresse, G. & Furthmuller, J. Efficient iterative schemes for Ab Initio total-energy calculations using a plane-wave basis set. Phys. Rev. B 54(16), 11169–11186 (1996).
Article ADS CAS Google Scholar
Kresse, G. & Joubert, D. From ultrasoft pseudopotentials to the projector augmented-wave method. Phys. Rev. B 59(3), 1758–1775 (1999).
Article ADS CAS Google Scholar
Nørskov, J. K.; Studt, F.; Abild-Pedersen, F.; Bligaard, T., Fundamental Concepts in Heterogeneous Catalysis. John Wiley & Sons, Inc.: New Jersey, 2014, 79–84
Dumesic, J. A. & Trevino, A. A. Kinetic simulation of ammonia synthesis catalysis. J. Catal. 116(1), 119–129 (1989).
Article CAS Google Scholar
NIST Chemistry WebBook, NIST Standard Reference Database Number 69. https://doi.org/10.18434/T4D303.
Nørskov, J. K. et al. Universality in heterogeneous catalysis. J. Catal. 209(2), 275–278 (2002).
Article Google Scholar
Mao, X.; Li, Q.; Xie, H.; Lau, R. Y. K.; Wang, Z.; Smolley, S. P. Least squares generative adversarial networks. https://arxiv.org/abs/1611.04076v3.
Gupta, A. & Zou, J. Feedback GAN for DNA optimizes protein functions. Nat. Mach. Intell. 1(2), 105–111 (2019).
Article Google Scholar
Dong, Y. et al. Inverse design of two-dimensional graphene/h-BN hybrids by a regressional and conditional GAN. Carbon 169, 9–16 (2020).
Article CAS Google Scholar
Ask Hjorth, L. et al. The atomic simulation environment—a python library for working with atoms. J. Phys. Condens. Matter 29(27), 273002 (2017).
Article Google Scholar
Ishikawa, A. Github. https://github.com/atsushi-ishikawa.
Ozaki, A., Aika, K.-I. & Hori, H. A new catalyst system for ammonia synthesis. Bull. Chem. Soc. Jpn. 44(11), 3216–3216 (1971).
Article CAS Google Scholar
Ishikawa, A., Doi, T. & Nakai, H. Catalytic performance of Ru, Os, and Rh nanoparticles for ammonia synthesis: A density functional theory analysis. J. Catal. 357, 213–222 (2018).
Article Google Scholar

Download references

Acknowledgements

The author thanks Dr. Yoshitaka Tateyama (National Institute for Materials Science, NIMS) for scientific discussions and assistance with the computational environment. This study was supported by MEXT as an elementary strategy initiative (Grant Number JPMXP0112101003). The calculations were carried out at the supercomputer center of NIMS, Kyushu University (ITO), and Hokkaido University (Grand Chariot).

Author information

Authors and Affiliations

Center for Green Research on Energy and Environmental Materials (GREEN), National Institute for Materials Science (NIMS), 1-1 Namiki, Tsukuba, Ibaraki, 305-0044, Japan
Atsushi Ishikawa

Authors

Atsushi Ishikawa
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.I. wrote the whole manuscript text and figures.

Corresponding author

Correspondence to Atsushi Ishikawa.

Ethics declarations

Competing interests

The author declares no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Ishikawa, A. Heterogeneous catalyst design by generative adversarial network and first-principles based microkinetics. Sci Rep 12, 11657 (2022). https://doi.org/10.1038/s41598-022-15586-9

Download citation

Received: 01 March 2022
Accepted: 27 June 2022
Published: 08 July 2022
DOI: https://doi.org/10.1038/s41598-022-15586-9

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.