High accuracy barrier heights, enthalpies, and rate coefficients for chemical reactions

Spiekermann, Kevin; Pattanaik, Lagnajit; Green, William H.

doi:10.1038/s41597-022-01529-6

Download PDF

Data Descriptor
Open access
Published: 18 July 2022

High accuracy barrier heights, enthalpies, and rate coefficients for chemical reactions

Scientific Data volume 9, Article number: 417 (2022) Cite this article

5314 Accesses
22 Citations
6 Altmetric
Metrics details

Subjects

Abstract

Quantitative chemical reaction data, including activation energies and reaction rates, are crucial for developing detailed kinetic mechanisms and accurately predicting reaction outcomes. However, such data are often difficult to find, and high-quality datasets are especially rare. Here, we use CCSD(T)-F12a/cc-pVDZ-F12//ωB97X-D3/def2-TZVP to obtain high-quality single point calculations for nearly 22,000 unique stable species and transition states. We report the results from these quantum chemistry calculations and extract the barrier heights and reaction enthalpies to create a kinetics dataset of nearly 12,000 gas-phase reactions. These reactions involve H, C, N, and O, contain up to seven heavy atoms, and have cleaned atom-mapped SMILES. Our higher-accuracy coupled-cluster barrier heights differ significantly (RMSE of ∼5 kcal mol⁻¹) relative to those calculated at ωB97X-D3/def2-TZVP. We also report accurate transition state theory rate coefficients \({k}_{\infty }(T)\) between 300 K and 2000 K and the corresponding Arrhenius parameters for a subset of rigid reactions. We believe this data will accelerate development of automated and reliable methods for quantitative reaction prediction.

Measurement(s)	Barrier Heights • Enthalpies • Rate Coefficients
Technology Type(s)	ab initio quantum chemistry computational method

Highly accurate protein structure prediction with AlphaFold

Article Open access 15 July 2021

Scientific discovery in the age of artificial intelligence

Article 02 August 2023

Selection of a promiscuous minimalist cAMP phosphodiesterase from a library of de novo designed proteins

Article Open access 03 May 2024

Background & Summary

Detailed reaction mechanisms are valuable tools for analyzing and predicting physical phenomena driven by chemical kinetics. Historically, kinetic model parameters were fit to a specific set of experimental results, which limited their generalizability to systems with different temperatures, pressures, or initial compositions. In recent decades, the field of chemical kinetics has transitioned from postdictive to predictive modeling approaches¹. This shift has been motivated by advances in compute power, which make it possible to predict many kinetic parameters using ab initio calculations rather than relying on scarce experimental data^{2,3,4,5,6,7,8,9,10,11,12,13}. Our research group has long been interested in the automated generation of kinetic models, which can simulate and predict the concentrations of all relevant species^14,15. Reliable datasets are essential for constructing such models with predictive power. A small error of a few kcal mol⁻¹ in the activation energy will lead to significant errors in the final rate estimate, particularly at lower temperatures. Unfortunately, accurate barriers are often known for fewer than 10% of the reactions in kinetic models^{8,9,10,11,12,13,16,17}. To help address this paucity of data, here we present relatively accurate barriers for nearly 12,000 reactions.

Kinetic parameters are currently estimated using functional group and linear-free-energy (LFER) methods,^18,19, but machine learning models are much more flexible and have broader scope. Indeed, machine learning has sparked an explosion of progress in physical and organic chemistry, especially in the areas of automated synthesis planning^20,21, targeted molecular optimization^22,23, and general property prediction^24,25 from thermodynamic^26,27 and solvation parameters^28,29 to full infrared spectra³⁰. In situations where data are plentiful, machine learning-based algorithms often provide excellent predictions and some have successfully been applied to experiments³¹. When such data is lacking, researchers generate their own datasets–both experimentally and computationally–to regress desired properties from them^32,33,34. In machine learning applied to chemistry, the community has largely taken a model-driven approach, where significant effort has been devoted to refining models on a few benchmark datasets^35,36. As a result, the community has delivered strong architectures from advanced graph convolutional networks^37,38,39 to atomistic networks^40,41. Today, progress is limited primarily by the scarcity of large, diverse, and high-quality datasets.

Here, we report a cleaned, high-quality dataset of reaction barriers, enthalpies, and transition state theory (TST) rate coefficients. We build upon the prior work from Grambow et al^42,43. Briefly, their work used the single-ended growing string method⁴⁴ to automatically identify thousands of transition states (TSs) and products from a given set of reactants. Reactants were chosen by using all molecules with six or fewer heavy atoms from GDB-7⁴⁵ as well as randomly selecting some (∼430) molecules with seven heavy atoms; the molecules contain H, C, N, and O atoms. Conformer searches were performed for the reactants by embedding several hundred conformers for each molecule using RDKit⁴⁶ with the ETKDG distance geometry method⁴⁷ and relaxing their geometries using the MMFF94 force field implemented in RDKit. The lowest energy conformer was then optimized using Q-Chem⁴⁸ at both the B97-D3/def2-mSVP level of theory with Becke-Johnson damping⁴⁹ and the ωB97X-D3/def2-TZVP⁵⁰ level of theory. The reactant conformer was the starting point for the growing string search. The highest energy point in the string was used as the initial guess for a conventional saddle point search. Additional details can be found in the original publication.

Our work brings the following advances. First, we clean the SMILES⁵¹ reported in Grambow et al⁴². The original publication treated all reactions with multiple products as containing one product complex, which does not conform well with traditional TST calculations that expect partition functions for each species. Thus, we separate the products from any product complex, recalculate the geometry optimization and frequency at either B97-D3/def2-mSVP or ωB97X-D3/def2-TZVP. Finally, we refine the single point energies for each species using explicitly correlated coupled-cluster calculations, which are expected to be much more accurate than the density functional theory methods^52,53,54,55. We provide the updated barrier heights and reaction enthalpies from our CCSD(T)-F12a/cc-pVDZ-F12//ωB97X-D3/def2-TZVP calculations. Our higher-accuracy calculations improve the RMSE of the barrier heights by approximately 5 kcal mol⁻¹ relative to those calculated at ωB97X-D3/def2-TZVP. We believe that the high-quality values in this dataset will accelerate development of automated and reliable methods for quantitative reaction prediction. Finally, we also identify a subset of reactions with rigid species that do not require a conformer search nor hindered-rotor treatment. The rigid-rotor harmonic oscillator (RRHO) TST rate coefficients \({k}_{\infty }(T)\) and fitted Arrhenius parameters for this subset are reported since these values should be accurate. We do not report \({k}_{\infty }(T)\) or Arrhenius parameters for reactions involving flexible reactants or transition states since RRHO TST is not accurate for these reactions.

Methods

Overview

Dataset refinement started by cleaning the SMILES from the original dataset⁴³ and filtering reactions to those containing one reactant and at most three products. Next, product complexes are separated into individual species, each of which is reoptimized at the respective level of theory i.e. either B97-D3/def2-mSVP or ωB97X-D3/def2-TZVP. The single point energy of all species optimized at ωB97X-D3/def2-TZVP is computed at CCSD(T)-F12a/cc-pVDZ-F12. These energies are used to calculate updated barrier heights by adding the zero-point energies (ZPEs) from the harmonic vibrational analysis to the reactant, product, and TS energies and then computing the difference between the resulting TS and reactant energies. Similarly, enthalpies of reaction are calculated based on the difference of the ZPE-corrected product and reactant energies; bond additivity corrections (BACs) are added to each species. Finally, we identify a subset of reactions that contain rigid species, calculate high-pressure limit TST rate coefficients, and report the fitted Arrhenius parameters.

Cleaning SMILES

The work from Grambow et al.⁴² used the single-ended growing string method⁴⁴ to generate a list of possible products from a given reactant. The input and output for the growing string method are a set of three-dimensional coordinates to describe the molecule or multi-molecule complex. Grambow et al. used Open Babel⁵⁶ to perceive connectivity and generate a SMILES for the reactant and product from each set of three-dimensional coordinates. However, in some cases, the bond-order and formal charges did not correspond to the most representative resonance structure. Here, we update the SMILES by using RDKit⁴⁶ to look for neighboring atoms with opposite formal charges, which often occurred between nitrogen and carbon atoms. Some representative examples of the updated SMILES and their impact on molecular structure are shown in Fig. 1. Additionally, there were a handful of reactions whose reactant was neutral, but whose product was positively charged. This charge imbalance was likely due to Open Babel occasionally generating an incorrect SMILES from the molecular coordinates. Here, we update the corresponding product SMILES to conserve charge for the reaction i.e. added an electron to create a correct Lewis structure. Representative examples are shown in Fig. 2. Atom-mapping is preserved when updating the SMILES.

Reoptimizing products

Of the reactions in the previously published dataset from Grambow et al.⁴², approximately 30% contain two products and 2% contain three products. These were previously treated as one product complex when using the growing string method as well as during subsequent geometry optimization and frequency calculation. However, to obtain rate coefficients, conventional canonical TST calculations expect partition functions for each individual species. Here, we separate the complexes into individual products, reoptimize the geometries, and recalculate the frequencies using Q-Chem 5.3.0⁴⁸ for both the B97-D3/def2-mSVP and ωB97X-D3/def2-TZVP datasets. The previously optimized geometries from the complex are used as the initial guess for the new optimization. The exact same settings are used in the input files as were used by Grambow et al.⁴² during the original Q-Chem calculations. This ensures that the separated products are run with the same method and basis set as well as with identical convergence criteria as those used by their corresponding reactant and TS.

Consistent with the previous work, nearly all molecules are run in the singlet state and use a spin-unrestricted ansatz. For example, the ground electronic state for methylene (CH₂) is a triplet, but because the TS for all reactions was computed at the singlet state, any CH₂ products were also recalculated in the singlet state. However, upon splitting some products, there are 49 reactions unique to the larger B97-D3 dataset whose product pairs were radicals; these individual species were calculated in the doublet state since it was assumed that the lone electron on each product had opposite spins to conserve the overall multiplicity for the reaction. We verified that spin contamination is not a problem by confirming that the average value of the total spin operator is between 0.75 and 0.77 for these species. The value of <S²> is 0 for all other species.

Note that reoptimizing the separated products and then summing their energy resulted in a different product energy than that from the original product complex. In a few cases, this changed the reaction enthalpy enough such that ΔH > ΔE₀, which would cause the reverse reaction to have a negative barrier height. Although submerged barriers are possible, often a large negative barrier height is reason to be suspicious. Thus, we remove any reaction in which the explicitly correlated coupled-cluster reaction enthalpy was more than 10 kcal mol⁻¹ larger than the barrier height.

Refining single point energies

A major accomplishment of this work is providing highly accurate kinetic parameters computed at RCCSD(T)-F12a/cc-pVDZ-F12//ωB97X-D3/def2-TZVP for a large and diverse set of atom-mapped gas phase reactions. All species were calculated in the singlet state. Although the ωB97X-D3 method is more accurate for predicting barrier heights than many other functionals⁵⁷, coupled-cluster CCSD(T) calculations are commonly considered the gold-standard in quantum chemistry^58,59. Here, we refine the single point energies of each species from the ωB97X-D3/def2-TZVP dataset using the explicitly correlated CCSD(T)-F12 method since previous literature has shown that CCSD(T)-F12 can achieve similar accuracy to the standard CCSD(T) calculation while using a much smaller basis set^{3,52,53,54,55,60}. This is notable because both coupled-cluster methods scale as \(O({N}^{7})\), such that N is the number of orbitals⁶¹, so using a smaller basis set offers substantial computational savings.

We next consider which basis set to use. Although triple-\(\zeta \) and quadruple-\(\zeta \) basis sets have shown reaction energies within 1 kcal mol⁻¹, many studies comparing basis sets use about 100 molecules or fewer. Further, such studies often focus on small molecules containing primarily three or four heavy atoms due to the steep scaling of coupled-cluster calculations. The main exception to this generalization is the recent calculation of the 133,000 molecules from QM9³⁵ with the G4MP2 level of theory^62,63; however, that dataset only contains stable species, while our dataset has approximately 12,000 TSs. Considering that our dataset contains nearly 22,000 unique stable species and transition states, each containing up to seven heavy atoms, we chose the cc-pVDZ-F12 basis set to accommodate the large number of calculations while maintaining high accuracy. 15 reactions were also run with cc-pVTZ-F12 to validate the double-\(\zeta \) accuracy. For all coupled-cluster single point calculations, we use the energy from CCSD(T)-F12a since both published literature⁶⁴ and the MOLPRO documentation⁶⁵ conclude this method offers a better approximation to the complete basis set limit for the double-\(\zeta \) and triple-\(\zeta \) basis sets. All calculations were run in parallel using MOLPRO 2015.1⁶⁵ on the National Energy Research Scientific Computing Center (NERSC).

Calculating reaction barrier heights and enthalpies

ZPEs from the harmonic vibrational analysis are added to the electronic energy for the reactant, product, and TS. For all species, a scaling factor is applied to the computed harmonic frequencies used to compute the ZPE; the scaling factor for B97-D3/def2-mSVP and for ωB97X-D3/def2-TZVP are calculated as described by Alecu et al.⁶⁶ and found to be 1.014 and 0.984 respectively. Reaction barrier heights are computed by taking the difference of the resulting TS and reactant energies. Similarly, enthalpies of reaction at 298 K are computed by taking the difference of the resulting product and reactant energies. When calculating the values for the coupled-cluster dataset, the CCSD(T)-F12a energies are used while the ZPEs are taken from the ωB97X-D3 calculation since this was the level of theory used for the geometry optimization and vibrational analysis. Note that atom energy corrections (AECs) and bond additivity corrections (BACs) are added to the enthalpy values for each species. Although the AECs cancel out during the subtraction to obtain the reaction enthalpy since all reactions are balanced, these corrections are important when comparing the \({\Delta }_{{\rm{f}}}\)H(298) to experimental values as described in the technical validation. Corrections are not used when computing reaction barriers. AECs are calculated by fitting the atomization energies of 14 small molecules. The atomization energies come from CCCBDB⁶⁷ and all have uncertainty values less than 0.2 kcal mol⁻¹. Petersson type BACs⁶⁸ were fit using a set of about 400 reference species with well-known heats of formation, primarily drawn from ATcT⁶⁹ and CCCBDB⁶⁷. The experimental uncertainty is at most 0.55 kcal mol⁻¹, though most values are much lower with the median being just 0.14 kcal mol⁻¹. For more details on the fitting procedure, see the Reaction Mechanism Generator (RMG) documentation at https://reactionmechanismgenerator.github.io/RMG-Py/users/arkane/input.html#atom-energy-fitting.

Calculating rates

Automated Reaction Kinetics and Network Exploration (Arkane) is a software package for computing thermodynamic properties and high-pressure limit rate coefficients using the results from quantum chemistry calculations. Thermodynamic properties are computed using the RRHO approximation, while kinetic parameters are computed using conventional canonical TST, also with RRHO. Arkane is developed and distributed as part of RMG-Py^14,15. All software is written in Python and provided as free, open source code under the terms of the MIT License.

We use Arkane to convert the single point energy from the quantum chemistry calculation to the gas-phase reference state; by default atom and spin-orbit coupling energy corrections are applied, but will cancel during the TST calculation. As before, the corresponding scaling factor is applied to the ZPE for each species. Arkane uses RRHO TST with Eckart tunneling correction to calculate the forward rate coefficient for a set of user-defined temperatures. BACs are omitted when calculating the forward rate coefficient since BACs are not present for the partial bonds in the TS. Arkane then uses a linear least-squares fitting to fit the list of reciprocal temperatures and logarithm of the rate coefficients to an Arrhenius expression, yielding the best approximation for the pre-exponential A-factor and the activation energy. We use 50 linearly spaced points in the reciprocal temperature space between 300 K and 2000 K when obtaining the Arrhenius parameters.

Data Records

All data is free and publicly accessible on Zenodo⁷⁰. Q-Chem output files are provided for the 16,302 reactions at B97-D3/def2-mSVP and for the 11,926 reactions at ωB97X-D3/def2-TZVP level of theory. For convenience, these also include the original log files for the reactant, TS, and non-reoptimized products from Grambow et al.⁴² since they are used to calculate barrier heights, enthalpies, and rate coefficients in this work. MOLPRO output files from the single point calculations are provided for the 11,926 reactions at the CCSD(T)-F12a/cc-pVDZ-F12 level of theory as well as for the 15 reactions calculated with the triple-\(\zeta \) basis. Information for each reaction is organized by the level of theory and stored in a separate folder labeled as rxn######, such that ###### denotes the reaction number padded with zeros. The numbering matches that from the originally published dataset⁴³ to facilitate easy comparison. For the quantum chemistry calculations, each folder contains the log files for the reactant, TS, and product as r######.log, ts######.log, and p######.log respectively. An additional number is appended to the file names from the separated products. For example, the log files for any reaction containing two products are labeled as p######_0.log and p######_1.log.

The cleaned atom-mapped SMILES, as well as all values calculated in this work, are provided in the comma-separated values (csv) files b97d3.csv, wb97xd3.csv, ccsdtf12_dz.csv, and ccsdtf12_tz.csv. The columns for the csv files are described in Table 1. The calculated TST rate coefficients and fitted Arrhenius parameters for the rigid species are provided in ccsdtf12_dz_rigid.csv, whose columns are described in Table 2. The Arkane output files from TST calculations and Arrhenius parameter fitting are also provided. Each reaction is again stored in a separate folder labeled as rxn######, which contains a rxn folder with all information from the Arrhenius fitting. The kinetic information is stored in Chemkin⁷¹ file format. The list of 50 temperatures (K) used during Arrhenius fitting is provided in arkane_temperatures.csv.

Table 1 Description of the columns in the main comma-separated value files for each level of theory.

Full size table

Table 2 Description of the columns in the comma-separated value file for rigid reactions.

Full size table

The improvement from fitting BACs at B97-D3/def2-mSVP, ωB97X-D3/def2-TZVP, CCSD(T)-F12a/cc-pVDZ-F12//ωB97X-D3/def2-TZVP, and CCSD(T)-F12a/cc-pVTZ-F12//ωB97X-D3/def2-TZVP is contained in b97d3_def2msvp_BAC.csv, wb97xd3_def2tzvp_BAC.csv, ccsdtf12_ccpvdzf12__wb97xd3_def2tzvp_BAC.csv, and ccsdtf12_ccpvtzf12__wb97xd3_def2tzvp_BAC.csv respectively. The files contain the experimental and calculated enthalpies for the reference species from RMG-database used for fitting. The atom and bond correction values are publicly stored on the RMG-database GitHub, though they are also provided in fitted_corrections.pkl for convenience. Further validation of the BACs at the double-\(\zeta \) basis set is done by comparing to experimental values from the Pedley⁷² set since over half of these molecules are not in the RMG-database training set used for fitting. The comparison is shown in ccsdtf12_dz_vs_Pedley_experimental.csv.

Technical Validation

The published work from Grambow et al.⁴² already performed several integrity checks, such as ensuring that all TSs have exactly one imaginary frequency, whose atomic displacements matched the bond changes occurring between the reactant and product. The authors also removed any TS with an imaginary frequency less than 100 cm⁻¹ in magnitude as that typically corresponds to conformational changes. In this work, we ensure that multiplicity and charge are conserved for all reactions. As described in the methods section, this is important when separating product complexes into individual product geometries for reoptimization. We next identify whether each reaction matches a reaction template from the RMG-database. As shown in Fig. 3, keto-enol is the most represented RMG template. However, due to the diversity of these reactions, the majority do not match any RMG template. This is consistent with the previously published work⁴², which chose to characterize the reaction diversity by extracting general templates that do not necessarily match a template from RMG-database. RMG-database is frequently updated, which includes occasionally updating the reaction templates to be more broad or more specific. Thus, using reaction templates from a different version of RMG-database may capture more or less reactions for a given family (or even include different reaction families). To generate Fig. 3 and Table 3, we used the AEC_BAC branch of RMG-database.

Table 3 RMG reaction templates present in the CCSD(T)-F12a/cc-pVDZ-F12 dataset.

Full size table

To evaluate the improvement from applying the fitted atom and bond corrections to the enthalpy values, we calculate the error relative to the high-quality reference set of about 400 molecules used for fitting. For the coupled-cluster data, the training mean absolute error (MAE) and root mean squared error (RMSE) are 0.5 and 0.8 kcal mol⁻¹ respectively. We also compare the corrected enthalpy values to an external test set originally published by Pedley⁷² and compiled and verified by Narayanan et al.⁶² to validate our enthalpy calculation approach (CCSD(T)-F12a + AEC + BAC). This set, named the Pedley test set, contains 459 species that have experimental uncertainty, defined as 95% confidence intervals^73,74, of less than 1 kcal mol⁻¹. After removing 76 species common to both our reference set and our coupled-cluster dataset (so as to exclude training species from our test set), we measure the error of our corrected enthalpies against the Pedley test set. This evaluation returns an MAE of 0.8 kcal mol⁻¹ and RMSE of 1.2 kcal mol⁻¹, indicating strong agreement of our approach with high-quality experimental data.

To compare accuracy improvements from the coupled-cluster calculations in this work, Table 4 shows the MAE and RMSE of the barrier height and reaction enthalpy relative to the lower levels of theory. The summary statistics are calculated using the list of about 10,400 reactions that are common to all three level of theory datasets. As expected, values from the ωB97X-D3 dataset show less deviation than those from the B97-D3 dataset, yet they are still several kcals away from the explicitly correlated coupled-cluster values. The RMSE of 5 kcal mol⁻¹ is significant since it implies that rate coefficients calculated at ωB97X-D3 differ on average by a factor of 12 at 1,000 K relative to those calculated at CCSD(T)-F12a; the difference increases substantially to a factor of 4,000 at 300 K. It is interesting to note that the RMSE for barrier heights reported here is more than twice as large as the RMSE reported in Ref. ⁵⁷. However, this previous analysis was done with 206 reactions from just a few reaction families, whereas the data presented in Table 4 represent more than 10,000 reactions and a much more diverse array of chemistry. Taking the barrier height RMSE of only RMG reaction families gives 8.0 and 3.8 kcal mol⁻¹ for the B97-D3 and ωB97X-D3 dataset respectively, both of which are smaller deviations than that for the entire dataset. As seen in Fig. 4, the barrier heights calculated at DFT tended to be an overestimate relative to those calculated at CCSD(T)-F12a/cc-pVTZ-F12//ωB97X-D3/def2-TZVP. On average, the difference is a few kcal mol⁻¹, though there are a minority of reactions in which the errors are larger. Further exploration as to why this DFT functional gives such different values could be an area of future research.

Table 4 Errors in \({\text{kcal mol}}^{-1}\) for each level of theory relative to CCSD(T)-F12a/cc-pVDZ-F12.

Full size table

We next compare some of the coupled cluster values in our dataset to those from other published works. For example, Dontgen et al.⁷⁵ reported ten keto-enol reactions calculated at DLPNO-CCSD(T)/CBS//B3LYP-D3BJ/def2-TZVP. Two of their reactions (reactions 1 and 7) are also present in our dataset. For both reactions, the barrier heights agree within 0.3 kcal mol⁻¹. Balabin⁷⁶ calculated tautomerization reactions of triazoles at CCSD(T)/CBS//MP2/aug-cc-pVT. For the reaction of 1H-1,2,3-triazole producing 2H-1,2,3-triazole, they report a reaction enthalpy of −3.98 kcal mol⁻¹ compared to our calculated value of −3.96 kcal mol⁻¹. Their reported barrier height for the reverse direction is 53.6 kcal mol⁻¹ compared to our value of 49.8 kcal mol⁻¹, calculated by subtracting our reaction enthalpy from our barrier height for the forward direction.

To further evaluate the improvement from the double-\(\zeta \) single point calculations, we also calculate some reactions at CCSD(T)-F12a/cc-pVTZ-F12//ωB97X-D3/def2-TZVP. The triple-\(\zeta \) calculations required substantially more computational time and scratch space when compared to the double-\(\zeta \) calculations. Three reactions are sampled from each of the top five most common RMG reaction families, shown in Fig. 3. When looking at the distribution of barrier heights within each family for the double-\(\zeta \) basis set, the three reactions are chosen to represent approximately the 25^th, 50^th, and 75^th percentile. Table 5 and Table 6 show the MAE and RMSE of the barrier height and reaction enthalpy respectively from the different levels of theory with respect to the triple-\(\zeta \) basis set. These trends are consistent with other previous literature that emphasizes the high fidelity of explicitly correlated coupled-cluster calculations, even with a double-\(\zeta \) basis set^55,60.

Table 5 Barrier height errors \(({\text{kcal mol}}^{-1})\) for each level of theory relative to CCSD(T)-F12a/cc-pVTZ-F12 for sample reactions.

Full size table

Table 6 Reaction enthalpy errors \(({\text{kcal mol}}^{-1})\) for each level of theory relative to CCSD(T)-F12a/cc-pVTZ-F12 for sample reactions.

Full size table

Finally, Grambow et al.⁴² already performed a conformer search for the reactants. No additional conformer searching is done in this work. Instead, we identify rigid reactions that do not require a conformer search and report the RRHO TST rate coefficients and fitted Arrhenius parameters for this subset since the explicitly correlated coupled-cluster values should be quite reliable. Fig. 5 summarizes the filtering workflow to identify rigid reactions. We start by using RDKit’s Lipinski rotatable bond SMARTS to find reactions whose reactant and product(s) do not have any rotatable bonds. Further, if any rings are present, we filter molecules with only planar rings (either aromatic or 3-membered). With these criteria, we identify a subset of reactions from the CCSD(T)-F12a/cc-pVDZ-F12 dataset that contain rigid stable species. We next omit any reaction whose TS has a positive frequency smaller than 100 cm⁻¹ since this is a common threshold for distinguishing conformational motions from vibrational modes that will be used in the rigid rotor harmonic oscillator approximation^77,78,79,80. As the last filtering step, we visually inspect the remaining reactions to verify that RDKit correctly identified rigid species and also confirm that the TS would not require a conformer search either; this left 105 rigid reactions. Two examples are shown in Fig. 6.

To determine whether the fitted parameters give a good estimate of the rate coefficient for these rigid reactions, we examine the average absolute percentage error between the rate coefficient calculated from TST and the predicted rate coefficients using the fitted parameters. The largest value is 233% i.e. about a factor of 3 error in the rate coefficient which is often quite acceptable, though most reactions have a much lower error. For instance, 62 of these rigid reactions have average fitting errors below 20%; thus, the fitted A-factor and activation energy should be very reliable for these reactions. Residuals from the least-squares fit for reactions with the 25^th and 75^th percentile for average percentage error are shown in Fig. 7. For nearly all data points, the residuals are very close to zero. Lastly, we searched for published experimental data to compare with our calculated values. Saito et al.⁸¹ studied isomerization of acetonitrile to methyl isocyanide at 1600–2100 K behind reflected shock waves. They report \({k}_{\infty }(T)=1{0}^{13.5}\) exp(−260 kJ mol⁻¹/RT) s⁻¹, which agrees quite well with the A-factor of 1.14 \(\times 1{0}^{14}\) s⁻¹ and Ea of 262 kJ mol⁻¹ from our fitted Arrhenius expression.

Usage Notes

Except for the commercial Q-Chem and MOLPRO quantum chemistry softwares, all code necessary to reproduce the generated data is available on GitHub⁸². The repository contains scripts, which should be run in the following order:

Jupyter notebooks were used to identify potentially erroneous SMILES for the reactants and products of both the B97-D3/def2-mSVP and ωB97X-D3/def2-TZVP level of theory. The suggested SMILES were manually inspected, utilizing the interactive nature of Jupyter notebooks, to confirm that the change was chemically reasonable and preserved the atom-mapping.
create_qchem_input_files.py: Parses the original Q-Chem log files from Grambow et al.⁴² to separate product complexes into the individual Q-Chem input files for both the B97-D3/def2-mSVP and ωB97X-D3/def2-TZVP levels of theory.
create_molpro_input_files.py: Creates MOLPRO input files for the single point calculations at CCSD(T)-F12a/cc-pVDZ-F12 using the reoptimized ωB97X-D3/def2-TZVP geometries.
parse_barriers_enthalpies.py: Compiles the reactant and product SMILES into comma-separated values files and parses the reaction barrier heights and enthalpies.
get_enthalpies_corrected.py: Uses the atom and bond corrections from RMG-database to obtain more accurate reaction enthalpies.
identify_rmg_reactions.py: Identifies which reactions correspond to RMG reaction templates.
identify_rigid_species.py: Uses RDKit to identify reactions with rigid reactant and product.
run_arkane.py: Runs Arkane to obtain Arrhenius rate parameters for the rigid reactions.
parse_tst_rates.py: Parses the calculated rate coefficients from the Arkane output files.
parse_arrhenius_parameters.py: Parses the fitted A-factors and activation energies from Arkane output files.
calculate_percent_error.py: Calculates the average percent error between the rate coefficients calculated from TST and those predicted using the fitted Arrhenius parameters.

Code availability

The code used to generate this data is freely available on GitHub under the MIT license⁸². Details on how to use the scripts to generate the data are provided in the Usage Notes. Some of the scripts utilize helpful components of the Reaction Mechanism Generator, such as RMG-Py, RMG-database, and the Automatic Rate Calculator (ARC)⁸³. All related software is open-source under the MIT license and freely accessible on GitHub. For RMG-Py, checkout the qchem_parser branch, and for RMG-database, checkout AEC_BAC. The GitHub version commit string was ea2eb625fb1dcc6892ef6ddd5d7fdc96abf477e1 for ARC on the main branch.

References

Green, W. H. Moving from postdictive to predictive kinetics in reaction engineering. AIChE Journal 66, e17059 (2020).
Article CAS Google Scholar
Wang, K. & Dean, A. M. Rate rules and reaction classes. Computer Aided Chemical Engineering 45, 203–257 (2019).
Zheng, J., Zhao, Y. & Truhlar, D. G. The DBH24/08 database and its use to assess electronic structure model chemistries for chemical reaction barrier heights. Journal of Chemical Theory and Computation 5, 808–821 (2009).
Article CAS PubMed Google Scholar
Krasnoukhov, V. S., Zagidullin, M. V., Zavershinskiy, I. P. & Mebel, A. M. Formation of phenanthrene via recombination of indenyl and cyclopentadienyl radicals: a theoretical study. Journal of Physical Chemistry A 124, 9933–9941 (2020).
Article ADS CAS PubMed Google Scholar
Grinberg Dana, A., Moore, K. B. III, Jasper, A. W. & Green, W. H. Large intermediates in hydrazine decomposition: A theoretical study of the N3H5 and N4H6 potential energy surfaces. Journal of Physical Chemistry A 123, 4679–4692 (2019).
Article ADS CAS PubMed Google Scholar
Keçeli, M. et al. Automated computational thermochemistry for butane oxidation: A prelude to predictive automated combustion kinetics. Proceedings of the Combustion Institute 37, 363–371 (2019).
Article CAS Google Scholar
Gillis, R. J. & Green, W. H. Thermochemistry prediction and automatic reaction mechanism generation for oxygenated sulfur systems: A case study of dimethyl sulfide oxidation. ChemSystemsChem 2, e1900051 (2020).
Article CAS Google Scholar
Johnson, M. S. et al. Oxidation and pyrolysis of methyl propyl ether. International Journal of Chemical Kinetics 53, 915–938 (2021).
Article CAS Google Scholar
Dong, X. et al. Revealing the critical role of radical-involved pathways in high temperature cyclopentanone pyrolysis. Combustion and Flame 216, 280–292 (2020).
Article CAS Google Scholar
Pio, G., Dong, X., Salzano, E. & Green, W. H. Automatically generated model for light alkene combustion. Combustion and Flame 241, 112080 (2022).
Article CAS Google Scholar
Class, C. A., Vasiliou, A. K., Kida, Y., Timko, M. T. & Green, W. H. Detailed kinetic model for hexyl sulfide pyrolysis and its desulfurization by supercritical water. Physical Chemistry Chemical Physics 21, 10311–10324 (2019).
Article CAS PubMed Google Scholar
Khanniche, S., Lai, L. & Green, W. H. Kinetics of intramolecular phenyl migration and fused ring formation in hexylbenzene radicals. Journal of Physical Chemistry A 122, 9778–9791 (2018).
Article ADS CAS PubMed Google Scholar
Payne, A. M., Spiekermann, K. A. & Green, W. H. Detailed reaction mechanism for 350–400C pyrolysis of an alkane, aromatic, and long-chain alkylaromatic mixture. Energy & Fuels 36, 1635–1646 (2022).
Article CAS Google Scholar
Gao, C. W., Allen, J. W., Green, W. H. & West, R. H. Reaction mechanism generator: Automatic construction of chemical kinetic mechanisms. Computer Physics Communications 203, 212–225 (2016).
Article ADS CAS Google Scholar
Liu, M. et al. Reaction mechanism generator v3.0: Advances in automatic mechanism generation. Journal of Chemical Information and Modeling 61, 2686–2696 (2021).
Article CAS PubMed Google Scholar
Lai, L., Khanniche, S. & Green, W. H. Thermochemistry and group additivity values for fused two-ring species and radicals. Journal of Physical Chemistry A 123, 3418–3428 (2019).
Article ADS CAS PubMed Google Scholar
Lai, L., Pang, H.-W. & Green, W. H. Formation of two-ring aromatics in hexylbenzene pyrolysis. Energy & Fuels 34, 1365–1377 (2020).
Article CAS Google Scholar
Benson, S. W. Thermochemistry and kinetics of sulfur-containing molecules and radicals. Chemical Reviews 78, 23–35 (1978).
Article CAS Google Scholar
Carstensen, H.-H. & Dean, A. M. Rate constant rules for the automated generation of gas-phase reaction mechanisms. Journal of Physical Chemistry A 113, 367–380 (2009).
Article ADS CAS PubMed Google Scholar
Schwaller, P. et al. Predicting retrosynthetic pathways using transformer-based models and a hyper-graph exploration strategy. Chemical Science 11, 3316–3325 (2020).
Article CAS PubMed PubMed Central Google Scholar
Mo, Y. et al. Evaluating and clustering retrosynthesis pathways with learned strategy. Chemical Science 12, 1469–1478 (2021).
Article CAS Google Scholar
Bradshaw, J., Paige, B., Kusner, M. J., Segler, M. H. & Hernández-Lobato, J. M. Barking up the right tree: an approach to search over molecule synthesis DAGs. arXiv preprint arXiv:2012.11522 (2020).
Coley, C. W. Defining and exploring chemical spaces. Trends in Chemistry 3, 133–145 (2021).
Article CAS Google Scholar
Yang, K. et al. Analyzing learned molecular representations for property prediction. Journal of Chemical Information and Modeling 59, 3370–3388 (2019).
Article CAS PubMed PubMed Central Google Scholar
Stokes, J. M. et al. A deep learning approach to antibiotic discovery. Cell 180, 688–702 (2020).
Article CAS PubMed PubMed Central Google Scholar
Li, Y.-P., Han, K., Grambow, C. A. & Green, W. H. Self-evolving machine: A continuously improving model for molecular thermochemistry. Journal of Physical Chemistry A 123, 2142–2152 (2019).
Article ADS CAS PubMed Google Scholar
Grambow, C. A., Li, Y.-P. & Green, W. H. Accurate thermochemistry with small data sets: A bond additivity correction and transfer learning approach. Journal of Physical Chemistry A 123, 5826–5835 (2019).
Article ADS CAS PubMed Google Scholar
Vermeire, F. H. & Green, W. H. Transfer learning for solvation free energies: From quantum chemistry to experiments. Chemical Engineering Journal 418, 129307 (2021).
Article CAS Google Scholar
Chung, Y. et al. Group contribution and machine learning approaches to predict Abraham solute parameters, solvation free energy, and solvation enthalpy. Journal of Chemical Information and Modeling 62, 433–446 (2022).
Article CAS PubMed Google Scholar
McGill, C., Forsuelo, M., Guan, Y. & Green, W. H. Predicting infrared spectra with message passing neural networks. Journal of Chemical Information and Modeling 61, 2594–2609 (2021).
Article CAS PubMed Google Scholar
Coley, C. W. et al. A robotic platform for flow synthesis of organic compounds informed by AI planning. Science 365 (2019).
Ahneman, D. T., Estrada, J. G., Lin, S., Dreher, S. D. & Doyle, A. G. Predicting reaction performance in C–N cross-coupling using machine learning. Science 360, 186–190 (2018).
Article ADS CAS PubMed Google Scholar
Grambow, C. A., Pattanaik, L. & Green, W. H. Deep learning of activation energies. Journal of Physical Chemistry Letters 11, 2992–2997 (2020).
Article CAS PubMed Google Scholar
Pattanaik, L., Ingraham, J. B., Grambow, C. A. & Green, W. H. Generating transition states of isomerization reactions with deep learning. Physical Chemistry Chemical Physics 22, 23618–23626 (2020).
Article CAS PubMed Google Scholar
Ramakrishnan, R., Dral, P. O., Rupp, M. & Von Lilienfeld, O. A. Quantum chemistry structures and properties of 134 kilo molecules. Scientific Data 1, 1–7 (2014).
Article CAS Google Scholar
Smith, J. S., Isayev, O. & Roitberg, A. E. ANI-1, A data set of 20 million calculated off-equilibrium conformations for organic molecules. Scientific Data 4, 1–8 (2017).
Article CAS Google Scholar
Kipf, T. N. & Welling, M. Semi-supervised classification with graph convolutional networks. arXiv 1609.02907 (2016).
Pattanaik, L. et al. Message passing networks for molecules with tetrahedral chirality. arXiv 2012.00094 (2020).
Mercado, R. et al. Graph networks for molecular design. Machine Learning: Science and Technology 2, 025023 (2021).
Google Scholar
Schütt, K. T., Sauceda, H. E., Kindermans, P.-J., Tkatchenko, A. & Müller, K.-R. SchNet–A deep learning architecture for molecules and materials. Journal of Chemical Physics 148, 241722 (2018).
Article ADS PubMed CAS Google Scholar
Klicpera, J., Groß, J. & Günnemann, S. Directional message passing for molecular graphs. arXiv 2003.03123 (2020).
Grambow, C. A., Pattanaik, L. & Green, W. H. Reactants, products, and transition states of elementary chemical reactions based on quantum chemistry. Scientific Data 7, 1–8 (2020).
Article CAS Google Scholar
Grambow, C. A., Pattanaik, L. & Green, W. H. Reactants, products, and transition states of elementary chemical reactions based on quantum chemistry, Zenodo https://doi.org/10.5281/zenodo.3715478 (2020).
Zimmerman, P. M. Single-ended transition state finding with the growing string method. J. Comput. Chem. 36, 601–611 (2015).
Article CAS PubMed Google Scholar
Ruddigkeit, L., Van Deursen, R., Blum, L. C. & Reymond, J.-L. Enumeration of 166 billion organic small molecules in the chemical universe database GDB-17. Journal of Chemical Information and Modeling 52, 2864–2875 (2012).
Article CAS PubMed Google Scholar
Landrum, G. et al. RDKit: Open-Source Cheminformatics Software. https://www.rdkit.org (2006).
Riniker, S. & Landrum, G. A. Better informed distance geometry: using what we know to improve conformation generation. Journal of Chemical Information and Modeling 55, 2562–2574 (2015).
Article CAS PubMed Google Scholar
Shao, Y. et al. Advances in molecular quantum chemistry contained in the Q-Chem 4 program package. Molecular Physics 113, 184–215 (2015).
Article ADS CAS Google Scholar
Grimme, S., Ehrlich, S. & Goerigk, L. Effect of the damping function in dispersion corrected density functional theory. Journal of Computational Chemistry 32, 1456–1465 (2011).
Article CAS PubMed Google Scholar
Lin, Y.-S., Li, G.-D., Mao, S.-P. & Chai, J.-D. Long-range corrected hybrid density functionals with improved dispersion corrections. Journal of Chemical Theory and Computation 9, 263–272 (2013).
Article CAS PubMed Google Scholar
Weininger, D. SMILES, a chemical language and information system. 1. introduction to methodology and encoding rules. Journal of Chemical Information and Computer Sciences 28, 31–36 (1988).
Article CAS Google Scholar
Bischoff, F. A., Wolfsegger, S., Tew, D. P. & Klopper, W. Assessment of basis sets for F12 explicitly-correlated molecular electronic-structure methodfs. Molecular Physics 107, 963–975 (2009).
Article ADS CAS Google Scholar
Knizia, G., Adler, T. B. & Werner, H.-J. Simplified CCSD(T)-F12 methods: Theory and benchmarks. Journal of Chemical Physics 130, 054104 (2009).
Article ADS PubMed CAS Google Scholar
Adler, T. B., Knizia, G. & Werner, H.-J. A simple and efficient CCSD(T)-F12 approximation. Journal of Chemical Physics 127, 221106 (2007).
Article ADS PubMed CAS Google Scholar
Pfeiffer, F., Rauhut, G., Feller, D. & Peterson, K. A. Anharmonic zero point vibrational energies: Tipping the scales in accurate thermochemistry calculations? Journal of Chemical Physics 138, 044311 (2013).
Article ADS PubMed CAS Google Scholar
O’Boyle, N. M. et al. Open Babel: An open chemical toolbox. Journal of Cheminformatics 3, 1–14 (2011).
CAS Google Scholar
Mardirossian, N. & Head-Gordon, M. Thirty years of density functional theory in computational chemistry: an overview and extensive assessment of 200 density functionals. Molecular Physics 115, 2315–2372 (2017).
Article ADS CAS Google Scholar
Tajti, A. et al. Heat: High accuracy extrapolated ab initio thermochemistry. Journal of Chemical Physics 121, 11599–11613 (2004).
Article ADS CAS PubMed Google Scholar
Karton, A., Rabinovich, E., Martin, J. M. & Ruscic, B. W4 theory for computational thermochemistry: In pursuit of confident sub-kj/mol predictions. Journal of Chemical Physics 125, 144108 (2006).
Article ADS PubMed CAS Google Scholar
Shang, Y., Ning, H., Shi, J., Wang, H. & Luo, S.-N. Chemical kinetics of H-abstractions from dimethyl amine by H, CH3, OH, and HO2 radicals with multi-structural torsional anharmonicity. Physical Chemistry Chemical Physics 21, 12685–12696 (2019).
Article CAS PubMed Google Scholar
Feller, D., Peterson, K. A. & Dixon, D. A. A survey of factors contributing to accurate theoretical predictions of atomization energies and molecular structures. Journal of Chemical Physics 129, 204105 (2008).
Article ADS PubMed CAS Google Scholar
Narayanan, B., Redfern, P. C., Assary, R. S. & Curtiss, L. A. Accurate quantum chemical energies for 133000 organic molecules. Chemical Science 10, 7449–7455 (2019).
Article CAS PubMed PubMed Central Google Scholar
Kim, H., Park, J. Y. & Choi, S. Energy refinement and analysis of structures in the QM9 database via a highly accurate quantum chemical method. Scientific Data 6, 1–8 (2019).
Article Google Scholar
Feller, D., Peterson, K. A. & Hill, J. G. Calibration study of the CCSD(T)-F12a/b methods for C2 and small hydrocarbons. Journal of Chemical Physics 133, 184102 (2010).
Article ADS PubMed CAS Google Scholar
Werner, H. et al. MOLPRO, version 2015.1, a package of ab initio programs. https://www.molpro.net (2015).
Alecu, I., Zheng, J., Zhao, Y. & Truhlar, D. G. Computational thermochemistry: scale factor databases and scale factors for vibrational frequencies obtained from electronic model chemistries. Journal of Chemical Theory and Computation 6, 2872–2887 (2010).
Article CAS PubMed Google Scholar
Johnson III, R. D. NIST computational chemistry comparison and benchmark database, NIST standard reference database number 101. http://cccbdb.nist.gov/ (2020).
Petersson, G. A. et al. Calibration and comparison of the Gaussian-2, complete basis set, and density functional methods for computational thermochemistry. Journal of Chemical Physics 109, 10570–10579 (1998).
Article ADS CAS Google Scholar
Ruscic, B. & Bross, D. Active thermochemical tables (ATcT) values based on ver. 1.122d of the thermochemical. https://atct.anl.gov/Thermochemical%20Data/version%201.122d/index.php (2018).
Spiekermann, KA., Pattanaik, L. & Green, WH. High accuracy barrier heights, enthalpies, and rate coefficients for chemical reactions version 1.0.1, Zenodo, https://doi.org/10.5281/zenodo.6618262 (2022).
ANSYS Inc. Chemkin-Pro. http://www.ansys.com/products/fluids/ansys-chemkin-pro, San Diego, CA (2017).
Pedley, J. Thermochemical data and structures of organic compounds, vol. 1 (CRC Press, 1994).
Ruscic, B. Uncertainty quantification in thermochemistry, benchmarking electronic structure computations, and active thermochemical tables. International Journal of Quantum Chemistry 114, 1097–1101 (2014).
Article CAS Google Scholar
Ruscic, B. & Bross, D. H. Thermochemistry. Computer Aided Chemical Engineering 45, 3–114 (2019).
Döntgen, M., Fenard, Y. & Heufer, K. A. Atomic partial charges as descriptors for barrier heights. Journal of Chemical Information and Modeling 60, 5928–5931 (2020).
Article PubMed CAS Google Scholar
Balabin, R. M. Tautomeric equilibrium and hydrogen shifts in tetrazole and triazoles: Focal-point analysis and ab initio limit. Journal of Chemical Physics 131, 154307 (2009).
Article ADS PubMed CAS Google Scholar
Pratt, L. M. et al. Aggregation of alkyllithiums in tetrahydrofuran. Journal of Organic Chemistry 72, 2962–2966 (2007).
Article CAS PubMed Google Scholar
Zhao, Y. & Truhlar, D. G. Computational characterization and modeling of buckyball tweezers: density functional study of concave–convex π π interactions. Physical Chemistry Chemical Physics 10, 2813–2818 (2008).
Article CAS PubMed Google Scholar
Ribeiro, R. F., Marenich, A. V., Cramer, C. J. & Truhlar, D. G. Use of solution-phase vibrational frequencies in continuum models for the free energy of solvation. Journal of Physical Chemistry B 115, 14556–14562 (2011).
Article CAS PubMed Google Scholar
Bao, J. L., Xing, L. & Truhlar, D. G. Dual-level method for estimating multistructural partition functions with torsional anharmonicity. Journal of Chemical Theory and Computation 13, 2511–2522 (2017).
Article CAS PubMed Google Scholar
Saito, K., Kakumoto, T. & Murakami, I. A study of the isomerization of acetonitrile at high temperatures. Chemical Physics Letters 110, 478–481 (1984).
Article ADS CAS Google Scholar
Spiekermann, K. A. & Pattanaik, L. reactants_products_ts_refined release version 1.0.0. Zenodo https://doi.org/10.5281/zenodo.5652085 (2022).
Grinberg Dana, A. et al. ARC - automated rate calculator, version 1.1.0. Zenodo https://doi.org/10.5281/zenodo.3356849 (2019).

Download references

Acknowledgements

We acknowledge financial support from the Gas Phase Chemical Physics Program of the U.S. Department of Energy (DOE), Office of Basic Energy Sciences, Division of Chemical Sciences, Geosciences, and Biosciences under Award number DE-SC0014901. This research used resources of the National Energy Research Scientific Computing Center, a DOE Office of Science User Facility supported by the Office of Science of the U.S. DOE under Contract No. DE-AC02–05CH11231. We also acknowledge the MIT SuperCloud and Lincoln Laboratory Supercomputing Center for providing computing resources that have contributed to the research results reported within this paper. The authors thank Dr. Duminda Ranasinghe, Dr. Colin Grambow, and members of the Green research group at MIT for useful discussions.

Author information

Authors and Affiliations

Department of Chemical Engineering, Massachusetts Institute of Technology, 77 Massachusetts Ave, Cambridge, MA, 02139, USA
Kevin Spiekermann, Lagnajit Pattanaik & William H. Green

Authors

Kevin Spiekermann
View author publications
You can also search for this author in PubMed Google Scholar
Lagnajit Pattanaik
View author publications
You can also search for this author in PubMed Google Scholar
William H. Green
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors conceived the project. K.A.S. performed all quantum chemistry and Arkane calculations. K.A.S. and L.P. cleaned the SMILES. All authors contributed to the manuscript.

Corresponding author

Correspondence to William H. Green.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Spiekermann, K., Pattanaik, L. & Green, W.H. High accuracy barrier heights, enthalpies, and rate coefficients for chemical reactions. Sci Data 9, 417 (2022). https://doi.org/10.1038/s41597-022-01529-6

Download citation

Received: 18 April 2022
Accepted: 30 June 2022
Published: 18 July 2022
DOI: https://doi.org/10.1038/s41597-022-01529-6

This article is cited by

A computational framework for neural network-based variational Monte Carlo with Forward Laplacian
- Ruichen Li
- Haotian Ye
- Liwei Wang
Nature Machine Intelligence (2024)