Complex reaction processes in combustion unraveled by neural network-based molecular dynamics simulation

Zeng, Jinzhe; Cao, Liqun; Xu, Mingyuan; Zhu, Tong; Zhang, John Z. H.

doi:10.1038/s41467-020-19497-z

Download PDF

Article
Open access
Published: 11 November 2020

Complex reaction processes in combustion unraveled by neural network-based molecular dynamics simulation

Nature Communications volume 11, Article number: 5713 (2020) Cite this article

28k Accesses
114 Citations
15 Altmetric
Metrics details

Subjects

Abstract

Combustion is a complex chemical system which involves thousands of chemical reactions and generates hundreds of molecular species and radicals during the process. In this work, a neural network-based molecular dynamics (MD) simulation is carried out to simulate the benchmark combustion of methane. During MD simulation, detailed reaction processes leading to the creation of specific molecular species including various intermediate radicals and the products are intimately revealed and characterized. Overall, a total of 798 different chemical reactions were recorded and some new chemical reaction pathways were discovered. We believe that the present work heralds the dawn of a new era in which neural network-based reactive MD simulation can be practically applied to simulating important complex reaction systems at ab initio level, which provides atomic-level understanding of chemical reaction processes as well as discovery of new reaction pathways at an unprecedented level of detail beyond what laboratory experiments could accomplish.

A benchmark dataset for Hydrogen Combustion

Article Open access 17 May 2022

Molecular-scale modeling of light emission by combustion: An ab initio study

Article Open access 03 September 2019

Accurate evaluation of combustion enthalpy by ab-initio computations

Article Open access 06 April 2022

Introduction

Ever since learning to use fire, human beings have never stopped studying combustion. With increasingly serious concern on environmental pollution from combustion, understanding and mastering the combustion mechanisms is of great importance. Gaining fundamental insights into combustion processes can help us design more efficient engines and minimize the production of pollutants. A typical combustion may contain hundreds of chemical species and thousands of fundamental chemical reactions. In particular, combustion occurs at extreme physical conditions with high pressures and high temperatures up to several thousand degrees. Also, many elementary reactions in a combustion typically occur on sub picosecond time scale. These extreme physical conditions make it very difficult, if not impossible, to carry out real-time experimental study of combustion. Thus, most experimental investigations of chemical reaction mechanisms focus on individual reactions instead of the complex reaction processes occurring in a combustion. In the past decades, in slico experiments such as reactive molecular dynamics (MD) simulations have shown their values in providing molecular (atomic)-level insights into the mechanism of combustions. In a reactive MD simulation, the reaction condition can be easily controlled in the simulation and some supercritical conditions that are difficult to achieve in the experiment can also be handled. Compared with the traditional theoretical approaches such as transition sate theory and quantum collision theory that focuses on studying a single reaction, reactive MD simulation can construct the entire interwoven reaction network of a combustion system¹. The heart of the reactive MD simulation is the potential energy surface (PES), which describes the inter- and intra-molecular interactions for molecules. Currently, there are mainly two classes of methods that can be used to construct the PES of a given molecular system: the quantum mechanics (QM)-based methods and the empirical force fields. Quantum mechanics is undoubtedly more rigorous and accurate, and MD simulations based on it are known as ab initio MD simulation (AIMD)^2,3. Although the AIMD method in principle can simulate complex chemical reactions in real time, it is limited to relatively small systems and short simulation time (typically, dozens of picoseconds) due to exorbitant computational costs of on-the-fly ab initio calculation. With the rapid development of computer hardware and algorithms, especially the employment of graphic processing units (GPUs), some AIMD methods have recently begun to handle larger chemical systems⁴. But so far, it is still impractical to use AIMD to simulate large-scale complex reaction systems such as combustions. Over the past decades, many reactive force fields (or PESs) have been developed and successfully used for various reactive molecular systems^{5,6,7,8,9,10,11,12}. A comprehensive discussion of these reactive force fields can be found in refs. ^13,14. Among these force fields, the empirical ReaxFF was widely used in MD simulation of combustion systems due to its computational efficiency¹⁵, but its accuracy and reliability are of significant concern^16,17,18. The key points of developing a reaction force field are the choice of the functional form and the parameterization process, which are complicated and depend on human intervention.

Recently, more researchers are switching to seek the help of machine-learning (ML) methods. ML method, especially artificial neural networks (NN), provides the possibility to construct PESs with the accuracy of the QM method but with an efficiency comparable to that of force fields. Neural networks constitute a very flexible and unbiased class of mathematical functions, which in principle is able to approximate any real-valued function to arbitrary accuracy. Since Behler and Parrinello proposed the high-dimensional neural network approach^19,20, several methods have been developed to implement this approach and many different kind of NN PESs have been proposed for water, small organic molecules, and metalloid materials^{21,22,23,24,25}. For example, the sGDML^26,27,28, SchNet²⁹, PhysNet³⁰, and FCHL³¹ methods. NN potentials have also been employed to study the reaction mechanisms of chemical systems. By combining high-precision NN PESs and quantum collision theory, Zhang and Jiang’s group have studied a series of elementary reactions in the gas phase and on the surface^32,33,34,35. Liu and co-workers developed the LASP program to study the heterogeneous catalysis with NN PESs³⁶ and built stochastic surface walking (SSW)-NN to explore reaction pathways from glucose to 5-hydroxymethylfurfural³⁷. Brickel et al. also studied the nucleophilic substitution reaction [Cl–CH₃–Br]⁻ in water with NN potential³⁸.

In this report, we present an in silico simulation of methane combustion based on an NN potential derived by training a high-dimensional NN model from ab initio computed energies. To achieve high efficiency and accuracy, the DeePMD model was used^39,40,41. This NN PES can accurately predict the energy and atomic forces of reactants, products and reaction intermediates. Based on this model, a 1-ns reactive MD simulation was performed for a combustion system initially containing 100 methane and 200 oxygen molecules with a sub-femtosecond time resolution (Fig. 1). A complete reaction network of the methane combustion can be constructed from the MD trajectory. The simulation not only produced the main reaction pathways that are consistent with the experiment but also provided much more detailed insights about the combustion processes as will be described in the following.

**Fig. 1: Real-time dynamics of methane combustion.**

Results

Accuracy of the NN PES

The performance of the NN potential highly depends on the quality of the reference datasets. Although several databases, such as QM7⁴², QM9⁴³, ANI-1⁴⁴, and ANI-1x⁴⁵, are accessible, they mainly include organic molecules and are therefore not suitable for this work. Combustion of methane will generate many molecular fragments and a lot of them are free radicals⁴⁶. Therefore, we followed a workflow (details are listed in the “Methods” section) to construct the reference datasets for the combustion. Then the DeepPot-SE model⁴⁷ was used to train the NN PES based on the reference. The predictive power of the NN model is shown in Supplementary Table 1 and Supplementary Fig. 1. It is clear that the DFT energies can be accurately reproduced by the NN model. The mean absolute errors are only 0.04 and 0.14 eV/atom in the training set and the test set, respectively. As for the atomic forces, the predicted values of the NN model are also highly consistent with the calculated results of the DFT (Supplementary Fig. 1). The correlation coefficient is 0.999 and the MAE is 0.12 eV/Å. Considering that there are a large number of atomic and molecular collisions during the combustion process, and some atomic forces can be as high as dozens of eV/Å, the accuracy of the NN model is encouraging. To verify the energy conservation of the NN PES, we performed a reactive MD simulation under the NVE ensemble. The system is a periodic box containing 100 CH₄ molecules and 200 O₂ molecules (a total of 900 atoms) with a density of 0.25 g/cm³. As shown in Supplementary Fig. 2, the total energy is conserved in MD simulation.

The initial stage of combustion

A 1 ns reactive MD simulation was performed for methane combustion with the NN PES under the NVT ensemble. The system is also a periodic box containing 100 CH₄ molecules and 200 O₂ molecules (a total of 900 atoms) with a density of 0.25 g/cm³. The MD simulations were run with a time-step of 0.1 fs and the temperature was kept at 3000 K by using the Berendsen thermostat. We chose a relatively high density (and thus high pressure) and high temperature to enhance the collision probability and sampling efficiency, which is a widely used strategy in reactive MD simulations because the time scale of the simulation is much shorter than that of experiments. In fact, experiments usually do not use pure fuel for combustion, but rather mix the fuel into a relatively inert gas for safety. In future work, we will try to combine the NN potential and enhanced sampling algorithms to bring simulated conditions more realistic.

Figure 1b and Supplementary Fig. 3 show the time-dependent progression of the main molecular species during the MD simulation. After 1 ns, about 90 CH₄ and 150 O₂ are consumed and about 160 H₂O, 30 CO, and 50 CO₂ are produced. The potential energy of the system during the simulation is shown in Supplementary Fig. 4. Although the system has not reached equilibrium, the important ignition process has already done, which includes much richer reaction information. In order to describe the complicated reaction network in more detail, we divided the combustion process into three stages, namely the initial stage of the combustion, the production of intermediate species of formaldehyde and formyl radical, and the production of CO and CO₂.

The reaction network in the initial stage of the combustion is shown in Fig. 2a. The combustion of methane started with the abstraction of its hydrogen atom by O₂ to generate two radicals ·CH₃ and HOO· (R3). As is seen from Fig. 2b, this process started at about 32 ps and took about 0.2 ps to finish. During the simulation, other radicals such as ·OH, ·H, and HOO· also abstracted hydrogen atom from CH₄ to generate ·CH₃ radical. Among them, the ·OH radical is the main species who complete this work and generates water molecules (R1). The atomization of methane into ·H and ·CH₃ was also observed.

**Fig. 2: The initial stage of combustion.**

Many ·CH₃ radicals interact with the ·OH radicals to form methanol (R6) molecules. According to Fig. 2c, this process was also very quick. Some ·CH₃ interacted with O₂ and HOO· to form methyldioxidanyl (CH₃OO·, R4) and methyl-hydroperoxide (CH₃OOH, R5). Radicals such as ·OH can also abstract H atoms from ·CH₃ and produce :CH₂. Methanol can further react with ·OH and ·H to generate methoxy radicals (CH₃O·, R10, R11), H₂O and H₂. It can also react with ·H to generate ·CH₂OH and H₂ (R12). The CH₃O· can also be produced by the interaction between CH₃OO· or CH₃OOH with ·H (R8 and R9).

Production of formaldehyde and formyl radicals

Most methoxy radicals generated from the last step were converted to formaldehyde mainly through two reaction pathways (Fig. 3a). The first one is for methoxy radical to interact with ·OH to form formaldehyde and H₂O (R16). As shown in Fig. 3b, this process took about 0.3 ps. The other pathway is for methoxy radical to interact with ·H and generate formaldehyde and H₂ (R17). The ·CH₂OH radicals can also convert to formaldehyde by losing the hydrogen atom on its hydroxyl group (R14 and R15). If it loses one H atom on the methylene group, it can generate :CHOH radicals (R13). In addition, the :CH₂ radicals can interact with ·OH and form formaldehyde and the methylidyne radical (R18 and R19).

The formaldehydes were further converted into the formyl (·CHO) radicals. The main reaction pathways are hydrogen abstraction by ·O and ·OH. Figure 3c shows the trajectory of the reaction CH₂O + ·OH → ·CHO + H₂O. An ·OH radical approaches the rotating formaldehyde molecule and snatches an H atom to form a water molecule; the whole process takes about 0.4 ps. In addition, other species such as ·H, O₂, HOO·, and ·CH₃ also abstracted the hydrogen atom from formaldehyde to form formyl radicals. The R20 and R23 are two reactions that form formyl radicals without the participation of formaldehyde.

Production of CO and CO₂

Formyl radicals can convert to CO by losing hydrogen in two ways (Fig. 4a). Firstly, it can lose an H atom directly (R25). Figure 4b shows a real-time trajectory of this process. A formyl radical lost its H atom at about 405.79 ps, but this reaction was quickly reversed and the formyl radical was re-formed. After another 0.4 ps the reaction took place again to form CO. Secondly, ·OH can also abstract the H atom from the formyl radical and generate H₂O and CO (R26).

The formyl radical can combine with the ·OH radical to form formic acid (R24), which can further lose its H atom to form ·COOH (R27) or HCOO· (R30). These two species can convert to CO₂ through the reaction with ·OH or ·H (R29 and R31). The ·COOH radical can also interact with ·H and generate CO and H₂O (R28). Figure 4c shows the trajectory of reaction CO + ·OH → CO₂ + ·H (R32). At 815.32 ps, an ·OH radical started to approach a CO molecule, and at 815.38 ps, an intermediate COOH was formed. The COOH should be relatively inactive, it stably existed for about 0.1 ps, and finally lost an H atom and became CO₂.

Further analysis found that the above-mentioned 32 reactions have all been found by experiments, and the reaction networks constructed by them are also highly consistent with the main reaction networks found experimentally^48,49. We totally detected 505 molecular species and 798 reactions from the trajectory. Species such as ethane, ethylene, and acetylene can also be found in the experimental database. In all, 130 of the 798 reactions extracted from the MD trajectory were included in the widely accepted GRI_Mech experimental mechanism library⁴⁸. Some experimentally observed reactions were not observed in our simulation, mostly likely because the present simulation was performed at relatively high temperature.

In fact, discovering new reactions is an important advantage of the present approach. For methane oxidation, a system that has been extensively studied by experiments, NN-based reactive MD can still discover hundreds of chemical reactions that have not been experimentally reported. This demonstrates that reactive MD can be a powerful tool to study combustion reactions. Interestingly, we found a cyclopropene molecule in the trajectory, which has not been reported to our knowledge. As shown in Supplementary Fig. 5, at 634.09 ps, a CO molecule collided with a ·CH₃ radical and joined together. Then a CH₂CO molecule was formed through hydrogen loss. The CH₂CO was stable for about 200 ps and then combined with another ·CH₃ radical. Subsequent hydrogen loss led to the formation of a cycloprop-2-en-1-one molecule at 828.65 ps. After another 60 ps, the third ·CH₃ attacked the cycloprop-2-en-1-one molecule and kicked out the CO group to form the CH₃CCH₂ molecule at 889.50 ps. Through further internal reaction and hydrogen loss, it finally formed a cyclopropene molecule at 891.16 ps and remained stable throughout the rest of the simulation. The entire process took about 260 ps to complete. While it might be possible that finding cyclopropene in our simulation is a coincidence or driven by the relatively high temperature, it still illustrates the ability of reactive MD simulation to discover new molecules and new reactions.

Discussion

Accurate in silico MD simulation of combustion or other complex chemical reactions is one of the ultimate goals of computational chemistry. In this work, an artificial neural network potential model trained to ab initio data describes complex chemical reactions in methane combustion. This NN potential model is orders of magnitude faster than the conventional DFT calculation. Benefit from the high efficiency of the NN model and GPU acceleration, nanosecond-sale MD simulations for a chemical system containing 900 atoms was achieved in about 4 days or so on an NVIDIA Tesla P100 card. Detailed reaction mechanisms were extracted from the MD trajectory and the detected molecular species and reaction networks are in excellent agreement with experimental observation. In addition, many new reactions were found that were not included in the experimental database. Compared to laboratory experiments, in silico simulations can be performed under more extreme conditions, and any specific reaction of interest can be easily detected and tracked. In addition, MD simulation can achieve ultra-high time resolution. The time-step used in this work is 0.1 fs. With the improvement of algorithms and hardware, even resolutions in smaller time scale can be achieved.

Compared with the traditional prior knowledge-based theoretical approach, reactive MD simulation can explore complex reaction networks and discover new reactions and species without any prior knowledge of reactions. Actually, complex reactions cannot be well understood without considering the kinetics of the reaction network it belongs to. Since reactive MD simulation tracks all chemical reactions in real time, one can even deduce the rate constants for individual reactions from a single MD trajectory by statistical analysis. We extracted the ten most statistically significant reactions from the trajectory and calculated their rate constants based on the algorithms developed in previous studies^50,51. As shown in Supplementary Table 2, most of the rate constants agree well with the GRI_Mech data⁴⁸. The main source of error might come from the uncertainties of parameters in the Arrhenius formula and the completeness of sampling. Ideally, one should run many trajectories with different initial conditions to obtain truly statistically accurate results. However, although these rates may not be accurate enough to be used directly in kinetic modeling, they can be effective in contributing to a comprehensive understanding of the combustion reaction.

A practical issue to be pointed out is that although some algorithms were used in this study to minimize the size of the reference dataset, there are still 578,731 structures in the reference set. Although the DFT calculation is very efficient, such a large reference set is difficult to perform high-level post-Hartree−Fock calculations. In order to further minimize the size of the reference set while ensuring its completeness, new algorithms need to be developed to further enhance the efficiency of this approach. Recently, Zhang et al. developed the DP-GEN⁵² (Deep potential Generator) software platform, which can automatically construct the reference dataset and train the NN model. The concurrent learning algorithm employed by this platform can make the redundancy of the reference set as small as possible. We are trying to integrate the algorithms developed in this work into the DP-GEN platform.

In addition, it is worth to point that while combustion is usually thought to be dominated by free radical reactions, recent studies have begun to examine the role of electronically excited state species in combustion. For example, the additional introduction of plasma was found to be effective in promoting combustion in experiments⁵³. However, MD simulations involving excited states are highly nontrivial, and there are large uncertainties in ab initio quantum chemistry computation for treating excited states of large systems. Based on sophisticated empirical or machine-learning PESs, several recent works have achieved the excited-state MD simulation for model systems^{54,55,56,57,58,59,60,61,62}. For example, the O+O recombination reaction to form the ground and excited-state singlet O₂ molecules on amorphous solid water⁶⁰. Such strategy will be considered in our future studies.

Despite further improvement is needed, the current report heralds the dawn of a new era in which neural network-based reactive MD simulation can be practically applied to simulating complex reaction systems at the ab initio level, which provides atomic-level understanding of every reaction process at unprecedented level of details beyond what laboratory experiment can accomplish.

Methods

Reference dataset

In this study, a workflow was developed for making reference datasets (Fig. 5). The details of each module in the workflow are given below.

To increase the efficiency of dataset construction, reactive MD simulation with ReaxFF was used to sample an initial dataset. A model combustion system containing a lot of CH₄ and H₂ molecules was built by using the Amorphous Cell module in the Material Studio⁶³ software package. Then the LAMMPS⁶⁴ program was used to perform the MD simulation. The NVT ensemble was used and the temperature was set to 3000 K with the Berendsen thermostat. The ReaxFF parameter of Chenoweth et al. (CHO-2008 parameter set)⁶⁵ was employed. The Open Babel software⁶⁶ and the Depth-First Search algorithm⁶⁷ were used to detect species in every snapshot of the trajectory. Then, for each atom in each snapshot, we build a molecular cluster that contains this atom and species that within a specified cutoff centered on it. In this work, the cutoff was set to 5 Å.

The initial dataset contains about 22.5 million structures, which is too large to perform QM calculations for every molecular cluster it contains. Therefore, it is necessary to resample it to remove redundant structures while ensuring its completeness. To this end, we first classified the initial dataset into sub-datasets based on the chemical bond information of the central atom. For example, the central H atom can be classified into two different types: a single H atom (H0) and an H atom formed a single chemical bond with another atom (H1).

Further treatment is still needed for large sub-datasets. For a given large sub-dataset, we first expressed each molecular cluster it contains as a Coulomb matrix⁶⁸:

$${\mathbf{C}}_{{{ij}}} = \left\{ {\begin{array}{*{20}{c}} {\frac{1}{2}Z_i^{2.4},i = j} \\ {\frac{{Z_iZ_j}}{{\left| {{\mathbf{R}}_i {\,}-{\,} {\mathbf{R}}_j} \right|}},i {\,\,}\ne{\,\,} j} \end{array}}, \right.$$

(1)

where $Z_i$ and $Z_j$ are nuclear charges of atom $i$ and $j$, ${\mathbf{R}}_{{i}}$ and ${\mathbf{R}}_{{j}}$ are their Cartesian coordinates. The minimum image convention⁶⁹ was used to consider the periodic boundary condition. “Invisible atoms” were introduced to fix the dimension of the Coulomb matrix. These invisible atoms do not influence the physics of the molecule of interest and make the total number of atoms in the molecule sum to a constant. To lower the dimension of the dataset and keep as much structural information as possible, the Coulomb matrix was further represented by the eigen-spectrum, which is obtained by solving the eigenvalue problem ${\mathbf{Cv}} = {\lambda}{\mathbf{v}}$ under the constraint $\lambda _i \ge \lambda _{i + 1}$. The clustering algorithm Mini Batch KMeans⁷⁰ was then used to cluster the given sub-datasets into smaller clusters according to the eigen-spectrum. Then we randomly selected 10,000 structures from each cluster (If the cluster contains no more than 10,000 structures, then all of them were selected).

Large amplitude collisions and reactions in the combustion can produce a lot of unpredictable species and intermediates. To ensure the completeness of the reference dataset, an active learning approach⁷¹ was used. Four different NN PES models were trained based on the dataset from the last step. Then several short MD simulations were performed based on these NN models. During the simulation, the atomic forces are evaluated by these four NN PES models simultaneously. For a specific atom, if the predicted forces by these four models are consistent with each other, then the molecular cluster that centered on this atom should be found in the dataset. On the contrary, if the results of these four models are inconsistent with each other and the error between them is in a specific range (0.5 eV/Å < error < 1.0 eV/Å in this work), the corresponding molecular cluster will be added into the dataset. The update of the dataset will be continued until the predictions of the four models are always consistent.

QM calculation

The potential energy and atomic forces for every structure in the final dataset were calculated by Gaussian 16⁷² software at the MN15/6-31G** level. The MN15 functional was employed because it has broad accuracy for multi-reference and single-reference systems⁷³. To consider the spin polarization effect, the initial wave function of a given structure is obtained by the combination of the wave functions of individual molecular species forming the structure, while the wave function of each molecular species was calculated based on its own charge and spin.

Training of the NN PES

The scheme of the NN model is shown in Fig. 6. The total energy E of a given structure is decomposed into a sum of atomic energy contributions^19,74, i.e., $E = \mathop {\sum }\nolimits_i E_i$, where i is the index of the atom. Each atomic energy is fully determined by the position of the ith atom and its near neighbors. To guarantee the translational, rotational, and permutational symmetries lying in the PES, the Cartesian coordinates of atomics are mapped to specific mathematical formulas called “descriptors” of the atomic chemical environment.

The DeepPot-SE (Deep Potential-Smooth Edition) model⁴⁷ was used to train the NN potential by the DeePMD-kit program⁷⁴. Details of this method can be found in ref. ⁶⁷. The model includes two networks: the embedding network and the fitting network. Both networks use the ResNet architecture⁷⁵. The size of the embedding network was set to (25, 50, 100) and the size of the embedding matrix was set to 12. The size of the fitting network is set to (240, 240, 240). The cutoff radius was set to 6.0 Å and the descriptors decay smoothly from 1.0 to 6.0 Å. The initial learning rate was set to 0.0005 and it will decay every 20,000 steps. The loss is defined by

$${\cal{L}} = \frac{{p_e}}{N}{\Delta}E^2 + \frac{{p_f}}{{3N}}\mathop {\sum }\limits_i |{\Delta}{\mathbf{F}}_{{i}}|^2,$$

(2)

where ${\Delta}E$ and ${\Delta}{\mathbf{F}}_{{i}}$ are root mean square errors in energy and force. The prefactor $p_e$ is set to 0.2 eV⁻² and the $p_f$ decays from 1000 Å² eV⁻² to 1 Å² eV⁻².

Data availability

The datasets (structures, potential energies and atomic forces of molecular species) generated during the current study are available at https://github.com/tongzhugroup/NNREAX, https://doi.org/10.6084/m9.figshare.12973055. Source data are provided with this paper.

Code availability

The codes used to generate the datasets in the current study are available at https://github.com/tongzhugroup/mddatasetbuilder, https://doi.org/10.5281/zenodo.4035925.

References

Martinez, T. J. Ab initio reactive computer aided molecular design. Acc. Chem. Res. 50, 652–656 (2017).
Article CAS Google Scholar
Car, R. & Parrinello, M. Unified approach for molecular-dynamics and density-functional theory. Phys. Rev. Lett. 55, 2471–2474 (1985).
Article ADS CAS Google Scholar
Tuckerman, M. E. Ab initiomolecular dynamics: basic concepts, current trends and novel applications. J. Phys. Condens. Matter 14, R1297–R1355 (2002).
Article ADS CAS Google Scholar
Wang, L.-P. et al. Discovering chemistry with an ab initio nanoreactor. Nat. Chem. 6, 1044 (2014).
Article CAS Google Scholar
Van Duin, A. C., Dasgupta, S., Lorant, F. & Goddard, W. A. ReaxFF: a reactive force field for hydrocarbons. J. Phys. Chem. A 105, 9396–9409 (2001).
Article CAS Google Scholar
Brenner, D. W. et al. A second-generation reactive empirical bond order (REBO) potential energy expression for hydrocarbons. J. Phys. Condens. Matter 14, 783 (2002).
Article ADS CAS Google Scholar
Nouranian, S., Tschopp, M. A., Gwaltney, S. R., Baskes, M. I. & Horstemeyer, M. F. An interatomic potential for saturated hydrocarbons based on the modified embedded-atom method. Phys. Chem. Chem. Phys. 16, 6233–6249 (2014).
Article CAS Google Scholar
Qu, C., Yu, Q. & Bowman, J. M. Permutationally invariant potential energy surfaces. Annu. Rev. Phys. Chem. 69, 151–175 (2018).
Article ADS CAS Google Scholar
Li, J. & Guo, H. Permutationally invariant fitting of intermolecular potential energy surfaces: a case study of the Ne-C2H2 system. J. Chem. Phys. 143, 214304 (2015).
Article ADS CAS Google Scholar
Braams, B. J. & Bowman, J. M. Permutationally invariant potential energy surfaces in high dimensionality. Int. Rev. Phys. Chem. 28, 577–606 (2009).
Article CAS Google Scholar
Nagy, T., Yosa Reyes, J. & Meuwly, M. Multisurface adiabatic reactive molecular dynamics. J. Chem. Theory Comput. 10, 1366–1375 (2014).
Article CAS Google Scholar
Warshel, A. & Florián, J. in Encyclopedia of Computational Chemistry (John Wiley and Sons, 2002).
Meuwly, M. Reactive molecular dynamics: from small molecules to proteins. Wires Comput. Mol. Sci. 9, e1386 (2019).
Article CAS Google Scholar
Koner, D., Salehi, S. M., Mondal, P. & Meuwly, M. Non-conventional force fields for applications in spectroscopy and chemical reaction dynamics. J. Chem. Phys. 153, 010901 (2020).
Article ADS CAS Google Scholar
Zheng, M. et al. Pyrolysis of liulin coal simulated by GPU-based ReaxFF MD with cheminformatics analysis. Energy Fuels 28, 522–534 (2014).
Article ADS CAS Google Scholar
Wang, E., Ding, J., Qu, Z. & Han, K. Development of a reactive force field for hydrocarbons and application to iso-octane thermal decomposition. Energy Fuels 32, 901–907 (2017).
Article CAS Google Scholar
Cheng, T., Jaramillo-Botero, A., Goddard, W. A. & Sun, H. Adaptive accelerated ReaxFF reactive dynamics with validation from simulating hydrogen combustion. J. Am. Chem. Soc. 136, 9434–9442 (2014).
Article CAS Google Scholar
Bertels, L. W., Newcomb, L. B., Alaghemandi, M., Green, J. R. & Head-Gordon, M. Benchmarking the performance of the ReaxFF reactive force field on hydrogen combustion systems. J. Phys. Chem. A 124, 5631–5645 (2020).
Article CAS Google Scholar
Behler, J. & Parrinello, M. Generalized neural-network representation of high-dimensional potential-energy surfaces. Phys. Rev. Lett. 98, 146401 (2007).
Article ADS CAS Google Scholar
Behler, J. First principles neural network potentials for reactive simulations of large molecular and condensed systems. Angew. Chem. Int. 56, 12828–12840 (2017).
Article CAS Google Scholar
Smith, J. S., Isayev, O. & Roitberg, A. E. ANI-1: an extensible neural network potential with DFT accuracy at force field computational cost. Chem. Sci. 8, 3192–3203 (2017).
Article CAS Google Scholar
Yao, K., Herr, J. E., Toth, D. W., Mckintyre, R. & Parkhill, J. The TensorMol-0.1 model chemistry: a neural network augmented with long-range physics. Chem. Sci. 9, 2261–2269 (2018).
Article CAS Google Scholar
Lee, K., Yoo, D., Jeong, W. & Han, S. SIMPLE-NN: an efficient package for training and executing neural-network interatomic potentials. Comput. Phys. Commun. 242, 95–103 (2019).
Article ADS CAS Google Scholar
Chen, X., Jørgensen, M. S., Li, J. & Hammer, B. Atomic energies from a convolutional neural network. J. Chem. Theory Comput. 14, 3933–3942 (2018).
Article CAS Google Scholar
Zhang, Y., Hu, C. & Jiang, B. Embedded atom neural network potentials: efficient and accurate machine learning with a physically inspired representation. J. Phys. Chem. Lett. 10, 4962–4967 (2019).
Article CAS Google Scholar
Chmiela, S. et al. Machine learning of accurate energy-conserving molecular force fields. Sci. Adv. 3, e1603015 (2017).
Article ADS CAS Google Scholar
Schutt, K. T., Arbabzadah, F., Chmiela, S., Muller, K. R. & Tkatchenko, A. Quantum-chemical insights from deep tensor neural networks. Nat. Commun. 8, 13890 (2017).
Article ADS CAS Google Scholar
Sauceda, H. E., Chmiela, S., Poltavsky, I., Muller, K. R. & Tkatchenko, A. Molecular force fields with gradient-domain machine learning: construction and application to dynamics of small molecules with coupled cluster forces. J. Chem. Phys. 150, 114102 (2019).
Article ADS CAS Google Scholar
Schütt, K. T., Sauceda, H. E., Kindermans, P.-J., Tkatchenko, A. & Müller, K.-R. SchNet—a deep learning architecture for molecules and materials. J. Chem. Phys. 148, 241722 (2018).
Article ADS CAS Google Scholar
Unke, O. T. & Meuwly, M. PhysNet: a neural network for predicting energies, forces, dipole moments, and partial charges. J. Chem. Theory Comput. 15, 3678–3693 (2019).
Article CAS Google Scholar
Christensen, A. S., Bratholm, L. A., Faber, F. A. & Anatole von Lilienfeld, O. FCHL revisited: faster and more accurate quantum machine learning. J. Chem. Phys. 152, 044107 (2020).
Article ADS CAS Google Scholar
Lu, X., Meng, Q., Wang, X., Fu, B. & Zhang, D. H. Rate coefficients of the H+ H2O2→ H2+ HO₂ reaction on an accurate fundamental invariant-neural network potential energy surface. J. Chem. Phys. 149, 174303 (2018).
Article ADS CAS Google Scholar
Yin, Z., Guan, Y., Fu, B. & Zhang, D. H. Two-state diabatic potential energy surfaces of ClH 2 based on nonadiabatic couplings with neural networks. Phys. Chem. Chem. Phys. 21, 20372–20383 (2019).
Article CAS Google Scholar
Zhang, Y., Zhou, X. & Jiang, B. Bridging the gap between direct dynamics and globally accurate reactive potential energy surfaces using neural networks. J. Phys. Chem. Lett. 10, 1185–1191 (2019).
Article CAS Google Scholar
Chen, J., Xu, X., Xu, X. & Zhang, D. H. Communication: An accurate global potential energy surface for the OH plus CO -> H + CO₂ reaction using neural networks. J. Chem. Phys. 138, 221104 (2013).
Article ADS CAS Google Scholar
Huang, S. D., Shang, C., Kang, P. L., Zhang, X. J. & Liu, Z. P. LASP: fast global potential energy surface exploration. Wiley Interdisci. Rev. Comput. Mol 9, e1415 (2019).
CAS Google Scholar
Kang, P. L., Shang, C. & Liuo, Z. P. Glucose to 5-hydroxymethylfurfural: origin of site-selectivity resolved by machine learning based reaction sampling. J. Am. Chem. Soc. 141, 20525–20536 (2019).
Article CAS Google Scholar
Brickel, S., Das, A. K., Unke, O. T., Turan, H. T. & Meuwly, M. Reactive molecular dynamics for the [Cl–CH3–Br]− reaction in the gas phase and in solution: a comparative study using empirical and neural network force fields. Electron. Struct. 1, 024002 (2019).
Article ADS CAS Google Scholar
Zhang, L., Han, J., Wang, H., Car, R. & Weinan, E. Deep potential molecular dynamics: a scalable model with the accuracy of quantum mechanics. Phys. Rev. Lett. 120, 143001 (2018).
Article ADS CAS Google Scholar
Han, J. Q., Zhang, L. F., Car, R. & Weinan, E. Deep potential: a general representation of a many-body potential energy surface. Commun. Comput. Phys. 23, 629–639 (2018).
Article MathSciNet Google Scholar
Jia, W. et al. Pushing the limit of molecular dynamics with ab initio accuracy to 100 million atoms with machine learning. Preprint at https://arxiv.org/abs/2005.00223 (2020).
Blum, L. C. & Reymond, J.-L. 970 million druglike small molecules for virtual screening in the chemical universe database GDB-13. J. Am. Chem. Soc. 131, 8732–8733 (2009).
Article CAS Google Scholar
Ruddigkeit, L., Van Deursen, R., Blum, L. C. & Reymond, J.-L. Enumeration of 166 billion organic small molecules in the chemical universe database GDB-17. J. Chem. Inf. Model. 52, 2864–2875 (2012).
Article CAS Google Scholar
Smith, J. S., Isayev, O. & Roitberg, A. E. ANI-1, a data set of 20 million calculated off-equilibrium conformations for organic molecules. Sci. Data 4, 170193 (2017).
Article CAS Google Scholar
Smith, J. S. et al. The ANI-1ccx and ANI-1x data sets, coupled-cluster and density functional theory properties for molecules. Sci. Data 7, 134 (2020).
Article CAS Google Scholar
He, Z., Li, X.-B., Liu, L.-M. & Zhu, W. The intrinsic mechanism of methane oxidation under explosion condition: a combined ReaxFF and DFT study. Fuel 124, 85–90 (2014).
Article CAS Google Scholar
Zhang, L. et al. End-to-end symmetry preserving inter-atomic potential energy model for finite and extended systems. In: Bengio, S. et al. (eds) Advances in Neural Information Processing Systems 31, 4436–4446 (Curran Associates Inc, 2018).
Smithy, G. P. et al. GRI_Mech 30. http://combustion.berkeley.edu/gri-mech/ (1999).
Reid, I. A. B., Robinson, C. & Smith, D. B. Spontaneous ignition of methane: Measurement and chemical model. Symp. Int. Combust. Proc. 20, 1833–1843 (1985).
Article Google Scholar
Wu, Y. Z., Sun, H., Wu, L. & Deetz, J. D. Extracting the mechanisms and kinetic models of complex reactions from atomistic simulation data. J. Comput. Chem. 40, 1586–1592 (2019).
Article CAS Google Scholar
Dontgen, M. et al. Automated discovery of reaction pathways, rate constants, and transition states using reactive molecular dynamics simulations. J. Chem. Theory Comput. 11, 2517–2524 (2015).
Article CAS Google Scholar
Zhang, Y. et al. DP-GEN: a concurrent learning platform for the generation of reliable deep learning based potential energy models. Comput. Phys. Commun. 253, 107206 (2020).
Article MathSciNet CAS Google Scholar
Ju, Y. & Sun, W. Plasma assisted combustion: dynamics and chemistry. Prog. Energy Combust. Sci. 48, 21–83 (2015).
Article Google Scholar
Chen, W.-K., Liu, X.-Y., Fang, W.-H., Dral, P. O. & Cui, G. Deep learning for nonadiabatic excited-state dynamics. J. Phys. Chem. Lett. 9, 6702–6708 (2018).
Article CAS Google Scholar
Hu, D., Xie, Y., Li, X., Li, L. & Lan, Z. Inclusion of machine learning kernel ridge regression potential energy surfaces in on-the-fly nonadiabatic molecular dynamics simulation. J. Phys. Chem. Lett. 9, 2725–2732 (2018).
Article CAS Google Scholar
Westermayr, J. et al. Machine learning enables long time scale molecular photodynamics simulations. Chem. Sci. 10, 8100–8107 (2019).
Article CAS Google Scholar
Westermayr, J., Faber, F. A., Christensen, A. S., von Lilienfeld, O. A. & Marquetand, P. Neural networks and kernel ridge regression for excited states dynamics of CH2NH2+: from single-state to multi-state representations and multi-property machine learning models. Mach. Learn.: Sci. Technol. 1, 025009 (2020).
Article Google Scholar
Borges, Y. G., Galvão, B. R. L., Mota, V. C. & Varandas, A. J. C. A trajectory surface hopping study of N2A3Σu+ quenching by H atoms. Chem. Phys. Lett. 729, 61–64 (2019).
Article ADS CAS Google Scholar
Schinke, R., Grebenshchikov, S. Y., Ivanov, M. V. & Fleurat-Lessard, P. Dynamical studies of the ozone isotope effect: a status report. Annu. Rev. Phys. Chem. 57, 625–661 (2006).
Article ADS CAS Google Scholar
Pezzella, M., Koner, D. & Meuwly, M. Formation and stabilization of ground and excited-state singlet O2 upon recombination of (3)P oxygen on amorphous solid water. J. Phys. Chem. Lett. 11, 2171–2176 (2020).
Article CAS Google Scholar
Koner, D., Bemish, R. J. & Meuwly, M. The C((3)P) + NO(X(2)Pi)–> O((3)P) + CN(X(2)Sigma(+)), N((2)D)/N((4)S) + CO(X(1)Sigma(+)) reaction: rates, branching ratios, and final states from 15 K to 20 000 K. J. Chem. Phys. 149, 094305 (2018).
Article ADS CAS Google Scholar
Koner, D., Unke, O. T., Boe, K., Bemish, R. J. & Meuwly, M. Exhaustive state-to-state cross sections for reactive molecular collisions from importance sampling simulation and a neural network representation. J. Chem. Phys. 150, 211101 (2019).
Article ADS CAS Google Scholar
BOVIA, Materials Studio 2017 https://www.3ds.com/products-services/biovia/resource-center/citations-and-references/ (Dassault Systèmes, San Diego, 2017).
Aktulga, H. M., Fogarty, J. C., Pandit, S. A. & Grama, A. Y. Parallel reactive molecular dynamics: numerical methods and algorithmic techniques. Parallel Comput. 38, 245–259 (2012).
Article Google Scholar
Chenoweth, K., Van Duin, A. C. & Goddard, W. A. ReaxFF reactive force field for molecular dynamics simulations of hydrocarbon oxidation. J. Phys. Chem. A 112, 1040–1053 (2008).
Article CAS Google Scholar
O’Boyle, N. M. et al. Open Babel: an open chemical toolbox. J. Cheminformatics 3, 33 (2011).
Article CAS Google Scholar
Tarjan, R. Depth-first search and linear graph algorithms. SIAM J. Comput. 1, 146–160 (1972).
Article MathSciNet MATH Google Scholar
Rupp, M., Tkatchenko, A., Müller, K.-R. & Von Lilienfeld, O. A. Fast and accurate modeling of molecular atomization energies with machine learning. Phys. Rev. Lett. 108, 058301 (2012).
Article ADS CAS Google Scholar
Hloucha, M. & Deiters, U. Fast coding of the minimum image convention. MoSim 20, 239–244 (1998).
CAS Google Scholar
Sculley, D. Web-scale k-means clustering. In: Rappa, M. et al. (eds) Proc. 19th International Conference on World Wide Web (ACM, 2010).
Zhang, L., Lin, D.-Y., Wang, H., Car, R. & Weinan, E. Active learning of uniformly accurate interatomic potentials for materials simulation. Phys. Rev. Mat. 3, 023804 (2019).
CAS Google Scholar
Frisch, M. et al. Gaussian 16, revision A. 03 (Gaussian Inc, Wallingford CT, 2016).
Haoyu, S. Y., He, X., Li, S. L. & Truhlar, D. G. MN15: A Kohn–Sham global-hybrid exchange–correlation density functional with broad accuracy for multi-reference and single-reference systems and noncovalent interactions. Chem. Sci. 7, 5032–5051 (2016).
Article CAS Google Scholar
Wang, H., Zhang, L., Han, J. & Weinan, E. DeePMD-kit: a deep learning package for many-body potential energy representation and molecular dynamics. Comput. Phys. Commun. 228, 178–184 (2018).
Article ADS CAS Google Scholar
He, K., Zhang, X., Ren, S. & Sun, J. Deep residual learning for image recognition. In: Tuytelaars, T. et al. (eds) Proc. IEEE Conference on Computer Vision and Pattern Recognition (IEEE, 2016).

Download references

Acknowledgements

The authors thank Dr. Linfeng Zhang and Dr. Han Wang for their discussion and help in using DeepPot-SE and DeePMD-kit. T.Z. would also like to thank Prof. Donghui Zhang for his valuable suggestions in this project. This work was supported by the National Key R&D Program of China (grant no. 2016YFA0501700), the National Natural Science Foundation of China (grant nos. 91641116, 91753103, and 21933010), and the Innovation Program of Shanghai Municipal Education Commission (201701070005E00020). J. Zeng was partially supported by the National Innovation and Entrepreneurship Training Program for Undergraduate (201910269080). We also thank the ECNU Multifunctional Platform for Innovation (No. 001) for providing supercomputer time.

Author information

Authors and Affiliations

Shanghai Engineering Research Center of Molecular Therapeutics & New Drug Development, School of Chemistry and Molecular Engineering, East China Normal University, Shanghai, 200062, China
Jinzhe Zeng, Liqun Cao, Mingyuan Xu, Tong Zhu & John Z. H. Zhang
NYU-ECNU Center for Computational Chemistry at NYU Shanghai, Shanghai, 200062, China
Tong Zhu & John Z. H. Zhang
Department of Chemistry, New York University, New York, NY, 10003, USA
John Z. H. Zhang
Collaborative Innovation Center of Extreme Optics, Shanxi University, Taiyuan, Shanxi, 030006, China
John Z. H. Zhang

Authors

Jinzhe Zeng
View author publications
You can also search for this author in PubMed Google Scholar
Liqun Cao
View author publications
You can also search for this author in PubMed Google Scholar
Mingyuan Xu
View author publications
You can also search for this author in PubMed Google Scholar
Tong Zhu
View author publications
You can also search for this author in PubMed Google Scholar
John Z. H. Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.Z. trained the neural network potential and performed most of the QM calculations. L.C. and M.X. analyzed the trajectory and performed part of the QM calculation. T.Z. and J.Z.H.Z. conceived the project and wrote the manuscript with input from all authors.

Corresponding authors

Correspondence to Tong Zhu or John Z. H. Zhang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Nature Communications thanks the anonymous reviewers for their contribution to the peer review of this work. Peer reviewer reports are available.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Zeng, J., Cao, L., Xu, M. et al. Complex reaction processes in combustion unraveled by neural network-based molecular dynamics simulation. Nat Commun 11, 5713 (2020). https://doi.org/10.1038/s41467-020-19497-z

Download citation

Received: 10 March 2020
Accepted: 06 October 2020
Published: 11 November 2020
DOI: https://doi.org/10.1038/s41467-020-19497-z

This article is cited by

Deep-potential enabled multiscale simulation of gallium nitride devices on boron arsenide cooling substrates
- Jing Wu
- E Zhou
- Guangzhao Qin
Nature Communications (2024)
Active learning graph neural networks for partial charge prediction of metal-organic frameworks via dropout Monte Carlo
- Stephan Thaler
- Felix Mayr
- Julija Zavadlav
npj Computational Materials (2024)
Exploring the frontiers of condensed-phase chemistry with a general reactive machine learning potential
- Shuhao Zhang
- Małgorzata Z. Makoś
- Justin S. Smith
Nature Chemistry (2024)
Machine learning-based prediction of mechanical properties of N-doped γ-graphdiyne
- Cun Zhang
- Bolin Yang
- Shaohua Chen
Science China Materials (2024)
First principles molecular dynamics simulation and thermal decomposition kinetics study of CL-20
- Jia Wu
- Jianbo Hu
- Zhirong Suo
Journal of Molecular Modeling (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.