Uncertainty-aware molecular dynamics from Bayesian active learning for phase transformations and thermal transport in SiC

Xie, Yu; Vandermause, Jonathan; Ramakers, Senja; Protik, Nakib H.; Johansson, Anders; Kozinsky, Boris

doi:10.1038/s41524-023-00988-8

Download PDF

Article
Open access
Published: 06 March 2023

Uncertainty-aware molecular dynamics from Bayesian active learning for phase transformations and thermal transport in SiC

npj Computational Materials volume 9, Article number: 36 (2023) Cite this article

4230 Accesses
9 Citations
Metrics details

Subjects

Abstract

Machine learning interatomic force fields are promising for combining high computational efficiency and accuracy in modeling quantum interactions and simulating atomistic dynamics. Active learning methods have been recently developed to train force fields efficiently and automatically. Among them, Bayesian active learning utilizes principled uncertainty quantification to make data acquisition decisions. In this work, we present a general Bayesian active learning workflow, where the force field is constructed from a sparse Gaussian process regression model based on atomic cluster expansion descriptors. To circumvent the high computational cost of the sparse Gaussian process uncertainty calculation, we formulate a high-performance approximate mapping of the uncertainty and demonstrate a speedup of several orders of magnitude. We demonstrate the autonomous active learning workflow by training a Bayesian force field model for silicon carbide (SiC) polymorphs in only a few days of computer time and show that pressure-induced phase transformations are accurately captured. The resulting model exhibits close agreement with both ab initio calculations and experimental measurements, and outperforms existing empirical models on vibrational and thermal properties. The active learning workflow readily generalizes to a wide range of material systems and accelerates their computational understanding.

Robust training of machine learning interatomic potentials with dimensionality reduction and stratified sampling

Article Open access 26 February 2024

Machine-learned interatomic potentials by active learning: amorphous and liquid hafnium dioxide

Article Open access 23 July 2020

Automated discovery of a robust interatomic potential for aluminum

Article Open access 23 February 2021

Introduction

Machine learning interatomic force fields have recently emerged as powerful tools in modeling interatomic interactions. They are capable of reaching near-quantum accuracy while being orders of magnitude faster than ab initio methods^{1,2,3,4,5,6,7,8,9}.

Recently, efficient active learning schemes have been demonstrated for high-efficiency data collection, where molecular dynamics (MD) is driven by the machine learning force field, and only configurations satisfying certain acquisition criteria^{6,10,11,12,13,14,15} are computed with accurate but expensive DFT calculations and added to the training set. Among them, the FLARE⁶ framework utilizes principled Gaussian process (GP) uncertainties to construct a Bayesian force field (BFF), a force field equipped with internal uncertainty quantification from Bayesian inference, enabling a fully autonomous active learning workflow.

The cost of prediction of conventional GP models scales linearly with the training set size, making it computationally expensive for large data sets. The sparse Gaussian process (SGP) approach selects a set of representative atomic environments from the entire training set to build an approximate model, which can scale to a larger training data set, but still suffers from the linear scaling of the inference cost with respect to the sparse set size. To address this issue, it was noticed that for the particular structure of the squared exponential 2+3-body kernel, it is possible to map the mean prediction of a trained model onto an equivalent low-dimensional parametric model^6,16,17 without any loss of accuracy. It was subsequently shown that the evaluation of the variance can also be mapped onto a low-dimensional model, achieving a dramatically accelerated uncertainty-aware BFF¹⁸. While the 2-body and 3-body descriptions used in the previous work^6,18 approach the computational speed of classical empirical potentials, they are limited in accuracy. To systematically increase the descriptive power of that approach, it is possible to extend the formalism to include higher body order interactions. However, inclusion of e.g. 4-body interactions requires summing contributions of all quadruplets of atoms in each neighborhood, which adds significant computational expense. Therefore, in a more recent work¹⁹ we used the atomic cluster expansion²⁰ to construct local structure descriptors that scale linearly with the number of neighbors, enabling efficient inclusion of higher body order correlations. An inner product kernel is then constructed based on the rotationally invariant ACE descriptors, forming a basis for a sparse Gaussian Process (GP) regression model for energies, forces and stresses. Crucially, it was shown in general that in the case of inner-product kernels with high dimensional many-body descriptors, the prediction of the mean can also be mapped exactly onto a constant-cost model via reorganization of the summation in the SGP mean calculation¹⁹. This allows for efficient evaluation of the model forces, energies and stresses, but does not address the cost of evaluating uncertainties. In this work, we present a method to map the variance of SGP models with inner product kernels. This advance enables large-scale uncertainty aware MD and overcome the linear scaling issue of SGPs. Building on this approach, we achieve a significant acceleration of the Bayesian active learning (BAL) workflow and integrate it with Large-scale Atomic/Molecular Massively Parallel Simulator (LAMMPS)²¹. Bayesian force fields are implemented within the LAMMPS MD engine, such that both forces and uncertainties for each atomic configuration are quantified at computational cost independent of the training set size.

As a demonstration of the accelerated autonomous workflow, we train an uncertainty-aware many-body BFF for silicon carbide (SiC) on its several polymorphs and phases. SiC is a wide-gap semiconductor with diverse applications ranging from efficient power electronics to nuclear physics and astronomy. With the discovery of a large number of extrasolar planets²², the compositions and processes under extreme conditions have led to a wide range of studies. In particular, SiC has been identified from the adsorption spectroscopy of carbon-rich extrasolar planets^23,24, which has motivated numerous experimental and computational studies of its high temperature high pressure behavior. The phase transition of SiC from the zinc blende (3C) to the rock salt (RS) phase is observed at high pressure in experiments^{25,26,27,28,29,30,31} and ab initio calculations^{32,33,34,35,36,37,38,39}. Empirical potentials such as Tersoff^40,41, Vashishta^42,43, MEAM⁴⁴, and Gao-Weber⁴⁵ have been developed and applied in large-scale simulations for different purposes. However, empirical analytical potentials are limited in descriptive complexity, and hence accuracy, and require intensive human effort to select training configurations and to train. Machine learning approaches have allowed for highly over-parameterized or non-parametric models to be trained on a wide range of structures and phases^{7,46,47,48,49,50}. Recently, neural network potentials were trained for SiC to study dielectric spectra⁵¹ and thermal transport properties⁵², but they do not capture high-pressure phase transitions.

In the present work, we deploy the accelerated autonomous BAL workflow to the high-pressure phase transition of SiC, and demonstrate that the transition process can be captured by the uncertainty quantification of the BFF. Then the BFF is used to perform large-scale MD simulations, and compute vibrational and thermal transport properties of different phases. The FLARE BFF shows good agreement with ab-initio calculations⁵³ and experimental measurements, and significantly outperforms available empirical potentials in terms of accuracy, while retaining comparable computational efficiency.

Results

Accelerated bayesian active learning workflow

Directly using ab initio MD to generate a sufficiently diverse training set for machine learning force fields is expensive and time consuming, and may still miss higher-energy configurations important for rare transformation phenomena. Here, we develop an active learning workflow, where MD is instead driven by the much faster surrogate FLARE many-body BFF. During the MD simulation, the model uses its internal uncertainty quantification, deciding to call DFT only when the model encounters atomic configurations with uncertainty above a chosen threshold. Within this framework, a much smaller number of DFT calls are needed, which greatly reduces the training time and increases the efficiency of phase space exploration.

In this work, we extend the formalism of efficient lossless mapping to include uncertainty of ACE-based many-body SGP models. Specifically, we implement mapped SGP variance to enable efficient uncertainty quantification and achieve large-scale MD simulations by interfacing the Bayesian active learning algorithm with LAMMPS. The mappings of the forces and uncertainty overcome the scaling with the training set size of the computational cost of SGP regression, resulting in a significant acceleration of the training process in comparison with using the full SGP¹⁹.

We illustrate our active learning workflow in Fig. 1a. Starting from a SGP model with a small initial training set, we map both prediction mean and variance into quadratic models to obtain an efficient BFF (details are discussed in Methods). With the mapped SGP force field, MD simulation runs in LAMMPS with uncertainty associated with the local energy assigned to each atom in a configuration at each time step. The MD simulation is interrupted once there are atoms whose uncertainties are above the threshold. Then DFT is used to compute energy, forces and stress for the high-uncertainty configurations. The training set is augmented with the newly acquired DFT data, the SGP model is retrained and mapped, and the MD simulation continues with the updated model.

To illustrate the acceleration of active learning workflow with mapped forces and variance compared to that with the SGP model, we deploy a single active learning run of bulk 4H-SiC system and show the performance of each part of the workflow in Fig. 1b. If the model is not mapped, the computational cost of MD with the SGP dominates the BAL procedure. When the mapped force field and variances are used, the computational cost is significantly reduced. The FLARE BFF achieves 0.76 ms ⋅ CPU/step/atom in LAMMPS MD, which includes uncertainty quantification, and is comparable in speed to empirical interatomic potentials such as ReaxFF⁵⁴. With mapped uncertainties, the DFT calculations become the dominant part from Fig. 1b, i.e. the computational time for BAL is determined by the number of DFT calls. We also note that the SGP model used for timing here has a small training set of 14 frames, and the computational cost of prediction grows linearly with sparse set size. It is worth emphasizing that after mapping, the prediction of model forces, energies, and stresses, and their variances is independent of the sparse set size. Therefore, the speedup versus the SGP model is more pronounced as more training data are collected and sparse representatives are added.

Bayesian force field for SiC high pressure phase transition

In this work, we demonstrate the accelerated BAL procedure by training a mapped uncertainty-aware Bayesian force field to describe the phase transition of SiC at high pressure. We set up compressive and decompressive MD simulations at temperatures of 300 K and 2000 K for on-the-fly training simultaneously, where each SGP model is initialized with an empty training set. The parameters set for the on-the-fly training workflow is in the Supplementary Table 1, and the complete training set information is in the Supplementary Table 2.

The compressive MD starts with the 2H, 4H, 6H, and 3C polytypes that are stable at low pressure, and the pressure is increased by 30 GPa every 50 ps. The decompressive MD starts with the RS phase at 200 GPa, and the pressure is decreased by 20 GPa every 50 ps. The training data are collected by the BAL workflow shown in Fig. 1. Fig. 2a shows the system volume and relative uncertainty, i.e. the ratio between the uncertainty of the current frame and the average uncertainty of the training data set (see details in Supplementary Method 1⁵⁵). In the compressive (decompressive) MD, when the transition happens, the volume decreases (increases) rapidly and the uncertainty spikes, since the model has never seen the transition state or the RS (3C) phase before. The post-transition high-pressure structure of the compression run has 6-fold (4-fold) coordination corresponding to the RS (3C) phase, and the transition is observed at 300 GPa (0 GPa) at room temperature. The difference of the transition pressures between the compressive and decompressive simulations is caused by nucleation-driven hysteresis. After the on-the-fly training is done, the training data from compressive and decompressive MD are combined to train a master force field for SiC, such that different phases at different pressures are covered by our force field. In Supplementary Fig. 2 and Supplementary Table 3⁵⁵, we demonstrate the accuracy of our force field on cohesive energy and elastic constants in comparison with DFT.

**Fig. 2: Phase transition simulation with FLARE BFF and enthalpy calculations.**

Using the mapped master force field, the phase transition pressure (at zero temperature) can be obtained from the enthalpy H = E_total + PV of the different phases. At low pressure, the RS phase has higher enthalpy than 3C. With the pressure increased above 65 GPa, the enthalpy of RS phase becomes lower than 3C. As shown in Fig. 2, empirical potentials such as ReaxFF⁵⁴, MEAM⁴⁴, Tersoff⁴⁰ and EDIP⁵⁶ produce qualitatively incorrect enthalpy curves, likely because they are trained only at low pressures. The Vashishta potential is the only one trained on the high pressure 3C-RS phase transition⁴² and presents a consistent scaling of enthalpy with pressures qualitatively, but it significantly overestimates the transition pressure at 90 GPa compared to DFT. FLARE BFF achieves a good agreement with the ab initio (DFT-PBE) enthalpy predictions, with both methods yielding 65 GPa. We note that our PBE value is consistent with previous first-principles calculations such as 66.6(LDA)³² and 58(PBEsol)³⁴.

Next we run a large-scale MD simulation with 1000 atoms for 500 ps at the temperature of 300 K. The NPT ensemble is used, and the pressure is increased by 30 GPa every 50 ps. At 300 GPa, the phase transition is observed from 4-fold coordination to 6-fold, as shown in Fig. 3b. The final 6-fold coordinated structure has radial distribution function as shown in Fig. 3c. The highest C-C, C-Si, Si-Si peaks match the perfect RS structure, confirming that the final structure is in the rock salt phase.

**Fig. 3: Large-scale simulation of phase transition and analysis.**

As in the smaller training simulations, the nucleation-controlled hysteresis caused the transition pressure (300 GPa) to be much higher than in experimental measurements (50–150 GPa)^25,26,28. To eliminate the nucleation barrier, we start our simulation with a configuration where the 3C and RS phases coexist, separated by a phase boundary, and evolve differently as a function of pressures. It is worth noting that such large scale two-phase simulations are enabled by the efficient force field and would not be possible to perform with DFT. To recognize simple cubic (RS) and cubic diamond (3C, zinc-blende) environments, we use polyhedral template matching⁵⁷ in OVITO⁵⁸. The time evolution of the fractions of simple cubic and cubic diamond atomic environments is shown in Fig. 3c. The phase boundary evolves quickly within 10 fs. When pressure is greater than 160 GPa, the phase boundary evolves to RS phase, while below 155 GPa it evolves to 3C phase, which indicates the phase transition pressure is located between 155–160 GPa at the temperature of 300 K.

A number of experiments have measured the density-pressure relations of zinc blende (3C) and rock salt at room temperature^{25,26,27,28,29}. As shown in Fig. 4, the equation of state shows two parallel density-pressure curves, where the one with lower density is associated with the zinc-blende phase, and the other one with the rock salt phase. The pressures and corresponding densities at room temperature are extracted from the MD trajectories, and the equation of state is plotted to compare with the experimental measurements. FLARE BFF accurately agrees with the experimental measurements for both phases. For the transition state, there are not many experimental data points available, and the measurements can be affected by the quality of the sample, but the FLARE BFF still shows good agreement with the measured data points.

**Fig. 4: The density-pressure relation from experimental measurements^{25,26,27,28,29} and MD simulations from FLARE BFF.**

Vibrational and thermal transport properties

To validate that the FLARE BFF gives accurate predictions of thermal properties, we investigate the phonon dispersions and thermal conductivities of the different polytypes and phases of SiC. The phonon dispersions are computed using Phonopy⁵⁹. As shown in Fig. 5, FLARE BFF produces phonon dispersions in close agreement with both DFT and experiments^60,61,62,63 for both the low pressure polytypes and the high pressure rock salt phase at 200 GPa. While in the optical branches the FLARE BFF prediction shows minor discrepancies with DFT at the highest frequencies, the phonon density of states (DOS) presents good agreement. In particular, FLARE BFF captures the peak at 23 THz corresponding to a number of degenerate optical branches. In Supplementary Fig. 3, we show that the optical branches can be improved by increasing the cutoff of BFF. Existing empirical potentials are much less accurate than FLARE BFF by comparing the phonon DOS in Fig. 5. We note that the SiC crystals are polarized by atomic displacements and the generated macroscopic field induces an LO-TO splitting near Γ point. The contribution from the polarization should be included through non-analytical correction (NAC)^59,64, and the first principle calculation with NAC is discussed in ref. ⁶⁵. However, since FLARE BFF model does not contain charges and polarization, the NAC term is not considered. Thus, our comparison is made between the FLARE BFF and DFT without NAC. The lack of NAC accounts for the disagreement in the high frequencies of the optical bands of 2H DFT phonon (Fig. 5) with experimental measurements near Γ point.

**Fig. 5: Phonon dispersions and phonon density of states of different polytypes from experimental measurements^60,61,62,63, FLARE BFF and DFT calculations.**

Having confirmed the accuracy of the 2nd order force constants by the phonon dispersion calculations, we then compute the thermal conductivity within the Boltzmann transport equation (BTE) formalism. The 2nd and 3rd order force constants are computed using the Phono3py⁶⁶ code, and then used in the Phoebe⁶⁷ transport code to evaluate thermal conductivity with the iterative BTE solver⁶⁸. In Supplementary Fig. 4⁵⁵, we verify that the exclusion of the non-analytic correction does not have a significant effect on the thermal conductivity values. Fig. 5 presents the thermal conductivities of the zinc blende phase at 0 GPa and the rock salt phase at 200 GPa as a function of temperature. The FLARE BFF results are in good agreement with the DFT-derived thermal conductivity for both zinc blende and rock salt phases. The thermal conductivity of zinc blende phase computed from DFT and FLARE BFF is also in good agreement with experimental measurements^69,70,71,72. For the high-pressure rock salt phase, the thermal conductivity has not been previously computed or measured to the best of our knowledge. Therefore, our calculation provides a prediction that awaits experimental verification in the future.

Discussion

In this work we develop a BFF that maps both the mean predictions and uncertainties of SGP models for many-body interatomic force fields. The mapping procedure overcomes the linear scaling issue of SGPs and results in near-quantum accuracy, while retaining computational cost comparable to empirical interatomic potentials such as ReaxFF. The efficient uncertainty-aware BFF model forms the basis for the construction of an accelerated autonomous BAL workflow that is coupled with the LAMMPS MD engine and enables large-scale parallel MD simulations. The key improvement with respect to previous methods is the ability of the many-body model to efficiently calculate forces and uncertainties at comparable computational cost.

As a demonstration of the ability of this method to capture and learn subtle interactions driving phase transformations on-the-fly, we use the BAL workflow to train a BFF for SiC on several common polytypes and phases. The zinc-blende to rock-salt transition is captured in both the active learning and the large-scale simulation, facilitated by the model uncertainty. FLARE BFF is shown to have good agreement with DFT for the enthalpy prediction in a wide range of pressure values. The BFF model is readily employed to perform a large-scale MD of the phase boundary evolution, which allows for reliable identification of the transition pressure at room temperature to be located at 155–160 GPa. The density-pressure relation predicted by FLARE BFF agrees very well with experimental measurements. We also find close agreement for phonon dispersions and thermal conductivities of several SiC phases, as compared with DFT calculations and experiments, outperforming existing empirical potentials.

The high-performance implementation of BFFs, combining accuracy with autonomous uncertainty-driven active learning, opens numerous possibilities to explicitly study dynamics and microscopic mechanisms of phase transformations and non-equilibrium properties such as thermal and ionic transport. The presented unified approach can be extended to a wide range of complex systems and phenomena, where interatomic interactions are difficult to capture with classical approaches while time- and length-scales are out of reach of first-principles computational methods. Uncertainty quantification in MD simulations allows for systematic monitoring of the model confidence and detection of rare and unanticipated phenomena, such as reactions or nucleation of phases. Such events are statistically unlikely to occur in smaller simulations and become increasingly likely and relevant as the simulation sizes increase, as may be needed to study complex and heterogeneous materials systems.

Methods

In this section, we use bold letters such as α and Λ to denote a vector or a matrix, and use indices such as α_i and Λ_ij to denote the components of the vector and matrix respectively. In addition, we use F to represent all the collected configurations with all atomic environments (the “full” data set), and use S for a subset atomic environments selected from those configurations (the “sparse” data set).

Gaussian process regression

The local atomic environment ρ_i of an atom i consists of all the neighbor atoms within a cutoff radius, and is associated with a label y_i that can be the force F_i on atom i or a local energy ε_i. The local energy labels are usually not available in practice, but total energies, stresses and atomic forces are. A kernel function k(ρ_i, ρ_j) quantifies the similarity between two atomic environments, which is also the covariance between local energies. Gaussian process regression (GP) assumes a Gaussian joint distribution of all the training data {(ρ_i, y_i)} and test data (ρ, y). The posterior distribution for test data is also Gaussian with the mean and variance

$$\varepsilon (\rho )={{{{\bf{k}}}}}_{\varepsilon F}{\left({{{{\bf{K}}}}}_{FF}+{{{\mathrm{\Lambda }}}}\right)}^{-1}{{{\bf{y}}}}$$

(1)

$$V(\rho )={k}_{\varepsilon \varepsilon }-{{{{\bf{k}}}}}_{\varepsilon F}{\left({{{{\bf{K}}}}}_{FF}+{{{\mathrm{\Lambda }}}}\right)}^{-1}{{{{\bf{k}}}}}_{F\varepsilon }$$

(2)

Here, ε is the local energy of ρ, k_εF is the kernel vector describing covariances between the test data and all the training data, with element k(ρ, ρ_i), k_εε = k(ρ, ρ) is the kernel between test data ρ and itself, K_FF is the kernel matrix with element k(ρ_i, ρ_j), Λ is a diagonal matrix describing noise, and y is the vector of all training labels.

Since the full GP evaluates the kernel between all configurations in the whole training set, it becomes computationally inefficient for large data sets. A sparse approximation is needed such that the computational cost can be reduced while keeping the information as complete as possible. Following our recent work¹⁹ we consider the mean prediction from Deterministic Training Conditional (DTC) Approximation⁷³, and the variance on local energies

$$\varepsilon (\rho )={{{{\bf{k}}}}}_{\varepsilon S}{\left({{{{\bf{K}}}}}_{SF}{{{{\mathrm{\Lambda }}}}}^{-1}{{{{\bf{K}}}}}_{FS}+{{{{\bf{K}}}}}_{SS}\right)}^{-1}{{{{\bf{K}}}}}_{SF}{{{\bf{y}}}}$$

(3)

$$V(\rho )={k}_{\varepsilon \varepsilon }-{{{{\bf{k}}}}}_{\varepsilon S}{K}_{SS}^{-1}{{{{\bf{k}}}}}_{S\varepsilon }$$

(4)

where the k_εS is the kernel vector between the test data ρ and the sparse training set S, K_SF is the kernel matrix between the sparse subset S and the complete training set F, and K_SS is the kernel matrix between S and itself. The energy, forces and stress tensor of a test configuration are given by the mean prediction, and their corresponding uncertainties are given by the square root of the predictive variance.

ACE descriptors with inner product kernel

In the FLARE SGP formalism¹⁹ we use the atomic cluster expansion²⁰ (ACE) descriptors to represent the features of a local environment. The total energy is constructed from atomic clusters, represented by the expansion coefficients of an atomic density function using spherical harmonics. We refer the readers to ref. ^19,20 for more details. To build up the SGP model for interatomic Bayesian force field, or BFF, we use the inner product kernel defined as

$$k({\rho }_{1},{\rho }_{2})={\sigma }^{2}{\left(\frac{{{{{\bf{d}}}}}_{1}\cdot {{{{\bf{d}}}}}_{2}}{{d}_{1}{d}_{2}}\right)}^{\xi }$$

(5)

where d₁ and d₂ are ACE descriptors of environments ρ₁ and ρ₂, σ is the signal variance which is optimized by maximizing the log likelihood of SGP, and ξ is the power of the inner product kernel which is selected a priori. We also normalize the kernel by the L2 norm of the descriptors. For simplicity of notations and without loss of generality, we ignore the normalization and only showcase the kernel without derivative associated with energy.

With inner product kernels, a highly efficient but lossless approximation is available via reorganization of the summation in the mathematical expression¹⁹. Defining ${{{\boldsymbol{\alpha }}}}:\,= {\left({{{{\bf{K}}}}}_{SF}{{{{\boldsymbol{\Lambda }}}}}^{-1}{{{{\bf{K}}}}}_{FS}+{{{{\bf{K}}}}}_{SS}\right)}^{-1}{{{{\bf{K}}}}}_{SF}{{{\bf{y}}}}$ and ${\tilde{{{{\boldsymbol{d}}}}}}_{i}={{{{\bf{d}}}}}_{i}/{d}_{i}$, the mean prediction of test data ρ_i (Eq. (3)) from sparse training set S = {ρ_s} can be written as¹⁹

$$\begin{array}{lll}\varepsilon ({\rho }_{i})={\sigma }^{2}\mathop{\sum}\limits_{t}{\left({\tilde{{{{\boldsymbol{d}}}}}}_{i}\cdot {\tilde{{{{\boldsymbol{d}}}}}}_{t}\right)}^{\xi }{\alpha }_{t}\\ \qquad\;\, ={\sigma }^{2}\mathop{\sum}\limits_{t,{m}_{1},...,{m}_{\xi }}{\tilde{d}}_{i{m}_{1}}{\tilde{d}}_{t{m}_{1}}\cdot \cdot \cdot {\tilde{d}}_{i{m}_{\xi }}{\tilde{d}}_{t{m}_{\xi }}{\alpha }_{t}\\ \qquad\;\,={\sigma }^{2}\mathop{\sum}\limits_{{m}_{1},...,{m}_{\xi }}{\tilde{d}}_{i{m}_{1}}\cdot \cdot \cdot {\tilde{d}}_{i{m}_{\xi }}\left(\mathop{\sum}\limits_{t}{\tilde{d}}_{t{m}_{1}}\cdot \cdot \cdot {\tilde{d}}_{t{m}_{\xi }}{\alpha }_{t}\right)\\ \qquad\;\,=\mathop{\sum}\limits_{{m}_{1},...,{m}_{\xi }}{\tilde{d}}_{i{m}_{1}}\cdot \cdot \cdot {\tilde{d}}_{i{m}_{\xi }}{\beta }_{{m}_{1},...,{m}_{\xi }},\end{array}$$

(6)

The β tensor can be computed from the training data descriptors. Thus, we can store the β tensor, and during the prediction we directly evaluate Eq. (6) without the need to sum over all training data t.

Crucially, the variance Eq. (4) has the similar reorganization as the mean prediction of Eq. (6)

$$V({\rho }_{i})={\sigma }^{2}{\left({\tilde{{{{\boldsymbol{d}}}}}}_{i}\cdot {\tilde{{{{\boldsymbol{d}}}}}}_{i}\right)}^{\xi }-{\sigma }^{4}\mathop{\sum}\limits_{s,t}{\left({\tilde{{{{\boldsymbol{d}}}}}}_{i}\cdot {\tilde{{{{\boldsymbol{d}}}}}}_{s}\right)}^{\xi }{\left({K}_{SS}^{-1}\right)}_{st}{\left({\tilde{{{{\boldsymbol{d}}}}}}_{t}\cdot {\tilde{{{{\boldsymbol{d}}}}}}_{i}\right)}^{\xi }$$

(7)

$$=\mathop{\sum}\limits_{\begin{array}{c}{m}_{1},...,{m}_{\xi }\\ {n}_{1},...,{n}_{\xi }\end{array}}({\tilde{d}}_{i{m}_{1}}\cdot \cdot \cdot {\tilde{d}}_{i{m}_{\xi }}){\gamma }_{\begin{array}{c}{m}_{1},...,{m}_{\xi }\\ {n}_{1},...,{n}_{\xi }\end{array}}({\tilde{d}}_{i{n}_{1}}\cdot \cdot \cdot {\tilde{d}}_{i{n}_{\xi }})$$

(8)

where γ is a tensor that can be calculated and stored once the training data is collected, and used in inference without explicit summation over training data i.

The reorganization indicates that the SGP regression with inner product kernel essentially gives a polynomial model of the descriptors. Denote n_d as the descriptor dimension. The reorganized mean prediction has both the β size and the computational cost as $O({n}_{d}^{\xi })$. While the reorganized variance prediction has both the γ size and the computational cost as $O({n}_{d}^{2\xi })$. By the reorganization, both the mean and variance predictions become independent of the training set and get rid of the linear scaling with respect to training size. Here, we choose ξ = 2 for the mean prediction, because (1) it is shown¹⁹ that ξ = 2 has a significant improvement of likelihood compared to ξ = 1, while the improvement of ξ > 2 is marginal; and (2) higher order requires a much larger memory for β tensor, and the evaluation of Eq. (6) is much costlier than ξ = 2.

From Eq. (8) we see that the variance has twice the polynomial degree of that for the mean prediction. For example, when ξ = 2, the mean prediction is a quadratic model while the variance is a quartic model of descriptors. For computational efficiency, in this work we use V_ξ=1 for variance prediction. V_ξ=1 is not the exact variance of our mean prediction ε_ξ=2, but an approximation using the same hyperparameters as V_ξ=2. It has a strong correlation with the exact variance V_ξ=2 as shown in Supplementary Fig. 1⁵⁵. Especially, the correlation coefficients between V_ξ=1 and higher powers are close to 1.0, the perfect linear relation, even with a training size of 200 frames. This indicates that uncertainty quantification with V_ξ=1 is able to recognize discrepancies between different configurations as well as V_ξ=2. Specifically, atomic environments with higher V_ξ=2 uncertainties will also be assigned higher V_ξ=1 uncertainties than others. It is then justified that V_ξ=1 can be used as a strongly correlated but much cheaper approximation of V_ξ=2.

To summarize, we use ε_ξ=2 and V_ξ=1 for mean and variance predictions respectively, where both are quadratic models with respect to the descriptors. During the on-the-fly active learning, every time the training set is updated by the new DFT data, the β and γ matrices are computed from training data descriptors and stored as coefficient files. In LAMMPS MD, the quadratic models are evaluated to make predictions for energy, forces, stress and uncertainty of the configurations.

Data availability

The scripts and data are available in Zenodo: https://doi.org/10.5281/zenodo.5797177.

Code availability

The code for FLARE Bayesian force field and Bayesian active learning is available at https://github.com/mir-group/flare.

References

Behler, J. & Parrinello, M. Generalized neural-network representation of high-dimensional potential-energy surfaces. Phys. Rev. Lett. 98, 146401 (2007).
Article Google Scholar
Shapeev, A. V. Moment tensor potentials: a class of systematically improvable interatomic potentials. Multiscale Model. Simul. 14, 1153–1173 (2016).
Article Google Scholar
Thompson, A. P., Swiler, L. P., Trott, C. R., Foiles, S. M. & Tucker, G. J. Spectral neighbor analysis method for automated generation of quantum-accurate interatomic potentials. J. Comput. Phys. 285, 316–330 (2014).
Schütt, K. et al. Schnet: a continuous-filter convolutional neural network for modeling quantum interactions. In Adv. Neural Inf. Process. Syst. 30 (2017).
Bartók, A. P., Payne, M. C., Kondor, R. & Csányi, G. Gaussian approximation potentials: the accuracy of quantum mechanics, without the electrons. Phys. Rev. Lett. 104, 136403 (2010).
Article Google Scholar
Vandermause, J. et al. On-the-fly active learning of interpretable bayesian force fields for atomistic rare events. npj Comput. Mater. 6, 1–11 (2020).
Article Google Scholar
Batzner, S. et al. E (3)-equivariant graph neural networks for data-efficient and accurate interatomic potentials. Nat. Commun. 13, 2453 (2022).
Article CAS Google Scholar
Chen, C., Ye, W., Zuo, Y., Zheng, C. & Ong, S. P. Graph networks as a universal machine learning framework for molecules and crystals. Chem. Mater. 31, 3564–3572 (2019).
Article CAS Google Scholar
Chen, C. & Ong, S. P. A universal graph deep learning interatomic potential for the periodic table. Nat. Comput. Sci. 2, 718–728 (2022).
Article Google Scholar
Jinnouchi, R., Lahnsteiner, J., Karsai, F., Kresse, G. & Bokdam, M. Phase transitions of hybrid perovskites simulated by machine-learning force fields trained on the fly with bayesian inference. Phys. Rev. Lett. 122, 225701 (2019).
Article CAS Google Scholar
Jinnouchi, R., Karsai, F. & Kresse, G. On-the-fly machine learning force field generation: application to melting points. Phys. Rev. B 100, 014105 (2019).
Article CAS Google Scholar
Li, Z., Kermode, J. R. & De Vita, A. Molecular dynamics with on-the-fly machine learning of quantum-mechanical forces. Phys. Rev. Lett. 114, 096405 (2015).
Article Google Scholar
Podryabinkin, E. V. & Shapeev, A. V. Active learning of linearly parametrized interatomic potentials. Comput. Mater. Sci. 140, 171–180 (2017).
Article CAS Google Scholar
Hodapp, M. & Shapeev, A. In operando active learning of interatomic interaction during large-scale simulations. Mach. Learn. Sci. Technol. 1, 045005 (2020).
Article Google Scholar
Young, T., Johnston-Wood, T., Deringer, V. L. & Duarte, F. A transferable active-learning strategy for reactive molecular force fields. Chem. Sci. – (2021). https://doi.org/10.1039/D1SC01825F.
Glielmo, A., Zeni, C. & De Vita, A. Efficient nonparametric n-body force fields from machine learning. Phys. Rev. B 97, 184307 (2018).
Article CAS Google Scholar
Glielmo, A., Zeni, C., Fekete, Á. & De Vita, A. Building nonparametric n-body force fields using gaussian process regression. In Machine Learning Meets Quantum Physics, 67–98 (Springer, 2020).
Xie, Y., Vandermause, J., Sun, L., Cepellotti, A. & Kozinsky, B. Bayesian force fields from active learning for simulation of inter-dimensional transformation of stanene. npj Comput. Mater. 7, 1–10 (2021).
Article Google Scholar
Vandermause, J., Xie, Y., Lim, J. S., Owen, C. J. & Kozinsky, B. Active learning of reactive bayesian force fields: application to heterogeneous hydrogen-platinum catalysis dynamics (2021).
Drautz, R. Atomic cluster expansion for accurate and transferable interatomic potentials. Phys. Rev. B99 (2019). https://doi.org/10.1103/PhysRevB.99.014104.
Plimpton, S. Fast parallel algorithms for short-range molecular dynamics. J. Comput. Phys. 117, 1–19 (1995).
Article CAS Google Scholar
Kim, D. et al. Structure and density of silicon carbide to 1.5 tpa and implications for extrasolar planets. Nat. Commun. 13, 1–9 (2022).
Google Scholar
Madhusudhan, N., Lee, K. K. & Mousis, O. A possible carbon-rich interior in super-earth 55 cancri e. Astrophys. J. Lett. 759, L40 (2012).
Article Google Scholar
Speck, A., Barlow, M. & Skinner, C. The nature of the silicon carbide in carbon star outflows. Mon. Not. R. Astron. Soc. 288, 431–456 (1997).
Article CAS Google Scholar
Tracy, S. et al. In situ observation of a phase transition in silicon carbide under shock compression using pulsed x-ray diffraction. Phys. Rev. B 99, 214106 (2019).
Article CAS Google Scholar
Sekine, T. & Kobayashi, T. Shock compression of 6h polytype sic to 160 gpa. Phys. Rev. B 55, 8034 (1997).
Article CAS Google Scholar
Vogler, T., Reinhart, W., Chhabildas, L. & Dandekar, D. Hugoniot and strength behavior of silicon carbide. J. Appl. Phys. 99, 023512 (2006).
Article Google Scholar
Miozzi, F. et al. Equation of state of sic at extreme conditions: new insight into the interior of carbon-rich exoplanets. J. Geophys. Res. Planets 123, 2295–2309 (2018).
Article CAS Google Scholar
Yoshida, M., Onodera, A., Ueno, M., Takemura, K. & Shimomura, O. Pressure-induced phase transition in sic. Phys. Rev. B. 48, 10587 (1993).
Article CAS Google Scholar
Kidokoro, Y., Umemoto, K., Hirose, K. & Ohishi, Y. Phase transition in sic from zinc-blende to rock-salt structure and implications for carbon-rich extrasolar planets. Am. Mineral. 102, 2230–2234 (2017).
Article Google Scholar
Daviau, K. & Lee, K. K. Zinc-blende to rocksalt transition in sic in a laser-heated diamond-anvil cell. Phys. Rev. B. 95, 134108 (2017).
Article Google Scholar
Wang, C.-Z., Yu, R. & Krakauer, H. Pressure dependence of born effective charges, dielectric constant, and lattice dynamics in sic. Phys. Rev. B. 53, 5430 (1996).
Article CAS Google Scholar
Ran, Z. et al. Phase transitions and elastic anisotropies of sic polymorphs under high pressure. Ceram. Int. 47, 6187–6200 (2021).
Article CAS Google Scholar
Lee, W. & Yao, X. First principle investigation of phase transition and thermodynamic properties of sic. Comput. Mater. Sci. 106, 76–82 (2015).
Article CAS Google Scholar
Durandurdu, M. Pressure-induced phase transition of sic. J. Phys. Condens. Matter. 16, 4411 (2004).
Article CAS Google Scholar
Lu, Y.-P., He, D.-W., Zhu, J. & Yang, X.-D. First-principles study of pressure-induced phase transition in silicon carbide. Phys. B. Condens. Matter 403, 3543–3546 (2008).
Article CAS Google Scholar
Xiao, H., Gao, F., Zu, X. T. & Weber, W. J. Ab initio molecular dynamics simulation of a pressure induced zinc blende to rocksalt phase transition in sic. J. Phys. Condens. Matter 21, 245801 (2009).
Article CAS Google Scholar
Gorai, S., Bhattacharya, C. & Kondayya, G. Pressure induced structural phase transition in sic. In AIP Conf Proc, vol. 1832, 030010 (AIP Publishing LLC, 2017).
Kaur, T. & Sinha, M. First principle study of structural, electronic and vibrational properties of 3c-sic. In AIP Conf Proc, vol. 2265, 030384 (AIP Publishing LLC, 2020).
Tersoff, J. Chemical order in amorphous silicon carbide. Phys. Rev. B. 49, 16349 (1994).
Article CAS Google Scholar
Erhart, P. & Albe, K. Analytical potential for atomistic simulations of silicon, carbon, and silicon carbide. Phys. Rev. B71 (2005). https://doi.org/10.1103/PhysRevB.71.035211.
Shimojo, F. et al. Molecular dynamics simulation of structural transformation in silicon carbide under pressure. Phys. Rev. Lett. 84, 3338 (2000).
Article CAS Google Scholar
Vashishta, P., Kalia, R. K., Nakano, A. & Rino, J. P. Interaction potential for silicon carbide: a molecular dynamics study of elastic constants and vibrational density of states for crystalline and amorphous silicon carbide. J. Appl. Phys. 101, 103515 (2007).
Article Google Scholar
Kang, K.-H., Eun, T., Jun, M.-C. & Lee, B.-J. Governing factors for the formation of 4h or 6h-SiC polytype during SiC crystal growth: an atomistic computational approach. J. Cryst. Growth 389, 120–133 (2014).
Article CAS Google Scholar
Gao, F. & Weber, W. J. Empirical potential approach for defect properties in 3c-SiC. Nucl. Instrum. Methods Phys. Res. B. 191, 504–508 (2002).
Article CAS Google Scholar
Handley, C. M., Hawe, G. I., Kell, D. B. & Popelier, P. L. Optimal construction of a fast and accurate polarisable water potential based on multipole moments trained by machine learning. Phys. Chem. Chem. Phys. 11, 6365–6376 (2009).
Article CAS Google Scholar
Bartók, A. P., Kermode, J., Bernstein, N. & Csányi, G. Machine learning a general-purpose interatomic potential for silicon. Phys. Rev. X. 8, 041048 (2018).
Google Scholar
Zhang, L., Han, J., Wang, H., Car, R. & Weinan, E. Deep potential molecular dynamics: a scalable model with the accuracy of quantum mechanics. Phys. Rev. Lett. 120, 143001 (2018).
Article CAS Google Scholar
Musaelian, A. et al. Learning local equivariant representations for large-scale atomistic dynamics. Nat. Commun. 14, 579 (2023).
Article CAS Google Scholar
Mailoa, J. P. et al. A fast neural network approach for direct covariant forces prediction in complex multi-element extended systems. Nat. Mach. Intell. 1, 471–479 (2019).
Article Google Scholar
Chen, W. & Li, L.-S. The study of the optical phonon frequency of 3c-sic by molecular dynamics simulations with deep neural network potential. J. Appl. Phys. 129, 244104 (2021).
Article CAS Google Scholar
Fu, B., Sun, Y., Zhang, L., Wang, H. & Xu, B. Deep learning inter-atomic potential for thermal and phonon behaviour of silicon carbide with quantum accuracy. arXiv preprint arXiv:2110.10843 (2021).
Ramakers, S. et al. Effects of thermal, elastic, and surface properties on the stability of sic polytypes. Phys. Rev. B. 106, 075201 (2022).
Article CAS Google Scholar
Soria, F. A., Zhang, W., Paredes-Olivera, P. A., Van Duin, A. C. & Patrito, E. M. Si/c/h reaxff reactive potential for silicon surfaces grafted with organic molecules. J. Phys. Chem. C. 122, 23515–23527 (2018).
Article CAS Google Scholar
Xie, Y. et al. Supplementary materials (2022).
Lucas, G., Bertolus, M. & Pizzagalli, L. An environment-dependent interatomic potential for silicon carbide: calculation of bulk properties, high-pressure phases, point and extended defects, and amorphous structures. J. Phys. Condens. Matter 22, 035802 (2009).
Article Google Scholar
Larsen, P. M., Schmidt, S. & Schiøtz, J. Robust structural identification via polyhedral template matching. Model. Simul. Mater. Sci. Eng. 24, 055007 (2016).
Article Google Scholar
Stukowski, A. Visualization and analysis of atomistic simulation data with ovito–the open visualization tool. Model. Simul. Mater. Sci. Eng. 18, 015012 (2009).
Article Google Scholar
Togo, A. & Tanaka, I. First principles phonon calculations in materials science. Scr. Mater. 108, 1–5 (2015).
Article CAS Google Scholar
Nowak, S. Crystal lattice dynamics of various silicon-carbide polytypes. In International Conference on Solid State Crystals 2000: Growth, Characterization, and Applications of Single Crystals, vol. 4412, 181-186 (International Society for Optics and Photonics, 2001).
Feldman, D. W., Parker, J. H., Choyke, W. J. & Patrick, L. Phonon dispersion curves by raman scattering in SiC, polytypes 3C, 4H, 6H, 15R, and 21R. Phys. Rev. 173, 787–793 (1968).
Article CAS Google Scholar
Nakashima, S.-i, Wada, A. & Inoue, Z. Raman scattering from anisotropic phonon modes in sic polytypes. J. Phys. Soc. Jpn. 56, 3375–3380 (1987).
Article CAS Google Scholar
Serrano, J. et al. Determination of the phonon dispersion of zinc blende (3c) silicon carbide by inelastic x-ray scattering. Appl. Phys. Lett. 80, 4360–4362 (2002).
Article CAS Google Scholar
Pick, R. M., Cohen, M. H. & Martin, R. M. Microscopic theory of force constants in the adiabatic approximation. Phys. Rev. B. 1, 910 (1970).
Article Google Scholar
Protik, N. H. et al. Phonon thermal transport in 2h, 4h and 6h silicon carbide from first principles. Mater. Today Phys. 1, 31–38 (2017).
Article Google Scholar
Togo, A., Chaput, L. & Tanaka, I. Distributions of phonon lifetimes in brillouin zones. Phys. Rev. B. 91, 094306 (2015).
Article Google Scholar
Cepellotti, A., Coulter, J., Johansson, A., Fedorova, N. S. & Kozinsky, B. Phoebe: a high-performance framework for solving phonon and electron boltzmann transport equations. J. Phys. Chem. Mater. 5, 035003 (2022).
Google Scholar
Omini, M. & Sparavigna, A. An iterative approach to the phonon boltzmann equation in the theory of thermal conductivity. Phys. B Condens. Matter 212, 101–112 (1995).
Article CAS Google Scholar
Taylor, R., Groot, H. & Ferrier, J. In Thermophysical properties of CVD SiC (School of Mechanical Engineering, Purdue University, 1993).
Senor, D. et al. Effects of neutron irradiation on thermal conductivity of sic-based composites and monolithic ceramics. Fusion Sci. Technol. 30, 943–955 (1996).
Article CAS Google Scholar
Morelli, D. et al. Carrier concentration dependence of the thermal conductivity of silicon carbide. In Institute of Physics Conference Series, vol. 137, 313-316 (Bristol [England]; Boston: Adam Hilger, Ltd., c1985-, 1994).
Graebner, J. et al. Report on a second round robin measurement of the thermal conductivity of cvd diamond. Diam. Relat. Mater. 7, 1589–1604 (1998).
Article CAS Google Scholar
Quinonero-Candela, J. & Rasmussen, C. E. A unifying view of sparse approximate gaussian process regression. J. Mach. Learn. Res. 6, 1939–1959 (2005).
Google Scholar

Download references

Acknowledgements

We thank Lixin Sun, Thomas Eckl, Matous Mrovec, Thomas Hammerschmidt and Ralf Drautz for helpful discussions on the construction of the force field and properties of SiC. We thank Jenny Coulter and Andrea Cepellotti for the helpful instructions on thermal conductivity calculation using Phono3py and Phoebe. We thank Jenny Hoffman, Sushmita Bhattacharya and Kristina Jacobsson for constructive suggestions on the manuscript. YX was supported from the US Department of Energy (DOE), Office of Science, Office of Basic Energy Sciences (BES) under Award No. DE-SC0020128. JV was supported by the National Science Foundation award number 2003725. AJ was supported by the Aker scholarship. BK was supported by the National Science Foundation through the Harvard University Materials Research Science and Engineering Center under award number DMR-2011754 and Robert Bosch LLC. Computational resources were provided by the FAS Division of Science Research Computing Group at Harvard University.

Author information

Authors and Affiliations

John A. Paulson School of Engineering and Applied Sciences, Harvard University, Cambridge, MA, USA
Yu Xie, Jonathan Vandermause, Nakib H. Protik, Anders Johansson & Boris Kozinsky
Department of Physics, Harvard University, Cambridge, MA, USA
Jonathan Vandermause
Corporate Sector Research and Advance Engineering, Robert Bosch GmbH, Renningen, USA
Senja Ramakers
Interdisciplinary Centre for Advanced Materials Simulation, Ruhr-Universität Bochum, Bochum, Germany
Senja Ramakers
Robert Bosch LLC, Research and Technology Center, Watertown, MA, USA
Boris Kozinsky

Authors

Yu Xie
View author publications
You can also search for this author in PubMed Google Scholar
Jonathan Vandermause
View author publications
You can also search for this author in PubMed Google Scholar
Senja Ramakers
View author publications
You can also search for this author in PubMed Google Scholar
Nakib H. Protik
View author publications
You can also search for this author in PubMed Google Scholar
Anders Johansson
View author publications
You can also search for this author in PubMed Google Scholar
Boris Kozinsky
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.V., Y.X. and A.J. developed the FLARE code, the corresponding LAMMPS pair styles and compute commands. Y.X. implemented the on-the-fly training workflow coupled with LAMMPS. Y.X. performed the on-the-fly training and MD simulation of SiC. S.R. performed DFT calculations of SiC polymorphs. Y.X. computed the thermal conductivity of SiC with the contribution of N.H.P.. B.K. conceived the application and supervised the work. Y.X. and B.K. wrote the manuscript. All authors contributed to manuscript preparation.

Corresponding authors

Correspondence to Yu Xie or Boris Kozinsky.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Materials

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Xie, Y., Vandermause, J., Ramakers, S. et al. Uncertainty-aware molecular dynamics from Bayesian active learning for phase transformations and thermal transport in SiC. npj Comput Mater 9, 36 (2023). https://doi.org/10.1038/s41524-023-00988-8

Download citation

Received: 28 April 2022
Accepted: 21 February 2023
Published: 06 March 2023
DOI: https://doi.org/10.1038/s41524-023-00988-8

This article is cited by

Uncertainty driven active learning of coarse grained free energy models
- Blake R. Duschatko
- Jonathan Vandermause
- Boris Kozinsky
npj Computational Materials (2024)