Ultra-fast interpretable machine-learning potentials

Xie, Stephen R.; Rupp, Matthias; Hennig, Richard G.

doi:10.1038/s41524-023-01092-7

Download PDF

Article
Open access
Published: 02 September 2023

Ultra-fast interpretable machine-learning potentials

npj Computational Materials volume 9, Article number: 162 (2023) Cite this article

5212 Accesses
5 Citations
15 Altmetric
Metrics details

Subjects

Abstract

All-atom dynamics simulations are an indispensable quantitative tool in physics, chemistry, and materials science, but large systems and long simulation times remain challenging due to the trade-off between computational efficiency and predictive accuracy. To address this challenge, we combine effective two- and three-body potentials in a cubic B-spline basis with regularized linear regression to obtain machine-learning potentials that are physically interpretable, sufficiently accurate for applications, as fast as the fastest traditional empirical potentials, and two to four orders of magnitude faster than state-of-the-art machine-learning potentials. For data from empirical potentials, we demonstrate the exact retrieval of the potential. For data from density functional theory, the predicted energies, forces, and derived properties, including phonon spectra, elastic constants, and melting points, closely match those of the reference method. The introduced potentials might contribute towards accurate all-atom dynamics simulations of large atomistic systems over long-time scales.

Fast, accurate, and transferable many-body interatomic potentials by symbolic regression

Article Open access 18 November 2019

Machine learning electronic structure methods based on the one-electron reduced density matrix

Article Open access 07 October 2023

Efficient interatomic descriptors for accurate machine learning force fields of extended molecules

Article Open access 15 June 2023

Introduction

All-atom dynamics simulations enable the quantitative study of atomistic systems and their interactions in physics, chemistry, materials science, pharmaceutical sciences, and related areas. The simulations’ capabilities and limits depend on the potential used to calculate the forces acting on the atoms, with an inherent correlation between the accuracy of the underlying physical model and the required computational effort: On the one hand, electronic-structure methods tend to be accurate, slow, applicable to many systems and require little human parameterization effort. On the other hand, traditional empirical potentials are fast but limited in accuracy and applicability, with often high parameterization effort.

Machine-learning potentials (MLPs)^1,2,3,4 are flexible functions fitted to reference energy and force data, e.g., from electronic structure methods. Their computational advantage does not primarily arise from simplified physical models but from avoiding redundant calculations through interpolation. Hence, they can stay close to the accuracy of the reference method while being orders of magnitude faster (see Fig. 1), with little human parameterization effort, but are often hard to interpret. However, current accurate MLPs are still orders of magnitude slower than traditional empirical potentials, limiting their use for dynamics simulations of large atomistic systems over long time scales.

**Fig. 1: Trade-off between prediction error and computational cost of evaluating machine-learning potentials.**

In this work, we develop an interpretable linear MLP based on effective two- and three-body potentials using a flexible cubic B-spline basis. Figure 1 demonstrates how this ultra-fast (UF) potential is close in error to state-of-the-art MLPs while being as fast as the fastest traditional empirical potentials, such as the Morse and Lennard-Jones potentials.

The many-body expansion⁵ of an atomistic system’s potential energy

$$E={E}_{0}+\frac{1}{1!}\mathop{\sum}\limits_{i}{V}_{1}({{{{\bf{R}}}}}_{i},{\sigma }_{i})+\frac{1}{2!}\mathop{\sum}\limits_{i,j}{V}_{2}({{{{\bf{R}}}}}_{i},{\sigma }_{i},{{{{\bf{R}}}}}_{j},{\sigma }_{j})+\frac{1}{3!}\mathop{\sum}\limits_{i,j,k}{V}_{3}({{{{\bf{R}}}}}_{i},{\sigma }_{i},{{{{\bf{R}}}}}_{j},{\sigma }_{j},{{{{\bf{R}}}}}_{k},{\sigma }_{k})+\ldots$$

(1)

is a sum of N-body potentials V_N that depend on atom positions R_i and element species σ_i, where the prefactors account for double-counting and V_N = 0 if two indices are the same. Assuming the transferability of the V_N across configurations provides the basis for empirical and machine-learning potentials.

In this study, we use elemental tungsten as an example and hence omit the species dependency as well as the reference energies E₀ and $\frac{1}{1!}{\sum }_{i}{V}_{1}({{{{\bf{R}}}}}_{i},{\sigma }_{i})$, and where appropriate subsume the factorial prefactors into the potentials V for simplicity. The extension to multi-component systems is straightforward and implemented in the accompanying program code.

Traditional empirical potentials often have rigid functional forms with a small number of tunable parameters. These are optimized to reproduce experimental quantities, such as lattice parameters and elastic coefficients, as well as calculated quantities from first principles, such as energies of crystal structures, defects, and surfaces^6,7,8. Typically this requires global optimization, e.g., via simulated annealing, and substantial human effort.

Pair potentials truncate Equation (1) after two-body terms,

$$E=\frac{1}{2}\mathop{\sum}\limits_{i,j}{V}_{2}({r}_{ij}),$$

(2)

where r_ij is the distance between atoms i and j. These potentials are limited to systems where higher-order terms such as angular and dihedral interactions are negligible. The functional forms of the Lennard–Jones (LJ)⁹ potential

$${V}^{{{{\rm{LJ}}}}}({r}_{ij})=4\epsilon \left[{\left(\frac{\sigma }{{r}_{ij}}\right)}^{12}-{\left(\frac{\sigma }{{r}_{ij}}\right)}^{6}\right]$$

(3)

and the Morse potential¹⁰

$${V}^{{{{\rm{Morse}}}}}({r}_{ij})={D}_{0}\left({e}^{-2a({r}_{ij}-{r}_{c})}-2{e}^{-a({r}_{ij}-{r}_{c})}\right)$$

(4)

were originally developed for their numerical efficiency, where ϵ, σ, and D₀, a, r_c are model parameters.

Many-body potentials extend the pair formalism by including additional many-body interactions, either in the form of many-body functions as in Equation (1), or via environment-dependent functionals, such as in the embedded atom method (EAM)¹¹,

$${E}^{{{{\rm{EAM}}}}}=\frac{1}{2}\mathop{\sum}\limits_{i,j}V({r}_{ij})+\mathop{\sum}\limits_{i}F\left(\mathop{\sum}\limits_{j\ne i}\rho ({r}_{ij})\right).$$

(5)

Here, the embedding energy F is a non-linear function of the electron density ρ, which is approximated by a pairwise sum.

While traditional empirical potentials have seen success in applications across decades of research, their rigid functional forms limit their accuracy. More recently, MLPs with flexible, functional forms and built-in physics domain knowledge in the form of engineered features^2,12 or deep neural network architectures have emerged as an alternative^13,14,15,16. State-of-the-art MLPs can simulate the dynamics of large (“high-dimensional”) atomistic systems with an accuracy close to the underlying electronic-structure reference method but orders of magnitude faster^17,18. However, they are still orders of magnitude slower than fast traditional empirical potentials, limiting their application in system size and simulation length.

Recent efforts to improve the speed and accuracy of MLPs include using linear regression models, which can be faster to train and evaluate than non-linear models^19,20. The spectral neighbor analysis potential (SNAP)²¹ and its quadratic variant (qSNAP)²² are linear models based on the bispectrum representation²³. Moment tensor potentials (MTP)²⁴, performant atomic cluster expansion (PACE) potentials^19,25, atomic permutationally-invariant polynomials (aPIP) potentials²⁶, and Chebyshev interaction model for efficient simulation (ChIMES) potentials²⁷ are linear models based on polynomial basis sets.

A complementary approach to improve speed is to use basis functions that are fast to evaluate. In the context of MLPs, non-linear kernel-based MLPs have been trained and subsequently projected onto a spline basis, yielding a linear model²⁸. Similar to this work, the general two- and three-body potential approach employs a quadratic spline basis set, exceeding MTPs in speed when trained on the same data²⁹. Polynomial symmetry functions improve over Behler–Parrinello symmetry functions³⁰ in speed and accuracy by introducing compact support³¹. Methods for fitting spline-based modified EAM potentials were benchmarked against MLPs, demonstrating comparable accuracy despite the lower complexity of their functional forms³².

Motivated by these observations, we developed a UF MLP that combines the speed of the fastest traditional empirical potentials with an accuracy close to state-of-the-art MLPs by employing regularized linear regression with spline basis functions with compact support to learn effective two- and three-body interactions. Figure 1 showcases the relation between prediction errors and computational costs for three traditional empirical potentials (LJ, Morse, EAM) and several MLPs benchmarked on a dataset of elemental tungsten³³. While the traditional empirical potentials are fast but limited by accuracy, the MLPs are accurate but limited by speed. UF MLPs improve on the Pareto frontier of predictive accuracy and computational costs. They are available as an open-source Python implementation (UF³, Ultra-Fast Force Fields)³⁴ with interfaces to the VASP³⁵ electronic-structure code and the LAMMPS^36,37 molecular dynamics code.

Results and discussion

The central idea of UF MLPs is to learn an effective low-order many-body expansion of the potential energy surface, using basis functions that are efficient to evaluate. For this, we truncate the many-body expansion of Eq. (1) at two- or three-body terms and express each term as a function of pairwise distances (one distance for the two-body and three distances for the three-body term). This approach is general and can be extended to higher-order terms. To minimize the computational cost of predictions, we represent N-body terms in a set of basis functions with compact support and sufficiently many derivatives to describe energies, forces, and vibrational modes, i.e., cubic splines.

Expansion in B-splines

Splines are piecewise polynomial functions with locally simple forms joined together at knot positions³⁸. They are globally flexible and smooth but do not suffer from some of the oscillatory problems of polynomial interpolators (e.g., Runge’s phenomenon³⁹).

Spline interpolation is well-established in empirical potential development^40,41. The LAMMPS package³⁶, a leading framework for molecular dynamics simulations, implements cubic spline interpolation for pair potentials and selected many-body potentials, including EAM. The primary motivations for splines are the desire for computational efficiency and the opportunity to improve accuracy systematically⁴². Their compact support and simple form make spline-based potentials the fastest choice to evaluate, and adding more knots increases their resolution.

B-splines constitute a basis for splines of arbitrary order. They are recursively defined as³⁸

$$\begin{array}{ll}{B}_{n,1}(r)\quad\,=\,\left\{\begin{array}{ll}1,\quad {t}_{n}\le r \,<\, {t}_{n+1}\\ 0,\quad \,{{\mbox{otherwise}}}\,\end{array}\right.\\ {B}_{n,d+1}(r)\,=\,\frac{r-{t}_{n}}{{t}_{n+d}-{t}_{n}}{B}_{n,d}(r)+\frac{{t}_{n+d+1}-r}{{t}_{n+d+1}-{t}_{n+1}}{B}_{n+1,d}(r),\end{array}$$

(6)

where t_n is the n-th knot position, and d is the degree of the polynomial. The position and number of knots, a non-decreasing sequence of support points that uniquely determine the basis set, may be fixed or treated as free parameters. B-Splines are well suited for interpolation due to their intrinsic smoothness and differentiability. Their derivatives are also defined recursively:

$${B}_{n,d+1}^{{\prime} }(r)=d\left(\frac{{B}_{n,d}(r)}{{t}_{n+d}-{t}_{n}}-\frac{{B}_{n+1,d}(r)}{{t}_{n+d+1}-{t}_{n+1}}\right)$$

(7)

The UF potential describes the energy E of an atomistic system via two- and three-body interactions:

$$E=\mathop{\sum}\limits_{i,j}{V}_{2}({r}_{ij})+\mathop{\sum}\limits_{i,j,k}{V}_{3}({r}_{ij},{r}_{ik},{r}_{jk}).$$

(8)

For finite systems such as molecules or clusters, indices i, j, k run over all atoms. For infinite systems modeled via periodic boundary conditions, i runs over the atoms in the simulation cell, and j, k run over all neighboring atoms, including those in adjacent copies of the simulation cell. While these are infinitely many, the sums are truncated in practice by assuming locality, that is, finite support of V₂ and V₃.

Modeling N-body interactions requires $\left(\begin{array}{c}N\\ 2\end{array}\right)$-dimensional tensor product splines. The UF potential therefore expresses V₂ and V₃ as linear combinations of cubic B-splines, B_n = B_n,3+1, and tensor product splines:

$$\begin{array}{ll}{V}_{2}({r}_{ij})\,=\,\mathop{\sum }\limits_{n=0}^{K}{c}_{n}{B}_{n}({r}_{ij})\\ {V}_{3}({r}_{ij},{r}_{ik},{r}_{jk})\,=\,\mathop{\sum }\limits_{l=0}^{{K}_{l}}\mathop{\sum }\limits_{m=0}^{{K}_{m}}\mathop{\sum }\limits_{n=0}^{{K}_{n}}{c}_{lmn}{B}_{l}({r}_{ij}){B}_{m}({r}_{ik}){B}_{n}({r}_{jk}),\end{array}$$

(9)

where K, K_l, K_m, and K_n denote the number of basis functions per spline or tensor spline dimension, and c_n and c_lmn are corresponding coefficients.

The B-spline basis set spans a finite domain and is bounded by the end knots [t₀, t_K]. Figure 2A illustrates the compact support of the cubic B-spline basis functions. We constrain the coefficients such that at the upper limit t_K, the potential smoothly goes to zero, and near the lower limit t₀ it monotonically increases with shorter distances to prevent atoms from getting unphysically close. By construction, each basis function is non-zero across four adjacent intervals. Therefore, evaluating the two- and three-body potentials involves evaluating at most 4 and 4³ = 64 basis functions for any pair or triplet of distances, respectively, giving rise to the aforementioned computational efficiency.

**Fig. 2: B-spline basis set for pair potentials.**

The force F_a acting on atom a is given as the negative gradient $-{\nabla }_{{{{{\bf{R}}}}}_{a}}E$ of the energy E with respect to the atom’s Cartesian coordinate R_a and obtained analytically from the derivatives of the two- and three-body potentials,

$$\begin{array}{ll}\frac{\partial {V}_{2}({r}_{ij})}{\partial {R}_{a,\ell }}=\mathop{\sum }\limits_{n=0}^{{K}_{n}}{c}_{n}{B}_{n}^{{\prime} }({r}_{ij})\frac{\partial {r}_{ij}}{\partial {R}_{a,\ell }}\\ \frac{\partial {V}_{3}({r}_{ij},{r}_{ik},{r}_{jk})}{\partial {R}_{a,\ell }}=\mathop{\sum }\limits_{l=0}^{{K}_{l}}\mathop{\sum }\limits_{m=0}^{{K}_{m}}\mathop{\sum }\limits_{n=0}^{{K}_{n}}{c}_{lmn}\\ \qquad\qquad\qquad\left({B}_{l}^{{\prime} }({r}_{ij}){B}_{m}({r}_{ik}){B}_{n}({r}_{jk})\frac{\partial {r}_{ij}}{\partial {R}_{a,\ell }}\right.\\ \qquad\qquad\qquad+{B}_{l}({r}_{ij}){B}_{m}^{{\prime} }({r}_{ik}){B}_{n}({r}_{jk})\frac{\partial {r}_{ik}}{\partial {R}_{a,\ell }}\\ \qquad\qquad\qquad\left.+{B}_{l}({r}_{ij}){B}_{m}({r}_{ik}){B}_{n}^{{\prime} }({r}_{jk})\frac{\partial {r}_{jk}}{\partial {R}_{a,\ell }}\right)\end{array}$$

(10)

with

$$\frac{\partial {r}_{ij}}{\partial {R}_{a,\ell }}=\frac{({\delta }_{aj}-{\delta }_{ai})({R}_{j,\ell }-{R}_{i,\ell })}{{r}_{ij}},$$

(11)

where l is the Cartesian coordinate.

Figure 2B illustrates, for the two-body potential, that the choice of the cubic B-spline basis results in a smooth, continuous first derivative comprised of quadratic B-splines and a continuous second derivative of linear B-splines.

Regularized least-squares optimization with energies and forces

During the fitting procedure, we optimize all spline coefficients c simultaneously with the regularized linear least-squares method. Given atomic configurations ${{{\mathcal{S}}}}$, energies ${{{\mathcal{E}}}}$, and forces ${{{\mathcal{F}}}}$, we fit the potential energy function E of Equation (8) by minimizing the loss function

$$\begin{array}{l}L=\frac{\kappa }{{\sigma }_{{{{\mathcal{E}}}}}^{2}| {{{\mathcal{E}}}}| }\mathop{\sum}\limits_{s\in {{{\mathcal{S}}}}}{(E(s)-{{{{\mathcal{E}}}}}_{s})}^{2}+\frac{1-\kappa }{{\sigma }_{{{{\mathcal{F}}}}}^{2}| {{{\mathcal{F}}}}| }\mathop{\sum}\limits_{s\in {{{\mathcal{S}}}}}{(-\nabla E(s)-{{{{\mathcal{F}}}}}_{s})}^{2}\\\qquad+\,{\lambda }_{1}\mathop{\sum }\limits_{n}^{K}{c}_{n}^{2}+{\lambda }_{2}\mathop{\sum }\limits_{n}^{K}{({c}_{n}-2{c}_{n+1}+{c}_{n+2})}^{2},\end{array}$$

(12)

where the second sum is taken over force components. Here, κ ∈ [0, 1] is a weighting parameter that controls the relative contributions between energy and force-component residuals, $| {{{\mathcal{E}}}}|$ and $| {{{\mathcal{F}}}}|$ are the number of energy and force component observations in the training set, and ${\sigma }_{{{{\mathcal{E}}}}}$ and ${\sigma }_{{{{\mathcal{F}}}}}$ are the sample standard deviations of energies and force components across the training set. This normalization yields dimensionless residuals, allowing κ to balance the relative contributions from energies and forces independently of the size and variance of the training energies and forces.

The minimization of L with respect to the spline coefficients c is a linear least-squares problem with Tikhonov regularization and solution

$${{{\bf{c}}}}={\left({{{{\bf{X}}}}}^{T}{{{\bf{X}}}}+{\lambda }_{1}{{{\bf{I}}}}+{\lambda }_{2}{{{{\bf{D}}}}}_{2}^{T}{{{{\bf{D}}}}}_{2}\right)}^{-1}{{{{\bf{X}}}}}^{T}{{{\bf{y}}}},$$

(13)

where I is the identity matrix, y contains energies and forces, and each element of X is the sum of B-spline values taken over all relevant pair distances in each configuration (rows) for each basis function (columns). Subsets of columns correspond to different body orders. Similarly, for multi-component systems, subsets of columns correspond to different chemical interactions. When forces are included in y, the corresponding rows of X are generated according to Eqs. (10) and (11). This optimization problem is strongly convex, allowing for an efficient and deterministic solution, e.g., with LU decomposition.

The used Tikhonov regularization controls the magnitude of spline coefficients c through the parameter λ₁ via the ridge penalty and the curvature and local smoothness across adjacent spline coefficients through λ₂ via the difference penalty^43,44. For the two-body case, D₂ is given by

$${{{{\bf{D}}}}}_{2}=\left(\begin{array}{ccccc}1&-2&1&&0\\ &\ddots &\ddots &\ddots &\\ 0&&1&-2&1\end{array}\right).$$

(14)

For higher-order potential terms, the difference penalty affects the spline coefficients that are adjacent in each dimension of a tensor product spline. This difference penalty is related to a penalty on the integral of the squared second derivative of the potential^26,45,46. However, the difference penalty is less complex because the dimensionality of the corresponding regularization problem is simply the number of basis functions K⁴⁴.

Figure 2C compares optimized UF₂, LJ, and Morse potentials for tungsten. Although the three curves have similar minima and behavior for greater pair distances r, the UF₂ potential exhibits additional inflection points. We attribute the ability of the UF₂ potential to reproduce the properties of a bcc metal, a traditionally difficult task for pair potentials, to its flexible, functional form.

The fitting of a UF potential maps energy and force data onto effective two- and three-body terms, as shown in Fig. 3 for the tungsten dataset. Both terms can be visualized directly, providing interpretability by disentangling contributions to the interatomic interactions. The inspection of minima, repulsive and attractive contributions and inflection points enables insight into the chemical bonding characteristics of the material. This straightforward and visual analysis makes the UF potentials more directly interpretable than most MLPs.

**Fig. 3: Visualization of two-body and three-body terms of a tungsten UF potential.**

Convergence of error in energy and force predictions

UF potentials should exactly reproduce any two- and three-body reference potential by construction, given sufficiently many basis functions and training data. To establish baseline functionality, we fit the UF₂ potential to energies and forces from the LJ potential for elemental tungsten and the UF_2,3 potential to energies and forces from the Stillinger-Weber potential on elemental silicon, which they both reproduce with negligible error (see Supplementary Figs. 2 and 3 for learning curves and details).

To assess the accuracy of UF potentials, we measure their ability to predict density-functional theory (DFT) energies and forces in tungsten as a function of the amount of training data. Figure 4 compares learning curves for the two-body UF₂, two- and three-body UF_2,3, SNAP, and qSNAP potentials. To quantify prediction performance, we use the root-mean-squared error (RMSE) on randomly sampled hold-out test sets. In this, we ensured that each test set contained all available configuration types (see Section “Data”). Each learning curve is fit with a soft plus function $\ln (1+{e}^{n})$, where n is the number of training data. This function captures both the initial linear slope in $\log$-$\log$ space and the observed saturation due to the models’ finite complexity.

**Fig. 4: Learning curves for UF and SNAP potentials fit to bcc tungsten.**

Of the four models, the UF₂ potential, using 28 basis functions, converges earliest. The SNAP potential, using 56 basis functions based on hyperspherical harmonics, converges slightly later with lower errors. The qSNAP potential, using 496 basis functions including quadratic terms, converges last, with modest improvements in error over SNAP. Finally, the UF_2,3 potential, using 915 basis functions, is comparable to SNAP in convergence speed with lower errors in energies and similar errors in forces. Separate energy and force learning curves are included in Supplementary Fig. 5.

The convergence speed is related to both the number of basis functions and the complexity of the many-body interactions. This indicates that the simpler UF₂ potential may be more suitable than other MLPs when data is scarce.

Validation with derived quantities

To benchmark performance for applications, we computed several derived quantities that were not included in the fit, such as the phonon spectrum and melting temperature, using each potential. Figure 5 shows the relative errors in 12 quantities: energy, forces, phonon frequencies, lattice constant a₀, elastic constants C₁₁, C₁₂, and C₁₄, bulk modulus B, surface energies E₁₀₀, E₁₁₀, E₁₁₁, and vacancy formation energy E_V. Energy, force, and phonon predictions are displayed as percent RMSE, normalized by the sample standard deviation of the reference values. The remaining scalar quantities are displayed as percentage errors and tabulated in Supplementary Table I. The training set includes surface, vacancy, and strained bcc configurations. Hence C₁₁, C₁₂, C₁₄, E₁₀₀, E₁₁₀, E₁₁₁, and E_V are not measures of extrapolation.

**Fig. 5: Performance for derived quantities of seven potentials relative to the DFT reference for bcc tungsten.**

Despite its low computational cost, the UF₂ pair potential exhibits errors comparable to those of more complex potentials, such as SNAP and qSNAP. In contrast, the LJ and Morse potentials severely overpredict and underpredict most properties, respectively. One source of error in pair potentials, including the UF₂ potential, arises due to deviation from the Cauchy relations in materials. The Cauchy relations are constraints between elastic constants that hold if atoms only interact via central forces, e.g., pair potentials, and every atom is a center of inversion, such as in bcc tungsten⁴⁷. For fcc and bcc lattices, the Cauchy relation is C₁₂ = C₄₄. Noble gas crystals nearly fulfill this condition, but significant deviations occur for most other crystals. Hence, the errors in C₁₂ and C₄₄ are larger for pair potentials, where they are constrained to be equal, than for models with many-body terms. This limitation in modeling the elastic response may hinder the prediction of mechanical properties and defect-related quantities⁴⁸.

As an example of an empirical potential used in practice, we selected the EAM4 potential⁴⁹ for all benchmarks. The EAM4 model, one of four tungsten models developed by Marinica et al., accurately reproduces the Peierls energy barrier and dislocation core energy⁴⁹. The EAM4 potential was fitted to materials’ properties such as those in Fig. 5 and, hence, exhibits reasonably low errors.

All potentials in Fig. 1 and Fig. 5, except EAM4, were fitted using the same training set of 1 939 configurations. While this training set is realistic in both size and diversity, based on their complexity, the SNAP, qSNAP, MTP, and GAP models may achieve even lower errors with a more extensive training set and larger basis set.

In this work, the cut-off radius for inter-atomic interactions was set to 5.5 Å for all potentials except for EAM4. Additional hyperparameters for the UF₂, UF_2,3, SNAP, qSNAP, and GAP basis sets are tabulated in Supplementary Tables II–V. For UF_2,3, a separate smaller cut-off radius of 4.25 Å was used for three-body interactions. This choice was motivated in part by precedence in other two- and three-body potentials, such as the modified embedded-atom potentials⁴², and in part by speed: the smaller cut-off radius results in ten times fewer three-body interactions and corresponding speed-up.

For the bcc tungsten system, the UF_2,3 potential approaches the accuracy of the MTP and GAP potentials. Adding the three-body interactions eliminates the error associated with the Cauchy discrepancy, improving the elastic constant predictions. The UF_2,3 potential also achieves lower errors for the surface and vacancy formation energies than the UF₂ pair potential, underscoring the value of including three-body interactions.

Figure 6 compares the calculated phonon spectra of the various MLPs with the DFT reference values³³. Perhaps surprisingly, the UF₂ pair potential is comparable in error to the potentials with many-body terms. In contrast, the Morse pair potential is rather inaccurate, and the LJ pair potential, not shown, yields a phonon spectrum with imaginary frequency and large frequency oscillations. Table 1 summarizes the RMSE of the phonon frequencies computed across the 26 DFT reference values. As in other calculated properties, the addition of three-body interactions in the UF_2,3 potential significantly improves the agreement with the DFT reference compared to the UF₂ pair potential and even surpasses the other computationally more expensive MLPs.

**Fig. 6: Phonon dispersion curves for bcc tungsten.**

Table 1 Melting temperature predictions alongside energy, force, and phonon frequency benchmarks for bcc tungsten.

Full size table

As an example of a practical, large-scale calculation, we predict the melting temperature of tungsten (see Section Calculations for computational details). Since the training set of the MLPs does not include liquid configurations, the melting point predictions measure the models’ extrapolative capacity. Table 1 compares the predicted melting temperatures of the MLPs to the ab-initio reference value of 3465 K⁵⁰. We observe that the other pair potentials are limited in their predictive capabilities while the UF₂ potential is comparable in accuracy to the more complex potentials, which yield reasonable predictions. The qSNAP melting calculation failed due to numerical instability at higher temperatures, which is known to occur in high-dimensional potentials⁵¹. Bonds break at higher temperatures, leading to many local atomic configurations that are underrepresented in the training set. We expect that training the qSNAP potential on a suitable, more extensive dataset would remove this instability. The extrapolative capacity of the UF₂ and UF_2,3 potentials in melting-temperature simulations illustrates that the UF potentials can provide an accurate description with only a moderately sized training dataset, indicating their usefulness for materials simulations with limited reference data.

Summary

We developed and implemented an MLP that is fast to train and evaluate, provides an interpretable form, is extendable to higher-order interactions, and accurately describes materials even for comparably sparse training sets. The approach is based on an effective many-body expansion and utilizes a flexible B-spline basis. The resulting regularized linear least-squares optimization problem is strongly convex, enabling efficient training.

For the example of elemental tungsten, the UF₂ pair potential produces energy, force, and property predictions rivaling those of SNAP and qSNAP while matching the cost of the Morse potential, corresponding to a reduction in computational cost by two orders of magnitude. Despite the intrinsic limitations of the pair potential in capturing physics, we find that the UF₂ pair potential yields reasonable predictions in property benchmarks, such as for elastic constants, phonons, surface energies, and melting temperature. The UF_2,3 potential, which accounts for three-body interactions, approaches the accuracy of MTP and GAP while reducing computational cost by one to three orders of magnitude.

The rapidly increasing number of MLPs, from ultra-fast linear models to graph neural network potentials, highlights the trade-offs between computational efficiency, robustness, and model capacity. Complex, high-dimensional MLPs are expected to yield higher accuracy for complex systems at the price of greatly increased computational cost and data requirements. The UF approach yields potentials that are fast and robust at the price of reduced flexibility and possibly greater errors for complex systems. Future work on UF potentials will explore the use of active learning for increased robustness and data efficiency, as well as the addition of the four-body term, which is necessary for modeling dihedral angles that are critical to describing organic molecules and protein structures. The software for fitting UF potentials and running them in the LAMMPS^36,37 molecular dynamics code is freely available in our GitHub repository³⁴.

Methods

Data

To compare the UF potential against existing potentials, we use a dataset by Szlachta et al.³³, which has been used before to benchmark the GAP³³, SNAP⁵², and aPIP²⁶ potentials. This dataset of 9 693 tungsten configurations includes body-centered cubic (bcc) primitive cells, bulk snapshots from molecular dynamics, surfaces, vacancies, gamma surfaces, gamma surface vacancies, and dislocation quadrupoles. Energies, forces, and stresses in the dataset were computed using DFT with the Perdew–Burke–Ernzerhof⁵³ functional.

B-spline basis

The UF potential uses natural cubic B-splines: Their first derivative is continuous and smooth, while their second derivative is continuous. They are natural rather than clamped in that their second derivative is zero at the boundary conditions. These properties, which are critical for accurately reproducing forces and stresses, motivated our choice of basis set. Other B-spline schemes have been explored for interpolation in empirical potential development. Wen et al.⁴¹ discuss clamped and Hermite splines as well as quartic and quintic splines. The natural cubic spline is sufficient except when computing properties that rely on the third and fourth derivative of the potential, such as thermal expansion and finite-temperature elastic constants.

Uniform spacing of knots is a reasonable choice in many cases. However, one may adjust the density of knots to control resolution in regions of interest. Due to compact support in this basis, each pair-distance energy requires the evaluation of exactly four B-splines. Hence, the potential’s speed scales with neither the number of knots nor the number of basis functions.

On the other hand, the minimum distance between knots limits the maximum curvature of the function. The optimum density of knots thus depends on the training set. Underfitting or overfitting may arise from insufficient or excessive knot density, respectively. Based on convergence tests (see Supplementary Fig. 1), we fit UF potentials in this work using 25 uniformly spaced knots.

By construction, each spline coefficient influences the overall function across five adjacent knots. As a result, one can tune the shape of the potential further according to additional constraints. For instance, soft-core and smooth-cutoff requirements may be satisfied by adjusting spline coefficients at the ends. In this work, we ensure that the potential and its first derivative smoothly go to zero when approaching the cutoff at r_ij = t_K by setting the last three coefficients to 0.

Models

In this work, we partitioned the data using a random split of 20% training and 80% testing data. The testing set was used to evaluate the RMSE in energies and forces, as reported in Table 1 and Fig. 1. The LJ and Morse potentials were optimized using the Broyden–Fletcher–Goldfarb–Shanno (BFGS) algorithm as implemented in the SciPy library⁵⁴. The SNAP and qSNAP potentials were retrained using the MAterials Machine Learning (MAML) package⁵⁵. The GAP potential was retrained using the QUantum mechanics and Interatomic Potentials (QUIP) package^56,57. We used the EAM4 potential, obtained from the NIST Interatomic Potential Repository^58,59, without modification. The MTP potential was fit using the MLIP package⁶⁰.

The size of the training set was selected to represent common, data-scarce scenarios. With a larger training set, the MTP, SNAP, qSNAP, and GAP models would likely produce better predictions. We refer the reader to the original works and existing benchmarks^18,32 for details regarding convergence with training examples and model complexity.

Calculations

All potentials in Fig. 1 were benchmarked using one thread on an AMD EPYC 7702 Rome (2.0 GHz) CPU. Computational cost measurements for each potential are reported as the average over ten simulations.

We use two methods to estimate the computational cost of the UF_2,3 potential and present both values in Fig. 1 and Table 1. The lower estimate, 0.76 ms/step, is computed by multiplying the computational cost of UF₂ by the ratio of floating point operations used by UF_2,3 and UF₂. The higher estimate, 2.03 ms/step, is computed using the ratio of speeds in the Python implementation multiplied by the reported cost of UF₂. We show performance bounds instead of the current UF_2,3 implementation’s computational cost because it has not been fully optimized yet.

Elastic constants were evaluated using the Elastic package⁶¹. Phonon spectra were evaluated using the Phonopy package⁶². Melting temperatures were calculated in LAMMPS using the two-phase method and a time step of 1 fs. The initial system, a 16 × 8 × 8 bcc supercell (2048 atoms), was equilibrated at a selected temperature for 40,000-time steps. Next, the solid-phase atoms were fixed while the liquid-phase atoms were heated to 5000 K and cooled back to the initial temperature over 80,000 time steps. Finally, the system was equilibrated over 200,000 steps using the isoenthalpic-isobaric (NPH) ensemble. If the final configuration did not contain both phases, the procedure was repeated with a different initial temperature. Reported melting temperatures were computed by taking the average over the final 100,000 steps.

Data availability

Example notebooks, LAMMPS input files, and parameters for all potentials fit in this work are available in the GitHub repository³⁴. The tungsten and silicon datasets are publicly available^63,64.

Code availability

Code for fitting UF₂ and UF₃ potentials, as well as exporting LAMMPS-compatible tables, is freely available in our open-source “Ultra-Fast Force Fields” GitHub repository³⁴. The UF₂ potential is natively supported by LAMMPS, GROMACS, and other molecular dynamics suites that can construct potentials from interpolation tables. In LAMMPS, the pair-style “table” is available for execution on both CPU and GPUs, enabling the UF₂ potential to benefit from various computer architectures. A LAMMPS package for using UF_2,3 potentials is also available in the repository.

References

Deringer, V. L., Caro, M. A. & Csányi, G. Machine learning interatomic potentials as emerging tools for materials science. Adv. Mater. 31, 1902765 (2019).
Article CAS Google Scholar
Langer, M. F., Goeßmann, A. & Rupp, M. Representations of molecules and materials for interpolation of quantum-mechanical simulations via machine learning. npj Comput. Mater. 8, 41 (2022).
Article Google Scholar
Miksch, A. M., Morawietz, T., Kästner, J., Urban, A. & Artrith, N. Strategies for the construction of machine-learning potentials for accurate and efficient atomic-scale simulations. Mach. Learn. Sci. Tech. 2, 031001 (2021).
Article Google Scholar
Friederich, P., Häse, F., Proppe, J. & Aspuru-Guzik, A. Machine-learned potentials for next-generation matter simulations. Nat. Mater. 20, 750–761 (2021).
Article CAS Google Scholar
Drautz, R., Fähnle, M. & Sanchez, J. M. General relations between many-body potentials and cluster expansions in multicomponent systems. J. Phys. 16, 3843–3852 (2004).
CAS Google Scholar
Rapaport, D. The Art of Molecular Dynamics Simulation (Cambridge University Press, 2004).
Martinez, J. A., Yilmaz, D. E., Liang, T., Sinnott, S. B. & Phillpot, S. R. Fitting empirical potentials: challenges and methodologies. Curr. Opin. Solid State Mater. Sci 17, 263–270 (2013).
Article CAS Google Scholar
Ragasa, E. J., O’Brien, C. J., Hennig, R. G., Foiles, S. M. & Phillpot, S. R. Multi-objective optimization of interatomic potentials with application to MgO. Model. Simul. Mater. Sci. Eng. 27, 074007 (2019).
Article CAS Google Scholar
Jones, J. E. On the determination of molecular fields.—I. from the variation of the viscosity of a gas with temperature. Proc. R. Soc. Lond. A 106, 441–462 (1924).
Article CAS Google Scholar
Morse, P. M. Diatomic molecules according to the wave mechanics. II. vibrational levels. Phys. Rev. 34, 57–64 (1929).
Article CAS Google Scholar
Daw, M. S. & Baskes, M. I. Embedded-atom method: Derivation and application to impurities, surfaces, and other defects in metals. Phys. Rev. B 29, 6443–6453 (1984).
Article CAS Google Scholar
Musil, F. et al. Physics-inspired structural representations for molecules and materials. Chem. Rev. 121, 9759–9815 (2021).
Article CAS Google Scholar
Deringer, V. L. et al. Gaussian process regression for materials and molecules. Chem. Rev. 121, 10073–10141 (2021).
Article CAS Google Scholar
Huang, B. & von Lilienfeld, O. A. Ab initio machine learning in chemical compound space. Chem. Rev. 121, 10001–10036 (2021).
Article CAS Google Scholar
Behler, J. Four generations of high-dimensional neural network potentials. Chem. Rev. 121, 10037–10072 (2021).
Article CAS Google Scholar
Unke, O. T. et al. Machine learning force fields. Chem. Rev. 121, 10142–10186 (2021).
Article CAS Google Scholar
Parsaeifard, B. et al. An assessment of the structural resolution of various fingerprints commonly used in machine learning. Mach. Learn.: Sci. Technol. 2, 015018 (2021).
Google Scholar
Zuo, Y. et al. Performance and cost assessment of machine learning interatomic potentials. J. Phys. Chem. A 124, 731–745 (2020).
Article CAS Google Scholar
Lysogorskiy, Y. et al. Performant implementation of the atomic cluster expansion (PACE) and application to copper and silicon. npj Comput. Mater. 7, 97 (2021).
Article CAS Google Scholar
Kovács, D. P. et al. Linear atomic cluster expansion force fields for organic molecules: beyond RMSE. J. Chem. Theor. Comput. 17, 7696–7711 (2021).
Article Google Scholar
Thompson, A., Swiler, L., Trott, C., Foiles, S. & Tucker, G. Spectral neighbor analysis method for automated generation of quantum-accurate interatomic potentials. J. Comput. Phys. 285, 316–330 (2015).
Article CAS Google Scholar
Wood, M. A. & Thompson, A. P. Extending the accuracy of the SNAP interatomic potential form. J. Chem. Phys. 148, 241721 (2018).
Article Google Scholar
Bartók, A. P., Payne, M. C., Kondor, R. & Csányi, G. Gaussian approximation potentials: the accuracy of quantum mechanics, without the electrons. Phys. Rev. Lett. 104, 136403 (2010).
Article Google Scholar
Shapeev, A. V. Moment tensor potentials: a class of systematically improvable interatomic potentials. Multiscale Model. Simul. 14, 1153–1173 (2016).
Article Google Scholar
Drautz, R. Atomic cluster expansion for accurate and transferable interatomic potentials. Phys. Rev. B 99, 14104 (2019).
Article CAS Google Scholar
van der Oord, C., Dusson, G., Csányi, G. & Ortner, C. Regularised atomic body-ordered permutation-invariant polynomials for the construction of interatomic potentials. Mach. Learn. 1, 015004 (2020).
Google Scholar
Lindsey, R. K., Fried, L. E. & Goldman, N. ChIMES: a force matched potential with explicit three-body interactions for molten carbon. J. Chem. Theory Comput. 13, 6222–6229 (2017).
Article CAS Google Scholar
Vandermause, J. et al. On-the-fly active learning of interpretable Bayesian force fields for atomistic rare events. npj Comput. Mater. 6, 20 (2020).
Article Google Scholar
Pozdnyakov, S., Oganov, A. R., Mazhnik, E., Mazitov, A. & Kruglov, I. Fast general two- and three-body interatomic potential. Phys. Rev. B 107, 125160 (2023).
Article CAS Google Scholar
Behler, J. & Parrinello, M. Generalized neural-network representation of high-dimensional potential-energy surfaces. Phys. Rev. Lett. 98, 146401 (2007).
Article Google Scholar
Bircher, M. P., Singraber, A. & Dellago, C. Improved description of atomic environments using low-cost polynomial functions with compact support. Mach. Learn. 2, 035026 (2021).
Google Scholar
Vita, J. A. & Trinkle, D. R. Exploring the necessary complexity of interatomic potentials. Comput. Mater. Sci. 200, 110752 (2021).
Article CAS Google Scholar
Szlachta, W. J., Bartók, A. P. & Csányi, G. Accuracy and transferability of Gaussian approximation potential models for tungsten. Phys. Rev. B 90, 104108 (2014).
Article Google Scholar
Xie, S. & Rupp, M. Ultra fast force fields package. https://github.com/uf3 (2021).
Kresse, G. & Furthmüller, J. Efficient iterative schemes for ab initio total-energy calculations using a plane-wave basis set. Phys. Rev. B 54, 11169 (1996).
Article CAS Google Scholar
Plimpton, S. Fast parallel algorithms for short-range molecular dynamics. J. Comput. Phys. 117, 1–19 (1995).
Article CAS Google Scholar
Thompson, A. P. et al. LAMMPS—a flexible simulation tool for particle-based materials modeling at the atomic, meso, and continuum scales. Comput. Phys. Comm. 271, 108171 (2022).
Article CAS Google Scholar
de Boor, C. A Practical Guide to Splines (Springer, New York, 1978).
Runge, C. Über empirische Funktionen und die Interpolation zwischen äquidistanten Ordinaten. Z. Math. Phys. 46, 224–243 (1901).
Google Scholar
Wolff, D. & Rudd, W. Tabulated potentials in molecular dynamics simulations. Comput. Phys. Commun. 120, 20–32 (1999).
Article CAS Google Scholar
Wen, M., Whalen, S. M., Elliott, R. S. & Tadmor, E. B. Interpolation effects in tabulated interatomic potentials. Model. Simul. Mater. Sci. Eng. 23, 074008 (2015).
Article Google Scholar
Hennig, R., Lenosky, T., Trinkle, D., Rudin, S. & Wilkins, J. Classical potential describes martensitic phase transformations between the α, β, and ω titanium phases. Phys. Rev. B 78, 054121 (2008).
Article Google Scholar
Whittaker, E. T. On a new method of graduation. Proc. Edinb. Math. Soc. 41, 63–75 (1922).
Article Google Scholar
Eilers, P. H. & Marx, B. D. Flexible smoothing with B-splines and penalties. Statist. Sci. 11, 89–121 (1996).
Article Google Scholar
Schoenberg, I. J. Spline functions and the problem of graduation. Proc. Natl. Acad. Sci. USA 52, 947–950 (1964).
Article CAS Google Scholar
Reinsch, C. H. Smoothing by spline functions. Numer. Math. 10, 177–183 (1967).
Article Google Scholar
Stakgold, I. The Cauchy relations in a molecular theory of elasticity. Q. Appl. Math. 8, 169–186 (1950).
Article Google Scholar
Ziegenhain, G., Hartmaier, A. & Urbassek, H. M. Pair vs many-body potentials: influence on elastic and plastic behavior in nanoindentation of fcc metals. J. Mech. Phys. Solids 57, 1514–1526 (2009).
Article CAS Google Scholar
Marinica, M.-C. et al. Interatomic potentials for modelling radiation defects and dislocations in tungsten. J. Phys. 25, 395502 (2013).
Google Scholar
Wang, L. G., van de Walle, A. & Alfè, D. Melting temperature of tungsten from two ab initio approaches. Phys. Rev. B 84, 092102 (2011).
Article Google Scholar
Li, J., Qu, C. & Bowman, J. M. Diffusion Monte Carlo with fictitious masses finds holes in potential energy surfaces. Mol. Phys. 119, e1976426 (2021).
Article Google Scholar
Wood, M. A. & Thompson, A. P. Quantum-accurate molecular dynamics potential for tungsten, technical report SAND2017-3265R, (Sandia National Laboratories, 2017). https://doi.org/10.2172/1365473.
Perdew, J. P., Burke, K. & Ernzerhof, M. Generalized gradient approximation made simple. Phys. Rev. Lett. 77, 3865–3868 (1996).
Article CAS Google Scholar
Virtanen, P. et al. SciPy 1.0: fundamental algorithms for scientific computing in Python. Nat. Methods 17, 261–272 (2020).
Article CAS Google Scholar
Ong, S. P. Accelerating materials science with high-throughput computations and machine learning. Comput. Mater. Sci. 161, 143–150 (2019).
Article CAS Google Scholar
Bartók, A. P., Kondor, R. & Csányi, G. On representing chemical environments. Phys. Rev. B 87, 184115 (2013).
Article Google Scholar
Bartók, A. P. & Csányi, G. Gaussian approximation potentials: a brief tutorial introduction. Int. J. Quant. Chem. 116, 1051–1057 (2015).
Article Google Scholar
Becker, C. A., Tavazza, F., Trautt, Z. T. & Buarque De Macedo, R. A. Considerations for choosing and using force fields and interatomic potentials in materials science and engineering. Curr. Opin. Solid State Mater. Sci 17, 277–283 (2013).
Article CAS Google Scholar
Hale, L. M., Trautt, Z. T. & Becker, C. A. Evaluating variability with atomistic simulations: The effect of potential and calculation methodology on the modeling of lattice and elastic constants. Model. Simul. Mater. Sci. Eng. 26, 055003 (2018).
Article Google Scholar
Novikov, I. S., Gubaev, K., Podryabinkin, E. V. & Shapeev, A. V. The MLIP package: moment tensor potentials with MPI and active learning. Mach. Learn. 2, 025002 (2021).
Google Scholar
Jochym, P. T. & Badger, C. jochym/elastic: Maintenance release. https://doi.org/10.5281/zenodo.1254570 (2018).
Togo, A. & Tanaka, I. First principles phonon calculations in materials science. Scr. Mater. 108, 1–5 (2015).
Article CAS Google Scholar
Csanyi, G. Gaussian approximation potential for tungsten. https://www.repository.cam.ac.uk/handle/1810/341742 (2022).
Csanyi, G. Research data: Machine learning a general-purpose interatomic potential for silicon. https://www.repository.cam.ac.uk/handle/1810/317974 (2021).

Download references

Acknowledgements

S.R.X. and R.G.H. were supported by the United States Department of Energy under contract number DE-SC0020385. R.G.H. was supported by the U.S. National Science Foundation under contract number DMR 2118718. M.R. acknowledges partial support by the European Center of Excellence in Exascale Computing TREX—Targeting Real Chemical Accuracy at the Exascale; this project has received funding from the European Union’s Horizon 2020 Research and Innovation program under Grant Agreement No. 952165. Part of the research was performed while the authors visited the Institute for Pure and Applied Mathematics, which is supported by the National Science Foundation (Grant No. DMS 1440415). Computational resources were provided by the University of Florida Research Computing Center. We thank Ajinkya Hire for the implementation of UF potentials in LAMMPS and Alexander Shapeev for fitting the MTP potentials. We thank Thomas Bischoff, Jason Gibson, Bastian Jäckl, Hendrik Kraß, Ming Li, Johannes Margraf, Paul-Rene Mayer, Pawan Prakash, Robert Schmid, and Benjamin Walls for testing of and contributing to the UF implementation.

Author information

Authors and Affiliations

Department of Materials Science and Engineering, University of Florida, Gainesville, FL, USA
Stephen R. Xie & Richard G. Hennig
Quantum Theory Project, University of Florida, Gainesville, FL, USA
Stephen R. Xie & Richard G. Hennig
Department of Computer and Information Science, University of Konstanz, Konstanz, Germany
Matthias Rupp
Materials Research and Technology Department, Luxembourg Institute of Science and Technology (LIST), Esch-sur-Alzette, Luxembourg
Matthias Rupp

Authors

Stephen R. Xie
View author publications
You can also search for this author in PubMed Google Scholar
Matthias Rupp
View author publications
You can also search for this author in PubMed Google Scholar
Richard G. Hennig
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors contributed extensively to the work presented in this paper. S.R.X., M.R., and R.G.H. jointly developed the methodology. S.R.X. implemented the algorithm and performed the model training and analysis. S.R.X., M.R., and R.G.H. wrote the paper.

Corresponding author

Correspondence to Richard G. Hennig.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplemental Material

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Xie, S.R., Rupp, M. & Hennig, R.G. Ultra-fast interpretable machine-learning potentials. npj Comput Mater 9, 162 (2023). https://doi.org/10.1038/s41524-023-01092-7

Download citation

Received: 01 October 2021
Accepted: 23 July 2023
Published: 02 September 2023
DOI: https://doi.org/10.1038/s41524-023-01092-7

This article is cited by

Machine learning heralding a new development phase in molecular dynamics simulations
- Eva Prašnikar
- Martin Ljubič
- Jure Borišek
Artificial Intelligence Review (2024)