BIGDML—Towards accurate quantum machine learning force fields for materials

Sauceda, Huziel E.; Gálvez-González, Luis E.; Chmiela, Stefan; Paz-Borbón, Lauro Oliver; Müller, Klaus-Robert; Tkatchenko, Alexandre

doi:10.1038/s41467-022-31093-x

Download PDF

Article
Open access
Published: 29 June 2022

BIGDML—Towards accurate quantum machine learning force fields for materials

Nature Communications volume 13, Article number: 3733 (2022) Cite this article

10k Accesses
29 Citations
14 Altmetric
Metrics details

Subjects

Abstract

Machine-learning force fields (MLFF) should be accurate, computationally and data efficient, and applicable to molecules, materials, and interfaces thereof. Currently, MLFFs often introduce tradeoffs that restrict their practical applicability to small subsets of chemical space or require exhaustive datasets for training. Here, we introduce the Bravais-Inspired Gradient-Domain Machine Learning (BIGDML) approach and demonstrate its ability to construct reliable force fields using a training set with just 10–200 geometries for materials including pristine and defect-containing 2D and 3D semiconductors and metals, as well as chemisorbed and physisorbed atomic and molecular adsorbates on surfaces. The BIGDML model employs the full relevant symmetry group for a given material, does not assume artificial atom types or localization of atomic interactions and exhibits high data efficiency and state-of-the-art energy accuracies (errors substantially below 1 meV per atom) for an extended set of materials. Extensive path-integral molecular dynamics carried out with BIGDML models demonstrate the counterintuitive localization of benzene–graphene dynamics induced by nuclear quantum effects and their strong contributions to the hydrogen diffusion coefficient in a Pd crystal for a wide range of temperatures.

Accelerating GW calculations through machine-learned dielectric matrices

Article Open access 07 October 2023

Machine-learning-accelerated simulations to enable automatic surface reconstruction

Article 07 December 2023

CHGNet as a pretrained universal neural network potential for charge-informed atomistic modelling

Article Open access 14 September 2023

Introduction

The development and implementation of accurate and efficient machine learning force fields (MLFF) is transforming atomistic simulations throughout the fields of physics^1,2,3,4,5, chemistry^{6,7,8,9,10,11,12,13,14}, biology^15,16, and materials science^{17,18,19,20,21,22}. The application of MLFFs have enabled a wealth of novel discoveries and quantum-mechanical insights into atomic-scale mechanisms in molecules^{3,6,22,23,24,25} and materials^2,4,26,27,28.

A major hurdle in the development of MLFFs is to optimize the conflicting requirements of ab initio accuracy, computational speed and data efficiency, as well as universal applicability to increasingly larger chemical spaces²⁹. In practice, all existing MLFFs introduce tradeoffs that restrict their accuracy, efficiency, or applicability. In the domain of materials modeling, all MLFFs known to the authors employ the so-called locality approximation, i.e. the global problem of predicting the total energy of a many-body condensed-matter system is approximated by its partitioning into localized atomic contributions. The locality approximation has been rather successful for capturing local chemical degrees of freedom, as demonstrated in a wide number of applications^{30,31,32,33,34}. However, we emphasize that the locality assumption disregards non-local interactions and its validity can only be truly assessed by comparison to experimental observables or explicit ab initio dynamics. This fact restricts truly predictive MLFF simulations of realistic materials, whose properties are often determined by a complex interplay between local chemical bonds and a multitude of non-local interactions.

The chemical space of materials is exceedingly diverse if we count all possible compositions and configurations of a given number of chemical elements. For example, an accurate MLFF reconstruction of the potential-energy surface (PES) of elemental bulk materials to meV/atom accuracy often requires many thousands of configurations for training^{21,30,35,36,37,38}. The MLFF errors also increase at least by an order of magnitude when including defects or surfaces^32,35.

Heteroatomic materials and interfaces between molecules and materials would require substantially more training data for creating predictive MLFFs and accuracies much better than 1 meV/atom, eventually making the modeling of such materials intractable. In addition, there is a strong desire to go beyond traditional density-functional theory (DFT) reference data in the field of atomistic materials modeling^39,40,41. Beyond-DFT methods can only be realistically applied to compute dozens or hundreds of geometries, making the construction of beyond-DFT MLFFs impractical.

To address these challenges, in this work we introduce a Bravais-Inspired Gradient Domain Machine Learning (BIGDML) model for periodic materials that is accurate, data efficient, and computationally inexpensive at the same time. The BIGDML model extends the applicability domain of the Symmetric Gradient-Domain Machine Learning (sGDML) framework^23,42,43 to include periodic systems with supercells containing up to roughly 200 atoms. The BIGDML model employs a global representation of the full system, i.e. treating the supercell as a whole instead of a collection of atoms. This avoids the uncontrollable locality approximation, but also restricts the maximum number of atoms in the unit cell. To extend the applicability of BIGDML to much larger unit cells will require the development of a global multiscale representation. An additional advantage of a global representation is that cross-correlations between forces on different atomic species are dealt with rigorously. Specifically, MLFFs based on the locality approximation construct separate models for each atom type. In contrast, the BIGDML model employs a global force covariance, allowing many-body correlations between atomic forces in a given supercell structure and capturing relevant interatomic interactions at different spatial scales. Similarly to the sGDML model, another key advantage of the BIGDML model is the usage of physical constraints (energy conservation) and all relevant physical symmetries of periodic systems, including the full translation and Bravais symmetry groups. As a consequence, BIGDML models achieve meV/atom accuracy already for 10–200 training points, surpassing state-of-the-art atom-based models by 1-2 orders of magnitude. This result underlines once again the importance of including prior knowledge, including physical laws and symmetries, into ML models. Clearly, what is known does not need to be learned from data—in effect the data manifold has been reduced in its complexity (see e.g.^{23,24,34,44,45,46,47}). It is important to mention that describing materials with several hundreds of atoms, having an exceedingly large number of symmetries, or transferring models between different systems still remain challenging tasks for the BIGDML model. Nevertheless, these technical issues could be addressed by utilising multiscale and composite approaches as well as iterative numerical kernel solvers (see the Discussion section for an extended discussion).

Altogether, the BIGDML framework opens the possibility to accurately reconstruct the PES of complex periodic materials with unprecedented accuracy at very low computational cost. In addition, the BIGDML model can be straightforwardly implemented as an ML engine in any periodic DFT code, and used as a molecular dynamics driver after being trained on just a handful of geometries.

Results

The BIGDML framework relies on two advances: (i) a global atomistic representation with periodic boundary conditions (PBC), (ii) the use of the full translation and Bravais symmetry group for a given material.

PBC-preserving representation

To avoid localization of interatomic interactions and artificial (from the electronic perspective) atom-type assignment, we use an efficient global representation with PBC. Following the sGDML approach for molecules^23,42, we take the atomistic Coulomb matrix (CM)⁴⁸ as a starting representation. When used with sGDML, the CM has been proven to be a robust, accurate, and efficient representation^23,42,49.

Here, we introduce a generalization of the molecular CM descriptor to represent periodic materials, ${{{{{{{{\mathcal{D}}}}}}}}}^{({{{{{{{\rm{PBC}}}}}}}})}$. In order to construct the Coulomb matrix for extended systems, we first enforce the PBC using the minimal-image convention (MIC)^50,51:

$${{{{{{{{\mathcal{D}}}}}}}}}_{ij}^{({{{{{{{\rm{PBC}}}}}}}})}=\left\{\begin{array}{ll}\frac{1}{| {{{{{{{{\bf{r}}}}}}}}}_{ij}-{{{{{{{\bf{A}}}}}}}}{\mathtt{mod}}({{{{{{{{\bf{A}}}}}}}}}^{-1}{{{{{{{{\bf{r}}}}}}}}}_{ij})| } &{{{{{{{\rm{if}}}}}}}}\,i\ne j\hfill\\ 0 \hfill&{{{{{{{\rm{if}}}}}}}}\,i=j\end{array}\right.$$

(1)

where r_ij = r_i − r_j is the difference between two atomic coordinates i and j, and A is the matrix defined by the supercell translation vectors as columns. Figure 1A left shows the Coulomb matrix descriptor when considering only the supercell structure with no PBC, which means that the ML model considers the system as a finite “molecule”. The right side of Fig. 1A shows the descriptor with the PBC enforced (Eq. (1)), having now the correct periodic structure.

Many widely used periodic global representations already exist, for example CM-inspired global descriptors such as the Ewald-sum, or extended Coulomb-like and sine matrices⁵². In the cases of the extended Coulomb-like and Ewald matrices, these representations account the contribution of the same atom iteratively by considering its multiple periodic images, which is computationally demanding and algebraically involved. From these global periodic representations⁵², only the sine matrix avoids using redundant information, since it just depends on the atomic positions in a single unit cell. Nevertheless, this is only a good representation for studying the crystal structure of materials at equilibrium, given that its main feature (i.e. projection of the full supercell to a single unit cell) cannot provide an accurate measure of two atoms in different cells. Additionally, in the case where the supercell of the system is taken as a “unit cell” in the sine matrix, the obtained descriptor is essentially local and unable to capture long-range interatomic correlations within the supercell. Our choice of CM with PBC enforced using MIC is one of the simplest and efficient choices, which also turns out to be exceptionally accurate and data-efficient, as will be shown below.

As an alternative to the global approach, many local materials’ representations have been developed. Among those representations, there are numerous descriptors based on atomic local environments, for example atom-density representations^53,54,55,56, partial radial-distribution functions⁵⁷, FCHL descriptor⁵⁸, rotationally-invariant internal representation⁵⁹, many-body vector interaction⁶⁰ and moment tensor potentials¹⁸. In all these cases, the PBC can be naturally incorporated by using the MIC, as it has been done for mechanistic force fields. These local representations in principle aim at the construction of transferable interatomic MLFFs, as done by GAP/SOAP framework⁵⁵ which is the basis of a series of high-quality chemical bonding potentials for phosphorus³², carbon³⁵, and silicon²¹. However, the intrinsic cutoff radius in these descriptors limits the extent of atomic environments, neglecting the ubiquitous long-range interactions and correlations between different atomic species. Here, by using a ${{{{{{{{\mathcal{D}}}}}}}}}^{({{{{{{{\rm{PBC}}}}}}}})}$ global descriptor, we avoid the need of fine-tuning representation hyperparameters while preserving high accuracy in the description of the many possible configuration states of a material.

Translation symmetries and the Bravais’ group

The full symmetry group ${{{{{{{\mathcal{F}}}}}}}}$ of a crystal is given by the semidirect product of translation symmetries ${{{{{{{\mathcal{T}}}}}}}}$ and the rotation and reflection symmetries of the Bravais lattice ${{{{{{{\mathcal{G}}}}}}}}$ (Bravais’ group): ${{{{{{{\mathcal{F}}}}}}}}={{{{{{{\mathcal{T}}}}}}}}\otimes {{{{{{{\mathcal{G}}}}}}}}$⁶¹ (See Fig. 1B). This is a general result, meaning that it applies to any periodic system of dimension d, ${{{{{{{{\mathcal{F}}}}}}}}}^{(d)}={{{{{{{{\mathcal{T}}}}}}}}}^{(d)}\otimes {{{{{{{{\mathcal{G}}}}}}}}}^{(d)}$. In practice, the translation group ${{{{{{{\mathcal{T}}}}}}}}$ is constructed by the set of translations of the Bravais cell that span the supercell using the primitive translation vectors as a basis, while the Bravais’ group ${{{{{{{\mathcal{G}}}}}}}}$ is the symmetry point group of the unit cell. In order to illustrate these concepts, as an example, let us consider a graphene (d = 2) supercell of size 5 × 5. Its full symmetry group is ${{{{{{{{\mathcal{T}}}}}}}}}_{5\times 5}^{(2)}\otimes {{{{{{{{\mathcal{G}}}}}}}}}^{(2)}={T}_{5\times 5}^{(2)}\otimes {D}_{6h}$ and contains 300 symmetry elements. Further important materials with ample symmetries are surfaces and interfaces. Analogous to molecules possessing internal rotors, molecules interacting with a surface are another case of a fluxional system. For example, benzene adsorbed on graphene has a full fluxional symmetry group defined by the direct product of graphene’s full symmetry group and benzene’s molecular point group, ${[{T}_{5\times 5}^{(2)}\otimes {D}_{6h}]}_{graphene}\otimes {[{D}_{6h}]}_{benzene}$, which contains 3600 symmetry elements. Such a large number of symmetries reduces considerably the region of configuration space needed to be sampled to reconstruct the full PES and consequently generate MLFF models with high data efficiency. The presented arguments generalize to other materials, such as molecular crystals, rigid bulk materials, porous materials, and hybrid organic-inorganic materials, e.g. perovskites.

The BIGDML model

The construction of a BIGDML model consists in combining a global PBC-descriptor and the full symmetry group of the system in the gradient-domain machine learning framework (See Fig. 1), which leads to a robust and highly data efficient MLFF, capable of reaching state-of-the-art accuracy using only a few dozens of training points. We would like to stress here that such unprecedented data efficiency opens up many opportunities to study advanced materials using high levels of electronic-structure theory, such as sophisticated DFT approximations or even coupled-cluster theory⁶².

In a nutshell, the periodic global supercell descriptor and symmetries presented in the previous sections are combined with the sGDML framework to create the BIGDML predictor displayed in Fig. 1C. To illustrate the effects of the symmetries in the PES reconstruction process for the atom–surface Pd₁/MgO system, Fig. 1D presents a diagram where the different core elements of the BIGDML model are systematically included and the resulting (learned) PES is displayed. In this figure, the shown PES corresponds to the energy surface experienced by a Pd atom. The panel (i) displays the reconstructed energy surface with no symmetries, where the training samples are the purple squares and represent the position of the Pd atom. In panel (ii) the PBC are enforced by the periodic descriptor (Eq. (1)), and then this is combined with the use of the point group of the unit cell in panel (iii) and with translation symmetries in panel (iv). From the last two panels, we can see the characteristic contribution of each symmetry group, ${{{{{{{\mathcal{G}}}}}}}}$ symmetrizes the local PES by adding effective training samples (shown as grey circles) while ${{{{{{{\mathcal{T}}}}}}}}$ delocalises the effective sampling over the whole supercell. Then, by considering the full symmetry group ${{{{{{{\mathcal{F}}}}}}}}$, in panel (v) we arrive to the PES reconstructed by the BIGDML model where the effective training data symmetrically span the whole supercell. The panels (i) to (v) show the increasing symmetrization of the PES, but also illustrate the accuracy gain at each stage. The prediction accuracy plot shown in panel (vi) clearly shows the important impact of each symmetry group in generating accurate and robust BIGDML models. It is important to highlight that the achieved accuracy is a combination of several complementary contributions. The Coulomb matrix analytical form provides the correct description of long-range interactions, the Matern kernel provides a basis function that correctly describes the tails of the contributions of each training point (See Methods section), and the full crystal symmetry group correctly enforces the symmetries in the predictor function. This can be seen in (Fig. 1-D-vi), where the accuracy of the BIGDML model increases consistently as each element is added.

Prediction performance of BIGDML for different materials

The BIGDML model can be applied to accurately reproduce atomic forces and total energy of bulk materials, surfaces, and interfaces. To illustrate the applicability of BIGDML, in this section we have selected representative systems that cover the broad spectrum of materials, and study the prediction accuracy of our MLFFs as judged by the learning curves (test error as a function of the number of data points used for training). The considered systems include bulk materials (graphene as a representative 2D material, 3D metallic and semiconducting solids), surfaces (Pd absorbed on MgO surface), and van der Waals bonded molecules on surfaces (benzene adsorbed on graphene), as well as a bulk material with interstitial defects (hydrogen in palladium). Additionally, we analyse the case of the CsPbBr₃ perovskite to test the performance of the model for larger supercells. For a detailed description of the database generation and the levels of theory, as well as the parameters of the simulations and software packages employed, we refer the reader to the Methods section.

Bulk materials: graphene as a representative 2D material

Graphene is a well-characterized layered material that continues to exhibit many remarkable properties despite being thoroughly studied^36,63,64. Hence, developing accurate and widely applicable force fields for graphene and its derivatives is an active research area. Recently, Rowe et al.³⁶ presented a comprehensive comparison of existing hand-crafted force fields and a Gaussian-process approximated potential (GAP) using the Smooth Overlap of Atomic Positions (SOAP) local descriptor. The GAP/SOAP approach was shown to generalize much better than mechanistic carbon FFs. In Fig. 2 we show the learning curves of the BIGDML model for 5 × 5 supercell of graphene, showing that only 10 geometries (data samples) are needed to match the best-performing method to date (≈ 25 meV Å⁻¹ in force RMSE)³⁶. The performance and data efficiency of BIGDML is remarkable, given that it uses less than 1% of the amount of data employed by atom-based local descriptors. More importantly, by increasing the number of data samples used for training to 100, we reach a generalization error of ≈1 meV (0.02 meV/atom) in energies and ≈6 meV Å⁻¹ for forces. To our knowledge, such accuracies have not been obtained in the field of MLFFs for extended materials. In order to put our results into context of state-of-the-art MLFFs, in Fig. 3 we show the learning curves comparing GAP/SOAP and BIGDML for graphene (See Supplementary Fig. 2 for an extended comparison using different materials). Given the same data for training, BIGDML achieves an improvement of a factor of 10 in accuracy, both for the total energy and atomic forces. The same conclusions hold for other systems studied in this work, as shown in the Supplementary Figs. 1 and 2.

**Fig. 2: Learning curves for different materials.**

**Fig. 3: BIGDML and GAP performance comparison for graphene.**

Bulk materials: the case of cubic crystals

In the case of 3D materials, we apply our model to monoatomic metallic materials covering common cubic crystal structures: Pd[FCC], Au[FCC] and Na[BCC]. Figure 2 shows the learning curve for these three structures with a supercell of 3 × 3 × 3 and symmetry groups ${T}_{3\times 3\times 3}^{(3)}\times {O}_{h}$. An accuracy of ≈ 10 meV Å⁻¹ for a monoatomic metal material can be achieved using approximately 70 samples in the case of Pd (only 10,000 atomic forces), which is only a fraction of the data (less than 1%) required by other models to obtain the same accuracy³⁸.

Bulk materials: large perovskite supercell

In order to test the usability of the model for larger supercells, now we consider the case of the CsPbBr₃ perovskite, which contains 160 atoms per supercell. This all-inorganic perovskite is a system of great interest in photovoltaics given its stability under highly humid environments, hence it is a potential candidate for reliable solar cells. These materials are known to have high fluxionality which drive the system to visit multiple local minima at finite temperature. Therefore, allowing long molecular dynamic trajectories via ML models helps to collect better sampling statistics, and hence more robust physical observables. In Fig. 4, energy and forces learning curves are shown. Despite the larger supecell, already at 100 training samples we reach the ≈ 1 meV/atom energy accuracy and a force error of 50.7 meV/Å. Furthermore, we found that when training on 1000 samples, BIGDML manages to achieve energy and force accuracy of ≈ 0.1 meV/atom and ≈ 2.6 meV/Å, respectively.

**Fig. 4: Model performance on large supercells.**

The obtained accuracy demonstrates that the BIGDML model can also achieve high fidelity in the reconstruction of the PES for large multi-element supercells with rich fluxional dynamics, as it is the case of perovskite materials.

Surfaces: atom chemisorbed at a surface –Pd₁/MgO

One of the main challenges of constructing MLFFs on local atomic environments is that such representations can fail to capture subtle local changes with global implications. For example, when describing a surface or an interface, atoms of the same element are described by the same atomic embedding function which in order to encode the many possible neighbourhoods (atoms in deeper layers, atoms close to the surface of the material) requires large amounts of training data. This eventually leads to degradation of MLFF performance, a problem that could become practically intractable for local MLFFs when dealing with molecule-surface interactions. These limitations can be addressed in local models but at the cost of higher complexity models and manual tuning of hyperparameters, hence losing the key advantages of MLFFs. In this section, we show that the BIGDML method does not have such limitations by studying two representative systems: chemisorbed Pd/MgO-surface and physisorbed benzene/graphene.

In recent years, it has been shown that single-atom catalysts (SACs) can offer superior catalytic performance compared to clusters and nanoparticles^65,66,67. These heterogeneous catalysts consist of isolated metal atoms supported on a range of substrates, such as metal oxides, metal surfaces or carbon-based materials. As a showcase, here we use a single Pd atom supported on a pristine MgO (100) surface. The considered supercell consists of a 2 × 2 slab of MgO(100) with 3 layers, where the lowest layer is kept fixed, and a single Pd atom is deposited on the surface.

The full symmetry group for this system is ${T}_{2\times 2}^{(2)}\otimes {C}_{4v}$ with 64 elements. The learning curve (see Fig. 2) shows that only 200 samples are needed to reach energy and force accuracy values of ≈ 34 meV ( ≈ 0.7 meV/atom) and ≈ 30 meV Å⁻¹, respectively. Similarly as in the case of learning force fields for molecules in the gas phase, the target error is always relative to the relevant dynamics of the system and its energetics^23,42,43. In this context, the Pd atom is chemisorbed at an oxygen site and the lowest energetic barrier that the Pd atom experiences is of 450 meV, thus our error is ≈ 6% of this value. In Fig. 5 we show the minimum-energy barrier (MEB) of Pd atom displacing from one minimum to another on the MgO surface computed by the nudged elastic band (NEB) method (See Methods section for details). It must be noted that the Pd atom never crossed this barrier during the MD simulation used to generate the reference dataset, as displayed by the purple lines in Fig. 5 indicating the distribution of the Pd atom location in the training dataset. Hence, even though the model did not have information regarding the saddle point, the energetic barrier was nevertheless correctly modeled by BIGDML by incorporating translational and Bravais symmetries.

**Fig. 5: Model performance on unseen energy barriers.**

Surfaces: molecule physisorbed at a surface –Benzene/graphene

A highly active field of research in materials science concerns the interaction between molecules and surfaces, due to its fundamental and technological relevance. From the modeling point of view, describing non-covalent interactions within the framework of DFT remains a competitive research area given its intricacies, which has led to very accurate dispersion interaction methods^{68,69,70,71,72}. Nevertheless, most of the studies about these systems focus on global optimizations or short MD simulations. Here, we demonstrate the applicability of BIGDML by learning the molecular force field of the benzene molecule interacting with graphene.

The full symmetry group of the benzene/graphene system is ${T}_{5\times 5}^{(2)}\otimes {C}_{6v}^{(Graphene)}\otimes {C}_{6v}^{(Benzene)}$, which has a total of 3600 elements. This large number of symmetries greatly reduces the configurational space sampling requirements to reconstruct its PES, as can be seen from the learning curve shown in Fig. 2 where the energy error quickly drops below ≈ 43 meV (1 kcal mol⁻¹) training only on 10 datapoints and ≈ 21 meV with 30 training datapoints. For this system, the energy generalisation accuracy starts to saturate at 0.18 meV/atom when training on 100 configurations. Achieving such high generalization accuracy using only a handful of training data points for such a complex system convincingly illustrates the high potential of the BIGDML model, since it suddenly opens the possibility of performing predictive simulations for a wide variety of systems where only static DFT calculations are available so far.

The systems discussed in this section offer a general picture of the broad diversity of extended materials that the BIGDML model can describe with high data efficiency and unprecedented accuracy. In particular, the applications here introduced provide a range of families of systems and materials that can be described by the model. For example, it is to be expected that, in general, given the performance of the trained models for Na, Au, and Pd, mono-atomic materials with cubic unit cells will be accurately described by the BIGDML. On the other hand, the accurate description of the CsPbBr₃ perovskite material shows that the model can handle and accurately learn large multi-element and fluxional materials. Then, a similar performance is expected when applied to a family of materials with similar structural characteristics. In the same order of ideas, the performance and applicability of the BIGDML model to molecules interacting with 2D materials is demonstrated with the benzene/graphene system, given that similarly complex dispersion interactions are to be expected.

Validation of BIGDML models for materials properties

In the previous section, we demonstrated the prediction capabilities of the BIGDML method using statistical accuracy measures. Now, we assess the predictive power of BIGDML models in terms of predicting physical properties of materials. In this section, we first perform a thorough test for ML models by assessing the phonon spectra of 2D graphene and 3D bulk materials. Then, we proceed to test the performance beyond the harmonic approximation by carrying out molecular dynamics simulations and comparing observables against explicit DFT calculations. All simulations performed in this section were done using the best trained models displayed in the learning curves (See Fig. 2 and “Methods” section).

Phonon spectra

A common challenging test to assess force fields (machine learned^20,35,36 as well as conventional FFs^73,74,75) is the phonon dispersion curves and phonon density of states, since they give a clear view of (i) the proper symmetrization of the FF and (ii) the correct description of the elastic properties of the material in the harmonic approximation. The main challenge for FFs is describing both collective low-frequency phonon modes and the local high-frequency ones with equal accuracy. In Fig. 6 we show the comparison of the BIGDML and DFT generated phonon bands displaying a perfect match, showing a RMSE phonon errors across the Brillouin zone of 0.85 meV for Graphene, 0.35 meV for Na, and 0.38 meV for Pd. These values are comparable to those reported in literature using MLFFs trained on thousands of configurations and hand-crafted datasets⁷⁶, while in our case we only require less than 100 randomly selected training points. Such accuracy originates from the use of a global representation for the supercell which captures local and non-local interactions with high fidelity, a feature that is crucial in describing vibrational properties.

**Fig. 6: Model performance on phonon bands reconstruction.**

Now, we proceed to a more challenging physical test, which is the prediction of properties at finite temperature, where also the anharmonic parts of the PES are important.

Molecular dynamics simulations: graphene

Simulations of graphene at finite temperature using an accurate description of the interatomic forces is a highly relevant topic given the plethora of applications of this material. In particular, a necessary contribution to its realistic description is the inclusion of nuclear quantum effects (NQE). For example, the experimental free energy barrier for the permeability of graphene-based membranes to thermal protons can only be correctly described by including the NQE of the carbon atoms^77,78. In order to corroborate that our graphene BIGDML model is giving the correct physical delocalization of the nuclei, we performed path-integral molecular dynamics (PIMD) simulations at 300 K for a 5 × 5 supercell. In Fig. 7 we compare the distribution of first neighbor interatomic distance r_CC between classical MD (blue) and PIMD (orange), results showing that the fluctuations in r_CC double its value when considering NQE. These findings are in excellent agreement with explicit first-principles PIMD simulations in the literature⁷⁸.

**Fig. 7: Nuclear quantum effects in graphene.**

As an additional robustness test, we have performed extended classical MD simulations at various temperatures using the EAM force field⁷⁹ and a BIGDML model trained on this level of theory, obtaining a perfect match between these two different methods. This further validates the predictive power of our methodology even at long time scales. These results are shown in the Supplementary Fig. 5.

Up to this point, we have performed simulations to validate our models under different conditions. In the next section, we perform predictive simulations, which highlight the potential of BIGDML for novel applications, including unexpected NQE-driven localization of benzene/graphene dynamics and the diffusion of interstitial hydrogen in bulk palladium.

Validation of BIGDML in dynamical simulations of materials

Benzene/graphene. The interaction between different molecules and graphene has been extensively studied, given the potential applications of molecule/graphene systems as electrical and optical materials and even as candidates for drug delivery systems^{80,81,82,83,84,85,86,87,88,89,90}. Of particular interest is the understanding of the effective binding strength and structural fluctuations of adsorbed molecules at finite temperature, which requires long time-scale molecular dynamics simulations, unaffordable when using explicit ab initio calculations. Here we will demonstrate that BIGDML models can be used for studying explicit long-time dynamics of realistic systems such as benzene (Bz) adsorbed on graphene with accurate and converged quantum treatment of both electrons and nuclei (See Fig. 8A). The Bz/graphene system has three minima that resemble those of the benzene dimer: the π − π stacking (parallel-displaced) structure as global minimum and two local minima corresponding to parallel and T-shaped configurations, as displayed in Fig. 8-B³ along with the corresponding structural parameters and adsorption energies computed at the PBE+MBD level of theory^69,70,91. The calculated adsorption energy for the global minimum is in a very good agreement with experimental measurements of 500 ± 80 meV⁹².

**Fig. 8: Dynamical strengthening of non-covalent interactions.**

An extensive amount of studies exist on the implications of NQE on properties of molecules and materials at finite temperature^93,94, however much less is known about the implications of NQE for non-covalent van der Waals (vdW) interactions^3,95. In the particular case of Bz/graphene, considering the translational symmetries of the PES experienced by the Bz molecule as well as thermal fluctuations and its many degrees of freedom, it is to be expected that the Bz dynamics will be highly delocalized. Nonetheless, it was recently reported that the inclusion of NQE in a molecular dimer can considerably enhance intermolecular vdW interactions³. However, the adsorption/binding energy ratio between Bz/graphene and Bz/Bz system is ${E}_{ads}^{Bz/graphene}/{E}_{int}^{Bz/Bz}\approx$4, therefore it is not clear how NQEs will affect such strongly interacting vdW systems.

In order to assess the role of temperature and NQE for Bz/graphene, in Fig. 8C we present the results obtained from classical MD and PIMD simulations at 300K using a BIGDML FF trained at the PBE+MBD level of theory. At this temperature, the benzene molecule tends to mostly populate configurations at an angle of ≈ 10^∘ relative to the graphene normal vector in both cases (see Fig. 8A). Nevertheless, classical MD simulations explore substantially wider regions of the PES, reaching angles of up to 80^∘, close to the T-shaped minimum. In contrast, PIMD simulations yield a localized sampling of θ with a maximum angle of ≈ 30^∘. To understand the origin of this localization, we have systematically increased the “quantumness” of the system by raising the number of beads in the PIMD simulations to converge towards the exact treatment of NQE. This approach provides concrete evidence of the progressive localization of the benzene normal orientation as the NQE increase (see Supplementary Fig. 3). The physical origin of this phenomenon is the NQE-induced interatomic bond dilation, where the zero-point energy generated by NQE drive the system beyond the harmonic oscillation regime. The intramolecular delocalization produces effective molecular volume dilation and increases the average polarizability of benzene and graphene rings, akin to a recent analysis of non-covalent interactions between molecular dimers upon constraining their center of mass³. In contrast, in this work no constraints were imposed on the Bz/graphene system, suggesting that the Bz molecule localization on graphene should be observable in experiments. In order to further rationalize the NQE-induced stabilization of vdW interactions, we have computed the vdW interaction energy as a function of compression/dilation of the Bz molecule on graphene and found a linear dependence between dilation and vdW interaction (see Supplementary Fig. 4). This analysis fully supports our hypothesis of NQE-induced stabilization and dynamical localization.

The rather fundamental nature of the underlying physical phenomenon of NQE-induced stabilization suggests that many polarizable molecules interacting with surfaces will exhibit a similar dynamical localization effect. It is worth mentioning that a thorough analysis of the Bz/graphene system demands extensive simulations, which are now made accessible due to the computational efficiency and accuracy of the BIGDML model. Our modeling could also be applied to larger molecules with peculiar behavior under applied external forces⁹⁶.

Hydrogen interstitial in bulk palladium. Hydrogen has become a promising alternative to fossil fuels as a cleaner energy source. Nevertheless, finding a safe, economical and high-energy-density hydrogen storage medium remains a challenge⁹⁷. One of the proposed methods is to store hydrogen in interstitial sites of the crystal lattices of bulk metals^97,98,99. Among these metals, palladium has been widely researched as a candidate, since it can absorb large quantities of hydrogen in a reversible manner⁹⁸.

Characterizing the diffusion of hydrogen in the crystal lattices at different temperatures is crucial to assess their performance as storage materials. Hence, in this section we study a system consisting of a hydrogen atom interstitial in bulk palladium with a cubic supercell containing 32 Pd atoms with full symmetry group ${T}_{2\times 2\times 2}^{(3)}\otimes {O}_{h}$, and described at the DFT-PBE level of theory (See Methods section for more details). The BIGDML learning curve for this system in presented in Fig. 2. Within the FCC lattice there are two possible cavities for hydrogen atoms storage: the octahedral (O-sites) and the tetrahedral (T-sites) cavities (See Fig. 9A-top), where the O-site is the global minimum⁹⁸ and it is separated from the T-site by an energetic barrier of ≈ 160 meV as shown in Fig. 9A-bottom. Additionally, from this figure, we can see the excellent agreement between BIGDML model and the reference DFT calculations.

**Fig. 9: Interstitial hydrogen diffusivity in bulk Pd.**

Kimizuka et al.⁹⁸ reported a study based on transition state theory (TST) suggesting that not only the inclusion of the NQE has indeed a strong effect on the H-atom diffusion, but also they reported that NQE hinder the migration from O-site to T-site. In order to elucidate realistic dynamics of the H atom in the metal lattice and the impact of the NQE without relying on approximations such as TST, we performed direct classical MD and PIMD simulations at different temperatures (from 100 K to 1000 K) (see “Methods” for more details). We first studied the NQE-induced statistical sampling of the hydrogen atom in each cavity as shown in Fig. 9B. In Supplementary Videos 1–4, we can see an animated version of this figure, where the shape of the sampled volume and how it changes as a function of the temperature and by the inclusion of the nuclear quantum effects is displayed. This helps us to visualize hydrogen dynamics in the temperature range from 100 to 300 K and to determine the shape of the cavity, which transforms from a cube to a much larger truncated octahedron as the temperature increases.

Then, from the generated (classical and quantum) trajectories we have estimated the diffusivity of the hydrogen atom as a function of the temperature, which are shown in Fig. 9C along with TST results and experimental data. Usually, this quantity is estimated by using transition rate theory, which only considers the energetics of the system (energy barrier and relative energy between adjacent states)⁹⁸. A more robust methodology is to directly compute the diffusivity from the molecular dynamics results using the mean-square displacement analysis¹⁰⁰, an option which MLFFs make feasible due to the long-time simulations required while keeping ab initio accuracy. From these results, we observe an Arrhenius temperature dependence for the diffusivity (D(T) = D₀e^−Q/kT) in both the classical MD and the PIMD cases, which is expected in the range of temperatures considered. While the TST-PIMD results by Kimizuka et al.⁹⁸ (D₀ = 9.90 × 10⁻⁷ m²/s, Q = 0.23 eV) accurately reproduce the experimental activation barrier Q for H in Pd, they considerably overestimate the value of the pre-exponential factor D₀. In our case, the diffusivity of H in Pd at lower temperatures is overestimated by classical MD (D₀ = 0.95 × 10⁻⁷ m²/s, Q = 0.151 eV), but it is close to the reported values at high temperatures. Meanwhile, both the activation barrier (Q = 0.231 eV) and the pre-exponential factor (D₀ = 2.70 × 10⁻⁷ m²/s) calculated using BIGDML@PIMD are in excellent agreement with the experimental data at all temperatures.

The results presented in this section demonstrate how BIGDML enables long PIMD simulations to obtain novel insights into dynamical behaviour of intricate materials containing vacancies or interstitial atoms.

Discussion

In this work, we introduced the BIGDML approach—a MLFF for materials that is accurate, straightforward to construct, efficient in terms of learning on reference ab initio data, and computationally inexpensive to evaluate. The accuracy and efficiency of the BIGDML method stems from extending the sGDML framework for finite systems^23,42 by employing a global periodic descriptor and making usage of translational and Bravais symmetry groups for materials. The BIGDML approach enables carrying out extended dynamical simulations of materials, while correctly describing all relevant chemical and physical (long-range) interactions in periodic systems contained within the reference data. In principle, once high-level electronic structure force calculations for periodic systems (with CCSD(T) or Quantum Monte Carlo methods) become a reality^39,40,41, the BIGDML method would be an ideal tool to execute highly accurate dynamics of materials. We remark that the molecular sGDML approach has already fulfilled this long-standing goal for molecules with up to a few dozen atoms^3,23.

We have demonstrated the applicability and robustness of the BIGDML method by studying a wide variety of relevant materials and their static and dynamical properties, for example successfully assessing the performance of BIGDML models for physical observables in the harmonic and anharmonic regimes in the form of phonon bands and molecular dynamics simulations. Furthermore, we carried out predictive simulations on interstitial hydrogen diffusion in bulk Pd, as well as accurately capturing intricate van der Waals forces and the dynamics of the interface formed by molecular benzene and 2D graphene layer.

From the practical perspective, the BIGDML approach represents an advantageous framework beyond its accuracy and data efficiency, given that the model generation is a straightforward process starting from the simplicity of database generation and its out-of-the-box training procedure⁴³. From the deployment point of view, to illustrate the gain in computational speed, we remark that for benzene/graphene we gain a factor of 50,000 for computing atomic forces with BIGDML when compared to the PBE + MBD level of electronic-structure theory. This gain would further increase when using a higher level of quantum-mechanical methods for generating reference data.

Many powerful MLFFs for materials have been proposed, and some are already widely used for materials modelling^54,101. In order to embed the BIGDML model into the current context of MLFFs for materials, it is convenient to address some of the limitations that current methodologies face, as well as to discuss goals to pursue with the next generation of MLFFs in materials science.

All current MLFFs for materials known to the authors employ the locality approximation, i.e. they build a model for an energy of an atom in a certain chemical environment, which is defined by a cutoff function. The typical employed cutoffs are of 3–8Å, being of a rather short range. Increasing the cutoff does not necessarily lead to a better model, because electronic interactions exhibit hard-to-learn multiscale structure⁹. From the practical point of view, trying to embed more information in the atomic representation by increasing the cutoff radius leads to other problems such as learning capacity limitations and, in the case of neural networks, their inherent difficulty to correctly represent and propagate multiscale interactions. In addition, different interaction scales are mutually coupled. An attractive feature of the locality approximation is that in principle, the short-range interactions are transferable to different systems. However, in practice this is not a general finding. For example, it was shown that a general-purpose GAP/SOAP MLFF for carbon³⁵ yields errors an order of magnitude higher in graphene compared to the same methodology trained specifically on graphene data³⁶. In addition, local MLFFs typically decouple interaction potentials of different atoms by assigning atom types. For example, carbon in benzene and carbon in graphene could be treated as different atom types. Obviously, such decoupling makes the learning problem harder because more data is necessary to “restore the coupling” between different atomic species.

BIGDML provides a robust solution for both problems of localization by using a global descriptor with periodic boundary conditions. This allows BIGDML to capture interactions at all relevant length scales by virtue of coupling all atomic coordinates. However, such a prominent feature comes with a limitation, since transferablity between different systems is not easy to achieve. The current approach to achieve transferable MLFFs is the localization of interactions. Thereby, transferability remains as one of the main challenges to be addressed for future generations of BIGDML. Nevertheless, the BIGDML framework is envisioned as a base model to further develop towards constructing more general and efficient force fields for materials modelling. Two possible avenues to address transferability are: (1) to use a construction approach where smaller models are combined to approximate larger supercells and (2) Localize the global descriptor to simplify interactions. In the first approach, models trained on small supercells, could be combined to span larger supercells in an approximate way by means of a convolution approach: ${\hat{f}}_{{{{{{{{\rm{large}}}}}}}}}\approx {\rho }_{{{{{{{{\rm{large}}}}}}}}}* {\hat{f}}_{{{{{{{{\rm{small}}}}}}}}}$, where ρ_large is the atomic distribution in the larger supercell, this being one approach towards larger and transferable composite models. In the second option, the mathematical form of the BIGDML predictor can be reformulated by, for example, reducing the many-body complexity of the ${{{{{{{{\mathcal{D}}}}}}}}}^{{{{{{{{\rm{PBC}}}}}}}}}$ descriptor to keep interactions only up to a certain body order. This approach has been proven to give good results in molecules^102,103. Despite current limitations of BIGDML, having access to a MLFF that can robustly represent global interactions in extended materials is a substantial achievement, as shown via extensive simulations in Section. In addition, we should stress that BIGDML has a superior learning capacity compared to local MLFFs, since it can reach generalization accuracies of up to two orders of magnitude better than localized MLFFs (see Fig. 3).

Another crucial aspect of MLFFs is their data efficiency and ability to correctly capture all relevant symmetries for a given system. Symmetries play a crucial role when studying nuclear displacements (phonons, thermal conductivities, etc). BIGDML addresses both of these challenges at the same time. The symmetries are obtained from the periodic cell and the reference geometries in a data-driven way. Symmetries are known to effectively reduce the complexity of the learning problem (cf. refs. ^46,104), as we have shown by introducing energy conservation (i.e. time homogeneity)⁴² and molecular point groups for finite molecular systems²³. Periodic systems have even more symmetries than molecules, making the force field reconstruction effectively a lower-dimensional task. While this qualitative outcome could have been expected prior to the formulation of BIGDML, the enormous practical advantage of incorporating crystalline symmetries is remarkable. Even a few dozen samples (atomic forces for a few unit cell geometries) already yield BIGDML models that can be used in practical applications of molecular dynamics.

We would like to remark further that while BIGDML is a kernel-based approach (see e.g. refs. ^{105,106,107,108}), able to include symmetries and prior physical information, it will be an interesting and important challenge to transfer the learning machinery established here also to deep learning approaches (such as convolution neural networks, graph neural networks or even generative adversarial models) ideally by incorporating symmetries, prior physical knowledge and equivariance constructions into their architecture (see refs. ^{24,34,109,110} for some first steps in this direction).

In this work, we have focused on materials with fixed supercell size and shape, nevertheless, a large number of physical phenomena in materials involve phase transitions and symmetry breaking. These systems represent a challenge to be addressed and requires developing further the ideas introduced in the BIGDML model. In this regard, the mathematical structure of the BIGDML framework has the foundations to allow the study of systems with flexible supercell (i.e. changing supercell volume and/or lattice vectors). This is because the defined metric (Euclidean distance between two structures) by the global representation does not depend on the particular selection of the lattice vectors. Meaning that, if we have two structure configurations X₁ and X₂ with lattice vectors a₁ and a₂, respectively, their distance in descriptor space $\parallel {{{{{{{{\mathcal{D}}}}}}}}}_{{{{{{{{{\rm{lattice}}}}}}}}}_{1}}^{{{{{{{{\rm{(PBC)}}}}}}}}}({{{{{{{{\bf{X}}}}}}}}}_{1})-{{{{{{{{\mathcal{D}}}}}}}}}_{{{{{{{{{\rm{lattice}}}}}}}}}_{2}}^{{{{{{{{\rm{(PBC)}}}}}}}}}({{{{{{{{\bf{X}}}}}}}}}_{2})| |$ is well-defined. Hence, exploiting such invariance in the metric could allow the description of materials with fluctuating lattice vectors, as it would be the case, for example, in simulations of materials described by the NpT ensemble.

Another challenge to be addressed is the need to describe even larger systems. In Section -D, we have already shown that it is possible to accurately describe highly fluxional structures with supercells containing 160 atoms and trained on up to 1000 structures. Given this evidence, the only limitation when moving to materials containing hundreds of atoms per supercell is memory requirements, a problem that is solved by using numerical solvers as it is done for training neural networks (see e.g. ref. ¹¹¹). Nevertheless, the BIGDML framework requires multiple extensions to scale up to much larger systems (with thousands of atoms per unit cell) or systems with a larger number of symmetries.

With the advent of new advanced materials such as high performance perovskite solar cells, topological insulators and van der Waals materials, it is crucial to construct reliable MLFFs capable of dynamical simulations at the highest level of accuracy given by electronic-structure theories while maintaining relatively low computational cost. While local MLFFs and BIGDML are complementary approaches, we would like to emphasize that global representations and symmetries could also be readily incorporated in other MLFF models. The challenge of developing accurate, efficient, scalable, and transferable MLFFs valid for molecules, materials, and interfaces thereof suggests the need for many further developments aiming towards universally applicable MLFF models.

Methods

Data generation and DFT calculations

Given the different types of calculations and materials in this work, we present the details of the data generation, model training and simulations organized per system. All the databases were generated using molecular dynamics simulations using the NVT thermostat.

Graphene. Here we used a 5 × 5 supercell at the DFT level of theory at the generalized gradient approximation (GGA) level of theory with the Perdew-Burke-Ernzerhof (PBE)⁹¹ exchange-correlation functional, We performed the calculation in the Quantum Espresso^112,113 software suite, using plane-waves with ultrasoft pseudopotentials and scalar-relativistic corrections. We used an energy cutoff of 40 Ry. A uniform 3 × 3 × 1 Monkhorst-Pack grid of k-points was used to integrate over the Brillouin zone. The ab initio MD (AIMD) used to generate the database was ran at 500 K during 10,000 time steps using an integration step of 0.5 fs. The results displayed in Fig. 7 were performed using PIMD simulations with 32 beads, and we ran the simulation for 300 ps using an integration step of 0.5 fs.

Pd₁/MgO. In this case, we used a 2 × 2 supercell with 3 atomic layers to model the MgO (100) surface. The calculations were performed in Quantum Espresso, using an energy cutoff of 50 Ry and integrating over the Brillouin zone at the Γ-point only. For this system, we ran an AIMD at 500 K with an integration step of 1.0 fs during 10,000 integration steps to generate the material’s database.

Benzene/graphene. For this particular example, we have used the same graphene supercell mentioned above and placed a benzene molecule on top. In order to include the correct non-covalent interactions between the benzene molecule and the graphene layer, we have used an all-electrons DFT/PBE level of theory with the many body dispersion (MBD)^69,70 treatment of the van der Waals interaction using the FHI-aims¹¹⁴ code. The AIMD simulation for the system’s database constructions was performed at 500 K using an integration step of 1.0 fs during 15,000 steps. The results displayed in Fig. 8 were performed using PIMD simulations using 1, 8, 16 and 32 beads (in order to guarantee that we have achieved converged NQE) and we ran the simulation for 200 ps using an integration step of 0.5 fs.

Bulk metals. In this case, we were interested in a variety of materials and their different interactions. Then, we have considered Pd[FCC] and Na[BCC] described at the DFT/PBE level of theory using the Quantum Espresso software. The databases were created by running AIMD simulations at 500 K and 1000 K for Pd, and 300 K for Na using a time steps of 1.0 fs for all the simulations. Monkhorst-Pack grids of 3 × 3 × 3 k-points were used to integrate over the Brillouin zone for all materials. All calculations for the bulk metals were spin-polarized.

H in Pd[FCC]. In this case we used a supercell of 3 × 3 × 3 with 32 Pd atoms and a single hydrogen atom described by DFT/PBE level of theory using the Quantum Espresso software. The database was generated by running AIMD at 1000 K. We used time steps of 1.0 fs and a total dynamics of 6 ps. Monkhorst-Pack grids of 3 × 3 × 3 k-points were used to integrate over the Brillouin zone for all materials. The results shown in Fig. 9 we obtained by running classical MD and PIMD simulations using an interface of the BIGDML FF with the i-PI simulation package¹¹⁵. We ran the simulations at various temperatures from 300 K to 1000 K. In each case, we employed a time step of 2.0 fs during 2,000,000 steps, for a total simulation time of 4 ns. For the PIMD simulations we used a different number of beads for each temperature: 32 for 100, 300, and 600 K; 24 for 400 K; 2 for 700 K; and 4 for 800 K. Using this data we were able to compute the H diffusivity as a function of the temperature.

The sGDML framework

A data efficient reconstruction of accurate FFs with ML hinges on including the right inductive biases in the model to compensate for finite reference dataset sizes. The Symmetric Gradient-Domain Machine Learning (sGDML) framework achieves this through constraints derived from exact physical laws^23,42,43. In additional to the basic roto-translational invariance of energy, sGDML implements energy conservation, a fundamental property of closed classical and quantum mechanical systems. The key idea behind sGDML is to define a Gaussian Process (GP) using a kernel ${{{{{{{\bf{k}}}}}}}}\left({{{{{{{\bf{x}}}}}}}},{{{{{{{{\bf{x}}}}}}}}}^{\prime}\right)={\nabla }_{{{{{{{{\bf{x}}}}}}}}}{k}_{E}\left({{{{{{{\bf{x}}}}}}}},{{{{{{{{\bf{x}}}}}}}}}^{\prime}\right){\nabla }_{{{{{{{{{\bf{x}}}}}}}}}^{\prime}}^{\top }$ that models any force field f_F as a transformation of some unknown PES f_E such that,

$${{{{{{{{\bf{f}}}}}}}}}_{{{{{{{{\bf{F}}}}}}}}}=-\nabla {f}_{E} \sim {{{{{{{\mathcal{G}}}}}}}}{{{{{{{\mathcal{P}}}}}}}}\left[-\nabla {\mu }_{E}({{{{{{{\bf{x}}}}}}}}),{\nabla }_{{{{{{{{\bf{x}}}}}}}}}{k}_{E}\left({{{{{{{\bf{x}}}}}}}},{{{{{{{{\bf{x}}}}}}}}}^{\prime}\right){\nabla }_{{{{{{{{{\bf{x}}}}}}}}}^{\prime}}^{\top }\right].$$

(2)

Here, ${\mu }_{E}:{{\mathbb{R}}}^{d}\to {\mathbb{R}}$ and ${k}_{E}:{{\mathbb{R}}}^{d}\times {{\mathbb{R}}}^{d}\to {\mathbb{R}}$ are the prior mean and prior covariance (kernel) functions of the latent energy GP-predictor, respectively. Regarding the functional form of the kernel, the sGDML framework uses the Matérn covariance function with restricted differentiability to second order: ${k}_{E}(d)={\sigma }^{2}(1+d+\frac{1}{3}{d}^{2}){e}^{-d}$, $d=\frac{\sqrt{5}| {{{{{{{\bf{x}}}}}}}}-{{{{{{{{\bf{x}}}}}}}}}^{\prime}| }{\rho }$, where σ and ρ are the normalization and scale parameters, respectively.

Each molecular geometry R represented in descriptor space by ${{{{{{{\bf{x}}}}}}}}={{{{{{{{\mathcal{D}}}}}}}}}^{({{{{{{{\rm{PBC}}}}}}}})}({{{{{{{\bf{R}}}}}}}})$ is encoded using the proposed descriptor (see Eq. (1)). A sGDML FF for a particular system is then obtained by solving a linear system for $\overrightarrow{\alpha }$,

$$({{{{{{{\bf{K}}}}}}}}+\lambda {\mathbb{1}})\overrightarrow{\alpha }=-{{{{{{{\bf{F}}}}}}}},$$

(3)

where the set of $\overrightarrow{\alpha }$s are the trainable parameters, ${{{{{{{{\bf{K}}}}}}}}}_{ij}={{{{{{{\bf{k}}}}}}}}\left({{{{{{{{\bf{x}}}}}}}}}_{i},{{{{{{{{\bf{x}}}}}}}}}_{j}\right)$ and F = − ∇ V_BO are the gradients of the PES as specified by the corresponding reference calculations. By construction of the kernel matrix, the resulting model is guaranteed to be integrable, such that the corresponding PES is recovered by

$$\int {{{{{{{{\bf{f}}}}}}}}}_{{{{{{{{\bf{F}}}}}}}}}\,{{{{{{{\rm{d}}}}}}}}{{{{{{{\bf{R}}}}}}}}=-{f}_{E}+c$$

(4)

up to an integration constant c.

The sGDML model additionally incorporates all relevant rigid space group symmetries, as well as dynamic non-rigid symmetries of the system at hand. This is achieved via marginalization of the kernel over the permutation set π ∈ S:

$${{{{{{{{\bf{f}}}}}}}}}_{{{{{{{{\bf{F}}}}}}}}}({{{{{{{\bf{x}}}}}}}})=\,\,\mathop{\sum }\limits_{i=1}^{M}{\overrightarrow{\alpha }}_{i}\mathop{\sum}\limits_{\pi \in S}{{{{{{{\bf{k}}}}}}}}\left({{{{{{{\bf{x}}}}}}}},{{{{{{{{\bf{P}}}}}}}}}_{\pi }{{{{{{{{\bf{x}}}}}}}}}_{i}\right).$$

(5)

Here, M is the number of training datapoints and P_π is the permutation operator in descriptor space. In the original model, these symmetries are automatically recovered as atom-permutations via multi-partite matching of all geometries in the training dataset²³. BIGDML supplements this set by adding permutational symmetries that are unique to periodic systems and were previously not considered.

Now, some aspects on training and deploying sGDML FFs. Solving the linear system Eq. (3) is the computationally most strenuous aspect of the training procedure, as it incurs a cost of ${{{{{{{\mathcal{O}}}}}}}}({(3NM)}^{3})$. Moreover, BIGDML is trained in closed-form (via matrix decomposition), which requires storing the kernel matrix at ${{{{{{{\mathcal{O}}}}}}}}({(3NM)}^{2})$ memory cost. The inclusion of symmetries only incurs extra linear cost during kernel matrix construction in this scenario, while the training cost remains the same. These economics are ideal for the application to periodic systems, where we can impose a strong inductive prior through the inclusion of large symmetry sets, which allows the number of training points M to remain small. Now, in cases where memory limitations appear, the model can be trained by a numerical solver as in the case of neural networks. This approach allows training much larger models and bigger systems.

Coulomb matrix PBC implementation

The periodic boundary conditions were implemented using the minimum image convention. Under this convention, we take the distance between two atoms to be the shortest distance between their periodic images. We start by expressing the distance vectors d_ij = r_i − r_j in the basis of the simulation supercell lattice vectors as

$${{{{{{{{\bf{d}}}}}}}}}_{ij}={{{{{{{{\bf{Ac}}}}}}}}}_{ij},$$

(6)

where A is a 3 × 3 matrix which contains the lattice (supercell) vectors as columns, and c_ij are the distance vectors in the new basis. We then confine the original distance vectors to the simulation cell,

$${{{{{{{{\bf{d}}}}}}}}}_{ij}^{{{{{{{{\rm{(PBC)}}}}}}}}}={{{{{{{{\bf{d}}}}}}}}}_{ij}-{{{{{{{\bf{A}}}}}}}}{{{{{{{\rm{nint}}}}}}}}({{{{{{{{\bf{c}}}}}}}}}_{ij}),$$

(7)

where nint(x) is the nearest integer function. By replacing the ordinary distance vectors d_ij with ${{{{{{{{\bf{d}}}}}}}}}_{ij}^{{{{{{{{\rm{(PBC)}}}}}}}}}$ in the Coulomb matrix descriptor, it becomes

$${{{{{{{{\mathcal{D}}}}}}}}}_{ij}^{({{{{{{{\rm{PBC}}}}}}}})}=\left\{\begin{array}{ll}\frac{1}{| {{{{{{{{\bf{d}}}}}}}}}_{ij}^{{{{{{{{\rm{(PBC)}}}}}}}}}| } &{{{{{{{\rm{if}}}}}}}}\,i\ne j\hfill\\ 0 \hfill&{{{{{{{\rm{if}}}}}}}}\,i=j\end{array}\right.$$

(8)

In practice, only the d^(PBC) upper triangular matrix is used.

Software: Interface with i-PI

For this work, a highly optimised interface of BIGDML has been implemented in the i-PI molecular dynamics package¹¹⁵. The main features of this implementation are: (1) it allows the use of periodic boundary conditions and stress tensor calculation, (2) parallel querying of all beads at once in PIMD simulations and (3) it uses the highly optimized sGDML GPU implementation in PyTorch to parallelise beads calculations, dramatically increasing the simulation efficiency.

Software: interface with phonopy for phonons

An ASE calculator is already provided by the sGDML package, this allows to use all its simulation options. In particular, the phonon analysis for materials is easily computed in this package using Phonopy¹¹⁶. An example of the scripts used to compute the phonons in this paper is provided in the Supplementary Software.

Training GAP/SOAP models

The GAP models for graphene were trained employing the QUIP software package available at http://www.libatoms.org. All potentials were constructed using the same training datasets prepared to train the BIGDML models in this work. A combination of a two-body (2b), a three-body (3b) and a many-body (SOAP) descriptor were used in the construction of each GAP model. The parameters for the 2b, 3b and SOAP descriptors were the same optimized values used in the work of Rowe et al. to fit their GAP model for graphene³⁶.

Data availability

All datasets used in this work are available at http://www.sgdml.org or http://quantum-machine.org/datasets/. Additional data related to this paper may be requested from the authors.

Code availability

The full documentation of the sGDML software can be found at, http://quantum-machine.org/gdml/doc/ and the code can be downloaded from https://github.com/stefanch/sGDML.

References

Veit, M. et al. Equation of state of fluid methane from first principles with machine learning potentials. J. Chem. Theory Comput. 15, 2574–2586 (2019).
Article CAS PubMed Google Scholar
Cheng, B., Mazzola, G., Pickard, C. J. & Ceriotti, M. Evidence for supercritical behaviour of high-pressure liquid hydrogen. Nature 585, 217–220 (2020).
Article ADS CAS PubMed Google Scholar
Sauceda, H. E., Vassilev-Galindo, V., Chmiela, S., Müller, K.-R. & Tkatchenko, A. Dynamical strengthening of covalent and non-covalent molecular interactions by nuclear quantum effects at finite temperature. Nat. Commun. 12, 442 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Deringer, V. L. et al. Origins of structural and electronic transitions in disordered silicon. Nature 589, 59–64 (2021).
Article ADS CAS PubMed Google Scholar
Ladygin, V., Korotaev, P., Yanilkin, A. & Shapeev, A. Lattice dynamics simulation using machine learning interatomic potentials. Comput. Mater. Sci. 172, 109333 (2020).
Article CAS Google Scholar
Smith, J. S., Isayev, O. & Roitberg, A. E. Ani-1: an extensible neural network potential with dft accuracy at force field computational cost. Chem. Sci. 8, 3192–3203 (2017).
Article CAS PubMed PubMed Central Google Scholar
Noé, F., Tkatchenko, A., Müller, K.-R. & Clementi, C. Machine learning for molecular simulation. Annu. Rev. Phys. Chem. 71, 361–390 (2020).
Article PubMed CAS Google Scholar
Tkatchenko, A. Machine learning for chemical discovery. Nat. Commun. 11, 4125 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Unke, O. T. et al. Machine learning force fields. Chem. Rev. 121, 10142–10186 (2021).
Article CAS PubMed PubMed Central Google Scholar
Von Lilienfeld, O. A. Quantum machine learning in chemical compound space. Angew. Chem. Int. Ed. 57, 4164–4169 (2018).
Article CAS Google Scholar
Schütt, K. T. et al. Machine Learning Meets Quantum Physics, vol. 968 (Springer Lecture Notes in Physics, 2020).
Musil, F. et al. Physics-inspired structural representations for molecules and materials. Chem. Rev. 121, 9759–9815 (2021).
Article CAS PubMed Google Scholar
von Lilienfeld, O. A. & Burke, K. Retrospective on a decade of machine learning for chemical discovery. Nat. Commun. 11, 4895 (2020).
Article ADS CAS Google Scholar
Keith, J. A. et al. Combining machine learning and computational chemistry for predictive insights into chemical systems. Chem. Rev. 121, 9816–9872 (2021).
Article CAS PubMed PubMed Central Google Scholar
Gao, W., Mahajan, S. P., Sulam, J. & Gray, J. J. Deep learning in protein structural modeling and design. Patterns 1, 100142 (2020).
Article PubMed PubMed Central Google Scholar
Noé, F., De Fabritiis, G. & Clementi, C. Machine learning for protein folding and dynamics. Curr. Opin. Struc. Biol. 60, 77–84 (2020).
Article CAS Google Scholar
Ghasemi, S. A., Hofstetter, A., Saha, S. & Goedecker, S. Interatomic potentials for ionic systems with density functional accuracy based on charge densities obtained by a neural network. Phys. Rev. B 92, 045131 (2015).
Article ADS CAS Google Scholar
Novikov, I. S., Gubaev, K., Podryabinkin, E. V. & Shapeev, A. V. The MLIP package: moment tensor potentials with MPI and active learning. Mach. Learn.: Sci. Technol. 2, 025002 (2021).
Google Scholar
Artrith, N., Urban, A. & Ceder, G. Constructing first-principles phase diagrams of amorphous lixsi using machine-learning-assisted sampling with an evolutionary algorithm. J. Chem. Phys. 148, 241711 (2018).
Article ADS PubMed CAS Google Scholar
Byggmästar, J., Nordlund, K. & Djurabekova, F. Gaussian approximation potentials for body-centered-cubic transition metals. Phys. Rev. Materials. 4, 093802 (2020).
Article ADS Google Scholar
Bartók, A. P., Kermode, J., Bernstein, N. & Csányi, G. Machine learning a general-purpose interatomic potential for silicon. Phys. Rev. X. 8, 041048 (2018).
Google Scholar
Bartók, A. P. et al. Machine learning unifies the modeling of materials and molecules. Sci. Adv. 3, e1701816 (2017).
Article ADS PubMed PubMed Central Google Scholar
Chmiela, S., Sauceda, H. E., Müller, K.-R. & Tkatchenko, A. Towards exact molecular dynamics simulations with machine-learned force fields. Nat. Commun. 9, 3887 (2018).
Article ADS PubMed PubMed Central CAS Google Scholar
Unke, O. T. & Meuwly, M. Physnet: A neural network for predicting energies, forces, dipole moments, and partial charges. J. Chem. Theory Comput. 15, 3678–3693 (2019).
Article CAS PubMed Google Scholar
Devereux, C. et al. Extending the applicability of the ani deep learning molecular potential to sulfur and halogens. J. Chem. Theo. Comp. 16, 4192–4202 (2020).
Article CAS Google Scholar
Behler, J. Constructing high-dimensional neural network potentials: a tutorial review. Int. J. Quantum Chem. 115, 1032–1050 (2015).
Article CAS Google Scholar
Butler, K. T., Davies, D. W., Cartwright, H., Isayev, O. & Walsh, A. Machine learning for molecular and materials science. Nature 559, 547–555 (2018).
Article ADS CAS PubMed Google Scholar
Wallace, S. K. et al. Modeling the high-temperature phase coexistence region of mixed transition metal oxides from ab initio calculations. Phys. Rev. Res.3, 013139 (2021).
Article CAS Google Scholar
von Lilienfeld, O. A., Müller, K.-R. & Tkatchenko, A. Exploring chemical compound space with quantum-based machine learning. Nat. Rev. Chem. 4, 347–358 (2020).
Article Google Scholar
Seema, P., Behler, J. & Marx, D. Peeling by nanomechanical forces: a route to selective creation of surface structures. Phys. Rev. Lett. 115, 036102 (2015).
Article ADS PubMed CAS Google Scholar
Schütt, K. T., Sauceda, H. E., Kindermans, P.-J., Tkatchenko, A. & Müller, K.-R. Schnet– a deep learning architecture for molecules and materials. J. Chem. Phys. 148, 241722 (2018).
Article ADS PubMed CAS Google Scholar
Deringer, V. L., Caro, M. A. & Csányi, G. A general-purpose machine-learning force field for bulk and nanostructured phosphorus. Nat. Commun. 11, 5461 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Ko, T. W., Finkler, J. A., Goedecker, S. & Behler, J. A fourth-generation high-dimensional neural network potential with accurate electrostatics including non-local charge transfer. Nat. Commun. 12, 398 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Unke, O. T. et al. Spookynet: Learning force fields with electronic degrees of freedom and nonlocal effects. Nat. Commun. 12, 7273 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Rowe, P., Deringer, V. L., Gasparotto, P., Csányi, G. & Michaelides, A. An accurate and transferable machine learning potential for carbon. J. Chem. Phys. 153, 034702 (2020).
Article CAS PubMed Google Scholar
Rowe, P., Csányi, G., Alfè, D. & Michaelides, A. Development of a machine learning potential for graphene. Phys. Rev. B 97, 054303 (2018).
Article ADS Google Scholar
Behler, J. First principles neural network potentials for reactive simulations of large molecular and condensed systems. Angew. Chem. Int. Ed. 56, 12828–12840 (2017).
Article CAS Google Scholar
Artrith, N. & Behler, J. High-dimensional neural network potentials for metal surfaces: A prototype study for copper. Phys. Rev. B 85, 045439 (2012).
Article ADS CAS Google Scholar
Booth, G. H., Grüneis, A., Kresse, G. & Alavi, A. Towards an exact description of electronic wavefunctions in real solids. Nature 493, 365–370 (2013).
Article ADS CAS PubMed Google Scholar
Gruber, T., Liao, K., Tsatsoulis, T., Hummel, F. & Grüneis, A. Applying the coupled-cluster ansatz to solids and surfaces in the thermodynamic limit. Phys. Rev. X. 8, 021043 (2018).
CAS Google Scholar
Zen, A. et al. Fast and accurate quantum monte carlo for molecular crystals. Proc. Natl. Acad. Sci. 115, 1724–1729 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Chmiela, S. et al. Machine learning of accurate energy-conserving molecular force fields. Sci. Adv. 3, e1603015 (2017).
Article ADS PubMed PubMed Central CAS Google Scholar
Chmiela, S., Sauceda, H. E., Poltavsky, I., Müller, K.-R. & Tkatchenko, A. sgdml: Constructing accurate and data efficient molecular force fields using machine learning. Comput. Phys. Commun. 240, 38–45 (2019).
Article CAS Google Scholar
Montavon, G. et al. Learning invariant representations of molecules for atomization energy prediction. Adv. Neural Inf. Process. Sys. 25, 440–448 (2012).
Google Scholar
Montavon, G. et al. Machine learning of molecular electronic properties in chemical compound space. New J. Phys. 15, 095003 (2013).
Article ADS CAS Google Scholar
Anselmi, F., Rosasco, L. & Poggio, T. On invariance and selectivity in representation learning. Inf. Inference: A J. IMA. 5, 134–158 (2016).
Article MathSciNet MATH Google Scholar
Poggio, T. & Anselmi, F.Visual cortex and deep networks: learning invariant representations (MIT Press 2016).
Rupp, M., Tkatchenko, A., Müller, K.-R. & von Lilienfeld, O. A. Fast and accurate modeling of molecular atomization energies with machine learning. Phys. Rev. Lett. 108, 58301 (2012).
Article ADS CAS Google Scholar
Sauceda, H. E., Chmiela, S., Poltavsky, I., Müller, K.-R. & Tkatchenko, A. Molecular force fields with gradient-domain machine learning: Construction and application to dynamics of small molecules with coupled cluster forces. J. Chem. Phys. 150, 114102 (2019).
Article ADS PubMed CAS Google Scholar
Hloucha, M. & Deiters, U. K. Fast coding of the minimum image convention. Mol. Simul. 20, 239–244 (1998).
Article CAS Google Scholar
Chmiela, S.Towards exact molecular dynamics simulations with invariant machine-learned models. Doctoral thesis, Technische Universität Berlin, Berlin (2019). https://doi.org/10.14279/depositonce-8635.
Faber, F., Lindmaa, A., von Lilienfeld, O. A. & Armiento, R. Crystal structure representations for machine learning models of formation energies. Int. J. Quantum Chem. 115, 1094–1101 (2015).
Article CAS Google Scholar
Willatt, M. J., Musil, F. & Ceriotti, M. Atom-density representations for machine learning. J.Chem. Phys. 150, 154110 (2019).
Article ADS PubMed CAS Google Scholar
Behler, J. & Parrinello, M. Generalized neural-network representation of high-dimensional potential-energy surfaces. Phys. Rev. Lett. 98, 146401 (2007).
Article ADS PubMed CAS Google Scholar
Bartók, A. P., Kondor, R. & Csányi, G. On representing chemical environments. Phys. Rev. B 87, 184115 (2013).
Article ADS CAS Google Scholar
Huo, H. & Rupp, M. Unified representation of molecules and crystals for machine learning. arXiv:1704.06439 (2017).
Schütt, K. T. et al. How to represent crystal structures for machine learning: towards fast prediction of electronic properties. Phys. Rev. B 89, 205118 (2014).
Article ADS CAS Google Scholar
Faber, F. A., Christensen, A. S., Huang, B. & von Lilienfeld, O. A. Alchemical and structural distribution based representation for universal quantum machine learning. J. Chem. Phys. 148, 241717 (2018).
Article ADS PubMed CAS Google Scholar
Li, Z., Kermode, J. R. & De Vita, A. Molecular dynamics with on-the-fly machine learning of quantum-mechanical forces. Phys. Rev. Lett. 114, 096405 (2015).
Article ADS PubMed CAS Google Scholar
Pronobis, W., Tkatchenko, A. & Müller, K.-R. Many-body descriptors for predicting molecular properties with machine learning: Analysis of pairwise and three-body interactions in molecules. J. Chem. Theory Comput. 14, 2991–3003 (2018).
Article CAS PubMed Google Scholar
Sólyom, J. Fundamentals of the Physics of Solids: 1st edn,Vol. I: Structure and Dynamics (Springer, 2008).
Zhang, I. Y. & Grüneis, A. Coupled cluster theory in materials science. Front. Mater. 6, 123 (2019).
Article ADS Google Scholar
Yoon, D., Son, Y.-W. & Cheong, H. Negative thermal expansion coefficient of graphene measured by raman spectroscopy. Nano Lett. 11, 3227–3231 (2011).
Article ADS CAS PubMed Google Scholar
Fan, Y., Xiang, Y. & Shen, H. Temperature-dependent negative poisson’s ratio of monolayer graphene: Prediction from molecular dynamics simulations. Nanotechnol. Rev. 8, 415–421 (2019).
Article CAS Google Scholar
Yang, X.-F. et al. Single-atom catalysts: A new frontier in heterogeneous catalysis. Acc. Chem. Res. 46, 1740–1748 (2013).
Article CAS PubMed Google Scholar
Wang, A., Li, J. & Zhang, T. Heterogeneous single-atom catalysis. Nat. Rev. Chem. 2, 65–81 (2018).
Article ADS CAS Google Scholar
Doherty, F., Wang, H., Yang, M. & Goldsmith, B. R. Nanocluster and single-atom catalysts for thermocatalytic conversion of co and co2. Catal. Sci. Technol. 10, 5772–5791 (2020).
Article CAS Google Scholar
Tkatchenko, A. & Scheffler, M. Accurate molecular van der waals interactions from ground-state electron density and free-atom reference data. Phys. Rev. Lett. 102, 073005 (2009).
Article ADS PubMed CAS Google Scholar
Tkatchenko, A., DiStasio, R. A., Car, R. & Scheffler, M. Accurate and efficient method for many-body van der waals interactions. Phys. Rev. Lett. 108, 236402 (2012).
Article ADS PubMed CAS Google Scholar
Ambrosetti, A., Reilly, A. M., DiStasio, R. A. & Tkatchenko, A. Long-range correlation energy calculated from coupled atomic response functions. J. Chem. Phys. 140, 18A508 (2014).
Article PubMed CAS Google Scholar
Ruiz, V. G., Liu, W., Zojer, E., Scheffler, M. & Tkatchenko, A. Density-functional theory with screened van der waals interactions for the modeling of hybrid inorganic-organic systems. Phys. Rev. Lett. 108, 146103 (2012).
Article ADS PubMed CAS Google Scholar
Hermann, J. & Tkatchenko, A. Density functional model for van der waals interactions: Unifying many-body atomic approaches with nonlocal functionals. Phys. Rev. Lett. 124, 146401 (2020).
Article ADS MathSciNet CAS PubMed Google Scholar
Cleri, F. & Rosato, V. Tight-binding potentials for transition metals and alloys. Phys. Rev. B 48, 22–33 (1993).
Article ADS CAS Google Scholar
Daw, M. S., Foiles, S. M. & Baskes, M. I. The embedded-atom method: a review of theory and applications. Mat. Sci. Eng. Rep. 9, 251 – 310 (1993).
Google Scholar
Sauceda, H. E. & Garzón, I. L. Structural determination of metal nanoparticles from their vibrational (phonon) density of states. J. Phys. Chem. C 119, 10876–10880 (2015).
Article CAS Google Scholar
George, J., Hautier, G., Bartók, A. P., Csányi, G. & Deringer, V. L. Combining phonon accuracy with high transferability in gaussian approximation potential models. J. Chem. Phys. 153, 044104 (2020).
Article ADS CAS PubMed Google Scholar
Lozada-Hidalgo, M. et al. Sieving hydrogen isotopes through two-dimensional crystals. Science 351, 68–70 (2016).
Article ADS CAS PubMed Google Scholar
Poltavsky, I., Zheng, L., Mortazavi, M. & Tkatchenko, A. Quantum tunneling of thermal protons through pristine graphene. J. Chem. Phys. 148, 204707 (2018).
Article ADS PubMed CAS Google Scholar
Tadmor, E. EAM potential (LAMMPS cubic hermite tabulation) for Pd developed by Zhou, Johnson, and Wadley (2004); NIST retabulation v000. OpenKIM, https://doi.org/10.25950/9edc9c7c (2018).
Gowtham, S., Scheicher, R. H., Ahuja, R., Pandey, R. & Karna, S. P. Physisorption of nucleobases on graphene: Density-functional calculations. Phys. Rev. B 76, 033401 (2007).
Article ADS CAS Google Scholar
Varghese, N. et al. Binding of dna nucleobases and nucleosides with graphene. Chem. Phys. Chem. 10, 206–210 (2009).
Article CAS PubMed Google Scholar
AlZahrani, A. First-principles study on the structural and electronic properties of graphene upon benzene and naphthalene adsorption. Appl. Surf. Sci. 257, 807–810 (2010).
Article CAS Google Scholar
Gan, T. & Hu, S. Electrochemical sensors based on graphene materials. Microchim. Acta 175, 1 (2011).
Article CAS Google Scholar
Mohapatra, B. D. et al. Stimulation of electrocatalytic oxygen reduction activity on nitrogen doped graphene through noncovalent molecular functionalisation. Chem. Commun. 52, 10385–10388 (2016).
Article CAS Google Scholar
Chakradhar, A., Sivapragasam, N., Nayakasinghe, M. T. & Burghaus, U. Adsorption kinetics of benzene on graphene: An ultrahigh vacuum study. J. Vac. Sci. Technol. A 34, 021402 (2016).
Article CAS Google Scholar
Roychoudhury, S., Motta, C. & Sanvito, S. Charge transfer energies of benzene physisorbed on a graphene sheet from constrained density functional theory. Phys. Rev. B 93, 045130 (2016).
Article ADS CAS Google Scholar
Tonel, M. Z., Lara, I. V., Zanella, I. & Fagan, S. B. The influence of the concentration and adsorption sites of different chemical groups on graphene through first principles simulations. Phys. Chem. Chem. Phys. 19, 27374–27383 (2017).
Article CAS PubMed Google Scholar
Tonel, M. Z., Martins, M. O., Zanella, I., Pontes, R. B. & Fagan, S. B. A first-principles study of the interaction of doxorubicin with graphene. Comput. Theor. Chem. 1115, 270–275 (2017).
Article CAS Google Scholar
de Moraes, E. E., Tonel, M. Z., Fagan, S. B. & Barbosa, M. C. Density functional theory study of Π-aromatic interaction of benzene, phenol, catechol, dopamine isolated dimers and adsorbed on graphene surface. J. Mol. Model. 25, 302 (2019).
Article PubMed CAS Google Scholar
Ojaghlou, N., Bratko, D., Salanne, M., Shafiei, M. & Luzar, A. Solvent-solvent correlations across graphene: The effect of image charges. ACS Nano 14, 7987–7998 (2020).
Article CAS PubMed Google Scholar
Perdew, J. P., Burke, K. & Ernzerhof, M. Generalized gradient approximation made simple. Phys. Rev. Lett. 77, 3865–3868 (1996).
Article ADS CAS PubMed Google Scholar
Zacharia, R., Ulbricht, H. & Hertel, T. Interlayer cohesive energy of graphite from thermal desorption of polyaromatic hydrocarbons. Phys. Rev. B 69, 155406 (2004).
Article ADS CAS Google Scholar
Fang, W. et al. Inverse temperature dependence of nuclear quantum effects in dna base pairs. J. Phys. Chem. Lett. 7, 2125–2131 (2016).
Article CAS PubMed PubMed Central Google Scholar
Markland, T. E. & Ceriotti, M. Nuclear quantum effects enter the mainstream. Nat. Rev. Chem. 2, 0109 (2018).
Article CAS Google Scholar
Rossi, M., Fang, W. & Michaelides, A. Stability of complex biomolecular structures: van der waals, hydrogen bond cooperativity, and nuclear quantum effects. J. Phys. Chem. Lett. 6, 4233–4238 (2015).
Article CAS PubMed Google Scholar
Leinen, P. et al. Autonomous robotic nanofabrication with reinforcement learning. Sci. Adv. 6, eabb6987 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Züttel, A. Materials for hydrogen storage. Materials Today 6, 24–33 (2003).
Article Google Scholar
Kimizuka, H., Ogata, S. & Shiga, M. Mechanism of fast lattice diffusion of hydrogen in palladium: Interplay of quantum fluctuations and lattice strain. Phys. Rev. B 97, 014102 (2018).
Article ADS CAS Google Scholar
Jiang, D. E. & Carter, E. A. Diffusion of interstitial hydrogen into and through bcc fe from first principles. Phys. Rev. B 70, 064102 (2004).
Article ADS CAS Google Scholar
Zhou, X. W., Gabaly, F. E., Stavila, V. & Allendorf, M. D. Molecular dynamics simulations of hydrogen diffusion in aluminum. J. Phys. Chem. C 120, 7500–7509 (2016).
Article CAS Google Scholar
Bartók, A. P., Payne, M. C., Kondor, R. & Csányi, G. Gaussian approximation potentials: The accuracy of quantum mechanics, without the electrons. Phys. Rev. Lett. 104, 136403 (2010).
Article ADS PubMed CAS Google Scholar
Pronobis, W.Towards more efficient and performant computations in quantum chemistry with machine learning. Doctoral thesis, Technische Universität Berlin, Berlin. https://doi.org/10.14279/depositonce-9866 (2020).
Kovács, D. P. et al. Linear atomic cluster expansion force fields for organic molecules: beyond rmse. J. Chem. Theory Comput. 17, 7696–7711 (2021).
Article PubMed PubMed Central CAS Google Scholar
Braun, M. L., Buhmann, J. M. & Müller, K.-R. On relevant dimensions in kernel feature spaces. J. Mach. Learn. Res. 9, 1875–1908 (2008).
MathSciNet MATH Google Scholar
Vapnik, V. N. The Nature of Statistical Learning Theory. (Springer, New York, NY, 1995).
Book MATH Google Scholar
Müller, K.-R., Mika, S., Ratsch, G., Tsuda, K. & Schölkopf, B. An introduction to kernel-based learning algorithms. IEEE Trans. Neural Netw. Learn. Syst. 12, 181–201 (2001).
Article Google Scholar
Schölkopf, B. & Smola, A. J.Learning with kernels: support vector machines, regularization, optimization, and beyond (MIT press, 2002).
Williams, C. K. & Rasmussen, C. E. Gaussian processes for machine learning. (MIT press Cambridge, MA, 2006).
MATH Google Scholar
Thomas, N. et al. Tensor field networks: Rotation-and translation-equivariant neural networks for 3d point clouds. arXiv:1802.08219 (2018)
Schütt, K., Unke, O. & Gastegger, M. Equivariant message passing for the prediction of tensorial properties and molecular spectra. In Int. Conf. Mach. Learn., 9377–9388 (PMLR, 2021). https://proceedings.mlr.press/v139/schutt21a.html.
LeCun, Y. A., Bottou, L., Orr, G. B. & Müller, K.-R. Efficient backprop. In Neural networks: Tricks of the trade, 9–48 (Springer, 2012).
Giannozzi, P. et al. QUANTUM ESPRESSO: a modular and open-source software project for quantum simulations of materials. J. Phys.: Condens. Matter 21, 395502 (2009).
Google Scholar
Giannozzi, P. et al. Advanced capabilities for materials modelling with Quantum ESPRESSO. J. Phys.: Condens. Matter 29, 465901 (2017).
CAS Google Scholar
Blum, V. et al. Ab initio molecular simulations with numeric atom-centered orbitals. Comput. Phys. Commun. 180, 2175 – 2196 (2009).
Article MATH CAS Google Scholar
Kapil, V. et al. i-pi 2.0: a universal force engine for advanced molecular simulations. Comput. Phys. Commun. 236, 214–223 (2019).
Article ADS CAS Google Scholar
Togo, A. & Tanaka, I. First principles phonon calculations in materials science. Scr. Mater. 108, 1–5 (2015).
Article CAS Google Scholar
Völkl, J., Wollenweber, G., Klatt, K.-H. & Alefeld, G. Notizen: Reversed isotope dependence for hydrogen diffusion in palladium. Z. Naturforsch. A 26, 922–923 (1971).
Article ADS Google Scholar
Heuser, B. J. et al. Direct measurement of hydrogen dislocation pipe diffusion in deformed polycrystalline pd using quasielastic neutron scattering. Phys. Rev. Lett. 113, 025504 (2014).
Article ADS CAS PubMed Google Scholar
Powell, G. L. & Kirkpatrick, J. R. Surface conductance and the diffusion of h and d in pd. Phys. Rev. B 43, 6968–6976 (1991).
Article ADS CAS Google Scholar

Download references

Acknowledgements

This paper is dedicated to the memory of Astro. We thank Oz Yosef Mendelsohn and Leeor Kronik for providing access to the perovskite dataset. AT was supported by the Luxembourg National Research Fund (DTU PRIDE MASSENA) and by the European Research Council (ERC-CoG BeStMo). KRM was partly supported by the Institute of Information & Communications Technology Planning & Evaluation (IITP) grants funded by the Korea government(MSIT) (No. 2019-0-00079, Artificial Intelligence Graduate School Program, Korea University and No. 2022-0-00984, Development of Artificial Intelligence Technology for Personalized Plug-and-Play Explanation and Verification of Explanation), and by the German Ministry for Education and Research (BMBF) under Grants 01IS14013A-E, 01GQ1115, 01GQ0850, 01IS18025A and 01IS18037. LOPB thank financial support from DGAPA-UNAM (PAPIIT) under Projects IA102218 and IN116020, as well as cpu-time at Supercómputo UNAM (Miztli) through a DGTIC-UNAM grant LANCAD-UNAM-DGTIC-307. LOPB and LEGG are also grateful to CONACYT-Mexico for support through Project 285218 and a doctoral scholarship 493775, respectively. HES works at the BASLEARN - TU Berlin/BASF Joint Lab for Machine Learning, co-financed by TU Berlin and BASF SE. HES thanks the support from DGTIC-UNAM under Project LANCAD-UNAM-DGTIC-419. Correspondence should be addressed to HES, KRM and AT.

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Authors and Affiliations

Departamento de Materia Condensada, Instituto de Física, Universidad Nacional Autónoma de México, Cd. de México C.P., 04510, Mexico
Huziel E. Sauceda
Machine Learning Group, Technische Universität Berlin, 10587, Berlin, Germany
Huziel E. Sauceda, Stefan Chmiela & Klaus-Robert Müller
BASLEARN - TU Berlin/BASF Joint Lab for Machine Learning, Technische Universität Berlin, 10587, Berlin, Germany
Huziel E. Sauceda
Programa de Doctorado en Ciencias (Física), División de Ciencias Exactas y Naturales, Universidad de Sonora, Blvd. Luis Encinas & Rosales, Hermosillo, C.P., 83000, Mexico
Luis E. Gálvez-González
BIFOLD – Berlin Institute for the Foundations of Learning and Data, Berlin, Germany
Stefan Chmiela & Klaus-Robert Müller
Departamento de Física Química, Instituto de Física, Universidad Nacional Autónoma de México, Cd. de México C.P., 04510, Mexico
Lauro Oliver Paz-Borbón
Google Research, Brain team, Berlin, Germany
Klaus-Robert Müller
Department of Artificial Intelligence, Korea University, Anam-dong, Seongbuk-gu, 02841, Seoul, Korea
Klaus-Robert Müller
Max Planck Institute for Informatics, Stuhlsatzenhausweg, 66123, Saarbrücken, Germany
Klaus-Robert Müller
Department of Physics and Materials Science, University of Luxembourg, L-1511, Luxembourg City, Luxembourg
Alexandre Tkatchenko

Authors

Huziel E. Sauceda
View author publications
You can also search for this author in PubMed Google Scholar
Luis E. Gálvez-González
View author publications
You can also search for this author in PubMed Google Scholar
Stefan Chmiela
View author publications
You can also search for this author in PubMed Google Scholar
Lauro Oliver Paz-Borbón
View author publications
You can also search for this author in PubMed Google Scholar
Klaus-Robert Müller
View author publications
You can also search for this author in PubMed Google Scholar
Alexandre Tkatchenko
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

H.E.S. and A.T. conceived the research and designed the analyses. H.E.S. and L.E.G.G. performed quantum chemical calculations and the molecular dynamics simulations. S.C. conceived and constructed the GDML framework. S.C., L.E.G.G., H.E.S., A.T., and K.-R.M. developed the machine learning methodology. H.E.S. and L.E.G.G. created the figures with help from other authors. H.E.S., K.-R.M. and A.T. wrote the paper. All the authors discussed results and commented on the manuscript.

Corresponding authors

Correspondence to Huziel E. Sauceda, Klaus-Robert Müller or Alexandre Tkatchenko.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks the anonymous reviewers for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Description of Additional Supplementary Files

Supplementary Video 1

Supplementary Video 2

Supplementary Video 3

Supplementary Video 4

Supplementary Software

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Sauceda, H.E., Gálvez-González, L.E., Chmiela, S. et al. BIGDML—Towards accurate quantum machine learning force fields for materials. Nat Commun 13, 3733 (2022). https://doi.org/10.1038/s41467-022-31093-x

Download citation

Received: 17 June 2021
Accepted: 01 June 2022
Published: 29 June 2022
DOI: https://doi.org/10.1038/s41467-022-31093-x

This article is cited by

Smart carbon-based sensors for the detection of non-coding RNAs associated with exposure to micro(nano)plastics: an artificial intelligence perspective
- Pooja Ratre
- Nazim Nazeer
- Pradyumna Kumar Mishra
Environmental Science and Pollution Research (2024)
Quantum confinement detection using a coupled Schrödinger system
- Chun Li
Nonlinear Dynamics (2024)
Efficient interatomic descriptors for accurate machine learning force fields of extended molecules
- Adil Kabylda
- Valentin Vassilev-Galindo
- Alexandre Tkatchenko
Nature Communications (2023)
Universal machine learning for the response of atomistic systems to external fields
- Yaolong Zhang
- Bin Jiang
Nature Communications (2023)
A transferable recommender approach for selecting the best density functional approximations in chemical discovery
- Chenru Duan
- Aditya Nandy
- Heather J. Kulik
Nature Computational Science (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.