Universal machine learning for the response of atomistic systems to external fields

Zhang, Yaolong; Jiang, Bin

doi:10.1038/s41467-023-42148-y

Download PDF

Article
Open access
Published: 12 October 2023

Universal machine learning for the response of atomistic systems to external fields

Nature Communications volume 14, Article number: 6424 (2023) Cite this article

5422 Accesses
6 Citations
44 Altmetric
Metrics details

Subjects

Abstract

Machine learned interatomic interaction potentials have enabled efficient and accurate molecular simulations of closed systems. However, external fields, which can greatly change the chemical structure and/or reactivity, have been seldom included in current machine learning models. This work proposes a universal field-induced recursively embedded atom neural network (FIREANN) model, which integrates a pseudo field vector-dependent feature into atomic descriptors to represent system-field interactions with rigorous rotational equivariance. This “all-in-one” approach correlates various response properties like dipole moment and polarizability with the field-dependent potential energy in a single model, very suitable for spectroscopic and dynamics simulations in molecular and periodic systems in the presence of electric fields. Especially for periodic systems, we find that FIREANN can overcome the intrinsic multiple-value issue of the polarization by training atomic forces only. These results validate the universality and capability of the FIREANN method for efficient first-principles modeling of complicated systems in strong external fields.

Highly accurate protein structure prediction with AlphaFold

Article Open access 15 July 2021

De novo design of protein structure and function with RFdiffusion

Article Open access 11 July 2023

Collective intelligence: A unifying concept for integrating biology across scales and substrates

Article Open access 28 March 2024

Introduction

The interplay between external fields and chemical systems is of fundamental importance in a range of physical, chemical, and biological processes^1,2. By interacting with atoms, molecules, or condensed matter, external (mainly electric) fields can induce electronic/spin polarization and spatial orientation of the system, which have offered a particular means to alter chemical structures³, promote electron transfer⁴, control phase transitions of materials⁵ or conformational transformations of biomolecules⁶, subtly manipulate chemical reactivity and selectivity in catalysis^7,8,9,10 and quantum dynamics in cold chemical reactions^11,12,13.

Exact field-dependent quantum scattering calculations were feasible only for very small systems¹⁴. Density functional theory (DFT) and ab initio molecular dynamics (AIMD) simulations based on the modern theory of polarization¹⁵ have been more commonly applied to study more complex aperiodic and periodic systems in the presence of external electric fields^{16,17,18,19,20}. However, the AIMD approach remains very demanding, especially when nuclear quantum effects (NQEs) are important¹⁹. Although empirical force fields can be instead highly efficient^21,22, their accuracy is limited by empirical functions and approximate expressions for the interaction Hamiltonian. For example, the commonly used dipole-field approximation truncates the perturbation of the system by an electric field to the first order (i.e. only the interaction with the permanent dipole is included) and omits higher-order interactions associated with polarizability, hyperpolarizability, and so on. Moreover, except these reactive force fields^23,24,25, most of them fall short of describing bond breakage/formation.

Recent years have witnessed revolutionary successes of machine learning (ML) methods in solving high-dimensional problems in chemistry^{26,27,28,29,30,31,32}. Various ML models for accurately representing potential energy surfaces (PESs) have been developed^{33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48}. Some of them have been extended to learn tensorial properties such as the dipole moment and polarizability tensor with correct rotational equivariance^{48,49,50,51,52,53,54,55,56,57,58,59,60}, enabling efficient field-free simulations of electronic and vibrational spectra. However, most ML models treat the potential energy and its response properties to electric fields separately, without capturing the field dependence. The influence of electric fields was first taken into account by Christensen et al. in a kernel-based regression method by constructing a fictitious dipole arising from fictitious partial charges and coupling it with the field vector to yield a scalar dipole-field interaction analog⁶¹. Müller and coworkers incorporated the dipole-field interaction similarly in their FieldSchNet neural network (NN) model to describe the solvent effect in the form of an effective field⁶². Gao and Remsing separated the long- and short-range interactions by introducing self-consistent effective electric fields, which are used as input for subsequent NNs along with local atomic coordinates to describe perturbations to the short-range system from the effective electric field. This strategy however does not rigorously fulfill the rotational equivariance with respect to the field direction⁶³.

In this work, by introducing a simple field-dependent feature into the description of the atomic environment, we develop a field-induced recursively embedded atom neural network (FIREANN) model with correct rotational equivariance of a system interacting with an external field. Without any truncation of field-induced interactions, FIREANN describes not only the energy variation with applied field strength and direction, but also the associated response properties simultaneously up to (in principle) any order. Path-integral-based molecular dynamics (MD) simulations with well-trained FIREANN models yield reliable ab initio spectroscopy of representative molecular and condensed phase systems in the presence of electric fields. A remarkable characteristic of this model is that it can detour the multivalued issue of polarization in periodic systems, a well-known but largely neglected fact in existing ML models, by learning atomic forces only.

Results

FIREANN framework

The FIREANN model is built on top of a physics-inspired recursively embedded atom neural network (REANN) framework⁶⁴ that uses embedded atom densities (EADs) as descriptors to atomic environment (Detailed in the Methods section). In the absence of electric fields, EADs are constructed in a quantum chemical spirit by the linear combination of Gaussian-type orbitals (GTOs) of surrounding atoms, preserving the overall rotational, translational, and permutational invariance of the system. However, an applied field can certainly redistribute the electron density and break down the rotational invariance of the system. The corresponding field-system interaction depends on the direction and strength of the electric field. To characterize this influence in a physically meaningful way, for an applied field ($\overrightarrow{{{{{{\boldsymbol{\varepsilon }}}}}}}$), we include a virtual field vector-dependent function, namely,

$${\varphi }_{{l}_{x}{l}_{y}{l}_{z}}(\overrightarrow{{{{{{\boldsymbol{\varepsilon }}}}}}})={({\varepsilon }_{x})}^{{l}_{x}}{({\varepsilon }_{y})}^{{l}_{y}}{({\varepsilon }_{z})}^{{l}_{z}}.$$

(1)

Following the procedure of feature construction in Methods, the field-dependent orbital was combined into the GTO to form a field-induced EAD (FI-EAD) vector that comprises various density values determined by different sets of contracted coefficients,

$${\rho }_{i}^{n}=\mathop{\sum }\limits_{l=0}^{L}{\mathop{\sum }\limits_{{l}_{x},{l}_{y},{l}_{z}}^{{l}_{x}+{l}_{y}+{l}_{z}=l}\frac{l!}{{l}_{x}!{l}_{y}!{l}_{z}!}\left[\mathop{\sum }\limits_{m=1}^{{N}_{\varphi }}{d}_{m}^{n}\left(\mathop{\sum }\limits_{j\ne i}^{{N}_{c}}{c}_{j}{\varphi }_{{l}_{x}{l}_{y}{l}_{z}}^{m}\left({\overrightarrow{{{{{{\bf{r}}}}}}}}_{ij}\right)+{c}_{\varepsilon }{\varphi }_{{l}_{x}{l}_{y}{l}_{z}}\left({\overrightarrow{{{{{{\boldsymbol{\varepsilon }}}}}}}}_{i}\right)\right)\right]}^{2}.$$

(2)

Here, the applied field felt by each atom is represented by a position vector of a pseudo-atom relative to that atom (${\overrightarrow{{{{{{\boldsymbol{\varepsilon }}}}}}}}_{i}$, as illustrated in Fig. 1). The FI-EAD feature can be rewritten in terms of interatomic distances and enclosed angles³⁹,

$${\rho }_{i}^{n}= \mathop{\sum }\limits_{l=0}^{L}\mathop{\sum }\limits_{j,k\ne i}^{{N}_{c}}{c}_{j}{c}_{k}{\left[{r}_{ij}{r}_{ik}\right]}^{l}{\cos }^{l}({\theta }_{ijk})\mathop{\sum }\limits_{m,m{^{\prime}}=1}^{{N}_{\varphi }}{d}_{m}^{n}{d}_{m{^{\prime}}}^{n}{f}_{m}({r}_{ij}){f}_{m{^{\prime}}}({r}_{ik}) V\\ +\mathop{\sum }\limits_{l=0}^{L}\mathop{\sum }\limits_{j\ne i}^{{N}_{c}}{c}_{j}{c}_{{{{{{\rm{e}}}}}}}{\left[{r}_{ij}|{\overrightarrow{{{{{{\boldsymbol{\varepsilon }}}}}}}}_{i}|\right]}^{l}{\cos }^{l}({\theta }_{ij\varepsilon })\mathop{\sum }\limits_{m=1}^{{N}_{\varphi }}{d}_{m}^{n}{f}_{m}({r}_{ij}),$$

(3)

where we have combined the radial Gaussian and switching functions in f_m for simplicity. From Eq. (3), one immediately realizes that the FI-EAD feature depends not only on atomic coordinates, but also on the field strength $(|{\overrightarrow{{{{{{\boldsymbol{\varepsilon }}}}}}}}_{i}|)$ and the closed angle θ_ijε between ${\overrightarrow{{{{{{\boldsymbol{\varepsilon }}}}}}}}_{i}$ and each ${\overrightarrow{{{{{{\bf{r}}}}}}}}_{ij}$ vector. In practice, rotating the field or the system separately will lead to different FI-EAD values, while the synchronous rotation of the field and the system without altering the relative direction of $\overrightarrow{{{{{{\boldsymbol{\varepsilon }}}}}}}$ with respect to each coordinate ${\hat{{{{{{\bf{r}}}}}}}}_{ij}$ will not. This FI-EAD feature captures the nature of the interaction between the system and the applied electric field without changing the physical form of the EAD feature. The resulting rotational equivariance is conserved in any subsequent message passing of the environment- and field-dependent orbital coefficients (c_j and c_ε) and thus in the final potential energy. When $\overrightarrow{{{{{{\boldsymbol{\varepsilon }}}}}}}$= 0, this model naturally reduces to the original REANN model. The only extra cost for evaluating FIREANN compared to the standard REANN is that of a field-induced orbital, which is almost negligible as evident from Eqs. (1) and (2).

**Fig. 1: Schematic of FIREANN framework.**

As an extra benefit, the FIREANN framework intrinsically describes the response of the potential energy to an external field up to an arbitrary order by taking the analytical gradients of the potential energy with respect to the field vector. For example, the electric dipole moment ($\overrightarrow{{{{{{\boldsymbol{\mu }}}}}}}$) is the first and the polarizability tensor (α) the second-order response to electric fields,

$$\overrightarrow{{{{{{\boldsymbol{\mu }}}}}}}=-\frac{\partial E}{\partial \overrightarrow{{{{{{\boldsymbol{\varepsilon }}}}}}}};\,{{{{{\boldsymbol{\alpha }}}}}}=\frac{\partial \overrightarrow{{{{{{\boldsymbol{\mu }}}}}}}}{\partial \overrightarrow{{{{{{\boldsymbol{\varepsilon }}}}}}}}=-\frac{{\partial }^{2}E}{\partial \overrightarrow{{{{{{\boldsymbol{\varepsilon }}}}}}}\partial \overrightarrow{{{{{{\boldsymbol{\varepsilon }}}}}}}}.$$

(4)

These properties can be simultaneously learned in a FIREANN model by adopting the following loss function,

$${{{{{\rm{L}}}}}}({{{{{\bf{w}}}}}})= \mathop{\sum }\limits_{m=1}^{{N}_{b}}\left[{\lambda }_{V}\times {\left({E}_{m}^{NN}-{E}_{m}^{Ref}\right)}^{2}+{\lambda }_{F}\times {\left|{\left(-\frac{\partial E}{\partial {{{{{\bf{r}}}}}}}\right)}_{m}^{NN}-{{{{{{\bf{F}}}}}}}_{m}^{Ref}\right|}^{2}\right.\\ \left.\,+{\lambda }_{\mu }\times {\left|{\left(-\frac{\partial E}{\partial \overrightarrow{{{{{{\boldsymbol{\varepsilon }}}}}}}}\right)}_{m}^{NN}-{\overrightarrow{{{{{{\boldsymbol{\mu }}}}}}}}_{m}^{Ref}\right|}^{2}+{\lambda }_{\alpha }\times {\left|{\left(-\frac{{\partial }^{2}E}{\partial \overrightarrow{{{{{{\boldsymbol{\varepsilon }}}}}}}\partial \overrightarrow{{{{{{\boldsymbol{\varepsilon }}}}}}}}\right)}_{m}^{NN}-{{{{{{\boldsymbol{\alpha }}}}}}}_{m}^{Ref}\right|}^{2}\right]/{N}_{b},$$

(5)

where N_b is the size of the batch dataset, the superscripts NN and Ref refer to the NN-predicted and reference quantities, and λ_V, λ_F, λ_μ, and λ_α represent the weights of the energy, force, dipole moment, and polarizability, respectively, in the loss function. Note that these response properties by construction offer correlated information on the field-dependence of the PES rather than being simply accumulated in the loss function⁴⁰. As a result, they have a similar effect as that of forces and hessians, which can help improve the fitting quality. We also note that FIREANN not only applies to electric fields as demonstrated by numerical examples below, but also equally to magnetic fields in the same spirit, by which the magnetic dipole and/or magnetic polarizability can be obtained.

A toy system

We first take the H₂O molecule as a toy system to verify the symmetry adaption of the FIREANN method subject to an external electric field. A FIREANN model was constructed with just a single equilibrium geometry lying in the yz plane and the electric field being 0.1 V Å⁻¹ along the x direction. As displayed in Fig. 2a, when the molecule rotates about the x axis, its potential energy does not change at all as the field is always orthogonal to the molecular plane. This is exactly encoded in FIREANN. On the other hand, the potential energy varies with the molecular rotation about the y axis, as shown in Fig. 2b. The energy variation representing the interaction between the molecular dipole and the electric field is again well predicted by the FIREANN model. Importantly, FIREANN further exhibits excellent extrapolatability in Fig. 2c, where the field intensity along the x axis varies from −0.2 to 0.2 V Å⁻¹, resulting in a symmetrical energy dependence on the field direction. The FIREANN model trained with a single data successfully reproduces the energy profile generated by DFT. In comparison, FieldSchNet fails to predict the correct field-induced energy dependence in the same condition and give a constant energy as shown in Fig. 2c. More detailed comparisons between FIREANN and FieldSchNet will be discussed below.

**Fig. 2: Rotational and field intensity dependence of the FIREANN model.**

Molecular spectroscopy

A distinct feature of the proposed FIREANN model is its all-in-one predictions for energies (atomic forces) and response properties with and without an electric field. We first demonstrate this feature for the N-methylacetamide (NMA) molecule, which has been widely used as a model system of the amide group to construct spectroscopic maps and simulate the spectra of the peptide backbone^{52,65,66,67,68}. Specifically, we constructed an FIREANN model by learning a mix set of ab initio energies, forces, dipole moments, and polarizabilities for the NMA molecule in an electric field varying from 0.0 to 0.4 V Å⁻¹ along x direction. Figure 3 clearly shows that the universal FIREANN model achieves an excellent accuracy for energy, dipole moment, and polarizability, with corresponding root mean square errors (RMSEs) of 0.0053 eV, 0.028 Debye, and 0.51 a.u., respectively. Given the synchronous prediction of these quantities, the FIREANN model enables efficient MD simulations of IR and Raman spectra in comparison with experimental data.

**Fig. 3: Performance of the FIREANN model for NMA.**

Figure 4a compares the calculated and experimental field-free infrared (IR) spectra⁶⁹ for NMA at 300 K. In general, the classical MD-based result agrees reasonably well with the experimental spectrum, even reproducing double peaks for the C-O stretching vibration (~1710 cm⁻¹) corresponding to the well-known P/R rotational structure-induced splitting⁶⁹. However, the calculated bands of Amide II, Amide III, Amide A (the N-H stretch band, ~3507 cm⁻¹), and the band including C-H stretching mode and other bending overtones of the methyl group (~2950 cm⁻¹) are apparently blue-shifted compared to experiment. This discrepancy is likely due to the neglect of NQEs in the classical treatment of these vibrational bands relevant to hydrogen atoms. To solve this problem, path-integral based thermostated ring polymer molecular dynamics (TRPMD)⁷⁰ simulations were performed. The TRPMD result significantly improves the agreement with the experiment and reproduces most bands in not only their positions but also their shapes and intensities. Figure 4b compares the calculated and experimental resonance Raman spectra of NMA for zero field. Due to the lack of the experimental spectrum for a single NMA molecule, the measured liquid spectrum⁷¹ is taken form qualitative comparison. Encouragingly, while there is apparent mismatch regarding the relative peak intensities of low-frequency modes, the TRPMD spectrum reproduces most of the observed bands reasonably well.

**Fig. 4: Field-free vibrational spectra of NMA.**

FIREANN also predicts the in-field molecular spectra, as shown in Fig. 5, where the field strength increases from 0 to 0.4 V Å⁻¹ every 0.1 V Å⁻¹ along x direction. In these in-field IR spectra, the C-O stretching band seems most influenced by the applied field. As mentioned in the field-free spectrum, this band has an intrinsic P/R rotational double-peak structure. Interestingly, with increasing field strength, this P/R branch splitting gradually vanishes and the absorption peak gets narrower and higher. This phenomenon implies the interplay between the electric field and the molecular rotation. Indeed, the dipole moment of NMA, which is almost parallel to the C-O bond^69,72, tends to reorient to the opposite direction of the electric field to minimize the energy. Increasing the field intensity increases the dipole-field interaction and more strongly confines the NMA molecule in the preferable orientation. In addition, a significant red shift of the CO stretching vibration is found roughly proportional to the field intensity. This redshift is likely a natural consequence of the weakening of the chemical bond by the applied electric field. A similar but smaller redshift is also found for the N-H stretching, consistent with the fact that the electron cloud of the C-O group is more polarizable than the N-H group, due apparently to the higher electron density there. Furthermore, we decompose the molecular polarizability into isotropic (${\alpha }_{{{{{{\rm{iso}}}}}}}={{{{{\rm{tr}}}}}}({{{{{\boldsymbol{\alpha }}}}}})/3$) and anisotropic (${{{{{{\boldsymbol{\alpha }}}}}}}_{{{{{{\rm{aniso}}}}}}}={{{{{\boldsymbol{\alpha }}}}}}-{\alpha }_{{{{{{\rm{iso}}}}}}}{{{{{\bf{I}}}}}}$) terms and exhibit corresponding Raman spectra in Fig. 5, respectively. The anisotropic Raman spectra show a similar field-dependence of the C-O stretching band for the same reason. However, the rotational splitting is absent in isotropic Raman spectra as the isotropic polarizability is rotation invariant. As a result, the increasing field results in only a pure red shift of the C-O stretching vibration.

**Fig. 5: In-field vibrational spectra of NMA predicted by FIREANN.**

Liquid water

The FIREANN model is by its atom-wise form capable of describing the response of periodic systems to external electric fields. We test this capability in liquid water. However, unlike molecular systems, the polarization (dipole moment per unit volume) of a periodic system is a multivalued quantity according to the modern theory of polarization¹⁵, resulting in multiple parallel branches that differ by a polarization quantum represented by the product of any lattice vector and the electronic charge and divided by the volume of the lattice¹⁵. This ill-defined multiplicity may lead to sudden jumps in the dipole moment. Figure 6a shows clearly the abrupt discontinuities in the evolution of the x component of DFT calculated dipole moment along an AIMD trajectory without an electric field. This accidental change in the dipole moment poses challenges for conventional atomistic ML models that learn field-free dipoles, which typically decompose the global dipole moment vector into local atomic dipoles and represent atomic dipoles by the product of atomic charges and position vectors^49,50,52. Importantly, this discontinuity issue occurs more frequently under a high field strength, as seen in Fig. 6b. Likewise, the in-field total energy is supposed to be discontinuous at these configurations as the dipole-field interaction jumps. Schienbein also recognized the multiple-valued problem of the dipole moment and proposed to learn the atomic polar tensor which is the spatial derivative of dipole instead of learning the dipole itself. These smooth spatial derivatives can be transformed into time derivatives of dipole in MD to calculate autocorrelation functions, ultimately yielding the IR spectrum⁶⁰. Similarly, learning the offsets of the Wannier centers relative to the corresponding O atoms can overcome this problem⁷³, however, localizing Wannier centers themselves in more complex systems with strong non-local features, is not a trial task, which easily gets stuck in local minima and has severe convergence difficulties⁷⁴. Furthermore, unlike FIREANN, these previous ML models are designed for field-free systems only, which does not describe the general response of a system to external fields and the field-dependent potential energy surface.

In the FIREANN framework, alternatively, this issue can be easily bypassed by training atomic forces only in the presence of electric field, because the gradient of the energy is actually unaffected. Although the dipole moment and polarizability are not explicitly involved, the interaction between the electric field and the system can be learned implicitly in the force-only training. The dipole moment can then be retrieved by the first-order gradient of the energy output with respect to the field vector. It should be noted that training atomic forces only will introduce an undetermined field-dependent constant to the total energy in our model. This will lead to certain undetermined constants to absolute thermodynamic properties. However, atomic forces (or any properties’ gradients with respect to atomic coordinates) are well represented by our model so that the changes of thermodynamic properties under a given electric field can still be correctly described, which are physically more meaningful in practical calculations. In addition, by including the polarizability in the loss function during the training process, one can eliminate the field dependence of the undetermined constant on the total energy. This allows us to compare energies and the changes in thermodynamic properties under different field strengths, as will be discussed later in the comparison with FieldSchNet.

To validate this strategy, we have constructed a FIREANN model of bulk water including 64 water molecules under an electric field up to 0.6 V Å⁻¹ along x direction, using atomic forces as targets only (named FIREANN-wF hereafter). Our model yields accurate predictions for atomic forces, with an overall RMSE of 39.4 meV Å⁻¹. In addition, the TRPMD-calculated field-free radial distribution functions (RDFs) of liquid water agree very well with previous on-the-fly results at the same DFT level⁷⁵ and experimental data⁷⁶, as shown in Fig. 7, further validating the accuracy of the FIREANN-wF potential. It is also beneficial to compare the dipole moments predicted by FIREANN-wF with DFT data. Since dipole moments should smoothly change as the configuration evolves, it is reasonable to correct any abrupt changes in the dipole moment calculated by DFT along an AIMD trajectory. This correction involves shifting the dipole moment by an integer multiplied by the product of the corresponding lattice vector and electronic charge. By applying this correction, one ensures that the adjusted dipole moment remains closest to its value in the previous step, allowing for a continuous variation of dipole moments along the trajectory. Impressively, as shown in Fig. 6, the corrected DFT dipole moments perfectly match the FIREANN-wF predictions, without any prior knowledge of these positions of sudden jumps. Interestingly, the FIREANN-wF model captures the drastic increase in the total polarization of the system under an intensive electric field, as shown in Fig. 6b. This is because of additivity of the molecular dipole moment as each water molecule tends to align its dipole moment to the field vector. Figure 6 also shows the poor performance of a flawed model training with multi-branched dipole moments in a brute-force way (named FIREANN-wD hereafter), which obviously fails to follow the correct evolution of the dipole moment. Note that dipole moments in both two models are obtained by calculating the energy gradient with respect to the electric field. It is worth stating that although it is viable to correct the dipole moment along a single AIMD trajectory because the configuration change is minor in adjacent steps, it is difficult to do so in practice for uncorrelated trajectories or for independent single-point calculations. In latter cases, the training dataset will likely include dipole moments of unpredictable branches and give large noises to conventional ML dipole models relying on the multiplication of atomic charges and position vectors.

**Fig. 7: Radial distribution functions of liquid water.**

Misrepresenting the dipole moment surface could have a significant influence on the resultant IR spectrum. Figure 8a compares the calculated and experimental IR spectra of liquid water at room temperature without an electric field. Thanks to the inclusion of NQEs by TRPMD, the FIREANN-wF model that offers both the correct PES and dipole moment surface, does capture well all experimental vibrational features⁷⁷ including the O-H stretching (~3600 cm⁻¹), H-O-H bending (~1690 cm⁻¹), librational (~700 cm⁻¹)⁷⁸ and H-bonding stretching bands (~170 cm⁻¹)⁷⁸. Our results also agree well with previous theoretical ones obtained by on-the-fly TRPMD at the same DFT level⁷⁵, likely with their DFT dipole moments corrected. By contrast, the IR spectrum predicted by the FIREANN-wF model using the same trajectories deviates significantly from the experimental counterpart. This comparison clearly highlights the necessity of using an ML model fulfilling the physical requirement of the dipole moment in predicting IR spectra. Finally, we show in Fig. 8b the predictions of the FIREANN-wF model for the IR spectrum with an electric field up to 0.4 V Å⁻¹ along x direction. Interestingly, the electric field influences mostly the O-H stretching band, resulting in a progressive red shift upon with the increasing field intensity. Unlike the NMA molecule, the red shift here is not solely because of the softening of the O-H bond by the electric field, but also the more ordered structure as a result of the field-induced reorientation of water molecules to be parallel to the field vector²⁰. This effect will render the liquid water structure more ice-like²⁰, in which the O-H vibrational band is lower in frequency. In contrast, this ordering effect hinders the librational rotation of water molecules and naturally results in an increased frequency of the librational mode. In comparison, the H-O-H bending mode is barely affected by the electric field, since the bending motion leads to little change in the direction of the dipole moment.

**Fig. 8: Field-free and in-field IR spectra of liquid water.**

Comparison with previous models

Although similar force-only training can be done using previous models proposed in refs. ^61,62, their ways of incorporating the external field are completely different from the present FIREANN model, rendering the absence of important high-order field-system interactions. Specifically, these models consider the response of the system to the external field by adding the dot product of a virtual atomic dipole and the electric field vector (namely $\overrightarrow{{{{{{\boldsymbol{\mu }}}}}}}\cdot \overrightarrow{{{{{{\boldsymbol{\varepsilon }}}}}}}$, a scalar value) to the standard atomic descriptor. By construction, high-order field-system interactions are missing in their descriptors. In contrast, by introducing a virtual field-dependent atomic orbital in our field-induced EAD descriptor, we capture the response of the electron density to the external field through orbital-orbital interactions (Eq. (2) in this work). In such a way, all interactions between the field and the atomic environment are included in our FIREANN model. This is a fundamental improvement over all previous models, which can be easily adapted to other equivariant features based ML models^44,48,53 without altering their fundamental architectures. Indeed, Christensen et al.⁶¹ have clearly admitted that their kernel-based model cannot predict polarizability and other high-order response properties. Similar deficiencies arise in the FieldSchNet model as well. Although the nonlinear message passing NNs imposed in that model may learn part of high-order interactions, this incompleteness will cause qualitative failures of FieldSchNet in some cases.

To show this explicitly, we have applied the FieldSchNet package and compared its predictions with present ones using exactly the same dataset. As already presented in Fig. 2c, the FIREANN model perfectly captures the nonlinear energy variation of a water molecule as a function of the field strength. In contrast, the FieldSchNet model predicts no dependence of the energy on the field strength at all. This is because the water molecule lies in the yz plane so all atomic dipoles derived from FieldSchNet always lie in plane, resulting in zero coupling with the electric field applied along the x direction and a constant energy. In practice, all x relevant components in any response quantities (dipole moment, polarizability, etc.) predicted by FieldSchNet are zero.

This phenomenon is not limited to molecules but also applies to periodic systems. To show this, we construct an exemplary dataset consisting of 200 liquid water configurations exposed to an electric field along the x direction ranging from 0 to 0.6 V Å⁻¹. Water molecules in these configurations are aligned in four evenly spaced layers (16 molecules in each) perpendicular to the x axis with the first and third layer equal to the second and fourth layers respectively, as shown in the inset of Fig. 9. To enable the comparison of field-dependent energies, we trained a FIREANN model and a FieldSchNet model respectively with both forces and polarizabilities (to eliminate the field dependence of the undetermined constant of the total energy), and aligned their field-free energies to the same zero point. The difference between FIREANN and FieldSchNet models is amplified in this system, where the RMSEs predicted on the test set by FIREANN and FieldSchNet are 54.5 meV Å⁻¹ (2.1 a.u.) and 245.4 meV Å⁻¹ (165.1 a.u.) for forces (polarizability), respectively. Again, the FIREANN model precisely captures the large energy variance up to an applied electric field of ±2 V Å⁻¹, while the FieldSchNet energy remains constant and deviates from the correct DFT result by several eV, as displayed in Fig. 9. This result also validates the generalizability of the FIREANN model towards representing high-intensity external fields.

**Fig. 9: Field-dependent energy curves of FIREANN and FieldSchNet.**

We note that this deficiency will generally appear in any configuration if all atomic dipoles along the applied field direction are zero, which would lead to an unphysical behavior near the corresponding configuration space and inevitable large fitting errors. For example, using the same full dataset of liquid water in this work and training with atomic forces and polarizabilities, the FIREANN and FieldSchNet models yield test RMSEs of 45.5 meV Å⁻¹ (2.5 a.u.) and 184.7 meV Å⁻¹ (12.9 a.u.), respectively. The much worse performance of FieldSchNet represents an indicator of its incomplete description of the field-system interaction. It is worth noting that our FIREANN implementation is more efficient than FieldSchNet, with a training time of 2.4 versus 7.6 minutes per epoch, when running on a single A100 GPU with a memory capacity of 80 GB.

Discussion

In this work, we have proposed a simple, accurate, and universal FIREANN model to learn the external field-dependent PES and response properties with the proper rotational equivariance. This model allows us to obtain all ingredients from one single training for modeling spectroscopy and dynamics of chemical systems with and without external electric fields. The validity of this model is supported by the good agreement between the predicted vibrational spectra of the NMA molecule and liquid water and field-free experimental data. Moreover, the field-induced alignment of the dipole moment and the softening of the covalent bond are clearly predicted in the in-field IR or Raman spectra. For periodic systems like liquid water, in particular, the intrinsic multi-valued polarization of the system results in the discontinuous dipole moments in the training data and makes it difficult to being represented by conventional machine learning models based on atomic charges. This issue is nicely bypassed in the FIREANN model by learning atomic forces only, which can yield both field-dependent potentials and dipole moments, and thus IR spectra of liquid water. Our results not only clearly validate the high accuracy of the all-in-one FIREANN model, but also elucidate the interplay between chemical systems and electric fields.

In the current implementation built on the original PyTorch⁷⁹ framework, training a FIREANN model in the most complete scenario (including energy, forces, dipole, and polarizability tensor) will take 4 times longer than force-only training, as the former process requires sample-to-sample (high-order) gradients. This issue can be largely alleviated by an improved implementation based on the more recently released functorch⁸⁰ module in a new version of PyTorch, which allows efficient computation of sample-to-sample (high-order) gradients. Although all results presented in this work are relevant to the system exposed to an electric field, the FIREANN framework can be extended to describe the response of the system to a magnetic field or even to an electromagnetic field by introducing another field vector-dependent virtual function in Eq. (2). This will allow a more complete description of magnetic fields interacting with the system than in ref. ⁶². Note that the current version of the FIREANN model is limited to describing the influence of a homogeneous external field. In the case of a non-uniform external field, the response of the electron density to the field is spatially dependent and must be explicitly considered. A feasible way is to discretize the non-uniform field to each atomic center and introduce a nonequivalent field-dependent function to each FI-EAD feature (as implicitly implied in Fig. 1) to approximate the response of each atomic density to the local field experienced by the central atom. Note that this adjustment is intended to introduce an external inhomogeneous field interacting with the entire system. This differs from an inhomogeneous electric field approximately generated by solvent environments, which acts only onto the embedding molecular center as described in ref. ⁶². These desirable features make the FIREANN approach very promising to efficiently modeling strong field-induced phenomena such as electrochemistry^81,82, plasmonic chemistry⁸³, and tip-induced catalytic reactions⁸.

Methods

REANN

The regular REANN model was proposed for representing field-free PESs⁶⁴. Like all atomistic NN models, the total potential energy (E) is expressed in the sum of atom-wise contributions, and each atomic energy (E_i) is learned by feeding a vector of atomic features for describing the atom-centered environment to an atom-wise NN. In the REANN model, EAD atomic features are specified to include many-body correlations between the central and neighbor atoms, which are simply evaluated by the square of the linear combination of a set of contracted GTOs located at neighbor atoms,

$${\rho }_{i}^{n}=\mathop{\sum }\limits_{l=0}^{L}{\mathop{\sum }\limits_{{l}_{x},{l}_{y},{l}_{z}}^{l}\frac{l!}{{l}_{x}!{l}_{y}!{l}_{z}!}\left[\mathop{\sum }\limits_{j\ne i}^{{N}_{c}}{c}_{j}\mathop{\sum }\limits_{m=1}^{{N}_{\varphi }}{d}_{m}^{n}{\varphi }_{{l}_{x}{l}_{y}{l}_{z}}^{m}\left({\overrightarrow{{{{{{\bf{r}}}}}}}}_{ij}\right)\right]}^{2},$$

(6)

where the primitive GTO takes the following form,

$${\varphi }_{{l}_{x}{l}_{y}{l}_{z}}^{m}\left({\overrightarrow{{{{{{\bf{r}}}}}}}}_{ij}\right)={\left({x}_{ij}\right)}^{{l}_{x}}{\left({y}_{ij}\right)}^{{l}_{y}}{\left({z}_{ij}\right)}^{{l}_{z}}\exp \left[-{\alpha }_{m}{\left({r}_{ij}-{r}_{m}\right)}^{2}\right]\,{f}_{c}\left({r}_{ij}\right),$$

(7)

and the contraction combines different shapes of primitive GTOs together,

$${\chi }^{n}\left({\overrightarrow{{{{{{\bf{r}}}}}}}}_{ij}\right)=\mathop{\sum }\limits_{m=1}^{{N}_{\varphi }}{d}_{m}^{n}{\varphi }_{{l}_{x}{l}_{y}{l}_{z}}^{m}\left({\overrightarrow{{{{{{\bf{r}}}}}}}}_{ij}\right),$$

(8)

In practical implementation, we reorder the summation over c_j and ${d}_{m}^{n}$ in Eq. (6) to obtain better efficiency. It should be noted that an EAD feature vector (ρ_i) consists of a number of density values generated from different sets of contracted GTOs. Although GTOs in our model are expanded in Cartesian coordinates, they can also be expressed in terms of spherical harmonics, resembling in spirit those equivariant features based on spherical harmonics^44,48,53. Specifically, ${\overrightarrow{{{{{{\bf{r}}}}}}}}_{ij}={\overrightarrow{{{{{{\bf{r}}}}}}}}_{i}-{\overrightarrow{{{{{{\bf{r}}}}}}}}_{j}$ is the position vector (three components) of the central atom i relative to the jth neighbor atom with r_ij (x_ij,y_ij,z_ij) being its norm (Cartesian component), l = l_x+ l_y + l_z specifies the orbital angular momentum (e.g., l = 0 for the s orbital, l = 1 for the p orbital, etc.), α_m and r_m are hyperparameters that determine the center and the width of the radial Gaussian function. In the combination to form an EAD feature, L is the maximum orbital angular momentum of GTOs, N_φ is the number of primitive GTOs for each l and ${d}_{m}^{n}$ is the contraction coefficient of the mth primitive GTO for the nth component of the EAD vector, N_c is the number of neighbor atoms and c_j is the jth atomic orbital coefficient within a cutoff radius (r_c), ${f}_{c}({r}_{ij})$ is a cosine-type switching function continuously decaying interatomic interactions to zero at r_c up to second-order derivatives. In particular, realizing that c_j itself necessarily depends on its atomic environment, we express it as the output of an atomic NN based on EAD features centered at atom j. Apparently, REANN is essentially a message-passing NN by recursively expanding the environment-dependent orbital coefficients like this, which has proven an efficient way to incorporate high-order many-body correlations in the local environment⁶⁴.

Training details

All FIREANN models in this study utilize NN with two hidden layers, each containing 64 neurons in each iteration of message-passing. Eight radial functions and L up to 2 were used to construct EAD features with sufficient representability. The initial learning rate was set to 0.002 and decays by a factor of 0.5 whenever the validation loss does not decrease for 100 consecutive epochs. Training stops when the learning rate drops below 1 × 10⁻⁵. The number of message-passing iterations for H₂O, NMA, and liquid water were set to 0, 4, and 3, respectively. The cutoff distances for the three systems were 3.0 Å, 6.0 Å, and 5.0 Å. Other parameters were automatically optimized during the training process. These weights for individual properties in the loss function were dynamically adjusted during the training process. For the NMA molecule, λ_V, λ_F, λ_μ, and λ_α decay linearly from 0.1, 50, 10, and 10 to 0.1, 0.5, 0.5, and 0.5 as the learning rate decays. The same set of weights were used for the H₂O molecule, except that there is no force weight included. As for the liquid water, only atomic forces were trained and weighting was unnecessary.

Computational details and datasets

Three systems are used to validate the FIREANN model, including a toy system (H₂O monomer), NMA, and liquid water.

A toy system

The training set of the H₂O molecule contains merely a single equilibrium geometry lying in the yz plane with an electric field of 0.1 V Å⁻¹ along the x direction. The potential energy, dipole moment, and polarizability of H₂O were calculated by Gaussian 09⁸⁴ at the B3LYP/cc-pVDZ level⁸⁵ and used as targets in the loss function defined in Eq. (7).

NMA

Over 13000 configurations were sampled from canonical ensemble (NVT) classical & path-integral MD simulations at 300 K in the presence of an electric field ranging from 0.0 to 0.4 V Å⁻¹ along the x direction and calculated using Gaussian 09⁸⁴ at the B3LYP/aug-cc-pVDZ level⁸⁵ with D3 correction of disperson⁸⁶. The dataset was divided into training set, validation set, and test set with a ratio of 8:1:1. Again, the potential energy, atomic force, dipole moment, and polarizability were trained simultaneously.

Liquid water

A cubic box of 64 water molecules was used in the data sampling. The dataset consists of ~33000 configurations sampled from NVT classical and path-integral AIMD simulations at 300 K with the external electric field ranging from 0.0 to 0.6 V Å⁻¹ along x direction. Electronic structures and properties were calculated by CP2K⁸⁷ with a hybrid density functional revPBE0^88,89 including D3 correction of dispersion⁸⁶. Goedecker–Tetter–Hutter pseudopotentials⁹⁰ with a cutoff of 1200 Ry and a TZV2P basis set were used. Only atomic forces were used to construct the PES, and the dipole moment was excluded due to its discontinuity caused by the multiple-value nature of the periodic systems. To illustrate the deficiency of FieldSchNet, a special dataset was collected consisting of 200 liquid water configurations exposed to an electric field along the x direction ranging from 0 to 0.6 V Å⁻¹. Water molecules in these configurations were averagely placed in four evenly spaced layers perpendicular to the x axis (16 molecules in each). In addition, the first (second) layer was made identical to the third (fourth) layer. In this way, the sum of dipole moments in each layer was kept in plane and any interlayer dipole moment canceled out, leaving a zero x component of the total dipole moment. In the data sampling, water molecules were centered evenly spaced grids (4 × 4) in the plane with some random shifts within 0.3 Å and random in-plane orientation. The intramolecular O-H bond lengths and H-O-H angle of each molecule are randomly displaced from their equilibrium values within ~0.1 Å and ~2°.

MD simulations with FIREANN models

NMA

To compare the calculated IR and Raman spectra of NMA with experimental data, classical MD simulations were performed at 300 K in the absence of an electric field. The NMA molecule was first equilibrated with 20 ps using the Andersen thermostat⁹¹, after which two-hundred snapshots with corresponding momentum were randomly chosen for initializing subsequent NVE MD simulations of 25 ps. The time correlation functions (TCFs) were computed by the average of 200 such trajectories. IR and Raman spectra were obtained by the Fourier transform of the TCFs of the time derivatives of dipole and polarizability, respectively⁹². In addition, to include NQEs, path-integral based TRPMD simulations^70,93 were performed with Langevin thermostats attached to all non-centroid normal modes, with and without adding an electric field. Other computational details are similar to those in the classical MD simulations. The resulting field-dependent IR and Raman spectra were obtained by the Fourier transform of the centroid-based TCFs on an average of 200 TRPMD trajectories. In all simulations, the time step was kept at 0.1 fs.

Liquid water

The system consists of 64 H₂O molecules with a side length 12.4185 Å. The NVT classical MD simulations of liquid water were performed at 300 K, using Andersen thermostat⁹¹. To obtain convergent IR spectra, we extracted 128 positions and momentum from an equilibrium NVT trajectory as initial states for NVE MD simulations, with a total time of 20 ps per NVE simulation and a time step of 0.1 fs. The same setup was used for a 24-bead TRPMD simulations to include NQEs, which was found to converge the spectroscopic results.

Data availability

The dataset of NMA molecules and the exemplary dataset of liquid water (200 structures) generated in this study have been deposited in the github [https://github.com/zhangylch/FIREANN/tree/main/data]. The initial and final structures of MD/TRPMD simulations generated in this study have been deposited in the github [https://github.com/zhangylch/FIREANN/tree/main/data/md_stru]. The complete liquid water data are available upon reasonable request from the corresponding author. The data generated in this study are provided in the Source Data file. Source data are provided with this paper.

Code availability

The FIREANN package are available from https://github.com/zhangylch/FIREANN⁹⁴.

References

Shaik, S. S. & Stuyver T. Effects of electric fields on structure and reactivity: new horizons in chemistry. The Royal Society of Chemistry, xviii, 428 pages (2021).
Lemeshko, M., Krems, R. V., Doyle, J. M. & Kais, S. Manipulation of molecules with electromagnetic fields. Mol. Phys. 111, 1648–1682 (2013).
CAS ADS Google Scholar
Alemani, M. et al. Electric field-induced isomerization of azobenzene by STM. J. Am. Chem. Soc. 128, 14446–14447 (2006).
CAS PubMed Google Scholar
Murgida, D. H. & Hildebrandt, P. Electron-transfer processes of cytochrome c at interfaces. new insights by surface-enhanced resonance raman spectroscopy. Acc. Chem. Res. 37, 854–861 (2004).
CAS PubMed Google Scholar
Velpula, G., Teyssandier, J., De Feyter, S. & Mali, K. S. Nanoscale control over the mixing behavior of surface-confined bicomponent supramolecular networks using an oriented external electric field. ACS Nano 11, 10903–10913 (2017).
CAS PubMed PubMed Central Google Scholar
English, N. J. & Mooney, D. A. Denaturation of hen egg white lysozyme in electromagnetic fields: a molecular dynamics study. J. Chem. Phys. 126, 091105 (2007).
PubMed ADS Google Scholar
Shaik, S., Mandal, D. & Ramanan, R. Oriented electric fields as future smart reagents in chemistry. Nat. Chem. 8, 1091–1098 (2016).
CAS PubMed Google Scholar
Ciampi, S., Darwish, N., Aitken, H. M., Díez-Pérez, I. & Coote, M. L. Harnessing electrostatic catalysis in single molecule, electrochemical and chemical systems: a rapidly growing experimental tool box. Chem. Soc. Rev. 47, 5146–5164 (2018).
CAS PubMed Google Scholar
Shaik, S., Ramanan, R., Danovich, D. & Mandal, D. Structure and reactivity/selectivity control by oriented-external electric fields. Chem. Soc. Rev. 47, 5125–5145 (2018).
CAS PubMed Google Scholar
Aragonès, A. C. et al. Electrostatic catalysis of a Diels–Alder reaction. Nature 531, 88–91 (2016).
PubMed ADS Google Scholar
Friedrich, B. & Herschbach, D. R. Spatial orientation of molecules in strong electric fields and evidence of pendular states. Nature 353, 412 (1991).
CAS ADS Google Scholar
Sussman, B. J., Townsend, D., Ivanov, M. Y. & Stolow, A. Dynamic stark control of photochemical processes. Science 314, 278–281 (2006).
CAS PubMed ADS Google Scholar
de Miranda, M. H. G. et al. Controlling the quantum stereodynamics of ultracold bimolecular reactions. Nat. Phys. 7, 502–507 (2011).
Google Scholar
Tscherbul, T. V. & Krems, R. V. Tuning bimolecular chemical reactions by electric fields. Phys. Rev. Lett. 115, 023201 (2015).
PubMed ADS Google Scholar
King-Smith, R. D. & Vanderbilt, D. Theory of polarization of crystalline solids. Phys. Rev. B 47, 1651–1654 (1993).
CAS ADS Google Scholar
Meir, R., Chen, H., Lai, W. & Shaik, S. Oriented electric fields accelerate diels–alder reactions and control the endo/exo selectivity. ChemPhysChem 11, 301–310 (2010).
CAS PubMed Google Scholar
Zhang, C. & Sprik, M. Finite field methods for the supercell modeling of charged insulator/electrolyte interfaces. Phys. Rev. B 94, 245309 (2016).
ADS Google Scholar
Saitta, A. M., Saija, F. & Giaquinta, P. V. Ab initio molecular dynamics study of dissociation of water under an electric field. Phys. Rev. Lett. 108, 207801 (2012).
PubMed ADS Google Scholar
Cassone, G. Nuclear quantum effects largely influence molecular dissociation and proton transfer in liquid water under an electric field. J. Phys. Chem. Lett. 11, 8983–8988 (2020).
CAS PubMed Google Scholar
Cassone, G., Sponer, J., Trusso, S. & Saija, F. Ab initio spectroscopy of water under electric fields. Phys. Chem. Chem. Phys. 21, 21205–21212 (2019).
CAS PubMed Google Scholar
Vegiri, A. & Schevkunov, S. V. A molecular dynamics study of structural transitions in small water clusters in the presence of an external electric field. J. Chem. Phys. 115, 4175–4185 (2001).
CAS ADS Google Scholar
Acosta-Gutiérrez, S., Hernández-Rojas, J., Bretón, J., Llorente, J. M. G. & Wales, D. J. Physical properties of small water clusters in low and moderate electric fields. J. Chem. Phys. 135, 124303 (2011).
PubMed ADS Google Scholar
Senftle, T. P. et al. The ReaxFF reactive force-field: development, applications and future directions. NPJ Comput. Mat. 2, 15011 (2016).
CAS Google Scholar
Sobrino Fernández, M., Peeters, F. M. & Neek-Amal, M. Electric-field-induced structural changes in water confined between two graphene layers. Phys. Rev. B 94, 045436 (2016).
ADS Google Scholar
Tan, S. et al. Enhancing the oxidation of toluene with external electric fields: a reactive molecular dynamics study. Sci. Rep. 7, 1710 (2017).
PubMed ADS PubMed Central Google Scholar
Musil, F. et al. Physics-inspired structural representations for molecules and materials. Chem. Rev. 121, 9759–9815 (2021).
CAS PubMed Google Scholar
Unke, O. T. et al. Machine learning force fields. Chem. Rev. 121, 10142–10186 (2021).
CAS PubMed PubMed Central Google Scholar
Behler, J. Four generations of high-dimensional neural network potentials. Chem. Rev. 121, 10037–10072 (2021).
CAS PubMed ADS Google Scholar
Kang, P.-L., Shang, C. & Liu, Z.-P. Large-scale atomic simulation via machine learning potentials constructed by global potential energy surface exploration. Acc. Chem. Res. 53, 2119–2129 (2020).
CAS PubMed Google Scholar
Huang, B. & von Lilienfeld, O. A. Ab initio machine learning in chemical compound space. Chem. Rev. 121, 10001–10036 (2021).
CAS PubMed PubMed Central Google Scholar
Deringer, V. L. et al. Gaussian process regression for materials and molecules. Chem. Rev. 121, 10073–10141 (2021).
CAS PubMed PubMed Central Google Scholar
Zhang, Y., Lin, Q. & Jiang, B. Atomistic neural network representations for chemical dynamics simulations of molecular, condensed phase, and interfacial systems: efficiency, representability, and generalization. WIREs Comput. Mol. Sci. 13, e1645 (2023).
CAS Google Scholar
Behler, J. & Parrinello, M. Generalized neural-network representation of high-dimensional potential-energy surfaces. Phys. Rev. Lett. 98, 146401 (2007).
PubMed ADS Google Scholar
Bartók, A. P., Payne, M. C., Kondor, R. & Csányi, G. Gaussian approximation potentials: the accuracy of quantum mechanics, without the electrons. Phys. Rev. Lett. 104, 136403 (2010).
PubMed ADS Google Scholar
Jiang, B. & Guo, H. Permutation invariant polynomial neural network approach to fitting potential energy surfaces. J. Chem. Phys. 139, 054112 (2013).
PubMed ADS Google Scholar
Shao, K., Chen, J., Zhao, Z. & Zhang, D. H. Communication: fitting potential energy surfaces with fundamental invariant neural network. J. Chem. Phys. 145, 071101 (2016).
PubMed ADS Google Scholar
Schütt, K. T., Sauceda, H. E., Kindermans, P.-J., Tkatchenko, A. & Müller, K.-R. SchNet – a deep learning architecture for molecules and materials. J. Chem. Phys. 148, 241722 (2018).
PubMed ADS Google Scholar
Zhang, L., Han, J., Wang, H., Car, R. & E, W. Deep potential molecular dynamics: a scalable model with the accuracy of quantum mechanics. Phys. Rev. Lett. 120, 143001 (2018).
CAS PubMed ADS Google Scholar
Zhang, Y., Hu, C. & Jiang, B. Embedded atom neural network potentials: efficient and accurate machine learning with a physically inspired representation. J. Phys. Chem. Lett. 10, 4962–4967 (2019).
CAS PubMed Google Scholar
Unke, O. T. & Meuwly, M. PhysNet: a neural network for predicting energies, forces, dipole moments, and partial charges. J. Chem. Theory Comput. 15, 3678–3693 (2019).
CAS PubMed Google Scholar
Zaverkin, V. & Kästner, J. Gaussian moments as physically inspired molecular descriptors for accurate and scalable machine learning potentials. J. Chem. Theory Comput. 16, 5410–5421 (2020).
CAS PubMed Google Scholar
Shapeev, A. V. Moment tensor potentials: a class of systematically improvable interatomic potentials. Multiscale Model. Simul. 14, 1153–1173 (2016).
MathSciNet MATH Google Scholar
Zubatyuk, R., Smith Justin, S., Leszczynski, J. & Isayev, O. Accurate and transferable multitask prediction of chemical properties with an atoms-in-molecules neural network. Sci. Adv. 5, eaav6490 (2021).
ADS Google Scholar
Unke, O. T. et al. SpookyNet: Learning force fields with electronic degrees of freedom and nonlocal effects. Nat. Commun. 12, 7273 (2021).
CAS PubMed ADS PubMed Central Google Scholar
Sauceda, H. E. et al. BIGDML—towards accurate quantum machine learning force fields for materials. Nat. Commun. 13, 3733 (2022).
CAS PubMed ADS PubMed Central Google Scholar
Braams, B. J. & Bowman, J. M. Permutationally invariant potential energy surfaces in high dimensionality. Int. Rev. Phys. Chem. 28, 577–606 (2009).
CAS Google Scholar
Dral, P. O. et al. MLatom 2: an integrative platform for atomistic machine learning. Top. Curr. Chem. 379, 27 (2021).
CAS Google Scholar
Batzner, S. et al. E(3)-equivariant graph neural networks for data-efficient and accurate interatomic potentials. Nat. Commun. 13, 2453 (2022).
CAS PubMed ADS PubMed Central Google Scholar
Gastegger, M., Behler, J. & Marquetand, P. Machine learning molecular dynamics for the simulation of infrared spectra. Chem. Sci. 8, 6924–6935 (2017).
CAS PubMed PubMed Central Google Scholar
Grisafi, A., Wilkins, D. M., Csányi, G. & Ceriotti, M. Symmetry-adapted machine learning for tensorial properties of atomistic systems. Phys. Rev. Lett. 120, 036002 (2018).
CAS PubMed ADS Google Scholar
Zhang, Y., Maurer, R. J. & Jiang, B. Symmetry-adapted high dimensional neural network representation of electronic friction tensor of adsorbates on metals. J. Phys. Chem. C. 124, 186–195 (2020).
CAS Google Scholar
Zhang, Y. et al. Efficient and accurate simulations of vibrational and electronic spectra with symmetry-preserving neural network models for tensorial properties. J. Phys. Chem. B 124, 7284–7290 (2020).
CAS PubMed Google Scholar
Schütt, K. & Unke O. Gastegger M. Equivariant message passing for the prediction of tensorial properties and molecular spectra. In: Proceedings of the 38th International Conference on Machine Learning (eds. Marina M., Tong Z.). PMLR (2021).
Nigam, J., Willatt, M. J. & Ceriotti, M. Equivariant representations for molecular Hamiltonians and N-center atomic-scale properties. J. Chem. Phys. 156, 014115 (2021).
ADS Google Scholar
Westermayr, J., Gastegger, M. & Marquetand, P. Combining SchNet and SHARC: the SchNarc machine learning approach for excited-state dynamics. J. Phys. Chem. Lett. 11, 3828–3834 (2020).
CAS PubMed PubMed Central Google Scholar
Sommers, G. M., Calegari Andrade, M. F., Zhang, L., Wang, H. & Car, R. Raman spectrum and polarizability of liquid water from deep neural networks. Phys. Chem. Chem. Phys. 22, 10592–10602 (2020).
CAS PubMed Google Scholar
Huang, X., Braams, B. J. & Bowman, J. M. Ab initio potential energy and dipole moment surfaces for H₅O₂⁺. J. Chem. Phys. 122, 044308 (2005).
ADS Google Scholar
Medders, G. R. & Paesani, F. Infrared and raman spectroscopy of liquid water through “first-principles” many-body molecular dynamics. J. Chem. Theory Comput. 11, 1145–1154 (2015).
CAS PubMed Google Scholar
Beckmann, R., Brieuc, F., Schran, C. & Marx, D. Infrared spectra at coupled cluster accuracy from neural network representations. J. Chem. Theory Comput. 18, 5492–5501 (2022).
CAS PubMed Google Scholar
Schienbein, P. Spectroscopy from machine learning by accurately representing the atomic polar tensor. J. Chem. Theory Comput. 19, 705–712 (2023).
CAS PubMed PubMed Central Google Scholar
Christensen, A. S., Faber, F. A. & Lilienfeld, O. A. V. Operators in quantum machine learning: Response properties in chemical space. J. Chem. Phys. 150, 064105 (2019).
PubMed ADS Google Scholar
Gastegger, M., Schütt, K. T. & Müller, K.-R. Machine learning of solvent effects on molecular spectra and reactions. Chem. Sci. 12, 11473–11483 (2021).
CAS PubMed PubMed Central Google Scholar
Gao, A. & Remsing, R. C. Self-consistent determination of long-range electrostatics in neural network potentials. Nat. Commun. 13, 1572 (2022).
CAS PubMed ADS PubMed Central Google Scholar
Zhang, Y., Xia, J. & Jiang, B. Physically motivated recursively embedded atom neural networks: incorporating local completeness and nonlocality. Phys. Rev. Lett. 127, 156002 (2021).
CAS PubMed ADS Google Scholar
Wang, L., Middleton, C. T., Zanni, M. T. & Skinner, J. L. Development and validation of transferable amide I vibrational frequency maps for peptides. J. Phys. Chem. B 115, 3713–3724 (2011).
CAS PubMed PubMed Central Google Scholar
Ye, S. et al. A neural network protocol for electronic excitations of N-methylacetamide. Proc. Natl Acad. Sci. USA 116, 11612–11617 (2019).
CAS PubMed ADS PubMed Central Google Scholar
Zhang, J. et al. A machine-learning protocol for ultraviolet protein-backbone absorption spectroscopy under environmental fluctuations. J. Phys. Chem. B 125, 6171–6178 (2021).
CAS PubMed Google Scholar
Zhao, L. et al. Accurate machine learning prediction of protein circular dichroism spectra with embedded density descriptors. JACS Au 1, 2377–2384 (2021).
CAS PubMed PubMed Central Google Scholar
Forsting, T., Gottschalk, H. C., Hartwig, B., Mons, M. & Suhm, M. A. Correcting the record: the dimers and trimers of trans-N-methylacetamide. Phys. Chem. Chem. Phys. 19, 10727–10737 (2017).
CAS PubMed Google Scholar
Rossi, M., Ceriotti, M. & Manolopoulos, D. E. How to remove the spurious resonances from ring polymer molecular dynamics. J. Chem. Phys. 140, 234116 (2014).
PubMed ADS Google Scholar
Herrebout, W. A., Clou, K. & Desseyn, H. O. Vibrational spectroscopy of N-methylacetamide revisited. J. Phys. Chem. A 105, 4865–4881 (2001).
CAS Google Scholar
Zhang, Y., Jiang, J. & Jiang B. Chapter 19 - Learning dipole moments and polarizabilities. In: Quantum Chemistry in the Age of Machine Learning (ed. Dral P. O.). Elsevier (2023).
Zhang, L. et al. Deep neural network for the dielectric response of insulators. Phys. Rev. B 102, 041121 (2020).
CAS ADS Google Scholar
Damle, A., Lin, L. & Ying, L. SCDM-k: localized orbitals for solids via selected columns of the density matrix. J. Comput. Phys. 334, 1–15 (2017).
MathSciNet CAS MATH ADS Google Scholar
Marsalek, O. & Markland, T. E. Quantum dynamics and spectroscopy of ab initio liquid water: the interplay of nuclear and electronic quantum effects. J. Phys. Chem. Lett. 8, 1545–1551 (2017).
CAS PubMed Google Scholar
Soper, A. K. The radial distribution functions of water and ice from 220 to 673 K and at pressures up to 400 MPa. Chem. Phys. 258, 121–137 (2000).
CAS Google Scholar
Bertie, J. E. & Lan, Z. Infrared intensities of liquids XX: The intensity of the OH stretching band of liquid water revisited, and the best current values of the optical constants of H2O(l) at 25 °C between 15,000 and 1 cm−1. Appl. Spectrosc. 50, 1047–1057 (1996).
CAS ADS Google Scholar
Silvestrelli, P. L., Bernasconi, M. & Parrinello, M. Ab initio infrared spectrum of liquid water. Chem. Phys. Lett. 277, 478–482 (1997).
CAS ADS Google Scholar
Paszke A. et al. PyTorch: an imperative style, high-performance deep learning library. In: Advances in Neural Information Processing Systems 32 (NeurIPS 2019) (eds. Wallach H., Larochelle H., Beygelzimer A., d’Alché-Buc F., Garnett EFaR). Curran Associates Inc. (2019).
Frostig, R., Johnson, M. J. & Leary C. Compiling machine learning programs via high-level tracing. Syst. Mach. Learn. 2–24 (2018).
Groß, A. Challenges in the modeling of elementary steps in electrocatalysis. Curr. Opin. Electrochem. 37, 101170 (2023).
Google Scholar
Rossmeisl, J., Nørskov, J. K., Taylor, C. D., Janik, M. J. & Neurock, M. Calculated phase diagrams for the electrochemical oxidation and reduction of water over Pt(111). J. Phys. Chem. B 110, 21833–21839 (2006).
CAS PubMed Google Scholar
Seemala, B. et al. Plasmon-mediated catalytic O2 dissociation on Ag nanostructures: hot electrons or near fields? ACS Energy Lett. 4, 1803–1809 (2019).
CAS Google Scholar
Frisch M. J. et al. Gaussian 09 Revision B.01. Gaussian Inc. (2009).
Becke, A. D. Density-functional exchange-energy approximation with correct asymptotic behavior. Phys. Rev. A 38, 3098 (1988).
CAS ADS Google Scholar
Grimme, S., Antony, J., Ehrlich, S. & Krieg, H. A consistent and accurate ab initio parametrization of density functional dispersion correction (DFT-D) for the 94 elements H-Pu. J. Chem. Phys. 132, 154104 (2010).
PubMed ADS Google Scholar
Kühne, T. D. et al. CP2K: An electronic structure and molecular dynamics software package - Quickstep: eficient and accurate electronic structure calculations. J. Chem. Phys. 152, 194103 (2020).
PubMed ADS Google Scholar
Adamo, C. & Barone, V. Toward reliable density functional methods without adjustable parameters: the PBE0 model. J. Chem. Phys. 110, 6158–6170 (1999).
CAS ADS Google Scholar
Goerigk, L. & Grimme, S. A thorough benchmark of density functional methods for general main group thermochemistry, kinetics, and noncovalent interactions. Phys. Chem. Chem. Phys. 13, 6670–6688 (2011).
CAS PubMed Google Scholar
Goedecker, S., Teter, M. & Hutter, J. Separable dual-space Gaussian pseudopotentials. Phys. Rev. B 54, 1703–1710 (1996).
CAS ADS Google Scholar
Andersen, H. C. Molecular dynamics simulations at constant pressure and/or temperature. J. Chem. Phys. 72, 2384–2393 (1980).
CAS ADS Google Scholar
Thomas, M., Brehm, M., Fligg, R., Vohringer, P. & Kirchner, B. Computing vibrational spectra from ab initio molecular dynamics. Phys. Chem. Chem. Phys. 15, 6608–6622 (2013).
CAS PubMed Google Scholar
Craig, I. R. & Manolopoulos, D. E. Quantum statistics and classical mechanics: real time correlation frunction from ring polymer molecular dynamics. J. Chem. Phys. 121, 3368–3373 (2004).
CAS PubMed ADS Google Scholar
Zhang, Y. & Jiang B. Universal machine learning for the response of atomistic systems to external fields. FIREANN https://doi.org/10.5281/zenodo8363726 (2023).

Download references

Acknowledgements

This work is supported by the Strategic Priority Research Program of the Chinese Academy of Sciences (XDB0450101), Innovation Program for Quantum Science and Technology (2021ZD0303301), CAS Project for Young Scientists in Basic Research (YSBR-005), National Natural Science Foundation of China (22325304, 22221003 and 22033007), and the Fundamental Research Funds for Central Universities (WK2060000017). We acknowledge the Supercomputing Center of USTC, Hefei Advanced Computing Center, Beijing PARATERA Tech CO., Ltd for providing high-performance computing service. We also thank Dr. Wei Hu, Dr. Xinming Qin, Dr. Mouyi Weng, and Junfeng Qiao for the very helpful discussion.

Author information

Yaolong Zhang
Present address: École Polytechnique Fédérale de Lausanne, 1015, Lausanne, Switzerland

Authors and Affiliations

Key Laboratory of Precision and Intelligent Chemistry, Department of Chemical Physics, Key Laboratory of Surface and Interface Chemistry and Energy Catalysis of Anhui Higher Education Institutes, University of Science and Technology of China, Hefei, Anhui, 230026, China
Yaolong Zhang & Bin Jiang
Hefei National Laboratory, University of Science and Technology of China, Hefei, 230088, China
Bin Jiang

Authors

Yaolong Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Bin Jiang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

B.J. designed the project, and Y.Z. and B.J. discussed the neural network architecture. Y.Z. wrote the code and performed all calculations. Y.Z. and B.J. wrote the manuscript.

Corresponding author

Correspondence to Bin Jiang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Rodrigo Alejandro Vargas-Hernández, and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Peer Review File

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Zhang, Y., Jiang, B. Universal machine learning for the response of atomistic systems to external fields. Nat Commun 14, 6424 (2023). https://doi.org/10.1038/s41467-023-42148-y

Download citation

Received: 16 April 2023
Accepted: 01 October 2023
Published: 12 October 2023
DOI: https://doi.org/10.1038/s41467-023-42148-y

This article is cited by

Electrofreezing of liquid water at ambient conditions
- Giuseppe Cassone
- Fausto Martelli
Nature Communications (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Highly accurate protein structure prediction with AlphaFold

De novo design of protein structure and function with RFdiffusion

Collective intelligence: A unifying concept for integrating biology across scales and substrates

Introduction

Results

FIREANN framework

A toy system

Molecular spectroscopy

Liquid water

Comparison with previous models

Discussion

Methods

REANN

Training details

Computational details and datasets

A toy system

NMA

Liquid water

MD simulations with FIREANN models

NMA

Liquid water

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Supplementary information

Peer Review File

Source data

Source Data

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Electrofreezing of liquid water at ambient conditions

Comments

Search

Quick links