Self-consistent determination of long-range electrostatics in neural network potentials

Gao, Ang; Remsing, Richard C.

doi:10.1038/s41467-022-29243-2

Download PDF

Article
Open access
Published: 23 March 2022

Self-consistent determination of long-range electrostatics in neural network potentials

Nature Communications volume 13, Article number: 1572 (2022) Cite this article

9406 Accesses
37 Citations
25 Altmetric
Metrics details

Subjects

Abstract

Machine learning has the potential to revolutionize the field of molecular simulation through the development of efficient and accurate models of interatomic interactions. Neural networks can model interactions with the accuracy of quantum mechanics-based calculations, but with a fraction of the cost, enabling simulations of large systems over long timescales. However, implicit in the construction of neural network potentials is an assumption of locality, wherein atomic arrangements on the nanometer-scale are used to learn interatomic interactions. Because of this assumption, the resulting neural network models cannot describe long-range interactions that play critical roles in dielectric screening and chemical reactivity. Here, we address this issue by introducing the self-consistent field neural network — a general approach for learning the long-range response of molecular systems in neural network potentials that relies on a physically meaningful separation of the interatomic interactions — and demonstrate its utility by modeling liquid water with and without applied fields.

Universal machine learning for the response of atomistic systems to external fields

Article Open access 12 October 2023

A fourth-generation high-dimensional neural network potential with accurate electrostatics including non-local charge transfer

Article Open access 15 January 2021

Efficient training of ANN potentials by including atomic forces via Taylor expansion and application to water and a transition-metal oxide

Article Open access 13 May 2020

Introduction

Computer simulations have transformed our understanding of molecular systems by providing atomic-level insights into phenomena of widespread importance. The earliest models used efficient empirical descriptions of interatomic interactions, and similar force field-based simulations form the foundation of molecular simulations today¹. However, it is difficult to describe processes like chemical reactions that involve bond breakage and formation, as well as electronic polarization effects within empirical force fields. The development of quantum mechanics-based ab initio simulations enabled the description of these complex processes, leading to profound insights across scientific disciplines^{2,3,4,5,6,7,8,9}. The vast majority of these first principles approaches rely on density functional theory (DFT), and the development of increasingly accurate density functionals has greatly improved the reliability of ab initio predictions^{10,11,12,13,14,15}. But, performing electronic structure calculations are expensive, and first-principles simulations are limited to small system sizes and short time scales.

The prohibitive expense of ab initio simulations can be overcome through machine learning. Armed with a set of ab initio data, machine learning can be used to train neural network (NN) potentials that describe interatomic interactions at the same level of accuracy as the ab initio methods, but with a fraction of the cost. Consequently, NN potentials enable ab initio quality simulations to reach the large system sizes and long time scales needed to model complex phenomena, such as phase diagrams^{16,17,18,19,20} and nucleation^21,22.

Despite the significant advances made in this area, there are still practical and conceptual difficulties with NN potential development, especially with regard to long-range electrostatics. To make NN potential construction computationally feasible, most approaches learn only local arrangements of atoms around a central particle, where the meaning of “local” is defined by a distance cutoff usually <1 nm. Because of this locality, the resulting NN potentials are inherently short-ranged. The lack of long-range interactions in NN potentials can lead to both quantitative and qualitative errors, especially when describing polar and charged species^23,24,25.

The need for incorporating long-range electrostatics into NN potentials has led to the development of several new approaches^{23,24,26,27,28,29}. Many of these approaches exclude all or some of the electrostatic interactions from training and then assign effective partial charges to each atomic nucleus that are used to calculate long-range electrostatic interactions using traditional methods^{23,25,26,27,28}. The values of these effective charges can be determined using machine learning methods. For example, the fourth-generation high-dimensional neural network potential (4G-HDNNP)²⁸ employs deep NNs to predict the electronegativities of each nucleus, which are subsequently used within a charge equilibration process to determine the effective charges. These approaches can predict binding energies and charge transfer between molecules, but they also introduce quantities that are not direct physical observables, such as the effective charges and electronegativities. Another approach explicitly incorporated nonlocal geometric information into the construction of local feature functions^24,30. This approach, referred to as the long-distance equivariant representation, is able to more accurately predict the binding energy between molecules and the polarizability of molecules, compared to purely local models. However, this model only takes in the coordinates of the nucleus as input information and cannot handle external fields.

The difficulties that current approaches to NN potentials have when treating long-range interactions can be resolved by a purely ab initio strategy that uses no effective quantities. Such a strategy can be informed by our understanding of the roles of short- and long-range interactions in condensed phases^31,32,33,34. In uniform liquids, appropriately chosen uniformly slowly varying components of the long-range forces—van der Waals attractions and long-range Coulomb interactions—cancel to a good approximation in every relevant configuration. As a result, the local structure is determined almost entirely by short-range interactions. In water, these short-range interactions correspond to hydrogen bonding and packing^35,36,37,38. Therefore, short-range models, including current NN potentials, can describe the structure of uniform systems. This idea, that short-range forces determine the structure of uniform systems, forms the foundation for the modern theory of bulk liquids^31,32,33, in which the averaged effects of long-range interactions can be treated as a small correction to the purely short-range system.

In contrast, the effects of long-range interactions are more subtle and play a role in collective effects that are important for dielectric screening. Moreover, long-range forces do not cancel at extended interfaces and instead play a key role in interfacial physics. As a result, short-range models cannot describe interfacial structure and thermodynamics, as they do in the bulk, and standard NN models fail to describe even the simplest liquid-vapor interfaces²⁵. The local molecular field (LMF) theory of Weeks and coworkers provides a framework for capturing the average effects of long-range interactions at interfaces through an effective external field^{34,39,40,41,42}. LMF theory also provides physically intuitive insights into the roles of short- and long-range forces at interfaces that can be leveraged to model nonuniform systems.

Here, we exploit the physical picture provided by liquid-state theory to develop a general approach for learning long-range interactions in NN potentials from ab initio calculations. We separate the atomic interactions into appropriate short-range and long-range components and construct a separate network to handle each part. Importantly, the short-range model is isolated from the long-range interactions. This separation also isolates the long-range response of the system, enabling it to be learned. Short-range interactions can be learned using established approaches. The short- and long-range components of the potential are then connected through a rapidly converging self-consistent loop. The resulting self-consistent field neural network (SCFNN) model is able to describe the effects of long-range interactions without the use of effective charges or similar artificial quantities. We illustrate this point through the development of a SCFNN model of liquid water. In addition to capturing the local structure of liquid water, as evidenced by the radial distribution function, the SCFNN model accurately describes long-range structural correlations connected to dielectric screening, the response of liquid water to electrostatic fields, and water’s dielectric constant. Because the SCFNN model learns the response to electrostatic fields, it can predict properties that depend on screening in environments for which it was not trained. We demonstrate this by using the SCFNN trained on bulk configurations to model the orientational ordering of water at the interface with its vapor. Finally, the SCFNN also captures the electronic fluctuations of water and can accurately predict its high-frequency dielectric constant.

Results

Workflow of the SCFNN model

The SCFNN model consists of two modules that each target a specific response of the system (Fig. 1). Module 1 predicts the electronic response via the position of the maximally localized Wannier function centers (MLWFCs). Module 2 predicts the forces on the nuclear sites. In turn, each module consists of two networks: one to describe the short-range interactions and one to describe perturbations to the short-range system from long-range electric fields. Together, these two modules (four networks) enable the model to predict the total electrostatic properties of the system.

**Fig. 1: Schematic of the self-consistent field neural network (SCFNN).**

In the short-range system, the v(r) = 1/r portion of the Coulomb potential is replaced by the short-range potential ${v}_{0}(r)={{{{{{{\rm{erfc}}}}}}}}(r/\sigma )/r$. Physically, v₀(r) corresponds to screening the charge distributions in the system through the addition of neutralizing Gaussian charge distributions of opposite sign—the interactions are truncated by Gaussians. Therefore, we refer to this system as the Gaussian-truncated (GT) system^{34,35,36,37,38}. By making a physically meaningful choice for σ, the GT system can describe the structure of bulk liquids with high accuracy but with a fraction of the computational cost. Moreover, the GT system has served as a useful short-range component system when modeling the effects of long-range fields^{37,39,41,43,44}. Here, we choose σ to be 4.2 Å (8 Bohr), which is large enough for the GT system to accurately describe hydrogen bonding and the local structure of liquid water^{34,35,36,37,38}.

The remaining part of the Coulomb interaction, ${v}_{1}(r)=v(r)-{v}_{0}(r)={{{{{{{\rm{erf}}}}}}}}(r/\sigma )/r$, is long ranged, but varies slowly over the scale of σ. Because v₁(r) is uniformly slowly-varying, the effective field produced by v₁(r) usually induces a linear response in the GT system. The linear nature of the response makes the effects of v₁(r) able to be captured by linear models. In the context of NNs, we demonstrate below that a linear network is sufficient to learn the linear response induced by long-range interactions.

Module 1

The separation of interactions into short- and long-range components is crucial to the SCFNN model. In particular, the two networks of each module are used to handle this separation. Network 1S of Module 1 predicts the positions of the MLWFCs in the short-range GT system, while Network 1L predicts the perturbations to the MLWFC positions induced by the effective long-range field. Networks 1S and 1L leverage Kohn’s theory on the nearsightedness of electronic matter (NEM)^45,46. The NEM states that⁴⁶ “local electronic properties, such as the density n(r), depend significantly on the effective external potential only at nearby points.” Here the effective external potential includes the external potential and the self-consistently determined long-range electric fields. Therefore, the NEM suggests that the electronic density, and consequently the positions of the MLWFCs, are “nearsighted” with respect to the effective potential, but not to the atomic coordinates, contrary to what has been assumed in previous work that also uses local geometric information of atoms as input to NNs^47,48. An atom located at ${{{{{{{\bf{r}}}}}}}}^{\prime}$ will affect the effective potential at r, even if ${{{{{{{\bf{r}}}}}}}}^{\prime}$ is far from r, through long-range electrostatic interactions. Consequently, current approaches to generating NN models can only predict the position of MLWFCs for a purely short-range system without long-range electrostatics, such as the GT system^47,48. We exploit this fact and use established NNs to predict the locations of the MLWFCs in the GT system⁴⁷. To do so, we create a local reference frame around each water molecule (Fig. 2) and use the coordinates of the surrounding atoms as inputs to the NN. The local reference system preserves the rotational and translational symmetry of the system. The network outputs the positions of the four MLWFCs around the central water, which are then transformed to the laboratory frame of reference.

**Fig. 2: Local frame around a central water.**

Network 1L predicts the response of the MLWFC positions to the effective field E(r), defined as the sum of the external field, E_ext(r), and the long-range field from v₁(r):

$${{{\bf{E}}}} ({{{\bf{r}}}}) = {{{\bf{E}}}}_{{{\rm{ext}}}} ({{{\bf{r}}}}) - \int d{{{\bf{r}}}}^{\prime} \rho ({{{\bf{r}}}}^{\prime} )\nabla {v}_{1}(| {{{\bf{r}}}}-{{{\bf{r}}}}^{\prime}|)\,,$$

(1)

where $\rho ({{{{{{{\bf{r}}}}}}}}^{\prime} )$ is the instantaneous charge density of the system, including nuclear and electronic charges. Network 1L also introduces a local reference frame for each water molecule. However, Network 1L takes as input both the local coordinates and local effective electric fields. The NEM suggests that this local information is sufficient to determine the perturbation in the MLWFC positions. Network 1L outputs this change in the positions of the water molecule’s four MLWFCs, and this perturbation is added to the MLWFC position determined in the GT system to obtain the MLWFCs in the full system. We note that E(r) is a slowly-varying long-range field, such that the MLWFCs respond linearly to this field. Therefore, Network 1L is constructed to be linear in E(r). Table 1 demonstrates that the linear response embodied by Network 1L predicts the perturbation of the MLWFCs with reasonable accuracy.

Table 1 Mean Absolute Error (MAE), multiplied by 100, of Network 1L and 2L in predicting the changes in the maximally localized Wannier function center (MLWFC) positions (Å) and the forces (eV/Å) on the oxygen and hydrogen nuclei, F_O and F_H, respectively, along the z-direction when fields of strength 0.1 and 0.2 V/Å are applied along the same direction.

Full size table

We now need to determine the effective field E(r). This effective field depends on the electron density distribution, but evaluating and including the full three-dimensional electron density for every configuration in a training set requires a prohibitively large amount of storage space. Instead, we approximate the electron density by the charge density of the MLWFCs, assuming each MLWFC is a point charge of magnitude −2e₀. This approximation is often used when computing molecular multipoles, as needed to predict vibrational spectra, for example^14,48. Here it is important to note that the MLWFs of water are highly localized so that the center gives a reasonable representation of the location of the MLWF. Moreover, the electron density is essentially smeared over the scale of σ through a convolution with v₁(r), which makes the resulting fields relatively insensitive to small-wavelength variations in the charge density. As a result, the electron density can be accurately approximated by the MLWFC charge density within our approach.

The effective field is a functional of the set of MLWFC positions, ${{{{{{{\bf{E}}}}}}}}[\left\{{{{{{{{{\bf{r}}}}}}}}}_{w}\right\}]$, and the positions of the MLWFCs themselves depend on the field, r_w[E]. Therefore, we determine E and $\left\{{{{{{{{{\bf{r}}}}}}}}}_{w}\right\}$ through self-consistent iteration. Our initial guess for E is obtained from the positions of the MLWFCs in the GT system. We then iterate this self-consistent loop until the MLWFC positions no longer change, within a tolerance of 2.6 × 10⁻⁴ Å. In practice, we find that self-consistency is achieved quickly.

Module 2

After Module 1 predicts the positions of the MLWFCs, Module 2 predicts the forces on the atomic sites. As with the first module, Module 2 consists of two networks: one that predicts the forces of the GT system and another that predicts the forces produced by E(r). To predict the forces in the GT system, we adopt the network used by Behler and coworkers⁴⁹. This network, Network 2S, takes local geometric information of the atoms as inputs and, consequently, cannot capture long-range interactions. To describe long-range interactions, we introduce a second network (Network 2L in Fig. 1). This additional network predicts the forces on atomic sites due to the effective field E(r), which properly accounts for long-range interactions in the system. In practice, we again introduce a local reference frame for each water molecule and use local atomic coordinates and local electric fields as inputs. In this case, we also find that a network that is linear in E(r) accurately predicts the resulting long-range forces, consistent with the linear response of the system to a slowly-varying field.

In practice, separating the data obtained from standard DFT calculations into the GT system and the long-range effective field is not straightforward. To solve this problem, we apply homogeneous electric fields of varying strength while keeping the atomic coordinates fixed. The fields only perturb the positions of the MLWFCs and the forces on the atoms—these perturbations are not related to the GT system. The changes induced by these electric fields are directly obtained from DFT calculations and are used to train Networks 1L and 2L and learn the response to long-range effective fields. The remaining part of the DFT data, which has the long-range field E(r) removed, is used to train Networks 1S and 2S and learn the response of the short-ranged GT system. See the “Methods” section for a more detailed discussion of the networks and the training procedure.

We emphasize that our approach to partitioning the system into a short-range GT piece and a long-range perturbation piece is different from other machine learning approaches for handling long-range electrostatics. The standard approaches usually partition the total energy into two parts, a short-ranged energy and an Ewald energy that is used to evaluate the long-range interactions. However, this partitioning results in a coupling between the short- and long-range interactions. For example, the short-range part of the energy in the 4G-HDNNP model depends on the effective charges that are assigned to the atoms, but these effective charges depend on long-range electrostatic interactions through the global charge equilibration process used to determine their values²⁸. In contrast, the SCFNN approach isolates the short-range interactions during the training process and connects the short-range model to long-range interactions through E(r) via self-consistency. The GT system embodied by Network 1S and 2S does not depend on long-range electrostatics even implicitly; it is completely uncoupled from the long-range interactions. The effects of long-range electrostatic interactions are isolated within the second network of each module, Network 1L and Network 2L in Fig. 1. This separation of short- and long-ranged effects is similar in spirit to the principles underlying LMF theory^34,39,41 and related theories of uniform liquids^{32,33,35,36,38}.

Water’s local structure is insensitive to long-range interactions

We demonstrate the success of the SCFNN approach by modeling liquid water. Water is the most important liquid on Earth. Yet, the importance of both short- and long-range interactions makes it difficult to model. Short-range interactions are responsible for water’s hydrogen bond network that is essential to its structure and unusual but important thermodynamic properties^36,50. Long-range interactions play key roles in water’s dielectric response, interfacial structure, and can even influence water-mediated interactions^41,51. Because of this broad importance, liquid water has served as a prototypical test system for many machine learning-based models^{17,24,48,49,52} Here, we test our SCFNN model on a system of bulk liquid water by performing molecular dynamics (MD) simulations of 1000 molecules in the canonical ensemble under periodic boundary conditions.

One conventional test on the validity of a NN potential is to compare the radial distribution function, g(r), between atomic sites for the different models. The g(r) predicted by the SCFNN model is the same as that predicted by the Behler–Parrinello (BP) model⁴⁹ for all three site–site correlations in water (Fig. 3). This level of agreement may be expected, based on previous work examining the structure of bulk water^{36,37,38,40,41}. The radial distribution functions of water are determined mainly by short-range, nearest-neighbor interactions, which arise from packing and hydrogen bonding; long-range interactions have little effect on the main features of g(r). Consequently, purely short-range models, like the GT system, can quantitatively reproduce the g(r) of water^{36,37,38,40,41}. Similarly, the short-range BP model accurately describes the radial distribution functions, as does the SCFNN model, which includes long-range interactions.

**Fig. 3: Local structure of bulk water.**

Long-range electrostatics and dielectric response

Though the short-range structure exemplified by the radial distribution function is insensitive to long-range interactions, long-range correlations are not. For example, the longitudinal component of the dipole density or polarization correlation function evaluated in reciprocal space, ${\chi }_{zz}^{0}({{{{{{{\bf{k}}}}}}}})$, was recently shown to be sensitive to long-range interactions⁴⁴. This correlation function is defined according to

$${\chi }_{zz}^{0}({{{{{{{\bf{k}}}}}}}})=\frac{1}{V}\left\langle\mathop{\sum}\limits_{l,j}\frac{({{{{{{{\bf{k}}}}}}}}\cdot {{{{{{{{\bf{p}}}}}}}}}_{l})\,({{{{{{{\bf{k}}}}}}}}\cdot {{{{{{{{\bf{p}}}}}}}}}_{j})}{{k}^{2}}\,{e}^{-i{{{{{{{\bf{k}}}}}}}}\cdot \left({{{{{{{{\bf{r}}}}}}}}}_{l}-{{{{{{{{\bf{r}}}}}}}}}_{j}\right)}\right\rangle \,,\,\,{{{{{{{\rm{with}}}}}}}}\,\,{{{{{{{\bf{k}}}}}}}}=k\hat{{{{{{{{\bf{z}}}}}}}}}\,.$$

(2)

Here p_j is the dipole moment of water molecule j and r_j is the position of the oxygen atom of water molecule j.

Here we compare the longitudinal polarization correlation function predicted by our SCFNN model and the BP model. The original BP model is not able to predict molecular charge distributions. Therefore, to predict the dipole moment of water, we couple the BP model with the short-range part of the SCFNN model that predicts MLWFCs (Network 1S). We note that a similar strategy was used in the previous work⁴⁷.

The longitudinal polarization correlation function predicted by our SCFNN model and the BP agree everywhere except at small k, indicating that long-range correlations are different in the two models (Fig. 4a). The long-wavelength behavior of the polarization correlation function is related to the dielectric constant via^35,53,54

$$\mathop{\lim }\limits_{k\to 0}{\chi }_{zz}^{0}({{{\bf{k}}}})={\varepsilon }_{0}{k}_{{{\rm{B}}}}T\frac{\varepsilon -1}{\varepsilon }\,,$$

(3)

where ε ≈ 100 is the value of the dielectric constant of water predicted by the SCFNN, as discussed below. The ${\chi }_{zz}^{0}({{{{{{{\bf{k}}}}}}}})$ predicted by our SCFNN model is consistent with the expected behavior at small k. In contrast, short-range models, like the GT system^35,44,54 and the BP model, significantly deviate from the expected asymptotic value. Consequently, these short-range models are expected to have difficulties describing the dielectric screening that is important in nonuniform systems^{25,37,39,41,44}, for example.

**Fig. 4: Long-range polarization in bulk water.**

To further examine the dielectric properties of the NN models, we can apply homogeneous fields of varying strength to the system and examine its response. To do so, we performed finite-field simulations at constant displacement field, D. These finite-D simulations⁵⁵ can be naturally combined with our SCFNN model, unlike many other NN models that cannot handle external fields. Following previous work⁴⁴, we use ${{{{{{{\bf{D}}}}}}}}=D\hat{{{{{{{{\bf{z}}}}}}}}}$, vary the magnitude of the displacement field from D = 0 V/Å to D = 0.4 V/Å, and examine the polarization, P, induced in water. As shown in Fig. 4b, the polarization response of water to the external field is accurately predicted by dielectric continuum theory, as expected, further suggesting that the SCFNN model properly describes the dielectric response of water. To the best of our knowledge, this is the first NN model that can accurately describe the response of a system to external fields. We emphasize that this response is achieved by learning the long-range response via Networks 1L and 2L.

Because the SCFNN can predict the response to electrostatic fields, we can use a highly efficient method to estimate the dielectric constant⁵⁶. To do so, we compute the r-dependent Kirkwood g-factor, G_K(r), with E = 0 and D = 0, where

$${G}_{{{{{{{{\rm{K}}}}}}}}}(r)= \langle {{{{{{{{\boldsymbol{\mu }}}}}}}}}_{1}\cdot {{{{{{{{\bf{M}}}}}}}}}_{1}(r) \rangle /{\mu }^{2},$$

(4)

μ₁ is the dipole of a water molecule at the origin and M₁(r) is the total dipole moment in a sphere of radius r including the molecule at the origin. The composite Kirkwood g-factor,

$${G}_{{{{{{{{\rm{Kc}}}}}}}}}(r)=\frac{1}{3}\left[2{G}_{{{{{{{{\rm{K}}}}}}}}}{(r)}_{{{{{{{{\bf{E}}}}}}}} = 0}+{G}_{{{{{{{{\rm{K}}}}}}}}}{(r)}_{{{{{{{{\bf{D}}}}}}}} = 0}\right],$$

(5)

converges rapidly with r to a constant g_K, which is related to the dielectric constant through Kirkwood’s relation for polarizable molecules⁵⁶

$$\frac{4\pi \beta N{\mu }^{2}{g}_{{{\rm{K}}}}}{V}=\frac{(\varepsilon -1)(2\varepsilon +1)}{\varepsilon }-\frac{({\varepsilon }_{\infty }-1)(2{\varepsilon }_{\infty }+1)}{{\varepsilon }_{\infty }},$$

(6)

where N is the number of water molecules, V is the system volume, β = 1/(k_BT), and ε_∞ is the high-frequency dielectric constant that arises from electronic polarization; ε_∞ ≈ 1.65 for the SCFNN model, as discussed below. As shown in Fig. 5a, the composite correlation function plateaus to a constant value near a distance of 6 Å, as expected⁵⁶. By replacing g_K in Eq. (6) with G_Kc(r) and inverting, we can compute the effective distance-dependent dielectric constant, shown in Fig. 5b. The dielectric constant rapidly converges to the bulk value of ε ≈ 100, which is close to estimates provided by van der Waals corrected functionals of similar accuracy¹⁴ and significantly less than that predicted by the PBE functional that overstructures water⁵⁶.

**Fig. 5: Estimating the dielectric constant.**

To push the limits of the SCFNN model, we can ask if it can properly predict dielectric screening in nonuniform environments, for which it was not trained. To do so, we simulate a water-vapor interface by extending the simulation cell along the z-axis to create a slab of water surrounded by a large vacuum region on either side. Because we have only trained on bulk configurations and not on configurations in the nonuniform system, we cannot expect the BP or the SCFNN model to accurately reproduce all features of the interface. Yet, both models do produce a stable interface, as shown by the densities in Fig. 6a, although the width of the SCFNN interfaces is smaller than those of the BP. Both models predict densities that are lower than those predicted by models explicitly trained for the interface, which may be expected because the bulk models did not learn the unbalanced dispersion forces that exist at interfaces^25,36,57. However, the bulk density predicted by the SCFNN model is larger than that of the BP model, in better agreement with experiments.

**Fig. 6: The structure of the water-vapor interface.**

Dielectric screening manifests in the orientational structure of interfacial water, and we examine the orientational preferences of water by computing $\langle {\cos} {\theta} ({z})\rangle$, where θ(z) is the angle formed by the surface normal and the dipole moment vector of a water molecule located at z. At the water–vapor interface, water molecules tend to point their dipoles slightly toward the vapor phase, a consequence of breaking an average of one H-bond per molecule at water’s surface. This dipole layer is screened by subsequent layers of water, such that no net orientation and zero electric field exists in the bulk. In the absence of long-range electrostatics, this screening is not achieved, and short-range models result in extended ordering from the interface into the bulk^{25,36,37,39,44}. Indeed, the short ranged BP model results in long-ranged orientational ordering of water at the liquid–vapor interface because it lacks dielectric screening. In contrast, the SCFNN model displays the expected behavior. A single $\langle {\cos} {\theta} ({z}) \rangle$ peak in $\langle {\cos} {\theta} ({z}) \rangle$ appears near the interface and goes to zero in the bulk of the slab due to proper screening of the interfacial dipole layer. This successful prediction suggests that the SCFNN approach may lead to the creation of NN models that are at least partially transferable to different environments.

Electronic fluctuations

In addition to the screening encompassed by the static dielectric constant, the SCFNN model can also properly predict electronic fluctuations of water and the high-frequency dielectric constant. To quantify electronic fluctuations, we compute the probability distribution of the magnitude of the water dipole moment from our simulations of bulk water using the SCFNN model, Fig. 7a. This distribution is dominated by the electronic polarization of water molecules and has a width consistent with predictions from ab initio MD simulations^58,59,60,61. Moreover, the mean of the distribution yields an average dipole moment (2.9 D) in agreement with that estimated from experiments (2.9 D)⁶², further supporting that the SCFNN produces an accurate description of the molecular charge distribution in liquid water.

We also decomposed the dipole moment distribution into contributions from short- and long-range interactions. The short-range contribution to the electronic polarization is determined by Network 1S and the long-range part is determined by Network 1L. As shown in Fig. 7a, the molecular dipole moment distribution in bulk water is determined by short-range interactions, where the nuclear configurations of the bulk were determined using the full SCFNN model. This is consistent with the idea that local structure in a uniform bulk liquid, and fluctuations about that local structure, are determined by short-range interactions.

Long-range electronic effects on electrostatic screening are quantified by the high-frequency dielectric constant, ε_∞. Physically, ε_∞ can be thought of as the amount by which an electric field is screened without altering the positions of the nuclei; it quantifies the electronic response to applied fields. To estimate ε_∞, we perform precisely this exercise: we compute the polarization of water in response to an external electric field of magnitude E and keep all positions of the nuclei fixed. The resulting polarization, shown in Fig. 7b, is consistent with a linear response to the field, as expected for dielectric screening. Fitting the induced electronic polarization to dielectric continuum theory expectations yields ε_∞ ≈ 1.65, in good agreement with the experimental value of 1.77, demonstrating that the SCFNN model can accurately predict long-range electronic response to electrostatic fields.

Finally, we compare the electronic fluctuations of the SCFNN model to predictions made by the 4G-HDNNP model. To do so, we perform a MD simulation of bulk water using the extended simple point charge (SPC/E) water model⁶³ and use the resulting configurations to determine the dipole moment distribution using each NN model, Fig. 7c. Using the same set of configurations allows us to compare only the ability of each model to predict charge distributions.

The 4G-HDNNP relies on atomic partial charges obtained from electronic structure calculations during the training process. The original implementation of the 4G-HDNNP model used Hirshfeld charges^28,64. We additionally train another version of the 4G-HDNNP model using Mulliken charges to examine the dependence of the results on the method of determining the atomic partial charges⁶⁵. See the Methods section for a more detailed discussion of the training procedure.

The SCFNN model results in a dipole moment distribution centered near the experimentally-determined average dipole moment. Moreover, the width of the SCFNN distribution is in good agreement with ab initio predictions^58,59,60,61, although slightly narrower than that obtained using SCFNN-generated configurations (Fig. 7a). The 4G-HDNNP models result in significantly narrower distributions than the SCFNN model, and the average molecular dipole moment is either too large (Hirshfeld) or too small (Mulliken). The prediction of distinctly different molecular dipole moments demonstrates a key disadvantage of relying on atomic partial charges during training—the definition of partial charges can be ambiguous and often artificial. Then, the resulting 4G-HDNNP models trained with different partial charges will give different results. In contrast, the SCFNN model removes this ambiguity by representing the molecular charge distributions using MLWFCs.

Discussion

In this work, we have presented a general strategy to construct NN potentials that can properly account for the long-range response of molecular systems that is responsible for dielectric screening and related phenomena. We demonstrated that this model produces the correct long-range polarization correlations in liquid water, as well as the correct response of liquid water to external electrostatic fields. Both of these quantities are related to the dielectric constant and require a proper description of long-range interactions. In contrast, current derivations of NN potentials result in short-range models that cannot capture these effects.

We anticipate that this approach will be of broad use to the molecular machine learning and simulation community for modeling the electrostatic and dielectric properties of molecular systems. In contrast to short-range interactions that must be properly learned to describe the different local environments encountered at extended interfaces and at solute surfaces, the response of the system to long-range, slowly-varying fields is quite general. Learning the long-range response (through Networks 1L and 2L) is analogous to learning a linear response in most cases, and we expect the resulting model to be relatively transferable; we emphasize, however, that the SCFNN is not limited to the linear response regime. As such, our resulting SCFNN model can make predictions about conditions on which it was not trained. For example, we trained the model for electric fields of magnitude 0, 0.1, and 0.2 V/Å, and then used this model to successfully predict the response of the system to displacement fields with magnitudes between 0 and 0.4 V/Å. This suggests that our approach can be used to train NN models in more complex environments and then accurately predict the response of water to long-range fields in those environments. We also showed that the SCFNN model trained for bulk water can predict orientational structure at the water-vapor interface as a result of learning dipolar screening, further emphasizing the ability of the SCFNN to predict the response of the system to electrostatic fields. The ability to learn the response of condensed phases to applied fields should make the SCFNN appealing for modeling atomic systems in electrochemical environments⁶⁶, where electrostatic potential differences drive chemical processes, as well as in the modeling of interfaces with polar surfaces where the application of displacement fields is used to properly model surface charge densities^67,68.

Our SCFNN approach is complementary to many established methods for creating NN potentials. Learning the short-range, GT system interactions can be accomplished with any method that uses local geometric information, and recent advances in optimizing this training can be leveraged^69,70. In this case, the precise form of Networks 1S and 2S can be replaced with an alternative NN. Then, Networks 1L and 2L can be used as defined here, within the general SCFNN workflow, resulting in a variant of the desired NN potential that can describe the effects of long-range interactions. Because of this, we expect our SCFNN approach to be transferable and readily interfaced with current and future machine learning methods for modeling short-range molecular interactions.

We close with a discussion of the limitations of the SCFNN model in its current form and possible strategies for improvement. We rely on defining a local molecular coordinate system on each water molecule, in order to make our model rotationally equivariant. Moreover, we assumed that a specific number of MLWFCs are associated with each molecule, four for each water molecule, and examined their coordinates within the local frame. These steps are complicated when bond breakage and formation occurs. Although the general procedure can be readily extended to many molecules, the set of possible molecules must be known in advance. Strategies for constructing rotationally equivariant NN potentials without a local reference frame have been developed and can be used in place of the strategy used here to develop further generations of the SCFNN model that improve upon these deficiencies^{30,48,71,72,73}.

Methods

Training the SCFNN

Our training and test set consists of 1571 configurations of 64 water molecules⁵². Homogeneous electric fields were applied to the system, as described further in the next section. We used two-thirds of the configurations for training and one-third to test the training of the network.

To train the networks we need to separate the DFT data into the GT system and the long-range effective field. However, that separation is not straightforward in practice. To achieve this, we use the differences in the MLWFC locations and forces induced by different fields to fit Networks 1L and 2L. We now describe this procedure in detail for fitting Network 1L, and Network 2L was fit following a similar approach.

To learn the effects of long-range interactions, we consider perturbations to the positions of the MLWFCs induced by external electric fields of different magnitudes. Consider applying two fields of strength $\left|{{{{{{{\bf{E}}}}}}}}\right|$ and $\left|{{{{{{{\bf{E}}}}}}}}\right|^{\prime}$. These fields will alter the MLWFC positions by Δr_w[R, E] and ${{\Delta }}{{{{{{{{\bf{r}}}}}}}}}_{w}^{\prime}[{{{{{{{\bf{R}}}}}}}},{{{{{{{\bf{E}}}}}}}}^{\prime} ]$, respectively. However, both Δr_w and ${{\Delta }}{{{{{{{{\bf{r}}}}}}}}}_{w}^{\prime}$ are not directly obtainable from a single DFT calculation. Instead, we can readily compute the difference in perturbations, ${{\Delta }}{{{{{{{{\bf{r}}}}}}}}}_{w}-{{\Delta }}{{{{{{{{\bf{r}}}}}}}}}_{w}^{\prime}$, directly from the DFT data, because

$${{\Delta }}{{{{{{{{\bf{r}}}}}}}}}_{w}-{{\Delta }}{{{{{{{{\bf{r}}}}}}}}}_{w}^{\prime}={{{{{{{{\bf{r}}}}}}}}}_{w}-{{{{{{{{\bf{r}}}}}}}}}_{w}^{\prime}\,.$$

(7)

Here r_w and ${{{{{{{{\bf{r}}}}}}}}}_{w}^{\prime}$ are the locations of the MLWFCs in the full system in the presence of the field E and ${{{{{{{\bf{E}}}}}}}}^{\prime}$, respectively, and these positions can be readily computed in the simulations. These differences in the MLWFC positions are used to fit Network 1L. In addition, we also exploit the fact that Δr_w = 0 when E = 0. This allows us to fix the zero point of Network 1L.

After fitting Networks 1L and 2L, we use them to predict the contribution of the effective field to the MLWFC locations and forces. We then subtract that part from the DFT data. What remains corresponds to the short-range GT system, and this is used to train Networks 1S and 2S.

We now describe the detailed structure of the four networks used here.

Network 1S

In the local frame of water molecule i, we construct two types of symmetry functions as inputs to Network 1S. The first type is the type 2 BP symmetry function⁷⁴,

$${G}_{i}^{2}=\mathop{\sum}\limits_{j\ne i}\exp (-\eta {({r}_{ij}-{r}_{s})}^{2}){f}_{{{{{{\rm{c}}}}}}}({r}_{ij}).$$

(8)

Here η and r_s are parameters that adjust the width and center of the Gaussian, and f_c is a cutoff function whose value and slope go to zero at the radial cutoff r_c. We adopted the same cutoff function as previous work⁴⁹, and the cutoff r_c is set equal to 12 Bohr.

The second type of symmetry function is similar to the type 4 BP symmetry function⁷⁴. This symmetry function depends on the angle between r_ij and the axis of the local frame,

$${{{{{{{{\bf{G}}}}}}}}}_{i}^{4}=\mathop{\sum}\limits_{j\ne i}{2}^{1-\zeta }{\left(1+\lambda \frac{{{{{{{{{\bf{r}}}}}}}}}_{ij}}{{r}_{ij}}\right)}^{\zeta }\exp (-\eta {r}_{ij}^{2}){f}_{{{{{{\rm{c}}}}}}}({r}_{ij}).$$

(9)

Here, ζ and λ are parameters that adjust the dependence of the angular term.

We use 36 symmetry functions as input to Network 1S. Network 1S itself consists of two hidden layers that contain 24 and 16 nodes. The output layer consists of 12 nodes, corresponding to the three-dimensional coordinates of the four MLWFCs of a central water molecule. Network 1S is a fully connected feed-forward network, and we use $\tanh (x)$ as its activation function.

Network 1L

In the local frame of water molecule i, we construct one type of symmetry function as input to Network 1L,

$${{{{{{{{\bf{EG}}}}}}}}}_{i}^{2}=\mathop{\sum}\limits_{j}{{{{{{{{\bf{E}}}}}}}}}_{j}\exp (-\eta {({r}_{ij}-{r}_{s})}^{2}){f}_{c}({r}_{ij}).$$

(10)

Here, E_j is the effective field exerted on atom j. We use 36 symmetry functions as inputs to Network 1L. Network 1L has no hidden layers. The output layer consists of 12 nodes, corresponding to the three-dimensional coordinates of the perturbations of a water molecule’s four MLWFCs induced by the external field.

Network 2S

Network 2S is exactly the same as the BP Network employed in the previous work⁴⁹. In brief, the network contains 2 hidden layers, each containing 25 nodes. Type 2 and 4 BP symmetry functions are used as inputs to the network. The network for oxygen takes 30 symmetry functions as inputs, while the network for hydrogen takes 27 symmetry functions as inputs. A hyperbolic tangent is used as the activation function.

Network 2L

Network 2L uses the same type of symmetry function as Network 1L. The network for the force on the oxygen and for the force on hydrogen are trained independently. To predict the force on the oxygen, we center the local frame on the oxygen atom. When the force on a hydrogen atom is the target, we center the local frame on a hydrogen atom. We use 36 symmetry functions as inputs to Network 2L. Network 2L has no hidden layers. The inputs map linearly onto the forces on the atoms.

4G-HDNNP

The same configurations used to train and test the SCFNN model are used to train and test the 4G-HDNNP. Hirshfeld and Mulliken charges for these configurations are obtained with DFT. Two-thirds of these configurations are used to train the 4G-HDNNP and the remaining one-third is used to test the training. We trained two versions of the 4G-HDNNP, one with Hirshfeld charges and the other with Mulliken charges. The 4G-HDNNP-Hirshfeld model yields an average charge error of 0.012e₀ on the test set, while the 4G-HDNNP-Mulliken yields an average charge error of 0.02e₀ on the test set.

DFT calculations

The DFT calculations followed previous work^52,75 and used published configurations of water as the training set⁵². In short, all calculations were performed with CP2K (version 7)^76,77, using the revPBE0 hybrid functional with 25% exact exchange^15,78,79, the D3 dispersion correction of Grimme⁸⁰, Goedecker–Tetter–Hutter pseudopotentials⁸¹, and TZV2P basis sets⁸², with a plane wave cutoff of 400 Ry. Maximally localized Wannier function centers⁸³ were evaluated with CP2K, using the LOCALIZE option. The maximally localized Wannier function spreads were minimized according to previous work⁸⁴. Hirshfeld and Mulliken charges were determined using the default implementations in CP2K. A homogeneous, external electric field was applied to the system using the Berry phase approach, with the PERIODIC_EFIELD option in CP2K^56,85,86. Electric fields of magnitude 0, 0.1, and 0.2 V/Å were applied along the z-direction of the simulation cell. Sample input files are given at Zenodo⁸⁷.

MD simulations

MD simulations are performed in the canonical (NVT) ensemble, with a constant temperature of 300 K maintained using a Berendsen thermostat⁸⁸. The system consisted of 1000 water molecules in a cubic box 31.2 Å in length. The equations of motion were integrated with a timestep of 0.5 fs. Radial distribution functions and longitudinal polarization correlation functions were computed from 100 independent trajectories that were each 50 ps in length. Finite-D simulations were performed under the same simulation conditions, and each trajectory was 50 ps long at each magnitude of D.

The liquid–vapor simulation was performed at 300 K. The system consisted of 1000 water molecules. The dimensions of the simulation box were L_x = L_y = 30 Å and L_z = 90 Å. The density profiles and the orientational profiles of water were obtained from 59 independent trajectories that were each 50 ps in length. Each trajectory is equilibrated for at least 50 ps before data are collected.

The SPC/E water⁶³ simulation is performed in the canonical (NVT) ensemble, with a constant temperature of 300 K maintained using a Berendsen thermostat⁸⁸. The system consisted of 1000 water molecules in a cubic box of length 31.2 Å. One thousand configurations were sampled from a 50 ns long trajectory of the SPC/E water simulation and the SCFNN and 4G-HDNNP were applied to these configurations to predict the dipole moments of water molecules.

Data availability

The data generated to train and test the SCFNN and the 4G-HDNNP have been deposited in Zenodo under accession code https://doi.org/10.5281/zenodo.5760191⁸⁷. Source data are provided with this paper.

Code availability

All DFT calculations were performed with CP2K version 7. In-house code was used to construct the NN potentials and perform the MD simulations. These codes are available on Github: https://doi.org/10.5281/zenodo.5919317⁸⁹.

References

Allen, M. P. & Tildesley, D. J. Computer Simulation of Liquids (Oxford University Press, 2017).
Tuckerman, M. E., Ungar, P. J., Von Rosenvinge, T. & Klein, M. L. Ab initio molecular dynamics simulations. J. Phys. Chem. 100, 12878–12887 (1996).
Article CAS Google Scholar
Car, R. & Parrinello, M. Unified approach for molecular dynamics and density-functional theory. Phys. Rev. Lett. 55, 2471 (1985).
Article ADS CAS PubMed Google Scholar
Chen, M. et al. Hydroxide diffuses slower than hydronium in water because its solvated structure inhibits correlated proton transfer. Nat. Chem. 10, 413–419 (2018).
Article PubMed Google Scholar
Geissler, P. L., Dellago, C., Chandler, D., Hutter, J. & Parrinello, M. Autoionization in liquid water. Science 291, 2121–2124 (2001).
Article ADS CAS PubMed Google Scholar
Lee, T.-S. et al. Role of Mg²⁺ in hammerhead ribozyme catalysis from molecular simulation. J. Am. Chem. Soc. 130, 3053–3064 (2008).
Article CAS PubMed PubMed Central Google Scholar
Walker, R. C., Crowley, M. F. & Case, D. A. The implementation of a fast and accurate QM/MM potential method in Amber. J. Comput. Chem. 29, 1019–1031 (2008).
Article CAS PubMed Google Scholar
Senn, H. M. & Thiel, W. in Atomistic Approaches in Modern Biology 173–290 (Springer, 2006).
Dal Peraro, M., Ruggerone, P., Raugei, S., Gervasio, F. L. & Carloni, P. Investigating biological systems using first principles Car–Parrinello molecular dynamics simulations. Curr. Opin. Struct. Biol. 17, 149–156 (2007).
Article CAS PubMed Google Scholar
Sun, J., Ruzsinszky, A. & Perdew, J. P. Strongly constrained and appropriately normed semilocal density functional. Phys. Rev. Lett. 115, 036402 (2015).
Article ADS PubMed Google Scholar
Sun, J. et al. Accurate first-principles structures and energies of diversely bonded systems from an efficient density functional. Nat. Chem. 8, 831–836 (2016).
Article CAS PubMed Google Scholar
Chen, M. et al. Ab initio theory and modeling of water. Proc. Natl Acad. Sci. USA 114, 10846–10851 (2017).
Article CAS PubMed PubMed Central Google Scholar
Furness, J. W., Kaplan, A. D., Ning, J., Perdew, J. P. & Sun, J. Accurate and numerically efficient r2SCAN meta-generalized gradient approximation. J. Phys. Chem. Lett. 11, 8208–8215 (2020).
Article CAS PubMed Google Scholar
Zhang, C. et al. Modeling liquid water by climbing up Jacob’s ladder in density functional theory facilitated by using deep neural network potentials. J. Phys. Chem. B 125, 11444–11456 (2021).
Adamo, C. & Barone, V. Toward reliable density functional methods without adjustable parameters: the PBE0 model. J. Chem. Phys. 110, 6158–6170 (1999).
Article ADS CAS Google Scholar
Gartner, T. E. et al. Signatures of a liquid–liquid transition in an ab initio deep neural network model for water. Proc. Natl Acad. Sci. USA 117, 26040–26046 (2020).
Article CAS PubMed PubMed Central Google Scholar
Zhang, L., Wang, H., Car, R. & Weinan, E. Phase diagram of a deep potential water model. Phys. Rev. Lett. 126, 236001 (2021).
Article ADS CAS PubMed Google Scholar
Deringer, V. L. et al. Realistic atomistic structure of amorphous silicon from machine-learning-driven molecular dynamics. J. Phys. Chem. Lett. 9, 2879–2885 (2018).
Article CAS PubMed Google Scholar
Niu, H., Bonati, L., Piaggi, P. M. & Parrinello, M. Ab initio phase diagram and nucleation of gallium. Nat. Commun. 11, 1–9 (2020).
Article CAS Google Scholar
Deringer, V. L. et al. Origins of structural and electronic transitions in disordered silicon. Nature 589, 59–64 (2021).
Article ADS CAS PubMed Google Scholar
Khaliullin, R. Z., Eshet, H., Kühne, T. D., Behler, J. & Parrinello, M. Nucleation mechanism for the direct graphite-to-diamond phase transition. Nat. Mater. 10, 693–697 (2011).
Article ADS CAS PubMed Google Scholar
Bonati, L. & Parrinello, M. Silicon liquid structure and crystal nucleation from ab initio deep metadynamics. Phys. Rev. Lett. 121, 265701 (2018).
Article ADS CAS PubMed Google Scholar
Yue, S. et al. When do short-range atomistic machine-learning models fall short? J. Chem. Phys. 154, 034111 (2021).
Article ADS CAS PubMed Google Scholar
Grisafi, A. & Ceriotti, M. Incorporating long-range physics in atomic-scale machine learning. J. Chem. Phys. 151, 204105 (2019).
Niblett, S. P., Galib, M. & Limmer, D. T. Learning intermolecular forces at liquid–vapor interfaces. J. Chem. Phys. 155, 164101 (2021).
Article ADS CAS PubMed Google Scholar
Xie, X., Persson, K. A. & Small, D. W. Incorporating electronic information into machine learning potential energy surfaces via approaching the ground-state electronic energy as a function of atom-based electronic populations. J. Chem. Theory Comput. 16, 4256–4270 (2020).
Article CAS PubMed Google Scholar
Yao, K., Herr, J. E., Toth, D. W., McKintyre, R. & Parkhill, J. The TensorMol-0.1 model chemistry: a neural network augmented with long-range physics. Chem. Sci. 9, 2261–2269 (2018).
Article CAS PubMed PubMed Central Google Scholar
Ko, T. W., Finkler, J. A., Goedecker, S. & Behler, J. A fourth-generation high-dimensional neural network potential with accurate electrostatics including non-local charge transfer. Nat. Commun. 12, 1–11 (2021).
Article Google Scholar
Behler, J. Four generations of high-dimensional neural network potentials. Chem. Rev. 121, 10037–10072 (2021).
Article CAS PubMed Google Scholar
Grisafi, A., Nigam, J. & Ceriotti, M. Multi-scale approach for the prediction of atomic scale properties. Chem. Sci. 12, 2078—2090 (2021).
Article Google Scholar
Widom, B. Intermolecular forces and the nature of the liquid state. Science 157, 375–382 (1967).
Article ADS CAS PubMed Google Scholar
Weeks, J. D., Chandler, D. & Andersen, H. C. Role of repulsive forces in determining the equilibrium structure of simple liquids. J. Chem. Phys. 54, 5237–5247 (1971).
Article ADS CAS Google Scholar
Chandler, D., Weeks, J. D. & Andersen, H. C. Van der waals picture of liquids, solids, and phase transformations. Science 220, 787–794 (1983).
Article ADS CAS PubMed Google Scholar
Rodgers, J. M. & Weeks, J. D. Local molecular field theory for the treatment of electrostatics. J. Phys. Condens. Matter 20, 494206 (2008).
Article Google Scholar
Rodgers, J. M. & Weeks, J. D. Accurate thermodynamics for short-ranged truncations of coulomb interactions in site-site molecular models. J. Chem. Phys. 131, 244108 (2009).
Article ADS PubMed Google Scholar
Remsing, R. C., Rodgers, J. M. & Weeks, J. D. Deconstructing classical water models at interfaces and in bulk. J. Stat. Phys. 145, 313–334 (2011).
Article ADS CAS MATH Google Scholar
Rodgers, J. M. & Weeks, J. D. Interplay of local hydrogen-bonding and long-ranged dipolar forces in simulations of confined water. Proc. Natl Acad. Sci. USA 105, 19136–19141 (2008).
Article ADS CAS PubMed PubMed Central Google Scholar
Rodgers, J. M., Hu, Z. & Weeks, J. D. On the efficient and accurate short-ranged simulations of uniform polar molecular liquids. Mol. Phys. 109, 1195–1211 (2011).
Article ADS CAS Google Scholar
Remsing, R. C., Liu, S. & Weeks, J. D. Long-ranged contributions to solvation free energies from theory and short-ranged models. Proc. Natl Acad. Sci. USA 113, 2819–2826 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Gao, A. et al. Role of solute attractive forces in the atomic-scale theory of hydrophobic effects. J. Phys. Chem. B 122, 6272–6276 (2018).
Article CAS PubMed Google Scholar
Gao, A., Remsing, R. C. & Weeks, J. D. Short solvent model for ion correlations and hydrophobic association. Proc. Natl Acad. Sci. USA 117, 1293–1302 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Remsing, R. C. Playing the long game wins the cohesion–adhesion rivalry. Proc. Natl Acad. Sci. USA 116, 23874–23876 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Baker III, E. B., Rodgers, J. M. & Weeks, J. D. Local molecular field theory for nonequilibrium systems. J. Phys. Chem. B 124, 5676–5684 (2020).
Article Google Scholar
Cox, S. J. Dielectric response with short-ranged electrostatics. Proc. Natl Acad. Sci. USA 117, 19746–19752 (2020).
Article CAS PubMed PubMed Central Google Scholar
Kohn, W. Density functional and density matrix method scaling linearly with the number of atoms. Phys. Rev. Lett. 76, 3168–3171 (1996).
Article ADS CAS PubMed Google Scholar
Prodans, E. & Kohn, W. Nearsightedness of electronic matter. Proc. Natl Acad. Sci. USA 102, 11635–11638 (2005).
Article ADS Google Scholar
Krishnamoorthy, A. et al. Dielectric constant of liquid water determined with neural network quantum molecular dynamics. Phys. Rev. Lett. 126, 216403 (2021).
Article ADS CAS PubMed Google Scholar
Zhang, L. et al. Deep neural network for the dielectric response of insulators. Phys. Rev. B 102, 1–6 (2020).
CAS Google Scholar
Morawietz, T., Singraber, A., Dellago, C. & Behler, J. How van der waals interactions determine the unique properties of water. Proc. Natl Acad. Sci. USA 113, 8368–8373 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Ball, P. Water as an active constituent in cell biology. Chem. Rev. 108, 74–108 (2008).
Article CAS PubMed Google Scholar
Prelesnik, J. L. et al. Ion-dependent protein–surface interactions from intrinsic solvent response. Proc. Natl Acad. Sci. USA 118, e2025121118 (2021).
Cheng, B., Engel, E. A., Behler, J., Dellago, C. & Ceriotti, M. Ab initio thermodynamics of liquid and solid water. Proc. Natl Acad. Sci. USA 116, 1110–1115 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Madden, P. & Kivelson, D. A consistent molecular treatment of dielectric phenomena. Adv. Chem. Phys. 56, 467–566 (1984).
CAS Google Scholar
Hu, Z. The symmetry-preserving mean field condition for electrostatic correlations in bulk. J. Chem. Phys. 156, 034111 (2022).
Zhang, C. & Sprik, M. Computing the dielectric constant of liquid water at constant dielectric displacement. Phys. Rev. B 93, 1–13 (2016).
Google Scholar
Zhang, C., Hutter, J. & Sprik, M. Computing the kirkwood g-factor by combining constant maxwell electric field and electric displacement simulations: application to the dielectric constant of liquid water. J. Phys. Chem. Lett. 7, 2696–2701 (2016).
Article CAS PubMed Google Scholar
Wohlfahrt, O., Dellago, C. & Sega, M. Ab initio structure and thermodynamics of the RPBE-D3 water/vapor interface by neural-network molecular dynamics. J. Chem. Phys. 153, 144710 (2020).
Article ADS CAS PubMed Google Scholar
Todorova, T., Seitsonen, A. P., Hutter, J., Kuo, I.-F. W. & Mundy, C. J. Molecular dynamics simulation of liquid water: hybrid density functionals. J. Phys. Chem. B 110, 3685–3691 (2006).
Article CAS PubMed Google Scholar
Sharma, M., Resta, R. & Car, R. Dipolar correlations and the dielectric permittivity of water. Phys. Rev. Lett. 98, 247401 (2007).
Article ADS PubMed Google Scholar
DiStasio, R. A., Santra, B., Li, Z., Wu, X. & Car, R. The individual and collective effects of exact exchange and dispersion interactions on the ab initio structure of liquid water. J. Chem. Phys. 141, 084502 (2014).
Article ADS PubMed Google Scholar
Zheng, L. et al. Structural, electronic, and dynamical properties of liquid water by ab initio molecular dynamics based on SCAN functional within the canonical ensemble. J. Chem. Phys. 148, 164505 (2018).
Article ADS PubMed Google Scholar
Badyal, Y. et al. Electron distribution in water. J. Chem. Phys. 112, 9206–9208 (2000).
Article ADS CAS Google Scholar
Berendsen, H. J. C., Grigera, J. R. & Straatsma, T. P. The missing term in effective pair potentials. J. Phys. Chem. 91, 6269–6271 (1987).
Article CAS Google Scholar
Hirshfeld, F. L. Bonded-atom fragments for describing molecular charge densities. Theoret. Chim. Acta 44, 129–138 (1977).
Article CAS Google Scholar
Mulliken, R. S. Electronic population analysis on LCAO–MO molecular wave functions. I. J. Chem. Phys. 23, 1833–1840 (1955).
Article ADS CAS Google Scholar
Scalfi, L., Salanne, M. & Rotenberg, B. Molecular simulation of electrode-solution interfaces. Annu. Rev. Phys. Chem. 72, 189–212 (2021).
Article CAS PubMed Google Scholar
Sayer, T. & Cox, S. J. Stabilization of AgI’s polar surfaces by the aqueous environment, and its implications for ice formation. Phys. Chem. Chem. Phys. 21, 14546–14555 (2019).
Article CAS PubMed Google Scholar
Sayer, T. & Cox, S. J. Macroscopic surface charges from microscopic simulations. J. Chem. Phys. 153, 164709 (2020).
Article ADS CAS PubMed Google Scholar
Schran, C., Brezina, K. & Marsalek, O. Committee neural network potentials control generalization errors and enable active learning. J. Chem. Phys. 153, 104105 (2020).
Article ADS CAS PubMed Google Scholar
Schran, C. et al. Machine learning potentials for complex aqueous systems made simple. Proc. Natl Acad. Sci. USA 118, e2110077118 (2021).
Article CAS PubMed PubMed Central Google Scholar
Grisafi, A., Wilkins, D. M., Csányi, G. & Ceriotti, M. Symmetry-adapted machine learning for tensorial properties of atomistic systems. Phys. Rev. Lett. 120, 036002 (2018).
Smidt, T. E., Geiger, M. & Miller, B. K. Finding symmetry breaking order parameters with euclidean neural networks. Phys. Rev. Res. 3, L012002 (2021).
Article CAS Google Scholar
Thomas, N. et al. Tensor field networks: rotation-and translation-equivariant neural networks for 3D point clouds. Preprint at https://arxiv.org/abs/1802.08219 (2018).
Behler, J. Atom-centered symmetry functions for constructing high-dimensional neural network potentials. J. Chem. Phys. 134, 074106 (2011).
Marsalek, O. & Markland, T. E. Quantum dynamics and spectroscopy of ab initio liquid water: the interplay of nuclear and electronic quantum effects. J. Phys. Chem. Lett. 8, 1545–1551 (2017).
Article CAS PubMed Google Scholar
Kühne, T. D. et al. CP2K: An electronic structure and molecular dynamics software package-Quickstep: efficient and accurate electronic structure calculations. J. Chem. Phys. 152, 194103 (2020).
Article ADS PubMed Google Scholar
VandeVondele, J. et al. QUICKSTEP: fast and accurate density functional calculations using a mixed Gaussian and plane waves approach. Comput. Phys. Commun. 167, 103–128 (2005).
Article ADS CAS Google Scholar
Zhang, Y. & Yang, W. Comment on “Generalized gradient approximation made simple”. Phys. Rev. Lett. 80, 890 (1998).
Article ADS CAS Google Scholar
Goerigk, L. & Grimme, S. A thorough benchmark of density functional methods for general main group thermochemistry, kinetics, and noncovalent interactions. Phys. Chem. Chem. Phys. 13, 6670–6688 (2011).
Article CAS PubMed Google Scholar
Grimme, S., Antony, J., Ehrlich, S. & Krieg, H. A consistent and accurate ab initio parametrization of density functional dispersion correction (DFT-D) for the 94 elements H-Pu. J. Chem. Phys. 132, 154104 (2010).
Article ADS Google Scholar
Goedecker, S., Teter, M. & Hutter, J. Separable dual-space Gaussian pseudopotentials. Phys. Rev. B 54, 1703–1710 (1996).
Article ADS CAS Google Scholar
VandeVondele, J. & Hutter, J. Gaussian basis sets for accurate calculations on molecular systems in gas and condensed phases. J. Chem. Phys. 127, 114105 (2007).
Article ADS Google Scholar
Marzari, N., Mostofi, A. A., Yates, J. R., Souza, I. & Vanderbilt, D. Maximally localized Wannier functions: theory and applications. Rev. Mod. Phys. 84, 1419–1475 (2012).
Article ADS CAS Google Scholar
Berghold, G., Mundy, C. J., Romero, A. H., Hutter, J. & Parrinello, M. General and efficient algorithms for obtaining maximally localized Wannier functions. Phys. Rev. B 61, 10040–10048 (2000).
Article ADS CAS Google Scholar
Souza, I., Íniguez, J. & Vanderbilt, D. First-principles approach to insulators in finite electric fields. Phys. Rev. Lett. 89, 117602 (2002).
Article ADS PubMed Google Scholar
Umari, P. & Pasquarello, A. Ab initio molecular dynamics in a finite homogeneous electric field. Phys. Rev. Lett. 89, 157602 (2002).
Article ADS CAS PubMed Google Scholar
Gao, A. & Remsing, R. C. Dataset for training and testing the SCFNN model. zenodo. https://doi.org/10.5281/zenodo.5760191 (2021).
Berendsen, H. J. C., Postma, J. P. M., van Gunsteren, W. F., DiNiola, A. & Haak, J. R. Molecular dynamics with coupling to an external bath. J. Chem. Phys. 81, 3684 (1984).
Article ADS CAS Google Scholar
Gao, A. & Remsing, R. C. Self-consistent determination of long-range electrostatics in neural network potentials, repository: andy90/SCFNN. zenodo. https://doi.org/10.5281/zenodo.5919316 (2022).

Download references

Acknowledgements

We acknowledge the Office of Advanced Research Computing (OARC) at Rutgers, The State University of New Jersey for providing access to the Amarel cluster and associated research computing resources that have contributed to the results reported here. We thank John Weeks for his helpful comments on the manuscript.

Author information

Authors and Affiliations

Department of Physics, Beijing University of Posts and Telecommunications, 100876, Beijing, China
Ang Gao
Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, NJ, 08854, USA
Richard C. Remsing

Authors

Ang Gao
View author publications
You can also search for this author in PubMed Google Scholar
Richard C. Remsing
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.G. and R.C.R. designed the study, performed the calculations, analyzed data, and wrote the manuscript.

Corresponding authors

Correspondence to Ang Gao or Richard C. Remsing.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Stephen Cox, Andrea Grisafi and the other anonymous reviewer(s) for their contribution to the peer review of this work.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Gao, A., Remsing, R.C. Self-consistent determination of long-range electrostatics in neural network potentials. Nat Commun 13, 1572 (2022). https://doi.org/10.1038/s41467-022-29243-2

Download citation

Received: 29 September 2021
Accepted: 07 March 2022
Published: 23 March 2022
DOI: https://doi.org/10.1038/s41467-022-29243-2

This article is cited by

Surface stratification determines the interfacial water structure of simple electrolyte solutions
- Yair Litman
- Kuo-Yang Chiang
- Mischa Bonn
Nature Chemistry (2024)
Incorporating long-range electrostatics in neural network potentials via variational charge equilibration from shortsighted ingredients
- Yusuf Shaidu
- Franco Pellegrini
- Stefano de Gironcoli
npj Computational Materials (2024)
Recent advances in machine learning interatomic potentials for cross-scale computational simulation of materials
- Nian Ran
- Liang Yin
- Jianjun Liu
Science China Materials (2024)
Efficient interatomic descriptors for accurate machine learning force fields of extended molecules
- Adil Kabylda
- Valentin Vassilev-Galindo
- Alexandre Tkatchenko
Nature Communications (2023)
Diffusion models in bioinformatics and computational biology
- Zhiye Guo
- Jian Liu
- Jianlin Cheng
Nature Reviews Bioengineering (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.