Elevating density functional theory to chemical accuracy for water simulations through a density-corrected many-body formalism

Dasgupta, Saswata; Lambros, Eleftherios; Perdew, John P.; Paesani, Francesco

doi:10.1038/s41467-021-26618-9

Download PDF

Article
Open access
Published: 04 November 2021

Elevating density functional theory to chemical accuracy for water simulations through a density-corrected many-body formalism

Nature Communications volume 12, Article number: 6359 (2021) Cite this article

12k Accesses
55 Citations
26 Altmetric
Metrics details

Subjects

Abstract

Density functional theory (DFT) has been extensively used to model the properties of water. Albeit maintaining a good balance between accuracy and efficiency, no density functional has so far achieved the degree of accuracy necessary to correctly predict the properties of water across the entire phase diagram. Here, we present density-corrected SCAN (DC-SCAN) calculations for water which, minimizing density-driven errors, elevate the accuracy of the SCAN functional to that of “gold standard” coupled-cluster theory. Building upon the accuracy of DC-SCAN within a many-body formalism, we introduce a data-driven many-body potential energy function, MB-SCAN(DC), that quantitatively reproduces coupled cluster reference values for interaction, binding, and individual many-body energies of water clusters. Importantly, molecular dynamics simulations carried out with MB-SCAN(DC) also reproduce the properties of liquid water, which thus demonstrates that MB-SCAN(DC) is effectively the first DFT-based model that correctly describes water from the gas to the liquid phase.

Extending density functional theory with near chemical accuracy beyond pure water

Article Open access 13 February 2023

Non-bonded force field model with advanced restrained electrostatic potential charges (RESP2)

Article Open access 03 April 2020

Realistic phase diagram of water from “first principles” data-driven quantum simulations

Article Open access 08 June 2023

Introduction

Its anomalous behavior¹ and importance to life² make water one of the most studied chemical compounds. Among its many unique properties is the high value of the heat capacity that allows water to resist sudden temperature changes, thus permitting living organisms to survive without experiencing significant temperature fluctuations³. In addition, the dynamic nature of the water hydrogen-bond network plays a central role in several fundamental processes, including transport and diffusion in bulk solutions and at interfaces, and hydration of hydrophobic and hydrophilic solutes⁴. For example, protein folding is thought to be driven by the hydrophobic effect⁵. Finally, countless chemical reactions involving charged species take place efficiently in liquid water due to its high dielectric constant^{6,7,8,9,10,11}. It is thus not surprising that a myriad of simulation studies have been devoted to developing a fundamental understanding of both chemical and physical properties of water in different environments and under different thermodynamic conditions^12,13,14.

Density functional theory (DFT)^15,16 is one the most important tools available to computational chemists and physicists for ab initio simulations of molecular systems in the condensed phase since it offers a good balance between accuracy and computational cost^17,18. However, as discussed in the “Results” section, the accuracy of a DFT calculation depends upon the accuracy of the underlying exchange-correlation (XC) functional, which allows for recasting the many-body electronic structure problem into a (self-consistent) single-particle problem formulated in terms of the Kohn−Sham equations¹⁹. The simplest XC functional, the local spin density approximation (LSDA)^19,20,21, was shown to correctly predict the structure of metallic crystals under pressure^22,23,24, but was unable to fulfill its promises for water simulations, overestimating the strength of the hydrogen bonds and, consequently, predicting a too packed and overstructured liquid phase^25,26. These limitations hindered the ability of the LSDA functional to describe the properties of water, even qualitatively.

Climbing the Jacob’s ladder of DFT approximations²⁷, the next generation of XC functionals, which were developed within the generalized gradient approximation (GGA)^28,29,30, dominated the scene of ab initio simulations of water for a long time, owing to their higher accuracy compared to LDA and affordable computational cost. Initial successes of the GGA functionals included relatively accurate binding energies for various water clusters and a reasonable description of the structure of liquid water^25,31,32,33. However, it became soon evident that serendipitous error cancellation was the primary reason behind the apparent accuracy of GGA simulations of liquid water, making the predictive power more accidental than consistent^34,35,36,37. For example, it was found that GGA functionals generally underestimate the density of liquid water, while predicting denser ice phases³⁸.

The third rung of the Jacob’s ladder of DFT approximations includes meta-GGA functionals^39,40 that perform significantly better than both LDA and GGA functionals due to the inclusion of the kinetic energy density. Among them, the strongly constrained and appropriately normed (SCAN) functional has gained particular attention because it satisfies all 17 known exact constraints that can be satisfied by a meta-GGA functional⁴¹. Without being fitted to any bonded system, SCAN was shown to enable accurate predictions for various properties of molecules and solids⁴². In particular, for molecular dynamics (MD) simulations of liquid water, SCAN was found to outperform its predecessor GGA functionals^43,44. Importantly, accounting for intermediate-range dispersion interactions, the SCAN functional allows for a more accurate description of the energy differences among water clusters and ice phases^42,44, while, when used in MD simulations, it predicts a density of liquid water which is appreciably closer to the experimental value compared to values obtained with GGA functionals⁴³.

Despite its relatively higher accuracy, SCAN, as all GGA and meta-GGA functionals, is still prone to density-driven errors (defined in the following section), including self-interaction⁴⁵ and delocalization errors^46,47,48,49. It was shown that self-interaction errors in the SCAN functional primarily affect 2-body contributions (defined in the “Results” section) to the interaction energies of water clusters⁵⁰. On the other hand, inclusion of a fraction of Hartree−Fock exchange (also known as exact exchange) in a SCAN hybrid was found to partially reduce density-driven errors in the calculations for various water systems^51,52. However, it was found that increasing the fraction of Hartree−Fock exchange beyond 15% did not improve the accuracy of hybrid SCAN functionals, progressively shifting the structure of liquid water towards that of ice⁵¹. A systematic analysis of hybrid SCAN functionals with varying fractions of Hartree−Fock exchange demonstrated the inability of these functionals to accurately represent 2-body interactions between water molecules, with errors up to ~5 kcal/mol for the water hexamer relative to reference values calculated using coupled cluster theory with single, double, and perturbative triple excitations, i.e., CCSD(T), in the complete basis set (CBS) limit⁵¹, the “gold standard” method for molecular interactions⁵³. In this context, a neural-network potential, NNP-SCAN0, was recently trained on a modified SCAN0 functional that incorporates 10% Hartree−Fock exchange⁵². (It is worth noting that, in its original formulation, the SCAN0 functional mixes 25% Hartree−Fock exchange with 75% SCAN exchange⁵⁴.) Despite providing better agreement with experimental data than SCAN for several properties of liquid water measured at ambient conditions, this improved agreement was achieved by actually performing the NNP-SCAN0 simulations at 330 K⁵².

While all previous studies suggest that SCAN is overall one of the most accurate XC functionals, they also indicate that any further improvement of the accuracy of DFT models for water requires removing, at least partially, the associated density-driven errors. To this end, we introduce here a data-driven many-body potential energy function (PEF) for water, MB-SCAN(DC), which is rigorously derived within a many-body formalism applied to density-corrected SCAN (DC-SCAN) data for individual many-body contributions to the interaction energies between water molecules. Density-corrected DFT (DC-DFT)^{55,56,57,58,59,60,61,62,63}, where the Hartree−Fock density is used instead of the Kohn-Sham density, is known to mitigate density-driven errors in GGA and meta-GGA functionals, especially nonempirical ones. In this regard, density-driven errors associated with calculations carried out on water clusters using the GGA PBE functional were found to be significant⁶³. Here, we show that both binding and interaction energies calculated with the DC-SCAN functional for various water clusters are close to the CCSD(T)/CBS reference values, with DC-SCAN correctly reproducing each individual many-body contribution to the interaction energies. Importantly, we demonstrate that the MB-SCAN(DC) PEF preserves the accuracy of DC-SCAN and enables simulations of liquid water with significantly higher accuracy than all previous DFT-based models reported in the literature (including both ab initio and neural-network models), predicting structural, thermodynamic, and dynamical properties in quantitative agreement with experiment.

Results

Theoretical background

In ground-state Kohn−Sham DFT¹⁹, the energy is self-consistently minimized as:

$$E=\mathop{\min }\limits_{n}\left\{F[n]+\int {d}^{3}r\ n({{{{{{{\bf{r}}}}}}}})v({{{{{{{\bf{r}}}}}}}})\right\}$$

(1)

where the minimizing n(r) is the ground-state density, v(r) is the external potential, and F[n] includes the exact non-interacting kinetic and Hartree electrostatic energy terms plus an exchange-correlation (XC) energy. Since the exact XC functional is unknown, different DFT approximations have been developed to solve Eq. (1). The total-energy error ΔE associated with different DFT approximations can be written as the sum of the functional-driven error, ΔE_F, and the density-driven error, ΔE_D⁵⁹:

$${{\Delta }}E={{\Delta }}{E}_{{{{{{{{\rm{F}}}}}}}}}+{{\Delta }}{E}_{{{{{{{{\rm{D}}}}}}}}}$$

(2)

The functional-driven error ${{\Delta }}{E}_{{{{{{{{\rm{F}}}}}}}}}={E}_{{{{{{{{\rm{XC}}}}}}}}}^{{{{{{{{\rm{approx}}}}}}}}}[{n}_{{{{{{{{\rm{exact}}}}}}}}}]-{E}_{{{{{{{{\rm{XC}}}}}}}}}^{{{{{{{{\rm{exact}}}}}}}}}[{n}_{{{{{{{{\rm{exact}}}}}}}}}]$ arises from the difference between the approximate XC functional, F[n], and the (unknown) exact functional, while the density-driven error ${{\Delta }}{E}_{{{{{{{{\rm{D}}}}}}}}}={E}_{{{{{{{{\rm{XC}}}}}}}}}^{{{{{{{{\rm{approx}}}}}}}}}[{n}_{{{{{{{{\rm{approx}}}}}}}}}]-{E}_{{{{{{{{\rm{XC}}}}}}}}}^{{{{{{{{\rm{approx}}}}}}}}}[{n}_{{{{{{{{\rm{exact}}}}}}}}}]$ arises from using an approximate density n(r) to solve Eq. (1). In most systems, the functional-driven error is the main contribution to the total error^56,59. By many measures, the best nonempirical functionals predict more accurate densities for neutral atoms than the heavily parameterized empirical functionals or even Hartree−Fock theory⁶⁴. But they still make density-driven delocalization errors^65,66 that can dominate the total error under special conditions^57,67.

Independent of the specific form and parametrization, standard approximate XC functionals still deviate from the piecewise-linear behavior of the exact functional for fractional charges⁶⁵, causing excess charge delocalization and resulting in incorrect densities^65,66. For certain systems, the density-driven error thus become the dominant contributor to the total error^59,68. This error can be understood by considering that the classical electrostatic repulsion term that is part of F[n] in Eq. (1) contains a self-interaction contribution due to each electron interacting with itself^45,67. While this self-interaction contribution should, in theory, be compensated by the XC energy, approximate XC functionals contain substantial local components that prevent them from quantitatively removing electron self-interactions. As a result, the electron density thus tends to over-delocalize in order to minimize the many-electron self-interactions^48,69,70, leading to fractional charges that underestimate the energy predicted by the piecewise-linear behavior of the exact functional^65,71.

Using a more accurate density can mitigate errors due to the over-delocalization of the electron density^56,60,72. However, obtaining an accurate density from wavefunction theories, such as Møller−Plesset peturbation theory and coupled-cluster theory, is computationally significantly more expensive than the corresponding DFT calculations. An approximate, yet efficient, approach to reducing density-driven errors in DFT calculations consists in using the Hartree−Fock density, n^HF(r) because, by construction, it does not suffer from either electron over-delocalization or self-interaction errors^{56,59,60,61,63}. The resulting density-corrected DFT (DC-DFT) energy can then be written as:

$${E}^{{{{{{{{\rm{DC-DFT}}}}}}}}}\approx {E}^{{{{{{{{\rm{HF}}}}}}}}}+\left({E}_{{{{{{{{\rm{XC}}}}}}}}}^{{{{{{{{\rm{approx}}}}}}}}}\left[{n}^{{{{{{{{\rm{HF}}}}}}}}}\right]-{E}_{{{{{{{{\rm{X}}}}}}}}}^{{{{{{{{\rm{HF}}}}}}}}}\right)$$

(3)

The occupied Hartree−Fock (HF) orbitals are used here in place of those from a self-consistent calculation with the approximated functional. If ${E}_{{{{{{{{\rm{xc}}}}}}}}}^{{{{{{{{\rm{approx}}}}}}}}}$ is not fitted to any bonded system, then neither is Eq. (3). Equation (3) takes advantage of the understood overall cancellation between semilocal approximations to the exchange and correlation energies; there is no corresponding overall cancellation in the potentials or functional derivatives. In meta-GGA functionals, the exchange-correlation energy depends explicitly not only on the electron density but also on the non-interacting kinetic energy density, and both these ingredients differ from HF to Kohn−Sham (KS) theory, but, importantly, for a meta-GGA functional like SCAN both the HF and the KS kinetic energy densities can be used to recognize iso-orbital and uniform density limits and to interpolate between them. In extensive molecular tests, SCAN evaluated on the Hartree−Fock density was found on average to be more accurate than self-consistent SCAN, and even more accurate than all but a few hybrid functionals⁷³. It is very possible for a density functional to yield accurate energies on physical densities and yet have an inaccurate functional derivative and thus an inaccurate self-consistent electron density, because the functional derivative yields the response of the functional to an arbitrary (and not necessarily physical) small density variation. For example, a self-consistent semilocal functional like SCAN cannot bind a full extra electron to an isolated neutral atom, but using the Hartree−Fock density for the negative ion yields an accurate electron affinity from such functionals⁷⁴. It should be noted that density correction also has some limitations: (1) It can only correct part of the error of an approximate density functional. (2) Because it is not self-consistent, it cannot provide Hellmann−Feynman forces on the nuclei. (3) Going beyond the level of the Hartree−Fock approximation can incur not only the cost of the higher-level density but also the cost of inverting it to find an effective one-electron potential⁷⁵.

Equation (3) can be used to calculate individual n-body energies, ϵ^nB, from one-body (1B) to N-body (NB), which enter the many-body expansion (MBE) of the energy for a system containing N (atomic or molecular) monomers⁷⁶:

$${E}_{N}(1,\ldots ,N) = \mathop{\sum }\limits_{i=1}^{N}{\epsilon }^{{{{{{{{\rm{1B}}}}}}}}}(i)+\mathop{\sum }\limits_{i < j}^{N}{\epsilon }^{{{{{{{{\rm{2B}}}}}}}}}(i,j)\\ +\mathop{\sum }\limits_{i < j < k}^{N}{\epsilon }^{{{{{{{{\rm{3B}}}}}}}}}(i,j,k)+\cdots +{\epsilon }^{{{{{{{{\rm{NB}}}}}}}}}(1,\ldots ,N),$$

(4)

In the case of water, ϵ^1B(i) in Eq. (4) corresponds to the distortion energy of the ith water molecule in the system from the equilibrium geometry of the corresponding free molecule, and all higher-order n-body energies, ϵ^nB(2, …, n), can be calculated recursively from the lower-order terms⁷⁷. The data-driven many-body formalism originally introduced with the MB-pol^78,79,80,81 potential energy function (PEF) for water proceeds from monomer to dimer to trimer, successively calculating the 1B, 2B, and 3B energies over a wide distribution of molecular configurations and then fitting them to analytic potential energy functions. The 4B and higher-order terms of Eq. (4) are replaced by a classical many-body polarization model. Once the PEFs are known, fast but very accurate classical molecular dynamics calculations for finite temperature or fully relaxed geometry optimizations for zero temperature can be performed. It should be noted that for the equilibrium geometries of molecules and solids self-consistent SCAN already performs quite well^41,42. Here, we show that DC-SCAN provides an accurate but inexpensive alternative to the accurate but expensive CCSD(T) parametrization of a data-driven many-body PEF for water, suggesting many possible future applications of DC-SCAN that we are beginning to explore.

2-body interactions in water

Our analysis of the ability of the SCAN functional to represent the interactions between water molecules begins with the comparison in Fig. 1 between the total 2-body (2B) energies (second term on the right-hand side of Eq. (4)) calculated for the low-energy isomers of the water hexamer using the (self-consistent) SCAN and SCAN0 functionals, and the corresponding (density-corrected) DC-SCAN and DC-SCAN0 functionals. Also shown for reference are the CCSD(T)/CBS values reported in ref. ⁷⁸. It should be noted that the hexamer holds a special space along the path that connects individual water molecules in the gas phase to liquid water since it is the smallest water cluster for which the low-energy isomers are characterized by three-dimensional arrangements that are reminiscent of the three-dimensional structure of the hydrogen-bond network found in the liquid phase. In addition, the large number of low-energy isomers makes the hexamer cluster the prototypical system to assess the ability of different water models to correctly reproduce many-body interactions in water¹⁴. Figure 1 shows that the SCAN functional displays fairly large errors compared to the reference values, with a maximum unsigned error (MUE) of 4.59 kcal/mol. In contrast, DC-SCAN predicts 2-body energies that are in quantitative agreement with the CCSD(T)/CBS values, resulting in a MUE of only 0.08 kcal/mol. By effectively eliminating the errors in the representation of 2-body interactions, the application of the density correction thus addresses the main shortcoming of the SCAN functional applied to water⁵⁰. Figure 1 also shows that SCAN0, the hybrid variant of SCAN with a 25% fraction of Hartree−Fock exchange, only provides a minor improvement in the representation of the 2-body energies, resulting in a MUE of 4.48 kcal/mol. It should be noted that SCAN0 provides a slightly more accurate description of the three-dimensional isomers (i.e., prism and cage isomers) but a worse description of the planar isomers (i.e., cyclic isomers) compared to SCAN. Importantly, the density correction applied to SCAN0 does not result in a similarly dramatic improvement as found for SCAN, with DC-SCAN0 still displaying a relatively large MUE of 2.26 kcal/mol.

**Fig. 1: 2B energies of the water hexamer isomers.**

The analysis of the effects associated with various dispersion corrections, which is reported in Supplementary Fig. 1, indicates that the addition of any form of dispersion energy worsens the accuracy of both SCAN and DC-SCAN. Specifically, all the dispersion-corrected SCAN functionals considered in our analysis, SCAN-D3(0), SCAN-D3(BJ), and SCAN-VV10, are found to overbind the hexamer isomers, which results in larger deviations from the CCSD(T)/CBS values compared to their dispersion-free counterparts. Similarly, inclusion of larger fractions (α) of Hartree−Fock exchange deteriorates the ability of hybrid SCANα and DC-SCANα functionals to reproduce the interaction energies of the hexamer isomers, resulting in significant overbinding (Supplementary Fig. 2). It is also worth noting that neither DC-SCAN + dispersion nor DC-SCANα (where α is the fraction of HF exchange) perform as well as DC-SCAN. This suggests that the addition of the dispersion correction and/or a fraction of Hartree−Fock exchange actually worsens the functional-driven error of SCAN for water.

To further investigate the impact of the density correction on the energetics of various water systems, in Fig. 2, we analyze the interaction energies of dimers extracted from a classical MD simulation of liquid water carried out in the isobaric-isothermal (NPT) ensemble at ambient conditions using the MB-pol PEF^79,80,81. (The interaction energy is the binding energy without its 1-body contribution.) For this analysis, we consider dimers with an oxygen−oxygen (O ⋯ O) distance shorter than 5.5 Å, which approximately corresponds to the radius of the first two solvation shells in liquid water^81,82,83. It should be noted that, by definition, the interaction energy of a water dimer exactly corresponds to the associated 2-body energy. Figure 2a shows the errors, ΔE, in 2-body energies calculated with SCAN and DC-SCAN relative to the corresponding reference values calculated at the CCSD(T)-F12b level of theory. As expected, DC-SCAN exhibits significantly smaller errors compared to SCAN for all dimers, independent of the O ⋯ O distance. Specifically, the maximum error associated with DC-SCAN is −0.47 kcal/mol, which must be compared with a maximum error of −1.38 kcal/mol calculated with SCAN. It is also important to analyze the errors as a function of the O ⋯ O distance since they directly affect the ability of the SCAN and DC-SCAN functionals to correctly predict the cohesive energy, and thus the structure, of liquid water. Figure 2a shows that the 2-body energies calculated with SCAN only start to approach the CCSD(T)-F12b values at ~4.5 Å, with a MUE of 0.25 kcal/mol associated with dimers with an O ⋯ O distance up to 4.5 Å, and a MUE of 0.16 kcal/mol for all dimers up to an O ⋯ O distance of 5.5 Å. In contrast, the 2-body energies calculated with DC-SCAN converge to the CCSD(T)-F12b values at 3.5 Å, with a MUE of 0.09 kcal/mol obtained for dimers with an O ⋯ O distance up to 3.5 Å, which decreases to 0.07 kcal/mol when all dimers up to an O ⋯ O distance of 5.5 Å are considered.

Figure 2b shows a comparison between the interaction energies calculated at the CCSD(T)-F12b, SCAN, and DC-SCAN levels of theory for an unrelaxed scan of the water dimer along the O ⋯ H distance, starting from the dimer optimized geometry. This comparison provides further evidence for DC-SCAN predicting 2-body energies in close agreement with the CCSD(T)-F12b values. In contrast, SCAN systematically overbinds the water dimer, which is particularly evident in the minimum-energy region, i.e., r(O ⋯ H) ~ 1.9 Å. It is worth noting that SCAN gives slightly better agreement with CCSD(T)-F12b at long range, before asymptotically converging to DC-SCAN results. The reason behind SCAN giving slightly better result than DC-SCAN at long range is the cancellation in SCAN of the density-driven errors (which overbind) with the lack of dispersion.

**Fig. 2: Comparison of dimer interaction energies.**

Binding energies of water clusters

It is known that the binding energies of the low-energy isomers of the water hexamer lie within a few kcal/mol from each other⁸⁴ while the two most stable isomers (D_2d and S₄) of the water octamer are degenerate⁸⁵. Table 1 shows that the SCAN functional predicts significantly different binding energies relative to the CCSD(T)-F12b results of ref. ⁸⁶ for both sets of clusters, with an overall MUE of 6.54 kcal/mol. Interestingly, the error per water molecule is higher for the three-dimensional isomers (prism and cage isomers) than for the planar isomers of the water hexamer, and increases for the two isoenergetic isomers of the octamer.

Table 1 Errors (in kcal/mol) in binding energies relative to the CCSD(T)-F12b values of ref. ⁸⁶ calculated for representative isomers of the water hexamer and octamer using SCAN, FLOSIC-SCAN (from ref. ⁵⁰) and DC-SCAN. The values in parentheses correspond to the errors per molecule. The last row reports the corresponding MUEs and MUEs per molecule.

Full size table

Table 1 also includes the binding energies calculated in ref. ⁵⁰ with SCAN corrected for the self-interaction energy using the Fermi−Lowdin orbital self-interaction correction (FLOSIC) scheme⁸⁷. Relative to SCAN, FLOSIC-SCAN is able to reduce the errors in the binding energies of all water clusters analyzed in Table 1, resulting in a MUE of 1.65 kcal/mol. The error per water molecule remains nearly constant for the prism, cage, and book-2 isomers of the hexamer but increases for the cyclic boat-2 isomer. The FLOSIC-SCAN error per molecule is smaller for the two isoenergetic isomers of the water octamer. The comparisons reported in Table 1 show that DC-SCAN performs better than FLOSIC-SCAN, with an overall MUE of 0.69 kcal/mol relative to CCSD(T)-F12b. As found with FLOSIC-SCAN, also in the case of DC-SCAN the error per molecule remains constant for the prism, cage, and book-2 isomers of the water hexamer but decreases for the cyclic boat-2 isomer. However, contrary to FLOSIC-SCAN, DC-SCAN predicts a slightly larger error per molecule for the two isoenergetic octamer isomers than for the prism, cage, and book-2 isomers of the hexamer. The overall MUE per molecule of 0.04 kcal/mol indicates that the binding energies predicted by DC-SCAN are in excellent agreement with the CCSD(T)-F12b reference values for all clusters analyzed in Table 1.

Many-body interactions in water

Although the results presented in Fig. 1 and Table 1 demonstrate that, by correcting density-driven errors, DC-SCAN is able to accurately reproduce the interaction energies of small water clusters, the analyses of the previous sections do not provide any direct information about the ability of DC-SCAN to correctly describe many-body effects in water. The competition and interplay of many-body effects have been shown to play a critical role in determining structural, thermodynamic, and dynamical properties of aqueous systems, from small clusters to bulk solutions and interfaces^{84,88,89,90,91}.

To investigate the impact of the density correction on individual n-body (nB) contributions to the interactions between water molecules, many-body decomposition analyses were carried out for the two isoenergetic isomers of the octamer. Errors relative to the CCSD(T)-F12b reference values are shown in Fig. 3 for each n-body energy calculated with the SCAN and DC-SCAN functionals. This analysis provides further evidence for the density-driven errors in the SCAN functional primarily affecting 2-body energies, with SCAN displaying large negative deviations from the CCSD(T)-F12b values, which confirms the tendency of the SCAN functional to overbind water clusters⁵⁰. After application of the density correction, the errors in the 2-body energies reduce to only ~0.3 kcal/mol for calculations carried out with the DC-SCAN functional. Importantly, Fig. 3 shows that the impact of the density correction is minimal for all nB energies with n > 2.

**Fig. 3: Errors in nB interaction energy of water octamers.**

After demonstrating that, by removing density-driven errors, the DC-SCAN functional effectively provides chemical accuracy for binding, interaction, and many-body energies of various water clusters, the 2B, 3B, and 4B energies, as well as the total interaction energies of the low-energy isomers of the water hexamer calculated using the SCAN and DC-SCAN functionals are compared in Fig. 4 with the analogous values calculated with the corresponding MB-SCAN and MB-SCAN(DC) potential energy functions (PEFs) described in the “Methods” section. Also shown for reference are the CCSD(T)/CBS values reported in ref. ⁷⁸. The errors relative to the reference CCSD(T)/CBS values for both many-body and interaction energies which are associated with the SCAN and DC-SCAN functionals and the corresponding MB-SCAN and MB-SCAN(DC) PEFs are shown in Supplementary Fig. 7. As already discussed in the case of the octamer isomers, density-driven errors are most pronounced at the 2B level, with MUEs of 4.59 and 0.08 kcal/mol associated with SCAN and DC-SCAN, respectively. The MUEs reduce to 0.59 and 0.38 kcal/mol at the 3-body level. The comparisons shown in Fig. 4a, b demonstrate that both the MB-SCAN and MB-SCAN(DC) PEFs are able to quantitatively reproduce the 2-body and 3-body energies calculated ab initio with the corresponding SCAN and DC-SCAN functionals. Since, by construction, nB energies with n > 3 in the MB-SCAN and MB-SCAN(DC) PEFs are entirely represented by a classical polarization term, the errors associated with these energies are not strictly related to those calculated ab initio with the corresponding SCAN and DC-SCAN functionals. In this regard, Fig. 4c shows that the 4-body energies predicted by the MB-SCAN and MB-SCAN(DC) PEFs tend to underbind the hexamer isomers relative to CCSD(T)/CBS, whereas the 4-body energies calculated with the SCAN and DC-SCAN functionals tend to overbind the same clusters. However, it should be noted that in both cases the 4-body errors are small for all eight isomers, with SCAN and MB-SCAN providing MUEs of 0.17 and 0.35 kcal/mol, respectively. The corresponding MUEs for DC-SCAN and MB-SCAN(DC) are 0.16 and 0.21 kcal/mol, respectively.

**Fig. 4: Many-body and interaction energies for the isomers of the water hexamer.**

The total interaction energies of the eight low-energy hexamer isomers calculated with the SCAN and DC-SCAN functionals, and the corresponding MB-SCAN and MB-SCAN(DC) PEFs are compared with the CCSD(T)/CBS reference values in Fig. 4d. Both DC-SCAN and MB-SCAN(DC) provide excellent agreement with the CCSD(T)/CBS reference values, displaying MUEs of 0.53 and 0.36 kcal/mol, respectively. In contrast, suffering from large density-driven errors at the 2-body level, SCAN and MB-SCAN systematically overbind all eight isomers.

Structural and dynamical properties of liquid water

The last question that remains to be addressed is whether the high accuracy displayed by the MB-SCAN(DC) PEF in reproducing the multidimensional energy landscape of water clusters is sufficient to correctly predict the properties of liquid water. To this end, classical MD simulations for a periodic box containing 256 molecules were carried out with the MB-SCAN(DC) PEF in the NPT ensemble at 1 atm and various temperatures between T = 250 K and T = 350 K. The lengths of the MD trajectories were 2.6 ns for T < 298 K and 2 ns for T ≥ 298 K. Figure 5 shows that the MB-SCAN(DC) PEF correctly reproduces the temperature-dependence of the density of liquid water at 1 atm, underestimating the experimental values by only ~0.01 g/cm³ at all temperatures. At 298 K, MB-SCAN(DC) predicts a density of 0.986 g/cm³, which is in close agreement with the experimental value of 0.997 g/cm³. The temperature of maximum density calculated by fitting a fifth-order polynomial to the MB-SCAN(DC) results is 280 K, in nearly quantitative agreement with the experimental value of 277 K. The MB-SCAN(DC) results are compared in Fig. 5 with those reported in the literature from MD simulations with SCAN⁴³ (SCAN-AIMD) as well as with NNPs trained on SCAN⁹² (SCAN-NNP) and SCAN0⁵² (SCAN0-NNP) data. These comparisons demonstrate that the MB-SCAN(DC) PEF predicts a liquid density at 330 K which is in significantly closer agreement with experiment than the value calculated in ref. ⁴³ from ab initio MD simulations with SCAN.

Particularly interesting is the comparison of the MB-SCAN(DC) PEF with the two NNPs models trained on SCAN⁹² and SCAN0⁵² data. Figure 5 shows that, despite being trained on SCAN data, the SCAN-NNP model is unable to correctly reproduce the density value calculated from ab initio MD simulations with SCAN at 330 K. A closer agreement between the density values calculated from the SCAN-AIMD and SCAN-NNP simulations was obtained after applying a reweighting procedure⁹². In addition, the SCAN-NNP model predicts a more pronounced temperature-dependence of the liquid density compared to experiment, overestimating both the value and the temperature of the density maximum⁹². A slightly more accurate prediction of the liquid density at 330 K is provided by the SCAN0-NNP model⁵², although no ab initio MD simulations with SCAN0 have been reported to compare with. Given the increased popularity of NNPs trained on DFT data, we believe that the differences between SCAN-AIMD and SCAN-NNP results deserve further investigation to assess the ability of NNPs to faithfully represent the target DFT models. In this context, it should be noted that in a previous study⁵¹ we found that MD simulations carried out with the MB-SCAN PEF predict a liquid density of 1.14 g/cm³ at 298 K, which is significantly different from the value of 1.05 g/cm³ obtained from ab initio simulations with SCAN⁴³. This difference is not due to the different size of the water systems studied in the two sets of simulations (256 molecules for MB-SCAN⁵¹ and 64 molecules for SCAN-AIMD⁴³.) An explanation for this difference, proposed in ref. ⁵¹, considers that any PEF rigorously derived from the many-body expansion of the energy (MBE) is strictly faithful to its parent quantum-mechanical method only when the latter does not display spurious delocalization of the electron density which affects the convergence of the MBE in an unphysical manner. We believe that the present analysis of the SCAN and DC-SCAN functionals, along with the corresponding MB-SCAN and MB-SCAN(DC) PEFs, provides support for the interpretation presented in ref. ⁵¹ that density-driven errors are responsible for the differences between MD simulations carried out with the SCAN functional and the MB-SCAN PEF. The temperature-dependence of the enthalpy of vaporization and isothermal compressibility calculated from classical MD simulations with MB-SCAN(DC) are shown in Supplementary Figs. 11 and 12.

Figure 6 compares the oxygen−oxygen (g_OO) radial distribution function (RDF) calculated from MD simulations carried out with the MB-SCAN and MB-SCAN(DC) PEFs at 298 K and 1 atm with the corresponding experimental data^82,83. The MB-SCAN(DC) PEF provides excellent agreement with the experimental RDF, slightly overestimating the height of the first peak while underestimating the height of the “valley” between the first two peaks. As shown in Supplementary Fig. 10, these small differences can be attributed to the neglect of nuclear quantum effects in classical MD simulations. The inclusion of nuclear quantum effects in path-integral molecular dynamics (PIMD) simulations with MB-SCAN(DC) indeed slightly lowers the height of the first peak and raises the “valley” between 3.2 and 4.0 Å similarly to what previously observed in the g_OO calculated with the MB-pol PEF⁸¹. As expected, the inclusion of nuclear quantum effects also improves the agreement with the experimental oxygen−hydrogen and hydrogen−hydrogen RDFs (Supplementary Fig. 10). In contrast, as already discussed in ref. ⁵¹, the MB-SCAN PEF predicts a denser and more unstructured liquid. Based on the analyses discussed above, the differences between the MB-SCAN and MB-SCAN(DC) oxygen−oxygen RDFs can be unambiguously attributed to density-driven errors that affect SCAN many-body energies, particularly at the 2-body level, which are used to train the corresponding MB-SCAN PEF.

To provide further insights into the ability of the MB-SCAN(DC) PEF to describe the properties of liquid water, we also calculated the temperature-dependence of the self-diffusion coefficient, D, from a 500-ps-long MD simulation carried out in the microcanonical (NVE) ensemble for a periodic box containing 256 molecules using the equilibrium density determined from the corresponding NPT simulations. D was calculated from the velocity autocorrelation function of the center of mass of each water molecule according to

$$D=\frac{1}{3}\int\nolimits_{0}^{\infty }\langle {v}_{i}(t){v}_{i}(0)\rangle {\rm d}t,$$

(5)

where v_i is the center of mass velocity of the ith water molecule. Figure 7 shows that the MB-SCAN(DC) PEF is able to correctly predict the diffusion coefficient between 250 and 350 K. In particular, at 298 K, the diffusion coefficient predicted by the MB-SCAN(DC) PEF is 0.212 A²/ps, which is in excellent agreement with the experimental value of 0.229 A²/ps. It should be noted that the larger error bars associated with the values of the diffusion coefficient at higher temperatures are due to larger fluctuations in the molecular velocities. This is in contrast to the value of 0.106 A²/ps obtained in ref. ⁹³ from MD simulations carried out with an adaptive neural-network model trained on SCAN data (SCAN-NNP in Fig. 7). In contrast to the MB-SCAN(DC) PEF, the SCAN-NNP model severely underestimates the diffusion coefficient of liquid water over the entire temperature range, although the agreement with experiment apparently improves as the temperature decreases.

**Fig. 7: Self-diffusion of liquid water.**

Discussion

An ab initio representation of water across all the different phases has been an elusive goal since the early days of computer simulations^94,95,96,97. Although models based on correlated wavefunction theories (WFT) can, in principle, provide such a long-sought after ab initio representation of water without resorting to ad hoc approximations or empirical parameterizations, the associated computational cost precludes the application of WFT models to systems containing more than a handful of water molecules. This effectively leaves DFT as the only viable approach to ab initio simulations of water¹³. However, it has been shown that existing XC functionals are not particularly accurate in their predictions of the properties of water^14,37, suffering from both functional-driven and density-driven errors.

In this study, we have demonstrated that the density-corrected SCAN (DC-SCAN) functional effectively removes density-driven errors from the water 2-body energies, which brings both binding and interaction energies of different water clusters very close to reference values calculated at CCSD(T)/CBS level of theory. Although not as pronounced as for the 2-body energies, the density correction also reduces density-driven errors in all higher-body terms of the many-body expansion (MBE) of the energy calculated for water using the DC-SCAN functional, with each individual many-body term being in quantitative agreement with the corresponding CCSD(T)/CBS reference values. In this context, it should be noted that a previous study⁵⁰ found a significant but less complete improvement for water clusters (Table 1) via a self-consistent FLOSIC self-interaction correction to SCAN. However, ref. ⁵⁰ did not find evidence for a major improvement from density correction, probably because the FLOSIC density is less localized than the Hartree−Fock and exact densities are. Although it should be kept in mind that the DC-SCAN functional, as does the parent SCAN functional, still suffers from functional-driven errors, which can be large for some chemical systems such as stretched H${}_{2}^{+}$, the analyses presented here demonstrate that these functional-driven errors are negligible for water. In the future, it would be important to test the performance of DC-SCAN for more-general chemical applications. Importantly, our analyses suggest that, in principle, ab initio MD simulations with the DC-SCAN functional should be able to provide a consistently accurate description of the properties of water. However, the requirement of using the Hartree−Fock density in a non-self-consistent SCAN calculation at each MD step would make ab initio MD simulations with DC-SCAN not straightforward to implement and expensive to perform.

While ab initio MD simulations with DC-SCAN are currently not feasible, we have shown that the improved accuracy of the DC-SCAN functional can be exploited to develop a data-driven many-body potential energy function, the MB-SCAN(DC) PEF, which indeed provides a highly accurate representation of water, from small clusters in the gas phase to the liquid phase. MB-SCAN(DC) is rigorously derived from the DC-SCAN MBE and adopts a hybrid data-driven/physics-based scheme, where a data-driven model, which captures (short-range) quantum-mechanical interactions arising from the overlap of the electron densities of individual molecules at the 2-body and 3-body levels (e.g., Pauli repulsion, and charge transfer and penetration), is integrated with a physics-based model of many-body interactions, which is represented by classical many-body electrostatics. Importantly, we have demonstrated that the MB-SCAN(DC) PEF achieves high accuracy by quantitatively reproducing each individual term of the DC-SCAN MBE for water, providing a correct representation of both short- and long-range many-body contributions. Since the DC-SCAN functional exhibits chemical accuracy for each individual term of the MBE for water and the MB-SCAN(DC) PEF quantitatively reproduces the DC-SCAN many-body energies, the MB-SCAN(DC) PEF effectively provides the first demonstration of a DFT-based model that correctly describes the properties of water, at the computational cost of advanced polarizable force fields¹⁴. Future applications of the MB-SCAN(DC) PEF will focus on modeling the phase diagram of water, which was shown to be only qualitatively reproduced by NNPs trained on SCAN data^92,98. We expect MB-SCAN(DC) to be especially well suited to modeling the liquid/vapor equilibrium, which involves the making and breaking of hydrogen bonds.

Finally, we want to emphasize that the many-body formalism adopted by the MB-SCAN(DC) PEF for water is general and has already been used in the development of data-driven many-body PEFs for various aqueous systems^99,100 and molecular fluids^101,102 which were trained on (expensive) CCSD(T) data. It thus follows that the significantly lower computational cost associated with DC-SCAN calculations can enable the routine development of MB-SCAN(DC) PEFs for generic (small) molecules which are trained on DC-SCAN data but effectively display CCSD(T) accuracy. In this context, it should be noted that the MB-Fit software infrastructure¹⁰³ for many-body PEFs combined with the MBX many-body energy/force calculator¹⁰⁴ interfaced with i-PI¹⁰⁵ and LAMMPS¹⁰⁶ already provides a robust platform for MD simulations of generic molecules in the gas, liquid, and solid phases using MB-SCAN(DC) PEFs.

Methods

Many-body expansion

Building upon the demonstrated accuracy of the MB-pol PEF for water^78,79,80,81 and following the same theoretical/computational approach employed in the development of DFT-based many-body PEFs^51,90,107, we used Eq. (4) to develop a data-driven many-body PEF, MB-SCAN(DC), that consistently reproduces each term of the MBE for water calculated using the DC-SCAN functional. Briefly, MB-SCAN(DC) includes explicit representations of 1B, 2B, and 3B energies, and describes all higher-order nB energy terms (n > 3) through classical many-body polarization. Specifically, ϵ^1B in Eq. (4) is represented by the Partridge−Schwenke PEF¹⁰⁸, while ϵ^2B and ϵ^3B are represented by terms describing permanent electrostatics, dispersion energy, and induction, which are combined with short-range permutationally invariant polynomials (PIPs)¹⁰⁹ fitted to reproduce 2B and 3B energies calculated with DC-SCAN for the same training sets of water dimers and trimers used in the development of MB-pol^79,80. A detailed description of the theoretical and computational framework adopted in the development of data-driven many-body PEFs for water can be found in the original references^{51,79,80,90,107}. It should be noted that, since our many-body PEFs directly target the underlying molecular interactions, differences in the representation of the 1-body (1B) term of Eq. (5) have been found to be negligible for modeling the properties of liquid water¹⁰⁷ and the air/water interface¹¹⁰.

All DFT calculations were performed with the aug-cc-pVQZ basis set^111,112 using Q-Chem¹¹³ quantum chemistry packages. Since the SCAN functional is particularly sensitive to the real-space grid, all SCAN and DC-SCAN calculations are performed on the highly dense Euler−Maclaurin−Lebedev (99,590) grid^114,115 (58,410 points per atom). In this regard, the results of a sensitivity analysis reported in Supplementary Table S1 suggest that the SG2 grid¹¹⁶ (~8000 points per atom) should also be sufficient to converge SCAN calculations. In case only smaller grids are available, we recommend to use r²SCAN¹¹⁷, which often achieves an accuracy similar to SCAN. Single-point energy calculations using explicitly correlated coupled cluster, CCSD(T)-F12b, theory¹¹⁸ were performed in the CBS limit by extrapolating^119,120 the energy values obtained with the cc-pVTZ-F12 and cc-pVQZ-F12 basis sets along with associated auxiliary and complementary auxiliary (CABS) basis sets^121,122 using the ORCA quantum chemistry package¹²³.

Data availability

All data generated and analyzed for this study are publicly available in this repository on GitHub: https://github.com/paesanilab/Data_Repository/tree/main/MBSCANDC.

Code availability

The MB-SCAN and MB-SCAN(DC) PEFs are available in MBX¹⁰⁴, and can be used in MD simulations with LAMMPS¹⁰⁶ and i-PI¹⁰⁵. All computer codes used in the analysis presented in this study are available from the authors upon request.

References

Gallo, P. et al. Water: a tale of two liquids. Chem. Rev. 116, 7463–7500 (2016).
Article CAS PubMed PubMed Central Google Scholar
Ball, P. Water as an active constituent in cell biology. Chem. Rev. 108, 74–108 (2008).
Article CAS PubMed Google Scholar
Franks, F. Water: A Matrix of Life Vol. 21 (Royal Society of Chemistry, 2000).
Eisenberg, D., Kauzmann, W. & Kauzmann, W. The Structure and Properties of Water (Oxford University Press, 2005).
Tanford, C. The hydrophobic effect and the organization of living matter. Science 200, 1012–1018 (1978).
Article ADS CAS PubMed Google Scholar
Jencks, W. P. General acid-base catalysis of complex reactions in water. Chem. Rev. 72, 705–718 (1972).
Article CAS Google Scholar
Savage, P. E. Organic chemical reactions in supercritical water. Chem. Rev. 99, 603–622 (1999).
Article CAS PubMed Google Scholar
Lindström, U. M. Stereoselective organic reactions in water. Chem. Rev. 102, 2751–2772 (2002).
Article PubMed CAS Google Scholar
Akiya, N. & Savage, P. E. Roles of water for chemical reactions in high-temperature water. Chem. Rev. 102, 2725–2750 (2002).
Article CAS PubMed Google Scholar
Li, C.-J. & Chen, L. Organic chemistry in water. Chem. Soc. Rev. 35, 68–82 (2006).
Article PubMed Google Scholar
Simon, M.-O. & Li, C.-J. Green chemistry oriented organic synthesis in water. Chem. Soc. Rev. 41, 1415–1427 (2012).
Article CAS PubMed Google Scholar
Vega, C. & Abascal, J. L. Simulating water with rigid non-polarizable models: a general perspective. Phys. Chem. Chem. Phys. 13, 19663–19688 (2011).
Article CAS PubMed Google Scholar
Hassanali, A. A., Cuny, J., Verdolino, V. & Parrinello, M. Aqueous solutions: state of the art in ab initio molecular dynamics. Philos. Trans. R. Soc. A 372, 20120482 (2014).
Article ADS MathSciNet MATH CAS Google Scholar
Cisneros, G. A. et al. Modeling molecular interactions in water: from pairwise to many-body potential energy functions. Chem. Rev. 116, 7501–7528 (2016).
Article CAS PubMed PubMed Central Google Scholar
Hohenberg, P. & Kohn, W. Inhomogeneous electron gas. Phys. Rev. 136, B864 (1964).
Article ADS MathSciNet Google Scholar
Kohn, W. Nobel lecture: electronic structure of matter–wave functions and density functionals. Rev. Mod. Phys. 71, 1253 (1999).
Article ADS CAS Google Scholar
Car, R. & Parrinello, M. Unified approach for molecular dynamics and density-functional theory. Phys. Rev. Lett. 55, 2471 (1985).
Article ADS CAS PubMed Google Scholar
Jones, R. O. Density functional theory: its origins, rise to prominence, and future. Rev. Mod. Phys. 87, 897 (2015).
Article ADS MathSciNet Google Scholar
Kohn, W. & Sham, L. J. Self-consistent equations including exchange and correlation effects. Phys. Rev. 140, A1133 (1965).
Article ADS MathSciNet Google Scholar
Ceperley, D. M. & Alder, B. J. Ground state of the electron gas by a stochastic method. Phys. Rev. Lett. 45, 566 (1980).
Article ADS CAS Google Scholar
Perdew, J. P. & Wang, Y. Accurate and simple analytic representation of the electron-gas correlation energy. Phys. Rev. B 45, 13244 (1992).
Article ADS CAS Google Scholar
Glötzel, D. & McMahan, A. Relativistic effects, phonons, and the isostructural transition in cesium. Phys. Rev. B 20, 3210 (1979).
Article ADS Google Scholar
Skriver, H. L. Crystal structure from one-electron theory. Phys. Rev. B 31, 1909 (1985).
Article ADS CAS Google Scholar
Moriarty, J. A. & McMahan, A. High-pressure structural phase transitions in Na, Mg, and Al. Phys. Rev. Lett. 48, 809 (1982).
Article ADS CAS Google Scholar
Laasonen, K., Csajka, F. & Parrinello, M. Water dimer properties in the gradient-corrected density functional theory. Chem. Phys. Lett. 194, 172–174 (1992).
Article ADS CAS Google Scholar
Laasonen, K., Parrinello, M., Car, R., Lee, C. & Vanderbilt, D. Structures of small water clusters using gradient-corrected density functional theory. Chem. Phys. Lett. 207, 208–213 (1993).
Article ADS CAS Google Scholar
Perdew, J. P. & Schmidt, K. Jacob’s ladder of density functional approximations for the exchange-correlation energy. In AIP Conf. Proc., Vol. 577, (eds Van Doren, V. E., Van Alsenoy, K. & Geerlings, P.) 1–20 (American Institute of Physics, 2001).
Becke, A. D. Density-functional exchange-energy approximation with correct asymptotic behavior. Phys. Rev. A 38, 3098 (1988).
Article ADS CAS Google Scholar
Lee, C., Yang, W. & Parr, R. G. Development of the Colle-Salvetti correlation-energy formula into a functional of the electron density. Phys. Rev. B 37, 785 (1988).
Article ADS CAS Google Scholar
Perdew, J. P., Burke, K. & Ernzerhof, M. Generalized gradient approximation made simple. Phys. Rev. Lett. 77, 3865 (1996).
Article ADS CAS PubMed Google Scholar
Sim, F., St. Amant, A., Papai, I. & Salahub, D. R. Gaussian density functional calculations on hydrogen-bonded systems. J. Am. Chem. Soc. 114, 4391–4400 (1992).
Article CAS Google Scholar
Tuckerman, M. E. Ab initio molecular dynamics: basic concepts, current trends and novel applications. J. Condens. Matter Phys. 14, R1297 (2002).
Article ADS CAS Google Scholar
Santra, B., Michaelides, A. & Scheffler, M. On the accuracy of density-functional theory exchange-correlation functionals for h bonds in small water clusters: benchmarks approaching the complete basis set limit. J. Chem. Phys. 127, 184104 (2007).
Article ADS PubMed CAS Google Scholar
Kuo, I.-F. W. et al. Liquid water from first principles: investigation of different sampling approaches. J. Phys. Chem. B 108, 12990–12998 (2004).
Article CAS Google Scholar
Grossman, J. C., Schwegler, E., Draeger, E. W., Gygi, F. & Galli, G. Towards an assessment of the accuracy of density functional theory for first principles simulations of water. J. Chem. Phys. 120, 300–311 (2004).
Article ADS CAS PubMed Google Scholar
VandeVondele, J. et al. The influence of temperature and density functional models in ab initio molecular dynamics simulation of liquid water. J. Chem. Phys. 122, 014515 (2005).
Article ADS CAS Google Scholar
Gillan, M. J., Alfe, D. & Michaelides, A. Perspective: how good is DFT for water? J. Chem. Phys. 144, 130901 (2016).
Article ADS PubMed CAS Google Scholar
Wang, J., Román-Pérez, G., Soler, J. M., Artacho, E. & Fernández-Serra, M.-V. Density, structure, and dynamics of water: the effect of van der waals interactions. J. Chem. Phys. 134, 024516 (2011).
Article ADS PubMed CAS Google Scholar
Perdew, J. P., Kurth, S., Zupan, A. & Blaha, P. Accurate density functional with correct formal properties: a step beyond the generalized gradient approximation. Phys. Rev. Lett. 82, 2544 (1999).
Article ADS CAS Google Scholar
Adamo, C., Ernzerhof, M. & Scuseria, G. E. The meta-GGA functional: thermochemistry with a kinetic energy density dependent exchange-correlation functional. J. Chem. Phys. 112, 2643–2649 (2000).
Article ADS CAS Google Scholar
Sun, J., Ruzsinszky, A. & Perdew, J. P. Strongly constrained and appropriately normed semilocal density functional. Phys. Rev. Lett. 115, 036402 (2015).
Article ADS PubMed CAS Google Scholar
Sun, J. et al. Accurate first-principles structures and energies of diversely bonded systems from an efficient density functional. Nat. Chem. 8, 831 (2016).
Article CAS PubMed Google Scholar
Chen, M. et al. Ab initio theory and modeling of water. Proc. Natl Acad. Sci. USA 114, 10846–10851 (2017).
Article CAS PubMed PubMed Central Google Scholar
Zheng, L. et al. Structural, electronic, with dynamical properties of liquid water by ab initio molecular dynamics based on scan functional within the canonical ensemble. J. Chem. Phys. 148, 164505 (2018).
Article ADS PubMed CAS Google Scholar
Perdew, J. P. & Zunger, A. Self-interaction correction to density-functional approximations for many-electron systems. Phys. Rev. B 23, 5048 (1981).
Article ADS CAS Google Scholar
Cohen, A. J., Mori-Sánchez, P. & Yang, W. Development of exchange-correlation functionals with minimal many-electron self-interaction error. J. Chem. Phys. 126, 191109 (2007).
Article ADS PubMed CAS Google Scholar
Mori-Sánchez, P., Cohen, A. J. & Yang, W. Localization and delocalization errors in density functional theory and implications for band-gap prediction. Phys. Rev. Lett. 100, 146401 (2008).
Article ADS PubMed CAS Google Scholar
Johnson, E. R., Mori-Sánchez, P., Cohen, A. J. & Yang, W. Delocalization errors in density functionals and implications for main-group thermochemistry. J. Chem. Phys. 129, 204112 (2008).
Article ADS PubMed CAS Google Scholar
Li, C., Zheng, X., Cohen, A. J., Mori-Sánchez, P. & Yang, W. Local scaling correction for reducing delocalization error in density functional approximations. Phys. Rev. Lett. 114, 053001 (2015).
Article ADS CAS PubMed Google Scholar
Sharkas, K. et al. Self-interaction error overbinds water clusters but cancels in structural energy differences. Proc. Natl Acad. Sci. USA 117, 11283–11288 (2020).
Article CAS PubMed PubMed Central Google Scholar
Lambros, E., Hu, J. & Paesani, F. Assessing the accuracy of the scan functional for water through a many-body analysis of the adiabatic connection formula. J. Chem. Theory Comput. 17, 3739–3749 (2021).
Article CAS PubMed Google Scholar
Zhang, C. et al. Modeling liquid water by climbing up Jacob’s ladder in density functional theory facilitated by using deep neural network potentials. J. Phys. Chem. B https://doi.org/10.1021/acs.jpcb.1c03884 (2021).
Rezac, J. & Hobza, P. Describing noncovalent interactions beyond the common approximations: how accurate is the “gold standard,” CCSD(T) at the complete basis set limit? J. Chem. Theory Comput. 9, 2151–2155 (2013).
Article CAS PubMed Google Scholar
Hui, K. & Chai, J.-D. SCAN-based hybrid and double-hybrid density functionals from models without fitted parameters. J. Chem. Phys. 144, 044114 (2016).
Article ADS PubMed CAS Google Scholar
Gordon, R. G. & Kim, Y. S. Theory for the forces between closed-shell atoms and molecules. J. Chem. Phys. 56, 3122–3133 (1972).
Article ADS CAS Google Scholar
Scuseria, G. E. Comparison of coupled-cluster results with a hybrid of Hartree−Fock and density functional theory. J. Chem. Phys. 97, 7528–7530 (1992).
Article ADS CAS Google Scholar
Oliphant, N. & Bartlett, R. J. A systematic comparison of molecular properties obtained using Hartree-Fock, a hybrid Hartree-Fock density-functional-theory, and coupled-cluster methods. J. Chem. Phys. 100, 6550–6561 (1994).
Article ADS CAS Google Scholar
Janesko, B. G. & Scuseria, G. E. Hartree-Fock orbitals significantly improve the reaction barrier heights predicted by semilocal density functionals. J. Chem. Phys. 128, 244112 (2008).
Article ADS PubMed PubMed Central CAS Google Scholar
Kim, M.-C., Sim, E. & Burke, K. Understanding and reducing errors in density functional calculations. Phys. Rev. Lett. 111, 073003 (2013).
Article ADS PubMed CAS Google Scholar
Kim, M.-C., Sim, E. & Burke, K. Ions in solution: density corrected density functional theory (DC-DFT). J. Chem. Phys. 140, 18A528 (2014).
Article PubMed CAS Google Scholar
Vuckovic, S., Song, S., Kozlowski, J., Sim, E. & Burke, K. Density functional analysis: the theory of density-corrected DFT. J. Chem. Theory Comput. 15, 6636–6646 (2019).
Article CAS PubMed Google Scholar
Jana, S., Patra, A., Śmiga, S., Constantin, L. A. & Samal, P. Insights from the density functional performance of water and water–solid interactions: scan in relation to other meta-ggas. J. Chem. Phys. 153, 214116 (2020).
Article ADS CAS PubMed Google Scholar
Song, S., Vuckovic, S., Sim, E. & Burke, K. Density sensitivity of empirical functionals. J. Phys. Chem. Lett. 12, 800–807 (2021).
Article CAS PubMed Google Scholar
Medvedev, M. G., Bushmarinov, I. S., Sun, J., Perdew, J. P. & Lyssenko, K. A. Density functional theory is straying from the path toward the exact functional. Science 355, 49–52 (2017).
Article ADS CAS PubMed Google Scholar
Perdew, J. P., Parr, R. G., Levy, M. & Balduz Jr, J. L. Density-functional theory for fractional particle number: derivative discontinuities of the energy. Phys. Rev. Lett. 49, 1691 (1982).
Article ADS CAS Google Scholar
Zhang, Y. & Yang, W. A challenge for density functionals: self-interaction error increases for systems with a noninteger number of electrons. J. Chem. Phys. 109, 2604–2608 (1998).
Article ADS CAS Google Scholar
Engel, E. & Dreizler, R. M. In Density Functional Theory, 109–217 (Springer, 2011).
Goodpaster, J. D., Barnes, T. A., Manby, F. R. & Miller III, T. F. Density functional theory embedding for correlated wavefunctions: improved methods for open-shell systems and transition metal complexes. J. Chem. Phys. 137, 224113 (2012).
Article ADS PubMed CAS Google Scholar
Cohen, A. J., Mori-Sánchez, P. & Yang, W. Challenges for density functional theory. Chem. Rev. 112, 289–320 (2012).
Article CAS PubMed Google Scholar
Ruzsinszky, A., Perdew, J. P. & Csonka, G. I. Binding energy curves from nonempirical density functionals. I. Covalent bonds in closed-shell and radical molecules. J. Phys. Chem. A 109, 11006–11014 (2005).
Article CAS PubMed Google Scholar
Hait, D. & Head-Gordon, M. Delocalization errors in density functional theory are essentially quadratic in fractional occupation number. J. Phys. Chem. Lett. 9, 6280–6288 (2018).
Article CAS PubMed Google Scholar
Lee, D., Furche, F. & Burke, K. Accuracy of electron affinities of atoms in approximate density functional theory. J. Phys. Chem. Lett. 1, 2124–2129 (2010).
Article CAS Google Scholar
Santra, G. & Martin, J. M. What types of chemical problems benefit from density-corrected dft? A probe using an extensive and chemically diverse test suite. J. Chem. Theory Comput. 17, 1368–1379 (2021).
Article CAS PubMed PubMed Central Google Scholar
Lee, D. & Burke, K. Finding electron affinities with approximate density functionals. Mol. Phys. 108, 2687–2701 (2010).
Article ADS CAS Google Scholar
Nam, S., Song, S., Sim, E. & Burke, K. Measuring density-driven errors using Kohn–Sham inversion. J. Chem. Theory Comput. 16, 5014–5023 (2020).
Article CAS PubMed Google Scholar
Hankins, D., Moskowitz, J. & Stillinger, F. Water molecule interactions. J. Chem. Phys. 53, 4544–4554 (1970).
Article ADS CAS Google Scholar
Góra, U., Podeszwa, R., Cencek, W. & Szalewicz, K. Interaction energies of large clusters from many-body expansion. J. Chem. Phys. 135, 224102 (2011).
Article ADS PubMed CAS Google Scholar
Reddy, S. K. et al. On the accuracy of the MB-pol many-body potential for water: interaction energies, vibrational frequencies, and classical thermodynamic and dynamical properties from clusters to liquid water and ice. J. Chem. Phys. 145, 194504 (2016).
Article ADS PubMed CAS Google Scholar
Babin, V., Leforestier, C. & Paesani, F. Development of a “first principles" water potential with flexible monomers: dimer potential energy surface, VRT spectrum, and second virial coefficient. J. Chem. Theory Comput. 9, 5395–5403 (2013).
Article CAS PubMed Google Scholar
Babin, V., Medders, G. R. & Paesani, F. Development of a “first principles" water potential with flexible monomers. II: trimer potential energy surface, third virial coefficient, and small clusters. J. Chem. Theory Comput. 10, 1599–1607 (2014).
Article CAS PubMed Google Scholar
Medders, G. R., Babin, V. & Paesani, F. Development of a “first-principles" water potential with flexible monomers. III. Liquid phase properties. J. Chem. Theory Comput. 10, 2906–2910 (2014).
Article CAS PubMed Google Scholar
Skinner, L. B. et al. Benchmark oxygen-oxygen pair-distribution function of ambient water from X-ray diffraction measurements with a wide Q-range. J. Chem. Phys. 138, 074506 (2013).
Article ADS PubMed CAS Google Scholar
Skinner, L. B., Benmore, C., Neuefeind, J. C. & Parise, J. B. The structure of water around the compressibility minimum. J. Chem. Phys. 141, 214507 (2014).
Article ADS CAS PubMed Google Scholar
Brown, S. E. et al. Monitoring water clusters “melt" through vibrational spectroscopy. J. Am. Chem. Soc. 139, 7082–7088 (2017).
Article CAS PubMed Google Scholar
Xantheas, S. S. & Aprà, E. The binding energies of the D_2d and S₄ water octamer isomers: high-level electronic structure and empirical potential results. J. Chem. Phys. 120, 823–828 (2004).
Article ADS CAS PubMed Google Scholar
Manna, D., Kesharwani, M. K., Sylvetsky, N. & Martin, J. M. Conventional and explicitly correlated ab initio benchmark study on water clusters: revision of the BEGDB and WATER27 data sets. J. Chem. Theory Comput. 13, 3136–3152 (2017).
Article CAS PubMed Google Scholar
Pederson, M. R., Ruzsinszky, A. & Perdew, J. P. Communication: Self-interaction correction with unitary invariance in density functional theory. J. Chem. Phys. 140, 121103 (2014).
Article ADS PubMed CAS Google Scholar
Elrod, M. J. & Saykally, R. J. Many-body effects in intermolecular forces. Chem. Rev. 94, 1975–1997 (1994).
Article CAS PubMed Google Scholar
Pérez, C. et al. Structures of cage, prism, and book isomers of water hexamer from broadband rotational spectroscopy. Science 336, 897–901 (2012).
Article ADS PubMed CAS Google Scholar
Riera, M., Lambros, E., Nguyen, T. T., Götz, A. W. & Paesani, F. Low-order many-body interactions determine the local structure of liquid water. Chem. Sci. 10, 8211–8218 (2019).
Article CAS PubMed PubMed Central Google Scholar
Zhuang, D., Riera, M., Schenter, G. K., Fulton, J. L. & Paesani, F. Many-body effects determine the local hydration structure of Cs⁺ in solution. J. Phys. Chem. Lett. 10, 406–412 (2019).
Article PubMed CAS Google Scholar
Piaggi, P. M., Panagiotopoulos, A. Z., Debenedetti, P. G. & Car, R. Phase equilibrium of water with hexagonal and cubic ice using the scan functional. J. Chem. Theory Comput. 17, 3065–3077 (2021).
Article CAS PubMed Google Scholar
Yao, Y. & Kanai, Y. Temperature dependence of nuclear quantum effects on liquid water via artificial neural network model based on SCAN meta-GGA functional. J. Chem. Phys. 153, 044114 (2020).
Article CAS PubMed Google Scholar
Matsuoka, O., Clementi, E. & Yoshimine, M. CI study of the water dimer potential surface. J. Chem. Phys. 64, 1351–1361 (1976).
Article ADS CAS Google Scholar
Lie, G. & Clementi, E. Molecular-dynamics simulation of liquid water with an ab initio flexible water–water interaction potential. Phys. Rev. A 33, 2679 (1986).
Article ADS CAS Google Scholar
Evans, M., Refson, K., Swamy, K., Lie, G. & Clementi, E. Molecular-dynamics simulation of liquid water with an ab initio flexible water–water interaction potential. II. The effect of internal vibrations on the time correlation functions. Phys. Rev. A 36, 3935 (1987).
Article ADS CAS Google Scholar
Niesar, U., Corongiu, G., Clementi, E., Kneller, G. & Bhattacharya, D. Molecular dynamics simulations of liquid water using the NCC ab initio potential. J. Phys. Chem. 94, 7949–7956 (1990).
Article CAS Google Scholar
Zhang, L., Wang, H., Car, R. & Weinan, E. Phase diagram of a deep potential water model. Phys. Rev. Lett. 126, 236001 (2021).
Article ADS CAS PubMed Google Scholar
Bajaj, P., Götz, A. W. & Paesani, F. Toward chemical accuracy in the description of ion–water interactions through many-body representations. I. Halide–water dimer potential energy surfaces. J. Chem. Theory Comput. 12, 2698–2705 (2016).
Article CAS PubMed Google Scholar
Riera, M., Mardirossian, N., Bajaj, P., Götz, A. W. & Paesani, F. Toward chemical accuracy in the description of ion–water interactions through many-body representations. Alkali-water dimer potential energy surfaces. J. Chem. Phys. 147, 161715 (2017).
Article ADS PubMed CAS Google Scholar
Riera, M., Yeh, E. P. & Paesani, F. Data-driven many-body models for molecular fluids: CO₂/H₂O mixtures as a case study. J. Chem. Theory Comput. 16, 2246–2257 (2020).
Article CAS PubMed Google Scholar
Riera, M., Hirales, A., Ghosh, R. & Paesani, F. Data-driven many-body models with chemical accuracy for CH₄/H₂O mixtures. J. Phys. Chem. B 124, 11207–11221 (2020).
Article CAS PubMed Google Scholar
Bull-Vulpe, E. F., Riera, M., Götz, A. W. & Paesani, F. MB-Fit: Software infrastructure for data-driven many-body potential energy functions. J. Chem. Phys 155, 124801 (2021).
Riera, M. & Paesani F. MBX: A many-body energy and force calculator. http://paesanigroup.ucsd.edu/software/mbx.html (2021).
Kapil, V. et al. i-PI 2.0: A universal force engine for advanced molecular simulations. Comput. Phys. Commun. 236, 214–223 (2019)
Article ADS CAS Google Scholar
Plimpton, S. Fast parallel algorithms for short-range molecular dynamics. J. Comput. Phys. 117, 1–19 (1995).
Article ADS CAS MATH Google Scholar
Lambros, E. et al. General many-body framework for data-driven potentials with arbitrary quantum mechanical accuracy: water as a case study. J. Chem. Theory Comput. 17, 5635–5650 (2021).
Article CAS PubMed Google Scholar
Partridge, H. & Schwenke, D. W. The determination of an accurate isotope dependent potential energy surface for water from extensive ab initio calculations and experimental data. J. Chem. Phys. 106, 4618–4639 (1997).
Article ADS CAS Google Scholar
Braams, B. J. & Bowman, J. M. Permutationally invariant potential energy surfaces in high dimensionality. Int. Rev. Phys. Chem. 28, 577–606 (2009).
Article CAS Google Scholar
Muniz, M. C. et al. Vapor–liquid equilibrium of water with the mb-pol many-body potential. J. Chem. Phys. 154, 211103 (2021).
Article ADS CAS PubMed Google Scholar
Dunning Jr, T. H. Gaussian basis sets for use in correlated molecular calculations. I. The atoms boron through neon and hydrogen. J. Chem. Phys. 90, 1007–1023 (1989).
Article ADS Google Scholar
Kendall, R. A., Dunning Jr, T. H. & Harrison, R. J. Electron affinities of the first-row atoms revisited. Systematic basis sets and wave functions. J. Chem. Phys. 96, 6796–6806 (1992).
Article ADS CAS Google Scholar
Epifanovsky, E. et al. Software for the frontiers of quantum chemistry: an overview of developments in the Q-Chem 5 package. J. Chem. Phys. 155, 084801 (2021).
Article ADS CAS PubMed Google Scholar
Murray, C. W., Handy, N. C. & Laming, G. J. Quadrature schemes for integrals of density functional theory. Mol. Phys. 78, 997–1014 (1993).
Article ADS CAS Google Scholar
Lebedev, V. I. Quadratures on a sphere. USSR Comput. Math. Math. Phys. 16, 10–24 (1976).
Article MathSciNet MATH Google Scholar
Dasgupta, S. & Herbert, J. M. Standard grids for high-precision integration of modern density functionals: SG-2 and SG-3. J. Comput. Chem. 38, 869–882 (2017).
Article CAS PubMed Google Scholar
Furness, J. W., Kaplan, A. D., Ning, J., Perdew, J. P. & Sun, J. Accurate and numerically efficient r²SCAN meta-generalized gradient approximation. J. Phys. Chem. Lett. 11, 8208–8215 (2020).
Article CAS PubMed Google Scholar
Adler, T. B., Knizia, G. & Werner, H.-J. A simple and efficient CCSD(T)-F12 approximation. J. Chem. Phys. 127, 221106 (2007).
Article ADS PubMed CAS Google Scholar
Zhong, S., Barnes, E. C. & Petersson, G. A. Uniformly convergent n-tuple-ζ augmented polarized (nZaP) basis sets for complete basis set extrapolations. I. Self-consistent field energies. J. Chem. Phys. 129, 184116 (2008).
Article ADS PubMed CAS Google Scholar
Helgaker, T., Klopper, W., Koch, H. & Noga, J. Basis-set convergence of correlated calculations on water. J. Chem. Phys. 106, 9639–9646 (1997).
Article ADS CAS Google Scholar
Yousaf, K. E. & Peterson, K. A. Optimized auxiliary basis sets for explicitly correlated methods. J. Chem. Phys. 129, 184108 (2008).
Article ADS PubMed CAS Google Scholar
Yousaf, K. E. & Peterson, K. A. Optimized complementary auxiliary basis sets for explicitly correlated methods: aug-cc-pvnz orbital basis sets. Chem. Phys. Lett. 476, 303–307 (2009).
Article ADS CAS Google Scholar
Neese, F. Software update: the ORCA program system, version 4.0. WIREs Comput. Mol. Sci. 8, e1327:1–6 (2017).
Google Scholar
Lemmon, E. W., McLinden, M. O. & Friend, D. G. In NIST Chemistry WebBook (eds Linstrom, P. & Mallard, W.) (National Institute of Standards and Technology, Gaithersburg, MD, 2021).
Holz, M., Heil, S. R. & Sacco, A. Temperature-dependent self-diffusion coefficients of water and six selected molecular liquids for calibration in accurate ¹H NMR PFG measurements. Phys. Chem. Chem. Phys. 2, 4740–4742 (2000).
Article CAS Google Scholar
Easteal, A. J., Price, W. E. & Woolf, L. A. Diaphragm cell for high-temperature diffusion measurements. tracer diffusion coefficients for water to 363 K. J. Chem. Soc., Faraday Trans. 1 85, 1091–1097 (1989).
Article CAS Google Scholar
Mills, R. Self-diffusion in normal and heavy water in the range 1-45^o. J. Phys. Chem. 77, 685–688 (1973).
Article CAS Google Scholar

Download references

Acknowledgements

We thank Eunji Sim, Suhwan Song, and Kieron Burke for stimulating discussions. This research was supported by the U.S. Department of Energy, Office of Science, Office of Basic Energy Science, through grants no. DE-SC0019490 (F.P.) and no. DE-SC0018331 (J.P.P.). This research used resources of the National Energy Research Scientific Computing Center (NERSC), which is supported by the Office of Science of the U.S. Department of Energy under contract DE-AC02-05CH11231, the Extreme Science and Engineering Discovery Environment (XSEDE), which is supported by the National Science Foundation through grant no. ACI-1548562, and the Triton Shared Computing Cluster (TSCC) at the San Diego Supercomputer Center (SDSC).

Author information

These authors contributed equally: Saswata Dasgupta, Eleftherios Lambros.

Authors and Affiliations

Department of Chemistry and Biochemistry, University of California, San Diego, La Jolla, CA, 92093, USA
Saswata Dasgupta, Eleftherios Lambros & Francesco Paesani
Department of Physics, Temple University, Philadelphia, PA, 19122, USA
John P. Perdew
Department of Chemistry, Temple University, Philadelphia, PA, 19122, USA
John P. Perdew
Materials Science and Engineering, University of California, San Diego, La Jolla, CA, 92093, USA
Francesco Paesani
San Diego Supercomputer Center, University of California, San Diego, La Jolla, CA, 92093, USA
Francesco Paesani

Authors

Saswata Dasgupta
View author publications
You can also search for this author in PubMed Google Scholar
Eleftherios Lambros
View author publications
You can also search for this author in PubMed Google Scholar
John P. Perdew
View author publications
You can also search for this author in PubMed Google Scholar
Francesco Paesani
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.D., E.L., J.P.P., and F.P. analyzed the data and wrote the paper. S.D. and E.L. contributed equally to this work. F.P. designed and supervised the research.

Corresponding author

Correspondence to Francesco Paesani.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Nature Communications thanks Edgar Engel and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Dasgupta, S., Lambros, E., Perdew, J.P. et al. Elevating density functional theory to chemical accuracy for water simulations through a density-corrected many-body formalism. Nat Commun 12, 6359 (2021). https://doi.org/10.1038/s41467-021-26618-9

Download citation

Received: 22 July 2021
Accepted: 08 October 2021
Published: 04 November 2021
DOI: https://doi.org/10.1038/s41467-021-26618-9

This article is cited by

Accurate descriptions of molecule-surface interactions in electrocatalytic CO2 reduction on the copper surfaces
- Zheng Chen
- Zhangyun Liu
- Xin Xu
Nature Communications (2023)
Realistic phase diagram of water from “first principles” data-driven quantum simulations
- Sigbjørn Løland Bore
- Francesco Paesani
Nature Communications (2023)
Extending density functional theory with near chemical accuracy beyond pure water
- Suhwan Song
- Stefan Vuckovic
- Kieron Burke
Nature Communications (2023)
Essential Oil of Origanum vulgare as a Green Corrosion Inhibitor for Carbon Steel in Acidic Medium
- Rachid Ihamdane
- Malika Tiskar
- Abdelaziz Chaouch
Arabian Journal for Science and Engineering (2023)
Viscosity in water from first-principles and deep-neural-network simulations
- Cesare Malosso
- Linfeng Zhang
- Davide Tisi
npj Computational Materials (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.