An electrostatic spectral neighbor analysis potential for lithium nitride

Deng, Zhi; Chen, Chi; Li, Xiang-Guo; Ong, Shyue Ping

doi:10.1038/s41524-019-0212-1

Download PDF

Article
Open access
Published: 16 July 2019

An electrostatic spectral neighbor analysis potential for lithium nitride

Zhi Deng¹,
Chi Chen¹,
Xiang-Guo Li¹ &
…
Shyue Ping Ong ORCID: orcid.org/0000-0001-5726-2587¹

npj Computational Materials volume 5, Article number: 75 (2019) Cite this article

5008 Accesses
68 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Machine-learned interatomic potentials based on local environment descriptors represent a transformative leap over traditional potentials based on rigid functional forms in terms of prediction accuracy. However, a challenge in their application to ionic systems is the treatment of long-ranged electrostatics. Here, we present a highly accurate electrostatic Spectral Neighbor Analysis Potential (eSNAP) for ionic α-Li₃N, a prototypical lithium superionic conductor of interest as a solid electrolyte or coating for rechargeable lithium-ion batteries. We show that the optimized eSNAP model substantially outperforms traditional Coulomb–Buckingham potential in the prediction of energies and forces, as well as various properties, such as lattice constants, elastic constants, and phonon dispersion curves. We also demonstrate the application of eSNAP in long-time, large-scale Li diffusion studies in Li₃N, providing atomistic insights into measures of concerted ionic motion (e.g., the Haven ratio) and grain boundary diffusion. This work aims at providing an approach to developing quantum-accurate force fields for multi-component ionic systems under the SNAP formalism, enabling large-scale atomistic simulations for such systems.

Highly efficient evaluation of diffusion networks in Li ionic conductors using a 3D-corrugation descriptor

Article Open access 22 October 2019

Arthur France-Lanord, Ryoji Asahi, … Erich Wimmer

Principal component analysis enables the design of deep learning potential precisely capturing LLZO phase transitions

Article Open access 18 March 2024

Yiwei You, Dexin Zhang, … Shunqing Wu

Prediction of stable Li-Sn compounds: boosting ab initio searches with neural network potentials

Article Open access 28 June 2022

Saba Kharabadze, Aidan Thorn, … Aleksey N. Kolmogorov

Introduction

A potential energy surface (PES) that yields potential energy of a system of atoms with given atomic coordinates is the fundamental enabler for atomistic simulation methods. In principle, ab initio or first principles methods that solve the Schrödinger equation, typically some approximation within the Kohn-Sham density functional theory (DFT) framework,^1,2 can be applied to directly calculate the PES. While such methods are highly accurate and transferable across diverse chemistries and bonding types, their high computational cost limit their application in molecular dynamics (MD) simulations to relatively small and simple systems containing up to a few hundreds of atoms and sub-nanosecond time scales. Empirical interatomic potentials, on the other hand, are a much cheaper alternative. The functional form of these potentials are drastically simplified with only a few fitting parameters to satisfy physical considerations.^3,4 However, the accuracy of the empirical potentials is necessarily limited by the approximations made in selecting the functional form, which are generally not transferable to another system with different bonding types.

In recent years, an alternative approach has gained popularity in constructing interatomic potentials with improved transferability.^5,6,7,8,9,10 In this approach, the atomic coordinates are featurized using local environment descriptors that are invariant to translations, rotations and permutations of homo-nuclear atoms, and are differentiable and unique.^8,11 A machine learning model is then trained to map the structural features to data (energies, forces, etc.) from first principles calculations. Such potentials have been demonstrated to achieve accuracy close to first principles methods at much lower computational costs.^5,7,9,10

The coefficients of the bispectrum of local atomic density were first applied in the Gaussian approximation potential by Bartók et al.⁷ Thompson et al. later showed that a linear model of bispectrum coefficients from the lowest order—the so-called spectral neighbor analysis potential (SNAP)—can accurately reproduce DFT energies and forces as well as a variety of calculated properties (e.g., elastic constants and migration barrier for screw dislocations) in bcc Ta and W.^9,12 More recently, the current authors have extended the SNAP formalism to bcc Mo, fcc Ni, and Cu, and the binary fcc Ni-bcc Mo alloy systems and showed that it outperforms traditional embedded atom method (EAM) and modified EAM potentials across a wide range of properties.^13,14 Thus far, SNAP models have mainly been developed for metallic systems.

For ionic systems, a common strategy in constructing interatomic potentials is to incorporate long-ranged electrostatic interactions (e.g., through the use of the Ewald summation) on top of energy model. This has been done for both traditional empirical models^15,16 as well as modern local atomic environment descriptor-based potentials (e.g., GAP for the mixed ionic-covalent GaN⁷ and neural network potential for ZnO⁶). In this work, we develop a highly accurate electrostatic SNAP (eSNAP) model for ionic α-Li₃N (see Fig. 1a). α-Li₃N is one of the earliest lithium superionic conductors ever reported,¹⁷ and remains a promising solid electrolyte/anode coating candidate today due to its stability against Li metal.^18,19 A highly accurate potential model for α-Li₃N would enable large-scale, long-time-scale diffusion studies of this highly important prototypical lithium conductor, as well as serve as a platform in which to develop similar potentials for more complex systems.

Results

Optimized model parameters

In our proposed electrostatic SNAP (eSNAP) model, we write the total potential energy E_p as the sum of the electrostatic contributions and the local energy (SNAP) due to the variations in atomic local environments, as follows:

$$E_{\mathrm{p}} = \gamma E_{{\mathrm{el}}} + E_{{\mathrm{SNAP}}}$$

(1)

$${\mathbf{F}}_j = - \nabla _jE_{\mathrm{p}} = - \gamma \nabla _jE_{{\mathrm{el}}} - F_{j,{\mathrm{SNAP}}}$$

(2)

where E_el and E_SNAP are the electrostatic energy computed using the Ewald summation approach²⁰ and the energy from SNAP, respectively, and γ is an effective screening prefactor for electrostatic interactions. An iterative procedure was developed to fit all model parameters using total energies and forces from DFT calculations until the training and test errors are converged (see Methods section for details).

For Li₃N, we calculated the electrostatic energy by assigning formal charges 1 and −3 to Li and N, respectively. For highly ionic α-Li₃N, we find that assigning formal charges, with screening accounted for via a fitted parameter (γ in Eq. (1)), results in a simpler, more stable potential model than variable charge models such as the charge equilibration (QEq)²¹ method. The narrow charge distribution of Li atoms from Bader analysis (see Fig. S1) also supports the usage of fixed charge. The final hyperparameters and coefficients for the optimized eSNAP model are given in Table 1. The optimized effective screening parameter γ is 0.057.

Table 1 Final hyperparameters and coefficients of SNAP

Full size table

Energy and force prediction

Figure 2a, b shows the comparison between DFT calculated and eSNAP predicted energies and forces on both training and test dataset in the final iteration. Both energy and force predictions agree well with those from DFT calculations, indicating the eSNAP model has successfully captured the fundamental relationship between atomic environment and potential energy/atomic forces. The mean absolute errors (MAEs) on energies and forces reached convergence after only two iterations, as shown in Fig. 2c, d. In comparison, the MAEs between DFT and the Coulomb–Buckingham potential by ref. ²² on the initial training configuration pool are substantially higher for both energies (22 meV/atom) and forces (0.48 eV/Å).

Structural properties

Table 2 compares the computed physical properties of α-Li₃N with different potential energy surfaces. The lattice constants calculated from eSNAP agree with those from DFT and experiments.²³ The calculated elastic constants from eSNAP also match reasonably well with DFT calculated and experimental values.²⁴ This excellent agreement on structural properties can be expected from the fact that the energies of unit cells with various distortions have been fed to the model with a large sample weight. In comparison, the lattice constants and elastic constants from the Coulomb–Buckingham potential match poorly with both DFT and experimental values, despite the fact that these physical properties were used to determine the potential parameters.²²

Table 2 Calculated structural properties from different potentials and lattice constants²³ and elastic constants²⁴ from experimental measurements

Full size table

We have also calculated the formation energy of Li Frenkel defects and the migration barrier of these defects. We considered two Frenkel configurations where a vacancy is introduced on a Li2 site and the interstitial Li is located at either Li2 site (intra-planar, Fig. 1b) or Li1 site (inter-planar, Fig. 1c), and all defect configurations are fully relaxed within each potential. The eSNAP model yields reasonably close formation energy of intra-planar defect to the DFT value, but slightly overestimates the value of inter-planar defect by 0.12 eV. On the other hand, the Coulomb–Buckingham potential underestimates the defect formation energies, likely due to the use of unsatisfactory lattice constants in building the defect configurations. Using the nudged elastic band (NEB) method,²⁵ we calculated the migration barrier of two types of hops, namely intra-planar (Li2 to Li2) vacancy migration and inter-planar (Li1 to Li2) Li interstitial migration. As shown in Table 2, the eSNAP barrier for intra-planar vacancy migration is in good agreement with the DFT barrier, while the eSNAP barrier for inter-planar interstitial migration underestimates the DFT barrier by 0.13 eV. We note that the eSNAP defect formation energies and migration barriers for the dominant intra-planar diffusion direction are reasonably close to the DFT values, while the overestimation of the inter-planar defect formation energy by eSNAP is compensated by the underestimation of the vacancy migration barrier. Similar errors and error compensations have been reported in prior non-electrostatic SNAP models on metals such as Mo and Ni.^13,14 On the other hand, we are unable to converge the NEB barriers using the Coulomb–Buckingham potential due to its inability to model the transition states.

Finally, Fig. 3 compares the calculated phonon dispersion curves of α-Li₃N from eSNAP with those from DFT calculations. The phonon dispersion curves were calculated using the finite displacement approach on a 3 × 3 × 3 supercell as implemented in the phonopy package.²⁶ We find that the phonon dispersion curves calculated from eSNAP are in good agreement with that from DFT. The only discrepancy is the imaginary phonon mode at Γ point observed in DFT phonon dispersion. According to Wu et al.,²⁷ this lattice instability is associated with the vibration of Li2 sites along the c axis, resulting in a more stable phase that is only 0.3 meV/atom lower in energy after displacing Li2 site by ~0.1 Å. This energy difference is well within the energy prediction error of the eSNAP model. We also note that the experimentally measured phonon dispersion curves at room temperature do not exhibit this lattice instability.²⁴ In contrast, the phonon dispersion curve calculated from the Coulomb–Buckingham potential show severely overestimated frequencies (Fig. S2) due to its unsatisfactory force prediction.

Bulk diffusion

MD simulations were performed using the optimized eSNAP to investigate Li diffusion in bulk α-Li₃N. Built from the unit cell with equilibrium volume, the simulation box is a 5 × 5 × 5 supercell of bulk α-Li₃N containing 500 atoms. MD simulations were carried out at elevated temperatures from 600 to 1200 K in an NVT ensemble for 1 ns long.

We first validated the eSNAP by comparing the mean square displacement (MSD) and diffusivities obtained from eSNAP MD simulations with those obtained from ab initio molecular dynamics (AIMD) simulations at high temperatures (1000 and 1200 K). Runs at lower temperatures were not chosen due to the poor convergence of diffusivity at limited simulation length (40 ps). It should be noted that even though 1200 K is above the melting point of Li₃N, the lattice did not melt in either AIMD or eSNAP MD during the short period of simulations. As shown in Fig. S3, the generally high Li mobility and anisotropic diffusion in α-Li₃N are successfully reproduced with eSNAP MD simulations. The tracer diffusivities (given by the slope of the MSD with respect to time) from eSNAP MD (1.48 × 10⁻⁴ cm²/s at 1000 K, 2.35 × 10⁻⁴ cm²/s at 1200 K) are in generally good agreement with those from AIMD (1.28 × 10⁻⁴ cm²/s at 1000 K, 2.16 × 10⁻⁴ cm²/s at 1200 K), showing a slight overestimation of about 15% and 8% at 1000 K and 1200 K, respectively.

Beyond tracer diffusivities, the orders of magnitude lower computational cost of the eSNAP relative to DFT affords us the capability to compute the charge diffusivity D_σ. For each temperature, 100 independent simulations were performed starting from different initial velocities. Diffusivities were obtained by averaging square displacements over all simulations at a particular temperature. Figure 4 plots the predicted Haven ratio and Arrhenius plot for Li₃N from eSNAP MD simulations. The activation energies, extrapolated room temperature conductivities and average Haven ratio across all temperatures are tabulated in Table 3. The anisotropic diffusion in α-Li₃N observed experimentally^28,29 is reproduced in many aspects, including the magnitude of diffusivity, activation energy, and Haven ratio. The higher diffusivities and lower activation energy in the direction perpendicular to c axis is consistent with the lower Haven ratio found. The activation energy perpendicular to c axis is close to the one in single crystal measurement, though the value parallel to c axis is much lower compared with experiments.²⁸ The lower activation energies lead to much higher extrapolated room temperature ionic conductivity for both directions. The Haven ratios obtained from eSNAP MD are reasonably close to the NMR measured values. We note that the activation energies obtained from MD simulations are lower than the sum of defect formation and migration energies. This is a result of the concerted motion of ions lowering the energy barriers, which is confirmed by the low Haven ratio. In comparison, we also performed a similar series of MD simulations with the Coulomb–Buckingham potential,²² and the results significantly underestimate the fast ionic conduction in α-Li₃N, and significantly overestimates the Haven ratio. In particular, the Haven ratio for the direction parallel to the c-axis is computed to be >1 using the Coulomb–Buckingham potential.

Table 3 Bulk diffusion results from MD simulations using eSNAP and Coulomb–Buckingham potential and single crystal dc conductivity²⁸ and NMR²⁹ measurements

Full size table

Grain boundary diffusion

To investigate grain boundary (GB) diffusion, we first computed the GB energies of two low Σ twist GB configurations—Σ4 [1000] and Σ7 [0001]. Both configurations are fully relaxed using DFT and eSNAP. The eSNAP-calculated GB energies for twist Σ4 [1000] and Σ7 [0001] GBs are 1.41 and 0.85 J m⁻², respectively, in good agreement with the DFT values of 1.64 and 0.86 J m⁻², respectively.

The lower-energy twist Σ7 [0001] GB is then used in large-scale diffusion studies, as shown in Fig. 5. The simulation box (Fig. 5a) contains 5040 atoms in total. Due to the periodic boundary conditions, two GBs are separated by 10× lattice vector c present in the box. NVT MD simulations were carried out at 300 K, with thermalization lasting for 30 ps followed by the production simulation of 1 ns. We find that the MSD of Li atoms within the GB plane is much higher than that in the bulk region, as shown in Fig. 5b, c, and there are few migration events occurring between the GB layer and the bulk layers. From the MSD, we estimate the 2D Li self-diffusivity within the twist GB to be 7.09 × 10⁻⁸ cm²/s, about three times of extrapolated total value (2.24 × 10⁻⁸ cm²/s in 3D) in the bulk at 300 K. These results indicate that grain boundaries may provide a rapid pathway for Li diffusion in α-Li₃N.

Discussion

In this work, we demonstrate that modern potentials based on local environment descriptors such as the SNAP can be adapted for ionic systems by incorporating long-range electrostatics.

The introduction of γ as a hyperparameter offers more flexibility to the potential model in order to achieve higher predictive power. Physically, γ can be interpreted as the inverse of dielectric constant. Indeed, the optimized value of γ is 0.057, which implies an effective dielectric constant of 17.5, reasonably close to the experimental dielectric of α-Li₃N of 14.³⁰ We note that while the experimentally measured dielectric constant could have been provided as an input to model development, the goal of this effort is to develop a general approach to training eSNAP models for materials, some of which may not have measured dielectric constants. We have also attempted to fit a regular SNAP model for Li₃N without the use of electrostatic interactions, but using a larger cutoff radius of 8 Å to allow the model to learn screened electrostatic interactions. The resulting SNAP model has significant higher MAEs in energies and forces of 2.3 meV/atom and 0.15 eV Å⁻¹, respectively.

Unlike earlier works where the sample weights are treated as hyperparameters optimized toward structural properties (lattice constants, elastic constants, etc.),^13,14 we used fixed sample weights in linear regression as the different scales between energies and forces are unified by using standardized z-scores as targets. Sample weight assignment then effectively becomes an exercise in assigning importance of matching various computed properties from DFT. Note that reproducing energetic calculations where atoms are relaxed remains a challenge, as eSNAP could not distinguish the difference of defect formation energies in different Frenkel defect configurations.

It should be noted that the focus of the current eSNAP model is on reproducing the energies and forces on solid-phase α-Li₃N for the purposes of scaling MD simulations beyond the limited simulation cells and time scales in AIMD for diffusion studies. As such, the training structures were selected mainly for this purpose and no attempt was made to include a broad diversity of training structures from different polymorphs of Li₃N, liquid configurations, etc. in the training pool.

Our choice of the SNAP approach is motivated by its simple linear form and its efficient implementation in the widely available open-source LAMMPS Molecular Dynamics Simulator.³¹ Though the MAEs of linear SNAP may not be as low as those achieved using other regression models and descriptors,^8,32 its efficiency and low training data requirements are the decisive factors for our choice. In terms of scaling performance, we tested the running time of a 1000-step MD simulation with various system sizes (500–500,000 atoms). Despite the O(N log N) time complexity of Ewald summation, eSNAP generally shows linear scaling performance (Fig. S4), presumably governed by the time-consuming bispectrum coefficient calculations.

Finally, we applied the eSNAP model to conduct long-time-scale (~1 ns) simulations of complex models (500–5000 atoms) of α-Li₃N. We report the Haven ratio of α-Li₃N by directly calculating charge diffusivity and show that grain boundaries may provide faster diffusion pathways (relative to bulk). The calculation of charge diffusivity, which is difficult to converge in AIMD simulations, enables us to compute much more reliable estimates of the anisotropic diffusivities of α-Li₃N. Interestingly, though we find that conductivity in the c-crystallographic direction is in general slower than the ab plane, the value is only one order of magnitude lower, contrary to single crystal measurements.²⁸ Li et al.¹⁹ have recently grown pinhole-free Li₃N nanofilms as a protective layer on Li metal anodes by flowing nitrogen gas. A critical design requirement is that the conductivity of Li in the [001] direction is sufficiently high. Li et al.¹⁹ measured conductivities of up to 0.5 mS/cm, which is in good agreement with our predictions and in disagreement with prior experiments and simulations with the Coulomb–Buckingham potential. It should be emphasized that the conductivity of ~0.01 mS/cm in the c direction reported in previous work²⁸ would lead to a highly resistive, low-performing coating. We hope that further careful experiments in the near future may shed further light on these discrepancies in anisotropic diffusivities between different experiments and computational simulations on this highly important lithium conductor.

Methods

Electrostatic SNAP (eSNAP) model

The atomic environment around atom i at coordinates r can be described by its atomic neighbor density ρ_i(r) with the following equation:^7,9

$$\rho _i({\mathbf{r}}) = \delta ({\mathbf{r}}) + \sum\limits_{r_{ii^{\prime}} < R_{ii^{\prime}}} {f_c} (r_{ii^{\prime}})w_{i^{\prime}}\delta ({\mathbf{r}} - {\mathbf{r}}_{{\mathbf{ii}}^{\prime}}),$$

(3)

where r_ii' is the vector joining the coordinates of central atom i and its neighbor atom i′, the cutoff function f_c ensures that the neighbor atomic density decays smoothly to zero at cutoff radius R_ii′, and the dimensionless neighbor weights w_i′ distinguish atoms of different types. This density function can be expanded as a generalized Fourier series in the 4D hyper-spherical harmonics $U_{m,m\prime }^j(\theta ,\phi ,\theta _0)$ as follows:

$$\rho _i({\mathbf{r}}) = \mathop {\sum}\limits_{j = 0,\frac{1}{2},...}^\infty {\mathop {\sum}\limits_{m = - j}^j {\mathop {\sum}\limits_{m^{\prime} = - j}^j {u_{m,m^{\prime}}^j} } } U_{m,m^{\prime}}^j(\theta ,\phi ,\theta _0),$$

(4)

where the coefficients $u_{m,m^{\prime}}^j$ are given by the inner product $\langle U_{m,m^{\prime}}^j|\rho \rangle$. The bispectrum coefficients are then given as:

$$B_{j_1,j_2,j} = \mathop {\sum }\limits_{m_1,m_1^\prime = - j_1}^{j_1} ,\mathop {\sum }\limits_{m_2,m_2^\prime = - j_2}^{j_2} \mathop {\sum }\limits_{m,m^\prime = - j}^j \left( {u_{m,m^{\prime}}^j} \right)^ \ast H\begin{array}{*{20}{c}} {jmm^{\prime}} \\ {j_1m_1m_1^{\prime}} \\ {j_2m_2m_2^{\prime}} \end{array}u_{m_1,m_1^\prime }^{j_1}u_{m_2,m_2^\prime }^{j_2},$$

(5)

where the constants $H\begin{array}{*{20}{c}} {jmm^{\prime}} \\ {j_1m_1m_1^\prime} \\ {j_2m_2m_2^\prime} \end{array}$ are coupling coefficients.

In the original formulation of the non-ionic SNAP model,⁹ the energy and forces are expressed as a linear function of the bispectrum coefficients, as follows:

$$E_{{\mathrm{SNAP}}} = \mathop {\sum}\limits_\alpha {\left( {\beta _{\alpha ,0}N_\alpha + \mathop {\sum}\limits_{k = \{ j_1,j_2,j\} } {\beta _{\alpha ,k}} \mathop {\sum}\limits_{i = 1}^{N_\alpha } {B_{k,i}} } \right)}$$

(6)

$${\mathbf{F}}_{j,{\mathrm{SNAP}}} = - \mathop {\sum}\limits_\alpha {\mathop {\sum}\limits_{k = \{ j_1,j_2,j\} } {\beta _{\alpha ,k}} } \mathop {\sum}\limits_{i = 1}^{N_\alpha } {\frac{{\partial B_{k,i}}}{{\partial {\mathbf{r}}_j}}} .$$

(7)

where α is the chemical identity of atoms, N_α is the total number of α atoms in the system, and β_α,k are the coefficients in the linear SNAP model for type α atoms.

For ionic systems, electrostatic interactions spanning in the entire range of interatomic distances are indispensable in the construction of energy model due to the long-range tail beyond the cutoff distance for local environment description (see Fig. 6). In our proposed electrostatic SNAP (eSNAP) model, we write the total potential energy as the sum of the electrostatic contributions and the local energy (SNAP) due to the variations in atomic local environments, as follows:

$$E_{\mathrm{p}} = \gamma E_{{\mathrm{el}}} + E_{{\mathrm{SNAP}}}$$

(8)

$${\mathbf{F}}_j = - \nabla _jE_{\mathrm{p}} = - \gamma \nabla _jE_{{\mathrm{el}}} - F_{j,{\mathrm{SNAP}}}$$

(9)

where E_el is the electrostatic energy computed using the Ewald summation approach²⁰ and γ is an effective screening prefactor for electrostatic interactions. The coefficients (γ and β) can be solved by fitting the linear model to total energies and forces from DFT calculations.

In addition, nuclei repulsions emerge at extremely short interatomic distances. In this work, the Ziegler-Biersack-Littmark (ZBL) potential is used to account for short-ranged nuclei repulsions.³³ To ensure that the fitting process captures the relevant relationship between the bispectrum coefficients and the DFT energies and forces, the cutoff distances of ZBL were chosen to be short enough (R_i = 1.0 Å, R_o = 1.5 Å) such that the ZBL potential has negligible contribution to energies or forces among the initial training configurations where extremely close interatomic distances were not sampled. More details about ZBL settings used in this work can be found in Supplementary Information.

Training data generation

Figure 1a shows the hexagonal P6/mmm unit cell of α-Li₃N, where Li2 sites form Li₂N layers with N sites in the ab plane and Li1 sites connect N sites in neighboring Li₂N layers along the c axis. To sample a diverse set of configurations, the initial training set includes two major components:

1.
Starting from the relaxed α-Li₃N unit cell, we first generated two series of unit cells with lattice distortions. One series samples different lattice constants a and c, and the other samples unit cells with different levels of strains (−1% to 1% at 0.2% intervals) applied in six different modes as described in de Jong et al.³⁴
2.
Snapshots were extracted from AIMD simulations at temperatures from 400 to 1200 K at 200 K intervals under an NVT ensemble. Starting from a 3 × 3 × 3 supercell with equilibrium volume, for each temperature, 200 snapshots were taken from a 40 ps AIMD simulation.

To ensure accurate energies and forces, static DFT calculations were performed on all configurations (including snapshots from AIMD).

DFT calculations

All DFT calculations were performed using the Vienna Ab initio Simulation Package (VASP)³⁵ within the projector augmented wave approach.³⁶ The Perdew-Burke-Ernzerhof (PBE) generalized gradient approximation was adopted as the exchange-correlation functional.³⁷ To ensure the convergence of energy and atomic force, a plane-wave energy cutoff of 520 eV and Γ-centered k-point meshes with a density of at least 30 Å were employed for all static DFT calculations. For AIMD simulations, a single Γ k-point and a much lower-energy cutoff of 300 eV were used for rapid propagation of trajectories.

Model training and test

Table 4 shows the weights applied on the different sets of training configurations during model training. As the initial training dataset contains many more configurations from AIMD snapshots with larger number of atoms, a much larger weight was applied on the energies of the distorted unit cells relative to those from the AIMD snapshots. A zero weight was applied on the negligibly small forces for the distorted unit cells.

Table 4 Data distribution and applied weights on different types of data points in the initial training dataset

Full size table

As shown in Fig. 7a, the energies and forces differ greatly in magnitude and distribution due to differences in the scales and units. In the original SNAP training approach, the effect of this difference in magnitude and distribution is partially accounted for by treating the data weights as hyperparameters to be optimized.^13,14 In this work, we use the standardized z-scores of energies and forces (plotted in Fig. 7b) as the targets in model training to avoid incorporating the effect of the distribution in the data weights, which are therefore fixed at the values in Table 4. The “standardized” eSNAP model in the fitting process is then given by the following:

$$\left[ {\begin{array}{*{20}{c}} {\frac{{e - \bar e}}{{\sigma _e}}} \\ \vdots \end{array}} \right] = \frac{1}{{N\sigma _e}}\left[ {\begin{array}{*{20}{c}} {E_{{\mathrm{el}}}} & {N_\alpha } & {\mathop {\sum}\limits_{i = 1}^{N_\alpha } {B_{1,i}} } & \ldots & {\mathop {\sum}\limits_{i = 1}^{N_\alpha } {B_{k,i}} } & \ldots \\ \vdots & \vdots & \vdots & \ldots & \vdots & \ldots \end{array}} \right]{\boldsymbol{\beta }}^{\mathrm{T}},$$

(10)

$$\left[ {\begin{array}{*{20}{l}} {\frac{{{\mathbf{F}}_j}}{{\sigma _F}}} \hfill \\ \vdots \hfill \end{array}} \right] = \frac{1}{{\sigma _F}}\left[ {\begin{array}{*{20}{c}} { - \frac{{\partial E_{{\mathrm{el}}}}}{{\partial {\mathbf{r}}_j}}} & 0 & { - \mathop {\sum}\limits_{i = 1}^{N_\alpha } {\frac{{\partial B_{1,i}}}{{\partial {\mathbf{r}}_j}}} } & \ldots & { - \mathop {\sum}\limits_{i = 1}^{N_\alpha } {\frac{{\partial B_{k,i}}}{{\partial {\mathbf{r}}_j}}} } & \ldots \\ \vdots & \vdots & \vdots & \ldots & \vdots & \ldots \end{array}} \right]{\boldsymbol{\beta }}^{\mathrm{T}},$$

(11)

where e is the energy per atom, $\bar e$ is the mean of e, and σ_e and σ_F are the standard deviations of e and F, respectively. The mean of forces is omitted since it is close to zero. The coefficient vector β^T to be solved can be written as:

$${\boldsymbol{\beta }}^{\mathrm{T}} = \left[ {\begin{array}{*{20}{l}} \gamma \hfill & {\beta _{\alpha ,0} - \bar e} \hfill & {\beta _{\alpha ,1}} \hfill & \ldots \hfill & {\beta _{\alpha ,k}} \hfill & \ldots \hfill \end{array}} \right]^{\mathrm{T}}.$$

(12)

For bispectrum coefficient calculations, we used the implementation available in LAMMPS.⁹ The two hyperparameters (cutoff distance R_α and atomic weight w_α) for each element (Li and N in the case of Li₃N) were determined using a two-step grid search scheme for the atomic weights and then followed by the cutoff distances. The MAE of forces from a linear model trained on the initial training set was chosen as the metric. For the atomic weights, it should be noted that the atomic density in ionic systems is generally higher than that in metallic systems; hence the search of atomic weights was performed in the range where |w_α| < 1. Similarly, the search space for cutoff radius was limited to the range where R_α < 4 Å. The results from grid search (Fig. S5) are available in Supplementary Information, and the final hyperparameters can be found in Table 1.

Figure 8 shows the flow chart of the iterative procedure used for training the eSNAP model in this work. A preliminary eSNAP model was first trained using the initial training set. Using this fitted eSNAP model, MD simulations were then carried out using a 3 × 3 × 3 supercell in equilibrium volume at temperatures ranging from 300 K to 1200 K at 100 K intervals under an NVT ensemble for 40 ps. Ten snapshots were sampled from each MD simulation to form a new set of test configurations. Static DFT calculations were performed on these test configurations. If the test MAEs for either energies or forces were significantly larger than the corresponding training MAEs, the test set was then merged into the training set to form a new extended training set. The entire eSNAP fitting, simulation and testing procedure was repeated until there is no significant over-fitting in both energies and forces. In this work, we use 150% of training MAE as the threshold to achieve a balance between the benefit gained by adding more training instances and the associated costs of performing more DFT calculations. It should be noted that this strategy is designed to bias the eSNAP model to improve the predictions on energy and force of MD simulations, which is the target application of interest in this work.

Diffusivity calculations

The tracer diffusivity of Li D* is calculated from the MSD of all diffusing Li ions as described by the Einstein relation:

$$D^ \ast = \frac{1}{{2dt}}\frac{1}{N}\mathop {\sum}\limits_{i = 1}^N {\left\langle {\left[ {{\mathrm{\Delta }}{\mathbf{r}}_i(t)} \right]^2} \right\rangle } ,$$

(13)

where d is the number of dimensions in which diffusion occurs, N is the total number of diffusing Li ions, Δr_i(t) is the displacement of the ith Li ion at time t.

The charge diffusivity of Li D_σ is calculated from the square net displacement of all diffusing Li ions, as described below:

$$D_\sigma = \frac{1}{{2dt}}\frac{1}{N}\left\langle {\left[ {\mathop {\sum}\limits_{i = 1}^N \Delta {\mathbf{r}}_i(t)} \right]^2} \right\rangle$$

(14)

The Li conductivity at temperature T (unit: K) can be calculated from the charge diffusivity D_σ using the Nernst-Einstein equation:

$$\sigma = \frac{{\rho z^2F^2}}{{RT}}D_\sigma ,$$

(15)

where ρ is the molar density of Li, z is the charge of Li (+1), F is the Faraday constant, and R is the gas constant.

In addition, the ratio between the tracer and charge diffusivities is referred to as the Haven ratio H_R = D*/D_σ.

All the simulations with the eSNAP were performed using LAMMPS.³¹ All the structure manipulations and interfacing with VASP and LAMMPS were handled by the Python Materials Genomics (pymatgen) library.³⁸

Data availability

The training configurations and their DFT computed total energy and atomic forces are available in the SNAP development repo on Github (https://github.com/materialsvirtuallab/snap).

References

Hohenberg, P. & Kohn, W. Inhomogeneous electron gas. Phys. Rev. 136, B864 (1964).
Article Google Scholar
Kohn, W. & Sham, L. J. Self-consistent equations including exchange and correlation effects. Phys. Rev. 140, A1133–A1138 (1965).
Article Google Scholar
Buckingham, R. A. The classical equation of state of gaseous helium, neon and argon. Proc. R. Soc. Lond. A 168, 264–283 (1938).
Article CAS Google Scholar
Daw, M. S. & Baskes, M. I. Embedded-atom method: derivation and application to impurities, surfaces, and other defects in metals. Phys. Rev. B 29, 6443–6453 (1984).
Article CAS Google Scholar
Behler, J. & Parrinello, M. Generalized neural-network representation of high-dimensional potential-energy surfaces. Phys. Rev. Lett. 98, 146401 (2007).
Article Google Scholar
Artrith, N., Morawietz, T. & Behler, J. High-dimensional neural-network potentials for multicomponent systems: applications to zinc oxide. Phys. Rev. B 83, 153101 (2011).
Article Google Scholar
Bartók, A. P., Payne, M. C., Kondor, R. & Csányi, G. Gaussian approximation potentials: The accuracy of quantum mechanics, without the electrons. Phys. Rev. Lett. 104, 136403 (2010).
Article Google Scholar
Bartók, A. P., Kondor, R. & Csányi, G. On representing chemical environments. Phys. Rev. B - Condens. Matter Mater. Phys. 87, 1–16 (2013).
Google Scholar
Thompson, A., Swiler, L., Trott, C., Foiles, S. & Tucker, G. Spectral neighbor analysis method for automated generation of quantum-accurate interatomic potentials. J. Comput. Phys. 285, 316–330 (2015).
Article CAS Google Scholar
Shapeev, A. V. Moment tensor potentials: a class of systematically improvable interatomic potentials. Multiscale Model. Simul. 14, 1153–1173 (2016).
Article Google Scholar
Behler, J. Atom-centered symmetry functions for constructing high-dimensional neural network potentials. J. Chem. Phys. 134, 074106 (2011).
Article Google Scholar
Wood, M. A. & Thompson, A. P. Extending the accuracy of the SNAP interatomic potential form. J. Chem. Phys. 148, 241721 (2018).
Chen, C. et al. Accurate force field for molybdenum by machine learning large materials. Data. Phys. Rev. Mater. 1, 043603 (2017).
Article Google Scholar
Li, X. G. et al. Quantum-accurate spectral neighbor analysis potential models for Ni-Mo binary alloys and fcc metals. Phys. Rev. B 98, 1–10 (2018a).
Google Scholar
Lewis, G. V. & Catlow, C. R. A. Potential models for ionic oxides. J. Phys. C. Solid State Phys. 18, 1149 (1985).
Article CAS Google Scholar
Lee, E., Lee, K. R., Baskes, M. I. & Lee, B. J. A modified embedded-atom method interatomic potential for ionic systems: 2NNMEAM+Qeq. Phys. Rev. B 93, 144110 (2016).
Article Google Scholar
Boukamp, B. A. & Huggins, R. A. Fast ionic conductivity in lithium nitride. Mater. Res. Bull. 13, 23–32 (1978).
Article CAS Google Scholar
Zhu, Y., He, X. & Mo, Y. Strategies based on nitride materials chemistry to stabilize Li metal anode. Adv. Sci. 4, 1600517 (2017).
Article Google Scholar
Li, Y. et al. Robust Pinhole-free Li₃N solid electrolyte grown from molten lithium. ACS Cent. Sci. 4, 97–104 (2018b).
Article CAS Google Scholar
Ewald, P. P. Die Berechnung optischer und elektrostatischer Gitterpotentiale. Ann. Phys. 369, 253–287 (1921).
Article Google Scholar
Rappe, A. K. & Goddard, W. A. III Charge equilibration for molecular dynamics simulations. J. Phys. Chem. 95, 3358–3363 (1991).
Article CAS Google Scholar
Walker, J. & Catlow, C. Defect structure and ionic conductivity in lithium nitride. Philos. Mag. A 43, 265–272 (1981).
Article CAS Google Scholar
Rabenau, A. & Schulz, H. Re-evaluation of the lithium nitride structure. J. Less-Common Met. 50, 155–159 (1976).
Article CAS Google Scholar
Kress, W., Grimm, H., Press, W. & Lefebvre, J. Lattice vibrations in lithium nitride, Li₃N. Phys. Rev. B 22, 4620–4625 (1980).
Article CAS Google Scholar
Henkelman, G., Uberuaga, B. P. & Jónsson, H. A climbing image nudged elastic band method for finding saddle points and minimum energy paths. J. Chem. Phys. 113, 9901 (2000).
Article CAS Google Scholar
Togo, A. & Tanaka, I. First principles phonon calculations in materials science. Scr. Mater. 108, 1–5 (2015).
Article CAS Google Scholar
Wu, G., Wu, S. & Wu, P. Doping-enhanced lithium diffusion in lithium-ion batteries. Phys. Rev. Lett. 107, 118302 (2011).
Article Google Scholar
Alpen, U. V., Rabenau, A. & Talat, G. H. Ionic conductivity in Li₃N single crystals. Appl. Phys. Lett. 30, 621–623 (1977).
Article Google Scholar
Messer, R., Birli, H. & Differt, K. NMR study of diffusion in Li₃N. J. Phys. C. Solid State Phys. 14, 2731–2746 (1981).
Article CAS Google Scholar
Wahl, J. & Holland, U. Local ionic motion in the superionic conductor Li₃N. Solid State Commun. 27, 237–241 (1978).
Article CAS Google Scholar
Plimpton, S. Fast parallel algorithms for short-range molecular dynamics. J. Comput. Phys. 117, 1–19 (1995).
Article CAS Google Scholar
Szlachta, W. J., Bartók, A. P. & Csányi, G. Accuracy and transferability of Gaussian approximation potential models for tungsten. Phys. Rev. B - Condens. Matter Mater. Phys. 90, 104108 (2014).
Article Google Scholar
Ziegler, J., Biersack, J. & Littmark, U. The Stopping and Range of Ions in Matter 1 (Pergamon Press, New York, 1985).
Book Google Scholar
de Jong, M. et al. Charting the complete elastic properties of inorganic crystalline compounds. Sci. Data 2, 150009 (2015).
Article Google Scholar
Kresse, G. & Furthmüller, J. Efficient iterative schemes for ab initio total-energy calculations using a plane-wave basis set. Phys. Rev. B 54, 11169–11186 (1996).
Article CAS Google Scholar
Blöchl, P. E. Projector augmented-wave method. Phys. Rev. B 50, 17953–17979 (1994).
Article Google Scholar
Perdew, J. P., Burke, K. & Ernzerhof, M. Generalized gradient approximation made simple. Phys. Rev. Lett. 77, 3865–3868 (1996).
Article CAS Google Scholar
Ong, S. P. et al. Python Materials Genomics (pymatgen): a robust, open-source python library for materials analysis. Comput. Mater. Sci. 68, 314–319 (2013).
Article CAS Google Scholar

Download references

Acknowledgements

This work was supported by the Office of Naval Research (ONR) Young Investigator Program (YIP) under Award No. N00014-16-1-2621. The authors acknowledge computational resources provided by Triton Shared Computing Cluster (TSCC) at the University of California, San Diego, the National Energy Research Scientific Computing Center (NERSC), and the Extreme Science and Engineering Discovery Environment (XSEDE) supported by the National Science Foundation under Grant No. ACI-1053575. The authors also thank Dr. Anton Van der Ven for the discussion on diffusivity calculations.

Author information

Authors and Affiliations

Department of NanoEngineering, University of California San Diego, 9500 Gilman Dr, Mail Code 0448, La Jolla, CA, 92093-0448, USA
Zhi Deng, Chi Chen, Xiang-Guo Li & Shyue Ping Ong

Authors

Zhi Deng
View author publications
You can also search for this author in PubMed Google Scholar
Chi Chen
View author publications
You can also search for this author in PubMed Google Scholar
Xiang-Guo Li
View author publications
You can also search for this author in PubMed Google Scholar
Shyue Ping Ong
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Z.D. performed potential model training, performance evaluation and diffusion studies. C.C. and X.-G.L. helped with the design of training procedure and analyses in diffusion studies. S.P.O. is the primary investigator and supervised the entire project. All authors contributed to the writing and editing of the paper.

Corresponding author

Correspondence to Shyue Ping Ong.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Deng, Z., Chen, C., Li, XG. et al. An electrostatic spectral neighbor analysis potential for lithium nitride. npj Comput Mater 5, 75 (2019). https://doi.org/10.1038/s41524-019-0212-1

Download citation

Received: 25 January 2019
Accepted: 27 June 2019
Published: 16 July 2019
DOI: https://doi.org/10.1038/s41524-019-0212-1

This article is cited by

Machine learning molecular dynamics simulation identifying weakly negative effect of polyanion rotation on Li-ion migration
- Zhenming Xu
- Huiyu Duan
- Yongyao Xia
npj Computational Materials (2023)
Atomic-scale origin of the low grain-boundary resistance in perovskite solid electrolyte Li0.375Sr0.4375Ta0.75Zr0.25O3
- Tom Lee
- Ji Qi
- Xiaoqing Pan
Nature Communications (2023)
A review of the recent progress in battery informatics
- Chen Ling
npj Computational Materials (2022)
Specialising neural network potentials for accurate properties and application to the mechanical response of titanium
- Tongqi Wen
- Rui Wang
- Zhaoxuan Wu
npj Computational Materials (2021)
Data-driven magneto-elastic predictions with scalable classical spin-lattice dynamics
- Svetoslav Nikolov
- Mitchell A. Wood
- Julien Tranchida
npj Computational Materials (2021)

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Optimized model parameters

Energy and force prediction

Structural properties

Bulk diffusion

Grain boundary diffusion

Discussion

Methods

Electrostatic SNAP (eSNAP) model

Training data generation

DFT calculations

Model training and test

Diffusivity calculations

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links