Graph dynamical networks for unsupervised learning of atomic scale dynamics in materials

Xie, Tian; France-Lanord, Arthur; Wang, Yanming; Shao-Horn, Yang; Grossman, Jeffrey C.

doi:10.1038/s41467-019-10663-6

Download PDF

Article
Open access
Published: 17 June 2019

Graph dynamical networks for unsupervised learning of atomic scale dynamics in materials

Nature Communications volume 10, Article number: 2667 (2019) Cite this article

21k Accesses
95 Citations
10 Altmetric
Metrics details

Subjects

Abstract

Understanding the dynamical processes that govern the performance of functional materials is essential for the design of next generation materials to tackle global energy and environmental challenges. Many of these processes involve the dynamics of individual atoms or small molecules in condensed phases, e.g. lithium ions in electrolytes, water molecules in membranes, molten atoms at interfaces, etc., which are difficult to understand due to the complexity of local environments. In this work, we develop graph dynamical networks, an unsupervised learning approach for understanding atomic scale dynamics in arbitrary phases and environments from molecular dynamics simulations. We show that important dynamical information, which would be difficult to obtain otherwise, can be learned for various multi-component amorphous material systems. With the large amounts of molecular dynamics data generated every day in nearly every aspect of materials design, this approach provides a broadly applicable, automated tool to understand atomic scale dynamics in material systems.

Learning properties of ordered and disordered materials from multi-fidelity data

Article 14 January 2021

Quantifying disorder one atom at a time using an interpretable graph neural network paradigm

Article Open access 07 July 2023

Linking atomic structural defects to mesoscale properties in crystalline solids using graph neural networks

Article Open access 17 September 2022

Introduction

Understanding the atomic scale dynamics in condensed phases is essential for the design of functional materials to tackle global energy and environmental challenges^1,2,3. The performance of many materials depends on the dynamics of individual atoms or small molecules in complex local environments. Despite the rapid advances in experimental techniques^4,5,6, molecular dynamics (MD) simulations remain one of the few tools for probing these dynamical processes with both atomic scale time and spatial resolutions. However, due to the large amounts of data generated in each MD simulation, it is often challenging to extract statistically relevant dynamics for each atom especially in multi-component, amorphous material systems. At present, atomic scale dynamics are usually learned by designing system-specific descriptions of coordination environments or computing the average behavior of atoms^7,8,9,10. A general approach for understanding the dynamics in different types of condensed phases, including solid, liquid, and amorphous, is still lacking.

The advances in applying deep learning to scientific research open new opportunities for utilizing the full trajectory data from MD simulations in an automated fashion. Ideally, one would trace every atom or small molecule of interest in the MD trajectories, and summarize their dynamics into a linear, low dimensional model that describes how their local environments evolve over time. Recent studies show that combining Koopman analysis and deep neural networks provides a powerful tool to understand complex biological processes and fluid dynamics from data^11,12,13. In particular, VAMPnets¹³ develop a variational approach for Markov processes to learn an optimal latent space representation that encodes the long-time dynamics, which enables the end-to-end learning of a linear dynamical model directly from MD data. However, in order to learn the atomic dynamics in complex, multi-component material systems, sharing knowledge learned for similar local chemical environments is essential to reduce the amount of data needed. The recent development of graph convolutional neural networks (GCN) has led to a series of new representations of molecules^14,15,16,17 and materials^18,19 that are invariant to permutation and rotation operations. These representations provide a general approach to encode the chemical structures in neural networks which shares parameters between different local environments, and they have been used for predicting properties of molecules and materials^{14,15,16,17,18,19}, generating force fields^19,20, and visualizing structural similarities^21,22.

In this work, we develop a deep learning architecture, Graph Dynamical Networks (GDyNets), that combines Koopman analysis and graph convolutional neural networks to learn the dynamics of individual atoms in material systems. The graph convolutional neural networks allow for the sharing of knowledge learned for similar local environments across the system, and the variational loss developed in VAMPnets^13,23 is employed to learn a linear model for atomic dynamics. Thus, our method focuses on the modeling of local atomic dynamics instead of global dynamics. This significantly improves the sampling of the atomic dynamical processes, because a typical material system includes a large number of atoms or small molecules moving in structurally similar but distinct local environments. We demonstrate this distinction using a toy system that shows global dynamics can be exponentially more complex than local dynamics. Then, we apply this method to two realistic material systems—silicon dynamics at solid–liquid interfaces and lithium ion transport in amorphous polymer electrolytes—to demonstrate the new dynamical information one can extract for such complex materials and environments. Given the enormous amount of MD data generated in nearly every aspect of materials research, we believe the broad applicability of this method could help uncover important new physical insights from atomic scale dynamics that may have otherwise been overlooked.

Results

Koopman analysis of atomic scale dynamics

In materials design, the dynamics of target atoms, like the lithium ion in electrolytes and the water molecule in membranes, provide key information to material performance. We describe the dynamics of the target atoms and their surrounding atoms as a discrete process in MD simulations,

$${\boldsymbol{x}}_{t + \tau } = {\boldsymbol{F}}({\boldsymbol{x}}_t),$$

(1)

where x_t and x_t+τ denote the local configuration of the target atoms and their surrounding atoms at time steps t and t + τ, respectively. Note that Eq. (1) implies that the dynamics of x is Markovian, i.e. x_t+τ only depends on x_t not the configurations before it. This is exact when x includes all atoms in the system, but an approximation if only neighbor atoms are included. We also assume that each set of target atoms follow the same dynamics F. These are valid assumptions since (1) most interactions in materials are short-range, (2) most materials are either periodic or have similar local structures, and we could test them by validating the dynamical models using new MD data, which we will discuss later.

The Koopman theory²⁴ states that there exists a function χ(x) that maps the local configuration of target atoms x into a lower dimensional feature space, such that the non-linear dynamics F can be approximated by a linear transition matrix K,

$${\boldsymbol{\chi }}({\boldsymbol{x}}_{t + \tau }) \approx {\boldsymbol{K}}^T{\boldsymbol{\chi }}({\boldsymbol{x}}_t).$$

(2)

The approximation becomes exact when the feature space has infinite dimensions. However, for most dynamics in material systems, it is possible to approximate it with a low dimensional feature space if τ is sufficiently large due to the existence of characteristic slow processes. The goal is to identify such slow processes by finding the feature map function χ(x).

Learning feature map function with graph dynamical networks

In this work, we use GCN to learn the feature map function χ(x). GCN provides a general framework to encode the structure of materials that is invariant to permutation, rotation, and reflection^18,19. As shown in Fig. 1, for each time step in the MD trajectory, a graph ${\cal{G}}$ is constructed based on its current configuration with each node v_i representing an atom and each edge u_i,j representing a bond connecting nearby atoms. We connect M nearest neighbors considering periodic boundary conditions while constructing the graph, and a gated architecture¹⁸ is used in GCN to reweigh the strength of each connection (see Supplementary Note 1 for details). Note that the graphs are constructed separately for each step, so the topology of each graph may be different. Also, the 3-dimensional information is preserved in the graphs since the bond length is encoded in u_i,j. Then, each graph is input to the same GCN to learn an embedding for each atom through graph convolution (or neural message passing¹⁶) that incorporates the information of its surrounding environments.

$${\boldsymbol{v}}_i^\prime = {\mathrm{Conv}}({\boldsymbol{v}}_i,{\boldsymbol{v}}_j,{\boldsymbol{u}}_{(i,j)}),\quad (i,j) \in {\cal{G}}.$$

(3)

After K convolution operations, information from the Kth neighbors will be propagated to each atom, resulting in an embedding ${\boldsymbol{v}}_i^{(K)}$ that encodes its local environment.

To learn a feature map function for the target atoms whose dynamics we want to model, we focus on the embeddings learned for these atoms. Assume that there are n sets of target atoms each made up with k atoms in the material system. For instance, in a system of 10 water molecules, n = 10 and k = 3. We use the label v_[l,m] to denote the mth atom in the lth set of target atoms. With a pooling function¹⁸, we can get an overall embedding v_[l] for each set of target atoms to represent its local configuration,

$${\boldsymbol{v}}_{[l]} = {\mathrm{Pool}}({\boldsymbol{v}}_{[l,0]},{\boldsymbol{v}}_{[l,1]}, \ldots ,{\boldsymbol{v}}_{[l,k]}).$$

(4)

Finally, we build a shared two-layer fully connected neural network with an output layer using a Softmax activation function to map the embeddings v_[l] to a feature space $\widetilde {\boldsymbol{v}}_{[l]}$ with a pre-determined dimension. This is the feature space described in Eq. (2), and we can select an appropriate dimension to capture the important dynamics in the material system. The Softmax function used here allows us to interpret the feature space as a probability over several states¹³. Below, we will use the term “number of states” and “dimension of feature space” interchangeably.

To minimize the errors of the approximation in Eq. (2), we compute the loss of the system using a VAMP-2 score^13,24 that measures the consistency between the feature vectors learned at timesteps t and t + τ,

$${\mathrm{Loss}} = - {\mathrm{VAMP}}(\widetilde {\boldsymbol{v}}_{[l],t},\widetilde {\boldsymbol{v}}_{[l],t + \tau }),\quad t \in [0,T - \tau ],l \in [0,n].$$

(5)

This means that a single VAMP-2 score is computed over the whole trajectory and all sets of target atoms. The entire network is trained by minimizing the VAMP loss, i.e. maximizing the VAMP-2 score, with the trajectories from the MD simulations.

Hyperparameter optimization and model validation

There are several hyperparameters in the GDyNets that need to be optimized, including the architecture of GCN, the dimension of the feature space, and lag time τ. We divide the MD trajectory into training, validation, and testing sets. The models are trained with trajectories from the training set, and a VAMP-2 score is computed with trajectories from the validation set. The GCN architecture is optimized according to the VAMP-2 score similar to ref. ¹⁸.

The accuracy of Eq. (2) can be evaluated with a Chapman-Kolmogorov (CK) equation,

$${\boldsymbol{K}}(n\tau ) = {\boldsymbol{K}}^n(\tau ),\quad n = 1,2, \ldots .$$

(6)

This equation holds if the dynamic model learned is Markovian, and it can predict the long-time dynamics of the system. In general, increasing the dimension of feature space makes the dynamic model more accurate, but it may result in overfitting when the dimension is very large. Since a higher feature space dimension and a larger τ make the model harder to understand and contain less dynamical details, we select the smallest feature space dimension and τ that fulfills the CK equation within statistical uncertainty. Therefore, the resulting model is interpretable and contains more dynamical details. Further details regarding the effects of feature space dimension and τ can be found in refs. ^13,24.

Local and global dynamics in the toy system

To demonstrate the advantage of learning local dynamics in material systems, we compare the dynamics learned by the GDyNet with VAMP loss and a standard VAMPnet with fully connected neural networks that learns global dynamics for a simple model system using the same input data. As shown in Fig. 2a, we generated a 200 ns MD trajectory of a lithium atom moving in a face-centered cubic (FCC) lattice of sulfur atoms at a constant temperature, which describes an important lithium ion transport mechanism in solid-state electrolytes⁷. There are two different sites for the lithium atom to occupy in a FCC lattice, tetrahedral sites and octahedral sites, and the hopping between the two sites should be the only dynamics in this system. As shown in Fig. 2b–d, after training and validation with the first 100 ns trajectory, the GDyNet correctly identified the transition between the two sites with a relaxation timescale of 42.3 ps while testing on the second 100 ns trajectory, and it performs well in the CK test. In contrast, the standard VAMPnet, which inputs the same data as the GDyNet, learns a global transition with a much longer relaxation timescale at 236 ps, and it performs much worse in the CK test. This is because the model views the four octahedral sites as different sites due to their different spatial locations. As a result, the transitions between these identical sites are learned as the slowest global dynamics.

It is theoretically possible to identify the faster local dynamics from a global dynamical model when we increase the dimension of feature space (Supplementary Fig. 1). However, when the size of the system increases, the number of slower global transitions will increase exponentially, making it practically impossible to discover important atomic scale dynamics within a reasonable simulation time. In addition, it is possible in this simple system to design a symmetrically invariant coordinate to include the equivalence of the octahedral and tetrahedral sites. But in a more complicated multi-component or amorphous material system, it is difficult to design such coordinates that take into account the complex atomic local environments. Finally, it is also possible to reconstruct global dynamics from the local dynamics. Since we know how the four octahedral and eight tetrahedral sites are connected in a FCC lattice, we can construct the 12 dimensional global transition matrix from the 2 dimensional local transition matrix (see Supplementary Note 2 for details). We obtain the slowest global relaxation timescale to be 531 ps, which is close to the observed slowest timescale of 528 ps from the global dynamical model in Supplementary Fig. 1. Note that the timescale from the two-state global model in Fig. 2 is less accurate since it fails to learn the correct transition. In sum, the built-in invariances in GCN provide a general approach to reduce the complexity of learning atomic dynamics in material systems.

Silicon dynamics at a solid–liquid interface

To evaluate the performance of the GDyNets with VAMP loss for a more complicated system, we study the dynamics of silicon atoms at a binary solid–liquid interface. Understanding the dynamics at interfaces is notoriously difficult due to the complex local structures formed during phase transitions^25,26. As shown in Fig. 3a, an equilibrium system made of two crystalline Si {110} surfaces and a liquid Si–Au solution is constructed at the eutectic point (629 K, 23.4% Si²⁷) and simulated for 25 ns using MD. We train and validate a four-state model using the first 12.5 ns trajectory, and use it to identify the dynamics of Si atoms in the last 12.5 ns trajectory. Note that we only use the Si atoms in the liquid phase and the first two layers of the solid {110} surfaces as the target atoms (Fig. 3b). This is because the Koopman models are optimized for finding the slowest transition in the system, and including additional solid Si atoms will result in a model that learns the slower Si hopping in the solid phase which is not our focus.

In Fig. 3b, c, the model identified four states that are crucial for the Si dynamics at the solid–liquid interface – liquid Si at the interface (state 0), solid Si (state 1), solid Si at the interface (state 2), and liquid Si (state 3). These states provide a more detailed description of the solid–liquid interface structure than conventional methods. In Supplementary Fig. 2, we compare our results with the distribution of the q₃ order parameter of the Si atoms in the system, which measures how much a site deviates from a diamond-like structure and is often used for studying Si interfaces²⁸. We learn from the comparison that (1) our method successfully identifies the bulk liquid and solid states, and learns additional interface states that cannot be obtained from q₃; (2) the states learned by our method are more robust due to access to dynamical information, while q₃ can be affected by the accidental ordered structures in the liquid phase; (3) q₃ is system specific and only works for diamond-like structures, but the GDyNets can potentially be applied to any material given the MD data.

In addition, important dynamical processes at the solid–liquid interface can be learned with the model. Remarkably, the model identified the relaxation process of the solid–liquid transition with a timescale of 538 ns (Fig. 3d, e), which is one order of magnitude longer than the simulation time of 12.5 ns. This is because the large number of Si atoms in the material system provide an ensemble of independent trajectories that enable the identification of rare events^29,30,31. The other two relaxation processes correspond to the transitions of solid Si atoms into/out of the interface (73.2 ns) and liquid Si atoms into/out of the interface (2.26 ns), respectively. These processes are difficult to obtain with conventional methods due to the complex structures at solid–liquid interfaces, and the results are consistent with our understanding that the former solid relaxation is significantly slower than the latter liquid relaxation. Finally, the model performs excellently in the CK test on predicting the long-time dynamics.

Lithium ion dynamics in polymer electrolytes

Finally, we apply GDyNets with VAMP loss to study the dynamics of lithium ions (Li-ions) in solid polymer electrolytes (SPEs), an amorphous material system composed of multiple chemical species. SPEs are candidates for next-generation battery technology due to their safety, stability, and low manufacturing cost, but they suffer from low Li-ion conductivity compared with liquid electrolytes^32,33. Understanding the key dynamics that affect the transport of Li-ions is important to the improvement of Li-ion conductivity in SPEs.

We focus on the state-of-the-art³³ SPE system—a mixture of poly(ethylene oxide) (PEO) and lithium bis-trifluoromethyl sulfonimide (LiTFSI) with Li/EO = 0.05 and a degree of polymerization of 50, as shown in Fig. 4a. Five independent 80 ns trajectories are generated to model the Li-ion transport at 363 K, following the same approach as described in ref. ⁶⁷. We train a four-state GDyNet with one of the trajectories, and use the model to identify the dynamics of Li-ions in the remaining four trajectories. The model identified four different solvation environments, i.e. states, for the Li-ions in the SPE. In Fig. 4b, the state 0 Li-ion has a population of 50.6 ± 0.8%, and it is coordinated by a PEO chain on one side and a TFSI anion on the other side. The state 1 has a similar structure as state 0 with a population of 27.3 ± 0.4%, but the Li-ion is coordinated by a hydroxyl group on the PEO side rather than an oxygen. In state 2, the Li-ion is completely coordinated by TFSI anion ions, which has a population of 15.1 ± 0.4%. And the state 3 Li-ion is coordinated by PEO chains with a population of 7.0 ± 0.9%. Note that the structures in Fig. 4b only show a representative configuration for each state. We compute the element-wise radial distribution function (RDF) for each state in Supplementary Fig. 3 to demonstrate the average configurations, which is consistent with the above description. We also analyze the total charge carried by the Li-ions in each state considering their solvation environments in Fig. 4c (see Supplementary Note 3 and Supplementary Table 1 for details). Interestingly, both state 0 and state 1 carry almost zero total charge in their first solvation shell due to the one TFSI anion in their solvation environments.

We further study the transition between the four Li-ion states. Three relaxation processes are identified in the dynamical model as shown in Fig. 4d, e. By analyzing the eigenvectors, we learn that the slowest relaxation is a process involving the transport of a Li-ion into and out of a PEO coordinated environment. The second slowest relaxation happens mainly between state 0 and state 1, corresponding to a movement of the hydroxyl end group. The transitions from state 0 to states 2 and 3 constitute the last relaxation process, as state 0 can be thought of an intermediate state between state 2 and state 3. The model performs well in CK tests (Fig. 4f). Relaxation processes in the PEO/LiTFSI systems have been extensively studied experimentally^34,35, but it is difficult to pinpoint the exact atomic scale dynamics related to these relaxations. The dynamical model learned by GDyNet provides additional insights into the understanding of Li-ion transport in polymer electrolytes.

Implications to lithium ion conduction

The state configurations and dynamical model allow us to further quantify the transitions that are responsible for the Li-ion conduction. In Fig. 5, we compute the contribution from each state transition to the Li-ion conduction using the Koopman model at τ = 0.8 ns. First, we learn that the majority of conduction results from transitions within the same states (i → i). This is because the transport of Li-ions in PEO is strongly coupled with segmental motion of the polymer chains^8,36, in contrast to the hopping mechanism in inorganic solid electrolytes³⁷. In addition, due to the low charge carried by state 0 and state 1, the majority of charge conduction results from the diffusion of states 2 and 3, despite their relatively low populations. Interestingly, the diffusion of state 2, a negatively charged species, accounts for ~40% of the Li-ion conduction. This provides an atomic scale explanation to the recently observed negative transference number at high salt concentration PEO/LiTFSI systems³⁸.

Discussion

We have developed a general approach, GDyNets, to understand the atomic scale dynamics in material systems. Despite being widely used in biophysics³¹, fluid dynamics³⁹, and kinetic modeling of chemical reactions^40,41,42, Koopman models, (or Markov state models³¹, master equation methods^43,44) have not been used in learning atomic scale dynamics in materials from MD simulations except for a few examples in understanding solvent dynamics^45,46,47. Our approach also differs from several other unsupervised learning methods^48,49,50 by directly learning a linear Koopman model from MD data. Many crucial processes that affect the performance of materials involve the local dynamics of atoms or small molecules, like the dynamics of lithium ions in battery electrolytes^51,52, the transport of water and salt ions in water desalination membranes^53,54, the adsorption of gas molecules in metal organic frameworks^55,56, among many other examples. With the improvement of computational power and continued increase in the use of molecular dynamics to study materials, this work could have broad applicability as a general framework for understanding the atomic scale dynamics from MD trajectory data.

Compared with the Koopman models previously used in biophysics and fluid dynamics, the introduction of graph convolutional neural networks enables parameter sharing between the atoms and an encoding of local environments that is invariant to permutation, rotation, and reflection. This symmetry facilitates the identification of similar local environments throughout the materials, which allows the learning of local dynamics instead of exponentially more complicated global dynamics. In addition, it is easy to extend this method to learn global dynamics with a global pooling function¹⁸. However, a hierarchical pooling function is potentially needed to directly learn the global dynamics of large biological systems including thousands of atoms. It is also possible to represent the local environments using other symmetry functions like smooth overlap of atomic positions (SOAP)⁵⁷, social permutation invariant (SPRINT) coordinates⁵⁸, etc. By adding a few layers of neural networks, a similar architecture can be designed to learn the local dynamics of atoms. However, these built-in invariances may also cause the Koopman model to ignore dynamics between symmetrically equivalent structures which might be important to the material performance. One simple example is the flip of an ammonia molecule—the two states are mirror symmetric to each other so the GCN will not be able to differentiate them by design. This can potentially be resolved by partially breaking the symmetry of GCN based on the symmetry of the material systems.

The graph dynamical networks can be further improved by incorporating ideas from both the fields of Koopman models and graph neural networks. For instance, the auto-encoder architecture^12,59,60 and deep generative models⁶¹ start to enable the direct generation of future structures in the configuration space. Our method currently lacks a generative component, but this can potentially be achieved with a proper graph decoder^62,63. Furthermore, transfer learning on graph embeddings may reduce the number of MD trajectories needed for learning the dynamics^64,65.

In summary, graph dynamical networks present a general approach for understanding the atomic scale dynamics in materials. With a toy system of lithium ion transporting in a face-centered cubic lattice, we demonstrate that learning local dynamics of atoms can be exponentially easier than global dynamics in material systems with representative local structures. The dynamics learned from two more complicated systems, solid–liquid interfaces and solid polymer electrolytes, indicate the potential of applying the method to a wide range of material systems and understanding atomic dynamics that are crucial to their performances.

Methods

Construction of the graphs from trajectory

A separate graph is constructed using the configuration in each time step. Each atom in the simulation box is represented by a node i whose embedding v_i is initialized randomly according to the element type. The edges are determined by connecting M nearest neighbors whose embedding u_(i,j) is calculated by,

$${\boldsymbol{u}}_{(i,j)}[t] = \exp ( - (d_{(i,j)} - \mu _t)^2/\sigma ^2),$$

(7)

where μ_t = t · 0.2 Å for t = 0, 1, …, K, σ = 0.2 Å, and d_(i,j) denotes the distance between i and j considering the periodic boundary conditions. The number of nearest neighbors M is 12, 20, and 20 for the toy system, Si–Au binary system, and PEO/LiTFSI system, respectively.

Graph convolutional neural network architecture details

The convolution function we employed in this work is similar to those in refs. ^18,22 but features an attention layer⁶⁶. For each node i, we first concatenate neighbor vectors from the last iteration ${\boldsymbol{z}}_{(i,j)}^{(t - 1)} = {\boldsymbol{v}}_i^{(t - 1)} \oplus {\boldsymbol{v}}_j^{(t - 1)} \oplus {\boldsymbol{u}}_{(i,j)}$, then we compute the attention coefficient of each neighbor,

$$\alpha _{ij} = \frac{{\exp ({\boldsymbol{z}}_{(i,j)}^{(t - 1)}{\boldsymbol{W}}_{\mathrm{a}}^{(t - 1)} + b_{\mathrm{a}}^{(t - 1)})}}{{\mathop {\sum}\limits_j {\exp } ({\boldsymbol{z}}_{(i,j)}^{(t - 1)}{\boldsymbol{W}}_{\mathrm{a}}^{(t - 1)} + b_{\mathrm{a}}^{(t - 1)})}},$$

(8)

where ${\boldsymbol{W}}_{\mathrm{a}}^{(t - 1)}$ and $b_{\mathrm{a}}^{(t - 1)}$ denotes the weights and biases of the attention layers and the output α_ij is a scalar number between 0 and 1. Finally, we compute the embedding of node i by,

$${\boldsymbol{v}}_i^{(t)} = {\boldsymbol{v}}_i^{(t - 1)} + \mathop {\sum}\limits_j {\alpha _{ij}} \cdot g({\boldsymbol{z}}_{(i,j)}^{(t - 1)}{\boldsymbol{W}}_{\mathrm{n}}^{(t - 1)} + {\boldsymbol{b}}_{\mathrm{n}}^{(t - 1)}),$$

(9)

where g denotes a non-linear ReLU activation function, and ${\boldsymbol{W}}_{\mathrm{n}}^{(t - 1)}$ and ${\boldsymbol{b}}_{\mathrm{n}}^{(t - 1)}$ denotes weights and biases in the network.

The pooling function computes the average of the embeddings of each atom for the set of target atoms,

$${\boldsymbol{v}}_{[l]} = \frac{1}{k}\mathop {\sum}\limits_m {{\boldsymbol{v}}_{[l,m]}} .$$

(10)

Determination of the relaxation timescales

The relaxation timescales represent the characteristic timescales implied by the transition matrix K(τ), where τ denotes the lag time of the transition matrix. By conducting an eigenvalue decomposition for K(τ), we could compute the relaxation timescales as a function of lag time by,

$$t_i(\tau ) = - \frac{\tau }{{\ln |\lambda _i(\tau )|}},$$

(11)

where λ_i(τ) denotes the ith eigenvalue of the transition matrix K. Note that the largest eigenvalue is alway 1, corresponding to infinite relaxation timescale and the equilibrium distribution. The finite t_i(τ) are plotted in Figs. 2b, 3d, and 4d for each material system as a function of τ by performing this computation using the corresponding K(τ). If the dynamics of the system is Markovian, i.e. Eq. (6) holds, one can prove that the relaxation timescales t_i(τ) will be constant for any τ^13,24. Therefore, we select a smallest τ* from Figs. 2b, 3d, and 4d to obtain a dynamical model that is Markovian and contains most dynamical details. We then compute the relaxation timescales using this τ* for each material system, and these timescales remain constant for any τ > τ*.

State-weighted radial distribution function

The RDF describes how particle density varies as a function of distance from a reference particle. The RDF is usually determined by counting the neighbor atoms at different distances over MD trajectories. We calculate the RDF of each state by weighting the counting process according to the probability of the reference particle being in state i,

$$g_i(r_{\mathrm{A}}) = \frac{1}{{\rho _i}}\frac{{{\mathrm{d}}[n(r_{\mathrm{A}})\cdot p_i]}}{{4\pi r_{\mathrm{A}}^2{\mathrm{d}}r_{\mathrm{A}}}},$$

(12)

where r_A denotes the distance between atom A and the reference particle, p_i denotes the probability of the reference particle being in state i, and ρ_i denotes the average density of state i.

Analysis of Li-ion conduction

We first compute the expected mean-squared-displacement of each transition at different t using the Bayesian rule,

$${\Bbb E}[d^2(t)|i \to j] = \frac{{\mathop {\sum}\limits_{t{\prime}} {d^2} (t\prime ,t\prime + t)p_i(t\prime )p_j(t\prime + t)}}{{\mathop {\sum}\limits_{t{\prime}} {p_i} (t\prime )p_j(t\prime + t)}},$$

(13)

where p_i (t) is the probability of state i at time t, and d²(t′, t′ + t) is the mean-squared-displacement between t′ and t′ + t. Then, the diffusion coefficient of each transition D_i→j(τ) at the lag time τ can be calculated by,

$$D_{ij}(\tau ) = \frac{1}{6}\left. {\frac{{{\mathrm{d}}{\Bbb E}[d^2(t)|i \to j]}}{{{\mathrm{d}}t}}} \right|_{t = \tau },$$

(14)

which is shown in Supplementary Table 2.

Finally, we compute the contribution of each transition to Li-ion conduction with Koopman matrix K(τ) using the cluster Nernst-Einstein equation⁶⁷,

$$\sigma _{ij} = \frac{{e^2N_{{\mathrm{Li}}}}}{{Vk_{\mathrm{B}}T}}\pi _iz_{ij}K_{ij}(\tau )D_{ij}(\tau ),$$

(15)

where e is the elementary charge, k_B is the Boltzmann constant, V, T are the volume and temperature of the system, N_Li is the number of Li-ions, π_i is the stationary distribution population of state i, and z_ij is the averaged charge of state i and state j. The percentage contribution is computed by,

$$\frac{{\sigma _{ij}}}{{\mathop {\sum}\limits_{i,j} {\sigma _{ij}} }}.$$

(16)

Lithium diffusion in the FCC lattice toy system

The molecular dynamics simulations are performed using the Large-scale Atomic/Molecular Massively Parallel Simulator (LAMMPS)⁶⁸, as implemented in the MedeA®⁶⁹ simulation environment. A purely repulsive interatomic potential in the form of a Born–Mayer term was used to describe the interactions between Li particles and the S sublattice, while all other interactions (Li–Li and S–S) are ignored. The cubic unit cell includes one Li atom and four S atoms, with a lattice parameter of 6.5 Å, a large value allowing for a low energy barrier. 200 ns MD simulations are run in the canonical ensemble (nVT) at a temperature of 64 K, using a timestep of 1 fs, with the S particles frozen. The atomic positions, which constituted the only data provided to the GDyNet and VAMPnet models, are sampled every 0.1 ps. In addition, the energy following the Tet-Oct-Tet migration path was obtained from static simulations by inserting Li particles on a grid.

Silicon dynamics at solid–liquid interface

The molecular dynamics simulation for the Si–Au binary system was carried out in LAMMPS⁶⁸, using the modified embedded-atom method interatomic potential^27,28. A sandwich like initial configuration was created, where Si–Au liquid alloy was placed in the middle, contacting with two {110} orientated crystalline Si thin films. 25 ns MD simulations are run in the canonical ensemble (nVT) at the eutectic point (629 K, 23.4% Si²⁷), using a time step of 1 fs. The atomic positions, which constituted the only data provided to the GDyNet model, are sampled every 20 ps.

Scaling of the algorithm

The scaling of the GDyNet algorithm is ${\cal{O}}(NMK)$, where N is the number of atoms in the simulation box, M is the number of neighbors used in graph construction, and K is the depth of the neural network.

Data availability

The MD simulation trajectories of the toy system, the Si–Au binary system, and the PEO/LiTFSI system are available at https://doi.org/10.24435/materialscloud:2019.0017/v1.

Code availability

GDyNets is implemented using TensorFlow⁷⁰ and the code for the VAMP loss function is modified on top of ref. ¹³. The code is available from https://github.com/txie-93/gdynet.

References

Etacheri, V., Marom, R., Elazari, R., Salitra, G. & Aurbach, D. Challenges in the development of advanced li-ion batteries: a review. Energy Environ. Sci. 4, 3243–3262 (2011).
Article CAS Google Scholar
Imbrogno, J. & Belfort, G. Membrane desalination: where are we, and what can we learn from fundamentals? Annu. Rev. Chem. Biomol. Eng. 7, 29–64 (2016).
Article CAS PubMed Google Scholar
Peighambardoust, S. J., Rowshanzamir, S. & Amjadi, M. Review of the proton exchange membranes for fuel cell applications. Int. J. Hydrog. energy 35, 9349–9384 (2010).
Article CAS Google Scholar
Zheng, A., Li, S., Liu, S.-B. & Deng, F. Acidic properties and structure–activity correlations of solid acid catalysts revealed by solid-state nmr spectroscopy. Acc. Chem. Res. 49, 655–663 (2016).
Article CAS PubMed Google Scholar
Yu, C. et al. Unravelling li-ion transport from picoseconds to seconds: bulk versus interfaces in an argyrodite li6ps5cl–li2s all-solid-state li-ion battery. J. Am. Chem. Soc. 138, 11192–11201 (2016).
Article CAS PubMed Google Scholar
Perakis, F. et al. Vibrational spectroscopy and dynamics of water. Chem. Rev. 116, 7590–7607 (2016).
Article CAS PubMed Google Scholar
Wang, Y. et al. Design principles for solid-state lithium superionic conductors. Nat. Mater. 14, 1026 (2015).
Article ADS CAS PubMed Google Scholar
Borodin, O. & Smith, G. D. Mechanism of ion transport in amorphous poly (ethylene oxide)/litfsi from molecular dynamics simulations. Macromolecules 39, 1620–1629 (2006).
Article ADS CAS Google Scholar
Miller, T. F. III, Wang, Z.-G., Coates, G. W. & Balsara, N. P. Designing polymer electrolytes for safe and high capacity rechargeable lithium batteries. Acc. Chem. Res. 50, 590–593 (2017).
Article CAS PubMed Google Scholar
Getman, R. B., Bae, Y.-S., Wilmer, C. E. & Snurr, R. Q. Review and analysis of molecular simulations of methane, hydrogen, and acetylene storage in metal–organic frameworks. Chem. Rev. 112, 703–723 (2011).
Article PubMed CAS Google Scholar
Li, Q., Dietrich, F., Bollt, E. M. & Kevrekidis, I. G. Extended dynamic mode decomposition with dictionary learning: a data-driven adaptive spectral decomposition of the koopman operator. Chaos: Interdiscip. J. Nonlinear Sci. 27, 103111 (2017).
Article MathSciNet MATH Google Scholar
Lusch, B., Kutz, J. N. & Brunton, S. L. Deep learning for universal linear embeddings of nonlinear dynamics. Nat. Commun. 9, 4950 (2018).
Article ADS PubMed PubMed Central CAS Google Scholar
Mardt, A., Pasquali, L., Wu, H. & Noé, F. Vampnets for deep learning of molecular kinetics. Nat. Commun. 9, 5 (2018).
Article ADS PubMed PubMed Central CAS Google Scholar
Duvenaud, D. K. et al. Convolutional networks on graphs for learning molecular fingerprints. In Advances in neural information processing systems 2224–2232 (2015).
Kearnes, S., McCloskey, K., Berndl, M., Pande, V. & Riley, P. Molecular graph convolutions: moving beyond fingerprints. J. Comput.-aided Mol. Des. 30, 595–608 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Gilmer, J., Schoenholz, S. S., Riley, P. F., Vinyals, O. & Dahl, G. E. Neural message passing for quantum chemistry, arXiv preprint arXiv:1704.01212 (2017).
Schütt, K. T., Arbabzadah, F., Chmiela, S., Müller, K. R. & Tkatchenko, A. Quantum-chemical insights from deep tensor neural networks. Nat. Commun. 8, 13890 (2017).
Article ADS PubMed PubMed Central CAS Google Scholar
Xie, T. & Grossman, J. C. Crystal graph convolutional neural networks for an accurate and interpretable prediction of material properties. Phys. Rev. Lett. 120, 145301 (2018).
Article ADS CAS PubMed Google Scholar
Schütt, K. T., Sauceda, H. E., Kindermans, P.-J., Tkatchenko, A. & Müller, K.-R. Schnet—a deep learning architecture for molecules and materials. J. Chem. Phys. 148, 241722 (2018).
Article ADS PubMed CAS Google Scholar
Zhang, L., Han, J., Wang, H., Car, R. & Weinan, E. Deep potential molecular dynamics: a scalable model with the accuracy of quantum mechanics. Phys. Rev. Lett. 120, 143001 (2018).
Article ADS CAS PubMed Google Scholar
Zhou, Q. et al. Learning atoms for materials discovery. Proc. Natl Acad. Sci. USA 115, E6411–E6417 (2018).
ADS CAS PubMed PubMed Central Google Scholar
Xie, T. & Grossman, J. C. Hierarchical visualization of materials space with graph convolutional neural networks. J. Chem. Phys. 149, 174111 (2018).
Article ADS PubMed CAS Google Scholar
Wu, H. & Noé, F. Variational approach for learning markov processes from time series data, arXiv preprint arXiv:1707.04659 (2017).
Koopman, B. O. Hamiltonian systems and transformation in hilbert space. Proc. Natl Acad. Sci. USA 17, 315–318 (1931).
Article ADS CAS PubMed MATH PubMed Central Google Scholar
Sastry, S. & Angell, C. A. Liquid–liquid phase transition in supercooled silicon. Nat. Mater. 2, 739 (2003).
Article ADS CAS PubMed Google Scholar
Angell, C. A. Insights into phases of liquid water from study of its unusual glass-forming properties. Science 319, 582–587 (2008).
Article CAS PubMed Google Scholar
Ryu, S. & Cai, W. A gold–silicon potential fitted to the binary phase diagram. J. Phys.: Condens. Matter 22, 055401 (2010).
ADS Google Scholar
Wang, Y., Santana, A. & Cai, W. Atomistic mechanisms of orientation and temperature dependence in gold-catalyzed silicon growth. J. Appl. Phys. 122, 085106 (2017).
Article ADS CAS Google Scholar
Pande, V. S., Beauchamp, K. & Bowman, G. R. Everything you wanted to know about markov state models but were afraid to ask. Methods 52, 99–105 (2010).
Article CAS PubMed PubMed Central Google Scholar
Chodera, J. D. & Noé, F. Markov state models of biomolecular conformational dynamics. Curr. Opin. Struct. Biol. 25, 135–144 (2014).
Article CAS PubMed PubMed Central Google Scholar
Husic, B. E. & Pande, V. S. Markov state models: From an art to a science. J. Am. Chem. Soc. 140, 2386–2396 (2018).
Article CAS PubMed Google Scholar
Meyer, W. H. Polymer electrolytes for lithium-ion batteries. Adv. Mater. 10, 439–448 (1998).
Article ADS CAS PubMed Google Scholar
Hallinan, D. T. Jr. & Balsara, N. P. Polymer electrolytes. Annu. Rev. Mater. Res. 43, 503–525 (2013).
Article ADS CAS Google Scholar
Mao, G., Perea, R. F., Howells, W. S., Price, D. L. & Saboungi, M.-L. Relaxation in polymer electrolytes on the nanosecond timescale. Nature 405, 163 (2000).
Article ADS CAS PubMed Google Scholar
Do, C. et al. Li+ transport in poly (ethylene oxide) based electrolytes: neutron scattering, dielectric spectroscopy, and molecular dynamics simulations. Phys. Rev. Lett. 111, 018301 (2013).
Article ADS PubMed CAS Google Scholar
Diddens, D., Heuer, A. & Borodin, O. Understanding the lithium transport within a rouse-based model for a peo/litfsi polymer electrolyte. Macromolecules 43, 2028–2036 (2010).
Article ADS CAS Google Scholar
Bachman, J. C. et al. Inorganic solid-state electrolytes for lithium batteries: mechanisms and properties governing ion conduction. Chem. Rev. 116, 140–162 (2015).
Article PubMed CAS Google Scholar
Pesko, D. M. et al. Negative transference numbers in poly (ethylene oxide)-based electrolytes. J. Electrochem. Soc. 164, E3569–E3575 (2017).
Article CAS Google Scholar
Mezić, I. Analysis of fluid flows via spectral properties of the koopman operator. Annu. Rev. Fluid Mech. 45, 357–378 (2013).
Article ADS MathSciNet MATH Google Scholar
Georgiev, G. S., Georgieva, V. T. & Plieth, W. Markov chain model of electrochemical alloy deposition. Electrochim. acta 51, 870–876 (2005).
Article CAS Google Scholar
Valor, A., Caleyo, F., Alfonso, L., Velázquez, J. C. & Hallen, J. M. Markov chain models for the stochastic modeling of pitting corrosion. Math. Prob. Eng. 2013 (2013).
Miller, J. A. & Klippenstein, S. J. Master equation methods in gas phase chemical kinetics. J. Phys. Chem. A 110, 10528–10544 (2006).
Article CAS PubMed Google Scholar
Buchete, N.-V. & Hummer, G. Coarse master equations for peptide folding dynamics. J. Phys. Chem. B 112, 6057–6069 (2008).
Article CAS PubMed Google Scholar
Sriraman, S., Kevrekidis, I. G. & Hummer, G. Coarse master equation from bayesian analysis of replica molecular dynamics simulations. J. Phys. Chem. B 109, 6479–6484 (2005).
Article CAS PubMed Google Scholar
Gu, C. et al. Building markov state models with solvent dynamics. In BMC bioinformatics, Vol. 14, S8 (BioMed Central, 2013). https://doi.org/10.1186/1471-2105-14-S2-S8
Hamm, P. Markov state model of the two-state behaviour of water. J. Chem. Phys. 145, 134501 (2016).
Article ADS PubMed CAS Google Scholar
Schulz, R. et al. Collective hydrogen-bond rearrangement dynamics in liquid water. J. Chem. Phys. 149, 244504 (2018).
Article ADS CAS PubMed Google Scholar
Cubuk, E. D., Schoenholz, S. S., Kaxiras, E. & Liu, A. J. Structural properties of defects in glassy liquids. J. Phys. Chem. B 120, 6139–6146 (2016).
Article CAS PubMed Google Scholar
Nussinov, Z. et al. Inference of hidden structures in complex physical systems by multi-scale clustering. In Information Science for Materials Discovery and Design, 115–138 (Springer International Publishing, Springer, 2016). https://doi.org/10.1007/978-3-319-23871-5_6
Kahle, L., Musaelian, A., Marzari, N. & Kozinsky, B. Unsupervised landmark analysis for jump detection in molecular dynamics simulations, Phys. Rev. Materials 3, 055404 (2019).
Funke, K. Jump relaxation in solid electrolytes. Prog. Solid State Chem. 22, 111–195 (1993).
Article CAS Google Scholar
Xu, K. Nonaqueous liquid electrolytes for lithium-based rechargeable batteries. Chem. Rev. 104, 4303–4418 (2004).
Article CAS PubMed Google Scholar
Corry, B. Designing carbon nanotube membranes for efficient water desalination. J. Phys. Chem. B 112, 1427–1434 (2008).
Article CAS PubMed Google Scholar
Cohen-Tanugi, D. & Grossman, J. C. Water desalination across nanoporous graphene. Nano Lett. 12, 3602–3608 (2012).
Article ADS CAS PubMed Google Scholar
Rowsell, J. L. C., Spencer, E. C., Eckert, J., Howard, J. A. K. & Yaghi, O. M. Gas adsorption sites in a large-pore metal-organic framework. Science 309, 1350–1354 (2005).
Article ADS CAS PubMed Google Scholar
Li, J.-R., Kuppler, R. J. & Zhou, H.-C. Selective gas adsorption and separation in metal–organic frameworks. Chem. Soc. Rev. 38, 1477–1504 (2009).
Article CAS PubMed Google Scholar
Bartók, A. P., Kondor, R. & Csányi, G. On representing chemical environments. Phys. Rev. B 87, 184115 (2013).
Article ADS CAS Google Scholar
Pietrucci, F. & Andreoni, W. Graph theory meets ab initio molecular dynamics: atomic structures and transformations at the nanoscale. Phys. Rev. Lett. 107, 085504 (2011).
Article ADS PubMed CAS Google Scholar
Wehmeyer, C. & Noé, F. Time-lagged autoencoders: deep learning of slow collective variables for molecular kinetics. J. Chem. Phys. 148, 241703 (2018).
Article ADS PubMed CAS Google Scholar
Ribeiro, J. M. L., Bravo, P., Wang, Y. & Tiwary, P. Reweighted autoencoded variational bayes for enhanced sampling (rave). J. Chem. Phys. 149, 072301 (2018).
Article ADS PubMed CAS Google Scholar
Wu, H., Mardt, A., Pasquali, L. & Noe, F. Deep generative markov state models. In Proceedings of the 32Nd International Conference on Neural Information Processing Systems, 3979–3988 (Curran Associates Inc., USA 2018). http://dl.acm.org/citation.cfm?id=3327144.3327312
Jin, W., Barzilay, R. & Jaakkola, T. Junction tree variational autoencoder for molecular graph generation, arXiv preprint arXiv:1802.04364 (2018).
Simonovsky, M. & Komodakis, N. Graphvae: Towards generation of small graphs using variational autoencoders, arXiv preprint arXiv:1802.03480 (2018).
M. M. Sultan & V. S. Pande. Transfer learning from markov models leads to efficient sampling of related systems. J. Phys. Chem. B (2017). https://doi.org/10.1021/acs.jpcb.7b06896
Altae-Tran, H., Ramsundar, B., Pappu, A. S. & Pande, V. Low data drug discovery with one-shot learning. ACS Cent. Sci. 3, 283–293 (2017).
Article CAS PubMed PubMed Central Google Scholar
Velickovic, P. et al. Graph attention networks, arXiv preprint arXiv:1710.10903 1 (2017).
France-Lanord, A. & Grossman, J. C. Correlations from ion-pairing and the nernst-einstein equation, Phys. Rev. Lett. 122, 136001 (2019).
Plimpton, S. Fast parallel algorithms for short-range molecular dynamics. J. Comput. Phys. 117, 1–19 (1995).
Article ADS CAS MATH Google Scholar
MedeA-2.22. Materials Design, Inc, San Diego, (2018).
Abadi, M. et al. Tensorflow: A system for large-scale machine learning. In 12th {USENIX} Symposium on Operating Systems Design and Implementation ({OSDI} 16), 265–283 (Savannah, GA, USA 2016). http://dl.acm.org/citation.cfm?id=3026877.3026899

Download references

Acknowledgements

This work was supported by Toyota Research Institute. Computational support was provided by Google Cloud, the National Energy Research Scientific Computing Center, a DOE Office of Science User Facility supported by the Office of Science of the U.S. Department of Energy under Contract No. DE-AC02-05CH11231, and the Extreme Science and Engineering Discovery Environment, supported by National Science Foundation grant number ACI-1053575.

Author information

Authors and Affiliations

Department of Materials Science and Engineering, Massachusetts Institute of Technology, Cambridge, MA, 02139, USA
Tian Xie, Arthur France-Lanord, Yanming Wang & Jeffrey C. Grossman
Department of Mechanical Engineering, Massachusetts Institute of Technology, Cambridge, MA, 02139, USA
Yang Shao-Horn

Authors

Tian Xie
View author publications
You can also search for this author in PubMed Google Scholar
Arthur France-Lanord
View author publications
You can also search for this author in PubMed Google Scholar
Yanming Wang
View author publications
You can also search for this author in PubMed Google Scholar
Yang Shao-Horn
View author publications
You can also search for this author in PubMed Google Scholar
Jeffrey C. Grossman
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

T.X. developed the software and performed the analysis. A.F.-L. and Y.W. performed the molecular dynamics simulations. T.X., A.F.-L., Y.W., Y.S.H., and J.C.G. contributed to the interpretation of the results. T.X. and J.C.G. conceived the idea and approach presented in this work. All authors contributed to the writing of the paper.

Corresponding author

Correspondence to Jeffrey C. Grossman.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information: Nature Communications thanks Stefan Chmiela and other anonymous reviewers for their contribution to the peer review of this work. Peer reviewer reports are available.

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Peer Review File

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Xie, T., France-Lanord, A., Wang, Y. et al. Graph dynamical networks for unsupervised learning of atomic scale dynamics in materials. Nat Commun 10, 2667 (2019). https://doi.org/10.1038/s41467-019-10663-6

Download citation

Received: 18 February 2019
Accepted: 17 May 2019
Published: 17 June 2019
DOI: https://doi.org/10.1038/s41467-019-10663-6

This article is cited by

In-layer inhomogeneity of molecular dynamics in quasi-liquid layers of ice
- Ikki Yasuda
- Katsuhiro Endo
- Kenji Yasuoka
Communications Chemistry (2024)
Advances of machine learning in materials science: Ideas and techniques
- Sue Sin Chong
- Yi Sheng Ng
- Jin-Cheng Zheng
Frontiers of Physics (2024)
Graph deep learning accelerated efficient crystal structure search and feature extraction
- Chuan-Nan Li
- Han-Pu Liang
- Su-Huai Wei
npj Computational Materials (2023)
Zero-shot learning of aerosol optical properties with graph neural networks
- K. D. Lamb
- P. Gentine
Scientific Reports (2023)
Sparse identification of Lagrangian for nonlinear dynamical systems via proximal gradient method
- Adam Purnomo
- Mitsuhiro Hayashibe
Scientific Reports (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.