Quantifying disorder one atom at a time using an interpretable graph neural network paradigm

Chapman, James; Hsu, Tim; Chen, Xiao; Heo, Tae Wook; Wood, Brandon C.

doi:10.1038/s41467-023-39755-0

Download PDF

Article
Open access
Published: 07 July 2023

Quantifying disorder one atom at a time using an interpretable graph neural network paradigm

Nature Communications volume 14, Article number: 4030 (2023) Cite this article

3765 Accesses
2 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Quantifying the level of atomic disorder within materials is critical to understanding how evolving local structural environments dictate performance and durability. Here, we leverage graph neural networks to define a physically interpretable metric for local disorder, called SODAS. This metric encodes the diversity of the local atomic configurations as a continuous spectrum between the solid and liquid phases, quantified against a distribution of thermal perturbations. We apply this methodology to four prototypical examples with varying levels of disorder: (1) grain boundaries, (2) solid-liquid interfaces, (3) polycrystalline microstructures, and (4) tensile failure/fracture. We also compare SODAS to several commonly used methods. Using elemental aluminum as a case study, we show how our paradigm can track the spatio-temporal evolution of interfaces, incorporating a mathematically defined description of the spatial boundary between order and disorder. We further show how to extract physics-preserved gradients from our continuous disorder fields, which may be used to understand and predict materials performance and failure. Overall, our framework provides a simple and generalizable pathway to quantify the relationship between complex local atomic structure and coarse-grained materials phenomena.

Linking atomic structural defects to mesoscale properties in crystalline solids using graph neural networks

Article Open access 17 September 2022

Graph dynamical networks for unsupervised learning of atomic scale dynamics in materials

Article Open access 17 June 2019

Learning properties of ordered and disordered materials from multi-fidelity data

Article 14 January 2021

Introduction

Understanding how a material’s structure affects its properties is one of the most fundamental principles in materials science. At the center of this paradigm is the fact that macroscopic material’s behavior begins at the atomic scale, with local atomic arrangements ultimately coming together to form structural features observed at larger length scales^1,2. Characterizing the nature and propagation of these local environments is therefore vital to understanding macroscale structure-property relationships and their evolution³. Complicating this endeavor is the fact that the long-range features often depend on structurally disordered atomic environments, which tend to dictate materials functionality⁴. For instance, transport, chemical reactivity, and phase nucleation are all profoundly affected by the presence of interfaces, interphases, and grain boundaries^5,6,7,8,9,10. These processes, in turn, are intricately connected to performance-durability trade-offs in both functional¹¹ and structural¹² materials. Examples include temperature-dependent microstructure evolution^13,14, hotspot formation^15,16, and the nucleation and growth of new material phases^17,18.

However, quantifying local atomic disorder in a physically motivated way in practice is extraordinarily difficult^19,20. Although a number of methods have been proposed to characterize local atomic environments, these methods are often not optimized to magnify the subtle differences present in disordered environments. Existing methods can typically be grouped into three general classes, each of which carries distinct trade-offs: (1) semi-empirical structure factors such as Adaptive Common Neighbor Analysis (CNA)²¹, Steinhardt order parameters²², Ackland-Jones order parameters (AJ)²³, atomic excess volume²⁴, Centrosymmetry parameters (CSP)²⁵, Scalar Graph Order Parameter²⁶, and the local atomic environment metric²⁷; (2) parameterized symmetry functions such as the Smooth Overlap of Atomic Positions²⁸, Behler-Parinnello functions²⁹, Moment Tensor Representations³⁰, Polyhedral Template Matching (PTM)³¹, Distortion Factors³², and the Adaptive Generalizable Neighborhood Informed functions³³; and (3) unsupervised machine learning methods which include graph-based^34,35,36,37, order parameter-based^38,39 and image-based^40,41 representations.

In general, it is highly desirable to develop a methodology that is, by construction, specifically designed to distinguish, quantify, and physically interpret regions with varying degrees of atomic disorder. Such a capability would enable more accurate predictions of how disordered atomic environments translate to higher-level features and functionality. For instance, mapping between discretized atomistic models and continuous field representations, such as phase-field^42,43,44 and finite-element⁴⁵ models, forces the use of ill-defined and arbitrary approximations⁴⁶, particularly when the disorder is present. Moreover, continuous field representations propagate via local gradients⁴⁷, the evaluation of which amplifies inaccuracies associated with disordered regions. Addressing these shortcomings is, therefore, a critical priority.

To this end, we introduce a physics-aware workflow composed of two stages, which can be seen in Fig. 1. First, we use graph neural networks (GNN) to explicitly encode local atomic structural information. Next, we apply this encoding to map the local atomic structure to an order parameter that characterizes the local disorder. This order parameter, henceforth referred to as the Structural Orderness Degree for Atomic Systems (SODAS), λ_i, quantifies an atom’s local structure in terms of the “closeness” to likely environments encountered between two limiting cases: a perfect crystal (λ_i = 1) and a melt (λ_i = 0). Our approach offers three distinct advantages: (1) the graph representation accurately encodes the topology of the connected network of atoms; (2) our paradigm is universally tunable to specific material systems that exhibit temperature-dependent structural transitions; and (3) the physical interpretability of atomic-level predictions due to the bounding of the problem between physically-identifiable endpoints. The power of this workflow is demonstrated by application to several examples of disordered aluminum systems, including solid-solid and solid-liquid interfaces, polycrystalline microstructures, and fracture evolution, and is compared to other methods from the literature such as CNA, AJ, PTM, and CSP.

**Fig. 1: General workflow for calculating SODAS values.**

Results

Definition of SODAS

In principle, for systems that exhibit temperature-dependent structural transformations between phases A and B, the level of the configurational disorder can be mapped onto an equivalent level of thermal disorder in a finite-temperature ensemble. To this end, we can introduce a fictitious temperature (${T}^{{\prime} }$) that mathematically represents this configurational disorder. In practice, ${T}^{{\prime} }$ can be parameterized for a given system using explicit MD simulations, as discussed in the Methods section. To physically bound ${T}^{{\prime} }$, we introduce T_d as the limit of full disorder (nominally the melting temperature). The value of ${T}^{{\prime} }$ is then confined to the range between 0 and T_d. We next define γ as a global structural order parameter:

$$\gamma ({T}^{{\prime} };{T}_{d},\,s)={{{{{{{\mathcal{N}}}}}}}}\frac{1}{1+\exp (-{({T}_{d}/{T}^{{\prime} })}^{s})}$$

(1)

where ${{{{{{{\mathcal{N}}}}}}}}$ normalizes γ between 0 (absolute disorder) and 1 (absolute order), defined as ${{{{{{{\mathcal{N}}}}}}}}={\sigma }_{\gamma }(ma{x}_{\gamma }-mi{n}_{\gamma })+mi{n}_{\gamma }$. s is an empirical scaling metric that determines where to begin the decay of γ from ordered to disordered. The introduction of s makes the definition of γ universal when considering systems that exhibit temperature-dependent structural transformations, as one can simply tailor its value for any unique material system. One can think of s as a way to control the steepness in the drop-off between order and disorder for a specific material system. For this work, s was set to 1.5. It is important to note that γ is defined as the global level of disorder for an entire material system and not at the atomic scale. A further discussion of the relationship between γ and s can be found in the supplemental information.

While γ describes the level of disorder of a macroscopic, homogeneously disordered system, we are primarily interested in local atomic disorder within a heterogeneous system. To establish a connection between the global and local scales, we map the likelihood of finding a given local atomic environment within an ensemble of configurations to a local order parameter (SODAS), λ(n), where n indexes an atom. One can think of λ(n) as defining which temperatures you are most likely to find a given atomic environment, but mapped to a point along a phase space trajectory between two phases, A and B. In practice, due to ergodic constraints, we assume that this ensemble can be sampled discretely from MD simulations. We represent this mapping as:

$$f({\{\gamma \}}_{i})\,\mapsto \,\lambda (n)$$

(2)

where {γ}_i = {γ₁, γ₂, . . . , γ_k} represents the set of γ values associated with a given local atomic structural motif i across a discrete set of k ensembles, and f is a function that maps {γ}_i to λ(n). It is important to understand that we are arguing a local atomic environment may be represented as the set of points along the order-to-disorder spectrum, rather than its geometric symmetries (or lack thereof). We would argue that this definition provides a more grounded representation of the local environment as it can be mapped back onto a physical system, rather than an unsupervised feature vector. While the function f is unknown, it can be approximated. In this work, we use a graph neural network scheme to facilitate this approximation, while retaining physical interpretability. It is also important to understand that the GNN is not predicting the temperature of a local environment but rather the point along the order-to-disorder spectrum that the local environment is most likely to exist at. One may think of this scheme as optimizing a high-dimensional non-linear function that maps the temperature of the system to the point along a path between two phases in a representative configuration space. We also note that s can be iteratively tuned for a given system by using an initial guess and observing the error between the predicted average λ for a structure and the theoretical gamma at the known thermostat temperature during training. Figure 1 outlines the key steps in this process and is discussed in further detail in the methods section.

Validation of SODAS

We first validate the SODAS model by observing where the average SODAS value within several bulk configurations at different temperatures align with the theoretical values of γ, as seen in Fig. 2. Here, we see that all atoms in the structure at 0 K is uniformly predicted to have λ = 1, which is indicative of the perfect crystal. In contrast, at 1200 K, all atoms indicate λ to be close to 0 due to the structure existing as a melt. On average, structures between these limits yield intermediate values of λ, as expected. In all cases, the average value of λ aligns well with the theoretical value of γ, providing evidence that our methodology, both in conception and implementation, is accurate. A red horizontal line is also drawn in Fig. 2 to provide the reader with a clear understanding of where absolute disorder is represented along the y-axis.

**Fig. 2: SODAS calculations on bulk structures taken during a superheating MD simulation.**

At the same time, detailed visualization of intermediate-temperature configurations reveals a spectrum of atomic environments covering a range of λ(n) in lieu of homogeneously distributed disorder. For example, the second structure in Fig. 2, which represents a structure at roughly 200 K, has λ values ranging from 0.9 to 0.99. Accordingly, as described previously, similar atomic environments exist at a range of temperatures but with different degrees of expression according to the average overall level of disorder. Intuitively this makes sense, as the goal of the SODAS metric is to judge the likelihood of an atomic environment existing at an arbitrary point along the abstract spectrum between fully ordered and fully disordered variants. If a unique atomic environment occurs at multiple temperatures, one would expect its λ to be a weighted combination of the individual occurrences of the environment along the temperature spectrum.

Boundary identification in solid-solid interfaces

While the perturbed pristine bulk structures provide a case study for us to analyze how SODAS performs, most interesting structures contain defects, and varying levels of disorder. To this end, in this section, we analyze how SODAS can be used to extract structural information out of solid-solid interface regions. Since λ is continuously valued over the discrete atoms, it can be interpolated to a continuous field. This mapping allows for its integration into continuum models. For instance, we note the similarity between such a continuous field representation and the phase order parameter used in phase-field models^8,48. We showcase this concept for the example of two grain boundary regions with varying levels of interfacial complexity. Nevertheless, we note that this method can be used for other classes of crystalline interfaces, such as symmetric tilt and twin boundaries, and edge/screw dislocations.

From Fig. 3, one can see the intuitive nature of SODAS, cleanly characterizing the grain regions with a λ close to 1, smoothly transitioning to higher degrees of disorder present near the boundary. For boundaries that show higher degrees of crystallinity, such as those in ∑5(110)[120], the disorder present at the interface is minimal, as is expected, though is still clearly present. Likewise, for more disordered boundaries, such as those in ∑9(110)[110], a greater degree of disorder is detected within the interface region. As in the previous section, these characterizations exemplify the ability of SODAS to determine where the grain begins and ends.

**Fig. 3: SODAS predictions and gradients of grain boundaries.**

Figure 3 shows the continuous fields derived from the originally discrete, per-particle SODAS value λ. Additionally, the gradient norm ∣∣ ∇ λ∣∣ was calculated and visualized. This discrete-to-continuum conversion was done by interpolating the discrete λ values onto a uniform grid using PyVista⁴⁹. When calculating the gradient of this field, we observe areas of the structure where there are sharp changes in the SODAS values. Notably, the gradient is maximized not at the center of the grain boundary, but rather at the transition to the boundary region, because these are locations within the structure where there is an abrupt change in the level of disorder present.

The sensitivity of this detection can be seen in Fig. 3, where the gradient of the scalar field predicts two regions where there is an abrupt change in the SODAS values. As we move from the crystalline regions towards the interface normal to the boundary region, we first encounter a crystal-to-boundary region, followed by the boundary itself, and finally, a boundary-to-crystal region as we move away from the interface. Therefore, the gradient predictions in Fig. 3 highlight the fact that a degree of homogeneity can exist in both the ordered interior of the grain as well as the disordered interior of the grain boundary.

Boundary identification in solid-liquid interfaces

Disordered interface boundaries are notoriously difficult to quantitatively characterize due to the inherent complexity and heterogeneity in their local atomic environments. Bond-angle methods such as CSP, CNA, AJ, and PTM sometimes struggle to accurately distinguish between perturbed crystalline and disordered atomic environments^50,51. These difficulties ultimately make defining interface boundaries challenging. In contrast, the SODAS formalism accomplishes this by providing a continuous metric that allows for a physically justifiable and mathematically rigorous definition of the interface boundary transition.

To this end, we have performed two-phase crystal/liquid CMD simulations at several temperatures (100, 500, and 1200 K), to observe how several methods classify the unique structural environments present in each scenario. Further details regarding the simulation setup can be found in the Methods section. Previous works have shown that the solid-liquid boundary in Al exhibits a soft transformation when moving from the solid to the liquid, with a gradual increase in the level of disorder as a function of distance from the solid phase^52,53. These results indicate that a three to four atomic-layer boundary exists between the solid and liquid phases in which the level of disorder continuously increases as one approaches the liquid phase. This implies that one needs a characterization method that can continuously and smoothly map the topology of the interface boundary.

Figure 4 provides a comparison between SODAS, PTM, and CSP. Here, we examine both the total structure, which again represents the solid-to-liquid transition, as well as a zoom-in view of the interface boundary itself. Figure 4a shows the SODAS characterization of the solid-to-liquid transition, along with a zoomed-in interface boundary region. Here, SODAS correctly identifies the crystalline region as corresponding to structure typically found at low temperatures, as this region is at 100 K in Fig. 4a. The interface boundary region in the zoomed-in portion also shows a natural gradual progression from solid-to-liquid, which one would expect at equilibrium conditions. There are regions where the interface is more crystalline, and regions that are more disordered, with a gradual degradation between these regions. This highlights that SODAS can accurately quantify both the solid and liquid phases, as well as the interface boundary between them.

**Fig. 4: Characterization of solid-liquid interfaces.**

Figure 4b reference the predictions made by CSP. Here, while CSP does an excellent job of identifying the crystalline region, it’s characterization of the liquid phase seems less reliable due to the misclassification of various sites throughout the liquid region. This misclassification stems from the fact that CSP works on the notion that values close to zero represent highly ordered crystal structures, while values away from zero represent deviations from those crystal symmetries. While CSP clearly indicates all atoms in the liquid as being away from the corresponding crystal symmetry, it fails to truly quantify liquid environments from one another. Beyond some threshold, a CSP far away from zero does imply that it is more structurally dissimilar than a value closer to zero, but again beyond some threshold. Therefore, it cannot distinguish liquid environments from one another. From Fig. 4b, one can also see a more discrete characterization of the interface boundary region, where only a vague guess can be made regarding regions of the boundary that are more liquid-like versus solid-like. As these regions would differ greatly in energy, and, therefore, properties, one can reason that CSP is not capable of providing an accurate description of this region.

For the cases shown in Fig. 4c–e, which covers AJ, CNA, and PTM, respectively, we examine how the binary classification schemes perform when characterizing the solid-liquid interface region. In all cases, the solid region is well defined, though, in both AJ and PTM, the liquid region disordered atoms are often classified as a particular solid phase, indicating a breakdown in the classification algorithm. Within the interface region, both AJ and PTM give a seemingly random characterization of structure types. While CNA performs better with a clear mapping between the SODAS and CNA classifications present within the interface, CNA provides a more coarse level of information, giving only the impression of solid-like and liquid-like regions. Overall, SODAS provides a significantly finer and more informative prediction of the solid-liquid interface region.

While Fig. 4 qualitatively compares the continuous nature of the solid-liquid interface, Fig. S5 provides a more quantitative picture. Here, we examine the average order parameter value, normalized between 0 and 1, for all methods as a function of distance along the y-axis, which exhibits the solid-to-liquid transformation. Figure S5’s highlights are as follows: (1) CSP provides a reasonable deviation in order parameter values when moving away from the solid, but fails to quantify the end of the boundary interface and liquid phase, (2) PTM and AJ both indicate the existence of a one to two atomic-layer thick boundary interface, short of the experimentally observed length⁵², and also provide a jagged set of values towards the ends of the boundary interface region, and (3) while a-CNA and SODAS both predict a boundary interface region in the experimentally observed range, only SODAS provides a smooth gradient throughout the entire process, confirming our qualitative analysis in the paragraphs above.

Figure 5 shows the SODAS characterization of the atomic environments present in the final MD configurations for the three temperatures described earlier. Figure 5a shows the SODAS predictions on the 100 K system, with inserted arrows indicating which regions of the structure were initially crystalline and which regions were initially liquid. One can observe, on the initial liquid side, the presence of several disordered regions. These regions are more clearly shown in Fig. 5b due to the specified SODAS range in the color bar. Intuitively it makes sense that the portion of the structure that was initially a liquid would have defects upon quenching to 100 K, while the region that was initially crystalline would not have such defects at 100 K.

Figure 5c shows the SODAS predictions on the 500 K system. The inserted arrows from (a) are not shown here but are implied, with the black dashed line from (a) being present. One can observe, on the initial liquid side, the presence of many disordered regions, with one large disordered patch in the middle. These regions are more clearly shown in Fig. 5d due to the specified SODAS range in the color bar. Again, this makes sense that the portion of the structure that was initially a liquid would have defects upon quenching to 500 K. Importantly, SODAS identifies larger defects present in the initial liquid region than in the initial crystal region, which also makes sense as local environments that lead to larger defects are easier to access kinetically in the liquid phase than they are in the crystal phase. While there are perturbed regions in the initial crystal portion, there are no large-scale defects present.

**Fig. 5: Characterization of equilibrated solid-liquid interfaces at different temperatures.**

Figure 5e shows the SODAS predictions on the 1200K system. The inserted arrows from (a) are not shown here but are implied, with the black dashed line from (a) being present. As 1200 K is above this interatomic potential melting temperature, SODAS correctly identifies the total structure as being in the liquid phase. However, we observe patches in the structure that are less disordered than others. These regions are more clearly shown in Fig. 5f due to the specified SODAS range in the colorbar. Interestingly, from Fig. 5f, one can see that the regions exhibiting less disorder are more common in the regions that was initially a solid, which makes sense as some level of structural similarity with the solid phase could be present upon melting. It is also possible that the initial crystal region has not yet reached equilibrium with the initial liquid phase. In either case, SODAS captures this trend and allows for a pathway for more complex analysis.

Autonomous microstructural feature extraction

The ability to define boundary transitions also enables the identification of larger-scale microstructural features. To demonstrate this capability, we showcase the performance of SODAS when compared to CSP, a-CNA, AJ, and PTM, for the classification of both grain boundary and grain regions in dynamic polycrystalline structures. As described in the Methods section, we performed an MD simulation at 600 K of a 1.6 million atom FCC aluminum system containing 250 initial grains.

Figure 6 provides a visual comparison of the final MD snapshot between the various characterization methods. Here, many grains have coalesced to form larger grains over time as enough kinetic energy was present in the system to overcome large potential energy barriers. Figure 6a provides the SODAS visualization, where one can clearly identify the grain boundary regions (shown in blue), the grains (shown in red), and the transition region between the grain and grain boundary (shown in white). Since we are defining grain boundary atoms as atoms having a certain level of disorder, this visualization provides a simple thresholding procedure, as one can clearly identify regions based on their level of disorder/order. This prescription will result in some misclassified atoms, however, as the temperature is increased due to the level of disorder present within the grain regions. The level of misclassification can be quantified, however, and is shown in Fig. 7.

**Fig. 6: Final snapshot along the 600 K CMD polycrystalline trajectory for several methods.**

**Fig. 7: Polycrystalline structure evolution during CMD at 600K.**

Figure 6b–e provide the visualizations of CSP, PTM, CNA, and AJ, respectively. CSP (b) has difficulty identifying the transition region between grain and grain boundary due to the level of noise present within its characterization. Here, many atoms are misclassified as grain boundary atoms, and it is clear that there is no obvious threshold that would allow for precise identification of the two regions. PTM (c) performs better within the grain than CSP; however, it performs worse within the grain boundary region. This is due to the inherent level of disconnectedness found within the boundary regions, where there are significant levels of noise when attempting to discern which atoms belong to the grain boundary. This should be no surprise, given PTM’s struggles to classify the boundary region of the solid-liquid interface.

CNA, shown in Fig. 6d, provides a better depiction of the GB regions than PTM, though it has difficulty within the grains. CNA yields deviations when identifying structure types as a level of atomic perturbations, with larger perturbations leading to larger errors in its characterizations of structure types. Therefore, while the boundaries themselves are reasonable, the transition between boundary and grain is not smooth and continuous. Finally, AJ (e) provides extremely thin and disorganized boundary regions with chaotic misclassification of grains present throughout the structure.

Figure 7 aims to quantify how both the grain boundaries and grains evolve over time. Figure 7a depicts how the number of grains changes as a function of time for all methods considered in this work. Here, we can see that SODAS was the only method to correctly identify all 250 grains in the initial configuration. As the temperature is increased, SODAS predicts a gradual decrease in the number of grains, which coincides with a gradual increase in the number of atoms within the grain regions, as shown in (b). The number of atoms within the grain boundary regions should also decrease in a similar manner, as atoms are leaving the grain boundaries and moving into the grain regions. This is evidenced in (d), where SODAS predicts a similar decrease in the number of grain boundary atoms over time as the growth of grain atoms. (c) shows the number of misclassified atoms, as described in the Methods section. From (c), we can see that SODAS has the smallest number of misclassified atoms present in the system for all cases except the initial configuration. This is because we’ve chosen a λ = 0.7 as our threshold, but at t = 0, all atoms in the grain region presumably have a λ = 1. Due to this, there are additional atoms in the transition region between grain and grain boundary that will get grouped with grain boundary due to their smaller λ value, but actually belong in the grain. While this leads to a larger number of misclassified atoms in the initial structure, it is important to note that this number is consistent with the remaining structures as the system evolves.

For CSP, Fig. 7a indicates a smaller number of grains captured in the initial configuration, followed by an increase in the number of grains to more than the original number. This is due to CSP’s noisy characterization, in which many small grains, on the order of a few tens of atoms, are being classified as their own grain instead of belonging to a single larger grain. This leads to a smaller average grain size, as shown in (b). CSP does capture the gentle decrease in the number of grains over time, though. Accordingly, CSP shows a much larger number of grain boundary atoms present in the system, as shown in (d), which we expect due to the noisy characterization. CSP also leads to a larger misclassification number (c), which aligns with and explains our noisy characterization argument.

For both CNA and PTM, Fig. 7a shows a smaller number of initial grains followed by a drastic dropoff as time evolves. This is due to the lack of a true thresholding parameter in the binary classification scheme these methods employ, yielding a more rigid definition of grain and grain boundary atom. While the number of atoms per grain increases in (b), it increases much more sharply than either CSP or SODAS. One can also see a much larger number of misclassified atoms in (c), again due to issues characterizing perturbed local structures at non-zero temperatures. There is also an increase in the number of grain boundary atoms when compared to SODAS, though the reduction in the grain boundary atoms over time does follow a similar trend.

Finally, for the case of AJ, Fig. 7a shows only a single grain present throughout the entire simulation, including the initial structure. This is due to AJ suffering even worse thresholding issues that either CNA or PTM, leading to an unphysical blending of the grain boundary and grain regions. This is also seen in (b), with there being a large number of atoms per grain, as AJ only finds a single grain in the system. This trend aligns with the number of grain boundary atoms in (d), with AJ registering nearly an order of magnitude fewer grain boundary atoms than any other method. There is also a larger number of misclassified atoms shown in (c), leading to a larger amount of blending between the two domains.

Dynamic fracture evolution

Here we examine the performance of various methods at capturing the initiation of tensile fracture. We quantify performance as the ability of a given method to accurately predict the location of shear bands throughout the material using only the level of disorder captured by each method. We use D², which has been used previously to gauge shear band locations⁵⁴, as a way of judging the accuracy of the structural characterization methods used in this work. D² is effectively capturing the instantaneous measure of an atom’s local displacement, making it a decent proxy for local irreversible shear transformations. Figure 8 shows the results of this quantification on snapshots throughout the MD simulation. Figure 8a–d provides insight into the location of shear bands by observing the locations in the structure, along the z-axis, that represent the highest levels of disorder for a given method. For all methods, kernel density estimation (KDE)⁵⁵ is used to determine peaks in each method’s disorder predictions. In the case of D², CSP and SODAS, values fed into KDE correspond to predicted values at a given z-coordinate in the top 5% of disorder. For a-CNA, PTM, and AJ, all values not characterized as a known crystal structure are used as their characterization scheme is binary. Figure 8e–h visualize the snapshots in (a–d) using SODAS to color-code the atoms. Figure S4 provides a visualization of other methods over the course of the MD simulation.

Figure 8a shows the approximated predicted shear band locations (peaks in the D² distribution), for several methods at 1 ns into the MD simulation. Several interesting points can be made here: (1) CSP provides a flat distribution, implying that it detects a uniform level of local structural change, (2) PTM only predicts a single peak at small z-coordinates while a-CNA and AJ predict the location of three bands in the bottom half of the structure, and (4) neither SODAS nor D² predicts the formation of any bands in this structure. These results seem to indicate that CSP,a-CNA, PTM, and AJ are extremely sensitive to local structural changes when compared to SODAS and D². Figure 8b shows several major changes, including the appearance of shear bands according to D², with SODAS peaks aligning reasonably well with peaks predicted by D². (b) also shows that all other methods produce a nearly flat distribution, again showing the difficulty in predicting shear band locations with these methods due to their seemingly random prediction of disorder along the z-axis. Interestingly, both AJ and a-CNA in (a) are in decent agreement with D² in (b), though their predictions fall out of disagreement in (b), indicating that these two methods could be used to predict future locations of band formations but not instantaneous locations of them. It is also important to note that in both (a) and (b), the structure has not undergone tensile failure, and represents the lead-up to the eventual failure of the material.

Figure 8c highlights an instance after the material has fractured, though not into two separate pieces yet. The semi-transparent yellow region in (c) indicates the location of fracture initiation. Here, we see a significant shift in the band locations and corresponding densities, which makes intuitive sense given the extreme structural changes in the material. (c) highlights the agreement between D² and SODAS over a nearly 225 Å range of the z-axis. All other methods show a nearly uniform prediction of three band peaks, which disagrees with the D² prediction of two peaks. The disagreement occurs between 75 and 150 Å, which represents the regions in the material where the fracture is occurring. This would indicate that all other methods deviate from D² in regions where the level of disorder is extreme, perhaps due to the mischaracterization of such environments. We note for clarity that SODAS and D² remain in excellent agreement throughout the entire z-axis.

Figure 8d shows the shear band locations after the material has fractured into two pieces. Here, we see reasonably good agreement between all methods, though we do note that SODAS still provides the most accurate picture when compared to D². We note again, however, that AJ, CSP, PTM, and a-CNA predict the approximate band locations given by D² in (d) in the previous panel, (c), indicating that these methods show promise in predicting future trends in potential band locations, but do not necessarily serve as accurate instantaneous band predictors.

To summarize these findings, we note three main takeaways: (1) All methods outside of SODAS predict the approximate formation of shear bands before their actual formation, according to D², indicating their potential use as future predictors of shear behavior but not instantaneous ones, (2) during the fracture process SODAS provides the most accurate depiction of potential shear band locations when compared to D², and (3) due to the alignment of SODAS and D² over the majority of the MD simulation, we conclude that the deviation from D² for all methods outside of SODAS is likely due to misclassification of highly disordered regions in the material, implying that one must be capable of accurately capturing those regions to truly understand the mechanical properties during the failure process.

Discussion

In summary, characterizing the nature of the local atomic disorder is critical and necessary to understand how structure-property relationships evolve. SODAS is a new mathematical framework in which local atomic environments are transformed into graph representations, encoded via a graph neural network paradigm, and finally mapped onto a local order parameter. This order parameter, λ, is an informative, continuous, and mathematically bounded scalar which represents the level of disorder present within an atomic environment, and is analogous to an atomically resolved configurational entropy density. In addition to the examples shown throughout this work, these advantages allow for the universal quantification of a multitude of complex and heterogeneous materials properties and phenomena.

We also envision our proposed methodology as a tool for multiscale model integration. In particular, SODAS provides an atomistically derived, physically motivated continuous scalar field representation for phase field and continuum models. This mapping can be likewise leveraged to output field quantities such as phase order, grain distribution, concentration, stress/strain, and so on. Such an approach offers a new perspective and valuable technique for bridging scales in multiscale models, both between atomistic and microscale descriptions, as well as between discrete and continuous representations. We further emphasize that although this work focuses on single-element systems, our method is generally applicable to multi-component systems and their corresponding microstructural features.

The advantages of SODAS also become clear for extraction of physical properties that relate to materials’ performance or degradation. For instance, we showed that by interpolating the discrete representation to a continuum representation, we could analyze or differentiate λ to deduce spatially resolved changes in structural homogeneity. In practice, these structural changes often map to changes in key response properties, including diffusivity, dielectric response, electrical conductivity, and elastic compliance⁵⁶. In cases where such properties can be computed locally or measured using local probes, SODAS offers a way to extract analytical relationships between structure and function. Moreover, sharp gradients from abrupt changes in response functions can concentrate electrical, chemical, or mechanical potential, creating hotspots that can initiate key electrochemomechanical failure modes. We, therefore, propose that gradients in the continuous representation of λ may provide a robust way to identify such hotspots, with a direct connection to early prediction of the propensity for deleterious outcomes such as fracture, corrosion, and thermal runaway.

Methods

Training data preparation

Classical molecular dynamics (CMD), using the LAMMPS software package⁵⁷, was used to generate training data for the GNN model. Starting from bulk FCC aluminum (containing 1024 atoms), CMD was performed in the NVT ensemble using Zhou et al. EAM potential⁵⁸. The range of temperatures used for the training data was 50 to 1200 K. At each temperature, an NVT simulation was performed for 10 ns. Data used for training was taken after the 5 ns mark to ensure that only equilibrated configurations were used for training.

Graph neural network implementation

Conversion to graph

Prior to GNN operation, we converted the atomic systems into graphs using a simple cutoff radius-based neighbor list search (implemented using Atomic Simulation Environment⁵⁹), with the cutoff R_c = 3.5 Å. Each node of the converted graph corresponds to the atom type z, and each edge the bond distance d. In the end, our graph representations encode the atoms and their local neighbors, with atoms represented as nodes, and neighbor connections represented as edges between nodes.

GNN operation

The GNN model used in this work consists of three components: the initial embedding, the atom-bond interactions, and the final output layers (Fig. 1). In the initial embedding, each atom type z is transformed into a feature vector by an Embedding, layer (PyTorch⁶⁰). Each bond distance d is expanded into a D-dimensional feature vector by the Radial Bessel basis functions (RBF)⁶¹,

$${{{{{{{{\rm{RBF}}}}}}}}}_{n}(d)=\sqrt{\frac{2}{{R}_{c}}}\frac{\sin (\frac{n\pi }{{R}_{c}}d)}{d},$$

(3)

where n ∈ [1. . D] and R_c is the cutoff value. Both atom and bond feature vectors have the same length D = 100.

The atom-bond interactions are also known as GNN convolution, aggregation, or message-passing. There are many variants of GNN convolution operations that can be adopted from the literature. In this work, we choose the edge-gated graph convolution^62,63. The term atom-bond interaction is based on the fact that the nodes and the edges exchange information during the convolution operation. Specifically, the node features ${\overrightarrow{h}}_{i}^{l+1}$ of node i at the (l + 1)th layer is updated as

$${\overrightarrow{h}}_{i}^{l+1}={\overrightarrow{h}}_{i}^{l}+{{{{{{{\rm{SiLU}}}}}}}}\left({{{{{{{\rm{LayerNorm}}}}}}}}\left({\overrightarrow{W}}_{s}^{l}{\overrightarrow{h}}_{i}^{l}+\mathop{\sum}\limits_{j\in {{{{{{{\mathcal{N}}}}}}}}(i)}{\hat{\overrightarrow{e}}}_{ij}^{l}\odot {\overrightarrow{W}}_{d}^{l}{\overrightarrow{h}}_{j}^{l}\right)\right),$$

(4)

where SiLU is the sigmoid linear unit activation function⁶⁴; LayerNorm is the layer normalization operation⁶⁵; ${\overrightarrow{W}}_{s}$ and ${\overrightarrow{W}}_{d}$ are weight matrices; the index j denotes the neighbor node of node i; ${\hat{\overrightarrow{e}}}_{ij}$ is the edge gate vector for the edge from node i to node j; and ⊙ denotes element-wise multiplication. The edge gate ${\hat{\overrightarrow{e}}}_{ij}^{l}$ at the lth layer is defined as

$${\hat{\overrightarrow{e}}}_{ij}^{l}=\frac{\sigma ({\overrightarrow{e}}_{ij}^{l})}{{\sum }_{{j}^{{\prime} }\in {{{{{{{\mathcal{N}}}}}}}}(i)}\sigma ({\overrightarrow{e}}_{i{j}^{{\prime} }}^{l})+\epsilon },$$

(5)

where σ is the sigmoid function, ${\overrightarrow{e}}_{ij}^{l}$ is the original edge feature, and ϵ is a small constant for numerical stability. The edge features ${\overrightarrow{e}}_{ij}^{l}$ is updated by

$${\overrightarrow{e}}_{ij}^{l+1}={\overrightarrow{e}}_{ij}^{l}+{{{{{{{\rm{SiLU}}}}}}}}\left({{{{{{{\rm{LayerNorm}}}}}}}}\left({\overrightarrow{W}}_{g}^{l}{\overrightarrow{z}}_{ij}^{l}\right)\right),$$

(6)

where ${\overrightarrow{W}}_{g}$ is a weight matrix, and ${\overrightarrow{z}}_{ij}$ is the concatenated vector from the node features ${\overrightarrow{h}}_{i}$, ${\overrightarrow{h}}_{j}$, and the edge features ${\overrightarrow{e}}_{ij}$:

$${\overrightarrow{z}}_{ij}={\overrightarrow{h}}_{i}\oplus {\overrightarrow{h}}_{j}\oplus {\overrightarrow{e}}_{ij}.$$

(7)

Lastly, via the final output layers, each node feature is eventually transformed into a scalar output y ranging from 0 to 1. In this work, these final output layers are a two-layer multilayer perceptron (MLP) with SiLU activation after the first layer (D = 100 neurons) and sigmoid output after the second layer (scalar output). Effectively, the GNN predicts the SODAS metric for every atom. Further details regarding model training are described in Supporting Information.

Atomistic simulation details

All CMD simulations were performed using the LAMMPS software package⁵⁷ and the Zhou et al. EAM potential⁵⁸.

Two-phase simulations

Two-phase simulations were performed within the NVT ensemble by creating two initial, independent thermostats within a rectangle block of Al containing roughly 51,000 atoms. Both regions are initially crystalline. During the initial stages of the MD simulations, the independent thermostats are used to create a liquid region and a crystal region. Within the liquid region, the thermostat is set to 2000 K, which the crystal region is set to 100 K. After 5 ns, allowing for equilibration of the independent phase regions, the two thermostats are removed and replaced by a single new thermostat which acts on the entire system. This thermostat is set to different temperatures depending on the different scenarios considered. These temperatures are 100, 500, and 1200 K. The combined system is equilibrated using the single thermostat for 5 ns.

Grain coarsening simulations

CMD simulations in the NVT ensemble were performed for four polycrystalline cases, each with a varying number of initial grains. An initial bulk aluminum system containing roughly 1.6 million atoms was used to construct a polycrystalline system, using the Atomsk software package⁶⁶, containing 250 initial grains. CMD simulations were performed at 600 K. NVT simulations were run for ~1.5 ns for each combination of initial structure and temperature. Further details regarding the polycrystalline structures can be found in the Results section as well as the Supplemental Information.

Microstructure characterization

Microstructure characterization occurs in four stages: (1) calculation of SODAS for all atoms in the system, (2) thresholding of the atomic configuration based on an atom’s SODAS value and subsequent removal of all atoms below the threshold value, (3) conversion of the remaining atoms to a graph representation for the discovery of subgraphs within the graph. Analysis of these subgraphs, such as the calculation of the number of atoms in the subgraph, were then performed. Figure S3b depicts this workflow visually. While step (1) requires little-to-no input from the user, step (2) requires one to define the level of disorder that needs to be captured when defining the interface regions. As the threshold value defines the structural properties of the interface region itself, with a near-zero threshold indicating grain boundaries which are extremely disordered and a value close to 1 representing highly crystalline boundaries. In principle, both classes of interfaces can exist within the same structure, which would require a more complex thresholding system, though for this work, we assume a uniform local atomic environment amongst all grain boundaries.

For all microstructure characterization tests in this work, we employ a thresholding technique to differentiate between atoms belonging to a grain and atoms belonging to a grain boundary. For the case of SODAS, we define this threshold as 0.7, as this value of λ corresponds roughly to the level of disorder one would expect at 600 K. The threshold values at each temperature are defined in a way that minimizes the number of atoms within a grain that may be mistaken as grain boundary atoms. At 600 K, which is roughly half of the melting temperature, there is a significant amount of kinetic energy in the system, which causes non-trivial levels of atomic perturbation.

The same thresholding technique is used for SODAS is also used for CSP. Like SODAS, CSP works on the assumption that non-zero values of CSP represent deviations from a symmetric known crystal structure. However, it is not clear whether or not this trend holds the further one moves from a CSP of zero, implying no obvious cutoff value to distinguish order from disorder. Similar to the SODAS case, we set the 0K CSP threshold at 10⁻⁵, and all other CSP thresholds are set at 2, meaning anything CSP value greater than 2 will be considered a grain boundary atom. For PTM, CNA, and AJ, the atomic-level characterization is different than that of CSP and SODAS. Here, environments are encoded as a one-hot vector of known crystal phases. If the atomic environment does not belong to a known reference, it is classified as “unknown”. Here, any atom classified as “unknown" is determined to be within a grain boundary.

Once atoms in the system have been thresholded, and all grain boundary atoms have been removed, the remaining system is then mapped onto a graph, G, shown in Fig. S3, where edges are represented by ij pairwise interactions within a 4 Å cutoff radius. A recursive subgraph search algorithm is employed to discover all connected subgraphs, S_G, within the complete graph G. This algorithm is extremely efficient, discovering all subgraphs within a 1.6 million atom system in 1.2 s. As all interface atoms were removed prior to the graph construction, all subgraphs in G represent the resulting grains contained within the structure. The total number of subgraphs in the system is equal to the number of grains in the system, and the number of nodes in each subgraph is the number of atoms in a given grain.

We also examine the grain boundary atoms and identify the number of such atoms that have been misclassified. We define this miss-classification as atoms belonging to the grain boundary designation that do not have a required number of neighbor connections within a cutoff of 3 Å. Recall that here we are only examining grain boundary atoms. Atoms within the true grain boundary should have a large number of grain boundary neighbors, whereas atoms that have been classified as grain boundary atoms but actually lie somewhere within a grain should have a fewer number of grain boundary atoms as neighbors. We use the combination of the number of grains detected, atoms per grain distributions, and miss-classified grain boundary atoms to judge a given method’s accuracy.

Tensile fracture simulations

A rectangular block of 64,000 atoms was given tensile strain at a constant strain rate of 0.5 $\frac{\%}{ps}$, at 100 K under NVT conditions. The tensile strain was given in the z-direction, and the simulation box was allowed to change along that axis, while the box was held constant along the remaining two axis. The simulation was run for 5 ns. Calculations of D² were done using the OVITO software package⁶⁷.

Visualization

All atomistic visualizations were created using the OVITO software package⁶⁷. Atoms-to-continuum visualizations were done using PyVista⁴⁹.

Characterization method details

For PTM, a root mean square deviation of 0.25 was used. For CNA, an adaptive cutoff radius was employed. For CSP, the number of neighbors was set to 12 since we examined only the FCC phase of Al. We employed the minimum-weight matching convention for CSP. A CSP cutoff of 2 was used as the boundary between order and disorder, due to the location of the first CSP distribution peak.

Data availability

Due to the file sizes of the data in this work, all data required to reproduce these results can be requested by contacting the corresponding author.

Code availability

The SODAS code can be downloaded at https://github.com/LLNL/graphite. A detailed demo of model training and testing can be found at https://github.com/LLNL/graphite/blob/main/notebooks/sodas/training-and-inference.ipynb.

References

Fish, J., Wagner, G. J. & Keten, S. Mesoscopic and multiscale modelling in materials. Nat. Mater. 20, 774–786 (2021).
Article ADS CAS PubMed Google Scholar
Miller, R. E. & Tadmor, E. B. A unified framework and performance benchmark of fourteen multiscale atomistic/continuum coupling methods. Model. Simul. Mater. Sci. Eng. 17, 053001 (2009).
Article ADS Google Scholar
Karma, A. & Tourret, D. Atomistic to continuum modeling of solidification microstructures. Curr. Opin. Solid State Mater. Sci. 20, 25–36 (2016).
Article ADS CAS Google Scholar
Guo, S. F. et al. Fe-based amorphous coating for corrosion protection of magnesium alloy. Mater. Des. 108, 624–631 (2016).
Article CAS Google Scholar
Lin, Y., Skaff, H., Emrick, T., Dinsmore, A. D. & Russell, T. P. Nanoparticle assembly and transport at liquid-liquid interfaces. Science 299, 226–229 (2003).
Article ADS CAS PubMed Google Scholar
Wacaser, B. A. et al. Preferential interface nucleation: an expansion of the vls growth mechanism for nanowires. Adv. Mater. 21, 153–165 (2009).
Article CAS Google Scholar
Willner, I. & Katz, E. Controlling chemical reactivity at solid-solution interfaces by means of hydrophobic magnetic nanoparticles. Langmuir 22, 1409–1419 (2006).
Article CAS PubMed Google Scholar
Heo, T. W. et al. Microstructural impacts on ionic conductivity of oxide solid electrolytes from a combined atomistic-mesoscale approach. npj Comput. Mater. 7, 214 (2021).
Article ADS CAS Google Scholar
Heo, T. W. & Chen, L.-Q. Phase-field modeling of displacive phase transformations in elastically anisotropic and inhomogeneous polycrystals. Acta Mater. 76, 68–81 (2014).
Article ADS CAS Google Scholar
Heo, T. W., Colas, K. B., Motta, A. T. & Chen, L.-Q. A phase-field model for hydride formation in polycrystalline metals: application to γ-hydride in zirconium alloys. Acta Mater. 181, 262–277 (2019).
Article ADS CAS Google Scholar
Gittleman, C. S., Kongkanand, A., Masten, D. & Gu, W. Materials research and development focus areas for low cost automotive proton-exchange membrane fuel cells. Curr. Opin. Electrochem. 18, 81–89 (2019).
Article CAS Google Scholar
Abbasi, R. et al. A roadmap to low-cost hydrogen with hydroxide exchange membrane electrolyzers. Adv. Mater. 31, 1805876 (2019).
Article Google Scholar
Srinivasan, S. G., Baskes, M. I. & Wagner, G. J. Atomistic simulations of shock induced microstructural evolution and spallation in single crystal nickel. J. Appl. Phys. 101, 043504 (2007).
Article ADS Google Scholar
Liu, S. et al. Atomistic simulation of microstructure evolution of niti single crystals in bending deformation. Comput. Mater. Sci. 199, 110733 (2021).
Article CAS Google Scholar
Simon, M. & Meyer, E. L. Detection and analysis of hot-spot formation in solar cells. Sol. Energy Mate. Solar Cells 94, 106–113 (2010).
Article CAS Google Scholar
Tokmakoff, A., Fayer, M. D. & Dlott, D. D. Chemical reaction initiation and hot-spot formation in shocked energetic molecular materials. J. Phys. Chem. 97, 1901–1913 (1993).
Article CAS Google Scholar
Budevski, E., Staikov, G. & Lorenz, W. J. Electrocrystallization: nucleation and growth phenomena. Electrochim. Acta 45, 2559–2574 (2000).
Article CAS Google Scholar
Zhang, R., Khalizov, A., Wang, L., Hu, M. & Xu, W. Nucleation and growth of nanoparticles in the atmosphere. Chem. Rev. 112, 1957–2011 (2012).
Article CAS PubMed Google Scholar
Bernstein, N. et al. Quantifying chemical structure and machine-learned atomic energies in amorphous and liquid silicon. Angew. Chem. Int. Ed. 58, 7057–7061 (2019).
Article CAS Google Scholar
Musil, F. et al. Physics-inspired structural representations for molecules and materials. Chem. Rev. 121, 9759–9815 (2021).
Article CAS PubMed Google Scholar
Stukowski, A. Structure identification methods for atomistic simulations of crystalline materials. Model Simul. Mat. Sci. Eng. 20, 045021 (2012).
Article ADS Google Scholar
Steinhardt, P. J. & Chaudhari, P. Point and line defects in glasses. Philos. Mag. A 44, 1375–1381 (1981).
Article ADS CAS Google Scholar
Ackland, G. J. & Jones, A. P. Applications of local crystal structure measures in experiment and simulation. Phys. Rev. B 73, 054104 (2006).
Article ADS Google Scholar
Mahmood, Y., Alghalayini, M., Martinez, E., Paredis, C. J. J. & Abdeljawad, F. Atomistic and machine learning studies of solute segregation in metastable grain boundaries. Sci. Rep. 12, 6673 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Kelchner, C. L., Plimpton, S. J. & Hamilton, J. C. Dislocation nucleation and defect structure during surface indentation. Phys. Rev. B 58, 11085–11088 (1998).
Article ADS CAS Google Scholar
Chapman, J., Goldman, N. & Wood, B. C. Efficient and universal characterization of atomic structures through a topological graph order parameter. npj Comput. Mater. 8, 37 (2022).
Article ADS CAS Google Scholar
Rosenbrock, C. W., Homer, E. R., Csányi, G. & Hart, G. L. W. Discovering the building blocks of atomic systems using machine learning: application to grain boundaries. npj Comput. Mater. 3, 29 (2017).
Article ADS Google Scholar
De, S., Bartók, A. P., Csányi, G. & Ceriotti, M. Comparing molecules and solids across structural and alchemical space. Phys. Chem. Chem. Phys. 18, 13754–13769 (2016).
Article CAS PubMed Google Scholar
Behler, J. Atom-centered symmetry functions for constructing high-dimensional neural network potentials. J. Chem. Phys. 134, 074106 (2011).
Article ADS PubMed Google Scholar
Shapeev, A. V. Moment tensor potentials: a class of systematically improvable interatomic potentials. Multiscale Model. Simul. 14, 1153–1173 (2016).
Article MathSciNet MATH Google Scholar
Larsen, P. M., Schmidt, S. & Schiøtz, J. Robust structural identification via polyhedral template matching. Model. Simul. Mater. Sci. Eng. 24, 055007 (2016).
Article ADS Google Scholar
Fujii, S., Yokoi, T., Fisher, C. A. J., Moriwake, H. & Yoshiya, M. Quantitative prediction of grain boundary thermal conductivities from local atomic environments. Nat. Commun. 11, 1854 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Chapman, J., Batra, R. & Ramprasad, R. Machine learning models for the prediction of energy, forces, and stresses for platinum. Comput. Mater. Sci. 174, 109483 (2020).
Article CAS Google Scholar
Park, C. W. et al. Accurate and scalable graph neural network force field and molecular dynamics with direct force architecture. npj Comput. Mater. 7, 73 (2021).
Article ADS Google Scholar
Choudhary, K. & DeCost, B. Atomistic line graph neural network for improved materials property predictions. npj Comput. Mater. 7, 185 (2021).
Article ADS Google Scholar
Hsu, T. et al. Efficient and interpretable graph network representation for angle-dependent properties applied to optical spectroscopy. npj Comput. Mater. 8, 151 (2022).
Article ADS CAS Google Scholar
Bapst, V. et al. Unveiling the predictive power of static structure in glassy systems. Nat. Phys. 16, 702 (2020).
Article CAS Google Scholar
Coli, G. M. & Dijkstra, M. An artificial neural network reveals the nucleation mechanism of a binary colloidal ab13 crystal. ACS Nano 15, 4335–4346 (2021).
Article CAS PubMed PubMed Central Google Scholar
Boattini, E., Dijkstra, M. & Filion, L. Unsupervised learning for local structure detection in colloidal systems. J. Chem. Phys. 151, 154901 (2019).
Article ADS PubMed Google Scholar
Chan, H., Cherukara, M., Loeffler, T. D., Narayanan, B. & Sankaranarayanan, S. K. R. S. Machine learning enabled autonomous microstructural characterization in 3d samples. npj Comput. Mater. 6, 1 (2020).
Article ADS Google Scholar
Zuo, C. et al. Elucidating tumor heterogeneity from spatially resolved transcriptomics data by multi-view graph collaborative learning. Nat. Commun. 13, 5962 (2022).
Article ADS PubMed PubMed Central Google Scholar
Steinbach, I. Phase-field models in materials science. Model. Simul. Mater. Sci. Eng. 17, 073001 (2009).
Article ADS Google Scholar
Chen, L.-Q. Phase-field models for microstructure evolution. Ann. Rev. Mater. Res. 32, 113–140 (2002).
Article CAS Google Scholar
Boettinger, W. J., Warren, J. A., Beckermann, C. & Karma, A. Phase-field simulation of solidification. Ann. Rev. Mater. Res. 32, 163–194 (2002).
Article CAS Google Scholar
Richmond, B. G. et al. Finite element analysis in functional morphology. Anat. Rec. A Discov. Mol. Cell. Evol. Biol. 283A, 259–274 (2005).
Article Google Scholar
Kavousi, S., Gates, A., Jin, L. & Asle Zaeem, M. A temperature-dependent atomistic-informed phase-field model to study dendritic growth. J. Cryst. Growth 579, 126461 (2022).
Article CAS Google Scholar
Emmerich, H. Advances of and by phase-field modelling in condensed-matter physics. Adv. Phys. 57, 1–87 (2008).
Article ADS CAS Google Scholar
Heo, T. W. et al. A mesoscopic digital twin that bridges length and time scales for control of additively manufactured metal microstructures. J. Phys. Mater. 4, 034012 (2021).
Article CAS Google Scholar
Sullivan, C. B. & Kaszynski, A. PyVista: 3D plotting and mesh analysis through a streamlined interface for the Visualization Toolkit (VTK). J. Open Source Softw. 4, 1450 (2019).
Article ADS Google Scholar
Pozdnyakov, S.N., Zhang, L., Ortner, C., Csányi, G. & Ceriotti, M. Local invertibility and sensitivity of atomic structure-feature mappings. Preprint at arXiv https://arxiv.org/abs/2109.11440 (2021).
Brink, T., Koch, L. & Albe, K. Structural origins of the boson peak in metals: from high-entropy alloys to metallic glasses. Phys. Rev. B 94, 224203 (2016).
Article ADS Google Scholar
Greer, A. L. Supercool order. Nat. Mater. 5, 13–14 (2006).
Article ADS CAS Google Scholar
Jesson, B. J. & Madden, P. A. Structure and dynamics at the aluminum solid-liquid interface: an ab initio simulation. J. Chem. Phys. 113, 5935–5946 (2000).
Article ADS CAS Google Scholar
Falk, M. L. & Langer, J. S. Dynamics of viscoplastic deformation in amorphous solids. Phys. Rev. E 57, 7192–7205 (1998).
Article ADS CAS Google Scholar
Terrell, G. R. & Scott, D. W. Variable kernel density estimation. Ann. Stat. 20, 1236–1265 (1992).
Article MathSciNet MATH Google Scholar
Heo, T. W. et al. Microstructural impacts on ionic conductivity of oxide solid electrolytes from a combined atomistic-mesoscale approach. npj Comput. Mater. 7, 1–15 (2021).
Article ADS Google Scholar
Plimpton, S. Fast parallel algorithms for short-range molecular dynamics. J. Comput. Phys. 117, 1–19 (1995).
Article ADS CAS MATH Google Scholar
Zhou, X. W. et al. Atomic scale structure of sputtered metal multilayers. Acta Mater. 49, 4005–4015 (2001).
Article ADS CAS Google Scholar
Larsen, A. H. et al. The atomic simulation environment-a python library for working with atoms. J. Phys. Condens. Matter 29, 273002 (2017).
Article Google Scholar
Paszke, A. et al. Pytorch: an imperative style, high-performance deep learning library. In Advances in Neural Information Processing Systems (eds Wallach, H. et al.) 8024–8035 (Curran Associates, Inc., 2019).
Klicpera, J., Gross, J. & Gunnemann, S. Directional message passing for molecular graphs. Preprint at arXiv:2003.03123 (2020).
Bresson, X. & Laurent, T. Residual gated graph convnets. Preprint at arXiv:1711.07553 (2017).
Dwivedi, V.P., Joshi, C.K., Laurent, T., Bengio, Y. & Bresson, X. Benchmarking graph neural networks. Preprint at arXiv:2003.00982 (2020).
Elfwing, S., Uchibe, E. & Doya, K. Sigmoid-weighted linear units for neural network function approximation in reinforcement learning. Neural Netw. 107, 3–11 (2018).
Article PubMed Google Scholar
Xu, J., Sun, X., Zhang, Z., Zhao, G. & Lin, J. Understanding and improving layer normalization. In Advances in Neural Information Processing Systems (eds Wallach, H. et al.) (Curran Associates, Inc., 2019).
Hirel, P. Atomsk: a tool for manipulating and converting atomic data files. Comput. Phys. Commun. 197, 212–219 (2015).
Article ADS CAS Google Scholar
Stukowski, A. Visualization and analysis of atomistic simulation data with OVITO-the Open Visualization Tool. Model. Simul. Mater. Sci. Eng. 18, 015012 (2010).
Article ADS Google Scholar

Download references

Acknowledgements

J. Chapman, T. Hsu, X. Chen, T. W. Heo, and B. C. Wood are partially supported by the Laboratory Directed Research and Development (LDRD) program (20-SI-004) at Lawrence Livermore National Laboratory. This work was performed under the auspices of the US Department of Energy by Lawrence Livermore National Laboratory under contract No. DE-AC52-07NA27344. J. Chapman also acknowledges the support of the Department of Mechanical Engineering at Boston University.

Author information

Authors and Affiliations

Department of Mechanical Engineering, Boston University, Boston, MA, USA
James Chapman
Materials Science Division, Lawrence Livermore National Laboratory, Livermore, CA, USA
James Chapman, Tae Wook Heo & Brandon C. Wood
Center for Applied Scientific Computing, Lawrence Livermore National Laboratory, Livermore, CA, USA
Tim Hsu & Xiao Chen

Authors

James Chapman
View author publications
You can also search for this author in PubMed Google Scholar
Tim Hsu
View author publications
You can also search for this author in PubMed Google Scholar
Xiao Chen
View author publications
You can also search for this author in PubMed Google Scholar
Tae Wook Heo
View author publications
You can also search for this author in PubMed Google Scholar
Brandon C. Wood
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

X.C. and B.W. supervised the research. J.C. performed all MD simulations and devised/implemented the autonomous microstructure feature extraction methodology. T.H. trained the GNN and performed all GNN-related predictions. J.C. and T.H. devised the theoretical SODAS framework. B.W., T.H., and J.C. devised the atoms-to-field mapping, while T.H. implemented it. T.W.H. provided insight into the connection between atomistic and phase-field modeling and helped guide discussions surrounding the atoms to continuous field methodology. J.C. and T.H. wrote the manuscript with inputs from all authors.

Corresponding authors

Correspondence to James Chapman, Tim Hsu or Brandon C. Wood.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks the anonymous reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Chapman, J., Hsu, T., Chen, X. et al. Quantifying disorder one atom at a time using an interpretable graph neural network paradigm. Nat Commun 14, 4030 (2023). https://doi.org/10.1038/s41467-023-39755-0

Download citation

Received: 07 June 2022
Accepted: 26 June 2023
Published: 07 July 2023
DOI: https://doi.org/10.1038/s41467-023-39755-0

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.