Predicting the propensity for thermally activated β events in metallic glasses via interpretable machine learning

Wang, Qi; Ding, Jun; Zhang, Longfei; Podryabinkin, Evgeny; Shapeev, Alexander; Ma, Evan

doi:10.1038/s41524-020-00467-4

Download PDF

Article
Open access
Published: 15 December 2020

Predicting the propensity for thermally activated β events in metallic glasses via interpretable machine learning

npj Computational Materials volume 6, Article number: 194 (2020) Cite this article

2985 Accesses
34 Citations
2 Altmetric
Metrics details

Subjects

Abstract

The elementary excitations in metallic glasses (MGs), i.e., β processes that involve hopping between nearby sub-basins, underlie many unusual properties of the amorphous alloys. A high-efficacy prediction of the propensity for those activated processes from solely the atomic positions, however, has remained a daunting challenge. Recently, employing well-designed site environment descriptors and machine learning (ML), notable progress has been made in predicting the propensity for stress-activated β processes (i.e., shear transformations) from the static structure. However, the complex tensorial stress field and direction-dependent activation could induce non-trivial noises in the data, limiting the accuracy of the structure-property mapping learned. Here, we focus on the thermally activated elementary excitations and generate high-quality data in several Cu-Zr MGs, allowing quantitative mapping of the potential energy landscape. After fingerprinting the atomic environment with short- and medium-range interstice distribution, ML can identify the atoms with strong resistance or high compliance to thermal activation, at a high accuracy over ML models for stress-driven activation events. Interestingly, a quantitative “between-task” transferring test reveals that our learnt model can also generalize to predict the propensity of shear transformation. Our dataset is potentially useful for benchmarking future ML models on structure-property relationships in MGs.

A transferable machine-learning framework linking interstice distribution and plastic heterogeneity in metallic glasses

Article Open access 05 December 2019

Quantitative three-dimensional imaging of chemical short-range order via machine learning enhanced atom probe tomography

Article Open access 16 November 2023

Finding defects in glasses through machine learning

Article Open access 15 July 2023

Introduction

Metallic glasses (MGs), as a unique class of amorphous materials, exhibit a high atomic packing density with pronounced topological and chemical short-to-medium range order^1,2,3,4. The complex local structures have been demonstrated to have a profound influence on the properties of MGs⁵. In essence, many properties of MGs can be depicted in terms of excursions in the potential energy landscape (PEL)^6,7,8, which is a multidimensional configurational space with local energy minima separated by barriers. In the PEL picture, elementary excitations upon external stimuli (e.g., thermal or mechanical) are associated with the β processes, which correspond to the hopping between nearby local minima, i.e., sub-basins inside a deep PEL megabasin⁹. Elementary excitations have been correlated with many properties^10,11,12, including local plastic deformation^13,14, diffusion mediated by atomic hopping¹⁵, as well as structural relaxation (local energy minimization in the direction towards the bottom of the basin) or rejuvenation (to a higher-energy local minimum)¹⁶.

It remains as a long-standing challenge to unravel the role of static structure in controlling the elementary excitations in MGs: is there a structural indicator that can be tapped into to predict how resistant or compliant different local regions are to externally stimulated activation? Over the past several decades, many efforts have been devoted to addressing this critical question. Recently, the emerging machine learning (ML) technique, based on well-crafted representations of the atomic environment, has been proven to be promising for establishing atomic-level structure-property relationships in liquids and glasses^{17,18,19,20,21,22,23,24}. For example, Schoenholz et al.¹⁷ studied L-J model liquids and utilized ML to derive a structural parameter called “softness”, which was found to correlate well with the particle’s propensity for hopping, reflecting its susceptibility to β relaxation of liquids¹⁰. Below the glass transition temperature, metallic liquids become frozen into glass solids and the timescale of the glass dynamics becomes very long, well beyond the capability of atomistic (e.g., molecular dynamics) simulations. We, therefore, have to resort to local perturbation methods, to activate the local group of atoms into excited states by stress or thermal stimulus, as a probe into the susceptibility to elementary excitations. Several recent ML studies have focused on quantitatively gauging how the local environment influences the propensity for stress-activated β processes (i.e., shear transformations) in MGs^18,19,20. For example, pioneering works of Cubuk et al.¹⁸ performed ML on disordered materials such as L-J glasses and granular systems and showed that radial and bond-angle distribution information can be used to identify atoms with a high propensity to shear transformation. Wang et al.¹⁹ developed interstice distribution as a new local structural representation for MGs, which is proven to be robust in predicting plastic sites of several MGs and has advantages in generalizing between compositions even chemical systems. However, the accuracy achieved in these attempts is not yet sufficiently high, and the reported scoring metric, e.g., recall or area under the receiver operating characteristic curve (AUC-ROC), is typically below 80%. One reason for this is that the elementary excitations upon shear transformations are complicated by the non-uniform tensorial stress field in the solid under deformation, as well as the dependence of activation on loading conditions (e.g., loading mode and direction)^25,26. If not properly dealt with, these would introduce non-trivial noises in the accrued data and influence adversely the quality of the learnt structure-property relations.

This problem, however, subsides when dealing with the thermally induced elementary excitations in MGs. For instance, here we use activation-relaxation technique (ART)^27,28 to probe the propensity for thermal activation of each atom in MGs (see schematic description in Fig. 1a, which will be discussed later). These activated processes are not subject to internal non-uniform stresses, and can be well converged by averaging over a considerable number of activation pathways, significantly reducing data noises. Meanwhile, the Gaussian-like distribution of thermal activation energetics (Fig. 1b, to be discussed later) can well identify atoms at both the hard and soft ends, corresponding to locally favored and unfavored motifs, respectively. This avoids the problem associated with common stress activation indicators (e.g., non-affine displacement or von Mises strain), which often exhibit a skewed “long-tail” distribution²⁹ and the resolution at the hard end is much lower than that on the soft side. Moreover, thermally activated events are comparable in their energetics, at least for some MGs that are based on some common (or similar) elements, even when they are of different composition or processing history; as such multiple “datasets” can be combined to facilitate the ML identification of the structural underpinning in more general terms.

**Fig. 1: Thermally activated events in Cu-Zr metallic glasses.**

In this work, we develop ML models to predict the propensity of thermally activated elementary excitation, from the atomic environment of the static MG structure. We systematically probe the activation energies in six MGs, including Cu₆₄Zr₃₆ prepared under different quenching rates, as well as Cu₅₀Zr₅₀ and Cu₈₀Zr₂₀, using ART^27,28. The activation energy around each atom is calculated, and ensemble-averaged over 50 activation trials, to indicate its susceptibility to excitation. We then combine the data from the six MGs into a wider activation energy spectrum (Fig. 1b) and use ML to identify those atoms with strong resistance or high compliance to activation. By fingerprinting the atomic site environment with a recently proposed interstice distribution representation¹⁹, we find that ML can reliably identify atoms with the highest 5% and lowest 5% activation energy, reaching an area under the receiver operating characteristic curve (AUC-ROC) of 0.942 and 0.888, respectively. Such accuracies are considerably better than that in previous ML predictions of the propensity for stress-driven shear transformations^18,19. We rigorously compare our ML results with those obtained using several other feature representations, and identify descriptors that are critical to our ML decision; interestingly, most of them turn out to be medium-range order features. Finally, we conduct quantitative “between-task” transferring tests and show that our learnt model can be used to predict the propensity for shear transformation as well. This ML work highlights the predictive power of local static structure to quantitatively connect with β processes in MGs.

Results

Energy barriers for thermally activated β processes

We employ molecular dynamics (MD) simulation to prepare six Cu-Zr model MG samples: (i) different compositions yet with the same cooling rate (Cu₅₀Zr₅₀, Cu₆₄Zr₃₆, and Cu₈₀Zr₂₀ quenched from liquid at 10¹⁰ K s⁻¹), and (ii) same composition but with different cooling rates (Cu₆₄Zr₃₆ MGs with the quenching rates of 10⁹ to 10¹² K s⁻¹) (see Methods for simulation details). We then apply ART to probe the energy barrier for thermally activated events^27,28. The physics picture of ART is to cross the energy barrier in a way that simulates the thermal activation, although it’s actually achieved by the ART algorithm instead of real temperature (0 K is applied during that process).

Around each atom in those MGs, we initiate 50 independent activation events along random activation pathways (illustrated by the dashed red lines in Fig. 1a, see Methods for more details). The ensemble-averaged activation energy, E_act, can then be defined as the average energy difference between the saddle point and the initial state,

$$E_{{\mathrm{act}}} = \left\langle {E_{{\mathrm{saddle}}} - E_{{\mathrm{initial}}}} \right\rangle$$

(1)

The average value of 50 independent activations around each atom is sufficient to achieve a converged E_act, which contains key statistical information for thermal excitations on each local region (including the center atom and its neighbors). We chose to average all the activation barrier into an “effective” barrier as the target variable for this ML study. Such “effective” barrier can be considered as a pure thermodynamics description that aims to provide a relatively complete information on the local topology of the potential energy landscape. Other options for the target variable such as the lowest energy barrier have been discussed in Supplementary Table 1.

Figure 1b shows the distribution of E_act in the six MGs. The dashed vertical line denotes the percentile 50% (median) of E_act in each MG. The widespread of E_act signifies a large degree of structural and property heterogeneity in each glass. As mentioned in the Introduction, the Gaussian-like distribution of E_act observed is very different from that for stress-activated event, where a “long-tail” distribution is often observed in the stress activation indicator (e.g., non-affine displacement or von Mises strain)²⁹. The E_act spectrum clearly depends on the MG composition or quenching rate. Next, we merge the E_act data of the six MGs into a more comprehensive E_act spectrum (Fig. 1b). The combined spectrum markedly increases the variety of local environments surveyed, far beyond what is present in a single MG. Later, we will feed this combined dataset to ML and test if ML is capable of mapping out the characteristic atoms at both the high E_act (hard) and low E_act (soft) ends of these Cu-Zr MGs.

Connecting activation barriers with local atomic environment

We make use of a set of interstice distribution descriptors to represent the local atomic environment¹⁹. The basic fingerprinting procedure is to extract groups of bonds, facets and tetrahedra from the coordination polyhedron of an atom, and then featurize the distribution of interstitial spaces present in these bond, facet, and tetrahedron groups. A simple treatment of representing the distribution is to derive typical statistics (such as minimum, mean, maximum, and standard-deviation) of the interstitial spaces present. The characterization of bond, facet, and tetrahedron interstices can include 2-body, 3-body, and 4-fold correlations, respectively, in the nearest-neighbor, short-range-order (SRO) signatures. The SRO signatures will be further “coarse-grained” to derive statistics among their neighbors. Such “coarse-grained” signatures are a representation of medium-range-order (MRO), with a length scale of ~4–6 Å, which is the next-level structural organization beyond the SRO. Upon implementation, the interstice representation contains 80 descriptors, 16 SRO, and 64 MRO. The codes for this representation can be publicly accessed in amlearn¹⁹ (https://github.com/Qi-max/amlearn) and matminer³⁰ (https://github.com/hackingmaterials/matminer). This representation has been demonstrated to be highly predictive, interpretable, and generalizable in a range of MGs¹⁹.

After featurizing all atoms in the six MGs, we feed the data to a scalable tree boosting ML algorithm, XGBOOST³¹. XGBOOST implements a parallel tree boosting algorithm that is proven to be very efficient and robust in various cases. We train two sets of XGBOOST classifiers to identify the highest 5% and the lowest 5% E_act atoms, respectively, in the combined dataset merged from six MGs (Fig. 1b). Varying the threshold from 3% to 10% gives similar results, and in general, the smaller the fraction, the better the ML score (i.e., the easier for ML to identify). As we are dealing with an imbalanced dataset, we do random equal undersampling three times to create three data samples, each with 3000 positive class atoms (the highest or lowest E_act atoms) and 3000 negative class atoms. We then perform 5-fold cross-validation on each of the data samples, and average the predictions on the test sets (i.e., averaged over 5 × 3 = 15 test sets). The repeated undersampling procedure is very useful for reducing the variance introduced by data undersampling.

We use the area under the receiver operating characteristic curve (AUC-ROC) as the scoring metric of the classifiers. The ROC curve characterizes the tradeoff between the true positive rate (TPR) and negative-positive rate (FPR)³²:

$${\mathrm{TPR}} = \frac{{{\mathrm{TP}}}}{{{\mathrm{TP}} + {\mathrm{FN}}}},{\mathrm{FPR}} = \frac{{{\mathrm{FP}}}}{{{\mathrm{FP}} + {\mathrm{TN}}}}$$

(2)

TPR is also known as recall or sensitivity, where TP and FN are short for true positive and false negative, respectively. FPR is the false-alarm rate, where FP and TN stand for false positive and true negative, respectively. AUC-ROC, measuring the area underneath the ROC curve, is a widely used metric to evaluate a classifier³². By definition, an AUC-ROC of 0.5 indicates the classifier performs no better than random chance level, 1.0 signifies perfect classification, and the higher the AUC-ROC, the better the model is at distinguishing the classes. Figure 2a presents the ROC curve and its AUC in classifying the highest and lowest 5% E_act atoms, respectively. For simplicity, these two ML problems are referred to as “H-E_act” and “L-E_act” hereafter. We see that the XGBOOST model trained from interstice distribution can well distinguish the high E_act atoms from the rest of the glass, reaching a very high AUC-ROC of 0.942. These high E_act atoms are particularly resistant to thermal activation and “pin” the local rearrangement. While there is an increased ambiguity in classifying the lowest E_act atoms, the AUC-ROC is also high (0.888), indicating there is also significant structural contrast at the soft end. One can directly observe from the ROC curve the TPR and FPR values at various probability thresholds for designating the classes.

**Fig. 2: Predicting the heterogeneity of thermally activated events.**

Besides outputting a “label” (0 or 1) to predict whether an atom belongs to a class or not, XGBOOST (and many other ML algorithms) can also give continuous probability estimates, in the range of [0, 1], to reveal the confidence level of predictions. The probabilities can reveal the uncertainty of prediction, allow some flexibility in using the model, and provide a more nuanced way to assess the model. However, raw class probabilities from nonlinear ML algorithms are often not well-calibrated and should be carefully checked before interpretation. Specifically, if the predicted probabilities match the “real” class probabilities, such probabilities are referred to as calibrated. For instance, when the positive class probability of some data points is 0.70, ideally these points should indeed have a probability of 0.70 to be positive. This ideal calibration occasion is illustrated by the diagonal line in Fig. 2b. In this work, we employ a post-training calibration method called isotonic regression³³ to improve the calibration performance of our probability estimates. As seen in Fig. 2b, the calibration curves of p_h and p_l, i.e., the probability estimates from models obtained in the “H-E_act” and “L-E_act” ML problems, respectively, are both demonstrated to be close to perfect calibration. The area between the calibration curve and the perfect calibration line, as a measure of miscalibration, is very low in both cases (Fig. 2b). Thus, our machine-learnt probability estimates can well reflect the real class probabilities and warrant further interpretation.

We proceed to look more into the distributions of the ML-evaluated class probability estimates, that is, p_h from H-E_act and p_l from L-E_act. Figures 3a and 3b present the overall p_h and p_l distributions in the six MGs as well as the variation of E_act with p_h and p_l. A wide distribution of p_h and p_l is observed, revealing a large degree of heterogeneity inside the MGs. p_h has a larger proportion of atoms close to 0 and 1, again indicating that ML is more confident at distinguishing the high E_act atoms. A strong dependence of E_act on p_h and p_l is observed (Fig. 3b), that is, positively correlated with p_h and negatively correlated with p_l, demonstrating the feasibility of p_h and p_l serving as indicators of the thermal activation propensity. We further visualize the distribution of E_act, p_h and p_l in a model Cu₆₄Zr₃₆ glass to allow atomic-scale scrutinization (Fig. 3c). For simplicity, only atoms with p_h or p_l > 0.50 are highlighted in the p_h and p_l maps: ML predicts that the probability of these atoms belonging to the highest 5% E_act or lowest 5% E_act class is greater than 0.50; if setting a class threshold as 0.50, these p_h or p_l > 0.50 atoms would then be classified as the high or low E_act class, respectively. A good correspondence can be seen between the high E_act atoms and high p_h atoms, as well as between the low E_act atoms and high p_l atoms. As reflected by the relatively lower prediction score in the L-E_act task, there are more false-positive atoms (high p_l yet high E_act) and false-negative atoms (low p_l yet low E_act) in predicting the low E_act atoms, but still the prediction quality is sufficiently good. These results reveal that a solid relationship between local structure and thermal activation propensity can be established by combining interstice features and ML. We also perform direct regression of E_act using the interstice features and the Pearson correlation coefficients and parity plots are presented in Supplementary Table 3.

**Fig. 3: ML-evaluated class probability.**

Comparison with ML models employing other feature representations

Next, we compare our ML results based on interstice features with those fitted from several other representations. Here we consider a total of eight pure structural representations and three physical signatures for comparison (Table 1). To guarantee a fair comparison, training is performed on the same data samples and same cross-validation splits. We train XGBOOST³¹ and SVM³⁴ models with various hyperparameters and extract the best scores for each representation. Most of the presented scores are from XGBOOST, while the best scores of the radial symmetry functions, bispectrum coefficients and smooth-overlap of atomic positions (SOAP) are from linear SVM, and moment tensor potential (MTP) internally uses linear regression to build the potential model (Table 1). The detailed ROC curves can be found in Supplementary Table 2. Besides, an additional feature indicating whether the atom is Cu (0) or Zr (1) is added to each representation to help ML decisions. This is very helpful for representations that cannot well distinguish the atom types from the features themselves.

Table 1 Using various pure structural representations or physical signatures to classify the highest 5% (denoted as H-E_act problem) and lowest 5% E_act atoms (L-E_act problem) of the combined E_act spectrum merged from six MGs (Fig. 1b).

Full size table

We start with two “baseline” models built with: (i) two one-hot-encoded (0 or 1) variables designating whether the nearest-neighbors around an atom form a < 0, 0, 12, 0, 0 > (<0, 0, 12, 0> if omitting occasional facets with >6 edges) or <0, 0, 12, 4, 0> Voronoi polyhedron or not; (ii) five integer Voronoi indices (n₃, n₄, n₅, n₆, and n_>6), where n_x represents the number of x-edged facets in the Voronoi polyhedron³⁵. Many studies revealed that the Cu-centered <0, 0, 12, 0, 0> icosahedra and Zr-centered <0, 0, 12, 4, 0> polyhedra are among the most stable motifs in Cu-rich Cu-Zr MGs^36,37. In this work, ~21.4% Cu atoms of the six MGs are surrounded by icosahedra and 6.0% Zr atoms are <0, 0, 12, 4, 0> . A baseline model can then be simply predicting the atoms centered in icosahedra or <0, 0, 12, 4, 0> as high E_act atoms and those not as low E_act. We find that the AUC-ROC achieved by such baseline model is not satisfactory, i.e., 0.673 and 0.557 in the H-E_act and L-E_act tasks, respectively (Table 1). As seen in the Supplementary Table 2, the TPR (recall) of this baseline model in classifying the highest 5% E_act atoms is ~0.48, indicating that indeed only ~0.48 of the highest E_act atoms are among the icosahedra and <0, 0, 12, 4, 0> atoms. Not surprisingly, this heuristic model works worse in classifying the lowest E_act atoms, as icosahedra and <0, 0, 12, 4, 0> are aimed at prototyping the most stable motifs and not forming those motifs does not necessarily mean that this atom is soft. This results in a large FPR and ultimately a small AUC-ROC of 0.557 in the L-E_act task (Supplementary Table 2). As to the second baseline model trained from the Voronoi indices, the prediction is better, with AUC-ROC of 0.750 and 0.628, respectively (Table 1, the ROC curves are presented in Supplementary Table 2). We see that by allowing the model to decide from the detailed Voronoi indices instead of from several predefined motifs only, the model can capture more subtle structural information and make better decisions in both tasks. These two sets of models are basically based on the well-established Voronoi indices and are relatively simple to set up, forming the baseline models in our tasks; and ideally, any proposed ML models should well outperform the baseline models.

Next, we combine a group of SRO features as the third structural representation for comparison, including characteristic motif signatures and Voronoi indices³⁵ as used in the baseline models, coordination number (CN) within a cutoff distance (4.0 Å) or in a Voronoi polyhedron, Voronoi volume, and bond-orientational order parameters (q_l and w_l, where l = 4, 6, 8, and 10)³⁸. This representation achieves an AUC-ROC of 0.807 in the H-E_act task and 0.634 in the L-E_act task (Table 1, see Supplementary Table 2 for ROC curves). The inclusion of bond-orientational order features accounts for the increase of AUC-ROC compared with baseline model 2. The L-E_act task remains to be a harder task than the H-E_act for the structural representation to predict. Beyond SRO, interestingly, if we further augment the SRO features with the coarse-grained MRO features (taking statistics between nearest neighbors¹⁹, as applied in the interstice representation), the predictive ability is greatly enhanced (Table 1, see Supplementary Table 2 for ROC curves). This suggests that it is important to bring MRO into the prediction scheme (the importance of MRO will be discussed in more detail later).

As another important group of structural representations, we adopt four representations that are originally designed for fitting ML potentials: (i) radial symmetry functions^{17,18,20,22,23,24,39}; (ii) bispectrum coefficients of density functions^40,41; (iii) moment tensor potential (MTP)^42,43; and (iv) smooth-overlap of atomic positions (SOAP)⁴⁴. Please see Methods for details. The ML results are summarized in Table 1 and ROC curves are shown in Supplementary Table 2. We see that these four representations can all well predict the high E_act atoms (AUC-ROC > 0.90), while the scores in predicting low E_act atoms are lower. The MTP and SOAP descriptors achieve the best scores in this group of structure representations. Going beyond the radial symmetry functions that only contain radial information, including angular information in the MTP and SOAP descriptors increases the prediction accuracy, yet does not induce a very significant improvement. This can be because in MGs, due to the removal of crystallographic restraints, the angular distribution tends to be close to that preferred in poly-tetrahedral packing without significant variation. Comparatively speaking, incorporating an effective representation of MRO, which has been demonstrated to pose a huge effect on the glass properties, has improved the prediction performance to a greater degree. This is demonstrated in Table 1 for the excellent accuracy of the interstice distribution representation with MRO incorporated, as well as the remarkable increase of accuracy for the simple SRO features when augmented by MRO ones. Besides, in previous studies, Schoenholz et al.¹⁷ used the radial symmetry function representation to classify atoms with high propensity for hopping (soft end) in L-J liquids and achieved a very high recall of ~90%. The relatively lower accuracy in the current L-E_act task (also corresponds to soft end) suggests that identifying atoms susceptible to β relaxation in the solid-state MGs could be harder than that for the parent supercooled liquids, as manifested by that the same set of features achieve a lower score in the former problem. Other possible factors are (i) the natural prediction accuracy difference between Cu-Zr MGs described by EAM potential and supercooled liquids described by pairwise L–J potential and (ii) the combination of different composition, different quenching rate in a single dataset may increase the ambiguity for the radial symmetry functions.

Finally, we compare the results of the pure structural representations with the results of three physical signatures, namely flexibility volume V_flex⁴⁵, atomic and coarse-grained shear moduli G⁴⁶ (see Methods). Table 1 summarizes their prediction scores and the ROC curves are presented in Supplementary Table 2. These signatures require detailed knowledge of interatomic potentials to calculate and thus are not pure structural representations. Among the physical descriptors, V_flex fares much better than atomic or coarse-grained G in correlating with E_act. We find that some pure structural representations (interstice, SRO + coarse-grained MRO, and the four ML potential representations) are still very competitive compared with these physical signatures (Table 1), further advocating the use of proper structural representation, with the aid of ML, to establish the structure-property relationship in MGs. The interstice distribution features achieve the highest prediction score in both the H-E_act and L-E_act tasks. Such quantitative benchmarks are important for obtaining a clear picture of the structure-property relations proposed in MGs. We also note that, strictly speaking, the relative performance of each representation can be task-specific. Thus, for a future task of interest, we recommend to conduct some rigorous benchmarking like this to locate the best representation for maximal ML performance.

Impact of medium-range environment on activated events

Thus far, we demonstrate that our ML model, employing the interstice features that start from static atomic positions only, can well predict the heterogeneity of thermal-activated elementary excitations in Cu-Zr MGs. We next look into how the ML models make decisions based on the input features.

ML algorithms such as XGBOOST allow quantification of feature importance, which evaluates how each descriptor improves the performance measure, e.g., Gini index for XGBOOST, and thus can be particularly useful for model interpretation. For ease of interpretation, we first remove some highly-linearly-correlated features (Pearson correlation coefficient > 0.70) and then reduce the feature number to 10 by a brute-force recursive feature-elimination procedure: i) train a model with N features and derive the ML performance; ii) iteratively eliminate each of the N features, retrain a ML model with the remaining N - 1 features and calculate the performance loss (if any) compared to the original model with N features; iii) eliminate the feature with the least performance loss. This is based on that basically, dropping unimportant features should not degrade the performance significantly. We recursively repeat the above procedure until the feature dimension is reduced to 10.

Figure 4a visualizes the ultimate 10 features and their Pearson correlation matrix. We abbreviate the subscript “interstice” as “is”; and for several distance interstice features, the subscript “dist” in d_is-dist indicates that the nearest-neighbors are determined by a cutoff distance rather than by the default Voronoi tessellation. The 10 features exhibit low Pearson correlation coefficient (the maximum is 0.63). Interestingly, we find that 9 out of the 10 survived features are describing interstice distribution in the medium-range (i.e., with “MRO” in the feature name). This again suggests that MRO contributes greatly to the decision making. According to the feature importance, MRO_mean Std(V_is) and MRO_mean Std(d_is-dist) are the most important features in the L-E_act and H-E_act tasks, respectively (Fig. 4b). These two metrics are evaluating the average variation of the tetrahedron volume interstice and bond distance interstice at the medium-range around an atom. This emphasizes the importance of local structure anisotropy, persisting to the medium-range, to the glass property. For the L-E_act task, MRO_mean Std(V_is) stands out with a very high importance, and for the H-E_act task, the feature importances distribute more evenly.

We then select typical hard and soft Cu (Zr) atoms and show the distribution of tetrahedron volume interstice, V_is, and bond distance interstice, d_is-dist, in their local environment to demonstrate the inherent structural contrast between the hard and soft atoms. Typical atoms with high E_act (~2.9 eV) and low E_act (~0.7 eV) are selected, and the red and purple histograms show the spread of interstices, V_is and d_is-dist, present in the coordination polyhedron (SRO) and in the neighboring clusters (MRO), respectively (Fig. 5a and b). We find that the V_is and d_is-dist distributions in the SRO of the high E_act atoms (Fig. 5a) are distinctly more centered than that in the low E_act ones (Fig. 5b). For the low E_act atoms, there often exist some tetrahedra or bond segments that have very low or high content of interstice. This would lower the stability of local environment and propel the atom to respond to thermal excitation. Remarkably, this trend persists to the medium-range (purple histograms). As quantified by Fig. 4b, the MRO interstice distribution is even more important than the SRO ones. The sharp contrast in the interstice distribution illustrates the foundation of our ML success in distinguishing the characteristic atoms.

**Fig. 5: Structural contrast between the high and low E_act atoms.**

Next, we use principal component analysis (PCA)⁴⁷ to project the information in the high-dimensional feature space (R¹⁰, ten features in Fig. 4) into a low-dimensional space (R²) to visualize the inherent data structure of the site environment signatures (Fig. 5c). PCA is a dimensionality reduction method that uses orthogonal transformation to reduce possibly correlated features to uncorrelated variables with key information preserved, and is totally unsupervised (with no use of class labels and does not need training)⁴³. From Fig. 5c, we see that the high E_act and low E_act atoms do tend to reside in very different regions (the ratios of variance explained by the principle component 1 and 2 are 0.303 and 0.209, respectively). Back to the above supervised ML results, strong structural contrast in both the hard and soft ends is also revealed (Figs. 2 and 3). Here, both supervised and unsupervised analyses suggest a highly inhomogeneous MG structure, with distinctive hard (or say solid-like) and soft (liquid-like) atoms dissolved inside.

Transferability to identifying shear transformation propensity

As mentioned earlier, in addition to thermally activated events, another important type of elementary excitation is the local shear transformation activated by stress^{48,49,50,51,52,53}. The low-stress-resistance units are usually referred to as shear transformation zones (STZs). As discussed in the Introduction, the thermal- and stress-activated excitations can both be interpreted in the framework of β processes, however, the atomic-specific response can vary, due to the different characteristics of stimulus source (uniform vs non-uniform, protocol-independent vs dependent). This prompts us to ask: how would our ML models trained for predicting the thermal excitation propensity perform, when they are used to identify STZs? Is it possible for the models to work well when transferring to a different task?

This “between-task” test is challenging in several ways: (i) STZs and E_act are basically different properties, stimulated by different stimuli and thus yielding different data; (ii) the features considered important for predicting E_act may not be optimal for identifying STZs. The point (ii) is very likely, as in a previous work using the interstice features to identify STZs in MGs, only ~50% of the most important features were MRO features¹⁹, much lower than the ~90% in the E_act case (Fig. 4). Driven by this question, we simulate athermal quasi-static (AQS) shear deformation of a typical Cu₆₄Zr₃₆–10⁹ K s⁻¹ glass (Methods). We calculate the interstice features of each atom and apply the model trained from the L-E_act problem (which focuses on the soft end) to derive the probability estimate p_l of each atom. Intuitively, as p_l is in positive correlation with the tendency of an atom to be easily activated by the thermal stimulus (Figs. 2 and 3), it may positively correlate with the susceptibility of atom to be activated by stress as well.

We calculate the non-affine displacement (${\mathrm{D}}^{2}_{\min}$) relative to undeformed state, at 4.0% shear strain, as an indicator of the plastic susceptibility of each atom. The correlation between ${\mathrm{D}}^{2}_{\min}$ and p_l is presented in Fig. 6a. Given the “long-tail” distribution of ${\mathrm{D}}^{2}_{\min}$, box plots are used to present the correlation. Box plots are useful in such case of skewed distributions, with the median (a line in the interior of box), 25% and 75% quantile (lower and upper ends of box), 1.5 times the inter-quartile range (whiskers extending outside box), as well as outliers (points outside the whiskers), clearly marked. The left figure in Fig. 6a shows the complete box plot, and some outliers extend so widely that the box section is squeezed. We then highlight the squeezed section, which constitutes the vast majority of data, in the right figure of Fig. 6a. A positive correlation between p_l and ${\mathrm{D}}^{2}_{\min}$ is clearly observed, evidencing our assumption that these two types of activations could have some similar structural origins. As a quantitative test, we use p_l to try classifying STZs with the largest 5% ${\mathrm {D}}^2_{\min}$ from the rest of the glass, similar to the setting of the L-E_act task. We vary the threshold of p_l in designating the positive/negative classes in this new STZ task, calculate the TPRs and FPRs and derive the ROC curve in Fig. 6b. The area under the ROC curve, AUC-ROC, is 0.810, which is a very reasonable score for such a transferring test. This quantitative test provides additional support to the feasibility of this “between-task” generalization.

**Fig. 6: Transferring ML model to identifying STZs.**

As discussed in the Introduction, the accuracy of STZ recognition (for example, Ref. ¹⁸ and ¹⁹) is usually lower than that of identifying the thermally activated atoms (Ref. ¹⁷ and this work), especially when using the same feature representation (radial symmetry functions^17,18 or interstice distribution features¹⁹). There are several factors that can cause this performance difference. One is the increased internal data noise of the STZ data, if the data is collected from a single loading condition. As discussed in the Introduction, stress-activated plastic heterogeneity is quite sensitive to the loading conditions such as loading mode and direction^25,26; thus, if using data from a single loading condition, non-trivial noise could be introduced in the collected data. For the thermal activation data, as used in this work, the absence of non-uniform stress eliminates the loading-related noises, and probing sufficient elementary ART events can guarantee a well-converged E_act to indicate the susceptibility to thermal excitation. In addition, upon deformation, the activation of STZ proceeds in a progressive way, that is, not all soft atoms will move in a straining step; therefore, it usually requires a relatively large strain to collect sufficient plastic events. However, this can introduce more cascade activation events to reduce the controllability of the initial undeformed structure, and the existence of long-range elastic field in the process of deformation would also increase the length scale of plastic heterogeneity, making it even beyond the scope of SRO and MRO that can be described by the structural representation.

Discussion

For the ML exploration of atomic-level structure-property relationships in amorphous alloys, a goal of common pursuit is developing novel structural representation and machine learning scheme. This paper, instead, focuses on another important aspect – finding of a suitable target property, with minimized data noises, to convincingly test the power of ML in correlating the structure with the property. Through what has been presented above, we have demonstrated that the thermally activated elementary excitation is an excellent choice in this regard. Compared with previous ML models on shear transformations in glasses, the merits of our present success on thermally activated events in MGs are multifold:

i.
We reached a high accuracy for ML prediction of elementary excitation in MGs. In this work, ML can accurately identify atoms with the highest 5% and lowest 5% thermal activation energy in a dataset merged from six different MGs, reaching an AUC-ROC of 0.942 and 0.888, respectively. These scores are significantly higher than that achieved in predicting the propensity of shear transformation. As discussed, this is mainly because the thermal activation does not suffer from the effect of non-uniform, oriented stress^25,26, and can reduce the data noises by well-converged exploration of elementary excitations. The importance of noise reduction also has implications for constructing high-fidelity glass datasets in the future.
ii.
Our ML model is able to link structure with both local favored and unfavored structural motifs, rather than only identifying the latter as in previous ML literature^{17,18,19,20,21}. This is aided by the explicit and sufficient ART perturbation tests around each atom, and the Gaussian-like distribution of thermal activation energetics that gives sufficient resolution to both the soft and hard ends. By benchmarking a variety of pure structural representations and physical signatures, our interstice distribution representation performs best in both ML tasks.
iii.
We have demonstrated that the data from multiple compositions or processing histories can be combined to connect with underlying structural signatures. This results from the comparable magnitude/range of activation barriers, for different compositions and processing histories in the same MG system. Such treatment can notably increase the variety of local environments surveyed, and allows for structure-property relation mining in more general terms.
iv.
Our analysis provides a repertoire of descriptors that are essential to the ML decision. We demonstrate how the ML models make decisions based on the interstice features and interpret why these features work in representing the inherent structural contrast in MGs. Our data-centric results also highlight the importance of MRO in determining the activation heterogeneity that has implications on the underlying glass physics. Very recently, Bapst et al.⁵⁴ have built graph neural networks to learn, from a large amount of data, to encode the atomic environment, via message-passing through an expanded neighborhood. The models achieved impressive scores in predicting the atomic motion in supercooled liquids and the shear-induced events. While such deep learning techniques can provide greater versatility and representing ability, ML techniques based on the physics-oriented descriptors still have their benefits. For example, interpretability is important for gaining insight into the underlying physics. In this regard, structure representation such as the interstice distribution features used in this work is fully transparent as it is easily interpretable in terms of what each feature is representing and we can gain structural insights that transfer. Meanwhile, structural representations are often not material- or class-specific, i.e., they are quite general and perform the same for any glass system, making it easier to judge whether the framework will work outside the training environment.
v.
We have conducted a quantitative “between-task” transferring test that successfully transfers the model fitted for pinpointing the low thermal activation energy atoms to identifying STZs upon AQS shear deformation. This success points to some common structural origins of the thermal-activated and stress-activated β processes. It is interesting to extend such quantitative transferring tests to more glass properties in the future. Despite a ton of atom-specific properties have been studied up-to-now, many properties may be intercorrelated; thus, despite one ML model is trained and tested on one task, it is possible to generalize to more tasks and gain a wider range of utility. Forming a quantitative test on a wider range of properties can also sharpen the general understanding of structure-property correlations in MGs.

Taken together, these advances underscore the structural impact on the β processes and their heterogeneity, and the insights shed light on the role of β processes as a basic unit event underlying a variety of properties of MGs^10,11,12, including local plastic deformation^13,14, atomic hoping mediating diffusion¹⁵, and structural relaxation/rejuvenation¹⁶. Our discovery, enabled by the well-designed site environment representation and dedicated ML models, is very useful and important as a step forward in establishing a concrete structure-property relationship for MGs. We have made our MG configurations and thermal activation energy data public in figshare with the DOI of https://doi.org/10.6084/m9.figshare.12485795, which could serve as a valuable benchmark for future ML studies in MG research.

Methods

MG samples preparation by MD simulation

Molecular dynamics (MD) simulations using LAMMPS⁵⁵ have been employed to prepare and analyze the Cu-Zr metallic glass models, using a set of optimized embedded-atom-method (EAM) potentials⁵⁶. Cu₆₄Zr₃₆, Cu₅₀Zr₅₀, and Cu₈₀Zr₂₀ samples containing 10,000 or 5,000 atoms (if 5,000, we will prepare two different samples at the same processing condition) were quenched to room temperature (300 K) from equilibrium liquids above the corresponding melting points. The quenching was performed at a rate of 10⁹–10¹² K s⁻¹, as marked in Fig. 1b, using a Nose–Hoover thermostat with zero external pressure. Periodic boundary conditions (PBC) were applied in all three directions during MD simulation⁵⁷. The timestep was 1 fs.

Activation-relaxation technique (ART)

Initial perturbations in ART were introduced by applying random displacement on a small group of atoms (an atom and its nearest-neighbors)^27,28. The magnitude of the displacement was fixed, while the direction was randomly chosen. When the curvature of the PEL was found to overcome the chosen threshold, the system was pushed towards the saddle point using the Lanczos algorithm. The saddle point is considered to be found when the overall force of the total system is below 0.01 eV Å⁻¹. The corresponding activation energy is thus the difference between the saddle point energy and the initial state energy. The search is performed using ART nouveau package^27,28,58. For each group of atoms, we employed ~50 successful ART searches with different random perturbation directions.

Radial symmetry functions

For an atom i, the radial symmetry functions are described as^{17,18,20,22,23,24,39},

$$G_\alpha (i;r) = \mathop {\sum}\limits_{j \in \alpha } {e^{ - (r_{ij} - r)^2/2\sigma ^2}}$$

(3)

where α represents an atom species in the system (Cu or Zr). r_ij is the distance between atoms i and j. r is a variable constant and σ is set as 0.2 Å. The sums are taken over all atom j whose distance to i is within a cutoff R^c (6.5 Å). This set of features can be considered as the Gaussian-smoothed partial pair correlation functions at different r values. Here, we vary r from 1.0 to 8.0 Å with a bin size of 0.2 Å (35 bins), generating 35 features for i – Cu and i – Zr, respectively. We then use the 70 features as input to train ML models on the same data and cross-validation splits to classify the high E_act and low E_act atoms.

Bispectrum coefficients of density functions

The coefficients of the bispectrum of the neighbor density mapped onto the 3-sphere are order parameters that can characterize the radial and angular distribution of neighbors of an atom⁴². We follow the implementation of Spectral Neighbor Analysis Potential (SNAP) which uses bispectrum as basis⁴¹. The bispectrum coefficients are calculated using the “compute sna/atom” command implemented by Thompson et al. in LAMMPS⁵⁵. We set the twojmax as 6 and rfac0 as 0.99363. The scaling factor of the cutoff radius, rcutfac, the cutoff radii, R_Cu/R_Zr, and neighbor weights, w_Cu/w_Zr, are optimized by grid search and set to be 4.0, 0.7/0.8 and 1.0/0.9 for predicting the high E_act atoms and 4.0, 1.0/1.0 and 1.0/1.0 for predicting the low E_act atoms.

Moment tensor potential (MTP)

The MTP introduces the moment tensor descriptors^43,44,

$$M_{\mu ,\nu }({\mathbf{n}}_i) = \mathop {\sum}\limits_j {f_\mu \left( {\left| {r_{ij}} \right|,z_i,z_j} \right)} \underbrace {r_{ij} \otimes \ldots \otimes r_{ij}}_{\nu \,{\mathrm{times}}}$$

(4)

to characterize the radial (f_μ) and angular information ($r_{ij} \otimes \ldots \otimes r_{ij}$) of the neighborhood ${\mathbf{n}}_i$. The moments are then contracted to a set of basis functions B_α that are invariant to permutations, rotations, and reflections. In practice, all the basis functions whose level of multiplication (levB_α) ≤ lev_max are included. The site energies are then expanded as a linear combination of the basis functions. In this work, we set the lev_max as 20 and the size of radial basis as 4, and the number of basis functions is 288. The radial parameters in the radial functions, the linear regression coefficients, as well as the weights of species (Cu and Zr) are fitted through regression of E_act using a modified version of MLIP package⁴⁴. The predicted E_act for the test atoms are then used to derive the ROC curve and AUC-ROC for the present classification tasks (i.e., derive the TPRs and FPRs by varying the E_act threshold in designating the positive/negative classes and calculate the area underneath the curve). The same set of test atoms are used for each CV split, and the remaining atoms are all used, without undersampling, for training.

Smooth-overlap of atomic positions (SOAP)

In the SOAP formalism, the neighbor density is expanded into a radial basis function R_n (r) and spherical harmonics Y_lm as angular basis set⁴⁰:

$$\rho _i(r) = \mathop {\sum}\limits_j {e^{ - (r_{ij} - r)^2/2\sigma ^2}} f_c(r_{ij}) = \mathop {\sum}\limits_{nlm} {c_{nlm}^i} R_n(r)Y_{lm}({\hat{\mathbf r}})$$

(5)

SOAP also has achieved notable success in fitting ML potentials. In pratical applications, the number of descriptors depend on n_max (number of radial basis functions) and l_max (maximum degree of spherical harmonics), as noted in Table 1. Here we set n_max = 6 and l_max = 8. The cutoff radius for determining the neighbors, R_c, and standard deviation of Gaussian expansion, σ, are set as 4.5 Å and 0.5 Å, respectively. The SOAP descriptors are derived using DScribe⁵⁹.

Flexibility volume and atomic shear moduli

The flexibility volume $V_{{\mathrm{flex}},i}$ of atom i is defined as⁴⁵:

$$V_{{\mathrm{flex}},i} = \left\langle {\left( {x_i(t) - \bar x_i} \right)^2} \right\rangle \times V_i^{1/3}$$

(6)

where $\bar x_i$ and $x_i(t)$ are the equilibrium position and instantaneous position at time t of the atom i, and V_i is the corresponding atomic volume. The calculation was obtained on short time scales when the mean square displacement is flat with time and contains the vibrational but not the diffusional contribution. Each sample was kept at equilibrium under a microcanonical ensemble (NVE) at room temperature for the calculation, which was taken over 100 independent runs, all starting from the same configuration but with momenta assigned randomly from the appropriate Maxwell-Boltzmann distribution.

Atomic shear moduli at room temperature were evaluated using the fluctuation method. For a canonical (NVT) ensemble, elastic constants can be calculated as the sum of three contributions:

$$C_{ijkl}^T = C_{ijkl}^I + C_{ijkl}^{II} + C_{ijkl}^{III}$$

(7)

where the superscript I, II, and III represents the fluctuation, kinetic contribution, and the Born term, respectively (see ref. ⁴⁶ for more details). To reduce the statistical error in our simulated samples, the average atomic shear modulus (G) is evaluated as

$$G = \frac{{C_{44} + C_{55} + C_{66}}}{3}$$

(8)

The local moduli tensor is computed at the coarse-grained scale using the average atomic shear moduli of the center atom and its nearest neighbors.

Athermal quasi-static (AQS) simulation

We employ the athermal quasi-static (AQS) mode to simulate the shear deformation of glass⁶⁰. On each deformation step, an affine strain of 10^-4 is imposed along the +xy direction, followed by an energy minimization using the conjugate-gradient method. Initial configuration is the inherent structure of the equilibrated glass sample. The simulations were conducted using LAMMPS⁵⁵ and periodic boundary conditions (PBC) were applied in all three directions. The plastic events were monitored using the non-affine displacement (${\mathrm {D}}^2_{\min}$)⁴⁹. This is done by tracking the atomic strain of each atom during deformation, and dissociating the strain into the best affine fit and the non-affine residue.

Data availability

The datasets used in this work have been made public in figshare with the DOI of https://doi.org/10.6084/m9.figshare.12485795.

Code availability

The codes for deriving the interstice representation can be publicly accessed in amlearn¹⁹ (https://github.com/Qi-max/amlearn) and matminer³⁰ (https://github.com/hackingmaterials/matminer).

References

Greer, A. L. Metallic Glasses. In Physical Metallurgy: Fifth Edition. 305–385 (Elsevier, 2014).
Schroers, J. Bulk metallic glasses. Phys. Today 66, 32–37 (2013).
Article CAS Google Scholar
Egami, T. Atomic level stresses. Prog. Mater. Sci. 56, 637–653 (2011).
Article CAS Google Scholar
Hirata, A. et al. Direct observation of local atomic order in a metallic glass. Nat. Mater. 10, 28–33 (2011).
Article CAS Google Scholar
Cheng, Y. Q. & Ma, E. Atomic-level structure and structure-property relationship in metallic glasses. Prog. Mater. Sci. 56, 379–473 (2011).
Article CAS Google Scholar
Goldstein, M. Viscous liquids and the glass transition: a potential energy barrier picture. J. Chem. Phys. 51, 3728–3739 (1969).
Article CAS Google Scholar
Debenedetti, P. G. & Stillinger, F. H. Supercooled liquids and the glass transition. Nature 410, 259–267 (2001).
Article CAS Google Scholar
Wales, D. J. A microscopic basis for the global appearance of energy landscapes. Science 293, 2067–2070 (2001).
Article CAS Google Scholar
Johari, G. P. & Goldstein, M. Viscous liquids and the glass transition. II. Secondary relaxations in glasses of rigid molecules. J. Chem. Phys. 53, 2372–2388 (1970).
Article CAS Google Scholar
Yu, H.-B., Wang, W.-H. & Samwer, K. The β relaxation in metallic glasses: an overview. Mater. Today 16, 183–191 (2013).
Article CAS Google Scholar
Qiao, J. C. & Pelletier, J. M. Dynamic mechanical relaxation in bulk metallic glasses: a review. J. Mat. Sci. Technol. 30, 523–545 (2014).
Article Google Scholar
Yu, H.-B., Richert, R. & Samwer, K. Structural rearrangements governing Johari-Goldstein relaxations in metallic glasses. Sci. Adv. 3, e1701577 (2017).
Article CAS Google Scholar
Fan, Y., Iwashita, T. & Egami, T. How thermally activated deformation starts in metallic glass. Nat. Commun. 5, 5083 (2014).
Article CAS Google Scholar
Wang, Z., Sun, B. A., Bai, H. Y. & Wang, W. H. Evolution of hidden localized flow during glass-to-liquid transition in metallic glass. Nat. Commun. 5, 5823 (2014).
Article CAS Google Scholar
Yu, H. B., Samwer, K., Wu, Y. & Wang, W. H. Correlation between β relaxation and self-diffusion of the smallest consituting atoms in metalllic glasses. Phys. Rev. Lett. 109, 095508 (2012).
Article CAS Google Scholar
Zhu, F. et al. Intrinsic correlation between β-relaxation and spatial heterogeneity in a metallic glass. Nat. Commun. 7, 11516 (2016).
Article CAS Google Scholar
Schoenholz, S. S., Cubuk, E. D., Sussman, D. M., Kaxiras, E. & Liu, A. J. A structural approach to relaxation in glassy liquids. Nat. Phys. 12, 469–471 (2016).
Article CAS Google Scholar
Cubuk, E. D. et al. Identifying structural flow defects in disordered solids using machine-learning methods. Phys. Rev. Lett. 114, 108001 (2015).
Article CAS Google Scholar
Wang, Q. & Jain, A. A transferable machine-learning framework linking interstice distribution and plastic heterogeneity in metallic glasses. Nat. Commun. 10, 5537 (2019).
Article CAS Google Scholar
Cubuk, E. D. et al. Structure-property relationships from universal signatures of plasticity in disordered solids. Science 358, 1033–1037 (2017).
Article CAS Google Scholar
Harrington, M., Liu, A. J. & Durian, D. J. Machine learning characterization of structural defects in amorphous packings of dimers and ellipses. Phys. Rev. E. 99, 022903 (2019).
Article CAS Google Scholar
Sussman, D. M., Schoenholz, S. S., Cubuk, E. D. & Liu, A. J. Disconnecting structure and dynamics in glassy thin films. Proc. Natl Acad. Sci. USA 114, 10601–10605 (2017).
Article CAS Google Scholar
Ma, Xiaoguang et al. Heterogeneous activation, local structure, and softness in supercooled colloidal liquids. Phys. Rev. Lett. 122, 28001 (2019).
Article CAS Google Scholar
Landes, F. P. et al. Attractive versus truncated repulsive supercooled liquids: the dynamics is encoded in the pair correlation function. Phys. Rev. E 101, 010602 (2020).
Article CAS Google Scholar
Barbot, A. et al. Local yield stress statistics in model amorphous solids. Phys. Rev. E. 97, 33001 (2018).
Article CAS Google Scholar
Schwartzman-Nowik, Z., Lerner, E. & Bouchbinder, E. Anisotropic structural predictor in glassy materials. Phys. Rev. E. 99, 60601 (2019).
Article CAS Google Scholar
Barkema, G. T. & Mousseau, N. Event-based relaxation of continuous disordered systems. Phys. Rev. Lett. 77, 4358–4361 (1996).
Article CAS Google Scholar
Rodney, D. & Schuh, C. Distribution of thermally activated plastic events in a flowing glass. Phys. Rev. Lett. 102, 235503 (2009).
Article CAS Google Scholar
Lee, M., Lee, C. M., Lee, K. R., Ma, E. & Lee, J. C. Networked interpenetrating connections of icosahedra: effects on shear transformations in metallic glass. Acta Mater. 59, 159–170 (2011).
Article CAS Google Scholar
Ward, L. et al. Matminer: an open source toolkit for materials data mining. Comput. Mater. Sci. 152, 60–69 (2018).
Article Google Scholar
Chen, T. & Guestrin, C. XGBoost: A Scalable Tree Boosting System. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 785–794 (Association for Computing Machinery, 2016).
Bradley, A. P. The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern Recognit. 30, 1145–1159 (1997).
Article Google Scholar
Zadrozny, B. & Elkan, C. Obtaining calibrated probability estimates from decision trees and naive Bayesian classifiers. In Proceedings of the 18th International Conference on Machine Learning (2001).
Cortes, C. & Vapnik, V. Support-vector networks. Mach. Learn. 20, 273–297 (1995).
Google Scholar
Okabe, A., Boots, B., Sugihara, K. & Chiu, S. N. Spatial tesselations. Concepts and applications of voronoi diagrams (2009).
Ding, J., Patinet, S., Falk, M. L., Cheng, Y. & Ma, E. Soft spots and their structural signature in a metallic glass. Proc. Natl Acad. Sci. USA 111, 14052–14056 (2014).
Article CAS Google Scholar
Ding, J., Cheng, Y. Q. & Ma, E. Full icosahedra dominate local order in Cu64Zr34 metallic glass and supercooled liquid. Acta Mater. 69, 343–354 (2014). 25.
Article CAS Google Scholar
Steinhardt, P. J., Nelson, D. R. & Ronchetti, M. Bond-orientational order in liquids and glasses. Phys. Rev. B 28, 784–805 (1983).
Article CAS Google Scholar
Behler, J. & Parrinello, M. Generalized neural-network representation of high-dimensional potential-energy surfaces. Phys. Rev. Lett. 98, 146401 (2007).
Article CAS Google Scholar
Thompson, A. P., Swiler, L. P., Trott, C. R., Foiles, S. M. & Tucker, G. J. Spectral neighbor analysis method for automated generation of quantum-accurate interatomic potentials. J. Comput. Phys. 285, 316–330 (2015).
Article CAS Google Scholar
Bartók, A. P., Payne, M. C., Kondor, R. & Csányi, G. Gaussian approximation potentials: the accuracy of quantum mechanics, without the electrons. Phys. Rev. Lett. 104, 136403 (2010).
Article CAS Google Scholar
Shapeev, A. V. Moment Tensor Potentials: A Class of Systematically Improvable Interatomic Potentials. Multiscale Model. Simul. 14, 1153–1173 (2016).
Article Google Scholar
Novikov, I. S., Gubaev, K., Podryabinkin, E. V. & Shapeev, A. V. The MLIP package: Moment tensor potentials with mpi and active learning. Preprint at https://arxiv.org/abs/2007.08555 (2020).
Bartók, A. P., Kondor, R. & Csányi, G. On representing chemical environments. Phys. Rev. B 87, 184115 (2013).
Article CAS Google Scholar
Ding, J. et al. Universal structural parameter to quantitatively predict metallic glass properties. Nat. Commun. 7, 13733 (2016).
Article CAS Google Scholar
Cheng, Y. Q. & Ma, E. Configurational dependence of elastic modulus of metallic glass. Phys. Rev. B 80, 64104 (2009).
Article CAS Google Scholar
Tipping, M. E. & Bishop, C. M. Probabilistic Principal Component Analysis. J. R. Stat. Soc. Ser. B (Statistical Methodol. 61, 611–622 (1999).
Argon, A. S. Plastic deformation in metallic glasses. Acta Metall. 27, 47 (1979).
Article CAS Google Scholar
Falk, M. L. & Langer, J. S. Dynamics of viscoplastic deformation in amorphous solids. Phys. Rev. E. 57, 7192–7205 (1998).
Article CAS Google Scholar
Tsamados, M., Tanguy, A., Goldenberg, C. & Barrat, J. L. Local elasticity map and plasticity in a model Lennard-Jones glass. Phys. Rev. E 80, 026112 (2009).
Article CAS Google Scholar
Greer, A. L., Cheng, Y. Q. & Ma, E. Shear bands in metallic glasses. Mater. Sci. Eng. R. Rep. 74, 71–132 (2013).
Article Google Scholar
Hufnagel, T. C., Schuh, C. A. & Falk, M. L. Deformation of metallic glasses: recent developments in theory, simulations, and experiments. Acta Mater. 109, 375–393 (2016).
Article CAS Google Scholar
Wisitsorasak, A. & Wolynes, P. G. Dynamical theory of shear bands in structural glasses. Proc. Natl Acad. Sci. USA 114, 1287–1292 (2017).
Article CAS Google Scholar
Bapst, V. et al. Unveiling the predictive power of static structure in glassy systems. Nat. Phys. 16, 448–454 (2020).
Article CAS Google Scholar
Plimpton, S. Fast parallel algorithms for short-range molecular dynamics. J. Comput. Phys. 117, 1–19 (1995).
Article CAS Google Scholar
Cheng, Y. Q., Ma, E. & Sheng, H. W. Atomic level structure in multicomponent bulk metallic glass. Phys. Rev. Lett. 102, 245501 (2009).
Article CAS Google Scholar
Allen, M. P. & Tildesley, D. J. Computer simulation of liquids. (Clarendon Press, Oxford, 1987).
Marinica, M. C., Willaime, F. & Mousseau, N. Energy landscape of small clusters of self-interstitial dumbbells in iron. Phys. Rev. B. 83, 9 (2011).
Article CAS Google Scholar
Himanen, L. et al. DScribe: Library of descriptors for machine learning in materials science. Comput. Phys. Commun. 247, 106949 (2020).
Article CAS Google Scholar
Maloney, C. E. & Lemaître, A. Amorphous systems in athermal, quasistatic shear. Phys. Rev. E. 74, 016118 (2006).
Article CAS Google Scholar

Download references

Acknowledgements

Q.W. and E.M. are supported at JHU by U.S. Department of Energy (DOE), DOE-BES-DMSE, under grant DE-FG02-19ER46056. Q.W. also acknowledges the support of National Natural Science Foundation of China (51701190). J.D. acknowledges the Chinese Thousand-Youth-Talent Program, and the Young Talent Startup Program of Xi’an Jiaotong University. A.S. and E.P. are supported by the Russian Science Foundation (grant number 18-13-00479).

Author information

Authors and Affiliations

Department of Materials Science and Engineering, Johns Hopkins University, Baltimore, MD, 21218, USA
Qi Wang & Evan Ma
Center for Advancing Materials Performance from the Nanoscale (CAMP-Nano), State Key Laboratory for Mechanical Behavior of Materials, Xi’an Jiaotong University, Xi’an, 710049, China
Jun Ding
Department of Computer Science, Johns Hopkins University, Baltimore, MD, 21218, USA
Longfei Zhang
Center for Energy Science and Technology, Skolkovo Institute of Science and Technology, 3 Nobel Street, Moscow, 143026, Russia
Evgeny Podryabinkin & Alexander Shapeev

Authors

Qi Wang
View author publications
You can also search for this author in PubMed Google Scholar
Jun Ding
View author publications
You can also search for this author in PubMed Google Scholar
Longfei Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Evgeny Podryabinkin
View author publications
You can also search for this author in PubMed Google Scholar
Alexander Shapeev
View author publications
You can also search for this author in PubMed Google Scholar
Evan Ma
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Q.W. and J.D. initiated the plan for this study. Q.W. designed and analyzed the machine learning models (except for MTP). J.D. conducted the ART simulations and the flexibility volume and shear moduli analyses. L.Z. and Q.W. benchmarked the SOAP and bispectrum coefficients. E.P. and A.S. modified the MLIP package for benchmarking the MTP. Q.W., J.D. and E.M. discussed the results and wrote the manuscript.

Corresponding authors

Correspondence to Qi Wang or Jun Ding.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Wang, Q., Ding, J., Zhang, L. et al. Predicting the propensity for thermally activated β events in metallic glasses via interpretable machine learning. npj Comput Mater 6, 194 (2020). https://doi.org/10.1038/s41524-020-00467-4

Download citation

Received: 24 June 2020
Accepted: 18 November 2020
Published: 15 December 2020
DOI: https://doi.org/10.1038/s41524-020-00467-4

This article is cited by

Characterizing Structural Heterogeneity in Metallic Glasses: A Molecular Dynamics-Guided Machine Learning Approach
- Hao Li
- Harsha Mohanty
Transactions of the Indian Institute of Metals (2024)
Molecular Mechanics of Disordered Solids
- Franz Bamer
- Firaz Ebrahem
- Benjamin Stamm
Archives of Computational Methods in Engineering (2023)
Phase classification of multi-principal element alloys via interpretable machine learning
- Kyungtae Lee
- Mukil V. Ayyasamy
- Prasanna V. Balachandran
npj Computational Materials (2022)
Machine Learning-Guided Exploration of Glass-Forming Ability in Multicomponent Alloys
- Yi Yao
- Timothy Sullivan
- Lin Li
JOM (2022)
Machine learning atomic dynamics to unfold the origin of plasticity in metallic glasses: From thermo- to acousto-plastic flow
- Xiaodi Liu
- Quanfeng He
- Jun Shen
Science China Materials (2022)