Autonomously revealing hidden local structures in supercooled liquids

Boattini, Emanuele; Marín-Aguilar, Susana; Mitra, Saheli; Foffi, Giuseppe; Smallenburg, Frank; Filion, Laura

doi:10.1038/s41467-020-19286-8

Download PDF

Article
Open access
Published: 30 October 2020

Autonomously revealing hidden local structures in supercooled liquids

Nature Communications volume 11, Article number: 5479 (2020) Cite this article

5029 Accesses
70 Citations
2 Altmetric
Metrics details

Subjects

Abstract

Few questions in condensed matter science have proven as difficult to unravel as the interplay between structure and dynamics in supercooled liquids. To explore this link, much research has been devoted to pinpointing local structures and order parameters that correlate strongly with dynamics. Here we use an unsupervised machine learning algorithm to identify structural heterogeneities in three archetypical glass formers—without using any dynamical information. In each system, the unsupervised machine learning approach autonomously designs a purely structural order parameter within a single snapshot. Comparing the structural order parameter with the dynamics, we find strong correlations with the dynamical heterogeneities. Moreover, the structural characteristics linked to slow particles disappear further away from the glass transition. Our results demonstrate the power of machine learning techniques to detect structural patterns even in disordered systems, and provide a new way forward for unraveling the structural origins of the slow dynamics of glassy materials.

Highly accurate protein structure prediction with AlphaFold

Article Open access 15 July 2021

John Jumper, Richard Evans, … Demis Hassabis

Shell buckling for programmable metafluids

Article 03 April 2024

Adel Djellouli, Bert Van Raemdonck, … Katia Bertoldi

Neural operators for accelerating scientific simulations and design

Article 08 April 2024

Kamyar Azizzadenesheli, Nikola Kovachki, … Anima Anandkumar

Introduction

The connection between structure and dynamics in supercooled liquids and glasses is one of the most intriguing puzzles in condensed matter physics. The conundrum: close to the glass transition, the dynamics slow down dramatically and become heterogeneous^1,2 while the structure appears largely unperturbed. Largely unperturbed, however, is not the same as unperturbed, and many studies have attempted to identify slow local structures by exploiting dynamical information^3,4. Unsupervised machine learning (UML) techniques may provide a novel way forward for shedding light on this problem.

Machine learning (ML) techniques are rapidly becoming a game-changer in the study of materials. Examples include speeding up computationally expensive calculations⁵, accurately distinguishing different crystal phases^6,7, and even developing design rules for structural and material properties⁸. An exciting development is the design of UML algorithms that can autonomously classify particles based on patterns in their local environment^9,10,11, even in disordered systems¹². A key strength of these UML approaches is that they can find variations in local structure without any a priori knowledge of what might appear, opening the door to finding new, unanticipated structures.

The idea of an autonomous algorithm that picks out structural heterogeneities is a particularly appealing one in the context of supercooled liquids. In this field, the last few years have seen a frantic hunt for local structural features that can be interpreted as the underlying cause for dynamical heterogeneities. To this end, a number of studies have examined the prevalence and lifetimes of a large variety of locally favored structures^13,14, correlated dynamics with local order parameters based on, e.g., tetrahedrality or packing efficiency^15,16,17, and have looked at the dynamical effects of promoting specific local features^18,19,20.

A major advance in correlating structure and dynamics was made through the use of supervised ML techniques. In particular, a number of studies have demonstrated that support-vector machines could be taught to recognize more mobile particles in several glass formers^21,22,23,24. More recently, it was shown that even better dynamical predictions could be made using both convolutional neural networks and graph neural networks²⁵. However, in order to train these algorithms, data linking structure to future dynamics had to be used. Methods that can autonomously detect purely structural heterogeneities offer an unbiased fresh look at the problem.

Here we show that a simple, efficient UML algorithm that we recently developed¹⁰ for detecting crystalline structure can be harnessed to detect structural heterogeneities in glasses. Our algorithm—which requires only a single snapshot as input—uses bond-order parameters (BOPs) to encode the local environments of the particles. Combining a neural network-based autoencoder with Gaussian mixture models, it then autonomously designs a structural order parameter capable of detecting the largest structural variation in the system.

We test the performance of the algorithm on three archetypical glass forming systems: binary hard spheres, Wahnström, and Kob–Andersen. These three model systems have been extensively studied in the context of fundamental glass formers, and have proven extremely valuable in unraveling many aspects of the glass transition (see e.g. refs. ^3,26,27). Additionally, extensive past research has indicated that both binary hard spheres and the Wahnström model display a strong correlation between local structure and dynamical slowdown^3,28, while these correlations are more nebulous for the Kob–Andersen model²⁸. Collectively, these models provide an ideal playground for testing the ability of our UML technique to find local structural features in supercooled liquids.

Results

Setting up the UML algorithm

We construct configurations for our UML analysis by running molecular dynamics simulations of three glass formers inside the glassy regime. The glass formers we consider are all three-dimensional models and include binary hard spheres, Wahnström, and Kob–Andersen (see “Methods” for more details). We then select one equilibrated configuration in the glassy regime for each glass former to start our UML analysis.

The UML method we explore here is based on an algorithm we recently developed¹⁰ for detecting crystalline structures. As shown in Fig. 1, this analysis consists of three steps. First, the local environment of each particle is encoded into a vector of eight BOPs (see “Methods”). This local environment includes information regarding (approximately) the first two shells of neighbors. Secondly, an autoencoder is used to lower the dimensionality of this vector. The autoencoder is a neural network trained to reproduce its input as its output. This neural network is especially designed to contain a “bottleneck” with a lower dimensionality than the input vector, such that the network is forced to compress the information, and subsequently decompress it again. After training the autoencoder, we only retain the compression part of the network, and use it as our dimensionality reducer. Note that this algorithm allows for non-linear transformations to a lower dimension. Third, the particles are then grouped in this lower-dimensional space using Gaussian mixture models. A full description of this algorithm is given in the Supplementary Methods.

**Fig. 1: Schematic representation of the unsupervised machine learning method.**

This UML algorithm has two main parameters that need to be chosen: (i) the dimensionality of the bottleneck of the autoencoder c and (ii) the number of Gaussian components N_G used to fit the distribution in the lower-dimensional space. To choose the dimensionality of the bottleneck, we require that the autoencoder reproduces at least 75% of the variance of the input data. In Fig. 2a–c we show the fraction of variance explained (FVE, see “Methods”) by the autoencoder as a function of c for the chosen snapshots from all three models. Based on this analysis, we choose c = 2 for both the binary hard spheres and the Wahnström models, and c = 4 for the Kob–Andersen model. To determine the number of Gaussians N_G, we measure the Bayesian Information Criterion (BIC, see Supplementary Methods) as a function of N_G, and plot it for each snapshot in Fig. 2d–f. The optimum number of Gaussians corresponds to the minimum in the BIC analysis, which for all models is found at N_G = 2. As each Gaussian can be seen as generating a cluster in the data, this means that the UML identifies two clusters of particles for all three models.

In addition to these two parameters, the depth and width of the neural network forming the autoencoder can be varied. In this paper we always use a network geometry of one hidden layer of dimension 40 for both the encoder and the decoder parts of the network. In the Supplementary Note 1, we explore the behavior of our UML algorithm, with respect to, e.g., the dimensionality of the input vector, repeated network trainings, network structure, and the dimensionality of the bottleneck c. We find the algorithm to be largely robust with respect to changes in these parameters.

Extracting a scalar order parameter P_red

To encode the UML clustering into a single scalar order parameter, each particle is assigned a probability to belong to one of the two clusters. To this end, we label the two clusters with different colors—white and red, see Fig. 1. For each particle i, we then define P_red(i) as the probability that the particle belongs to the red cluster based on the Gaussian mixture model (see “Methods”). This results in a single scalar order parameter between 0 and 1, with values near 0 indicating particles whose environment more closely matches the white cluster, and values near 1 indicating particles whose environment more closely matches the red cluster. Note that by definition this order parameter captures the largest structural heterogeneities in the system, as found by the UML approach. We would like to stress that in contrast with supervised ML studies of glasses^21,22, our approach uses no dynamical information, and is trained on a single static snapshot for each system.

Using this scalar order parameter, we analyze the three glassy configurations, and color each particle based on P_red. The results are shown in Fig. 3a–c. In all three systems, the system shows clear structural heterogeneity, consisting of regions of both environments.

**Fig. 3: Structural analysis and correlations with dynamics in three archetypical glass formers.**

Correlating P_red with dynamics

The question now is whether the structural variations detected by P_red are correlated with the dynamics. To probe this, we measure the dynamic propensity D_i(δt) of particle i: a measure for how mobile particle i will be over the next time interval δt (see “Methods”), which has proven useful in supercooled liquids^15,16,29,30. In Fig. 3j–l, we plot the Spearman’s correlation coefficient between P_red and D_i(δt), as a function of the time interval δt. As one might expect, this correlation is weak both for very short time scales, where particles are simply rattling within their cages, and for long time scales where the system loses memory of its initial configuration. It peaks slightly below the structural relaxation time τ_α, indicating that we have indeed identified structures connected to the structural relaxation.

To further investigate the correlation between the UML order parameter and the dynamics, in Fig. 3d–f, we color the particles according to their dynamic propensity, with δt chosen to correspond to the maximum in the correlation. Comparing to Fig. 3a–c, it is clear that regions of high dynamic propensity correspond to high values of P_red, indicating that the particles identified as part of the red cluster also largely correspond to the faster particles in the system. The correlation can be further improved by averaging P_red over particles within a small local region, similar to what was found in previous studies^15,16,31. This is shown in the snapshots in Fig. 3g–i and the solid lines in Fig. 3j–l, where we show the results for a local averaging radius r_c = 2σ. In all cases, the correlation between the averaged ${\bar{P}}_{{\rm{red}}}$ and D_i peaks very close to τ_α. This is slightly later than the unaveraged version, likely because we are now looking at larger regions, which will take more time to rearrange. Note that here we resolved the inherent symmetry between red and white clusters, by always labeling as red the cluster that on average turns out to be faster.

As also found in previous work²⁸, Kob–Andersen seems to be the model whose behavior is less well captured by our analysis. This might be related to the attraction that could induce heterogeneities over large length scales due to the proximity of a gas–liquid phase coexistence^32,33. This kind of effect would not be fully captured by our (highly local) observables.

To summarize, in all cases the structural heterogeneities identified by our UML order parameter correlate significantly with the local dynamics. This leads to two intriguing questions: (i) are these structural correlations to the dynamics strong or weak in comparison to literature? and (ii) how does the method perform when compared to supervised learning algorithms specifically designed to predict dynamics?

With respect to the first question (i), in the case of binary hard spheres, previous literature has shown a strong correlation between local tetrahedrality and both global and local dynamics. At the same state point shown in Fig. 3 for binary hard spheres, the correlation between local tetrahedrality and local dynamics was shown to reach ≈0.63 for small particles¹⁵. Our UML order parameter significantly exceeds this with a value of ≈0.72. For Wahnström, and Kob–Andersen, Hocky et al.²⁸ examined the correlation between locally favored structures and dynamic propensity. Although the state points are not exactly the same as ours, they found a slightly stronger correlation for Wahnström, and a slightly weaker one for Kob–Andersen.

With respect to the second question (ii), we will focus on the Kob–Andersen system where the most extensive data exist. In a recent article, Bapst et al.²⁵ examined both a variety of physics-based methods for predicting propensity (based on soft modes³⁴, the Debye–Waller factor³⁰, or potential energy^29,35), as well as state of the art supervised ML algorithms^21,25. As shown in Supplementary Fig. 7, at the strongest supercooling ${\bar{P}}_{{\rm{red}}}$ (with a correlation of 0.4) correlates better than any of the physics-based methods (best correlation: 0.35), but worse than the supervised ML algorithms (range of correlations: 0.4–0.6). The higher correlations in the supervised algorithms were expected as these algorithms explicitly fit dynamic propensity as a function of structural features. However, in contrast to the UML algorithm, such fitting requires a large amount of data and computational effort.

Variation of P_red with supercooling

Thus far, we have shown that P_red is capable of predicting local variations in the dynamics in each of our systems. A natural next question is whether the same order parameter can be used globally, i.e. to capture the onset of dynamical arrest as the glass former is supercooled. This can be done by checking whether the “slow” structural group becomes more dominant as we move closer to the glass transition, similar to what was seen in, e.g., refs. ^15,23,36. To confirm that the UML-designed order parameters are able to capture the onset of dynamical arrest without the need for retraining, we use the exact same UML order parameter trained on the snapshots of Fig. 3 on systems equilibrated at lower degrees of supercooling: lower packing fractions η for hard spheres, and higher temperatures T^* for the other two models. In Fig. 4, we plot 〈P_red〉, defined as the globally averaged value of P_red, as a function of the degree of supercooling for each glass former. In all cases, 〈P_red〉 increases monotonically as the system moves out of the glassy regime. Hence, the structures we identify as white (slow) at strong supercooling disappear as we move away from the glass transition—clearly showing that the UML order parameter identifies local structures that are important for the dynamical slowdown. Interestingly, as shown in the insets in Fig. 4, the relationship between 〈P_red〉 and the structural relaxation time is exponential for both the hard sphere and the Wahnström system.

**Fig. 4: Structural order parameter as a function of supercooling.**

Predictive power as a function of supercooling

It is well known that the dynamical heterogeneities of supercooled liquids become weaker and shift to shorter time scales as we move away from the glass transition. As a result, the correlation between structure and dynamics should also become weaker^3,15,16. To test how the predictive power of our UML analysis depends on the degree of supercooling, we perform a new UML analysis on each of the glass formers at different packing fractions and temperatures. Specifically, for each state point we find a new projection and classification, and determine the correlation between ${\bar{P}}_{{\rm{red}}}$ and D_i. Note that this is different from our analysis for Fig. 4 as there we used only a single-order parameter for each system. By retraining the order parameter, we correlate the dynamical heterogeneity with the largest structural variation that we can find at each individual state point. In Fig. 5 we show that indeed the correlations become weaker and shift to shorter times (along with τ_α) as we move away from the glassy regime.

**Fig. 5: Correlation between structure and dynamics for different supercoolings.**

In Fig. 5 we have kept the local averaging radius r_c fixed at 2σ_A for the calculation of ${\bar{P}}_{{\rm{red}}}$. However, in practice, the maximum in the correlation depends on the choice of r_c, as shown in the Supplementary Note 4. Specifically, the optimal value of r_c tends to increase with the degree of supercooling. It might be tempting to interpret this growth as a growing static length scale, a topic of significant debate in the glass community (see, e.g., refs. ^{3,37,38,39,40}). However, it is important to realize that this optimum value is determined by using dynamical data (i.e. the correlation with the dynamic propensity). As such, the optimum in r_c should be seen as a dynamic length scale. In contrast, one can straightforwardly extract a static correlation length from our order parameter P_red, by determining its autocorrelation function as a function of distance. As shown in the Supplementary Note 5, this method gives an essentially constant correlation length for each of our three systems, with little to no growth with increasing supercooling. Hence, our method does not yield a clearly growing, purely static length scale.

Characterizing the local structure of the different clusters

As the UML is based on a description of local environments in terms of BOPs⁴¹, a natural question to ask is how the two identified clusters differ in terms of their BOPs. In Fig. 6, we plot the mean value of all BOPs q₁, … , q₈ for both red (more mobile, P_red > 0.5) and white clusters (P_red < 0.5) for the three snapshots depicted in Fig. 3. Perhaps surprisingly, we do not observe dramatic differences in the average BOPs of the two clusters. The small variations that are seen, however, exhibit a few notable trends. Specifically for all three models, q₂, q₃, q₄, q₅, and q₈ are higher in the red, more mobile cluster. Additionally, for both hard spheres and Wahnström, q₆ is the highest in the slow cluster; note that q₆ is often connected to close packed crystal structures like face-centered cubic and hexagonal-close packed.

**Fig. 6: Variation of bond-order with P_red.**

As a second avenue for differentiating the local structure in each group we use topological cluster classification (TCC)⁴². This algorithm detects a set of pre-defined clusters corresponding to low-energy (or high-packing) structures in a few model systems. As shown in Fig. 7, we find that for the hard sphere and Wahnström systems, P_red negatively correlates strongly with local structures built up out of one or more tetrahedra, while it positively correlates with TCC clusters built from square pyramids. For the Kob–Andersen mixture, P_red still negatively correlates best with tetrahedral environments, but correlations are significantly weaker. Interestingly, TCC detects essentially no clusters that correlate positively with P_red, suggesting that these particles have local environments not detected by TCC—indicating that our UML is picking up on structures not included in the low-energy (or high-packing) structures built into TCC. This is one area where the UML approach shines: it is not restricted by a priori assumptions about the features that are considered in the clustering.

**Fig. 7: Correlation between P_red and TCC clusters.**

Discussion

Overall, our results are consistent with the idea that at least part of the dynamical slowdown is driven by the emergence of a set of locally favored structures, in agreement with earlier observations on these models (see, e.g., refs. ^19,43,44). As the system is pushed closer to the glass transition, these structures become more stable, due to a more favorable local packing or potential energy, with a profound impact on both local and global dynamics³. What is intriguing is the observation that this variation in local structure can be effectively captured by a one-dimensional machine-learned order parameter, and that this structural order parameter strongly correlates with the dynamics.

Interestingly, our Gaussian clustering approach indicates that the structural features of all of our systems, once projected onto the lower-dimensional space, are best described by two Gaussians. This is consistent with a previously proposed picture where supercooled liquids can be interpreted as a mixture of two competing structural populations²⁰. Note that a similar two-state picture has been extremely successful in understanding the glassy behavior of supercooled water^31,45, where the competing local structures are ostensibly linked to different thermodynamic phases at extreme supercooling.

The UML analysis used here is both extremely simple to implement as well as very efficient to run. The input is a single snapshot, and the analysis involves standard BOP calculations, fitting a small neural network-based autoencoder, and Gaussian mixture models; all of which are fast, standard methods that are commonly available in open-source libraries. The total analysis costs only a few minutes of computational time. While the simplicity of this algorithm is one of its main strengths, this also implies that there are many potential routes towards expanding this method—for instance, adding information about the local density to the local descriptors, and/or projecting the information onto higher dimensional order parameters. Such additions might increase the accuracy, although likely at the cost of the simplicity and speed. Already, a number of UML techniques exist that classify particles based on local structure^9,10,11,12, using different definitions of local structures, and different approaches for classification. It will be interesting to see which of these performs best in purely heterogeneous environments like glasses.

In conclusion, we have demonstrated that a simple and fast autoencoder-based UML approach is a powerful tool in the development of new structural order parameters in supercooled liquids. We demonstrated that the structural heterogeneities captured by this order parameter are strongly correlated with the dynamical heterogeneities in all three glass formers studied here, creating a new way forward for unraveling the microscopic origins of dynamical slowdown in supercooled liquids.

Methods

Models

We consider three model glass formers in three dimensions: binary hard spheres, Wahnström²⁶, and Kob–Andersen²⁷. Both Wahnström and Kob–Andersen are binary mixtures of Lennard–Jones (LJ) particles.

The binary hard-sphere model we consider is a mixtures of 30% large A particles and 70% small B particles, with size ratio σ_B/σ_A = 0.85. The mode coupling theory (MCT) packing fraction for this model is η_MCT = 0.584 (see Supplementary Note 7).

The Wahnström model²⁶ is an equimolar (50–50%) mixture of A and B particles. The LJ interaction strength between all pairs of particles is identical (ϵ_AA = ϵ_AB = ϵ_BB), but the B particles are slightly larger than the A particles (σ_BB = 1.2σ_AA and σ_AB = 1.1σ_AA). The LJ potential is truncated and shifted at the minimum in the potential, such that the interactions are purely repulsive. The MCT temperature is k_BT_MCT/ϵ_AA = 0.652 (see Supplementary Note 7).

The Kob–Andersen model²⁷ is a non-additive mixture of 80% (large) A particles and 20% (small) B particles. The interaction parameters are σ_BB = 0.88σ_AA, σ_AB = 0.8σ_AA, ϵ_BB = 0.5ϵ_AA, and ϵ_AB = 1.5ϵ_AA. The LJ potential is truncated and shifted at a cutoff distance r_c,ij = 2.5σ_ij (where i, j ∈ {A, B}), such that the attractive part of the potential is retained. The MCT temperature for this model is k_BT_MCT/ϵ_AA = 0.42 (see Supplementary Note 7).

For both Wahnström and Kob–Andersen, we define the reduced number density ${\rho }^{* }=\rho {\sigma }_{AA}^{3}$ and reduced temperature T^* = k_BT/ϵ_AA, with k_B Boltzmann’s constant.

Simulations

For all models, we use molecular dynamics simulations in the canonical ensemble. In the case of hard spheres, the simulations are performed using an event-driven approach. For Wahnström and Kob–Andersen, we use the simulation package LAMMPS⁴⁶.

Dynamic propensities are calculated as an isoconfigurational ensemble average of the absolute displacement of each particle. In other words, we perform at least 32 independent simulations starting from the same initial configuration, but with randomly chosen velocities for all particles. The dynamic propensity of particle i after a time interval δt is then defined as

$${D}_{i}(\delta t)={\langle \left|{{\bf{r}}}_{i}(\delta t)-{{\bf{r}}}_{i}(0)\right|\rangle }_{{\rm{c}}},$$

(1)

where r_i(t) is the position of particle i at time t, and the average is taken over the independent runs.

In order to obtain the relaxation time τ_α, we calculate the self-intermediate scattering function (ISF) for the Wahnström and the Kob–Andersen systems, and the total intermediate scattering function for the hard spheres:

$$F(q,t)=\frac{\left\langle{\sum }_{j,k}\exp \left\{i{\bf{q}}\left[{{\bf{r}}}_{j}(t)-{{\bf{r}}}_{k}(0)\right]\right\}\right\rangle}{\left\langle{\sum }_{j,k}\exp \left\{i{\bf{q}}\left[{{\bf{r}}}_{j}(0)-{{\bf{r}}}_{k}(0)\right]\right\}\right\rangle},$$

(2)

where r_i is the position of particle i and q is a wave vector. We calculate the ISF at an inverse wavelength q = ∣q∣ corresponding to the first peak of the structure factor. After that, we fit the long-time decay of the ISF with a stretched exponential function $\gamma \exp \left[-{(t/{\tau }_{\alpha })}^{\beta }\right]$, where γ, β, and the relaxation time τ_α are fit parameters.

Local environment description

To characterize the local environment of each particle, we use an averaged version of the local BOPs introduced by Steinhardt et al.⁴¹. First, we define for any given particle i the complex quantities

$${q}_{lm}(i)=\frac{1}{{N}_{b}(i)}\sum _{j\in {{\mathcal{N}}}_{b}(i)}{Y}_{l}^{m}({{\bf{r}}}_{ij}),$$

(3)

where ${Y}_{l}^{m}({{\bf{r}}}_{ij})$ are the spherical harmonics of order l, with m an integer that runs from m = −l to m = +l. Additionally, r_ij is the vector from particle i to particle j, and ${{\mathcal{N}}}_{b}(i)$ is the set of nearest neighbors of particle i, which we will define later. Note that ${{\mathcal{N}}}_{b}(i)$ contains N_b(i) particles. Then, the rotationally invariant BOPs, q_l, are defined as⁴¹

$${q}_{l}(i)=\sqrt{\frac{4\pi }{2l+1}\mathop{\sum }\limits_{m=-l}^{l}| {q}_{lm}(i){| }^{2}}.$$

(4)

Finally, we define an average ${\bar{q}}_{l}(i)$ as

$${\bar{q}}_{l}(i)=\frac{1}{{N}_{b}(i)+1}\left[{q}_{l}(i)+\sum _{k\in {{\mathcal{N}}}_{b}(i)}{q}_{l}(k)\right].$$

(5)

Note that by taking this average, ${\bar{q}}_{l}(i)$ depends on the positions of not only the nearest neighbor shell of particles but also the second neighbor shell. Additionally, the quantities in Eq. (5) differ from the averaged BOPs introduced by Lechner and Dellago⁴⁷ where first the averaging is performed on the non-rotational-invariant q_lm, and then rotational-invariant quantities are built.

Our description of the local environment of particle i consists of an eight-dimensional vector,

$${\bf{Q}}(i)=(\{{\bar{q}}_{l}(i)\}),$$

(6)

with l ∈ [1, 8].

The set of nearest neighbors of each particle is identified with a parameter-free criterion called SANN (solid angle nearest neighbor)⁴⁸ for the hard spheres and Wahnström models. In this approach, an effective individual cutoff radius, r_c(i), is found for every particle i in the system based on its local environment. This method is not inherently symmetric, i.e., j might be a neighbor of i while i is not a neighbor of j. However, symmetry can be enforced by either adding j to the neighbors of i or removing i from the neighbors of j. In this study, we applied the latter solution. For the Kob–Andersen mixture, we obtained better results with a fixed cutoff radius (see Supplementary Note 8 for a comparison).

Unsupervised ML

The UML approach used here follows the method outlined in ref. ¹⁰. A detailed description is provided in the Supplementary Methods.

Fraction of variance explained

The optimal number of nodes in the bottleneck layer of the autoencoder, c, is determined by computing the FVE by the reconstruction,

$$\,{\text{FVE}}\,=1-\frac{\mathop{\sum }\nolimits_{i = 1}^{N}{\left\Vert {\bf{Q}}(i)-{\hat{\bf{Q}}}(i)\right\Vert }^{2}}{\mathop{\sum }\nolimits_{i = 1}^{N}{\left\Vert {\bf{Q}}(i)-{\bar{{\bf{Q}}}}\right\Vert }^{2}},$$

(7)

where $\bar{{\bf{Q}}}$ is the mean input vector and N is the number of particles. To choose c we require that this fraction is at least 75%.

Definition of P_red

As described in the main text, for all three models, after using the UML method to reduce the dimensionality of the data, we then cluster the data using Gaussian mixture models. In all cases, the optimum number of clusters was found to be two, which we label red and white. We then used this clustering to create a scalar order parameter P_red which quantifies the local structure around the particles as follows:

$${P}_{{\rm{red}}}=\frac{{g}_{{\rm{red}}}}{{g}_{{\rm{white}}}+{g}_{{\rm{red}}}},$$

(8)

where g_white(red) is the value of the fitted Gaussian peak associated with the white (red) cluster. Note that by definition P_white + P_red = 1. Hence, as we go from the white peak to the red peak and beyond (where g_white is very small), P_red monotonically increases. Note that even though dynamics are not used in this analysis, a posteriori we always assigned the label red to the cluster with the most mobile particles.

TCC analysis

To correlate P_red with locally favored structures in the three model systems, we use the TCC algorithm⁴². Specifically, we count for each particle the number of clusters it is involved in of each type detected by the algorithm. We then calculate the correlation between the number of clusters of a given type a particle is a part of, and P_red. Note that in its original form, TCC does not accurately count simple clusters (specifically tetrahedra, square pyramids, and pentagonal pyramids) which are subsumed into larger combinations of such clusters. Here we have adapted the algorithm to correct for this choice.

Data availability

The data associated with this research is available upon request.

Code availability

The simulation and analysis codes associated with this research are available upon request.

References

Ediger, M. D. Spatially heterogeneous dynamics in supercooled liquids. Annu. Rev. Phys. Chem. 51, 99–128 (2000).
Article ADS CAS Google Scholar
Berthier, L., Biroli, G., Bouchaud, J.-P., Cipelletti, L. & van Saarloos, W. Dynamical Heterogeneities in Glasses, Colloids, and Granular Media, Vol. 150 (OUP, Oxford, 2011).
Royall, C. P. & Williams, S. R. The role of local structure in dynamical arrest. Phys. Rep. 560, 1–75 (2015).
Article ADS MathSciNet CAS Google Scholar
Tanaka, H., Tong, H., Shi, R. & Russo, J. Revealing key structural features hidden in liquids and glasses. Nat. Rev. Phys. 1, 333–348 (2019).
Article Google Scholar
Behler, J. & Parrinello, M. Generalized neural-network representation of high-dimensional potential-energy surfaces. Phys. Rev. Lett. 98, 146401 (2007).
Article ADS CAS Google Scholar
Geiger, P. & Dellago, C. Neural networks for local structure detection in polymorphic systems. J. Chem. Phys. 139, 164105 (2013).
Article ADS CAS Google Scholar
Boattini, E., Ram, M., Smallenburg, F. & Filion, L. Neural-network-based order parameters for classification of binary hard-sphere crystal structures. Mol. Phys. 116, 3066–3075 (2018).
Article ADS CAS Google Scholar
Butler, K. T., Davies, D. W., Cartwright, H., Isayev, O. & Walsh, A. Machine learning for molecular and materials science. Nature 559, 547–555 (2018).
Article ADS CAS Google Scholar
Reinhart, W. F., Long, A. W., Howard, M. P., Ferguson, A. L. & Panagiotopoulos, A. Z. Machine learning for autonomous crystal structure identification. Soft Matter 13, 4733 (2017).
Article ADS CAS Google Scholar
Boattini, E., Dijkstra, M. & Filion, L. Unsupervised learning for local structure detection in colloidal systems. J. Chem. Phys. 151, 154901 (2019).
Article ADS CAS Google Scholar
Spellings, M. & Glotzer, S. C. Machine learning for crystal identification and discovery. AIChE J. 64, 2198–2206 (2018).
Article CAS Google Scholar
Paret, J., Jack, R. L. & Coslovich, D. Assessing the structural heterogeneity of supercooled liquids through community inference. J. Chem. Phys. 152, 144502 (2020).
Article ADS CAS Google Scholar
Malins, A., Eggers, J., Royall, C. P., Williams, S. R. & Tanaka, H. Identification of long-lived clusters and their link to slow dynamics in a model glass former. J. Chem. Phys. 138, 12A535 (2013).
Article CAS Google Scholar
Leocmach, M. & Tanaka, H. Roles of icosahedral and crystal-like order in the hard spheres glass transition. Nat. Commun. 3, 974 (2012).
Article ADS CAS Google Scholar
Marín-Aguilar, S., Wensink, H. H., Foffi, G. & Smallenburg, F. Tetrahedrality dictates dynamics in hard sphere mixtures. Phys. Rev. Lett. 124, 208005 (2020).
Article ADS Google Scholar
Tong, H. & Tanaka, H. Revealing hidden structural order controlling both fast and slow glassy dynamics in supercooled liquids. Phys. Rev. X 8, 011041 (2018).
CAS Google Scholar
Tong, H. & Tanaka, H. Structural order as a genuine control parameter of dynamics in simple glass formers. Nat. Commun. 10, 1–10 (2019).
Article CAS Google Scholar
Taffs, J. & Royall, C. P. The role of fivefold symmetry in suppressing crystallization. Nat. Commun. 7, 1–7 (2016).
Article CAS Google Scholar
Marín-Aguilar, S., Wensink, H. H., Foffi, G. & Smallenburg, F. Slowing down supercooled liquids by manipulating their local structure. Soft Matter 15, 9886–9893 (2019).
Article ADS Google Scholar
Speck, T., Malins, A. & Royall, C. P. First-order phase transition in a model glass former: coupling of local structure and dynamics. Phys. Rev. Lett. 109, 195703 (2012).
Article ADS CAS Google Scholar
Cubuk, E. D. et al. Identifying structural flow defects in disordered solids using machine-learning methods. Phys. Rev. Lett. 114, 108001 (2015).
Article ADS CAS Google Scholar
Schoenholz, S. S., Cubuk, E. D., Sussman, D. M., Kaxiras, E. & Liu, A. J. A structural approach to relaxation in glassy liquids. Nat. Phys. 12, 469–471 (2016).
Article CAS Google Scholar
Schoenholz, S. S., Cubuk, E. D., Kaxiras, E. & Liu, A. J. Relationship between local structure and relaxation in out-of-equilibrium glassy systems. Proc. Natl Acad. Sci. USA 114, 263–267 (2017).
Article ADS CAS Google Scholar
Landes, F. P., Biroli, G., Dauchot, O., Liu, A. J. & Reichman, D. R. Attractive versus truncated repulsive supercooled liquids: the dynamics is encoded in the pair correlation function. Phys. Rev. E 101, 010602 (2020).
Article ADS CAS Google Scholar
Bapst, V. et al. Unveiling the predictive power of static structure in glassy systems. Nat. Phys. 16, 448–454 (2020).
Article CAS Google Scholar
Wahnström, G. Molecular-dynamics study of a supercooled two-component Lennard-Jones system. Phys. Rev. A 44, 3752 (1991).
Article ADS Google Scholar
Kob, W. & Andersen, H. C. Testing mode-coupling theory for a supercooled binary Lennard-Jones mixture I: the van Hove correlation function. Phys. Rev. E 51, 4626 (1995).
Article ADS CAS Google Scholar
Hocky, G. M., Coslovich, D., Ikeda, A. & Reichman, D. R. Correlation of local order with particle mobility in supercooled liquids is highly system dependent. Phys. Rev. Lett. 113, 157801 (2014).
Article ADS CAS Google Scholar
Berthier, L. & Jack, R. L. Structure and dynamics of glass formers: predictability at large length scales. Phys. Rev. E 76, 041509 (2007).
Article ADS CAS Google Scholar
Widmer-Cooper, A. & Harrowell, P. Predicting the long-time dynamic heterogeneity in a supercooled liquid on the basis of short-time heterogeneities. Phys. Rev. Lett. 96, 185701 (2006).
Article ADS CAS Google Scholar
Shi, R., Russo, J. & Tanaka, H. Origin of the emergent fragile-to-strong transition in supercooled water. Proc. Natl Acad. Sci. USA 115, 9444–9449 (2018).
Article ADS CAS Google Scholar
Sastry, S. Liquid limits: Glass transition and liquid-gas spinodal boundaries of metastable liquids. Phys. Rev. Lett. 85, 590 (2000).
Article ADS CAS Google Scholar
Berthier, L. & Tarjus, G. Nonperturbative effect of attractive forces in viscous liquids. Phys. Rev. Lett. 103, 170601 (2009).
Article ADS CAS Google Scholar
Widmer-Cooper, A., Perry, H., Harrowell, P. & Reichman, D. R. Irreversible reorganization in a supercooled liquid originates from localized soft modes. Nat. Phys. 4, 711–715 (2008).
Article CAS Google Scholar
Doliwa, B. & Heuer, A. What does the potential energy landscape tell us about the dynamics of supercooled liquids and glasses? Phys. Rev. Lett. 91, 235501 (2003).
Article ADS CAS Google Scholar
Royall, C. P., Williams, S. R., Ohtsuka, T. & Tanaka, H. Direct observation of a local structural mechanism for dynamic arrest. Nat. Mater. 7, 556 (2008).
Article ADS CAS Google Scholar
Karmakar, S., Dasgupta, C. & Sastry, S. Growing length and time scales in glass-forming liquids. Proc. Natl Acad. Sci. USA 106, 3675–3679 (2009).
Article ADS CAS Google Scholar
Berthier, L., Biroli, G., Bouchaud, J.-P. & Tarjus, G. Can the glass transition be explained without a growing static length scale? J. Chem. Phys. 150, 094501 (2019).
Article ADS CAS Google Scholar
Wyart, M. & Cates, M. E. Does a growing static length scale control the glass transition? Phys. Rev. Lett. 119, 195501 (2017).
Article ADS Google Scholar
Sausset, F. & Tarjus, G. Growing static and dynamic length scales in a glass-forming liquid. Phys. Rev. Lett. 104, 065701 (2010).
Article ADS CAS Google Scholar
Steinhardt, P. J., Nelson, D. R. & Ronchetti, M. Bond-orientational order in liquids and glasses. Phys. Rev. B 28, 784 (1983).
Article ADS CAS Google Scholar
Malins, A., Williams, S. R., Eggers, J. & Royall, C. P. Identification of structure in condensed matter with the topological cluster classification. J. Chem. Phys. 139, 234506 (2013).
Article ADS CAS Google Scholar
Turci, F., Speck, T. & Royall, C. P. Structural-dynamical transition in the Wahnström mixture. Eur. Phys. J. E 41, 54 (2018).
Article CAS Google Scholar
Crowther, P., Turci, F. & Royall, C. P. The nature of geometric frustration in the Kob-Andersen mixture. J. Chem. Phys. 143, 044503 (2015).
Article ADS CAS Google Scholar
Caupin, F. & Anisimov, M. A. Thermodynamics of supercooled and stretched water: unifying two-structure description and liquid-vapor spinodal. J. Chem. Phys. 151, 034503 (2019).
Article ADS CAS Google Scholar
Plimpton, S. Fast parallel algorithms for short-range molecular dynamics. J. Comput. Phys. 117, 1–19 (1995).
Article ADS CAS MATH Google Scholar
Lechner, W. & Dellago, C. Accurate determination of crystal structures based on averaged local bond order parameters. J. Chem. Phys. 129, 114707 (2008).
Article ADS CAS Google Scholar
Van Meel, J. A., Filion, L., Valeriani, C. & Frenkel, D. A parameter-free, solid-angle based, nearest-neighbor algorithm. J. Chem. Phys. 136, 234107 (2012).
Article ADS CAS Google Scholar

Download references

Acknowledgements

We gratefully acknowledge Alfons van Blaaderen for many useful discussions. L.F. and E.B. acknowledge funding from The Netherlands Organisation for Scientific Research (NWO) (Grant No. 16DDS004), and L.F. acknowledges funding from NWO for a Vidi grant (Grant No. VI.VIDI.192.102). S.M.-A. acknowledges support from the Consejo Nacional de Ciencia y Tecnología (CONACyT scholarship No. 340015/471710). S.M. and G.F. acknowledge IFCPR/CEFIPRA for support through project No. 5704-1.

Author information

Authors and Affiliations

Soft Condensed Matter, Debye Institute of Nanomaterials Science, Utrecht University, Utrecht, Netherlands
Emanuele Boattini & Laura Filion
Université Paris-Saclay, CNRS, Laboratoire de Physique des Solides, 91405, Orsay, France
Susana Marín-Aguilar, Saheli Mitra, Giuseppe Foffi & Frank Smallenburg

Authors

Emanuele Boattini
View author publications
You can also search for this author in PubMed Google Scholar
Susana Marín-Aguilar
View author publications
You can also search for this author in PubMed Google Scholar
Saheli Mitra
View author publications
You can also search for this author in PubMed Google Scholar
Giuseppe Foffi
View author publications
You can also search for this author in PubMed Google Scholar
Frank Smallenburg
View author publications
You can also search for this author in PubMed Google Scholar
Laura Filion
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

L.F and F.S. together suggested the study. L.F., F.S., G.F., E.B., and S.M.-A. designed the research. S.M.-A. and S.M. performed the simulations, and E.B. and S.M.-A. performed the data analysis. All authors contributed to the interpretation of the results and wrote the paper.

Corresponding author

Correspondence to Laura Filion.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Nature Communications thanks François Landes and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Boattini, E., Marín-Aguilar, S., Mitra, S. et al. Autonomously revealing hidden local structures in supercooled liquids. Nat Commun 11, 5479 (2020). https://doi.org/10.1038/s41467-020-19286-8

Download citation

Received: 31 March 2020
Accepted: 05 October 2020
Published: 30 October 2020
DOI: https://doi.org/10.1038/s41467-020-19286-8

This article is cited by

Fast crystal growth of ice VII owing to the decoupling of translational and rotational ordering
- Xuan Zhang
- Yifeng Yao
- Kenji Mochizuki
Communications Physics (2023)
Modern computational studies of the glass transition
- Ludovic Berthier
- David R. Reichman
Nature Reviews Physics (2023)
A Thermodynamic Perspective on Polymer Glass Formation
- Wen-Sheng Xu
- Zhao-Yan Sun
Chinese Journal of Polymer Science (2023)
Robust prediction of force chains in jammed solids using graph neural networks
- Rituparno Mandal
- Corneel Casert
- Peter Sollich
Nature Communications (2022)
Unsupervised topological learning approach of crystal nucleation
- Sébastien Becker
- Emilie Devijver
- Noël Jakse
Scientific Reports (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.