Simulating the NaK Eutectic Alloy with Monte Carlo and Machine Learning

Reitz, Douglas M.; Blaisten-Barojas, Estela

doi:10.1038/s41598-018-36574-y

Download PDF

Article
Open access
Published: 24 January 2019

Simulating the NaK Eutectic Alloy with Monte Carlo and Machine Learning

Scientific Reports volume 9, Article number: 704 (2019) Cite this article

2161 Accesses
5 Citations
Metrics details

Subjects

Abstract

Combining atomistic simulations and machine learning techniques can expedite significantly the materials discovery process. We present an application of such methodological combination for the prediction of the melting transition and amorphous-solid behavior of the NaK alloy at the eutectic concentration. We show that efficient prediction of these properties is possible via machine learning methods trained on the topological local structural properties. The configurations resulting from Monte Carlo annealing of the NaK eutectic alloy are analyzed with topological attributes based on the Voronoi tessellation and using expectation-maximization clustering and Random Forest classification. We show that the Voronoi topological fingerprints make an accurate and fast prediction of the alloy thermal behavior by cataloguing the atomic configurations into three distinct phases: liquid, amorphous solid, and crystalline solid. Melting is found at 230 K by the sharp split of configurations classified as crystalline solid and as liquid. With the proposed metrics, an arrest-motion temperature is identified at 130–140 K through a top down clustering of the atomic configurations catalogued as amorphous solid. This statistical learning paradigm is not restricted to eutectic alloys or thermodynamics, extends the utility of topological attributes in a significant way, and harnesses the discovery of new material properties.

Impact of lattice relaxations on phase transitions in a high-entropy alloy studied by machine-learning potentials

Article Open access 01 May 2019

Tatiana Kostiuchenko, Fritz Körmann, … Alexander Shapeev

Phase prediction and experimental realisation of a new high entropy alloy using machine learning

Article Open access 23 March 2023

Swati Singh, Nirmal Kumar Katiyar, … Shrikrishna N. Joshi

Machine-learning informed prediction of high-entropy solid solution formation: Beyond the Hume-Rothery rules

Article Open access 07 May 2020

Zongrui Pei, Junqi Yin, … Michael C. Gao

Introduction

The characterization of metallic amorphous solids is more complex than the identification of crystalline matter¹. Currently, bulk amorphous metals have become useful engineering materials in several applications despite the fact that their microscopic properties at the medium and local range are not as well understood². For example, bulk amorphous alloys exhibit high strength, peculiar elastic properties, and other unusual engineering characteristics³. Concurrently, several machine learning techniques have been introduced in the field of condensed matter for enhancing the understanding of phenomena in materials design^4,5,6,7. In this article we demonstrate that a topological inspection of the structure of the eutectic sodium-potassium (NaK) alloy using machine learning analyses predicts excellently the solidification fate of the liquid eutectic alloy leading to crystalline and amorphous solids. Indeed, our findings are in full agreement with our Metropolis Monte Carlo (MMC) simulations using the second moment approximation (SMA) potential⁸ described in upcoming sections.

When looking for compositions of binary metal alloys with high glass forming ability, the eutectic and slightly off-eutectic compositions is a good place to start^9,10. Eutectic NaK is a binary alloy formed by 22% Na and 78% K in atomic weight^11,12. This interesting alloy is liquid at room temperature, solidifies at temperatures below 260 K¹², and is used as a coolant in nuclear reactors among other applications¹³. Experimental measurements of the mass density have been performed for the NaK liquid phase from room to higher temperatures at the eutectic concentration¹⁴. Numerical simulations of the NaK nanoalloy at various relative concentrations¹⁵ showed that the magical nanostructure size with minimum excess energy corresponded to the eutectic composition. However, there are no theoretical/computational studies of the extended condensed phases of eutectic NaK.

Our preliminary MMC simulations of the eutectic NaK alloy gave the thermal behavior of the enthalpy and the mass density along a computational procedure that annealed the system spanning high-to-low temperatures¹⁶. Now we analayze the thermodynamic parameters (structure, enthalpy, system volume) exhaustively from the MMC atomistic simulations with the goal of neatly identifying the liquid, crystalline, and amorphous phases that develop along the annealing process. An independent machine learning analysis using exclusively the atomic Cartesian coordinates obtained in the simulation runs is undertaken with the goal of verifying the identification of the three condensed phases predicted by the simulation. Our machine learning strategy is based on topological parameters determined by Voronoi-tessellating¹⁷ the 3D space occupied by the atoms inside the computational box. The Voronoi tessellation has been utilized for identifying topological characteristics of liquids^18,19,20, glasses^2,21,22, zeolites²³, clusters²⁴, among several others.

This article is organized as follows. In the Model and methods section the model potential for the NaK alloy is provided along with a description of the Monte Carlo methodology used in the atomistic simulations. The definition of the topological attributes used in the machine learning analyses is also included in this section. The section entitled Energetics and structure of eutectic NaK describes the thermodynamic properties obtained along the thermal annealing process of the NaK alloy obtained with atomistic simulations and the structural analysis performed to characterize the three different phases detected. The atomic positions in each configuration obtained in this section constitute the dataset that we used for further study. Description of the topological attributes for the machine learning approaches of the dataset is given in the Data analyses of the topological attributes, along with the data-based analyses of principal component, unsupervised learning data clustering, and data classification approaches. Our results highlight the ability of machine learning in analyzing intrinsic thermodynamic behavior, and at the same time providing valuable guidance for inspection of other metal alloys in condensed phases. This work is concluded in Conclusion with a discussion of the results.

Model and Methods

Model potential

The Second Moment Approximation (SMA) model potential is a many-body potential that approximates the local environment of every atom mimicking the distribution of electronic states in a d-band by a bonding term U_el, supplemented with the U_rep Born-Mayer term for the short-range repulsion⁸. The SMA is a classical version of the tight-binding approach. As such, the SMA differs significantly from pair-additive classical models and has a characteristic very soft spheres repulsive wall. The SMA analytical expression is a sum, U_coh = U_rep + U_el, with

$${U}_{rep}=\sum _{i=1}^{N}\,\sum _{j=1,i\ne j}^{N}\,{\varepsilon }_{ij}{e}^{-{p}_{ij}(\tfrac{{r}_{ij}}{{r}_{{0}_{ij}}}-1)}$$

(1)

$${U}_{el}=-\,\sum _{i=1}^{N}\,{\{\sum _{j=1,i\ne j}^{N}{{\zeta }_{ij}}^{2}{e}^{-2{q}_{ij}(\tfrac{{r}_{ij}}{{r}_{{0}_{ij}}}-1)}\}}^{1/2}$$

(2)

where r_ij are the interatomic distances and N is the total number of atoms. The parameters for pure Na and K were developed in previous work⁸. Combination rules were employed for the Na-K pairs, with the geometric mean of the K and Na parameters for ζ₀, ε₀, p, q and a weighted arithmetic mean for ${r}_{{0}_{NaK}}=\frac{{N}_{Na}}{N}{r}_{{0}_{Na}}+\frac{{N}_{K}}{N}{r}_{{0}_{K}}$ (N_K, N_Na being the number of K and Na atoms). Table 1 lists all SMA parameters used in this work.

Table 1 The SMA Parameters for atomic pairs K-K⁸, Na-Na⁸, and Na-K¹⁶.

Full size table

Although this classical modeling of the atomic interactions is not unique, we believe that the parametrization of the SMA is very appropriate for describing soft metals as the NaK alloy.

Metropolis Monte Carlo atomistic simulations

Currently, the science community employing atomistic simulations for researching condensed phases of materials in thermal equilibrium recur primarily to Molecular Dynamics (MD) and Metropolis Monte Carlo (MMC)²⁵, two extensively used methods. Across time, atomistic simulations have become popular because of the research investments to produce software packages that automate the multitude of algorithms needed in these simulations. The increase in popularity of MD over MMC has been driven by the ease to computer parallelize the algorithms for solving the MD underlying ordinary differential equations. On the contrary, implementing Markov Chain Monte Carlo methods have faced the bottleneck of the intrinsically serial Markov chain process²⁶. Over time, novel computational techniques have been developed commensurate with the advances of computer hardware resulting in several MMC packages^27,28,29.

In this work, we employ our in-house MMC implementation²⁷. The MMC algorithm allows calculation of system properties averages at a temperature T by performing an importance sampling of the system states with energy E_i and probability P_i = exp(−E_i/k_BT)/Q. Here, k_B is the Boltzmann constant and Q is the partition function of the system. Each sampled state is a configuration of the system given by the coordinates of all atoms composing the system. The generated sequence of samples are linked through a Markov chain that requires ratios of probabilities between two consecutive samples for transitioning between them. Thus, the algorithm eliminates the need of calculating the partition function Q. The acceptance or rejection for transitioning from state i to state j is given by min(1, P_j/P_i). We used the isobaric-isothermal (NPT) version of the MMC²⁵.

MMC NPT simulations were run for a system of 2000 atoms with periodic boundary conditions and a cutoff radius of 2.381 nm at a constant pressure of 101.325 kPa. At the alloy eutectic concentration, the computational box had 648 sodium atoms and 1352 potassium atoms. The SMA model potential was used to compute the potential energy of the atomic configurations. The initial configuration had the sodium atoms randomly distributed in the sites of a perfect bcc lattice at the pure potassium experimental density. The remaining sites were populated with potassium atoms. A new system configuration is generated once each of the 2000 atoms was attempted a move of a fixed length step in random direction. The magnitude of the atomic movements was dynamically adjusted throughout the simulation to maintain approximately a 50% rejection rate of attempted atomic moves. The volume change of the computational box was attempted once every passage over the 2000 attempted atomic moves at constant volume. Typically, in order to obtain the average values reported in the next section, 2 million passages through the full 2000 atoms were attempted after the system was sought to be in equilibrium.

For assessing cooling rates, we determine a rough estimate of the time equivalent to one MMC passage over all atoms (referred to as lattice-step). A system of N = 2000 potassium atoms was first NPT-equilibrated at 337 K and 101.325 kPa. Next, six NVT runs at the same temperature were run from different initial configurations to collect the atomic mean square displacement (MSD) as a function of lattice-steps. From the ${\rm{M}}{\rm{S}}{\rm{D}}\,({\rm{t}})={\sum }_{m=1}^{6}\,{\sum }_{i=1}^{N}$ ${[{{\bf{r}}}_{{\bf{i}}}(t)-{{\bf{r}}}_{{\bf{i}}}{({t}_{o})}_{m}]}^{2}/(6N)=6{\rm{t}}{{\rm{D}}}_{self}$ and the potassium empirical value³⁰ D_self = 3.59 × 10⁻⁹ m²/s, we very roughly estimate 33 × 10⁶ lattice-steps ≈1 μs in the liquid phase. This estimate will be used in the next section when referring to cooling rates.

Energetics and Structure of Eutectic NaK

The NaK alloy system was NPT equilibrated at 101.325 kPa and a high temperature of 700 K. Next, the system was annealed at constant pressure resulting in two branches of the the enthalpy and system volume below 230 K, as shown in Fig. 1. These two extensive properties of soft materials display such behavior^31,32,33, which is predictable for a eutectic alloy. The lower enthalpy/volume branch was associated to crystalline packing. The higher enthalpy/volume branch corresponds to liquid states that were supercooled below 230 K and became an amorphous solid below approximately 140 K^31,32,33.

Averages in Fig. 1 were calculated over 2 million lattice-steps after the system was equilibrated at each temperature. Using the estimate of 1 MMC lattice-step ≈0.03 ps, the following cooling/warming rates were applied along the annealing process. A first cooling process gave rise to the liquid-amorphous branch of higher enthalpy and volume in Fig. 1 by: (i) cooling from 700 to 150 K at a rate of 287 K/μs, (ii) re-cooling from 170 to 150 K at a slower rate of 100 K/μs enabled the system to reach a state of crystal character at 160 K (lowest temperature the system reached a crystal state from the upper branch), (iii) cooling from 150 to 50 K at 380 K/μs. Secondly, starting from the state with crystal character at 160 K, an annealing process gave rise to the crystalline branch of lower enthalpy and volume in Fig. 1 by: (i) warming-up from 160 to 229 K at 180 K/μs until at 230 K the system reverted to the liquid-amorphous branch (229 K was the highest temperature at which the system remained crystalline), (ii) slow cooling from 229 to 50 K at a rate of 73 K/μs.

In Fig. 1, the amorphous-liquid branch displays a smooth behavior resulting from cooling at various rates. Meanwhile, the crystalline branch shows the two states at 160 K and 229 K between which the warming-up process was performed plus the states resulting from a slow cooling-down from 229 K to 50 K. The insets of Fig. 1 depict the warm-up states with green triangles and the cool-down states in black dots along the crystalline branch evidencing a hysteresis as observed in other alloys³⁴. The annealing hysteresis is narrow in eutectic alloys because both metals melt simultaneously. The temperature above which the system did not remain crystalline while warming-up was 230 K. This temperature is lower than 260 K, the experimental melting temperature¹². On the other hand, the passage from liquid to supercooled liquid occurred gradually in the 240 K to 140 K range. Around 130–140 K the upper branch had an inflection point consistent with an arrest in volume changes in configuration space and the system became a long-lived metastable amorphous solid^31,33. Properties of the liquid were in good agreement with experimental values, indicating that the SMA potential gives a realistic representation of the liquid alloy. The liquid heat capacity C_p = 1019 J(kgK)⁻¹ was calculated from a fit to the enthalpy slope between 290–310 K, agreeing well with the experimental value of 977 J(kgK)⁻¹ at 298 K¹⁴. Additionally, the calculated equilibrium density of the liquid at 350 K was 0.89 g/cm³, comparing well with the experimental value¹⁴ of 0.86 g/cm³. No crystal structure measurements were found in the literature.

Analysis of the pair correlation function reveals clearly the different structural characteristics of the two types of solids, amorphous and crystalline, as illustrated in Figs 2 and 3, respectively. The g_K−K(r) and g_Na−Na(r) were calculated with NVT MMC at 110 K and 1002.5 kg/m³ for the crystalline solid and 992.57 kg/m³ for the amorphous solid. A clear difference between these two solids is seen in the g_K−K(r) 2nd through 4rd peaks, where the peaks in the crystalline solid match well with the bcc lattice, while the amorphous solid displays the characteristic structural loss with a double bumped-second peak. The crystal peaks are broadened because the potassium atoms occupy only 70% of the sample volume and the rest is occupied by sodium atoms. The g_Na−Na(r) shows differences between crystal and amorphous, although not as clear as for the K-K pairs. In Figs 2 and 3, the system snapshots of the two solid systems at 110 K show visually the difference between them. To help the visualization, the snapshots in these figures depict the computational box replicated three times in each Cartesian direction. The crystalline structure in Fig. 2 shows the periodic array of the atoms and the segregated Na cluster (grey) immersed in the potassium matrix (violet). This Na cluster was shaped as a raft with 3–4 atomic layers of thickness, as desiring to form a lamellar structure with large surface area. By contrast, the solid system depicted in Fig. 3 was amorphous throughout the occupied volume, had several segregated smaller Na clusters, and a significantly larger number of isolated Na atoms. Snapshots in Figs. 2, 3 were drawn with Ovito³⁵.

Further analysis of the two solids structure was done at 140 K with the Adaptive Common Neighbor Analysis (a-CNA) algorithm^36,37,38 that yielded a fingerprint of the local environment of each atom. In the crystalline solid, 55% of atoms had a bcc entourage and about 7% were fcc or hcp. The remaining atoms had no crystal symmetries in their local surroundings due to boundary atoms between the K matrix and the Na cluster. In contrast, in the amorphous solid none of the atoms had a bcc crystal entourage, only 10 atoms had fcc-, and 2 had ico- surroundings. Altogether, an insignificant number of atoms in the amorphous solid was characterized by a local surrounding with definite crystal signature.

In summary, the structural analyses provided additional support that two types of solids were obtained during the annealing process as previously differentiated by their thermodynamics. There is no experimental evidence that the eutectic NaK solidifies only as a crystal. Here, we have predicted that a metastable amorphous solid state is also reachable below 140 K along an annealing process. The lower enthalpy branch ranging from 50–230 K pertained to a crystalline matrix of potassium atoms spanning the computational box and encasing an extended cluster of sodium atoms. Meanwhile, the inflection point in the supercooled liquid-amorphous branch around 140 K suggested that below that temperature the system had transitioned to a solid, amorphous state. The transition resembled a glass transition. However, the amorphous solid did not display a homogeneous distribution of Na atoms in the K amorphous matrix as expected in a glass. Instead, an amorphous matrix of K atoms encased several small clusters of Na atoms that were themselves also amorphous. Therefore, we associated the inflection point with a temperature at which the supercooled liquid was viscous enough to transform definitely into a solid with amorphous structure but not necessarily a glass. Such temperature depends on the cooling rate^31,32,33. We refer to it as T_a in following sections.

Data Analysis Based on a Machine Learning Protocol

The current advent of publicly available trajectory data from numerous atomistic simulations drives interest to the implementation of smart tools that could extract information beyond the calculation that generated them. A goal of our data analyses was to find out if a machine learning protocol based exclusively on the knowledge of atomic coordinates collected along the MMC trajectories would provide a plausible physical description without knowing any of the thermodynamics findings described in the previous section. Machine learning analyses require a choice of attributes, also known as descriptors or features. We decided on the use of the Voronoi cells properties as sole attributes for the machine learning strategy described in this section because a Voronoi tessellation only requires knowledge of the coordinates of a set of points.

Topological attributes based on Voronoi tessellation

A Voronoi tessellation is a segmentation of the available space into cells that fill such space densely¹⁷. This tessellation generates groupings of planes into convex polyhedra (Voronoi cells) such that all points on the cell are closer to the central site than to any other site. The resulting intersection of these planes is a Voronoi cell. We defined the machine learning attributes that described the atomic packing topology based on the Voronoi tessellation of the computational box volume, such that the position of each atom in the computational box was at the center of a Voronoi cell. Thus, a characteristic polyhedron enveloped each atom. Voronoi cells are characterized by the total number of faces CN (coordination number) and the number of 3-, 4-, 5-, and 6-edged faces comprising the cell. For example, CNn = 〈n₃, n₄, n₅, n₆〉 specifies a Voronoi cell with n faces, out of which there are n₃ 3-edged faces, n₄ 4-edged faces, etc. Certain Voronoi cells^2,22,39 have proven useful in identifying glasses because they give rise to Z and Ze atomic arrangements where the nearest-neighbor atoms to the central atom are connected with fivefold bonds. We have tracked the actual Voronoi cells instead of their associated polytetrahedral atomic arrangements and labeled them as cells of type Z or Ze according to their classification². All other Voronoi cells were identified by their CN type. Voro++⁴⁰ with periodic boundary conditions was used to generate the tessellation of the atomic configurations saved from the MMC simulation runs. Configurations corresponding to the green points in Fig. 1 insets were not included.

Not only all encountered Voronoi cells were identified, but also several properties derived from them were defined as attributes. For example, attribute f5 was defined as the number of 5-edged cell faces. Along our study, 48 different attributes were found in the structures gathered from the MMC simulation runs. Table 2 provides a list of them. The Voronoi tessellation of the computational box containing 2000 atoms was performed for 18672 saved MMC configurations and the 48 topological attributes were calculated. Configurations from the green points in Fig. 1 inset were not included. Each of the 48 attributes was found a certain number of times in a given configuration, which defined its frequency of occurrence. These occurrence frequencies were entered in a data table of 18672 rows by 48 columns. Therefore, the dataset spanned a 48-dimensional space.

Table 2 Type of the topological attributes used in this study.

Full size table

Attribute selection

Reducing the number of attributes is a basic selection process in machine learning. There are various methods for selecting the most significant attributes based on different types of ranking processes. We used the Laplacian score⁴¹ for ranking the 48 topological attributes described in Table 2. This method acts as a filter for selecting attributes based on their ability of locality preservation. The 26 top Laplacian-ranked attributes and their scores are listed in Table 3. Our machine learning study was based on these 26 attributes such that the data table was reduced to have 18672 rows by 26 columns. The Appendix provides a figure depicting the ten top attributes and the data table is given in the Supplementary Information.

Table 3 The 26 highly ranked topological attributes based on their Laplacian score.

Full size table

Principal component analysis

In order to eliminate correlations among the top 26 attributes listed in Table 3, a principal component analysis (PCA)⁴² was performed. As a result, a set of 26 linearly uncorrelated attributes, called principal components (PC), were obtained. The principal components are linear combinations of the 26 original attributes that maximize their variance. This criterion is equivalent to minimizing the error function defined as the sum of squares in a regression analysis⁴³. The first three PCs, PC1, PC2, PC3, yielded variances of 97%, with contributions of 63%, 32%, and 2%, respectively. A projection of the original 18672 data set onto the planes of the leading PCs allows for a type of data clustering, as visually shown in Fig. 4. An inspection of the data points in each of the two clusters based on what we know from the thermodynamics study, indicates that in the PC1-PC2 plane, the negative values correspond to crystalline structures (depicted blue) and the positive values correspond to the amorphous solid plus liquid data with no clear split between them. This same type of two-cluster split is visual in the PC1–PC3 plot of Fig. 4. Although it is common practice to use this approach for clustering data, clearly such clustering analysis is unable of discerning between the amorphous solid data and the liquid data. The PCA is also used for dimensionality reduction. In our case, there is a clear possibility of reducing the data space dimensions from 26 to 3. Next section describes an unsupervised learning algorithm for clustering the data making use of the reduced dimensionality of the dataset.

Clustering of data

There are several machine learning clustering algorithms for unsupervised learning. We selected the expectation maximization (EM)^44,45, a two phase iterative method to find an estimate of the maximum likelihood of model parameters. The EM algorithm attempts first to find an expected estimate of parameters for defining the log probability of the observed data, followed by a maximization of the log probability with respect to the parameters. The EM is appropriate for our data set because of its ability to create clusters sustaining a disjoint partition of the data when the data can be modeled by a mixture of Gaussian functions. The EM algorithm as implemented in Weka⁴⁶ was adopted. The input attributes for the EM clustering were defined to be the projection of the original dataset onto the three predominant PCs, yielding a data table of 18672 rows by 3 columns. These attributes were linearly uncorrelated. The EM number of clusters to split the data was set to three, maximum number possible with three attributes. The resulting clusters were named cluster-blue, cluster-red, and cluster-black.

For visualizing the clustered data we recurred to identify the temperature at which each of the 18672 configurations was produced and its corresponding enthalpy and system density. Each configuration was given a color depending upon which of the three EM clusters it belonged to, blue for cluster-blue, red for cluster-red and black for cluster-black. Figure 5 shows how the data clustered. We remark that none of these thermodynamic values (temperature, enthalpy, density, volume) entered as attributes in the EM clustering or the PCA. Clearly, we see that cluster-blue corresponds to crystalline structures from our MMC simulations, cluster-black corresponds to amorphous solid structures, and cluster-red contains the liquid and supercooled liquid. Cluster-blue data are sharply separated from the rest, while there is a small region of temperature overlap between cluster-red and cluster-black in the 170–240 K temperature range. Indeed, cluster-red (liquid) has no configurations below 160 K. Likewise cluster-black (amorphous solid) has no configurations above 240 K.

In summary, this unsupervised machine learning approach is very successful. By only inspecting the topological characteristics of the simulated structures, the approach is able to assign the structures to the correct thermodynamics behavior obtained in the traditional simulations. To verify how stable is the data split determined by the clustering method, in the next section we create a data classification model defining three classes as the three clusters obtained in this section.

Random Forests classification

To detect the quality of the clustering data split, we recurred to defining three classes based on the EM clustering and create a model classifier with them. We proceeded to sample a smaller dataset with 1934 points picked randomly from the 18672 data points. A table was constructed with the 3 PC attributes of the 1934-points dataset. This reduced size dataset was used for training a classification model with three classes: liquid, crystal, amorphous, depending upon which EM cluster the samples belonged to. The Random Forest classifier⁴⁷, as implemented in Weka⁴⁶, was used to build our model classifier with 100 random trees, trained with 10-fold validation. The evaluation on the training set had a mean absolute error of 0.002 with no confusion.

Once the 3-class classification model was created with the 1934-dataset, the remaining 16738 points were individually classified using the newly established data model. As a result, the crystalline structures classified with 100% accuracy into the crystal class. Meanwhile there was a minor confusion between the amorphous and the liquid classes, with the liquid class having 99.8% accuracy and the amorphous class a 98.9% accuracy. In total only 55 structures were classified within a class different than the EM cluster to which they belonged. This is shown in the confusion matrix given in Table 4.

Table 4 Confusion matrix for the Random Forest classification model.

Full size table

There are 55 samples out of the confusion matrix diagonal. The 20 samples wrongly classified as liquid were distributed between 180 K and 240 K. Likewise the 35 structures wrongly classified as amorphous solid were spread from 170 K to 250 K. Summarizing, the Random Forest model built with 10% of the available MMC configurations was appropriate for classifying the remaining 90% of the configurations into the three classes that originated from the unsupervised clustering analysis: crystal, amorphous solid, and liquid. Our results demonstrate explicitly the power of machine learning in estimating thermodynamic behavior and simultaneously providing valuable guidance to machine learning of metal alloys condensed phases.

As illustrated in Fig. 5, configurations belonging to the amorphous and liquid classes displayed thermodynamic a smooth temperature dependence in Fig. 1. Therefore, a final top down analysis of these two types of configurations was performed with the EM algorithm to yield a first layer of a hierarchical clustering⁴⁸. The amorphous class displayed a sharp split of samples into two sub-clusters. By identifying each sample with its temperature during the simulation, Fig. 6a illustrates visually the results. The crossing region between130–170 K was identified as the amorphous system transitioning to a volume-arrested amorphous solid. The liquid class displayed a softer split of samples into two clusters, as shown in Fig. 6b. The crossing region between 220–300 K was identified as temperatures where the system was pre-melting and melting. Since the SMA potential of the pure metals predict melting temperature higher than experiment⁸, it is expected that the alloy melting region spreads beyond the experimental 260 K. The system behavior in these transition regions was embedded in the attributes used, which we chose not to inspect before applying the latest clustering. The Appendix includes the temperature behavior of ten top attributes.

As a validation of conclusions in Fig. 6, a moving interface NPT simulation⁴⁹ was implemented by creating an initial hybrid system with half of the computational box containing the crystalline solid configuration at 150 K and the other half with the liquid configuration at several temperatures between 200–250 K. It was clearly seen that for all temperatures below 220 K the crystalline solid prevailed by solidifying the liquid half of the box. Meanwhile for temperatures 220–250 K, the liquid configuration overcame. As the temperature neared 220 K from above, a steep increase in MMC lattice steps were required.

In summary, this machine learning process has revealed the mechanism that the material underwent along annealing only based on the topological attributes generated from the available configurations of the system. This is a remarkable success of domain-based data analytics that opens up the possibility for analyzing the ever increasing number of simulation configurations that are becoming available to the condensed matter community.

Conclusions

To conclude, we have shown that inspection of important thermodynamic properties of materials in the condensed phase is achievable by fusing the notions of topological attributes of the system and machine learning methods. Using a dataset consisting of 18672 configurations obtained from NPT MMC simulations of the NaK eutectic alloy, we have presented a machine learning protocol that allows us to reveal a mapping between independently accessible attributes of a system and its various thermodynamic properties. Firstly, we have shown that three phases of the eutectic NaK alloy can be identified, liquid, crystalline solid, and amorphous solid along the annealing procedure with NPT MMC simulations. From the current simulations, 18672 configurations registering the Cartesian coordinates of 2000 atoms were saved for further analysis. Secondly, based on the Voronoi tessellation, a set of 48 topological attributes were calculated for each configuration. Thirdly, with machine learning techniques these topological attributes were reduced to 26 based on ranking, and further to 3 through principal component analysis. The latter were used to cluster the configurations into three data clusters. These data clusters reproduced almost perfectly the liquid, amorphous, and crystalline condensed phases determined with the simulations. As a fourth step, a verification of the validity of the splitting into data clusters was carried out with the Random Forest classification. Analysis of these classes and the connection to the temperatures at which the configurations were obtained allowed to validate the clustering process and provided a robust estimate of the temperature range at which the system melts, 220–310 K, and at which the system transitions into a amorphous-like solid at 130–170 K.

The methodology presented here is relevant for identifying (or screening) unknown materials with a targeted combination of topological properties in an efficient manner with high fidelity. Our results highlight the ability of machine learning analyses for unraveling the embedded topological aspects of configuration space when inspecting condensed matter systems.

Appendix

Ten of the top 48 topological attributes are depicted in Fig. 7(a,b). Average values per atom of the frequency of occurrence of each attribute at each simulation temperature are given for all attributes except V_K and V_Na. For the latter two, a sum of all cell volumes of type K or Na is divided by the average volume of the full system. Colored points depict configurations belonging to the three EM clusters: liquid (red), crystalline solid (blue), amorphous solid (grey). Attributes in Fig. 7(a) displayed higher values for the crystalline structures below 200 K, while attributes in Fig. 7(b) favored the amorphous solid below 200 K. Note that f5, f7, Z and Ze in Fig. 7(b) have their highest and almost constant value for temperatures below 140 K and decrease at higher temperatures. This observation, plus the visible inflection point of the enthalpy and volume at that temperature, suggested that the supercooled liquid had transitioned to a solid amorphous state below approximately 140 K. The V_K and V_Na illustrate the volume split assigned by the tessellation to the K and Na atoms, respectively. Clearly shown is a Na volume contraction in the amorphous solid with respect to the crystalline solid, while the opposite effect is visible in the K volume.

Data Availability

The Supplementary Information provides the dataset used in this work, with attributes as columns and data points as rows.

References

Chen, H. Glassy metals. Rep. Prog. Phys. 43, 353–432 (1980).
Article ADS MathSciNet Google Scholar
Cheng, Y. & Ma, E. Atomic-level structure and structure–property relationship in metallic glasses. Prog. Mater. Sci. 56, 379–473 (2011).
Article CAS Google Scholar
Johnson, W. L. Bulk amorphous metal-An emerging engineering material. JOM 54, 40–43 (2002).
Article CAS Google Scholar
Behler, J. & Parrinello, M. Generalized Neural-Network representation of high-dimensional potential-energy surfaces. Phys. Rev. Lett. 98, 146401 (2007).
Article ADS Google Scholar
Wang, L. Discovering phase transitions with unsupervised learning. Phys. Rev. B 94, 195105 (2016).
Article ADS Google Scholar
Torlai, G. & Melko, R. G. Learning thermodynamics with Boltzmann machines. Phys. Rev. B 94, 165134 (2016).
Article ADS Google Scholar
Carrasquilla, J. & Melko, R. Machine learning phases of matter. Nat. Physics 13, 431–434 (2017).
Article ADS CAS Google Scholar
Li, Y., Blaisten-Barojas, E. & Papaconstantopoulos, D. A. Structure and dynamics of alkali-metal clusters and fission of highly charged clusters. Phys. Rev. B 57, 15519 (1998).
Article ADS CAS Google Scholar
Lu, Z. P., Shen, J., Xing, D. W., Sun, J. F. & Liu, C. T. Binary eutectic clusters and glass formation in ideal glass-forming liquids. App. Phys. Lett. 89, 071910 (2006).
Article ADS Google Scholar
Ma, D., Tan, H., Wang, D., Li, Y. & Ma, E. Strategy for pinpointing the best glass-forming alloys. App. Phys. Lett. 86, 191906 (2005).
Article ADS Google Scholar
Kean, C. H. Pressure-temperature phase diagram of Na-K alloys and the effect of pressure on the resistance of the liquid phase. Phys. Rev. 55, 750–754 (1939).
Article ADS CAS Google Scholar
Foust, O. J. Sodium-NaK Engineering Handbook, vol. 1 (Gordon & Breach, Science Publishers, New York, 1972).
Natesan, K., Reed, C. & Mattas, R. Assessment of alkali metal coolants for the ITER blanket. Fusion Engineering and Design, Proc. 3rd. Inter. Symp. Fusion Nuclear Tech. 27, 457–466 (1995).
CAS Google Scholar
O’Donnell, W., Papanikolaou, P. & Reed, C. The thermophysical and transport properties of eutectic NaK near room temperature. SciTech Connect (Report: Argonne National Lab., IL, USA) (1989).
Aguado, A. & López, J. M. Structure determination in 55-atom Li–Na and Na–K nanoalloys. J. Chem. Phys. 133, 094302 (2010).
Article ADS Google Scholar
Reitz, D. & Blaisten-Barojas, E. Monte carlo study of the crystalline and amorphous NaK alloy. Procedia Comput. Sci. 108, 1215–1221, International Conference on Computational Science, ICCS 2017, 12–14 June 2017, Zurich, Switzerland (2017).
Voronoi, G. Nouvelles applications des paramètres continus à la théorie des formes quadratiques. J. Reine Angew. Math. 133, 97–138 (1908).
MathSciNet MATH Google Scholar
Bernal, J. Geometry of the structure of monatomic liquids. Nature 439, 141–147 (1959).
Article ADS Google Scholar
Rahman, A. Liquid structure and self-diffusion. J. Chem. Phys. 45, 2585–2592 (1966).
Article ADS CAS Google Scholar
Finney, J. L. Random packings and the structure of simple liquids. I. The geometry of random close packing. Proc. R. Soc. London, Ser. A 319, 479–493 (1970).
Article ADS CAS Google Scholar
Nelson, D. R. Order, frustration, and defects in liquids and glasses. Phys. Rev. B 28, 5515–5535 (1959).
Article ADS MathSciNet Google Scholar
Sheng, H. W., Luo, W. K., Alamgir, F. M., Bai, J. M. & Ma, E. Atomic packing and short-to-medium-range order in metallic glasses. Nature 439, 419–425 (2006).
Article ADS CAS Google Scholar
Yang, S., Lach-hab, M., Vaisman, I. I. & Blaisten-Barojas, E. Identifying zeolite frameworks with a machine learning approach. J. Phys. Chem. C 113, 21721 (2009).
Article CAS Google Scholar
Malins, A., Williams, S. R., Eggers, J. & Royall, C. P. Identification of structure in condensed matter with the topological cluster classification. J. Chem. Phys. 139, 234506 (2013).
Article ADS Google Scholar
Frenkel, D. & Smit, B. Understanding Molecular Simulation: from Algorithms to Applications (Academic Press, 2nd edition, 2001).
Editors: Gilks, W. R., Richardson, S. & Spiegelhalter, D. Markov Chain Monte Carlo in Practice (Chapman & Hall, reprinted by CRC Press, Baton Rouge, New York, 1998).
Hall, C., J., W. & Blaisten-Barojas, E. The Metropolis Monte Carlo method with CUDA enabled Graphic Processing Units. J. Comp. Phys. 258, 871–879 (2014).
Article ADS CAS Google Scholar
Shah, J. K. et al. Cassandra: An open source Monte Carlo package for molecular simulation. J. Comput. Chem. 38, 1727–1739 (2017).
Article CAS Google Scholar
Purton, J. A., C. J. & Parker, S. DLMONTE: A general purpose program for parallel Monte Carlo simulation. Molecular Simulation 39, 1240–52 (2013).
Article CAS Google Scholar
Hsieh, M. & Swalin, R. Diffusion studies in liquid potassium and rubidium. Acta Met. 22, 219–226 (1974).
Article CAS Google Scholar
Kauzmann, W. The nature of the glassy state and the behavior of liquids at low temperatures. Chem. Rev. 43, 219–256 (1948).
Article CAS Google Scholar
Debenedetti, P. G. & Stilinger, F. H. Supercooled liquids and the glass transition. Nature 410, 259–267 (2001).
Article ADS CAS Google Scholar
Berthier, L. et al. Configurational entropy measurements in extremely supercooled liquids that break the glass ceiling. PNAS 114, 11356–11361 (2017).
Article ADS CAS Google Scholar
Lopasso, E. M., C. A., Caro, M. & Turchi, P. E. A. Phase diagram of an empirical potential: The case of Fe-Cu. Phys. Rev. B 68, 214205 (2003).
Article ADS Google Scholar
Belonoshko, A. B. & Dubrovinsky, L. S. Molecular dynamics of NaCl (B1 and B2) and MgO (B1) melting: Two-phase simulation. American Mineralogist 81, 303–316 (1996).
Article ADS CAS Google Scholar
Stukowski, A. Visualization and analysis of atomistic simulation data with Ovito the Open Visualization Tool. Modell. Simul. Mater. Sci. Eng. 18, 015012 (2010).
Honeycutt, J. D. & Andersen, H. C. Molecular dynamics study of melting and freezing of small Lennard-Jones clusters. J. Phys. Chem. 91, 4950–4963 (1987).
Article CAS Google Scholar
Faken, D. & Jónsson, H. Systematic analysis of local atomic structure combined with 3D computer graphics. Comput. Mater. Sci. 2, 279–286 (1994).
Article CAS Google Scholar
Stukowski, A. Structure identification methods for atomistic simulations of crystalline materials. Modell. Simul. Mater. Sci. Eng. 20, 045021 (2012).
Article ADS Google Scholar
Cheng, Y. Q., Ma, E. & Sheng, H. W. Atomic level structure in multicomponent bulk metallic glass. Phys. Rev. Lett. 102, 245501 (2009).
Article ADS CAS Google Scholar
Rycroft, C. H. Voro++: A three-dimensional voronoi cell library in c++. Chaos 19, 041111 (2009).
Article ADS Google Scholar
He, X., Cai, D. & Niyogi, P. Laplacian score for feature selection. In Advances in Neural Information Processing Systems, vol. 17 (MIT Press, Cambridge, 2004).
Duterman, G. H. Principal Components Analysis. (SAGE Publications, Newbury Park, 1989).
Google Scholar
Jong, J.-C. & Kotz, S. On a relation between principal components and regression analysis. Am. Stat. 53, 349–351 (1999).
MathSciNet Google Scholar
Dempster, A. P., Laird, N. M. & Rubin, D. B. Maximum likelihood from incomplete data via the EM algorithm. J. R. Stat. Soc. Series B Stat. Methodol. 39, 1–38 (1977).
MathSciNet MATH Google Scholar
Wu, C. J. On the convergence properties of the em algorithm. Ann. Stat. 11, 95–103 (1983).
Article MathSciNet Google Scholar
Hall, M. et al. The WEKA data mining software: An update. SIGKDD Explor. Newsl. 11, 10–18 (2009).
Article Google Scholar
Breiman, L. Random forests. Machine Learning 45, 5–32 (2001).
Article Google Scholar
Lach-hab, M., Yang, S., Vaisman, I. & Blaisten-Barojas, E. Novel approach for clustering zeolite crystal structures. Molecular Informatics 29, 297–301 (2010).
Article CAS Google Scholar

Download references

Acknowledgements

We acknowledge partial support from the National Science Foundation CHE-0626111 and the Thomas F. & Kate Miller Jeffress Memorial Trust. We are thankful to the Open Access Publishing Fund of George Mason University (GMU) for covering the cost of this publication. Computations were done in the Argo cluster, Office of Research Computing, GMU.

Author information

Authors and Affiliations

Center for Simulation and Modeling (formerly, Computational Materials Science Center) and Department of Computational and Data Sciences, George Mason University, Fairfax, Virginia, 22030, USA
Douglas M. Reitz & Estela Blaisten-Barojas

Authors

Douglas M. Reitz
View author publications
You can also search for this author in PubMed Google Scholar
Estela Blaisten-Barojas
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

D.M.R. and E.B.-B. contributed equally. All authors reviewed the manuscript.

Corresponding author

Correspondence to Estela Blaisten-Barojas.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Dataset 1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Reitz, D.M., Blaisten-Barojas, E. Simulating the NaK Eutectic Alloy with Monte Carlo and Machine Learning. Sci Rep 9, 704 (2019). https://doi.org/10.1038/s41598-018-36574-y

Download citation

Received: 12 April 2018
Accepted: 22 November 2018
Published: 24 January 2019
DOI: https://doi.org/10.1038/s41598-018-36574-y

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.