Deep neural networks for accurate predictions of crystal stability

Ye, Weike; Chen, Chi; Wang, Zhenbin; Chu, Iek-Heng; Ong, Shyue Ping

doi:10.1038/s41467-018-06322-x

Download PDF

Article
Open access
Published: 18 September 2018

Deep neural networks for accurate predictions of crystal stability

Weike Ye¹,
Chi Chen²,
Zhenbin Wang²,
Iek-Heng Chu² &
…
Shyue Ping Ong ORCID: orcid.org/0000-0001-5726-2587²

Nature Communications volume 9, Article number: 3800 (2018) Cite this article

19k Accesses
192 Citations
76 Altmetric
Metrics details

Subjects

Abstract

Predicting the stability of crystals is one of the central problems in materials science. Today, density functional theory (DFT) calculations remain comparatively expensive and scale poorly with system size. Here we show that deep neural networks utilizing just two descriptors—the Pauling electronegativity and ionic radii—can predict the DFT formation energies of C₃A₂D₃O₁₂ garnets and ABO₃ perovskites with low mean absolute errors (MAEs) of 7–10 meV atom⁻¹ and 20–34 meV atom⁻¹, respectively, well within the limits of DFT accuracy. Further extension to mixed garnets and perovskites with little loss in accuracy can be achieved using a binary encoding scheme, addressing a critical gap in the extension of machine-learning models from fixed stoichiometry crystals to infinite universe of mixed-species crystals. Finally, we demonstrate the potential of these models to rapidly transverse vast chemical spaces to accurately identify stable compositions, accelerating the discovery of novel materials with potentially superior properties.

Scale-invariant machine-learning model accelerates the discovery of quaternary chalcogenides with ultralow lattice thermal conductivity

Article Open access 24 March 2022

Machine-learning structural and electronic properties of metal halide perovskites using a hierarchical convolutional neural network

Article Open access 14 April 2020

Accelerated identification of equilibrium structures of multicomponent inorganic crystals using machine learning potentials

Article Open access 12 May 2022

Introduction

The formation energy of a crystal is a key metric of its stability and synthesizability. It is typically defined relative to constituent unary/binary phases (E_f) or the stable linear combination of competing phases in the phase diagram (E_hull, or energy above convex hull)¹. In recent years, machine learning (ML) models trained on density functional theory (DFT)² calculations have garnered widespread interest as a means to scale quantitative predictions of materials properties^3,4,5,6,7, including energies of crystals. However, most previous efforts at predicting E_f or E_hull of crystals^{5,8,9,10,11,12} using ML models have yielded mean absolute errors (MAEs) of 70–100 meV atom⁻¹, falling far short of the necessary accuracy for useful crystal stability predictions. This is because approximately 90% of the crystals in the Inorganic Crystal Structure Database (ICSD) have E_hull < 70 meV atom⁻¹¹³, and the errors of DFT-calculated formation energies of ternary oxides from binary oxides relative to experiments are ~ 24 meV atom⁻¹¹⁴.

We propose to approach the crystal stability prediction problem by using artificial neural networks (ANNs)¹⁵, i.e., algorithms that are loosely modeled on the animal brain, to quantify well-established chemical intuition. The Pauling electronegativity and ionic radii guide much of our understanding about the bonding and stability of crystals today, for example, in the form of Pauling’s five rules¹⁶ and the Goldschmidt tolerance factor for perovskites¹⁷. Though these rules are qualitative in nature, their great success points to the potential existence of a direct relationship between crystal stability and these descriptors.

To probe these relationships, we choose, as our initial model system, the garnets, a large family of crystals with widespread technological applications such as luminescent materials for solid-state lighting¹⁸ and lithium superionic conductors for rechargeable lithium-ion batteries^19,20. Garnets have the general formula C₃A₂D₃O₁₂, where C, A and D denote the three cation sites with Wyckoff symbols 24c (dodecahedron), 16a (octahedron) and 24d (tetrahedron), respectively, in the prototypical cubic $Ia\overline 3 d$ garnet crystal shown in Fig. 1a. The distinct coordination environments of the three sites result in different minimum ionic radii ratios (and hence, species preference) according to Pauling’s first rule. We further demonstrate the generalizability of our approach to the ABO₃ perovskites (Fig. 1b), another broad class of technologically important crystals^{21,22,23,24,25}.

In this work, we show that ANNs using only the Pauling electronegativity²⁶ and ionic radii²⁷ of the constituent species as the input descriptors can achieve extremely low MAEs of 7–10 meV atom⁻¹ and 20–34 meV atom⁻¹ in predicting the formation energies of garnets and perovskites, respectively. We also introduce two alternative approaches to extend such ANN models beyond simple unmixed crystals to the much larger universe of mixed cation crystals—a rigorously defined averaging scheme for the electronegativity and ionic radii for modeling complete cation disorder, and a novel binary encoding scheme to account for the effect of cation orderings with minimal increase in feature dimension. Finally, we demonstrate the application of the NN models in accurately and efficiently identifying stable compositions out of thousands of garnet and perovskite candidates, greatly expanding the space for the discovery of materials with potentially superior properties.

Results

Model construction and definitions

We start with the hypothesis that the formation energy E_f of a C₃A₂D₃O₁₂ garnet is some unknown function f of the Pauling electronegativities (χ) and Shannon ionic radii (r) of the species in the C, A, and D sites, i.e.,

$$E_f = f\left( {\chi _{\mathrm{C}},\,r_{\mathrm{C}},\chi _{\mathrm{A}},r_{\mathrm{A}},\,\chi _{\mathrm{D}},r_{\mathrm{D}}} \right)$$

(1)

Here, we define E_f as the change in energy in forming the garnet from binary oxides with elements in the same oxidation states, i.e., $E_f^{\mathrm{oxide}}$ as opposed to the more commonly used formation energy from the elements $E_f^{\mathrm{element}}$ in previous works^8,9,10,11. Using the Ca₃Al₂Si₃O₁₂ garnet (grossular) as an example, $E_f^{\mathrm{oxide}}$ is given by the energy of the reaction: 3CaO + Al₂O₃ + 3SiO₂ → Ca₃Al₂Si₃O_12. This choice of definition of E_f is motivated by two reasons. First, binary oxides are frequently used as synthesis precursors. Second, our definition ensures that garnets that share elements in the same oxidation states have E_f that are referenced to the same binary oxides, minimizing well-known DFT errors. In contrast, $E_f^{\mathrm{element}}$ and E_hull are both poor target metrics for a ML model. $E_f^{\mathrm{element}}$ suffers from non-systematic DFT errors associated with the incomplete cancellation of the self-interaction error in redox reactions²⁸, while E_hull is defined with respect to the linear combination of stable phases at the C₃A₂D₃O₁₂ composition in the C-A-D-O phase diagram, which can vary unpredictably even for highly similar chemistries. Henceforth, the notation E_f in this work refers to $E_f^{\mathrm{oxide}}$ unless otherwise stated. The binary oxides used to calculate the E_f for garnets and perovskites are listed in Supplementary Table 1 and 2, respectively.

Based on the universal approximation theorem²⁹, we may model the unknown function f(χ_C,r_C,χ_A,r_A,χ_D,r_D), which is clearly non-linear (see Supplementary Fig. 1), using a feed-forward ANN, as depicted in Fig. 2. The loss function and evaluation metric are chosen to be the mean squared error (MSE) and MAE, respectively. We will denote the architecture of the ANN using nⁱ−n^[1]−n^[2]−···−1, where nⁱ and n^[l] are the number of neurons in the input and l^th hidden layer, respectively.

Neural network model for unmixed garnets

We developed an initial ANN model for unmixed garnets, i.e., garnets with only one type of species each in C, A, and D. A data set comprising 635 unmixed garnets was generated by performing full DFT relaxation and energy calculations (see Methods) on all charge-neural combinations of allowed species (Supplementary Table 3) on the C, A, and D sites³⁰. This dataset was randomly divided into training, validation, and test data in the ratio of 64:16:20. Using 50 repeated random sub-sampling cross validation, we find that a 6-24-1 ANN architecture yields a small root mean square error (RMSE) of 12 meV atom⁻¹, as well as the smallest standard deviation in the RMSE among the 50 sub-samples (Supplementary Fig. 2a). The training, validation and test MAEs for the optimized 6-24-1 model are ~7–10 meV atom⁻¹ (Fig. 3a), an order of magnitude lower than the ~100 meV atom⁻¹ achieved in previous ML models^5,8,9,10. For comparison, the error in the DFT E_f of garnets relative to experimental values is around 14 meV atom⁻¹ (Supplementary Table 4). Similar RMSEs are obtained for deep neural network (DNN) architectures containing two hidden layers (Supplementary Fig. 2b), indicating that a single-hidden-layer architecture is sufficient to model the relationship E_f and the descriptors.

Averaged neural network models for mixed garnets

To extend our model to mixed garnets, i.e., garnets with more than one type of species in the C, A, and D sites, we explored two alternative approaches—one based on averaging of descriptors, and another based on expanding the number of descriptors to account for the effect or species ordering. The data set for mixed garnets were created using the same species pool, but allowing two species to occupy one of the sites. Mixing on the A sites was set at a 1:1 ratio, and that on the C and D sites was set at a 2:1 ratio, generating garnets of the form C₃A’A”D₃O₁₂ (211 compositions), C’C’’₂A₂D₃O₁₂ (445 compositions), and C₃A₂D’D’’₂O₁₂ (116 compositions)_. For each composition, we calculated the energies of all symmetrically distinct orderings within a single primitive unit cell of the garnet. All orderings must belong to a subgroup of the $Ia\overline 3 d$ garnet space group.

In the first approach, we characterized each C, A, or D site using weighted averages of the ionic radii and electronegativities of the species present in each site, given by the following expressions (see Methods):

$$r_{\mathrm{avg}} = xr_{\mathrm{X}} + \left( {1 - x} \right)r_{\mathrm{Y}}$$

(2)

$$\chi _{\mathrm{avg}} = \chi _{\mathrm{O}} - \sqrt {x\left( {\chi _{\mathrm{X}} - \chi _{\mathrm{O}}} \right)^2 + \,\left( {1 - x} \right)\left( {\chi _{\mathrm{Y}} - \chi _{\mathrm{O}}} \right)^2}$$

(3)

where X and Y are the species present in a site with fraction x and (1−x), respectively, and O refers to the element oxygen. The implicit assumption in this “averaged” ANN model is that species X and Y are completely disordered, i.e., different orderings of X and Y result in negligible DFT energy differences.

Using the same 6-24-1 ANN architecture, we fitted an “averaged” model using the energy of the ground state ordering of the 635 unmixed and 772 mixed garnets. We find that the training, validation, and test MAEs of the optimized model are 22, 26, and 26 meV atom⁻¹, respectively (Supplementary Fig. 3a). These MAEs are about double that of the unmixed ANN model, but still comparable to the error of the DFT E_f relative to experiments. The larger MAEs may be attributed to the fact that the effect of species orderings on the crystal energy is not accounted for in this “averaged” model.

Ordered neural network model for mixed garnets

In the second approach, we undertook a more ambitious effort to account for the effect of species orderings on crystal energy. Here, we discuss the results for species mixing on the C site only, for which the largest number of computed compositions and orderings is available. For 2:1 mixing, there are 20 symmetrically distinct orderings within the primitive garnet cell, which can be encoded using a 5-bit binary array [b₀,b₁,b₂,b₃,b₄]. This binary encoding scheme is significantly more compact that the commonly used one-hot encoding scheme, and hence, minimizes the increase in the descriptor dimensionality. We may then modify Eq. 1 as follows:

$$E_f = f\left( {\chi _{C^\prime },\,r_{C^\prime },\,\chi _{C^{\prime\prime} },\,r_{C^{\prime\prime} },\,\chi _A,\,r_A,\;\chi _D,\,r_D,\,b_0,\,b_1,\,b_2,\,b_3,\,b_4} \right)$$

(4)

where the electronegativities and ionic radii of both species on the C sites are explicitly represented. In contrast to the “averaged” model, we now treat the 20 ordering-E_f pairs at each composition as distinct data points. Each unmixed composition was also included as 20 data points with the same descriptor values and E_f, but different binary encodings.

We find that a two-hidden-layer DNN is necessary to model this more complex composition-ordering-energy relationship. The final optimized 13-22-8-1 model exhibits overall training, validation and test MAEs of ~11–12 meV atom⁻¹ on the entire unmixed and mixed dataset (Supplementary Fig. 3b). The comparable MAEs between this extended DNN model and the unmixed ANN model is clear evidence that the DNN model has successfully captured the additional effect of orderings on E_f. We note that the average standard deviation of the predicted E_f of different orderings of unmixed compositions using this extended DNN model is only 2.8 meV atom⁻¹, indicating that the DNN has also learned the fact that orderings of the same species on a particular site have little effect on the energy. Finally, similar MAEs can be achieved for A and D site mixing (Supplementary Fig. 3c and 3d) using the same approach.

Stability classification of garnets using ANN models

While E_f is a good target metric for a predictive ANN model, the stability of a crystal is ultimately characterized by its E_hull. Using the predicted E_f from our DNN models and pre-calculated DFT data from the Materials Project³¹, we have computed E_hull by constructing the 0 K C-A-D-O phase diagrams. From Fig. 4a, we may observe that the extended C-mixed DNN model can achieve a >90% accuracy in classifying stable/unstable unmixed garnets at a strict E_hull threshold of 0 meV atom⁻¹ and rises rapidly with increasing threshold. Similarly, high classification accuracies of greater than 90% are achieved for all three types of mixed garnets. Given the great flexibility of the garnet prototype in accommodating different species, there are potentially millions of undiscovered compositions. Even using our restrictive protocol of single-site mixing in specified ratios, 8427 mixed garnet compositions can be generated, of which 2307 are predicted to have E_hull of 0 meV atom⁻¹, i.e., potentially synthesizable (Supplementary Fig. 4a). A web application that computes E_f and E_hull for any garnet composition using the optimized DNNs has been made publicly available for researchers at http://crystals.ai.

Neural network models for unmixed and mixed perovskites

To demonstrate that our proposed approach is generalizable and not specific to the garnet crystal prototype, we have constructed similar neural network models using a dataset of 240 unmixed, 222 A-mixed and 80 B-mixed ABO₃ perovskites generated using the species in Supplementary Table 5. We find that a 4-12-1 single-hidden-layer neural network is able to achieve MAEs of 21–34 meV atom⁻¹ in the predicted E_f for unmixed perovskites (Fig. 3c), while two 10-24-1 neural networks are able to achieve MAEs of 22–39 meV atom⁻¹ in the E_f of the mixed perovskites (Supplementary Fig. 5). These MAEs are far lower than those of prior ML models of unmixed perovskites, which generally have MAEs of close to 100 meV atom⁻¹ or higher^9,16. As shown in Fig. 3b, the accuracy of classifying stable versus unstable perovskites exceeds 80% at a strict E_hull threshold of 0 meV atom⁻¹ and maintains at above 70% at a loosened E_hull threshold of 30 meV atom⁻¹. During the review of this work, a new work by Li et al.³² reported achieving comparable MAEs of ~28 meV atom⁻¹ in predicting the E_hull of perovskites using a kernel ridge regression model. However, this performance was achieved using a set of 70 descriptors, with model performance sharply dropping with less than 70 descriptors. Furthermore, Li et al.’s model is restricted to perovskites with E_hull < 400 meV atom⁻¹ and only a single ordering for each mixed perovskite, while in this work, the highest E_hull is 747 meV atom⁻¹ for the perovskite dataset and all symmetrically distinct orderings on the A and B sites within a √2×√2×1 orthorhombic conventional perovskite unit cell (ten structures each) are considered.

Discussion

To summarize, we have shown that NN models can quantify the relationship between traditionally chemically intuitive descriptors, such as the Pauling electronegativity and ionic radii, and the energy of a given crystal prototype. A key advantage of our proposed NN models is that they rely only on an extremely small number (two) of site-based descriptors, i.e., no structural degrees of freedom are considered beyond the ionic radii of a particular species in a site and the ordering of the cations in the mixed oxides. This is in stark contrast to most machine-learning models in the literature utilizing a large number of correlated descriptors, which render such models highly susceptible to overfitting, or machine-learning force-fields, which can incorporate structural and atomic degrees of freedom but at a significant loss of transferability to different compositions. Most importantly, we derive two alternative approaches—a rigorously defined averaging scheme to model complete cation disorder and a binary encoding scheme to account for the effect of orderings—to extend high-performing unmixed deep learning models to mixed cation crystals with little/no loss in error performance and minimal increase in descriptor dimensionality. It should be noted that our NN models are still restricted to the garnet and perovskite compositions (with or without cation mixing) with no vacancies, though further extensions to other common crystal structure prototypes and to account for vacancies should in principle be possible. Finally, we show how predictive models of E_f can be combined with existing large public databases of DFT computed energies to predict E_hull and hence, phase stability. These capabilities can be used to efficiently traverse large chemical spaces of unmixed and mixed crystals to identify stable compositions and orderings, greatly accelerating the potential for novel materials discovery.

Methods

DFT calculations

All DFT calculations were performed using Vienna ab initio simulation package (VASP) within the projector augmented-wave approach^33,34. Calculation parameters were chosen to be consistent with those used in the Materials Project, an open database of pre-computed energies for all known inorganic materials³¹. The Perdew-Burke-Ernzehof generalized gradient approximation exchange-correlation functional³⁵ and a plane-wave energy cut-off of 520 eV were used. Energies were converged to within 5 × 10⁻⁵ eV atom^-1, and all structures were fully relaxed. For mixed compositions, symmetrically distinct orderings within the 80-atom primitive garnet unit cell and the 40-atom √2×√2×1 orthorhombic perovskite supercell were generated using the enumlib library³⁶ via the Python Materials Genomics package.³⁷

Training of ANNs

Training of the ANNs was carried out using the Adam optimizer³⁸ at a learning rate of 0.2, with the mean square error of E_f as the loss metric. For each architecture, we ran with a random 64:16:20 split of training, validation and test data, i.e., random sub-sampling cross validation.

Electronegativity averaging

Pauling’s definition of electronegativity is based on an “additional stabilization” of a heteronuclear bond X–O compared to average of X–X and O–O bonds, as follows.

$$\left( {\chi _{\mathrm{X}} - \chi _O} \right)^2 = E_d\left( {{\mathrm{XO}}} \right) - \frac{{E_d\left( {{\mathrm{XX}}} \right) + E_d\left( {{\mathrm{OO}}} \right)}}{2}$$

where χ_X and χ_O are the electronegativities of species X and O, respectively, and E_d is the dissociation energy of the bond in parentheses. Here, O refers to oxygen.

For a disordered site containing species X and Y in the fractions x and (1−x), respectively, we obtain the following:

$$\left( {\chi _{\mathrm{X}_x\mathrm{Y}_{1 - x}} - \chi _O} \right)^{2} = xE_d\left( {{\mathrm{XO}}} \right) + \left( {1 - x} \right)E_d\left( {{\mathrm{YO}}} \right) \\ - \frac{{xE_d\left( {{\mathrm{XX}}} \right) + \left( {1 - x} \right)E_d\left( {{\mathrm{YY}}} \right) + E_d\left( {{\mathrm{OO}}} \right)}}{2}\\ = x\left( {\chi _{\mathrm{X}} - \chi _{\mathrm{O}}} \right)^2 + \left( {1 - x} \right)\left( {\chi _{\mathrm{Y}} - X_{\mathrm{O}}} \right)^2$$

We then obtain the effective electronegativity for the disordered site as follows:

$${\mathrm{\chi }}_{\mathrm{X}_x\mathrm{Y}_{1 - x}} = \chi _{\mathrm{O}} - \sqrt {x\left( {\chi _{\mathrm{X}} - \chi _{\mathrm{O}}} \right)^2 + \left( {1 - x} \right)\left( {\chi _{\mathrm{Y}} - \chi _{\mathrm{O}}} \right)^2}$$

Data availability

The datasets generated during and/or analysed during the current study are available in the GitHub repository https://github.com/materialsvirtuallab/garnetdnn as well as the Dryad Digital Repository (doi: 10.5061/dryad.760r5b6). A web application that estimates E_f and E_hull for any given garnet or perovskite composition using the optimized DNNs is available at http://crystals.ai/.

References

Ong, S. P., Wang, L., Kang, B. & Ceder, G. Li−Fe−P−O2 phase diagram from first principles calculations. Chem. Mater. 20, 1798–1807 (2008).
Article CAS Google Scholar
Hohenberg, P. & Kohn, W. Inhomogeneous electron gas. Phys. Rev. 136, B864 (1964).
Article ADS MathSciNet Google Scholar
Pilania, G., Wang, C., Jiang, X., Rajasekaran, S. & Ramprasad, R. Accelerating materials property predictions using machine learning. Sci. Rep. 3, 2810 (2013).
Article ADS PubMed PubMed Central Google Scholar
Lee, J., Seko, A., Shitara, K., Nakayama, K. & Tanaka, I. Prediction model of band gap for inorganic compounds by combination of density functional theory calculations and machine learning techniques. Phys. Rev. B 93, 115104 (2016).
Article ADS CAS Google Scholar
Schmidt, J. et al. Predicting the thermodynamic stability of solids combining density functional theory and machine learning. Chem. Mater. 29, 5090–5103 (2017).
Article CAS Google Scholar
Pilania, G. et al. Machine learning bandgaps of double perovskites. Sci. Rep. 6, 19375 (2016).
Article ADS PubMed PubMed Central CAS Google Scholar
Isayev, O. et al. Universal fragment descriptors for predicting properties of inorganic crystals. Nat. Commun. 8, 15679 (2017).
Article ADS PubMed PubMed Central CAS Google Scholar
Meredig, B. et al. Combinatorial screening for new materials in unconstrained composition space with machine learning. Phys. Rev. B 89, 094104 (2014).
Article ADS CAS Google Scholar
Faber, F. A., Lindmaa, A., Von Lilienfeld, O. A. & Armiento, R. Machine learning energies of 2 million elpasolite (ABC₂D₆). Phys. Rev. Lett. 117, 135502 (2016).
Article ADS PubMed CAS Google Scholar
Ward, L., Agrawal, A., Choudhary, A. & Wolverton, C. A general-purpose machine learning framework for predicting properties of inorganic materials. npj Comput. Mater. 2, 16028 (2016).
Article Google Scholar
Ward, L. et al. Including crystal structure attributes in machine learning models of formation energies via Voronoi tessellations. Phys. Rev. B 96, 024104 (2017).
Article ADS Google Scholar
Seko, A., Hayashi, H., Nakayama, K., Takahashi, A. & Tanaka, I. Representation of compounds for machine-learning prediction of physical properties. Phys. Rev. B 95, 144110 (2017).
Article ADS Google Scholar
Sun, W. et al. The thermodynamic scale of inorganic crystalline metastability. Sci. Adv. 2, e1600225–e1600225 (2016).
Article ADS PubMed PubMed Central CAS Google Scholar
Hautier, G., Ong, S. P., Jain, A., Moore, C. J. & Ceder, G. Accuracy of density functional theory in predicting formation energies of ternary oxides from binary oxides and its implication on phase stability. Phys. Rev. B - Condens. Matter Mater. Phys. 85, 155208 (2012).
Article ADS CAS Google Scholar
LeCun, Y., Bengio, Y. & Hinton, G. Deep learning. Nature 521, 436–444 (2015).
Article ADS PubMed CAS Google Scholar
Pauling, L. The principles determining the structure of complex ionic crystals. J. Am. Chem. Soc. 51, 1010–1026 (1929).
Article CAS Google Scholar
Goldschmidt, V. M. Die Gesetze der Krystallochemie. Naturwissenschaften 14, 477–485 (1926).
Article ADS CAS Google Scholar
Nakamura, S. Present performance of InGaN-based blue/green/yellow LEDs. Proc. SPIE 3002, 26–35 (1997).
Google Scholar
O’Callaghan, M. P., Lynham, D. R., Cussen, E. J. & Chen, G. Z. Structure and ionic-transport properties of lithium-containing garnets Li ₃ Ln ₃ Te ₂ O₁₂ (Ln = Y, Pr, Nd, Sm−Lu). Chem. Mater. 18, 4681–4689 (2006).
Article CAS Google Scholar
Peng, H., Wu, Q. & Xiao, L. Low temperature synthesis of Li₅La₃Nb₂O₁₂ with cubic garnet-type structure by sol–gel process. J. Sol.-Gel Sci. Technol. 66, 175–179 (2013).
Article CAS Google Scholar
Kobayashi, K.-I., Kimura, T., Sawada, H., Terakura, K. & Tokura, Y. Room-temperature magnetoresistance in an oxide material with an ordered double-perovskite structure. Nature 395, 677–680 (1998).
Article ADS CAS Google Scholar
Cava, R. J. et al. Bulk superconductivity at 91 K in single-phase oxygen-deficient perovskite Ba₂YCu₃O_{9 − δ}. Phys. Rev. Lett. 58, 1676–1679 (1987).
Article ADS PubMed CAS Google Scholar
Cohen, R. E. Origin of ferroelectricity in perovskite oxides. Nature 358, 136–138 (1992).
Article ADS CAS Google Scholar
Grinberg, I. et al. Perovskite oxides for visible-light-absorbing ferroelectric and photovoltaic materials. Nature 503, 509–512 (2013).
Article ADS PubMed CAS Google Scholar
Green, M. A., Ho-Baillie, A. & Snaith, H. J. The emergence of perovskite solar cells. Nat. Photonics 8, 506–514 (2014).
Article ADS CAS Google Scholar
Pauling, L. The nature of the chemical bond. IV. The energy of single bonds and the relative electronegativity of atoms. J. Am. Chem. Soc. 54, 3570–3582 (1932).
Article MATH CAS Google Scholar
Shannon, R. D. Revised effective ionic radii and systematic studies of interatomie distances in halides and chaleogenides. Acta Cryst. A32, 751–767 (1976).
Article CAS Google Scholar
Wang, L., Maxisch, T. & Ceder, G. Oxidation energies of transition metal oxides within the GGA + U framework. Phys. Rev. B 73, 195107 (2006).
Article ADS CAS Google Scholar
Hornik, K. Approximation capabilities of multilayer feedforward networks. Neural Netw. 4, 251–257 (1991).
Article Google Scholar
Granat-struktur, D., Ubersicht, E., Kationen, D. & Ionenverteilung, D. Crystal chemistry of the garnet. Z. für Krist. - Cryst. Mater. 47, 1989 (1999).
Google Scholar
Jain, A. et al. Commentary: the materials project: a materials genome approach to accelerating materials innovation. APL Mater. 1, 011002 (2013).
Article ADS CAS Google Scholar
Li, W., Jacobs, R. & Morgan, D. Predicting the thermodynamic stability of perovskite oxides using machine learning models. Comput. Mater. Sci. 150, 454–463 (2018).
Article CAS Google Scholar
Kresse, G. & Furthmüller, J. Efficient iterative schemes for ab initio total-energy calculations using a plane-wave basis set. Phys. Rev. B 54, 11169–11186 (1996).
Article ADS CAS Google Scholar
Blöchl, P. E. Projector augmented-wave method. Phys. Rev. B 50, 17953–17979 (1994).
Article ADS Google Scholar
Perdew, J. P., Burke, K. & Ernzerhof, M. Generalized gradient approximation made simple. Phys. Rev. Lett. 77, 3865–3868 (1996).
Article ADS PubMed CAS Google Scholar
Hart, G. L. W., Nelson, L. J. & Forcade, R. W. Generating derivative structures at a fixed concentration. Comput. Mater. Sci. 59, 101–107 (2012).
Article CAS Google Scholar
Ong, S. P. et al. Python materials genomics (pymatgen): a robust, open-source python library for materials analysis. Comput. Mater. Sci. 68, 314–319 (2013).
Article CAS Google Scholar
Kingma, D. P. & Jimmy Ba, Adam: a method for stochastic optimization. Preprint at https://arxiv.org/pdf/1412.6980 (2016).
Chollet, F. et al. Keras. http://keras.io (2015).
Abadi, M. M. et al. TensorFlow: large-scale machine learning on heterogeneous distributed systems. http://www.tensorflow.org/ (2015).

Download references

Acknowledgements

This work is supported by the Samsung Advanced Institute of Technology (SAIT)’s Global Research Outreach (GRO) Program. The authors also acknowledge data and software resources provided by the Materials Project, funded by the U.S. Department of Energy, Office of Science, Office of Basic Energy Sciences, Materials Sciences and Engineering Division under Contract No. DE-AC02-05-CH11231: Materials Project program KC23MP, and computational resources provided by Triton Shared Computing Cluster (TSCC) at the University of California, San Diego, the National Energy Research Scientific Computing Centre (NERSC), and the Extreme Science and Engineering Discovery Environment (XSEDE) supported by National Science Foundation under Grant No. ACI-1053575. The authors would also like to express their gratitude to Professors Darren Lipomi and David Fenning from the University of California, San Diego, and Dr Anubhav Jain from Lawrence Berkeley National Laboratory for helpful comments on the manuscript.

Author information

Authors and Affiliations

Department of Chemistry and Biochemistry, University of California San Diego, 9500 Gilman Dr, Mail Code 0303, La Jolla, CA, 92093-0448, USA
Weike Ye
Department of NanoEngineering, University of California San Diego, 9500 Gilman Dr, Mail Code, 0448, La Jolla, CA, 92093-0448, USA
Chi Chen, Zhenbin Wang, Iek-Heng Chu & Shyue Ping Ong

Authors

Weike Ye
View author publications
You can also search for this author in PubMed Google Scholar
Chi Chen
View author publications
You can also search for this author in PubMed Google Scholar
Zhenbin Wang
View author publications
You can also search for this author in PubMed Google Scholar
Iek-Heng Chu
View author publications
You can also search for this author in PubMed Google Scholar
Shyue Ping Ong
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.P.O., W.Y. and C.C. proposed the concept. W.Y. carried out the calculations and analysis with the help from C.C., Z.W. and I.C. W.Y. prepared the initial draft of the manuscript. All authors contributed to the discussions and revisions of the manuscript.

Corresponding author

Correspondence to Shyue Ping Ong.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary Information

Peer Review File

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Ye, W., Chen, C., Wang, Z. et al. Deep neural networks for accurate predictions of crystal stability. Nat Commun 9, 3800 (2018). https://doi.org/10.1038/s41467-018-06322-x

Download citation

Received: 30 July 2018
Accepted: 27 August 2018
Published: 18 September 2018
DOI: https://doi.org/10.1038/s41467-018-06322-x

This article is cited by

Semantic segmentation in crystal growth process using fake micrograph machine learning
- Takamitsu Ishiyama
- Takashi Suemasu
- Kaoru Toko
Scientific Reports (2024)
Accelerating material property prediction using generically complete isometry invariants
- Jonathan Balasingham
- Viktor Zamaraev
- Vitaliy Kurlin
Scientific Reports (2024)
Material Property Prediction Using Graphs Based on Generically Complete Isometry Invariants
- Jonathan Balasingham
- Viktor Zamaraev
- Vitaliy Kurlin
Integrating Materials and Manufacturing Innovation (2024)
Methods and applications of machine learning in computational design of optoelectronic semiconductors
- Xiaoyu Yang
- Kun Zhou
- Lijun Zhang
Science China Materials (2024)
DFT analysis of the electronic, optical, phonon, elastic, and mechanical features of ternary Rb2XS3 (X = Si, Ge, Sn) chalcogenides
- Şule Uğur
- Melek Güler
- Gökay Uğur
Optical and Quantum Electronics (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.