Crystal structure prediction by combining graph network and optimization algorithm

Cheng, Guanjian; Gong, Xin-Gao; Yin, Wan-Jian

doi:10.1038/s41467-022-29241-4

Download PDF

Article
Open access
Published: 21 March 2022

Crystal structure prediction by combining graph network and optimization algorithm

Nature Communications volume 13, Article number: 1492 (2022) Cite this article

18k Accesses
36 Citations
9 Altmetric
Metrics details

Subjects

Abstract

Crystal structure prediction is a long-standing challenge in condensed matter and chemical science. Here we report a machine-learning approach for crystal structure prediction, in which a graph network (GN) is employed to establish a correlation model between the crystal structure and formation enthalpies at the given database, and an optimization algorithm (OA) is used to accelerate the search for crystal structure with lowest formation enthalpy. The framework of the utilized approach (a database + a GN model + an optimization algorithm) is flexible. We implemented two benchmark databases, i.e., the open quantum materials database (OQMD) and Matbench (MatB), and three OAs, i.e., random searching (RAS), particle-swarm optimization (PSO) and Bayesian optimization (BO), that can predict crystal structures at a given number of atoms in a periodic cell. The comparative studies show that the GN model trained on MatB combined with BO, i.e., GN(MatB)-BO, exhibit the best performance for predicting crystal structures of 29 typical compounds with a computational cost three orders of magnitude less than that required for conventional approaches screening structures through density functional theory calculation. The flexible framework in combination with a materials database, a graph network, and an optimization algorithm may open new avenues for data-driven crystal structural predictions.

Topological representations of crystalline compounds for the machine-learning prediction of materials properties

Article Open access 05 February 2021

A geometric-information-enhanced crystal graph network for predicting properties of materials

Article Open access 06 September 2021

Regularized machine learning on molecular graph model explains systematic error in DFT enthalpies

Article Open access 13 July 2021

Introduction

Predicting crystal structure at a given chemical composition prior to experimental synthesis has attracted significant interest in condensed matter science. Earlier attempts based on empirical rules provided qualitative descriptions of structures, for example, Pauling’s five rules for ionic crystals¹, Goldschmidt’s tolerance factor for perovskite formability², and dimensional descriptors to classify the zinc-blend (ZB)/wurtzite (WZ) and rock-salt (RS) structures for binary semiconductor compounds^3,4. Owing to reliable energy calculation via density functional theory (DFT), the current state-of-the-art approaches for crystal structure prediction (CSP) mainly combine DFT calculations with structural searching algorithms such as (quasi-) random search^5,6, simulated annealing⁷, genetic algorithm^8,9,10,11, particle-swarm optimization (PSO)^12,13, and differential evolutionary process¹⁴. These approaches extensively explore the structural candidates via searching algorithms and by adopting DFT-calculated energy as a stability metric. The necessary DFT calculations involve the evaluation of numerous structural candidates in the process of structure searching and are thus time-consuming. For example, 70 and 120 DFT structural optimizations are required to determine the ZB structure of GaAs (eight atoms in the cell)¹⁰ and α-quartz structure of SiO₂ (six atoms in the cell)¹², respectively.

The advancement of machine learning (ML) in materials science has recently focused on its applications in predicting materials properties such as formation enthalpies (ΔH)^15,16, Gibbs free energies¹⁷, bandgaps^18,19, wave function and electron density²⁰, X-ray absorption spectra²¹, and phase transitions²². The accuracy of this approach is close to that of quantum mechanics calculations; however, the computational costs are orders of magnitude lower. In addition to the influence of compositional atoms, the influence of their spatial arrangement, i.e., crystal structure, on materials properties has recently been analyzed via structural characterizing approaches such as the Wyckoff-species matrix-based method²³, Voronoi tessellation method²⁴, and graph network^18,19. A crystal (Crys) can be represented by a vector ({v_i}_i=1,N, {R_i}_i=1,N, L), where {v_i} and {R_i} are elemental features and coordinates of the ith atom, N is the total number of atoms in a periodic cell, and L is the vector (a, b, c, α, β, γ) defining the cell shape. In these approaches, crystal structures are transformed to a physically meaningful and algorithm-readable data formats, such as a symmetry-invariant matrix²³, bond configurations²⁴, or crystal graphs¹⁸, enabling the establishment of a correlation model between a crystal and its formation enthalpy as follows:

$$\varDelta H=f({{{{{\rm{Crys}}}}}}({\{{{{{{{\bf{v}}}}}}}_{i}\}}_{i=1,N},{\{{{{{{{\bf{R}}}}}}}_{i}\}}_{i=1,N},{{{{{\bf{L}}}}}}))$$

(1)

In principle, CSP can be efficiently performed using Eq. (1) by optimizing ({R_i}_i=1,N, L) to minimize ΔH at a given {v_i}_i=1,N. This approach replaces DFT calculations with the ML model; therefore, it has the potential to significantly accelerate the CSP.

Despite this potential advantage, the practical approach of ML-based CSP still has challenges²⁵. First, the ML model should have a sensitive response to the crystal structure; therefore, the fixed-structure model^15,26 and symmetry-invariant model²³, which have a constraint on the crystal structures, are inapplicable or limited in determining the ground state structure (GSS) that may have arbitrary cell shape and atomic coordinates. Second, the high accuracy of DFT calculations benefit from the systematic cancellation of errors relative to the experiment, and the claimed DFT-level accuracies of the ML models are obtained from training data composed of stable crystal structures²⁷. The extension of ML models to structural searching is questionable because most structural candidates in the searching process are metastable or unstable, and their relative energies are crucial in determining the GSS. Finally, an appropriate optimization algorithm compatible with the ML model is required.

In this study, we constructed a framework that establishes a graph network (GN) model between crystal structures and their formation enthalpies at the given database, and this GN model was then combined with an optimization algorithm (OA) for CSP. The framework (a database + a GN model + an OA) is flexible that allows variance in materials database, crystal graph representation, and OA. In this study, we adopted GN developed by Chen et al.¹⁹ as it was designed for both molecules and crystals, facilitating the future extension of the framework to molecules. The Open Quantum Materials Database (OQMD)²⁸ of version 1.3 and Matbench dataset of formation energy (MatB)²⁹, have been used separately to train the GN model and random searching (RAS), PSO and Bayesian optimization (BO) has been implemented as OAs. The performance of different combinations have been investigated and compared to predict the crystal structures of 29 octet binary compounds as listed in Table 1, including group IV crystals (C, Si), group I–VII crystal (I = Li, Na, K, Rb, Cs; VII = F, Cl), group II–VI crystal (II = Be, Mg, Ca, Sr, Ba, Zn, Cd; VI = O, S) and typical photovoltaic semiconductors GaAs, CdTe and CsPbI₃ (an inorganic representative for perovskite photovoltaics). The comparative studies show that the GN model trained on MatB combined with BO, i.e., GN(MatB)-BO, can predict crystal structures with the best accuracy and extremely low computational cost. The flexibility of graph network, database, and optimization algorithm in the approach facilitate further development and improvement of this approach. This study may open a new avenue for data-driven crystal structural prediction.

Table 1 The performance of GN-OA with different combinations of databases (OQMD and MatB) and optimization algorithms (RAS, PSO, BO) for crystal structure prediction of 29 typical compounds.

Full size table

Results

Crystal graph

In the original GN³⁰, a graph is defined by three ingredients, i.e., nodes (v_i), edges connecting nodes (e_k), and the global attributes (u), which are naturally borrowed to crystal graph as atoms, pairs, and macroscopic attributes (e.g., pressure, temperature)^19,31. Considering that multiple atoms and pairs exist in a crystal, crystal graph is numerically represented by G({v_i}_i=1,nv, {e_k}_k=1:ne, u), where v_i and e_k are the elemental and pair attributes of ith atom and kth pair, and nv and nk are the number of atoms and pairs, respectively, in the cell. In MEGNet¹⁹, v and e are the atomic numbers and spatial distance, represented by N_v- and N_e-dimensional vectors (N_v and N_e are hyperparameters) learned from model training, respectively. Accordingly, an embedding layer with a N_v × nv matrix (Fig. 1c) was added after atomic attribute {v_i} as input for GN (Fig. 1j). A nv × nv × N_e matrix (Fig. 1d) was added after {e_k} (Fig. 1k), where nv × nv represents the pair connectivity between two atoms and each pair is represented by an expanded distance with Gaussian basis numerically represented by N_e points. In comparison to the fixed features, N_v- and N_e- dimensional vectors can be considered as elemental and pair features that were learned during the model training process. The learned elemental embeddings have been shown to encode the elemental periodicity and can be transferred to predict different properties¹⁹.

**Fig. 1: Flowchart of GN-OA approach.**

Database and data split

Two benchmark datasets, OQMD of version 1.3²⁸, and MatB²⁹ have been used for GN model training and evaluation. For OQMD, data cleaning was performed to exclude data with incomplete information and restrictions: (i) the number of atoms in the unit cell (<50), (ii) PBE as exchange-correlation functional, and (iii) kinetic energy cutoff (520 eV), making data as reliable and comparable as possible. Accordingly, more than 320,000 data points have been obtained, including ~40,000 experimentally known ones and ~280,000 hypothetical ones, covering 85 elements, 7 lattice systems, and 167 space groups. For MatB, we used the Matbench v0.1 dataset²⁹ that is derived from data cleaning in Materials Project. For properties of formation energy, it included ~132,000 data points, covering 84 elements, 7 lattice systems, and 227 space groups. The distributions of the number of elements and atoms in each database are shown in Fig. 1a, b. For both OQMD and MatB, the same ratio of data split has been adopted, i.e., training set (50%), validation set (12.5%), and test set (37.5%), to construct GN models for CSP. In all the training, validation, and test process, the data of 29 binary compounds studied in this work, have been excluded.

GN model

As shown in Fig. 1h, the GN model was constructed to establish the correlation between the crystals and their formation enthalpies¹⁹. Crystal graph represented by matrix {v_i} (Fig. 1c, j) and {e_k} (Fig. 1d, k) are the input of GN model and formation enthalpy ΔH is the output. There could be m MEGNet layers (m is hyperparameter) that make up the MEGNet blocks (Fig. 1l) to update the matrix {v_i} and {e_k}. The set2set layers (Fig. 1m, n) are used to learn a representation vector from the matrix {v_i} and {e_k}. Then use the concatenate layer (Fig. 1o) to combine these vectors, go through a fully connected layer (Fig. 1p) composed of l dense layers (l is hyperparameter), and get ΔH (Fig. 1q). Since the symmetries and invariances are included in the current GN model and the pair features are established on the connectivity between two atoms. Cell rotation or symmetry permutation would not change the features, and thus, the GN model. We train GN model using the data in two respective databases, i.e., OQMD²⁸, and MatB²⁹, leading to two different GN models, GN(OQMD) and GN(MatB). By optimizing hyperparameters in Supplementary Table 1, the best performing one in each model was selected to minimize the errors between the GN-predicted and DFT-calculated ΔH on the test set as the results shown in Fig. 2. The results show that GN(OQMD) has less MAE (16.07 meV/atom) than GN(MatB) (31.66 meV/atom). MAE of GN(MatB) is close to the previous report of 32.7 meV/atom²⁹. Such a tiny difference of 1 meV on the same MatB dataset may originate from different data split. The insets in Fig. 2 show a systematic decrease of the MAE as the number of training data. Better performance of OQMD can be ascribed to its larger database (~320,000 DFT-calculated data for inorganic compounds), which is more than twice than MatB. Despite less MAE of GN(OQMD), as shown later, its performance on CSP is inferior to GN(MatB), indicating possible overfitting of GN(OQMD).

Symmetry constraint

The wealth of experimental data shows that most of the crystal structures at low temperature have symmetry operations³² and adding symmetry constraint would accelerate CSP. Meanwhile, most crystal structures in training data, either OQMD or MatB, are symmetrical (with space group spanning from P2 to P230). In this work, we treat CSP with symmetry constraint, by adding two additional structural features, crystal symmetry S and the occupancy of Wyckoff position W_i for the ith atom, which are chosen through 229 space groups and associated 1506 Wyckoff positions³³. As shown in Fig. 1e, the procedure firstly chose a symmetry S among P2 to P230 and then the lattice parameters L are generated within the chosen symmetry. Secondly, a combination of Wyckoff positions {W_i} at given symmetry is selected to meet the number of atoms given in a cell. The atomic coordinates {R_i} are then given by the selected Wyckoff position {W_i} and lattice parameters L. Space group S and corresponding {W_i} are variables upon optimizations during CSP with symmetry constraint to generate Crys({v_i}, S, {W_i}, {R_i}, L) (Fig. 1f). For practical implementation, we added an additional constraint (4.0 V_a > V > 1.0 V_a, where V_a is the volume sum of compositional atoms) to avoid the generation of unreasonable structures with extremely small or large volumes.

Optimization algorithms

CSP is an optimization problem to identify S, {W_i}, {R_i}, and L at a given chemical composition {v_i} to minimize ΔH. After constructing crystal structure Crys({v_i}, S, {W_i}, {R_i}, L) (Fig. 1f), a structural analysis is performed and convert Crys({v_i}, S, {W_i}, {R_i}, L) to crystal graph G({v_i},{e_k}) (Fig. 1g) and its formation enthalpy is obtained by GN model ΔH = f [G({v_i},{e_k})] (Fig. 1h).

Ideally, if one could enumerate all possible crystal structure Crys({v_i}, S, {W_i}, {R_i}, L), do crystal graph conversions to G({v_i},{e_k}), obtain their formation enthalpy by GN model ΔH = f [G({v_i},{e_k})], the problem of CSP was simply solved by choosing the crystal structures with the lowest ΔH. However, the enumeration of all possible structures is a long-standing challenge. Here, we adopted three OAs (Fig. 1i), RAS, PSO, and BO, since RAS and PSO are successful algorithms applied in DFT-based CSP^5,12 and BO has been shown to be compatible with the black-box ML model, demonstrating a great capability to identify the global minimum^34,35, and has been recently combined with DFT calculations for CSP in fixed crystal systems³⁶. Here, we applied BO via a Gaussian mixture model based on the tree of Parzen estimators (TPE)³⁷ to explore the structural space. Compared to the normal BO algorithm based on the Gaussian process, which performs better in low-dimensional space (number of features <20), the TPE-based Gaussian mixture model demonstrated higher efficiency in high-dimensional space³⁷.

As shown in the right panel of Fig. 1, for a given number of atoms in a cell, n initial structures Crys({v_i}, S, {W_i}, {R_i}, L) were randomly generated, and their corresponding elemental and pair attributes were obtained by structural analysis to convert the crystal structure to crystal graph G({v_i},{e_j}). Accordingly, ΔH’s were predicted using the GN model to obtain n pairs of (Crys, ΔH_Crys). After that, the approach will iteratively go the loop of structural searching from Fig. 1f–i and then back to 1f by OA. Different OAs will generate new structures in Fig. 1i, f in different ways. For RAS, the new structure was generated in a stochastic way and did not depend on the searching history. For PSO, in each iteration, a set of n structures were generated as a new generation by tracking two extremes (Crys, ΔH_Crys) values (pbest and gbest)³⁸. We used scikit-opt (https://github.com/guofei9987/scikit-opt) and choose the momentum parameter ω as 0.8, the cognitive and social parameters are 0.5 and 0.5, respectively. For BO, a new structure with potentially low ΔH was recommended and the recommendation model was re-trained based on all previous pairs of (Crys, ΔH_Crys) in a manner of active learning. We employ TPE-based BO as implemented in Hyperopt³⁷, (https://github.com/hyperopt/hyperopt) and choose observation quantile γ as 0.25³⁴ and a maximum number of trails to 200.

Applications

The GN-OA approach was then applied to identifying the crystal structures of 29 compounds listed in Table 1. There are more than 300 types of prototype structures for AB compounds²⁸; two representatives of these are tetrahedral-coordination ZB/WZ and octahedral-coordination RS structures. Predicting ZB/WZ and RS structures proves the ability of CSP from ionic to covalent systems³⁹.

As aforementioned, the framework of approach is flexible that we adopted OQMD and MatB respectively to train GN model and RAS, PSO, and BO for the optimization algorithm. Here, we take CaS for example, to compare the performance of RAS, PSO and BO on CSP with GN model trained on MatB. The characteristics of three OAs can be clearly seen in the evolution of ∆H on the iteration steps in Fig. 3a. The ∆H distributes randomly in energy scale (Y-axis in Fig. 3a) for RAS. While, PSO can quickly find the low-ΔH configurations (exploitation). But its problem is that it may be stuck in the local minimum as shown that most of the PSO-selected structures after 1500 steps are close to each other and located around the energy of local minimum, as shown by a sharp DOS (density of structures) at a local minimum. In contrast, BO is an algorithm that has a balance between exploitation and exploration, as shown by double peaks in DOS, indicating that it has a higher ability to jump out of one particular local minimum (exploration). In this case, GN(MatB)-RAS and GN(MatB)-BO find the correct GSS at the 2503th and 372th iteration step, respectively, while GN(MatB)-PSO cannot find correct GSS within 5000 steps. For GN(MatB)-BO, the GSS was found at 207th step (Fig. 3f) with a lattice constant of 6.50 Å and then the GN(MatB)-BO show ability to optimize the lattice constant to 5.77 Å as shown in Fig. 1g, close to 5.72 Å of DFT-calculated value.

**Fig. 3: The process and performance of GN-OA.**

The approaches of GN-RAS, GN-PSO, and GN-BO were then applied to CSP for 28 other compounds. The results are summarized in Table 1. It was observed that: (i) like the case shown for CaS, the accuracy of OA for CSP follow the sequence that BO > RAS > PSO, whether the GN is trained on OQMD or MatB; (ii) GN model trained on MatB generally show better accuracy for CSP than that trained on OQMD, whether RAS, PSO or BO was adopted. As a result, GN(MatB)-BO shows the best performance. The corresponding ΔH evolution of GN(MatB)-RAS, GN(MatB)-PSO, and GN(MatB)-BO for all 29 compounds are shown in Supplementary Figs. 3–10. For 25 compounds that GN(MatB)-BO can correctly predict, GN(MatB)-BO can predict their lattice constants and absolute energy differences (|ΔH_DFT − ΔH_GN|) with averaged 2.24% error and 20.8 meV/atom, respectively, to DFT-calculated values, as shown in Fig. 4.

**Fig. 4: The comparisons of GSS derived from GN-OA and DFT.**

In comparison to DFT-based approach

Accuracy and efficiency are two criteria for a CSP approach. It should be noted that the accuracy of the current GN-OA approach is inferior to that of the DFT-based approach in terms of non-100% prediction accuracy and the variation of lattice parameters. In fact, the GN model is trained based on the DFT-calculated data; thus, it cannot surpass the accuracy of DFT results. In compromise with the accuracy, GN(MatB)-BO finished those tasks with much higher efficiency than DFT-based CSP, as shown in Fig. 5. Here, we compare the computational cost of DFT-PSO and GN(MatB)-BO to predict 25 compounds and found that GN(MatB)-BO has a computational cost three orders of magnitude lower than DFT-based approach. DFT-PSO typically requires 60–80 DFT calculations (Si and CsPbI₃ as the example shown in Supplementary Fig. 11) to find the GSS, which is consistent with previous reports of 70 and 120 DFT structural optimizations to find the GSS of GaAs (eight atoms in the cell)¹⁰ and SiO₂ (six atoms in the cell)¹², respectively.

**Fig. 5: The comparison of computational cost.**

Discussion

To the best of our knowledge, this is the first study to establish a GN-OA framework for CSP, which contains three essential parts: (1) a database consisting of crystal structures and the formation energies; (2) a GN constructing the correlation model between crystal structure and formation energies; (3) an OA to search the crystal structures with minimum formation energy. These three parts are all fast-developing research frontiers and certainly not perfect at present; therefore, the limitations of the current GN-OA approach are also apparent, such as the failure to predict the GSS of some crystals and the deviation of predicted lattice parameters. There are two failure modes. One is the failure of GN model to put the GSS as the lowest ∆H, such as CdS (Supplementary Fig. 12), and the other is the failure of OA not visit the GSS with the lowest ∆H, such as GN-PSO for CaS (Supplementary Fig. 13). Meanwhile, their advantage is that any progress of these three aspects may help in improving the efficiency and accuracy of GN-OA approach.

In this study, we adopted and compared OQMD and MatB databases, mainly containing stable or metastable structures (global or local minimums in PES). However, during the structural searching process, most structures are unstable (away from the minimums). The addition of the energetic data of these unstable structures should help the model to capture the entire PES landscape, thereby improving the efficiency and accuracy of GN-OA approach. Notably, generating energy landscape on numerous unstable structures is a necessary step for generating ML potential⁴⁰. In principle, CSP based on ML potential should be more accurate; however, ML potential is generated on fixed types of elemental combinations [{v_i} is constant], for example of aluminum⁴⁰, while the GN model for CSP should be universal for all elements [{v_i} is a variable]. For constant {v_i}, 6000–10,000 DFT calculations are required to generate an applicable ML potential. It is an open question that how many DFT data are required to generate a reasonable GN model and how to combine existing DFT data trained for ML potential generation into GN model. This requires further investigation.

n this study, we adopted the framework of MEGNet¹⁹ as a crystal graph. Since the first development of crystal graph (CGCNN)¹⁸, many studies are being conducted to further improve the crystal graph, such as improved crystal graph convolutional neural network (iCGCNN)⁴¹, directional message passing neural network (DimeNet)⁴², atomictic line graph neural network (ALIGNN)^{19,43,44,45,46,47}, which was reviewed in a recent paper⁴⁸. The implementation of those crystal graphs in GN-OA framework or further development of crystal graphs may further improve the accuracy.

We show that BO algorithm when combined with GN model is superior to PSO and RAS, which are often combined with DFT for CSP. Notably, BO is also replaceable. An optimization algorithm that is compatible with black-box GN model needs further exploration.

A platform will be established to allow the users to combine their crystal representation, database, and structural searching approach to optimize GN-OA approach for CSP. In addition to the database, crystal graph, and optimization algorithm, opportunities are given to technical improvements, such as algorithm parallelization and optimization, which may also improve the accuracy and efficiency.

In summary, we constructed a flexible framework that used a graph network to establish the ML model between crystal structures and their formation enthalpies at the given database, and this model was then combined with an optimization algorithm for CSP. The framework was then applied to predict the crystal structures of 29 typical compounds. The comparative studies of multiple combinations of database, GN model, and optimization algorithm showed that GN model trained on MatB combined with Bayesian optimization structural searching [GN(MatB)-BO], although with less accuracy than DFT results, can predict crystal structures with computational cost three orders of magnitude less than DFT-based approaches. Meanwhile, the limitations of the current GN-OA approach are also apparent. In terms of methodology, several directions need further development, including crystal structure characterization, structural searching, and algorithm parallelization, to predict more complicated and unknown structures more efficiently. The current study may open a new avenue for data-driven crystal structural prediction without using the expensive DFT calculations during structural searching.

Data availability

All relevant data are included in this article and its Supplementary Information files.

Code availability

The code for GN-OA approach is available on http://www.comates.group/links?software=gn_oa. All GN-based results reported in this work can be reproduced by this code. The DFT-based results are produced by CALYPSO code.

References

Pauling, L. The principles determining the structure of complex ionic crystals. J. Am. Chem. Soc. 51, 1010–1026 (1929).
Article CAS Google Scholar
Goldschmidt, V. M. Die Gesetze der Krystallochemie. Naturwissenschaften 14, 477–485 (1926).
Article ADS CAS Google Scholar
Van Vechten, J. A. Quantum dielectric theory of electronegativity in covalent systems. I. Electronic dielectric constant. Phys. Rev. 182, 891–905 (1969).
Article ADS Google Scholar
PHILLIPS, J. C. Ionicity of the chemical bond in crystals. Rev. Mod. Phys. 42, 317–356 (1970).
Article ADS CAS Google Scholar
Pickard, C. J. & Needs, R. J. Ab initiorandom structure searching. J. Phys. Condens. Matter 23, 053201 (2011).
Article ADS Google Scholar
He, C. et al. Complex low energy tetrahedral polymorphs of group IV elements from first principles. Phys. Rev. Lett. 121, 175701 (2018).
Article ADS CAS Google Scholar
Pannetier, J., Bassas-Alsina, J., Rodriguez-Carvajal, J. & Caignaert, V. Prediction of crystal structures from crystal chemistry rules by simulated annealing. Nature 346, 343–345 (1990).
Article ADS CAS Google Scholar
Deaven, D. M. & Ho, K. M. Molecular geometry optimization with a genetic algorithm. Phys. Rev. Lett. 75, 288–291 (1995).
Article ADS CAS Google Scholar
Glass, C. W., Oganov, A. R. & Hansen, N. USPEX—Evolutionary crystal structure prediction. Comput. Phys. Commun. 175, 713–720 (2006).
Article ADS CAS Google Scholar
Trimarchi, G. & Zunger, A. Global space-group optimization problem: finding the stablest crystal structure without constraints. Phys. Rev. B 75, 104113 (2007).
Article ADS Google Scholar
Zhao, X. et al. Exploring the structural complexity of intermetallic compounds by an adaptive genetic algorithm. Phys. Rev. Lett. 112, 045502 (2014).
Article ADS CAS Google Scholar
Wang, Y., Lv, J., Zhu, L. & Ma, Y. Crystal structure prediction via particle-swarm optimization. Phys. Rev. B 82, 094116 (2010).
Article ADS Google Scholar
Peng, F. et al. Hydrogen clathrate structures in rare earth hydrides at high pressures: possible route to room-temperature superconductivity. Phys. Rev. Lett. 119, 107001 (2017).
Article ADS Google Scholar
Zhang, Y.-Y., Gao, W., Chen, S., Xiang, H. & Gong, X.-G. Inverse design of materials by multi-objective differential evolution. Comput. Mater. Sci. 98, 51–55 (2015).
Article CAS Google Scholar
Faber, F. A., Lindmaa, A., von Lilienfeld, O. A. & Armiento, R. Machine learning energies of 2 million elpasolite (A B C 2 D 6) crystals. Phys. Rev. Lett. 117, 135502 (2016).
Article ADS Google Scholar
Li, Z., Xu, Q., Sun, Q., Hou, Z. & Yin, W.-J. Thermodynamic stability landscape of halide double perovskites via high-throughput computing and machine learning. Adv. Funct. Mater. 29, 1807280 (2019).
Article Google Scholar
Bartel, C. J. et al. Physical descriptor for the Gibbs energy of inorganic crystalline solids and temperature-dependent materials chemistry. Nat. Commun. 9, 4168 (2018).
Article ADS Google Scholar
Xie, T. & Grossman, J. C. Crystal graph convolutional neural networks for an accurate and interpretable prediction of material properties. Phys. Rev. Lett. 120, 145301 (2018).
Article ADS CAS Google Scholar
Chen, C., Ye, W., Zuo, Y., Zheng, C. & Ong, S. P. Graph networks as a universal machine learning framework for molecules and crystals. Chem. Mater. 31, 3564–3572 (2019).
Article CAS Google Scholar
Tsubaki, M. & Mizoguchi, T. Quantum deep field: data-driven wave function, electron density generation, and atomization energy prediction and extrapolation with machine learning. Phys. Rev. Lett. 125, 206401 (2020).
Article ADS CAS Google Scholar
Carbone, M. R., Topsakal, M., Lu, D. & Yoo, S. Machine-learning X-ray absorption spectra to quantitative accuracy. Phys. Rev. Lett. 124, 156401 (2020).
Article ADS CAS Google Scholar
Liu, Y.-H. & van Nieuwenburg, E. P. L. Discriminative cooperative networks for detecting phase transitions. Phys. Rev. Lett. 120, 176401 (2018).
Article ADS CAS Google Scholar
Jain, A. & Bligaard, T. Atomic-position independent descriptor for machine learning of material properties. Phys. Rev. B 98, 214112 (2018).
Article ADS Google Scholar
Ward, L. et al. Including crystal structure attributes in machine learning models of formation energies via Voronoi tessellations. Phys. Rev. B 96, 024104 (2017).
Article ADS Google Scholar
Ryan, K., Lengyel, J. & Shatruk, M. Crystal structure prediction via deep learning. J. Am. Chem. Soc. 140, 10158–10168 (2018).
Article Google Scholar
Oliynyk, A. O., Adutwum, L. A., Harynuk, J. J. & Mar, A. Classifying crystal structures of binary compounds AB through cluster resolution feature selection and support vector machine analysis. Chem. Mater. 28, 6672–6681 (2016).
Article CAS Google Scholar
Bartel, C. J. et al. A critical examination of compound stability predictions from machine-learned formation energies. Npj Comput. Mater. 6, 97 (2020).
Article ADS Google Scholar
Saal, J. E., Kirklin, S., Aykol, M., Meredig, B. & Wolverton, C. Materials design and discovery with high-throughput density functional theory: the open quantum materials database (OQMD). JOM 65, 1501–1509 (2013).
Article CAS Google Scholar
Dunn, A., Wang, Q., Ganose, A., Dopp, D. & Jain, A. Benchmarking materials property prediction methods: the Matbench test set and Automatminer reference algorithm. Npj Comput. Mater. 6, 1–10 (2020).
Google Scholar
Battaglia, P. W. et al. Relational inductive biases, deep learning, and graph networks. Preprint at https://arxiv.org/abs/1806.01261 (2018).
Chen, C., Zuo, Y., Ye, W., Li, X. & Ong, S. P. Learning properties of ordered and disordered materials from multi-fidelity data. Nat. Comput. Sci. 1, 46–53 (2021).
Article Google Scholar
Bergerhoff, G., Hundt, R., Sievers, R. & Brown, I. D. The inorganic crystal structure data base. J. Chem. Inf. Model. 23, 66–69 (1983).
CAS Google Scholar
Hahn, T., Shmueli, U. & Wilson, A. International tables for crystallography. (D. Reidel Pub. Co.; Sold and distributed in the USA and Canada by Kluwer Academic Publishers Group, 1984).
Bergstra, J. S., Bardenet, R., Bengio, Y. & Kégl, B. In Advances in Neural Information Processing Systems 24 (eds. Shawe-Taylor, J. et al.) 2546–2554 (Curran Associates, Inc., 2011).
Seko, A. et al. Prediction of low-thermal-conductivity compounds with first-principles anharmonic lattice-dynamics calculations and Bayesian optimization. Phys. Rev. Lett. 115, 205901 (2015).
Article ADS Google Scholar
Yamashita, T. et al. Crystal structure prediction accelerated by Bayesian optimization. Phys. Rev. Mater. 2, 013803 (2018).
Article Google Scholar
Bergstra, J., Yamins, D. & Cox, D. Making a science of model search: hyperparameter optimization in hundreds of dimensions for vision architectures. In International Conference on Machine Learning 115–123 (PMLR, 2013).
Kennedy, J. & Eberhart, R. Particle swarm optimization. In Proc. ICNN’95 - International Conference on Neural Networks Vol. 4, 1942–1948 (IEEE, 1995).
Ghiringhelli, L. M., Vybiral, J., Levchenko, S. V., Draxl, C. & Scheffler, M. Big data of materials science: critical role of the descriptor. Phys. Rev. Lett. 114, 105503 (2015).
Article ADS Google Scholar
Smith, J. S. et al. Automated discovery of a robust interatomic potential for aluminum. Nat. Commun. 12, 1257 (2021).
Article ADS CAS Google Scholar
Park, C. W. & Wolverton, C. Developing an improved crystal graph convolutional neural network framework for accelerated materials discovery. Phys. Rev. Mater. 4, 063801 (2020).
Article CAS Google Scholar
Klicpera, J., Groß, J. & Günnemann, S. Directional message passing for molecular graphs. Preprint at https://arxiv.org/abs/2003.03123 (2020).
Schütt, K. T., Sauceda, H. E., Kindermans, P.-J., Tkatchenko, A. & Müller, K.-R. SchNet – A deep learning architecture for molecules and materials. J. Chem. Phys. 148, 241722 (2018).
Article ADS Google Scholar
Klicpera, J., Becker, F. & Günnemann, S. GemNet: universal directional graph neural networks for molecules. In conference paper at 35th Conference on Neural Information Processing Systems (NeurIPS 2021) (2021).
Shuaibi, M. et al. Rotation invariant graph neural networks using spin convolutions. Preprint at https://arxiv.org/abs/2106.09575 (2021).
Godwin, J. et al. Very deep graph neural networks via noise regularisation. Preprint at https://arxiv.org/abs/2106.07971 (2021).
Chanussot, L. et al. Open catalyst 2020 (OC20) dataset and community challenges. ACS Catal. 11, 6059–6072 (2021).
Article CAS Google Scholar
Choudhary, K. et al. Recent advances and applications of deep learning methods in materials science. Preprint at https://arxiv.org/abs/2110.14820 (2021).

Download references

Acknowledgements

W.Y. acknowledged funding support by National Key Research and Development Program of China (Grant No. 2020YFB1506400), National Natural Science Foundation of China (Grant No. 11974257), Jiangsu Distinguished Young Talent Funding (Grant No. BK20200003), Yunnan Provincial Key S&T Program (Grant No. 202002AB080001-1), the Priority Academic Program Development of Jiangsu Higher Education Institutions (PAPD). DFT calculations were carried out at the National Supercomputer Center in Tianjin [TianHe-1(A)].

Author information

Authors and Affiliations

College of Energy, Soochow Institute for Energy and Materials InnovationS (SIEMIS), and Jiangsu Provincial Key Laboratory for Advanced Carbon Materials and Wearable Energy Technologies, Soochow University, Suzhou, 215006, China
Guanjian Cheng & Wan-Jian Yin
Shanghai Qi Zhi Institute, Shanghai, 200030, China
Guanjian Cheng, Xin-Gao Gong & Wan-Jian Yin
Key Laboratory for Computational Physical Sciences (MOE), Institute of Computational Physical Sciences, Fudan University, Shanghai, 200438, China
Xin-Gao Gong
Light Industry Institute of Electrochemical Power Sources, Soochow University, Suzhou, 215006, China
Wan-Jian Yin

Authors

Guanjian Cheng
View author publications
You can also search for this author in PubMed Google Scholar
Xin-Gao Gong
View author publications
You can also search for this author in PubMed Google Scholar
Wan-Jian Yin
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

W.Y. conceived the idea. G.C. wrote the code and conducted the calculations. X.G., G.C., and W.Y. discussed the results and wrote the manuscript.

Corresponding author

Correspondence to Wan-Jian Yin.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Patrick Riley, and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Cheng, G., Gong, XG. & Yin, WJ. Crystal structure prediction by combining graph network and optimization algorithm. Nat Commun 13, 1492 (2022). https://doi.org/10.1038/s41467-022-29241-4

Download citation

Received: 15 August 2021
Accepted: 07 March 2022
Published: 21 March 2022
DOI: https://doi.org/10.1038/s41467-022-29241-4

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.