Materials Prediction via Classification Learning

Balachandran, Prasanna V.; Theiler, James; Rondinelli, James M.; Lookman, Turab

doi:10.1038/srep13285

Download PDF

Article
Open access
Published: 25 August 2015

Materials Prediction via Classification Learning

Prasanna V. Balachandran¹,
James Theiler²,
James M. Rondinelli³ &
…
Turab Lookman¹

Scientific Reports volume 5, Article number: 13285 (2015) Cite this article

Subjects

Abstract

In the paradigm of materials informatics for accelerated materials discovery, the choice of feature set (i.e. attributes that capture aspects of structure, chemistry and/or bonding) is critical. Ideally, the feature sets should provide a simple physical basis for extracting major structural and chemical trends and furthermore, enable rapid predictions of new material chemistries. Orbital radii calculated from model pseudopotential fits to spectroscopic data are potential candidates to satisfy these conditions. Although these radii (and their linear combinations) have been utilized in the past, their functional forms are largely justified with heuristic arguments. Here we show that machine learning methods naturally uncover the functional forms that mimic most frequently used features in the literature, thereby providing a mathematical basis for feature set construction without a priori assumptions. We apply these principles to study two broad materials classes: (i) wide band gap AB compounds and (ii) rare earth-main group RM intermetallics. The AB compounds serve as a prototypical example to demonstrate our approach, whereas the RM intermetallics show how these concepts can be used to rapidly design new ductile materials. Our predictive models indicate that ScCo, ScIr and YCd should be ductile, whereas each was previously proposed to be brittle.

Recent advances and applications of machine learning in solid-state materials science

Article Open access 08 August 2019

Jonathan Schmidt, Mário R. G. Marques, … Miguel A. L. Marques

Benchmarking materials property prediction methods: the Matbench test set and Automatminer reference algorithm

Article Open access 15 September 2020

Alexander Dunn, Qi Wang, … Anubhav Jain

Identifying domains of applicability of machine learning models for materials science

Article Open access 04 September 2020

Christopher Sutton, Mario Boley, … Matthias Scheffler

Introduction

The advent of pseudopotential theory for solids marked a key period in the computation and application of orbital radii concepts. Simon and Bloch first utilized model potentials in pseudopotential calculations and showed their relevance in reproducing properties of solids that depend explicitly on valence energy, such as ionization energies, equilibrium geometries, force constants and dissociation energies^1,2. They defined orbital radii by finding the classical turning points or central balance points from a model nonlocal hard-core pseudopotential [V(r)], where the sum of the repulsive centrifugal and “Pauli forces” exactly cancel out the attractive Coulombic force exerted by the nucleus^3,4. The nonlocal nature of the pseudopotential gives an angular momentum (l) dependence, yielding a radius (in units of angstroms) at V(r) = 0 for the valence electrons in s-, p-, d- and f-like orbitals for an atom or a free ion. There are different forms of V(r) and their exact derivations are well documented in the literature^1,2,5,6,7.

Our interest in orbital radii for materials informatics, i.e., the growing field focused on using information science methods to understand condensed matter systems, is motivated by several reasons. First, orbital radii are based on model pseudopotential [V(r)] fits to spectroscopic data, which gives a straightforward reference frame for extracting relative chemical and structural trends⁸. Second, they are transferable from one compound to another. Third, for a given atom (or an ion), depending on its electronic configuration, radii for s-, p-, d-, or f-like orbitals exist unlike the empirical scales^9,10 where atomic (or ionic) radii are a single-valued quantity. As a result, this relatively simple physical basis provides more flexibility in exploring and understanding electronic and/or atomic structure—property relationships in complex materials. For example, the orbital radii are able to capture the distance to maximum radial charge density, which determines the interatomic distances and angles in crystals^5,9.

Although the orbital radii are not directly measured from experiments and unsuitable for full electronic structure calculations, it has been hypothesized that linear combinations of localized s-, p-, d- and f-like orbital radii (or their reciprocal) give a qualitative measure of bond covalency and orbital electronegativity^5,8. The underlying rationale for considering linear combinations of orbital radii is rooted deeply in the valence bond theory of solids proposed by Pauling¹¹, which provides a conceptual framework for discussing crystal structures and their energetic trends in terms of overlap or hybridization of atomic orbitals³. Burdett and Price also used the extended Hckel theory to argue the linkage between electronic energies and linear combinations of reciprocal orbital radii that capture the orbital interactions between ss, pp, sp and ps contributions to the stabilization energy¹².

Bloch and Simon were among the earliest proponents of utilizing orbital radii for studying structural trends in solids³. They demonstrated that in s-p bonded elemental solids, the fractional difference between the maxima of s- and p-radial functions could serve as an “index” (i.e. feature) that separates covalent, fcc, hcp and bcc structures³. Later, St. John and Bloch extended the approach to classify crystal structures of wide band gap octet AB compounds⁴, following the original work of Mooser and Pearson¹³ that uses average principal quantum numbers and Pauling scale electronegativity differences¹⁴. One of the key contributions from St. John and Bloch is the suggestion of a functional form for linearly combining the s- and p-like orbital character of the radii, which led to the nomenclature⁵ of r_σ and r_π as the two principal feature sets for AB compound classification:

While r_σ (equation 1) captures the electronegativity difference between A- and B-atoms, the feature r_π (equation 2) was identified to represent the directional nature of bonding through hybridization between s- and p-orbitals of A- and B-atoms. Since then Phillips^15,16,17, Chelikowsky⁵, Littlewood¹⁸, Zunger^19,20, Cohen²¹, Andreoni²², Burdett¹², Rabe²³ and others^24,25,26,27 have independently applied these orbital radii concepts, including the r_σ and r_π form without post-facto reformulation, to classify structures and properties of many other binary and multicomponent crystalline compounds with varying degrees of accuracy and success. As the understanding evolved, modifications to the original form of r_σ and r_π were suggested, largely based on intuition from domain knowledge or physical understanding of the known materials.

Without such understanding, however, the successful formulation of predictive theories for new compounds becomes a non-trivial endeavor. We therefore pose the question: Can one improve the predictive capabilities of the orbital radii by uncovering the natural form in which they should be combined? Plainly, is there a simple and tractable mathematical formalism that allows one to automatically construct linear combinations of orbital radii from data, irrespective of the chemical and/or structural complexity of the material? Are there other linear combinations of orbital radii, in addition to the r_σ and r_π, for AB compounds? We are not the first to raise these questions^12,22; there are many debates in the literature about the validity of r_σ and r_π and suggestions to explore new functional forms that complement r_σ and r_π²².

In this work, we use machine learning (ML) methods to show that linear combinations of orbital radii can be constructed directly from the data without requiring domain knowledge of a materials class. The wide band gap AB compounds form an ideal starting point to address our main question, because the structures of these materials are well-characterized. We show in the AB family that the functional form of r_σ and r_π can be reproduced solely from data using ML and furthermore, new linear combinations are uncovered which give more versatility to the orbital radii approach for extracting structure-property relationships. Next, we apply the method to the more complex RM intermetallics (R and M are rare-earth and main group or transition metal element, respectively), where at present r_σ and r_π are poorly defined or do not exist. We classify their mechanical properties into two groups, namely ductile and brittle and uncover orbital radii-based ‘selection-rules’ that give excellent agreement with computationally intensive density functional theory (DFT) calculations. We demonstrate the predictive power of our methods by identifying new and yet to be synthesized ductile RM intermetallic compounds: We predict ScIr to be ductile, although it was originally proposed as brittle in the literature. Results from ML, electronic structure calculations and Zener anisotropy ratio data support this prediction. Furthermore, we predict using ML that ScCo and YCd are potential ductile materials albeit our electronic structure calculations and Zener anisotropy ratio data are not as conclusive. These results have major implications beyond searching for materials with improved mechanical properties; the unbiased formulation of orbital radii for classification and property prediction can be readily applied to the electronic, magnetic and optical function of more complex materials where only limited models of the properties are available, enabling an expansion of new materials for experimentation.

Results

Data sets

AB compounds

The data set for this work is made of 55 AB compounds taken from the recent work of Saad et al.²⁵. Each AB compound is labeled uniquely in one of three crystal structure forms: rocksalt (R), wurtzite (W), or zinc blende (Z) as shown in Fig. 1. There were a total of 28, 8 and 19 AB compounds in R-, W- and Z-structures, respectively. The R-structure has a 6-fold nearest neighbor coordination environment, whereas W- and Z-structures have 4-fold coordination. The stacking sequence and bond distortions in the AB₄ tetrahedra distinguish the W- from Z-structures. For instance, in W- and Z-structures we observe a stacking sequence of —abab— (Fig. 1b) and —abcabc— (Fig. 1c), respectively. The AB₄ tetrahedron in the Z-structure are ideal, i.e., each has four equidistant A–B bond lengths, whereas in W-structure only three of the four A—B bonds are of equal length owing to the polar symmetry. Note that in Saad et al., six (out of 55) AB compounds were originally labeled as dual structures, where the ground state structures were designated as borderline due to close energetic competition. We performed DFT calculations on those six compounds and re-labeled them uniquely to a single structure-type based on the phase with the lowest energy (amongst the three R, W and Z-structures). See Supplementary Note 1 and Supplementary Table 1 for the total energies.

We use two sets of orbital radii scales for the feature set: Chelikowsky (C)²⁵ and Waber-Cromer (WC)²⁸. Orbital radii from the Chelikowsky scale are defined by classical turning points [V(r_l) = 0] of hard core nonlocal pseudopotentials total energies from DFT in the local-density approximation (LDA). The Waber and Cromer scale, on the other hand, uses the self-consistent Dirac-Slater eigenfunctions and the orbital radii correspond to the principal maxima in the charge-density distribution function for a specific orbital of an atom. This orbital radii scale is for neutral elements.

In the Chelikowsky scale, the p-orbital radii (r_p) for an atom is always greater than that of the s-orbital radii (r_s), however, this it is not the case in the Waber-Cromer scale. For instance, consider the Cd atom: Waber-Cromer tabulated the values of 4p, 4d and 5s orbital radii as 0.445, 0.505 and 1.18 Å, respectively; in this formalism, r_4p < r_5s. However, in the Chelikowsky scale r_s and r_p for the Cd atom are 0.67 and 1.26 Å, respectively; here, r_p > r_s. In the case of Si atom, Waber-Cromer scale lists the 3s and 3p orbital radii as 0.904 and 1.068 Å, respectively and the Chelikowsky scale has r_s and r_p as 0.66 and 0.88 Å, respectively. We maintain consistency between the two scales in our data set by tabulating the smaller orbital radii first, followed by the larger one (irrespective of their principal quantum number). In both scales, the pseudopotential model, [V(r)], replicates only the valence electronic states.

We also follow the conventional notation of r_s and r_p for the Chelikowsy scale, whereas we revise our notation to r_i and r_o for the Waber-Cromer scale; r_i and r_o stand for radius of inner and outermost orbitals, respectively. We only consider the radii of s- and p-orbitals and neglect the d-orbital in both scales. Although the Chelikowsky’s scale has been used before for classifying the crystal structures of AB compounds²⁵, this is the first time that the Waber-Cromer’s scale is explored for these purposes. One of the main advantages of the Waber-Cromer scale is that the orbital radii data have been tabulated for the majority of elements in the periodic table—including lanthanides and actinides.

RM intermetallics

Our training data set for classification learning comprises 30 RM compounds, with mechanical properties experimentally measured through tensile and impact tests (19 are ductile and 11 are brittle)²⁹. We construct a data set for ML using the same label as reported by Gschneidner et al.²⁹ and characterize each RM compound using only the Waber-Cromer orbital radii scale. We were are unable to explore the Chelikowsky scale for this problem because it does not contain radii for the rare-earth elements. For the R-atom, we used the s-, p-, d- and f-orbital radii; whereas for the M-atom we used s-, p- and d-orbital radii. We also constructed an additional virtual or unexplored set of 113 RM compounds. Our objective is to build a classification model on the training set and, in turn, use it to predict the mechanical properties (ductile or brittle) of unexplored 113 compounds. Note that all 143 compounds are assumed to be fully stoichiometric and to have the cubic B2 CsCl crystal structure-type (see Fig. 2).

Crystal Structure Classification of Wide Band Gap AB compounds

In Fig. 3, the bi-variate statistical correlation map between the s-(i-) and p-(o-) orbital radii of A- and B-atoms for the two scales are shown. Negative and positive signs indicate inverse and direct correlation, respectively. Figure 3 uncovers key similarities and differences between the two scales. In the Chelikowsky scale, s- and p-orbital radii for both A- and B-atoms correlate strongly. On the other hand, in the Waber-Cromer scale only and for the B-atoms show strong correlation. In the case of the A-atom, we find that only correlates strongly with both and . This is one of the important differences between the Waber-Cromer and Chelikowsky scales.

A scatter plot between (from Waber-Cromer scale) and (from Chelikowsky scale) is shown in Fig. 4, where the underlying periodic trends are apparent (identified based on prior knowledge about these elements): (i) Be, Li, Na, K, Rb, Mg, Ca and Sr (alkali and alkaline earth elements) (ii) Zn and Cd (transition series elements) and (iii) B, Si, Ga, Al and In (post-transition series elements). We observe piecewise linear relationships within each elemental series, but collectively the correlation coefficient is small. As noted before, the orbital radii from the Chelikowsky scale has its origin in nonlocal pseudopotentials from DFT within the LDA, while the Waber-Cromer scale is the principal maxima in the charge density from the Dirac-Slater wavefunction model. We attribute the absence of correlation between and to the two different V(r) pseudopotential models.

After correlation analysis, we constructed two separate data sets (each dataset is now a 54 × 4 matrix), one for each orbital radii scale. Each data set was then subjected to principal component analysis (PCA), where we first autoscaled the data (i.e. each column vector was normalized to have zero mean and unit variance) and calculated the sample covariance matrix (Σ). Eigenvalue decomposition of the Σ-matrix produces two new matrices: eigenvalues and eigenvectors. Since we have a 54 × 4 data matrix, the eigenvalue decomposition procedure produces four eigenvalues (in the form of a diagonal matrix) and four eigenvectors. Each eigenvector [also referred to as principal component (PC) loading] is a linear combination of the weighted contribution of the r_s (r_i) and r_p (r_o) orbital radii of A- and B-atoms and the eigenvalues indicate the % variance captured by the corresponding eigenvectors. We then project the autoscaled data points onto the loadings, which are called the PC scores. Since there are four eigenvectors, we have four PC scores. The difference seen in Fig. 3 between the two scales is manifested in the %-variance data given by the eigenvalues of each PC (Table 1), which suggests that the data structures in both scales are quite different.

Table 1 Percentage variance explained (in descending order) by the eigenvalues of Chelikowsky and Waber-Cromer orbital radii scales.

Full size table

A rule of thumb is to plot % variance as a function of number of PC’s and locate the elbow in the curve³⁰. The elbow generally determines the number of PC’s to be considered for further analysis. In this paper, we consider all PC’s for classification learning.

The functional forms for the s- and p-orbital radii combinations obtained from the Chelikowsky scale eigenvectors are given in Table 2. Notice the striking similarity in the functional form of C-PC1 and . Similarly, C-PC3 and C-PC4 have the functional form of , albeit with large difference in the coefficients or weights; however for the given data set, C-PC3 and C-PC4 capture only small amount of variance in the data set (see Table 1).

Table 2 Linear combinations of orbital radii from PCA for Chelikowsky (C) and Waber-Cromer (WC) scales.

Full size table

The C-PC2 is an interesting feature as it captures the orbital radii sum (r_s + r_p) of the A- and B-atoms, yet its functional form has hitherto not been explored. Furthermore, it accounts for 44.38% of the variance in the data set (Table 1). Normally, the difference between r_s and r_p is often used to infer the degree of hybridization between s- and p-orbitals for an atom; the smaller the difference, the greater the potential for sp-hybridization and vice versa. However, the functional form of C-PC2 corresponds to an orbital radii sum. We attribute the physical picture of [r_s + r_p] to describe the core radius of an atom implying that C-PC2 captures the core radii sum of the atom set (A and B). A scatter plot between r_s + r_p for the A-atom and its corresponding Shannon ionic radius¹⁰ for the (monovalent and divalent) cations and anions in 6-fold coordination shows a linear relationship (see Supplementary Note 2 and Supplementary Figure 1). As a result, we infer that C-PC2 captures the physics describing the relative close-packed tendencies of various AB compounds.

In Fig. 5a, the correlation plot of four PC scores along with the r_σ and r_π features for the Chelikowsky scale is shown. The correlation coefficient () between r_σ and r_π features is found to be 0.63. As expected, we find that r_σ correlates strongly with C-PC1 ; interestingly, r_π is found to correlate with C-PC2 . We remark that the sign of the PCs are arbitrary; thus the sign of the correlation (positive, direct or negative, inverse) is not meaningful.

Functional forms for the four eigenvectors from the Waber-Cromer scale are given in Table 2. In terms of trends, as shown in Fig. 5b, we find that WC-PC2 correlates directly with , whereas WC-PC3 is found to correlate inversely with . We use an asterisk symbol (*) in r_σ and r_π for the Waber-Cromer scale to differentiate it from that of the Chelikowsky scale. As noted earlier, in the Waber-Cromer scale we use the inner (r_i) and outermost (r_o) orbital radii, whose principal quantum number can be different. Originally, the r_σ and r_π were conceived for s- and p-orbital radii that resemble the pseudopotential model of the Chelikowsky scale.

We now classify the crystal structures of AB compounds to one of the three R, W and Z using the PC scores from both scales and compare them with the canonical r_σ and r_π descriptors. We utilize decision trees and support vector machine (SVM) algorithms for ML (see Methods section). The performance of classifiers were assessed using the full training set and leave-one-out cross validation (LOO-CV), which involves training the ML algorithm on all but one data point and then applying that classifier to the left-out data point. The process is repeated until each compound is left out exactly once and its crystal structure or properties predicted from the knowledge of the remaining compounds. LOO-CV is a common procedure used in ML when the training data sets are smaller in size (such as those in this paper). The results from classification learning are given in Table 3. Notice the similarity in the classification accuracies between PCA and [r_σ, r_π] features in both the scales. The largest difference in terms of accuracy is found to be ~5.5% between [r_σ, r_π] and PCA in the Chelikowsky scale with SVM. In all other cases, the performances are similar.

Table 3 Accuracy (in terms of % correctly classified) of AB compounds using both orbital radii scales.

Full size table

In Table 4, we list the most common misclassified AB compounds from the full training set for both scales. We performed DFT calculations (see Methods section) on these five misclassified compounds and the results are also tabulated in Table 4. Except CdSe, the lowest energy structures for the rest of the AB compounds agree well with those of Saud et al. We identify CdSe to have the zinc blende (Z) structure from our DFT calculations, although it was originally labeled to be wurtzite (W) by Saud et al. Note that the energy difference between Z- and W-structures is as small as 2 meV/formula unit (f.u). The feature set from the Waber-Cromer scale and PC-scores of both scales using decision trees classify CdSe as Z. One of the reasons that AB compounds with A = Cd show difficulty in our classification could be attributed to the omission of d-orbital radii in our feature set. Atom- and orbital-projected density of states (PDOS) spectra for CdSe in W and Z-structures show that the top of the valence band is comprised of 4d-states, indicating its dominant role in chemical bonding (see Supplementary Note 3 and Supplementary Figure 2). These 4d-states reduce the band gap, thereby increasing the covalent character or bond polarizability, which is not readily captured by the ML model from the given training set. To capture the effect of Cd 4d-states on the band gap, we considered MgTe and CdTe. We calculated the band gap of MgTe in W-structure to be 2.4 eV. For the exact same crystal structure (without relaxing the internal coordinates and unit cell geometry), we substituted Cd-atom in the place of Mg-atom and re-calculated the band gap to be 0.8 eV.

Table 4 Summary of misclassified AB compounds by using the full training set and their lowest energy structures from our DFT-PBEsol calculations using ultrasoft pseudopotentials.

Full size table

The compound MgTe was also misclassified in our classification learning. Note that Saud et al. also reported difficulty in classifying it, even though they used an exhaustive list of feature sets (9 features) and explored a completely different set of ML methods. Our DFT calculations reveal that the energy difference between W–Z and W–R structures in MgTe are of the order of 3.8 and 2.6 meV/f.u., respectively, indicating that MgTe lies on the borderline, making it difficult for our ML methods to accurately assess its relative position in the phase space.

Why is MgTe more difficult to classify using ML methods? In our data set, there are a total of six AB compounds with Te-atom in the B-site. Among the six, three of them (BeTe, CdTe and ZnTe) have Z, two of them (CaTe and SrTe) have R and MgTe has W ground state structure. We now examine the electronic structure of MgTe with CaTe for the three crystal structures (Fig. 6). CaTe is an ideal choice, because of the similarity in the valence electron configuration between the Group 2 elements: Mg with ([Ne]3s²) and Ca with ([Ar]4s²). The ground state for CaTe was determined to be the R-structure, in agreement with Saud et al.; the energy difference between R–W and R–Z is 302 and 621 meV/f.u., respectively.

Noticeable differences in the partial-densities of states (PDOS) are found in the bandwidth of Te 5p-orbitals for the two compounds, where they are relatively narrow in CaTe compared to the broader spectral features in MgTe. In the R- (Fig. 6a,d) and Z-structures (Fig. 6c,f), where all Ca-Te and Mg-Te bond lengths are equidistant, the electronic states of 5p_x, 5p_y and 5p_z orbitals overlap. Key differences occur in the W-structure (Fig. 6b,e), where the 5p-orbitals splits into 5p_z and 5p_x,y (we use the notation 5p_x,y to denote the fact that 5p_x and 5p_y overlap). Furthermore, the difference in the centers of masses between 5p_z and 5p_x,y are also more pronounced in the two compounds for the W-structure. These subtle changes in the electronic structure affect the energetics, thereby favoring one structure over the other. We conclude that the orbital radii scales (from both Chelikowsky and Waber-Cromer) are probably insensitive to the relative energetic competition (and orbital splitting) of Te 5p_x, 5p_y and 5p_z-states seen in MgTe relative to the other five ATe compounds, which eventually favors the R- or Z-structure. We conjecture that this problem could be potentially alleviated by adding more AB compounds that behave similar to MgTe, but remains to be explored further.

To summarize this section on AB compounds, we have shown that PCA can be used to construct linear combinations of orbital radii. The classification performance of these linear combinations from PCA with respect to [r_σ, r_π] is an important outcome of our work with potential implications for ML beyond binary wide band gap AB compounds. It is also important to recognize that orbital radii can themselves serve as the data matrix for ML (without the need for considering the linear combinations). One of the problems with such a feature set is that these orbital radii could show a high degree of statistical correlation (as seen in Fig. 3), indicating redundancy of information. PCA removes the correlation (redundancy) by finding linear combinations of tightly connected orbital radii and the PCs (eigenvectors) are orthogonal to one another (see Fig. 5). The orthogonality condition ensures that the features are independent (under the assumption that the physical process is governed by Gaussian or Normal distribution), which in turn allows us to explore a broad range of ML methods without compromising the interpretability or performance.

Mechanical Properties Classification of Intermetallic RM compounds

We now classify the mechanical properties of intermetallic RM compounds, where R and M represent chemistries of rare-earths and main group or transition metal elements, respectively. Unlike the AB compounds (discussed earlier) that have wide band gap and no partially filled d- or f-like orbitals in their valence electron configuration, these RM compounds are metallic with delocalized electrons; the d- and f-like orbitals control the electronic and magnetic states. Furthermore, these B2-RM intermetallics have unique mechanical properties; normally, intermetallic compounds are brittle (i.e., no plastic deformation observed in the stress-strain curve); however, Gschneidner et al.³¹ discovered a family of RM intermetallics that were experimentally found to show high ductility and high fracture-toughness at room temperature. One of the intriguing aspects of these materials is that not all RM intermetallics in B2 structure-type are ductile; some of them are also brittle, indicating a delicate balance in the structure-chemistry-property relationships.

Previous plane-wave pseudopotential-based DFT calculations²⁹ have shown the relative importance of the electronic states of M-atoms near the Fermi level (E_F) to classify the ductile and brittle mechanical behavior. From atom- and orbital-PDOS, it was found that in ductile RM compounds there are no bands of p- or d-orbitals of the M-atom that exhibit directional (anisotropic) character near the E_F. It was concluded that a given RM is expected to be ductile when the valence electronic states of M-atoms are primarily s-like with d-bands located at 1 eV or more below the E_F. The objective of this work is to use orbital radii scales, construct their linear combinations and apply ML methods to uncover classification rules that separate ductile and brittle RM intermetallics. We ask the following questions: Could orbital radii and ML methods capture the physics described by DFT calculations with substantially less computational overhead and complexity? Can this undertanding be used to predict new and previously unexplored ductile RM intermetallics for experimentation?

In Fig. 7, the correlation plot of the Waber-Cromer orbital radii for the 143 RM compounds is shown. As noted before we do not use the Chelikowsky scale for this problem, because the radii are not given for rare-earth elements. Notice the strong statistical correlation seen in the orbital radii of R- and M-elements, similar to Fig. 3 for AB compounds. For the R-atom, and show direct or positive correlation, whereas the and show inverse or negative correlation. In the case of the M-atom, the and show strong positive correlation and are not linearly related to . Since strong statistical correlation implies redundancy of information, we also apply PCA to construct their linear combinations. As a result, we built two separate data sets for classification learning: one based on the raw orbital radii data alone and in the other, we performed PCA to construct linear combinations of orbital radii. Unlike the AB compounds, for which we knew a priori about r_σ and r_π, there are no such linear combinations for RM compounds in the literature. We construct these linear combinations for the first time from PCA.

In Table 5, the linear combination of orbital radii (PC’s) and % variance explained for each of those combinations are given. The first four PC’s together capture ~94% variance in the data set and we focus our attention there. Notice that RM-PC1 and PC3 contain linear combinations of orbital radii of only R-elements. In RM-PC1, the effect of and orbital radii is pronounced, whereas in RM-PC3 dominates; in both RM-PC1 and PC3, the effect of is non-trivial; it is unlikely that the functional form could have been surmised. On the other hand, RM-PC2 and PC4 contain linear combinations of orbital radii of only M-elements. More specifically, RM-PC2 captures the contributions from and and RM-PC4 captures the role of orbital radii. From Table 5, we infer that in the given data set there is no mixing of orbital radii of R and M elements. We also note that the family of B2 intermetallics extends beyond the RM chemistries considered in this work (e.g., NiTi). When one constructs a comprehensive data set with all known material B2 structure-types (including the RM compounds), then one may be able to discover some orbital mixing.

Table 5 Linear combinations of orbital radii from PCA for the RM intermetallics using Waber-Cromer scale and % variance explained by each of those linear combinations.

Full size table

After correlation analysis and PCA, we now focus our attention on classifying the ductile and brittle mechanical properties of the 30 known RM intermetallics. We utilize decision trees and SVM for classification learning. With decision trees, we obtain classification accuracies of 86.7% and 96.7% based on LOO-CV and full training set, respectively, with the raw orbital radii scale (without any linear combinations). On the other hand, we attain marginally better classification accuracy of 90% from LOO-CV with the linear combinations of orbital radii from PCA. The performance of SVMs (for various combinations of feature sets) were comparable to that of the decision trees, where we attained classification accuracies of 86.7 and 93.3% based on LOO-CV and full training set, respectively, with the raw orbital radii scale. Similarly, with the linear combinations from PCA we obtained accuracies of 86.7% and 90% based on LOO-CV and full training set, respectively. In Fig. 8a,b, the decision trees from raw orbital radii scale and PCA, respectively, are shown.

Interestingly, both decision trees identify features associated with only M-elements to be critical for ductile and brittle mechanical property classification—in excellent agreement with previous DFT calculations²⁹. The conditions and RM-PCA > 1.3419 in Fig. 8a,b, respectively, for the brittle property correspond to RM intermetallics that have Zn- and Mg-atoms in the M-site. This result is consistent with Gschneidner et al., who also suggested that Zn- and Mg-based compounds should exhibit similar mechanical behavior. Moreover, it was suggested that Cd-compounds are also expected to behave similar to Zn- and Mg-based compounds. However, we differ in our interpretation for the YCd compound; our analysis indicates some form of interaction between (or ) and orbitals and thus different deformation properties.

A classification accuracy of <100% indicates that there are misclassified instances (see Table 6). The ScIr compound was misclassified as ductile by both decision trees and also SVM; additionally, ScCo and YCd were also misclassified as ductile by the decision tree shown in Fig. 8b that uses the linear combinations obtained from PCA. SVM also identifies ScCo and YCd as misclassified instances.

Table 6 Summary of most commonly misclassified RM compounds from classification learning.

Full size table

We performed DFT calculations for the three misclassified compounds (ScIr, ScCo and YCd) and compared their electronic structures to compounds that were correctly identified by our classification learning model to be ductile (ScCu) and brittle (YZn). In Fig. 9, we show the orbital-PDOS for the d-, p-, and s-states of the M-atom for brittle (Fig. 9a–c), misclassified (Fig. 9d–f) and ductile (Fig. 9g–i) compounds. In brittle YZn, clearly, the Zn-p-states (Fig. 9b) dominate the E_F; the d-states (Fig. 9a) are fully occupied and are at ~7 eV below the E_F. On the other hand, in ductile ScCu the Cu-d-states (Fig. 9g) are located between 2–4 eV below the E_F and there are some contributions from p- (Fig. 9h) and s-states (Fig. 9i) at the E_F. These results agree well with those of Gschneidner et al., although that reference does not report the local s- and p-states.

In the case of misclassified ScCo, there are more Co-d states near the E_F (Fig. 9d) relative to the Cu-analogue. The center of mass of the Co 3d-band is also shifted more towards the E_F; the spectral signatures and peak positions of p- and the s-orbital states (in Fig. 9e,f, respectively) are comparable to that of the Cu-analogue. These electronic structure results, in conjunction with the rules given by Gschneidner et al., appear to indicate that ScCo is probably brittle. If this result holds true, then our classification model is genuinely misclassifying the ScCo compound.

With ScIr (Fig. 9d–f), the bandwidth associated with the Ir 5d-orbitals is found to be larger (due to its spatially extended nature) and its center of mass is shifted away from the E_F (relative to the Co-analogue). At the same time, in the 0 to 1 eV range (above E_F), the 5d-spectral features resemble those of the Co-analogue. Similarly, the Ir p- and s-states in ScIr resemble those of the ScCo electronic structure. Note that we do not account for electron-correlation or spin-orbit coupling in our calculations. From our own DFT calculations and applying the rules of Gschneidner et al., it is not obvious whether ScIr is ductile or brittle.

Unlike a phase transition (e.g. ferroelectricity, where the order parameter is polarization), which is accompanied by a change in symmetry of the order parameter, here we are not dealing with any similar phase change concomitant with ductile and/or brittle behavior. In our case, either an RM intermetallic is ductile or brittle and therefore, we cannot write a phenomenological free energy expansion in terms of the order parameter. Alternatively, we can employ the recently developed mesoscale dislocation mechanics³² that uses energy-based stability criterion to validate our misclassifications. According to this approach, the necessary and sufficient crystallographic conditions that must be satisfied by a B2 material for enhanced ductility are that should be the dominant slip direction, yet slip should also be possible with the formation of anti-phase boundaries (APBs) and APBs should have bistable existence on both and planes, respectively. The notation and follow the conventions of Miller indices for representing families of crystallographic equivalent directions and planes, respectively. Detailed investigation on a range of B2 materials revealed that elastically anisotropic B2 alloys do not satisfy the necessary and sufficient conditions; hence they are brittle. On the other hand, ductile B2 alloys are nearly isotropic. Sun and Johnson suggested the use of the Zener anisotropy ratio \ [A , where c₁₁, c₁₂ and c₄₄ are the elastic constants of the B2 cubic structure], as a qualitative figure of merit to classify the mechanical properties³². In ductile B2 systems (similar to those explored in this paper), the value of A should be close to one³². Wang et al.³³ have calculated the A-ratio for ScIr as 1.584 from DFT calculations (using GGA approximation), which is closer to that of the ductile ScCu (1.5) as opposed to that of the brittle YZn (1.985). Based on the A-ratio, we validate our findings for ScIr and predict it to be ductile. Therefore, we recommend experimental re-evaluation of its mechanical properties.

In the case of YCd, its electronic structure is found to be similar to that of YZn. Furthermore, the A-ratio for YCd is reported as 2.894, which is much larger than that of YZn (1.985) indicating that it is probably brittle. Unfortunately, Wang et al. do not report the A-ratio of ScCo compound for us to compare. We also note that Gschneidner et al. have not reported any quantitative mechanical experiments data on the three misclassified materials.

The reasons for observing misclassifications in our ML could be attributed to the following factors: (i) small size of the data set, (ii) insufficient examples of materials in our training set that resemble the mechanical properties of ScCo, ScIr and YCd compounds, and/or (iii) the Waber-Cromer orbital radii may not necessarily contain the physics representative of the complex electronic structures of ScCo, ScIr and YCd. Strictly, we recommend re-evaluation of the mechanical properties of ScCo, ScIr and YCd before considering the application of our results for designing new materials. Nonetheless, based on the fact that our ML approach identifies key electronic structure features that agree strongly with the state-of-the-art DFT calculations, gives us some confidence in applying the classification rules shown in Fig. 8 to predict the mechanical properties of the remaining unexplored 113 RM compounds. Note that our classification rules are applicable only for RM compounds with B2 crystal structure-type and we have assumed all 113 RM compounds to have B2 crystal structure-type, which may not be necessarily true. We predict 57 out of 113 compounds to be ductile from Fig. 8a that have M = Cu, Ni, Au, Ag, Pd, Pt and Ir. Similarly from Fig. 8b, 77 compounds are predicted to be ductile. As discussed earlier, in addition to M = Cu, Ni, Au, Ag, Pd, Pt and Ir, even M = Cd and Co compounds are predicted to be ductile from Fig. 8b.

Discussion

We have shown how to construct linear combinations of orbital radii solely from data without any a priori assumption about their functional form. We demonstrated our ML approach on two broad materials classes that include insulators (AB compounds) and metallic materials (RM intermetallics). We first tested the performance of PCA-derived linear combinations in classifying the crystal structures of AB compounds and found that they perform equally well as the canonical r_σ and r_π feature sets. We identified misclassified instances and provided a rationale for such occurrences (in the process we addressed some of the potential shortcomings of orbital radii scales and small data sets, in general).

We then extended the ML principles gleaned from relatively simple AB compounds, to more electronically complex RM intermetallics with the objective of classifying mechanical deformation behavior. We found excellent agreement between the classification rules extracted from ML and insights from DFT calculations. We identified that the behavior of Cd-based RM compounds could differ from that of the Zn- and Mg-based compounds. Previous DFT work predicted similar behavior among Cd-, Zn- and Mg-based compounds²⁹. Although the accuracies of our ML models were not 100%, we identified ScIr (originally considered as a brittle material) to be potentially ductile, requiring re-evaluation of its mechanical properties. We also predicted several new ductile RM intermetallics using classification learning. It was interesting to find that the orbital radii features of atom-R were not identified to be critical for classifying ductile from brittle RM compounds, particularly when our DFT+U calculations for the ductile DyCu compound showed evidence for high density of Dy 4f-states at the Fermi level (see Supplementary Note 3). This finding could have important implications for targeted materials design, because we now have an additional degree of freedom to dope other rare-earth elements at the R-site without affecting the mechanical properties of the parent alloy. Broadly, our informatics work shows how by leveraging available small data sets, ML methods and accurate electronic structure calculations it is possible to enable materials discovery; the concepts described here can be readily applied beyond the binary systems and properties explored in this paper.

We clarify the relevance of our informatics approach at a time when high-throughput first principles calculations, such as those found in The Materials Project³⁴ and AFLOWLIB³⁵, have attracted significant attention. We are fully aware that one could employ accurate ab initio calculations to evaluate the relative energetics of simple compounds, such as the wide band gap AB alloys explored in this work. In fact, we have shown in this paper that classification learning, indeed, achieves accuracy comparable to first principles based methods. Our work reinforces data-driven informatics based learning as an alternative paradigm to high-throughput first principles calculations for rapidly identifying new and previously unexplored material compositions with targeted properties. This crucial finding makes our approach highly attractive for the computational design of complex materials with defects, solid solutions and multicomponent alloys, whose structures and compositions are not reported in online repositories such as International Crystal Structure Database (ICSD)³⁶. Furthermore, we also utilize experimental data (where available) as the starting point for our classification learning, as demonstrated using RM intermetallics, which is a departure from the high-throughput literature where databases from first principles calculations are frequently mined. Clearly, in our approach the accuracies of the reported experimental data in the literature are critical towards assessing the success of our ML models (similar to the importance of the accuracies of density functional theory or other theories in computing properties within the high-throughput framework). We also acknowledge that our reliance on experimental data introduces additional complexities for ML in the form of handling small or tiny data sets and uncertainties in error measures, which we have shown can be addressed by employing physically meaningful feature selection/extraction methods, cross-validation schemes and ab initio calculations of misclassified instances. We believe that to achieve accelerated materials design and discovery, it is crucial to consider complexities and uncertainties associated with data coming from experiments and we note that the ML approaches described in this paper are a significant step in that direction.

Methods

Machine Learning (ML)

Classification is a machine learning (ML) approach that separates data into pre-defined classes. In classification, we have a data matrix X of dimension m × n. The rows of X are m chemical compositions and columns are n features. Each chemical composition in X belongs to a class specified by another categorial attribute (Y) called the class label, Y = (y₁, y₂, …, y_m) with p distinct labels . The source for class labels could be either experiments or high-fidelity computations. In this particular case, the classification problem is referred to as supervised learning, because the class label attribute (Y) is a part of the data set. A subtlety to this description is that the column vector Y could also represent a numeric attribute, in which case the supervised learning is referred to as regression or prediction. The objective of supervised-ML is to find a function, that maps features (matrix X) onto Y. The mapping functions determine the decision boundaries that help separate one class (y_i) from the other(s). Our interest in supervised learning is motivated from earlier studies^{37,38,39,40,41}, which showed that ML can be used for predicting new materials with specific functionalities in an accelerated manner. In this work, we use orbital radii as features to which ML is applied.

The three ML methods we use are principal component analysis (PCA)³⁰, decision trees⁴² and support vector machines (SVM)⁴³. PCA is used for constructing the linear combinations of orbital radii; decision trees and SVM are used for supervised classification learning. These classification ML methods determine the functional relationship, . While decision trees partition the data in a linear fashion using vertical and horizontal lines, SVM is a non-linear method. We use the R-package for PCA⁴⁴. Decision trees are performed using the J48 algorithm⁴⁵ as implemented in the Weka platform⁴⁶ and with the default values of the hyper-parameters. SVMs are performed using the RBF-kernel as implemented in the scikit-learn python module⁴⁷ and the hyper-parameters are optimized through cross-validation. Regarding the choice of kernels for SVM’s, the RBF-kernel is standard practice relative to other popular kernels (such as linear and polynomial) and has the advantage that it is “universally consistent”, which means that with enough data and appropriate choice of kernel hyper-parameters, the SVM RBF-kernel can find the Bayes optimal classifier for any distribution⁴⁸. This is not to say that it will be the optimal classifier all the time, the “no-free-lunch” theorem prevents one from making such a statement⁴⁹.

Classification accuracies are calculated as the ratio of number of compounds correctly classified to the total number of compounds that are used for training. We trained the decision tree and SVM algorithms in two ways: (i) on the full training set and (ii) with leave-one-out cross validation (LOO-CV). The LOO-CV approach involves training the ML algorithm on all but one data point and then applying that classifier to the left-out data point. The process is repeated until each compound is left out exactly once and its crystal structure or properties predicted from the knowledge of the remaining compounds. LOO-CV is a common procedure used in ML when the training data sets are smaller in size. Additional details necessary to reproduce the ML are given in Supplementary Note 4.

Density Functional Theory

Density functional theory (DFT) calculations were performed within the generalized gradient approximation (GGA) as implemented in Quantum ESPRESSO⁵⁰. For the AB solids, the PBEsol exchange-correlation functional⁵¹ was used and the core and valence electrons were treated with ultrasoft pseudopotentials⁵². The Brillouin zone integration was performed using a 12 × 12 × 12 Monkhorst-Pack k-point mesh⁵³ centered at Γ and 60 Ry plane-wave cutoff. For modeling the RM intermetallics, the PBE exchange-correlation functional⁵⁴ was used. The core and valence electrons were treated with projector augmented-wave (PAW) pseudopotentials⁵⁵. We chose these exchange-correlation functionals and pseudopotentials to directly compare our results with available electronic structure calculations²⁹. All RM compounds were treated as spin unpolarized except those containing Dy. For the Dy-atom, we used the PAW pseudopotential generated by Topsakal and Wentzcovitch⁵⁶ that explicitly treats the 4f-orbitals as valence states. Collinear ferromagnetic spin order was imposed on the Dy-atom. We compared the electronic structure of the intermetallic compound with and without Hubbard-U correction on the Dy manifold; we chose a U_f value of 5 eV⁵⁶. (see Supplementary Note 3 and Supplementary Figure 3). For the DFT+U calculations, the standard Dudarev implementation⁵⁷ was used. In compounds that do not contain Dy, non spin-polarized calculations were performed. The Brillouin zone integration was performed using a 10 × 10 × 10 Monkhorst-Pack k-point mesh⁵³ centered at Γ and 60 Ry plane-wave cutoff.

For both AB and RM compounds, to yield optimally smooth pseudopotentials, we used the Troullier-Martins pseudization method⁵⁸. The scalar relativistic pseudopotentials were generated using the atomic package⁵⁰ with the inclusion of nonlinear core corrections. For the density of states calculations, 14 × 14 × 14 Monkhorst-Pack k-point mesh centered at Γ was used. The atomic positions and the cell volume were allowed to change until an energy convergence threshold of 10⁻⁸ eV and Hellmann-Feynman forces less than 2 meV/Å, respectively, were achieved. The space groups of the optimized structures were determined using FINDSYM⁵⁹ and the resulting crystal structures were visualized in VESTA⁶⁰.

Additional Information

How to cite this article: Balachandran, P. V. et al. Materials Prediction via Classification Learning. Sci. Rep. 5, 13285; doi: 10.1038/srep13285 (2015).

References

Simons, G. New Model Potential for Pseudopotential Calculations. The Journal of Chemical Physics 55, 756–761 (1971).
ADS CAS Google Scholar
Simons, G. & Bloch, A. Pauli-Force Model Potential for Solids. Phys. Rev. B 7, 2754–2761 (1973).
ADS Google Scholar
Bloch, A. N. & Simons, G. Structural Index for Elemental Solids. Journal of the American Chemical Society 94, 8611–8613 (1972).
CAS Google Scholar
John, J. & Bloch, A. N. Quantum-Defect Electronegativity Scale for Nontransition Elements. Phys. Rev. Lett. 33, 1095–1098 (1974).
ADS Google Scholar
Chelikowsky, J. R. & Phillips, J. C. Quantum-defect theory of heats of formation and structural transition energies of liquid and solid simple metal alloys and compounds. Phys. Rev. B 17, 2453–2477 (1978).
ADS CAS Google Scholar
Cohen, M. L. Pseudopotentials and Crystal Structure. In O’Keeffe & Navrotsky, A. (eds.) Structure and Bonding in Crystals I 25–48 (Elsevier Science, 1981).
Bloch, A. N. & Schatteman, G. C. Quantum-Defect Orbital Radii and the Structural Chemistry of Simple Solids. In O’Keeffe & Navrotsky, A. (eds.) Structure and Bonding in Crystals I 49–71 (Elsevier Science, 1981).
Phillips, J. C. Quantum Theory and Crystal Chemistry. In O’Keeffe & Navrotsky, A. (eds.) Structure and Bonding in Crystals I 13–24 (Elsevier Science, 1981).
Slater, J. C. Atomic Radii in Crystals. The Journal of Chemical Physics 41, 3199–3204 (1964).
ADS CAS Google Scholar
Shannon, R. D. Revised effective ionic radii and systematic studies of interatomic distances in halides and chalcogenides. Acta Crystallographica Section A 32, 751–767 (1976).
ADS Google Scholar
Pauling, L. A Resonating-Valence-Bond Theory of Metals and Intermetallic Compounds. Proceedings of the Royal Society of London A: Mathematical, Physical and Engineering Sciences 196, 343–362 (1949).
ADS CAS MATH Google Scholar
Burdett, J. K. & Price, S. L. An interpretation of structural sorting diagrams for AB type compounds using molecular orbital ideas. Journal of Physics and Chemistry of Solids 43, 521–531 (1982).
ADS CAS Google Scholar
Mooser, E. & Pearson, W. B. On the crystal chemistry of normal valence compounds. Acta Crystallographica 12, 1015–1022 (1959).
CAS Google Scholar
Pauling, L. The Nature of the Chemical Bond. IV. The Energy of Single Bonds and the Relative Electronegativity of Atoms. Journal of the American Chemical Society 54, 3570–3582 (1932).
CAS MATH Google Scholar
Phillips, J. C. & Van Vechten, J. A. Spectroscopic Analysis of Cohesive Energies and Heats of Formation of Tetrahedrally Coordinated Semiconductors. Phys. Rev. B 2, 2147–2160 (1970).
ADS Google Scholar
Phillips, J. Structural pseudoion form factors. Solid State Communications 22, 549–550 (1977).
ADS CAS Google Scholar
Machlin, E. S., Chow, T. P. & Phillips, J. C. Structural Stability of Suboctet Simple Binary Compounds. Phys. Rev. Lett. 38, 1292–1295 (1977).
ADS CAS Google Scholar
Littlewood, P. B. Structure and bonding in narrow gap semiconductors. Critical Reviews in Solid State and Materials Sciences 11, 229–285 (1983).
ADS Google Scholar
Zunger, A. Systematization of the stable crystal structure of all AB-type binary compounds: A pseudopotential orbital-radii approach. Phys. Rev. B 22, 5839–5872 (1980).
ADS CAS Google Scholar
Paudel, T. R., Zakutayev, A., Lany, S., D’Avezac, M. & Zunger, A. Doping Rules and Doping Prototypes in A2BO4 Spinel Oxides. Advanced Functional Materials 21, 4493–4501 (2011).
CAS Google Scholar
Cohen, M. L. Electronic Charge Densities in Semiconductors: Electron density calculations give new insights into the origins of the properties of solids. Science 179, 1189–1195 (1973).
ADS CAS PubMed Google Scholar
Andreoni, W. & Galli, G. Unified structural classification of AB2 molecules and solids from valence electron orbital radii. Physics and Chemistry of Minerals 14, 389–395 (1987).
ADS CAS Google Scholar
Rabe, K. M. Quantum Diagrams and Prediction of New Materials. Journal of Alloys and Compounds 197, 131–135 (1993).
CAS Google Scholar
Lencer, D. et al. A map for phase-change materials. Nat. Mater 7, 972–977 (2008).
ADS CAS PubMed Google Scholar
Saad, Y. et al. Data mining for materials: Computational experiments with AB compounds. Phys. Rev. B 85, 104104 (2012).
ADS Google Scholar
Seko, A., Maekawa, T., Tsuda, K. & Tanaka, I. Machine learning with systematic density-functional theory calculations: Application to melting temperatures of single- and binary-component solids. Phys. Rev. B 89, 054303 (2014).
ADS Google Scholar
Ghiringhelli, L. M., Vybiral, J., Levchenko, S. V., Draxl, C. & Scheffler, M. Big data of materials science: Critical role of the descriptor. Phys. Rev. Lett. 114, 105503 (2015).
ADS PubMed Google Scholar
Waber, J. T. & Cromer, D. T. Orbital Radii of Atoms and Ions. The Journal of Chemical Physics 42, 4116–4123 (1965).
ADS CAS Google Scholar
Gschneidner, K. et al. Influence of the electronic structure on the ductile behavior of B2 CsCl-type AB intermetallics. Acta Materialia 57, 5876–5881 (2009).
CAS Google Scholar
Ringnér, M. What is principal component analysis? Nat Biotech 26, 303–304 (2008).
Google Scholar
Gschneidner, K. et al. A family of ductile intermetallic compounds. Nat. Mater 2, 587–591 (2003).
ADS CAS PubMed Google Scholar
Sun, R. & Johnson, D. D. Stability maps to predict anomalous ductility in B2 materials. Phys. Rev. B 87, 104107 (2013).
ADS Google Scholar
Wang, X. F., Jones, T. E., Li, W. & Zhou, Y. C. Extreme Poisson’s ratios and their electronic origin in B2 CsCl-type AB intermetallic compounds. Phys. Rev. B 85, 134108 (2012).
ADS Google Scholar
Jain, A. et al. Commentary: The Materials Project: A materials genome approach to accelerating materials innovation. APL Materials 1, 011002 (2013).
ADS Google Scholar
Curtarolo, S. et al. AFLOWLIB.ORG: A distributed materials property repository from high-throughput ab initio calculations. Computational Materials Science 58, 227–235 (2012).
CAS Google Scholar
Belsky, A., Hellenbrandt, M., Karen, V. L. & Luksch, P. New developments in the Inorganic Crystal Structure Database (ICSD): accessibility in support of materials research and design. Acta Crystallographica Section B 58, 364–369 (2002).
Google Scholar
Balachandran, P. V., Broderick, S. R. & Rajan, K. Identifying the inorganic gene for high-temperature piezoelectric perovskites through statistical learning. Proceedings of the Royal Society A: Mathematical, Physical and Engineering Science 467, 2271–2290 (2011).
ADS CAS Google Scholar
Balachandran, P. V., Puggioni, D. & Rondinelli, J. M. Crystal-Chemistry Guidelines for Noncentrosymmetric A2BO4 Ruddlesden-Popper Oxides. Inorganic Chemistry 53, 336–348 (2014).
CAS PubMed Google Scholar
Meredig, B. et al. Combinatorial screening for new materials in unconstrained composition space with machine learning. Phys. Rev. B 89, 094104 (2014).
ADS Google Scholar
Meredig, B. & Wolverton, C. Dissolving the Periodic Table in Cubic Zirconia: Data Mining to Discover Chemical Trends. Chemistry of Materials 26, 1985–1991 (2014).
CAS Google Scholar
Isayev, O. et al. Materials cartography: Representing and mining materials space using structural and electronic fingerprints. Chemistry of Materials 27, 735–743 (2015).
CAS Google Scholar
Kingsford, C. & Salzberg, S. L. What are decision trees? Nat Biotech 26, 1011–1013 (2008).
CAS Google Scholar
Noble, W. S. What is a support vector machine? Nat Biotech 24, 1565–1567 (2006).
CAS Google Scholar
R Core Team. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria (2012).
Quinlan, R. C4.5: Programs for Machine Learning (Morgan Kaufmann Publishers, San Mateo, CA, 1993).
Hall, M. et al. The WEKA Data Mining Software: An Update. SIGKDD Explor. Newsl. 11, 10–18 (2009).
Google Scholar
Pedregosa, F. et al. Scikit-learn: Machine Learning in Python. Journal of Machine Learning Research 12, 2825–2830 (2011).
MathSciNet MATH Google Scholar
Steinwart, I. On the influence of the kernel on the consistency of support vector machines. J. Mach. Learn. Res. 2, 67–93 (2002).
MathSciNet MATH Google Scholar
Wolpert, D. H. The Lack of A Priori Distinctions Between Learning Algorithms. Neural Computation 8, 1341–1390 (1996).
Google Scholar
Giannozzi, P. et al. QUANTUM ESPRESSO: a modular and open-source software project for quantum simulations of materials. Journal of Physics: Condensed Matter 21, 395502 (19pp) (2009).
PubMed Google Scholar
Perdew, J. P. et al. Restoring the Density-Gradient Expansion for Exchange in Solids and Surfaces. Phys. Rev. Lett. 100, 136406 (2008).
ADS PubMed Google Scholar
Vanderbilt, D. Soft self-consistent pseudopotentials in a generalized eigenvalue formalism. Phys. Rev. B 41, 7892–7895 (1990).
ADS CAS Google Scholar
Monkhorst, H. J. & Pack, J. D. Special points for Brillouin-zone integrations. Phys. Rev. B 13, 5188–5192 (1976).
ADS MathSciNet Google Scholar
Perdew, J. P., Burke, K. & Ernzerhof, M. Generalized Gradient Approximation Made Simple. Phys. Rev. Lett. 77, 3865–3868 (1996).
ADS CAS PubMed Google Scholar
Blöchl, P. E. Projector augmented-wave method. Phys. Rev. B 50, 17953–17979 (1994).
ADS Google Scholar
Topsakal, M. & Wentzcovitch, R. Accurate projected augmented wave (PAW) datasets for rare-earth elements (RE=La-Lu). Computational Materials Science 95, 263–270 (2014).
CAS Google Scholar
Dudarev, S. L., Botton, G. A., Savrasov, S. Y., Humphreys, C. J. & Sutton, A. P. Electron-energy-loss spectra and the structural stability of nickel oxide: An LSDA+U study. Phys. Rev. B 57, 1505–1509 (1998).
ADS CAS Google Scholar
Troullier, N. & Martins, J. L. Efficient pseudopotentials for plane-wave calculations. Phys. Rev. B 43, 1993–2006 (1991).
ADS CAS Google Scholar
Stokes, H. T. & Hatch, D. M. FINDSYM: program for identifying the space-group symmetry of a crystal. Journal of Applied Crystallography 38, 237–238 (2005).
CAS Google Scholar
Momma, K. & Izumi, F. VESTA: a three-dimensional visualization system for electronic and structural analysis. Journal of Applied Crystallography 41, 653–658 (2008).
CAS Google Scholar

Download references

Acknowledgements

P.V.B., T.L. and J.T. acknowledge funding support from the Los Alamos National Laboratory (LANL) Laboratory Directed Research and Development (LDRD) DR (#20140013DR) on Materials Informatics. J.M.R. acknowledges support from NSF-DMR 1454688. P.V.B. thanks J. Hogden for comments on the paper. P.V.B. thanks M. Sanati for bringing the RM intermetallics problem to our attention and M. Topsakal for assistance with the Dy-pseudopotentials. P.V.B. also thanks J. Gubernatis and G. Pilania for insightful discussions. DFT calculations were performed using the Institutional Computing (IC) resources at LANL.

Author information

Authors and Affiliations

Theoretical Division, Los Alamos National Laboratory, Los Alamos, 87545, NM, USA
Prasanna V. Balachandran & Turab Lookman
Intelligence and Space Research, Los Alamos National Laboratory, Los Alamos, 87545, NM, USA
James Theiler
Department of Materials Science and Engineering, Northwestern University, Evanston, 60208, IL, USA
James M. Rondinelli

Authors

Prasanna V. Balachandran
View author publications
You can also search for this author in PubMed Google Scholar
James Theiler
View author publications
You can also search for this author in PubMed Google Scholar
James M. Rondinelli
View author publications
You can also search for this author in PubMed Google Scholar
Turab Lookman
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

The informatics ideas were formulated by P.V.B., J.T., J.M.R. and T.L. P.V.B. built the data sets, performed principal component analysis and decision tree classification. P.V.B. and J.M.R. performed density functional calculations. J.T. performed the support vector machine classification. All authors analyzed the results and contributed to the writing of the paper.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Electronic supplementary material

Supplementary Information

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Balachandran, P., Theiler, J., Rondinelli, J. et al. Materials Prediction via Classification Learning. Sci Rep 5, 13285 (2015). https://doi.org/10.1038/srep13285

Download citation

Received: 02 April 2015
Accepted: 16 July 2015
Published: 25 August 2015
DOI: https://doi.org/10.1038/srep13285

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.