Abstract
Electronic density of states (DOS) is a key factor in condensed matter physics and material science that determines the properties of metals. Firstprinciples densityfunctional theory (DFT) calculations have typically been used to obtain the DOS despite the considerable computation cost. Herein, we report a fast machine learning method for predicting the DOS patterns of not only bulk structures but also surface structures in multicomponent alloy systems by a principal component analysis. Within this framework, we use only four features to define the composition, atomic structure, and surfaces of alloys, which are the dorbital occupation ratio, coordination number, mixing factor, and the inverse of miller indices. While the DFT method scales as O(N^{3}) in which N is the number of electrons in the system size, our pattern learning method can be independent on the number of electrons. Furthermore, our method provides a pattern similarity of 91 ~ 98% compared to DFT calculations. This reveals that our learning method will be an alternative that can break the tradeoff relationship between accuracy and speed that is well known in the field of electronic structure calculations.
Similar content being viewed by others
Introduction
Electronic density of states (DOS) plays a tremendously important role in determining the properties of metals^{1}. Researchers in the fields of solidstate and condensed matter physics carefully diagnose density distributions of free electrons in metals to understand scientific concepts that are hidden in such density distributions (e.g., the dband center theory)^{2} and to develop new materials^{3,4}.
Quantum mechanical approaches (e.g., density functional theory) shed light on the nature of electrons in metals, and firstprinciples density functional theory (DFT) calculations are successful methods to develop the electronic DOS of metals. Although quantum mechanical methods provide a high accuracy, they have the disadvantage of a severe computational workload, which originates from the complexity of manybody systems^{5}. Thus, many researchers are seeking a fast method to predict electronic structures of materials with a high accuracy^{6,7,8,9}.
Within quantum mechanical frameworks, their high computational cost limits the system size that can be studied. To circumvent such causalitybased frameworks, an inductive method can be realized by utilizing data and statistical learning algorithms^{10,11,12,13,14,15,16,17}. Recently, a machinelearning approach was pursued to address different quantum mechanical problems^{18,19}, and in particular, to predict the electronic structures of alloys, e.g., to predict the DOS values at the Fermi level^{20} or the dband centers^{21}. However, to date, these attempts have been limited to the prediction of only single value, and no machinelearning technique is available for the prediction of DOS patterns that includes both the value and shape.
Herein, we propose a new perspective on the representation of DOS that has been regarded as multidimensional digital data from onedimensional continuous curves. Using principal component analysis, we identified highly correlated DOS patterns for various metal systems and proposed features to determine the correlation between the DOS patterns and the atomic structures of materials in a linear subspace. We successfully reproduced the DOS patterns of alloys usually found by quantum mechanical approaches, which is independent of the number of electrons in the system. Furthermore, our method achieves a small loss of accuracy (Accuracy >90%) compared to DFT calculations. The DOS pattern learning method can provide a breakthrough in the tradeoff relationship between accuracy and speed, which is well known in the field of electronic structure calculations. Moreover, the approach is applicable for predicting DOS patterns in not only bulk structures but also of surfaces in multicomponent alloy systems.
Results
When mapping DOS patterns from the atomic structures of alloys, there is a mathematical puzzle, i.e., the number of input material labels (e.g., compositions, crystal structures, and lattice parameters) is much smaller than the number of output DOS values at the corresponding energy levels. Accordingly, we first compressed the output information by digitizing an analog signal of the DOS in a rectangular window to one multidimensional vector, as shown in Fig. 1a. Next, we applied principal component analysis (PCA), an unsupervised learning technique, to reduce the highdimensional data to a lowdimensional data set^{22,23}. Then, we could build a model to represent the DOS patterns.
Learning process of DOS patterns
In the learning process of the DOS pattern (ρ), PCA was employed in which, we implemented Python code with matrices operation package NumPy^{24} for the analysis. Mathematically, this code finds the maximum variance of linearly independent eigenvectors. Prior to the analysis, DOS image vectors were digitized in a rectangular window. In our study, we considered an energy range from −10 eV to 5 eV and a DOS range from 0 to 3. We standardized the DOS image vectors of the training data by obtaining the normalized matrix Y in which the i^{th} row (y_{i}) of Y is \(\,{{\bf{x}}}_{{\boldsymbol{i}}}\bar{{\bf{x}}}\), where \(\bar{{\bf{x}}}\) is the mean of each column vector of X. Then, we calculated the eigenvectors, u_{p} = \({({u}_{1},{u}_{2},\ldots ,{u}_{M\times N})}_{p}\), and the corresponding eigenvalues, λ_{p}, were calculated by the covariance matrix, S = Y^{T}Y, according to Eq. (1):
Here, the eigenvectors are called principal components (PCs), and the corresponding eigenvalues describe the data variance along the PCs.
The original vector x can be reconstructed by using the following Eq. (2):
where P is the number of PCs and p is their index. Thus, coefficient α_{p}of the eigenvectors can be computed by \({{\bf{y}}}^{{\rm{T}}}{{\bf{u}}}_{p}+{\bar{{\bf{x}}}}^{{\rm{T}}}{{\bf{u}}}_{p}\), and it corresponds to the coordinate values on the linear subspace that is composed of PCs.
In the learning process using the PCA, we identified the linear subspace for which the orthogonal projections of the image vector, x, have a maximum variance, and we learned the eigenvectors, u, of the training systems in the linear subspace (Fig. 1b). The original image vectors can be reconstructed by \(\sum _{p=1}^{P}{\alpha }_{p}{{\bf{u}}}_{p}\).
Predicting process of DOS patterns
During the predicting process for the DOS pattern (a new image vector, x′) of a test alloy, as shown in Fig. 1c, we estimated the new coefficients, \({\alpha ^{\prime} }_{p}\), via a linear interpolation between α_{p} of the two training systems that is most similar to the test composition, where features relevant to the electron occupation and atomic configuration were considered (Supplementary Figs S1 and S2). Using \(\sum _{p=1}^{P}{\alpha ^{\prime} }_{p}{{\bf{u}}}_{p}\), we obtained a new image vector, x′, and transformed from the x′ to the DOS probability matrix, X′, the elements of which are the probable values of each DOS levels at the given energy interval.
To predict DOS patterns, only a single DOS value must be determined by a given energy interval. Thus, we defined the DOS probability matrix originating from the DOS image vector. The DOS image vector, \({\bf{x}}{\boldsymbol{^{\prime} }}=({x^{\prime} }_{1},{x^{\prime} }_{2},\ldots ,{x^{\prime} }_{M\times N})\), calculated by the PCs and the estimated coefficients, was transformed to the DOS image matrix, I′, with M columns and N rows in a gridbased rectangular window, and its size is the same as the size used in the learning process (Supplementary Fig. S6). To define the DOS probability matrix, we considered only positive entries in the I′, and the other entries were regarded as zero. Moreover, we normalized all of the entries of the DOS levels at each energy interval. Then, we defined the DOS probability matrix, X′, with M columns and N rows, as given by
where \({x^{\prime} }_{m,n}\) is the positive entry value of the column vector in X′, and m and n are the matrix indices.
To predict the DOS pattern with X′, one should determine a single DOS value at each energy interval. Therefore, we obtained the estimated DOS, which is ρ′ and is given by
where E_{m} is the m^{th} energy interval, and ρ_{n} is the n^{th} DOS level.
Application into binary alloy systems
To test our pattern learning (PL) method, it was first applied to a CuNi system. Thermodynamically, this alloy system shows a complete solid solution, indicating that the Cu and Ni atoms in the alloys are homogeneously mixed in a facecentered cubic (fcc) structure regardless of the composition. Thus, it is expected that the DOS of the alloy system follows intrinsic electronic structures of Cu and Ni crystals and their composition can be a key feature for the representation of their DOS patterns. Therefore, we define the dorbital electron occupation rate (n_{d}) as an alloy compositiondependent feature that represents local DOS patterns of the dorbitals. Moreover, all of the pristine Cu and Ni and their alloys have a fcc crystal structure, indicating that the effect of the atomic structure on the DOS pattern is not significant. Accordingly, to predict the DOS patterns in this system, we considered only n_{d} as a feature. After training the DOS patterns for various CuNi compositions {Cu, Cu_{0.75}Ni_{0.25}, Cu_{0.5}Ni_{0.5}, Cu_{0.25}Ni_{0.75}, Ni}, we predicted the DOS of Cu_{0.375}Ni_{0.625} as the test alloy (Fig. 2a) by considering three PCs. A comparison with the DFT results revealed that our method obtained the pattern similarity of 95% (σ = 0.95). However, the calculation time is less than 1 minute even on 1 core of an Intel Xeon CPU, whereas the DFT method requires approximately 2 hours on 16 cores of the CPU.
In contrast to the CuNi system, the CuFe system was also considered because the crystal structures of Cu and Fe are different and their alloys do not exhibit a complete solid solution. This implies that features based on the atomic structures in addition to n_{d} are required for the DOS representation. We introduced the coordination number (CN) and a mixing factor (F_{mix}) as features to distinguish the atomic structures. The CN was obtained by dividing the number of all bonds between two atoms by the total number of atoms in the system, where the bonds were calculated using the covalent atomic radii. F_{mix} indicates the ratio of the number of different pair bonds in the alloy system to the total number of bonds. Using F_{mix}, one can distinguish the atomic distributions in alloy systems that have the same CN (Supplementary Fig. S2).
To represent the DOS patterns for the test data, the coefficient \({\alpha ^{\prime} }_{p}\) should be determined. Since the eigenvectors obtained after PCA correspond to the PC vectors, the distributed coefficients lying on identical eigenvectors were correlated with each other. Thus, we generated linear regression lines between the α_{p} of the training data in which we focused on the linear regression line between two training data near the test composition. Then, using the features of the training and test systems, we estimated the \({\alpha ^{\prime} }_{k}\) contributions of n_{d}, CN, and F_{mix} (\(\,{\alpha }_{p}^{^{\prime} {{\rm{n}}}_{{\rm{d}}}},\,{\alpha }_{p}^{^{\prime} {\rm{CN}}},\,{\alpha }_{p}^{^{\prime} {{\rm{F}}}_{{\rm{mix}}}}\)) for the test system using the linear regression line (Fig. 1c). We defined the set of features as Φ = {n_{d}, CN, F_{mix}}. Here, it was assumed that the three features have equal weights so that
where β_{φ} is 1/3 for all the features. A detailed description of the estimation of the coefficients is also provided in Section 3 of the Supplementary Information (Table S1 and Fig. S7).
Using these three features (n_{d}, CN, and F_{mix}), the DOS of Cu_{0.375}Fe_{0.625} was predicted (Fig. 2b). The use of only n_{d} leads to the pattern similarity of 78%, while the use of all three features improves the pattern similarity up to 95%. Even in the previously examined CuNi system, consideration of CN and F_{mix} in addition to n_{d} can slightly improve the pattern similarity up to 96% for Cu_{0.375}Ni_{0.625} (Supplementary Fig. S8). Furthermore, we calculated DOS patterns for new test data, Cu_{0.625}Ni_{0.375} and Cu_{0.625}Fe_{0.375}, and then obtained the pattern similarities of 95% and 97%, respectively (Supplementary Fig. S9).
To highlight a novelty of our PL method, we additionally calculated the DOS patterns by a linear interpolation of the DOS patterns of the two nearest neighbors without PCA. Compared to the DFT calculation, the linear interpolation method shows the pattern similarities of 90% for Cu_{0.375}Ni_{0.625} and 88% for Cu_{0.375}Fe_{0.625} (Supplementary Fig. S10), which are lower accuracies than those predicted by our PL method.
Application into multicomponent alloy systems
To extend our method to multicomponent alloy systems, we also developed a method to represent the DOS patterns of ternary systems, using the example of the CuNiPt system (Fig. 3). Figure 3a shows a triangular composition diagram of the CuNiPt system, where a total of 15 compositions were considered as the training set: pure 3, binary 9, and ternary 3. Similar to the previous binary cases, by determining the coefficients (α_{p}) of the PCs for a ternary test composition, one can represent its DOS pattern. First, we selected three training compositions that were located closest to the test composition and calculated the distances (d) between the test composition and the three training compositions (Fig. 3b). Then, the \({\alpha ^{\prime} }_{p}\) for the DOS representation were estimated by using the features and α_{p} at the training compositions, where it was assumed on physical grounds that the DOS pattern of the test composition is represented by the highest weight for the training composition that is nearest to the test composition. According to coherentpotential approximation (CPA)^{25,26} that has been extensively used in calculating electronic structures of various alloy systems, the effective or coherent potential lattice can be represented by the average behavior of the AB binary alloy. However, the singlesite nature of the CPA limits its applicability to systems with negligible shortrange order and local lattice relaxation effects^{27}. When estimating the \({\alpha ^{\prime} }_{p}\) for the DOS representation of the test ternary alloy on the basis of the theory, we consider the three training compositions that is nearest to the test composition, rather than the pure compositions. The atomic structures at the training compositions that is nearest to the test composition include more similar atomic distribution information (e.g., atomic ordering or lattice relaxation) to the test alloy than those of the pure metals or others.
The basic idea for an ABC ternary case is similar to that of the binary case. In this case, we define the set of features as Φ = {n_{d,A}, n_{d,B}, n_{d,C}, CN_{norm}, F_{mix}}. The number of feature values for n_{d} depends on the number of elements in the multicomponents case. Here, we also considered the differences (d_{ij}) in the feature values between the test and the adjacent three training compositions (Supplementary Fig. S11) as given by:
where i and j are the selected data of the ABC alloy system, and n_{d,A}, n_{d,B}, n_{d,C}, CN_{norm}, and F_{mix} are feature values corresponding to the data. Of the three material features (n_{d}, CN, and F_{mix}), the n_{d} and F_{mix} values range from 0 to 1, whereas CN is greater than 1. To obtain units in the same range, we considered the normalized value of CN (CN_{norm}) by dividing the CN value by 12, which is based on the fact that the maximum CN value in the alloy system is 12 for a fcc structure. When the composition and crystal structure of a test alloy are more similar to the training data, the differences in the feature values decreases. We defined Ω as the set of the nearest three training data, ν as the test data, and ν′ as the training data. To estimate the PC coefficient (\({\alpha ^{\prime} }_{k,\nu }\)) for the test data, three weights (w) of the coefficients of the three training systems were calculated based on d_{ij} using Eq. (7):
The range of \({w}_{{\rm{\nu }}{\rm{\nu }}^{\prime} }\) is from 0 to 1. Then, the estimated PC coefficients for the test alloy were calculated by
For the example of the CuNiPt system shown in Fig. 3, the n_{d}, CN, and F_{mix} of the training and test data are summarized in Supplementary Table S2. This approach was tested for two compositions: Cu_{0.03}Ni_{0.03}Pt_{0.94} (Fig. 3c) and Cu_{0.32}Ni_{0.34}Pt_{0.34} (Fig. 3d), and we determined that our method obtains the pattern similarity of 96%.
Using the similar procedure, our method can be readily extended to quaternary or quinary alloy systems. As an example, we considered the quinary system of CuNiPtFeCr that is on an extension line of the ternary CuNiPt system discussed in Fig. 3. We trained the DOS patterns for the 15 ternary CuNiPt compositions in Fig. 3 and the 3 quaternary/quinary CuNiPtFeCr compositions {Cu_{0.315}Ni_{0.315}Pt_{0.25}Fe_{0.12}, Cu_{0.315}Ni_{0.315}Pt_{0.25}Cr_{0.12}, and Cu_{0.315}Ni_{0.315}Pt_{0.25}Fe_{0.06}Cr_{0.06}}, and then predicted the DOS patterns of Cu_{0.315}Ni_{0.315}Pt_{0.25}Fe_{0.03}Cr_{0.09} and Cu_{0.315}Ni_{0.315}Pt_{0.25}Fe_{0.09}Cr_{0.03} as test quinary systems, in which the atomic structures of the training and test quaternary/quinary systems are shown in Supplementary Fig. S4. Compared to the DFT results, our method reveals the superior pattern similarity: 97% for Cu_{0.315}Ni_{0.315}Pt_{0.25}Fe_{0.03}Cr_{0.09} and 96% for Cu_{0.315}Ni_{0.315}Pt_{0.25}Fe_{0.09}Cr_{0.03} (Supplementary Fig. S12). This reveals a high utility of our method. In particular, it can be applied to the field of not only a metallic catalysis but also a high entropy alloy (HEA), in which the HEA is usually consisted of five or more metallic elements. Here, we need to compare our PL method with the CPA^{25,26} that has been extensively used in calculating the electronic structure calculations in multicomponent random solid solutions. Recently, the electronic structures of various HEAs were calculated by the exact muffintin orbitals (EMTO) method in combination with the CPA^{27,28}. The EMTOCPA is undoubtedly an accurate and efficient method. However, the computational complexity of the EMTOCPA scales linearly with the number of atoms in the supercell^{29}, while our PL method is independent on the number of atoms in the supercell (the details will be discussed in the Discussion section). This implies that as a supercell size increases our PL method can be more efficient than the EMTOCPA method.
Application into surface structures
By expanding the scheme that was applied to bulk structures, we studied the representation of the DOS patterns for surface structures of alloys. In particular, we used method to represent the DOS patterns of highindex surfaces based on those of lowindex surfaces. Here, it is important to find a feature to define the surface structures, with which we can estimate the PC coefficients for a test surface. In Fig. 4a, a highindex surface, (211), can be regarded as a surface to connect two lowindex surfaces, (011) and (111). Moreover, the step alignment of atoms on the (211) surface plane is generated by a combination of the atom alignments on the (011) and (111) surface plane. In this regard, we employed a lattice plane vector by using the miller indices, which consist of three integers, h, k, and l, and where the notation of the lattice plane vector is written by (hkl). Then, we defined the lattice plane that intercepts three points, \({\overrightarrow{{\boldsymbol{L}}}}_{1}\)/h, \({\overrightarrow{{\boldsymbol{L}}}}_{2}\)/k, and \({\overrightarrow{{\boldsymbol{L}}}}_{3}\)/l, where \({\overrightarrow{{\boldsymbol{L}}}}_{1}\),\(\,{\overrightarrow{{\boldsymbol{L}}}}_{2}\), and \({\overrightarrow{{\boldsymbol{L}}}}_{3}\) are the lattice vectors in a conventional unit cell. Therefore, we considered the inverses of the miller indices, 1/h, 1/k, and 1/l, as the features regarding the surface plane orientations, and they are denoted by h′, k′, and l′, respectively. If one of the miller indices is zero, the feature value is set to be zero to avoid an infinity value.
Figure 4b shows the threedimensional (h′, k′, l′) vector space representing the inverses of miller indices for the training and test surfaces. The training samples include seven lattice plane vectors where all of the miller indices are lower than 2; {(001), (010), (100), (011), (101), (110), (111)}. The vectors correspond to lowindex surface plane vectors. Then, after adding the origin vector, (000), and connecting all vectors of the training samples and the origin, a cubic geometrical figure can be obtained. The test sample is the vector of which the miller indices is larger than one, which corresponds to a highindex surface plane vector. Thus, the high index surface plane vectors can lie on an edge or face in the cube figure. In Fig. 4a, the alignments of atoms on the (211) surface plane is a combination of the atom alignments on the (011) and (111) surface planes. Therefore, we can estimate the DOS pattern for the (211) surface with those for the (011) and (111) surfaces. Here, for the three vectors, the miller indices k and l are the same, indicating that one can be distinguished only using the miller index h. During the predicting process of our method (Fig. 1c), we only used the h′ value to determine the PC coefficients, \({\alpha ^{\prime} }_{p(211)}\), by the linearly interpolating between the two coefficients for the (011) and (111) surfaces after performing the PCA using all training DOS data.
To validate our method for surface structures, we tested (211) surfaces of the pure Cu metal (Fig. 4c) and the Cu_{0.375}Ni_{0.625} alloy (Fig. 4d). For the Cu case, our method provided the pattern similarity of 93% compared to DFT calculation (Fig. 4c), where we considered only three DOS data for Cu(001), (011), and (111) surfaces as the training data. When predicting the DOS pattern for the (211) surface of the Cu_{0.375}Ni_{0.625} alloy (see Supplementary Fig. S13), we considered (001), (011), and (111) surfaces of five CuNi alloys; {Cu, Cu_{0.75}Ni_{0.25}, Cu_{0.5}Ni_{0.5}, Cu_{0.25}Ni_{0.75}, Ni}. We first predicted the DOS patterns for (001), (011) and (111) surfaces of the Cu_{0.375}Ni_{0.625} alloy with three features (n_{d}, CN, and F_{mix}), which is similar to the method used in the bulk case. Then, we performed PCA one more time for the DOS patterns for the lowindex surfaces of the Cu_{0.375}Ni_{0.625} alloy that were obtained after the first pattern learning method. Then, using the inverse value of the miller index as a feature, we predicted the DOS pattern for the (211) surface of the Cu_{0.375}Ni_{0.625} alloy, and obtained the pattern similarity of 97% (Fig. 4d). In DFT calculations, the slab calculations are much more timeconsuming than bulk calculations. However, our method provides a similar calculation speed for both bulk and surface systems. For example, for the Cu(211) and Cu_{0.375}Ni_{0.625}(211) surface, the DFT calculation takes 2 hours on 36 cores of the CPU, while our method is still less than 5 minute even on 1 core of the CPU.
Discussion
Although highperformance computing machines have been used practically thus far, we could still tackle largescale firstprinciples calculations of more than hundreds atoms using the limited computing power. Regarding the computation cost, it is wellknown that the DFT method scales as O(N^{3}), where N is the number of electrons in the system^{30}. Indeed, a similar trend was observed in this work (Fig. 5a). However, our method remarkably shows a higher speed than DFT and requires only 1 minute regardless of N, although it depends on the training data size since our method needs to scan the entire training data. In addition, tight binding (TB) or density functional TB (DFTB) methods that are an approximate quantum mechanical approach are undoubtedly an efficient method to calculate electronic structures of materials. However, they require massive calculations for eigenvalue distributions of very large matrices^{31,32,33,34}. Thus, their computational complexity scales linearly with a dimension of the matrix, which depends on the number of atoms or orbitals in the alloy system^{35}. However, our PL method is independent of the system size, indicating that our method can show a higher speed than the TB or DFTB method.
Moreover, Fig. 5b shows the accuracy of our method for various binary alloy systems composed of 5 transition metals (Cu, Ni, Ru, Pd, Pt), and found that the pattern similarity is as high as 91~98%. Our method outperforms DFT calculations in terms of the calculation speed, and it loses little information compared to the DFT electronic structures. This clearly reveals that our method will be an alternative to break the tradeoff relationship between accuracy and speed, which is well known in the field of electronic structure calculations. And, compared to TB or DFTB method, our PL method has another competence. Such TB approaches basically need a number of training DFT data to determine many TB parameters. For example, to accurately calculate the electronic structure of bulk Rh, 29 TB parameters fitted by ~7,400 training data were required^{36}. However, our PL method used the limited training data. As an example, the DOS (96% similarity) of an arbitral CuNi binary alloy can be obtained with only 5 training data (Supplementary Fig. S8).
One of the novelties of this work is that the electronic DOS that was originally a function of an energy level can be expressed with a simple model in a linear combination form of few (three or four) PC bases. Here, we highlight the use of only three or four PCs. Although the available number of PCs in a learning process of DOS patterns is as many as the dimension of the DOS image vectors, the number of PCs highly contributing to the representation of the DOS is few, where the contribution can be evaluated with the eigenvalue for each PC. Thus, four PCs are enough to recognize the diversity of the DOS patterns in the training data. For example, in the binary alloy systems of Fig. 2 where five training data are considered, the contribution of each PC to the representation of the DOS pattern is 32.9% for the 1^{st} PC, 25.7% for the 2^{nd} PC, 22.2% for the 3^{rd} PC, and 19.1% for the 4^{th} PC, which indicates that the contribution of the remaining PCs is very miniscule (less than 0.1%). This clearly shows that only four PCs in the PCA are sufficient to represent DOS patterns.
The performance (calculation speed and accuracy) of our PL method is affected by the number of PCs and the grid size. Interestingly, due to the overfitting problem^{37}, the use of three PCs provides the most accurate DOS patterns in CuNi alloy systems (Supplementary Fig. S14), although at least four PCs are required to fully represent the DOS patterns in training data as discussed in the above paragraph. The grid size also affects the pattern similarity (accuracy) and calculation time of our method (Supplementary Fig. S14). The use of a higher (or finer) grid size provides a higher accuracy, even though an improvement in the pattern similarity for a grid with a higher density than a 100 × 100 grid (M = 100) is not significant. However, for M > 100, the calculation time is significantly high. Thus, we should employ appropriate values with respect to the number of PCs and grid size to guarantee high pattern similarity (>90%) and low calculation time (<1 min).
To our knowledge, in this work, we presented the first machinelearning approach for calculating electronic DOS patterns (both of value and shape) with a strong accuracy and a fast speed. Moreover, our approach can handle a variety of spectrum image data of materials (Xray photoelectron spectroscopy, Xray diffraction, Raman spectrum, etc.). Toward an era of datadriven material design, the importance of material databases will continue to increase; however, the accumulation of data will be a serious bottleneck. In this regard, the fast generation of material databases will be a key in the future. Application of our PCAbased method into various imagetype data will provide rapid and accurate prediction of various material properties in place of DFT calculations or other experimental measurements. Therefore, it is anticipated that our model will accelerate the construction of largescale material databases as well as the design of materials in various fields such as catalysts and electronic devices^{38,39}.
Methods
Data selection
To represent the DOS pattern for a test alloy system with the learning model developed in this work, the relevant training data are required. The data include alloy compositions, crystal structures, and DOS patterns. In general, the more training data would provide a more accurate representation. However, since the main purpose of this work is to introduce a new scheme for obtaining DOS patterns by a machinelearning method, we used a limited data set. In a binary AB system, we considered five data sets (two pure structures and three alloy ones). For the pure cases, we used the experimental crystal structure. For the alloy systems, the A_{0.25}B_{0.75}, A_{0.5}B_{0.5}, and A_{0.75}B_{0.25} compositions were considered. Here, based on the thermodynamic phase diagram of the alloy system, the crystal structures of the three compositions were determined. If there exists an intermetallic phase at the alloy composition, we preferentially considered an intermetallic crystal structure (e.g., L1_{0} for Pt_{0.5}Ni_{0.5}, and L1_{2} for Pt_{0.25}Ni_{0.75} and Pt_{0.75}Ni_{0.25}). On the other hand, for the cases where the intermetallic phase does not exist, atomically randomly mixed structures (i.e., solidsolution phases) were considered with two crystal structures of pure A and B. Among these two structures, we selected the more stable structure as determined by the DFT calculations. The compositions and atomic structures considered in bulk and slab structures study as training and test data are described in Supplementary Figs S3 and S4, respectively. Then, in slab structures study, the compositions of surface layers are considered as same as A_{x}B_{1−x}. The DOS patterns of the training structures were also obtained from the DFT calculations.
DFT calculation of electronic structures
All electronic structure calculations were performed using the Vienna ab initio simulation package (VASP)^{40,41}. The exchangecorrelation energy was described by the revised PerdewBurkeErnzerhof (RPBE) exchange functional^{42,43}. The electronic wave functions were expanded in the planewave basis set with a kinetic energy cutoff of 520 eV. The effect of the core electrons was modeled by projector augmentedwave (PAW) potentials^{44}. The Brillouin zone was sampled using a MonkhorstPack kpoint mesh, and the kpoint sampling was set to 8 × 8 × 8 for bulk structures and 4 × 4 × 1 for slab structures. The bulk crystal structures were modeled using a 2 × 2 × 2 supercell (e.g., fcc: 32 atoms and bcc: 16 atoms), and the slab crystal structures were simulated periodically with four layer cells. In slab structures calculations, a large vacuum spacing >15 Å was used to prevent interslab interactions, and the top most surface layer and subsurface layer of the computational cell were geometrically relaxed such that the maximum force on each atom was less than 0.05 eV Å^{−1}. Their DOS patterns were obtained after a geometry optimization process. We focused on the local DOS of the d orbitals in metals for simplicity, and every DOS normalized by the number of atoms in a periodic system was described as DOS = f (E − E_{f}), where E − E_{f} is the relative energy shift from the Fermi level (E_{f}). In addition, during the DFT calculations, we turned on the spin polarization effect to consider the magnetic properties of the metals. In representing the DOS patterns via our learning model, we applied our model separately for the up spin and the down spin.
Pattern similarity calculation
The pattern similarity of our learning model was calculated through a comparison with the DFT results, in which the l^{2}norm was used. The pattern similarity σ is defined as follows:
where ρ′ and ρ are the DOS patterns obtained by our learning method and calculated by the DFT method, respectively. When σ is closer to 1, our method becomes more accurate.
Data Availability
The data that support the plots within this paper and other findings of this study are available from the corresponding author on request.
References
Martin, R. M. Electronic Structure: Basic Theory and Practical Methods (Cambridge Univ. Press, 2004).
Nørskov, J. K., Bligaard, T., Rossmeisl, J. & Christensen, C. H. Towards the computational design of solid catalysts. Nat. Chem. 1, 37–46 (2009).
Seo, D., Shin, H., Kang, K., Kim, H. & Han, S. S. Firstprinciples design of hydrogen dissociation catalysts based on isoelectronic metal solid solutions. J. Phys. Chem. Lett. 5, 1819–1824 (2014).
Ma, X., Li, Z., Achenie, L. E. K. & Xin, H. Machinelearningaugmented chemisorption model for CO_{2} electroreduction catalyst screening. J. Phys. Chem. Lett. 6, 3528–3533 (2015).
Ratcliff, L. E. et al. Challenges in large scale quantum mechanical calculations. WIREs Comput Mol Sci 7, 1–24 (2017).
Galli, G. Quantum molecular dynamics simulations. Curr. Opin. Solid State Mater. Sci. 1, 864–874 (1996).
Saad, Y., Chelikowsky, J. R. & Shontz, S. M. Numerical methods for electronic structure calculations of materials. SIAM Rev. 52, 3–54 (2010).
Goedecker, S. Linear scaling electronic structure methods. Rev. Mod. Phys. 71, 1085–1123 (1999).
Ordej, P. OrderN tightbinding methods for electronicstructure and molecular dynamics. Comput. Mater. Sci. 12, 157–191 (1998).
Lecun, Y., Bengio, Y. & Hinton, G. Deep learning. Nature 521, 436–444 (2015).
Jordan, M. I. & Mitchell, T. M. Machine learning: Trends, perspectives, and prospects. Science 349, 255–260 (2015).
Carleo, G. & Troyer, M. Solving the quantum manybody problem with artificial neural networks. Science 606, 602–606 (2017).
Biamonte, J. et al. Quantum machine learning. Nature 549, 195–202 (2017).
Carrasquilla, J. & Melko, R. G. Machine learning phases of matter. Nat. Phys. 13, 431–434 (2017).
Nieuwenburg, E. P. L. V., Liu, Y. & Huber, S. D. Learning phase transitions by confusion. Nat. Phys. 13, 435–440 (2017).
Snyder, J. C., Rupp, M., Hansen, K., Mu, K. & Burke, K. Finding density functionals with machine learning. Phys. Rev. Lett. 108, 253002 (2012).
Ghiringhelli, L. M., Vybiral, J., Levchenko, S. V., Draxl, C. & Scheffler, M. Big data of materials science: Critical role of the descriptor. Phys. Rev. Lett. 114, 105503 (2015).
Brockherde, F. et al. Bypassing the KohnSham equations with machine learning. Nat. Commun. 8, 872 (2017).
Arsenault, L., Lopezbezanilla, A. & Millis, A. J. Machine learning for manybody physics: The case of the Anderson impurity model. Phys. Rev. B 90, 155136 (2014).
Schütt, K. T., Glawe, H., Brockherde, F., Sanna, A. & Gross, E. K. U. How to represent crystal structures for machine learning: Towards fast prediction of electronic properties. Phys. Rev. B 89, 205118 (2014).
Takigawa, I., Shimizu, K., Tsuda, K. & Takakusagi, S. Machinelearning prediction of the dband center for metals and bimetals. RSC Adv. 6, 52587–52595 (2016).
Mueller, T., Kusne, A. G. & Ramprasad, R. Machine learning in materials science: Recent progress and emerging applications. Rev. in Comput. Chem. 29, 186–273 (2016).
Hastie, T., Tibshirani, R. & Friedman, J. The Elements of Statistical Learning 534–552 (Springer, 2009).
Ramani, V. et al. Massively multiplex singlecell HiC. Nat. Meth. 14, 263–266 (2017).
Soven, P. Coherentpotential model of substitutional disordered alloys. Phys. Rev. 156, 809–813 (1967).
Gyorffy, B. L. Coherentpotential approximation for a nonoverlappingmuffintin potential model of random substitutional alloys. Phys. Rev. B 5, 2382–2384 (1972).
Tian, F., Varga, L. K., Chen, N., Delczerg, L. & Vitos, L. Ab initio investigation of highentropy alloys of 3d elements. Phys. Rev. B 87, 075144 (2013).
Tian, F., Varga, L. K., Chen, N., Shen, J. & Vitos, L. Ab initio design of elastically isotropic TiZrNbMoV_{x} highentropy alloys. J. Alloys Compd. 599, 19–25 (2014).
Peil, O. E., Ruban, A. V. & Johansson, B. Selfconsistent supercell approach to alloys with local environment effects. Phys. Rev. B 85, 165140 (2012).
Whitfield, J. D., Love, P. J. & AspureGuzik, A. Computational complexity in electronic structure. Phys. Chem. Chem. Phys. 15, 397–411 (2013).
Cleri, F. & Rosato, V. Tightbinding potentials for transition metals and alloys. Phys. Rev. B 48, 22–33 (1993).
Usman, M., Broderick, C. A., Lindsay, A. & O’Reilly, E. P. Tightbinding analysis of the electronic structure of dilute bismide alloys of GaP and GaAs. Phys. Rev. B 84, 245202 (2011).
Mukherjee, S., MoránLópez, J. L., Kumar, V. & Bennemann, K. H. Electronic theory for surface segregation in Cu_{x}Ni_{1−x} alloy. Phys. Rev. B 25, 730–737 (1982).
Wahiduzzaman, M. et al. DFTB parameters for the periodic table: Part 1, electronic structure. J. Chem. Theory Comput. 9, 4006–4017 (2013).
Hams, A. & Raedt, H. D. Fast algorithm for finding the eigenvalue distribution of very large matrices. Phys. Rev. E 62, 4365–4377 (2000).
Barreteau, C. & Spanjaard, D. Electronic structure and total energy of transition metals from an spd tightbinding method: Application to surfaces and clusters of Rh. Phys. Rev. B 58, 9721–9731 (1998).
Dominggos, P. A few useful things to know about machine learning. Communications of the ACM 55, 78–87 (2012).
Kolb, B., Lentz, L. C. & Kolpak, A. M. Discovering charge density functionals and structureproperty relationships with PROPhet: A general framework for coupling machine learning and first principles methods. Sci. Rep. 7, 1–9 (2017).
Hill, J. et al. Materials science with largescale data and informatics: Unlocking new opportunities. MRS Bulletin 41, 399–409 (2017).
Kresse, G. & Joubert, D. From Ultrasoft pseudopotentials to the projector augmentedwave method. Phys. Rev. B 59, 1758–1775 (1999).
Kresse, G. & Furthmiiller, J. Efficiency of abinitio total energy calculations for metals and semiconductors using a planewave basis set. Comput. Mater. Sci. 6, 15–50 (1996).
Perdew, J. P., Burke, K. & Ernzerhof, M. Generalized gradient approximation made simple. Phys. Rev. Lett. 3, 3865–3868 (1996).
Hammer, B., Hansen, L. B. & Nørskov, J. K. Improved adsorption energetics within densityfunctional theory using revised PerdewBurkeErnzerhof functionals. Phys. Rev. B 59, 7413–7421 (1999).
Blochl, P. E. Projector augmentedwave. Phys. Rev. B 50, 17953–17979 (1994).
Acknowledgements
This work was supported by Creative Materials Discovery Program through the National Research Foundation of Korea (NRF2016M3D1A1021140). We acknowledge the financial supports of the Korea Institute of Science and Technology (Grant No. 2E28000). This work was supported by Samsung Research Funding & Incubation Center of Samsung Electronics under Project Number SRFCMA180103.
Author information
Authors and Affiliations
Contributions
B.C.Y. and S.S.H. conceived and designed the research. D.K. and C.K. provided theoretical support. B.C.Y. performed the research. B.C.Y., D.K., C.K., and S.S.H. analyzed the data. B.C.Y. and S.S.H. wrote the manuscript with feedback from all authors. S.S.H. managed the project.
Corresponding author
Ethics declarations
Competing Interests
The authors declare no competing interests.
Additional information
Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Yeo, B.C., Kim, D., Kim, C. et al. Pattern Learning Electronic Density of States. Sci Rep 9, 5879 (2019). https://doi.org/10.1038/s41598019422779
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598019422779
This article is cited by

Generalizing deep learning electronic structure calculation to the planewave basis
Nature Computational Science (2024)

Density Functional Theory Study of Removal of 4(((4(Phenyldiazenyl) Phenyl) Imino) Methyl) Benzoic Acid Azo Dye by Boron NitrideNanotube, to Investigate a Model for Industrial Wastewater Treatment
Water, Air, & Soil Pollution (2024)

A Comparative Study of Graphynes for Sensing of Gasses (SO2 & Acetylene) Using DFT Calculations and MD Simulations
Indian Journal of Microbiology (2024)

Featureaware unsupervised lesion segmentation for brain tumor images using fast data density functional transform
Scientific Reports (2023)

Density of states prediction for materials discovery via contrastive learning from probabilistic embeddings
Nature Communications (2022)