Unlocking phonon properties of a large and diverse set of cubic crystals by indirect bottom-up machine learning approach

Rodriguez, Alejandro; Lin, Changpeng; Shen, Chen; Yuan, Kunpeng; Al-Fahdi, Mohammed; Zhang, Xiaoliang; Zhang, Hongbin; Hu, Ming

doi:10.1038/s43246-023-00390-3

Download PDF

Article
Open access
Published: 15 August 2023

Unlocking phonon properties of a large and diverse set of cubic crystals by indirect bottom-up machine learning approach

Communications Materials volume 4, Article number: 61 (2023) Cite this article

1561 Accesses
5 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Although first principles based anharmonic lattice dynamics is one of the most common methods to obtain phonon properties, such method is impractical for high-throughput search of target thermal materials. We develop an elemental spatial density neural network force field as a bottom-up approach to accurately predict atomic forces of ~80,000 cubic crystals spanning 63 elements. The primary advantage of our indirect machine learning model is the accessibility of phonon transport physics at the same level as first principles, allowing simultaneous prediction of comprehensive phonon properties from a single model. Training on 3182 first principles data and screening 77,091 unexplored structures, we identify 13,461 dynamically stable cubic structures with ultralow lattice thermal conductivity below 1 Wm⁻¹K⁻¹, among which 36 structures are validated by first principles calculations. We propose mean square displacement and bonding-antibonding as two low-cost descriptors to ease the demand of expensive first principles calculations for fast screening ultralow thermal conductivity. Our model also quantitatively reveals the correlation between off-diagonal coherence and diagonal populations and identifies the distinct crossover from particle-like to wave-like heat conduction. Our algorithm is promising for accelerating discovery of novel phononic crystals for emerging applications, such as thermoelectrics, superconductivity, and topological phonons for quantum information technology.

Million-scale data integrated deep neural network for phonon properties of heuslers spanning the periodic table

Article Open access 06 February 2023

Scale-invariant machine-learning model accelerates the discovery of quaternary chalcogenides with ultralow lattice thermal conductivity

Article Open access 24 March 2022

Computation and data driven discovery of topological phononic materials

Article Open access 22 February 2021

Introduction

The dynamics of phonons, the quanta of lattice vibrations, play a critical role in various technologies ranging from heat dissipation in modern semiconductors¹ to thermal barrier coatings in turbine blades². In general, applications involving heat transfer require either extreme phonon impedance or conduction, which is often the performance-limiting property. Another application example of phonons is quantum communication, which can be realized by microwave-frequency phonons such as acoustic resonators³ and is of significant interest for the generation of remote entanglement and the secure transmission of information. Over the recent years, material scientists have sought new materials with excellent phonon properties through a combination of laboratory synthesis and computational prediction using density functional theory (DFT). Although the former requires decades of trial-and-error with the intuition of experienced chemists⁴, the latter has progressed the discovery of new materials and the understanding of microscopic phonon transport due to the availability of high-performance computers. Indeed, tremendous amount of previous studies in the thermal science have been dedicated to bulk, interfacial, low-dimensional, and layered materials^5,6,7,8,9,10. Generally speaking, many investigations on thermal transport in materials have uncovered the structure-thermal property relationship, including those from bond strength, structure, and chemistry^{11,12,13,14,15,16,17}. Additionally, unique phonon interactions may arise from certain crystals promoting anharmonicity and lowering lattice thermal conductivity (κ_L), including rattler atoms^17,18,19, ferroelectric instability²⁰, and electron-phonon interactions^21,22. Notably, some studies also advance the theoretical description of phonon transport, e.g., addition of fourth order anharmonicity²³ and temperature-dependent effective potential for temperature effects on the interatomic force constants (IFCs)^12,24,25.

Despite the robust nature of DFT in predicting thermal transport properties, the explicit treatment of electronic degrees of freedom entails significant computational costs when faced with potentially thousands of candidates to screen target phonon properties. By majority, previous studies are limited to one or few materials that may acquire results in a matter of a few days, depending on their hardware resources. However, with the advent of materials genome such as Materials Project⁴ and Open Quantum Materials Database (OQMD)²⁶, the DFT evaluation of phase stability and properties of interest for thousands to millions of previously unexplored materials puts several years of delay on the synthesis of novel materials. This is especially magnified for phonon properties. For instance, the κ_L, one of the most important phonon properties, is computationally demanding by DFT, due to the required calculations of large amount of supercells with different atomic displacements which is then processed to give IFCs for the Boltzmann transport equation (BTE) simulation²⁷.

In response, data-driven techniques such as machine learning (ML) have surfaced in materials science to address the demanding costs of DFT, effectively trading some accuracy for significant speed-up. The basic assumption with ML-based models for predicting DFT-level properties is the introduction of a finite cutoff, whereby atomic interactions beyond such cutoff are neglected. This allows for linear scaling of the computational cost against the number of atoms as opposed to the cubic scaling with DFT. With ML, prediction of target properties requires a physically informative set of inputs as the descriptors. For a sufficiently accurate model, descriptors should satisfy several requirements, including (a) distinguishable representation for each system, (b) descriptive of the similarities/discrepancies between systems, (c) completeness to sufficiently differentiate systems, and (d) simplicity of the descriptor to ease calculation time²⁸. Given these requirements, several methods have been proposed over the past few years to confront computationally costly phonon properties. For example, progresses have been made in the ML prediction of κ_L recently, either directly or indirectly. The direct prediction refers to a ML model or several models in sequence with κ_L as the final output. The majority of studies over the years fall under this category. Several ML models have been trained on 110 half-Heusler compounds by compiling elemental, compound, and compound-elemental descriptors, obtaining high validation accuracy and revealing the bond distance as the most important descriptor for κ_L²⁹. Diamond-like materials were explored for ultra-high κ_L using a small training datasets through transfer learning of the three-phonon scattering channel volume, or P₃ for short³⁰. On the other hand, the indirect prediction approach first predicts lower-level properties eventually leading to κ_L, including the atomic forces and the IFCs which are required for κ_L. To date, little research has been seen in this subarea. Notably, providing lower-level physics introduces several advantages over the direct method. Firstly, with the atomic forces and/or IFCs, one can compute the full phonon properties, e.g., phonon dispersions, temperature-dependent κ_L, and scattering rates. This allows for in-depth study of the phonon properties at the high-throughput level without needing to rely strictly on pure DFT. The temperature dependent κ_L is especially desirable for high-temperature applications whereas most studies involving ML focus only on room temperature. Secondly, because κ_L is not directly computed, variables involved in the BTE calculation may be modulated, such as the inclusion of higher order anharmonicity beyond 3rd order and off-diagonal contribution. Thirdly, atomic forces and IFCs are much more abundant than single κ_L values associated with each structure, allowing for improved training and potential transferability of information among diverse structures.

Here, we demonstrate a bottom-up approach by application of the Elemental Spatial Density Neural Network Force Field (Elemental-SDNNFF) for high-quality phonon property prediction of large materials databases. Previously we have applied the model to a smaller set of 11,866 structures with half, quaternary, and full Heusler structures spanning 55 elements³¹. In this work, we expand the model to a more complex set of 77,091 cubic structures containing 16 structural prototypes and 63 chemical species. This is made possible by our model providing sufficient flexibility to distinguish many unique atomic environments for the high-throughput calculation of full phonon properties. Because the forces are provided, thermodynamic stability of these structures can be determined via their predicted phonon dispersions which is not possible in direct methods. In this work, the model is initially trained on a small subset of 3107 structures and is iteratively improved on a larger dataset of 77,091 structures with active learning. Data augmentation is incorporated whereby equivalent atomic environments are rotated to provide a ~3× boost to the total atomic forces for training. Then, the final model is deployed to predict the complete phonon properties of the remaining stable structures, with speed of three orders of magnitude faster than full DFT calculations for systems containing greater than 10² atoms as seen by Supplementary Fig. S1. Then, we focus on the high-quality phonon property prediction addressing several major challenges for high-throughput material property prediction. Specifically, by predicting the atomic forces, access to a full description of harmonic properties, such as phonon dispersions, specific heat, and the third order scattering channel volume, and anharmonic properties, such as the lattice thermal conductivity, are evaluated simultaneously with a single neural network model whilst providing deep physical insight and agreement to DFT. In addition, previous models for Heusler structures were trained efficiently by nature of providing only three structural prototypes to the model. Here, although the structure diversity is significantly more complex, the model maintains prediction of meaningful atomic forces and corresponding phonon properties. Overall, our model is capable of growing and adapting to new datasets for exploration of previously unseen materials.

Results

Training dataset from active learning rounds

To develop an ML model for evaluating atomic forces, it is crucial to prepare a sufficiently large and diverse dataset of various atomic environments. In theory, there are limitless combinations of atomic environments considering structural symmetries, chemistries, and displacements that one may encounter during the evaluation of atomic forces. This is especially true when involving random or stochastic displacement methods for generating IFCs such as compressive sensing lattice dynamics (CSLD)³². As ML methods specifically supervised learning are interpolative by nature, selection of training or reference data for evaluation of seemingly infinite possible atomic environments requires a human-free or self-informative approach for efficient model training from costly DFT calculations. The crystalline structures analyzed here are borrowed from the OQMD database^26,33 and are categorized into 16 prototypes spanning four cubic space groups (Table 1). Some structures are split from the pool for initial training and evaluated with DFT for their atomic forces. From DFT, the IFCs and subsequent phonon properties are also gathered for comparison. Accordingly, the rest of the structures without DFT are marked as “unexplored” left for the model evaluation stage.

Table 1 Distribution of all cubic structures used for training and screening.

Full size table

Figure 1 shows the overall workflow for generating the dataset in Table 1 and training the Elemental-SDNNFF model for phonon property prediction. First, structures from materials databases are gathered and filtered for low formation energy and energy above hull to increase the probability of thermodynamic stability. Then, supercells are perturbed by a small atomic displacement and the dataset is split into training and active learning structures. Thereafter, the training set is evaluated by DFT and corresponding forces are trained into a set or committee of models. Here, we manipulate the poor extrapolation capability of neural networks by evaluating untrained structures and comparing the predicted forces. Unseen structures with high force variance in the committee indicate poor representability of the local atomic environments in the supercell and are proposed for the next round of training. These structures are passed by DFT and are retrained into the model, forming a closed loop. After several rounds, the models are deployed for force evaluation and phonon calculations of large materials databases. For more details about structure generation and the active learning procedure, refer to the “Methods” section.

**Fig. 1: Workflow for training Elemental-SDNNFF models.**

Prediction of phonon properties

To benchmark the performance of the trained model for phonon properties prediction, we first examine the atomic force accuracy on a small subset of 400 untrained structures. We also compare the performance to CHGNet, which was recently proposed as a universal potential energy surface model³⁴. As obtained in Supplementary Fig. S3, we found a force root mean square error (RMSE) of 29.3 and 121 meV/Å for the Elemental-SDNNFF and CHGNet models, respectively, showcasing the competitive performance of our model and its consistency remaining close to the training RMSE of 18.6 meV/Å. Thereafter, the errors relative to DFT for the phonon dispersion and corresponding κ_L at 300 K are shown in Fig. 2. In Fig. 2a, the RMSE of the frequency is divided by the frequency range of the corresponding dispersions to normalize and merge the data to a single histogram and is shown as a percentage. The average error is 1.88% which is excellent as seen by the insets of sample dispersions in Fig. 2a. In Fig. 2b, the log value of the DFT and predicted κ_L yields an R² of 0.89 with a mean average error (MAE) of 0.254 log(W m⁻¹ K⁻¹), meaning that the predicted κ_L is on average within 1.795 times the DFT value and is shown by the structures within the dashed lines representing two times the perfect agreement. The prediction capability is competitive with the 0.12 MAE and 0.87 R² error presented by the random forest model trained on ~10³ materials³⁵. Additionally, 103 untrained structures with κ_L from DFT are evaluated by our model for validation and are compared in the inset of Fig. 2a. Out of these structures, 67 were predicted owning <1 W m⁻¹ K⁻¹ and 36 remain within the same range from DFT values. Notably, at the lower end of the κ_L range, the model tends to underpredict the κ_L with greater intensity approaching the ultralow range of predicted 0.1 W m⁻¹ K⁻¹. This is due to the highly sensitive nature of the phonon transport toward the extrema of the κ_L, specifically from the quality of atomic forces in displaced supercells and eventually the 3rd order IFCs³⁶. Nonetheless, the materials with prediction under 0.1 W m⁻¹ K⁻¹ are likely to remain within 1 W m⁻¹ K⁻¹ range and our model is effective at filtering candidates with ultralow κ_L.

**Fig. 2: Summary of phonon dispersion and lattice thermal conductivity predictions.**

The advantage of our bottom-up ML approach for phonons manifests from the plethora of information from standard phonon calculation packages when provided a set of predicted atomic forces. Indeed, by default phonon frequencies and scattering matrix elements required for iterative BTE are computed in advance. From the phonon frequencies, information like the speed of sound, constant volume heat capacity, and the three-phonon scattering phase space may be readily computed. Here, we compare these properties from our neural network model with those from DFT calculations to further understand more about the underlying contributions to the predicted κ_L. Indeed, the constant volume heat capacity (c_v) is directly involved in computing κ_L along with the phonon group velocities and scattering rates. Additionally, the speed of sound (V_s) is a partial representative of the group velocities for long-wavelength acoustic modes in crystals²⁰. The three-phonon scattering phase space (P₃), is a quantitative measurement of the number of three-phonon scattering channels. Unlike c_v and V_s, higher P₃ is indicative of larger scattering rates and thus lower κ_L³⁰. However, akin to c_v and V_s, P₃ requires only the second order IFCs and therefore is computationally inexpensive as a result after the model is trained. Finally, the mean square displacement (MSD) of vibrating atoms, usually observed in finite temperature molecular dynamics, may also be computed from the phonon frequencies and eigenmodes⁹. To describe structures with a single value, only the maximum MSD among all atoms in the primitive cell is assigned.

In Fig. 3, the comparison of c_v, V_s, P₃, and maximum MSD between the neural network model and DFT is shown with the corresponding R² and MAE values. Exceptional agreement is found for c_v, followed by V_s, P₃, and log(max MSD). With the small error in phonon frequencies, the mode-weighted global property c_v requires the mode-dependent phonon frequency as a direct input and is summed up over a dense sampling of Brillouin zone, and thus expectedly owns the best agreement with DFT³⁷. The maximum MSD is also constructed similarly and thus owns high accuracy with DFT. Although V_s is also directly computed from dispersions, the gradient of phonon frequency with respect to wave vector is required and is consequentially more sensitive to the predicted atomic forces than c_v. Finally, P₃ also uses the phonon frequencies directly but involves a counting of three phonon collisions by energy and momentum conservation. In other words, the error propagated from atomic forces into the phonon frequencies is compounded resulting in the largest error out of the other three harmonic phonon properties. Interestingly, while Fig. 3a–c for the most part experience an even spread of error, the scatter plot for the maximum MSD in Fig. 3d is shown with increasing disagreement at higher maximum MSD. This is because higher maximum MSD, corresponding to softer phonon modes, usually indicates lower κ_L, and the second order IFCs are more sensitive to the atomic displacements and corresponding forces. In such a case, we anticipate that increasing the atomic displacements can better capture the anharmonicity and hence the potential energy surfaces.

**Fig. 3: Prediction of harmonic properties compared with density functional theory.**

Quantification of ultralow κ _L with predicted properties

Out of the 77,091 cubic materials set aside for evaluation, 27,059 structures are predicted by our trained Elemental-SDNNFF model to have no imaginary phonon frequencies, and thus being potentially thermodynamically stable. These structures are then evaluated for predicting full phonon properties including c_v, V_s, P₃, maximum MSD, and κ_L. In the previous section, we show how the trained model represents DFT-level IFCs while maintaining speeds on the order of ~10³. This was made possible due to the initial training set and subsequent active learning iterations generating millions of data points to best fine-tune the model for handling many structures and chemistries. Despite replacing DFT with a machine learned model, the process of computing anharmonic IFCs and subsequently iteratively solving BTE is still time consuming for several tens of thousands of hypothetical structures. Therefore, given the large data of phonon properties evaluated by our model, quantification of trends for κ_L with structural and harmonic properties is desired to search materials with known thermodynamic stability. Previous studies suggest that κ_L is strongly correlated with several physical parameters, including volume of the unit cell V_cell³⁵, specific heat capacity c_v³⁸, sound velocity V_s¹⁵, three-phonon scattering phase space volume P₃³⁰, and thermal MSD³⁹. Henceforth, we have experimented with linear combinations of c_v, V_s, P₃, max MSD, and V_cell to correlate with κ_L. We found the max MSD by itself has the best performance as a descriptor for κ_L of crystals. The reason is most likely due to the major contribution of harmonic phonons in thermal MSD for κ_L when compared to those listed above. Additionally, MSD may be computed as a function of temperature and is more useful to observe temperature-dependent trends. A generally inverse-linear relationship is observed between the log κ_L and the log(max MSD) (Fig. 4). Note, log(max MSD) is normalized here in ${\mathbb{R}}{\mathbb{\in }}[{0,1}]$ based on values found from the DFT set for ease of comparison. Figure 4a provides evidence of linearity through comparison of κ_L and the max MSD. The fitted red line shows a decreasing trend of κ_L with increasing max MSD. Structures with extremely high maximum MSD are indicative of rattling atoms, in which strong phonon-phonon scattering and ultralow κ_L is prevalent³⁹. Given the agreement of the max MSD between DFT and predictions in Fig. 3d, Fig. 4b demonstrates the prediction of both κ_L and maximum MSD for the 25,901 unexplored structures out of the stable 27,059 pool, since some structures failed in BTE calculations and κ_L was not plotted. Again, the trend remains inversely proportional to the descriptor. We do notice that the newly fitted blue line shows a steeper slope in comparison to the previous red line by DFT (also shown in the same plot for comparison), and the difference between the two lines deviates with increasing maximum MSD. The most probable cause is the underprediction of κ_L at the lower extreme (Fig. 2b) and increased error in the MSD at the higher extreme (Fig. 3d). Although the ultralow κ_L may be underestimated on the log scale, these predictions remain highly beneficial for quickly marking structures with potential as thermal insulators. To quickly filter ultralow κ_L materials, a maximum MSD is set such that the value of the fitted line is 1.795 W m⁻¹ K⁻¹. This is chosen deliberately knowing predicted values of κ_L are within 1.795 times the DFT value which aids the later filtration for structures less than 1 W m⁻¹ K⁻¹. As such, the maximum MSD filter is set to 0.076 Å² or 0.464 on the normalized log(max MSD) plot. Specifically, we found 9306 total structures with normalized log(max MSD) higher than 0.464. Out of these structures, the κ_L of 8873 (95.4%) structures are less than 1 W m⁻¹ K⁻¹. For normalized log(max MSD) less than 0.464, out of 16,596 structures, the κ_L of 4590 are less than 1 W m⁻¹ K⁻¹. This means, the success rate for filtering structures is 66% (8873 out of 13,461 structures) for those with κ_L less than 1 W m⁻¹ K⁻¹. Thus, the maximum MSD is a reliable descriptor for indicating highly unique structures with out-of-trend values of atomic displacement and corresponding ultralow κ_L. Such a critical value of maximum MSD 0.076 Å² may be used in future works to identify potential candidates for thermal insulators in cubic structures.

**Fig. 4: Results of the thermal mean squared displacements against the lattice thermal conductivity.**

Rattling effect has been proved to induce large MSD in many systems. To generalize structures with high probability for rattler atoms, we plot the average of the MSD for each element across the 25,901 pool in Supplementary Fig. S4. Hydrogen and all alkali metal elements, including Li, Na, K, Rb, and Cs, have the highest average MSD among all 63 elements covered here. Some alkaline earth metal elements including Mg, Ca, Sr, and Ba have medium MSD. Interestingly, halogen elements including Cl, Br, and I also possess high MSD. N stands out among the nonmetallic elements with the next largest being C, P, and Si. Tl also stands out as a semimetallic element next to Pb and Sn with significantly lower MSD. Metallic elements including Hg, Cd, Ag, and Au are the highest in their category although their MSD is much lower than their alkali metal counterparts.

To visualize the spread of predicted κ_L in the dataset, Supplementary Fig. S5 displays the t-distributed stochastic neighbor embedding (t-SNE) using Elemental-SDNNFF structure input vectors. For simplification, only one point per structure is implemented corresponding to a single Elemental-SDNNFF vector centered at the unit cell rather than on a per-atom basis. In Supplementary Fig. S5a, the distribution of space group indicates an overlap of structures with space group number 216 and 225 whereas a majority of structures with space group number 221 and 227 form visible clusters. This is sensible given space group 216 is different from space group 225 by just a vacant lattice site (ABC vs. ABC₂) or a different element on the same lattice site (ABCD vs. ABC₂). Supplementary Fig. S5b focuses on the predicted κ_L where observable regions of thermally insulating materials (blue) are highly contrasted from thermally conductive materials (red). Mainly, the upper left of the figure contains a mixture of space group 216 and 225 structures with ultralow κ_L with some additional blue regions along the bottom and top outer edge mostly corresponding to 221 and 225. This is further manifested in Fig. 3c where regions are instead highlighted by the predicted normalized log(max MSD). To highlight the relationship between κ_L and max MSD, we subtract the value from unity to match the properties based on color. As seen by the comparison between Supplementary Fig. S5b, c, both figures form identical structure mappings of the predicted κ_P and normalized log(max MSD) values, supporting their strong correlation. Additionally, the congregation of certain out-of-trend structures with extremely high or low κ_L indicates a correlation between Elemental-SDNNFF input vector and phonon conductors and insulators, suggesting strong structural-property relationship with phonon transport. Overall, the t-SNE plots encompass the wide range of unique structures and physics manifested when applying machine learned atomic force fields such as the Elemental-SDNNFF.

Insight from bonding and anti-bonding analysis

In the previous section, we related thermal displacements to the κ_L but do not discuss the effect of chemical bonding. Here, we further analyze our predicted structures with Crystal Orbital Hamilton Population (COHP)⁴⁰ to quantify the contributions to the bonding and antibonding states. To assign single values of bonding and antibonding to each structure, we perform integration over COHP curves for each atomic pair as evaluated by LOBSTER code and take the average⁴¹. Figure 5 displays the resulting bonding and antibonding with highlighted log values of predicted κ_L to observe trends. Notably, at low bonding values (e.g., <200) and high antibonding (e.g., >1), only κ_L < 3 W m⁻¹ K⁻¹ exists. This region contains low interatomic bonding strength and high phonon anharmonicity, resulting in ultralow κ_L. Our observed trend of high antibonding indicates strong phonon anharmonicity and is consistent with recent studies by full DFT calculations on other systems^16,42,43,44. On the other hand, the high bonding (>200) region seems to contain all ranges of κ_L. This is understandable from the physics point of view, whereby the κ_L is governed by two major mechanisms of interatomic bonding strength and phonon anharmonicity and thus the bonding/antibonding contributions might be competing. It is also worth pointing out that, the application of COHP is a low-cost indicator of κ_L requiring only the DFT calculations on primitive cells after structure optimization, which is very promising for the filtration of structures with anomalously low κ_L.

**Fig. 5: Bonding-antibonding map as related to lattice thermal conductivity.**

Off-diagonal thermal transport analysis

Recently two different mechanisms for phonon transport in solids have been discussed^45,46. In crystalline materials, heat carriers propagate and scatter in a particle-like behavior as described by Peierls-Boltzmann transport picture for phonon wave-packet dynamics. Such populations have a well-defined energy (frequency) and therefore can be interpreted as particle-like excitations with a well-defined wave vector (q) and mode index (s), and corresponding lattice thermal conductivity is denoted as κ_P. In contrast, in glass materials, heat carriers behave wave-like, hopping via a Zener-like tunneling between quasi-degenerate vibrational eigenstates, as described by the Allen-Feldman theory. Such coherences do not have an absolute energy and cannot be related to a single eigenstate. Rather, they describe oscillations between pairs of eigenstates and correspond to an evolution which does not preserve the nature of the single-particle excitation. Very recently, the importance of such coherences’ contribution to overall lattice thermal conductivity describing the wave-like interband tunneling of phonons, dubbed as two-channel thermal transport or off-diagonal contribution of heat-flux operator, has been theoretically formulated and experimentally validated in ultralow κ_P materials^47,48,49. With the phonon property of large-scale crystals being predicted herein, it is intuitive to explore the two-channel thermal transport behavior of these materials.

The contributions of off-diagonal components (${\kappa }_{c}^{\alpha \beta }$) to the total thermal conductivity is obtained by^48,50:

$${\kappa }_{c}^{\alpha \beta } = \frac{{{{\hbar }}}^{2}}{{k}_{B}{T}^{2}{N}\Omega }\mathop{\sum }\limits_{q}\mathop{\sum }\limits_{j,{j}^{{\prime} }}^{j\ne {j}^{{\prime} }}\frac{{\omega }_{j}\left({{{{{\boldsymbol{q}}}}}}\right)+{\omega }_{{j}^{{\prime} }}\left({{{{{\boldsymbol{q}}}}}}\right)}{2}{V}_{j,{j}^{{\prime} }}^{\alpha }\left({{{{{\boldsymbol{q}}}}}}\right){V}_{j,{j}^{{\prime} }}^{\beta }\left({{{{{\boldsymbol{q}}}}}}\right) \\ \quad \times \frac{{\omega }_{j}\left({{{{{\boldsymbol{q}}}}}}\right){n}_{j}\left({{{{{\boldsymbol{q}}}}}}\right)\left({n}_{j}\left({{{{{\boldsymbol{q}}}}}}\right)+1\right)+{\omega }_{{j}^{{\prime} }}\left({{{{{\boldsymbol{q}}}}}}\right){n}_{{j}^{{\prime} }}\left({{{{{\boldsymbol{q}}}}}}\right)\left({n}_{{j}^{{\prime} }}\left({{{{{\boldsymbol{q}}}}}}\right)+1\right)}{4{\left({\omega }_{j}\left({{{{{\boldsymbol{q}}}}}}\right)-{\omega }_{{j}^{{\prime} }}\left({{{{{\boldsymbol{q}}}}}}\right)\right)}^{2}+{\left({\Gamma }_{j}\left({{{{{\boldsymbol{q}}}}}}\right)+{\Gamma }_{j{\prime} }\left({{{{{\boldsymbol{q}}}}}}\right)\right)}^{2}}\times \left({\Gamma }_{j}\left({{{{{\boldsymbol{q}}}}}}\right)+{\Gamma }_{j{\prime} }\left({{{{{\boldsymbol{q}}}}}}\right)\right)$$

(1)

where ħ, k_B, T, N, and Ω are the reduced Planck constant, Boltzmann constant, absolute temperature, the number of q-points sampled in reciprocal space, and volume of the unit cell, respectively. ω_j (q), Γ_j (q), and n_j (q) are the frequency, linewidth, and the equilibrium Bose-Einstein distribution function with wave vector q and branch j, respectively. ${V}_{j,{j}^{{\prime} }}^{\alpha }\left({{{{{\boldsymbol{q}}}}}}\right)$ is the off-diagonal elements (j ≠ j′) of velocity matrix with direction α, and can be calculated by:

$${V}_{j,j{\prime} }^{\alpha }({{{{{\boldsymbol{q}}}}}})=\frac{1}{{\omega }_{j}({{{{{\boldsymbol{q}}}}}})+{\omega }_{j{\prime} }({{{{{\boldsymbol{q}}}}}})}\langle {e}_{j}({{{{{\boldsymbol{q}}}}}})|{\nabla }_{q}^{\alpha }{{{{{\boldsymbol{D}}}}}}({{{{{\boldsymbol{q}}}}}})|{e}_{j{\prime} }({{{{{\boldsymbol{q}}}}}})\rangle$$

(2)

where e_j (q) and D are the eigenvector and dynamical matrix, respectively. In our work, we modified the original ShengBTE code⁵¹ to output the off-diagonal elements of velocity matrix and then calculate the off-diagonal thermal conductivity based on Eq. (1).

For calculations of lattice thermal conductivity contribution by coherence phonons (κ_c), 1000 structures from each pool of materials with low (<1 W m⁻¹ K⁻¹), medium (1–10 W m⁻¹ K⁻¹), and high (>10 W m⁻¹ K⁻¹) κ_P by traditional BTE solution are randomly chosen with the same parameters and force constants from the model prediction step. From these structures, 869 low, 995 medium, and 999 high κ_P BTE calculations successfully converged with average off-diagonal ratios κ_c/κ_P of 2.59, 5.02 × 10⁻², and 4.35 × 10⁻³, respectively. The κ_c/κ_P show good agreement between DFT results and prediction by our Elemental-SDNNFF model (Supplementary Fig. S6), which again displays the accuracy of our model. In Fig. 6, we compute the percentage of off-diagonal contribution which is defined as κ_c/κ_total where κ_total = κ_c + κ_P. Good agreement between our Elemental-SDNNFF predictions on unseen data and DFT results from training data is observed. Strong linear-like correlation trend is found between percentage of κ_c from κ_total. Generally speaking, the lower κ_P is, the higher percentage of off-diagonal contribution by κ_c could have. For extremely low κ_P materials, e.g., κ_P ~0.1 W m⁻¹ K⁻¹, the κ_c could contribute as high as 50% or even 70% of κ_total, showing dominant role of contributions of the coherences even at room temperature. Similar phenomenon is also found in perovskite CsPbBr₃ at room temperature and La₂Zr₂O₇ at high temperature^48,49. We also noticed that, for the same κ_total, the κ_c contribution can differ quite largely among different structures, leading to a very wide spread of the log-scale percentage contribution in Fig. 6. This implies that, the precise off-diagonal contribution percentage depends on detailed phonon band structures and mode-level phonon anharmonicity of different materials, rather than the single absolute value of κ_P. Figure 6 also shows that there are considerable amount of structures with dominant wave-like heat conduction, instead of particle-like conduction as predicted by traditional Peierls BTE. Our model clearly determines the crossover from particle-like to wave-like heat conduction (dashed line in Fig. 6).

**Fig. 6: The off-diagonal contribution against the total lattice thermal conductivity at room temperature from two sources of data.**

To observe mode-level contributions of κ_P and κ_c, three low κ_P materials (NaKAs KIrCs₂Cl₆, and CuPdSr₂ with space group no. 216, 225, and 216, respectively) from the untrained pool are chosen with varying coherence contribution for comparison. The frequency-dependent relaxation time is plotted in Fig. 7. The materials are provided in order of increasing κ_P with (prediction value, DFT validation value) of (0.037, 0.075), (0.101, 0.127), and (0.68, 0.935) W m⁻¹ K⁻¹, respectively. The κ_C values for (predicted, DFT) results are (0.222, 0.205), (0.252, 0.226), and (0.07289, 0.08282) W m⁻¹ K⁻¹, respectively. From comparison between the predicted (left column in Fig. 7) and DFT (right column in Fig. 7) results, the general trends of relaxation time against frequency are captured, confirming again the accuracy of our ML model. Despite this, κ_P contributions from the low frequency range are more pronounced in size and density by eye in the DFT results (Fig. 7b, d, f) than in the prediction results (Fig. 7a, c, f). This agrees with the previously observed underprediction of κ_P for structures with κ_P < 1 W m⁻¹ K⁻¹. In addition, the κ_C of NaKAs and KIrCs₂Cl₆ share similar values and is pronounced by the presence of high mode-level κ_C contribution in the entire frequency range. For CuPdSr₂, a vast majority of points own low contribution by coherence phonons thus reducing the κ_c. From observation, κ_C contributions are overshadowed by dominating mode-level κ_P throughout the entire frequency range, leading to lower κ_c/κ_total ratios as previously seen in Fig. 6.

**Fig. 7: The frequency-dependent relaxation time for NaKAs, KIrCs₂Cl₆, and CuPdSr₂.**

Conclusion

We demonstrate the development of a bottom-up machine learning approach through accurately and efficiently predicting comprehensive phonon properties of ~80,000 crystalline materials, which is realized by evaluating the atomic forces with sufficient DFT representation. The query-by-committee active learning scheme allows iterative improvement of the models by simultaneous prediction of atomic forces in the unseen pool of displaced structures. The final model is deployed for constructing IFCs for both observed and unseen structures. Given the good accuracy of phonon properties such as vibrational frequencies and κ_L, we exploit the abundant 25,901 pool of structures that are predicted to be thermodynamically stable by our model and quantify thermal insulators with simple descriptors. Both MSD and bonding/antibonding states are two computationally efficient approaches for screening ultralow κ_L. We should mention that, given the high accuracy of harmonic dominant properties such as maximum MSD, we propose our model as a method to generate high-quality data for direct prediction of phonon properties through other machine learning models, such as prediction of extremely high or low lattice thermal conductivity materials. Physical insight into off-diagonal contribution to overall phonon transport is also analyzed with our model, demonstrating the general trend of high coherence contributions to the total thermal conductivity for low κ_L structures and the crossover from particle-like to wave-like heat conduction in diverse structures. The precise coherence contribution percentage depends on detailed phonon band structures and mode-level phonon anharmonicity of different materials, rather than the single absolute value of κ_L. Our algorithm is capable of growing and being adapted to even larger unseen materials and is promising for accelerating discovery of crystals for emerging phonon mediated applications.

Methods

Training dataset generation

We first perform structure filtration by elements, formation energy, and energy above hull, and then perform structure optimization (blue path in Fig. 1). The purpose of filtration is to reduce the structure count and increase the likelihood of stability in subsequent DFT calculations of phonon dispersions. After structure optimization, we replicate primitive cells into supercells and displace all atoms by fixed 0.03 Å in random directions to create diversity in the atomic environments and facilitate training, which is also standard for IFC calculations. Additionally, for IFC fitting with CSLD, such introduction of random displacements helps mitigate the innate poor energy conservation of direct force field models, as demonstrated in previous works^36,52. Specifically, due to the nature of direct force prediction, IFC fitting with methods such as finite difference method (FDM) does not guarantee zero or near-zero forces for atoms in equilibrium. Such atoms are abundant in FDM whereby only one or two atoms are displaced for IFCs up to the third order, causing significant disagreement in the force-sensitive lattice thermal conductivity. Thus, stochastic methods such as CSLD mitigate the energy conservation issue by displacing all atoms, generating a noise-canceling effect for subsequent IFCs fitted by predicted forces. Afterwards, we randomly select a small fraction of displaced supercell structures to serve as the initial training data for the models. This is passed to DFT calculations, and the resulting ground truth atomic forces and local atomic environment are passed to five initialized models with similar architecture but different weights. Here, we used a small subset of existing DFT data previously calculated for phonon properties. Although we do not introduce techniques to choose the initial structures for training, we recommend those such as the principal component analysis (PCA) shown to improve model representation especially for out-of-trend structures⁵³. After the DFT step, we perform data augmentation in which atomic environments are rotated according to nearest neighbor rules, generating ~2–3× increase in the existing dataset (see schematic in Supplementary Fig. S2). This enhances the dataset diversity for model training in addition to the already abundant N × D dataset, where N is the number of atoms per simulation (or per supercell) and D is the number of DFT simulations. As an aside, future work is planned to introduce rotational equivariance to the existing Elemental-SDNNFF model for automatic consideration of rotated atomic environments without the need for said nearest neighbor rules and rotation matrices⁵⁴. This should drastically improve the training efficiency by reducing the training size while maintaining the current rotational covariance of the force field.

Active learning details

Taking advantage of the interpolative nature of neural networks, these models serve as a committee that will judge the remaining untrained or active learning structures for atomic environments. Structures yielding high uncertainty in the forces indicates poor representation of the corresponding atomic environments and DFT forces. The uncertainty is evaluated by⁵⁵:

$${\varepsilon }_{i}=\sqrt{\left\langle {{||}{f}_{i}^{m}-\bar{{f}_{i}}{||}}^{2}\right\rangle },\bar{\,{f}_{i}}=\left\langle {f}_{i}^{m}\right\rangle$$

(3)

where ε_i is the indicator for atom i, ${f}_{i}^{m}$ is the predicted force by model m, and $\bar{\,{f}_{i}}$ is the average force across all models in the committee. We take the max(ε_i) for each displaced supercell and choose one supercell with the highest max(ε_i) out of all supercells associated with a unique structure to promote diversity in atomic environments for subsequent DFT evaluation. We set the uncertainty threshold to ε_i > 50 meV/Å well above the force error of the model to guarantee poorly represented structures in the committee. Those structures with uncertainty above the threshold are then passed to further DFT calculations and retrained into the model, forming a closed loop with iterative self-improvement (red circled arrows in Fig. 1). Once the number of recommended structures converges to near-constant value, we then publish the model for evaluation of atomic forces and subsequent phonon properties for all unseen structures (green path in Fig. 1). For more details about the model details and active learning procedure, we refer the reader to our previous work³¹. After seven rounds of active learning with combined data augmentation, 29.4 million atomic environments are successfully trained into the network (Supplementary Fig. S2).

Data availability

The main data supporting the findings of this study, including those generated by full DFT calculations, are available from the corresponding author upon reasonable request.

Code availability

The trained neural network model and source codes used to evaluate atomic forces in displaced supercell structures are available from the corresponding author upon reasonable request.

References

Qin, G., Qin, Z., Wang, H. & Hu, M. Anomalously temperature-dependent thermal conductivity of monolayer GaN with large deviations from the traditional 1/T law. Phys. Rev. B 95, 1–10 (2017).
Article Google Scholar
Padture, N. P., Gell, M. & Jordan, E. H. Thermal barrier coatings for gas-turbine engine applications. Science 296, 280–284 (2002).
Article CAS Google Scholar
Dumur et al. Quantum communication with itinerant surface acoustic wave phonons. npj Quantum Inf. 7, 1–5 (2021).
Article Google Scholar
Jain, A. et al. Commentary: The materials project: a materials genome approach to accelerating materials innovation. APL Mater. 1, 011002 (2013).
Tang, D. S., Qin, G. Z., Hu, M. & Cao, B. Y. Thermal transport properties of GaN with biaxial strain and electron-phonon coupling. J. Appl. Phys. 127, 035102 (2020).
Qin, G. & Hu, M. Accelerating evaluation of converged lattice thermal conductivity. npj Comput. Mater. 4, 3 (2018).
Zhou, Y., Xiong, S., Zhang, X., Volz, S. & Hu, M. Thermal transport crossover from crystalline to partial-crystalline partial-liquid state. Nat. Commun. 9, 1–8 (2018).
Google Scholar
Qin, G., Qin, Z., Wang, H. & Hu, M. Lone-pair electrons induced anomalous enhancement of thermal transport in strained planar two-dimensional materials. Nano Energy 50, 425–430 (2018).
Article CAS Google Scholar
Togo, A. & Tanaka, I. First principles phonon calculations in materials science. Scr. Mater. 108, 1–5 (2015).
Article CAS Google Scholar
Lindsay, L., Katre, A., Cepellotti, A. & Mingo, N. Perspective on ab initio phonon thermal transport. J. Appl. Phys 126, 1–20 (2019).
Article Google Scholar
Li, J. F., Liu, W. S., Zhao, L. D. & Zhou, M. High-performance nanostructured thermoelectric materials. NPG Asia Mater. 2, 152–158 (2010).
Article Google Scholar
Ma, T., Chakraborty, P., Guo, X., Cao, L. & Wang, Y. First-principles modeling of thermal transport in materials: achievements, opportunities, and challenges. Int. J. Thermophys. 41, 9 (2020).
Anand, S., Wood, M., Xia, Y., Wolverton, C. & Snyder, G. J. Double half-Heuslers. Joule 3, 1226–1238 (2019).
Article CAS Google Scholar
He, J. et al. Ultralow thermal conductivity in full Heusler semiconductors. Phys. Rev. Lett. 117, 1–6 (2016).
Article Google Scholar
Roekeghem, A., Carrete, J., Oses, C., Curtarolo, S. & Mingo, N. High-throughput computation of thermal conductivity of high-temperature solid phases: the case of oxide and fluoride perovskites. Phys. Rev. X 6, 1–10 (2016).
Google Scholar
Ding, J. et al. Soft anharmonic phonons and ultralow thermal conductivity in Mg3(Sb, Bi)2 thermoelectrics. Sci. Adv. 7, 1–8 (2021).
Article Google Scholar
Pal, K., Xia, Y., He, J. & Wolverton, C. Intrinsically low lattice thermal conductivity derived from Rattler cations in an AMM′Q 3 family of chalcogenides. Chem. Mater. 31, 8734–8741 (2019).
Article CAS Google Scholar
Pal, K. et al. Accelerated discovery of a large family of quaternary chalcogenides with very low lattice thermal conductivity. npj Comput. Mater. 7, 82 (2021).
Eivari, H. A., Sohbatzadeh, Z., Mele, P. & Assadi, M. H. N. Low thermal conductivity: fundamentals and theoretical aspects in thermoelectric applications. Mater. Today Energy 21, 100744 (2021).
Article Google Scholar
Ghosh, T., Dutta, M., Sarkar, D. & Biswas, K. Insights into low thermal conductivity in inorganic materials for thermoelectrics. J. Am. Chem. Soc. 114, 10099–10118 (2022).
Zhu, Z., Xi, J. & Yang, J. Significant reduction in lattice thermal conductivity in a p-type filled skutterudite due to strong electron-phonon interactions. J. Mater. Chem. A 10, 13484–13491 (2022).
Article CAS Google Scholar
Li, C., Ravichandran, N. K., Lindsay, L. & Broido, D. Fermi surface nesting and phonon frequency gap drive anomalous thermal transport. Phys. Rev. Lett. 121, 175901 (2018).
Article CAS Google Scholar
Kang, J. S., Li, M., Wu, H., Nguyen, H. & Hu, Y. Experimental observation of high thermal conductivity in boron arsenide. Science 361, 575–578 (2018).
Article CAS Google Scholar
Han, Z., Yang, X., Li, W., Feng, T. & Ruan, X. FourPhonon: an extension module to ShengBTE for computing four-phonon scattering rates and thermal conductivity. Comput. Phys. Commun. 270, 108179 (2022).
Article CAS Google Scholar
Zeng, Z., Chen, C., Zhang, C., Zhang, Q. & Chen, Y. Critical phonon frequency renormalization and dual phonon coexistence in layered Ruddlesden-Popper inorganic perovskites. Phys. Rev. B 105, 1–7 (2022).
Article Google Scholar
Kirklin, S. et al. The open quantum materials database (OQMD): assessing the accuracy of DFT formation energies. npj Comput. Mater. 1, 15010 (2015).
McGaughey, A. J. H., Jain, A., Kim, H. Y. & Fu, B. Phonon properties and thermal conductivity from first principles, lattice dynamics, and the Boltzmann transport equation. J. Appl. Phys. 125, 01110 (2019).
Wei, H., Bao, H. & Ruan, X. Perspective: Predicting and optimizing thermal transport properties with machine learning methods. Energy AI 8, 100153 (2022).
Article Google Scholar
Bhattacharjee, D., Kundavu, K., Saraswat, D., Raghuvanshi, P. R. & Bhattacharya, A. Thorough descriptor search to machine learn the lattice thermal conductivity of half-Heusler compounds. ACS Appl. Energy Mater. 5, 8913–8922 (2022).
Article CAS Google Scholar
Ju, S. et al. Exploring diamondlike lattice thermal conductivity crystals via feature-based transfer learning. Phys. Rev. Mater. 5, 1–2 (2021).
Google Scholar
Rodriguez, A. et al. Million-scale data integrated deep neural network for phonon properties of heuslers spanning the periodic table. npj Comput. Mater. 9, 20 (2023).
Zhou, F., Nielson, W., Xia, Y. & Ozoliņš, V. Compressive sensing lattice dynamics. I. General formalism. Phys. Rev. B 100, 184308 (2019).
Saal, J. E., Kirklin, S., Aykol, M., Meredig, B. & Wolverton, C. Materials design and discovery with high-throughput density functional theory: the open quantum materials database (OQMD). JOM 65, 1501–1509 (2013).
Article CAS Google Scholar
Deng, B. et al. CHGNet: pretrained universal neural network potential for charge-informed atomistic modeling. 1–12 (2023).
Zhu, T. et al. Charting lattice thermal conductivity for inorganic crystals and discovering rare earth chalcogenides for thermoelectrics. Energy Environ. Sci. 14, 3559–3566 (2021).
Article CAS Google Scholar
Rodriguez, A., Liu, Y. & Hu, M. Spatial density neural network force fields with first-principles level accuracy and application to thermal transport. Phys. Rev. B 102, 35203 (2020).
Article CAS Google Scholar
Togo, A., Chaput, L. & Tanaka, I. Distributions of phonon lifetimes in Brillouin zones. Phys. Rev. B Condens. Matter Mater. Phys. 91, 094306 (2015).
Carrete, J., Li, W., Mingo, N., Wang, S. & Curtarolo, S. Finding unprecedentedly low-thermal-conductivity half-heusler semiconductors via high-throughput materials modeling. Phys. Rev. X 4, 1–9 (2014).
Google Scholar
Jain, A., Veeravenkata, H. P., Godse, S. & Srivastava, Y. High-throughput computational discovery of 40 ultralow thermal conductivity and 20 highly anisotropic crystalline materials. (2022).
Dronskowski, R. & Bloechl, P. E. Crystal orbital Hamilton populations (COHP): energy-resolved visualization of chemical bonding in solids based on density-functional calculations. J. Phys. Chem. 97, 8617–8624 (1993).
Article CAS Google Scholar
Maintz, S., Deringer, V. L., Tchougréeff, A. L. & Dronskowski, R. LOBSTER: a tool to extract chemical bonding from plane-wave based DFT. J. Comput. Chem. 37, 1030–1035 (2016).
Article CAS Google Scholar
Xia, Y. et al. High-throughput study of lattice thermal conductivity in binary rocksalt and zinc blende compounds including higher-order anharmonicity. Phys. Rev. X 10, 41029 (2020).
CAS Google Scholar
Yuan, K., Zhang, X., Chang, Z., Tang, D. & Hu, M. Antibonding induced anharmonicity leading to ultralow lattice thermal conductivity and extraordinary thermoelectric performance in CsK2X (X = Sb, Bi). J. Mater. Chem. C 15822–15832 https://doi.org/10.1039/d2tc03356a (2022).
Chang, Z. et al. Zintl phase compounds Mg3Sb2−xBix (x = 0, 1, and 2) monolayers: electronic, phonon and thermoelectric properties from ab initio calculations. Front. Mech. Eng. 8, 1–11 (2022).
Article Google Scholar
Iotti, R. C., Ciancio, E. & Rossi, F. Quantum transport theory for semiconductor nanostructures: a density-matrix formulation. Phys. Rev. B Condens. Matter Mater. Phys. 72, 1–21 (2005).
Article Google Scholar
Rossi, F. & Kuhn, T. Theory of ultrafast phenomena in photoexcited semiconductors. Rev. Mod. Phys. 74, 895–950 (2002).
Article CAS Google Scholar
Mukhopadhyay, S. et al. Two-channel model for ultralow thermal conductivity of crystalline Tl3VSe4. Science 360, 1455–1458 (2018).
Article CAS Google Scholar
Simoncelli, M., Marzari, N. & Mauri, F. Unified theory of thermal transport in crystals and glasses. Nat. Phys. 15, 809–813 (2019).
Article CAS Google Scholar
Simoncelli, M., Marzari, N. & Mauri, F. Wigner formulation of thermal transport in solids. Phys. Rev. X 12, 41011 (2022).
CAS Google Scholar
Zeng, Z. et al. Nonperturbative phonon scatterings and the two-channel thermal transport in Tl3VSe4. Phys. Rev. B 103, 1–7 (2021).
Article Google Scholar
Li, W., Carrete, J., Katcho, N. A. & Mingo, N. ShengBTE: a solver of the Boltzmann transport equation for phonons. Comput. Phys. Commun. 185, 1747–1758 (2014).
Article CAS Google Scholar
Fu, X. et al. Forces are not enough: benchmark and critical evaluation for machine learning force fields with molecular simulations. 1–25 (2022).
Tranås, R., Løvvik, O. M., Tomic, O. & Berland, K. Lattice thermal conductivity of half-Heuslers with density functional theory and machine learning: enhancing predictivity by active sampling with principal component analysis. Comput. Mater. Sci. 202, 110938 (2022).
Article Google Scholar
Batzner, S. et al. E(3)-equivariant graph neural networks for data-efficient and accurate interatomic potentials. Nat. Commun. 13, 1–11 (2022).
Article Google Scholar
Zhang, L., Lin, D. Y., Wang, H., Car, R. & Weinan, E. Active learning of uniformly accurate interatomic potentials for materials simulation. Phys. Rev. Mater. 3, 23804 (2019).
Article CAS Google Scholar

Download references

Acknowledgements

A.R. acknowledges the financial support by the Department of Energy, Office of Nuclear Energy, Integrated University Program Graduate Fellowship (IUP) under award no. DE-NE-0000095 and NASA SC Space Grant Consortium REAP Program (Award No.: 521383-RP-SC004). Research reported in this work was supported in part by NSF under awards 2030128 and 2110033, SC EPSCoR Program under award number (23-GC01), and an ASPIRE grant from the Office of the Vice President for Research at the University of South Carolina (project 80005046).

Author information

Authors and Affiliations

Department of Mechanical Engineering, University of South Carolina, Columbia, SC, 29208, USA
Alejandro Rodriguez, Mohammed Al-Fahdi & Ming Hu
Theory and Simulation of Materials (THEOS), École Polytechnique Fédérale de Lausanne, CH-1015, Lausanne, Switzerland
Changpeng Lin
Institute of Materials Science, Technical University of Darmstadt, 64287, Darmstadt, Germany
Chen Shen & Hongbin Zhang
School of Energy and Power Engineering, Dalian University of Technology, Dalian, 116024, China
Kunpeng Yuan & Xiaoliang Zhang

Authors

Alejandro Rodriguez
View author publications
You can also search for this author in PubMed Google Scholar
Changpeng Lin
View author publications
You can also search for this author in PubMed Google Scholar
Chen Shen
View author publications
You can also search for this author in PubMed Google Scholar
Kunpeng Yuan
View author publications
You can also search for this author in PubMed Google Scholar
Mohammed Al-Fahdi
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoliang Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Hongbin Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Ming Hu
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.H. conveyed the idea and designed and supervised the study. A.R. performed the neural network potential training and testing as well as the active learning loop. C.L. wrote the code utilizing compressive sensing lattice dynamics (CSLD) for 2nd and 3rd IFCs fitting. A.R. and C.S. performed DFT calculations. K.Y. revised the ShengBTE code to quantify the coherence contribution to overall thermal transport. M.A. performed the COHP calculations and bonding/antibonding analysis. A.R. prepared the draft of the manuscript. X.Z., H.Z., and M.H. revised the manuscript. All the authors contributed to discussions and interpretation of results in the manuscript.

Corresponding author

Correspondence to Ming Hu.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Communications Materials thanks Sheng Gong and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Primary Handling Editors: Milica Todorović and Aldo Isidori.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Rodriguez, A., Lin, C., Shen, C. et al. Unlocking phonon properties of a large and diverse set of cubic crystals by indirect bottom-up machine learning approach. Commun Mater 4, 61 (2023). https://doi.org/10.1038/s43246-023-00390-3

Download citation

Received: 26 February 2023
Accepted: 04 August 2023
Published: 15 August 2023
DOI: https://doi.org/10.1038/s43246-023-00390-3