## Abstract

The knowledge of the vibrational properties of a material is of key importance to understand physical phenomena such as thermal conductivity, superconductivity, and ferroelectricity among others. However, detailed experimental phonon spectra are available only for a limited number of materials, which hinders the large-scale analysis of vibrational properties and their derived quantities. In this work, we perform *ab initio* calculations of the full phonon dispersion and vibrational density of states for 1521 semiconductor compounds in the harmonic approximation based on density functional perturbation theory. The data is collected along with derived dielectric and thermodynamic properties. We present the procedure used to obtain the results, the details of the provided database and a validation based on the comparison with experimental data.

### Metadata summary

Design Type(s) | |
---|---|

Measurement Type(s) | |

Technology Type(s) | |

Factor Type(s) | |

Sample Characteristic(s) |

Machine-accessible metadata file describing the reported data (ISA-tab format)

## Background & Summary

The phonon spectrum of a material describes the dynamics of its constituent atoms in the harmonic approximation, in the framework of the long-established theory of lattice vibrations^{1,2}. The details of the lattice dynamics are of key importance as many properties can not be explained by static models. A simple example is the set of thermal properties extracted from the phonon density of states (DOS), such as the vibrational contribution to the entropy of the system and the heat capacity^{3,4,5}. But the vibrational properties of crystalline solids are also needed to investigate a number of other materials features, such as the thermal conductivity^{6,7,8}, the conventional phonon-mediated superconductivity^{9,10,11} and the ferroelectric and ferroelastic transitions^{12,13,14,15}. Additionally, they provide information for the investigation of the phase stability of compounds^{16} through the inspection of imaginary phonon modes and for the interpretation of Raman experimental spectra^{17}.

Experimental phonon band structures are available only for a limited set of compounds and, in some cases, only for specific points of the Brillouin zone. Density functional theory (DFT) offers the possibility to obtain the vibrational properties of materials using frozen-phonon^{18} or molecular dynamics^{19,20}. Alternatively, density functional perturbation theory (DFPT) is an accurate and efficient tool to calculate the lattice dynamics^{21}. Given the requirements of these simulations, it is just recently, with the increase of computational power and the diffusion of high-throughput (HT) frameworks^{22,23,24}, that handling and analysing large numbers of phonon calculations has been made possible.

Standard studies reporting phonon calculations based on DFT usually consider few materials in selected phases. More recently efforts have been devoted to evaluate the phonon band structures for a large number of compounds (Atsushi Togo's phonon database http://phonondb.mtl.kyoto-u.ac.jp and ref. 25). In the present work, we report the full phonon band structures and derived quantities for 1521 semiconducting inorganic crystals, obtained following the procedure and the approximations detailed in our previous work^{26}. The phonons are related to the second order derivatives of the energies with respect to the atomic displacements. However, to obtain the correct behavior for long range interactions in the case of polar materials, the coupling between the displacements and the electric field^{27} must also be considered. The latter is related to the mixed second order derivatives of the energy with respect to the electric field and atomic displacements. These derivatives, as well as those purely with respect to the electric field, can be efficiently calculated in the framework of DFPT. They give access to the Born effective charges (BECs) and the dielectric tensors, respectively. Here, we provide this full set of second order derivatives in an open database. The schematic overview of available properties and the procedure to obtain them is outlined in Fig. 1.

This derivatives database offers the possibility to analyze the lattice dynamics of a large number of compounds, generated with uniform approximations and under a validated procedure. These results are part of the Materials Project^{28} (MP) which uses HT methods to predict material properties for the discovery and design of new compounds.

The remainder of the paper is organized as follows. First, we define the phonon properties calculated and the procedure employed to obtain them. We then describe the structure of the data present in the database and give a graphical representation. Finally, we provide a validation of the results based on a comparison with experimental data.

## Methods

### Methodology and definitions

Most of the methodology and notations follow closely ref. 29. For a generic point **q** in the Brillouin zone the phonon frequencies *ω*_{q,m} and eigenvectors *U*_{m}(* qκ*′

*β*) can be obtained by solving of the generalized eigenvalue problem $$\begin{array}{}\text{(1)}& \sum _{\kappa \prime \beta}{\tilde{C}}_{\kappa \alpha ,\kappa \prime \beta}(\mathbf{q}){U}_{m}(\mathbf{q}\kappa \prime \beta )={M}_{\kappa}{\omega}_{\mathbf{q},m}^{2}{U}_{m}(\mathbf{q}\kappa \alpha ),\end{array}$$ where

*κ*labels the atoms in the cell,

*α*and

*β*are cartesian coordinates and ${\tilde{C}}_{\kappa \alpha ,\kappa \prime \beta}(\mathbf{q})$ are the interatomic force constants in reciprocal space, which are related to the second derivatives of the energy with respect to atomic displacements. These values have been obtained by performing a Fourier interpolation of those calculated on a regular grid of

*q*-points obtained with DFPT.

To correctly describe the limit **q**→0 for polar materials, with the splitting between longitudinal and transverse optical modes (LO and TO, respectively) the dipole-dipole interaction has been taken into account. This requires the knowledge of both the BEC and the dielectric tensors^{29}. The BEC tensor can be linked either to the change of polarisation $\mathcal{P}$ induced by the periodic displacement *τ*_{κα}, or to the force *F*_{κ,α} induced on atom *κ* by an electric field ${\mathcal{E}}_{\beta}$
$$\begin{array}{}\text{(2)}& {Z}_{\kappa ,\beta \alpha}^{*}={\mathrm{\Omega}}_{0}\frac{\partial \mathcal{P}}{\partial {\tau}_{\kappa \alpha}}=\frac{\partial {F}_{\kappa ,\alpha}}{\partial {\mathcal{E}}_{\beta}}=\frac{{\partial}^{2}E}{\partial {\tau}_{\kappa \alpha}\partial {\mathcal{E}}_{\beta}},\end{array}$$
where *E* is the total energy, and is an observable quantity.

In the theoretical formulation, the interatomic force constants $\tilde{C}$ and BEC tensors *Z** must satisfy a series of sum rules. The first, following from the invariance of total energy with respect to translations (known as acoustic sum rule, ASR)
$$\begin{array}{}\text{(3)}& \sum _{\kappa}{\tilde{C}}_{\kappa \alpha ,\kappa \prime \beta}\left(\mathbf{q}=0\right)=0,\end{array}$$
implies that the acoustic modes at Γ are identically zero. The second rule
$$\begin{array}{}\text{(4)}& \sum _{\kappa}{Z}_{\kappa ,\beta \alpha}^{*}=0,\end{array}$$
guarantees that the charge neutrality is fulfilled at the level of the BECs (CNSR). Both are imposed during the interpolation process to improve the results, but the actual deviation from the exact condition can be used to estimate the degree of convergence of the calculation (see below).

The second derivatives with respect to the electric field $\mathcal{E}$ allow one to also obtain the dielectric permittivity tensor resulting from the electronic polarization, usually noted as ${\epsilon}_{\alpha \beta}^{\infty}$. This, together with the values of the phonon frequencies at the center of the Brillouin zone *ω*^{Γ}_{m} and the oscillator strength tensor *f*_{m,αβ}, gives the static dielectric tensor
$$\begin{array}{}\text{(5)}& {\epsilon}_{\alpha \beta}^{0}={\epsilon}_{\alpha \beta}^{\infty}+4\pi {\displaystyle \sum _{m}}\frac{{f}_{m,\alpha \beta}^{2}}{{\left({\omega}_{m}^{\mathrm{\Gamma}}\right)}^{2}}.\end{array}$$

Given the phonon DOS *g*(*ω*)
$$\begin{array}{}\text{(6)}& g(\omega )=\frac{1}{3nN}{\displaystyle \sum _{\mathbf{q},l}}\delta \left(\omega -\omega (\mathbf{q},l)\right),\end{array}$$
where *n* is the number of atoms per unit cell and *N* is the number of unit cells, several thermodynamic quantities can be obtained in the harmonic approximation: the Helmholtz free energy Δ*F*, the phonon contribution to the internal energy Δ*E*_{ph}, the constant-volume specific heat *C*_{v} and the entropy *S*. The explicit expressions are given by^{3}:
$$\begin{array}{}\text{(7)}& \mathrm{\Delta}F=3nN{k}_{B}T{\int}_{0}^{{\omega}_{L}}\mathrm{ln}\left(2\mathrm{sinh}\frac{\hslash \omega}{2{k}_{B}T}\right)g(\omega )d\omega \end{array}$$
$$\begin{array}{}\text{(8)}& \mathrm{\Delta}{E}_{\mathrm{ph}}=3nN\frac{\hslash}{2}{\int}_{0}^{{\omega}_{L}}\omega \mathrm{coth}\left(\frac{\hslash \omega}{2{k}_{B}T}\right)g(\omega )d\omega \end{array}$$
$$\begin{array}{}\text{(9)}& {C}_{v}=3nN{k}_{B}{\int}_{0}^{{\omega}_{L}}{\left(\frac{\hslash \omega}{2{k}_{B}T}\right)}^{2}{\mathrm{csch}}^{2}\left(\frac{\hslash \omega}{2{k}_{B}T}\right)g(\omega )d\omega \end{array}$$
$$\begin{array}{}\text{(10)}& S=3nN{k}_{B}{\int}_{0}^{{\omega}_{L}}\left(\frac{\hslash \omega}{2{k}_{B}T}\mathrm{coth}\left(\frac{\hslash \omega}{2{k}_{B}T}\right)-\mathrm{ln}\left(2\mathrm{sinh}\frac{\hslash \omega}{2{k}_{B}T}\right)\right)g(\omega )d\omega ,\end{array}$$
where *k*_{B} is the Boltzmann constant and *ω*_{L} is the largest phonon frequency.

Notice that in cases where imaginary frequencies are present in the system, these thermodynamic properties are ill defined and will not be calculated.

All the DFT and DFPT calculations presented in this work are performed with the ABINIT software package^{30,31,32}. The PBEsol^{33} semilocal generalized gradient approximation for the exchange-correlation functional (XC), that has proven to provide accurate phonon frequencies compared to experimental data^{34}, has been used for all the simulations. Norm-conserving pseudopotentials^{35}, generated with the appropriate XC functional, are taken for all the elements from the pseudopotentials table PseudoDojo version 0.3 (ref. 36). The plane wave cutoff is chosen based on the hardest element for each compound, according to the values suggested in the PseudoDojo table.

The Brillouin zone has been sampled using equivalent *k*-point and *q*-point grids that respect the symmetries of the crystal and with a density of approximately 1500 points per reciprocal atom, as suggested in ref. 26. The *q*-point grid is always Γ-centered. All the structures are relaxed with strict convergence criteria, i.e. until all the forces on the atoms are below 10^{−6} Ha/Bohr and the stresses are below 10^{−4} Ha/Bohr^{3}.

For all the materials, the primitive cells and the band structures are defined according to the conventions of Setyawan and Curtarolo^{37}.

### Numerical precision estimation

We have identified a set of indicators that can give hints about the level of numerical precision of our results. The main ones are the aforementioned breaking of the acoustic and charge neutrality sum rules. While these properties are explicitly imposed, the breaking can be usually reduced by increasing the plane wave cutoff. This suggest that a large breaking could signal a lack of convergence with respect to this parameter.

We have also observed^{26} that the presence of small negative frequencies for the acoustic phonon frequencies in the close proximity of the Γ point could be associated with poor choices of the *k* or *q* point grids. In particular, we have observed that these are hardly ever a signal of a real incommensurate instability.

Despite the presence of these indications of a possible lack of convergence, the results obtained from the calculations with the imposition of the sum rules can still be reliable or give rather accurate values away from problematic regions, providing useful information, especially for screening purposes over large sets of data. Potentially problematic calculations are thus included in the database along with the numerical value of the ASR and CNSR. Three flags will help quickly identify these cases. One is set if the largest acoustic mode at Γ is larger than 30 cm^{−1}, when the ASR is not explicitly imposed. A second one signals that the value of
$$\begin{array}{}\text{(11)}& {\displaystyle \underset{\alpha ,\beta}{\mathrm{max}}}\left({\displaystyle \sum _{\kappa}}{Z}_{\kappa ,\beta \alpha}^{*}\right)\end{array}$$
is larger than 0.2, that corresponds to the breaking of the CNSR. The last one indicates the presence of negative frequencies just in the region 0<|* q*|<0.05 in fractional coordinates along the high symmetry lines. Materials with likely real instabilities, showing negative (imaginary) frequencies also beyond this limit, do not have this flag set.

### Workflow

The workflow employed to handle our HT calculation is outlined in Fig. 2. The structures present in the MP database^{28,38} are taken as starting point, considering only semiconducting and insulating materials. Since these have been optimized within the projector augmented wave framework and for a different XC functional (PBE), we first perform a full relaxation of the system with strict convergence parameters. The following step consists in running the DFPT simulations to obtain the second derivatives of the energy with respect to the different perturbations considered. These calculations are carried out in parallel over all the perturbations and all the *q*-points. If the calculations are completed correctly, the set of derivatives is then used to generate the phonon band structure and DOS, along with the derived quantities, using a Fourier interpolation scheme. Unsuccessful calculations are analyzed and rerun if possible or discarded otherwise.

At this point the results undergo the controls defined in the previous section, i.e. the breaking of the ASR and CNSR and the presence of small negative frequencies close to Γ, and the corresponding flags are set in the record. A further flag is added if the band structure shows any negative frequency with absolute value larger than 5 cm^{−1}, to signal the likely presence of an instability.

Finally, all the records are inserted into the MP database. From this they will be made available on the MP website and a JSON (JavaScript Object Notation) data document is generated for each record. A copy of all the JSON documents is available for download from the Figshare repository (Data Citation 1: Figshare https://doi.org/10.6084/m9.figshare.c.3938023).

### Code availability

The open source code ABINIT^{30,31,32} is used throughout this work for calculations of phonon properties. ABINIT is distributed under the GNU General Public Licence. The workflows used to run the simulations are implemented using FireWorks as workflow manager^{39} (https://github.com/materialsproject/fireworks) and specific workflows are available in the Abiflows package (https://github.com/abinit/abiflows). The Pymatgen^{40} and AbiPy (https://github.com/abinit/abipy) python packages are used to generate inputs and analyze the results. Pymatgen is released under the MIT (Massachusetts Institute of Technology) License and is open source. AbiPy is released under the GNU GPL license. FireWorks is released under a modified BSD license.

## Data Records

The calculated phonon properties and derived quantities for 1521 materials are made available in this work. The materials include only inorganic solid semiconductors and insulators, with 1508 of these having less than 13 atomic sites per cell. The second order derivatives of the energy in the ABINIT derivative database file format (DDB) and the processed results in the JSON format can be downloaded from the Figshare repository (Data Citation 1: Figshare https://doi.org/10.6084/m9.figshare.c.3938023). The data will additionally be made accessible through the Materials Project website (www.materialsproject.org) where we will provide static and interactive plots of the phonon dispersion.

### Second order derivatives of the total energy

The outputs of the DFPT calculation are the second order derivatives of the energies with respect to atomic displacements on a regular grid in the Brillouin zone and the second order derivatives of the energies with respect to static homogeneous electric field. These quantities are stored in the ABINIT DDB file format which is a human readable text format. A DDB file for each material is available in the Figshare repository (Data Citation 1: Figshare https://doi.org/10.6084/m9.figshare.c.3938023).

### Processed data

The processed data for each of the calculated material is stored as a JSON document (Data Citation 1: Figshare https://doi.org/10.6084/m9.figshare.c.3938023). JSON is a textual lightweight data-interchange format, that can be easily parsed by machines. It is built on two kinds of structures i) a collection of key/value pairs and ii) an ordered list of values. These structures can be nested. Each JSON file contains the data for a single material with the top level keys described in Table 1. The content of the second level is detailed in the Tables 2,3,4,5,6.

The metadata defined in Table 2 provides a description of the material and its characteristics, as well as details about the approximations employed in the calculation. The phonon properties are reported as described in Table 3. The phonon band structure is available for *n*_{qpts} *q*-points along the high-symmetry path, as defined in ref. 37, for each of the modes (*n*_{modes}=3×*n*_{sites}). The DOS is reported with two keys describing the list of frequencies and the corresponding values of the DOS. The thermodynamic properties obtained from the integration of the DOS are calculated on a uniformly spaced list of *n*_{T} temperatures with each property as a list of corresponding values as shown in Table 4. In the case where large negative frequencies are present in the material, the thermodynamic properties have not been calculated and the values corresponding to the *thermo* key at the top level (see Table 1) is empty. Dielectric properties and BECs are given according to the description provided in Table 5.

The estimations of the breaking of the sum rules are given, as defined in previous sections, in Tables 3 and 5. Two flags signal the cases where these values are considered large and the corresponding keys are given in Table 6, with two more flags concerning the presence of negative frequencies.

### Graphical representation of results

Figures 3 and 4 are examples of graphical representation of the data stored in the database. The data reported under the *phonon* key of the JSON file (Table 3) can be used to plot the phonon band structure and DOS of each calculated material. An example showing the comparison of phonon band structures and DOS for three different phases of SiO_{2} (*β*-cristobalite, stishovite and *α*-quartz), is reported in Fig. 3.

Figure 4 illustrates the correlation between the average phonon frequency *ω*^{−}, calculated as
$$\begin{array}{}\text{(12)}& \overline{\omega}=\frac{\int \omega \mathit{g}\left(\omega \right)d\omega}{\int \mathit{g}\left(\omega \right)d\omega},\end{array}$$
and the average atomic mass of the compound, that we define as
$$\begin{array}{}\text{(13)}& \overline{m}={\left(\frac{1}{n}{\displaystyle \sum _{\kappa}}\sqrt{{M}_{\kappa}}\right)}^{-2},\end{array}$$
to better fit the values in equation (1). Only the cases where no negative frequencies are present in the phonon spectrum have been considered. As expected based on the relation between masses and frequencies given by equation (1), heavier elements are usually associated with lower average frequencies and the data follows the trend *ω*^{−}~1/(*m*^{−})^{1/2}. The data displays a spread around the hyperbolic fit, since the phonon frequencies are the outcome of the interplay of the whole set of interatomic force constants and of the different masses of the elements composing the material. It can be noticed that some trends can be recognized with respect to the masses of the components. Systems with non-uniform masses (identified by the small ratio *m*_{min}/*m*^{−}), tend to lay on a different hyperbolic curve with respect to more uniformly weighted systems.

## Technical Validation

The methodology used in this work and the choice of the sampling of the Brillouin zone have been analyzed in a previous study^{26}. There, it has been shown that properly converged results can be obtained with a sampling of the Brillouin zone that respects the symmetry of the system and with a density of 1500 points per reciprocal atom, both for the *k*-point and *q*-point sampling. The results have been validated by comparing the sound velocities obtained from the phonon band structures with those calculated from the elastic tensors and checking the errors of the calculated vibrational entropy with respect to experimental data (see equation (10)). The tests showed a satisfactory accuracy, especially for screening purposes.

The pseudopotentials available in the PseudoDojo table^{36} have been evaluated by checking the Δ value with respect to the results obtained from all electron DFT codes^{41,42}. However, since for the PBEsol functional these reference values do not exist, we used input parameters almost equivalent to those obtained for the PBE table to generate the PBEsol pseudopotentials. Further tests have been carried out on these new pseudopotentials to ensure a limited breaking of the ASR, exclude the presence of ghost states in the occupied and empty regions, and convergence of phonon modes at Γ. All these tests have been used to determine optimal values for the suggested energy cutoffs.

As a further validation, we present the comparison of the phonon frequencies at the Γ point calculated in this work with the available experimental data for 53 compounds.^{34,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63,64,65,66,67,68,69,70} The majority of the materials considered are cubic systems with few atoms per supercells. However at least one material per crystal system is present in the test set, with system sizes up to 12 atoms per unit cell.

We considered all the frequencies at Γ for which the experimental data are available and matched them with the calculated values according to the symmetry of the modes. In Fig. 5 we report the relative errors of each of these frequencies with respect to the experimental data. With a mean relative error (MRE) over all the frequencies of −3.6%, it can clearly be seen that, on average, the simulations underestimate the values of the phonon frequencies. This underestimation, however, has been observed for different generalized gradient approximations (GGA) to the XC functional^{34} and is not limited to the case of the PBEsol employed in this work. Satisfactorily, almost all the errors are within 10%, with just 15 frequencies exceeding this threshold. In these cases, the calculated values are in good agreement with other simulations present in literature, showing that the origin of the disagreement does not lie in a lack of precision in our procedure. In particular, the worst examples in the set, with errors of −38% and −31.5%, belong to rock salt BaO (mp-1342) and to cubic ZrO_{2} (mp-1565), respectively. It has been shown^{71} that the large error in the TO mode of BaO is the result of the occurrence of strong anharmonicities, so that such frequencies cannot be accurately reproduced using the harmonic approximation. On the other hand, since cubic zirconia is unstable in normal conditions, the experimental values are obtained from yttria-stabilized samples. This is likely to perturb the phonon frequencies with respect to the ideal values.

With a mean absolute relative error (MARE) of 4.6%, we then conclude that our test set is in reasonably good agreement with the experimental data available.

Finally, in order to strengthen our choice of using the PBEsol XC functional, we evaluated the error of the values of the entropy (see equation (10)) at ~300*K* obtained in this work with respect to experimental data. These can be compared with the errors obtained previously using the Perdew-Burke-Ernzerhof (PBE)^{72} approximation in ref. 26. The relative errors with respect to experimentally measured data are shown for both the functionals in Fig. 6 for 27 compounds. As already remarked, the agreement is quite good, considering also that at room temperature the thermal expansion and the anharmonic effects, not included in our simulations, can already be playing a role.

While for a few materials the error obtained for the PBEsol values are larger than that coming from the PBE calculations, the former provide in general a better agreement, with a MARE over all the materials of 2.97% for PBEsol versus 4.25% for PBE. This analysis further confirms the higher accuracy of the PBEsol approximation with respect to PBE.

## Usage Notes

We present the processed phonon and dielectric properties of 1521 semiconducting materials. These can be used, for example, to identify trends in vibrational frequencies and thermodynamic properties. The data is provided in JSON files allowing to quickly extract information from the dataset.

If more detailed knowledge of specific quantities is required, we refer the users to the DDB files. These contain all the second order derivatives of the energies with respect to atomic perturbations on a set of points in the irreducible zone and the second order derivatives of the energies with respect to a static homogeneous electric field. These files can be processed using the *anaddb* code^{31} to extract several quantities. These include, (i) phonon frequencies at any point in the Brillouin zone, with and without the imposition of the sum rules discussed above, (ii) phonon band structures along custom paths in the Brillouin zone, (iii) interatomic force constants in real space (iv) projected phonon density of states. Ample documentation is available on the ABINIT website (https://www.abinit.org) and a python interface to run the postprocessing tool and plot the results is provided in the AbiPy package. Documentation for the usage of this python package is available online (http://pythonhosted.org/abipy, https://github.com/abinit/abitutorials/blob/master/abitutorials/ddb.ipynb). Example scripts are provided in the Figshare database (Data Citation 1: Figshare https://doi.org/10.6084/m9.figshare.c.3938023).

## Additional information

**How to cite this article**: Petretto, G. et al. High-throughput density functional perturbation theory phonons for inorganic materials. *Sci. Data* 5:180065 doi: 10.1038/sdata.2018.65 (2018).

**Publisher’s note**: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

## References

- 1.
Born, M. & Huang, K.

*Dynamical Theory of Crystal Lattices*(Oxford University Press, 1954). - 2.
Brüesch, P.

*Phonons: Theory and Experiments I. Lattice Dynamics and Models of Interatomic Forces*(Springer-Verlag, 1982). - 3.
Lee, C. & Gonze, X. Ab initio calculation of the thermodynamic properties and atomic temperature factors of SiO

_{2}α-quartz and stishovite.*Phys. Rev. B***51**, 8610–8613 (1995). - 4.
Karki, B. B., Wentzcovitch, R. M., de Gironcoli, S. & Baroni, S. High-pressure lattice dynamics and thermoelasticity of MgO.

*Phys. Rev. B***61**, 8793–8800 (2000). - 5.
Togo, A., Chaput, L., Tanaka, I. & Hug, G. First-principles phonon calculations of thermal expansion in Ti

_{3}SiC_{2}, Ti_{3}AlC_{2}, and Ti_{3}GeC_{2}.*Phys. Rev. B***81**, 174301 (2010). - 6.
Seko, A.

*et al.*Prediction of low-thermal-conductivity compounds with first-principles anharmonic lattice-dynamics calculations and bayesian optimization.*Phys. Rev. Lett.***115**, 205901 (2015). - 7.
Lindsay, L.

*et al.*Phonon thermal transport in strained and unstrained graphene from first principles.*Phys. Rev. B***89**, 155426 (2014). - 8.
Romero, A. H., Gross, E. K. U., Verstraete, M. J. & Hellman, O. Thermal conductivity in PbTe from first principles.

*Phys. Rev. B***91**, 214310 (2015). - 9.
Savrasov, S. Y. & Andersen, O. K. Linear-response calculation of the electron-phonon coupling in doped CaCuO

_{2}.*Phys. Rev. Lett.***77**, 4430–4433 (1996). - 10.
Connétable, D.

*et al.*Superconductivity in doped*sp*^{3}semiconductors: The case of the clathrates.*Phys. Rev. Lett.***91**, 247001 (2003). - 11.
Giustino, F Electron-phonon interactions from first principles.

*Rev. Mod. Phys.***89**, 015003 (2017). - 12.
Ghosez, P., Gonze, X., Lambin, P. & Michenaud, J.-P. Born effective charges of barium titanate: Band-by-band decomposition and sensitivity to structural features.

*Phys. Rev. B***51**, 6765–6768 (1995). - 13.
Zhong, W., King-Smith, R. D. & Vanderbilt, D. Giant LO-TO splittings in perovskite ferroelectrics.

*Phys. Rev. Lett.***72**, 3618–3621 (1994). - 14.
Bousquet, E., Spaldin, N. A. & Ghosez, P. Strain-induced ferroelectricity in simple rocksalt binary oxides.

*Phys. Rev. Lett.***104**, 037601 (2010). - 15.
Togo, A., Oba, F. & Tanaka, I. First-principles calculations of the ferroelastic transition between rutile-type and CaCl

_{2}-type SiO_{2}at high pressures.*Phys. Rev. B***78**, 134106 (2008). - 16.
Togo, A. & Tanaka, I. Evolution of crystal structures in metallic elements.

*Phys. Rev. B***87**, 184104 (2013). - 17.
Decremps, F., Pellicer-Porres, J., Saitta, A. M., Chervin, J.-C. & Polian, A. High-pressure raman spectroscopy study of wurtzite ZnO.

*Phys. Rev. B***65**, 092101 (2002). - 18.
Yin, M. T. & Cohen, M. L. Theory of lattice-dynamical properties of solids: Application to Si and Ge.

*Phys. Rev. B***26**, 3259 (1982). - 19.
Kohanoff, J., Andreoni, W. & Parrinello, M. Zero-point-motion effects on the structure of C

_{60}.*Phys. Rev. B***46**, 4371 (1992). - 20.
Kohanoff, J. Phonon spectra from short non-thermally equilibrated molecular dynamics simulations.

*Comput. Mater. Sci.***2**, 221 (1994). - 21.
Baroni, S., de Gironcoli, S., Dal Corso, A & Giannozzi, P Phonons and related crystal properties from density-functional perturbation theory.

*Rev. Mod. Phys.***73**, 515–562 (2001). - 22.
Jain, A.

*et al.*FireWorks: a dynamic workflow system designed for high-throughput applications.*Concurrency and Computation: Practice and Experience***27**, 5037–5059 (2015). - 23.
Curtarolo, S.

*et al.*AFLOWLIB.ORG: A distributed materials properties repository from high-throughput ab initio calculations.*Computational Materials Science***58**, 227–235 (2012). - 24.
Pizzi, G., Cepellotti, A., Sabatini, R., Marzari, N. & Kozinsky, B. AiiDA: automated interactive infrastructure and database for computational science.

*Computational Materials Science***111**, 218–230 (2016). - 25.
Legrain, F., Carrete, J., van Roekeghem, A., Curtarolo, S. & Mingo, N. How chemical composition alone can predict vibrational free energies and entropies of solids.

*Chemistry of Materials***29**, 6220–6227 (2017). - 26.
Petretto, G., Gonze, X., Hautier, G. & Rignanese, G.-M. Convergence and pitfalls of density functional perturbation theory phonons calculations from a high-throughput perspective.

*Computational Materials Science***144**, 331–337 (2018). - 27.
Giannozzi, P., de Gironcoli, S., Pavone, P. & Baroni, S. Ab initio calculation of phonon dispersions in semiconductors.

*Phys. Rev. B***43**, 7231–7242 (1991). - 28.
Jain, A.

*et al.*Commentary: The Materials Project: A materials genome approach to accelerating materials innovation.*APL Materials***1**, 011002 (2013). - 29.
Gonze, X. & Lee, C. Dynamical matrices, Born effective charges, dielectric permittivity tensors, and interatomic force constants from density-functional perturbation theory.

*Phys. Rev. B***55**, 10355–10368 (1997). - 30.
Gonze, X.

*et al.*First-principles computation of material properties: the ABINIT software project.*Computational Materials Science***25**, 478–492 (2002). - 31.
Gonze, X.

*et al.*ABINIT: First-principles approach to material and nanosystem properties.*Computer Physics Communications***180**, 2582–2615 (2009). - 32.
Gonze, X.

*et al.*Recent developments in the ABINIT software package.*Computer Physics Communications***205**, 106 (2016). - 33.
Perdew, J. P.

*et al.*Restoring the density-gradient expansion for exchange in solids and surfaces.*Phys. Rev. Lett.***100**, 136406 (2008). - 34.
He, L.

*et al.*Accuracy of generalized gradient approximation functionals for density-functional perturbation theory calculations.*Phys. Rev. B***89**, 064305 (2014). - 35.
Hamann, D. R. Optimized norm-conserving Vanderbilt pseudopotentials.

*Phys. Rev. B***88**, 085117 (2013). - 36.
van Setten, M. J.

*et al.*The PseudoDojo: Training and grading a 85 element optimized norm-conserving pseudopotential table.*Computer Physics Communications***226**, 39–54 (2018). - 37.
Setyawan, W. & Curtarolo, S. High-throughput electronic band structure calculations: Challenges and tools.

*Computational Materials Science***49**, 299–312 (2010). - 38.
Ong, S. P.

*et al.*The materials application programming interface (API): A simple, flexible and efficient API for materials data based on REpresentational State Transfer (REST) principles.*Computational Materials Science***97**, 209–215 (2015). - 39.
Jain, A., Hautier, G., Ong, S. P. & Persson, K. New opportunities for materials informatics: Resources and data mining techniques for uncovering hidden relationships.

*Journal of Materials Research***31**, 977–994 (2016). - 40.
Ong, S. P.

*et al.*Python Materials Genomics (pymatgen): A robust, open-source python library for materials analysis.*Computational Materials Science***68**, 314–319 (2013). - 41.
Lejaeghere, K., Speybroeck, V. V., Oost, G. V. & Cottenier, S. Error estimates for solid-state density-functional theory predictions: An overview by means of the ground-state elemental crystals.

*Critical Reviews in Solid State and Materials Sciences***39**, 1–24 (2014). - 42.
Lejaeghere, K.

*et al.*Reproducibility in density functional theory calculations of solids.*Science***351**(2016). - 43.
Madelung, O.

*Semiconductors: Data Handbook*(Springer Berlin Heidelberg, 2004). - 44.
Galtier, M., Montaner, A. & Vidal, G. Phonons optiques de CaO, SrO, BaO au centre de la zone de brillouin á 300 et 17K.

*Journal of Physics and Chemistry of Solids***33**, 2295–2302 (1972). - 45.
Jacobs, P. W. & Vernon, M. L. Phonon dispersion and defect energies for rubidium chloride, bromide, iodide, and sulphide.

*Canadian Journal of Chemistry***76**, 1540–1547 (1998). - 46.
Vijayaraghavan, P. R., Nicklow, R. M., Smith, H. G. & Wilkinson, M. K. Lattice dynamics of silver chloride.

*Phys. Rev. B***1**, 4819–4826 (1970). - 47.
Dolling, G., Smith, H. G., Nicklow, R. M., Vijayaraghavan, P. R. & Wilkinson, M. K. Lattice dynamics of lithium fluoride.

*Phys. Rev***168**, 970–979 (1968). - 48.
Buhrer, W. Crystal dynamics of caesium fluoride.

*Journal of Physics C: Solid State Physics***6**, 2931 (1973). - 49.
Reid, J. S., Smith, T. & Buyers, W. J. L. Phonon frequencies in NaBr.

*Phys. Rev. B***1**, 1833–1844 (1970). - 50.
Ahmad, A. A. Z., Smith, H. G., Wakabayashi, N. & Wilkinson, M. K. Lattice dynamics of cesium chloride.

*Phys. Rev. B***6**, 3956–3961 (1972). - 51.
Raunio, G., Almqvist, L. & Stedman, R. Phonon dispersion relations in NaCl.

*Phys. Rev***178**, 1496–1501 (1969). - 52.
Raunio, G. & Almqvist, L. Dispersion relations for phonons in KCl at 80 and 300°k.

*physica status solidi (b)***33**, 209–215 (1969). - 53.
Hayes, R. R. & Rieder, K. H. Raman scattering from RbF and RbBr.

*Phys. Rev. B***8**, 5972–5976 (1973). - 54.
Wagner, V.

*et al.*Optical and acoustical phonon properties of BeTe.*Journal of Crystal Growth***184**, 1067–1071 (1998). - 55.
Yamashita, N., Michitsuji, Y. & Asano, S. Photoluminescence spectra and vibrational structures of the SrS:Ce

^{3+}and SrSe:Ce^{3+}phosphors.*Journal of The Electrochemical Society***134**, 2932–2934 (1987). - 56.
Chen, J. & Shen, W. Z. Raman study of phonon modes and disorder effects in Pb

_{1-x}Sr_{x}Se alloys grown by molecular beam epitaxy.*Journal of Applied Physics***99**, 013513 (2006). - 57.
Hofmeister, A. M., Keppel, E. & Speck, A. K. Absorption and reflection infrared spectra of MgO and other diatomic compounds.

*Monthly Notices of the Royal Astronomical Society***345**, 16–38 (2003). - 58.
Bühner, W. & Hälg, W. Crystal dynamics of cesium iodide.

*physica status solidi (b)***46**, 679–686 (1971). - 59.
Wagner, V.

*et al.*Lattice dynamics and bond polarity of Be-chalcogenides a new class of ii-vi materials.*physica status solidi (b)***215**, 87–91 (1999). - 60.
Rolandson, S. & Raunio, G. Lattice dynamics of CsBr.

*Phys. Rev. B***4**, 4617–4623 (1971). - 61.
Verstraete, M. & Gonze, X. First-principles calculation of the electronic, dielectric, and dynamical properties of CaF

_{2}.*Phys. Rev. B***68**, 195123 (2003). - 62.
Mestres, N., Calleja, J., Aliev, F. & Belogorokhov, A. Electron localization in the disordered conductors TiNiSn and HfNiSn observed by raman and infrared spectroscopies.

*Solid State Communications***91**, 779–784 (1994). - 63.
Duman, S., Sütlü, A., BağcÄ, S., Tütüncü, H. M & Srivastava, G. P Structural, elastic, electronic, and phonon properties of zinc-blende and wurtzite BeO.

*Journal of Applied Physics***105**, 033719 (2009). - 64.
Kanney, L. B., Gillis, N. S. & Raich, J. C. Lattice dynamics and phase transition in sodium azidea.

*The Journal of Chemical Physics***67**, 81–85 (1977). - 65.
Massa, N. E., Mitra, S. S., Prask, H., Singh, R. S. & Trevino, S. F. Infrared-active lattice vibrations in alkali azides.

*The Journal of Chemical Physics***67**, 173–179 (1977). - 66.
Fadda, G., Zanzotto, G. & Colombo, L. First-principles study of the effect of pressure on the five zirconia polymorphs. II. static dielectric properties and raman spectra.

*Phys. Rev. B***82**, 064106 (2010). - 67.
Rignanese, G.-M., Detraux, F., Gonze, X. & Pasquarello, A. First-principles study of dynamical and dielectric properties of tetragonal zirconia.

*Phys. Rev. B***64**, 134301 (2001). - 68.
Kranert, C., Sturm, C., Schmidt-Grund, R. & Grundmann, M. Raman tensor elements of

*β*-Ga2O3.*Scientific reports***6**(2016). - 69.
Machon, D., McMillan, P. F., Xu, B. & Dong, J. High-pressure study of the

*β*-to-*α*transition in Ga_{2}O_{3}.*Phys. Rev. B***73**, 094125 (2006). - 70.
Wang, C. Y.

*et al.*Phase stabilization and phonon properties of single crystalline rhombohedral indium oxide.*Crystal Growth & Design***8**, 1257–1260 (2008). - 71.
Chen, S. & Bongiorno, A. Boundary conditions in periodic density functional calculations of insulating materials.

*Phys. Rev. B***83**, 165125 (2011). - 72.
Perdew, J. P., Burke, K. & Ernzerhof, M. Generalized gradient approximation made simple.

*Phys. Rev. Lett.***77**, 3865–3868 (1996).

## Data Citations

- 1.
Petretto, G.

*et al.**Figshare*https://doi.org/10.6084/m9.figshare.c.3938023 (2018)

## Acknowledgements

G.P., X.G. and G.-M.R. acknowledge support from the Communauté française de Belgique through the BATTAB project (ARC 14/19-057). G.-M.R. is also grateful to the F.R.S.-FNRS for the financial support. Financial support was provided from F.R.S.-FNRS through the PDR Grants HiT4FiT (T.1031.14) and HTBaSE (T.1071.15). H.P.C.M acknowledges support by the National Research Fund, Luxembourg (Project OTPMD). This work was funded by the U.S. Department of Energy, Office of Science, Office of Basic Energy Sciences, Materials Sciences and Engineering Division under Contract No. DE-AC02-05-CH11231 : Materials Project program KC23MP. Additional support was provided by the Center for Next Generation Materials by Design, an Energy Frontier Research Center funded by the U.S. Department of Energy, Office of Science, Basic Energy Sciences under Awards DE-AC02-05CH11231 and DE-AC36-089028308. This work made use of resources of the National Energy Research Scientific Computing Center (NERSC), supported by the Office of Basic Energy Sciences of the U.S. Department of Energy under Contract No. DE-AC02-05CH11231.

## Author information

## Affiliations

### Institute of Condensed Matter and Nanoscience (IMCN), Université catholique de Louvain, B-1348 Louvain-la-neuve, Belgium

- Guido Petretto
- , Henrique P.C. Miranda
- , Matteo Giantomassi
- , Michiel J. van Setten
- , Xavier Gonze
- , Geoffroy Hautier
- & Gian-Marco Rignanese

### Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA

- Shyam Dwaraknath
- , Donald Winston
- & Kristin A. Persson

### Department of Materials Science and Engineering, University of California, Berkeley, California 94720, USA

- Kristin A. Persson

## Authors

### Search for Guido Petretto in:

### Search for Shyam Dwaraknath in:

### Search for Henrique P.C. Miranda in:

### Search for Donald Winston in:

### Search for Matteo Giantomassi in:

### Search for Michiel J. van Setten in:

### Search for Xavier Gonze in:

### Search for Kristin A. Persson in:

### Search for Geoffroy Hautier in:

### Search for Gian-Marco Rignanese in:

### Contributions

G.P. performed the phonon calculations, developed the algorithm and the code, performed the data verification and analysis and wrote the paper. S.D. wrote the code to upload the raw data on the Materials Project website and collaborated on the development of web interface. H.P.C.M. developed the visualization tool of the phonon modes and the web interface on the Materials Project website. D.W. developed the web interface on the Materials Project website. M.G. developed the HT implementation and the pseudopotentials and collaborated on data verification. M.J.v.S. developed the HT implementation and the pseudopotentials. X.G. was involved in supervising the phonon calculations. K.A.P. was involved in supervising and planning the work and its integration with the Materials Project effort. G.H. was involved in supervising and planning the work and its integration with the Materials Project effort. G.-M.R. supervised and planned the work and collaborated on data verification. All the authors contributed to the writing of the paper.

### Competing interests

The authors declare no competing interests.

## Corresponding authors

Correspondence to Guido Petretto or Geoffroy Hautier or Gian-Marco Rignanese.

## Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ The Creative Commons Public Domain Dedication waiver http://creativecommons.org/publicdomain/zero/1.0/ applies to the metadata files made available in this article.