Machine learning the Hubbard U parameter in DFT+U using Bayesian optimization

Yu, Maituo; Yang, Shuyang; Wu, Chunzhi; Marom, Noa

doi:10.1038/s41524-020-00446-9

Download PDF

Article
Open access
Published: 27 November 2020

Machine learning the Hubbard U parameter in DFT+U using Bayesian optimization

npj Computational Materials volume 6, Article number: 180 (2020) Cite this article

24k Accesses
94 Citations
4 Altmetric
Metrics details

Subjects

Abstract

Within density functional theory (DFT), adding a Hubbard U correction can mitigate some of the deficiencies of local and semi-local exchange-correlation functionals, while maintaining computational efficiency. However, the accuracy of DFT+U largely depends on the chosen Hubbard U values. We propose an approach to determining the optimal U parameters for a given material by machine learning. The Bayesian optimization (BO) algorithm is used with an objective function formulated to reproduce the band structures produced by more accurate hybrid functionals. This approach is demonstrated for transition metal oxides, europium chalcogenides, and narrow-gap semiconductors. The band structures obtained using the BO U values are in agreement with hybrid functional results. Additionally, comparison to the linear response (LR) approach to determining U demonstrates that the BO method is superior.

Exchange-correlation functionals for band gaps of solids: benchmark, reparametrization and machine learning

Article Open access 10 July 2020

Machine learning method for tight-binding Hamiltonian parameterization from ab-initio band structure

Article Open access 25 January 2021

A simple denoising approach to exploit multi-fidelity data for machine learning materials properties

Article Open access 11 November 2022

Introduction

Density functional theory (DFT) is the work horse of electronic structure simulations. In particular, semi-local exchange-correlation functionals, such as the generalized gradient approximation (GGA) of Perdew, Burke, and Ernzerhof (PBE)^1,2 are widely used for high-throughput materials discovery efforts^3,4,5. However, due to the self-interaction error (SIE), local and semi-local functionals systematically underestimate band gaps, occasionally to the extent that semiconductors are erroneously predicted to be metallic^6,7. One way to mitigate SIE is including a fraction of excact (Fock) exchange in a hybrid functional^8,9, such as the Heyd-Scuseria-Ernzerhof (HSE) functional^10,11. Hybrid functionals produce improved band gaps; However, their relatively high computational cost may be prohibitive for large systems, such as interface models containing several hundred atoms, and/or for screening a large number of materials.

The DFT+U method is an alternative approach, first introduced by Anisimov et al.¹² and further developed by Dudarev et al.⁷ A Hubbard-like model is adopted to correct the self-interaction error as follows:

$${E}_{{\mathrm{tot}}}={E}_{{\mathrm{DFT}}}+\frac{U-J}{2}\mathop{\sum }\limits_{\sigma}{n}_{m,\sigma }-{n}_{m,\sigma }^{2},$$

(1)

where n is the atomic-orbital occupation number, m is the orbital momentum, σ is a spin index, U represents the on-site Coulomb repulsion, and J represents the exchange interaction. The exchange interaction may be incorporated into the Coulomb term to define the effective Hubbard U as U_eff = U − J⁷. The accuracy of DFT+U calculations hinges on the choice of the system dependent parameter, U_eff.

Often, U_eff is determined empirically by searching for values that reproduce experimental results, such as the band gap of a given material. The empirical approach will inevitably fail if no experimental data are available, which is frequently the case in materials discovery efforts. Several approaches have been proposed for determining the U parameter from first principles¹³. One popular approach is the linear response (LR) method, based on constrained DFT (CDFT) proposed by Cococcioni and de Gironcoli¹⁴. In this approach, linear behavior of the total energy with respect to the occupation number is imposed to correct the unphysical curvature of local and semi-local functionals. The effective U parameter is then given by the difference between the inverse non-interacting density response, ${\chi }_{0}^{-1}$, and the inverse interacting density response, χ⁻¹, which correspond, respectively, to the second derivatives of the non-charge-self-consistent DFT energy, E, and the charge-self-consistent DFT energy, E^KS, with respect to the localized occupation of a single site, q_I:

$${U}_{{\mathrm{eff}}}=\frac{{\partial }^{2}E[\{{q}_{I}\}]}{\partial {q}_{I}^{2}}-\frac{{\partial }^{2}{E}^{{\mathrm{KS}}}[\{{q}_{I}\}]}{\partial {q}_{I}^{2}}={({\chi }_{0}^{-1}-{\chi }^{-1})}_{II}$$

(2)

To simulate the variation of occupations in an infinite crystal, a super-cell is constructed. The size of the super-cell is increased until the value of U_eff converges, which may result in a significant computational cost. A modified self-consistent formulation of LR has been proposed by Kulik et al.¹⁵. Another method for determining the Hubbard U parameter from first principles is the unrestricted Hartree-Fock (UHF) approach proposed by Mosey et al.^16,17. Within this approach, UHF calculations are performed for an electrostatically embedded finite-sized cluster to simulate the bulk material. Similar to the super-cell size in the LR method, the U parameters have to be converged with increasing cluster size. A third approach is the constrained random-phase approximation (cRPA)^18,19,20, which is significantly more computationally expensive.

Here, we propose an approach for determining U_eff based on Bayesian optimization (BO). To demonstrate the performance of our approach, we have chosen three classes of materials, for which semi-local functionals are known to perform poorly: transition metal monoxides, europium chalcogenides, and narrow-gap semiconductors. The results and the computational cost of GGA+U_BO are also compared to the LR method for determining U_eff. We show that for materials of all three classes, BO with a well-designed objective function produces band structures of comparable quality to a hybrid functional. The band structures obtained using GGA+U_BO are either similar to or better than those obtained with GGA+U_LR. In all cases BO is more computationally efficient than LR.

Results and discussion

Bayesian optimization

BO²¹ is a machine learning algorithm that performs global optimization of a black box function by guessing the shape of the function and then iteratively improving it by sequentially sampling points that are promising and/or have high information content. A Bayesian statistical model is used to emulate the objective function. Gaussian process (see additional information in the Supplementary Discussion) is a common choice for BO algorithms²² because it also quantifies the uncertainty associated with each prediction. Future function evaluations are decided by an acquisition function²³. BO is superior to grid search because it utilizes acquired data to make informed sampling decisions, thus requiring fewer expensive function evaluations (e.g., DFT calculations). The successful application of Bayesian optimization requires an appropriate objective function. Here, the surrogate objective function, $f(\overrightarrow{U})$, is formulated such that its maximum corresponds to the U_eff values that best reproduce the band gap, E_g, and the band structure obtained from HSE:

$$f(\overrightarrow{U})=-{\alpha }_{1}{({{\rm{E}}}_{{\rm{g}}}^{{\rm{HSE}}}-{{\rm{E}}}_{{\rm{g}}}^{{\rm{PBE+U}}})}^{2}-{\alpha }_{2}{\left(\Delta {\rm{Band}}\right)}^{2}$$

(3)

Here, $\overrightarrow{U}=[{U}^{1},{U}^{2},\ldots ,{U}^{n}]$ is the vector of U_eff values applied to different atomic species and Uⁱ ∈ [−10, 10] eV. ΔBand is defined similarly to ref. ²⁴ as the mean squared error of the PBE+U band structure with respect to HSE:

$$\Delta {\rm{Band}}=\sqrt{\frac{1}{{N}_{{\mathrm{E}}}}\mathop{\sum }\limits_{i = 1}^{{N}_{k}}\mathop{\sum }\limits_{j = 1}^{{N}_{b}}{({\epsilon }_{{\mathrm{HSE}}}^{j}[{k}_{i}]-{\epsilon }_{{\mathrm{PBE}}+U}^{j}[{k}_{i}])}^{2}}$$

(4)

N_E represents the total number of eigenvalues, ϵ, included in the comparison, N_k is the number of k-points, and N_b is the number of bands selected for comparison. To avoid double counting the band gap difference in the calculation of ΔBand, the valence band maximum (VBM) and conduction band minimum (CBM) are shifted to zero for both the PBE+U and HSE band structures. Hence, ΔBand captures differences in the qualitative features of the band structures produced by PBE+U vs. HSE, independently of the difference in the band gap. Hybrid functionals have been used in the past as a reference for DFT+U²⁵. Here, the HSE functional has been chosen as the reference thanks to its well-established accuracy for various materials^26,27,28. The method can be easily adapted to use any other reference band structure, including different hybrid functionals or many-body perturbation theory. The coefficients α₁ and α₂ may be used to assign different weights to the band gap vs. the band structure. We set α₁ = 0.25 and α₂ = 0.75 as default. Additional analysis of the sensitivity of the U parameters and the resulting band structures to the choice of α₁ and α₂ is provided in the Supplementary Discussion.

BO is applied to maximize the objective function, as illustrated in Fig. 1. To initialize the calculation, the geometry information and input settings of VASP are required. Gaussian process with the radial basis function kernel (see additional information in the Supplementary Discussion) is used as the statistical model to fit the objective function²⁹. The Gaussian process defines a mean, μ, and a standard deviation, σ, for every point, $\overrightarrow{U}$, and updates those parameters from the training data in each iteration²⁹. The upper confidence bound (UCB)²⁹ acquisition function is used to predict the value that would be generated by evaluation of the objective function at a new point and decide what value of $\overrightarrow{U}$ to sample in the n^th iteration:

$${\vec U _n} = \mathop {\arg \max }\limits_{\vec U } \,\mu (\vec U ) + \kappa \sigma (\vec U )$$

(5)

The hyperparameter κ controls the trade-off between exploration and exploitation. Here, we set κ = 1. In each iteration VASP is called to evaluate $f(\overrightarrow{{U}_{n}})$. The posterior probability distribution and acquisition function are updated until the maximal number of iterations, N, is reached. Then, the code outputs the value of $\overrightarrow{U}$ that maximizes $f(\overrightarrow{U})$. The reference HSE calculation is performed only once. The total computational cost of BO amounts to performing one HSE calculation and N PBE+U calculations. The computational cost of updating the BO posterior probability distribution and acquisition function in each iteration is negligible compared to the cost of DFT calculations. Because N is typically small and all calculations are performed for one unit cell (as opposed to a supercell or a large cluster), the computational cost of determining U_eff by BO is often lower than that of determining U_eff by the aforementioned first principles methods. Below, we demonstrate the performance of PBE+U_BO for NiO, EuTe, and InAs. In the Supplementary Discussion, we provide additional examples for EuS, InP, InSb, GaSb, Ge, NiO, MnO, FeO, and CoO. The LR method of determining U_eff is used for comparison. We note that all DFT+U results presented here are based on the implementation of the Dudarev formalism in VASP. Different DFT+U implementations may yield different results³⁰.

Transition metal oxide: NiO

Transition metal oxides are among the materials most often studied with DFT+U¹⁵. In particular, NiO and other transition metal monoxides have been shown to be poorly described by the PBE functional because of their strongly correlated d electrons^31,32,33,34. Our PBE result shown in Fig. 2a is no exception. The PBE band gap of 0.73 eV is considerably underestimated compared to the HSE result of 4.26 eV shown in Fig. 2b. The latter is close to the experimental band gap of 4.0–4.3 eV^35,36. For both PBE and HSE, the valence band maximum (VBM) is located at point T. However, there are qualitative differences in the structure of the valence band and the location of the conduction band minimum (CBM). The contribution of different states to the PBE band structure, shown in Fig. 2a, suggests that the reason could be that the Ni 3d bands are located below the Ni 4s bands leading to an inverted band ordering at the Γ point, which should have been the CBM. Thus, we start by applying the Hubbard U correction to the Ni d states. Based on the HSE band structure, we included the top 10 valence bands and the bottom 4 conduction bands in the optimization. As shown in Fig. 3, the BO algorithm converges within 13 iterations and finds the optimum at ${U}_{{\mathrm{eff}}}^{{\mathrm{Ni}},d}$= 6.8 eV. This leads to rearrangement of the top valence bands and moves the first two conduction bands upward, as shown in Fig. 2c. These changes increase the gap to 3.36 eV and correct the position of the CBM. In comparison, the LR method produces a ${U}_{{\mathrm{eff}}}^{{\mathrm{Ni}},d}$ value of 5.4 eV, which yields a similar band structure with a somewhat smaller gap of 3.20 eV and the CBM incorrectly located between T and K, as shown in Fig. 2d.

**Fig. 2: Band structures of NiO obtained using different methods.**

Although applying a Hubbard U correction to the Ni d states leads to a significant improvement over pure PBE, some residual differences between the PBE+U and HSE results may be attributed to the non-negligible contribution of the oxygen 2p states, as shown in Fig. 2a³⁷. Therefore, a two-dimensional (2D) BO was performed. As illustrated in Fig. 4a, it results in Hubbard U values of ${U}_{{\mathrm{eff}}}^{{\mathrm{Ni}},d}$= 5.9 eV and ${U}_{{\mathrm{eff}}}^{O,p}$= 9.4 eV after 55 iterations. This yields a gap of 3.70 eV and a band structure in closer agreement with HSE, as shown in Fig. 2e. Another indication for the closer agreement with HSE is an increase of the objective function from −0.37 to −0.11 eV². In comparison, LR produces a Hubbard U value of ${U}_{{\mathrm{eff}}}^{O,p}$= 8.3 eV for the O 2p states. With the two-parameter LR, the band gap increases to 3.53 eV and the CBM is in the right position, as shown in Fig. 2f. For NiO, the accuracy of the BO and LR methods for determining U is similar, however the BO method is more efficient. For one-parameter optimization on the same number of CPU cores, LR with a 3 × 3 × 3 super-cell takes about 4.5 times longer than BO, as detailed in Supplementary Table 3. When two parameters are considered, LR is more computationally expensive than BO by a factor of eight.

Europium chalcogenide: EuTe

In addition to the d-block transition metals, f-block elements are considered as strongly correlated and are often treated with DFT+U^38,39. EuTe, a ferromagnetic semiconductor with a gap of 2.0 eV (refs. ^40,41), is a representative example. As shown in Fig. 5a, the PBE functional erroneously predicts EuTe to be metallic. Analysis of the contributions of different orbitals shows that the conduction bands are mostly formed by the d and s states of Eu. The 7 valence bands closest to the Fermi level are formed by the f states of Eu. The p states of Te are found in a separate manifold below the Eu-4f derived bands^42,43,44. The vanishing gap can be attributed to the overlap between bands dominated by the 4f and 5d states of Eu. The HSE calculation, shown in Fig. 5b, produces an indirect band gap of 1.24 eV with the VBM located at the Γ point.

**Fig. 5: Band structures of EuTe obtained using different methods.**

BO was performed with the Hubbard U correction applied to the 4f orbitals of Eu. 20 eigenvalues around the Fermi level (10 above and 10 below) were included in the optimization. The resulting band structure with ${U}_{{\mathrm{eff}}}^{{\mathrm{Eu}},f}$ = 7.1 eV is plotted in Fig. 5c. The Hubbard U correction pushes the Eu 4f bands away from the Fermi level and opens a band gap of 0.71 eV. The PBE+U_BO calculation reproduces the qualitative features of HSE band structure. In comparison, the LR method produces a value of ${U}_{{\mathrm{eff}}}^{{\mathrm{Eu}},f}$ = 5.5 eV. As shown in Fig. 5d, this results in a somewhat smaller band gap of 0.56 eV. In this case, BO also produces similar results to LR at a fraction of the computational cost. The total amount of CPU time required for LR with a 3 × 3 × 3 super-cell is higher than BO by a factor of 9, as shown in Supplementary Table 3. Based on Fig. 5a, the d states of Eu contribute significantly to the bottom conduction bands. Therefore, it may possible to further reduce the difference between the HSE and PBE+U results by considering the d orbital of Eu (refs. ^45,46) in 2D BO. However, applying U corrections to two orbitals of the same element is not implemented in VASP.

Narrow gap semiconductor: InAs

Although all the examples discussed so far involve strongly correlated electrons, SIE also manifests in PBE calculations of narrow-gap semiconductors^47,48,49. The band structure of InAs, shown in Fig. 6a, is a representative example, where PBE produces no band gap⁵⁰. HSE, shown in Fig 6b, gives a band gap of 0.37 eV, which is comparable to the experimental gap of 0.417 eV^51,52. We find that conducting BO with a single U_eff applied to the p orbitals of either In or As yields no band gap. Therefore, 2D BO is performed to optimize ${U}_{{\mathrm{eff}}}^{{\mathrm{In}},5p}$ and ${U}_{{\mathrm{eff}}}^{{\mathrm{As}},4p}$ simultaneously. The resulting band structure with ${U}_{{\mathrm{eff}}}^{{\mathrm{In}},5p}$ = −0.5 eV and ${U}_{{\mathrm{eff}}}^{{\mathrm{As}},4p}$ = −7.5 eV is shown in Fig. 6c. PBE+U_BO produces a band gap of 0.31 eV and a band structure in good agreement with HSE. Negative U_eff values are theoretically permissible when the exchange term, J, is larger than the on-site Coulomb repulsion, U, as suggested in refs. ^{53,54,55,56,57}. It has been argued that negative values of U_eff are appropriate for delocalized states, such as the In s and As p states, where the exchange-correlation hole is overestimated by GGA. In comparison, the LR method, which does not permit negative values of U_eff, yields ${U}_{{\mathrm{eff}}}^{{\mathrm{In}},5p}$ = 0.7 eV and ${U}_{{\mathrm{eff}}}^{{\mathrm{As}},4p}$ = 3.3 eV. This produces a band structure with no gap, as shown in Fig. 6d. Thus, for InAs BO is not only more efficient, but also more accurate than LR. In the Supplementary Discussion, we further demonstrate the transferability of the U values found by BO for bulk InAs to a slab of InAs with 11 atomic layers.

In summary, we have developed a method of determining the optimal Hubbard U parameter in DFT+U by using the Bayesian optimization machine learning algorithm. The objective function was formulated to reproduce as closely as possible the band gap and the qualitative features of the band structure obtained with a hybrid functional. We have demonstrated robust performance for several materials, including transition metal oxides, Eu chalcogenides, and narrow-gap semiconductors. PBE+U_BO consistently produces band structures comparable to HSE. Furthermore, BO is more efficient than the linear response method and performs better, particularly in cases that call for negative values of U_eff. Based on this, we conclude that PBE+U_BO can provide the accuracy of a hybrid functional at the computational cost of a semi-local functional. This may enable conducting simulations for larger systems, such as surfaces and interfaces, which would be unfeasible with a hybrid functional.

Methods

Computational details

All DFT calculations were performed using the Vienna Ab Initio Simulation Package (VASP) code with the projector augmented wave (PAW) method^58,59,60. Spin-orbit coupling (SOC) was included in the calculations of transition metal oxides, Eu chalcogenides, and narrow-gap semiconductors⁶¹. Details of the lattice parameter and energy cutoff used for each compound are provided in Supplementary Table 1. The Brillouin zone was sampled using an 8 × 8 × 8 k-point grid for PBE and PBE+U calculations and 6 × 6 × 6 for HSE calculations. The coordinates of the high-symmetry k-points used for plotting the band structures are provided in Supplementary Table 2.

Linear response calculations

For LR calculations, a small potential was applied to the target state of a single site. Non-charge-self-consistent and charge-self-consistent calculations of the state occupation were performed as the potential was varied from −0.04 eV to 0.04 eV in increments of 0.02 eV. The derivatives of the occupation with respect to the potential give the non-interacting and interacting response matrices, used in Eq. 2 to calculate U_eff (see also Yang et al.⁶²). To avoid interactions between periodic images of the perturbed atom, a 3 × 3 × 3 super-cell was constructed.

Data availability

Data will be available from the corresponding author upon reasonable request.

Code availability

The BO code developed here is available at: https://github.com/maituoy/BayesianOpt4dftu.

References

Perdew, J. P., Burke, K. & Ernzerhof, M. Generalized gradient approximation made simple. Phys. Rev. Lett. 77, 3865–3868 (1996).
Article CAS Google Scholar
Perdew, J. P., Burke, K. & Ernzerhof, M. Generalized gradient approximation made simple [phys. rev. lett. 77, 3865 (1996)]. Phys. Rev. Lett. 78, 1396–1396 (1997).
Article CAS Google Scholar
Curtarolo, S. et al. The high-throughput highway to computational materials design. Nat. Mater. 12, 191–201 (2013).
Article CAS Google Scholar
Jain, A. et al. The Materials Project: a materials genome approach to accelerating materials innovation. APL Mater. 1, 011002 (2013).
Article CAS Google Scholar
Saal, J. E., Kirklin, S., Aykol, M., Meredig, B. & Wolverton, C. Materials design and discovery with high-throughput density functional theory: The open quantum materials database (oqmd). JOM 65, 1501–1509 (2013).
Article CAS Google Scholar
Hüfner, S. Electronic structure of NiO and related 3d-transition-metal compounds. Adv. Phys. 43, 183–356 (1994).
Article Google Scholar
Dudarev, S. L., Botton, G. A., Savrasov, S. Y., Humphreys, C. J. & Sutton, A. P. Electron-energy-loss spectra and the structural stability of nickel oxide: An lsda+u study. Phys. Rev. B 57, 1505–1509 (1998).
Article CAS Google Scholar
Perdew, J. P. Climbing the ladder of density functional approximations. MRS Bull. 38, 743–750 (2013).
Article CAS Google Scholar
Becke, A. D. A new mixing of Hartree-Fock and local density-functional theories. J. Chem. Phys. 98, 1372–1377 (1993).
Article CAS Google Scholar
Heyd, J. & Scuseria, G. E. Hybrid functionals based on a screened Coulomb potential. J. Chem. Phys. 118, 8207–8215 (2003).
Article CAS Google Scholar
Heyd, J., Scuseria, G. E. & Ernzerhof, M. Erratum: hybrid functionals based on a screened coulomb potential [J. CHEM. Phys. 118, 8207 (2003)]. J. Chem. Phys. 124, 219906 (2006).
Article CAS Google Scholar
Anisimov, V. I., Zaanen, J. & Andersen, O. K. Band theory and Mott insulators: Hubbard U instead of Stoner I. Phys. Rev. B 44, 943–954 (1991).
Article CAS Google Scholar
Yu, K. & Carter, E. A. Communication: comparing ab initio methods of obtaining effective U parameters for closed-shell materials. J. Chem. Phys. 140, 121105 (2014).
Cococcioni, M. & de Gironcoli, S. Linear response approach to the calculation of the effective interaction parameters in the LDA+U method. Phys. Rev. B 71, 035105 (2005).
Article CAS Google Scholar
Kulik, H. J., Cococcioni, M., Scherlis, D. A. & Marzari, N. Density functional theory in transition-metal chemistry: a self-consistent hubbard u approach. Phys. Rev. Lett. 97, 103001 (2006).
Article CAS Google Scholar
Mosey, N. J. & Carter, E. A. Ab initio evaluation of coulomb and exchange parameters for DFT+U calculations. Phys. Rev. B 76, 155123 (2007).
Article CAS Google Scholar
Mosey, N. J., Liao, P. & Carter, E. A. Rotationally invariant ab initio evaluation of Coulomb and exchange parameters for DFT+U calculations. J. Chem. Phys. 129, 14103 (2008).
Aryasetiawan, F., Karlsson, K., Jepsen, O. & Schönberger, U. Calculations of Hubbard U from first-principles. Phys. Rev. B 74, 125106 (2006).
Article CAS Google Scholar
Miyake, T. & Aryasetiawan, F. Screened coulomb interaction in the maximally localized wannier basis. Phys. Rev. B 77, 085122 (2008).
Article CAS Google Scholar
Şaşíoğlu, E., Friedrich, C. & Blügel, S. Effective coulomb interaction in transition metals from constrained random-phase approximation. Phys. Rev. B 83, 121101 (2011).
Article CAS Google Scholar
Frazier, P. I. A tutorial on bayesian optimization. arXiv https://arxiv.org/abs/1807.02811 (2018).
Brochu, E., Cora, V. M., & De Freitas, N. A tutorial on bayesian optimization of expensive cost functions, with application to active user modeling and hierarchical reinforcement learning. arXiv preprint at https://arxiv.org/abs/1012.2599 (2010).
Snoek, J., Larochelle, H. & Adams, R. P. Practical bayesian optimization of machine learning algorithms. In Advances In Neural Information Processing Systems. 2951–2959 (Neural Information Processing Systems Foundation, Inc., 2012).
Huhn, W. P. & Blum, V. One-hundred-three compound band-structure benchmark of post-self-consistent spin-orbit coupling treatments in density functional theory. Phys. Rev. Mater. 1, 033803 (2017).
Article Google Scholar
Topsakal, M. & Wentzcovitch, R. M. Accurate projected augmented wave (PAW) datasets for rare-earth elements (RE=La-Lu). Comput. Mater. Sci. 95, 263–270 (2014).
Article CAS Google Scholar
Zhang, G.-X., Reilly, A. M., Tkatchenko, A. & Scheffler, M. Performance of various density-functional approximations for cohesive properties of 64 bulk solids. New J. Phys. 20, 063020 (2018).
Article CAS Google Scholar
Garza, A. J. & Scuseria, G. E. Predicting band gaps with hybrid density functionals. J. Phys. Chem. Lett. 7, 4165–4170 (2016).
Article CAS Google Scholar
Friedrich, C., Betzinger, M., Schlipf, M., Blügel, S. & Schindlmayr, A. Hybrid functionals and gw approximation in the flapw method. J. Phys. Condens. Matter 24, 293201 (2012).
Article CAS Google Scholar
Williams, C. K. I. & Rasmussen, C. E. Gaussian Processes For Machine Learning. Vol. 2 (MIT press Cambridge, 2006).
Kick, M., Reuter, K. & Oberhofer, H. Intricacies of DFT+U, not only in a numeric atom centered orbital framework. J. Chem. Theory Comput. 15, 1705–1718 (2019).
Article CAS Google Scholar
Ye, L.-H., Luo, N., Peng, L.-M., Weinert, M. & Freeman, A. J. Dielectric constant of NiO and LDA+U. Phys. Rev. B 87, 075115 (2013).
Article CAS Google Scholar
Pask, J. E., Singh, D. J., Mazin, I. I., Hellberg, C. S. & Kortus, J. Structural, electronic, and magnetic properties of MnO. Phys. Rev. B 64, 024403 (2001).
Article CAS Google Scholar
Deng, H.-X., Li, J., Li, S.-S., Xia, J.-B., Walsh, A. & Wei, S.-H. Origin of antiferromagnetism in CoO: a density functional theory study. Appl. Phys. Lett. 96, 162508 (2010).
Article CAS Google Scholar
Dufek, P., Blaha, P., Sliwko, V. & Schwarz, K. Generalized-gradient-approximation description of band splittings in transition-metal oxides and fluorides. Phys. Rev. B 49, 10170 (1994).
Article CAS Google Scholar
Hüfner, S., Osterwalder, J., Riesterer, T. & Hulliger, F. Photoemission and inverse photoemission spectroscopy of NiO. Solid State Commun. 52, 793–796 (1984).
Article Google Scholar
Sawatzky, G. A. & Allen, J. W. Magnitude and origin of the band gap in NiO. Phys. Rev. Lett. 53, 2339 (1984).
Article CAS Google Scholar
Himmetoglu, B. & Wentzcovitch, R. M. First-principles study of electronic and structural properties of CuO. Phys. Rev. B 84, 115108 (2011).
Article CAS Google Scholar
Jiang, H., Gomez-Abal, R. I., Rinke, P. & Scheffler, M. Localized and Itinerant states in lanthanide oxides united by GW@LDA+U. Phys. Rev. Lett. 102, 126403 (2009).
Article CAS Google Scholar
Jiang, H., Gomez-Abal, R. I., Rinke, P. & Scheffler, M. First-principles modeling of localized d states with the GW@LDA+U approach. Phys. Rev. B 82, 045108 (2010).
Article CAS Google Scholar
Wachter, P. The optical electrical and magnetic properties of the europium chalcogenides and the rare earth pnictides. Crit. Rev. Solid State Mater. Sci. 3, 189–241 (1972).
Article CAS Google Scholar
Ghosh, D. B., De, M. & De, S. K. Electronic structure and magneto-optical properties of magnetic semiconductors: Europium monochalcogenides. Phys. Rev. B 70, 115211 (2004).
Article CAS Google Scholar
Larson, P. & Lambrecht, W. R. L. Electronic structure and magnetism of europium chalcogenides in comparison with gadolinium nitride. J. Phys. Condens. Matter 18, 11333–11345 (2006).
Article CAS Google Scholar
Shi, S. Q. et al. Electronic structure and magnetism of EuX (X = O, S, Se and Te): A first-principles investigation. EPL 83, 4–9 (2008).
Google Scholar
Schlipf, M., Betzinger, M., Ležaić, M., Friedrich, C. & Blügel, S. Structural, electronic, and magnetic properties of the europium chalcogenides: a hybrid-functional DFT study. Phys. Rev. B 88, 94433 (2013).
Article CAS Google Scholar
Ingle, N. J. C. & Elfimov, I. S. Influence of epitaxial strain on the ferromagnetic semiconductor EuO: first-principles calculations. Phys. Rev. B 77, 121202 (2008).
Article CAS Google Scholar
An, J. M., Barabash, S. V., Ozolins, V., van Schilfgaarde, M. & Belashchenko, K. D. First-principles study of phase stability of Gd-doped EuO and EuS. Phys. Rev. B 83, 064105 (2011).
Article CAS Google Scholar
Kim, Y.-S., Hummer, K. & Kresse, G. Accurate band structures and effective masses for InP, InAs, and InSb using hybrid functionals. Phys. Rev. B 80, 035203 (2009).
Article CAS Google Scholar
Massidda, S. et al. Structural and electronic properties of narrow-band-gap semiconductors: InP, InAs, and InSb. Phys. Rev. B 41, 12079–12085 (1990).
Article CAS Google Scholar
Soluyanov, A. A. et al. Optimizing spin-orbit splittings in InSb Majorana nanowires. Phys. Rev. B 93, 115317 (2016).
Article CAS Google Scholar
Malyi, O. I., Dalpian, G. M., Zhao, X.-G., Wang, Z. & Zunger, A. Realization of predicted exotic materials: the burden of proof. Mater. Today 32, 35–45 (2019).
Article CAS Google Scholar
Vurgaftman, I., Meyer, J. R. & Ram-Mohan, L. R. Band parameters for III-V compound semiconductors and their alloys. J. Appl. Phys. 89, 5815–5875 (2001).
Article CAS Google Scholar
Madelung, O., Rössler, U. & Schulz, M. Group IV Elements, IV-IV and III-V Compounds. Part b - Electronic, Transport, Optical and Other Properties. Landolt-Börnstein - Group III Condensed Matter, 41A1β (Heidelberg Springer, Berlin, 2002).
Micnas, R., Ranninger, J. & Robaszkiewicz, S. Superconductivity in narrow-band systems with local nonretarded attractive interactions. Rev. Mod. Phys. 62, 113–171 (1990).
Article CAS Google Scholar
Hase, I. & Yanagisawa, T. Madelung energy of the valence-skipping compound BaBiO₃. Phys. Rev. B 76, 174103 (2007).
Article CAS Google Scholar
Nakamura, H., Hayashi, N., Nakai, N., Okumura, M. & Machida, M. First-principle electronic structure calculations for magnetic moment in iron-based superconductors: an LSDA+ negative U study. Phys. C Supercond. 469, 908–911 (2009).
Article CAS Google Scholar
Persson, C. & Mirbt, S. Improved electronic structure and optical properties of sp-hybridized semiconductors using LDA+U SIC. Braz. J. Phys. 36, 286–290 (2006).
Article CAS Google Scholar
Cococcioni, M. The LDA+U approach: a simple Hubbard correction for correlated ground states. Correlated Electrons: From Models to Materials Modeling and Simulation. Vol. 2, Ch. 4 (Verlag des Forschungszentrum Jülich, Jülich, Germany, 2012).
Kresse, G. Ab initio molecular dynamics for liquid metals. J. Non Cryst. Solids 192–193, 222–229 (1995).
Article Google Scholar
Rolland, A., Pouget, L., Brunel, M., Loas, G. & Alouini, M. Bridging the THz to RF gap by four-wave mixing in a highly nonlinear fiber. In Proc. 2013 Conference on Lasers and Electro-Optics, CLEO 2013 59, 11–19 (2013).
Blöchl, P. E. Projector augmented-wave method. Phys. Rev. B 50, 17953 (1994).
Article Google Scholar
Steiner, S., Khmelevskyi, S., Marsmann, M. & Kresse, G. Calculation of the magnetic anisotropy with projected-augmented-wave methodology and the case study of disordered Fe_1−xCo_x alloys. Phys. Rev. B 93, 224425 (2016).
Article CAS Google Scholar
Yang, S., Wu, C. & Marom, N. Topological properties of SnSe/EuS and SnTe/CaTe interfaces. Phys. Rev. Mater. 4, 034203 (2020).
Article CAS Google Scholar

Download references

Acknowledgements

Work on III–V semiconductors was funded by the National Science Foundation (NSF) through grant OISE-1743717. Work on transition metal oxides and Eu chalcogenides was funded by the U.S. Department of Energy through grant DE-SC0019274. This research used resources of the National Energy Research Scientific Computing Center (NERSC), a U.S. Department of Energy Office of Science User Facility operated under Contract No. DE-AC02-05CH11231.

Author information

These authors contributed equally: Maituo Yu, Shuyang Yang.

Authors and Affiliations

Department of Materials Science and Engineering, Carnegie Mellon University, Pittsburgh, PA, 15213, USA
Maituo Yu, Shuyang Yang, Chunzhi Wu & Noa Marom
Department of Physics, Carnegie Mellon University, Pittsburgh, PA, 15213, USA
Noa Marom
Department of Chemistry, Carnegie Mellon University, Pittsburgh, PA, 15213, USA
Noa Marom

Authors

Maituo Yu
View author publications
You can also search for this author in PubMed Google Scholar
Shuyang Yang
View author publications
You can also search for this author in PubMed Google Scholar
Chunzhi Wu
View author publications
You can also search for this author in PubMed Google Scholar
Noa Marom
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.Y. and S.Y. implemented the algorithm, performed the calculations, collected, and analyzed the data. M.Y. and S.Y. contributed equally to this work. C.W. performed some of the calculations. N.M. led the project. All authors contributed to writing the manuscript.

Corresponding author

Correspondence to Noa Marom.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Yu, M., Yang, S., Wu, C. et al. Machine learning the Hubbard U parameter in DFT+U using Bayesian optimization. npj Comput Mater 6, 180 (2020). https://doi.org/10.1038/s41524-020-00446-9

Download citation

Received: 06 June 2020
Accepted: 27 October 2020
Published: 27 November 2020
DOI: https://doi.org/10.1038/s41524-020-00446-9

This article is cited by

First-principles examination of structural, electronic, magnetic, and optical properties of a free lead scintillating double perovskites $A_2NaTaX_6$ ($A = Cs$, Rb; $X = Cl, Br, I)$
- H. Ouhenou
- A. Zaghrane
- M. Driouich
Optical and Quantum Electronics (2024)
Electronic, optical, magnetic, and magnetocaloric properties of double perovskites Sr2CrOsO6: first principles approach and Monte Carlo simulation
- M. Bessimou
- R. Masrour
Optical and Quantum Electronics (2024)
First principles calculations and analysis of electronic and optical structure of Ho-doped ZnO films
- S. Aydin
Journal of Materials Science: Materials in Electronics (2023)
Insights into the performance of InAs-based devices in extreme environments from multiscale simulations
- Logan R. Brennaman
- Adib J. Samin
Applied Physics A (2023)
Hubbard U through polaronic defect states
- Stefano Falletta
- Alfredo Pasquarello
npj Computational Materials (2022)