Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

Machine learning the Hubbard U parameter in DFT+U using Bayesian optimization

Abstract

Within density functional theory (DFT), adding a Hubbard U correction can mitigate some of the deficiencies of local and semi-local exchange-correlation functionals, while maintaining computational efficiency. However, the accuracy of DFT+U largely depends on the chosen Hubbard U values. We propose an approach to determining the optimal U parameters for a given material by machine learning. The Bayesian optimization (BO) algorithm is used with an objective function formulated to reproduce the band structures produced by more accurate hybrid functionals. This approach is demonstrated for transition metal oxides, europium chalcogenides, and narrow-gap semiconductors. The band structures obtained using the BO U values are in agreement with hybrid functional results. Additionally, comparison to the linear response (LR) approach to determining U demonstrates that the BO method is superior.

Introduction

Density functional theory (DFT) is the work horse of electronic structure simulations. In particular, semi-local exchange-correlation functionals, such as the generalized gradient approximation (GGA) of Perdew, Burke, and Ernzerhof (PBE)1,2 are widely used for high-throughput materials discovery efforts3,4,5. However, due to the self-interaction error (SIE), local and semi-local functionals systematically underestimate band gaps, occasionally to the extent that semiconductors are erroneously predicted to be metallic6,7. One way to mitigate SIE is including a fraction of excact (Fock) exchange in a hybrid functional8,9, such as the Heyd-Scuseria-Ernzerhof (HSE) functional10,11. Hybrid functionals produce improved band gaps; However, their relatively high computational cost may be prohibitive for large systems, such as interface models containing several hundred atoms, and/or for screening a large number of materials.

The DFT+U method is an alternative approach, first introduced by Anisimov et al.12 and further developed by Dudarev et al.7 A Hubbard-like model is adopted to correct the self-interaction error as follows:

$${E}_{{\mathrm{tot}}}={E}_{{\mathrm{DFT}}}+\frac{U-J}{2}\mathop{\sum }\limits_{\sigma}{n}_{m,\sigma }-{n}_{m,\sigma }^{2},$$
(1)

where n is the atomic-orbital occupation number, m is the orbital momentum, σ is a spin index, U represents the on-site Coulomb repulsion, and J represents the exchange interaction. The exchange interaction may be incorporated into the Coulomb term to define the effective Hubbard U as Ueff = U − J7. The accuracy of DFT+U calculations hinges on the choice of the system dependent parameter, Ueff.

Often, Ueff is determined empirically by searching for values that reproduce experimental results, such as the band gap of a given material. The empirical approach will inevitably fail if no experimental data are available, which is frequently the case in materials discovery efforts. Several approaches have been proposed for determining the U parameter from first principles13. One popular approach is the linear response (LR) method, based on constrained DFT (CDFT) proposed by Cococcioni and de Gironcoli14. In this approach, linear behavior of the total energy with respect to the occupation number is imposed to correct the unphysical curvature of local and semi-local functionals. The effective U parameter is then given by the difference between the inverse non-interacting density response, \({\chi }_{0}^{-1}\), and the inverse interacting density response, χ−1, which correspond, respectively, to the second derivatives of the non-charge-self-consistent DFT energy, E, and the charge-self-consistent DFT energy, EKS, with respect to the localized occupation of a single site, qI:

$${U}_{{\mathrm{eff}}}=\frac{{\partial }^{2}E[\{{q}_{I}\}]}{\partial {q}_{I}^{2}}-\frac{{\partial }^{2}{E}^{{\mathrm{KS}}}[\{{q}_{I}\}]}{\partial {q}_{I}^{2}}={({\chi }_{0}^{-1}-{\chi }^{-1})}_{II}$$
(2)

To simulate the variation of occupations in an infinite crystal, a super-cell is constructed. The size of the super-cell is increased until the value of Ueff converges, which may result in a significant computational cost. A modified self-consistent formulation of LR has been proposed by Kulik et al.15. Another method for determining the Hubbard U parameter from first principles is the unrestricted Hartree-Fock (UHF) approach proposed by Mosey et al.16,17. Within this approach, UHF calculations are performed for an electrostatically embedded finite-sized cluster to simulate the bulk material. Similar to the super-cell size in the LR method, the U parameters have to be converged with increasing cluster size. A third approach is the constrained random-phase approximation (cRPA)18,19,20, which is significantly more computationally expensive.

Here, we propose an approach for determining Ueff based on Bayesian optimization (BO). To demonstrate the performance of our approach, we have chosen three classes of materials, for which semi-local functionals are known to perform poorly: transition metal monoxides, europium chalcogenides, and narrow-gap semiconductors. The results and the computational cost of GGA+UBO are also compared to the LR method for determining Ueff. We show that for materials of all three classes, BO with a well-designed objective function produces band structures of comparable quality to a hybrid functional. The band structures obtained using GGA+UBO are either similar to or better than those obtained with GGA+ULR. In all cases BO is more computationally efficient than LR.

Results and discussion

Bayesian optimization

BO21 is a machine learning algorithm that performs global optimization of a black box function by guessing the shape of the function and then iteratively improving it by sequentially sampling points that are promising and/or have high information content. A Bayesian statistical model is used to emulate the objective function. Gaussian process (see additional information in the Supplementary Discussion) is a common choice for BO algorithms22 because it also quantifies the uncertainty associated with each prediction. Future function evaluations are decided by an acquisition function23. BO is superior to grid search because it utilizes acquired data to make informed sampling decisions, thus requiring fewer expensive function evaluations (e.g., DFT calculations). The successful application of Bayesian optimization requires an appropriate objective function. Here, the surrogate objective function, \(f(\overrightarrow{U})\), is formulated such that its maximum corresponds to the Ueff values that best reproduce the band gap, Eg, and the band structure obtained from HSE:

$$f(\overrightarrow{U})=-{\alpha }_{1}{({{\rm{E}}}_{{\rm{g}}}^{{\rm{HSE}}}-{{\rm{E}}}_{{\rm{g}}}^{{\rm{PBE+U}}})}^{2}-{\alpha }_{2}{\left(\Delta {\rm{Band}}\right)}^{2}$$
(3)

Here, \(\overrightarrow{U}=[{U}^{1},{U}^{2},\ldots ,{U}^{n}]\) is the vector of Ueff values applied to different atomic species and Ui [−10, 10] eV. ΔBand is defined similarly to ref. 24 as the mean squared error of the PBE+U band structure with respect to HSE:

$$\Delta {\rm{Band}}=\sqrt{\frac{1}{{N}_{{\mathrm{E}}}}\mathop{\sum }\limits_{i = 1}^{{N}_{k}}\mathop{\sum }\limits_{j = 1}^{{N}_{b}}{({\epsilon }_{{\mathrm{HSE}}}^{j}[{k}_{i}]-{\epsilon }_{{\mathrm{PBE}}+U}^{j}[{k}_{i}])}^{2}}$$
(4)

NE represents the total number of eigenvalues, ϵ, included in the comparison, Nk is the number of k-points, and Nb is the number of bands selected for comparison. To avoid double counting the band gap difference in the calculation of ΔBand, the valence band maximum (VBM) and conduction band minimum (CBM) are shifted to zero for both the PBE+U and HSE band structures. Hence, ΔBand captures differences in the qualitative features of the band structures produced by PBE+U vs. HSE, independently of the difference in the band gap. Hybrid functionals have been used in the past as a reference for DFT+U25. Here, the HSE functional has been chosen as the reference thanks to its well-established accuracy for various materials26,27,28. The method can be easily adapted to use any other reference band structure, including different hybrid functionals or many-body perturbation theory. The coefficients α1 and α2 may be used to assign different weights to the band gap vs. the band structure. We set α1 = 0.25 and α2 = 0.75 as default. Additional analysis of the sensitivity of the U parameters and the resulting band structures to the choice of α1 and α2 is provided in the Supplementary Discussion.

BO is applied to maximize the objective function, as illustrated in Fig. 1. To initialize the calculation, the geometry information and input settings of VASP are required. Gaussian process with the radial basis function kernel (see additional information in the Supplementary Discussion) is used as the statistical model to fit the objective function29. The Gaussian process defines a mean, μ, and a standard deviation, σ, for every point, \(\overrightarrow{U}\), and updates those parameters from the training data in each iteration29. The upper confidence bound (UCB)29 acquisition function is used to predict the value that would be generated by evaluation of the objective function at a new point and decide what value of \(\overrightarrow{U}\) to sample in the nth iteration:

$${\vec U _n} = \mathop {\arg \max }\limits_{\vec U } \,\mu (\vec U ) + \kappa \sigma (\vec U )$$
(5)
Fig. 1: Bayesian optimization.
figure1

Workflow of the algorithm for determining Ueff.

The hyperparameter κ controls the trade-off between exploration and exploitation. Here, we set κ = 1. In each iteration VASP is called to evaluate \(f(\overrightarrow{{U}_{n}})\). The posterior probability distribution and acquisition function are updated until the maximal number of iterations, N, is reached. Then, the code outputs the value of \(\overrightarrow{U}\) that maximizes \(f(\overrightarrow{U})\). The reference HSE calculation is performed only once. The total computational cost of BO amounts to performing one HSE calculation and N PBE+U calculations. The computational cost of updating the BO posterior probability distribution and acquisition function in each iteration is negligible compared to the cost of DFT calculations. Because N is typically small and all calculations are performed for one unit cell (as opposed to a supercell or a large cluster), the computational cost of determining Ueff by BO is often lower than that of determining Ueff by the aforementioned first principles methods. Below, we demonstrate the performance of PBE+UBO for NiO, EuTe, and InAs. In the Supplementary Discussion, we provide additional examples for EuS, InP, InSb, GaSb, Ge, NiO, MnO, FeO, and CoO. The LR method of determining Ueff is used for comparison. We note that all DFT+U results presented here are based on the implementation of the Dudarev formalism in VASP. Different DFT+U implementations may yield different results30.

Transition metal oxide: NiO

Transition metal oxides are among the materials most often studied with DFT+U15. In particular, NiO and other transition metal monoxides have been shown to be poorly described by the PBE functional because of their strongly correlated d electrons31,32,33,34. Our PBE result shown in Fig. 2a is no exception. The PBE band gap of 0.73 eV is considerably underestimated compared to the HSE result of 4.26 eV shown in Fig. 2b. The latter is close to the experimental band gap of 4.0–4.3 eV35,36. For both PBE and HSE, the valence band maximum (VBM) is located at point T. However, there are qualitative differences in the structure of the valence band and the location of the conduction band minimum (CBM). The contribution of different states to the PBE band structure, shown in Fig. 2a, suggests that the reason could be that the Ni 3d bands are located below the Ni 4s bands leading to an inverted band ordering at the Γ point, which should have been the CBM. Thus, we start by applying the Hubbard U correction to the Ni d states. Based on the HSE band structure, we included the top 10 valence bands and the bottom 4 conduction bands in the optimization. As shown in Fig. 3, the BO algorithm converges within 13 iterations and finds the optimum at \({U}_{{\mathrm{eff}}}^{{\mathrm{Ni}},d}\)= 6.8 eV. This leads to rearrangement of the top valence bands and moves the first two conduction bands upward, as shown in Fig. 2c. These changes increase the gap to 3.36 eV and correct the position of the CBM. In comparison, the LR method produces a \({U}_{{\mathrm{eff}}}^{{\mathrm{Ni}},d}\) value of 5.4 eV, which yields a similar band structure with a somewhat smaller gap of 3.20 eV and the CBM incorrectly located between T and K, as shown in Fig. 2d.

Fig. 2: Band structures of NiO obtained using different methods.
figure2

a PBE with the projected contributions of Ni d states, Ni s states, and O p states colored in green, yellow, and red, respectively; b HSE; c PBE+U with one-parameter BO; d PBE+U with one-parameter LR; e PBE+U with two-parameter BO; and f PBE+U with two-parameter LR. The VBM is referenced to 0 eV.

Fig. 3: 1D BO for NiO.
figure3

a The Gaussian process predicted mean values and b the acquisition function after 13 BO steps.

Although applying a Hubbard U correction to the Ni d states leads to a significant improvement over pure PBE, some residual differences between the PBE+U and HSE results may be attributed to the non-negligible contribution of the oxygen 2p states, as shown in Fig. 2a37. Therefore, a two-dimensional (2D) BO was performed. As illustrated in Fig. 4a, it results in Hubbard U values of \({U}_{{\mathrm{eff}}}^{{\mathrm{Ni}},d}\)= 5.9 eV and \({U}_{{\mathrm{eff}}}^{O,p}\)= 9.4 eV after 55 iterations. This yields a gap of 3.70 eV and a band structure in closer agreement with HSE, as shown in Fig. 2e. Another indication for the closer agreement with HSE is an increase of the objective function from −0.37 to −0.11 eV2. In comparison, LR produces a Hubbard U value of \({U}_{{\mathrm{eff}}}^{O,p}\)= 8.3 eV for the O 2p states. With the two-parameter LR, the band gap increases to 3.53 eV and the CBM is in the right position, as shown in Fig. 2f. For NiO, the accuracy of the BO and LR methods for determining U is similar, however the BO method is more efficient. For one-parameter optimization on the same number of CPU cores, LR with a 3 × 3 × 3 super-cell takes about 4.5 times longer than BO, as detailed in Supplementary Table 3. When two parameters are considered, LR is more computationally expensive than BO by a factor of eight.

Fig. 4: 2D BO for NiO.
figure4

a The Gaussian process predicted mean values, and b the acquisition function after 55 BO steps.

Europium chalcogenide: EuTe

In addition to the d-block transition metals, f-block elements are considered as strongly correlated and are often treated with DFT+U38,39. EuTe, a ferromagnetic semiconductor with a gap of 2.0 eV (refs. 40,41), is a representative example. As shown in Fig. 5a, the PBE functional erroneously predicts EuTe to be metallic. Analysis of the contributions of different orbitals shows that the conduction bands are mostly formed by the d and s states of Eu. The 7 valence bands closest to the Fermi level are formed by the f states of Eu. The p states of Te are found in a separate manifold below the Eu-4f derived bands42,43,44. The vanishing gap can be attributed to the overlap between bands dominated by the 4f and 5d states of Eu. The HSE calculation, shown in Fig. 5b, produces an indirect band gap of 1.24 eV with the VBM located at the Γ point.

Fig. 5: Band structures of EuTe obtained using different methods.
figure5

a PBE with the projected contributions of Eu f states, Eu d states, and Te p states colored in green, red, and yellow, respectively; b HSE; c PBE+U with \({U}_{{\mathrm{eff}}}^{{\mathrm{Eu}},f}\) from BO; and d PBE+U with \({U}_{{\mathrm{eff}}}^{{\mathrm{Eu}},f}\) from LR. The VBM is referenced to 0 eV.

BO was performed with the Hubbard U correction applied to the 4f orbitals of Eu. 20 eigenvalues around the Fermi level (10 above and 10 below) were included in the optimization. The resulting band structure with \({U}_{{\mathrm{eff}}}^{{\mathrm{Eu}},f}\) = 7.1 eV is plotted in Fig. 5c. The Hubbard U correction pushes the Eu 4f bands away from the Fermi level and opens a band gap of 0.71 eV. The PBE+UBO calculation reproduces the qualitative features of HSE band structure. In comparison, the LR method produces a value of \({U}_{{\mathrm{eff}}}^{{\mathrm{Eu}},f}\) = 5.5 eV. As shown in Fig. 5d, this results in a somewhat smaller band gap of 0.56 eV. In this case, BO also produces similar results to LR at a fraction of the computational cost. The total amount of CPU time required for LR with a 3 × 3 × 3 super-cell is higher than BO by a factor of 9, as shown in Supplementary Table 3. Based on Fig. 5a, the d states of Eu contribute significantly to the bottom conduction bands. Therefore, it may possible to further reduce the difference between the HSE and PBE+U results by considering the d orbital of Eu (refs. 45,46) in 2D BO. However, applying U corrections to two orbitals of the same element is not implemented in VASP.

Narrow gap semiconductor: InAs

Although all the examples discussed so far involve strongly correlated electrons, SIE also manifests in PBE calculations of narrow-gap semiconductors47,48,49. The band structure of InAs, shown in Fig. 6a, is a representative example, where PBE produces no band gap50. HSE, shown in Fig 6b, gives a band gap of 0.37 eV, which is comparable to the experimental gap of 0.417 eV51,52. We find that conducting BO with a single Ueff applied to the p orbitals of either In or As yields no band gap. Therefore, 2D BO is performed to optimize \({U}_{{\mathrm{eff}}}^{{\mathrm{In}},5p}\) and \({U}_{{\mathrm{eff}}}^{{\mathrm{As}},4p}\) simultaneously. The resulting band structure with \({U}_{{\mathrm{eff}}}^{{\mathrm{In}},5p}\) = −0.5 eV and \({U}_{{\mathrm{eff}}}^{{\mathrm{As}},4p}\) = −7.5 eV is shown in Fig. 6c. PBE+UBO produces a band gap of 0.31 eV and a band structure in good agreement with HSE. Negative Ueff values are theoretically permissible when the exchange term, J, is larger than the on-site Coulomb repulsion, U, as suggested in refs. 53,54,55,56,57. It has been argued that negative values of Ueff are appropriate for delocalized states, such as the In s and As p states, where the exchange-correlation hole is overestimated by GGA. In comparison, the LR method, which does not permit negative values of Ueff, yields \({U}_{{\mathrm{eff}}}^{{\mathrm{In}},5p}\) = 0.7 eV and \({U}_{{\mathrm{eff}}}^{{\mathrm{As}},4p}\) = 3.3 eV. This produces a band structure with no gap, as shown in Fig. 6d. Thus, for InAs BO is not only more efficient, but also more accurate than LR. In the Supplementary Discussion, we further demonstrate the transferability of the U values found by BO for bulk InAs to a slab of InAs with 11 atomic layers.

Fig. 6: Band structures of InAs obtained using different methods.
figure6

a PBE with the projected contributions of In p states, In s states, and As p states colored in green, red, and yellow, respectively; b HSE; c PBE+U with \({U}_{{\mathrm{eff}}}^{{\mathrm{In}},p}\) and \({U}_{{\mathrm{eff}}}^{{\mathrm{As}},p}\) from BO; d PBE+U with \({U}_{{\mathrm{eff}}}^{{\mathrm{In}},p}\) and \({U}_{{\mathrm{eff}}}^{{\mathrm{As}},p}\) from LR. The VBM is referenced to 0 eV.

In summary, we have developed a method of determining the optimal Hubbard U parameter in DFT+U by using the Bayesian optimization machine learning algorithm. The objective function was formulated to reproduce as closely as possible the band gap and the qualitative features of the band structure obtained with a hybrid functional. We have demonstrated robust performance for several materials, including transition metal oxides, Eu chalcogenides, and narrow-gap semiconductors. PBE+UBO consistently produces band structures comparable to HSE. Furthermore, BO is more efficient than the linear response method and performs better, particularly in cases that call for negative values of Ueff. Based on this, we conclude that PBE+UBO can provide the accuracy of a hybrid functional at the computational cost of a semi-local functional. This may enable conducting simulations for larger systems, such as surfaces and interfaces, which would be unfeasible with a hybrid functional.

Methods

Computational details

All DFT calculations were performed using the Vienna Ab Initio Simulation Package (VASP) code with the projector augmented wave (PAW) method58,59,60. Spin-orbit coupling (SOC) was included in the calculations of transition metal oxides, Eu chalcogenides, and narrow-gap semiconductors61. Details of the lattice parameter and energy cutoff used for each compound are provided in Supplementary Table 1. The Brillouin zone was sampled using an 8 × 8 × 8 k-point grid for PBE and PBE+U calculations and 6 × 6 × 6 for HSE calculations. The coordinates of the high-symmetry k-points used for plotting the band structures are provided in Supplementary Table 2.

Linear response calculations

For LR calculations, a small potential was applied to the target state of a single site. Non-charge-self-consistent and charge-self-consistent calculations of the state occupation were performed as the potential was varied from −0.04 eV to 0.04 eV in increments of 0.02 eV. The derivatives of the occupation with respect to the potential give the non-interacting and interacting response matrices, used in Eq. 2 to calculate Ueff (see also Yang et al.62). To avoid interactions between periodic images of the perturbed atom, a 3 × 3 × 3 super-cell was constructed.

Data availability

Data will be available from the corresponding author upon reasonable request.

Code availability

The BO code developed here is available at: https://github.com/maituoy/BayesianOpt4dftu.

References

  1. 1.

    Perdew, J. P., Burke, K. & Ernzerhof, M. Generalized gradient approximation made simple. Phys. Rev. Lett. 77, 3865–3868 (1996).

    CAS  Article  Google Scholar 

  2. 2.

    Perdew, J. P., Burke, K. & Ernzerhof, M. Generalized gradient approximation made simple [phys. rev. lett. 77, 3865 (1996)]. Phys. Rev. Lett. 78, 1396–1396 (1997).

    CAS  Article  Google Scholar 

  3. 3.

    Curtarolo, S. et al. The high-throughput highway to computational materials design. Nat. Mater. 12, 191–201 (2013).

    CAS  Article  Google Scholar 

  4. 4.

    Jain, A. et al. The Materials Project: a materials genome approach to accelerating materials innovation. APL Mater. 1, 011002 (2013).

    Article  CAS  Google Scholar 

  5. 5.

    Saal, J. E., Kirklin, S., Aykol, M., Meredig, B. & Wolverton, C. Materials design and discovery with high-throughput density functional theory: The open quantum materials database (oqmd). JOM 65, 1501–1509 (2013).

    CAS  Article  Google Scholar 

  6. 6.

    Hüfner, S. Electronic structure of NiO and related 3d-transition-metal compounds. Adv. Phys. 43, 183–356 (1994).

    Article  Google Scholar 

  7. 7.

    Dudarev, S. L., Botton, G. A., Savrasov, S. Y., Humphreys, C. J. & Sutton, A. P. Electron-energy-loss spectra and the structural stability of nickel oxide: An lsda+u study. Phys. Rev. B 57, 1505–1509 (1998).

    CAS  Article  Google Scholar 

  8. 8.

    Perdew, J. P. Climbing the ladder of density functional approximations. MRS Bull. 38, 743–750 (2013).

    CAS  Article  Google Scholar 

  9. 9.

    Becke, A. D. A new mixing of Hartree-Fock and local density-functional theories. J. Chem. Phys. 98, 1372–1377 (1993).

    CAS  Article  Google Scholar 

  10. 10.

    Heyd, J. & Scuseria, G. E. Hybrid functionals based on a screened Coulomb potential. J. Chem. Phys. 118, 8207–8215 (2003).

    CAS  Article  Google Scholar 

  11. 11.

    Heyd, J., Scuseria, G. E. & Ernzerhof, M. Erratum: hybrid functionals based on a screened coulomb potential [J. CHEM. Phys. 118, 8207 (2003)]. J. Chem. Phys. 124, 219906 (2006).

    Article  CAS  Google Scholar 

  12. 12.

    Anisimov, V. I., Zaanen, J. & Andersen, O. K. Band theory and Mott insulators: Hubbard U instead of Stoner I. Phys. Rev. B 44, 943–954 (1991).

    CAS  Article  Google Scholar 

  13. 13.

    Yu, K. & Carter, E. A. Communication: comparing ab initio methods of obtaining effective U parameters for closed-shell materials. J. Chem. Phys. 140, 121105 (2014).

  14. 14.

    Cococcioni, M. & de Gironcoli, S. Linear response approach to the calculation of the effective interaction parameters in the LDA+U method. Phys. Rev. B 71, 035105 (2005).

    Article  CAS  Google Scholar 

  15. 15.

    Kulik, H. J., Cococcioni, M., Scherlis, D. A. & Marzari, N. Density functional theory in transition-metal chemistry: a self-consistent hubbard u approach. Phys. Rev. Lett. 97, 103001 (2006).

    Article  CAS  Google Scholar 

  16. 16.

    Mosey, N. J. & Carter, E. A. Ab initio evaluation of coulomb and exchange parameters for DFT+U calculations. Phys. Rev. B 76, 155123 (2007).

    Article  CAS  Google Scholar 

  17. 17.

    Mosey, N. J., Liao, P. & Carter, E. A. Rotationally invariant ab initio evaluation of Coulomb and exchange parameters for DFT+U calculations. J. Chem. Phys. 129, 14103 (2008).

  18. 18.

    Aryasetiawan, F., Karlsson, K., Jepsen, O. & Schönberger, U. Calculations of Hubbard U from first-principles. Phys. Rev. B 74, 125106 (2006).

    Article  CAS  Google Scholar 

  19. 19.

    Miyake, T. & Aryasetiawan, F. Screened coulomb interaction in the maximally localized wannier basis. Phys. Rev. B 77, 085122 (2008).

    Article  CAS  Google Scholar 

  20. 20.

    Şaşíoğlu, E., Friedrich, C. & Blügel, S. Effective coulomb interaction in transition metals from constrained random-phase approximation. Phys. Rev. B 83, 121101 (2011).

    Article  CAS  Google Scholar 

  21. 21.

    Frazier, P. I. A tutorial on bayesian optimization. arXiv https://arxiv.org/abs/1807.02811 (2018).

  22. 22.

    Brochu, E., Cora, V. M., & De Freitas, N. A tutorial on bayesian optimization of expensive cost functions, with application to active user modeling and hierarchical reinforcement learning. arXiv preprint at https://arxiv.org/abs/1012.2599 (2010).

  23. 23.

    Snoek, J., Larochelle, H. & Adams, R. P. Practical bayesian optimization of machine learning algorithms. In Advances In Neural Information Processing Systems. 2951–2959 (Neural Information Processing Systems Foundation, Inc., 2012).

  24. 24.

    Huhn, W. P. & Blum, V. One-hundred-three compound band-structure benchmark of post-self-consistent spin-orbit coupling treatments in density functional theory. Phys. Rev. Mater. 1, 033803 (2017).

    Article  Google Scholar 

  25. 25.

    Topsakal, M. & Wentzcovitch, R. M. Accurate projected augmented wave (PAW) datasets for rare-earth elements (RE=La-Lu). Comput. Mater. Sci. 95, 263–270 (2014).

    CAS  Article  Google Scholar 

  26. 26.

    Zhang, G.-X., Reilly, A. M., Tkatchenko, A. & Scheffler, M. Performance of various density-functional approximations for cohesive properties of 64 bulk solids. New J. Phys. 20, 063020 (2018).

    Article  CAS  Google Scholar 

  27. 27.

    Garza, A. J. & Scuseria, G. E. Predicting band gaps with hybrid density functionals. J. Phys. Chem. Lett. 7, 4165–4170 (2016).

    CAS  Article  Google Scholar 

  28. 28.

    Friedrich, C., Betzinger, M., Schlipf, M., Blügel, S. & Schindlmayr, A. Hybrid functionals and gw approximation in the flapw method. J. Phys. Condens. Matter 24, 293201 (2012).

    Article  CAS  Google Scholar 

  29. 29.

    Williams, C. K. I. & Rasmussen, C. E. Gaussian Processes For Machine Learning. Vol. 2 (MIT press Cambridge, 2006).

  30. 30.

    Kick, M., Reuter, K. & Oberhofer, H. Intricacies of DFT+U, not only in a numeric atom centered orbital framework. J. Chem. Theory Comput. 15, 1705–1718 (2019).

    CAS  Article  Google Scholar 

  31. 31.

    Ye, L.-H., Luo, N., Peng, L.-M., Weinert, M. & Freeman, A. J. Dielectric constant of NiO and LDA+U. Phys. Rev. B 87, 075115 (2013).

    Article  CAS  Google Scholar 

  32. 32.

    Pask, J. E., Singh, D. J., Mazin, I. I., Hellberg, C. S. & Kortus, J. Structural, electronic, and magnetic properties of MnO. Phys. Rev. B 64, 024403 (2001).

    Article  CAS  Google Scholar 

  33. 33.

    Deng, H.-X., Li, J., Li, S.-S., Xia, J.-B., Walsh, A. & Wei, S.-H. Origin of antiferromagnetism in CoO: a density functional theory study. Appl. Phys. Lett. 96, 162508 (2010).

    Article  CAS  Google Scholar 

  34. 34.

    Dufek, P., Blaha, P., Sliwko, V. & Schwarz, K. Generalized-gradient-approximation description of band splittings in transition-metal oxides and fluorides. Phys. Rev. B 49, 10170 (1994).

    CAS  Article  Google Scholar 

  35. 35.

    Hüfner, S., Osterwalder, J., Riesterer, T. & Hulliger, F. Photoemission and inverse photoemission spectroscopy of NiO. Solid State Commun. 52, 793–796 (1984).

    Article  Google Scholar 

  36. 36.

    Sawatzky, G. A. & Allen, J. W. Magnitude and origin of the band gap in NiO. Phys. Rev. Lett. 53, 2339 (1984).

    CAS  Article  Google Scholar 

  37. 37.

    Himmetoglu, B. & Wentzcovitch, R. M. First-principles study of electronic and structural properties of CuO. Phys. Rev. B 84, 115108 (2011).

    Article  CAS  Google Scholar 

  38. 38.

    Jiang, H., Gomez-Abal, R. I., Rinke, P. & Scheffler, M. Localized and Itinerant states in lanthanide oxides united by GW@LDA+U. Phys. Rev. Lett. 102, 126403 (2009).

    Article  CAS  Google Scholar 

  39. 39.

    Jiang, H., Gomez-Abal, R. I., Rinke, P. & Scheffler, M. First-principles modeling of localized d states with the GW@LDA+U approach. Phys. Rev. B 82, 045108 (2010).

    Article  CAS  Google Scholar 

  40. 40.

    Wachter, P. The optical electrical and magnetic properties of the europium chalcogenides and the rare earth pnictides. Crit. Rev. Solid State Mater. Sci. 3, 189–241 (1972).

    CAS  Article  Google Scholar 

  41. 41.

    Ghosh, D. B., De, M. & De, S. K. Electronic structure and magneto-optical properties of magnetic semiconductors: Europium monochalcogenides. Phys. Rev. B 70, 115211 (2004).

    Article  CAS  Google Scholar 

  42. 42.

    Larson, P. & Lambrecht, W. R. L. Electronic structure and magnetism of europium chalcogenides in comparison with gadolinium nitride. J. Phys. Condens. Matter 18, 11333–11345 (2006).

    CAS  Article  Google Scholar 

  43. 43.

    Shi, S. Q. et al. Electronic structure and magnetism of EuX (X = O, S, Se and Te): A first-principles investigation. EPL 83, 4–9 (2008).

    Google Scholar 

  44. 44.

    Schlipf, M., Betzinger, M., Ležaić, M., Friedrich, C. & Blügel, S. Structural, electronic, and magnetic properties of the europium chalcogenides: a hybrid-functional DFT study. Phys. Rev. B 88, 94433 (2013).

    Article  CAS  Google Scholar 

  45. 45.

    Ingle, N. J. C. & Elfimov, I. S. Influence of epitaxial strain on the ferromagnetic semiconductor EuO: first-principles calculations. Phys. Rev. B 77, 121202 (2008).

    Article  CAS  Google Scholar 

  46. 46.

    An, J. M., Barabash, S. V., Ozolins, V., van Schilfgaarde, M. & Belashchenko, K. D. First-principles study of phase stability of Gd-doped EuO and EuS. Phys. Rev. B 83, 064105 (2011).

    Article  CAS  Google Scholar 

  47. 47.

    Kim, Y.-S., Hummer, K. & Kresse, G. Accurate band structures and effective masses for InP, InAs, and InSb using hybrid functionals. Phys. Rev. B 80, 035203 (2009).

    Article  CAS  Google Scholar 

  48. 48.

    Massidda, S. et al. Structural and electronic properties of narrow-band-gap semiconductors: InP, InAs, and InSb. Phys. Rev. B 41, 12079–12085 (1990).

    CAS  Article  Google Scholar 

  49. 49.

    Soluyanov, A. A. et al. Optimizing spin-orbit splittings in InSb Majorana nanowires. Phys. Rev. B 93, 115317 (2016).

    Article  CAS  Google Scholar 

  50. 50.

    Malyi, O. I., Dalpian, G. M., Zhao, X.-G., Wang, Z. & Zunger, A. Realization of predicted exotic materials: the burden of proof. Mater. Today 32, 35–45 (2019).

    Article  CAS  Google Scholar 

  51. 51.

    Vurgaftman, I., Meyer, J. R. & Ram-Mohan, L. R. Band parameters for III-V compound semiconductors and their alloys. J. Appl. Phys. 89, 5815–5875 (2001).

    CAS  Article  Google Scholar 

  52. 52.

    Madelung, O., Rössler, U. & Schulz, M. Group IV Elements, IV-IV and III-V Compounds. Part b - Electronic, Transport, Optical and Other Properties. Landolt-Börnstein - Group III Condensed Matter, 41A1β (Heidelberg Springer, Berlin, 2002).

  53. 53.

    Micnas, R., Ranninger, J. & Robaszkiewicz, S. Superconductivity in narrow-band systems with local nonretarded attractive interactions. Rev. Mod. Phys. 62, 113–171 (1990).

    CAS  Article  Google Scholar 

  54. 54.

    Hase, I. & Yanagisawa, T. Madelung energy of the valence-skipping compound BaBiO3. Phys. Rev. B 76, 174103 (2007).

    Article  CAS  Google Scholar 

  55. 55.

    Nakamura, H., Hayashi, N., Nakai, N., Okumura, M. & Machida, M. First-principle electronic structure calculations for magnetic moment in iron-based superconductors: an LSDA+ negative U study. Phys. C Supercond. 469, 908–911 (2009).

    CAS  Article  Google Scholar 

  56. 56.

    Persson, C. & Mirbt, S. Improved electronic structure and optical properties of sp-hybridized semiconductors using LDA+U SIC. Braz. J. Phys. 36, 286–290 (2006).

    CAS  Article  Google Scholar 

  57. 57.

    Cococcioni, M. The LDA+U approach: a simple Hubbard correction for correlated ground states. Correlated Electrons: From Models to Materials Modeling and Simulation. Vol. 2, Ch. 4 (Verlag des Forschungszentrum Jülich, Jülich, Germany, 2012).

  58. 58.

    Kresse, G. Ab initio molecular dynamics for liquid metals. J. Non Cryst. Solids 192193, 222–229 (1995).

    Article  Google Scholar 

  59. 59.

    Rolland, A., Pouget, L., Brunel, M., Loas, G. & Alouini, M. Bridging the THz to RF gap by four-wave mixing in a highly nonlinear fiber. In Proc. 2013 Conference on Lasers and Electro-Optics, CLEO 2013 59, 11–19 (2013).

  60. 60.

    Blöchl, P. E. Projector augmented-wave method. Phys. Rev. B 50, 17953 (1994).

    Article  Google Scholar 

  61. 61.

    Steiner, S., Khmelevskyi, S., Marsmann, M. & Kresse, G. Calculation of the magnetic anisotropy with projected-augmented-wave methodology and the case study of disordered Fe1−xCox alloys. Phys. Rev. B 93, 224425 (2016).

    Article  CAS  Google Scholar 

  62. 62.

    Yang, S., Wu, C. & Marom, N. Topological properties of SnSe/EuS and SnTe/CaTe interfaces. Phys. Rev. Mater. 4, 034203 (2020).

    CAS  Article  Google Scholar 

Download references

Acknowledgements

Work on III–V semiconductors was funded by the National Science Foundation (NSF) through grant OISE-1743717. Work on transition metal oxides and Eu chalcogenides was funded by the U.S. Department of Energy through grant DE-SC0019274. This research used resources of the National Energy Research Scientific Computing Center (NERSC), a U.S. Department of Energy Office of Science User Facility operated under Contract No. DE-AC02-05CH11231.

Author information

Affiliations

Authors

Contributions

M.Y. and S.Y. implemented the algorithm, performed the calculations, collected, and analyzed the data. M.Y. and S.Y. contributed equally to this work. C.W. performed some of the calculations. N.M. led the project. All authors contributed to writing the manuscript.

Corresponding author

Correspondence to Noa Marom.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Yu, M., Yang, S., Wu, C. et al. Machine learning the Hubbard U parameter in DFT+U using Bayesian optimization. npj Comput Mater 6, 180 (2020). https://doi.org/10.1038/s41524-020-00446-9

Download citation

Further reading

Search

Quick links