A machine learning route between band mapping and band structure

Xian, R. Patrick; Stimper, Vincent; Zacharias, Marios; Dendzik, Maciej; Dong, Shuo; Beaulieu, Samuel; Schölkopf, Bernhard; Wolf, Martin; Rettig, Laurenz; Carbogno, Christian; Bauer, Stefan; Ernstorfer, Ralph

doi:10.1038/s43588-022-00382-2

Download PDF

Resource
Open access
Published: 30 December 2022

A machine learning route between band mapping and band structure

Nature Computational Science volume 3, pages 101–114 (2023)Cite this article

7750 Accesses
3 Citations
13 Altmetric
Metrics details

Subjects

This article has been updated

Abstract

The electronic band structure and crystal structure are the two complementary identifiers of solid-state materials. Although convenient instruments and reconstruction algorithms have made large, empirical, crystal structure databases possible, extracting the quasiparticle dispersion (closely related to band structure) from photoemission band mapping data is currently limited by the available computational methods. To cope with the growing size and scale of photoemission data, here we develop a pipeline including probabilistic machine learning and the associated data processing, optimization and evaluation methods for band-structure reconstruction, leveraging theoretical calculations. The pipeline reconstructs all 14 valence bands of a semiconductor and shows excellent performance on benchmarks and other materials datasets. The reconstruction uncovers previously inaccessible momentum-space structural information on both global and local scales, while realizing a path towards integration with materials science databases. Our approach illustrates the potential of combining machine learning and domain knowledge for scalable feature extraction in multidimensional data.

Representing individual electronic states for machine learning GW band structures of 2D materials

Article Open access 03 February 2022

Towards fully automated GW band structure calculations: What we can learn from 60.000 self-energy evaluations

Article Open access 29 January 2021

Machine learning the Hubbard U parameter in DFT+U using Bayesian optimization

Article Open access 27 November 2020

Main

Modeling and characterization of the electronic band structure (BS) of a material play essential roles in materials design¹ and device simulation². The BS exists in momentum space, Ω(k_x, k_y, k_z, E), and imprints the multidimensional and multivalued functional relations between the energy (E) and momenta (k_x, k_y, k_z) of periodically confined electrons³. Photoemission band mapping⁴ (Fig. 1a) using momentum- and energy-resolved photoemission spectroscopy (PES), including angle-resolved PES (ARPES)^5,6 and multidimensional PES^7,8, measures the BS as an intensity-valued multivariate probability distribution directly in Ω. The proliferation of band-mapping datasets and their public availability brought about by recent hardware upgrades^7,8,9,10 have ushered in possibilities regarding the comprehensive benchmarking of theories and experiments, which is especially challenging for multiband materials with complex band dispersions^11,12,13. The available methods for interpreting photoemission spectra fall into two categories: physics-based methods, which require least-squares fitting of one-dimensional lineshapes, named energy or momentum distribution curves (EDCs or MDCs), and analytical models^5,14,15. Although physics-informed data models guarantee high accuracy and interpretability, upscaling the pointwise fitting (or estimation) to large, densely sampled regions in momentum space (for example, including 10⁴ or more momentum locations) presents challenges due to the limited numerical stability and efficiency. Therefore, their use is limited to selected momentum locations determined heuristically from physical knowledge of the materials and experimental settings. Image-processing-based methods apply data transformations to improve the visibility of dispersive features^16,17,18,19. They are more computationally efficient and can operate on entire datasets, yet offer only visual enhancement of the underlying band dispersion. They do not allow reconstruction and are therefore insufficient for truly quantitative benchmarking or archiving.

A method balancing the two approaches will extract the band dispersion with sufficiently high accuracy and be scalable to multidimensional datasets, therefore providing the basis for distilling structural information from complex band-mapping data and for building efficient tools for annotating and understanding spectra. In this regard we propose a computational framework (Fig. 1b) for global reconstruction of the photoemission (or quasiparticle) BS as a set of energy (or electronic) bands, formed by energy values (that is, band loci) connected along momentum coordinates. This local connectedness assumption is more valid than using local maxima of photoemission intensities, because local maxima are not always good indicators of band loci²⁰. We exploit the connection between theory and experiment in our framework, based on a probabilistic machine-learning^21,22 model, to approximate the intensity data from band-mapping experiments. The gist of the model is rooted in Bayes rule:

$${p}({{{{X}}}}| {{{\mathcal{D}}}})\propto {p}({{{\mathcal{D}}}}| {{{{X}}}}){p}({{{{X}}}}),$$

(1)

where X are the random variables to be inferred and the data ${{{\mathcal{D}}}}$ are mapped directly onto unknowns and experimental observables. We assign the energy values of the photoemission BS as the model’s variables to extract from data, and a nearest-neighbor (NN) Gaussian distribution as the prior, p(X), to describe the proximity of energy values at nearby momenta. The EDC at every momentum grid point relates to the likelihood, ${p}({{{\mathcal{D}}}}| {{{{X}}}})$, when we interpret the photoemission intensity probabilistically. The optimum is obtained via maximum a posteriori (MAP) estimation in probabilistic inference²¹ (Methods and Supplementary Fig. 2). Given the form of the NN prior, the posterior, ${p}({{{{X}}}}| {{{\mathcal{D}}}})$, in the current setting forms a Markov random field (MRF)^21,23,24, which encapsulates the energy-band continuity assumption and the measured intensity distribution of photoemission in a probabilistic graphical model. In one benefit, the probabilistic formulation can incorporate imperfect physical knowledge algebraically in the model or numerically as the initialization (that is, warm start; Methods) of the MAP estimation, without requiring the de facto ground truth and training as in supervised machine learning²⁵. In another benefit, the graphical model representation allows convenient optimization and extension to other dimensions (Supplementary Fig. 1 and Supplementary Section 1).

To demonstrate the effectiveness of the method, we first reconstructed the entire 3D dispersion surface, E(k_x, k_y), of all 14 valence bands within the projected first Brillouin zone (in (k_x, k_y, E) coordinates) of the semiconductor tungsten diselenide (WSe₂), spanning ~7 eV in energy and ~3 Å⁻¹ along each momentum direction. We also adapted the informatics tools to BS data to sample and compare the reconstructed and theoretical BSs globally. The accuracy of the reconstruction was validated using synthetic data and the extracted local structural parameters along with pointwise fitting. The available data and BS informatics enable a detailed comparison of band dispersion at a resolution of <0.02 Å⁻¹. We performed various tests and benchmarking on datasets of other materials and simulated data, where ground truth is available to evaluate the accuracy and computational efficiency.

Results

BS reconstruction and digitization

Our main example is the 2D layered semiconductor WSe₂, with its hexagonal lattice and bilayer stacking periodicity (denoted 2H-WSe₂), as a model system for band-mapping experiments^11,26,27. Earlier valence-band mapping and reconstruction in ARPES experiments on WSe₂ demonstrated a high degree of similarity between theory and experiments^11,26,27, but a quantitative assessment within the entire (projected) Brillouin zone is still lacking. The valence BS of 2H-WSe₂ contains 14 strongly dispersive energy bands, formed by a mixture of the 5d⁴ and 6s² orbitals of the W atoms and the 4p⁴ orbitals of the Se atoms, in its hexagonal unit cell. The strong spin–orbit coupling (SOC) due to these heavy elements produces large momentum- and spin-dependent energy splitting and modifications to the BS^11,28.

We use a 2D MRF to model the loci of an energy band within the intensity-valued 3D band-mapping data, regarded as a collection of momentum-ordered EDCs. This is graphically represented by a rectangular grid overlaid on the momentum axes with indices (i, j) (where i, j are non-negative integers), as shown in step (3) of Fig. 1b. The undetermined band energy of the EDC at (i, j), with the associated momentum coordinates (k_x, i, k_y, j), is considered a random variable, ${\tilde{E}}_{i,\,j}$, of the MRF. Together, the probabilistic model is characterized by a joint distribution, expressed as the product of the likelihood and the Gaussian prior in equation (1). To maintain its simplicity, we do not explicitly account for the intensity modulations of various origins (such as imbalanced transition matrix elements²⁰) in the original band-mapping data, which cannot be remediated by upgrading the photon source or detector. Instead, we pre-process the data to minimize their effects on the reconstruction (Fig. 1c–f). The pre-processing steps include (1) intensity symmetrization and (2) contrast enhancement²⁹, followed by (3) Gaussian smoothing (Methods), after which the continuity of band-like features is restored. The EDCs from the pre-processed data, ${\tilde{I}}$, are used effectively as the likelihood to calculate the MRF joint distribution:

$${p}(\{{\tilde{E}}_{i,\,j}\})={\frac{1}{Z}\mathop{\prod}\limits_{ij}\tilde{I}({k}_{x,\,i},\,{k}_{y,\,j},\,{\tilde{E}}_{i,\,j})\cdot \mathop{\prod}\limits_{(i,\,j)(l,\,m)| {{{\rm{NN}}}}}\exp \left[-\frac{{({\tilde{E}}_{i,\,j}-{\tilde{E}}_{l,\,m})}^{2}}{2{\eta }^{2}}\right]}.$$

(2)

Here, Z is a normalization constant, η is a hyperparameter defining the width of the Gaussian prior, ∏_ij denotes the product over all discrete momentum values sampled in the experiment, and ∏_{(i, j)(l, m)∣NN} is the product over all NN terms. A detailed derivation of equation (2) is given in Supplementary Section 1. Reconstruction of the photoemission BS is carried out sequentially for all bands and relies on local optimization of the MRF’s variables, ${\{{\tilde{E}}_{i,\,j}\}}$.

To optimize over large graphical models, we adopt multiple parallelization schemes to achieve efficient operations on scalable computing hardware. A single band reconstruction involving optimization over 10⁴ random variables is achieved within seconds and hyperparameter tuning within tens of minutes (Methods and Supplementary Figs. 3 and 4). In comparison, pointwise fitting often requires individual hand-tuning and is therefore difficult to scale up to whole bands within a meaningful timeframe. To correctly resolve band crossings and nearly degenerate energies, we inject relevant physical knowledge into the optimization by using density functional theory (DFT) BS calculations with semi-local approximation³⁰ as a starting point for the reconstruction. The calculation qualitatively involves physical symmetry information for WSe₂, albeit not quantitatively reproducing the experimental quasiparticle BSs at all momentum coordinates. As shown with four DFT calculations with different exchange-correlation functionals³⁰ to initiate the reconstruction for WSe₂ and in various cases using synthetic data with known ground truths (Methods, Supplementary Table 3 and Supplementary Figs. 4–8), the reconstruction algorithm is not particularly sensitive to the initialization as long as the information about band crossings is present. The current framework can also support initialization from more advanced electronic-structure methods, such as GW³¹ or those including electronic self-energies renormalized by electron–phonon coupling³², where semi-local approximation yields not only quantitatively, but also qualitatively wrong quasiparticle BSs compared with the experiment. However, a systematic benchmarking of theory and experiment goes beyond the scope of this work.

The 14 reconstructed valence bands of WSe₂ initialized by the local density approximation (LDA)-level DFT are shown in Fig. 2b–d and Supplementary videos. To globally compare the computed and reconstructed bands at a consistent resolution, we expand the BS in orthonormal polynomial bases³³, which are global shape descriptors and unbiased by the underlying electronic detail. The geometric featurization of band dispersion allows multiscale sampling and comparison using coefficient (or feature) vectors³⁴. We chose Zernike polynomials (ZPs) to decompose the 3D dispersion surfaces (Fig. 3 and Methods) because of their existing adaptations to various boundary conditions³⁵.

**Fig. 2: Band reconstruction from WSe₂ photoemission data.**

**Fig. 3: Digitization and comparison of WSe₂ BSs.**

In Fig. 3a,b, the band dispersions show generally decreasing dependence (seen from the magnitude of coefficients) on basis terms with increasing complexities (Fig. 3a), and the majority of dispersion is encoded into a subset of the terms (Fig. 3b). This observation implies that moderate smoothing may be applied to remove high-frequency features to improve the reconstruction in the case of limited-quality data (acquired without sufficient accumulation time), which is often unavoidable when materials exhibit vacuum degradation, or during experimental parameter tuning. The example in Fig. 3b and additional numerical evidence in Supplementary Fig. 14 illustrate the approximation capability of the hexagonal ZPs. These coefficients act as geometric fingerprints of the energy band dispersion, enabling the use of similarity or distance metrics (Methods) for their comparison³⁴. In Fig. 3c, the positive cosine similarity confirms the strong shape (or dispersion) resemblance of the seven pairs of spin-split energy bands in the reconstructed BS of WSe₂, and the low negative values, such as those for bands 1–2 and 13–14, reflect the opposite directions of their respective dispersion (Fig. 2d). These observations are consistent with the outcome obtained from DFT calculations (Supplementary Fig. 13).

Computational metrics and performance

To quantify the computational advantages of the machine-learning-based reconstruction approach, we examine the outcome from diverse perspectives related to consistency, accuracy and cost. To assess the consistency of reconstruction in its entirety, we introduce a BS distance metric (Methods), invariant to the global energy shift frequently used to adjust the energy zero, to quantify the differences in band dispersion and the relative spacing between bands, which are the two major sources of variation between theories and experiments. The distance is calculated using the geometric fingerprints to bypass interpolation errors while reconciling the coordinate spacing difference between reconstructed and theoretical BSs, essential for differentiating BS data from heterogeneous sources in materials science databases^36,37. The results in Fig. 3d refer to the valence BS of WSe₂ discussed in this work, with the distances (Methods) and their spread (that is, standard errors) displayed in the upper and lower triangles, respectively. A high degree of consistency exists among the reconstructions (pairwise distance no larger than 60 ± 8 meV per band), regardless of the level of DFT calculation used for initialization, indicating the robustness of the probabilistic reconstruction algorithm, whereas the distances between the DFT calculations are much larger, both in energy shifts and their spread. As shown in Fig. 3d and Supplementary Fig. 5, the learning algorithm can effectively reduce the epistemic uncertainty³⁸ between theories to obtain a consistent reconstruction.

To demonstrate the computational advantage of the MRF reconstruction over traditional line-fitting methods, we benchmarked the outcome over selected regions in synthetic photoemission data. The regions are chosen based on their importance, and we limit the size to have a manageable computing time (about an hour on our computing cluster, at maximum, for a single run), determined by the slower method, and to allow for hyperparameter tuning, which requires tens of runs. The line-fitting approach uses the Levenberg–Marquardt least-squares optimization³⁹ with bound constraints for multicomponent photoemission spectra composed of a series of lineshape functions. We used the benchmark established in ref. ⁴⁰ for pointwise line fitting, employing high-performance computing and two synthetic datasets with known ground-truth dispersion, representing the local and global settings of the BS reconstruction problem (Supplementary Section 2.5). The synthetic data were based on a BS at the LDA-DFT level around the K-point and along the high-symmetry line of the Brillouin zone. To limit the hardware requirements, we used only distributed multicore-CPU computing for performance benchmarking. The estimated computing times are normalized to the per-band per-spectrum level⁴⁰. The accuracy of the reconstruction is calculated using the same-resolution root-mean-squared (r.m.s.) error, and the (in)stability is quantified by the standard deviation (s.d.) of the residuals, which measures surface roughness⁴¹. The benchmarking results are compiled in Fig. 4 and Supplementary Table 2. They show that, compared with pointwise line fitting, the MRF reconstruction offers a considerable reduction in both normalized computing time and hyperparameter tuning time, while achieving consistently higher accuracy and stability in all but the two-band case. The combination of accuracy and stability in MRF reconstruction is due to the connectivity built into the prior, whereas in the pointwise fitting approach, information is not explicitly shared among neighbors. Because the number of bands reflects the complexity of the multicomponent spectra, near-constant normalized computing time and hyperparameter tuning time (Fig. 4a,b) in the MRF reconstruction, regardless of the number of bands (or spectral components), allow us to scale up the computation to datasets comprising 10⁴ to 10⁵ or more spectra. The substantial gain in computational efficiency is a result of the inherent divide-and-conquer strategy in our BS reconstruction problem formulation and the associated distributed optimization method in the algorithm design. Comparatively, the distributed pointwise fitting exhibits a quasi-linear computational scaling with respect to the number of bands. When hyperparameter tuning is taken into account, in practice it is only feasible for fitting small datasets with up to 10³ multicomponent spectra⁴⁰.

**Fig. 4: Performance evaluation on benchmarks.**

Extended use cases and applications

The band dispersions recovered from photoemission data are often examined locally near dispersion extrema. We show in Fig. 5 that, besides providing the global structural information, the reconstruction improves the robustness of traditional pointwise lineshape fitting in extended regions of the momentum space, when used as an initial guess, because BS calculations may exhibit appreciable momentum-dependent deviations from experimental data that prevent them from being a sufficiently good starting point. Pointwise fitting in turn acts as the refinement of local details not explicitly included in the probabilistic reconstruction model, which prioritizes efficiency. This sequential approach recovers large regions in the Brillouin zone at high energy resolution, without laborious hand-tuning of the fitting parameters per photoemission spectrum. Adopting this approach to WSe₂, we first recovered a compendium of local BS parameters (Supplementary Table 4). The trigonal warping parameters of the first two valence bands around the ${\overline{{{{\rm{K}}}}}}$-point are 5.8 eV Å³ and 3.9 eV Å³, respectively, confirming the magnitude difference between these spin-split bands predicted by theory²⁸. The warping signature extends further to high-energy bands. Dispersion fitting around the saddle point ${\overline{{{{\rm{M}}}}}}^{{\prime} }$ (and ${\overline{{{{\rm{M}}}}}}$) of the BS reveals that the gap opened by the spin–orbit interaction extends beyond it anisotropically on the dispersion surfaces, with the minimum gap at 338 meV, markedly larger than in the DFT results, which predict degeneracy²⁸. We expect this observation to contribute to the spin-dependent optical absorption due to the association of the saddle point, in energy dispersion, with a van Hove singularity^28,42.

**Fig. 5: Local BS parameters of WSe₂.**

In addition to WSe₂, we performed BS reconstruction on two other photoemission datasets for other classes of material. The first dataset is from bismuth tellurium selenide (Bi₂Te₂Se), a topological insulator, measured using the same laboratory photoemission set-up (Fig. 6a–e) as for the WSe₂ dataset. Although we used only simple numerical functions (Gaussian and paraboloid) to initialize the MRF reconstruction, the outcome demonstrates correct discrete momentum–space symmetry and details of energy dispersion down to the concave-shaped hexagonal warping in the band energy contours around the Dirac point⁴³. Four energy bands, including the two low-energy valence bands, a surface-state energy band, and a partially occupied conduction band, were recovered using our approach for Bi₂Te₂Se. The second is the bulk gold (Au) photoemission dataset measured at a synchrotron X-ray source (Fig. 6f,g). We used DFT calculations as the initialization to reconstruct four of the bulk energy bands, which are usually very challenging to extract by hand-tracing or parametric function-fitting, due in part to blurring (k_z dispersion) from the 3D characteristics of the electrons in the metallic bulk. Further discussions on these two materials and their band reconstructions are provided in Supplementary Section 3.

**Fig. 6: Band reconstruction for Bi₂Te₂Se and Au(111).**

Discussion

The reconstruction approach described here provides a quantitative connection between empirical band dispersion (${E}_{b}^{{{{\rm{emp}}}}}$) obtained from photoemission band mapping and the theoretical counterparts (${E}_{b}^{{{{\rm{theory}}}}}$) through various orders of momentum-dependent ‘perturbations’ (${{\Delta }}{E}_{b}^{(n)}$). The connection may be expressed as

$$\begin{array}{lll}{E}_{b}^{{{{\rm{emp}}}}}({{{\bf{k}}}},\,{{\varSigma }})&\approx &{E}_{b}^{{{{\rm{theory}}}}}({{{\bf{k}}}},\,{{\varSigma }})+{{\Delta }}{E}_{b}^{(0)}+{{\Delta }}{E}_{b}^{(1)}({{{\bf{k}}}},\,{{\varSigma }})+{{\Delta }}{E}_{b}^{(2)}({{{\bf{k}}}},\,{{\varSigma }})+...\\ &=&{E}_{b}^{{{{\rm{theory}}}}}({{{\bf{k}}}},\,{{\varSigma }})+\mathop{\sum}\limits_{n}{{\Delta }}{E}_{b}^{(n)}({{{\bf{k}}}},\,{{\varSigma }})\\ &=&{E}_{b}^{{{{\rm{theory}}}}}({{{\bf{k}}}},\,{{\varSigma }})+{{\Delta }}{E}_{b}({{{\bf{k}}}},{{\varSigma }}).\end{array}$$

(3)

In equation (3), b is the band index, Σ represents electron self-energy, the zeroth-order term (${{\Delta }}{E}_{b}^{(0)}$) means a rigid shift, and higher-order terms have increasing momentum-dependent nonlinearities. Our results here demonstrate that this formulation leads to practical band reconstruction, which recovers the accumulated perturbations (ΔE_b) in equation (3) for every experimentally resolvable energy band. The outcome with current reconstruction accuracy and stability should assist interpretation of deep-lying bands, parametrizing multiband Hamiltonian models⁴⁴. The data size reduction by over 5,000 times from 3D band-mapping data to geometric features vectors (Methods) facilitates database integration^37,45.

Apart from the benefits, we want to outline three limitations of our reconstruction approach. First, the reconstruction approach does not work ab initio and requires knowing the number of energy bands, N_b, as implicated by equation (3) for an indexed band (b = 1, 2, ..., N_b). Although in simple datasets with up to several bands, N_b can be estimated using prior knowledge of the material or from visual inspection, correctly estimating N_b in complex datasets still requires calculated BSs. Second, when the electron self-energy modulation is substantial, separating the so-called bare-band dispersion (that is, single-particle dispersion) from the quasiparticle dispersion is needed to understand the material physics⁴⁶. This requires re-evaluating the BS reconstruction concept and considering the full spectral function (Supplementary Section 1.1) explicitly to account for non-standard lineshapes. Nevertheless, the outcome of our current approach may act as a trial solution for disentangling the bare-band dispersion relation from the electron self-energy⁴⁶. Because the local connectedness assumption in equation (2) remains largely valid, our reconstruction may still recover the quasiparticle dispersion. We demonstrate this in Supplementary Fig. 10 using simulated photoemission data with a kink anomaly, a strong modification of dispersion from electron self-energy^5,6. Third, an appropriate initialization may be expensive or impossible to obtain, either due to the computational cost, if higher-level theories (such as DFT with hybrid functionals and GW) are required, or due to the complexity of the materials system, including undetermined microscopic interactions, sample defects or structural disorder, creating strong intensity blurring from k_z dispersion and so on. These scenarios will remain challenging for band reconstruction.

Besides our demonstrations, we anticipate additional use cases. These include (1) online monitoring⁴⁷ of band-mapping experiments in the study of materials’ phase transitions⁴⁸ or functioning devices⁴⁹, where changes in atomic structure or carrier mobility are often accompanied by detectable changes in the electronic structure (including band dispersion), resulting in I(k, E, t) with time (t) dependence in addition to momentum (k) and energy. There is also (2) spatial mapping of BS variations for electronic devices via scanning photoemission measurements^50,51, resulting in I(k, E, x) with spatial (x) dependence. In cases (1) and (2), a fast reconstruction and evaluation framework may be used in a feedback loop to steer or optimize experimental conditions. The next use case is (3) implementation of the reconstruction across various materials and to band-mapping data⁷ conditioned on external parameters, including temperature, photon energy, dynamical time delay, and spin as resolved quantities, which will generate comprehensive knowledge about the (non)equilibrium electronic structure of materials to benchmark theories. Moreover, the reconstruction method is (4) transferable to extracting the band dispersion of other quasiparticles (phonons⁵², polaritons⁵³ and so on⁵⁴) in periodic systems, given the availability of corresponding multidimensional datasets. Finally, (5) the analogy between band mapping and spatially resolved spectral imaging, which produces location-dependent spectra, or I(x, y, E) suggests that the reconstruction algorithm may find use in teasing out the spatial (x, y) variation of the spectral shifts, complementary to the outcome of clustering algorithms⁵⁵.

The increasing amount of publicly accessible and reusable datasets from materials-science communities⁴⁵ motivates future extensions to the model with other types of informative prior that account for the full complexity of the physical signal while maintaining computational efficiency. Overall, the multidisciplinary methodology provides an example of building next-generation high-throughput materials-characterization toolkits combining learning algorithms with physical knowledge⁵⁶ to arrive at a comprehensive understanding of materials properties that has been unattainable so far.

Methods

Band-mapping measurements of WSe₂

Multidimensional PES experiments were conducted with a laser-driven, high-harmonic-generation-based XUV light source⁹ operating at 21.7 eV and 500 kHz and a METIS 1000 (SPECS) momentum microscope featuring a delay-line detector coupled to a time-of-flight drift tube^8,57. The experiment captures photoelectrons directly in their 3D coordinates, (k_x, k_y, E)^7,8. Single-crystal samples of WSe₂ (>99.995% pure) were purchased from HQ Graphene and were used directly for measurements without further purification. Before measurements, the WSe₂ samples were attached to the Cu substrate with conductive epoxy resin (EPO-TEK H20E). The samples were cleaved by cleaving pins attached to the sample surface upon transfer into the measurement chamber, which operated at an ambient pressure of 10⁻¹¹ mbar during photoemission experiments. No effect of surface termination was observed in the measured WSe₂ photoemission spectra, similar to previous experimental observations^11,26. For the valence-band-mapping experiments, the energy focal plane of the photoelectrons within the time-of-flight drift tube was set close to the top valence band. Although effects of sample degradation have been reported²⁷ during the course of long-duration angular scanning in ARPES measurements, with our high-repetition-rate photon source⁹ and the fast electronics of the momentum microscope, band mapping of WSe₂ achieves a sufficient signal-to-noise ratio for valence-band reconstruction within only tens of minutes of data acquisition, without the need for angular scanning and subsequent reconstruction from momentum–space slices.

Data processing and reconstruction

The raw data, in the form of single-electron events recorded by the delay-line detector, were pre-processed using home-developed software packages⁵⁸. The events were first binned to the (k_x, k_y, E) grid with dimensions of 256 × 256 × 470 to cover the full valence-band range in WSe₂ within the projected Brillouin zone (PBZ), which amounts to a pixel size of ~0.015 Å⁻¹ along the momentum axes and ~18 meV along the energy axis. The bin sizes are within the limits of the momentum resolution (<0.01 Å⁻¹) and energy resolution (<15 meV) of the photoelectron spectrometer⁵⁹.

Data binning was carried out in conjunction with the necessary lens distortion correction⁶⁰ and calibrations, as described in ref. ⁵⁸. The outcome provided a sufficient level of granularity in momentum space to resolve the fine features in band dispersion while achieving higher signal-to-noise ratio than when using single-event data directly. Afterwards, we applied intensity symmetrization to the data along the six-fold rotation symmetry and mirror symmetry axes¹¹ of the photoemission intensity pattern in (k_x, k_y) coordinates, followed by contrast enhancement using the multidimensional extension of the contrast limited adaptive histogram equalization (MCLAHE) algorithm, where the intensities in the image are transformed by a look-up table built from the normalized cumulative distribution function of local image patches²⁹. Finally, we applied Gaussian smoothing to the data along the k_x, k_y and E axes with s.d. of 0.8, 0.8 and 1 pixels (or ~0.012 Å⁻¹, 0.012 Å⁻¹, and 18 meV), respectively.

After data pre-processing, we sequentially reconstructed every energy band of WSe₂ from the photoemission data using the MAP approach described in the main text. The reconstruction requires tuning of three hyperparameters: (1) momentum scaling and (2) the rigid energy shift to coarse-align the computed energy band, for example, from DFT, to the photoemission data, and (3) the width of the NN Gaussian prior (η in equation (2)). Hyperparameter tuning is also carried out individually for each band to adapt to a specific environment. An example of hyperparameter tuning is given in Supplementary Fig. 4. The MAP reconstruction method involves optimization of the band-energy random variables, ${\{{\tilde{E}}_{i,\,j}\}}$, to maximize the posterior probability, ${p}={p}(\{{\tilde{E}}_{i,\,j}\})$, or to minimize the negative log-probability loss function, ${{{\mathcal{L}}}}:= {-\log p}$, obtained from equation (2) as is used in our actual implementation:

$${{{\mathcal{L}}}}(\{{\tilde{E}}_{i,\,j}\})={-\mathop{\sum}\limits_{i,\,j}\log I({k}_{x,\,i},\,{k}_{y,\,j},\,{\tilde{E}}_{i,\,j})+\mathop{\sum}\limits_{(i,\,j),\,(l,\,m)| {{{\rm{NN}}}}}\frac{{({\tilde{E}}_{i,\,j}-{\tilde{E}}_{l,\,m})}^{2}}{2{\eta }^{2}}+{{{\rm{const.}}}}}$$

(4)

We implemented the optimization using a parallelized version of the iterated conditional mode⁶¹ method in TensorFlow⁶² to run on multicore computing clusters and GPUs. The parallelization involves a checkerboard coloring scheme (or coding method) of the graph nodes⁶³ and subsequent hierarchical grouping of colored nodes, which allows alternating updates on different subgraphs (that is, subsets of the nodes) of the MRF during optimization. Typically, the optimization process in the reconstruction of one band converges within and therefore is terminated after 100 epochs, which takes ~7 s on a single NVIDIA GTX980 GPU for the above-mentioned data size. Details on the parallelized implementation are provided in Supplementary Section 1. In addition, because symmetry information is not explicitly included in the MRF model, the reconstructed bands generally require further symmetrization, such as refinement or post-processing, to be ready for database integration.

We have described our approach of using BS calculations to initialize the MAP optimization as a warm start. The term ‘warm start’ in the context of numerical optimization generally refers to the initialization of an optimization using the outcome of an associated but more solvable problem (for example, a surrogate model) obtained beforehand that yields an approximate answer, instead of starting from scratch (cold start). Warm-starting an optimization improves the effective use of prior knowledge and its convergence rate³⁹. In the current context, we regard the BS reconstruction from photoemission band-mapping data as the optimization problem to warm start, and the outcome from an electronic-structure calculation can produce a sufficiently good approximate to the solution of the optimization problem. For WSe₂, straightforward DFT calculations with semi-local approximation (which in itself involves explicit optimizations such as geometric optimization of the crystal structures) are sufficient, but our approach is not limited to DFT. Therefore, the use of ‘warm start’ in our application is conceptually well-aligned with the origin of the term.

To validate the MAP reconstruction algorithm in a variety of scenarios, we used synthetic photoemission data where the nominal ground-truth BSs are available. The BSs are constructed using analytic functions, model Hamiltonians or DFT calculations. The initializations are generated by tuning the numerical parameters used to generate the ground-truth BSs. The procedures and results are presented in Supplementary Section 2. In simple cases, such as single or well-isolated bands, the reconstruction yields a close solution to the ground truth, even with a flat band initialization. In the more general multiband scenario with congested bands and band crossings (or anti-crossings), an approximate dispersion (or shape) of the band and the crossing information is required in the initialization (warm start) to converge to a realistic solution. We further tested the robustness of the initializations by (1) scaling the energies of the ground truth and (2) using DFT calculations with different exchange-correlation (XC) functionals, to capture sufficient variability of available BS calculations in the real world. We quantify the variations in the initializations and the performance of the reconstruction using the average error (equation (9) or Fig. 4b), calculated with respect to the ground truth. Among the different numerical experiments, we find that the optimization converges consistently to a set of bands that better match the experimental data than the initialization. This is manifested in the fact that the average errors of the initializations are reduced to a similar level in the corresponding reconstruction outcomes, a trend seen over all bands, regardless of their dispersion. In the synthetic data with an energy spacing of ~18 meV, the average error in the reconstruction is on the order of 40–50 meV for each band, which amounts to an average inaccuracy of <3 bins along the energy dimension at a momentum location. The inaccuracy is, however, dependent on the bin sizes used in pre-processing and the fundamental resolution in the experiment. We have made the code for the MAP reconstruction algorithm and the synthetic data generation publicly accessible from the online repository Fuller⁶⁴ for broader applications.

Visualization strategies

Band-mapping and BS data contain unique multidimensional data structures in materials science that are often presented with specific visualizations motivated by the underlying solid-state physics and symmetry properties. In this Article we select a fixed set of 2D and 3D visualization techniques to illustrate their links and allow comparison with other photoemission studies of the same materials. Typically, ARPES data⁶ of the form I(E, k) are sampled and visualized along a particular path (the k-path⁶⁵) in momentum space^26,27, where only specific high-symmetry positions are labeled with capital letters³. A canonical k-path exists for each space group symmetry setting⁶⁵. Photoemission band mapping generates datasets with a dimensionality of three or higher, and often contains a lower symmetry (in intensity I) as a result of the photoemission matrix elements²⁰ and the experimental conditions. These factors lead to more flexibility in data representation⁵⁸ and motivate the use of alternate k-paths that capture the complexity of the photoemission spectra. In Fig. 1c–f for WSe₂ and Fig. 6a–c for Bi₂Te₂Se, we combine 3D volumetric rendering and 2D k-path views to illustrate both the data symmetry and the intensity modulations present in the data.

To visualize the band dispersion surfaces, E_b(k_x, k_y) (b = 1, 2, ...), we combine 3D stacked surfaces and 2D image sequences, as exemplified in Fig. 2b,d for WSe₂ and Fig. 6d,e for Bi₂Te₂Se. This paired visualization approach balances the strengths and shortcomings of different viewpoints to achieve a comprehensive representation of the data type. The 3D stacked surface representation highlights the entirety and complexity of the data, but often contains occluded regions imperceptible from a fixed viewing direction. The 2D-image-sequence representation includes all energy dispersion information, yet loses the inter-relationship on the energy scale between energy bands, which matters in the event of (anti)crossings. In combining these two approaches, we typically choose the same color map and scale to maintain referenceability between the two representations. For each energy band, the full color scale is used to cover its energy range, becoming the normalized energy (norm. ener.) scale, which illustrates the local detail of the dispersion that otherwise may be hard to discern.

BS calculations

Electronic BSs were calculated within (generalized) DFT using the LDA^66,67, the generalized-gradient approximation (GGA-PBE)⁶⁸ and GGA-PBEsol⁶⁹), and the hybrid XC functional HSE06⁷⁰, which incorporates a fraction of the exact exchange. All calculations were performed with the all-electron, full-potential numeric-atomic orbital code, FHI-aims⁷¹. They were conducted for the geometries obtained by fully relaxing the atomic structure with the respective XC functional to keep the electronic and atomic structures consistent. SOC was included in a perturbational fashion⁷². The momentum grid used for the calculation was equally sampled with a spacing of 0.012 Å⁻¹ in both k_x and k_y directions, which covers the irreducible part of the first Brillouin zone at k_z = 0.35 Å⁻¹, estimated using the inner potential of WSe₂ from a previous measurement¹¹. The calculated BS is symmetrized to fill the entire hexagonal Brillouin zone used to initialize the BS reconstruction and synthetic data generation. We note here that, for MAP reconstruction, the momentum grid size used in the theoretical calculations (such as DFT at various levels as used here) need not be identical to that of the data (or instrument resolution), and in such cases an appropriate upsampling (or downsampling) should be applied to the calculation to match the momentum resolution. Further details are presented in Supplementary Section 4.

BS informatics

The shape feature-space representation of each electronic band is derived from the decomposition

$${E}_{b}({{{\bf{k}}}})=\mathop{\sum}\limits_{l}{a}_{l}{\phi }_{l}({{{\bf{k}}}})={{{\bf{a}}}}\cdot {{{\bf{\Phi }}}}.$$

(5)

Here, k = (k_x, k_y) represents the momentum coordinate, E_b(k) is the single-band dispersion relation (for example, the dispersion surface in 3D), and a_l and ϕ_l(k) are the coefficient and its associated basis term, respectively. The latter are grouped separately into the feature vector, a = (a₁, a₂, ...) and the basis vector, Φ = (ϕ₁, ϕ₂, ...). The orthonormality of the basis is guaranteed within the PBZ of the material:

$${\int}_{{{{\bf{k}}}}\in {{{\varOmega }}}_{{{{\rm{PBZ}}}}}}{\phi }_{m}({{{\bf{k}}}}){\phi }_{n}({{{\bf{k}}}}){\rm{d}}{{{\bf{k}}}}={\delta }_{mn}.$$

(6)

For the hexagonal PBZ of WSe₂, the basis terms are hexagonal ZPs constructed using a linear combination of the circular ZPs via Gram–Schmidt orthonormalization within a regular (that is, equilateral and equiangular) hexagon³⁵. A similar method can be used to generate the ZP-derived orthonormal basis adapted to other boundary conditions³⁵. The representation in feature space³⁴ provides a way to quantify the difference (or distance) d between energy bands or BSs at different resolutions or scales, without additional interpolation. To quantify the shape similarity between energy bands E_b and ${E}_{{b}^{{\prime} }}$, we calculate the cosine similarity using the feature vectors

$${d}_{\cos }({E}_{b},{E}_{{b}^{{\prime} }})=\frac{{{{\bf{a}}}}\cdot {{{{\bf{a}}}}}^{{\prime} }}{| {{{\bf{a}}}}| \cdot | {{{{\bf{a}}}}}^{{\prime} }| },$$

(7)

where the cosine similarity is bounded within [−1, 1], with a value of 0 describing orthogonality of the feature vectors and a value of 1 and −1 describing parallel and anti-parallel relations between them, respectively, both indicating high similarity. The use of cosine similarity in feature space allows comparison of dispersion while being unaffected by their magnitudes. In comparing the dispersion between single energy bands using equation (7), the first term in the polynomial expansion, or the hexagonal equivalent of the Zernike piston⁷³, is discarded as it only represents a constant energy offset (with zero spatial frequency) instead of dispersion, which is characterized by a combination of finite and nonzero spatial frequencies.

The electronic BS is a collection of energy bands ${E}_{B}=\{{E}_{{b}_{i}}\}$ (i = 1, 2, ...). To quantify the distance between two BSs, ${E}_{{B}_{1}}=\{{E}_{{b}_{1,\,i}}\}$ and ${E}_{{B}_{2}}=\{{E}_{{b}_{2,\,i}}\}$, containing the same number of energy bands while ignoring their global energy difference, we first subtract the energy grand mean (that is, the mean of the energy means of all bands within the region of the BS for comparison). We then compute the Euclidean distance, or the ℓ²-norm, for the ith pair of bands, d_b, i:

$${d}_{b,\,i}({E}_{{b}_{1,\,i}},{E}_{{b}_{2,\,i}})=\parallel {{\tilde{{{{\bf{a}}}}}}_{1,\,i}-{\tilde{{{{\bf{a}}}}}}_{2,\,i}\parallel }_{2}=\sqrt{\mathop{\sum}\limits_{l}{({\tilde{a}}_{1,\,il}-{\tilde{a}}_{2,\,il})}^{2}}.$$

(8)

Here, ${\tilde{{{{\bf{a}}}}}}$ denotes the feature vector after subtracting the energy grand mean, so that any global energy shift is removed. We define the BS distance as the average distance over all N_b pairs of bands, or ${d}_{B}({E}_{{B}_{1}},\,{E}_{{B}_{2}})$ = ${\mathop{\sum }\nolimits_{i}^{{N}_{b}}{d}_{b,\,i}({E}_{{b}_{1},\,i},{E}_{{b}_{2},\,i})/{N}_{b}}$. The values of ${d}_{B}({E}_{{B}_{1}},\,{E}_{{B}_{2}})$ are shown in the upper triangle of Fig. 3d and their corresponding standard errors (over the 14 valence bands of WSe₂) in the lower triangle. The distance in equation (8) is independent of basis and allows energy bands calculated on different resolutions or from different materials with the same symmetry (for example, differing only by Brillouin zone size) to be compared.

We use same-resolution error metrics to evaluate the approximation quality of the expansion basis and to quantify the reconstruction outcome with a known ground-truth BS. Specifically, we define the average approximation error (with energy unit), η_avg, for each energy band using the energy difference at every momentum location:

$${\eta }_{{{{\rm{avg}}}}}({E}_{{{{\rm{approx}}}}},{E}_{{{{\rm{recon}}}}})=\sqrt{\frac{1}{{N}_{k}}\mathop{\sum}\limits_{{{{\bf{k}}}}\in {{{\varOmega }}}_{{{{\rm{PBZ}}}}}}{({E}_{{{{\rm{approx}}}},\,{{{\bf{k}}}}}-{E}_{{{{\rm{recon}}}},\,{{{\bf{k}}}}})}^{2}},$$

(9)

where N_k is the number of momentum grid points and the summation runs over the PBZ. In addition, we construct the relative approximation error, η_rel, following the definition of the normwise error⁷⁴ in matrix computation:

$${\eta }_{{{{\rm{rel}}}}}({E}_{{{{\rm{approx}}}}},\,{E}_{{{{\rm{recon}}}}})=\frac{{\left\Vert {E}_{{{{\rm{approx}}}}}-{E}_{{{{\rm{recon}}}}}\right\Vert }_{2}}{{\left\Vert {E}_{{{{\rm{recon}}}}}\right\Vert }_{2}}.$$

(10)

Equations (9) and (10) are used to compute the curves in Fig. 3b as a function of the number of basis terms included in the approximation. The relevant code for the representation using hexagonal ZPs and the computation of the metrics is also accessible in the public repository Fuller⁶⁴.

Data reduction

The raw data and intermediate results are stored in the HDF5 format⁵⁸. The file sizes quoted here for reference are calculated from storage as double-precision floats or integers (for indices). The photoemission band-mapping data of WSe₂ (256 × 256 × 470 bins) have a size of ~235 MB (240,646 kB) after binning from single-event data (7.8 GB or 8,176,788 kB). The reconstructed valence bands at the same resolution occupy ~3 MB (3,352 kB) in storage, and the size further decreases to 46 kB when we store the shape feature vector associated with each band. If only the top-100 coefficients (ranked by the absolute values of their amplitudes) and their indices in the feature vectors are stored, the data amounts to 24 kB. For the case of WSe₂, the top-100 coefficients can approximate the band dispersion with a relative error (equation (10)) of <0.8% for every energy band, as shown in Supplementary Fig. 14.

Data availability

The electronic-structure calculations for WSe₂ are available from the NOMAD repository (https://doi.org/10.17172/NOMAD/2020.03.28-1)⁷⁵. The raw and processed photoemission datasets used in this work for WSe₂ (https://doi.org/10.5281/zenodo.7314278)⁷⁶, Bi₂Te₂Se (https://doi.org/10.5281/zenodo.7317667)⁷⁷ and Au(111) (https://doi.org/10.5281/zenodo.7305241 including DFT calculation)⁷⁸ are available on Zenodo. Source data are provided with this paper.

Code availability

The code developed for band structure reconstruction, including examples, is available on GitHub (https://github.com/mpes-kit/fuller)⁷⁹.

Change history

16 January 2023
In the version of this article initially published online, refs. 76–79 were not presented as doi-based references, while the Editor recognition statement was missing. The changes have been made in the HTML and PDF versions of the article.

References

Isaacs, E. B. & Wolverton, C. Inverse band structure design via materials database screening: application to square planar thermoelectrics. Chem. Mater. 30, 1540–1546 (2018).
Article Google Scholar
Marin, E. G., Perucchini, M., Marian, D., Iannaccone, G. & Fiori, G. Modeling of electron devices based on 2-D materials. IEEE Trans. Electron Devices 65, 4167–4179 (2018).
Article Google Scholar
Bouckaert, L. P., Smoluchowski, R. & Wigner, E. Theory of Brillouin zones and symmetry properties of wave functions in crystals. Phys. Rev. 50, 58–67 (1936).
Article MATH Google Scholar
Chiang, T.-C. & Seitz, F. Photoemission spectroscopy in solids. Ann. Phys. 10, 61–74 (2001).
Article Google Scholar
Damascelli, A., Hussain, Z. & Shen, Z.-X. Angle-resolved photoemission studies of the cuprate superconductors. Rev. Mod. Phys. 75, 473–541 (2003).
Article Google Scholar
Zhang, H. et al. Angle-resolved photoemission spectroscopy. Nat. Rev. Methods Primers 2, 54 (2022).
Article Google Scholar
Schönhense, G., Medjanik, K. & Elmers, H.-J. Space-, time- and spin-resolved photoemission. J. Electron Spectros. Relat. Phenomena 200, 94–118 (2015).
Article Google Scholar
Medjanik, K. et al. Direct 3D mapping of the Fermi surface and Fermi velocity. Nat. Mater. 16, 615–621 (2017).
Article Google Scholar
Puppin, M. et al. Time- and angle-resolved photoemission spectroscopy of solids in the extreme ultraviolet at 500-kHz repetition rate. Rev. Sci. Instrum. 90, 023104 (2019).
Article Google Scholar
Gauthier, A. et al. Tuning time and energy resolution in time-resolved photoemission spectroscopy with nonlinear crystals. J. Appl. Phys. 128, 093101 (2020).
Article Google Scholar
Riley, J. M. et al. Direct observation of spin-polarized bulk bands in an inversion-symmetric semiconductor. Nat. Phys. 10, 835–839 (2014).
Article Google Scholar
Bahramy, M. S. et al. Ubiquitous formation of bulk Dirac cones and topological surface states from a single orbital manifold in transition-metal dichalcogenides. Nat. Mater. 17, 21–28 (2018).
Article Google Scholar
Schröter, N. B. M. et al. Chiral topological semimetal with multifold band crossings and long Fermi arcs. Nat. Phys. 15, 759–765 (2019).
Article Google Scholar
Valla, T. et al. Evidence for quantum critical behavior in the optimally doped cuprate Bi₂Sr₂CaCu₂O_8 + δ. Science 285, 2110–2113 (1999).
Article Google Scholar
Levy, G., Nettke, W., Ludbrook, B. M., Veenstra, C. N. & Damascelli, A. Deconstruction of resolution effects in angle-resolved photoemission. Phys. Rev. B 90, 045150 (2014).
Article Google Scholar
Zhang, P. et al. A precise method for visualizing dispersive features in image plots. Rev. Sci. Instrum. 82, 043712 (2011).
Article Google Scholar
He, Y., Wang, Y. & Shen, Z.-X. Visualizing dispersive features in 2D image via minimum gradient method. Rev. Sci. Instrum. 88, 073903 (2017).
Article Google Scholar
Peng, H. et al. Super resolution convolutional neural network for feature extraction in spectroscopic data. Rev. Sci. Instrum. 91, 033905 (2020).
Article Google Scholar
Kim, Y. et al. Deep learning-based statistical noise reduction for multidimensional spectral data. Rev. Sci. Instrum. 92, 073901 (2021).
Article Google Scholar
Moser, S. An experimentalist’s guide to the matrix element in angle resolved photoemission. J. Electron Spectros. Relat. Phenomena 214, 29–52 (2017).
Article Google Scholar
Murphy, K. P. Machine Learning: A Probabilistic Perspective (MIT Press, 2012).
Ghahramani, Z. Probabilistic machine learning and artificial intelligence. Nature 521, 452–459 (2015).
Article Google Scholar
Wang, C., Komodakis, N. & Paragios, N. Markov random field modeling, inference and learning in computer vision and image understanding: a survey. Comput. Vis. Image Underst. 117, 1610–1627 (2013).
Article Google Scholar
Comer, M. & Simmons, J. The Markov random field in materials applications: a synoptic view for signal processing and materials readers. IEEE Signal Process. Mag. 39, 16–24 (2022).
Article Google Scholar
Kaufmann, K. et al. Crystal symmetry determination in electron diffraction using machine learning. Science 367, 564–568 (2020).
Article Google Scholar
Traving, M. et al. Electronic structure of WSe₂: a combined photoemission and inverse photoemission study. Phys. Rev. B 55, 10392–10399 (1997).
Article Google Scholar
Finteis, T. et al. Occupied and unoccupied electronic band structure of WSe₂. Phys. Rev. B 55, 10400–10411 (1997).
Article Google Scholar
Kormányos, A. et al. k⋅p theory for two-dimensional transition metal dichalcogenide semiconductors. 2D Mater. 2, 022001 (2015).
Article Google Scholar
Stimper, V., Bauer, S., Ernstorfer, R., Scholkopf, B. & Xian, R. P. Multidimensional contrast limited adaptive histogram equalization. IEEE Access 7, 165437–165447 (2019).
Article Google Scholar
Perdew, J. P. & Schmidt, K. Jacob’s ladder of density functional approximations for the exchange-correlation energy. In AIP Conference Proceedings (Eds Doren, V. V. et al.) 1–20 (AIP, 2001).
Golze, D., Dvorak, M. & Rinke, P. The GW Compendium: a practical guide to theoretical photoemission spectroscopy. Front. Chem. 7, 377 (2019).
Article Google Scholar
Zacharias, M., Scheffler, M. & Carbogno, C. Fully anharmonic nonperturbative theory of vibronically renormalized electronic band structures. Phys. Rev. B 102, 045126 (2020).
Article Google Scholar
Zhang, D. & Lu, G. Review of shape representation and description techniques. Pattern Recognit. 37, 1–19 (2004).
Article Google Scholar
Khotanzad, A. & Hong, Y. Invariant image recognition by Zernike moments. IEEE Trans. Pattern Anal. Mach. Intell. 12, 489–497 (1990).
Article Google Scholar
Mahajan, V. N. & Dai, G.-m. Orthonormal polynomials in wavefront analysis: analytical solution. J. Opt. Soc. Am. A 24, 2994–3016 (2007).
Article Google Scholar
Himanen, L., Geurts, A., Foster, A. S. & Rinke, P. Data driven materials science: status, challenges and perspectives. Adv. Sci. 6, 1900808 (2019).
Article Google Scholar
Horton, M. K., Dwaraknath, S. & Persson, K. A. Promises and perils of computational materials databases. Nat. Comput. Sci. 1, 3–5 (2021).
Article Google Scholar
Kiureghian, A. D. & Ditlevsen, O. Aleatory or epistemic? Does it matter? Struct. Safety 31, 105–112 (2009).
Article Google Scholar
Nocedal, J. & Wright, S. J. Numerical Optimization 2nd edn (Springer, 2006).
Xian, R. P., Ernstorfer, R. & Pelz, P. M. Scalable multicomponent spectral analysis for high-throughput data annotation. Preprint at https://arxiv.org/abs/2102.05604 (2021).
Smith, M. W. Roughness in the Earth Sciences. Earth Sci. Rev. 136, 202–225 (2014).
Article Google Scholar
Guo, H. et al. Double resonance Raman modes in monolayer and few-layer MoTe₂. Phys. Rev. B 91, 205415 (2015).
Article Google Scholar
Heremans, J. P., Cava, R. J. & Samarth, N. Tetradymites as thermoelectrics and topological insulators. Nat. Rev. Mater. 2, 17049 (2017).
Article Google Scholar
Ehrhardt, M. & Koprucki, T. (eds) Multi-band effective mass approximations. In Lecture Notes in Computational Science and Engineering Vol. 94 (Springer, 2014).
Scheffler, M. et al. FAIR data enabling new horizons for materials research. Nature 604, 635–642 (2022).
Article Google Scholar
Kordyuk, A. A. et al. Bare electron dispersion from experiment: self-consistent self-energy analysis of photoemission data. Phys. Rev. B 71, 214513 (2005).
Article Google Scholar
Noack, M. M. et al. Gaussian processes for autonomous data acquisition at large-scale synchrotron and neutron facilities. Nat. Rev. Phys. 3, 685–697 (2021).
Article Google Scholar
Beaulieu, S. et al. Ultrafast dynamical Lifshitz transition. Sci. Adv. 7, eabd9275 (2021).
Article Google Scholar
Curcio, D. et al. Accessing the spectral function in a current-carrying device. Phys. Rev. Lett. 125, 236403 (2020).
Article Google Scholar
Wilson, N. R. et al. Determination of band offsets, hybridization, and exciton binding in 2D semiconductor heterostructures. Sci. Adv. 3, e1601832 (2017).
Article Google Scholar
Ulstrup, S. et al. Nanoscale mapping of quasiparticle band alignment. Nat. Commun. 10, 3283 (2019).
Article Google Scholar
Ewings, R. et al. Horace: software for the analysis of data from single crystal spectroscopy experiments at time-of-flight neutron instruments. Nucl. Instrum. Methods Phys. Res. A 834, 132–142 (2016).
Article Google Scholar
Whittaker, C. E. et al. Exciton polaritons in a two-dimensional Lieb lattice with spin-orbit coupling. Phys. Rev. Lett. 120, 097401 (2018).
Article Google Scholar
Frölich, A., Fischer, J., Wolff, C., Busch, K. & Wegener, M. Frequency-resolved reciprocal-space mapping of visible spontaneous emission from 3D photonic crystals. Adv. Opt. Mater. 2, 849–853 (2014).
Article Google Scholar
Amenabar, I. et al. Hyperspectral infrared nanoimaging of organic samples based on Fourier transform infrared nanospectroscopy. Nat. Commun. 8, 14402 (2017).
Article Google Scholar
von Rueden, L. et al. Informed machine learning—a taxonomy and survey of integrating prior knowledge into learning systems. IEEE Trans. Knowl. Data Eng. 35, 614–633 (2023).
Oelsner, A. et al. Microspectroscopy and imaging using a delay line detector in time-of-flight photoemission microscopy. Rev. Sci. Instrum. 72, 3968–3974 (2001).
Article Google Scholar
Xian, R. P. et al. An open-source, end-to-end workflow for multidimensional photoemission spectroscopy. Sci. Data 7, 442 (2020).
Article Google Scholar
SPECS GmbH. METIS 1000 Brochure (SPECS, 2019); https://www.specs-group.com/fileadmin/user_upload/products/brochures/SPECS_Brochure-METIS_RZ_web.pdf
Xian, R. P., Rettig, L. & Ernstorfer, R. Symmetry-guided nonrigid registration: the case for distortion correction in multidimensional photoemission spectroscopy. Ultramicroscopy 202, 133–139 (2019).
Article Google Scholar
Kittler, J. & Föglein, J. Contextual classification of multispectral pixel data. Image Vision Comput. 2, 13–29 (1984).
Article Google Scholar
Abadi, M. et al. TensorFlow: large-scale machine learning on heterogeneous distributed systems. Preprint at https://arxiv.org/abs/1603.04467 (2016).
Li, S. Markov Random Field Modeling in Image Analysis 3rd edn (Springer, 2009).
Stimper, V. & Xian, R. P. Fuller. https://github.com/mpes-kit/fuller
Hinuma, Y., Pizzi, G., Kumagai, Y., Oba, F. & Tanaka, I. Band structure diagram paths based on crystallography. Comput. Mater. Sci. 128, 140–184 (2017).
Article Google Scholar
Ceperley, D. M. & Alder, B. J. Ground state of the electron gas by a stochastic method. Phys. Rev. Lett. 45, 566–569 (1980).
Article Google Scholar
Perdew, J. P. & Wang, Y. Accurate and simple analytic representation of the electron-gas correlation energy. Phys. Rev. B 45, 13244–13249 (1992).
Article Google Scholar
Perdew, J. P., Burke, K. & Ernzerhof, M. Generalized gradient approximation made simple. Phys. Rev. Lett. 77, 3865–3868 (1996).
Article Google Scholar
Perdew, J. P. et al. Restoring the density-gradient expansion for exchange in solids and surfaces. Phys. Rev. Lett. 100, 136406 (2008).
Article Google Scholar
Heyd, J., Scuseria, G. E. & Ernzerhof, M. Hybrid functionals based on a screened Coulomb potential. J. Chem. Phys. 118, 8207–8215 (2003).
Article Google Scholar
Blum, V. et al. Ab initio molecular simulations with numeric atom-centered orbitals. Comput. Phys. Commun. 180, 2175–2196 (2009).
Article MATH Google Scholar
Huhn, W. P. & Blum, V. One-hundred-three compound band-structure benchmark of post-self-consistent spin-orbit coupling treatments in density functional theory. Phys. Rev. Mater. 1, 033803 (2017).
Article Google Scholar
Wyant, J. C. & Creath, K. in Applied Optics and Optical Engineering Vol. Xl (eds Shannon, R R. & Wyant, J. C.) 1–53 (Academic Press, 1992).
Watkins, D. S. Fundamentals of Matrix Computations 3rd edn (Wiley, 2010).
Zacharias, M. & Carbogno, C. First-principles calculations for 2H-WSe₂. NOMAD Repository https://nomad-lab.eu/prod/rae/gui/dataset/id/CS7f_obIQd6hE3-2JHfSuw (2020).
Xian, R. P. et al. Dataset of photoemission valence-band mapping and band reconstruction of 2H-WSe₂. Zenodo https://doi.org/10.5281/zenodo.7314278 (2022).
Dendzik, M. et al. Excited-state photoemission band mapping data of the topological insulator Bi₂Te₂Se. Zenodo https://doi.org/10.5281/zenodo.7317667 (2022).
Dendzik, M. et al. Synchrotron bulk photoemission data from Au(111) and DFT calculations. Zenodo https://doi.org/10.5281/zenodo.7305241 (2022).
Xian, R. P. et al. Fuller: code and examples for the band structure reconstruction workflow. Zenodo https://doi.org/10.5281/zenodo.7325584 (2022).

Download references

Acknowledgements

We thank M. Scheffler for fruitful discussions and S. Schülke and G. Schnapka at Gemeinsames Netzwerkzentrum (GNZ) in Berlin and M. Rampp at Max Planck Computing and Data Facility (MPCDF) in Garching for support on the computing infrastructure. The work was partially supported by BiGmax, the Max Planck Society’s Research Network on Big-Data-Driven Materials-Science, the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (grants 740233 and ERC-2015-CoG-682843), the German Research Foundation (DFG) through the Emmy Noether programme under grant no. RE 3977/1, the SFB/TRR 227 ‘Ultrafast Spin Dynamics’ (project-ID 328545488, projects A09 and B07) and the NOMAD pillar of the FAIR-DI e.V. association. We thank M. Bremholm for providing the Bi₂Te₂Se samples, and Ph. Hofmann and M. Bianchi for their support in obtaining Au(111) photoemission data. M.D. acknowledges support from the Göran Gustafssons Foundation. S. Beaulieu acknowledges financial support from the Banting Fellowship from the Natural Sciences and Engineering Research Council (NSERC) in Canada.

Funding

Open access funding provided by Max Planck Society.

Author information

R. Patrick Xian
Present address: Department of Mechanical Engineering, University College London, London, UK
Marios Zacharias
Present address: Université de Rennes, INSA Rennes, CNRS, Institut FOTON, Rennes, France
Maciej Dendzik
Present address: Department of Applied Physics, KTH Royal Institute of Technology, Stockholm, Sweden
Samuel Beaulieu
Present address: Université de Bordeaux-CNRS-CEA, CELIA, Talence, France
Stefan Bauer
Present address: Division of Decision and Control Systems, KTH Royal Institute of Technology, Stockholm, Sweden
These authors contributed equally: R. Patrick Xian, Vincent Stimper.

Authors and Affiliations

Fritz Haber Institute of the Max Planck Society, Berlin, Germany
R. Patrick Xian, Marios Zacharias, Maciej Dendzik, Shuo Dong, Samuel Beaulieu, Martin Wolf, Laurenz Rettig, Christian Carbogno & Ralph Ernstorfer
Department of Empirical Inference, Max Planck Institute for Intelligent Systems, Tübingen, Germany
Vincent Stimper, Bernhard Schölkopf & Stefan Bauer

Authors

R. Patrick Xian
View author publications
You can also search for this author in PubMed Google Scholar
Vincent Stimper
View author publications
You can also search for this author in PubMed Google Scholar
Marios Zacharias
View author publications
You can also search for this author in PubMed Google Scholar
Maciej Dendzik
View author publications
You can also search for this author in PubMed Google Scholar
Shuo Dong
View author publications
You can also search for this author in PubMed Google Scholar
Samuel Beaulieu
View author publications
You can also search for this author in PubMed Google Scholar
Bernhard Schölkopf
View author publications
You can also search for this author in PubMed Google Scholar
Martin Wolf
View author publications
You can also search for this author in PubMed Google Scholar
Laurenz Rettig
View author publications
You can also search for this author in PubMed Google Scholar
Christian Carbogno
View author publications
You can also search for this author in PubMed Google Scholar
Stefan Bauer
View author publications
You can also search for this author in PubMed Google Scholar
Ralph Ernstorfer
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

R.P.X. and R.E. conceived and coordinated the project. The photoemission band-mapping experiments were supervised by L.R., R.E. and M.W. S.D. and S. Beaulieu acquired the data on WSe₂, and M.D. acquired the data on Bi₂Te₂Se and Au(111). M.Z., M.D. and C.C. performed the DFT BS calculations. R.P.X. and M.D. processed the raw data. R.P.X. devised the BS digitization, algorithm validation schemes and metrics, and performed computational benchmarking. V.S. designed and implemented the machine-learning algorithm under the supervision of S. Bauer and B.S., along with input from R.P.X. R.P.X. and V.S. co-wrote the first draft of the manuscript with contributions from M.Z. and M.D. All authors contributed to discussions and revision of the manuscript to its final version.

Corresponding authors

Correspondence to R. Patrick Xian, Vincent Stimper, Stefan Bauer or Ralph Ernstorfer.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Computational Science thanks the anonymous reviewers for their contribution to the peer review of this work. Primary Handling Editor: Jie Pan, in collaboration with the Nature Computational Science team.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Supplementary Figs. 1–14, Tables 1–4, derivations of formulae, extended numerical validations and discussion.

Supplementary Video 1

Left side shows the position of the cut viewed from the first projected Brillouin zone of WSe₂. Right side shows the corresponding 2D cut in (k_y, E) coordinates from volumetric band mapping data, overlaid with the DFT calculation performed at the LDA level (LDA-DFT), used to initialize the reconstruction, and the resulting 14 reconstructed valence bands.

Supplementary Video 2

Left side shows the position of the cut viewed from the first projected Brillouin zone of WSe₂. Right side shows the corresponding 2D cut in (k_x, E) coordinates from volumetric band mapping data, overlaid with the DFT calculation performed at the LDA level (LDA-DFT), used to initialize the reconstruction, and the resulting 14 reconstructed valence bands.

Supplementary Video 3

The video explores the reconstructed valence bands from photoemission band mapping data on WSe₂ using LDA-level DFT calculation as the initialization. It illustrates the generation of an exploded view of the bands from the original reconstruction, the bands viewed collectively from different angles and the individual view of each band.

Source data

Source Data Fig. 1

Numerical data contained in Fig. 1c–f.

Source Data Fig. 2

Numerical data contained in Fig. 2c,d.

Source Data Fig. 3

Numerical data contained in Fig. 3.

Source Data Fig. 4

Numerical data contained in Fig. 4.

Source Data Fig. 5

Numerical data contained in Fig. 5c,e.

Source Data Fig. 6

Numerical data contained in Fig. 6b–d,f,g.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Xian, R.P., Stimper, V., Zacharias, M. et al. A machine learning route between band mapping and band structure. Nat Comput Sci 3, 101–114 (2023). https://doi.org/10.1038/s43588-022-00382-2

Download citation

Received: 01 March 2022
Accepted: 17 November 2022
Published: 30 December 2022
Issue Date: January 2023
DOI: https://doi.org/10.1038/s43588-022-00382-2