Unveiling quasiparticle dynamics of topological insulators through Bayesian modelling

Tokuda, Satoru; Souma, Seigo; Segawa, Kouji; Takahashi, Takashi; Ando, Yoichi; Nakanishi, Takeshi; Sato, Takafumi

doi:10.1038/s42005-021-00673-6

Download PDF

Article
Open access
Published: 27 July 2021

Unveiling quasiparticle dynamics of topological insulators through Bayesian modelling

Communications Physics volume 4, Article number: 170 (2021) Cite this article

2966 Accesses
7 Citations
22 Altmetric
Metrics details

Subjects

Abstract

Quasiparticle - a key concept to describe interacting particles - characterizes electron-electron interaction in metals (Fermi liquid) and electron pairing in superconductors. While this concept essentially relies on the simplification of hard-to-solve many-body problem into one-particle picture and residual effects, a difficulty in disentangling many-body effects from experimental quasiparticle signature sometimes hinders unveiling intrinsic low-energy dynamics, as highlighted by the fierce controversy on the origin of Dirac-band anomaly in graphene and dispersion kink in high-temperature superconductors. Here, we propose an approach to solve this fundamental problem - the Bayesian modelling of quasiparticles. We have chosen a topological insulator TlBi(S,Se)₂ as a model system to formulate an inverse problem of quasiparticle spectra with semiparametric Bayesian analysis, and successfully extracted one-particle and many-body characteristics, i.e. the intrinsic energy gap and unusual lifetime in Dirac-quasiparticle bands. Our approach is widely applicable to clarify the quasiparticle dynamics of quantum materials.

Direct observation of a magnetic-field-induced Wigner crystal

Article 10 April 2024

Yen-Chen Tsui, Minhao He, … Ali Yazdani

A hybrid topological quantum state in an elemental solid

Article 10 April 2024

Md Shafayat Hossain, Frank Schindler, … M. Zahid Hasan

Nonlinearity-induced topological phase transition characterized by the nonlinear Chern number

Article Open access 11 April 2024

Kazuki Sone, Motohiko Ezawa, … Takahiro Sagawa

Introduction

Low-energy excitations in interacting electronic systems are known to be characterized by quasiparticles. The concept of quasiparticle was originally proposed in the Laudau’s Fermi-liquid theory wherein strongly interacting electrons share a similar behavior with weakly interacting counterparts. Instead of strongly interacting bare electrons (holes), one can define dressed electrons (holes) as elementary excitations (i.e., quasiparticles), which can be understood by extending the framework of single-particle approximation. While the Fermi-liquid theory successfully captured low-energy dynamics of normal metals and ³He, the quasiparticle concept is nowadays applied widely in solids such as electron systems interacting with lattice vibrations (phonons) and spin excitations (magnons). Angle-resolved photoemission spectroscopy (ARPES) has played a pivotal role in uncovering key quasiparticle properties by capturing the energy dispersion (E–k relation) and lifetime of, e.g., Bogoliubov quasiparticles associated with the superconducting Cooper pairing in high-temperature superconductors^1,2,3 and mass-renormalized quasiparticles caused by strong electron–phonon coupling on metal surfaces and quasi-two-dimensional (quasi-2D) materials^4,5,6. As highlighted by these examples, for the understanding of the origin and mechanism of exotic physical properties of novel materials, it is crucial to experimentally establish the nature of quasiparticles.

To elucidate the quasiparticle dynamics, it is desirable to be able to unambiguously extract the original single-particle band dispersion (bare-band dispersion, E_k) and many-body effects (self-energy, Σ) from the ARPES data. Both of these physical quantities are directly linked to the ARPES spectrum through the spectral function expressed as,

$$A\left( {k,\omega } \right) = \frac{1}{\pi }\frac{{ - {{{{{\mathrm{Im}}}}}}{\sum} {\left( {{{{{{\mathbf{k}}}}}},\omega } \right)} }}{{\left[ {\omega - E_k - {{{{{\mathrm{Re}}}}}}{\sum} {\left( {{{{{{\mathbf{k}}}}}},\omega } \right)} } \right]^2 + \left[ {{{{{{\mathrm{Im}}}}}}{\sum} {\left( {{{{{{\mathbf{k}}}}}},\omega } \right)} } \right]^2}}$$

(1)

where ω is the energy with respect to the Fermi level (E_F). Many attempts have been hitherto made to extract the intrinsic Σ from ARPES data by assuming a reasonable shape of E_k. For example, E_k was referenced to the band calculation obtained with the local density approximation (e.g., refs. ^7,8), or it was empirically approximated with a polynomial function (e.g., linear or parabola^9,10). While such data analysis certainly gave insights into the quasiparticle dynamics, one often faced a serious problem in clarifying the nature of many-body interactions. This is represented by the fierce debates on the absence or appearance of an intrinsic energy gap at the Dirac point in epitaxial single-layer graphene^11,12, which is critical for feasible application of graphene as a semiconductor device. Also, the origin of dispersion kink in cuprate superconductors (phononic, magnetic, or others) is controversial for more than a decade¹³, and its relationship with the high-T_c mechanism is yet to be clarified. These controversies partially originate from a few assumptions one had to make to extract E_k and Σ.

To overcome these problems, we apply semiparametric Bayesian modeling to ARPES data. We start by introducing the basics of Bayesian analysis through a simple demonstration. As a prototypical example, we model the ARPES intensity of a topological insulator (TI) TlBi(S,Se)₂¹⁴ based on the parametric form of bare-band dispersion and nonparametric forms of any other elements to perform the semiparametric Bayesian analysis of spectral function. We provide a clear insight into one-particle and many-body characteristics of TlBi(S,Se)₂ by successfully extracting bare-band dispersion and self-energy.

Results

Basics of Bayesian analysis

First, we explain the basic concept of Bayesian analysis by showing its application to an energy distribution curve (EDC) contributed by multiple bands. A common approach in extracting the peak positions (i.e., contributing energy bands) is to find a good reproduction of the experimental EDCs by simulated EDCs using the least-square method. However, this method cannot pin down which class of model (e.g., how many bands are contributing) is the best for given data, often posing a question on the basic applicability of the model itself. To demonstrate this problem, we show in Fig. 1a a representative experimental EDC (dots) together with the result of numerical fittings (red curves) using the least-square method assuming the existence of intrinsic single, double, and triple Lorentzian peaks (blue curves; the number of peaks K = 1–3) that represent three different class of models. One can immediately recognize that the model with triple Lorentzian peaks shows the best fit to the experimental data. This is natural because the inclusion of more peaks (more parameters) always leads to a decrease in mean square error (shown by red line in Fig. 1b). However, it does not validate that the actual number of peaks are three. One needs to select the most appropriate model (in this case, the number of peaks) to suitably reproduce the EDC. Importantly, such selection should not include arbitrariness and must not rely on “human eyes.” The Bayesian framework, as an extension of the least squares, enables the evaluation of the model’s appropriateness itself in terms of the posterior probability derived from the chain rule of probability, called Bayes’ formula (see Methods). One can see from Fig. 1b that the double-peak model (K = 2) has the highest probability (77%) among K = 1–5. Whereas the least-square method determines a unique solution of the parameter set as a global minimum of the mean square error, the Bayesian framework treats the “statistical ensemble” of numerous solutions as random variables with the Boltzmann distribution, called the posterior probability distribution of the parameter set. Namely, many solutions for peak position and peak width are plotted in the parameter space and colored with the posterior probability density proportional to the Boltzmann factor, where the point with the highest posterior probability density corresponds to the best-fit parameters (least-square solution) as highlighted in Fig. 1c [note that “marginal” posterior probability density is plotted in Fig. 1c, because the intensity of each peak (not shown) is also a model parameter]. Like the canonical ensemble, a statistical ensemble of Bayesian analysis is conditioned on three principal factors: the number of observed data points (e.g., the number of meshes in an ARPES image), the number of parameters in each model class, and the “temperature.” This temperature is nothing to do with actual temperature of the sample and is related to the signal-to-noise ratio of observed data, affecting the uncertainty of model parameters. If the “temperature” is treated as a hyper-parameter, it can be estimated by maximizing the “partition function,” which is connected with the posterior probability of model class (Fig. 1b) through Bayes’ formula (see Methods).

**Fig. 1: Application of Bayesian analysis to the energy distribution curve.**

Semiparametric model of ARPES intensity

Now that the basic scheme of the Bayesian analysis is demonstrated for a single spectrum, we apply this analysis to the actual 2D ARPES intensity through a semiparametric modeling of quasiparticle bands. For a testbed system, we have chosen TlBi(S_1−xSe_x)₂ (x = 0.8)¹⁴, where a slight intensity suppression around the Dirac point is seen as shown in Fig. 2a, but the origin of such unusual gap-like behavior has been a target of intensive debates^{15,16,17,18,19,20,21,22,23,24}. Since the Dirac gap is a prerequisite for realizing some exotic topological quantum phenomena^25,26,27, it is important to establish whether the bare band E_k of the Dirac-cone state is gapless (Fig. 2b) or gapped (Fig. 2c) to pin down the mechanism of unusual Dirac-band anomaly (it is noted here, if the chemical potential is situated within the gap, physical properties are mainly governed by the “quasiparticle band gap,” which is a combination of the bare-band gap and the self-energy effect). It is worth noting that such absence or appearance of the Dirac gap is also critical for the classification of magnetic–TI and axion–insulator phases, as highlighted by a fierce debate on whether or not the surface state hosts the Dirac gap associated with the time-reversal symmetry-breaking magnetic order in MnBi₂Te₄ and related compounds (see, e.g., refs. ^{28,29,30,31,32,33}). In our Bayesian analysis, we treat such distinct gapless and gapped states as different model classes and judge the validity of the models for given ARPES data. As shown in the bottom of Fig. 2a–c, we assume that E_k is represented by the function E_s(k) for the band index s = ±1 (+1 for upper, −1 for lower Dirac cones), parametrized by the binding energy (E_B) at the Dirac point ω_DP, the band asymmetry α (this parameter is associated with the effective-mass asymmetry of bulk conduction and valence bands^34,35), a band parameter γ, and the half-width of band gap Δ (Δ = 0 for gapless, Δ > 0 for gapped). Our first goal is to extract the actual single-particle spectral function A_s(k, ω) from the ARPES data by estimating a parameter set of {ω_DP, α, γ, Δ} ({ω_DP, α, γ} for Δ = 0 case) and also obtain a concrete form of self-energy Σ(k, ω) (for simplicity, we assume that Σ is k independent because the k range of interest is sufficiently small).

**Fig. 2: Semiparametric Bayesian modeling of the gapped Dirac-cone surface state.**

As highlighted in Fig. 2d, the ARPES intensity is composed not only of intrinsic spectral function A_s(k, ω) for the Dirac-cone states but also of the spectral weight from other features such as bulk states and spectral background. Here all these “background” states are represented by a single ω-dependent function B(ω). We also assumed independent matrix-elements M_s(k) and M_B(k) for the photoelectron intensities¹³ for the Dirac cone and the background, respectively, and simulated the total ARPES intensity I(k, ω) by neglecting the instrumental resolution. Noticeably, it is not necessary to assume any particular analytical forms of Σ(ω), B(ω), M_s(k), and M_B(k) (note that Σ is chosen to satisfy the Kramers–Kronig relation under assumption of the particle–hole symmetry) and we could approximately treat them as vectors whose elements are values of each function at every observed points (ω, k), such as {ImΣ(ω₁), ImΣ(ω₂), …ImΣ(ω_n)} at {ω_1, ω_2,…, ω_n}.

Semiparametric Bayesian analysis of TlBi(S,Se)₂

Based on the above modeling of an ARPES image, we formulate the semiparametric Bayesian analysis of the overall spectral quantities, E_s(k), Σ(ω), B(ω), M_s(k), and M_B(k) and implement this analysis by a basic algorithm for Bayesian analyses (see Methods). First, we have validated our methodology by a demonstration using mimic ARPES images that the electronic structures are predefined as ground truths (see Supplementary Note 1 and Supplementary Figure 1). Then we have applied our methodology to the actual ARPES image containing sufficiently fine E–k mesh (111 × 111 for Fig. 2a) and have succeeded in simultaneously estimating all the spectral quantities [specifically, 559 (558) scalar variables for the gapped (gapless) case].

Now we examine whether the Dirac-cone state is gapped or not by using gapless (Δ = 0) and gapped (Δ > 0) models that take into account all the above spectral contributions in the Bayesian analysis. Such examination is an obvious advantage of the Bayesian analysis and can hardly be carried out by the standard least-square method. One can immediately recognize in the left inset of Fig. 3a that the probability for Δ = 0 is negligibly small as opposed to the case for Δ > 0 (100% within computational uncertainty), indicating that the Dirac gap is indeed realized in TlBi(S_0.2Se_0.8)₂, consistent with the previous study¹⁴. Then we estimated the posterior probability distribution of Δ for the gapped models, as shown by histogram in Fig. 3a, where the vertical axis corresponds to the probability density. As can be seen, estimated Δ values are sharply distributed at 44.3 ± 0.3 meV, as better visualized in the magnified view in the right inset. This suggests that the energy gap can be estimated with higher accuracy and reliability through our Bayesian analysis compared to the EDC analysis applied thus far to TlBi(S_1−xSe_x)₂^14,16.

**Fig. 3: Extraction of essential band parameters from the semiparametric Bayesian analysis.**

The histogram for Δ shown in Fig. 3a is obtained by integrating all the other (558) parameters so that the posterior probability distribution for the other parameters cannot be seen from the plot. We show in Fig. 3b the posterior probability distribution against three essential band parameters ω_DP, Δ, and γ in the three-dimensional (3D) density scatter plot (note that the other 556 parameters are integrated out). One can see that the data points are sharply focused in the narrow region of the (ω_DP, γ, Δ) parameter space. This is also visualized by plotting the distribution against two parameter sets, i.e., Δ and ω_DP in Fig. 3c (Δ and γ in Fig. 3d) in the 2D density scatter plots, which are obtained by integrating γ (ω_DP). From these results, we obtain (ω_DP, Δ, γ) = (0.397 ± 0.001 eV, 44.3 ± 0.3 meV, 3.31 ± 0.02 eV·Å) as the mean and standard deviation of distributed parameter sets. This demonstrates that the Bayesian analysis is useful not only to estimate the intrinsic band parameters from ARPES data but also to see a correlation between different band parameters; these characteristics can hardly be obtained by the conventional data analysis.

One can further confirm the validity of the band model used in our Bayesian analysis by seeing that the experimental data are very well reproduced numerically. A side-by-side comparison of the experimental ARPES image and the numerical semiparametric regression function I(k, ω) (the mean values of band parameters are used) in Fig. 3e, f signifies the almost identical intensity distribution except for a higher noise level in the experiment. Such a good matching is highlighted by the obviously weak and featureless subtracted intensity in Fig. 3g. Because all the parameters are obtained from our Bayesian analysis, now we are able to show any of A_s(k, ω), M_s(k), M_B(k), B(ω), and Σ(ω) by a 2D intensity image (see Supplementary Note 2 and Supplementary Figure 2). As an example, we show in Fig. 3h, i spectral functions for the upper and lower Dirac cones A₊₁(k, ω) and A₋₁(k, ω), independently. The result signifies that the dispersion of each Dirac-cone branch is rounded around the Dirac point due to the Dirac-gap opening.

To highlight the degree of agreement between the experiment and Bayesian modeling, we plot in Fig. 3j the experimental EDC at the Γ point (blue open circles) together with the numerically fitted EDC (red solid curve) that includes the background B(ω) besides the peaks from the upper and lower Dirac cones. One can see that the experimental EDC is well reproduced by the fitting curve. The apparent difference in the energy position between the upper and lower peaks demonstrates the existence of a finite Dirac gap, as also corroborated by the extracted bare-band dispersions E₊₁(k) and E₋₁(k) shown by dashed curves in Fig. 3h, i. This suggests that the energy gap opens in the original bare band and further indicates that the experimental suppression of spectral weight at the Dirac point cannot be understood by assuming the strongly ω-dependent self-energy effect for the gapless Dirac cone, distinct from the case of graphene on SiC¹¹.

One might expect that the application of Bayesian analysis to a single experimental EDC would be sufficient for concluding a finite Dirac gap in the bare-band dispersion. However, this is not the case because the background shape can be arbitrarily chosen for the sake of just numerically reproducing the single EDC. The analysis of 2D ARPES image itself, in which the background (and matrix element and self-energy as well) is a continuous function of k and ω, is essential for extracting the intrinsic band parameters. Also, the contribution from the ω-dependent self-energy that causes the peak shift and asymmetry in the spectral line shape can never be captured by the analysis of single EDC.

One-body and many-body characteristics of TlBi(S,Se)₂

Thanks to the extraction of intrinsic Dirac gap through our Bayesian analysis, we can access the intrinsic many-body interactions, which was not possible in the previous studies owing essentially to the uncertainty in determining the bare-band dispersion. We plot in Fig. 4a the real and imaginary parts of self-energy ReΣ(ω) and |ImΣ(ω)| simultaneously extracted with Δ = 44.3 ± 0.3 meV from our Bayesian analysis for the gapped-state model, compared with those obtained from the gapless-state model where Δ is intentionally fixed to 0 meV (the invalid case; see also Fig. 4c). One can see the overall smooth ω dependence of both ReΣ and ImΣ for Δ = 44.3 ± 0.3 meV (Fig. 4a), whereas there exists an unusual hump feature around ω_DP in both ReΣ and ImΣ for Δ = 0 (Fig. 4b). Such anomaly is unphysical and associated with an artifact originating from the assumption of gapless Dirac-cone state despite a finite Dirac gap. In fact, when the Δ value is properly incorporated in Fig. 4a, such anomaly disappears. One can see from the self-energy plot in Fig. 4a that |ImΣ| which reflects the quasiparticle scattering rate (inversely proportional to the quasiparticle lifetime) has a broad maximum at around ω ~ 0.15 eV, whereas it shows a minimum at ~0.4 eV, around ω_DP. The lower scattering rate on approaching ω_DP is reasonable when we consider the available phase space of the Dirac-cone states, because the phase space should monotonically increase on moving away from ω_DP due to the expansion of equi-energy contour in k space, as can be seen from Fig. 4d. It is emphasized, however, that the broad hump seen in |ImΣ| cannot be understood by this argument, requiring the presence of additional scattering channel. As a possible source of this channel, we point out the bulk conduction band that has a bottom at ω_CB ~ 0.15 eV (see Fig. 2a). When ω is located in the energy range of bulk conduction band (i.e., ω < ω_CB), the surface-bulk inter-band scattering would take place besides the intra-surface-band scattering, leading to the nonmonotonic behavior of |ImΣ| around ω_CB. As shown in Fig. 4a, one can also recognize that |ReΣ| becomes maximally 40 meV, comparable to the size of Dirac gap. This suggests that the influence of self-energy effects cannot be neglected in the band dispersion; in particular, the bare-band dispersion cannot be determined by simply tracing the peak maxima of EDCs.

**Fig. 4: Signature of many-body effects in the Dirac electrons.**

Discussion

The present study sheds light on the fiercely debated origin of Dirac gap in TIs. The Dirac gap of TlBi(S_1−xSe_x)₂ (as well as those seen in magnetically doped TIs) has been interpreted in terms of many different scenarios standing either on the intrinsically massive Dirac fermions or the massless ones. The former involves the hybridization between surface and interface Dirac cones³⁶, hybridization with impurity bands¹⁹, local symmetry breaking²¹, disorder-driven topological phase transition²⁰, and chemical-inhomogeneity-induced smearing of band inversion²³. The latter based on the massless Dirac fermions can be associated with the extremely strong coupling with collective modes (including phonon¹⁷ and plasmaron¹¹), spin dephasing²⁴, exciton pairing²², the final-state effect, etc. The present study that applies the semiparametric Bayesian modeling to ARPES data suggests that the latter approach is unlikely to be responsible for the Dirac gap. To be more specific, taking into account of the intrinsic gap on the bare band as well as the behavior of self-energy around ω_DP in Fig. 4a that can be basically explained in terms of the phase-space argument for the ordinary Dirac electrons, it is suggested that many-body effects such as the electron–electron scattering and the electron–mode coupling, as intensively discussed in strongly correlated systems like high-temperature superconductors, are not responsible for the formation of massive Dirac fermions in TlBi(S_1−xSe_x)₂. It is thus inferred that the observed gap in TlBi(S_1−xSe_x)₂ is different from the “gap” seen in the Dirac-cone band of graphene that was suggested to be associated with the many-body interactions¹¹.

Since the above consideration supports an intrinsically massive Dirac fermion in TlBi(S_1−xSe_x)₂, it would be useful to compare the present result with the results for magnetic TIs where the Dirac gap is expected to open due to the time-reversal symmetry breaking but not due to the exotic many-body interactions. The magnitude of experimental Dirac gap in the magnetic TIs such as MnBi₂Te₄, e.g., refs. ^{28,29,30,31,32,33}, and TIs proximitized with ferromagnets is very small or even undetectable by ARPES, in contrast to the sizable Dirac-gap magnitude of 2Δ = 88.6 meV revealed by the Bayesian analysis for TlBi(S_0.2Se_0.8)₂. This result, together with the fact that TlBi(S_1−xSe_x)₂ shows no magnetic order, suggests that a possibility of local time-reversal symmetry breaking due to the local magnetic order is ruled out to account for the observed Dirac gap.

The semiparametric Bayesian modeling of ARPES data proposed in this study can be widely applicable to various Dirac-electron systems where the interplay among the Dirac gap, symmetry breaking, and many-body interactions is of interest, as represented by the Dirac-band anomaly in magnetic TIs, axion insulators, and graphene. Also, when the appropriate analytical form of bare band is established, the Bayesian-based approach would work effectively in a wider variety of systems characterized by the band anomaly occurring in a small energy scale, such as the spin–orbit gap due to the band inversion, the small band splitting associated with the spin–orbit coupling, and the dispersion kink due to the electron–mode coupling.

Methods

Bayes’ formula

Throughout our analyses, the chain rule of probability Pr(B|A) = Pr(A|B) Pr(B)/Pr(A) for random variables A and B, called Bayes’ formula, was utilized. For the parameter estimation in the EDC analysis, B corresponds to the set {width, position, intensity} for each peak (parameter set), while A corresponds to the set {EDC data, the model class (peak number K), “temperature”} (note that the inset to Fig. 1c represents this Pr(B|A), referred to as posterior probability distribution of the parameter set). Under the Bayes’ formula, what one needs to carry out is the modeling of Pr(A|B) and Pr(A), called here the likelihood function and prior probability distribution, respectively. Once these models are formulated, their appropriateness can also be evaluated by the Bayes’ formula with the relation Pr(A) = ΣPr(A|B) Pr(B). For the model selection in the EDC analysis, the relation Pr(K|EDC data) = Pr(EDC data|K) Pr(K)/Pr(EDC data) was utilized, where Pr(k|EDC data) is referred to as posterior probability distribution of parameter set (Fig. 1b). Note that Pr(K|EDC data) is derived by integrating out “temperature” in Pr(A), since Pr(A) = Pr(EDC data, “temperature”|K) Pr(K) holds.

Bayesian analysis of EDC

In the EDC analysis (Fig. 1), the posterior probability distribution for the parameter set was formulated by p ∝ exp(−nβ MSE)ϕ, where n, β, ϕ and MSE are the number of data points constructing EDC, a hyper-parameter (inverse temperature), the prior probability distribution, and the mean square error, respectively. The MSE for each K is defined by a difference between EDC data and sum of all Lorentzian functions. The function ϕ was set as the continuous uniform distribution whose support is [0, 0.1] (eV) for the peak width, [−0.5, −0.3] (eV) for the peak position and [0, 1] (a.u.) for the peak intensity.

Formulation of semiparametric Bayesian analysis

In the analysis of 2D ARPES image (Figs. 3 and 4), we assumed that the intensity Y_ij of ARPES image at each pixel (k_i, ω_j) for i = 1, …, m and j = 1, …, n is given by $Y_{ij} = I( {k_i,\omega _j;w_0,E_k} ) + \xi _{ij},$ where I is the intensity function defined by an equation in Fig. 2d, E_k analytical form of bare-band dispersion and w₀ the set of other elements including parameters of bare-band dispersion, the self-energy, the matrix elements, and the “background.” The random variable ξ_ij is observation noise subject to the Gaussian distribution whose mean and variance are 0 and β₀⁻¹ > 0, respectively. In other words, Y_ij is assumed to be subject to the conditional probability density function

$$p( {y_{ij}{{{{{\mathrm{|}}}}}}k_i,\omega _j;w,E_s,\beta } ): = \sqrt {\frac{\beta }{{2\pi }}} {{{{{\mathrm{exp}}}}}}\left( { - \frac{\beta }{2}( {Y_{ij} - I( {k_i,\omega _j;w,E_s} )} )^2} \right)$$

(2)

with w = w₀, E_s = E_k, and β = β₀. Since w₀, E_k and β₀ are unknown in practice and should be estimated, we treat them as random elements w, E_s and β subject to the posterior probability density function

$$p\left( {w{{{{{\mathrm{|}}}}}}D^{mn},E_s,\beta } \right) = \frac{{\phi \left( w \right)}}{{Z\left( {E_s,\beta } \right)}}\mathop {\prod }\limits_{i = 1}^m \mathop {\prod }\limits_{j = 1}^n p( {Y_{ij}{{{{{\mathrm{|}}}}}}k_i,\omega _j;w,E_s,\beta } )$$

(3)

where D^mn = {Y_ij, k_i, ω_j} is a data set of ARPES image, ϕ(w) an arbitrary prior probability density function, and $Z( {E_s,\beta } ): = p( {\{ {y_{ij}} \}{{{{{\mathrm{|}}}}}}\{ {k_i} \},\{ {\omega _j} \},E_s,\beta } )$ the partition function. Note that w consists of 2n + 3m + 4 (or 2n + 3m + 3) scalar values for the gapped (or gapless) state: the binding energy at the Dirac point ω_DP, the band asymmetry α, a band parameter γ, the half-width of band gap Δ (Δ = 0 for the gapless state), the imaginary part of self-energy {ImΣ(ω_j)}, the matrix elements {M₊₁(k_i), M₋₁(k_i), M_B(k_i)}, and the “background” {B(ω_j)}. The function ϕ was set as follows: the exponential distribution whose mean is 10 (eV·Å²) for α, 4 (eV·Å) for γ, 0.1 (eV) for |ImΣ| and 0.2 (a.u.) for M_s(k), the continuous uniform distribution whose support is [0, 0.5] (eV) for ω_DP, [0, 0.25] (eV) for Δ, [0, 1] (a.u.) for M_B(k), and [0, 10] (a.u.) for B(ω).

We should also mention that $p\left( {w{{{{{\mathrm{|}}}}}}D^{mn},E_s,\beta } \right) \propto f\left( {w;E_s,\beta } \right)$ holds for the function

$$f\left( {w;E_s,\beta } \right): = \phi \left( w \right){{{{{\mathrm{exp}}}}}}\left( { - \frac{{nm\beta }}{2}{{{{{\rm{MSE}}}}}}\left( {w;E_s} \right)} \right),$$

(4)

with the MSE function

$${{{{{\rm{MSE}}}}}}\left( {w;E_s} \right): = \frac{1}{{nm}}\mathop {\sum}\limits_{i = 1}^m {\mathop {\sum}\limits_{j = 1}^n {\left( {Y_{ij} - I\left( {k_i,\omega _j;w,E_s} \right)} \right)} } ^2.$$

(5)

The Bayesian analysis treats the statistical ensemble of w subject to $p\left( {w{{{{{\mathrm{|}}}}}}D^{mn},E_s,\beta } \right)$ as an extension of the least-squares method. The mean and standard deviation of $p\left( {w{{{{{\mathrm{|}}}}}}D^{mn},E_s,\beta } \right)$ is, respectively, adopted as estimator and its error bar.

We also estimated E_s and b by treating them as random elements subject to the conditional probability distribution function

$$p\left( {E_s,\beta {{{{{\mathrm{|}}}}}}D^{mn}} \right) = \frac{{Z\left( {E_s,\beta } \right)}}{{\mathop {\sum }\nolimits_{\left\{ {E_s} \right\}} \int Z\left( {E_s,\beta } \right)d\beta}}$$

(6)

where {E_s} is a collection of candidate forms of E_s. Note that this equation is derived from Bayes’ formula such that p(E_s, β) is an uniform distribution. Especially, E_s and β that maximize $p\left( {E_s,\beta {{{{{\mathrm{|}}}}}}D^{mn}} \right)$ are adopted as estimators. This type of estimators is known as the empirical Bayes estimator^37,38,39. We also quantify the uncertainty of each E_s by the marginal probability

$$p\left( {E_s{{{{{\mathrm{|}}}}}}D^{mn}} \right) = \int p\left( {E_s,\beta {{{{{\mathrm{|}}}}}}D^{mn}} \right)d\beta,$$

(7)

as shown in the inset of Fig. 3a (see also Fig. 1b).

Algorithm of semiparametric Bayesian analysis

The computation of $p\left( {w{{{{{\mathrm{|}}}}}}D^{mn},E_s,\beta } \right)$ was performed by the exchange Monte Carlo method^40,41 (see also Supplementary Table 1), where β is discretized as 128 points consisting of 0 and 127 logarithmically spaced points in the interval [1.5 × 10⁻¹⁰, 1.5 × 10²]. The total Monte Carlo sweeps were 10,000 after the burn-in, where the obtained sequence {$w_l^t$} for t = 1, …, 10,000 and l = 1, …, 128 is regarded as a statistical ensemble of w subject to $p\left( {w{{{{{\mathrm{|}}}}}}D^{mn},E_s,\beta _l} \right)$. Figures 1c and 3a–d are the density scatter plots of {$w_l^t$} at β that maximize $Z\left( {E_s,\beta } \right)$.

We also calculate $p\left( {E_s,\beta {{{{{\mathrm{|}}}}}}D^{mn}} \right)$ via the bridge sampling^42,43, as shown by

$$Z\left( {E_s,\beta _l} \right) = \mathop {\prod }\limits_{l^\prime = 1}^{l - 1} \frac{{Z\left( {E_s,\beta _{l^\prime + 1}} \right)}}{{Z\left( {E_s,\beta _{l^\prime }} \right)}} = \mathop {\prod }\limits_{l^\prime = 1}^{l - 1} \langle{{{{{\mathrm{exp}}}}}}\left( { - \frac{{nm}}{2}\left( {\beta _{l^\prime + 1} - \beta _{l^\prime }} \right){{{{{\rm{MSE}}}}}}\left( {w;E_s} \right)} \right)\rangle_{\beta _{l^\prime }},$$

(8)

where $\langle Q\rangle_\beta$ denotes the average of an arbitrary quantity Q over $p\left( {w{{{{{\mathrm{|}}}}}}D^{mn},E_s,\beta } \right)$ and is approximated by sample mean of obtained sequence {$Q_l^t$}.

Data availability

The data and information within this paper are available from the corresponding authors upon request.

Code availability

The computer codes to generate the results are available from the corresponding authors upon request.

References

Bogoljubov, N. N. On a new method in the theory of superconductivity. Nuovo Cim. 7, 794–805 (1958).
Article ADS MathSciNet Google Scholar
Campuzano, J. C. et al. Direct observation of particle-hole mixing in the superconducting state by angle-resolved photoemission. Phys. Rev. B 53, R14737 (1996).
Article ADS Google Scholar
Matsui, H. et al. BCS-like Bogoliubov quasiparticles in high-T_c superconductors observed by angle-resolved photoemission spectroscopy. Phys. Rev. Lett. 90, 217002 (2003).
Article ADS Google Scholar
Valla, T., Fedorov, A. V., Johnson, P. D. & Hulbert, S. L. Many-body effects in angle-resolved photoemission: quasiparticle energy and lifetime of a Mo(110) surface state. Phys. Rev. Lett. 83, 2085 (1999).
Article ADS Google Scholar
Hengsberger, M., Purdie, D., Segovia, P., Garnier, M. & Baer, Y. Photoemission study of a strongly coupled electron-phonon system. Phys. Rev. Lett. 83, 592 (1999).
Article ADS Google Scholar
Lanzara, A. et al. Evidence for ubiquitous strong electron–phonon coupling in high-temperature superconductors. Nature 412, 510–514 (2001).
Article ADS Google Scholar
Bogdanov, P. V. et al. Evidence for an energy scale for quasiparticle dispersion in Bi₂Sr₂CaCu₂O₈. Phys. Rev. Lett. 85, 2581 (2000).
Article ADS Google Scholar
Meevasana, W. et al. Extracting the spectral function of the cuprates by a full two-dimensional analysis: Angle-resolved photoemission spectra of Bi₂Sr₂CuO₆. Phys. Rev. B 77, 104506 (2008).
Article ADS Google Scholar
Johnson, P. D. et al. Doping and temperature dependence of the mass enhancement observed in the cuprate Bi₂Sr₂CaCu₂O_8+δ. Phys. Rev. Lett. 87, 177007 (2001).
Article ADS Google Scholar
Kordyuk, A. A. et al. Bare electron dispersion from experiment: Self-consistent self-energy analysis of photoemission data. Phys. Rev. B 71, 214513 (2005).
Article ADS Google Scholar
Bostwick, A. et al. Observation of plasmarons in quasi-freestanding doped graphene. Science 328, 999–1002 (2010).
Article ADS Google Scholar
Zhou, S. Y. et al. Substrate-induced bandgap opening in epitaxial graphene. Nat. Mater. 6, 770–775 (2007).
Article ADS Google Scholar
Damascelli, A., Hussain, Z. & Shen, Z. X. Angle-resolved photoemission studies of the cuprate superconductors. Rev. Mod. Phys. 75, 473 (2003).
Article ADS Google Scholar
Sato, T. et al. Unexpected mass acquisition of Dirac fermions at the quantum phase transition of a topological insulator. Nat. Phys. 7, 840–844 (2011).
Article Google Scholar
Xu, S. Y. et al. Topological phase transition and texture inversion in a tunable topological insulator. Science 332, 560–564 (2011).
Article ADS Google Scholar
Souma, S. et al. Spin polarization of gapped Dirac surface states near the topological phase transition in TlBi(S_1-xSe_x)₂. Phys. Rev. Lett. 109, 186804 (2012).
Article ADS Google Scholar
Li, Z. & Carbotte, J. P. Phonon structure in dispersion curves and density of states of massive Dirac fermions. Phys. Rev. B 88, 045417 (2013).
Article ADS Google Scholar
Habe, T. & Asano, Y. Gapped energy spectra around the Dirac node at the surface of a three-dimensional topological insulator in the presence of the time-reversal symmetry. J. Phys. Soc. Jpn. 82, 064704 (2013).
Article ADS Google Scholar
Sánchez-Barriga, J. et al. Nonmagnetic band gap at the Dirac point of the magnetic topological insulator (Bi_1-xMn_x)₂Se₃. Nat. Commun. 7, 10559 (2016).
Article ADS Google Scholar
Brahlek, M. et al. Disorder-driven topological phase transition in Bi₂Se₃ films. Phys. Rev. B 94, 165104 (2016).
Article ADS Google Scholar
Tanaka, I. Influence of rotational symmetry breaking on topological insulators. Ann. Phys. 396, 71–77 (2018).
Article ADS Google Scholar
Wang, J. R., Liu, G. Z. & Zhang, C. J. Excitonic pairing and insulating transition in two-dimensional semi-Dirac semimetals. Phys. Rev. B 95, 075129 (2017).
Article ADS Google Scholar
Zhang, W. et al. Topological phase transition with nanoscale inhomogeneity in (Bi_1-xIn_x)₂Se₃. Nano Lett. 18, 2677–2682 (2018).
Article ADS Google Scholar
Qi, J., Liu, H., Jiang, H. & Xie, X. C. Dephasing effects in topological insulators. Front. Phys. 14, 43403 (2019).
Article ADS Google Scholar
Hasan, M. Z. & Kane, C. L. Colloquium: topological insulators. Rev. Mod. Phys. 82, 3045 (2010).
Article ADS Google Scholar
Qi, X.-L. & Zhang, S.-C. Topological insulators and superconductors. Rev. Mod. Phys. 83, 1057 (2011).
Article ADS Google Scholar
Ando, Y. Topological insulator materials. J. Phys. Soc. Jpn. 82, 102001 (2013).
Article ADS Google Scholar
Li, H. et al. Dirac surface states in intrinsic magnetic topological insulators EuSn₂As₂ and MnBi_2nTe_3n+1. Phys. Rev. X 9, 041039 (2019).
Google Scholar
Chen, B. et al. Intrinsic magnetic topological insulator phases in the Sb doped MnBi₂Te₄ bulks and thin flakes. Nat. Commun. 10, 4469 (2019).
Article ADS Google Scholar
Gong, Y. et al. Experimental realization of an intrinsic magnetic topological insulator. Chin. Phys. Lett. 36, 076801 (2019).
Article ADS Google Scholar
Otrokov, M. M. et al. Prediction and observation of an antiferromagnetic topological insulator. Nature 576, 416 (2019).
Article ADS Google Scholar
Hao, Y. J. et al. Gapless surface Dirac cone in antiferromagnetic topological insulator MnBi₂Te₄. Phys. Rev. X 9, 041038 (2019).
Google Scholar
Chen, Y. J. et al. Topological electronic structure and its temperature evolution in antiferromagnetic topological insulator MnBi₂Te₄. Phys. Rev. X 9, 041040 (2019).
Google Scholar
Zhang, H. et al. Topological insulators in Bi₂Se₃, Bi₂Te₃, and Sb₂Te₃ with a single Dirac cone on the surface. Nat. Phys. 5, 438–442 (2009).
Article Google Scholar
Lu, H.-Z., Shan, W.-Y., Yao, W., Niu, Q. & Shen, S.-Q. Massive Dirac fermions and spin physics in an ultrathin film of topological insulator. Phys. Rev. B 81, 115407 (2010).
Article ADS Google Scholar
Zhang, Y. et al. Crossover of the three-dimensional topological insulator Bi₂Se₃ to the two-dimensional limit. Nat. Phys. 6, 584–588 (2010).
Article Google Scholar
MacKay, D. J. Bayesian interpolation. Neural comput. 4, 415–447 (1992).
Article MATH Google Scholar
Bishop, C. M. Pattern Recognition and Machine Learning (Springer, 2006).
MATH Google Scholar
Tokuda, S. et al. Simultaneous estimation of noise variance and number of peaks in Bayesian spectral deconvolution. J. Phys. Soc. Jpn. 86, 024001 (2017).
Article ADS Google Scholar
Geyer, C. J. Markov Chain Monte Carlo Maximum Likelihood. In Computing Science and Statistics: Proc. 23rd Symposium on the Interface 156–163 (American Statistical Association, 1991).
Hukushima, K. & Nemoto, K. Exchange Monte Carlo method and application to spin glass simulations. J. Phys. Soc. Jpn. 65, 1604–1608 (1996).
Article ADS Google Scholar
Meng, X.-L. & Wong, W. H. Simulating ratios of normalizing constants via a simple identity: a theoretical exploration. Stat. Sin. 6, 831–860 (1996).
MathSciNet MATH Google Scholar
Gelman, A. & Meng, X.-L. Simulating normalizing constants: from importance sampling to bridge sampling to path sampling. Stat. Sci. 13, 163–185 (1998).
Article MathSciNet MATH Google Scholar

Download references

Acknowledgements

This work was supported by JST-CREST (no. JPMJCR18T1), Grant-in-Aid for Scientific Research on Innovative Areas “Topological Materials Science” (JSPS KAKENHI Grant number JP15H05853), Grant-in-Aid for Scientific Research on Innovative Areas “Discrete Geometric Analysis for Materials Design” (JSPS KAKENHI Grant number JP18H04472), and Grant-in-Aid for Early-Career Scientists (JSPS KAKENHI Grant number 20K19889). The work in Cologne was funded by the Deutsche Forschungsgemeinschaft (German Research Foundation) - Project number 277146847 - CRC 1238 (Subproject A04).

Author information

Authors and Affiliations

Research Institute for Information Technology, Kyushu University, Kasuga, Japan
Satoru Tokuda
Mathematics for Advanced Materials-OIL, AIST, Sendai, Japan
Satoru Tokuda & Takeshi Nakanishi
Center for Spintronics Research Network, Tohoku University, Sendai, Japan
Seigo Souma, Takashi Takahashi & Takafumi Sato
Advanced Institute for Materials Research (WPI-AIMR), Tohoku University, Sendai, Japan
Seigo Souma, Takashi Takahashi & Takafumi Sato
Department of Physics, Kyoto Sangyo University, Kyoto, Japan
Kouji Segawa
Department of Physics, Tohoku University, Sendai, Japan
Takashi Takahashi & Takafumi Sato
Institute of Physics II, University of Cologne, Köln, Germany
Yoichi Ando

Authors

Satoru Tokuda
View author publications
You can also search for this author in PubMed Google Scholar
Seigo Souma
View author publications
You can also search for this author in PubMed Google Scholar
Kouji Segawa
View author publications
You can also search for this author in PubMed Google Scholar
Takashi Takahashi
View author publications
You can also search for this author in PubMed Google Scholar
Yoichi Ando
View author publications
You can also search for this author in PubMed Google Scholar
Takeshi Nakanishi
View author publications
You can also search for this author in PubMed Google Scholar
Takafumi Sato
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

The work was planned and proceeded by discussion among S.T., S.S, T.N. T.T. and T.S. S.T. carried out the semiparametric Bayesian analysis. K.S. and Y.A. carried out the sample growth. S.S. and T.S performed the ARPES measurements. S.T. and T.S. finalized the manuscript with inputs from all the authors.

Corresponding authors

Correspondence to Satoru Tokuda or Takafumi Sato.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Communications Physics thanks the anonymous reviewers for their contribution to the peer review of this work. Peer reviewer reports are available.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Peer Review File

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Tokuda, S., Souma, S., Segawa, K. et al. Unveiling quasiparticle dynamics of topological insulators through Bayesian modelling. Commun Phys 4, 170 (2021). https://doi.org/10.1038/s42005-021-00673-6

Download citation

Received: 12 January 2021
Accepted: 07 July 2021
Published: 27 July 2021
DOI: https://doi.org/10.1038/s42005-021-00673-6

This article is cited by

Bayesian active learning with model selection for spectral experiments
- Tomohiro Nabika
- Kenji Nagata
- Masato Okada
Scientific Reports (2024)
Angle-resolved photoemission spectroscopy
- Hongyun Zhang
- Tommaso Pincelli
- Shuyun Zhou
Nature Reviews Methods Primers (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.