Introduction

Maxwell’s equations are in the foundation of modern optical and electrical technologies. To apply Maxwell’s equations in conventional matter, it is necessary to specify constituent relations, describing the polarization Pc(E) and magnetization Mc(B) as a function of the applied electric and magnetic fields, respectively. Soon after the theoretical prediction1,2,3 and experimental discovery of two-dimensional (2D) and three-dimensional (3D) topological insulators (TIs)4,5, it has been recognized that the constituent relations in this new phase of quantum matter contain additional cross-terms Pt(B) and Mt(E) when time-reversal symmetry is weakly broken6.

Here N is an integer and α≈1/137 is the fine structure constant. The derivation of equation (1) is based on the topological field theory of time-reversal invariant insulators6. Its intriguing consequences are the universal Faraday rotation angle =α, when a linearly polarized electromagnetic radiation passes through the top and bottom topological surfaces6,7, and magnetic monopole images, induced by electrical charges in proximity to a topological surface8. However, experimental verification of these topological magnetoelectric effects (TMEs) has been lacking. As the modified Maxwell’s equations describing electrodynamics of TIs are applicable in the low-energy limit, optical experiments should be performed at terahertz (THz) or sub-THz frequencies9,10,11,12. Qualitatively, equation (1) applied to the magnetic and electric fields of the primary THz radiation results in a perpendicular polarized secondary THz radiation. The sum of the primary and secondary radiation can be viewed as the rotation of the polarization plane, that is, as the Faraday effect. We would like to note that the quantum Faraday effect and the TME are basically different manifestations of the same axion physics13.

In real samples, the TME may be screened by nontopological contributions13,14,15. In fact, quantized Faraday rotation has been detected in 2D electron gas16 and graphene17 in the quantum Hall effect (QHE) regime. However, in both experiments the Faraday rotation takes a value θF=−4α/(1+nsub), that is, it depends on the refractive index of the substrate nsub and hence is not fundamental.

Here we report on the observation of the universal Faraday rotation angle equal to the fine structure constant α. Strained HgTe layers grown on CdTe, that are investigated in the present work, are shown to be a 3D TI18 with surface-dominated charge transport19 that was observed in THz experiments as well9,11,20. To eliminate the material details, we perform measurements under antireflection conditions, such that the transmission through the CdTe substrate is approaching 100% (ref. 21).

Results

THz Faraday effect in 2D systems

The observed Faraday rotation angle =α (for N=0) comes from two spatially separated topological surfaces in a 3D TI. This corresponds to the half-quantized Hall conductivity e2/(2h) per surface or, equivalently, to the TME occurring at each surface separately. Therefore, the observed Faraday effect 2(N+1/2)α is intimately related to the TME, which distinguishes qualitatively our 3D TI from 2D or quasi-2D materials. There is also a quantitative difference. Even without the substrate (nsub=1), the Faraday rotation in graphene would be quantized as 4(N+1/2)α, including the spin and valley degeneracies. The minimum Faraday rotation angle is then =2α (for N=0)17. For a conventional 2D electron gas, such as hosted in GaAs/AlGaAs heterostructures, the quantization of the Faraday angle is expected to be 2, where the factor of 2 comes from the equal contributions of the up- and down-spin subsystems, which independently exhibit the integer QHE. This is because in GaAs/AlGaAs heterostructures, the Zeeman splitting for magnetic fields below is negligible compared to the THz photon energy. Therefore, the minimum Faraday rotation angle would be also =2α (for N=1)16, which is twice larger than our result.

Sample details

The strained HgTe film is a 58 nm thick HgTe layer embedded between two Cd0.7Hg0.3Te layers (Fig. 1a). The Cd0.7Hg0.3Te layers have a thickness of 51 nm (lower layer) and 11 nm (top/cap layer), respectively. The purpose of these layers is to provide the identical crystalline interface for top and bottom surface of the HgTe films as well as to protect the HgTe from oxidization and adsorption. This leads to an increase in carrier mobility with a simultaneous decrease in carrier density compared to uncaped samples18. The transport characterization on a standard Hall bar sample shows a carrier density at 0 V gate of 1.7 × 1011 cm−2 and a carrier mobility of 2.2 × 105 cm2 V−1 s−1. The optical measurements are carried out on a sample fitted with a 110 nm thick multilayer insulator of SiO2/Si3N4 and a 4 nm thick Ru film. The Ru film (oxidized in the air) is used as a semitransparent top-gate electrode22. The gate leads to 15% suppression of the transmission signal, which can be taken into account as field-independent contribution to the conductivity. The properties of the gate material have been investigated in a separate experiment.

Figure 1: THz magnetooptics of a strained HgTe 3D TI.
figure 1

(a) A scheme of the experimental set-up (only one arm of the Mach–Zehnder interferometer is shown). The strained HgTe layer, which is a 3D TI, is sandwiched between (Cd,Hg)Te protecting layers. The top-gate electrode, consisting of a SiO2/Si3N4 multilayer insulator and a thin conducting Ru film, is semitransparent at THz frequencies. The THz radiation (v=0.35 THz) is linearly polarized, and the Faraday rotation (θF) and ellipticity (ηF) are measured as a function of the magnetic field strength B for different gate voltages UG. (b,c) Transmission spectra in the parallel and crossed polarizer configurations, respectively. The gate voltage is colour-coded, and the experimental curves are shifted for clarity. Notations in b,c: e denotes the CR of the topological surface states of electron character, s1 and s2 denote extra resonances with opposite phase to that of the e-CR as discussed in the text.

THz spectroscopy

The transmittance experiments at THz frequencies (0.1 THz<v<1 THz) are carried out in a Mach–Zehnder interferometer arrangement23,24, allowing measurement of the amplitude and the phase shift of the electromagnetic radiation in a geometry with controlled polarization (Fig. 1a). The monochromatic THz radiation is provided by a backward-wave oscillator. The THz power on the sample is in between 10 and 100 μW with the focal spot of 0.2 cm2. Using wire grid polarizers, the complex transmission coefficient t=e is obtained both in parallel tp (Fig. 1b) and cross tc (Fig. 1c) polarization geometries, providing full information about the transmitted light. External magnetic fields are applied using a split-coil superconducting magnet. The experiments are carried out in Faraday geometry, that is, with B applied parallel to the propagation direction of the THz radiation. The ac conductivity tensor at THz angular frequency ω=2πν is obtained from the experimental data by inverting the Berreman equations25 for the complex transmission coefficient through a thin conducting film on an insulating substrate. The explicit expressions used in the calculations are given in Methods section.

In general case, the light propagating along the z direction can be characterized by the orthogonal x and y components of the electric and magnetic fields, which can be written in the form of a 4D vector V. The interconnection between vectors V1 and V2, corresponding to different points in space separated by a distance , is given by . Here is a 4 × 4 transfer matrix. For an insulating substrate of thickness and dielectric constant ɛ, this is the identity matrix provided is an integer. We find in a separate experiment on a bare CdTe substrate that this condition is fulfilled for v≈0.35 THz, and all the measurements presented here are performed at this frequency to minimize the contribution to the Faraday signal from the substrate. The corresponding photon energy of 1.4 meV is much smaller than the energy gap in strained HgTe (above 10 meV)18, and equations (1) are a good approximation.

For normal incidence, the fields across the conducting interface are connected by the Maxwell equation . Here the eiωt time dependence is assumed for all fields. As the wavelength of 856 μm for v=0.35 THz is much larger than the HgTe layer thickness, we use the limit of thin film, and the corresponding transfer matrix is determined by the diagonal (σxx) and Hall (σxy) components of the conductivity tensor . Within the Drude-like model, these components for one type of charge carriers can be written in the form13,26

Here Ωc is the cyclotron resonance (CR) frequency, σ0 is the dc conductivity, and τ is the scattering time. For classical conductors, the CR frequency is written as Ωc=eB/me, where me is the effective electron cyclotron mass.

The total transfer matrix relates vectors V on both sides of the sample and hence contains full information about the transmission and reflection coefficients. Thus, when is the identity matrix, the influence of the substrate is minimized, and the THz response is dominated by the ac transport properties of the HgTe layer, in accord with equations (2) and (3). The calculation of the complex transmission coefficients tp and tc based on the transfer matrix formalism as well as the exact form of the transfer matrices are presented in Methods section.

Magnetic field dependence of the THz transmission is dominated by a sharp CR of surface electrons (e) Ωce at (Fig. 1b,c). Below we demonstrate their Dirac-like character and that they are responsible for the universal Faraday rotation. Remarkably, the observation of the CR both in tp and tc indicates a high purity of our HgTe layer. The scattering time is significantly longer than the inverse THz frequency , and according to equations (2) and (3) the ac conductivity reveals a resonance-like behaviour σxx, σxy1/(ω2).

Further features are broad resonances at and at indicated in Fig. 1 as s1 and s2, respectively. The phase of the corresponding THz transmission coefficient φc in the vicinity of these resonances has the opposite sign with respect to that of the e-CR. Remarkably, the s1 and s2 resonances disappear with applying positive gate voltage (Figs 1 and 2b). We associate them with either interband Landau-level transitions or thermally activated states as discussed below.

Figure 2: Charge carriers in strained HgTe.
figure 2

(a) The band structure of the Cd0.7Hg0.3Te/HgTe heterostructure close to the Γ-point for UG=1.9 V. The chemical potential (green line) crosses the Dirac-like surface state (red solid line) in the band gap corresponding to the electron CR Ωce. The dashed red line shows the linear dispersion of the 2D surface state without hybridization with the heavy-hole band. (b) 2D dc conductivity σ0 of different charge carriers (e and Σs=s1+s2), obtained by Drude-like fits to equations (2) and (3) of the magnetooptical spectra. The dimensionless values are given relative to the impedance of free space . The inset shows the e-CR in terms of Ωce/B as a function of the gate voltage UG.

Band structure analysis

To understand the origin of the experimentally observed resonances, we analyse the band structure of tensile strained Cd0.7Hg0.3Te/HgTe layer as shown in Fig. 2a. It is obtained similar to ref. 19 within the tight binding approximation of the 6 × 6—Kane Hamiltonian27,28. Due to reduced point symmetry at the boundary between the Cd0.7Hg0.3Te and HgTe layers, an additional interface potential is allowed in the Hamiltonian29. This potential is used to shift the Dirac point closer to the valence band edge, so that the tight binding results are in good agreement with recent angle-resolved photoemission spectroscopy (ARPES) experiments18,30 and ab initio calculations31 on HgTe. Final position of the Dirac point is at energy around −40 meV (that is, it is burried in the the valence band) in agreement with18,30,31. The Dirac-like surface states are located in the band gap between the light-hole (LH, conduction) and heavy-hole (HH, valence) subbands (see the red line in Fig. 2a), and they are not perfectly linear due to the hybridization with the heavy-hole band. The dashed red line in Fig. 2a shows the linear dispersion of the 2D surface state without hybridization with the heavy-hole band. This is a hypothetical curve, since in the realistic material the hybridization with the heavy-hole band is non-zero. The camel back of the heavy-hole band originates from coupling of this band to the electron-like valence band and is therefore a hallmark of the inverted band structure of HgTe. In accordance with previous transport data, the chemical potential crosses the topological surface states for a large range of gate voltages19. The total electron density in Fig. 2a is , representing the experimental situation at . For simplicity, we assume here the same density at the top and bottom surfaces. Using the general formula for a quasi-classical CR32 , where E(k) is the energy dispersion, B is the magnetic field and A is the area enclosed by the wave vector k, we calculate for the topological surface state .

Quantized THz Hall effect

Experimentally, simultaneous fit of the real and imaginary parts of tp and tc allows the extraction of all transport characteristics, that is, conductivity, charge carrier density, scattering time and CR frequency21. The inset of Fig. 2b shows experimentally determined electron CR as a function of gate voltage, which perfectly agrees with the theoretical value for the topological Dirac-like surface states. Since only surface states are observed in transport experiments on the similar structures19, a possible explanation of the appearance of additional resonances is interband Landau-level transitions between the HH bulk bands and topological surface states. Such transitions are generally allowed as can be shown using the Kubo formula. Another possibility would be thermally activated transport between the camel back of the HH bulk band and the surface states. This is generally possible since the THz field may well induce heating of the carriers, resulting in a higher effective temperature compared to that of the lattice.

From the obtained scattering time and the CR positions in the magnetooptical spectra of Fig. 1b,c, one can calculate the mobility μ=τΩc/B. The surface states demonstrate high mobility , which agrees with the dc transport data. Since the e-CR and s1,s2 resonances occur at different magnetic fields, their contributions to the ac transport can be clearly separated, as presented in Fig. 2b. The striking feature of this plot is that the ac conductivity of the surface states dominates at large gate voltages. In what follows, we concentrate therefore on , while remaining weak contribution from the interband Landau-level transitions/thermally activated transitions are subtracted as explained in Supplementary Fig. 1.

Figure 3a demonstrates the real part of the Hall conductivity σxy. The overall behaviour is provided by the high-field tail of the classical Drude model, that is, equation (3), resulting in a rapid suppression of σxy with growing magnetic field. In addition to the classical behaviour, regular oscillations in ∂σxy/∂B can be recognized, which are linear in inverse magnetic field (Fig. 3b). The slope of the linear behaviour changes with gate voltage, reflecting gate dependence of the electron density per surface. These QHE oscillations are extrapolated to 1/2, indicating Dirac character of the surface electrons33. The oscillations of ∂σxy/∂B in Fig. 3a are superimposed by the 1/B tail of the classical electron CR and therefore are not well pronounced. The visibility can be significantly improved by inserting the sample in a Fabry–Pérot resonator, as we have previously demonstrated for a similar structure11.

Figure 3: THz QHE of the surface states.
figure 3

(a) The real part of the THz Hall conductivity σxy of two surfaces in units of 2e2/h, obtained at (symbols). The vertical solid lines indicate the positions of the Hall plateaus, estimated from the maxima in ∂σxy/∂B. Theoretical calculations represented by the thin line are performed as explained in the text. The inset presents the same experimental and theoretical curves in the whole magnetic field range, including the surface carrier CR at 0.4 T. (b) The plateau number as a function of the inverse magnetic field for different gate voltages. The lines are linear fits to N=hna/eB+1/2, where na corresponds to the carrier density on a single surface.

In magnetic fields above 5 T, the Hall conductivity clearly shows a plateau close to σxy=e2/h, corresponding to a value (1/2)e2/h per surface (Fig. 3a). Another plateau close to (3/2)e2/h per surface is also recognizable at a magnetic field of 3 T. It is superimposed on the CR tail and therefore tilted. The steps in σxy loose their regularity in lower-magnetic fields, as can be qualitatively explained by a finite THz frequency ω in magnetooptical experiments. As mentioned above, the overall behaviour of σxy(ω) is provided by the classical curve of equation (3), and the real part of σxy can be approximated as σxy(ω)≈σ0Ωce/[(ω2)τ], which in the limit reduces to the expression σxy=ne/B, being a multiple of e2/h. In low magnetic fields, the CR frequency Ωce becomes comparable to the THz frequency ω, destroying the regularities in σxy(ω).

Since in strained HgTe, the Fermi level lies in the bulk band gap (see Fig. 2a and ref. 19), we attribute the observed THz QHE to the formation of the 2D Landau levels at the top and bottom surfaces of the HgTe layer (Fig. 1a). This interpretation is further substantiated by our theoretical analysis of the ac quantum Hall conductivity σxy(ω) calculated from the Kubo formula for both top and bottom surface states within the Dirac model13,34.

Our two-surface Dirac model describes well the surface carrier CR (the inset of Fig. 3a). The lengths of the theoretical Hall plateaus in the high magnetic field region (Fig. 3a) correlate correctly with the positions of the extrema in the derivative &∂;σxy/∂B. However, the model predicts much sharper transitions between the QHE plateaus, as observed in the experiment. One of two possible explanations is the heating of the surface carriers by the THz field, resulting in a higher effective temperature compared to that of the lattice. Such a heating can occur due to inefficient energy relaxation in the electronic system through the emission of LO phonons at low temperatures35. The best fit of our experimental data is obtained with T=25 K (Fig. 3a). Another explanation is based on spatial fluctuations of the surface carrier densities, which are likely to occur in our samples due to their large lateral sizes compared to the typical Hall bars used in the dc measurements. The experimental data of Fig. 3a can alternatively be well fitted assuming cold carriers (T=1.8 K) with density fluctuations within 10% relative to their nominal values. As the fits are nearly indistinguishable, we cannot quantitatively determine the contributions of both mechanisms leading to the smearing of the THz QHE plateaus.

The stronger the field, the closer the Hall conductivity to the quantized values expected for a two-surface Dirac system

where Na,b are the integer numbers of the highest occupied Landau levels at the top and bottom surfaces, with nanb being the corresponding carrier densities (Φ0=h/ is the magnetic flux quantum). Upon approaching the CR, the Hall conductivity deviates from the quantized values in equation (4) due to the predominance of the intraband transitions between the Landau levels. From the fitting procedure, we extract the nominal carrier densities and . The total surface carrier density na+nb agrees well with that obtained from Drude-like fits of magnetooptical spectra. Another fitting parameter is the classical (Drude) surface conductivity σa=σb≈50e2/h. Its large value indicates high-surface carrier mobility, insuring that the condition for the quantum Hall regime, , is met for B>1 T. Here is the characteristic Landau-level spacing for a Dirac system, τa,b are the scattering times of the top and bottom carriers, R0=h/(2e2) is the resistance quantum, and vF is the Fermi velocity of the Dirac surface states.

Quantized THz Faraday effect

Having established that the THz response of the topological surface states in high magnetic fields B>5 T is determined by the conductivity quantum G0=e2/h (Na=Nb=0), we turn to the central result of this work, the THz Faraday effect. Owing to the TME of equation (1), an oscillating electric Exeiωt (magnetic Hyeiωt) field of the linearly polarized THz radiation induces in a 3D TI an oscillating magnetic αHxeiωt (electric αEyeiωt) field. The generated, in such a way, secondary THz radiation is polarized perpendicular to the primary polarization and its amplitude is α times smaller. This can be viewed as a rotation of the initial polarization by an angle . Indeed, Fig. 4a clearly demonstrates that the Faraday angle in high magnetic fields is close to this fundamental value.

Figure 4: Quantized THz Faraday rotation of Dirac fermions.
figure 4

Faraday rotation (a) and Faraday ellipticity (b) in a 3D HgTe TI as a function of the external magnetic field for different gate voltages (colour-coded). The horizontal solid line in a indicates the universal Faraday rotation angle . (c) Gate voltage dependence of the Faraday rotation in a magnetic field of 5 T. The dashed lines in a,c provide the calculated value of the Faraday rotation with realistic parameters of the experiment (including the top-gate electode) and assuming the HgTe conductivity exactly equal to σxy=e2/h. The short-dashed line in c indicates the level. The error bars are estimated taking into account the accuracy of the original transmission data and the uncertainties due to subtraction of weak residual contribution (Supplementary Fig. 1).

We rigorously characterize the THz Faraday effect, and the Faraday ellipticity ηF is shown in Fig. 4b. It is relatively small < in high magnetic fields, but does not reach zero. This observation indicates that while the TME dominates, the interaction of TIs with THz radiation is not a completely dissipationless process in our samples. Remarkably, the universal value of the Faraday angle remains robust against the gate voltage. This is demonstrated in Fig. 4c, where α for .

Discussion

The observed terahertz Faraday rotation equal to the fine structure constant α=e2/2hc=e2μ0c/2h is a direct consequence of the TME, confirming axion electrodynamics of 3D topological insulators. We use monochromatic terahertz spectroscopy, providing complete amplitude and phase reconstruction, which can be applied to investigate topological phenomena in various systems, including graphene, 2D electron gas, layered superconductors and recently experimentally discovered Weyl semimetals36. Picoradian angle resolution can be achieved using a balanced detection scheme37, and the universal Faraday rotation in combination with the magnetic flux quantum Φ0=h/ and the conductivity quantum G0=e2/h are suggested14 to use for a metrological definition of the three basic physical constants, e, h and c (given that the vacuum permeability is equal exactly ).

Note added in proof: Okada et al. and Wu et al. reported on the quantized Faraday rotation in TIs independently to this work and around a similar time in arXiv. We note that Okada et al.38 observed only the trajectory towards the fine structure constant. Wu et al.39 used interface doping to reduce the carrier concentration and put the chemical potential in the bulk.

Methods

Experimental technique

Magnetooptical experiments in the THz frequency range (100 GHz<ν<1,000 GHz) have been carried out in a Mach–Zehnder interferometer arrangement23, which allows measurements of the amplitude and the phase shift in a geometry with controlled polarization of radiation.

Theoretical analysis of magnetooptical spectra

To analyse the experimental transmission spectra, we follow the formalism described by Berreman24,25. The THz light propagating along the z direction can be characterized by the (orthogonal) components of electric (Ex, Ey) and magnetic (Hx, Hy) fields, which may be combined in form of a four-component vector V=(Ex, Ey, Hx, Hy). The propagation of light between two points in space separated by a distance and characterized by vectors V1 and V2 can be described via 4 × 4 transfer matrix as . To provide a simple example, for an isotropic dielectric substrate the transfer matrix is simplified to:

Here Z= and is the wave vector. In the following, we assume μ=1. The Berreman procedure is in general not limited to the case of normal incidence. However, in such geometry the choice of tangential field components simplifies the treatment. Electric and magnetic fields across the interfaces are connected by the Maxwell equation . Here is the complex conductivity tensor of the material and the time dependence in form eiωt is assumed.

The full transfer matrix describes the transmission and reflection coefficients, which can be calculated using another basis. In this basis, the vector V consists of (i) the amplitude of the linearly polarized wave (Ex) propagating in the positive direction, (ii) the amplitude of the wave with the same polarization propagating in the negative direction, and (iii) and (iv) of two waves with perpendicular polarization (Ey). The propagation matrix in the new basis is , with

The present experiment is described by a linearly polarized incident wave and by two components of the transmitted (t) and reflected (r) waves, respectively. The equation connecting all waves is given by:

Here the tp and tc are the complex transmittance amplitudes within parallel and crossed polarizers geometry, rp and rc are respective reflectivity coefficients. The Faraday rotation θ and Faraday ellipticity η are obtained from the transmission amplitudes , and phase shifts φp, φc as

To interpret the experimental data, we use the ac conductivity tensor obtained in the classical (Drude) limit from the Kubo conductivity of topological surface states7,15. We note that for a 2D conducting film on an isotropic dielectric substrate, the complex transmission can be obtained analytically. For a substrate with permittivity ɛ, the final equations for the spectra in parallel (tp) and crossed (tc) polarizers are given by:

where

Here is equal to the substrate thickness, Z0≈377 Ohm is the impedance of free space, Z= is the relative impedance of the substrate and k= is the wave vector of the radiation in the substrate. The components of the conductivity tensor, σxx(ω), and σxy(ω), are given by equations (2) and (3). As can be clearly seen, equations (9) and (10) can be inverted analytically to get the solution for the complex conductivity thus avoiding the numerical procedure.

As has been discussed previously6, in case of a film on a substrate the universal value of the Faraday rotation angle should be modified by the refractive index of the substrate. We note that an infinite substrate has been assumed in these calculations. In the present case of finite substrate, exact transmission matrix formalism has been utilized thus automatically taking into account the influence of the substrate and of the Ru-gate. Moreover, monochromatic radiation has been used in the experiments and the frequency of this radiation has been selected close to a maximum of the Fabry–Pérot resonances in the substrate (). In this case, equations (11) and (12) can be simplified to

and the influence of the substrate is minimized.

Data availability

The data that support the findings of this study are available from the corresponding authors upon reasonable request.

Additional information

How to cite this article: Dziom, V. et al. Observation of the universal magnetoelectric effect in a 3D topological insulator. Nat. Commun. 8, 15197 doi: 10.1038/ncomms15197 (2017).

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.