Machine learning the microscopic form of nematic order in twisted double-bilayer graphene

Sobral, João Augusto; Obernauer, Stefan; Turkel, Simon; Pasupathy, Abhay N.; Scheurer, Mathias S.

doi:10.1038/s41467-023-40684-1

Download PDF

Article
Open access
Published: 17 August 2023

Machine learning the microscopic form of nematic order in twisted double-bilayer graphene

Nature Communications volume 14, Article number: 5012 (2023) Cite this article

2056 Accesses
3 Citations
27 Altmetric
Metrics details

Subjects

Abstract

Modern scanning probe techniques, such as scanning tunneling microscopy, provide access to a large amount of data encoding the underlying physics of quantum matter. In this work, we show how convolutional neural networks can be used to learn effective theoretical models from scanning tunneling microscopy data on correlated moiré superlattices. Moiré systems are particularly well suited for this task as their increased lattice constant provides access to intra-unit-cell physics, while their tunability allows for the collection of high-dimensional data sets from a single sample. Using electronic nematic order in twisted double-bilayer graphene as an example, we show that incorporating correlations between the local density of states at different energies allows convolutional neural networks not only to learn the microscopic nematic order parameter, but also to distinguish it from heterostrain. These results demonstrate that neural networks are a powerful method for investigating the microscopic details of correlated phenomena in moiré systems and beyond.

Moiré nematic phase in twisted double bilayer graphene

Article 23 December 2021

Hyperbolic enhancement of photocurrent patterns in minimally twisted bilayer graphene

Article Open access 12 March 2021

Imaging moiré deformation and dynamics in twisted bilayer graphene

Article Open access 10 January 2022

Introduction

Driven by the impressive improvements in machine learning (ML) in the last couple of years, exploring its potential for quantum many-body physics has recently become the subject of intense research^1,2. For instance, ML provides powerful tools to solve inverse problems that occur frequently in physics^3,4,5,6: given a model, it is often straightforward with conventional many-body techniques to compute observables that can be measured experimentally, whereas the often needed inverse problem of extracting the model and underlying microscopic physics from observations is much more challenging and typically even formally ill-defined. A second example of a large class of applications of ML in physics is ML-assisted analysis of experiments, in particular of those yielding image-like data like scanning tunneling microscopy (STM)^7,8,9,10, photoemission¹¹, and others^{12,13,14,15,16,17,18}.

In the context of applying ML algorithms to data from imaging techniques like STM, van der Waals moiré superlattices^19,20 are particularly promising for three reasons: (i) they display a huge variety of correlated quantum-many-body phenomena, such as interaction-induced insulating phases²¹, magnetism²², superconductivity²³, electronic nematic order^24,25,26,27, which can also coexist microscopically^27,28. Despite intense research on these phenomena over several decades, e.g., in the pnictides or cuprates, their origin and relations are still the subject of ongoing debates. However, compared to these microscopic crystalline quantum materials, moiré superlattices are (ii) highly tunable; for instance, the density of carriers can be varied within a single sample just by applying a gate voltage (as opposed to chemical doping) and even the interactions can be tuned²⁹. This allows producing large data sets of measurements on a single sample, containing a lot of information on microscopic physics. This aspect, which is crucial for data-driven approaches, is further enhanced by (iii) the large moiré unit cells of these systems compared to that of microscopic crystals, increasing the relative spatial resolution of scanning probe techniques significantly. This enables experiments to probe the structure of the wave functions within the unit cell and thus provides access to microscopic physics compared to conventional quantum materials. For instance, in the extreme limit of only one degree of freedom (Wannier state or pixel) per unit cell, the broken rotational symmetry of the electron liquid—the defining property of electronic nematic order^30,31—is not visible as a consequence of translational symmetry and thus requires a careful analysis of the behavior around impurities³².

In this work, we explore these advantages of moiré superlattices for extracting or learning effective field-theoretical descriptions of their correlated many-body physics from STM data. This can be viewed as an inverse problem and is also conceptually related to the goal of Hamiltonian learning in quantum simulation^{33,34,35,36,37,38}, albeit in rather different regimes and based on different measurement schemes. As a concrete example, we use electronic nematic order in twisted double-bilayer graphene (TDBG)^{39,40,41,42,43,44,45}. This moiré system consists of two AB-stacked bilayers of graphene that are twisted against each other; as one can see in Fig. 1a, it exhibits the point group D₃, generated by threefold rotation C₃ along the out-of-plane z-axis and twofold rotation C_2x along the in-plane x-axis. Evidence of electronic nematic order has been observed in previous STM experiments^42,46 which clearly exhibit stripe-like features breaking the C₃ symmetry spontaneously for certain electron concentrations. While simple limiting cases have been compared with the data in Samajdar et al.⁴⁶, there is no systematic analysis of the microscopic form of nematicity in the system. To fill this gap, we consider the more general case in which all leading terms on the graphene and moiré scale describing nematic order in a continuum-model description of TDBG⁴⁷ are included. In addition, as it is common in graphene moiré systems^{24,25,26,42,48}, we also allow for finite strain. The Hamiltonian defining the changes in TDBG resulting from nematic order and strain depends on a set of parameters β, which we reconstruct from STM data using convolutional neural networks (CNN) in a supervised learning procedure. As such, our study differs significantly from recent works, which focused on detecting the presence or absence of nematic order³² or performed a phenomenological data analysis of STM measurements⁴⁹ with ML, rather than extracting the underlying microscopic physics as we do here.

**Fig. 1: TDBG, LDOS maps, and nematicity.**

Results

Nematic order in TDBG

The non-interacting band structure of TDBG features two moiré minibands per spin and valley close to charge neutrality, where a variety of correlation-driven phenomena can emerge^{39,40,41,42,43,44,45}. In Fig. 1b, these minibands are denoted as valence (VFB) and conduction flat bands (CFB). The band structure shown is obtained from continuum-model calculations close to half-filling of the CFB (band filling ν = 0.475), where electronic nematic order was observed to be the strongest⁴², see Supplementary Note 1 for more details. STM experiments probe the band structure and wave functions of a system by providing direct access to the spatial and energy dependence of the local density of states (LDOS). Most commonly, the LDOS is studied either for a fixed position r₀ over a range of different energies, ${{{{{{{{\mathcal{D}}}}}}}}}_{{{{{{{{{\boldsymbol{r}}}}}}}}}_{0}}(\omega )$, or for a fixed energy ω₀ covering a spatial region of the system, ${{{{{{{{\mathcal{D}}}}}}}}}_{{\omega }_{0}}({{{{{{{\bf{r}}}}}}}})$. The behavior of ${{{{{{{{\mathcal{D}}}}}}}}}_{{\omega }_{0}}({{{{{{{\bf{r}}}}}}}})$ and ${{{{{{{{\mathcal{D}}}}}}}}}_{{{{{{{{{\boldsymbol{r}}}}}}}}}_{0}}(\omega )$ following from the continuum model for TDBG for three different energies and high-symmetry positions in the moiré unit cell is shown in Fig. 1c. The C₃ rotational and translational symmetry of the moiré lattice can be clearly seen in ${{{{{{{{\mathcal{D}}}}}}}}}_{{\omega }_{0}}({{{{{{{\bf{r}}}}}}}})$. Meanwhile, C_2x is broken, albeit weakly, as a consequence of the electric field required to control the electron filling to be close to the middle of the CFB in an open-faced STM sample geometry⁴².

In graphene moiré systems, there are two fundamentally distinct sources of C₃ symmetry breaking—strain and electronic nematic order. Postponing the discussion of the former below, electronic nematic order^30,31 refers to the spontaneous rotational symmetry breaking as a result of electronic correlations. While recent works also indicate the possibility of nematic charge-density wave states in TDBG^43,50, where moiré translational symmetry is simultaneously broken, we here focus on translationally symmetric nematic order since the STM data of Rubio-Verdú et al.⁴² preserves moiré translations. The underlying nematic order parameter we study is a time-reversal- and moiré-translation-invariant vector ${{{{{{{\boldsymbol{\Phi }}}}}}}}=\Phi {\hat{{{{{{{{\boldsymbol{\Phi }}}}}}}}}}_{\varphi }$, ${\hat{{{{{{{{\boldsymbol{\Phi }}}}}}}}}}_{\varphi }=(\cos 2\varphi,\sin 2\varphi )$, transforming under the irreducible representation E of D₃ (or of C₃, taking into account the weak C_2x breaking); Φ and φ stand for the intensity and orientation of the nematic director, respectively. The microscopic form of nematicity can be modeled by a coupling of Φ to a fermionic bilinear and reads in its most general form in a continuum-model description as⁴⁶

$$\begin{array}{c}{{{{{{{{\mathcal{H}}}}}}}}}_{{{{{{{{\boldsymbol{\Phi }}}}}}}}}={\int}_{{{{{{{{\boldsymbol{r}}}}}}}}}{\int}_{\Delta {{{{{{{\boldsymbol{r}}}}}}}}}{{{{{{{\boldsymbol{\Phi }}}}}}}}\cdot {{{{{{{{\boldsymbol{\phi }}}}}}}}}_{\sigma,\ell,s,\eta ;{\sigma }^{{\prime} },{\ell }^{{\prime} },{s}^{{\prime} },{\eta }^{{\prime} }}\left({{{{{{{\boldsymbol{r}}}}}}}},\Delta {{{{{{{\boldsymbol{r}}}}}}}}\right)\\ \times {c}_{\sigma,\ell,s,\eta }^{{{{\dagger}}} }\left({{{{{{{\boldsymbol{r}}}}}}}}+\Delta {{{{{{{\boldsymbol{r}}}}}}}}\right){c}_{{\sigma }^{{\prime} },{\ell }^{{\prime} },{s}^{{\prime} },{\eta }^{{\prime} }}({{{{{{{\boldsymbol{r}}}}}}}})+\,{{\mbox{H.c.}}}\,,\end{array}$$

(1)

where c^† and c are the electronic creation and annihilation operators. This general form encompasses couplings between the two sublattices s = A, B of the microscopic graphene sheets, the four graphene layers ℓ = 1, …, 4, the valley η = ± and spin σ = ↑, ↓ degrees of freedom in the tensorial form factor ${{{{{{{{\boldsymbol{\phi }}}}}}}}}_{\sigma,\ell,s,\eta ;{\sigma }^{{\prime} },{\ell }^{{\prime} },{s}^{{\prime} },{\eta }^{{\prime} }}({{{{{{{\boldsymbol{r}}}}}}}},\Delta {{{{{{{\bf{r}}}}}}}})$; its two components are required to transform in the same way as Φ under all symmetries of the system. In the following, we will take ϕ to be trivial in the spin and diagonal in the valley indices, ${{{{{{{{\boldsymbol{\phi }}}}}}}}}_{\sigma,\ell,s,\eta ;{\sigma }^{{\prime} },{\ell }^{{\prime} },{s}^{{\prime} },{\eta }^{{\prime} }}={\delta }_{\sigma,{\sigma }^{{\prime} }}{\delta }_{\eta,{\eta }^{{\prime} }}{{{{{{{{\boldsymbol{\phi }}}}}}}}}_{\ell,s;{\ell }^{{\prime} },{s}^{{\prime} }}(\eta )$. This is motivated by the weak spin-orbit coupling in graphene^51,52 and the lack of indications of interaction-induced spin-orbit coupling, which is also strongly constrained⁵³. Furthermore, the intervalley-coherent nematicity is known to lead to stronger effects on the remote bands⁴⁶ that were not observed experimentally⁴².

Since we are working with a continuum theory, the space of possible couplings ϕ in Equation (1) is technically infinite-dimensional. As such, a complete reconstruction of ϕ from experimental data is impossible given the finite resolution and energy range of the available data. On top of this, it is not required either as we are primarily interested in understanding the low-energy behavior of the system. In the spirit of gradient expansions commonly used in continuum low-energy field theories, we will therefore only keep the leading terms in Φ. There is, however, a subtlety associated with the presence of an additional moiré length scale. We will therefore have to consider two basic classes of nematic orders, referred to as graphene (GN) and moiré (MN) nematicity^42,46.

In the case of MN, nematic order is associated with the moiré scale, i.e., we choose $\Delta {{{{{{{\boldsymbol{r}}}}}}}}={{{{{{{{\bf{R}}}}}}}}}_{{m}_{1},{m}_{2}}={m}_{1}{{{{{{{{\bf{L}}}}}}}}}_{1}^{M}+{m}_{2}{{{{{{{{\bf{L}}}}}}}}}_{2}^{M}$ in Equation (1), ${m}_{j}\in {\mathbb{Z}}$, with moiré lattice vectors ${{{{{{{{\bf{L}}}}}}}}}_{j}^{M}$, to represent the non-trivial transformation behavior of ϕ under C₃. We can thus take it to be diagonal in the remaining internal indices, yielding

$$\begin{array}{c}{{{{{{{{\mathcal{H}}}}}}}}}_{{{{{{{{\boldsymbol{\Phi }}}}}}}}}^{{{{{{{{\rm{MN}}}}}}}}}=\frac{1}{2}{\Phi }_{{{{{{{{\rm{MN}}}}}}}}}{\int}_{{{{{{{{\boldsymbol{r}}}}}}}}}\mathop{\sum}\limits_{{m}_{1},{m}_{2}\in {\mathbb{Z}}}{\hat{{{{{{{{\boldsymbol{\Phi }}}}}}}}}}_{{\varphi }_{{{{{{{{\rm{MN}}}}}}}}}}\cdot {{{{{{{{\boldsymbol{\phi }}}}}}}}}_{{m}_{1},{m}_{2}}({{{{{{{\boldsymbol{r}}}}}}}})\\ \times {c}_{\alpha }^{{{{\dagger}}} }({{{{{{{\boldsymbol{r}}}}}}}}+{{{{{{{{\boldsymbol{R}}}}}}}}}_{{m}_{1},{m}_{2}})\,{c}_{\alpha }({{{{{{{\boldsymbol{r}}}}}}}})+\,{{\mbox{H.c.}}}\,,\end{array}$$

(2)

with multi-index α = (σ, ℓ, s, η). We further focus on the lowest moiré-lattice harmonic by setting ${\phi }_{{m}_{1},{m}_{2}}({{{{{{{\bf{r}}}}}}}})={\phi }_{{m}_{1},{m}_{2}}$ and only keeping the terms with the shortest possible ${{{{{{{{\bf{R}}}}}}}}}_{{m}_{1},{m}_{2}}$. Intuitively, MN order can be thought of as a distortion of the effective inter-moiré-unit-cell hopping matrix elements, as illustrated schematically in the lower right panel of Fig. 1d.

Conversely, GN acts as a local order parameter, Δr = 0 in Equation (1), without any explicit reference to the moiré scale,

$${{{{{{{{\mathcal{H}}}}}}}}}_{{{{{{{{\boldsymbol{\Phi }}}}}}}}}^{{{{{{{{\rm{GN}}}}}}}}}={\Phi }_{{{{{{{{\rm{GN}}}}}}}}}{\int}_{{{{{{{{\boldsymbol{r}}}}}}}}}{\hat{{{{{{{{\boldsymbol{\Phi }}}}}}}}}}_{{\varphi }_{{{{{{{{\rm{GN}}}}}}}}}}\cdot {{{{{{{{\boldsymbol{\phi }}}}}}}}}_{\ell,s;{\ell }^{{\prime} },{s}^{{\prime} }}(\eta ;{{{{{{{\boldsymbol{r}}}}}}}}){c}_{\ell,s}^{{{{\dagger}}} }({{{{{{{\boldsymbol{r}}}}}}}}){c}_{{\ell }^{{\prime} },{s}^{{\prime} }}({{{{{{{\boldsymbol{r}}}}}}}}).$$

(3)

Here, the correct transformation properties of ϕ result from its structure in the internal indices. Focusing on the local intra-layer contributions and the leading (constant) basis function, the most general form reads as

$${{{{{{{{\boldsymbol{\phi }}}}}}}}}_{\ell,s;{\ell }^{{\prime} },{s}^{{\prime} }}(\eta ;{{{{{{{\boldsymbol{r}}}}}}}})={\delta }_{\ell,{\ell }^{{\prime} }}{\psi }_{\ell }\left(\begin{array}{c}{({e}^{i{\alpha }_{\ell }\eta {\rho }_{z}}{\rho }_{x})}_{s{s}^{{\prime} }}\\ \eta {({e}^{i{\alpha }_{\ell }\eta {\rho }_{z}}{\rho }_{y})}_{s{s}^{{\prime} }}\end{array}\right),$$

(4)

where Pauli matrices in sublattice space are represented by ρ_j; α_l, and ψ_l are real-valued parameters. As shown schematically in the upper left panel of Fig. 1d, one can think of GN as the nematic distortion of the bonds of the individual graphene layers in a way that preserves the graphene translational symmetry.

We emphasize that GN and MN should not be viewed as distinct phases; they break the same symmetries and as such in general mix. We thus take ${{{{{{{{\mathcal{H}}}}}}}}}_{{{{{{{{\boldsymbol{\Phi }}}}}}}}}^{{{{{{{{\rm{MN}}}}}}}}}+{{{{{{{{\mathcal{H}}}}}}}}}_{{{{{{{{\boldsymbol{\Phi }}}}}}}}}^{{{{{{{{\rm{GN}}}}}}}}}$ to describe nematicity in TDBG in the following, which depends on the set of parameters β = {α_ℓ, ψ_ℓ, Φ_MN, Φ_GN, φ_MN, φ_GN}. The computation of the LDOS for a specific set of parameters can be done straightforwardly from the continuum model. The resulting spatial dependence of the LDOS, ${{{{{{{{\mathcal{D}}}}}}}}}_{{\omega }_{0}}({{{{{{{\bf{r}}}}}}}})$, is also shown in Fig. 1d for two different values of β. As opposed to the plots without nematic order, C₃ is now broken, leading to stripes in the VFB, while translational symmetry is still preserved. The inverse problem—inferring the value of the parameters β from a given LDOS pattern—is a much more challenging task. Our goal in the following sections will be to use ML, in particular, CNNs to learn the set β directly from LDOS images.

Data sets and learning stage

Using CNNs to solve this inverse problem can be interpreted as a supervised learning task², i.e., a regression-like procedure using synthetic LDOS data labeled by their respective value of nematicity parameters β. More specifically, our CNNs take as inputs 65 × 65 pixels of LDOS images and apply consecutive transformations (represented by a set of weights between each layer) in order to extract meaningful correlations that represent the set β. One example of the CNN image inputs is shown in Fig. 2a. The complete data set consists of 12,000 images which are divided into training (60%), validation (20%), and test (20%) subgroups. Each image is generated for a randomly sampled set of nematic parameters β and the intensities in the LDOS are modified with the addition of Gaussian noise (see Supplementary Note 1). The motivation for noise is twofold: to avoid overfitting⁵⁴ and to test the stability against and performance of the procedure with noise, which is inevitably present in experimental data. For a detailed description of the CNN architecture, see the Methods section and Fig. 2a.

**Fig. 2: CNN architecture and nematic director prediction.**

The learning procedure is then defined by the minimization of the loss function with respect to the CNN’s weights in a backward propagation procedure⁵⁵. The loss function can be represented as the mean-squared error (MSE), which is defined as the difference between the true and expected set of parameters β in ${{{{{{{\rm{MSE}}}}}}}}=\mathop{\sum }\nolimits_{j}^{N}{({\beta }_{j}^{{{{{{{{\rm{true}}}}}}}}}-{\beta }_{j}^{{{{{{{{\rm{predicted}}}}}}}}})}^{2}/N$, with N representing the number of samples in the training or validation data sets. Finally, we consider the adaptive moment estimation (ADAM) for the minimization of the loss function, with a learning rate of 0.001 and batch size equal to 64⁵⁶. After the completion of the training stage, the algorithm is ready to be deployed to previously unseen data, returning as outputs the parameters β^predicted.

Orientation of the nematic director

As a first investigation, we consider the task of predicting the orientation φ of the nematic director from ${{{{{{{{\mathcal{D}}}}}}}}}_{{\omega }_{0}}({{{{{{{\bf{r}}}}}}}})$ images at a single energy in the VFB (ω₀ = − 15 meV, see Fig. 1b). For this, we consider a data set with randomly generated MN and GN intensities Φ_MN, Φ_GN ∈ [0.001, 0.1] eV, and φ_MN = φ_GN = φ ∈ [0, π]. Furthermore, ψ_l = 1 and α_l = 0 for all layers. The relation between the shape of the LDOS at single energy ${{{{{{{{\mathcal{D}}}}}}}}}_{{\omega }_{0}}({{{{{{{\bf{r}}}}}}}})$ and φ is highly non-trivial for two reasons: even for a given form of nematicity, changing φ generically not just merely rotates the LDOS pattern, due to the lattice, but leads to complex distortions of its structure. Additionally, by sampling ${{{{{{{{\mathcal{H}}}}}}}}}_{{{{{{{{\boldsymbol{\Phi }}}}}}}}}^{{{{{{{{\rm{MN}}}}}}}}}+{{{{{{{{\mathcal{H}}}}}}}}}_{{{{{{{{\boldsymbol{\Phi }}}}}}}}}^{{{{{{{{\rm{GN}}}}}}}}}$, even if the same bond direction is favored over the C₃-related ones in the LDOS pattern of two samples, the underlying φ can be rather different. As can be seen in the three sample LDOS plots in Fig. 2b with different values of φ, the correspondence between φ and ${{{{{{{{\mathcal{D}}}}}}}}}_{{\omega }_{0}}({{{{{{{\bf{r}}}}}}}})$ is complex and not apparent to the human eye.

Using the angles φ as labels to the data is the most straightforward choice, but leads to inaccurate predictions around 0 and π due to the periodicity in the definition of the nematic order parameter, ${\hat{{{{{{{{\boldsymbol{\Phi }}}}}}}}}}_{\varphi }=(\cos 2\varphi,\sin 2\varphi )={\hat{{{{{{{{\boldsymbol{\Phi }}}}}}}}}}_{\varphi+\pi }$. To circumvent this feature, we use the two-component label ${\hat{{{{{{{{\boldsymbol{\Phi }}}}}}}}}}_{\varphi }$ instead of φ in the training process and then fold the network’s prediction back to φ with the arctan2 function⁵⁷. The results, shown in Fig. 2b, are consistent with the true labels, including at the boundaries of φ’s domain. This shows that even when the precise nature of nematicity (predominantly MN or GN or an admixture of the two) is not known, the director orientation φ can be accurately predicted with our CNN setup from ${{{{{{{{\mathcal{D}}}}}}}}}_{{\omega }_{0}}({{{{{{{\bf{r}}}}}}}})$ at a single energy. We have checked that the few outliers in Fig. 2b are directly related to small nematic intensities, where φ has virtually no impact on the LDOS and is, thus, impossible to predict.

Form of nematicity

After successfully learning the director orientation φ in the presence of different nematicities, we proceed into investigating the finer details of these couplings by learning the parameters β = {Φ_MN, Φ_GN, α_l} defined in Equations ((2)–(4)). To this end, we consider ψ_l = 1 and α_l = α for all layers. For concreteness, we set φ_MN = φ_GN = φ = 2π/3, which is one of the possible discrete orientations (φ_MN = φ_GN = 2π/3, π/6 and symmetry related) of the nematic director in the presence of C_2x. The data set now consists of randomly generated MN and GN intensities Φ_MN, Φ_GN ∈ [0.001, 0.1] eV, and α ∈ [0, π]. The intensity values are chosen such that the stripes in the VFB resemble the experimental results⁴². As with φ, instead of learning the angular variable α directly, the arctan2 mapping is also applied.

Using only the LDOS at a single energy (i.e., one ${{{{{{{{\mathcal{D}}}}}}}}}_{{\omega }_{0}}({{{{{{{\bf{r}}}}}}}})$ channel) in the ML architecture for this task does not produce accurate predictions. Additionally, both hyperparameter optimization and architecture modifications did not lead to any significant improvement, implying that nematic order impacts the electronic structure in complex ways that cascade across energy scales. In fact, this is also intuitively clear since, for example, the samples marked by a star and pentagon in Fig. 3a have fundamentally different nematic couplings and yet exhibit visually similar ${{{{{{{{\mathcal{D}}}}}}}}}_{{\omega }_{0}}({{{{{{{\bf{r}}}}}}}})$ images at the VFB energy.

**Fig. 3: Predicting the form of nematicity.**

In experiments, one can typically obtain single-point spectra [${{{{{{{{\mathcal{D}}}}}}}}}_{{{{{{{{{\boldsymbol{r}}}}}}}}}_{0}}(\omega )$] and real-space LDOS images at fixed energies [${{{{{{{{\mathcal{D}}}}}}}}}_{{\omega }_{0}}({{{{{{{\bf{r}}}}}}}})$]. We can therefore include additional input channels corresponding to ${{{{{{{{\mathcal{D}}}}}}}}}_{{\omega }_{0}}({{{{{{{\bf{r}}}}}}}})$ and ${{{{{{{{\mathcal{D}}}}}}}}}_{{{{{{{{{\boldsymbol{r}}}}}}}}}_{0}}(\omega )$ for different energies ω₀ and points r₀, respectively. In the second case, the individual point spectra are transformed to scaleogram images for consistency with the input data for CNNs^5,58, see upper left inset in Fig. 3a and Supplementary Fig. 1. The new architecture is then formed by four channels with ${{{{{{{{\mathcal{D}}}}}}}}}_{{\omega }_{0}}({{{{{{{\bf{r}}}}}}}})$ inputs at fixed energies ω₀ = (−35, −15, 1, 23) meV within the flat and remote bands, such that they resemble visually the corresponding ones in the experimental data of Rubio-Verdú et al.⁴², and three channels for ${{{{{{{{\mathcal{D}}}}}}}}}_{{{{{{{{{\boldsymbol{r}}}}}}}}}_{0}}(\omega )$ scaleogram inputs at stacking positions r₀ = (BAAC, ABAB, ABCA), cf. Fig. 1c. Each channel is passed through parallel Conv-Batch-MaxPool layers as in Fig. 2a, but instead of flattening each channel separately, they are concatenated to a Dense-Dropout stage before the last layer (Fig. 3a).

In Fig. 3b–d, predictions on the test data set are represented for (b) α, and (c) the moiré and (d) graphene nematic intensities; as can be seen, very good agreement is found between the reconstructed and true parameters. The outliers in α are related to small Φ_GN (brighter colors). From Equations (3) and (4), it is clear that for small Φ_GN, minimal changes will be induced in the LDOS, irrespective of the true value of the phase governed by α. This is a similar behavior to what was observed for outliers in the nematic director prediction. The results of Fig. 3 demonstrate that the microscopic form of nematicity can be extracted from the LDOS if significant energy dependence is included in the input data set.

Including strain

As already alluded to above, another possible source of C₃ breaking is strain^48,59,60,61, which is believed to be a ubiquitous property of graphene moiré superlattices at small twist angles. Breaking the same symmetries as nematic order, strain can obscure the experimental identification of nematic order and their precise interplay is still under debate^24,25,26,62. Experiments indicate^{24,25,26,42,48} that the most relevant form of strain in graphene superlattices such as twisted bilayer graphene (TBG) or TDBG is uniaxial heterostrain. In this case, the matrices ${{{{{{{{\mathcal{E}}}}}}}}}_{j}$ describing the in-plane metric deformation of the coordinates in the jth rotated Bernal bilayer of TDBG are of the form

$${{{{{{{{\mathcal{E}}}}}}}}}_{2}=-{{{{{{{{\mathcal{E}}}}}}}}}_{1}=\frac{1}{2}R{({\theta }_{\epsilon })}^{-1}\left(\begin{array}{cc}-\epsilon &0\\ 0&v\epsilon \\ \end{array}\right)R({\theta }_{\epsilon }).$$

(5)

Here v = 0.16 is the Poisson ratio for graphene and R(θ_ϵ) is the 2 × 2 matrix describing rotations of 2D vectors by angle θ_ϵ. We see that uniaxial heterostrain is characterized by two variables, the strain intensity ϵ and the direction of strain, parameterized by the angle θ_ϵ.

In the following, we allow for the simultaneous presence of uniaxial heterostrain and nematic order, leading to two additional parameters, ϵ and θ_ϵ, in β. We will study whether our ML approach is still able to extract the microscopic form of nematicity and also learn the relative strength and direction of strain. Note that the form of nematicity is still given by Equations ((2)–(4)), with the only difference that we replace ${{{{{{{{\bf{L}}}}}}}}}_{j}^{M}$ in the definition of ${{{{{{{{\bf{R}}}}}}}}}_{{m}_{1},{m}_{2}}$ by the strained moiré lattice vectors. The data set for this task is built with nematic intensities Φ_MN, Φ_GN ∈ [0.001, 0.1] eV, with the addition of strain parameters ϵ ∈ [0, 0.8]% and θ_ϵ ∈ [0, π/3]. Here, α_l = 0, ψ_l = 1 and φ = φ_MN = φ_GN = 2π/3. The domain for the strain intensities is chosen based on typical values observed in TBG²⁴, and for θ_ϵ on the periodicity of the unstrained system as θ_ϵ → θ_ϵ + π/3⁶¹. The ML architecture employed in this section is the same as in the previous investigation (Fig. 3a).

In Fig. 4a–d, predictions on the test data set are shown for ϵ (a), θ_ϵ (b), and the nematic intensities (c, d). At first sight, the result for the strain angle in Fig. 4b looks as if the procedure ceased to work since there are many data points where the true and predicted value of θ_ϵ differ significantly. However, when indicating the true strain intensity label ϵ for each prediction, it becomes clear that the outliers are related to small values of ϵ (brighter colors). As such, this behavior is not a shortcoming of the learning procedure but actually a feature of strain: for small enough ϵ in Equation (5), the angle θ_ϵ has no meaning. We have checked that removing the samples with small strain ϵ from the training and test data set will lead to accurate predictions of θ_ϵ (see Supplementary Fig. 2). The stability that we find for our learning procedure in the presence of virtually vanishing ϵ is, however, important when applying it to experimental data, where the strength of strain is unknown.

**Fig. 4: Distinguishing strain and nematicity.**

Most importantly, we see in Fig. 4c, d that the nematic couplings can still be accurately predicted when varying strain is present. The MAE is equally distributed in these cases, in contrast to the strain intensity prediction. This shows that not only nematic order can be identified when strain is present, but also its internal structure and the strength of strain that is present at the same time can be resolved when using different channels consisting of both ${{{{{{{{\mathcal{D}}}}}}}}}_{{{{{{{{{\boldsymbol{r}}}}}}}}}_{0}}(\omega )$ and ${{{{{{{{\mathcal{D}}}}}}}}}_{{\omega }_{0}}({{{{{{{\bf{r}}}}}}}})$ as inputs. This allows the networks to take into account correlations between different energies in the STM data, which in turn conveys the crucial microscopic physics, enabling the model to disambiguate between lattice and electronic effects.

Experimental data

After demonstrating the effectiveness of CNNs on learning microscopic parameters {β_i} from a synthetic (theoretical) data set D${}_{{{{{{{{\rm{th}}}}}}}}}({\beta }_{1},\cdots \,,{\beta }_{{N}_{{{{{{{{\rm{th}}}}}}}}}})$ with N_th samples, we now proceed into applying the trained ML architecture for predictions of the a priori unknown sets of parameters $\{{\beta }_{i}^{{\prime} }\}$ in an experimental data set D${}_{\exp }({\beta }_{1}^{{\prime} },\cdots \,,{\beta }_{{N}_{\exp }}^{{\prime} })$. For concreteness, we use the same synthetic training data set as in Supplementary Note 2, where only the nematic and strain intensities are predicted, i.e., β = {Φ_MN, Φ_GN, ϵ}. The data set D${}_{\exp }$ is constituted of both scaleograms ${{{{{{{{\mathcal{D}}}}}}}}}_{{{{{{{{{\bf{r}}}}}}}}}_{0}}(\omega )$ and ${{{{{{{{\mathcal{D}}}}}}}}}_{{\omega }_{0}}({{{{{{{\bf{r}}}}}}}})$ maps for different fillings of the CFB (n_s). More details about the preprocessing of the experimental data D${}_{\exp }$ can be found in the Supplementary Fig. 3.

In Fig. 5, predictions of the trained CNN for the set $\{{\beta }_{i}^{{\prime} }\}$ show non-zero values of nematicity (a) and strain (b) for all fillings of the CFB. For n_s ≥ 0.47 (gray region), the experimental data shows the most pronounced signatures of broken rotational symmetry to the human eye, which was previously interpreted as electronic nematic order^42,46. Here the CNN predicts MN to dominate over GN, although both are finite (as expected by symmetry). As can be seen in Fig. 5c, the parameters predicted by the CNN nicely reproduce the key features in the experimental data, including the strong stripes in the VFB and the much weaker, albeit finite, signatures of nematicity in the other bands.

**Fig. 5: Application to experimental data.**

For smaller fillings, n_s < 0.47, the experimental data still exhibit distortions that break C₃, see Supplementary Fig. 4, but no clear stripe-like features appear. The CNN tries to assign different anisotropy sources to these distorted regions, but the agreement between theoretical prediction and experiment is less accurate than for larger n_s. It is clearly possible that, indeed, a crossover from primarily MN to GN occurs when lowering n_s, as predicted by the neural network, see Fig. 5a, in particular, since nematic order is also a plausible instability in non-twisted bilayer graphene^29,63. However, we believe that additional experimental data and refined theoretical models are required to conclude whether this is really the case.

In contrast to this interplay between the nematic couplings, strain remains relatively constant for all n_s, and slightly decreases in Fig. 5b for n_s ≥ 0.47 as it approaches the same order of magnitude of ϵ ∈ [0.003 − 0.1%] that is expected for the experimental samples in ${D}_{\exp }$⁴². We note that at low fillings the value of strain that is predicted by the neural network is nevertheless significantly greater than the value extracted from experimental topography. This is likely a consequence of subtle differences between the continuum-model calculations and the experimental spectroscopy, which the network attempts to accommodate by including finite strain.

Discussion

We constructed and demonstrated a ML procedure that can extract the form of the nematic order parameter in TDBG from LDOS data. The key ingredient was the use of several channels that capture the correlations among different energies. Our work has several important implications. First, it shows that the presence and even the strength and internal structure of nematic order can be extracted when the sample exhibits significant heterostrain; this is a crucial aspect for moiré systems where the issue of distinguishing between nematicity and strain has been the subject of debate. Second, our analysis also shows which type of STM data is needed and most useful to extract information about nematicity: as we have seen, the LDOS maps at a single energy, ${{{{{{{{\mathcal{D}}}}}}}}}_{{\omega }_{0}}({{{{{{{\boldsymbol{r}}}}}}}})$, are not enough to deduce the form of the nematic order parameter and—contrary to what one might have expected—point spectra, i.e., ${{{{{{{{\mathcal{D}}}}}}}}}_{{{{{{{{{\boldsymbol{r}}}}}}}}}_{0}}(\omega )$, contain a lot of helpful complementary information for that task (see also the second model discussed in the Supplementary Note 5). Additionally, by studying the influence of inhomogeneous disorder in ${{{{{{{{\mathcal{D}}}}}}}}}_{{\omega }_{0}}({{{{{{{\bf{r}}}}}}}})$ maps, we show in Supplementary Note 4 that our ML procedure is highly robust against potential impurities, demonstrating further its generality and ability to disentangle random factors from microscopic physics. We emphasize that this form of solid-state Hamiltonian learning, i.e., of parameterizing the leading terms of a set of microscopic order parameters (like nematic order) or perturbations (such as strain) and extracting their form using multi-channel CNNs can be more broadly applied to other systems—see Supplementary Note 5 where we discuss a toy model for twisted bilayer graphene—and other forms of instabilities, such as the correlated insulators^64,65 or superconductivity. As such, this could open up ways of revealing the form and role of nematic order and other phases for the physics of quantum materials.

Methods

Details on the ML architecture

The implementation of the ML architecture for Fig. 2a was done with the TensorFlow library⁶⁶. Each convolutional layer is followed by batch normalization and max pooling layers (Conv-Batch-MaxPool). The batch normalization layers normalize the input weights in each stage, and also reduce the number of epochs necessary for convergence⁶⁷. This process is repeated four times, with the convolutional layers having a kernel size of 3 × 3 and strides set to 1. The filters follow a sequence of 16−32−32−16 with rectified linear unit (ReLU) activation functions⁶⁸. Padding is set to zero such that the reduction of dimensionality is performed only by the MaxPool layers. In turn, these have both strides and pool sizes set to 2 × 2. After a Flatten stage, dense layers lead to a dropout before the final layer with filters equal to the number of parameters in β. The Flatten layer transforms the data to a one-dimensional shape, and the Dropout reduces overfitting by setting a percentage of 20% adjusted weights to zero⁶⁹. Tests on variations of this architecture and the influence of its components on the performance of the predictions are described in Supplementary Note 2.

Metrics for parity plots

The additional metrics R² and mean absolute percentage error (MAPE) were calculated via R${}^{2}=\mathop{\sum }\nolimits_{j}^{N}{({\beta }_{j}^{{{{{{{{\rm{predicted}}}}}}}}}-{\bar{\beta }}^{{{{{{{{\rm{true}}}}}}}}})}^{2}/{({\beta }_{j}^{{{{{{{{\rm{true}}}}}}}}}-{\bar{\beta }}^{{{{{{{{\rm{true}}}}}}}}})}^{2}/N$ and MAPE$=\mathop{\sum }\nolimits_{j}^{N}\left|({\beta }_{j}^{{{{{{{{\rm{true}}}}}}}}}-{\beta }_{j}^{{{{{{{{\rm{predicted}}}}}}}}})/{\beta }_{j}^{{{{{{{{\rm{true}}}}}}}}}\right|/N$, where N stands for the number of samples in the test data set, and ${\bar{\beta }}^{{{{{{{{\rm{true}}}}}}}}}=\mathop{\sum }\nolimits_{j}^{N}{\beta }_{j}^{{{{{{{{\rm{true}}}}}}}}}/N$ is the standard mean over the parameters being learned.

Data availability

The theoretical and experimental data sets used and generated in this study are available in the Zenodo database under the accession code https://zenodo.org/record/7698738.

Code availability

The source codes used in this study are available in our Github repository⁷⁰.

References

Carleo, G. et al. Machine learning and the physical sciences. Rev. Mod. Phys. 91, 045002 (2019).
Article ADS CAS Google Scholar
Dawid, A. et al. Modern applications of machine learning in quantum sciences. Preprint at https://doi.org/10.48550/arXiv.2204.04198 (2022).
Lee, J., Carbone, M. R. & Yin, W. Machine learning the spectral function of a hole in a quantum antiferromagnet. Phys. Rev. B 107, 205132 (2023).
Article ADS CAS Google Scholar
Dubois, A. et al. Untrained physically informed neural network for image reconstruction of magnetic field sources. Phys. Rev. Appl. 18, 064076 (2022).
Article ADS CAS Google Scholar
Berthusen, N. F., Sizyuk, Y., Scheurer, M. & Orth, P. Learning crystal field parameters using convolutional neural networks. SciPost Phys. 11, 011 (2021).
Article ADS MathSciNet CAS Google Scholar
Chertkov, E. & Clark, B. K. Computational inverse method for constructing spaces of quantum models from wave functions. Phys. Rev. X 8, 031029 (2018).
CAS Google Scholar
Choudhary, K. et al. Computational scanning tunneling microscope image database. Sci. Data 8, 57 (2021).
Article PubMed PubMed Central Google Scholar
Joucken, F. et al. Denoising scanning tunneling microscopy images of graphene with supervised machine learning. Phys. Rev. Mater. 6, 123802 (2022).
Article CAS Google Scholar
Wang, C. et al. Machine learning identification of impurities in the STM images. Chin. Phys. B, 116805 (2020).
Article ADS Google Scholar
Zhang, Y. et al. Machine learning in electronic-quantum-matter imaging experiments. Nature 570, 484–490 (2019).
Article CAS PubMed Google Scholar
Liu, J., Huang, D., Yang, Y. F. & Qian, T. Removing grid structure in angle-resolved photoemission spectra via deep learning method. Phys. Rev. B 107, 165106 (2023).
Article ADS CAS Google Scholar
Bohrdt, A. et al. Classifying snapshots of the doped Hubbard model with machine learning. Nat. Phys. 15, 921–924 (2019).
Article CAS Google Scholar
Khan, A., Lee, C.-H., Huang, P. Y. & Clark, B. K. Leveraging generative adversarial networks to create realistic scanning transmission electron microscopy images. npj Comput Mater. 9, 85 (2023).
Article ADS Google Scholar
Chen, X. et al. Machine learning for optical scanning probe nanoscopy. Adv. Mater. 2022, 2109171 (2022).
Google Scholar
Ede, J. M. Deep learning in electron microscopy. Mach. Learn.: Sci. Technol. 2, 011004 (2021).
Google Scholar
Iwasawa, H., Ueno, T., Masui, T. & Tajima, S. Unsupervised clustering for identifying spatial inhomogeneity on local electronic structures. npj Quantum Mater. 7, 2397–4648 (2022).
Article Google Scholar
Burzawa, L., Liu, S. & Carlson, E. W. Classifying surface probe images in strongly correlated electronic systems via machine learning. Phys. Rev. Mater. 3, 033805 (2019).
Article CAS Google Scholar
Basak, S. et al. Deep learning Hamiltonians from disordered image data in quantum materials. Phys. Rev. B 107, 205121 (2023).
Article ADS CAS Google Scholar
Balents, L., Dean, C. R., Efetov, D. K. & Young, A. F. Superconductivity and strong correlations in moiré flat bands. Nat. Phys. 16, 725–733 (2020).
Article CAS Google Scholar
Andrei, E. Y. & MacDonald, A. H. Graphene bilayers with a twist. Nat. Mater. 19, 1265–1275 (2020).
Article ADS CAS PubMed Google Scholar
Cao, Y. et al. Correlated insulator behaviour at half-filling in magic-angle graphene superlattices. Nature 556, 80–84 (2018).
Article ADS CAS PubMed Google Scholar
Sharpe, A. L. et al. Emergent ferromagnetism near three-quarters filling in twisted bilayer graphene. Science 365, 605–608 (2019).
Article ADS CAS PubMed Google Scholar
Cao, Y. et al. Unconventional superconductivity in magic-angle graphene superlattices. Nature 556, 43–50 (2018).
Article ADS CAS PubMed Google Scholar
Kerelsky, A. et al. Maximized electron interactions at the magic angle in twisted bilayer graphene. Nature 572, 95–100 (2019).
Article ADS CAS PubMed Google Scholar
Jiang, Y. et al. Charge order and broken rotational symmetry in magic-angle twisted bilayer graphene. Nature 573, 91–95 (2019).
Article ADS CAS PubMed Google Scholar
Choi, Y. et al. Electronic correlations in twisted bilayer hraphene near the magic angle. Nat. Phys. 15, 1174–1180 (2019).
Article CAS Google Scholar
Cao, Y. et al. Nematicity and competing orders in superconducting magic-angle graphene. Science 372, 264–271 (2021).
Article ADS CAS PubMed Google Scholar
Lin, J.-X. et al. Zero-field superconducting diode effect in small-twist-angle trilayer graphene. Nat. Phys. 18, 1221–1227 (2022).
Article CAS Google Scholar
Liu, X. et al. Tuning electron correlation in magic-angle twisted bilayer graphene using coulomb Screening. Science 371, 1261–1265 (2021).
Article ADS CAS PubMed Google Scholar
Fradkin, E., Kivelson, S. A., Lawler, M. J., Eisenstein, J. P. & Mackenzie, A. P. Nematic fermi fluids in condensed matter physics. Annu. Rev. Condens. Matter Phys. 1, 153–178 (2010).
Article ADS CAS Google Scholar
Fernandes, R. M., Chubukov, A. V. & Schmalian, J. What drives nematic order in iron-bsed superconductors? Nat. Phys. 10, 97–104 (2014).
Article CAS Google Scholar
Goetz, J. B., Zhang, Y. & Lawler, M. Detecting nematic order in STM/STS data with artificial intelligence. SciPost Phys. 8, 087 (2020).
Article ADS MathSciNet CAS Google Scholar
Granade, C. E., Ferrie, C., Wiebe, N. & Cory, D. G. Robust online Hamiltonian learning. N. J. Phys. 14, 103013 (2012).
Article MathSciNet MATH Google Scholar
Wiebe, N., Granade, C., Ferrie, C. & Cory, D. G. Hamiltonian learning and certification using quantum resources. Phys. Rev. Lett. 112, 190501 (2014).
Article ADS PubMed Google Scholar
Wang, J. et al. Experimental quantum Hamiltonian learning. Nat. Phys. 13, 551–555 (2017).
Article Google Scholar
Valenti, A., van Nieuwenburg, E., Huber, S. & Greplova, E. Hamiltonian learning for quantum error correction. Phys. Rev. Res. 1, 033092 (2019).
Article CAS Google Scholar
Kokail, C. et al. Quantum variational learning of the entanglement Hamiltonian. Phys. Rev. Lett. 127, 170501 (2021).
Article ADS CAS PubMed Google Scholar
Yu, W., Sun, J., Han, Z. & Yuan, X. Robust and efficient Hamiltonian learning. Quantum 7, 1045 (2023).
Article Google Scholar
Cao, Y. et al. Tunable correlated states and spin-polarized phases in twisted bilayer-bilayer graphene. Nature 583, 215–220 (2020).
Article ADS CAS PubMed Google Scholar
Liu, X. et al. Tunable spin-polarized correlated states in twisted double bilayer graphene. Nature 583, 221–225 (2020).
Article ADS CAS PubMed Google Scholar
Shen, C. et al. Correlated states in twisted double bilayer graphene. Nat. Phys. 16, 520–525 (2020).
Article CAS Google Scholar
Rubio-Verdú, C. et al. Moiré nematic phase in twisted double bilayer graphene. Nat. Phys. 18, 196–202 (2022).
Article Google Scholar
He, M. et al. Chirality-dependent topological states in twisted double bilayer graphene. Preprint at https://doi.org/10.48550/arXiv.2109.08255 (2021).
Kuiri, M. et al. Spontaneous time-reversal symmetry breaking in twisted double bilayer graphene. Nat. Commun. 13, 6468 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Su, R., Kuiri, M., Watanabe, K., Taniguchi, T. & Folk, J. Superconductivity in twisted double bilayer graphene stabilized by WSe₂. Preprint at https://doi.org/10.48550/arXiv.2109.08255 (2022).
Samajdar, R. et al. Electric-field-tunable electronic nematic order in twisted double-bilayer graphene. 2D Mater. 8, 034005 (2021).
Article CAS Google Scholar
Koshino, M. Band structure and topological properties of twisted double bilayer graphene. Phys. Rev. B 99, 235406 (2019).
Article ADS CAS Google Scholar
Huder, L. et al. Electronic spectrum of twisted graphene layers under heterostrain. Phys. Rev. Lett. 120, 156405 (2018).
Article ADS CAS PubMed Google Scholar
Taranto, W. et al. Unsupervised learning of two-component nematicity from STM data on magic angle bilayer graphene. Preprint at https://doi.org/10.48550/arXiv.2203.04449 (2022).
Wilhelm, P., Lang, T., Scheurer, M. & & Läuchli, A. Non-coplanar magnetism, topological density wave order and emergent symmetry at half-integer filling of moiré Chern bands. SciPost Phys. 14, 040 (2023).
Kane, C. L. & Mele, E. J. Quantum spin Hall effect in graphene. Phys. Rev. Lett. 95, 226801 (2005).
Article ADS CAS PubMed Google Scholar
Min, H. et al. Intrinsic and Rashba spin-orbit interactions in graphene sheets. Phys. Rev. B 74, 165310 (2006).
Article ADS Google Scholar
Kiselev, E. I., Scheurer, M. S., Wölfle, P. & Schmalian, J. Limits on dynamically generated spin-orbit coupling: absence of ℓ = 1 Pomeranchuk instabilities in metals. Phys. Rev. B 95, 125122 (2017).
Article ADS Google Scholar
Goodfellow, I. J., Shlens, J. & Szegedy, C. Explaining and harnessing adversarial examples. Preprint at https://doi.org/10.48550/arXiv.1412.6572 (2015).
Rumelhart, D. E., Hinton, G. E. & Williams, R. J. Learning representations by back-propagating errors. Nature 323, 533–536 (1986).
Article ADS MATH Google Scholar
Kingma, D. P. & Ba, J. Adam: a method for stochastic optimization. Preprint at https://doi.org/10.48550/arXiv.1412.6980 (2017).
Fischer, P., Dosovitskiy, A. and Brox, T. Image orientation estimation with convolutional networks. German Conference on Pattern Recognition. (Springer International Publishing, Cham, 2015).
Mallat, S. A Wavelet Tour of Signal Processing. (Academic Press, 1999)
Nguyen, V. H. & Dollfus, P. Strain-induced modulation of Dirac cones and van Hove singularities in a twisted graphene bilayer. 2D Mater. 2, 035005 (2015).
Article Google Scholar
Yan, W. et al. Strain and curvature induced evolution of electronic band structures in twisted graphene bilayer. Nat. Commun. 4, 2159 (2013).
Article ADS PubMed Google Scholar
Bi, Z., Yuan, N. F. Q. & Fu, L. Designing flat bands by strain. Phys. Rev. B 100, 035448 (2019).
Article ADS CAS Google Scholar
Scheurer, M. S. Spectroscopy of graphene with a magic twist. Nature 572, 40–41 (2019).
Article ADS CAS PubMed Google Scholar
Cvetkovic, V., Throckmorton, R. E. & Vafek, O. Electronic multicriticality in bilayer graphene. Phys. Rev. B 86, 075467 (2012).
Article ADS Google Scholar
Nuckolls, K. P. et al. Quantum textures of the many-body wavefunctions in magic-angle graphene. Preprint at https://doi.org/10.48550/arXiv.2303.00024 (2023).
Kim, H. et al. Imaging inter-valley coherent order in magic-angle twisted trilayer graphene. Preprint at https://doi.org/10.48550/arXiv.2304.10586 (2023).
Abadi, M. et al. TensorFlow: Large-scale machine learning on heterogeneous distributed systems. Preprint at https://doi.org/10.48550/arXiv.1603.04467 (2016).
Ioffe, S. & Szegedy, C. Batch normalization: accelerating deep network training by reducing internal covariate shift. Preprint at https://doi.org/10.48550/arXiv.1502.03167 (2015).
Fukushima, K. Cognitron: a self-organizing multilayered neural network. Biol. Cybern. 20, 121–136 (1975).
Article PubMed Google Scholar
Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I. & Salakhutdinov, R. Dropout: a simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 15, 1929–1958 (2014).
MathSciNet MATH Google Scholar
Sobral, J. A., Obernauer, S., Turkel, S., Pasupathy, A. N. & Scheurer, M. S. joaosds/nematic-learning (v1.0) https://zenodo.org/record/8132134 (2023).

Download references

Acknowledgements

J.A.S. and M.S.S. acknowledge funding by the European Union (ERC-2021-STG, Project 101040651—SuperCorr). Views and opinions expressed are however those of the authors only and do not necessarily reflect those of the European Union or the European Research Council Executive Agency. Neither the European Union nor the granting authority can be held responsible for them. This publication was funded by the German Research Foundation (DFG) grant “Open Access Publication Funding / 2023-2024 / University of Stuttgart” (512689491). Salary support is also provided by the National Science Foundation via grant DMR-2004691 (S.T.) and by the Office of Basic Energy Sciences, Materials Sciences, and Engineering Division, U.S. Department of Energy under Contract No. DE-SC0012704 (A.N.P.). J.A.S. is grateful for discussions with J.P. Valeriano, Sayan Banerjee, Patrick Wilhelm, Igor Reis, and Pedro H.P. Cintra. M.S.S. also thanks R. Samajdar, R. Fernandes, and J. Venderbos for a previous collaboration on nematic order in TDBG⁴⁶.

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Authors and Affiliations

Institute for Theoretical Physics III, University of Stuttgart, 70550, Stuttgart, Germany
João Augusto Sobral & Mathias S. Scheurer
Institute for Theoretical Physics, University of Innsbruck, A-6020, Innsbruck, Austria
João Augusto Sobral, Stefan Obernauer & Mathias S. Scheurer
Department of Physics, Columbia University, 10027, New York, NY, USA
Simon Turkel & Abhay N. Pasupathy
Condensed Matter Physics and Materials Science Division, Brookhaven National Laboratory, 11973, Upton, NY, USA
Abhay N. Pasupathy

Authors

João Augusto Sobral
View author publications
You can also search for this author in PubMed Google Scholar
Stefan Obernauer
View author publications
You can also search for this author in PubMed Google Scholar
Simon Turkel
View author publications
You can also search for this author in PubMed Google Scholar
Abhay N. Pasupathy
View author publications
You can also search for this author in PubMed Google Scholar
Mathias S. Scheurer
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

ML and continuum-model calculations for TDBG were performed by J.A.S., S.T., and M.S.S. Analogous calculations for the minimal model of TBG in the SI were done by S.O.; The experimental data set for TDBG from STM measurements was obtained by S.T and A.N.P.; Preprocessing of the experimental data was done by J.A.S. and S.T.; M.S.S. planned and supervised the project. All authors contributed to the writing of the manuscript.

Corresponding author

Correspondence to João Augusto Sobral.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Gabriel Schleder, and the other, anonymous, reviewer for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review file

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Sobral, J.A., Obernauer, S., Turkel, S. et al. Machine learning the microscopic form of nematic order in twisted double-bilayer graphene. Nat Commun 14, 5012 (2023). https://doi.org/10.1038/s41467-023-40684-1

Download citation

Received: 06 March 2023
Accepted: 01 August 2023
Published: 17 August 2023
DOI: https://doi.org/10.1038/s41467-023-40684-1

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.