Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

# Dictionary learning in Fourier-transform scanning tunneling spectroscopy

## Abstract

Modern high-resolution microscopes are commonly used to study specimens that have dense and aperiodic spatial structure. Extracting meaningful information from images obtained from such microscopes remains a formidable challenge. Fourier analysis is commonly used to analyze the structure of such images. However, the Fourier transform fundamentally suffers from severe phase noise when applied to aperiodic images. Here, we report the development of an algorithm based on nonconvex optimization that directly uncovers the fundamental motifs present in a real-space image. Apart from being quantitatively superior to traditional Fourier analysis, we show that this algorithm also uncovers phase sensitive information about the underlying motif structure. We demonstrate its usefulness by studying scanning tunneling microscopy images of a Co-doped iron arsenide superconductor and prove that the application of the algorithm allows for the complete recovery of quasiparticle interference in this material.

## Introduction

The past few decades have seen dramatic advances in the understanding of the structure of materials via scattering and microscopy techniques. Scattering techniques are useful when perfect periodicity exists in a material, while microscopy is well suited for specimens that lack periodicity. Recent advances in microscopy techniques, when coupled with improved computing power, have enabled the scientific community to generate massive, multi-dimensional spatial images of specimens as a function of control parameters, such as time, energy, and applied stimulus. Examples of such advanced tools include super-resolution optical microscopy to inspect the structure of proteins beyond the diffraction limit1, scanning transmission electron microscopy to examine the chemical structure of materials at the atomic scale2, and scanning tunneling microscopy (STM) to visualize the quantum electronic structure of surfaces with atomic resolution3. Fundamentally, a microscope image represents the interaction between the probe and the specimen, and often times sophisticated analysis must be performed to uncover the scientific content present in the image. Specimens of interest for STM studies include metals4, two-dimensional (2-D) materials5, unconventional superconductors6, topological materials7,8, and charge9 and spin10,11 ordered materials, among others. Image analysis of these materials has provided several unique insights into the quantum electronic structure and interactions present within them. Many microscopy techniques utilize the Fourier transform (FT)12,13 for analysis, revealing the characteristic wavelengths present in the image, which are then related to a scientific theory of the specimen being studied. When perfect periodicity exists in an image, the FT provides a concise and accurate description of the image6,7,9,14,14,15,16,17,18. However, when applied to aperiodic images, the FT suffers from phase noise leading to a fundamental loss of information19,20. With the proliferation of new computing techniques, one may wonder if the maturation of optimization algorithms can be leveraged to extract more information from a microscopy image than through the FT.

In this work, we consider a class of images that are of particular importance to microscopy—those that can be perceived as a basic motif, called a kernel, that is repeated aperiodically across the image. Examples of kernels include electronic scattering patterns around atomic defects (in STM) and fluorescence from individual proteins (in optical microscopy). We present the development of an algorithm, based on nonconvex optimization, for analyzing such images that quantitatively extracts the principal motifs present in an image. We demonstrate that this algorithm can elucidate fundamentally new information unavailable through traditional FT analysis. While our methods are generally applicable to a wide range of microscopy techniques, in this work we focus on its application to STM.

## Results

### STM and scanning tunneling spectroscopy

STM and scanning tunneling spectroscopy produces 2-D spectroscopic maps of the local density of states (LDoS) at position x with energy ω, forming a three-dimensional dataset. The contrast in these images stems from local spatial variations of the LDoS, denoted as δρ(xω). Measurements in which δρ(xω) is ascribed to material defects that cause electron scattering and interference4 are particularly interesting, and such maps are often called quasiparticle interference (QPI) maps. Analysis of QPI maps has uncovered information on the dispersion relations and scattering processes in semimetals21,22, high-temperature superconductors6,14,18,23, and other systems. Let us suppose that the LDoS pattern created by a single defect located at x with energy ω is δρ0(xω), and that the STM image is composed from N defects located at x1x2, …, xN:

$$\delta \rho ({\bf{x}},\omega)=\mathop {\sum}\limits_{j=1}^{N}{c}_{j}\delta {\rho }_{0}({\bf{x}}-{{\bf{x}}}_{j},\omega),$$
(1)

where cj are constants. A real-world example of such an image is shown in Fig. 1a, obtained on the pnictide superconductor NaFeAs18.

According to scattering theory, the FT of the QPI image of an individual defect, δρ0(qω) = ∫dxeiqxδρ0(xω), is correlated with the underlying electronic structure of the material14,15,16. Many quantum materials previously studied by STM, examples of which include superconducting cuprates6, pnictides18 and chalcogenides10, charge density wave materials9, topological insulators7, and correlated oxides24 have sufficient disorder so that the LDoS signatures of different defects overlap. In this situation, it is not possible to identify the isolated defect signature through inspection. Instead, the traditional analysis9 proceeds by taking the FT of the entire STM image δρ(xω) in (1):

$$\delta \rho ({\bf{q}},\omega)=\int {\rm{d}}{\bf{x}}\ {e}^{-i{\bf{q}}\cdot {\bf{x}}}\delta \rho ({\bf{x}},\omega)=\delta {\rho }_{0}({\bf{q}},\omega)\mathop {\sum}\limits_{j=1}^{N}{c}_{j}\exp \{-i{\bf{q}}\cdot {{\bf{x}}}_{j}\}.$$
(2)

While the quantity of interest for QPI analysis is δρ0(qω), the experimental FT image contains a frequency-varying, complex-valued phase factor, $${\mathcal{P}}({\bf{q}})\equiv {\sum }_{j=1}^{N}{c}_{j}\exp \{-i{\bf{q}}\cdot {{\bf{x}}}_{j}\}$$. This is illustrated in Fig. 1b, where the real part of the FT (Re-FT) displays wild oscillations due to $${\mathcal{P}}({\bf{q}})$$. To mitigate this, the magnitude of the FT (mag-FT) is taken, and the analysis proceeds by assuming that $${\mathcal{P}}({\bf{q}})$$ is approximately constant in magnitude so that $$\left|{\mathcal{P}}({\bf{q}})\right|\approx \bar{c}\sqrt{N}$$, where $$\bar{c}$$ is the average value of the cj. The result of this procedure is illustrated in Fig. 1c, showing that debilitating noise still persists in the FT after taking the modulus. Moreover, the procedure of obtaining the mag-FT effectively eliminates half of the useful information in the complex-valued FT, annihilating all of the phase information from electron scattering processes originally present in real space. Intense peaks and contours in the real and imaginary parts of δρ(qω) are experimental indicators of dominant scattering wavevectors and order parameter symmetries, which can reveal important properties about the superconducting gap function sign structure25,26 and surface states of topological insulators27,28. However, random phase noise fluctuations in experimental QPI spectra make comparisons with theoretical QPI calculations difficult29.

An improved analysis technique to FT-STM would identify the location and the LDoS signature associated with each defect in a quantitatively rigorous fashion that respects experimental and material-specific constraints. For instance, defects remain fixed in position across a series of STM images in which the measurement bias voltage is varied (see Fig. 2). In this work, we present an analysis technique based on nonconvex optimization that possesses these desirable features while being broadly applicable to other forms of microscopy and image analysis.

### Connection with sparse blind deconvolution

Our algorithm is based on a deconvolutional procedure illustrated in Fig. 2. The image (denoted as $${\mathcal{Y}}$$) in Fig. 2a was produced by simulating the effect of quasiparticle scattering from numerous point defects randomly distributed across the image. At a given bias voltage, the image consists of a recurrent scattering pattern (called the kernel $${{\mathcal{A}}}_{0}$$) convolved with the locations and relative weights of each defect (called the activation map $${{\mathcal{X}}}_{0}$$) as illustrated in Fig. 2a and represented as $${\mathcal{Y}}={{\mathcal{A}}}_{0}\star {{\mathcal{X}}}_{0}$$. The underlying challenge in our analysis is to invert the procedure—starting with an STM image, determine the kernel and its corresponding activation map. The kernel and activation map are easily identified by inspection in Fig. 2a; however, this becomes a highly non-trivial problem in the presence of many overlapping kernels and experimental noise. Similar convolutional models are used in neuroscience to model neuron spike patterns19 and in systems biology to capture responses of the endocrine system20. In contrast, our algorithm focuses on a 2-D signal.

When $${{\mathcal{A}}}_{0}$$ contains multiple slices, with each slice corresponding to a different bias voltage, we mathematically express the proposed model for STM measurements by collecting the convolutions for each voltage slice using the notation

$${\mathcal{Y}}={{\mathcal{A}}}_{0} \otimes {{\mathcal{X}}}_{0}+{\mathcal{Z}},$$
(3)

which is schematically depicted in Fig. 2b. The activation map $${{\mathcal{X}}}_{0}$$ is shared globally across all measurement biases and $${\mathcal{Z}}$$ is an additive noise tensor. The task of recovering both $${{\mathcal{A}}}_{0}$$ and $${{\mathcal{X}}}_{0}$$ given $${\mathcal{Y}}$$ is known as the sparse blind deconvolution (SBD) problem.

### Formulating the SBD problem

Over the past decade, a wealth of heuristics and applications for sparse signal recovery have been developed, often leading to efficient algorithms in theory as well as in practice30,31 (see Supplementary Note 2). We investigate the following heuristic for producing estimates $$\hat{{\mathcal{A}}}$$ and $$\hat{{\mathcal{X}}}$$ for $${{\mathcal{A}}}_{0}$$ and $${{\mathcal{X}}}_{0}$$, by posing an optimization problem based on (3):

$$\hat{{\mathcal{A}}}=\arg \ \min _{{\mathcal{A}}} \min _{{\mathcal{X}}}\left[{\psi }_{\lambda }\left({\mathcal{A}},{\mathcal{X}}\right)\equiv \frac{1}{2}{\left\Vert {\mathcal{A}} \otimes {\mathcal{X}}-{\mathcal{Y}}\right\Vert }_{\mathrm{F}}^{2}+\lambda r({\mathcal{X}}),\right]$$
(4)

which allows one to recover $$\hat{{\mathcal{X}}}=\arg \ {\min }_{{\mathcal{X}}}{\psi }_{\lambda }(\hat{{\mathcal{A}}},{\mathcal{X}})$$.

This is similar to prem .vious formulations proposed for various SBD applications: the Frobenius norm term $${\left\Vert \cdot \right\Vert }_{\mathrm{F}}^{2}$$ promotes data fidelity upon minimization $$({\mathcal{Y}}\simeq \hat{{\mathcal{A}}} \otimes \hat{{\mathcal{X}}})$$, and a regularization term $$r(\hat{{\mathcal{X}}})$$ is chosen, such as the 1 norm, so that the minimization encourages $$\hat{{\mathcal{X}}}$$ to be sparse, with λ ≥ 0 governing the trade-off between the two objectives. However, most SBD applications focus on signal enhancement that uses the convolutional model as a rough guideline, leading to a weak notion of accurate estimation32. In contrast, the convolutional model fits naturally into the STM setting, in which robust, consistent results are paramount for scientific investigation. These considerations prompt a number of choices that are not emphasized in previous heuristics, such as the domain of $${\mathcal{A}}$$, form of $$r(\hat{{\mathcal{X}}})$$, and refinement of the estimates.

In order to solve this optimization problem, we present the SBD-STM algorithm:

Algorithm 1 Complete SBD-STM procedure

Input:

• Observation $${\mathcal{Y}}\in {{\mathbb{R}}}^{{n}_{1}\times {n}_{2}\times s}$$, kernel size $$\left({m}_{1},{m}_{2}\right)$$, initial λ0 ≥ 0, decay rate $$\alpha \in \left[0,1\right)$$, and final λend ≥ 0.

Initial phase:

1. 1.

Randomly initialize: $${{\mathcal{A}}}^{\left(0\right)}\in {\mathcal{S}}={{\mathbb{S}}}^{{m}_{1}\times {m}_{2}\times s}$$.

2. 2.

$${{\mathcal{A}}}_{* }^{\left(0\right)},{{\mathcal{X}}}_{* }^{\left(0\right)}\leftarrow$$ASolve$$({{\mathcal{A}}}^{(0)},{\lambda }_{0},{\mathcal{Y}})$$.

Refinement phase:

1. 1.

Lifting: Get $${{\mathcal{A}}}^{\left(1\right)}\in {S}^{\prime}={{\mathbb{S}}}^{{m}_{1}^{\prime}\times {m}_{2}^{\prime}\times s}$$ by zero-padding the edges of $${{\mathcal{A}}}_{* }^{\left(0\right)}$$ with a border of width $$\left\lfloor \frac{{m}_{i}}{2}\right\rfloor$$.

2. 2.

Set λ1 = λ0.

3. 3.

Continuation: Repeat for k = 1, 2, …  until λkλend,

1. (a)

$${{\mathcal{A}}}_{* }^{\left(k\right)},{{\mathcal{X}}}_{* }^{\left(k\right)}\leftarrow$$ASolve$$\left({{\mathcal{A}}}^{\left(k\right)},{\lambda }_{k},{\mathcal{Y}},{{\mathcal{X}}}_{* }^{\left(k-1\right)}\right)$$,

2. (b)

Centering:

1. i.

Find the size m1 × m2 submatrix of $${{\mathcal{A}}}_{* }^{\left(k\right)}$$ that maximizes the Frobenius (square) norm across all m1 × m2 submatrices.

2. ii.

Get $${{\mathcal{A}}}^{\left(k+1\right)}$$ by shifting $${{\mathcal{A}}}_{* }^{\left(k\right)}$$ so that the chosen m1 × m2 restriction is in the center, removing and zeropadding entries as needed.

3. iii.

Normalize $${{\mathcal{A}}}^{\left(k+1\right)}$$ so it lies in $${{\mathcal{S}}}^{\prime}$$.

4. iv.

Shift $${{\mathcal{X}}}_{* }^{\left(k\right)}$$ along the anti-parallel vector to the shift of $${{\mathcal{A}}}_{* }^{\left(k\right)}$$.

3. (c)

Set λk+1 = αλk.

Output:

1. 1.

Extract $$\hat{{\mathcal{A}}}\in {\mathcal{S}}$$ by extracting the restriction of the final $${{\mathcal{A}}}^{\left(k+1\right)}$$ to the center m1 × m2 window.

2. 2.

Find the corresponding activation map $$\hat{{\mathcal{X}}}\in {{\mathbb{R}}}^{{n}_{1}\times {n}_{2}}$$ by solving $${\min }_{{\mathcal{X}}}{\psi }_{{\lambda }_{k}}(\hat{{\mathcal{A}}},{\mathcal{X}})$$.

Function Asolve

Input:

• Current kernel, $${{\mathcal{A}}}_{{\rm{in}}}$$, current sparsity parameter, λin, the observation $${\mathcal{Y}}$$, current activation map $${{\mathcal{X}}}_{{\rm{in}}}$$ (Refinement Phase)

Minimization

if Initial Phase then

1. 1.

Initialize $${\mathcal{X}}$$ as a zero matrix of size (n1n2)

2. 2.

$${{\mathcal{X}}}_{1}\leftarrow$$ Minimize $${\mathcal{X}}$$ for $${\psi }_{{\lambda }_{{\rm{in}}}}({{\mathcal{A}}}_{{\rm{in}}},{\mathcal{X}})$$ using the FISTA algorithm33.

else

$${{\mathcal{X}}}_{1}\leftarrow {{\mathcal{X}}}_{{\rm{in}}}$$

end if

1. 1.

$${{\mathcal{A}}}_{{\rm{out}}}\leftarrow$$ Minimize $${\mathcal{A}}$$ for $${\psi }_{{\lambda }_{{\rm{in}}}}({\mathcal{A}},{{\mathcal{X}}}_{1})$$ using the Riemannian Trust-Region Method (RTRM) over the sphere34.

2. 2.

$${{\mathcal{X}}}_{{\rm{out}}}\leftarrow$$ Minimize $${\mathcal{X}}$$ for $${\psi }_{{\lambda }_{{\rm{in}}}}({{\mathcal{A}}}_{{\rm{out}}},{\mathcal{X}})$$ using FISTA.

Output:

• $${{\mathcal{A}}}_{{\rm{out}}}$$, $${{\mathcal{X}}}_{{\rm{out}}}$$.

See Supplementary Notes 2 and 3 for further discussion on formulating and solving Eq. (4), and Supplementary Note 4 for how our approach to SBD can be applied to image deblurring by using an objective similar to Eq. (4).

### The blind deconvolution approach

To demonstrate the strength of SBD-STM, consider the situation illustrated in Fig. 3. We generated a simulated observation $${\mathcal{Y}}$$ using a ground truth scattering pattern $${{\mathcal{A}}}_{0}$$ similar to Fig. 1d and a dense, randomly generated activation map $${{\mathcal{X}}}_{0}$$ shown in Fig. 3b. Convolving the truth data with the activation map and adding significant white noise with variance η so that the signal-to-noise ratio (SNR) is less than unity, with $${\rm{SNR}}\equiv \frac{{\mathrm{var}}({{\mathcal{A}}}_{0})}{\eta }$$, we generate the image shown in Fig. 3c. With many overlapping kernels and substantial noise, it is a futile task to accurately identify the underlying kernel and activation map of the image through visual inspection. However, as shown in Fig. 3d, e, SBD-STM successfully recovers a kernel $$\hat{{\mathcal{A}}}$$ and its associated activation map $$\hat{{\mathcal{X}}}$$ that closely resemble the truth data. The results shown in Fig. 3 were obtained with a fixed λ = 0.1. The scaling of the activation map entries is due to the choice of λ, and the noise also introduces blurring in the activation map35. Despite this, the overall features of the activation map and the recovered kernel are remarkably similar to the ground truth.

In Fourier space, the Re-FT of $${\mathcal{Y}}$$ is missing crucial features of the true Re-FT spectrum and has noise fluctuations  ≈100 times that of the true transform. However, the Re-FT of the SBD-STM recovered kernel is consistent with the true Re-FT in both its structure and amplitude.

In our implementation, SBD-STM yields an activation map shared across all bias voltages. This not only reveals the spatial distribution of the defect kernels but also naturally improves the accuracy of the SBD-STM recovered kernels at bias energies with noisy measurements. Consequently, SBD-STM returns more physically meaningful results when data from multiple biases are simultaneously analyzed than if each constant-bias slice of $${\mathcal{Y}}$$ were individually analyzed. SBD-STM results on a simulated noisy STM dataset with 41 bias voltages are found in Supplementary Note 5, demonstrating that SBD-STM is successful in optimizing the objective function with STM constraints in mind.

### Performance characterization

Before invoking SBD-STM on experimental data, we must understand its limitations and domain of applicability. The complexity of the STM deconvolution problem varies depending on the SNR and the overlap tendency of nearby defects. A series of numerical experiments on simulated data were performed to investigate the effects of defect concentration θ—the probability that any entry of $${{\mathcal{X}}}_{0}$$ is truly non-zero—and additive measurement noise on the expected success of SBD-STM. Simulated STM images are produced in a similar fashion as in Fig. 3, and the performance of SBD-STM was assessed as a function of four adjustable parameters—the image size n ≡ n1 × n2, kernel size m ≡ m1 × m2, kernel concentration θ, and SNR. Details of the data generation and simulation work are contained in Supplementary Notes 1 and 6. To assess the accuracy of kernel recovery in real space, we define the real-space recovery error metric as $$\epsilon ({\hat{{\mathcal{A}}}}_{\theta ,\eta },{{\mathcal{A}}}_{0})\equiv \frac{2}{\pi }\arccos |\langle {\hat{{\mathcal{A}}}}_{\theta ,\eta },{{\mathcal{A}}}_{0}\rangle|$$, with $$\langle {\hat{{\mathcal{A}}}}_{\theta ,\eta },{{\mathcal{A}}}_{0}\rangle$$ denoting the inner product of the vectorizations of $${\hat{{\mathcal{A}}}}_{\theta ,\eta }$$ and $${{\mathcal{A}}}_{0}$$, which are the recovered and truth kernels, respectively. Figure 4a depicts a normalized defect size $$\frac{m}{n}$$ vs. concentration θ phase diagram to explore the interplay between $$\frac{m}{n}$$ and θ on real-space algorithmic accuracy $$\epsilon ({\hat{{\mathcal{A}}}}_{\theta },{{\mathcal{A}}}_{0})$$ in noise-free (η = 0) simulated measurements. We observe a phase transition in SBD-STM performance in the $$\frac{m}{n}-\theta$$ plane. The bottom left of the plot captures situations in which the defects have negligible probability of overlapping, facilitating the near-perfect deconvolution of the noise-free image. Increasing either $$\frac{m}{n}$$ or θ introduces error in the kernel recovery due to increased overlapping between defects. Practically, $$\frac{m}{n}$$ can be reduced by increasing the overall STM measurement area n in an attempt to perform deconvolution-by-inspection. However, at high defect concentrations θ or moderate noise levels, this strategy cannot guarantee success, while SBD-STM can still return reliable estimates.

Next, we briefly discuss the SBD-STM performance as a function of noise in the signal. Figure 4b shows the evolution of $$\epsilon ({\hat{{\mathcal{A}}}}_{\theta ,\eta },{{\mathcal{A}}}_{0})$$ as a function of defect concentration θ for three values of SNR ranging from noise free to noise dominated (SNR = 0.792). At high noise levels, performance error fluctuations in the far left of the plot appear because of the statistically futile challenge of accurately identifying low-density, low-intensity motifs under high levels of noise. The error curves appear to converge into a narrow band when θ 0.01, demonstrating that the algorithm is robust to a wide range of SNRs for higher concentrations. By θ ≈ 0.2, the defect concentrations are sufficiently dense so that virtually all defect kernels are overlapping, causing SBD-STM to collapse and return unreliable estimates. These trends persist when the kernel size and SNR are further increased, as described in Supplementary Note 6. Altogether, our simulations show that in a wide range of parameters (Fig. 4a), SBD-STM performs splendidly (ϵ < 0.1) and consistently outperforms the usual FT-STM methodology even in the presence of considerable noise, as seen in Fig. 4c–j.

### Application to real data

The results obtained above on synthetic data show that SBD-STM is able to recover kernels even at high defect density and in the presence of significant white noise, where alternate techniques such as manual detection fail. However, one might still question whether the method will work on real STM data, which might have other types of noise or errors, which we have not accounted for in our synthetic data. In order to investigate this fully, we now apply SBD-STM to investigate a set of experimentally obtained STM images of NaFe1−xCoxAs at different values of x = 0.0, 0.015 and 0.02. At x = 0.0 (parent compound), no additional cobalt dopants are present in the lattice, and the only defects present are those intrinsic to the crystal, which are at a concentration of about 1%. This compound has been previously studied via STM and the data have been analyzed using the standard FT technique18. To contrast with the standard FT-STM analysis, we implement SBD-STM on raw experimental data to demonstrate the significant improvement in data fidelity of the Re-FT. Shown in Fig. 5b, c are the recovered Kernel and activation map, respectively, from the map shown in Fig. 5a. Figure 5d, e show the Re-FT of the entire image and of the kernel, respectively, over the same range in Fourier space. The phase sensitive recovery of the FT in Fig. 5e when compared to Fig. 5d is immediately apparent. We note that for this particular doping the individual kernels are well separated, and the kernel can be isolated directly by eye from the large area image in Fig. 5a. We show the application of SBD-STM to this sample to illustrate that the recovered kernel is indeed what one would expect by direct measurement around an individual defect.

We now turn to the sample with x = 0.015, a STS image of which is shown in Fig. 5f. At this doping, we see that there are regions of high-density clustering of the kernels, where the clustering is dense enough such that the individual kernels overlap and become difficult to resolve. In Fig. 5g, h, we show the corresponding recovery for the kernel and activation map, respectively. The Re-FT of the entire map and the recovered kernel are shown in Fig. 5i, j, respectively. At this defect level, we can very occasionally see isolated defects, and the recovered kernel is seen to nicely match with the differential conductance around these isolated defects.

Finally, we consider a sample that is optimally doped (highest Tc) with a nominal cobalt concentration of x = 0.02. Shown in Fig. 5k is an STS image obtained on this sample at T = 5.9 K. At this doping level, there are no isolated impurities present anywhere in the sample, and the kernel cannot be manually recovered. The SBD-STM is able to recover the kernel and activation map as shown in Fig. 5l, m respectively. The Re-FT of the entire image and of the recovered kernel are shown in Fig. 5n, o, respectively. We note that the kernel recovered at this doping level is much closer to four-fold symmetry than at lower cobalt dopings. This is consistent with previous measurements of the phase diagram of NaFe1−xCoxAs36, including those by STM37 that have shown that the transition from orthorhombic to tetragonal symmetry happens before optimal doping. We see from the series of data in Fig. 5 that SBD-STM works over the entire doping range that is relevant to STM experiments and is able to recover high-quality kernels with phase-sensitive FTs.

From the results shown on both synthetic and real experimental STM data, we can see that SBD-STM provides a complete recovery of kernels in real space and therefore a phase sensitive recovery of the FT in reciprocal space. Within the formulation of QPI, the phase of the FT in the QPI signal is dependent on the incoming and outgoing quasiparticle’s Green’s function as well as the potential of the impurity. The availability of phase information can give us new insight into individual materials that is not available simply from the magnitude of the QPI signal. In the remainder of this paper, we consider one such new insight into the physics of NaFe1−xCoxAs. NaFe1−xCoxAs, like many of the pnictides displays a superconducting dome as a function of cobalt doping. The maximum Tc at optimal doping (x = 0.02) reaches 18 K. As with the other pnictides, determining the symmetry of the superconducting order parameter in this compound is of much current interest. In this context, recent theoretical work on QPI in the superconducting state of the pnictides29,38,39 has opened up the possibility of distinguishing different superconducting order parameters from their QPI signature. We follow the procedure outlined in the recent work by Altenfeld et al. 39, where the (real) Fourier transform of the QPI signal around a single defect δρ(qω) is integrated over all q-space to produce the quantity δρ(ω). This quantity is then anti-symmetrized with respect to energy relative to the Fermi level to produce a quantity δρ(ω) = δρ(ω) − δρ(−ω). It is shown29,39 that in the case of s± pairing, δρ(ω) is large and of constant sign over the energy range near the superconducting gap. Conversely, in the case of s++ pairing, δρ(ω) is expected to be small and have a sign change around the gap energy. In order to carry out the integration over q described in this procedure, it is required that we have access to the phase of the QPI signal. One way of achieving this is to directly image around an isolated dopant or defect where the complete phase sensitive pattern can be measured. Such STS imaging has recently been performed on iron chalcogenides40,41. However, this method has not been applied to the iron arsenides, especially at optimal doping where the defect or dopant density is high and the phase information in the FT was not previously available. Armed with the phase-sensitivity of SBD-STM, we now analyze differential conductance maps of near optimally doped NaFe1−xCoxAs to investigate the superconducting order parameter at optimal doping.

We start with a dataset that consists of 21 raw STS images from −10 to +10 meV in 1 meV increments on an optimally doped NaFe1−xCoxAs sample at T = 5.9 K. One of these raw images at ω = −10 meV is shown in Fig. 6a (additional images are described in Supplementary Note 10). Notice that no individual motifs can be resolved by eye. We then proceed to recover the kernels $$\hat{{\mathcal{A}}}({\bf{r}},\omega)$$ at each energy using SBD-STM. An example of this recovery is shown in Fig. 6b, which is the recovered SBD-STM kernel $$\hat{{\mathcal{A}}}$$(rω = − 10 meV), recovered from the raw STS image in Fig. 6a. The recovered kernel lacks the strong anisotropy of recovered kernels from the underdoped regime, suggesting that electronic nematicity is not strong at this doping. Assuming that SBD-STM has worked correctly, the recovered $$\hat{{\mathcal{A}}}({\bf{r}},\omega)$$ is identical to the real-space QPI signal δρ(rω). We then take the real part of the 2-D Fourier transform $$\hat{{\mathcal{A}}}({\bf{q}},\omega)$$, as shown in Fig. 6c at ω = −10 meV. This FT has the full-phase information present, and we can then integrate over q and antisymmetrize with respect to energy:

$${\hat{{\mathcal{A}}}}^{-}(\omega)=\sum _{{\bf{q}}}{\mathrm{Re}}(\hat{{\mathcal{A}}}({\bf{q}},\omega))-{\mathrm{Re}}(\hat{{\mathcal{A}}}({\bf{q}},-\omega)).$$
(5)

We perform this procedure at each energy, and plot the resultant $${\hat{{\mathcal{A}}}}^{-}(\omega)$$ in Fig. 6d. In Fig. 6e, we show the spatially averaged differential conductance from the same datasets as a function of energy, revealing the superconducting gap. From the two coherence peaks, we calculate a 2Δ of 11 meV. We can clearly see from Fig. 6d that $${\hat{{\mathcal{A}}}}^{-}(\omega)$$ is peaked near the superconducting gap, and has no sign change in the energy range near the gap. For comparison we performed theoretical calculations of the anti-symmetrized correction to the LDoS, δρ(ω) as shown in the inset of Fig. 6d following the original prescription29. Here, we use the electronic structure of Co-doped NaFeAs previously measured by angle-resolved photoemission spectroscopy (ARPES)42,43 and fitted to the 10 orbital tight-binding model44. The values of the superconducting gap on each band were taken to be Δh = 6.5 meV, Δe = 6.8 meV on the electron and the smaller hole pockets, respectively42, and ΔH = 3.5 meV on the larger hole pocket43. Further details are given in Supplementary Note 9. As expected, the behavior of the $${\hat{{\mathcal{A}}}}^{-}(\omega)$$ for the sign-changing gaps between electron and hole pockets follows the predicted behavior of the δρ(ω) for an s+−-pairing state in which this quantity does not change in an energy range between the gaps on the hole and electron pockets, while for sign-preserving gaps this quantity is generally small, with an alternating sign between the gaps. A similar behavior is found in the experiment, as shown in the main Fig. 6d. This procedure illustrates some of the new physical insight into STS image data that can be obtained once the complete phase information in the QPI signal is available for analysis.

## Discussion

In its current implementation, SBD-STM addresses the problem of identifying a single motif across a series of images. Beyond the identification of real-space motifs in microscopy images, SBD-STM can also be applied to problems in which the motif is sparse in the appropriately chosen space, such as sparsity in the spatial gradient for natural image deblurring45,46. Moreover, the flexibility of the convolutional data model in (3) affords the natural generalization, $${\mathcal{Y}}={\sum }_{j=1}^{M}{{\mathcal{A}}}_{0}^{\left(j\right)} \otimes {{\mathcal{X}}}_{0}^{\left(j\right)}+{\mathcal{Z}}$$, which expands the scope of SBD-STM to identify multiple distinct kernels in any series of images. In particular, STM images that contain various short-range orders, such as charge or spin density waves, would be amenable to a similar analysis47,48,49,50. SBD-STM recovered results from these STM measurements can be directly compared with theoretical predictions51,52,53 to understand the nature of competing orders in superconductors and other strongly correlated materials. Other analysis methodologies47,50,54 have been recently proposed to improve FT data fidelity and provide some phase-sensitive information on the structure of ordered phases. These alternative approaches provide compelling information under suitable conditions, but their results are still vulnerable to phase noise contamination. The correct implementation of SBD-STM to such cases remains an open but solvable problem.

## Data availability

The datasets generated during and/or analysed during the current study are available from the corresponding author on reasonable request.

## References

1. Betzig, E. et al. Imaging intracellular fluorescent proteins at nanometer resolution. Science 313, 1642–1645 (2006).

2. Muller, D. A. Structure and bonding at the atomic scale by scanning transmission electron microscopy. Nat. Mater. 8, 263–270 (2009).

3. Binnig, G. & Rohrer, H. Surface imaging by scanning tunneling microscopy. Ultramicroscopy 1, 157–160 (1983).

4. Crommie, M. F., Lutz, C. P. & Eigler, D. M. Imaging standing waves in a two-dimensional electron gas. Nature 363, 524–527 (1993).

5. Rutter, G. M. et al. Scattering and interference in epitaxial graphene. Science 317, 219–222 (2007).

6. Hoffman, J. E. et al. Imaging quasiparticle interference in Bi2Sr2CaCu2O8+x. Science 297, 1148–1151 (2002).

7. Roushan, P. et al. Topological surface states protected from backscattering by chiral spin texture. Nature 460, 1106–1109 (2009).

8. Hanneken, C. et al. Electrical detection of magnetic skyrmions by tunnelling non-collinear magnetoresistance. Nat. Nanotechnol. 10, 1039–1042 (2015).

9. Arguello, C. J. et al. Quasiparticle interference, quasiparticle interactions, and the origin of the charge density wave in 2h-nbse2. Phys. Rev. Lett. 114, 037001 (2015).

10. Hanaguri, T., Niitaka, S., Kuroki, K. & Takagi, H. Unconventional s-wave superconductivity in Fe(Se,Te). Science 328, 474–476 (2010).

11. Enayat, M. et al. Real-space imaging of the atomic-scale magnetic structure of Fe1+yTe. Science 345, 653–656 (2014).

12. Kimoto, K., Kurashima, K., Nagai, T., Ohwada, M. & Ishizuka, K. Assessment of lower-voltage {TEM} performance using 3d fourier transform of through-focus series. Ultramicroscopy 121, 31–37 (2012).

13. Jaffe, J. S. & Glaeser, R. M. Difference fourier analysis of “surface features” of bacteriorhodopsin using glucose-embedded and frozen-hydrated purple membrane. Ultramicroscopy 23, 17–28 (1987).

14. Wang, Q.-H. & Lee, D.-H. Quasiparticle scattering interference in high-temperature superconductors. Phys. Rev. B 67, 020511 (2003).

15. Fiete, G. A. & Heller, E. J. Colloquium : theory of quantum corrals and quantum mirages. Rev. Mod. Phys. 75, 933–948 (2003).

16. Kivelson, S. A. et al. How to detect fluctuating stripes in the high-temperature superconductors. Rev. Mod. Phys. 75, 1201–1241 (2003).

17. McElroy, K. et al. Relating atomic-scale electronic phenomena to wave-like quasiparticle states in superconducting Bi2Sr2CaCu2O8+δ. Nature 422, 592–596 (2003).

18. Rosenthal, E. P. et al. Visualization of electron nematicity and unidirectional antiferroic fluctuations at high temperatures in NaFeAs. Nat. Phys. 10, 225–232 (2014).

19. Pillow, J. W., Shlens, J., Chichilnisky, E. J. & Simoncelli, E. P. A model-based spike sorting algorithm for removing correlation artifacts in multi-neuron recordings. PLoS ONE 8, e62123 (2013).

20. Faghih, R. T., Dahleh, M. A., Adler, G. K., Klerman, E. B. & Brown, E. N. Deconvolution of serum cortisol levels by using compressed sensing. PLoS ONE 9, e85204 (2014).

21. Vonau, F. et al. Evidence of hole–electron quasiparticle interference in ErSi2 semimetal by fourier-transform scanning tunneling spectroscopy. Phys. Rev. Lett. 95, 176803 (2005).

22. Simon, L., Vonau, F. & Aubel, D. A phenomenological approach of joint density of states for the determination of band structure in the case of a semi-metal studied by ft-sts. J. Phys. 19, 355009 (2007).

23. Wang, F. & Lee, D.-H. The electron-pairing mechanism of iron-based superconductors. Science 332, 200–204 (2011).

24. Wang, Z. et al. Quasiparticle interference and strong electron mode coupling in the quasi-one-dimensional bands of Sr2RuO4. Nat. Phys. 13, 799 (2017).

25. Hanaguri, T. et al. Coherence factors in a high-tc cuprate probed by quasi-particle scattering off vortices. Science 323, 923–926 (2009).

26. Chi, S. et al. Sign inversion in the superconducting order parameter of LiFeAs inferred from bogoliubov quasiparticle interference. Phys. Rev. B 89, 104522 (2014).

27. Zhang, T. et al. Experimental demonstration of topological surface states protected by time-reversal symmetry. Phys. Rev. Lett. 103, 266803 (2009).

28. Okada, Y. et al. Direct observation of broken time-reversal symmetry on the surface of a magnetically doped topological insulator. Phys. Rev. Lett. 106, 206805 (2011).

29. Hirschfeld, P. J., Altenfeld, D., Eremin, I. & Mazin, I. I. Robust determination of the superconducting gap sign structure via quasiparticle interference. Phys. Rev. B 92, 184513 (2015).

30. Candes, E. J. & Tao, T. Decoding by linear programming. IEEE Trans. Inf. Theory 51, 4203–4215 (2005).

31. Elad, M. Sparse and redundant representation modeling—what next? IEEE Signal Process. Lett. 19, 922–928 (2012).

32. Levin, A., Weiss, Y., Durand, F. & Freeman, W. T. Understanding and evaluating blind deconvolution algorithms. In 2009 IEEE Conference on Computer Vision and Pattern Recognition 1964–1971 (2009).

33. Beck, A. & Teboulle, M. A fast iterative shrinkage-thresholding algorithm for linear inverse problems. SIAM J. Imaging Sci. 2, 183–202 (2009).

34. Absil, P.-A., Baker, C. G. & Gallivan, K. A. Trust-region methods on riemannian manifolds. Found. Comput. Math. 7, 303–330 (2007).

35. Candès, E. J., Wakin, M. B. & Boyd, S. P. Enhancing sparsity by reweighted 1 minimization. J. Fourier Anal. Appl. 14, 877–905 (2008).

36. Parker, D. R. et al. Control of the competition between a magnetic phase and a superconducting phase in cobalt-doped and nickel-doped nafeas using electron count. Phys. Rev. Lett. 104, 057007 (2010).

37. Cai, P. et al. Doping dependence of the anisotropic quasiparticle interference in NaFe1−xCoxAs iron-based superconductors. Phys. Rev. Lett. 112, 127001 (2014).

38. Martiny, J. H. J., Kreisel, A., Hirschfeld, P. J. & Andersen, B. M. Robustness of a quasiparticle interference test for sign-changing gaps in multiband superconductors. Phys. Rev. B 95, 184507 (2017).

39. Altenfeld, D., Hirschfeld, P. J., Mazin, I. I. & Eremin, I. Detecting sign-changing superconducting gap in lifeas using quasiparticle interference. Phys. Rev. B 97, 054519 (2018).

40. Sprau, P. O. et al. Discovery of orbital-selective cooper pairing in fese. Science 357, 75–80 (2017).

41. Du, Z. et al. Sign reversal of the order parameter in (Li1 xFex)OHFe1−yZnySe. Nat. Phys. 14, 134–139 (2017).

42. Liu, Z.-H. et al. Unconventional superconducting gap in nafe0.95co0.05as observed by angle-resolved photoemission spectroscopy. Phys. Rev. B 84, 064519 (2011).

43. Thirupathaiah, S. et al. Weak-coupling superconductivity in electron-doped nafe0.95co0.05as revealed by arpes. Phys. Rev. B 86, 214508 (2012).

44. Eschrig, H. & Koepernik, K. Tight-binding models for the iron-based superconductors. Phys. Rev. B 80, 104503 (2009).

45. Fergus, R., Singh, B., Hertzmann, A., Roweis, S. T. & Freeman, W. T. Removing camera shake from a single photograph. ACM Trans. Graph. 25, 787–794 (2006).

46. Levin, A., Weiss, Y., Durand, F. & Freeman, W. Understanding blind deconvolution algorithms. IEEE Trans. Pattern Anal. Mach. Intell. 33, 2354–2367 (2011).

47. Fujita, K. et al. Direct phase-sensitive identification of a d-form factor density wave in underdoped cuprates. Proc. Natl Acad. Sci. USA 111, E3026–E3032 (2014).

48. Allan, M. P. et al. Anisotropic impurity states, quasiparticle scattering and nematic transport in underdoped Ca(Fe1−xCox)2 As2. Nat. Phys. 9, 220–224 (2013).

49. Cai, P. et al. Visualizing the microscopic coexistence of spin density wave and superconductivity in underdoped NaFe1−xCoxAs. Nat. Commun. 4, 1596 (2013).

50. Hamidian, M. H. et al. Atomic-scale electronic structure of the cuprate d-symmetry form factor density wave state. Nat. Phys. 12, 150–156 (2016).

51. Knolle, J., Eremin, I., Akbari, A. & Moessner, R. Quasiparticle interference in the spin-density wave phase of iron-based superconductors. Phys. Rev. Lett. 104, 257001 (2010).

52. Wang, Y., Agterberg, D. F. & Chubukov, A. Coexistence of charge-density-wave and pair-density-wave orders in underdoped cuprates. Phys. Rev. Lett. 114, 197001 (2015).

53. Schattner, Y., Gerlach, M. H., Trebst, S. & Berg, E. Competing orders in a nearly antiferromagnetic metal. Phys. Rev. Lett. 117, 097002 (2016).

54. DallaTorre, E. G., He, Y. & Demler, E. Holographic maps of quasiparticle interference. Nat. Phys. 12, 1052–1056 (2016).

## Acknowledgements

We thank Ethan Rosenthal and Erick Andrade for help with STM data acquisition, and Andrew Millis and Rafael Fernandes for discussions. M.A.M. and I.M.E. are thankful to Sergey Borisenko for providing the tight-binding parametrization of the ARPES data in NaFe1−xCoxAs. This work is supported by the National Science Foundation Bigdata program (Grant number IIS-1546411). Support for STM equipment and operations is provided by the Air Force Office of Scientific Research (Grant number FA9550-16-1-0601). The work of I.M.E. was carried out with financial support from the Ministry of Science and Higher Education of the Russian Federation in the framework of Increase Competitiveness Program of NUST MISiS Grant No. K2-2017-085.

## Author information

Authors

### Contributions

S.C.C. and J.Y.S. applied the SBD algorithm to STM data. Y.L., Z.C., J.S., and Y.Z. developed the SBD algorithm. M.A.M. and I.M.E. performed theoretical calculations of QPI in NaFeAs. J.N.W. and A.N.P. advised.

### Corresponding authors

Correspondence to John N. Wright or Abhay N. Pasupathy.

## Ethics declarations

### Competing interests

The authors declare no competing interests.

Peer review information Nature Communications thanks the anonymous reviewer(s) for their contribution to the peer review of this work.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

## Rights and permissions

Reprints and Permissions

Cheung, S.C., Shin, J.Y., Lau, Y. et al. Dictionary learning in Fourier-transform scanning tunneling spectroscopy. Nat Commun 11, 1081 (2020). https://doi.org/10.1038/s41467-020-14633-1

• Accepted:

• Published:

• DOI: https://doi.org/10.1038/s41467-020-14633-1

• ### Multi-atom quasiparticle scattering interference for superconductor energy-gap symmetry determination

• Rahul Sharma
• Andreas Kreisel
• Peter O. Sprau

npj Quantum Materials (2021)