Self-supervised machine learning pushes the sensitivity limit in label-free detection of single proteins below 10 kDa

Dahmardeh, Mahyar; Mirzaalian Dastjerdi, Houman; Mazal, Hisham; Köstler, Harald; Sandoghdar, Vahid

doi:10.1038/s41592-023-01778-2

Download PDF

Article
Open access
Published: 27 February 2023

Self-supervised machine learning pushes the sensitivity limit in label-free detection of single proteins below 10 kDa

Nature Methods volume 20, pages 442–447 (2023)Cite this article

10k Accesses
21 Citations
62 Altmetric
Metrics details

Subjects

Abstract

Interferometric scattering (iSCAT) microscopy is a label-free optical method capable of detecting single proteins, localizing their binding positions with nanometer precision, and measuring their mass. In the ideal case, iSCAT is limited by shot noise such that collection of more photons should extend its detection sensitivity to biomolecules of arbitrarily low mass. However, a number of technical noise sources combined with speckle-like background fluctuations have restricted the detection limit in iSCAT. Here, we show that an unsupervised machine learning isolation forest algorithm for anomaly detection pushes the mass sensitivity limit by a factor of 4 to below 10 kDa. We implement this scheme both with a user-defined feature matrix and a self-supervised FastDVDNet and validate our results with correlative fluorescence images recorded in total internal reflection mode. Our work opens the door to optical investigations of small traces of biomolecules and disease markers such as α-synuclein, chemokines and cytokines.

MiLoPYP: self-supervised molecular pattern mining and particle localization in situ

Article Open access 09 September 2024

ECLiPSE: a versatile classification technique for structural and morphological analysis of 2D and 3D single-molecule localization microscopy data

Article Open access 10 September 2024

Analysis of the Human Protein Atlas Weakly Supervised Single-Cell Classification competition

Article Open access 29 September 2022

Main

Analysis of nanometer-scale matter is of the utmost importance for a variety of biomedical investigations^1,2,3,4,5. During the last 100 years many clever techniques have been invented for characterization of macromolecules, for example, to resolve structure, map dynamics, assess chemical composition, and measure physical quantities such as size and weight. Methods based on nuclear magnetic resonance spectroscopy, electrophoresis, mass spectrometry, electron microscopy, fluorescence imaging and plasmon resonance spectroscopy have introduced decisive information, but each approach also has its limitations. Thus, new innovations are continuously sought to push the measurement boundaries. Optical methods are desirable in this quest because they can be non-invasive and compatible with real-time studies. Indeed, the optical cross-section of matter is intrinsically large enough to enable the detection of single molecules and proteins in direct extinction measurements^6,7,8, in which the incident field (or a fraction of it) interferes with the tiny amount of light that is coherently scattered by the nano-object of interest^9,10,11.

The interferometric signal that is generated by the scattered light (iSCAT) not only enables the detection and sensing of sub-wavelength nanoparticles such as single proteins, but it also provides information on the particle size⁸. Indeed, iSCAT measurements have recently been calibrated to determine protein mass^11,12, leading to a technology that is now also offered commercially (Refeyn Ltd., Oxford, UK). Given that the sensitivity of iSCAT is ultimately limited by shot noise^10,13,14, one could expect to detect an arbitrarily small amount of matter if only one could collect a sufficiently large number of photons. In practice, however, the dynamics of residual background fluctuations prevents one from reaching this ideal situation¹⁴, and hence proteins lighter than approximately 40 kDa have not been detected¹¹. In this work, we report on a substantial improvement in iSCAT detection sensitivity to the range of 9 kDa using machine learning approaches for anomaly detection¹⁵. We benchmark and validate our results by performing fluorescence detection in total internal reflection (TIRF) mode. More sophisticated analysis of the signal and background might enable the sensitivity limit to be extended even further in the near future.

The iSCAT signal

Figure 1a shows the iSCAT sensing set-up. A laser beam centered at a wavelength of 445 nm illuminates a sample that consists of an aqueous buffer on a microscope coverglass. A fraction of the incident light is reflected at this interface and is used as the reference in its interference with the scattered light from the nano-object under study^10,13. The detected optical power on the camera is

$${P}_{{{{\rm{d}}}}}\propto | {E}_{{{{\rm{r}}}}}{| }^{2}+| {E}_{{{{\rm{s}}}}}{| }^{2}+2| {E}_{{{{\rm{r}}}}}| | {E}_{{{{\rm{s}}}}}| \cos \phi ,$$

(1)

where E_r = rE_i, E_s = sE_i and E_i denote the electric fields of the reference, scattered and incident light fields, respectively, and ϕ signifies the phase difference between the latter two quantities. To add fluorescence imaging capabilities, a laser beam centered at a wavelength of 631 nm is used to illuminate the sample in TIRF mode through the same microscope objective. The fluorescence signal is filtered through a dichroic mirror and is imaged on a second camera. Figure 1b shows an example of the iSCAT image for 100 nm polymer beads bound to the coverglass. In Fig. 1c we show the TIRF image of the same beads, which contained fluorescent dyes.

The scattered field of a nanoparticle is proportional to the incident field via its polarizability (α) so that s ∝ α. For small nano-objects, the intensity of the scattered light (∣E_s∣²) becomes negligible compared with the other terms in equation (1). Hence, the iSCAT contrast (C) of a small particle can be formulated as

$$C=\frac{{P}_{{{{\rm{d}}}}}-{P}_{{{{\rm{r}}}}}}{{P}_{{{{\rm{r}}}}}} \sim 2\frac{| {E}_{{{{\rm{s}}}}}| }{| {E}_{{{{\rm{r}}}}}| }\cos \phi =2\frac{s}{r}\cos \phi \,,$$

(2)

where P_d and P_r refer to the detected and reference powers, respectively. Considering that α is proportional to the particle volume and assuming a constant density for protein matter, one can conclude that C is linearly proportional to the particle mass⁸. Thus, the iSCAT signal provides a measure for mass photometry^11,12.

The iSCAT image in Fig. 1b was recorded in one frame in a short exposure time of 20 μs. Visualizing single proteins, however, requires longer integration times and an elaborate analysis to account for the speckle-like background features that are caused by coherent scattering from slight imperfections of the sample surface¹⁴. In brief, this analysis exploits the temporal change of the signal as a protein lands on the sensor substrate to eliminate the static background of the sample by comparing each video frame with its neighbors. In practice, a series of careful steps establish an algorithm that performs a differential rolling average (DRA) of several hundred camera frames, followed by the application of various tools to identify the point spread function (PSF) of individual proteins and determine their iSCAT contrasts^8,12. It was found that the integration time for each protein event cannot be extended beyond a few seconds due to residual background dynamics. As a result, the detection sensitivity reaches a plateau at a molecular weight of approximately 40–50 kDa. The analysis procedure and its limitations are given in a recent publication¹⁴ as well as in Supplementary Information, Section 3.

Machine learning

Computer vision and machine learning methods have recently been used in microscopy applications with an emphasis on correcting the background or enhancing the signal. For background correction, conventional computer vision methods have been used, exploiting temporal and spatial information in two independent steps^16,17. In addition, scientists have applied supervised¹⁸ and unsupervised¹⁹ deep neural networks (DNNs) in machine learning. For example, supervised DNNs were used to extract spatiotemporal features in localization microscopy and particle tracking^20,21. Supervised algorithms are, however, limited in scope because they require knowledge of the ground truth, which in turn implies full knowledge of the signal and noise properties. An example for getting around this restriction has recently been reported, in which an unsupervised DNN based on FastDVDnet²² was used to denoise an image series²³.

In this work, we exploit self-supervised FastDVDnet in a different tailor-made scheme whereby we first denoise our images and then subtract the de-noised frame from the frame of interest to identify the PSFs of the rare landing proteins. Here, a frame t in the DRA video is analyzed by comparing it with its neighboring frames t − k and t + k with suitable stride k (Supplementary Table 1). Next, we classify the outcome using isolation forest (iForest)²⁴, which is an unsupervised algorithm in anomaly detection. Anomaly detection encompasses a general class of algorithms in which one first establishes a ‘normal’ signal and then identifies deviations or ‘anomalies’. The normal signal in our experiment is the residual background speckle image obtained by averaging over multiple frames immediately before and after the frames that contain a protein landing event. The output of iForest thus becomes a vector of true (anomalous) and false (normal) values for each pixel. iForest has been successfully applied to computer vision, signal processing and communication applications^15,25,26,27. We present a brief overview of various concepts relevant to our work in the Supplementary Information.

To gain more direct insight into the underlying physical criteria in anomaly detection, we also explore a user-defined approach in which one chooses a set of temporal (mean, standard deviation and so on) and spatial features (for example, PSF-like figures) that are evaluated for a certain pixel range in each image frame (Supplementary Figs. 3–8). A given frame t is then re-shaped for each feature into a one-dimensional vector with elements representing the pixel values of that frame. Next, a feature matrix is composed of the one-dimensional vectors that are produced from the aforementioned frames (Supplementary Figs. 8a, and 9a), and the resulting feature matrix is fed to iForest for classification. We note that the initial choice of the user-defined features is based on physical considerations such as the PSF size, typical DRA window and camera frame rate, which mostly depend on the optical system and not on the protein under study. In other words, the user can explore various options, which can be validated on independent data, for example, on larger proteins. Furthermore, the usefulness of the chosen features can be assessed in simulations of synthetic data. Nevertheless, the success and efficiency of the user-defined feature matrix depend on the aptitude and judgment of the user. Hence, we rely on the DNN for our final conclusions given that it does not require critical input from the user.

Results

Before we apply our analysis to the detection of very small proteins, we investigate its performance on a bovine serum albumin (BSA) sample, which, with a molecular mass of approximately 66 kDa, is one of the smallest proteins that can be detected with existing techniques. In Fig. 2a we show an example of the raw image recorded on the iSCAT camera, and Fig. 2b shows a single protein from that measurement after a typical DRA analysis on 1,500 neighboring frames. Although a small protein is successfully detected, the image also shows background fluctuations that are not fully eliminated by the existing algorithm, possibly due to various electronic, mechanical or fluidic sources of noise (Supplementary Information, Section 1 and ref. ¹⁴).

**Fig. 2: Benchmarking methods for BSA (66 kDa).**

For our current discussion, it suffices to consider the residual signal fluctuations as ‘noise’ in the recognition of the particle contrast, which acts as the ‘signal’. Thus, the problem can be reduced to the challenge of deciphering image attributes at a given signal-to-noise ratio (SNR). In our set-up, proteins with a molecular mass of 40 kDa, which is the lowest that has been reported in the literature¹¹, have an SNR of ~3, whereby the noise level is defined as the root mean square (RMS) of the residual background fluctuations. Here, it is important to note that the resulting speckle noise is not white because the spatial variations of the background are governed by the same instrument response function that determines the system PSF. This structured background makes it particularly difficult to identify the signal¹⁸. In this work, we show that the application of machine learning algorithms enables us to detect proteins as small as 9 kDa, corresponding to an SNR of ~1.4 in our set-up.

To improve the robustness of the results, we labeled the proteins under study with ATTO 647 dye molecules with a negligible molecular weight of approximately 0.7 kDa and negligible extinction coefficient at the iSCAT illumination wavelength, so that we can monitor them via the accompanying TIRF detection (Fig. 2c). To check the purity of the protein samples after labeling, we ran a gel electrophoresis (sodium dodecylsulfate–polyacrylamide gel electrophoresis) (Supplementary Information, Section 2). We note that variations in the number of fluorophores per protein do not disturb the study because we aim to identify only the protein. We found that co-illumination of the red and blue laser beams led to fast photobleaching, preventing us from performing simultaneous iSCAT and TIRF measurements (Supplementary Information, Section 10). We, thus, interlaced the two recording periods with typical repetition cycles of 30 s.

We now examine the same measurements using anomaly detection. Figure 2d presents the location of the resulting anomalous pixels for all frames that were used to detect the protein under discussion by using a user-defined feature matrix (Supplementary Fig. 8). To suppress false-positive events, we apply a morphological operation to eliminate unconnected anomalous pixels in each frame. In the case of the data in Fig. 2d, the morphological operation considered anomalous pixels that were accompanied by at least one more neighboring pixel (Supplementary Table 1). Next, the image in Fig. 2d is convolved with a Gaussian function that fits our experimental PSF, corresponding to half-width at half maximum of 2.5 pixels. We then implement a binary mask with a radius of 5 pixels about the center of mass of the resulting distribution to restrict a detection region for one landing event (Fig. 2e). In other words, two detection events are counted as such only if their binary masks do not overlap. A comparison with the conventional DRA and TIRF measurements (Fig. 2b,c) shows very good agreement with the outcome of anomaly detection based on user-defined criteria.

Figure 2f shows the result of anomaly detection based on the DNN approach for the protein landing event of Fig. 2b. It can be seen that as opposed to the user-defined scenario in Fig. 2d, the DNN approach can effectively isolate the entirety of a PSF in each frame, significantly increasing the detection yield. We note that to eliminate artifacts near the borders and corners of a frame, we considered only the data inside a circular mask of radius R = 33 pixels.

Having established the principle of our new methodology, we now showcase its performance by measuring proteins not previously detectable. Figure 3a–c shows examples of three TIRF images, which confirm the presence of proteins with a molecular mass of 21, 18 and 9 kDa, respectively. In Fig. 3d–f we present the corresponding DRA-treated images. To guide the eye, we placed circles at the locations of protein landing events as determined from the centers of the PSFs in their corresponding TIRF images. Distinguishing the protein PSF from the speckle-like background appears not to be within reach in any of the cases. Remarkably, however, the data in Fig. 3g–i show that anomaly detection based on user-defined features can identify the protein landing events. The success of this procedure can be traced to the fact that by combining temporal and spatial features in the feature matrix, the algorithm imposes simultaneous temporal and spatial restrictions that distinguish true landing events from other uncorrelated temporal and spatial fluctuations (Supplementary Information, Section 5). Figure 3j–l shows the probability maps of the events obtained from an unsupervised DNN analysis, and Fig. 3m–o plots the corresponding outcome of iForest classification. Both the user-defined and the DNN approaches succeed in detecting the protein events in the data presented in Fig. 3. The advantage of the latter method is, however, that it does not rely on optimal choices in the feature matrix. We compare the performances of the two methods in more detail in the Supplementary Information.

**Fig. 3: Detection of very small proteins with molecular mass of 21 kDa (left column), 18 kDa (middle column) and 9 kDa (right column).**

To elucidate the advantage of the DNN further, we synthetically lowered the SNR of the landing event discussed in Fig. 2 by reducing the DRA window size. Figure 4a,b shows the outcome of two DRA averaging window sizes of 750 and 250 frames, respectively. As shown in Fig. 4d, the user-defined approach is not able to detect the protein with the same feature matrix criteria as before. Figure 4e,f, however, shows that the DNN approach remains successful.

**Fig. 4: User-defined versus DNN performance at different SNR.**

We have presented several cases in which iSCAT detection of protein landing events was confirmed by TIRF images. The modulation of the iSCAT contrast in the speckle-like background, however, may cause false-positive events or mask a true event. Similarly, landing events might be absent in the TIRF channel, for example, due to photobleaching or imperfect labeling. Consequently, the yield in obtaining a one-to-one correspondence between the TIRF and the iSCAT data is low in our interlaced measurements (Supplementary Information, Section 10). One such example is shown in Fig. 3g,j,m, in which anomaly detection detects two proteins while TIRF finds only one of them. Figure 5a shows another example of several events captured in the iSCAT and TIRF channels recorded within 20 s. In Fig. 5b we show the coincidence map of the two signals obtained by constructing the pixel-wise product of the localized events. We note, however, that the average rate of landing events was comparable in the iSCAT and TIRF channels with 0.2–2 proteins per second and 1–5 proteins per second, respectively, showing that we do not over-count in the iSCAT channel. Furthermore, by performing simulations, we estimated the false-positive signals in our algorithms to be less than approximately 10% (Supplementary Fig. 11). In practice, one can choose to apply more stringent morphological operations on the DNN output to reduce the false-positive events at the cost of the detection yield (Supplementary Table 3). For instance, we included only events with at least three connected pixels in each frame for the 9 kDa data to minimize the chances of counting unwanted events (Supplementary Table 1).

**Fig. 5: Comparison of the coincidence yields between iSCAT and TIRF.**

Protein mass photometry

The task of the anomaly detection algorithms (user-defined or DNN) discussed above is to identify protein landing events. Once the PSFs of individual particles have been localized, their iSCAT contrasts can be extracted as in previous reports to arrive at their mass information¹⁴ (Supplementary Fig. 3). In brief, the hot region identified by anomaly detection is searched using difference of Gaussian to localize the PSF of the protein (Supplementary Fig. 8g). We then extract the temporal value of the localized PSF center intensity directly from DRA to form a V-shaped landing trajectory (Supplementary Fig. 8c). The sides of the V-shaped trace are fitted with two lines and the intersect is used to assign the base line, which is then used to determine a contrast¹⁴.

The blue histograms in Fig. 5c–e show the distribution of the iSCAT contrasts obtained from 21, 18 and 9 kDa protein samples, respectively, following the full DNN-based anomaly detection algorithm. In addition, the orange histograms in Fig. 5c–e show the spread of the contrasts obtained for the iSCAT events that coincided with an event detected in the TIRF channel. We find that although the yield is lower for coincidences, the main modes of the histograms are very well aligned. We note that the distribution towards higher contrasts can be attributed to small populations of oligomeric states of the protein, protein aggregates, or sample impurities¹². The Gaussian mixture model²⁸ was used to identify the underlying subpopulations^14,29.

The contrast of the main histogram mode was estimated using maximum likelihood estimation, analogous to the procedure in localization microscopy³⁰. We then used bootstrapping to estimate the confidence interval in this assignment (sampling cycles >1,000). The deduced contrast can be related to mass if one assumes a common density and refractive index for proteins^8,12,14. Because the parameters r and s (Eq. (1)) can vary between individual iSCAT set-ups, one needs to establish a calibration ladder, much in the spirit of the read-out procedure in gel electrophoresis. Figure 6a presents such a library, which contains the data from protein samples with nominal molecular mass of 220, 66, 21, 18 and 9 kDa. The error bars represent the precision in each assignment, and the line shows the result of a linear fit to the data.

In Fig. 6b we plot the accuracy (in units of kDa), determined as the difference between the measured mean value and the quantity suggested by the fit. Figure 6c presents the percentage precision. It is evident that both accuracy and precision become less robust for the smallest protein size. We also note a slight offset at the intercept of the linear fit on the vertical axis. We attribute this to the fact that the background fluctuations cannot be fully eliminated, thus, affecting the base line and the contrast value¹⁴. Nevertheless, the linear model in Fig. 6a has an RMS deviation of 1.0 × 10⁻⁵, which is one of the lowest values reported for such protein libraries^12,29.

Discussion and outlook

In 2014 iSCAT was successful in the label-free detection of single 500 kDa (myosin 5a)³¹ and 66 kDa (BSA)⁸ proteins. Since then, the sensitivity limit has been somewhat improved to 55 kDa¹² and approximately 40 kDa¹¹, whereby the application of a spatial mask in the Fourier plane was considered to be instrumental for favoring the scattered signal^12,32,33. In our current work, we use an anomaly detection machine learning algorithm to substantially push the sensitivity limit to proteins as small as 9 kDa. Moreover, we achieve this without using a spatial mask.

Label-free and real-time analysis of small proteins is very promising for ultrasensitive diagnostics of disease markers such as interleukins or other cytokines in bodily fluids³⁴. In addition, a range of fundamental studies such as assembly of biological nanostructures³⁵, cell secretion^36,37 and protein aggregation³⁸ would greatly benefit from this methodology. iSCAT detection of biomolecules can be further advanced through improvements in physical measurements, for example, by using CMOS (complementary metal oxide semiconductor) cameras with larger well capacity and lower dark noise, or using a higher quality substrate surface to lower the iSCAT background. The methodology presented in this work also holds promise for efforts in cryogenic electron microscopy and fluorescence microscopy with low SNR. As machine learning approaches become more established in microscopy^{18,19,20,21,23,39}, one can expect further advances in the computational analysis of label-free sensing. A first measure, for example, could involve replacing iForest with an end-to-end DNN⁴⁰.

Methods

Protein sample preparation and labeling

All proteins used in this study are commercially available in a highly pure quality. Human plasma fibronectin (220 kDa) was purchased from Sigma Aldrich (cat. no. FC010). UltraPure BSA was purchased from Life Technologies (cat. no. AM2616). The structure of BSA corresponds to 66 kDa. The product used in this study was specified by the manufacturer at 67–68 kDa. Recombinant protein G (21 kDa) was purchased from Fisher Scientific (cat. no. 21193). Recombinant Escherichia coli Skp protein (18 kDa) and recombinant human interleukin (IL)-8 protein (9 kDa) were purchased from Abcam (cat. nos. ab97397 and ab9631, respectively). Proteins were diluted or buffer exchanged (desalted) into labeling buffer containing 50 mM HEPES and 25 mM KCl (pH 7.8), prior to the labeling reaction, using a 7K MWCO (molecular weight cut-off) Zeba desalting column (ThermoFisher, cat. no. 89882). Proteins were unspecifically labeled via their exposed primary amines using the ATTO 647 fluorophore containing the reactive group NHS (N-hydroxysuccinimidyl) ester (cat. no. 18373-1MG-F, Sigma Aldrich). Proteins were mixed with dyes at a ratio of 1:1 for 2 h at room temperature, and then desalted from the excess of dye using the same desalting columns. Proteins were further filtered using a 100 nm syringe filter (Whatmann Anotop 10, cat. no. WHA68091002, Sigma Aldrich). The labeling efficiency was then estimated using an absorption spectrometer (Nanodrop 2000, ThermoFisher). The labeling efficiency ranged between 40% and 80% for different protein samples. SDS–PAGE was used to assess protein purity, labeling and the approximate molecular weight (Supplementary Information). Based on the manufacturer information, most of these proteins are found in their monomeric states. In the case of Skp protein it can form a trimer assembly, however at the concentration of our measurements (~10 nM) it is mainly in the monomeric state⁴¹. To establish the protein ladder we read the contrast for the main (lowest) mode of the iSCAT histogram. We note that if proteins do form large assemblies, their larger iSCAT contrasts become noticeable in our experiments.

Coverglass functionalization

To prepare the surface of the coverglass for protein binding, it was sonicated in isopropyl alcohol and ethanol for 5 min each, followed by 10 min of oxygen plasma. The sample was then mounted and left to stabilize for a few hours.

Protein injection and data acquisition

Each labeled protein sample was diluted down to approximately 10 nM in concentration, and 10 μl was manually injected by micropipetting on top of the iSCAT field of view. This is then immediately followed by starting the iSCAT camera data acquisition, which triggers the blue iSCAT laser. After approximately 20 s of data acquisition, the blue laser is switched off and the red laser (TIRF channel) is switched on for 10 s. This is then followed by several cycles of interlaced iSCAT and TIRF data acquisition, to reach a satisfactory data volume for meaningful statistics. Depending on the protein size, the iSCAT camera was set to run at 5–15 kHz at an exposure time of 20 μs.

Optical set-up

A continuous-wave laser centered at 445 nm (iBeam smart, Toptica) is collimated and focused onto the back focal plane of an oil-immersion microscope objective (α Plan-Apochromat ×100, NA 1.46, Zeiss). A coverglass is positioned at the focus of the microscope objective using a piezo positioner (Nano-LPQ, Mad City Labs). The iSCAT field is imaged using a scientific CMOS camera (MV1-D1024E-160-CL, Photonfocus).

TIRF illumination was done with a laser beam at 631 nm, which was directed into the iSCAT pathway via a dichroic mirror (D1, Chroma ZT647rdc-UF3) mounted on a translation stage and a second dichroic mirror (D2, Chroma T480spxxr-UF3). The fluorescence signal was collected via the same microscope objective that was used for the iSCAT measurements. D2 separated the fluorescence from the iSCAT path and transmitted it through D1 onto a CCD (charge-coupled device) camera (Hamamatsu Orca Flash). Here, we also used a band pass filter (ET700/75) in front of the camera (S1).

Statistics and reproducibility

Single-protein sensitivity is achieved only when thousands of frames are averaged in the analysis procedure described here. Each detection event in Figs. 2–4 is by definition a single-molecule event and as such is not reproducible. However, in a given video containing millions of frames, hundreds of single-protein events are registered, which are nominally equivalent. The histograms in Fig. 5c–e are formed by considering all such individual recordings. The data points in Fig. 6a are read from such histograms.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

The data are available upon reasonable request. Source data are provided with this paper.

Code availability

The scripts reported in this paper have been deposited at PiSCAT⁴²: https://github.com/SandoghdarLab.

References

Hong, M., Zhang, Y. & Hu, F. Membrane protein structure and dynamics from NMR spectroscopy. Annu. Rev. Phys. Chem. 63, 1–24 (2012).
Article CAS PubMed Google Scholar
Zhu, Z., Lu, J. J. & Liu, S. Protein separation by capillary gel electrophoresis: a review. Anal. Chim. Acta 709, 21–31 (2012).
Article CAS PubMed Google Scholar
Bai, X.-C., McMullan, G. & Scheres, S. H. How cryo-EM is revolutionizing structural biology. Trends Biochem. Sci. 40, 49–57 (2015).
Article CAS PubMed Google Scholar
Sauer, M. & Heilemann, M. Single-molecule localization microscopy in eukaryotes. Chem. Rev. 117, 7478–7509 (2017).
Article CAS PubMed Google Scholar
Kaushik, A. Advances in nanosensors for biological and environmental analysis: book review. Biosensors 9, 101 (2019).
Article PubMed Central Google Scholar
Sandoghdar, V. Nano-optics in 2020 ± 20. Nano Lett. 20, 4721–4723 (2020).
Article CAS PubMed Google Scholar
Kukura, P., Celebrano, M., Renn, A. & Sandoghdar, V. Single-molecule sensitivity in optical absorption at room temperature. J. Phys. Chem. Lett. 1, 3323–3327 (2010).
Article CAS Google Scholar
Piliarik, M. & Sandoghdar, V. Direct optical sensing of single unlabelled proteins and super-resolution imaging of their binding sites. Nat. Commun. 5, 4495 (2014).
Article CAS PubMed Google Scholar
Lindfors, K., Kalkbrenner, T., Stoller, P. & Sandoghdar, V. Detection and spectroscopy of gold nanoparticles using supercontinuum white light confocal microscopy. Phys. Rev. Lett. 93, 037401 (2004).
Article CAS PubMed Google Scholar
Taylor, R. W. & Sandoghdar, V. Interferometric scattering microscopy: seeing single nanoparticles and molecules via Rayleigh scattering. Nano Lett. 19, 4827–4835 (2019).
Article CAS PubMed PubMed Central Google Scholar
Priest, L., Peters, J. S. & Kukura, P. Scattering-based light microscopy: from metal nanoparticles to single proteins. Chem. Rev. 121, 11937–11970 (2021).
Article CAS PubMed PubMed Central Google Scholar
Young, G. et al. Quantitative mass imaging of single biological macromolecules. Science 360, 423–427 (2018).
Article CAS PubMed PubMed Central Google Scholar
Taylor, R. W. & Sandoghdar, V. in Label-Free Super-Resolution Microscopy (ed. Astratov, V.) Ch. 2 (Springer International Publishing, 2019).
Dastjerdi, H. M. et al. Optimized analysis for sensitive detection and analysis of single proteins via interferometric scattering microscopy. J. Phys. D. Appl. Phys. 55, 054002 (2021).
Article Google Scholar
Pang, G., Shen, C., Cao, L. & Hengel, A. V. D. Deep learning for anomaly detection: a review. ACM Comput. Surv. 54, 38 (2021).
Google Scholar
Cheng, C.-Y. & Hsieh, C.-L. Background estimation and correction for high-precision localization microscopy. ACS Photonics 4, 1730–1739 (2017).
Article CAS Google Scholar
Spindler, S., Sibold, J., Gholami Mahmoodabadi, R., Steinem, C. & Sandoghdar, V. High-speed microscopy of diffusion in pore-spanning lipid membranes. Nano Lett. 18, 5262–5271 (2018).
Article CAS PubMed Google Scholar
Möckl, L., Roy, A. R., Petrov, P. N. & Moerner, W. E. Accurate and rapid background estimation in single-molecule localization microscopy using the deep neural network BGnet. Proc. Natl Acad. Sci. USA 117, 60–67 (2019).
Article PubMed PubMed Central Google Scholar
Wang, F., Henninen, T. R., Keller, D. & Erni, R. Noise2atom: unsupervised denoising for scanning transmission electron microscopy images. Appl. Microsc. 50, 23 (2020).
Article PubMed PubMed Central Google Scholar
Speiser, A. et al. Deep learning enables fast and dense single-molecule localization with high accuracy. Nat. Methods 18, 1082–1090 (2021).
Article CAS PubMed PubMed Central Google Scholar
Špačková, B. et al. Label-free nanofluidic scattering microscopy of size and mass of single diffusing molecules and nanoparticles. Nat. Methods 19, 751–758 (2022).
Article PubMed PubMed Central Google Scholar
Tassano, M., Delon, J. & Veit, T. Fastdvdnet: towards real-time deep video denoising without flow estimation. In Proc. 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 1354–1363 (IEEE, 2020).
Sheth, D. Y. et al. Unsupervised deep video denoising. In Proc. 2021 IEEE/CVF International Conference on Computer Vision (ICCV), 1759–1768 (IEEE, 2021).
Ting, K., Liu, F. & Zhou, Z. Isolation forest. In Proc. 8th IEEE International Conference on Data Mining (ICDM), 413–422 (IEEE, 2008).
Midtvedt, B. et al. Quantitative digital microscopy with deep learning. Appl. Phys. Rev. 8, 011310 (2021).
Article CAS Google Scholar
Xu, Y., Wu, T., Gao, F., Charlton, J. R. & Bennett, K. M. Improved small blob detection in 3D images using jointly constrained deep learning and Hessian analysis. Sci. Rep. 10, 326 (2020).
Article CAS PubMed PubMed Central Google Scholar
Chandola, V., Banerjee, A. & Kumar, V. Anomaly detection: a survey. ACM Comput. Surv. 41, 15 (2009).
Article Google Scholar
McLachlan, G. J. & Peel, D. Finite Mixture Models (John Wiley & Sons, 2004).
Sonn-Segev, A. et al. Quantifying the heterogeneity of macromolecular machines by mass photometry. Nat. Commun. 11, 1772 (2020).
Article CAS PubMed PubMed Central Google Scholar
Mortensen, K. I., Churchman, L. S., Spudich, J. A. & Flyvbjerg, H. Optimized localization analysis for single-molecule tracking and super-resolution microscopy. Nat. Methods 7, 377–381 (2010).
Article CAS PubMed PubMed Central Google Scholar
Ortega Arroyo, J. et al. Label-free, all-optical detection, imaging, and tracking of a single protein. Nano Lett. 14, 2065–2070 (2014).
Article CAS PubMed PubMed Central Google Scholar
Cole, D., Young, G., Weigel, A., Sebesta, A. & Kukura, P. Label-free single-molecule imaging with numerical-aperture-shaped interferometric scattering microscopy. ACS Photonics 4, 211–216 (2017).
Article CAS PubMed PubMed Central Google Scholar
Liebel, M., Hugall, J. T. & van Hulst, N. F. Ultrasensitive label-free nanosensing and high-speed tracking of single proteins. Nano Lett. 17, 1277–1281 (2017).
Article CAS PubMed Google Scholar
Seruga, B., Zhang, H., Bernstein, L. J. & Tannock, I. F. Cytokines and their relationship to the symptoms and outcome of cancer. Nat. Rev. Cancer. 8, 887–899 (2008).
Article CAS PubMed Google Scholar
Garmann, R. F., Goldfain, A. M. & Manoharan, V. N. Measurements of the self-assembly kinetics of individual viral capsids around their RNA genome. Proc. Natl Acad. Sci. USA 116, 22485–22490 (2019).
Article CAS PubMed PubMed Central Google Scholar
Zhang, M. & Schekman, R. Unconventional secretion, unconventional solutions. Science 340, 559–561 (2013).
Article CAS PubMed Google Scholar
McDonald, M. P. et al. Visualizing single-cell secretion dynamics with single-protein sensitivity. Nano Lett. 18, 513–519 (2018).
Article CAS PubMed Google Scholar
Ross, C. A. & Poirier, M. A. Protein aggregation and neurodegenerative disease. Nat. Med. 10 (Suppl.), S10–S17 (2004).
Li, Y., Xue, Y. & Tian, L. Deep speckle correlation: a deep learning approach toward scalable imaging through scattering media. Optica 5, 1181–1190 (2018).
Article Google Scholar
Wang, D., Wang, X. & Lv, S. An overview of end-to-end automatic speech recognition. Symmetry 11, 1018 (2019).
Article Google Scholar
Pan, S., Yang, C. & Zhao, X. S. Affinity of skp to OmpC revealed by single-molecule detection. Sci. Rep. 10, 14871 (2020).
Article CAS PubMed PubMed Central Google Scholar
Dastjerdi, H. M., Mahmoodabadi, R. G., Bär, M., Sandoghdar, V. & Köstler, H. PiSCAT: a Python package for interferometric scattering microscopy. J. Open Source Softw. 7, 4024 (2022).
Article Google Scholar

Download references

Acknowledgements

The authors thank A. Shkarin for help with the iSCAT camera acquisition software (pyLabLib Cam-control), M. Schwab for the mechanical components and T. Utikal for technical support. The authors also thank F. Wieser, H. Mirzaalian, M. Blessing, J. Zirkelbach, K. König and K. Kasaian for providing valuable comments on the manuscript. V.S. and H.M.D. acknowledge fruitful discussions with R. Gholami Mahmoodabadi. This work was supported by the Max Planck Society.

Funding

This work was supported by the Max Planck Society. Open access funding provided by the Max Planck Society.

Author information

These authors contributed equally: Mahyar Dahmardeh, Houman Mirzaalian Dastjerdi.

Authors and Affiliations

Max Planck Institute for the Science of Light, Erlangen, Germany
Mahyar Dahmardeh, Houman Mirzaalian Dastjerdi, Hisham Mazal & Vahid Sandoghdar
Max-Planck-Zentrum für Physik und Medizin, Erlangen, Germany
Mahyar Dahmardeh, Houman Mirzaalian Dastjerdi, Hisham Mazal & Vahid Sandoghdar
Department of Computer Science, Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen, Germany
Houman Mirzaalian Dastjerdi & Harald Köstler
Erlangen National High Performance Computing Center (NHR@FAU), Erlangen, Germany
Harald Köstler
Department of Physics, Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen, Germany
Vahid Sandoghdar

Authors

Mahyar Dahmardeh
View author publications
You can also search for this author in PubMed Google Scholar
Houman Mirzaalian Dastjerdi
View author publications
You can also search for this author in PubMed Google Scholar
Hisham Mazal
View author publications
You can also search for this author in PubMed Google Scholar
Harald Köstler
View author publications
You can also search for this author in PubMed Google Scholar
Vahid Sandoghdar
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.D. built the iSCAT set-up and conducted all measurements. H.M. prepared and labeled the protein samples. M.D. and H.M. augmented the fluorescence measurement. H.M.D. constructed the analysis pipeline. M.D. and H.M.D. analyzed the data. V.S. and H.K. supervised the project. V.S., M.D. and H.M.D. wrote the paper.

Corresponding author

Correspondence to Vahid Sandoghdar.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information:

Nature Methods thanks Nikolas Hundt, Giovanni Volpe, and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Primary Handling Editor: Rita Strack, in collaboration with the Nature Methods team.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Supplementary Figs. 1–19, Supplementary Tables 1–4.

Reporting Summary

Source data

Source Data Fig. 5c

iSCAT contrast of binding events for a protein mass of 21kDa detected only via iSCAT (group 1) and also in TIRF (group 2) as one zip file including a *.mat data file with Matlab script to plot the histograms.

Source Data Fig. 5d

iSCAT contrast of binding events for a protein mass of 18kDa detected only via iSCAT (group 1) and also in TIRF (group 2) as one zip file including a *.mat data file with Matlab script to plot the histograms.

Source Data Fig. 5e

iSCAT contrast of binding events for a protein mass of 9kDa detected only via iSCAT (group 1) and also in TIRF (group 2) as one zip file including a *.mat data file with Matlab script to plot the histograms.

Source Data Fig. 6a

A MATLAB script which plots the iSCAT contrast of different protein samples vs. the molecular mass in units of kDa.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Dahmardeh, M., Mirzaalian Dastjerdi, H., Mazal, H. et al. Self-supervised machine learning pushes the sensitivity limit in label-free detection of single proteins below 10 kDa. Nat Methods 20, 442–447 (2023). https://doi.org/10.1038/s41592-023-01778-2

Download citation

Received: 20 May 2022
Accepted: 06 January 2023
Published: 27 February 2023
Issue Date: March 2023
DOI: https://doi.org/10.1038/s41592-023-01778-2

This article is cited by

Single-molecule FRET for probing nanoscale biomolecular dynamics
- Daniel Nettels
- Nicola Galvanetto
- Benjamin Schuler
Nature Reviews Physics (2024)
Label-free detection and profiling of individual solution-phase molecules
- Lisa-Maria Needham
- Carlos Saavedra
- Randall H. Goldsmith
Nature (2024)
Molecular fingerprinting of biological nanoparticles with a label-free optofluidic platform
- Alexia Stollmann
- Jose Garcia-Guirado
- Romain Quidant
Nature Communications (2024)
The cyanobacterial protein VIPP1 forms ESCRT-III-like structures on lipid bilayers
- Sichen Pan
- Karin Gries
- Simon Scheuring
Nature Structural & Molecular Biology (2024)
Spatial redundancy transformer for self-supervised fluorescence image denoising
- Xinyang Li
- Xiaowan Hu
- Qionghai Dai
Nature Computational Science (2023)