Analyzing protein dynamics from fluorescence intensity traces using unsupervised deep learning network

Yuan, Jinghe; Zhao, Rong; Xu, Jiachao; Cheng, Ming; Qin, Zidi; Kou, Xiaolong; Fang, Xiaohong

doi:10.1038/s42003-020-01389-z

Download PDF

Article
Open access
Published: 12 November 2020

Analyzing protein dynamics from fluorescence intensity traces using unsupervised deep learning network

Jinghe Yuan ORCID: orcid.org/0000-0001-6456-7696¹,
Rong Zhao²,
Jiachao Xu¹,
Ming Cheng¹,
Zidi Qin³,
Xiaolong Kou¹ &
…
Xiaohong Fang^1,3

Communications Biology volume 3, Article number: 669 (2020) Cite this article

2039 Accesses
7 Citations
1 Altmetric
Metrics details

Subjects

Abstract

We propose an unsupervised deep learning network to analyze the dynamics of membrane proteins from the fluorescence intensity traces. This system was trained in an unsupervised manner with the raw experimental time traces and synthesized ones, so neither predefined state number nor pre-labelling were required. With the bidirectional Long Short-Term Memory (biLSTM) networks as the hidden layers, both the past and future context can be used fully to improve the prediction results and can even extract information from the noise distribution. The method was validated with the synthetic dataset and the experimental dataset of monomeric fluorophore Cy5, and then applied to extract the membrane protein interaction dynamics from experimental data successfully.

Differences in ligand-induced protein dynamics extracted from an unsupervised deep learning approach correlate with protein–ligand binding affinities

Article Open access 19 May 2022

Deep-LASI: deep-learning assisted, single-molecule imaging analysis of multi-color DNA origami structures

Article Open access 17 October 2023

Accurate Prediction of Protein Structural Flexibility by Deep Learning Integrating Intricate Atomic Structures and Cryo-EM Density Information

Article Open access 02 July 2024

Introduction

Many membrane proteins form multimeric complexes before they carry out diverse essential cellular functions¹. Investigating their organization and dynamics is very essential for understanding these functions². Photobleaching event counting, based on the irreversible and stochastic loss of fluorescence, is a powerful tool for the stoichiometry analysis of protein complexes composed of many subunits². By measuring the fluorophore photobleaching process, the information on the subunit number in a complex and the photobleaching step location can be obtained³. However, photobleaching event counting is not able to extract the dynamic information of membrane protein interactions, such as durations of protein association, and transition rates of proteins between different aggregation states, etc. Yet, these dynamic properties are unarguably of most interest, as they underlie protein function within cells⁴. By analyzing diffusion states, these dynamics can be extracted from the diffusion tracks, and diffusion state analysis is becoming a particularly interesting tool^5,6,7,8. Actually, the fluorescence intensity traces can be used not only to the photobleaching event counting, but also to the protein dynamic extracting by monitoring the fluorescence intensity changes (step-jump). The main difference of dynamic finding from photobleaching event counting is that the dynamic finding is to track the processes of molecule association and dissociation, which is presented as the molecule diffusion state changes or fluorescence intensity jumps.

No matter for photobleaching event counting or for dynamic finding, extracting the step-like information embedded in heavy noise from fluorescence intensity traces is still a challenging task^9,10. When the noise amplitude is larger than the step-drop, it is very difficult to discern between step and noise. Moreover, even with higher signal-noise-ratios, filtering out the fluorophore blinking driven by core fluorophore instabilities from the true steps is also a big obstacle.

For photobleaching event counting, many methods have been developed to find the drop events from the fluorescence intensity traces. The simplest way may be to use filters such as Chung and Kennedy’s nonlinear filter¹¹, or Haar wavelet-based filter^12,13 to reduce noise and detect the drops by observing. To improve the convenience and accuracy, more automatic approaches, including data treated with wavelet transformation^14,15, multiscale products analysis¹⁶, running t-test algorithm¹⁷, step-fitting based on χ² distribution¹⁸, hidden Markov models (HMM)^19,20 and other algorithms have been developed to analyze single-molecule data. Among these methods, HMM have been widely used because the discrete transitions between states can be considered as memoryless (Markovian) processes and the time context is taken into account²⁰. However, all these approaches above need initial selecting parameters, which can be very subjective even unreasonable. Furthermore, in HMM methods, the memoryless hypothesis may be not practical. Even with these mathematical aids, for complexes with >5 subunits, the distributions of bleaching steps for n and n + 1 subunits look similar, and detection of discrete steps becomes more difficult^21,22.

Deep-neural networks (DNNs) have achieved excellent success on various kinds of difficult learning tasks. We also developed a combining method of using convolutional and long short-term memory (LSTM) deep-learning neural network (CLDNN) for photobleaching event counting²³. The convolutional layers take charge of feature extraction of step-drop (photobleaching) events and LSTM recurrent layers distinguish between photobleaching and photoblinking. This CLDNN does not require user-specified parameters and gives very high accuracy. However, CLDNN still belongs to supervised-learning framework, which needs large labeled training sets of input-target pairs to train this network. In practice, it is often difficult even impossible to label the experimental data without subjectivity. In fact, extracting step-jump events (such as photobleaching event counting or dynamic finding) can be regarded as mapping of sequences to sequences, and LSTM alone is qualified to the two tasks of extracting jump information and distinguishing photobleaching from photoblinking. After getting rid of the convolutional layers, the sate paths (hence the drop location and the transition rates between different states) can be extracted as an extra benefit.

In this work, instead of the diffusion states analysis, with the fluorescence intensity traces, we propose an unsupervised-learning framework of discriminator-generator network (DGN) to analyze the aggregation states and association dynamics of membrane proteins²⁴. This framework can be used not only for photobleaching event counting, but also for the dynamic finding. This proposed DGN network consists of two bidirectional Long Short-Term Memory (biLSTM) networks²⁵, in which, a biLSTM as discriminator is used to map the input time trace of fluorophore intensity to a hidden state vector of a fixed dimensionality, and then another biLSTM as generator to recover the input time trace from this hidden state vector. The discriminator and generator of the proposed framework are jointly trained to maximize the conditional probability of predicting itself with the input time trace, and thus find the hidden state (fluorophore count sequence) behind the time trace.

This framework was trained in an unsupervised manner based only on raw time traces and without the need for any supervisor annotation. Unsupervised learning has two obvious benefits, one is that it avoids the prejudice of selecting training data, and another is that we need not to label the traces for training the nets, which is often difficult or even impossible.

LSTM does not suffer on very long-time trace²⁶, so can disregard the memoryless hypothesis in hidden Markovian model (HMM). And biLSTM can access both past and future context to improve the prediction results^25,27.

Our framework refers to three types of unsupervised-learning framework, auto-encoding variational Bayes (VAEs)²⁸, Generative Adversarial Nets (GAN)²⁹, and the translation frameworks^30,31. VAEs method needs to build the model density, it will not work in the case where we are dealing with distributions supported by low-dimensional manifolds³². It has two main differences from GAN as well. The first one is that the framework of GAN consists of a generative model and a discriminative model sequentially, which core goal is to train the generative model to generate some samples based on the probability distribution, however, our framework has an inverse structure and our main goal is to optimize the discriminator to find the hidden states. The second difference is that we use recurrent neural network (RNN) model LSTM to map the time sequences but GAN used multilayer perceptron in both the generative model and discriminative model. The main difference from translation frameworks is that we used sequential hidden states instead of the summary hidden states. To the best of our knowledge, this is the first time such a system has been demonstrated and we believe our work will provide a new paradigm for dynamic analysis of membrane proteins.

This proposed framework comes with many advantages relative to previous modeling frameworks. The first one is that no Markov hypothesis needed, which is essential for the HMM methods^33,34,35. The second one is that the unsupervised training results with neither pre-knowledge (such as step number) nor pre-labeling of experimental data are required²³. Remarkably, the third one is that the embedded LSTM units can not only reduce the interferences from all types of noise, but also even extract step information from the noise distribution, which is impossible for human identifier.

Results

Network architecture

We assume the fluorophore number as the hidden state, the changing process of emitting fluorophores as the hidden process, h, which is not necessary Markovian process. As in Fig. 1, the generative process is the process to generate the fluorescence intensity trace x under the hidden process h. Now our most concerned task is to extract the hidden process, h from the measured fluorescence intensity trace x, named discriminative process as in Fig. 1. Statistically, in the whole framework combining the discriminative process and generative process, it means to maximize the probability

$$P\left( x \right) = \mathop {\prod}\limits_{t = 1}^T {P\left( {\left. {x_t} \right|x_{t - 1}, \cdots ,x_1} \right)},$$

(1)

where t is the time notation and T is the sequence length.

**Fig. 1: The graphical model of combing the discriminator and generator.**

This can be done with our proposed framework of RNN discriminator-generator. We used a biLSTM layer and a softmax layer for the discriminator and a biLSTM layer and a linear layer for the generator.

As the discriminator reads each element of the input sequence x sequentially, the hidden state (output of the discriminator) changes according to equation (2).

$$h_t = f\left( {h_{t - 1},x_t} \right).$$

(2)

After reading the whole input sequence, we get the hidden state sequence of the same length.

The generator of the proposed model is trained to generate the output sequence $\left\{ {\widehat x_1,\widehat x_2, \cdots ,\widehat x_t, \cdots ,\widehat x_T} \right\}$ to approximate the input sequence {x₁, x₂,···x_t,···x_T} by predicting the element x_t with the given hidden sequence {h₁, h₂,···, h_t} and previously predicted sequence $\left\{ {\widehat x_1,\widehat x_2, \cdots ,\widehat x_{t - 1}} \right\}$. So $\widehat x_t$ is also conditioned on h_t and $\widehat x_{t - 1}$, Hence, the output of the generator at moment t is,

$$\widehat x_t = g\left( {h_t,\widehat x_{t - 1}} \right).$$

(3)

The discriminator and generator of the proposed framework are jointly trained in an unsupervised manner to minimize quadratic loss function

$$L = \frac{1}{{2T}}\mathop {\sum}\limits_{t = 1}^T {\left( {\widehat x_t - x_t} \right)^2}.$$

(4)

Once the training is completed, the discriminator can be independently used to estimate the hidden sequence (fluorophore number sequence), meanwhile the generator to generate time sequence under given the hidden sequence.

Training the network

Just as training GANs, training DGNs is delicate and unstable as well. Inspired by the DE-GAN³⁶, we extracted the mean fluorescence intensity of single-fluorescent molecule from the experimental datasets with gaussian mixture model method³⁷ (Supplementary Methods and Supplementary Fig. 1), and then synthesized datasets with this mean fluorescence intensity and taking into consideration of the three origins of noise, namely, the Poisson distributed shot noise, Gaussian distributed random noise, and fluorescence blinking (Fig. 2 and Supplementary Methods). To describe the noise level, as in our previous report²³, we adopted the average of the adjusted signal-to-noise ratio (aSNR)¹⁰ defined as

$$a{\mathrm{SNR}} = \frac{{2\left( {\mu _{i + 1} - \mu _i} \right)}}{{\sigma _i + \sigma _{i + 1}}},\,i = 1,2, \cdots N - 1,$$

(5)

where μ_i and σ_i are the mean and standard deviation (SD) of the i^th fluorescence state (a fluorescence state is the plateau of data points between two steps) and N is the fluorescence state number of the fluorescence time trace.

**Fig. 2: The training and performances of DGN on both photobleaching event counting and dynamic finding with the fluorescence intensity traces.**

To facilitate the convergence of DGN, we enhanced the loss function by adding a hidden-space loss function (cross-entropy loss function) for the synthesized datasets

$$L_h = \frac{1}{T}\mathop {\sum}\nolimits_{t = 1}^T {y_t{\mathrm{ln}}\left( {h_t} \right)},$$

(6)

where h_t is the output of the discriminator and y_t is the ground truth of the synthetic data, and amended the backpropagation error of the discriminator as

$$\delta _d = \left( {1 - \alpha } \right)\delta _g + \alpha \delta _h,$$

(7)

where δ_g is the backpropagation error of experimental datasets propagating from the generator to the output layer of discriminator, and δ_h is the backpropagation error of synthesized datasets generated at the output layer of discriminator. α is a weighted factor

$$\alpha = 0.5e^{ - \frac{{2k}}{K}},$$

(8)

where k is current training loop and K is the total training loops.

To train DGN by this improved strategy, the networks is forced to carry information of the synthesized datasets at the beginning, and more information of the experimental datasets is used gradually but guaranteeing the convergence in the direction trained with the synthesized datasets.

For both photobleaching event counting and dynamic finding with the fluorescence intensity traces, we built two datasets consist of a synthesized subset and an experimental subset used for training, validating and testing the proposed framework, respectively³⁸.

Datasets with different signal-to-noise values were synthesized (Supplementary Fig. 2). Then this synthesized dataset was randomly divided into three subsets again, namely, the training subset, validating subset and testing subset. The experimental subsets were collected from the experimental data of actual single-molecule movies. These experimental subsets were also divided into three subsets, namely, the training subset, validating subset and testing subset. Both the synthesized and experimental training subsets together is input to the nets for training and both the synthesized and experimental validating subsets together for validation. The hypothetical maximum state number is set as 10 for photobleaching counting and 5 for dynamic finding, respectively (Fig. 2(a)).

Both direction LSTM of the discriminator include 32 LSTM units. Its parameters were optimized by minimizing cross-entropy loss function using mini-batch gradient descent with batch size 8 to maintain a balance between the robustness of stochastic gradient descent and the efficiency of batch gradient descent. Weights are randomly initialized. To reduce the overfitting and guarantee convergence, the techniques known as L2 regularization³⁹ and weights thresholding⁴⁰ were used.

There are 16 LSTM units in both direction LSTM of the generator. Its parameters were optimized by minimizing mean square error loss function using mini-batch gradient descent with batch size 8. Weights are randomly initialized as well. The techniques of L2 regularization and weights thresholding were used too. The parameters presented in the network are detailed in Supplementary Table.1.

Model evaluation

We trained and validated this DGN networks as in Fig. 2(a). Supplementary Fig. 3 shows the loss curves for training and validation, respectively. After 60 epochs, the training and validation loss is 0.033 and 0.058, respectively, for photobleaching event counting, and after 34 epochs, the training and validation loss is 0.026 and 0.15, respectively, for dynamic finding. The validation loss is larger than the training loss and going down uniformly, indicating that the overfitting had not appeared.

We checked the classification ability of the discriminator with the synthesized testing subset. The normalized confusion matrixes of the hidden sequence demonstrate the predicted accuracies (Supplementary Fig. 4 for photobleaching event counting).

We then visualized the outputs of the discriminator with two examples of synthesized testing data (Fig. 3(a, b)), in which, Fig. 3(a) is for photobleaching counting and Fig. 3(b) for dynamic finding. In Fig. 3(a) and Fig. 3(b), the black curves are the synthesized fluorescence intensity traces, the green broken line is the corresponding hidden state paths and the purple broken line is the predicted hidden state paths. For the photobleaching counting (Fig. 3(a)), even for the hidden states with very short duration (such as the state 3 with 2 frames only), our discriminator can give accurate estimates, which is very difficult to do this for the HMM methods. Remarkably, this discriminator is even able to extract hidden state path from the noise distribution, for example, the fluorescence intensity traces between the state 8 and state 5 is slope-like, it is impossible to discern the state paths for human identifier and it is usually excluded with some filters (such as polynomial fitting in our CLDNN²³). For the dynamic finding (Fig. 3(b)), the discriminator gave almost perfect estimates of the hidden state paths. As expected, the LSTM layers distinguished the actual state paths from the blinking events.

**Fig. 3: Valuating the framework with both synthesized and experimental testing data.**

In the photobleaching data analysis (Fig. 2(b)), researchers also care about the predicting accuracy on statistic distribution of bleaching steps, which indicates the protein population. Herein, we tested 1000 synthesized samples (with aSNR 2.98 and equal percentage 10%) as in Fig. 3(c). We achieved a quite good estimate of the statistic distribution of bleaching steps. The maximum error appeared at the state 7 as about 0.99%.

Further, with the experimental data from monomeric fluorophore Cy5, we checked the DGN net. As expected, there are a vast majority (86.81 ± 0.38%) of monomeric Cy5 molecules bleached with single photobleaching step and only a small fraction (<14%) was bleached with two or more steps because of more Cy5 fluorophores locating within the region of diffraction limit (Fig. 3(d)). This distribution of bleaching step is similar to that of monomeric YFP immobilized on the glass⁴¹.

To check the generalizability of our model, for photobleaching counting, we tested other eight datasets (1000 samples for each dataset) with different SNR (aSNR is 0.67, 0.79, 1.04, 1.35, 1.69, 2.33, 2.98, 3.69, and 4.74, respectively). The black curve (marked as “DGN 10 States”) in Fig. 4(a) shows the accuracies under these aSNRs (the accuracy defined as the lowest accuracy of the 10 states as in Supplementary Fig. 4). Even at the noise level aSNR = 1.69, DGN obtained the accuracy of 71.4 ± 4.04 % yet. We also checked the accuracy values of first 5 states. The orange curve (marked as “DGN 5 States”) in Fig. 4(a) presents the accuracies under these aSNRs. Under the noise level aSNR = 3.69, we got the accuracy of 94.4 ± 0.31%, which was about the same signal to noise level in our single-molecule imaging data.

**Fig. 4: The predicting accuracy versus aSNRs.**

For dynamic finding (Fig. 2(b)), we also tested seven datasets (1000 samples for each dataset) with aSNR 0.68, 0.84, 1.23, 1.71, 2.44, 3.84, and 5.45, respectively. The blue curve (marked as “DGN 5 States DF”) in Fig. 4(a) shows the accuracies of the 5 states under these aSNRs. For example, at the noise level aSNR = 2.44, DGN obtained the accuracy of 74.0 ± 1.79 % yet.

Comparison with different algorithms

So far, we have not found any publications of dynamic finding based on fluorescence intensity traces, we focused on the comparison of different algorithms for photobleaching counting.

To compare DGN with other previous methods (such as HMM³, NoRSE⁴², PIF⁴³, and CLDNN²³ etc.), we synthesized additional test datasets with states from 2 to 5 (because most previous methods can only deal with the questions with maximum state number 5) and aSNRs at 1.18, 1.40, 1.87, 2.40, 2.97, 3.81, and 4.42, respectively (1000 samples for each dataset without zero-step traces)³⁸. We retrained the DGN net with hypothetical maximum state number of 10 but tested it with these synthesized test datasets. Figure 4(b) shows the accuracies of these methods under the different aSNRs. From Fig. 4(b), the HMM (blue curve), NoRSE (green curve) and PIF (orange curve) have similar performances with the synthesized test datasets. Our DGN method (black curve) shows much higher accuracies than these three methods with these synthesized testing datasets. For example, when aSNR = 1.40, DGN achieved 79.6% accuracy, but HMM got accuracy 30.9% only. We think the reason is that unlike HMM, biLSTM has long term memory and can extract the information from both past and future context, so even under heavy noise, it can extract the step features from the noise distribution. We also compared our DGN method with our CLDNN method (pink curve) reported previously²³, and found that DGN had similar accuracies with CLDNN. However, the CLDNN has more complicated structure including two convolutional layers and two LSTM layers, in which, the convolutional layers are charge of feature extraction and the LSTM layers discern between the photobleaching and the photoblinking. The drawback is that the convolutional layers ignore the context information from the time traces hence fail to give full play of LSTM. In DGN method, the biLSTM layer can make full use of the context information, and is able to extract state paths even according to the noise distribution^5,25.

Although most photobleaching event counting is applied to proteins with low subunit number, we think DGN is able to quantify protein oligomers or complex with high-order states. We further tested the performance of DGN to the datasets with much more steps (50 states as in Supplementary Fig. 5). In Supplementary Fig. 5, the aSNR of the 7 datasets are 4.03, 3.22, 2.36, 1.97, 1.56, 1.08, and 0.84, respectively. Under the noise level aSNR 4.03, we can obtain the estimate accuracy 52.62% of the state 49.

Photobleaching event counting from experimental data

As in Fig. 2(b), after imaging individual GFP tagged transforming growth factor-β (TGF-β) type II receptors (TβRII) on cell membrane, we applied DGN to analyze 1139 traces (400 frames for each trace) from 21 resting MCF7 cells without TGF-β stimulation, and 1618 traces from 17 MCF7 cells with TGF-β stimulation. TβRII is a crucial component in TGF-β signal transduction, which regulates many important cellular processes, including cell growth, differentiation, migration, and apoptosis⁴⁴. It is believed that TβRII can exist as monomer in resting cells. With the binding of TGF-β, TβRII can interact with TGF-β type I receptor (TβRI) to form the signaling complex (mainly tetramer consisting of two TβRII and two TβRI) and initiate the intracellular signaling^45,46,47.

In Fig. 5(a), the black curve recorded the photobleaching trace of a single EGFP tagged TβRII molecule that imaged with custom-built total internal reflection fluorescence microscope (TIRFM), and the purple broken line is the predicted state path. Noticeably, there is a transitory state 2 in the trace.

**Fig. 5: The performance of DGN on experimental photobleaching traces.**

Results from statistical photobleaching steps clearly showed that, in the resting cells, the fraction of TβRII monomers (State 2) and dimers (State 3) is 41.8 ± 0.59 and 38.0 ± 0.87%, respectively, and there are only little trimers and tetramers. After ligand stimulation, monomers undergo severe oligomerization, the dimers (State 3), trimers (State 4) and tetramers (State 5) increased to 34.0 ± 0.87, 30.3 ± 1.22 and 13.8 ± 0.58%, respectively, however, the monomers decreased to 8.2 ± 0.40% (Fig. 5(b)), suggesting that after the TGF-β stimulation, more monomeric TβRII oligomerize to dimers and other oligomers.

Dynamic finding from experimental data

We retrained this framework to analyze single-molecule fluorescence intensity traces of TβRII in living cells (Fig. 2(b)). The fluorescence intensity of observed single molecules is expected to be affected by the association or disassociation process of TβRII, leading to multiple fluorescent states existence.

The analysis of single-molecule TβRII-EGFP traces on Hela cells revealed that TβRII exhibited two dominant fluorescent states. Thus, we referred the low- and high-fluorescent states as monomer and oligomer (TβRII/TβRII or TβRI/TβRI/TβRII/TβRII), respectively (Fig. 6(a)).

**Fig. 6: The estimated dynamics from experimental fluorescence intensity traces before and after TGF-β (TGF) stimulation, and without and with Nystatin (Nys) treatment.**

Upon the TGF-β stimulation, the state occupancies of oligomers increased from 19.36 ± 0.27% (red bar of Fig. 6(a)) to 24.90 ± 0.68% (green bar of Fig. 6(a)), while that of monomers decreased from 77.61 ± 0.54 to 63.69 ± 0.61%. This shows more TβRII have aggregated to form dimers or heterotetramers for signal activation, which is consistent with the previous studies about TβRII signaling mechanism^45,48. In the meantime, the transition rates (the transition rates (a_ij) is the probability of transition from state i to state j in unit time) between different aggregation states can be characterized quantitatively as well (Table 1). In resting cells, the transition rate from monomer to dimer or heterotetramer is a₁₂ = 1.47 ± 0.086 s⁻¹ and the reversed transition rate is a₂₁ = 1.28 ± 0.047 s⁻¹, and after ligand stimulation, they changed to a₁₂ = 1.62 ± 0.039 s⁻¹, and a₂₁ = 1.03 ± 0.032 s⁻¹, correspondingly. The increasement of transition rate from monomer to oligomer and the decrement of the reversed transition rate suggest that ligand would drive the balance move more forward to oligomer formation.

Table 1 State transition rates between monomer and oligomer before and after TGF-β (TGF) stimulation, and without and with Nystatin (Nys) treatment.

Full size table

After obtained the state paths of all single molecules, we fitted the duration time of the oligomeric state with a single exponential decay and found that the oligomer duration time was prolonged from 0.54 s (red fitting curve of Fig. 6(b)) to 0.61 s (green fitting curve of Fig. 6(b)) after TGF-β stimulation, indicating that the ligand can stabilize the oligomeric state.

In addition, it has also been suggested that lipid-rafts, the cholesterol-rich membrane microdomains, have effect on TGF-β receptor aggregation. By analyzing the diffusion behavior of single-molecule tracking trajectories of TβRI-EGFP, previous research showed that the formation of heteromeric receptor complexes would be hindered when lipid-rafts were disrupted⁴⁹. We also applied our method to analyze 4459 TβRII-EGFP tracking trajectories under similar conditions. The transfected cells were incubated with 50 µg/ml Nystatin for 30 min before the addition of TGF-β, which is able to disrupt the lipid-rafts. Different from the results without Nystatin treatment, the ligand stimulation on the Nystatin treated cells resulted in a smaller increase in the oligomeric component (21.55 ± 0.52%) (blue bar of Fig. 6(a)). The transition rates from monomer to oligomer and in the reverse direction are larger than that of normal cells with TGF-β stimulation (Table 1), but the oligomer duration time is obviously less than that of normal cells with or without TGF-β stimulation. It indicates that disruption of the lipid-rafts can make TGF-β receptor association or disassociation more active, and the oligomers are difficult to stably exist.

Discussion

In summary, we have designed a discriminator-generator RNN model to analyze the hidden states sequences from the fluorescence intensity traces. This system can be trained in unsupervised manner with the raw experimental time traces, so neither predefined state number nor pre-labeling are need. With the biLSTM as the hidden layers of the discriminator and the generator, both the past and future context can be used fully to improve the prediction results. No Markovian hypothesis is embedded in, so it can even be used to treat time sequences of non-Markovian process. It’s worth noting that, from the simulation, the biLSTM units can not only reduce the interferences from all types of noise, but also extract step information from the noise distribution.

This system can be used not only for photobleaching event counting, but also for the dynamic finding. We checked its performances on the synthetic test dataset and the experimental dataset of monomeric fluorophore Cy5, and then used it to experimental single-molecule fluorescence intensity traces successfully.

For photobleaching event counting, comparing with HMM, NoRSE and PIF, DGN has outstanding performance because it has longer term memory and can make use the context fully, even extract step information from only the noise distribution, which is impossible for human identifier. Comparing with CLDNN, DGN can not only count the photobleaching steps but also give the state paths (hence step location), which is very important for calculating the transition rates between polymerization states.

For dynamic finding, DGN estimated all the dynamic properties such as durations of protein association, transition rates during protein interactions and state occupancies of different protein aggregation states, which are very important, as they underlie protein function within cells. We not only find that ligand can drive the balance forward to oligomer formation, but also find that disruption of the lipid-rafts can make TGF-β receptor association or disassociation more active, and oligomers are difficult to stably exist.

Methods

Cell culture and transfection

Full-length human TβRII cDNA were subcloned into the HindIII and BamHI sites of pEGFP-N1 (Clontech), yielding the TβRII-EGFP plasmids⁴⁵. The plasmid was confirmed by DNA sequencing.

HeLa and MCF7 cells were purchased from Cell Resource Center, IBMS, CAMS/PUMC and cultured in Dulbecco’s modified Eagle’s medium (DMEM, Gibco) supplemented with 10% fetal bovine serum (Hyclone) and antibiotics (50 mg/ml streptomycin, 50 U/ml penicillin) at 37 °C in a 5% CO₂ atmosphere. Cells were seeded in a 35-mm glass-bottom dish for 16 h and then transfected with 0.5 μg TβRII-EGFP plasmids in the DMEM medium for 4 h, respectively. Transfection was performed using Lipofectamine 2000 (Invitrogen) according to manufacturer’s instructions.

For the ligand stimulation experiments, Cells transfected with TβRII-EGFP were added with 10 ng/ml TGF-β1 (R&D) in phenol red-free DMEM for 15 min at 37 °C before fluorescence imaging.

Single-molecule imaging

Single-molecule fluorescence imaging was performed by a home-built TIRF microscope using an inverted Olympus IX71 microscope equipped with a total internal reflective fluorescence illuminator, a 100×/1.45 NA Plan Apochromatic TIRF objective and an electron-multiplying charge-coupled device (EMCCD) camera (Andor iXon DU-897D BV)^46,50. EGFP molecules were excited by a 488 nm laser at 1 mW (60 W/cm2) (Melles Griot, Carlsbad, CA, USA). The collected fluorescent signals were passed through a filter HQ 525/50 (Chroma Technology), and then directed to the EMCCD camera. The gain of the EMCCD camera was set at 300. Movies of 400 frames were acquired for each sample at a frame rate of 10 Hz.

Single-molecule tracking with U-Track software

Time-lapse sequences of single-molecule image were acquired and then tracked with U-Track methods as described in ref. ⁵¹ By fitting Gaussian kernels to approximate the two-dimensional point spread function of the microscope objective around local intensity maxima, the sub-pixel localization is achieved. To construct the trajectories, the algorithm first links the detected particles between consecutive frames, and then links the generated track segments to simultaneously close gaps and capture particle merging and splitting events.

Statistics and reproducibility

Bootstrap method was used to ensure the statistics and reproducibility.

To estimate the state distribution of the experimental data from monomeric fluorophore Cy5, five sub-datasets were generated by randomly extracting 72% of the total 2224 trajectories, the means and standard deviations were obtained as in Fig. 3(c), in which, the standard deviations from the bootstrap analysis are typically <0.5%.

To check the generalizability for photobleaching counting, we tested nine datasets (1000 samples for each dataset) with different SNR (aSNR is 0.67, 0.79, 1.04, 1.35, 1.69, 2.33, 2.98, 3.69, and 4.74, respectively). For each dataset, five sub-datasets were generated by randomly extracting 72% of the total trajectories, the means and standard deviations of accuracy were obtained as in Fig. 4(a).

To analyze the state distributions before and after TGF-β stimulation, five cell groups were generated randomly from the total experimental cells, the means and standard deviations were obtained as in Fig. 5(b).

To analyze the state distributions in live cells, five cell groups were generated randomly as well from the total experimental cells, the means and standard deviations were obtained as in Fig. 6(a).

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

All the data of this manuscript are archived in the public repositories Dryad (https://doi.org/10.5061/dryad.4qrfj6q64) by the link: https://datadryad.org/stash/share/9liw8QzlGcUgwdIz5nahbp6wVjXhZvD03ZKp_w5bURg.³⁸

Code availability

The custom codes described in the paper is deposited in the public repositories Zenodo (https://doi.org/10.5281/zenodo.4030065) by the link: https://zenodo.org/record/4030065#.X2Bj_WgzYuU.²⁴

References

Atanasova, M. & Whitty, A. Understanding cytokine and growth factor receptor activation mechanisms[J]. Crit. Rev. Biochem. Mol. Biol. 47, 502–530 (2012).
Article CAS PubMed Google Scholar
Ulbrich, M. H. & Isacoff, E. Y. Subunit counting in membrane-bound proteins. Nat. Methods 4, 319–321 (2007).
Article CAS PubMed Google Scholar
Yuan, J. et al. Analysis of the steps in single-molecule photobleaching traces by using the hidden markov model and maximum-likelihood clustering[J]. Chem. Asian J. 9, 2303–2308 (2014).
Article CAS PubMed Google Scholar
Kusumi, A. et al. Tracking single molecules at work in living cells[J]. Nat. Chem. Biol. 10, 524 (2014).
Article CAS PubMed Google Scholar
Jaqaman, K. et al. Robust single-particle tracking in live-cell time-lapse sequences[J]. Nat. Methods 5, 695–702 (2008).
Article CAS PubMed Google Scholar
Sergé, A. et al. Dynamic multiple-target tracing to probe spatiotemporal cartography of cell membranes[J]. Nat. Methods 5, 687–694 (2008).
Article CAS PubMed Google Scholar
Hiramoto-Yamaki, N. et al. Ultrafast diffusion of a fluorescent cholesterol analog in compartmentalized plasma membranes[J]. Traffic 15, 583–612 (2014).
Article CAS PubMed Google Scholar
Jaqaman, K. et al. Cytoskeletal control of CD36 diffusion promotes its receptor and signaling function[J]. Cell 146, 593–606 (2011).
Article CAS PubMed Google Scholar
Aggarwal, T. et al. Detection of steps in single molecule data[J]. Cellular and Molecular. Cell. Mol. Bioeng. 5, 14–31 (2012).
Article CAS PubMed Google Scholar
Tsekouras, K. et al. A novel method to accurately locate and count large numbers of steps by photobleaching[J]. Mol. Biol. Cell 27, 3601–3615 (2016).
Article CAS PubMed Google Scholar
Chung, S. H. & Kennedy, R. A. Forward-backward non-linear filtering technique for extracting small biological signals from noise[J]. J. Neurosci. Methods 40, 71–86 (1991).
Article CAS PubMed Google Scholar
Knight, A. E. & Molloy, J. E. Coupling ATP hydrolysis to mechanical work.[J]. Nat. Cell Biol. 1, E87 (1999).
Article CAS PubMed Google Scholar
Ulbrich, M. H. & Isacoff, E. Y. Subunit counting in membrane-bound proteins[J]. Nat. Methods 4, 4–4 (2007).
Article Google Scholar
Nakajo, K. et al. Stoichiometry of the KCNQ1-KCNE1 ion channel complex[J]. Proc. Natl Acad. Sci. 107, 18862–18867 (2010).
Article CAS PubMed PubMed Central Google Scholar
Wang, Y. Jump and sharp cusp detection by wavelets[J]. Biometrika 82, 385–397 (1995).
Article Google Scholar
Sadler, B. M. & Swami, A. Analysis of multiscale products for step detection and estimation[J]. IEEE Trans. Inf. Theory 45, 1043–1051 (1999).
Article Google Scholar
Carter, N. J. & Cross, R. A. Mechanics of the kinesin step[J]. Nature 435, 308–312 (2005).
Article CAS PubMed Google Scholar
Kerssemakers, J. W. J. et al. Assembly dynamics of microtubules at molecular resolution[J]. Nature 442, 709–712 (2006).
Article CAS PubMed Google Scholar
Andrec, M., Levy, R. M. & Talaga, D. S. Direct determination of kinetic rates from single-molecule photon arrival trajectories using hidden markov models[J]. J. Phys. Chem. A 107, 7454–7464 (2003).
Article CAS PubMed PubMed Central Google Scholar
Messina, T. C. et al. Hidden Markov model analysis of multichromophore photobleaching[J]. J. Phys. Chem. B 110, 16366–16376 (2006).
Article CAS PubMed Google Scholar
Das, S. K. et al. Membrane protein stoichiometry determined from the step-wise photobleaching of dye-labelled subunits[J]. Chembiochem 8, 994–999 (2007).
Article CAS PubMed Google Scholar
Ha, T. Single-molecule methods leap ahead. [J]. Nat. Methods 11, 1015 (2014).
Article CAS PubMed Google Scholar
Xu, J. et al. Automated stoichiometry analysis of single-molecule fluorescence imaging traces via deep learning[J]. J. Am. Chem. Soc. 141, 6976–6985 (2019).
Article CAS PubMed Google Scholar
Yuan, J. Analyzing protein dynamics from fluorescence intensity traces using unsupervised deep learning network. Zenodo https://doi.org/10.5281/zenodo.4030065 (2020).
Graves, A. & Schmidhuber, J. Framewise phoneme classification with bidirectional LSTM and other neural network architectures. Neural Netw. 18, 602–610 (2005).
Article PubMed Google Scholar
Hochreiter, S. & Schmidhuber, J. Long short-term memory[J]. Neural Comput. 9, 1735–1780 (1997).
Article CAS PubMed Google Scholar
Thireou, T. & Reczko, M. Bidirectional long short-term memory networks for predicting the subcellular localization of eukaryotic proteins. IEEE/ACM Trans. Comput. Biol. Bioinforma. 4, 441–446 (2007).
Article CAS Google Scholar
Kingma, D. P. & Welling, M. “Auto-Encoding Variational Bayes”, the International Conference on Learning Representations (ICLR) 2014.
Goodfellow, I. J. et al. Generative adversarial nets[C]. International Conference on Neural Information Processing Systems (MIT Press, 2014).
Kalchbrenner N. & Blunsom P. Recurrent continuous translation models[C]. Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing. 1700–1709 (Association for Computational Linguistics, 2013).
Brown, P. F. et al. The mathematics of statistical machine translation: Parameter estimation[J]. Comput. Linguist. 19, 263–311 (1993).
Google Scholar
Arjovsky, M., Chintala, S., & Bottou, L. “Wasserstein generative adversarial networks”. Proceedings of the 34th International Conference on MachineLearning, Sydney, Australia, PMLR 70, 2017.
Eddy, S. R. What is a hidden Markov model?[J]. Nat. Biotechnol. 22, 1315–1316 (2004).
Article CAS PubMed Google Scholar
Gagniuc, P. A. Markov Chains: from Theory to Implementation and Experimentation. (John Wiley & Sons, 2017).
Persson, F., Linden, M., Unoson, C. & Elf, J. Extracting intracellular diffusive states and transition rates from single-molecule tracking data. Nat. Methods 10, 265–269 (2013).
Article PubMed Google Scholar
Zhong, G. et al. Generative adversarial networks with decoder-encoder output noise. Neural Netw. 127, 19–28 (2020).
Article PubMed Google Scholar
Bishop, C. M. Pattern recognition and machine learning. Cristopher M. Bishop. 490p (Springer, 2006).
Yuan, J. Analyzing protein dynamics from fluorescence intensity traces using unsupervised deep learning network, v3, Dryad, Dataset, https://doi.org/10.5061/dryad.4qrfj6q64 (2020).
Ng, A. Y. Feature selection, L 1 vs. L 2 regularization, and rotational invariance[C]. Proceedings of the Twenty-first International Conference on Machine Learning. Vol. 78 (ACM, 2004).
Yan, X. et al. Weight thresholding on complex networks. Phys. Rev. E 98, 042304 (2018).
Chen, J. et al. Probing cellular protein complexes using single-molecule pull-down. Nature 473, 484–488 (2011).
Article PubMed CAS Google Scholar
Reuel, N. F. et al. NoRSE: noise reduction and state evaluator for high-frequency single event traces. Bioinformatics 28, 296–7 (2012).
Article CAS PubMed Google Scholar
McGuire, H., Aurousseau, M. R. P., Bowie, D. & Blunck, R. Automating single subunit counting of membrane proteins in mammalian cells. J. Biol. Chem. 287, 35912–35921 (2012).
Article CAS PubMed Google Scholar
Clarke, D. C. & Liu, X. Decoding the quantitative nature of TGF-β/Smad signaling[J]. Trends Cell Biol. 18, 430–442 (2008).
Article CAS PubMed Google Scholar
Zhang, W. et al. Single-molecule imaging reveals transforming growth factor-β-induced type II receptor dimerization. Proc. Natl Acad. Sci. USA 106, 15679–83. (2009).
Article CAS PubMed Google Scholar
Zhang, W. et al. Monomeric type I and type III transforming growth factor-beta receptors and their dimerization revealed by sin-gle-molecule imaging. Cell Res. 20, 1216–1223 (2010).
Article CAS PubMed Google Scholar
Huang, T. et al. TGF-beta signaling is mediated by two autonomously functioning TβRI: TβRII pairs. EMBO J. 30, 1263–1276 (2011).
Article CAS PubMed Google Scholar
Zhao, R. et al. Quantitative single-molecule study of TGF-β/ Smad signaling. Acta Biochim. Biophys. Sin. 50, 1–9 (2017).
Google Scholar
Ma, X. et al. Lateral diffusion of TGF-beta type I receptor studied by single-molecule imaging. Biochem. Biophys. Res. Commun. 356, 67–71 (2007).
Article CAS PubMed Google Scholar
Zhang, M. et al. Single-molecule imaging reveals the stoichiometry change of epidermal growth factor receptor during transactivation by β_2-adrenergic receptor. Sci. China (Chem.) 10, 52–59 (2017).
Google Scholar
Li, N. et al. Single-molecule imaging reveals the activation dynamics of intracellular protein Smad3 on cell membrane. Sci. Rep. 6, 33469 (2016).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

This work was supported by the National Natural Science Foundation of China (22077124, 21735006, 91939301) and the Chinese Academy of Sciences.

Author information

Authors and Affiliations

Key Laboratory of Molecular Nanostructure and Nanotechnology, CAS Research/Education Center for Excellence in Molecular Sciences, Institute of Chemistry, Chinese Academy of Sciences, 100190, Beijing, China
Jinghe Yuan, Jiachao Xu, Ming Cheng, Xiaolong Kou & Xiaohong Fang
Division of Chemical Metrology and Analytical Science, National Institute of Metrology, 100029, Beijing, China
Rong Zhao
University of Chinese Academy of Sciences, 100049, Beijing, China
Zidi Qin & Xiaohong Fang

Authors

Jinghe Yuan
View author publications
You can also search for this author in PubMed Google Scholar
Rong Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Jiachao Xu
View author publications
You can also search for this author in PubMed Google Scholar
Ming Cheng
View author publications
You can also search for this author in PubMed Google Scholar
Zidi Qin
View author publications
You can also search for this author in PubMed Google Scholar
Xiaolong Kou
View author publications
You can also search for this author in PubMed Google Scholar
Xiaohong Fang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.Y. designed the study, conducted analysis, curated the data, formulated methodology, and wrote the paper. R.Z., J.X., M.C., Z.Q., and X.K. performed the single-molecule imaging and tracking experiments. X.F. supervised the project, and reviewed and edited the paper.

Corresponding authors

Correspondence to Jinghe Yuan or Xiaohong Fang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Yuan, J., Zhao, R., Xu, J. et al. Analyzing protein dynamics from fluorescence intensity traces using unsupervised deep learning network. Commun Biol 3, 669 (2020). https://doi.org/10.1038/s42003-020-01389-z

Download citation

Received: 06 November 2019
Accepted: 13 October 2020
Published: 12 November 2020
DOI: https://doi.org/10.1038/s42003-020-01389-z

This article is cited by

Single-molecule FRET for probing nanoscale biomolecular dynamics
- Daniel Nettels
- Nicola Galvanetto
- Benjamin Schuler
Nature Reviews Physics (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.