Multi-parameter photon-by-photon hidden Markov modeling

Harris, Paul David; Narducci, Alessandra; Gebhardt, Christian; Cordes, Thorben; Weiss, Shimon; Lerner, Eitan

doi:10.1038/s41467-022-28632-x

Download PDF

Article
Open access
Published: 22 February 2022

Multi-parameter photon-by-photon hidden Markov modeling

Nature Communications volume 13, Article number: 1000 (2022) Cite this article

4506 Accesses
16 Citations
6 Altmetric
Metrics details

Subjects

Abstract

Single molecule Förster resonance energy transfer (smFRET) is a unique biophysical approach for studying conformational dynamics in biomacromolecules. Photon-by-photon hidden Markov modeling (H²MM) is an analysis tool that can quantify FRET dynamics of single biomolecules, even if they occur on the sub-millisecond timescale. However, dye photophysical transitions intertwined with FRET dynamics may cause artifacts. Here, we introduce multi-parameter H²MM (mpH²MM), which assists in identifying FRET dynamics based on simultaneous observation of multiple experimentally-derived parameters. We show the importance of using mpH²MM to decouple FRET dynamics caused by conformational changes from photophysical transitions in confocal-based smFRET measurements of a DNA hairpin, the maltose binding protein, MalE, and the type-III secretion system effector, YopO, from Yersinia species, all exhibiting conformational dynamics ranging from the sub-second to microsecond timescales. Overall, we show that using mpH²MM facilitates the identification and quantification of biomolecular sub-populations and their origin.

Fluorescence resonance energy transfer at the single-molecule level

Article 28 March 2024

Unscrambling fluorophore blinking for comprehensive cluster detection via photoactivated localization microscopy

Article Open access 05 October 2020

Two-colour single-molecule photoinduced electron transfer fluorescence imaging microscopy of chaperone dynamics

Article Open access 29 November 2021

Introduction

The role of structural dynamics in biomolecular function has come to the forefront of biophysical research^1,2. Biomolecules in solution exhibit structural dynamics at a hierarchy of timescales and modes, from bond rotations to movements of entire globular domains, occurring at times from picoseconds to seconds and longer³. In many cases, the stages in the biomolecular function are promoted by different sub-populations of closely-related structures, or conformations. Examples include coupling of catalytic activity to domain dynamics in some enzymes^4,5, the dynamics of the DNA bubble in transcription initiation to support transcription start site selection^6,7, DNA mismatch repair⁸, protein translocation⁹, chaperone action¹⁰, the allosteric regulation of the AAA+ disaggregase¹¹, active membrane transport^{12,13,14,15,16,17}, and many other important biochemical processes, in which structural dynamics is coupled to or influences biological function^1,2. Thus, methods capable of identifying and characterizing distinctly time-separated structural sub-populations of biomolecules are of great interest in biomolecular sciences and structural biology.

NMR- and EPR-based methods^18,19,20,21 as well as single molecule methods^{22,23,24,25,26} have come to the forefront in the field of dynamic structural biology, each with their own advantages and limitations. Single molecule methods allow probing one biomolecule at a time while tracking multiple experimental parameters simultaneously. This approach provides access to conformational heterogeneity, real-time kinetics, and identification of rare conformational states otherwise masked due to ensemble averaging.

One of the most popular single molecule approaches relies on the phenomenon of Förster resonance energy transfer (FRET), single molecule FRET (smFRET)²⁷, where the biomolecule of interest is site-specifically labeled at two strategic residues with two fluorescent dyes, which can exhibit transfer of excitation energy from the donor dye to the acceptor dye with a probability (or efficiency; E), which is inversely proportional to the sixth power of the distance between the dyes, according to the Förster relation^28,29,30. The FRET efficiency can be determined either ratiometrically, through the donor and acceptor fluorescence intensities, or through the use of fluorescence lifetime-based methods. Ratiometric methods yield an initial raw efficiency, E_raw (see Supplementary Eq. 1), to which correction factors must be applied, such as leakage of donor photons into the acceptor channel, direct excitation of the acceptor by the donor light source, differences in donor and acceptor fluorescence quantum yields and detection efficiencies (better known as the γ -factor), in order to yield accurate E^31,32,33. Lifetime-based approaches do not require such corrections, but rely on pulsed laser sources and time-correlated single photon counting modules³⁴. SmFRET has proven to be a powerful tool to disentangle conformational sub-populations of biomacromolecules undergoing dynamic transitions over a range of timescales³. Nevertheless, smFRET remains limited by the time resolution and observation time of the apparatus³. A popular approach is the observation of individual freely-diffusing molecules through the excitation volume of a confocal microscope^1,2. Here the observation time of a single molecule is on the order of a few milliseconds, with possible time resolution of dynamics as rapid as nanoseconds using advanced analyses of photon statistics within single molecule photon bursts (Fig. 1a, b). Some of the latter methods include photon distribution analysis, or probability distribution analysis (PDA)^{35,36,37,38,39,40}, burst variance analysis (BVA)⁴¹, FRET two-kernel density estimator (FRET-2CDE)⁴², analysis of two-dimensional histograms of donor fluorescence lifetimes, and ratiometric FRET efficiencies of bursts, also known as FRET lines^34,43,44, fluorescence correlation spectroscopy (FCS)^45,46 coupled to FRET^47,48,49, maximum likelihood approaches^{50,51,52,53,54}, such as hidden Markov modeling^4,7,55,56 (HMM), and photon recoloring^57,58. These have been summarized in recent reviews of the field^1,2.

**Fig. 1: Cartoon representations of data acquisition, and biological systems examined in this work.**

Photon-by-photon hidden Markov modeling (H²MM)⁵⁶ is a maximum likelihood method^57,59 that adopts the HMM machinery, while working directly with the photon data without binning into fluorescence intensity time traces, other than the clock time of the acquisition card, e.g. 50 ns for nsALEX, 12.5 ns for μsALEX. H²MM can extract the number of states involved in the underlying FRET dynamics, their mean E_raw values and transition rate constants. Nevertheless, while advanced smFRET setups often detect multiple fluorescence parameters beyond the intensities, such as in alternating laser excitation (ALEX)^60,61 or in multi-color smFRET-based measurements^{62,63,64,65,66,67,68,69}, H²MM in its current iteration only uses the raw FRET efficiency of a single donor-acceptor pair of dyes.

Here, we introduce multi-parameter H²MM (mpH²MM), which enables incorporation of multiple parameters in the analysis, through additional photon streams. We demonstrate this concept with two types of ALEX experiments: microsecond ALEX (μsALEX) and nanosecond ALEX (nsALEX; known also as pulsed interleaved excitation, PIE)^60,61. We applied this approach to different biomacromolecular complexes with dynamics ranging from the sub-second to microsecond timescales: (i) a DNA hairpin loop⁷⁰, (ii) the maltose binding protein MalE from E. coli, and (iii) YopO, a type-III-secretion system effector from pathogenic Yersinia species⁷¹ (Fig. 1c–e, Supplementary Fig. 1). Our results and analysis demonstrate that mpH²MM is able to quantitatively report sub-populations based on both the ALEX-relevant mean parameters, E_raw and the stoichiometry, S_raw (see Supplementary Eq. 2), as well as their transition rate constants, demonstrating FRET-relevant conformational transitions, as well as FRET-irrelevant photophysical transitions. We also present the H2MM_C python package⁷², with a backend written in C, for data processing, which is approximately two orders of magnitude faster than the previous implementation of H²MM in matlab⁵⁶.

Importantly, throughout this work, we make the clear distinction between sub-populations and states, where the latter is referred to the state models used to describe the dynamically interconverting sub-populations resolved from the data. This distinction is important, since thermodynamic states are single potential wells, and it is possible that the identified sub-populations are actually a group of states that interconvert much faster than the time resolution of the measurements. It should also be noted that we use the term parameter in multi-parameter H²MM to refer to parameters derived from ratios of sums of photons in different photon steams (e.g., E and S). These are distinct from state model parameters (e.g., rate constants, mean E).

Results

Verification of mpH²MM against simulated data

Analysis with single parameter H²MM (spH²MM) and mpH²MM can be performed using any given state model. Therefore, we must select the most likely state model among several, differing in their number of states and number of transition rate constants. Discriminating over- and under-fitted state models from the most likely model has proven difficult in the past^7,73. Previously, we proposed the modified Bayes information criterion (BIC’), which does not provide an extremum-based decision on the most likely state model⁷. In the current work, we implement the integrated complete likelihood (ICL)^74,75, which gets a minimum value for the most likely state model, as the primary criterion for state model selection.

Using simulated smFRET data, where the ground truth of the number and properties of the states is known, we find that the ICL is more reliable than the BIC’ at selecting the most reliable state model (see Supplementary Fig. 2, and Jupyter notebooks in supplementary dataset⁷²). Yet, there are instances in the simulated data, and in real data sets, we describe later, where the selection of the most likely state model based on ICL is of a model with too few states, relative to our prior knowledge of the system. Therefore, we always consider the ICL first, then BIC’, and take into account the prior knowledge of the system when selecting the most likely state model (see Supplementary Note 2 for expanded discussion, Supplementary Fig. 2).

To verify the validity of the multi-parameter approach, we perform a series of simple simulations (see supplementary Jupyter notebook mpH2MMsimulations⁷⁶). We compare results of spH²MM and mpH²MM analyses of simulated data where the acceptor excitation photon stream was either included or excluded. Using this data, we find that selecting the most likely state model based on the ICL parameter reliably identifies the correct ground truth state model, and this model accurately reproduces the transition rate constants, E_raw and S_raw values used in the simulation (Supplementary Figs. 3 and 4, E_raw, Supplementary Table 1, S_raw values defined in Supplementary Eqs. 6 and 7, respectively). In contrast, spH²MM is less reliable, and depending on the circumstances, it is unable to distinguish states with similar E_raw values, which are easily distinguished in mpH²MM by their S_raw values. Further, without the information about S_raw, interpretation of the models is more difficult, even if the correct number of states and their accurate E_raw values are recovered in spH²MM.

DNA hairpin exhibiting millisecond dynamics

As a first biological test system for mpH²MM, we used a DNA hairpin system introduced by Tsukanov et al. with a loop containing 31 adenines and a six base-pair stem⁷⁰. The opening and closing rate constants of the hairpin vary as a function of the GC content of the stem as well as the sodium chloride (NaCl) concentration⁷⁰. When appropriately labeled with a FRET donor and acceptor pair of dyes (ATTO 550 and ATTO 647N, respectively), the open and closed hairpin sub-populations exhibit distinct low and high mean E_raw values, respectively. The hairpin containing two GCs out of the six stem bases, which we term HP3, exhibited opening and closing rates of a few milliseconds, depending on the NaCl concentration in the buffer. Such a DNA construct with well-characterized and tunable transition rates serves as an ideal model system to test and characterize the performance of mpH²MM.

We first perform nsALEX measurements⁶¹ with this construct at a concentration of 300 mM NaCl, where a mix of both open and closed states are expected to interchange dynamically⁷⁰. As a qualitative test for FRET dynamics occurring within bursts, we use burst variance analysis (BVA)⁴¹, which compares the expected variance in E_raw based on shot noise (the static FRET semi-circle) against the actual variance in E_raw. BVA of the HP3 data shows clear deviation from the static FRET semi-circle, suggesting that individual HP3 molecules are undergoing FRET dynamics as they traverse the confocal volume, which we term within-burst FRET dynamics (Fig. 2a). E-τ_D plots⁴⁴ also indicate within-burst dynamics (see Supplementary Fig. 5). However, without the prior knowledge of the DNA hairpin behavior as a two-state FRET system, and without knowing how many more sub-populations unrelated to FRET may exist, it is not necessarily clear how many distinct sub-populations are involved in within-burst dynamics. In visual examination of the 2D E-S plot, three sub-populations are apparent: (i) an open hairpin sub-population with mean E_raw of 0.2, (ii) a closed hairpin sub-population, with a E_raw of 0.65, both open and closed sub-populations have mean S_raw of 0.5, and (iii) a third sub-population with a mean E_raw of 0, and mean S_raw of 1, where the acceptor is either in a dark state, or missing altogether (Fig. 2b). The 2D E-S plot also exhibits bursts with intermediate E_raw values, bridging between the open and closed hairpin sub-populations. As these bursts are particularly dynamic in the BVA analysis, these are bursts where the hairpin is undergoing opening and closing transitions while crossing the confocal volume.

**Fig. 2: mpH²MM results for DNA hairpin at 300 mM NaCl.**

Analyses of this data with spH²MM and mpH²MM show different patterns in the ICL values of the state models. The ICL is minimized for spH²MM models for a two-state model, while it is minimized for a four-state model when using mpH²MM. Visual inspection of the one-dimensional projection of burst data onto the E_raw parameter immediately suggests an explanation for this discrepancy, as it appears as only two sub-populations. The donor-only or dark acceptor state, and the open hairpin state exhibit similar low E_raw values and are difficult to distinguish as sub-populations based solely on E_raw. This projection reflects the data accessible to spH²MM, the donor excitation streams, and thus the open hairpin and dark acceptor states are expected to have nearly identical FRET signatures with regard to the streams accessible to spH²MM, thus leading to the false inference of only two states. The open hairpin FRET sub-population and the dark acceptor states are, however, quite distinct with regard to the acceptor excitation stream, which is accessible to mpH²MM.

In the ICL-based selected four-state model retrieved by mpH²MM, two out of the four states match nicely with the states in the ICL-based selected model from spH²MM model, having similar E_raw values. Their S_raw values are ~ 0.5 (Fig. 2b red circles), as expected for molecules undergoing FRET. The third and fourth states in the model can be matched to dark acceptor and dark donor sub-populations, respectively. The third state has a E_raw value ~ 0 and a S_raw value ~ 1 (Fig. 2b, top left red circle). This state has a clear sub-population of bursts associated with it in the 2D E-S plot. The fourth state has an intermediate E_raw value, and a very low S_raw value of ~ 0.17 (Fig. 2b, bottom red circle, Supplementary Table 2). There is no obvious sub-population visually observed in the E-S plots to which this would correspond, but the E_raw and S_raw values are consistent with this being a dark donor state. More importantly, comparing the parameters of the state models retrieved by spH²MM and mpH²MM, we find that the transition rate constants derived using mpH²MM are closer to those found by Tsukanov et al.⁷⁰ than those extracted using spH²MM (Supplementary Table 3, and supplementary .csv files of all state models found by H²MM analysis⁷²). The transition rate constants provide a clue as to why the fourth state does not show up in the E-S plots as a distinct sub-population, as the transition rates predict rare transitions to it, and rapid transitions away from it. Thus, populating the fourth state occurs only briefly and rarely in bursts undergoing rapid dynamics, such that it does not appear as a clear sub-population in the E-S plots (Supplementary Table 3, and supplementary .csv file⁷²).

The Viterbi algorithm finds the most likely state path through each burst, given a state model and its parameter values (Fig. 2d and Supplementary Fig. 6a–e). We use this to classify bursts by which states are present within each burst (Fig. 2d, Supplementary Fig. 6f), and separate photons into dwells, for which E_raw and S_raw can be defined (Fig. 2d, Supplementary Eqs. 8 and 9 in Supplementary Note 1.3). Additional analysis of dwells and their durations is provided in Supplementary Fig. 7. Visual examination of the burst-based E-S plot (Fig. 2e) shows that the Viterbi algorithm reasonably classifies most bursts that have E_raw and S_raw values close to the predicted value of a given sub-population as only having that state present, as well as bursts with intermediate E_raw and S_raw that are predicted to include dwells of multiple states. Notably, there are only a few bursts classified as having dwells solely in the dark donor state (Fig. 2e), keeping with what is predicted by the transition rates, and indeed, few dwells are even found in this state (Fig. 2f, Supplementary Fig. 6g). The scarcity of the donor dark state in the Viterbi analysis serves to both confirm this observation and prove the sensitivity of mpH²MM at the same time. In summary, using spH²MM, we do not properly decouple the FRET-relevant information from the FRET-irrelevant dye transitions to fluorophore dark states for the DNA hairpin data, which influences the accuracy of the retrieved values for the E_raw and rate constant parameters. On the other hand, using mpH²MM assists in the proper decoupling of the FRET-relevant information from the FRET-irrelevant ones and in gaining accurate parameter values. See Supplementary Figs. 8–19 for additional hairpin data acquired at different concentrations of NaCl. Now that we have verified mpH²MM with a well-defined biomolecular system of the DNA hairpin, HP3, we move to explore its usefulness in other biomacromolecular systems.

Quantifying the dynamics of a substrate-binding protein

In the previous example, we examined a system that exhibits intrinsic conformational dynamics, hence dynamics that is not induced by binding of a ligand. Now, we test mpH²MM on a system with conformational dynamics that is induced by substrate binding. For this, we select the periplasmic maltose binding protein, MalE from E. coli⁷⁷, which is the extracellular component of the maltose ABC importer MalFGK₂-E⁷⁷. MalE is a bilobed protein with a structural core built from a periplasmic binding protein (PBP)-like II domain. Two rigid domains, D₁ and D₂, are separated by a two-segment β -strand hinge and are complemented by a C-terminal embellishment that facilitates structural dynamics between open and closed states¹⁷. This allows for MalE to close upon substrate binding, similar to a venus fly-trap. For our nsALEX smFRET measurements we produced a MalE double-cysteine variant with labels at the outer sides of the two lobes, specifically residues T36C and S352C. As shown previously, this enables tracking of the opening and closing dynamics in single MalE molecules^17,78. We test three concentrations of the substrate maltose: none (apo), 1 μM (close to the K_D value⁷⁷) and 1 mM (holo). FRET histograms, using a dual channel burst search (DCBS)³⁶ filter exhibit three sub-populations: (i) a minor, low E_raw sub-population at E_raw of 0.1, (ii) a major sub-population with an intermediate E_raw of 0.5, and (iii) a major sub-population with a high E_raw of 0.7. We use DCBS because the donor- and acceptor-only sub-populations are very strong, and otherwise overwhelm the nsALEX data. Since we apply DCBS, bursts of the high S_raw and low E_raw values cannot represent molecules with permanently dark acceptor, but could be the result of either a real conformation, or of frequent acceptor blinking. With increasing maltose concentration, the fraction of the ~ 0.5 E_raw sub-population decreases, while the fraction of the ~ 0.7 E_raw sub-population increases (Fig. 3).

The BVA plot exhibits evidence of within-burst dynamics, so mpH²MM analysis of within-burst dynamics is warranted (Fig. 3, top row).

In mpH²MM analysis, the ICL-based model selection identifies the five-state model for 1 μM maltose, and the four-state model for 1 mM maltose. Examining these models, we find that all contain a single high S_raw state and a single low S_raw state, with the high S_raw state also having an E_raw of 0, and importantly, no bursts exist in these ranges due to the use of a DCBS filter (Supplementary Tables 4 and 5). Therefore, we can conclude that these states are the result of transition in the donor and acceptor dyes for the low and high S_raw states, respectively. The transition rates of the models and Viterbi analysis both show that these states are appreciably populated (Fig. 3, bottom row, Supplementary Figs. 20–22, Supplementary Table 7, and supplementary .csv file⁷²), thus the use of mpH²MM analysis is vital here. Depending on the maltose concentration, the ICL of spH²MM analysis predicts different numbers of states for each concentration, and the E_raw values of the states within these models are far less consistent (Fig. 3, vertical bars). These states are often similar to states found by mpH²MM, but their interpretation would be ambiguous if we did not have mpH²MM for additional information. In other cases, spH²MM-based states appear as a fusion of two states found by mpH²MM.

As another example of how vital mpH²MM analysis is in this case, consider the BVA signature of the apo form. Our analysis shows that the FRET dynamics for E_raw is not due to the actual dynamics between the open conformations of MalE. This is clear since the ~ 0.7 E_raw sub-population is not identified if maltose is not supplied. Such interpretation cannot be made from spH²MM results, due to the less consistent prediction of the number of states, and the parameters of those models. Therefore, we can confirm that MalE undergoes large-scale conformational dynamics linked to its function, mostly induced by the binding of maltose, hence it follows an induced-fit binding mechanism.

Adapting to μsALEX: microsecond dynamics of YopO

Finally, we demonstrate how to apply mpH²MM with μsALEX experiments. For this, we use the type-III secretion effector from Yersinia species, YopO⁷¹. We measure the conformational dynamics of a double-cysteine variant of YopO, with dyes labeling residues L113C and L497C. These labeling positions are expected to change distances upon binding to actin. Burst selection is performed using the DCBS filter, for the same reasons as in the MalE data - there are strong blinking dynamics that overwhelm the analysis otherwise. Interestingly, in the absence of actin there appears to be a single FRET sub-population in E-S plots, with tails towards dark donor and dark acceptor sub-populations. Nevertheless, BVA shows these bursts have a variance above the expected static FRET semi-circle (Fig. 4a, top panel), and hence within-burst dynamics. In the presence of bound actin (60 μM), a main sub-population is present with a shift toward lower E_raw values, and the BVA plot suggests no signature of within-burst dynamics at that main sub-population (Fig. 4b, top panel).

Using this μsALEX data with mpH²MM, the alternation period proves to be an obstacle, causing mpH²MM analysis to fail without a key adjustment to the data. Unlike in nsALEX, multiple photons can be detected during a given alternation period of the donor or acceptor excitation lasers. This results in photons originating from donor excitation that are temporally separated from photons originating from acceptor excitation in a periodic pattern, resulting in alternating periods where no photons originating from donor excitation are detected, and alternatively periods where no photons originating from acceptor excitation are detected. When we first apply mpH²MM to μsALEX data, we find that instead of detecting states with meaningful S_raw values, all states have S_raw values of either 0 or 1, and transition rates are all very similar to the alternation rate, meaning that mpH²MM detects the alternation rate instead of actual conformational dynamics (Supplementary Fig. 23a, b).

In that respect, to enable meaningful μsALEX analysis via mpH²MM that incorporates photons originating from acceptor excitation, we introduce a shift so that the times of the acceptor excitation photons overlap with the photons originating from donor excitation (see Supplementary Note 1.1.1). By doing so, the alternation period is no longer detected and meaningful dynamics with E_raw and S_raw values can be recovered (Fig. 4, Supplementary Fig. 23c). The usefulness of this analysis is evidenced by the detection of dark donor and dark acceptor states. Thus application of mpH²MM even to μsALEX data usually yields better results than with spH²MM. However, caution must be taken to avoid artefacts due to the alternation period. For instance, if the timescale of a transition approaches that of the alternation period, S_raw values may be biased or averaged together due to the shift (for in-depth discussion on this topic, see Supplementary Note 1.1.2).

Applying mpH²MM to analyze the measured data of YopO in the presence of actin, the most likely model is clearly a four-state model, using an alternation period of 50 μs (20 kHz alternation rate). The ICL-based model selection identifies four states, while the BIC’-based selection shows the four-state model to be close to the 0.005 threshold, and the five-state model can be further disregarded based on its reasonableness. Selection is more difficult for the apo results, as the two criteria disagree with ICL-based model selection that identifies three states and BIC’-based model selection that identifies five. Therefore, the most likely model is either the three-, four-, or five-state model, and examination of these models and prior knowledge of the data is necessary. The three-state model predicts states that appear as dark donor and acceptor, and a single FRET state. This model can be ruled out because the BVA shows significant dynamics around the single FRET population, and thus the single FRET state is insufficient to explain the BVA signature. The five-state model, on the other hand, suffers from the opposite problem - there are two states with very low S_raw values, where it appears as though the dark donor state has split into two. The four-state model, however, is reasonable, showing two FRET states, dark donor and dark acceptor states (Supplementary Tables 6 and 7). Transition rates between the high and low FRET states are 12,400 s⁻¹ and 6,000 s⁻¹ for transitions from high E_raw to low E_raw states, and for transitions from low E_raw to high E_raw states, respectively. These dynamics, however, approach the timescale of the alternation rate (20 kHz; for detailed discussion and examination, see Supplementary Note 1.1.2 as well as Supplementary Figs. 24, 25). Based on the analyses of the Viterbi-derived dwell times, error analysis of data sub-samples, and comparison with the results of mpH²MM analysis employed on other measurements using different alternation periods, we conclude that these transition rates are not artifacts, and reflect true FRET transitions in the data (see Supplementary Note 1.1.2, Supplementary Figs. 26–28 for comparison of different alternation periods, Supplementary Tables 8 and 9 for optimized model of different alternation period).

The timescale of the FRET dynamics being faster than burst duration by two orders of magnitude explains the appearance of the data in the FRET histogram as a single FRET population, yet with a signature of within-burst dynamics in the BVA plot. Inspecting the results of the mpH²MM analysis, the meaning of the within-burst FRET dynamics of YopO in the absence of actin becomes clear - it exhibits transitions in the tens of microseconds between two main FRET states intertwined with rapid transitions to dark donor and dark acceptor states. Each burst that lasts a few milliseconds contains multiple dwells in the underlying states and transitions between them, and so the bursts are averaged-out as a single main population. When comparing these results, with the analysis results of YopO in the presence of bound actin, it becomes clear that the lower E_raw state of the two FRET states in the absence of actin is stabilized upon actin binding. Therefore, we can conclude that YopO conformational dynamics relevant to actin binding occurs intrinsically, regardless of the presence of actin, and that actin stabilizes and locks one of the pre-existing conformations.

Without using mpH²MM, it would have been difficult to accurately report on this dynamics, as the FRET within-burst dynamics is intertwined with FRET-irrelevant transitions to dark states. It should be noted that we have successfully decoupled conformational and photophysical dynamics in μsALEX data without the use of fluorescence lifetimes.

Discussion

MpH²MM increases both the information content of the results and the sensitivity of the H²MM algorithm to differences in the photon streams that are too subtle when examining only a single parameter. We have shown that mpH²MM is able to disentangle dark acceptor states from low FRET states that have structural meaning. We have exhibited the advantage of using mpH²MM to elucidate an accurate quantitative picture on two proteins with two types of conformational dynamics that serve their function: (1) MalE with conformational dynamics induced by maltose binding, and (2) YopO with conformational dynamics occurring intrinsically, with actin binding stabilizing one of the states. In both cases, the overall picture is complicated by having the FRET-relevant transitions intertwined with the FRET-irrelevant dye transition to dark states, and not taking these into account could result in wrongly elucidated quantities and potentially wrong interpretations. Of note is the rapid conformational dynamics on the order of tens of microseconds in YopO when actin was absent. The exact description of the dynamics was possible using mpH²MM on μsALEX, and hence did not necessarily require analysis of the correlation of donor fluorescence lifetimes with ratiometric FRET values, as can be done using FRET lines fits to E-τ_D 2D plots in lifetime-based smFRET⁴⁴. As μsALEX and nsALEX setups are now commonly used, the acceptor excitation stream is usually available, therefore, mpH²MM maximizes the use of available data for characterizing rapidly interconverting sub-populations.

MpH²MM is, therefore, a powerful tool for the quantification of rapid conformational dynamics in a variety of systems, while also extracting information that can be used to extract inter-dye distance distributions. The integration of the acceptor excitation photon stream is critical in this process, as we have shown that spH²MM often conflates photophysical and conformational states, leading to incorrect E_raw and transition rate constants. Comparing a given protein or other biomacromolecular system with different ligands, or concentrations of ligands, it is possible to discriminate when a system demonstrates intrinsic conformational dynamics or conformational changes triggered by ligands. MpH²MM provides accurate quantitative measures of both transition rates and mean E_raw values, the latter of which can be converted into accurate mean FRET efficiency values with the proper correction factors for the system⁷⁹. Such information can then be converted into mean inter-dye distances, which provide invaluable information for FRET-based integrative structural models^7,33,79.

The success of integrating the acceptor excitation stream into our analysis, suggests that a similar approach could also be employed in camera-based smFRET applications. Alternating laser excitation, and HMM algorithms are commonly employed in analysis of data, although the information of the acceptor excitation stream is discarded in these analyses, and only used for truncating trajectories upon acceptor bleaching. The present mpH²MM algorithm is inappropriate for such data as the information is based on intensity and a constant frame rate, instead of single photon arrivals with variable interphoton times. Alternatively, the introduction of SPAD arrays⁸⁰ should allow analysis of immobilized molecules with single photon precision, which would allow for analysis of such data directly using mpH²MM.

mpH²MM is also not restricted to our demonstrated application in two detector setups with nsALEX and μsALEX. The most obvious application of mpH²MM beyond ALEX, is with the multiple photon streams in multi-parameter fluorescence detection (MFD)^34,43, or with multi-color smFRET-based measurements^{62,63,64,65,66,67,68,69}. Here, three or even four spectrally-distinct dyes are attached to the biomolecule of interest, and each produces a distinct photon stream. This enables the simultaneous observation of multiple inter-dye FRET efficiencies at once. If qualitative tests indicate that such a system is undergoing within-burst dynamics, mpH²MM is well-suited to extract the transfer efficiencies relevant to the underlying dynamically interconverting sub-populations. Applying these methods is as simple as assigning an index to each photon stream. We include a supplementary Jupyter notebook using a developer version of FRETBursts⁸¹ that accepts fluorescence anisotropy information from multi-parameter fluorescence detection, or from MFD coupled to pulsed interleaved excitation^34,43, and demonstrate mpH²MM’s ability to disentangle fluorescence anisotropies on data kindly provided by Cao et al.⁸². Values within the emission probability matrix can then be used as intensities to calculate all relevant ratiometric values. Multiple conformational sub-populations interconverting at sub-millisecond timescales could be simultaneously measured and disentangled with such a setup. Information on fluorescence anisotropy could also be incorporated, which, depending on the labeling scheme could report on dye steric restriction or oligomeric state of the system in question.

In this work, we used two ratiometric parameters drawn from ratios of photon counts of the photon streams available in ALEX-based measurements within the mpH²MM framework. In some smFRET measurements, such as in nsALEX, the photon nanotimes, which are the basis for fluorescence lifetime data, can also be considered as a parameter within the mpH²MM framework. However, unlike E_raw and S_raw, which are approximately binomially distributed, photon nanotimes distribute exponentially or sometimes according to a sum of exponentials. To transform photon nanotime data into a parameter that is also centrally distributed, and hence one that can be used within the mpH²MM framework, we propose a method for mapping the non-binomially distributed lifetime to a binomially distributed parameter amenable to mpH²MM (see Supplementary Note 3 for further details).

The new H2MM_C python package makes H²MM analysis much more practical, most analysis, for up to six states, take less time than the data acquisition times, given our modest hardware (a 2 year old middle-tier gaming laptop). See Supplementary Note 4 and Supplementary Tables 10 and 11 for system requirements and the duration of calculations in this paper. The supplied Jupyter notebooks provide examples for how to execute mpH²MM using FRETBursts. Experimenters using other platforms must utilize their knowledge of the fine details of their data to properly filter and cast their data into the simple and general format that the H2MM_C package⁷⁶ accepts. We also provide an in-depth tutorial available on Zenodo⁸³.

Methods

Production of YopO and MalE variants

The double-cysteine variant YopO L113C/L497C is produced and purified as described in Peter et al.⁷¹ and kindly provided by Gregor Hagelüken and Martin Peter, Institute of Structural Biology (University of Bonn). The double-cysteine variant MalE T36C/S352C is generated and purified according to methods reported previously¹⁷.

Labeling of MalE

The MalE variant T36C/352C is stochastically-labeled with Alexa Flour™ 555 and Alexa Fluor™ 647 dye derivatives as described in Peter et al. and deBoer et al.^17,84. The His₆-MalE double variant (200 μg) is incubated with 1 mM DTT and loaded immediately after on 200 μL (wet volume) Ni-Sepharose 6 Fast Flow resin, pre-equilibrated with labeling buffer 1 (50 mM Tris-HCl pH 7.4, 50 mM KCl). After a washing step with 50 column volumes labeling buffer 1, the loaded resin is incubated overnight at 4 ^∘C with 5-fold excess (25 nmol of each fluorophore dissolved in 1 mL of labeling buffer 1. Next, the resin is further washed with 50 column volumes labeling buffer 1 to remove the excess unbound fluorophores. Labeled protein is eluted with 800 μL elution buffer (50 mM Tris-HCl pH 8.0, 50 mM KCl, 500 mM imidazole) and further purified by size-exclusion chromatography (ÄKTA pure system, Superdex 75 Increase 10/300 GL column, GE Healthcare). Protein concentration is determined using the protein extinction coefficient and corrected for direct absorption of the fluorophores at 280 nm. Labeling efficiencies are estimated to be at least 60% for each fluorophore individually and donor-acceptor pairing at least 20%.

Labeled MalE is stored in 50 mM Tris-HCl pH7.4, 50 mM KCl and 1 mgmL⁻¹ bovine serum albumin (BSA) at 4 ^∘C for no more than 3 days. Concentrations ranged between 10 to 100 nM.

Labeling of YopO

The protein variant YopO L113C/L497C is stochastically-labeled with fluorophore-linked maleimide derivatives, as described previously⁸⁴. Briefly, 200 μg of protein is incubated with 5 mM DTT at 4 ^∘C for 30 min, to prevent oxidation of the cysteine thiol groups. The protein is loaded onto a PD Mini-Trap G-25 column (GE Healthcare) pre-equilibrated with Buffer A (50 mM Tris-HCl pH 7.4, 50 mM KCl) and subsequently eluted with 1 mL of Buffer A by gravity gel filtration, in order to eliminate the excess of DTT. The eluted protein is incubated overnight at 4 ^∘C with 50 nmol, respectively, of Alexa Fluor™ 555- and Alexa Fluor™ 647- C₂ maleimide (ThermoFisher Scientific). Excess dyes are removed again by gravity gel filtration using a PD Min-Trap G-25 column, as described above. The labeled protein is further purified from residual dyes and soluble aggregates by size-exclusion chromatography (SEC), with a Superdex™ 75 Increase 10/300 GL column, on an ÄKTA pure system (GE Healthcare). Protein concentration is determined using the protein extinction coefficient and corrected for direct absorption of the fluorophores at 280 nm.

Labeling efficiencies are estimated to be at least 60% for each fluorophore individually and donor-acceptor pairing at least 20%.

Experimental setup

Experimental setup for studies of HP3

We performed the nsALEX smFRET measurements of the doubly-labeled DNA hairpin construct⁷⁰ in the presence of 50, 100, 200, 250, 300, and 350 mM sodium chloride, using a confocal-based setup (ISS^TM, USA) assembled on top of an Olympus IX73 inverted microscope stand. We use a pulsed picosecond fiber laser (λ = 532 nm, pulse width of 100 ps FWHM, operating at 20 MHz repetition rate and 100 μW measured at the back aperture of the objective lens) for exciting the Cy3B donor dye (FL-532-PICO, CNI, China), and a pulsed picosecond diode laser (λ = 642 nm, pulse width of 100 ps FWHM, operating at 20 MHz repetition rate and 60 μW measured at the back aperture of the objective lens) for exciting the ATTO 647N acceptor dye (QuixX^® 642-140 PS, Omicron, GmbH), delayed by 25 ns. The laser beams pass through a polarization maintaining optical fiber and then further shaped by a linear polarizer and a halfwave plate. A dichroic beam splitter with high reflectivity at 532 and 640 nm (ZT532/640rpc, Chroma, USA) reflects the light through the optical path to a high numerical aperture (NA) super apochromatic objective (60X, NA = 1.2, water immersion, Olympus, Japan), which focuses the light onto a small confocal volume. The microscope collects the fluorescence from the excited molecules through the same objective, and focuses it with an achromatic lens (f = 100 mm) onto a 100 μm diameter pinhole (variable pinhole, motorized, tunable from 20 μm to 1 mm), and then re-collimates it with an achromatic lens (f = 100 mm). Then, donor and acceptor fluorescence are split between two detection channels using a dichroic mirror with a cutoff wavelength at λ = 652 nm (FF652-Di01-25x36, Semrock Rochester NY, USA). We further filter the donor and acceptor fluorescence from other light sources 585/40 nm (FF01-585/40-25, Semrock Rochester NY, USA) and 698/70 nm (FF01-698/70-25, Semrock Rochester NY, USA) band-pass filters, respectively, and detect the donor and acceptor fluorescence signals using two hybrid photomultipliers (Model R10467U-40, Hamamatsu, Japan), routed through a 4-to-1 router to a time-correlated single photon counting (TCSPC) module (SPC-150, Becker & Hickl, GmbH) as its START signal (the STOP signal is routed from the laser controller). We perform data acquisition using the VistaVision software (version 4.2.095, 64-bit, ISS^TM, USA) in the time-tagged time-resolved (TTTR) file format. After acquiring the data, we transform it into the photon HDF5 file format⁸⁵ for easy dissemination of raw data to the public, and easy input in the FRETBursts analysis software.

Experimental setup for studies of MalE

The nsALEX measurements on MalE are performed using a home-built setup, assembled around an Olympus IX73 inverted microscope stand. We use a picosecond pulsed diode laser (λ = 532 nm, pulse width of 100 ps FWHM, operating at 20 MHz repetition rate and 32 μW at the back aperture of the objective) for exciting the Alexa Fluor™ 555 donor (LDH-P-FA-530B, Picoquant GmbH), and a picosecond pulsed diode laser (λ = 640 nm, pulse width of 90 ps FWHM, operating at 20 MHz repetition rate, and 20 μW at the back aperture of the objective) to excite the Alexa Fluor™ 647 acceptor (LDH-D-C-640, Picoquant, GmbH), driven by the same PDL828 “Sepia II” (Picoquant, GmbH) controller. The laser light is guided into the microscope by a dual-edge beamsplitter (ZT532/640rpc Chroma/AHF, GmbH) and focused to a diffraction-limited excitation spot by an oil immersion objective (UPLSAPO 60XO, Olypus). The emitted light is collected through the same objective, spatially filtered through a 50 μm pinhole, and spectrally split into donor and acceptor channels by a single-edge dichroic mirror (H643 LPXR, AHF). The emission is filtered (donor: BrightLine HC 582/75, Semrock/AHF, acceptor: Longpass 647 LP Edge Basic, Semrock/AHF) and the signal is recorded with avalanche photodiodes (SPCM-AQRH-34, Excelitas) and a TCSPC module (HydraHarp400, Picoquant, GmbH). Data was acquired with Picoquant SymPhoTime 64 v2.7.

Coverslips are passivated with 1 mg mL⁻¹ BSA in PBS buffer before adding around 100 μL of sample. MalE stock solution is diluted to ~ 50 pM concentration in 50 mM Tris-HCl pH 7.4, 50 mM KCl, and either, none, 1 μM or 1 mM of the ligand maltose.

Experimental setup for studies of YopO

The μsALEX measurements of YopO are performed using the setup in Gebhardt et al.⁸⁶. These are conducted on the same home-built microscope as the MalE experiments, built around an Olympus IX71 base, although the lasers and dichroics are replaced as described below. We use a continuous wave λ = 532 nm diode laser (OBIS 532-10-LS, Coherent, USA) laser with 60 μW power measured at the back aperture of the objective to excite the donor Alexa Fluor™ 555 dye, and a continuous wave λ = 640 nm diode laser (OBIS 640-100-LX, Coherent, USA) with 25 μW power measured at the back aperture of the objective. The lasers are distally modulated by TTL pulses with an alternating frequency of 10 kHz, 20 kHz, and 100 kHz, for an alternation period of 100 μs 50 μs, and 10 μs, respectively. The lasers are combined and coupled into a polarization maintaining single-mode patch cable (P-3-488PM-FC2, Thorelabs, USA). The laser light is reflected into the objective by a dual-edge dichroic mirror (ZT532/640rpc, Chroma/AHF) and focused by a water immersion objective (UPlanSApo 60/1.2w, Olympus, GmbH). The dichroic mirrors, fluorescent filters and avalanche photodiodes are identical to those used for acquisiton of MalE data.

Coverlips are passivated with BSA as in MalE measurements. 100 μL of YopO solution, diluted to between 50 pM and 80 pM is used for each measurement in 50 mM Tris-HCl pH 7.4, 50 mM KCl. For measurements with actin, the buffer also contained 50 μM non-muscle human actin protein (Cytoskeleton, Inc) and 0.2 mM ATP and 0.2 mM CaCl₂.

Data is acquired using labVIEW v7.1 software as presented in Ingargiola et al.⁸⁷.

Burst selection

All data processing and analysis is performed using Jupyter Notebooks available in supplementary dataset, along with the accompanying photon-HDF5 files containing the raw data⁷². We perform burst search and selection using the FRETBursts analysis software⁸⁸. The background is assessed per each 30 s of acquisition, and bursts are identified as time periods were the instantaneous photon count rate of a sliding window of m = 10 consecutive photons is at least F = 6 times higher than the background rate. Bursts in the normal selection are selected if they include at least 30 photons in total between all streams. Visualizations are performed using FRETBursts’ dplot function, or matplotlib when greater customization is desired.

Single and multi-parameter H²MM analysis

Bursts identified by FRETBursts are then converted into a format readable by the H2MM_C software⁷⁶, by a simple function supplied in the Jupyter notebooks available in supplementary dataset⁷², this function is also responsible for applying the shift to acceptor excitation photons in μsALEX experiments (Supplementary Note 1.1.1). In spH²MM, only photons arising from donor excitation are considered, assigned to either donor or acceptor streams, identified by index 0 or 1, respectively, depending on at which detector they arrived. MpH²MM also considers photons arriving during acceptor excitation, assigning these photons an index of 2. All H²MM calculations are performed within the Jupyter notebooks, available in supplementary dataset⁷², using the Python package by Paul David Harris⁷⁶. We use the H²MM algorithm (both single- and multi-parameter) to test how well different state models describe the data.

Model selection

To choose the best model, we primarily use the ICL^74,75, where the state model reaching a minimal ICL is generally considered the one that describes the data best, with minimal free parameters. We always calculate sufficient numbers of state models to ensure ICL is minimized. The ICL parameter is defined in Eq. (1):

$$ICL(m)=-2\ln ({{{{{{{\bf{p}}}}}}}}({{{{{{{\bf{y}}}}}}}},\hat{{{{{{{{\bf{s}}}}}}}}}| m,\hat{{\lambda }_{m}}))+K\ln (n)$$

(1)

where $\ln ({{{{{{{\bf{p}}}}}}}}({{{{{{{\bf{y}}}}}}}},\hat{{{{{{{{\bf{s}}}}}}}}}| m,\hat{{\lambda }_{m}}))$ is the posterior probability of the most likely state path, as determined by the Viterbi algorithm, K is the number of free parameters in the model, and n is the number of photons in all bursts in the data set. K is calculated as in Eq. (2):

$$K={q}^{2}+(r-1)q-1$$

(2)

where q is the number of states the state model represents, and r is the number of photon streams used for the calculation of all of the parameters that are assessed. For spH²MM, r = 2, while for nsALEX mpH²MM, r = 3. The ICL is preferable as an extremum-based criterion over the previously proposed threshold based on the modified Bayes Information Criterion (BIC’)⁷. See supplementary dataset⁷² for Jupyter notebooks testing the reliability of ICL with simulated data sets generated using PyBroMo⁸⁹ (https://github.com/OpenSMFS/PyBroMo/releases/tag/0.8.1; was utilized in previous works^7,85,90). We use the Viterbi algorithm to find the most likely state path based on the posterior probability.

Viterbi analysis

From the state path, photons are separated into dwells, each of which can be assigned a duration, a mean E_raw, and for mpH²MM, a mean S_raw. This also allows bursts to be classified by which and how many states are present. As one measure of error, we use the weighted standard deviation and the weighted standard error of the E_raw and S_raw as a proxy for the standard error of the H²MM model (see Supplementary Note 1.3 for full derivation).

Error analysis by variance of subsets

Analysis of the variance of subsets is another method to assess the error of parameters (see Supplementary Note 1.4 for detailed description). This is implemented as a function in the Jupyter notebooks in the supplementary dataset⁷². This is an attractive approach, as it does not depend on any most likely state path like in the Vieterbi based approach. This method, however, is significantly more computationally expensive than the Viterbi approach.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

The photon-HDF5 data, Jupyter notebooks, .csv data and H2MM_C code that support the findings of this study are available in the Zenodo and github repositories with the identifiers at: https://doi.org/10.5281/zenodo.5566809, and https://doi.org/10.5281/zenodo.5535302^72,76.

Code availability

The H2MM_C library used in this study is available on github https://github.com/harripd/H2MMpythonlib (commit 1f3d0a84f149d21a740161372526eb3742027602). The FRETbursts used in this study is available on github https://github.com/harripd/FRETBursts (commit 315c60d3791aa93cf2ec6e880003174c8192fc88). The phconvert code used n this study is available on github https://github.com/Photon-HDF5/phconvert v0.9 (commit 3a86e58f11f77e21c2a02a1d9453060db6811c9c). The PyBroMo code used in this study is available on github: https://github.com/tritemio/PyBroMo v0.8.1 (commit 8403ae750ff68796ef4118dd497478cf54355382). labVIEW code is available on github: https://github.com/multispot-software/MultichannelTimestamper.

References

Lerner, E. et al. Toward dynamic structural biology: Two decades of single-molecule Förster resonance energy transfer. Science 359, eaan1133 (2018).
Article PubMed PubMed Central Google Scholar
Lerner, E. et al. FRET-based dynamic structural biology: Challenges, perspectives and an appeal for open-science practices. eLife 10, e60416 (2021).
Article CAS PubMed PubMed Central Google Scholar
Schuler, B. & Hofmann, H. Single-molecule spectroscopy of protein folding dynamics-expanding scope and timescales. Curr. Opin. Struct. Biol. 23, 36–47 (2013).
Article CAS PubMed Google Scholar
Aviram, H. Y. et al. Direct observation of ultrafast large-scale dynamics of an enzyme under turnover conditions. Proc. Natl. Acad. Sci. 115, 3243–3248 (2018).
Article CAS PubMed PubMed Central Google Scholar
Mazal, H. & Haran, G. Single-molecule FRET methods to study the dynamics of proteins at work. Curr. Opin. Biomed. Eng. 12, 8–17 (2019).
Article PubMed PubMed Central Google Scholar
Robb, N. C. et al. The transcription bubble of the RNA polymerase - promoter open complex exhibits conformational heterogeneity and millisecond-scale dynamics: implications for transcription start-site selection. J. Mol. Biol. 425, 875–885 (2013).
Article CAS PubMed Google Scholar
Lerner, E., Ingargiola, A. & Weiss, S. Characterizing highly dynamic conformational states: The transcription bubble in RNAP-promoter open complex as an example. J. Chem. Phys. 148, 123315 (2018).
Article ADS PubMed PubMed Central Google Scholar
Cristóvão, M. et al. Single-molecule multiparameter fluorescence spectroscopy reveals directional MutS binding to mismatched bases in DNA. Nucleic Acids Res. 40, 5448–5464 (2012).
Article PubMed PubMed Central Google Scholar
Fessl, T. et al. Dynamic action of the Sec machinery during initiation, protein translocation and termination. eLIFE 7, e35112 (2018).
Article PubMed PubMed Central Google Scholar
Calabrese, A. N. et al. Inter-domain dynamics in the chaperone SurA and multi-site binding to its outer membrane protein clients. Nat. Commun. 11, 2155 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Mazal, H. et al. Tunable microsecond dynamics of an allosteric switch regulate the activity of a AAA+ disaggregation machine. Nat. Commun. 10, 1438 (2019).
Article ADS PubMed PubMed Central Google Scholar
Zhao, Y. et al. Single-molecule dynamics of gating in a neurotransmitter transporter homologue. Nature 465, 188–193 (2010).
Article ADS CAS PubMed PubMed Central Google Scholar
Zhao, Y. et al. Substrate-modulated gating dynamics in a Na+-coupled neurotransmitter transporter homologue. Nature 474, 109–113 (2011).
Article CAS PubMed PubMed Central Google Scholar
Erkens, G. B., Hänelt, I., Goudsmits, J. M. H., Slotboom, D. J. & van Oijen, A. M. Unsynchronised subunit motion in single trimeric sodium-coupled aspartate transporters. Nature 502, 119–123 (2013).
Article ADS CAS PubMed Google Scholar
Gouridis, G. et al. Conformational dynamics in substrate-binding domains influences transport in the ABC importer GlnPQ. Nat. Struct. Mol. Biol. 22, 57–64 (2015).
Article CAS PubMed Google Scholar
Husada, F. et al. Conformational dynamics of the ABC transporter McjD seen by single-molecule FRET. EMBO J. 37, 1–13 (2018).
Article Google Scholar
de Boer, M. et al. Conformational and dynamic plasticity in substrate-binding proteins underlies selective transport in ABC importers. eLIFE 8, e44652 (2019).
Article PubMed PubMed Central Google Scholar
Anthis, N. J. & Clore, G. M. Visualizing transient dark states by NMR spectroscopy. Q. Rev. Biophysics. 48, 35–116 (2015).
Article CAS Google Scholar
Clore, G. M. & Iwahara, J. Theory, practice, and applications of paramagnetic relaxation enhancement for the characterization of transient low-population states of biological macromolecules and their complexes. Chem. Rev. 109, 4108–4139 (2009).
Article CAS PubMed PubMed Central Google Scholar
Palmer, A. G. NMR characterization of the dynamics of biomacromolecules. Chem. Rev. 104, 3623–3640 (2004).
Article CAS PubMed Google Scholar
Ravera, E. et al. Insights into domain-domain motions in proteins and RNA from solution NMR. Acc. Chem. Res. 47, 3118–3126 (2014).
Article CAS PubMed PubMed Central Google Scholar
Su, Q. P. & Ju, L. A. Biophysical nanotools for single-molecule dynamics. Biophysical Rev. 10, 1349–1357 (2018).
Article CAS Google Scholar
Bavishi, K. & Hatzakis, N. Shedding light on protein folding, structural and functional dynamics by single molecule studies. Molecules 19, 19407–19434 (2014).
Article PubMed PubMed Central Google Scholar
Medina, E., R. Latham, D. & Sanabria, H. Unraveling protein’s structural dynamics: from configurational dynamics to ensemble switching guides functional mesoscale assemblies. Curr. Opin. Struct. Biol. 66, 129–138 (2021).
Article CAS PubMed Google Scholar
Mandal, S. S. Force spectroscopy on single molecules of life. ACS Omega. 5, 11271–11278 (2020).
Article CAS PubMed PubMed Central Google Scholar
Dimura, M. et al. Quantitative FRET studies and integrative modeling unravel the structure and dynamics of biomolecular systems. Curr. Opin. Struct. Biol. 40, 163–185 (2016).
Article CAS PubMed Google Scholar
Ha, T. et al. Probing the interaction between two single molecules: fluorescence resonance energy transfer between a single donor and a single acceptor. Proc. Natl Acad. Sci. 93, 6264–6268 (1996).
Article ADS CAS PubMed PubMed Central Google Scholar
Förster, T. Zwischenmolekulare energiewanderung und fluoreszenz. Ann. der Phys. 437, 55–75 (1948).
Article ADS MATH Google Scholar
Förster, T. 10th spiers memorial lecture. transfer mechanisms of electronic excitation. Discuss. Faraday Soc. 27, 7 (1959).
Article Google Scholar
Stryer, L. & Haugland, R. P. Energy transfer: a spectroscopic ruler. Proc. Natl Acad. Sci. 58, 719–726 (1967).
Article ADS CAS PubMed PubMed Central Google Scholar
Dahan, M. et al. Ratiometric measurement and identification of single diffusing molecules. Chem. Phys. 247, 85–106 (1999).
Article CAS Google Scholar
Deniz, A. A. et al. Single-pair fluorescence resonance energy transfer on freely diffusing molecules: Observation of Forster distance dependence and subpopulations. Proc. Natl Acad. Sci. 96, 3670–3675 (1999).
Article ADS CAS PubMed PubMed Central Google Scholar
Lee, N. K. et al. Accurate FRET measurements within single diffusing biomolecules using alternating-laser excitation. Biophysical J. 88, 2939–2953 (2005).
Article ADS CAS Google Scholar
Rothwell, P. J. et al. Multiparameter single-molecule fluorescence spectroscopy reveals heterogeneity of HIV-1 reverse transcriptase:primer/template complexes. Proc. Natl Acad. Sci. 100, 1655–1660 (2003).
Article ADS CAS PubMed PubMed Central Google Scholar
Aviram, M., Felekyan, S., Gaiduk, A. & Seidel, C. A. Separating structural heterogeneities from stochastic variations in fluorescence resonance energy transfer distributions via photon distribution analysis. J. Phys. Chem. B 110, 6970–6978 (2006).
Article Google Scholar
Nir, E. et al. Shot-noise limited single-molecule FRET histograms: comparison between theory and experiments. J. Phys. Chem. B 110, 22103–22124 (2006).
Article CAS PubMed PubMed Central Google Scholar
Kalinin, S., Felekyan, S., Antonik, M. & Seidel, C. A. Probability distribution analysis of single-molecule fluorescence anisotropy and resonance energy transfer. J. Phys. Chem. B 111, 10253–10262 (2007).
Article CAS PubMed Google Scholar
Kalinin, S., Felekyan, S., Valeri, A. & Seidel, C. A. Characterizing multiple molecular states in single-molecule multiparameter fluorescence detection by probability distribution analysis. J. Phys. Chem. B 112, 8361–8374 (2008).
Article CAS PubMed Google Scholar
Kalinin, S., Valeri, A., Antonik, M., Felekyan, S. & Seidel, C. A. Detection of structural dynamics by FRET: a photon distribution and fluorescence lifetime analysis of systems with multiple states. J. Phys. Chem. B 114, 7983–7995 (2010).
Article CAS PubMed Google Scholar
Santoso, Y., Torella, J. P. & Kapanidis, A. N. Characterizing single-molecule FRET dynamics with probability distribution analysis. ChemPhysChem 11, 2209–2219. (2010).
Article Google Scholar
Torella, J. P., Holden, S. J., Santoso, Y., Hohlbein, J. & Kapanidis, A. N. Identifying molecular dynamics in single-molecule fret experiments with burst variance analysis. Biophysical J. 100, 1568–1577 (2011).
Article ADS CAS Google Scholar
Tomov, T. E. et al. Disentangling subpopulations in single-molecule FRET and ALEX experiments with photon distribution analysis. Biophysical J. 102, 1163–1173 (2012).
Article ADS CAS Google Scholar
Sisamakis, E., Valeri, A., Kalinin, S., Rothwell, P. J. & Seidel, C. A. Accurate single-molecule FRET studies using multiparameter fluorescence detection. In Methods in Enzymology, vol. 475, 455-514 (Elsevier Inc., 2010), 1 edn. https://doi.org/10.1016/S0076-6879(10)75018-7 https://linkinghub.elsevier.com/retrieve/pii/S0076687910750187.
Barth, A. et al. Unraveling multi-state molecular dynamics in single-molecule FRET experiments- Part I: Theory of FRET-Lines (2021). http://arxiv.org/abs/2107.14770.
Magde, D., Elson, E. & Webb, W. Thermodynamic fluctuations in a reacting system-measurement by fluorescence correlation spectroscopy. Phys. Rev. Lett. 29, 705 (1972).
Article ADS CAS Google Scholar
Rigler, R.et al. Fluorescence correlation spectroscopy with high count rate and low background: analysis of translational diffusion. In: Accounts of Chemical Research 22.10 (3 Oct. 1993), pp. 169–175. https://doi.org/10.1007/BF00185777
Widengren, J., Schweinberger, E., Berger, S. & Seidel, C. A. Two new concepts to measure fluorescence resonance energy transfer via fluorescence correlation spectroscopy: theory and experimental realizations. J. Phys. Chem. A 105, 6851–6866 (2001).
Article CAS Google Scholar
Torres, T. & Levitus, M. Measuring conformational dynamics: a new FCS-FRET approach. J. Phys. Chem. B 111, 7392–7400 (2007).
Article CAS PubMed Google Scholar
Gurunathan, K. & Levitus, M. FRET fluctuation spectroscopy of diffusing biopolymers: contributions of conformational dynamics and translational diffusion. J. Phys. Chem. B 114, 980–986 (2010).
Article CAS PubMed PubMed Central Google Scholar
Köllner, M. & Wolfrum, J. How many photons are necessary for fluorescence-lifetime measurements? Chem. Phys. Lett. 200, 199–204 (1992).
Article ADS Google Scholar
Zander, C. et al. Detection and characterization of single molecules in aqueous solution. Appl. Phys. B 63, 517–523 (1996).
Article ADS CAS Google Scholar
Maus, M. et al. An experimental comparison of the maximum likelihood estimation and nonlinear least-squares fluorescence lifetime analysis of single molecules. Anal. Chem. 73, 2078–2086 (2001).
Article CAS PubMed Google Scholar
Nettels, D., Gopich, I. V., Hoffmann, A. A. & Schuler, B. Ultrafast dynamics of protein collapse from single-molecule photon statistics. Proc. Natl Acad. Sci. 104, 2655–2660 (2007).
Article ADS CAS PubMed PubMed Central Google Scholar
Chung, H. S., McHale, K., Louis, J. M. & Eaton, W. A. Single-molecule fluorescence experiments determine protein folding transition path times. Science. 335, 981–984 (2012).
Article ADS CAS PubMed Google Scholar
Keller, B. G., Kobitski, A., Jäschke, A., Nienhaus, U. G. & Noé, F. Complex RNA folding kinetics revealed by single-molecule FRET and hidden markov models. J. Am. Chem. Soc. 136, 4534–4543 (2014).
Article CAS PubMed PubMed Central Google Scholar
Pirchi, M. et al. Photon-by-photon hidden markov model analysis for microsecond single-molecule FRET kinetics. J. Phys. Chem. B 120, 13065–13075 (2016).
Article CAS PubMed Google Scholar
Gopich, I. V. & Szabo, A. Theory of the energy transfer efficiency and fluorescence lifetime distribution in single-molecule FRET. Proc. Natl Acad. Sci. 109, 7747–7752 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Ingargiola, A., Weiss, S. & Lerner, E. Monte carlo diffusion-enhanced photon inference: distance distributions and conformational dynamics in single-molecule FRET. J. Phys. Chem. B 122, 11598–11615 (2018).
Article CAS PubMed Google Scholar
Gopich, I. V. & Szabo, A. Decoding the pattern of photon colors in single-molecule FRET. J. Phys. Chem. B 113, 10965–10973 (2009).
Article CAS PubMed PubMed Central Google Scholar
Müller, B. K., Zaychikov, E., Bräuchle, C. & Lamb, D. C. Pulsed interleaved excitation. Biophysical J. 89, 3508–3522 (2005).
Article ADS Google Scholar
Laurence, T. A., Kong, X., Jager, M. & Weiss, S. Probing structural heterogeneities and fluctuations of nucleic acids and denatured proteins. Proc. Natl Acad. Sci. 102, 17348–17353 (2005).
Article ADS CAS PubMed PubMed Central Google Scholar
Hohng, S., Joo, C. & Ha, T. Single-molecule three-color FRET. Biophysical J. 87, 1328–1337 (2004).
Article ADS CAS Google Scholar
Clamme, J.-P. & Deniz, A. A. Three-color single-molecule fluorescence resonance energy transfer. ChemPhysChem 6, 74–77 (2005).
Article CAS PubMed Google Scholar
Lee, N. K., Koh, H. R. & Kim, S. K. Folding of 8-17 deoxyribozyme studied by three-color alternating-laser excitation of single molecules. J. Am. Chem. Soc. 129, 15526–15534 (2007).
Article CAS PubMed Google Scholar
Lee, N. K. et al. Three-color alternating-laser excitation of single molecules: monitoring multiple interactions and distances. Biophysical J. 92, 303–312 (2007).
Article ADS CAS Google Scholar
Lee, S., Lee, J. & Hohng, S. Single-molecule three-color FRET with both negligible spectral overlap and long observation time. PLoS One 5, e12270 (2010).
Article ADS PubMed PubMed Central Google Scholar
Stein, I. H., Steinhauser, C. & Tinnefeld, P. Single-molecule four-color FRET visualizes energy-transfer paths on DNA origami. J. Am. Chem. Soc. 133, 719–726 (2011).
Article Google Scholar
Yim, S. W. et al. Four-color alternating-laser excitation single-molecule fluorescence spectroscopy for next-generation biodetection assays. Clin. Chem. 58, 707–716 (2012).
Article CAS PubMed PubMed Central Google Scholar
Ratzke, C., Hellenkamp, B. & Hugel, T. Four-colour FRET reveals directionality in the Hsp90 multicomponent machinery. Nat. Commun. 5, 4192 (2014).
Article ADS CAS PubMed Google Scholar
Tsukanov, R., Tomov, T. E., Berger, Y., Liber, M. & Nir, E. Conformational dynamics of DNA hairpins at millisecond resolution obtained from analysis of single-molecule FRET histograms. J. Phys. Chem. B 117, 16105–16109 (2013).
Article CAS PubMed Google Scholar
Peter, M. F. et al. Studying conformational changes of the yersinia Type-III-secretion effector YopO in solution by integrative structural biology. Structure 27, 1416–1426 (2019).
Article Google Scholar
Harris, P. D. et al. Multi-parameter photon-by-photon hidden markov modeling dataset. https://zenodo.org/record/5902313 (2021).
Harris, P. D., Hamdan, S. M. & Habuchi, S. Relative contributions of base stacking and electrostatic repulsion on DNA nicks and gaps. J. Phys. Chem. B 124, 10663–10672 (2020).
Article CAS PubMed Google Scholar
Biernacki, C., Celeux, G. & Govaert, G. Assessing a mixture model for clustering with the integrated completed likelihood. IEEE Trans. Pattern Anal. Mach. Intell. 22, 719–725 (2000).
Article Google Scholar
Celeux, G. & Durand, J.-B. Selecting hidden Markov model state number with cross-validated likelihood. Computational Stat. 23, 541–564 (2008).
Article MathSciNet MATH Google Scholar
Harris, P. D. H2MMpythonlib: simulated models (2021). https://zenodo.org/record/5535302.
Mächtel, R., Narducci, A., Griffith, D. A., Cordes, T. & Orelle, C. An integrated transport mechanism of the maltose ABC importer. Res. Microbiol. 170, 321–337 (2019).
Article PubMed PubMed Central Google Scholar
Kim, E. et al. A single-molecule dissection of ligand binding to a protein with intrinsic dynamics. Nat. Chem. Biol. 9, 313–318 (2013).
Ingargiola, A. Applying corrections in single-molecule FRET. bioRxiv083287 (2017). https://www.biorxiv.org/content/early/2017/02/01/083287.
Zickus, V. et al. Fluorescence lifetime imaging with a megapixel SPAD camera and neural network lifetime estimation. Sci. Rep. 10, 20986 (2020).
Article CAS PubMed PubMed Central Google Scholar
Harris, P. D. Fretbursts development version (2021). https://github.com/harripd/FRETBursts/tree/polarization.
Cao, A.-M. et al. Allosteric modulators enhance agonist efficacy by increasing the residence time of a GPCR in the active state. Nat. Commun. 12, 5426 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Harris, P. D. H2mm tutorial (2021). https://doi.org/10.5281/zenodo.5566886.
Peter, M. F. et al. Cross-validation of distance measurements in proteins by PELDOR/DEER and single-molecule FRET. bioRxiv 2020.11.23.394080 (2020). http://biorxiv.org/content/early/2020/11/23/2020.11.23.394080.abstract.
Ingargiola, A., Laurence, T., Boutelle, R., Weiss, S. & Michalet, X. Photon-HDF5. Biophysical J. 110, 25–33 (2016).
Article ADS Google Scholar
Gebhardt, C. et al. Molecular and spectroscopic characterization of green and red cyanine fluorophores from the alexa fluor and AF series. ChemPhysChem 22, 1566–1583 (2021).
Article CAS PubMed PubMed Central Google Scholar
Ingargiola, A. et al. Multispot single-molecule FRET: High-throughput analysis of freely diffusing molecules. PLoS One 12, e0175766 (2017).
Article PubMed PubMed Central Google Scholar
Ingargiola, A., Lerner, E., Chung, S. Y., Weiss, S. & Michalet, X. FRETBursts: An open source toolkit for analysis of freely-diffusing Single-molecule FRET. PLoS ONE 11, 1–27 (2016).
Article Google Scholar
Ingargiola, A. FOpenSMFS/PyBroMo: Version 0.8.1. zenodo.org (2019).
Hagai, D. & Lerner, E. Systematic assessment of burst impurity in confocal-based single-molecule fluorescence detection using Brownian motion simulations. In: Molecules24 (2019) ISSN: 14203049. https://doi.org/10.3390/molecules24142557.

Download references

Acknowledgements

We thank Gregor Hagelücken and Martin Peter from the Institute of Structural Biology (University of Bonn, GER) for providing YopO. We would like to thank Robert Quast and Emmanuel Margeat for insightful discussions regarding the implementation of mpH²MM for the analysis of 4-detector nsALEX measurements (2-color smFRET, with fluorescence anisotropies), based on their existing data⁸². We would also like to thank Demain Lieberman for his helpful discussion regarding implementation of H²MM code, and Bill Harris for his help in enabling the H2MM_C code to work on Windows and Linux. This paper was supported by the National Institutes of Health (NIH, grant R01 GM130942 to S.W. and E.L. as a subaward), the National Science Foundation (NSF, grants 1818147 and 1842951 to S.W.), the Human Frontiers Science Program (HFSP, grant RGP0061/2019 to S.W.), the Israel Science Foundation (ISF, grant 3565/20 to E.L., within the KillCorona – Curbing Coronavirus Research Program), the Milner Fund (to E.L.), and the Hebrew University of Jerusalem (start-up funds to E.L.). Work in the lab of T.C. was financed by Deutsche Forschungsgemeinschaft (SFB863, project A13 and GRK2062, project C03), an ERC Starting Grant (No. 638536 – SM-IMPORT to T.C.) and by the Center of Nanoscience Munich (CeNS).

Author information

Authors and Affiliations

Department of Biological Chemistry, The Alexander Silberman Institute of Life Sciences, Faculty of Mathematics & Science, The Edmond J. Safra Campus, The Hebrew University of Jerusalem, Jerusalem, 9190401, Israel
Paul David Harris & Eitan Lerner
Physical and Synthetic Biology. Faculty of Biology, Ludwig-Maximilians-Universität München, Großhadernerstr. 2-4, 82152, Planegg-Martinsried, Germany
Alessandra Narducci, Christian Gebhardt & Thorben Cordes
Department of Chemistry and Biochemistry, and Department of Physiology, University of California, Los Angeles, CA, USA
Shimon Weiss
CaliforniaNanoSystems Institute, University of California, Los Angeles, CA, USA
Shimon Weiss
The Center for Nanoscience and Nanotechnology, The Hebrew University of Jerusalem, Jerusalem, 9190401, Israel
Eitan Lerner

Authors

Paul David Harris
View author publications
You can also search for this author in PubMed Google Scholar
Alessandra Narducci
View author publications
You can also search for this author in PubMed Google Scholar
Christian Gebhardt
View author publications
You can also search for this author in PubMed Google Scholar
Thorben Cordes
View author publications
You can also search for this author in PubMed Google Scholar
Shimon Weiss
View author publications
You can also search for this author in PubMed Google Scholar
Eitan Lerner
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

E.L. performed HP3 nsALEX measurements. C.G. performed MalE nsALEX measurements. A.N. and T.C. performed YopO μsALEX experiments. P.D.H. analyzed data and contributed analytical tools. P.D.H. & E.L. designed and performed the research and composed the initial manuscript. P.D.H., A.N., C.G., T.C., S.W. & E.L. discussed the data and contributed to the final version of the manuscript.

Corresponding authors

Correspondence to Paul David Harris or Eitan Lerner.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks the anonymous reviewers for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Harris, P.D., Narducci, A., Gebhardt, C. et al. Multi-parameter photon-by-photon hidden Markov modeling. Nat Commun 13, 1000 (2022). https://doi.org/10.1038/s41467-022-28632-x

Download citation

Received: 15 April 2021
Accepted: 03 February 2022
Published: 22 February 2022
DOI: https://doi.org/10.1038/s41467-022-28632-x

This article is cited by

Dynamic stability of Sgt2 enables selective and privileged client handover in a chaperone triad
- Hyunju Cho
- Yumeng Liu
- Shu-ou Shan
Nature Communications (2024)
Cross-validation of distance measurements in proteins by PELDOR/DEER and single-molecule FRET
- Martin F. Peter
- Christian Gebhardt
- Gregor Hagelueken
Nature Communications (2022)
A blind benchmark of analysis tools to infer kinetic rate constants from single-molecule FRET trajectories
- Markus Götz
- Anders Barth
- Sonja Schmid
Nature Communications (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.