Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • Article
  • Published:

Diffraction-limited molecular cluster quantification with Bayesian nonparametrics

A preprint version of the article is available at bioRxiv.

Abstract

Life’s fundamental processes involve multiple molecules operating in close proximity within cells. To probe the molecular composition of such small (diffraction-limited) regions, experiments often report on the total fluorescence intensity emitted from labeled molecules within. Methods exist to enumerate total fluorophore numbers (for example, step counting by photobleaching); however, these methods cannot treat photophysical dynamics nor learn their associated kinetic rates. Here we propose a method to simultaneously enumerate fluorophores and determine their photophysical properties. Although our focus here is on photophysical dynamics, such dynamics can also serve as a proxy for other types of dynamics such as the kinetics of assembly and disassembly of clusters. As the number of active fluorescent molecules at any given time is unknown, we rely on Bayesian nonparametrics to derive our kinetic estimates. We provide a versatile framework for enumerating up to 100 fluorophores from brightness time traces, benchmarked on synthetic and real datasets.

This is a preview of subscription content, access via your institution

Access options

Buy this article

Prices may be subject to local taxes which are calculated during checkout

Fig. 1: Description of the data generation.
Fig. 2: A graphical representation of the inverse model.
Fig. 3: Inference on simulated data.
Fig. 4: Comparison against other methods.
Fig. 5: Inference on real data.

Similar content being viewed by others

Data availability

The data analyzed in this project from ref. 23 were provided by D.-P. Herten, K. Yserentant and J. Hummert of the University of Birmingham, and can be obtained from D.-P. Herten (D.Herten@bham.ac.uk). Brightness time traces of the ROIs can be found on Zenodo50. Original movies are also available on Zenodo51,52,53,54,55. Source data are provided with this paper.

Code availability

Code can be found on Zenodo50.

References

  1. Jung, S.-R., Fujimoto, B. S. & Chiu, D. T. Quantitative microscopy based on single-molecule fluorescence. Curr. Opin. Chem. Biol. 39, 64–73 (2017).

    Article  Google Scholar 

  2. Lee, A., Tsekouras, K., Calderon, C., Bustamante, C. & Pressé, S. Unraveling the thousand word picture: an introduction to super-resolution data analysis. Chem. Rev. 117, 7276–7330 (2017).

    Article  Google Scholar 

  3. Leung, B. O. & Chou, K. C. Review of super-resolution fluorescence microscopy for biology. Appl. Spect. 65, 967–980 (2011).

    Article  Google Scholar 

  4. Cho, W.-K. et al. Mediator and RNA polymerase II clusters associate in transcription-dependent condensates. Science 361, 412–415 (2018).

    Article  Google Scholar 

  5. Cisse, I. I. et al. Real-time dynamics of RNA polymerase II clustering in live human cells. Science 341, 664–667 (2013).

    Article  Google Scholar 

  6. Thevathasan, J. V. et al. Nuclear pores as versatile reference standards for quantitative superresolution microscopy. Nat. Methods 16, 1045–1053 (2019).

    Article  Google Scholar 

  7. Wang, H. et al. Rubisco condensate formation by ccmm in β-carboxysome biogenesis. Nature 566, 131–135 (2019).

    Article  Google Scholar 

  8. Guo, Y. E. et al. Pol II phosphorylation regulates a switch between transcriptional and splicing condensates. Nature 572, 543–548 (2019).

    Article  Google Scholar 

  9. Banani, S. F., Lee, H. O., Hyman, A. A. & Rosen, M. K. Biomolecular condensates: organizers of cellular biochemistry. Nat. Rev. Mol. Cell Biol. 18, 285–298 (2017).

    Article  Google Scholar 

  10. Gabitto, M. et al. A Bayesian nonparametric approach to super-resolution single-molecule localization. Ann. Appl. Stat. 15, 1742–1766 (2021).

  11. Verdaasdonk, J. S., Lawrimore, J. & Bloom, K. in Methods in Cell Biology Vol. 123 (eds Waters, J. C. & Wittman, T.) 347–365 (Elsevier, 2014).

  12. Ulbrich, M. H. & Isacoff, E. Y. Subunit counting in membrane-bound proteins. Nat. Methods 4, 319–321 (2007).

  13. Aspelmeier, T., Egner, A. & Munk, A. Modern statistical challenges in high-resolution fluorescence microscopy. Annu. Rev. Stat. Appl. 2, 163–202 (2015).

  14. Nino, D., Djayakarsana, D. & Milstein, J. N. Nanoscopic stoichiometry and single-molecule counting. Small Methods 3, 1900082 (2019).

  15. Tsekouras, K., Custer, T. C., Jashnsaz, H., Walter, N. G. & Pressé, S. A novel method to accurately locate and count large numbers of steps by photobleaching. Mol. Biol. Cell 27, 3601–3615 (2016).

    Article  Google Scholar 

  16. Staudt, T. et al. Statistical molecule counting in super-resolution fluorescence microscopy: towards quantitative nanoscopy. Stat. Sci. 35, 92–111 (2020).

  17. Garry, J., Li, Y., Shew, B., Gradinaru, C. C. & Rutenberg, A. D. Bayesian counting of photobleaching steps with physical priors. J. Chem. Phys. 152, 024110 (2020).

    Article  Google Scholar 

  18. McGuire, H., Aurousseau, M. R., Bowie, D. & Blunck, R. Automating single subunit counting of membrane proteins in mammalian cells. J. Biol. Chem. 287, 35912–35921 (2012).

    Article  Google Scholar 

  19. Zhang, H. & Guo, P. Single molecule photobleaching (SMPB) technology for counting of RNA, DNA, protein and other molecules in nanoparticles and biological complexes by TIRF instrumentation. Methods 67, 169–176 (2014).

    Article  Google Scholar 

  20. Chen, Y., Deffenbaugh, N. C., Anderson, C. T. & Hancock, W. O. Molecular counting by photobleaching in protein complexes with many subunits: best practices and application to the cellulose synthesis complex. Mol. Biol. Cell 25, 3630–3642 (2014).

    Article  Google Scholar 

  21. Kalafut, B. & Visscher, K. An objective, model-independent method for detection of non-uniform steps in noisy signals. Comput. Phys. Commun. 179, 716–723 (2008).

    Article  Google Scholar 

  22. Xu, J. et al. Automated stoichiometry analysis of single-molecule fluorescence imaging traces via deep learning. J. Am. Chem. Soc.141, 6976–6985 (2019).

  23. Hummert, J. et al. Photobleaching step analysis for robust determination of protein complex stoichiometries. Mol. Biol. Cell 32, ar35 (2021).

    Article  Google Scholar 

  24. Gelman, A. et al. Bayesian Data Analysis (Chapman and Hall/CRC, 2013).

  25. Von Toussaint, U. Bayesian inference in physics. Rev. Modern Phys. 83, 943–999 (2011).

  26. Zhou, M., Hannah, L., Dunson, D. & Carin, L. in Artificial Intelligence and Statistics (eds Lawrence, N. D. & Girolami, M.)1462–1471 (PMLR, 2012).

  27. Thibaux, R. & Jordan, M. I. in Artificial Intelligence and Statistics 564–571 (2007).

  28. Glembockyte, V., Lin, J. & Cosa, G. Improving the photostability of red-and green-emissive single-molecule fluorophores via Ni2+ mediated excited triplet-state quenching. J. Phys. Chem. B 120, 11923–11929 (2016).

    Article  Google Scholar 

  29. Lu, H., Zhang, H., Yang, S. & Zheng, Z. Camera parameters auto-adjusting technique for robust robot vision. In 2010 IEEE International Conference on Robotics and Automation 1518–1523 (IEEE, 2010).

  30. Bishop, C. M. Pattern Recognition and Machine Learning (Springer, 2006).

  31. Rollins, G. C., Shin, J. Y., Bustamante, C. & Pressé, S. Stochastic approach to the molecular counting problem in superresolution microscopy. Proc. Natl Acad. Sci. USA 112, E110–E118 (2015).

    Article  Google Scholar 

  32. Shroff, H., White, H. & Betzig, E. Photoactivated localization microscopy (PALM) of adhesion complexes. Curr. Protoc. Cell Biol. 41, 4–21 (2008).

  33. Rust, M. J., Bates, M. & Zhuang, X. Stochastic optical reconstruction microscopy (STORM) provides sub-diffraction-limit image resolution. Nat. Methods 3, 793 (2006).

    Article  Google Scholar 

  34. Wampler, J. E. & Kutz, K. Quantitative fluorescence microscopy using photomultiplier tubes and imaging detectors. Methods Cell Biol. 29, 239–267 (1988).

    Article  Google Scholar 

  35. Zhang, Z., Wang, Y., Piestun, R. & Huang, Z.-L. Characterizing and correcting camera noise in back-illuminated SCMOS cameras. Opt. Express 29, 6668–6690 (2021).

  36. Hirsch, M., Wareham, R. J., Martin-Fernandez, M. L., Hobson, M. P. & Rolfe, D. J. A stochastic model for electron multiplication charge-coupled devices—from theory to practice. PLoS ONE 8, e53671 (2013).

    Article  Google Scholar 

  37. Huang, F. et al. Video-rate nanoscopy using scmos camera–specific single-molecule localization algorithms. Nat. Methods 10, 653 (2013).

    Article  Google Scholar 

  38. Robbins, M. S. & Hadwen, B. J. The noise performance of electron multiplying charge-coupled devices. IEEE Trans. Electron Devices 50, 1227–1232 (2003).

    Article  Google Scholar 

  39. Gael, J. V., Teh, Y. W. & Ghahramani, Z. The infinite factorial hidden Markov model. In Advances in Neural Information Processing Systems 1697–1704 (NeurIPS, 2009).

  40. Tavakoli, M. et al. Pitching single-focus confocal data analysis one photon at a time with Bayesian nonparametrics. Phys. Rev. X 10, 011021 (2020).

    Google Scholar 

  41. Jazani, S. et al. An alternative framework for fluorescence correlation spectroscopy. Nat. Commun. 10, 3662 (2019).

  42. Gilks, W. R. & Wild, P. Adaptive rejection sampling for Gibbs sampling. J. R. Stat. Soc. C 41, 337–348 (1992).

  43. Geman, S. & Geman, D. Stochastic relaxation, Gibbs distributions, and the Bayesian restoration of images. In IEEE Transactions on Pattern Analysis and Machine Intelligence 721–741 (IEEE, 1984).

  44. Robert, C. & Casella, G. Monte Carlo Statistical Methods (Springer, 2013).

  45. Sgouralis, I. & Pressé, S. An introduction to infinite HMMs for single-molecule data analysis. Biophys. J. 112, 2021–2029 (2017).

    Article  Google Scholar 

  46. Sgouralis, I. & Pressé, S. Icon: an adaptation of infinite HMMs for time traces with drift. Biophys. J. 112, 2117–2126 (2017).

    Article  Google Scholar 

  47. Kilic, Z., Sgouralis, I. & Pressé, S. Residence time analysis of RNA polymerase transcription dynamics: a Bayesian sticky HMM approach. Biophys. J. 120, 1665–1679 (2021).

    Article  Google Scholar 

  48. Sivia, D. & Skilling, J. Data Analysis: A Bayesian Tutorial (OUP, 2006).

  49. Betancourt, M., Byrne, S., Livingstone, S. & Girolami, M. The geometric foundations of Hamiltonian Monte Carlo. Bernoulli 23, 2257–2298 (2017).

    Article  MathSciNet  Google Scholar 

  50. Bryan, J. IV, Sgroualis, I. & Pressé, S. jsbryan4/FluorophoreCounter: organized data files. Zenodo https://doi.org/10.5281/zenodo.5829710 (2022).

  51. Bryan IV, J., Sgroualis, I. & Pressé, S. Diffraction-limited molecular cluster quantification with Bayesian nonparametrics: calibration data. Zenodo https://doi.org/10.5281/zenodo.5834051 (2022).

  52. Bryan IV, J., Sgroualis, I. & Pressé, S. Diffraction-limited molecular cluster quantification with Bayesian nonparametrics: 35 binding site data. Zenodo https://doi.org/10.5281/zenodo.5834725 (2022).

  53. Bryan IV, J., Sgroualis, I. & Pressé, S. Diffraction-limited molecular cluster quantification with Bayesian nonparametrics: 20 binding site data A. Zenodo https://doi.org/10.5281/zenodo.5834272 (2022).

  54. Bryan IV, J., Sgroualis, I. & Pressé, S. Diffraction-limited molecular cluster quantification with Bayesian nonparametrics: 20 binding site data B. Zenodo https://doi.org/10.5281/zenodo.5834730 (2022).

  55. Bryan IV, J., Sgroualis, I. & Pressé, S. Diffraction-limited molecular cluster quantification with Bayesian nonparametrics: 20 binding site data C. Zenodo https://doi.org/10.5281/zenodo.5834798 (2022).

Download references

Acknowledgements

We thank D.-P. Herten, K. Yserentant and J. Hummert of University of Birmingham for their collaboration and excellent data. S.P. acknowledges support from the NIH (grant numbers R01GM134426 and R01GM130745) and NSF (award number 1719537).

Author information

Authors and Affiliations

Authors

Contributions

S.P. and I.S. conceived of the project. J.S.B. performed the coding and development. S.P. oversaw all aspects of the project.

Corresponding author

Correspondence to Steve Pressé.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Computational Science thanks Jean-Baptiste Masson, Ruobo Zhou and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Primary Handling Editor: Jie Pan, in collaboration with the Nature Computational Science team.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Supplementary Figs. 1–45 and Sections 1–4.

Source data

Source Data Fig. 3

Contains five csv files plus a README explaining how to use the csv files to reconstruct the figure. The five csvs are as follows: fig3_data_trace.txt, measured brightness trace for an ROI; fig3_MAP_trace.txt, inferred MAP brightness trace; fig3_num_flor_ground_truth.txt, ground truth number of fluorophores in each ROI; fig3_num_flor_samples.txt, MCMC samples of the number of fluorophores in each ROI; fig3_error.txt, difference between the MCMC samples and the ground truth.

Source Data Fig. 4

Contains 40 csv files plus a README explaining how to use the csv files to reconstruct the figure. The 40 csv files are organized as follows: all files start with ‘fig4_’. After that, the file name will include either ‘base’, ‘startdark’ or ‘highnoise’, indicating which dataset it is from. After that, the file name will include either ‘trace’, ‘numflor’ or ‘error’, indicating what type of data it is. The file name will end with either ‘change_point’, ‘our_method’, ‘ruler_method’, ‘two_state’ or ‘data’, indicating where the data comes from. All ‘trace’ csv files are brightness versus time traces. The data trace is the measurement for an ROI. All ‘numflor’ csv files are estimates for the number of fluorophores in each ROI, with ‘data’ indicating the ground truth. (Note that ‘our_method’ is larger than the others because we provide estimates for many iterations of the MCMC chain) All ‘error’ csv files are the differences between the method estimate for the number of fluorophores in the ROI and the actual number of fluorophores.

Source Data Fig. 5

Contains 12 csv files plus a README explaining how to use the csv files to generate the figure. The csv files are organized as follows: there are six ‘fig5_XX_site_ground_truth.txt’ files, which have the expected distribution for the number of fluorophores; there are six ‘fig5_XX_site_samples.txt’, which are MCMC samples for the number of fluorophores in each ROI. XX indicates which dataset the file is from and can be one of the following: 20, 40, 80, 35, 70, 140.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Bryan IV, J.S., Sgouralis, I. & Pressé, S. Diffraction-limited molecular cluster quantification with Bayesian nonparametrics. Nat Comput Sci 2, 102–111 (2022). https://doi.org/10.1038/s43588-022-00197-1

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1038/s43588-022-00197-1

This article is cited by

Search

Quick links

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing