A particle-filter framework for robust cryo-EM 3D reconstruction

Hu, Mingxu; Yu, Hongkun; Gu, Kai; Wang, Zhao; Ruan, Huabin; Wang, Kunpeng; Ren, Siyuan; Li, Bing; Gan, Lin; Xu, Shizhen; Yang, Guangwen; Shen, Yuan; Li, Xueming

doi:10.1038/s41592-018-0223-8

Article
Published: 30 November 2018

A particle-filter framework for robust cryo-EM 3D reconstruction

Mingxu Hu ORCID: orcid.org/0000-0003-3603-3966^1,2,3^na1,
Hongkun Yu ORCID: orcid.org/0000-0002-2591-6328^3,4^na1,
Kai Gu⁵^na1,
Zhao Wang^1,3,4,
Huabin Ruan^1,2,
Kunpeng Wang^3,4,
Siyuan Ren^3,4,
Bing Li^3,4,
Lin Gan^3,4,
Shizhen Xu^3,4,
Guangwen Yang ORCID: orcid.org/0000-0002-8673-8254^3,4,
Yuan Shen ORCID: orcid.org/0000-0002-9396-1964⁵ &
…
Xueming Li ORCID: orcid.org/0000-0002-8451-9947^1,2,6

Nature Methods volume 15, pages 1083–1089 (2018)Cite this article

6396 Accesses
30 Citations
7 Altmetric
Metrics details

Subjects

Abstract

Single-particle electron cryomicroscopy (cryo-EM) involves estimating a set of parameters for each particle image and reconstructing a 3D density map; robust algorithms with accurate parameter estimation are essential for high resolution and automation. We introduce a particle-filter algorithm for cryo-EM, which provides high-dimensional parameter estimation through a posterior probability density function (PDF) of the parameters given in the model and the experimental image. The framework uses a set of random support points to represent such a PDF and assigns weighting coefficients not only among the parameters of each particle but also among different particles. We implemented the algorithm in a new program named THUNDER, which features self-adaptive parameter adjustment, tolerance to bad particles, and per-particle defocus refinement. We tested the algorithm by using cryo-EM datasets for the cyclic-nucleotide-gated (CNG) channel, the proteasome, β-galactosidase, and an influenza hemagglutinin (HA) trimer, and observed substantial improvement in resolution.

Access through your institution

Buy or subscribe

This is a preview of subscription content, access via your institution

Access options

Access through your institution

Buy this article

Purchase on Springer Link
Instant access to full article PDF

Buy now

Prices may be subject to local taxes which are calculated during checkout

**Fig. 1: Illustration of support points in the particle filter and the LFs in the rotation subspace of the CNG dataset.**

**Fig. 2: Support points in three subspaces.**

**Fig. 3: Comparison of the resolutions among different processing options for three datasets.**

**Fig. 4: z-height distribution of proteasome particles in ice.**

**Fig. 5: Comparison of the 3D reconstructions of the 40°-tilted dataset of the influenza HA trimer²⁷.**

Estimating conformational landscapes from Cryo-EM particles by 3D Zernike polynomials

Article Open access 11 January 2023

A Bayesian approach to extracting free-energy profiles from cryo-electron microscopy experiments

Article Open access 01 July 2021

A minority of final stacks yields superior amplitude in single-particle cryo-EM

Article Open access 10 December 2023

Data availability

The entry codes of the datasets from EMPIAR are EMPIAR-10025, EMPIAR-10061, and EMPIAR-10097. The entry codes of the density maps from the EMDB are EMD-6656, EMD-2984, EMD-6287, and EMD-8731. The entry codes of the structure models from the Protein Data Bank are PDB 1PMA, PDB 3WHE, PDB 5H3O, and PDB 5A1A. The calculated density maps that support the findings of this study are available as Supplementary Data and from the corresponding author upon reasonable request.

References

Merk, A. et al. Breaking cryo-EM resolution barriers to facilitate drug discovery. Cell 165, 1698–1707 (2016).
Article CAS PubMed Central Google Scholar
Cheng, Y. Single-particle cryo-EM at crystallographic resolution. Cell 161, 450–457 (2015).
Article CAS PubMed Central Google Scholar
Cheng, Y., Grigorieff, N., Penczek, P. A. & Walz, T. A primer to single-particle cryo-electron microscopy. Cell 161, 438–449 (2015).
Article CAS PubMed Central Google Scholar
Sigworth, F. J. A maximum-likelihood approach to single-particle image refinement. J. Struct. Biol. 122, 328–339 (1998).
Article CAS PubMed Central Google Scholar
Scheres, S. H. W. RELION: implementation of a Bayesian approach to cryo-EM structure determination. J. Struct. Biol. 180, 519–530 (2012).
Article CAS PubMed Central Google Scholar
Scheres, S. H. A Bayesian view on cryo-EM structure determination. J. Mol. Biol. 415, 406–418 (2012).
Article CAS PubMed Central Google Scholar
Scheres, S. H. Classification of structural heterogeneity by maximum-likelihood methods. Methods Enzymol. 482, 295–320 (2010).
Article CAS PubMed Central Google Scholar
Henderson, R. Avoiding the pitfalls of single particle cryo-electron microscopy: Einstein from noise. Proc. Natl Acad. Sci. USA 110, 18037–18041 (2013).
Article CAS PubMed Central Google Scholar
Rosenthal, P. B. & Rubinstein, J. L. Validating maps from single particle electron cryomicroscopy. Curr. Opin. Struct. Biol. 34, 135–144 (2015).
Article CAS PubMed Central Google Scholar
Henderson, R. et al. Tilt-pair analysis of images from a range of different specimens in single-particle electron cryomicroscopy. J. Mol. Biol. 413, 1028–1046 (2011).
Article CAS PubMed Central Google Scholar
Stewart, A. & Grigorieff, N. Noise bias in the refinement of structures derived from single particles. Ultramicroscopy 102, 67–84 (2004).
Article CAS PubMed Central Google Scholar
Grigorieff, N. FREALIGN: high-resolution refinement of single particle structures. J. Struct. Biol. 157, 117–125 (2007).
Article CAS PubMed Central Google Scholar
Punjani, A., Rubinstein, J. L., Fleet, D. J. & Brubaker, M. A. cryoSPARC: algorithms for rapid unsupervised cryo-EM structure determination. Nat. Methods 14, 290–296 (2017).
Article CAS PubMed Central Google Scholar
Arulampalam, M. S., Maskell, S., Gordon, N. & Clapp, T. A tutorial on particle filters for online nonlinear/non-gaussian bayesian tracking. IEEE Trans. Signal Process. 50, 174–188 (2002).
Article Google Scholar
Douc, R. & Cappe, O. Comparison of resampling schemes for particle filtering. in Proc. Int. Symp. Image Signal Process Anal. 64–69 (IEEE, Piscataway, NJ, USA, 2005).
Gustafsson, F. et al. Particle filters for positioning, navigation, and tracking. IEEE Trans. Signal Process. 50, 425–437 (2002).
Article Google Scholar
Li, M. et al. Structure of a eukaryotic cyclic-nucleotide-gated channel. Nature 542, 60–65 (2017).
Article CAS PubMed Central Google Scholar
Mindell, J. A. & Grigorieff, N. Accurate determination of local defocus and specimen tilt in electron microscopy. J. Struct. Biol. 142, 334–347 (2003).
Article PubMed Central Google Scholar
Gong, X. et al. Structural insights into the Niemann-Pick C1 (NPC1)-mediated cholesterol transfer and Ebola infection. Cell 165, 1467–1478 (2016).
Article CAS Google Scholar
Campbell, M. G., Veesler, D., Cheng, A., Potter, C. S. & Carragher, B. 2.8 Å resolution reconstruction of the Thermoplasma acidophilum 20S proteasome using cryo-electron microscopy. eLife 4, e06380 (2015).
Article Google Scholar
Bartesaghi, A. et al. 2.2 Å resolution cryo-EM structure of β-galactosidase in complex with a cell-permeant inhibitor. Science 348, 1147–1151 (2015).
Article CAS PubMed Central Google Scholar
Iudin, A., Korir, P. K., Salavert-Torres, J., Kleywegt, G. J. & Patwardhan, A. EMPIAR: a public archive for raw electron microscopy image data. Nat. Methods 13, 387–388 (2016).
Article CAS PubMed Central Google Scholar
Noble, A. J. et al. Routine single particle cryoEM sample and grid characterization by tomography. eLife 7, e34257 (2018).
Article PubMed Central Google Scholar
Zheng, S. Q. et al. MotionCor2: anisotropic correction of beam-induced motion for improved cryo-electron microscopy. Nat. Methods 14, 331–332 (2017).
Article CAS PubMed Central Google Scholar
Brilot, A. F. et al. Beam-induced motion of vitrified specimen on holey carbon film. J. Struct. Biol. 177, 630–637 (2012).
Article CAS PubMed Central Google Scholar
Zhang, K. Gctf: real-time CTF determination and correction. J. Struct. Biol. 193, 1–12 (2016).
Article CAS PubMed Central Google Scholar
Tan, Y. Z. et al. Addressing preferred specimen orientation in single-particle cryo-EM through tilting. Nat. Methods 14, 793–796 (2017).
Article CAS PubMed Central Google Scholar
Kay, S. M., Fundamentals of Statistical Signal Processing, Volume I: Estimation Theory (Prentice Hall, Upper Saddle River, NJ, USA, 1993).
Weinstein, E. & Weiss, A. J. A general-class of lower bounds in parameter-estimation. IEEE Trans. Inf. Theory 34, 338–342 (1988).
Article Google Scholar
Xu, W., Baggeroer, A. B. & Richmond, C. D. Bayesian bounds for matched-field parameter estimation. IEEE Trans. Signal Process. 52, 3293–3305 (2004).
Article Google Scholar
Scheres, S. H. Beam-induced motion correction for sub-megadalton cryo-EM particles. eLife 3, e03665 (2014).
Article PubMed Central Google Scholar

Download references

Acknowledgements

This work was supported by funds from The National Key Research and Development Program (2016YFA0501102 and 2016YFA0501902 to X.L.), National Natural Science Foundation of China (31722015 and 31570730 to X.L., and 61672312 to G.Y.), Advanced Innovation Center for Structural Biology (to X.L., Y.S., and G.Y.), Tsinghua-Peking Joint Center for Life Sciences (to X.L.), One-Thousand Talent Program through the State Council of China (to X.L. and Y.S.), and Intel Parallel Computing Center project (to X.L.). We thank Y. Cheng (University of California San Francisco) for providing the T20S proteasome sample, and X. Zhou and C. Lin (Tsinghua University) for collecting the T20S proteasome data on CCD camera. We thank X. Lin (Intel) for help in optimizing the C++ code of THUNDER. We acknowledge the National Supercomputing Center in Wuxi and the Tsinghua University Branch of the China National Center for Protein Sciences Beijing for providing facility support in computation.

Author information

These authors contributed equally: Mingxu Hu, Hongkun Yu, Kai Gu.

Authors and Affiliations

MOE Key Laboratory of Protein Science, School of Life Sciences, Tsinghua University, Beijing, China
Mingxu Hu, Zhao Wang, Huabin Ruan & Xueming Li
Advanced Innovation Center for Structural Biology, Tsinghua University, Beijing, China
Mingxu Hu, Huabin Ruan & Xueming Li
National Supercomputing Center in Wuxi, Wuxi, China
Mingxu Hu, Hongkun Yu, Zhao Wang, Kunpeng Wang, Siyuan Ren, Bing Li, Lin Gan, Shizhen Xu & Guangwen Yang
Department of Computer Science and Technology, Tsinghua University, Beijing, China
Hongkun Yu, Zhao Wang, Kunpeng Wang, Siyuan Ren, Bing Li, Lin Gan, Shizhen Xu & Guangwen Yang
Department of Electronic Engineering, Tsinghua University, Beijing, China
Kai Gu & Yuan Shen
Tsinghua-Peking Joint Center for Life Sciences, Beijing, China
Xueming Li

Authors

Mingxu Hu
View author publications
You can also search for this author in PubMed Google Scholar
Hongkun Yu
View author publications
You can also search for this author in PubMed Google Scholar
Kai Gu
View author publications
You can also search for this author in PubMed Google Scholar
Zhao Wang
View author publications
You can also search for this author in PubMed Google Scholar
Huabin Ruan
View author publications
You can also search for this author in PubMed Google Scholar
Kunpeng Wang
View author publications
You can also search for this author in PubMed Google Scholar
Siyuan Ren
View author publications
You can also search for this author in PubMed Google Scholar
Bing Li
View author publications
You can also search for this author in PubMed Google Scholar
Lin Gan
View author publications
You can also search for this author in PubMed Google Scholar
Shizhen Xu
View author publications
You can also search for this author in PubMed Google Scholar
Guangwen Yang
View author publications
You can also search for this author in PubMed Google Scholar
Yuan Shen
View author publications
You can also search for this author in PubMed Google Scholar
Xueming Li
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

X.L., Y.S., and G.Y. conceived the project; Y.S., M.H., X.L., H.Y., and K.G. designed the algorithm; M.H. and H.Y. implemented the algorithm; M.H. and H.Y. designed and wrote the major part of the program with full functions of 3D reconstruction; K.W., S.R., B.L., L.G., and S.X. wrote part of the program; H.R. optimized and accelerated the CPU version; Z.W. wrote the GPU version; M.H. and X.L. performed the tests; X.L. wrote the manuscript; and all authors revised the manuscript.

Corresponding authors

Correspondence to Guangwen Yang, Yuan Shen or Xueming Li.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Integrated supplementary information

Supplementary Figure 1 3D classification of the CNG dataset.

a-d) are four classes calculated from 211,826 particles after one round of filtering by 2D classification in our previous work¹⁷. The side view (left) and top view (right) are shown. The model with C4 symmetry (PDB entry code: 5h3o) was docked in the maps. The uncovered models, pointed to by orange arrows, indicate the missing part of the subunit in the trans-membrane region of the CNG channel.

Supplementary Figure 2 3D classifications by THUNDER.

a) Density maps calculated from the dirty dataset of the proteasome. b) Density maps calculated from the dirty dataset of the β-galactosidase. Four classes were calculated with ellipsoid initial models. No symmetry was applied.

Supplementary Figure 3 FSC curves of CNG reconstructions with various numbers of support points after the scanning phase (the first phase of the iteration).

The tests were performed in the a) rotation subspace and b) x-y translation subspace. The nearly identical FSC curves with various numbers of support points implied that a small amount of support points can give accurate parameter estimation.

Supplementary Figure 4 Support points on different likelihood functions.

a) The support points distributed on a strong single peak of the LF which was calculated from a “good” particle image. b) The support points distributed on multiple strong peaks of the LF which was calculated from a “bad” particle image.

Supplementary Figure 5 Representative side-chain maps of the β-galactosidase reconstruction with various options.

Some aromatic rings in β-galactosidase showed clearer holes in the center than that in the published map (EMDB entry code: 2984) (Supplementary Fig. 5).

Supplementary Figure 6 Comparison of the resolution and estimated tilting angle by different programs for the 40^o-tilted dataset of the influenza HA trimer 27.

a) FSC curves corresponding to different method of local defocus determination. Two tests were performed to evaluate the defocus refinement by THUNDER. For the first one, CTFTilt¹⁸ was used to obtain the initial defocus estimation, and then THUNDER with defocus refinement was used to reprocess the dataset. The final resolution of the reconstruction was reported at 3.69 Å with most side-chains visible (Fig. 5b). The defocus refinement contributed a 0.9 Å resolution improvement (see the red dotted and solid lines). Another test was performed with the defocus value (estimated by GCTF²⁶) from the downloaded parameter file (EMPIAR entry code: 10097), and reported 3.81 Å resolution before the defocus refinement. This result is better than the published one, but slightly worse than the one above, indicating less defocus accuracy of GCTF than that of the defocus refinement by THUNDER. Unexpectedly, the defocus refinement on the basis of the GCTF results did not obviously improve the resolution. Considering the defocus refinement is just a local refinement, a possible reason is that the initial defocus error of some particles is beyond the working range of the defocus refinement. b) Histogram of tilt angles of micrographs. To further validate the estimated defocus, we made statistics of the tilt angles of micrographs, measured by plane fitting of the per-particle defocus values from the same micrograph. The histograms corresponding to CTFTilt and the defocus refinement of THUNDER are pretty identical and closed to the expected 40^o tilt. The one from GCTF (from the downloaded parameter file) has a different angle distribution from the other three, which might imply errors in measured defocus.

Supplementary Figure 7 The defocus refinement at different resolution range.

a) 3D reconstruction of the Thermoplasma acidophilum 20S proteasome calculated by THUDNER, and b) corresponding FSC curves before and after the defocus refinement. c, d and e) the histogram of the defocus statistics before and after the defocus refinement for the proteasome CCD dataset, CNG dataset and the high-resolution proteasome dataset from EMPIAR, respectively. To test the performance of the defocus refinement of THUNDER on low-quality and low-resolution dataset, we collected a dataset of proteasome on a Tecnai F20 200kV microscope with Gatan US4000 CCD camera. This dataset contains 42 untilted micrographs and 44 tilted micrographs (20^o tilted). The initial defocus was measured by CTFFind3 for untilted micrographs and CTFTilt for tilted micrographs (the per-particle defocus was estimated according to the measured tilting angle and tilting axis from CTFTilt). Because the per-particle defocus values measured directly from the entire micrograph might be very coarse, it was expected to see more significant improvement after defocus refinement. The final reconstruction at 8.1 Å resolution were calculated from 21404 particles. Comparing the FSC curves before and after defocus refinement, there is just a small improvement after defocus refinement. A further test was then designed by adding ~ 500 Å defocus shift to the refined defocus of each particle and then examining whether the defocus refinement by THUNDER could recover them. c) The histograms of defocus distribution. The black curve is the distribution of the initial defocus, and blue curve is that after a shift of ~ 500 Å, and the red curve is that after the defocus refinement. The vertical dashed lines indicate the mean of defocus distributions. It can be seen that the defocus is partially recovered after defocus refinement. We also did the same tests for d) the CNG and e) the high-resolution proteasome dataset from EMPIAR. It is seen that the defocus can be recovered nearly perfectly. These results indicate that the defocus refinement by THUNDER can also work for the data collected by a CCD camera. But the performance of defocus refinement is limited by the low sensitivity of 3D reconstruction to the defocus errors at ~ 9 Å resolution level.

Supplementary Figure 8 3D reconstruction of three datasets.

The a) CNG density map, b) proteasome density map and c) β-galactosidase density map are shown. The left images are the whole density maps with the resolution value labeled on the bottom. The right are the representative secondary structures segmented from the maps deposited in EMDB (top) and from the maps on the far left solved by THUNDER (bottom).

Supplementary Figure 9 Resolution comparisons of 3D reconstructions calculated by different programs.

a) FSC curves of the clean dataset of the proteasome. b) FSC curves of the dirty dataset of the proteasome. Three options of THUNDER, the grading and the defocus refinement enabled (red solid lines), only the grading enabled (red dashed lines), and both the grading and the defocus refinement disabled (red dotted lines), were tested, and the corresponding results were compared with RELION 2.1 (blue dotted lines) and cryoSPARC 0.6.5 (turquoise dotted lines). The clean dataset has less bad particles then the dirty one. Therefore, the resolutions with and without the particle grading are much closed, and also similar to the results from RELION and cryoSPARC. The dirty dataset should contain more bad particles, as expected, the improvement by the particle grading is more obvious than that with the clean dataset. Meantime, the improvements relative to THUNDER and cryoSPARC are also larger than that with the clean dataset, which further demonstrate the advantage of the bad particle tolerance. For the image processing, the default settings of three programs were used, and D7 symmetry was applied during 3D reconstructions. THUNDER used an ellipsoid map as initial model, and RELION used a cylinder with the similar size of the proteasome as initial model. For cryoSPARC, all particles in each dataset were first classified to one 3D class to generate one initial model, and then “Homogenous Refinement” were performed with this initial model.

Supplementary Figure 10 Scalability of the parallel computing of THUNDER.

Three dirty datasets were used to measure the computing time of a) 3D refinement and b) 3D classification of the CNG dataset, c) 3D refinement and d) 3D classification of the proteasome dataset, e) 3D refinement and f) 3D classification of the β-galactosidase dataset. Each job was submitted to 8, 16, 32 and 64 computing nodes, respectively. The computing time versus the number of computing nodes shows nearly ideal linear trend (red curves). Each computing node has two E5–2680v3 CPU and 128GB DDR4 RAM. The current version of THUNDER needs to load the entire dataset into RAM to reduce the pressure on the storage I/O, which requires that the sum of the available RAM spaces from all computing nodes is larger than the size of the dataset. Due this limitation, the β-galactosidase job on 8 computing nodes failed, and hence was not tested. The data loading strategy will be changed in the next version of THUNDER to avoid this problem.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Hu, M., Yu, H., Gu, K. et al. A particle-filter framework for robust cryo-EM 3D reconstruction. Nat Methods 15, 1083–1089 (2018). https://doi.org/10.1038/s41592-018-0223-8

Download citation

Received: 17 October 2017
Accepted: 23 October 2018
Published: 30 November 2018
Issue Date: December 2018
DOI: https://doi.org/10.1038/s41592-018-0223-8

This article is cited by

A robust normalized local filter to estimate compositional heterogeneity directly from cryo-EM maps
- Björn O. Forsberg
- Pranav N. M. Shah
- Alister Burt
Nature Communications (2023)
EPicker is an exemplar-based continual learning approach for knowledge accumulation in cryoEM particle picking
- Xinyu Zhang
- Tianfang Zhao
- Xueming Li
Nature Communications (2022)
Structure deformation and curvature sensing of PIEZO1 in lipid membranes
- Xuzhong Yang
- Chao Lin
- Bailong Xiao
Nature (2022)
3D reconstruction from cryo-EM projection images using two spherical embeddings
- Yonggang Lu
- Jiaxuan Liu
- Jing He
Communications Biology (2022)
Accelerating the cryo-EM structure determination in RELION on GPU cluster
- Xin You
- Hailong Yang
- Depei Qian
Frontiers of Computer Science (2022)