The fast development of single-particle cryogenic electron microscopy (cryo-EM) has made it more feasible to obtain the 3D structure of well-behaved macromolecules with a molecular weight higher than 300 kDa at ~3 Å resolution. However, it remains a challenge to obtain the high-resolution structures of molecules smaller than 200 kDa using single-particle cryo-EM. In this work, we apply the Cs-corrector-VPP-coupled cryo-EM to study the 52 kDa streptavidin (SA) protein supported on a thin layer of graphene and embedded in vitreous ice. We are able to solve both the apo-SA and biotin-bound SA structures at near-atomic resolution using single-particle cryo-EM. We demonstrate that the method has the potential to determine the structures of molecules as small as 39 kDa.
With recent technical breakthroughs, cryogenic electron microscopy (cryo-EM) has rapidly become one of the most powerful and efficient technologies to investigate the structures of macromolecules at near-atomic resolution. Of the various cryo-EM structural determination methods, single-particle analysis (SPA) has drawn the most attention from structural biologists because of its relatively well-established methods for specimen preparation, data collection, image processing, and structural determination1,2,3. Thanks to the significant improvements in the recording speed and detective quantum efficiency of the direct electron detection cameras, more information at both low and high resolutions can be recovered from raw movie stacks, thus improving the reconstruction accuracy4. New algorithms based on Bayesian statistics have also greatly improved the efficiency of extracting signal from noisy micrographs and heterogeneous datasets5,6,7,8,9. Currently, it has become increasingly routine to reconstruct a well-behaved macromolecule with good homogeneity, rigidity, and random orientations in ice and a molecular weight larger than 300 kDa at ~3 Å resolution. In contrast, it remains a challenge to solve high-resolution structure of proteins with a smaller molecular weight, especially those below 100 kDa, using SPA Cryo-EM. The major hurdle lies in the weak contrast of the small-sized molecules embedded in vitreous ice using conventional transmission electron microscopy (CTEM). Another major obstacle remaining in SPA cryo-EM is the adsorption of proteins to the air–water interface (AWI) of the thin layer of solution during the cryo-specimen preparation3,10,11. Until now, the smallest protein resolved by CTEM using SPA at near-atomic resolution is the 2.9 Å resolution structure of the 64 kDa methemoglobin12,13.
Recent hardware developments have introduced to cryo-EM new electron optical apparatuses, including energy filter, Cs-corrector, and Volta phase plate (VPP), to further improve the imaging quality. The VPP can introduce an extra phase shift to the contrast transfer function (CTF) of the objective lens, thus increasing the low-frequency signal of weak-phase objects such as frozen-hydrated biological molecules14,15,16. With new algorithms supporting the CTF determination and correction of micrographs taken with VPP6,17,18, it was shown that VPP can be used to study various structures at near-atomic resolution, including the 64 kDa hemoglobin at 3.2 Å resolution19,20,21,22. Using a combination of VPP and the Cs-corrector, we demonstrated that the structure of apo-ferritin can be solved at near-atomic resolution in both under- and over-focus modes of the objective lens23.
In this work, we use SPA cryo-EM with VPP and Cs-corrector to determine the structure of SA with a molecular weight of ~52 kDa. Different from hemoglobin that consists mostly of α-helices, SA is constituted by mainly β-strands. Our work demonstrates that the VPP can be used in SPA to resolve SA in both the apo-state and biotin-bound state at near-atomic resolution. We also find that graphene films can serve as good supporting materials to keep the SA in multiple orientations for the high-resolution structural determination. Our results prove in principle the capability of SPA cryo-EM to solve the atomic models of small-sized proteins and their ligand-bound complexes. This development would be of potential application in structure-based drug discovery.
Preparation of frozen-hydrated SA specimen on graphene film
In this study, we used a single-crystalline monolayer graphene over a Quantifoil R0.6/1 gold grid as the supporting film to facilitate the cryogenic SA specimen preparation (see Methods for more details). Using a modified version of our previous imaging strategy to combine the Cs-corrector and VPP for cryo-EM, we were able to collect high-resolution datasets of vitrified SA specimens with high efficiency (Methods). When examined under the VPP-Cs-corrector-coupled Titan Krios at 300 kV with phase shift ranging from 30° to 120°, the SA specimens demonstrated monodisperse particles with a high contrast that could be easily identified and picked using automatic algorithms (Fig. 1a). We found that the single-crystalline graphene with a monolayer of carbon atoms introduced very low background noise to the specimen and could also serve as a good reference for the assessment of the cryo-EM image quality and motion correction with its hexagonal lattice signal24,25,26. After the motion correction of the raw movie stacks of the specimen, we calculated the Fourier transform of the motion-corrected micrographs. In micrographs with good quality, we observed clear reflection spots at 2.13 Å resolution in a hexagonal pattern corresponding to the graphene lattice at its first order (Supplementary Fig. 1), indicating a successful motion correction with high-resolution information recovered to at least 2.13 Å. It is worth noting that these reflection spots were not clear or sharp enough without the proper motion correction (Supplementary Fig. 1). Therefore, the sharpness of the reflection spots of single-crystalline graphene in the Fourier transform can serve as a good indicator to judge the quality of the micrographs and the motion-correction efficiency. We also examined the Fourier transforms of various areas on the same specimen grid and found that most of them demonstrated a consistently hexagonal lattice diffraction pattern in the same orientation, indicating the presence of a single-crystalline graphene film over the grid (Supplementary Fig. 1D).
Single-particle reconstruction of SA by VPP-cryo-EM
Using the automatic particle picking algorithm Gautomatch, we extracted ~710,000 and 1,350,000 particle images from the good motion-corrected micrographs of SA in the absence and presence of biotin, respectively, and applied a 120 Å Fourier high-pass filter to the particles prior to further processing (Supplementary Fig. 2). The high-pass filter turned out to be necessary for the correct alignment of the particle images (Supplementary Fig. 2), probably reducing the low-frequency background bias, in agreement with our previous results27. Reference-free two-dimensional (2D) alignment and classification from such datasets yielded 2D class averages with clear secondary structural features that matched the atomic model of the SA protein (Supplementary Fig. 2B and 2C). Using an initial model generated de novo by the Stochastic Gradient Descent (SGD) method in Relion6, we performed multiple rounds of three-dimensional (3D) classifications to screen the best particles for the final 3D refinement and reconstruction (Supplementary Figs. 3 and 4). In the end, we obtained a reconstruction of apo-SA at 3.3 Å resolution (with D2 symmetry applied during the refinement, Fig. 1c) from a final dataset composed of ~24,000 particles (Supplementary Table 1) and a reconstruction of the SA–biotin complex at 3.2 Å resolution (with D2 symmetry applied during the refinement, Fig. 1d) from a final dataset comprising ~45,000 particles (Fig. 1e, Supplementary Table 1). We also performed reconstructions of the two different states without imposing any symmetry (Supplementary Fig. 5A). These reconstructions have a very similar map quality to those calculated with the D2 symmetry, albeit with slightly lower resolutions (Supplementary Fig. 5C).
The 3D reconstructions of SA in its apo- and biotin-bound states were both clear enough to depict all the secondary structural elements and most of the side chains (Figs. 2 and 3, Supplementary Movie 1). The atomic model of SA solved previously by X-ray crystallography (PDB 1MEP28) can fit into the EM densities with a correlation coefficient ~0.74, indicating the structural fidelity of SA in its crystallographic and soluble forms. The density of biotin in the SA–biotin reconstruction can be precisely identified with the unambiguous docking of biotin’s atomic model (Fig. 2). Compared with the biotin-bound SA, the density corresponding to loop 46–51 in the EM map of apo-SA was missing (Fig. 2), indicating that this lid-like loop is flexible without ligand binding. In contrast, this loop can be clearly defined in the EM map of biotin-bound SA, in which the major side chains (ASN23, SER27, TYR43, ASN49, and SER88) forming a stable hydrogen bond network around the biotin ligand are well resolved (Fig. 2).
Focused classification analysis of the biotin-binding pocket
A critical problem in drug discovery is to identify the ligand-binding site of target proteins. We wondered whether the ligand-binding site could be determined via image processing in small proteins such as SA without prior knowledge29,30,31,32,33. As SA is a tetramer and has four biotin-binding sites in each protein, we treated each SA monomer (with one binding pocket) as an asymmetric unit and used the angular information from the reconstruction with D2 symmetry to align the four asymmetric units from the same particle to a given orientation. This step generated a dataset four times larger and comprising roughly aligned asymmetric particles, thus called the asymmetric particle dataset. After a local search refinement with C1 symmetry, the asymmetric particle dataset was subjected to 35 iterations of 3D classification into 4 classes in a skip-alignment mode in Relion. Without specifically focusing on the binding pocket, a soft mask slightly larger than the SA monomer was applied in either the refinement or classification. We performed this 3D skip-alignment classification analysis of the apo-SA and biotin-SA datasets separately, and found a rather small occupancy variance around the biotin-binding pocket among the different classes in each dataset (Supplementary Fig. 6A and 6B), demonstrating unambiguously the lack of biotin in all the monomers of apo-SA and the full occupancy of biotin in all the monomers of biotin-SA. This result occurs because SA has a very strong binding affinity to biotin and the condition of the biotin-SA specimen allowed the full occupancy of the protein’s ligand-binding sites. The ligand occupancy, however, may not be full for other proteins and other conditions. Thus, we tried to test whether we could extract the ligand-binding information by image processing from particles with a partial ligand occupancy. We mixed the apo-SA and biotin-bound SA datasets, and analysed them as one dataset for the 3D refinement. The reconstruction of the mixed dataset demonstrated a structure showing a biotin-like density in the binding pocket at 3.1 Å resolution (Fig. 4a, b, Supplementary Fig. 5B). From this mixed dataset of apo-SA and biotin-SA, the 3D skip-alignment classification of the asymmetric unit into four classes illustrated distinct differences in the biotin-binding pocket (Fig. 4c). Although Class II was vacant of biotin, the other three classes all had partial biotin occupancy in the binding pocket. We further refined the 3D reconstructions using particles in Class II (Fig. 4d) or the merged Class I–III–IV (Fig. 4e) individually. The refined 3D maps showed more clearly that the Class II reconstruction lacked a density corresponding to loop 46–51 and the biotin ligand (Fig. 4d, red circle), whereas the Class I–III–IV reconstruction maintained both clearly (Fig. 4e, blue circle).
We further randomly split the biotin-SA dataset into 20 subsets (9140 monomers in each subset) and mixed different numbers of them with the apo-SA dataset to generate 20 mixed datasets with different ratios of biotin-SA/apo-SA. We then performed the 3D refinement of these mixed datasets. As the biotin-SA/apo-SA ratio increased, the density of loop 46–51 and biotin molecules in the reconstructions became clearer and was recognizable when the ratio was higher than 0.5 (Supplementary Fig. 6C). From the mixed dataset of M5 with a biotin-SA/apo-SA ratio of 0.5, we could further classify it to separate the biotin-bound SA structural features (Supplementary Fig. 6D). The results above implicated the capability of the heterogeneity analysis for the ligand-binding detection of proteins as small as SA by single-particle cryo-EM.
Reconstruction of sub-tetrameric SA from subtracted dataset
Although the 52 kDa SA is the smallest protein that has been resolved at near-atomic resolution using SPA cryo-EM until now, we were wondering whether SPA cryo-EM is capable of reconstructing even smaller proteins. We used the particle segmentation and subtraction algorithms34,35 that are currently available in Relion to generate monomeric (13 kDa), dimeric (26 kDa), and trimeric (39 kDa) SA datasets from raw biotin-SA datasets in silica (Fig. 5a). The subtracted SA datasets had smaller molecular weights and broke the intrinsic D2 symmetry of SA and therefore the signal for the proper alignment was even weaker.
We first tested whether there was enough signal in the subtracted dataset for 2D classification with the correct angular information. The angular information of each subtracted particle was calculated in accordance with its relative orientation in the original tetrameric SA particle as well as the angular information of that tetrameric SA image in the final tetramer reconstruction. When using the correct angular information without alignment, all three subtracted datasets generated good 2D class averages with correct shapes and features (Fig. 5b, Skip Align). By removing all the angular information, we performed the reference-free 2D alignment and classification of the three datasets from scratch in Relion. In this procedure, the 13 kDa monomeric dataset generated 2D class averages with roughly correct outlines but much noisier features than the perfectly aligned controls in different views (Fig. 5b, Search Align, left panel), suggesting more alignment error in the reference-free alignment. The 26 kDa dimeric dataset generated one well-aligned view (Fig. 5b, Search Align, middle panel), whereas the other views were misaligned. The 39 kDa trimeric dataset generated correct shapes and features in multiple views (Fig. 5b, right panel as representatives), indicating a successful reference-free alignment.
To verify whether the subtracted datasets can still generate valid 3D reconstructions, we used the correct angular information to perform local 3D refinement. Indeed, given the correct angular information, all the three subtracted datasets yielded correct reconstructions (Fig. 5c). We further tested whether the images from those three datasets had enough signals to search for the correct angular information without any prior knowledge. The 39 kDa trimeric dataset had enough signals to generate a correct 3D reconstruction via a global angular search from scratch (Fig. 5c). In contrast, the monomeric and dimeric datasets failed to reconstruct high-resolution structures in global refinement (Fig. 5d), probably due to the lack of sufficient signals to align.
The 3D refinement results of the three datasets were consistent with the 2D classification, indicating that: (1) all datasets contained enough signals for reconstruction at high resolution if the angular information is correct and (2) the 39 kDa trimeric SA images already contained enough signals for image processing from scratch to obtain a high-resolution structure. The results also indicated that good 2D class averages with clear features would provide a high possibility of successful reconstruction. In our results, the 26 kDa dimeric dataset could generate high-quality 2D class averages of certain orientations but not all of the views. The lack of accuracy of the alignments in the other orientations probably caused the failure of its 3D refinement. We infer that the major constituents of the β-strands in SA made the alignment difficult in some orientations. Nevertheless, the successful reconstruction of the trimer dataset indicated the capability of solving an asymmetric protein structure with a molecular weight ~39 kDa at near-atomic resolution by SPA cryo-EM.
Distribution of SA particles in the vitrified specimen
We noticed that even after the careful scrutiny of the SA particle images by 2D classification to remove all obvious junk or bad particles, only ~20% (79,289 vs. 378,987 for the apo-SA, Supplementary Fig. 3) of the seemingly good particles contributed to the correct high-resolution reconstruction after the 3D classification. Indeed, despite our various efforts on image processing procedures, the other 80% of the particle images did not generate reconstructions with clear secondary structural details, even though they appeared very similar in our eyes to the good particles for the high-resolution reconstructions. We confirmed that the original micrographs containing these particles were of high quality.
We set out to investigate what made the difference for the particles to contribute to the high-resolution reconstruction. It has been hypothesized that the adsorption of protein molecules on the AWI may cause the denaturation or partial unfolding of the protein3,10,11. We were wondering whether the location of the particles in the thin layer of vitreous ice caused the variation of the image quality for the high-resolution reconstruction. We therefore performed the electron tomography of the same grid for the single-particle data collection of SA on the graphene-supporting film using VPP-Cs-corrector-coupled cryo-EM. The 3D reconstructions of the tomograms were clear enough for us to depict the SA particle distribution in the specimen (Supplementary Movies 2 and 3, Fig. 6, and Supplementary Fig. 7). It is interesting to see that the SA particles distributed mainly in two different layers along the z-direction, one on the AWI and the other on the graphene–water interface (GWI) (Fig. 6a, Supplementary Fig. 7). There were very few particles between these two layers. This observation suggests that during the specimen preparation, the SA molecules either stuck to the GWI or got adsorbed onto the AWI. Surprisingly, the particles on the GWI had an uneven distribution, mostly in clustering areas (Fig. 6a, red arrow) and only a few in lacuna areas (Fig. 6a, blue arrow). In contrast, the particles on AWI showed a more uniform and dispersed distribution (Fig. 6b). Such a phenomenon was observed in both relatively thick (~50 nm, Supplementary Fig. 7C, Supplementary Movie 2) and thin (~10 nm, Supplementary Fig. 7D, Supplementary Movie 3) ice. The electron tomography analysis implied that the micrographs of SA single particles collected at a zero-degree tilt actually reflected the superposition of the particles on both GWI and AWI.
We analysed all the micrographs that were used for the single-particle reconstruction by sorting them by the percentage of good particles classified in the correct high-resolution reconstruction. We found that the best micrographs with the highest percentages of good particle images had uneven particle distributions (Fig. 6c), and that the good particles contributing to the correct high-resolution reconstruction came mostly from the clustering areas with a similar pattern to those on the GWI, as revealed by electron tomography (Fig. 6d). The particles in the more uniform and dispersed areas contributed much less to the final high-resolution reconstruction.
Intrigued by the observation of the particle distribution, we went through the entire apo-SA dataset to verify the potential correlation between the particle distribution on the grids and their contribution to the high-resolution reconstructions. Out of the 1450 micrographs of apo-SA, we manually selected 749 micrographs with clear features of particle clustering and extracted 212,105 particles from these micrographs using the particle position information calculated from the previous reference-free 2D alignment and classification of the entire apo-SA dataset. Based on the location of each particle, the 212,105 particles were manually divided into two subsets with 134,606 particles in the clustering regions (subset A) and 77,499 particles in the uniformly dispersed regions (subset B) (Fig. 6a–d). Ideally, such a division should put most of the particles on the GWI in subset A and leave subset B with mainly particles on the AWI. We subsequently correlated the particles in the two subsets to the particles in the different steps during the 3D classification and refinement of the entire apo-SA dataset (Supplementary Fig. 3). This assigned 44,326 particles from the 749 micrographs that contributed to the 3.5 Å resolution reconstruction after the first round of 3D classification. We immediately noticed that among the 44,326 particles, 35,676, accounting for >80%, were from subset A and only 8,650, accounting for <20%, were from subset B (Fig. 6e, f, Supplementary Table 2). It is also worth noting that the percentage of particles retained in the best 3D class from subset A is 26.5% (35,676/134,606; Supplementary Table 2), higher than that from subset B, 11.1% (8,650/77,499; Supplementary Table 2). The fact that the contribution of the particles from subset A to the best 3D class increased from 63.5% (134,606/212,105) to 80.5% (35,676/44,326) after the classification indicates that a larger portion of the molecules on the GWI were well-preserved.
To further understand the difference between the two subsets of particles, we performed a reference-free 2D classification on them and found that the subset B particles exhibited more severe preferential orientations than those of subset A (Fig. 6g, Supplementary Fig. 8A, B). We also compared the 3D reconstructions of the particles in subsets A and B, either using angular information from the previous 3D refinement of the entire apo-SA dataset or by recalculating them from scratch. The maps from subset A consistently demonstrated the correct features of the SA molecule, whereas the maps from subset B were of poor quality (Supplementary Figs. 8C, D, 9).
To further investigate the effect of AWI on the structure of SA molecules, we performed cryo-EM analysis of apo-SA on regular holey carbon grids without graphene support. SPA of these SA molecules demonstrated a strong preferential orientation, which is similar to that of subset B in the above analysis (Supplementary Fig. 10).
As SPA cryo-EM has become a powerful method for solving structure of supramolecular complexes with a large molecular weight, the question of how small a molecule can be solved at a near-atomic resolution by this method has drawn more attention. Here we demonstrated that using VPP and Cs-corrector, SPA cryo-EM can solve SA, with a molecular weight of ~50 kDa, at ~3 Å resolution, good enough to determine the ligand-binding site. By combination with particle subtraction analysis, we could push the lower boundary of the molecular weight further to at least a 39 kDa asymmetric tetramer at ~3 Å resolution. Although a 3 Å resolution reconstruction is not sufficient to accurately assign every atom from a specific small molecule, with additional information on the possible conformations of the molecule, we could identify the binding pocket and possible interactions (such as hydrogen bonds) between the protein and ligands. More recent progress in algorithm development has enabled the solution of macromolecules with a large molecular weight and high symmetry at better than 2 Å resolution36,37,38. Previous theoretical predictions suggested that SPA could determine the atomic structure of proteins with a molecular weight as small as ~20–40 kDa39,40,41. It is foreseeable that macromolecules as small as streptavidin or even smaller can be solved at a high-enough resolution to build atomic models of the ligand de novo. Such a scenario would make the cryo-EM structure-based drug discovery more trivial. More importantly, the unique power of single-particle cryo-EM in dealing with heterogeneous ligand occupancy and conformations in a single specimen could help accelerate the drug-screening process without the need for crystallization trials or crystal soaking, which are both time- and material-consuming. Our focused classification approach to quantify the ligand density of the dataset suggested a limited capability to obtain a full occupancy of the bound ligand. This is likely to be due to the low signal to noise ratio (SNR) of the images and the current algorithm’s limitation. With the developments of hardware and software in the future, one may solve the structures of a target macromolecule co-existing with multiple ligand candidates in the same specimen and obtain multiple ligand-occupied states solved simultaneously within a few days of data collection and computation.
In this work, we used the Cs-corrector in combination with the VPP for data acquisition, which has been demonstrated by us to allow imaging at the under-focus or over-focus of the objective lens for cryo-EM23. However, our results in this work do not suggest a necessity of the Cs-corrector for the high-resolution structural determination of small protein complexes. As shown in the works by Khoshouei et al.19 and Herzik et al.13, the 64 kDa hemoglobin can be solved at ~3 Å resolution without the Cs-corrector or VPP.
This work solves a near-atomic resolution structure of a protein smaller than 100 kDa with a supporting film. Using single-crystalline graphene as the supporting film may bring us the following benefits, at least for the single-particle studies of SA: (1) reducing the ice thickness (ice noise) without introducing a strong background noise (Supplementary Fig. 7C, 7D) and (2) attracting protein particles near the GWI and reducing the adsorption onto the AWI. We also noticed that the orientation distributions of apo-SA and biotin-SA were slightly different. This difference is probably an effect on the surface property of GWI by 4 mM free biotin molecule in the biotin-SA solution. In the total two apo-SA grids and four biotin-SA grids prepared in two batches 40 days apart from each other, we found that the distribution of particles in the specimens and orientations, as well as the image quality were reproducible in our hands.
In agreement with other studies11,42, we found that the SA particles in our frozen-hydrated specimens either stay on the GWI or become adsorbed onto the AWI, but very few stay in the bulk of the ice. This finding suggests that during the period of specimen preparation, the SA molecules move quite fast into the two interfaces and probably do not come back into the liquid bulk once hitting the interfaces43,44. The uneven distribution of SA particles on the GWI indicates a non-uniformed interfacial property of the graphene surface in our experiments. We do not have a good explanation for this phenomenon, whereas possible causes might include heterogeneous hydrophilicity, adsorbates contamination, or more complicated graphene–water interfacial interactions on the graphene surface45,46. Taking advantage of the different distribution patterns of SA particles on the GWI and AWI, we could roughly separate the dataset in two groups. Our current results suggest that the particles staying on the GWI were better preserved with their high-resolution structural features, whereas those adsorbed to the AWI were preferentially oriented. To prevent the proteins from hitting the AWI too fast, it may be helpful to reduce the Brownian motion rate. This process could be achieved by reducing the temperature of the protein solution, increasing the viscosity of the solution, or increasing the thickness of the liquid layer over the holes on the EM grid. However, these may all unavoidably reduce the contrast of the molecules in the cryo-EM. An ultimate solution to prevent macromolecules from hitting the AWI is by blocking the AWI with either a supporting film such as the graphene or some inert surfactant that does not have any impact on the macromolecules’ structures, as suggested by Glaeser and colleagues10,47,48. Anchoring the macromolecules with certain affinity tags to the graphene or other electron-transparent supporting materials would be an alternative solution49. Only when we prevent the denaturation of the macromolecules at the AWI can we take full advantage of the single-particle cryo-EM analysis in deciphering the distribution of molecular machines in their conformational landscape beyond the static atomic models.
Cryo-EM sample preparation
For the biotin-free apo-SA cryo-sample, 1 mg ml−1 commercially available streptavidin solution (New England Biolabs) was diluted to 0.2 mg ml−1 in 25 mM Tris-HCl buffer (pH 7.5, 75 mM NaCl). After centrifugation (12,000 × g, 15 min), a 4 μl diluted protein sample (0.2 mg ml−1) was added to a pre-glow-discharged 300 mesh Quantifoil Au R0.6/1 graphene-coated grid for specimen preparation in a Vitrobot Mark IV (FEI Company). The graphene-coated grids were prepared as previously described25. Briefly, a large‐area single-crystalline graphene grown on a copper foil produced by the chemical vapor deposition method was transferred to a Quantifoil Au holey carbon grid using a polymer-free transfer method with isopropanol solution. The copper foil was etched off by the (NH4)2S2O8 aqueous solution and washed away completely to generate a highly clean single-crystalline graphene supporting film on the Quantifoil Au holey carbon grid. Immediately before making the cryo-EM specimen, the graphene-coated grid was glow-discharged for 10 s at a low level in a Harrick Plasma instrument after its chamber was evacuated for 2 min from air. In the Vitrobot Mark IV, the humidity was set at 100%, the protein solution was applied to the grid and there was a wait of 10 s before blotting. A blot force of −1 and blot time of 1 s were applied to blot the grid after waiting. After blotting, the grid was plunged into pre-cooled liquid ethane at a liquid nitrogen temperature.
For a biotin-bound SA cryo-EM specimen, the same streptavidin solution as above was supplemented with biotin (Sigma-Aldrich, St Louis, MO, USA) to a final concentration of 0.2 mg ml−1 SA and 4 mM biotin. The biotin-SA solution was incubated on ice for 1 h and then centrifuged at 12,000 × g for 20 min. When preparing the cryo-sample, the waiting time before blotting was 2 s and the blot force was −2. The rest of the steps were the same as those in the preparation of the apo-SA specimen.
Data collection on VPP-Cs-corrector-coupled EM
All the data were collected on the same 300 kV Cs-corrected Titan Krios microscope, which is equipped with an FEI Volta phase plate (FEI) and a K2 Summit direct electron detector with GIF Bio-Quantum Energy Filters (Gatan). After the cryo-specimens were loaded into the microscope, we first performed the basic alignment of the microscope. Then, we tuned the Cs-corrector and VPP at eucentric-focus with approximately −0.6 μm defocus from the eucentric height at ×195,000 magnification (TEM mode, micro-probe) using a previously published procedure23. The microscope with a well-tuned Cs-corrector was then changed to EFTEM mode and the low-dose module exposure mode was set to nanoprobe mode with a 50 μm C2 aperture at ×215,000 magnification (EFTEM mode). The K2 detector was gain-corrected and the energy filter was fully tuned at the exposure condition. We have updated the previous version of AutoEMation so that it can perform fully automatic VPP data collection, as well as VPP position changes and initial phase-shift buildup every ~40 images, as previously established23. During the data collection period, the objective lens was set at eucentric-focus and the specimen was adjusted and imaged at a Z-position of −0.8 μm from the eucentric height for all the exposure holes within an 8 μm radius area. Thirty-two-frame super-resolution movies were collected in a 2.56 s exposure time with a total dose of 50 e− Å−2 and pixel size of 0.26325 Å at the specimen level. Using this method, the data collection speed was ~ 80 images per hour. In total, we collected 1450 movie stacks for apo-SA in a 1-day session and 3309 movie stacks for biotin-SA in a 2-day session.
We used SerialEM to collect VPP electron tomographic data on exactly the same apo-SA cryo-specimen as used in the SPA data collection. Tilt series were collected from −54° to 54° with a 3° interval at ×64,000 magnification (EFTEM mode, pixel size 1.772 Å at the specimen level). For each tilt, the exposure time is 1.0 s with 8 frames using a total dose of 3.38 e− Å−2 in super-resolution mode; therefore, each set of tilt series has a total dose of 125 e− Å−2.
The super-resolution raw frames of the K2 camera were integrated to MRC format stacks by a local-written program Dat2MRC (developed by Bo Shen, unpublished). MotionCorr (-bin 2 -fod 4 -bft 200 -ssr 1 -ssc 1 -pbx 192) was first used for the full-frame alignment and generated bin2-movie stacks for the initial examination4. After the initial examination of the movie stacks for good CTF and astigmatism, the good uncorrected bin2-movie stacks were further processed by MotionCorr2 for a 5 × 5 patch drift correction with dose weighting (-PixSize 0.5265 -kV 300 -Iter 30 -Patch 5 5 -FmDose 1.56 -Bft 200 -Group 3)50. The summed bin4-images were generated with a pixel size of 1.053 Å after the MotionCorr2 correction. The non-dose-weighted images were used for the CTF estimation of the defocus, astigmatism, and phase-shift parameters by Gctf17. The CTF fitting of each micrograph was examined and screened by checking the Thon ring fitting accuracy manually. The dose-weighted images were used for particle picking and reconstruction. For the apo-SA dataset, 709,967 particles were automatically picked by Gautomatch (developed by Kai Zhang, http://www.mrc-lmb.cam.ac.uk/kzhang/Gautomatch/) from 1385 micrographs. For the biotin-SA dataset, 1,346,980 particles were picked by Gautomatch from 3272 micrographs. After the particles were extracted by Relion, a 120 Å high-pass filter was applied to the particle stacks by relion_image_handler for a better 2D classification performance. The initial model was generated de novo by the 3D initial model in Relion using the SGD method. For each dataset, multiple rounds of 2D or 3D classification were performed in Relion to screen the best particles producing the two 3D reconstructions with a D2 symmetry of apo-SA at about 3.3 Å resolution (23,991 particles) and 3.2 Å resolution (45,686 particles) based on the gold-standard fourier shell correlation (FSC) criterion. In addition, a 3.1 Å resolution reconstruction could be generated by combining the two datasets together with the final refined particles (69,677 particles in total). The local resolution was estimated by the program blocres (Bsoft package)51. Directional FSC profiles were calculated on the Remote 3DFSC Processing Server (https://3dfsc.salk.edu/)52.
For the asymmetric SPA, the star files of the related particles were extended four times by the program relion_particle_symmetry_expand with D2 symmetry. Then, the new star files were input into Relion for the skip-align 3D classification and the particle subtraction following the standard process. Ligand occupancies were calculated by counting valid voxel numbers within a masked biotin region from normalized maps using USCF-Chimera. Three individual 3D classification results were used for estimating the mean value and SD.
To generate the subtracted datasets, the densities to be subtracted were manually adjusted in UCSF-Chimera53. The subtracted particles were then re-refined with either local angular search (within 1.8°) or global search from scratch (initial 7.5°). For the global search refinement, the initial models were generated from the target apo-SA maps with 20 Å low-pass filtering.
For the tomography reconstruction, the tilt series raw stacks were first drift-corrected by MotionCor2. The fiducial-free alignment and tomogram reconstruction were done by IMOD’s standard procedure54. The final tomograms were generated with an eight-time binning (pixel size 7.088 Å) from super-resolution images.
Model fitting and refinement
The atomic model of biotin-SA (PDB 1MEP) was fit into the EM density maps as a rigid body in UCSF-Chimera. The crystal structure fit well in the high-resolution EM density maps. Based on the map densities, we mutated and refined some side chains manually in Coot55 and run one round of real space refinement in PHENIX56.
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Data supporting the findings of this manuscript are available from the corresponding authors upon reasonable request. A reporting summary for this Article is available as a Supplementary Information file. The source data underlying Figs. 4c, 6e–g and Supplementary Figs 6A, B, D, 8 A–D are provided as a Source Data file. The accession numbers for the EM maps, models, and raw movie stack of streptavidin reported in this paper are EMD-0689, EMD-0690, PDB-6J6J, PDB-6J6K, EMPIAR-10269, and EMPIAR-10270.
Program Dat2MRC is available from https://github.com/sailorsb/dat2mrc.
Cheng, Y., Grigorieff, N., Penczek, P. A. & Walz, T. A primer to single-particle cryo-electron microscopy. Cell 161, 438–449 (2015).
Bai, X.-C., McMullan, G. & Scheres, S. H. How cryo-EM is revolutionizing structural biology. Trends Biochem. Sci. 40, 49–57 (2015).
Glaeser, R. M. How good can cryo-EM become? Nat. Methods 13, 28–32 (2016).
Li, X. et al. Electron counting and beam-induced motion correction enable near-atomic-resolution single-particle cryo-EM. Nat. Methods 10, 584 (2013).
Scheres, S. H. W. RELION: implementation of a Bayesian approach to cryo-EM structure determination. J. Struct. Biol. 180, 519–530 (2012).
Kimanius, D., Forsberg, B. O., Scheres, S. H. & Lindahl, E. Accelerated cryo-EM structure determination with parallelisation using GPUs in RELION-2. eLife 5, e18722 (2016).
Punjani, A., Rubinstein, J. L., Fleet, D. J. & Brubaker, M. A. cryoSPARC: algorithms for rapid unsupervised cryo-EM structure determination. Nat. Methods 14, 290 (2017).
Lyumkis, D., Brilot, A. F., Theobald, D. L. & Grigorieff, N. Likelihood-based classification of cryo-EM images using FREALIGN. J. Struct. Biol. 183, 377–388 (2013).
Hu, M. et al. A particle-filter framework for robust cryo-EM 3D reconstruction. Nat. Methods 15, 1083–1089 (2018).
Taylor, K. A. & Glaeser, R. M. Retrospective on the early development of cryoelectron microscopy of macromolecules and a prospective on opportunities for the future. J. Struct. Biol. 163, 214–223 (2008).
Noble, A. J. et al. Routine single particle CryoEM sample and grid characterization by tomography. eLife 7, e34257 (2018).
Herzik Jr, M. A., Wu, M. & Lander, G. C. Achieving better-than-3-Å resolution by single-particle cryo-EM at 200 keV. Nat. Methods 14, 1075–1078 (2017).
Herzik, M. A., Wu, M. & Lander, G. C. High-resolution structure determination of sub-100 kDa complexes using conventional cryo-EM. Nat. Commun. 10, 1032 (2019).
Danev, R., Buijsse, B., Khoshouei, M., Plitzko, J. M. & Baumeister, W. Volta potential phase plate for in-focus phase contrast transmission electron microscopy. Proc. Natl Acad. Sci. USA 111, 15635–15640 (2014).
Danev, R. & Baumeister, W. Cryo-EM single particle analysis with the Volta phase plate. Elife 5, e13046 (2016).
Danev, R., Tegunov, D. & Baumeister, W. Using the Volta phase plate with defocus for cryo-EM single particle analysis. eLife 6, e23006 (2017).
Zhang, K. Gctf: real-time CTF determination and correction. J. Struct. Biol. 193, 1–12 (2016).
Rohou, A. & Grigorieff, N. CTFFIND4: fast and accurate defocus estimation from electron micrographs. J. Struct. Biol. 192, 216–221 (2015).
Khoshouei, M., Radjainia, M., Baumeister, W. & Danev, R. Cryo-EM structure of haemoglobin at 3.2 Å determined with the Volta phase plate. Nat. Commun. 8, 16099 (2017).
Chua, E. Y. et al. 3.9 Å structure of the nucleosome core particle determined by phase-plate cryo-EM. Nucleic Acids Res. 44, 8013–8019 (2016).
Khoshouei, M. et al. Volta phase plate cryo-EM of the small protein complex Prx3. Nat. Commun. 7, 10534 (2016).
Liang, Y.-L. et al. Phase-plate cryo-EM structure of a class B GPCR–G-protein complex. Nature 546, 118–123 (2017).
Fan, X. et al. Near-atomic resolution structure determination in over-focus with Volta phase plate by Cs-corrected cryo-EM. Structure 25, 1623–1630.e3 (2017).
Palovcak, E. et al. A simple and robust procedure for preparing graphene-oxide cryo-EM grids. J. Struct. Biol. 204, 80–84 (2018).
Zhang, J. et al. Single crystals: clean transfer of large graphene single crystals for high‐intactness suspended membranes and liquid cells. Adv. Mater. 29, 1700639 (2017).
Russo, C. J. & Passmore, L. A. Controlling protein adsorption on graphene for cryo-EM using low-energy hydrogen plasmas. Nat. Methods 11, 649 (2014).
Taylor, D. W. et al. Substrate-specific structural rearrangements of human Dicer. Nat. Struct. Mol. Biol. 20, 662 (2013).
Hyre, D. E. et al. Cooperative hydrogen bond interactions in the streptavidin-biotin system. Protein Sci. 15, 459–467 (2010).
Penczek, P. A., Frank, J. & Spahn, C. M. A method of focused classification, based on the bootstrap 3D variance analysis, and its application to EF-G-dependent translocation. J. Struct. Biol. 154, 184–194 (2006).
Unverdorben, P. et al. Deep classification of a large cryo-EM dataset defines the conformational landscape of the 26S proteasome. Proc. Natl Acad. Sci. USA 111, 5544–5549 (2014).
von Loeffelholz, O. et al. Focused classification and refinement in high-resolution cryo-EM structural analysis of ribosome complexes. Curr. Opin. Struct. Biol. 46, 140–148 (2017).
Zhang, C. et al. Analysis of discrete local variability and structural covariance in macromolecular assemblies using Cryo-EM and focused classification. Ultramicroscopy 203, 170–180 (2018).
Loveland, A. B., Demo, G., Grigorieff, N. & Korostelev, A. A. Ensemble cryo-EM elucidates the mechanism of translation fidelity. Nature 546, 113 (2017).
Bai, X.-c, Rajendra, E., Yang, G., Shi, Y. & Scheres, S. H. Sampling the conformational space of the catalytic subunit of human γ-secretase. Elife 4, e11182 (2015).
Zhou, Q., Zhou, N. & Wang, H.-W. Particle segmentation algorithm for flexible single particle reconstruction. Biophys. Rep. 3, 43–55 (2017).
Bartesaghi, A. et al. Atomic resolution cryo-EM structure of β-galactosidase. Structure 26, 848–856 (2018).
Tan, Y. Z. et al. Sub-2 Å Ewald curvature corrected structure of an AAV2 capsid variant. Nat. Commun. 9, 3628 (2018).
Merk, A. et al. Breaking cryo-EM resolution barriers to facilitate drug discovery. Cell 165, 1698–1707 (2016).
Henderson, R. The potential and limitations of neutrons, electrons and X-rays for atomic resolution microscopy of unstained biological molecules. Q. Rev. Biophys. 28, 171–193 (1995).
Glaeser, R. M. Review: electron crystallography: present excitement, a nod to the past, anticipating the future. J. Struct. Biol. 128, 3–14 (1999).
Rosenthal, P. B. & Henderson, R. Optimal determination of particle orientation, absolute hand, and contrast loss in single-particle electron cryomicroscopy. J. Mol. Biol. 333, 721–745 (2003).
Dubochet, J., Adrian, M., Lepault, J. & McDowall, A. Emerging techniques: Cryo-electron microscopy of vitrified biological specimens. Trends Biochem. Sci. Vers. Ed. 10, 143–146 (1985).
Naydenova, K. & Russo, C. J. Measuring the effects of particle orientation to improve the efficiency of electron cryomicroscopy. Nat. Commun. 8, 629 (2017).
Glaeser, R. M. Proteins, interfaces, and cryo-EM grids. Curr. Opin. Colloid Interface Sci. 34, 1–8 (2018).
Ke, X., Peigen, C. & Heath, J. R. Graphene visualizes the firstwater adlayers on mica at ambient conditions. Science 329, 1188–1191 (2010).
Ko, H. C. et al. High-resolution characterization of preferential gas adsorption at the graphene-water interface. Langmuir 32, 11164–11171 (2016).
Han, B. G., Watson, Z., Cate, J. H. & Glaeser, R. M. Monolayer-crystal streptavidin support films provide an internal standard of cryo-EM image quality. J. Struct. Biol. 200, 307–313 (2017).
Glaeser, R. M. et al. Factors that Influence the formation and stability of thin, cryo-EM specimens. Biophys. J. 110, 749–755 (2016).
Liu, N. et al. Bioactive functionalized monolayer graphene for high-resolution cryo-electron microscopy. J. Am. Chem. Soc. 141, 4016–4025 (2019).
Zheng, S. Q. et al. MotionCor2: anisotropic correction of beam-induced motion for improved cryo-electron microscopy. Nat. Methods 14, 331–332 https://doi.org/10.1038/nmeth.4193 https://www.nature.com/articles/nmeth.4193#supplementary-information (2017).
Heymann, J. B. & Belnap, D. M. Bsoft: image processing and molecular modeling for electron microscopy. J. Struct. Biol. 157, 3–18 (2007).
Tan, Y. Z. et al. Addressing preferred specimen orientation in single-particle cryo-EM through tilting. Nat. Methods 14, 793–796 https://doi.org/10.1038/nmeth.4347 https://www.nature.com/articles/nmeth.4347#supplementary-information (2017).
Pettersen, E. F. et al. UCSF Chimera—a visualization system for exploratory research and analysis. J. Comput. Chem. 25, 1605–1612 (2004).
Mastronarde, D. N. & Held, S. R. Automated tilt series alignment and tomographic reconstruction in IMOD. J. Struct. Biol. 197, 102–113 (2016).
Emsley, P., Lohkamp, B., Scott, W. G. & Cowtan, K. Features and development of Coot. Acta Crystallogr. Sect. D Biol. Crystallogr. 66, 486–501 (2010).
Adams, P. D. et al. PHENIX: A Comprehensive Python‐Based System for Macromolecular Structure Solution (American Cancer Society, 2010).
We thank Xiaomin Li and Tao Yang at the Tsinghua University Branch of the National Protein Science Facility (Beijing) for their technical support on the Cryo-EM and High-Performance Computation platforms. We thank Zhipu Luo at Soochow University for his help in atomic model refinement. This work was supported by grant (2016YFA0501100 to H.W. and J.L., 2016YFA0200101 to H.P.) from the Ministry of Science and Technology of China, grant (Z161100000116034 to H.W.) from the Beijing Municipal Science & Technology Commission, and grant (21525310 to H.P.) from the National Natural Science Foundation of China.
The authors declare no competing interests.
Journal peer review information: Nature Communications thanks Radostin Danev and the other anonymous reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.
Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Fan, X., Wang, J., Zhang, X. et al. Single particle cryo-EM reconstruction of 52 kDa streptavidin at 3.2 Angstrom resolution. Nat Commun 10, 2386 (2019). https://doi.org/10.1038/s41467-019-10368-w
Megabodies expand the nanobody toolkit for protein structure determination by single-particle cryo-EM
Nature Methods (2021)
Cryo-EM structure of coronavirus-HKU1 haemagglutinin esterase reveals architectural changes arising from prolonged circulation in humans
Nature Communications (2020)
Protein & Cell (2020)
Nature Communications (2019)