The methods of DNA nanotechnology enable the rational design of custom shapes that self-assemble in solution from sets of DNA molecules. DNA origami, in which a long template DNA single strand is folded by many short DNA oligonucleotides, can be employed to make objects comprising hundreds of unique DNA strands and thousands of base pairs, thus in principle providing many degrees of freedom for modelling complex objects of defined 3D shapes and sizes. Here, we address the problem of accurate structural validation of DNA objects in solution with cryo-EM based methodologies. By taking into account structural fluctuations, we can determine structures with improved detail compared to previous work. To interpret the experimental cryo-EM maps, we present molecular-dynamics-based methods for building pseudo-atomic models in a semi-automated fashion. Among other features, our data allows discerning details such as helical grooves, single-strand versus double-strand crossovers, backbone phosphate positions, and single-strand breaks. Obtaining this higher level of detail is a step forward that now allows designers to inspect and refine their designs with base-pair level interventions.
Programmable self-assembly with DNA is a route to nanofabrication1,2,3,4,5,6,7,8 with applications emerging in a variety of fields9,10. The self-assembly reactions of such objects can yield monodisperse products11 and the underlying design concepts in principle enable specifying target structures with chemical accuracy on the level of single bases, within objects designed to contain several thousand bases. As in any other field aimed at creating items of technology, also the structures of objects built with the methods of DNA nanotechnology must be validated experimentally. However, the accuracy and the depth of the available structural data in DNA nanotechnology remains poor compared to the structural data that are routinely generated in other fields of study such as de novo protein design12. Here, we addressed the problem of accurate structural validation with cryo-EM-based methodologies for determining the structures of up to megadalton-scale DNA objects in solution, together with molecular-dynamics-based methods for building pseudoatomic models in a semiautomated fashion. Our methods yield structures that afford improved detail compared to our own previous work13 and to those of others8,14 (see Supplementary Table S1), and now allow discerning details such as helical grooves, single-strand versus double-strand crossovers, backbone phosphate positions, and nick sites. Access to data with such a level of detail enables performing iterative geometry refinements on the level of strands or individual base pairs, as we will show. We predict that this capability will enable the field to move toward more advanced functionalities that require the accurate relative positioning of functional groups, such as molecular recognition, proximity-enhanced templated synthesis, near-field photonic effects, or even enzyme-like catalysis. Likewise, DNA–template-assisted structural determination of proteins14,15,16 presumably will benefit from these improvements in cryo-EM methodology for DNA origami.
We determined cryo-EM maps for a library of multilayer DNA objects in honeycomb and in square-lattice packing (Fig. 1 and Supplementary Figs. S1–26). The library of objects includes four brick-like multilayer DNA origami objects (Fig. 1a) with increasing internal floppiness, a barrel-like 126-helix bundle (Fig. 1b), two variants of a multidomain object called Twisttower (Fig. 1c), a design variant v2 of a previously reported object called the Pointer (Fig. 1d), a small 16-helix bundle (Fig. 1e), four variants of a hinged-beam-like object (Fig. 1f), two variants of a dumbbell-like object (Fig. 1g), and five variants of a six-helix tube featuring asymmetric markers at either end as reporters for twist and one ten-helix tube (Fig. 1h). We also attempted to solve structures of variants of single-layer DNA origami tiles in square-lattice design (Rothemund Rectangle1), but were unsuccessful due to excessive conformational heterogeneity (Supplementary Fig. S27). The micrographs showed that the original, uncorrected tile with a crossover density corresponding to a twist density of 10.67 base pairs per turn assumes wrapped-up-like shapes in solution. The high degree of flexibility and the wrapped-up shape is in accordance with previous findings from simulations, on-support atomic force microscopy (AFM), negative-stain electron microscopy (EM), and small-angle X-ray scattering (SAXS) data17,18,19,20, and should be taken into account for in-solution applications. Exemplary cryo-EM micrographs and acquisition details for each object are given in Supplementary Figs. S1–S27 and in Supplementary Table S2, respectively.
Global twist deformation in a square-lattice
The object termed Twisttower (Fig. 1c) is a complex of four cuboid-like domains fused together. The cuboids feature 2 × 2, 4 × 4, 6 × 6, and 8 × 8 helices, respectively, in a quadratic cross-sectional arrangement (Fig. 2a). The Twisttower design allowed us to systematically study twist deformations, and how they may be removed, in a square-lattice packing context as a function of cross-sectional area. In the cryo-EM map that we determined, each cuboid domain exhibited an independent right-handed twist deformation around the helical axis of the domain (Fig. 2a). Such twist deformations are expected to arise a priori for square-lattice designs with default eight-base-pair crossover spacing because the design specifics create right-handed internal torques4. These torques are produced by helical underwinding from 10.5 to 10.67 base pairs per turn imposed by the square-lattice connectivity. The global twist observed in each Twisttower domain decreased with the increasing cross-sectional area (Fig. 2a, left). To remove the global twist deformation, the internal design specifics must be changed such that the right-handed internal torques are mechanically balanced by counteracting left-handed torques. This may be achieved by locally deviating from the default eight-base-pair crossover spacing design rules following previously discussed concepts4, by reducing the average bases between crossovers to achieve the native 10.5 base pairs per turn. We thus designed and solved a cryo-EM structure of a refined variant of the Twisttower in which we eliminated the global twist deformations. As a result, we obtained a more orderly object with a more regular square-lattice structure (Fig. 2a, right).
Global twist deformation in a honeycomb lattice
The majority of other objects in our library were multilayer DNA origami in honeycomb-lattice packing. Nearly all cryo-EM maps that we determined for objects built using the default seven-base-pair strand crossover spacing prescribed by honeycomb-design rules displayed global right-handed twist deformations (Fig. 1a, f left, 1 g left, 1 h, Fig. 2b–f). The appearance of these twist deformations is noteworthy because the crossovers in seven-base-pair helical intervals create helical connections in a threefold symmetry that closely matches with the natural 10.5 base pairs per turn B-DNA twist density. Hence, in contrast to square-lattice designs, honeycomb designs do not impose a priori helical deformations and twist buildup is not necessarily expected2,4. The extent of the global twist deformation seen in our panel of honeycomb-lattice structures also decreased with the increasing cross-sectional area, consistent with the behavior we saw for a twist in the domains of the square-lattice Twisttower. The 126-helix bundle (Fig. 2b), which featured the widest cross section in our panel, did not display any detectable global twist. The slenderest object, the six-helix tube, exhibited the largest twist deformation, with a 90° total twist over 100 base pairs, in a design implementation that places strand breaks directly at crossover sites for improved folding yield (Supplementary Fig. S28)21,22. A second variant of the six-helix tube had a negligible global twist but instead curvature, and assembled with poor yield (Fig. 2c, bottom). This second variant was designed with legacy rules where strand breaks are placed systematically away from crossovers2. Both six-helix-tube variants use the same default honeycomb seven-base-pair crossover spacing. Our observations made with these two six-helix-tube variants highlight that not only folding behavior but also solution shape may sensitively depend on the internal design details, emphasizing the need for structural validation. We also designed and analyzed the structure of a third, further refined six-helix-tube variant featuring base-pair deletions every 21 bases. This third variant folds well and does not exhibit a global twist deformation nor curvature (Fig. 2c, top and Supplementary Fig. S28).
We previously developed the hinged-beam-like object (Figs. 1f, 2d, left) for positioning fluorophores and reactive groups23, as well as for measuring forces between nucleosomes24. Here, we determined a cryo-EM map for this object and the map reveals a pronounced right-handed twist deformation in the two beams of the object. Our cryo-EM map thereby invalidates the straight geometrical model that we previously assumed23,24 to calculate point-to-point distances. Since the assumed versus actual point-to-point distance changes upon angle change between the beams are negligibly affected by the twist deformation, conclusions drawn in the previous work are not affected. We titrated the density of counter-twist-producing modifications necessary in the hinged-beam object to entirely remove the global twist deformations and solved cryo-EM structures for each design iteration (Fig. 2d, left to right), thus yielding a design variant that now does correspond to the previously assumed straight geometrical model.
Empirical global twist correction
Our set of cryo-EM solution structures furthermore allows constructing an empirical guide for estimating the expected global twist deformations, and for refining the objects to achieve negligible global twist deformations. To this end, we plotted the estimated polar moment of inertia (the torsional stiffness) versus the observed global twist per base along the helical direction in each cryo-EM map that we determined (Fig. 2e). The graph also gives the effective helical twist density that is imposed by design context (square or honeycomb lattice) and the number of counter-torque-producing base-pair deletions or insertions that we installed (Fig. 2e). Researchers can estimate the polar moment of inertia based on the helical cross section of the planned object and then use to plot to read off the required average segment-length deviation from default honeycomb or square-lattice crossover spacing to approximately yield zero twist per base.
In order for the counter-torque-based design refinement to succeed, helices must be connected by crossovers in order to transmit the twist-countering torques. To illustrate this requirement, we built two variants of a dumbbell-shaped multilayer DNA origami featuring rotor-like indicator domains at either end (Fig. 2f). The central axis segment of the dumbbell consisted of 24 parallel, but unconnected helices. The cryo-EM structures that we solved for the two variants of the dumbbell overlapped closely and had the same twist deformation even though we installed differing local helical twist density in-between the rotors (Fig. 2f). This shows that due to the lack of crossovers in the dumbbell axis, torque transmission arising by helical under- or overwinding is not effective.
Since we just saw that crossovers are necessary to transmit the torques, we wondered whether strains originating from crossovers constitute the root cause of twist in honeycomb objects. If this were true, making crossovers floppier should reduce twist deformations. To test this hypothesis, we designed and analyzed a set of multilayer DNA origami 48-helix-brick variants in which we added unpaired thymidines at all staple-strand crossovers (0T, 1T, 2T, and 4T). All variants displayed right-handed global twist deformations of comparable extent (Fig. 2g–i), which suggests that strains produced at crossovers do not cause the twist. Since helical details could be discerned in the majority of the helices of the map obtained with the 0T variant, we could determine the handedness of the global twist deformation based on the fact that B-DNA is right-handed. Helical details were lost in the EM maps already upon adding one T per crossover, but we presume that the twist deformations observed for the variants with more T’s at the crossovers have the same handedness as the 0T variant. Whereas the global twist remained unaffected, adding the thymidines made the objects swell (Fig. 2h, i). The interhelical spacing and the effective diameter of helices increased with T addition, which we attribute to the increasing fluctuations enabled by the increasingly floppy interhelical junctions.
Previous work indicated that the spatial distribution of single-strand backbone discontinuities (nicks), which could constitute torsionally weak points in double-helical domains, influenced the twist angle of six-helix tubes as seen by AFM adhered to a solid support25. To test the relevance of this design parameter in solution, we determined cryo-EM maps of variants of 42-helix brick-like objects in honeycomb-lattice packing (Supplementary Figs. S21–26). In one variant, the staple-strand nicks were distributed randomly; in the second variant, the nicks were aligned on a set of (virtual) cross-sectional planes. The third variant was like the second variant but had additional unpaired thymidines at the nick sites for UV-point welding26. The resulting cryo-EM maps all overlapped closely and featured the identical global twist deformation. Hence, nick-site distribution appears inconsequential with respect to twist buildup. Previous computational studies17,27,28 indicate that for lattice-based DNA origami, the stacking interaction at nick sites might be strong enough to compensate for the missing backbone connection. Introducing gaps might have a greater influence on the overall twist compared to nicks. We note that the irradiation of the objects with UV light eradicated the twist deformation, based on cryo-EM maps that we solved after exposing the samples to 310-nm light (Supplementary Fig. S29), which corroborates previous findings with single-layer structures seen adhered on solid supports29.
Revealing higher-resolution features
We used 3D classification with a large number of classes to uncover the spectrum of shapes sampled by a given DNA origami object. Exemplarily, we found that the Twisttower samples an ensemble of conformations in which the different domains move relative to each other by bending and twist deformations at the domain interfaces. In particular, the 2 × 2 and the 4 × 4 domains show relative displacements with up to 12-nm (extreme to the extreme) amplitude (Fig. 3a and Supplementary Movie M1). Furthermore, we also observed breathing motions in which the entire helical lattice expands and shrinks, as exemplified in data obtained with the Pointer-v2 object (Fig. 3b and Supplementary Movies M2, M3)13. The Pointer v2 also exhibited other types of structural fluctuations akin to domain motions. These fluctuations are design-specific and depend on the global shape as well as the topology of the nanostructures. As these motions are driven by the thermal fluctuation of the individual helices, they are also dependent on temperature and salt- and buffer conditions, as well as the overall folding quality of the nanostructure ensemble. Folding defects, either caused by partially unhybridized or defect oligonucleotides, can locally influence the mechanical properties of a helical segment and distort the global shape or act as a hinge for a domain motion. Hence, DNA origami are not rigid objects, instead, they display substantial structural heterogeneity. The heterogeneity may be coarsely classified as relative domain motions similar to those seen in proteins and as helical lattice breathing, which does not exist in proteins. Ignoring these motions and reconstructing the structure of a target object as a single static entity will blur high-resolution detail that might have been present in the data.
Multibody refinement in the presence of internal motions
To systematically deal with internal motions, we adapted focused refinement methodologies30. We demonstrate the efficacy of this approach exemplarily with the results obtained with the Twisttower (Fig. 3c). Figure 3c, top left, shows the result of a 3D refinement that assumes the whole object as one rigid body. While the quality of this cryo-EM map is already superior to any other DNA origami structure published thus far, the helical interfaces at the periphery of the object and substantial portions of the 4 × 4 domain remain blurred. The 2 × 2 domain can hardly be discerned at all. To deal with relative domain motions, we divided the Twisttower object into its domains (i.e., the 2 × 2, 4 × 4, 6 × 6, and 8 × 8 regions) and used multibody refinement30 to separately reconstruct the 3D structure of these domains pretending that now the domains, but not the entire Twisttower, were rigid bodies. The resulting 3D maps for the separate domains offered improved detail. In particular, helical grooves can now be seen in all regions of the domains and in particular at the periphery (Fig. 3c, insets on the right). A Frankenstein map of the Twisttower (Fig. 3c, bottom), which is generated by merging the maps from the different focused refinements, allows appreciating that now virtually all parts of the Twisttower are resolved with high detail (see also Supplementary Fig. S30). We applied this analysis also to the twist-corrected variant of the Twisttower (Supplementary Fig. S31) and the hinged-beam-like object v4 (Supplementary Fig. S32). To show the efficacy of the multibody refinement even when the object does not have clear domains, we applied the multibody approach to three brick-like objects (Supplementary Figs. 3d, S33–S35) and the 126-helix bundle (Supplementary Fig. S36). Simply dividing the whole object into smaller rigid bodies (Fig. 3d, right) enabled reconstructing all regions of the objects in greater detail including the peripheries (Fig. 3d, top vs. bottom).
Focal scanning refinement
The maps can still be further improved by using a focal scanning two-body refinement approach that allows dealing with helical lattice-breathing motions. To this end, we further divided the previously defined domains into two segments, one being a focal cross-sectional element (e.g., a domain consisting of 2 × 2 double helices, each 32-base-pairs long) to be reconstructed with higher detail (Fig. 4a), and the other comprising the rest of the domain, surrounding the focal element. We thus obtained improved level of detail in the focal element (Fig. 4a, left vs. right). This two-rigid-body refinement approach can then be iterated by scanning the small focus area across the entire object (Fig. 4b). The results from these separate localized refinements may be inspected separately, each revealing a portion of the target structure with high detail, and all of them may again be combined into a Frankenstein map (Fig. 4b, bottom).
Using the focal scanning refinement procedure, we were able to reconstruct details of 2 × 2 parts of the Twisttower object with a global resolution of 4.3 Å and local resolutions below 4 Å (Supplementary Fig. S37 and Supplementary Movie M4). At this level of detail, not only helical grooves can be seen but also the molecular details of the helices emerge. Backbone phosphates manifest as bumps (Fig. 4a, right); the covalent single-strand versus double-strand backbone connections between helices can be recognized as thin bonds and discriminated from each other (Fig. 4c, d, respectively). Nicks that lack phosphates manifest as depressions or kinks in the groove boundaries (Fig. 4e). The thus-refined maps now also deliver detail at the very periphery of the objects, including density from single-strand tails and peripheral crossovers (Fig. 4f). Obtaining this high level of detail is a major step forward that now allows designers to inspect and refine their designs with base-pair-level interventions.
Pseudo-atomic model construction
The availability of high-quality structural data immediately presents a challenge, which is the interpretation of the map with atomic models and forming a link to the actual design. To build such a model, a suitable initial model is required, which can then be systematically corrected to optimally fit into the electron density. For creating initial models, we used the atomic-detail predictions produced by ENRG-MD31. Fitting the initial model into the EM densities is complicated by the pseudo-periodic lattice structure of the DNA origami EM maps. Many local minima exist in which a fit can get stuck. To carry out the fitting, we used molecular dynamics flexible fitting32, in which the electron-density map acts as an attractive force field, and developed a cascaded relaxation33 protocol, in which we relieved restraints sequentially from the ENRG-MD force field as the correlation between measured EM map and the fitted atomic model improved (see “Methods”). Figure 5a shows snapshots of our cascaded relaxation procedure (see also Supplementary Movie M5) for a region within the Twisttower object. Our methodology allows pseudo-atomic model construction in a semiautomated fashion within ~12 h compute time on a standard desktop computer, which may be compared to the several weeks it took to manually construct the atomic model of the previously reported Pointer object13. We constructed pseudoatomic models for six different objects (Fig. 5b). The fitting was validated using Fourier shell correlation against the cryo-EM half-maps (Supplementary Fig. S38) and the models show good cross-correlation with the cryo-EM maps (Supplementary Table S3). We also refitted the previously reported Pointer cryo-EM map with our semiautomated approach, and the computed atomic model closely matches the manually constructed one13 (Supplementary Fig. S39).
Context-based zoning and cropping of cryo-EM maps
We developed a viewer tool34 to form a link between the experimentally determined cryo-EM map, the fitted atomic model, and the strand diagram prepared by the designer to build the DNA origami object under study. The tool links the actual geometry of the object as seen in the cryo-EM map and annotated by the fitted atomic model in terms of cartesian atomic coordinates with the connectivity index system used by designers in DNA origami-strand diagrams (helical and base indices). The tool allows cropping or zoning the map to dissect it into elements of interest and displaying it together with the corresponding segments of the atomic model. For instance, we used the tool to systematically segment all measured maps into the constituent lattice layers to visually inspect the maps and the quality of the atomic model that was fitted to the map (Supplementary Fig. S40). We used these dissections to reveal unexpected or misfolded structural features that otherwise would have been hidden in the depths of the full cryo-EM maps (Supplementary Fig. S41).
In conclusion, our set of cryo-EM structures covers a range of different DNA origami designs and provides insight into structural changes that follow from subtle variations of these designs. Thereby, our dataset (see Supplementary Table S4 for EMBD and PDB IDs) provides the constraints needed for parameterizing computational structure prediction methods, whether coarse-grained or with atomistic detail17,31,35,36. Exemplarily, we computed the deviations of atomistic ENRG-MD predictions31 from our atomic models, which were all >9.6 Å (RMSD) (see Supplementary Table S3) and thus clearly above the resolution of the experimental maps used for fitting. The strongest deviations between experimental data and prediction occurred at locations where the design deviates from idealized lattice rules, e.g., at sites with omitted crossovers or with sudden changes in the helical cross section (Supplementary Fig. S42).
Our set of atomic coordinates obtained from fitting six DNA origami electron-density maps comprises ~100,000 base-pair coordinates, roughly equally distributed over all the possible base-pair step sequences, which may be compared to ~65,000 DNA-only base-pair coordinates currently available in the protein data bank (PDB) (Feb 2020). The depth of data could also allow mining for sequence–structural relationships in B-DNA that could help advance DNA nanotechnology from the current sequence-agnostic design methods to more refined approaches that optimize sequences for target backbone coordinates, akin to strategies used in de novo protein design37.
Our results underline the importance of structure validation in solution. For example, the presumed geometry of one of our own previously reported objects turned out to be not quite correct. It is likely that the actual geometries of many other previously reported objects deviate from the idealized expectations. This is particularly relevant in applications where twist deformations could affect the results (e.g., refs. 38,39,40,41) and with objects having slender cross sections such as six-helix tubes. In fact, six-helix tubes in honeycomb-lattice packing are popular objects that are used in a variety of contexts, ranging from NMR-based structural analysis42 and liquid crystals41 to single-molecule manipulation43,44. Our solution structures of six-helix tubes revealed a strong dependence of their shapes on design details.
We also demonstrated how iterative cycles of design and validation can help to correct an object to meet desired specifications, highlighting the programmability of DNA nanotechnology. Beyond validation of global shapes, we are convinced that revealing high-resolution features of DNA origami structures as we showed here with our focal scanning refinement (Fig. 4) will open new possibilities for the field. Now, researchers can zoom into regions of interest of a DNA origami chassis to iteratively refine them, for example, to tune the relative position and orientation of functional moieties or to model reactive centers.
The reaction mixtures contained homemade scaffold DNA, purchased staple oligonucleotides (Eurofins MWG and IDT), and folding buffer (1 × FOBx) at pH 8, including 5 mM Tris, 1 mM EDTA, 5 mM NaCl, and × mM MgCl2 (see Data D1, Supplementary Table S5). The mixtures were subjected to 15 min of constant heating at 65 °C followed by a stepwise thermal annealing ramp using a Tetrad thermal cycling device (MJ Research, now Bio-Rad). The folding products were purified and concentrated using PEG purification and filter purification/concentration. The used type and concentration of scaffolds and oligonucleotides, concentration of MgCl2, annealing ramps, and purification/concentration protocols depended on the type of structure (see Supplementary Table S5). The PEG purification22 was performed by mixing the folding reaction in a one-to-one ratio with a 15% PEG 8000, 5 mM TRIS, 1 mM EDTA, and 500 mM NaCl solution, and centrifuged for 30 min at 20,000 rcf. Afterward, the supernatant was discarded and the pellet dissolved in 1 × FOB. For the filter purification/concentration, the sample was diluted with 1 × FOB to a final MgCl2 concentration of 5 mM. The Amicon Ultra 0.5-ml, 50-kDa cut-off filters (Millipore) were rinsed with 1 × FOB5. The sample was added to each filter and subjected to a centrifugation step at 10 k rcf for 5 min. After several washing steps consisting of removing of the flow-through, refilling of the filters to 500 µl with 1 × FOB5, and a centrifugation step, the filters were placed upside-down in fresh tubes and subjected to another centrifugation step.
Excess staple DNA strands were removed from the reaction mixture by performing one round of polyethylene glycol (PEG) precipitation. The resulting pellets were dissolved in HPLC buffer (1 mM EDTA, 5 mM TrisBase, and 200 mM NaCl, pH 8) containing 5 mM MgCl2. Then, we subjected the sample to HPLC (Agilent Technologies 1260/1290 infinity) using the column (Agilent Bio SEC-5: 5 µm, 2000 A, 21.2 × 300 mm) at a flow rate of 2 ml/min and collected fractions of the monomer peak (30–35 min). Due to dilution of the sample, we used ultrafiltration (30 K Amicon Ultra-15 mL from Merck Millipore) to concentrate the sample and to exchange the buffer to folding buffer (1 mM EDTA, 5 mM TrisBase, 5 mM NaCl, and 5 mM MgCl2, pH 8).
For UV-point welding26, we used a 300-W xenon light source (MAX-303 from Asahi Spectra) with a high-transmission band-pass filter centered around 310 nm (XAQA310 from Asahi Spectra). We used a light guide (Asahi Spectra) to couple the light into the sample by placing it directly on top of a 0.65-ml reaction tube. Unless otherwise indicated, the brick-like samples were irradiated for 120 min. Samples were irradiated in folding buffer (5 mM Tris, 1 mM EDTA, and 5 mM NaCl), including 30 mM MgCl2, unless otherwise stated. After irradiation, the buffer was exchanged to folding buffer, including 5 mM MgCl2.
Cryo-grid preparation and image acquisition
The purified and the concentrated sample was applied to glow-discharged C-Flat 2/1 4 C (Protochips) or C-Flat 1.2/1.3 4 C grids (Protochips) and plunge-frozen using a Vitrobot Mark IV (FEI, now Thermo Scientific) at the following settings: temperature of 22 °C, the humidity of 100%, 0-s wait time, 2–4-s blot time, −1 blot force, and 0-s drain time (Supplementary Table S2). For the Pointer object and the 16-helix bundle, homemade graphene oxide-coated holey carbon grids were used. Graphene oxide dispersion in H2O (Sigma) was diluted to 0.2 mg/ml in H2O and spun at 300 g for 30 s to remove large aggregates. Quantifoil R1.2/1.3 holey grids were glow-discharged for 1 min, and 3 µl of the graphene suspension was added to the grids for 1 min. Grids were subsequently blotted briefly using Whatman No1 filter paper and washed three times on 20-µl drops of H2O (twice on the graphene side and once on the reverse side). Grids were then used for plunge-freezing without further treatment45. The data were acquired on a Titan Krios G2 electron microscope operated at 300 kV equipped with a Falcon 2, later upgraded to a Falcon 3, direct detector using the EPU software (FEI, now Thermo Scientific). The acquisition parameters for the individual data sets are summarized in Supplementary Table S2.
Cryo-EM data processing
The image processing was performed in Relion 246 and 330. The micrographs were motion-corrected and contrast-transfer function estimated using MotionCor247 and CTFFIND3 and CTFFIND448, respectively. The particles were picked using the Relion and Cryolo49 autopickers. For the autopicking procedure in Relion, a few thousand particles were manually picked and subjected to reference-free 2D classification to create templates. The autopicked particles were extracted from the micrographs and subjected to multiple rounds of 2D and 3D classification to remove falsely picked grid contaminations and damaged particles and to address structural heterogeneity. A refined 3D map was reconstructed using a low-resolution initial model created in Relion. The particles were polished (per-particle motion correction and dose weighting), and a polished 3D-refined map was reconstructed. The map was post processed using a low-pass-filtered mask to calculate the FCSs and estimate the global resolution. Based on the local resolution estimation implemented in Relion, the map was also locally low-pass filtered.
The procedure was performed using multibody refinement in Relion 330. The consensus maps were divided into custom parts using the eraser tool in UCSF Chimera50. The parts were low-pass filtered, binarized, and multiple layers of soft-edge voxels were added to create the masks for multibody refinement. The multibody-refined maps were post processed using low-pass-filtered masks to calculate the FCSs and locally low-pass filtered based on their estimated local resolution. For each component of the principal component analysis, particles were distributed into ten equally populated bins according to their eigenvalues to create maps representing the motion of the respective components. For the subsequent focused two-body refinement of the Twisttower, the particles were re-extracted at the centers of projections of the four domains, with smaller subarea boxes but original pixel size from the micrographs, subtracting the signal from the other bodies. The resulting smaller boxes allow more efficient processing. From the resulting four subtracted particle sets, refined maps of the individual bodies were reconstructed. To focus on a small subvolume, the map was divided into a map containing the region of interest and a second map containing the rest using UFSF Chimera. To address the remaining motion, multibody refinement with low-pass-filtered, soft-edged masks was performed instead of the simpler approach of partial signal subtraction and focused refinement.
The initial pseudoatomic models were calculated using the idealized coordinates provided by ENRG-MD31, with the caDNAno 2.3 strand diagram6 and the nucleotide sequence as input. Models were manually prealigned with the cryo-EM maps using VMD version 1.9.3_MacOSX51. Approximate alignment of the center of mass and the helical direction is sufficient, as the procedure is capable of realigning the model. Before starting the fitting procedure, the steepest-descent energy minimization of up to 4800 steps was used to improve the model’s geometry. All simulations were performed using the program NAMD 2.12_Linux52 and the CHARMM3653,54 force field for nucleic acids. The VMD packages mdff_0.5 and volutil_1.3 were used to prepare the grid-based potentials from the cryo-EM maps. All simulations were performed in the NVT ensemble, used periodic boundary conditions, smooth truncation of Lennard–Jones and short-ranged Coulomb interactions at 10 Å, and a switch distance of 8 Å. No PME long-range corrections were applied and Langevin forces for all nonhydrogen atoms with damping of 0.1 1/ps were used to keep the temperature constant. Simulations were performed in a vacuum with dielectric constant 1. For the Twisttower, the twist-corrected Twisttower, and the barrel-like 126-helix bundle, composite maps from the multibody refinement were used for fitting.
ENRG-MD-driven cascaded flexible fitting
The cascaded relaxation procedure is based on molecular-dynamics flexible fitting (MDff)32,55 and consists of three phases. First, the helices of the initial model are globally aligned. MDff is performed with a weight on the EM density of 0.3 kcal/mol. To avoid the strong local minima generated by the lattice, the model is fitted to a series of cryo-maps of sequentially improving resolution, an approach called cascading flexible fitting (cMDff)33. Maps are low-pass filtered using a gaussian blur of up to 22 Å. This initial cascade consists of eight maps with an overall resolution of 22–10 Å. Each map is used to perform 12,000 steps of MDff. As the strong deformations of the model during these early stages can significantly distort the geometry of the model, the elastic network (EN) provided by ENRG-MD31 is used throughout this process. The harmonic bonds not only provide extra stability, but they also significantly speed up the dynamics of the structure. Consequently, the initial cascade can correct strong misalignment of the map and initial model, although large deviations might necessitate more time to align properly. This is the case for the Twisttower, where the different domains cannot be aligned with the initial model correctly. Additionally, the 2 × 2 sections are twisted by more than 90 degrees, necessitating a prolongation of the early alignment process. For the Twisttower, the cascade started at 24 Å and 60,000 steps.
During this general helical alignment step, local deformations, especially close to holiday junctions, can occur. These deformations are a consequence of the nonglobal nature of the MDff minimization combined with the restraints of ENRG-MD, which do not permit large structural deviations from B-DNA. These harmonic restraints can be categorized into long-range interhelical, short-range intrahelical, and bonds connecting the Watson–Crick base pairs. To resolve these local deformations, the grid-based potential is switched off for 12,000 steps, locally relaxing the structure using all EN restraints31. Then, a second cascade is performed using eight maps of overall resolution from 16 Å to the final resolution of the map. MDff is performed with 6000 steps for each map. As the model is already closer to the hypothetical solution structure, the EN is no longer required to suppress strong deformations. Reducing the number of harmonic bonds increases the sampled face space of the procedure and improves the model’s accuracy. Consequently, long-range bonds of the EN are turned off for maps with a resolution of 14 Å or better. Finally, the model is fitted against the original map for 12,000 steps with only base-pair-restraining bonds still in place. Afterward, the weight on the EM density is increased to 1.0 kcal/mol for 18,000 energy- minimization steps. The selection of the different harmonic bonds is performed in the context of the strand diagram.
Model validation and stereochemical quality
To assess the quality of the fit, the masked cross-correlation coefficient (ccc) was calculated using the mdff_0.5 package in VMD51 (Supplementary Table S3), using the map resolution reported in Supplementary Table S3. The masks were generated with the viewer tool34. Additionally, helical properties like helical rise, twist, and base-pair orientation were calculated and monitored throughout the fitting procedure to assess outliers and big changes in the model’s geometry. We also used a half-map-based validation approach to prevent overfitting56. Fourier Shell Correlation for the final fitted model and the experimental cryo-EM map was calculated using Relion46 (see Supplementary Fig. S38). The simulated maps were generated in VMD.
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
All maps and fitted models that support the findings of this study are available in the EMDB57 and in the Protein Data Bank (PDB)58, respectively: EMD-11379, EMD-11378, EMD-11881, EMD-11170, EMD-11367, EMD-11387, EMD-11343, EMD-11344, EMD-11345, EMD-11355, EMD-11351, EMD-11352, EMD-11353, EMD-11354, EMD-11346, EMD-11348, EMD-11349, EMD-11350, EMD-11159, EMD-11168, EMD-11294, EMD-10993, EMD-11295, EMD-11296, EMD-11297, EMD-11298, 7ARV, 7ARY, 7ARE, 7AS5, 7ARQ, 7ART. Identifiers with corresponding names of the structures are also listed in Supplementary Table 5. Raw cryo-EM data are available from the corresponding author upon reasonable request. Source data are provided with this paper.
The viewer tool for context-based zoning and cropping of cryo-EM maps is free open-source software under the GNU Public License version 3 and can be downloaded from https://github.com/elija-feigl/FitViewer34. It is developed as a Python-based Jupyter Notebook59. Atomic model trajectories and coordinate files are read/written by the python package MDAnalysis 0.20.160, cryo-EM data are handled by the package mrcfile 1.1.261. A custom version of the package autodesk/nanodesign (https://github.com/elija-feigl/nanodesign_dietz) is used for reading the caDNAno strand diagrams.
Rothemund, P. W. Folding DNA to create nanoscale shapes and patterns. Nature 440, 297–302 (2006).
Douglas, S. M. et al. Self-assembly of DNA into nanoscale three-dimensional shapes. Nature 459, 414–418 (2009).
Andersen, E. S. et al. Self-assembly of a nanoscale DNA box with a controllable lid. Nature 459, 73–76 (2009).
Dietz, H., Douglas, S. M. & Folding, W. M. Shih DNA into twisted and curved nanoscale shapes. Science 325, 725–730 (2009).
Ke, Y. et al. Multilayer DNA origami packed on a square lattice. J. Am. Chem. Soc. 131, 15903–15908 (2009).
Douglas, S. M. et al. Rapid prototyping of 3D DNA-origami shapes with caDNAno. Nucleic Acids Res. 37, 5001–5006 (2009).
Benson, E. et al. DNA rendering of polyhedral meshes at the nanoscale. Nature 523, 441–444 (2015).
Veneziano, R. et al. Designer nanoscale DNA assemblies programmed from the top down. Science 352, 1534 (2016).
Hong, F., Zhang, F., Liu, Y. & Yan, H. DNA Origami: scaffolds for creating higher order structures. Chem. Rev. 117, 12584–12640 (2017).
Ramezani, H. & Dietz, H. Building machines with DNA molecules. Nat. Rev. Genet. 21, 5–26 (2020).
Sobczak, J. P., Martin, T. G., Gerling, T. & Dietz, H. Rapid folding of DNA into nanoscale shapes at constant temperature. Science 338, 1458–1461 (2012).
Huang, P. S., Boyken, S. E. & Baker, D. The coming of age of de novo protein design. Nature 537, 320–327 (2016).
Bai, X. C., Martin, T. G., Scheres, S. H. & Dietz, H. Cryo-EM structure of a 3D DNA-origami object. Proc. Natl Acad. Sci. USA 109, 20012–20017 (2012).
Dong, Y. et al. Folding DNA into a lipid-conjugated nanobarrel for controlled reconstitution of membrane proteins. Angew. Chem. 57, 2072–2076 (2018).
Martin, T. G. et al. Design of a molecular support for cryo-EM structure determination. Proc. Natl Acad. Sci. USA 113, E7456–E7463 (2016).
Aksel, T., Yu, Z., Cheng, Y. & Douglas, S. M. Molecular goniometers for single-particle cryo-EM of DNA binding proteins. Nature Biotechnology. https://doi.org/10.1038/s41587-020-0716-8 (2020).
Kim, D. N., Kilchherr, F., Dietz, H. & Bathe, M. Quantitative prediction of 3D solution shape and flexibility of nucleic acid nanostructures. Nucleic Acids Res. 40, 2862–2868 (2012).
Woo, S. & Rothemund, P. W. Programmable molecular recognition based on the geometry of DNA nanostructures. Nat. Chem. 3, 620–627 (2011).
Baker, M. A. B. et al. Dimensions and global twist of single-layer DNA origami measured by small-angle X-ray scattering. ACS Nano 12, 5791–5799 (2018).
Mallik, L. et al. Electron microscopic visualization of protein assemblies on flattened DNA origami. ACS Nano 9, 7133–7141 (2015).
Martin, T. G. & Dietz, H. Magnesium-free self-assembly of multi-layer DNA objects. Nat. Commun. 3, 1103 (2012).
Wagenbauer, K. F. et al. How we make DNA origami. ChemBioChem 18, 1873–1885 (2017).
Funke, J. J. & Dietz, H. Placing molecules with Bohr radius resolution using DNA origami. Nat. Nanotechnol. 11, 47–52 (2016).
Funke, J. J. et al. Uncovering the forces between nucleosomes using DNA origami. Sci. Adv. 2, e1600974 (2016).
Lee, J. Y. et al. Investigating the sequence-dependent mechanical properties of DNA nicks for applications in twisted DNA nanostructure design. Nucleic Acids Res. 47, 93–102 (2019).
Gerling, T., Kube, M., Kick, B. & Dietz, H. Sequence-programmable covalent bonding of designed DNA assemblies. Sci. Adv. 4, eaau1157 (2018).
Pan, K. et al. Lattice-free prediction of three-dimensional structure of programmed DNA assemblies. Nat. Commun. 5, 5578 (2014).
Pan, K., Bricker, W. P., Ratanalert, S. & Bathe, M. Structure and conformational dynamics of scaffolded DNA origami nanoparticles. Nucleic Acids Res. 45, 6284–6298 (2017).
Chen, H., Li, R., Li, S., Andreasson, J. & Choi, J. H. Conformational effects of UV light on DNA origami. J. Am. Chem. Soc. 139, 1380–1383 (2017).
Nakane, T., Kimanius, D., Lindahl, E. & Scheres, S. H.Characterisation of molecular motions in cryo-EM single-particle data by multi-body refinement in RELION. eLife 7, e36861 (2018).
Maffeo, C., Yoo, J. & Aksimentiev, A. De novo reconstruction of DNA origami structures through atomistic molecular dynamics simulation. Nucleic Acids Res. 44, 3013–3019 (2016).
Trabuco, L. G., Villa, E., Mitra, K., Frank, J. & Schulten, K. Flexible fitting of atomic structures into electron microscopy maps using molecular dynamics. Structure 16, 673–683 (2008).
Singharoy, A. et al. Molecular dynamics-based refinement and validation for sub-5 A cryo-electron microscopy maps. eLife 5, e16105 (2016).
Castro, C. E. et al. A primer to scaffolded DNA origami. Nat. Methods 8, 221–229 (2011).
Snodin, B. E. K., Schreck, J. S., Romano, F., Louis, A. A. & Doye, J. P. K. Coarse-grained modelling of the structural properties of DNA origami. Nucleic Acids Res. 47, 1585–1597 (2019).
Boyken, S. E. et al. De novo design of protein homo-oligomers with modular hydrogen-bond network-mediated specificity. Science 352, 680–687 (2016).
Kuzyk, A. et al. DNA-based self-assembly of chiral plasmonic nanostructures with tailored optical response. Nature 483, 311–314 (2012).
Schmied, J. J. et al. Fluorescence and super-resolution standards based on DNA origami. Nat. Methods 9, 1133–1134 (2012).
Acuna, G. P. et al. Fluorescence enhancement at docking sites of DNA-directed self-assembled nanoantennas. Science 338, 506–510 (2012).
Siavashpouri, M. et al. Molecular engineering of chiral colloidal liquid crystals using DNA origami. Nat. Mater. 16, 849–856 (2017).
Douglas, S. M., Chou, J. J. & Shih, W. M. DNA-nanotube-induced alignment of membrane proteins for NMR structure determination. Proc. Natl Acad. Sci. USA 104, 6644–6648 (2007).
Kauert, D. J., Kurth, T., Liedl, T. & Seidel, R. Direct mechanical measurements reveal the material properties of three-dimensional DNA origami. Nano Lett. 11, 5558–5563 (2011).
Pfitzner, E. et al. Rigid DNA beams for high-resolution single-molecule mechanics. Angew. Chem. 52, 7766–7771 (2013).
Bokori-Brown, T. G. M. Monika, Naylor, ClaireE., Basak, AjitK., Titball, RichardW. & Savva, ChristosG. Cryo-EM structure of lysenin pore elucidates membrane insertion by an aerolysin family protein. Nat. Commun. 7, 11293 (2016).
Kimanius, D., Forsberg, B. O., Scheres, S. H. & Lindahl, E. Accelerated cryo-EM structure determination with parallelisation using GPUs in RELION-2. eLife 5, e18722 (2016).
Zheng, S. Q. et al. MotionCor2: anisotropic correction of beam-induced motion for improved cryo-electron microscopy. Nat. Methods 14, 331–332 (2017).
Rohou, A. & Grigorieff, N. CTFFIND4: fast and accurate defocus estimation from electron micrographs. J. Struct. Biol. 192, 216–221 (2015).
Wagner, T. et al. SPHIRE-crYOLO is a fast and accurate fully automated particle picker for cryo-EM. Commun. Biol. 2, 218 (2019).
Pettersen, E. F. et al. UCSF Chimera–a visualization system for exploratory research and analysis. J. Comput. Chem. 25, 1605–1612 (2004).
Humphrey, W., Dalke, A. & Schulten, K. VMD: visual molecular dynamics. J. Mol. Graph 14(33–38), 27–38 (1996).
Phillips, J. C. et al. Scalable molecular dynamics with NAMD. J. Comput. Chem. 26, 1781–1802 (2005).
Hart, K. et al. Optimization of the CHARMM additive force field for DNA: improved treatment of the BI/BII conformational equilibrium. J. Chem. Theory Comput. 8, 348–362 (2012).
Vanommeslaeghe, K. et al. CHARMM general force field: a force field for drug-like molecules compatible with the CHARMM all-atom additive biological force fields. J. Comput. Chem. 31, 671–690 (2010).
McGreevy, R. et al. xMDFF: molecular dynamics flexible fitting of low-resolution X-ray structures. Acta Crystallogr. D. Biol. Crystallogr. 70, 2344–2355 (2014).
Igaev, M., Kutzner, C., Bock, L. V., Vaiana, A. C. & Grubmuller, H. Automated cryo-EM structure refinement using correlation-driven molecular dynamics. eLife 8, e43542 (2019).
Lawson, C. L. et al. EMDataBank.org: unified data resource for CryoEM. Nucleic Acids Res. 39, D456–D464 (2011).
Berman, H. M. et al. The protein data bank. Nucleic Acids Res. 28, 235–242 (2000).
Kluyver, T. et al. Jupyter Notebooks-a publishing format for reproducible computational workflows. In ELPUB, 87–90. (2016).
Michaud-Agrawal, N., Denning, E. J., Woolf, T. B. & Beckstein, O. MDAnalysis: a toolkit for the analysis of molecular dynamics simulations. J. Comput. Chem. 32, 2319–2327 (2011).
Burnley, T., Palmer, C. M. & Winn, M. Recent developments in the CCP-EM software suite. Acta Crystallogr. D. Struct. Biol. 73, 469–477 (2017).
This work was supported by a European Research Council Consolidator Grant to H.D. (GA no. 724261), the Deutsche Forschungsgemeinschaft through grants provided within the Gottfried-Wilhelm-Leibniz Program and the SFB863 TPA9 Project ID 111166240 (to H.D.), and the UK Medical Research Council (MC_UP_A025_1013 to S.H.W.S.). Additional support came from the Max Planck School Matter to Life (a joint program of BMBF and Max Planck Society) to H.D. and E.F.
Open Access funding enabled and organized by Projekt DEAL.
The authors declare no competing interests.
Peer review information Nature Communications thanks Chunhai Fan and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Kube, M., Kohler, F., Feigl, E. et al. Revealing the structures of megadalton-scale DNA complexes with nucleotide resolution. Nat Commun 11, 6229 (2020). https://doi.org/10.1038/s41467-020-20020-7
This article is cited by
Nature Communications (2023)
Nature Protocols (2022)