Main

There has been considerable recent progress in the design of protein nanomaterials including cyclic oligomers2,3,4, polyhedral nanocages5,6,7,8, one-dimensional (1D) fibres9,10, 2D sheets11,12 and 3D crystals10,13 by docking together8 or fusing6,14 protein monomers or cyclic oligomers. Although powerful, these methods have two limitations that arise from the irregularity of almost all protein structures. First, because the shapes of the constituent components are generally complex, they cannot be assembled into higher-order structures on the basis of simple geometric principles; instead, large-scale sampling calculations are required to identify shape-complementary interactions for each case, and there is no guarantee that designable interfaces can be found. Second, as for the myriad protein complexes in nature, the size of a designed protein assembly cannot be readily scaled; it is nearly impossible to make a smaller or larger but otherwise nearly identical version of assemblies generated using current design methods. By contrast, designed materials that extend along just one dimension, such as α-helical coiled coils and repeat proteins, can be grown or shrunk by simply varying the length of the chain. There is a rich history of designing coiled coils using simple geometric principles; this extensibility and designability have made them widely used constituents of designed protein materials15.

We set out to develop a general approach for designing expandable higher-order protein nanomaterials with the simplicity and programmability of coiled-coil engineering. We reasoned that if a modular and regular toolkit of building blocks and interactions could be generated consisting of linear building blocks constructed from (1) repeating sequence elements that extend without twisting as additional sequence repeats are added (Fig. 1b,c(top),d), (2) curved building blocks that trace out arcs of circles of different radii (Fig. 1c(bottom),d) and (3) non-covalent arrangements that hold two building blocks in pre-specified relative orientations (Fig. 1e), then building up new nanostructures could in principle be carried out by inspection in a manner analogous to blueprinting a house frame (Fig. 1a). Furthermore, as with a house frame, the regular structures of the constituent building blocks could enable scaling the dimensions of the final architecture (area or volume) by simply altering the size of the constituent monomers, and structural reinforcement by placement of additional buttressing elements.

Fig. 1: Overview of THR protein blocks and interaction modules.
figure 1

a, Building a house frame from standardized wooden building blocks. b, THR internal geometry. Blocks are constructed from idealized straight α-helices with an angle of rotation between adjacent helices of Δθ; the remaining degrees of freedom that contribute to the repeat trajectory are also indicated. c, Changing Δθ (while holding the other parameters constant) specifically changes the curvature of the repeat trajectory. d, Single-chain THR modules. e, THR interaction modules. Image of house frame (a, top) by mmaxer, Can Stock Photo.

Design of twistless helix repeats

Natural and previously designed proteins exhibit a wide range of helical geometries with local irregularities, kinks and deviations from linearity16 that make it difficult to achieve the properties illustrated in Fig. 1 that enable simple nanomaterial scaling (beyond the one dimension accessed by varying the number of repeats in a repeat protein or coiled coil). To achieve these properties, we designed a series of new building blocks constructed from ideal α-helices with all helical axes aligned. Restricting helical geometry to ideal straight helices with zero helical twist in principle considerably limits what types of structure could be built, but this is more than compensated by the great simplification of downstream material design, as illustrated below. We construct twistless helix repeat (THR) protein blocks from identical straight α-helices (typically 2–4 helices in each unit); the length of the blocks can be varied simply by varying the number of repeat units. In contrast to existing natural and designed repeat proteins17, THRs are constructed to enable modular nanomaterial design: linear blocks are perfectly straight, allowing nanomaterials to be extended and contracted with no alteration in the angles between the constituent monomers; curve blocks have smoothly curving trajectories that stay in-plane; and turn as well as interaction modules enable placement of two blocks in precise relative orientations with angles appropriate for regular material design.

We blueprint THRs by explicit placement of these straight helix structural elements using an extension of the principles used in coiled-coil and helical bundle design16,18. A first helix a0, part of the zeroth repeat, is placed at the origin and aligned to the z axis. A copy of a0 called a1 is then placed at a new location to set the rigid body transformation between the zeroth and first (and all subsequent) repeat units. After this, any other helices (b0, c0 …) that will be part of the repeating unit are placed as appropriate between a0 and a1 to provide more helices to pack against for stability, and the helices are connected with loops19; repetition of this basic unit then generates backbones with the desired geometries17 (Fig. 1b,c). As the helices are perfectly straight and parallel to the z axis, the overall repeat protein trajectory is fully defined by the following transformation parameters from a0 to a1: the distance of displacement in the xy plane from helical axis to helical axis (d), the change in displacement in the z axis direction (Δh) and the change in helix phase (Δθ; Fig. 1b). The remaining degrees of freedom for the positions of helices b0, c0 …, which define the internal geometry of the repeat, are extensively sampled, sequences are designed using Rosetta FastDesign or ProteinMPNN19,20, and designs are selected for experimental characterization on the basis of packing and sequence–structure consistency metrics (Methods). We obtained synthetic genes encoding the selected designs, expressed them in Escherichia coli and purified the proteins using nickel–nitrilotriacetic acid immobilized metal affinity chromatography. Designs that were solubly expressed were analysed by size-exclusion chromatography (SEC) to determine oligomerization state, and in the case of assemblies a subset was analysed by negative-stain electron microscopy (ns-EM). Experimental success rates and structural homogeneity for different classes of designs are summarized in Supplementary Figs. 1 and 2 and Supplementary Discussion.

To generate straight, linear THRs, we set Δθ to zero. As illustrated in Fig. 2a,b, this results in perfectly straight repeat proteins in which each repeat unit is translated but not rotated relative to the previous unit. There are two subclasses: setting Δh = 0 generates repeat proteins with each repeat unit simply displaced in the xy plane (Fig. 2a), whereas setting Δh to a non-zero value generates repeat proteins that also step along the z axis (Fig. 2b). We tested 33 linear THRs (with Δh = 0) with helices either about 20 or about 40 residues in height; 23 of 33 tested designs were solubly expressed, and 13 of 19 designs analysed by SEC were primarily monomeric as designed (Supplementary Figs. 1a,b and 2). Structural characterization of the linear building blocks by X-ray crystallography individually and/or cryogenic EM (cryo-EM) in the context of assemblies (see below) revealed that both the detailed internal structures and the overall straight linear geometry were successfully achieved. The backbone root mean square deviations (RMSDs) between the design models and crystal structures of three 20-residue helix designs (THR1, THR2 and THR3) and two 40-residue helix designs (THR5 and THR6) were 0.8, 0.8, 0.4, 0.6 and 1.3 Å, respectively, and in all five cases the relative rotation of successive repeats is nearly zero (Fig. 2a and Supplementary Fig. 6a). We found that we could not only control Δθ = 0, but also program values of the inter-repeat distance d: the crystal structure of a design with d set to a compact helix packing value of 8.7 Å had a very close value of 8.6–8.8 Å at its central interior (THR3), in contrast to most others designed at 10.0 Å (Supplementary Fig. 6b). For structural validation of blocks with non-zero Δh, the cryo-EM structure of an assembly constructed from such a block (THR4) exhibited a linear stair-stepping structure nearly identical to the design model, (backbone RMSD of 1.0 Å; Fig. 2b and Supplementary Fig. 1a).

Fig. 2: 1D and 2D shapes from THRs.
figure 2

a,b, The linear THR designs (rainbow) are nearly identical to the experimentally determined structures (grey). Side-chain sticks between α-carbon and β-carbon are shown to indicate helical phasing. a, Left: the 2.5-Å-resolution crystal structure of the short, linear THR1 has a 0.8 Å CA RMSD to the design. The inset below shows repeat packing in the THR interior. Right: the 2.7-Å-resolution crystal structure of the tall, linear THR5 has a 0.6 Å CA RMSD to the design. b, Bottom: Comparison of the stair-stepping linear THR4 design model to the cryo-EM structure (determined as part of a nanocage assembly; Supplementary Fig. 16). The CA RMSD between the cryo-EM structure and the design model is 1.0 Å. c, C4 and C3 polygons generated from four-helix turn module THRs as illustrated on the left. C4 square 90_C4_B (middle) and C3 triangle 120_C3_A (right) oligomers with representative ns-EM 2D class averages for comparison (raw EM micrographs are in Supplementary Fig. 1f). Chain breaks are at the ends of the rainbow sections. Scale bar, 4 nm (for the design models); class averages are not to scale. d, Uncapped curve THRs generate cyclic ring oligomers. The 12-repeat ring design (tested as C4) R12B has a cryo-EM 3D reconstruction overlaid on the model; the two are nearly identical. A 2D class average with the individual straight helices resolved is shown left of the ring. e, The 20-repeat ring design (tested as C4) R20A has an ns-EM reconstruction density overlaid on the model, and a raw micrograph is shown inside. Scale bar, 10 nm. f, The 30-repeat ring design (tested as C6) R30A represented in a similar manner to e. Scale bar in e, 10 nm (for the design models with reconstruction maps overlaid in df); class averages are not to scale. The asymmetric unit is coloured in rainbow.

To generate turn blocks, we blueprint an additional helix c0 lined up with a0 and a1 that can be assigned any specified phase difference, which can be utilized in fusion operations to produce a turn that is equal to θc − θa (Supplementary Fig. 5d,e). As for all of the THR blocks described here, because of the ideality of the block construction, the same sequence interactions can be used for the intra-block and inter-block interactions; we refer to blocks in which the terminal repeats have identical sequences to the internal repeats as uncapped, and those in which the terminal helices have polar outward-facing residues to prevent self-association (like the linear blocks above) as capped. We experimentally characterized uncapped turn modules that generate rotations of 360/n, in which n is 3, 4, 5 or 6; if the geometry is correct, these should oligomerize to form closed polygons with n subunits. ns-EM 2D class averages of the n = 3 designs clearly show the designed triangular shape with flattened corners (Fig. 2c and Supplementary Fig. 1f), and for n = 4, the designed square shapes (Fig. 2c and Supplementary Fig. 1f) including fine details such as the lower density around the corner helix are observed. For n = 5 and n = 6, success rates were lower, probably because their hinge regions involved less-extensive helix–helix interactions, but we did obtain designs with the expected polygonal structures for both after using reinforced corners on the C6 design (Supplementary Fig. 1f and Supplementary Discussion). Thus, by controlling the phase rotations between adjacent helices, turns can be encoded while maintaining overall parallel helical architecture. We also made polygonal designs with combinations of linear THRs and new straight helix-heterodimer corner junctions instead of turn modules (Supplementary Discussion and Supplementary Figs. 1g, 9 and 10).

To generate curve THRs, we incorporate a phase change (Δθ) between repeating elements (Fig. 1c) that generates a curved trajectory rather than a linear one. We choose Δθ to be a factor of 360° so that perfectly closed rings can be generated. The size of the closed ring can be controlled by specifying Δθ and the distance d between repeats (Supplementary Fig. 7). To access a broad range of d parameter values, we add additional helices to the repeat unit; for circular rings we used four helices per repeat unit. A full curve THR ring with n repeats can be divided into smaller chains each with m repeats, in which m is a factor of n; n/m uncapped repeats can associate to generate the full ring with cyclic symmetry21. To facilitate gene synthesis and protein production, we characterized such split oligomeric versions of the rings rather than synthesizing very long single chains. We designed rings with 12, 18, 20 and 30 repeats ranging from 9 to 22 nm in outside diameter. The 12- and 20-repeat rings were tested as C4 designs, whereas the 18- and 30-repeat rings were tested as C6 designs. Designs for all four ring sizes were remarkably uniform with ns-EM micrographs densely covered with circular assemblies with few to no defects or alternative structures present (Supplementary Fig. 7). Two-dimensional class averages showed that designs for all four sizes were close to the intended size (Fig. 2d; 10, 1 and 9 unique designs yielded distinct ring shapes for 18-, 20- and 30-repeat rings (Supplementary Figs. 1e and 2)). The smallest rings with 12 repeats have solvent-exposed helices exterior to the ring placed to facilitate outward-facing fusions without disrupting the core packing of the ring; these are clearly visible in the 2D class averages and 5.2-Å-resolution cryo-EM reconstruction of R12B (Fig. 2d and Supplementary Fig. 1e) that matches the designed patterning of the helices. ns-EM of the 18-, 20- and 30-repeat rings (with outside diameters of 12, 14 and 22 nm respectively) showed that many designs formed remarkably monodisperse populations of ring-like structures closely consistent with the design models (Fig. 2e,f and Supplementary Fig. 1e). ns-EM class averages of these designs had the smooth and round shape of the design models, and were in most but not all cases homogeneous (some designs assembled into closed-ring species that ranged by ±1 chain of the desired number, resulting in some slightly oblong shapes; Supplementary Fig. 1e). These designs highlight the control over ring curvature that can be achieved by specifying building block repeat rotation parameters.

The simplicity of our blocks in principle enables the reinforcing of designed materials using struts rigidly linking distinct structural elements. As a first test of this, we sought to build concentric ring assemblies from pairs of rings that have different sizes but repeat numbers that share large common denominators. For example, 2 repeat units of a 20-repeat ring can be combined with 3 repeat units of a 30-repeat ring as 10 copies generate a complete ring in both cases (Fig. 3a, left). Rings were segmented into matching cyclic symmetries, the rotation and z displacement of one ring relative to the other was sampled, and linear THRs were placed to connect the inner and outer rings. We constructed single-component C10 concentric ring assemblies by connecting a three-repeat-unit curved block and a two-repeat-unit curved block that both generate a 36° (360°/10) rotation with a radially oriented strut. Two-dimensional class averages of ns-EM images of the designed strutted assemblies show both rings clearly present (Fig. 3a, right; some 11-subunit rings were observed in addition to the target 10-subunit structure). We similarly connected three repeat units with a 20° rotation per repeat, and five repeat units with a 12° rotation per repeat, with a radial strut; the resulting composite subunits map out a 60° rotation of inner and outer rings such that six subunits generate a full 360° ring. The resulting two-component C6 strutted assembly yielded 2D class averages that showed both rings with all chains present, and a 5.1-Å cryo-EM reconstruction was very close to the design model (RMSD 2.7 Å) with very similar outer diameter (19.7 nm versus 20.1 nm; Fig. 3b and Supplementary Fig. 8c). The helix positioning in the inner ring and the strut are also very close to the design model (Supplementary Fig. 8c, insets). Thus, the modularity of the THRs enables designing complex structures by inspection, and enables buttressing to increase structural robustness (Supplementary Discussion and Supplementary Fig. 8).

Fig. 3: Design of strutted double rings.
figure 3

a,b, Two different size rings built from curve THRs for which integral multiples generate the same rotation can be concentrically nested and connected by struts. a, Three repeats of an outer ring (12° per repeat) are combined with two repeats of an inner ring (18° per repeat) that both generate a 36° rotation. Connection of the two pieces with a linear THR generates a C10 single-component ring (strut_C10_8); an asymmetric unit is highlighted in the second ring image. An ns-EM 3D reconstruction in C10 symmetry is shown overlaid with the design model next to 2D class averages and a representative micrograph. b, Five repeats of an outer ring (12° per repeat) are combined with three repeats of an inner ring (20° per repeat) that both generate a 60° rotation. Connection of the two pieces with a linear THR and an additional chain break in the outer ring generates a C6 two-component ring (strut_C6_21); an asymmetric unit is highlighted in the second ring image with the two chains in different colors. A cryo-EM 3D reconstruction in C6 symmetry is shown overlaid with the design model next to cryo-EM 2D class averages and a representative ns-EM micrograph (additional cryo-EM details are provided in Supplementary Fig. 8c). Scale bars, 20 nm (a,b). An asymmetric unit is outlined on top of the design model, and repeats are sectioned with dashed lines.

Expandable nanomaterials

The regularity of our blocks in principle enables scaling the size of nanomaterial designs simply by changing the number of repeats in the constituent THRs without altering any of the inter-block interfaces. How the THRs must be aligned to enable expandability differs for each architecture, as described below.

To construct expandable cyclic assemblies, the linear THRs must be placed such that the propagation axis is normal to the cyclic symmetry axis. For cyclic designs with this property (those built from turn modules or heterodimers; see Supplementary Discussion), adding or removing repeats simply changes the length of the oligomer edge without affecting the interface between monomers. We tested this expandability with a C4 ‘square’ (sC4) oligomer for which we had obtained a cryo-EM reconstruction with 1.6-Å-backbone RMSD (Supplementary Fig. 10). This subunit consists of a central linear THR flanked by straight-helix heterodimers that produce a 90° turn. To expand this structure, we inserted two additional repeat units (six helices) into the linear THR portion of the subunit. Cryo-EM 2D class averages for both the original and expanded square show close agreement to the design models and clear expansion; the helices clearly remain aligned to the z axis as designed (Fig. 5a).

Architectures with polyhedral nanocage symmetry can be similarly expanded provided that the linear THR propagation axis is parallel to the plane formed by the two symmetry axes spanned by the THR (Supplementary Fig. 11a). To generate such architectures, and enable further access to construction in three dimensions, we designed out-of-plane interactions between building blocks. We first focused on designing C2 symmetric interfaces in which the angles between linear THRs correspond to the angles needed to generate regular polyhedral symmetry (Fig. 4) when combined with planar C3 or C4 components, while also satisfying expandability criteria. For an octahedral ‘cube’ (O4) built from flat objects with C4 symmetry that lie on the ‘cube faces’, this angle is 90°. For tetrahedra (T3), octahedra (O3) and icosahedra (I3) built from flat C3-symmetric objects, the out-of-plane handshake angles that are needed to join the flat objects are 70.5°, 109.5° and 138.2°, respectively22,23. Handshake C2 homodimers were generated by fixing this out-of-plane angle and keeping the linear THR propagation axes parallel to each other, sampling only the offset spacing between the THRs8 (Supplementary Fig. 12).

Fig. 4: Modular construction of protein nanocages from THRs.
figure 4

a, In box: regular nanocages are constructed from curve THR rings with linear arms projecting outwards that can be linearly extended (left) and designed handshake C2 interfaces (right) that hold the rings at the angle required for the desired polyhedral symmetry. THR chains that are fused into one chain are shown in blue, with identical backbone and sequence areas used for concatenation indicated in dotted outlines. beC2 handshake modules generating the required angles (top left in each panel, be) are combined with either C3 (bd) or C4 (e) versions of the ring (shown below the box in a) to construct nanocages in be. Design models are shown as helix cylinders, with the asymmetric unit in blue, and the remaining copies in purple. Each cage is overlaid with the 3D cryo-EM reconstruction or ns-EM (marked with an asterisk here and in subsequent figures) reconstruction, with representative paired 2D classes on the right (left is experimental; right is computed from design model). The blue dots indicate the location of the handshake angle in the cage. b, The T3 tetrahedral design cage_T3_101 uses a C3 ring and a 70.5° C2 handshake. c, The O3 octahedral design cage_O3_20 uses a C3 ring and a 109.5° C2 handshake. d, The I3 icosahedral design cage_I3_8 uses a C3 ring and a 138.2° C2 handshake. e, Th O4 octahedral design cage_O4_34 uses a C4 ring and a 90.0° C2 handshake. Scale bars, 20 nm (b), 27 nm (c), 52 nm (d) and 22 nm (e).

To generate expandable nanocages, flat cyclic components that form the faces of the cages were linked through noncovalent handshake interactions at the specified angle. For the flat cyclic component, we used a ring design with 12 repeats (R12B; Fig. 2c) constructed from curve units, and split the 12 repeats into either 3 subunits with 4 repeats each (C3) or 4 subunits with 3 repeats each (C4; Supplementary Fig. 3d), depending on the desired polyhedral symmetry architecture. We then fused linear THR arms onto each subunit constrained to point outward parallel to a radial vector emanating from the symmetry axis, but offset such that when the C2 interface is formed, the C2 axis is along a radial vector (Fig. 4a and Supplementary Fig. 12). Tetrahedral, octahedral ‘cubic’ and icosahedral structures with C3 rings at respective axes (T3, O3 and I3), and octahedral structures with C4 rings at the respective axes (O4) were constructed by incorporation of the appropriate C2 interface. For example, to make a ‘cubic’ octahedral nanocage, we incorporate into the C4 ring arm the 90° C2 handshake module (Fig. 4e) by simple sequence concatenation. Synthetic genes were obtained for 13 nanocage designs; all 13 expressed solubly, 10 had SEC elution profiles that suggested cage formation, 8 yielded particles with the expected size by ns-EM, and 7 gave 2D class averages and symmetric 3D reconstructions that resembled the design models. Successful designs for each architecture are shown in Fig. 4b–e and Supplementary Figs. 1j and 12. Designed geometric features including the spindle-like twofold handshake interface and the flat ‘in-plane’ ring areas with distinct holes are clearly evident. For the T3 and O4 cages, the correct species dominated, but in O3 and I3 cages there were noticeable populations of species that were either partially formed or broken under ns-EM conditions (Supplementary Fig. 13). A 7.5-Å cryo-EM reconstruction and an experimental model were obtained for the cubic cage built from tetrameric rings on the faces (cage_O4_34) that were very close to the design model, with the straight helices clearly evident and only very slight deviations in the arm alignment (Fig. 4e and Supplementary Fig. 14). A 4.0-Å cryo-EM reconstruction and an experimental model for the tetrahedral cage_T3_101 were similarly very close (Fig. 4b and Supplementary Fig. 41). These results illustrate the robustness of structures that can be assembled from our regularized building blocks using simple ‘snapping together’ of complementary pieces in three dimensions, and show that with additional reinforcing mechanisms such as cooperativity, structural specificity can be achieved without traditional ‘knob-and-hole’ helix–helix interactions24.

We tested the expansion in all three dimensions of the cubic design (Fig. 4e and Supplementary Fig. 14) by increasing the number of repeat units in the linear arm. We generated four different sizes of the cage_O4_34 by increasing the number of THR helices in the arm by +0, +4, +8 or +12 helices (Fig. 5b and Supplementary Fig. 13). For all sizes, ns-EM 2D class averages (Fig. 5b, bottom row) show all three symmetrical views with the designed increases in size but otherwise close preservation of architecture. Three-dimensional ns-EM reconstructions were consistent with corresponding design models, with the overall cube shape and ring circular pore clearly visible in each of the sizes (Fig. 5b, top row). The first three sizes of cage show primarily intact assemblies across the ns-EM grids; for the largest size (+12), some incomplete assembly was also observed (Supplementary Fig. 13). Additional single-component expandable nanocage designs are described in Supplementary Figs. 13, 16 and 17 and the Supplementary Discussion.

Fig. 5: Extendable THR-based nanomaterials.
figure 5

a, An extendable C4 square. Left: the base design (sC4); right, an expanded version with six additional helices per chain (sC4_+6). Far left and right: representative cryo-EM micrographs (scale bars, 100 nm) with adjacent 2D class averages. A 3D reconstruction is superimposed on the base size design model. Chain breaks are in the red heterodimer region. The length of the side of the square (bar labeled x) is indicated beneath the 2D class averages. b, Expandable O4 octahedral handshake nanocage, cage_O4_34. Top left to right, design models and cryo-EM or ns-EM reconstructions (the asterisk denotes ns-EM on +8 and +12) of designs extended by 0, 4, 8 and 12 helices. Bottom (left): ns-EM 2D class averages along three symmetry axes (rows) for increasing size designs (columns). The inserted helices are shown in yellow for each case. Scale bars, 30 nm (for 2D class averages). The length of a side of each cube (x, measured between the outside corners of the handshake) determined from the EM map volumes is indicated beneath the class averages. c, An expandable two-component O43 nanocage cage_O43_129. Bottom row: the second size (+4; lowest deviation cryo-EM reconstruction) is shown with a symmetrized design model. For the other sizes, individual oligomers were fitted into the EM density (cryo-EM for +0 and +8 sizes; ns-EM for +12 size) so that rotational deviations from the ideal designs can be analysed (Supplementary Fig. 22). The extendable trimer component is in blue and the constant tetramer component is in red. Right, above 3D cage models: ns-EM 2D class averages along the three symmetry axes for each of the four sizes. The dimension x is the experimentally determined distance between the centre of mass of neighbouring C4 and C3 components. Scale bar, 25 nm (for 2D class averages).

We next designed two-component expandable nanocages by locking the rotation degrees of freedom of a THR-containing building block to maintain the expandability constraint (Methods), and then docking it against a freely sampling partner oligomer to form an O43 architecture (Fig. 5c, Supplementary Discussion and Supplementary Figs. 1923). Expandability over four different sizes was achieved with cage_O43_129 (+0, +4, +8 and +12 helices). The internal structure of the oligomers is clearly resolved in cryo-EM reconstructions for the first three sizes and in ns-EM reconstruction of the largest size; the distance between the centre of mass of the tetramer component to the centre of mass of the trimer component across the different sizes is 7.9, 9.4, 11.3 and 11.7 nm respectively (Fig. 5c and Supplementary Fig. 21). Views down each of the three symmetry axes (twofold, threefold and fourfold) are clear for each size (except for the threefold view in the largest size) with slight rotational deviations of the fourfold cyclic component compared to the design model, whereas the rotation of the threefold cyclic component holding the THR remains unperturbed as designed (Fig. 5c and Supplementary Fig. 22). A fifth size (16 additional helices) assembled into cage-like structures but the populations were too heterogeneous for detailed characterization (Supplementary Fig. 21).

For unbounded architectures that extend along one or more axes, extensibility requires that the linear THR propagation axes be parallel to the extension axes. We constructed an antiparallel assembly with an overall train track shape from THR modules (Fig. 6a). The ‘rails’ of the track are linear THRs that are uncapped to allow for unbounded linear assembly end-to-end, and C2 ‘ties’ dock onto branch interfaces on the sides of the rails, organizing them into strutted antiparallel pairs. Adding repeats to the rails increases the spacing between ties (along the helical axis) and adding repeats to the ties increases the separation distance between rails along a different axis (Fig. 6b). We used 12-helix addition to the rail to double the spacing between ties, and 8-helix addition to the tie to roughly double the length of the tie. For the four combinations of component sizes, we obtained ns-EM 2D class averages consistent with the design models (compare Fig. 6b and Fig. 6d). Train track assembly was robust to fusion of mScarlet-i on rails both at termini and in an internal loop (Supplementary Fig. 24b), and sfGFP on the ties25,26, as monitored by ns-EM, with density observed for the GFP, Supplementary Fig. 24c).

Fig. 6: Designed train track fibres.
figure 6

a, Components of the train track designs: a branch module, a split module and a C2 interface module. b, Train track designs. The asymmetric unit is outlined in the top left design; red and blue are unique protein chains. Taking advantage of the extensibility of the building blocks, four different train track designs were created, starting with the design at the top left. Top right: increasing the spacing between rungs by expanding the split module. Bottom left, increasing the length of the rungs by expanding the C2 module. Bottom right: increasing both rung spacing and length by expanding both the split and C2 modules. Inserted segments are in yellow. c, An ns-EM micrograph of the design at the top left of b. Scale bar, 25 nm. d, ns-EM 2D class averages of the four designs in b, shown in the same overall layout. The design models are overlaid on the class averages at 1:1 scale. Scale bar, 25 nm.

Discussion

On determining the first low-resolution model of the structure of a globular protein (myoglobin), John Kendrew wrote in 1958 that “Perhaps the most remarkable features of the molecule are its complexity and its lack of symmetry. The arrangement seems to be almost totally lacking in the kind of regularities which one instinctively anticipates”27. More than six decades of structural biology research have shown this to be a generally appropriate description of protein structure1. Figures 25 show that this complexity is not an inherent feature of the polypeptide chain: the simplicity and regularity of our designed materials approaches that of the wooden beams used for constructing a house frame. This enables the resizing of designed materials in two and three dimensions simply by changing the numbers of repeat units in the THR modules with little or no need for detailed design calculations; previously this has been possible only with coiled coils and repeat proteins with open helical symmetries (propagating along a single axis)9,15,28. The flat surfaces and regular geometry have immediate applications to the design of bio-mineralizing systems: THR monomers presenting carboxylate groups in regular arrays nucleate the mineralization of carbonate into calcite29, and expandable THR systems such as the cubic assemblies in Fig. 3 presenting such arrays could provide a route to hierarchical protein–mineral hybrid materials.

There are exciting paths forwards to further increase the capabilities of our programmable THR platform. First, our current multi-subunit assemblies have high symmetry, and assembly of arbitrary nanostructures would require breaking symmetry—one approach to achieving this would be to build heterodimeric and heterotrimeric interfaces between THRs, which would enable considerable shape diversification and addressability of each protein chain30. This would allow access to a broad range of asymmetric nanostructures, as with DNA nanotechnology bricks, tiles and slats31,32,33,34,35, but with the higher precision and greater functionality of proteins. Second, the materials generated here all form through self-assembly, but as the number of components increases the overall yield of the desired product could decrease. This limitation could potentially be overcome by stepwise solid-phase assembly with crosslinking after addition of each THR component (as in solid-phase peptide or DNA synthesis, but in three dimensions with the location of addition specified by non-covalent interactions between the THRs; the analogue in construction is nailing lumber pieces together after alignment). The combination of symmetry breaking and stepwise assembly would enable the design of a very wide range of protein nanomaterials based on simple geometric sketches that could be readily genetically modified to present a wide variety of functional domains in precisely controllable relative orientations.

Methods

Computational and experimental methods are all provided in the Supplementary Information.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.