From structural polymorphism to structural metamorphosis of the coat protein of flexuous filamentous potato virus Y

Kavčič, Luka; Kežar, Andreja; Koritnik, Neža; Žnidarič, Magda Tušek; Klobučar, Tajda; Vičič, Žiga; Merzel, Franci; Holden, Ellie; Benesch, Justin L. P.; Podobnik, Marjetka

doi:10.1038/s42004-024-01100-x

Download PDF

Article
Open access
Published: 17 January 2024

From structural polymorphism to structural metamorphosis of the coat protein of flexuous filamentous potato virus Y

Communications Chemistry volume 7, Article number: 14 (2024) Cite this article

1142 Accesses
8 Altmetric
Metrics details

Subjects

Abstract

The structural diversity and tunability of the capsid proteins (CPs) of various icosahedral and rod-shaped viruses have been well studied and exploited in the development of smart hybrid nanoparticles. However, the potential of CPs of the wide-spread flexuous filamentous plant viruses remains to be explored. Here, we show that we can control the shape, size, RNA encapsidation ability, symmetry, stability and surface functionalization of nanoparticles through structure-based design of CP from potato virus Y (PVY). We provide high-resolution insight into CP-based self-assemblies, ranging from large polymorphic or monomorphic filaments to smaller annular, cubic or spherical particles. Furthermore, we show that we can prevent CP self-assembly in bacteria by fusion with a cleavable protein, enabling controlled nanoparticle formation in vitro. Understanding the remarkable structural diversity of PVY CP not only provides possibilities for the production of biodegradable nanoparticles, but may also advance future studies of CP’s polymorphism in a biological context.

Dressing up artificial viral capsids self-assembled from C-terminal-modified β-annulus peptides

Article 13 May 2020

Atomic structure of potato virus X, the prototype of the Alphaflexiviridae family

Article 16 March 2020

DNA-origami-directed virus capsid polymorphism

Article Open access 17 July 2023

Introduction

Single-stranded RNA (ssRNA) viruses account for nearly half of all plant viruses. Most of them have only one type of structural protein, a capsid (or coat) protein (CP), and form either rod-shaped or flexuous filamentous virions¹. The latter are much more common², with viruses of the genus Potyvirus (family Potyviridae) representing the largest group³. Potyviruses have major economic impact and are responsible for more than half of the world’s viral crop damage⁴. Their genomic positive-sense ssRNA of about 10 kb generally encodes ten proteins³, which includes also the CP, whose copies form a flexuous filamentous capsid with left-handed helical symmetry around the viral ssRNA, as shown by the cryo-EM structures of watermelon mosaic virus (WMV), potato virus Y (PVY), and turnip mosaic virus (TuMV)^5,6,7. CP consists of a highly conserved globular core flanked by two large extended regions with a high frequency of structural disorder (Fig. 1a)^5,6,7,8. The C-terminal high intrinsic disorder region (C-IDR) is partially conserved and packaged in the lumen of the virion, supported by the helical ssRNA scaffold. The N-terminal IDR (N-IDR) exhibits very low amino acid conservation. It is exposed on the outer surface of the virion and is critical for the flexible nature of virions, connecting the CP units longitudinally and perpendicular to the filament axis^5,6,7. This is in contrast to viruses from the genera of rigid rod-shaped tobamoviruses or hordeiviruses, where the relatively short C- and N-terminal structural elements are exposed on the outer surface of CP, and the connection between the CP units is established by the wedge-shaped CP cores^9,10.

**Fig. 1: Structural polymorphism of recombinant PVY CP.**

Potyviral CP plays a role in virtually every step of the viral infection cycle, from transmission of the virion by aphids, virus assembly and disassembly, regulation of genome amplification, protein translation, to cell-to-cell and long-distance movement¹¹. The structural context in which CP acts during the different phases of the viral cycle is not yet known, however, the intrinsic structural plasticity of CP^5,6,7,8 mainly contributed by both IDRs seems to play the crucial role^6,8.

The presence of different structural states or pleomorphism of viral capsids is quite common in enveloped^12,13, icosahedral¹⁴ or even some helical viruses^10,15,16 and has been associated with certain stages of their life cycle^17,18. In addition, structural polymorphism is a well-known feature of recombinantly produced virus-like particles (VLPs) derived from icosahedral or rod-shaped viruses^14,15,16. CP and its mode of self-assembly can be modulated using structural synthetic virology approaches, resulting in symmetric nanoparticles of different shapes and sizes with specific material properties that have great potential for medical, biotechnological, or smart material applications^14,19,20,21.

Although the structures of several flexuous filamentous potexviruses^{22,23,24,25,26} and potyviruses^5,6,7 have been recently determined, information on the structural diversity of these viruses and their VLPs is lacking^6,7,23,26,27. In our work, we have investigated the structural landscape of self-assemblies formed by potyviral CP. While the structural analysis of natural supramolecular complexes formed by CPs during the viral life cycle is challenging due to the very complex and dynamic natural context, the successful production of recombinant potyviral VLPs has been reported for different expression systems, preferring plants or bacteria²⁸. Interestingly, the cryo-EM structure of PVY VLPs prepared from bacteria, determined at 4.1 Å resolution, showed a markedly different architecture of VLP filaments than the structure of PVY virions, as they consisted of stacked octameric CP rings and did not contain RNA⁶. On the other hand, the structure of TuMV VLPs produced by transient expression in tobacco at 8.0 Å resolution indicated an RNA-free filamentous arrangement of CP units in left-handed helical symmetry⁷. Interestingly, the structure of VLPs determined at 2.6 Å resolution based on CP of another potyvirus, sweet potato feathery mottle virus (SPFMV), and produced in tobacco by transient expression in the presence of a replicating RNA, showed a virus-like architecture, with ssRNA directing the helical arrangement of CPs along the filament²⁷. These studies showed that a specific potyviral CP self-assembles into filaments of a single architectural type under selected experimental conditions, but this differed among the three experimental arrangements, suggesting that polymorphism may also exist within a species of CP under certain conditions. In this study, we found that the wild type PVY CP can indeed form three architecturally distinct types of VLPs simultaneously. Furthermore, through structure-based engineering of PVY CP, we discovered that we can control the formation of a wide range of highly ordered supramolecular assemblies, their architecture, RNA encapsidation, and molecular properties. These can range from various filamentous to ring-shaped, cubic or spherical assemblies with high symmetry, most of which form without a template. To avoid spontaneous CP self-assembly in a complex bacterial environment, we have developed a system for CP self-assembly in vitro that allows the controlled formation of nanoparticles with desired properties. This remarkable structural diversity of PVY CP nanoparticles makes them great candidates for nanobiotechnological applications. Moreover, the high-resolution details about the structural plasticity of PVY CP could pave the way for a better understanding of CP polymorphism in a biological context.

Results

Recombinant PVY CP self-assembles into three architecturally distinct types of VLPs

To investigate the potential of PVY CP to form polymorphic assemblies, we produced PVY VLPs in bacteria. A comprehensive analysis of the cryo-EM data revealed two new filament architectures (Fig. 1b, c; Table 1; Supplementary Figs. 1a, 2) in addition to the predominant RNA-free stacked ring assembly (VLP^r) that we had previously observed⁶. 25% of picked particles exhibited left-handed helical symmetry with no RNA packed inside (VLP^h), similar to TuMV VLPs⁷. The remaining 8% of picked particles also exhibited left-handed helical symmetry and encapsidated RNA (VLP^h+RNA), closely resembling PVY virion⁶ (Fig. 1c; Supplementary Table 1) and SPFMV VLPs²⁷. An improved data analysis procedure seemed to play a crucial role here, as such a distribution of polymorphic filaments was also obtained by reprocessing our previous data⁶ (Supplementary Fig. 1b).

Table 1 Cryo-EM data collection, refinement and validation statistics.

Full size table

The C-IDR is structurally defined only in VLP^h+RNA filaments, i.e. in CP^h+RNA, where the helical scaffold of RNA supports the cone-like organization of C-IDRs in the lumen of the filament (Fig. 1c, d). In the absence of RNA in VLP^r and VLP^h, the C-IDR in CP^r and CP^h is disordered, with no traceable cryo-EM density beyond A222 (Supplementary Fig. 2a). The fold of the CP core domain is conserved between the polymorphic filaments, except for the conserved RNA-binding loop S125-G130⁶, which adopts different conformations in the presence or absence of RNA (Fig. 1d). In all three structures, there was no cryo-EM density for the first 41 residues of N-IDR, exposed on the outer surface of the filaments (Supplementary Fig. 2a). Beyond H42, N-IDR in CP^h+RNA folds similarly to that in PVY virus (Supplementary Fig. 2b), whereas N-IDR in CP^r and CP^h takes a different turn at K53 (Fig. 1d). Thus, the structural plasticity of PVY CP, in particular the two IDRs and the conserved RNA-binding loop S125-G130, enables the polymorphism of PVY VLPs produced in bacteria. In the absence of ssRNA, the N-IDRs adopt two slightly different conformations, as seen in CP^r and CP^h (Supplementary Fig. 2c) allowing the formation of two different types of RNA-free filaments (Supplementary Fig. 2d), with the stacked octameric ring assembly being the more stable and therefore predominant form.

C-terminal truncation of CP reduces the architectural diversity of VLPs

Potyviral CP with deleted C-IDR still forms filaments^6,29. Because C-IDR is structurally defined in wild type VLPs only in the presence of ssRNA (VLP^h+RNA), we examined how the absence of C-IDR affects filament architecture.

We prepared VLPs consisting of the CP units lacking 40 C-terminal residues, i.e whole C-IDR (CP^ΔC40), and analyzed them by cryo-EM (Fig. 1e). Interestingly, we detected only two different architectures of filaments. 65% of them had the stacked-ring architecture (VLP^ΔC40:r) and 35% had left-handed helical symmetry and encapsidated RNA (VLP^ΔC40:h+RNA)(Fig. 1e; Supplementary Fig. 3). The structures of the CP^ΔC40:r and CP^ΔC40:h+RNA subunits and the helical parameters of their VLPs were comparable to those of their wild type counterparts (Supplementary Fig. 4a; Table 1). This indicates that luminal C-IDR is not essential either for filament formation or RNA encapsidation. The absence of C-IDR facilitates the accessibility of the RNA-binding site, resulting in a significantly increased proportion of VLP^ΔC40:h+RNA filaments. VLP^ΔC40:r and VLP^ΔC40:h+RNA filaments are long flexible and hollow nanotubes with an inner dimeter of 4.1 nm and 3.7 nm, respectively (Fig. 1e; Supplementary Fig. 4b).

To prevent encapsidation of ssRNA, we further truncated the C-terminus, excluding 60 C-terminal residues (CP^ΔC60) containing both the C-IDR and the α8-helix, which is placed opposite to the RNA-binding loop and forms part of the RNA-binding cleft (Fig. 1d). This indeed led to formation of RNA-free filaments. Interestingly, the filaments had monomorphic helical architecture, VLP^ΔC60:h, with unique helical parameters (Supplementary Figs. 3, 4c; Table 1). Because we found that 19 C-terminal residues of this construct, including those of the α7-helix (Fig. 1d), were not defined by cryo-EM density, we prepared another deletion mutant, CP^ΔC79, excluding these residues (Fig. 1f; Supplementary Fig. 3). The VLP^ΔC79:h filaments had a similar structure to VLP^ΔC60:h, but a better-defined C-terminal part (Fig. 1g; Supplementary Fig. 4d; Table 1). The disrupted RNA-binding cleft in CP^ΔC60/CP^ΔC79 thus lost the ability to bind RNA, and consequently the RNA-binding loop S125-G130 adopted the conformation found in the RNA-free filaments VLP^h and in VLP^r (Fig. 1g; Supplementary Fig. 4d). On the other hand, the fold of the N-IDR in CP^ΔC60/CP^ΔC79 resembled that found in RNA-encapsidating filaments, consistent with the CP-CP distances along the VLP^ΔC79:h filament being closer to VLP^h+RNA than to VLP^h (Fig. 1h). Thus, disruption of the RNA-binding cleft by large truncation of the C-terminal part of CP prevents RNA binding and also leads to the formation of monomorphic VLPs. VLP^ΔC79:h filaments are compact flexible hollow nanotubes with an inner channel diameter of 5.4 nm (Fig. 1f), with thermal stability higher than that of wild type VLPs and comparable to that of PVY virions (Fig. 1i).

Simultaneous truncation of both CP IDRs leads to formation of stable octameric rings

The CP N-IDR plays an important role in filament assembly by participating in inter- and intra-ring interactions⁶. Moreover, we have shown here that the structural plasticity of this region enables variability in the packing arrangement of CP subunits in filaments, and thus polymorphism (Fig. 1d; Supplementary Fig. 2c). We have previously shown that truncation of the N-IDR at G40 results in an insoluble protein, whereas CP^ΔN49 and CP^ΔN49C40 with both IDRs truncated, self-assemble into single octameric rings and their short stacks⁶. To facilitate sample preparation for further structural analysis, we attached the His₆-tag to the C-terminus of CP^ΔN49C40 (trCP)(Fig. 2a). Cryo-EM revealed that the affinity-purified sample consisted predominantly of trCP double octameric rings assembled in a head-to-tail (H2T) orientation (Fig. 2a; Supplementary Fig. 5). In addition, we observed some shorter filaments (<5%), with RNA-free stacked-ring or helical architecture, with the helical parameters similar to those of VLP^ΔC79:h (Fig. 2a; Supplementary Fig. 6a).

**Fig. 2: CP with truncated IDRs preferentially forms (double) octameric rings.**

In wild type VLP^r filaments, the N-IDR is responsible for the axial connection of the octameric rings, with no obvious interactions between the core regions of the CPs (Supplementary Fig. 2c)⁶. The truncation of N-IDR reduces the axial separation of the rings by 3.9 Å and shifts the twist angle in H2T double rings compared with VLP^r filaments (Fig. 2b; Table 1). The blob of density in the center of the two rings (Fig. 2c), which was already observed in the 2D class averages, was assigned to a cluster of His₆-tags, because it was absent in the 2D class averages after the removal of the His₆-tag, which also led to the dissociation of double rings into single rings (Fig. 2c).

The octameric ring exhibits pronounced charge anisotropy, with positive (P-side) and negative (N-side) charge predominating on the opposite surfaces (Fig. 2d), which explains the ionic strength-dependent size distribution of the self-assembled particles (Supplementary Fig. 6b).

Single amino acid substitutions at the N-side restore filament formation

To investigate how disturbance of electrostatics affects the interactions between the H2T double rings, we substituted individual nonconserved amino acids in the core region pointing to the interface between the two trCP rings (Fig. 2e; Supplementary Fig. 7).

The substitutions on the N-side of the ring, trCP^L99C, trCP^K153E and trCP^E150C showed a markedly increased tendency to form RNA-free assemblies larger than double rings (Fig. 3a-d; Supplementary Figs. 8, 9). The formation of double rings was negligible in the case of trCP^L99C and trCP^K153E, and instead we observed the formation of exclusively RNA-free filaments with helical (predominant form) or stacked ring architecture (Fig. 3b, c; Supplementary Fig. 8). In the case of trCP^E150C, the assortment of particles was more heterogeneous, ranging from double rings to filaments. trCP^E150C filaments accounted for only around 45% of the observed particles, with helical and stacked ring architectures represented to a similar extent (Supplementary Fig. 9). Interestingly, among various types of particles in the rCP^E150C sample, we detected a significant proportion of two novel architectures (Fig. 3d). One of them with a central cube-shaped body composed of six orthogonally arranged octameric rings growing outwards by stacking copies of the rings to form cross-shaped junctions (Fig. 3d middle). In the second architectural type, the octameric rings joined to form a central spherical body on whose surface additional rings stacked in at least one direction (Fig. 3d right; Supplementary Fig. 9c). Overall, single amino acid substitutions of selected nonconserved residues on the N-side increased the stickiness of the surface and restored the formation of filamentous assemblies.

**Fig. 3: N-side mutations resume formation of filaments and lead to novel architectures of filament junctions.**

Single amino acid substitutions at the P-side lead to the formation of flipped double rings, cubic and spherical particles

Single amino acid substitutions at the P-side led to the formation of architecturally more homogeneous particles (Fig. 4a; Supplementary Fig. 10). trCP^K176E, trCP^G193D and trCP^G193C assembled exclusively into double octameric rings, with cryo-EM reconstruction of trCP^K176E revealing a head-to-head (H2H) arrangement of the two rings (Fig. 4b, c; Supplementary Figs. 10a, b, 11). Again, the two rings are held together by His₆-tags (Supplementary Fig. 11c–f), but their central axis is slightly tilted compared with the trCP H2T double rings (Fig. 4c). However, no further stacking of H2H double rings or formation of filaments was detected, possibly due to the fact that both N-side surfaces in the double ring are exposed to the exterior.

**Fig. 4: P-side mutations lead to novel octameric-ring assemblies, such as H2H double rings, cubes and spheres.**

SEC analysis of the P-side mutants trCP^K176C, trCP^K176S, and trCP^K177E indicated the formation of larger particles than double rings (Fig. 4a; Supplementary Figs. 12, 13). Cryo-EM 2D class averages showed that most of these particles had a cubic shape (Fig. 4b; Supplementary Fig. 10c). 3D reconstruction of trCP^K176C with an overall resolution of 3.0 Å revealed the cubes consisted of six orthogonally arranged rings (Fig. 4c; Supplementary Fig. 12), with no additional stacking of rings as found in trCP^E150C (Fig. 3d middle). In the case of trCP^K177E, around 30% of the particles had a spherical shape, consisting of 9 rings (Fig. 4c; Supplementary Fig. 13), similar to the spherical core of the particles formed by trCP^E150C, but again without further stacking of rings on the exposed N-side.

Overall, selected amino acid substitutions of the nonconserved residues on the P-side facilitated the association of the octameric rings with the P-side involved in the interactions and N-sides exposed to exterior. This led to the formation of smaller particles, such as H2H double rings, and cubic or spherical assemblies of rings.

The cubic particles are stabilized by hydrophobic interactions and contain CP-derived cargo

To better understand what drives the orthogonal assembly of the trCP-derivatives, we analyzed the interactions in the locally refined 3D reconstruction of the trCP^K176C cubes of 3.2 Å resolution (Supplementary Fig. 12; Table 1). This revealed that the hydrophobic interactions between the P-sides of the ring pairs on the C2 symmetry axis are crucial for stable assembly (Fig. 5a; Supplementary Fig. 14a). Two subunits of each interacting ring contribute to stabilization, one through the residues of the α5-helix from the core and the other through N-IDR residues, including the α1-helix (Fig. 5a). No disulfide bond was observed between the rings, because the C176 residues in the adjacent rings are too far apart. M54 in N-IDR is crucial for maintaining the interactions, as replacement by Cys in trCP^M54C+K176C resulted in the formation of H2H double rings instead of cubes (Supplementary Fig. 14b, c).

**Fig. 5: The orthogonal assembly of octameric rings into cubes is driven by electrostatics and stabilized by hydrophobic interactions.**

The center of each octameric ring contained a blob of density, which disappeared after removal of the His₆-tags (trCP^K176C-noHis) without affecting the cubic architecture (Fig. 5b; Supplementary Fig. 15a). Another blob of density was observed in the center of all cubic assemblies (Fig. 5c; Supplementary Fig. 15b), indicating the presence of putative cargo. Native mass spectrometry analysis of trCP^K176C (Supplementary Fig. 15c) revealed charge series around 10,000 m/z, consistent with a double ring, and unresolvable peaks around 18,500 m/z, that we assign tentatively to a fully-formed cubic particle. To circumvent the challenge posed by this heterogeneity, we obtained mass photometry data for trCP^K176C, trCP^K176C-noHis and trCP^K176S (Supplementary Fig. 15d–f). We measured masses of ~1.3 MDa for each, a mass higher than expected based on 48 copies of the protomers and consistent with a central cargo of approximately 250–350 kDa (Fig. 5c; Supplementary Fig. 15d, f). When the cubes were disassembled under denaturing conditions, no significant impurities were identified in the denatured spectra beside mass of the monomer (Fig. 5d; Supplementary Fig. 15e, g). A small population of covalently associated dimer was also present only in trCP^K176C and trCP^K176C-noHis. However, these CP dimers did not originate from the octameric rings assembling the cubes, as no disulfide bonds were observed within or between the octameric rings (Fig. 5a). Although we cannot assess at this point how important the central protein mass is for self-assembly, our results clearly indicate that no molecular species other than the subunits of trCP mutant are required for the formation of these cubic particles.

Due to highly symmetrical distribution and exposure of the C-termini on the surface of the octameric rings in the cubes, we replaced the C-terminal His₆-tag on trCP^K176C with the SpyTag³⁰ (trCP^K176C-SpyTag). Cubic particles, similar to those with C-terminal His₆-tags were formed (Supplementary Fig. 16).

Given the rather unexpected result of self-assembly of the trCP-mutants into cubes, we investigated whether the preferential orthogonal assembly of the K176C mutant compared with the wild type trCP could be predicted by the coarse-grained molecular dynamics simulations (Fig. 5e). Starting from randomly distributed octameric trCP^noHis or trCP^K176C-noHis rings in aqueous solution, the trCP^K176C-noHis rings were indeed more prone to form orthogonal ring assemblies (triplets) (Fig. 5e; Supplementary Fig. 17) than the nonmutant trCP^noHis.

In summary, the cubic assemblies of selected P-side trCP mutants are composed exclusively of CP-derived units. 48 surface exposed C-termini can be modified to carry (removable) affinity tags such as His₆-tag or Spy-tag.

Self-assembly can be controlled by fusion of heterologous proteins with CP

The supramolecular assemblies described above were purified directly from bacterial cell lysates. Next, we developed a system to prevent the self-assembly process in the expression system and instead trigger it in a controlled environment in vitro. To prevent the formation of filaments with C-IDRs packed in the lumen of the filament (Fig. 1c), we fused the 43-kDa maltose-binding protein (MBP) to the C-terminus of CP (Fig. 6a). Indeed, the CP-MBP fusion did not form filaments, and we were able to isolate the monomeric CP-MBP units (Supplementary Fig. 18a). The purified monomeric fraction of CP-MBP was then exposed in vitro to the tobacco etch virus (TEV) protease, which released MBP from CP, resulting in the formation of RNA-free filaments (ivVLP^WT) (Supplementary Fig. 18b, c). We then applied this procedure to the CP^ΔC40 fusion with MBP (CP^ΔC40-MBP). In contrast to the VLP^ΔC40 formed in bacteria (Fig. 1e), the filaments produced in vitro were architecturally nearly homogenous, with 97% of the RNA-free stacked-ring architecture (ivVLP^ΔC40:r) (Fig. 6b; Supplementary Fig. 18d, e). Furthermore, this concept was successfully used for the in vitro triggered assembly of nanocubes, ivtrCP^K176C (Fig. 6c; Supplementary Fig. 18f–h). Also in this case, cryo-EM 3D reconstruction revealed a cargo in the center of the cubes (Supplementary Fig. 18h).

**Fig. 6: In vitro triggered self-assembly of engineered nanoparticles.**

In summary, spontaneous self-assembly of CP and its derivatives in the bacterial expression system can be prevented by fusion of a heterologous protein at their C-termini. In vitro triggered self-assembly by proteolytic release of the fused protein leads to the formation of highly ordered RNA-free nanoparticles.

VLPs can be further stabilized by introducing disulfide bonds between CPs

It has already been shown for filamentous protein or peptide self-assemblies^31,32,33 that the introduction of Cys residues at the interfaces axially connecting the subunits increases the stability of such particles. To investigate this possibility in the case of flexible PVY VLP filaments, we introduced disulfide bonds between adjacent CP subunits based on the VLP^r structural model (Fig. 7a). The double Cys mutants of the full-length CP, T43C+D136C, L99C+K176C, E150C+G193C and S39C+E72C, successfully formed VLPs (Supplementary Fig. 19a) with SDS-PAGE analysis indicating disulfide bond formation (Fig. 7b). With the exception of VLP^T43C+D136C, these filaments had longer median lengths (Fig. 7c), and elevated melting temperatures for 5–10 °C (Supplementary Fig. 19b, c) compared with the wild type VLPs. Moreover, VLP^L99C+K176C, VLP^E150C+G193C, and VLP^S39C+E72C filaments survived incubation at 60 °C for 10 min under oxidizing conditions but not under reducing conditions, whereas VLP and VLP^T43C+D136C disintegrated in both cases (Fig. 7d). VLP^L99C+K176C, VLP^E150C+G193C, and VLP^S39C+E72C filaments were structurally polymorphic (Supplementary Fig. 20), and exhibited similar architecture to wild type VLPs, except that the VLP^h+RNA form was essentially negligible. We could confirm the formation of disulfide bonds between adjacent rings only in the asymmetric reconstruction of their stacked ring forms (Fig. 7e). This suggests that not all adjacent Cys are paired, revealing the quasi-equivalence of subunits in the flexible filaments³⁴. However, the uneven distribution of disulfide bonds along the filament could also be, at least to some extent, the result of the extreme sensitivity of disulfide-bonds to electron damage radiation³⁵. Nevertheless, such interlocking brought adjacent rings in VLP^L99C+K176C:r and VLP^{E150C+G193C:r} 3.2 Å and 2.0 Å closer, respectively, than in VLP^r (Fig. 7f). This was not observed in VLP^S39C+E72C:r due to stapling of the CPs by structurally plastic N-IDRs (Fig. 7a). No interconnecting cryo-EM density was observed in the VLP^h filaments, likely due to helical averaging along the filament. Overall, VLPs can be further thermally stabilized by introducing disulfide bonds between selected residue positions at axial CP-CP interfaces.

**Fig. 7: Stabilization of VLPs by introducing disulfide bonds between CP units in filaments.**

Monomorphic RNA-encapsidating VLPs can be generated by a single amino acid substitution at the N-IDR/CP-core interface of adjacent CP units

Unlike other CP double cysteine mutants, VLP^T43C+D136C was unique in having more uniform distribution of filament length and instability at 60 °C (Fig. 7c, d). Cryo-EM revealed exclusively RNA-packing filaments with left-handed helical symmetry, and overall resolution of 2.4 Å (Fig. 8a; Supplementary Fig. 21; Table 1), which to our knowledge is the highest resolution for the potyviral VLPs. The structure of CP^T43C+D136C and the thermal stability profile of the respective filaments strongly resembled that of PVY virus (Supplementary Fig. 22a, b; Supplementary Table 1).

**Fig. 8: Analysis of ssRNA packaged in VLPs formed by CP^T43C+D136C.**

The cryo-EM density for VLP^T43C+D136C was defined starting at residue V44 (Table 1), indicating the absence of the disulfide bond between C43 and C136 and thus the redundancy of one of the introduced cysteines. Indeed, negative staining TEM (nsTEM) of cell lysates revealed that VLP^D136C resembled VLP^T43C+D136C, whereas the purified VLP^T43C showed stacked-ring filaments with a length similar to that of wild type VLPs (Supplementary Fig. 22c). Further cryo-EM analysis of purified VLP^D136C confirmed monomorphic RNA-packing filaments (Supplementary Fig. 22d).

D136 is located in the β-hairpin of the CP core region. Together with E139 from the same β-hairpin and R46 from the N-IDR of the adjacent CP, it forms a triangle of conserved charged residues (Fig. 8b; Supplementary Fig. 7). Replacement of either residue by Ala resulted in the exclusive (VLP^R46A) or predominant (VLP^E139A) formation of RNA-encapsidating filaments (Fig. 8b; Supplementary Fig. 22e, f). In RNA-free VLPs composed of wild type CP, each CP subunit is linked to four adjacent subunits, with N-IDRs acting as clutches (Supplementary Fig. 2c). Disruption of these interactions by mutations in the R46/D136/E139 triangle favors the VLP^h+RNA type of assembly, in which the loss of interaction between the β-hairpin and the N-IDR is compensated for by the extensive interaction network between 13 CP subunits and CP-RNA interactions present in ‘h+RNA’ filaments. Therefore, it was not surprising that the CP-construct CP^{ΔC60:T43C+D136C}, which integrates both the inability to bind RNA and the weakened N-IDR binding, was not soluble (Supplementary Fig. 22g). These results demonstrate that we can produce monomorphic RNA-encapsidating VLPs with a narrow length distribution by simple modifications of the CP-CP interface at the N-IDR-core contact.

CP encapsidates ssRNA with limited specificity

Within the narrow length distribution range of VLP^T43C+D136C filaments, we detected four distinct maxima. The first was at 61 nm, which is close to the theoretical length of filaments (65 nm) encapsidating the 807 nt long CP^T43C+D136C coding sequence (CDS) (Fig. 8c). Others were at 134 nm, 199 nm, and 267 nm approximately multiples of the first. This could be due to longitudinal fusion of the filaments, as is commonly in potyviruses such as PVY (Supplementary Fig. 23) or potato virus A³⁶. Previous studies already suggested that recombinant potyviral CP encapsidates its own mRNA^27,37. To verify this, we performed analysis of RNA extracted from VLP^T43C+D136C filaments. Using RNA-free VLP^ΔC60 as a control we showed that 98% of RNA recovered from the purified VLP^T43C+D136C sample was the RNA extracted from the filaments (Supplementary Fig. 24a). Reverse transcription quantitative PCR (RT-qPCR) (Supplementary Fig. 24b) showed that CP^T43C+D136C mRNA was present at much higher levels in comparison to idnT background gene, reported to be stably expressed in E. coli upon heterologous protein overexpression³⁸. To obtain a quantitative overview of all RNA transcripts encapsidated in VLPs, we employed nanopore direct RNA sequencing. This showed that ~70% of the RNA packaged in VLP^T43C+D136C belonged to CP^T43C+D136C mRNA and 30% were assigned to the bacterial RNAs (Fig. 8d; Supplementary Fig. 24c). Among all coding sequences (CDS), CP^T43C+D136C was strongly predominant (~75%), with roughly even coverage of the entire sequence (Supplementary Fig. 24d, e). Some bacterial genes, such as hns, were also detected to a significant extent (11.6%) (Fig. 8e; Supplementary Fig. 24d).

These experiments suggest that the specificity of RNA encapsidation by CP is limited. Next, we investigated, whether we could encapsidate the mRNA of interest into the filaments formed by CP^T43C+D136C. As a heterologous gene of interest, we chose the gene encoding p97, a human protein forming ~600 kDa hexamers³⁹ that differ in architecture from VLPs. We first attempted to encapsidate p97 mRNA by adding in vitro transcribed mRNA to the CP-MBP system described above. However, after the release of MBP, no RNA was encapsidated, likely due to secondary or tertiary structural elements in the RNA produced in vitro that prevent CP from self-assembling around it. To address this issue, we used a bacterial co-expression system so that the nascent CP mRNA and the p97 mRNA were produced in temporal and spatial proximity (Fig. 8f). In this system, two heterologous mRNAs are transcribed, one for CP^T43C+D136C and p97 (CP+p97) and the other for p97 only (p97). CP^T43C+D136C and p97 proteins were successfully produced (Fig. 8f), and the VLP purification protocol allowed successful separation of filaments from p97 hexamers (Supplementary Fig. 25a, b).

RNA transcripts were identified and quantified by nanopore direct RNA sequencing of RNA extracted either from cells (total cell RNA) or from purified VLPs (RNA from VLPs). This initially showed enrichment of CP, p97 and some bacterial CDS in VLPs compared with total cell RNA (Supplementary Fig. 25c). However, a detailed analysis of sequencing coverage along the CP and p97 CDS revealed important differences between the two samples. Namely, whereas reads mapping to CP CDS were very abundant in both samples (Fig. 8g), coverage of p97 was significantly lower in RNA from VLPs. We also noted a marked decrease in p97 coverage after position 1500 in the total cell RNA sample (Fig. 8g, vertical dotted line). A sharp decrease was also observed at this position in the RNA from VLPs, with virtually no coverage in the following region, indicating the absence of the full-length p97 sequence in purified VLPs (Fig. 8g; Supplementary Note 1).

Because of the uneven coverage along both transcripts (Fig. 8g), we performed CDS quantification with the coverage near the 3’ end as a sensor for the level of the full-length sequence (Methods). This confirmed a very low abundance of full-length p97 transcripts in VLPs (1.8%) despite relatively high levels of p97 mRNA (19.8%) in total cell RNA (Fig. 8h; Supplementary Fig. 25d). Thus, coupling of the synthesis of p97 mRNA with the production of CP did not result in efficient encapsidation of p97 mRNA in VLPs in bacteria. Interestingly, bacterial hns mRNA was highly represented in VLPs (12.2%), significantly higher compared with its presence in the total cell RNA (3.7%). However, such enrichment in VLPs was not observed for CP or p97 (Supplementary Fig. 25d). Overall, the specificity of the recombinant PVY CP for the encapsidated RNA is not limited to CP mRNA.

Discussion

Powerful methodological approaches to high-resolution analysis have helped to discover that symmetric supramolecular assemblies of many viruses, storage or transport cages¹⁴, cytoskeleton^40,41,42, flagella^43,44, amyloid fibers⁴⁵ and others can exist in structurally polymorphic states, with each type of self-assembly usually associated with a specific biological function. Structural polymorphism can also be applied to many recombinant VLPs, protein cages, and (artificial) peptide assemblies, providing a large repertoire of molecular platforms for vaccines, drug delivery systems, nanoreactors, biomaterials or nanomachines^{14,19,21,46,47,48}. In particular, CPs from plant viruses represent a great resource for such nanoparticles, as they are biodegradable and usually nonpathogenic to mammals⁴⁹. Among them, the most studied are ssRNA viruses such as rod-shaped TMV, and icosahedral cowpea chlorotic mottle virus (CCMV), whose CPs represent highly tunable molecular platforms for the production of nucleoprotein assemblies with remarkable architectures and material properties^20,21,49.

In this study, we show how the intrinsic structural plasticity of CP from the flexuous filamentous virus PVY enables the formation of a wide assortment of highly-ordered nanoparticles, whose structural and chemical properties can be tailored by simple modifications. Unlike rigid rod-shaped TMV nanoparticles, which generally require an RNA template for stable formation²¹, most of the PVY CP types of nanoparticles shown here self-assemble without a template.

Our results can be summarized in seven points. First, recombinant PVY CP can simultaneously form three architecturally distinct types of VLPs (Fig. 1b, c). These filaments are mostly RNA-free, of either stacked-ring or helical architecture, with only a small fraction of the RNA-encapsidating filaments resembling the native virion. We have shown that the major source of structural plasticity and consequently polymorphism is provided by both IDRs and the conserved RNA binding loop S125-G130. The low proportion of RNA-encapsidated VLPs suggests that the efficiency of assembly of RNA-free filaments is higher than that of RNA-encapsidating ones at given conditions. The RNA-free filaments with stacked-ring architecture are predominant and thus represent the most stable form of CP self-assembly. Interestingly, only a slight change in N-IDR conformation leads to the formation of another type of RNA-free filaments with left-helical symmetry. The three types of polymorphic VLPs formed by the wild type PVY CP could potentially mimic different CP assemblies of structurally liable helical virions during different phases of the viral life cycle, such as virion assembly or disassembly and viral cell-to-cell or long-distance transport^11,50, however, future in-depth studies of virus-associated structures in planta are required to confirm this. Second, we showed that most of the RNA packaged in recombinant VLPs was CP mRNA (Fig. 8c–e), which may be due to large amount of CP mRNA due to overexpression in bacteria. However, notable amounts of packaged RNAs in VLPs were of a bacterial origin, with some of CDSs even more enriched in VLPs than CP or the eukaryotic gene p97, compared with their levels in total cell RNA (Supplementary Fig. 25d). Capability of the potyviral CP to encapsidate heterologous viral RNA under certain conditions in vivo was reported before^51,52. While recombinant CP shows limited specificity, it is expected that in plants, in order to prevent wasting viral resources, the interplay between the viral and/or host factors is dictating packaging of the viral ssRNA into stable virions^11,53. More detailed studies are needed to understand whether the limited specificity of recombinant CP is due to the specific nucleotide sequence, RNA length, proximity of freshly overexpressed CP to heterologous RNA molecules, or the combination between these factors.

Third, we show that RNA can be encapsidated in VLPs even in the absence of C-IDR. Potyviral C-IDR has been shown to be critical for viral replication and regulated shift from translation to replication^6,54,55. Here we show that C-IDR does not play an essential structural role in the filament formation or RNA-encapsidation, however, it does affect the fine structural details in filament architecture (Fig. 1e–i). Fourth, in addition to the ability of wild type PVY CP to simultaneously form filaments of different architectures, this protein and thus its self-assembly, is also highly tunable. Simple modifications in CP lead to a lower degree of polymorphism and even to the formation of monomorphic filaments, or filaments with novel architectures (Figs. 1e–i, 7, and 8a, b). Structure-based design can be used to produce purely RNA-encapsidating VLPs with relatively narrow length-distribution or exclusively RNA-free filaments with broad length-distributions (Fig. 7c). In both cases, the lumen of the filament can either be filled with C-IDRs or hollow in their absence. Fifth, we show that we can achieve a striking change in quaternary structure, i.e. structural metamorphosis, by simple genetic modifications of CP. By deletions and/or single-site mutations, we can reduce or even prevent the filament formation and instead produce single or double octameric rings of CP as well as highly ordered cubic or spherical self-assemblies of these rings, which can be further modified to form into cross shaped forms (Figs. 3–5). Sixth, we show that the outer surfaces of CP-derived nanoparticles, especially double rings, cubes, or spheres, can be equipped with surface exposed affinity tags such as His₆-tag⁵⁶ or Spy-tag³⁰, thereby providing symmetric platforms for further functionalization (Figs. 2c and 5b, Supplementary Fig. 16). Finally, we have developed a system in which CP-derivatives are fused with a heterologous protein attached to its C-terminus to obtain nanoparticles of enhanced purity, which are assembled under defined and controlled in vitro conditions (Fig. 6). Such fusion proteins with IDRs not engaged in self-assemblies could be used to study molecular interactions between individual CPs and other viral or plant host molecules, such as HCPro⁵⁷, Argonaut⁵⁸, or RNA⁵⁹.

In summary, the intrinsic structural plasticity of PVY CP allows a remarkable structural diversity of its supramolecular assemblies. The high-resolution data obtained in this study and the possibility of structure-based design of nanoparticles with novel architectures and tailored properties make PVY CP an excellent candidate for nanobiotechnological applications, such as vaccine and biosensor development, cargo storage and delivery, medical imaging, or energy and nanostructured materials^49,60. Bacteria represent a preferred expression system as they allow efficient and cost-effective production of nanoparticles. Although the detailed information on the structural diversity of PVY CP shown here is based on nanoparticles produced in bacteria, it may facilitate future studies on the role of PVY CP in its natural environment.

Methods

Molecular cloning of CP variants

The wild type PVY CP from a complementary DNA (cDNA) of PVY-NTN strain (GenBank accession no. KM396648), and its double deletion mutant without (CP^ΔN49C40) or with C-terminal His₆-tag, preceded by the TEV protease cleavage site (termed truncated CP, trCP), were previously cloned in vectors pT7-7 (CP) and pET28a (CP^ΔN49C40, trCP), respectively⁶. The C-IDR deletion constructs were cloned using classical restriction enzyme-based approach and inserted in pET28a vector. To obtain constructs with introduced mutations, site-directed mutagenesis was performed using inverse PCR method^61,62 with one or two oligonucleotides (nucleotide sequence available upon request).

For CP-MBP constructs, sequence encoding maltose-binding protein (MBP) with a C-terminal N-rich linker and “factor Xa” cleavage site, was obtained from the pMAL-c2X vector backbone. This sequence was inserted between TEV protease cleavage site and His₆-tag at the C-terminus of His₆-tagged CP construct, cloned previously⁶. Cloning of CP^ΔC40-MBP and trCP^K176C-MBP constructs was done via the Gibson cloning method^63,64 (NEB).

For co-expression experiment, CP^T43C+D136C and human p97 (kindly provided by Dr. Marta Popović, Ruđer Bošković Institute, Croatia) were cloned in the pRSFDuet-1 dual expression system vector (pRSFDuet1-Cdc45) using PCR and Gibson assembly^63,64. RNA-packing CP^T43C+D136C was cloned into the first multiple cloning site (MCS) and p97 into the second MCS, while the connecting region was identical to the commercial pRSFDuet-1 backbone. All sequences were verified by nucleotide sequencing (Eurofins Genomics or GENEWIZ).

Expression and purification of CP variants

E. coli BL21(DE3) cells, transformed with plasmids containing CP constructs, were grown to an OD₆₀₀ of 0.8–1.2 in 2× YT medium (16 g l⁻¹ tryptone, 10 g l⁻¹ yeast extract, 5 g l⁻¹ NaCl) supplemented with 5 mM MgCl₂ and 2 mM CaCl₂. Gene expression was induced with 0.1 mM Isopropyl β-D-1-thiogalactopyranoside (IPTG) and the cells were grown overnight at 20 °C.

Non His₆-tag variants forming VLPs were purified as described previously⁶ with minor modifications and all purification steps done at 4 °C. In brief, the harvested cells were lysed by sonication on ice in phosphate-buffered saline (PBS) (1.8 mM KH₂PO₄, 10.1 mM Na₂HPO₄, 140 mM NaCl, 2.7 mM KCl, pH 7.4) and centrifuged at 20,000 × g for 40 min. The lysate was incubated for 30 min in the mixture of 4% PEG 8000 and 500 mM NaCl. Following centrifugation for 30 min at 14,000 × g, the pellet with VLPs was resuspended in PBS by gentle overnight shaking. Remaining solid material was removed by 30 min centrifugation at 35,000 × g. The soluble fraction with enriched VLPs was loaded on 20–60% sucrose density gradient and ultracentrifuged at 117,000 × g for 6 h in a Beckman 50 Ti rotor. All fractions of the gradient were collected and analyzed with SDS-PAGE to identify fractions containing CP. Selected fractions were pooled, dialyzed for 24 h against PBS, concentrated using Amicon Ultra centrifugal filters with a 100-kDa molecular weight cut-off to the final concentration of 1–3 mg ml^-1 and supplied with glycerol up to 5% v/v (final concentration) before storage at −80 °C.

To achieve higher purity of the VLP samples used for RNA extraction, an additional purification step of ammonium sulfate precipitation^65,66 was implemented before the standard VLP purification procedure described above. After cell lysis and centrifugation, ammonium sulfate was added to the soluble fraction to 15% (w/v) concentration. Following stirring for 30 min, the precipitated proteins were pelleted with 15 min centrifugation at 13,400 × g. The process was repeated in a stepwise manner with 5% (w/v) increase in ammonium sulfate concentration up to final 30% (w/v). The pellets pulled at different concentrations of ammonium sulfate were resuspended in PBS and subjected to SDS-PAGE analysis. Fractions with enriched either p97 or CP, were dialyzed via PD-10 desalting columns and with dialysis tubing (12–14 kDa cut-off), with CP-enriched fraction further purified as described above for filamentous VLP. All steps of the purification procedure were done at 4 °C. Final samples were concentrated to a concentration of 1–2 mg ml⁻¹ and stored at −80 °C.

His₆-tagged proteins (trCP, CP-MBP) were isolated from cells by sonication on ice in PBS with 10 mM imidazole, followed by 40-min centrifugation at 50,000 × g and Ni-NTA chromatography. The non-specifically bound proteins were washed from the column and the His₆-tagged proteins eluted with PBS containing 300 mM imidazole. The eluted fractions were dialyzed against PBS overnight and concentrated using Amicon Ultra (30-kDa or 100-kDa cut-off), and loaded on the size exclusion column Superdex 200 10/300 GL (24 ml) or Superdex 200 16/60 PG (120 ml) (GE Healthcare) with PBS as the running buffer. Fractions with desired trCP variant on SDS-PAGE, were pooled and concentrated using Amicon Ultra centrifugal filters or Pierce™ Protein Concentrators, both with 100-kDa molecular weight cut-off, to the concentration of 1–7 mg ml⁻¹ for various assemblies. For CP-MBP, size exclusion chromatography (SEC) step was performed on HiLoad Superdex 200 16/600 at identical conditions as those used for separation of column manufacturers size standards (GE Healthcare). In the case of trCP, trCP^K176E and trCP^K176C, His₆-tags were removed using the TEV protease in 1:10-1:20 (TEV:trCP) molar ratio overnight at 20 °C, followed by the second Ni-NTA chromatography or SEC at room temperature. Fractions containing the proteins with cleaved tags were concentrated to 4 mg ml⁻¹ and stored at −80 °C. Sample purity and protein folding were checked with SDS-PAGE and circular dichroism spectroscopy, respectively.

In vitro self-assembly of VLP filaments and cubic particles

In all types of CP*-MBP fusions, CP* self-assembly was initiated with the addition of TEV protease to the purified CP*-MBP in a molar ratio of 1:10–1:20 (TEV:CP*-MBP) and left overnight at 4 °C. For CP-MBP, the sample after TEV protease cleavage was loaded onto NiNTA column to separate the cleaved His₆-tagged MBP and the non-cleaved CP-MBP fusion from the freshly self-assembled VLPs. For CP^ΔC40-MBP, assembled VLPs were purified using the standard VLP isolation procedure described above. For trCP^K176C-MBP, additional purification was done by SEC using Superdex 200 16/600 (120 ml) column. The purified samples were concentrated with Amicon Ultra centrifugal filters (100-kDa cut-off) with presence of filaments before and after TEV protease cleavage supervised by negative staining transmission electron microscopy (nsTEM) for CP-MBP or cryo-electron microscopy (cryo-EM) for CP^ΔC40-MBP and trCP^K176C-MBP.

Thermal stability assay

The thermal stability of the proteins was determined by differential scanning fluorimetry (DSF) at a protein concentration of approximately 0.1 mg ml⁻¹ in the presence of 2× SYPRO Orange (Thermo Fisher Scientific)⁶⁷. Samples were subjected to temperatures from 25 °C to 95 °C at a gradient of 1 °C min⁻¹. Temperature melting profiles were acquired with LightCycler 480 system (Roche). Samples were measured in triplicates with two independent measurements. Melting temperatures T_m were determined as minimum values from first derivative of the measured data curves in OriginPro2023 (OriginLab). All results are expressed as means ± standard deviation (SD) with their comparison performed by one-way ANOVA (analysis of variance) followed by Tukey’s multi comparison test. A value of p < 0.001 was considered statistically significant. All source data with detailed statistical analysis are provided in the Supplementary Data file.

Native-PAGE

Characterization of the protein assemblies in native conditions was performed on 4–16% Native-PAGE Bis-Tris gels (Thermo Fisher Scientific). Samples were mixed with 4x Native PAGE sample buffer (Thermo Fisher Scientific) and run in the Dark Blue Cathode Buffer (Thermo Fisher Scientific) for 60 min at 150 V and for 40–60 min at 250 V according to the manufacturer instructions. The gels were fixed with 40% methanol (v/v) and 10% acetic acid (v/v) and destained with 8% acetic acid (v/v).

Negative staining transmission electron microscopy (nsTEM)

For visualization, the final concentration of CP construct was approximately 1.5–3 µM. Copper mesh grids (SPI Supplies) were Formvar-coated, stabilized with carbon and glow-discharged (EM ACE200, Leica Microsystems). The VLP sample (5–20 μl) was applied to a grid, left to soak for 5 min, blotted, washed and contrasted with 1% (w/v) uranyl acetate (aqueous solution). Grids were imaged at 80 kV by CM 100 transmission electron microscope (Philips), equipped with Orius SC 200 camera (Gatan) and Digital Micrograph software 2.1.1 or by TALOS L120 (Thermo Fisher Scientific), operating at 100 kV, equipped with camera Ceta 16 M and Velox v3.0 (Thermo Fisher Scientific).

Filament length distribution analysis

Filament lengths were measured from nsTEM micrographs using the Fiji (ImageJ 1.53c) software suite⁶⁸ after manually tracing multiple points along at least 200 flexuous filaments. The violin plots of filament length distribution were produced using OriginPro2023 (OriginLab) with median values and ranges above the 25^th and below the 75^th percentile designated on the plots with white circle and black rectangular box, respectively. Values ‘n’ above each violin correspond to the number of measured filaments. Histograms of filament length distribution were analyzed using Gauss distribution fit in Origin2018 (OriginLab) and plotted in MATLAB R2021b (MathWorks) with values above the peaks provided as mean ± SD. All filament length measurements are provided in the Supplementary Data file.

Extraction of the total cell or VLP-encapsidated RNA

Extraction of total RNA from cells after induced overnight expression was done using the RNeasy Kit with optimized protocol for extraction from E. coli adapted from RNAprotect® Bacteria Reagent Handbook (Qiagen)⁶⁹. Specifically, cell lysis was performed enzymatically using lysozyme from chicken egg white (Sigma) and proteinase K (NEB), followed by standard RNeasy protocol with on-column DNase I treatment (Roche) and final elution in RNase-free water.

RNA extraction from the purified VLP samples was performed based on the previously published protocol of extraction from Potyvirus particles⁷⁰. In brief, the sample was incubated in the presence of 1% SDS (w/v) at 55 °C for 5 min, followed by phenol-chloroform extraction. The extracted RNA was then precipitated by the addition of 0.5 initial sample volume of 7.5 M ammonium acetate and 2.5 volumes of cold absolute ethanol at −20 °C for 1 h, followed by 25 min centrifugation at 12,000 × g. After washing the pellet with 70% ethanol (v/v) and air-drying for 15–30 min at room temperature, the precipitated RNA was resuspended in DEPC-treated water and treated with Turbo DNase rigorous protocol (Thermo Fisher) to remove any potential DNA contaminants, followed by isolation with RNA Clean & Concentrator-5 kit (Zymo Research) and storage at −80 °C.

RNA quantification with reverse transcription quantitative polymerase chain reaction

RNA was reverse transcribed using random hexamer oligos (IDT) and SuperScript IV reverse transciptase (Thermo Fisher) following the manufacturer’s protocol. After RT, cDNA was diluted 10x and used in the PCR. The final qPCR reaction was performed using Fast SYBR Green master mix (Thermo Fisher) in 6 μM primer mix for either CP^T43C+D136C or idnT as a control gene, found to be stably expressed in E. coli upon induction of protein overexpression³⁸. The reaction was measured using LightCycler 480 system (Roche) with the following conditions: 92 °C for 3 min followed by 40 cycles of 3 s at 92 °C and 30 s at 60 °C. The measurements were made in 2 biological replicates (for each 3 technical replicates). C_t values were obtained using automatic threshold detection by the software (Roche), mean C_t value and standard deviation were calculated in Origin2018 (OriginLab).

Polyadenylation, direct RNA sequencing and bioinformatic analysis

For poly(A)-tailing reaction E. coli poly(A) polymerase (NEB) was used. Purified RNA was polyadenylated for 1 min at 37 °C in the following reaction mix: 10 µl total RNA, with 2 µl 10× E. coli poly(A) polymerase buffer, 2 µl ATP, 5 µl nuclease-free water and 1 µl E. coli poly(A) polymerase (NEB). The reaction was stopped by the addition of 5 µl of 50 mM EDTA. Polyadenylated RNA was cleaned using 2.5× sample volume of AMPure XP beads (Beckman Coulter) and eluted in 10 µl nuclease-free water.

Direct RNA sequencing was performed using the Direct RNA Sequencing protocol (SQK-RNA002) for MinION adapted to sequencing using a Flongle flow cell (Oxford Nanopore Technologies). For adapter ligation, 1 µl of T4 DNA ligase (Thermo Fisher) was used and SuperScript IV (Invitrogen) was used for reverse transcription. For RNA adapter ligation 4 µl NEBNext Quick Ligation Reaction buffer (NEB), 2 µl RNA Adapter (RMX), 1.5 µl T4 DNA ligase were added and the total reaction volume was brought to 20 µl. Finally, RNA was cleaned using 1× sample volume of AMPure XP beads and eluted in 9 µl Elution Buffer (EB). The eluate was then loaded on a Flongle R9.4.1 flow cell.

Raw read files were base-called using guppy version 6.0.0 using high-accuracy mode (rna_r9.4.1_70bps_hac.cfg) with filtering set to minimum Q-score of 7. Base-called reads were mapped either to E. coli genomic coding sequences (CDS) or its genome. In the first case, base-called reads were mapped to E. coli BL21(DE3) genomic coding sequences (genome NCBI Reference Sequence NZ_CP081489.1) to which custom coding sequences of CP and p97 from the expression vector pRSF-Duet1 were added manually. Mapping was performed using minimap2⁷¹ with ''-ax map-ont -k14'' parameters. Mapped reads were filtered using samtools v1.6⁷² and mapped reads with the MAPQ score 60 were retained. Reads mapped to E. coli CDS were counted using NanoCount⁷³ with default parameters, where filtering for reads that map within 50 nt of the 3’-end of the reference was enabled (3’ filtering). Estimated count values per coding sequence were used to calculate adjusted transcripts per million (TPM) values for only those transcripts that were present in both biological replicates. Values for transcripts not present in both biological replicates were discarded. Mann-Whitney-Wilcoxon two-sided test was performed over adjusted TPM values between both replicates of each measured RNA sample (CP^T43C+D136C expression: p. val. = 3.9 × 10⁻⁷ comparing ‘RNA from VLPs’ replicates; CP^T43C+D136C-p97 co-expression: p. val. = 0.027 comparing ‘total cell RNA’ replicates, p. val. = 0.204 comparing ‘RNA from VLPs’ replicates). Adjusted TPM values were averaged between biological replicates and used further in downstream analyses. Per base coverage was computed using bedtools v2.30.0. Per base coverage was further normalized by division with the sum of coverage of all bases mapped and multiplied by a million bases. Coverage plots were plotted using seaborn Python library. Smoothened per base coverage means were calculated as the mean value of 40 consecutive bases.

Mapping the reads to the E. coli BL21(DE3) genome (NCBI Reference NZ_CP081489.1) to which CP and p97 coding sequences were added as additional chromosomes, was performed using minimap2 with “-ax map-ont -k14” parameters. Mapped reads were filtered using samtools v1.6 for the MAPQ score of 60. The amount of rRNA reads in each of the samples was calculated by intersecting the filtered mapped sequences with a genomic GTF file using bedtools intersect. Relative amounts of different RNA species as bp % values were calculated by adding gene-specific base pairs (bps) and comparing them to the sum of all mapped bps.

Cryo-EM grid preparation and data acquisition

For grid preparation, 3 μl of the sample with a concentration of around 1 mg ml^-1 for the filamentous particles or 3–4 mg ml⁻¹ for non-filamentous assemblies, were applied to glow-discharged Quantifoil 200-mesh R2/2 holey carbon grids (Quantifoil) followed by vitrification in Vitrobot Mark IV (Thermo Fisher Scientific). With the exception of VLP^ΔC40 and trCP^K176C, the samples were imaged on Glacios transmission electron microscope operated at 200 kV and equipped with Falcon 3 direct electron detector (Thermo Fisher Scientific). Data sets were acquired at a nominal magnification of 150,000 corresponding to calibrated pixel size of 0.950 Å and defocus range between −0.8 and −2.1 μm with a total dose of around 40 e⁻ Å⁻².

For VLP^ΔC40 and trCP^K176C, cryo-EM data was collected on Titan Krios transmission electron microscope (Thermo Fisher Scientific) operated at 300 kV at CEITEC, Brno, Czech Republic. VLP ^ΔC40 data set was acquired in linear mode with Falcon2 (Thermo Fisher Scientific) direct electron detector at a nominal magnification of 75,000 corresponding to a calibrated pixel size of 1.063 Å and defocus range of −1.3 and −0.4 µm, with 40 frames collected within 1.02 s exposure giving a total dose of 84 e⁻/Å². trCP^K176C data set was acquired on K2 Summit direct electron detector (Gatan) operating in counting mode at a nominal magnification of 165,000 corresponding to a pixel size of 0.822 Å and defocus range between −0.3 and −3.6 μm. 32-frame movies were collected during 4 s exposure time with a total dose of 32 e⁻ Å^-2.

Cryo-EM image processing

The detailed workflow for each dataset-specific reconstruction is presented in Supplementary Figs. 1, 3, 5, 8, 9, 11–13, 20 and 21. In general, more than 500 movies were collected for each sample and used for cryo-EM data processing, performed in cryoSPARC v3.3 or 4.1^74,75,76 except for VLP^ΔC40 and VLP^T43C+D136C, where RELION-3.1⁷⁷ was used.

For disulfide bond-stapled VLPs, cryo-EM reconstructions (Supplementary Fig. 20) were performed using C1 symmetry with additional selection of subclasses based on observed extensive conformational variability using 3D Variability analysis⁷⁵ and 3D classification. Final sharpened non-symmetric cryo-EM maps were checked for connecting density.

The resolutions of the final cryo-EM maps, in some cases locally sharpened with DeepEMhancer v0.13⁷⁸, were determined based on the gold-standard FSC criterion of 0.143⁷⁹. Local resolutions were calculated using BlockRes⁸⁰, and cryo-EM densities were visualized in UCSF Chimera 1.16⁸¹ and ChimeraX 1.5. Details, EMPIAR and EMDB codes are provided in Table 1 and Supplementary Tables 1 and 2.

Model building

PDB ID codes of initial models used for model building are provided in Table 1. In each case, the initial model was fitted into the reconstructed cryo-EM map using UCSF Chimera 1.16 with one central CP unit and all the neighbors in direct contact subjected to several iterative cycles of manual refinement using WinCoot 0.9.8.1⁸² and real-space refinement with secondary structure and geometry restraints in Phenix 1.20.1 package⁸³. For helical filaments with RNA, the segment of 5 uracils from the PVY virion (PDB ID: 6HXX) was fitted into the empty density of one CP and subjected to the same iterative cycles of refinement as for the protein components. Molprobity⁸⁴ was used for validation of individual models after each cycle.

For trCP^K176C, atomic models were built in the cryo-EM map after local refinement (EMD-17063) with two distinct protomer structures, one in C2 symmetric contact and the adjacent one. Model from the locally-refined cryo-EM map (PDB ID 8OPK) was rigid body-docked into the globally-refined cryo-EM map (EMD-17062) to obtain the atomic model of the entire cubic particle (PDB ID 8OPJ). Final 3D models were visualized in UCSF Chimera 1.16 and ChimeraX 1.5⁸⁵. The surface electrostatic potential of the trCP was calculated by APBS 3.4.1⁸⁶. Detailed statistics of model building and refinement are presented in Table 1.

Mass spectrometry and photometry

To denature the assembly and determine an accurate monomer mass, constructs trCP^K176C, trCP^K176C-noHis and trCP^K176S in PBS were diluted into a solution of 50% acetonitrile and 2% formic acid, to a final concentration of 5 µM of the cubic 48mer. For the native spectrum of trCP^K176C, the sample had buffer exchanged in 200 mM ammonium acetate (pH 6.9) using Bio-Spin 6 columns (Bio Rad) and sprayed at the same concentration. Nanoelectrospray mass spectrometry data were acquired using a QExactive UHMR mass spectrometer (ThermoFisher) using gold-plated 1.2 OD mm capillaries prepared in-house, as previously described⁸⁷. Resultant spectra were deconvolved and analyzed using UniDec⁸⁸.

To acquire mass photometry data⁸⁹, the constructs were diluted with PBS to 50 nM and measured on a Refeyn Two^MP mass photometer and analyzed using DiscoverMP v2.5.0 (Refeyn Ltd).

Molecular dynamics simulations (MD)

We took the atomic model of one ring from CP^ΔC40:r and truncated it on the N-terminus (ΔN49) to simulate the trCP^noHis. For trCP^K176C-noHis starting model, an additional K176C mutation was introduced. We constructed two coarse-grained (CG) systems by arranging eight randomly oriented rings (either trCP^noHis or trCP^K176C-noHis) with a minimum initial pairwise spacing of 16 nm between rings (i.e., 1.5 times the ring diameter). Rings were immersed in a 50 × 50 × 50 nm³ cubic box of water in which neutralizing counterions were eventually added. Each CG model of the ring was generated from the corresponding atomic model by using Martinize2 protocol⁹⁰. An elastic network⁹¹ was applied to maintain the overall internal structure of an individual ring. All CG-MD simulations were performed with GROMACS 2019.6⁹² and Martini 3.0 force field⁹³. The systems were energy minimized with the steepest descent algorithm (50.000 steps), followed by a brief NPT (keeping the number, pressure and temperature constant) equilibration cycle to relax the initial configurations (200.000 steps of 5 fs). Afterward, the systems were simulated for 10 μs in an NPT ensemble with periodic boundary conditions (500.000.000 steps of 20 fs). The temperature was maintained at 300 K and pressure at 1 bar by coupling the dynamics using V-rescale thermostat⁹⁴ and Berendsen barostat⁹⁵. The cut-off value for the Coulomb and van der Waals interactions was set to 1.1 nm, and a relative dielectric constant was set to 15. The rings freely diffuse in the solution until they hit each other and occasionally form a contact. The formation of ring clusters was analyzed by first identifying all ring-triplets sharing all three pairwise contacts to each other for each given instant of time (trajectory frame). As an order parameter revealing the form of the clusters we introduced the absolute value of the scalar triple product p = |(n₁,n₂,n₃)| = |n₁(n₂ × n₃)|, with vector n_i identifying a directional unit vector along the i-th ring normal. The value p = 0 corresponds to the plane-distributed ring-triplets while p = 1 corresponds to the mutually orthogonal orientation of rings forming the corner of the cube. The coordinates for the initial (equilibrated) structures and the final structures (after 10 micro seconds) can be obtained upon request in GROMACS format.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

Cryo-EM maps and atomic models have been deposited in the Electron Microscopy Data Bank (EMDB) and wwPDB, respectively, with EMDB/PDB accession codes: EMD-17046/8OPA, EMD-17047/8OPB, EMD-17048/8OPC, EMD-17049/8OPD, EMD-17050/8OPE, EMD-17051/8OPF, EMD-17052/8OPG, EMD-17053/8OPH, EMD-17054, EMD-17055, EMD-17056, EMD-17057, EMD-17058, EMD-17059, EMD-17060, EMD-17061, EMD-17062/8OPJ, EMD-17063/8OPK, EMD-17064, EMD-17065, EMD-17066, EMD-17067, EMD-17068, EMD-17069, EMD-17070, EMD-17071, EMD-17072/8OPL, EMD-17073, EMD-17074 and EMD-17075, with corresponding structures and atomic models provided in Table 1 and Supplementary Tables 1 and 2. Raw cryo-EM datasets have been deposited to the Electron Microscopy Pilot Image Archive (EMPIAR) with accession codes EMPIAR-11545 (EMD-17046/8OPA, EMD-17047/8OPB, EMD-17048/8OPC), EMPIAR-11546 (EMD-17049/8OPD, EMD-17050/8OPE), EMPIAR-11547 (EMD-17052/8OPG), EMPIAR-11548 (EMD-17053/8OPH, EMD-17054, EMD-17055), EMPIAR-11549 (EMD-17062/8OPJ, EMD-17063/8OPK) and EMPIAR-11550 (EMD-17072/8OPL). RNA nanopore sequencing data have been deposited on European nucleotide archive (ENA) with accession code PRJEB61146. All data are available in the main text, figures and supplementary information. Source data are provided with this paper as Supplementary Data file. Additional data related to this paper may be requested from the authors.

References

Stubbs, G. & Kendall, A. Helical viruses. Adv. Exp. Med. Biol. 726, 631–658 (2012).
Article CAS PubMed Google Scholar
Carstens, E. B. Introduction to virus taxonomy in virus taxonomy. In Ninth Report of the International Committee on Taxonomy of Viruses (ed. King, A. M. Q., Adams, M. J., Carstens, E. B. & Lefkowitz, E.) 3–20 (Elsevier Inc, 2012).
Inoue-Nagata, A. K. et al. ICTV virus taxonomy profile: potyviridae 2022. J. Gen. Virol. 103, 001738 (2022).
Article CAS Google Scholar
Yang, X., Li, Y. & Wang, A. Research advances in potyviruses: from the laboratory bench to the field. Ann. Rev. Phytopathol. 59, 1–29 (2021).
Article Google Scholar
Zamora, M. et al. Potyvirus virion structure shows conserved protein fold and RNA binding site in ssRNA viruses. Sci. Adv. 3, eaao2182 (2017).
Article PubMed PubMed Central Google Scholar
Kežar, A. et al. Structural basis for the multitasking nature of the potato virus Y coat protein. Sci. Adv. 5, eaaw3808 (2019).
Article PubMed PubMed Central Google Scholar
Cuesta, R. et al. Structure of Turnip mosaic virus and its viral-like particles. Sci. Rep. 9, 1–6 (2019).
Article Google Scholar
Charon, J., Theil, S., Nicaise, V. & Michon, T. Protein intrinsic disorder within the Potyvirus genus: from proteome-wide analysis to functional annotation. Mol. Biosyst. 12, 634–652 (2016).
Article CAS PubMed Google Scholar
Ge, P. & Zhou, Z. H. Hydrogen-bonding networks and RNA bases revealed by cryo electron microscopy suggest a triggering mechanism for calcium switches. Proc. Natl Acad. Sci. 108, 9637–9642 (2011).
Article CAS PubMed PubMed Central Google Scholar
Clare, D. K. et al. Novel inter-subunit contacts in barley stripe mosaic virus revealed by cryo-electron microscopy. Structure 23, 1815–1826 (2015).
Article CAS PubMed PubMed Central Google Scholar
Martínez-Turiño, S. & García, J. A. Potyviral coat protein and genomic RNA: a striking partnership leading virion assembly and more. Adv. Virus Res. 108, 165–211 (2020).
Article PubMed Google Scholar
Li, S., Hill, C. P., Sundquist, W. I. & Finch, J. T. Image reconstructions of helical assemblies of the HIV-1 CA protein. Nature 407, 409–413 (2000).
Article CAS PubMed Google Scholar
Obr, M. & Schur, F. K. M. Chapter Five - Structural analysis of pleomorphic and asymmetric viruses using cryo-electron tomography and subtomogram averaging. In Complementary Strategies to Understand Virus Structure and Function (ed. Rey, F. A.) 117–159 (Academic Press, 2019).
Lie, F., Szyszka, T. N. & Lau, Y. H. Structural polymorphism in protein cages and virus-like particles. J. Mater. Chem. B 11, 6516–6526 (2023).
Article CAS PubMed Google Scholar
Gonnin, L. et al. Structural landscape of the respiratory syncytial virus nucleocapsids. Nat. Commun. 14, 5732 (2023).
Article CAS PubMed PubMed Central Google Scholar
Wang, F. et al. Spindle-shaped archaeal viruses evolved from rod-shaped ancestors to package a larger genome. Cell 185, 1297–1307 (2022).
Article CAS PubMed PubMed Central Google Scholar
Ganser-Pornillos, B. K., Yeager, M. & Sundquist, W. I. The structural biology of HIV assembly. Curr. Opin. Struct. Biol. 18, 203–217 (2008).
Article CAS PubMed PubMed Central Google Scholar
Heymann, J. B. et al. Dynamics of herpes simplex virus capsid maturation visualized by time-lapse cryo-electron microscopy. Nat. Struct. Mol. Biol. 10, 334–341 (2003).
Article CAS Google Scholar
Wang, F., Gnewou, O., Solemanifar, A., Conticello, V. P. & Egelman, E. H. Cryo-EM of helical polymers. Chem. Rev. 122, 14055–14065 (2022).
Article CAS PubMed Google Scholar
Seitz, I. et al. DNA-origami-directed virus capsid polymorphism. Nat. Nanotechnol. 18, 1205–1212 (2023).
Article CAS PubMed PubMed Central Google Scholar
Wege, C. & Koch, C. From stars to stripes: RNA-directed shaping of plant viral protein templates—structural synthetic virology for smart biohybrid nanostructures. Wiley Interdiscip. Rev. Nanomed. Nanobiotechnol. 12, e1591 (2020).
Article PubMed Google Scholar
Grinzato, A. et al. Atomic structure of potato virus X, the prototype of the Alphaflexiviridae family. Nat. Chem. Biol. 16, 564–569 (2020).
Article CAS PubMed Google Scholar
Thuenemann, E. C. et al. A replicating viral vector greatly enhances accumulation of helical virus-like particles in plants. Viruses 13, 885 (2021).
Article CAS PubMed PubMed Central Google Scholar
Yang, S. et al. Crystal structure of the coat protein of the flexible filamentous papaya mosaic virus. J. Mol. Biol. 422, 263–273 (2012).
Article CAS PubMed PubMed Central Google Scholar
DiMaio, F. et al. The molecular basis for flexibility in the flexible filamentous plant viruses. Nat. Struct. Mol. Biol. 22, 642–644 (2015).
Article CAS PubMed PubMed Central Google Scholar
Donchenko, E. K. et al. Structure and properties of virions and virus-like particles derived from the coat protein of Alternanthera mosaic virus. PLoS One 12, e0183824 (2017).
Article PubMed PubMed Central Google Scholar
Chase, O. et al. CryoEM and stability analysis of virus-like particles of potyvirus and ipomovirus infecting a common host. Commun. Biol. 6, 1–14 (2023).
Article Google Scholar
Zeltins, A. Construction and characterization of virus-like particles: a review. Mol. Biotechnol. 53, 92–107 (2013).
Article CAS PubMed Google Scholar
Yuste-Calvo, C., Ibort, P., Sánchez, F. & Ponz, F. Turnip mosaic virus coat protein deletion mutants allow defining dispensable protein domains for ‘in Planta’ eVLP formation. Viruses 12, 661 (2020).
Article CAS PubMed PubMed Central Google Scholar
Zakeri, B. et al. Peptide tag forming a rapid covalent bond to a protein, through engineering a bacterial adhesin. Proc. Natl Acad. Sci. 109, e690–e697 (2012).
Article CAS PubMed PubMed Central Google Scholar
Miranda, F. F. et al. A self-assembled protein nanotube with high aspect ratio. Small 5, 2077–2084 (2009).
Article CAS PubMed Google Scholar
Pieri, L. et al. Atomic structure of Lanreotide nanotubes revealed by cryo-EM. Proc. Natl Acad. Sci. 119, e2120346119 (2022).
Article PubMed PubMed Central Google Scholar
Ballister, E. R., Lai, A. H., Zuckermann, R. N., Cheng, Y. & Mougous, J. D. In vitro self-assembly of tailorable nanotubes from a simple protein building block. Proc. Natl Acad. Sci. 105, 3733–3738 (2008).
Article CAS PubMed PubMed Central Google Scholar
Caspar, D. L. & Klug, A. Physical principles in the construction of regular viruses. Cold Spring Harb. Symp. Quant. Biol. 27, 1–24 (1962).
Article CAS PubMed Google Scholar
Hattne, J. et al. Analysis of global and site-specific radiation damage in cryo-EM. Structure 26, 759–766 (2018).
Article CAS PubMed PubMed Central Google Scholar
De, S. et al. Potato virus A particles – a versatile material for self-assembled nanopatterned surfaces. Virology 578, 103–110 (2023).
Article CAS PubMed Google Scholar
Joseph, J. & Savithri, H. S. Determination of 3’-terminal nucleotide sequence of pepper vein banding virus RNA and expression of its coat protein in Escherichia coli. Arch. Virol. 144, 1679–1687 (1999).
Article CAS PubMed Google Scholar
Zhou, K. et al. Novel reference genes for quantifying transcriptional responses of Escherichia coli to protein overexpression by quantitative PCR. BMC Mol. Biol. 12, 18 (2011).
Article CAS PubMed PubMed Central Google Scholar
Banerjee, S. et al. 2.3 Å resolution cryo-EM structure of human p97 and mechanism of allosteric inhibition. Science 351, 871–875 (2016).
Article CAS PubMed PubMed Central Google Scholar
Sui, H. & Downing, K. H. Structural basis of interprotofilament interaction and lateral deformation of microtubules. Structure 18, 1022–1031 (2010).
Article CAS PubMed PubMed Central Google Scholar
Galkin, V. E., Orlova, A., Schröder, G. F. & Egelman, E. H. Structural polymorphism in F-actin. Nat. Struct. Mol. Biol. 17, 1318–1323 (2010).
Article CAS PubMed PubMed Central Google Scholar
Weber, M. S. et al. Structural heterogeneity of cellular K5/K14 filaments as revealed by cryo-electron microscopy. Elife 10, e70307 (2021).
Article CAS PubMed PubMed Central Google Scholar
Kreutzberger, M. A. B. et al. Convergent evolution in the supercoiling of prokaryotic flagellar filaments. Cell 185, 3487–3500.e14 (2022).
Article CAS PubMed PubMed Central Google Scholar
Wang, F. et al. A structural model of flagellar filament switching across multiple bacterial species. Nat. Commun. 8, 960 (2017).
Article PubMed PubMed Central Google Scholar
Yang, Y. et al. Cryo-EM structures of amyloid-b 42 filaments from human brains. Science 375, 167–172 (2022).
Article CAS PubMed PubMed Central Google Scholar
Röder, J., Dickmeis, C. & Commandeur, U. Small, smaller, nano: new applications for potato virus X in nanotechnology. Front. Plant Sci. 10, 158 (2019).
Article PubMed PubMed Central Google Scholar
Zhu, J. et al. Protein assembly by design. Chem. Rev. 121, 13701–13796 (2021).
Article CAS PubMed PubMed Central Google Scholar
Sharma, M. et al. Shape-morphing of an artificial protein cage with unusual geometry induced by a single amino acid change. ACS Nanosci. Au 2, 404–413 (2022).
Article CAS PubMed PubMed Central Google Scholar
Eiben, S. et al. Plant virus-based materials for biomedical applications: trends and prospects. Adv. Drug Del. Rev. 145, 96–118 (2019).
Article CAS Google Scholar
Solovyev, A. G. & Makarov, V. V. Helical capsids of plant viruses: architecture with structural lability. J. General Virol. 97, 1739–1754 (2016).
Article CAS Google Scholar
Tóbiás, I., Palkovics, L., Tzekova, L. & Balázs, E. Replacement of the coat protein gene of plum pox potyvirus with that of zucchini yellow mosaic potyvirus: characterization of the hybrid potyvirus. Virus Res. 76, 9–16 (2001).
Article PubMed Google Scholar
Bourdin, D. et al. Evidence that heteroencapsidation between two potyviruses is involved in aphid transmission of a non-aphid-transmissible isolate from mixed infections. Phytopathology 81, 1459–1464 (1991).
Article Google Scholar
Gallo, A., Valli, A., Calvo, M. & García, J. A. A functional link between RNA replication and virion assembly in the potyvirus plum pox virus. J. Virol. 92, e02179–17 (2018).
Article CAS PubMed PubMed Central Google Scholar
Lõhmus, A., Hafrén, A. & Mäkinen, K. Coat protein regulation by CK2, CPIP, HSP70, and CHIP is required for potato virus a replication and coat protein accumulation. J. Virol. 91, e01316 (2017).
Article PubMed PubMed Central Google Scholar
Ivanov, K. I. et al. Phosphorylation of the potyvirus capsid protein by protein kinase CK2 and its relevance for virus infection. Plant Cell 15, 2124–2139 (2003).
Article CAS PubMed PubMed Central Google Scholar
Reichel, A. et al. Noncovalent, site-specific biotinylation of histidine-tagged proteins. Anal. Chem. 79, 8590–8600 (2007).
Article CAS PubMed Google Scholar
Valli, A. A., Gallo, A., Rodamilans, B., López-Moya, J. J. & García, J. A. The HCPro from the Potyviridae family: an enviable multitasking Helper Component that every virus would like to have. Mol. Plant Pathol. 19, 744–763 (2018).
Article PubMed Google Scholar
Saha, S., Lõhmus, A., Dutta, P., Pollari, M. & Mäkinen, K. Interplay of HCPro and CP in the regulation of Potato Virus A RNA expression and encapsidation. Viruses 14, 1233 (2022).
Article CAS PubMed PubMed Central Google Scholar
Besong-Ndika, J., Ivanov, K. I., Hafrèn, A., Michon, T. & Mäkinen, K. Cotranslational coat protein-mediated inhibition of potyviral RNA translation. J. Virol. 89, 4237–4248 (2015).
Article CAS PubMed PubMed Central Google Scholar
Wen, A. M. & Steinmetz, N. F. Design of virus-based nanomaterials for medicine, biotechnology, and energy. Chem. Soc. Rev. 45, 4074–4126 (2016).
Article CAS PubMed PubMed Central Google Scholar
Ochman, H., Gerber, A. S. & Hartl, D. L. Genetic applications of an inverse polymerase chain reaction. Genetics 120, 621–623 (1988).
Article CAS PubMed PubMed Central Google Scholar
Triglia, T., Peterson, M. G. & Kemp, D. J. A procedure for in vitro amplification of DNA segments that lie outside the boundaries of known sequences. Nucleic Acids Res. 16, 8186 (1988).
Article CAS PubMed PubMed Central Google Scholar
Gibson, D. G. et al. Enzymatic assembly of DNA molecules up to several hundred kilobases. Nat. Methods 6, 343–345 (2009).
Article CAS PubMed Google Scholar
Gibson, D. G. et al. Creation of a bacterial cell controlled by a chemically synthesized genome. Science 329, 52–56 (2010).
Article CAS PubMed Google Scholar
Wingfield, P. Protein precipitation using ammonium sulfate. In Current Protocols in Protein Science vol. APPENDIX 3 A.3F.1-A.3F.8 (NIH Public Access, 1998).
Waluga, T., Zein, M., Jördening, H. J. & Scholl, S. Simulation of reaction integrated adsorption of trienzymatic synthesized laminaribiose. Chem. Ing. Tech. 86, 119–128 (2014).
Article CAS Google Scholar
Niesen, F. H., Berglund, H. & Vedadi, M. The use of differential scanning fluorimetry to detect ligand interactions that promote protein stability. Nat. Protoc. 2, 2212–2221 (2007).
Article CAS PubMed Google Scholar
Schindelin, J. et al. Fiji: an open-source platform for biological-image analysis. Nat. Methods 9, 676–682 (2012).
Article CAS PubMed Google Scholar
Qiagen. Protocol 4: Enzymatic lysis and proteinase K digestion of bacteria. In RNAprotect® Bacteria Reagent Handbook 30–33 (Qiagen, 2020).
Berger, P. H. & Shiel, P. J. Potyvirus isolation and RNA extraction. Methods Mol. Biol. 81, 151–160 (1998).
CAS PubMed Google Scholar
Li, H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics 34, 3094–3100 (2018).
Article CAS PubMed PubMed Central Google Scholar
Li, H. et al. The sequence alignment/map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
Article PubMed PubMed Central Google Scholar
Gleeson, J. et al. Accurate expression quantification from nanopore direct RNA sequencing with NanoCount. Nucleic Acids Res. 50, e19 (2022).
Article CAS PubMed Google Scholar
Punjani, A., Rubinstein, J. L., Fleet, D. J. & Brubaker, M. A. CryoSPARC: algorithms for rapid unsupervised cryo-EM structure determination. Nat. Methods 14, 290–296 (2017).
Article CAS PubMed Google Scholar
Punjani, A. & Fleet, D. J. 3D variability analysis: resolving continuous flexibility and discrete heterogeneity from single particle cryo-EM. J. Struct. Biol. 213, 107702 (2021).
Article CAS PubMed Google Scholar
Punjani, A., Zhang, H. & Fleet, D. J. Non-uniform refinement: adaptive regularization improves single-particle cryo-EM reconstruction. Nat. Methods 17, 1214–1221 (2020).
Article CAS PubMed Google Scholar
Zivanov, J. et al. New tools for automated high-resolution cryo-EM structure determination in RELION-3. Elife 7, e42166 (2018).
Article PubMed PubMed Central Google Scholar
Sanchez-Garcia, R. et al. DeepEMhancer: a deep learning solution for cryo-EM volume post-processing. Commun. Biol. 4, 874 (2021).
Article PubMed PubMed Central Google Scholar
Scheres, S. H. W. & Chen, S. Prevention of overfitting in cryo-EM structure determination. Nat. Methods 9, 853–854 (2012).
Article CAS PubMed PubMed Central Google Scholar
Cardone, G., Heymann, J. B. & Steven, A. C. One number does not fit all: mapping local variations in resolution in cryo-EM reconstructions. J. Struct. Biol. 184, 226–236 (2013).
Article PubMed Google Scholar
Pettersen, E. F. et al. UCSF Chimera - a visualization system for exploratory research and analysis. J. Comput. Chem. 25, 1605–1612 (2004).
Article CAS PubMed Google Scholar
Emsley, P. & Cowtan, K. Coot: model-building tools for molecular graphics. Acta Crystallogr. D Biol. Crystallogr. 60, 2126–2132 (2004).
Article PubMed Google Scholar
Liebschner, D. et al. Macromolecular structure determination using X-rays, neutrons and electrons: recent developments in Phenix. Acta Crystallogr. D Struct. Biol. 75, 861–877 (2019).
Article CAS PubMed PubMed Central Google Scholar
Williams, C. J. et al. MolProbity: more and better reference data for improved all-atom structure validation. Protein Sci. 27, 293–315 (2018).
Article CAS PubMed Google Scholar
Pettersen, E. F. et al. UCSF ChimeraX: structure visualization for researchers, educators, and developers. Protein Sci. 30, 70–82 (2021).
Article CAS PubMed Google Scholar
Jurrus, E. et al. Improvements to the APBS biomolecular solvation software suite. Protein Sci. 27, 112–128 (2018).
Article CAS PubMed Google Scholar
Kondrat, F. D. L., Struwe, W. B. & Benesch, J. L. P. Native mass spectrometry: towards high-throughput structural proteomics. In Structural Proteomics: High-Throughput Methods: Second Edition (ed. Owens, R. J.) 349–371 (Humana New York), (2015).
Marty, M. T. et al. Bayesian deconvolution of mass and ion mobility spectra: from binary interactions to polydisperse ensembles. Anal. Chem. 87, 4370–4376 (2015).
Article CAS PubMed PubMed Central Google Scholar
Young, G. et al. Quantitative mass imaging of single biological macromolecules. Science 360, 423–427 (2018).
Article CAS PubMed PubMed Central Google Scholar
Kroon, P. C. Martinize 2—VerMoUTH. in Aggregate, Automate, Assemble (ed. Kroon, P.) 16–53 (University of Groningen), (2020).
Periole, X., Cavalli, M., Marrink, S. J. & Ceruso, M. A. Combining an elastic network with a coarse-grained molecular force field: Structure, dynamics, and intermolecular recognition. J. Chem. Theory Comput. 5, 2531–2543 (2009).
Article CAS PubMed Google Scholar
Abraham, M. J. et al. Gromacs: high performance molecular simulations through multi-level parallelism from laptops to supercomputers. SoftwareX 1–2, 19–25 (2015).
Article Google Scholar
Souza, P. C. T. et al. Martini 3: a general purpose force field for coarse-grained molecular dynamics. Nat. Methods 18, 382–388 (2021).
Article CAS PubMed Google Scholar
Bussi, G., Donadio, D. & Parrinello, M. Canonical sampling through velocity rescaling. J. Chem. Phys. 126, 014101 (2007).
Article PubMed Google Scholar
Berendsen, H. J. C., Postma, J. P. M., Van Gunsteren, W. F., Dinola, A. & Haak, J. R. Molecular dynamics with coupling to an external bath. J. Chem. Phys. 81, 3684–3690 (1984).
Article CAS Google Scholar
Hu, G. et al. flDPnn: Accurate intrinsic disorder prediction with putative propensities of disorder functions. Nat. Commun. 12, 4438 (2021).
Article CAS PubMed PubMed Central Google Scholar
Sehnal, D. et al. MOLE 2.0: Advanced approach for analysis of biomacromolecular channels. J. Cheminform. 5, 39 (2013).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We thank Jelka Lenarčič and Tanja Peric for technical assistance with cloning, Izidor Friškovec and Marija Srnko for technical support with protein purification and biochemical characterization, and Dr. Matic Kisovec for technical support with cryo-EM data analysis. We thank Dr. Miha Modic and Dr. Oscar G. Wilkins for advice on nanopore RNA sequence analysis and Prof. Dr. Jernej Ule for use of nanopore sequencing in his ‘RNA networks’ laboratory. We thank Dr. Ana Crnković for advice on the dual-expression system and Dr. Ion Gutierrez Aguirre for kindly providing the PVY virus. CIISB research infrastructure projects LM2018127 and LM2015043, funded by MEYS CR, are gratefully acknowledged for financial support of measurements at CF CryoEM. We thank the National Institute of Chemistry Cryo-EM Facility supported by the Slovenian Research Agency Infrastructure Program IO-0003. We thank the Slovenian Research Agency for funding (grant numbers P1-0391 (M.P., A.K.), J1-4410 (M.P., A.K., L.K.), PhD fellowship (N.K.), P4-0165 (M.T.Ž.), J1-2467 (M.T.Ž.), P1-0010 (F.M.)). We thank the National Institute of Chemistry, Slovenia, for the PhD ‘Janko Jamnik’ fellowships to L.K. and T.K. We thank European Union’s Horizon 2020 research and innovation program for the grant 835300-RNPdynamics (T.K., Ž.V.) and the grant from the Biotechnology and Biological Sciences Research Council BBSRC sLoLa BB/W00349X/1 (E.H., J.L.P.B.).

Author information

Authors and Affiliations

Department of Molecular Biology and Nanobiotechnology, National Institute of Chemistry, Ljubljana, Slovenia
Luka Kavčič, Andreja Kežar, Neža Koritnik, Tajda Klobučar, Žiga Vičič & Marjetka Podobnik
PhD Program ‘Chemical Sciences‘, Faculty of Chemistry and Chemical Technology, University of Ljubljana, Ljubljana, Slovenia
Luka Kavčič
PhD Program ‘Biomedicine’, Faculty of Medicine, University of Ljubljana, Ljubljana, Slovenia
Neža Koritnik
Department of Biotechnology and Systems Biology, National Institute of Biology, Ljubljana, Slovenia
Magda Tušek Žnidarič
PhD Program ‘Biosciences’, Biotechnical Faculty, University of Ljubljana, Ljubljana, Slovenia
Tajda Klobučar
Theory Department, National Institute of Chemistry, Ljubljana, Slovenia
Franci Merzel
Department of Chemistry, University of Oxford, Oxford, UK
Ellie Holden & Justin L. P. Benesch
Kavli Institute for Nanoscience Discovery, University of Oxford, Oxford, UK
Ellie Holden & Justin L. P. Benesch

Authors

Luka Kavčič
View author publications
You can also search for this author in PubMed Google Scholar
Andreja Kežar
View author publications
You can also search for this author in PubMed Google Scholar
Neža Koritnik
View author publications
You can also search for this author in PubMed Google Scholar
Magda Tušek Žnidarič
View author publications
You can also search for this author in PubMed Google Scholar
Tajda Klobučar
View author publications
You can also search for this author in PubMed Google Scholar
Žiga Vičič
View author publications
You can also search for this author in PubMed Google Scholar
Franci Merzel
View author publications
You can also search for this author in PubMed Google Scholar
Ellie Holden
View author publications
You can also search for this author in PubMed Google Scholar
Justin L. P. Benesch
View author publications
You can also search for this author in PubMed Google Scholar
Marjetka Podobnik
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

The study was conceptualized by M.P. and L.K. Molecular cloning was conducted by L.K., A.K., and N.K. Protein expression and purification was conducted by L.K. and A.K. Cryo-EM grid preparation and screening were conducted by L.K. and A.K. Data processing, model building, and analysis were conducted by L.K. and A.K. L.K. conducted biophysical assays. Negative staining transmission electron microscopy grid preparation and screening was conducted by M.T.Ž. RT-qPCR and RNA sequencing was performed by L.K., T.K. and Ž.V. Mass spectrometry and mass photometry were conducted by J.L.P.B. and E.H. MD simulations were conducted by F.M. The manuscript was prepared by L.K. and M.P. All authors provided critical feedback and helped shape the research, analysis and manuscript.

Corresponding author

Correspondence to Marjetka Podobnik.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Communications Chemistry thanks the anonymous reviewers for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Peer Review File

Supplementary Information

Description of Additional Supplementary Files

Supplementary data 1

reporting summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Kavčič, L., Kežar, A., Koritnik, N. et al. From structural polymorphism to structural metamorphosis of the coat protein of flexuous filamentous potato virus Y. Commun Chem 7, 14 (2024). https://doi.org/10.1038/s42004-024-01100-x

Download citation

Received: 20 September 2023
Accepted: 05 January 2024
Published: 17 January 2024
DOI: https://doi.org/10.1038/s42004-024-01100-x

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.