Article | Open | Published:

# Nonequilibrium self-assembly dynamics of icosahedral viral capsids packaging genome or polyelectrolyte

## Abstract

The survival of viruses partly relies on their ability to self-assemble inside host cells. Although coarse-grained simulations have identified different pathways leading to assembled virions from their components, experimental evidence is severely lacking. Here, we use time-resolved small-angle X-ray scattering to uncover the nonequilibrium self-assembly dynamics of icosahedral viral capsids packaging their full RNA genome. We reveal the formation of amorphous complexes via an en masse pathway and their relaxation into virions via a synchronous pathway. The binding energy of capsid subunits on the genome is moderate (~7kBT0, with kB the Boltzmann constant and T0 = 298 K, the room temperature), while the energy barrier separating the complexes and the virions is high (~ 20kBT0). A synthetic polyelectrolyte can lower this barrier so that filled capsids are formed in conditions where virions cannot build up. We propose a representation of the dynamics on a free energy landscape.

## Introduction

The simplest icosahedral viruses can be viewed as nanometer-scaled protein shells called capsids encasing the genome in the form of nucleic acids. The genome is packaged during replication in the host cell1 through an efficient self-assembly process2,3,4. The fact that some viruses are able to self-assemble from purified components and to package materials in vitro has promoted the development of engineered viral capsids enclosing synthetic polymers5,6, heterologous nucleic acids7,8,9,10, nanoemulsion droplets11, gold12,13, and magnetic14 nanoparticles to name a few. Therefore, understanding the molecular mechanisms of viral self-assembly can help devise therapeutic strategies for inhibiting viral replication15,16 and design functional biocompatible nanocontainers17 for drug delivery and medical imaging. Furthermore, viruses can also help address fundamental questions on the physical principles underlying multicomponent self-assembly, notably by elucidating the dynamic pathways leading to error-free complex structures18,19.

Capsid subunits are subject to a delicate balance between a short-range, mostly hydrophobic, attraction and a long-range electrostatic repulsion, while the subunit–genome interaction is electrostatically attractive2,20,21. Molecular dynamics simulations22,23,24 have shed light on two pathways that depend on the interaction energies between components25,26: (i) in the first pathway, a piece of capsid is formed on the genome then grows by sequential binding of capsid subunits. Subunit binding and genome packaging occur in a synchronous manner. This pathway takes place mainly for strong subunit–subunit attractive interactions and is often called nucleation growth or nucleation elongation in the field; (ii) in the other pathway, subunits adsorb onto the genome en masse in a short time frame and the resulting amorphous complex relaxes into an ordered, filled capsid. This pathway is preferentially followed for strong subunit–genome attractive interactions.

The self-assembly and disassembly dynamics of empty icosahedral viral capsids has been investigated by time-course static light scattering27,28,29, resistive-pulse sensing through nanofluidic channel30, atomic force microscopy31,32, and time-resolved small-angle X-ray scattering (TR-SAXS)33,34,35,36. In the presence of genome, by contrast, data are scarce because the dynamics is intrinsically more complex and the coexistence of different components renders the interpretation difficult. Kler et al.37 observed by TR-SAXS the synchronous pathway during the packaging of short RNA segments in SV40-derived nanoparticles. However, because SV40 is a DNA virus and its genome is actually 10-fold larger than the segments used in that study, the observations may not be relevant in a biological context. Single molecule fluorescence correlation microscopy38,39 suggested the en masse pathway for the genome packaging in satellite tobacco necrosis virus capsid, but the technique provides the hydrodynamic radius as a function of time, and thereby lacks information on the morphology and on the mass of species.

The cowpea chlorotic mottle virus (CCMV) is an icosahedral, single-stranded (ss)RNA plant virus widely used in physical6,8,28 and nanotechnological9,10,19,40 studies. Its T = 3 capsid is made up of 90 chemically identical protein dimers—hereafter called subunits—enclosing the genome shared between four ssRNA segments. In the absence of genome, the subunits remain dissociated at neutral pH, but spontaneously form empty capsids at acidic pH due to globally enhanced attractive interactions41,42. With genome, recent investigations8,43 have shown that, while subunits barely interact with it at neutral pH and high ionic strength, amorphous nucleoprotein complexes (NPCs) are formed when the ionic strength is lowered. In turn, at acidic pH, these NPCs relax into well-ordered, icosahedral virions. Roughly speaking, the ionic strength controls the subunit–genome interactions and the pH acts on the subunit–subunit interactions.

In this study, we conduct a quantitative investigation on the nonequilibrium self-assembly dynamics of CCMV capsids packaging their full genetic material. Using TR-SAXS with a high brilliance synchrotron source, we establish the existence of the en masse and synchronous pathways by tuning the subunit–subunit and subunit–genome interactions, and we estimate several related nonequilibrium quantities. We also investigate the role played by the cargo (genome or polyelectrolyte) in the pathway selection.

## Results

### NPCs self-assemble through an en masse pathway

At pH 7.5 and an ionic strength of 0.5 M, CCMV subunits are in the form of dissociated dimers42 with a radius of gyration Rg of 33 Å (Supplementary Fig. 1). CCMV genome consists of four ssRNA segments distributed in such a way that 2980 nucleotides on average are packaged per virion. Therefore, for the sake of simplicity, we hereafter refer to ssRNA molecules of this reference size as the genome. Static scattering measurements on purified genome gave a Rg of 140 ± 11 (s.e.m.) Å (Supplementary Fig. 2).

The subunits have two positively charged, flexible, N-terminal arms (Fig. 1a, inset) that allow them to bind on the genome by electrostatic interactions. We performed static measurements by small-angle neutron scattering (SANS) with a mixture of subunits and genome in 68% D2O at pD 7.5. In this fraction of heavy water, the contrast between RNA and the solvent vanishes, and solely the scattering intensity arising from the proteins is detected6. The subunit-to-genome mass ratio ρ in a native virion is about 3.6, but we performed experiments with ρ around 6, which has been found to be the minimal ratio ensuring the complete packaging of genome43. Figure 1a shows that whereas the scattering intensity was near the noise level at an ionic strength of 0.5 M, it increased by one order of magnitude at 0.1 M, which can be ascribed to the binding of subunits on the genome, thus forming NPCs (see drawings in Fig. 1a). By assuming that the solution contained free subunits and Nth-order complexes (N = 0, 1, 2…, + ∞), each of which was made of N subunits bound on the genome, we can infer an upper limit $$\left\langle N \right\rangle _{\mathrm{up}}$$ of the mean value of N from the forward scattering intensities I0I(q = 0) (see Methods). Accordingly, we estimated that each NPC contained on average a maximum of $$\left\langle N \right\rangle _{\mathrm{up}}$$ = 75 ± 31 (s.e.m.) subunits. The extraction of $$\left\langle N \right\rangle _{\mathrm{up}}$$ from the scattering data does not require any prior knowledge on the distribution of N, and therefore, this independent-model quantity was used across this study to probe the binding of subunits on the genome in various conditions.

We carried out TR-SAXS experiments by rapidly mixing purified subunits and genome with a final ionic strength of 0.1 M. Figure 1b depicts the assembly of NPCs for ρ = 6. Subunits were bound on the genome very rapidly and their mean number per NPC remained stable for more than 10 min at about 77 subunits. A single exponential decay function indicated a binding time τbind of 28 ms. By contrast, the variation of Rg occurred over a structural relaxation time τstruc of 48 s, that is, three orders of magnitude higher than τbind. We used a double exponential decay for fitting Rg: the first phase might reflect a compaction of the genome due to the presence of subunits, while the second phase might result from the partial association of bound subunits into capsid subassemblies. These findings suggested that NPCs were formed through an en masse pathway (see drawing of Fig. 1b): subunits were first captured by the genome in ~100 ms, then NPCs slowly self-organized over several minutes. Interestingly, other experiments performed in the same conditions showed that τstruc was reduced at small values of ρ (Supplementary Fig. 3): the compaction of the genome, which is thought to correspond to the fast phase, may slow down when a large number of subunits bind on it as is the case at high values of ρ. The subsequent self-organization of capsid subassemblies, that is, the second phase, may be affected accordingly.

The experiment depicted on Fig. 1c shows a solution containing NPCs pre-assembled at an ionic strength of 10 mM and ρ =5.5, and rapidly diluted twofold. $$\left\langle N \right\rangle _{\mathrm{up}}$$ decreased slowly from 75 to 58 subunits per NPC with an unbinding time of 204 s. The variation of Rg was not significant over the period of observation (~1200 s). We think that it was due to two competing effects: the NPCs tended to be less compact upon the release of subunits, but at the same time, the released subunits did not contribute any longer to the radius of gyration of NPCs. Rg remained therefore roughly unchanged. This experiment demonstrates that dilution is sufficient to release subunits from NPCs within a few minutes and suggests a weak binding energy between the subunits and the genome.

The morphology of NPCs at equilibrium is shown in Fig. 2a by transmission electron cryomicroscopy (cryo-TEM): we can see amorphous species with an equivalent diameter of 27 ± 4 (s.d.) nm and an average aspect ratio of 1.13, that is, a size which is close to that of the native virion (~29 nm in diameter). We carried out an ab initio shape determination of NPCs with a model of densely packed beads from the scattering pattern collected at the end of the assembly experiment performed at ρ = 6. The low resolution structure appeared elongated with a length of ~380 Å and a cross-sectional diameter of ~230 Å (Fig. 2b–d), in fair agreement with the cryo-TEM images.

In order to get a quantitative insight into the binding thermodynamics and the diffusion of subunits during the NPC self-assembly, we analyzed the scattering intensities with a simple templated assembly model (see Supplementary Methods), that is,

$$C_N + S\ \mathop{\leftrightarrows}\limits_{{k^ - }}^{{k^ + }}\ C_{N + 1},$$
(1)

where S denotes the free subunits, CN are the Nth-order complexes with N = 0, 1, 2…, + ∞ and C0 stands for the bare genome. k+ and k are the forward and backward reaction rate constants, respectively. At equilibrium, the mean number of subunits per NPC is given by $$\left\langle N \right\rangle$$(t→ + ∞) ≈ (cS − K−1)/cG, where cS and cG are the total molar concentrations of subunits and genome respectively, and K−1 = k/k + is the inverse of equilibrium constant, also equal to the critical subunit concentration above which NPCs are formed (Supplementary Fig. 4). For the sake of simplicity, we kept k+ and k constant, which implied that the effects of the interactions between subunits, whether they were bound on the genome or free, were averaged over N. Figure 2e (black discs) shows $$\left\langle N \right\rangle$$ estimated near equilibrium for experiments carried out at various ρ, by using an analytical expression relating it with I0(t→ + ∞) through the templated assembly model (see Supplementary Methods). The total subunit concentration cS was fixed in all the experiments, which implied that $$\left\langle N \right\rangle$$(t→ + ∞)(1−K−1/cS)ρ, and thus we estimated K−1 from a linear fit (black line) to be 13.6 ± 1.3 (s.e.m.) µM. The fact that the experimental curve was linear indicated that the NPCs were still far from being saturated by bound subunits. At higher values of ρ, we expect the number of bound subunits to reach a limit and $$\left\langle N \right\rangle$$(t→ + ∞) will exhibit a plateau (Supplementary Fig. 5). It follows from K−1 that the binding free energy between the subunits and NPCs was Gbind −17 kJ mol−1 = −6.7kBT0 (see Supporting Information for the expression of Gbind), with kB the Boltzmann constant and T0 the room temperature (298 K). A linear fit of $$\left\langle N \right\rangle _{\mathrm{up}}$$ as a function of ρ (light gray line) returned a K−1 of 10.9 ± 1.8 (s.e.m.) µM and Gbind −6.9kBT0, that is, a very similar value. This value is moderate and indicates that the subunits were indeed loosely bound to the NPCs. Recall, however, that Gbind also comprises an effective contribution arising from the subunit–subunit interactions.

Furthermore, if the binding of subunits was diffusion-limited, the forward reaction rate constant entering the templated assembly model would be expressed by k+ = 4πNARCDS, where RC is a typical radius for NPCs taken as 120 Å, DS is the apparent diffusion coefficient of subunits, and NA is the Avogadro constant. By fitting I0 collected from an experiment at ρ = 8 with the intensity given by the templated assembly model (Supplementary Fig. 6), we estimated DS to be 4.9 × 10−12 m2 s−1, while the diffusion coefficient of a single subunit computed by molecular dynamics simulations amounted to 3.0 × 10−10 m2 s−1, that is, two orders of magnitude higher. This discrepancy indicates an energy barrier, possibly arising from the subunits in NPCs hindering the insertion of incoming free subunits through steric and electrostatic repulsions. This barrier likely also arises from an orientation dependence of the incoming subunits as well as from a dependence on the subunit conformation, notably at the level of the RNA-binding domain. At any rate, the binding process is partly reaction-limited.

### NPCs relax into virions via a synchronous pathway

By lowering the pH of a solution of NPCs, the subunit–subunit interactions become more attractive. As a result, the NPCs become metastable and slowly relax into species structurally identical to native virions8. In the experiment depicted on Fig. 3a, we mixed a solution of pre-assembled NPCs with a buffer bringing the pH to 5.2 and the ionic strength to 55 mM. $$\left\langle N \right\rangle _{\mathrm{up}}$$ increased from 75 to about 85 in more than 1 h, which gave a long binding time of 3100 s. The radius of gyration did not change significantly during this time frame (a variation of a few angstroms compared to 11 nm is below the typical resolution of small-angle scattering). However, a close inspection of the form factors of NPCs show the arches becoming more pronounced over time, indicating a structural self-organization (Fig. 3b). A comparison with the form factor of purified virions showed that the relaxing NPCs had a size and a structure close to those of native virions, and that empty capsids were not in detectable amounts (Supplementary Fig. 8). The evolution of the form factor at its first minimum reflects the dynamics of this self-organization process and we estimated the structural relaxation time τstruc to be 2920 s (Fig. 3c). Note that we used again a double exponential decay function for the fittings: the first phase might be fast because several binding sites in the NPCs were readily accessible to the free subunits, while the second phase might take more time because the dynamics became limited by the slow construction of the capsid. Since τbind ≈ τstruc, the system likely proceeded through a synchronous mechanism where the capsid was building up by capturing free subunits while packaging the genome. Simultaneously, a few bound subunits could continue being exchanged with the bulk solution or even transferred between NPCs through collisions. We carried out the same relaxation experiment at various temperatures ranging from 10 °C up to 40 °C. For an energetically activated process, the binding time should follow an Arrhenius-like law, that is, τbind exp(Ea/kBT), where Ea is the activation energy. Figure 3d plots the logarithm of τbind as a function of 1/T and shows a linear relationship consistent with the existence of an energy barrier Ea estimated to be 20kBT0. We also investigated the shortest decay time of the binding process and we found out that the fast phase might be activated as well with an energy of 11kBT0 (see Supplementary Fig. 9a). At all the temperatures, τstruc remained of the same order of magnitude as τbind, including for the fast phase (Supplementary Figs. 9b, 10, 11). We checked out the morphology of the species by cryo-TEM after a few days of incubation (Fig. 3e). Like the native virions, they displayed a diameter of 29 ± 1 (s.d.) nm and a hollow core arising from the fact that the genome is packed against the interior surface of the capsid45 due to electrostatic interactions.

### PSS-filled capsids are formed via an en masse pathway

Poly(styrene sulfonic acid) (PSS) is a flexible negatively charged synthetic polyelectrolyte, whose linear charge density (~ 0.33e Å−1) is close to that of ssRNA. We used 600-kDa PSS which carried about 3100 negative charges on average, that is, an amount similar to that of the viral genome. However, subunits and PSS mixed at pH 7.5 produce spherical species6. Figure 4a shows the time evolution of $$\left\langle N \right\rangle _{\mathrm{up}}$$ estimated by assuming an average of 1.5 PSS chains packaged per capsid, as previously established by SANS6. $$\left\langle N \right\rangle _{\mathrm{up}}$$ plateaued at ~70—which was consistent with the presence of objects smaller than native virions (90 subunits)—with a binding time τbind of 17 ms very close to the τbind = 28 ms obtained with the genome in the same conditions. Likewise, the radius of gyration relaxed in τstruc = 12 s, which was of the same order of magnitude as the value of 48 s found with the genome. The time evolution of Rg exhibited two phases, which were most likely of the same nature as those observed with the genome (Fig. 1b). As a consequence, since τbindτstruc, PSS-filled capsids self-assembled via an en masse pathway: the subunits were rapidly bound on the polyelectrolyte and the resulting complexes slowly self-organized into spherical species. Cryo-TEM images in Fig. 4b show that the final capsids were well assembled but slightly less ordered than the native virions, which suggested that the capsids comprised defects. The vast majority had a diameter of 23 ± 1 (s.d.) nm and only a few species were larger than 29 nm, in good agreement with our previous results6. The fact that capsids were able to close up in the presence of PSS but not with genome at pH 7.5 could be explained by the hydrophobicity and the flexibility of PSS. PSS is known to be insoluble in water when its charge fraction drops below 30% owing to its hydrophobic backbone46. We therefore think that the RNA-binding domains of the bound subunits might neutralize a large fraction of the negative charges carried by the PSS. The latter, which is very flexible, might then collapse through a coil–globule transition that could allow the bound subunits to contact each other and to build up a closed shell, despite the energy barrier related to their electrostatic repulsion and to the conformational change required to arrange onto an icosahedral lattice. A recent theoretical study demonstrated that flexible, linear chains require less free energy than stiff, branched ones to be confined in a capsid47, which is in qualitative agreement with our results. Like virions, PSS-filled capsids are stable against dilution, which indicates that, at the end of assembly, the subunits making up the capsid are no longer able to exchange with the bulk solution.

## Discussion

The present study shows experimentally how the self-assembly dynamics of icosahedral virions is influenced by the subunit–subunit and subunit–genome interactions. Garmann et al.8,48 reported that during the assembly of CCMV NPCs performed with exogenous RNA derived from the genome of brome mosaic virus and of tobacco mosaic virus, all subunits were bound to RNA regardless of the subunit-to-RNA mass ratio well beyond the ratio of 3.6 found in native virions. This finding entails the irreversibility of the binding and, subsequently, a high binding energy of subunits on NPCs. In contrast, the binding energy was found here, with the native genome, to be moderate (~7kBT0) despite its electrostatic nature. This is more consistent with competition experiments where subunits initially bound to exogenous RNA eventually migrated toward viral RNA as long as the NPCs had not yet relaxed into virions43. Such a moderate binding energy may thus be advantageous for the virus, allowing it to select its genome specifically. It is also in agreement with our observation of 75 subunits per NPC at most in conditions where the subunit-to-genome molar ratio was 150 (ρ = 6) because a significant fraction of subunits remained free. Yet, these NPCs relaxed into well-formed virions, that is, comprising 90 subunits, upon lowering the pH. We emphasize that, at pH 7.5, the subunits have a negative net charge and the attractive force with the genome is ensured by the subunit dipole moment. Molecular dynamics simulations revealed that the dipole moment was reduced subsequently to the folding of the RNA-binding domains against the body of the proteins (Supplementary Fig. 13), thus contributing to reduced subunit–genome interactions.

In agreement with coarse-grained simulations24, our measurements provided evidence that the en masse and synchronous pathways were selected according to the interactions tuned here by the pH and ionic strength. In host cells, cooperativity between subunits and genome may be required to package and therefore to protect the genome within a short time frame. A separate synchronous step in turn prevents misassembly—including exogeneous RNA packaging—and kinetic traps. In this respect, note that when we tried to perform a one-step assembly by mixing subunits and genome while simultaneously lowering the pH and the ionic strength, a lot of misassembled and aggregated objects were formed. What may drive the assembly in vivo is still unclear. A possible candidate was divalent cations Ca2+ that are known to stabilize the capsid44. However, our experiments performed in the presence of Ca2+ did not reveal any significant change in the self-assembly dynamics of NPCs (Supplementary Fig. 14). Surprisingly, increasing subunit–subunit interactions by acidification was not required for PSS-filled capsids, which demonstrates that, in certain conditions, the subunits are able to build up a closed shell in a single step at neutral pH. We hypothesize therefore that the assembly occurs in vivo at neutral pH and that divalent cations have little role to play at this stage. NPCs may be formed gradually as the subunits are expressed, which permits an efficient selectivity for the genome, and then, when the number of subunits per NPC reaches a high enough value—probably well above 100—the NPCs irreversibly relax into icosahedral virions. This scheme has the advantage that it provides a simple mechanism—namely, capsid protein expression—to switch to the assembly step of the viral cycle, a crucial regulation level for positive-sense (ss)RNA viruses such as CCMV whose RNA serve not only as genome but also as messenger RNA and replication templates.

Figure 5 summarizes the self-assembly dynamics on a free energy landscape parametrized by two reaction coordinates, that is, the number of subunit–subunit contacts (x-axis) and the number of subunit–genome contacts (y-axis). Along the x-axis, the dynamics is reaction-limited with a high energy barrier to overcome as encountered in the synchronous pathway. Along the y-axis, the dynamics is diffusion-limited only as long as only a few subunits are bound on the genome. Beyond a certain number of subunits, the dynamics becomes reaction-limited because more collisions of incoming subunits with NPCs are required for successful insertions. The free energy landscape can be tailored by adjusting the pH and ionic strength, but also depends on the flexibility and hydrophobicity of the polyelectrolyte to package.

Elucidating the dynamic pathways pertaining to viral replication will be useful to combat viral infection. But viruses are also a great source of inspiration for biomimetic nanomaterials, especially for synthetic delivery systems made of oppositely charged components such as polymer–DNA and lipid–DNA complexes used for gene delivery applications49. The atomic precision with which their building blocks form so complex a structure and their stability in a variety of adverse environments should foster their use outside their natural context. The present work will hopefully contribute to strengthening the efforts devoted to understanding, modeling, and harnessing the assembly of a wide range of viruses, viral nanocages, and synthetic delivery systems.

## Methods

### Sample preparation

CCMV was grown in blackeye cowpea leaves (Vigna ungiculata), then capsid proteins and genomic RNA were extracted from purified virions6,35,36. More precisely, 140 g of infected leaves were mixed and stirred with 300 mL of 0.15 M sodium acetate, pH 4.8, and 300 mL of ice-cold chloroform prior to 10 min of centrifugation at 10,000 × g. The virions in the supernatant were precipitated by adding NaCl to a final concentration of 20 mM and 8% (v w−1) of poly(ethylene glycol) (MW 8000). The solution was centrifuged at 10,000 × g for 10 min. The pellet was resuspended and the solution centrifuged at 8000 × g for 10 min. The supernatant was centrifuged through a 20% (v w−1) sucrose cushion at 150,000 × g for 2 h and the virions in the pellet were stored at −80 °C. For protein purification, virions were dialyzed against 50 mM Tris-HCl, pH 7.5, 0.5 M CaCl2, 1 mM EDTA, pH 8, and 1 mM dithiothreitol (DTT). The solution was centrifuged at 150,000 × g for 18 h and pure proteins collected in the supernatant were stored at 4 °C in 50 mM sodium acetate, pH 4.8, 0.5 M NaCl, and 1 mM EDTA. Viral RNA was purified by mixing 0.1 mL of virions with 1 mL of TRIzol® reagent (Life Technologies, France). The sample was centrifuged at 12,000 × g for 15 min and the aqueous phase was removed prior to addition of 0.5 mL of isopropanol. After a 10-min centrifugation at 12,000 × g, the pellet was washed with 1 mL of 75% ethanol and centrifuged at 7500 × g for 5 min. The RNA pellet was resuspended in RNase-free water and stored at −80 °C until use. Virions, capsid proteins, and genomic RNA were deemed pure for A260/A280 > 1.6, A280/A260 > 1.65, and A260/A280 > 1.8, respectively.

Six hundred kilodaltons PSS was sulfonated6 from polystyrene (Polymer Source, Canada). Briefly, 100 mg of polystyrene was dissolved in 15 mL of cyclohexane. Four microliters of a mixture of concentrated sulfuric acid and phosphoric acid was added and the solution was agitated for 2 h at 50 °C. Ten milliliters of ice water was added, and then the aqueous phase was filtered and dialyzed against pure sterile water for 2 weeks. PSS was freeze-dried and stored until use. Elemental analyses indicated an average degree of sulfonation of 95%.

All samples were dialyzed against their final buffer no more than 2 days before experiments and stored at 4 °C. For the assembly of NPCs, subunits were dispersed in 50 mM Tris-HCl, pH 7.5, 0.45 M NaCl, 1 mM DTT, 0.5 mM phenylmethylsulfonyl fluoride (PMSF), while genome and PSS were in 10 mM Tris-HCl, pH 7.5. The final concentration of subunits was 1 g L−1 (24.6 µM) and the amount of genome was adjusted according to the desired subunit-to-genome mass ratio. An additional volume of 10 mM Tris-HCl, pH 7.5—with 30 mM CaCl2 for the experiment with divalent cations—was used to adjust the ionic strength to 0.1 M. For the relaxation of NPCs into virions, pre-assembled NPCs were prepared in 10 mM Tris-HCl, pH 7.5, 1 mM DTT, 0.5 mM PMSF with a subunit concentration of 1.72 g L−1 (42.3 µM), and they were mixed with an equal volume of 10 mM sodium acetate, pH 4.8, and 90 mM NaCl. The temperature of the experiments was 20 °C unless otherwise stated.

### Time-resolved small-angle X-ray scattering

Scattering patterns were recorded at the ID02 beamline of the European Synchrotron Radiation Facility (Grenoble, France). The sample-to-detector distance was set to 3.0 m, which provided scattering wavenumbers q ranging from 3.3 × 10−3 to 0.26 Å−1. The experimental setup included a stopped-flow apparatus (BioLogic SFM-400, France) equipped with four independent syringes. Beam exposure time was set to 5 ms. The two-dimensional scattering images were radially averaged and the uncertainty on the scattering intensity was computed as the standard deviation of the photon counts in the pixels within each bin. Intensities were converted into absolute units after subtraction of the contribution of buffer solutions. The forward scattering intensity and the radius of gyration were determined by using the DATRG tool of the ATSAS suite50. The ab initio reconstruction of NPCs was carried out with the program DAMMIN50, which minimized the discrepancy between the calculated scattering pattern of a model of densely packed beads and the experimental pattern of NPCs smoothed by the program GNOM50.

The forward scattering intensity I0I(q = 0) of a NPC solution can be written as:

$$I_0 = \Delta b_{\mathrm{S}}^2X_{{\mathrm S}} + \mathop {\sum }\limits_{N = 0}^{ + \infty } X_N\left( {\Delta b_{\mathrm{G}} + \Delta b_{\mathrm{S}}N} \right)^2 = f\left( {\left\langle N \right\rangle ,\left\langle {N^2} \right\rangle } \right),$$
(2)

with ΔbG and ΔbS the excess scattering lengths of genome and subunits respectively, XN the molar concentration of Nth-order NPCs (X0XG the molar concentration of bare genome) and XS the molar concentration of free subunits. In reality, the limit to infinity is a convenient approximation and N should be bounded by a maximal value (see Supplementary Methods for further discussion). The mean number of subunits per NPC is defined as $$\left\langle N \right\rangle = \mathop {\sum }\limits_{N = 0}^{ + \infty } NX_N{\mathrm{/}}\!\mathop {\sum }\limits_{N = 0}^{ + \infty } X_N$$. Small-angle scattering is an ensemble-averaging technique and it is thereby not possible to infer the distribution of N solely from measured intensities. In particular, we cannot calculate $$\left\langle N \right\rangle$$ from Eq. (2) without the knowledge of the standard deviation of N. However, we can find an upper limit $$\left\langle N \right\rangle _{\mathrm{up}}$$, which verifies $$\left\langle N \right\rangle _{\mathrm{up}}\, \ge \left\langle N \right\rangle$$ regardless the probability distribution of N (see Supplementary Methods for justification):

$$\left\langle N \right\rangle _{\mathrm{up}} \equiv \sqrt { {\Gamma}^2 + \frac{{I_0 - I_0^ \ast }}{{\Delta b_{\mathrm{S}}^2c_{\mathrm{G}}}}} - \Gamma,$$
(3)

with Г = ΔbGbS − 1/2 and $$I_0^ \ast \equiv \Delta b_{\mathrm{G}}^2c_{\mathrm{G}} + \Delta b_{\mathrm{S}}^2c_{\mathrm{S}}$$ the forward scattering intensity produced by the mixture of subunits and genome prior to complexation. cG and cS are the total molar concentrations of genome and subunits, respectively.

### Small-angle neutron scattering

SANS measurements were carried out at the PAXY beamline of the Laboratoire Léon Brillouin (Saclay, France). Two wavelengths of 6 and 8.5 Å, combined with sample-to-detector distances of 1, 3, and 5 m, were employed to cover a range of q values from 4 × 10−3 to 0.5 Å−1. Samples were exposed for 1 h and the scattering intensities were recorded on a two-dimensional multi-wire detector. The scattering images were radially averaged and the uncertainty on the scattering intensity was computed as the standard deviation of the neutron counts in the pixels within each bin. Buffer solutions containing 68% D2O were used to contrast match the genome.

### Cryo-transmission electron microscopy

Three microliters of sample solution was deposited onto a holey carbon grid (Quantifoil R2/2) ionized by glow discharge. The grid was blotted with a filter paper and directly plunged into liquid ethane cooled down by liquid nitrogen using a FEI Vitrobot. The grids were stored in liquid nitrogen until use. The frozen samples were transferred into a Gatan 626 cryo-holder and observed at −180 °C via a JEOL JEM-2010 microscope equipped with a 200-kV field emission gun. The samples were imaged with a magnification of × 50,000 using a minimal dose system and the images were collected with a Gatan Ultrascan 4K CCD camera at 1.5 and 2 µm of defocus.

### Data availability

The authors declare that the data supporting the findings of this study are available within the article and its Supplementary Information files or from the corresponding author on reasonable request.

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

## Change history

• ### 05 September 2018

This Article was originally published without the accompanying Peer Review File. This file is now available in the HTML version of the Article; the PDF was correct from the time of publication.

## References

1. 1.

Speir, J. A. & Johnson, J. E. Nucleic acid packaging in viruses. Curr. Opin. Struct. Biol. 22, 65–71 (2012).

2. 2.

Bruinsma, R. F., Comas-Garcia, M., Garmann, R. F. & Grosberg, A. Y. Equilibrium self-assembly of small RNA viruses. Phys. Rev. E 93, 032405 (2016).

3. 3.

Bruinsma, R. F. & Klug, W. S. Physics of viral shells. Annu. Rev. Condens. Matter Phys. 6, 245–268 (2015).

4. 4.

Verdier, T., Foret, L. & Castelnovo, M. Modeling the kinetics of open self-assembly. J. Phys. Chem. B 120, 6411–6420 (2016).

5. 5.

Sikkema, F. D. et al. Monodisperse polymer-virus hybrid nanoparticles. Org. Biomol. Chem. 5, 54–57 (2007).

6. 6.

Tresset, G. et al. Weighing polyelectrolytes packaged in viruslike particles. Phys. Rev. Lett. 113, 128305 (2014).

7. 7.

Azizgolshani, O., Garmann, R. F., Cadena-Nava, R., Knobler, C. M. & Gelbart, W. M. Reconstituted plant viral capsids can release genes to mammalian cells. Virology 441, 12–17 (2013).

8. 8.

Garmann, R. F., Comas-Garcia, M., Knobler, C. M. & Gelbart, W. M. Physical principles in the self-assembly of a simple spherical virus. Acc. Chem. Res. 49, 48–55 (2016).

9. 9.

Garmann, R. F. et al. A simple RNA–DNA scaffold templates the assembly of monofunctional virus-like particles. J. Am. Chem. Soc. 137, 7584–7587 (2015).

10. 10.

Kwak, M. et al. Virus-like particles templated by DNA micelles: a general method for loading virus nanocarriers. J. Am. Chem. Soc. 132, 7834–7835 (2010).

11. 11.

Chang, C. B., Knobler, C. M., Gelbart, W. M. & Mason, T. G. Curvature dependence of viral protein structures on encapsidated nanoemulsion droplets. ACS Nano 2, 281–286 (2008).

12. 12.

Goicochea, N. L., De, M., Rotello, V. M., Mukhopadhyay, S. & Dragnea, B. Core-like particles of an enveloped animal virus can self-assemble efficiently on artificial templates. Nano. Lett. 7, 2281–2290 (2007).

13. 13.

Liu, A. J. et al. Protein cages as containers for gold nanoparticles. J. Phys. Chem. B 120, 6352–6357 (2016).

14. 14.

Huang, X. L. et al. Self-assembled virus-like particles with magnetic cores. Nano. Lett. 7, 2407–2416 (2007).

15. 15.

Perilla, J. R., Hadden, J. A., Goh, B. C., Mayne, C. G. & Schulten, K. All-atom molecular dynamics of virus capsids as drug targets. J. Phys. Chem. Lett. 7, 1836–1844 (2016).

16. 16.

Zlotnick, A. & Mukhopadhyay, S. Virus assembly, allostery and antivirals. Trends Microbiol. 19, 14–23 (2011).

17. 17.

Douglas, T. & Young, M. Viruses: making friends with old foes. Science 312, 873–875 (2006).

18. 18.

Whitelam, S. & Jack, R. L. The statistical mechanics of dynamic pathways to self-assembly. Annu. Rev. Phys. Chem. 66, 143–163 (2015).

19. 19.

Liljestrom, V., Mikkila, J. & Kostiainen, M. A. Self-assembly and modular functionalization of three-dimensional crystals from oppositely charged proteins. Nat. Commun. 5, 4445 (2014).

20. 20.

Kegel, W. K. & van der Schoot, P. Competing hydrophobic and screened-coulomb interactions in hepatitis B virus capsid assembly. Biophys. J. 86, 3905–3913 (2004).

21. 21.

Tresset, G. et al. Two-dimensional phase transition of viral capsid gives insights into subunit interactions. Phys. Rev. Appl. 7, 014005 (2017).

22. 22.

Elrad, O. M. & Hagan, M. F. Encapsulation of a polymer by an icosahedral virus. Phys. Biol. 7, 045003 (2010).

23. 23.

Hagan, M. F. Modeling viral capsid assembly. Adv. Chem. Phys. 155, 1–67 (2014).

24. 24.

Perlmutter, J. D., Perkett, M. R. & Hagan, M. F. Pathways for virus assembly around nucleic acids. J. Mol. Biol. 426, 3148–3165 (2014).

25. 25.

Zlotnick, A., Porterfield, J. Z. & Wang, J. C. Y. To build a virus on a nucleic acid substrate. Biophys. J. 104, 1595–1604 (2013).

26. 26.

Perlmutter, J. D. & Hagan, M. F. Mechanisms of virus assembly. Annu. Rev. Phys. Chem. 66, 217–239 (2015).

27. 27.

Zlotnick, A., Aldrich, R., Johnson, J. M., Ceres, P. & Young, M. J. Mechanism of capsid assembly for an icosahedral plant virus. Virology 277, 450–456 (2000).

28. 28.

Johnson, J. M. et al. Regulating self-assembly of spherical oligomers. Nano. Lett. 5, 765–770 (2005).

29. 29.

Chen, C., Kao, C. C. & Dragnea, B. Self-assembly of brome mosaic virus capsids: insights from shorter time-scale experiments. J. Phys. Chem. A 112, 9405–9412 (2008).

30. 30.

Harms, Z. D., Selzer, L., Zlotnick, A. & Jacobson, S. C. Monitoring assembly of virus capsids with nanofluidic devices. ACS Nano 9, 9087–9096 (2015).

31. 31.

Castellanos, M., Pérez, R., Carrillo, P. J. P., de Pablo, P. J. & Mateu, M. G. Mechanical disassembly of single virus particles reveals kinetic intermediates predicted by theory. Biophys. J 102, 2615–2624 (2012).

32. 32.

Medrano, M. et al. Imaging and quantitation of a succession of transient intermediates reveal the reversible self-assembly pathway of a simple icosahedral virus capsid. J. Am. Chem. Soc. 138, 15385–15396 (2016).

33. 33.

Tuma, R., Tsuruta, H., French, K. H. & Prevelige, P. E. Detection of intermediates and kinetic control during assembly of bacteriophage P22 procapsid. J. Mol. Biol. 381, 1395–1406 (2008).

34. 34.

Tresset, G. et al. Norovirus capsid proteins self-assemble through biphasic kinetics via long-lived stave-like intermediates. J. Am. Chem. Soc. 135, 15373–15381 (2013).

35. 35.

Law-Hine, D. et al. Reconstruction of the disassembly pathway of an icosahedral viral capsid and shape determination of two successive intermediates. J. Phys. Chem. Lett. 6, 3471–3476 (2015).

36. 36.

Law-Hine, D., Zeghal, M., Bressanelli, S., Constantin, D. & Tresset, G. Identification of a major intermediate along the self-assembly pathway of an icosahedral viral capsid by using an analytical model of a spherical patch. Soft Matter 12, 6728–6736 (2016).

37. 37.

Kler, S. et al. RNA encapsidation by SV40-derived nanoparticles follows a rapid two-state mechanism. J. Am. Chem. Soc. 134, 8823–8830 (2012).

38. 38.

Borodavka, A., Tuma, R. & Stockley, P. G. Evidence that viral RNAs have evolved for efficient, two-stage packaging. Proc. Natl Acad. Sci. USA 109, 15769–15774 (2012).

39. 39.

Patel, N. et al. Revealing the density of encoded functions in a viral RNA. Proc. Natl Acad. Sci. USA 112, 2227–2232 (2015).

40. 40.

Comellas-Aragonès, M. et al. A virus-based single-enzyme nanoreactor. Nat. Nanotechnol. 2, 635–639 (2007).

41. 41.

Adolph, K. W. & Butler, P. J. G. Studies on assembly of a spherical plant virus. 1. States of aggregation of isolated protein. J. Mol. Biol. 88, 327 (1974).

42. 42.

Lavelle, L. et al. Phase diagram of self-assembled viral capsid protein polymorphs. J. Phys. Chem. B 113, 3813–3819 (2009).

43. 43.

Comas-Garcia, M., Cadena-Nava, R. D., Rao, A. L. N., Knobler, C. M. & Gelbart, W. M. In vitro quantification of the relative packaging efficiencies of single-stranded RNA molecules by viral capsid protein. J. Virol. 86, 12271–12282 (2012).

44. 44.

Speir, J. A., Munshi, S., Wang, G. J., Baker, T. S. & Johnson, J. E. Structures of the native and swollen forms of cowpea chlorotic mottle virus determined by x-ray crystallography and cryoelectron microscopy. Structure 3, 63–78 (1995).

45. 45.

Fox, J. M. et al. Comparison of the native CCMV virion with in vitro assembled CCMW virions by cryoelectron microscopy and image reconstruction. Virology 244, 212–218 (1998).

46. 46.

Baigl, D., Seery, T. A. P. & Williams, C. E. Preparation and characterization of hydrosoluble, partially charged poly(styrenesulfonate)s of various controlled charge fractions and chain lengths. Macromolecules 35, 2318–2326 (2002).

47. 47.

Li, S. Y., Erdemci-Tandogan, G., van der Schoot, P. & Zandi, R. The effect of RNA stiffness on the self-assembly of virus particles. J. Phys. Condens. Matter 30, 044002 (2018).

48. 48.

Garmann, R. F. et al. Role of electrostatics in the assembly pathway of a single-stranded RNA virus. J. Virol. 88, 10472–10479 (2014).

49. 49.

de Ilarduya, C. T., Sun, Y. & Duezguenes, N. Gene delivery by lipoplexes and polyplexes. Eur. J. Pharm. Sci. 40, 159–170 (2010).

50. 50.

Franke, D. et al. ATSAS 2.8: a comprehensive data analysis suite for small-angle scattering from macromolecular solutions. J. Appl. Crystallogr. 50, 1212–1225 (2017).

## Acknowledgements

We thank Emmanuel Trizac for fruitful discussions on electrostatics. M.C. is supported by the “IDI 2016” project funded by the IDEX Paris-Saclay, ANR-11-IDEX-0003-02. D.L.-H. was partly funded by the Triangle de la Physique (contract 2012-065T). J.C. acknowledges a scholarship from the China Scholarship Council (CSC). G.T. acknowledges financial support from the Agence Nationale de la Recherche (contract ANR-16-CE30-0017-01). The electron microscopy imaging is supported by “Investissements d’Avenir” LabEx PALM (ANR-10-LABX-0039-PALM). We also acknowledge the European Synchrotron Radiation Facility (ESRF) and the Laboratoire Léon Brillouin (LLB) for allocation of synchrotron and neutron beam time on ID02 and PAXY beamlines, respectively. Molecular dynamics simulations have been performed at the “Centre de Calcul Scientifique en Région Centre” facility (CCRS-Orléans, France) under the CASCIMODOT program.

## Author information

### Affiliations

1. #### Laboratoire de Physique des Solides, CNRS, Univ. Paris-Sud, Université Paris-Saclay, 91405, Orsay Cedex, France

• Maelenn Chevreuil
• , Didier Law-Hine
• , Jingzhi Chen
• , Doru Constantin
• , Jéril Degrouard
• , Mehdi Zeghal
•  & Guillaume Tresset
2. #### Institute for Integrative Biology of the Cell (I2BC), CEA, CNRS, Univ. Paris-Sud, Université Paris-Saclay, 91198, Gif-sur-Yvette Cedex, France

• Maelenn Chevreuil
•  & Stéphane Bressanelli
3. #### Laboratoire Léon Brillouin (LLB), UMR 12 CEA-CNRS, Université Paris-Saclay, CEA-Saclay, 91191, Gif-sur-Yvette Cedex, France

• Sophie Combet
4. #### European Synchrotron Radiation Facility (ESRF), 71 avenue des Martyrs, 38000, Grenoble, France

• Johannes Möller

### Contributions

M.C., D.L.-H., and G.T. designed the research. M.C., D.L.-H., and G.T. prepared the samples. M.C., D.L.-H., J.C., S.B., S.C., D.C., J.D., J.M., M.Z., and G.T. performed the experiments. M.C., D.L.-H., D.C., S.B., and G.T. analyzed and contributed to interpreting the data. G.T. wrote the paper.

### Competing interests

The authors declare no competing interests.

### Corresponding author

Correspondence to Guillaume Tresset.