Structural dissection of human metapneumovirus phosphoprotein using small angle x-ray scattering

Renner, Max; Paesen, Guido C.; Grison, Claire M.; Granier, Sébastien; Grimes, Jonathan M.; Leyrat, Cédric

doi:10.1038/s41598-017-14448-z

Download PDF

Article
Open access
Published: 01 November 2017

Structural dissection of human metapneumovirus phosphoprotein using small angle x-ray scattering

Max Renner¹,
Guido C. Paesen¹,
Claire M. Grison²,
Sébastien Granier²,
Jonathan M. Grimes^1,3 &
…
Cédric Leyrat ORCID: orcid.org/0000-0003-0189-0562²

Scientific Reports volume 7, Article number: 14865 (2017) Cite this article

2091 Accesses
16 Citations
1 Altmetric
Metrics details

Subjects

Abstract

The phosphoprotein (P) is the main and essential cofactor of the RNA polymerase (L) of non-segmented, negative‐strand RNA viruses. P positions the viral polymerase onto its nucleoprotein–RNA template and acts as a chaperone of the nucleoprotein (N), thereby preventing nonspecific encapsidation of cellular RNAs. The phosphoprotein of human metapneumovirus (HMPV) forms homotetramers composed of a stable oligomerization domain (P_core) flanked by large intrinsically disordered regions (IDRs). Here we combined x-ray crystallography of P_core with small angle x-ray scattering (SAXS)-based ensemble modeling of the full-length P protein and several of its fragments to provide a structural description of P that captures its dynamic character, and highlights the presence of varyingly stable structural elements within the IDRs. We discuss the implications of the structural properties of HMPV P for the assembly and functioning of the viral transcription/replication machinery.

Ensemble description of the intrinsically disordered N-terminal domain of the Nipah virus P/V protein from combined NMR and SAXS

Article Open access 11 November 2020

Marco Schiavina, Edoardo Salladini, … Sonia Longhi

Structure of the N-RNA/P interface indicates mode of L/P recruitment to the nucleocapsid of human metapneumovirus

Article Open access 22 November 2023

Jack D. Whitehead, Hortense Decool, … Max Renner

High-resolution cryo-EM structure of the Pseudomonas bacteriophage E217

Article Open access 08 July 2023

Fenglin Li, Chun-Feng David Hou, … Gino Cingolani

Introduction

Acute respiratory tract infections constitute an important cause of mortality in children under five, with ~1.5 million fatalities reported in 2008¹. Human metapneumovirus (HMPV), first described in the Netherlands in 2001, is a major agent of viral respiratory illness and pneumonia worldwide². Although most often asymptomatic in healthy adults, HMPV can be severe in immunocompromised and elderly populations^3,4. After the closely related respiratory syncytial virus (RSV), HMPV is recognized to be the second most important cause of viral bronchiolitis and pneumonia in young children^4,5. It has been posited that HMPV evolved from an avian metapneumovirus-like ancestor and that there has been a zoonotic cross-species transmission event from birds to humans around 200 years ago⁶. Both HMPV and RSV are classified as members of the Pneumoviridae, recently elevated to family status within the viral order Mononegavirales ⁷. The Mononegavirales order harbours numerous significant human pathogens such as Ebola virus (family Filoviridae), measles virus (family Paramyxoviridae), and rabies virus (family Rhabdoviridae).

All mononegaviruses possess common transcription/replication strategies and a similar genome organisation⁸. In members of Mononegavirales the single-stranded (ss), negative-sense (−) RNA genome is protected by a sheath of oligomerised viral nucleoproteins N (NP in Filoviridae) which form the nucleocapsid^9,10,11. This packaged form of the RNA genome prevents antiviral signalling and degradation by host nucleases, but also serves as the template for transcription and replication by the L polymerase^8,9. The HMPV genome (~13 kB) encodes 9 genes, N-P-M-F-M2-SH-G-L, ordered from 3′- to 5′-end of the RNA which are transcribed sequentially by L^12,13 within cytoplasmic viral inclusion bodies that serve as viral transcription/replication factories^14,15,16. To transcribe viral mRNAs from the genome and to replicate progeny genomes L requires the essential ancillary factor P^17,18. In addition, processive transcription into full-length and polycistronic mRNAs also requires the antitermination factor M2–1^19,20. Components of the replicase/transcriptase complexes, consisting of the N, L, P, and M2-1 proteins, are important targets for the development of antiviral therapeutics²¹.

The polymerase cofactor P (VP35 in Filoviridae) serves as a central hub of the replicase/transcriptase by bringing together their various elements, but also performs key functions throughout the viral life cycle. In the case of HMPV, the phosphoprotein was shown to play an important role in direct cell-to-cell viral spread by co-localizing with actin and inducing membrane deformations in bronchial airway cells²². A role in manipulating the host immune response by preventing RIG-I-mediated sensing of HMPV viral 5′ triphosphate RNA has also been demonstrated for the phosphoprotein of the HMPV B1 strain²³. Pneumoviral P interacts with both L and the nucleocapsid, thereby tethering the polymerase to its template^24,25,26,27. During transcription, P also recruits the M2-1 antiterminator to the polymerase, where M2-1 is thought to bind nascent viral mRNA emerging from L^28,29,30. In replication, P supplies growing progeny nucleocapsids with naive, RNA-free nucleoproteins (N⁰) for immediate, co-transcriptional packaging. This is achieved via an N-terminal region of P which binds to N⁰, thereby preventing premature RNA encapsidation and oligomerisation^31,32,33,34. A hallmark of mononegavirus P proteins is their propensity to form distinct functional oligomers. In HMPV and RSV a central oligomerization domain facilitates the formation of P tetramers^35,36,37. Polymerase cofactors from members of Mononegavirales feature conditionally-folded molecular recognition elements (MoREs) and extended intrinsically disordered regions (IDRs) which cannot be described by a single unique conformation in solution^{26,37,38,39,40,41}. This characteristic seriously hampers the structural understanding of full-length P proteins by x-ray crystallography or cryo-electron microscopy and, consequently, to date these techniques have not delivered a structural description of any full-length mononegavirus polymerase cofactor. However, the use of structural information from isolated stable domains combined with small angle x-ray scattering (SAXS) and molecular dynamics simulations (MDS) can reveal an ensemble description of the entire protein. Here we have used this integrated approach to structurally dissect the functional regions of HMPV P.

Results

Structures of P_core from two new crystal forms

We have previously reported the crystallographic structure of the HMPV P_core domain (P residues ~169–194) at an intermediate resolution of 3.1 Å³⁷. Serendipitously (see Materials and Methods), we obtained two additional crystal forms of P_core with improved resolution, allowing us to build a higher quality model (100% Ramachandran favoured, see Table 1). The first crystal (form 1, space group P2₁) diffracted to a resolution of 1.6 Å, while the second crystal (form 2, space group P2₁2₁2₁) gave rise to diffraction data up to 2.2 Å. In all of the crystal forms, including the original crystal (PDB ID: 4BXT)³⁷, the asymmetric unit is composed of two tetrameric, helical coiled-coils packing against each other in varying orientations (Fig. 1A–C). Sample electron density maps of the higher resolution P_core structures are shown in Supplementary Fig. S1.

Table 1 Crystallographic data collection and refinement statistics.

Full size table

Mapping of the sequence conservation within the Pneumoviridae onto P_core reveals the strict conservation of the hydrophobic amino acids (Leu and Ile residues) lining the interior of the coiled-coil, indicating that all family members share a similar P oligomerization region (Fig. 1D, strictly conserved residues in purple). Furthermore, Arg175 is strictly conserved in all Pneumoviridae and plays a role in stabilizing the quaternary structure of the tetramer by forming a salt bridge with either an Asp or Glu on the neighbouring helix. Notably, residues Ser184 and Glu181 are also invariable despite being exposed to the solvent and not directly involved in oligomerization. It is tempting to speculate that these residues may instead be involved in interactions with other viral components, for instance the L polymerase⁴².

In one of the tetramers within crystal form 2 we observed a single, solvent-exposed helix which possessed markedly blurred electron density and, as a result, significantly higher crystallographic B-factors than the three other protomers of the coiled-coil (Fig. 1E, white arrow). This suggests that, in absence of packing constraints within the crystal (as is the case for this solvent-exposed helix), the protomers of the P_core region are somewhat flexible and there is a certain degree of malleability of the tetramer interface. Alignment of all six tetramers from crystal forms 1 and 2 and the original crystal confirmed that there are small-scale breathing motions within the coiled-coil (Fig. 1F). This is fully consistent with the overall lower buried surface area (~4800 Å) and predicted dissociation energy (~21 kcal/mol) for HMPV P than in the more extended paramyxoviral coiled-coil oligomerization domains. For comparison, the buried surface area and predicted dissociation energy of the Nipah virus P oligomerization domain have been reported to lie between 15000–20000 Å and 140–200 kcal/mol⁴³. These data indicate a flexibly associated tetramer of HMPV P protomers.

SAXS characterization of the HMPV P constructs

Next, we turned to SAXS in order to obtain structural information in the solution state for the different functional regions of P (Fig. 2). In total 4 P constructs were characterized by SAXS: 1) the N-terminal region (P_1–60) which encompasses the N⁰ binding site, 2) the central region (P_135–237) composed of the M2-1 binding site, the tetramerization domain P_core and a putative nucleoprotein binding region, 3) the P_135–294 construct, which includes potential L and N binding regions at its C-terminus, and 4) the full-length P protein (P_1–294) (domain annotations are shown in Fig. 2A and Supplementary Fig. S2). Our approach of splitting the protein into different fragments enables more accurate X-ray scattering profiles to be obtained compared to focussing on the full-length protein only, thus increasing the overall structural information content that can be extracted from the SAXS data. For each construct, high-quality SAXS profiles could be obtained (Fig. 2B), and samples were free from aggregates as evidenced by the linearity of the Guinier region (Fig. 2C). Parameters extracted from the SAXS data are summarized in Table 2. Molecular weights were estimated based on calculation of the concentration-independent volume of correlation V_C, as defined in⁴⁴, and were found to be consistent with the theoretical molecular weight expected for each construct. In the case of the full-length P, a SEC-MALLS profile was also recorded, which confirmed the sample monodispersity, oligomerization state, and molecular weight (Supplementary Fig. S3, MW_MALLS = 125 kDa ± 12 kDa and MW_theor. = 134.8 kDa). The SAXS-derived radii of gyration (R_g) were independent of concentration for P_1–60 and P_135–237. On the contrary P_135–294 and P_1–294, respectively, were observed to significantly compact or expand with increasing protein concentration indicating the presence of a structure factor contribution to the data, which was treated through data merging of different protein concentrations (Table 2).

Table 2 SAXS-derived parameters.

Full size table

We used normalized Kratky plots to obtain a semi-quantitative view of the level of intrinsic disorder present in each construct. As can be seen in Fig. 2D, all constructs showed a mixed profile indicating a balance between order and disorder, with the exception of P_1–60 which displayed a typical intrinsically disordered protein (IDP) Kratky plot. This is likely due to the absence of the tetramerization domain, the most ordered region in P. A similar behaviour was observed when the experimental R_g of each construct was placed on a R_g versus number of residues plot and compared with the empirical laws available for both globular⁴⁵ and intrinsically disordered proteins⁴⁶ (Fig. 2E). P_1–60 appeared as a canonical IDP while the tetrameric constructs showed an intermediate behavior between fully folded proteins and IDPs. Of note, the full-length P_1–294 was slightly offset towards the IDP curve compared to P_135–237 and P_135–294, consistent with an increase in intrinsic disorder upon inclusion of the N-terminal region. We then used computational modeling in combination with ensemble optimization to derive SAXS-validated ensembles of atomic models for each construct.

The N-terminal region of P shows α-helical propensity and is in equilibrium between extended and more compact conformers

All-atom models of P_1–60 were generated using the state-of-the-art program flexible-meccano⁴⁷ and used for ensemble optimization of the available experimental SAXS curves, resulting in a high-quality fit to the data (χ_exp ~ 0.6 - Table 2, Fig. 3A). Because no significant concentration dependence of the R_g was observed, an average R_g distribution was calculated and is shown in Fig. 3B, revealing an equilibrium between two populations of different sizes centered around 2.2 nm and 4.0 nm. Representative models of these two populations are shown in Fig. 3C. A significant fraction of the selected models (~20%) displayed α-helical structure in the previously reported N⁰ binding region (Fig. 3C) indicating that this MoRE is able to partially fold as an α-helix in the absence of its binding partner.

Ensemble analysis of the central region of P indicates the presence of loose tertiary structure located C-terminally of P_core

Ensembles of atomic models of P_135–237 were generated by combining classical and coarse-grained atomistic molecular dynamics simulations and using the available SAXS-validated models of P_158–237 as templates (see material and methods)³⁷. The two constructs differ only by the presence of the putative M2–1 binding region approximately located between residues 135 and 158, which mainly adopts extended conformations within the selected ensembles, although residual α-helical structure between residues 138–145 is present in a small fraction of the models. The ensemble optimization results show that the selected ensembles (Fig. 4A) successfully reproduced the experimental data (0.6 < χ_exp < 1.1 - Table 2, Fig. 4C). The R_g distribution highlights the equilibrium between two main populations (Fig. 4B) which differ in the conformation of their C-terminal regions (Fig. 4A, inset). Residues 204–219 (which overlap with a recently proposed secondary N-binding site in RSV²⁶) tend to form an α-helix (α_204–219) that packs against the P_core mainly through hydrophobic interactions. This relatively unstable structural motif is present in about 30% of the selected models and gives rise to a somewhat more compact population, shifting down the R_g values by about 1 nm compared to the other, more extended population. Interestingly, these ordered conformations of residues 204–219 have originally been observed when modeling P_158–237 and were found to be relatively stable for several hundreds of nanoseconds of classical MDS both in our previous work³⁷ and in this study (not shown). It is worth noting that when performing the ensemble optimization using a pool in which the compact population of models was excluded, we could observe a significant deterioration of the χ_exp value for the highest concentration SAXS curve from 1.09 to 1.49, indicating that the presence of a packed α_204–219 in the ensemble models is important to correctly reproduce the experimental SAXS profile.

To further assess the validity of our ensemble analysis procedure, SAXS profiles of P_135–237 were recorded at increasing concentrations of guanidinium hydrochloride (Gdn-HCl) (Fig. 5 and Supplementary Fig. S4). Addition of 1, 2 and 3 M Gdn-HCl resulted in the progressive disappearance of the most compact population and a concomitant increase in the proportion of extended models (Fig. 5A and B). As can be seen in Fig. 5A, the population of models where α_204–219 packs against the neighboring P_core is completely absent in the 1 M Gdn HCl selected ensemble, while residual α-helical structure in this region is retained. At higher Gdn-HCl concentrations, these secondary structure elements are also lost. Interestingly, P_core was found to remain stable at all Gdn-HCl concentrations indicating a highly stable tetrameric core.

The C-terminal region of P may stabilize the loose tertiary structure present in the central region

Models of P_135–294 were obtained using coarse-grained atomistic MDS based on the P_135–237 modeling results. Again, the quality of the fit to the experimental data was very good (χ _exp = 0.5 - Table 2, Fig. 6A). The R_g distribution is dominated by a main population centered around 5–6 nm in equilibrium with more extended models (Fig. 6C). At high protein concentrations, we observed a significant decrease of the measured R_g from 5.66 to 5.36 nm (Table 2). This concentration-dependent R_g decrease might be linked to the presence of a highly negatively charged patch present in the C-terminal part of the molecule (Supplementary Fig. S5), leading to long range intermolecular repulsion. About 55% of selected models appear to have the packed α_204–219, suggesting that the C-terminal region of the protein might stabilize the cap of helices located C-terminally to the tetramerization domain (and also observed with the P_135–237 construct). The 75 C-terminal amino acids adopt various disordered conformations, with a significant fraction of the selected models displaying a short α-helical motif near their C-terminus in the predicted region (residues 253–262, at the beginning of the putative L-binding site) (Fig. 6B). It is however not possible to ascertain the presence of this motif based on SAXS data alone for such a large construct, as only a modest fraction of the scattering arises from this region of the molecule.

Ensemble structure of the full-length phosphoprotein

Finally, we generated an ensemble of atomic models of P_1–294 using coarse-grained MDS based on the results of the P₆₀ and P_135–294 modeling and performed ensemble optimization against the experimental SAXS data. We were able to adequately fit the data with χ_exp = 1.7 (Fig. 7A and Table 2). A representative ensemble of models optimized using the merged SAXS curve is shown in Fig. 7C. The full-length phosphoprotein appears as a large, tentacular molecule with maximal intramolecular distances (D_max) roughly comprised between 25 and 35 nm, and a partitioning of the N-terminal and C-terminal intrinsically disordered extensions on each side of P_core. The R_g distribution shows a main population centered around 7–8 nm (Fig. 7B) and about 60% of selected models displayed a packed α_204–219. Contrary to P_135–294, higher protein concentrations lead to higher apparent R_g values that increase from 7.1 to 7.5 nm in the (relatively narrow) concentration range used (Table 2). This concentration-dependent increase in R_g indicates attractive interparticle interference which possibly results from the presence of both highly negatively and positively charged patches located in the N-terminal and C-terminal regions (Supplementary Fig. S5).

Discussion

The essential polymerase cofactor P from Pneumoviridae is a multifunctional hub which interacts with numerous components of the viral RNA-synthesis machinery. It also possesses large IDRs and can therefore not easily be characterized by classical structural biology methods, such as X-ray crystallography. NMR studies of the RSV P protein have shown that IDRs account for around 80% of the protein²⁶. Analysis of whole eukaryotic genomes has suggested that as much as 41% of protein sequences contain IDRs of significant length⁴⁸ and research on IDPs indicates that these regions are frequently involved in protein-protein interactions (reviewed in⁴⁹). Pneumoviridae P is commonly described as possessing an N-terminal IDR (mainly associated with binding RNA-free N) and a C-terminal IDR (mainly associated with binding N-RNA), which are separated by a central tetramerization module, P_core ^24,32,37,50. In this study we present a description of the structure and dynamics of HMPV P, using a combination of crystallography of stable/bound domains and SAXS ensemble analysis. We show that full-length phosphoprotein exists in solution as an ensemble of inter-converting conformers with a tentacular architecture. The four long N-terminal IDRs and the four somewhat shorter C-terminal IDRs are partitioned by the central tetramerization domain owing to the parallel orientation of P_core α-helices. The P protein tetramer (1176 residues) has huge dimensions, with an average R_g of 7.5 nm and an average D_max of about 30 nm. For comparison, the entire L polymerase (2109 residues) of vesicular stomatitis virus (VSV), which is likely similar in size as HMPV L polymerase, displays a R_g of 3.9 nm and a D_max of about 13 nm, based on its cryo-electron microscopy structure⁵¹. Undoubtedly, the large dimensions and flexibility of the P protein reflect its functions in viral transcription and replication which require the simultaneous binding and adequate positioning of the N⁰, the M2-1 anti-terminator, the L polymerase and the nucleocapsid. In addition, simultaneous binding of P to multiple N-RNA subunits may influence the conformation and curvature of the nucleocapsid.

Conformation of the N-terminal region

The disordered N-terminal region of P has been implicated in chaperoning the N protein, keeping it in an RNA-free state prior to nucleocapsid assembly, for a range of negative strand viruses^{32,33,39,52,53}. This is important as N may otherwise unproductively bind to host nucleic acids. N-chaperoning is facilitated by a MoRE located towards the very N-terminus of the P-protein, which folds upon binding nascent and RNA-free nucleoproteins^31,39. Kratky analysis of the first 60 N-terminal residues of HMPV P (P₆₀) confirmed the disordered character of this region. However, in our ensemble analysis 20% of the models exhibited helical secondary structure within residues 13–28, corresponding to the same helix that is observed in the crystal structure of HMPV P bound to N₀ ³². Interestingly, a recent NMR study similarly observed a 20% helical propensity in the equivalent residues (amino acids 12–24) of RSV P²⁶. Such pre-formed secondary structure elements in the apo form have been suggested to lead to increased association rates to the binding partner via conformational selection⁴⁹. Rapid on-rates may be especially important in the case of N-chaperoning as N⁰ needs to be captured by P before unspecific RNA-uptake and polymerization can take place, leading to a dead-end state for N. Following initial binding, the pre-formed helix may act as a nucleus from which the N-P binding interface is extended to what is observed in the crystallographic structure (encompassing P residues 1–28) via a dock-and-coalesce mechanism⁵⁴.

Conformation of the M2-1 binding region

Pneumoviral P proteins recruit the additional processivity factor M2-1 to the viral transcription machinery, forming an extended non-globular complex^28,29,55. HMPV M2-1 is a highly dynamic, tetrameric, RNA-binding protein featuring a CCCH-type zinc-finger on each protomer, which is involved in recognition of nucleic acids^30,56. The M2-1 protein is thought to bind to nascent RNA emerging from the viral polymerase, thus inhibiting premature termination of transcripts via an as-of-yet undetermined mechanism^{19,20,57,58,59}. Mutational studies have mapped RSV P residues 100–120 as the M2-1 interaction site⁶⁰, which roughly corresponds to the region around residues 135–158 in HMPV. In contrast to the significant fraction of pre-formed N⁰-binding region described above, our ensembles display only marginal residual secondary structure within the M2-1 binding site of apo P. In-line with this observation, the NMR study of RSV P could detect a small segment of five residues (RSV amino acids 98–103) with only 8% helical propensity at the M2-1 binding region²⁶. In this case, recruitment of M2-1 by P may involve folding-upon-binding of an otherwise highly unstructured region. M2-1 and P may thus form a so-called fuzzy complex, in which substantial flexibility is retained even within the bound form⁶¹.

Conformation of the C-terminal IDR

Previous crystallographic and solution scattering studies of Nipah virus^43,62 and Sendai virus P⁶³ have highlighted a cap-like structure of helices folding back onto the N-terminal part of the oligomerization domain. However, the cap was not observed in the crystal structures of the helical coiled-coil oligomerization domains of Measles virus P⁶⁴, Mumps virus P⁶⁵, or HMPV P³⁷. Although not present in the crystal, in our HMPV P solution ensembles we find a compact subpopulation of conformers which consistently features a cap of helices (amino acids 204–219) located C-terminally of the tetramerization domain. It is quite possible that the cap is a specific feature of pneumoviral P proteins that exists in solution but is insufficiently stable, at least in HMPV, to be observed in the crystalline state. Deletion studies for bovine respiratory syncytial virus (BRSV) P have suggested that the region corresponding to the cap in HMPV may represent a secondary nucleoprotein binding site, besides the primary one at the very C-terminus of P⁶⁶. In addition, a recent NMR study of RSV P has identified a helical region partially overlapping with the corresponding cap-forming residues in HMPV²⁶. The authors observed line broadening in this helix upon adding the N-terminal domain of the nucleoprotein, strongly indicating that these do indeed interact. It is worth noting that the directly adjacent oligomerization region has been suggested to interact with the large L polymerase, at least in BRSV⁴². The secondary N-binding site of the cap-helices may thus bring L and N into close vicinity and may facilitate the interplay of N with L during the viral RNA-synthesis cycle. This is in line with mutational studies in RSV which have revealed that mutation of a conserved charged cluster within the cap to alanines (corresponding HMPV residues R215/E216/E217) abrogated all reporter gene activity in a minigenome assay⁶⁷. The R215/E216/E217 cluster of the cap-structure in HMPV P is solvent accessible in our ensembles and may thus be positioned ideally to interact with further viral components.

Our structural ensembles of P_135–294 and P_1–294 feature a C-terminal α-helix in the predicted region (residues 253–262), which is part of a putative L-binding site that was mapped by mutagenesis studies in HRSV²⁷. Although it is difficult to validate the presence of this secondary structure element on the basis of SAXS data alone due to the weak contribution to the scattering signal arising from this region of the molecule (within the experimental SAXS profiles), it is worth mentioning that this region was suggested to fold upon binding to the L polymerase based on the effect of point mutations on HRSV RNA synthesis²⁷. However, this region of the P protein is poorly conserved between RSV and HMPV owing to the insertion of a highly acidic stretch of about 18 residues in the middle of the L binding site in the HMPV P sequence (Supplementary Fig. S2), and no residual α-helical structure could be detected in the equivalent region of RSV P by NMR²⁶.

Long range effects modulating P conformational ensembles

The structural characterization of P_135–294 and P_1–294 also revealed the ability of specific IDRs of the P protein to influence the conformation of other parts of the protein. In particular it was observed that a significantly higher proportion of models featuring the α_204–219 cap-like structure is present in P_135–294 and P_1–294 structural ensembles compared to P_135–237 (55–60% versus 30%). The ability of the C-terminal region of P to stabilize this α-helical structure might explain its absence from the crystal structures of P_core (as the P constructs used for crystallization required C-terminal degradation in order to yield diffracting crystals, which in turn destabilizes the cap-like structure). We hypothesize that this property might arise from the highly acidic nature of the C-terminal region of P which contains a large number of negatively charged residues arranged in repetitive blocks (for example residues 140–170 and residues 263–294, Supplementary Figs S2 and S5), leading to intramolecular electrostatic repulsion. Interestingly, the full-length P protein SAXS profiles display higher apparent R_g values at high protein concentrations, indicating attractive interparticle interference. Based on the distribution of charged residues along the P sequence, it is tempting to speculate that the presence of basic patches of residues within the N-terminal IDR (Supplementary Fig. S5) combined with the acidic C-terminal IDR is responsible for this attraction.

Possible implications for viral inclusion bodies formation

The presence of a structure factor contribution in the SAXS profiles of P within the relatively small concentration range used for measurements (0.3 to 2 mg/ml) shows that the protein has a propensity to affect the structure of the liquid itself when present at high concentrations. This property might be relevant to viral replication in physiological conditions as pneumoviral replication was shown to occur in segregated cytoplasmic inclusion bodies which harbour the components of the replication machinery and concentrate viral proteins^68,69. Furthermore, expression of N and P was necessary and sufficient to induce the formation of cytoplasmic inclusions in HMPV and HRSV^15,16,70, but also in more distantly related viruses such as human parainfluenza virus type 3⁷¹ or rabies virus^72,73. Recently, there is great interest in the ability of proteins with IDRs to form phase-separated micro-compartments without the need of lipid bilayers, especially in conjunction with molecular crowding^74,75 (cellular examples of such membrane-less compartments are Cajal bodies or cytoplasmic stress granules). These phase-separated organelles feature higher local concentrations of a subset of factors which are often relevant for a functionally related pathway. Typically, proteins with low-complexity regions, low sequence diversity, multivalent binding properties, and blocks of oppositely-charged residues are potentially able to form these biomolecular condensates under specific conditions. The amino-acid compositions of many mononegavirus P proteins satisfy these prerequisites, with HMPV P possessing low-complexity regions, and being composed of over 35% Lys, Arg, Glu, and Asp, often arranged in repetitive blocks. Because each N molecule displays 2 P binding sites, it is tempting to speculate that binding of P proteins to N-RNA and N⁰ in cells would result in very high local concentrations of P. This, in turn might facilitate the formation of phase-separated microenvironments, constituting highly specialized viral RNA-synthesis and replication micro-factories. Phosphorylation of constituent proteins has previously been shown to control phase-separating behaviour^76,77. Similarly, phosphorylation of P-proteins might modulate their general phase-separating behaviour, instead of or along with specific protein-protein interactions, thus controlling inclusion body formation.

In summary, we have shown that HMPV P is a highly dynamic protein that is composed of a stable helical tetramerization domain flanked by large N-terminal and C-terminal IDRs that sample a large volume in solution and display varying degrees of preformed structural elements. We found that the N-terminal N⁰ binding site contained a significant proportion of α-helical structure and that the region C-terminally adjacent to the tetramerization domain had a tendency to form a cap-like helical structure that mapped to a putative N- RNA binding site. Our results further suggested that this cap-like structure might be stabilized by the presence of the full C-terminal IDR, and that the full-length phosphoprotein’s basic and acidic patches of residues may play a role in viral inclusion body formation by inducing long range intermolecular attraction and facilitating the formation of phase-separated microenvironments.

Material and Methods

Protein cloning, expression and purification

The regions of the HMPV P gene (strain NL1-00, A1, GenBank: AAK62966.1) encompassing phosphoprotein residues 1–60, 135–237, 135–294, and 1–294 were amplified by polymerase chain reaction and cloned into pOPINF (P 1–60, 135–237 and 135–294) or pOPINE (P1-294) plasmids⁷⁸ using the In-Fusion system (TAKARA CLONTECH) following the instructions of the manufacturer. Recombinant proteins expressed from pOPINF feature a N-terminal (His)₆-tag followed by a 3C protease site, while pOPINE has a C-terminal (His)₆-tag. All constructs were verified by nucleotide sequencing.

The (His)₆-tagged constructs were transformed into Rosetta2 E. coli cells for recombinant expression. E. coli were grown at 37 °C in terrific broth (TB) in presence of appropriate antibiotics to an OD₆₀₀ of ~0.8 and expression was induced by addition of 1 mM β-D-1-thiogalactopyranoside (IPTG). Following induction, the cells were incubated at 18 °C overnight while shaking and were subsequently harvested by centrifugation (18 °C, 20 min, 4000×g). Cell pellets were resuspended in 20 mM Tris, pH 7.5, 500 mM NaCl and then lysed by sonication. The lysate was centrifuged for 45 min at 4 °C and 50000×g. The cleared lysate was syringe-filtered (0.45 μm pore size, MILLIPORE) and transferred onto a column packed with pre-equilibrated Ni²⁺-NTA Agarose (QIAGEN). Following several washes, the (His)₆-tagged samples were eluted from the beads with 20 mM Tris, pH 7.5, 150 mM NaCl, 300 mM imidazole. Finally, the samples were subjected to size exclusion chromatography and buffer exchanged into 20 mM Tris, pH 7.5, 150 mM NaCl. For small-angle X-ray scattering experiments, proteins were concentrated on-site with centrifugal filter units (MILLIPORE).

Small angle X-ray scattering experiments

SAXS measurements were performed on beamline BM29 (P_1–60, P_135–237 and P_135–294) and former beamline ID14-3 (P_1–294) at the European Synchrotron Radiation Facility (ESRF), Grenoble, France. Samples were kept at 20 °C and data were collected at a wavelength of 0.0995 nm and a sample-to-detector distance of 1 m. 1D scattering profiles were generated and buffer subtraction was carried out by the automated data processing pipeline available at BM29 (ID14-3). The radius of gyration was determined with the program PRIMUS⁷⁹ according to the Guinier approximation at low Q values, and molecular weights were estimated based on⁴⁴. In the case of P_135–294 and P_1–294, which displayed concentration dependent changes in the SAXS profile at low Q values, a merged curve was produced by combining the low Q region of data measured at low concentration with the high Q region of the high concentration SAXS profiles. These particle interference-free merged curves were used for subsequent analysis.

Structural modelling and molecular dynamics simulations

Different strategies were used to obtain all atom models for each construct. 1000 models of the backbone of monomeric P_1–60 were obtained using the program flexible-meccano⁴⁷, with an α-helical propensity of 50% for residues 14–26 which were observed to form an α-helix when complexed with the HMPV N protein³². Protein side chains were then added using the program SCCOMP⁸⁰.

Initial models of P_135–237 were built based on three families of models extracted from our previous HMPV P modeling study that were found to accurately reproduce SAXS data for this P_158–237 ³⁷. All three models are composed of a tetrameric coiled coil ranging from residues 168 to 198, with disordered residues at the N-terminus that have residual α-helical structure. In the first model, residues 202–219 adopt α-helical structures that pack laterally against the C-terminal part of the coiled coil region, while residues 220–237 are extended. In the second model, residues 208–237 form an α-helix consistently with secondary structure predictions, and in the third model, residues 168–237 are in an extended conformation with no secondary or tertiary structure. The region composed of residues 135–158 (including the N-terminal His6-3C site) was obtained based on a LOMETS model⁸¹ adopting a relatively extended conformation which was grafted onto the three initial models of P_158–237. Two additional starting models were generated by shortening the coiled coil by one helix turn at its C-terminus in the third model (to match the x-ray structure) and by further removing residual helical structure in the N-terminal region (that had been carried out from previous classical M.D. simulations). All five starting models of P_158–237 were then simulated in GROMACS⁸² using either an atomistic coarse-grained structure-based model (SBM)^83,84, or explicit solvent classical molecular dynamics simulations (MDS).

In the case of the SBM MDS, a timestep of 0.0005 time units was used and the simulation was coupled to a temperature bath via Langevin dynamics. A single 100 ns trajectory was obtained for each starting model, and snapshots were extracted every 50 ps leading to an ensemble of 5000 models. In the case of classical MDS, we generated multiple trajectories for an aggregated simulation time of ~660 ns. MDS was performed using the amber99SBws forcefield⁸⁵ which has been developed to reproduce the properties of intrinsically disordered proteins. At the beginning of each simulation, the protein was immersed in a box of SPC/E water, with a minimum distance of 0.9 nm between protein atoms and the edges of the box. 150 mM of NaCl were then added using genion. Long range electrostatics were treated with the particle-mesh Ewald summation⁸⁶. Bond lengths were constrained using the P-LINCS algorithm. The integration time step was 5 fs. The v-rescale thermostat and the Parrinello–Rahman barostat were used to maintain a temperature of 300 K and a pressure of 1 atm. Each system was energy minimized using 1,000 steps of steepest descent and equilibrated for 500 ps with restrained protein heavy atoms prior to production simulations. Snapshots were extracted every 200 ps from each trajectory, leading to the generation of ~3300 additional models of P_135–237.

In order to generate models of P_135–294 and of P_1–294, we adopted a similar strategy in which residues 238–294 and residues 1–134 were grafted onto the existing P_135–237 models with or without the α-helical secondary structure elements (residues 14–26 and residues 251–262). Model types that were not selected through ensemble optimization of P_135–237 were not considered for this procedure. We then used the SBM approach to generate ensembles for P_135–294 and P_1–294, yielding ~5000 P_135–294 and ~8000 P_1–294 models.

Ensemble optimization

For each model from each ensemble, theoretical SAXS patterns were calculated with the program CRYSOL⁸⁷ and ensemble optimization fitting was performed with GAJOE^88,89. GAJOE uses a genetic algorithm to select from a large pool of conformers optimized sub-ensembles that minimize the discrepancy between the experimental and calculated curves χ _exp according to the following equation:

$$\chi {exp}^{2}=\frac{1}{K\,-\,1}\sum _{j=1}^{K}{[\frac{\mu I({Q}_{j})-{I}_{exp}({Q}_{j})}{\sigma ({Q}_{j})}]}^{2}$$

(1)

where K is the number of points in the experimental curve, σ is the standard deviation and µ is a scaling factor. The optimum selected ensemble size and relative weights of the models were determined automatically by GAJOE. For each curve, the ensemble optimization procedure was repeated for a minimum of 20 times, from which the Rg distributions of the optimized ensembles were built.

Crystallization and data collection

Our initial goal was the crystallization of a complex of the HMPV M2-1 protein bound to a P construct including the putative M2-1 binding region. The expression and purification of HMPV M2-1 has been described previously³⁰. In an attempt to crystallize the M2-1 – P_135–237 complex, vapour diffusion crystallization trials of a 1:1 mixture of these two proteins at 7 mg/ml in 20 mM Tris, pH 7.5, 150 mM NaCl were set up using a Cartesian Technologies pipetting system⁹⁰. Although we were not able to grow crystals of a complex, crystals which later proved to harbour only the P oligomerization region could be obtained after extended time periods. The P2₁ crystal (form 1) of HMPV P_core grew at 20 °C after 291–344 days with mother liquor containing 20% polyethylene glycol (PEG) 6000, 200 mM NaCl, and 100 mM Tris, pH 8.0. The P2₁2₁2₁ crystal (form 2) grew at 20 °C after between 132 and 185 days with mother liquor containing 25% PEG 3350, 200 mM MgCl, and 100 mM Tris, pH 8.5. Crystals were frozen in liquid nitrogen after being cryoprotected with 25% glycerol. Diffraction data were recorded on beamlines I03 (P2₁ crystal) and I04 (P2₁2₁2₁ crystal) at Diamond Light Source, Didcot, UK. Data reduction was carried out automatically with XIA2⁹¹.

Structure determination and refinement

The HMPV P_core data sets were phased by molecular replacement with PHASER⁹² using the previously published structure (pdbID:4BXT). The structures from both crystal forms were subjected to multiple rounds of manual building in COOT⁹³ and refinement in PHENIX⁹⁴. We made use of translation-libration-screw (TLS) parameters and 8-fold torsion-angle non-crystallographic symmetry (NCS) restraints as implemented in PHENIX⁹⁴. For the 1.6 Å data of crystal form 1 we additionally carried out anisotropic atomic displacement parameter (ADP) refinement. The structures were validated with the wwPDB Validation Service (https://validate-rcsb-1.wwpdb.org/). Refinement statistics are given in Table 1. The final coordinates and structure factors have been deposited in the PDB with accession codes 5OIX and 5OIY.

Structure and sequence analyses

Structure-related figures were prepared with the PyMOL Molecular Graphics System (DeLano Scientific LLC). Protein interfaces were analysed with the PISA webserver⁹⁵. Mapping of sequence conservation onto the P_core structure was carried out with the ConSurf server⁹⁶ using P sequences from the Pneumoviridae family members human metapneumovirus (HMPV), avian metapneumovirus (AMPV), canine pneumonia virus (CPV), murine pneumonia virus (MPV), bovine respiratory syncytial virus (BRSV), and human respiratory syncytial virus (HRSV). Sequences were aligned using using PROMALS3D⁹⁷ and Jalview⁹⁸ in order to analyse the conservation of the protein binding sites that were identified in RSV. The putative functional regions have been assigned based on homology and previously reported studies^{16,26,27,42,60,66,99,100}. The linear net charge per residue (NCPR) for HMPV P was calculated using the Classification of Intrinsically Disordered Ensemble Regions (CIDER) webserver¹⁰¹.

Data availability

Coordinates and structure factors have been deposited in the Protein Data Bank with accession numbers 5OIX and 5OIY. The SAXS datasets generated during the current study are available from the corresponding author on reasonable request.

References

Black, R. E. et al. Global, regional, and national causes of child mortality in 2008: a systematic analysis. Lancet 375, 1969–1987, https://doi.org/10.1016/S0140-6736(10)60549-1 (2010).
Article PubMed Google Scholar
van den Hoogen, B. G. et al. A newly discovered human pneumovirus isolated from young children with respiratory tract disease. Nature medicine 7, 719–724, https://doi.org/10.1038/89098 (2001).
Article PubMed CAS Google Scholar
van den Hoogen, B. G. et al. Prevalence and clinical symptoms of human metapneumovirus infection in hospitalized patients. The Journal of infectious diseases 188, 1571–1577, https://doi.org/10.1086/379200 (2003).
Article PubMed Google Scholar
Schildgen, V. et al. Human Metapneumovirus: lessons learned over the first decade. Clin Microbiol Rev 24, 734–754, https://doi.org/10.1128/CMR.00015-11 (2011).
Article PubMed PubMed Central Google Scholar
Esposito, S. & Mastrolia, M. V. Metapneumovirus Infections and Respiratory Complications. Semin Respir Crit Care Med 37, 512–521, https://doi.org/10.1055/s-0036-1584800 (2016).
Article PubMed Google Scholar
de Graaf, M., Osterhaus, A. D., Fouchier, R. A. & Holmes, E. C. Evolutionary dynamics of human and avian metapneumoviruses. J Gen Virol 89, 2933–2942, https://doi.org/10.1099/vir.0.2008/006957-0 (2008).
Article PubMed CAS Google Scholar
Afonso, C. L. et al. Taxonomy of the order Mononegavirales: update 2016. Arch Virol 161, 2351–2360, https://doi.org/10.1007/s00705-016-2880-1 (2016).
Article CAS PubMed PubMed Central Google Scholar
Ortin, J. & Martin-Benito, J. The RNA synthesis machinery of negative-stranded RNA viruses. Virology 479-480, 532–544, https://doi.org/10.1016/j.virol.2015.03.018 (2015).
Article CAS PubMed Google Scholar
Ruigrok, R. W., Crepin, T. & Kolakofsky, D. Nucleoproteins and nucleocapsids of negative-strand RNA viruses. Curr Opin Microbiol 14, 504–510, https://doi.org/10.1016/j.mib.2011.07.011 (2011).
Article CAS PubMed Google Scholar
Gutsche, I. et al. Structural virology. Near-atomic cryo-EM structure of the helical measles virus nucleocapsid. Science 348, 704–707, https://doi.org/10.1126/science.aaa5137 (2015).
Article ADS CAS PubMed Google Scholar
Tawar, R. G. et al. Crystal structure of a nucleocapsid-like nucleoprotein-RNA complex of respiratory syncytial virus. Science 326, 1279–1283, https://doi.org/10.1126/science.1177634 (2009).
Article ADS CAS PubMed Google Scholar
Sutherland, K. A., Collins, P. L. & Peeples, M. E. Synergistic effects of gene-end signal mutations and the M2-1 protein on transcription termination by respiratory syncytial virus. Virology 288, 295–307, https://doi.org/10.1006/viro.2001.1105 (2001).
Article CAS PubMed Google Scholar
Noton, S. L. & Fearns, R. Initiation and regulation of paramyxovirus transcription and replication. Virology 479-480, 545–554, https://doi.org/10.1016/j.virol.2015.01.014 (2015).
Article CAS PubMed Google Scholar
Norrby, E., Marusyk, H. & Orvell, C. Morphogenesis of respiratory syncytial virus in a green monkey kidney cell line (Vero). J Virol 6, 237–242 (1970).
CAS PubMed PubMed Central Google Scholar
Derdowski, A. et al. Human metapneumovirus nucleoprotein and phosphoprotein interact and provide the minimal requirements for inclusion body formation. J Gen Virol 89, 2698–2708, https://doi.org/10.1099/vir.0.2008/004051-0 (2008).
Article CAS PubMed PubMed Central Google Scholar
Garcia-Barreno, B., Delgado, T. & Melero, J. A. Identification of protein regions involved in the interaction of human respiratory syncytial virus phosphoprotein and nucleoprotein: significance for nucleocapsid assembly and formation of cytoplasmic inclusions. J Virol 70, 801–808 (1996).
CAS PubMed PubMed Central Google Scholar
Yu, Q., Hardy, R. W. & Wertz, G. W. Functional cDNA clones of the human respiratory syncytial (RS) virus N, P, and L proteins support replication of RS virus genomic RNA analogs and define minimal trans-acting requirements for RNA replication. J Virol 69, 2412–2419 (1995).
CAS PubMed PubMed Central Google Scholar
Fearns, R. & Plemper, R. K. Polymerases of paramyxoviruses and pneumoviruses. Virus Res, https://doi.org/10.1016/j.virusres.2017.01.008 (2017).
Collins, P. L., Hill, M. G., Cristina, J. & Grosfeld, H. Transcription elongation factor of respiratory syncytial virus, a nonsegmented negative-strand RNA virus. Proc Natl Acad Sci USA 93, 81–85 (1996).
Article ADS CAS PubMed PubMed Central Google Scholar
Fearns, R. & Collins, P. L. Role of the M2-1 transcription antitermination protein of respiratory syncytial virus in sequential transcription. J Virol 73, 5852–5864 (1999).
CAS PubMed PubMed Central Google Scholar
Cox, R. & Plemper, R. K. The paramyxovirus polymerase complex as a target for next-generation anti-paramyxovirus therapeutics. Front Microbiol 6, 459, https://doi.org/10.3389/fmicb.2015.00459 (2015).
Article PubMed PubMed Central Google Scholar
El Najjar, F. et al. Human metapneumovirus Induces Reorganization of the Actin Cytoskeleton for Direct Cell-to-Cell Spread. PLoS pathogens 12, e1005922, https://doi.org/10.1371/journal.ppat.1005922 (2016).
Article PubMed PubMed Central CAS Google Scholar
Goutagny, N. et al. Cell type-specific recognition of human metapneumoviruses (HMPVs) by retinoic acid-inducible gene I (RIG-I) and TLR7 and viral interference of RIG-I ligand recognition by HMPV-B1 phosphoprotein. J Immunol 184, 1168–1179, https://doi.org/10.4049/jimmunol.0902750 (2010).
Article CAS PubMed Google Scholar
Tran, T. L. et al. The nine C-terminal amino acids of the respiratory syncytial virus protein P are necessary and sufficient for binding to ribonucleoprotein complexes in which six ribonucleotides are contacted per N protein protomer. J Gen Virol 88, 196–206, https://doi.org/10.1099/vir.0.82282-0 (2007).
Article CAS PubMed Google Scholar
Galloux, M. et al. Characterization of a viral phosphoprotein binding site on the surface of the respiratory syncytial nucleoprotein. J Virol 86, 8375–8387, https://doi.org/10.1128/JVI.00058-12 (2012).
Article CAS PubMed PubMed Central Google Scholar
Pereira, N. et al. New Insights into Structural Disorder in Human Respiratory Syncytial Virus Phosphoprotein and Implications for Binding of Protein Partners. J Biol Chem 292, 2120–2131, https://doi.org/10.1074/jbc.M116.765958 (2017).
Article CAS PubMed Google Scholar
Sourimant, J. et al. Fine mapping and characterization of the L-polymerase-binding domain of the respiratory syncytial virus phosphoprotein. J Virol 89, 4421–4433, https://doi.org/10.1128/JVI.03619-14 (2015).
Article CAS PubMed PubMed Central Google Scholar
Tran, T. L. et al. The respiratory syncytial virus M2-1 protein forms tetramers and interacts with RNA and P in a competitive manner. J Virol 83, 6363–6374, https://doi.org/10.1128/JVI.00335-09 (2009).
Article CAS PubMed PubMed Central Google Scholar
Blondot, M. L. et al. Structure and functional analysis of the RNA- and viral phosphoprotein-binding domain of respiratory syncytial virus M2-1 protein. PLoS pathogens 8, e1002734, https://doi.org/10.1371/journal.ppat.1002734 (2012).
Article CAS PubMed PubMed Central Google Scholar
Leyrat, C., Renner, M., Harlos, K., Huiskonen, J. T. & Grimes, J. M. Drastic changes in conformational dynamics of the antiterminator M2-1 regulate transcription efficiency in Pneumovirinae. Elife 3, e02674, https://doi.org/10.7554/eLife.02674 (2014).
Article PubMed PubMed Central Google Scholar
Leyrat, C. et al. Structure of the vesicular stomatitis virus N(0)-P complex. PLoS pathogens 7, e1002248, https://doi.org/10.1371/journal.ppat.1002248 (2011).
Article CAS PubMed PubMed Central Google Scholar
Renner, M. et al. Nucleocapsid assembly in pneumoviruses is regulated by conformational switching of the N protein. Elife 5, e12627, https://doi.org/10.7554/eLife.12627 (2016).
Article PubMed PubMed Central Google Scholar
Yabukarski, F. et al. Structure of Nipah virus unassembled nucleoprotein in complex with its viral chaperone. Nat Struct Mol Biol 21, 754–759, https://doi.org/10.1038/nsmb.2868 (2014).
Article CAS PubMed Google Scholar
Galloux, M. et al. Identification and characterization of the binding site of the respiratory syncytial virus phosphoprotein to RNA-free nucleoprotein. J Virol 89, 3484–3496, https://doi.org/10.1128/JVI.03666-14 (2015).
Article CAS PubMed PubMed Central Google Scholar
Llorente, M. T. et al. Structural properties of the human respiratory syncytial virus P protein: evidence for an elongated homotetrameric molecule that is the smallest orthologue within the family of paramyxovirus polymerase cofactors. Proteins 72, 946–958, https://doi.org/10.1002/prot.21988 (2008).
Article CAS PubMed Google Scholar
Llorente, M. T. et al. Structural analysis of the human respiratory syncytial virus phosphoprotein: characterization of an alpha-helical domain involved in oligomerization. J Gen Virol 87, 159–169, https://doi.org/10.1099/vir.0.81430-0 (2006).
Article CAS PubMed Google Scholar
Leyrat, C., Renner, M., Harlos, K. & Grimes, J. M. Solution and crystallographic structures of the central region of the phosphoprotein from human metapneumovirus. PLoS One 8, e80371, https://doi.org/10.1371/journal.pone.0080371 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Jensen, M. R. et al. Structural disorder within sendai virus nucleoprotein and phosphoprotein: insight into the structural basis of molecular recognition. Protein Pept Lett 17, 952–960 (2010).
Article CAS PubMed Google Scholar
Leyrat, C. et al. The N(0)-binding region of the vesicular stomatitis virus phosphoprotein is globally disordered but contains transient alpha-helices. Protein Sci 20, 542–556, https://doi.org/10.1002/pro.587 (2011).
Article CAS PubMed PubMed Central Google Scholar
Leyrat, C. et al. Ensemble structure of the modular and flexible full-length vesicular stomatitis virus phosphoprotein. J Mol Biol 423, 182–197, https://doi.org/10.1016/j.jmb.2012.07.003 (2012).
Article CAS PubMed Google Scholar
Erales, J., Beltrandi, M., Roche, J., Mate, M. & Longhi, S. Insights into the Hendra virus NTAIL-XD complex: Evidence for a parallel organization of the helical MoRE at the XD surface stabilized by a combination of hydrophobic and polar interactions. Biochim Biophys Acta 1854, 1038–1053, https://doi.org/10.1016/j.bbapap.2015.04.031 (2015).
Article CAS PubMed Google Scholar
Khattar, S. K., Yunus, A. S. & Samal, S. K. Mapping the domains on the phosphoprotein of bovine respiratory syncytial virus required for N-P and P-L interactions using a minigenome system. J Gen Virol 82, 775–779, https://doi.org/10.1099/0022-1317-82-4-775 (2001).
Article CAS PubMed Google Scholar
Bruhn, J. F. et al. Crystal structure of the nipah virus phosphoprotein tetramerization domain. J Virol 88, 758–762, https://doi.org/10.1128/JVI.02294-13 (2014).
Article PubMed PubMed Central CAS Google Scholar
Rambo, R. P. & Tainer, J. A. Accurate assessment of mass, models and resolution by small-angle scattering. Nature 496, 477–481, https://doi.org/10.1038/nature12070 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Narang, P., Bhushan, K., Bose, S. & Jayaram, B. A computational pathway for bracketing native-like structures fo small alpha helical globular proteins. Physical chemistry chemical physics: PCCP 7, 2364–2375 (2005).
Article ADS CAS PubMed Google Scholar
Bernado, P. & Svergun, D. I. Structural analysis of intrinsically disordered proteins by small-angle X-ray scattering. Mol Biosyst 8, 151–167, https://doi.org/10.1039/c1mb05275f (2012).
Article CAS PubMed Google Scholar
Ozenne, V. et al. Flexible-meccano: a tool for the generation of explicit ensemble descriptions of intrinsically disordered proteins and their associated experimental observables. Bioinformatics 28, 1463–1470, https://doi.org/10.1093/bioinformatics/bts172 (2012).
Article CAS PubMed Google Scholar
Dunker, A. K., Obradovic, Z., Romero, P., Garner, E. C. & Brown, C. J. Intrinsic protein disorder in complete genomes. Genome Inform Ser Workshop Genome Inform 11, 161–171 (2000).
CAS PubMed Google Scholar
Mollica, L. et al. Binding Mechanisms of Intrinsically DisorderedProteins: Theory, Simulation, and Experiment. Front Mol Biosci 3, 52, https://doi.org/10.3389/fmolb.2016.00052 (2016).
Article PubMed PubMed Central Google Scholar
Noval, M. G., Esperante, S. A., Molina, I. G., Chemes, L. B. & Prat-Gay, G. Intrinsic Disorder to Order Transitions in the Scaffold Phosphoprotein P from the Respiratory Syncytial Virus RNA Polymerase Complex. Biochemistry 55, 1441–1454, https://doi.org/10.1021/acs.biochem.5b01332 (2016).
Article CAS PubMed Google Scholar
Liang, B. et al. Structure of the L Protein of Vesicular Stomatitis Virus from Electron Cryomicroscopy. Cell 162, 314–327, https://doi.org/10.1016/j.cell.2015.06.018 (2015).
Article CAS PubMed PubMed Central Google Scholar
Leung, D. W. et al. An Intrinsically Disordered Peptide from Ebola Virus VP35 Controls Viral RNA Synthesis by Modulating Nucleoprotein-RNA Interactions. Cell Rep 11, 376–389, https://doi.org/10.1016/j.celrep.2015.03.034 (2015).
Article CAS PubMed PubMed Central Google Scholar
Guryanov, S. G., Liljeroos, L., Kasaragod, P., Kajander, T. & Butcher, S. J. Crystal Structure of the Measles Virus Nucleoprotein Core in Complex with an N-Terminal Region of Phosphoprotein. J Virol 90, 2849–2857, https://doi.org/10.1128/JVI.02865-15 (2015).
Article PubMed CAS Google Scholar
Zhou, H. X., Pang, X. & Lu, C. Rate constants and mechanisms of intrinsically disordered proteins binding to structured targets. Physical chemistry chemical physics: PCCP 14, 10466–10476, https://doi.org/10.1039/c2cp41196b (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Esperante, S. A. et al. Fine modulation of the respiratory syncytial virus M2-1 protein quaternary structure by reversible zinc removal from its Cys(3)-His(1) motif. Biochemistry 52, 6779–6789, https://doi.org/10.1021/bi401029q (2013).
Article CAS PubMed Google Scholar
Tanner, S. J. et al. Crystal structure of the essential transcription antiterminator M2-1 protein of human respiratory syncytial virus and implications of its phosphorylation. Proc Natl Acad Sci USA 111, 1580–1585, https://doi.org/10.1073/pnas.1317262111 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Hardy, R. W., Harmon, S. B. & Wertz, G. W. Diverse gene junctions of respiratory syncytial virus modulate the efficiency of transcription termination and respond differently to M2-mediated antitermination. J Virol 73, 170–176 (1999).
CAS PubMed PubMed Central Google Scholar
Hardy, R. W. & Wertz, G. W. The product of the respiratory syncytial virus M2 gene ORF1 enhances readthrough of intergenic junctions during viral transcription. J Virol 72, 520–526 (1998).
CAS PubMed PubMed Central Google Scholar
Hardy, R. W. & Wertz, G. W. The Cys(3)-His(1) motif of the respiratory syncytial virus M2-1 protein is essential for protein function. J Virol 74, 5880–5885 (2000).
Article CAS PubMed PubMed Central Google Scholar
Mason, S. W. et al. Interaction between human respiratory syncytial virus (RSV) M2-1 and P proteins is required for reconstitution of M2-1-dependent RSV minigenome activity. J Virol 77, 10670–10676 (2003).
Article CAS PubMed PubMed Central Google Scholar
Esperante, S. A., Paris, G. & de Prat-Gay, G. Modular unfolding and dissociation of the human respiratory syncytial virus phosphoprotein p and its interaction with the m(2-1) antiterminator: a singular tetramer-tetramer interface arrangement. Biochemistry 51, 8100–8110, https://doi.org/10.1021/bi300765c (2012).
Article CAS PubMed Google Scholar
Blocquel, D., Beltrandi, M., Erales, J., Barbier, P. & Longhi, S. Biochemical and structural studies of the oligomerization domain of the Nipah virus phosphoprotein: evidence for an elongated coiled-coil homotrimer. Virology 446, 162–172, https://doi.org/10.1016/j.virol.2013.07.031 (2013).
Article CAS PubMed Google Scholar
Tarbouriech, N., Curran, J., Ruigrok, R. W. & Burmeister, W. P. Tetrameric coiled coil domain of Sendai virus phosphoprotein. Nat Struct Biol 7, 777–781, https://doi.org/10.1038/79013 (2000).
Article CAS PubMed Google Scholar
Communie, G. et al. Structure of the tetramerization domain of measles virus phosphoprotein. J Virol 87, 7166–7169, https://doi.org/10.1128/JVI.00487-13 (2013).
Article CAS PubMed PubMed Central Google Scholar
Cox, R. et al. Structural and functional characterization of the mumps virus phosphoprotein. J Virol 87, 7558–7568, https://doi.org/10.1128/JVI.00653-13 (2013).
Article CAS PubMed PubMed Central Google Scholar
Khattar, S. K., Yunus, A. S., Collins, P. L. & Samal, S. K. Deletion and substitution analysis defines regions and residues within the phosphoprotein of bovine respiratory syncytial virus that affect transcription, RNA replication, and interaction with the nucleoprotein. Virology 285, 253–269, https://doi.org/10.1006/viro.2001.0960 (2001).
Article CAS PubMed Google Scholar
Lu, B. et al. Identification of temperature-sensitive mutations in the phosphoprotein of respiratory syncytial virus that are likely involved in its interaction with the nucleoprotein. J Virol 76, 2871–2880 (2002).
Article CAS PubMed PubMed Central Google Scholar
Li, D. et al. Association of respiratory syncytial virus M protein with viral nucleocapsids is mediated by the M2-1 protein. J Virol 82, 8863–8870, https://doi.org/10.1128/JVI.00343-08 (2008).
Article CAS PubMed PubMed Central Google Scholar
Ghildyal, R., Mills, J., Murray, M., Vardaxis, N. & Meanger, J. Respiratory syncytial virus matrix protein associates with nucleocapsids in infected cells. J Gen Virol 83, 753–757, https://doi.org/10.1099/0022-1317-83-4-753 (2002).
Article CAS PubMed Google Scholar
Garcia, J., Garcia-Barreno, B., Vivo, A. & Melero, J. A. Cytoplasmic inclusions of respiratory syncytial virus-infected cells: formation of inclusion bodies in transfected cells that coexpress the nucleoprotein, the phosphoprotein, and the 22K protein. Virology 195, 243–247, https://doi.org/10.1006/viro.1993.1366 (1993).
Article CAS PubMed Google Scholar
Zhang, S. et al. An amino acid of human parainfluenza virus type 3 nucleoprotein is critical for template function and cytoplasmic inclusion body formation. J Virol 87, 12457–12470, https://doi.org/10.1128/JVI.01565-13 (2013).
Article CAS PubMed PubMed Central Google Scholar
Lahaye, X. et al. Functional characterization of Negri bodies (NBs) in rabies virus-infected cells: Evidence that NBs are sites of viral transcription and replication. J Virol 83, 7948–7958, https://doi.org/10.1128/JVI.00554-09 (2009).
Article CAS PubMed PubMed Central Google Scholar
Nikolic, J. et al. Negri bodies are viral factories with properties of liquid organelles. Nat Commun 8, 58, https://doi.org/10.1038/s41467-017-00102-9 (2017).
Article ADS PubMed PubMed Central Google Scholar
Banani, S. F., Lee, H. O., Hyman, A. A. & Rosen, M. K. Biomolecular condensates: organizers of cellular biochemistry. Nat Rev Mol Cell Biol 18, 285–298, https://doi.org/10.1038/nrm.2017.7 (2017).
Article CAS PubMed Google Scholar
Uversky, V. N. Protein intrinsic disorder-based liquid-liquid phase transitions in biological systems: Complex coacervates and membrane-less organelles. Adv Colloid Interface Sci 239, 97–114, https://doi.org/10.1016/j.cis.2016.05.012 (2017).
Article CAS PubMed Google Scholar
Li, P. et al. Phase transitions in the assembly of multivalent signalling proteins. Nature 483, 336–340, https://doi.org/10.1038/nature10879 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Banjade, S. & Rosen, M. K. Phase transitions of multivalent proteins can promote clustering of membrane receptors. Elife 3, https://doi.org/10.7554/eLife.04123 (2014).
Berrow, N. S. et al. A versatile ligation-independent cloning method suitable for high-throughput expression screening applications. Nucleic Acids Res 35, e45, https://doi.org/10.1093/nar/gkm047 (2007).
Article ADS PubMed PubMed Central CAS Google Scholar
Svergun, D. I., Konarev, P. V., Volkov, V. V., Sokolova, A. V. & Koch, M. H. J. PRIMUS: a Windows PC-based system for small-angle scattering data analysis. J Appl Crystallogr 36, 1277–1282, https://doi.org/10.1107/S0021889803012779 (2003).
Article Google Scholar
Eyal, E., Najmanovich, R., McConkey, B. J., Edelman, M. & Sobolev, V. Importance of solvent accessibility and contact surfaces in modeling side-chain conformations in proteins. J Comput Chem 25, 712–724, https://doi.org/10.1002/jcc.10420 (2004).
Article CAS PubMed Google Scholar
Wu, S. & Zhang, Y. LOMETS: a local meta-threading-server for protein structure prediction. Nucleic Acids Res 35, 3375–3382, https://doi.org/10.1093/nar/gkm251 (2007).
Article CAS PubMed PubMed Central Google Scholar
Hess, B., Kutzner, C., van der Spoel, D. & Lindahl, E. GROMACS 4: Algorithms for Highly Efficient, Load-Balanced, and Scalable Molecular Simulation. Journal of Chemical Theory and Computation 4, 435–447, https://doi.org/10.1021/ct700301q (2008).
Article CAS PubMed Google Scholar
Noel, J. K., Whitford, P. C., Sanbonmatsu, K. Y. & Onuchic, J. N. SMOG@ctbp: simplified deployment of structure-based models in GROMACS. Nucleic Acids Res 38, W657–661, https://doi.org/10.1093/nar/gkq498 (2010).
Article CAS PubMed PubMed Central Google Scholar
Whitford, P. C. et al. An all-atom structure-based potential for proteins: bridging minimal models with all-atom empirical forcefields. Proteins 75, 430–441, https://doi.org/10.1002/prot.22253 (2009).
Article CAS PubMed PubMed Central Google Scholar
Best, R. B., Zheng, W. & Mittal, J. Balanced Protein-Water Interactions Improve Properties of Disordered Proteins and Non-Specific Protein Association. J Chem Theory Comput 10, 5113–5124, https://doi.org/10.1021/ct500569b (2014).
Article CAS PubMed PubMed Central Google Scholar
Essmann, U. et al. A smooth particle mesh Ewald method. Journal of Chemical Physics 103, 8577–8593 (1995).
Article ADS CAS Google Scholar
Svergun, D., Barberato, C. & Koch, M. H. J. CRYSOL-a program to evaluate X-ray solution scattering of biological macromolecules from atomic coordinates. J Appl Crystallogr 28, 768–773 (1995).
Article CAS Google Scholar
Bernado, P., Mylonas, E., Petoukhov, M. V., Blackledge, M. & Svergun, D. I. Structural characterization of flexible proteins using small-angle X-ray scattering. J Am Chem Soc 129, 5656–5664, https://doi.org/10.1021/ja069124n (2007).
Article CAS PubMed Google Scholar
Tria, G., Mertens, H. D., Kachala, M. & Svergun, D. I. Advanced ensemble modelling of flexible macromolecules using X-ray solution scattering. IUCrJ 2, 207–217, https://doi.org/10.1107/S205225251500202X (2015).
Article CAS PubMed PubMed Central Google Scholar
Walter, T. S. et al. A procedure for setting up high-throughput nanolitre crystallization experiments. Crystallization workflow for initial screening, automated storage, imaging and optimization. Acta Crystallogr D Biol Crystallogr 61, 651–657, https://doi.org/10.1107/S0907444905007808 (2005).
Article PubMed CAS Google Scholar
Winter, G., Lobley, C. M. & Prince, S. M. Decision making in xia2. Acta Crystallogr D Biol Crystallogr 69, 1260–1273, https://doi.org/10.1107/S0907444913015308 (2013).
Article CAS PubMed PubMed Central Google Scholar
McCoy, A. J. et al. Phaser crystallographic software. J Appl Crystallogr 40, 658–674, https://doi.org/10.1107/S0021889807021206 (2007).
Article CAS PubMed PubMed Central Google Scholar
Emsley, P. & Cowtan, K. Coot: model-building tools for molecular graphics. Acta Crystallogr D Biol Crystallogr 60, 2126–2132, https://doi.org/10.1107/S0907444904019158 (2004).
Article PubMed CAS Google Scholar
Adams, P. D. et al. PHENIX: a comprehensive Python-based system for macromolecular structure solution. Acta Crystallogr D Biol Crystallogr 66, 213–221, https://doi.org/10.1107/S0907444909052925 (2010).
Article CAS PubMed PubMed Central Google Scholar
Krissinel, E. & Henrick, K. Inference of macromolecular assemblies from crystalline state. J Mol Biol 372, 774–797, https://doi.org/10.1016/j.jmb.2007.05.022 (2007).
Article CAS PubMed Google Scholar
Ashkenazy, H., Erez, E., Martz, E., Pupko, T. & Ben-Tal, N. ConSurf 2010: calculating evolutionary conservation in sequence and structure of proteins and nucleic acids. Nucleic Acids Res 38, W529–533, https://doi.org/10.1093/nar/gkq399 (2010).
Article CAS PubMed PubMed Central Google Scholar
Pei, J. & Grishin, N. V. PROMALS3D: multiple protein sequence alignment enhanced with evolutionary and three-dimensional structural information. Methods Mol Biol 1079, 263–271, https://doi.org/10.1007/978-1-62703-646-7_17 (2014).
Article PubMed PubMed Central Google Scholar
Waterhouse, A. M., Procter, J. B., Martin, D. M., Clamp, M. & Barton, G. J. Jalview Version 2–a multiple sequence alignment editor and analysis workbench. Bioinformatics 25, 1189–1191, https://doi.org/10.1093/bioinformatics/btp033 (2009).
Article CAS PubMed PubMed Central Google Scholar
Slack, M. S. & Easton, A. J. Characterization of the interaction of the human respiratory syncytial virus phosphoprotein and nucleocapsid protein using the two-hybrid system. Virus Res 55, 167–176, https://doi.org/10.1016/S0168-1702(98)00042-2 (1998).
Article CAS PubMed Google Scholar
Mallipeddi, S. K., Lupiani, B. & Samal, S. K. Mapping the domains on the phosphoprotein of bovine respiratory syncytial virus required for N-P interaction using a two-hybrid system. J Gen Virol 77(Pt 5), 1019–1023 (1996).
Article CAS PubMed Google Scholar
Holehouse, A. S., Das, R. K., Ahad, J. N., Richardson, M. O. & Pappu, R. V. CIDER: Resources to Analyze Sequence-Ensemble Relationships of Intrinsically Disordered Proteins. Biophys J 112, 16–21, https://doi.org/10.1016/j.bpj.2016.11.3200 (2017).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

The research leading to these results has received funding from the European Union Seventh Framework Programme (FP7/2007–2013) under SILVER grant agreement n^° 260644, and MRC grant (MR/L017709/1). MR is funded by a Wellcome Trust Fellowship (204703/Z/16/Z), CMG benefited from the support of the Labex EpiGenMed, an « Investissements d’avenir » program, reference ANR-10-LABX-12–01, and JMG by the WT (200835/Z/16/Z). This work was supported by a Wellcome Trust administrative support grant (203141/Z/16/Z). The authors would like to thank the staff of beamline BM29 and former beamline ID14-3 at the European Synchrotron Radiation Facility (Grenoble, France) for assistance with SAXS data collection. The authors would also like to thank Diamond Light Source for beamtime (proposal MX10627), and the staff of beamlines I03 and I04 for assistance with crystal testing and data collection. Finally, the authors would like to thank Dr. Nicolas Martinez for helpful discussions.

Author information

Authors and Affiliations

Division of Structural Biology, Wellcome Trust Centre for Human Genetics, Oxford University, Roosevelt Drive, Oxford, OX3 7BN, UK
Max Renner, Guido C. Paesen & Jonathan M. Grimes
Institut de Génomique Fonctionnelle, CNRS UMR-5203 INSERM U1191, University of Montpellier, Montpellier, France
Claire M. Grison, Sébastien Granier & Cédric Leyrat
Science Division, Diamond Light Source Ltd., Diamond House, Harwell Science and Innovation Campus, Didcot, Oxfordshire, OX11 0DE, United Kingdom
Jonathan M. Grimes

Authors

Max Renner
View author publications
You can also search for this author in PubMed Google Scholar
Guido C. Paesen
View author publications
You can also search for this author in PubMed Google Scholar
Claire M. Grison
View author publications
You can also search for this author in PubMed Google Scholar
Sébastien Granier
View author publications
You can also search for this author in PubMed Google Scholar
Jonathan M. Grimes
View author publications
You can also search for this author in PubMed Google Scholar
Cédric Leyrat
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.R., C.L. and J.M.G. conceived and designed research. M.R., G.C.P. and C.L. performed experiments. M.R., C.M.G., S.G., J.M.G. and C.L. analyzed data. M.R., J.M.G. and C.L. wrote the paper with contributions from G.C.P., C.M.G. and S.G.

Corresponding authors

Correspondence to Jonathan M. Grimes or Cédric Leyrat.

Ethics declarations

Competing Interests

The authors declare that they have no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary Figure S1–5

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Renner, M., Paesen, G.C., Grison, C.M. et al. Structural dissection of human metapneumovirus phosphoprotein using small angle x-ray scattering. Sci Rep 7, 14865 (2017). https://doi.org/10.1038/s41598-017-14448-z

Download citation

Received: 24 July 2017
Accepted: 10 October 2017
Published: 01 November 2017
DOI: https://doi.org/10.1038/s41598-017-14448-z

This article is cited by

Structure of the N-RNA/P interface indicates mode of L/P recruitment to the nucleocapsid of human metapneumovirus
- Jack D. Whitehead
- Hortense Decool
- Max Renner
Nature Communications (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Ensemble description of the intrinsically disordered N-terminal domain of the Nipah virus P/V protein from combined NMR and SAXS

Structure of the N-RNA/P interface indicates mode of L/P recruitment to the nucleocapsid of human metapneumovirus

High-resolution cryo-EM structure of the Pseudomonas bacteriophage E217

Introduction

Results

Structures of Pcore from two new crystal forms

SAXS characterization of the HMPV P constructs

The N-terminal region of P shows α-helical propensity and is in equilibrium between extended and more compact conformers

Ensemble analysis of the central region of P indicates the presence of loose tertiary structure located C-terminally of Pcore

The C-terminal region of P may stabilize the loose tertiary structure present in the central region

Ensemble structure of the full-length phosphoprotein

Discussion

Conformation of the N-terminal region

Conformation of the M2-1 binding region

Conformation of the C-terminal IDR

Long range effects modulating P conformational ensembles

Possible implications for viral inclusion bodies formation

Material and Methods

Protein cloning, expression and purification

Small angle X-ray scattering experiments

Structural modelling and molecular dynamics simulations

Ensemble optimization

Crystallization and data collection

Structure determination and refinement

Structure and sequence analyses

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing Interests

Additional information

Electronic supplementary material

Supplementary Figure S1–5

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Structure of the N-RNA/P interface indicates mode of L/P recruitment to the nucleocapsid of human metapneumovirus

Comments

Search

Quick links

Structures of P_core from two new crystal forms

Ensemble analysis of the central region of P indicates the presence of loose tertiary structure located C-terminally of P_core