Ensemble description of the intrinsically disordered N-terminal domain of the Nipah virus P/V protein from combined NMR and SAXS

Schiavina, Marco; Salladini, Edoardo; Murrali, Maria Grazia; Tria, Giancarlo; Felli, Isabella C.; Pierattelli, Roberta; Longhi, Sonia

doi:10.1038/s41598-020-76522-3

Download PDF

Article
Open access
Published: 11 November 2020

Ensemble description of the intrinsically disordered N-terminal domain of the Nipah virus P/V protein from combined NMR and SAXS

Marco Schiavina^1,2^na1,
Edoardo Salladini⁴^na1,
Maria Grazia Murrali^1,2^na1,
Giancarlo Tria^2,3,
Isabella C. Felli^1,2,
Roberta Pierattelli^1,2 &
…
Sonia Longhi⁴

Scientific Reports volume 10, Article number: 19574 (2020) Cite this article

2312 Accesses
13 Citations
3 Altmetric
Metrics details

Subjects

Abstract

Using SAXS and NMR spectroscopy, we herein provide a high-resolution description of the intrinsically disordered N-terminal domain (PNT, aa 1–406) shared by the Nipah virus (NiV) phosphoprotein (P) and V protein, two key players in viral genome replication and in evasion of the host innate immune response, respectively. The use of multidimensional NMR spectroscopy allowed us to assign as much as 91% of the residues of this intrinsically disordered domain whose size constitutes a technical challenge for NMR studies. Chemical shifts and nuclear relaxation measurements provide the picture of a highly flexible protein. The combination of SAXS and NMR information enabled the description of the conformational ensemble of the protein in solution. The present results, beyond providing an overall description of the conformational behavior of this intrinsically disordered region, also constitute an asset for obtaining atomistic information in future interaction studies with viral and/or cellular partners. The present study can thus be regarded as the starting point towards the design of inhibitors that by targeting crucial protein–protein interactions involving PNT might be instrumental to combat this deadly virus.

Low-resolution description of the conformational space for intrinsically disordered proteins

Article Open access 09 November 2022

Full structural ensembles of intrinsically disordered proteins from unbiased molecular dynamics simulations

Article Open access 23 February 2021

Extended experimental inferential structure determination method in determining the structural ensembles of disordered protein states

Article Open access 09 June 2020

Introduction

The Nipah virus (NiV), together with its close relative Hendra virus (HeV), is a zoonotic paramyxovirus responsible for severe encephalitis in humans. The NiV and HeV have been classified in the Henipavirus genus¹ that also comprises the later on discovered Cedar virus (CedV)². Because of their high pathogenic power, broad host range, high interspecies transmission and lack of therapeutics and vaccines, henipaviruses are classified as bio-security level 4 (BSL-4) pathogens and are considered as potential bio-terrorism agents.

The genome of henipaviruses is made of a non-segmented, single-stranded RNA molecule of negative polarity that is encapsidated by a regular array of nucleoprotein (N) monomers to form a helical nucleocapsid. This N:RNA complex, and not naked RNA, is the substrate used by the viral polymerase for both transcription and replication. The viral polymerase is a complex consisting of the large (L) protein, which bears all the enzymatic activities, and the phosphoprotein (P). Through its interaction with both L and the nucleocapsid, the P protein acts as a tether and recruits L onto the N:RNA template. In addition, P also serves as a chaperon for both L³ and N in that it is required for proper folding/maturation of L and maintains N in a monomeric, RNA-free form⁴. Therefore P is a pivotal protein endowed with multiple functions critical for both transcription and replication.

The repertoire of P functions is further expanded by the peculiar coding capacity of the P gene. Indeed, beyond the P protein, the P gene also codes for the V and W proteins that are generated through the addition of either one (protein V) or two (protein W) non-templated guanosines at the editing site of the P messenger. The addition of these guanosines triggers a downstream frame-shift. The P, V, and W proteins therefore share a common N-terminal region (referred to as PNT) that constitutes a bona fide domain (i.e. a genuine functional unit) as inferred from the genetic organization of the P gene (Fig. 1A). The Henipavirus V and W proteins are key players in the evasion of the interferon (IFN)-mediated response via an antagonist activity of IFN signaling^5,6. V and W bind to STAT1, a key signal transducer in the IFN-mediated antiviral response, through their common PNT region⁷. Binding of STAT1 by V leads to inhibition of STAT1 translocation into the nucleus, whereas binding to W leads to sequestration of STAT1 in the nucleus⁷. The NiV P protein is endowed with anti-IFN function as well, indicating that the PNT domain common to P, V, and W is responsible for the IFN antagonist activity.

We previously showed that PNT from both NiV and HeV is intrinsically disordered¹⁴. The P C-terminal region (PCT) consists of an alternation of disordered regions and ordered regions (Fig. 1A). In fact, PCT comprises a disordered region (aa 407–469), referred to as « spacer » and overlapping with the reading frame of the C-terminal domain of V and W, a structured region responsible for the oligomerization of P (PMD, aa 470–578)^8,15, a disordered linker and a structured region (X domain, XD, aa 660–709) with a triple α-helical bundle fold (Fig. 1A)⁹. We characterized the NiV and HeV V proteins and showed that PNT conserves its disorder also within the V protein, while the zinc-finger domain (ZnFD) has a predominant β conformation¹⁰ in agreement with predictions^16,17. In that study, we also experimentally showed that the NiV and HeV V proteins interact with DDB1—a cellular protein whose binding to V promotes STAT1 degradation—and concluded that the ZnFD plays a crucial role in strengthening this interaction¹⁰.

Although our previous study on the V proteins from both NiV and HeV¹⁰ has contributed to illuminating the conformational behavior of these proteins and has provided a conceptual asset to design new antiviral strategies to combat the ability of these viruses to escape the innate immune response, a site-resolved description of these proteins is still lacking. A major hurdle in this respect arises from the presence of their intrinsically disordered PNT region, which prevents crystallization and whose size constitutes a challenge for NMR studies.

To fill this gap in knowledge, and as a first step towards an atomic description of the NiV V protein, we decided to investigate NiV PNT by combining small-angle X-ray scattering (SAXS) and NMR spectroscopy. When we started this project, high-resolution data were already available for the first 35 residues of NiV PNT. Yabukarski and co-workers indeed solved the X-ray structure of a complex (herein referred to as N⁰–P_1–50) made of a truncated form of NiV N unable to self-assemble (aa 32–383) and of the first 50 residues of NiV P⁴ (Fig. 1A). Only the first 35 residues of P were defined in the electron density. This Molecular Recognition Element (MoRE) forms two α-helices (αP1a, residues 1–19; αP1b; aa 21–28) separated by a kink (Fig. 1A). In that study, the P region encompassing the first 100 residues (P_1–100) was also investigated by NMR spectroscopy on a small construct. Although the HN correlation NMR spectrum is typical of a disordered protein, the secondary-structure propensities (SSPs) calculated from Cα and Cβ secondary chemical shifts indicate the presence of five fluctuating α-helices. Upon addition of the N protein, only residues 50–100 remain detectable indicating that they remain flexible in the complex and that the N⁰-binding region of P encompasses residues 1 to 50⁴. This latter region contains two conserved motifs in Paramyxoviridae members (i.e. soyuz1 ad soyuz2)¹⁸ (see Fig. 1B).

When we were about to complete our NMR and SAXS characterization of NiV PNT, a study was published reporting the structural description of the NiV P protein¹³. In that study, the authors combined NMR spectroscopy, SAXS, and X-ray crystallography and obtained an ensemble description of this large protein. Notably, they decided to use a “divide et impera” approach to disentangle the NMR spectral complexity of the large disordered N-terminal region (residues 1–474). They thus generated several overlapping constructs that altogether cover the entire P protein and then assigned the HSQC spectrum of each construct corresponding to the P disordered regions and to XD. The chemical shift-derived SSPs of the fragments revealed that the long N-terminal intrinsically disordered region and the linker connecting PMD to XD dynamically sample multiple conformations while possessing short regions of residual secondary structure. The comparison between the HSQC spectrum of the full-length NiV P protein with the HSQC spectra of the individual P sub-constructs unveiled an overall good superimposition supporting a scenario where the N-terminal moiety remains highly flexible and retains a conformational behavior in the tetramer similar to that observed in the isolated sub-constructs, with negligible inter-chain contacts¹³.

Herein, we report the results we obtained using cutting-edge, multidimensional NMR spectroscopy approaches applied to the entire PNT domain. In spite of the challenging nature of such an assignment, we succeeded in assigning as much as 91% of the peaks of this long intrinsically disordered domain. The combination of SAXS and NMR information allowed us to obtain an ensemble description of the conformational behavior of the protein in solution.

Results and discussion

NMR residue-specific structural and dynamic characterization of NiV PNT

NiV PNT is a very large intrinsically disordered protein (412 residues including the C-terminal His tag and the initial methionine), as can be inferred by inspection of the two 2D spectra, the ¹H–¹⁵N HSQC (Fig. 2A) and the ¹³C′–¹⁵N CON (Fig. 2B). The latter reveals also signals of proline residues and is characterized by improved chemical shift dispersion, two features that are very useful for the study of intrinsically disordered proteins (IDPs). An assignment strategy that combines ¹³C detected with ¹H detected NMR is thus very helpful to provide sufficient information to enable sequence-specific assignment of the resonances of the full-length protein. A series of triple-resonance 3/5D NMR experiments, specifically designed for IDPs based either on ¹H^N or ¹³C detection, were acquired^19,20 (Supplementary Table S1). The projection reconstruction spectroscopy (APSY) approach was exploited in most of the multidimensional experiments to reduce the experimental time while preserving high spectral resolution in the indirect dimensions^21,22.

The ¹³C′–¹⁵N CON spectrum of NiV PNT, shown in Fig. 2B, can be considered the reference spectrum for a series of multidimensional CON-based experiments in which correlations to additional nuclear spins in the indirect dimensions of the experiments provide the needed information to achieve the sequence-specific assignment. These multidimensional spectra include the ¹³C-detected 3D (H)CBCACON²³, (H)CBCANCO²⁴, (H)COCON^25,26, and 5D (HCA)CONCACON²⁷ spectra. Inspection of the spectra constituted the starting point for the sequence-specific assignment^22,27.

This information was complemented with that available through ¹H^N detected multidimensional NMR experiments. The 3D BEST-TROSY (BT) triple-resonance experiments²⁸ HN(CO)CACB, HNCACB, HNCANNH, HNCO, HN(CO)CACB, HNCACB, and HNCOCANNH were acquired to obtain the complementary information needed to complete the assignment, augmented by a 5D BT-HN(COCAN)CONH experiment²⁹ used to resolve a few ambiguities and to confirm the chemical shift assignment obtained with the 3D spectra.

We could successfully assign the resonances of H^N, N, C′, C^α, and C^β nuclei of as many as 369 residues, including all proline residues (Fig. 2B), thus leading to a percentage of 91% of assigned residues for the entire protein. The chemical shifts of the previously mentioned nuclei are reported in Supplementary Table S2. The assignment is less complete in the N- and C-terminal moieties of the protein and in particular in the stretches encompassing residues 14–27, 41–49 and 342–346 either because of increased transverse relaxation or of residual signal overlap or both. The availability of heteronuclear chemical shifts enables the presence of residual secondary structure to be inferred³⁰. The neighbor-corrected structural propensities (ncSSPs) were computed by comparing the experimentally measured chemical shifts of N, C′, C^α, and C^β nuclei to the values expected for corresponding random coils³¹ using the tool available online at https://st-protein02.chem.au.dk/ncSPC³². The propensities to adopt α-helical and β-extended conformations, as obtained using the method of Mulder³², are shown in Fig. 3A and confirm that the protein is intrinsically disordered, with SSP values in the + /− 0.2 range for the majority of the primary sequence except for the regions encompassing residues 69–73, 237–240, 341–349, and 400–403.

Segments with SSP values between 0.05 and 0.2 indicate very modest deviations from random coil behavior. Moreover, the detection of local structural propensity in a polypeptide crucially depends on the choice of the reference chemical shifts for the random coil state and on possible bias induced by experimental conditions (e.g. buffer, pH, experimental temperature, etc.). Additional information is thus required to characterize the structural and dynamic properties of the protein in addition to nuclear chemical shifts.

To investigate backbone dynamics of NiV PNT, we measured heteronuclear ¹H–¹⁵N NOE, ¹⁵N R_2, and ¹⁵N R₁ values. We could successfully determine relaxation rates for most of the assigned cross-peaks observed in the 2D ¹H–¹⁵N HSQC spectrum and discarded those whose overlap would have rendered the analysis ambiguous. The relaxation rates show an overall trend consistent with the considerably disordered conformation of PNT. ¹H–¹⁵N NOE values are all well below 0.5, indicating high flexibility of the backbone (Supplementary Figure S1A). Contrary to the ¹⁵N R₁ values that are rather uniform along the polypeptide chain (Supplementary Figure S1B), higher than modal ¹⁵N R₂ values were observed in several regions of the primary sequence, i.e. 9–52, 110–141, 207–220, 226–256, 261–269 and 308–406 (Fig. 3B). The segments encompassing residues 9–52 and the C-terminal part of the polypeptide chain (340–406) are regions that are partially unassigned in our experimental conditions. This observation, as well as the absence of correlations in the multidimensional spectra, confirms that these regions are affected by dynamical phenomena leading to high R₂ values, such as the presence of transiently populated secondary structural elements or chemical exchange broadening. The positive SSP values for the residues that can be observed in the regions 339–357 and 367–373 suggest the presence of transiently formed α-helices in these regions. The following stretch of about 30 amino acids, from residue 376 to the end of the protein primary sequence, is particularly rich in positively charged amino acids (22%), which suggests the possibility that the high nuclear relaxation rates observed in the absence of pronounced local secondary structural elements are due to the occurrence of transient electrostatic interactions (Fig. 1B and Supplementary Figure S2). Interestingly, the region encompassing residues 226–256 and 261–269, which overall shows higher than modal ¹⁵N R₂, is rich in negatively charged residues (33%) and could indeed be engaged in transient interaction with the C-terminal, positively charged stretch (376–406).

It is also interesting to note that this latter region (i.e. 376–406) is rich in positively charged amino acids (with 7 of such residues), and also contains 4 prolines (Fig. 1B and Supplementary Figure S2). This particular amino acidic composition of alternating proline residues with amino acids sharing the same charge (positive in this case) could thus have a role in promoting conformations with a local net charge even in the absence of well-defined secondary structural elements. The same pattern of regions rich in amino acids sharing the same charge and proline residues is found in other regions, such as ²⁶⁷PEDEEP²⁷², although in this case the amino acids are negatively charged. (Fig. 1B and Supplementary Figure S2). The NMR data thus suggest a possible cross-talk between regions enriched in residues with opposite charge. The 238–256 region is negatively charged and can establish interactions with positively charged regions, such as the C-terminal one encompassing residues 376–406 (Fig. 1B and Supplementary Figure S2).

Additional amino acid stretches showing higher than modal ¹⁵N R₂ values are 110–141 and 207–220. These do not show evidence of transiently populated secondary structural elements nor highly charged segments. However, looking at the amino acid composition, in particular at amino acid types that are quite rare in IDPs³³, one can observe a number of aromatic residues in these regions. These account for 22% of the residues in region 110–141 and 23% in region 207–220, indicating that these regions feature a higher amount of aromatic residues with respect to what is observed for the whole protein (6%) (Fig. 1B and Supplementary Figure S2). Aromatic residues are bulky amino acids and might be involved in local transient interactions without inducing specific secondary structural elements explaining the higher ¹⁵N transverse relaxation rates observed, as also observed for other IDPs³⁴ in which aromatic residues have been shown to promote compact states.

Additional information that can be obtained by NMR derives from the investigation of exchange processes of amide protons with the solvent. Several CLEANEX experiments were acquired with different mixing times to measure the exchange rate constants³⁵. Intensities of cross peaks are measured to obtain an estimation of the k_ex (s⁻¹) (Supplementary Figure S1C). Thus, k_ex reports on solvent exchange processes of amide protons with the solvent on a per-residue basis. While this observable has been used extensively to characterize globular proteins and identify amide protons that are buried in hydrophobic cores and/or involved in hydrogen bonds through the determination of “protection factors”, in the case of IDPs it is not straightforward to interpret hydrogen exchange data in terms of structural effects as these are not easy to disentangle in a clear way from other effects such as local electrostatic potential, nature of the amino acid, and possible effects of neighboring amino acids in the primary sequence^36,37. On the other hand, there is no doubt that this observable is rich in information^36,37,38 and that, as we make progress in understanding the different contributions and in predicting exchange values for random coil conformations³⁹, it might become a useful tool to achieve information about the structural and dynamic properties of IDPs.

Inspecting the observed k_ex for NiV PNT as a function of the primary sequence reveals that some regions present a k_ex that is higher than the modal value of 1.58 Hz, while some others are well below this value. Higher values are observed for residues 27–30, between two regions with large, positive SSP values at the edges of two transiently populated helices. Another region with high k_ex is the one encompassing residues 119–147, rich in aromatic residues (17%; Fig. 1B and Supplementary Figure S2), where slightly higher ¹⁵N R₂ values were identified, in agreement with the presence of a solvent exposed region. Another region with significantly high rates is the one spanning residues 182–194, a region that is very rich in serine and threonine residues (38% in this segment versus 16% in the whole protein), two amino acids that are often solvent exposed and are characterized by high exchange rates with the solvent in general³⁹ (Fig. 1B and Supplementary Figure S2). The region presenting lower k_ex values (slow chemical exchange) are the regions encompassing residues 97–122 and 239–263, which are rich in negatively charged residues (31% in the first segment, 40% in the second one) (Fig. 1B and Supplementary Figure S2), a feature that is known to cause reduced exchange processes^37,40.

The NMR data clearly suggest that NiV PNT is disordered and very flexible with few short and transient secondary structure elements. The N-terminus shows moderate positive secondary structure propensity for residues 68–81, while some hints of the occurrence of other transiently populated α-helices can be obtained from the combination of ¹⁵N R₂, SSP and k_ex values in the region encompassing residues 10–30. This region contains a segment (1–29) that adopts a helix-kink-helix conformation in the crystal structure of the N⁰–P complex⁴. Subsequent NMR studies confirmed the presence of two transiently populated α-helices at the N-terminus of NiV PNT also in solution and in the absence of the binding partner^4,13. The presence of these helical elements mildly populated in our experimental conditions might be responsible for the conformational exchange processes likely causing the broadening of the signals in the full construct used here. Nevertheless, a few amino acids are detectable in this region and do show SSP values that indicate α-helical propensity (Fig. 3A). The more efficient transverse relaxation in these regions, as compared to the studies performed on shorter constructs, could be due to subtle differences arising from intramolecular interactions.

The overall behavior of PNT in solution might also be influenced by the experimental conditions. Indeed, the buffer used in¹³ includes compounds that mitigate protein self-association and aggregation (150 mM NaCl, 50 mM arginine, 50 mM glutamate)⁴¹. It should be pointed out however that in other cases the same buffer was reported to induce protein compaction by minimizing the effect of intramolecular long-range electrostatic interactions⁴². The causes of these conflicting effects are not well understood⁴².

Similar considerations also hold for the 340–355 region that was shown by Jensen and co-workers to partly sample an α-helical conformation¹³. Although we could not map the core region of the transient α-helix we mapped the flanking residues that, also in this case, confirm the presence of a transient α-helix in this region. This transiently populated α-helix may correspond to a binding site for one or multiple partners whose identification will require future studies. A possible partner could be the unassembled form of the N protein, by analogy with measles virus. Indeed, the counterpart of this region in measles virus P corresponds to a transiently populated α–helix (α4, aa 190–198) that binds weakly N⁰⁴³.

The transiently populated α-helices are in good agreement with predictions as obtained with both FESS (the fast secondary structure predictor implemented in FELLS⁴⁴ (Supplementary Figure S2) and PSIPRED⁴⁵ (Supplementary Figure S3A). In terms of predicted MoREs (see Supplementary Figure S3A), a quite good agreement was found with the experimental data, as well as with already mapped binding sites, including the STAT1 (aa 110–140)¹³ and STAT2 (aa 110–140 and 230–237) binding sites⁴⁶. The 110–140 region shown to constitute a weak binding site to STAT1¹³ does not exhibit significant SSP values (all the values are below 0.2), a finding that suggests that this region might correspond to an I-MoRE (Irregular-MoRE), i.e. a region that may remain at least partially disordered after binding to the partner. This latter hypothesis is further corroborated by the fact that addition of STAT1 only triggers a decrease in the resonance intensities of the 110–140 region with no concomitant peak shift¹³.

An analysis performed on the ELM database (https://elm.eu.org/)⁴⁷ highlighted the presence of numerous Short Linear Motifs (SLiMs) along the sequence. SLiMs, also known as linear motifs (LMs), are short stretches of adjacent amino acids mediating protein–protein interactions and occurring within IDRs⁴⁸. SLiMs play crucial roles in cell regulation and SLiM mimicry is often used by viruses to hijack their host cellular machinery⁴⁹. A nice correlation was found between the presence of SLiMs and the dynamic behavior of NiV PNT. Indeed, 22 SLiMs were identified (Supplementary Table S3) and most of them (18) are located within the regions displaying high R₂ values (Supplementary Figure S3B). As an example, the previously mentioned C-terminal region (aa 376–406), enriched in proline residues and in positively charged residues, features several (i.e. 7) SLiMs. These SLiMs are mainly involved in interactions with kinases, phosphatases, SH3 and WW regulatory domains, consistent with a role in hijacking key cell regulatory processes. These interactions can now be investigated at atomic resolution through ¹³C-detected experiments tailored for proline residues that allow acquiring quick snapshots in a clean spectral region^50,51.

SAXS ensemble characterization of NiV PNT

Synchrotron SAXS in solution measurements were performed to gain insight into overall NiV PNT conformation and motion. Linearity in the Guinier region at low angles (sR_g < 1.0) revealed good data quality, with no indication of protein aggregation (Fig. 4A, inset). The molecular mass determined from the forward scattering intensity at zero angle I(0) was ~ 47 kDa indicating a monomer in solution and in agreement with the value (45.3 kDa) computed from the amino acid sequence using the expasy server (https://web.expasy.org/protparam/). Although not significantly different, the R_g extracted from P(r) is slightly larger (~ 63 Å) than the one extracted from Guinier analysis (~ 61 Å). The theoretical R_g value expected for an IDP of the same length (412 amino acids) calculated using Flory’s power law \({R}_{g}^{IDP}={R}_{0}{N}^{v}\), where R₀ is 2.54 ± 0.01, N is number of amino acid and \(v\) is 0.522 ± 0.001⁵², is ~ 59 Å. The experimentally determined R_g is therefore very close to the value expected for an IDP (for comparison the R_g expected for a random coil conformation would be ~ 72⁵³).

P(r) yields a maximal dimension, D_max, of ~ 230 Å (Fig. 4B) with a long tail in the P(r) function, suggesting that the protein tends to assume an overall non-compact conformation⁵⁴. The overall SAXS parameters for NiV PNT are listed in Supplementary Table S4.

The flexible nature of NiV PNT was qualitatively assessed by using the normalized Kratky plot, where the absence of a well-defined bell shape indicates a protein with intrinsically disordered regions (Fig. 4C). To achieve further insights into the conformational behavior of NiV PNT, data obtained from NMR and SAXS experiments were combined in search of an ensemble that quantitatively describes the conformational behavior of NiV PNT in solution. In a first step, a pool of 50,000 structures was generated based on the NMR data. To this end, the ncSSP values larger than ± 0.05 were provided as inputs to Flexible-Meccano⁵⁵. Subsequently, from the NMR-based pool, we used Ensemble Optimization Method (EOM) 2.0 to generate a conformational sub-ensemble that best fits the experimental SAXS data. In order to minimize over-fitting, EOM attempts at minimizing the number of conformers able to fit the experimental data and usually generates ensembles consisting of 5 to 40 conformers. Based on the high flexibility expected, no repetition of conformations in the ensemble was allowed. The scattering curve back-calculated from the selected ensemble (Fig. 4A, cyan curve) fits well the experimental SAXS data as judged from both χ² and CorMap⁵⁶ (χ² = 0.744, P = 0.101). The resulting final R_g distribution, broader than the one generated from the NMR-based pool, indicates that NiV PNT exists in solution as a randomly distributed ensemble of non-compact and highly flexible conformations. Note that successive and independent selections by EOM 2.0 consistently yielded similar R_g distributions thus attesting the reproducibility of the results (data not shown). The flexibility of the ensemble was quantified as Ensemble_Rflex = ~ 91% (NMR-Pool_Rflex = ~ 83%).

The average R_g value of the ensemble (66 Å) slightly exceeds the theoretical value expected from Flory’s power law (59 Å). This discrepancy may reflect sequence specificities, i.e. specific sequence attributes such as proline content and charge decoration (i.e. net charge per residue, fraction of charged residues and linear distribution of opposite charges) that were shown to be major determinants of IDP conformational properties^{57,58,59,60,61,62}. For an additional discussion of the observed discrepancy see Supplementary text.

A similar investigation was conducted by using a random pool generated by EOM 2.0 without NMR-based restraints (χ² = 0.745, P = 0.192) (Fig. 4A, red curve). EOM 2.0 yielded similar results with randomness quantified as Ensemble_Rflex = ~ 92% (Random-Pool_Rflex = ~ 83%) such that no significant differences was observed when using different pools (Fig. 5).

Although SAXS is poorly sensitive to secondary structure and although the presence of secondary structure elements has been reported to have a limited impact on protein compaction^57,63, we sought at investigating the impact of the occurrence of the N-terminal transiently populated α-helix on chain compaction. To this end, using Flexible-Meccano, we generated an initial pool in which an α-helical conformation was imposed to residues 1–30, with the experimental ncSSPs. Furthermore, we also investigated the final R_g distribution while imposing a kink centered at residue 20 (as observed in the crystal structure of the N⁰–P_1–50 complex) providing NMR restraints. All the resulting sub-ensembles fit equally well the scattering data as judged from the obtained χ² values and even for these two cases, no differences were observed in terms of the resulting R_g distributions (Supplementary Figure S4). This probably reflects the fact that SAXS is a low-resolution technique that provides relevant information in terms of chain compactness but fails to capture secondary structure propensities. In addition, previous findings from others and us have highlighted that regular secondary structure does not overly contribute to protein compaction (i.e. more compact forms do not necessarily exhibit an increased content in regular secondary structure)^57,63. As such, it is not surprising that the presence of transiently populated α-helices in the initial ensemble does not have a significant impact on the final SAXS-derived sub-ensembles. Nevertheless, comparing SAXS-only ensembles to ensembles generated by the combined use of NMR and SAXS restraints is of interest given the growing interest towards ensemble descriptions of IDPs. In addition, combining NMR and SAXS data enables at least partly overcoming over-fitting as already discussed by Mertens and Svergun⁶⁴.

Conclusions

Albeit IDPs/IDRs are notoriously prone to undergo proteolytic degradation, we managed in generating a full-length NiV PNT sample stable enough to allow acquiring all the NMR spectra required for the assignment. Although the length of this intrinsically disordered domain is very challenging for NMR studies, the use of multidimensional NMR spectroscopy and of ¹³C-detected experiments allowed us to assign as much as 91% of the residues including prolines. The combination of SAXS and NMR data led to ensemble models of NiV PNT showing its conformational heterogeneity.

The presence of such a large disordered region shared by both the V and P proteins is likely related to their promiscuity. Structural disorder is known to serve as a determinant of protein interactivity^65,66,67. Indeed V interacts with various cellular partners, such as DDB1¹⁰, STAT proteins^13,46,68, CRM1^46,69, PLK1⁷⁰ and, possibly, nuclear factors such as IRF3. All these interactions play a crucial role in the ability of the virus to counteract the antiviral innate immune response of the host. The P protein binds to multiple partners as well, including N and the L protein (for a review see⁷¹). Therefore, intrinsic disorder represents an ergonomic solution for the virus to encode fewer proteins with more functions while keeping the genome size as small as possible. It is therefore not surprising to find large IDRs in proteins having a broad molecular partnership, such as the Henipavirus V and P proteins, and, more generally, in viral proteins^72,73.

Intrinsic disorder also represents a strategy to alleviate constraints imposed by evolution on overlapping reading frames (such in the case of PNT that overlaps with the C protein) and to buffer the deleterious effect of mutations (i.e. IDRs/IDPs are more tolerant of substitutions compared to globular proteins) (for reviews see^71,73). It is also well established that intrinsic disorder allows affinity and specificity to be uncoupled (i.e. it enables interactions with low affinity and decent specificity) (see^33,74,75 and references therein cited). However, the occurrence of residual disorder in complexes involving IDPs (in the form of fuzzy appendages) and partial preconfiguration of binding motifs before binding afford a way to attenuate the entropic penalty associated to the disorder-to-order transition thereby, ultimately, modulating the binding affinity. The involvement of IDRs in protein–protein interactions that need to be finely tuned offers an exquisite means to modulate the strength of those interactions: by tuning the extent of preconfiguration of the binding motifs and/or the length of flanking fuzzy appendages, the virus can reach an optimal binding strength. It is therefore conceivable that the presence within NiV PNT of long disordered regions flanking binding motifs, including SLiMs, as well as binding sites to N⁰ or STAT proteins, may exert a role in regulating these critical interactions. The flexibility of these flanking regions would enable regulating both the exposure of binding sites specifically recognized by multiple viral and cellular partners, and the affinity of these interactions. Ultimately, this would result in their ability to orchestrate virus replication, through hijacking of cellular pathways and evasion of the IFN response.

The present results constitute an asset for obtaining atomistic information in future interaction studies involving NiV PNT. Indeed, the availability of the chemical shifts of full-length NiV PNT will be instrumental to map the residues involved in binding to partners such as DDB1 and/or STAT1 in the context of either the isolated PNT domain or the V protein. Once detailed information is available from interaction studies, inhibitors can be rationally conceived. Therefore the present study can be regarded as a starting point towards the design of inhibitors abrogating the ability of this virus to escape the innate immune response. Given the high similarity between NiV and HeV PNT (56%), it is conceivable that the results of these studies could be extrapolated to some extent to the HeV as well.

Methods

Protein expression and purification

The NiV PNT construct, encoding residues 1–406 of the NiV P protein with a C-terminal hexahistidine tag, has been already described¹⁴. Expression of unlabeled NiV PNT was carried out as previously described¹⁴.

Isotopically labeled (either ¹⁵N–¹³C or ¹⁵N) NiV PNT samples were prepared by growing at 37 °C transformed E. coli T7 cells (New England Biolabs, Ipswich, MA, USA) bearing the pRARE plasmid (Novagen, Madison, WI, USA) in LB medium supplemented with 100 μg mL⁻¹ ampicillin and 34 μg mL⁻¹ choramphenicol. When the OD₆₀₀ reached 0.6, the culture was centrifuged at 4000 rpm for 10 min and the pellet was resuspended in ¼ of the initial volume of M9 medium (6 g L⁻¹ of Na₂HPO₄, 3 g L⁻¹ of KH₂PO₄, 0.5 g L⁻¹ of NaCl, 0.246 g L⁻¹ of MgSO₄) supplemented with 1 g L⁻¹ of ¹⁵NH₄Cl and 2 g L⁻¹ of either glucose or ¹³C-glucose. After one hour at 37 °C, IPTG was added to a final concentration of 0.5 mM, and the cells were subsequently grown at 37 °C for 3.5 h.

NiV PNT was purified as described in¹⁴, except that 6 M GuHCl was added after the lysis step to both denature bacterial proteases and recover the recombinant protein also from inclusion bodies with the aim of improving protein stability and yield. After 1 h incubation at 4 °C, the sample was clarified and the supernatant was purified through immobilized metal affinity chromatography (IMAC). The fractions containing the recombinant protein were combined, and then loaded onto a Superdex 75 h 16/60 column (GE, Healthcare). The elution buffer was 10 mM sodium phosphate pH 6.5, supplemented with 5 mM EDTA and 5 mM DTT. The fractions containing the protein were collected and conserved at − 20 °C.

NMR spectroscopy experiments

Immediately before NMR studies, the NiV PNT sample was thawed, centrifuged at 12,000 rpm to remove any possible protein aggregate and then concentrated using 30 kDa Amicon Ultra Centrifugal Filters (Merk Milllipore, Darmstadt, Germany). All NMR spectra for assignment were acquired on ¹³C–¹⁵N-isotopically enriched NiV PNT at a concentration of about 200 µM.

The ¹H-detected spectra were acquired at 288 K with a 22.3 T Bruker Avance III 950 NMR spectrometer; the ¹³C-detected spectra were acquired at 288 K with a 16.4 T Bruker Avance NEO 700 NMR spectrometer equipped with a cryogenically cooled probehead optimized for ¹³C-direct detection. Experimental details are reported in Supplementary Table S1.

All the heteronuclear relaxation experiments (R₁, R₂ and ¹H–¹⁵N NOEs) were acquired using ¹⁵N labeled NiV PNT at about 100 µM. The spectra were recorded at 288 K on a 16.4 T Bruker Avance NEO 700 NMR spectrometer equipped with a cryogenically cooled triple resonance probehead. The ¹⁵N R₁ and R₂ experiments were acquired with 8 scans (2048 × 256 points) and a relaxation delay of 3.0 s. To determine the ¹⁵N R₁ the following delays were used: 20 ms, 60 ms, 120 ms, 180 ms, 250 ms, 400 ms, 500 ms, 600 ms, 750 ms, and 900 ms. To determine the ¹⁵N R₂ the following delays were used: 32 ms, 64 ms, 96 ms, 128 ms, 160 ms, 190 ms, 260 ms, 320 ms, 380 ms, 440 ms, and 500 ms. The ¹H–¹⁵N NOEs experiments were acquired with 64 scans (2048 × 288 points) and a relaxation delay of 6.0 s.

The CLEANEX experiments were acquired on a 100 µM sample, with 16 scans (2048 × 248 points) and a relaxation delay of 3.0 s. The following delays were used: 5 ms, 10 ms, 20 ms, and 30 ms.

NMR data processing and analysis

NMR data sets were processed using the Bruker TopSpin 4.0.6 software. CARA⁷⁶ and its tool XEASY⁷⁷ were used to analyze and annotate the spectra.

The ¹⁵N relaxation rates (R₁ and R₂) were determined by fitting the cross-peak intensity measured as a function of variable delay, to single-exponential decay using the Bruker Dynamic Center 2.4, available as a stand-alone ancillary software of TopSpin by Bruker. ¹H–¹⁵N NOE values were obtained as the ratio between peak intensity in spectra recorded with and without ¹H saturation.

The secondary structure propensity (SSP) from heteronuclear chemical shifts was determined by using the neighbor corrected structural propensity calculator (ncSPC) tool³² available online at https://st-protein02.chem.au.dk/ncSPC. The Mulder random coil chemical shift library³¹ was chosen for the analysis, and the average window size was left to the standard value of 5.

SAXS experiments and analysis

Samples were concentrated using 30 kDa Amicon Ultra Centrifugal Filters (Merk Milllipore, Darmstadt, Germany) and loaded onto a Superdex 75 h 16/60 column using 20 mM Tris/HCl pH 8.0, 0.3 M NaCl, 5 mM DTT as elution buffer. Synchrotron X-ray scattering data were collected at ESRF BM29 beamline (Grenoble) using a PILATUS 1 M pixel detector (DECTRIS, Baden, Switzerland) at a sample-detector distance of 2.87 m and a wavelength of 0.0992 nm. This setup covers a range of momentum transfer of 0.028 < s < 4.525 nm^–1 (s = 4π sin(θ)/λ, where 2θ is the scattering angle). Samples were loaded using a robotic sample changer⁷⁸ and measured at 20 °C at concentrations ranging from 0.5 to 1.2 mg/mL. 10 independent frames of 1 s each were collected for each sample and data were automatically reduced using an in-house pipeline. Further analysis was done using ATSAS 3.0.1⁷⁹. As no concentration dependence was observed (Supplementary Figure S5), the curve with less noise (at 1.2 mg/mL) was used for the SAXS analysis. The forward scattering I(0) as well as the R_g were calculated using the Guinier approximation assuming that, at very small angles (s < 1.3/R_g), the intensity is represented as I(s) = I(0) · exp(− (sR_g)²/3)^80,81. The forward scattering intensities were calibrated using water as reference. Linearity in the Guinier region was used to exclude sample aggregation, and the pair-distance distribution function, P(r), from which the D_max and the R_g were estimated, was computed using GNOM⁸². Qualitative assessment of compactness versus structural disorder was made by transforming the scattering profiles in the so-called normalised Kratky representation [(sR_g)² * I(s)/I(0) vs. sR_g]⁸³. The MM was derived by placing the scattered intensity on an absolute scale using liquid water as calibrant.

Protein flexibility was quantified using EOM 2.0⁸⁴ that assumes coexistence of a range of Cα-only conformations in solution for which an average scattering intensity fits the experimental data. A genetic algorithm (GA) is used to select ensembles with varying numbers of conformers. The GA is repeated n independent times, and the ensemble with the lowest discrepancy considered as the best solution out of n final ensembles. Furthermore, repetition of GA allows the computation of R_g distributions so that structural information about the flexibility could be extracted. The width of the distribution is used to derive the flexibility of the particles, whereby a narrow distribution indicates a rather rigid particle and broader distributions are associated with higher flexibility. Using EOM 2.0, systematic quantification of the flexibility was made by using the metric R_flex—which computes the Shannon information entropy of the distributions⁸⁴. Experimental error-independent goodness-of-fit was also confirmed by using the software CorMap that estimates the differences between one-dimensional spectra independently of explicit error estimates, using only data point correlations⁵⁶. All the softwares used for the SAXS data analysis are part of the ATSAS 3.0 package⁷⁹.

Accession numbers

The chemical shifts have been deposited in the Biological Magnetic Resonance Data Bank (BMRB) under the code 50370. SAXS data have been deposited in the Small Angle Scattering Biological Data Bank (SASBDB)⁸⁵ under code SASDJB5. The ensemble derived using SAXS constraints only and the one derived from the combined use of SAXS and NMR constraints have been deposited within the Protein Ensemble Database (PED-DB, https://proteinensemble.org/)⁸⁶ under accession numbers PED00176 and PED00177, respectively. All-atoms pdb files, required for deposition in the PED-DB, were generated from EOM Cα-only pdb files using the PD2ca2main server (http://www.sbg.bio.ic.ac.uk/~phyre2/PD2_ca2main/)⁸⁷.

References

Wang, L. F. et al. The exceptionally large genome of Hendra virus: Support for creation of a new genus within the family Paramyxoviridae. J. Virol. 74, 9972–9979 (2000).
Article CAS Google Scholar
Marsh, G. A. et al. Cedar virus: A novel Henipavirus isolated from Australian bats. PLoS Pathog. 8, e1002836 (2012).
Article CAS Google Scholar
Bloyet, L. M. et al. HSP90 chaperoning in addition to phosphoprotein required for folding but not for supporting enzymatic activities of measles and Nipah virus L polymerases. J. Virol. 90, 6642–6656 (2016).
Article CAS Google Scholar
Yabukarski, F. et al. Structure of Nipah virus unassembled nucleoprotein in complex with its viral chaperone. Nat. Struct. Mol. Biol. 21, 754–759 (2014).
Article CAS Google Scholar
Park, M. S. et al. Newcastle disease virus (NDV)-based assay demonstrates interferon-antagonist activity for the NDV V protein and the Nipah virus V, W, and C proteins. J. Virol. 77, 1501–1511 (2003).
Article CAS Google Scholar
Fontana, J. M., Bankamp, B. & Rota, P. A. Inhibition of interferon induction and signaling by paramyxoviruses. Immunol. Rev. 225, 46–67 (2008).
Article CAS Google Scholar
Shaw, M. L., Garcia-Sastre, A., Palese, P. & Basler, C. F. Nipah virus V and W proteins have a common STAT1-binding domain yet inhibit STAT1 activation from the cytoplasmic and nuclear compartments, respectively. J. Virol. 78, 5633–5641 (2004).
Article CAS Google Scholar
Bruhn-Johannsen, J. F. et al. Crystal structure of the Nipah virus phosphoprotein tetramerization domain. J. Virol. 88, 758–762 (2014).
Article CAS Google Scholar
Habchi, J. et al. Characterization of the interactions between the nucleoprotein and the phosphoprotein of Henipaviruses. J. Biol. Chem. 286, 13583–13602 (2011).
Article CAS Google Scholar
Salladini, E., Delauzun, V. & Longhi, S. The Henipavirus V protein is a prevalently unfolded protein with a zinc-finger domain involved in binding to DDB1. Mol. Biosyst. 13, 2254–2267 (2017).
Article CAS Google Scholar
DeLano, W. L. The PyMOL molecular graphics system. Proteins: Struct. Funct. Bioinf. 30, 442–454 (2002).
Google Scholar
Wootton, J. C. Non-globular domains in protein sequences: Automated segmentation using complexity measures. Comput. Chem. 18, 269–285 (1994).
Article CAS MATH Google Scholar
Jensen, M. R. et al. Structural description of the Nipah virus phosphoprotein and its interaction with STAT1. Biophys. J. 118, 2470–2488 (2020).
Article ADS CAS Google Scholar
Habchi, J., Mamelli, L., Darbon, H. & Longhi, S. Structural disorder within henipavirus nucleoprotein and phosphoprotein: From predictions to experimental assessment. PLoS ONE 5, e11684 (2010).
Article ADS CAS Google Scholar
Blocquel, D., Beltrandi, M., Erales, J., Barbier, P. & Longhi, S. Biochemical and structural studies of the oligomerization domain of the Nipah virus phosphoprotein: Evidence for an elongated coiled-coil homotrimer. Virology 446, 162–172 (2013).
Article CAS Google Scholar
Karlin, D., Ferron, F., Canard, B. & Longhi, S. Structural disorder and modular organization in Paramyxovirinae N and P. J. Gen. Virol. 84, 3239–3252 (2003).
Article CAS Google Scholar
Li, T., Chen, X., Garbutt, K. C., Zhou, P. & Zheng, N. Structure of DDB1 in complex with a paramyxovirus V protein: Viral hijack of a propeller cluster in ubiquitin ligase. Cell 124, 105–117 (2006).
Article CAS Google Scholar
Karlin, D. & Belshaw, R. Detecting remote sequence homology in disordered proteins: Discovery of conserved motifs in the N-termini of Mononegavirales phosphoproteins. PLoS ONE 7, e31719 (2012).
Article ADS CAS Google Scholar
Brutscher, B. et al. NMR methods for the study of instrinsically disordered proteins structure, dynamics, and interactions: General overview and practical guidelines. Adv. Exp. Med. Biol. 870, 49–122 (2015).
Article CAS Google Scholar
Felli, I. C. & Pierattelli, R. Recent progress in NMR spectroscopy: Toward the study of intrinsically disordered proteins of increasing size and complexity. IUBMB Life 64, 473–481 (2012).
Article CAS Google Scholar
Hiller, S., Fiorito, F., Wuthrich, K. & Wider, G. Automated projection spectroscopy (APSY). Proc. Natl. Acad. Sci. USA 102, 10876–10881 (2005).
Article ADS CAS Google Scholar
Murrali, M. G. et al. ¹³C APSY-NMR for sequential assignment of intrinsically disordered proteins. J. Biomol. NMR 70, 167–175 (2018).
Article CAS Google Scholar
Bermel, W. et al. Protonless NMR experiments for sequence-specific assignment of backbone nuclei in unfolded proteins. J. Am. Chem. Soc. 128, 3918–3919 (2006).
Article CAS Google Scholar
Bermel, W. et al. H-start for exclusively heteronuclear NMR spectroscopy: The case of intrinsically disordered proteins. J. Magn. Reson. 198, 275–281 (2009).
Article ADS CAS Google Scholar
Felli, I. C., Pierattelli, R., Glaser, S. J. & Luy, B. Relaxation-optimised Hartmann–Hahn transfer using a specifically Tailored MOCCA-XY16 mixing sequence for carbonyl–carbonyl correlation spectroscopy in ¹³C direct detection NMR experiments. J. Biomol. NMR 43, 187–196 (2009).
Article CAS Google Scholar
Mateos, B., Konrat, R., Pierattelli, R. & Felli, I. C. NMR characterization of long-range contacts in intrinsically disordered proteins from paramagnetic relaxation enhancement in ¹³C direct-detection experiments. ChemBioChem 20, 335–339 (2019).
Article CAS Google Scholar
Bermel, W. et al. High-dimensionality ¹³C direct-detected NMR experiments for the automatic assignment of intrinsically disordered proteins. J. Biomol. NMR 57, 353–361 (2013).
Article CAS Google Scholar
Solyom, Z. et al. BEST-TROSY experiments for time-efficient sequential resonance assignment of large disordered proteins. J. Biomol. NMR 55, 311–321 (2013).
Article CAS Google Scholar
Piai, A. et al. “CON–CON” assignment strategy for highly flexible intrinsically disordered proteins. J. Biomol. NMR 60, 209–218 (2014).
Article CAS Google Scholar
Marsh, J. A., Singh, V. K., Jia, Z. & Forman-Kay, J. D. Sensitivity of secondary structure propensities to sequence differences between alpha- and gamma-synuclein: Implications for fibrillation. Protein Sci. 15, 2795–2804 (2006).
Article CAS Google Scholar
Tamiola, K., Acar, B. & Mulder, F. A. Sequence-specific random coil chemical shifts of intrinsically disordered proteins. J. Am. Chem. Soc. 132, 18000–18003 (2010).
Article CAS Google Scholar
Tamiola, K. & Mulder, F. A. Using NMR chemical shifts to calculate the propensity for structural order and disorder in proteins. Biochem. Soc. Trans. 40, 1014–1020 (2012).
Article CAS Google Scholar
Habchi, J., Tompa, P., Longhi, S. & Uversky, V. N. Introducing protein intrinsic disorder. Chem. Rev. 114, 6561–6588 (2014).
Article CAS Google Scholar
Mateos, B. et al. The ambivalent role of proline residues in an intrinsically disordered protein: From disorder promoters to compaction facilitators. J. Mol. Biol. 432, 3093–3111 (2020).
Article CAS Google Scholar
Hwang, T. L., van Zijl, P. C. & Mori, S. Accurate quantitation of water-amide proton exchange rates using the phase-modulated CLEAN chemical EXchange (CLEANEX-PM) approach with a Fast-HSQC (FHSQC) detection scheme. J. Biomol. NMR 11, 221–226 (1998).
Article CAS Google Scholar
Chevelkov, V., Xue, Y., Rao, D., Forman-Kay, J. & Skrynnikov, N. ¹⁵N H/D-SOLEXSY experiment for accurate measurement of amide solvent exchange rates: Application to denatured drkN SH3. J. Biomol. NMR 46, 227–244 (2010).
Article CAS Google Scholar
Croke, R., Sallum, C., Watson, E., Watt, E. & Alexandrescu, A. Hydrogen exchange of monomeric alpha-synuclein shows unfolded structure persists at physiological temperature and is independent of molecular crowding in Escherichia coli. Protein Sci. 17, 1434–1445 (2008).
Article CAS Google Scholar
Smith, A., Zhou, L. & Pielak, G. Hydrogen exchange of disordered proteins in Escherichia coli. Protein Sci. 24, 706–713 (2015).
Article CAS Google Scholar
Bai, Y., Milne, J., Mayne, L. & Englander, S. Primary structure effects on peptide group hydrogen exchange. Proteins 17, 75–86 (1993).
Article CAS Google Scholar
Pontoriero, L., Schiavina, M., Murrali, M., Pierattelli, R. & Felli, I. Monitoring the interaction of α-synuclein with calcium ions through exclusively heteronuclear nuclear magnetic resonance experiments. Angew. Chem. Int. Ed. Engl. 59, 18537–18545 (2020).
Article CAS Google Scholar
Golovanov, A., Hautbergue, G., Wilson, S. & Lian, L. A simple method for improving protein solubility and long-term stability. J. Am. Chem. Soc. 26, 8933–8939 (2004).
Article CAS Google Scholar
Blobel, J. et al. Protein loop compaction and the origin of the effect of arginine and glutamic acid mixtures on solubility, stability and transient oligomerization of proteins. Eur. Biophys. J. 40, 1327–1338 (2011).
Article CAS Google Scholar
Milles, S. et al. An ultraweak interaction in the intrinsically disordered replication machinery is essential for measles virus function. Sci. Adv. 4, eaat7778 (2018).
Article ADS CAS Google Scholar
Piovesan, D., Walsh, I., Minervini, G. & Tosatto, S. C. E. FELLS: Fast estimator of latent local structure. Bioinformatics 33, 1889–1891 (2017).
Article CAS Google Scholar
Jones, D. T. Protein secondary structure prediction based on position-specific scoring matrices. J. Mol. Biol. 292, 195–202 (1999).
Article CAS Google Scholar
Rodriguez, J. J., Cruz, C. D. & Horvath, C. M. Identification of the nuclear export signal and STAT-binding domains of the Nipah virus V protein reveals mechanisms underlying interferon evasion. J. Virol. 78, 5358–5367 (2004).
Article CAS Google Scholar
Kumar, M. et al. ELM-the eukaryotic linear motif resource in 2020. Nucleic Acids Res. 48, D296–D306 (2020).
CAS PubMed Google Scholar
Van Roey, K. et al. Short linear motifs: Ubiquitous and functionally diverse protein interaction modules directing cell regulation. Chem. Rev. 114, 6733–6778 (2014).
Article CAS Google Scholar
Davey, N. E., Trave, G. & Gibson, T. J. How viruses hijack cell regulation. Trends Biochem. Sci. 36, 159–169 (2011).
Article CAS Google Scholar
Murrali, M. G., Piai, A., Bermel, W., Felli, I. C. & Pierattelli, R. Proline fingerprint in intrinsically disordered proteins. ChemBioChem 19, 1625–1629 (2018).
Article CAS Google Scholar
Schiavina, M. et al. Taking simultaneous snapshots of intrinsically disordered proteins in action. Biophys. J. 117, 46–55 (2019).
Article ADS CAS Google Scholar
Bernado, P. & Svergun, D. I. Structural analysis of intrinsically disordered proteins by small-angle X-ray scattering. Mol. Biosyst. 8, 151–167 (2012).
Article CAS Google Scholar
Fitzkee, N. & Rose, G. Reassessing random-coil statistics in unfolded proteins. Proc. Natl. Acad. Sci. USA 101, 12497–12502 (2004).
Article ADS CAS Google Scholar
Bernado, P. Effect of interdomain dynamics on the structure determination of modular proteins by small-angle scattering. Eur. Biophys. J. 39, 769–780 (2010).
Article CAS Google Scholar
Ozenne, V. et al. Flexible-meccano: A tool for the generation of explicit ensemble descriptions of intrinsically disordered proteins and their associated experimental observables. Bioinformatics 28, 1463–1470 (2012).
Article CAS Google Scholar
Franke, D., Jeffries, C. M. & Svergun, D. Correlation map, a goodness-of-fit test for one-dimensional X-ray scattering spectra. Nat. Methods 12, 419–422 (2015).
Article CAS Google Scholar
Marsh, J. A. & Forman-Kay, J. D. Sequence determinants of compaction in intrinsically disordered proteins. Biophys. J. 98, 2383–2390 (2010).
Article ADS CAS Google Scholar
Mao, A. H., Crick, S. L., Vitalis, A., Chicoine, C. L. & Pappu, R. V. Net charge per residue modulates conformational ensembles of intrinsically disordered proteins. Proc. Natl. Acad. Sci. USA 107, 8183–8188 (2010).
Article ADS CAS Google Scholar
Das, R. K. & Pappu, R. V. Conformations of intrinsically disordered proteins are influenced by linear sequence distributions of oppositely charged residues. Proc. Natl. Acad. Sci. USA 110, 13392–13397 (2013).
Article ADS CAS Google Scholar
Das, R. K., Ruff, K. M. & Pappu, R. V. Relating sequence encoded information to form and function of intrinsically disordered proteins. Curr. Opin. Struct. Biol. 32, 102–112 (2015).
Article CAS Google Scholar
Tedeschi, G. et al. Conformational response to charge clustering in synthetic intrinsically disordered proteins. Biochim. Biophys. Acta Gen. Subj. 1862, 2204–2214 (2018).
Article CAS Google Scholar
Bianchi, G., Longhi, S., Grandori, R. & Brocca, S. Relevance of electrostatic charges in compactness, aggregation, and phase separation of intrinsically disordered proteins. Int. J. Mol. Sci. 21, E6208 (2020).
Article Google Scholar
Blocquel, D., Habchi, J., Gruet, A., Blangy, S. & Longhi, S. Compaction and binding properties of the intrinsically disordered C-terminal domain of Henipavirus nucleoprotein as unveiled by deletion studies. Mol. Biosyst. 8, 392–410 (2012).
Article CAS Google Scholar
Mertens, H. D. T. & Svergun, D. I. Combining NMR and small angle X-ray scattering for the study of biomolecular structure and dynamics. Arch. Biochem. Biophys. 628, 33–41 (2017).
Article CAS Google Scholar
Dunker, A. K., Cortese, M. S., Romero, P., Iakoucheva, L. M. & Uversky, V. N. Flexible nets. FEBS J. 272, 5129–5148 (2005).
Article CAS Google Scholar
Uversky, V. N., Oldfield, C. J. & Dunker, A. K. Showing your ID: Intrinsic disorder as an ID for recognition, regulation and cell signaling. J. Mol. Recognit. 18, 343–384 (2005).
Article CAS Google Scholar
Haynes, C. et al. Intrinsic disorder is a common feature of hub proteins from four eukaryotic interactomes. PLoS Comput. Biol. 2, e100 (2006).
Article ADS CAS Google Scholar
Rodriguez, J. J., Parisien, J. P. & Horvath, C. M. Nipah virus V protein evades alpha and gamma interferons by preventing STAT1 and STAT2 activation and nuclear accumulation. J. Virol. 76, 11476–11483 (2002).
Article CAS Google Scholar
Rodriguez, J. J., Wang, L. F. & Horvath, C. M. Hendra virus V protein inhibits interferon signaling by preventing STAT1 and STAT2 nuclear accumulation. J. Virol. 77, 11842–11845 (2003).
Article CAS Google Scholar
Ludlow, L. E., Lo, M. K., Rodriguez, J. J., Rota, P. A. & Horvath, C. M. Henipavirus V protein association with Polo-like kinase reveals functional overlap with STAT1 binding and interferon evasion. J. Virol. 82, 6259–6271 (2008).
Article CAS Google Scholar
Longhi, S., Bloyet, L. M., Gianni, S. & Gerlier, D. How order and disorder within paramyxoviral nucleoproteins and phosphoproteins orchestrate the molecular interplay of transcription and replication. Cell. Mol. Life Sci. 74, 3091–3118 (2017).
Article CAS Google Scholar
Uversky, V. N. & Longhi, S. Flexible Viruses: Structural Disorder in Viral Proteins (Wiley, New York, 2012).
Google Scholar
Xue, B. et al. Structural disorder in viral proteins. Chem. Rev. 114, 6880–6911 (2014).
Article CAS Google Scholar
Bugge, K. et al. Interactions by disorder—A matter of context. Front. Mol. Biosci. 7, 110 (2020).
Article ADS CAS Google Scholar
Olsen, J., Teilum, K. & Kragelund, B. Behaviour of intrinsically disordered proteins in protein–protein complexes with an emphasis on fuzziness. Cell. Mol. Life Sci. 74, 3175–3183 (2017).
Article CAS Google Scholar
Keller, R. The Computer Aided Resonance Assignment Tutorial (Cantina Verlag, Goldau, 2004).
Google Scholar
Bartels, C., Xia, T. H., Billeter, M., Guntert, P. & Wuthrich, K. The program XEASY for computer-supported NMR spectral analysis of biological macromolecules. J. Biomol. NMR 6, 1–10 (1995).
Article CAS Google Scholar
Round, A. R. et al. Automated sample-changing robot for solution scattering experiments at the EMBL Hamburg SAXS station X33. J. Appl. Crystallogr. 41, 913–917 (2008).
Article CAS Google Scholar
Franke, D. et al. ATSAS 2.8: A comprehensive data analysis suite for small-angle scattering from macromolecular solutions. J. Appl. Crystallogr. 50, 1212–1225 (2017).
Article CAS Google Scholar
Guinier, A. La diffraction des rayons X aux tres petits angles; application a l’etude de phenomenes ultramicroscopiques. Ann. Phys. (Paris) 12, 161–237 (1939).
ADS CAS MATH Google Scholar
Guinier, A. & Fournet, F. Small Angle Scattering of X-rays (Wiley Interscience, New York, 1955).
MATH Google Scholar
Svergun, D. Determination of the regularization parameters in indirect-trasform methods using perceptual criteria. J. Appl. Cryst. 25, 495–503 (1992).
Article CAS Google Scholar
Durand, D. et al. NADPH oxidase activator p67(phox) behaves in solution as a multidomain protein with semi-flexible linkers. J. Struct. Biol. 169, 45–53 (2010).
Article CAS Google Scholar
Tria, G., Mertens, H. D. T., Kachala, M. & Svergun, D. Advanced ensemble modelling of flexible macromolecules using X-ray solution scattering. IUCrJ 2, 202–217 (2015).
Article CAS Google Scholar
Valentini, E., Kikhney, A. G., Previtali, G., Jeffries, C. M. & Svergun, D. I. SASBDB, a repository for biological small-angle scattering data. Nucleic Acids Res. 43, D357-363 (2015).
Article CAS Google Scholar
Lazar, T. et al. PED in 2021: A major update of the protein ensemble database for intrinsically disordered proteins. Nucl. Acid Res. (Database issue) in press, (2021).
Moore, B. L., Kelley, L. A., Barber, J., Murray, J. W., MacDonald, J. T. High-quality protein backbone reconstruction from alpha carbons using Gaussian mixture models. J. Comput. Chem. 34, 1881–1889 (2013).
Article CAS Google Scholar

Download references

Acknowledgements

This work was carried out with the financial support of the CNRS and of the Fondazione CR Firenze to RP. E. S. was supported by a joint doctoral fellowship from the Direction Générale de l’Armement (DGA) and Aix-Marseille University. The authors acknowledge the support of the Italian government programme “MIUR—Dipartimenti di Eccellenza 2018-2022” to the Department of Chemistry “Ugo Schiff” of the University of Florence. The authors acknowledge also the support and the use of resources of Instruct-ERIC, a Landmark ESFRI, through the R&D pilot scheme and iNEXT (H2020 Grant # 653706) for providing access to the CERM research infrastructure. The authors are thankful to Julien Perard for his help in SAXS data collection, the ESRF synchrotron for beamtime allocation and to Gerlind Sulzenbacher (AFMB lab) for efficiently managing the AFMB BAG.

Author information

These authors contributed equally: Marco Schiavina, Edoardo Salladini and Maria Grazia Murrali.

Authors and Affiliations

Magnetic Resonance Center (CERM), University of Florence, Via Luigi Sacconi 6, 50019, Sesto Fiorentino, Italy
Marco Schiavina, Maria Grazia Murrali, Isabella C. Felli & Roberta Pierattelli
Department of Chemistry “Ugo Schiff”, University of Florence, Via della Lastruccia 3-13, 50019, Sesto Fiorentino, Italy
Marco Schiavina, Maria Grazia Murrali, Giancarlo Tria, Isabella C. Felli & Roberta Pierattelli
Florence Center for Electron Nanoscopy (FloCEN), University of Florence, Via della Lastruccia 3-13, 50019, Sesto Fiorentino, Italy
Giancarlo Tria
Lab. Architecture et Fonction des Macromolécules Biologiques (AFMB), UMR 7257, Aix-Marseille University and CNRS, 163 Avenue de Luminy, Case 932, Marseille, France
Edoardo Salladini & Sonia Longhi

Authors

Marco Schiavina
View author publications
You can also search for this author in PubMed Google Scholar
Edoardo Salladini
View author publications
You can also search for this author in PubMed Google Scholar
Maria Grazia Murrali
View author publications
You can also search for this author in PubMed Google Scholar
Giancarlo Tria
View author publications
You can also search for this author in PubMed Google Scholar
Isabella C. Felli
View author publications
You can also search for this author in PubMed Google Scholar
Roberta Pierattelli
View author publications
You can also search for this author in PubMed Google Scholar
Sonia Longhi
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.L., R.P., and I.C.F. conceived and planned the experiments. E.S. purified and labeled the protein for NMR and SAXS experiments, and performed all the SAXS measurements. E.S. and M.G.M. acquired all the NMR spectra. E.S., M.G.M. and M.S. performed the sequence-specific assignment and analyzed the data under the guidance of R.P. and I.C.F. G.T. processed and analyzed SAXS data. All the authors analyzed the data. S.L. wrote the first draft of the paper, and all the authors contributed to the writing.

Corresponding authors

Correspondence to Isabella C. Felli, Roberta Pierattelli or Sonia Longhi.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Schiavina, M., Salladini, E., Murrali, M.G. et al. Ensemble description of the intrinsically disordered N-terminal domain of the Nipah virus P/V protein from combined NMR and SAXS. Sci Rep 10, 19574 (2020). https://doi.org/10.1038/s41598-020-76522-3

Download citation

Received: 01 July 2020
Accepted: 27 October 2020
Published: 11 November 2020
DOI: https://doi.org/10.1038/s41598-020-76522-3

This article is cited by

Anticipation of Antigenic Sites for the Goal of Vaccine Designing Against Nipah Virus: An Immunoinformatics Inquisitive Quest
- Suraj kumar Sharma
- Shivani Srivastava
- Vivek Srivastava
International Journal of Peptide Research and Therapeutics (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.