Molecular evolution and genetic variations of  V and W proteins derived by RNA editing in Avian Paramyxoviruses

Rao, Pachineella Lakshmana; Gandham, Ravi Kumar; Subbiah, Madhuri

doi:10.1038/s41598-020-66252-x

Download PDF

Article
Open access
Published: 12 June 2020

Molecular evolution and genetic variations of V and W proteins derived by RNA editing in Avian Paramyxoviruses

Pachineella Lakshmana Rao¹,
Ravi Kumar Gandham¹ &
Madhuri Subbiah¹

Scientific Reports volume 10, Article number: 9532 (2020) Cite this article

3391 Accesses
9 Citations
4 Altmetric
Metrics details

Subjects

A Publisher Correction to this article was published on 02 October 2020

This article has been updated

Abstract

The newly assigned subfamily Avulavirinae in the family Paramyxoviridae includes avian paramyxoviruses (APMVs) isolated from a wide variety of avian species across the globe. Till date, 21 species of APMVs are reported and their complete genome sequences are available in GenBank. The APMV genome comprises of a single stranded, negative sense, non-segmented RNA comprising six transcriptional units (except APMV-6 with seven units) each coding for a structural protein. Additionally, by co-transcriptional RNA editing of phosphoprotein (P) gene, two mRNAs coding for accessory viral proteins, V and W, are generated along with unedited P mRNA. However, in APMV-11, the unedited mRNA codes for V protein while +2 edited mRNA translates to P protein, similar to members of subfamily Rubulavirinae in the same family. Such RNA editing in paramyxoviruses enables maximizing the coding capacity of their smaller genome. The three proteins of P gene: P, V and W, share identical N terminal but varied C terminal sequences that contribute to their unique functions. Here, we analyzed the P gene editing site, V and W sequences of all 21 APMV species known so far (55 viruses) by using bioinformatics and report their genetic variations and molecular evolution. The variations observed in the sequence and hexamer phase positions of the P gene editing sites is likely to influence the levels and relative proportions of P, V and W proteins’ expressions which could explain the differences in the pathogenicity of APMVs. The V protein sequences of APMVs had conserved motifs similar to V proteins of other paramyxoviruses including the seven cysteine residues involved in MDA5 interference, STAT1 degradation and interferon antagonism. Conversely, W protein sequences of APMVs were distinct. High sequence homology was observed in both V and W proteins between strains of the same species than between species except in APMV-3 which was the most divergent APMV species. The estimates of synonymous and non-synonymous substitution rates suggested negative selection pressure on the V and W proteins within species indicating their low evolution rate. The molecular clock analysis revealed higher conservation of V protein sequence compared to W protein indicating the important role played by V protein in viral replication, pathogenesis and immune evasion. However, we speculate the genetic diversity of W proteins could impact the degree of pathogenesis, variable interferon antagonistic activity and the wide host range exhibited by APMV species. Phylogenetically, V proteins of APMVs clustered into three groups similar to the recent classification of APMVs into three new genera while no such pattern could be deciphered in the analysis of W proteins except that strains of same species grouped together. This is the first comprehensive study describing in detail the genetic variations and the molecular evolution of P gene edited, accessory viral proteins of Avian paramyxoviruses.

Host adaptive mutations in the 2009 H1N1 pandemic influenza A virus PA gene regulate translation efficiency of viral mRNAs via GRSF1

Article Open access 17 October 2022

An exploration of ambigrammatic sequences in narnaviruses

Article Open access 29 November 2019

Identification of a novel SARS-CoV-2 variant with a truncated protein in ORF8 gene by next generation sequencing

Article Open access 17 March 2022

Introduction

Avian paramyxoviruses (APMVs) are a group of paramyxoviruses known to infect a variety of bird species across the globe. Till date, 21 species of APMVs have been identified and the list is expected to grow with increase in viral surveillance in wild and domesticated birds. With recent ICTV 2019 classification, these viruses now belong to three genera, Metaavulavirus (APMV-2, -5, -6, -7, -8, -10, -11, -14, -15 and -20), Orthoavulavirus (APMV-1, -9, -12, -13, -16, APV-A, APV-B and APV-C) and Paraavulavirus (APMV-3 and -4) within the new subfamily Avulavirinae under family Paramyxoviridae in the order Mononegavirales¹. APMV-1 to -9 were isolated before 1980, APMV-10 to -13 were identified by 2015 and all the other APMVs were reported in the recent years².

Avian Paramyxoviruses are enveloped with a single stranded, non-segmented, negative sense RNA genome of size 13 to 17 kb². The prototype virus, APMV-1 of genus Orthoavulavirus, also well known as Newcastle disease virus or NDV, is the most extensively studied virus in this group. NDV causes severe economically important disease in poultry. There are five pathotypes of NDV based on the clinical signs exhibited by infected chickens: (a) viscerotropic velogenic or highly virulent, pantropic NDV causing severe mortality (b) neurotropic velogenic or highly virulent NDV, specifically causing neurological illnesses and high mortality (c) mesogenic or moderately virulent NDV, with mortalities as high as 50% and reducing egg production (d) lentogenic–either respiratory or enteric type NDV, low virulence and causing low reduction in egg production and (e) asymptomatic or avirulent NDV. However, the pathotype classification is not always clear-cut³. There are more than thousand strains of NDV which have been isolated, sequenced and found to exhibit wide spectrum of virulence. The viral RNA genome encodes for six genes arranged in tandem, each coding for six structural proteins, 3’-N-P-M-F-HN-L-5’^4,5. N is nucleocapsid protein, each N protomer is known to bind exactly 6 nucleotides of genomic and antigenomic RNA of most paramyxoviruses thus imposing a hexamer phase on the entire RNA genome. In nature, the genomic length of paramyxoviruses is polyhexameric (6n + 0) which is found to be necessary for efficient replication and this is called the ‘rule of six’^6,7. N together with P, phosphoprotein and L, large polymerase protein, forms viral RNA dependent RNA polymerase complex essential for viral genome transcription and replication; M, matrix protein, is seen within the envelope, aids in virus assembly and budding; two viral glycoproteins, F, fusion protein and HN, hemagglutinin-neuraminidase protein, are studded on the envelope and assist with fusion of virus with host membrane and receptor binding, respectively⁴. NDV F protein is known as the virulence determinant; the virulent strains have unique multiple basic amino acids, at least three arginine (R) or lysine (K) residues, at fusion protein cleavage site starting at amino acid position 113, and a phenylalanine residue at position 117³. APMV-6 is also known to express an additional small hydrophobic (SH) protein from SH gene located between F and HN genes⁸. Further, by co-transcriptional RNA editing of P gene, two mRNAs, V and W are expressed^4,9,10,11,12. Also, in certain paramyxoviruses, by a process of alternative transcription initiation in P gene (+1 reading frame), accessory C proteins are generated^13,14. Thus by these mechanisms, paramyxoviruses are able to efficiently utilize over 95% of their small RNA genome for expression of viral proteins¹².

The P gene carries a slippery sequence, a stretch of adenosine (A) nucleotides and guanosine (G) nucleotides called the ‘editing site’ where insertions of 1 G or 2 G nucleotides occur during transcription of P gene by the stuttering viral polymerase that reiteratively reads the template base^12,15. A single G nucleotide addition leads to +1 frameshift in the ORF, generating V mRNA with a frequency of 25 to 35% and two G nucleotides addition leads to +2 frameshift in the ORF generating W mRNA with a frequency of 2 to 8.5% and the unedited mRNA (60-70%) codes for P protein in NDV^12,16. Among the paramyxoviruses, members of the subfamily Rubulavirinae and APMV-11 of genus Metaavulavirus, encode V protein from their unedited transcript, while P protein is coded by +2 frameshift and W protein is expressed by +1 frameshift^17,18. The resulting three mRNAs from P gene (P, V and W mRNAs) share common N terminal sequences and differ both in length and amino acid composition in their C terminal region. Their specific functions are dictated by their unique C terminal sequences. Studies on V protein of APMV-1 and other paramyxoviruses have revealed that V protein is multifunctional, targets STAT1 degradation, interferes with MDA5, is interferon antagonist^19,20,21,22, inhibits apoptosis^23,24, assist in viral replication^20,25 and plays important roles in tissue tropism, virulence determination^11,20,26 and host range restriction²¹. On the other hand, there is very limited information about the function of W protein. The W protein of Nipah virus has been shown to impact viral pathogenesis and support the virus to evade the host immunity^27,28,29. In APMV-1, the nuclear localization of W protein and its incorporation into the virion has been recently reported^16,30.

The complete genome sequences of all 21 species of APMV have been described individually^{17,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62}. A comprehensive comparative analysis of complete genome and structural genes of 20 species of Avulaviruses has been recently published². Three clades of viruses were concluded based on phylogenomic analysis of 20 APMVs: Clade I included APMV-2, -5, -6, -7, -8, -10, -11, -14, -15 and -20 (currently classified under new genus Metaavulavirus), clade II comprised of APMV -1, -9, -12, -13, -16, APV-A, -B and -C (currently assigned under genus Orthoavulavirus) and clade III included APMV-3 and -4 (currently under new genus Paraavulavirus)^1,2. One of the viruses, previously classified as APMV-17 (South Korean “avian paramyxovirus 17”) has now been proposed as a separate species (APMV-21) based on phylogenies of complete genomes, complete F and L genes, PASC and STD analysis^2,63. Nevertheless, very little is known about the P gene edited accessory viral proteins of APMVs. We have examined and analyzed 55 viruses belonging to all 21 APMV species identified till date and discuss here the genetic diversity and molecular evolution of P gene edited proteins, V and W.

Materials and methods

Sequence information

The full length sequences of P gene available for all the 21 species of APMVs were obtained from National Center for Biotechnology Information (NCBI) (http://www.ncbi.nlm.nih.gov/). Additionally, publications reporting the complete genome sequence of these viruses were also referred for identification of P gene editing site, for prediction of sequences of V and W proteins. A total of 55 viruses belonging to 21 species of APMVs were analyzed in this study which included (their GenBank accession numbers are provided in Table 1) four strains each of avirulent APMV-1, moderately virulent APMV-1 and highly virulent APMV-1; eight isolates of APMV-6; five isolates each of APMV-2 and APMV-8; four isolates of APMV-10; three isolates of APMV-13; two isolates each of APMV-3, APMV-4, APMV-5; one isolate each of APMV-7, APMV-9, APMV-11, APMV-12, APMV-14, APMV-15, APMV-16, APMV-20, APMV-21 and APV-A, APV-B and APV-C. Detailed information of these isolates along with metadata such as hosts, year and location of isolation in addition to their sequence information are provided in Table 1. The complete sequences of P gene ORF, V proteins and W proteins used in this study that were either directly collected from NCBI or derived by prediction using DNASTAR software suite are provided in Supplementary Files S1, S2 and S3.

Table 1 Detailed information on V and W proteins of APMV species and strains analyzed in this study.

Full size table

Sequence alignment, comparison and prediction of conserved motifs/domains

Multiple sequence alignments of V and W proteins were performed using the TCOFFEE multiple alignment algorithm, mode ‘expresso’ and the sequence similarities were colored through ESPript^64,65,66. All residues/amino acid positions mentioned in the results and discussion correspond to APMV-1 strain KJ808820.1, the strain that appears first in the alignment file. Individual sequences were also analyzed in NCBI’s interface, conserved domain (CD)-search⁶⁷ and aligned sequences were run in DREME version 5.0.5 software⁶⁸ to identify conserved motifs/domains. The intraclade amino acid percentage identity was estimated using Megalign software from DNASTAR.

Prediction of Nuclear Localization Signal (NLS) and Nuclear Export Signal (NES) in V and W proteins

The nuclear localization signal (NLS) in V and W proteins of APMV species were identified using online tool, cNLS mapper with a cut-off score of 5.0 that predicted NLS specific to the importin αβ pathway⁶⁹. The presence of nuclear export signal (NES) in V and W proteins of APMV species was predicted using online tool, NetNES 1.1 server that predicted leucine-rich NES using a combination of neural networks and hidden Markov models⁷⁰ and using LocNES that predicted the classical NESs in CRM1 cargoes⁷¹.

Phylogenetic analysis and evolutionary divergence

Phylogenetic analysis was performed using MEGA7 software. For drawing the phylogenetic trees, evolutionary history was inferred by using the Maximum Likelihood method with JTT matrix-based model⁷² for V proteins and Dayhoff matrix based model for W proteins⁷³. For drawing the phylogenetic tree of V proteins, bootstrap consensus tree inferred from 500 replicates was taken to represent the evolutionary history of the taxa analyzed⁷⁴. Branches corresponding to partitions, reproduced in less than 80% bootstrap replicates, were collapsed. The percentage of replicate trees in which the associated taxa clustered together in the bootstrap test (500 replicates) are shown next to the branches. Initial trees for heuristic search were obtained automatically by applying Neighbor-Joining and BioNJ algorithms to a matrix of pairwise distances estimated using a JTT model, and then topology with superior log likelihood value was selected. A discrete gamma distribution was used for V proteins tree to model evolutionary rate differences among sites (16 categories (+G, parameter = 2.4693)). The rate variation model allowed for some sites to be evolutionarily invariable ([+I], 5.67% sites). For drawing the phylogenetic tree of W proteins, a discrete gamma distribution was used to model evolutionary rate differences among sites (5 categories (+G, parameter = 1.1488)). The rate variation model allowed for some sites to be evolutionarily invariable ([+I], 0.70% sites). Analysis of both the trees involved 55 amino acid sequences. All positions containing gaps and missing data were eliminated. There were a total of 141 and 71 positions in the final dataset for drawing V and W proteins’ phylogenetic trees, respectively.

The estimates of evolutionary divergence over sequence pairs between groups were analyzed for V and W proteins of all 21 APMV species using MEGA7⁷⁵. Briefly, based on the maximum likelihood fits of 56 different amino acid substitution models, the final analyses were conducted in MEGA7 using JTT matrix-based model⁷² for V proteins and Dayhoff matrix based model⁷³ for W proteins. The rate variation among sites was modeled with a gamma distribution (shape parameter = 1). The analysis included 55 amino acid sequences. All positions containing gaps and missing data were eliminated.

Selection pressure analysis

The number of nonsynonymous substitutions per nonsynonymous site (dN), the number of synonymous substitutions per synonymous site (dS), and the dN/dS ratios for the nucleotide sequences of V and W proteins of all 21 species were analyzed for the entire sequence and also their shared N terminal and unique C terminal regions. The shared portion in the N-terminus of all the three proteins was considered up to the RNA editing site (KKG motif). The C terminal regions of V and W proteins of all 21 species were considered after RNA editing site (KKG motif). The dN/dS ratio of 21 species of APMV nucleotide sequences were estimated by DnaSP v6.12.03 software⁷⁶. The protein was considered under positive selection or diversifying when the dN/dS ratio is >1 and negative or purifying selection when dN/dS ratio <1.

Evolutionary rate analysis

To estimate evolutionary rates of different APMV species in V and W nucleotide sequences, the substitution rate analysis was performed by BEAST v 1.10.4 software⁷⁷. The substitution model GTR and site heterogeneity model G + I was found to be the best by MEGA7 and was used here to study the substitution rate of V and W sequences. The tree prior coalescent, constant size was used for individual and all the species. The uncorrelated relaxed clock with lognormal was implemented. The MCMC chain 4 × 10⁸ cycles was used to reach the ESS value more than 200 to converge the data except for W proteins of APMV-8 strains where MCMC chain length of 2 × 10⁸ cycles was used. The final data analysis was performed using tracer v 1.7.1 software.

Results

RNA editing site and prediction of V and W protein sequences

Previous reports suggested identical P gene RNA editing sites for APMV-1, -2, -4, -5, -6, -7, -8, -9, -10, -12, -13, -15, -20, APV -A, -B and –C and varied P gene editing site sequences for APMV-3, -11, -14 and -16. We observed the following conserved pattern in the RNA editing sequences among APMVs: U₃C₆ for APMV-4; U₄C₄ for APMV-14; U₄C₅ for APMV-20; U₅C₃ for APMV -1 (all strains except KJ736742.1 and KJ808820.1), APMV-8 and APMV-10; U₅C₄ for APMV -1 (strains KJ736742.1 and KJ808820.1) and APMV-2; U₅C₆ for APMV-15; U₆C₃ for APMV -5 (strain GU206351.1), APMV -7, -9, -12, -13, -16, -19, -21; U₆C₂ for APMV-5 (strain LC168750.1); U₆C₄ for APMV -6, -17, -18 and UUCUUC₅ for APMV-11. Further, variations were observed in the cis-acting sequence at the editing site: the sequences immediately upstream of the editing site were³’AA in APMV -2, -3, -4, -5, -7, -8, -10, -11, -14, -15, -17, -18, -19, -20 and³’GA in APMV -1, -9, -12, -13, -16 and -21 (all orthoavulaviruses) while³’AG was conserved among APMV-6 strains.

The hexamer phase of the start of the template C run in the P gene editing site (Table 1) revealed that this position was conserved within each species except in APMV-3 and for one strain of APMV-6 (KT962980.1). The hexamer phasing positions for APV -A, -B, -C were not determined as their genome lengths did not conform to ‘rule of six’. The start of the C run was at hexamer position 1 for APMV-20; at hexamer position 2 for APMV -4, -11 and -14; at hexamer position 3 for APMV -2, -3 (EU782025), -5, -8 and -10; at hexamer position 4 for APMV-3 (EU403085) and APMV-7; at hexamer position 5 for APMV-1, one strain of APMV-6 (KT962980.1), APMV -9, -12, -13, and -21 and at hexamer position 6 for APMV -15, -16 and all strains of APMV-6 except one strain (KT962980.1).

All APMVs except APMV-11 expressed P protein from unedited mRNA. The editing site of APMV-11 resembled that of other paramyxoviruses that insert 2G for generating P mRNA or the ‘genomic V’ viruses¹². The P protein of APMV-11 is derived from 2G nucleotides insertion, W protein from single G nucleotide addition and unedited mRNA expresses the V protein¹⁷. The V and W protein sequences of the other 20 APMV species were predicted by insertion of single G and two G nucleotides at the P gene RNA editing site, respectively.

Amino acid sequence analysis: percentage identity and conservations

The V and W protein sequences of all 21 APMV species shared common N terminal region with P protein. The variations in the amino acid sequences were minimum within the first 60 amino acids in N terminal region. The N-terminal portions of P, V and W proteins of metaavulaviruses and orthoavulaviruses showed closer identity than paraavulaviruses. In ortho and paraavulaviruses, the N- and C- terminal regions showing high homology up to 93%. Metaavulaviruses showed 100% identity in all their C- and N- terminal regions. The C terminal region of W-protein sequences showed 0.0–100% identity, as both the amino acid composition and length variations at C-terminal portion for all the species were higher (Table 2). As described previously, the soyuz1 and soyuz2 motifs were observed within N terminal region in all APMVs except in APMV-3 strains⁷⁸. Additionally, conserved domains (CD) were predicted in APMV-1 mesogenic strain Komarov (CD between aa 25 to 167) for large tegument protein UL36 (superfamily member PHA03247), in APMV-14 (CD between aa 31 and 144) for gene regulated by oestrogen in breast cancer- GREB1 (superfamily member, pfam15782) and in APMV-21 (CD between aa 53 and 120) for Tumor necrosis factor receptor superfamily member cd13415.

Table 2 Intraclade percentage amino acid sequence identity for P-gene products in APMV species.

Full size table

Comparison of V protein sequences of APMV species

The V protein of APV-C was the shortest (221 aa, MW: 23.73 kDa) and that of APMV-21 was the longest (304 aa, MW: 31.98 kDa) among the 21 APMV species. The length of V protein (in aa) conserved within species was as follows: APMV-2 strains (232 aa), APMV-4 strains (224 aa), APMV-5 strains (277 aa), APMV-8 strains (238 aa), APMV-10 strains (246 aa) and APMV-13 strains (241 aa). However, in APMV-1, -3 and -6, variation in the length of V protein was observed between strains within the same species. Between species, the similarity in the V protein length was observed as follows: V protein length of 252 aa was observed in APMV-14, -15 and one strain of APMV-3; APMV-11 and -5 showed 277 aa long V protein; APMV -9 and -20 had 263 aa long V protein while the V protein length of APMV-16 and two lentogenic strains of APMV-1 were 245 aa. The lowest amino acid identity (10.4%) was observed between V proteins of APMV-3 strain Wisconsin and APV-A. The lowest amino acid divergence was noticed between APV-B and APV-C (70.4) and both showed identity of 53.7% at amino acid level which was the highest between APMV species (Supplementary File S4).

The multiple sequence alignment of V protein sequences of APMV species revealed higher amino acid conservations in both N (majorly in the first 60 amino acids) and C terminal regions (Fig. 1). All viruses in this study had the following conserved motifs similar to V proteins of other known paramyxoviruses (i) KKG motif in the N terminal region, which is the coding sequences at the P gene mRNA editing site (residues 132-134, corresponding to APMV-1 strain KJ808820.1, the first strain in the alignment file) except in APMV-3, APMV-4, APMV-12, APMV-13, APMV-14 and APMV-20 (ii) HRRE motif (residues 177 – 180, corresponding to APMV-1 strain KJ808820.1), (iii) WCNP motif (residues 195-198, corresponding to APMV-1 strain KJ808820.1) and (iv) conserved seven cysteine-rich domain. Interestingly, the following amino acids were also conserved in the C terminal region in majority of APMV species with few exceptions: Proline residues at five positions: (a) position 175 in ten species (except in APMV -3,-4,-6, -7, -8, -9, -10, -11, -15, -20 and -21), (b) position 198 in all APMVs, (c) position 202 (except in APMV-4), (d) position 207 in all APMVs and (e) position 218 in all APMVs; Glycine residues at three positions: (a) position 176 (except in APMV-4 and -6), (b) position 188 (except in certain strains of APMV-1 and APMV-7, -9, -11, -12, -13, -14, -15, -16, APV -A and -B), (c) position 215 (except in a single lentogenic strain, KM885162, of APMV-1, all strains of APMV-3, all strains of APMV-10 and interestingly in all these viruses, the glycine residue was replaced with arginine residue); Serine residues at two positions: (a) position 182 (except in APMV -4, -11, -20) and (b) position 194 (except in APMV -3, -4, -5, -6, -12, -14); Arginine residue at position 208 (except in APMV -3, -4, -5, -7, -8, -11, -13, -16, APV-A, -B and –C); Leucine residue at position 223 (except in APMV-5, -6, -7, -8 and -11) and Aspartic acid residue at position 227 (except in APMV-2, -3, -7, -9, -11 and -21). The percentage amino acid conservation in the C terminal region of V proteins of all 21 APMV species was between 30% (in the longest V protein, that of APMV-21) and 48% (in the shortest V protein, that of APV-C). The NLS were predicted in V proteins of APMV-5 and APMV-20 by cNLS mapper with a cut off score of 5.0. The NES were identified only in APMV-5 strains (Table 3b).

Table 3 Predicted nuclear localization (NLS) and nuclear export signals (NES) in W (3a) and V (3b) proteins of 21 APMV species.

Full size table

Comparison of W protein sequences of APMV species

The length of W protein varied between 125 and 227 amino acids (aa) with calculated molecular weights between 13.30 kDa and 24.38 kDa. APMV-3 strain Netherland and APMV-7 had the shortest W protein (125 aa) and two strains of APMV-1, mesogenic strain KX761866.1 and velogenic strain KJ808820.1 (227 aa) had the longest W protein.

The length (in aa) of W protein was conserved among all the strains of APMV-2 (207 aa), all the strains of APMV-4 (137 aa), all the strains of APMV-5 (187 aa), all the strains of APMV-10 (172 aa) and all the strains of APMV-13 (150 aa). Also, similarity in the W protein length was noticed between the following species: W protein length of 172 aa was observed in all strains of APMV-10 and all strains of APMV-8 except strain FJ215863.2; W protein length of 177 aa was deduced in one strain of APMV-1 (JQ015296.1) and 3 strains of APMV-6 (EU622637.2, AY029299.1, EF569970.1), while the W protein length of APMV-12 and APMV-5 were 187 aa. Variations in the W protein length between strains within the same species were observed in APMV-1 (227, 221, 196, 183, 179, 177, 137 aa), APMV-3 (125, 127 aa), APMV-6 (157, 162, 177, 197 aa) and APMV-8 (172, 203 aa). APMV-1 strains analyzed in this study, had the longest unique C terminal region when compared to other APMV species (Table 1 and Fig. 2).

The lowest amino acid percentage identity (2.4) was observed between W proteins of APMV-12 and APMV-3 strain Netherland. Incidentally, APMV-3 strain Netherland also showed the lowest amino acid identity with W proteins of other APMV species. The lowest amino acid divergence (83.7) was noticed between APMV-1 isolate HN1007 (KX761866.1) and APMV-16, their W protein amino acid identity was 48.6% which was the highest homology observed between APMV species (Supplementary File S5).

The NLS were identified in five out of the twelve strains of APMV-1, in one of the eight strains of APMV-6 (GQ406232.1) and in APMV-9 in their C terminal region while NLS in W protein of APMV-20 was observed in the N terminal region (shared with P and V proteins). The presence of NES were predicted in these viruses except in one strain of APMV-1 (JQ015296.1) and APMV-20 (Table 3a).

Phylogenetic tree and evolutionary distance analysis

Based on their V protein sequences, phylogenetically the APMV species formed three distinct groups: group 1 consisted of APMV -3 strains, group 2 consisted of APMV -1, -9, -12, -13, -16, -21, APV -A, -B and -C (all orthoavulaviruses) and group 3 consisted of APMV -2, -5, -6, -7, -8, -10, -11, -14, -15, -20 and -4 (all metaavulaviruses and one paraavulavirus) (Fig. 3). The highest evolutionary divergence of 2.20 was observed between APMV-3 & APMV-4 and APMV-3 & APMV-12 followed by a divergence value of 1.91 between APMV-3 & APMV-5 and APMV-3 & APMV-11. The lowest divergence was between APV-A and APV-B (0.47) followed by APMV-9 & APMV-21 (0.50). The distance between the strains of the same species was noticed more in APMV-3 (0.4) followed by APMV-1 (0.297), which was further reiterated by their lower percentage of amino acid homology (Table 4).

Table 4 Estimates of Evolutionary Divergence over Sequence Pairs between Groups, analyzed for V proteins of 21 APMV species.

Full size table

The phylogenetic tree obtained from W protein sequences analysis showed clustering of strains of the same species (Fig. 4). The evolutionary distance analyses of W proteins of APMV species revealed that APMV-3 species is more divergent than other APMV species. The highest evolutionary divergence was noticed between APMV-3 & APV-C species (10.322) followed by APMV-3 & APMV-13 (8.594) and APMV-3 & APMV-12 (8.270). The lowest divergence was observed between APMV-9 & APMV-21 (0.400) followed by APV-A & APV-B (0.407). The distance between the strains of the same species was more in APMV-3 (0.619) followed by APMV-1 (0.256) which was also apparent from their lower percentage of amino acid homology (Table 5).

Table 5 Estimates of Evolutionary Divergence over Sequence Pairs between Groups, analyzed for W proteins of 21 APMV species.

Full size table

Selection pressure analysis

The dN/dS ratio was used to determine the natural selection pressure acting on the P gene edited products. The dN/dS ratio was estimated by DnaSP v6.12.03 for APMV species that comprised of more than one strain. The dN/dS values were significantly less than 1 for both V and W sequences (complete, N- and C-terminal regions) of most species explaining that they are under negative selection pressure. Only the C terminal region of V proteins of APMV-3 strains showed positive selection with dN/dS> 1 (Table 6).

Table 6 Selection Pressure Analysis of V and W proteins of 21 APMV species.

Full size table

Evolutionary rate analysis

The substitution rate of the V and W nucleotide sequences of APMV species that comprised of more than two strains were estimated by uncorrelated relaxed clock with lognormal using BEAST software. APMV-10 comprised of four strains, which were 98.65% to 100% identical to each other and hence the substitution rate could not be determined. The substitution rate was highest in APMV-13 followed by APMV-2, APMV-6 for both V and W proteins. The overall substitution rate was 7.37 × 10⁻⁵ for V protein and 8.07 × 10⁻⁵ for W protein (Table 7).

Table 7 Evolutionary Rate analysis by Molecular Clock- Estimated nucleotide substitution rates for V and W nucleotide sequences of all 21 APMV species.

Full size table

Discussion

Avian paramyxoviruses are known to infect a variety of bird species across the globe. Currently 21 species (previously called as serotypes) of APMVs are characterized and more viruses could be identified in future with improved viral surveillance programs. Paramyxoviruses with their small genome have a unique strategy of maximizing their genomic information by expressing viral proteins through co-transcriptional RNA editing. This helps to avoid error catastrophe caused by higher mutation rates often associated with larger genomes. Additionally, these viruses follow the ‘rule of six’ for efficient replication. Though, detailed studies on APMV structural genes and their complete genomes are available, a comparative knowledge of their accessory proteins expressed through RNA editing is lacking. In this study, using bioinformatics approach, we analyzed the P gene editing site, predicted and studied the protein sequences of edited products- V and W, of all 21 APMV species (55 viruses) known till date.

The hexamer phasing at the P gene editing site within each virus group is conserved⁷. We observed conserved hexamer phasing between certain APMV species and also, within each APMVs except in APMV-3 and for one strain of APMV-6. The hexamer phase is known to regulate the mRNA editing pattern, though subtle, it is important; for example, in human and bovine parainfluenza virus type 3 (PIV-3) in which the hexamer positions at P gene editing site are 2 to 5, higher mRNA editing frequency (~70%) and more number of G insertions (1 to 6 at equal frequencies) are observed while least mRNA editing ~30% with only 1 to 3 G insertions occur in Sendai virus wherein hexamer phase position is 1 at the P gene editing site ^7,79,80. Based on the hexamer phasing position, it is anticipated that, in all APMVs except in APMV -20, the editing frequency could be extensive with possibilities of more number of G insertions. However, the cis-acting sequence of P gene editing site in APMV-20 (³’AA), suggests higher mRNA editing frequency and increased number of G insertions as reported in human and bovine PIV-3^81,82. Thus APMVs seem to follow PIV-3 RNA editing phenotype.

Another interesting observation is the unique editing site sequence of APMV-11 (³’A₄UUCUUC₅), in which the unedited mRNA translates to V protein and it has been suggested that 2G insertions in mRNA translates to P protein¹⁷. In rubulaviruses with P gene editing site sequence of³’A₃UUCUC₄, realignment of the nascent mRNA/template hybrid during 1G insertion would mean non permissible A:C base pairing hence the minimum insertion expected is 2G⁸³. The base pairing between the nascent chain and the template genome to form a hybrid is important to prevent transcriptional slippage by the polymerase⁸⁰. Similarly, in APMV-11, 1G and 2G insertions would lead to unstable A:C base pairing, hence 3G insertion (V protein) could be the minimum number of insertions expected, also, while a 4G insertion would translate to W protein, a 5G insertion would lead to P protein synthesis. It needs to be explored if APMV-11 expresses more V protein (from both unedited mRNA and 3G insertions) than other paramyxoviruses. Furthermore, it will be interesting to study if deletions in addition to G insertions could happen in APMV-11 and other APMVs with longer C runs at the editing site as described previously with recombinant Sendai virus and PIV-3 minigenomes⁸³.

Three factors, the editing site (sequence and the length of C runs), the type of sequences immediately upstream of the editing site (cis-acting sequence) and the hexamer phase positions are known to decide the editing phenotype (i.e. number of G insertions, deletions and frequency of mRNA editing) which further can influence the virus pathogenicity^81,82,84,85. Among APMVs, variations are observed in (i) P gene editing site, (ii) hexamer phase at the editing site and also (iii) the sequences immediately upstream of the editing site, all of which will determine the expression levels and relative proportions of P, V and W proteins in the APMVs which in turn could explain their differences in replication, pathogenicity and virus-host interactions.

With respect to the length and amino acid composition of V and W proteins, there were huge variations between species than within species, which was also reiterated by their dN/dS estimates. The V proteins were more conserved than W proteins. Higher sequence identity for V proteins was observed between the strains of the same species (exception was APMV-3) more often than between species. Phylogenetically, V protein analysis of APMV species grouped viruses similar to the individual gene-based phylogeny², all the members of genus Orthoavulavirus clustered into one group and all members of genus Metaavulvirus along with one of the avian paraavulaviruses (APMV-4) formed the second group while the avian paraavulavirus, APMV-3, formed an outgroup. This was further affirmed by evolutionary distance analysis. The phylogenetic analysis and the evolutionary distance data of both V and W proteins clearly showed that APMV-3 strains are the most divergent.

The N terminal region of V and W proteins which is shared with P protein, showed highest conservation among APMVs. In their C terminal region, the V proteins of paramyxoviruses carry conserved arginine and isoleucine residues upstream of highly conserved seven cysteine residues (zinc binding domain), known to play important roles in MDA5 interference, STAT1 degradation and blocking interferon signaling to evade host immunity^86,87,88. The V proteins of all 21 APMV species analyzed in this study had the seven cysteine residues and remarkably, many other amino acids were also conserved in their C terminal region. Though similar observations have been made earlier in other paramyxovirus V proteins, their functional importance is unknown yet⁸⁶.

The V and W genes were found to be under negative selection pressure with dN/dS <1 in all the species. This shows the conserved nature of the non-structural viral proteins within the species and probably indicates their functional importance, which is yet to be completely explored. Furthermore, the substitution rates of APMV species determined by molecular clock was varying across the species and was higher between species than within species. The substitution rates for W proteins was higher than V proteins except for APMV-1, where more strains were available for comparison. Though the substitution rate was slightly higher for V nucleotide sequence (5.15 × 10⁻⁴) of APMV-1 compared to W nucleotide sequence (1.0 × 10⁻⁴), it did not lead to changes in amino acid sequence as evident from dN/dS ratio estimates suggesting negative selection pressure. The higher conservation of V protein sequence implies its significant role in virus biology such as replication, pathogenesis and immune evasion.

In contrast to V proteins, the APMV W proteins were highly disordered, showed little sequence conservation when compared to V proteins and their divergence values were higher when compared to V proteins. The evolutionary data analysis of W proteins suggested higher sequence identity among strains of same species and higher variability between species. There were no conserved sequences or motifs in the C terminal region of W proteins except that most them carried large number of basic amino acids suggesting W protein to be highly basic as described previously¹². The exceptions were one strain of APMV-1 (KJ736742.1), APMV-4, APMV-7 and APV-C, and all of them had shorter W protein length. This genetic diversity seen in the W proteins may determine the degree of pathogenesis, variable interferon antagonistic activity and the wide host range exhibited by the APMV species.

The likelihood of W mRNA occurrence could be less than that of V mRNA, because of two unstable base pairing created by the two mismatches (2G) during polymerase stuttering. This skepticism becomes more compelling and doubts rise to whether W protein is expressed at all in those APMV species whose predicted W protein sequences have only fewer amino acid residues in their C terminal region- single (in APMV-7) or two (in APMV-4, APMV-13 & APMV-15) or three (in APMV-1 isolate R75/98) or four (in APMV-6, JX522537.1 & APV-C) or five (in APMV-5 & APMV-16) amino acids. However, equal or higher frequencies of insertions of 1G and 2G during RNA editing have been reported in certain paramyxoviruses such as Nipah virus and Bovine Parainfluenza virus type 3^89,90.

The presence of W mRNA of APMV-1 was first accounted in 1993 with a frequency of about 10%¹², and the W protein expression from APMV-1 lentogenic strain Clone 30 and from APMV-1 lentogenic strain La Sota and velogenic strain SG10 was recently confirmed^16,91]. We had earlier shown that W protein of APMV-1 mesogenic strain Komarov compartmentalized in the nucleus using plasmid system³⁰, while the same has also been documented during virus infection in cells in the above two studies. Here, we report NLS and NES of W proteins predicted only in certain APMV species, also, we could identify NLS only in five out of twelve strains of APMV-1 implying that the not all the W proteins of APMV-1 strains localize in the nucleus. The W protein sequence analysis of nearly 1000 strains of APMV-1 in our lab show variations in the W protein length between strains (unpublished data),which was also reported recently in an analysis of 286 strains of NDV⁹¹, furthermore, W proteins of only about 50% of the strains analyzed by us are predicted to localize into the nucleus (data not shown) leading us to speculate that these differences in W proteins can attribute to the wide spectrum of pathogenicity and virulence observed in Newcastle disease.

Among paramyxoviruses, the W protein of Nipah and Hendra viruses, are the most well characterized. The nuclear localization of W protein of Nipah virus was found to modify p53 expression and activity⁹², sequester inactive STAT1 within nucleus⁹³, prevent IRF3 phosphorylation, inhibit IFN signaling mediated both by the virus and TLR-3²⁹, modulate host immunity, influence the disease course and viral pathogenesis specifically neurovirulence^27,28,94. Intriguingly, neither the lack of W protein nor its cytoplasmic localization in APMV-1 strain clone 30 had any effect on viral replication in cell culture¹⁶. Though no conserved motifs could be identified between the W proteins of APMVs and Nipah virus, it will be interesting to study if similar roles are executed by W proteins of any or all the APMV species. To our knowledge, this is the first comprehensive and comparative evolutionary study of the P gene edited accessory viral proteins of APMVs. The information obtained by this study will enable designing future studies to understand the specific functions of conserved motifs/amino acids of V and W proteins and decipher their evolutionary significance on the virus and as well as on the host.

Change history

02 October 2020
An amendment to this paper has been published and can be accessed via a link at the top of the paper.

References

Amarasinghe, G. K. et al. Taxonomy of the order Mononegavirales: update 2019. Archives of Virology 164, 1967–1980 (2019).
Article CAS PubMed PubMed Central Google Scholar
Aziz-ul-Rahman, M. M. & Shabbir, M. Z. Molecular Phylogenetics and Evolution Comparative evolutionary and phylogenomic analysis of Avian avulaviruses. Molecular Phylogenetics and Evolution 127, 931–951 (2018).
Article CAS PubMed Google Scholar
OIE. Infection with Newcastle disease virus. In OIE - Terrestrial Animal Health Code. Preprint at, http://www.oie.int/fileadmin/Home/eng/Health_standards/tahc/current/chapitre_nd.pdf (2019).
Lamb, R. A. & Parks, G. D. Paramyxoviridae: the viruses and their replication (Fields vir; & P. M. H. (Eds. B. N. Fields, D. N. Knipe, ed.). Lippincott, Williams, and Wilkins (2007).
Millar, N. S. & Emmerson, P. T. Molecular Cloning and Nucleotide Sequencing of Newcastle Disease. Virus. Newcastle Disease 8, 79–97 (1988).
Article Google Scholar
Calain, P. & Roux, L. The rule of six, a basic feature for efficient replication of Sendai virus defective interfering RNA. Journal of General Virology 67, 4822–4830 (1993).
Article CAS Google Scholar
Kolakofsky, D. et al. Paramyxovirus RNA synthesis and the requirement for hexamer genome length: the rule of six revisited. Journal of Virology 72, 891–899 (1998).
Article CAS PubMed PubMed Central Google Scholar
Chang, P. C. et al. Complete nucleotide sequence of avian paramyxovirus type 6 isolated from ducks. Journal of General Virology 82, 2157–2168 (2001).
Article CAS Google Scholar
Chambers, P. & Samson, A. C. Non-structural proteins in Newcastle disease virus-infected cells. Journal of General Virology 58(Pt 1), 1–12 (1982).
Article CAS Google Scholar
Locke, D. P. et al. Newcastle disease virus phosphoprotein gene analysis and transcriptional editing in avian cells. Virus Research 69, 55–68 (2000).
Article CAS PubMed Google Scholar
Mebatsion, T., Verstegen, S., De Vaan, L. T. C., Römer-Oberdörfer, A. & Schrier, C. C. A recombinant Newcastle Disease Virus with low-level V protein expression is immunogenic and lacks pathogenicity for chicken embryos. Journal of Virology 75, 420–428 (2001).
Article CAS PubMed PubMed Central Google Scholar
Steward, M., Vipond, I. B., Millar, N. S. & Emmerson, P. T. RNA editing in Newcastle disease virus. Journal of General Virology 74, 2539–2547 (1993).
Article CAS Google Scholar
Giorgi, C., Blumberg, B. M. & Kolakofsky, D. Sendai virus contains overlapping genes expressed from a single mRNA. Cell 35, 829–836 (1983).
Article CAS PubMed Google Scholar
Kolakofsky, D., Vidal, S. & Curran, J. Paramyxovirus RNA Synthesis and P Gene Expression. In K. D.W. (Ed.), The Paramyxoviruses,The Viruses. Springer, Boston, MA (1991).
Vidal, S., Curran, J. & Kolakofsky, D. A stuttering model for paramyxovirus P mRNA editing. The EMBO Journal 9, 2017–2022 (1990).
Article CAS PubMed PubMed Central Google Scholar
Karsunke, J. et al. W protein expression by Newcastle disease virus. Virus Research 263, 207–216 (2019).
Article CAS PubMed Google Scholar
Briand, F. X., Henry, A., Massin, P. & Jestin, V. Complete genome sequence of a novel avian paramyxovirus. Journal of Virology 86, 7710–7710 (2012).
Article CAS PubMed PubMed Central Google Scholar
Paterson, R. G. & Lamb, R. A. RNA editing by G-nucleotide insertion in Mumps virus P-gene mRNA transcripts. Journal of Virology 64, 4137–4145 (1990).
Article CAS PubMed PubMed Central Google Scholar
Childs, K. S., Andrejeva, J., Randall, R. E. & Goodbourn, S. Mechanism of mda-5 inhibition by paramyxovirus V proteins. Journal of Virology 83, 1465–73 (2009).
Article CAS PubMed Google Scholar
Huang, Z., Krishnamurthy, S., Panda, A. & Samal, S. K. Newcastle disease virus V protein is associated with viral pathogenesis and functions as an alpha interferon Antagonist. Journal of Virology 77, 8676–8685 (2003).
Article CAS PubMed PubMed Central Google Scholar
Park, M. S., García-Sastre, A., Cros, J. F., Basler, C. F. & Palese, P. Newcastle Disease Virus V protein is a determinant of host range restriction. Journal of Virology 77, 9522–9532 (2003).
Article CAS PubMed PubMed Central Google Scholar
Park, M. S. et al. Newcastle disease virus (NDV)-based assay demonstrates interferon-antagonist activity for the NDV V protein and the Nipah virus V, W, and C proteins. Journal of Virology 77, 1501–1511 (2003).
Article CAS PubMed PubMed Central Google Scholar
Chu, Z. et al. Newcastle Disease Virus V protein inhibits cell apoptosis and promotes viral replication by targeting CacyBP/SIP. Frontiers in Cellular and Infection Microbiology 8, 304 (2018).
Article ADS PubMed PubMed Central CAS Google Scholar
Wang, C. et al. Newcastle disease virus V protein inhibits apoptosis in DF-1 cells by downregulating TXNL1. Veterinary Research 49, 102 (2018).
Article PubMed PubMed Central CAS Google Scholar
Chu, Z. et al. Newcastle disease virus V protein promotes viral replication in HeLa cells through the activation of MEK/ERK signaling. Viruses 10, 489 (2018).
Article PubMed Central CAS Google Scholar
Alamares, J. G., Elankumaran, S., Samal, S. K. & Iorio, R. M. The interferon antagonistic activities of the V proteins from two strains of Newcastle disease virus correlate with their known virulence properties. Virus Research 147, 153–157 (2010).
Article CAS PubMed Google Scholar
Satterfield, B. A. et al. The immunomodulating V and W proteins of Nipah virus determine disease course. Nature Communications 6, 7483 (2015).
Article ADS CAS PubMed Google Scholar
Satterfield, B. A., Geisbert, T. W. & Mire, C. E. Inhibition of the host antiviral response by Nipah virus: current understanding and future perspectives. Future Virology 11, 331–344 (2016).
Article CAS Google Scholar
Shaw, M. L., Cardenas, W. B., Zamarin, D., Palese, P. & Basler, C. F. Nuclear localization of the Nipah virus W protein allows for inhibition of both virus-and toll-like receptor 3-triggered signaling pathways. Journal of Virology 79, 6078–6088 (2005).
Article CAS PubMed PubMed Central Google Scholar
Vaidyanathan, S. P. & Gawai, S. & Subbiah, M. Poster Presentation, ICID 2016: Elucidation of the role of non-structural viral protein (W) of Newcastle disease virus. International Journal of Infectious Diseases 45, 337–338 (2016).
Article Google Scholar
Abolnik, C., De Castro, M. & Rees, J. Full genomic sequence of an African Avian Paramyxovirus Type 4 strain isolated from a wild duck. Virus Genes 45, 537–541 (2012).
CAS PubMed Google Scholar
Fereidouni, S. et al. Next-generation sequencing of five new avian paramyxoviruses 8 isolates from Kazakhstan indicates a low genetic evolution rate over four decades. Archives of Virology 163, 331–336 (2018).
Article CAS PubMed Google Scholar
Heiden, S., Grund, C., Höper, D., Mettenleiter, T. C. & Römer-Oberdörfer, A. Pigeon paramyxovirus type 1 variants with polybasic F protein cleavage site but strikingly different pathogenicity. Virus Genes 49, 502–506 (2014).
Article CAS PubMed Google Scholar
Hiono, T., Matsuno, K., Tuchiya, K., Lin, Z., Okamatsu, M. & Sakoda, Y. Complete genome sequence of the avian paramyxovirus serotype 5 strain APMV-5/budgerigar/Japan/TI/75. Genome Announcement 4, pii: e01005-16 (2016).
Jeong, J. et al. Complete genome sequence of a novel avian paramyxovirus isolated from wild birds in South Korea. Archives of Virology 163, 223–227 (2018).
Article CAS PubMed Google Scholar
Karamendin, K. et al. Novel avian paramyxovirus isolated from gulls in Caspian seashore in Kazakhstan. PLoS One 12, 1–15 (2017).
Article CAS Google Scholar
Karamendin, K. et al. Complete genome sequence of avian paramyxovirus strain APMV-6/red-crested pochard/Balkhash/5842/2013 from Kazakhstan. Genome Announcements 3, e00158–15 (2015).
Article PubMed PubMed Central Google Scholar
Kim, S. H., Subbiah, M., Samuel, A. S., Collins, P. L. & Samal, S. K. Roles of the fusion and hemagglutinin-neuraminidase proteins in replication, tropism, and pathogenicity of avian paramyxoviruses. Journal of Virology 85, 8582–8596 (2011).
Article CAS PubMed PubMed Central Google Scholar
Kumar, S., Nayak, B., Collins, P. L. & Samal, S. K. Complete genome sequence of avian paramyxovirus type 3 reveals an unusually long trailer region. Virus Research 137, 189–197 (2008).
Article CAS PubMed PubMed Central Google Scholar
Kumar, S., Nayak, B., Samuel, A. S., Xiao, S., Collins, P. L. & Samal, S. K. Complete genome sequence of avian paramyxovirus-3 strain Wisconsin: Evidence for the existence of subgroups within the serotype. Virus Research 149, 78–85 (2010).
Article CAS PubMed PubMed Central Google Scholar
Kydyrmanov, A. I. et al. Novel avian paramyxovirus isolated from gulls in Caspian seashore in Kazakhstan. PLoS One. 28 12(12), e0190339 (2017).
Article CAS Google Scholar
Lee, H. J. et al. A novel avian paramyxovirus (Putative Serotype 15) isolated from wild birds. Frontiers in Microbiology 8, 786 (2017).
Article PubMed PubMed Central Google Scholar
Li, X., Zhang, S., Wang, H., Zhao, J. & Zhang, G. Genomic characterization of two avian paramyxovirus type 2 isolates from chickens in China. Virus Genes 43, 55–59 (2011).
Article CAS PubMed Google Scholar
Miller, P. J. et al. Evidence for a new avian paramyxovirus serotype 10 detected in Rockhopper Penguins from the Falkland Islands. Journal of Virology 84, 11496–11504 (2010).
Article CAS PubMed PubMed Central Google Scholar
Nayak, B., Kumar, S., Collins, P. L. & Samal, S. K. Molecular characterization and complete genome sequence of avian paramyxovirus type 4 prototype strain duck/Hong Kong/D3/75. Virology Journal 5, 124 (2008).
Article PubMed PubMed Central CAS Google Scholar
Neira, V. et al. Novel avulaviruses in penguins, Antarctica. Emerging Infectious Diseases 23, 1212–1214 (2017).
Article PubMed PubMed Central Google Scholar
Paldurai, A., Subbiah, M., Kumar, S., Collins, P. L. & Samal, S. K. Complete genome sequences of avian paramyxovirus type 8 strains goose/Delaware/1053/76 and pintail/Wakuya/20/78. Virus Research 142, 144–153 (2009).
Article CAS PubMed PubMed Central Google Scholar
Tirumurugaan, K. G. et al. Genotypic and pathotypic characterization of Newcastle disease viruses from India. PLoS One 6, e28414 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Samuel, A. S., Kumar, S., Madhuri, S., Collins, P. L. & Samal, S. K. Complete sequence of the genome of avian paramyxovirus type 9 and comparison with other paramyxoviruses. Virus Research 142, 10–18 (2009).
Article CAS PubMed PubMed Central Google Scholar
Samuel, A. S., Paldurai, A., Kumar, S., Collins, P. L. & Samal, S. K. Complete genome sequence of avian paramyxovirus (APMV) serotype 5 completes the analysis of nine APMV serotypes and reveals the longest APMV genome. PLoS One 5, 1–13 (2010).
Article CAS Google Scholar
Subbiah, M., Nayak, S., Collins, P. L. & Samal, S. K. Complete genome sequences of avian paramyxovirus serotype 2 (APMV-2) strains Bangor, England and Kenya: Evidence for the existence of subgroups within serotype 2. Virus Research 152, 85–95 (2010).
Article CAS PubMed PubMed Central Google Scholar
Subbiah, M., Xiao, S., Collins, P. L. & Samal, S. K. Complete sequence of the genome of avian paramyxovirus type 2 (strain Yucaipa) and comparison with other paramyxoviruses. Virus Research 137, 40–48 (2008).
Article CAS PubMed PubMed Central Google Scholar
Subbiah, M. et al. Pathogenesis of two strains of avian paramyxovirus serotype 2, Yucaipa and Bangor, in chickens and turkeys. Avian Diseases 54, 1050–1057 (2010).
Article PubMed PubMed Central Google Scholar
Terregino, C. et al. Antigenic and genetic analyses of isolate APMV/wigeon/Italy/3920-1/2005 indicate that it represents a new avian paramyxovirus (APMV-12). Archives of Virology 158, 2233–2243 (2013).
Article CAS PubMed Google Scholar
Thampaisarn, R. et al. Characterization of avian paramyxovirus serotype 14, a novel serotype, isolated from a duck fecal sample in Japan. Virus Research 228, 46–57 (2017).
Article CAS PubMed Google Scholar
Thomazelli, L. M. et al. Novel avian paramyxovirus (APMV-15) isolated from a migratory bird in South America. PLoS One 12, 1–7 (2017).
Article CAS Google Scholar
Tian, Z. et al. Complete nucleotide sequence of Avian Paramyxovirus type 6 strain JL isolated from mallard ducks in China. Journal of Virology 86, 13112 (2012).
Article CAS PubMed PubMed Central Google Scholar
Tsunekuni, R., Ito, H., Otsuki, K., Kida, H. & Ito, T. Genetic comparisons between lentogenic Newcastle disease virus isolated from waterfowl and velogenic variants. Virus Genes 40, 252–255 (2010).
Article CAS PubMed Google Scholar
Wu, W. et al. Molecular and antigenic characteristics of Newcastle disease virus isolates from domestic ducks in China. Infection, Genetics and Evolution 32, 34–43 (2015).
Article CAS PubMed Google Scholar
Xiao, S. et al. Complete genome sequence of avian paramyxovirus type 7 (strain Tennessee) and comparison with other paramyxoviruses. Virus Research 145, 80–91 (2009).
Article CAS PubMed PubMed Central Google Scholar
Xiao, S. et al. Complete genome sequences of avian paramyxovirus serotype 6 prototype strain Hong Kong and a recent novel strain from Italy: Evidence for the existence of subgroups within the serotype. Virus Research 150, 61–72 (2010).
Article ADS CAS PubMed PubMed Central Google Scholar
Yamamoto, E., Ito, H., Tomioka, Y. & Ito, T. Characterization of novel avian paramyxovirus strain APMV/Shimane67 isolated from migratory wild geese in Japan. The Journal of Veterinary Medical Science 77, 1079–1085 (2015).
Article CAS PubMed PubMed Central Google Scholar
Aziz-ul-Rahman & M. Z. Shabbir. One (1) new species in the genus Avulavirus (Mononegavirales: Paramyxoviridae). Taxonomic proposal document submitted to ICTV, code assigned 2019.014M (2019).
Armougom, F. et al. Expresso: automatic incorporation of structural information in multiple sequence alignments using 3D-Coffee. Nucleic Acids Research 34, W604–W608 (2006).
Article CAS PubMed PubMed Central Google Scholar
Notredame, C., Higgins, D. G. & Heringa, J. T-coffee: a novel method for fast and accurate multiple sequence alignment11Edited by J. Thornton. Journal of Molecular Biology 302, 205–217 (2000).
Robert, X. & Gouet, P. Deciphering key features in protein structures with the new ENDscript server. Nucleic Acids Research 42, W320–W324 (2014).
Article CAS PubMed PubMed Central Google Scholar
Marchler-Bauer, A. et al. CDD: NCBI’s conserved domain database. Nucleic Acids Research 43, D222–D226 (2015).
Article CAS PubMed Google Scholar
Bailey, T. L. DREME: motif discovery in transcription factor ChIP-seq data. Bioinformatics 27, 1653–1659 (2011).
Article CAS PubMed PubMed Central Google Scholar
Kosugi, S., Hasebe, M., Tomita, M. & Yanagawa, H. Systematic identification of cell cycle-dependent yeast nucleocytoplasmic shuttling proteins by prediction of composite motifs. In Proceedings of the National Academy of Sciences 106,10171-10176 (2009).
la Cour, T. et al. Analysis and prediction of leucine-rich nuclear export signals. Protein Engineering, Design and Selection 17, 527–536 (2004).
Article CAS PubMed Google Scholar
Xu, D. et al. LocNES: a computational tool for locating classical NESs in CRM1 cargo proteins. Bioinformatics 31, 1357–1365 (2015).
Article CAS PubMed Google Scholar
Jones, D. T., Taylor, W. R. & Thornton, J. M. The rapid generation of mutation data matrices. Computer Applications in the Biosciences 8, 275–282 (1992).
CAS PubMed Google Scholar
Schwartz, R. M. & Dayhoff, M. O. Matrices for detecting distant relationships. In Atlas of protein sequence and structure 5, 353–358 (1979).
Google Scholar
Felsenstein, J. Confidence Limits On Phylogenies: An Approach Using the Bootstrap. Evolution 39, 783–791 (1985).
Article PubMed Google Scholar
Kumar, S., Stecher, G. & Tamura, K. MEGA7: Molecular Evolutionary Genetics Analysis Version 7.0 for Bigger Datasets. Molecular Biology and Evolution 33, 1870–1874 (2016).
Article CAS PubMed PubMed Central Google Scholar
Rozas, J. et al. DnaSP 6: DNA sequence polymorphism analysis of large data sets. Molecular biology and evolution 34, 3299–3302 (2017).
Article CAS PubMed Google Scholar
Suchard, M. A. et al. Bayesian phylogenetic and phylodynamic data integration using BEAST 1.10. Virus Evolution 4, vey016 (2018).
Article PubMed PubMed Central Google Scholar
Karlin, D. & Belshaw, R. Detecting remote sequence homology in disordered proteins: Discovery of conserved motifs in the N-termini of mononegavirales phosphoproteins. PLoS One 7, e31719 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Kolakofsky, D. Paramyxovirus RNA synthesis, mRNA editing, and genome hexamer phase: A review. Virology 498, 94–98 (2016).
Article CAS PubMed Google Scholar
Iseni, F. et al. Chemical modification of nucleotide bases and mRNA editing depend on hexamer or nucleoprotein phase in Sendai virus nucleocapsids. RNA 8, 1056–1067 (2002).
Article CAS PubMed PubMed Central Google Scholar
Hausmann, S., Garcin, D., Morel, A. S. & Kolakofsky, D. Two nucleotides immediately upstream of the essential A6G3 slippery sequence modulate the pattern of G insertions during Sendai virus mRNA editing. Journal of Virology 73, 343–351 (1999).
Article CAS PubMed PubMed Central Google Scholar
Hausmann, S., Garcin, D., Delenda, C. & Kolakofsky, D. The versatility of paramyxovirus RNA polymerase stuttering. Journal of Virology 73, 5568–5576 (1999).
Article CAS PubMed PubMed Central Google Scholar
Jacques, J. P., Hausmann, S. & Kolakofsky, D. Paramyxovirus mRNA editing leads to G deletions as well as insertions. The EMBO Journal 13, 5496–503 (1994).
Article CAS PubMed PubMed Central Google Scholar
Mebatsion, T., Verstegen, S., De Vaan, L. T., Römer-Oberdörfer, A. & Schrier, C. C. A recombinant Newcastle disease virus with low-level V protein expression is immunogenic and lacks pathogenicity for chicken embryos. Journal of Virology 75, 420–428 (2001).
Article CAS PubMed PubMed Central Google Scholar
Mebatsion, T., de Vaan, L. T., de Haas, N., Römer-Oberdörfer, A. & Braber, M. Identification of a mutation in editing of defective Newcastle disease virus recombinants that modulates P-gene mRNA editing and restores virus replication and pathogenicity in chicken embryos. Journal of Virology 77, 9259–9265 (2003).
Article CAS PubMed PubMed Central Google Scholar
Ramachandran, A. & Horvath, C. M. Dissociation of paramyxovirus interferon evasion activities: universal and virus-specific requirements for conserved V protein amino acids in MDA5 interference. Journal of Virology 84, 11152–11163 (2010).
Article CAS PubMed PubMed Central Google Scholar
Steward, M. & Samson, A. C. & Errington W, E. P. The Newcastle disease virus V protein binds zinc. Archives of Virology 140, 1321–1328 (1995).
Article CAS PubMed Google Scholar
Qiu, X. et al. Newcastle Disease Virus V Protein Targets Phosphorylated STAT1 to Block IFN-I Signaling. PloS One 11, e0148560–e0148560 (2016).
Article PubMed PubMed Central CAS Google Scholar
Kulkarni, S. et al. Nipah Virus edits its P Gene at High Frequency to Express the V and W Proteins. Journal of Virology 83, 3982–3987 (2009).
Article CAS PubMed PubMed Central Google Scholar
Pelet, T., Curran, J. & Kolakofsky, D. The P gene of bovine parainfluenza virus 3 expresses all three reading frames from a single mRNA editing site. The EMBO Journal 10, 443–448 (1991).
Article CAS PubMed PubMed Central Google Scholar
Yang, Y. et al. Appropriate amount of W protein of avian avulavirus 1 benefits viral replication and W shows strain-dependent subcellular localization. Virology 538, 71–85 (2019).
Article CAS PubMed Google Scholar
Martinez-Gil, L., Vera-Velasco, N. M. & Mingarro, I. Exploring the Human-Nipah Virus Protein-Protein Interactome. Journal of Virology 91, pii: e01461-17. (2017).
Ciancanelli, M. J., Volchkova, V. A., Shaw, M. L., Volchkov, V. E. & Basler, C. F. Nipah Virus Sequesters Inactive STAT1 in the Nucleus via a P Gene-Encoded Mechanism. Journal of Virology 83, 7828–7841 (2009).
Article CAS PubMed PubMed Central Google Scholar
Yoneda, M. et al. The nonstructural proteins of Nipah virus play a key role in pathogenicity in experimentally infected animals. PloS One 5, e12709–e12709 (2010).
Article ADS PubMed PubMed Central CAS Google Scholar

Download references

Acknowledgements

The authors thank their lab members, National Institute of Animal Biotechnology and Department of Biotechnology, Government of India for supporting the study. This work was funded by the National Institute of Animal Biotechnology core grant (#C0007) and the Department of Biotechnology, Government of India (#BT/PR8740/AGR/36/772/2013).

Author information

Authors and Affiliations

National Institute of Animal Biotechnology, Hyderabad, 500032, Telangana, India
Pachineella Lakshmana Rao, Ravi Kumar Gandham & Madhuri Subbiah

Authors

Pachineella Lakshmana Rao
View author publications
You can also search for this author in PubMed Google Scholar
Ravi Kumar Gandham
View author publications
You can also search for this author in PubMed Google Scholar
Madhuri Subbiah
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.S. conceived and designed the study, P.L.R. collected the data, M.S., P.L.R. and R.G. analyzed the data, M.S., P.L.R., R.G. drafted the manuscript.

Corresponding author

Correspondence to Madhuri Subbiah.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary information S1

Supplementary information S2

Supplementary information S3

Supplementary information S4

Supplementary information S5

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Rao, P.L., Gandham, R.K. & Subbiah, M. Molecular evolution and genetic variations of V and W proteins derived by RNA editing in Avian Paramyxoviruses. Sci Rep 10, 9532 (2020). https://doi.org/10.1038/s41598-020-66252-x

Download citation

Received: 09 December 2019
Accepted: 06 May 2020
Published: 12 June 2020
DOI: https://doi.org/10.1038/s41598-020-66252-x

This article is cited by

Molecular characterization suggests kinetic modulation of expression of accessory viral protein, W, in Newcastle disease virus infected DF1 cells
- B. Nagaraj Nayak
- Kalaimagal Rajagopal
- Madhuri Subbiah
VirusDisease (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Materials and methods

Sequence information

Sequence alignment, comparison and prediction of conserved motifs/domains

Prediction of Nuclear Localization Signal (NLS) and Nuclear Export Signal (NES) in V and W proteins

Phylogenetic analysis and evolutionary divergence

Selection pressure analysis

Evolutionary rate analysis

Results

RNA editing site and prediction of V and W protein sequences

Amino acid sequence analysis: percentage identity and conservations

Comparison of V protein sequences of APMV species

Comparison of W protein sequences of APMV species

Phylogenetic tree and evolutionary distance analysis

Selection pressure analysis

Evolutionary rate analysis

Discussion

Change history

02 October 2020

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links