Abstract
Zika virus (ZIKV) belongs to a class of neurotropic viruses that have the ability to cause congenital infection, which can result in microcephaly or fetal demise. Recently, the RNA-binding protein Musashi-1 (Msi1), which mediates the maintenance and self-renewal of stem cells and acts as a translational regulator, has been associated with promoting ZIKV replication, neurotropism, and pathology. Msi1 predominantly binds to single-stranded motifs in the 3′ untranslated region (UTR) of RNA that contain a UAG trinucleotide in their core. We systematically analyzed the properties of Musashi binding elements (MBEs) in the 3′UTR of flaviviruses with a thermodynamic model for RNA folding. Our results indicate that MBEs in ZIKV 3′UTRs occur predominantly in unpaired, single-stranded structural context, thus corroborating experimental observations by a biophysical model of RNA structure formation. Statistical analysis and comparison with related viruses show that ZIKV MBEs are maximally accessible among mosquito-borne flaviviruses. Our study addresses the broader question of whether other emerging arboviruses can cause similar neurotropic effects through the same mechanism in the developing fetus by establishing a link between the biophysical properties of viral RNA and teratogenicity. Moreover, our thermodynamic model can explain recent experimental findings and predict the Msi1-related neurotropic potential of other viruses.
Similar content being viewed by others
Introduction
Flaviviruses are an emerging group of arboviruses belonging to the Flaviviridae family. Researchers have been describing recent outbreaks of these viruses that have not been previously detected for decades1,2,3.
The genus Flavivirus comprises more than 70 species that are mainly transmitted by mosquitoes and ticks, typically classified into four groups: Mosquito-borne flaviviruses (MBFVs), tick-borne flaviviruses (TBFVs), insect-specific flaviviruses (ISFVs), that do not have vertebrate hosts, and no known arthropod vector flaviviruses (NKVs), which typically infect bats and rodents. Flaviviruses represent a global health threat, including emerging and re-emerging human pathogens such as Dengue (DENV), Yellow fever (YFV), Japanese encephalitis (JEV), West Nile (WNV), Tick-borne encephalitis (TBEV) and Zika (ZIKV) viruses4,5.
Initially isolated in 1947 from a sentinel rhesus macaque in the Ziika forest, Uganda, ZIKV has not been associated with severe disease, apart from skin rashes, body pain, and fever. Likewise, ZIKV has been circulating across equatorial zones in Africa and Asia for 60 years, until the first outbreak was reported in Yap Island, Micronesia in 2007. Subsequently, the virus spread eastwards to French Polynesia and other Pacific islands in 2013 and reached the Americas in 20156,7. There are two main ZIKV lineages, the original African (type strain MR766) and an Asian (type strain FSS13025)8,9, the latter also comprising American strains such as PE243.
Background
The 2015–2017 outbreak in the Americas raised the possibility of a link between ZIKV infection and congenital abnormalities, which included placental damage, intrauterine growth restrictions, eye diseases and microcephaly in children as well as acute motor axonal neuropathy-type Guillain-Barré syndrome in adults10. While MBFVs are typically transmitted by host-vector interaction, vertical transmission from mother to child during pregnancy via transplacental infection has been reported11.
The neurotropic potential of ZIKV-related flaviviruses has been known since the 1970s, when Saint Louis encephalitis virus (SLEV) has been attributed to a severe neurological disorder in infected mice12,13. Vertical transmission has been observed with JEV in mice14 and human15 and a case of human fetal infection have been reported after YFV vaccination16. Other transmission pathways of ZIKV include blood transfusions and sexual transmission17,18. Despite enormous efforts in studying ZIKV infections in the last years, the biological reasoning and mechanisms behind arbovirus congenital neurotropism remain elusive.
Flavivirus genome organization
Flaviviruses have the structure of an enveloped sphere of approximately 50 nm diameter. They are single-stranded positive-sense RNA viruses of 10–12 kb in size, and their genomic RNA (gRNA) encodes a single open reading frame (ORF) flanked by highly structured untranslated regions (UTRs). Upon translation of the ORF, a polyprotein is produced which is processed by viral and cellular enzymes, yielding structured (C, prM, E) and unstructured proteins (NS1, NS2A, NS2B, NS3, NS4A, 2K, NS4B, NS5). Both flavivirus UTRs are crucially related to regulation of the viral life cycle, mediating processes such as genome circularization, viral replication and packaging19,20,21,22.
Flaviviruses hijack the host mRNA degradation pathway
The central role of flavivirus 3′UTR in modulating cytopathicity and pathogenicity became apparent when an accumulation of both gRNA and viral long non-coding RNA (lncRNA) has been observed upon infection. These lncRNAs, also known as subgenomic flaviviral RNAs (sfRNAs)23, are stable decay intermediates derived from exploiting the host’s mRNA degradation machinery24.
sfRNAs are produced by partial degradation of viral gRNA by Xrn1, a host 5′-3′ exoribonuclease that is associated with the endogenous mRNA turnover machinery25,26. The enzyme stalls at highly conserved RNA structures in the viral 3′UTR, so-called Xrn1-resistant RNAs (xrRNAs), resulting in sfRNAs of variable lengths27,28. Xrn1-resistant RNAs and sfRNAs appear to be ubiquitously present in many flaviviruses. They have been described in MBFVs, including DENV29, YFV30, JEV31, and ZIKV32, TBFVs23,33, and recently in ISFVs and NKVs34,35. There is typically more than one xrRNA, given the diverse molecular architecture of different flavivirus 3′UTRs. Pseudoknot interactions have been proposed in some, but not all flavivirus xrRNAs32,36. While they may form transiently under certain conditions28, conclusive validation of their ubiquitous presence is missing. Hence, we will exclude them in this work. Earlier studies in our group have identified conserved RNA structural elements in viral 3′UTRs37,38,39,40,41, some of which have later been attributed to xrRNA functionality23. Stem-loop (SL) as well as dumbbell (DB) structures are found in 3′UTRs of flaviviruses in single or double copies (Fig. 1) and have been associated with quantitative protection of downstream viral RNA42.
The inhibition of Xrn1 by viral RNA yields sfRNAs that affect many cellular processes, both in the vector and the host43. In mosquitoes, sfRNA interacts directly with the predominant innate immune response pathway, RNA interference (RNAi), by serving as a template for microRNA (miRNA) biogenesis44. Conversely, in host cells sfRNA modulates the anti-viral interferon response45, e.g., by binding proteins to inhibit the translation of interferon-stimulated genes46. Moreover, sfRNA has been shown to inhibit Xrn1 and Dicer activity, thereby altering host mRNA levels47,48.
At the same time, a variety of host proteins bind the 3′UTR of flaviviruses, thereby mediating viral replication, polyprotein translation or the anti-viral immune response (see Table 1 in ref.43 for a comprehensive overview of host proteins that bind flavivirus 3′UTR/sfRNA). Although notoriously underrepresented in literature, one can expect that many of these proteins also bind sfRNA due to sequence and structure conservation.
Subgenomic flaviviral RNA interacts with Musashi
One of these groups of host factors is the Musashi (Msi) protein family. Msi is a highly conserved family of proteins in vertebrates and invertebrates that act as a translational regulator of target mRNAs and is involved in cell proliferation and differentiation. While the two Msi paralogs in mammals, Musashi-1 (Msi1) and Musashi-2 (Msi2), are expressed in stem cells49,50,51 and overexpressed in tumors and leukemias52, they are absent in differentiated tissue. Moreover, Msi1 is involved in the regulation of blood-testis barrier proteins and spermatogenesis in mice53. Musashi proteins have two RNA recognition motif (RRM) domains, whose sequence specificity has been determined by an in vitro selection method and NMR spectroscopy51,54,55. The trinucleotide sequence UAG, whose thermodynamic binding specificity was determined by fluorescence polarization assays, has been identified as core Musashi binding element (MBE). Nucleotides enclosing the main MBE recognition motif make minor contributions to binding affinity56. While earlier SELEX experiments identified the binding aptamer sequence (G/A)UnAGU (n = 1 − 3)51, iCLIP experiments with Msi1 in human glioblastoma cells confirmed the preferential binding of Msi1 to single-stranded (stem-loop) UAG sequences in 3′UTRs, but not in coding regions57. Zearfoss et al.56 observed that both GUAGU and AUAGU are recognized by mouse Msi1, whereas Drosophila Msi1 has a higher affinity for GUAGU. NMR-derived structures of the two Msi1 RNA recognition motifs in complex with RNA also show that both RNA-binding domains bind GUAGU (PDB IDs 2RS2 and 5X3Z).
In summary, there is a strong consensus in the literature that UAG is central to all proposed Musashi binding motifs. Therefore, we focus our calculations around this trinucleotide, and provide evidence that the availability of UAG in pentanucleotides expands to the accessibility of the entire motif.
Musashi is involved in flavivirus neurotropism
An interesting, yet understudied hypothesis is the possibility that the stem cell regulator protein Musashi could be related to ZIKV tropism. Based on the identification of a MBE in the 3′UTR of the ZIKV genome10, de Bernardi Schneider et al.7 reported the presence of the same element with a higher binding affinity for human Msi1 in all ZIKV sequences that belong to the Asia-Pacific-Americas clade in an in silico screen and implied that there could be a change of tropism for the viral lineage. Chavali et al.58 tested the possibility of Msi1 interaction with the ZIKV genome in vivo and found that Msi1 not only interacts with ZIKV, but also enhances viral replication. They noted that ZIKV RNA could compete with endogenous targets for binding Msi1 in the brain of the developing fetus, thereby dysregulating the expression of genes required for neural stem cell development. Based on their data the authors concluded that Msi1 is involved in ZIKV neurotropism and pathology and raised the question whether MBEs present in other flavivirus genomes could exhibit similar functionality. In a recent study, Platt et al.59 investigated whether ZIKV-related arboviruses can cause congenital infection and fetal pathology in utero in immunocompetent mice. They tested two emerging neurotropic flaviviruses, WNV, and Powassan virus (POWV), as well as two alphaviruses, Chikungunya virus (CHIKV) and Mayaro virus (MAYV). All four viruses caused placental infection, however, only WNV and POWV resulted in fetal demise, indicating that ZIKV is not unique among flaviviruses in its capacity to be transplacentally transmitted and cause fetal neuropathology.
In this contribution, we systematically analyze the Musashi-related neurotropic potential of well-curated flavivirus genomes in silico. We investigate structural features of MBEs in viral 3′UTRs by a thermodynamic model of RNA structure formation and work out the biophysical properties of conserved RNA structures harboring MBEs in order to build a theoretical ground for future in vivo studies.
Materials and Methods
Dataset
Sequence data for the present study was acquired from the public National Center for Biotechnology Information (NCBI) refseq database (https://www.ncbi.nlm.nih.gov/refseq/) on 15 December 2017. We filtered for all complete viral genomes under taxonomy ID 11051 (genus Flavivirus), resulting in 72 genomes, 51 of which had 3′UTR sequences and annotation available (Table 1).
The core Musashi binding element is only three nucleotides long, hence one can expect to observe a certain number of UAG trinucleotides by chance in any viral 3′UTR. Table 1 shows the number of MBEs present in 3′UTR regions of viral genomes analyzed here as well as the ratio \({R}_{UAG}={O}_{UAG}\)/\({E}_{UAG}\), i.e., observed versus expected frequencies. Assuming that all four nucleotides (A, U, G, C) occur independently and with equal probability, the expected probability to observe a subsequence of length l is equal to (1/4)l. For \(l=3\), this is equal to 1/64. More realistically, the frequency of each nucleotide \(i\in \{A,U,G,C\}\) in an RNA sequence of length L is \({F}_{i}={N}_{i}\)/\(L\), where, Ni is the nucleotide count of i. For any trinucleotide XYZ, the expected trinucleotide frequency EXYZ is then computed from mononucleotide frequencies as \({E}_{XYZ}={F}_{X}\ast {F}_{Y}\ast {F}_{Z}\).
The refseq genome for Spondweni virus (SPONV, accession number NC_029055.1) does not include a 3′UTR sequence. Since SPONV is phylogenetically closely related to ZIKV60, we were looking to include this sequence into our analysis. Nikos Vasilakis (Univ. of Texas Medical Branch, Galveston, TX, USA) generously provided SPONV sequence data. The 338 nt 3′UTR sequence of the SA-Ar strain (see Supplementary Material) has been added to the set of flavivirus sequences analyzed here.
Kama virus (KAMV) does not contain UAG trinucleotides in the 3′UTRs, consequently it has been discarded from our dataset. The remaining virus species contain between 1 and 19 MBEs in their 3′UTRs.
Opening energy directly relates to single-strandedness
The biophysical model employed here is based on a description of RNA at the level of secondary structures, building upon the thermodynamic nearest neighbor energy model as implemented in the ViennaRNA Package61. This allows for computing equilibrium properties of RNA such as the single most stable, minimum free energy (MFE) structure, as well as the partition function \({\mathscr{Z}}\). The latter makes an evaluation of the thermodynamic ensemble of RNA structures available and is defined as the sum over all Boltzmann factors of individual structures s
where E(s) is the free energy of the structure, R the universal gas constant and T the thermodynamic temperature of the system. The equilibrium probability of a secondary structure s is then defined as
The partition function \({\mathscr{Z}}\) can be computed efficiently via dynamic programming62 and allows calculation of individual base pair probabilities, even for large sequences63. In this line, the accessibility (i.e., the probability that a region \(i\ldots j\) along the RNA is single-stranded) can be derived from the partition function (Eq. 1)64. Likewise, the opening energy (i.e., the free energy required to force the region to be single-stranded) can be computed as
The opening energy of a region within an RNA is directly related to local RNA secondary structure. In this line, low opening energy is a reliable indicator for single-strandedness. We employ the sliding window approach of RNAplfold63 to compute local pairing probabilities of UAG trinucleotide motifs to assess the likelihood of single-strandedness of and around MBEs. RNAplfold is part of the ViennaRNA Package61 and can compute the accessibilities or single-strandedness of all intervals of an RNA in cubic time65. We select 97 nt windows upstream and downstream of MBEs in viral 3′UTRs and compute local pairing probabilities for base pairs within 100 nt windows. Opening energies for trinucleotides are then evaluated from averaged pairing probabilities with RNAplfold.
The significance of a calculated MBE opening energy is assessed by comparison with a large number of randomized sequences of the same length and same base or dinucleotide composition. We compute the opening energies of trinucleotides both in a genomic as well as a shuffled sequence context and apply a z score statistics. The normalized z score is defined as
where Eopen(XYZ) is the opening energy of trinucleotide XYZ in its genomic context, μ and σ are the mean and standard deviations, respectively, of the opening energies of XYZ computed over a large sample of randomized sequences. Randomization with regard to keeping sequence composition is achieved here by applying dinucleotide shuffling to the 97 nt windows upstream and downstream of MBEs, while keeping XYZ in place. The same idea applies to calculations of pentanucleotide motifs.
The approach outlined above is implemented in the Perl utility plfoldz.pl, which is available from https://github.com/mtw/plfoldz. The script employs the ViennaRNA61 scripting language interface for thermodynamics calculations, the ViennaNGS66 suite for extraction of genomic loci and the uShuffle Perl bindings67 for k-let shuffling. The tool reports for each requested trinucleotide the opening energy in a genomic context as well as an opening energy z score obtained from n shuffling events of upstream and downstream sequences. Here, n = 10,000 dinucleotide shuffling events were used.
Characteriztaion of MBEs within xrRNAs
To localize MBEs within homologous substructures in flavivirus 3′UTRs we constructed infernal68 covariance models for conserved xrRNA elements. The structural RNA alignments underlying the infernal models were computed with locarna69 and further analyzed with RNAalifold61 and RNAaliSplit70.
Results
MBEs are highly accessible in ZIKV 3′UTRs
The Musashi family of proteins preferentially bind single-stranded UAG motifs in 3′UTRs57. To evaluate the thermodynamics of Msi-UAG affinity more broadly, we set out to analyze the single-strandedness of all possible trinucleotides in ZIKV genomes. To this end, we computed the opening energies of all trinucleotide motifs present in the coding sequence (CDS) and 3′UTR of the African (ZIKV-UG) and Asian/American (ZIKV-BR) Zika strains. A z score was calculated for each occurrence of trinucleotide XYZ according to Eq. 4, thereby normalizing the opening energy of XYZ in its genomic context with n = 10,000 dinucleotide-shuffled upstream and downstream regions of 97 nt, using 100 nt windows in RNAplfold.
Negative opening energy z scores indicate increased accessibility, i.e., UAG trinucleotides in viruses with overall low z scores are likely to occur in an unpaired structural context within the 3′UTR. Through the distribution of z scores, sorted by median z score (Fig. 2) we were able to see three aspects standing out. First, the distribution of z scores is markedly divergent among CDS and 3′UTR. The interquartile ranges of opening energy z scores are homogeneous within the CDS region, while dispersion is varied within the 3′UTR. We hypothesize that this is caused by a different sequence composition that manifests in highly variable opening energies. It could, however, also be an artifact of the different sample sizes based on the divergent trinucleotide count in CDS and 3′UTR, respectively. Second, UAG is the most accessible trinucleotide in the 3′UTR of ZIKV-BR and among the highest accessible trinucleotides in the 3′UTR of ZIKV-UG. This is striking as it corroborates previous experimental evidence of Musashi affinity to ZIKV58 by means of a thermodynamic model, thus underlining a possible role of Msi1 in ZIKV neurotropism. Moreover, the UAG trinucleotide is neither enriched nor depleted in the 3′UTRs of ZIKV-BR and ZIKV-UG (Table 1). Third, the canonical start codon AUG appears to the far right end of the scale in both ZIKV-BR and ZIKV-UG 3′UTRs, i.e., it is among the least accessible trinucleotides. This suggests evolutionary pressure on keeping the start codon in a paired structural context within the 3′UTR, thereby prohibiting accessibility to ribosomes and disabling undesirable leaky translation start from these AUG triplets.
We also tested the accessibility of larger Musashi recognition motifs. To this end, we employed the same approach outlined above for all pentanucleotides found in the 3′UTRs of ZIKV-BR and ZIKV-UG, respectively. The distributions of opening energy z scores (Supplementary Data, Fig. S1) are in good agreement with our results derived for trinucleotides as well as previous experimental data, suggesting that core a UAG is among the most accessible motifs. In particular, our data shows that NUAGN is the most accessible pentanucleotide in the 3′UTR of ZIKV-BR and among the highest accessible pentanucleotides in the 3′UTR of ZIKV-UG, similar to the situation found for trinucleotides. UAG appears to be conserved in an unpaired structural context not only by itself but also in a larger sequence context of enclosing nucleotides, which exhibit high accessibility upon the presence of a central UAG Musashi recognition element. This finding is in line with the reported Musashi recognition pentamers GUAGU and AUAGU.
MBE accessibility in related viruses
To assess the Musashi-related neurotropic potential of other flaviviruses, we evaluated the accessibilities of MBEs in related species. To this end, all 435 UAG trinucleotide motifs within 3′UTRs in the refseq dataset were identified, grouped by vector specificity and subjected to the computational approach outlined above (97 nt upstream/downstream windows, n = 10,000 dinucleotide shufflings).
Msi1 preferentially binds single stranded RNA57, consequently UAG motifs that contribute with low z scores have a high affinity for Msi1 binding. Within the MBFV group, the Asian/American lineage Zika virus (ZIKV-BR) has the lowest median z score, followed by Saint Louis encephalitis virus (SLEV), Nounané virus (NOUV) and the African lineage Zika virus (ZIKV-UG). Among others, two lineages of West Nile virus (WNV1, WNV2) and Yellow fever virus (YFV) appear with a negative median z score. ZIKV-BR turns out to be the only isolate among MBFVs that has just negative z score values in our simulations, i.e., all UAG motifs within the 3′UTR of the Brazilian ZIKV isolate appear in an unpaired structural context. Likewise, Karshi virus (KSIV), Alkhumra hemorrhagic fever virus (ALKV) and Langat virus (LGTV) have a strictly negative z scores distribution among the TBFV group. POWV, Omsk hemorrhagic fever (OHFV) and Louping ill virus (LIV) have negative mean opening energy z scores. Interestingly, UAG trinucleotides are relatively depleted in all TBFV species analyzed here (Table 1). Among NKVs, Montana myotis leukoencephalitis virus (MMLV) and Entebbe bat virus (ENTV) show negative mean opening energy z scores. Culex flavivirus (CxFV), Cell fusing agent virus (CFAG), Parramatta River virus (PaRV) and Ochlerotatus caspius flavivirus (OCFVPT) tend to have singe-stranded MBEs among the ISFVs. Here, OCFVPT is the only isolate with a strictly negative z score distribution (Fig. 3).
The number of UAG trinucleotide motifs in 3′UTRs of the refseq dataset lies between 1 and 19 (Table 1). The overall range of opening energy z scores is not equal among different flavivirus groups. While the lower bound is between −1.65 and −1.93 among all groups, MBFVs and ISFVs show markedly higher upper bounds than TBFVs and NKVs, respectively. Absolute values of computed MBE opening energy z scores are listed in Table 2.
Conserved xrRNAs contain MBEs
Several species appear to the right of the plots in Fig. 3 due to the sorting by median z score. However, they comprise a non-negligible number of accessible MBEs, as indicated by negative opening energy z scores. Examples are (re-)emerging species like JEV and Usutu virus (USUV), which contain 15 and 19 MBEs, respectively.
To investigate this further, we assigned each MBE in our dataset to one of the conserved elements stem-loop (SL), dumbbell (DB) and 3′ stem-loop (3SL) (Fig. 1) by means of covariance models. Analysis of RNA sequence and structure conservation revealed that the majority of virus isolates in our dataset contain only a single UAG motif within their SL and 3SL elements. Conversely, DB elements, which are conserved in MBFVs and NKVs (Fig. 4), stand out among conserved RNA structures in flavivirus 3′UTRs. They contain a pair of MBEs, separated by a 4 nt spacer, within a perfectly conserved sequence motif of approx. 20 nt length in their distal stem-loop structure. We hypothesize that this pair of conserved UAG motifs interacts with the two RNA-binding domains in Musashi proteins. Figure 5 shows the consensus secondary structure of flavivirus DB elements.
Discussion
Our findings lead to the conclusion that the accessibility of UAG motifs calculated through opening energies in Flavivirus 3′UTRs is indicative of the Musashi-related neurotropic potential of virus species. Our computational analyses show that there is little difference in the distribution of opening energies for all trinucleotides within the polyprotein (CDS) region of ZIKV. When comparing CDS and 3′UTR regions, we see a difference in behavior, as different trinucleotides do possess different opening energies, UAG being highly accessible in ZIKV. Although it is not possible to quantify the impact of the accessibility on the patient phenotype, it is interesting to see the UAG motifs in the Brazilian ZIKV isolate more accessible than in the Ugandan ZIKV isolate. This result raises the question once again if the increased pathogenicity seen in ZIKV today is due to changes in the sequence over time or simply lack of better surveillance71.
Previous experiments lead toward the idea that ZIKV is unique among flaviviruses regarding the clinical outcomes resulting from congenital infection72,73. Although our results indicate that this may be true for well-studied viruses such as DENV and WNV, other viruses which have not caused recent outbreaks may have been neglected.
Looking in depth at other viruses, Nounané virus (NOUV), a dual-host affiliated insect-specific flavivirus is found among the viruses with high MBE accessibility. NOUV was isolated in Cote d’Ivoire in 2004 from Uranotaenia mashonaensis, a Culicidae mosquito not known to harbor flaviviruses before74. While replication has been tested in human and non-human cell lines, vertebrate infection and pathogenesis could not be observed75.
Within the TBFV serocomplex, KSIV has the lowest overall MBE opening energies. Originally isolated from Ornithodoros papillipes ticks in Uzbekistan in 197276 it currently does not present history of infection in humans. Conversely, Powassan virus (POWV), another TBFV with negative MBE opening energy, first isolated in Powassan, Ontario, Canada in 1959 from a child who died of acute encephalitis77 can cause transplacental infection59 and has been associated with severe neuropathology and death in mice and human78.
Given that UAG can be regarded as the primary Msi1 binding motif, we can argue that ZIKV has the highest affinity for binding Msi1 among all MBFVs. Platt et al.59 showed that besides ZIKV, the neurotropic flaviviruses WNV and POWV, as well as the alphaviruses Chikungunya (CHIKV) and Mayaro (MAYV) infect placenta and fetus in immunocompetent, wild-type mice. However, only WNV was shown to infect the placenta and the fetal central nervous system, causing injury to the developing brain.
Congenital infection in humans is documented for WNV, JEV, YFV, and ZIKV. CHIKV and MAYV did not show this behavior. In this line, our results are in agreement with experimental studies that reported teratogenicity for SLEV12,13, WNV59,79, YFV16,80 and POWV59.
Bizarre neurological manifestations were also observed in patients infected by Ntaya virus (NTAV)81, a neurotropic virus from the Japanese encephalitis serocomplex, as well as WNV in humans82,83 and in mice84, USUV85,86 and DENV87,88. The fact that these viruses line up more on the positive side of the opening energy plots in Fig. 3 does not mean that they should not be neurotropic. It merely highlights that there might be additional mechanisms causing neuropathogenicity.
MBEs are conserved in flavivirus 3′UTR elements
Flavivirus DB elements do not only show structural conservation over the MBFV and NKV serocomplexes, but even maintain their primary sequence within a region of approx. 20 nt of the distal stem-loop (Figs 4 and 5). The combination of covariation and primary sequence conservation within a single RNA element underlines the importance of DB elements in flavivirus pathogenicity. It could also be indicative of a special role of DB element regions in the minus-strand synthesis during flavivirus replication.
UAG trinucleotides are the core nucleotides within MBE motifs to contribute the highest binding energy56. Our current analysis underlines that there seems to be evolutionary pressure on keeping UAG motifs within the DB elements unpaired. In ZIKV we see that not only the UAGs within DB elements but also those that overlap with SL elements show negative opening energy z-score.
Msi1 presence in different cells
The presence of Msi1 proteins in both sperm and neural precursor cells highlights the importance of studying the Msi1-MBE interaction in flaviviruses. Given that Msi1 has been shown to enhance ZIKV replication58, this interaction could be a critical reason why ZIKV persists in sperm for a long time after the individual has been infected89,90, allowing the virus to be transmitted sexually and also why the virus would harbor itself in neuronal cells, allowing it to interfere with and dysregulate neurodevelopment.
A possible role of Musashi in the flavivirus life cycle
Msi1, which binds to the 3′UTR of target mRNAs, has been shown to repress translation initiation by competing with the translation initiation factor eIF4G for binding to poly(A)-binding protein (PABP), thereby inhibiting the assembly of the 80S ribosomal unit91. Ribosome profiling experiments have corroborated this down-regulatory effect of Msi1, while keeping mRNA levels92. This allows for a speculative explanation of the findings by Chavali et al.58, i.e., that Msi1 enhances ZIKV replication, and a possible role of Msi1 in the viral life cycle: Flaviviruses need to “donate” a few copies of the quasispecies ensemble for Xrn1 degradation and subsequent sfRNA production. In this line, Msi1 could serve as an agent that provides a reasonable amount of gRNAs that are not translated but subject to Xrn1 degradation. The resulting sfRNAs can then down-regulate the host response46,93.
Conclusion
We studied a specific aspect of flavivirus congenital pathogenicity, i.e., the neurotropic effect inferred by the presence of MBEs in the 3′UTR of flavivirus genomes. Employing an established biophysical model of RNA structure formation, we analyzed the thermodynamic properties of MBEs in silico. Our results underline experimental studies suggesting that ZIKV is not alone in its capacity to cause severe neuropathology to infants through the MBE mechanism. While several tick-borne and mosquito-borne flavivirus species like Karshi virus (KSIV), Alkhumra hemorrhagic fever virus (ALKV) or Nounané virus (NOUV) line up with ZIKV in our theoretical model, their tropism might have been overseen due to the lack of reported significant outbreaks. However, some of them appear to have similar neurotropic potential and thus might be potent emerging pathogens.
The approach presented here could in principle be used for developing a tool to predict the Musashi-related neurotropic potential of novel viruses or (re-)emerging strains of known viruses. Combination of opening energy z scores with large scale epidemiologic data could be employed in a machine learning framework that also considers structural conservation and homology of flavivirus 3′UTR elements. Such a tool could play a role in categorizing viruses.
Data Availability
The plfoldz.pl Perl Utility for computing RNA opening energy z scores is available from https://github.com/mtw/plfoldz.
References
Tognarelli, J. et al. A report on the outbreak of Zika virus on Easter Island, South Pacific, 2014. Arch Virol 161, 665–668 (2016).
Malone, R. W. et al. Zika virus: medical countermeasure development challenges. PLoS Neglect Trop D 10, e0004530 (2016).
Hotez, P. J. & Murray, K. O. Dengue, West Nile virus, Chikungunya, Zika and now Mayaro? PLoS Neglect Trop D 11, e0005462 (2017).
Gubler, D., Kuno, G. & Markoff, L. Flaviviruses. Fields Virology 1, 1153–1252 (2007).
Weaver, S. C. et al. Zika virus: History, emergence, biology, and prospects for control. Antivir Res 130, 69–80 (2016).
Song, B.-H., Yun, S.-I., Woolley, M. & Lee, Y.-M. Zika virus: history, epidemiology, transmission, and clinical presentation. J Neuroimmunol 308, 50–64 (2017).
de Bernardi Schneider, A. et al. Molecular evolution of Zika virus as it crossed the Pacific to the Americas. Cladistics 33, 1–20 https://doi.org/10.1111/cla.12178 (2017).
Gong, Z., Xu, X. & Han, G.-Z. The diversification of Zika virus: Are there two distinct lineages? Genome Biol Evol 9, 2940–2945 (2017).
Simonin, Y., van Riel, D., Van de Perre, P., Rockx, B. & Salinas, S. Differential virulence between Asian and African lineages of Zika virus. PLoS Negl Trop D 11, e0005821 (2017).
Klase, Z. A. et al. Zika fetal neuropathogenesis: etiology of a viral syndrome. PLoS Neglect Trop D 10, e0004877 (2016).
Platt, D. J. & Miner, J. J. Consequences of congenital Zika virus infection. Curr Opin Virol 27, 1–7 (2017).
Andersen, A. & Hanson, R. Experimental transplacental transmission of St. Louis encephalitis virus in mice. Infect Immun 2, 320–325 (1970).
Andersen, A. & Hanson, R. Intrauterine infection of mice with St. Louis encephalitis virus: immunological, physiological, neurological, and behavioral effects on progeny. Infect Immun 12, 1173–1183 (1975).
Fujisaki, Y., Miura, Y., Sugimori, T., Murakami, Y. & Miura, K. Experimental studies on vertical infection of mice with Japanese encephalitis virus. IV. Effect of virus strain on placental and fetal infection. NatL I Anim Health Q 23, 21–26 (1983).
Chaturvedi, U. et al. Transplacental infection with Japanese encephalitis virus. J Infect Dis 141, 712–715 (1980).
Tsai, T., Paul, R., Lynberg, M. & Letson, G. Congenital yellow fever virus infection after immunization in pregnancy. J Inf Dis 168, 1520–1523 (1993).
Foy, B. D. et al. Probable non–vector-borne transmission of Zika virus, Colorado, USA. Emerg Infect Dis 17, 880 (2011).
Musso, D. et al. Potential sexual transmission of Zika virus. Emerg Infect Dis 21, 359 (2015).
Hahn, C. S. et al. Conserved elements in the 3′ untranslated region of flavivirus RNAs and potential cyclization sequences. J Mol Biol 198, 33–41 (1987).
Mukhopadhyay, S., Kuhn, R. J. & Rossmann, M. G. A structural perspective of the flavivirus life cycle. Nat Rev Microbiol 3, 13 (2005).
Villordo, S. M., Alvarez, D. E. & Gamarnik, A. V. A balance between circular and linear forms of the dengue virus genome is crucial for viral replication. RNA 16, 2325–2335 (2010).
de Borba, L. et al. Overlapping local and long-range RNA-RNA interactions modulate dengue virus genome cyclization and replication. J Virol 89, 3430–3437 (2015).
Pijlman, G. P. et al. A highly structured, nuclease-resistant, noncoding RNA produced by flaviviruses is required for pathogenicity. Cell Host Microbe 4, 579–591 (2008).
Akiyama, B. M., Eiler, D. & Kieft, J. S. Structured RNAs that evade or confound exonucleases: function follows form. Curr Opin Struct Biol 36, 40–47 (2016).
Jones, C. I., Zabolotskaya, M. V. & Newbury, S. F. The 5′-3′ exoribonuclease Xrn1/Pacman and its functions in cellular processes and development. Wires RNA 3, 455–468 (2012).
Antic, S., Wolfinger, M. T., Skucha, A., Hosiner, S. & Dorner, S. General and miRNA-mediated mRNA degradation occurs on ribosome complexes in Drosophila cells. Mol Cell Biol MCB–01346, https://doi.org/10.1128/MCB.01346-14 (2015).
Funk, A. et al. RNA structures required for production of subgenomic flavivirus RNA. J Virol 84, 11407–11417 (2010).
Chapman, E. G., Moon, S. L., Wilusz, J. & Kieft, J. S. RNA structures that resist degradation by Xrn1 produce a pathogenic Dengue virus RNA. elife 3, e01892 (2014).
Liu, R. et al. Identification and characterization of small sub-genomic RNAs in dengue 1–4 virus-infected cell cultures and tissues. Biochem Bioph Res Co 391, 1099–1103 (2010).
Silva, P. A., Pereira, C. F., Dalebout, T. J., Spaan, W. J. & Bredenbeek, P. J. An RNA pseudoknot is required for production of yellow fever virus subgenomic RNA by the host nuclease XRN1. J Virol 84, 11395–11406 (2010).
Fan, Y.-H. et al. Small noncoding RNA modulates Japanese encephalitis virus replication and translation in trans. Virol J 8, 492 (2011).
Akiyama, B. M. et al. Zika virus produces noncoding RNAs using a multi-pseudoknot structure that confounds a cellular exonuclease. Science aah3963 (2016).
Schnettler, E. et al. Induction and suppression of tick cell antiviral RNAi responses by tick-borne flaviviruses. Nucleic Acids Res 42, 9436–9446 (2014).
MacFadden, A. et al. Mechanism and structural diversity of exoribonuclease-resistant RNA structures in flaviviral RNAs. Nat Commun 9, 119 (2018).
Ochsenreiter, R., Hofacker, I. L. & Wolfinger, M. T. Functional RNA Structures in the 3′UTR of Tick-borne, Insect-specific and No Known Vector Flaviviruses. Viruses 11, 298 (2019).
Sztuba-Solinska, J. et al. Structural complexity of Dengue virus untranslated regions: cis-acting RNA motifs and pseudoknot interactions modulating functionality of the viral genome. Nucleic Acids Res 41, 5075–5089 (2013).
Rauscher, S., Flamm, C., Mandl, C. W., Heinz, F. X. & Stadler, P. F. Secondary structure of the 3′-noncoding region of flavivirus genomes: comparative analysis of base pairing probabilities. RNA 3, 779–791 (1997).
Hofacker, I. L. et al. Automatic detection of conserved RNA structure elements in complete RNA virus genomes. Nucleic Acids Res 26, 3825–3836 (1998).
Witwer, C., Rauscher, S., Hofacker, I. L. & Stadler, P. F. Conserved RNA secondary structures in picornaviridae genomes. Nucleic Acids Res 29, 5079–5089 (2001).
Hofacker, I. L., Stadler, P. F. & Stocsits, R. R. Conserved RNA secondary structures in viral genomes: a survey. Bioinformatics 20, 1495–1499 (2004).
Thurner, C., Witwer, C., Hofacker, I. L. & Stadler, P. F. Conserved RNA secondary structures in flaviviridae genomes. J Gen Virol 85, 1113–1124 (2004).
Kieft, J. S., Rabe, J. L. & Chapman, E. G. New hypotheses derived from the structure of a flaviviral Xrn1-resistant RNA: Conservation, folding, and host adaptation. RNA Biology 12, 1169–1177 (2015).
Roby, J. A., Pijlman, G. P., Wilusz, J. & Khromykh, A. A. Noncoding subgenomic flavivirus RNA: multiple functions in West Nile virus pathogenesis and modulation of host responses. Viruses 6, 404–427 (2014).
Hussain, M. et al. West Nile virus encodes a microRNA-like small RNA in the 3′ untranslated region which up-regulates GATA4 mRNA and facilitates virus replication in mosquito cells. Nucleic Acids Res 40, 2210–2223 (2011).
Schuessler, A. et al. West Nile virus noncoding subgenomic RNA contributes to viral evasion of the type I interferon-mediated antiviral response. J Virol 86, 5708–5718 (2012).
Manokaran, G. et al. Dengue subgenomic RNA binds TRIM25 to inhibit interferon expression for epidemiological fitness. Science 350, 217–221 (2015).
Moon, S. L. et al. A noncoding RNA produced by arthropod-borne flaviviruses inhibits the cellular exoribonuclease XRN1 and alters host mRNA stability. RNA (2012).
Schnettler, E. et al. Non-coding flavivirus RNA displays RNAi suppressor activity in insect and mammalian cells. J Virol JVI–01104 (2012).
Sakakibara, S.-I. et al. Mouse-Musashi-1, a neural RNA-binding protein highly enriched in the mammalian CNS stem cell. Dev Biol 176, 230–242 (1996).
Sakakibara, S.-I., Nakamura, Y., Satoh, H. & Okano, H. RNA-binding protein Musashi2: developmentally regulated expression in neural precursor cells and subpopulations of neurons in mammalian CNS. J Neurosci 21, 8091–8107 (2001).
Imai, T. et al. The neural RNA-binding protein Musashi1 translationally regulates mammalian numb gene expression by interacting with its mRNA. Mol Cell Biol 21, 3888–3900 (2001).
Kharas, M. G. et al. Musashi-2 regulates normal hematopoiesis and promotes aggressive myeloid leukemia. Nat Med 16, 903 (2010).
ErLin, S. et al. Musashi-1 maintains blood–testis barrier structure during spermatogenesis and regulates stress granule formation upon heat stress. Mol Biol Cell 26, 1947–1956 (2015).
Ohyama, T. et al. Structure of Musashi1 in a complex with target RNA: the role of aromatic stacking interactions. Nucleic Acids Res 40, 3218–3231 (2012).
Iwaoka, R. et al. Structural Insight into the Recognition of r(UAG) by Musashi-1 RBD2, and Construction of a Model of Musashi-1 RBD1-2 Bound to the Minimum Target RNA. Molecules 22, 1207 (2017).
Zearfoss, N. R. et al. A conserved three-nucleotide core motif defines Musashi RNA binding specificity. J Biolo Chem 289, 35530–35541 (2014).
Uren, P. J. et al. RNA-binding protein Musashi1 is a central regulator of adhesion pathways in glioblastoma. Mol Cell Biol 35, 2965–2978 (2015).
Chavali, P. L. et al. Neurodevelopmental protein Musashi 1 interacts with the Zika genome and promotes viral replication. Science eaam9243 (2017).
Platt, D. J. et al. Zika virus-related neurotropic flaviviruses infect human placental explants and cause fetal demise in mice. Sci Transl Med 10, eaao7090 (2018).
Haddow, A. D. et al. Genetic Characterization of Spondweni and Zika Viruses and Susceptibility of Geographically Distinct Strains of Aedes aegypti, Aedes albopictus and Culex quinquefasciatus (Diptera: Culicidae) to Spondweni Virus. PLoS Neglect Trop D 10, e0005083 (2016).
Lorenz, R. et al. ViennaRNA Package 2.0. Algorithm Mol Biol 6, 26 (2011).
McCaskill, J. S. The equilibrium partition function and base pair binding probabilities for RNA secondary structure. Biopolymers 29, 1105–1119 (1990).
Bernhart, S. H., Hofacker, I. L. & Stadler, P. F. Local RNA base pairing probabilities in large sequences. Bioinformatics 22, 614–615 (2005).
Lorenz, R., Wolfinger, M. T., Tanzer, A. & Hofacker, I. L. Predicting RNA secondary structures from sequence and probing data. Methods 103, 86–98, https://doi.org/10.1016/j.ymeth.2016.04.004 (2016).
Bernhart, S. H., Mückstein, U. & Hofacker, I. L. RNA Accessibility in cubic time. Algorithm Mol Biol 6, 3 (2011).
Wolfinger, M. T., Fallmann, J., Eggenhofer, F. & Amman, F. ViennaNGS: A toolbox for building efficient next-generation sequencing analysis pipelines. F1000Research 4:50, https://doi.org/10.12688/f1000research.6157.2 (2015).
Jiang, M., Anderson, J., Gillespie, J. & Mayne, M. uShuffle: a useful tool for shuffling biological sequences while preserving the k-let counts. BMC Bioinformatics 9, 192 (2008).
Nawrocki, E. P. & Eddy, S. R. Infernal 1.1: 100-fold faster RNA homology searches. Bioinformatics 29, 2933–2935, https://doi.org/10.1093/bioinformatics/btt509 (2013).
Will, S., Reiche, K., Hofacker, I. L., Stadler, P. F. & Backofen, R. Inferring noncoding RNA families and classes by means of genome-scale structure-based clustering. PLoS Comp Biol 3, e65 (2007).
Wolfinger, M. T. Bio::RNA::RNAaliSplit 0.09, https://doi.org/10.5281/zenodo.2532826, Https://github.com/mtw/Bio-RNA-RNAaliSplit (2019).
de Bernardi Schneider, A. & Wolfinger, M. T. Preventing disease outbreaks with computational biology, how far can we go? NCT CBNW Newsletter 58, https://doi.org/10.5281/zenodo.1463018 (2018).
Richard, A. S. et al. AXL-dependent infection of human fetal endothelial cells distinguishes Zika virus from other pathogenic flaviviruses. Proc Natl Acad Sci USA 201620558 (2017).
Kakooza-Mwesige, A., Mohammed, A. H., Kristensson, K., Juliano, S. L. & Lutwama, J. J. Emerging viral infections in Sub-Saharan Africa and the Developing nervous System: A Mini Review. Front Neurol 9, 82 (2018).
Junglen, S. et al. A new flavivirus and a new vector: characterization of a novel flavivirus isolated from uranotaenia mosquitoes from a tropical rain forest. J Virol 83, 4462–4468 (2009).
Huhtamo, E. et al. Novel flaviviruses from mosquitoes: Mosquito-specific evolutionary lineages within the phylogenetic group of mosquito-borne flaviviruses. Virology 464, 320–329 (2014).
Lvov, D. et al. ”Karshi” virus, a new flavivirus (Togaviridae) isolated from Ornithodoros papillipes (Birula, 1895) ticks in Uzbek SSR. Arch Virol 50, 29–36 (1976).
McLean, D. & Donohue, W. Powassan virus: isolation of virus from a fatal case of encephalitis. Canad Med Assoc J 80, 708 (1959).
Piantadosi, A. et al. Emerging cases of Powassan virus encephalitis in New England: clinical presentation, imaging, and review of the literature. Clin Infect Dis 62, 707–713 (2015).
O’Leary, D. R. et al. Birth outcomes following West Nile Virus infection of pregnant women in the United States: 2003–2004. Pediatrics 117, e537–e545 (2006).
Nishioka, D. A. et al. Yellow fever vaccination during pregnancy and spontaneous abortion: a case-control study. Trop Med Int Health 3, 29–33 (1998).
Woodruff, A., Bowen, E. & Platt, G. Viral infections in travellers from tropical Africa. Br Med J 1, 956–958 (1978).
for Disease Control, C. (CDC, P. et al. Intrauterine West Nile virus infection–New York, 2002. MMWR. Morb Mortal W 51, 1135 (2002).
Alpert, S. G., Fergerson, J. & Noël, L.-P. Intrauterine West Nile virus: ocular and systemic findings. Am J Ophthalmol 136, 733–735 (2003).
Julander, J. G. et al. Treatment of West Nile virus-infected mice with reactive immunoglobulin reduces fetal titers and increases dam survival. Antivir Res 65, 79–85 (2005).
Salinas, S. et al. Deleterious effect of Usutu virus on human neural cells. PLoS Neglect Trop Dis 11, e0005913 (2017).
Bassi, M. R., Sempere, R. N., Meyn, P., Polacek, C. & Arias, A. Extinction of Zika virus and Usutu virus by lethal mutagenesis reveals different patterns of sensitivity to three mutagenic drugs. Antimicrob Agents Ch AAC–00380 (2018).
Yin, X., Zhong, X. & Pan, S. Vertical transmission of dengue infection: the first putative case reported in China. Rev Inst Med Trop SP 58 (2016).
Ranjan, R., Kumar, K. & Nagar, N. Congenital dengue infection: Are we missing the diagnosis? Pediatr Infect Dis J 8, 120–123 (2016).
Sakakibara, S.-I. et al. RNA-binding protein Musashi family: roles for CNS stem cells and a subpopulation of ependymal cells revealed by targeted disruption and antisense ablation. Proc Nat Acad Sci USA 99, 15194–15199 (2002).
D’Ortenzio, E. et al. Evidence of sexual transmission of Zika virus. New Engl J Med 374, 2195–2198 (2016).
Kawahara, H. et al. Neural RNA-binding protein Musashi1 inhibits translation initiation by competing with eIF4G for PABP. J Cell Biol 181, 639–653 (2008).
Katz, Y. et al. Musashi proteins are post-transcriptional regulators of the epithelial-luminal cell state. Elife 3 (2014).
Clarke, B., Roby, J., Slonchak, A. & Khromykh, A. Functional non-coding RNAs derived from the flavivirus 3′ untranslated region. Virus Res 206, 53–61 (2015).
Weinberg, Z. & Breaker, R. R. R2R-software to speed the depiction of aesthetic consensus RNA secondary structures. BMC Bioinformatics 12, 3 (2011).
Acknowledgements
We thank Nikos Vasilakis for providing Spondweni virus sequences. We further thank Ivo Hofacker for fruitful discussions. This work was partly funded by the Austrian science fund FWF project F43 “RNA regulation of the transcriptome”.
Author information
Authors and Affiliations
Contributions
A.B.S. and M.T.W. conceived the study, conducted the in silico experiments, analysed the results and wrote the manuscript. Both authors reviewed the manuscript.
Corresponding author
Ethics declarations
Competing Interests
The authors declare no competing interests.
Additional information
Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Schneider, A.d.B., Wolfinger, M.T. Musashi binding elements in Zika and related Flavivirus 3′UTRs: A comparative study in silico. Sci Rep 9, 6911 (2019). https://doi.org/10.1038/s41598-019-43390-5
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-019-43390-5
This article is cited by
-
Zika virus RNA structure controls its unique neurotropism by bipartite binding to Musashi-1
Nature Communications (2023)
-
Theoretical studies on RNA recognition by Musashi 1 RNA-binding protein
Scientific Reports (2022)
-
Guapiaçu virus, a new insect-specific flavivirus isolated from two species of Aedes mosquitoes from Brazil
Scientific Reports (2021)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.