ZikaVR: An Integrated Zika Virus Resource for Genomics, Proteomics, Phylogenetic and Therapeutic Analysis

Gupta, Amit Kumar; Kaur, Karambir; Rajput, Akanksha; Dhanda, Sandeep Kumar; Sehgal, Manika; Khan, Md. Shoaib; Monga, Isha; Dar, Showkat Ahmad; Singh, Sandeep; Nagpal, Gandharva; Usmani, Salman Sadullah; Thakur, Anamika; Kaur, Gazaldeep; Sharma, Shivangi; Bhardwaj, Aman; Qureshi, Abid; Raghava, Gajendra Pal Singh; Kumar, Manoj

doi:10.1038/srep32713

Download PDF

Article
Open access
Published: 16 September 2016

ZikaVR: An Integrated Zika Virus Resource for Genomics, Proteomics, Phylogenetic and Therapeutic Analysis

Amit Kumar Gupta¹^na1,
Karambir Kaur¹^na1,
Akanksha Rajput¹^na1,
Sandeep Kumar Dhanda¹^na1,
Manika Sehgal¹^na1,
Md. Shoaib Khan¹^na1,
Isha Monga¹^na1,
Showkat Ahmad Dar¹^na1,
Sandeep Singh¹^na1,
Gandharva Nagpal¹^na1,
Salman Sadullah Usmani¹^na1,
Anamika Thakur¹^na1,
Gazaldeep Kaur¹^na1,
Shivangi Sharma¹^na1,
Aman Bhardwaj¹^na1,
Abid Qureshi¹^na1,
Gajendra Pal Singh Raghava¹^na1 &
…
Manoj Kumar¹^na1

Scientific Reports volume 6, Article number: 32713 (2016) Cite this article

10k Accesses
41 Citations
20 Altmetric
Metrics details

Subjects

Abstract

Current Zika virus (ZIKV) outbreaks that spread in several areas of Africa, Southeast Asia, and in pacific islands is declared as a global health emergency by World Health Organization (WHO). It causes Zika fever and illness ranging from severe autoimmune to neurological complications in humans. To facilitate research on this virus, we have developed an integrative multi-omics platform; ZikaVR (http://bioinfo.imtech.res.in/manojk/zikavr/), dedicated to the ZIKV genomic, proteomic and therapeutic knowledge. It comprises of whole genome sequences, their respective functional information regarding proteins, genes, and structural content. Additionally, it also delivers sophisticated analysis such as whole-genome alignments, conservation and variation, CpG islands, codon context, usage bias and phylogenetic inferences at whole genome and proteome level with user-friendly visual environment. Further, glycosylation sites and molecular diagnostic primers were also analyzed. Most importantly, we also proposed potential therapeutically imperative constituents namely vaccine epitopes, siRNAs, miRNAs, sgRNAs and repurposing drug candidates.

Identification of circulating microRNA signatures as potential biomarkers in the serum of elk infected with chronic wasting disease

Article Open access 23 December 2019

Jessy A. Slota, Sarah J. Medina, … Stephanie A. Booth

Multiomics interrogation into HBV (Hepatitis B virus)-host interaction reveals novel coding potential in human genome, and identifies canonical and non-canonical proteins as host restriction factors against HBV

Article Open access 02 November 2021

Shilin Yuan, Guanghong Liao, … Ronggui Hu

Pytheas: a software package for the automated analysis of RNA sequences and modifications via tandem mass spectrometry

Article Open access 03 May 2022

Luigi D’Ascenzo, Anna M. Popova, … James R. Williamson

Introduction

Zika virus (ZIKV) is a flavivirus belonging to family Flaviviridae and is one of the major factors for current outbreak spreading over several areas of Africa, Southeast Asia, and in pacific islands. Zika infection was declared an emergency epidemic threat worldwide by World Health Organization (WHO) in early 2016 (http://www.who.int/en/). ZIKV is a mosquito-borne virus transmitted through monkeys and Aedes genus¹ where humans are their occasional hosts. Majority of infections caused by virus are asymptomatic but cause slight illness called as Zika fever that leads to headache, rash, malaise and chills etc². However, recent epidemiological studies suggest its association with neurological defects such as Guillain-Barre syndrome^3,4, and microcephaly^5,6. Besides the zoonotic and mother to child transmission of virus, Zika is even deemed as a sexually and transfusion-transmitted illness^7,8. General life cycle events of ZIKV are depicted in Supplementary Figure S1.

The virus causing the disease was first isolated from serum sample of rhesus monkey² and then in 1948 it was isolated from a group of A. africanus mosquitoes in Zika forest². Subsequently, ZIKV infection was identified in African continents, i.e., Uganda (1948)⁹, Nigeria (1971)¹⁰, Sierra Leone (1972)¹¹, Gabon (1975)¹², southeastern part of Central African Republic (1979)¹³, French Polynesia (2013–14)¹⁴ and in Brazil (2015)¹⁵. Additionally, similar cases were also reported in Asian countries like Malaysia (1996)¹⁶, Yap state of Micronesia (2007)¹⁷, Pakistan¹⁸ and Cambodia in 2010¹⁹.

ZIKV contains single-stranded positive sense RNA genome of about ~11 kb²⁰. It encloses one open reading frame that encodes a polypeptide of 3419 amino acids and 2 adjoining non-coding regions (NCR), i.e., 5′and 3′ NCR²⁰. Polyprotein of virus is proteolytically processed into three major structural proteins namely capsid, precursor of membrane and envelope protein (E), seven non-structural proteins (NS1, NS2A, NS2B, NS3, NS4A, NS4B, NS5)²¹ and glycoprotein^22,23. ZIKV infection symptoms are similar to other arboviral diseases like dengue. Diagnosis based on symptoms is unreliable for specific identification. Therefore, laboratory diagnosis is vital to obtain conclusive results^17,24. Hence, appropriate selection of molecular diagnostics primers is significantly important for routine ZIKV or flavivirus identification.

To date there is no explicit antiviral drug treatment for combating its infection; only the symptoms can be mitigated²¹. Furthermore, the course of vaccine and drug development is extremely multifaceted, which may take several years for delivering specific anti-ZIKV vaccines^25,26. Moreover, the tedious conventional vaccine development strategies make the situation worse²⁷. Thus, in silico approaches are beneficial in revealing potential vaccine candidates²⁸. Hence, an integrative approach comprising of proteome-scale screening and immunoinformatics is applied for predicting the putative yet promising vaccine candidates. The epitope-driven vaccine development approach has proved advantageous against several infections^29,30,31,32 and recently an epitope-based vaccine called “RTS,S” (also known as Mosquirix^TM)^33,34 has effectively moved to phase-III trials utilizing an engineered T-cell epitope of the causative protozoan parasite (http://www.malariavaccine.org/). After successful clinical trials, it will be the first commercial vaccine against malaria^33,34.

Alternatively, there are other strategies to develop effective therapeutic regimens. Like, RNA interference (RNAi) technology is extensively used in silencing of gene. Small interfering RNAs (siRNAs) are tested as new potential therapeutics³⁵ against various pathogens and disorders^36,37,38. They are often employed for focused anti-viral therapies³⁹ against viruses including Hepatitis C virus (HCV) and Ebola^40,41,42,43. Presently, over twenty siRNA-based therapeutics are in clinical trials⁴⁴ including normal as well as chemically modified siRNAs (cmsiRNAs)⁴⁵. These include SPC2996 for leukemia, EZN3042 for solid tumors and SPC3649 for HCV infection respectively⁴⁴.

Additionally, microRNAs (miRNAs) are also found to play an important role in viral infections and activation of innate immune response⁴⁶. Therefore, systematic genome wide screening of ZIKV genome for miRNAs may assist in designing anti-viral therapeutics including anti-miRs against ZIKV miRNAs. Further, predicting miRNA targets (in human and ZIKV) may help in understanding disease progression. Recently, in case of ZIKV, computational studies on predicting siRNAs⁴⁷ as well as epitopes⁴⁸ have been executed but no such database exists, which describes all the predicted siRNAs and epitopes in a well-defined and comprehensive manner. Lately, Clustered regularly interspaced short palindromic repeats (CRISPR)/CRISPR-associated proteins (Cas) approach has been developed for genome editing^49,50,51. In this approach, small guide RNAs (sgRNAs) are utilized to alter genomes of various organisms from humans to viruses. Hence, it is also crucial to have complete list of sgRNAs for specific and efficient targeting of ZIKV through CRISPR/Cas technique.

In addition, for suggesting potential drugs that may combat Zika infection, therapeutic switching approach can be implemented. Over decades, this strategy of drug repositioning is extensively being exploited for allocating novel applications to existing drugs for different diseases^52,53. Since, there is neither a drug available to treat Zika nor any drug has entered the drug discovery process. Therefore, computational approach leading to therapeutic switching could provide valuable insights in revealing potential drugs that may be effective against Zika infection.

However, to best of our knowledge no resource is available that is devoted to ZIKV comparative genomics, therapeutics and related analysis. Thus, to better understand different aspects, we have developed an integrated web-based multi-omics platform-ZikaVR. Mainly, therapeutically essential components like putative epitopes, siRNAs, miRNAs, sgRNAs (CRISPR/Cas9 targets), molecular diagnostic primers, related drug candidates and drugs information for therapeutic interventions against ZIKV are provided in the resource. Additionally, we have also developed graphical genome browser, “ZikaVR browser” for the collective representation of annotation and regulatory information.

Utility and Discussion

A systematic approach is applied for building ZikaVR wherein interesting findings have been exclusively assimilated in the resource. It is a well-structured and interactive platform, which supports high-performance genomic browser along with numerous comparative genomics analysis information and therapeutically important components. It is organized into various divisions like genomes, annotation browser, genes and proteins, epitope map, phylogenomics, molecular diagnostic primers, therapeutics that contains sub-divisions, i.e., vaccine epitopes, siRNAs, miRNAs, sgRNAs, and drug targets etc. Further, it also supports various tools for analysis and visualization of genomic content. It comprehends different tools like Zblast, Align viewer, Blockcon, Genoplotter, Str3D and Physicoprop (Fig. 1). This resource will certainly assist scientists and pharmaceutical agencies in conceiving experiments for enriched development of vaccine and drugs against ZIKV.

ZikaVR genomes, proteomes and browser

All genomic information and annotation of ZIKV were compiled to provide highly sophisticated and informative user end interface. To navigate through the genomes and proteomes, we set up a ZikaVR browser, which facilitates dynamic graphical visualization of annotations (Fig. 2) powered by JBrowse as also implemented in ViralEpi v1.0⁵⁴ and HPVbase⁵⁵. Along with this, individual genome sequence analysis is also represented in static mode with circular representation of viral genomes (Supplementary Figure S2) and distinct analysis outcomes. Additionally, resource also facilitates an advance genome search page for easy retrieval of sequence data. User can search and categorize ZIKV genomes based on their status (complete or partial), geographical area (Africa, Asia etc.), country (Uganda, Thailand etc.), year and length. It supports flexible and smooth zooming, scrolling and browsing at different levels to display detailed information.

Structural elucidation of Zika virus proteins

In order to extensively understand the ZIKV infection mechanism and underlying processes, fundamental requisite is the protein structure information. In pursuit of developing potential vaccines and drugs against Zika, promising targets have to be identified. For this purpose, Zika protein (Capsid, Envelope, Membrane glycoprotein, NS1, NS2A, NS2B, NS3, NS4A, NS4B and NS5) sequences were subjected to in silico structure prediction analysis. Recently, few structures of Zika proteins^{56,57,58,59,60} have been reported in the Protein Data Bank (PDB) (http://www.rcsb.org/pdb/home/home.do) and therefore the same structures were used as templates in our modeling procedure along with other structure templates. Overall, 840 tertiary structures of ZIKV proteins were modeled. List of PDB IDs used as template to model ZIKV proteins are provided in Supplementary Table S1. All the predicted structures for Zika proteins are provided in the resource with Jmol visualization facility and can be downloaded as PDB files. These protein structures will help in estimating the binding of drugs to potential drug targets⁶¹.

Phylogenomics

We analyzed whole genome sequences of ZIKVs (85) and other viruses of genus flavivirus (10) to infer the evolutionary relationships and patterns. Here, we used comprehensive approach, which includes use of phylogenetic analysis, codon usage bias and context analysis, along with implication of conserved and variable region in diagnostics and therapeutics. The consolidated approach used in this study could provide better insight into evolutionary pattern and classification.

Phylogenetic analysis

All the 95 genomes (Fig. 3) and 94 proteomes (Fig. 4) were taken from five groups of viruses, i.e., Spondweni virus (SPOV), Dengue virus, Yellow fever virus, Japanese encephalitis virus, Semeliki forest virus groups. In Maximum likelihood (ML) analysis, ZIKVs belonged to Spondweni group and are arranged in different clades (Figs 3 and 4). They belong to various geographical regions like South America, North America, Asia, Uganda, Central African Republic, Senegal, etc. Similar pattern of phylogeny were reported in previous studies based on different gene regions^{1,62,63,64,65,66}. SPOV shows close relation to ZIKV by both methods. Whereas, as expected outgroups from togaviridae gp. (CHIKV and RRV) showed distinctness from flaviviridae gp.

Codon usage biasness and context

The pattern and usage frequencies of codons vary between and within genomes. It is affected by various factors mainly comprising gene length, nucleotide composition bias, G+C content, recombination events and rates, expression level etc. This biased usage of synonymous codons may be significantly useful to indicate and understand pattern of genome evolution among species. We have calculated codon context by utilizing Anaconda software. This software helps to calculate familiar residual values for involvement of each codon pair in genomes. This residual value indicates association between two codons of each context through chi square test. Average residual values were calculated for the total number of codon pairs. Each value in a cell of the frequency table was changed into a two-colored map as shown in Fig. 5. In the matrix, green (value more than +3) and red (value more than −3) color represents the preferred and rare codons, respectively.

The cluster pattern in the matrix shows differences as well as commonalities of codon context between species. Similarly black color in the cells signifies that residual values fall within range of −3 to +3 and relates to codon context that does not correspond to biasness. We have also represented codon preference in the form of histogram in which blue color corresponds to rare and black color signifies preferred codons as shown in Fig. 6. This histogram shows that CAU, GGA, and UGA are the most preferred codons while ACG, UCG and UUA are amongst the rare codons present in NC_012532.1 strain of ZIKV. Detailed information of preferred and rare codons of other ZIKV strains is provided in ZikaVR resource. Our results provide useful insights on codon context and usage bias patterns that may facilitate better understanding of genome organization (Figs 5 and 6).

Glycosylation patterns in ZIKV

In this study, we have predicted glycosylation sites (N-, O- and C-linked) in all the ZIKV strains. N-glycosylation is a type of post translation modification (PTM) which plays an important role in viruses such as proteolytic process, protein trafficking, virulence, immune evasion, virus assembly, receptor binding etc⁶⁷.

For N-linked glycosylation (N-GlcNAc), we found Asn-X-Ser/Thr motif in envelope, NS2A, NS3, NS4B, and NS5 proteins among all strains. In NS3, N-GlcNAc sites were present at positions 158, 249 and 568, while in NS4B at 64, 216 and in NS5 at 214 in all strains isolated from different hosts (human, monkey, and mosquitoes). However, N-GlcNAc sites at position Asn-154 in envelope and 149 in NS2A sequences were restricted to strains from human host. In ZIKV and west nile virus strains, the deletion of the potential glycosylation site in envelope protein could be related to serial passages of the virus in mouse brain that has been previously reported^66,68,69.

In addition, we have also detected O-linked glycosylation (O-GalNAc) which is responsible for various biological activities such as virus/bacteria-host interactions, ligand recognition, signal transduction etc⁷⁰. The most common O-GalNAc positions in envelope protein are (7, 47, 48, 170, 173), membrane glycoprotein (6, 8, 56), NS1 (290, 293), NS2A (2, 3, 19), NS2B (1, 5), NS3 (34, 135, 225, 245), NS4A (36), NS4B (51), and NS5 (215). Furthermore, we also revealed C-linked glycosylated sites in membrane glycoprotein, NS2B, NS3 and NS5. In membrane glycoprotein sequence (AHL43503) glycosylation site was determined only at one position (115). The function of C-mannosylation is still not clear, however it may have crucial role in secretion and enzymatic activity⁷¹. NS2B and NS3 were shown to be glycosylated at 121 and 234 positions, respectively. Similarly, in case of NS5 protein, we found glycosylation at 702-residue position in some strains (isolated from sentinel rhesus monkey and Aedes opok). In recent years, the role of glycosylation responsible for viral infection has become one of the most emerging fields in drug designing⁶⁷. Some studies interlink the effect of altered glycosylation sites with diseases (such as cancer) facilitating the development of biomarkers or therapeutic targets^72,73.

Molecular diagnostics

We compiled a list of all the oligonucleotide primer pairs used for the detection of ZIKV till date. All the information related to primers, i.e., their sequence, orientation, genomic positions with respect to the reference genome, and GenBank IDs of the strains that have been experimentally isolated using the respective primers is available on the website.

The primers obtained were tested for their specificity against all the 85 genomes of ZIKV as listed in Table 1 and Supplementary Table S2. Genomes to which both forward and reverse primers mapped completely are the ones considered to be amplified by that particular primer pair. In case of 6 ZIKV specific primer pairs obtained, the primer pair Unnamed1 and Unnamed2 showed exact complementarity against maximum genomes, i.e., 82 and has also previously shown to be able to detect all 37 strains of ZIKV used in the respective study^24,74. Also, poor complementarity of this pair against 10 other related flaviviral genomes suggest that this primer pair may be used as a pan-ZIKV pair.

Table 1 ZIKV specific primers to amplify the genomic regions of the reference genome.

Full size table

Additionally, six universal flaviviridae primer pairs (Supplementary Table S2) used for detection of ZIKV strains in literature were also analyzed. Out of six, one universal flaviviridae degenerate pair unifor and unirev was predicted to specifically amplify 69 out of 85 analyzed ZIKV genomes. This primer pair was also specific for 7 of the 10 related out groups, i.e., West Nile virus, Spondweni virus, Japanese encephalitis virus, Dengue virus 1, 2, 3, 4. Thus, in cases where a patient needs to be tested against a number of flaviviruses depicting similar symptoms, this specific primer pair can be used. Furthermore, 108 normal and 145 degenerate potential primer pairs were designed for 85 ZIKV genomes. Majority of these primers could detect more than 90% of the genomes. Selected primer pair for each genomic region is provided in Table 2. Gene names and numbers of designed primer pairs are listed in Supplementary Table S3. The complete list is provided on the web resource. The compendium of experimentally used primers as well as the ones designed using stringent conditions in this study can be used for ZIKV detection and thus maybe extremely useful for diagnosis.

Table 2 Designed ZIKV specific primers for each genomic region.

Full size table

Potential Therapeutics

Putative vaccine candidates

In the study, attempts have been made to identify potential Zika epitopes comprising of B-cell epitopes, T-cell epitopes and promiscuous MHC binders. The 9mers have been generated to analyze the Zika genome wherein the peptides mapped to human proteome are eliminated. A total of 641 B-cell epitopes with their respective peptide sequence, Lbtope scores and B-cell confidence are stated at the website. The peptides (WGNGCGLFG and VDRGWGNGC) scored highest in B-cell epitope prediction and are proposed to be targeted. 1458 T-cell epitope based peptides have been identified in ZIKV and reported in ZikaVR. Overall 6725 MHC class I and 1631 MHC class II focusing epitopes are also predicted and displayed at the website. The information includes peptide sequences, MHC class I and II alleles and their counts. Further, to comprehend the impact of these binding peptides, the results from IFNepitope and IL4pred methods have also been integrated in the resource. The interferon-gamma inducing and interleukin-4 inducing peptides provide insights in understanding the downstream immune processes. In the analysis, 722 peptides were predicted to induce interferon-gamma on binding to MHC class II alleles and the used method for prediction is also clearly shown. While, 1169 peptides binding to MHC class II alleles are reported to stimulate interleukin-4.

Moreover, the study revealed a few experimentally characterized epitopes through B-cell, T-cell and MHC assays among these putative epitopes. The information corresponding to the Zika epitopes, their assays used for validation and numbers of reported evidences are provided in ZikaVR. The epitopes namely, TYQNKVVKVL, YFHRRDLRL and YMWLGARFL were found to be reported through B-cell, T-cell and MHC assays to activate majority of arms in the immune system. Further based on our in-silico high throughput analysis, we are recommending 32 potential vaccine epitope candidates (Table 3).

Table 3 List of recommended 32 potential vaccine epitope candidates.

Full size table

RNA based therapeutics

Small interfering RNAs and microRNAs

We extracted 10776 putative siRNAs utilizing VIRsiRNApred and desiRm software with variable predicted efficacy ranging from 0 to 100 percent in inhibiting the target mRNAs. The immunomodulatory impact of these siRNAs as predicted by imRNA program demonstrates the roles of these siRNAs in further invoking the immune system. Further, 521 predicted siRNAs using VIRsiRNApred showed 70 percent or more silencing efficacy. Moreover, off targets of siRNAs in human genome are also provided along with predicted siRNAs. Representative set of efficient siRNAs is provided in Supplementary Table S4. The potential siRNAs obtained by desiRm software with higher efficacy score (i.e., >0.80) and their immunomodulatory roles were deduced. List of representative efficient siRNAs is specified in Supplementary Table S5. The server provides an interactive view where one can access detailed siRNA related information including its sequence, efficacy scores and immunomodulatory scores. User can check the conservation of the ZIKV siRNA against other viruses using “siTarConserve” tool in VIRsiRNApred web server⁷⁵. Viruses have been targeted using siRNAs and have shown positive results including the flaviviruses as Dengue, West Nile virus, Japanese encephalitis virus etc³⁶.

Additionally, 15 ZIKV pre-miRNAs were predicted from VMir; while mature 5p and 3p sequences were extracted from each pre-miRNA using Mature Bayes tool (30 mature ZIKV-miRNAs) (Table 4). Apart from the mature miRNA sequences and their location information in pre-miRNA and ZIKV genome, we have extracted GC content and free energy secondary structures of all mature miRNAs. The minimum free energy (MFE) secondary structures are provided on the web server along with other details. Further, orthologous miRNAs and potential targets were identified. Using TargetScan script and seed-align tool of VIRmiRNA, we have listed 202 orthologous miRNAs mainly from viruses like Epstein–Barr virus (EBV), Human herpesvirus 6B (HHV-6), White Spot Syndrome virus (WSSV) etc., Drosophila melanogaster and Homo sapiens. Seed of most of the orthologous miRNAs were found to be orthologous to the ZIKV-MR32-3p followed by ZIKV-MD77-5p and ZIKV-MD34-3p. Moreover, we have identified 688 experimentally validated targets. Most of the orthologous targets were reported for ebv-miR-bart7; which is orthologous to ZIKV-MR66.

Table 4 List of predicted and analyzed 30 mature ZIKV-miRNAs.

Full size table

Single guide RNAs (sgRNAs)

Based on our analysis, we have obtained 1898 sgRNAs in the complete genome of ZIKV. The output is represented in tabular form displaying sgRNA sequences, PAM, strand, i.e., sense/antisense (+/−), start and end position of this 23 residue sgRNA in the genome and its total GC%. This will surely help to identify CRISPR targets against ZIKV prior to experimental procedures and will save time.

Identification of potential drugs via therapeutic switching

For estimating the putative drugs that may prove promising for fighting Zika infection, the Zika genome was mapped to existing drugs in DrugBank for closely related viruses as depicted in the drug targets section under Therapeutics option. The selection criterion was based on its identity and coverage with ZIKV where a minimum threshold of 52% identity was set for filtering. Majority of known drug targets like genome polyprotein (DENV-2 and DENV-3) mapped to the Zika genome polyprotein with around 80% identity and have well-known small molecules/drugs against them in DrugBank. These small compounds acting as drugs include ribavirin monophosphate (DrugBank ID: DB01693), S-adenosyl-L-homocysteine (DrugBank ID: DB01752) and alpha-L-fucose (DrugBank ID: DB04473) (Table 5). The precise pharmacological action of most of these compounds against Zika infection is not yet evinced. Although, ribavirin monophosphate is known to possess anti-viral activities by either lethal mutagenesis or inhibiting inosine monophosphate dehydrogenase (IMPDH) leading to decline in intracellular GTP levels⁷⁶. The decreased GTPs indeed interfere with the viral growth by limiting viral protein synthesis. On the other hand, S-adenosyl-L-homocysteine is believed to halt the maturation of viral mRNA thus displaying anti-viral property by selective inhibition of methyltranferases⁷⁷. The drug repositioning analysis delivered interesting findings as the drugs with maximum identity and coverage with known drug targets in DrugBank (Table 5) were aligned to all the Zika genomes available at ZikaVR. This illustrates that the proposed drugs may target all the Zika strains but in varying degrees based upon their similarity with the known drug targets; thus exemplifying the power of drug repositioning. This strategy of drug repositioning has delivered promising drug candidates^52,53 that upon validations and successful clinical trials may be effectively used against ZIKV.

Table 5 The proposed drugs for Zika virus via drug repositioning.

Full size table

Analysis tools

ZikaVR also facilitates very useful analysis and visualization tools to explore genomic and structural information. These include (1) Zblast: to find similarity and align query sequence to the ZIKV genomes and genes. The output of this tool is similar to standard BLAST output along with tabular representation. (2) Align viewer: alignment visualization tool to interactively visualize, edit and manipulate multiple sequence alignment. (3) Blockcon: this tool allows user to select conserved region of a DNA and protein sequences from multiple sequence alignment to use in phylogenetic analysis. Here, we have implemented Gblocks program⁷⁸ to provide easy-to-use server with maximum functionality. User can also download different results using download option. (4) Genoplotter: a dot plot analysis tool to compare two genome sequences in two-dimensional plot powered by Gepard V1.30 program⁷⁹. (5) Str3D: tool to visualize 3-dimensional structure of proteins implemented using Jmol (www.jmol.org/) an open-source java viewer. (6) Physicoprop: this tool provides graphical view of important physico-chemical properties of epitopes or peptides^80,81.

Materials and Methods

Genomic and proteomic data collection

ZIKV whole genome and proteome sequences were retrieved and collected from the NCBI and GenBank databases. Total of 333 sequences were obtained till May 2016, which were manually checked for the presence and absence of well-reported ORFs and categorized. After curation, overall 94 complete (9 with ambiguous nucleotides (Ns)) and 239 partial genomes were provided. A comprehensive advance search option is implemented on the server for easy retrieval and classification of sequence data. From the genomic data, following information were extracted, i.e., strain, isolate, isolation source, genome size, region, geographical area, host, and year. Nucleotide and protein sequences of all the full-length ZIKV genomes were investigated. Our analysis comprised of two phases: first phase was the full-length genomic analysis and in the second phase we analyzed each gene sequence at nucleotide (nt) and amino acid (aa) level.

Structural elucidation of Zika proteins

We modeled 10 proteins of the ZIKV for the determination of their tertiary structures. Proteomes were divided into different proteins using protein boundaries. First, we identified templates for each protein by performing BLAST search against PDB database and selected the templates with e-value less than 0.01. Next, we selected top 10 templates if the number of hits was more than 10. We used MODELLER software⁸² to build the homology model for each protein. However, no templates were available for ‘NS2A’ and ‘NS4A’ proteins. For them, we first performed clustering of the respective sequences at 95% identity cutoff using CD-HIT software⁸³ to select representative sequences of the individual proteins. For ‘NS2A’ we obtained 2 sequences and for ‘NS4A’ we obtained 1 sequence. We used online I-TASSER structure prediction server^84,61 for the prediction of tertiary structure of representative protein sequences. Next, we used the first model of predicted I-TASSER structure as a template to model rest of the respective protein sequences.

Multiple sequence alignment

In the whole genome study, all genomes were aligned using MEGA version 6.06⁸⁵. Multiple sequence alignment of these sequences was conducted using the ClustalW program⁸⁶ with default parameters to explore the conserved sites among distinct ZIKV genomes; represented by 80% or above conservation criteria.

Phylogenetic analysis

Genomes (85) and proteomes (84) of ZIKVs along with 10 viruses of genus flavivirus were analyzed to deduce evolutionary relationship among them. Phylogenetic relationships were constructed with ML algorithm in MEGA 6.06⁸⁵. Firstly, 95 viral genomes and 94 proteomes (1 genome non-functional) were aligned using ClustalW algorithm integrated in MEGA 6.06. Further, General Time Reversible (GTR) using a discrete Gamma distribution (+G) model was used for ML tree for genomes. Likewise, LG using discrete Gamma distribution (+G) was employed for ML tree building for proteomes. Moreover, statistical support was calculated using bootstrap analysis for both the trees using 1000 pseudo-replicates.

Codon usage bias and context study

We compared and summarized various ways to analyze codon usage such as RSCU (relative synonymous codon usage) values, nucleotide contents, ENC (Effective number of codons) values⁸⁷ etc. The number of times (row frequency) a codon is used for each amino acid is also utilized to analyze codon bias. Complete genomic sequences were analyzed using the CUSP (Create a codon usage table) program of EMBOSS (The European Molecular Biology Open Software Suite, Cambridge, UK). Additionally, we have also analyzed rare and preferred codon distribution and codon context among ZIKV strains using Anaconda program⁸⁸.

Glycosylation sites

We investigated capsid, envelope, membrane glycoproteins, NS1, NS2A, NS2B, NS3, NS4A, NS4B, NS5 protein sequences of ZIKV strains using NetCGlyc v1.0⁸⁹, NetOGlyc v4.0⁹⁰ and NetNGlyc v3.1^91,92. These algorithms are based on neural networks to predict C-mannosylated, mucin type GalNAc O-linked and N-linked glycosylation sites respectively.

Molecular diagnostic primers

Literature was thoroughly examined for the experimentally used primers for detection and diagnosis of ZIKV infections. These PCR primers were extracted and checked for specificity against about 85 ZIKV genomes. Additionally, potential candidate primers were also designed for these genomes using PrimerDesign-M tool⁹³ with default parameters except following. The primers were designed for multiple fragments in a given region of interest based on the multiple sequence alignment where columns having more than 5% gaps were not considered. Flexible parameter for fragment overlap was selected. Complexity limit was taken to be 2 (i.e., one degenerate position allowed).

Epitopes

The epitope identification focused on generating 9 mer overlapping peptides from 5 proteins namely, polyprotein, envelope protein, glycoprotein, NS3 and NS5 encoded by ZIKV genome. These peptides were further analyzed for their immune potential and were exclusively reduced to those 9mer-peptides specific to the virus consequently absent in human genome thus lowering the risk of self-tolerance. Further, human thousand proteomes were constructed by translating sequences from The 1000 Genomes Project into proteins⁹⁴. Now, the viral peptides exhibiting 100% identity with the human thousand proteome were eliminated from the final analysis. After generating peptides with ZIKV-specific 9mer residues, the next objective was to narrow down the search to the peptides that may induce immune response and produce memory cells in human against the virus. For an advanced perspective on the immunomodulatory impact of these peptides, three major variants of epitopes, i.e., B-cell epitopes, T-cell epitopes and MHC alleles binding peptides were deliberated.

B-cell epitopes

The linear and conformational B-cell epitopes in ZIKV were predicted from LBtope⁹⁵ and CBTOPE⁹⁶ methods respectively. LBtope is an efficient method built on huge dataset of experimentally validated B-cell epitopes and non-epitopes. For increasing the reliability of prediction, a cut-off of 60% was chosen for this prediction method. Another used method, i.e., CBTOPE has a distinctive feature for estimating the conformational B-cell epitopes from its primary structure where a threshold of −0.3 was shortlisted for prediction. The results from both the methods were integrated and are exhibited at the website.

MHC allele binding peptides

In the present study, for predicting MHC class I binders in ZIKV, Propred1 tool has been utilized and top 4% have been selected⁹⁷. This method works on a matrix-based program for predicting the binding peptides to MHC class I alleles. As these peptide binders for MHC molecules are known to activate cytotoxic T lymphocytes (CTL) therefore these are referred to as putative CTL epitopes, which were detected via CTLPred⁹⁸. Likewise, MHC class II binders were predicted to estimate the potential T Helper (Th) epitopes in ZIKV using ProPred⁹⁹. The top 3% of identified peptides have been proposed as promiscuous MHC class II binders.

T-cell epitopes and immune response prediction

After the postulation of MHC binders that act as probable T-cell epitopes, CTL epitopes were also predicted discretely from CTLPred built on artificial neural network (ANN) and support vector machine modules. This tool directly focuses on antigen primary sequence and excludes the step for MHC class I binder’s prediction. For the detection of CTL epitopes in ZIKV, SVM-based module with default constraints have been implemented. All these predicted epitopes were also searched in The Immune Epitope Database (IEDB), the largest database on experimentally validated epitopes or antigenic regions¹⁰⁰. The experimentally confirmed IEDB epitopes were mapped to Zika antigens for revealing the potential epitopes.

In addition, the interleukins released by MHC class II binders were also estimated using IFNepitope method at a default threshold of 0. The earlier predicted peptides can further stimulate Th1 cells (T-helper cell type I)¹⁰¹ guiding the release of interferon-gamma (IFN-γ). On similar lines, antigenic regions triggering Th2 (T-helper cell type II) cells for circulating cytokine, interleukin-4 (IL4) were predicted from IL4pred (default base of 0.2)¹⁰². These predicted epitopes and their subsequent impact on further release of immune regulators facilitates appropriate designing of vaccine candidates by broadly understanding the progression of Zika infection.

Small interfering RNAs and microRNAs

Various databases are present in the literature for the viral siRNAs e.g. VIRsiRNAdb³⁶ but no siRNA is so far designed for the ZIKV. We have employed VIRsiRNApred⁷⁵ and DesiRm software¹⁰³ for predicting the siRNAs against the ZIKV reference genome along with off-target information. Further, highly potent siRNAs (efficacy > 0.80) with their immunomodulatory impact predicted via imrna program¹⁰⁴ were identified. Furthermore, we have used VMir algorithm¹⁰⁵ for the detection of putative microRNA hairpins (pre-miRNAs). All predictions on VMir were carried out using the default parameters. Mature Bayes tool¹⁰⁶ was used to identify mature miRNAs from the hairpin pre-miRNAs. Target predictions for the predicted ZIKV miRNAs were done using Tar-Find tool of VIRmiRNA¹⁰⁷. Secondary structure of ZIKV miRNAs is also computed using RNAfold program of the Vienna Package¹⁰⁸. Additionally, we have utilized the concept of orthology-based miRNAs and their targets prediction. Similar to TargetScan methodology, we have aligned seed sequences of ZIKV miRNAs to experimentally known viral as well as other cellular miRNAs using seed-align tool of VIRmiRNA.

Single guide RNAs (sgRNAs) identification

For this, we have developed an in-house Perl script for the identification of all possible sgRNAs on the basis of Protospacer adjacent motif (PAM) in the ZIKV genome. This scans all “NGG” motifs in the genome of ZIKV on both the strands (forward and reverse) and then extracts 20 nucleotides upstream of the motif as putative sgRNA or CRISPR targets.

Drug Repositioning

For this purpose, the Zika genome was mapped to existing drugs in DrugBank¹⁰⁹ for related viruses. The ZIKV genome was subjected to repositioning for testified drugs in DrugBank that perhaps have approved toxicity and other safety regulations against closely related viruses. This approach of therapeutic switching reveals crucial findings and is judicious as it reduces the encountered costs during clinical trials. This analysis indicates that well-characterized drugs for related infections could probably be tested for fighting Zika as well. Thus, reducing complexity of determining effective drugs and rendering drug discovery process simpler.

Development and implementation of ZikaVR

The ultimate challenge was to build exclusive portal integrating information from all the postulations constituted from above analysis as well as already available knowledge on ZIKV. An integrated resource has been designed that may assist scientific community concerned in developing therapeutics against ZIKV. This platform has been built on Linux operating system using Apache HTTP Server (version 2.2.17). Back-end of server is supported by MySQL (version 5.0.51b) for ensuring proper storage and management of data. The interface is created using HTML5, CSS3, PHP (version 5.2.14) and JavaScript (version 1.7) as previously implemented^{55,43,110,111}, which complements its usage over a wide range of devices like laptops, mobiles and tablets. A number of in-house Perl and Python scripts were written for predicting putative epitopes, siRNAs, drug targets and potential drugs for ZIKV. A generalized workflow for ZikaVR is depicted in Fig. 1 offering an effortless comprehension to the developed compendium.

Future developments

Advent of next generation sequencing (NGS) platforms facilitates to decipher specific disease single nucleotide variants (SNVs), mutations, viral integrations, epigenetic events etc. In future, we will develop and provide sophisticated NGS data analysis tools and pipeline to study these events. Additionally, as viral variations are influential in the pathogenicity, we will provide cataloging of existing known variations relevant to distinct diseases, which could provide comprehensive basis for personalized medicine. Additionally, other information can be extended; epitope structures and its visualization can also be provided. It may guide researchers in the development of effective therapeutic solutions and vaccines. We will continue to maintain steady operation and quarterly or half-yearly updation of ZikaVR.

Conclusions

The intensification of Zika epidemic critically demands international collaborative efforts to reduce the risk of further spread in concerned regions and prevent the threat entering into non-affected countries. A lot of research focusing on ZIKV, its transmission mode, pattern of instigating infection and perceiving underlying mechanisms for inducing neurological abnormalities is the fundamental need of the hour. An interdisciplinary approach can address the issue of determining efficient vaccine candidates against ZIKV infection. Currently, there are limited in silico studies executed on ZIKV and Zika etiology whereas no such resource or compendium exists till date. In the developed resource, i.e., ZikaVR, majority of tools and methods used for the analysis are established by our own group. The preference of using these methods over other equally good software packages is the availability of standalone version for analyzing huge number of sequences and its comparable performance. ZikaVR provides a unique blend of interactive genomic annotation browser, comparative genomics and therapeutic analysis. Various components such as 3D structures, whole genome alignments, phylogenetic studies, genomic rearrangement and syntenic regions, codon usage and context are useful for comparative and evolutionary analysis especially important for diverge applications, i.e., epidemiological studies, taxonomy, comparative genomics, structural analysis etc. Further, molecular entity (i.e., primer) is critically essential for diagnostics. Based on our in-depth computational analysis, we are recommending and providing list of 32 potential vaccine epitope candidates. Similarly, siRNAs, miRNAs, sgRNAs and repositioned drug candidates are also advocated. We developed a user-friendly interface and dynamic resource with seamless functioning and sophisticated analysis tools. It is anticipated that ZikaVR will provide a valuable and comprehensive resource for genomic, evolutionary and therapeutic aspects of ZIKVs making it utilizable for wider research community. The predicted therapeutic targets in the study could be utilized for designing effective vaccines, and drugs to combat ZIKV.

Additional Information

How to cite this article: Gupta, A. K. et al. ZikaVR: An Integrated Zika Virus Resource for Genomics, Proteomics, Phylogenetic and Therapeutic Analysis. Sci. Rep. 6, 32713; doi: 10.1038/srep32713 (2016).

References

Hayes, E. B. Zika virus outside Africa. Emerg Infect Dis 15, 1347–1350, doi: 10.3201/eid1509.090442 (2009).
Article PubMed PubMed Central Google Scholar
Dick, G. W., Kitchen, S. F. & Haddow, A. J. Zika virus. I. Isolations and serological specificity. Trans R Soc Trop Med Hyg 46, 509–520 (1952).
Article CAS PubMed Google Scholar
Musso, D., Nilles, E. J. & Cao-Lormeau, V. M. Rapid spread of emerging Zika virus in the Pacific area. Clin Microbiol Infect 20, O595–O596, doi: 10.1111/1469-0691.12707 (2014).
Article CAS PubMed Google Scholar
Oehler, E. et al. Zika virus infection complicated by Guillain-Barre syndrome–case report, French Polynesia, December 2013. Euro Surveill 19 (2014).
Tetro, J. A. Zika and microcephaly: causation, correlation, or coincidence? Microbes Infect 18, 167–168, doi: 10.1016/j.micinf.2015.12.010 (2016).
Article PubMed Google Scholar
Schuler-Faccini, L. et al. Possible Association Between Zika Virus Infection and Microcephaly - Brazil, 2015. MMWR Morb Mortal Wkly Rep 65, 59–62, doi: 10.15585/mmwr.mm6503e2 (2016).
Article PubMed Google Scholar
Musso, D. et al. Potential sexual transmission of Zika virus. Emerg Infect Dis 21, 359–361, doi: 10.3201/eid2102.141363 (2015).
Article CAS PubMed PubMed Central Google Scholar
Foy, B. D. et al. Probable non-vector-borne transmission of Zika virus, Colorado, USA. Emerg Infect Dis 17, 880–882, doi: 10.3201/eid1705.101939 (2011).
Article PubMed PubMed Central Google Scholar
McCrae, A. W. & Kirya, B. G. Yellow fever and Zika virus epizootics and enzootics in Uganda. Trans R Soc Trop Med Hyg 76, 552–562 (1982).
Article CAS PubMed Google Scholar
Fagbami, A. H. Zika virus infections in Nigeria: virological and seroepidemiological investigations in Oyo State. J Hyg (Lond) 83, 213–219 (1979).
Article CAS Google Scholar
Robin, Y. & Mouchet, J. Serological and entomological study on yellow fever in Sierra Leone. Bull Soc Pathol Exot Filiales 68, 249–258 (1975).
CAS PubMed Google Scholar
Jan, C., Languillat, G., Renaudet, J. & Robin, Y. [A serological survey of arboviruses in Gabon]. Bull Soc Pathol Exot Filiales 71, 140–146 (1978).
CAS PubMed Google Scholar
Saluzzo, J. F., Gonzalez, J. P., Herve, J. P. & Georges, A. J. Serological survey for the prevalence of certain arboviruses in the human population of the south-east area of Central African Republic (author’s transl). Bull Soc Pathol Exot Filiales 74, 490–499 (1981).
CAS PubMed Google Scholar
Cao-Lormeau, V. M. et al. Zika virus, French polynesia, South pacific, 2013. Emerg Infect Dis 20, 1085–1086, doi: 10.3201/eid2006.140138 (2014).
Article PubMed PubMed Central Google Scholar
Campos, G. S., Bandeira, A. C. & Sardi, S. I. Zika Virus Outbreak, Bahia, Brazil. Emerg Infect Dis 21, 1885–1886, doi: 10.3201/eid2110.150847 (2015).
Article PubMed PubMed Central Google Scholar
Kilbourn, A. M. et al. Health evaluation of free-ranging and semi-captive orangutans (Pongo pygmaeus pygmaeus) in Sabah, Malaysia. J Wildl Dis 39, 73–83, doi: 10.7589/0090-3558-39.1.73 (2003).
Article PubMed Google Scholar
Lanciotti, R. S. et al. Genetic and serologic properties of Zika virus associated with an epidemic, Yap State, Micronesia, 2007. Emerg Infect Dis 14, 1232–1239, doi: 10.3201/eid1408.080287 (2008).
Article CAS PubMed PubMed Central Google Scholar
Darwish, M. A., Hoogstraal, H., Roberts, T. J., Ahmed, I. P. & Omar, F. A sero-epidemiological survey for certain arboviruses (Togaviridae) in Pakistan. Trans R Soc Trop Med Hyg 77, 442–445 (1983).
Article CAS PubMed Google Scholar
Heang, V. et al. Zika virus infection, Cambodia, 2010. Emerg Infect Dis 18, 349–351, doi: 10.3201/eid1802.111224 (2012).
Article PubMed PubMed Central Google Scholar
Kuno, G. & Chang, G. J. Full-length sequencing and genomic characterization of Bagaza, Kedougou, and Zika viruses. Arch Virol 152, 687–696, doi: 10.1007/s00705-006-0903-z (2007).
Article CAS PubMed Google Scholar
Centers for Disease Control and Prevention, http://www.cdc.gov/zika/symptoms/.
Chambers, T. J., Hahn, C. S., Galler, R. & Rice, C. M. Flavivirus genome organization, expression, and replication. Annu Rev Microbiol 44, 649–688, doi: 10.1146/annurev.mi.44.100190.003245 (1990).
Article CAS PubMed Google Scholar
Lindenbach, B. D. & Rice, C. M. Molecular biology of flaviviruses. Adv Virus Res 59, 23–61 (2003).
Article CAS PubMed Google Scholar
Faye, O. et al. One-step RT-PCR for detection of Zika virus. J Clin Virol 43, 96–101, doi: 10.1016/j.jcv.2008.05.005 (2008).
Article CAS PubMed Google Scholar
Dyer, O. Zika vaccine could be in production by year’s end, says maker. BMJ 352, i630, doi: 10.1136/bmj.i630 (2016).
Article PubMed Google Scholar
Cohen, J. INFECTIOUS DISEASE. The race for a Zika vaccine is on. Science 351, 543–544, doi: 10.1126/science.351.6273.543 (2016).
Article CAS ADS PubMed Google Scholar
Fauci, A. S. & Morens, D. M. Zika Virus in the Americas–Yet Another Arbovirus Threat. N Engl J Med 374, 601–604, doi: 10.1056/NEJMp1600297 (2016).
Article PubMed Google Scholar
Moriel, D. G. et al. Genome-based vaccine development: a short cut for the future. Hum Vaccin 4, 184–188 (2008).
Article CAS PubMed Google Scholar
Sette, A. & Fikes, J. Epitope-based vaccines: an update on epitope identification, vaccine design and delivery. Curr Opin Immunol 15, 461–470 (2003).
Article CAS PubMed Google Scholar
Ben-Yedidia, T. & Arnon, R. Epitope-based vaccine against influenza. Expert Rev Vaccines 6, 939–948, doi: 10.1586/14760584.6.6.939 (2007).
Article CAS PubMed Google Scholar
Koshy, R. & Inchauspe, G. Evaluation of hepatitis C virus protein epitopes for vaccine development. Trends Biotechnol 14, 364–369, doi: 10.1016/0167-7799(96)10049-4 (1996).
Article CAS PubMed Google Scholar
Sintchenko, V. Infectious disease informatics (Springer, 2010).
Efficacy and safety of RTS,S/AS01 malaria vaccine with or without a booster dose in infants and children in Africa: final results of a phase 3, individually randomised, controlled trial. Lancet 386, 31–45, doi: 10.1016/s0140-6736(15)60721-8 (2015).
Oyarzun, P. & Kobe, B. Recombinant and epitope-based vaccines on the road to the market and implications for vaccine design and production. Hum Vaccin Immunother 0, doi: 10.1080/21645515.2015.1094595 (2015).
Davis, M. E. et al. Evidence of RNAi in humans from systemically administered siRNA via targeted nanoparticles. Nature 464, 1067–1070, doi: 10.1038/nature08956 (2010).
Article CAS ADS PubMed PubMed Central Google Scholar
Thakur, N., Qureshi, A. & Kumar, M. VIRsiRNAdb: a curated database of experimentally validated viral siRNA/shRNA. Nucleic Acids Res 40, D230–D236, doi: 10.1093/nar/gkr1147 (2012).
Article CAS PubMed Google Scholar
Ozcan, G., Ozpolat, B., Coleman, R. L., Sood, A. K. & Lopez-Berestein, G. Preclinical and clinical development of siRNA-based therapeutics. Adv Drug Deliv Rev 87, 108–119, doi: 10.1016/j.addr.2015.01.007 (2015).
Article CAS PubMed PubMed Central Google Scholar
Burnett, J. C., Rossi, J. J. & Tiemann, K. Current progress of siRNA/shRNA therapeutics in clinical trials. Biotechnol J 6, 1130–1146, doi: 10.1002/biot.201100054 (2011).
Article CAS PubMed PubMed Central Google Scholar
Haasnoot, P. C., Cupac, D. & Berkhout, B. Inhibition of virus replication by RNA interference. J Biomed Sci 10, 607–616, doi: 73526 (2003).
Article PubMed Google Scholar
Tiemann, K. & Rossi, J. J. RNAi-based therapeutics–current status, challenges and prospects. EMBO Molecular Medicine 1, 142–151, doi: 10.1002/emmm.200900023 (2009).
Article CAS PubMed PubMed Central Google Scholar
Ashfaq, U. A. et al. siRNAs: Potential therapeutic agents against Hepatitis C Virus. Virology Journal 8, 276–276, doi: 10.1186/1743-422x-8-276 (2011).
Article CAS PubMed PubMed Central Google Scholar
Geisbert, T. W. et al. Postexposure protection of non-human primates against a lethal Ebola virus challenge with RNA interference: a proof-of-concept study. Lancet 375, 1896–1905, doi: 10.1016/s0140-6736(10)60357-1 (2010).
Article CAS PubMed PubMed Central Google Scholar
Dhanda, S. K., Chaudhary, K., Gupta, S., Brahmachari, S. K. & Raghava, G. P. A web-based resource for designing therapeutics against Ebola Virus. Sci Rep 6, 24782, doi: 10.1038/srep24782 (2016).
Article CAS ADS PubMed PubMed Central Google Scholar
Lares, M. R., Rossi, J. J. & Ouellet, D. L. RNAi and small interfering RNAs in human disease therapeutic applications. Trends Biotechnol 28, 570–579, doi: 10.1016/j.tibtech.2010.07.009 (2010).
Article CAS PubMed PubMed Central Google Scholar
Dar, S. A., Thakur, A., Qureshi, A. & Kumar, M. siRNAmod: A database of experimentally validated chemically modified siRNAs. Sci Rep 6, 20031, doi: 10.1038/srep20031 (2016).
Article CAS ADS PubMed PubMed Central Google Scholar
Lodish, H. F., Zhou, B., Liu, G. & Chen, C. Z. Micromanagement of the immune system by microRNAs. Nat Rev Immunol 8, 120–130, doi: 10.1038/nri2252 (2008).
Article CAS PubMed Google Scholar
Shawan, M. M. A. K. et al. Design and Prediction of Potential RNAi (siRNA) Molecules for 3′UTR PTGS of Different Strains of Zika Virus: A Computational Approach. Nat. Sci 13, 37–50 (2015).
Google Scholar
Shawan, M. M. A. et al. In Silico Modeling and Immunoinformatics Probing Disclose the Epitope Based PeptideVaccine Against Zika Virus Envelope Glycoprotein. Indian Journal of Pharmaceutical and Biological Research 2, 44 (2014).
Article CAS ADS Google Scholar
Mali, P. et al. RNA-guided human genome engineering via Cas9. Science 339, 823–826, doi: 10.1126/science.1232033 (2013).
Article CAS ADS PubMed PubMed Central Google Scholar
Price, A. A., Sampson, T. R., Ratner, H. K., Grakoui, A. & Weiss, D. S. Cas9-mediated targeting of viral RNA in eukaryotic cells. Proc Natl Acad Sci USA 112, 6164–6169, doi: 10.1073/pnas.1422340112 (2015).
Article CAS ADS PubMed PubMed Central Google Scholar
Kaur, K., Tandon, H., Gupta, A. K. & Kumar, M. CrisprGE: a central hub of CRISPR/Cas-based genome editing. Database (Oxford) 2015, bav055, doi: 10.1093/database/bav055 (2015).
Article CAS Google Scholar
Dudley, J. T., Deshpande, T. & Butte, A. J. Exploiting drug-disease relationships for computational drug repositioning. Brief Bioinform 12, 303–311, doi: 10.1093/bib/bbr013 (2011).
Article CAS PubMed PubMed Central Google Scholar
Ashburn, T. T. & Thor, K. B. Drug repositioning: identifying and developing new uses for existing drugs. Nat Rev Drug Discov 3, 673–683, doi: 10.1038/nrd1468 (2004).
Article CAS PubMed Google Scholar
Khan, M. S., Gupta, A. K. & Kumar, M. ViralEpi v1.0: a high-throughput spectrum of viral epigenomic methylation profiles from diverse diseases. Epigenomics 8, 67–75, doi: 10.2217/epi.15.95 (2016).
Article CAS PubMed Google Scholar
Kumar Gupta, A. & Kumar, M. HPVbase–a knowledgebase of viral integrations, methylation patterns and microRNAs aberrant expression: As potential biomarkers for Human papillomaviruses mediated carcinomas. Sci Rep 5, 12522, doi: 10.1038/srep12522 (2015).
Article CAS ADS PubMed PubMed Central Google Scholar
Kostyuchenko, V. A. et al. Structure of the thermally stable Zika virus. Nature 533, 425–428, doi: 10.1038/nature17994 (2016).
Article CAS ADS PubMed Google Scholar
Sirohi, D. et al. The 3.8 A resolution cryo-EM structure of Zika virus. Science 352, 467–470, doi: 10.1126/science.aaf5316 (2016).
Article CAS ADS PubMed PubMed Central Google Scholar
Song, H., Qi, J., Haywood, J., Shi, Y. & Gao, G. F. Zika virus NS1 structure reveals diversity of electrostatic surfaces among flaviviruses. Nat Struct Mol Biol 23, 456–458, doi: 10.1038/nsmb.3213 (2016).
Article CAS PubMed Google Scholar
Dai, L. et al. Structures of the Zika Virus Envelope Protein and Its Complex with a Flavivirus Broadly Protective Antibody. Cell Host Microbe 19, 696–704, doi: 10.1016/j.chom.2016.04.013 (2016).
Article CAS PubMed Google Scholar
Tian, H. et al. The crystal structure of Zika virus helicase: basis for antiviral drug design. Protein Cell 7, 450–454, doi: 10.1007/s13238-016-0275-4 (2016).
Article PubMed PubMed Central Google Scholar
Roy, A., Kucukural, A. & Zhang, Y. I-TASSER: a unified platform for automated protein structure and function prediction. Nat Protoc 5, 725–738, doi: 10.1038/nprot.2010.5 (2010).
Article CAS PubMed PubMed Central Google Scholar
Ye, Q. et al. Genomic characterization and phylogenetic analysis of Zika virus circulating in the Americas. Infect Genet Evol 43, 43–49, doi: 10.1016/j.meegid.2016.05.004 (2016).
Article CAS PubMed Google Scholar
Shen, S. et al. Phylogenetic analysis revealed the central roles of two African countries in the evolution and worldwide spread of Zika virus. Virol Sin 31, 118–130, doi: 10.1007/s12250-016-3774-9 (2016).
Article PubMed PubMed Central Google Scholar
Lednicky, J. et al. Zika Virus Outbreak in Haiti in 2014: Molecular and Clinical Data. PLoS Negl Trop Dis 10, e0004687, doi: 10.1371/journal.pntd.0004687 (2016).
Article CAS PubMed PubMed Central Google Scholar
Lanciotti, R. S., Lambert, A. J., Holodniy, M., Saavedra, S. & Signor Ldel, C. Phylogeny of Zika Virus in Western Hemisphere, 2015. Emerg Infect Dis 22, 933–935, doi: 10.3201/eid2205.160065 (2016).
Article PubMed PubMed Central Google Scholar
Haddow, A. D. et al. Genetic characterization of Zika virus strains: geographic expansion of the Asian lineage. PLoS Negl Trop Dis 6, e1477, doi: 10.1371/journal.pntd.0001477 (2012).
Article CAS PubMed PubMed Central Google Scholar
Vigerust, D. J. & Shepherd, V. L. Virus glycosylation: role in virulence and immune interactions. Trends Microbiol 15, 211–218, doi: 10.1016/j.tim.2007.03.003 (2007).
Article CAS PubMed PubMed Central Google Scholar
Chambers, T. J., Halevy, M., Nestorowicz, A., Rice, C. M. & Lustig, S. West Nile virus envelope proteins: nucleotide sequence analysis of strains differing in mouse neuroinvasiveness. J Gen Virol 79 (Pt 10), 2375–2380, doi: 10.1099/0022-1317-79-10-2375 (1998).
Article CAS PubMed Google Scholar
Faye, O. et al. Molecular evolution of Zika virus during its emergence in the 20(th) century. PLoS Negl Trop Dis 8, e2636, doi: 10.1371/journal.pntd.0002636 (2014).
Article CAS PubMed PubMed Central Google Scholar
Van den Steen, P., Rudd, P. M., Dwek, R. A. & Opdenakker, G. Concepts and principles of O-linked glycosylation. Crit Rev Biochem Mol Biol 33, 151–208, doi: 10.1080/10409239891204198 (1998).
Article CAS PubMed Google Scholar
Goto, Y. et al. C-mannosylation of human hyaluronidase 1: possible roles for secretion and enzymatic activity. Int J Oncol 45, 344–350, doi: 10.3892/ijo.2014.2438 (2014).
Article CAS PubMed Google Scholar
Stowell, S. R., Ju, T. & Cummings, R. D. Protein glycosylation in cancer. Annu Rev Pathol 10, 473–510, doi: 10.1146/annurev-pathol-012414-040438 (2015).
Article CAS PubMed PubMed Central Google Scholar
Li, X., Wang, X., Tan, Z., Chen, S. & Guan, F. Role of Glycans in Cancer Cells Undergoing Epithelial-Mesenchymal Transition. Front Oncol 6, 33, doi: 10.3389/fonc.2016.00033 (2016).
Article PubMed PubMed Central Google Scholar
Faye, O. et al. Quantitative real-time PCR detection of Zika virus and evaluation with field-caught mosquitoes. Virol J 10, 311, doi: 10.1186/1743-422x-10-311 (2013).
Article PubMed PubMed Central Google Scholar
Qureshi, A., Thakur, N. & Kumar, M. VIRsiRNApred: a web server for predicting inhibition efficacy of siRNAs targeting human viruses. J Transl Med 11, 305, doi: 10.1186/1479-5876-11-305 (2013).
Article CAS PubMed PubMed Central Google Scholar
Crotty, S., Cameron, C. & Andino, R. Ribavirin’s antiviral mechanism of action: lethal mutagenesis? J Mol Med (Berl) 80, 86–95, doi: 10.1007/s00109-001-0308-0 (2002).
Article CAS Google Scholar
Balzarini, J., De Clercq, E., Serafinowski, P., Dorland, E. & Harrap, K. R. Synthesis and antiviral activity of some new S-adenosyl-L-homocysteine derivatives. J Med Chem 35, 4576–4583 (1992).
Article CAS PubMed Google Scholar
Castresana, J. Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis. Mol Biol Evol 17, 540–552 (2000).
Article CAS PubMed Google Scholar
Krumsiek, J., Arnold, R. & Rattei, T. Gepard: a rapid and sensitive tool for creating dotplots on genome scale. Bioinformatics 23, 1026–1028, doi: 10.1093/bioinformatics/btm039 (2007).
Article CAS PubMed Google Scholar
Rajput, A., Gupta, A. K. & Kumar, M. Prediction and analysis of quorum sensing peptides based on sequence features. PLoS One 10, e0120066, doi: 10.1371/journal.pone.0120066 (2015).
Article CAS PubMed PubMed Central Google Scholar
Thakur, N., Qureshi, A. & Kumar, M. AVPpred: collection and prediction of highly effective antiviral peptides. Nucleic Acids Res 40, W199–W204, doi: 10.1093/nar/gks450 (2012).
Article CAS PubMed PubMed Central Google Scholar
Webb, B. & Sali, A. Comparative Protein Structure Modeling Using MODELLER. Curr Protoc Bioinformatics 47, 561–32, doi: 10.1002/0471250953.bi0506s47 (2014).
Fu, L., Niu, B., Zhu, Z., Wu, S. & Li, W. CD-HIT: accelerated for clustering the next-generation sequencing data. Bioinformatics 28, 3150–3152, doi: 10.1093/bioinformatics/bts565 (2012).
Article CAS PubMed PubMed Central Google Scholar
Yang, J. et al. The I-TASSER Suite: protein structure and function prediction. Nat Methods 12, 7–8, doi: 10.1038/nmeth.3213 (2015).
Article CAS PubMed PubMed Central Google Scholar
Tamura, K., Stecher, G., Peterson, D., Filipski, A. & Kumar, S. MEGA6: Molecular Evolutionary Genetics Analysis version 6.0. Mol Biol Evol 30, 2725–2729, doi: 10.1093/molbev/mst197 (2013).
Article CAS PubMed PubMed Central Google Scholar
Chenna, R. et al. Multiple sequence alignment with the Clustal series of programs. Nucleic Acids Res 31, 3497–3500 (2003).
Article CAS PubMed PubMed Central Google Scholar
Behura, S. K. & Severson, D. W. Comparative analysis of codon usage bias and codon context patterns between dipteran and hymenopteran sequenced genomes. PLoS One 7, e43111, doi: 10.1371/journal.pone.0043111 (2012).
Article CAS ADS PubMed PubMed Central Google Scholar
Moura, G. et al. Comparative context analysis of codon pairs on an ORFeome scale. Genome Biol 6, R28, doi: 10.1186/gb-2005-6-3-r28 (2005).
Article PubMed PubMed Central Google Scholar
Julenius, K. NetCGlyc 1.0: prediction of mammalian C-mannosylation sites. Glycobiology 17, 868–876, doi: 10.1093/glycob/cwm050 (2007).
Article CAS PubMed Google Scholar
Julenius, K., Molgaard, A., Gupta, R. & Brunak, S. Prediction, conservation analysis, and structural characterization of mammalian mucin-type O-glycosylation sites. Glycobiology 15, 153–164, doi: 10.1093/glycob/cwh151 (2005).
Article CAS PubMed Google Scholar
Gupta, R. & Brunak, S. Prediction of glycosylation across the human proteome and the correlation to protein function. Pac Symp Biocomput 310–322 (2002).
Blom, N., Sicheritz-Ponten, T., Gupta, R., Gammeltoft, S. & Brunak, S. Prediction of post-translational glycosylation and phosphorylation of proteins from the amino acid sequence. Proteomics 4, 1633–1649, doi: 10.1002/pmic.200300771 (2004).
Article CAS PubMed Google Scholar
Yoon, H. & Leitner, T. PrimerDesign-M: a multiple-alignment based multiple-primer design tool for walking across variable genomes. Bioinformatics 31, 1472–1474, doi: 10.1093/bioinformatics/btu832 (2015).
Article CAS PubMed Google Scholar
Abecasis, G. R. et al. An integrated map of genetic variation from 1,092 human genomes. Nature 491, 56–65, doi: 10.1038/nature11632 (2012).
Article CAS ADS PubMed Google Scholar
Singh, H., Ansari, H. R. & Raghava, G. P. Improved method for linear B-cell epitope prediction using antigen’s primary sequence. PLoS One 8, e62216, doi: 10.1371/journal.pone.0062216 (2013).
Article CAS ADS PubMed PubMed Central Google Scholar
Ansari, H. R. & Raghava, G. P. Identification of conformational B-cell Epitopes in an antigen from its primary sequence. Immunome Res 6, 6, doi: 10.1186/1745-7580-6-6 (2010).
Article CAS PubMed PubMed Central Google Scholar
Singh, H. & Raghava, G. P. ProPred1: prediction of promiscuous MHC Class-I binding sites. Bioinformatics 19, 1009–1014 (2003).
Article CAS PubMed Google Scholar
Bhasin, M. & Raghava, G. P. Prediction of CTL epitopes using QM, SVM and ANN techniques. Vaccine 22, 3195–3204, doi: 10.1016/j.vaccine.2004.02.005 (2004).
Article CAS PubMed Google Scholar
Singh, H. & Raghava, G. P. ProPred: prediction of HLA-DR binding sites. Bioinformatics 17, 1236–1237 (2001).
Article CAS PubMed Google Scholar
Kim, Y. et al. Immune epitope database analysis resource. Nucleic Acids Res 40, W525–W530, doi: 10.1093/nar/gks438 (2012).
Article CAS PubMed PubMed Central Google Scholar
Dhanda, S. K., Vir, P. & Raghava, G. P. Designing of interferon-gamma inducing MHC class-II binders. Biol Direct 8, 30, doi: 10.1186/1745-6150-8-30 (2013).
Article CAS PubMed PubMed Central Google Scholar
Dhanda, S. K., Gupta, S., Vir, P. & Raghava, G. P. Prediction of IL4 inducing peptides. Clin Dev Immunol 2013, 263952, doi: 10.1155/2013/263952 (2013).
Article CAS PubMed PubMed Central Google Scholar
Ahmed, F. & Raghava, G. P. Designing of highly effective complementary and mismatch siRNAs for silencing a gene. PLoS One 6, e23443, doi: 10.1371/journal.pone.0023443 (2011).
Article CAS ADS PubMed PubMed Central Google Scholar
Chaudhary, K., Nagpal, G., Dhanda, S. K. & Raghava, G. P. Prediction of Immunomodulatory potential of an RNA sequence for designing non-toxic siRNAs and RNA-based vaccine adjuvants. Sci Rep 6, 20678, doi: 10.1038/srep20678 (2016).
Article CAS ADS PubMed PubMed Central Google Scholar
Sullivan, C. S. & Grundhoff, A. Identification of viral microRNAs. Methods Enzymol 427, 3–23, doi: 10.1016/s0076-6879(07)27001-6 (2007).
Article CAS PubMed Google Scholar
Gkirtzou, K., Tsamardinos, I., Tsakalides, P. & Poirazi, P. MatureBayes: a probabilistic algorithm for identifying the mature miRNA within novel precursors. PLoS One 5, e11843, doi: 10.1371/journal.pone.0011843 (2010).
Article CAS ADS PubMed PubMed Central Google Scholar
Qureshi, A., Thakur, N., Monga, I., Thakur, A. & Kumar, M. VIRmiRNA: a comprehensive resource for experimentally validated viral miRNAs and their targets. Database (Oxford) 2014, doi: 10.1093/database/bau103 (2014).
Hofacker, I. L. & Stadler, P. F. Memory efficient folding algorithms for circular RNA secondary structures. Bioinformatics 22, 1172–1176, doi: 10.1093/bioinformatics/btl023 (2006).
Article CAS PubMed Google Scholar
Law, V. et al. DrugBank 4.0: shedding new light on drug metabolism. Nucleic Acids Res 42, D1091–D1097, doi: 10.1093/nar/gkt1068 (2014).
Article CAS PubMed Google Scholar
Rajput, A., Kaur, K. & Kumar, M. SigMol: repertoire of quorum sensing signaling molecules in prokaryotes. Nucleic Acids Res 44, D634–D639, doi: 10.1093/nar/gkv1076 (2016).
Article CAS PubMed Google Scholar
Qureshi, A., Thakur, N., Tandon, H. & Kumar, M. AVPdb: a database of experimentally validated antiviral peptides targeting medically important viruses. Nucleic Acids Res 42, D1147–D1153, doi: 10.1093/nar/gkt1191 (2014).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

Financial support for this work is provided by Council of Scientific and Industrial Research, India (BSC0121) and Department of Biotechnology, Government of India (GAP0001). Open access charges by: CSIR-Institute of Microbial Technology, Chandigarh, India.

Author information

Gupta Amit Kumar, Kaur Karambir, Rajput Akanksha, Dhanda Sandeep Kumar, Sehgal Manika and Khan Md. Shoaib contributed equally to this work.

Authors and Affiliations

Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research (CSIR), Sector 39A, Chandigarh, 160036, India
Amit Kumar Gupta, Karambir Kaur, Akanksha Rajput, Sandeep Kumar Dhanda, Manika Sehgal, Md. Shoaib Khan, Isha Monga, Showkat Ahmad Dar, Sandeep Singh, Gandharva Nagpal, Salman Sadullah Usmani, Anamika Thakur, Gazaldeep Kaur, Shivangi Sharma, Aman Bhardwaj, Abid Qureshi, Gajendra Pal Singh Raghava & Manoj Kumar

Authors

Amit Kumar Gupta
View author publications
You can also search for this author in PubMed Google Scholar
Karambir Kaur
View author publications
You can also search for this author in PubMed Google Scholar
Akanksha Rajput
View author publications
You can also search for this author in PubMed Google Scholar
Sandeep Kumar Dhanda
View author publications
You can also search for this author in PubMed Google Scholar
Manika Sehgal
View author publications
You can also search for this author in PubMed Google Scholar
Md. Shoaib Khan
View author publications
You can also search for this author in PubMed Google Scholar
Isha Monga
View author publications
You can also search for this author in PubMed Google Scholar
Showkat Ahmad Dar
View author publications
You can also search for this author in PubMed Google Scholar
Sandeep Singh
View author publications
You can also search for this author in PubMed Google Scholar
Gandharva Nagpal
View author publications
You can also search for this author in PubMed Google Scholar
Salman Sadullah Usmani
View author publications
You can also search for this author in PubMed Google Scholar
Anamika Thakur
View author publications
You can also search for this author in PubMed Google Scholar
Gazaldeep Kaur
View author publications
You can also search for this author in PubMed Google Scholar
Shivangi Sharma
View author publications
You can also search for this author in PubMed Google Scholar
Aman Bhardwaj
View author publications
You can also search for this author in PubMed Google Scholar
Abid Qureshi
View author publications
You can also search for this author in PubMed Google Scholar
Gajendra Pal Singh Raghava
View author publications
You can also search for this author in PubMed Google Scholar
Manoj Kumar
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.K. conceptualize and designed the study; A.G., A.T., K.K., S.K.D. and S.S.U. performed data collection and curation; A.G., A.Q. and S.S.U. designed web interface; K.K. and A.G. executed codon analysis; S.D. and G.N. carried out siRNA design; A.R. implemented phylogenetics; S.K.D., M.S.K., G.N. and A.B. involved in vaccine epitope analysis; S.K.D. performed drug repositioning; S. Singh implemented protein structure prediction; A.T. contributed to design of genome plot; I.M. executed miRNA analysis; G.K. and S.D. designed and analyzed diagnostic primers; S. Sharma performed glycosylation site prediction; A.G., S.K.D., G.P.S.R. and M.K. performed data analysis and interpretations; A.G., M.S. and K.K. wrote the manuscript; G.P.S.R. and M.K. were involved in proofreading; G.P.S.R. and M.K. supervised the project.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Electronic supplementary material

Supplementary Information

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Gupta, A., Kaur, K., Rajput, A. et al. ZikaVR: An Integrated Zika Virus Resource for Genomics, Proteomics, Phylogenetic and Therapeutic Analysis. Sci Rep 6, 32713 (2016). https://doi.org/10.1038/srep32713

Download citation

Received: 25 April 2016
Accepted: 11 August 2016
Published: 16 September 2016
DOI: https://doi.org/10.1038/srep32713

This article is cited by

Zika Virus as an Emerging Neuropathogen: Mechanisms of Neurovirulence and Neuro-Immune Interactions
- Gerwyn Morris
- Tatiana Barichello
- Michael Maes
Molecular Neurobiology (2018)
In silico analyses of conservational, functional and phylogenetic distribution of the LuxI and LuxR homologs in Gram-positive bacteria
- Akanksha Rajput
- Manoj Kumar
Scientific Reports (2017)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Identification of circulating microRNA signatures as potential biomarkers in the serum of elk infected with chronic wasting disease

Multiomics interrogation into HBV (Hepatitis B virus)-host interaction reveals novel coding potential in human genome, and identifies canonical and non-canonical proteins as host restriction factors against HBV

Pytheas: a software package for the automated analysis of RNA sequences and modifications via tandem mass spectrometry

Introduction

Utility and Discussion

ZikaVR genomes, proteomes and browser

Structural elucidation of Zika virus proteins

Phylogenomics

Phylogenetic analysis

Codon usage biasness and context

Glycosylation patterns in ZIKV

Molecular diagnostics

Potential Therapeutics

Putative vaccine candidates

RNA based therapeutics

Small interfering RNAs and microRNAs

Single guide RNAs (sgRNAs)

Identification of potential drugs via therapeutic switching

Analysis tools

Materials and Methods

Genomic and proteomic data collection

Structural elucidation of Zika proteins

Multiple sequence alignment

Phylogenetic analysis

Codon usage bias and context study

Glycosylation sites

Molecular diagnostic primers

Epitopes

B-cell epitopes

MHC allele binding peptides

T-cell epitopes and immune response prediction

Small interfering RNAs and microRNAs

Single guide RNAs (sgRNAs) identification

Drug Repositioning

Development and implementation of ZikaVR

Future developments

Conclusions

Additional Information

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Ethics declarations

Competing interests

Electronic supplementary material

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Zika Virus as an Emerging Neuropathogen: Mechanisms of Neurovirulence and Neuro-Immune Interactions

In silico analyses of conservational, functional and phylogenetic distribution of the LuxI and LuxR homologs in Gram-positive bacteria

Comments

Search

Quick links