Review Article | Published:

Systems virology: host-directed approaches to viral pathogenesis and drug targeting

Nature Reviews Microbiology volume 11, pages 455466 (2013) | Download Citation


High-throughput molecular profiling and computational biology are changing the face of virology, providing a new appreciation of the importance of the host in viral pathogenesis and offering unprecedented opportunities for better diagnostics, therapeutics and vaccines. Here, we provide a snapshot of the evolution of systems virology, from global gene expression profiling and signatures of disease outcome, to geometry-based computational methods that promise to yield novel therapeutic targets, personalized medicine and a deeper understanding of how viruses cause disease. To realize these goals, pipettes and Petri dishes need to join forces with the powers of mathematics and computational biology.

Key points

  • Systems biology approaches are required to advance our understanding of virus–host interactions, how these interactions cause disease and, ultimately, how to improve diagnostics, therapeutics and vaccines.

  • Over the past decade, the field of systems virology has evolved from using first-generation microarrays to the integration of multidimensional data sets. This has resulted in significant findings, including the identification of gene expression signatures that are predictive of viral pathogenesis and vaccine efficacy, insights into how viruses disrupt cellular metabolism, and the mapping of virus–host interactomes.

  • To fulfil its initial promise of revolutionizing our understanding of virus–host interactions, the field of systems virology must move beyond just the listing of molecules that are differentially expressed following viral infection; it must now look to define the relationships between key host molecules and their interactions with viral components.

  • Several key computational challenges must be addressed in order to move into this new phase of systems virology, including consideration of nonlinear relationships such as the dynamics of the system, the integration of multidimensional data sets and the identification of causal relationships.

  • Virologists, computer scientists and mathematicians must combine their skills and expertise in applying systems approaches to untangle the complex question of how viruses kill.

Access optionsAccess options

Rent or Buy article

Get time limited or full article access on ReadCube.


All prices are NET prices.


  1. 1.

    et al. A systems biology approach to infectious disease research: innovating the pathogen-host research paradigm. mBio 2, e00325-10 (2011). A perspective by leaders in the field of systems biology of infectious disease research.

  2. 2.

    & What is systems biology? Future Microbiol. 5, 139–141 (2010).

  3. 3.

    (ed.) Systems Biology (Springer, 2013).

  4. 4.

    Systems biology: evolving into the mainstream. Cell 144, 839–841 (2011).

  5. 5.

    & Systems approaches to dissecting immunity. Curr. Top. Microbiol. Immunol. 363, 1–19 (2013).

  6. 6.

    The multiple dimensions of integrative biology. Integr. Biol. (Camb.) 4, 9 (2012).

  7. 7.

    Multiscale analysis of biological systems. Acta Biotheor. 61, 3–19 (2013).

  8. 8.

    et al. Large-scale monitoring of host cell gene expression during HIV-1 infection using cDNA microarrays. Virology 266, 8–16 (2000).

  9. 9.

    & The pathogenesis of influenza virus infections: the contributions of virus and host factors. Curr. Opin. Immunol. 23, 481–486 (2011).

  10. 10.

    , , & Systems approaches to influenza-virus host interactions and the pathogenesis of highly virulent and pandemic viruses. Semin. Immunol. 4 Dec 2012 (doi:10.1016/j.smim.2012.11.001).

  11. 11.

    et al. Genomic analysis of increased host immune and cell death responses induced by 1918 influenza virus. Nature 443, 578–581 (2006).

  12. 12.

    et al. Aberrant innate immune response in lethal infection of macaques with the 1918 influenza virus. Nature 445, 319–323 (2007). The first study in which non-human primates are infected with the 1918 pandemic influenza virus. This study highlights the importance of the ability of the virus to modulate the host response.

  13. 13.

    et al. Into the eye of the cytokine storm. Microbiol. Molec Biol. Rev. 76, 16–32 (2012). A review discussing the cytokine storm in the context of viral infections, and how systems virology approaches have provided significant insights into the kinetics of cytokine gene expression.

  14. 14.

    et al. Early and sustained innate immune response defines pathology and death in nonhuman primates infected by highly pathogenic influenza virus. Proc. Natl Acad. Sci. USA 106, 3455–3460 (2009).

  15. 15.

    et al. Lethal influenza virus infection in macaques is associated with early dysregulation of inflammatory related genes. PLoS Pathog. 5, e1000604 (2009).

  16. 16.

    et al. Lethal dissemination of H5N1 influenza virus is associated with dysregulation of inflammation and lipoxin signaling in a mouse model of infection. J. Virol. 84, 7613–7624 (2010).

  17. 17.

    et al. The intracellular sensor NLRP3 mediates key innate and healing responses to influenza A virus via the regulation of caspase-1. Immunity 30, 566–575 (2009).

  18. 18.

    , , , & Inflammasome recognition of influenza virus is essential for adaptive immune responses. J. Exp. Med. 206, 79–87 (2009).

  19. 19.

    et al. The NLRP3 inflammasome mediates in vivo innate immunity to influenza A virus through recognition of viral RNA. Immunity 30, 556–565 (2009).

  20. 20.

    , , , & A chemokine gene expression signature derived from meta-analysis predicts the pathogenicity of viral respiratory infections. BMC Syst. Biol. 5, 202 (2011).

  21. 21.

    et al. Multi-factorial analysis of class predication error: estimating optimal number of biomarkers for various classification rules. J. Bioinform. Comput. Biol. 08, 945–965 (2010).

  22. 22.

    , & Systems vaccinology. Immunity 33, 516–529 (2010).

  23. 23.

    , , , & Vaccinomics and a new paradigm for the development of preventive vaccines against viral infections. OMICS 15, 625–636 (2011). A review about a new paradigm for vaccine development. This is one of several reviews in a special issue of OMICS entitled Vaccines of the 21st Century: Vaccinomics for the Global Public Health.

  24. 24.

    et al. Systems biology approach predicts immunogenicity of the yellow fever vaccine in humans. Nature Immunol. 10, 116–125 (2009).

  25. 25.

    et al. Systems biology of vaccination for seasonal influenza in humans. Nature Immunol. 12, 786–795 (2011).

  26. 26.

    et al. An integrated encyclopedia of DNA elements in the human genome. Nature 489, 57–74 (2012).

  27. 27.

    et al. Landscape of transcription in human cells. Nature 489, 101–108 (2012).

  28. 28.

    et al. The viral and cellular microRNA targetome in lymphoblastoid cell lines. PLoS Pathog. 8, e1002484 (2012).

  29. 29.

    , & Long non-coding RNAs: insights into functions. Nature Rev. Genet. 10, 155–159 (2009).

  30. 30.

    et al. Unique signatures of long noncoding RNA expression in response to virus infection and altered innate immune signaling. mBio 1, e00206-10 (2010). The first paper to report the discovery of differential expression of several long non-coding RNAs in response to SARS-CoV infection in four founder strains of the Collaborative Cross mouse model.

  31. 31.

    et al. Integrative deep sequencing of the mouse lung transcriptome reveals differential expression of diverse classes of small RNAs in response to respiratory virus infection. mBio 2, 00198-11 (2011).

  32. 32.

    et al. Next-generation sequencing reveals HIV-1-mediated suppression of T cell activation and RNA processing and regulation of noncoding RNA expression in a CD4+ T cell line. mBio 2, e00134-11 (2011).

  33. 33.

    et al. Next-generation sequencing of small RNAs from HIV-infected cells identifies phased microRNA expression patterns and candidate novel microRNAs differentially expressed upon infection. mBio 4, e00549-12 (2013).

  34. 34.

    et al. Decoding human cytomegalovirus. Science 338, 1088–1093 (2012).

  35. 35.

    , , , & Dynamics of the cellular metabolome during human cytomegalovirus infection. PLoS Pathog. 2, e132 (2006).

  36. 36.

    et al. Systems-level metabolic flux profiling identifies fatty acid synthesis as a target for antiviral therapy. Nature Biotech. 26, 1179–1186 (2008).

  37. 37.

    , , , & Divergent effects of human cytomegalovirus and herpes simplex virus-1 on cellular metabolism. PLoS Pathog. 7, e1002124 (2011).

  38. 38.

    , & Inhibition of calmodulin-dependent kinase kinase blocks human cytomegalovirus-induced glycolytic activation and severely attenuates production of viral progeny. J. Virol. 85, 705–714 (2011).

  39. 39.

    , , & Human cytomegalovirus induces the activity and expression of acetyl-coenzyme A carboxylase, a fatty acid biosynthetic enzyme whose inhibition attenuates viral replication. J. Virol. 85, 5814–5824 (2011).

  40. 40.

    et al. Synaptic vesicle-like lipidome of human cytomegalovirus virions reveals a role for SNARE machinery in virion egress. Proc. Natl Acad. Sci. USA 108, 12869–12874 (2011).

  41. 41.

    , & HCMV targets the metabolic stress response through activation of AMPK whose activity is important for viral replication. PLoS Pathog. 8, e1002502 (2012).

  42. 42.

    , , , & Herpes simplex virus 1 infection activates poly(ADP-ribose) polymerase and triggers the degradation of poly(ADP-ribose) glycohydrolase. J. Virol. 86, 8259–8268 (2012).

  43. 43.

    et al. Host defense against viral infection involves interferon mediated down-regulation of sterol biosynthesis. PLoS Biol. 9, e1000598 (2011).

  44. 44.

    et al. The SARS-coronavirus-host interactome: identification of cyclophilins as target for pan-coronavirus inhibitors. PLoS Pathog. 7, e1002331 (2011).

  45. 45.

    et al. Identification of host proteins required for HIV infection through a functional genomic screen. Science 319, 921–926 (2008).

  46. 46.

    et al. RNA interference screen for human genes associated with West Nile virus infection. Nature 455, 242–245 (2008).

  47. 47.

    et al. Discovery of insect and human dengue virus host factors. Nature 458, 1047–1050 (2009).

  48. 48.

    et al. The IFITM proteins mediate cellular resistance to influenza A H1N1 virus, West Nile virus, and dengue virus. Cell 139, 1243–1254 (2009).

  49. 49.

    et al. A genome-wide genetic screen for host factors required for hepatitis C virus propagation. Proc. Natl Acad. Sci. USA 106, 16410–16415 (2009).

  50. 50.

    The Complex Trait Consortium. The Collaborative Cross, a community resource for the genetic analysis of complex traits. Nature Genet. 36, 1133–1137 (2004).

  51. 51.

    et al. Herpesviral protein networks and their interaction with the human proteome. Science 311, 239–242 (2006).

  52. 52.

    & Virus–host interactomes and global models of virus-infected cells. Trends Microbiol. 19, 501–508 (2011).

  53. 53.

    , , & Virus–host interactomes — antiviral drug discovery. Curr. Opin. Virol. 2, 614–621 (2012).

  54. 54.

    & Cell-based genomic screening: elucidating virus–host interactions. Curr. Opin. Virol. 2, 778–786 (2012).

  55. 55.

    & Uncovering the global host cell requirements for influenza virus replication via RNAi screening. Microbes Infect. 13, 516–525 (2011).

  56. 56.

    , , & Viruses and interactomes in translation. Mol. Cell. Proteomics 11, M111.014738 (2012).

  57. 57.

    et al. Interpreting cancer genomes using systematic host network perturbations by tumour virus proteins. Nature 487, 491–495 (2012). A study that identifies potential cancer-causing driver genes by combining data from tumour virus–host interactomes with data about changes in the host transcriptome on expression of tumour virus ORFs.

  58. 58.

    et al. Expression quantitative trait loci for extreme host response to influenza A in pre-collaborative cross mice. G3 (Bethesda) 2, 213–221 (2012).

  59. 59.

    Principles and challenges of genomewide DNA methylation analysis. Nature Rev. Genet. 11, 191–203 (2010).

  60. 60.

    Mass spectrometric analysis of histone variants and post-translational modifications. Front. Biosci. (Schol. Ed.) 1, 142–153 (2009).

  61. 61.

    Genomic modulators of the immune response. Trends Genet. 29, 74–83 (2012).

  62. 62.

    , & Epigenetic mechanisms as targets and companions of viral assaults. Ann. NY Acad. Sci. 1230, E29–E36 (2011).

  63. 63.

    et al. Suppression of the antiviral response by an influenza histone mimic. Nature 483, 428–433 (2012).

  64. 64.

    Gene regulatory network inference using out of equilibrium statistical mechanics. HFSP J. 2, 183–188 (2008).

  65. 65.

    Out-of-equilibrium dynamics of gene expression and the Jarzynski equality. Phys. Rev. Lett. 100, 188101 (2008).

  66. 66.

    , , & Bottlenecks and hubs in inferred networks are important for virulence in Salmonella typhimurium. J. Comput. Biol. 16, 169–180 (2009).

  67. 67.

    & Community structure in social and biological networks. Proc. Natl Acad. Sci. USA 99, 7821–7826 (2002).

  68. 68.

    et al. Module networks: identifying regulatory modules and their condition-specific regulators from gene expression data. Nature Genet. 34, 166–176 (2003).

  69. 69.

    et al. A physical and regulatory map of host-influenza interactions reveals pathways in H1N1 infection. Cell 139, 1255–1267 (2009).

  70. 70.

    et al. Temporal proteome and lipidome profiles reveal hepatitis C virus-associated reprogramming of hepatocellular metabolism and bioenergetics. PLoS Pathog. 6, e1000719 (2010).

  71. 71.

    et al. Topological analysis of protein co-abundance networks identifies novel host targets important for HCV infection and pathogenesis. BMC Syst. Biol. 6, 28 (2012).

  72. 72.

    et al. Systems virology identifies a mitochondrial fatty acid oxidation enzyme, dodecenoyl coenzyme A delta isomerase, required for hepatitis C virus replication and likely pathogenesis. J. Virol. 85, 11646–11654 (2011).

  73. 73.

    et al. Network based analysis of hepatitis C virus core and NS4B protein interactions. Mol. Biosyst. 6, 2539–2553 (2010).

  74. 74.

    et al. Increased viral loads and exacerbated innate host response in aged macaques infected with 2009 pandemic H1N1 influenza A virus. J. Virol. 86, 11115–11127 (2012).

  75. 75.

    et al. In vivo systems analysis identifies spatial and temporal aspects of the modulation of TNF-α-induced apoptosis and proliferation by MAPKs. Sci. Signal. 4, ra16 (2011).

  76. 76.

    et al. Multi-scale in vivo systems analysis reveals the influence of immune cells on TNF-α-induced apoptosis in the intestinal epithelium. PLoS Biol. 10, e1001393 (2012).

  77. 77.

    , & Dynamics of innate immunity are key to chronic immune activation in AIDS. Curr. Opin. HIV AIDS 7, 79–85 (2012).

  78. 78.

    Critical dynamics in host–pathogen systems. Curr. Top. Microbiol. Immunol. 363, 235–259 (2013).

  79. 79.

    Graphic requirements for multistationarity. ComPlexUs 1, 123–133 (2003).

  80. 80.

    & High-dimensional switches and the modelling of cellular differentiation. J. Theor. Biol. 233, 391–411 (2005).

  81. 81.

    , , , & Improving the efficiency of multidimensional scaling in the analysis of high-dimensional data using singular value decomposition. Bioinformatics 27, 1413–1421 (2011).

  82. 82.

    & New dimensionality reduction methods for the representation of high dimensional 'omics' data. Expert Rev. Mol. Diagnost. 11, 27–34 (2011). A review that discusses the need for the development of dimensionality reduction and visualization methods, and presents an example of how existing techniques can be combined to overcome the current limitations. This review also discusses future directions in the field.

  83. 83.

    et al. Early transcriptional programming links progression to hepatitis C virus-induced severe liver disease in transplant patients. Hepatology 56, 17–27 (2012).

  84. 84.

    et al. Proteome and computational analyses reveal new insights into the mechanisms of hepatitis C virus-mediated liver disease posttransplantation. Hepatology 56, 28–38 (2012).

  85. 85.

    et al. A network perspective on metabolic inconsistency. BMC Syst. Biol. 6, 41 (2012).

  86. 86.

    , , & A higher-order generalized singular value decomposition for comparison of global mRNA expression from multiple organisms. PLoS ONE 6, e28072 (2011).

  87. 87.

    & Integrating proteomic, transcriptional, and interactome data reveals hidden components of signaling and regulatory networks. Sci. Signal. 2, ra40 (2009). The description of a computational method (based on the Steiner tree problem) that provides a general framework for building models of regulatory networks from high-throughput data sets.

  88. 88.

    , & Significance analysis and statistical mechanics: an application to clustering. Phys. Rev. Lett. 105, 220601 (2010).

  89. 89.

    et al. Gene expression signature-based screening identifies new broadly effective influenza A antivirals. PLoS ONE 5, e13169 (2010).

  90. 90.

    , , , & In-silico drug screening and potential target identification for hepatocellular carcinoma using Support Vector Machines based on drug screening result. Gene 518, 201–208 (2013).

  91. 91.

    & Predictive, personalized, preventive, participatory (P4) cancer medicine. Nature Rev. Clin. Oncol. 8, 184–187 (2011).

  92. 92.

    Infection systems biology: from reactive to proactive (P4) medicine. Int. Microbiol. 15, 55–60 (2012).

  93. 93.

    DNA microarrays. History and overview. Methods Mol. Biol. 170, 1–15 (2001).

  94. 94.

    , & in Molecular Profiling (eds Espina, V. & Liotta, L. A.) 89–105 (Humana, 2012).

  95. 95.

    & MicroRNA detection by microarray. Anal. Bioanal. Chem. 394, 1117–1124 (2009).

  96. 96.

    , & Genome-scale DNA methylation analysis. Epigenomics 2, 105–117 (2010).

  97. 97.

    , & Microarrays for personalized genomic medicine. Adv. Clin. Chem. 52, 1–18 (2010).

  98. 98.

    Sequencing technologies — the next generation. Nature Rev. Genet. 11, 31–46 (2010).

  99. 99.

    , & RNA-Seq: a revolutionary tool for transcriptomics. Nature Rev. Genet. 10, 57–63 (2009).

  100. 100.

    & Next-generation transcriptome assembly. Nature Rev. Genet. 12, 671–682 (2011).

  101. 101.

    ChIP-seq: advantages and challenges of a maturing technology. Nature Rev. Genet. 10, 669–680 (2009).

  102. 102.

    , , & Protein–RNA interactions: new genomic technologies and perspectives. Nature Rev. Genet. 13, 77–83 (2012).

  103. 103.

    Collaborative-Cross-Consortium. The genome architecture of the Collaborative Cross mouse genetic reference population. Genetics 190, 389–401 (2012).

  104. 104.

    et al. Quantitative trait locus analysis using recombinant inbred intercrosses: theoretical and empirical considerations. Genetics 170, 1299–1311 (2005).

Download references


The authors thank L. Josset for generating the networks in figure 3, and M. Heise and M. Ferris for providing the data used in box 3. Research in the author's laboratory is supported by Public Health Service grants R2400011172, R2400011157, P30DA015625, P51RR00166 and U54AI081680, and by federal funds from the US National Institute of Allergy and Infectious Diseases, National Institutes of Health, Department of Health and Human Services, under contract HHSN272200800060C.

Author information


  1. Department of Microbiology and Washington National Primate Research Center, University of Washington, Box 358070, Seattle, Washington 98195, USA.

    • G. Lynn Law
    • , Marcus J. Korth
    • , Arndt G. Benecke
    •  & Michael G. Katze
  2. Université Pierre et Marie Curie, Centre National de la Recherche Scientifique UMR7224, 7–9 Quai Saint Bernard, Bat. A, B 75005 Paris, France.

    • Arndt G. Benecke


  1. Search for G. Lynn Law in:

  2. Search for Marcus J. Korth in:

  3. Search for Arndt G. Benecke in:

  4. Search for Michael G. Katze in:

Competing interests

The authors declare no competing financial interests.

Corresponding author

Correspondence to Michael G. Katze.


Complement system

Blood proteins that react with one another in a cascade to aid the ability of phagocytic cells to eliminate microorganisms. Complement proteins also have a role in the development of inflammation.

Small nucleolar RNAs

(snoRNAs). RNAs that guide the modification (for example, methylation or pseudouridylation) of other RNAs, particularly ribosomal RNAs.

PIWI-interacting RNAs

(piRNAs). Small RNAs that are thought to be involved in gene silencing through the formation of ribonucleoprotein complexes with PIWI proteins.


Immunoprecipitation of RNA-binding proteins followed by high-throughput sequencing of the bound RNA.


(Crosslinking immunoprecipitation followed by high-throughput sequencing). A screening method used to identify RNA sequences that interact with either RNA-binding proteins or other RNAs.

Metabolic flux profiling

A measurement approach that uses liquid chromatography–tandem mass spectrometry to quantify the rate of conversion of biochemical molecules in a metabolic network after perturbing the system. Systems-level metabolic flux profiling is a high-throughput approach to quantifying changes in metabolic activity.

Short hairpin RNA

(shRNA). A type of RNA that forms a tight hairpin which has the ability to silence gene expression through RNAi.

Unfolded-protein response

A cellular stress response to the accumulation of unfolded proteins in the ER. The response is characterized by a signal transduction pathway that aims to restore homeostasis by limiting protein biosynthesis and increasing the abundance of molecular chaperones involved in protein folding.

Expression quantitative trait loci

(eQTLs). Genomic loci, as identified by gene expression profiling, that regulate mRNA expression. eQTLs are mapped by computationally connecting DNA sequence variation with variation in gene expression, providing information on how host genetics affects the function of molecular networks.

Structural equation modelling

A multivariate analysis technique for testing and estimating causal relationships among variables.

Betweenness centrality

A measure of the location of a gene in a network. Genes with high betweenness centrality, referred to as bottleneck genes, are located between and therefore connect different portions of the network (that is, different subnetworks).


The phenomenon in which the effects of one gene are modified by one or more other genes.

Network topology

The arrangement and connections of the various components of a network.

About this article

Publication history



Further reading