Perspective

What would you do if you could sequence everything?

Published online:

Abstract

It could be argued that the greatest transformative aspect of the Human Genome Project has been not the sequencing of the genome itself, but the resultant development of new technologies. A host of new approaches has fundamentally changed the way we approach problems in basic and translational research. Now, a new generation of high-throughput sequencing technologies promises to again transform the scientific enterprise, potentially supplanting array-based technologies and opening up many new possibilities. By allowing DNA/RNA to be assayed more rapidly than previously possible, these next-generation platforms promise a deeper understanding of genome regulation and biology. Significantly enhancing sequencing throughput will allow us to follow the evolution of viral and bacterial resistance in real time, to uncover the huge diversity of novel genes that are currently inaccessible, to understand nucleic acid therapeutics, to better integrate biological information for a complete picture of health and disease at a personalized level and to move to advances that we cannot yet imagine.

  • Subscribe to Nature Biotechnology for full access:

    $250

    Subscribe

Additional access options:

Already a subscriber?  Log in  now or  Register  for online access.

References

  1. 1.

    Whole-genome re-sequencing. Curr. Opin. Genet. Dev. 16, 545–552 (2006).

  2. 2.

    Genomes for all. Sci. Am. 294, 46–54 (2006).

  3. 3.

    & Physical approaches to DNA sequencing and detection. Rev. Mod. Phys. 80, 141–165 (2008).

  4. 4.

    The impact of next-generation sequencing technology on genetics. Trends Genet. 24, 133–141 (2008).

  5. 5.

    et al. Single-Molecule DNA Sequencing of a Viral Genome. Science 320, 106–109 (2008).

  6. 6.

    International Human Genome Sequencing Consortium. Initial sequencing and analysis of the human genome. Nature 409, 860–921 (2001).

  7. 7.

    Drosophila 12 Genomes Consortium. Evolution of genes and genomes on the Drosophila phylogeny. Nature 450, 203–218 (2007).

  8. 8.

    et al. Whole-genome sequencing and variant discovery in C. elegans. Nat. Methods 5, 183–188 (2008).

  9. 9.

    et al. The complete genome of an individual by massively parallel DNA sequencing. Nature 452, 872–876 (2008).

  10. 10.

    , , , & New developments in ancient genomics. Trends Ecol. Evol. 23, 386–393 (2008).

  11. 11.

    & Genetic basis of fitness differences in natural populations. Nature 452, 169–175 (2008).

  12. 12.

    The road to genome-wide association studies. Nat. Rev. Genet. 9, 314–318 (2008).

  13. 13.

    et al. A map of human genome sequence variation containing 1.42 million single nucleotide polymorphisms. Nature 409, 928–933 (2001).

  14. 14.

    International HapMap Consortium. A haplotype map of the human genome. Nature 437, 1299–1320 (2005).

  15. 15.

    International HapMap Consortium. A second generation human haplotype map of over 3.1 million SNPs. Nature 449, 851–861 (2007).

  16. 16.

    Wellcome Trust Case Control Consortium. Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls. Nature 447, 661–678 (2007).

  17. 17.

    et al. Susceptibility to coronary artery disease and diabetes is encoded by distinct, tightly linked SNPs in the ANRIL locus on chromosome 9p. Hum. Mol. Genet. 17, 806–814 (2008).

  18. 18.

    , & Singleton SNPs in the human genome and implications for genome-wide association studies. Eur. J. Hum. Genet. 16, 506–515 (2008).

  19. 19.

    et al. Multiple rare alleles contribute to low plasma levels of HDL cholesterol. Science 305, 869–872 (2004).

  20. 20.

    , & Schizophrenia: a common disease caused by multiple rare alleles. Br. J. Psychol. 190, 194–199 (2007).

  21. 21.

    & The new cytogenetics: blurring the boundaries with molecular biology. Nat. Rev. Genet. 6, 782–792 (2005).

  22. 22.

    et al. Detection of large-scale variation in the human genome. Nat. Genet. 36, 949–951 (2004).

  23. 23.

    et al. Copy number variation: new insights in genome diversity. Genome Res. 16, 949–961 (2006).

  24. 24.

    et al. Genome-wide detection of human copy number variations using high-density DNA oligonucleotide arrays. Genome Res. 16, 1575–1584 (2006).

  25. 25.

    et al. Global variation in copy number in the human genome. Nature 444, 444–454 (2006).

  26. 26.

    et al. The diploid genome sequence of an individual human. PLoS Biol. 5, e254 (2007).

  27. 27.

    , , , & Structural variants: changing the landscape of chromosomes and design of disease studies. Hum. Mol. Genet. 15 Spec No 1, R57–R66 (2006).

  28. 28.

    & Copy-number variation and association studies of human disease. Nat. Genet. 39, S37–S42 (2008).

  29. 29.

    et al. The consensus coding sequences of human breast and colorectal cancers. Science 314, 268–274 (2006).

  30. 30.

    et al. The genomic landscapes of human breast and colorectal cancers. Science 318, 1108–1113 (2007).

  31. 31.

    et al. Functional Classification Analysis of Somatically Mutated Genes in Human Breast and Colorectal Cancers. Genomics 91, 508–511 (2008).

  32. 32.

    et al. Distinct classes of chromosomal rearrangements create oncogenic ETS gene fusions in prostate cancer. Nature 448, 595–599 (2007).

  33. 33.

    et al. Quantitative monitoring by polymerase colony assay of known mutations resistant to ABL kinase inhibitors. Oncogene 27, 775–782 (2008).

  34. 34.

    et al. DNA bar coding and pyrosequencing to identify rare HIV drug resistance mutations. Nucleic Acids Res. 35, e91 (2007).

  35. 35.

    , , , & Characterization of mutation spectra with ultra-deep pyrosequencing: application to HIV-1 drug resistance. Genome Res. 17, 1195–1201 (2007).

  36. 36.

    & XDR-TB genome sequencing: a glimpse of the microbiology of the future. Future Microbiol. 3, 111–113 (2008).

  37. 37.

    et al. High-throughput sequencing provides insights into genome variation and evolution in Salmonella Typhi. Nat. Genet. 40, 987–993 (2008).

  38. 38.

    , , & Clonal integration of a polyomavirus in human Merkel cell carcinoma. Science 319, 1096–1100 (2007).

  39. 39.

    et al. A new arenavirus in a cluster of fatal transplant-associated diseases. N. Engl. J. Med. 358, 991–998 (2008).

  40. 40.

    et al. A metagenomic survey of microbes in honey bee colony collapse disorder. Science 318, 283–287 (2007).

  41. 41.

    et al. Symbiosis insights through metagenomic analysis of a microbial consortium. Nature 443, 950–955 (2006).

  42. 42.

    et al. Dissecting biological “dark matter” with single-cell genetic analysis of rare and uncultivated TM7 microbes from the human mouth. Proc. Natl. Acad. Sci. USA 104, 11889–11894 (2007).

  43. 43.

    et al. Diverse syntrophic partnerships from deep-sea methane vents revealed by direct cell capture and metagenomics. Proc. Natl. Acad. Sci. USA 105, 7052–7057 (2008).

  44. 44.

    et al. Functional metagenomic profiling of nine biomes. Nature 452, 629–632 (2008).

  45. 45.

    et al. 'Systems toxicology' approach identifies coordinated metabolic responses to copper in a terrestrial non-model invertebrate, the earthworm Lumbricus rubellus. BMC Biol. 6, 25 (2008).

  46. 46.

    & Genomics technology for assessing soil pollution. J. Biol. 7, 19 (2008).

  47. 47.

    et al. The human microbiome project. Nature 449, 804–810 (2007).

  48. 48.

    et al. High-resolution mapping of DNA hypermethylation and hypomethylation in lung cancer. Proc. Natl. Acad. Sci. USA 105, 252–257 (2007).

  49. 49.

    et al. Integrated epigenomic analyses of neuronal MeCP2 reveal a role for long-range interaction with active genes. Proc. Natl. Acad. Sci. USA 104, 19416–19421 (2007).

  50. 50.

    & The methylome: approaches for global DNA methylation profiling. Trends Genet. 24, 231–237 (2008).

  51. 51.

    , , , & Methylation-specific oligonucleotide microarray: a new potential for high-throughput methylation analysis. Genome Res. 12, 158–164 (2002).

  52. 52.

    et al. Distinct epigenetic changes in the stromal cells of breast cancers. Nat. Genet. 37, 899–905 (2005).

  53. 53.

    et al. Epigenetic inactivation of the premature aging Werner syndrome gene in human cancer. Proc. Natl. Acad. Sci. USA 103, 8822–8827 (2006).

  54. 54.

    , & Quantitative assessment of DNA methylation: potential applications for disease diagnosis, classification, and prognosis in clinical settings. J. Mol. Med. 84, 365–377 (2006).

  55. 55.

    et al. Epigenetic differences arise during the lifetime of monozygotic twins. Proc. Natl. Acad. Sci. USA 102, 10604–10609 (2005).

  56. 56.

    & A blueprint for a Human Epigenome Project: the AACR Human Epigenome Workshop. Cancer Res. 65, 11241–11246 (2005).

  57. 57.

    Momentum building for human epigenome project. J. Natl. Cancer Inst. 98, 84–86 (2006).

  58. 58.

    The necessity of a human epigenome project. Carcinogenesis 27, 1121–1125 (2006).

  59. 59.

    et al. DNA methylation profiling of human chromosomes 6, 20 and 22. Nat. Genet. 38, 1378–1385 (2006).

  60. 60.

    et al. Genome-wide high-resolution mapping and functional analysis of DNA methylation in Arabidopsis. Cell 126, 1189–1201 (2006).

  61. 61.

    et al. Nucleosome organization in the Drosophila genome. Nature 453, 358–362 (2008).

  62. 62.

    et al. A high-resolution, nucleosome position map of C. elegans reveals a lack of universal sequence-dictated positioning. Genome Res. 18, 1051–1063 (2008).

  63. 63.

    & Genome-wide analysis of protein-DNA interactions. Annu. Rev. Genomics Hum. Genet. 7, 81–102 (2006).

  64. 64.

    & ChIPping away at gene regulation. EMBO Rep. 9, 337–343 (2008).

  65. 65.

    & Chromatin state maps: new technologies, new insights. Curr. Opin. Genet. Dev. 18, 109–115 (2008).

  66. 66.

    et al. Genome-wide profiles of STAT1 DNA association using chromatin immunoprecipitation and massively parallel sequencing. Nat. Methods 4, 651–657 (2007).

  67. 67.

    et al. Mapping of transcription factor binding regions in mammalian cells by ChIP: comparison of array- and sequencing-based technologies. Genome Res. 17, 898–909 (2007).

  68. 68.

    , , & Genome-wide mapping of in vivo protein-DNA interactions. Science 316, 1497–1502 (2007).

  69. 69.

    et al. High-resolution profiling of histone methylations in the human genome. Cell 129, 823–837 (2007).

  70. 70.

    , , & Genome-wide prediction of conserved and nonconserved enhancers by histone acetylation patterns. Genome Res. 17, 74–81 (2007).

  71. 71.

    et al. Monoubiquitinated H2B is associated with the transcribed region of highly expressed genes in human cells. Nat. Cell Biol. 10, 483–488 (2008).

  72. 72.

    et al. Genome-wide maps of chromatin state in pluripotent and lineage-committed cells. Nature 448, 553–560 (2007).

  73. 73.

    et al. Genome-scale DNA methylation maps of pluirpotent and differentiated cells. Nature 454, 766–770 (2008).

  74. 74.

    et al. Core transcriptional regulatory circuitry in human embryonic stem cells. Cell 122, 947–956 (2005).

  75. 75.

    et al. Control of developmental regulators by Polycomb in human embryonic stem cells. Cell 125, 301–313 (2006).

  76. 76.

    et al. Systems biology approaches identify ATF3 as a negative regulator of Toll-like receptor 4. Nature 441, 173–178 (2006).

  77. 77.

    et al. Three microarray platforms: an analysis of their concordance in profiling gene expression. BMC Genomics 6, 63 (2005).

  78. 78.

    Encode Project Consortium. Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project. Nature 447, 799–816 (2007).

  79. 79.

    et al. Large-scale transcriptional activity in chromosomes 21 and 22. Science 296, 916–919 (2002).

  80. 80.

    et al. Transcriptional maps of 10 human chromosomes at 5-nucleotide resolution. Science 308, 1149–1154 (2005).

  81. 81.

    et al. Examples of the complex architecture of the human transcriptome revealed by RACE and high-density tiling arrays. Genome Res. 15, 987–997 (2005).

  82. 82.

    et al. Control of leaf morphogenesis by microRNAs. Nature 425, 257–263 (2003).

  83. 83.

    et al. MicroRNA expression profiles classify human cancers. Nature 435, 834–838 (2005).

  84. 84.

    et al. The let-7 MicroRNA family members mir-48, mir-84, and mir-241 function together to regulate developmental timing in Caenorhabditis elegans. Dev. Cell 9, 403–414 (2005).

  85. 85.

    & MicroRNA signatures in human cancers. Nat. Rev. Cancer 6, 857–866 (2006).

  86. 86.

    & MicroRNA-cancer connection: the beginning of a new tale. Cancer Res. 66, 7390–7394 (2006).

  87. 87.

    et al. The microRNAs miR-373 and miR-520c promote tumour invasion and metastasis. Nat. Cell Biol. 10, 202–210 (2008).

  88. 88.

    et al. Application of massively parallel sequencing to microRNA profiling and discovery in human embryonic stem cells. Genome Res. 18, 610–621 (2008).

  89. 89.

    et al. Epigenetic silencing of tumour suppressor gene p15 by its antisense RNA. Nature 451, 202–206 (2008).

  90. 90.

    et al. Genome-wide atlas of gene expression in the adult mouse brain. Nature 445, 168–176 (2007).

  91. 91.

    et al. Gene expression profiles in anatomically and functionally distinct regions of the normal aged human brain. Physiol. Genomics 28, 311–322 (2006).

  92. 92.

    et al. A survey of genetic human cortical gene expression. Nat. Genet. 39, 1494–1499 (2007).

  93. 93.

    , , & A microarray configuration to quantify expression levels and relative abundance of splice variants. Nucleic Acids Res. 33, e47 (2005).

  94. 94.

    et al. Genome-wide analysis of transcript isoform variation in humans. Nat. Genet. 40, 225–231 (2008).

  95. 95.

    Alternative splicing: new insights from global analyses. Cell 126, 37–47 (2006).

  96. 96.

    , , & Serial analysis of gene expression. Science 270, 484–487 (1995).

  97. 97.

    et al. SAGE is far more sensitive than EST for detecting low-abundance transcripts. BMC Genomics 5, 1–4 (2004).

  98. 98.

    et al. LongSAGE profiling of nine human embryonic stem cell lines. Genome Biol. 8, R113 (2007).

  99. 99.

    , , , & Mapping and quantifying mammalian transcriptomes by RNA-Seq. Nat. Methods 5, 621–628 (2008).

  100. 100.

    et al. Stem cell transcriptome profiling via massive-scale mRNA sequencing. Nat. Methods 5, 613–619 (2008).

  101. 101.

    et al. Transcriptome analysis for Caenorhabditis elegans based on novel expressed sequence tags. BMC Biol. 6, 30 (2008).

  102. 102.

    et al. Profiling the HeLa S3 transcriptome using randomly primed cDNA and massively parallel short-read sequencing. Biotechniques 45, 81–94 (2008).

  103. 103.

    et al. The transcriptional landscape of the yeast genome defined by RNA sequencing. Science 320, 1344–1349 (2008).

  104. 104.

    et al. Dynamic repertoire of a eukaryotic transcriptome surveyed at single-nucleotide resolution. Nature 453, 1239–1243 (2008).

  105. 105.

    & Genetics of global gene expression. Nat. Rev. Genet. 7, 862–872 (2006).

  106. 106.

    et al. Genetics of gene expression surveyed in maize, mouse and man. Nature 422, 297–302 (2003).

  107. 107.

    et al. Variations in DNA elucidate molecular networks that cause disease. Nature 452, 429–435 (2008).

  108. 108.

    et al. A regulatory SNP causes a human genetic disease by creating a new transcriptional promoter. Science 312, 1215–1217 (2006).

  109. 109.

    et al. Highly integrated single-base resolution maps of the epigenome in Arabidopsis. Cell 133, 523–536 (2008).

  110. 110.

    , , & Optimization of probe length and the number of probes per gene for optimal microarray analysis of gene expression. Nucleic Acids Res. 32, e99 (2004).

  111. 111.

    The beginning of the end for microarrays? Nat. Genet. 5, 585–586 (2008).

  112. 112.

    & Bioinformatics challenges of new sequencing technology. Trends Genet. 24, 142–149 (2008).

Download references

Acknowledgements

We thank Jennifer MacArthur (Helicos BioSciences) for assistance and critical reading of the manuscript and Stephen Quake (Stanford University) for critical reading and comments.

Author information

Affiliations

  1. Helicos BioSciences, One Kendall Square, Cambridge, Massachusetts 02139, USA.

    • Avak Kahvejian
    •  & John F Thompson
  2. Dana Farber Cancer Institute, 44 Binney Street, Boston, Massachusetts 02115, USA.

    • John Quackenbush

Authors

  1. Search for Avak Kahvejian in:

  2. Search for John Quackenbush in:

  3. Search for John F Thompson in:

Competing interests

A.K. and J.F.T. are employees and J.Q. is on the scientific advisory board of Helicos BioSciences Corp.

Corresponding author

Correspondence to John Quackenbush.