Innovation | Published:

RNA-Seq: a revolutionary tool for transcriptomics

Nature Reviews Genetics volume 10, pages 5763 (2009) | Download Citation

Subjects

Abstract

RNA-Seq is a recently developed approach to transcriptome profiling that uses deep-sequencing technologies. Studies using this method have already altered our view of the extent and complexity of eukaryotic transcriptomes. RNA-Seq also provides a far more precise measurement of levels of transcripts and their isoforms than other methods. This article describes the RNA-Seq approach, the challenges associated with its application, and the advances made so far in characterizing several eukaryote transcriptomes.

Access optionsAccess options

Rent or Buy article

Get time limited or full article access on ReadCube.

from$8.99

All prices are NET prices.

References

  1. 1.

    , & Genomewide analysis of mRNA processing in yeast using splicing-specific microarrays. Science 296, 907–910 (2002).

  2. 2.

    et al. A high-resolution map of transcription in the yeast genome. Proc. Natl Acad. Sci. USA 103, 5320–5325 (2006).

  3. 3.

    et al. Empirical analysis of transcriptional activity in the Arabidopsis genome. Science 302, 842–846 (2003).

  4. 4.

    et al. Global identification of human transcribed sequences with genome tiling arrays. Science 306, 2242–2246 (2004).

  5. 5.

    et al. Transcriptional maps of 10 human chromosomes at 5-nucleotide resolution. Science 308, 1149–1154 (2005).

  6. 6.

    & Hybridization interactions between probesets in short oligo microarrays lead to spurious correlations. BMC Bioinformatics 7, 276 (2006).

  7. 7.

    , & Toward a universal microarray: prediction of gene expression through nearest-neighbor probe sequence identification. Nucleic Acids Res. 35, e99 (2007).

  8. 8.

    , & Gene discovery in dbEST. Science 265, 1993–1994 (1994).

  9. 9.

    et al. The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC). Genome Res. 14, 2121–2127 (2004).

  10. 10.

    , , & Serial analysis of gene expression. Science 270, 484–487 (1995).

  11. 11.

    & Tag-based approaches for transcriptome research and genome annotation. Nature Methods 2, 495–502 (2005).

  12. 12.

    et al. CAGE: cap analysis of gene expression. Nature Methods 3, 211–222 (2006).

  13. 13.

    & [Cap analysis gene expression: CAGE]. Tanpakushitsu Kakusan Koso 49, 2688–2693 (2004) (in Japanese).

  14. 14.

    et al. Cap analysis gene expression for high-throughput analysis of transcriptional starting point and identification of promoter usage. Proc. Natl Acad. Sci. USA 100, 15776–15781 (2003).

  15. 15.

    et al. Gene expression analysis by massively parallel signature sequencing (MPSS) on microbead arrays. Nature Biotechnol. 18, 630–634 (2000).

  16. 16.

    et al. A spatial dissection of the Arabidopsis floral transcriptome by MPSS. BMC Plant Biol. 8, 43 (2008).

  17. 17.

    et al. Massively parallel signature sequencing (MPSS) as a tool for in-depth quantitative gene expression profiling in all organisms. Brief. Funct. Genomic Proteomic 1, 95–104 (2002).

  18. 18.

    et al. The transcriptional landscape of the yeast genome defined by RNA sequencing. Science 320, 1344–1349 (2008).

  19. 19.

    et al. Dynamic repertoire of a eukaryotic transcriptome surveyed at single-nucleotide resolution. Nature 453, 1239–1243 (2008).

  20. 20.

    , , , & Mapping and quantifying mammalian transcriptomes by RNA-Seq. Nature Methods 5, 621–628 (2008).

  21. 21.

    et al. Highly integrated single-base resolution maps of the epigenome in Arabidopsis. Cell 133, 523–536 (2008).

  22. 22.

    et al. Stem cell transcriptome profiling via massive-scale mRNA sequencing. Nature Methods 5, 613–619 (2008).

  23. 23.

    , , , & RNA-seq: an assessment of technical reproducibility and comparison with gene expression arrays. Genome Res. 11 Jun 2008 (doi: 10.1101/gr.079558.108).

  24. 24.

    et al. Profiling the HeLa S3 transcriptome using randomly primed cDNA and massively parallel short-read sequencing. Biotechniques 45, 81–94 (2008).

  25. 25.

    & The new paradigm of flow cell sequencing. Genome Res. 18, 839–846 (2008).

  26. 26.

    , , , & SNP discovery via 454 transcriptome sequencing. Plant J. 51, 910–918 (2007).

  27. 27.

    et al. Rapid transcriptome characterization for a nonmodel organism using 454 pyrosequencing. Mol. Ecol. 17, 1636–1647 (2008).

  28. 28.

    , , & Gene discovery and annotation using LCM-454 transcriptome sequencing. Genome Res. 17, 69–73 (2007).

  29. 29.

    et al. Dynamic transcriptome of Schizosaccharomyces pombe shown by RNA–DNA hybrid mapping. Nature Genet. 40, 977–986 (2008).

  30. 30.

    , et al. Systematic analysis of transcribed loci in ENCODE regions using RACE sequencing reveals extensive transcription in the human genome. Genome Biol. 9, R3 (2008).

  31. 31.

    , , & SOAP: short oligonucleotide alignment program. Bioinformatics 24, 713–714 (2008).

  32. 32.

    , & Mapping short DNA sequencing reads and calling variants using mapping quality scores. Genome Res. 19 Aug 2008 (doi: 10.1101/gr.078212.108).

  33. 33.

    , & Using quality scores and longer reads improves accuracy of Solexa read mapping. BMC Bioinformatics 9, 128 (2008).

  34. 34.

    et al. Whole-genome sequencing and variant discovery in C. elegans. Nature Methods 5, 183–188 (2008).

  35. 35.

    et al. Identification of somatically acquired rearrangements in cancer using genome-wide massively parallel paired-end sequencing. Nature Genet. 40, 722–729 (2008).

  36. 36.

    et al. Biological function of unannotated transcription during the early development of Drosophila melanogaster. Nature Genet. 38, 1151–1158 (2006).

  37. 37.

    Translational regulation of GCN4 and the general amino acid control of yeast. Annu. Rev. Microbiol. 59, 407–450 (2005).

  38. 38.

    & The RNA binding protein Pub1 modulates the stability of transcripts containing upstream open reading frames. Cell 101, 741–751 (2000).

  39. 39.

    & MicroRNA biogenesis: drosha can't cut it without a partner. Curr. Biol. 15, R61–64 (2005).

  40. 40.

    How does RNA editing affect dsRNA-mediated gene silencing? Cold Spring Harb. Symp. Quant. Biol. 71, 285–292 (2006).

  41. 41.

    et al. A global view of gene activity and alternative splicing by deep sequencing of the human transcriptome. Science 321, 956–960 (2008).

  42. 42.

    et al. Large-scale analysis of the yeast genome by transposon tagging and gene disruption. Nature 402, 413–418 (1999).

  43. 43.

    , , , & High-throughput methods for the large-scale analysis of gene function by transposon tagging. Methods Enzymol. 328, 550–574 (2000).

Download references

Acknowledgements

We thank D. Raha for many valuable comments.

Author information

Affiliations

  1. Zhong Wang and Michael Snyder are at the Department of Molecular, Cellular and Developmental Biology, and Mark Gerstein is at the Department of Molecular, Biophysics and Biochemistry, Yale University, 219 Prospect Street, New Haven, Connecticut 06520, USA.

    • Zhong Wang
    • , Mark Gerstein
    •  & Michael Snyder

Authors

  1. Search for Zhong Wang in:

  2. Search for Mark Gerstein in:

  3. Search for Michael Snyder in:

Corresponding author

Correspondence to Michael Snyder.

Glossary

Cap analysis of gene expression

(CAGE). Similar to SAGE, except that 5′-end information of the transcript is analysed instead of 3′-end information.

Contigs

A group of sequences representing overlapping regions from a genome or transcriptome.

dsRNA editing

Site-specific modification of a pre-mRNA by dsRNA-specific enzymes that leads to the production of variant mRNA from the same gene.

Genomic tiling microarray

A DNA microarray that uses a set of overlapping oligonucleotide probes that represent a subset of or the whole genome at very high resolution.

Massively parallel signature sequencing

(MPSS). A gene expression quantification method that determines 17–20-bp 'signatures' from the ends of a cDNA molecule using multiple cycles of enzymatic cleavage and ligation.

MicroRNA

(miRNA). Small RNA molecules that are processed from small hairpin RNA (shRNA) precursors that are produced from miRNA genes. miRNAs are 21–23 nucleotides in length and through the RNA-induced silencing complex they target and silence mRNAs containing imperfectly complementary sequence.

Piwi-interacting RNAs

(piRNA). Small RNA species that are processed from single-stranded precursor RNAs. They are 25–35 nucleotides in length and form complexes with the piwi protein. piRNAs are probably involved in transposon silencing and stem-cell function.

Quantitative PCR

(qPCR). An application of PCR to determine the quantity of DNA or RNA in a sample. The measurements are often made in real time and the method is also called real-time PCR.

Sequencing depth

The total number of all the sequences reads or base pairs represented in a single sequencing experiment or series of experiments.

Serial analysis of gene expression

(SAGE). A method that uses short 14–20-bp sequence tags from the 3′ ends of transcripts to measure gene expression levels.

Short interfering RNA

(siRNA). RNA molecules that are 21–23 nucleotides long and that are processed from long double-stranded RNAs; they are functional components of the RNAi-induced silencing complex. siRNAs typically target and silence mRNAs by binding perfectly complementary sequences in the mRNA and causing their degradation and/or translation inhibition.

Spike-in RNA

A few species of RNA with known sequence and quantity that are added as internal controls in RNA-Seq experiments.

About this article

Publication history

Published

DOI

https://doi.org/10.1038/nrg2484

Further reading