Letter | Published:

The dynamics and regulators of cell fate decisions are revealed by pseudotemporal ordering of single cells

Nature Biotechnology volume 32, pages 381386 (2014) | Download Citation

Abstract

Defining the transcriptional dynamics of a temporal process such as cell differentiation is challenging owing to the high variability in gene expression between individual cells. Time-series gene expression analyses of bulk cells have difficulty distinguishing early and late phases of a transcriptional cascade or identifying rare subpopulations of cells, and single-cell proteomic methods rely on a priori knowledge of key distinguishing markers1. Here we describe Monocle, an unsupervised algorithm that increases the temporal resolution of transcriptome dynamics using single-cell RNA-Seq data collected at multiple time points. Applied to the differentiation of primary human myoblasts, Monocle revealed switch-like changes in expression of key regulatory factors, sequential waves of gene regulation, and expression of regulators that were not known to act in differentiation. We validated some of these predicted regulators in a loss-of function screen. Monocle can in principle be used to recover single-cell gene expression kinetics from a wide array of cellular processes, including differentiation, proliferation and oncogenic transformation.

Access optionsAccess options

Rent or Buy article

Get time limited or full article access on ReadCube.

from$8.99

All prices are NET prices.

Accessions

Primary accessions

Gene Expression Omnibus

References

  1. 1.

    et al. Single-cell mass cytometry of differential immune and drug responses across a human hematopoietic continuum. Science 332, 687–696 (2011).

  2. 2.

    et al. Single-cell transcriptomics reveals bimodality inexpression and splicing in immune cells. Nature 498, 236–240 (2013).

  3. 3.

    et al. Resolution of cell fate decisions revealed by single-cell gene expression analysis from zygote to blastocyst. Dev. Cell 18, 675–685 (2010).

  4. 4.

    , , & CEL-Seq: single-cell RNA-Seq by multiplexed linear amplification. Cell Reports 2, 666–673 (2012).

  5. 5.

    et al. Single-cell expression analyses during cellular reprogramming reveal an early stochastic and a late hierarchic phase. Cell 150, 1209–1222 (2012).

  6. 6.

    et al. Tracing the derivation of embryonic stem cells from the inner cell mass by single-cell RNA-Seq analysis. Cell Stem Cell 6, 468–478 (2010).

  7. 7.

    et al. Full-length mRNA-seq from single-cell levels of RNA and individual circulating tumor cells. Nat. Biotechnol. 30, 777–782 (2012).

  8. 8.

    The interpretation of interaction in contingency tables. J. R. Stat. Soc. Series B Stat. Methodol. 13, 238–241 (1951).

  9. 9.

    , & Single-cell sequencing-based technologies will revolutionize whole-organism science. Nat. Rev. Genet. 14, 618–630 (2013).

  10. 10.

    , & Reconstructing the temporal ordering of biological samples using microarray data. Bioinformatics 19, 842–850 (2003).

  11. 11.

    & Extracting dynamics from static cancer expression data. IEEE/ACM Trans. Comput. Biol. and Bioinf. 5, 172–182 (2008).

  12. 12.

    , & Discovering biological progression underlying microarray samples. PLOS Comput. Biol. 7, e1001123 (2011).

  13. 13.

    et al. Extracting a cellular hierarchy from high-dimensional cytometry data with SPADE. Nat. Biotechnol. 29, 886–891 (2011).

  14. 14.

    & Myoblast fusion: lessons from flies and mice. Development 139, 641–656 (2012).

  15. 15.

    The circuitry of a master switch: Myod and the regulation of skeletal muscle gene transcription. Development 132, 2685–2695 (2005).

  16. 16.

    Expression profiling and identification of novel genes involved in myogenic differentiation. FASEB J. 10.1096/fj.03-0568fje (2004).

  17. 17.

    & Independent component analysis: algorithms and applications. Neural Netw. 411–430 (2000).

  18. 18.

    et al. visNe enables visualization of high dimensional single-cell data and reveals phenotypic heterogeneity of leukemia. Nat. Biotechnol. 31, 545–552 (2013).

  19. 19.

    et al. Muscle injury activates resident fibro/adipogenic progenitors that facilitate myogenesis. Nat. Cell Biol. 12, 153–163 (2010).

  20. 20.

    An initial blueprint for myogenic differentiation. Genes Dev. 19, 553–569 (2005).

  21. 21.

    et al. An expansive human regulatory lexicon encoded in transcription factor footprints. Nature 489, 83–90 (2012).

  22. 22.

    & ChromHMM: automating chromatin state discovery and characterization. Nat. Methods 9, 215–216 (2012).

  23. 23.

    , & Autoactivation of Xenopus MyoD transcription and its inhibition by USF. Cell Growth Differ. 8, 275–282 (1997).

  24. 24.

    & The multiple roles of CUX1: insights from mouse models and cell-based assays. Gene 412, 84–94 (2008).

  25. 25.

    et al. XBP1 controls diverse cell type- and condition-specific transcriptional regulatory networks. Mol. Cell 27, 53–66 (2007).

  26. 26.

    & Computational morphology of curves. Vis. Comput. 11, 105–112 (1994).

  27. 27.

    Curve reconstruction in arbitrary dimension and the traveling salesman problem. Proc. 8th Discrete Geometry and Computational Imagery Conference(DGCI) 164–176 (1999).

  28. 28.

    Curve reconstruction, the traveling salesman problem, and Menger's theorem on length. Discrete Comput. Geom. 24, 577–603 (2000).

  29. 29.

    & Testing for the consecutive ones property, interval graphs, and graph planarity using PQ-tree algorithms. J. Comput. Syst. Sci. 13, 335–379 (1976).

  30. 30.

    et al. Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks. Nat. Protoc. 7, 562–578 (2012).

  31. 31.

    et al. TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and genefusions. Genome Biol. 14, R36 (2013).

  32. 32.

    & Fast gapped-read alignment with Bowtie 2. Nat. Methods 9, 357–359 (2012).

  33. 33.

    et al. GENCODE: producing a reference annotation for ENCODE. Genome Biol. 7 (suppl. 1), S4.1–S4.9 (2006).

  34. 34.

    et al. Differential analysis of gene regulation at transcript resolution with RNA-seq. Nat. Biotechnol. 31, 46–53 (2013).

  35. 35.

    Estimation of relationships for limited dependent variables. Econometrica 26, 24–36 (1958).

  36. 36.

    & Generalized additive models, Stat. Sci. 1, 297–318 (1986).

  37. 37.

    W. & Generalized Additive Models. J. R. Stat. Soc. Series B Stat. Methodol. 58, 481–493 (1996).

  38. 38.

    et al. A lentiviral RNAi library for human and mouse genes applied to an arrayed viral high-content screen. Cell 124, 1283–1298 (2006).

  39. 39.

    et al. miR-31 modulates dystrophin expression: new implications for Duchenne muscular dystrophy therapy. EMBO Rep. 12, 136–141 (2011).

  40. 40.

    & Comprehensive algorithm for quantitative real-time polymerase chain reaction. J. Comput. Biol. 12, 1047–1064 (2005).

  41. 41.

    & BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 26, 841–842 (2010).

  42. 42.

    & Camera: a competitive gene set test accounting for inter-gene correlation. Nucleic Acids Res. 40, e133 (2012).

Download references

Acknowledgements

We are grateful to S. Bordbar, C. Zhu, A. Wagers and the Broad RNAi platform for technical assistance and to M. Soumillon for discussions. C.T. was supported by a Damon Runyon Cancer Research Foundation Fellowship. D.C. was supported by a Human Frontier Science Program Fellowship. D.C. and T.S.M. were supported by the Harvard Stem Cell Institute. This work was supported by US National Institutes of Health grants 1DP2OD00667, P01GM099117 and P50HG006193-01. This work was also supported in part by the Single Cell Genomics initiative, a collaboration between the Broad Institute and Fluidigm Inc.

Author information

Author notes

    • Cole Trapnell
    •  & Davide Cacchiarelli

    These authors contributed equally to this work.

Affiliations

  1. Department of Stem Cell and Regenerative Biology, Harvard University, Cambridge, Massachusetts, USA.

    • Cole Trapnell
    • , Davide Cacchiarelli
    • , Michael Morse
    • , Tarjei S Mikkelsen
    •  & John L Rinn
  2. The Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA.

    • Cole Trapnell
    • , Davide Cacchiarelli
    • , Jonna Grimsby
    • , Prapti Pokharel
    • , Michael Morse
    • , Niall J Lennon
    • , Tarjei S Mikkelsen
    •  & John L Rinn
  3. Harvard Stem Cell Institute, Harvard University, Cambridge, Massachusetts, USA.

    • Davide Cacchiarelli
    •  & Tarjei S Mikkelsen
  4. Fluidigm Corporation, South San Francisco, California, USA.

    • Shuqiang Li
    •  & Kenneth J Livak
  5. Department of Pathology, Beth Israel Deaconess Medical Center, Boston, Massachusetts, USA.

    • John L Rinn

Authors

  1. Search for Cole Trapnell in:

  2. Search for Davide Cacchiarelli in:

  3. Search for Jonna Grimsby in:

  4. Search for Prapti Pokharel in:

  5. Search for Shuqiang Li in:

  6. Search for Michael Morse in:

  7. Search for Niall J Lennon in:

  8. Search for Kenneth J Livak in:

  9. Search for Tarjei S Mikkelsen in:

  10. Search for John L Rinn in:

Contributions

C.T. and D.C. conceived the strategy of ordering individual cells by developmental progress. C.T. designed and wrote Monocle and performed the computational analysis. D.C., C.T., J.G., P.P., S.L. and M.M. performed the experiments. D.C., C.T. and J.L.R. designed the study. C.T., D.C., J.G., N.J.L., K.J.L., T.S.M. and J.L.R. wrote the manuscript.

Competing interests

K.J.L. and S.L. are employees of Fluidigm, Inc. Fluidigm manufactures an instrument used in this study.

Corresponding author

Correspondence to John L Rinn.

Supplementary information

PDF files

  1. 1.

    Supplementary Text and Figures

    Supplementary Figures 1–13 and Supplementary Tables 1 and 2

Zip files

  1. 1.

    Supplementary Source Code

    Source code

About this article

Publication history

Received

Accepted

Published

DOI

https://doi.org/10.1038/nbt.2859

Further reading