Protocol | Published:

Genome-wide copy number analysis of single cells

Nature Protocols volume 7, pages 10241041 (2012) | Download Citation

  • A Corrigendum to this article was published on 25 February 2016

This article has been updated


Copy number variation (CNV) is increasingly recognized as an important contributor to phenotypic variation in health and disease. Most methods for determining CNV rely on admixtures of cells in which information regarding genetic heterogeneity is lost. Here we present a protocol that allows for the genome-wide copy number analysis of single nuclei isolated from mixed populations of cells. Single-nucleus sequencing (SNS), combines flow sorting of single nuclei on the basis of DNA content and whole-genome amplification (WGA); this is followed by next-generation sequencing to quantize genomic intervals in a genome-wide manner. Multiplexing of single cells is discussed. In addition, we outline informatic approaches that correct for biases inherent in the WGA procedure and allow for accurate determination of copy number profiles. All together, the protocol takes 3 d from flow cytometry to sequence-ready DNA libraries.

Access optionsAccess options

Rent or Buy article

Get time limited or full article access on ReadCube.


All prices are NET prices.

Change history

  • 24 February 2016

     In the version of this article initially published, the units for the concentration of NaCl in the NST buffer described in the Reagent Setup section were incorrect. The correct unit should be mM. The error has been corrected in the HTML and PDF versions of the article.


Sequence Read Archive


  1. 1.

    , & Copy number variants and genetic traits: closer to the resolution of phenotypic to genotypic variability. Nat. Rev. Genet. 8, 639–646 (2007).

  2. 2.

    et al. High-resolution copy-number variation map reflects human olfactory receptor diversity and evolution. PLoS Genet. 4, e1000249 (2008).

  3. 3.

    et al. Diet and the evolution of human amylase gene copy number variation. Nat. Genet. 39, 1256–1260 (2007).

  4. 4.

    et al. Association analyses of 249,796 individuals reveal 18 new loci associated with body mass index. Nat. Genet. 42, 937–948 (2010).

  5. 5.

    et al. Strong association of de novo copy number mutations with autism. Science 316, 445–449 (2007).

  6. 6.

    et al. The landscape of somatic copy-number alteration across human cancers. Nature 463, 899–905 (2010).

  7. 7.

    et al. Signatures of mutation and selection in the cancer genome. Nature 463, 893–898 (2010).

  8. 8.

    et al. Genomic architecture characterizes tumor progression paths and fate in breast cancer patients. Sci. Transl. Med. 2, 38ra47 (2010).

  9. 9.

    , , , & DNA amplifications in breast cancer: genotypic-phenotypic correlations. Future Oncol. 6, 967–984 (2010).

  10. 10.

    & The array CGH and its clinical applications. Drug Discov. Today 13, 760–770 (2008).

  11. 11.

    , , , & A census of amplified and overexpressed human cancer genes. Nat. Rev. Cancer 10, 59–64 (2010).

  12. 12.

    & Array comparative genomic hybridization and its applications in cancer. Nat. Genet. 37 (suppl.): S11–17 (2005).

  13. 13.

    et al. Clonal heterogeneity in childhood myelodysplastic syndromes—challenge for the detection of chromosomal imbalances by array-CGH. Genes Chromosomes Cancer 49, 885–900 (2010).

  14. 14.

    Sequencing technologies—the next generation. Nat. Rev. Genet. 11, 31–46 (2010).

  15. 15.

    The impact of next-generation sequencing technology on genetics. Trends Genet. 24, 133–141 (2008).

  16. 16.

    , & Advances in understanding cancer genomes through second-generation sequencing. Nat. Rev. Genet. 11, 685–696 (2010).

  17. 17.

    et al. High-resolution mapping of copy-number alterations with massively parallel sequencing. Nat. Methods 6, 99–103 (2009).

  18. 18.

    et al. Personalized copy number and segmental duplication maps using next-generation sequencing. Nat. Genet. 41, 1061–1067 (2009).

  19. 19.

    et al. Mutational evolution in a lobular breast tumour profiled at single nucleotide resolution. Nature 461, 809–813 (2009).

  20. 20.

    et al. Genome remodelling in a basal-like breast cancer metastasis and xenograft. Nature 464, 999–1005 (2010).

  21. 21.

    et al. The patterns and dynamics of genomic instability in metastatic pancreatic cancer. Nature 467, 1109–1113 (2010).

  22. 22.

    et al. Distant metastasis occurs late during the genetic evolution of pancreatic cancer. Nature 467, 1114–1117 (2010).

  23. 23.

    et al. Genetic variegation of clonal architecture and propagating cells in leukaemia. Nature 469, 356–361 (2011).

  24. 24.

    et al. Tumour evolution inferred by single-cell sequencing. Nature 472, 90–94 (2011).

  25. 25.

    , , & Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol. 10, R25 (2009).

  26. 26.

    et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 15, 2078–2079 (2009).

  27. 27.

    & A faster cicular binary segmentation algorithm for the analysis of array CGH data. Bioinformatics 23, 657–63 (2007).

  28. 28.

    et al. Doublet discrimination in DNA cell-cycle analysis. Cytometry 46, 296–306 (2001).

  29. 29.

    , , , & Degradation of chromosomal DNA during apoptosis. Cell Death Differ 10, 108–116 (2003).

  30. 30.

    et al. Massive genomic rearragement acquired in a single catastrophic event during cancer development. Cell 144, 27–40 (2011).

Download references


We acknowledge all the members of the Hicks and Wigler labs for discussions during the technology development. We thank W. McCombie, E. Ghiban and L. Gelley for their advice and technical assistance, and A. Gordon for informatics support and assistance with figures. N.N. is supported by grants from Texas STARS and the Alice Kleberg Reynolds Foundation. This work was supported by grants to M.W. and J.H. from the Department of the Army (W81XWH04-1-0477), the Breast Cancer Research Foundation, and the Simons Foundation. M.W. is an American Cancer Society Research Professor.

Author information


  1. Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, USA.

    • Timour Baslan
    • , Jude Kendall
    • , Linda Rodgers
    • , Hilary Cox
    • , Mike Riggs
    • , Asya Stepansky
    • , Jennifer Troge
    • , Kandasamy Ravi
    • , Diane Esposito
    • , Michael Wigler
    •  & James Hicks
  2. Molecular and Cellular Biology Program, Stony Brook University, Stony Brook, New York, USA.

    • Timour Baslan
  3. Ontario Institute for Cancer Research, Toronto, Ontario, Canada.

    • B Lakshmi
  4. Department of Genetics, University of Texas MD Anderson Cancer Center, Houston, Texas, USA.

    • Nicholas Navin
  5. Department of Bioinformatics and Computational Biology, University of Texas MD Anderson Cancer Center, Houston, Texas, USA.

    • Nicholas Navin


  1. Search for Timour Baslan in:

  2. Search for Jude Kendall in:

  3. Search for Linda Rodgers in:

  4. Search for Hilary Cox in:

  5. Search for Mike Riggs in:

  6. Search for Asya Stepansky in:

  7. Search for Jennifer Troge in:

  8. Search for Kandasamy Ravi in:

  9. Search for Diane Esposito in:

  10. Search for B Lakshmi in:

  11. Search for Michael Wigler in:

  12. Search for Nicholas Navin in:

  13. Search for James Hicks in:


T.B., J.K., N.N., J.H. and M.W. designed methods and analyzed data. T.B., L.R., H.C., M.R., A.S. and J.T. performed experiments. K.R., D.E. and B.L. contributed to the technology development. J.K., N.N., T.B., J.H. and M.W. developed informatic analysis. T.B., J.K. and J.H. wrote the paper.

Competing interests

The authors declare no competing financial interests.

Corresponding author

Correspondence to James Hicks.

Supplementary information

PostScript documents

  1. 1.

    Supplementary Figure 1

    Outline of the informatics section. Provide a concise outline of all the steps with input-program-output labels.

Zip files

  1. 1.

    Supplementary Data

    Input/output data for the informatics procedure of the protocol required to reproduce copy number profiles from single cell sequencing data. Supplementary Data Legend outlines all the data with descriptions.

  2. 2.

    Supplementary Methods

    Contains all programs required to process single cell sequencing data for copy number determination, from genome preparation to sequence mapping and copy number estimation. Supplementary Methods Legend outlines all the programs and their utilities.

Text files

  1. 1.

    Supplementary Note

    Descriptions of data and program files. Text providing a brief overview of the data and program files included in the Supplementary Methods and Supplementary Data.

About this article

Publication history



Further reading


By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.