Somatic coding mutations in human induced pluripotent stem cells

Journal name:
Date published:
Published online


Defined transcription factors can induce epigenetic reprogramming of adult mammalian cells into induced pluripotent stem cells. Although DNA factors are integrated during some reprogramming methods, it is unknown whether the genome remains unchanged at the single nucleotide level. Here we show that 22 human induced pluripotent stem (hiPS) cell lines reprogrammed using five different methods each contained an average of five protein-coding point mutations in the regions sampled (an estimated six protein-coding point mutations per exome). The majority of these mutations were non-synonymous, nonsense or splice variants, and were enriched in genes mutated or having causative effects in cancers. At least half of these reprogramming-associated mutations pre-existed in fibroblast progenitors at low frequencies, whereas the rest occurred during or after reprogramming. Thus, hiPS cells acquire genetic modifications in addition to epigenetic modifications. Extensive genetic screening should become a standard procedure to ensure hiPS cell safety before clinical use.

Accession codes

Primary accessions

Sequence Read Archive


  1. Takahashi, K. & Yamanaka, S. Induction of pluripotent stem cells from mouse embryonic and adult fibroblast cultures by defined factors. Cell 126, 663676 (2006)
  2. Yu, J. et al. Induced pluripotent stem cell lines derived from human somatic cells. Science 318, 19171920 (2007)
  3. Mayshar, Y. et al. Identification and classification of chromosomal aberrations in human induced pluripotent stem cells. Cell Stem Cell 7, 521531 (2010)
  4. Hong, H. et al. Suppression of induced pluripotent stem cell generation by the p53–p21 pathway. Nature 460, 11321135 (2009)
  5. Li, H. et al. The Ink4/Arf locus is a barrier for iPS cell reprogramming. Nature 460, 11361139 (2009)
  6. Kawamura, T. et al. Linking the p53 tumour suppressor pathway to somatic cell reprogramming. Nature 460, 11401144 (2009)
  7. Utikal, J. et al. Immortalization eliminates a roadblock during cellular reprogramming into iPS cells. Nature 460, 11451148 (2009)
  8. Marión, R. M. et al. A p53-mediated DNA damage response limits reprogramming to ensure iPS cell genomic integrity. Nature 460, 11491153 (2009)
  9. Ruiz, S. et al. A high proliferation rate is required for somatic cell reprogramming and maintenance of human embryonic stem cell identity. Curr. Biol. 21, 4552 (2011)
  10. Porreca, G. J. et al. Multiplex amplification of large sets of human exons. Nature Methods 4, 931936 (2007)
  11. Deng, J. et al. Targeted bisulfite sequencing reveals changes in DNA methylation associated with nuclear reprogramming. Nature Biotechnol. 27, 353360 (2009)
  12. Bashiardes, S. et al. Direct genomic selection. Nature Methods 2, 6369 (2005)
  13. Gnirke, A. et al. Solution hybrid selection with ultra-long oligonucleotides for massively parallel targeted sequencing. Nature Biotechnol. 27, 182189 (2009)
  14. Levy, S. et al. The diploid genome sequence of an individual human. PLoS Biol. 5, e254 (2007)
  15. Drmanac, R. et al. Human genome sequencing using unchained base reads on self-assembling DNA nanoarrays. Science 327, 7881 (2009)
  16. Ng, S. B. et al. Targeted capture and massively parallel sequencing of 12 human exomes. Nature 461, 272276 (2009)
  17. Kumar, P., Henikoff, S. & Ng, P. C. Predicting the effects of coding non-synonymous variants on protein function using the SIFT algorithm. Nature Protocols 4, 10731081 (2009)
  18. Forbes, S. A. et al. The Catalogue of Somatic Mutations in Cancer (COSMIC). Curr. Protocols Hum. Genet. 10, 10.11 (2008)
  19. Shah, S. P. et al. Mutational evolution in a lobular breast tumour profiled at single nucleotide resolution. Nature 461, 809813 (2009)
  20. Futreal, P. A. et al. A census of human cancer genes. Nature Rev. Cancer 4, 177183 (2004)
  21. Hamosh, A., Scott, A. F., Amberger, J. S., Bocchini, C. A. & McKusick, V. A. Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders. Nucleic Acids Res. 33, D514D517 (2005)
  22. Druley, T. E. et al. Quantification of rare allelic variants from pooled genomic DNA. Nature Methods 6, 263265 (2009)
  23. Ahuja, D., Saenz-Robles, M. T. & Pipas, J. M. SV40 large T antigen targets multiple cellular pathways to elicit cellular transformation. Oncogene 24, 77297745 (2005)
  24. Yu, J. et al. Human induced pluripotent stem cells free of vector and transgene sequences. Science 324, 797801 (2009)
  25. Pleasance, E. D. et al. A comprehensive catalogue of somatic mutations from a human cancer genome. Nature 463, 191196 (2010)
  26. Lee, W. et al. The mutation spectrum revealed by paired genome sequences from a lung cancer patient. Nature 465, 473477 (2010)
  27. Ding, L. et al. Genome remodelling in a basal-like breast cancer metastasis and xenograft. Nature 464, 9991005 (2010)
  28. Dennis, G., Jr et al. DAVID: Database for Annotation, Visualization, and Integrated Discovery. Genome Biol. 4, P3 (2003)
  29. Lee, J. H. et al. A robust approach to identifying tissue-specific gene expression regulatory variants using personalized human induced pluripotent stem cells. PLoS Genet. 5, e1000718 (2009)
  30. Park, I. H. et al. Reprogramming of human somatic cells to pluripotency with defined factors. Nature 451, 141146 (2008)
  31. Chan, E. M. et al. Live cell imaging distinguishes bona fide human iPS cells from partially reprogrammed cells. Nature Biotechnol. 27, 10331037 (2009)
  32. Dimos, J. T. et al. Induced pluripotent stem cells generated from patients with ALS can be differentiated into motor neurons. Science 321, 12181221 (2008)
  33. Rodriguez-Piza, I. et al. Reprogramming of human fibroblasts to induced pluripotent stem cells under xeno-free conditions. Stem Cells 28, 3644 (2010)
  34. Aasen, T. et al. Efficient and rapid generation of induced pluripotent stem cells from human keratinocytes. Nature Biotechnol. 26, 12761284 (2008)
  35. Stewart, S. A. et al. Lentivirus-delivered stable gene silencing by RNAi in primary cells. RNA 9, 493501 (2003)
  36. Warren, L. et al. Highly efficient reprogramming to pluripotency and directed differentiation of human cells with synthetic modified mRNA. Cell Stem Cell 7, 618630 (2010)
  37. Akagi, T., Sasai, K. & Hanafusa, H. Refractory nature of normal human diploid fibroblasts with respect to oncogene-mediated transformation. Proc. Natl Acad. Sci. USA 100, 1356713572 (2003)
  38. Cowan, C. A. et al. Derivation of embryonic stem-cell lines from human blastocysts. N. Engl. J. Med. 350, 13531356 (2004)
  39. Boulting, G. L. et al. A functionally characterized test set of human induced pluripotent stem cells. Nature advance online publication. doi:10.1038/nbt.1783 (3 February 2011)
  40. Zhang, K. et al. Digital RNA allelotyping reveals tissue-specific and allele-specific gene expression in human. Nature Methods 6, 613618 (2009)
  41. Meena Kishore, S., Vincent, T. K. C. & Pandjassarame, K. Distributions of exons and introns in the human genome. In Silico Biol. 4, 387393 (2004)

Download references

Author information

  1. These authors contributed equally to this work.

    • Athurva Gore &
    • Zhe Li


  1. Department of Bioengineering, Institute for Genomic Medicine and Institute of Engineering in Medicine, University of California at San Diego, 9500 Gilman Drive, La Jolla, California 92093, USA

    • Athurva Gore,
    • Zhe Li,
    • Ho-Lim Fung &
    • Kun Zhang
  2. Department of Cellular and Molecular Medicine and Howard Hughes Medical Institute, University of California at San Diego, 9500 Gilman Drive, La Jolla, California 92093, USA

    • Jessica E. Young,
    • Isabel Canto,
    • Mason A. Israel,
    • Melissa L. Wilbert &
    • Lawrence S. B. Goldstein
  3. Division of Pediatric Hematology/Oncology, Children’s Hospital Boston and Dana Farber Cancer Institute, Boston, Massachusetts 02115, USA

    • Suneet Agarwal,
    • Yuin-Han Loh,
    • Philip D. Manos &
    • George Q. Daley
  4. Department of Anatomy, University of Wisconsin-Madison, Madison, Wisconsin 53705, USA

    • Jessica Antosiewicz-Bourget,
    • Junying Yu &
    • James A. Thomson
  5. Center of Regenerative Medicine, 08003 Barcelona, Spain

    • Alessandra Giorgetti,
    • Nuria Montserrat &
    • Juan Carlos Izpisua Belmonte
  6. Howard Hughes Medical Institute, Harvard Stem Cell Institute, Department of Stem Cell and Regenerative Biology, Harvard University, Cambridge, Massachusetts 02138, USA

    • Evangelos Kiskinis &
    • Kevin Eggan
  7. Department of Genetics, Harvard Medical School, Boston, Massachusetts 02135, USA

    • Je-Hyuk Lee
  8. Salk Institute for Biological Studies, La Jolla, California 92037, USA

    • Athanasia D. Panopoulos,
    • Sergio Ruiz &
    • Juan Carlos Izpisua Belmonte
  9. The J. Craig Venter Institute, Rockville, Maryland 20850, USA

    • Ewen F. Kirkness
  10. Immune Disease Institute, Children’s Hospital Boston, Boston, Massachusetts 02115, USA

    • Derrick J. Rossi


L.S.B.G. and K.Z. co-directed the study. A. Gore, Z.L., L.S.B.G. and K.Z. designed the experiments. J.E.Y., S.A., J.A.-B., I.C., A. Giorgetti, M.A.I., E.K., J.-H.L., Y.-H.L., P.D.M., N.M., A.D.P., S.R., M.L.W., J. Yu, J.C.I.B., D.J.R., J.A.T., K.E., G.Q.D. and L.S.B.G. biopsied, cultured and derived hiPS cell lines. Z.L. performed DNA extraction. A. Gore, Z.L. and K.Z. performed exome library construction, DigiQ library construction and validation Sanger sequencing. H.-L.F. performed Illumina sequencing. A. Gore and K.Z. performed bioinformatic and statistical analysis with contributions from E.F.K. A. Gore, Z.L., L.S.B.G., G.Q.D. and K.Z. wrote the manuscript with contributions from all other authors.

Competing financial interests

The authors declare no competing financial interests.

Corresponding authors

Correspondence to:

Sequencing results for the mutations reported here are included in Supplementary Figure 1. Raw Illumina sequencing reads are available from the NCBI ShortRead Archive, accession SRP005709, except for lines derived from Hib11, Hib17, Hib29, CF, HFFxF, dH1F fibroblasts as the original donors were not consulted about public release of their genome data.

Author details

Supplementary information

PDF files

  1. Supplementary Information (8.7M)

    The file contains Supplementary Figures 1-5 with legends and a Supplementary Note.

Excel files

  1. Supplementary Tables (73K)

    The file contains Supplementary Tables 1-3

Additional data