It is well established that autism spectrum disorders (ASD) have a strong genetic component; however, for at least 70% of cases, the underlying genetic cause is unknown1. Under the hypothesis that de novo mutations underlie a substantial fraction of the risk for developing ASD in families with no previous history of ASD or related phenotypes—so-called sporadic or simplex families2,3—we sequenced all coding regions of the genome (the exome) for parent–child trios exhibiting sporadic ASD, including 189 new trios and 20 that were previously reported4. Additionally, we also sequenced the exomes of 50 unaffected siblings corresponding to these new (n = 31) and previously reported trios (n = 19)4, for a total of 677 individual exomes from 209 families. Here we show that de novo point mutations are overwhelmingly paternal in origin (4:1 bias) and positively correlated with paternal age, consistent with the modest increased risk for children of older fathers to develop ASD5. Moreover, 39% (49 of 126) of the most severe or disruptive de novo mutations map to a highly interconnected β-catenin/chromatin remodelling protein network ranked significantly for autism candidate genes. In proband exomes, recurrent protein-altering mutations were observed in two genes: CHD8 and NTNG1. Mutation screening of six candidate genes in 1,703 ASD probands identified additional de novo, protein-altering mutations in GRIN2B, LAMC3 and SCN1A. Combined with copy number variant (CNV) data, these results indicate extreme locus heterogeneity but also provide a target for future discovery, diagnostics and therapeutics.

Access optionsAccess options

Rent or Buy article

Get time limited or full article access on ReadCube.


All prices are NET prices.


Data deposits

Access to the raw sequence reads can be found at the NCBI database of Genotypes and Phenotypes (dbGaP) and National Database for Autism Research under accession numbers phs000482.v1.p1 and NDARCOL0001878, respectively.


  1. 1.

    & Solving the autism puzzle a few pieces at a time. Neuron 70, 806–808 (2011)

  2. 2.

    et al. Multiple recurrent de novo CNVs, including duplications of the 7q11.23 Williams syndrome region, are strongly associated with autism. Neuron 70, 863–885 (2011)

  3. 3.

    et al. Rare de novo and transmitted copy-number variation in autistic spectrum disorders. Neuron 70, 886–897 (2011)

  4. 4.

    et al. Exome sequencing in sporadic autism spectrum disorders identifies severe de novo mutations. Nature Genet. 43, 585–589 (2011)

  5. 5.

    , , , & Advancing paternal age and risk of autism: new evidence from a population-based study and a meta-analysis of epidemiological studies. Mol. Psychiatry 16, 1203–1212 (2010)

  6. 6.

    & The Simons Simplex Collection: a resource for identification of autism genetic risk factors. Neuron 68, 192–195 (2010)

  7. 7.

    Rate, molecular spectrum, and consequences of human mutation. Proc. Natl Acad. Sci. USA 107, 961–968 (2010)

  8. 8.

    et al. Exome sequencing supports a de novo mutational paradigm for schizophrenia. Nature Genet. 43, 864–868 (2011)

  9. 9.

    et al. De novo mutations revealed by whole-exome sequencing are strongly associated with autism. Nature (this issue)

  10. 10.

    et al. De novo copy number variants associated with intellectual disability have a paternal origin and age bias. J. Med. Genet. 48, 776–778 (2011)

  11. 11.

    & Autism genetics: strategies, challenges, and opportunities. Autism Res. 1, 4–17 (2008)

  12. 12.

    , , & Axonal netrin-Gs transneuronally determine lamina-specific subdendritic segments. Proc. Natl Acad. Sci. USA 104, 14801–14806 (2007)

  13. 13.

    et al. Disruption of Netrin G1 by a balanced chromosome translocation in a girl with Rett syndrome. Eur. J. Hum. Genet. 13, 921–927 (2005)

  14. 14.

    et al. CHD8 suppresses p53-mediated apoptosis through histone H1 recruitment during early embryogenesis. Nature Cell Biol. 11, 172–182 (2009)

  15. 15.

    , , & CHD8 is an ATP-dependent chromatin remodeling factor that regulates β-catenin target genes. Mol. Cell. Biol. 28, 3894–3904 (2008)

  16. 16.

    et al. CHD8 interacts with CHD7, a protein which is mutated in CHARGE syndrome. Hum. Mol. Genet. 19, 2858–2866 (2010)

  17. 17.

    Etiological heterogeneity in autism spectrum disorders: more than 100 genetic and genomic disorders and still counting. Brain Res. 1380, 42–77 (2011)

  18. 18.

    et al. Truncation of the Down syndrome candidate gene DYRK1A in two unrelated patients with microcephaly. Am. J. Hum. Genet. 82, 1165–1170 (2008)

  19. 19.

    et al. A copy number variation morbidity map of developmental delay. Nature Genet. 43, 838–846 (2011)

  20. 20.

    et al. Delineation of a critical region on chromosome 18 for the del(18)(q12.2q21.1) syndrome. Am. J. Med. Genet. A. 146A, 1330–1334 (2008)

  21. 21.

    et al. De novo mutations of SETBP1 cause Schinzel-Giedion syndrome. Nature Genet. 42, 483–485 (2010)

  22. 22.

    et al. The GeneMANIA prediction server: biological network integration for gene prioritization and predicting gene function. Nucleic Acids Res. 38, W214–W220 (2010)

  23. 23.

    , , & DADA: Degree-aware algorithms for network-based disease gene prioritization. BioData Mining 4, 19 (2011)

  24. 24.

    & The ups and downs of Wnt signaling in prevalent neurological disorders. Oncogene 25, 7545–7553 (2006)

  25. 25.

    et al. Tbr1 regulates regional and laminar identity of postmitotic neurons in developing neocortex. Proc. Natl Acad. Sci. USA 107, 13129–13134 (2010)

  26. 26.

    , , , & Massively parallel exon capture and library-free resequencing across 16 genomes. Nature Methods 6, 315–316 (2009)

  27. 27.

    et al. Transcriptomic analysis of autistic brain reveals convergent molecular pathology. Nature 474, 380–384 (2011)

  28. 28.

    et al. Protein interactome reveals converging molecular pathways among autism disorders. Sci. Transl. Med. 3, 86ra49 (2011)

  29. 29.

    et al. Rare de novo variants associated with autism implicate a large functional network of genes involved in formation and function of synapses. Neuron 70, 898–907 (2011)

  30. 30.

    & Wnt signaling: multiple functions in neural development. Cell. Mol. Life Sci. 62, 1100–1108 (2005)

  31. 31.

    & The non-apoptotic role of p53 in neuronal biology: enlightening the dark side of the moon. EMBO Rep. 10, 576–583 (2009)

  32. 32.

    & Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25, 1754–1760 (2009)

  33. 33.

    et al. A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nature Genet. 43, (2011)

  34. 34.

    et al. mrsFAST: a cache-oblivious algorithm for short-read mapping. Nature Methods 7, 576–577 (2010)

  35. 35.

    , , , & Mapping and quantifying mammalian transcriptomes by RNA-Seq. Nature Methods 5, 621–628 (2008)

  36. 36.

    NIMH Human Genetics Initiative: 2003 update. Am. J. Psychiatry 160, 621–622 (2003)

  37. 37.

    & The World Mental Health (WMH) survey initiative version of the World Health Organization (WHO) Composite International Diagnostic Interview (CIDI). Int. J. Methods Psychiatr. Res. 13, 93–121 (2004)

  38. 38.

    et al. The ClinSeq Project: piloting large-scale genome sequencing for research in genomic medicine. Genome Res. 19, 1665–1674 (2009)

  39. 39.

    , & A comparison between screened NIMH and clinically interviewed control samples on neuroticism and extraversion. Mol. Psychiatry 13, 122–130 (2008)

  40. 40.

    et al. A genome-wide association study implicates diacylglycerol kinase eta (DGKH) and several other genes in the etiology of bipolar disorder. Mol. Psychiatry 13, 197–207 (2008)

  41. 41.

    et al. Population analysis of large copy number variants and hotspots of human genetic disease. Am. J. Hum. Genet. 84, 148–161 (2009)

  42. 42.

    et al. Genome-wide association study of CNVs in 16,000 cases of eight common diseases and 3,000 shared controls. Nature 464, 713–720 (2010)

  43. 43.

    , , , & Cytoscape 2.8: new features for data integration and network visualization. Bioinformatics 27, 431–432 (2011)

  44. 44.

    , , , & Computing topological parameters of biological networks. Bioinformatics 24, 282–284 (2008)

  45. 45.

    et al. Target-enrichment strategies for next-generation sequencing. Nature Methods 7, 111–118 (2010)

  46. 46.

    & Estimating the number of species - a Review. J. Am. Stat. Assoc. 88, 364–373 (1993)

  47. 47.

    & Estimating the number of classes via sample coverage. J. Am. Stat. Assoc. 87, 210–217 (1992)

Download references


We would like to thank and recognize the following ongoing studies that produced and provided exome variant calls for comparison: NHLBI Lung Cohort Sequencing Project (HL 1029230), NHLBI WHI Sequencing Project (HL 102924), NIEHS SNPs (HHSN273200800010C), NHLBI/NHGRI SeattleSeq (HL 094976), and the Northwest Genomics Center (HL 102926). We are grateful to all of the families at the participating Simons Simplex Collection (SSC) sites, as well as the principal investigators (A. Beaudet, R. Bernier, J. Constantino, E. Cook, E. Fombonne, D. Geschwind, E. Hanson, D. Grice, A. Klin, R. Kochel, D. Ledbetter, C. Lord, C. Martin, D. Martin, R. Maxim, J. Miles, O. Ousley, K. Pelphrey, B. Peterson, J. Piggot, C. Saulnier, M. State, W. Stone, J. Sutcliffe, C. Walsh, Z. Warren and E. Wijsman). We also acknowledge M. State and the Simons Simplex Collection Genetics Consortium for providing Illumina genotyping data, T. Lehner and the Autism Sequencing Consortium for providing an opportunity for pre-publication data exchange among the participating groups. We appreciate obtaining access to phenotypic data on SFARI Base. This work was supported by the Simons Foundation Autism Research Initiative (SFARI 137578 and 191889; E.E.E., J.S. and R.B.) and NIH HD065285 (E.E.E. and J.S.). E.B. is an Alfred P. Sloan Research Fellow. E.E.E. is an Investigator of the Howard Hughes Medical Institute.

Author information


  1. Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195, USA

    • Brian J. O’Roak
    • , Laura Vives
    • , Santhosh Girirajan
    • , Emre Karakoc
    • , Niklas Krumm
    • , Bradley P. Coe
    • , Roie Levy
    • , Arthur Ko
    • , Choli Lee
    • , Joshua D. Smith
    • , Emily H. Turner
    • , Ian B. Stanaway
    • , Benjamin Vernot
    • , Maika Malig
    • , Carl Baker
    • , Joshua M. Akey
    • , Elhanan Borenstein
    • , Mark J. Rieder
    • , Deborah A. Nickerson
    • , Jay Shendure
    •  & Evan E. Eichler
  2. Department of Psychiatry and Behavioral Sciences, University of Washington, Seattle, Washington 98195, USA

    • Beau Reilly
    •  & Raphael Bernier
  3. Department of Computer Science and Engineering, University of Washington, Seattle, Washington 98195, USA

    • Elhanan Borenstein
  4. Santa Fe Institute, Santa Fe, New Mexico 87501, USA

    • Elhanan Borenstein
  5. Howard Hughes Medical Institute, Seattle, Washington 98195, USA

    • Evan E. Eichler


  1. Search for Brian J. O’Roak in:

  2. Search for Laura Vives in:

  3. Search for Santhosh Girirajan in:

  4. Search for Emre Karakoc in:

  5. Search for Niklas Krumm in:

  6. Search for Bradley P. Coe in:

  7. Search for Roie Levy in:

  8. Search for Arthur Ko in:

  9. Search for Choli Lee in:

  10. Search for Joshua D. Smith in:

  11. Search for Emily H. Turner in:

  12. Search for Ian B. Stanaway in:

  13. Search for Benjamin Vernot in:

  14. Search for Maika Malig in:

  15. Search for Carl Baker in:

  16. Search for Beau Reilly in:

  17. Search for Joshua M. Akey in:

  18. Search for Elhanan Borenstein in:

  19. Search for Mark J. Rieder in:

  20. Search for Deborah A. Nickerson in:

  21. Search for Raphael Bernier in:

  22. Search for Jay Shendure in:

  23. Search for Evan E. Eichler in:


E.E.E., J.S. and B.J.O. designed the study and drafted the manuscript. E.E.E. and J.S. supervised the study. R.B., B.R. and B.J.O. analysed the clinical information. R.B., L.V., S.G., E.K., N.K. and B.P.C. contributed to the manuscript. S.G., N.K., B.P.C., A.K., C.B., M.M. and L.V. generated and analysed CNV data. B.J.O. and L.V. performed MIP resequencing and mutation validations. I.B.S., E.H.T., B.J.O. and J.S. developed MIP protocol and analysis. B.V. and J.M.A. generated loci-specific mutation rate estimates. R.L. and E.B. performed PPI network analysis and simulations. E.K. performed DADA analysis. C.L. performed Illumina sequencing. J.D.S., I.B.S., E.H.T. and C.L. analysed sequence data. B.P.C. performed IPA analysis. B.J.O., E.K. and N.K. developed the de novo analysis pipelines and analysed sequence data. D.A.N., M.J.R., J.D.S. and E.H.T. supervised exome sequencing and primary analysis.

Competing interests

E.E.E. is on the scientific advisory boards for Pacific Biosciences, Inc and SynapDx Corp. J.S. is a member of the scientific advisory board or serves as a consultant for Aria Diagnostics, Stratos Genomics, Good Start Genetics, and Adaptive TCR. B.J.O. is an inventor on patent PCT/US2009/30620: mutations in contactin associated protein 2 are associated with increased risk for idiopathic autism.

Corresponding authors

Correspondence to Jay Shendure or Evan E. Eichler.

Supplementary information

PDF files

  1. 1.

    Supplementary Information

    This file contains Supplementary Discussion; Supplementary Figures 1–13; Supplementary Tables 2, 4, 6-13; and Supplementary References.

Excel files

  1. 1.

    Supplementary Tables

    This file contains Supplementary Tables 1, 3 and 5 which give detailed information on exome capture, sequence coverage, paternal age, de novo mutation sites, and functional annotations.

About this article

Publication history






Further reading


By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.