Abstract
Spontaneously arising (de novo) mutations have an important role in medical genetics. For diseases with extensive locus heterogeneity, such as autism spectrum disorders (ASDs), the signal from de novo mutations is distributed across many genes, making it difficult to distinguish disease-relevant mutations from background variation. Here we provide a statistical framework for the analysis of excesses in de novo mutation per gene and gene set by calibrating a model of de novo mutation. We applied this framework to de novo mutations collected from 1,078 ASD family trios, and, whereas we affirmed a significant role for loss-of-function mutations, we found no excess of de novo loss-of-function mutations in cases with IQ above 100, suggesting that the role of de novo mutations in ASDs might reside in fundamental neurodevelopmental processes. We also used our model to identify ∼1,000 genes that are significantly lacking in functional coding variation in non-ASD samples and are enriched for de novo loss-of-function mutations identified in ASD cases.
Access options
Subscribe to Journal
Get full journal access for 1 year
$225.00
only $18.75 per issue
All prices are NET prices.
VAT will be added later in the checkout.
Rent or Buy article
Get time limited or full article access on ReadCube.
from$8.99
All prices are NET prices.
References
- 1.
Ng, S.B. et al. Exome sequencing identifies MLL2 mutations as a cause of Kabuki syndrome. Nat. Genet. 42, 790–793 (2010).
- 2.
Iossifov, I. et al. De novo gene disruptions in children on the autistic spectrum. Neuron 74, 285–299 (2012).
- 3.
Neale, B.M. et al. Patterns and rates of exonic de novo mutations in autism spectrum disorders. Nature 485, 242–245 (2012).
- 4.
O'Roak, B.J. et al. Sporadic autism exomes reveal a highly interconnected protein network of de novo mutations. Nature 485, 246–250 (2012).
- 5.
Sanders, S.J. et al. De novo mutations revealed by whole-exome sequencing are strongly associated with autism. Nature 485, 237–241 (2012).
- 6.
O'Roak, B.J. et al. Exome sequencing in sporadic autism spectrum disorders identifies severe de novo mutations. Nat. Genet. 43, 585–589 (2011).
- 7.
Antonarakis, S.E. CpG dinucleotides and human disorders. in Encyclopedia of Life Sciences (John Wiley & Sons, Chichester, UK, 2006).
- 8.
Petrovski, S., Wang, Q., Heinzen, E.L., Allen, A.S. & Goldstein, D.B. Genic intolerance to functional variation and the interpretation of personal genomes. PLoS Genet. 9, e1003709 (2013).
- 9.
de Ligt, J. et al. Diagnostic exome sequencing in persons with severe intellectual disability. N. Engl. J. Med. 367, 1921–1929 (2012).
- 10.
Rauch, A. et al. Range of genetic mutations associated with severe non-syndromic sporadic intellectual disability: an exome sequencing study. Lancet 380, 1674–1682 (2012).
- 11.
O'Roak, B.J. et al. Multiplex targeted sequencing identifies recurrently mutated genes in autism spectrum disorders. Science 338, 1619–1622 (2012).
- 12.
Betancur, C. Etiological heterogeneity in autism spectrum disorders: more than 100 genetic and genomic disorders and still counting. Brain Res. 1380, 42–77 (2011).
- 13.
Darnell, J.C. et al. FMRP stalls ribosomal translocation on mRNAs linked to synaptic function and autism. Cell 146, 247–261 (2011).
- 14.
Sanders, S.J. et al. Multiple recurrent de novo CNVs, including duplications of the 7q11.23 Williams syndrome region, are strongly associated with autism. Neuron 70, 863–885 (2011).
- 15.
Xu, B. et al. De novo gene mutations highlight patterns of genetic and neural complexity in schizophrenia. Nat. Genet. 44, 1365–1369 (2012).
- 16.
Bustamante, C.D. et al. Natural selection on protein-coding genes in the human genome. Nature 437, 1153–1157 (2005).
- 17.
McDonald, J.H. & Kreitman, M. Adaptive protein evolution at the Adh locus in Drosophila. Nature 351, 652–654 (1991).
- 18.
Epi4K Consortium & Epilepsy Phenome/Genome Project. De novo mutations in epileptic encephalopathies. Nature 501, 217–221 (2013).
- 19.
DePristo, M.A. et al. A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat. Genet. 43, 491–498 (2011).
- 20.
Li, H. & Durbin, R. Fast and accurate long-read alignment with Burrows-Wheeler transform. Bioinformatics 26, 589–595 (2010).
- 21.
McKenna, A. et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 20, 1297–1303 (2010).
- 22.
Koren, A. et al. Differential relationship of DNA replication timing to different forms of human mutation and variation. Am. J. Hum. Genet. 91, 1033–1040 (2012).
- 23.
Kryukov, G.V., Pennacchio, L.A. & Sunyaev, S.R. Most rare missense alleles are deleterious in humans: implications for complex disease and association studies. Am. J. Hum. Genet. 80, 727–739 (2007).
- 24.
Campbell, C.D. et al. Estimating the human mutation rate using autozygosity in a founder population. Nat. Genet. 44, 1277–1281 (2012).
- 25.
Conrad, D.F. et al. Variation in genome-wide mutation rates within and between human families. Nat. Genet. 43, 712–714 (2011).
Acknowledgements
All data from published studies are available in the respective publications. All newly generated data and computational tools used in this paper will be available online as downloadable material. We have also constructed a website to query genes that provides information on constraint and the de novo mutations found in the specified gene across published studies of de novo mutation. We would like to thank E. Daly and M. Chess for their contributions to data analysis and the construction of the website, respectively. We acknowledge the following resources and families who contributed to them: the National Institute of Mental Health (NIMH) repository (U24MH068457); the Autism Genetic Resource Exchange (AGRE) Consortium, a program of Autism Speaks (1U24MH081810 to C.M. Lajonchere); The Autism Simplex Collection (TASC) (grant from Autism Speaks); the Simons Foundation Autism Research Initiative (SFARI) Simplex Collection (grant from the Simons Foundation); and The Autism Consortium (grant from the Autism Consortium). This work was directly supported by US National Institutes of Health (NIH) grants R01MH089208 (M.J.D.), R01MH089025 (J.D.B.), R01MH089004 (G.D.S.), R01MH089175 (R.A.G.) and R01MH089482 (J.S.S.) and was supported in part by US NIH grants P50HD055751 (E.H.C.), R01MH057881 (B.D.) and R01MH061009 (J.S.S.). We acknowledge partial support from grants U54HG003273 (R.A.G.) and U54HG003067 (E. Lander). We thank T. Lehner (NIMH), A. Felsenfeld (National Human Genome Research Institute) and P. Bender (NIMH) for their support and contribution to the project. E.B., J.D.B., B.D., M.J.D., R.A.G., K. Roeder, A.S., G.D.S. and J.S.S. are lead investigators in the ARRA Autism Sequencing Collaboration (AASC). We would also like to thank the NHLBI GO Exome Sequencing Project (ESP) and its ongoing studies that produced and provided exome variant calls on the web: the Lung GO Sequencing Project (HL-102923), the Women's Health Initiative (WHI) Sequencing Project (HL-102924), the Broad GO Sequencing Project (HL-102925), the Seattle GO Sequencing Project (HL-102926) and the Heart GO Sequencing Project (HL-103010).
Author information
Affiliations
Analytic and Translational Genetics Unit, Department of Medicine, Massachusetts General Hospital and Harvard Medical School, Boston, Massachusetts, USA.
- Kaitlin E Samocha
- , Elise B Robinson
- , Jack A Kosmicki
- , Andrew Kirby
- , Daniel G MacArthur
- , Shaun M Purcell
- , Benjamin M Neale
- & Mark J Daly
Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, Massachusetts, USA.
- Kaitlin E Samocha
- , Elise B Robinson
- , Christine Stevens
- , Andrew Kirby
- , Daniel G MacArthur
- , Stacey B Gabriel
- , Shaun M Purcell
- , Benjamin M Neale
- & Mark J Daly
Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, Massachusetts, USA.
- Kaitlin E Samocha
- , Elise B Robinson
- , Christine Stevens
- , Benjamin M Neale
- & Mark J Daly
Program in Genetics and Genomics, Biological and Biomedical Sciences, Harvard Medical School, Boston, Massachusetts, USA.
- Kaitlin E Samocha
Department of Psychiatry, Yale University School of Medicine, New Haven, Connecticut, USA.
- Stephan J Sanders
Department of Genetics, Yale University School of Medicine, New Haven, Connecticut, USA.
- Stephan J Sanders
Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas, USA.
- Aniko Sabo
- , Eric Boerwinkle
- & Richard A Gibbs
Psychiatric and Neurodevelopmental Genetics Unit, Department of Psychiatry, Massachusetts General Hospital and Harvard Medical School, Boston, Massachusetts, USA.
- Lauren M McGrath
- , Shaun M Purcell
- & Aarno Palotie
Center for Biomedical Informatics, Harvard Medical School, Boston, Massachusetts, USA.
- Jack A Kosmicki
- & Dennis P Wall
Department of Pathology, Beth Israel Deaconess Medical Center, Boston, Massachusetts, USA.
- Jack A Kosmicki
- & Dennis P Wall
Institute for Molecular Medicine Finland (FIMM), University of Helsinki, Helsinki, Finland.
- Karola Rehnström
- & Aarno Palotie
Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Cambridge, UK.
- Karola Rehnström
- & Aarno Palotie
Department of Genetics, Harvard Medical School, Boston, Massachusetts, USA.
- Swapan Mallick
Synapdx, Lexington, Massachusetts, USA.
- Mark DePristo
Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, New York, USA.
- Shaun M Purcell
- & Joseph D Buxbaum
Department of Neuroscience, Icahn School of Medicine at Mount Sinai, New York, New York, USA.
- Shaun M Purcell
- & Joseph D Buxbaum
Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, New York, USA.
- Shaun M Purcell
- & Joseph D Buxbaum
Human Genetics Center, University of Texas Health Science Center at Houston, Houston, Texas, USA.
- Eric Boerwinkle
Seaver Autism Center for Research and Treatment, Icahn School of Medicine at Mount Sinai, New York, New York, USA.
- Joseph D Buxbaum
Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, New York, USA.
- Joseph D Buxbaum
Mindich Child Health and Development Institute, Icahn School of Medicine at Mount Sinai, New York, New York, USA.
- Joseph D Buxbaum
Department of Psychiatry, University of Illinois at Chicago, Chicago, Illinois, USA.
- Edwin H Cook Jr
Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA.
- Gerard D Schellenberg
Center for Molecular Neuroscience, Vanderbilt University, Nashville, Tennessee, USA.
- James S Sutcliffe
Department of Psychiatry, University of Pittsburgh Medical School, Pittsburgh, Pennsylvania, USA.
- Bernie Devlin
Department of Statistics, Carnegie Mellon University, Pittsburgh, Pennsylvania, USA.
- Kathryn Roeder
Lane Center for Computational Biology, Carnegie Mellon University, Pittsburgh, Pennsylvania, USA.
- Kathryn Roeder
Authors
Search for Kaitlin E Samocha in:
Search for Elise B Robinson in:
Search for Stephan J Sanders in:
Search for Christine Stevens in:
Search for Aniko Sabo in:
Search for Lauren M McGrath in:
Search for Jack A Kosmicki in:
Search for Karola Rehnström in:
Search for Swapan Mallick in:
Search for Andrew Kirby in:
Search for Dennis P Wall in:
Search for Daniel G MacArthur in:
Search for Stacey B Gabriel in:
Search for Mark DePristo in:
Search for Shaun M Purcell in:
Search for Aarno Palotie in:
Search for Eric Boerwinkle in:
Search for Joseph D Buxbaum in:
Search for Edwin H Cook in:
Search for Richard A Gibbs in:
Search for Gerard D Schellenberg in:
Search for James S Sutcliffe in:
Search for Bernie Devlin in:
Search for Kathryn Roeder in:
Search for Benjamin M Neale in:
Search for Mark J Daly in:
Contributions
K.E.S., B.M.N. and M.J.D. conceived and designed the mutational model and constraint methods. K.E.S. and E.B.R. executed the analyses. K.E.S., E.B.R., L.M.M., J.A.K., S.M., A.K., D.P.W., D.G.M., S.M.P., J.D.B., B.D. and K. Roeder contributed to analysis concepts and methods. K.E.S., S.J.S., C.S., A.S., K. Rehnström, S.B.G., M.D., A.P., E.B., J.D.B., E.H.C., R.A.G., G.D.S., J.S.S., B.D., K. Roeder, B.M.N. and M.J.D. contributed autism sequencing, evaluation and manuscript comments. K.E.S., E.B.R., B.M.N. and M.J.D. performed the primary writing.
Competing interests
The authors declare no competing financial interests.
Corresponding author
Correspondence to Mark J Daly.
Integrated supplementary information
Supplementary information
PDF files
- 1.
Supplementary Text and Figures
Supplementary Figures 1–3, Supplementary Tables 3–9 and Supplementary Note.
Excel files
- 1.
Gene-specific probabilities of mutation.
The per-gene probabilities of mutation are listed for each gene (transcript specified) by mutation type. Probabilities of mutation are given per chromosome and have been transformed by log10. “NA” is listed when there is no probability of mutation due usually to low coverage.
- 2.
Top 1,003 constrained genes.
The gene-specific information listed includes transcript and identifier, chromosome, transcription start position, number of coding bases, probabilities of a synonymous and missense mutation (given per chromosome), the number of observed and expected synonymous and missense variants, the signed Z scores for the deviation for both synonymous and missense variants, and the ratio of missing missense variation (“ratio_missing”).
Rights and permissions
To obtain permission to re-use content from this article visit RightsLink.
About this article
Further reading
-
A framework for the investigation of rare genetic disorders in neuropsychiatry
Nature Medicine (2019)
-
Clinically-relevant postzygotic mosaicism in parents and children with developmental disorders in trio exome sequencing data
Nature Communications (2019)
-
MYRF haploinsufficiency causes 46,XY and 46,XX disorders of sex development: bioinformatics consideration
Human Molecular Genetics (2019)
-
Next-Generation Sequencing in Autism Spectrum Disorder
Cold Spring Harbor Perspectives in Medicine (2019)
-
Rare copy number variation in extremely impulsively violent males
Genes, Brain and Behavior (2019)