Abstract
Discovery of most autosomal recessive disease-associated genes has involved analysis of large, often consanguineous multiplex families or small cohorts of unrelated individuals with a well-defined clinical condition. Discovery of new dominant causes of rare, genetically heterogeneous developmental disorders has been revolutionized by exome analysis of large cohorts of phenotypically diverse parent-offspring trios1,2. Here we analyzed 4,125 families with diverse, rare and genetically heterogeneous developmental disorders and identified four new autosomal recessive disorders. These four disorders were identified by integrating Mendelian filtering (selecting probands with rare, biallelic and putatively damaging variants in the same gene) with statistical assessments of (i) the likelihood of sampling the observed genotypes from the general population and (ii) the phenotypic similarity of patients with recessive variants in the same candidate gene. This new paradigm promises to catalyze the discovery of novel recessive disorders, especially those with less consistent or nonspecific clinical presentations and those caused predominantly by compound heterozygous genotypes.
This is a preview of subscription content, access via your institution
Access options
Subscribe to this journal
Receive 12 print issues and online access
$209.00 per year
only $17.42 per issue
Buy this article
- Purchase on SpringerLink
- Instant access to full article PDF
Prices may be subject to local taxes which are calculated during checkout
Similar content being viewed by others
References
Deciphering Developmental Disorders Study. Large-scale discovery of novel genetic causes of developmental disorders. Nature 519, 223–228 (2015).
De Rubeis, S. et al. Synaptic, transcriptional and chromatin genes disrupted in autism. Nature 515, 209–215 (2014).
Wright, C.F. et al. Genetic diagnosis of developmental disorders in the DDD study: a scalable analysis of genome-wide research data. Lancet 385, 1305–1314 (2015).
Najmabadi, H. et al. Deep sequencing reveals 50 novel genes for recessive cognitive disorders. Nature 478, 57–63 (2011).
Köhler, S. et al. The Human Phenotype Ontology project: linking molecular biology and disease through phenotype data. Nucleic Acids Res. 42, D966–D974 (2014).
Zemojtel, T. et al. Effective diagnosis of genetic disease by computational phenotype analysis of the disease-associated genome. Sci. Transl. Med. 6, 252ra123 (2014).
Alazami, A.M. et al. Loss of function mutation in LARP7, chaperone of 7SK ncRNA, causes a syndrome of facial dysmorphism, intellectual disability, and primordial dwarfism. Hum. Mutat. 33, 1429–1434 (2012).
Akawi, N.A., Al-Jasmi, F., Al-Shamsi, A.M., Ali, B.R. & Al-Gazali, L. LINS, a modulator of the WNT signaling pathway, is involved in human cognition. Orphanet J. Rare Dis. 8, 87 (2013).
Kvarnung, M. et al. A novel intellectual disability syndrome caused by GPI anchor deficiency due to homozygous mutations in PIGT. J. Med. Genet. 50, 521–528 (2013).
Nakashima, M. et al. Novel compound heterozygous PIGT mutations caused multiple congenital anomalies–hypotonia-seizures syndrome 3. Neurogenetics 15, 193–200 (2014).
Shinwari, J.M. et al. Recessive mutations in COL25A1 are a cause of congenital cranial dysinnervation disorder. Am. J. Hum. Genet. 96, 147–152 (2015).
Faletra, F. et al. Autosomal recessive Stickler syndrome due to a loss of function mutation in the COL9A3 gene. Am. J. Med. Genet. A. 164A, 42–47 (2014).
de Vries, B.B., Pals, G., Odink, R. & Hamel, B.C. Homozygosity for a FBN1 missense mutation: clinical and molecular evidence for recessive Marfan syndrome. Eur. J. Hum. Genet. 15, 930–935 (2007).
Van Dijk, F.S. et al. Compound-heterozygous Marfan syndrome. Eur. J. Med. Genet. 52, 1–5 (2009).
Davey, M.G. et al. The chicken talpid3 gene encodes a novel protein essential for Hedgehog signaling. Genes Dev. 20, 1365–1377 (2006).
Szymanska, K., Hartill, V.L. & Johnson, C.A. Unraveling the genetics of Joubert and Meckel-Gruber syndromes. J. Pediatr. Genet. 3, 65–78 (2014).
Bangs, F. et al. Generation of mice with functional inactivation of talpid3, a gene first identified in chicken. Development 138, 3261–3272 (2011).
Scheffner, M. & Kumar, S. Mammalian HECT ubiquitin-protein ligases: biological and pathophysiological aspects. Biochim. Biophys. Acta 1843, 61–74 (2014).
Brown, S.D. & Moore, M.W. The International Mouse Phenotyping Consortium. past and future perspectives on mouse phenotyping. Mamm. Genome 23, 632–640 (2012).
Guimier, A. et al. MMP21 is mutated in human heterotaxy and is required for normal left-right asymmetry in vertebrates. Nat. Genet. doi:10.1038/ng.3376 (5 October 2015).
Ahokas, K. et al. Matrix metalloproteinase-21, the human orthologue for XMMP, is expressed during fetal development and in cancer. Gene 301, 31–41 (2002).
Feng, Y. et al. Mammalian protein arginine methyltransferase 7 (PRMT7) specifically targets RXR sites in lysine- and arginine-rich regions. J. Biol. Chem. 288, 37010–37025 (2013).
Leroy, C. et al. The 2q37-deletion syndrome: an update of the clinical spectrum including overweight, brachydactyly and behavioural features in 14 new patients. Eur. J. Hum. Genet. 21, 602–612 (2013).
Williams, S.R. et al. Haploinsufficiency of HDAC4 causes brachydactyly mental retardation syndrome, with brachydactyly type E, developmental delays, and behavioral problems. Am. J. Hum. Genet. 87, 219–228 (2010).
Musante, L. & Ropers, H.H. Genetics of recessive cognitive disorders. Trends Genet. 30, 32–39 (2014).
Smedley, D. et al. PhenoDigm: analyzing curated annotations to associate animal models with human diseases. Database (Oxford) 2013, bat025 (2013).
MacArthur, D.G. et al. Guidelines for investigating causality of sequence variants in human disease. Nature 508, 469–476 (2014).
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25, 1754–1760 (2009).
DePristo, M.A. et al. A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat. Genet. 43, 491–498 (2011).
Li, H. et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
1000 Genomes Project Consortium. An integrated map of genetic variation from 1,092 human genomes. Nature 491, 56–65 (2012).
McLaren, W. et al. Deriving the consequences of genomic variants with the Ensembl API and SNP Effect Predictor. Bioinformatics 26, 2069–2070 (2010).
Lim, L.P. & Burge, C.B. A computational analysis of sequence features involved in recognition of short introns. Proc. Natl. Acad. Sci. USA 98, 11193–11198 (2001).
Harrow, J. et al. GENCODE: the reference human genome annotation for The ENCODE Project. Genome Res. 22, 1760–1774 (2012).
Pettersen, E.F. et al. UCSF Chimera—a visualization system for exploratory research and analysis. J. Comput. Chem. 25, 1605–1612 (2004).
Li, Y. et al. Global genetic analysis in mice unveils central role for cilia in congenital heart disease. Nature 521, 520–524 (2015).
Liu, X. et al. Interrogating congenital heart defects with noninvasive fetal echocardiography in a mouse forward genetic screen. Circ Cardiovasc Imaging 7, 31–42 (2014).
Kim, A.J. et al. Microcomputed tomography provides high accuracy congenital heart disease diagnosis in neonatal and fetal mice. Circ Cardiovasc Imaging 6, 551–559 (2013).
Skarnes, W.C. et al. A conditional knockout resource for the genome-wide study of mouse gene function. Nature 474, 337–342 (2011).
Ryder, E. et al. Molecular characterization of mutant mouse strains generated from the EUCOMM/KOMP-CSD ES cell resource. Mamm. Genome 24, 286–294 (2013).
White, J.K. et al. Genome-wide generation and systematic phenotyping of knockout mice reveals new roles for many genes. Cell 154, 452–464 (2013).
Karp, N.A., Melvin, D. & Mott, R.F. Robust and sensitive analysis of mouse knockout phenotypes. PLoS ONE 7, e52410 (2012).
Acknowledgements
We thank the families for their participation and patience. We are grateful to the Exome Aggregation Consortium for making their data available. The DDD study presents independent research commissioned by the Health Innovation Challenge Fund (grant HICF-1009-003), a parallel funding partnership between the Wellcome Trust and the UK Department of Health, and the Wellcome Trust Sanger Institute (grant WT098051). The views expressed in this publication are those of the author(s) and not necessarily those of the Wellcome Trust or the UK Department of Health. The study has UK Research Ethics Committee approval (10/H0305/83, granted by the Cambridge South Research Ethics Committee and GEN/284/12, granted by the Republic of Ireland Research Ethics Committee). The research team acknowledges the support of the National Institutes for Health Research, through the Comprehensive Clinical Research Network. The authors wish to thank the Sanger Mouse Genetics Project for generating and providing mouse phenotyping information, N. Karp for statistical input on the mouse data and V. Narasimhan for making the bcftools roh algorithm available. D.R.F. is funded through an MRC Human Genetics Unit program grant to the University of Edinburgh. Work on the Mmp21-mutant mouse models was supported by US National Institutes of Health grant U01-HL098180 to C.W.L. V.P. was funded by a fellowship from the DFG German Research Foundation.
Author information
Authors and Affiliations
Consortia
Contributions
N.A., J.M., S.C., T.W.F., W.D.J., D.K., J.L., A. Sifrim, J.C.B., D.R.F. and M.E.H. developed analytical methods and/or analyzed human genotype and phenotype data. M. Blyth, A.F.B., M. Balasubramanian, T.C., C.D., N.F., J.G., E.H., S.J., A.K., M.L., M.O'R., D.O., E.R., A. Smith, P.T. and J.W. phenotyped patients. R.F., G.G., S.S.G., N.K., C.L., V.P. and C.W.L. generated and analyzed model organism data. M.A., D.M., E.P. and D.R. performed validation experiments. G.J.S. performed protein structure analysis. C.F.W., H.V.F., J.C.B., D.R.F. and M.E.H. supervised the experimental and analytical work. M.E.H., D.R.F., N.A., J.M. and C.W.L. wrote the manuscript. D.R.F. and M.E.H. jointly supervised the project.
Corresponding author
Ethics declarations
Competing interests
M.E.H. is a consultant for and shareholder in Congenica, Ltd, which provides genetic diagnostic services.
Additional information
A full list of members appears in the Supplementary Note.
Integrated supplementary information
Supplementary Figure 1 Singleton ratio for last G in exon compared to LOF and missense variants.
Comparing the proportion of variants belonging to different functional classes that are singletons among the DDD parents.
Supplementary Figure 2 Comparison of cumulative allele frequencies for loss-of-function and functional variants in DDD versus ExAC.
The cumulative allele frequencies per gene obtained from unaffected DDD parents of European ancestry are on the x axis (log scaled), and the cumulative allele frequencies from the NFE subset of the ExAC data set are on the y axis (log scaled). The dashed line identifies equivalence for DDD and ExAC derived cumulative allele frequencies.
Supplementary Figure 3 Rates of autozygosity per gene within the probands
The distribution of the proportion of probands autozygous across a given gene is shown. The grey vertical lines show the autozygosity proportions for the genes with P < 5 × 10-4, which fall in the bulk of distribution. The X-axis uses a logarithmic scale.
Supplementary Figure 4 QQ plot from testing genes for enrichment of families with rare biallelic synonymous variants.
Comparison of QQ plots from four different tests of genotype enrichment for rare, biallelicly inherited synonymous genotypes. Black dots show the QQ plot from testing for enrichment using cumulative allele frequencies from the NFE subset of ExAC (shaded black), without correcting for population structure or autozygosity. Red dots show the QQ plot after correcting for population structure within the DDD probands by considering all possible combinations of counts among the four ancestral continental populations defined in ExAC. Blue dots show a QQ plot after correcting for gene-specific autozygosity in the probands. Gray dots show a QQ plot after correcting for both population structure and autozygosity; this plot is the closest match to the null expectation (shown as a dashed red line).
Supplementary Figure 5 QQ plot from testing the similarity of Human Phenotype Ontology terms among probands.
Comparison of QQ plots from testing the similarity of HPO terms within a test data set. The figure includes QQ plots from testing probands in genes with recurrent de novo variants (standard) and a distribution from testing the same genes but with randomly sampled individuals (permuted). The permuted genes matched proband numbers to the number of recurrent de novo variants in each gene. The minimum P value for these tests is limited to 1 × 10−5, which is determined by the number of iterations (100,000) used to test the similarity of HPO terms.
Supplementary Figure 6 Protein modeling of missense variants in PRMT7.
(a) The overall structure of the human PRMT7 model was based on mouse Prmt7 (PDB ID 4C4A). The protein contains two catalytic modules (shown by a horizontal line), each containing an AdoMet-binding domain (orange), a C-terminal β barrel (red) and a dimerization domain (blue). Residues that are mutated in DDD patients are shown in magenta. The donor homolog S-adenosyl homocysteine (SAH) was bound to the mouse structure and is also shown here. (b) The predicted consequences of the missense mutations detected in this study on PRMT7 protein structure and function. Arg32Thr. Arg32 sits at the entrance of the N-terminal catalytic module (orange and red) but mainly interacts with residues from the C-terminal catalytic module (blue, raspberry and olive). R32T mutation results in the loss of a hydrogen bond and changes the nature of the donor-binding pocket. Arg387Gly. E478Q-mutated proteins have less than 0.1% the activity of the native protein (Acta Crystallogr. D Biol. Crystallogr. 70, 2401–2412, 2014). This is attributed to loss of the critical hydrogen bonds that this residue makes with Arg387 and Arg378. Mutation R387G will result in the loss of these hydrogen bonds, which will likely affect the activity of the protein. Trp494Arg. Trp494 sits on an α helix that forms part of the predicted donor-binding site of the C-terminal catalytic module surrounded by hydrophobic residues. The W494R mutation is predicted to cause conformational change around this region and lead to changes in the donor-binding pocket.
Supplementary Figure 7 Skeletal phenotyping of 10-d-old PRMT7 loss-of-function mice: reduced body size, bone defects and reduced digit length.
Prmt7tm1a/tm1a mice have severely reduced body size (a,b,h) and weight (g) compared to control littermates. Prmt7tm1a/wt mice are not affected and resemble wild-type littermates (data not shown). Skeletal staining of Prmt7wt/wt and Prmt7tm1a/tm1a mice was examined at P10 for full skeleton (b), hind paws (c), forelimbs (d), skull (e) and rib cage (f). Mice display growth retardation (b–f) and brachydactyly of the fifth metatarsal bone (c,i). The null mice present a duplication of the first rib (f; 100% of Prmt7tm1a/tm1a mice at P10, n = 5). Although all digit lengths were reduced in the mutant mice, when normalized to overall body size reduction (using radius length as a proxy), only the fifth metatarsal shows statistically significant disproportional reduction in length compared to controls (i). Metacarpal bones normalized to radius were not significantly reduced in length in Prmt7tm1a/tm1a mice at this stage (P10; data not shown). Scale bars in b–f indicate 5 mm.
Supplementary information
Supplementary Text and Figures
Supplementary Figures 1–7, Supplementary Tables 1, 2 and 4, and Supplementary Note. (PDF 1748 kb)
Supplementary Table 3
Clinical data for four novel recessive genes. (XLSX 65 kb)
Ciliary motility in the Mmp21 Miri mutant.
Videomicroscopy of the embryonic node from an Mmp21 Miri mutant shows robust ciliary motility and leftward fluid flow similar to that seen in the embryonic node of a wild-type littermate control. Flow videos are shown at 200% the speed of real time to facilitate the visualization of bead movement, while cilia motion videos are shown at 15% the speed of real time to allow for better visualization of nodal ciliary motion. (MOV 2845 kb)
Rights and permissions
About this article
Cite this article
Akawi, N., McRae, J., Ansari, M. et al. Discovery of four recessive developmental disorders using probabilistic genotype and phenotype matching among 4,125 families. Nat Genet 47, 1363–1369 (2015). https://doi.org/10.1038/ng.3410
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1038/ng.3410
This article is cited by
-
Short stature in PRMT7 Mutations: first evidence of response to growth hormone treatment
European Journal of Human Genetics (2023)
-
De novo disruptive heterozygous MMP21 variants are potential predisposing genetic risk factors in Chinese Han heterotaxy children
Human Genomics (2022)
-
PRMT7 ablation in cardiomyocytes causes cardiac hypertrophy and fibrosis through β-catenin dysregulation
Cellular and Molecular Life Sciences (2022)
-
Discovery of a genetic module essential for assigning left–right asymmetry in humans and ancestral vertebrates
Nature Genetics (2022)
-
A resource of targeted mutant mouse lines for 5,061 genes
Nature Genetics (2021)