Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • Letter
  • Published:

Discovery of four recessive developmental disorders using probabilistic genotype and phenotype matching among 4,125 families

Abstract

Discovery of most autosomal recessive disease-associated genes has involved analysis of large, often consanguineous multiplex families or small cohorts of unrelated individuals with a well-defined clinical condition. Discovery of new dominant causes of rare, genetically heterogeneous developmental disorders has been revolutionized by exome analysis of large cohorts of phenotypically diverse parent-offspring trios1,2. Here we analyzed 4,125 families with diverse, rare and genetically heterogeneous developmental disorders and identified four new autosomal recessive disorders. These four disorders were identified by integrating Mendelian filtering (selecting probands with rare, biallelic and putatively damaging variants in the same gene) with statistical assessments of (i) the likelihood of sampling the observed genotypes from the general population and (ii) the phenotypic similarity of patients with recessive variants in the same candidate gene. This new paradigm promises to catalyze the discovery of novel recessive disorders, especially those with less consistent or nonspecific clinical presentations and those caused predominantly by compound heterozygous genotypes.

This is a preview of subscription content, access via your institution

Access options

Buy this article

Prices may be subject to local taxes which are calculated during checkout

Figure 1
Figure 2: Clinical and neuroradiological features associated with biallelic variants in KIAA0586.
Figure 3: Clinical and neuroradiological features associated with biallelic variants in HACE1.
Figure 4: Features associated with biallelic variants in MMP21 in humans and Mmp21 in mice.
Figure 5: Features associated with biallelic variants in PRMT7 in humans and Prmt7 in mice.

Similar content being viewed by others

Accession codes

Accessions

Protein Data Bank

References

  1. Deciphering Developmental Disorders Study. Large-scale discovery of novel genetic causes of developmental disorders. Nature 519, 223–228 (2015).

  2. De Rubeis, S. et al. Synaptic, transcriptional and chromatin genes disrupted in autism. Nature 515, 209–215 (2014).

    Article  CAS  Google Scholar 

  3. Wright, C.F. et al. Genetic diagnosis of developmental disorders in the DDD study: a scalable analysis of genome-wide research data. Lancet 385, 1305–1314 (2015).

    Article  Google Scholar 

  4. Najmabadi, H. et al. Deep sequencing reveals 50 novel genes for recessive cognitive disorders. Nature 478, 57–63 (2011).

    Article  CAS  Google Scholar 

  5. Köhler, S. et al. The Human Phenotype Ontology project: linking molecular biology and disease through phenotype data. Nucleic Acids Res. 42, D966–D974 (2014).

    Article  Google Scholar 

  6. Zemojtel, T. et al. Effective diagnosis of genetic disease by computational phenotype analysis of the disease-associated genome. Sci. Transl. Med. 6, 252ra123 (2014).

    Article  Google Scholar 

  7. Alazami, A.M. et al. Loss of function mutation in LARP7, chaperone of 7SK ncRNA, causes a syndrome of facial dysmorphism, intellectual disability, and primordial dwarfism. Hum. Mutat. 33, 1429–1434 (2012).

    Article  CAS  Google Scholar 

  8. Akawi, N.A., Al-Jasmi, F., Al-Shamsi, A.M., Ali, B.R. & Al-Gazali, L. LINS, a modulator of the WNT signaling pathway, is involved in human cognition. Orphanet J. Rare Dis. 8, 87 (2013).

    Article  Google Scholar 

  9. Kvarnung, M. et al. A novel intellectual disability syndrome caused by GPI anchor deficiency due to homozygous mutations in PIGT. J. Med. Genet. 50, 521–528 (2013).

    Article  CAS  Google Scholar 

  10. Nakashima, M. et al. Novel compound heterozygous PIGT mutations caused multiple congenital anomalies–hypotonia-seizures syndrome 3. Neurogenetics 15, 193–200 (2014).

    Article  Google Scholar 

  11. Shinwari, J.M. et al. Recessive mutations in COL25A1 are a cause of congenital cranial dysinnervation disorder. Am. J. Hum. Genet. 96, 147–152 (2015).

    Article  CAS  Google Scholar 

  12. Faletra, F. et al. Autosomal recessive Stickler syndrome due to a loss of function mutation in the COL9A3 gene. Am. J. Med. Genet. A. 164A, 42–47 (2014).

    Article  Google Scholar 

  13. de Vries, B.B., Pals, G., Odink, R. & Hamel, B.C. Homozygosity for a FBN1 missense mutation: clinical and molecular evidence for recessive Marfan syndrome. Eur. J. Hum. Genet. 15, 930–935 (2007).

    Article  CAS  Google Scholar 

  14. Van Dijk, F.S. et al. Compound-heterozygous Marfan syndrome. Eur. J. Med. Genet. 52, 1–5 (2009).

    Article  CAS  Google Scholar 

  15. Davey, M.G. et al. The chicken talpid3 gene encodes a novel protein essential for Hedgehog signaling. Genes Dev. 20, 1365–1377 (2006).

    Article  CAS  Google Scholar 

  16. Szymanska, K., Hartill, V.L. & Johnson, C.A. Unraveling the genetics of Joubert and Meckel-Gruber syndromes. J. Pediatr. Genet. 3, 65–78 (2014).

    Article  CAS  Google Scholar 

  17. Bangs, F. et al. Generation of mice with functional inactivation of talpid3, a gene first identified in chicken. Development 138, 3261–3272 (2011).

    Article  CAS  Google Scholar 

  18. Scheffner, M. & Kumar, S. Mammalian HECT ubiquitin-protein ligases: biological and pathophysiological aspects. Biochim. Biophys. Acta 1843, 61–74 (2014).

    Article  CAS  Google Scholar 

  19. Brown, S.D. & Moore, M.W. The International Mouse Phenotyping Consortium. past and future perspectives on mouse phenotyping. Mamm. Genome 23, 632–640 (2012).

    Article  CAS  Google Scholar 

  20. Guimier, A. et al. MMP21 is mutated in human heterotaxy and is required for normal left-right asymmetry in vertebrates. Nat. Genet. doi:10.1038/ng.3376 (5 October 2015).

    Article  CAS  Google Scholar 

  21. Ahokas, K. et al. Matrix metalloproteinase-21, the human orthologue for XMMP, is expressed during fetal development and in cancer. Gene 301, 31–41 (2002).

    Article  CAS  Google Scholar 

  22. Feng, Y. et al. Mammalian protein arginine methyltransferase 7 (PRMT7) specifically targets RXR sites in lysine- and arginine-rich regions. J. Biol. Chem. 288, 37010–37025 (2013).

    Article  CAS  Google Scholar 

  23. Leroy, C. et al. The 2q37-deletion syndrome: an update of the clinical spectrum including overweight, brachydactyly and behavioural features in 14 new patients. Eur. J. Hum. Genet. 21, 602–612 (2013).

    Article  CAS  Google Scholar 

  24. Williams, S.R. et al. Haploinsufficiency of HDAC4 causes brachydactyly mental retardation syndrome, with brachydactyly type E, developmental delays, and behavioral problems. Am. J. Hum. Genet. 87, 219–228 (2010).

    Article  CAS  Google Scholar 

  25. Musante, L. & Ropers, H.H. Genetics of recessive cognitive disorders. Trends Genet. 30, 32–39 (2014).

    Article  CAS  Google Scholar 

  26. Smedley, D. et al. PhenoDigm: analyzing curated annotations to associate animal models with human diseases. Database (Oxford) 2013, bat025 (2013).

    Article  Google Scholar 

  27. MacArthur, D.G. et al. Guidelines for investigating causality of sequence variants in human disease. Nature 508, 469–476 (2014).

    Article  CAS  Google Scholar 

  28. Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25, 1754–1760 (2009).

    Article  CAS  Google Scholar 

  29. DePristo, M.A. et al. A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat. Genet. 43, 491–498 (2011).

    Article  CAS  Google Scholar 

  30. Li, H. et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).

    Article  Google Scholar 

  31. 1000 Genomes Project Consortium. An integrated map of genetic variation from 1,092 human genomes. Nature 491, 56–65 (2012).

  32. McLaren, W. et al. Deriving the consequences of genomic variants with the Ensembl API and SNP Effect Predictor. Bioinformatics 26, 2069–2070 (2010).

    Article  CAS  Google Scholar 

  33. Lim, L.P. & Burge, C.B. A computational analysis of sequence features involved in recognition of short introns. Proc. Natl. Acad. Sci. USA 98, 11193–11198 (2001).

    Article  CAS  Google Scholar 

  34. Harrow, J. et al. GENCODE: the reference human genome annotation for The ENCODE Project. Genome Res. 22, 1760–1774 (2012).

    Article  CAS  Google Scholar 

  35. Pettersen, E.F. et al. UCSF Chimera—a visualization system for exploratory research and analysis. J. Comput. Chem. 25, 1605–1612 (2004).

    Article  CAS  Google Scholar 

  36. Li, Y. et al. Global genetic analysis in mice unveils central role for cilia in congenital heart disease. Nature 521, 520–524 (2015).

    Article  CAS  Google Scholar 

  37. Liu, X. et al. Interrogating congenital heart defects with noninvasive fetal echocardiography in a mouse forward genetic screen. Circ Cardiovasc Imaging 7, 31–42 (2014).

    Article  Google Scholar 

  38. Kim, A.J. et al. Microcomputed tomography provides high accuracy congenital heart disease diagnosis in neonatal and fetal mice. Circ Cardiovasc Imaging 6, 551–559 (2013).

    Article  Google Scholar 

  39. Skarnes, W.C. et al. A conditional knockout resource for the genome-wide study of mouse gene function. Nature 474, 337–342 (2011).

    Article  CAS  Google Scholar 

  40. Ryder, E. et al. Molecular characterization of mutant mouse strains generated from the EUCOMM/KOMP-CSD ES cell resource. Mamm. Genome 24, 286–294 (2013).

    Article  CAS  Google Scholar 

  41. White, J.K. et al. Genome-wide generation and systematic phenotyping of knockout mice reveals new roles for many genes. Cell 154, 452–464 (2013).

    Article  CAS  Google Scholar 

  42. Karp, N.A., Melvin, D. & Mott, R.F. Robust and sensitive analysis of mouse knockout phenotypes. PLoS ONE 7, e52410 (2012).

    Article  CAS  Google Scholar 

Download references

Acknowledgements

We thank the families for their participation and patience. We are grateful to the Exome Aggregation Consortium for making their data available. The DDD study presents independent research commissioned by the Health Innovation Challenge Fund (grant HICF-1009-003), a parallel funding partnership between the Wellcome Trust and the UK Department of Health, and the Wellcome Trust Sanger Institute (grant WT098051). The views expressed in this publication are those of the author(s) and not necessarily those of the Wellcome Trust or the UK Department of Health. The study has UK Research Ethics Committee approval (10/H0305/83, granted by the Cambridge South Research Ethics Committee and GEN/284/12, granted by the Republic of Ireland Research Ethics Committee). The research team acknowledges the support of the National Institutes for Health Research, through the Comprehensive Clinical Research Network. The authors wish to thank the Sanger Mouse Genetics Project for generating and providing mouse phenotyping information, N. Karp for statistical input on the mouse data and V. Narasimhan for making the bcftools roh algorithm available. D.R.F. is funded through an MRC Human Genetics Unit program grant to the University of Edinburgh. Work on the Mmp21-mutant mouse models was supported by US National Institutes of Health grant U01-HL098180 to C.W.L. V.P. was funded by a fellowship from the DFG German Research Foundation.

Author information

Authors and Affiliations

Authors

Consortia

Contributions

N.A., J.M., S.C., T.W.F., W.D.J., D.K., J.L., A. Sifrim, J.C.B., D.R.F. and M.E.H. developed analytical methods and/or analyzed human genotype and phenotype data. M. Blyth, A.F.B., M. Balasubramanian, T.C., C.D., N.F., J.G., E.H., S.J., A.K., M.L., M.O'R., D.O., E.R., A. Smith, P.T. and J.W. phenotyped patients. R.F., G.G., S.S.G., N.K., C.L., V.P. and C.W.L. generated and analyzed model organism data. M.A., D.M., E.P. and D.R. performed validation experiments. G.J.S. performed protein structure analysis. C.F.W., H.V.F., J.C.B., D.R.F. and M.E.H. supervised the experimental and analytical work. M.E.H., D.R.F., N.A., J.M. and C.W.L. wrote the manuscript. D.R.F. and M.E.H. jointly supervised the project.

Corresponding author

Correspondence to Matthew E Hurles.

Ethics declarations

Competing interests

M.E.H. is a consultant for and shareholder in Congenica, Ltd, which provides genetic diagnostic services.

Additional information

A full list of members appears in the Supplementary Note.

Integrated supplementary information

Supplementary Figure 1 Singleton ratio for last G in exon compared to LOF and missense variants.

Comparing the proportion of variants belonging to different functional classes that are singletons among the DDD parents.

Supplementary Figure 2 Comparison of cumulative allele frequencies for loss-of-function and functional variants in DDD versus ExAC.

The cumulative allele frequencies per gene obtained from unaffected DDD parents of European ancestry are on the x axis (log scaled), and the cumulative allele frequencies from the NFE subset of the ExAC data set are on the y axis (log scaled). The dashed line identifies equivalence for DDD and ExAC derived cumulative allele frequencies.

Supplementary Figure 3 Rates of autozygosity per gene within the probands

The distribution of the proportion of probands autozygous across a given gene is shown. The grey vertical lines show the autozygosity proportions for the genes with P < 5 × 10-4, which fall in the bulk of distribution. The X-axis uses a logarithmic scale.

Supplementary Figure 4 QQ plot from testing genes for enrichment of families with rare biallelic synonymous variants.

Comparison of QQ plots from four different tests of genotype enrichment for rare, biallelicly inherited synonymous genotypes. Black dots show the QQ plot from testing for enrichment using cumulative allele frequencies from the NFE subset of ExAC (shaded black), without correcting for population structure or autozygosity. Red dots show the QQ plot after correcting for population structure within the DDD probands by considering all possible combinations of counts among the four ancestral continental populations defined in ExAC. Blue dots show a QQ plot after correcting for gene-specific autozygosity in the probands. Gray dots show a QQ plot after correcting for both population structure and autozygosity; this plot is the closest match to the null expectation (shown as a dashed red line).

Supplementary Figure 5 QQ plot from testing the similarity of Human Phenotype Ontology terms among probands.

Comparison of QQ plots from testing the similarity of HPO terms within a test data set. The figure includes QQ plots from testing probands in genes with recurrent de novo variants (standard) and a distribution from testing the same genes but with randomly sampled individuals (permuted). The permuted genes matched proband numbers to the number of recurrent de novo variants in each gene. The minimum P value for these tests is limited to 1 × 10−5, which is determined by the number of iterations (100,000) used to test the similarity of HPO terms.

Supplementary Figure 6 Protein modeling of missense variants in PRMT7.

(a) The overall structure of the human PRMT7 model was based on mouse Prmt7 (PDB ID 4C4A). The protein contains two catalytic modules (shown by a horizontal line), each containing an AdoMet-binding domain (orange), a C-terminal β barrel (red) and a dimerization domain (blue). Residues that are mutated in DDD patients are shown in magenta. The donor homolog S-adenosyl homocysteine (SAH) was bound to the mouse structure and is also shown here. (b) The predicted consequences of the missense mutations detected in this study on PRMT7 protein structure and function. Arg32Thr. Arg32 sits at the entrance of the N-terminal catalytic module (orange and red) but mainly interacts with residues from the C-terminal catalytic module (blue, raspberry and olive). R32T mutation results in the loss of a hydrogen bond and changes the nature of the donor-binding pocket. Arg387Gly. E478Q-mutated proteins have less than 0.1% the activity of the native protein (Acta Crystallogr. D Biol. Crystallogr. 70, 2401–2412, 2014). This is attributed to loss of the critical hydrogen bonds that this residue makes with Arg387 and Arg378. Mutation R387G will result in the loss of these hydrogen bonds, which will likely affect the activity of the protein. Trp494Arg. Trp494 sits on an α helix that forms part of the predicted donor-binding site of the C-terminal catalytic module surrounded by hydrophobic residues. The W494R mutation is predicted to cause conformational change around this region and lead to changes in the donor-binding pocket.

Supplementary Figure 7 Skeletal phenotyping of 10-d-old PRMT7 loss-of-function mice: reduced body size, bone defects and reduced digit length.

Prmt7tm1a/tm1a mice have severely reduced body size (a,b,h) and weight (g) compared to control littermates. Prmt7tm1a/wt mice are not affected and resemble wild-type littermates (data not shown). Skeletal staining of Prmt7wt/wt and Prmt7tm1a/tm1a mice was examined at P10 for full skeleton (b), hind paws (c), forelimbs (d), skull (e) and rib cage (f). Mice display growth retardation (bf) and brachydactyly of the fifth metatarsal bone (c,i). The null mice present a duplication of the first rib (f; 100% of Prmt7tm1a/tm1a mice at P10, n = 5). Although all digit lengths were reduced in the mutant mice, when normalized to overall body size reduction (using radius length as a proxy), only the fifth metatarsal shows statistically significant disproportional reduction in length compared to controls (i). Metacarpal bones normalized to radius were not significantly reduced in length in Prmt7tm1a/tm1a mice at this stage (P10; data not shown). Scale bars in bf indicate 5 mm.

Supplementary information

Supplementary Text and Figures

Supplementary Figures 1–7, Supplementary Tables 1, 2 and 4, and Supplementary Note. (PDF 1748 kb)

Supplementary Table 3

Clinical data for four novel recessive genes. (XLSX 65 kb)

Ciliary motility in the Mmp21 Miri mutant.

Videomicroscopy of the embryonic node from an Mmp21 Miri mutant shows robust ciliary motility and leftward fluid flow similar to that seen in the embryonic node of a wild-type littermate control. Flow videos are shown at 200% the speed of real time to facilitate the visualization of bead movement, while cilia motion videos are shown at 15% the speed of real time to allow for better visualization of nodal ciliary motion. (MOV 2845 kb)

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Akawi, N., McRae, J., Ansari, M. et al. Discovery of four recessive developmental disorders using probabilistic genotype and phenotype matching among 4,125 families. Nat Genet 47, 1363–1369 (2015). https://doi.org/10.1038/ng.3410

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1038/ng.3410

This article is cited by

Search

Quick links

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing