Inferring compound heterozygosity from large-scale exome sequencing data

Guo, Michael H.; Francioli, Laurent C.; Stenton, Sarah L.; Goodrich, Julia K.; Watts, Nicholas A.; Singer-Berk, Moriel; Groopman, Emily; Darnowsky, Philip W.; Solomonson, Matthew; Baxter, Samantha; Tiao, Grace; Neale, Benjamin M.; Hirschhorn, Joel N.; Rehm, Heidi L.; Daly, Mark J.; O’Donnell-Luria, Anne; Karczewski, Konrad J.; MacArthur, Daniel G.; Samocha, Kaitlin E.

doi:10.1038/s41588-023-01608-3

Article
Published: 06 December 2023

Inferring compound heterozygosity from large-scale exome sequencing data

Nature Genetics volume 56, pages 152–161 (2024)Cite this article

5011 Accesses
1 Citations
95 Altmetric
Metrics details

Subjects

Abstract

Recessive diseases arise when both copies of a gene are impacted by a damaging genetic variant. When a patient carries two potentially causal variants in a gene, accurate diagnosis requires determining that these variants occur on different copies of the chromosome (that is, are in trans) rather than on the same copy (that is, in cis). However, current approaches for determining phase, beyond parental testing, are limited in clinical settings. Here we developed a strategy for inferring phase for rare variant pairs within genes, leveraging genotypes observed in the Genome Aggregation Database (v2, n = 125,748 exomes). Our approach estimates phase with 96% accuracy, both in trio data and in patients with Mendelian conditions and presumed causal compound heterozygous variants. We provide a public resource of phasing estimates for coding variants and counts per gene of rare variants in trans that can aid interpretation of rare co-occurring variants in the context of recessive disease.

Access through your institution

Buy or subscribe

This is a preview of subscription content, access via your institution

Access options

Access through your institution

Buy this article

Purchase on Springer Link
Instant access to full article PDF

Buy now

Prices may be subject to local taxes which are calculated during checkout

**Fig. 1: Overview of phasing approach using the expectation–maximization method in gnomAD.**

**Fig. 2: Phasing accuracy as a function of variant AF.**

**Fig. 3: Phasing accuracy using population-specific versus cosmopolitan P_trans estimates.**

**Fig. 4: Phasing accuracy as a function of distance between variant pairs.**

**Fig. 5: Counts of genes with variants in *trans* in gnomAD.**

Effective variant filtering and expected candidate variant yield in studies of rare human disease

Article Open access 15 July 2021

Reanalysis of clinical exome identifies the second variant in two individuals with recessive disorders

Article 23 January 2023

Diagnostic implications of pitfalls in causal variant identification based on 4577 molecularly characterized families

Article Open access 29 August 2023

Data availability

The gnomAD v2 dataset can be accessed at https://gnomad.broadinstitute.org. We made use of prior quality control processing of these and related data. In addition, we downloaded HapMap2 genetic maps from https://github.com/joepickrell/1000-genomes-genetic-maps.

We provide both web-based look-up tools and downloads for the data generated here. A look-up tool to find the likely co-occurrence pattern between two rare (global AF in gnomAD exomes <5%) coding, flanking intronic (from position −1 to −3 in acceptor sites and +1 to +8 in donor sites) or 5′/3′ UTR variants can be found at https://gnomad.broadinstitute.org/variant-cooccurrence

Additionally, we display the per-gene counts tables that provide the details of the number of individuals with two rare variants, stratified by AF and functional consequence, on each gene’s main page. One table provides the details of counts of individuals with two heterozygous variants and includes the predicted phase, while the second table provides the details of individuals with homozygous variants. Both can be found by clicking on the ‘Variant Co-occurrence’ tab on each gene’s main page.

All variant co-occurrence tables can be downloaded from https://gnomad.broadinstitute.org/downloads#v2-variant-cooccurrence

Code availability

The code used to estimate P_trans estimates for variant pairs and to determine the number of individuals carrying rare, compound heterozygous variants can be found at https://github.com/broadinstitute/gnomad_chets

The code has also been uploaded to Zenodo (https://doi.org/10.5281/zenodo.10034663).

References

Wang, Q. et al. Landscape of multi-nucleotide variants in 125,748 human exomes and 15,708 genomes. Nat. Commun. 11, 2539 (2020).
Article CAS PubMed PubMed Central Google Scholar
Bansal, V., Halpern, A. L., Axelrod, N. & Bafna, V. An MCMC algorithm for haplotype assembly from whole-genome sequence data. Genome Res. 18, 1336–1346 (2008).
Article CAS PubMed PubMed Central Google Scholar
Patterson, M. et al. WhatsHap: weighted haplotype assembly for future-generation sequencing reads. J. Comput. Biol. 22, 498–509 (2015).
Article CAS PubMed Google Scholar
Hager, P., Mewes, H.-W., Rohlfs, M., Klein, C. & Jeske, T. SmartPhase: accurate and fast phasing of heterozygous variant pairs for genetic diagnosis of rare diseases. PLoS Comput. Biol. 16, e1007613 (2020).
Article CAS PubMed PubMed Central Google Scholar
Maestri, S. et al. A long-read sequencing approach for direct haplotype phasing in clinical settings. Int. J. Mol. Sci. 21, 9177 (2020).
Article CAS PubMed PubMed Central Google Scholar
Mantere, T., Kersten, S. & Hoischen, A. Long-read sequencing emerging in medical genetics. Front. Genet. 10, 426 (2019).
Article CAS PubMed PubMed Central Google Scholar
Snyder, M. W., Adey, A., Kitzman, J. O. & Shendure, J. Haplotype-resolved genome sequencing: experimental methods and applications. Nat. Rev. Genet. 16, 344–358 (2015).
Article CAS PubMed Google Scholar
Li, N. & Stephens, M. Modeling linkage disequilibrium and identifying recombination hotspots using single-nucleotide polymorphism data. Genetics 165, 2213–2233 (2003).
Article CAS PubMed PubMed Central Google Scholar
Loh, P.-R. et al. Reference-based phasing using the Haplotype Reference Consortium panel. Nat. Genet. 48, 1443–1448 (2016).
Article CAS PubMed PubMed Central Google Scholar
Browning, B. L., Tian, X., Zhou, Y. & Browning, S. R. Fast two-stage phasing of large-scale sequence data. Am. J. Hum. Genet. 108, 1880–1890 (2021).
Article CAS PubMed PubMed Central Google Scholar
Hofmeister, R. J., Ribeiro, D. M., Rubinacci, S. & Delaneau, O. Accurate rare variant phasing of whole-genome and whole-exome sequencing data in the UK Biobank. Nat. Genet. 55, 1243–1249 (2023).
Article CAS PubMed PubMed Central Google Scholar
Tewhey, R., Bansal, V., Torkamani, A., Topol, E. J. & Schork, N. J. The importance of phase information for human genomics. Nat. Rev. Genet. 12, 215–223 (2011).
Article CAS PubMed PubMed Central Google Scholar
Browning, S. R. & Browning, B. L. Haplotype phasing: existing methods and new developments. Nat. Rev. Genet. 12, 703–714 (2011).
Article CAS PubMed PubMed Central Google Scholar
Karczewski, K. J. et al. The mutational constraint spectrum quantified from variation in 141,456 humans. Nature 581, 434–443 (2020).
Article CAS PubMed PubMed Central Google Scholar
Excoffier, L. & Slatkin, M. Maximum-likelihood estimation of molecular haplotype frequencies in a diploid population. Mol. Biol. Evol. 12, 921–927 (1995).
CAS PubMed Google Scholar
Hodgkinson, A. & Eyre-Walker, A. Variation in the mutation rate across mammalian genomes. Nat. Rev. Genet. 12, 756–766 (2011).
Article CAS PubMed Google Scholar
Ségurel, L., Wyman, M. J. & Przeworski, M. Determinants of mutation rate variation in the human germline. Annu. Rev. Genomics Hum. Genet. 15, 47–70 (2014).
Article PubMed Google Scholar
Rahbari, R. et al. Timing, rates and spectra of human germline mutation. Nat. Genet. 48, 126–133 (2016).
Article CAS PubMed Google Scholar
Lek, M. et al. Analysis of protein-coding genetic variation in 60,706 humans. Nature 536, 285–291 (2016).
Article CAS PubMed PubMed Central Google Scholar
Carlson, J. et al. Extremely rare variants reveal patterns of germline mutation rate heterogeneity in humans. Nat. Commun. 9, 3753 (2018).
Article PubMed PubMed Central Google Scholar
Lynch, M. Rate, molecular spectrum, and consequences of human mutation. Proc. Natl Acad. Sci. USA 107, 961–968 (2010).
Article CAS PubMed PubMed Central Google Scholar
Baxter, S. M. et al. Centers for Mendelian genomics: a decade of facilitating gene discovery. Genet. Med. 24, 784–797 (2022).
Article CAS PubMed PubMed Central Google Scholar
Ioannidis, N. M. et al. REVEL: an ensemble method for predicting the pathogenicity of rare missense variants. Am. J. Hum. Genet. 99, 877–885 (2016).
Article CAS PubMed PubMed Central Google Scholar
Pejaver, V. et al. Calibration of computational tools for missense variant pathogenicity classification and ClinGen recommendations for PP3/BP4 criteria. Am. J. Hum. Genet. 109, 2163–2177 (2022).
Article CAS PubMed PubMed Central Google Scholar
Lassen, F. H. et al. Exome-wide evidence of compound heterozygous effects across common phenotypes in the UK Biobank. Preprit at medRxiv https://doi.org/10.1101/2023.06.29.23291992 (2023).
Sharp, K., Kretzschmar, W., Delaneau, O. & Marchini, J. Phasing for medical sequencing using rare variants and large haplotype reference panels. Bioinformatics 32, 1974–1980 (2016).
Article CAS PubMed PubMed Central Google Scholar
Chen, S. et al. A genomic mutational constraint map using variation in 76,156 human genomes. Nature https://doi.org/10.1038/s41586-023-06045-0 (2023).
Van der Auwera, G. A. et al. From FastQ data to high confidence variant calls: the Genome Analysis Toolkit best practices pipeline. Curr. Protoc. Bioinformatics 43, 11.10.1–11.10.33 (2013).
PubMed Google Scholar
Hail Team. Hail-is/hail. GitHub. github.com/hail-is/hail/commit/acd89e80c345 (2023).
Choi, Y., Chan, A. P., Kirkness, E., Telenti, A. & Schork, N. J. Comparison of phasing strategies for whole human genomes. PLoS Genet. 14, e1007308 (2018).
Article PubMed PubMed Central Google Scholar
Roadmap Epigenomics Consortium. et al. Integrative analysis of 111 reference human epigenomes. Nature 518, 317–330 (2015).
Article PubMed Central Google Scholar
International HapMap Consortium. et al. A second generation human haplotype map of over 3.1 million SNPs. Nature 449, 851–861 (2007).
Article Google Scholar
McLaren, W. et al. The ensembl variant effect predictor. Genome Biol. 17, 122 (2016).
Article PubMed PubMed Central Google Scholar
Georgi, B., Voight, B. F. & Bućan, M. From mouse to human: evolutionary genomics analysis of human orthologs of essential genes. PLoS Genet. 9, e1003484 (2013).
Article CAS PubMed PubMed Central Google Scholar
Behan, F. M. et al. Prioritization of cancer therapeutic targets using CRISPR–Cas9 screens. Nature 568, 511–516 (2019).
Article CAS PubMed Google Scholar
Hart, T., Brown, K. R., Sircoulomb, F., Rottapel, R. & Moffat, J. Measuring error rates in genomic perturbation screens: gold standards for human functional genomics. Mol. Syst. Biol. 10, 733 (2014).
Article PubMed PubMed Central Google Scholar
Hart, T. et al. Evaluation and design of genome-wide CRISPR/SpCas9 knockout screens. G3 (Bethesda) 7, 2719–2727 (2017).
Article CAS PubMed Google Scholar
Vinceti, A. et al. CoRe: a robustly benchmarked R package for identifying core-fitness genes in genome-wide pooled CRISPR–Cas9 screens. BMC Genomics 22, 828 (2021).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We thank all members of the gnomAD team for helpful comments and suggestions, and we particularly recognize the members of the gnomAD methods and browser teams who worked hard over many years to provide cleaned datasets, easy-to-use browsers and visualizations. This work was supported by the National Human Genome Research Institute (NHGRI; U24HG011450 to H.L.R. and M.J.D.; UM1HG008900 to D.G.M. and H.L.R.; U01HG011755 to A.O.-L. and H.L.R.).

Author information

These authors contributed equally: Michael H. Guo, Laurent C. Francioli.

Authors and Affiliations

Department of Neurology, Hospital of the University of the Pennsylvania, Philadelphia, PA, USA
Michael H. Guo
Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
Michael H. Guo, Laurent C. Francioli, Sarah L. Stenton, Julia K. Goodrich, Nicholas A. Watts, Moriel Singer-Berk, Emily Groopman, Philip W. Darnowsky, Matthew Solomonson, Samantha Baxter, Jessica Alföldi, Irina M. Armean, Samantha M. Baxter, Sarah E. Calvo, Katherine R. Chao, Sinéad Chapman, Siwei Chen, Ryan L. Collins, Beryl Cummings, Stacey Donnelly, Patrick T. Ellinor, Eleina England, Tõnu Esko, Emily Evangelista, Sanna Gudmundsson, Namrata Gupta, Zan Koenig, Kristen M. Laricchia, Emily Lipscomb, Steven A. Lubitz, Alicia R. Martin, James B. Meigs, Eric V. Minikel, Vamsi K. Mootha, Anne H. O’Donnell-Luria, William Phu, Timothy Poterba, Dan Rhodes, Andrea Saltzman, Jeremiah Scharf, Molly Schleicher, Eleanor Seaby, Rachel G. Son, Christine Stevens, Michael E. Talkowski, Yekaterina Tarasova, Christopher Vittal, Arcturus Wang, Qingbo Wang, James S. Ware, Nicola Whiffin, Michael W. Wilson, Mary T. Yohannes, Grace Tiao, Benjamin M. Neale, Joel N. Hirschhorn, Heidi L. Rehm, Mark J. Daly, Anne O’Donnell-Luria, Konrad J. Karczewski, Daniel G. MacArthur & Kaitlin E. Samocha
Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
Laurent C. Francioli, Sarah L. Stenton, Julia K. Goodrich, Nicholas A. Watts, Matthew Solomonson, Jessica Alföldi, Irina M. Armean, Sinéad Chapman, Siwei Chen, Sanna Gudmundsson, Kristen M. Laricchia, Anne H. O’Donnell-Luria, Aarno Palotie, Timothy Poterba, Cotton Seed, Christine Stevens, Christopher Vittal, Arcturus Wang, Michael W. Wilson, Grace Tiao, Benjamin M. Neale, Heidi L. Rehm, Mark J. Daly, Anne O’Donnell-Luria, Konrad J. Karczewski, Daniel G. MacArthur & Kaitlin E. Samocha
Division of Genetics and Genomics, Boston Children’s Hospital, Boston, MA, USA
Sarah L. Stenton, Emily Groopman, Sanna Gudmundsson, Anne H. O’Donnell-Luria, William Phu & Anne O’Donnell-Luria
Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
Benjamin M. Neale
The Novo Nordisk Foundation Center for Genomic Mechanisms of Disease, Broad Institute of MIT and Harvard, Cambridge, MA, USA
Benjamin M. Neale & Konrad J. Karczewski
Departments of Genetics and Pediatrics, Harvard Medical School, Boston, MA, USA
Joel N. Hirschhorn
Division of Endocrinology, Boston Children’s Hospital, Boston, MA, USA
Daniel Chasman & Joel N. Hirschhorn
Center for Basic and Translational Obesity Research, Boston Children’s Hospital, Boston, MA, USA
Joel N. Hirschhorn
Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
Heidi L. Rehm, Anne O’Donnell-Luria, Konrad J. Karczewski & Kaitlin E. Samocha
Institute for Molecular Medicine Finland (FIMM), Helsinki, Finland
Sinéad Chapman, Steven McCarroll, Aarno Palotie, Timothy Poterba, Jeremiah Scharf, Cotton Seed, Christine Stevens, Michael E. Talkowski, Arcturus Wang & Mark J. Daly
Centre for Population Genomics, Garvan Institute of Medical Research and UNSW Sydney, Sydney, New South Wales, Australia
Daniel G. MacArthur
Centre for Population Genomics, Murdoch Children’s Research Institute, Melbourne, Victoria, Australia
Daniel G. MacArthur
University of Miami Miller School of Medicine, Gastroenterology, Miami, FL, USA
Maria Abreu
Unidad de Investigacion de Enfermedades Metabolicas, Instituto Nacional de Ciencias Medicas y Nutricion, Mexico City, Mexico
Carlos A. Aguilar Salinas
Peninsula College of Medicine and Dentistry, Exeter, UK
Tariq Ahmad, Sarah E. Calvo, Ryan L. Collins, Sekar Kathiresan, Anne H. O’Donnell-Luria, Jeremiah Scharf & Michael E. Talkowski
Division of Preventive Medicine, Brigham and Women’s Hospital, Boston, MA, USA
Christine M. Albert
Division of Cardiovascular Medicine, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA, USA
Christine M. Albert
Department of Cardiology, Parma University Hospital, Parma, Italy
Diego Ardissino
European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK
Irina M. Armean
Department of Biology Faculty of Natural Sciences, University of Haifa, Haifa, Israel
Gil Atzmon
Departments of Medicine and Genetics, Albert Einstein College of Medicine, Bronx, NY, USA
Gil Atzmon
Data Science Platform, Broad Institute of MIT and Harvard, Cambridge, MA, USA
Eric Banks, David Benjamin, Louis Bergelson, Kristian Cibulskis, Miguel Covarrubias, James Emery, Yossi Farjoun, Kiran Garimella, Laura D. Gauthier, Jeff Gentry, Andrea Haessly, Thibault Jeandet, Diane Kaplan, Trevyn Langsford, Christopher Llanwarne, Ruchi Munshi, Sam Novod, Nikelle Petrillo, David Roazen, Valentin Ruano-Rubio, Nareh Sahakian, Megan Shand, Jonathan T. Smith, Jose Soto, Kathleen Tibbetts, Charlotte Tolonen, Gordon Wade & Ben Weisburd
Department of Quantitative Health Sciences, Lerner Research Institute Cleveland Clinic, Cleveland, OH, USA
John Barnard
Sorbonne Université, APHP, Gastroenterology Department Saint Antoine Hospital, Paris, France
Laurent Beaugerie
NHLBI and Boston University’s Framingham Heart Study, Framingham, MA, USA
Emelia J. Benjamin
Department of Medicine, Boston University Chobanian and Avedisian School of Medicine, Boston, MA, USA
Emelia J. Benjamin
Department of Epidemiology, Boston University School of Public Health, Boston, MA, USA
Emelia J. Benjamin
Department of Biostatistics and Center for Statistical Genetics, University of Michigan, Ann Arbor, MI, USA
Michael Boehnke
National Human Genome Research Institute, National Institutes of Health Bethesda, Bethesda, MD, USA
Lori L. Bonnycastle
The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York City, NY, USA
Erwin P. Bottinger, Judy Cho & Ruth J. F. Loos
Department of Biochemistry, Wake Forest School of Medicine, Winston-Salem, NC, USA
Donald W. Bowden
Center for Genomics and Personalized Medicine Research, Wake Forest School of Medicine, Winston-Salem, NC, USA
Donald W. Bowden
Center for Diabetes Research, Wake Forest School of Medicine, Winston-Salem, NC, USA
Donald W. Bowden
Department of Cardiovascular Sciences, University of Leicester, Leicester, UK
Matthew J. Bown & Nilesh J. Samani
NIHR Leicester Biomedical Research Centre, Glenfield Hospital, Leicester, UK
Matthew J. Bown & Nilesh J. Samani
John Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
Steven Brant
Harvard School of Public Health, Boston, MA, USA
Hannia Campos
Central American Population Center, San Pedro, Costa Rica
Hannia Campos
Department of Epidemiology and Biostatistics, Imperial College London, London, UK
John C. Chambers
Department of Cardiology, Ealing Hospital, NHS Trust, Southall, UK
John C. Chambers & Jaspal Kooner
Imperial College Healthcare NHS Trust, Imperial College London, London, UK
John C. Chambers & Jaspal Kooner
Department of Medicine and Therapeutics, The Chinese University of Hong Kong, Hong Kong, China
Juliana C. Chan & Ronald C. W. Ma
Department of Medicine, Harvard Medical School, Boston, MA, USA
Daniel Chasman, Bruce Cohen, Jose Florez, Gad Getz, Sekar Kathiresan, James B. Meigs & Dost Ongur
Northwestern University Feinberg School of Medicine, Chicago, IL, USA
Rex L. Chisholm
University of Cambridge, Cambridge, UK
Rajiv Chowdhury & John Danesh
Departments of Cardiovascular, Medicine Cellular and Molecular Medicine Molecular Cardiology, Quantitative Health Sciences, Cleveland Clinic, Cleveland, OH, USA
Mina K. Chung
Department of Pediatrics, Columbia University Irving Medical Center, New York City, NY, USA
Wendy K. Chung
Herbert Irving Comprehensive Cancer Center, Columbia University Medical Center, New York City, NY, USA
Wendy K. Chung
Department of Medicine, Columbia University Medical Center, New York City, NY, USA
Wendy K. Chung
McLean Hospital, Belmont, MA, USA
Bruce Cohen & Dost Ongur
Division of Medical Sciences, Harvard Medical School, Boston, MA, USA
Ryan L. Collins & Beryl Cummings
Genomics Platform, Broad Institute of MIT and Harvard, Cambridge, MA, USA
Kristen M. Connolly
Department of Medicine, University of Mississippi Medical Center, Jackson, MI, USA
Adolfo Correa
Department of Epidemiology, Colorado School of Public Health, Aurora, CO, USA
Dana Dabelea
Department of Medicine and Pharmacology, University of Illinois at Chicago, Chicago, IL, USA
Dawood Darbar
Vanderbilt University Medical Center, Nashville, TN, USA
Joshua Denny
Department of Genetics, Texas Biomedical Research Institute, San Antonio, TX, USA
Ravindranath Duggirala
Department of Biostatistics, Boston University School of Public Health, Boston, MA, USA
Josée Dupuis
National Heart Lung and Blood Institute’s Framingham Heart Study, Framingham, MA, USA
Josée Dupuis
Cardiac Arrhythmia Service and Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA, USA
Patrick T. Ellinor & Steven A. Lubitz
Cardiovascular Epidemiology and Genetics Hospital del Mar Medical Research Institute (IMIM), Barcelona, Catalonia, Spain
Roberto Elosua
CIBER CV, Barcelona, Spain
Roberto Elosua
Department of Medicine, Medical School University of Vic-Central, University of Catalonia, Barcelona, Spain
Roberto Elosua & Jaume Marrugat
Institute for Cardiogenetics, University of Lübeck, Lübeck, Germany
Jeanette Erdmann
German Research Centre for Cardiovascular Research, Hamburg/Lübeck/Kiel, Lübeck, Germany
Jeanette Erdmann
University Heart Center Lübeck, Lübeck, Germany
Jeanette Erdmann
Estonian Genome Center, Institute of Genomics, University of Tartu, Tartu, Estonia
Tõnu Esko & Andres Metspalu
Victor Chang Cardiac Research Institute, Darlinghurst, New South Wales, Australia
Diane Fatkin
Faculty of Medicine, UNSW Sydney, Kensington, New South Wales, Australia
Diane Fatkin
Cardiology Department, St Vincent’s Hospital, Darlinghurst, New South Wales, Australia
Diane Fatkin
Broad Genomics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
Steven Ferriera & Namrata Gupta
Diabetes Unit and Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
Jose Florez
Programs in Metabolism and Medical & Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
Jose Florez
Institute of Clinical Molecular Biology (IKMB), Christian-Albrechts-University of Kiel, Kiel, Germany
Andre Franke
Helsinki University and Helsinki University Hospital Clinic of Gastroenterology, Helsinki, Finland
Martti Färkkilä
Bioinformatics Program MGH Cancer Center and Department of Pathology, Boston, MA, USA
Stacey Gabriel & Gad Getz
Cancer Genome Computational Analysis, Broad Institute of MIT and Harvard, Cambridge, MA, USA
Gad Getz
Department of Psychiatry and Behavioral Sciences, Boston Children’s Hospital and Harvard Medical School, Boston, MA, USA
David C. Glahn
Harvard Medical School Teaching Hospital, Boston, MA, USA
David C. Glahn
Department of Endocrinology and Metabolism, Hadassah Medical Center and Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel
Benjamin Glaser
Department of Psychiatry and Behavioral Sciences, SUNY Upstate Medical University, Syracuse, NY, USA
Stephen J. Glatt
Institute for Genomic Medicine, Columbia University Medical Center Hammer Health Sciences, New York City, NY, USA
David Goldstein
Department of Genetics & Development, Columbia University Medical Center, Hammer Health Sciences, New York City, NY, USA
David Goldstein
Centro de Investigacion en Salud Poblacional, Instituto Nacional de Salud Publica, Cuernavaca, Mexico
Clicerio Gonzalez
Lund University, Lund, Sweden
Leif Groop
Institute for Molecular Medicine Finland (FIMM), HiLIFE University of Helsinki, Helsinki, Finland
Leif Groop, Jaakko Kaprio, Aarno Palotie, Samuli Ripatti, Tiinamaija Tuomi & Maija Wessman
Lund University Diabetes Centre, Malmö, Sweden
Christopher Haiman
Washington School of Medicine, St Louis, MO, USA
Ira Hall & Nathan Stitziel
Human Genetics Center University of Texas Health Science Center at Houston, Houston, TX, USA
Craig Hanis
Department of Neurology, Columbia University, New York City, NY, USA
Matthew Harms
Institute of Genomic Medicine, Columbia University, New York City, NY, USA
Matthew Harms
Institute of Biomedicine, University of Eastern Finland, Kuopio, Finland
Mikko Hiltunen
Department of Psychiatry, Helsinki University Central Hospital Lapinlahdentie, Helsinki, Finland
Matti M. Holi
Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
Christina M. Hultman & Patrick F. Sullivan
Icahn School of Medicine at Mount Sinai, New York City, NY, USA
Christina M. Hultman & Eimear Kenny
Bonei Olam, Center for Rare Jewish Genetic Diseases, Brooklyn, NY, USA
Chaim Jalas
Department of Neurology, Helsinki University, Central Hospital, Helsinki, Finland
Mikko Kallela
Cardiovascular Disease Initiative and Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
Sekar Kathiresan
Charles Bronfman Institute for Personalized Medicine, New York City, NY, USA
Eimear Kenny
Division of Genome Science, Department of Precision Medicine, National Institute of Health, Cheongju, Republic of Korea
Bong-Jo Kim & Young Jin Kim
MRC Centre for Neuropsychiatric Genetics & Genomics, Cardiff University School of Medicine, Cardiff, UK
George Kirov, Michael C. O’Donovan & Michael J. Owen
National Heart and Lung Institute Cardiovascular Sciences, Hammersmith Campus, Imperial College London, London, UK
Jaspal Kooner
Department of Health THL-National Institute for Health and Welfare, Helsinki, Finland
Seppo Koskinen
Section of Cardiovascular Medicine, Department of Internal Medicine, Yale School of Medicine, Center for Outcomes Research and Evaluation Yale-New Haven Hospital, New Haven, CT, USA
Harlan M. Krumholz
Division of Pediatric Gastroenterology, Emory University School of Medicine, Atlanta, GA, USA
Subra Kugathasan
Department of Internal Medicine, Seoul National University Hospital, Seoul, Republic of Korea
Soo Heon Kwak & Kyong Soo Park
The University of Eastern Finland, Institute of Clinical Medicine, Kuopio, Finland
Markku Laakso
Kuopio University Hospital, Kuopio, Finland
Markku Laakso
Department of Genetics, Yale School of Medicine, New Haven, CT, USA
Nicole Lake & Monkol Lek
Department of Clinical Chemistry Fimlab Laboratories and Finnish Cardiovascular Research Center-Tampere Faculty of Medicine and Health Technology, Tampere University, Tampere, Finland
Terho Lehtimäki & Kari M. Mattila
The Mindich Child Health and Development, Institute Icahn School of Medicine at Mount Sinai, New York City, NY, USA
Ruth J. F. Loos
National Autonomous University of Mexico, Mexico City, Mexico
Teresa Tusie Luna
Salvador Zubirán National Institute of Health Sciences and Nutrition, Mexico City, Mexico
Teresa Tusie Luna
Li Ka Shing Institute of Health Sciences, The Chinese University of Hong Kong, Hong Kong, China
Ronald C. W. Ma
Hong Kong Institute of Diabetes and Obesity, The Chinese University of Hong Kong, Hong Kong, China
Ronald C. W. Ma
University of California San Francisco Parnassus Campus, San Francisco, CA, USA
Gregory M. Marcus
Cardiovascular Research REGICOR Group, Hospital del Mar Medical Research Institute (IMIM), Barcelona, Spain
Jaume Marrugat
Department of Genetics, Harvard Medical School, Boston, MA, USA
Steven McCarroll
Oxford Centre for Diabetes, Endocrinology and Metabolism, University of Oxford, Oxford, UK
Mark I. McCarthy
Welcome Centre for Human Genetics, University of Oxford, Oxford, UK
Mark I. McCarthy
Oxford NIHR Biomedical Research Centre, Oxford University Hospitals, NHS Foundation Trust, John Radcliffe Hospital, Oxford, UK
Mark I. McCarthy
John P. Hussman Institute for Human Genomics, Leonard M. Miller School of Medicine, University of Miami, Miami, FL, USA
Jacob McCauley
The Dr. John T. Macdonald Foundation Department of Human Genetics, Leonard M. Miller School of Medicine, University of Miami, Miami, FL, USA
Jacob McCauley
F. Widjaja Foundation Inflammatory Bowel and Immunobiology Research Institute Cedars-Sinai Medical Center, Los Angeles, CA, USA
Dermot McGovern
Atherogenomics Laboratory University of Ottawa, Heart Institute, Ottawa, Ontario, Canada
Ruth McPherson
Division of General Internal Medicine, Massachusetts General Hospital, Boston, MA, USA
James B. Meigs
Department of Clinical Sciences University, Hospital Malmo Clinical Research Center, Lund University, Malmö, Sweden
Olle Melander
University of Arizona Health Science, Tuscon, AZ, USA
Deborah Meyers
Department of Medicine, University of Maryland School of Medicine, Baltimore, MD, USA
Braxton D. Mitchell
Howard Hughes Medical Institute and Department of Molecular Biology, Massachusetts General Hospital, Boston, MA, USA
Vamsi K. Mootha
International Centre for Diarrhoeal Disease Research, Dhaka, Bangladesh
Aliya Naheed
Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
Saman Nazarian & Dan Rader
Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
Saman Nazarian
Lund University, Department of Clinical Sciences, Skåne University Hospital, Malmö, Sweden
Peter M. Nilsson
Department of Statistical Genetics, Osaka University Graduate School of Medicine, Suita, Japan
Yukinori Okada & Qingbo Wang
Laboratory of Statistical Immunology, Immunology Frontier Research Center (WPI-IFReC), Osaka University, Suita, Japan
Yukinori Okada
Integrated Frontier Research for Medical Science Division, Institute for Open and Transdisciplinary Research Initiatives, Osaka University, Suita, Japan
Yukinori Okada
Instituto Nacional de Medicina Genómica, (INMEGEN), Mexico City, Mexico
Lorena Orozco
Medical Research Institute, Ninewells Hospital and Medical School University of Dundee, Dundee, UK
Colin Palmer
Wake Forest School of Medicine, Winston-Salem, NC, USA
Nicholette D. Palmer
Department of Molecular Medicine and Biopharmaceutical Sciences, Graduate School of Convergence Science and Technology, Seoul National University, Seoul, Republic of Korea
Kyong Soo Park
Department of Psychiatry Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
Carlos Pato
Department of Psychiatry and Behavioral Sciences, Johns Hopkins University School of Medicine, Baltimore, MD, USA
Ann E. Pulver
Children’s Hospital of Philadelphia, Philadelphia, PA, USA
Dan Rader
Division of Genetics and Epidemiology, Institute of Cancer Research, London, UK
Nazneen Rahman
University of Washington, Seattle, WA, USA
Alex Reiner
Fred Hutchinson Cancer Research Center, Seattle, WA, USA
Alex Reiner
Medical Research Center, Oulu University Hospital, Oulu Finland and Research Unit of Clinical Neuroscience Neurology University of Oulu, Oulu, Finland
Anne M. Remes
Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA
Stephen Rich
Department of Public Health Sciences, University of Virginia, Charlottesville, VA, USA
Stephen Rich
Research Center Montreal Heart Institute, Montreal, Québec, Canada
John D. Rioux
Department of Medicine, Faculty of Medicine Université de Montréal, Montreal, Québec, Canada
John D. Rioux
Department of Public Health, Faculty of Medicine, University of Helsinki, Helsinki, Finland
Samuli Ripatti & Erkki Vartiainen
Broad Institute of MIT and Harvard, Cambridge, MA, USA
Samuli Ripatti & J. Gustav Smith
Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, USA
Dan M. Roden
Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA
Dan M. Roden
The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA, USA
Jerome I. Rotter & Kent D. Taylor
Department of Biostatistics and Epidemiology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
Danish Saleheen
Department of Medicine, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, PA, USA
Danish Saleheen
Center for Non-Communicable Diseases, Karachi, Pakistan
Danish Saleheen
National Institute for Health and Welfare, Helsinki, Finland
Veikko Salomaa & Jaana Suvisaari
Deutsches Herzzentrum, München, Germany
Heribert Schunkert
Technische Universität München, München, Germany
Heribert Schunkert
Institute of Genetic Epidemiology, Department of Genetics and Pharmacology, Medical University of Innsbruck, Innsbruck, Austria
Sebastian Schönherr
Duke Molecular Physiology Institute, Durham, NC, USA
Svati H. Shah
Division of Cardiovascular Medicine, Nashville VA Medical Center, Vanderbilt University School of Medicine, Nashville, TN, USA
Moore B. Shoemaker
Division of Endocrinology, National University Hospital, Singapore City, Singapore
Tai Shyong
NUS Saw Swee Hock School of Public Health, Singapore City, Singapore
Tai Shyong
Channing Division of Network Medicine, Brigham and Women’s Hospital, Boston, MA, USA
Edwin K. Silverman
Harvard Medical School, Boston, MA, USA
Edwin K. Silverman
Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York City, NY, USA
Pamela Sklar
Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York City, NY, USA
Pamela Sklar
Institute for Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York City, NY, USA
Pamela Sklar
The Wallenberg Laboratory/Department of Molecular and Clinical Medicine, Institute of Medicine, Gothenburg University and the Department of Cardiology, Sahlgrenska University Hospital, Gothenburg, Sweden
J. Gustav Smith
Department of Cardiology, Wallenberg Center for Molecular Medicine and Lund University Diabetes Center, Clinical Sciences, Lund University and Skåne University Hospital, Lund, Sweden
J. Gustav Smith
Institute of Clinical Medicine Neurology, University of Eastern Finad, Kuopio, Finland
Hilkka Soininen
Gastroenterology Department, Sorbonne Université, INSERM, Centre de Recherche Saint-Antoine, CRSA, AP-HP, Saint Antoine Hospital, Paris, France
Harry Sokol
INRA, UMR1319 Micalis & AgroParisTech, Jouy en Josas, France
Harry Sokol
Paris Center for Microbiome Medicine (PaCeMM), FHU, Paris, France
Harry Sokol
Department of Twin Research and Genetic Epidemiology, King’s College London, London, UK
Tim Spector
The McDonnell Genome Institute at Washington University, Seattle, WA, USA
Nathan Stitziel
Departments of Genetics and Psychiatry, University of North Carolina, Chapel Hill, NC, USA
Patrick F. Sullivan
Saw Swee Hock School of Public Health National University of Singapore, National University Health System, Singapore City, Singapore
E. Shyong Tai & Yik Ying Teo
Department of Medicine, Yong Loo Lin School of Medicine National University of Singapore, Singapore City, Singapore
E. Shyong Tai
Duke-NUS Graduate Medical School, Singapore City, Singapore
E. Shyong Tai
Life Sciences Institute, National University of Singapore, Singapore City, Singapore
Yik Ying Teo
Department of Statistics and Applied Probability, National University of Singapore, Singapore City, Singapore
Yik Ying Teo
Center for Behavioral Genomics, Department of Psychiatry, University of California, San Diego, San Diego, CA, USA
Ming Tsuang
Institute of Genomic Medicine, University of California, San Diego, San Diego, CA, USA
Ming Tsuang
Endocrinology, Abdominal Center, Helsinki University Hospital, Helsinki, Finland
Tiinamaija Tuomi
Institute of Genetics, Folkhalsan Research Center, Helsinki, Finland
Tiinamaija Tuomi
Juliet Keidan Institute of Pediatric Gastroenterology Shaare Zedek Medical Center, The Hebrew University of Jerusalem, Jerusalem, Israel
Dan Turner
Instituto de Investigaciones Biomédicas, UNAM, Mexico City, Mexico
Teresa Tusie-Luna
Instituto Nacional de Ciencias Médicas y Nutrición Salvador Zubirán, Mexico City, Mexico
Teresa Tusie-Luna
Department of Psychiatry and Human Behavior, University of California Irvine, Irvine, CA, USA
Marquis Vawter
National Heart & Lung Institute & MRC London Institute of Medical Sciences, Imperial College London, London, UK
James S. Ware
Royal Brompton & Harefield Hospitals, Guy’s and St. Thomas’ NHS Foundation Trust, London, UK
James S. Ware
Radcliffe Department of Medicine, University of Oxford, Oxford, UK
Hugh Watkins
Department of Gastroenterology and Hepatology, University of Groningen and University Medical Center Groningen, Groningen, The Netherlands
Rinse K. Weersma
Folkhälsan Institute of Genetics, Folkhälsan Research Center, Helsinki, Finland
Maija Wessman
National Heart & Lung Institute and MRC London Institute of Medical Sciences, Imperial College London, London, UK
Nicola Whiffin
Cardiovascular Research Centre, Royal Brompton & Harefield Hospitals NHS Trust, London, UK
Nicola Whiffin
Department of Physiology and Biophysics, University of Mississippi Medical Center, Jackson, MS, USA
James G. Wilson
Program in Infectious Disease and Microbiome, Broad Institute of MIT and Harvard, Cambridge, MA, USA
Ramnik J. Xavier
Center for Computational and Integrative Biology, Massachusetts General Hospital, Boston, MA, USA
Ramnik J. Xavier

Authors

Michael H. Guo
View author publications
You can also search for this author in PubMed Google Scholar
Laurent C. Francioli
View author publications
You can also search for this author in PubMed Google Scholar
Sarah L. Stenton
View author publications
You can also search for this author in PubMed Google Scholar
Julia K. Goodrich
View author publications
You can also search for this author in PubMed Google Scholar
Nicholas A. Watts
View author publications
You can also search for this author in PubMed Google Scholar
Moriel Singer-Berk
View author publications
You can also search for this author in PubMed Google Scholar
Emily Groopman
View author publications
You can also search for this author in PubMed Google Scholar
Philip W. Darnowsky
View author publications
You can also search for this author in PubMed Google Scholar
Matthew Solomonson
View author publications
You can also search for this author in PubMed Google Scholar
Samantha Baxter
View author publications
You can also search for this author in PubMed Google Scholar
Grace Tiao
View author publications
You can also search for this author in PubMed Google Scholar
Benjamin M. Neale
View author publications
You can also search for this author in PubMed Google Scholar
Joel N. Hirschhorn
View author publications
You can also search for this author in PubMed Google Scholar
Heidi L. Rehm
View author publications
You can also search for this author in PubMed Google Scholar
Mark J. Daly
View author publications
You can also search for this author in PubMed Google Scholar
Anne O’Donnell-Luria
View author publications
You can also search for this author in PubMed Google Scholar
Konrad J. Karczewski
View author publications
You can also search for this author in PubMed Google Scholar
Daniel G. MacArthur
View author publications
You can also search for this author in PubMed Google Scholar
Kaitlin E. Samocha
View author publications
You can also search for this author in PubMed Google Scholar

Consortia

gnomAD Project Consortium

Maria Abreu
, Carlos A. Aguilar Salinas
, Tariq Ahmad
, Christine M. Albert
, Jessica Alföldi
, Diego Ardissino
, Irina M. Armean
, Gil Atzmon
, Eric Banks
, John Barnard
, Samantha M. Baxter
, Laurent Beaugerie
, Emelia J. Benjamin
, David Benjamin
, Louis Bergelson
, Michael Boehnke
, Lori L. Bonnycastle
, Erwin P. Bottinger
, Donald W. Bowden
, Matthew J. Bown
, Steven Brant
, Sarah E. Calvo
, Hannia Campos
, John C. Chambers
, Juliana C. Chan
, Katherine R. Chao
, Sinéad Chapman
, Daniel Chasman
, Siwei Chen
, Rex L. Chisholm
, Judy Cho
, Rajiv Chowdhury
, Mina K. Chung
, Wendy K. Chung
, Kristian Cibulskis
, Bruce Cohen
, Ryan L. Collins
, Kristen M. Connolly
, Adolfo Correa
, Miguel Covarrubias
, Beryl Cummings
, Dana Dabelea
, Mark J. Daly
, John Danesh
, Dawood Darbar
, Joshua Denny
, Stacey Donnelly
, Ravindranath Duggirala
, Josée Dupuis
, Patrick T. Ellinor
, Roberto Elosua
, James Emery
, Eleina England
, Jeanette Erdmann
, Tõnu Esko
, Emily Evangelista
, Yossi Farjoun
, Diane Fatkin
, Steven Ferriera
, Jose Florez
, Laurent C. Francioli
, Andre Franke
, Martti Färkkilä
, Stacey Gabriel
, Kiran Garimella
, Laura D. Gauthier
, Jeff Gentry
, Gad Getz
, David C. Glahn
, Benjamin Glaser
, Stephen J. Glatt
, David Goldstein
, Clicerio Gonzalez
, Julia K. Goodrich
, Leif Groop
, Sanna Gudmundsson
, Namrata Gupta
, Andrea Haessly
, Christopher Haiman
, Ira Hall
, Craig Hanis
, Matthew Harms
, Mikko Hiltunen
, Matti M. Holi
, Christina M. Hultman
, Chaim Jalas
, Thibault Jeandet
, Mikko Kallela
, Diane Kaplan
, Jaakko Kaprio
, Konrad J. Karczewski
, Sekar Kathiresan
, Eimear Kenny
, Bong-Jo Kim
, Young Jin Kim
, George Kirov
, Zan Koenig
, Jaspal Kooner
, Seppo Koskinen
, Harlan M. Krumholz
, Subra Kugathasan
, Soo Heon Kwak
, Markku Laakso
, Nicole Lake
, Trevyn Langsford
, Kristen M. Laricchia
, Terho Lehtimäki
, Monkol Lek
, Emily Lipscomb
, Christopher Llanwarne
, Ruth J. F. Loos
, Steven A. Lubitz
, Teresa Tusie Luna
, Ronald C. W. Ma
, Daniel G. MacArthur
, Gregory M. Marcus
, Jaume Marrugat
, Alicia R. Martin
, Kari M. Mattila
, Steven McCarroll
, Mark I. McCarthy
, Jacob McCauley
, Dermot McGovern
, Ruth McPherson
, James B. Meigs
, Olle Melander
, Andres Metspalu
, Deborah Meyers
, Eric V. Minikel
, Braxton D. Mitchell
, Vamsi K. Mootha
, Ruchi Munshi
, Aliya Naheed
, Saman Nazarian
, Benjamin M. Neale
, Peter M. Nilsson
, Sam Novod
, Anne H. O’Donnell-Luria
, Michael C. O’Donovan
, Yukinori Okada
, Dost Ongur
, Lorena Orozco
, Michael J. Owen
, Colin Palmer
, Nicholette D. Palmer
, Aarno Palotie
, Kyong Soo Park
, Carlos Pato
, Nikelle Petrillo
, William Phu
, Timothy Poterba
, Ann E. Pulver
, Dan Rader
, Nazneen Rahman
, Heidi L. Rehm
, Alex Reiner
, Anne M. Remes
, Dan Rhodes
, Stephen Rich
, John D. Rioux
, Samuli Ripatti
, David Roazen
, Dan M. Roden
, Jerome I. Rotter
, Valentin Ruano-Rubio
, Nareh Sahakian
, Danish Saleheen
, Veikko Salomaa
, Andrea Saltzman
, Nilesh J. Samani
, Kaitlin E. Samocha
, Jeremiah Scharf
, Molly Schleicher
, Heribert Schunkert
, Sebastian Schönherr
, Eleanor Seaby
, Cotton Seed
, Svati H. Shah
, Megan Shand
, Moore B. Shoemaker
, Tai Shyong
, Edwin K. Silverman
, Moriel Singer-Berk
, Pamela Sklar
, J. Gustav Smith
, Jonathan T. Smith
, Hilkka Soininen
, Harry Sokol
, Matthew Solomonson
, Rachel G. Son
, Jose Soto
, Tim Spector
, Christine Stevens
, Nathan Stitziel
, Patrick F. Sullivan
, Jaana Suvisaari
, E. Shyong Tai
, Michael E. Talkowski
, Yekaterina Tarasova
, Kent D. Taylor
, Yik Ying Teo
, Grace Tiao
, Kathleen Tibbetts
, Charlotte Tolonen
, Ming Tsuang
, Tiinamaija Tuomi
, Dan Turner
, Teresa Tusie-Luna
, Erkki Vartiainen
, Marquis Vawter
, Christopher Vittal
, Gordon Wade
, Arcturus Wang
, Qingbo Wang
, James S. Ware
, Hugh Watkins
, Nicholas A. Watts
, Rinse K. Weersma
, Ben Weisburd
, Maija Wessman
, Nicola Whiffin
, Michael W. Wilson
, James G. Wilson
, Ramnik J. Xavier
& Mary T. Yohannes

Contributions

M.H.G., L.C.F., S.L.S., J.K.G., A.O.-L., K.J.K., D.G.M. and K.E.S. conceived and designed experiments. M.H.G., L.C.F., S.L.S. and J.K.G. performed the analyses. N.A.W., P.W.D. and M.S. developed visualizations for the web browser. E.G. and M.S.-B. performed variant curation. S.B., G.T., B.M.N., J.N.H., H.L.R., M.J.D., A.O.-L. and K.J.K. provided data and analysis suggestions. J.N.H., D.G.M. and K.E.S. supervised the work. M.H.G., L.C.F., S.L.S., J.K.G. and K.E.S. completed the primary writing of the manuscript with input and approval of the final version from all other authors.

Corresponding author

Correspondence to Kaitlin E. Samocha.

Ethics declarations

Competing interests

L.C.F. is currently an employee of, and owns stock in, Vertex Pharmaceuticals. B.M.N. is a member of the scientific advisory board at Deep Genomics and Neumora (f/k/a RBNC Therapeutics). H.L.R. has received support from Illumina and Microsoft to support rare disease gene discovery and diagnosis. M.J.D. is a founder of Maze Therapeutics and Neumora Therapeutics (f/k/a RBNC Therapeutics). A.O.-L. has consulted for Tome Biosciences and Ono Pharma USA and is a member of the scientific advisory board for Congenica and the Simons Foundation SPARK for Autism study. K.J.K. is a consultant for Tome Biosciences and Vor Biosciences and a member of the Scientific Advisory Board of Nurture Genomics. D.G.M. is a paid advisor to GlaxoSmithKline, Insitro, Variant Bio and Overtone Therapeutics and has received research support from AbbVie, Astellas, Biogen, BioMarin, Eisai, Google, Merck, Microsoft, Pfizer and Sanofi-Genzyme. K.E.S. has received support from Microsoft for work related to rare disease diagnostics. The remaining authors declare no competing interests.

Peer review

Peer review information

Nature Genetics thanks Arnaldur Gylfason, Tobias Marschall and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data

Extended Data Fig. 1 Publicly available browser for sharing phasing data.

a, Sample gnomAD browser output for two variants (GRCh37 1-55505647-G-T and 1-55523855-G-A) in the gene PCSK9. On the top, a table subdivided by genetic ancestry group displays how many individuals in gnomAD v2 from that genetic ancestry are consistent with the two variants occurring on different haplotypes (trans), and how many individuals are consistent with their occurring on the same haplotype (cis). Below that, there is a 3×3 table that contains the 9 possible combinations of genotypes for the two variants of interest. The number of individuals in gnomAD v2 that fall in each of these combinations are shown and are colored by whether they are consistent with variants falling on different haplotypes (red) or the same haplotype (blue), or whether they are indeterminate (purple). The estimated haplotype counts for the four possible haplotypes for the two variants as calculated by the EM algorithm is displayed on the bottom right. The probability of being in trans for this particular pair of variants is >99%. b, Variant co-occurrence tables on the gene landing page. For each gene (GBA1 shown), the top table lists the number of individuals carrying pairs of rare heterozygous variants by inferred phase, allele frequency (AF), and predicted functional consequence. The number of individuals with homozygous variants are tabulated in the same manner and presented as a comparison below. AF thresholds of ≤ 5%, ≤ 1%, and ≤ 0.5% are displayed across six predicted functional consequences (combinations of pLoF, various evidence strengths of predicted pathogenicity for missense variants, and synonymous variants). Both variants in the variant pair must be annotated with a consequence at least as severe as the consequence listed (that is, pLoF + strong missense also includes pLoF + pLoF).

Supplementary information

Supplementary Information

Supplementary Note and Supplementary Figs. 1–9.

Reporting Summary

Peer Review File

Supplementary Tables

Supplementary Tables 1 and 2: The dataset includes phasing information for diagnostic variants from CMG patients and manual curation of rare, compound heterozygous loss-of-function variants.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Guo, M.H., Francioli, L.C., Stenton, S.L. et al. Inferring compound heterozygosity from large-scale exome sequencing data. Nat Genet 56, 152–161 (2024). https://doi.org/10.1038/s41588-023-01608-3

Download citation

Received: 24 March 2023
Accepted: 08 November 2023
Published: 06 December 2023
Issue Date: January 2024
DOI: https://doi.org/10.1038/s41588-023-01608-3

This article is cited by

A genomic mutational constraint map using variation in 76,156 human genomes
- Siwei Chen
- Laurent C. Francioli
- Konrad J. Karczewski
Nature (2024)

Inferring compound heterozygosity from large-scale exome sequencing data

Subjects

Abstract

Access options

Similar content being viewed by others

Effective variant filtering and expected candidate variant yield in studies of rare human disease

Reanalysis of clinical exome identifies the second variant in two individuals with recessive disorders

Diagnostic implications of pitfalls in causal variant identification based on 4577 molecularly characterized families

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Consortia

gnomAD Project Consortium

Contributions

Corresponding author

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Extended data

Extended Data Fig. 1 Publicly available browser for sharing phasing data.

Supplementary information

Supplementary Information

Reporting Summary

Peer Review File

Supplementary Tables

Rights and permissions

About this article

Cite this article

This article is cited by

A genomic mutational constraint map using variation in 76,156 human genomes

A genomic mutational constraint map using variation in 76,156 human genomes

Search

Quick links

Subjects

Abstract

Access options

Similar content being viewed by others

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Consortia

gnomAD Project Consortium

Contributions

Corresponding author

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Extended data

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links