A framework for detecting noncoding rare-variant associations of large-scale whole-genome sequencing studies

Li, Zilin; Li, Xihao; Zhou, Hufeng; Gaynor, Sheila M.; Selvaraj, Margaret Sunitha; Arapoglou, Theodore; Quick, Corbin; Liu, Yaowu; Chen, Han; Sun, Ryan; Dey, Rounak; Arnett, Donna K.; Auer, Paul L.; Bielak, Lawrence F.; Bis, Joshua C.; Blackwell, Thomas W.; Blangero, John; Boerwinkle, Eric; Bowden, Donald W.; Brody, Jennifer A.; Cade, Brian E.; Conomos, Matthew P.; Correa, Adolfo; Cupples, L. Adrienne; Curran, Joanne E.; de Vries, Paul S.; Duggirala, Ravindranath; Franceschini, Nora; Freedman, Barry I.; Göring, Harald H. H.; Guo, Xiuqing; Kalyani, Rita R.; Kooperberg, Charles; Kral, Brian G.; Lange, Leslie A.; Lin, Bridget M.; Manichaikul, Ani; Manning, Alisa K.; Martin, Lisa W.; Mathias, Rasika A.; Meigs, James B.; Mitchell, Braxton D.; Montasser, May E.; Morrison, Alanna C.; Naseri, Take; O’Connell, Jeffrey R.; Palmer, Nicholette D.; Peyser, Patricia A.; Psaty, Bruce M.; Raffield, Laura M.; Redline, Susan; Reiner, Alexander P.; Reupena, Muagututi’a Sefuiva; Rice, Kenneth M.; Rich, Stephen S.; Smith, Jennifer A.; Taylor, Kent D.; Taub, Margaret A.; Vasan, Ramachandran S.; Weeks, Daniel E.; Wilson, James G.; Yanek, Lisa R.; Zhao, Wei; Rotter, Jerome I.; Willer, Cristen J.; Natarajan, Pradeep; Peloso, Gina M.; Lin, Xihong

doi:10.1038/s41592-022-01640-x

Article
Published: 27 October 2022

A framework for detecting noncoding rare-variant associations of large-scale whole-genome sequencing studies

Zilin Li ORCID: orcid.org/0000-0003-1521-8945^1,2^na1,
Xihao Li ORCID: orcid.org/0000-0001-8151-0106¹^na1,
Hufeng Zhou¹,
Sheila M. Gaynor¹,
Margaret Sunitha Selvaraj ORCID: orcid.org/0000-0002-2751-9254^3,4,5,
Theodore Arapoglou¹,
Corbin Quick¹,
Yaowu Liu⁶,
Han Chen^7,8,
Ryan Sun⁹,
Rounak Dey¹,
Donna K. Arnett¹⁰,
Paul L. Auer¹¹,
Lawrence F. Bielak ORCID: orcid.org/0000-0002-3443-8030¹²,
Joshua C. Bis¹³,
Thomas W. Blackwell¹⁴,
John Blangero¹⁵,
Eric Boerwinkle^7,16,
Donald W. Bowden¹⁷,
Jennifer A. Brody ORCID: orcid.org/0000-0001-8509-148X¹³,
Brian E. Cade^4,18,19,
Matthew P. Conomos ORCID: orcid.org/0000-0001-9744-0851²⁰,
Adolfo Correa²¹,
L. Adrienne Cupples^22,23,
Joanne E. Curran ORCID: orcid.org/0000-0002-6898-155X¹⁵,
Paul S. de Vries⁷,
Ravindranath Duggirala¹⁵,
Nora Franceschini²⁴,
Barry I. Freedman ORCID: orcid.org/0000-0003-0275-5530²⁵,
Harald H. H. Göring¹⁵,
Xiuqing Guo ORCID: orcid.org/0000-0002-5264-5068²⁶,
Rita R. Kalyani²⁷,
Charles Kooperberg ORCID: orcid.org/0000-0002-7986-8560²⁸,
Brian G. Kral²⁷,
Leslie A. Lange²⁹,
Bridget M. Lin³⁰,
Ani Manichaikul³¹,
Alisa K. Manning^5,32,33,
Lisa W. Martin³⁴,
Rasika A. Mathias²⁷,
James B. Meigs^4,5,35,
Braxton D. Mitchell^36,37,
May E. Montasser³⁸,
Alanna C. Morrison⁷,
Take Naseri³⁹,
Jeffrey R. O’Connell³⁶,
Nicholette D. Palmer ORCID: orcid.org/0000-0001-8883-2511¹⁷,
Patricia A. Peyser ORCID: orcid.org/0000-0002-9717-8459¹²,
Bruce M. Psaty ORCID: orcid.org/0000-0002-7278-2190^13,40,41,
Laura M. Raffield⁴²,
Susan Redline^18,19,43,
Alexander P. Reiner ORCID: orcid.org/0000-0002-1427-4470^28,40,
Muagututi’a Sefuiva Reupena⁴⁴,
Kenneth M. Rice ORCID: orcid.org/0000-0002-3071-7278²⁰,
Stephen S. Rich ORCID: orcid.org/0000-0003-3872-7793³¹,
Jennifer A. Smith ORCID: orcid.org/0000-0002-3575-5468^12,45,
Kent D. Taylor²⁶,
Margaret A. Taub⁴⁶,
Ramachandran S. Vasan ORCID: orcid.org/0000-0001-7357-5970^23,47,
Daniel E. Weeks ORCID: orcid.org/0000-0001-9410-7228⁴⁸,
James G. Wilson⁴⁹,
Lisa R. Yanek ORCID: orcid.org/0000-0001-7117-1075²⁷,
Wei Zhao¹²,
NHLBI Trans-Omics for Precision Medicine (TOPMed) Consortium,
TOPMed Lipids Working Group,
Jerome I. Rotter²⁶,
Cristen J. Willer ORCID: orcid.org/0000-0001-5645-4966^50,51,52,
Pradeep Natarajan^3,4,5,
Gina M. Peloso ORCID: orcid.org/0000-0002-5355-8636^22,23 &
…
Xihong Lin ORCID: orcid.org/0000-0001-7067-7752^1,4,53

Nature Methods volume 19, pages 1599–1611 (2022)Cite this article

9491 Accesses
27 Citations
47 Altmetric
Metrics details

Subjects

Abstract

Large-scale whole-genome sequencing studies have enabled analysis of noncoding rare-variant (RV) associations with complex human diseases and traits. Variant-set analysis is a powerful approach to study RV association. However, existing methods have limited ability in analyzing the noncoding genome. We propose a computationally efficient and robust noncoding RV association detection framework, STAARpipeline, to automatically annotate a whole-genome sequencing study and perform flexible noncoding RV association analysis, including gene-centric analysis and fixed window-based and dynamic window-based non-gene-centric analysis by incorporating variant functional annotations. In gene-centric analysis, STAARpipeline uses STAAR to group noncoding variants based on functional categories of genes and incorporate multiple functional annotations. In non-gene-centric analysis, STAARpipeline uses SCANG-STAAR to incorporate dynamic window sizes and multiple functional annotations. We apply STAARpipeline to identify noncoding RV sets associated with four lipid traits in 21,015 discovery samples from the Trans-Omics for Precision Medicine (TOPMed) program and replicate several of them in an additional 9,123 TOPMed samples. We also analyze five non-lipid TOPMed traits.

Access through your institution

Buy or subscribe

This is a preview of subscription content, access via your institution

Access options

Access through your institution

Buy this article

Purchase on Springer Link
Instant access to full article PDF

Buy now

Prices may be subject to local taxes which are calculated during checkout

Dynamic incorporation of multiple in silico functional annotations empowers rare variant association analysis of large whole-genome sequencing studies at scale

Article 24 August 2020

Powerful, scalable and resource-efficient meta-analysis of rare variant associations in large whole genome sequencing studies

Article 23 December 2022

SAIGE-GENE+ improves the efficiency and accuracy of set-based rare variant association tests

Article Open access 22 September 2022

Data availability

This paper used the TOPMed Freeze 5 WGS data and phenotype data of lipids, CRP, eGFR, FG, FI and TL. The genotype and phenotype data are available in dbGAP. The TOPMed data were from the following 14 studies, under the provided accession numbers:

Framingham Heart Study (phs000974.v1.p1), Old Order Amish Study (phs000956.v1.p1), Jackson Heart Study (phs000964.v1.p1), Multi-Ethnic Study of Atherosclerosis (phs001416.v1.p1), GWAS of Adiposity in Samoans (phs000972) and Women’s Health Initiative (phs001237), Atherosclerosis Risk in Communities Study (phs001211), Cleveland Family Study (phs000954), Cardiovascular Health Study (phs001368), Diabetes Heart Study (phs001412), Genetic Study of Atherosclerosis Risk (phs001218), Genetic Epidemiology Network of Arteriopathy (phs001345), Genetics of Lipid Lowering Drugs and Diet Network (phs001359) and San Antonio Family Heart Study (phs001215).

The functional annotation data are publicly available and were downloaded from: GRCh38 CADD v1.4 (https://cadd.gs.washington.edu/download), ANNOVAR dbNSFP v3.3a (https://annovar.openbioinformatics.org/en/latest/user-guide/download/), LINSIGHT (https://github.com/CshlSiepelLab/LINSIGHT/), FATHMM-XF (http://fathmm.biocompute.org.uk/fathmm-xf/), CAGE (https://fantom.gsc.riken.jp/5/data/), GeneHancer (https://www.genecards.org/) and Umap/Bismap (https://bismap.hoffmanlab.org/). In addition, recombination rate and nucleotide diversity were obtained from work by Gazal et al.⁵⁴. The tissue-specific functional annotations were downloaded from ENCODE (https://www.encodeproject.org/report/?type=Experiment). The assembled functional annotation data from these sources are available at http://favor.genohub.org/.

Code availability

STAARpipeline is implemented as an open-source R package available at https://github.com/xihaoli/STAARpipeline/ (ref. ⁵⁵) and https://content.sph.harvard.edu/xlin/software.html. STAARpipelineSummary is implemented as an open-source R package available at https://github.com/xihaoli/STAARpipelineSummary/ (ref. ⁵⁶) and https://content.sph.harvard.edu/xlin/software.html. The scripts used to generate the results have been archived on Zenodo using https://doi.org/10.5281/zenodo.6871408 (ref. ⁵⁷). Data analysis was performed in R (3.6.1). STAAR v0.9.6, STAARpipeline v0.9.6 and STAARpipelineSummary v0.9.6 were used in simulation and real data analysis, and seqMeta v1.6.7 was used in simulation. Wget v1.14 was used for downloading the annotation data. FAVORannotator v1.0.0 (https://github.com/xihaoli/STAARpipeline-Tutorial/) was used to functionally annotate the whole-genome data.

References

Manolio, T. A. et al. Finding the missing heritability of complex diseases. Nature 461, 747–753 (2009).
Article CAS PubMed PubMed Central Google Scholar
Wainschtein, P. et al. Assessing the contribution of rare variants to complex trait heritability from whole-genome sequence data. Nat. Genet. 54, 263–273 (2022).
Article CAS PubMed PubMed Central Google Scholar
Hernandez, R. D. et al. Ultrarare variants drive substantial cis heritability of human gene expression. Nat. Genet. 51, 1349–1355 (2019).
Article CAS PubMed PubMed Central Google Scholar
Taliun, D. et al. Sequencing of 53,831 diverse genomes from the NHLBI TOPMed Program. Nature 590, 290–299 (2021).
Article CAS PubMed PubMed Central Google Scholar
Flannick, J. et al. Exome sequencing of 20,791 cases of type 2 diabetes and 24,440 controls. Nature 570, 71–76 (2019).
Article CAS PubMed PubMed Central Google Scholar
Van Hout, C. V. et al. Exome sequencing and characterization of 49,960 individuals in the UK Biobank. Nature 586, 749–756 (2020).
Article PubMed PubMed Central Google Scholar
Zhang, F. & Lupski, J. R. Non-coding genetic variants in human disease. Hum. Mol. Genet. 24, R102–R110 (2015).
Article CAS PubMed PubMed Central Google Scholar
Khurana, E. et al. Role of non-coding sequence variants in cancer. Nat. Rev. Genet. 17, 93–108 (2016).
Article CAS PubMed Google Scholar
Lee, P. H. et al. Principles and methods of in-silico prioritization of non-coding regulatory variants. Hum. Genet. 137, 15–30 (2018).
Article CAS PubMed Google Scholar
ENCODE Project Consortium. An integrated encyclopedia of DNA elements in the human genome. Nature 489, 57–74 (2012).
Article Google Scholar
Moore, J. E. et al. Expanded encyclopaedias of DNA elements in the human and mouse genomes. Nature 583, 699–710 (2020).
Article PubMed PubMed Central Google Scholar
Bansal, V., Libiger, O., Torkamani, A. & Schork, N. J. Statistical analysis strategies for association studies involving rare variants. Nat. Rev. Genet. 11, 773–785 (2010).
Article PubMed PubMed Central Google Scholar
Lee, S., Abecasis, G. R., Boehnke, M. & Lin, X. Rare-variant association analysis: study designs and statistical tests. Am. J. Hum. Genet. 95, 5–23 (2014).
Article CAS PubMed PubMed Central Google Scholar
Kiezun, A. et al. Exome sequencing and the genetic basis of complex traits. Nat. Genet. 44, 623 (2012).
Article CAS PubMed PubMed Central Google Scholar
Li, B. & Leal, S. M. Methods for detecting associations with rare variants for common diseases: application to analysis of sequence data. Am. J. Hum. Genet. 83, 311–321 (2008).
Article CAS PubMed PubMed Central Google Scholar
Wu, M. C. et al. Rare-variant association testing for sequencing data with the sequence kernel association test. Am. J. Hum. Genet. 89, 82–93 (2011).
Article CAS PubMed PubMed Central Google Scholar
Li, X. et al. Dynamic incorporation of multiple in silico functional annotations empowers rare-variant association analysis of large whole-genome sequencing studies at scale. Nat. Genet. 52, 969–983 (2020).
Article CAS PubMed PubMed Central Google Scholar
Morrison, A. C. et al. Practical approaches for whole-genome sequence analysis of heart-and blood-related traits. Am. J. Hum. Genet. 100, 205–215 (2017).
Article CAS PubMed PubMed Central Google Scholar
Li, Z. et al. Dynamic scan procedure for detecting rare-variant association regions in whole-genome sequencing studies. Am. J. Hum. Genet. 104, 802–814 (2019).
Article CAS PubMed PubMed Central Google Scholar
He, Z., Xu, B., Buxbaum, J. & Ionita-Laza, I. A genome-wide scan statistic framework for whole-genome sequence data analysis. Nat. Commun. 10, 3018 (2019).
Article PubMed PubMed Central Google Scholar
Natarajan, P. et al. Deep-coverage whole genome sequences and blood lipids among 16,324 individuals. Nat. Commun. 9, 3391 (2018).
Article PubMed PubMed Central Google Scholar
Li, Z., Liu, Y. & Lin, X. Simultaneous detection of signal regions using quadratic scan statistics with applications to whole genome association studies. J. Am. Stat. Assoc. 117, 823–834 (2022).
Article CAS PubMed Google Scholar
Bocher, O. & Génin, E. Rare-variant association testing in the non-coding genome. Hum. Genet. 139, 1345–1362 (2020).
Article PubMed Google Scholar
Fishilevich, S. et al. GeneHancer: genome-wide integration of enhancers and target genes in GeneCards. Database 2017, bax028 (2017).
FANTOM Consortium. A promoter-level mammalian expression atlas. Nature 507, 462–470 (2014).
Google Scholar
Andersson, R. et al. An atlas of active enhancers across human cell types and tissues. Nature 507, 455–461 (2014).
Article CAS PubMed PubMed Central Google Scholar
Breslow, N. E. & Clayton, D. G. Approximate inference in generalized linear mixed models. J. Am. Stat. Assoc. 88, 9–25 (1993).
Google Scholar
Chen, H. et al. Control for population structure and relatedness for binary traits in genetic association studies via logistic mixed models. Am. J. Hum. Genet. 98, 653–666 (2016).
Article CAS PubMed PubMed Central Google Scholar
Chen, H. et al. Efficient variant set mixed model association tests for continuous and binary traits in large-scale whole-genome sequencing studies. Am. J. Hum. Genet. 104, 260–274 (2019).
Article CAS PubMed PubMed Central Google Scholar
Zhou, H., Arapoglou, T., Li, X., Li, Z. & Lin, X.. FAVOR Essential Database. https://doi.org/10.7910/DVN/1VGTJI (Harvard Dataverse V1, 2022).
Harrow, J. et al. GENCODE: the reference human genome annotation for The ENCODE Project. Genome Res. 22, 1760–1774 (2012).
Article CAS PubMed PubMed Central Google Scholar
Frankish, A. et al. GENCODE reference annotation for the human and mouse genomes. Nucleic Acids Res. 47, D766–D773 (2019).
Article CAS PubMed Google Scholar
Kinsella, R. J. et al. Ensembl BioMarts: a hub for data retrieval across taxonomic space. Database 2011, bar030 (2011).
Povysil, G. et al. Rare-variant collapsing analyses for complex traits: guidelines and applications. Nat. Rev. Genet. 20, 747–759 (2019).
Article CAS PubMed Google Scholar
Kircher, M. et al. A general framework for estimating the relative pathogenicity of human genetic variants. Nat. Genet. 46, 310–315 (2014).
Article PubMed PubMed Central Google Scholar
Huang, Y.-F., Gulko, B. & Siepel, A. Fast, scalable prediction of deleterious noncoding variants from functional and population genomic data. Nat. Genet. 49, 618–624 (2017).
Article CAS PubMed PubMed Central Google Scholar
Rogers, M. F. et al. FATHMM-XF: accurate prediction of pathogenic point mutations via extended features. Bioinformatics 34, 511–513 (2017).
Article PubMed Central Google Scholar
Liu, Y. et al. ACAT: a fast and powerful P value combination method for rare-variant analysis in sequencing studies. Am. J. Hum. Genet. 104, 410–421 (2019).
Article CAS PubMed PubMed Central Google Scholar
Buniello, A. et al. The NHGRI-EBI GWAS Catalog of published genome-wide association studies, targeted arrays and summary statistics 2019. Nucleic Acids Res. 47, D1005–D1012 (2019).
Article CAS PubMed Google Scholar
Stilp, A. M. et al. A system for phenotype harmonization in the National Heart, Lung, and Blood Institute Trans-Omics for Precision Medicine (TOPMed) Program. Am. J. Epidemiol. 190, 1977–1922 (2021).
Article PubMed PubMed Central Google Scholar
Moutsianas, L. et al. The power of gene-based rare-variant methods to detect disease-associated variation and test hypotheses about complex disease. PLoS Genet. 11, e1005165 (2015).
Article PubMed PubMed Central Google Scholar
Raffield, L. M. et al. Allelic heterogeneity at the CRP locus identified by whole-genome sequencing in multi-ancestry cohorts. Am. J. Hum. Genet. 106, 112–120 (2020).
Article CAS PubMed Google Scholar
Lin, B. M. et al. Whole-genome sequence analyses of eGFR in 23,732 people representing multiple ancestries in the NHLBI trans-omics for precision medicine (TOPMed) consortium. EBioMedicine 63, 103157 (2021).
Article CAS PubMed PubMed Central Google Scholar
DiCorpo, D. et al. Whole-genome sequence association analysis of fasting glucose and fasting insulin levels in diverse cohorts from the NHLBI TOPMed Program. Commun. Biol. 5, 756 (2022).
Article CAS PubMed PubMed Central Google Scholar
Taub, M. A. et al. Genetic determinants of telomere length from 109,122 ancestrally diverse whole-genome sequences in TOPMed. Cell Genom. 2, 100084 (2022).
Article CAS PubMed Google Scholar
Schaffner, S. F. et al. Calibrating a coalescent simulation of human genome sequence variation. Genome Res. 15, 1576–1583 (2005).
Article CAS PubMed PubMed Central Google Scholar
Lee, S., Wu, M. C. & Lin, X. Optimal tests for rare-variant effects in sequencing association studies. Biostatistics 13, 762–775 (2012).
Article PubMed PubMed Central Google Scholar
Zaidi, A. A. & Mathieson, I. Demographic history mediates the effect of stratification on polygenic scores. Elife 9, e61548 (2020).
Article CAS PubMed PubMed Central Google Scholar
Gogarten, S. M. et al. Genetic association testing using the GENESIS R/Bioconductor package. Bioinformatics 35, 5346–5348 (2019).
Article CAS PubMed PubMed Central Google Scholar
Zheng, X. & Davis, J. W. SAIGEgds-an efficient statistical tool for large-scale PheWAS with mixed models. Bioinformatics 37, 728–730 (2020).
Article Google Scholar
Peloso, G. M. et al. Association of low-frequency and rare coding-sequence variants with blood lipids and coronary heart disease in 56,000 whites and blacks. Am. J. Hum. Genet. 94, 223–232 (2014).
Article CAS PubMed PubMed Central Google Scholar
Moon, S., Lee, Y., Won, S. & Lee, J. Multiple genotype-phenotype association study reveals intronic variant pair on SIDT2 associated with metabolic syndrome in a Korean population. Hum. Genomics 12, 48 (2018).
Article CAS PubMed PubMed Central Google Scholar
Dong, C. et al. Comparison and integration of deleteriousness prediction methods for nonsynonymous SNVs in whole exome sequencing studies. Hum. Mol. Genet. 24, 2125–2137 (2015).
Article CAS PubMed Google Scholar
Gazal, S. et al. Linkage disequilibrium–dependent architecture of human complex traits shows action of negative selection. Nat. Genet. 49, 1421–1427 (2017).
Article CAS PubMed PubMed Central Google Scholar
Li, X. & Li, Z. xihaoli/STAARpipeline: STAARpipeline_v0.9.6. version 0.9.6 https://doi.org/10.5281/zenodo.6871504 (2022).
Li, X. & Li, Z. xihaoli/STAARpipelineSummary: STAARpipelineSummary_v0.9.6. version 0.9.6 https://doi.org/10.5281/zenodo.6871524 (2022).
Li, X. & Li, Z. xihaoli/STAARpipeline-Tutorial: v0.9.6. version 0.9.6 https://doi.org/10.5281/zenodo.6871408 (2022).

Download references

Acknowledgements

This work was supported by grants R35-CA197449, U19-CA203654, R01-HL113338, U01-HG012064 and U01-HG009088 (to X. Lin); R01-HL142711 and R01-HL127564 (to P.N. and G.M.P.); R35-HL135824 (to C.J.W.); 75N92020D00001, HHSN268201500003I, N01-HC-95159, 75N92020D00005, N01-HC-95160, 75N92020D00002, N01-HC-95161, 75N92020D00003, N01-HC-95162, 75N92020D00006, N01-HC-95163, 75N92020D00004, N01-HC-95164, 75N92020D00007, N01-HC-95165, N01-HC-95166, N01-HC-95167, N01-HC-95168, N01-HC-95169, UL1-TR-000040, UL1-TR-001079, UL1-TR-001420, UL1-TR001881, DK063491, R01-HL071051, R01-HL071205, R01-HL071250, R01-HL071251, R01-HL071258, R01-HL071259 and UL1-RR033176 (to J.I.R. and X.G.); U01-HL72518, HL087698, HL49762, HL59684, HL58625, HL071025, HL112064, NR0224103 and M01-RR000052 (to the Johns Hopkins General Clinical Research Center); NO1-HC-25195, HHSN268201500001I, 75N92019D00031 and R01-HL092577-06S1 (to R.S.V. and L.A.C.); the Evans Medical Foundation and the Jay and Louis Coffman Endowment from the Department of Medicine, Boston University School of Medicine (to R.S.V.); HHSN268201800001I and U01-HL137162 (to K.M.R. and M.P.C.); R01-HL133040 (to D.E.W.); R35-HL135818, R01-HL113338, and HL436801 (to S.R.); KL2TR002490 (to L.M.R.); R01-HL92301, R01-HL67348, R01-NS058700, R01-AR48797 and R01-AG058921 (to N.D.P. and D.W.B.); R01-DK071891 (to N.D.P., B.I.F. and D.W.B.); M01-RR07122 and F32-HL085989 (General Clinical Research Center of the Wake Forest University School of Medicine); the American Diabetes Association and P60-AG10484 (Claude Pepper Older Americans Independence Center of Wake Forest University Health Sciences); U01-HL137181 (to J.R.O.); and R01-HL141944 (to R.A.M.). R.A.M. receives support as the Sarah Miller Coulson Scholar in the Johns Hopkins Center for Innovative Medicine; HHSN268201600018C, HHSN268201600001C, HHSN268201600002C, HHSN268201600003C and HHSN268201600004C (to C.L.K.); R01-HL113323, U01-DK085524, R01-HL045522, R01-MH078143, R01-MH078111, and R01-MH083824 (to H.H.H.G., R.D., J.E.C. and J.B.); R01- DK117445 and R01-MD012765 (to N.F. and B.M.L.); U01-DK078616 and R01-DK078616 (to J.B.M. and A.K.M.); 18CDA34110116 from American Heart Association (to P.S.d.V.); HHSN268201800010I, HHSN268201800011I, HHSN268201800012I, HHSN268201800013I, HHSN268201800014I and HHSN268201800015I (to A.C.); R01-HL153805 and R03-HL154284 (to B.E.C.); HHSN268201700001I, HHSN268201700002I, HHSN268201700003I, HHSN268201700005I and HHSN268201700004I (to E.B.); U01-HL072524, R01-HL104135-04S1, U01-HL054472, U01-HL054473, U01-HL054495, U01-HL054509 and R01-HL055673-18S1 (to D.K.A.). Molecular data for the TOPMed program were supported by the NHLBI. Core support including centralized genomic read mapping and genotype calling, along with variant QC metrics and filtering were provided by the TOPMed Informatics Research Center (3R01HL-117626-02S1; contract HHSN268201800002I). Core support including phenotype harmonization, data management, sample-identity QC and general program coordination were provided by the TOPMed Data Coordinating Center (R01HL-120393, U01HL-120393 and contract HHSN268201800001I). We gratefully acknowledge the studies and participants who provided biological samples and data for TOPMed. We gratefully acknowledge the support from The Samoan Obesity, Lifestyle and Genetic Adaptations Study Group. The full study-specific acknowledgements are detailed in the Supplementary Note.

Author information

These authors contributed equally: Zilin Li, Xihao Li.

Authors and Affiliations

Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
Zilin Li, Xihao Li, Hufeng Zhou, Sheila M. Gaynor, Theodore Arapoglou, Corbin Quick, Rounak Dey, Eric Van Buren, Jingwen Zhang & Xihong Lin
Department of Biostatistics and Health Data Science, Indiana University School of Medicine, Indianapolis, IN, USA
Zilin Li
Center for Genomic Medicine and Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA, USA
Margaret Sunitha Selvaraj & Pradeep Natarajan
Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, MA, USA
Margaret Sunitha Selvaraj, Brian E. Cade, James B. Meigs, Brian Cade, Pradeep Natarajan & Xihong Lin
Department of Medicine, Harvard Medical School, Boston, MA, USA
Margaret Sunitha Selvaraj, Alisa K. Manning, James B. Meigs & Pradeep Natarajan
School of Statistics, Southwestern University of Finance and Economics, Chengdu, China
Yaowu Liu
Human Genetics Center, Department of Epidemiology, Human Genetics, and Environmental Sciences, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, USA
Han Chen, Eric Boerwinkle, Paul S. de Vries, Alanna C. Morrison & Paul de Vries
Center for Precision Health, School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX, USA
Han Chen
Department of Biostatistics, University of Texas MD Anderson Cancer Center, Houston, TX, USA
Ryan Sun
Dean’s Office, University of Kentucky, College of Public Health, Lexington, KY, USA
Donna K. Arnett
Division of Biostatistics, Institute for Health & Equity and Cancer Center, Medical College of Wisconsin, Milwaukee, WI, USA
Paul L. Auer
Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, MI, USA
Lawrence F. Bielak, Patricia A. Peyser, Jennifer A. Smith, Wei Zhao, Larry Bielak, Patricia Peyser & Jennifer Smith
Cardiovascular Health Research Unit, Department of Medicine, University of Washington, Seattle, WA, USA
Joshua C. Bis, Jennifer A. Brody, Bruce M. Psaty, Joshua Bis, Jennifer Brody, Rozenn Lemaitre & Bruce Psaty
Department of Biostatistics and Center for Statistical Genetics, University of Michigan, Ann Arbor, MI, USA
Thomas W. Blackwell
Department of Human Genetics and South Texas Diabetes and Obesity Institute, School of Medicine, The University of Texas Rio Grande Valley, Brownsville, TX, USA
John Blangero, Joanne E. Curran, Ravindranath Duggirala & Harald H. H. Göring
Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA
Eric Boerwinkle
Department of Biochemistry, Wake Forest University School of Medicine, Winston-Salem, NC, USA
Donald W. Bowden, Nicholette D. Palmer & Nicholette Palmer
Division of Sleep and Circadian Disorders, Brigham and Women’s Hospital, Boston, MA, USA
Brian E. Cade, Susan Redline & Brian Cade
Division of Sleep Medicine, Harvard Medical School, Boston, MA, USA
Brian E. Cade, Susan Redline & Brian Cade
Department of Biostatistics, University of Washington, Seattle, WA, USA
Matthew P. Conomos & Kenneth M. Rice
Jackson Heart Study, Department of Medicine, University of Mississippi Medical Center, Jackson, MS, USA
Adolfo Correa
Department of Biostatistics, Boston University School of Public Health, Boston, MA, USA
L. Adrienne Cupples, Gina Peloso, Yuxuan Wang & Gina M. Peloso
Framingham Heart Study, National Heart, Lung, and Blood Institute and Boston University, Framingham, MA, USA
L. Adrienne Cupples, Ramachandran S. Vasan, Gina Peloso & Gina M. Peloso
Department of Epidemiology, Gillings School of Global Public Health, University of North Carolina, Chapel Hill, NC, USA
Nora Franceschini
Department of Internal Medicine, Nephrology, Wake Forest University School of Medicine, Winston-Salem, NC, USA
Barry I. Freedman
The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA, USA
Xiuqing Guo, Kent D. Taylor, Jerome Rotter & Jerome I. Rotter
GeneSTAR Research Program, Department of Medicine, Johns Hopkins University School of Medicine, Baltimore, MD, USA
Rita R. Kalyani, Brian G. Kral, Rasika A. Mathias, Lisa R. Yanek, Rita Kalyani, Rasika Mathias & Lisa Yanek
Division of Public Health Sciences, Fred Hutchinson Cancer Center, Seattle, WA, USA
Charles Kooperberg, Alexander P. Reiner & Alex Reiner
Division of Biomedical Informatics and Personalized Medicine, Department of Medicine, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
Leslie A. Lange & Brian Kral
Department of Biostatistics, University of North Carolina, Chapel Hill, NC, USA
Bridget M. Lin, Leslie Lange & Stephen Rich
Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA
Ani Manichaikul & Stephen S. Rich
Metabolism Program, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
Alisa K. Manning & Lisa Martin
Clinical and Translational Epidemiology Unit, Mongan Institute, Massachusetts General Hospital, Boston, MA, USA
Alisa K. Manning
Division in Cardiology, George Washington School of Medicine and Health Sciences, Washington, DC, USA
Lisa W. Martin
Division of General Internal Medicine, Massachusetts General Hospital, Boston, MA, USA
James B. Meigs
Department of Medicine, University of Maryland School of Medicine, Baltimore, MD, USA
Braxton D. Mitchell, Jeffrey R. O’Connell & Jeff O’Connell
Geriatrics Research and Education Clinical Center, Baltimore VA Medical Center, Baltimore, MD, USA
Braxton D. Mitchell
Division of Endocrinology, Diabetes, and Nutrition, Program for Personalized and Genomic Medicine, University of Maryland School of Medicine, Baltimore, MD, USA
May E. Montasser
Ministry of Health, Government of Samoa, Apia, Samoa
Take Naseri
Department of Epidemiology, University of Washington, Seattle, WA, USA
Bruce M. Psaty, Alexander P. Reiner, Bruce Psaty & Alex Reiner
Departments of Health Systems and Population Health, University of Washington, Seattle, WA, USA
Bruce M. Psaty & Bruce Psaty
Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
Laura M. Raffield
Division of Pulmonary, Critical Care, and Sleep Medicine, Beth Israel Deaconess Medical Center, Boston, MA, USA
Susan Redline
Lutia I Puava Ae Mapu I Fagalele, Apia, Samoa
Muagututi’a Sefuiva Reupena
Survey Research Center, Institute for Social Research, University of Michigan, Ann Arbor, MI, USA
Jennifer A. Smith & Jennifer Smith
Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
Margaret A. Taub
Department of Medicine, Boston University School of Medicine, Boston, MA, USA
Ramachandran S. Vasan
Department of Human Genetics and Biostatistics, University of Pittsburgh, Pittsburgh, PA, USA
Daniel E. Weeks
Division of Cardiology, Beth Israel Deaconess Medical Center, Boston, MA, USA
James G. Wilson & James Wilson
Department of Internal Medicine, University of Michigan, Ann Arbor, MI, USA
Cristen Willer & Cristen J. Willer
Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
Cristen Willer & Cristen J. Willer
Department of Human Genetics, University of Michigan, Ann Arbor, MI, USA
Cristen Willer & Cristen J. Willer
Department of Statistics, Harvard University, Cambridge, MA, USA
Xihong Lin
New York Genome Center, New York, NY, USA
Namiko Abe, Karen Bunting, Bo-Juen Chen, Heather Geiger, Soren Germer, Melissa Marton, Nicolas Robine, Alexi Runnels, Tanja Smith, Lara Winterkorn & Michael Zody
University of Michigan, Ann Arbor, MI, USA
Gonçalo Abecasis, Matthew Flickinger, Colin Gross, Sharon Kardia, Jonathon LeFaive, Jacob Pleiness, Albert Vernon Smith, Daniel Taliun, Peter VandeHaar, Jiongming Wang, Ketian Yu, Sebastian Zoellner, Ida Surakka & Brooke Wolford
Broad Institute, Cambridge, MA, USA
Francois Aguet, Kristin Ardlie, Mark Chaffin, Seung Hoan Choi, Stacey Gabriel, Namrata Gupta, Carolina Roselli, Seyedeh Maryam Zekavat, Elizabeth Atkinson, Romit Bhattacharya, Sarah Calvo, So Mi Cho, Jacqueline Dron, Amanda Elliott, Hilary Finucane, Andrea Ganna, Mary Haas, Masahiro Kanai, Amit Khera, Sumeet Khetarpal, Derek Klarin, Satoshi Koyama, Vamsi Mootha, Tetsushi Nakao, Kaavya Paruchuri, Aniruddh Patel, Mark Trinder, Md Mesbah Uddin, Sarah Urbut & Zhi Yu
Cedars Sinai, Boston, MA, USA
Christine Albert
Children’s Hospital of Philadelphia, University of Pennsylvania, Philadelphia, PA, USA
Laura Almasy
Emory University, Atlanta, GA, USA
Alvaro Alonso, Rich Johnston, Lawrence S. Phillips & Zhaohui Qin
University of Maryland, Baltimore, MD, USA
Seth Ament, Amber Beitelshees, Christy Chang, Coleen Damcott, Scott Devine, Mao Fu, Da-Wei Gong, Yue Guan, Elliott Hong, Michael Kessler, Joshua Lewis, Patrick McArdle, Tim O’Connor, James Perry, Toni Pollin, Robert Reed, Kathleen Ryan, Amol Shetty, Elizabeth Streeten, Simeon Taylor, Huichun Xu, Dawei Gong, Jicai Jiang, John McLenithan, Carole Sztalryd & Norann Zaghloul
University of Washington, Seattle, WA, USA
Peter Anderson, Jai Broome, Colleen Davis, Leslie Emery, Chris Frazar, Stephanie M. Fullerton, Stephanie Gogarten, Deepti Jain, Craig Johnson, Alyna Khan, Cathy Laurie, Cecelia Laurie, David Levine, Josh Smith, Nona Sotoodehnia, Adrienne M. Stilp, Adam Szpiro, Timothy A. Thornton, David Tirschwell, Fei Fei Wang, Bruce Weir, Quenna Wong, Gail Jarvik & Kerri Wiggins
University of Mississippi Medical Center, Jackson, MI, USA
Pramod Anugu, Lynette Ekunwe, Yan Gao, Hao Mei & Nancy Min
National Institutes of Health, Bethesda, MD, USA
Deborah Applebaum-Bowden & Paule Valery Joseph
Johns Hopkins University, Baltimore, MD, USA
Dan Arking, Dimitrios Avramopoulos, Emily Barron-Casella, Terri Beaty, Lewis Becker, James Casella, Kimberly Jones, Barry Make, Rakhi Naik, Ingo Ruczinski, Steven Salzberg, Margaret Taub & Dhananjay Vaidya
Duke University, Durham, NC, USA
Allison Ashley-Koch & Marilyn Telen
University of Alabama, Birmingham, AL, USA
Stella Aslibekyan, Bertha Hidalgo & Marguerite Ryan Irvin
Stanford University, Stanford, CA, USA
Tim Assimes, Chris Gignoux, Marco Perez & Michael Snyder
Providence Health Care, Medicine, Vancouver, BC, Canada
Najib Ayas
Baylor College of Medicine Human Genome Sequencing Center, Houston, TX, USA
Adithya Balasubramanian, Huyen Dinh, Harsha Doddapaneni, Shannon Dugan-Perez, Jesse Farek, Richard Gibbs, Yi Han, Jianhong Hu, Ziad Khan, Sandra Lee, Vipin Menon, Ginger Metcalf, Zeineen Momin, Donna Muzny, Caitlin Nessner, Osuji Nkechinyere, Geoffrey Okwuonu, Mahitha Rajendran, Sejal Salvi, Jireh Santibanez, Jennifer Watt & Christie Ballantyne
Cleveland Clinic, Cleveland, OH, USA
John Barnard & Serpil Erzurum
Tempus, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
Kathleen Barnes
Columbia University, New York, NY, USA
R. Graham Barr, Danish Saleheen & Andrew Moran
The Emmes Corporation, LTRC, Rockville, MD, USA
Lucas Barwick
Cleveland Clinic, Quantitative Health Sciences, Cleveland, OH, USA
Gerald Beck
Johns Hopkins University, Medicine, Baltimore, MD, USA
Diane Becker
National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD, USA
Rebecca Beer, Weiniu Gan, Cashell Jaquish, Andrew Johnson, Dan Levy, James Luo, Julie Mikulla, George Papanicolaou, Pankaj Qasba & Christopher O’Donnell
Boston University, Massachusetts General Hospital, Boston University School of Medicine, Boston, MA, USA
Emelia Benjamin
University of Pittsburgh, Pittsburgh, PA, USA
Takis Benos, Mark Geraci, Mark Gladwin, Ryan L. Minster, Frank Sciurba, Jenna Carlson & Samantha Rosenthal
Fundação de Hematologia e Hemoterapia de Pernambuco - Hemope, Recife, Brazil
Marcos Bezerra
University of Utah, Obstetrics and Gynecology, Salt Lake City, UT, USA
Nathan Blue
National Jewish Health, National Jewish Health, Denver, CO, USA
Russell Bowler
Medical College of Wisconsin, Pediatrics, Milwaukee, WI, USA
Ulrich Broeckel
University of Texas Health at Houston, Pediatrics, Houston, TX, USA
Deborah Brown
University of California, San Francisco, San Francisco, CA, USA
Esteban Burchard, Ryan Hernandez, Shannon Kelly & Elad Ziv
Stanford University, Biomedical Data Science, Stanford, CA, USA
Carlos Bustamante
University of Washington, Biostatistics, Seattle, WA, USA
Erin Buth, Ben Heavner, Susanne May, Caitlin McHugh, Sarah C. Nelson & Kayleen Williams
University of Colorado at Denver, Denver, CO, USA
Jonathan Cardwell, Sameer Chavan, Michelle Daya, Shanshan Gao, Daniel Grine, John Hokanson, Ethan Lange, Susan Mathai, Bonnie Neltner, Meher Preethi Boorgula, Pamela Russell, David Schwartz, Aniket Shetty, Tarik Walker, Avram Walts & Ivana Yang
Brigham & Women’s Hospital, Boston, MA, USA
Vincent Carey, Juan P. Casas Romero, Michael Cho, Dawn DeMeo, Auyon Ghosh, Brian Hobbs, Meryl LeBoff, Jiwon Lee, JoAnn Manson, Dandi Qiao, Edwin Silverman, Tamar Sofer, Jody Sylvia, Carla Wilson & Shamil R. Sunyaev
University of Montreal, Quebec City, Quebec, USA
Julie Carrier
University of Mississippi, Medicine, Jackson, MS, USA
April Carson
Washington State University, Pullman, WA, USA
Cara Carty
University of California, Los Angeles, Los Angeles, CA, USA
Richard Casaburi, Carolyn Crandall & Karol Watson
Brigham & Women’s Hospital, Medicine, Boston, MA, USA
Peter Castaldi & Matt Moll
National Taiwan University, Taipei, Taiwan
Yi-Cheng Chang
Brigham & Women’s Hospital, Division of Preventive Medicine, Boston, MA, USA
Daniel Chasman
University of Virginia, Charlottesville, VA, USA
Wei-Min Chen, Charles Farber, Josyf C. Mychaleckyj & Aakrosh Ratan
Lundquist Institute, Torrance, CA, USA
Yii-Der Ida Chen, Xiaohui Li & Henry Lin
National Taiwan University, National Taiwan University Hospital, Taipei, Taiwan
Lee-Ming Chuang
Cleveland Clinic, Cleveland Clinic, Cleveland, OH, USA
Mina Chung
National Health Research Institute Taiwan, Miaoli County, Taiwan
Ren-Hua Chung & I-Shou Chang
Broad Institute, Metabolomics Platform, Cambridge, MA, USA
Clary Clish
Cleveland Clinic, Immunity and Immunology, Cleveland, OH, USA
Suzy Comhair
University of Vermont, Burlington, VT, USA
Elaine Cornell & Jon Peter Durda
National Jewish Health, Denver, CO, USA
James Crapo, Elizabeth Regan & Snow Xueyan Zhao
University of Michigan, Internal Medicine, Ann Arbor, MI, USA
Jeffrey Curtis & Sarah Graham
Vitalant Research Institute, San Francisco, CA, USA
Brian Custer
University of Illinois at Chicago, Chicago, IL, USA
Dawood Darbar
University of Chicago, Chicago, IL, USA
Sean David
Mayo Clinic, Health Quantitative Sciences Research, Rochester, MN, USA
Mariza de Andrade
Washington University in St Louis, Department of Medicine, Cardiovascular Division, St. Louis, MO, USA
Lisa de las Fuentes
Vanderbilt University, Nashville, TN, USA
Michael DeBaun & Yingchang Lu
University of Cincinnati, Cincinnati, OH, USA
Ranjan Deka
University of North Carolina, Chapel Hill, NC, USA
Qing Duan, Yun Li, Kari North & Ann Von Holle
Washington University in St Louis, Genetics, St Louis, MO, USA
Susan K. Dutcher
Brown University, Providence, RI, USA
Charles Eaton & Stephen McGarvey
Harvard University, Channing Division of Network Medicine, Cambridge, MA, USA
Adel El Boueiz
Massachusetts General Hospital, Boston, MA, USA
Patrick Ellinor, Steven Lubitz, Lu-Chen Weng, Corneliu Bodea & James Pirruccello
National Jewish Health, Center for Genes, Environment and Health, Denver, CO, USA
Tasha Fingerlin
University of Texas Health at Houston, Houston, TX, USA
Myriam Fornage, James Hixson & Goo Jun
Washington University in St Louis, St Louis, MI, USA
Lucinda Fulton, C. Charles Gu, D. C. Rao, Karen Schwander & Yun Ju Sung
Fred Hutchinson Cancer Research Center, Seattle, WA, USA
Margery Gass, Mary Pettinger & Ying Zhou
Icahn School of Medicine at Mount Sinai, New York, NY, USA
Bruce Gelb, Eimear Kenny, Girish Nadkarni, Michael Preuss, Ron Do & Marie Verbanck
Beth Israel Deaconess Medical Center, Boston, MA, USA
Robert Gerszten
Boston Children’s Hospital, Harvard Medical School, Department of Psychiatry, Boston, MA, USA
David Glahn
University of Colorado Anschutz Medical Campus, Aurora, CO, USA
Sharon Graw, Luisa Mestroni & Matthew Taylor
Mass General Brigham, Obstetrics and Gynecology, Boston, MA, USA
Kathryn J. Gray
University of Mississippi, Cardiology, Jackson, MI, USA
Michael Hall
University of Calgary, Medicine, Calgary, Alberta, Canada
Patrick Hanly
University of Maryland, Genetics, Philadelphia, PA, USA
Daniel Harris
Yale University, Department of Chronic Disease Epidemiology, New Haven, CT, USA
Nicola L. Hawley
Tulane University, New Orleans, LA, USA
Jiang He & Changwei Li
University of Washington, Epidemiology, Seattle, WA, USA
Susan Heckbert & Nicholas Smith
Wake Forest Baptist Health, Winston-Salem, NC, USA
David Herrington
Brigham & Women’s Hospital, Channing Division of Network Medicine, Boston, MA, USA
Craig Hersh
University of Iowa, Iowa City, IA, USA
Karin Hoth, Robert Wallace & Wei Bao
National Health Research Institute Taiwan, Institute of Population Health Sciences, NHRI, Miaoli County, Taiwan
Chao (Agnes) Hsiung
Tri-Service General Hospital National Defense Medical Center, Taipei City, Taiwan
Yi-Jen Hung
Blood Works Northwest, Seattle, WA, USA
Haley Huston & Sarah Ruuska
Taichung Veterans General Hospital Taiwan, Taichung City, Taiwan
Chii Min Hwu, Wen-Jane Lee & Wayne Hui-Heng Sheu
Oklahoma State University Medical Center, Internal Medicine, DIvision of Endocrinology, Diabetes and Metabolism, Columbus, OH, USA
Rebecca Jackson
Blood Works Northwest, Research Institute, Seattle, WA, USA
Jill Johnsen
University of Michigan, Biostatistics, Ann Arbor, MI, USA
Hyun Min Kang & Joshua Weinstock
Albert Einstein College of Medicine, New York, NY, USA
Robert Kaplan & Sylvia Smoller
Harvard University, Cambridge, MA, USA
Wonji Kim & Sean McFarland
McGill University, Montréal, Quebec, Canada
John Kimoff
University of Colorado at Denver, Epidemiology, Aurora, CO, USA
Greg Kinney
Blood Works Northwest, Medicine, Seattle, WA, USA
Barbara Konkle
Loyola University, Public Health Sciences, Maywood, IL, USA
Holly Kramer
Harvard School of Public Health, Biostatistics, Boston, MA, USA
Christoph Lange
Boston University, University of Massachusetts Chan Medical School, Worcester, MA, USA
Honghuang Lin
Brown University, Epidemiology and Medicine, Providence, RI, USA
Simin Liu
Duke University, Cardiology, Durham, NC, USA
Yongmei Liu
Stanford University, Cardiovascular Institute, Stanford, CA, USA
Yu Liu
Icahn School of Medicine at Mount Sinai, The Charles Bronfman Institute for Personalized Medicine, New York, NY, USA
Ruth J. F. Loos
Boston University, Boston, MA, USA
Kathryn Lunetta
Ohio State University, Division of Pulmonary, Critical Care and Sleep Medicine, Columbus, OH, USA
Ulysses Magalang
University of Texas Rio Grande Valley School of Medicine, Brownsville, TX, USA
Michael Mahaney
University of Alabama at Birmingham, Birmingham, AL, USA
Merry-Lynn McDonald
University of Washington, Genome Sciences, Seattle, WA, USA
Daniel McGoldrick
RTI International, North Carolina, NC, USA
Becky McNeil & Ravi Mathur
University of Arizona, Tucson, AZ, USA
Deborah A. Meyers
Stanford University, Center For Sleep Sciences and Medicine, Palo Alto, CA, USA
Emmanuel Mignot
National Institute of Child Health and Human Development, National Institutes of Health, Bethesda, MD, USA
Mollie Minear
Oklahoma Medical Research Foundation, Genes and Human Disease, Oklahoma City, OK, USA
Courtney Montgomery
Howard University, Washington, DC, USA
Sergei Nekhai
University of Washington, Department of Genome Sciences, Seattle, WA, USA
Deborah Nickerson
University at Buffalo, Buffalo, NY, USA
Heather Ochs-Balcom
University of Pennsylvania, Division of Sleep Medicine/Department of Medicine, Philadelphia, PA, USA
Allan Pack
Stanford University, Stanford Cardiovascular Institute, Stanford, CA, USA
David T. Paik & Joseph Wu
University of Minnesota, Minneapolis, MN, USA
James Pankow, Michael Tsai & Scott Vrieze
RTI International, Biostatistics and Epidemiology Division, Research Triangle Park, NC, USA
Cora Parker
University of Texas Rio Grande Valley School of Medicine, Edinburg, TX, USA
Juan Manuel Peralta
Fred Hutchinson Cancer Research Center, Fred Hutch and UW, Seattle, WA, USA
Ulrike Peters
Johns Hopkins University, Cardiology/Medicine, Baltimore, MD, USA
Wendy Post
University of Colorado at Denver, Medicine, Denver, CO, USA
Julia Powers Becker
University of Colorado at Denver, CCPM, Denver, CO, USA
Nicholas Rafaels
Northwestern University, Chicago, IL, USA
Laura Rasmussen-Torvik & John Wilkins
New York Genome Center, New York Genome Center, New York City, NY, USA
Catherine Reeves
University of Ottawa, Sleep Research Unit, University of Ottawa Institute for Mental Health Research, Ottawa, Ontario, Canada
Rebecca Robillard
Vanderbilt University, Medicine, Pharmacology, Biomedicla Informatics, Nashville, TN, USA
Dan Roden
Universidade de Sao Paulo, Faculdade de Medicina, Sao Paulo, Brazil
Ester Cerdeira Sabino
University of Maryland, Pathology, Seattle, WA, USA
Shabnam Salimi
Lundquist Institute, TGPS, Torrance, CA, USA
Kevin Sandow
Harvard University, Division of Hematology/Oncology, Boston, MA, USA
Vijay G. Sankaran
Harvard Medical School, Genetics, Boston, MA, USA
Christine Seidman
Harvard Medical School, Boston, MA, USA
Jonathan Seidman & Roby Joehanes
Université Laval, Quebec City, Quebec, Canada
Frédéric Sériès
Emory University, Pediatrics, Atlanta, GA, USA
Vivien Sheehan
Emory University, Human Genetics, Atlanta, GA, USA
Stephanie L. Sherman
Vanderbilt University, Medicine/Cardiology, Nashville, TN, USA
M. Benjamin Shoemaker
UMass Memorial Medical Center, Worcester, MA, USA
Brian Silver
University of Saskatchewan, Saskatoon, Saskatchewan, Canada
Robert Skomro
Wake Forest Baptist Health, Biostatistical Sciences, Winston-Salem, NC, USA
Beverly Snively
University of Colorado at Denver, Genomic Cardiology, Aurora, CO, USA
Garrett Storm
Brigham & Women’s Hospital, Channing Department of Medicine, Boston, MA, USA
Jessica Lasky Su
Stanford University, Genetics, Stanford, CA, USA
Hua Tang
University of Washington, Department of Genome Sciences, Seattle, WA, USA
Machiko Threlkeld
Fred Hutchinson Cancer Research Center, Cancer Prevention Division of Public Health Sciences, Seattle, WA, USA
Lesley Tinker
University of Pennsylvania, Genetics, Philadelphia, PA, USA
Sarah Tishkoff
University of Alabama, Biostatistics, Birmingham, AL, USA
Hemant Tiwari
University of Washington, Department of Biostatistics, Seattle, WA, USA
Catherine Tong
University of Vermont, Pathology & Laboratory Medicine, Burlington, VT, USA
Russell Tracy
University of Southern California, USC Methylation Characterization Center, University of Southern California, Los Angeles, CA, USA
David Van Den Berg
Brigham & Women’s Hospital, Mass General Brigham, Boston, MA, USA
Heming Wang
Brigham & Women’s Hospital, Channing Division of Network Medicine, Department of Medicine, Boston, MA, USA
Scott T. Weiss
Indiana University, Epidemiology, Indianapolis, IN, USA
Jennifer Wessel
Henry Ford Health System, Detroit, MI, USA
L. Keoki Williams
University of Pittsburgh, Medicine, Pittsburgh, PA, USA
Yingze Zhang
Case Western Reserve University, Department of Population and Quantitative Health Sciences, Cleveland, OH, USA
Xiaofeng Zhu
Virginia Commonwealth University, Richmond, VA, USA
Ana F. Diallo
Westat, Atlanta, GA, USA
Caitlin Floyd, Scott Heemann & Amy Miller
Saarland University Medical Center, Homburg, Germany
Bernhard Haring
Department of Biostatistics, Epidemiology and Informatics, University of Pennsylvania, Philadelphia, PA, USA
Blanca Himes
Verve Therapeutics, Cambridge, MA, USA
Sekar Kathiresan

Authors

Zilin Li
View author publications
You can also search for this author in PubMed Google Scholar
Xihao Li
View author publications
You can also search for this author in PubMed Google Scholar
Hufeng Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Sheila M. Gaynor
View author publications
You can also search for this author in PubMed Google Scholar
Margaret Sunitha Selvaraj
View author publications
You can also search for this author in PubMed Google Scholar
Theodore Arapoglou
View author publications
You can also search for this author in PubMed Google Scholar
Corbin Quick
View author publications
You can also search for this author in PubMed Google Scholar
Yaowu Liu
View author publications
You can also search for this author in PubMed Google Scholar
Han Chen
View author publications
You can also search for this author in PubMed Google Scholar
Ryan Sun
View author publications
You can also search for this author in PubMed Google Scholar
Rounak Dey
View author publications
You can also search for this author in PubMed Google Scholar
Donna K. Arnett
View author publications
You can also search for this author in PubMed Google Scholar
Paul L. Auer
View author publications
You can also search for this author in PubMed Google Scholar
Lawrence F. Bielak
View author publications
You can also search for this author in PubMed Google Scholar
Joshua C. Bis
View author publications
You can also search for this author in PubMed Google Scholar
Thomas W. Blackwell
View author publications
You can also search for this author in PubMed Google Scholar
John Blangero
View author publications
You can also search for this author in PubMed Google Scholar
Eric Boerwinkle
View author publications
You can also search for this author in PubMed Google Scholar
Donald W. Bowden
View author publications
You can also search for this author in PubMed Google Scholar
Jennifer A. Brody
View author publications
You can also search for this author in PubMed Google Scholar
Brian E. Cade
View author publications
You can also search for this author in PubMed Google Scholar
Matthew P. Conomos
View author publications
You can also search for this author in PubMed Google Scholar
Adolfo Correa
View author publications
You can also search for this author in PubMed Google Scholar
L. Adrienne Cupples
View author publications
You can also search for this author in PubMed Google Scholar
Joanne E. Curran
View author publications
You can also search for this author in PubMed Google Scholar
Paul S. de Vries
View author publications
You can also search for this author in PubMed Google Scholar
Ravindranath Duggirala
View author publications
You can also search for this author in PubMed Google Scholar
Nora Franceschini
View author publications
You can also search for this author in PubMed Google Scholar
Barry I. Freedman
View author publications
You can also search for this author in PubMed Google Scholar
Harald H. H. Göring
View author publications
You can also search for this author in PubMed Google Scholar
Xiuqing Guo
View author publications
You can also search for this author in PubMed Google Scholar
Rita R. Kalyani
View author publications
You can also search for this author in PubMed Google Scholar
Charles Kooperberg
View author publications
You can also search for this author in PubMed Google Scholar
Brian G. Kral
View author publications
You can also search for this author in PubMed Google Scholar
Leslie A. Lange
View author publications
You can also search for this author in PubMed Google Scholar
Bridget M. Lin
View author publications
You can also search for this author in PubMed Google Scholar
Ani Manichaikul
View author publications
You can also search for this author in PubMed Google Scholar
Alisa K. Manning
View author publications
You can also search for this author in PubMed Google Scholar
Lisa W. Martin
View author publications
You can also search for this author in PubMed Google Scholar
Rasika A. Mathias
View author publications
You can also search for this author in PubMed Google Scholar
James B. Meigs
View author publications
You can also search for this author in PubMed Google Scholar
Braxton D. Mitchell
View author publications
You can also search for this author in PubMed Google Scholar
May E. Montasser
View author publications
You can also search for this author in PubMed Google Scholar
Alanna C. Morrison
View author publications
You can also search for this author in PubMed Google Scholar
Take Naseri
View author publications
You can also search for this author in PubMed Google Scholar
Jeffrey R. O’Connell
View author publications
You can also search for this author in PubMed Google Scholar
Nicholette D. Palmer
View author publications
You can also search for this author in PubMed Google Scholar
Patricia A. Peyser
View author publications
You can also search for this author in PubMed Google Scholar
Bruce M. Psaty
View author publications
You can also search for this author in PubMed Google Scholar
Laura M. Raffield
View author publications
You can also search for this author in PubMed Google Scholar
Susan Redline
View author publications
You can also search for this author in PubMed Google Scholar
Alexander P. Reiner
View author publications
You can also search for this author in PubMed Google Scholar
Muagututi’a Sefuiva Reupena
View author publications
You can also search for this author in PubMed Google Scholar
Kenneth M. Rice
View author publications
You can also search for this author in PubMed Google Scholar
Stephen S. Rich
View author publications
You can also search for this author in PubMed Google Scholar
Jennifer A. Smith
View author publications
You can also search for this author in PubMed Google Scholar
Kent D. Taylor
View author publications
You can also search for this author in PubMed Google Scholar
Margaret A. Taub
View author publications
You can also search for this author in PubMed Google Scholar
Ramachandran S. Vasan
View author publications
You can also search for this author in PubMed Google Scholar
Daniel E. Weeks
View author publications
You can also search for this author in PubMed Google Scholar
James G. Wilson
View author publications
You can also search for this author in PubMed Google Scholar
Lisa R. Yanek
View author publications
You can also search for this author in PubMed Google Scholar
Wei Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Jerome I. Rotter
View author publications
You can also search for this author in PubMed Google Scholar
Cristen J. Willer
View author publications
You can also search for this author in PubMed Google Scholar
Pradeep Natarajan
View author publications
You can also search for this author in PubMed Google Scholar
Gina M. Peloso
View author publications
You can also search for this author in PubMed Google Scholar
Xihong Lin
View author publications
You can also search for this author in PubMed Google Scholar

Consortia

NHLBI Trans-Omics for Precision Medicine (TOPMed) Consortium

Namiko Abe
, Gonçalo Abecasis
, Francois Aguet
, Christine Albert
, Laura Almasy
, Alvaro Alonso
, Seth Ament
, Peter Anderson
, Pramod Anugu
, Deborah Applebaum-Bowden
, Kristin Ardlie
, Dan Arking
, Allison Ashley-Koch
, Stella Aslibekyan
, Tim Assimes
, Dimitrios Avramopoulos
, Najib Ayas
, Adithya Balasubramanian
, John Barnard
, Kathleen Barnes
, R. Graham Barr
, Emily Barron-Casella
, Lucas Barwick
, Terri Beaty
, Gerald Beck
, Diane Becker
, Lewis Becker
, Rebecca Beer
, Amber Beitelshees
, Emelia Benjamin
, Takis Benos
, Marcos Bezerra
, Nathan Blue
, Russell Bowler
, Ulrich Broeckel
, Jai Broome
, Deborah Brown
, Karen Bunting
, Esteban Burchard
, Carlos Bustamante
, Erin Buth
, Jonathan Cardwell
, Vincent Carey
, Julie Carrier
, April Carson
, Cara Carty
, Richard Casaburi
, Juan P. Casas Romero
, James Casella
, Peter Castaldi
, Mark Chaffin
, Christy Chang
, Yi-Cheng Chang
, Daniel Chasman
, Sameer Chavan
, Bo-Juen Chen
, Wei-Min Chen
, Yii-Der Ida Chen
, Michael Cho
, Seung Hoan Choi
, Lee-Ming Chuang
, Mina Chung
, Ren-Hua Chung
, Clary Clish
, Suzy Comhair
, Elaine Cornell
, Carolyn Crandall
, James Crapo
, Jeffrey Curtis
, Brian Custer
, Coleen Damcott
, Dawood Darbar
, Sean David
, Colleen Davis
, Michelle Daya
, Mariza de Andrade
, Lisa de las Fuentes
, Michael DeBaun
, Ranjan Deka
, Dawn DeMeo
, Scott Devine
, Huyen Dinh
, Harsha Doddapaneni
, Qing Duan
, Shannon Dugan-Perez
, Jon Peter Durda
, Susan K. Dutcher
, Charles Eaton
, Lynette Ekunwe
, Adel El Boueiz
, Patrick Ellinor
, Leslie Emery
, Serpil Erzurum
, Charles Farber
, Jesse Farek
, Tasha Fingerlin
, Matthew Flickinger
, Myriam Fornage
, Chris Frazar
, Mao Fu
, Stephanie M. Fullerton
, Lucinda Fulton
, Stacey Gabriel
, Weiniu Gan
, Shanshan Gao
, Yan Gao
, Margery Gass
, Heather Geiger
, Bruce Gelb
, Mark Geraci
, Soren Germer
, Robert Gerszten
, Auyon Ghosh
, Richard Gibbs
, Chris Gignoux
, Mark Gladwin
, David Glahn
, Stephanie Gogarten
, Da-Wei Gong
, Sharon Graw
, Kathryn J. Gray
, Daniel Grine
, Colin Gross
, C. Charles Gu
, Yue Guan
, Namrata Gupta
, Michael Hall
, Yi Han
, Patrick Hanly
, Daniel Harris
, Nicola L. Hawley
, Jiang He
, Ben Heavner
, Susan Heckbert
, Ryan Hernandez
, David Herrington
, Craig Hersh
, Bertha Hidalgo
, James Hixson
, Brian Hobbs
, John Hokanson
, Elliott Hong
, Karin Hoth
, Chao (Agnes) Hsiung
, Jianhong Hu
, Yi-Jen Hung
, Haley Huston
, Chii Min Hwu
, Marguerite Ryan Irvin
, Rebecca Jackson
, Deepti Jain
, Cashell Jaquish
, Jill Johnsen
, Andrew Johnson
, Craig Johnson
, Rich Johnston
, Kimberly Jones
, Hyun Min Kang
, Robert Kaplan
, Sharon Kardia
, Shannon Kelly
, Eimear Kenny
, Michael Kessler
, Alyna Khan
, Ziad Khan
, Wonji Kim
, John Kimoff
, Greg Kinney
, Barbara Konkle
, Holly Kramer
, Christoph Lange
, Ethan Lange
, Cathy Laurie
, Cecelia Laurie
, Meryl LeBoff
, Jiwon Lee
, Sandra Lee
, Wen-Jane Lee
, Jonathon LeFaive
, David Levine
, Dan Levy
, Joshua Lewis
, Xiaohui Li
, Yun Li
, Henry Lin
, Honghuang Lin
, Simin Liu
, Yongmei Liu
, Yu Liu
, Ruth J. F. Loos
, Steven Lubitz
, Kathryn Lunetta
, James Luo
, Ulysses Magalang
, Michael Mahaney
, Barry Make
, JoAnn Manson
, Melissa Marton
, Susan Mathai
, Susanne May
, Patrick McArdle
, Merry-Lynn McDonald
, Sean McFarland
, Daniel McGoldrick
, Caitlin McHugh
, Becky McNeil
, Hao Mei
, Vipin Menon
, Luisa Mestroni
, Ginger Metcalf
, Deborah A. Meyers
, Emmanuel Mignot
, Julie Mikulla
, Nancy Min
, Mollie Minear
, Ryan L. Minster
, Matt Moll
, Zeineen Momin
, Courtney Montgomery
, Donna Muzny
, Josyf C. Mychaleckyj
, Girish Nadkarni
, Rakhi Naik
, Sergei Nekhai
, Sarah C. Nelson
, Bonnie Neltner
, Caitlin Nessner
, Deborah Nickerson
, Osuji Nkechinyere
, Kari North
, Tim O’Connor
, Heather Ochs-Balcom
, Geoffrey Okwuonu
, Allan Pack
, David T. Paik
, James Pankow
, George Papanicolaou
, Cora Parker
, Juan Manuel Peralta
, Marco Perez
, James Perry
, Ulrike Peters
, Lawrence S. Phillips
, Jacob Pleiness
, Toni Pollin
, Wendy Post
, Julia Powers Becker
, Meher Preethi Boorgula
, Michael Preuss
, Pankaj Qasba
, Dandi Qiao
, Zhaohui Qin
, Nicholas Rafaels
, Mahitha Rajendran
, D. C. Rao
, Laura Rasmussen-Torvik
, Aakrosh Ratan
, Robert Reed
, Catherine Reeves
, Elizabeth Regan
, Rebecca Robillard
, Nicolas Robine
, Dan Roden
, Carolina Roselli
, Ingo Ruczinski
, Alexi Runnels
, Pamela Russell
, Sarah Ruuska
, Kathleen Ryan
, Ester Cerdeira Sabino
, Danish Saleheen
, Shabnam Salimi
, Sejal Salvi
, Steven Salzberg
, Kevin Sandow
, Vijay G. Sankaran
, Jireh Santibanez
, Karen Schwander
, David Schwartz
, Frank Sciurba
, Christine Seidman
, Jonathan Seidman
, Frédéric Sériès
, Vivien Sheehan
, Stephanie L. Sherman
, Amol Shetty
, Aniket Shetty
, Wayne Hui-Heng Sheu
, M. Benjamin Shoemaker
, Brian Silver
, Edwin Silverman
, Robert Skomro
, Albert Vernon Smith
, Josh Smith
, Nicholas Smith
, Tanja Smith
, Sylvia Smoller
, Beverly Snively
, Michael Snyder
, Tamar Sofer
, Nona Sotoodehnia
, Adrienne M. Stilp
, Garrett Storm
, Elizabeth Streeten
, Jessica Lasky Su
, Yun Ju Sung
, Jody Sylvia
, Adam Szpiro
, Daniel Taliun
, Hua Tang
, Margaret Taub
, Matthew Taylor
, Simeon Taylor
, Marilyn Telen
, Timothy A. Thornton
, Machiko Threlkeld
, Lesley Tinker
, David Tirschwell
, Sarah Tishkoff
, Hemant Tiwari
, Catherine Tong
, Russell Tracy
, Michael Tsai
, Dhananjay Vaidya
, David Van Den Berg
, Peter VandeHaar
, Scott Vrieze
, Tarik Walker
, Robert Wallace
, Avram Walts
, Fei Fei Wang
, Heming Wang
, Jiongming Wang
, Karol Watson
, Jennifer Watt
, Joshua Weinstock
, Bruce Weir
, Scott T. Weiss
, Lu-Chen Weng
, Jennifer Wessel
, Kayleen Williams
, L. Keoki Williams
, Carla Wilson
, Lara Winterkorn
, Quenna Wong
, Joseph Wu
, Huichun Xu
, Ivana Yang
, Ketian Yu
, Seyedeh Maryam Zekavat
, Yingze Zhang
, Snow Xueyan Zhao
, Xiaofeng Zhu
, Elad Ziv
, Michael Zody
& Sebastian Zoellner

TOPMed Lipids Working Group

Gonçalo Abecasis
, Donna K. Arnett
, Stella Aslibekyan
, Tim Assimes
, Elizabeth Atkinson
, Christie Ballantyne
, Wei Bao
, Amber Beitelshees
, Romit Bhattacharya
, Larry Bielak
, Joshua Bis
, Corneliu Bodea
, Eric Boerwinkle
, Donald W. Bowden
, Jennifer Brody
, Brian Cade
, Sarah Calvo
, Jenna Carlson
, I-Shou Chang
, Yii-Der Ida Chen
, So Mi Cho
, Seung Hoan Choi
, Ren-Hua Chung
, Adolfo Correa
, L. Adrienne Cupples
, Coleen Damcott
, Paul de Vries
, Ana F. Diallo
, Ron Do
, Jacqueline Dron
, Amanda Elliott
, Hilary Finucane
, Caitlin Floyd
, Mao Fu
, Andrea Ganna
, Dawei Gong
, Sarah Graham
, Mary Haas
, Bernhard Haring
, Jiang He
, Scott Heemann
, Blanca Himes
, James Hixson
, Marguerite Ryan Irvin
, Gail Jarvik
, Jicai Jiang
, Roby Joehanes
, Paule Valery Joseph
, Goo Jun
, Rita Kalyani
, Masahiro Kanai
, Sharon Kardia
, Sekar Kathiresan
, Amit Khera
, Sumeet Khetarpal
, Derek Klarin
, Charles Kooperberg
, Satoshi Koyama
, Brian Kral
, Leslie Lange
, Cathy Laurie
, Rozenn Lemaitre
, Zilin Li
, Xihao Li
, Changwei Li
, Xihong Lin
, Yingchang Lu
, Michael Mahaney
, Ani Manichaikul
, Lisa Martin
, Rasika Mathias
, Ravi Mathur
, Stephen McGarvey
, John McLenithan
, Julie Mikulla
, Amy Miller
, Braxton D. Mitchell
, May E. Montasser
, Vamsi Mootha
, Andrew Moran
, Alanna C. Morrison
, Tetsushi Nakao
, Pradeep Natarajan
, Kari North
, Jeff O’Connell
, Christopher O’Donnell
, Nicholette Palmer
, Kaavya Paruchuri
, Aniruddh Patel
, Gina Peloso
, James Perry
, Ulrike Peters
, Mary Pettinger
, Patricia Peyser
, James Pirruccello
, Toni Pollin
, Michael Preuss
, Bruce Psaty
, Susan Redline
, Robert Reed
, Alex Reiner
, Stephen Rich
, Samantha Rosenthal
, Jerome Rotter
, Margaret Sunitha Selvaraj
, Wayne Hui-Heng Sheu
, Jennifer Smith
, Tamar Sofer
, Adrienne M. Stilp
, Shamil R. Sunyaev
, Ida Surakka
, Carole Sztalryd
, Hua Tang
, Kent D. Taylor
, Mark Trinder
, Michael Tsai
, Md Mesbah Uddin
, Sarah Urbut
, Eric Van Buren
, Marie Verbanck
, Ann Von Holle
, Heming Wang
, Yuxuan Wang
, Kerri Wiggins
, John Wilkins
, Cristen Willer
, James Wilson
, Brooke Wolford
, Huichun Xu
, Lisa Yanek
, Zhi Yu
, Norann Zaghloul
, Seyedeh Maryam Zekavat
, Jingwen Zhang
& Ying Zhou

Contributions

Z.L., X. Li and X. Lin designed the experiments. Z.L., X. Li, H.Z. and X. Lin performed the experiments. Z.L., X. Li, H.Z., S.M.G., M.S.S., T.A., C.Q., Y.L., H.C., R.S., R.D., D.K.A., L.F.B., J.C.B., T.W.B., J.B., E.B., D.W.B., J.A.B., B.E.C., M.P.C., A.C., L.A.C., J.E.C., P.S.d.V., R.D., B.I.F., H.H.H.G., X.G., R.R.K., C.L.K., B.G.K., L.A.L., A.W.M., L.W.M., B.D.M., M.E.M., A.C.M., T.N., J.R.O., N.D.P., P.A.P., B.M.P., L.M.R., S.R., A.P.R., M.S.R., K.M.R., S.S.R., J.A.S., K.D.T., R.S.V., D.E.W., J.G.W., L.R.Y., W.Z., J.I.R., C.J.W., P.N., G.M.P. and X. Lin acquired, analyzed or interpreted data. G.M.P., P.N. and NHLBI TOPMed Lipids Working Group provided administrative, technical or material support. Z.L., X. Li, S.M.G. and X. Lin drafted the manuscript and revised it according to co-authors’ suggestions. All authors critically reviewed the manuscript, suggested revisions as needed, and approved the final version.

Corresponding authors

Correspondence to Zilin Li or Xihong Lin.

Ethics declarations

Competing interests

S.M.G. is now an employee of Regeneron Genetics Center. J.B.M. is an Academic Associate for Quest Diagnostics R&D. For B.D.M., the Amish Research Program receives partial support from Regeneron Pharmaceuticals. M.E.M. reports grants from Regeneron Pharmaceutical unrelated to the present work. B.M.P. serves on the Steering Committee of the Yale Open Data Access Project funded by Johnson & Johnson. L.M.R. is a consultant for the TOPMed Administrative Coordinating Center (through Westat). S.R. reports support from Jazz Pharma, Eli Lilly and Apnimed, unrelated to the present work. The spouse of C.J.W. works at Regeneron Pharmaceuticals. P.N. reports investigator-initiated grants from Amgen, Apple, AstraZeneca, Boston Scientific and Novartis, personal fees from Apple, AstraZeneca, Blackstone Life Sciences, Foresite Labs, Novartis and Roche/Genentech, is a co-founder of TenSixteen Bio, is a shareholder of geneXwell and TenSixteen Bio, and reports spousal employment at Vertex, all unrelated to the present work. X. Lin is a consultant of AbbVie Pharmaceuticals and Verily Life Sciences. The remaining authors declare no competing interests.

Peer review

Peer review information

Nature Methods thanks Yukinori Okada and the other, anonymous, reviewer for their contribution to the peer review of this work. Primary Handling editor: Lin Tang, in collaboration with the Nature Methods team. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data

Extended Data Fig. 1 Rare variant (MAF < 0.01) distribution in the discovery phase using TOPMed cohorts (n = 21,015).

Variant categories are defined by GENCODE VEP categories.

Extended Data Fig. 2 Manhattan plots and Q-Q plots for unconditional gene-centric noncoding analysis and sliding window analysis of high-density lipoprotein cholesterol (HDL-C) in the discovery phase (n = 21,015).

a, Manhattan plots for unconditional gene-centric noncoding analysis of protein-coding gene. The horizontal line indicates a genome-wide STAAR-O P value threshold of 3.57 × 10⁻⁷. The significant threshold is defined by multiple comparisons using the Bonferroni correction (\(0.05/\left( {20,000 \times 7} \right) = 3.57 \times 10^{ - 7}\)). Different symbols represent the STAAR-O P value of the protein-coding gene using different functional categories (upstream, downstream, UTR, promoter_CAGE, promoter_DHS, enhancer_CAGE, enhancer_DHS). Promoter_CAGE and promoter_DHS are the promoters with overlap of Cap Analysis of Gene Expression (CAGE) sites and DNase hypersensitivity (DHS) sites for a given gene, respectively. Enhancer_CAGE and enhancer_DHS are the enhancers in GeneHancer predicted regions with the overlap of CAGE sites and DHS sites for a given gene, respectively. b, Quantile-quantile plots for unconditional gene-centric noncoding analysis of protein-coding gene. Different symbols represent the STAAR-O P-value of the gene using different functional categories (upstream, downstream, UTR, promoter_CAGE, promoter_DHS, enhancer_CAGE, enhancer_DHS). c, Manhattan plots for unconditional gene-centric noncoding analysis of ncRNA gene. The horizontal line indicates a genome-wide STAAR-O P value threshold of 2.50 × 10⁻⁶. The significant threshold is defined by multiple comparisons using the Bonferroni correction (\(0.05/20,000 = 2.50 \times 10^{ - 6}\)). d, Quantile-quantile plots for unconditional gene-centric noncoding analysis of ncRNA gene. e, Manhattan plot for 2-kb sliding windows. The horizontal line indicates a genome-wide P value threshold of 1.88 × 10⁻⁸. The significant threshold is defined by multiple comparisons using the Bonferroni correction (\(0.05/\left( {2.66 \times 10^6} \right) = 1.88 \times 10^{ - 8}\)). f, Quantile-quantile plot for 2-kb sliding windows. In panels, a, c and e, the chromosome number are indicated by the colors of dots. In all panels, STAAR-O is a two-sided test.

Extended Data Fig. 3 Manhattan plots and Q-Q plots for unconditional gene-centric noncoding analysis and sliding window analysis of low-density lipoprotein cholesterol (LDL-C) in the discovery phase (n=21,015).

a, Manhattan plots for unconditional gene-centric noncoding analysis of protein-coding gene. The horizontal line indicates a genome-wide STAAR-O P-value threshold of 3.57 × 10⁻⁷. The significant threshold is defined by multiple comparisons using the Bonferroni correction (\(0.05/\left( {20,000 \times 7} \right) = 3.57 \times 10^{ - 7}\)). Different symbols represent the STAAR-O P-value of the protein-coding gene using different functional categories (upstream, downstream, UTR, promoter_CAGE, promoter_DHS, enhancer_CAGE, enhancer_DHS). Promoter_CAGE and promoter_DHS are the promoters with overlap of Cap Analysis of Gene Expression (CAGE) sites and DNase hypersensitivity (DHS) sites for a given gene, respectively. Enhancer_CAGE and enhancer_DHS are the enhancers in GeneHancer predicted regions with the overlap of CAGE sites and DHS sites for a given gene, respectively. b, Quantile-quantile plots for unconditional gene-centric noncoding analysis of protein-coding gene. Different symbols represent the STAAR-O P-value of the gene using different functional categories (upstream, downstream, UTR, promoter_CAGE, promoter_DHS, enhancer_CAGE, enhancer_DHS). c, Manhattan plots for unconditional gene-centric noncoding analysis of ncRNA gene. The horizontal line indicates a genome-wide STAAR-O P-value threshold of 2.50 × 10⁻⁶. The significant threshold is defined by multiple comparisons using the Bonferroni correction (\(0.05/20,000 = 2.50 \times 10^{ - 6}\)). d, Quantile-quantile plots for unconditional gene-centric noncoding analysis of ncRNA gene. e, Manhattan plot for 2-kb sliding windows. The horizontal line indicates a genome-wide P-value threshold of 1.88 × 10⁻⁸. The significant threshold is defined by multiple comparisons using the Bonferroni correction (\(0.05/\left( {2.66 \times 10^6} \right) = 1.88 \times 10^{ - 8}\)). f, Quantile-quantile plot for 2-kb sliding windows. In panels, a, c and e, the chromosome number are indicated by the colors of dots. In all panels, STAAR-O is a two-sided test.

Extended Data Fig. 4 Manhattan plots and Q-Q plots for unconditional gene-centric noncoding analysis and sliding window analysis of triglycerides (TGs) in the discovery phase (n=21,015).

a, Manhattan plots for unconditional gene-centric noncoding analysis of protein-coding gene. The horizontal line indicates a genome-wide STAAR-O P-value threshold of 3.57 × 10⁻⁷. The significant threshold is defined by multiple comparisons using the Bonferroni correction (\(0.05/\left( {20,000 \times 7} \right) = 3.57 \times 10^{ - 7}\)). Different symbols represent the STAAR-O P-value of the protein-coding gene using different functional categories (upstream, downstream, UTR, promoter_CAGE, promoter_DHS, enhancer_CAGE, enhancer_DHS). Promoter_CAGE and promoter_DHS are the promoters with overlap of Cap Analysis of Gene Expression (CAGE) sites and DNase hypersensitivity (DHS) sites for a given gene, respectively. Enhancer_CAGE and enhancer_DHS are the enhancers in GeneHancer predicted regions with the overlap of CAGE sites and DHS sites for a given gene, respectively. b, Quantile-quantile plots for unconditional gene-centric noncoding analysis of protein-coding gene. Different symbols represent the STAAR-O P-value of the gene using different functional categories (upstream, downstream, UTR, promoter_CAGE, promoter_DHS, enhancer_CAGE, enhancer_DHS). c, Manhattan plots for unconditional gene-centric noncoding analysis of ncRNA gene. The horizontal line indicates a genome-wide STAAR-O P-value threshold of 2.50 × 10⁻⁶. The significant threshold is defined by multiple comparisons using the Bonferroni correction (\(0.05/20,000 = 2.50 \times 10^{ - 6}\)). d, Quantile-quantile plots for unconditional gene-centric noncoding analysis of ncRNA gene. e, Manhattan plot for 2-kb sliding windows. The horizontal line indicates a genome-wide P-value threshold of 1.88 × 10⁻⁸. The significant threshold is defined by multiple comparisons using the Bonferroni correction (\(0.05/\left( {2.66 \times 10^6} \right) = 1.88 \times 10^{ - 8}\)). f, Quantile-quantile plot for 2-kb sliding windows. In panels, a, c and e, the chromosome number are indicated by the colors of dots. In all panels, STAAR-O is a two-sided test.

Extended Data Fig. 5 Manhattan plots and Q-Q plots for unconditional gene-centric noncoding analysis and sliding window analysis of total cholesterol (TC) in the discovery phase (n=21,015).

a, Manhattan plots for unconditional gene-centric noncoding analysis of protein-coding gene. The horizontal line indicates a genome-wide STAAR-O P-value threshold of 3.57 × 10⁻⁷. The significant threshold is defined by multiple comparisons using the Bonferroni correction (\(0.05/\left( {20,000 \times 7} \right) = 3.57 \times 10^{ - 7}\)). Different symbols represent the STAAR-O P-value of the protein-coding gene using different functional categories (upstream, downstream, UTR, promoter_CAGE, promoter_DHS, enhancer_CAGE, enhancer_DHS). Promoter_CAGE and promoter_DHS are the promoters with overlap of Cap Analysis of Gene Expression (CAGE) sites and DNase hypersensitivity (DHS) sites for a given gene, respectively. Enhancer_CAGE and enhancer_DHS are the enhancers in GeneHancer predicted regions with the overlap of CAGE sites and DHS sites for a given gene, respectively. b, Quantile-quantile plots for unconditional gene-centric noncoding analysis of protein-coding gene. Different symbols represent the STAAR-O P-value of the gene using different functional categories (upstream, downstream, UTR, promoter_CAGE, promoter_DHS, enhancer_CAGE, enhancer_DHS). c, Manhattan plots for unconditional gene-centric noncoding analysis of ncRNA gene. The horizontal line indicates a genome-wide STAAR-O P-value threshold of 2.50 × 10⁻⁶. The significant threshold is defined by multiple comparisons using the Bonferroni correction (\(0.05/20,000 = 2.50 \times 10^{ - 6}\)). d, Quantile-quantile plots for unconditional gene-centric noncoding analysis of ncRNA gene. e, Manhattan plot for 2-kb sliding windows. The horizontal line indicates a genome-wide P-value threshold of 1.88 × 10⁻⁸. The significant threshold is defined by multiple comparisons using the Bonferroni correction (\(0.05/\left( {2.66 \times 10^6} \right) = 1.88 \times 10^{ - 8}\)). f, Quantile-quantile plot for 2-kb sliding windows. In panels, a, c and e, the chromosome number are indicated by the colors of dots. In all panels, STAAR-O is a two-sided test.

Supplementary information

Supplementary Information

Supplementary Figs. 1–20 and Supplementary Note

Reporting Summary

Peer Review File

Supplementary Tables 1–17

Rights and permissions

Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Li, Z., Li, X., Zhou, H. et al. A framework for detecting noncoding rare-variant associations of large-scale whole-genome sequencing studies. Nat Methods 19, 1599–1611 (2022). https://doi.org/10.1038/s41592-022-01640-x

Download citation

Received: 06 November 2021
Accepted: 06 September 2022
Published: 27 October 2022
Issue Date: December 2022
DOI: https://doi.org/10.1038/s41592-022-01640-x

This article is cited by

Genetic variation across and within individuals
- Zhi Yu
- Tim H. H. Coorens
- Pradeep Natarajan
Nature Reviews Genetics (2024)
Cauchy combination methods for the detection of gene–environment interactions for rare variants related to quantitative phenotypes
- Xiaoqin Jin
- Gang Shi
Heredity (2023)
Adjusting for common variant polygenic scores improves yield in rare variant association analyses
- Sean J. Jurgens
- James P. Pirruccello
- Patrick T. Ellinor
Nature Genetics (2023)
Whole-Genome Sequencing Analysis of Human Metabolome in Multi-Ethnic Populations
- Elena V. Feofanova
- Michael R. Brown
- Bing Yu
Nature Communications (2023)
Powerful, scalable and resource-efficient meta-analysis of rare variant associations in large whole genome sequencing studies
- Xihao Li
- Corbin Quick
- Xihong Lin
Nature Genetics (2023)