Genetic diversity fuels gene discovery for tobacco and alcohol use

Saunders, Gretchen R. B.; Wang, Xingyan; Chen, Fang; Jang, Seon-Kyeong; Liu, Mengzhen; Wang, Chen; Gao, Shuang; Jiang, Yu; Khunsriraksakul, Chachrit; Otto, Jacqueline M.; Addison, Clifton; Akiyama, Masato; Albert, Christine M.; Aliev, Fazil; Alonso, Alvaro; Arnett, Donna K.; Ashley-Koch, Allison E.; Ashrani, Aneel A.; Barnes, Kathleen C.; Barr, R. Graham; Bartz, Traci M.; Becker, Diane M.; Bielak, Lawrence F.; Benjamin, Emelia J.; Bis, Joshua C.; Bjornsdottir, Gyda; Blangero, John; Bleecker, Eugene R.; Boardman, Jason D.; Boerwinkle, Eric; Boomsma, Dorret I.; Boorgula, Meher Preethi; Bowden, Donald W.; Brody, Jennifer A.; Cade, Brian E.; Chasman, Daniel I.; Chavan, Sameer; Chen, Yii-Der Ida; Chen, Zhengming; Cheng, Iona; Cho, Michael H.; Choquet, Hélène; Cole, John W.; Cornelis, Marilyn C.; Cucca, Francesco; Curran, Joanne E.; de Andrade, Mariza; Dick, Danielle M.; Docherty, Anna R.; Duggirala, Ravindranath; Eaton, Charles B.; Ehringer, Marissa A.; Esko, Tõnu; Faul, Jessica D.; Fernandes Silva, Lilian; Fiorillo, Edoardo; Fornage, Myriam; Freedman, Barry I.; Gabrielsen, Maiken E.; Garrett, Melanie E.; Gharib, Sina A.; Gieger, Christian; Gillespie, Nathan; Glahn, David C.; Gordon, Scott D.; Gu, Charles C.; Gu, Dongfeng; Gudbjartsson, Daniel F.; Guo, Xiuqing; Haessler, Jeffrey; Hall, Michael E.; Haller, Toomas; Harris, Kathleen Mullan; He, Jiang; Herd, Pamela; Hewitt, John K.; Hickie, Ian; Hidalgo, Bertha; Hokanson, John E.; Hopfer, Christian; Hottenga, JoukeJan; Hou, Lifang; Huang, Hongyan; Hung, Yi-Jen; Hunter, David J.; Hveem, Kristian; Hwang, Shih-Jen; Hwu, Chii-Min; Iacono, William; Irvin, Marguerite R.; Jee, Yon Ho; Johnson, Eric O.; Joo, Yoonjung Y.; Jorgenson, Eric; Justice, Anne E.; Kamatani, Yoichiro; Kaplan, Robert C.; Kaprio, Jaakko; Kardia, Sharon L. R.; Keller, Matthew C.; Kelly, Tanika N.; Kooperberg, Charles; Korhonen, Tellervo; Kraft, Peter; Krauter, Kenneth; Kuusisto, Johanna; Laakso, Markku; Lasky-Su, Jessica; Lee, Wen-Jane; Lee, James J.; Levy, Daniel; Li, Liming; Li, Kevin; Li, Yuqing; Lin, Kuang; Lind, Penelope A.; Liu, Chunyu; Lloyd-Jones, Donald M.; Lutz, Sharon M.; Ma, Jiantao; Mägi, Reedik; Manichaikul, Ani; Martin, Nicholas G.; Mathur, Ravi; Matoba, Nana; McArdle, Patrick F.; McGue, Matt; McQueen, Matthew B.; Medland, Sarah E.; Metspalu, Andres; Meyers, Deborah A.; Millwood, Iona Y.; Mitchell, Braxton D.; Mohlke, Karen L.; Moll, Matthew; Montasser, May E.; Morrison, Alanna C.; Mulas, Antonella; Nielsen, Jonas B.; North, Kari E.; Oelsner, Elizabeth C.; Okada, Yukinori; Orrù, Valeria; Palmer, Nicholette D.; Palviainen, Teemu; Pandit, Anita; Park, S. Lani; Peters, Ulrike; Peters, Annette; Peyser, Patricia A.; Polderman, Tinca J. C.; Rafaels, Nicholas; Redline, Susan; Reed, Robert M.; Reiner, Alex P.; Rice, John P.; Rich, Stephen S.; Richmond, Nicole E.; Roan, Carol; Rotter, Jerome I.; Rueschman, Michael N.; Runarsdottir, Valgerdur; Saccone, Nancy L.; Schwartz, David A.; Shadyab, Aladdin H.; Shi, Jingchunzi; Shringarpure, Suyash S.; Sicinski, Kamil; Skogholt, Anne Heidi; Smith, Jennifer A.; Smith, Nicholas L.; Sotoodehnia, Nona; Stallings, Michael C.; Stefansson, Hreinn; Stefansson, Kari; Stitzel, Jerry A.; Sun, Xiao; Syed, Moin; Tal-Singer, Ruth; Taylor, Amy E.; Taylor, Kent D.; Telen, Marilyn J.; Thai, Khanh K.; Tiwari, Hemant; Turman, Constance; Tyrfingsson, Thorarinn; Wall, Tamara L.; Walters, Robin G.; Weir, David R.; Weiss, Scott T.; White, Wendy B.; Whitfield, John B.; Wiggins, Kerri L.; Willemsen, Gonneke; Willer, Cristen J.; Winsvold, Bendik S.; Xu, Huichun; Yanek, Lisa R.; Yin, Jie; Young, Kristin L.; Young, Kendra A.; Yu, Bing; Zhao, Wei; Zhou, Wei; Zöllner, Sebastian; Zuccolo, Luisa; Batini, Chiara; Bergen, Andrew W.; Bierut, Laura J.; David, Sean P.; Gagliano Taliun, Sarah A.; Hancock, Dana B.; Jiang, Bibo; Munafò, Marcus R.; Thorgeirsson, Thorgeir E.; Liu, Dajiang J.; Vrieze, Scott

doi:10.1038/s41586-022-05477-4

Download PDF

Article
Open access
Published: 07 December 2022

Genetic diversity fuels gene discovery for tobacco and alcohol use

Gretchen R. B. Saunders¹^na1,
Xingyan Wang²^na1,
Fang Chen ORCID: orcid.org/0000-0003-3108-8204²^na1,
Seon-Kyeong Jang¹^na1,
Mengzhen Liu ORCID: orcid.org/0000-0001-6550-6959¹^na1,
Chen Wang ORCID: orcid.org/0000-0002-2727-7240²^na1,
Shuang Gao²,
Yu Jiang³,
Chachrit Khunsriraksakul²,
Jacqueline M. Otto¹,
Clifton Addison⁴,
Masato Akiyama^5,6,
Christine M. Albert^7,8,
Fazil Aliev⁹,
Alvaro Alonso¹⁰,
Donna K. Arnett¹¹,
Allison E. Ashley-Koch^12,13,
Aneel A. Ashrani¹⁴,
Kathleen C. Barnes^15,16,
R. Graham Barr¹⁷,
Traci M. Bartz^18,19,
Diane M. Becker²⁰,
Lawrence F. Bielak²¹,
Emelia J. Benjamin^22,23,
Joshua C. Bis¹⁸,
Gyda Bjornsdottir²⁴,
John Blangero²⁵,
Eugene R. Bleecker²⁶,
Jason D. Boardman²⁷,
Eric Boerwinkle²⁸,
Dorret I. Boomsma²⁹,
Meher Preethi Boorgula¹⁵,
Donald W. Bowden³⁰,
Jennifer A. Brody¹⁸,
Brian E. Cade^31,32,33,
Daniel I. Chasman⁸,
Sameer Chavan¹⁵,
Yii-Der Ida Chen³⁴,
Zhengming Chen^35,36,
Iona Cheng^37,38,
Michael H. Cho^39,40,
Hélène Choquet⁴¹,
John W. Cole^42,43,
Marilyn C. Cornelis⁴⁴,
Francesco Cucca⁴⁵,
Joanne E. Curran²⁵,
Mariza de Andrade⁴⁶,
Danielle M. Dick⁹,
Anna R. Docherty^47,48,49,
Ravindranath Duggirala²⁵,
Charles B. Eaton⁵⁰,
Marissa A. Ehringer^51,52,
Tõnu Esko⁵³,
Jessica D. Faul⁵⁴,
Lilian Fernandes Silva⁵⁵,
Edoardo Fiorillo⁵⁶,
Myriam Fornage^28,57,
Barry I. Freedman⁵⁸,
Maiken E. Gabrielsen⁵⁹,
Melanie E. Garrett^12,13,
Sina A. Gharib^18,60,61,
Christian Gieger⁶²,
Nathan Gillespie⁴⁸,
David C. Glahn⁶³,
Scott D. Gordon⁶⁴,
Charles C. Gu⁶⁵,
Dongfeng Gu⁶⁶,
Daniel F. Gudbjartsson^24,67,
Xiuqing Guo⁶⁸,
Jeffrey Haessler⁶⁹,
Michael E. Hall⁷⁰,
Toomas Haller⁵³,
Kathleen Mullan Harris⁷¹,
Jiang He^72,73,
Pamela Herd⁷⁴,
John K. Hewitt^51,75,
Ian Hickie⁷⁶,
Bertha Hidalgo⁷⁷,
John E. Hokanson⁷⁸,
Christian Hopfer⁷⁹,
JoukeJan Hottenga²⁹,
Lifang Hou⁴⁴,
Hongyan Huang^80,81,
Yi-Jen Hung⁸²,
David J. Hunter⁸³,
Kristian Hveem^59,84,85,
Shih-Jen Hwang⁸⁶,
Chii-Min Hwu⁸⁷,
William Iacono¹,
Marguerite R. Irvin⁷⁷,
Yon Ho Jee⁸⁰,
Eric O. Johnson^88,89,
Yoonjung Y. Joo^44,90,
Eric Jorgenson⁹¹,
Anne E. Justice^92,93,
Yoichiro Kamatani^5,94,
Robert C. Kaplan^69,95,
Jaakko Kaprio⁹⁶,
Sharon L. R. Kardia²¹,
Matthew C. Keller^51,75,
Tanika N. Kelly^72,73,
Charles Kooperberg^19,69,
Tellervo Korhonen⁹⁶,
Peter Kraft^80,81,
Kenneth Krauter⁹⁷,
Johanna Kuusisto^98,99,
Markku Laakso⁹⁸,
Jessica Lasky-Su¹⁰⁰,
Wen-Jane Lee¹⁰¹,
James J. Lee¹,
Daniel Levy⁸⁶,
Liming Li¹⁰²,
Kevin Li¹⁰³,
Yuqing Li³⁷,
Kuang Lin³⁵,
Penelope A. Lind^104,105,106,
Chunyu Liu¹⁰⁷,
Donald M. Lloyd-Jones¹⁰⁸,
Sharon M. Lutz^109,110,
Jiantao Ma^86,111,
Reedik Mägi⁵²,
Ani Manichaikul¹¹²,
Nicholas G. Martin⁶⁴,
Ravi Mathur⁸⁸,
Nana Matoba^5,113,
Patrick F. McArdle¹¹⁴,
Matt McGue¹,
Matthew B. McQueen¹¹⁵,
Sarah E. Medland¹⁰⁴,
Andres Metspalu⁵³,
Deborah A. Meyers²⁶,
Iona Y. Millwood^35,36,
Braxton D. Mitchell^114,116,
Karen L. Mohlke¹¹⁷,
Matthew Moll^39,40,
May E. Montasser¹¹⁴,
Alanna C. Morrison²⁸,
Antonella Mulas⁵⁶,
Jonas B. Nielsen^59,118,
Kari E. North⁹³,
Elizabeth C. Oelsner¹⁷,
Yukinori Okada^{119,120,121,122},
Valeria Orrù⁵⁶,
Nicholette D. Palmer³⁰,
Teemu Palviainen⁹⁶,
Anita Pandit¹⁰³,
S. Lani Park¹²³,
Ulrike Peters^69,124,
Annette Peters^125,126,127,
Patricia A. Peyser²¹,
Tinca J. C. Polderman^128,129,
Nicholas Rafaels¹⁵,
Susan Redline^31,32,130,
Robert M. Reed¹³¹,
Alex P. Reiner^69,124,
John P. Rice¹³²,
Stephen S. Rich¹¹²,
Nicole E. Richmond⁷⁸,
Carol Roan¹³³,
Jerome I. Rotter⁶⁸,
Michael N. Rueschman³¹,
Valgerdur Runarsdottir¹³⁴,
Nancy L. Saccone^65,135,
David A. Schwartz¹³⁶,
Aladdin H. Shadyab¹³⁷,
Jingchunzi Shi¹³⁸,
Suyash S. Shringarpure¹³⁸,
Kamil Sicinski¹³³,
Anne Heidi Skogholt⁵⁹,
Jennifer A. Smith^21,54,
Nicholas L. Smith^124,139,140,
Nona Sotoodehnia^18,141,
Michael C. Stallings^51,75,
Hreinn Stefansson²⁴,
Kari Stefansson^24,142,
Jerry A. Stitzel⁵¹,
Xiao Sun⁷²,
Moin Syed¹,
Ruth Tal-Singer¹⁴³,
Amy E. Taylor^144,145,146,
Kent D. Taylor⁶⁸,
Marilyn J. Telen¹²,
Khanh K. Thai⁴¹,
Hemant Tiwari¹⁴⁷,
Constance Turman^80,81,
Thorarinn Tyrfingsson¹³⁴,
Tamara L. Wall¹⁴⁸,
Robin G. Walters^35,36,
David R. Weir⁵⁴,
Scott T. Weiss¹⁰⁰,
Wendy B. White¹⁴⁹,
John B. Whitfield⁶⁴,
Kerri L. Wiggins¹⁵⁰,
Gonneke Willemsen²⁹,
Cristen J. Willer^151,152,153,
Bendik S. Winsvold^59,154,155,
Huichun Xu¹¹⁴,
Lisa R. Yanek²⁰,
Jie Yin⁴¹,
Kristin L. Young¹⁵⁶,
Kendra A. Young⁷⁸,
Bing Yu²⁸,
Wei Zhao²¹,
Wei Zhou^153,157,
Sebastian Zöllner^158,159,
Luisa Zuccolo^144,146,160,
23andMe Research Team,
The Biobank Japan Project,
Chiara Batini¹⁶¹,
Andrew W. Bergen^162,163,
Laura J. Bierut¹³²,
Sean P. David^164,165,
Sarah A. Gagliano Taliun^166,167,168,
Dana B. Hancock⁸⁸,
Bibo Jiang²,
Marcus R. Munafò^144,145,169,
Thorgeir E. Thorgeirsson²⁴,
Dajiang J. Liu ORCID: orcid.org/0000-0001-6553-858X²^na2 &
…
Scott Vrieze ORCID: orcid.org/0000-0003-3861-7930¹^na2

Nature volume 612, pages 720–724 (2022)Cite this article

61k Accesses
117 Citations
398 Altmetric
Metrics details

Subjects

This article has been updated

Abstract

Tobacco and alcohol use are heritable behaviours associated with 15% and 5.3% of worldwide deaths, respectively, due largely to broad increased risk for disease and injury^1,2,3,4. These substances are used across the globe, yet genome-wide association studies have focused largely on individuals of European ancestries⁵. Here we leveraged global genetic diversity across 3.4 million individuals from four major clines of global ancestry (approximately 21% non-European) to power the discovery and fine-mapping of genomic loci associated with tobacco and alcohol use, to inform function of these loci via ancestry-aware transcriptome-wide association studies, and to evaluate the genetic architecture and predictive power of polygenic risk within and across populations. We found that increases in sample size and genetic diversity improved locus identification and fine-mapping resolution, and that a large majority of the 3,823 associated variants (from 2,143 loci) showed consistent effect sizes across ancestry dimensions. However, polygenic risk scores developed in one ancestry performed poorly in others, highlighting the continued need to increase sample sizes of diverse ancestries to realize any potential benefit of polygenic prediction.

Multi-ancestry transcriptome-wide association analyses yield insights into tobacco use biology and drug repurposing

Article Open access 26 January 2023

Genome-wide association study of smoking trajectory and meta-analysis of smoking status in 842,000 individuals

Article Open access 20 October 2020

Multi-ancestry meta-analysis of tobacco use disorder identifies 461 potential risk genes and reveals associations with multiple health outcomes

Article 17 April 2024

Main

We developed a multi-ancestry meta-regression method to meta-analyse ancestrally diverse genome-wide association study (GWAS) summary statistics from 60 cohorts with 3,383,199 individuals (Supplementary Table 1; see Supplementary Fig. 1 for an overview of the project), representing major clines of recent human ancestry (Fig. 1a). The meta-analytic method used here uses meta-regression to account for per study axes of genetic ancestry variation combined with a random effect to capture further unexplained heterogeneity in the effect of a given genetic variant. Although ancestry here is continuous, we also performed secondary analyses of continental groups reflecting four ancestry clines, including individuals of African (AFR; maximum n = 119,589) and American (AMR; n = 286,026) recently admixed ancestries primarily from the United States; individuals of East Asian ancestries (EAS; n = 296,438) primarily from the United States, People’s Republic of China and Japan; and individuals of European ancestries (EUR; n = 2,669,029) from the United States, Europe and Australia (see Extended Data Fig. 1 and Supplementary Note). Smoking phenotypes were selected to represent different stages of tobacco use and addiction, including initiation, the onset of regular use, amount smoked and cessation. Measures of onset included whether an individual ever smoked regularly (smoking initiation (SmkInit); n = 3,383,199) and the age at which the individual began smoking regularly (AgeSmk; n = 728,826). Amount smoked among current and former regular smokers was measured as cigarettes smoked per day (CigDay; n = 784,353). Smoking cessation (SmkCes; n = 1,400,535) contrasted current versus former smokers. Alcohol use was widely available across most studies, measured as drinks per week (DrnkWk; n = 2,965,643).

**Fig. 1: Ancestry composition and effect size moderation.**

Multi-ancestry meta-analysis

Using our multi-ancestry meta-analysis, we identified 2,143 associated loci across all phenotypes (sentinel variant P < 5 × 10⁻⁹), with 3,823 independently associated variants (Extended Data Fig. 2, Supplementary Tables 2 and 3 and Supplementary Figs. 2 and 3). Of these, 1,346 loci and 2,486 independent variants were associated with SmkInit, 33 loci (39 variants) with AgeSmk, 140 loci (243 variants) with CigDay, 128 loci (206 variants) with SmkCes and 496 loci (849 variants) with DrnkWk. Approximately 64% (n = 1,364) of loci were phenotype-specific, five loci were associated with all four smoking phenotypes but not with DrnkWk, and five loci were associated with all five phenotypes. All sentinel variants within identified loci had high posterior probabilities that their effect would replicate in a sufficiently powered study according to a trans-ancestry extension of our GWAS cross-validation technique⁶. Only 17 sentinel variants (0.7%) had such posterior probabilities of less than 0.99 and were therefore removed from the counts above and from further consideration (additional details on these 17 variants are shown in Supplementary Fig. 4).

Inclusion of diverse ancestry may improve the discovery of new variants through a combination of increased genetic variation, larger sample sizes and improved fine-mapping due to diverse patterns of linkage disequilibrium (LD). We quantified gains in power from the use of our multi-ancestry model over a simpler ancestry-naive fixed-effects model excluding the ancestry meta-regression. Comparing the number of associated variants, we found 721 additional independent variants that were identified only by the multi-ancestry meta-regression analysis. Both sets of models were fit to the same data, such that the larger number of significantly associated variants identified with the multi-ancestry model indicates increased power from accounting for axes of genetic variation and residual heterogeneity. Included among these 721 were newly associated variants in genes related to nervous system function (for example, NRXN1) including glutamatergic (GRIN2A) neurotransmission, which is of relevance to neurocircuitry in addiction^7,8.

To isolate likely causal variants, we used a fine-mapping procedure (see Supplementary Note) that leverages variation in LD across ancestry groups to construct 90% credible intervals. We identified 597 loci (27.9%) in which the 90% credible intervals included fewer than five variants, including 192 loci (9.0%) with a single fine-mapped variant. Overall, credible intervals contained medians of 9–19 variants and median spans of 32–78 kb across phenotypes (Supplementary Table 4). Compared with the EUR-stratified GWAS (described in the next section), the trans-ancestry fine-mapping increased the number of 90% credible intervals containing fewer than five variants by 27.6%, and containing a single variant by 41.2%. Across all 2,143 loci, 1,330 (62.1%) loci had a reduced number of variants in the credible intervals in the multi-ancestry analysis. To determine the gain in resolution attributable to increased sample size (versus LD differences), we ‘downsampled’ the multi-ancestry analysis by removing EUR ancestry cohorts until the total sample size was approximately equal to that of the EUR-stratified analysis and regenerated fine-mapping results. Using the 1,330 loci with improved resolution in multi-ancestry analysis, we found that the credible intervals were reduced from a median of 22 variants in the EUR-stratified analysis to 12 variants in the downsampled multi-ancestry analysis, suggesting that approximately 55% of the observed improvement in fine-mapping is attributable to larger multi-ancestry sample sizes alone. These findings highlight the utility of both increased sample size and diverse ancestry in fine-mapping variants for these complex behavioural phenotypes. To characterize genes prioritized from fine-mapping, we conducted a series of functional enrichment analyses. We first selected intervals fine-mapped to fewer than five variants from the multi-ancestry results and mapped each variant to the nearest gene to identify ‘high-priority’ genes. Relative to genes mapped from variants with posterior inclusion probabilities (PIP) < 0.01, the high-priority genes were enriched across brain and nerve tissues (Extended Data Fig. 3a and Supplementary Table 5). Within the brain, cell-type enrichment of the high-priority genes was observed for projecting glutamatergic neurons from the cortex, hippocampus and amygdala (telencephalon excitatory projection neurons) and projection GABA neurons from medium spiny neurons of the striatum (telencephalon inhibitory projecting neurons), along with neurons in various subcortical structures such as the hypothalamus and midbrain, consistent with aspects of the mesolimbic theory of addiction^7,8 (Extended Data Fig. 3b). Finally, these high-priority genes that were strongly associated with substance use were enriched in gene pathways related to neurogenesis, neuronal development, neuronal differentiation and synaptic function. The neurodevelopmental aspect of the high-priority genes could indicate a role for these genes in processes that predispose individuals to risk of substance use and/or may contribute to brain circuit rewiring during drug use.

The multi-ancestry meta-analysis method also allowed for tests of whether a variant effect size differed (that is, was moderated) by ancestry along four ancestry dimensions estimated from multidimensional scaling (MDS) of allele frequencies from each participating study (Fig. 1a). Roughly, the first axis represents an EAS ancestry cline, the second axis an AFR cline, the third a EUR cline (north to south EUR) and the fourth an AMR cline. There was minimal evidence of effect size moderation by ancestry for most independent variants, ranging from 76.6% (187 variants) in CigDay to 85.0% (175 variants) in SmkCes. Another 7.7–18.1% showed modest evidence for moderation. Finally, roughly 3.6% of all independent variants, reflecting 136 variants from 84 distinct loci, showed strong evidence of effect size moderated by ancestry (complete results are shown in Supplementary Table 2). Comparisons between the variants with strong evidence for effect size moderation by ancestry and those with no evidence suggested that the identification of these 136 variants was not driven to a large extent by differences in imputation quality, LD scores or Fst (fixation index) across ancestries.

Across phenotypes, 88 of these 136 variants showed moderation by the first axis of ancestry variation (approximate EAS cline; Fig. 1b, left), 29 variants by the second axis (approximate AFR cline; Fig. 1b, middle) and 10 variants by the fourth axis (approximate AMR cline; Fig. 1b, right). Nine variants showed differences in effect size moderated by the third axis (EUR cline). Only the effect of one variant was moderated by three or more ancestry clines (EAS, AFR and AMR): rs1229984, a missense variant in the alcohol dehydrogenase gene ADH1B, which has been shown to be protective against alcohol consumption⁹. An increase on any of these clines was associated with a reduced effect size of this allele, on average. For example, if there are two people who both carry one copy of the protective T allele for this variant but are separated by 1 s.d. on MDS component 1 (EAS cline), the person with a lower value on that MDS cline would be expected to drink 0.3 fewer drinks than the person with a higher MDS value, despite the same rs1229984 genotype in ADH1B.

To further evaluate causal genes and relevant tissues through which associated variants may be operating, we applied a trans-ancestry transcriptome-wide association study (TWAS) analysis to each phenotype across 49 tissues derived from the GTEx Consortium¹⁰. Using a P value threshold Bonferroni-corrected for the total number of genes and tissues within a phenotype, we found 1,167 genes significantly associated with SmkInit, 21 genes with AgeSmk, 203 genes with CigDay, 188 genes with SmkCes and 504 genes with DrnkWk (resulting in 1,705 unique genes across phenotypes; Supplementary Table 6). For each of our five phenotypes, matrix decomposition parallel analysis¹¹ of the per-tissue P value correlation matrix suggested two components: one explaining 53.7–55.2% of the variance in P values, and another explaining 3.5–3.8% of the variance in P values. Similar loading patterns were observed for all phenotypes such that all tissues loaded strongly (all loadings > 0.12) on the first component, suggesting that it represents a general effect across tissues, whereas only brain tissues had strong loadings on the second component (all loadings > 0.12), indicating the importance of brain-specific gene expression effects for these tobacco and alcohol use phenotypes. Pathway enrichment analyses of the TWAS-associated genes identified 1,029 unique gene pathways across phenotypes that were broadly enriched across tissues (Supplementary Table 7), including many of obvious relevance to neurotransmission and neurodevelopment.

To further illustrate several variants within genes of interest, we integrated findings described above to select variants for which there was evidence of association across analytic methods and for which the availability of diverse ancestries was clearly relevant. Illustrative variants were chosen in a similar way as described for the enrichment analyses above: (1) we extracted variants from multi-ancestry fine-mapped credible intervals containing less than five variants, and (2) we cross-referenced the resulting variants with the multi-ancestry TWAS cis-expression quantitative trait loci and their significantly associated genes. We highlight five of the 52 genes that resulted from this process.

We found the nicotinic gene cluster CHRNA5–A3–B4 to be significantly associated with SmkInit¹² with a fine-mapped 90% credible interval that shrank from 53 variants in EUR-stratified results to just two variants in multi-ancestry results (rs2869055 and rs28438420; Supplementary Table 4). These variants are not in high LD (r² = 0.31 for both variants) with the well-known variant rs16969968 in this gene cluster. By contrast, this locus was fine-mapped to two variants in high LD with rs16969968 for CigDay (r² = 0.84 and 0.86), suggesting that the variants underlying this signal for smoking initiation may be distinct from those for cigarettes per day. We also found a novel association between SmkInit and CACNA1B, which encodes a voltage-gated calcium channel (Ca_v2.2) that controls neuronal neurotransmitter release and has been associated with cocaine reinstatement¹³, increased aggression and vigilance, and reduced startle and exploration¹⁴. CACNA1B is linked to multiple psychiatric disorders, including schizophrenia, bipolar disorder and autism spectrum disorders^15,16,17.

CigDay was associated with variants in neurturin (NRTN), a type of glial cell line-derived neurotrophic factor involved in the development and survival of dopamine neurons¹⁸. This gene has been studied in relation to Parkinson disease for its potential to restore dopamine neurocircuitry¹⁹. Likewise, PAK6 was another novel gene strongly associated with CigDay in TWAS results and was fine-mapped to just three variants in the 90% credible interval. PAK6 encodes a p21-activated kinase that is highly expressed in the striatum and hippocampus, has been implicated in the migration of GABAergic interneurons²⁰ as well as the modulation of dopaminergic neurotransmission²¹, and is involved in locomotor activity and cognitive function²². PAK6 has been robustly associated with schizophrenia²³ and neurodegenerative diseases^24,25, such as Parkinson disease and Alzheimer disease, further highlighting its role in synaptic changes. Finally, we found a novel association between ECE2 and DrnkWk. ECE2 is involved in cortical development²⁶ as well as the processing of several neuroendocrine peptides, including neurotensin and substance P²⁷, and may also have a role in amyloid-β processing²⁸. ECE2 also generates peptides such as BAM 12 (which shows κ-opioid receptor selectivity) and BAM 22 (which shows μ-opioid receptor selectivity), suggesting a link with pain transmission²⁷.

Genetic correlation and polygenic scores

To evaluate heritability, genetic correlation and polygenic scoring, we generated ancestry-stratified GWAS meta-analysis results for each of the four continental groups: AFR, AMR, EAS and EUR (Supplementary Table 2 lists ancestry-stratified loci). Heritability and cross-phenotype genetic correlations were generally similar in sign and modest in magnitude in each ancestry (Fig. 2a and Supplementary Tables 8 and 9). Smoking phenotypes were moderately genetically correlated with each other (|r_g| = 0.30–0.63) and with DrnkWk (|r_g| = 0.16–0.27). Genetic correlations for the same phenotype between each of the largest contributing cohorts and all remaining cohorts (restricted to EUR ancestries only) were generally high for each smoking phenotype (mean r_g of 0.93) and DrnkWk (mean r_g of 0.72), indicating that these measures were reliable across cohorts (Supplementary Table 9).

**Fig. 2: Within-ancestry and across-ancestry performance of polygenic scores in an independent target sample (Add Health³⁵).**

To characterize the multifactorial genetic aetiology of tobacco and alcohol use, we computed genetic correlations of our EUR-stratified results with 1,141 medical, biomarker and behavioural phenotypes from the UK Biobank²⁹ (Supplementary Tables 10 and 11). An affinity propagation clustering algorithm³⁰ was used to aid interpretability by grouping UK Biobank phenotypes such that each of the five current phenotypes were exemplars (Supplementary Fig. 5). SmkInit and AgeSmk clustered together, as did SmkCes and CigDay, with all four forming a broad higher-level smoking cluster. Phenotypes with high positive genetic correlations with SmkInit included addiction to any substance, neighbourhood material deprivation, diagnosis of chronic obstructive pulmonary disease, and a negative correlation with age at first sexual intercourse (|r_g| = 0.57–0.64). For AgeSmk, the largest genetic correlations were with reproductive phenotypes such as age at first birth (r_g = 0.69–0.71) and measures of years of education and attainment (r_g = 0.58–0.69). CigDay and SmkCes were most highly positively correlated with respiratory and cardiovascular diseases and cancers (r_g = 0.52–0.72), highlighting their genetic link to adverse disease outcomes. Finally, DrnkWk was most strongly correlated with problematic drinking behaviours (r_g = 0.52–0.70), indicating extensive overlap in the genetic architecture of DrnkWk and measures of alcohol use, problems and alcohol use disorder. This is consistent with previous findings of strong but imperfect genetic correlations (for example, r_g = 0.8) between alcohol consumption and alcohol use disorder from large-scale GWAS^31,32. We note, however, that genetic correlations can be difficult to interpret^33,34 as they may be affected by genetic confounding, mediation effects or sampling bias.

We used the ancestry-stratified meta-analysis results to construct ancestry-specific polygenic risk scores in Add Health³⁵, an independent target sample of individuals of diverse ancestries from the United States (n = 2,199 AFR, 1,132 AMR, 525 EAS and 6,092 EUR). To evaluate within-ancestry and across-ancestry performance of polygenic scores, we iteratively fit a multiple regression model and evaluated the incremental predictive accuracy of each ancestry-based score, over and above scores already entered into the model (that is, first including the AMR-based score, then adding the AFR-based, EAS-based and EUR-based scores one at a time to evaluate incremental prediction accuracy). EUR-based scores in EUR ancestries outperformed ancestry-matched scores in non-EUR ancestries (Fig. 2a) and showed significantly stronger associations with most phenotypes in EUR ancestries than in non-EUR ancestries (described by decile plots and tested by modelling an interaction between the EUR-based polygenic risk score and the target sample ancestry group), consistent with expectations³⁶ (Fig. 2b,c). For each ancestry and phenotype, the EUR-based score on its own outperformed the ancestry-matched score on its own (Supplementary Table 12). These results highlight the relative utility of current polygenic scores for EUR ancestries versus all others. In interpreting these results, however, we note that some comparisons may be underpowered to identify differences in the variance explained by polygenic scores between ancestries. Finally, EUR-based scores overpredicted tobacco and alcohol use for individuals of non-EUR ancestry and underpredicted for individuals of EUR ancestry, although this prediction bias is readily eliminated through statistical correction with genetic principal components.

Summary

Tobacco and alcohol use are heritable behaviours that can be radically affected by environmental factors, including cultural context³⁷ and public health policies^38,39. Despite this, we found that a large majority of associated genetic variants showed homogeneous effect size estimates across diverse ancestries, suggesting that the genetic variants associated with substance use affect such individuals similarly. The limited extent of variant effect size heterogeneity, coupled with similar heritability estimates and cross-trait genetic correlations, indicates that the genetic architecture underlying substance use is not markedly different across ancestries. There are some potentially interesting exceptions of ancestrally heterogeneous effects in genes such as ADH1B and CACNA1B. By contrast, polygenic scores generally performed well in EUR ancestries but with mixed-to-limited results in other ancestries, suggesting that portability of such scores across ancestries remains challenging, even when discovery sample sizes across all ancestries are more than 100,000. Explanations for this apparent discrepancy have been proposed⁴⁰, but more stringent and sensitive tests will be required to draw strong conclusions about such patterns of heredity.

Most individuals of EUR, AFR and AMR ancestries in the current study live in the United States and Europe and share somewhat similar environments regarding tobacco and alcohol availability and policies surrounding use of these substances, and all included individuals were adults. Further increases in genetic diversity and consideration of environmental moderators, including cultural factors, will continue to add to our understanding of the genetic architecture of both substance use and related behaviours and diseases.

Methods

Here we describe an overview of the methods used to conduct the association, fine-mapping and downstream in silico functional analysis. Additional details can be found in the Supplementary Note.

Generation of summary statistics and ancestry considerations

Except for TOPMed studies, in which the genetic data were derived from deep whole-genome sequencing, participants in all studies were genotyped on genome-wide arrays. The majority of studies imputed their genotypes to the Haplotype Reference Consortium⁴¹ (for EUR ancestries) or 1000 Genomes⁴² (Supplementary Table 1). GWAS summary statistics were generated in each study sample typically using RVTESTS⁴³, BOLT-LMM⁴⁴ or SAIGE⁴⁵ with covariates of sex, age, age squared and genetic principal components according to an analysis plan detailed in the Supplementary Note. Studies composed primarily of closely related individuals (for example, family studies) first regressed out covariates, inverse-normalized the residuals as necessary and then tested additive variant effects under a linear mixed model with a genetic kinship matrix for all phenotypes. Some studies of unrelated individuals followed the same analysis for quasi-continuous phenotypes (AgeSmk, CigDay and DrnkWk), but estimated additive genetic effects under a logistic model for binary phenotypes (SmkInit and SmkCes).

We used terminology and acronyms from the 1000 Genomes Project⁴² to describe ancestry. The majority of participating cohorts stratified their sample by ancestry before generation of summary statistics. Cohorts composed of substantial samples of multiple ancestry groups provided summary statistics stratified by ancestry, as well as results based on all individuals regardless of ancestry for use in the multi-ancestry meta-analyses. As TOPMed served multiple functions in the present study, including as an LD reference panel, we detailed the ancestry analyses and classification of TOPMed data in the Supplementary Note. For example, for both ancestry-stratified and multi-ancestry conditional analysis, we created TOPMed reference panels for estimating LD. We first created ancestry-stratified reference samples, resulting in matched ancestry reference sample sizes of n = 28,665 AFR, n = 19,737 AMR, n = 4,918 EAS and n = 51,656 EUR. To create a TOPMed-based reference sample for multi-ancestry analyses, we combined the matched ancestry individuals, resulting in a diverse ancestry reference panel (n = 104,976) that matches the ancestry proportions of the included cohorts to estimate LD.

Extensive quality control and filtering were performed on the summary statistics from each cohort. We removed studies with a sample size of less than 100, and those with genomic control values greater than 1.1 or less than 0.9 and a sample size of less than 10,000 (per study sample size and genomic control values are listed in Supplementary Table 1), as well as variants with an imputation quality of less than 0.3.

Ancestry-stratified meta-analyses

Ancestry-stratified meta analyses were performed using the software package rareGWAMA (see URLs for software use). Specifically, the method aggregated weighted Z-score statistics, that is,

$${Z}_{{\rm{META}}}=\frac{{\sum }_{k}{w}_{k}{Z}_{k}}{{({\sum }_{k}{w}_{k}^{2})}^{1/2}},$$

where Z_k is the Z-score statistic in study k. The weight w_k is defined by ${w}_{k}=\sqrt{{N}_{k}{p}_{k}(1-{p}_{k}){R}_{k}^{2}},$ where ${p}_{k}$ is the variant allele frequency, and ${R}_{k}^{2}$ is the imputation quality in study k. This method accounts for between-study heterogeneity in phenotype measures, imputation accuracy, allele frequencies and sample sizes.

Multi-ancestry meta-analyses

Multi-ancestry meta-analyses were performed using mixed-effects meta-regression for optimal trans-ancestry meta-analysis (MEMO) implemented in rareGWAMA (see URLs for software use). The full model is ${b}_{{jk}}={\sum }_{l=0}^{L}{C}_{{lk}}{\gamma }_{{jl}}+{e}_{{jk}}+{{\epsilon }}_{{jk}},$ where ${b}_{{jk}}$ is the genetic effect estimate for the jth variant in the kth study, and ${C}_{{lk}}$ is the lth ancestry component for the kth study. Note that we set ${C}_{0k}=1$, so ${\gamma }_{j0}$ serves as the intercept. The regression coefficient ${\gamma }_{{jl}}$ captures the effect of the lth axis of genetic variation for the jth variant, with ${\gamma }_{j0}$ as an intercept in the model, and ${e}_{{jk}}\sim N\left(0,{\tau }^{2}\right)$ is the random effect that captures unexplained effect size heterogeneity after adjusting for genetic variation. Finally, ${{\epsilon }}_{{jk}}\sim N\left(0,{s}_{{jk}}^{2}\right)$ is the random error term, where ${s}_{{jk}}^{2}$ is the variance of the genetic effect estimate ${b}_{{jk}}$. This method models heterogeneity of effects attributable to ancestry as well as a random effect to capture residual heterogeneity. The MEMO model contains fixed-effect, random-effect and meta-regression models as special cases. Specifically, removing the random effect ${e}_{{jk}}$ results in a regular meta-regression model, removing the covariates of genetic variation (${C}_{{lk}})$, but retaining ${e}_{{jk}}$ results in a random-effect meta-analysis model, whereas removing both ${e}_{{jk}}$ and ${C}_{{lk}}$ results in a fixed-effect meta-analysis model.

Per study ancestry variation, ${C}_{{lk}}$ is calculated using MDS on the basis of allele frequency. We defined the genetic distance between two studies, that is, study k and k′, with J variants, as ${d}_{k{k}^{{\prime} }}=\sqrt{{\sum }_{j}{\left({f}_{{jk}}-{f}_{j{k}^{{\prime} }}\right)}^{2}},$ where ${f}_{{jk}}$ and ${f}_{j{k}^{{\prime} }}$ are the allele frequency for the jth variant for study k and k′, respectively. We fit models with 0, 1, 2, 3 and 4 MDS components and combined the results using a minimal P value approach (see Extended Data Fig. 1a for a visual representation of the first four MDS components).

To better ensure robustness, for each phenotype, we filtered variants from the meta-analytic results to variants that were present in at least three studies, had an effective sample size (sample size multiplied by imputation accuracy) to maximum sample size ratio of ≥ 0.1, and minor allele frequency (MAF) > 0.001 in the multi-ancestry and EUR-stratified meta-analysis or MAF > 0.01 for AMR-stratified, AFR-stratified and EAS-stratified meta-analysis, given the expected drop off in imputation accuracy for those ancestries. These filters reduce potential artefacts arising from sparse data or poor imputation and retain variants with reasonable statistical power.

With increasing imputation accuracy and the inclusion of variants with MAF down to 0.1% (for EUR), genome-wide significant variants were identified using a threshold of P < 5 × 10⁻⁹, to account for approximately 10 million independent tests. The threshold was chosen based on previous work on low-frequency variants^5,46,47. All statistical tests are two-sided unless otherwise stated.

Robustness and replicability of signals

We applied genomic control correction for low-frequency variants (MAF < 1%) in both multi-ancestry and ancestry-stratified meta-analyses. Genomic control correction for common variants was not applied given that elevation of genomic control values is expected with high polygenicity (that is, it assumes sparsity) and very large sample sizes⁴⁸; such a correction may be overly conservative. To evaluate this decision, we estimated the replicability of associated loci using a trans-ancestry extension of an existing method⁶. This method, ‘RATES’, incorporates cohort-level summary statistics (single-nucleotide polymorphism (SNP) effect sizes and their corresponding standard errors), along with allele frequency-based MDS components per study to assign a posterior probability that each sentinel variant effect would replicate in a sufficiently powered study. To further evaluate robustness of our results, we estimated LD score regression (LDSC) intercepts and attenuation ratios to account for bias in the intercept test when sample sizes become extreme, as in the present case. Results were within expected limits and consistent with a limited effect of population stratification on the meta-analysis results⁴⁴ (Supplementary Table 8). Then, we compared the sign of SNP effect size estimates between EUR-stratified results and within-sibling GWAS results from the UK Biobank, finding sign concordance estimates of 63.4–80% across phenotypes, all of which were significantly higher than would be expected if our results were driven entirely by population stratification or cryptic relatedness and were consistent in magnitude with other large-scale association studies⁴⁹. Finally, given reduced power in the within-sibling GWAS, we additionally compared the sign of SNP effect size estimates between EUR-stratified 23andMe summary statistics (the largest participating cohort) and EUR-stratified summary statistics with all cohorts except 23andMe, finding sign concordance estimates of 94.3–100%. See the Supplementary Note for further details on the methods and full results, including the list of excluded variants and loci.

Conditional analyses and locus definitions

We performed sequential forward selection to identify independently associated variants in each locus⁵⁰ for ancestry-stratified and multi-ancestry results. The procedure begins by including only the top association signal into a set of independently associated variants (ϕ) per locus. Conditional analysis is then conducted on the remaining variants, conditioning on variants in ϕ. If any of these conditional signals remained significant (that is, P < 5 × 10⁻⁹), we added the top signal to the set ϕ. The process iterates until there are no remaining significantly associated variants. The method requires an external genomic reference panel to estimate LD patterns. For ancestry-stratified conditional analyses, we used ancestry-matched individuals from TOPMed to estimate LD (sample sizes given previously). For multi-ancestry conditional analyses, we used the diverse ancestry TOPMed reference panel (n = 104,976) that matched the ancestry proportions of the included cohorts.

Loci were defined based in part on the conditional analysis, using a multi-step approach. First, consistent with previous GWAS meta-analysis⁵ in EUR ancestries, we identified all 1-Mb windows surrounding sentinel variants and collapsed overlapping windows. This resulted in a total of 1,449 such windows. For each window, we then used our ancestry-aware conditional analysis⁵¹ (described previously) with an ancestry-matched reference panel from TOPMed to enumerate all independent variants within each window. Then, for each independent variant, we defined a locus as the region including all variants in LD of r² > 0.1, based on the same ancestry-matched TOPMed reference panel (Supplementary Table 3 and Supplementary Fig. 3). Overlapping loci were then collapsed. This procedure avoids conventional definitions of a locus based on work in EUR ancestries and is tailored to the multi-ancestry data at hand.

Allelic effect size moderation

We evaluated evidence of effect size moderation by ancestry in the multi-ancestry model for each independent variant. To do so, we extended the MEMO model into a mixture model that separated variants with homogenous effects (models with only an intercept term) from those with possible heterogeneous effects (on at least one axis of genetic variation). We considered six sub-models including the null model, and the models in which the number of included components varied from 0 to 4.

$$L(\,y)=\prod _{a}{p}_{a}^{{\rm{N}}{\rm{U}}{\rm{L}}{\rm{L}}}p({b}_{j}|{\rm{N}}{\rm{U}}{\rm{L}}{\rm{L}})+{{\rm{p}}}_{{\rm{a}}}^{{\rm{A}}{\rm{L}}{\rm{T}}}\sum _{{\rm{j}}\in {{\rm{S}}}_{{\rm{a}}}}[{{\rm{q}}}_{{\rm{j}}0}\,{\rm{p}}({{\rm{b}}}_{{\rm{j}}}|{\rm{M}}{{\rm{R}}}_{0}(\,{\rm{j}}))+\ldots +{{\rm{q}}}_{{\rm{j}}4}\,{\rm{p}}({{\rm{b}}}_{{\rm{j}}}|{\rm{M}}{{\rm{R}}}_{4}(\,{\rm{j}}))],$$

where $p({b}_{j}|{\rm{N}}{\rm{U}}{\rm{L}}{\rm{L}})$ and $p\left({b}_{j}| M{R}_{l}\right)$ are the likelihoods of the variant j effect sizes under the null model and the meta-regression models with l axes of genetic variation, respectively; ${p}_{a}^{{\rm{NULL}}}$ and ${p}_{a}^{{\rm{ALT}}}$ are the probabilities of locus a carrying zero or at least one causal variant, respectively. The term ${q}_{{jl}}$ is the probability that the model with l axes of genetic variation best fit the data. We selected the model with the largest posterior probability for each variant as the best-fitting model to capture the genetic effect heterogeneity. Variants in which the zero component model was selected (that is, all models with at least one component were rejected) were considered to have homogeneous effects across ancestry. Among the remaining variants, we considered which one of the meta-regression models (that is, 1–4 components) best described the extent of effect heterogeneities based on the posterior probabilities for each model. In addition, we required that strongly heterogeneous variants had an MDS component effect that was significantly different from zero and were polymorphic in two or more ancestry-stratified cohorts to ease interpretation of heterogeneous effects. For example, a variant in which the model with two components best fit the data was considered at least weakly heterogeneous. If this variant also had a component two effect significantly different than zero (${\gamma }_{j2}\ne 0,$ from above) and was polymorphic in at least two ancestries, it was considered strongly heterogeneous.

Fine-mapping

On the basis of the selected genetic effect model (above), for each variant in a locus, we calculated the Bayes factor by ${\varLambda }_{j}={\rm{\exp }}\left[\frac{{X}_{j}-\left(T+1\right){\rm{\log }}K}{2}\right]$, where ${X}_{j}$ denotes the chi-squared test statistic for variant j, T denotes the number of axes of genetic variation included in the best-fitting model (that is, 0–4 MDS components) and K denotes the number of studies contributing to the GWAS. Using the approximate Bayes factor, we then calculated the posterior inclusion probability for each variant as ${\pi }_{j}=\frac{{\varLambda }_{j}}{{\sum }_{i}{\varLambda }_{i}}$, where i indexes each locus. Finally, we derived 90% credible intervals by ranking variants within a locus by their single posterior estimate and selecting variants until the cumulative posterior inclusion probability reached 0.90.

For EUR-stratified fine-mapping, we approximated the Bayes factor as above with T set to 0. Fine-mapping was conducted in EUR-stratified results, using identical loci as in multi-ancestry fine-mapping, to describe the increased resolution attributable to diverse ancestry inclusion and differences in sample size.

Functional enrichment analysis was conducted to test whether high-priority genes identified in the fine-mapping results were expressed in specific tissue types or enriched in certain cell types or gene pathways. High-priority genes were defined as those mapped from variants in credible intervals containing less than five variants. That is, for each variant in credible intervals with less than five variants, we used the UCSC genome annotation database to assign genes. We assigned intergenic variants to the nearest gene. We mapped genes from variants with PIP < 0.01 (as ‘control’ genes) in the same way. Functional enrichment was then evaluated by estimating a relative risk (as described and implemented previously⁵²), defined as the ratio of the proportion of genes mapped from variants in credible intervals with less than five variants that are in a given annotation category to the proportion of genes mapped from variants, within associated loci, with PIP < 0.01 in the same annotation category. Annotation categories were derived from GTEx tissue expression⁵³, central nervous systems cell types⁵⁰ and gene pathways⁵⁴.

TWAS

TWAS were performed using a trans-ancestry method. In brief, this method fits a series of meta-regression models including the first four axes of genetic variation (MDS components), similar to that of our multi-ancestry meta-analysis model minus the random-effect term. Genetic effect estimates from these four models were then used to estimate phenotypic effects of each variant. Together, with variant weights taken from PrediXcan⁵⁵ based on 49 tissues from GTEx¹⁰ release version 8 (which includes up to 15% of individuals of non-EUR ancestry), the phenotypic effect estimates were used to construct a single TWAS statistic for each MDS component. A minimum P value approach⁵⁶ was then applied to combine all four TWAS statistic P values. Finally, we used a Cauchy combination test⁵⁷ to combine P values across all available tissues for each gene. The final, combined P value was subjected to a Bonferroni correction for 22,121 genes in 49 tissues. We present our TWAS results based on per gene P values combined across all available tissues, resulting in a 5 (phenotype) × 22,121 (gene) matrix of P values. Pathway enrichment was also conducted using a weighted regression approach⁵⁸ with the TWAS per-tissue P values to quantify the enrichment of identified genes in each pathway.

Heritability and genetic correlations

LDSC⁵⁹ was used to estimate heritability of our five phenotypes for EAS and EUR ancestries using a standard 1-cM window size. For ancestries with more recent admixture (AFR and AMR ancestries), we used covariate-adjusted LDSC⁶⁰ for the same analyses in which in-sample LD scores were calculated using ancestry-matched TOPMed reference samples and adjusted by the first 50 principal components. For more recently admixed AFR and AMR ancestries, which tend to show longer-range LD, we used a 20-cM window size when calculating LD scores. For both LDSC and covariate-adjusted LDSC, variants were subset to HapMap3 (ref. ⁶¹) with MAF > 0.05, as recommended for this approach.

We calculated genetic correlations between our five phenotypes and 4,065 UK Biobank phenotypes (both restricted to EUR ancestry) using bivariate LDSC with 1000 Genomes-based pre-calculated EUR LD scores for HapMap3 variants. We excluded phenotypes with heritability Z-scores less than 3 (reflecting near-zero heritability), genetic correlations with our phenotypes less than −0.8 or greater than 0.8, to remove phenotypes approaching redundancy with our target tobacco and alcohol use measures (for example, cigarettes per day versus packs per day), and those whose genetic correlations were unable to be estimated largely due to negative heritability estimates, leaving 1,141 UK Biobank phenotypes. Affinity propagation clustering⁶², a message-passing algorithm based on exemplars that identifies their corresponding set of clusters, was then used to further interpret the pattern of genetic correlations and multifactorial nature of substance use. A Bonferroni-corrected P value threshold for 1,141 UK Biobank phenotypes was used to identify genetic correlations that were significantly different from zero.

Polygenic scoring

Polygenic risk scores were computed using LDpred for each ancestry group separately, an approach that incorporates the correlation between genetic variants to re-weight effect size estimates⁶³. We used an independent prediction cohort, Add Health³⁵, to validate each score. Add Health is a nationally representative sample of US adolescents enrolled in grades 7 through 12 during the 1994–1995 school year. The mean birth year of respondents was 1979 (s.d. = 1.8) and the mean age at assessment (here, wave 4) was 29.0 years (s.d. = 1.8), which is comparable, in general, to the age of participants in the 23andMe cohort but younger, on average, than those in other cohorts. Add Health is composed of individuals from the same four major ancestral groups (defined with reference to 1000 Genomes; see Supplementary Note for details) comprising our ancestry-stratified results (EUR, AFR, AMR and EAS). Phenotypic descriptive statistics are given in Supplementary Table 12. Across the full Add Health sample, approximately 41% ever smoke regularly and reported an average of 7.3 cigarettes per day. For each polygenic score, we used only HapMap3 variants and those with MAF > 0.01. We used each Add Health ancestry group as its own LD reference panel for construction of each polygenic score, after removing related individuals, except for EAS in which we use 1000 Genomes due to the small sample size in Add Health.

Prediction accuracy of each polygenic score was estimated by taking the difference in the coefficient of determination (R²) between a base model that included only the covariates of age, sex, age × sex interaction, and the first ten genetic principal components, and a full model that additionally included the polygenic score. All scores were scaled to have a mean of zero and standard deviation of one.

URLs for software use

Ethics

Ethical review and approval were provided by the University of Minnesota institutional review board. All human participants provided informed consent.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

GWAS summary statistics can be downloaded online (https://doi.org/10.13020/przg-dp88) with more information available here: https://genome.psych.umn.edu/index.php/GSCAN. We have provided association results for variants that passed quality-control filters in the multi-ancestry and ancestry-stratified results for each of the five substance use phenotypes, excluding data provided by 23andMe. Ancestry-stratified polygenic score weights based on ancestry-stratified summary statistics are also provided. 23andMe results are available directly from the company.

Code availability

All software used to perform these analyses is publicly available. Software tools used are listed in the main text and Methods.

Change history

26 January 2023
An amendment to the underlying article code was made to enable an author name to appear correctly in PubMed.

References

World Health Organization. Tobacco. WHO https://www.who.int/news-room/fact-sheets/detail/tobacco (2022).
World Health Organization. Alcohol. WHO https://www.who.int/news-room/fact-sheets/detail/alcohol (2022).
World Health Organization. The top 10 causes of death. WHO https://www.who.int/news-room/fact-sheets/detail/the-top-10-causes-of-death (2020).
Griswold, M. G. et al. Alcohol use and burden for 195 countries and territories, 1990–2016: a systematic analysis for the Global Burden of Disease Study 2016. Lancet 392, 1015–1035 (2018).
Article Google Scholar
Liu, M. et al. Association studies of up to 1.2 million individuals yield new insights into the genetic etiology of tobacco and alcohol use. Nat. Genet. 51, 237–244 (2019).
Article CAS Google Scholar
McGuire, D. et al. Model-based assessment of replicability for genome-wide association meta-analysis. Nat. Commun. 12, 1964 (2021).
Article ADS CAS Google Scholar
Volkow, N. D., Koob, G. F. & McLellan, A. T. Neurobiologic advances from the brain disease model of addiction. N. Engl. J. Med. 374, 363–371 (2016).
Article CAS Google Scholar
Koob, G. F. & Volkow, N. D. Neurocircuitry of Addiction. Neuropsychopharmacology 35, 217–238 (2010).
Article Google Scholar
Bierut, L. J. et al. ADH1B is associated with alcohol dependence and alcohol consumption in populations of European and African ancestry. Mol. Psychiatry 17, 445–450 (2012).
Article CAS Google Scholar
The GTEx Consortium. The GTEx Consortium atlas of genetic regulatory effects across human tissues. Science 369, 1318–1330 (2020).
Article Google Scholar
Horn, J. L. A rationale and test for the number of factors in factor analysis. Psychometrika 30, 179–185 (1965).
Article CAS MATH Google Scholar
Berrettini, W. H. & Doyle, G. A. The CHRNA5–A3–B4 gene cluster in nicotine addiction. Mol. Psychiatry 17, 856–866 (2012).
Article CAS Google Scholar
Buchta, W. C. et al. Dynamic CRMP2 regulation of CaV2.2 in the prefrontal cortex contributes to the reinstatement of cocaine seeking. Mol. Neurobiol. 57, 346–357 (2020).
Article CAS Google Scholar
Andrade, A. et al. Genetic associations between voltage-gated calcium channels and psychiatric disorders. Int. J. Mol. Sci. 20, 3537 (2019).
Article CAS Google Scholar
Purcell, S. M. et al. Common polygenic variation contributes to risk of schizophrenia and bipolar disorder. Nature 460, 748–752 (2009).
Article ADS CAS Google Scholar
Moskvina, V. et al. Gene-wide analyses of genome-wide association data sets: evidence for multiple common risk alleles for schizophrenia and bipolar disorder and for overlap in genetic risk. Mol. Psychiatry 14, 252–260 (2009).
Article CAS Google Scholar
Liao, X. & Li, Y. Genetic associations between voltage-gated calcium channels and autism spectrum disorder: a systematic review. Mol. Brain 13, 96 (2020).
Article ADS CAS Google Scholar
Koskela, M. et al. Update of neurotrophic factors in neurobiology of addiction and future directions. Neurobiol. Dis. 97, 189–200 (2017).
Article CAS Google Scholar
Domanskyi, A., Saarma, M. & Airavaara, M. Prospects of neurotrophic factors for Parkinson’s disease: comparison of protein and gene therapy. Hum. Gene Ther. 26, 550–559 (2015).
Article CAS Google Scholar
Zhang, K., Wang, Y., Fan, T., Zeng, C. & Sun, Z. S. The p21-activated kinases in neural cytoskeletal remodeling and related neurological disorders. Protein Cell 13, 6–25 (2020).
Article CAS Google Scholar
Civiero, L. & Greggio, E. PAKs in the brain: function and dysfunction. Biochim. Biophys. Acta 1864, 444–453 (2018).
Article CAS Google Scholar
Nekrasova, T., Jobes, M. L., Ting, J. H., Wagner, G. C. & Minden, A. Targeted disruption of the Pak5 and Pak6 genes in mice leads to deficits in learning and locomotion. Dev. Biol. 322, 95–108 (2008).
Article CAS Google Scholar
Landek-Salgado, M. A., Faust, T. E. & Sawa, A. Molecular substrates of schizophrenia: homeostatic signaling to connectivity. Mol. Psychiatry 21, 10–28 (2016).
Article CAS Google Scholar
Civiero, L. et al. Leucine-rich repeat kinase 2 interacts with p21-activated kinase 6 to control neurite complexity in mammalian brain. J. Neurochem. 135, 1242–1256 (2015).
Article CAS Google Scholar
Ma, Q.-L. et al. p21-Activated kinase-aberrant activation and translocation in Alzheimer disease pathogenesis. J. Biol. Chem. 283, 14132–14143 (2008).
Article CAS Google Scholar
Buchsbaum, I. Y. et al. ECE2 regulates neurogenesis and neuronal migration during human cortical development. EMBO Rep. 21, e48204 (2020).
Article CAS Google Scholar
Mzhavia, N., Pan, H., Che, F.-Y., Fricker, L. D. & Devi, L. A. Characterization of endothelin-converting enzyme-2. Implication for a role in the nonclassical processing of regulatory peptides. J. Biol. Chem. 278, 14704–14711 (2003).
Article CAS Google Scholar
Baranello, R. J. et al. Amyloid-β protein clearance and degradation (ABCD) pathways and their role in Alzheimer’s disease. Curr. Alzheimer Res. 12, 32–46 (2015).
Article CAS Google Scholar
Bycroft, C. et al. The UK Biobank resource with deep phenotyping and genomic data. Nature 562, 203–209 (2018).
Article ADS CAS Google Scholar
Bodenhofer, U., Kothmeier, A. & Hochreiter, S. APCluster: an R package for affinity propagation clustering. Bioinformatics 27, 2463–2464 (2011).
Article CAS Google Scholar
Mallard, T. T. et al. Item-level genome-wide association study of the alcohol use disorders identification test in three population-based cohorts. Am. J. Psychiatry 179, 58–70 (2022).
Article Google Scholar
Zhou, H. et al. Genome-wide meta-analysis of problematic alcohol use in 435,563 individuals yields insights into biology and relationships with other traits. Nat. Neurosci. 23, 809–818 (2020).
Article CAS Google Scholar
Bulik-Sullivan, B. et al. An atlas of genetic correlations across human diseases and traits. Nat. Genet. 47, 1236–1241 (2015).
Article CAS Google Scholar
Kraft, P., Chen, H. & Lindström, S. The use of genetic correlation and Mendelian randomization studies to increase our understanding of relationships between complex traits. Curr. Epidemiol. Rep. 7, 104–112 (2020).
Article Google Scholar
Harris, K. M. et al. Cohort profile: the National Longitudinal Study of Adolescent to Adult Health (Add Health). Int. J. Epidemiol. 48, 1415–1415k (2019).
Article Google Scholar
Martin, A. R. et al. Human demographic history impacts genetic risk prediction across diverse populations. Am. J. Hum. Genet. 100, 635–649 (2017).
Article CAS Google Scholar
Hermalin, L. The Age Prevalence of Smoking among Chinese Women: A Case of Arrested Diffusion (Population Studies Center, 2010).
Flor, L. S., Reitsma, M. B., Gupta, V., Ng, M. & Gakidou, E. The effects of tobacco control policies on global smoking prevalence. Nat. Med. 27, 239–243 (2021).
Article CAS Google Scholar
Burton, R. et al. A rapid evidence review of the effectiveness and cost-effectiveness of alcohol control policies: an English perspective. Lancet 389, 1558–1580 (2017).
Article Google Scholar
Mathieson, I. The omnigenic model and polygenic prediction of complex traits. Am. J. Hum. Genet. 108, 1558–1563 (2021).
Article CAS Google Scholar
McCarthy, S. et al. A reference panel of 64,976 haplotypes for genotype imputation. Nat. Genet. 48, 1279–1283 (2016).
Article CAS Google Scholar
1000 Genomes Project Consortium et al. A global reference for human genetic variation. Nature 526, 68–74 (2015).
Article Google Scholar
Zhan, X., Hu, Y., Li, B., Abecasis, G. R. & Liu, D. J. RVTESTS: an efficient and comprehensive tool for rare variant association analysis using sequence data. Bioinformatics 32, 1423–1426 (2016).
Article CAS Google Scholar
Loh, P.-R., Kichaev, G., Gazal, S., Schoech, A. P. & Price, A. L. Mixed-model association for biobank-scale datasets. Nat. Genet. 50, 906–908 (2018).
Article CAS Google Scholar
Zhou, W. et al. Efficiently controlling for case–control imbalance and sample relatedness in large-scale genetic association studies. Nat. Genet. 50, 1335–1341 (2018).
Article CAS Google Scholar
Chen, Z. & Liu, Q. A new approach to account for the correlations among single nucleotide polymorphisms in genome-wide association studies. Hum. Hered. 72, 1–9 (2011).
Article Google Scholar
Gao, X., Becker, L. C., Becker, D. M., Starmer, J. D. & Province, M. A. Avoiding the high Bonferroni penalty in genome-wide association studies. Genet. Epidemiol. 34, 100–105 (2010).
Google Scholar
Yang, J. et al. Genomic inflation factors under polygenic inheritance. Eur. J. Hum. Genet. 19, 807–812 (2011).
Article Google Scholar
Lee, J. J. et al. Gene discovery and polygenic prediction from a genome-wide association study of educational attainment in 1.1 million individuals. Nat. Genet. 50, 1112–1121 (2018).
Article CAS Google Scholar
Bryois, J. et al. Genetic identification of cell types underlying brain complex traits yields insights into the etiology of Parkinson’s disease. Nat. Genet. 52, 482–493 (2020).
Article CAS Google Scholar
Jiang, Y. et al. Proper conditional analysis in the presence of missing data: application to large scale meta-analysis of tobacco use phenotypes. PLoS Genet. 14, e1007452 (2018).
Article Google Scholar
Kanai, M. et al. Insights from complex trait fine-mapping across diverse populations. Preprint at medRxiv https://doi.org/10.1101/2021.09.03.21262975 (2021).
Lonsdale, J. et al. The Genotype-Tissue Expression (GTEx) project. Nat. Genet. 45, 580–585 (2013).
Article CAS Google Scholar
Gene Ontology Consortium. The Gene Ontology resource: enriching a GOld mine. Nucleic Acids Res. 49, D325–D334 (2021).
Article Google Scholar
Gamazon, E. R. et al. A gene-based association method for mapping traits using reference transcriptome data. Nat. Genet. 47, 1091–1098 (2015).
Article CAS Google Scholar
Lin, D.-Y. & Tang, Z.-Z. A general framework for detecting disease associations with rare variants in sequencing studies. Am. J. Hum. Genet. 89, 354–367 (2011).
Article CAS Google Scholar
Liu, Y. & Xie, J. Cauchy combination test: a powerful test with analytic p-value calculation under arbitrary dependency structures. J. Am. Stat. Assoc. 115, 393–402 (2020).
Article CAS MATH Google Scholar
Leeuw, C. A., de, Mooij, J. M., Heskes, T. & Posthuma, D. MAGMA: generalized gene-set analysis of GWAS data. PLoS Comput. Biol. 11, e1004219 (2015).
Article Google Scholar
Bulik-Sullivan, B. K. et al. LD score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat. Genet. 47, 291–295 (2015).
Article CAS Google Scholar
Luo, Y. et al. Estimating heritability and its enrichment in tissue-specific gene sets in admixed populations. Hum. Mol. Genet. 30, 1521–1534 (2021).
CAS Google Scholar
Altshuler, D. M. et al. Integrating common and rare genetic variation in diverse human populations. Nature 467, 52–58 (2010).
Article ADS CAS Google Scholar
Frey, B. J. & Dueck, D. Clustering by passing messages between data points. Science 315, 972–976 (2007).
Article ADS CAS MATH Google Scholar
Vilhjálmsson, B. J. et al. Modeling linkage disequilibrium increases accuracy of polygenic risk scores. Am. J. Hum. Genet. 97, 576–592 (2015).
Article Google Scholar

Download references

Acknowledgements

This study was designed and carried out by the GWAS and Sequencing Consortium of Alcohol and Nicotine use (GSCAN). It was conducted by using the UK Biobank Resource under application number 16651. This study was supported by funding from US National Institutes of Health awards R56HG011035, R01DA044283, R01DA042755 and U01DA041120 to S.V., and R01GM126479, R56HG011035, R03OD032630, R01HG011035 and R56HG012358 to D.J.L. G.R.B.S. was also supported by National Institutes of Health award T32DA050560. D.J.L. and X.W. were in part supported by the Penn State College of Medicine’s Biomedical Informatics and Artificial Intelligence Program in the Strategic Plan. A full list of acknowledgements is provided in the Supplementary Note.

Author information

These authors contributed equally: Gretchen R.B. Saunders, Xingyan Wang, Fang Chen, Seon-Kyeong Jang, Mengzhen Liu, Chen Wang
These authors jointly supervised this work: Dajiang J. Liu, Scott Vrieze

Authors and Affiliations

Department of Psychology, University of Minnesota, Minneapolis, MN, USA
Gretchen R. B. Saunders, Seon-Kyeong Jang, Mengzhen Liu, Jacqueline M. Otto, William Iacono, James J. Lee, Matt McGue, Moin Syed & Scott Vrieze
Department of Public Health Sciences, Penn State College of Medicine, Hershey, PA, USA
Xingyan Wang, Fang Chen, Chen Wang, Shuang Gao, Chachrit Khunsriraksakul, Bibo Jiang & Dajiang J. Liu
Department of Epidemiology & Population Health at Stanford University, Stanford, CA, USA
Yu Jiang
Jackson Heart Study (JHS) Graduate Training and Education Center (GTEC), Department of Epidemiology and Biostatistics, School of Public Health, Jackson State University, Jackson, MS, USA
Clifton Addison
Laboratory for Statistical and Translational Genetics, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
Masato Akiyama, Yoichiro Kamatani & Nana Matoba
Department of Ocular Pathology and Imaging Science, Kyushu University Graduate School of Medical Sciences, Fukuoka, Japan
Masato Akiyama
Smidt Heart Institute, Cedars-Sinai Medical Center, Los Angeles, CA, USA
Christine M. Albert
Division of Preventive Medicine, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA, USA
Christine M. Albert & Daniel I. Chasman
Department of Psychiatry, Rutgers Robert Wood Johnson Medical School, New Brunswick, NJ, USA
Fazil Aliev & Danielle M. Dick
Department of Epidemiology, Rollins School of Public Health, Emory University, Atlanta, GA, USA
Alvaro Alonso
Dean’s Office and Department of Epidemiology, College of Public Health, University of Kentucky, Lexington, KY, USA
Donna K. Arnett
Department of Medicine and Duke Comprehensive Sickle Cell Center, Duke University School of Medicine, Durham, NC, USA
Allison E. Ashley-Koch, Melanie E. Garrett & Marilyn J. Telen
Duke Molecular Physiology Institute, Duke University School of Medicine, Durham, NC, USA
Allison E. Ashley-Koch & Melanie E. Garrett
Division of Hematology, Department of Medicine, Mayo Clinic College of Medicine and Science, Rochester, MN, USA
Aneel A. Ashrani
Division of Biomedical Informatics & Personalized Medicine, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
Kathleen C. Barnes, Meher Preethi Boorgula, Sameer Chavan & Nicholas Rafaels
Tempus, Chicago, IL, USA
Kathleen C. Barnes
Department of Medicine, Columbia University Medical Center, New York, NY, USA
R. Graham Barr & Elizabeth C. Oelsner
Cardiovascular Health Research Unit, Department of Medicine, University of Washington, Seattle, WA, USA
Traci M. Bartz, Joshua C. Bis, Jennifer A. Brody, Sina A. Gharib & Nona Sotoodehnia
Department of Biostatistics, University of Washington, Seattle, WA, USA
Traci M. Bartz & Charles Kooperberg
Department of Medicine, Johns Hopkins University School of Medicine, Baltimore, MD, USA
Diane M. Becker & Lisa R. Yanek
Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, MI, USA
Lawrence F. Bielak, Sharon L. R. Kardia, Patricia A. Peyser, Jennifer A. Smith & Wei Zhao
Department of Medicine, Boston Medical Center, Boston University School of Medicine, Boston, MA, USA
Emelia J. Benjamin
Department of Epidemiology, Boston University School of Public Health, Boston, MA, USA
Emelia J. Benjamin
deCODE Genetics/Amgen, Inc., Reykjavik, Iceland
Gyda Bjornsdottir, Daniel F. Gudbjartsson, Hreinn Stefansson, Kari Stefansson & Thorgeir E. Thorgeirsson
Department of Human Genetics and South Texas Diabetes and Obesity Institute, University of Texas Rio Grande Valley School of Medicine, Brownsville, TX, USA
John Blangero, Joanne E. Curran & Ravindranath Duggirala
Department of Medicine, University of Arizona, Tucson, AZ, USA
Eugene R. Bleecker & Deborah A. Meyers
Institute of Behavioral Science, University of Colorado Boulder, Boulder, CO, USA
Jason D. Boardman
Human Genetics Center, Department of Epidemiology, Human Genetics, and Environmental Sciences, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, USA
Eric Boerwinkle, Myriam Fornage, Alanna C. Morrison & Bing Yu
Netherlands Twin Register, Dept Biological Psychology, Vrije Universiteit Amsterdam, Amsterdam, The Netherlands
Dorret I. Boomsma, JoukeJan Hottenga & Gonneke Willemsen
Department of Biochemistry, Wake Forest School of Medicine, Winston-Salem, NC, USA
Donald W. Bowden & Nicholette D. Palmer
Division of Sleep and Circadian Disorders, Brigham and Women’s Hospital, Boston, MA, USA
Brian E. Cade, Susan Redline & Michael N. Rueschman
Division of Sleep Medicine, Harvard Medical School, Boston, MA, USA
Brian E. Cade & Susan Redline
Program in Medical and Population Genetics, Broad Institute, Cambridge, MA, USA
Brian E. Cade
Institute for Translational Genomics and Population Sciences, Department of Pediatrics, Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA, USA
Yii-Der Ida Chen
Clinical Trial Service Unit and Epidemiological Studies Unit, Nuffield Department of Population Health, University of Oxford, Oxford, UK
Zhengming Chen, Kuang Lin, Iona Y. Millwood & Robin G. Walters
MRC Population Health Research Unit, Nuffield Department of Population Health, University of Oxford, Oxford, UK
Zhengming Chen, Iona Y. Millwood & Robin G. Walters
Department of Epidemiology & Biostatistics, University of California, San Francisco, CA, USA
Iona Cheng & Yuqing Li
UCSF Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, CA, USA
Iona Cheng
Channing Division of Network Medicine, Department of Medicine, Brigham and Women’s Hospital, Boston, MA, USA
Michael H. Cho & Matthew Moll
Division of Pulmonary and Critical Care Medicine, Department of Medicine, Brigham and Women’s Hospital, Boston, MA, USA
Michael H. Cho & Matthew Moll
Kaiser Permanente Northern California (KPNC), Division of Research, Oakland, CA, USA
Hélène Choquet, Khanh K. Thai & Jie Yin
Department of Neurology, Baltimore Veterans Affairs Medical Center, Baltimore, MD, USA
John W. Cole
Division of Vascular Neurology, Department of Neurology, University of Maryland School of Medicine, Baltimore, MD, USA
John W. Cole
Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
Marilyn C. Cornelis, Lifang Hou & Yoonjung Y. Joo
University of Sassari, Sassari SS, Italy
Francesco Cucca
Division of Clinical Trials and Biostatistics, Department of Quantitative Health Sciences, Mayo Clinic College of Medicine and Science, Rochester, MN, USA
Mariza de Andrade
Department of Psychiatry, University of Utah School of Medicine, Salt Lake City, UT, USA
Anna R. Docherty
Virginia Institute for Psychiatric and Behavioral Genetics, Virginia Commonwealth University, Virginia, USA
Anna R. Docherty & Nathan Gillespie
Huntsman Mental Health Institute, Salt Lake City, UT, USA
Anna R. Docherty
Department of Family Medicine, Brown University, Providence, RI, USA
Charles B. Eaton
Institute for Behavioral Genetics, University of Colorado Boulder, Boulder, CO, USA
Marissa A. Ehringer, John K. Hewitt, Matthew C. Keller, Michael C. Stallings & Jerry A. Stitzel
Department of Integrative Physiology, University of Colorado Boulder, Boulder, CO, USA
Marissa A. Ehringer & Reedik Mägi
Institute of Genomics, University of Tartu, Tartu, Estonia
Tõnu Esko, Toomas Haller & Andres Metspalu
Survey Research Center, Institute for Social Research, University of Michigan, Ann Arbor, MI, USA
Jessica D. Faul, Jennifer A. Smith & David R. Weir
Institute of Clinical Medicine, Internal Medicine, University of Eastern Finland, Kuopio, Finland
Lilian Fernandes Silva
Istituto di Ricerca Genetica e Biomedica, Consiglio Nazionale delle Ricerche (CNR), Monserrato, Italy
Edoardo Fiorillo, Antonella Mulas & Valeria Orrù
Brown Foundation Institute of Molecular Medicine, McGovern Medical School, University of Texas Health Science Center at Houston, Houston, TX, USA
Myriam Fornage
Department of Internal Medicine-Section on Nephrology, Wake Forest School of Medicine, Winston-Salem, NC, USA
Barry I. Freedman
K.G. Jebsen Center for Genetic Epidemiology, Department of Public Health and Nursing, NTNU, Norwegian University of Science and Technology, Trondheim, Norway
Maiken E. Gabrielsen, Kristian Hveem, Jonas B. Nielsen, Anne Heidi Skogholt & Bendik S. Winsvold
Division of Pulmonary, Critical Care, and Sleep Medicine, Department of Medicine, University of Washington, Seattle, WA, USA
Sina A. Gharib
Center for Lung Biology, Department of Medicine, University of Washington, Seattle, WA, USA
Sina A. Gharib
Research Unit Molecular Epidemiology, Institute of Epidemiology, Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Germany
Christian Gieger
Department of Psychiatry & Behavioral Sciences, Boston Children’s Hospital & Harvard Medical School, Boston, MA, USA
David C. Glahn
Genetic Epidemiology, QIMR Berghofer Medical Research Institute, Brisbane, Australia
Scott D. Gordon, Nicholas G. Martin & John B. Whitfield
Division of Biostatistics, Washington University School of Medicine, St. Louis, MO, USA
Charles C. Gu & Nancy L. Saccone
Department of Epidemiology and Key Laboratory of Cardiovascular Epidemiology, Fuwai Hospital, National Center for Cardiovascular Diseases, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
Dongfeng Gu
School of Engineering and Natural Sciences, University of Iceland, Reykjavik, Iceland
Daniel F. Gudbjartsson
The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA, USA
Xiuqing Guo, Jerome I. Rotter & Kent D. Taylor
Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
Jeffrey Haessler, Robert C. Kaplan, Charles Kooperberg, Ulrike Peters & Alex P. Reiner
Department of Medicine, University of Mississippi Medical Center, Jackson, MS, USA
Michael E. Hall
Department of Sociology and the Carolina Population Center, University of North Carolina, Chapel Hill, NC, USA
Kathleen Mullan Harris
Department of Epidemiology, Tulane University, New Orleans, LA, USA
Jiang He, Tanika N. Kelly & Xiao Sun
Translational Sciences Institute, Tulane University, New Orleans, LA, USA
Jiang He & Tanika N. Kelly
McCourt School of Public Policy, Georgetown University, Washington, DC, USA
Pamela Herd
Department Of Psychology and Neuroscience, University of Colorado Boulder, Boulder, CO, USA
John K. Hewitt, Matthew C. Keller & Michael C. Stallings
Youth Mental Health & Technology Team, Brain and Mind Centre, University of Sydney, Sydney, Australia
Ian Hickie
Department of Epidemiology, School of Public Health, University of Alabama at Birmingham, Birmingham, AL, USA
Bertha Hidalgo & Marguerite R. Irvin
Department of Epidemiology, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
John E. Hokanson, Nicole E. Richmond & Kendra A. Young
Department of Psychiatry, University of Colorado Anschutz Medical Center, Denver, CO, USA
Christian Hopfer
Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA
Hongyan Huang, Yon Ho Jee, Peter Kraft & Constance Turman
Program in Genetic Epidemiology and Statistical Genetics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
Hongyan Huang, Peter Kraft & Constance Turman
Institute of Preventive Medicine, National Defense Medical Center, New Taipei City, Taiwan
Yi-Jen Hung
Nuffield Department of Population Health, University of Oxford, Oxford, UK
David J. Hunter
HUNT Research Center, Department of Public Health and Nursing, Faculty of Medicine and Health Sciences, Norwegian University of Science and Technology (NTNU), Trondheim, Norway
Kristian Hveem
Department of Research, Innovation and Education, St. Olavs Hospital, Trondheim University Hospital, Trondheim, Norway
Kristian Hveem
Population Sciences Branch, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD, USA
Shih-Jen Hwang, Daniel Levy & Jiantao Ma
Section of Endocrinology and Metabolism, Department of Medicine, Taipei Veterans General Hospital, Taipei, Taiwan
Chii-Min Hwu
GenOmics, Bioinformatics, and Translational Research Center, RTI International, Research Triangle Park, NC, USA
Eric O. Johnson, Ravi Mathur & Dana B. Hancock
Fellow Program, RTI International, Research Triangle Park, NC, USA
Eric O. Johnson
Institute of Data Science, Korea University, Seoul, South Korea
Yoonjung Y. Joo
Regeneron Genetics Center, Tarrytown, NY, USA
Eric Jorgenson
Department of Population Health Sciences, Geisinger, Danville, PA, USA
Anne E. Justice
Department of Epidemiology, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
Anne E. Justice & Kari E. North
Laboratory of Complex Trait Genomics, Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Tokyo, Japan
Yoichiro Kamatani
Department of Epidemiology and Population Health, Albert Einstein College of Medicine, Bronx, NY, USA
Robert C. Kaplan
Institute for Molecular Medicine Finland - FIMM, University of Helsinki, Helsinki, Finland
Jaakko Kaprio, Tellervo Korhonen & Teemu Palviainen
Department of Molecular, Cellular and Developmental Biology, University of Colorado, Boulder, CO, USA
Kenneth Krauter
Institute of Clinical Medicine, Internal Medicine, University of Eastern Finland and Kuopio University Hospital, Kuopio, Finland
Johanna Kuusisto & Markku Laakso
Center for Medicine and Clinical Research, Kuopio University Hospital, Kuopio, Finland
Johanna Kuusisto
Brigham and Women’s Hospital, Department of Medicine, Channing Division of Network Medicine, Boston, MA, USA
Jessica Lasky-Su & Scott T. Weiss
Department of Medical Research, Taichung Veterans General Hospital, Taichung City, Taiwan
Wen-Jane Lee
Department of Epidemiology and Biostatistics, School of Public Health, Peking University Health Science Center, Beijing, China
Liming Li
Center for Statistical Genetics, Department of Biostatistics, University of Michigan, Ann Arbor, MI, USA
Kevin Li & Anita Pandit
Psychiatric Genetics, QIMR Berghofer Medical Research Institute, Brisbane, Australia
Penelope A. Lind & Sarah E. Medland
School of Biomedical Sciences, Faculty of Medicine, University of Queensland, Brisbane, Australia
Penelope A. Lind
School of Biomedical Sciences, Queensland University of Technology, Brisbane, Australia
Penelope A. Lind
Department of Biostatistics, Boston University School of Public Health, Boston, MA, USA
Chunyu Liu
Departments of Preventive Medicine, Medicine, and Pediatrics, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
Donald M. Lloyd-Jones
Department of Population Medicine, Harvard Pilgrim Health Care Institute, Boston, MA, USA
Sharon M. Lutz
Department of Biostatics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
Sharon M. Lutz
Division of Nutrition Epidemiology and Data Science, Friedman School of Nutrition Science and Policy, Tufts University, Boston, MA, USA
Jiantao Ma
Center for Public Health Genomics, Department of Public Health Sciences, University of Virginia School of Medicine, Charlottesville, VA, USA
Ani Manichaikul & Stephen S. Rich
Department of Genetics, UNC Neuroscience Center, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
Nana Matoba
Division of Endocrinology, Diabetes and Nutrition, Department of Medicine, University of Maryland School of Medicine, Baltimore, MD, USA
Patrick F. McArdle, Braxton D. Mitchell, May E. Montasser & Huichun Xu
Department of Integrative Physiology, University of Colorado, Boulder, CO, USA
Matthew B. McQueen
Geriatrics Research and Education Clinical Center, Baltimore Veterans Administration Medical Center, Baltimore, MD, USA
Braxton D. Mitchell
Department of Genetics, University of North Carolina, Chapel Hill, NC, USA
Karen L. Mohlke
Department of Internal Medicine, Division of Cardiovascular Medicine, University of Michigan, Ann Arbor, MI, USA
Jonas B. Nielsen
Laboratory for Systems Genetics, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
Yukinori Okada
Department of Statistical Genetics, Osaka University Graduate School of Medicine, Suita, Japan
Yukinori Okada
Laboratory of Statistical Immunology, Immunology Frontier Research Center (WPI-IFReC), Osaka University, Suita, Japan
Yukinori Okada
Department of Genome Informatics, Graduate School of Medicine, the University of Tokyo, Tokyo, Japan
Yukinori Okada
Population Sciences of the Pacific Program, University of Hawaii Cancer Center, Honolulu, HI, USA
S. Lani Park
Department of Epidemiology, University of Washington, Seattle, WA, USA
Ulrike Peters, Alex P. Reiner & Nicholas L. Smith
Institute of Epidemiology, Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Germany
Annette Peters
Institute for Medical Information Processing, Biometry and Epidemiology, Ludwig Maximilians University Munich, Munich, Germany
Annette Peters
German Centre for Cardiovascular Research, DZHK, Partner Site Munich, Munich, Germany
Annette Peters
Department of Clinical Developmental Psychology, Vrije Universiteit, Amsterdam, The Netherlands
Tinca J. C. Polderman
Department of Child and Adolescent Psychiatry, Amsterdam UMC, Amsterdam, The Netherlands
Tinca J. C. Polderman
Division of Pulmonary, Critical Care, and Sleep Medicine, Beth Israel Deaconess Medical Center, Boston, MA, USA
Susan Redline
Division of Pulmonary and Critical Care Medicine, Department of Medicine, University of Maryland School of Medicine, Baltimore, MD, USA
Robert M. Reed
Department of Psychiatry, Washington University School of Medicine, St. Louis, MO, USA
John P. Rice & Laura J. Bierut
Center for Demography of Health and Aging, University of Wisconsin-Madison, Madison, WI, USA
Carol Roan & Kamil Sicinski
SAA-National Center of Addiction Medicine, Vogur Hospital, Reykjavik, Iceland
Valgerdur Runarsdottir & Thorarinn Tyrfingsson
Department of Genetics, Washington University School of Medicine, St. Louis, MO, USA
Nancy L. Saccone
Division of Pulmonary Sciences and Critical Care Medicine; Department of Medicine and Immunology, University of Colorado, Aurora, CO, USA
David A. Schwartz
Herbert Wertheim School of Public Health and Human Longevity Science, University of California, San Diego, La Jolla, CA, USA
Aladdin H. Shadyab
23andMe, Inc, Sunnyvale, CA, USA
Jingchunzi Shi & Suyash S. Shringarpure
Kaiser Permanente Washington Health Research Institute, Kaiser Permanente Washington, Seattle, WA, USA
Nicholas L. Smith
Seattle Epidemiologic Research and Information Center, Department of Veterans Affairs Office of Research and Development, Seattle, WA, USA
Nicholas L. Smith
Division of Cardiology, Department of Medicine, University of Washington, Seattle, WA, USA
Nona Sotoodehnia
Faculty of Medicine, University of Iceland, Reykjavik, Iceland
Kari Stefansson
COPD Foundation, Washington, DC, USA
Ruth Tal-Singer
MRC Integrative Epidemiology Unit, Population Health Sciences, University of Bristol, Bristol, UK
Amy E. Taylor, Luisa Zuccolo & Marcus R. Munafò
National Institute for Health Research Biomedical Research Centre at the University Hospitals Bristol NHS Foundation Trust and the University of Bristol, Bristol, UK
Amy E. Taylor & Marcus R. Munafò
Department of Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK
Amy E. Taylor & Luisa Zuccolo
Department of Biostatistics, School of Public Health, University of Alabama at Birmingham, Birmingham, AL, USA
Hemant Tiwari
Department of Psychiatry, University of California San Diego, San Diego, CA, USA
Tamara L. Wall
Jackson Heart Study Undergraduate Training and Education Center, Tougaloo College, Tougaloo, MS, USA
Wendy B. White
Department of Medicine, University of Washington, Seattle, WA, USA
Kerri L. Wiggins
Department of Internal Medicine, Division of Cardiology, University of Michigan, Ann Arbor, MI, USA
Cristen J. Willer
Department of Human Genetics, University of Michigan, Ann Arbor, MI, USA
Cristen J. Willer
Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
Cristen J. Willer & Wei Zhou
Department of Research and Innovation, Division of Clinical Neuroscience, Oslo University Hospital, Oslo, Norway
Bendik S. Winsvold
Department of Neurology, Oslo University Hospital, Oslo, Norway
Bendik S. Winsvold
Department of Epidemiology, Gillings School of Global Public Health, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
Kristin L. Young
Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
Wei Zhou
Department of Biostatistics, University of Michigan, Ann Arbor, MI, USA
Sebastian Zöllner
Department of Psychiatry, University of Michigan, Ann Arbor, MI, USA
Sebastian Zöllner
Health Data Science Centre, Fondazione Human Technopole, Milan, Italy
Luisa Zuccolo
Department of Population Health Sciences, University of Leicester, Leicester, UK
Chiara Batini
Oregon Research Institute, Springfield, OR, USA
Andrew W. Bergen
BioRealm, LLC, Walnut, CA, USA
Andrew W. Bergen
Outcomes Research Network & Department of Family Medicine, NorthShore University HealthSystem, Evanston, IL, USA
Sean P. David
Department of Family Medicine, University of Chicago, Chicago, IL, USA
Sean P. David
Department of Medicine, Université de Montréal, Montréal, Québec, Canada
Sarah A. Gagliano Taliun
Department of Neurosciences, Université de Montréal, Montréal, Québec, Canada
Sarah A. Gagliano Taliun
Research Centre, Montréal Heart Institute, Montréal, Québec, Canada
Sarah A. Gagliano Taliun
School of Psychological Science, University of Bristol, Bristol, UK
Marcus R. Munafò

Authors

Gretchen R. B. Saunders
View author publications
You can also search for this author in PubMed Google Scholar
Xingyan Wang
View author publications
You can also search for this author in PubMed Google Scholar
Fang Chen
View author publications
You can also search for this author in PubMed Google Scholar
Seon-Kyeong Jang
View author publications
You can also search for this author in PubMed Google Scholar
Mengzhen Liu
View author publications
You can also search for this author in PubMed Google Scholar
Chen Wang
View author publications
You can also search for this author in PubMed Google Scholar
Shuang Gao
View author publications
You can also search for this author in PubMed Google Scholar
Yu Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Chachrit Khunsriraksakul
View author publications
You can also search for this author in PubMed Google Scholar
Jacqueline M. Otto
View author publications
You can also search for this author in PubMed Google Scholar
Clifton Addison
View author publications
You can also search for this author in PubMed Google Scholar
Masato Akiyama
View author publications
You can also search for this author in PubMed Google Scholar
Christine M. Albert
View author publications
You can also search for this author in PubMed Google Scholar
Fazil Aliev
View author publications
You can also search for this author in PubMed Google Scholar
Alvaro Alonso
View author publications
You can also search for this author in PubMed Google Scholar
Donna K. Arnett
View author publications
You can also search for this author in PubMed Google Scholar
Allison E. Ashley-Koch
View author publications
You can also search for this author in PubMed Google Scholar
Aneel A. Ashrani
View author publications
You can also search for this author in PubMed Google Scholar
Kathleen C. Barnes
View author publications
You can also search for this author in PubMed Google Scholar
R. Graham Barr
View author publications
You can also search for this author in PubMed Google Scholar
Traci M. Bartz
View author publications
You can also search for this author in PubMed Google Scholar
Diane M. Becker
View author publications
You can also search for this author in PubMed Google Scholar
Lawrence F. Bielak
View author publications
You can also search for this author in PubMed Google Scholar
Emelia J. Benjamin
View author publications
You can also search for this author in PubMed Google Scholar
Joshua C. Bis
View author publications
You can also search for this author in PubMed Google Scholar
Gyda Bjornsdottir
View author publications
You can also search for this author in PubMed Google Scholar
John Blangero
View author publications
You can also search for this author in PubMed Google Scholar
Eugene R. Bleecker
View author publications
You can also search for this author in PubMed Google Scholar
Jason D. Boardman
View author publications
You can also search for this author in PubMed Google Scholar
Eric Boerwinkle
View author publications
You can also search for this author in PubMed Google Scholar
Dorret I. Boomsma
View author publications
You can also search for this author in PubMed Google Scholar
Meher Preethi Boorgula
View author publications
You can also search for this author in PubMed Google Scholar
Donald W. Bowden
View author publications
You can also search for this author in PubMed Google Scholar
Jennifer A. Brody
View author publications
You can also search for this author in PubMed Google Scholar
Brian E. Cade
View author publications
You can also search for this author in PubMed Google Scholar
Daniel I. Chasman
View author publications
You can also search for this author in PubMed Google Scholar
Sameer Chavan
View author publications
You can also search for this author in PubMed Google Scholar
Yii-Der Ida Chen
View author publications
You can also search for this author in PubMed Google Scholar
Zhengming Chen
View author publications
You can also search for this author in PubMed Google Scholar
Iona Cheng
View author publications
You can also search for this author in PubMed Google Scholar
Michael H. Cho
View author publications
You can also search for this author in PubMed Google Scholar
Hélène Choquet
View author publications
You can also search for this author in PubMed Google Scholar
John W. Cole
View author publications
You can also search for this author in PubMed Google Scholar
Marilyn C. Cornelis
View author publications
You can also search for this author in PubMed Google Scholar
Francesco Cucca
View author publications
You can also search for this author in PubMed Google Scholar
Joanne E. Curran
View author publications
You can also search for this author in PubMed Google Scholar
Mariza de Andrade
View author publications
You can also search for this author in PubMed Google Scholar
Danielle M. Dick
View author publications
You can also search for this author in PubMed Google Scholar
Anna R. Docherty
View author publications
You can also search for this author in PubMed Google Scholar
Ravindranath Duggirala
View author publications
You can also search for this author in PubMed Google Scholar
Charles B. Eaton
View author publications
You can also search for this author in PubMed Google Scholar
Marissa A. Ehringer
View author publications
You can also search for this author in PubMed Google Scholar
Tõnu Esko
View author publications
You can also search for this author in PubMed Google Scholar
Jessica D. Faul
View author publications
You can also search for this author in PubMed Google Scholar
Lilian Fernandes Silva
View author publications
You can also search for this author in PubMed Google Scholar
Edoardo Fiorillo
View author publications
You can also search for this author in PubMed Google Scholar
Myriam Fornage
View author publications
You can also search for this author in PubMed Google Scholar
Barry I. Freedman
View author publications
You can also search for this author in PubMed Google Scholar
Maiken E. Gabrielsen
View author publications
You can also search for this author in PubMed Google Scholar
Melanie E. Garrett
View author publications
You can also search for this author in PubMed Google Scholar
Sina A. Gharib
View author publications
You can also search for this author in PubMed Google Scholar
Christian Gieger
View author publications
You can also search for this author in PubMed Google Scholar
Nathan Gillespie
View author publications
You can also search for this author in PubMed Google Scholar
David C. Glahn
View author publications
You can also search for this author in PubMed Google Scholar
Scott D. Gordon
View author publications
You can also search for this author in PubMed Google Scholar
Charles C. Gu
View author publications
You can also search for this author in PubMed Google Scholar
Dongfeng Gu
View author publications
You can also search for this author in PubMed Google Scholar
Daniel F. Gudbjartsson
View author publications
You can also search for this author in PubMed Google Scholar
Xiuqing Guo
View author publications
You can also search for this author in PubMed Google Scholar
Jeffrey Haessler
View author publications
You can also search for this author in PubMed Google Scholar
Michael E. Hall
View author publications
You can also search for this author in PubMed Google Scholar
Toomas Haller
View author publications
You can also search for this author in PubMed Google Scholar
Kathleen Mullan Harris
View author publications
You can also search for this author in PubMed Google Scholar
Jiang He
View author publications
You can also search for this author in PubMed Google Scholar
Pamela Herd
View author publications
You can also search for this author in PubMed Google Scholar
John K. Hewitt
View author publications
You can also search for this author in PubMed Google Scholar
Ian Hickie
View author publications
You can also search for this author in PubMed Google Scholar
Bertha Hidalgo
View author publications
You can also search for this author in PubMed Google Scholar
John E. Hokanson
View author publications
You can also search for this author in PubMed Google Scholar
Christian Hopfer
View author publications
You can also search for this author in PubMed Google Scholar
JoukeJan Hottenga
View author publications
You can also search for this author in PubMed Google Scholar
Lifang Hou
View author publications
You can also search for this author in PubMed Google Scholar
Hongyan Huang
View author publications
You can also search for this author in PubMed Google Scholar
Yi-Jen Hung
View author publications
You can also search for this author in PubMed Google Scholar
David J. Hunter
View author publications
You can also search for this author in PubMed Google Scholar
Kristian Hveem
View author publications
You can also search for this author in PubMed Google Scholar
Shih-Jen Hwang
View author publications
You can also search for this author in PubMed Google Scholar
Chii-Min Hwu
View author publications
You can also search for this author in PubMed Google Scholar
William Iacono
View author publications
You can also search for this author in PubMed Google Scholar
Marguerite R. Irvin
View author publications
You can also search for this author in PubMed Google Scholar
Yon Ho Jee
View author publications
You can also search for this author in PubMed Google Scholar
Eric O. Johnson
View author publications
You can also search for this author in PubMed Google Scholar
Yoonjung Y. Joo
View author publications
You can also search for this author in PubMed Google Scholar
Eric Jorgenson
View author publications
You can also search for this author in PubMed Google Scholar
Anne E. Justice
View author publications
You can also search for this author in PubMed Google Scholar
Yoichiro Kamatani
View author publications
You can also search for this author in PubMed Google Scholar
Robert C. Kaplan
View author publications
You can also search for this author in PubMed Google Scholar
Jaakko Kaprio
View author publications
You can also search for this author in PubMed Google Scholar
Sharon L. R. Kardia
View author publications
You can also search for this author in PubMed Google Scholar
Matthew C. Keller
View author publications
You can also search for this author in PubMed Google Scholar
Tanika N. Kelly
View author publications
You can also search for this author in PubMed Google Scholar
Charles Kooperberg
View author publications
You can also search for this author in PubMed Google Scholar
Tellervo Korhonen
View author publications
You can also search for this author in PubMed Google Scholar
Peter Kraft
View author publications
You can also search for this author in PubMed Google Scholar
Kenneth Krauter
View author publications
You can also search for this author in PubMed Google Scholar
Johanna Kuusisto
View author publications
You can also search for this author in PubMed Google Scholar
Markku Laakso
View author publications
You can also search for this author in PubMed Google Scholar
Jessica Lasky-Su
View author publications
You can also search for this author in PubMed Google Scholar
Wen-Jane Lee
View author publications
You can also search for this author in PubMed Google Scholar
James J. Lee
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Levy
View author publications
You can also search for this author in PubMed Google Scholar
Liming Li
View author publications
You can also search for this author in PubMed Google Scholar
Kevin Li
View author publications
You can also search for this author in PubMed Google Scholar
Yuqing Li
View author publications
You can also search for this author in PubMed Google Scholar
Kuang Lin
View author publications
You can also search for this author in PubMed Google Scholar
Penelope A. Lind
View author publications
You can also search for this author in PubMed Google Scholar
Chunyu Liu
View author publications
You can also search for this author in PubMed Google Scholar
Donald M. Lloyd-Jones
View author publications
You can also search for this author in PubMed Google Scholar
Sharon M. Lutz
View author publications
You can also search for this author in PubMed Google Scholar
Jiantao Ma
View author publications
You can also search for this author in PubMed Google Scholar
Reedik Mägi
View author publications
You can also search for this author in PubMed Google Scholar
Ani Manichaikul
View author publications
You can also search for this author in PubMed Google Scholar
Nicholas G. Martin
View author publications
You can also search for this author in PubMed Google Scholar
Ravi Mathur
View author publications
You can also search for this author in PubMed Google Scholar
Nana Matoba
View author publications
You can also search for this author in PubMed Google Scholar
Patrick F. McArdle
View author publications
You can also search for this author in PubMed Google Scholar
Matt McGue
View author publications
You can also search for this author in PubMed Google Scholar
Matthew B. McQueen
View author publications
You can also search for this author in PubMed Google Scholar
Sarah E. Medland
View author publications
You can also search for this author in PubMed Google Scholar
Andres Metspalu
View author publications
You can also search for this author in PubMed Google Scholar
Deborah A. Meyers
View author publications
You can also search for this author in PubMed Google Scholar
Iona Y. Millwood
View author publications
You can also search for this author in PubMed Google Scholar
Braxton D. Mitchell
View author publications
You can also search for this author in PubMed Google Scholar
Karen L. Mohlke
View author publications
You can also search for this author in PubMed Google Scholar
Matthew Moll
View author publications
You can also search for this author in PubMed Google Scholar
May E. Montasser
View author publications
You can also search for this author in PubMed Google Scholar
Alanna C. Morrison
View author publications
You can also search for this author in PubMed Google Scholar
Antonella Mulas
View author publications
You can also search for this author in PubMed Google Scholar
Jonas B. Nielsen
View author publications
You can also search for this author in PubMed Google Scholar
Kari E. North
View author publications
You can also search for this author in PubMed Google Scholar
Elizabeth C. Oelsner
View author publications
You can also search for this author in PubMed Google Scholar
Yukinori Okada
View author publications
You can also search for this author in PubMed Google Scholar
Valeria Orrù
View author publications
You can also search for this author in PubMed Google Scholar
Nicholette D. Palmer
View author publications
You can also search for this author in PubMed Google Scholar
Teemu Palviainen
View author publications
You can also search for this author in PubMed Google Scholar
Anita Pandit
View author publications
You can also search for this author in PubMed Google Scholar
S. Lani Park
View author publications
You can also search for this author in PubMed Google Scholar
Ulrike Peters
View author publications
You can also search for this author in PubMed Google Scholar
Annette Peters
View author publications
You can also search for this author in PubMed Google Scholar
Patricia A. Peyser
View author publications
You can also search for this author in PubMed Google Scholar
Tinca J. C. Polderman
View author publications
You can also search for this author in PubMed Google Scholar
Nicholas Rafaels
View author publications
You can also search for this author in PubMed Google Scholar
Susan Redline
View author publications
You can also search for this author in PubMed Google Scholar
Robert M. Reed
View author publications
You can also search for this author in PubMed Google Scholar
Alex P. Reiner
View author publications
You can also search for this author in PubMed Google Scholar
John P. Rice
View author publications
You can also search for this author in PubMed Google Scholar
Stephen S. Rich
View author publications
You can also search for this author in PubMed Google Scholar
Nicole E. Richmond
View author publications
You can also search for this author in PubMed Google Scholar
Carol Roan
View author publications
You can also search for this author in PubMed Google Scholar
Jerome I. Rotter
View author publications
You can also search for this author in PubMed Google Scholar
Michael N. Rueschman
View author publications
You can also search for this author in PubMed Google Scholar
Valgerdur Runarsdottir
View author publications
You can also search for this author in PubMed Google Scholar
Nancy L. Saccone
View author publications
You can also search for this author in PubMed Google Scholar
David A. Schwartz
View author publications
You can also search for this author in PubMed Google Scholar
Aladdin H. Shadyab
View author publications
You can also search for this author in PubMed Google Scholar
Jingchunzi Shi
View author publications
You can also search for this author in PubMed Google Scholar
Suyash S. Shringarpure
View author publications
You can also search for this author in PubMed Google Scholar
Kamil Sicinski
View author publications
You can also search for this author in PubMed Google Scholar
Anne Heidi Skogholt
View author publications
You can also search for this author in PubMed Google Scholar
Jennifer A. Smith
View author publications
You can also search for this author in PubMed Google Scholar
Nicholas L. Smith
View author publications
You can also search for this author in PubMed Google Scholar
Nona Sotoodehnia
View author publications
You can also search for this author in PubMed Google Scholar
Michael C. Stallings
View author publications
You can also search for this author in PubMed Google Scholar
Hreinn Stefansson
View author publications
You can also search for this author in PubMed Google Scholar
Kari Stefansson
View author publications
You can also search for this author in PubMed Google Scholar
Jerry A. Stitzel
View author publications
You can also search for this author in PubMed Google Scholar
Xiao Sun
View author publications
You can also search for this author in PubMed Google Scholar
Moin Syed
View author publications
You can also search for this author in PubMed Google Scholar
Ruth Tal-Singer
View author publications
You can also search for this author in PubMed Google Scholar
Amy E. Taylor
View author publications
You can also search for this author in PubMed Google Scholar
Kent D. Taylor
View author publications
You can also search for this author in PubMed Google Scholar
Marilyn J. Telen
View author publications
You can also search for this author in PubMed Google Scholar
Khanh K. Thai
View author publications
You can also search for this author in PubMed Google Scholar
Hemant Tiwari
View author publications
You can also search for this author in PubMed Google Scholar
Constance Turman
View author publications
You can also search for this author in PubMed Google Scholar
Thorarinn Tyrfingsson
View author publications
You can also search for this author in PubMed Google Scholar
Tamara L. Wall
View author publications
You can also search for this author in PubMed Google Scholar
Robin G. Walters
View author publications
You can also search for this author in PubMed Google Scholar
David R. Weir
View author publications
You can also search for this author in PubMed Google Scholar
Scott T. Weiss
View author publications
You can also search for this author in PubMed Google Scholar
Wendy B. White
View author publications
You can also search for this author in PubMed Google Scholar
John B. Whitfield
View author publications
You can also search for this author in PubMed Google Scholar
Kerri L. Wiggins
View author publications
You can also search for this author in PubMed Google Scholar
Gonneke Willemsen
View author publications
You can also search for this author in PubMed Google Scholar
Cristen J. Willer
View author publications
You can also search for this author in PubMed Google Scholar
Bendik S. Winsvold
View author publications
You can also search for this author in PubMed Google Scholar
Huichun Xu
View author publications
You can also search for this author in PubMed Google Scholar
Lisa R. Yanek
View author publications
You can also search for this author in PubMed Google Scholar
Jie Yin
View author publications
You can also search for this author in PubMed Google Scholar
Kristin L. Young
View author publications
You can also search for this author in PubMed Google Scholar
Kendra A. Young
View author publications
You can also search for this author in PubMed Google Scholar
Bing Yu
View author publications
You can also search for this author in PubMed Google Scholar
Wei Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Wei Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Sebastian Zöllner
View author publications
You can also search for this author in PubMed Google Scholar
Luisa Zuccolo
View author publications
You can also search for this author in PubMed Google Scholar
Chiara Batini
View author publications
You can also search for this author in PubMed Google Scholar
Andrew W. Bergen
View author publications
You can also search for this author in PubMed Google Scholar
Laura J. Bierut
View author publications
You can also search for this author in PubMed Google Scholar
Sean P. David
View author publications
You can also search for this author in PubMed Google Scholar
Sarah A. Gagliano Taliun
View author publications
You can also search for this author in PubMed Google Scholar
Dana B. Hancock
View author publications
You can also search for this author in PubMed Google Scholar
Bibo Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Marcus R. Munafò
View author publications
You can also search for this author in PubMed Google Scholar
Thorgeir E. Thorgeirsson
View author publications
You can also search for this author in PubMed Google Scholar
Dajiang J. Liu
View author publications
You can also search for this author in PubMed Google Scholar
Scott Vrieze
View author publications
You can also search for this author in PubMed Google Scholar

Consortia

23andMe Research Team

Jingchunzi Shi
& Suyash S. Shringarpure

The Biobank Japan Project

Masato Akiyama
, Yoichiro Kamatani
, Nana Matoba
& Yukinori Okada

Contributions

D.J.L. and S.V. designed, led and oversaw the study. G.R.B.S and X.W. were the lead analysts for the study, and they were assisted by D.J.L., S.V., F. Chen, S.-K.J., M. Liu and C.W. Phenotype definitions were developed by L.J.B., M.C.C., J. Kaprio., E.J., D.J.L., M. McGue, M.R.M., S.V. and L.Z. Software development was carried out by X.W., D.J.L., F. Chen and C.W. Multi-ancestry meta-analyses were performed by X.W. Ancestry-stratified meta-analyses were performed by G.R.B.S. and M. Liu. Conditional analyses were performed by X.W. and G.R.B.S. Fine-mapping and allelic heterogeneity were performed by X.W. and G.R.B.S. Replicability analyses were performed by C.W., S.-K.J. and G.R.B.S. Multi-ancestry TWAS were performed by F. Chen. Heritability and genetic correlation analyses were performed by S.-K.J. Polygenic scoring analyses was performed by G.R.B.S. Bioinformatics analyses were performed and interpreted by F. Chen, S.-K.J., G.R.B.S., S.V. and J.A. Stitzel. Figures were created by M. Liu, G.R.B.S., S.-K.J. and S.V. M. Liu and S.V. coordinated among participating cohorts. M.A.E. and M.C.K. helped with data access. G.R.B.S. coordinated authorship and acknowledgement details. C.B., A.W.B., L.B., S.P.D., S.A.G.T., D.B.H., M.R.M. and T.E.T. provided helpful advice and feedback on study design and the manuscript. All authors contributed to and critically reviewed the manuscript. G.R.B.S., X.W., S.-K.J., F. Chen, C.W., D.J.L. and S.V. made major contributions to the writing and editing.

Corresponding authors

Correspondence to Dajiang J. Liu or Scott Vrieze.

Ethics declarations

Competing interests

The spouse of N.L. Saccone is listed as an inventor on issued U.S. patent 8080371 ‘Markers of addiction’, covering the use of certain single-nucleotide polymorphisms in determining the diagnosis, prognosis and treatment of addiction. M.H.C. has received grant funding from GSK and Bayer, and speaking or consulting fees from AstraZeneca, Illumina and Genentech. R.T.-S. is a former employee and current shareholder of GSK and is currently a non-executive member of the ENA Respiratory board of directors. She reports personal fees from Teva, Immunomet, Vocalis Health and ENA Respiratory (until January 2021). D.A.S. is the founder and chief scientific officer of Eleven P15, a company focused on the early diagnosis of treatment of pulmonary fibrosis. J.B.N. and E.J. are employed by Regeneron Pharmaceuticals, Inc. The spouse of C.J.W. is employed by Regeneron Pharmaceuticals, Inc. L.J.B. is listed as an inventor on Issued U.S. Patent 8080371 ‘Markers for addiction’, covering the use of certain single-nucleotide polymorphisms in determining the diagnosis, prognosis and treatment of addiction. The 23andMe Research Team, including J.S. and S.S.S., are employees of 23andMe, Inc., and hold stock and/or stock options in 23andMe. T.E.T., D.F.G., H.S., G.B. and K. Stefansson are employees of deCODE genetics/AMGEN. M. Moll received grant support from Bayer. A.W.B. is listed as a co-inventor on a U.S. patent application ‘Biosignature discovery for substance use disorder using statistical learning’ assigned to BioRealm, LLC, and serves as a scientific advisor and consultant to BioRealm, LLC. All other authors declare no competing interests.

Peer review

Peer review information

Nature thanks David Balding, Ditte Demontis, Eske Derks and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data figures and tables

Extended Data Fig. 1 Ancestry space of studies contributing to meta-analysis (panel a), versus individuals from TOPMed and 1000 Genomes (panel b).

The meta-regression within the MEMO model requires specification of ancestry clines. To ensure consistency in the meaning of ancestry clines across all five MEMO analyses (one for each phenotype) we created a single multidimensional scaling solution based on allele frequencies from all phenotypes in all participating cohorts. These solutions are plotted in panel a (circles correspond to TOPMed cohorts, squares are all other cohorts which used imputed microarray genotypes, and triangles are 1000 Genomes ancestry groups). Colors of points correspond to the primary assigned ancestry of each cohort (studies with < 90% of individuals coming from a single ancestry group are shown in grey). Panel b shows projection of principal components (after OADP transformation) of TOPMed individuals onto PCs of 1000 Genomes individuals, in colored triangles. Each 1000 Genomes individual is colored by their known ancestry. This PC information was used in assigning ancestry to TOPMed individuals for the purpose of reference panel creation (individuals of South Asian ancestry were not included in analyses). The PCs in panel b were reordered or reversed in some cases to align with panel a. These transformations are noted in the axis labels.

Extended Data Fig. 2 Multi-ancestry meta-analysis Manhattan plots.

Black horizontal line corresponds to P = 5 × 10⁻⁹, the GWAS significance threshold used for all analyses. Note that some y-axis scales are discontinuous to better illustrate variants with very small P-values (e.g., the Drinks per Week y-axis is cut at 30 with a maximum value of 307.7, denoting a P-value of 1.9 × 10⁻³⁰⁸). All P-values are from two-sided statistical tests.

Extended Data Fig. 3 Tissue expression and brain cell type enrichment in high priority genes.

Panel a shows tissue expression enrichment in ‘high priority’ genes. We define high priority genes here as those located nearest to the variants in fine-mapped credible intervals containing less than five variants. These genes were compared to ‘control’ genes identified in the same way, but from variants in credible intervals with PIP < 0.01 from the trans-ancestry fine-mapping. The x-axis denotes GTEx tissue types. The y-axis represents relative risk estimates comparing high priority to control genes. Panel b shows similar relative risk comparisons with 39 brain cell types. Data are presented as relative risk values with error bars denoting bootstrapped 95% confidence intervals. Further details on estimating relative risk are included in the Supplementary Note section ‘Functional enrichment’.

Supplementary information

Supplementary Information

This file contains Supplementary Notes, Supplementary Figs. 1–5, Supplementary References, and Acknowledgements – see Contents page for details.

Reporting Summary

Supplementary Tables

Supplementary Tables 1–12.

Peer Review File

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Saunders, G.R.B., Wang, X., Chen, F. et al. Genetic diversity fuels gene discovery for tobacco and alcohol use. Nature 612, 720–724 (2022). https://doi.org/10.1038/s41586-022-05477-4

Download citation

Received: 09 March 2022
Accepted: 25 October 2022
Published: 07 December 2022
Issue Date: 22 December 2022
DOI: https://doi.org/10.1038/s41586-022-05477-4

This article is cited by

Peripheral blood transcriptomic profiling of molecular mechanisms commonly regulated by binge drinking and placebo effects
- Amol Carl Shetty
- John Sivinski
- Chamindi Seneviratne
Scientific Reports (2024)
Gastroesophageal reflux disease increases the risk of rheumatoid arthritis: a bidirectional two-sample Mendelian randomization study
- Quan Yuan
- Zixiong Shen
- Yang Li
Scientific Reports (2024)
An approach to identify gene-environment interactions and reveal new biological insight in complex traits
- Xiaofeng Zhu
- Yihe Yang
- Hugues Aschard
Nature Communications (2024)
Cracking the chicken and egg problem of schizophrenia and substance use: Genetic interplay between schizophrenia, cannabis use disorder, and tobacco smoking
- Meghan J. Chenoweth
Neuropsychopharmacology (2024)
Cross-ancestry genetic investigation of schizophrenia, cannabis use disorder, and tobacco smoking
- Emma C. Johnson
- Isabelle Austin-Zimmerman
- Arpana Agrawal
Neuropsychopharmacology (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.