Association study in African-admixed populations across the Americas recapitulates asthma risk loci in non-African populations

Daya, Michelle; Rafaels, Nicholas; Brunetti, Tonya M.; Chavan, Sameer; Levin, Albert M.; Shetty, Aniket; Gignoux, Christopher R.; Boorgula, Meher Preethi; Wojcik, Genevieve; Campbell, Monica; Vergara, Candelaria; Torgerson, Dara G.; Ortega, Victor E.; Doumatey, Ayo; Johnston, Henry Richard; Acevedo, Nathalie; Araujo, Maria Ilma; Avila, Pedro C.; Belbin, Gillian; Bleecker, Eugene; Bustamante, Carlos; Caraballo, Luis; Cruz, Alvaro; Dunston, Georgia M.; Eng, Celeste; Faruque, Mezbah U.; Ferguson, Trevor S.; Figueiredo, Camila; Ford, Jean G.; Gan, Weiniu; Gourraud, Pierre-Antoine; Hansel, Nadia N.; Hernandez, Ryan D.; Herrera-Paz, Edwin Francisco; Jiménez, Silvia; Kenny, Eimear E.; Knight-Madden, Jennifer; Kumar, Rajesh; Lange, Leslie A.; Lange, Ethan M.; Lizee, Antoine; Maul, Pissamai; Maul, Trevor; Mayorga, Alvaro; Meyers, Deborah; Nicolae, Dan L.; O’Connor, Timothy D.; Oliveira, Ricardo Riccio; Olopade, Christopher O.; Olopade, Olufunmilayo; Qin, Zhaohui S.; Rotimi, Charles; Vince, Nicolas; Watson, Harold; Wilks, Rainford J.; Wilson, James G.; Salzberg, Steven; Ober, Carole; Burchard, Esteban G.; Williams, L. Keoki; Beaty, Terri H.; Taub, Margaret A.; Ruczinski, Ingo; Mathias, Rasika A.; Barnes, Kathleen C.

doi:10.1038/s41467-019-08469-7

Download PDF

Article
Open access
Published: 20 February 2019

Association study in African-admixed populations across the Americas recapitulates asthma risk loci in non-African populations

Michelle Daya¹,
Nicholas Rafaels¹,
Tonya M. Brunetti¹,
Sameer Chavan¹,
Albert M. Levin²,
Aniket Shetty¹,
Christopher R. Gignoux¹,
Meher Preethi Boorgula¹,
Genevieve Wojcik ORCID: orcid.org/0000-0001-7206-8088³,
Monica Campbell¹,
Candelaria Vergara ORCID: orcid.org/0000-0003-3297-8057⁴,
Dara G. Torgerson⁵,
Victor E. Ortega⁶,
Ayo Doumatey⁷,
Henry Richard Johnston⁸,
Nathalie Acevedo⁹,
Maria Ilma Araujo¹⁰,
Pedro C. Avila ORCID: orcid.org/0000-0002-1358-7256¹¹,
Gillian Belbin¹²,
Eugene Bleecker¹³,
Carlos Bustamante³,
Luis Caraballo⁹,
Alvaro Cruz¹⁴,
Georgia M. Dunston¹⁵,
Celeste Eng⁵,
Mezbah U. Faruque¹⁶,
Trevor S. Ferguson ORCID: orcid.org/0000-0002-2393-1452¹⁷,
Camila Figueiredo¹⁸,
Jean G. Ford¹⁹,
Weiniu Gan²⁰,
Pierre-Antoine Gourraud²¹,
Nadia N. Hansel⁴,
Ryan D. Hernandez²²,
Edwin Francisco Herrera-Paz ORCID: orcid.org/0000-0001-9437-6620^23,24,
Silvia Jiménez⁹,
Eimear E. Kenny¹²,
Jennifer Knight-Madden¹⁷,
Rajesh Kumar²⁵,
Leslie A. Lange¹,
Ethan M. Lange¹,
Antoine Lizee²¹,
Pissamai Maul²⁶,
Trevor Maul²⁶,
Alvaro Mayorga²⁷,
Deborah Meyers¹³,
Dan L. Nicolae²⁸,
Timothy D. O’Connor²⁹,
Ricardo Riccio Oliveira³⁰,
Christopher O. Olopade³¹,
Olufunmilayo Olopade²⁸,
Zhaohui S. Qin ORCID: orcid.org/0000-0002-1583-146X³²,
Charles Rotimi ORCID: orcid.org/0000-0001-5759-053X⁷,
Nicolas Vince ORCID: orcid.org/0000-0002-3767-6210²¹,
Harold Watson³³,
Rainford J. Wilks¹⁷,
James G. Wilson³⁴,
Steven Salzberg ORCID: orcid.org/0000-0002-8859-7432³⁵,
Carole Ober³⁶,
Esteban G. Burchard²²,
L. Keoki Williams³⁷,
Terri H. Beaty ORCID: orcid.org/0000-0002-7644-1705³⁸,
Margaret A. Taub³⁹,
Ingo Ruczinski³⁹,
Rasika A. Mathias⁴^na1,
Kathleen C. Barnes¹^na1 &
CAAPA

Nature Communications volume 10, Article number: 880 (2019) Cite this article

10k Accesses
61 Citations
87 Altmetric
Metrics details

Subjects

An Author Correction to this article was published on 04 September 2019

This article has been updated

Abstract

Asthma is a complex disease with striking disparities across racial and ethnic groups. Despite its relatively high burden, representation of individuals of African ancestry in asthma genome-wide association studies (GWAS) has been inadequate, and true associations in these underrepresented minority groups have been inconclusive. We report the results of a genome-wide meta-analysis from the Consortium on Asthma among African Ancestry Populations (CAAPA; 7009 asthma cases, 7645 controls). We find strong evidence for association at four previously reported asthma loci whose discovery was driven largely by non-African populations, including the chromosome 17q12–q21 locus and the chr12q13 region, a novel (and not previously replicated) asthma locus recently identified by the Trans-National Asthma Genetic Consortium (TAGC). An additional seven loci reported by TAGC show marginal evidence for association in CAAPA. We also identify two novel loci (8p23 and 8q24) that may be specific to asthma risk in African ancestry populations.

Targeted analysis of genomic regions enriched in African ancestry reveals novel classical HLA alleles associated with asthma in Southwestern Europeans

Article Open access 08 December 2021

A whole genome sequencing study of moderate to severe asthma identifies a lung function locus associated with asthma risk

Article Open access 02 April 2022

Mixed-model admixture mapping identifies smoking-dependent loci of lung function in African Americans

Article 13 December 2019

Introduction

Asthma is a complex disease where the interplay between genetic factors and environmental exposures controls susceptibility and disease progression. In the U.S., ethnic minorities are disproportionally affected by asthma. For example, African Americans and Puerto Ricans have higher asthma-related morbidity and mortality rates compared to European Americans^1,2,3. In addition to environmental, cultural, and socio-economic risk factors, genetic factors, possibly from a common background ancestry, likely underlie some of these disparities in the health burden of asthma in the U.S. Despite the relatively high burden of disease, representation of African ancestry populations in asthma genome-wide association studies (GWAS) has been limited, and in GWAS’s performed to date including individuals of African ancestry, the samples have been modest and underpowered to detect true associations. For example, the largest asthma GWAS focused solely on African ancestry populations included only 819 asthma cases⁴. Recently, the Trans-National Asthma Genetic Consortium (TAGC) reported 18 asthma-associated loci based on a meta-analysis of 142,486 subjects, but only 2149 cases and 6055 controls included in this study were of African ancestry. Only two genome-wide significant associations^5,6 have been reported for asthma from GWAS conducted in African ancestry populations to date^4,5,6,7,8. The discovery of genetic risk factors for asthma in African ancestry populations has been further hampered by lack of representation of African ancestry in imputation reference panels, and legacy commercial genotyping arrays that have until very recently not provided adequate coverage of linkage disequilibrium (LD) patterns in African ancestry populations.

To address these disparities in asthma genetics research and the paucity of information on African genetic diversity, we previously established the Consortium on Asthma among African ancestry Populations in the Americas (CAAPA)⁹. Because of the lack of adequate representation of African ancestry in imputation reference panels, we first performed whole-genome sequencing (WGS) of samples collected from 880 individuals who self-reported African ancestry from 19 North, Central and South American and Caribbean populations (446 individuals from nine African American populations, 43 individuals from Central America, 121 individuals from three South American populations, and 197 individuals from four Caribbean populations), as well as individuals from continental West Africa (45 Yoruba-speaking individuals from Ibadan, Nigeria and 28 individuals from Gabon). These whole-genome sequences were made publicly available through dbGAP (accession code phs001123.v1.p1) and were incorporated into the reference panel on the Michigan imputation server (a free genotype imputation service, https://imputationserver.sph.umich.edu).

Previously we performed coverage analysis of the novel variation identified in the CAAPA sequence data, and found only 69% of common SNP variants and 41% of low-frequency SNP variants identified by CAAPA can be tagged by traditional GWAS arrays at r² ≥ 0.8¹⁰. In addition, lack of coverage of low frequency variants (minor allele frequency [MAF] between 0.01–0.05) in GWAS arrays negatively impacts the imputation of low frequency variants. To address these issues, we used the CAAPA sequence data to develop the African Diaspora Power Chip (ADPC) in partnership with Illumina, Inc., a gene-centric SNP genotyping array designed to complement commercially available genome-wide chips, thereby improving tagging and coverage of African ancestry genetic variation¹⁰. The array included ~495,000 SNPs, with a MAF enriched for low frequency variants. Subsequently, the content of the ADPC was incorporated into Illumina’s Multi-Ethnic Global Array (MEGA)¹⁰.

Using the ADPC, we genotyped CAAPA participants from nine studies (seven African American studies, one African Caribbean [Barbados], and one South American [Puerto Rico]). We combined the ADPC data with existing commercial genome-wide genotype data, and imputed additional genotypes using the CAAPA reference panel. Subsequently we used the MEGA to genotype additional African ancestry asthma studies with no previously existing genome-wide genotype data (one African American, three South American, and two Caribbean studies), and similarly imputed genotypes on these subjects using the CAAPA reference panel. Sample populations, their ascertainment, and clinical characteristics are described in detail in the Supplementary Note 1, Table 1 and Supplementary Table 1. We then used these data to perform a GWAS of asthma in individuals of African ancestry (7009 asthmatic cases and 7645 controls). We also performed admixture mapping, a technique that leverages local ancestry to identify regions of the genome where ancestry from a particular ancestral population is inherited more frequently among affected compared to unaffected individuals.

Table 1 Studies included in the CAAPA association analysis

Full size table

Our GWAS results recapitulate 11 of the 18 loci recently reported by TAGC, including the chromosome 9p24 (IL33), 15q22 (RORA), and 17q12–q21 loci, as well as a locus on chromosome 12q13 (STAT6) reported as novel but not replicated by TAGC. We identify two loci on chromosome 8 not previously reported as being associated with asthma, and one local ancestry peak on chromosome 6q22 is genome-wide significant in our admixture mapping study. We speculate that the common asthma-associated variants on chromosome 8p23 implicate the ARHGEF10 or MYOM2 genes, while a low frequency protective variant on chromosome 8q24 is intronic to TATDN1, a gene with increased expression in human airway smooth muscle cells stimulated with interleukin. The TCF21 gene, which has previously been shown to be differentially expressed in bronchial biopsies of asthmatics compared to controls, is a likely candidate in the admixture mapping peak. However, further replication efforts are necessary to provide robust evidence of replication for these chromosome 6 and 8 loci.

Results

Association analysis

Studies included in the asthma association analysis represent a diverse spectrum of African ancestry (Fig. 1a, Supplementary Fig. 1, Supplementary Table 2) with median African ancestry proportions in non-asthmatics as low as 0.17 in subjects from Puerto Rico (GALA II) and as high as 0.90 in subjects from Jamaica and Barbados (JAAS and BAGS). In addition, the studies had different objectives and differed in potentially relevant factors such as age of onset and diagnostic criteria (see Supplementary Note 1 and Supplementary Table 1). For this reason, we performed tests for association separately for each dataset, and then combined the results using MR-MEGA, a novel meta-analysis approach that models allelic effects as a function of axes of genetic variation¹¹. In this way, heterogeneous associations across genetically distinct populations are not penalized, and the degree of heterogeneity due to ancestry, as well as residual effects due to differences in study design, can be assessed. The results of the genome-wide meta-analysis are summarized in Fig. 1b and c, and associations with MR-MEGA association p < 10⁻⁶ (the same cut-off employed by the EVE GWAS⁶) are summarized in Table 2 and Supplementary Table 9. There were seven loci with associations smaller than this threshold. Associations at two loci were genome-wide significant (p < 5 × 10⁻⁸), including a locus at chromosome 8p23 not previously reported by any asthma GWAS^7,12, and the chromosome 17q12–q21 locus, which is regarded as one of the most consistent asthma association findings to date¹³. Two of the loci with p < 10⁻⁶ (but not attaining conventional genome-wide significance) were reported recently by TAGC in the multi-ethnic meta-analysis of 23,948 asthmatics and 118,538 controls⁶. The remaining three loci involved low-frequency SNPs (MAF between 0.01–0.05), and the accuracy of SNP imputation could only be verified for one locus on chromosome 8q24 (Supplementary Table 10).

Table 2 Lead SNP summary

Full size table

Associations novel to CAAPA

Several SNPs intronic to a gene encoding a long non-coding RNA on chromosome 8p23 have MR-MEGA association p < 10⁻⁶, and two of these associations were genome-wide significant. While this association was observed in multiple African American samples, the strongest effects were observed in three non-U.S. studies from Barbados (BAGS), Cartagena, Colombia (PGCA), and Puerto Rico (GALA II; Supplementary Fig. 15). While the associated SNPs in this region do not overlap with any expression quantitative trait locis (eQTLs) in the publicly available databases we mined, long-range chromatin interaction and expression data in relevant tissues (lymphoblastoid cells, fetal lung fibroblast cells, and lung) implicate two different genes, ARHGEF10 and MYOM2, ~600–800 KB downstream from the most significant SNP rs13277810 that potentially explain these observed associations (Supplementary Note 13, Supplementary Fig. 19)^14,15,16. Expression of ARHGEF10 has been associated with exacerbations in chronic obstructive pulmonary disease¹⁷, and this gene may also be a target of the well-known type-2 inflammation cytokine interleukin 33¹⁸, while genetic variants in MYOM2 are predictive of lung function in an isolated European ancestry (Hutterite) population¹⁹. We tested whether SNP rs13277810 is associated with asthma using African American genetic data from three cardiovascular studies as well as African American and Hispanic genetic data from the BioMe biobank. Although there was strong evidence for replication of this SNP association in BioMe Hispanics (SNP association Z-score p = 7.15 × 10⁻⁴, with consistent effect direction), the SNP was not associated with asthma in any of the other studies, and the TAGC asthma GWAS summary statistics also did not support this association in Europeans (Table 3). Most CAAPA asthmatics had childhood onset asthma (Supplementary Table 1), which is likely not the case for the cardiovascular disease studies. Roughly a third of BioMe Hispanics are Puerto Rican (Supplementary Note 2), and Puerto Rican asthmatics are likely to have childhood onset asthma²⁰. We therefore re-tested this association in CAAPA, and compared results when including versus excluding adult onset asthmatics. Because the effect size of the association decreased when adult onset asthmatics were excluded (Supplementary Table 17), we could not confirm that age of asthma onset accounts for the lack of replication. Despite the potential biological plausibility of this chromosome 8p23 association, it may be a false positive. Alternatively, lack of replication could be due to differences in study design (ascertainment of subjects and asthma case-control classification), stronger effects in non-U.S. African ancestry populations, and the small number of asthmatics available in the replication data.

Table 3 Summary of tests for replication for two novel loci on chromosome 8 with p-values < 10⁻⁶ in the CAAPA meta-analysis

Full size table

The remaining three loci yielding MR-MEGA association p < 10⁻⁶ have not previously been reported by any asthma GWAS, and the identified SNPs were all low frequency variants (0.01 < MAF < 0.05). According to allele frequencies from the 1000 Genomes Project (TGP), these SNPs are present in African but not European populations. The imputation quality of the most significant SNPs at these regions is only moderate (median RSQ statistics of 0.79, 0.61, and 0.77 for rs114647118 [8q24], rs73952947 [18q12], and rs73595000 [19q13], respectively, see Supplementary Data 1); therefore we performed TaqMan genotyping to confirm results in the JAAS (Jamaica) and BAGS (Barbados) samples (Supplementary Table 10). The veracity of imputation and asthma association could only be verified for rs114647118 on chromosome 8q24 (Supplementary Tables 10 and 11). Carriers of the minor allele at SNP rs114647118 were less likely to have asthma, and this pattern was observed in all CAAPA studies (Supplementary Fig. 15). SNP rs114647118 is intronic to TATDN1, a gene with increased expression in human airway smooth muscle cells stimulated with interleukin 17A²¹. However, this association did not replicate in the three African American cardiovascular studies (Table 3). Unfortunately, data were not available in TAGC or BioMe to test for replication.

Comparison to previous asthma GWAS

We compared results from the recent TAGC multi-ethnic meta-analysis, the largest and most definitive GWAS performed in observed asthma cases and controls to date, with the CAAPA meta-analysis⁷. TAGC reported 18 loci associated with asthma, categorizing nine loci as known asthma susceptibility genes (known), five as new asthma loci (new), two as new signals at loci previously associated with asthma in ancestry-specific populations (ancestry-specific), plus two as asthma signals previously reported for asthma with hay fever (asthma + hay fever) (Table 2 and Supplementary Table 12)⁷. Because some of the CAAPA studies were included in the TAGC asthma meta-analysis (of the 2149 asthmatics and 6055 non-asthmatics in TAGC, 1601 asthmatics and 2375 non-asthmatics were from studies included in the CAAPA discovery, and 548 asthmatics and 3680 non-asthmatics were from studies included in the CAAPA replication), and to contrast African and European asthma susceptibility loci, we compared CAAPA associations with those observed in TAGC Europeans. Only two of the TAGC loci, on chromosome 12q13 and 5q31.1, were not genome-wide significant in TAGC Europeans as judged by their random effect p-values (p = 1.6 × 10⁻⁷ and p = 1.6 × 10⁻⁶, respectively), although the fixed-effect p-values for these associations were genome-wide significant (p = 5.5 × 10⁻⁹ and p = 8.5 × 10⁻¹⁰, respectively). Both these loci were not reported by any prior GWAS of asthma, and their p-values in the other ethnic groups represented in TAGC were marginal (p = 0.05, 0.22, and 0.60 for the chromosome 12q13 TAGC lead SNP and p = 0.04, 5.5 × 10⁻³, and 0.27 for the chromosome 5q31.1 TAGC lead SNP in Africans, Japanese, and Latinos, respectively). We therefore conclude the genome-wide significance of these loci in the TAGC multi-ethnic meta-analysis is largely due to their strong associations observed in subjects of European ancestry.

TAGC summary statistics were merged with the CAAPA meta-analysis results, and associations of 810 SNPs with significant fixed-effect p-values in TAGC Europeans, Bonferroni corrected for the number of overlapping SNPs in the merged dataset, were assessed for replication in the CAAPA meta-analysis. Three known asthma loci replicated in CAAPA, after a Bonferroni correction for 810 association tests: chromosomes 9p24 (RANBP6, IL33), 15q22 (RORA,NARG2,VPS13C), and 17q12–q21 (Supplementary Table 13).

The TAGC lead SNP rs167769 on chromosome 12q13 is intronic to STAT6, a transcription factor that affects Th2 lymphocyte responses mediated by IL-4 and IL-13^7,22. This was a new association reported by TAGC, not previously implicated in any asthma GWAS, although we note this SNP has been reported as a putatively causal SNP discovered by GWAS of lung function^23,24, and a number of linkage studies have pinpointed this chromosomal region in the early days of genome-wide investigations of asthma and atopy^25,26,27,28. In addition, markers in STAT6 have been identified by a number of candidate gene association studies^29,30,31,32. However, prior to the CAAPA analyses, this locus had not been replicated in independent asthma GWAS datasets. The lead SNP rs3122929 in CAAPA at this locus is in strong LD with the TAGC lead SNP rs167769 in Europeans from the TGP (r² = 0.93) and nearly achieves genome-wide significance in CAAPA (MR-MEGA association p = 9.1 × 10⁻⁷). In addition, the association observed in TAGC subjects of African ancestry was marginal (fixed effect p = 0.05). With the increased sample size available through CAAPA, our meta-analysis provides further evidence of the association between the 12q13 region and asthma, confirming its contribution to asthma risk in African ancestry populations.

We also assessed windows ±10 KB from each of the 18 TAGC lead SNPs for replication in CAAPA (±200 KB for the chr17q12–q21 locus, due to the extended LD in this region), as well as 2 SNPs previously reported as achieving genome-wide significance in African ancestry populations^5,6 (Fig. 1c). Considering a Bonferroni-corrected significance threshold for 20 tests (one for each of these prior loci), an additional seven TAGC loci showed evidence of association in the CAAPA meta-analysis (Table 2, loci not in bold font). This includes a novel TAGC locus on chromosome 6p21 implicating human leukocyte antigen genes. In addition, for the known chromosome 5q31 TAGC locus, one of the CAAPA associations passing this significance threshold involved a SNP in strong LD with the TAGC lead SNP in TGP Europeans (rs1295686, r² with TAGC lead SNP rs20541 = 0.96, Supplementary Table 18).

The genome-wide meta-analysis of asthma in multi-ethnic populations previously performed by the EVE consortium reported a genome-wide significant African ancestry-specific association for SNP rs1101999, which is intronic to the PYHIN1 gene⁶. This was the first African ancestry asthma association reported by a GWAS. This SNP has a high minor allele frequency in African populations (0.31 in the TGP phase 3 AFR population, n = 1322), but a very rare or low frequency in other populations (MAF < 0.005 in TGP phase 3 EAS, EUR, and SAS, n = 1008, 1006, and 978, respectively, MAF = 0.04 in AMR, n = 694), and the role this gene may play in asthma remains unclear. SNP rs1101999 was marginally associated with asthma in the CAAPA meta-analysis (MR-MEGA association p = 6.4 × 10⁻³), but the strength of association was much reduced from its original report (Supplementary Fig. 13, Supplementary Tables 12 and 14). Recently, the eMERGE (electronic medical records and genomics) network conducted an asthma GWAS in biobank subjects, and reported one genome-wide significant association in African Americans, SNP rs11788591, intronic to the PTGES gene. This association did not replicate in the CAAPA meta-analysis (MR-MEGA association p = 0.23, Supplementary Table 12, Fig. 1c, Supplementary Fig. 13).

Genome-wide significant associations on chr17q12–q21

Associations between SNPs on chr17q12–21 and risk of asthma showed evidence of ancestry heterogeneity (46 of the 54 SNPs with association p < 10⁻⁶ had significant heterogeneity, p < 0.05). In general, the magnitude of the effect size increases as the average proportion of European ancestry in the study increased (Supplementary Table 9; most of the corresponding ß0 values were close to zero, whereas ß1, which captures the effect along the axis of genetic variation separating African and European ancestry, showed an increase in effect size magnitude as European ancestry increased). We also observe that the higher the average proportion of African ancestry, the smaller the magnitude of the effect size (Fig. 2b, forest plot of the lead association ordered by average African ancestry). An exception was observed among Honduran subjects who self-reported as Garifuna (HONDAS). HONDAS has a large average African (77%), a higher Native American (20%), and a very small European component (3%). Interestingly, the lead SNP in CAAPA was the same SNP reported by a meta-analysis of asthma in Puerto Rican children³³, distinct from the lead SNPs reported by the multi-ethnic EVE and TAGC GWAS^6,7. The most significant and largest effect size magnitudes were observed for the studies that had higher European and Native American components; however, we speculate that this may reflect risk for asthma inherited from a Native American genetic background given the minimal European component in the HONDAS population. We note this trend is not as strong in PGCA, the CAAPA study with the highest proportion of Native American ancestry (29%), which may be due in part to the heterogeneous patterns of LD in this chromosomal region.

In addition to the smaller effect size magnitude in studies with high African ancestry, we also observed much weaker associations (Supplementary Fig. 11 vs. Supplementary Fig. 12), and no chr17q12–q21 associations with p < 10⁻⁶ were observed in an inverse-variance meta-analysis of African American samples (Supplementary Table 4), despite a relatively large sample size of 3651 cases and 4222 controls. These observations are consistent with a recent report by Stein et al. summarizing the chromosome 17q12–q21 locus, which included an analysis and discussion of the relatively weak associations observed at this locus among African Americans¹³. They posited that the reduced strength of association may be due to an overall lower MAF spectrum in African Americans in this region (which reduces statistical power to detect association), breakdown of LD on African haplotypes, and different asthma endotypes (viral exposures) in children. Based on the reduced effect size magnitude observed in studies with higher average African ancestry, we additionally posit a smaller effect size magnitude is observed on African ancestry haplotypes (which could in turn be due to breakdown of LD and reduced correlation between tagging and causal variants). To investigate this, we extracted 17 putatively causal SNPs reported in the Stein et al. review, plus an additional five SNPs from the CAAPA meta-analysis with MR-MEGA association p < 10⁻⁶ and r² < 0.8 between all 17 SNPs in TGP European and African populations. We then stratified subjects for whom local ancestry estimates were available based on the number of copies of African ancestry at each of these SNPs, and tested for association between each SNP and asthma separately for each local ancestry group (0, 1, or 2 copies of African ancestry, Supplementary Data 2). Figure 2c shows a trend of decreasing effect size magnitude as the number of copies of African ancestry increased, for most of these 22 candidate SNPs, suggesting (but not conclusively proving) smaller effect size magnitudes on African ancestry haplotypes.

Consistent with Stein et al., the degree to which LD breaks down in CAAPA samples increased with average African ancestry (Fig. 2a and Supplementary Fig. 18). One example of how this breakdown could affect association is the rs12936231–rs4065275 haplotype and its association with ORMDL3 expression. Specifically, Stein et al. notes that the rs12936231-C and rs4065275-G alleles were associated with high expression of ORMDL3 in peripheral blood cells from non-African populations, but the rs4065275-G allele was not associated with expression of ORMDL3 in Yoruban (African) lymphoblastoid cell lines. This is possibly due to the rs12936231-C and rs4065275-G haplotype almost always being present in this combination on non-African haplotypes, while the rs12936231-G and rs4065275-G haplotype in non-Africans was rare (3%). In contrast, the rs12936231-G and rs4065275-G haplotype is common in Africans (19%)¹³. We examined the rs12936231–rs4065275 haplotype for association with asthma in the three local ancestry groups (0, 1, or 2 copies of African ancestry), but no significant associations were observed in any of the groups (Supplementary Table 16).

Finally, we note genome-wide significant SNPs at the chromosome 17q12–q21 region in CAAPA ranged between positions 37,908,867–38,089,717 (Fig. 2a), which included the ORMDL3/GSDMB haplotype block (hg19, see Supplementary Table 9), at least 32 kb from the most significant SNP (at position 37,876,835) in TAGC. However, in the TAGC pediatric sub-group analysis the strongest association was observed 3.6 kb proximal to GSDMB, so we speculate the relatively strong association in CAAPA at the ORMDL3/GSDMB region reflects the large proportion of childhood onset asthmatics included in CAAPA (Supplementary Table 1). The ORMDL3/GSDMB region is strongly associated with childhood onset asthma^13,34, and to rule out that the weaker associations in high African ancestry populations may be due to the inclusion of adult onset asthmatics in some of the CAAPA studies, we re-tested the association between the 22 candidate SNPs and asthma, using only those studies where age of onset was available, or that were pediatric studies. We compared the association results when including versus excluding adult onset asthmatics, and the strength of the associations remained marginal when adult onset asthmatics were excluded (Supplementary Table 15).

Associations with total serum IgE

We also examined whether genetic associations with asthma overlap with atopy by testing lead SNPs from Table 2 for association with total serum IgE (tIgE) using 4132 subjects for whom this phenotype was available (CAAPA lead SNPs and lead SNPs from TAGC that replicated in CAAPA; associations were tested separately in cases and controls and then combined using meta-analysis). Four of the SNPs correlated with levels of tIgE in asthmatics (with the asthma risk allele associated with increased levels of tIgE, Supplementary Table 19), suggesting that these asthma risk alleles may lead to an increased Th2 immune response. This includes SNP rs1420101 in the IL1R1 locus (a gene that is known to correlate with eosinophilia³⁵), SNP rs10519067 intronic to the RORA gene (encoding a transcription factor that regulates the growth of group 2 innate lymphoids, a key cell type in the memory Th2 cell response³⁶), and SNPs rs907092 and rs2952156 in the chr17q12–21 locus. These results are consistent with previous studies that have shown both shared and unique associations between these phenotypes^{37,38,39,40,41}.

Admixture mapping

In addition to performing an asthma GWAS, we also leveraged local ancestry to identify regions of the genome where ancestry from a particular ancestral population was inherited more frequently in affected versus unaffected individuals. This technique, called admixture mapping, can be used as a complementary approach to association mapping in admixed populations to uncover associations not detectable by SNP tests alone⁴². Because the genetic structure of the Barbados population is similar to that of African Americans (Fig. 1a and Supplementary Fig. 1), and because of overlap in genotype array coverage, we combined the African American studies genotyped on the ADPC and the Barbados study (BAGS) and used this combined dataset for admixture mapping discovery. CAAPA studies genotyped on the MEGA, additional BAGS subjects genotyped on Illumina’s Omni array, and African American subjects from BioMe were leveraged to replicate our admixture mapping discoveries. The distribution of p-values from the discovery admixture mapping tests for association is summarized in Supplementary Fig. 20 (local ancestry dose association p-values reported by EMMAX). The QQ plot shows little evidence of systematic test statistic inflation. The deflated inflation factor of 0.90 appears to be largely driven by 0.1 < p < 0.5, whereas the number of p < 0.02 is greater than expected, suggesting association results are enriched with local ancestry segments showing differences in ancestry between cases and controls. Only one segment of local ancestry, ranging from base pair positions 134,149,974–134,300,365 on chromosome 6, crossed the multiple testing threshold (see Methods). This segment falls within a local ancestry peak on chromosome 6q22.31–23.2, with increased African ancestry associated with increased risk of asthma (Supplementary Table 20). Genes falling in this segment include the Transcription Factor 21 (TCF21) and TATA-Box Binding Protein Like 1 (TBPL1) genes. The TCF21 gene has been shown to be differentially expressed in bronchial biopsies of asthmatics compared to controls⁴³. The most significant SNP showing association in this region in the admixture mapping discovery group (rs111966851, Supplementary Fig. 6) has a MAF of 0.308 and 0.006 in Africans and Europeans, respectively (n = 1322 AFR and n = 1006 EUR TGP phase 3), which corroborates the idea that African ancestry in this segment may increase risk for asthma. For replication, local ancestry segments overlapping with the peak segments were tested for association with asthma in CAAPA studies with similar genetic structure to the discovery studies (i.e., African Americans and Jamaicans, see population structure in Fig. 1a and Supplementary Fig. 1), additional BAGS (Barbados) subjects, and African American subjects from BioMe. None of these segments were associated with asthma (Supplementary Table 21), and SNP rs111966851 (the most significant SNP in this region in the admixture mapping discovery) was also not associated with asthma in the CAAPA studies excluded from the admixture mapping discovery (Supplementary Fig. 16).

Discussion

We report a large GWAS of asthma in African ancestry populations; prior studies included only 763–3037 asthmatic subjects^4,5,6,7. Eleven of the 18 loci recently identified in the TAGC meta-analysis show evidence of association in CAAPA, including strong evidence for four different regions. This includes the region around STAT6 on chromosome 12q13, a novel region identified by TAGC and not replicated to date, as well as the well-known chromosome 17q12–21 region, which reached genome-wide significance in our analysis. It has been posited that disparities in asthma susceptibility can partly be explained by genetic risk factors^4,44,45. In recapitulating associations mainly discovered in European ancestry populations (a result that has previously not been well-quantified in the literature), our results suggest that at the very least, common genetic variation may not strongly contribute to asthma disparities. However, our data show the chromosome 17q12–21 associations have smaller effect size magnitudes on African ancestry haplotypes. In addition, we built a genetic risk score for the 18 asthma loci reported by TAGC, and found that although asthmatics had a statistically significant higher risk score compared to controls (Supplementary Fig. 24), the effect was too small to build a predictive risk score for asthma in CAAPA.

In addition to recapitulating asthma genes discovered largely in non-African populations, we identified two loci on chromosome 8 not previously reported by asthma GWAS, and through admixture mapping identified a region on chromosome 6q22. In our inclusion of the largest sample of African ancestry individuals in our discovery GWAS, we were unfortunately limited in sample sizes of African ancestry individuals available for replication with consequent limitations in power. The most significant SNP on chromosome 8p23 reached genome-wide significance and was replicated in Hispanics from BioMe. However, our attempts to replicate this same locus in African Americans were unsuccessful; the number of cases compared to controls was considerably smaller (4–18 times smaller) as the replication studies were not primarily ascertained for asthma. Similarly, we also failed to replicate the low frequency variant on chromosome 8q24 (only 398 African American asthmatics with genetic data were available) and the admixture mapping signal (only 845 cases were available, of which 498 were African American).

The CAAPA meta-analysis includes data from 15 independent studies and is the largest asthma GWAS focused on African ancestry populations to date. Unfortunately, as is the case for other complex diseases for which morbidity disproportionately affects underrepresented and underserved populations^46,47,48, a legacy of underrepresentation or exclusion of minorities from federally-funded studies has rendered comparatively robust non-European datasets rare to nonexistent. A recent analysis of ancestry represented in the GWAS Catalog¹² concluded that non-European, non-Asian groups combined account for less than 4% of individuals included in the catalog⁴⁶. The analysis also found that African ancestry individuals contributed 7% of all catalog associations, despite only comprising 2.4% of the catalog, highlighting the value of GWAS conducted in African ancestry populations for enabling scientific discoveries. Furthermore, the authors stressed the importance of assessing the generalizability of genetic disease associations across populations, and the value that low-LD African ancestry individuals contribute to multi-ethnic fine-mapping of genetic associations. Despite the considerable federal support for the CAAPA initiative, we do recognize that the CAAPA sample size falls considerably short of the recent mega studies comprised of asthma datasets in the hundreds of thousands^7,49. Furthermore, replication had to be sought in studies not primarily ascertained for asthma and with limited sample sizes. Important insights drawn from this study include the demonstration that many of the genetic loci associated with asthma in European ancestry populations may also be at play in African ancestry populations, and a clearer understanding of the LD patterns among African ancestry populations in 17q21. Potentially novel loci discovered by this meta-analysis are as yet not replicated within this study, but warrant follow-up. Importantly, the advent of institutional biobanks with access to multi-ethnic patient populations⁵⁰, as well as efforts by institutions such as the National Institute of Health to reduce health and research disparities⁵¹ promise to greatly expand representation of well-characterized patients of African ancestry in the near future, allowing for robust follow-up of these CAAPA findings. The improved availability of African ancestry whole-genome sequence imputation reference panels available through initiatives such as the NHLBI-supported TOPMed program⁵² should also provide high quality imputation of low and rare frequency variation in African ancestry populations, which will empower future studies. Lastly, we note better availability of other -omics datasets representing diverse ethnicities, such as transcriptomic data in tissue types relevant to asthma, will be needed to enable discoveries by utilizing the next generation of analysis tools^53,54,55.

Methods

Study oversight

NIH guidelines for conducting human genetic research were followed. The Institutional Review Boards (IRB) of Johns Hopkins University (GRAAD, BASS and BAGS), Howard University (CRAD and HUFS), Wake Forest University (SARP), the University of California, San Francisco (coordinating center for the SAGE II and GALA II studies), the Western Institutional Review Board for the recruitment in Puerto Rico (GALA II Puerto Ricans), Children’s Hospital and Research Center Oakland and Kaiser Permanente-Vallejo Medical Center (SAGE II), the University of Chicago (CAG), University of the West Indies, Mona, Jamaica and Cave Hill Campus, Barbados (BAGS), University of Mississippi Medical Center (JHS), Henry Ford Health System (SAPPHIRE), the Universidad Católica de Honduras in San Pedro Sula (HONDAS), Federal University of Bahia (BIAS and ProAR), the University of Cartagena (PGCA), all reviewed and approved this study. All participants provided written informed consent.

Genome-wide ancestry estimation and analysis

Unrelated phase 3 1000 Genomes Project (TGP) subjects of European ancestry (CEU, n = 84) and African ancestry (YRI, n = 84), as well as unrelated Native American (NAT) subjects from Mao et al.⁵⁶ (n = 43), were used as reference panels for the genome-wide ancestry analysis. A combined dataset of SNPs common to the reference panels plus all genotyped SNPs in the CAAPA datasets (with <1% missing genotypes in each dataset) were created, after which a LD pruning step was performed (leaving 20,482 SNPs). (Note for the SAPPHIRE dataset, the combined genome-wide ancestry analysis described here was restricted to the 730 asthmatic cases for which ADPC [African Diaspora Power Chip] data were available.) Principal components were then formed from the genotypes of the CAAPA and reference subjects by the R Bioconductor package GENESIS. GENESIS uses PC-AiR to calculate principal components to account for cryptic and known relatedness between subjects⁵⁷. PC1 distinguished African ancestry from European and Native American ancestry and PC2 distinguished Native American ancestry from European and African ancestry. The results are summarized in Supplementary Fig. 1. The combined dataset was also used to estimate the proportion of genome-wide ancestry deriving from the three source populations represented by the reference panels, for each CAAPA subject. This was done using the software program ADMIXTURE. Because BAGS, HUFS, and BIAS include families, and ADMIXTURE assumes all subjects are unrelated, only the founders in these studies (n = 226, n = 997, and n = 179, respectively) were included for the ADMIXTURE analysis. These results are summarized in Fig. 1a and Supplementary Table 2. Finally, the GENESIS PCA was repeated separately for each of the CAAPA datasets, excluding the reference populations, using a set of LD pruned SNPs, in order to test for differences in ancestry between cases and controls within each dataset. The association between the first 2 PCs and asthma is summarized in Supplementary Table 2. As expected, the first principal component explained most of the variance in the genetic data (Supplementary Fig. 2) in all of the datasets. Relevant PCs were included in the asthma association models, in order to adjust for population structure (see Association analysis).

Association analysis

Because the studies included in the primary asthma association analysis represent a diverse spectrum of African ancestry (Fig. 1a and Supplementary Table 2), and had different objectives and differed in potentially relevant factors such as age of onset and diagnosis (Supplementary Note 1 and Supplementary Table 1), we performed tests for association separately for each dataset, and then combined the association results using MR-MEGA, a novel meta-regression approach that models allelic effects as a function of axes of genetic variation¹¹. In this way, heterogeneous associations across genetically distinct populations are not penalized, and the degree of heterogeneity due to ancestry can be assessed, as well as residual effects that may be due to differences in study design.

Statistical models fitted to the GWAS + ADPC datasets

Logistic mixed effects models were used to test for association between imputed allelic dosage and asthma, using the GENESIS R Bioconductor package. GENESIS uses a penalized quasi-likelihood approximation to the generalized linear mixed model. SNP association p-values are estimated using a score test, which tests for model fit improvement if the SNP is added to the null model. GENESIS uses PC-relate to estimate a kinship matrix excluding other sources of variance such as population structure⁵⁸, and PC-AiR to calculate principal components accounting for cryptic and known relatedness between subjects⁵⁷. The kinship matrix and principal components were calculated using a dataset of LD pruned genotyped SNPs. This kinship matrix was included as random effect in the null model, and principal components were included as fixed-effect covariates (the first principal component, as well as any of the top 10 principal components associated with asthma status [p-value < 0.05], were included).

Statistical models fitted to the SAPPHIRE dataset

ADPC data were only available for 730 SAPPHIRE asthmatics, but no controls. GWAS array data from the Affymetrix Axiom AFR array were available for 1325 cases and 566 controls (Table 1). The Henry Ford Health System group performed tests for association for this study, and shared summary statistics with CAAPA. To generate the summary statistics, logistic regression was used to test for association between imputed allelic dosage and asthma, using the PLINK software package^59,60. The GENESIS R Bioconductor package was used to estimate a kinship matrix, to ensure that the coefficient of relatedness between each pair of subjects was below 0.25. Principal components were also calculated using GENESIS. Principal components included in the regression models were chosen using the same strategy used in the CAAPA GWAS + ADPC association analysis.

Statistical models fitted to the MEGA datasets

Asthma association tests in the MEGA datasets used the same statistical models described for the GWAS + ADPC datasets. However, because some of the populations genotyped on the MEGA have a broader ancestry spectrum (Fig. 1a, Supplementary Fig. 1), the association analysis pipeline included as fixed-effect covariates the first principal component, any other principal components identified by the elbow method (Supplementary Fig. 2) as explaining a large percentage of variance, as well as any of the top 20 principal components associated with asthma status [p-value < 0.05].

Processing and assessment of individual association results

First, association results of SNPs with low imputation accuracy were removed from the individual datasets. The chosen filter was informed by a study that quantified imputation accuracy in African Americans⁶¹ and filtering was done based on minor allele frequency (MAF) and the per-SNP estimation of the squared correlation between imputed allele dosages and true unknown genotypes (Rsq). Associations between asthma and SNPs with MAF ≤ 0.005 were excluded if Rsq ≤ 0.5, as were associations with SNPs with MAF > 0.005 if Rsq ≤ 0.3. SNPs with a minor allele count ≤10 were also excluded. Next, inflation factors and QQ plots were used to assess the individual study association results (Supplementary Figs. 3 and 4). Large inflation of these test statistics was not observed. All of the CAAPA datasets were therefore included in the meta-analysis, described below.

Meta-analysis

MR-MEGA uses multi-dimensional scaling to infer genetic axes of variation across studies, and models allelic effect across studies using a linear regression model of the allelic effect of each study along each genetic axis of variation, weighting the contribution of a study by the inverse variance of the allelic effect from the study. The model is described in detail in the Materials and Methods section of the MR-MEGA publication¹¹. As input, the software requires the association odds ratio (OR) and its 95% confidence interval (CI), which were calculated per SNP and study as follows, where U is the Score statistic of the model fitted by GENESIS for the SNP, and i is its variance:

$${\mathrm{OR}} = e^{U/i}$$

(1)

$$95\% \,{\mathrm{CI}} = e^{U/i \pm 1.96\sqrt 1 /i}$$

(2)

These formulas are based on recommendations from Zhou et al.⁶². For SAPPHIRE, the odds ratio reported by the software was used as is, and the 95% CI was calculated as follows, using the standard error (SE) reported by the software:

$$95\% \,{\mathrm{CI}} = e^{{\mathrm{log}}({\mathrm{OR}}) \pm 1.96{\mathrm{SE}}}$$

(3)

The MR-MEGA software requires specification of the number of genetic axes of variation to be used in the meta-analysis. Most of the CAAPA populations have ancestry from African and European populations, with some Native American ancestry (Fig. 1a). Because the mean Native American ancestry represented is small, however, and the first axis generally separates populations of high and low African ancestry (Supplementary Fig. 5), only 1 axis of genetic variation was used in the meta-analysis.

Inverse-variance meta-analysis

MR-MEGA estimates p-values by comparing the deviance of the regression model with coefficients equal to zero, to the deviance of the model with the coefficients unconstrained, and combined odds ratios/effect sizes across studies are not estimated. For this reason, all cross-study odds ratios reported in this paper, e.g., Table 2 and the forest plots in the supplementary material, were estimated using inverse-variance meta-analysis. Inverse-variance meta-analysis was also used to combine association results of all the African American studies in order to generate Supplementary Table 4, combined SNP association results for the admixture mapping peak in the admixture mapping discovery dataset (Supplementary Fig. 6), and assessing associations on chromosome 17q12–21 in studies with high and low African ancestry (Supplementary Figs. 11 and 12, Supplementary Data 2).

Replication in cardiovascular disease studies

Imputed GWAS array data (described previously⁶³) from three cardiovascular disease studies with asthma information were used to assess replication of novel associations in CAAPA (CARDIA⁶⁴, MESA⁶⁵, and ARIC⁶⁶, see Supplementary Note 2 and Table 3). The imputed allelic dose of SNPs rs13277810 and rs114647118 was extracted, and logistic regression (implemented in the R software packages) was used to test for association between allelic dose and asthma. Principal components associated with asthma (p-value < 0.1) were included as covariates in the models.

Replication in BioMe

Summary statistics of tests for association between GWAS array data and asthma in African Americans and Hispanics from BioMe were also used to assess replication (see Supplementary Note 2 and Table 3). These summary statistics were generated using the logistic regression implementation in PLINK⁵⁹. ImpG-Summary⁶⁷ and the CAAPA WGS reference panel (Supplementary Table 3, Supplementary Note 3) were used to impute associations surrounding rs13277810, separately for African Americans and Hispanics (predicted r² for rs13277810 was high, 0.911 in both the African American and Hispanic datasets). Associations for the region surrounding the low frequency SNP rs114647118 were also imputed, but the imputation quality of this SNP, as well as other low frequency SNPs in high LD with it, was insufficient to assess replication (predicted r² = 0.586 and 0.581 for all SNPs assessed, in the African American and Hispanic datasets, respectively).

TAGC SNP-by-SNP comparison

The CAAPA meta-analysis was compared to the TAGC multi-ethnic meta-analysis, the largest and most definitive asthma GWAS to date⁷. The association results of the lead SNPs reported by TAGC were compared by extracting the corresponding CAAPA meta-analysis p-values, and using inverse-variance meta-analysis to calculate odds ratios and 95% CI for all SNPs (Table 2 and Supplementary Table 12). Note that two of the TAGC lead SNPs were not present in CAAPA: rs2855812 was filtered out of the CAAPA WGS reference panel because this SNP is located in a segmental duplication region, and rs11071558 was filtered out from the CAAPA reference panel because it is tri-allelic. Because the largest population represented in TAGC is European (19,954 of the 23,948 cases and 107,715 of the 118,538 controls), and because some of the CAAPA studies were included in the TAGC multi-ethnic meta-analysis, a direct comparison of association significance and effect size was only done for the TAGC European meta-analysis, and not the multi-ethnic meta-analysis. Together with the ancestry heterogeneity p-value estimated by MR-MEGA, this is also useful for gleaning the effect of asthma-associated SNPs in Europeans versus Africans.

TAGC replication in CAAPA

In addition to the SNP-by-SNP comparison described above, the TAGC summary statistics were downloaded and merged with the CAAPA meta-analysis results. Associations of all SNPs with significant fixed-effect p-values in TAGC Europeans were assessed in the CAAPA meta-analysis, after applying a Bonferroni correction for the number of overlapping SNPs in the merged dataset. A Bonferroni correction for the number of SNPs assessed in CAAPA was applied as a stringent correction for claiming robust replication of loci reported by TAGC in CAAPA (Supplementary Table 13).

Additional replication in CAAPA

Windows ±10 KB from each of the 18 TAGC lead SNPs were extracted from the CAAPA meta-analysis results (±200 KB for the chr17q12–21 locus, due to the extended LD in this region). A ±10-KB window surrounding two SNPs reported as genome-wide significant in African ancestry GWAS was also extracted^5,6. A Bonferroni-corrected significance threshold for 20 tests (one for each of these prior loci) was used as a measure of additional evidence of replication in the CAAPA meta-analysis (Fig. 1c and Table 2, loci not in bold font).

LD block replication in CAAPA

SNPs within the same LD block and in high LD with the TAGC lead SNPs (r² > 0.8) in Europeans from the TGP were also selected for comparison, and their association results were inspected (Supplementary Table 18). In this way, no or marginal replication of TAGC lead SNPs in CAAPA due to differences in LD patterns between Europeans and Africans can be assessed. The LD block surrounding each lead TAGC SNP was identified using TGP phase 3 data, population EUR, representing European ancestry. Thus, the effect of SNPs that may be causal in Europeans but not the lead SNP reported in the TAGC European analysis can still be tested for replication in CAAPA. Flanking SNPs ±10 KB from each TAGC lead SNP with MAF >0.05 in the CAAPA WGS reference panel and that intersected with the CAAPA meta-analysis association results as well as TGP phase 3 variants were selected, and Gabriel’s algorithm implemented in Haploview^68,69 was then used to identify LD blocks present in the window of SNPs. CAAPA meta-analysis p-values for SNPs falling in the LD block containing the lead SNP, and that had r² > 0.8 with the lead SNP, were extracted, and these SNPs and their asthma associations are summarized in Supplementary Table 18. The chr17q12–21 locus achieved genome-wide significance in CAAPA and was therefore excluded from this analysis.

EVE replication sensitivity analysis

A sensitivity analysis of the African ancestry-specific genome-wide association reported by the EVE consortium for SNP rs1101999⁶ was also done (Supplementary Table 12). CAAPA studies included in the EVE meta-analysis were meta-analyzed together, and all CAAPA studies not included in the EVE meta-analysis were meta-analyzed together. MR-MEGA was used for this meta-analysis, as CAAPA studies with relatively high proportions of European ancestry could be included, without penalizing the strength of the association should these high European ancestry populations show a different pattern of association compared to populations with high African and low European ancestry.

Tests for association with total serum IgE

Lead SNPs from Table 2 (reported by CAAPA, as well as TAGC SNPs with evidence for replication in CAAPA) were tested for association with total serum IgE (tIgE), adopting the approach used in the EVE consortium’s meta-analysis of genetic association with tIgE⁴⁰. The analysis was stratified based on asthma case-control status and originating study (eight groups of asthmatics and six groups of controls, see Supplementary Table 1 for a summary of the distribution of tIgE in these groups). Linear mixed-effect models implemented in the R Bioconductor GENESIS software package were used for association testing between the allelic dose at each SNP and Studentized residuals of log10 transformed tIgE (adjusted for age and sex). Similar to the asthma association models, the models included a kinship matrix as random effect and principal components as fixed-effect covariates. Finally, association test statistics from the analysis strata were combined using inverse-variance meta-analysis, to yield combined statistics for non-asthmatics, asthmatics, and non-asthmatics + asthmatics (Supplementary Table 19).

Admixture mapping discovery

Local ancestry inference was performed using RFMix⁷⁰, and the pipeline is described in Supplementary Note 12. Custom scripts were used to convert the RFMix local ancestry calls to an EMMAX dosage TPED file, where the encoded dosage is defined as having 0, 1, or 2 copies of African ancestry at a particular local ancestry segment. Tests for association between the local ancestry dosage values and asthma case-control status were then done using the linear mixed-effect models implemented in the EMMAX software package. Phenotypes for subjects from SAPPHIRE, which included only asthmatics and no controls as well as the three population outliers identified by the genome-wide ancestry analysis, were set to missing. These models included dataset as a fixed-effect covariate, and a Balding-Nichols kinship matrix as random effect. The method described by Gao et al. was used to estimate the number of effective tests that should be used in a Bonferroni correction for multiple testing⁷¹. Briefly, a local ancestry correlation matrix, and corresponding eigenvalues, was calculated using local ancestry dosage values (0/1/2 copies of African ancestry) per chromosome. The number of effective tests for a chromosome was then set to the number of eigenvectors that explain 99.5% of the variance in the local ancestry data (for n local ancestry segments, find the largest k such that $\mathop {\sum}\nolimits_{i = 1}^k {{\kern 1pt} {\mathrm{eigenvalues}}} {\mathrm{/}}\mathop {\sum}\nolimits_{i = 1}^n {{\kern 1pt} {\mathrm{eigenvalues}}} \le 0.995$). The total number of effective tests was then set to the sum of the number of effective tests calculated for each of the 22 autosomes. Using this method, a Bonferroni-corrected p-value threshold of 0.05/(262 total number of effective tests) = 1.9 × 10⁻⁴ was used to claim statistical significance.

Admixture mapping replication

The segments including the start and end position of the admixture mapping discovery peak (chr6:134,149,974–134,300,365) were selected for replication. Logistic regression was used to test for association between the number of copies of African ancestry (dosage value of 0, 1, 2) and case-control status, separately for each replication dataset (BASS, BAGS, JAAS, and BioMe). The base R package was used to fit the model to the BioMe data, adjusting for the first and fifth principal components, as these principal components were also associated with asthma status (p-values < 0.1). The R Bioconductor package GENESIS was used to fit models to the BASS, BAGS, and JAAS dataset, including a kinship matrix as random effect and principal components as fixed effects, as described for the SNP association analysis. The results for the segment including/closest to the midpoint of the admixture mapping discovery peak (134,225,170) was combined using inverse-variance meta-analysis.

Admixture mapping power calculations

We used the Genetic Association Study (GAS) Power Calculator⁷² to perform post-hoc power calculations for admixture mapping. As BAGS and HUFS include related subjects, their effective number of cases and controls were estimated as $n_{\mathrm{cases}} = 1{\mathrm{/}}(1 + 2\bar r)$ and $n_{\mathrm{controls}} = 1{\mathrm{/}}(1 + 2\bar r)$, where $\bar r$ is the mean kinship coefficient between relatives in the particular study⁷³ (the kinship matrix estimated by GENESIS (see Statistical models fitted to the GWAS + ADPC datasets) was used to calculate $\bar r$; $\bar r = 0.30$ for BAGS, $\bar r = 0.33$ for HUFS). Given 2380 cases and 3255 controls, at a significance level of p = 1.9 × 10⁻⁴, disease prevalence of 0.1, disease allele frequency of 0.2 (the mean number of copies of non-African ancestry), we had ≥89% power to detect a relative risk ≥1.20 in our discovery dataset. For our replication dataset, given 777 cases and 2263 controls, at a significance level of p = 0.05, disease allele frequency of 0.15 (accounting for the lower mean number of copies of non-African ancestry in BAGS and JAAS), we had ≥70% power to detect a relative risk ≥1.2. However, for a relative risk of 1.12 (the ratio of non-European local ancestry in controls versus cases for the segment identified in the discovery), we only had 34% power for replication.

Software

Detailed information on the software packages and software versions used for analyses are listed in Supplementary Table 24.

Data availability

The legacy GWAS array ADPC and MEGA data that support the findings of this study have been deposited in dbGAP with the accession code phs001123.v2.p1. These data can be accessed through dbGAP. Specific data use limitations: GRU-IRB (General Research Use, IRB approval required). Only ADPC data are available for the SAGE II, GALA II, and SAPPHIRE datasets; in addition, no phenotype data are available for these three datasets. Summary statistics from the meta-analysis are also available through the GWAS catalog¹² [https://www.ebi.ac.uk/gwas/downloads/summary-statistics].

Change history

04 September 2019
An amendment to this paper has been published and can be accessed via a link at the top of the paper.

References

Moorman, J. E. et al. National surveillance of asthma: United States, 2001–2010. Vital Health Stat. 3, 1–58 (2012).
Google Scholar
Moorman, J. E. et al. National surveillance for asthma—United States, 1980–2004. Mmwr. Surveill. Summ. 56, 1–54 (2007).
PubMed Google Scholar
Keet, C. A. et al. Neighborhood poverty, urban residence, race/ethnicity, and asthma: rethinking the inner-city asthma epidemic. J. Allergy Clin. Immunol. 135, 655–662 (2015).
Article PubMed PubMed Central Google Scholar
Mathias, R. A. et al. A genome-wide association study on African-ancestry populations for asthma. J. Allergy Clin. Immunol. 125, 336–346 e4 (2010).
Article CAS PubMed Google Scholar
Almoguera, B. et al. Identification of four novel loci in asthma in European American and African American populations. Am. J. Respir. Crit. Care. Med. 195, 456–463 (2017).
Article PubMed PubMed Central Google Scholar
Torgerson, D. G. et al. Meta-analysis of genome-wide association studies of asthma in ethnically diverse North American populations. Nat. Genet. 43, 887–892 (2011).
Article CAS PubMed PubMed Central Google Scholar
Demenais, F. et al. Multiancestry association study identifies new asthma risk loci that colocalize with immune-cell enhancer marks. Nat. Genet. 50, 42–53 (2018).
Article CAS PubMed Google Scholar
White, M. J. et al. Novel genetic risk factors for asthma in African American children: precision medicine and the SAGE II study. Immunogenetics 68, 391–400 (2016).
Article CAS PubMed PubMed Central Google Scholar
Mathias, R. A. et al. A continuum of admixture in the Western Hemisphere revealed by the African Diaspora genome. Nat. Commun. 7, 12522 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Johnston, H. R. et al. Identifying tagging SNPs for African specific genetic variation from the African Diaspora Genome. Sci. Rep. 7, 46398 (2017).
Article ADS PubMed PubMed Central Google Scholar
Magi, R. et al. Trans-ethnic meta-regression of genome-wide association studies accounting for ancestry increases power for discovery and improves fine-mapping resolution. Hum. Mol. Genet. 26, 3639–3650 (2017).
Article CAS PubMed PubMed Central Google Scholar
MacArthur, J. et al. The new NHGRI-EBI Catalog of published genome-wide association studies (GWAS Catalog). Nucleic Acids Res. 45, D896–D901 (2017).
Article CAS PubMed Google Scholar
Stein, M. M. et al. A decade of research on the 17q12–21 asthma locus: piecing together the puzzle. J. Allergy Clin. Immunol. 142, 749–764 (2018).
Article PubMed Google Scholar
Schmitt, A. D. et al. A compendium of chromatin contact maps reveals spatially active regions in the human genome. Cell Rep. 17, 2042–2059 (2016).
Article CAS PubMed PubMed Central Google Scholar
Martin, J. S. et al. HUGIn: Hi-C unifying genomic interrogator. Bioinformatics 33, 3793–3795 (2017).
Article CAS PubMed PubMed Central Google Scholar
Ay, F., Bailey, T. L. & Noble, W. S. Statistical confidence estimation for Hi-C data reveals regulatory chromatin contacts. Genome Res. 24, 999–1011 (2014).
Article CAS PubMed PubMed Central Google Scholar
Singh, D. et al. Altered gene expression in blood and sputum in COPD frequent exacerbators in the ECLIPSE cohort. PLoS One 9, e107381 (2014).
Article ADS PubMed PubMed Central Google Scholar
Pinto, S. M. et al. Quantitative phosphoproteomic analysis of IL-33-mediated signaling. Proteomics 15, 532–544 (2015).
Article CAS PubMed PubMed Central Google Scholar
Yao, T. C. et al. Genome-wide association study of lung function phenotypes in a founder population. J. Allergy Clin. Immunol. 133, 248–255 e1–10 (2014).
Centers for Disease Control and Prevention (CDC) National Asthma Control Program. Asthma in Puerto Rico. State Data Profiles https://www.cdc.gov/asthma/stateprofiles/Asthma_in_PR.pdf (2011).
Dragon, S., Hirst, S. J., Lee, T. H. & Gounni, A. S. IL-17A mediates a selective gene expression profile in asthmatic human airway smooth muscle cells. Am. J. Respir. Cell Mol. Biol. 50, 1053–1063 (2014).
Article PubMed PubMed Central Google Scholar
Goenka, S. & Kaplan, M. H. Transcriptional regulation by STAT6. Immunol. Res. 50, 87–96 (2011).
Article CAS PubMed PubMed Central Google Scholar
Wain, L. V. et al. Genome-wide association analyses for lung function and chronic obstructive pulmonary disease identify new loci and potential druggable targets. Nat. Genet. 49, 416–425 (2017).
Article CAS PubMed PubMed Central Google Scholar
Soler Artigas, M. et al. Genome-wide association and large-scale follow up identifies 16 new loci influencing lung function. Nat. Genet. 43, 1082–1090 (2011).
Article PubMed Google Scholar
The Collaborative Study on the Genetics of Asthma (CSGA). A genome-wide search for asthma susceptibility loci in ethnically diverse populations. Nat. Genet. 15, 389–392 (1997).
Article Google Scholar
Barnes, K. C. et al. Dense mapping of chromosome 12q13.12–q23.3 and linkage to asthma and atopy. J. Allergy Clin. Immunol. 104, 485–491 (1999).
Article CAS PubMed Google Scholar
Barnes, K. C. et al. Linkage of asthma and total serum IgE concentration to markers on chromosome 12q: evidence from Afro-Caribbean and Caucasian populations. Genomics 37, 41–50 (1996).
Article CAS PubMed Google Scholar
Nickel, R. et al. Evidence for linkage of chromosome 12q15–q24.1 markers to high total serum IgE concentrations in children of the German Multicenter Allergy Study. Genomics 46, 159–162 (1997).
Article CAS PubMed Google Scholar
Kavalar, M. S. et al. Association of ORMDL3, STAT6 and TBXA2R gene polymorphisms with asthma. Int. J. Immunogenet. 39, 20–25 (2012).
Article CAS PubMed Google Scholar
Duetsch, G. et al. STAT6 as an asthma candidate gene: polymorphism-screening, association and haplotype analysis in a Caucasian sib-pair study. Hum. Mol. Genet. 11, 613–621 (2002).
Article CAS PubMed Google Scholar
Wang, Q. et al. Association of polymorphisms of STAT6 and SO(2) with Chinese childhood asthma: a case-control study. Biomed. Environ. Sci. 24, 670–677 (2011).
CAS PubMed Google Scholar
Godava, M., Vrtel, R. & Vodicka, R. STAT6—polymorphisms, haplotypes and epistasis in relation to atopy and asthma. Biomed. Pap. Med. Fac. Univ. Palacky. Olomouc Czech. Repub. 157, 172–180 (2013).
Article CAS PubMed Google Scholar
Yan, Q. et al. A meta-analysis of genome-wide association studies of asthma in Puerto Ricans. Eur. Respir. J. 49, 1601505 (2017).
Article PubMed Google Scholar
Kreiner-Moller, E. et al. 17q21 gene variation is not associated with asthma in adulthood. Allergy 70, 107–114 (2015).
Article CAS PubMed Google Scholar
Esnault, S. et al. Identification of genes expressed by human airway eosinophils after an in vivo allergen challenge. PLoS One 8, e67560 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Lima, L. C. et al. Genetic variants in RORA are associated with asthma and allergy markers in an admixed population. Cytokine 113, 177–184 (2019).
Article CAS PubMed Google Scholar
Sunyer, J., Anto, J. M., Castellsague, J., Soriano, J. B. & Roca, J. Total serum IgE is associated with asthma independently of specific IgE levels. The Spanish Group of the European Study of Asthma. Eur. Respir. J. 9, 1880–1884 (1996).
Article CAS PubMed Google Scholar
Sears, M. R. et al. Relation between airway responsiveness and serum IgE in children with asthma and in apparently normal children. N. Engl. J. Med. 325, 1067–1071 (1991).
Article CAS PubMed Google Scholar
Palmer, L. J. et al. Independent inheritance of serum immunoglobulin E concentrations and airway responsiveness. Am. J. Respir. Crit. Care. Med. 161, 1836–1843 (2000).
Article CAS PubMed Google Scholar
Levin, A. M. et al. A meta-analysis of genome-wide association studies for serum total IgE in diverse study populations. J. Allergy Clin. Immunol. 131, 1176–1184 (2013).
Article CAS PubMed Google Scholar
Burrows, B., Martinez, F. D., Halonen, M., Barbee, R. A. & Cline, M. G. Association of asthma with serum IgE levels and skin-test reactivity to allergens. N. Engl. J. Med. 320, 271–277 (1989).
Article CAS PubMed Google Scholar
Galanter, J. M. et al. Genome-wide association study and admixture mapping identify different asthma-associated loci in Latinos: the Genes-environments & Admixture in Latino Americans study. J. Allergy Clin. Immunol. 134, 295–305 (2014).
Article CAS PubMed PubMed Central Google Scholar
Vaillancourt, V. T., Bordeleau, M., Laviolette, M. & Laprise, C. From expression pattern to genetic association in asthma and asthma-related phenotypes. BMC Res. Notes 5, 630 (2012).
Article CAS PubMed PubMed Central Google Scholar
Gould, W. et al. Factors predicting inhaled corticosteroid responsiveness in African American patients with asthma. J. Allergy Clin. Immunol. 126, 1131–1138 (2010).
Article CAS PubMed PubMed Central Google Scholar
Sulovari, A., Chen, Y. H., Hudziak, J. J. & Li, D. Atlas of human diseases influenced by genetic variants with extreme allele frequency differences. Hum. Genet. 136, 39–54 (2017).
Article PubMed Google Scholar
Morales, J. et al. A standardized framework for representation of ancestry data in genomics studies, with application to the NHGRI-EBI GWAS Catalog. Genome Biol. 19, 21 (2018).
Article PubMed PubMed Central Google Scholar
West, K. M., Blacksher, E. & Burke, W. Genomics, health disparities, and missed opportunities for the nation’s research agenda. JAMA 317, 1831–1832 (2017).
Article PubMed PubMed Central Google Scholar
Popejoy, A. B. & Fullerton, S. M. Genomics is failing on diversity. Nature 538, 161–164 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Ferreira, M. A. et al. Shared genetic origin of asthma, hay fever and eczema elucidates allergic disease biology. Nat. Genet. 49, 1752–1757 (2017).
Article CAS PubMed PubMed Central Google Scholar
Wolford, B. N., Willer, C. J. & Surakka, I. Electronic health records: the next wave of complex disease genetics. Hum. Mol. Genet. 27, R14–R21 (2018).
National Institutes of Health. Health Disparities (NIH, UK, 2010).
Brody, J. A. et al. Analysis commons, a team approach to discovery in a big-data environment for genetic epidemiology. Nat. Genet. 49, 1560–1563 (2017).
Article CAS PubMed PubMed Central Google Scholar
Schlauch, D., Glass, K., Hersh, C. P., Silverman, E. K. & Quackenbush, J. Estimating drivers of cell state transitions using gene regulatory network models. BMC Syst. Biol. 11, 139 (2017).
Article PubMed PubMed Central Google Scholar
Ferreira, M. A. et al. Gene-based analysis of regulatory variants identifies 4 putative novel asthma risk genes related to nucleotide synthesis and signaling. J. Allergy Clin. Immunol. 139, 1148–1157 (2017).
Article CAS PubMed Google Scholar
Gamazon, E. R. et al. A gene-based association method for mapping traits using reference transcriptome data. Nat. Genet. 47, 1091–1098 (2015).
Article CAS PubMed PubMed Central Google Scholar
Mao, X. et al. A genomewide admixture mapping panel for Hispanic/Latino populations. Am. J. Hum. Genet. 80, 1171–1178 (2007).
Article CAS PubMed PubMed Central Google Scholar
Conomos, M. P., Miller, M. B. & Thornton, T. A. Robust inference of population structure for ancestry prediction and correction of stratification in the presence of relatedness. Genet. Epidemiol. 39, 276–293 (2015).
Article PubMed PubMed Central Google Scholar
Conomos, M. P., Reiner, A. P., Weir, B. S. & Thornton, T. A. Model-free estimation of recent genetic relatedness. Am. J. Hum. Genet. 98, 127–148 (2016).
Article CAS PubMed PubMed Central Google Scholar
Chang, C. C. et al. Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience 4, 7 (2015).
Article PubMed PubMed Central Google Scholar
Purcell, S. et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81, 559–575 (2007).
Article CAS PubMed PubMed Central Google Scholar
Liu, E. Y. et al. Genotype imputation of Metabochip SNPs using a study-specific reference panel of ~4,000 haplotypes in African Americans from the Women’s Health Initiative. Genet. Epidemiol. 36, 107–117 (2012).
Article PubMed PubMed Central Google Scholar
Zhou, B., Shi, J. & Whittemore, A. S. Optimal methods for meta-analysis of genome-wide association studies. Genet. Epidemiol. 35, 581–591 (2011).
Article PubMed PubMed Central Google Scholar
Lettre, G. et al. Genome-wide association study of coronary heart disease and its risk factors in 8,090 African Americans: the NHLBI CARe Project. PLoS Genet. 7, e1001300 (2011).
Article CAS PubMed PubMed Central Google Scholar
Friedman, G. D. et al. CARDIA: study design, recruitment, and some characteristics of the examined subjects. J. Clin. Epidemiol. 41, 1105–1116 (1988).
Article ADS CAS PubMed Google Scholar
Bild, D. E. et al. Multi-Ethnic Study of Atherosclerosis: objectives and design. Am. J. Epidemiol. 156, 871–881 (2002).
Article PubMed Google Scholar
The ARIC investigators. The Atherosclerosis Risk in Communities (ARIC) Study: design and objectives. Am. J. Epidemiol. 129, 687–702 (1989).
Article Google Scholar
Pasaniuc, B. et al. Fast and accurate imputation of summary statistics enhances evidence of functional enrichment. Bioinformatics 30, 2906–2914 (2014).
Article CAS PubMed PubMed Central Google Scholar
Barrett, J. C., Fry, B., Maller, J. & Daly, M. J. Haploview: analysis and visualization of LD and haplotype maps. Bioinformatics 21, 263–265 (2005).
Article CAS PubMed Google Scholar
Gabriel, S. B. et al. The structure of haplotype blocks in the human genome. Science 296, 2225–2229 (2002).
Article ADS CAS PubMed Google Scholar
Maples, B. K., Gravel, S., Kenny, E. E. & Bustamante, C. D. RFMix: a discriminative modeling approach for rapid and robust local-ancestry inference. Am. J. Hum. Genet. 93, 278–288 (2013).
Article CAS PubMed PubMed Central Google Scholar
Gao, X., Starmer, J. & Martin, E. R. A multiple testing correction method for genetic association studies using correlated single nucleotide polymorphisms. Genet. Epidemiol. 32, 361–369 (2008).
Article PubMed Google Scholar
Johnson, J. L. & Abecasis, G. R. GAS Power Calculator: web-based power calculator for genetic association studies. Preprint at bioRxiv https://doi.org/10.1101/164343 (2017).
Yang, Y. et al. Effective sample size: quick estimation of the effect of related samples in genetic case-control association analyses. Comput. Biol. Chem. 35, 40–49 (2011).
Article MathSciNet CAS PubMed PubMed Central MATH Google Scholar

Download references

Acknowledgements

We thank Goncalo Abecasis for coordinating inclusion of the CAAPA reference panel on the Michigan Imputation Server, Todd Deppe, Estelle Giraud, Cindy Lawley from Illumina for genotyping services, and Pat Oldewurtel for administrative and technical support.

Author information

These authors contributed equally: Rasika A. Mathias, Kathleen C. Barnes.

Authors and Affiliations

Department of Medicine, University of Colorado Denver, Aurora, CO, 80045, USA
Michelle Daya, Nicholas Rafaels, Tonya M. Brunetti, Sameer Chavan, Aniket Shetty, Christopher R. Gignoux, Meher Preethi Boorgula, Monica Campbell, Leslie A. Lange, Ethan M. Lange, Kathleen C. Barnes & Iain Königsberg
Department of Public Health Sciences, Henry Ford Health System, Detroit, MI, 48202, USA
Albert M. Levin
Department of Genetics, Stanford University School of Medicine, Stanford, CA, 94305, USA
Genevieve Wojcik, Carlos Bustamante, Shaila Musharoff, Suyash S. Shringarpure & Francisco M. De La Vega
Department of Medicine, Johns Hopkins University, Baltimore, MD, 21224, USA
Candelaria Vergara, Nadia N. Hansel, Rasika A. Mathias, Cassandra Foster & Li Gao
Department of Medicine, University of California San Francisco, San Francisco, CA, 94143, USA
Dara G. Torgerson, Celeste Eng & Maria Pino-Yanes
Center for Human Genomics and Personalized Medicine, Wake Forest School of Medicine, Winston-Salem, 27157, USA
Victor E. Ortega
Center for Research on Genomics & Global Health, National Institutes of Health, Bethesda, MD, 20892, USA
Ayo Doumatey & Charles Rotimi
Department of Human Genetics, Emory University, Atlanta, GA, 30322, USA
Henry Richard Johnston
Institute for Immunological Research, Universidad de Cartagena, Cartagena, 130000, Colombia
Nathalie Acevedo, Luis Caraballo & Silvia Jiménez
Immunology Service, Universidade Federal da Bahia, Salvador, 401110170, Brazil
Maria Ilma Araujo
Department of Medicine, Northwestern University, Chicago, IL, 60611, USA
Pedro C. Avila, Leslie Grammer & Robert Schleimer
Department of Genetics and Genomics, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
Gillian Belbin & Eimear E. Kenny
Department of Medicine, University of Arizona College of Medicine, Tucson, AZ, 85724, USA
Eugene Bleecker & Deborah Meyers
Universidade Federal da Bahia, Salvador, 401110170, Brazil
Alvaro Cruz
Department of Microbiology, Howard University College of Medicine, Washington, DC, 20059, USA
Georgia M. Dunston
National Human Genome Center, Howard University College of Medicine, Washington, DC, 20059, USA
Mezbah U. Faruque
Caribbean Institute for Health Research, The University of the West Indies, Kingston, 00007, Jamaica
Trevor S. Ferguson, Jennifer Knight-Madden & Rainford J. Wilks
Departamento de Biorregulacao, Universidade Federal da Bahia, Salvador, 401110170, Brazil
Camila Figueiredo
Department of Medicine, Einstein Medical Center, Philadelphia, PA, 19141, USA
Jean G. Ford
National Heart, Lung and Blood Institute, National Institutes of Health, Bethesda, MD, 20892, USA
Weiniu Gan
Université de Nantes, INSERM, Centre de Recherche en Transplantation et Immunologie, UMR, 1064, ATIP-Avenir, Equipe 5, Nantes, France
Pierre-Antoine Gourraud, Antoine Lizee & Nicolas Vince
Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, 94143, USA
Ryan D. Hernandez, Esteban G. Burchard & Zachary A. Szpiech
Facultad de Medicina, Universidad Católica de Honduras, San Pedro Sula, 21102, Honduras
Edwin Francisco Herrera-Paz, Gerardo Ayestas, Said Omar Leiva Erazo, Pamela Landaverde-Torres, Rosella Martinez, Luis F. Mayorga, Hector Ramos, Allan Saenz & Gloria Varela
Universidad Tecnológica Centroamericana (UNITEC), Facultad de Ciencias Médicas, Tegucigalpa, Honduras
Edwin Francisco Herrera-Paz
Department of Pediatrics, Northwestern University, Chicago, IL, 60611, USA
Rajesh Kumar
Genetics and Epidemiology of Asthma in Barbados, The University of the West Indies, Chronic Disease Research Centre, Jemmots Lane, St. Michael, BB11115, Barbados
Pissamai Maul & Trevor Maul
Centro de Neumologia y Alergias, San Pedro Sula, 21102, Honduras
Alvaro Mayorga
Department of Medicine, University of Chicago, Chicago, IL, 60637, USA
Dan L. Nicolae, Olufunmilayo Olopade & Oluwafemi Oluwole
Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD, 21201, USA
Timothy D. O’Connor & Wei Song
Laboratório de Patologia Experimental, Centro de Pesquisas Gonçalo Moniz, Salvador, 40296-710, Brazil
Ricardo Riccio Oliveira
Department of Medicine and Center for Global Health, University of Chicago, Chicago, IL, 60637, USA
Christopher O. Olopade
Department of Biostatistics and Bioinformatics, Emory University, Atlanta, GA, 30322, USA
Zhaohui S. Qin & Yijuan Hu
Faculty of Medical Sciences, The University of the West Indies, Queen Elizabeth Hospital, Bridgetown, St. Michael, BB11000, Barbados
Harold Watson
Department of Physiology and Biophysics, University of Mississippi Medical Center, Jackson, MS, 39216, USA
James G. Wilson
Departments of Biomedical Engineering and Biostatistics, Johns Hopkins University, Baltimore, MD, 21205, USA
Steven Salzberg
Department of Human Genetics, University of Chicago, Chicago, IL, 60637, USA
Carole Ober
Center for Individualized and Genomic Medicine Research, Henry Ford Health System, Detroit, MI, 48202, USA
L. Keoki Williams
Department of Epidemiology, Bloomberg School of Public Health, JHU, Baltimore, MD, 21205, USA
Terri H. Beaty
Department of Biostatistics, Bloomberg School of Public Health, JHU, Baltimore, MD, 21205, USA
Margaret A. Taub & Ingo Ruczinski
Centre de Recherches Médicales de Lambaréné, BP:242, Lambaréné, 13901, Gabon
Ayola Akim Adegnika & Ulysse Ateba-Ngoa
Department of Chemical Pathology, University of Ibadan, Ibadan, 900001, Nigeria
Ganiyu Arinola
Faculty of Medicine, University of Iceland, 101, Reykjavík, Iceland
Hrafnhildur Bjarnadóttir
Department of Medicine, University of Mississippi Medical Center, Jackson, MS, 39216, USA
Adolfo Correa & Solomon Musani
Pulmonary and Critical Care Medicine, Morehouse School of Medicine, Atlanta, GA, 30310, USA
Marilyn G. Foreman
Data and Statistical Sciences, AbbVie, North Chicago, IL, 60064, USA
Jingjing Gao
Illumina, Inc., San Diego, CA, 92122, USA
Mark Hansen
Department of Medicine, Vanderbilt University, Nashville, TN, 37232, USA
Tina Hartert & Lorraine B. Ware
Department of Preventive Medicine, Northwestern University, Chicago, IL, 60611, USA
Kwang-Youn A. Kim
Instituto de Investigaciones Immunologicas, Universidad de Cartagena, Cartagena, 130000, Colombia
Javier Marrugo, Beatriz Martinez & Catherine Meza
Facultad de Ciencias de la Salud, Universidad Tecnológica Centroamericana (UNITEC), San Pedro Sula, 21102, Honduras
Delmy-Aracely Mejia-Mejia
Department of Child Health, The University of the West Indies, Kingston, 00007, Jamaica
Maureen Samms-Vaughan
Department of Medicine, Johns Hopkins University, Baltimore, MD, 21287, USA
Alan F. Scott
Biomedical Sciences Graduate Program, University of California San Francisco, San Francisco, CA, 94158, USA
Raul Torres
Centro Medico de la Familia, San Pedro Sula, 21102, Honduras
Olga Marina Vasquez
Department of Parasitology, Leiden University Medical Center, Leiden, 02333, Netherlands
Maria Yazdanbakhsh

Authors

Michelle Daya
View author publications
You can also search for this author in PubMed Google Scholar
Nicholas Rafaels
View author publications
You can also search for this author in PubMed Google Scholar
Tonya M. Brunetti
View author publications
You can also search for this author in PubMed Google Scholar
Sameer Chavan
View author publications
You can also search for this author in PubMed Google Scholar
Albert M. Levin
View author publications
You can also search for this author in PubMed Google Scholar
Aniket Shetty
View author publications
You can also search for this author in PubMed Google Scholar
Christopher R. Gignoux
View author publications
You can also search for this author in PubMed Google Scholar
Meher Preethi Boorgula
View author publications
You can also search for this author in PubMed Google Scholar
Genevieve Wojcik
View author publications
You can also search for this author in PubMed Google Scholar
Monica Campbell
View author publications
You can also search for this author in PubMed Google Scholar
Candelaria Vergara
View author publications
You can also search for this author in PubMed Google Scholar
Dara G. Torgerson
View author publications
You can also search for this author in PubMed Google Scholar
Victor E. Ortega
View author publications
You can also search for this author in PubMed Google Scholar
Ayo Doumatey
View author publications
You can also search for this author in PubMed Google Scholar
Henry Richard Johnston
View author publications
You can also search for this author in PubMed Google Scholar
Nathalie Acevedo
View author publications
You can also search for this author in PubMed Google Scholar
Maria Ilma Araujo
View author publications
You can also search for this author in PubMed Google Scholar
Pedro C. Avila
View author publications
You can also search for this author in PubMed Google Scholar
Gillian Belbin
View author publications
You can also search for this author in PubMed Google Scholar
Eugene Bleecker
View author publications
You can also search for this author in PubMed Google Scholar
Carlos Bustamante
View author publications
You can also search for this author in PubMed Google Scholar
Luis Caraballo
View author publications
You can also search for this author in PubMed Google Scholar
Alvaro Cruz
View author publications
You can also search for this author in PubMed Google Scholar
Georgia M. Dunston
View author publications
You can also search for this author in PubMed Google Scholar
Celeste Eng
View author publications
You can also search for this author in PubMed Google Scholar
Mezbah U. Faruque
View author publications
You can also search for this author in PubMed Google Scholar
Trevor S. Ferguson
View author publications
You can also search for this author in PubMed Google Scholar
Camila Figueiredo
View author publications
You can also search for this author in PubMed Google Scholar
Jean G. Ford
View author publications
You can also search for this author in PubMed Google Scholar
Weiniu Gan
View author publications
You can also search for this author in PubMed Google Scholar
Pierre-Antoine Gourraud
View author publications
You can also search for this author in PubMed Google Scholar
Nadia N. Hansel
View author publications
You can also search for this author in PubMed Google Scholar
Ryan D. Hernandez
View author publications
You can also search for this author in PubMed Google Scholar
Edwin Francisco Herrera-Paz
View author publications
You can also search for this author in PubMed Google Scholar
Silvia Jiménez
View author publications
You can also search for this author in PubMed Google Scholar
Eimear E. Kenny
View author publications
You can also search for this author in PubMed Google Scholar
Jennifer Knight-Madden
View author publications
You can also search for this author in PubMed Google Scholar
Rajesh Kumar
View author publications
You can also search for this author in PubMed Google Scholar
Leslie A. Lange
View author publications
You can also search for this author in PubMed Google Scholar
Ethan M. Lange
View author publications
You can also search for this author in PubMed Google Scholar
Antoine Lizee
View author publications
You can also search for this author in PubMed Google Scholar
Pissamai Maul
View author publications
You can also search for this author in PubMed Google Scholar
Trevor Maul
View author publications
You can also search for this author in PubMed Google Scholar
Alvaro Mayorga
View author publications
You can also search for this author in PubMed Google Scholar
Deborah Meyers
View author publications
You can also search for this author in PubMed Google Scholar
Dan L. Nicolae
View author publications
You can also search for this author in PubMed Google Scholar
Timothy D. O’Connor
View author publications
You can also search for this author in PubMed Google Scholar
Ricardo Riccio Oliveira
View author publications
You can also search for this author in PubMed Google Scholar
Christopher O. Olopade
View author publications
You can also search for this author in PubMed Google Scholar
Olufunmilayo Olopade
View author publications
You can also search for this author in PubMed Google Scholar
Zhaohui S. Qin
View author publications
You can also search for this author in PubMed Google Scholar
Charles Rotimi
View author publications
You can also search for this author in PubMed Google Scholar
Nicolas Vince
View author publications
You can also search for this author in PubMed Google Scholar
Harold Watson
View author publications
You can also search for this author in PubMed Google Scholar
Rainford J. Wilks
View author publications
You can also search for this author in PubMed Google Scholar
James G. Wilson
View author publications
You can also search for this author in PubMed Google Scholar
Steven Salzberg
View author publications
You can also search for this author in PubMed Google Scholar
Carole Ober
View author publications
You can also search for this author in PubMed Google Scholar
Esteban G. Burchard
View author publications
You can also search for this author in PubMed Google Scholar
L. Keoki Williams
View author publications
You can also search for this author in PubMed Google Scholar
Terri H. Beaty
View author publications
You can also search for this author in PubMed Google Scholar
Margaret A. Taub
View author publications
You can also search for this author in PubMed Google Scholar
Ingo Ruczinski
View author publications
You can also search for this author in PubMed Google Scholar
Rasika A. Mathias
View author publications
You can also search for this author in PubMed Google Scholar
Kathleen C. Barnes
View author publications
You can also search for this author in PubMed Google Scholar

Consortia

CAAPA

Ayola Akim Adegnika
, Ganiyu Arinola
, Ulysse Ateba-Ngoa
, Gerardo Ayestas
, Hrafnhildur Bjarnadóttir
, Adolfo Correa
, Said Omar Leiva Erazo
, Marilyn G. Foreman
, Cassandra Foster
, Li Gao
, Jingjing Gao
, Leslie Grammer
, Mark Hansen
, Tina Hartert
, Yijuan Hu
, Iain Königsberg
, Kwang-Youn A. Kim
, Pamela Landaverde-Torres
, Javier Marrugo
, Beatriz Martinez
, Rosella Martinez
, Luis F. Mayorga
, Delmy-Aracely Mejia-Mejia
, Catherine Meza
, Solomon Musani
, Shaila Musharoff
, Oluwafemi Oluwole
, Maria Pino-Yanes
, Hector Ramos
, Allan Saenz
, Maureen Samms-Vaughan
, Robert Schleimer
, Alan F. Scott
, Suyash S. Shringarpure
, Wei Song
, Zachary A. Szpiech
, Raul Torres
, Gloria Varela
, Olga Marina Vasquez
, Francisco M. De La Vega
, Lorraine B. Ware
& Maria Yazdanbakhsh

Contributions

M.D., N.R., and T.M.B, analysed the data, interpreted the data, and wrote the paper. S.C., A.M.L., A.S., M.P.B., G.W., H.R.J. and G.B. analysed and interpreted the data. M.C. performed genotyping. K.C.B., R.A.M., T.H.B. and M.A.T. conceived the experiments, interpreted the data, and wrote the paper. C.V., C.R.G., V.E.O., A.D., D.G.T., N.A., M.I.A., P.C.A., E.B., C.B., L.C., A.C., G.M.D., C.E., M.U.F., T.S.F., C.F., J.G.F., W.G., P.A.G., N.N.H., R.D.H., E.F.H., S.J., E.E.K., J.K., R.K., L.A.L., E.M.L., A.L., P.M., T.M., A.M., D.M., D.L.N., T.D.O., R.R.O., C.O.O., O.O., Z.S.Q., C.R., N.V., H.W., R.J.W., J.G.W., S.S., C.O., E.G.B., L.K.W. and I.R. contributed to interpretation of results and critically reviewed the manuscript. CAAPA provided infrastructure, biospecimens and associated phenotype data, and intellectual input for the overall design and execution of the study.

Corresponding author

Correspondence to Kathleen C. Barnes.

Ethics declarations

Competing interests

All authors declare no competing interests.

Additional information

Journal peer review information: Nature Communications thanks Martin Tobin and the other anonymous reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

A full list of consortium members appears at the end of the paper.

Supplementary information

Supplementary Information

Peer Review File

Description of Additional Supplementary Files

Supplementary Data 1

Supplementary Data 2

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Daya, M., Rafaels, N., Brunetti, T.M. et al. Association study in African-admixed populations across the Americas recapitulates asthma risk loci in non-African populations. Nat Commun 10, 880 (2019). https://doi.org/10.1038/s41467-019-08469-7

Download citation

Received: 21 June 2018
Accepted: 08 January 2019
Published: 20 February 2019
DOI: https://doi.org/10.1038/s41467-019-08469-7

This article is cited by

DNA-binding factor footprints and enhancer RNAs identify functional non-coding genetic variants
- Simon C. Biddie
- Giovanna Weykopf
- Wendy A. Bickmore
Genome Biology (2024)
Evaluation of penalized and machine learning methods for asthma disease prediction in the Korean Genome and Epidemiology Study (KoGES)
- Yongjun Choi
- Junho Cha
- Sungkyoung Choi
BMC Bioinformatics (2024)
Multi-omics in nasal epithelium reveals three axes of dysregulation for asthma risk in the African Diaspora populations
- Brooke Szczesny
- Meher Preethi Boorgula
- Rasika A. Mathias
Nature Communications (2024)
Identification of asthma-related genes using asthmatic blood eQTLs of Korean patients
- Dong Jun Kim
- Ji Eun Lim
- Bermseok Oh
BMC Medical Genomics (2023)
Multi-ancestry meta-analysis and fine-mapping in Alzheimer’s disease
- Julie Lake
- Caroline Warly Solsberg
- Hampton L. Leonard
Molecular Psychiatry (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Association analysis

Associations novel to CAAPA

Comparison to previous asthma GWAS

Genome-wide significant associations on chr17q12–q21

Associations with total serum IgE

Admixture mapping

Discussion

Methods

Study oversight

Genome-wide ancestry estimation and analysis

Association analysis

Statistical models fitted to the GWAS + ADPC datasets

Statistical models fitted to the SAPPHIRE dataset

Statistical models fitted to the MEGA datasets

Processing and assessment of individual association results

Meta-analysis

Inverse-variance meta-analysis

Replication in cardiovascular disease studies

Replication in BioMe

TAGC SNP-by-SNP comparison

TAGC replication in CAAPA

Additional replication in CAAPA

LD block replication in CAAPA

EVE replication sensitivity analysis

Tests for association with total serum IgE

Admixture mapping discovery

Admixture mapping replication

Admixture mapping power calculations

Software

Data availability

Change history

04 September 2019

References

Acknowledgements

Author information

Authors and Affiliations

Consortia

CAAPA

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links