Identification of genetic effects underlying type 2 diabetes in South Asian and European populations

Loh, Marie; Zhang, Weihua; Ng, Hong Kiat; Schmid, Katharina; Lamri, Amel; Tong, Lin; Ahmad, Meraj; Lee, Jung-Jin; Ng, Maggie C. Y.; Petty, Lauren E.; Spracklen, Cassandra N.; Takeuchi, Fumihiko; Islam, Md. Tariqul; Jasmine, Farzana; Kasturiratne, Anuradhani; Kibriya, Muhammad; Mohlke, Karen L.; Paré, Guillaume; Prasad, Gauri; Shahriar, Mohammad; Chee, Miao Ling; de Silva, H. Janaka; Engert, James C.; Gerstein, Hertzel C.; Mani, K. Radha; Sabanayagam, Charumathi; Vujkovic, Marijana; Wickremasinghe, Ananda R.; Wong, Tien Yin; Yajnik, Chittaranjan S.; Yusuf, Salim; Ahsan, Habibul; Bharadwaj, Dwaipayan; Anand, Sonia S.; Below, Jennifer E.; Boehnke, Michael; Bowden, Donald W.; Chandak, Giriraj R.; Cheng, Ching-Yu; Kato, Norihiro; Mahajan, Anubha; Sim, Xueling; McCarthy, Mark I.; Morris, Andrew P.; Kooner, Jaspal S.; Saleheen, Danish; Chambers, John C.

doi:10.1038/s42003-022-03248-5

Download PDF

Article
Open access
Published: 07 April 2022

Identification of genetic effects underlying type 2 diabetes in South Asian and European populations

Communications Biology volume 5, Article number: 329 (2022) Cite this article

6137 Accesses
22 Citations
38 Altmetric
Metrics details

Subjects

An Author Correction to this article was published on 05 May 2022

This article has been updated

Abstract

South Asians are at high risk of developing type 2 diabetes (T2D). We carried out a genome-wide association meta-analysis with South Asian T2D cases (n = 16,677) and controls (n = 33,856), followed by combined analyses with Europeans (n_eff = 231,420). We identify 21 novel genetic loci for significant association with T2D (P = 4.7 × 10⁻⁸ to 5.2 × 10⁻¹²), to the best of our knowledge at the point of analysis. The loci are enriched for regulatory features, including DNA methylation and gene expression in relevant tissues, and highlight CHMP4B, PDHB, LRIG1 and other genes linked to adiposity and glucose metabolism. A polygenic risk score based on South Asian-derived summary statistics shows ~4-fold higher risk for T2D between the top and bottom quartile. Our results provide further insights into the genetic mechanisms underlying T2D, and highlight the opportunities for discovery from joint analysis of data from across ancestral populations.

Genome-wide association studies

Article 26 August 2021

Tissue-specific enhancer–gene maps from multimodal single-cell data identify causal disease alleles

Article 09 April 2024

Associations of dietary patterns with brain health from behavioral, neuroimaging, biochemical and genetic analyses

Article Open access 01 April 2024

Introduction

Type 2 diabetes (T2D) is a major public health problem that currently affects 425 million people worldwide¹. The burden of diabetes is especially high in South Asia, the most densely populated region of the world (~1.9 billion people in 2019, ~24.9% of world population)². India alone is home to 72 million people with T2D¹. In urban settings, risk of T2D in South Asians is four-fold higher than Europeans³, with T2D developing 10 years earlier⁴. The increased risk of T2D in South Asians is not accounted for by traditional risk factors such as obesity, diet and physical inactivity, which has led to the hypothesis that South Asians may have increased genetic susceptibility⁵.

To date, sequence variants at more than 400 genetic loci have been implicated in the aetiology of T2D through genome-wide association studies (GWAS) that have been carried out predominantly amongst people of European ancestry^{6,7,8,9,10,11,12}. Although many of these risk relationships are cosmopolitan, population-specific studies have revealed genetic variants associated with T2D that are common in their respective ethnic groups, but rare or absent elsewhere^{3,13,14,15,16,17,18,19}. Recent studies have highlighted the extensive genetic diversity that exists in South Asia^{20,21,22,23,24,25,26}, and which provides the basis for investigating the possibility of distinct disease relationships, in addition to cosmopolitan effects. There is a well recognised need for deeper investigation of the genetic basis for T2D and other chronic diseases in South Asian populations²⁷.

Results

Genome-wide meta-analysis

As part of the DIAbetes Meta-ANalysis of Trans-Ethnic association studies (DIAMANTE) project, we combined data from 16 South Asian case-control and population-based cohorts, to complete the largest discovery genome-wide meta-analysis for T2D in this ethnic group to date (16,677 cases and 33,856 controls). Characteristics of participants, information on the genotyping arrays and imputation are detailed in Supplementary Data 1 and 2. We performed two primary association analyses in body mass index (BMI) unadjusted and BMI-adjusted models. At a P-value threshold of 5 × 10⁻⁸, a total of 372 and 440 SNPs, distributed across 10 and 14 gene loci, reached statistical significance in the BMI-unadjusted and adjusted models, respectively (Supplementary Data 3; Supplementary Figs. 1, 2). All SNPs identified at this stage were from genetic loci known to be associated with T2D (Supplementary Data 4), arguing against an important contribution of population-specific variants to the risk of T2D in South Asians.

Next, we carried out a combined analysis of GWAS data for South Asians and Europeans to identify genetic effects that are shared between the populations. To the best of our knowledge at the point of analysis, we selected all potentially novel SNP-phenotype associations at P < 1 × 10⁻³ in either the BMI unadjusted (N = 17,944) or the BMI-adjusted (N = 17,215) model amongst South Asians, for further testing. We limited the analysis to the SNPs with the lowest P-values for association with T2D in South Asians, to reduce the burden of multiple testing, and to help prioritise genetic variants that may make a more important contribution in South Asians. We performed fixed effects meta-analyses to combine results from South Asians with data from Europeans within the DIAMANTE consortium (effective sample size = 231,420 and 157,384 for BMI-unadjusted and adjusted models, respectively). We chose to combine data with Europeans, as the major global population group most closely genetically related to South Asians^10,16,28,29. The genetic correlation between South Asians and Europeans for T2D is 0.89 (SE: 0.06), which is higher than reported between East Asians and Europeans at 0.62 (SE: 0.09)³⁰, providing further support for this choice. We found that 218 and 185 novel SNPs across 14 and 11 loci reached genome-wide significance (P = 4.7 × 10⁻⁸ to 5.2 × 10⁻¹²) in the BMI-unadjusted and adjusted models, respectively, (Supplementary Data 5 and 6). Together these represent 21 novel genetic loci, of which 12 showed a high degree of statistical stringency (P < 5 × 10⁻⁹). Sentinel SNP for each locus was defined as the SNP with the lowest p-values within a ±500kb region. Conditional analysis confirms the 21 sentinel SNPs to be associated with T2D, independent of previously reported T2D genetic relationships (Supplementary Data 7; Table 1). Regional plots for the 21 loci are shown in Supplementary Figs. 3–44. Whilst the majority of our newly identified associations represent common genetic variants with modest effect size (OR for T2D < 1.2), we also identify a low-frequency genetic variant (rs76141923), which has a moderately-high risk on T2D (OR = 1.40; 95% CI: 1.26–1.57, minor allele frequency in SAS: 1%). There was no evidence for heterogeneity between the results for the association of the genetic variants with T2D between the analyses with and without BMI adjustment (all P > 0.003; Bonferroni corrected for 21 loci; Supplementary Data 8). Sex-specific analysis did not reveal any additional significant findings (Supplementary Data 9).

Table 1 Genetic loci newly identified to be associated with Type 2 Diabetes (T2D).

Full size table

We used Genome-wide Complex Trait Analysis (GCTA) of the results for South Asians to seek for additional independent signals at the 21 novel loci; there were no signals that reached a locus-wide significance threshold of P < 1 × 10⁻⁵. The heritability for T2D in South Asians is estimated at ~12% by GREML in GCTA using our largest available cohort (LOLIPOP-IA610). The total T2D (trait) variance explained by the independent T2D-associated loci (both known and novel) after COJO was estimated at 10.6%. Taking a genome-wide approach, heritability was estimated at 35 and 50% in the IA317 and IA610 South Asian datasets, respectively. This is similar to that reported in Europeans^31,32.

Cross-ancestry comparisons

We next tested our 21 novel genetic loci for association with T2D in East Asian, African and Hispanic populations, ethnic groups not included in the discovery meta-analysis. Effective sample sizes in East Asians were 211,793 and 135,780 for the BMI-unadjusted and adjusted analyses, respectively, providing high statistical power (>90%) to identify the effect sizes observed in our South Asian samples, at P = 0.05. We found that 12 of our sentinel SNPs replicated in East Asians in the BMI-unadjusted model (P < 0.05 and same direction of effect, Supplementary Data 10). Amongst the nine sentinel SNPs that did not replicate, three show evidence for heterogeneity of effect (Supplementary Figs. 45–48), while four are low frequency or monomorphic, in East Asians. Findings were similar in the BMI-adjusted model (Supplementary Data 11; Supplementary Figs. 46–49). Effective sample sizes in African and Hispanic populations were lower (BMI unadjusted: N = 36,991 and 32,037; BMI adjusted N = 36,345 and 20,499, respectively). Although power to replicate the associations identified in South Asians ranged from 30 to 95% (mean 51%, Supplementary Fig. 50), we found that only one of the sentinel SNPs replicated in Africans, while none replicated in Hispanics (Supplementary Figs. 45–48).

There was limited evidence for heterogeneity of effect on T2D, between South Asians and Europeans at the 21 novel genetic loci (Supplementary Data 10 and 11, Supplementary Figs. 45–48); this is not unexpected in light of our strategy comprising meta-analysis of data from the two populations as the basis for discovery. We also find no evidence for heterogeneity of effect on the risk of sentinel SNPS on risk of T2D between South Asians and Europeans, at 237 (82%) of the 288 genetic loci reported to be associated with T2D (Supplementary Data 12 and 13, Supplementary Figs. 51 and 52). Taken together, the above results suggest that there is little heterogeneity between Europeans and South Asians, which supports out approach of meta-analysis across these population groups, to maximise power for the identification of shared effects. To extend on this analysis, we investigated all SNPs at P < 1 × 10⁻³ in either the BMI unadjusted (N = 17,944) or the BMI-adjusted (N = 17,215) model in the South Asians (Supplementary Data 14 and 15).

Functional genomic analyses

We identified potential candidate genes based upon proximity (genes within 10 kb of the sentinel SNP), and functional genomic information (gene expression and regulatory DNA methylation) (Table 1). We searched for cis expression quantitative trait loci (eQTLs) using eQTL data generated from whole blood gene expression data from 31,684 individuals by the eQTLGen Consortium³³. We found an enrichment of cis eQTLs among the novel SNPs (P < 1.8 × 10⁻³⁰⁸, compared to expectation under the null hypothesis [1-sample t-test]), with 47 unique significant cis eQTLs present for 16 of the 21 sentinel SNPs or their proxies (r² ≥ 0.8) at a Bonferroni corrected p-value threshold of <6.8 × 10^fto (Table 1; Supplementary Data 16). To investigate the robustness of the identified eQTLs in South Asians, we performed lookups in our eQTL dataset derived from 693 South Asians, where we replicated 37% of the eQTLs across 9 of the loci (16 out of 43 eQTLs available in the South Asian dataset). Among the 63% that did not replicate, none of them were sufficiently powered to detect the effect at 80% power and 5% significance level. We further supplemented our analysis by interrogating islet-specific cis eQTLs^8,34,35. Here we observed 50 unique significant cis eQTLs with 17 of the novel sentinel SNPs (P < 0.05; Supplementary Data 17; enrichment [1-sample t-test] P = 3.0 × 10⁻¹³²), of which 13 replicated those that were found in the eQTLGen analysis, suggesting that the novel loci implicates both generic and tissue-specific eQTLs. In addition, at three of the novel loci, T2D signals colocalized with respective cis-eQTLs for CHMP4B, PDHB and LRIG1 in both BMI-unadjusted and adjusted models (Table 1; Supplementary Data 18 and 19).

To better understand the disturbances in genomic regulation underlying T2D, we tested the association of the sentinel SNPs with DNA methylation, in South Asians (N = 1841). DNA methylation was quantified in peripheral blood by Illumina HumanMethylation450 array³⁶. We found that 19 of the sentinel SNPs have one or more cis methylation quantitative trait loci (methQTL) associations at P < 8.2 × 10⁻⁶ (P < 0.05 after Bonferroni correction for the 6,065 SNP-CpG association tests; Supplementary Data 20), which represents a ~2-fold enrichment in cis methQTLs observed (enrichment [1-sample t-test]: P < 1.8 × 10⁻³⁰⁸). All methQTLs replicated in independent testing in 1354 South Asians (P < 0.05 with Bonferroni correction and consistent direction of effect; Supplementary Data 21). We mapped methQTLs to potential candidate genes using Illumina annotation files, supplemented by gene proximity, and noted that at nine loci, although we found both significant eQTLs and methQTLs with our sentinel SNPs, there was no evidence for colocalization observed (Table 1; Supplementary Data 22). In part, this may reflect the lower sample size available for functional genomic studies in South Asians.

Functional annotation and cross-trait association

Detailed functional annotation for sentinel SNPs at the 21 novel loci is provided in Supplementary Data 23. VEP (Variant Effect Predictor) was used to annotate the sentinel SNPs and their proxies (r² > 0.8) for nonsynonymous or splice site variations, and the presence of transcription factor binding sites (TFBS)³⁷. None of the SNPs or their proxies was nonsynonymous. Two were splice site variants, seven SNPs overlap known TFBS and SNPs at four loci overlap CpG islands (Supplementary Data 23).

We used HaploReg v4.1 (Broad Institute) to identify SNPs overlapping regulatory regions (protein binding and regulatory motifs, promoter and enhancer histone marks, and DNase I hypersensitive sites (DHS))³⁸. There was significant enrichment for SNPs and/or proxies for promoter histone marks in at least one tissue (n = 13; P = 1.92 × 10⁻¹³⁰), as well as SNPs or proxies overlapping with enhancer histone marks or DHS in at least one tissue (n = 19; P = 2.16 × 10⁻⁴⁵ and n = 17; P = 1.52 × 10⁻¹⁹, respectively). PhenoScanner, an on-line tool which searches 88 complex trait GWAMAs and three GWAS catalogues, was used to annotate the 21 novel loci for association with other complex traits (P < 1 × 10⁻⁵)^39,40. As has been observed previously, we found associations with numerous metabolic phenotypes such as systolic and diastolic blood pressure/hypertension, waist circumference and BMI (Supplementary Data 24)^{41,42,43,44,45,46}.

Polygenic risk scores for risk stratification of T2D in South Asians

Studies in predominantly European populations have demonstrated the potential for polygenic risk scores to identify people at high risk of complex disease, who may show greater benefit from interventions to prevent disease development^47,48. To advance the use of genomic information for risk stratification in South Asian populations, who are at high risk of T2D and other major chronic diseases, we used the LDpred algorithm to derive a polygenic risk score for T2D. We used South Asian summary statistics from the current meta-analysis to derive a South Asian specific polygenic risk score (SA-PRS) for T2D. We compared performance of this model to a polygenic risk for T2D based on European summary statistics from the DIAMANTE consortium (EUR-PRS)⁴⁹. The best-performing SA-PRS was selected based upon an initial validation dataset (IA610; n = 2019 cases and 3696 controls), and identified as the model with maximum area under the receiver-operator curve (AUC; proportion of causal variants, p = 0.01; Supplementary Data 25). For the European-derived summary statistics model, we selected the model previously reported to be optimal with p = 0.01⁴⁷. We replicated model performance of the SA-PRS and EUR-PRS in two independent testing sets (SINDI; n = 974 cases and 1168 controls; LOLIPOP-GSA: n = 1000 cases and controls each).

We found that the SA-PRS model, which is based on population-specific South Asian summary statistics, shows better predictive power for T2D than a risk score based on European summary statistics in both validation and testing (Validation: AUC: 0.62 [95% CI: 0.60–0.63] vs 0.55 [95% CI: 0.54–0.57], P = 1.0E−10; Testing: AUC: 0.59 [95% CI: 0.58–0.61] vs 0.55 [95% CI: 0.53–0.56], P = 2.2E−4, Supplementary Data 26). The prevalence of T2D increased from 19 to 53% across decile bin, with risk increasing as risk score increases (p = 1.6 × 10⁻⁵; trend test). Indeed, the SA-PRS identified the top quartile of our validation population as having 4.03 (95% CI:3.36–4.85; P < 4.5 × 10⁻⁵⁰) higher risk for T2D relative to the bottom quartile (Supplementary Fig. 53). We further showed that there were no significant improvement in AUC using summary statistics obtained from meta-analysis of South Asians and Europeans (Validation AUC: P = 1.000; Testing AUC: P = 0.356; Supplementary Data 26). Our results confirm the potential for genomic information to identify South Asian individuals susceptible to T2D.

Discussion

We carried out a genome-wide association meta-analysis of T2D in South Asians, leveraging power for discovery by inclusion of data from additional European samples, thereby revealing 21 genetic loci newly associated with T2D at the point of analysis. Leading genetic variants at these genetic loci associated with T2D are enriched for location in DNA protein-binding sites, regulatory motifs, DHS, promoter and enhancer histone marks, and for association with gene regulatory methylation marks in blood, and with gene expression in blood and pancreatic islets. At three of the novel loci, we found that T2D signals colocalized with respective cis-eQTLs for CHMP4B, PDHB and LRIG1.

Genetic variants in LRGI1 have been previously shown to be linked to birth weight, blood pressure and cardiovascular endpoints^50,51,52,53, while methylation of CpG sites in LRIG1 has been linked to obesity in children, as well as birthweight and maternal adiposity^54,55,56. Indeed, it has been recently shown that LRIG1 regulates lipid metabolism via BMP signalling, and that LRIG1 variants are strongly associated with increased BMI, but with lower risk of T2D⁵⁷. DHB encodes one of the components of PDH, which catalyses the irreversible oxidative decarboxylation of pyruvate into acetyl-CoA and reduces NAD+ to NADH. Levels of PDH impact upon the proportion of energy supplied by glucose relative to that from other energy-producing molecules such as lipids and amino acids, thereby playing a key role in the development of diabetes⁵⁸. At the same time, variants in PDHB have been linked to BMI-adjusted waist circumference and LDL cholesterol levels^59,60.

For the nine genes where we found both significant eQTLs and methQTLs with our sentinel SNPs, we did not observe significant colocalization. This may reflect the choice of blood as the tissue for investigation, or poor coverage of the methylation markers, given that the methylation array is sparse and covers only ~2% of the methylome, and therefore do not identify the most important causal methylation site. In addition, despite being one of the largest DNA methylation datasets available, the sample size for the DNA methylation analysis is ~3200, which may limit the power of the colocalization experiment.

Although our observations support the view that the genetic variants we identify to be associated with T2D are likely to be of functionally importance, and operate primarily through an effect on genomic regulation, which is consistent with findings from other large-scale genetic association studies^9,61,62, there is no direct evidence that DNA methylation is part of the casual chain leading to T2D.

Our study includes data from 50,533 South Asians; this represents the largest study of T2D genetics in this population to date. We leverage discovery through combined analysis of our leading results with data from a parallel study of 898,130 Europeans, confirming the utility of combining results from different ethnic groups through increased sample size. We also used data from large-scale GWAS of T2D in East Asian, African and Hispanic populations to examine whether our associations are cosmopolitan or specific. We show that many of the sentinel SNPs are not associated with T2D in East Asians, despite their large sample size and high statistical power. In most instances, the sentinel SNPs which appear specific to South Asians are low frequency, rare or even monomorphic in East Asians. Similarly, only a minority of our sentinel SNPs were associated with T2D in African and Mexican populations. Our results provide further support for the presence of population-specific genetic variation contributing to chronic disease, and the strong rationale for large-scale studies focused on this ethnic group that have at least equivalent sample size to current efforts in Europeans. Such studies will be an important source of discovery, and will help to reverse the current bias in genomic research towards investigations of predominantly European populations²⁷.

We also investigate the application of polygenic risk scores for the prediction of T2D in South Asian populations. We show that polygenic risk scores based on South Asian specific GWA data are more predictive for T2D than scores based on results from European specific studies. Our South Asian polygenic risk score identifies the top quartile of the population to be at ~4-fold higher risk for T2D relative to the bottom quartile, confirming the potential for genetic risk scores to identify susceptible individuals. In addition, we extend previous observations and show that the polygenic risk scores for prediction of T2D in South Asians, show improved performance when based on South Asian rather than European association test results. This inferior performance of the PRS model based on European data likely reflects heterogeneity of effect, relative to that observed in South Asians.

In summary, we identify 21 novel genetic loci associated with T2D at the point of analysis. Whilst many may be cosmopolitan, several do not appear to be shared by East Asian, African and Hispanic populations. Our results provide further insights into the genetic mechanisms underlying T2D, and reinforce the importance of extending the investigation of susceptibility to T2D and other chronic diseases to non-European populations.

Methods

Statistics and reproducibility

Sixteen cohorts of South Asians with 50,533 participants were included in the analysis (16,677 cases and 33,856 non-prediabetic normal controls). Effective sample size (Neff) was calculated as 4/(1/N_cases + 1/N_controls), where N_cases is the number of cases and N_controls is the number of controls. For genome-wide meta-analyses, two primary models of logistic regression association analyses were carried out with or without BMI adjustment. Age, sex, and cohort-specific covariates were included where applicable, with the first 3–15 study-specific principal components included as covariates to correct for population substructure. No technical replicates were included in the analysis. The current sample size allows us to detect significant associations at P-value < 5 × 10⁻⁸ for SNPs with MAF ≥ 0.05 and odds ratios (OR) ≥ 1.21, or MAF ≥ 0.20 and OR ≥ 1.11 at 80% power, taking an additive disease model.

Cohort information, definition of T2D cases and normoglycaemic controls

A total of 50,533 participants (16,677 cases and 33,856 non-prediabetic normal controls) across sixteen South Asians cohorts were included in the meta-analysis. T2D cases were defined as having any one of the following: medical history of T2D or T2D treatment, fasting plasma glucose concentration ≥7.0 mmol/L, plasma glucose concentration at 2 h of OGTT ≥ 11.1 mmol/L, or HbA1c ≥ 6.5%. Normoglycaemic controls were defined as meeting all of the following criteria (where data are available): no history of T2D or T2D treatment, fasting plasma glucose <6.1 mmol/L, plasma glucose concentration at 2 h of OGTT < 7.8 mmol/L, and HbA1c < 6.0%. Prediabetic participants who are neither T2D cases nor normoglycaemic controls were removed from 12 out of 16 cohorts analysed in this study. All cohort studies were approved by the relevant institutional review boards, and conducted according to the Declaration of Helsinki. All participants of each study provided written informed consent. An overview of the study design and number of novel loci discovered at each analysis stage is available in Supplementary Fig. 54. The list of participating South Asian cohorts and corresponding sample characteristics are available in Supplementary Data 1.

Genotyping and imputation

All study centres performed genome-wide genotyping with standard commercial genotyping platforms. Quality control (QC) of samples and genetic markers, mainly single nucleotide polymorphisms (SNPs), and imputation to 1000 Genomes (1KG) Project Phase 3 cosmopolitan reference haplotypes were conducted at each study centre¹⁶, except for UKBB which was imputed to the Haplotype Reference Consortium (HRC)⁶³. Markers in UKBB that did not overlap with 1KG were removed from UKBB. Standard QC for samples included removing samples with low call rate (e.g. <95%), extreme heterozygosity, mismatch of sex, and those of duplicates, relatedness, and population outliers. QC for variants included removing variants with low call rate (<95%), Hardy-Weinberg equilibrium P-value < 1 × 10⁻⁶, or minor allele frequency (MAF) < 1%.

Before pre-phasing and imputation, markers were aligned to NCBI build 37 locations of the human genome using the strand files and scripts developed by the University of Oxford. A further check of variants IDs, alleles, and frequencies matching with the 1KG Phase 3 reference panel was then applied using a script developed by the University of Oxford. This stringent QC reduces errors due to strand misalignment and duplicates by removing and/or updating SNPs that do not agree in position, allele or frequency with the 1KG relevant ethnic group data. Standard pre-phasing and imputation approaches were applied including pre-phasing using SHAPEIT and imputation using IMPUTE2 or MACH/minimac locally^64,65,66, or using the Michigan Imputation Server for pre-phasing and imputation. Imputation quality was examined, and allele frequencies were checked against the 1KG reference panel for South Asian population, and those significantly deviating from reference population (>0.20) were examined and any quality issue solved. All cohorts passed imputation QC. Association analyses were then performed by each study centre upon satisfactory imputation quality. In total, there were 32,869,683 SNPs available for analysis after quality control.

Genome-wide meta-analyses

Two primary models of logistic regression association analyses were carried out with SNPTEST (for imputation with IMPUTE2), mach2dat (for imputation with MACH/minimach), or other appropriate programs (sex-combined, with or without BMI adjustment)^65,66. Age, sex, and cohort-specific covariates were included where applicable. The first 3–15 study-specific principal components were included as covariates to correct for population substructure. Details of genotyping arrays and imputation are summarised in Supplementary Data 2.

Association summary statistics were filtered for QC before meta-analysis. Criteria for inclusion of a variant in association meta-analysis included imputation info ≥ 0.4 for IMPUTE2 and 0.3 for MACH/minimac^66,67,68, minor allele count (MAC) ≥ 6, P-values for Hardy-Weinberg equilibrium ≥ 1 × 10⁻⁶, standard error (SE) > 0 and <10, and P-values > 0 and ≤1. Meta-analyses combining association summary statistics from all cohorts were performed using the inverse-variance weighted fixed-effect model as implemented in the METAL program⁶⁹. Genomic control (GC) adjustment was applied in each cohort before meta-analysis when the GC factor (lambda) was greater than one.

In sex-combined models with or without BMI adjustment, SNPs with MAF ≥ 0.01, P-values < 1 × 10⁻³, not in a known T2D loci (>500 kb from the known variants previously published), and total sample size exceeding half of the maximum sample size, were included for further testing. This yielded a total of 18,537 and 17,929 potentially novel SNPs for the sex-combined BMI unadjusted or adjusted model at P-values < 1 × 10⁻³, respectively (Supplementary Data 3). The European DIAMANTE consortium data were used for the meta-analyses. Meta-analyses of summary statistics on the above SNPs in South Asians and in European DIAMANTE data were performed with METAL using the inverse-variance weighted fixed effects approach⁶⁹. SNPs with meta-analysis P-values < 5 × 10⁻⁸ were considered statistically significant. Results are summarised in Supplementary Data 5 and 6. Genetic correlation for T2D between South Asians and Europeans were calculated using popcorn³⁰.

Conditional analysis

In an effort to investigate if the newly identified variants were independent of known associations, we performed a conditional regression conditional on known SNPs associated with T2D in the same loci (±1Mb) using the -cojo-cond function in the Genome-wide Complex Trait Analysis (GCTA)—conditional and joint analysis(COJO) program⁷⁰. GCTA version 1.91.0 beta was used in this study. This approach uses the summary statistics from meta-analysis, and linkage disequilibrium data from a GWAS study or reference haplotypes, which we defined as IA610 here. Heritability for T2D was estimated using GREML in GCTA using the same cohort (IA610) as reference.

Detection of distinct association signals

To detect multiple distinct association signals at each novel locus (+/−1Mb region), we performed approximate conditional analyses using GCTA with genome-wide meta-analysis summary statistics from South Asian cohorts in BMI-adjusted or unadjusted models, and with LD estimated from IA610. The cojo-slct option was used for a stepwise model selection to select independently associated SNPs in each region. We used cojo-p 1e-5 to set the threshold p-value to declare a locus-wide significant hit. The default multiple regression R2 (0.9) on the selected SNPs was used for the cut off value to control collinearity. SNPs with allele frequency differences >0.2 between association SNP and the reference SNP were removed from analysis.

Cross-ancestry comparison

We then systematically investigated the association with T2D for these 21 novel loci in other ethnic-specific studies, including participants of East Asian, African-American and Hispanic ancestry. Sentinel SNPs were looked up in both BMI-unadjusted and adjusted models where possible. Results are shown in Supplementary Data 10 and 11.

For East Asians, the T2D meta-analyses were performed with studies participating in the Asian Genetic Epidemiology Network (AGEN), a consortium of genetic epidemiology studies of T2D and related traits conducted in individuals of East Asian ancestry. The first phase of the East Asian meta-analysis included 77,418 T2D cases and 356,122 controls from 20 GWAS and three biobanks, China Kadoorie Biobank (CKB), Korea Biobank Array (KBA), and Biobank Japan (BBJ) (N_eff = 211,793)^71,72,73,74. In Phase 2 of the meta-analysis, a subset from Phase 1 (56,267 T2D cases and 227,155 controls; N_eff = 139,701) were analysed in BMI-adjusted and sex-specific models. Included studies were genotyped on either commercially available or customised Affymetrix or Illumina genome-wide genotyping arrays. Array quality control criteria implemented within each study, including variant call rate and Hardy-Weinberg equilibrium. The genotype scaffold for each study was then imputed to the 1000 Genomes Phase 1 or 3 reference panel using minimac3 or IMPUTEv2^16,67,75,76. In Phase 1, all studies were imputed to 1000 Genomes Phase 3. In Phase 2, all studies were imputed to 1000 Genomes Phase 3 except for Biobank Japan (BBJ), which was imputed to the 1000 Genomes Phase 1 reference panel⁷⁶. Institutional review boards approved all study protocols at their respective sites, and written informed consent was obtained from all participants.

Data of African–American ancestry was based upon the MEDIA (Meta-analysis of Type 2 Diabetes in African Americans) Consortium, a collaborative effort to combine T2D GWAS data from individuals of African ancestry. The current study includes 18 GWAS cohorts (15,043 T2D cases and 22,318 controls), with genotype data imputed to either 1000 Genomes phase 1 or phase 3.

For the Hispanic/Latino ancestry meta-analyses, 14 studies with a total of 13,151 cases and 21,511 controls were included. The component studies consisted of the BioMe Biobank⁷⁷, the Genetics of Latinos Diabetic Retinopathy study⁷⁸, Hispanic Community Health Study and Study of Latinos⁷⁹, the Mexican-American Hypertension-Insulin Resistance Family Study^80,81,82, Los Angeles Latino Eye Study⁸³, Mexican-American Coronary Artery Disease Study^84,85, Mexico City studies 1 and 2⁸⁶, Multi-Ethnic Study of Atherosclerosis⁸⁷, the Non-Insulin-Dependent Diabetes Mellitus-Atherosclerosis Study⁸⁴, San Antonio Family Heart Study^88,89, the Slim Initiative in Genomic Medicine for the Americas study^17,18,90, Starr County Health Studies⁹¹, and the Women’s Health Initiative study⁹². All studies comprised self-reported Hispanic/Latino individuals with varying country of origin and admixture proportions. Quality control of genome-wide genotype data, imputation to 1000 Genomes Project reference datasets phase 1 or 3^16,76, and association analyses were performed for each study. Results from association analyses adjusted for sex were provided for all studies, and when available, results from additional analyses adjusted for both sex and body mass index, and sex-stratified analyses both with and without adjustment for body mass index were provided for meta-analyses as well. Association results for each study were filtered to remove low-frequency variants (minor allele count <14) and low-quality variants (minimac3 r² < 0.3 or IMPUTE2 info <0.4)^67,75,93. Meta-analysis was performed using MR-MEGA⁹⁴, including one axis of genetic variation as captured by study-level measures of mean allele frequency differences.

To check for heterogeneity between ethnic populations, we performed meta-analyses of summary statistics on the SNPs with METAL using the inverse-variance weighted fixed effects approach.

Effects of BMI adjustment and Sex-differentiated meta-analysis

The difference between BMI-adjusted and unadjusted models was tested using a matched analysis within the same subset of 15 cohorts, where the Z-score is calculated as below to account for correlation between the two models⁶.

$$(\beta -{\beta }_{1})/{{{{{\rm{sqrt}}}}}}({{{{{{\rm{SE}}}}}}}^{2}-{{{{{{{\rm{SE}}}}}}}_{1}}^{2}{-}2\,{\times}\,\rho \,{\times}\,{{{{{\rm{SE}}}}}}\,{\times}\,{{{{{{\rm{SE}}}}}}}_{1}),$$

(1)

where β and β₁ are effect sizes for BMI-unadjusted and adjusted models, respectively, SE and SE₁ are the corresponding standard errors, and ρ is the estimated correlation between β and β₁ obtained from all SNPs from this study (ρ = 0.96).

Sex-differentiated meta-analysis allowing for heterogeneity of allelic effects between males and females was conducted to detect sex-differentiated effects with the GWAMA program⁹⁵.

Expression quantitative trait loci (eQTL) analysis

Using eQTL data generated from whole blood gene expression data from 31,684 individuals by the eQTLGen Consortium³³, we examined the cis-associations of the novel variants and their proxies (r² ≥ 0.8 in South Asians) with expression levels of nearby genes (±1 Mb) in all tissues to explore the potential functionality of novel variants. Additional eQTL lookups were performed for (i) whole blood in South Asians from LOLIPOP (n = 693) and (ii) in pancreatic islets (n = 420) to explore metabolically relevant tissue-specific relationships⁹⁶. We also performed permutation testing to determine the extent of enrichment of cis eQTLs observed. We first determined the number of our novel sentinel SNPs that showed significant cis eQTLs. Expectation under the null hypothesis was then determined by permutation testing using a set of matched background SNPs. SNPs were matched for MAF (±2% in South Asians) and distance to the nearest gene (±10 kb). P-values were calculated by comparing the observed number of SNPs that displayed significant cis eQTLs to the mean expectation under the null hypothesis from permutation testing using a 1-sample t test (N = 1000 permutations of 21 SNPs).

Expression quantitative trait loci (eQTL) analysis in South Asians

Transcriptome-wide measurements of gene expression in blood were undertaken in South Asian (N = 693) participants of the LOLIPOP study, using the Illumina HumanHT-12 v4 BeadChip array according to manufacturer’s protocol. Expression values were summarised to gene level estimates by averaging the log2 transformed expression levels of probes mapping to the same gene. To quantify the relationship between genetic variation and gene expression we first derived residuals for gene expression using linear regression of gene expression levels against sex, age, RNA integrity number, RNA conversion batch and RNA extraction batch. Expression residuals were then used as outcome variables in a linear regression model with SNP dosage as the independent variable, corresponding to the following linear model formulae: Gene ~ SNP + sex + age + RIN + RNA_Conv_Batch + RNA_Extract_Batch. Data analysis was performed using Matrix eQTL⁹⁷.

methQTL analysis

For the discovery phase, we measured DNA methylation in 1,841 South Asians using peripheral blood sample collected at baseline from the London Life Sciences Prospective Population Study (LOLIPOP) (see Cohort Description—Supplementary Note 1); for the replication phase we studied 1354 South Asians using blood sample collected at the follow-up visit. All participants were unrelated. DNA methylation was quantified using the Illumina HumanMethlation450K array according to the manufacturer’s instructions. Bisulfite conversion of genomic DNA was performed using the EZ DNA methylation kit according to the manufacturer’s instructions (Zymo Research, Orange, CA). Bead intensity was retrieved using the minfi software package, and a detection P value of <10⁻¹⁶ was used for marker calling. Of the 486,427 positions assayed by the array, we excluded markers with call rates <98% (N = 4,684), or that do not measure methylation at CpG sites (N = 4006). This left 466,186 autosomal and 11,329 sex chromosome markers for analysis. Markers reported to be cross-hybridising were retained but flagged. Samples with gender inconsistency and/or with low marker call rate (<98%) were filtered⁹⁸. Genotyping was done with a combination of Illumina genotyping arrays (HumanHap300, Human-Hap610, OmniExpress and OmniExomeExpress). Genotypes were called with Illumina Genome Studio and imputation performed using the IMPUTEv2 software package and 1000 Genomes Project cosmopolitan reference panel (ALL_1000G_phase1integrated_v3_impute_macGT1). Standard GWAS quality control criteria were applied, including filtering for call-rate, minor allele frequency, info score and Hardy–Weinberg equilibrium.

Data normalisation was performed separately within each cohort and percentage methylation at each CpG site was calculated. Residuals were then derived from a linear regression of the percentage methylation (outcome) with technical and clinical predictors. These include age, sex, estimates of white-blood cell subpopulations and principal components of control-probe intensities. Residuals were used as outcomes in association testing. Association testing for the discovery stage was performed separately for each study population and genotyping platform using QuickTest. Summary statistics were combined by fixed-effect meta-analysis using METAL⁶⁹. Statistical significance was set to P < 7.7 × 10⁻⁶, which corresponds to P < 0.05 after Bonferroni correction for the 6,494 SNP-CpG association tests.

We also performed permutation testing to determine the extent of enrichment of cis methQTLs observed. We first determined the number of our novel sentinel SNPs that showed significant cis methQTLs. Expectations under the null hypothesis was then determined by permutation testing using a set of matched background SNPs. SNPs were matched for MAF (±2% in South Asians) and distance to the nearest gene (±10 kb). P-values were calculated by comparing the observed number of SNPs that displayed significant cis methQTLs to the mean expectation under the null hypothesis from permutation testing using a 1-sample t-test (N = 1,000 permutations of 21 SNPs).

Colocalization analysis

We used coloc 2.3-1 to check for colocalization of the T2D association signals at our 21 loci with eQTL signals from eQTLgen⁹⁹. For each locus, we examined all SNPs available in both datasets within 1 Mb of the sentinel SNP identified in our T2D GWAS, and ran coloc.abf with default parameters and priors. The summary statistics from the joint meta-GWAS of South Asians and Europeans were used for the SNP-T2D axis. We also checked for colocalization between eQTL and methQTL associations. We considered there to be sufficient evidence for colocalization when coloc PP4 > 0.6 (posterior probability for shared underlying causal variant).

Functional annotation and cross-trait associations

We annotated the sentinel SNPs and their proxies for regulatory regions (promoter and enhancer histone marks, DNase I hypersensitivity, protein binding and regulatory motifs) with HaploRegv4.1 (Broad Institute) using the haploR package in R (version 3.6.0)³⁸. VEP (Variant Effect Predictor) was used for the identification of transcription factor binding sites and nonsynonymous and splicing variants³⁷. EpiExplorer and the UCSC Genome Browser were used to annotate CpG islands¹⁰⁰. We performed permutation testing to determine the degree of enrichment for various regulatory regions. We first determined the number of novel sentinel SNPs that were annotated for the respective features, followed by generating expectations under the null hypothesis by permutation testing using a set of matched background SNPs. SNPs are matched for MAF (±2% in South Asians) and distance to the nearest gene (±10 kb). P-values were calculated by comparing the observed number of SNPs that were annotated for the respective features to the mean expectation under the null hypothesis from permutation testing using a 1-sample t-test (N = 1000 permutations of 21 SNPs).

Polygenic risk score

For PRS derivation, we applied the LDpred algorithm, a Bayesian approach that calculates posterior mean effect for all variants based on a prior (effect size and level of statistical significance in the prior GWAS) and subsequent shrinkage based on linkage disequilibrium (LD) with other variants nearby⁴⁹. DNA polymorphisms with ambiguous strand (A/T or C/G) were removed from the score derivation.

We developed two models for PRS derivation, one based upon South Asian-derived summary statistics (current study) and another based upon European-derived summary statistics for T2D associations (DIAMANTE). For the South Asian-derived summary statistics, we reperformed our genome-wide meta-analysis across only 14 cohorts, excluding two cohorts to be utilised as validation and independent testing series for LDpred (validation: IA610; n = 2,019 cases and 3696 controls and testing: SINDI: n = 974 cases and 1168 controls). A third independent cohort was also used for testing (LOLIPOP_GSA: n = 900 cases and 919 controls). South Asians and Europeans from the 1000 Genomes Phase 3 were used as the reference panel for LD calculation for the South Asian- and European-derived PRS models, respectively.

LD radius was set to M/3000 (default value in LDpred) in both cases, whereby M is the total number of SNPs used in the corresponding analysis. For the tuning parameter p, which refers to the proportion of variants assumed to be causal (non-zero effects), a range of values were tested (1, 0.3, 0.1, 0.03, 0.01, 0.003, 0.001, 0.0003, 0.0001) and the SNP weights generated used to calculate PRS. The optimal PRS for the South Asian-derived summary statistics model was chosen according to the area under the receiver-operator curve (AUC) based upon a logistic regression model⁴⁷. For the European-derived summary statistics model, we selected the model with p = 0.01, as previously reported to be optimum⁴⁷. The proportion of variance explained was calculated for each model using Nagelkerke’s pseudo-R2 metric. To plot the risk gradient, we first divided the validation population into 10 bins according to decile of the PRS, and plotted the prevalence of T2D within each bin.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

Summary statistics data is available via GWAS catalogue. Study Accession IDs are GCST90093109 and GCST90093110 for the BMI-unadjusted and BMI-adjusted models, respectively. Full summary statistics for cis eQTL analyses were downloaded from eQTLGen website (https://eqtlgen.org/cis-eqtls.html).

Change history

05 May 2022
A Correction to this paper has been published: https://doi.org/10.1038/s42003-022-03404-x

References

International Diabetes Federation. IDF Diabetes Atlas 8th edn. (International Diabetes Federation, 2017).
United Nations. World Population Prospects: The 2019 Revision (United Nations, 2019).
Kooner, J. S. et al. Genome-wide association study in individuals of South Asian ancestry identifies six new type 2 diabetes susceptibility loci. Nat. Genet. 43, 984–989 (2011).
Article CAS PubMed PubMed Central Google Scholar
Sattar, N. & Gill, J. M. Type 2 diabetes in migrant south Asians: mechanisms, mitigation, and management. Lancet Diabetes Endocrinol. 3, 1004–1016 (2015).
Article PubMed Google Scholar
Zheng, Y., Ley, S. H. & Hu, F. B. Global aetiology and epidemiology of type 2 diabetes mellitus and its complications. Nat. Rev. Endocrinol. 14, 88–98 (2018).
Article PubMed Google Scholar
Scott, R. A. et al. An expanded genome-wide association study of type 2 diabetes in Europeans. Diabetes 66, 2888–2902 (2017).
Article CAS PubMed PubMed Central Google Scholar
Fuchsberger, C. et al. The genetic architecture of type 2 diabetes. Nature 536, 41–47 (2016).
Article CAS PubMed PubMed Central Google Scholar
Mahajan, A. et al. Refining the accuracy of validated target identification through coding variant fine-mapping in type 2 diabetes. Nat. Genet. 50, 559–571 (2018).
Article CAS PubMed PubMed Central Google Scholar
Xue, A. et al. Genome-wide association analyses identify 143 risk variants and putative regulatory mechanisms for type 2 diabetes. Nat. Commun. 9, 2941 (2018).
Article PubMed PubMed Central Google Scholar
Mahajan, A. et al. Fine-mapping type 2 diabetes loci to single-variant resolution using high-density imputation and islet-specific epigenome maps. Nat. Genet. 50, 1505–1513 (2018).
Article CAS PubMed PubMed Central Google Scholar
Spracklen, C. N. et al. Identification of type 2 diabetes loci in 433,540 East Asian individuals. Nature 582, 240–245 (2020).
Article CAS PubMed PubMed Central Google Scholar
Goodarzi, M. O. & Rotter, J. I. Genetics insights in the relationship between type 2 diabetes and coronary heart disease. Circ. Res. 126, 1526–1548 (2020).
Article CAS PubMed PubMed Central Google Scholar
Kwak, S. H. et al. Nonsynonymous variants in PAX4 and GLP1R are associated with type 2. Diabetes East Asian Popul. Diabetes 67, 1892–1902 (2018).
CAS Google Scholar
Cho, Y. S. et al. Meta-analysis of genome-wide association studies identifies eight new loci for type 2 diabetes in east Asians. Nat. Genet. 44, 67–72 (2011).
Article PubMed PubMed Central Google Scholar
Zhao, W. et al. Identification of new susceptibility loci for type 2 diabetes and shared etiological pathways with coronary heart disease. Nat. Genet. 49, 1450–1457 (2017).
Article CAS PubMed PubMed Central Google Scholar
Auton, A. et al. A global reference for human genetic variation. Nature 526, 68–74 (2015).
Article PubMed Google Scholar
Sigma Type 2 Diabetes Consortium. et al. Association of a low-frequency variant in HNF1A with type 2 diabetes in a Latino population. JAMA 311, 2305–2314 (2014).
Article Google Scholar
Sigma Type 2 Diabetes Consortium. et al. Sequence variants in SLC16A11 are a common risk factor for type 2 diabetes in Mexico. Nature 506, 97–101 (2014).
Article Google Scholar
Spracklen, C. N. et al. Identification of type 2 diabetes loci in 433,540 East Asian individuals. Nature Under review.
GenomeAsia 100K consortium. The GenomeAsia 100K Project: Enabling Genetic Discoveries Across Asia. Nature Accepted.
Wu, D. et al. Large-scale whole-genome sequencing of three diverse Asian populations in Singapore. Cell Accepted.
Risch, N., Tang, H., Katzenstein, H. & Ekstein, J. Geographic distribution of disease mutations in the Ashkenazi Jewish population supports genetic drift over selection. Am. J. Hum. Genet. 72, 812–822 (2003).
Article CAS PubMed PubMed Central Google Scholar
Lamason, R. L. et al. SLC24A5, a putative cation exchanger, affects pigmentation in zebrafish and humans. Science 310, 1782–1786 (2005).
Article CAS PubMed Google Scholar
Kwiatkowski, D. P. How malaria has affected the human genome and what human genetics can teach us about malaria. Am. J. Hum. Genet. 77, 171–192 (2005).
Article CAS PubMed PubMed Central Google Scholar
Sabeti, P. C. et al. Positive natural selection in the human lineage. Science 312, 1614–1620 (2006).
Article CAS PubMed Google Scholar
Bersaglieri, T. et al. Genetic signatures of strong recent positive selection at the lactase gene. Am. J. Hum. Genet. 74, 1111–1120 (2004).
Article CAS PubMed PubMed Central Google Scholar
Popejoy, A. B. & Fullerton, S. M. Genomics is failing on diversity. Nature 538, 161–164 (2016).
Article CAS PubMed PubMed Central Google Scholar
Chambers, J. C. et al. The South Asian genome. PLoS ONE 9, e102645 (2014).
Article PubMed PubMed Central Google Scholar
Wong, L. P. et al. Insights into the genetic structure and diversity of 38 South Asian Indians from deep whole-genome sequencing. PLoS Genet. 10, e1004377 (2014).
Article PubMed PubMed Central Google Scholar
Brown, B. C. Asian Genetic Epidemiology Network Type 2 Diabetes Consortium, Ye, C. J., Price, A. L. & Zaitlen, N. Transethnic genetic-correlation estimates from summary statistics. Am. J. Hum. Genet. 99, 76–88 (2016).
Avery, A. R. & Duncan, G. E. Heritability of type 2 diabetes in the washington state twin registry. Twin Res. Hum. Genet. 22, 95–98 (2019).
Article PubMed PubMed Central Google Scholar
Willemsen, G. et al. The Concordance and Heritability of Type 2 Diabetes in 34,166 Twin Pairs From International Twin Registers: The Discordant Twin (DISCOTWIN) Consortium. Twin Res. Hum. Genet 18, 762–771 (2015).
Article PubMed Google Scholar
Võsa, U. et al. Unraveling the polygenic architecture of complex traits using blood eQTL meta-analysis. bioRxiv https://doi.org/10.1101/447367 (2018).
Varshney, A. et al. Genetic regulatory signatures underlying islet gene expression and type 2 diabetes. Proc. Natl Acad. Sci. USA 114, 2301–2306 (2017).
Article CAS PubMed PubMed Central Google Scholar
Wood, A. R. et al. A genome-wide association study of IVGTT-based measures of first-phase insulin secretion refines the underlying physiology of type 2 diabetes variants. Diabetes 66, 2296–2309 (2017).
Article CAS PubMed PubMed Central Google Scholar
Bibikova, M. et al. High density DNA methylation array with single CpG site resolution. Genomics 98, 288–295 (2011).
Article CAS PubMed Google Scholar
McLaren, W. et al. The ensembl variant effect predictor. Genome Biol. 17, 122 (2016).
Article PubMed PubMed Central Google Scholar
Ward, L. D. & Kellis, M. HaploReg: a resource for exploring chromatin states, conservation, and regulatory motif alterations within sets of genetically linked variants. Nucleic Acids Res. 40, D930–D934 (2012).
Article CAS PubMed Google Scholar
Staley, J. R. et al. PhenoScanner: a database of human genotype-phenotype associations. Bioinformatics 32, 3207–3209 (2016).
Article CAS PubMed PubMed Central Google Scholar
Kamat, M. A. et al. PhenoScanner V2: an expanded tool for searching human genotype-phenotype associations. Bioinformatics https://doi.org/10.1093/bioinformatics/btz469 (2019).
Shungin, D. et al. New genetic loci link adipose and insulin biology to body fat distribution. Nature 518, 187–196 (2015).
Article CAS PubMed PubMed Central Google Scholar
Locke, A. E. et al. Genetic studies of body mass index yield new insights for obesity biology. Nature 518, 197–206 (2015).
Article CAS PubMed PubMed Central Google Scholar
Justice, A. E. et al. Genome-wide meta-analysis of 241,258 adults accounting for smoking behaviour identifies novel loci for obesity traits. Nat. Commun. 8, 14977 (2017).
Article PubMed PubMed Central Google Scholar
Akiyama, M. et al. Genome-wide association study identifies 112 new loci for body mass index in the Japanese population. Nat. Genet. 49, 1458–1467 (2017).
Article CAS PubMed Google Scholar
Randall, J. C. et al. Sex-stratified genome-wide association studies including 270,000 individuals show sexual dimorphism in genetic loci for anthropometric traits. PLoS Genet. 9, e1003500 (2013).
Article CAS PubMed PubMed Central Google Scholar
Speliotes, E. K. et al. Association analyses of 249,796 individuals reveal 18 new loci associated with body mass index. Nat. Genet. 42, 937–948 (2010).
Article CAS PubMed PubMed Central Google Scholar
Khera, A. V. et al. Genome-wide polygenic scores for common diseases identify individuals with risk equivalent to monogenic mutations. Nat. Genet. 50, 1219–1224 (2018).
Article CAS PubMed PubMed Central Google Scholar
Kathiresan, S. et al. Polymorphisms associated with cholesterol and risk of cardiovascular events. N. Engl. J. Med. 358, 1240–1249 (2008).
Article CAS PubMed Google Scholar
Vilhjalmsson, B. J. et al. Modeling linkage disequilibrium increases accuracy of polygenic risk scores. Am. J. Hum. Genet. 97, 576–592 (2015).
Article CAS PubMed PubMed Central Google Scholar
Warrington, N. M. et al. Maternal and fetal genetic effects on birth weight and their relevance to cardio-metabolic risk factors. Nat. Genet. 51, 804–814 (2019).
Article CAS PubMed PubMed Central Google Scholar
Kichaev, G. et al. Leveraging polygenic functional enrichment to improve GWAS power. Am. J. Hum. Genet. 104, 65–75 (2019).
Article CAS PubMed Google Scholar
Evans, D. S. et al. Fine-mapping, novel loci identification, and SNP association transferability in a genome-wide association study of QRS duration in African Americans. Hum. Mol. Genet. 25, 4350–4368 (2016).
Article CAS PubMed PubMed Central Google Scholar
Sotoodehnia, N. et al. Common variants in 22 loci are associated with QRS duration and cardiac ventricular conduction. Nat. Genet. 42, 1068–1076 (2010).
Article CAS PubMed PubMed Central Google Scholar
Sharp, G. C. et al. Maternal BMI at the start of pregnancy and offspring epigenome-wide DNA methylation: findings from the pregnancy and childhood epigenetics (PACE) consortium. Hum. Mol. Genet. 26, 4067–4085 (2017).
Article CAS PubMed PubMed Central Google Scholar
Kupers, L. K. et al. Meta-analysis of epigenome-wide association studies in neonates reveals widespread differential DNA methylation associated with birthweight. Nat. Commun. 10, 1893 (2019).
Article PubMed PubMed Central Google Scholar
Huang, R. C. et al. Genome-wide methylation analysis identifies differentially methylated CpG loci associated with severe obesity in childhood. Epigenetics 10, 995–1005 (2015).
Article CAS PubMed PubMed Central Google Scholar
Herdenberg, C. et al. LRIG proteins regulate lipid metabolism via BMP signaling and affect the risk of type 2 diabetes. Commun Biol 4, 90 (2021).
Article CAS PubMed PubMed Central Google Scholar
Wang, X. et al. Conditional knockout of pyruvate dehydrogenase in mouse pancreatic betacells causes morphological and functional changes. Mol. Med. Rep. 21, 1717–1726 (2020).
CAS PubMed PubMed Central Google Scholar
Christakoudi, S., Evangelou, E., Riboli, E. & Tsilidis, K. K. GWAS of allometric body-shape indices in UK Biobank identifies loci suggesting associations with morphogenesis, organogenesis, adrenal cell renewal and cancer. Sci. Rep. 11, 10688 (2021).
Article CAS PubMed PubMed Central Google Scholar
Richardson, T. G. et al. Evaluating the relationship between circulating lipoprotein lipids and apolipoproteins with risk of coronary heart disease: a multivariable Mendelian randomisation analysis. PLoS Med. 17, e1003062 (2020).
Article PubMed PubMed Central Google Scholar
Gallagher, M. D. & Chen-Plotkin, A. S. The Post-GWAS era: from association to function. Am. J. Hum. Genet. 102, 717–730 (2018).
Article CAS PubMed PubMed Central Google Scholar
Kato, N. et al. Trans-ancestry genome-wide association study identifies 12 genetic loci influencing blood pressure and implicates a role for DNA methylation. Nat. Genet. 47, 1282–1293 (2015).
Article CAS PubMed PubMed Central Google Scholar
McCarthy, S. et al. A reference panel of 64,976 haplotypes for genotype imputation. Nat. Genet. 48, 1279–1283 (2016).
Article CAS PubMed PubMed Central Google Scholar
Delaneau, O., Marchini, J., Genomes Project, C. & Genomes Project, C. Integrating sequence and array data to create an improved 1000 Genomes Project haplotype reference panel. Nat. Commun. 5, 3934 (2014).
Article CAS PubMed Google Scholar
Marchini, J., Howie, B., Myers, S., McVean, G. & Donnelly, P. A new multipoint method for genome-wide association studies by imputation of genotypes. Nat. Genet 39, 906–913 (2007).
Article CAS PubMed Google Scholar
Li, Y., Willer, C. J., Ding, J., Scheet, P. & Abecasis, G. R. MaCH: using sequence and genotype data to estimate haplotypes and unobserved genotypes. Genet. Epidemiol. 34, 816–834 (2010).
Article PubMed PubMed Central Google Scholar
Howie, B. N., Donnelly, P. & Marchini, J. A flexible and accurate genotype imputation method for the next generation of genome-wide association studies. PLoS Genet. 5, e1000529 (2009).
Article PubMed PubMed Central Google Scholar
Howie, B., Fuchsberger, C., Stephens, M., Marchini, J. & Abecasis, G. R. Fast and accurate genotype imputation in genome-wide association studies through pre-phasing. Nat. Genet. 44, 955–959 (2012).
Article CAS PubMed PubMed Central Google Scholar
Willer, C. J., Li, Y. & Abecasis, G. R. METAL: fast and efficient meta-analysis of genomewide association scans. Bioinformatics 26, 2190–2191 (2010).
Article CAS PubMed PubMed Central Google Scholar
Yang, J. et al. Conditional and joint multiple-SNP analysis of GWAS summary statistics identifies additional variants influencing complex traits. Nat. Genet. 44, 369–S363 (2013).
Article Google Scholar
Suzuki, K. et al. Identification of 28 new susceptibility loci for type 2 diabetes in the Japanese population. Nat. Genet. 51, 379–386 (2019).
Article CAS PubMed Google Scholar
Kim, Y. & Han, B. G., Ko, G. E. S. g. Cohort profile: The Korean Genome and Epidemiology Study (KoGES) Consortium. Int J. Epidemiol. 46, 1350 (2017).
Article CAS PubMed PubMed Central Google Scholar
Moon, S. et al. The Korea Biobank Array: design and identification of coding variants associated with blood biochemical traits. Sci. Rep. 9, 1382 (2019).
Article PubMed PubMed Central Google Scholar
Imamura, M. et al. Genome-wide association studies in the Japanese population identify seven novel loci for type 2 diabetes. Nat. Commun. 7, 10531 (2016).
Article CAS PubMed PubMed Central Google Scholar
Das, S. et al. Next-generation genotype imputation service and methods. Nat. Genet. 48, 1284–1287 (2016).
Article CAS PubMed PubMed Central Google Scholar
Abecasis, G. R. et al. An integrated map of genetic variation from 1,092 human genomes. Nature 491, 56–65 (2012).
Article PubMed Google Scholar
Tayo, B. O. et al. Genetic background of patients from a university medical center in Manhattan: implications for personalized medicine. PLoS ONE 6, e19166 (2011).
Article CAS PubMed PubMed Central Google Scholar
Kuo, J. Z. et al. Systemic soluble tumor necrosis factor receptors 1 and 2 are associated with severity of diabetic retinopathy in Hispanics. Ophthalmology 119, 1041–1046 (2012).
Article PubMed Google Scholar
Qi, Q. et al. Genetics of type 2 diabetes in U.S. Hispanic/Latino individuals: results from the Hispanic Community Health Study/Study of Latinos (HCHS/SOL). Diabetes 66, 1419–1425 (2017).
Article CAS PubMed PubMed Central Google Scholar
Xiang, A. H. et al. Evidence for joint genetic control of insulin sensitivity and systolic blood pressure in hispanic families with a hypertensive proband. Circulation 103, 78–83 (2001).
Article CAS PubMed Google Scholar
Cheng, L. S. et al. Coincident linkage of fasting plasma insulin and blood pressure to chromosome 7q in hypertensive hispanic families. Circulation 104, 1255–1260 (2001).
Article CAS PubMed Google Scholar
Pojoga, L. H. et al. Variants of the caveolin-1 gene: a translational investigation linking insulin resistance and hypertension. J. Clin. Endocrinol. Metab. 96, E1288–E1292 (2011).
Article CAS PubMed PubMed Central Google Scholar
Varma, R. et al. Prevalence of diabetic retinopathy in adult Latinos: the Los Angeles Latino eye study. Ophthalmology 111, 1298–1306 (2004).
Article PubMed Google Scholar
Palmer, N. D. et al. Genetic variants associated with quantitative glucose homeostasis traits translate to type 2 diabetes in Mexican Americans: The GUARDIAN (Genetics Underlying Diabetes in Hispanics) Consortium. Diabetes 64, 1853–1866 (2015).
Article CAS PubMed Google Scholar
Goodarzi, M. O. et al. Determination and use of haplotypes: ethnic comparison and association of the lipoprotein lipase gene and coronary artery disease in Mexican-Americans. Genet Med 5, 322–327 (2003).
Article CAS PubMed Google Scholar
Parra, E. J. et al. Genome-wide association study of type 2 diabetes in a sample from Mexico City and a meta-analysis of a Mexican-American sample from Starr County, Texas. Diabetologia 54, 2038–2046 (2011).
Article CAS PubMed Google Scholar
Bild, D. E. et al. Multi-ethnic study of atherosclerosis: objectives and design. Am. J. Epidemiol. 156, 871–881 (2002).
Article PubMed Google Scholar
MacCluer, J. W. et al. Genetics of atherosclerosis risk factors in Mexican Americans. Nutr. Rev. 57, S59–S65 (1999).
Article CAS PubMed Google Scholar
Cai, G. et al. Genome-wide scans reveal quantitative trait Loci on 8p and 13q related to insulin action and glucose metabolism: the San Antonio Family Heart Study. Diabetes 53, 1369–1374 (2004).
Article CAS PubMed Google Scholar
Mercader, J. M. et al. A loss-of-function splice acceptor variant in IGF2 is protective for type 2 diabetes. Diabetes 66, 2903–2914 (2017).
Article CAS PubMed PubMed Central Google Scholar
Below, J. E. et al. Genome-wide association and meta-analysis in populations from Starr County, Texas, and Mexico City identify type 2 diabetes susceptibility loci and enrichment for expression quantitative trait loci in top signals. Diabetologia 54, 2047–2055 (2011).
Article CAS PubMed PubMed Central Google Scholar
Prentice, R. L. et al. Design of the Women’s Health Initiative clinical trial and observational study. The Women’s Health Initiative Study Group. Control Clin. Trials 19, 61–109 (1998).
Article Google Scholar
Howie, B., Marchini, J. & Stephens, M. Genotype imputation with thousands of genomes. G3 (Bethesda) 1, 457–470 (2011).
Article Google Scholar
Magi, R. et al. Trans-ethnic meta-regression of genome-wide association studies accounting for ancestry increases power for discovery and improves fine-mapping resolution. Hum. Mol. Genet. 26, 3639–3650 (2017).
Article CAS PubMed PubMed Central Google Scholar
Magi, R. & Morris, A. P. GWAMA: software for genome-wide association meta-analysis. BMC Bioinform. 11, 288 (2010).
Article Google Scholar
Vinuela, A. et al. Genetic variant effects on gene expression in human pancreatic islets and their implications for T2D. Nat. Commun. 11, 4912 (2020).
Article PubMed PubMed Central Google Scholar
Shabalin, A. A. Matrix eQTL: ultra fast eQTL analysis via large matrix operations. Bioinformatics 28, 1353–1358 (2012).
Article CAS PubMed PubMed Central Google Scholar
Chambers, J. C. et al. Epigenome-wide association of DNA methylation markers in peripheral blood from Indian Asians and Europeans with incident type 2 diabetes: a nested case-control study. Lancet Diabetes Endocrinol. 3, 526–534 (2015).
Article CAS PubMed PubMed Central Google Scholar
Giambartolomei, C. et al. Bayesian test for colocalisation between pairs of genetic association studies using summary statistics. PLoS Genet 10, e1004383 (2014).
Article PubMed PubMed Central Google Scholar
Halachev, K., Bast, H., Albrecht, F., Lengauer, T. & Bock, C. EpiExplorer: live exploration and global analysis of large epigenomic datasets. Genome Biol. 13, R96 (2012).
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

The BPC study was supported by the US National Institute of Environmental Health Sciences Grants #P42 ES10349 and #P30 ES09089. The EpiDREAM study was funded by a grant from the Canadian Institutes of Health Research University Industry competition with partner funding from the GlaxoSmithKline and Sanofi Aventis Global, Sanofi Aventis Canada, Genome Quebec Innovation Centre, Heart and Stroke Foundation of Canada. The GRCCDS study comprises various cohorts that are supported by Council of Scientific Industrial Research (CSIR), Ministry of Science and Technology, Government of India, India and Wellcome Trust, London, UK. We are grateful to the patients and subjects who voluntarily participated in the study. We also thankfully acknowledge other research who have supported the study. The INDICO study was majorly supported by Council of Scientific and Industrial Research (CSIR), Government of India through CARDIOMED project Grant Number: BSC0122 provided to CSIR-Institute of Genomics and Integrative Biology. The study was also partially funded by the Department of Science and Technology-PURSE-II (DST/SR/PURSE II/11) given to Jawaharlal Nehru University. We are very much thankful to all the volunteers participated in the study. The INTERHEART study was funded by the Canadian Institutes of Health Research, the Heart and Stroke Foundation of Ontario, and the International Clinical Epidemiology Network (INCLEN); unrestricted grants from several pharmaceutical companies (with major contributions from AstraZeneca, Novartis, Hoechst Marion Roussel [now Aventis], Knoll Pharmaceuticals [now Abbott], Bristol-Myers Squibb, King Pharma, and Sanofi-Synthelabo); and various national bodies in different countries (see Online Appendix; http://image.thelancet.com/extras/04art8001webappendix2.pdf). The LOLIPOP study is supported by the National Institute for Health Research (NIHR) Comprehensive Biomedical Research Centre Imperial College Healthcare NHS Trust, the British Heart Foundation (SP/04/002), the Medical Research Council (G0601966, G0700931), the Wellcome Trust (084723/Z/08/Z, 090532 & 098381) the NIHR (RP-PG-0407-10371), the NIHR Official Development Assistance (ODA, award 16/136/68), the European Union FP7 (EpiMigrant, 279143) and H2020 programs (iHealth-T2D, 643774). We acknowledge the support of the MRC-PHE Centre for Environment and Health, and the NIHR Health Protection Research Unit on Health Impact of Environmental Hazards. The work was carried out in part at the NIHR/Wellcome Trust Imperial Clinical Research Facility. The views expressed are those of the author(s) and not necessarily those of the Imperial College Healthcare NHS Trust, the NHS, the NIHR, or the Department of Health. We thank the participants and research staff who made the study possible. J.C.C. is supported by the Singapore Ministry of Health’s National Medical Research Council under its Singapore Translational Research Investigator (STaR) Award (NMRC/STaR/0028/2017). Danish Saleheen (PROMIS) has received funding from NHLBI, NINDS, the British Heart Foundation, Pfizer, Regeneron, Genentech, and Eli Lilly pharmaceuticals. Genotyping in PROMIS was funded by the Wellcome Trust, UK, and Pfizer. Fieldwork in the PROMIS study was supported through funds available to investigators at the Center for Non-Communicable Diseases, Pakistan, and the University of Cambridge, UK. Biomarker assays in PROMIS have been funded through grants awarded by the National Institutes of Health (RC2HL101834 and RC1TW008485) and the Fogarty International (RC1TW008485). The RHS study was supported by the Grant of National Center for Global Health and Medicine (NCGM). The SINDI study is supported by the National Medical Research Council (NMRC), Singapore (grants 0796/2003, 1176/2008, 1149/2008, STaR/0003/2008, 1249/2010, CG/SERI/2010, CIRG/1371/2013, and CIRG/1417/2015), and Biomedical Research Council (BMRC), Singapore (08/1/35/19/550 and 09/1/35/19/616). This work was conducted using the UK Biobank resource under application number 9161. M.M. serves on advisory panels for Pfizer, NovoNordisk, Zoe Global; has received honoraria from Pfizer, NovoNordisk and Eli Lilly; has stock options in Zoe Global; has received research funding from Abbvie, Astra Zeneca, Boehringer Ingelheim, Eli Lilly, Janssen, Merck, NovoNordisk, Pfizer, Roche, Sanofi Aventis, Servier, Takeda. Spracklen CN is supported by American Heart Association Postdoctoral Fellowship 15POST24470131 and 17POST33650016; L.E.P. and J.E.B. are supported by NIH R01AG061351, 5U01DK105554; A.P.M. is supported by NIH U01DK105535. M.C.Y.N. is supported by R01DK066358 and U01DK105556.

Author information

These authors contributed equally: Marie Loh, Weihua Zhang, Jaspal S. Kooner, Danish Saleheen, John C. Chambers.

Authors and Affiliations

Lee Kong Chian School of Medicine, Nanyang Technological University, Singapore, 308232, Singapore
Marie Loh, Hong Kiat Ng & John C. Chambers
Department of Epidemiology and Biostatistics, Imperial College London, London, W2 1PG, UK
Marie Loh, Weihua Zhang & John C. Chambers
Department of Cardiology, Ealing Hospital, London North West Healthcare NHS Trust, Middlesex, UB1 3HW, UK
Weihua Zhang, Jaspal S. Kooner & John C. Chambers
Institute of Computational Biology, Deutsches Forschungszentrum für Gesundheit und Umwelt, Helmholtz Zentrum München, 85764, Neuherberg, Germany
Katharina Schmid
Department of Informatics, Technical University of Munich, 85748, Garching bei München, Neuherberg, Germany
Katharina Schmid
Department of Medicine, McMaster University, Hamilton, ON, Canada
Amel Lamri, Hertzel C. Gerstein, Salim Yusuf & Sonia S. Anand
Population Health Research Institute, Hamilton Health Sciences and McMaster University, Hamilton, ON, Canada
Amel Lamri, Guillaume Paré, Hertzel C. Gerstein, Salim Yusuf & Sonia S. Anand
The University of Chicago, Biological Sciences Division, Public Health Sciences, 5841 South Maryland Avenue, MC2000, Chicago, IL, 60637, USA
Lin Tong, Farzana Jasmine, Muhammad Kibriya, Mohammad Shahriar & Habibul Ahsan
Genomic Research on Complex diseases, CSIR-Centre for Cellular and Molecular Biology (CSIR-CCMB), Hyderabad, India
Meraj Ahmad, K. Radha Mani & Giriraj R. Chandak
Translational Medicine and Human Genetics, Department of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA
Jung-Jin Lee
Department of Medicine, Mayo Hospital, Lahore, Pakistan
Jung-Jin Lee & Donald W. Bowden
Center for Genomics and Personalized Medicine Research, Center for Diabetes Research, Wake Forest School of Medicine, Winston-Salem, NC, 37215, USA
Maggie C. Y. Ng
Vanderbilt Genetics Institute, Division of Genetic Medicine, Vanderbilt University Medical Center, Nashville, TN, 37232, USA
Maggie C. Y. Ng, Lauren E. Petty & Jennifer E. Below
Human Genetics Center, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, 77030, USA
Lauren E. Petty & Jennifer E. Below
Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA
Cassandra N. Spracklen & Karen L. Mohlke
Department of Biostatistics and Epidemiology, University of Massachusetts, Amherst, MA, 01003, USA
Cassandra N. Spracklen
Department of Gene Diagnostics and Therapeutics, Research Institute, National Center for Global Health and Medicine, Tokyo, Japan
Fumihiko Takeuchi & Norihiro Kato
U Chicago Research Bangladesh, House#4, Road#2b, Sector#4, Uttara, Dhaka, 1230, Bangladesh
Md. Tariqul Islam
Department of Public Health, Faculty of Medicine, University of Kelaniya, Kelaniya, Sri Lanka
Anuradhani Kasturiratne & Ananda R. Wickremasinghe
Department of Pathology and Molecular Medicine, McMaster University, Hamilton, ON, Canada
Guillaume Paré
Academy of Scientific and Innovative Research, CSIR-Institute of Genomics and Integrative Biology Campus, New Delhi, 110020, India
Gauri Prasad & Dwaipayan Bharadwaj
Systems Genomics Laboratory, School of Biotechnology, Jawaharlal Nehru University, New Delhi, 110067, India
Gauri Prasad & Dwaipayan Bharadwaj
Singapore Eye Research Institute, Singapore National Eye Centre, Singapore, Singapore
Miao Ling Chee, Charumathi Sabanayagam, Tien Yin Wong & Ching-Yu Cheng
Department of Medicine, Faculty of Medicine, University of Kelaniya, Kelaniya, Sri Lanka
H. Janaka de Silva
Department of Medicine, McGill University, Montreal, QC, Canada
James C. Engert
Department of Human Genetics, McGill University, Montreal, QC, Canada
James C. Engert
Department of Health Research Methods, Evidence, and Impact, McMaster University, Hamilton, ON, Canada
Hertzel C. Gerstein, Salim Yusuf & Sonia S. Anand
Ophthalmology & Visual Sciences Academic Clinical Program (Eye ACP), Duke-NUS Medical School, Singapore, Singapore
Charumathi Sabanayagam, Tien Yin Wong & Ching-Yu Cheng
Department of Biostatistics and Epidemiology, University of Pennsylvania, Philadelphia, Pennsylvania, 19104, USA
Marijana Vujkovic
Department of Ophthalmology, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore
Tien Yin Wong & Ching-Yu Cheng
Diabetes Unit, King Edward Memorial Hospital and Research Centre, Pune, India
Chittaranjan S. Yajnik
Department of Biostatistics and Center for Statistical Genetics, University of Michigan, Ann Arbor, Michigan, 48109, USA
Michael Boehnke
Department of Biochemistry, Wake Forest School of Medicine, Winston-Salem, NC, 37215, USA
Donald W. Bowden
JSS Academy of Health Education of Research, Mysuru, India
Giriraj R. Chandak
Science and Engineering Research Board, Department of Science and Technology, Ministry of Science and technology, Government of India, New Delhi, India
Giriraj R. Chandak
Oxford Centre for Diabetes, Endocrinology and Metabolism, Radcliffe Department of Medicine, University of Oxford, Oxford, OX3 7BN, UK
Anubha Mahajan & Mark I. McCarthy
Wellcome Centre for Human Genetics, Nuffield Department of Medicine, University of Oxford, Oxford, OX3 7BN, UK
Anubha Mahajan & Mark I. McCarthy
Saw Swee Hock School of Public Health, National University of Singapore, Singapore, Singapore
Xueling Sim
Oxford NIHR Biomedical Research Centre, Churchill Hosptial, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 7LE, UK
Mark I. McCarthy
Department of Biostatistics, University of Liverpool, Liverpool, L69 3GL, UK
Andrew P. Morris
Estonian Genome Center, Institute of Genomics, University of Tartu, Tartu, Estonia
Andrew P. Morris
Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, UK
Andrew P. Morris
Imperial College Healthcare NHS Trust, Imperial College London, London, W12 0HS, UK
Jaspal S. Kooner & John C. Chambers
MRC-PHE Centre for Enviroment and Health, Imperial College London, London, W2 1PG, UK
Jaspal S. Kooner & John C. Chambers
National Heart and Lung Institute, Imperial College London, London, W12 0NN, UK
Jaspal S. Kooner
Center for Non-Communicable Diseases, Karachi, Pakistan
Danish Saleheen
Department of Medicine, Columbia University Irving Medical Center, New York, NY, 10032, USA
Danish Saleheen
Department of Cardiology, Columbia University Irving Medical Center, New York, NY, 10032, USA
Danish Saleheen

Authors

Marie Loh
View author publications
You can also search for this author in PubMed Google Scholar
Weihua Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Hong Kiat Ng
View author publications
You can also search for this author in PubMed Google Scholar
Katharina Schmid
View author publications
You can also search for this author in PubMed Google Scholar
Amel Lamri
View author publications
You can also search for this author in PubMed Google Scholar
Lin Tong
View author publications
You can also search for this author in PubMed Google Scholar
Meraj Ahmad
View author publications
You can also search for this author in PubMed Google Scholar
Jung-Jin Lee
View author publications
You can also search for this author in PubMed Google Scholar
Maggie C. Y. Ng
View author publications
You can also search for this author in PubMed Google Scholar
Lauren E. Petty
View author publications
You can also search for this author in PubMed Google Scholar
Cassandra N. Spracklen
View author publications
You can also search for this author in PubMed Google Scholar
Fumihiko Takeuchi
View author publications
You can also search for this author in PubMed Google Scholar
Md. Tariqul Islam
View author publications
You can also search for this author in PubMed Google Scholar
Farzana Jasmine
View author publications
You can also search for this author in PubMed Google Scholar
Anuradhani Kasturiratne
View author publications
You can also search for this author in PubMed Google Scholar
Muhammad Kibriya
View author publications
You can also search for this author in PubMed Google Scholar
Karen L. Mohlke
View author publications
You can also search for this author in PubMed Google Scholar
Guillaume Paré
View author publications
You can also search for this author in PubMed Google Scholar
Gauri Prasad
View author publications
You can also search for this author in PubMed Google Scholar
Mohammad Shahriar
View author publications
You can also search for this author in PubMed Google Scholar
Miao Ling Chee
View author publications
You can also search for this author in PubMed Google Scholar
H. Janaka de Silva
View author publications
You can also search for this author in PubMed Google Scholar
James C. Engert
View author publications
You can also search for this author in PubMed Google Scholar
Hertzel C. Gerstein
View author publications
You can also search for this author in PubMed Google Scholar
K. Radha Mani
View author publications
You can also search for this author in PubMed Google Scholar
Charumathi Sabanayagam
View author publications
You can also search for this author in PubMed Google Scholar
Marijana Vujkovic
View author publications
You can also search for this author in PubMed Google Scholar
Ananda R. Wickremasinghe
View author publications
You can also search for this author in PubMed Google Scholar
Tien Yin Wong
View author publications
You can also search for this author in PubMed Google Scholar
Chittaranjan S. Yajnik
View author publications
You can also search for this author in PubMed Google Scholar
Salim Yusuf
View author publications
You can also search for this author in PubMed Google Scholar
Habibul Ahsan
View author publications
You can also search for this author in PubMed Google Scholar
Dwaipayan Bharadwaj
View author publications
You can also search for this author in PubMed Google Scholar
Sonia S. Anand
View author publications
You can also search for this author in PubMed Google Scholar
Jennifer E. Below
View author publications
You can also search for this author in PubMed Google Scholar
Michael Boehnke
View author publications
You can also search for this author in PubMed Google Scholar
Donald W. Bowden
View author publications
You can also search for this author in PubMed Google Scholar
Giriraj R. Chandak
View author publications
You can also search for this author in PubMed Google Scholar
Ching-Yu Cheng
View author publications
You can also search for this author in PubMed Google Scholar
Norihiro Kato
View author publications
You can also search for this author in PubMed Google Scholar
Anubha Mahajan
View author publications
You can also search for this author in PubMed Google Scholar
Xueling Sim
View author publications
You can also search for this author in PubMed Google Scholar
Mark I. McCarthy
View author publications
You can also search for this author in PubMed Google Scholar
Andrew P. Morris
View author publications
You can also search for this author in PubMed Google Scholar
Jaspal S. Kooner
View author publications
You can also search for this author in PubMed Google Scholar
Danish Saleheen
View author publications
You can also search for this author in PubMed Google Scholar
John C. Chambers
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Participant recruitment, characterisation and data generation. BPC: L.T., M.T.I., F.J., M.K., M.S., H.A.; EpiDREAM: H.C.G., A.L., G.Paré, S.Y., S.S.A.; GRCCDS: M.A., K.R.M., C.S.Y., G.R.C.; INDICO: A.M.; G.Prasad, D.B.; INTERHEART: A.L., G.Paré, J.C.E., S.Y., S.S.A.; LOLIPOP: M.L., W.Z., H.K.N., J.S.K., J.C.C.; PROMIS: J-J L, M.V., D.S.; RHS: F.T., A.K., H.J.deSilva, A.R.W., N.K.; SINDI: M.L.C., C.S., T.Y.W., C-Y.C.; UKBB: A.M., M.I.M. Meta-analysis results/lookup in ethnic populations.: African-American: M.CY.N., D.W.B.; East Asian: C.N.S., K.L.M., X.S., European: M.B., A.M.; Hispanic: L.E.P., J.E.B.; Trans-ethnic: A.P.M Statistical analyses: W.Z., M.L., K.S.; Study design and manuscript writing: M.L., J.S.K., D.S. and J.C.C. All authors have given approval to the final version of the paper.

Corresponding authors

Correspondence to Jaspal S. Kooner, Danish Saleheen or John C. Chambers.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Communications Biology thanks Yang Wu and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Primary Handling Editor: George Inglis.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Description of Additional Supplementary Files

Supplementary Data 1-26

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Loh, M., Zhang, W., Ng, H.K. et al. Identification of genetic effects underlying type 2 diabetes in South Asian and European populations. Commun Biol 5, 329 (2022). https://doi.org/10.1038/s42003-022-03248-5

Download citation

Received: 06 January 2020
Accepted: 08 March 2022
Published: 07 April 2022
DOI: https://doi.org/10.1038/s42003-022-03248-5

This article is cited by

Parental and child genetic burden of glycaemic dysregulation and early-life cognitive development: an Asian and European prospective cohort study
- Jian Huang
- Michelle Z. L. Kee
- Jonathan Yinhao Huang
Translational Psychiatry (2024)
Lessons and Applications of Omics Research in Diabetes Epidemiology
- Gechang Yu
- Henry C. H. Tam
- Ronald C. W. Ma
Current Diabetes Reports (2024)
The ‘Insertion/Deletion’ Polymorphism, rs4340 and Diabetes Risk: A Pilot Study from a Hospital Cohort
- Manali Shah
- Anjali Gupta
- A. Rosalind Marita
Indian Journal of Clinical Biochemistry (2024)
Genetics and epigenetics of diabetes and its complications in India
- Ankita Priyadarshini
- Riya Madan
- Sadhan Das
Human Genetics (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.