Genome-wide association analysis of type 2 diabetes in the EPIC-InterAct study

Cai, Lina; Wheeler, Eleanor; Kerrison, Nicola D.; Luan, Jian’an; Deloukas, Panos; Franks, Paul W.; Amiano, Pilar; Ardanaz, Eva; Bonet, Catalina; Fagherazzi, Guy; Groop, Leif C.; Kaaks, Rudolf; Huerta, José María; Masala, Giovanna; Nilsson, Peter M.; Overvad, Kim; Pala, Valeria; Panico, Salvatore; Rodriguez-Barranco, Miguel; Rolandsson, Olov; Sacerdote, Carlotta; Schulze, Matthias B.; Spijkerman, Annemieke M. W.; Tjonneland, Anne; Tumino, Rosario; van der Schouw, Yvonne T.; Sharp, Stephen J.; Forouhi, Nita G.; Riboli, Elio; McCarthy, Mark I.; Barroso, Inês; Langenberg, Claudia; Wareham, Nicholas J.

doi:10.1038/s41597-020-00716-7

Download PDF

Data Descriptor
Open access
Published: 13 November 2020

Genome-wide association analysis of type 2 diabetes in the EPIC-InterAct study

Scientific Data volume 7, Article number: 393 (2020) Cite this article

11k Accesses
17 Citations
6 Altmetric
Metrics details

Subjects

Abstract

Type 2 diabetes (T2D) is a global public health challenge. Whilst the advent of genome-wide association studies has identified >400 genetic variants associated with T2D, our understanding of its biological mechanisms and translational insights is still limited. The EPIC-InterAct project, centred in 8 countries in the European Prospective Investigations into Cancer and Nutrition study, is one of the largest prospective studies of T2D. Established as a nested case-cohort study to investigate the interplay between genetic and lifestyle behavioural factors on the risk of T2D, a total of 12,403 individuals were identified as incident T2D cases, and a representative sub-cohort of 16,154 individuals was selected from a larger cohort of 340,234 participants with a follow-up time of 3.99 million person-years. We describe the results from a genome-wide association analysis between more than 8.9 million SNPs and T2D risk among 22,326 individuals (9,978 cases and 12,348 non-cases) from the EPIC-InterAct study. The summary statistics to be shared provide a valuable resource to facilitate further investigations into the genetics of T2D.

Measurement(s)	type 2 diabetes mellitus
Technology Type(s)	case-cohort study • genome wide association study
Factor Type(s)	genotype dosage • genetic principal components • study centre • Age • Sex
Sample Characteristic - Organism	Homo sapiens
Sample Characteristic - Location	Europe

Machine-accessible metadata file describing the reported data: https://doi.org/10.6084/m9.figshare.12981821

Identification of genetic effects underlying type 2 diabetes in South Asian and European populations

Article Open access 07 April 2022

Identification of type 2 diabetes loci in 433,540 East Asian individuals

Article 06 May 2020

Multi-ancestry genetic study of type 2 diabetes highlights the power of diverse populations for discovery and translation

Article 12 May 2022

Background & Summary

Diabetes is one of the fastest-growing health challenges of the 21^st century. The most common form of diabetes, type 2 diabetes (T2D), is a complex multifactorial disease which can lead to further severe health consequences such as cardiovascular diseases and premature death. In 2019, 463 million people worldwide were living with diabetes according to the International Diabetes Federation, and this number is expected to rise to 700 million by 2045¹. Genome-wide association studies (GWAS) have made considerable progress in identifying genetic risk factors and in providing evidence for more in-depth understanding of the biological and pathological pathways underlying T2D. A recent study performed a meta-analysis of T2D across 32 GWAS of European ancestry participants and identified 243 genome-wide significant loci (403 distinct genetic variants) associated with T2D risk². The summary statistics from this meta-analysis are publicly available; however, the GWAS results for each participating study, including EPIC-InterAct, cannot be acquired easily.

To date, a growing body of comprehensive methods has been developed for downstream analyses of GWAS. Sharing of summary statistics can help enable these analyses, for example, by providing researchers with a more convenient way to look-up genetic association effect estimates to conduct causal inference analyses using methods such as two-sample Mendelian Randomization which assumes samples are non-overlapping^3,4. In addition, sharing GWAS results can help researchers to further their understanding of the shared genetic basis of T2D with other traits of interest, to perform fine-mapping to pinpoint the causal genetic variants or identify genetic loci shared with other risk factors and disease outcomes. Therefore, the aim of this current work was to provide a reference dataset for researchers to utilize in order to conduct further genetic analyses, generate hypotheses and improve understanding of the aetiology, the biological pathways and mechanisms of T2D and related metabolic and cardiovascular diseases.

Methods

Study design and participants

The EPIC-InterAct study is a large-scale prospective study nested in the European Prospective Investigation into Cancer (EPIC) study, facilitating the investigation of genetic and lifestyle factors on the risk of T2D among European populations. A total of 26 research centres located in eight different European countries (France, Italy, Spain, UK, the Netherlands, Germany, Sweden, and Denmark) were included. The study design, sample collection and genotyping have been described in detail previously^5,6.

In brief, the EPIC-InterAct study adopted a nested case-cohort design. A total of 340,234 participants with stored blood and information reported on diabetes status from the wider EPIC study were followed up for 3.99 million person-years. During the follow-up, researchers from participating study centres ascertained and verified 12,403 incident cases of T2D through self-reported history of T2D, doctor diagnosed T2D and diabetes medication use, linkage to primary care registers, secondary care registers, medication use (pharmacy/ drug registers), hospital admissions and mortality data or local and national diabetes and pharmaceutical registers⁵. To select a representative sub-cohort, a total of 16,835 participants were randomly selected at baseline with numbers proportional to the number of participants in each participating centre. Participants with prevalent (n = 548), unknown (n = 129) and post-censoring diabetes status (n = 4) were excluded, with a total of 16,154 diabetes-free individuals remaining in the EPIC-InterAct sub-cohort (Fig. 1).

DNA samples and genotyping platforms

Blood samples were collected at recruitment and stored in liquid nitrogen at the International Agency for Research into Cancer (IARC) in Lyon, France, or in local biorepositories except for Umeå where −80 °C freezers were used. DNA was extracted and quantified, with details of sample handling described elsewhere^5,7.

Available EPIC-InterAct DNA samples were genotyped using two genotyping platforms. A total of 10,023 EPIC-InterAct participants were randomly selected for genome-wide genotyping using the Illumina 660W-Quad BeadChip (Illumina, Inc., San Diego, California) at the Wellcome Trust Sanger Institute with the number of individuals selected per centre being proportional to the percentage of total cases in that centre, except the Danish participants who did not have available DNA samples at the time⁷. Samples were excluded if they had a low call rate (<95.4%), a lack of concordance with previous genotyping results, a mismatch between self-reported sex and the sex inferred from genetic data (X chromosome heterozygosity) or missing data, or they were autosomal heterozygosity outliers, overall array intensity outliers, ethnic outliers (non-European ancestry) or duplicate samples. Related individuals in the Illumina 660 W genotyping array group were identified based on an identity by descent (IBD) pi-hat threshold of 0.1875 (mid-point between second-degree (0.25) and third-degree (0.125) relatives), and those with the largest number of relatives or the lowest call rate were removed preferentially. A total of 9,290 samples genotyped on the Illumina 600 W array passed initial sample quality control (QC).

A total 13,474 individuals from the remaining of EPIC-InterAct samples (including the Danish samples) were genotyped using the Illumina core-exome 12v1 and 24v1 arrays at Cambridge Genomic Services in the Department of Pathology at the University of Cambridge. The two core-exome arrays are very similar; hence the genotype data were merged for further analyses. Following comparable QC procedures as above, a total of 13,202 samples genotyped using the core-exome arrays passed initial sample QC.

Following initial sample QC, an additional 166 participants who had relatives (IBD pi-hat threshold of 0.1875) across the different genotyping arrays (Illumina 660 W vs Illumina core-exome) were excluded, and a total of 22,326 individuals were included in the downstream genetic analyses (Fig. 1; Table 1).

Table 1 Sample size of the EPIC-InterAct T2D GWAS analysis by diabetes outcome status and genotyping array.

Full size table

Genotype imputation

Prior to imputation, single nucleotide polymorphisms (SNPs) were removed if they had Hardy Weinberg p-value < 10⁻⁶ or were not found in the Haplotype Reference Consortium (HRC) reference panel version 1.0⁸, were A/T or G/C with minor allele frequency (MAF) >0.4, had an allele frequency difference >0.2 with the reference panel, or were short insertion-deletion mutations (indels). A total of 553,115 and 366,044 SNPs passed pre-imputation SNP QC in the Illumina 660W-Quad BeadChip and combined Illumina core-exome arrays, respectively. Imputation was performed using the HRC reference panel and IMPUTE v2.3.2 software⁹. Monomorphic and singleton SNPs and those with imputation quality (info) <0.3 were excluded prior to genetic analyses.

Genome-wide association meta-analysis

For genome-wide association analysis of T2D, all 22,326 included individuals in the EPIC-InterAct study were of European ancestry, including 9,978 type 2 diabetes cases (including 616 cases from the sub-cohort) and 12,348 non-cases from the sub-cohort, among whom 9,178 participants were genotyped on the Illumina 660 W array and 13,148 using the core-exome array (Fig. 1; Table 1). The mean follow-up time for the EPIC-InterAct cases included in the analyses was 6.8 years (standard deviation (s.d.) =3.3 years), and 12.2 years (s.d. =2.0 years) for the sub-cohort.

We used logistic regression to test genome-wide associations with T2D, rather than Prentice- weighted Cox regression that takes into account the case-cohort design of EPIC-InterAct. Logistic regression was chosen both for computational efficiency and because it has been shown to have greater power than Prentice-weighted Cox regression to detect SNP-disease associations¹⁰. All T2D incident cases including those from the sub-cohort were coded as ‘1’, and non-cases from the sub-cohort were coded as ‘0’. To estimate the association between T2D and each genetic variant, we performed logistic regression under an additive genetic model, adjusting for age, sex, study centre and the first four genetic principal components to account for population structure using QUICKTEST Version 0.98¹¹. Dummy variables for each study centre (combining the six centres in France due to the small sample size in each French centre) were included in the model to account for the differences between participants from each country and the potential confounding by larger scale relatedness between participants from each study centre. Genome-wide analyses were performed separately for each genotyping array and combined using an inverse variance weighted fixed-effect meta-analysis in METAL¹². The final meta-analysis had an effective sample size¹² of up to 21,924.

Ethics statement

The EPIC-InterAct study was approved by the local ethics committee in the participating countries and the Internal Review Board of the International Agency for Research on Cancer. All participants gave written informed consent. The study was coordinated by the Medical Research Council Epidemiology Unit at the University of Cambridge.

Data Records

Genome-wide association summary statistics from the meta-analysis of T2D in the EPIC-InterAct study and Cox-regression analysis results for the 370 top T2D SNPs from the recently published DIAMANTE study² are available to download from the Dryad Digital Repository (https://doi.org/10.5061/dryad.qnk98sfcg)¹³.

The genome-wide summary statistics are in tab-delimited TXT format, including rsID (based on the HRC reference panel), chromosome, position (using the reference genome GRCh37 (hg19)), effect allele, other allele, frequency of effect allele, effect estimate, standard error of the effect estimate, p-value, assessment of heterogeneity across the two genotyping arrays, total sample size and effective sample size for the SNP.

The Cox-regression analysis results are in tab-delimited TXT format, including MarkerName (hg19), rsID (based on the HRC reference panel), chromosome, position (using the reference genome GRCh37 (hg19)), effect allele, other allele, frequency of effect allele, beta, standard error of beta, hazard ratio (HR), lower-bound of 95% confidence interval (CI) of HR, upper-bound of 95% confidence interval (CI) of HR, p-value, imputation quality, total sample size.

Alternatively, the genome-wide summary statistics data is also available in NHGRI-EBI’s GWAS Catalog with accession ID GCST90006934¹⁴. It can be downloaded via the following ftp link: ftp://ftp.ebi.ac.uk/pub/databases/gwas/summary_statistics/GCST90006934.

In addition, access to individual-level EPIC data is available through the International Agency for Research on Cancer (IARC): https://epic.iarc.fr/access/, where there is a controlled-access repository. A clear and open access request mechanism and data use agreement is in place.

Technical Validation

For the meta-analysis, only SNPs with minor allele frequency (MAF) > 0.5%, imputation information score > 0.4, Hardy-Weinberg Equilibrium p-value > 1 × 10⁻⁶ and association effect standard error < 10 from each genotyping platform were included. After the meta-analysis, 31 SNPs with heterogeneity p-value < 1 × 10⁻⁵ were excluded. A total of 8,924,492 SNPs remained in the shared meta-analysis results. The numbers of genetic variants in each MAF bin are shown in Table 2.

Table 2 Number of SNPs in each minor allele frequency (MAF) bin in the final EPIC-InterAct T2D GWAS meta-analysis result after quality control.

Full size table

The Manhattan plot is shown in Fig. 2. The quantile-quantile plot (Fig. 3) showed no evidence of inflation from confounding or other biases, supported by the LD score regression¹⁵ intercept, which was very close to 1 (1.0054); therefore, no genomic control correction was performed. As a positive control, the top independent genome-wide significant signal from the meta-analysis was the well-established TCF7L2 variant rs7903146¹⁶ (p = 1.30 × 10⁻³⁸).

Because logistic regression may potentially yield inflated effect estimates when applied in a case-cohort study¹⁰, we compared the strength of associations from the GWAS meta-analysis (logistic regression) and Prentice-weighted Cox-regression analyses adjusting for sex, study centre and first four principal components with age as the underlying time-scale variable for established T2D genetic variants. A total of 370 SNPs from the recently published DIAMANTE study² are available in our HRC imputed EPIC-InterAct genotype data. Among these, 175 SNPs with p-value < 0.05 in the EPIC-InterAct meta-analysis results were included in the comparison. The Pearson correlation coefficient between the log of hazard ratios from the Cox-regression model and the log of odds ratios from logistic regression models was 0.98 (p = 3.1 × 10⁻¹²⁶) (Fig. 4), showing the effects are highly comparable.

Code availability

IMPUTE v2.3.2: https://mathgen.stats.ox.ac.uk/impute/impute_v2.html QUICKTEST Version 0.98: http://toby.freeshell.org/software/quicktest.shtml METAL: https://genome.sph.umich.edu/wiki/METAL All other analyses, including the Prentice-weighted Cox-regression analyses, were performed using R 3.4.2¹⁷.

References

International Diabetes Federation. IDF Diabetes Atlas, 9th edn. https://www.diabetesatlas.org (2019).
Mahajan, A. et al. Fine-mapping of an expanded set of type 2 diabetes loci to single-variant resolution using high-density imputation and islet-specific epigenome maps Individual study design and principal investigators Europe PMC Funders Group. Nat. Genet. 50, 1505–1513 (2018).
Article CAS PubMed PubMed Central Google Scholar
Pierce, B. L. & Burgess, S. Efficient design for mendelian randomization studies: Subsample and 2-sample instrumental variable estimators. Am. J. Epidemiol. 178, 1177–1184 (2013).
Article PubMed PubMed Central Google Scholar
Bowden, J., Smith, G. D., Haycock, P. C. & Burgess, S. Consistent estimation in Mendelian randomization with some invalid instruments using a weighted median estimator. Genet. Epidemiol. 40, 304–314 (2016).
Article PubMed PubMed Central Google Scholar
The InterAct Consortium. et al. The InterAct Project: an examination of the interaction of genetic and lifestyle factors on the incidence of type 2 diabetes in the EPIC Study. Diabetologia 54, 2272–2282 (2011).
Article PubMed Central Google Scholar
Forouhi, N. G. & Wareham, N. J. The EPIC-InterAct Study: A study of the interplay between genetic and lifestyle behavioral factors on the risk of type 2 diabetes in European populations. Curr. Nutr. Rep. 3, 355–363 (2014).
Article CAS PubMed PubMed Central Google Scholar
Langenberg, C. et al. Gene-lifestyle interaction and type 2 diabetes: the EPIC InterAct Case-Cohort Study. PLoS Med. 11 (2014).
McCarthy, S. et al. A reference panel of 64,976 haplotypes for genotype imputation. Nat. Genet. 48, 1279–1283 (2016).
Article CAS PubMed PubMed Central Google Scholar
Howie, B. N., Donnelly, P. & Marchini, J. A flexible and accurate genotype imputation method for the next generation of genome-wide association studies. PLoS Genet. 5, e1000529 (2009).
Article PubMed PubMed Central Google Scholar
Staley, J. R. et al. A comparison of Cox and logistic regression for use in genome-wide association studies of cohort and case-cohort design. Eur. J. Hum. Genet. 25, 854–862 (2017).
Article PubMed PubMed Central Google Scholar
Kutalik, Z. et al. Methods for testing association between uncertain genotypes and quantitative traits. Biostatistics 12, 1–17 (2011).
Article PubMed Google Scholar
Willer, C. J., Li, Y. & Abecasis, G. R. METAL: Fast and efficient meta-analysis of genomewide association scans. Bioinformatics 26, 2190–2191 (2010).
Article CAS PubMed PubMed Central Google Scholar
Cai, L. et al. EPIC-Data from: Genome-wide association analysis of type 2 diabetes in the EPIC-InterAct study. Dryad Digital Repository. https://doi.org/10.5061/dryad.qnk98sfcg (2020).
GWAS Catalog. https://identifiers.org/gcst:GCST90006934 (2020).
Bulik-Sullivan, B. et al. LD score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat. Genet. 47, 291–295 (2015).
Article CAS PubMed PubMed Central Google Scholar
Sladek, R. et al. A genome-wide association study identifies novel risk loci for type 2 diabetes. Nature 445, 881–885 (2007).
Article ADS CAS PubMed Google Scholar
R Core Team. R: A language and environment for statistical computing. (2014).

Download references

Acknowledgements

We thank all EPIC participants and staff for their contribution to the study. We thank Nicola Kerrison (MRC Epidemiology Unit, Cambridge) for managing the data for the InterAct Project and staff from the Laboratory Team, Field Epidemiology Team, and Data Functional Group of the MRC Epidemiology Unit in Cambridge, UK, for carrying out sample preparation, DNA provision and quality control, genotyping, and data-handling work. The funding of the EPIC-InterAct study was provided by the EU FP6 Programme [grant number Integrated Project LSHM_CT_2006_037197]. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. The authors acknowledge support from the Medical Research Council Epidemiology Unit (grants MC_UU_12015/1 and MC_UU_12015/5) and Wellcome WT206194.

Author information

Mark I. McCarthy
Present address: 1 DNA Way, South San Francisco, CA, 94080, USA

Authors and Affiliations

MRC Epidemiology Unit, University of Cambridge, Cambridge, United Kingdom
Lina Cai, Eleanor Wheeler, Nicola D. Kerrison, Jian’an Luan, Stephen J. Sharp, Nita G. Forouhi, Claudia Langenberg & Nicholas J. Wareham
Clinical Pharmacology Centre, Queen Mary University of London, London, United Kingdom
Panos Deloukas
Department of Clinical Sciences, Clinical Research Center, Skåne University Hospital, Lund University, 20502, Malmö, Sweden
Paul W. Franks, Leif C. Groop & Peter M. Nilsson
Department of Public Health and Clinical Medicine, Umeå University, 90187, Umeå, Sweden
Paul W. Franks & Olov Rolandsson
Ministry of Health of the Basque Government, Public Health Division of Gipuzkoa, Biodonostia Health Research Institute, Donostia-San Sebastian, Spain
Pilar Amiano
Navarra Public Health Institute, Pamplona, Spain
Eva Ardanaz
IdiSNA, Navarra Institute for Health Research, Pamplona, Spain
Eva Ardanaz
Centro de Investigación Biomédica en Red de Epidemiología y Salud Pública (CIBERESP), Madrid, Spain
Eva Ardanaz, José María Huerta & Miguel Rodriguez-Barranco
Unit of Nutrition and Cancer, Catalan Institute of Oncology - ICO, Nutrition and Cancer Group, Bellvitge Biomedical Research Institute - IDIBELL, L’Hospitalet de Llobregat, Barcelona, 08908, Spain
Catalina Bonet
Digital Epidemiology and e-Health Research Hub, Department of Population Health, Luxembourg Institute of Health, 1A-B, rue Thomas Edison, L-1445, Strassen, Luxembourg
Guy Fagherazzi
Center of Epidemiology and Population Health UMR 1018, Inserm, Paris South - Paris Saclay University, Gustave Roussy Institute, Villejuif, France
Guy Fagherazzi
Division of Cancer Epidemiology, German Cancer Research Center (DKFZ), Im Neuenheimer Feld 581, 69120, Heidelberg, Germany
Rudolf Kaaks
Department of Epidemiology, Murcia Regional Health Council, IMIB-Arrixaca, Murcia, Spain
José María Huerta
Cancer Risk Factors and Life-Style Epidemiology Unit, Institute for Cancer Research, Prevention and Clinical Network - ISPRO, Florence, Italy
Giovanna Masala
Department of Public Health, Aarhus University, Bartholins Allé 2, DK-8000, Aarhus C, Denmark
Kim Overvad
Department of Cardiology, Aalborg University Hospital, Sdr. Skovvej 15, DK-9000, Aalborg, Denmark
Kim Overvad
Epidemiology and Prevention Unit, Fondazione IRCCS Istituto Nazionale dei Tumori di Milano, Milan, Italy
Valeria Pala
Dipartimento di Medicina Clinica e Chirurgia, Federico II University, via Pansini 5, 80131, Naples, Italy
Salvatore Panico
Escuela Andaluza de Salud Pública (EASP), Granada, Spain
Miguel Rodriguez-Barranco
Instituto de Investigación Biosanitaria ibs.Granada, Granada, Spain
Miguel Rodriguez-Barranco
Unit of Cancer Epidemiology, Città della Salute e della Scienza University-Hospital and Center for Cancer Prevention (CPO), Turin, Italy
Carlotta Sacerdote
Department of Molecular Epidemiology, German Institute of Human Nutrition Potsdam-Rehbruecke, Nuthetal, Germany
Matthias B. Schulze
German Center for Diabetes Research (DZD), Neuherberg, Germany
Matthias B. Schulze
Institute of Nutrition Science, University of Potsdam, Nuthetal, Germany
Matthias B. Schulze
National Institute for Public Health and the Environment (RIVM), PO Box 1, 3720 BA, Bilthoven, The Netherlands
Annemieke M. W. Spijkerman
Danish Cancer Society Research Center, Strandboulevarden 49, 2100, Copenhagen, Denmark
Anne Tjonneland
Cancer Registry and Histopathology Department, Azienda Sanitaria Provinciale No 7, Piazza Igea Nr 1, 97100, Ragusa, Italy
Rosario Tumino
Associazone Iblea per la Ricerca Epidemiologica -Organizazione Non Lucrativa di Utilità Sociale, Piazza Amcione No 2, 97100, Ragusa, Italy
Rosario Tumino
Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, 3584 CG, Utrecht, the Netherlands
Yvonne T. van der Schouw
School of Public Health, Imperial College London, London, United Kingdom
Elio Riboli
Wellcome Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
Mark I. McCarthy
Oxford Centre for Diabetes, University of Oxford, Oxford, United Kingdom
Mark I. McCarthy
Oxford NIHR Biomedical Research Centre, Oxford University Hospitals NHS Foundation Trust, John Radcliffe Hospital, Oxford, United Kingdom
Mark I. McCarthy
University of Exeter Medical School, Exeter, United Kingdom
Inês Barroso

Authors

Lina Cai
View author publications
You can also search for this author in PubMed Google Scholar
Eleanor Wheeler
View author publications
You can also search for this author in PubMed Google Scholar
Nicola D. Kerrison
View author publications
You can also search for this author in PubMed Google Scholar
Jian’an Luan
View author publications
You can also search for this author in PubMed Google Scholar
Panos Deloukas
View author publications
You can also search for this author in PubMed Google Scholar
Paul W. Franks
View author publications
You can also search for this author in PubMed Google Scholar
Pilar Amiano
View author publications
You can also search for this author in PubMed Google Scholar
Eva Ardanaz
View author publications
You can also search for this author in PubMed Google Scholar
Catalina Bonet
View author publications
You can also search for this author in PubMed Google Scholar
Guy Fagherazzi
View author publications
You can also search for this author in PubMed Google Scholar
Leif C. Groop
View author publications
You can also search for this author in PubMed Google Scholar
Rudolf Kaaks
View author publications
You can also search for this author in PubMed Google Scholar
José María Huerta
View author publications
You can also search for this author in PubMed Google Scholar
Giovanna Masala
View author publications
You can also search for this author in PubMed Google Scholar
Peter M. Nilsson
View author publications
You can also search for this author in PubMed Google Scholar
Kim Overvad
View author publications
You can also search for this author in PubMed Google Scholar
Valeria Pala
View author publications
You can also search for this author in PubMed Google Scholar
Salvatore Panico
View author publications
You can also search for this author in PubMed Google Scholar
Miguel Rodriguez-Barranco
View author publications
You can also search for this author in PubMed Google Scholar
Olov Rolandsson
View author publications
You can also search for this author in PubMed Google Scholar
Carlotta Sacerdote
View author publications
You can also search for this author in PubMed Google Scholar
Matthias B. Schulze
View author publications
You can also search for this author in PubMed Google Scholar
Annemieke M. W. Spijkerman
View author publications
You can also search for this author in PubMed Google Scholar
Anne Tjonneland
View author publications
You can also search for this author in PubMed Google Scholar
Rosario Tumino
View author publications
You can also search for this author in PubMed Google Scholar
Yvonne T. van der Schouw
View author publications
You can also search for this author in PubMed Google Scholar
Stephen J. Sharp
View author publications
You can also search for this author in PubMed Google Scholar
Nita G. Forouhi
View author publications
You can also search for this author in PubMed Google Scholar
Elio Riboli
View author publications
You can also search for this author in PubMed Google Scholar
Mark I. McCarthy
View author publications
You can also search for this author in PubMed Google Scholar
Inês Barroso
View author publications
You can also search for this author in PubMed Google Scholar
Claudia Langenberg
View author publications
You can also search for this author in PubMed Google Scholar
Nicholas J. Wareham
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceived and designed the experiments: L.C., E.W., N.K., J.L., I.B., P.D., C.L., N.J.W.; analysed the data: L.C., N.K., J.L.; wrote and contributed to the writing of the first manuscript draft: L.C., E.W., N.K., J.L., C.L., N.J.W.; all authors contributed to the interpretation of the data, revised the article critically for important intellectual content, and approved the final version of the paper to be published; NJW is the guarantor of this work and, as such, has full access to all the data in the study and takes responsibility for the integrity of the data and the accuracy of the data analysis.

Corresponding author

Correspondence to Nicholas J. Wareham.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

The Creative Commons Public Domain Dedication waiver http://creativecommons.org/publicdomain/zero/1.0/ applies to the metadata files associated with this article.

Reprints and permissions

About this article

Cite this article

Cai, L., Wheeler, E., Kerrison, N.D. et al. Genome-wide association analysis of type 2 diabetes in the EPIC-InterAct study. Sci Data 7, 393 (2020). https://doi.org/10.1038/s41597-020-00716-7

Download citation

Received: 05 May 2020
Accepted: 04 September 2020
Published: 13 November 2020
DOI: https://doi.org/10.1038/s41597-020-00716-7