Sequence data and association statistics from 12,940 type 2 diabetes cases and controls

Flannick, Jason; Fuchsberger, Christian; Mahajan, Anubha; Teslovich, Tanya M.; Agarwala, Vineeta; Gaulton, Kyle J.; Caulkins, Lizz; Koesterer, Ryan; Ma, Clement; Moutsianas, Loukas; McCarthy, Davis J.; Rivas, Manuel A.; Perry, John R. B.; Sim, Xueling; Blackwell, Thomas W.; Robertson, Neil R.; Rayner, N William; Cingolani, Pablo; Locke, Adam E.; Tajes, Juan Fernandez; Highland, Heather M.; Dupuis, Josee; Chines, Peter S.; Lindgren, Cecilia M.; Hartl, Christopher; Jackson, Anne U.; Chen, Han; Huyghe, Jeroen R.; van de Bunt, Martijn; Pearson, Richard D.; Kumar, Ashish; Müller-Nurasyid, Martina; Grarup, Niels; Stringham, Heather M.; Gamazon, Eric R.; Lee, Jaehoon; Chen, Yuhui; Scott, Robert A.; Below, Jennifer E.; Chen, Peng; Huang, Jinyan; Go, Min Jin; Stitzel, Michael L.; Pasko, Dorota; Parker, Stephen C. J.; Varga, Tibor V.; Green, Todd; Beer, Nicola L.; Day-Williams, Aaron G.; Ferreira, Teresa; Fingerlin, Tasha; Horikoshi, Momoko; Hu, Cheng; Huh, Iksoo; Ikram, Mohammad Kamran; Kim, Bong-Jo; Kim, Yongkang; Kim, Young Jin; Kwon, Min-Seok; Lee, Juyoung; Lee, Selyeong; Lin, Keng-Han; Maxwell, Taylor J.; Nagai, Yoshihiko; Wang, Xu; Welch, Ryan P.; Yoon, Joon; Zhang, Weihua; Barzilai, Nir; Voight, Benjamin F.; Han, Bok-Ghee; Jenkinson, Christopher P.; Kuulasmaa, Teemu; Kuusisto, Johanna; Manning, Alisa; Ng, Maggie C. Y.; Palmer, Nicholette D.; Balkau, Beverley; Stančáková, Alena; Abboud, Hanna E.; Boeing, Heiner; Giedraitis, Vilmantas; Prabhakaran, Dorairaj; Gottesman, Omri; Scott, James; Carey, Jason; Kwan, Phoenix; Grant, George; Smith, Joshua D.; Neale, Benjamin M.; Purcell, Shaun; Butterworth, Adam S.; Howson, Joanna M. M.; Lee, Heung Man; Lu, Yingchang; Kwak, Soo-Heon; Zhao, Wei; Danesh, John; Lam, Vincent K. L.; Park, Kyong Soo; Saleheen, Danish; So, Wing Yee; Tam, Claudia H. T.; Afzal, Uzma; Aguilar, David; Arya, Rector; Aung, Tin; Chan, Edmund; Navarro, Carmen; Cheng, Ching-Yu; Palli, Domenico; Correa, Adolfo; Curran, Joanne E.; Rybin, Dennis; Farook, Vidya S.; Fowler, Sharon P.; Freedman, Barry I.; Griswold, Michael; Hale, Daniel Esten; Hicks, Pamela J.; Khor, Chiea-Chuen; Kumar, Satish; Lehne, Benjamin; Thuillier, Dorothée; Lim, Wei Yen; Liu, Jianjun; Loh, Marie; Musani, Solomon K.; Puppala, Sobha; Scott, William R.; Yengo, Loïc; Tan, Sian-Tsung; Taylor, Herman A.; Thameem, Farook; Wilson, Gregory; Wong, Tien Yin; Njølstad, Pål Rasmus; Levy, Jonathan C.; Mangino, Massimo; Bonnycastle, Lori L.; Schwarzmayr, Thomas; Fadista, João; Surdulescu, Gabriela L.; Herder, Christian; Groves, Christopher J.; Wieland, Thomas; Bork-Jensen, Jette; Brandslund, Ivan; Christensen, Cramer; Koistinen, Heikki A.; Doney, Alex S. F.; Kinnunen, Leena; Esko, Tõnu; Farmer, Andrew J.; Hakaste, Liisa; Hodgkiss, Dylan; Kravic, Jasmina; Lyssenko, Valeri; Hollensted, Mette; Jørgensen, Marit E.; Jørgensen, Torben; Ladenvall, Claes; Justesen, Johanne Marie; Käräjämäki, Annemari; Kriebel, Jennifer; Rathmann, Wolfgang; Lannfelt, Lars; Lauritzen, Torsten; Narisu, Narisu; Linneberg, Allan; Melander, Olle; Milani, Lili; Neville, Matt; Orho-Melander, Marju; Qi, Lu; Qi, Qibin; Roden, Michael; Rolandsson, Olov; Swift, Amy; Rosengren, Anders H.; Stirrups, Kathleen; Wood, Andrew R.; Mihailov, Evelin; Blancher, Christine; Carneiro, Mauricio O.; Maguire, Jared; Poplin, Ryan; Shakir, Khalid; Fennell, Timothy; DePristo, Mark; de Angelis, Martin Hrabé; Deloukas, Panos; Gjesing, Anette P.; Jun, Goo; Nilsson, Peter; Murphy, Jacquelyn; Onofrio, Robert; Thorand, Barbara; Hansen, Torben; Meisinger, Christa; Hu, Frank B.; Isomaa, Bo; Karpe, Fredrik; Liang, Liming; Peters, Annette; Huth, Cornelia; O'Rahilly, Stephen P; Palmer, Colin N. A.; Pedersen, Oluf; Rauramaa, Rainer; Tuomilehto, Jaakko; Salomaa, Veikko; Watanabe, Richard M.; Syvänen, Ann-Christine; Bergman, Richard N.; Bharadwaj, Dwaipayan; Bottinger, Erwin P.; Cho, Yoon Shin; Chandak, Giriraj R.; Chan, Juliana CN; Chia, Kee Seng; Daly, Mark J.; Ebrahim, Shah B.; Langenberg, Claudia; Elliott, Paul; Jablonski, Kathleen A.; Lehman, Donna M.; Jia, Weiping; Ma, Ronald C. W.; Pollin, Toni I.; Sandhu, Manjinder; Tandon, Nikhil; Froguel, Philippe; Barroso, Inês; Teo, Yik Ying; Zeggini, Eleftheria; Loos, Ruth J. F.; Small, Kerrin S.; Ried, Janina S.; DeFronzo, Ralph A.; Grallert, Harald; Glaser, Benjamin; Metspalu, Andres; Wareham, Nicholas J.; Walker, Mark; Banks, Eric; Gieger, Christian; Ingelsson, Erik; Im, Hae Kyung; Illig, Thomas; Franks, Paul W.; Buck, Gemma; Trakalo, Joseph; Buck, David; Prokopenko, Inga; Mägi, Reedik; Lind, Lars; Farjoun, Yossi; Owen, Katharine R.; Gloyn, Anna L.; Strauch, Konstantin; Tuomi, Tiinamaija; Kooner, Jaspal Singh; Lee, Jong-Young; Park, Taesung; Donnelly, Peter; Morris, Andrew D.; Hattersley, Andrew T.; Bowden, Donald W.; Collins, Francis S.; Atzmon, Gil; Chambers, John C.; Spector, Timothy D.; Laakso, Markku; Strom, Tim M.; Bell, Graeme I.; Blangero, John; Duggirala, Ravindranath; Tai, E. Shyong; McVean, Gilean; Hanis, Craig L.; Wilson, James G.; Seielstad, Mark; Frayling, Timothy M.; Meigs, James B.; Cox, Nancy J.; Sladek, Rob; Lander, Eric S.; Gabriel, Stacey; Mohlke, Karen L.; Meitinger, Thomas; Groop, Leif; Abecasis, Goncalo; Scott, Laura J.; Morris, Andrew P.; Kang, Hyun Min; Altshuler, David; Burtt, Noël P.; Florez, Jose C.; Boehnke, Michael; McCarthy, Mark I.

doi:10.1038/sdata.2017.179

Download PDF

Data Descriptor
Open access
Published: 19 December 2017

Sequence data and association statistics from 12,940 type 2 diabetes cases and controls

Jason Flannick^1,2^na1,
Christian Fuchsberger³^na1,
Anubha Mahajan⁴^na1,
Tanya M. Teslovich³,
Vineeta Agarwala^2,5,
Kyle J. Gaulton ORCID: orcid.org/0000-0003-1318-7161⁴,
Lizz Caulkins²,
Ryan Koesterer²,
Clement Ma³,
Loukas Moutsianas⁴,
Davis J. McCarthy^4,6,
Manuel A. Rivas⁴,
John R. B. Perry^4,7,8,9,
Xueling Sim³,
Thomas W. Blackwell³,
Neil R. Robertson^4,10,
N William Rayner^4,10,11,
Pablo Cingolani^12,13,
Adam E. Locke ORCID: orcid.org/0000-0001-6227-198X³,
Juan Fernandez Tajes⁴,
Heather M. Highland¹⁴,
Josee Dupuis^15,16,
Peter S. Chines¹⁷^na3,
Cecilia M. Lindgren^2,4,
Christopher Hartl²,
Anne U. Jackson³,
Han Chen^15,18,
Jeroen R. Huyghe³,
Martijn van de Bunt ORCID: orcid.org/0000-0002-6744-6125^4,10,
Richard D. Pearson⁴,
Ashish Kumar^4,19,
Martina Müller-Nurasyid ORCID: orcid.org/0000-0002-7898-2353^20,21,22,23,
Niels Grarup ORCID: orcid.org/0000-0001-5526-1070²⁴,
Heather M. Stringham ORCID: orcid.org/0000-0002-2991-6392³,
Eric R. Gamazon ORCID: orcid.org/0000-0003-4204-8734²⁵,
Jaehoon Lee²⁶,
Yuhui Chen⁴,
Robert A. Scott⁸,
Jennifer E. Below²⁷,
Peng Chen²⁸,
Jinyan Huang²⁹,
Min Jin Go³⁰,
Michael L. Stitzel³¹,
Dorota Pasko⁷,
Stephen C. J. Parker³²,
Tibor V. Varga ORCID: orcid.org/0000-0002-2383-699X³³,
Todd Green²,
Nicola L. Beer ORCID: orcid.org/0000-0002-4964-7150¹⁰,
Aaron G. Day-Williams¹¹,
Teresa Ferreira⁴,
Tasha Fingerlin³⁴,
Momoko Horikoshi^4,10,
Cheng Hu³⁵,
Iksoo Huh²⁶,
Mohammad Kamran Ikram^36,37,38,
Bong-Jo Kim³⁰,
Yongkang Kim²⁶,
Young Jin Kim³⁰,
Min-Seok Kwon³⁹,
Juyoung Lee³⁰,
Selyeong Lee²⁶,
Keng-Han Lin³,
Taylor J. Maxwell²⁷,
Yoshihiko Nagai^13,40,41,
Xu Wang²⁸,
Ryan P. Welch ORCID: orcid.org/0000-0001-6378-1295³,
Joon Yoon ORCID: orcid.org/0000-0002-9509-119X³⁹,
Weihua Zhang^42,43,
Nir Barzilai⁴⁴,
Benjamin F. Voight ORCID: orcid.org/0000-0002-6205-9994^45,46,
Bok-Ghee Han³⁰,
Christopher P. Jenkinson^47,48,
Teemu Kuulasmaa⁴⁹,
Johanna Kuusisto^49,50,
Alisa Manning²,
Maggie C. Y. Ng^51,52,
Nicholette D. Palmer^51,52,53,
Beverley Balkau⁵⁴,
Alena Stančáková⁴⁹,
Hanna E. Abboud⁴⁷^na3,
Heiner Boeing⁵⁵,
Vilmantas Giedraitis ORCID: orcid.org/0000-0003-3423-2021⁵⁶,
Dorairaj Prabhakaran⁵⁷,
Omri Gottesman⁵⁸,
James Scott⁵⁹,
Jason Carey²,
Phoenix Kwan³,
George Grant²,
Joshua D. Smith⁶⁰,
Benjamin M. Neale ORCID: orcid.org/0000-0003-1513-6077^2,61,
Shaun Purcell^2,62,63,
Adam S. Butterworth⁶⁴,
Joanna M. M. Howson⁶⁴,
Heung Man Lee⁶⁵,
Yingchang Lu⁵⁸,
Soo-Heon Kwak⁶⁶,
Wei Zhao⁶⁷,
John Danesh^11,64,68,
Vincent K. L. Lam⁶⁵,
Kyong Soo Park ORCID: orcid.org/0000-0003-3597-342X⁶⁹,
Danish Saleheen^70,71,
Wing Yee So⁶⁵,
Claudia H. T. Tam⁶⁵,
Uzma Afzal⁴²,
David Aguilar⁷²,
Rector Arya⁷³,
Tin Aung^36,37,38,
Edmund Chan⁷⁴,
Carmen Navarro^75,76,77,
Ching-Yu Cheng^28,36,37,38,
Domenico Palli ORCID: orcid.org/0000-0002-5558-2437⁷⁸,
Adolfo Correa⁷⁹,
Joanne E. Curran⁸⁰,
Dennis Rybin¹⁵,
Vidya S. Farook⁸¹,
Sharon P. Fowler⁴⁷,
Barry I. Freedman⁸²,
Michael Griswold⁸³,
Daniel Esten Hale⁷³,
Pamela J. Hicks^51,52,53,
Chiea-Chuen Khor ORCID: orcid.org/0000-0002-1128-4729^{28,36,37,84,85},
Satish Kumar ORCID: orcid.org/0000-0002-1969-4431⁸⁰,
Benjamin Lehne⁴²,
Dorothée Thuillier⁸⁶,
Wei Yen Lim²⁸,
Jianjun Liu^28,85,
Marie Loh^42,87,88,
Solomon K. Musani⁸⁹,
Sobha Puppala⁸¹,
William R. Scott⁴²,
Loïc Yengo⁸⁶,
Sian-Tsung Tan^43,59,
Herman A. Taylor⁷⁹,
Farook Thameem⁴⁷,
Gregory Wilson⁹⁰,
Tien Yin Wong^36,37,38,
Pål Rasmus Njølstad^91,92,
Jonathan C. Levy¹⁰,
Massimo Mangino ORCID: orcid.org/0000-0002-2167-7470^9,93,
Lori L. Bonnycastle¹⁷,
Thomas Schwarzmayr⁹⁴,
João Fadista⁹⁵,
Gabriela L. Surdulescu⁹,
Christian Herder ORCID: orcid.org/0000-0002-2050-093X^96,97,
Christopher J. Groves¹⁰,
Thomas Wieland⁹⁴,
Jette Bork-Jensen²⁴,
Ivan Brandslund^98,99,
Cramer Christensen¹⁰⁰,
Heikki A. Koistinen ORCID: orcid.org/0000-0001-7870-070X^{101,102,103,104},
Alex S. F. Doney¹⁰⁵,
Leena Kinnunen¹⁰¹,
Tõnu Esko^{2,106,107,108},
Andrew J. Farmer¹⁰⁹,
Liisa Hakaste^102,110,111,
Dylan Hodgkiss⁹,
Jasmina Kravic⁹⁵,
Valeri Lyssenko⁹⁵,
Mette Hollensted²⁴,
Marit E. Jørgensen¹¹²,
Torben Jørgensen^113,114,115,
Claes Ladenvall⁹⁵,
Johanne Marie Justesen ORCID: orcid.org/0000-0002-0484-8522²⁴,
Annemari Käräjämäki^116,117,
Jennifer Kriebel ORCID: orcid.org/0000-0003-4270-018X^97,118,119,
Wolfgang Rathmann^97,120,
Lars Lannfelt⁵⁶,
Torsten Lauritzen¹²¹,
Narisu Narisu¹⁷,
Allan Linneberg^113,122,123,
Olle Melander¹²⁴,
Lili Milani ORCID: orcid.org/0000-0002-5323-3102¹⁰⁶,
Matt Neville^10,125,
Marju Orho-Melander¹²⁶,
Lu Qi^127,128,
Qibin Qi^127,129,
Michael Roden^96,97,130,
Olov Rolandsson¹³¹,
Amy Swift¹⁷,
Anders H. Rosengren⁹⁵,
Kathleen Stirrups¹¹,
Andrew R. Wood⁷,
Evelin Mihailov¹⁰⁶,
Christine Blancher¹³²,
Mauricio O. Carneiro²,
Jared Maguire²,
Ryan Poplin²,
Khalid Shakir²,
Timothy Fennell²,
Mark DePristo²,
Martin Hrabé de Angelis^97,133,134,
Panos Deloukas ORCID: orcid.org/0000-0001-9251-070X^11,135,136,
Anette P. Gjesing²⁴,
Goo Jun^3,27,
Peter Nilsson¹³⁷,
Jacquelyn Murphy²,
Robert Onofrio²,
Barbara Thorand^97,118,
Torben Hansen^24,138,
Christa Meisinger^97,118,
Frank B. Hu^29,127,
Bo Isomaa^110,139,
Fredrik Karpe^10,125,
Liming Liang^18,29,
Annette Peters^23,97,118,
Cornelia Huth^97,118,
Stephen P O'Rahilly¹⁴⁰,
Colin N. A. Palmer¹⁴¹,
Oluf Pedersen²⁴,
Rainer Rauramaa¹⁴²,
Jaakko Tuomilehto ORCID: orcid.org/0000-0002-8306-6202^{143,144,145,146},
Veikko Salomaa¹⁴⁶,
Richard M. Watanabe^147,148,149,
Ann-Christine Syvänen¹⁵⁰,
Richard N. Bergman¹⁵¹,
Dwaipayan Bharadwaj¹⁵²,
Erwin P. Bottinger⁵⁸,
Yoon Shin Cho¹⁵³,
Giriraj R. Chandak¹⁵⁴,
Juliana CN Chan^65,155,156,
Kee Seng Chia²⁸,
Mark J. Daly⁶¹,
Shah B. Ebrahim⁵⁷,
Claudia Langenberg⁸,
Paul Elliott ORCID: orcid.org/0000-0002-7511-5684^42,157,
Kathleen A. Jablonski¹⁵⁸,
Donna M. Lehman⁴⁷,
Weiping Jia³⁵,
Ronald C. W. Ma ORCID: orcid.org/0000-0002-1227-803X^65,155,156,
Toni I. Pollin¹⁵⁹,
Manjinder Sandhu^11,64,
Nikhil Tandon¹⁶⁰,
Philippe Froguel ORCID: orcid.org/0000-0003-2972-0784^86,161,
Inês Barroso ORCID: orcid.org/0000-0001-5800-4520^11,140,
Yik Ying Teo^28,162,163,
Eleftheria Zeggini ORCID: orcid.org/0000-0003-4238-659X¹¹,
Ruth J. F. Loos ORCID: orcid.org/0000-0002-8532-5087⁵⁸,
Kerrin S. Small ORCID: orcid.org/0000-0003-4566-0005⁹,
Janina S. Ried²⁰,
Ralph A. DeFronzo⁴⁷,
Harald Grallert^97,118,119,
Benjamin Glaser¹⁶⁴,
Andres Metspalu¹⁰⁶,
Nicholas J. Wareham⁸,
Mark Walker¹⁶⁵,
Eric Banks²,
Christian Gieger^20,118,119,
Erik Ingelsson^4,166,
Hae Kyung Im ORCID: orcid.org/0000-0003-0333-5685²⁵,
Thomas Illig^119,167,168,
Paul W. Franks^33,127,131,
Gemma Buck¹³²,
Joseph Trakalo¹³²,
David Buck¹³²,
Inga Prokopenko ORCID: orcid.org/0000-0003-1624-7457^4,10,161,
Reedik Mägi¹⁰⁶,
Lars Lind¹⁶⁹,
Yossi Farjoun¹⁷⁰,
Katharine R. Owen^10,125,
Anna L. Gloyn ORCID: orcid.org/0000-0003-1205-1844^4,10,125,
Konstantin Strauch^20,22,
Tiinamaija Tuomi^{102,110,111,171},
Jaspal Singh Kooner^43,59,172,
Jong-Young Lee ORCID: orcid.org/0000-0002-0092-9958³⁰,
Taesung Park^26,39,
Peter Donnelly^4,6,
Andrew D. Morris^173,174,
Andrew T. Hattersley¹⁷⁵,
Donald W. Bowden^51,52,53,
Francis S. Collins¹⁷,
Gil Atzmon^44,176,
John C. Chambers^42,43,172,
Timothy D. Spector⁹,
Markku Laakso^49,50,
Tim M. Strom^94,177,
Graeme I. Bell¹⁷⁸,
John Blangero⁸⁰,
Ravindranath Duggirala⁸¹,
E. Shyong Tai^28,74,179,
Gilean McVean^4,180,
Craig L. Hanis²⁷,
James G. Wilson¹⁸¹,
Mark Seielstad ORCID: orcid.org/0000-0001-5783-1401^182,183,
Timothy M. Frayling⁷,
James B. Meigs¹⁸⁴,
Nancy J. Cox²⁵,
Rob Sladek^13,40,185,
Eric S. Lander¹⁸⁶,
Stacey Gabriel²,
Karen L. Mohlke ORCID: orcid.org/0000-0001-6721-153X¹⁸⁷,
Thomas Meitinger^94,177,
Leif Groop ORCID: orcid.org/0000-0002-0187-3263^95,171,
Goncalo Abecasis³,
Laura J. Scott³,
Andrew P. Morris^4,106,188,
Hyun Min Kang¹,
David Altshuler^{1,2,107,189,190,191}^na2,
Noël P. Burtt²,
Jose C. Florez^2,62,189,190,
Michael Boehnke³^na2 &
…
Mark I. McCarthy^4,10,125^na2

Scientific Data volume 4, Article number: 170179 (2017) Cite this article

9597 Accesses
21 Citations
16 Altmetric
Metrics details

Subjects

An Erratum to this article was published on 23 January 2018

Abstract

To investigate the genetic basis of type 2 diabetes (T2D) to high resolution, the GoT2D and T2D-GENES consortia catalogued variation from whole-genome sequencing of 2,657 European individuals and exome sequencing of 12,940 individuals of multiple ancestries. Over 27M SNPs, indels, and structural variants were identified, including 99% of low-frequency (minor allele frequency [MAF] 0.1–5%) non-coding variants in the whole-genome sequenced individuals and 99.7% of low-frequency coding variants in the whole-exome sequenced individuals. Each variant was tested for association with T2D in the sequenced individuals, and, to increase power, most were tested in larger numbers of individuals (>80% of low-frequency coding variants in ~82 K Europeans via the exome chip, and ~90% of low-frequency non-coding variants in ~44 K Europeans via genotype imputation). The variants, genotypes, and association statistics from these analyses provide the largest reference to date of human genetic information relevant to T2D, for use in activities such as T2D-focused genotype imputation, functional characterization of variants or genes, and other novel analyses to detect associations between sequence variation and T2D.

Design Type(s)	individual genetic characteristics comparison design • parallel group design • data integration objective
Measurement Type(s)	genetic sequence variation analysis
Technology Type(s)	whole genome sequencing • exome sequencing
Factor Type(s)	ethnic group
Sample Characteristic(s)	Homo sapiens • Finland • Germany • United Kingdom • Sweden • United States of America • South Korea • Singapore • Israel

Machine-accessible metadata file describing the reported data (ISA-Tab format)

Genome-wide association studies

Article 26 August 2021

Tissue-specific enhancer–gene maps from multimodal single-cell data identify causal disease alleles

Article 09 April 2024

Utility of polygenic scores across diverse diseases in a hospital cohort for predictive modeling

Article Open access 12 April 2024

Background & Summary

Genome wide association studies (GWAS) have provided a valuable but incomplete window into the genetic basis of type 2 diabetes (T2D)¹. Common (minor allele frequency [MAF]>5%) variants at over 100 loci have been robustly associated with disease risk, but most have not yet been translated to causal variants, effector transcripts, or disease mechanisms². Because common variants from GWAS have modest effect sizes, and because those previously published explain in aggregate only 10–15% of the genetic basis of T2D³, it has been hypothesized that variants unexplored by GWAS might have a greater impact on efforts to understand or treat T2D^4,5.

To produce a more complete catalogue of rare, low-frequency, and common variants, the GoT2D and T2D-GENES consortia analysed whole-exome and genome sequence data in up to 12,940 individuals (6,504 T2D cases and 6,436 controls; Fig. 1a, Tables 1 and 2)³. First, to interrogate lower-frequency (MAF>0.5%) variation genome-wide, 2,657 Northern and Central European individuals were selected by GoT2D (1,326 cases, 1,331 controls) and characterized with a combination of low-pass (~5x) whole-genome sequencing, deep (~82x) whole-exome sequencing, and high-density (2.5M) SNP genotyping. Genetic variants from these assays were incorporated into a phased integrated panel (WGS panel), capturing an estimated 99% of variants, genome-wide, present in more than 0.5% of individuals (Table 3). Second, additional individuals from 10 cohorts spanning five ethnicities (European, Hispanic, South Asian, East Asian, and African American) were characterized by deep (~82x) whole-exome sequencing by T2D-GENES. The resultant T2D-GENES exome sequence data were combined with the GoT2D exome sequence data to produce a second panel of variation (WES panel), capturing an estimated 99.7% of coding variants present in more than 0.5% of the combined 12,940 individuals (Tables 4 and 5).

**Figure 1: Overview of data and analysis generation.**

Table 1 Summary of studies included in WGS panel.

Full size table

Table 2 Summary of studies included in WES panel.

Full size table

Table 3 Summary of variants in the WGS panel.

Full size table

Table 4 Summary of variant annotations in the WES panel.

Full size table

Table 5 Summary of coding variant frequencies in the WES panel.

Full size table

Each variant was tested for association with T2D under an additive genetic model. To increase power, variants were then assessed in larger sample sizes via one of two means (Fig. 1b). Coding variants were analysed in 79,854 additional individuals (28,305 T2D cases, 51,549 controls) via the Illumina Exome Array, which captures 81.6% of European MAF>0.5% coding variants in the WES panel. Non-coding variants (and coding variants absent from the Exome Array) were analysed in up to 44,414 additional individuals (11,645 cases, 32,769 controls) via statistical genotype imputation; after quality control, this analysis included 89% of variants observed in three or more individuals in the WGS panel. Each variant was tested for association with T2D in the additional individuals, under an additive genetic model, and association statistics were then combined with those from the sequence data via meta-analysis.

Collectively, these analyses suggest a limited role for low-frequency variation in the genetic basis of T2D³. However, they also demonstrate an ability to identify novel hypotheses about the effects of gene inactivation⁶, a resource of coding variants for calibrating cellular assays⁷, and a catalogue of noncoding variants for use in statistical or functional fine mapping of GWAS signals³. The WGS panel also provides a novel resource for genotype imputation⁸, with increased resolution for T2D-specific variants relative to the 1000 Genomes (1000G) reference panel, as well as a means to calibrate simulation-based models of population history⁹ or disease genetic architecture¹⁰.

Methods

These methods are a modified version of the descriptions contained in Fuchsberger et al.³.

Ethics statement

All human research was approved by the relevant institutional review boards and conducted according to the Declaration of Helsinki. All participants provided written informed consent.

WGS GoT2D integrated panel generation

Ascertainment of individuals

Individuals were sampled from four studies: the Finland-United States Investigation of NIDDM Genetics (FUSION) Study (493 cases, 486 controls), KORA (101 cases, 104 controls), the UKT2D Consortium (322 cases, 322 controls), and the Malmö-Botnia Study (410 cases, 419 controls). All individuals were of Northern or Central European ancestry. Cases were preferentially lean, had (relatively) early onset T2D, or had a familial history of T2D; controls by comparison were preferentially overweight or had low fasting glucose levels¹¹. To decrease the likelihood of selecting T2D cases who in fact had type 1 diabetes (T1D) or monogenic forms of diabetes (such as Maturity Onset Diabetes of the Young), cases with an age of diagnosis below 35, testing positive for GAD antibodies, or with a first-degree relative known to have T1D were not included. Statistics of the 2,657 individuals ultimately included in the association analysis are provided in Table 1.

Many of these individuals were measured for cardiometabolic phenotypes other than T2D, including glucose and insulin, anthropometrics, lipids, and blood pressure (Table 6).

Table 6 Additional cardiometabolic phenotypes measured in individuals included in the WGS and WES panels.

Full size table

DNA sample preparation

De-identified DNA samples were sent to the Broad Institute in Cambridge, MA, USA (Malmö-Botnia and FUSION), the Wellcome Trust Centre for Human Genetics in Oxford, UK (UKT2D), or the Helmholtz Zentrum München in Germany (KORA). DNA quantity was measured by Picogreen (all samples) to ensure sufficient total DNA and minimum concentrations for downstream experiments. Samples (Malmö-Botnia, FUSION, UKT2D) were then genotyped on a Sequenom iPLEX assay for a set of 24 SNPs (one X chromosome and 23 autosomal SNPs), with only samples with high-quality genotypes advanced for subsequent sequencing or genotyping.

Exome sequencing

Genomic DNA was sheared, end repaired, ligated with barcoded Illumina sequencing adapters, amplified, size selected, and subjected to in-solution hybrid capture using the Agilent SureSelect Human All Exon v2.0 (Malmö-Botnia, FUSION, UK2T2D) or v3.0 (KORA) bait set (Agilent Technologies, USA). Resulting Illumina exome sequencing libraries were qPCR quantified, pooled, and sequenced with 76-bp paired-end reads using Illumina GAII or HiSeq 2,000 sequencers to ~82-fold mean coverage.

Genome sequencing

Whole-genome Illumina sequencing library construction was performed as described for exome sequencing above, except that genomic DNA was sheared to a larger target size and hybrid capture was not performed. The resulting libraries were size selected to contain fragment insert sizes of 380 bp±20% (Malmö-Botnia, FUSION, KORA) and 420 bp±25% (UKT2D) using gel electrophoresis or the SAGE Pippin Prep (Sage Science, USA). Libraries were qPCR quantified, pooled, and sequenced with 101-bp paired-end reads using Illumina GAII or HiSeq 2,000 sequencers to ∼5-fold mean coverage.

HumanOmni2.5 array genotyping and quality control (QC)

SNP array genotyping was performed by the Broad Genetic Analysis Platform. DNA samples were placed on 96-well plates and assayed using the Illumina HumanOmni2.5-4v1_B SNP array. Genotypes were then called using Illumina GenomeStudio v2010.3 with default clusters. SNPs with GenTrain score <0.6, cluster separation score <0.4, or call rate <97% were considered technical failures at the genotyping laboratory and excluded from further analysis. Next, 85 individuals with a genotype call rate below 98%, low genetic fingerprint (24-marker panel) concordance, or estimated gender discordance were excluded from further analysis. Finally, SNPs monomorphic across all individuals, failed by the 1000G Omni 2.5 QC filter, or with Hardy-Weinberg equilibrium P<10⁻⁶ were excluded from analysis.

Processing, quality control, and variant calling of sequence data

Sequence data were processed and aligned to the human genome (build hg19) using the Picard (http://broadinstitute.github.io/picard/), BWA¹², and GATK^13,14 software packages, following best-practice pipelines.

Sequencing coverage of each individual was computed based on the fraction of target bases with >20 reads aligned (exome sequencing) or average number of reads aligned across all bases genome-wide (genome sequencing). Based on these metrics, we excluded from further analysis exome sequence data (from 151 individuals) with coverage ≤20x in >20% of the target bases and genome sequence data (from 68 individuals) with average coverage ≤5x.

Possible DNA contamination of sequence data was assessed using verifyBamID¹⁵, either by direct comparison of sequence and HumanOmni2.5 array genotypes (where available) or by indirect estimates of contamination based on HumanOmni2.5 array allele frequencies. DNA samples with estimated contamination >2% using either method were excluded from further analysis (data from 7 individuals in the exome sequencing dataset and 59 individuals in the genome sequencing dataset). Uncontaminated DNA sample swaps were also detected via comparison of sequence and array data and corrected prior to variant calling.

To identify single nucleotide variants (SNVs) from the whole-genome sequence data, we used two independent SNV calling pipelines: GotCloud¹⁶ and the GATK UnifiedGenotyper¹⁴. We merged unfiltered SNV calls across the two call sets and then processed the merged site list through the SVM and VQSR filtering algorithms implemented by those pipelines. SNVs that failed both filtering algorithms were excluded from further analysis. To identify SNVs from the whole-exome sequence data, we used the GATK UnifiedGenotyper best-practices pipeline¹⁴.

To identify short insertions and deletions (indels) from the whole-genome sequence data, we called variants using the GATK UnifiedGenotyper best-practices pipeline. Because indels are known to have high false positive rates¹⁷, we applied more stringent criteria for indel QC than for SNV QC, excluding indels that failed either the SVM or VQSR filtering algorithms. To identify indels from the whole-exome sequence data, we used the GATK UnifiedGenotyper best-practices pipeline¹⁴.

To identify structural variants (SVs, or >100-bp deletions) from the whole-genome sequence data, we used GenomeSTRiP¹⁸. To increase sensitivity after initial discovery of SVs, we merged the discovered sites with deletions identified in 1,092 sequenced individuals from the 1000G Project¹⁷ and then genotyped the merged site lists across the whole-genome sequenced individuals. After applying the default filtering implemented in GenomeSTRiP, pass-filtered sites variable in any of the individuals were identified as candidate variant sites. Among these candidate sites, we excluded variants in known immunoglobin loci to reduce the impact of possible cell-line artifacts. We did not call SVs from the whole-exome sequence data.

Integrated panel generation

We merged variants discovered from the three experimental platforms into one site list. For individuals who had data from each of the three platforms, we then calculated genotype likelihoods across all sites separately by platform: for the whole-genome sequence data, we used GotCloud; for the exome sequence data, we used the GATK UnifiedGenotyper; and for the HumanOmni2.5 data, we converted hard genotype calls into genotype likelihoods assuming a genotype error rate of 10⁻⁶. If a site was not assayed by one of the three platforms, it was ignored in likelihood calculation for that platform.

We then calculated combined genotype likelihoods as the product of the genome, exome, and HumanOmni2.5 likelihoods, assuming independence across platforms. Following a strategy originally developed for the 1000G Phase 1 project¹⁷, we then phased the integrated likelihoods using Beagle¹⁹ (with 10,000 SNVs per chunk and 1,000 overlapping SNVs between consecutive chunks) and refined phased genotypes using Thunder²⁰ as implemented in GotCloud (with 400 states).

Using the genotypes from the integrated panel, we performed principal component analysis (PCA) separately for each of the three variant types (SNVs, indels, SVs), using EPACTS on an LD-pruned (r²<0.20) set of MAF>0.01 autosomal variants (with variants in large high-LD regions^21,22 or with Hardy-Weinberg P<10⁻⁶ removed). Inspecting the first ten PCs for each variant type, we identified 43 outlier individuals based on PCs from SNVs and indels and 136 additional outliers based on PCs from SVs; these 179 individuals were excluded from further analysis. Additionally, 38 individuals with close relationships with other study individuals (estimated genome-wide identity-by-descent proportion of alleles shared >0.20) were excluded from further analysis.

The final WGS panel contains genotypes from 2,874 individuals at 26.85M SNVs, 1.59M indels, and 11.88 K SVs. The final analysis set includes genotypes from 2,657 individuals at 26.20M SNVs, 1.50M indels, and 8.88K SVs SVs (Table 3).

WES (GoT2D+T2D-GENES Multiethnic) panel generation

Ascertainment of individuals

In addition to the individuals within the WGS panel, additional individuals, 10,242 of which were included in the final analysis, were chosen for whole-exome sequencing from 10 studies: the Jackson Heart Study (500 African-American cases, 526 matched controls), the Wake Forest School of Medicine Study (518 African-American cases, 530 matched controls), the Korea Association Research Project (526 East-Asian cases, 561 matched controls), the Singapore Diabetes Cohort Study and Singapore Prospective Study Program (486 East-Asian cases, 592 matched controls), Ashkenazi (506 European cases, 355 matched controls), the Metabolic Syndrome in Men (METSIM) Study (484 European cases, 498 matched controls), the San Antonio Mexican American Family Studies (272 Hispanic cases, 218 matched controls), the Starr County Texas study (749 Hispanic cases, 704 matched controls), the London Life Sciences Population (LOLIPOP) Study (531 South-Asian cases, 538 matched controls), and the Singapore Indian Eye Study (563 South-Asian cases, 585 matched controls). Potential T1D or MODY cases were excluded via similar approaches as for the whole-genome sequencing experiment. Statistics of the individuals ultimately included in the association analysis are provided in Table 2.

As for the WGS panel, many individuals were measured for additional cardiometabolic phenotypes (Table 6).

Exome sequencing

DNA samples were obtained and sequenced in the same manner as described for the exome sequencing component of the WGS panel.

Processing, QC, and variant calling

As for the exome sequence data within the WGS panel, sequence data for the WES panel were processed and aligned to the human genome (build hg19) using the Picard, BWA¹², and GATK^13,14 software packages and best-practice pipelines. Genotype likelihoods were computed controlling for contamination. Hard calls (the GATK-called genotypes but set as missing at a genotype quality [GQ] <20 threshold¹⁴) and dosages (the expected value of the genotype, defined as Pr(RX|data)+2Pr(XX|data), where R is the reference and X the alternative allele) were computed for each individual at each variant site. Hard calls were used only for quality control, while dosages were used in downstream association analyses. Multi-allelic SNVs and indels were dichotomized by collapsing alternate alleles into one category.

Individuals were excluded from analysis if they were outliers on one of multiple metrics: poor array genotype concordance (where available), high number of variant alleles or singletons, high or low allele balance (average proportion of non-reference alleles at heterozygous sites), or excess mean heterozygosity or ratio of heterozygous to homozygous genotypes. Within this reduced set of individuals, we then further excluded variants based on hard call rate (<90% in any cohort), deviation from Hardy-Weinberg equilibrium (P<10⁻⁶ in any ancestry group), or differential call rate between T2D cases and controls (P<10⁻⁴ in any ancestry group).

The final WES panel contains genotypes for 13,008 individuals at 2.93M SNVs and 111.9 K indels. The set ultimately included in coding variant association analysis (after removal of individuals with close relatives or of uncertain ancestry) contains genotypes for 12,940 individuals at 2.89M SNVs and 110.2 K indels (Tables 4 and 5).

Assaying variants in larger sample sizes

Imputation from the WGS panel

We carried out genotype imputation, using existing SNP array data, from the WGS panel into 44,414 individuals (11,645 T2D cases and 32,769 controls) from 13 studies participating in the DIAGRAM consortium. Each study performed quality control independently. A more detailed description of the analyzed individuals is available elsewhere³.

Exome array genotyping from the WES panel

We considered 28,305 T2D cases and 51,549 controls from 13 studies of European ancestry, each genotyped with the Illumina exome array. Studies independently called genotypes using the Illumina GenCall algorithm (http://www.illumina.com/Documents/products/technotes/technote_gencall_data_analysis_software.pdf ) or Birdseed²³. Individuals were excluded if they had a low call rate (<99%), excess heterozygosity, high singleton counts, evidence of non-European ancestry, discrepancy between recorded and genotyped sex, or discordance with prior SNP array or genotyping platform fingerprint data (where available). Variants were excluded if they had a low call rate (<99%), deviation from Hardy-Weinberg equilibrium (P<10⁻⁶), GenTrain score <0.6, cluster separation score <0.4, or a suspect intensity plot based on manual inspection. After quality control, missing genotypes were re-called using zCall²⁴, and additional quality control was performed to exclude poorly genotyped individuals (call rate <99% or excess heterozygosity) or variants (call rate <99%). A more detailed description of the analyzed individuals is available elsewhere³.

Association analysis

WGS panel single variant analysis

For each variant in the WGS panel, we tested for association between genotype and T2D in the 2,657 sequenced individuals. We used a logistic regression framework (assuming an additive genetic model) with the Firth bias-corrected likelihood ratio test^25,26 to test for significance. Tests were adjusted for sex, the first two PCs computed based on genotypes from the HumanOmni2.5M array, and an indicator function for observed temporal stratification based on sequencing date and center.

Analysis of imputed datasets

In each of the thirteen studies within which variants from the WGS panel were imputed, SNVs with minor allele count (MAC)≥1 were tested for T2D association under an additive genetic model. Association tests were adjusted for study-specific covariates and performed using either the Firth, likelihood ratio, or score tests as implemented in EPACTS (https://genome.sph.umich.edu/wiki/EPACTS) or SNPTEST²⁷. Residual population stratification for each study was accounted for using genomic control²⁸. Association statistics were then combined across studies, using a fixed-effects sample-size weighted meta-analysis as implemented in METAL²⁹.

WES panel single variant analysis

For each variant in the WES panel, we tested for association between genotype and T2D in the 12,940 sequenced individuals. We computed separate association statistics for each ancestry group using EMMAX³⁰. Additionally, we performed association tests using the Wald statistic, adjusting for ethnic-specific principal components after exclusion of related individuals. For each test, we calculated genomic control inflation factors and corrected association summary statistics (P-values and standard errors) to account for residual population structure.

We subsequently performed a fixed-effects meta-analysis of ancestry-specific association summary statistics for each variant using (i) a sample-size weighting of P-values from the EMMAX analysis and (ii) an inverse-variance weighting of effect size estimates from the Wald analysis. For the final results, P-values were taken from the EMMAX analysis, and effect size estimates from the Wald analysis.

Analysis of exome array datasets

In each study within which exome array genotyping was applied, variants were tested for association with T2D via both the EMMAX and Wald tests. For the Wald test, related individuals were excluded and statistics were adjusted for study-specific principal components. For each study, P-values and standard errors were corrected based on the calculated genomic control inflation factor.

Variants were then combined via a fixed-effects meta-analysis. EMMAX P-values were combined via a sample-size weighted analysis, and Wald effect sizes were combined via an inverse-variant weighted analysis. For the final results, P-values were taken from the EMMAX analysis, and effect sizes were estimated from the Wald analysis.

Gene-level analysis

We first generated four variant lists (‘masks’) based on functional annotations and observed allele frequencies. Annotations were computed based on transcripts in ENSEMBL 66 (GRCh37.66) using CHAoS v0.6.3, SnpEFF v3.1³¹, and VEP v2.7³². We then identified variants predicted by at least one of the three algorithms in at least one mapped transcript to be protein-truncating ('for example, nonsense, frameshift, essential splice site), denoted PTVs, or other protein-altering (for example, missense, in-frame indel, non-essential splice site), denoted missense. We additionally used a previously described procedure³³ to identify subsets of missense variants bioinformatically predicted to be deleterious: those annotated as damaging by each of Polyphen2-HumDiv, PolyPhen2-HumVar, LRT, Mutation Taster, and SIFT were considered to meet ‘strict’ criteria, while those annotated as damaging by one of these algorithms was considered to meet ‘broad’ criteria. We then calculated the MAF of each variant based on the highest frequency across each of the five ancestry groups. We finally combined these annotations to produce four masks: the PTV-only mask included PTVs, the PTV+NS_strict mask included variants in the PTV-only mask as well as those meeting ‘strict’ criteria for deleteriousness, the PTV+NS_broad mask included variants in the PTV-only mask as well as those with MAF<1% meeting ‘broad’ criteria for deleteriousness, and the PTV+missense mask included variants in the PTV+NS_broad mask as well as those with MAF<1% annotated as missense.

We performed gene-level analysis using the MetaSKAT software package (v0.32)³⁴, employing the SKAT v0.93 library to perform a SKAT-O³⁵ analysis within each ancestry group as well as across all ancestry groups via meta-analysis. Within each ancestry group, we assumed homogenous allele frequencies and genetic affects and adjusted for ethnic-specific axes of genetic variation after exclusion of 96 related individuals. For the meta-analysis, we used the MetaSKAT option to analyze genotype-level data, allowing for heterogeneity of allele frequencies and genetic effects between ancestry groups. All analyses were completed using the recommended ρ vector for SKAT-O: (0, 0.12, 0.22, 0.32, 0.52, 0.5, 1).

Code availability

All analyses were performed using publically available software packages, using versions and parameters as described above.

Data Records

Genotypes and phenotypes from the WGS and WES panels are available at the European Genome-phenome Archive (EGA, Data Citation 1 and Data Citation 2) and the database of Genotypes and Phenotypes (dbGAP, Data Citation 3 to Data Citation 12).

The data in EGA are covered under a single data use agreement, which complies with all of the cohort-specific data use restrictions. While this does limit data access according to the criteria of the most restrictive cohort, it is the only mechanism through which the entire WES and WGS panels are available to investigators. Additionally, the EGA contains data from one cohort that could not be released to a US-based repository. To download either the WGS or WES panel from the EGA, investigators must obtain approval from a data access committee (DAC, t2dgenes-got2d-dac@broadinstitute.org) to analyze data from all cohorts included in the study. The requester will receive an application packet that includes a project proposal document and a Data Transfer Agreement (DTA). The requester must then provide to the DAC a short description of their study, the proposed use of the data, an approval from the Institution's IRB, and a signed DTA. Assuming IRB approval and an executed DTA, the process for obtaining final approval from the DAC takes 4–6 weeks. Once approved, investigators can download either a single VCF file with genotypes from all individuals in the WGS panel (Data Citation 1) or a single VCF file with genotypes from all individuals in the WES panel (Data Citation 2).

If investigators cannot obtain approval to analyze all cohorts in the WES panel (e.g., commercial uses) they can download cohort-specific data from dbGAP. Each cohort in dbGAP is subject to distinct data use restrictions, and investigators can obtain separate VCF files, as well as the raw sequence reads, for each of the cohorts.

The WGS and WES panels are accompanied by exclusion lists of variants and individuals (Data Citation 1 and Data Citation 2). The WGS VCF file contains data from the full set of 2,874 individuals and 28.45M variants that passed QC, with additional lists provided containing the 2,657 individuals and 17.69M variants included in association analysis. The WES VCF file contains the full set of 13,008 individuals that passed QC, with additional lists provided containing the 3.04M variants that passed QC (the VCF includes a small number of variants that failed QC) and the 12,940 samples and 2.93M variants included in association analysis. Additionally, the WES dataset includes lists of variants included in gene-level analysis for each of the four analyzed masks.

Five datasets of association statistics are also available for download. Association statistics for variants in the WGS panel are available from the whole-genome sequenced individuals or from those with imputed genotypes (Data Citation 1). Association statistics in the WES panel are available from the whole-exome sequenced individuals or from those genotyped on the exome array (Data Citation 2). Additionally, gene-level association statistics from the whole-exome sequenced individuals are available for each of the four variant masks (Data Citation 2).

A description of the datasets is available in Table 7. All association statistics are also available for browsing and searching via the public Type 2 Diabetes Knowledge Portal at www.type2diabetesgenetics.org. Through the portal, users can construct queries to find variants satisfying specified annotations and association thresholds, both across the WES and WGS analyses as well as other GWAS datasets. Users can also dynamically construct a set of variants within a gene and obtain a P-value from aggregate association analysis within the WES individuals.

Table 7 Summary of datasets.

Full size table

Technical Validation

Evaluation of variants in the WGS panel

We evaluated the variant sensitivity (fraction of true variant sites detected) of the WGS panel, based on the 2,538 individuals with data from all three experimental platforms (low-pass whole-genome sequencing, whole-exome sequencing, and HumanOmni2.5M array genotyping). To assess the sensitivity of low-pass whole-genome sequencing alone, we computed the fraction of variants detected from whole-exome sequencing that were also detected by low-pass whole-genome sequencing. Sensitivity estimates were 99.8, 99.0, and 48.2% for common (MAF>5%), low-frequency (0.5%<MAF<5%), and rare (MAF<0.5%) SNVs, respectively, and >99.9, 93.8, and 17.9% for common, low-frequency, and rare short indels, respectively. We also assessed the coding SNV sensitivity of low-pass whole-genome sequence data combined with exome sequence data, based on the proportion of HumanOmni2.5 SNVs detected by either sequencing platform. Because HumanOmni2.5 SNVs are enriched for common variants, we calculated an averaged sensitivity at each allele count, weighted by the number of exome-detected variants given the allele count. Sensitivity estimates were 99.9, 99.7, and 83.9% for common, low-frequency, and rare variants, respectively. These sensitivity estimates provide lower bounds on the sensitivity of the full WGS panel, which combines HumanOmni2.5 SNP array data as well as the two types of sequence data (Fig. 2a).

**Figure 2: Summary of key quality control metrics for WGS and WES panels.**

We further evaluated the genotype accuracy of the WGS panel for each of the three classes of variant (SNVs, indels, and SVs). Across chromosome 20, concordance of low-pass whole-genome-sequence-based SNV genotypes with exome-sequence-based genotypes was 99.86%, with homozygous reference, heterozygous, and homozygous non-reference concordances of 99.97, 98.34, and 99.72%, respectively. Concordance between exome-sequence-based SNV genotypes and HumanOmni2.5 genotypes was 99.4%, with homozygous reference, heterozygous, and homozygous non-reference concordances of 99.97, 99.69, and 99.88%, respectively. For indels genotyped with both low-pass whole-genome-sequence data and exome-sequence data, concordance was 99.4%, with homozygous reference, heterozygous, and homozygous non-reference concordances of 99.8, 95.8, and 98.6%, respectively.

To evaluate the genotype accuracy of SVs detected from the low-pass whole-genome sequence data, we took advantage of the 181 individuals in our study who were previously included in the WTCCC array-CGH based structural variant detection experiment³⁶. Taking the WTCCC data as a gold standard, we estimated genotype accuracy across 1,047 overlapping SVs (with reciprocal overlap >0.8) genome-wide. The overall genotype concordance was 99.8%, with homozygous reference, heterozygous, and homozygous non-reference concordances of 99.9, 99.6, and 99.7%, respectively.

Evaluation of variants in the WES panel

We assessed the overall sequencing quality of individuals in the WES panel by computing distributions of global statistics, stratified by reported ancestry (Fig. 2b). After quality control, the number of non-reference variants, mean heterozygosity, and average allele balance (fraction of non-reference reads at heterozygous sites) per individual approximately matched a Gaussian distribution within each ancestry. Concordance between genotypes from exome-sequence data and those from independent SNP arrays was above 99% for the vast majority of individuals, with non-reference concordance above 99.5% for individuals genotyped on the (highest-quality) OMNI array.

We also assessed bulk properties of indels within the WES panel. The length distribution of indels showed an excess of variants with lengths a multiple of three, as expected. Additionally, principal components computed from indels alone closely matched those computed from SNVs and indels together (Fig. 2c).

Evaluation of imputation from the WGS panel

We computed the mean imputation quality as measured by the average squared correlation between imputed genotypes and actual genotypes from leave-one-out cross-validation analysis. For variants of allele count ≈100 or above in the WGS panel (corresponding to a frequency of 1.8%), average r² values were in excess of 0.8 for Finnish individuals and in excess of 0.6 for British individuals (Fig. 3a).

**Figure 3: Completeness of additional variant genotyping.**

Evaluation of exome array sensitivity

We assessed the overlap of variants present on the exome array with those observed in the WES panel. As the exome array primarily contains SNVs that are predicted to be protein altering, we focused on nonsense, essential splice site, and missense variants; only variants passing QC in both sequence and array data were included in the assessment. The fraction of variants in the WES panel on the exome array was highest for Europeans, at 81.6%, and lowest in African-Americans, at 49.0% (Fig. 3b).

Evaluation of association tests

We used the genetic power calculator (http://zzz.bwh.harvard.edu/gpc/) to estimate power to detect T2D association for each of the single variant analyses. All calculations assumed a T2D prevalence of 8%. Figure 4 shows power estimates under optimistic scenarios, in which a variant is present in the WGS panel (Fig. 4a), observed at equal frequency in all ancestries in the WES panel (Fig. 4b), imputed with high (r²=0.8) accuracy (Fig. 4c), or included on the exome array (Fig. 4d). We also computed power for less optimistic scenarios³; if, for example, a MAF 1% variant is present in only one ancestry, it must have an odds ratio of ~3.5 to achieve significance of P=10⁻⁴, rather than an odds ratio of ~1.8 were it to be present in all ancestries.

**Figure 4: Power of single variant analysis in the WGS panel, WES panel, imputation, and exome array analyses.**

For gene-based tests in the WES panel, we used a simulated haplotype data set (http://cran.r-project.org/web/packages/SKAT/vignettes/SKAT.pdf) and estimated power as a function of (i) the phenotypic variance, under a liability scale, explained by additive genetic effects and (ii) the percentage of variants that were causal (50% or 100%). As for single-variant power calculations, we considered variants of constant frequency across all five ancestry groups, as well as variants specific to one ancestry group. These calculations suggested³ that, even under optimistic scenarios, genes must explain >1% of genetic variance in order to achieve a moderately significant (P<10⁻³) association in the WES panel.

To ensure that association statistics were well calibrated, we computed quantile-quantile (QQ) plots comparing observed statistics to those expected under the null distribution. The vast majority of statistics matched the expected distribution (suggesting good calibration of the association tests) with a deviation from the null for common variant associations from the WGS panel, imputed genotypes, and exome array genotypes (suggesting power to detect known positive control T2D-associated non-coding common variants).

Usage Notes

The WGS and WES panels may be useful for simulation-based approaches that require individual-level genotypes and phenotypes. In this case the full list of ‘QC+’ variants (not merely those included in the T2D analysis) should be used, as the association analysis omitted very rare variants that might be useful in other settings. The WGS panel can also serve as a reference panel for genotype imputation, particularly in cases where an excess of haplotypes from T2D cases are required. Although more recent and larger efforts such as the Haplotype Reference Consortium⁸ will provide greater imputation power for most use cases, the WGS panel is not restricted based on minor allele count and includes indels and SVs.

The most valuable data from the T2D-GENES and GoT2D studies are likely the catalogues of T2D association statistics for low-frequency and common variation. These statistics may prove useful for fine mapping or functional studies of T2D GWAS signals, in which enumeration of potential causal variants is required, or for ‘reverse genetic’ approaches, in which estimates of the phenotypic effects of variants with strong molecular effects are desired. For this usage, we advise investigators to query the T2D Knowledge Portal at www.type2diabetesgenetics.org as a first step, as its goal is to provide a simple and continuously updated means to query these and other association statistics. The portal is designed specifically for queries about individual variants or those within a single gene or genomic locus, as well as variant- or gene-level analyses for which investigators desire to adjust included variants, covariates, or individuals. Users should note that data from the T2D-GENES and GoT2D studies are included in, and thus should not be combined with data from, the Exome Aggregation Consortium (exac.broadinstitute.org).

Should investigators desire access to all association statistics, genome-wide, the files at EGA should be used (as the T2D Knowledge Portal does not support bulk download of association statistics). For single variant associations, a statistic in the largest available sample size should be used. Coding variant association statistics present in both the WES panel analysis and the exome chip analysis can be safely combined via meta-analysis, as can non-coding variant association statistics present in both the WGS panel analysis and the imputation-based analysis; statistics should not, however, be combined across the non-coding and coding variant analyses. Association results from the sequence or exome chip data should not need to be filtered, but it is advisable to filter results from the imputation-based analysis according to a threshold on imputation quality (e.g., r²>0.3). For gene-level analyses, investigators should first use the aggregate association statistics and then dissect results by examining variant-level statistics for each variant in the mask.

The pre-computed statistics should be sufficient for most investigators. Cases where recalculation of associations may be appropriate include (a) conditional analyses, such as variant association controlling for additional individual phenotypes or genotypes at different variants, (b) association analyses with phenotypes other than T2D, and (c) novel statistical tests. In these cases, usage of the tests and inclusion of the covariates described in the methods section is recommended, and only variants and individuals present in the final ‘QC+’ analysis lists should be included.

Additional information

How to cite this article: Flannick, J. et al. Sequence data and association statistics from 12,940 type 2 diabetes cases and controls. Sci. Data 4:170179 doi: 10.1038/sdata.2017.179 (2017).

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

Manolio, T. A. et al. Finding the missing heritability of complex diseases. Nature 461, 747–753 (2009).
Article ADS CAS Google Scholar
Flannick, J. & Florez, J. C. Type 2 diabetes: genetic data sharing to advance complex disease research. Nature Reviews Genetics 17, 535–549 (2016).
Article CAS Google Scholar
Fuchsberger, C. et al. The genetic architecture of type 2 diabetes. Nature 536, 41–47 (2016).
Article ADS CAS Google Scholar
Bodmer, W. & Bonilla, C. Common and rare variants in multifactorial susceptibility to common diseases. Nature Genetics 40, 695–701 (2008).
Article CAS Google Scholar
Cirulli, E. T. & Goldstein, D. B. Uncovering the roles of rare variants in common disease through whole-genome sequencing. Nature Reviews Genetics 11, 415–425 (2010).
Article CAS Google Scholar
Flannick, J. et al. Loss-of-function mutations in SLC30A8 protect against type 2 diabetes. Nature Genetics 46, 357–363 (2014).
Article CAS Google Scholar
Majithia, A. R. et al. Rare variants in PPARG with decreased activity in adipocyte differentiation are associated with increased risk of type 2 diabetes. Proceedings of the National Academy of Sciences of the United States of America 111, 13127–13132 (2014).
Article ADS CAS Google Scholar
McCarthy, S. et al. A reference panel of 64,976 haplotypes for genotype imputation. Nature Genetics 48, 1279–1283 (2016).
Article CAS Google Scholar
Wang, S. R. et al. Simulation of Finnish population history, guided by empirical genetic data, to assess power of rare-variant tests in Finland. American Journal of Human Genetics 94, 710–720 (2014).
Article CAS Google Scholar
Agarwala, V., Flannick, J., Sunyaev, S., Go, T. D. C. & Altshuler, D. Evaluating empirical bounds on complex disease genetic architecture. Nature Genetics 45, 1418–1427 (2013).
Article CAS Google Scholar
Guey, L. T. et al. Power in the phenotypic extremes: a simulation study of power in discovery and replication of rare variants. Genetic Epidemiology 35, 236–246 (2011).
PubMed Google Scholar
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25, 1754–1760 (2009).
Article CAS Google Scholar
McKenna, A. et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Research 20, 1297–1303 (2010).
Article CAS Google Scholar
DePristo, M. A. et al. A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nature Genetics 43, 491–498 (2011).
Article CAS Google Scholar
Jun, G. et al. Detecting and estimating contamination of human DNA samples in sequencing and array-based genotype data. American Journal of Human Genetics 91, 839–848 (2012).
Article CAS Google Scholar
Jun, G., Wing, M. K., Abecasis, G. R. & Kang, H. M. An efficient and scalable analysis framework for variant extraction and refinement from population-scale DNA sequence data. Genome Research 25, 918–925 (2015).
Article CAS Google Scholar
Abecasis, G. R. et al. An integrated map of genetic variation from 1,092 human genomes. Nature 491, 56–65 (2012).
Article ADS Google Scholar
Handsaker, R. E., Korn, J. M., Nemesh, J. & McCarroll, S. A. Discovery and genotyping of genome structural polymorphism by sequencing on a population scale. Nature Genetics 43, 269–276 (2011).
Article CAS Google Scholar
Browning, S. R. & Browning, B. L. Rapid and accurate haplotype phasing and missing-data inference for whole-genome association studies by use of localized haplotype clustering. American Journal of Human Genetics 81, 1084–1097 (2007).
Article CAS Google Scholar
Li, Y., Sidore, C., Kang, H. M., Boehnke, M. & Abecasis, G. R. Low-coverage sequencing: implications for design of complex trait association studies. Genome Research 21, 940–951 (2011).
Article CAS Google Scholar
Price, A. L. et al. Long-range LD can confound genome scans in admixed populations. American Journal of Human Genetics 83, 132–135, author reply 135-139 (2008).
Article CAS Google Scholar
Weale, M. E. Quality control for genome-wide association studies. Methods in Molecular Biology 628, 341–372 (2010).
Article CAS Google Scholar
Korn, J. M. et al. Integrated genotype calling and association analysis of SNPs, common copy number polymorphisms and rare CNVs. Nature Genetics 40, 1253–1260 (2008).
Article CAS Google Scholar
Goldstein, J. I. et al. zCall: a rare variant caller for array-based genotyping: genetics and population analysis. Bioinformatics 28, 2543–2545 (2012).
Article CAS Google Scholar
Firth, D. Bias reduction of maximum-likelihood-estimates. Biometrika 80, 27–38 (1993).
Article MathSciNet MATH Google Scholar
Ma, C., Blackwell, T., Boehnke, M., Scott, L. J. & Go, T. D. i. Recommended joint and meta-analysis strategies for case-control association testing of single low-count variants. Genetic Epidemiology 37, 539–550 (2013).
Article Google Scholar
Marchini, J., Howie, B., Myers, S., McVean, G. & Donnelly, P. A new multipoint method for genome-wide association studies by imputation of genotypes. Nature Genetics 39, 906–913 (2007).
Article CAS Google Scholar
Devlin, B. & Roeder, K. Genomic control for association studies. Biometrics 55, 997–1004 (1999).
Article CAS MATH Google Scholar
Willer, C. J., Li, Y. & Abecasis, G. R. METAL: fast and efficient meta-analysis of genomewide association scans. Bioinformatics 26, 2190–2191 (2010).
Article CAS Google Scholar
Kang, H. M. et al. Variance component model to account for sample structure in genome-wide association studies. Nature Genetics 42, 348–354 (2010).
Article CAS Google Scholar
Cingolani, P. et al. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly 6, 80–92 (2012).
Article CAS Google Scholar
McLaren, W. et al. The Ensembl Variant Effect Predictor. Genome Biology 17, 122 (2016).
Article Google Scholar
Purcell, S. M. et al. A polygenic burden of rare disruptive mutations in schizophrenia. Nature 506, 185–190 (2014).
Article ADS CAS Google Scholar
Lee, S., Teslovich, T. M., Boehnke, M. & Lin, X. General framework for meta-analysis of rare variants in sequencing association studies. American Journal of Human Genetics 93, 42–53 (2013).
Article CAS Google Scholar
Lee, S., Wu, M. C. & Lin, X. Optimal tests for rare variant effects in sequencing association studies. Biostatistics 13, 762–775 (2012).
Article Google Scholar
Wellcome Trust Case Control Consortium. Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls. Nature 447, 661–678 (2007).
Article Google Scholar

Data Citations

The European Genome-phenome Archive EGAS00001001459 (2016)
The European Genome-phenome Archive EGAS00001001460 (2016)
Altshuler, D., Boehnke, M., McCarthy, M., & Florez, J. dbGAP phs001097.v1.p1 (2016)
Altshuler, D., Boehnke, M., McCarthy, M., & Florez, J. dbGAP phs001099.v1.p1 (2016)
Altshuler, D., Boehnke, M., McCarthy, M., & Florez, J. dbGAP phs001098.v1.p1 (2016)
Duggirala, R. dbGAP phs000849.v1.p1 (2016)
Altshuler, D., Boehnke, M., McCarthy, M., & Florez, J. dbGAP phs001096.v1.p1 (2016)
Altshuler, D., Boehnke, M., McCarthy, M., & Florez, J. dbGAP phs001095.v1.p1 (2016)
Altshuler, D., Boehnke, M., McCarthy, M., & Florez, J. dbGAP phs001093.v1.p1 (2016)
Altshuler, D., Boehnke, M., McCarthy, M., & Florez, J. dbGAP phs001100.v1.p1 (2016)
Altshuler, D., Boehnke, M., McCarthy, M., & Florez, J. dbGAP phs001102.v1.p1 (2016)
Altshuler, D., Boehnke, M., McCarthy, M., & Florez, J. dbGAP phs000840.v1.p1 (2016)

Download references

Acknowledgements

Grant support and acknowledgments are listed in the Supplementary Information.

Author information

Jason Flannick, Christian Fuchsberger and Anubha Mahajan: These authors contributed equally to this work.
David Altshuler, Michael Boehnke and Mark I. McCarthy: These authors jointly supervised this work.
Peter S. Chines and Hanna E. Abboud: Deceased.

Authors and Affiliations

Department of Molecular Biology, Massachusetts General Hospital, Boston, Massachusetts, USA
Jason Flannick, Hyun Min Kang & David Altshuler
Program in Medical and Population Genetics, Broad Institute, Cambridge, Massachusetts, USA
Jason Flannick, Vineeta Agarwala, Lizz Caulkins, Ryan Koesterer, Cecilia M. Lindgren, Christopher Hartl, Todd Green, Alisa Manning, Jason Carey, George Grant, Benjamin M. Neale, Shaun Purcell, Tõnu Esko, Mauricio O. Carneiro, Jared Maguire, Ryan Poplin, Khalid Shakir, Timothy Fennell, Mark DePristo, Jacquelyn Murphy, Robert Onofrio, Eric Banks, Stacey Gabriel, David Altshuler, Noël P. Burtt & Jose C. Florez
Department of Biostatistics and Center for Statistical Genetics, University of Michigan, Ann Arbor, Michigan, USA
Christian Fuchsberger, Tanya M. Teslovich, Clement Ma, Xueling Sim, Thomas W. Blackwell, Adam E. Locke, Anne U. Jackson, Jeroen R. Huyghe, Heather M. Stringham, Keng-Han Lin, Ryan P. Welch, Phoenix Kwan, Goo Jun, Goncalo Abecasis, Laura J. Scott & Michael Boehnke
Nuffield Department of Medicine, Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
Anubha Mahajan, Kyle J. Gaulton, Loukas Moutsianas, Davis J. McCarthy, Manuel A. Rivas, John R. B. Perry, Neil R. Robertson, N William Rayner, Juan Fernandez Tajes, Cecilia M. Lindgren, Martijn van de Bunt, Richard D. Pearson, Ashish Kumar, Yuhui Chen, Teresa Ferreira, Momoko Horikoshi, Erik Ingelsson, Inga Prokopenko, Anna L. Gloyn, Peter Donnelly, Gilean McVean, Andrew P. Morris & Mark I. McCarthy
Harvard-MIT Division of Health Sciences and Technology, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA
Vineeta Agarwala
Department of Statistics, University of Oxford, Oxford, UK
Davis J. McCarthy & Peter Donnelly
Genetics of Complex Traits, University of Exeter Medical School, University of Exeter, Exeter, UK
John R. B. Perry, Dorota Pasko, Andrew R. Wood & Timothy M. Frayling
MRC Epidemiology Unit, Institute of Metabolic Science, University of Cambridge, Cambridge, UK
John R. B. Perry, Robert A. Scott, Claudia Langenberg & Nicholas J. Wareham
Department of Twin Research and Genetic Epidemiology, King's College London, London, UK
John R. B. Perry, Massimo Mangino, Gabriela L. Surdulescu, Dylan Hodgkiss, Kerrin S. Small & Timothy D. Spector
Radcliffe Department of Medicine, Oxford Centre for Diabetes, Endocrinology and Metabolism, University of Oxford, Oxford, UK
Neil R. Robertson, N William Rayner, Martijn van de Bunt, Nicola L. Beer, Momoko Horikoshi, Jonathan C. Levy, Christopher J. Groves, Matt Neville, Fredrik Karpe, Inga Prokopenko, Katharine R. Owen, Anna L. Gloyn & Mark I. McCarthy
Department of Human Genetics, Wellcome Trust Sanger Institute, Hinxton, Cambridgeshire, UK
N William Rayner, Aaron G. Day-Williams, John Danesh, Kathleen Stirrups, Panos Deloukas, Manjinder Sandhu, Inês Barroso & Eleftheria Zeggini
School of Computer Science, McGill University, Montreal, Quebec, Canada
Pablo Cingolani
McGill University and Génome Québec Innovation Centre, Montreal, Quebec, Canada
Pablo Cingolani, Yoshihiko Nagai & Rob Sladek
Human Genetics Center, The University of Texas Graduate School of Biomedical Sciences at Houston, The University of Texas Health Science Center at Houston, Houston, Texas, USA
Heather M. Highland
Department of Biostatistics, Boston University School of Public Health, Boston, Massachusetts, USA
Josee Dupuis, Han Chen & Dennis Rybin
National Heart, Lung, and Blood Institute's Framingham Heart Study, Framingham, Massachusetts, USA
Josee Dupuis
Medical Genomics and Metabolic Genetics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland, USA
Peter S. Chines, Lori L. Bonnycastle, Narisu Narisu, Amy Swift & Francis S. Collins
Department of Biostatistics, Harvard School of Public Health, Boston, Massachusetts, USA
Han Chen & Liming Liang
Chronic Disease Epidemiology, Swiss Tropical and Public Health Institute, University of Basel, Basel, Switzerland
Ashish Kumar
Institute of Genetic Epidemiology, Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Germany
Martina Müller-Nurasyid, Janina S. Ried, Christian Gieger & Konstantin Strauch
Department of Medicine I, University Hospital Grosshadern, Ludwig-Maximilians-Universität, Munich, Germany
Martina Müller-Nurasyid
Chair of Genetic Epidemiology, IBE, Faculty of Medicine, LMU Munich, Germany
Martina Müller-Nurasyid & Konstantin Strauch
DZHK (German Centre for Cardiovascular Research), partner site Munich Heart Alliance, Munich, Germany
Martina Müller-Nurasyid & Annette Peters
The Novo Nordisk Foundation Center for Basic Metabolic Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
Niels Grarup, Jette Bork-Jensen, Mette Hollensted, Johanne Marie Justesen, Anette P. Gjesing, Torben Hansen & Oluf Pedersen
Department of Medicine, Section of Genetic Medicine, The University of Chicago, Chicago, Illinois, USA
Eric R. Gamazon, Hae Kyung Im & Nancy J. Cox
Department of Statistics, Seoul National University, Seoul, Republic of Korea
Jaehoon Lee, Iksoo Huh, Yongkang Kim, Selyeong Lee & Taesung Park
Human Genetics Center, School of Public Health, The University of Texas Health Science Center at Houston, Houston, Texas, USA
Jennifer E. Below, Taylor J. Maxwell, Goo Jun & Craig L. Hanis
Saw Swee Hock School of Public Health, National University of Singapore, National University Health System, Singapore, Singapore
Peng Chen, Xu Wang, Ching-Yu Cheng, Chiea-Chuen Khor, Wei Yen Lim, Jianjun Liu, Kee Seng Chia, Yik Ying Teo & E. Shyong Tai
Department of Epidemiology, Harvard School of Public Health, Boston, Massachusetts, USA
Jinyan Huang, Frank B. Hu & Liming Liang
Center for Genome Science, Korea National Institute of Health, Chungcheongbuk-do, Republic of Korea
Min Jin Go, Bong-Jo Kim, Young Jin Kim, Juyoung Lee, Bok-Ghee Han & Jong-Young Lee
The Jackson Laboratory for Genomic Medicine, Farmington, Connecticut, USA
Michael L. Stitzel
Departments of Computational Medicine & Bioinformatics and Human Genetics, University of Michigan, Ann Arbor, Michigan, USA
Stephen C. J. Parker
Department of Clinical Sciences, Lund University Diabetes Centre, Genetic and Molecular Epidemiology Unit, Lund University, Malmö, Sweden
Tibor V. Varga & Paul W. Franks
Department of Epidemiology, Colorado School of Public Health, University of Colorado, Aurora, Colorado, USA
Tasha Fingerlin
Department of Endocrinology and Metabolism, Shanghai Diabetes Institute, Shanghai Jiao Tong University Affiliated Sixth People's Hospital, Shanghai, China
Cheng Hu & Weiping Jia
Singapore Eye Research Institute, Singapore National Eye Centre, Singapore, Singapore
Mohammad Kamran Ikram, Tin Aung, Ching-Yu Cheng, Chiea-Chuen Khor & Tien Yin Wong
Department of Ophthalmology, Yong Loo Lin School of Medicine, National University of Singapore, National University Health System, Singapore, Singapore
Mohammad Kamran Ikram, Tin Aung, Ching-Yu Cheng, Chiea-Chuen Khor & Tien Yin Wong
The Eye Academic Clinical Programme, Duke-NUS Graduate Medical School, Singapore, Singapore
Mohammad Kamran Ikram, Tin Aung, Ching-Yu Cheng & Tien Yin Wong
Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, Republic of Korea
Min-Seok Kwon, Joon Yoon & Taesung Park
Department of Human Genetics, McGill University, Montreal, Quebec, Canada
Yoshihiko Nagai & Rob Sladek
Research Institute of the McGill University Health Centre, Montreal, Quebec, Canada
Yoshihiko Nagai
Department of Epidemiology and Biostatistics, Imperial College London, London, UK
Weihua Zhang, Uzma Afzal, Benjamin Lehne, Marie Loh, William R. Scott, Paul Elliott & John C. Chambers
Department of Cardiology, Ealing Hospital NHS Trust, Southall, Middlesex, UK
Weihua Zhang, Sian-Tsung Tan, Jaspal Singh Kooner & John C. Chambers
Departments of Medicine and Genetics, Albert Einstein College of Medicine, New York, USA
Nir Barzilai & Gil Atzmon
Department of Systems Pharmacology and Translational Therapeutics, University of Pennsylvania—Perelman School of Medicine, Philadelphia, Pennsylvania, USA
Benjamin F. Voight
Department of Genetics, University of Pennsylvania—Perelman School of Medicine, Philadelphia, Pennsylvania, USA
Benjamin F. Voight
Department of Medicine, University of Texas Health Science Center, San Antonio, Texas, USA
Christopher P. Jenkinson, Hanna E. Abboud, Sharon P. Fowler, Farook Thameem, Donna M. Lehman & Ralph A. DeFronzo
Research, South Texas Veterans Health Care System, San Antonio, Texas, USA
Christopher P. Jenkinson
Faculty of Health Sciences, Institute of Clinical Medicine, Internal Medicine, University of Eastern Finland, Kuopio, Finland
Teemu Kuulasmaa, Johanna Kuusisto, Alena Stančáková & Markku Laakso
Kuopio University Hospital, Kuopio, Finland
Johanna Kuusisto & Markku Laakso
Center for Genomics and Personalized Medicine Research, Wake Forest School of Medicine, Winston-Salem, North Carolina, USA
Maggie C. Y. Ng, Nicholette D. Palmer, Pamela J. Hicks & Donald W. Bowden
Center for Diabetes Research, Wake Forest School of Medicine, Winston-Salem, North Carolina, USA
Maggie C. Y. Ng, Nicholette D. Palmer, Pamela J. Hicks & Donald W. Bowden
Department of Biochemistry, Wake Forest School of Medicine, Winston-Salem, North Carolina, USA
Nicholette D. Palmer, Pamela J. Hicks & Donald W. Bowden
Centre for Research in Epidemiology and Population Health, Inserm U1018, Villejuif, France
Beverley Balkau
German Institute of Human Nutrition Potsdam-Rehbruecke, Nuthetal, Germany
Heiner Boeing
Department of Public Health and Caring Sciences, Geriatrics, Uppsala University, Uppsala, Sweden
Vilmantas Giedraitis & Lars Lannfelt
Centre for Chronic Disease Control, New Delhi, India
Dorairaj Prabhakaran & Shah B. Ebrahim
The Charles Bronfman Institute for Personalized Medicine, The Icahn School of Medicine at Mount Sinai, New York, USA
Omri Gottesman, Yingchang Lu, Erwin P. Bottinger & Ruth J. F. Loos
National Heart and Lung Institute, Cardiovascular Sciences, Hammersmith Campus, Imperial College London, London, UK
James Scott, Sian-Tsung Tan & Jaspal Singh Kooner
Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington, USA
Joshua D. Smith
Department of Medicine, Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, Massachusetts, USA
Benjamin M. Neale & Mark J. Daly
Department of Medicine, Center for Genomic Medicine, Massachusetts General Hospital, Boston, Massachusetts, USA
Shaun Purcell & Jose C. Florez
Department of Psychiatry, Icahn Institute for Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, USA
Shaun Purcell
Department of Public Health and Primary Care, University of Cambridge, Cambridge, UK
Adam S. Butterworth, Joanna M. M. Howson, John Danesh & Manjinder Sandhu
Department of Medicine and Therapeutics, The Chinese University of Hong Kong, Hong Kong, China
Heung Man Lee, Vincent K. L. Lam, Wing Yee So, Claudia H. T. Tam, Juliana CN Chan & Ronald C. W. Ma
Department of Internal Medicine, Seoul National University College of Medicine, Seoul, Republic of Korea
Soo-Heon Kwak
Department of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA
Wei Zhao
Department of Public Health and Primary Care, NIHR Blood and Transplant Research Unit in Donor Health and Genomics, University of Cambridge, Cambridge, UK
John Danesh
Department of Molecular Medicine and Biopharmaceutical Sciences, Graduate School of Convergence Science and Technology, and College of Medicine, Seoul National University, Seoul, Republic of Korea
Kyong Soo Park
Department of Biostatistics and Epidemiology, University of Pennsylvania, Philadelphia, Pennsylvania, USA
Danish Saleheen
Center for Non-Communicable Diseases, Karachi, Pakistan
Danish Saleheen
Cardiovascular Division, Baylor College of Medicine, Houston, Texas, USA
David Aguilar
Department of Pediatrics, University of Texas Health Science Center, San Antonio, Texas, USA
Rector Arya & Daniel Esten Hale
Department of Medicine, Yong Loo Lin School of Medicine, National University of Singapore, National University Health System, Singapore, Singapore
Edmund Chan & E. Shyong Tai
Department of Epidemiology, Murcia Regional Health Council, IMIB-Arrixaca, Murcia, Spain
Carmen Navarro
CIBER Epidemiología y Salud Pública (CIBERESP), Spain
Carmen Navarro
Unit of Preventive Medicine and Public Health, School of Medicine, University of Murcia, Spain
Carmen Navarro
Cancer Research and Prevention Institute (ISPO), Florence, Italy
Domenico Palli
Department of Medicine, University of Mississippi Medical Center, Jackson, Mississippi, USA
Adolfo Correa & Herman A. Taylor
South Texas Diabetes and Obesity Institute, Regional Academic Health Center, University of Texas Health Science Center at San Antonio/University of Texas Rio Grande Valley, Brownsville, Texas, USA
Joanne E. Curran, Satish Kumar & John Blangero
Department of Genetics, Texas Biomedical Research Institute, San Antonio, Texas, USA
Vidya S. Farook, Sobha Puppala & Ravindranath Duggirala
Department of Internal Medicine, Section on Nephrology, Wake Forest School of Medicine, Winston-Salem, North Carolina, USA
Barry I. Freedman
Center of Biostatistics and Bioinformatics, University of Mississippi Medical Center, Jackson, Mississippi, USA
Michael Griswold
Department of Paediatrics, Yong Loo Lin School of Medicine, National University of Singapore, National University Health System, Singapore, Singapore
Chiea-Chuen Khor
Division of Human Genetics, Genome Institute of Singapore, A*STAR, Singapore, Singapore
Chiea-Chuen Khor & Jianjun Liu
CNRS-UMR8199, Lille University, Lille Pasteur Institute, Lille, France
Dorothée Thuillier, Loïc Yengo & Philippe Froguel
Institute of Health Sciences, University of Oulu, Oulu, Finland
Marie Loh
Translational Laboratory in Genetic Medicine (TLGM), Agency for Science, Technology and Research (A*STAR), Singapore, Singapore
Marie Loh
Jackson Heart Study, University of Mississippi Medical Center, Jackson, Mississippi, USA
Solomon K. Musani
College of Public Services, Jackson State University, Jackson, Mississippi, USA
Gregory Wilson
Department of Clinical Science, KG Jebsen Center for Diabetes Research, University of Bergen, Bergen, Norway
Pål Rasmus Njølstad
Department of Pediatrics, Haukeland University Hospital, Bergen, Norway
Pål Rasmus Njølstad
NIHR Biomedical Research Centre at Guy’s and St Thomas’ Foundation Trust, London, UK
Massimo Mangino
Institute of Human Genetics, Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Germany
Thomas Schwarzmayr, Thomas Wieland, Tim M. Strom & Thomas Meitinger
Department of Clinical Sciences, Diabetes and Endocrinology, Lund University Diabetes Centre, Malmö, Sweden
João Fadista, Jasmina Kravic, Valeri Lyssenko, Claes Ladenvall, Anders H. Rosengren & Leif Groop
Institute of Clinical Diabetology, German Diabetes Center, Leibniz Center for Diabetes Research at Heinrich Heine University, Düsseldorf, Germany
Christian Herder & Michael Roden
German Center for Diabetes Research (DZD), München-Neuherberg, Germany
Christian Herder, Jennifer Kriebel, Wolfgang Rathmann, Michael Roden, Martin Hrabé de Angelis, Barbara Thorand, Christa Meisinger, Annette Peters, Cornelia Huth & Harald Grallert
Institute of Regional Health Research, University of Southern Denmark, Odense, Denmark
Ivan Brandslund
Department of Clinical Biochemistry, Vejle Hospital, Vejle, Denmark
Ivan Brandslund
Department of Internal Medicine and Endocrinology, Vejle Hospital, Vejle, Denmark
Cramer Christensen
Department of Health, National Institute for Health and Welfare, Helsinki, Finland
Heikki A. Koistinen & Leena Kinnunen
Abdominal Center: Endocrinology, University of Helsinki and Helsinki University Central Hospital, Helsinki, Finland
Heikki A. Koistinen, Liisa Hakaste & Tiinamaija Tuomi
Minerva Foundation Institute for Medical Research, Helsinki, Finland
Heikki A. Koistinen
Department of Medicine, University of Helsinki and Helsinki University Central Hospital, Helsinki, Finland
Heikki A. Koistinen
Division of Cardiovascular and Diabetes Medicine, Medical Research Institute, Ninewells Hospital and Medical School, Dundee, UK
Alex S. F. Doney
Estonian Genome Center, University of Tartu, Tartu, Estonia
Tõnu Esko, Lili Milani, Evelin Mihailov, Andres Metspalu, Reedik Mägi & Andrew P. Morris
Department of Genetics, Harvard Medical School, Boston, Massachusetts, USA
Tõnu Esko & David Altshuler
Division of Endocrinology, Boston Children's Hospital, Boston, Massachusetts, USA
Tõnu Esko
Nuffield Department of Primary Care Health Sciences, University of Oxford, Oxford, UK
Andrew J. Farmer
Folkhälsan Research Centre, Helsinki, Finland
Liisa Hakaste, Bo Isomaa & Tiinamaija Tuomi
Research Programs Unit, Diabetes and Obesity, University of Helsinki, Helsinki, Finland
Liisa Hakaste & Tiinamaija Tuomi
Steno Diabetes Center, Gentofte, Denmark
Marit E. Jørgensen
Research Centre for Prevention and Health, Capital Region of Denmark, Glostrup, Denmark
Torben Jørgensen & Allan Linneberg
Department of Public Health, Institute of Health Sciences, University of Copenhagen, Copenhagen, Denmark
Torben Jørgensen
Faculty of Medicine, Aalborg University, Aalborg, Denmark
Torben Jørgensen
Department of Primary Health Care, Vaasa Central Hospital, Vaasa, Finland
Annemari Käräjämäki
Diabetes Center, Vaasa Health Care Center, Vaasa, Finland
Annemari Käräjämäki
Institute of Epidemiology II, Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Germany
Jennifer Kriebel, Barbara Thorand, Christa Meisinger, Annette Peters, Cornelia Huth, Harald Grallert & Christian Gieger
Research Unit of Molecular Epidemiology, Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Germany
Jennifer Kriebel, Harald Grallert, Christian Gieger & Thomas Illig
Institute for Biometrics and Epidemiology, German Diabetes Center, Leibniz Center for Diabetes Research at Heinrich Heine University, Düsseldorf, Germany
Wolfgang Rathmann
Department of Public Health, Section of General Practice, Aarhus University, Aarhus, Denmark
Torsten Lauritzen
Department of Clinical Experimental Research, Rigshospitalet, Glostrup, Denmark
Allan Linneberg
Department of Clinical Medicine, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
Allan Linneberg
Department of Clinical Sciences, Hypertension and Cardiovascular Disease, Lund University, Malmö, Sweden
Olle Melander
Oxford NIHR Biomedical Research Centre, Oxford University Hospitals Trust, Oxford, UK
Matt Neville, Fredrik Karpe, Katharine R. Owen, Anna L. Gloyn & Mark I. McCarthy
Department of Clinical Sciences, Diabetes and Cardiovascular Disease, Genetic Epidemiology, Lund University, Malmö, Sweden
Marju Orho-Melander
Department of Nutrition, Harvard School of Public Health, Boston, Massachusetts, USA
Lu Qi, Qibin Qi, Frank B. Hu & Paul W. Franks
Channing Division of Network Medicine, Department of Medicine, Brigham and Women’s Hospital and Harvard Medical School, Boston, Massachusetts, USA
Lu Qi
Department of Epidemiology and Population Health, Albert Einstein College of Medicine, New York, USA
Qibin Qi
Division of Endocrinology and Diabetology, Medical Faculty, Heinrich-Heine University, Düsseldorf, Germany
Michael Roden
Department of Public Health and Clinical Medicine, Umeå University, Umeå, Sweden
Olov Rolandsson & Paul W. Franks
Nuffield Department of Medicine, High Throughput Genomics, Oxford Genomics Centre, Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
Christine Blancher, Gemma Buck, Joseph Trakalo & David Buck
Institute of Experimental Genetics, Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Germany
Martin Hrabé de Angelis
Center of Life and Food Sciences Weihenstephan, Technische Universität München, Freising-Weihenstephan, Germany
Martin Hrabé de Angelis
William Harvey Research Institute, Barts and The London School of Medicine and Dentistry, Queen Mary University of London, London, UK
Panos Deloukas
Princess Al-Jawhara Al-Brahim Centre of Excellence in Research of Hereditary Disorders (PACER-HD), King Abdulaziz University, Jeddah, Saudi Arabia
Panos Deloukas
Department of Clinical Sciences, Medicine, Lund University, Malmö, Sweden
Peter Nilsson
Faculty of Health Sciences, University of Southern Denmark, Odense, Denmark
Torben Hansen
Department of Social Services and Health Care, Jakobstad, Finland
Bo Isomaa
Metabolic Research Laboratories, Institute of Metabolic Science, University of Cambridge, Cambridge, UK
Stephen P O'Rahilly & Inês Barroso
Pat Macpherson Centre for Pharmacogenetics and Pharmacogenomics, Medical Research Institute, Ninewells Hospital and Medical School, Dundee, UK
Colin N. A. Palmer
Foundation for Research in Health, Exercise and Nutrition, Kuopio Research Institute of Exercise Medicine, Kuopio, Finland
Rainer Rauramaa
Center for Vascular Prevention, Danube University Krems, Krems, Austria
Jaakko Tuomilehto
Diabetes Research Group, King Abdulaziz University, Jeddah, Saudi Arabia
Jaakko Tuomilehto
Dasman Diabetes Institute, Dasman, Kuwait
Jaakko Tuomilehto
National Institute for Health and Welfare, Helsinki, Finland
Jaakko Tuomilehto & Veikko Salomaa
Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, California, USA
Richard M. Watanabe
Department of Physiology & Biophysics, Keck School of Medicine, University of Southern California, Los Angeles, California, USA
Richard M. Watanabe
Diabetes and Obesity Research Institute, Keck School of Medicine, University of Southern California, Los Angeles, California, USA
Richard M. Watanabe
Department of Medical Sciences, Molecular Medicine and Science for Life Laboratory, Uppsala University, Uppsala, Sweden
Ann-Christine Syvänen
Cedars-Sinai Diabetes and Obesity Research Institute, Los Angeles, California, USA
Richard N. Bergman
Functional Genomics Unit, CSIR-Institute of Genomics & Integrative Biology (CSIR-IGIB), New Delhi, India
Dwaipayan Bharadwaj
Department of Biomedical Science, Hallym University, Chuncheon, Republic of Korea
Yoon Shin Cho
CSIR-Centre for Cellular and Molecular Biology, Hyderabad, Telangana, India
Giriraj R. Chandak
Li Ka Shing Institute of Health Sciences, The Chinese University of Hong Kong, Hong Kong, China
Juliana CN Chan & Ronald C. W. Ma
Hong Kong Institute of Diabetes and Obesity, The Chinese University of Hong Kong, Hong Kong, China
Juliana CN Chan & Ronald C. W. Ma
MRC-PHE Centre for Environment and Health, Imperial College London, London, UK
Paul Elliott
The Biostatistics Center, The George Washington University, Rockville, Maryland, USA
Kathleen A. Jablonski
Department of Medicine, Division of Endocrinology, Diabetes and Nutrition, and Program in Personalized and Genomic Medicine, University of Maryland School of Medicine, Baltimore, Maryland, USA
Toni I. Pollin
Department of Endocrinology and Metabolism, All India Institute of Medical Sciences, New Delhi, India
Nikhil Tandon
Department of Genomics of Common Disease, School of Public Health, Imperial College London, London, UK
Philippe Froguel & Inga Prokopenko
Life Sciences Institute, National University of Singapore, Singapore, Singapore
Yik Ying Teo
Department of Statistics and Applied Probability, National University of Singapore, Singapore, Singapore
Yik Ying Teo
Endocrinology and Metabolism Service, Hadassah-Hebrew University Medical Center, Jerusalem, Israel
Benjamin Glaser
The Medical School, Institute of Cellular Medicine, Newcastle University, Newcastle, UK
Mark Walker
Department of Medical Sciences, Molecular Epidemiology and Science for Life Laboratory, Uppsala University, Uppsala, Sweden
Erik Ingelsson
Hannover Unified Biobank, Hannover Medical School, Hanover, Germany
Thomas Illig
Department of Human Genetics, Hannover Medical School, Hanover, Germany
Thomas Illig
Department of Medical Sciences, Uppsala University, Uppsala, Sweden
Lars Lind
Data Sciences and Data Engineering, Broad Institute, Cambridge, Massachusetts, USA
Yossi Farjoun
Finnish Institute for Molecular Medicine, University of Helsinki, Helsinki, Finland
Tiinamaija Tuomi & Leif Groop
Imperial College Healthcare NHS Trust, Imperial College London, London, UK
Jaspal Singh Kooner & John C. Chambers
Clinical Research Centre, Centre for Molecular Medicine, Ninewells Hospital and Medical School, Dundee, UK
Andrew D. Morris
The Usher Institute to the Population Health Sciences and Informatics, University of Edinburgh, Edinburgh, UK
Andrew D. Morris
University of Exeter Medical School, University of Exeter, Exeter, UK
Andrew T. Hattersley
Department of Natural Science, University of Haifa, Haifa, Israel
Gil Atzmon
Institute of Human Genetics, Technische Universität München, Munich, Germany
Tim M. Strom & Thomas Meitinger
Departments of Medicine and Human Genetics, The University of Chicago, Chicago, Illinois, USA
Graeme I. Bell
Cardiovascular & Metabolic Disorders Program, Duke-NUS Medical School Singapore, Singapore, Singapore
E. Shyong Tai
Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford, UK
Gilean McVean
Department of Physiology and Biophysics, University of Mississippi Medical Center, Jackson, Mississippi, USA
James G. Wilson
Department of Laboratory Medicine & Institute for Human Genetics, University of California, San Francisco, San Francisco, California, USA
Mark Seielstad
Blood Systems Research Institute, San Francisco, California, USA
Mark Seielstad
General Medicine Division, Massachusetts General Hospital and Department of Medicine, Harvard Medical School, Boston, Massachusetts, USA
James B. Meigs
Division of Endocrinology and Metabolism, Department of Medicine, McGill University, Montreal, Quebec, Canada
Rob Sladek
Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
Eric S. Lander
Department of Genetics, University of North Carolina, Chapel Hill, North Carolina, USA
Karen L. Mohlke
Department of Biostatistics, University of Liverpool, Liverpool, UK
Andrew P. Morris
Department of Medicine, Harvard Medical School, Boston, Massachusetts, USA
David Altshuler & Jose C. Florez
Department of Medicine, Diabetes Research Center (Diabetes Unit), Massachusetts General Hospital, Boston, Massachusetts, USA
David Altshuler & Jose C. Florez
Department of Biology, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA
David Altshuler

Authors

Jason Flannick
View author publications
You can also search for this author in PubMed Google Scholar
Christian Fuchsberger
View author publications
You can also search for this author in PubMed Google Scholar
Anubha Mahajan
View author publications
You can also search for this author in PubMed Google Scholar
Tanya M. Teslovich
View author publications
You can also search for this author in PubMed Google Scholar
Vineeta Agarwala
View author publications
You can also search for this author in PubMed Google Scholar
Kyle J. Gaulton
View author publications
You can also search for this author in PubMed Google Scholar
Lizz Caulkins
View author publications
You can also search for this author in PubMed Google Scholar
Ryan Koesterer
View author publications
You can also search for this author in PubMed Google Scholar
Clement Ma
View author publications
You can also search for this author in PubMed Google Scholar
Loukas Moutsianas
View author publications
You can also search for this author in PubMed Google Scholar
Davis J. McCarthy
View author publications
You can also search for this author in PubMed Google Scholar
Manuel A. Rivas
View author publications
You can also search for this author in PubMed Google Scholar
John R. B. Perry
View author publications
You can also search for this author in PubMed Google Scholar
Xueling Sim
View author publications
You can also search for this author in PubMed Google Scholar
Thomas W. Blackwell
View author publications
You can also search for this author in PubMed Google Scholar
Neil R. Robertson
View author publications
You can also search for this author in PubMed Google Scholar
N William Rayner
View author publications
You can also search for this author in PubMed Google Scholar
Pablo Cingolani
View author publications
You can also search for this author in PubMed Google Scholar
Adam E. Locke
View author publications
You can also search for this author in PubMed Google Scholar
Juan Fernandez Tajes
View author publications
You can also search for this author in PubMed Google Scholar
Heather M. Highland
View author publications
You can also search for this author in PubMed Google Scholar
Josee Dupuis
View author publications
You can also search for this author in PubMed Google Scholar
Peter S. Chines
View author publications
You can also search for this author in PubMed Google Scholar
Cecilia M. Lindgren
View author publications
You can also search for this author in PubMed Google Scholar
Christopher Hartl
View author publications
You can also search for this author in PubMed Google Scholar
Anne U. Jackson
View author publications
You can also search for this author in PubMed Google Scholar
Han Chen
View author publications
You can also search for this author in PubMed Google Scholar
Jeroen R. Huyghe
View author publications
You can also search for this author in PubMed Google Scholar
Martijn van de Bunt
View author publications
You can also search for this author in PubMed Google Scholar
Richard D. Pearson
View author publications
You can also search for this author in PubMed Google Scholar
Ashish Kumar
View author publications
You can also search for this author in PubMed Google Scholar
Martina Müller-Nurasyid
View author publications
You can also search for this author in PubMed Google Scholar
Niels Grarup
View author publications
You can also search for this author in PubMed Google Scholar
Heather M. Stringham
View author publications
You can also search for this author in PubMed Google Scholar
Eric R. Gamazon
View author publications
You can also search for this author in PubMed Google Scholar
Jaehoon Lee
View author publications
You can also search for this author in PubMed Google Scholar
Yuhui Chen
View author publications
You can also search for this author in PubMed Google Scholar
Robert A. Scott
View author publications
You can also search for this author in PubMed Google Scholar
Jennifer E. Below
View author publications
You can also search for this author in PubMed Google Scholar
Peng Chen
View author publications
You can also search for this author in PubMed Google Scholar
Jinyan Huang
View author publications
You can also search for this author in PubMed Google Scholar
Min Jin Go
View author publications
You can also search for this author in PubMed Google Scholar
Michael L. Stitzel
View author publications
You can also search for this author in PubMed Google Scholar
Dorota Pasko
View author publications
You can also search for this author in PubMed Google Scholar
Stephen C. J. Parker
View author publications
You can also search for this author in PubMed Google Scholar
Tibor V. Varga
View author publications
You can also search for this author in PubMed Google Scholar
Todd Green
View author publications
You can also search for this author in PubMed Google Scholar
Nicola L. Beer
View author publications
You can also search for this author in PubMed Google Scholar
Aaron G. Day-Williams
View author publications
You can also search for this author in PubMed Google Scholar
Teresa Ferreira
View author publications
You can also search for this author in PubMed Google Scholar
Tasha Fingerlin
View author publications
You can also search for this author in PubMed Google Scholar
Momoko Horikoshi
View author publications
You can also search for this author in PubMed Google Scholar
Cheng Hu
View author publications
You can also search for this author in PubMed Google Scholar
Iksoo Huh
View author publications
You can also search for this author in PubMed Google Scholar
Mohammad Kamran Ikram
View author publications
You can also search for this author in PubMed Google Scholar
Bong-Jo Kim
View author publications
You can also search for this author in PubMed Google Scholar
Yongkang Kim
View author publications
You can also search for this author in PubMed Google Scholar
Young Jin Kim
View author publications
You can also search for this author in PubMed Google Scholar
Min-Seok Kwon
View author publications
You can also search for this author in PubMed Google Scholar
Juyoung Lee
View author publications
You can also search for this author in PubMed Google Scholar
Selyeong Lee
View author publications
You can also search for this author in PubMed Google Scholar
Keng-Han Lin
View author publications
You can also search for this author in PubMed Google Scholar
Taylor J. Maxwell
View author publications
You can also search for this author in PubMed Google Scholar
Yoshihiko Nagai
View author publications
You can also search for this author in PubMed Google Scholar
Xu Wang
View author publications
You can also search for this author in PubMed Google Scholar
Ryan P. Welch
View author publications
You can also search for this author in PubMed Google Scholar
Joon Yoon
View author publications
You can also search for this author in PubMed Google Scholar
Weihua Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Nir Barzilai
View author publications
You can also search for this author in PubMed Google Scholar
Benjamin F. Voight
View author publications
You can also search for this author in PubMed Google Scholar
Bok-Ghee Han
View author publications
You can also search for this author in PubMed Google Scholar
Christopher P. Jenkinson
View author publications
You can also search for this author in PubMed Google Scholar
Teemu Kuulasmaa
View author publications
You can also search for this author in PubMed Google Scholar
Johanna Kuusisto
View author publications
You can also search for this author in PubMed Google Scholar
Alisa Manning
View author publications
You can also search for this author in PubMed Google Scholar
Maggie C. Y. Ng
View author publications
You can also search for this author in PubMed Google Scholar
Nicholette D. Palmer
View author publications
You can also search for this author in PubMed Google Scholar
Beverley Balkau
View author publications
You can also search for this author in PubMed Google Scholar
Alena Stančáková
View author publications
You can also search for this author in PubMed Google Scholar
Hanna E. Abboud
View author publications
You can also search for this author in PubMed Google Scholar
Heiner Boeing
View author publications
You can also search for this author in PubMed Google Scholar
Vilmantas Giedraitis
View author publications
You can also search for this author in PubMed Google Scholar
Dorairaj Prabhakaran
View author publications
You can also search for this author in PubMed Google Scholar
Omri Gottesman
View author publications
You can also search for this author in PubMed Google Scholar
James Scott
View author publications
You can also search for this author in PubMed Google Scholar
Jason Carey
View author publications
You can also search for this author in PubMed Google Scholar
Phoenix Kwan
View author publications
You can also search for this author in PubMed Google Scholar
George Grant
View author publications
You can also search for this author in PubMed Google Scholar
Joshua D. Smith
View author publications
You can also search for this author in PubMed Google Scholar
Benjamin M. Neale
View author publications
You can also search for this author in PubMed Google Scholar
Shaun Purcell
View author publications
You can also search for this author in PubMed Google Scholar
Adam S. Butterworth
View author publications
You can also search for this author in PubMed Google Scholar
Joanna M. M. Howson
View author publications
You can also search for this author in PubMed Google Scholar
Heung Man Lee
View author publications
You can also search for this author in PubMed Google Scholar
Yingchang Lu
View author publications
You can also search for this author in PubMed Google Scholar
Soo-Heon Kwak
View author publications
You can also search for this author in PubMed Google Scholar
Wei Zhao
View author publications
You can also search for this author in PubMed Google Scholar
John Danesh
View author publications
You can also search for this author in PubMed Google Scholar
Vincent K. L. Lam
View author publications
You can also search for this author in PubMed Google Scholar
Kyong Soo Park
View author publications
You can also search for this author in PubMed Google Scholar
Danish Saleheen
View author publications
You can also search for this author in PubMed Google Scholar
Wing Yee So
View author publications
You can also search for this author in PubMed Google Scholar
Claudia H. T. Tam
View author publications
You can also search for this author in PubMed Google Scholar
Uzma Afzal
View author publications
You can also search for this author in PubMed Google Scholar
David Aguilar
View author publications
You can also search for this author in PubMed Google Scholar
Rector Arya
View author publications
You can also search for this author in PubMed Google Scholar
Tin Aung
View author publications
You can also search for this author in PubMed Google Scholar
Edmund Chan
View author publications
You can also search for this author in PubMed Google Scholar
Carmen Navarro
View author publications
You can also search for this author in PubMed Google Scholar
Ching-Yu Cheng
View author publications
You can also search for this author in PubMed Google Scholar
Domenico Palli
View author publications
You can also search for this author in PubMed Google Scholar
Adolfo Correa
View author publications
You can also search for this author in PubMed Google Scholar
Joanne E. Curran
View author publications
You can also search for this author in PubMed Google Scholar
Dennis Rybin
View author publications
You can also search for this author in PubMed Google Scholar
Vidya S. Farook
View author publications
You can also search for this author in PubMed Google Scholar
Sharon P. Fowler
View author publications
You can also search for this author in PubMed Google Scholar
Barry I. Freedman
View author publications
You can also search for this author in PubMed Google Scholar
Michael Griswold
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Esten Hale
View author publications
You can also search for this author in PubMed Google Scholar
Pamela J. Hicks
View author publications
You can also search for this author in PubMed Google Scholar
Chiea-Chuen Khor
View author publications
You can also search for this author in PubMed Google Scholar
Satish Kumar
View author publications
You can also search for this author in PubMed Google Scholar
Benjamin Lehne
View author publications
You can also search for this author in PubMed Google Scholar
Dorothée Thuillier
View author publications
You can also search for this author in PubMed Google Scholar
Wei Yen Lim
View author publications
You can also search for this author in PubMed Google Scholar
Jianjun Liu
View author publications
You can also search for this author in PubMed Google Scholar
Marie Loh
View author publications
You can also search for this author in PubMed Google Scholar
Solomon K. Musani
View author publications
You can also search for this author in PubMed Google Scholar
Sobha Puppala
View author publications
You can also search for this author in PubMed Google Scholar
William R. Scott
View author publications
You can also search for this author in PubMed Google Scholar
Loïc Yengo
View author publications
You can also search for this author in PubMed Google Scholar
Sian-Tsung Tan
View author publications
You can also search for this author in PubMed Google Scholar
Herman A. Taylor
View author publications
You can also search for this author in PubMed Google Scholar
Farook Thameem
View author publications
You can also search for this author in PubMed Google Scholar
Gregory Wilson
View author publications
You can also search for this author in PubMed Google Scholar
Tien Yin Wong
View author publications
You can also search for this author in PubMed Google Scholar
Pål Rasmus Njølstad
View author publications
You can also search for this author in PubMed Google Scholar
Jonathan C. Levy
View author publications
You can also search for this author in PubMed Google Scholar
Massimo Mangino
View author publications
You can also search for this author in PubMed Google Scholar
Lori L. Bonnycastle
View author publications
You can also search for this author in PubMed Google Scholar
Thomas Schwarzmayr
View author publications
You can also search for this author in PubMed Google Scholar
João Fadista
View author publications
You can also search for this author in PubMed Google Scholar
Gabriela L. Surdulescu
View author publications
You can also search for this author in PubMed Google Scholar
Christian Herder
View author publications
You can also search for this author in PubMed Google Scholar
Christopher J. Groves
View author publications
You can also search for this author in PubMed Google Scholar
Thomas Wieland
View author publications
You can also search for this author in PubMed Google Scholar
Jette Bork-Jensen
View author publications
You can also search for this author in PubMed Google Scholar
Ivan Brandslund
View author publications
You can also search for this author in PubMed Google Scholar
Cramer Christensen
View author publications
You can also search for this author in PubMed Google Scholar
Heikki A. Koistinen
View author publications
You can also search for this author in PubMed Google Scholar
Alex S. F. Doney
View author publications
You can also search for this author in PubMed Google Scholar
Leena Kinnunen
View author publications
You can also search for this author in PubMed Google Scholar
Tõnu Esko
View author publications
You can also search for this author in PubMed Google Scholar
Andrew J. Farmer
View author publications
You can also search for this author in PubMed Google Scholar
Liisa Hakaste
View author publications
You can also search for this author in PubMed Google Scholar
Dylan Hodgkiss
View author publications
You can also search for this author in PubMed Google Scholar
Jasmina Kravic
View author publications
You can also search for this author in PubMed Google Scholar
Valeri Lyssenko
View author publications
You can also search for this author in PubMed Google Scholar
Mette Hollensted
View author publications
You can also search for this author in PubMed Google Scholar
Marit E. Jørgensen
View author publications
You can also search for this author in PubMed Google Scholar
Torben Jørgensen
View author publications
You can also search for this author in PubMed Google Scholar
Claes Ladenvall
View author publications
You can also search for this author in PubMed Google Scholar
Johanne Marie Justesen
View author publications
You can also search for this author in PubMed Google Scholar
Annemari Käräjämäki
View author publications
You can also search for this author in PubMed Google Scholar
Jennifer Kriebel
View author publications
You can also search for this author in PubMed Google Scholar
Wolfgang Rathmann
View author publications
You can also search for this author in PubMed Google Scholar
Lars Lannfelt
View author publications
You can also search for this author in PubMed Google Scholar
Torsten Lauritzen
View author publications
You can also search for this author in PubMed Google Scholar
Narisu Narisu
View author publications
You can also search for this author in PubMed Google Scholar
Allan Linneberg
View author publications
You can also search for this author in PubMed Google Scholar
Olle Melander
View author publications
You can also search for this author in PubMed Google Scholar
Lili Milani
View author publications
You can also search for this author in PubMed Google Scholar
Matt Neville
View author publications
You can also search for this author in PubMed Google Scholar
Marju Orho-Melander
View author publications
You can also search for this author in PubMed Google Scholar
Lu Qi
View author publications
You can also search for this author in PubMed Google Scholar
Qibin Qi
View author publications
You can also search for this author in PubMed Google Scholar
Michael Roden
View author publications
You can also search for this author in PubMed Google Scholar
Olov Rolandsson
View author publications
You can also search for this author in PubMed Google Scholar
Amy Swift
View author publications
You can also search for this author in PubMed Google Scholar
Anders H. Rosengren
View author publications
You can also search for this author in PubMed Google Scholar
Kathleen Stirrups
View author publications
You can also search for this author in PubMed Google Scholar
Andrew R. Wood
View author publications
You can also search for this author in PubMed Google Scholar
Evelin Mihailov
View author publications
You can also search for this author in PubMed Google Scholar
Christine Blancher
View author publications
You can also search for this author in PubMed Google Scholar
Mauricio O. Carneiro
View author publications
You can also search for this author in PubMed Google Scholar
Jared Maguire
View author publications
You can also search for this author in PubMed Google Scholar
Ryan Poplin
View author publications
You can also search for this author in PubMed Google Scholar
Khalid Shakir
View author publications
You can also search for this author in PubMed Google Scholar
Timothy Fennell
View author publications
You can also search for this author in PubMed Google Scholar
Mark DePristo
View author publications
You can also search for this author in PubMed Google Scholar
Martin Hrabé de Angelis
View author publications
You can also search for this author in PubMed Google Scholar
Panos Deloukas
View author publications
You can also search for this author in PubMed Google Scholar
Anette P. Gjesing
View author publications
You can also search for this author in PubMed Google Scholar
Goo Jun
View author publications
You can also search for this author in PubMed Google Scholar
Peter Nilsson
View author publications
You can also search for this author in PubMed Google Scholar
Jacquelyn Murphy
View author publications
You can also search for this author in PubMed Google Scholar
Robert Onofrio
View author publications
You can also search for this author in PubMed Google Scholar
Barbara Thorand
View author publications
You can also search for this author in PubMed Google Scholar
Torben Hansen
View author publications
You can also search for this author in PubMed Google Scholar
Christa Meisinger
View author publications
You can also search for this author in PubMed Google Scholar
Frank B. Hu
View author publications
You can also search for this author in PubMed Google Scholar
Bo Isomaa
View author publications
You can also search for this author in PubMed Google Scholar
Fredrik Karpe
View author publications
You can also search for this author in PubMed Google Scholar
Liming Liang
View author publications
You can also search for this author in PubMed Google Scholar
Annette Peters
View author publications
You can also search for this author in PubMed Google Scholar
Cornelia Huth
View author publications
You can also search for this author in PubMed Google Scholar
Stephen P O'Rahilly
View author publications
You can also search for this author in PubMed Google Scholar
Colin N. A. Palmer
View author publications
You can also search for this author in PubMed Google Scholar
Oluf Pedersen
View author publications
You can also search for this author in PubMed Google Scholar
Rainer Rauramaa
View author publications
You can also search for this author in PubMed Google Scholar
Jaakko Tuomilehto
View author publications
You can also search for this author in PubMed Google Scholar
Veikko Salomaa
View author publications
You can also search for this author in PubMed Google Scholar
Richard M. Watanabe
View author publications
You can also search for this author in PubMed Google Scholar
Ann-Christine Syvänen
View author publications
You can also search for this author in PubMed Google Scholar
Richard N. Bergman
View author publications
You can also search for this author in PubMed Google Scholar
Dwaipayan Bharadwaj
View author publications
You can also search for this author in PubMed Google Scholar
Erwin P. Bottinger
View author publications
You can also search for this author in PubMed Google Scholar
Yoon Shin Cho
View author publications
You can also search for this author in PubMed Google Scholar
Giriraj R. Chandak
View author publications
You can also search for this author in PubMed Google Scholar
Juliana CN Chan
View author publications
You can also search for this author in PubMed Google Scholar
Kee Seng Chia
View author publications
You can also search for this author in PubMed Google Scholar
Mark J. Daly
View author publications
You can also search for this author in PubMed Google Scholar
Shah B. Ebrahim
View author publications
You can also search for this author in PubMed Google Scholar
Claudia Langenberg
View author publications
You can also search for this author in PubMed Google Scholar
Paul Elliott
View author publications
You can also search for this author in PubMed Google Scholar
Kathleen A. Jablonski
View author publications
You can also search for this author in PubMed Google Scholar
Donna M. Lehman
View author publications
You can also search for this author in PubMed Google Scholar
Weiping Jia
View author publications
You can also search for this author in PubMed Google Scholar
Ronald C. W. Ma
View author publications
You can also search for this author in PubMed Google Scholar
Toni I. Pollin
View author publications
You can also search for this author in PubMed Google Scholar
Manjinder Sandhu
View author publications
You can also search for this author in PubMed Google Scholar
Nikhil Tandon
View author publications
You can also search for this author in PubMed Google Scholar
Philippe Froguel
View author publications
You can also search for this author in PubMed Google Scholar
Inês Barroso
View author publications
You can also search for this author in PubMed Google Scholar
Yik Ying Teo
View author publications
You can also search for this author in PubMed Google Scholar
Eleftheria Zeggini
View author publications
You can also search for this author in PubMed Google Scholar
Ruth J. F. Loos
View author publications
You can also search for this author in PubMed Google Scholar
Kerrin S. Small
View author publications
You can also search for this author in PubMed Google Scholar
Janina S. Ried
View author publications
You can also search for this author in PubMed Google Scholar
Ralph A. DeFronzo
View author publications
You can also search for this author in PubMed Google Scholar
Harald Grallert
View author publications
You can also search for this author in PubMed Google Scholar
Benjamin Glaser
View author publications
You can also search for this author in PubMed Google Scholar
Andres Metspalu
View author publications
You can also search for this author in PubMed Google Scholar
Nicholas J. Wareham
View author publications
You can also search for this author in PubMed Google Scholar
Mark Walker
View author publications
You can also search for this author in PubMed Google Scholar
Eric Banks
View author publications
You can also search for this author in PubMed Google Scholar
Christian Gieger
View author publications
You can also search for this author in PubMed Google Scholar
Erik Ingelsson
View author publications
You can also search for this author in PubMed Google Scholar
Hae Kyung Im
View author publications
You can also search for this author in PubMed Google Scholar
Thomas Illig
View author publications
You can also search for this author in PubMed Google Scholar
Paul W. Franks
View author publications
You can also search for this author in PubMed Google Scholar
Gemma Buck
View author publications
You can also search for this author in PubMed Google Scholar
Joseph Trakalo
View author publications
You can also search for this author in PubMed Google Scholar
David Buck
View author publications
You can also search for this author in PubMed Google Scholar
Inga Prokopenko
View author publications
You can also search for this author in PubMed Google Scholar
Reedik Mägi
View author publications
You can also search for this author in PubMed Google Scholar
Lars Lind
View author publications
You can also search for this author in PubMed Google Scholar
Yossi Farjoun
View author publications
You can also search for this author in PubMed Google Scholar
Katharine R. Owen
View author publications
You can also search for this author in PubMed Google Scholar
Anna L. Gloyn
View author publications
You can also search for this author in PubMed Google Scholar
Konstantin Strauch
View author publications
You can also search for this author in PubMed Google Scholar
Tiinamaija Tuomi
View author publications
You can also search for this author in PubMed Google Scholar
Jaspal Singh Kooner
View author publications
You can also search for this author in PubMed Google Scholar
Jong-Young Lee
View author publications
You can also search for this author in PubMed Google Scholar
Taesung Park
View author publications
You can also search for this author in PubMed Google Scholar
Peter Donnelly
View author publications
You can also search for this author in PubMed Google Scholar
Andrew D. Morris
View author publications
You can also search for this author in PubMed Google Scholar
Andrew T. Hattersley
View author publications
You can also search for this author in PubMed Google Scholar
Donald W. Bowden
View author publications
You can also search for this author in PubMed Google Scholar
Francis S. Collins
View author publications
You can also search for this author in PubMed Google Scholar
Gil Atzmon
View author publications
You can also search for this author in PubMed Google Scholar
John C. Chambers
View author publications
You can also search for this author in PubMed Google Scholar
Timothy D. Spector
View author publications
You can also search for this author in PubMed Google Scholar
Markku Laakso
View author publications
You can also search for this author in PubMed Google Scholar
Tim M. Strom
View author publications
You can also search for this author in PubMed Google Scholar
Graeme I. Bell
View author publications
You can also search for this author in PubMed Google Scholar
John Blangero
View author publications
You can also search for this author in PubMed Google Scholar
Ravindranath Duggirala
View author publications
You can also search for this author in PubMed Google Scholar
E. Shyong Tai
View author publications
You can also search for this author in PubMed Google Scholar
Gilean McVean
View author publications
You can also search for this author in PubMed Google Scholar
Craig L. Hanis
View author publications
You can also search for this author in PubMed Google Scholar
James G. Wilson
View author publications
You can also search for this author in PubMed Google Scholar
Mark Seielstad
View author publications
You can also search for this author in PubMed Google Scholar
Timothy M. Frayling
View author publications
You can also search for this author in PubMed Google Scholar
James B. Meigs
View author publications
You can also search for this author in PubMed Google Scholar
Nancy J. Cox
View author publications
You can also search for this author in PubMed Google Scholar
Rob Sladek
View author publications
You can also search for this author in PubMed Google Scholar
Eric S. Lander
View author publications
You can also search for this author in PubMed Google Scholar
Stacey Gabriel
View author publications
You can also search for this author in PubMed Google Scholar
Karen L. Mohlke
View author publications
You can also search for this author in PubMed Google Scholar
Thomas Meitinger
View author publications
You can also search for this author in PubMed Google Scholar
Leif Groop
View author publications
You can also search for this author in PubMed Google Scholar
Goncalo Abecasis
View author publications
You can also search for this author in PubMed Google Scholar
Laura J. Scott
View author publications
You can also search for this author in PubMed Google Scholar
Andrew P. Morris
View author publications
You can also search for this author in PubMed Google Scholar
Hyun Min Kang
View author publications
You can also search for this author in PubMed Google Scholar
David Altshuler
View author publications
You can also search for this author in PubMed Google Scholar
Noël P. Burtt
View author publications
You can also search for this author in PubMed Google Scholar
Jose C. Florez
View author publications
You can also search for this author in PubMed Google Scholar
Michael Boehnke
View author publications
You can also search for this author in PubMed Google Scholar
Mark I. McCarthy
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Author contributions are described in the Supplementary Information.

Corresponding author

Correspondence to Jason Flannick.

Ethics declarations

Competing interests

Ralph A DeFronzo has been a member of advisory boards for Astra Zeneca, Novo Nordisk, Janssen, Lexicon, Boehringer-Ingelheim, received research support from Bristol Myers Squibb, Boehringer- Ingelheim, Takeda and Astra Zeneca, and is a member of speaker’s bureaus for Novo-Nordisk and Astra Zeneca. Jose C Florez has received consulting honoraria from Pfizer and PanGenX. Erik Ingelsson is an advisor and consultant for Precision Wellness, Inc., and advisor for Cellink for work unrelated to the present project. Mark McCarthy has received consulting and advisory board honoraria from Pfizer, Lilly, and NovoNordisk. Gilean McVean and Peter Donnelly are co-founders of Genomics PLC, which provides genome analytics.

ISA-Tab metadata

Supplementary information

Supplementary Information (PDF 257 kb)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ The Creative Commons Public Domain Dedication waiver http://creativecommons.org/publicdomain/zero/1.0/ applies to the metadata files made available in this article.

Reprints and permissions

About this article

Cite this article

Flannick, J., Fuchsberger, C., Mahajan, A. et al. Sequence data and association statistics from 12,940 type 2 diabetes cases and controls. Sci Data 4, 170179 (2017). https://doi.org/10.1038/sdata.2017.179

Download citation

Received: 31 May 2017
Accepted: 02 November 2017
Published: 19 December 2017
DOI: https://doi.org/10.1038/sdata.2017.179

This article is cited by

Whole genome sequence association analysis of fasting glucose and fasting insulin levels in diverse cohorts from the NHLBI TOPMed program
- Daniel DiCorpo
- Sheila M. Gaynor
- Alisa K. Manning
Communications Biology (2022)
Relationship between insulin sensitivity and gene expression in human skeletal muscle
- Hemang M. Parikh
- Targ Elgzyri
- Ola Hansson
BMC Endocrine Disorders (2021)
Loss of Znt8 function in diabetes mellitus: risk or benefit?
- Carla P. Barragán-Álvarez
- Eduardo Padilla-Camberos
- Nestor E. Díaz-Martínez
Molecular and Cellular Biochemistry (2021)
Multi-omics analysis identifies CpGs near G6PC2 mediating the effects of genetic variants on fasting glucose
- Ren-Hua Chung
- Yen-Feng Chiu
- Chao A. Hsiung
Diabetologia (2021)
Mutations and variants of ONECUT1 in diabetes
- Anne Philippi
- Sandra Heller
- Alexander Kleger
Nature Medicine (2021)

Subjects

Abstract

Similar content being viewed by others

Background & Summary

Methods

Ethics statement

WGS GoT2D integrated panel generation

Ascertainment of individuals

DNA sample preparation

Exome sequencing

Genome sequencing

HumanOmni2.5 array genotyping and quality control (QC)

Processing, quality control, and variant calling of sequence data

Integrated panel generation

WES (GoT2D+T2D-GENES Multiethnic) panel generation

Ascertainment of individuals

Exome sequencing

Processing, QC, and variant calling

Assaying variants in larger sample sizes

Imputation from the WGS panel

Exome array genotyping from the WES panel

Association analysis

WGS panel single variant analysis

Analysis of imputed datasets

WES panel single variant analysis

Analysis of exome array datasets

Gene-level analysis

Code availability

Data Records

Technical Validation

Evaluation of variants in the WGS panel

Evaluation of variants in the WES panel

Evaluation of imputation from the WGS panel

Evaluation of exome array sensitivity

Evaluation of association tests

Usage Notes

Additional information

References

References

Data Citations

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

ISA-Tab metadata

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links