Introduction

Type 2 diabetes is a complex metabolic disorder resulting from complex interaction of a large number of genetic and environmental factors. Linkage studies for type 2 diabetes have provided promising and replicated findings on 1q21–q231, 2 and 20q13.3, 4, 5, 6, 7 These robust replications suggested the plausibility of localization of single or multiple susceptibility genes on these chromosomal regions. Subsequently, several studies performed fine mapping of these candidate regions but yielded little success.8 The extensive work of the 1q21 consortium found association of a few genes with type 2 diabetes on chromosome 1 but could not provide a confirmatory signal.8, 9, 10 Moreover, investigation of candidate genes on these regions have provided inconsistent and/or only modest association of a few genes including KCNJ9,11 KCNJ10,12 PKLR,13 CRP,14 LMNA,15 CASQ1,16 DUSP12,17 HNF4A,18 DOK519 and PTPN120 with type 2 diabetes. These candidate regions still remain unexplored in Indian population, which is a high-risk group for type 2 diabetes21 that necessitates a systematic evaluation of these promising candidate regions in this population.

The associated genetic variants for complex disorders are expected to have moderate-to-small effects. This is evident from the findings of genome wide association studies on type 2 diabetes where majority of the variants identified till date have small-to-moderate effects and are located in non-coding regions.22 The same has also been observed for loci associated with different traits where 88% of the associated signals have moderate-to-weak effect and are located in non-coding regions.23 Interestingly, a recent study has shown abundance of heritable factors altering gene expression in trait-associated loci and suggested that genetic risk factors for complex traits often affect phenotype by altering gene expression.24 This brings up a strong case for a focused study on regulatory variants with respect to complex traits. Comprehending this, we investigated association of 207 common variants in regulatory regions (proximal promoter, untranslated and flanking regions) of genes in type 2 diabetes-linked regions 1q21–23 and 20q13 with type 2 diabetes and its related traits in Indians in a two-stage case-control study.

Materials and methods

Subjects

Study involved participation of 4200 unrelated subjects of Indo-European ethnicity, recruited from urban regions in and around Delhi, North India. Initial association analysis was performed in 2115 subjects comprising 1073 patients with type 2 diabetes and 1042 controls. Replication was assessed in 2085 subjects of which 1047 were type 2 diabetes patients and 1038 were controls. Type 2 diabetes was diagnosed as per WHO (World Health Organization) criteria.25 Non-diabetic subjects of 40 years without family history of diabetes who had glycated hemoglobin level 6.0% and fasting glucose level <110 mg dl−1 were enrolled as controls. The inclusion and exclusion criteria for type 2 diabetes patients and control subjects are provided in details previously.26 Recruited subjects underwent extensive anthropometric and biochemical measurements as provided earlier,26 and their characteristics are presented in Table 1.

Table 1 Anthropometric and clinical characteristics of the study populations

The study was carried out in accordance with the principles of Helsinki Declaration. The study was approved by the ethics committee of all the participating institutions (Institute of Genomics and Integrative Biology (CSIR), All India Institute of Medical Sciences, SMS Medical College and Hospital and University College of Medical Sciences). Informed written consent was obtained from all the participants.

Selection of genes and single nucleotide polymorphisms (SNPs)

On 1q21–23, strongest linkage with type 2 diabetes had been observed at marker D1S484 with Lod-score of 5.3.27 We screened 2 Mb genomic region around D1S484 spanning 158 Mb–160 Mb on chromosome 1 (NCBI build 36.3). On 20q13, 6 Mb region around the strongest marker D20S196 (Lod-score 5.3), spanning 46 Mb–52 Mb (NCBI build 36.3) was explored because of low gene density.6 Information regarding all the genes within the regions was obtained from Ensembl database (v38) (http://www.ensembl.org/index.html). The 2 Mb region of 1q21–23 had 59 genes whereas 37 genes were located within 6 Mb region of 20q13.

Genes were searched for SNPs in 5′untranslated region , 3′untranslated region, 3 kb upstream and downstream of the genes, in NCBI variation database (dbSNP build 126: http://www.ncbi.nlm.nih. gov/SNP). SNPs in untranslated regions of the genes and tagSNPs were given preference, and at least one SNP in 5′ and 3′ regions for each gene was included, if available. A total of 207 SNPs comprising 131 SNPs of 59 genes from 1q21–23 and 76 SNPs of 37 genes from 20q13 (Supplementary Table 1) were genotyped in 2115 individuals in stage 1 (Supplementary Figures S1 and S2).

Genotyping

Genotyping in stage 1 was performed using Illumina GoldenGate assay (Illumina Inc., San Diego, CA, USA). Genotyping data were subjected to extensive quality control that included genotype confidence score >0.25, SNP call rate >0.9, GenTrans score >0.6, cluster separation score >0.4, Hardy Weinberg equilibrium (P>0.05 in controls), minor allele frequency >0.05. Ninety samples had genotype call for <90% of the SNPs and were excluded from the analysis. A total of 150 SNPs were available for further analysis, having average call rate of 99.5%, with concordance of >99.9% in genotype calls in 7% duplicates.

In stage 2 for replication, four top signals from stage 1: rs11265455 (SLAMF1), rs1062827 (F11R), rs12565932 (F11R) and rs1333062 (ITLN1) were genotyped using iPLEX (Sequenom, San Deigo, CA, USA). Average genotyping success rate was 96% with >99.8% consistency rate in genotype calls in 5% duplicates.

Statistical analysis

Statistical analyses were mainly performed using PLINK v. 1.07 (http://www.pngu.mgh.harvard.edu/purcell/plink/). Hardy Weinberg equilibrium was tested using χ2 analyses. Association of SNPs with type 2 diabetes was assessed using logistic regression under additive model adjusted for age, sex and body mass index (BMI). Odds ratios (ORs) were calculated with respect to risk allele. Meta-analysis was performed by combining summary estimates both under random effect and fixed effect models. P value <3.3 × 10−4 was considered significant after multiple testing correction for independent SNPs (0.05/138). Association of SNPs with quantitative traits was determined using linear regression adjusted for age, sex and BMI as appropriate, in control subjects after inverse normal transformation of the data to achieve normal distribution. A P value <2.0 × 10−5 (0.05/(17 × 138)) was considered significant after multiple testing correction for quantitative traits analysis. Statistical power of the study was estimated using QUANTO version 1.2 (http://hydra.usc.edu/gxe/).28 Our sample size in the initial phase had 70–97% power to detect the association with OR of 1.3–1.5 assuming minor allele frequency of 0.20.

Results

On 1q21–23, 10 SNPs showed association with type 2 diabetes (P<0.05), with most significant association at rs11265455 in 5′ flank of SLAMF1 (OR=1.32, P=1.1 × 10−3). The highest OR was obtained for 3′untranslated region variants (rs1062827 and rs12565932) of F11R (OR=1.36, P=1.5 × 10−3 and 1.35, P=1.8 × 10−3) (Table 2). On 20q13, only nominal associations at rs5602 (PTGIS) was observed (P=0.026). Allelic and genotypic distributions of all the SNPs investigated for association with type 2 diabetes are provided in Supplementary Table 1.

Table 2 Association of SNPs on 1q21–23 and 20q13 with type 2 diabetes in North Indian population

As obesity status is shown to modulate genetic susceptibility of type 2 diabetes previously,26, 29 we performed BMI-stratified association analysis after segregating subjects as normal-weight (BMI<23 kg m−2) and overweight/obese (BMI23 kg m−2).30 The most significant association was found at rs881291 of ATP1A4 (OR=1.42, P=3.8 × 10−3) on 1q21–23 in normal-weight subjects and at rs1333062 of ITLN1 (OR=1.28, P=3.4 × 10−3) in overweight/obese subjects (Supplementary Table 2). Interestingly, the association of rs1333062 (ITLN1) in BMI-stratified analysis was found to be stronger than the entire data set.

For replication analysis, we selected SNPs rs11265455, rs1062827 and rs12565932, which showed association with type 2 diabetes at P<5.0 × 10−3. In addition, we also selected an interesting signal rs1333062 that was nominally associated in entire data set (P=0.039) but showed stronger association with type 2 diabetes in BMI-stratified analysis (P<5.0 × 10−3). Associations of rs11265455 in SLAMF1 and rs1333062 in ITLN1 were confirmed in replication analyses that were corroborated by the meta-analysis of two study populations (ORr=1.29, Pr=3.9 × 10−5 and ORr=1.19, Pr=1.8 × 10−4, respectively) (Table 2). Association of both the variants remained significant after correcting for multiple tests. As observed in stage 1, we found stronger association of ITLN1 variant rs1333062 with type 2 diabetes among overweight/obese subjects in BMI-stratified analyses in stage 2 (OR=1.28 P=7.8 × 10−3) and meta-analysis (ORr=1.25, Pr=8.8 × 10−5) (Table 3). The combined analysis also showed significant association of rs1333062 with type 2 diabetes among overweight/obese subjects (OR=1.25, P=8.7 × 10−5). For F11R SNPs rs12565932 and rs1062827, only nominal association under fixed effect model of meta-analysis was observed (ORf=1.19 and 1.18, Pf=0.011 and 0.016, respectively). Similar results were obtained when the analysis was performed by combining the raw genotyping data from two stages (Table 2).

Table 3 BMI-stratified analysis of SNPs associated with type 2 diabetes in North Indians

We investigated the functional effects of SLAMF1 and ITLN1 variants through cis-gene expression quantitative trait loci (cis-eQTL) analysis using transcriptome data from University of Chicago's eQTL data repository (http://www.eqtl.uchicao.edu/cgi-bin/gbrowse/eqtl). The ITLN1 variant rs1333062 is in strong linkage disequilibrium (LD) with rs11265519 (r2=0.89), which is shown to be associated with the expression of LY9 (P=2.4 × 10−7), C4ORF11 (P=1.7 × 10−6), AF009262 (P=2.9 × 10−6), OR5B21 (P=1.0 × 10−6), IGSF4B (P=5.0 × 10−4) and SLAMF8 (P=3.6 × 10−3) in liver. We found an eQTL rs7550771 near SLAMF1, but this eQTL was not in LD with the associated variant rs11265455.

Next, we investigated association of SNPs with metabolic traits that are risk factors for type 2 diabetes, including obesity (waist circumference, hip circumference, waist-to-hip ratio), glucose homeostasis (fasting plasma glucose, glycated hemoglobin, fasting plasma insulin, C-peptide, homeostasis model assessment of insulin resistance (HOMA-IR)), dyslipidemia (total cholesterol, triglyceride, high-density lipoprotein cholesterol, low-density lipoprotein cholesterol) and inflammation (high sensitivity C-reactive protein) (Figures 1 and 2). For obesity-related phenotypes, strongest signal for association was observed at rs1008687 of SPATA2 (β=−1.13 kg m−2, P=3.4 × 10−5 for BMI) on 20q13 (Figure 2). We also found association of rs1008687 with HOMA-IR and fasting plasma insulin. The strongest association with measures of glucose homeostasis was observed at rs16858733 of FCRLB on 1q21-23 (P=3.3 × 10−4 for fasting plasma glucose, P=2.5 × 10−3 for fasting plasma insulin and P=3.1 × 10−4 for HOMA-IR). SNPs showing putative association with quantitative clinical traits are listed in Supplementary Table 3.

Figure 1
figure 1

Association of SNPs on 1q21–23 with type 2 diabetes-related clinical traits. Legend: The –log10 P values are plotted against the chromosomal location (NCBI build 36.3). A full color version of this figure is available at the Journal of Human Genetics journal online.

Figure 2
figure 2

Association of SNPs on 20q13 with type 2 diabetes-related clinical traits. Legend: The –log10 P values are plotted against the chromosomal location (NCBI build 36.3). A full color version of this figure is available at the Journal of Human Genetics journal online.

Discussion

Investigation of genetic risk factors for type 2 diabetes in Indians, which is the high-risk group for type 2 diabetes and related metabolic traits, is very imperative. However, efforts on these lines are very limited, often restricted to replication of genome wide association studies findings. Taking lead from genome wide linkage scans, many studies investigated the association of genetic variants from linked 1q21 region with type 2 diabetes in European populations; however, there have been no such efforts from India. The present study was designed to explore high-risk Indian population for type 2 diabetes genetic risk factors from two of the strongly linked chromosomal regions 1q21–23 and 20q13.

We demonstrate association of common variants of SLAMF1 and ITLN1 on 1q21–23 with type 2 diabetes. SLAMF1 encodes signaling lymphocytic activation molecule family member 1 (SLAMF1), also called as CD150. Expression of SLAMF1 is shown to be raised on peripheral blood B cells and synovial fluid from patients with rheumatoid arthritis.31 Here, we found association of rs11265455 in proximal region of SLAMF1 with type 2 diabetes. ITLN1 encodes Intelectin 1 (Omentin 1), an adipokine that enhances insulin-stimulated glucose uptake in adipocytes.32 Plasma levels of Intelectin 1 are inversely correlated with BMI, waist circumference and insulin resistance.33 A study that investigated association of Val109Asp variant of ITLN1 with type 2 diabetes in Caucasians failed to detect the same.34 On contrary, our analysis revealed association of rs1333062 in 3′ flank region of ITLN1, which is in complete LD with Val109Asp of ITLN1 (D′=1.0 and r2=1.0 both for GIH (Gujarati Indians in Houston, Texas) and CEU (Utah residents with Northern and Western European ancestry from the CEPH collection) populations from HapMap data). The observed association of rs1333062 might reflect functional consequence of the missense variant in LD in type 2 diabetes. Moreover, eQTL analysis showed that rs1333062 is in strong LD with rs11265519, which is associated with the expression of nearby genes including LY9, IGSF4B and SLAMF8 in liver. These evidences support our findings that suggest ITLN1 as a susceptibility locus for type 2 diabetes.

Suggestive associations of variants of F11R were also observed in initial analysis, but could not be replicated. Previous reports of association of F11R variants with hypertension and obesity35 support our observations. Also, meta-analysis corroborated our initial findings, but the inconsistency in stage1 and stage 2 warrants further investigation of these variants in other populations. We also identified genomic segments on 1q21–23 and 20q13 influencing type 2 diabetes-related traits, such as obesity, glucose homeostasis and dyslipidemia, which may further be fine mapped to find susceptibility genes for these intermediate phenotypic traits. Notably, we have reported all potential associations of SNPs with metabolic traits at nominal significance (P<0.05). Though increasing the likelihood of false-positive associations, this might provide lead for further investigations of these regions in other populations.

Investigated chromosomal regions harbors a number of positional candidate genes including APOA2, DUSP12, ATF6, KCNJ9 and KCNJ10. Consistent with earlier association of DUSP12 variant with glucose in Chinese population,36 we also found similar result in North Indians. Contradictory results regarding association of ATF6 with type 2 diabetes have been reported earlier.37, 38, 39 Our study detected nominal association of an ATF6 variant with type 2 diabetes and BMI. PTPN1 is shown as an important contributor to risk of type 2 diabetes, obesity and metabolic syndrome;20 however, no association could be observed in our population. APOA2, KCNJ9 and KCNJ10, the well-known biological candidates for type 2 diabetes, did not show any association with type 2 diabetes or any related trait in our study.

Given the scope of the present study, investigation of the selected candidate regions is restricted only to the smaller regions of observed linkage, which generally extend to 10 Mb. Here we adopted a gene-centric approach and focused on the regulatory regions of the genes located on these candidate loci. The role of coding variants in complex disorders cannot be ruled out, but the present knowledge and evidences suggest more likelihood of regulatory variants being involved in complex diseases. With the power of the study, we expected higher likelihood of finding association in putative regulatory regions and hence, mainly focused on those variants. Thus, further investigation of the identified genes for variants in coding and intronic regions is required to establish their etiologic role, if any. Moreover, we would like to mention that though using the present approach we identified association of two genes with type 2 diabetes, further examination of tag SNPs that provide better coverage of the variations in the candidate regions might facilitate identification of additional susceptibility variants.

A study by Rosenberg, et al.40 stated that false-positives due to genetic heterogeneity in diverse Indian population could be smaller than expected. Still, we made considerable efforts to avoid population stratification by recruiting subjects in both the phases with great caution with regards to their ethnicity and geographical regions, forming a homogenous cluster as reported by the Indian Genome Variation Consortium.41 The multidimensional scaling based on 608 unlinked markers genotyped in our initial study population also clearly shows that the study population belongs to one cluster (Supplementary Figure S3).

In conclusion, the present study demonstrates association of common variants of SLAMF1 and ITLN1 with type 2 diabetes in North Indians. Further studies validating the association of these variants in our populations and exploring functional significance are warranted.