Introduction

Recent genome-wide association (GWA) studies and their meta-analyses of type II diabetes (T2D) in populations of European origin have lead to the identification of multiple new loci with no prior implications in T2D pathogenesis.1, 2, 3, 4, 5, 6 The association of some of these new loci detected during the first round of GWA studies was replicated in other populations of non-European origin. First detected by linkage scan,7 the TCF7L2 gene was identified as the strongest candidate gene to be associated with T2D through GWA studies (odds ratio (OR): 1.35) and has been replicated in subsequent studies in many different ethnicities.8, 9, 10, 11, 12, 13, 14, 15, 16 The association of other newly emerged GWAS loci, such as IGF2BP2, CDKAL1, CDKN2A/B, HHEX, SLC30A8 and KCNJ11, was replicated in Japanese;17, 18 IGF2BP2, SLC30A8, HHEX, CDKAL1, CDKN2A/B and FTO in Asians from Hong Kong and Korea;19 IGF2BP2, CDKAL1, CDKN2A/B, HHEX and SLC30A8 in Han Chinese;20 and IGF2BP2, PPARG2 and FTO in Indian Sikhs.21

More recently, a meta-analysis of three large GWA studies by the Diabetes Genetics Replication and Meta-analysis (DIAGRAM) Consortium identified six additional T2D loci (NOTCH2, THADA, ADAMTS9, JAZF1, CDC123/CAMKID and TSPAN8/LGRS) to be strongly associated with increased susceptibility to T2D with modest ORs (1.15−1.3).6 Three of these six loci (JAZF1, CDC123/CAMKID and TSPAN8/LGRS) revealed significant association with surrogate measures of insulin release and impaired β-cell function in an independent study of Caucasians from Denmark.22 Another Caucasian study from Southern Germany could not replicate the association of any of these six loci with T2D in their large population-based cohort, except that a weak association was detected in ADAMTS9 with insulin sensitivity and insulin secretion.23

Given the considerable differences in allele frequencies among different ethnic groups,3, 24, 25, 26 and variations in diabetic phenotype across ethnicities,27 an evaluation of these novel loci in T2D in other ethnic populations is important. The aim of this replication study is to determine whether the variations in these six new loci contribute to the risk of T2D in a high-risk community of Asian Indian Sikhs.

Materials and methods

Human subjects

The study subjects are part of the ongoing Sikh Diabetes Study (SDS).28 This study focuses on an endogamous community of Khatri Sikhs living in the Northern states of India, including Punjab, Haryana, Himachal Pradesh, Delhi and Jammu and Kashmir. The DNA and serum samples of 680 T2D case (375 men and 305 women) and 637 normoglycemic (NG) control (320 men and 317 women) subjects were used in this investigation. Of these 680 cases, 357 subjects were from family material (one index case from each family) and the remaining 323 individuals were unrelated T2D cases from the same Sikh community and geographic location. The cases were 25 years with mean age at the time of recruitment (mean±s.d.) of 55.4±11.0 years. The diagnosis of T2D was confirmed by scrutinizing medical records for symptoms, use of medication and measuring fasting glucose levels following the guidelines of the American Diabetes Association.29, 30 T2D was defined as a medical record indicating either (1) a fasting plasma glucose level 126 mg per 100 ml or 7.0 mmol l−1 after a minimum 12-h fast, or (2) a 2-h post glucose level (2-h oral glucose tolerance test (OGTT)) 200 mg per 100 ml or 11.1 mmol l−1 on more than one occasion with symptoms of diabetes. Impaired fasting glucose was defined as fasting plasma glucose >100 mg per 100 ml (5.6 mmol l−1) but <126 mg per 100 ml (7.0 mmol l−1). Impaired glucose tolerance was defined as a 2-h OGTT 140 mg per 100 ml (7.8 mmol l−1) but <200 mg per 100 ml (11.1 mmol l−1). In the absence of medical record information, we confirmed self-reported T2D cases by performing a 2-h OGTT following the criteria of the World Health Organization (WHO) (75 g oral load of glucose). Subjects with impaired fasting glucose or impaired glucose tolerance were considered as ‘pre-diabetics’ and were not included in this study. Body mass index (BMI) was calculated as (weight (kg)/height (meter)2). Waist and hip circumferences were taken with a metal tape at the abdomen and at the hip.

The NG participants were from the same Khatri Sikh community and were recruited from the same geographic origin as the T2D patients, as described earlier.28 The majority of the subjects were recruited from the state of Punjab in Northern India. Individuals of South, East and Central Indian origin or those with type I diabetes or family members with type I diabetes or rare forms of T2D denoted maturity-onset diabetes of young and secondary diabetes (for example, diabetes due to hemochromatosis or pancreatitis) were excluded. Of the 637 NG controls (320 men and 317 women), 262 were non-diabetic spouses of diabetic patients, and the remaining 375 subjects were unrelated non-T2D individuals, who had no family history of T2D, but some had other chronic illnesses such as hypertension, coronary heart disease or arthritis. The selection of controls was based on a fasting glycemia <100 mg per 100 ml (<5.6 mmol l−1) or a 2-h glucose <140 mg per 100 ml (<7.8 mmol l−1). The average age of NG controls (mean±s.d.) was 50.6±13.5 years. In general, Sikhs do not smoke for religious and cultural reasons and about 50% of them are lifelong vegetarians. Clinical characteristics of the SDS subjects used for this investigation are summarized in Table 1. All blood samples were obtained at the baseline visit and all participants provided written informed consent for investigations. All SDS protocols and consent documents were reviewed and approved by the University of Oklahoma Institutional Review Boards as well as by the Human Subject Protection Committees at the participating hospitals and institutes in India.

Table 1 Clinical characteristics of the study population stratified by gender and disease (mean±s.d.)

Metabolic assays

Quantitation of lipids (total cholesterol, low-density lipoprotein cholesterol, high-density lipoprotein cholesterol, very low-density lipoprotein cholesterol and triglycerides) and insulin levels was measured in serum. Lipids were quantified by using standard enzymatic methods (Roche, Basel, Switzerland). Insulin was measured by a radioimmunoassay (Diagnostic Products, Los Angeles, CA, USA). All quantitative parameters were determined by following the manufacturer's instructions using a Hitachi 902 auto-analyzer (Roche, Basel, Switzerland). Homeostasis model assessment (HOMA) for insulin resistance (HOMA-IR) was calculated as (fasting glucose × fasting insulin/22.5). Pancreatic β-cell function (HOMA-B) was calculated as (fasting insulin × 20)/(fasting glucose−3.5), as described by Matthews et al.31

SNP genotyping

DNA was extracted from buffy coats using QiaAmp blood kits (Qiagen, Chatsworth, CA, USA) or by the salting out procedure.32 Genotyping for all investigated single nucleotide polymorphisms (SNPs) was performed using TaqMan Pre-Designed or TaqMan Made-to-Order SNP genotyping assays from Applied Biosystems (ABI, Foster City, CA, USA). TaqMan genotyping reactions were performed on an ABI 7900HT genetic analyzer using 2 μl of (10 ng μl−1) genomic DNA following the manufacturer's instructions. Fluorescence was detected on an ABI prism 7900HT sequence detection system (ABI). Genotypes were scored by analyzing data on both real-time and allele discrimination assay platforms using SDS software provided by the ABI. Data quality for SNP genotyping was checked by establishing reproducibility of control DNA samples. For quality control, 30 replicate positive controls and 8 negative controls were included in each run to match the concordance, and the discrepancy in the concordance was <0.2%. Genotyping success rate ranged between 95 and 99.3% for all the investigated SNPs.

Statistical analysis

The Hardy–Weinberg equilibrium was evaluated using Pearson's χ2 1 d.f. goodness-of-fit test separately for cases and controls. The allele frequencies in T2D cases were compared with those in controls using a χ2 test or Fisher's exact probability test, where appropriate. Statistical evaluations for testing genetic effects of association between the case–control status and each individual SNP, measured by the ORs and its corresponding 95% confidence limits, were estimated using unconditional logistic regression before and after adjusting for age, gender and other covariates. Association analyses were performed assuming codominant, dominant and recessive models using SNPassoc33 and SPSS (v. 15.0) for Windows. In all analyses, the common homozygote genotype in the control population was defined as the reference category. Statistical power was assessed using the Genetic Power Calculator34. We also applied Bonferroni's correction for multiple testing.

A multiple linear regression analysis was used to examine the impact of these variants on quantitative risk variables for T2D, including fasting insulin, glucose and lipid levels. Skewed variables were detected by Shapiro-Wilk's test for the continuous traits. Subsequently, triglycerides, total cholesterol, low-density lipoprotein cholesterol, fasting glucose, fasting insulin, HOMA-IR and HOMA-B were normalized by log-transformation before statistical comparisons, and all P-values were derived from analyses of transformed data. Significant covariates for each dependent trait were identified by using Spearman's correlation and step-wise multiple linear regression with an overall 5% level of significance. These analyses were performed using dominant, codominant and recessive genetic models and, the best fitting model was chosen for each SNP. Analyses were adjusted for the confounding effects of age, BMI, sex, medication and also for disease status in the combined analysis, where appropriate. For lipid traits, the individuals on lipid-lowering medications were excluded.

Results

We genotyped 680 T2D patients and 637 NG controls. These subjects were genotyped for NOTCH2 (rs10923931), THADA (rs7578597), ADAMTS9 (rs4607103), JAZF1 (rs864745), CDC123/CAMKID (rs12779790) and TSPAN/LGR5 (rs7961581) SNPs and analyzed for association with T2D and quantitative sub-phenotypes of T2D. The case–control cohort used in this investigation was matched for ethnicity, culture and geographical locations. Table 1 shows the clinical and physical characteristics of study subjects stratified by gender and disease. The information on the genomic location of the investigated SNPs and minor allele frequencies is summarized in Table 2. The genotype distribution of all investigated SNPs was in the Hardy–Weinberg equilibrium in both cases and controls.

Table 2 Association of candidate SNPs with T2D

Of these six SNPs, only CDC123/CAMKID revealed a significant evidence of association with T2D in Sikhs after appropriately adjusting for the confounding variables of age, sex and BMI. The adjusted OR for CDC123/CAMKID under dominant genetic model was 1.27 (95% CI: 1.02–1.57; P=0.031). A marginally significant trend was also observed under the codominant model (OR: 1.27; 95% CI: 1.00–1.67; P=0.052). The variant in NOTCH2 also revealed a weak association with T2D under the codominant model (OR: 1.21; 95% CI: 1.00–1.47; P=0.053) (Table 2). However, after applying Bonferroni's correction testing for six SNPs for multiple comparisons, none of the investigated SNPs remained statistically significant. No other variant from the remaining loci was associated with T2D in our study. The general estimates of power in this sample to detect the effects of variants with ORs between 1.2 and 1.5 at α=0.05 were 65, 70 and >82% when the frequency of the risk allele was 0.15, 0.20 and >0.30, respectively. Decreasing the OR to 1.1 reduced these power estimates substantially to 30, 33 and >40%.

We next explored the association of these investigated SNPs with T2D-related quantitative traits controlling insulin action (HOMA-IR) and insulin secretion (HOMA-B), adiposity and lipid-related traits. The multiple linear regression analysis after adjusting confounding variable of age, BMI and sex revealed a significant association of the CDC123/CAMKID (rs12779790) variant with fasting insulin levels both in T2D cases (P=0.009) and NG controls (P=0.030), and also in a combined sample of T2D cases+NG controls after adjusting for the effects of disease status along with other significant covariates (P=0.003). As shown in Table 3, the mean values of fasting insulin levels (95% CI) (pmol l−1) markedly decreased among GG homozygous individuals with G (risk) allele in both NG controls (AA: 62.0 (56.3–67.4); AG: 61.0 (52.8–75.7); and GG: 26.4 (18.1–37.5); P=0.030) and T2D cases (AA: 49.3 (45.1–53.5); AG: 40.3 (34.7–46.5); and GG: 34.0 (19.9–51.6); P=0. 009). Our results also revealed significant evidence of impaired β-cell function among G-allele carriers in both T2D cases (P=0.008) and a combined sample (P=0.026) but not in NG controls (P=0.572) (Table 3). There was a gradual decline in pancreatic β-cell function among G (risk) allele carriers, as suggested by mean (95% CI) HOMA-B levels (AA: 20.5 (18.6–22.7); AG: 16.0 (13.3–19.1); and GG: 13.5 (5.2–34.9); P=0.008) among T2D cases, and (AA: 38.9 (35.6–42.7); AG: 32.5 (27.1–38.8); and GG: 23.8 (13.2–40.9); P=0.026) in the combined cohort after controlling for the effects of disease status and medication. We also observed a weak association of NOTCH2 variants with fasting insulin levels among T2D patients (P=0.033) but not in NG controls (P=0.720) (Supplementary Table 1s). None of the remaining SNPs revealed association with any metabolic trait related to insulin resistance, insulin secretion, T2D or obesity (Supplementary Tables 1s–5s). Conservative estimates of the statistical power for quantitative analysis were calculated assuming an additive genetic model and a 0.005 level of significance after Bonferroni's adjustment for considering 10 independent variables. The power to detect inter-genotypic differences in insulin levels as small as 10% was well in excess of 85% at allele frequency ranging from 0.13 to 0.40. HOMA-B levels with differences of 20% were detectable with 57% power for allele frequencies of 0.13 and they increased to 80% for allele frequencies of about 0.20.

Table 3 Association of CDC123/CAMKID (rs12779790) with anthropometric and metabolic traits controlling glucose homeostasis and lipid metabolism

Discussion

In this investigation, we have tested the association of six latest candidate SNPs identified by the meta-analysis of GWA data6 in our T2D case–control cohort of Khatri Sikhs of Asian Indian origin. Of the six SNPs, only CDC123/CAMKID; (rs12779790) replicated earlier evidence of association with T2D and other metabolic sub-phenotypes controlling insulin secretion in our sample. By applying Bonferroni's correction, the observed association with T2D phenotype would no longer be statistically significant for CDC123/CAMKID. However, the observed association of this variant with quantitative measures of insulin secretion in our sample suggests that the association is true. Our results in the multiple linear regression analysis revealed a significant association of this SNP with fasting insulin in NG controls (P=0.030; R2=0.017), T2D cases (P=0.009; R2=0.020) as well as in a combined sample (P=0.003; R2=0.016) under an additive genetic model. Moreover, the P-value associated with fasting insulin levels remained statistically significant after applying Bonferroni's correction in the combined cohort (P=0.03). It is noted that the impairment in the β-cell function was more pronounced among T2D patients (P=0.008; R2=0.017) but not in NG controls (P=0.572; R2=0.001), perhaps because of the increased demand for insulin during acute conditions of hyperglycemia.

From these results, it appears that the genetic variation in CDC123/CAMKID affects glucose-stimulated insulin release and may be responsible in impaired β-cell function that may subsequently contribute to putative T2D risk in Sikhs. However, this variant explained only 2% of the overall contribution of this gene in controlling fasting insulin levels in this sample. These findings suggest that perhaps the causal variant in this gene still remains to be identified. Interestingly, the same SNP has been shown to affect pancreatic β-cell function in a large population-based cohort from Denmark.22 The authors showed association of G (risk) allele with insulin secretion under additive (P=0.02) and recessive (P=8 × 10–5) genetic models. However, at the same time, the association of this variant could not be reproduced with T2D or impaired β-cell function in another Caucasian study from Southern Germany.23 The possible reason for this inconsistency in confirming the role of these loci with diabetic and pre-diabetic phenotypes within Euro-Caucasians could be due to the genuine genetic heterogeneity existing among ‘Caucasians’ because of wide geographic locations and genetic histories. In the current sample of Asian Indian Sikhs, false-positive or false-negative associations arising from genetic heterogeneity owing to population admixture are less likely to be the reason for non-replication of association with these loci in our sample. Our carefully ascertained, relatively homogeneous case–control sample of Asian Indian Sikhs belongs to a single geographic location and is collected from an endogamous community of urban Sikhs from the Khatri community.

Another possible reason for inconsistency among the Euro-Caucasian-based studies is a lack of sufficient statistical power to detect the small contribution of these loci (ORs: 1.15–1.3)6 to the complex phenotype of T2D.35 Unfortunately, this case–control sample of Sikhs is similarly hampered by low statistical power at the lower end of this range of ORs. The minimum ORs detectable with 80% power for risk allele frequencies of 0.15, 0.20 and 0.30 were 1.25, 1.22 and 1.19, respectively. Consequently, the lack of association of some previously reported loci with T2D in this Sikh population may not be conclusive that these genetic variants have no effect on risk for T2D.

A further reason for the lack of association of these SNPs with risk for T2D could be that in the Euro-Caucasian studies the reported SNPs were not functional but rather were in linkage disequilibrium (LD) with nearby SNPs, and perhaps because of ethnic differences, this LD was not present in our sample. Clearly, there was a significant difference (P0.001) in the distribution of risk allele frequencies in our Sikh sample compared with Caucasians reported in a meta-analysis study6 (NOTCH2: 0.20 vs 0.10; THADA: 0.85 vs 0.90; ADAMTS9: 0.50 vs 0.76; JAZF1: 0.68 vs 0.50; CDC123/CAMKID: 0.13 vs 0.18; TSPAN8/LGR5: 0.35 vs 0.27, respectively, for Sikhs and Caucasians). Therefore, it is likely that the causal variant in these loci could be ethnic-specific and could be present elsewhere in the same or nearby gene. Furthermore, genetic differences in phenotype heterogeneity and characterization also cannot be ruled out.

GWA studies allow the detection of novel genes with previously unknown roles in the biology of a disease. The detection of new loci may help unravel the disease pathophysiology in greater detail.35 However, it is still challenging to detect the true candidates using GWA studies in complex diseases, especially for the loci that are common because of ethnic variations.36 CDC123/CAMKID is an evolutionary conserved protein involved in controlling cell cycle and its levels of expression are regulated by nutrients in Saccharomyces cerevisiae.20 However, its role in T2D pathophysiology is unclear. This variant (rs12779790) lies in the inter-genic region (about 90 kb) from CDC123/CAMKID and (63.5 kb) from CAMKID. It is also possible that the causative variant is located in CAMKID or other nearby gene regions that may be affecting insulin secretion or insulin sensitivity. Given the sparse knowledge available on the biological function of these genes, it is unclear how these genes are involved in the T2D pathogenesis. Unlike CDC123/CAMKID, no other variant revealed significant association with quantitative variables involved in glucose homeostasis (fasting glucose, fasting insulin, HOMA-IR or HOMA-B), adiposity or lipid levels, except for the weak association of NOTCH2 variants affecting fasting insulin levels (P=0.033) in T2D cases (Supplementary Table 1s). It is noted that NOTCH2 is expressed in ductal cells of branching pancreatic buds and plays a key role in fetal pancreatic development. In addition, modulation of NOTCH signaling can affect pancreatic β-cell formation. Although our data could not replicate the positive association of NOTCH2 with T2D, marginally significant association of T (risk) allele with reduced insulin secretion suggests the possible role of NOTCH2 variation in affecting β-cell function in Sikhs. Perhaps a causal variant in this gene with a greater risk is yet to be identified. Our findings also confirmed the negative association revealed in JAZF1, THADA and ADAMTS9 with pre-diabetic phenotypes in another Caucasian sample from Southern Germany.23 Our results also agree with the negative findings reported in THADA and ADAMTS9 with surrogate measures of insulin release and insulin action in Caucasian study from Denmark.22 The inability of this study as well as of the earlier studies to clearly replicate meta-analysis findings may suggest a weak association of these loci to T2D.

Nevertheless, false-negative results in our data set also cannot be ruled out because of limited statistical power to detect the role of some of these SNPs in T2D or in β-cell function (insulin secretion) or insulin sensitivity. Precise phenotype data, including the measurements of insulin secretion (such as intravenous glucose tolerance test) or hyperglycemic clamps and C-peptide for more efficient estimation of insulin release, may better define the role of these variants in β-cell function and insulin sensitivity. Positive association detected in the CDC123/CAMKID variant with fasting insulin levels and impaired β-cell function emphasizes the importance of abnormal β-cell function in the pathogenesis of T2D. These results await further confirmation by an ethnicity-matched larger study. To our knowledge, ours is the first replication study reporting the role of these six new loci with T2D in a population sample from the Indian subcontinent.