Introduction

Obesity is defined as a medical condition in which excess body fat accumulation can impact health negatively.1 It has become one of the leading disorders afflicting mankind worldwide.2 Because of recent explosion of obesity, World Health Organization (WHO) has designated obesity a global epidemic. Overweight and obesity have been estimated to cause as much as 3.4 million deaths every year, with 4% of years of life lost and 4% of DALYs (disability-adjusted life years).3 Obesity is a risk factor for various complication including diabetes, hypertension, cardiovascular diseases and so on.4

Obesity is a heterogeneous disorder involving a complex interaction of a multitude of factors including environmental, behavioral and genetic factors, none of which are completely understood.5 Environmental factors include availability and abundance of food, place of living and socioeconomic status, and behavior in context to obesity largely refers to eating and physical activity that in turn are outputs of a complex integration of different nutritional, metabolic, neuronal and hormonal signals in the brain.6

Genetic studies on obesity started as a result of the observation that family clustering occurs in obesity. Early twin studies indicated that heritability estimates for twins are similar no matter whether they are reared apart or together and confirmed that 25–50% body mass index (BMI) variation observed within family studies and 40–90% within twin studies could be attributed to genetic factors.7, 8, 9 A landmark discovery in the field of obesity genetics was the identification of mutations in the leptin gene in grossly obese children followed by identification of mutations in other genes involved in energy regulation pathways, laying down the basis for monogenic obesity.10 However, it soon became clear that these mutations are very rare and could not explain the recent explosion of obesity and there must be some other mechanism responsible for common forms of obesity. The first genome-wide association study in 2007 opened the way to solve the issue, and in the same year, three independent GWA studies found the same gene, FTO, showing association with obesity.11, 12, 13 This discovery laid the ground for the identification of many variants in other noncandidate genes and it is now accepted that common forms of obesity are product of the combined net effect of many risk variants in different genes.14, 15

The association of common FTO variants, especially rs9939609, with obesity has been confirmed in Caucasians; however, the results in Asian populations are conflicting.16, 17, 18, 19, 20, 21, 22, 23, 24, 25 In Pakistan, there is limited research in the field of obesity genetics. A recent study in the Pakistani population indicated the association of this SNP with obesity in women and proposed that the SNP may play its role by affecting plasma glucose and leptin levels.26 However, no study investigated the relationship between the variant and the lipid profile in obese individuals. A recent genome-wide association study in subjects of South Asian ancestry identified six new loci associated with type II diabetes and these associations have been replicated only in Chinese and Japanese populations.27, 28, 29 As obesity is a risk factor for diabetes, it is possible that these SNPs may mediate their effect by affecting BMI. Because of this biological relevance and rationale, the aim of the current study was to analyze the role of these SNPs and the FTO variant rs9939609 in obesity in subjects from Pakistan and to correlate them with anthropometric traits and plasma lipid profile to elucidate the effect, if any, of these SNPs on lipid parameters.

Materials and methods

Study subjects

The study was a case–control observational type carried out at the Center for Cardiovascular Genetics, Institute of Cardiovascular Science, University College London, and Department of Microbiology and Molecular Genetics, University of the Punjab. The study included a total of 475 subjects (250 cases and 225 controls). Study subject recruitment was done by random sampling from hospitals and general population of Punjab, Pakistan. Subjects signed a written consent and filled in a detailed questionnaire regarding demographic information, lifestyle, exercise habits and family history of obesity. The inclusion criteria were BMI and waist-to-hip ratio (WHR) cutoffs defined for Asian populations previously (BMI >23 kg/m2 as overweight and >26 kg/m2 as obese).30, 31 Exclusion criteria included pregnancy, presence of malignancies and recent infections. The study was approved by the institutional ethics committee (Ethical Committee, School of Biological Sciences, University of the Punjab, Pakistan) and all procedures were carried out in accordance to Helsinki declaration.

Anthropometric measurements

The measurement of body weight (kg), height (m) and waist and hip circumference (cm) was according to standard procedures as described previously.32 BMI (kg/m2) and WHR were calculated for each study subject.

Blood sampling and biochemical analyses

Blood samples were taken after 8–12 h of fasting. Half sample was used for DNA isolation and the other half was used to obtain serum. Serum was separated by centrifuging gel vacutainers at 14 000 r.p.m. for 10 min, and samples were collected in sterilized eppendorf and screened for any infectious agents (HBV, HCV, HIV). Positive samples were discarded and safe samples were used for lipid profile determination. Serum total cholesterol (TC), triglycerides (TG), high-density lipoprotein cholesterol (HDL-C) and low-density lipoprotein cholesterol (LDL-C) were measured using commercially available kits (Spectrum Diagnostics, Obour, Egypt). Epoch, Biotek microplate reader (Biotek Instruments, Winooski, VT, USA) was used for all optical density measurements.

Genotyping

Genomic DNA was isolated from blood leukocytes using Wizard Genomic DNA purification kit (Promega, Madison, WI, USA). DNA was quantified using Nanodrop (ND-8000, Thermo Fisher Scientific, Waltham, MA, USA), made to a 5 ng/μl concentration and spotted on 384-well MicroAmp plates by liquid handling system robot (Biomerk FX, Beckman Coulter, Brea, CA, USA). Genotyping was done by TaqMan allelic discrimination assay using primers and probes given in Supplementary Table 1. The real-time PCR reaction mixture consisted of 1 × KAPA probe fast qPCR master mix (KAPABiosystems, Wilmington, DE, USA), 100 nM of each primer, 100 nM of each probe, ROX high (0.4 μl/20 μl reaction mixture), 5 ng DNA and PCR grade water as needed. The PCR program consisted of the following steps: 50 oC for 2 min, denaturation at 95 oC for 10 min, 40 cycles of amplification at 95 oC for 15 s and 60 oC for 1 min. PCR was done on Bio-Rad C1000 thermal cycler and the results were called using sequence detection software (SDS) version 2.0 on ABI Prism 7900HT (Applied Biosystems/Life Technologies, Waltham, MA, USA).

Gene score calculation

The genotypes of the subjects were coded as 0, 1 and 2 for homozygous protective, heterozygous and homozygous risk genotypes, assuming that both alleles are codominant and the effect of one allele is not masked by the presence of another allele. The count for all SNPs was added to get the sum of risk alleles that each individual possesses, termed the unweighted gene score, that was multiplied by published effect size to get a weighted gene score.28

Statistical analysis

Data analysis was done using the Statistical Package for Social Sciences (IBM SPSS statistics, version 22, Armonk, NY, USA). Data were analyzed for mean, SD and normality of quantitative variables. Quantitative variables showing skewness were log transformed and results are presented as geometric means with approximate SD or SE. The study population was tested for Hardy–Weinberg equilibrium (HWE). Allele/genotype frequencies were calculated and compared between cases and controls using the χ2 test. Odds ratio (OR) and 95% confidence intervals were calculated using logistic regression models. Age and gender association with obesity were checked by Mann–Whitney U-test and Pearson’s χ2 test, respectively, and independent sample t-tests were used to compare the other continuous variables by obesity. The association of variants with obesity was determined using logistic regression, whereas anthropometric (BMI, weight, waist and hip circumference and WHR) and lipid traits were analyzed by analysis of covariance (ANCOVA) with adjustment for age and gender. Linear regression was used to calculate rise/fall in lipid trait levels per risk allele with age and gender included as covariates in the models. In order to test for the hypothesis that the effect of T2D variants in disease progression may be mediated by BMI, a mediation analysis was performed. Mediation analysis was undertaken using the user written command binary_mediation in Stata version 13 (StataCorp, College Station, TX, USA) to estimate direct and indirect effects using the product of coefficients approach. This method uses logistic regression and ordinary least squares and hence the coefficients are rescaled to be comparable before the indirect effects are computed. The program is based on the methods described by MacKinnon and Dwyer.33 To consider the effect of testing multiple genotypes, for each variable we calculated the false discovery rates (FDR), as shown in the tables for comparisons, that were significant at a P-value of <0.05. All but one comparison remained significant at a FDR of 10%. We did not include multiple phenotypes in this calculation as many of the examined variables are highly correlated (eg, BMI and weight and height, TC and LDL-C and TG and HDL are inversely correlated). The results should be considered as exploratory and interpreted in light of the number of tests conducted. Results will need confirmation in other studies. The allele/genotype frequency data have been submitted to GWAS central (http://www.gwascentral.org/).

Results

The characteristics of the study population are summarized in Table 1. The proportion of individuals with a family history of obesity, hypertension, cardiovascular problems and diabetes in the obese group is 63.6%, 25.5%, 12% and 32%, respectively, whereas in controls the values are 8.4%, 0.4%, 0% and 1.3%, respectively. The diabetic patients were on medication, but the exact information on which medicine is being used could not be traced in the majority of patients as a large proportion of the samples came from the general population with insufficient literacy to remember the medicine’s name. The number of males and females in the obese group (n=250) was 139 (55.6%) and 111 (44.4%), respectively, whereas in the control group (n=225) there were 118 (52%) males and 107 (48%) females.

Table 1 Study population anthropometric and biochemical characteristics

Association of selected SNPs with obesity

The genotype distributions for all SNPs was in HWE (P>0.05) for all variants, and the minor allele frequencies of all variants were comparable to the original studies (Table 2). The effect of the gene score (the total number of risk alleles present in an individual) on obesity is shown in Figure 1. Compared with subjects with ≤3 risk alleles, those with ≥9 had an odds ratio for obesity of 5.7 (P-value=0.03). To explore which of the SNPs were contributing to this effect we examined the genotype frequencies of the six SNPs in cases and controls (Table 2). The genotype distribution differed significantly between cases and controls for rs1802295 (VPS26A; P=0.02), rs7178572 (HMG20A; P=0.007) rs2028299 (AP3S2; P=0.04) and rs4812829 (HNF4A; P=0.02), whereas risk allele frequencies differed significantly between cases and controls for rs1802295 (VPS26A; P=0.03) and rs9939609 (FTO; P=0.01; Table 2).

Table 2 Minor allele frequencies and odds ratios of selected SNPs
Figure 1
figure 1

Figure showing the relationship of increasing number of risk alleles, the frequency of subjects carrying the respective number of risk alleles and odds ratios.

Association of SNPs with anthropometric traits

As shown in Table 3, rs3923113, rs2028299, rs4812829 and rs9939609 did not show a significant association with any of the anthropometric traits, rs16861329 showed an association with hip circumference, rs1802295 (VPS26A) was associated with BMI, weight and hip/waist circumference and rs7178572 (HMG20A) showed a significant association with weight and hip circumference. Even in the absence of a significant association for some traits, it was clear that the presence of risk alleles for some variants slightly changed the BMI and WHR values showing the quantitative effect of variants.

Table 3 Association of SNPs with anthropometric traits

Association of SNPs with lipid profile

The association of selected SNPs with lipid traits (TC, TG, LDL-C and HDL-C) and their quantitative contribution per risk allele are shown in Table 4. When plotted against the unweighted gene score, rise or fall in lipid trait values is indicated in Figure 2 and Supplementary Figure 1. A mediation analysis for looking at the mediation effect of lipids on obesity showed that rs1802295 was the only SNP associated with both obesity and any lipid trait.

Table 4 Association of SNPs with lipid traits
Figure 2
figure 2

Effect of increasing number of risk alleles on mean total cholesterol.

Gene score

The total SNP score was calculated for controls and cases, adding the individual risk score and were compared by independent sample t-test. Histograms of unweighted gene score show that in controls 8% individuals had <4 risk alleles and only 57% individuals carried ≥6 risk alleles, whereas in cases only 5% subjects had <4 risk alleles and 58% individuals carried ≥6 risk alleles. In cases, mean gene score (5.93) did not differ significantly from mean gene score in controls (5.76) (Figure 3 and Supplementary Figure 2). The histogram of unweighted gene score is given as Supplementary Figure 3. The mediation analysis rejected the assumption of mediation of the T2D variants by affecting BMI, with neither the weighted or unweighted gene scores showing any significant association with diabetes between cases and controls. However, when tested individually, only rs1802295 showed a significant association with diabetes, but in the direction opposite to the published effect.

Figure 3
figure 3

Obesity by unweighted gene score. Mean unweighted gene score in cases vs controls, mean (SD)=5.76 (1.45) in cases vs 5.93 (1.54) in controls, P=0.29.

Discussion

Pakistan, as a low-income country, initially faced the problem of malnutrition and undernutrition, but an improvement in life standards and availability of a variety of palatable foods at relatively low price resulted in an increase in overall body weight that became more evident because of decrease in physical activity. Owing to illiteracy and psychological stigma, obesity has not been considered a disease until recently, hindering research to understand what factors, biochemical as well as genetic, are involved in the development of obesity in Pakistan. In this context, it is needed to establish a genetic panel representing common variants predisposing to common forms of obesity in this region of the world.

The study aimed to seek insight into the possible mechanism through which six of the reported type II diabetes susceptibility loci and one well-established FTO variant may lead to obesity and subsequently to diabetes. As obesity is a risk factor for diabetes, we genotyped these variants in obese as well as nonobese subjects in order to clarify whether or not these variants are associated with obesity. The extensively studied variant rs9939609 in the FTO gene was selected in order to clarify the inconsistent association patterns observed for the variant with obesity and its minor allele frequency in previous studies.26 The SNP is reported to have a high minor allele frequency in Caucasians (0.4) but very low in Asian population (0.1), whereas its association with obesity and type II diabetes has been confirmed in many populations but not in Chinese Han subjects.11, 18, 19, 20, 21, 22, 23, 24, 25, 26, 34, 35, 36, 37, 38 The six GWAS SNPs were found to be associated with type II diabetes and have been replicated for Chinese and Japanese populations only for association with type II diabetes.27, 28, 29 However, these SNPs have not been previously studied with respect to obesity, and the Pakistani population represents an interesting ethnic group because of the many cultural and social restrictions and with an increasing rate of obesity and type II diabetes. It was important to study whether or not these SNPs have any role in obesity in the Pakistani population.

The results of our study, presented in Table 2, indicate a modest association of the selected variants with obesity, with odds ratios ranging from 1.06 for rs7178572 (HMG20A) to 1.54 for rs9939609 (FTO), with 2/7 of the SNPs also showing association with weight and 1/7 with BMI. However, it is evident that the mechanism of the effect of these SNPs on obesity is only partly through influencing plasma lipid levels, with only 2/7 showing an association with LDL-Cl levels and 1/7 an association with HDL-C, most notably with the FTO gene showing no association with any plasma lipid trait. These results are supported by the function and differential expression of the genes in which these SNPs are located in specific tissues.

We used the unweighted and weighted gene score to study the combined effect of risk alleles on obesity. This is a robust approach, particularly when sample size is small, as it gives information regarding the additive effects of multiple variants in different genes in the same individual. As the weighted gene score takes into account the published effect sizes from large studies, the effect size assigned to each variant is independent of the effect estimated from the current small study such that the power problem is somewhat overcome. We had selected 7 variants in 7 different genes, and this means that an individual could have a maximum of 14 risk alleles. By using a gene score approach, we found that there is a threshold of having at least >4 alleles in order to have a considerable effect on body weight. This may be due to the fact that if only ≤4 risk alleles are present, even the combined effect is so small that the body’s homeostatic mechanisms can maintain an ideal body weight, but if ≥5 alleles are present, the combined effect is pronounced and the result is an increased risk of weight gain and obesity. The effect may be accounted for by the small sample size, and these results need replication in a larger sample size.

The SNP rs3923113 is close to the growth factor receptor-bound protein 14 (GRB14) gene, a strong candidate for the observed association with obesity and type II diabetes. The gene product is an adapter protein that binds to insulin and insulin-like growth factor receptors inhibiting tyrosine kinase signaling.39, 40 GRB14−/− mice are lean and have improved insulin sensitivity. The rs3923113 risk allele is associated with reduced insulin sensitivity and T2D in South Asians, indicating a gain of function.28 This SNP is in LD with another SNP rs10195252 that has been shown to be associated with central adiposity and GRB14 expression in adipose tissue.41

The rs16861329 is an intronic SNP in sialyltransferase 6 galactosidase 1 protein (ST6GAL1) gene, the product of which is an enzyme located in the Golgi apparatus. Although this gene was not linked to T2D previously, glycosylation due to addition of sialic acid is reported to influence glucose metabolism via affecting insulin signaling and trafficking.42 Alternatively, an association with obesity may arise due to the fact that this SNP is located close to the adiponectin (ADIPOQ) gene that encodes adiponectin, a hormone known to increase insulin sensitivity.43

The SNP rs1802295 is in the vacuolar protein sorting-associated protein 26A (VPS26A) gene that encodes a subunit of a large multimeric protein complex involved in transport to trans-Golgi complex.44, 45 This gene is expressed in adipose, pancreatic and some other tissues, and although its relationship with glucose metabolism has not been clearly understood, it is considered as a candidate gene for susceptibility to obesity and T2D because of abundant expression in adipose tissue.46

The fourth SNP rs2028299 is in close proximity to the adapter-related protein complex (AP3S2). This gene is expressed in pancreatic and adipose tissue, and the product is involved in vesicle transport and sorting in these tissues.47 The SNP may directly affect body weight by disturbing vesicular transport in smooth endoplasmic reticulum in adipose tissue or can act indirectly by affecting other genes, that is, it is used as a marker for variation in the perilipin 1 (PLIN-1) gene. The product of PLIN-1 is involved in coating of fat droplets in adipose tissue and regulation of lipolysis, as this gene lies close to AP3S2.48 Many variations in PLIN1 have been found to be associated with obesity in mouse models.49, 50

We have observed a significant association of the SNP rs7178572 with weight, HC and WHR but with serum HDL-C only. This is an intronic SNP in the high-mobility group protein 20A (HMG20A) gene that encodes a regulatory protein controlling gene expression by histone modification and is important in neural development. The SNP may play its role by affecting the expression of genes involved in lipid metabolism, thereby leading to dyslipidemia and obesity.51, 52

The SNP rs4812829 is strongly associated with LDL-C only. It is intronic in the hepatocyte nuclear factor 4 (HNF4A) gene, a strong candidate for association with obesity and T2D. The product of the gene is a transcription factor strongly expressed in liver.53 HNF4A mutations have been reported to cause mature-onset diabetes of young (MODY) type I, involving impaired β-cell function and insulin sensitivity.54 The risk allele of rs4812829 has been found to be associated with reduced β-cell function in subjects of South Asian ancestry.28

The most extensively studied variant of fat mass and obesity (FTO) gene is an intronic SNP, rs9939609. This SNP is known to predispose to obesity, type II diabetes and cardiovascular diseases and has also been linked to psychological disorders.55, 56, 57, 58 A longitudinal study in Brazil found a 0.07 kg/m2/year weight gain for each additional risk allele.59 It has been previously reported that the overall risk of obesity is increased ~1.3-fold per risk allele. This variant is known to affect body weight by disturbing glucose metabolism.26 Interestingly, we have found no association of this variant with any of the lipid traits, indicating that the variant has a direct role on energy regulation to increase body weight.

The study had the limitation of small sample size and therefore limited power. This problem was overcome by using a gene score approach. Although we observed a linear effect of the obesity gene score on lipid traits, the effect on obesity was not linear, which may be due to power problems, and therefore the observed association should be replicated in larger studies in the future. The lead GWAS SNP at a locus may not itself be functional, but rather may act as a marker of a functional variant in one of the nearby genes. Correctly identifying the functional SNP and the gene will be valuable as it will improve the size of effect for risk prediction and correctly identify novel targets for future therapeutic interventions.

Conclusion

The SNPs in the current study had high minor allele frequencies and moderate effect sizes. In future, studies with larger sample sizes should be conducted to not only clarify the effect of already known SNPs but also discover additional novel variants that are involved in affecting this group of related health problems, including obesity, diabetes, hypertension and cardiovascular disorders, and to find at least some common variants with moderate/large effect sizes in the Pakistani population in order to establish a risk score for obesity. Individuals having risk alleles greater than the threshold may be given advice to modify lifestyle, diet and exercise habits so that the genetic predisposition may not ‘switch on’.