Introduction

Obesity is a major public health problem worldwide and affects individuals across all age groups and ethnicities1. According to the World Health Organization (WHO), there are up to one billion overweight individuals worldwide, of which 300 million would be classified as clinically obese (www.who.int). Moreover, the situation in most developed countries is more severe. Obesity is a leading risk factor for many health problems such as heart disease, type 2 diabetes and most cancers2; furthermore, it accounts for up to 7.8% of the total health care expense in developed countries3. Numerous studies have attempted to illustrate the pathogenesis of obesity.

A multitude of genes associated with obesity have been discovered, with one of the most important being SOCS3 (suppressor of cytokine signaling 3), which is located in the chromosome region 17q24-17q25. SOCS3 works as a feedback inhibitor for a range of cytokine signals and inhibits the function of leptin and downstream steps in the insulin signaling pathway to regulate energy balance4, 5, 6, 7. Although this function has been confirmed in animal models, the association data from human studies are relatively limited. Talbert et al have shown that the SOCS3 gene is related to body mass index (BMI), visceral adipose (VAT), and waist circumference (WAIST) in Hispanic families8. However, in the study performed by Jamshidi et al9 on the association analyses of common polymorphisms (rs4969169, rs12953258, and rs8064821) in SOCS3 between two normal female twins that examined body weight, insulin sensitivity or lipid profile, none of the three polymorphisms were found to be associated with obesity. In addition, the study on the association analysis of two polymorphisms in the coding sequence and promoter region of the SOCS3 gene (including -920 C>A (rs12953258) and -1044 C>A) in German children and adolescents with extreme obesity suggested that no association was observed10.

This report focuses on the association between the SOCS3 gene and obesity-related traits in the Kazak, Uygur and Han nationalities from the Xinjiang province of China. Two SNP polymorphisms were selected according to the data from tag-polymorphisms of the HapMap project and previous studies11, 12. One polymorphism is the A+930→G (rs4969168) polymorphism that is located in the 3′UTR of the SOCS3 gene, which is a haplotype tagging SNP (htSNP) that sufficiently covers the genetic variation of the entire gene. The other SNP selected for this study is rs9892622 A→G, a high-frequency variant near the gene, which is located in the upstream promoter region of the SOCS3 gene and appears to be a reasonable functional candidate for the SOCS3 locus and obesity.

Materials and methods

Subjects

Subjects (n=2711) were recruited from three races of Xinjiang in China. All subjects passed the normal medical examination and were selected randomly, including 1045 Kazakh, 804 Uygur and 862 Han individuals within the age range of 19–87 years old. The local ethics committee approved the study, and all participants provided informed consent. Standing height (height) was measured (to the nearest 0.01 m) with a stadiometer. Body weight (weight) was measured (to the nearest 0.01 kg) with a weighing machine. Body mass index (BMI) was calculated as the weight in kilograms divided by the square of the height in meters. Lastly, waist circumference (WAIST), hip circumference (HIP), and waist to hip ratio (WHR) were determined as previously described13.

DNA preparation

Genomic DNA was extracted from peripheral blood leukocytes of all samples by the phenol-chloroform nucleic acid extraction method. The genomic DNA concentrations of samples were measured by using a NanoDrop Spectrophotometer (Thermo Fisher, Boston, MA, USA) and were qualified for analysis when the ratio of wavelengths 260/280 ranged between 1.7–2.0. Next, genomic DNA samples were diluted to 10 ng/μL as the working concentration.

Polymorphisms selection and genotyping

Two SNP variants rs4969168 and rs9892622 were selected. Rs4969198 was selected as a tagSNP to cover all of the related polymorphisms on SOCS3 that are based on the recently released HapMap data. Because common variants are susceptible to common diseases, we chose another potential functional locus with a relatively high frequency, rs9892622, in the promoter region of SOCS3. Genotyping was performed using the 5′-nuclease TaqMan-MGB assay in 384-well plates and the ABI PRISM 7900HT Sequence Detection system (Applied Biosystems, Foster City, CA, USA). The raw data were read at the end-point and were clustered into three groups that represented the three genotypes.

Statistical analysis

The data were analyzed with SPSS (version 13.0). Before any other analysis, the maximum likelihood estimate of allele frequencies was tested for departures from Hardy-Weinberg Equilibrium proportions (HWE) using a chi-squared goodness of fit test to make sure that there were no typing errors or bias from population structure. The distribution of obesity-related quantitative traits (BMI, height, weight, WHR, WAIST and HIP), characteristic variants (age, gender, etc) and genotypes among the three nationalities were analyzed by one-way ANOVA with SNK test. A t-test (independent samples) was used to investigate the distribution of variants in different genders. Stratified analysis by nationality and gender was performed to explore the interaction between environmental and genetic factors. Additionally, linear regression analysis was used to determine the importance of obesity-related traits. To avoid over-correction induced by measuring too many obesity-related traits, only the variants of age, nationality, gender and genotypes were involved in the regression analysis. The P-value confidence level was set at 0.05 for the above tests.

Results

Sample characteristics

After removing samples with missing data, a total population with n=2667 samples was left, which comprised the three races, Kazakh, Uygur and Han; the number of samples from each race and their percentage of the total sample population were 1017 (38.14%), 798 (29.92%), and 852 (31.94%), respectively. The distribution of sample characteristics (mean±SD of obesity-related traits and distribution of genotypes of SNP polymorphisms) in each nationality are displayed in Table 1. The statistical differences of these variants among nationalities were determined by the Fisher LSD test. The data showed that all of the variants (age, BMI, weight, height, WHR, WAIST, HIP, rs4969168, and rs9892622) were statistically different among the three nationalities and indicated that analysis of the interaction of gene polymorphisms and environmental factors in a total population would be inappropriate; rather, a separate analysis of the effects of interaction within individual populations of each nationality would be more appropriate.

Table 1 Sample characteristics.

The distribution of variants, except polymorphisms, in different genders was also statistically analyzed by independent samples in a t-test (Table 2). It was found that the variants, except WHR, were significantly different between males and females. The distributions of genotypic polymorphisms in different genders were analyzed by a χ2 square test; no difference was found in the genotypic frequencies between genders, which indicated that no typing bias or error existed.

Table 2 Distribution of variants in different genders.

Genotyping and allele frequencies

The genotyping success rates of both rs4969168 and rs9892622 were 97.4%. The allele frequencies of SOCS3 rs4969168 and rs9892622 for the total population and each subpopulation (classified by race) were collected (Table 1). In the Kazakh, Uygur and Han subpopulations, the allele frequencies of rs4969168 were 0.582, 0.625, and 0.572, respectively, and the allele frequencies of rs9892622 were 0.453, 0.499, and 0.452, respectively. There was no departure from the Hardy-Weinberg equilibrium in any of the groups (P>0.05). One-way ANOVA was used to determine that the distribution of polymorphisms was statistically different among the three nationalities.

Association of the SOCS3 gene with obesity-related quantitative traits

Overall analysis

The association of the two polymorphisms (rs4969168 and rs9892622) with BMI, WHR, height, weight, WAIST, and HIP was determined by one-way ANOVA with SNK test, but no evidence of association was found between each of the two polymorphisms and the above quantitative traits in the total population.

Stratified analysis of nationality

Because obesity is due to the interaction of environmental and genetic factors and the above data showed differences between nationalities or gender regarding obesity-related traits, it was supposed that environmental background, race, gender or age might have a significant interaction with the polymorphisms, which may lead to obesity or related traits. Therefore, stratified analysis was performed based on the interaction between nationalities and polymorphisms. Although no significant difference was observed among the three genotypes of rs4969168 in the total population, the data indicated that a remarkable difference existed within the subpopulations (Table 3 and 4). Significant differences were found in the mean of BMI, weight, and WAIST among the pair-wise comparison of the genotypes of rs4969168. Although most compared pairs were not found to be statistically significant, there were noticeable trends (Table 3). Based on the data from the Han group, as the frequency of the G allele increased, the BMI, weight and WAIST measurements decreased, which suggests that the G allele may play a role in maintaining a reduced body weight in the Han group. However, the opposite trend was seen in the Kazakh group but not in the Uygur group. The potential cause of this trend will be further covered in the latter part of the discussion.

Table 3 Mean of BMI, Weight, WAIST and WHR of each genotype of rs4969168 in each nationality.
Table 4 P-value of pairwised comparison among genotypes of rs4969168.

For rs9892622, a significant difference for obesity-related traits was not observed in the total population. Statistically meaningful results only appeared in the male subpopulation of the Uygur group, but no obvious trend could be summarized from the datasets (Table 5 and 6).

Table 5 Mean of BMI, Weight, WAIST, and WHR of each genotype of rs9892622 in each nationality.
Table 6 P-value of pairwised comparison among genotypes of rs9892622.

Linear regression analysis

Linear regression analysis confirmed the previous results in which nationality and gender were included, but age and the two polymorphisms excluded, in the model with P<0.05. In the following stratified analysis by nationality or gender, the two polymorphisms were also included with P<0.05, but age was excluded from the model. Because nationality seemed to be the most effective factor in our data, we performed a linear regression analysis for each nationality. In the Han population, gender was the first factor involved in the regression (P=0.004), followed by rs4969168 (P=0.015).

Discussion

With the release of data from the HapMap phase III project and completion of several whole genome association studies on several common diseases, scientists realized that tagSNPs in target genes could be selected as candidate loci to cover all of the polymorphisms involved in the haploblocks of the target gene.

SOCS3 is one of several genes involved in cytokine signaling and regulates the function of proteins downstream, inhibits the insulin signaling pathway and stimulates the upregulation of TNFα in adipose tissue14. The protein encoded by SOCS3 is also important in energy balance and regulation4, 5, 6, 7. This result has been confirmed in animal models, but the association data in humans have been relatively limited. Research by Talbert et al has shown that the SOCS3 gene is related to body mass index (BMI), visceral adipose (VAT), and waist circumference (WAIST) in Hispanic families8. However, in a study reported by Jamshidi9 on the association analyses of common polymorphisms (rs4969169, rs12953258 and rs8064821) in SOCS3 with body weight, insulin sensitivity or lipid profile in normal female twins, none of these three polymorphisms was found to be associated with obesity. In a previous study, an association analysis of two polymorphisms (-920 C>A (rs12953258) and -1044 C>A) was performed, one in the coding sequence and another in the promoter region, in the SOCS3 gene in German children and adolescents with extreme obesity10. As a result, there was no association observed between the two polymorphisms in SOCS3 and extreme obesity. Weight is a complex phenotype that is affected by several biological pathways; the most important two pathways are the neural control system of food intake and energy balance regulation. Thus, one explanation to these ambivalent results is that the pathogeny of obesity differs among populations.

Another potential explanation is that different haploblock distributions appear in different ethnic groups or in different nationalities. Consequently, pathogenic loci linked to tagSNPs can be partially or totally different, and in some cases, special ethnic tagSNPs would lose their ability of catching pathogenic loci in other ethnic studies, which seems to have happened in our study. Our data suggest that there is no association between rs4969168 and obesity-related traits in the total population (P>0.05). However, a significant association was observed for BMI and weight in the Han population and male subgroup, which indicates that the polymorphism rs4969168 seems to have a different association effect on different nationalities and different genders. In addition, the data showed that the allele A of rs4969168 was associated with increased BMI and weight in the Han group. Moreover, the population with the AA genotype of rs4969168 is more susceptible to weight-gain in the Han population, especially in males. However, a totally opposite result was discovered in the Kazakh population, which indicates that the mechanisms may likely be distinct between these two nationalities. Our results are reasonable given that SOCS3 serves as both a pivotal and hub function in a complex regulatory system, which further suggests that SOCS3 may also play a dual role in weight control.

Similar to rs4969168, there was no evidence for an association between rs9892622 and obesity-related traits in the total population, but a significant association appeared in the stratified analysis. Interestingly, distinct from rs4969168, rs9892622 was associated with BMI, weight and WAIST in the Kazakh and Uygur populations rather than the Han population.

In conclusion, whether weight control is attributed to different pathogenic factors or the distribution of ethnic haploblocks, the genetic background seems to certainly play a basic role. According to our current data, both rs4969168 and rs9892622 are associated with obesity, and the association effect is different according to nationality and gender. Further study is required to fine-map the pathogenic loci and discover the functional impact of SOCS3 in obesity.

Author contribution

Wei TANG designed the research, executed the experiments and wrote the paper; Yong-quan SHI and Zhi-min LIU designed the research and wrote the paper; Jun-jie ZOU and Jiao-yang ZHENG contributed analytic tools; and Xiang-fang CHEN and Hua-zong ZENG analyzed the data.