Introduction

Chronic obstructive pulmonary disease (COPD) is currently the leading cause of decreases in disability-adjusted life years (DALY). Globally, it is projected to be the third most important cause of DALY by the year 20201, and it is estimated that COPD affects nearly 8.2% of the Chinese adult population2. Cigarette smoking is the major environmental risk factor for COPD; however, only approximately 15% of smokers develop clinically relevant airflow obstruction3. The variation in the susceptibility to cigarette smoke, in combination with the familial inheritance pattern of COPD, suggests that there may be a genetic component to the development of COPD4. Multiple studies in diverse populations have demonstrated the genetic contribution to the variability in pulmonary function and the familial aggregation of COPD. As expected, segregation analysis suggests that multiple genes may be involved in COPD susceptibility. The associations between COPD and polymorphisms in genes with potential importance in COPD pathogenesis have been investigated5; however, only α1-antitrypsin has been unequivocally identified as relevant to the development of COPD. Recently, polymorphisms in the CHRNA3-CHRNA5-IREB2, HHIP, and FAM13A loci have been found to be associated with COPD by genome-wide association studies (GWAS)6,7,8.

Fibroblast growth factor 7 (FGF7), also known as keratinocyte growth factor (KGF), has a variety of effects in the lung. FGF7 stimulates epithelial cell proliferation in the airway and alveoli9,10 and affects the morphology of the developing lung. The disruption of FGF7 receptor function significantly reduces airway branching during development11. The over-expression of FGF7 in the airways of developing mice severely alters lung growth12. FGF7 also stimulates fluid and electrolyte secretion. When applied to fetal lung explants, FGF7 stimulates dramatic intra-luminal swelling and expansion13. FGF7 may also play a role in airway repair following injury. FGF7 induces alveolar type II pneumocyte proliferation in vitro and in vivo9. FGF7 stimulates surfactant protein and phospholipid expression as well as the transepithelial transport of fluids and electrolytes, minimizes injury, enhances the repair of damaged epithelia, and may dampen the epithelial response to inflammatory mediators14. Exogenous FGF7 treatment has been used as a protective agent after oxidant or bleomycin-induced lung injury15,16. Endogenous FGF7 mRNA expression is induced in neonatal rabbits exposed to hyperoxia, and FGF7 protein expression is increased in adult respiratory distress syndrome17, suggesting that FGF7 may play a role in lung repair. Thus, this protein could help repair the damaged lungs of individuals who are smokers and who are at a risk of developing COPD.

In addition, FGF7 has been identified to play roles in several clinical diseases or phenotypes, including nonsyndromic cleft lip and palate18, thyroid volume and goiter risk19. Moreover, FGF7 was identified as a COPD susceptibility locus in a recent report20.

With these considerations in mind, we hypothesized that polymorphisms in the FGF7 gene might modulate susceptibility to COPD. To test this hypothesis, we investigated the association between FGF7 gene polymorphisms and the risk of COPD in a Chinese Han population.

Materials and methods

Subjects

As described previously21, 279 patients with COPD and 367 age-matched non-COPD control subjects were recruited for this study. The subjects in both groups were unrelated ethnic Han Chinese individuals recruited from Chengdu city or surrounding regions in the Sichuan Province of western China. Each subject was personally interviewed face-to-face by trained interviewers who collected the patient's demographic data as well as information related to risk factors such as tobacco smoking. The recruitment and the clinical analyses were conducted at the Department of Respiratory Medicine in West China Hospital of Sichuan University; clinical analyses were performed according to the Global Initiative for Chronic Obstructive Lung Disease (GOLD) criteria22. COPD patients were enrolled when they suffered from cough, sputum production and dyspnea at least upon exertion and showed chronic irreversible airflow limitation defined by an FEV1 (forced expiratory volume in 1 s) to FVC (forced vital capacity) ratio <70%, and FEV1 predicted <80% after the inhalation of a β2-agonist. Patients were excluded from this study if they had other significant respiratory disease, such as bronchial asthma, bronchiectasis, lung cancer, or pulmonary tuberculosis. The age-matched non-COPD control subjects were volunteers who came to the West China Hospital of Sichuan University for physical examination only. The inclusion criteria for controls were as follows: (1) FEV1/FVC ratio >70%, FEV1% and FVC% predicted >80% and (2) without pulmonary disease. Individuals were excluded if they had a history of chronic lung disease, atopy, an acute pulmonary infection in the 4 weeks before assessment for this study, or a family history of COPD.

This study was approved by the Ethics Committee of the West China Hospital, Sichuan University, and written informed consent was obtained from all subjects before their participation in the study. The investigator explained the nature, purpose and risks of the study and provided the subject with a copy of the information sheet.

SNP selection

Based on the linkage disequilibrium and haplotype block analysis of the HapMap project data (http://www.hapmap.org; Public Release #24/Phase II, Nov 2008), we excluded SNPs with low heterozygosity (below 0.1), with low minor allele frequency (below 0.05), or without genotype information. Finally, we chose two TagSNPs at the FGF7 locus, rs10519225 and rs7170426, using the program Tagger with a cut-off of 0.8 for r2 and a MAF (minor allele frequency) of >0.1. rs10519225 is located at bp 49720778 of chromosome 15 (build 37.3) and captures SNPs, including rs17479589, rs11855798, rs17478785, rs4389093, rs10519225, rs12916839, rs12595692, rs4480740, rs17478694, rs12148764, rs2899434, rs7174135, rs9920722, and rs10519224. Meanwhile, rs7170426 is located at bp 49751853 (build 37.3) and captures SNPs, including rs11630629, rs2899433, and rs7167041. Overall, SNPs rs10519225 and rs7170426 provided 62% coverage of the genetic information at the FGF7 locus of the Han Chinese population based on MAF >10% and r2 >0.8.

SNP genotyping

Venous blood was collected from each subject in sterile tubes with EDTA-Na2 anticoagulants and stored at -20 °C. Genomic DNA was extracted from the stored blood using a commercial extraction kit (Bioteke Corporation, Beijing, China) according to the manufacturer's instructions. SNPs were genotyped using the ABI SNaPshot method (Applied Biosystems, CA, USA). PCR was performed using the specific primers: rs10519225 forward 5′-GAAGAAGAGCATTGAGAAC-3′, rs10519225 reverse 5′-TCTGTCTTCCACATCTGTC-3′, rs7170426 forward 5′-GAACCTGCATTTCAGCATTG-3′, and rs7170426 reverse 5′-CTACAAGTCTCAAGCACAGC-3′. The specific SNaPshot primers were used for a SNaPshot reaction using the purified PCR products as templates. The following primers were used for the SNaPshot reaction: rs10519225: 5′-ATTCTGCACCTGACAGAAATAACACTGCTCTTTAAGCTGA-3′ and rs7170426: 5′-TTTATGTAAATTTATAGAGTTTGAGGAATAATTAAAACTAGTATTTCT-3′. The SNaPshot reaction products were then analyzed on an ABI 3130 Genetic Analyzer (Applied Biosystems, CA, USA). To confirm the genotyping results, 10% of the samples were randomly selected and re-genotyped by direct sequencing using a BigDye terminator (Applied Biosystems, CA, USA); no more than a 2% discrepancy was observed for any SNP between the two genotyping methods.

Statistical analysis

The demographic and clinical data between the COPD patients and the control subjects were compared using the chi-squared test and Student's t-test. Hardy-Weinberg equilibrium was tested with a goodness of fit chi-squared test (with one degree of freedom) to compare the observed and expected genotype frequencies. The differences between the COPD patients and the controls with respect to the genotype distributions were analyzed using one-way analysis of variance for univariate analysis and logistic regression for multivariate analysis. Age and sex were used as covariates in the multivariable analyses. A two-sided significance level of q<0.05 was used for all significant tests. Statistical analyses were performed in SPSS version 17.0 and Microsoft Excel. To correct for testing for multiple dependent parameters, the false discovery rate (FDR) was calculated using the Benjamini-Hochberg procedure23. The FDR significance level was set at q<0.05. We used SVS7 (SNP & Variation Suite 7, available at http://www.goldenhelix.com/SNP_Variation/index.html) to calculate the linkage disequilibrium (LD) (r2) and performed the haplotype association.

Results

General characteristics

The baseline characteristics and the results of the pulmonary function tests for the 279 patients with COPD and 367 control subjects were presented in Table 1. All patients had FEV1 values <80% of predicted and thus were diagnosed with moderate-to-severe COPD according to the Global Initiative for Chronic Obstructive Lung Disease (classification of severity: mild=FEV1 ≥80% of predicted; moderate=FEV1 ≥30% to <80% of predicted; and severe=FEV1 <30% of predicted). The COPD cases and control subjects did not significantly differ in sex, age or smoking history. The FEV1, FEV1/predicted and FEV1/FVC were significantly lower in the COPD case subjects compared with the controls (P<0.01).

Table 1 Description of study population.

rs10519225 is associated with COPD

All subjects were genotyped at both rs10519225 and rs7170426 using the SNaPshot method. The genotype distributions are indicated in Table 2. The distributions of each genotype of rs10519225 and rs7170426 in COPD patients and controls were compatible with the Hardy-Weinberg equilibrium (χ2=1.752 and 0.347 for COPD patients, respectively; χ2=0.652 and 0.347 for controls, respectively; all P values were higher than 0.05). For rs10519225, the frequencies of the GG, GA, and AA genotypes were 63.8%, 33.7%, and 2.5% in COPD patients and 70.8%, 26.2%, and 3.0% in controls, respectively. For rs7170426, the observed frequencies of the AA, AG, and GG genotypes were 51.6%, 39.1%, and 9.3% in COPD patients and 55.9%, 37.6%, and 6.5% in controls, respectively. In the univariate analysis, the genotypes of rs10519225 were distributed dramatically differently between COPD patients and healthy controls (P=0.022), whereas the distribution of rs7170426 was not different between two groups. Genotypes containing the minor rs10519225 A allele were associated with COPD susceptibility in an additive pattern (Table 2). After adjustment for age and sex, the association of rs10519225 and COPD remained significant (P=0.011, FDR q=0.022). Multivariate binary logistic regression to estimate the magnitude of effect showed that the odds ratio (OR) for the presence of at least one minor allele of rs10519225 (additive model) was 1.535 (95% CI: 1.104–2.134) (Table 2).

Table 2 Associations of genotypes in rs10519225 and rs7170426 with COPD.

Association between haplotype of rs10519225 and rs7170426 with COPD

LD analysis showed that rs10519225 and rs7170426 were in weak LD, with r2=0.033, D'=0.232, and D' 95% CI (0.150–0.520). Multivariate logistic regression was used to incorporate age, sex, and different rs10519225 and rs7170426 haplotypes into the models. As shown in Table 3, the G–A haplotype of rs10519225–rs7170426 was associated with a lower risk of COPD under the additive model (OR=0.751, 95% CI: 0.598-0.944, P=0.012, FDR q=0.048), whereas none of the other haplotypes were found to have significant effects.

Table 3 Associations of haplotypes between rs10519225 and rs7170426 with COPD*.

Discussion

Case-control association studies of a candidate gene can be a very powerful approach toward identifying the genetic causes of complex diseases such as COPD24. In this case-control study, we evaluated the possible association of FGF7 gene polymorphisms with susceptibility to COPD in a Chinese Han population for the first time. Our results suggested that the rs10519225 polymorphism in the FGF7 gene correlated with modestly higher risks of COPD, with an OR of 1.54 (95% CI: 1.104–2.134), whereas the rs7170426 polymorphism was not correlated with COPD risk. Haplotype analysis, in which several SNPs within the same gene are evaluated simultaneously, can provide more information than a single SNP and thus elevates the statistical power of the analysis25. LD analysis showed that these loci were in weak LD. Therefore, we assayed each of the haplotypes constructed by polymorphisms rs10519225 and rs7170426. The G–A haplotype was associated with a lower risk of COPD in the additive genetic model (OR=0.751, 95% CI: 0.598–0.944, P=0.012, FDR q=0.048). This suggested that the G-A haplotype might serve as a protective genetic factor against COPD in Chinese Han population. Nonetheless, further studies are required to confirm this hypothesis. These results indicated that certain polymorphisms in the FGF7 gene may be crucial in the pathogenesis of COPD. Intriguingly, the human FGF7 gene is located on chromosome 15 at position q21.2, and a previous genome-wide linkage study for generalized COPD identified chromosome 15q21 as meeting genome-wide criteria for suggestive linkage26. Moreover, another polymorphism within the FGF7 gene, rs4480740, was recently reported to be associated with COPD in an additional population20. We assessed the extent of LD between rs10519225, rs7170426, and rs4480740 in the Han Chinese population using HapMap project data. Both rs10519225 and rs7170426 were found to be in strong LD with rs4480740 (Figure 1). Although we did not carry out this association study in an independent population, the strong LD between rs10519225 and rs4480740 suggested that the association of rs10519255 with COPD observed in the current study was unlikely to have occurred by chance.

Figure 1
figure 1

Relative position of SNPs and linkage disequilibrium map for FGF7 in the Han Chinese population populations studied. Polymorphisms are identified by their dbSNP rs numbers, and their relative positions are marked by vertical lines within the white horizontal bar. The numbers within squares indicate the D' value, expressed as a percentile.

It should be acknowledged that the SNPs associated with the statistical signal might simply play a role as a surrogate marker for the causal functional SNP or SNPs. That is, rs10519225 in FGF7 gene could be in linkage disequilibrium with another polymorphism of the gene that impacts FGF7 activity. Moreover, rs10519225 is located within an intronic region of FGF7, which is usually removed during the gene-splicing process. Intronic SNPs may modify gene function by affecting the regulation of gene expression27. The dysfunction of FGF7 may pose a risk for the improper development of the airway and alveoli. It has been proposed that persons with smaller initial lung volumes are more likely to develop overt manifestations of disease during the accelerated decline of lung function in COPD. Alternatively, FGF7 may play a role in disease susceptibility through its role in the protection against the oxidative stress response specifically in the lung epithelium.

The direction of association with disease is noteworthy; rs10519225 heterozygosity was associated with susceptibility to COPD in this study. The finding of heterozygote susceptibility is unusual in genetic disease association studies, but is well described in the study of genetic susceptibility to human pulmonary or airway diseases, including COPD and asthma28,29,30.

We are aware that the significant results in this study could prove to be false positives because of the relatively small sample size, but even with a larger sample, the functional and biological impacts of the described polymorphisms would require further study. Possible gene-gene and gene-environment interactions also pose a challenge for genetic analysis of COPD association studies. Further studies using larger populations are needed, and other variants in the FGF7 gene should be investigated to clarify the association of FGF7 and individual susceptibility to the development of COPD.

In conclusion, we have investigated the link between FGF7 gene polymorphisms and the risk of COPD in the Chinese Han population for the first time. The rs10519225 genotype frequencies are significantly different in COPD patients and controls, suggesting that this SNP may be useful for predicting COPD susceptibility. Similarly, the rs10519225G-rs7170426A haplotype may be protective against COPD in the Chinese Han population. Further functional characterization of these SNPs is required to clarify the significance of FGF7 in the pathogenesis of COPD.

Author contribution

Si-cheng XU, Jiang-ying KUANG, Jin LIU, Chun-lan MA, and Yu-lin FENG contributed to the study design and genotyping; Zhi-guang SU contributed to the study design and data analysis and wrote the manuscript.