Introduction

Lung cancer is the leading cause of cancer-related death worldwide and responsible for approximately 1.3 million deaths each year1. Despite the great progress made in several areas of oncology, the prognosis of lung cancer remains dismal2. The exact cause and mechanism of lung cancer are still under investigation. Epidemiological studies have demonstrated tobacco smoking as well as exposure to environmental tobacco smoke in healthy non-tobacco users as the major risk factor for lung cancer3. However, not all smokers develop lung cancer and a fraction of life long non-smokers will die from lung cancer indicating that genetic factors may play a significant role in determining the susceptibility to lung cancer3,4.

The matrix metalloproteases (MMPs) are zinc-dependent endopeptidases that degrade the extracellular matrix collagens and belong to a larger family of proteases known as the metzincin superfamily. 5,6 Matrix metalloproteinase-1 (MMP-1) may degrade a broad range of substrates including the interstitial types I, II, III collagens as well as casein and contribute to tumor growth and spread by altering the cellular microenvironment to favor tumor formation. 5,6,7,8 Among secreted MMPs, MMP-2 and MMP-9 are known to play a major role in cancer invasion and metastasis development by their ability to degrade type IV collagen9. Furthermore, overexpression of MMP13 has been related to more aggressive tumors and poor prognosis in lung cancer10,11.

Polymorphisms in the regulatory regions of MMPs have been associated with changes in the expression level of these genes in different human cancer12,13,14. Up to now, many molecular epidemiological studies have investigated the association between the MMPs polymorphism and lung cancer risk15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31. However, the results remain controversial and ambiguous. Several meta-analysis have been performed to assess MMPs polymorphism in lung cancer, but these analyses are mainly based on traditional approaches, which would lead to multiple comparisons or erroneous mode specification without prior biological evidence. Therefore, we conducted this meta-analysis based 22 case-control studies by a comprehensive statistical strategy of a logistic regression and a model-free approach32,33.

Materials and Methods

Search Strategy

We searched for relevant studies up to May 2014 through the PubMed, Embase, Wanfang (http://www.wanfangdata.com.cn), China National Knowledge Infrastructure Platform (CNKI; http://www.cnki.net) database with the following terms and their combinations: “lung cancer/carcinoma”, “polymorphism/variant” and “metalloproteinase/MMP”. We tried to identify potentially relevant studies from the whole reference lists by orderly reviewing title, abstract and full text.

Selection criteria

The inclusion criteria were as follows: a) case-control studies focused on the association of MMP1, MMP2, MMP9 or MMP13 polymorphism and lung cancer; b) genotype and allele data available. Studies were excluded for following reasons: a) unpublished papers, reviews and duplication of publications; b) data unavailable for calculating genotype or allele frequencies; c) no control population. Additionally, investigations of departure from Hardy-Weinberg equilibrium (HWE) was excluded from the final analysis. If more than one article was published using the same case series, we selected the study with the largest sample size.

Data extraction

All the available data were extracted from each study by two investigators (H X L and X Y L) independently according to the inclusion criteria listed above. For each study, we recorded the first author, year of publication, country of origin, ethnicity, the method of genotyping, the number of cases and controls and genotype distributions in cases and controls.

Statistical analysis

Hardy-Weinberg equilibrium was examined by chi-square goodness-of-fit test (P > 0.05) using gene frequencies of the healthy individuals. Metagen (http://bioinformatics.biol.uoa.gr/~pbagos/metagen/) was used by selecting the genetic model. Two parameters, θ2 and θ3, were calculated using the formula: log it (πij) = αi + θ2zi2 + θ3zi3 and OR AB/AA = exp (θ2), OR BB/AA = exp (θ3); where αi is the indicator of study-specific fixed-effect; θ2 and θ3 are dummy variables of genotypes AB and BB. The appropriate genetic model was identified using the following criteria:(i) No association: θ2 = θ3; (ii) Dominant model: θ2 = θ3 > 0; (iii) Recessive model: θ2 = 0 and θ3 > 0; (iv) Additive model: 2θ2 = θ3; (v) Co-dominant model: θ3 > θ2 > 0; (vi) Complete overdominant model: θ2 > 0 and θ3 = 0. Finally, once the most appropriate genetic model was identified, the pooled OR with corresponding 95% confidence interval (95% CI) was estimated in logistic regression model. Additionally, Zintzaras reported a novel method to calculate the generalized odds ratio (ORG) based on a genetic model-free approach, which may overcome the short-comings of multiple model testing or erroneous model specification33. Thus, the ORG calculations were also performed.

The heterogeneity of the studies was assessed using the Cochran’s Q test (considered significant for P < 0.10) and was quantified by the I2 statistic. Both fixed effects (Mantel-Haenszel) and random effects (Der Simonian and Laird) models were used to combine the data. Relative influence of each study on the pooled estimate was assessed by omitting one study at a time for sensitivity analysis. Publication bias was evaluated by visual inspection of symmetry of Begg’s funnel plot and assessment of Egger’s test (P < 0.05 was regarded as representative of statistical significance). Statistical analyses were done in ORGGASMA, metan and metagen in STATA software, version 11.0 (STATA Corp., College Station, TX, USA) and all tests were two-sided.

Results

Characteristics of the studies

There were 330 papers relevant to the search words. The flow chart of selection of studies and reasons for exclusion is presented in Fig. 1. Overall, 18 publications with 22 case-control studies including 8202 cases and 7578 controls were available for this analysis. Seven studies with 3996 cases and 3507 controls for MMP1-1607 1G/2G polymorphism, five studies with 2004 cases and 1967 controls for MMP2-1306 C/T polymorphism, three studies with 1229 cases and 1303 controls for MMP2-735 C/T polymorphism, four studies with 1202 cases and 1039 controls for MMP9-1562 C/T polymorphism and three studies with 1221 cases and 1225 controls for MMP13-77A/G polymorphism. Study characteristics are summarized in Table 1. The genotype distributions in the controls of all studies were consistent with HWE.

Table 1 Characteristics of studies included in this meta-analysis.
Figure 1
figure 1

Flow diagram of studies identification.

Quantitative synthesis

There was a variation in the 2G allele frequency of the MMP1-1607 1G/2G polymorphism among the controls across different ethnicities, ranging from 0.46 to 0.71. For Asian controls, the 2G allele frequency was 0.56, which was slightly higher than that in Caucasian controls (0.53, P = 0.791; Fig. 2A). Another variation was in the T allele frequency of the MMP2-1306 C/T polymorphism among the controls across different ethnicities, ranging from 0.17 to 0.19. For Asian controls, the T allele frequency was 0.17, which was slightly lower than that in Caucasian controls (0.18, P = 0.249; Fig. 2B).

Figure 2
figure 2

Frequencies of the variant alleles among control subjects stratified by ethnicity. (a) MMP1 -1607 2G allele; (b) MMP2 -1306 T allele.

Five common SNPs occurred in MMP1, MMP2, MMP9 and MMP13 sequences were included in the quantitative synthesis and detail results were shown in Table 2. For the MMP1 -1607 1G/2G polymorphism, the pooled OR1G2G/1G1G and OR2G2G/1G1G were 1.08(95%CI = 0.96-1.21) and 1.16(95%CI = 1.02-1.33), respectively, suggesting an additive model was assessed using traditional method. Overall, no significant association with lung cancer risk was detected for MMP1 -1607 1G/2G polymorphism in additive model and heterogeneity between studies was observed in the overall comparison. In subgroup analysis based on ethnicity, however, the heterogeneity disappeared and a significantly increased risk was found in Asians(OR = 1.34, 95%CI:1.18-1.53) (Fig. 3). Based on the model-free approach, significant result was also produced for MMP1 -1607 1G/2G polymorphism and lung cancer risk among Asians(ORG = 1.41, 95%CI:1.21-1.65). No significant association was found in subgroup analyses based on the source of control and sample size (Table 2).

Table 2 Meta-analysis of MMP1,MMP2,MMP9 and MMP13 polymorphism and lung cancer risk.
Figure 3
figure 3

Odds ratios (OR) and 95% confidence interval (CI) of individual studies and pooled data for the association of MMP1-1607 1G/2G polymorphism and lung cancer risk in additive model.

For the MMP2 -1306 C/T polymorphism, the pooled ORCT/CC and ORTT/CC were 0.54(95%CI = 0.47-0.63) and 0.53(95%CI = 0.33-0.85), respectively, suggesting no appropriate genetic model was assessed using traditional method. Based on the model-free approach, significant result was found in the overall comparison (ORG = 0.64, 95%CI:0.46-0.87) and among Asians (ORG = 0.49, 95%CI:0.42-0.57), but not among Caucasians(ORG = 1.09, 95%CI:0.74-1.59) (Fig. 4). Stratified by source of control, a significantly risk was found in the population-based studies, however, no significant association was observed in the hospital-based studies (Table 2). When stratifying by sample size, a significant association was found in sample size ≥ 500 studies (Table 2). No significant heterogeneity between studies was observed in subgroup analyses.

Figure 4
figure 4

Forest plot of the generalized odds ratio (ORG) and 95% confidence intervals (CIs) of studies on the association between lung cancer and the MMP2-1306 C/T polymorphism based on model-free approach.

For the MMP2 -735 C/T polymorphism, the pooled ORCT/CC and ORTT/CC were 0.70(95%CI = 0.59-0.83) and 0.75(95%CI = 0.51-1.10), respectively, suggesting no appropriate genetic model was assessed using traditional method. Based on the model-free approach, significant result was found in the overall comparison (ORG = 0.72, 95%CI:0.62-0.84) and among Asians(ORG = 0.71, 95%CI:0.61-0.84), but not among Caucasians(ORG = 0.85, 95%CI:0.44-1.67). Stratified by source of control, a significantly risk was found in the population-based studies, however, no significant association was observed in the hospital-based studies (Table 2). When stratifying by sample size, a significant association was found in sample size ≥ 500 studies (Table 2). No significant heterogeneity between studies was observed in the overall comparisons as well as in subgroup analyses.

For the MMP9 -1562 C/T polymorphism, the pooled ORCT/CC and ORTT/CC were 1.14(95%CI = 0.70-1.87) and 0.46(95%CI = 0.11-2.00), respectively, suggesting a complete overdominant model was assessed using traditional method. Overall, no significant association with lung cancer risk was detected for MMP9 -1562 C/T polymorphism in complete overdominant model and heterogeneity between studies was observed in the overall comparison. In subgroup analysis based on ethnicity, however, a significantly decreased risk was found in Asians(OR = 0.40, 95%CI:0.25-0.64), suggesting homozygotes were at a lesser risk of lung cancer than heterozygotes. Based on the model-free approach, significant result was also found in Asians(ORG = 2.73, 95%CI:1.74-4.27), suggesting lung cancer cases with higher mutational load than healthy individuals have higher risk for lung cancer susceptibility. Stratified by source of control, a significantly risk was found in the population-based studies, however, no significant association was observed in the hospital-based studies (Table 2). When stratifying by sample size, no significant association was found (Table 2). Heterogeneity between studies was observed in the overall comparisons and subgroup analysis based on sample size, but not in subgroup analysis based on ethnicity.

For the MMP13 -77A/G polymorphism, the pooled ORAG/AA and ORGG/AA were 0.99(95%CI = 0.83-1.18) and 1.32(95%CI = 1.03-1.67), respectively, suggesting a recessive model was assessed using traditional method. In recessive model, no significant association with lung cancer risk was detected for MMP13 -77A/G polymorphism in overall comparison and subgroup analysis (Fig. 5). Based on the model-free approach, no significant result was also found in overall comparison and subgroup analysis (Table 2).

Figure 5
figure 5

Forest plot of the generalized odds ratio (ORG) and 95% confidence intervals (CIs) of studies on the association between lung cancer and the MMP13-77 A/G polymorphism in recessive model.

Sensitive analysis

Sensitivity analyses were performed to assess the influence of individual dataset on the pooled ORs by sequential removing each eligible study. As seen in Fig. 6, any single study was omitted, while the overall statistical significance does not change, indicating that our results are statistically robust.

Figure 6
figure 6

Sensitivity analysis: examining the influence of individual studies to pooled odds ratios (OR). (a) MMP1-1607 1G/2G polymorphism in additive model; (b) MMP2-1306 C/T polymorphism for generalized odds ratio (ORG).

Publication bias

Begg’s funnel plot and Egger’s test were performed to assess publication bias among the literatures. As shown in Fig. 7, there was no evidence of publication bias for MMP1 -1607 1G/2G in additive model (Begg’s test P = 1.000; Egger’s test P = 0.703) and MMP2 -1306 C/T in generalized odds ratio (Begg’s test P = 0.221; Egger’s test P = 0.076).

Figure 7
figure 7

Begg’s funnel plot for publication bias test. Each point represents a separate study for the indicated association. (a) Funnel plot for additive model of MMP1-1607 1G/2G polymorphism; (b) Funnel plot for generalized odds ratio (ORG) of MMP2-1306 C/T polymorphism.

Discussion

Meta-analysis is a powerful statistical tool to resolve the discrepancies across individual studies by integrating existing published data. At present, the majority of meta-analyses of genetic association studies are usually conducted by comparing genotype frequencies between cases and controls under various genetic models. However, these genetic models are not independent and a priori knowledge or biological justification for model selection is seldom available34,35. Therefore, we performed this meta-analysis about MMP1, 2, 9 and 13 polymorphisms and lung cancer risk by a comprehensive strategy, including logistic regression and model-free approach32,33, to avoid erroneous model specification and multiple model tests with the risk of an inflated Type I error rate.

In the current study, a total of 22 case-control studies with 8202 cases and 7578 controls were included in the meta-analysis12,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31 and the association between MMP1-1607 1G/2G, MMP2-1306 C/T, MMP2-735 C/T, MMP9 -1562 C/T and MMP13 -77A/G polymorphisms and lung cancer risk was explored. Our results suggest that MMP1-1607 1G/2G, MMP2-1306 C/T, MMP2-735 C/T, MMP9 -1562 C/T polymorphisms were significantly associated with lung cancer risk among Asian population, but there is no association found between MMP13 -77A/G polymorphism and susceptibility to lung cancer.

This finding may be biologically plausible. MMPs play roles in many important physiological and pathological processes including cancer and lung inflammation through degradation of basal membranes and extracellular matrix24,25,36. Expression of MMPs has been linked to a wide range of cancer types including lung cancer and has been reported to be correlated with tumor invasion and poor prognosis24,25,26,27. In recent years, several SNPs (MMP1-1607 1G/2G, MMP2-1306 C/T, MMP2-735 C/T, MMP9 -1562 C/T and MMP13 -77A/G) in the promoter region of the MMP genes have been reported26,27,28,31. Functional analyses of these polymorphisms in MMP genes have found their modulatory effect on transcriptional activity, leading to alterations in the gene expression13,14,36,37. For MMP1-1607 1G/2G polymorphism, the promoter with the 2G allele has significantly stronger transcriptional activity compared with the 1G promoter, because the 2G allele creates a transcription factor binding site and increases transcription capacity38. It has been demonstrated that individuals with CC genotype of both MMP2 -735 C/T and -1306 C/T polymorphisms have higher promoter activity and higher MMP-2 enzyme activity compared with those with the TT genotype and thus may have potentially higher risk of lung cancer39,40. The MMP9 -1562 C to T substitution has been shown to up-regulate the promoter activity and the presence of the -1562T allele has also been found to associated with the decease of the capacity of a putative transcription repressor protein with a subsequent increase in gene expression41. Results from six independent transfection experiments in vitro with MMP13 -77A/G constructs indicated that the constructs with A had about twice as much transcriptional activity as the constructs with G in the same location37. It has been suggested that these SNPs are associated with the development of different human cancer13,14,15,16,42,43.

MMP polymorphisms and lung cancer risk have been investigated by several meta-analyses44,45,46,47. Recently, Hu et al conducted a comprehensive meta-analysis about five MMP polymorphisms and lung cancer susceptibility and found that the MMP1-1607 1G/2G and MMP2-1306 C/T confer significantly susceptibility to lung cancer and MMP1-1607’s effect was dependent on ethnicity, consistent with the results of this meta-analysis44. Compared with Hu’s work, we excluded three studies deviating from Hardy-Weinberg equilibrium(HWE)7,48,49, identified more eligible studies21,28,29,30 and performed a detailed analysis by logistic regression and model-free approach. We also found some significant associations that were not observed in Hu’s study, one example of which was that we found the MMP2 -735 C/T decreased lung cancer risk for Asians, whereas no significant result was found for Caucasians. On the other hand, we also analyzed the MMP13 -77A/G polymorphism. Compared with several other meta-analysis about MMP polymorphisms and lung cancer risk reported by Guo XT et al45, Hu J et al 46 and Wang J et al 47, we identified more eligible studies, evaluated more SNPs(MMP1-1607 1G/2G, MMP2-1306 C/T, MMP2-735 C/T, MMP9 -1562 C/T and MMP13 -77A/G) and performed analysis by a comprehensive strategy, while they only analyzed single polymorphism and lung cancer risk.

Some heterogeneity factors between studies that could limit the strengths of the meta-analysis should be addressed. First, ethnicity was one of the most important factors that could lead to heterogeneity because of the diverse genetic backgrounds and environmental factors in different ethnicities. Second, the source of the controls was another factor that could lead to heterogeneity. Population-based controls could be more reliable than hospital-based controls because the genotype distributions in hospital-based controls may be deviated from normal. In this study, significant heterogeneity was found in three of the five polymorphisms. For these polymorphisms, the heterogeneity disappeared in subgroup analysis based on ethnicity, suggesting that ethnicity of the studied population are the major source of the heterogeneity.

The current study has some inevitable limitations that should be acknowledged. First, only published studies were included in this meta-analysis, which may have biased our results. Second, there was significant heterogeneity among included studies. Even though we used the random-effect model to calculate pool ORs, the precision of outcome would be affected. Third, our results were based on an unadjusted estimated, a more precise analysis would have been conducted if more detailed individual data were available.

In summary, we concluded that the MMP1-1607 1G/2G, MMP2-1306 C/T, MMP2-735 C/T, MMP9 -1562 C/T polymorphisms were risk factors for lung cancer among Asians, while MMP13 -77A/G polymorphism was not associated with lung cancer risk. However, future well designed large studies, particularly stratified by gene-gene and gene-environment interactions might be necessary to clarify the possible role of the MMP polymorphisms in the susceptibility to lung cancer.

Additional Information

How to cite this article: Li, H. et al. Association of matrix metalloproteinase family gene polymorphisms with lung cancer risk: logistic regression and generalized odds of published data. Sci. Rep. 5, 10056; doi: 10.1038/srep10056 (2015).