Introduction

The candidate-gene approach, which is hypothesis-driven and builds on a priori knowledge of the gene’s biological functional impact on the trait or disease in question, has been extensively adopted in genetic association studies.1 However, one of the compelling problems facing most association studies is the lack of consistency in effect estimates. This can most likely be attributed to genetic heterogeneity across races/ethnicities, limited statistical power stemming from an insufficient sample size and lack of adjustment for potential confounders. Even in most studies where a significant association is reported, the odds ratios for individual genetic variants are often <2.2, 3 Such low-effect estimates are not surprising given that the pathogenesis of complex diseases such as hypertension usually involves a multistep, multigenic process. Furthermore, it is unlikely that any one individual genetic variant could explain a large proportion of variability in disease susceptibility. Therefore, tests for joint impacts of multiple candidate variants may provide valuable information in the search for hypertension-susceptibility genes.

For this project, we focused on the components of the matrix metalloproteinase (MMP) family as hypertension candidate genes. MMPs are zinc-dependent endopeptidases that belong to a large family of structurally related proteases capable of degrading all types of extracellular matrix proteins.4 Extracellular matrix proteins have a supporting and structural role in the vascular wall, and their high content has been found to be associated with an increased risk of hypertension and target organ damage.5 MMPs can also modify extracellular matrix proteins and exert a significant impact on many physiological and pathological processes.5, 6 Consequently, the genes encoding MMPs are regarded as logical candidates for determining the molecular mechanisms underlying hypertension. MMPs are classified into collagenases (MMP-1, -8, -13, -18), stromelysins (MMP-3, -10, -11), gelatinases (MMP-2, -9) and membrane-type MMPs.7 Each MMP family member is coded by a separate gene and has a different tissue distribution and bioactive role.8 The regulatory expression levels of MMP genes may involve mRNA stability mechanisms in the cytoplasm. Genetic defects that have been reported to affect the binding site of transcription factors or cause the loss of function of MMP enzymes may also be involved.9, 10, 11 Considering the polymorphic nature of the encoding genes of MMP family members, it is of added interest to determine which MMP genetic variants have the functional potential to affect the final bioavailability of the family members and thus the progression of hypertension.

To generate more information, we sought to examine the association between a comprehensive panel of genetic variants in the MMP family genes and the risk of hypertension in a large northeastern Han Chinese population. In total, 10 genetic variants were selected from five genes encoding MMP-1 (rs1799750, rs1144393), MMP-2 (rs243865, rs243864), MMP-3 (rs3025058, rs617819, rs679620), MMP-9 (rs3918242, rs2664538) and MMP-12 (rs2276109). All study variants have previously been examined in published association studies and are located in the promoter or coding regions, with minor allele frequencies of at least 5%.12, 13, 14, 15, 16 In the present study, we examined the combined effects of risk alleles of all the examined variants and evaluated their high-order gene-to-gene interactions.

Methods

Study design

This was a hospital-based case-control study, as we have previously reported.17, 18, 19

Study population

All study participants were recruited from Qiqihar city, Heilongjiang province, China. The study was approved by the institutional review board of Qiqihar Medical University, and written informed consent was obtained from each participant at the time of enrollment. This study was performed in agreement with the guidelines outlined in the Declaration of Helsinki.

Blood pressure was measured using a calibrated mercury sphygmomanometer (Baumanometer, W.A. Baum Co, Inc., Copiague, NY, USA) with an appropriate adult cuff size by certified examiners. Hypertension was defined as a mean systolic blood pressure >140 mm Hg or a diastolic blood pressure >90 mm Hg or the current use of antihypertensive drugs. As recommended by Tobin et al.,20 for participants taking antihypertensive medication, blood pressure was determined by adding 15 and 10 mm Hg to the systolic and diastolic blood pressure, respectively.

All study participants were assigned to the hypertensive group or normotensive group based on the results of clinical and laboratory examinations. The hypertensive group consisted of inpatients or outpatients of the Second Affiliated Hospital of Qiqihar Medical University. Hypertensive patients with clinical evidence of secondary hypertension and renal disease were excluded. The normotensive controls, who underwent a medical examination at the same hospital, were clinically confirmed to be free of hypertension and had a negative family history of hypertension in their first-degree relatives. All study participants were genetically unrelated local residents of Han Chinese descent who were recruited consecutively between June 2008 and December 2012.

Sample size

The hypertensive group included 1009 sporadic patients with a mean age of 64.48 (s.d.: 8.53) years. Men comprised 54.31% of this study population. The remaining participants (n=756) whose blood pressure was normal formed the age-, gender- and ethnicity-matched control group.

Measurement

Age, gender, body weight and height were recorded at the time of enrollment. For hypertensive patients, age referred to the age of first onset of hypertension. Body weight was measured in light clothing and bare feet. Body mass index (BMI) was calculated as weight in kilograms divided by height in meters squared. Blood samples were collected after overnight fasting, and the sera were isolated immediately by centrifugation. The plasma levels of triglycerides, total cholesterol, high-density lipoprotein cholesterol, blood urea nitrogen, creatinine and urea acid were determined enzymatically using the available kits and auto-analyzers. High-sensitivity C-reactive protein was measured using a particle-enhanced immunoturbidimetric method.

Genotyping

Genomic DNA was extracted from peripheral blood leukocytes by a standard phenol-chloroform method and stored at −40 °C until required for batch genotyping. The genotypes of the 10 examined variants were determined by the PCR-ligase detection reaction (PCR–LDR) method as previously described.21

PCR reactions were conducted in an EDC-810 Amplifier (Dongsheng Innovation Biotech, Beijing, China). The cycling parameters were as follows: 94 °C for 2 min; 35 cycles of 94 °C for 20 s, 60 °C for 20 s and 72 °C for 20 s; and a final extension step at 72 °C for 3 min.

For each variant, two specific probes were synthesized to discriminate specific bases, with one common probe labeled with 6-carboxy-fluorescein at the 3′ end and phosphorylated at the 5′ end. The multiplex ligation reaction was conducted in a reaction volume of 10 μl containing 2 μl of PCR product, 1 μl of 10 × Taq DNA ligase buffer, 1 μM of each discriminating probe and 5 U of Taq DNA ligase. The ligation parameters were 30 cycles of 94 °C for 30 s and 56 °C for 3 min. After the reaction, 1 μl of LDR reaction product was mixed with 1 μl of ROX passive reference and 1 μl of loading buffer before being denatured at 95 °C for 3 min and chilled rapidly in ice water. The fluorescent products of the LDR were differentiated using an ABI 3730XL sequencer (Applied Biosystems, Foster City, CA, USA).

To test the accuracy of this PCR-LDR method, 96 DNA samples were randomly selected and run in duplicates with 100% concordance.

Statistical analysis

The Student’s t-test was used to test for differences between patients and controls for quantitative variables. The χ2 test was used to assess the goodness-of-fit of the observed allele frequencies with the expected frequencies according to the Hardy–Weinberg equilibrium and the genotype/allele frequencies between the two groups. Statistical power was estimated by the PS (Power and Sample Size Calculations) software (version 3.0, Department of Biostatistics, Vanderbilt University, Nashville, TN, USA).

Given that no definitive previous associations between all examined variants and hypertension risk were available after reviewing the literature, we primarily treated the minor-variant allele at each locus as the ‘risk’ allele. Each genotype of the 10 examined variants was regressed in an unconditional logistic regression model under the assumptions of additive (major homozygotes vs. heterozygotes vs. minor homozygotes), dominant (major homozygotes vs. heterozygotes plus minor homozygotes) and recessive (major homozygotes plus heterozygotes vs. minor homozygotes) models of inheritance after controlling for age, gender and BMI. Risk estimates are expressed as the odds ratio (OR) and its 95% confidence interval (95% CI). If the OR was less than unity for the minor-variant allele, the alternative major allele was reassigned as the risk allele. The best-fitting model among three assumed genetic models was the one with the smallest adjusted P-value. In cases where the genotype count for the homozygous variant genotype was less than five in both patients and controls, only the dominant model with the highest statistical power was considered.

The cumulative effects of multiple variants in the MMP family genes on hypertension risk were assessed by an unfavorable genotype analysis using six variants (rs1144393, rs243865, rs3025058, rs679620, rs2664538 and rs2276109) that exhibited a borderline significant association in a single-locus analysis (P for the best-fitting model <0.10). The unfavorable genotype referred to the genotype carrying the risk allele(s). The best-fitting model was the additive model for rs1144393 (unfavorable genotype: WW and WM), rs243865 (WW and WM) and rs679620 (MM and WM), the dominant model for rs3025058 (WW+WM) and rs2276109 (WW) and the recessive model for rs2664538 (WW+WM), where WW is homozygous wild types, WM is heterozygotes and MM is homozygous variants. In the additive model, an unfavorable genotype carrying one and two risk alleles was assigned a score of 0.5 and 1, respectively, and in the other two models, an unfavorable genotype was assigned a score of 1. The number of unfavorable genotypes was summed for each participant. The statistical analyses described above were completed using the STATA software (version 11.2, StataCorp, College Station, TX, USA) in Windows. A two-tailed P<0.05 was considered statistically significant.

The potential interactions of multiple variants in the MMP family genes were identified and characterized by using an open-source data-mining multifactor dimensionality reduction (MDR) approach (version 3.0, available at the website www.epistasis.org).22, 23 This approach aims to construct all possible combinations of the above six variants. The accuracy of each MDR model is evaluated by a Bayes classifier in the context of a 10-fold cross-validation. A single best model simultaneously has the maximum testing accuracy and cross-validation consistency (a measure of the number of times of 10 divisions of the data set that the best model is extracted). Statistical significance was evaluated using a 1000-fold permutation test to compare observed testing accuracies with those expected under the null hypothesis of null association. The permutation test corrects for multiple testing by repeating the entire analyses on 1000 data sets that are consistent with the null hypothesis.

Results

Baseline characteristics

Demographic and clinical variables are compared between hypertensive patients and normotentive controls in Table 1. The distribution of age (P=0.751) and gender (P=0.843) did not significantly differ between patients and controls. The levels of BMI (P<0.001), systolic and diastolic blood pressure (both P<0.001), fasting glucose (P<0.001), triglycerides (P=0.024), creatinine (P=0.002) and high-sensitivity C-reactive protein were significantly higher, whereas the high-density lipoprotein cholesterol level was significantly lower in patients than in controls (P<0.001 for all). The proportion of patients undergoing antihypertensive treatment was 14.17%.

Table 1 Baseline characteristics of all the study subjects

Single-locus analysis

Table 2 shows the genotype distributions and allele frequencies of the 10 examined variants in the MMP family genes between hypertensive patients and normotensive controls. No deviations from the Hardy–Weinberg equilibrium were observed in normotensive controls for all the 10 variants examined (P>0.05). The genotype and allele distributions of rs243865, rs3025058, rs679620 and rs2276109 differed significantly between patients and controls at the significant level of 0.05, whereas with a Bonferroni corrected α of 0.05/10 (here, 10 is the total number of examined variants), only rs3025058 (P for genotype and allele: <0.001 and <0.001, respectively) and rs679620 (P for genotype and allele: 0.004 and <0.001, respectively) differed significantly between the two groups. The power to reject the null hypothesis of no differences in risk allele frequencies of rs3025058 and rs679620 between the two groups was 98.4% and 91.1%, respectively.

Table 2 Genotype distributions and allele frequencies of the 10 examined variants in the 5 MMP family genes between hypertensive patients and normotensive controls

The risk prediction of the 10 examined variants for hypertension risk under the three genetic models is presented in Table 3. After the Bonferroni’s correction for multiple tests (Bonferroni’s significance threshold P=0.05/10), the odds of having hypertension was significant for rs3025058 under the additive (OR=1.33; 95% CI: 1.16–1.53; P<0.001) and dominant (OR=1.43; 95% CI: 1.18–1.73; P<0.001) models and was significant for rs679620 under the additive (OR=1.27; 95% CI: 1.1–1.46; P<0.001) model after controlling for age, gender and BMI. Risk estimates were marginally significant for rs3025058 under the recessive (OR=1.52; 95% CI: 1.13–2.03; P=0.005) model and for rs2276109 under the additive (OR=0.77; 95% CI: 0.65–0.93; P=0.005) and dominant (OR=0.75; 95% CI: 0.62–0.92; P=0.005) models. No significance was attained for the association of the other variants with hypertension.

Table 3 Risk prediction of the 10 examined variants in the 5 MMP family genes for hypertension risk under three genetic models of inheritance

Cumulative effect analysis

To examine whether the effects of multiple variants in the MMP family genes were additive on hypertension risk, a cumulative analysis of unfavorable genotypes of significant variants was performed (Table 4). Given our sample size, we trichotomized each participant into the low-, medium- and high-risk groups based on the tertile distribution of the total risk score of unfavorable genotypes in controls, and we set participants with a score of 3.5 as the reference group. Overall, there was a trend towards an increased risk of hypertension with increasing numbers of unfavorable genotypes. Compared with the reference group, participants with scores spanning (3.5, 4) and > 4 had ORs that increased to 1.61 (95% CI: 1.25–2.51; P<0.001) and 1.92 (95% CI: 1.54–2.39; P<0.001) after controlling for age, gender and BMI, respectively. Moreover, each additional increase in risk score was significantly associated with a 44% increase in hypertension risk (95% CI: 1.29–1.6; P<0.001) after adjustment.

Table 4 Combined effect analysis by the number of unfavorable genotypes from six significant variants and hypertension riska

Genetic interaction analysis

To further explore the potential interaction of six significant variants, we used the promising data-mining analytical approach MDR. As summarized in Table 5, each best model across all possible combinations is assessed by the testing accuracy, cross-validation consistency and significance level. Out of the six derived MDR models, a three-locus model including rs3025058, rs679620 and rs243865 was chosen as the overall best model with the maximum testing accuracy of 0.6605 and cross-validation consistency of 10. This best model was significant at P=0.0022, suggesting that a model this good or better was observed less than 3 out of the 1000 permutations and thus was unlikely to hinge on the null hypothesis of null association.

Table 5 Summary of MDR analysis on six significant variants in the five MMP family genes

Discussion

In this study, we systematically evaluated the individual and joint effects of the 10 genetic variants in the 5 MMP family genes on the risk of hypertension in a large northeastern Han Chinese population. The most noteworthy finding was that the true association between individual variants and hypertension risk may not be revealed until the combined analyses of multiple variants from genes involving a specific physiological or cellular function are performed. Moreover, we proposed a three-locus model that can best characterize genetic interactions of multiple variants in the MMP family genes. Our results not only highlight the complexity of the joint effects of genetic variation on disease risk but also provide evidence supporting the logical functional role of the MMP family genes in the pathogenesis of hypertension.

It is well known that hypertension is an established independent risk factor for cardiovascular disease, which is characterized by an increase in extracellular matrix content.24 A substantial body of evidence supports a close relationship between high extracellular matrix content and hypertension.5, 25 It is reasonable to expect that the degradation of extracellular matrix proteins by MMPs may contribute to the pathogenesis of cardiovascular disease as well as hypertension. Recently, Schmid-Schonbein6 has written an excellent editorial commentary on the emerging opportunities for MMPs in the optimal management of hypertension and its complications. Therefore, understanding the genetic and/or environmental determinants of MMP activation should be a high priority.

Assessing the association between the MMP family genes and cardiovascular disease has been widely used to identify the regions of the genome and candidate variants that contribute to disease susceptibility. However, these associations have always been debated because of inconsistent reports. Recently, we have undertaken a systematic review of the literature by means of a meta-analysis on the association of the MMP family multiple variants with coronary artery disease, and we found strong evidence of a disease susceptibility for the MMP-3 and MMP-9 genes.12 In the present study, we confirmed the positive association of two functional variants of the MMP-3 gene with hypertension risk in a single-locus analysis, reinforcing an independent leading role of the MMP-3 gene in the development of cardiovascular disease. Moreover, our interaction analysis identified another promoter variant rs243865 in the MMP-2 gene that significantly interacted with the MMP-3 gene in the best prediction of hypertension. It is of interest to note that rs243865 showed no statistical association with hypertension in our single-locus analysis, suggesting that there is evidence of epistasis, that is, the effect of one variant may not be disclosed if the effect of another locus is not considered.26 Although our data failed to detect a relationship between MMP-9 genetic variants and hypertension risk, there was evidence that MMP-2 and MMP-9 levels were inversely correlated with arterial stiffness in healthy participants.27 Furthermore, blockade of MMP-9 activity resulted in an increase in pulse pressure.28 We cannot therefore exclude the possibility that a causal variant exists in the MMP-9 gene that is not recognized by the markers selected in association with hypertension or other specific forms of cardiovascular disease. Fully addressing this possibility will require a more comprehensive genetic approach, such as the selection of tagging variants in most key genes that are involved in a specific physiological or cellular function. This approach would offer more convincing evidence for the relevance of the examined genes in disease risk.

In many cases, genetic association studies are complicated by genetic heterogeneity and insufficient study power. On the one hand, genetic heterogeneity is an inevitable problem in disease identification strategies,29 and it may explain in part the inconsistent associations found across ethnic groups. One potential strategy to decrease genetic heterogeneity is to use homogeneous populations.30 In fact, our study population is characterized by genetic homogeneity and geographic stability. In addition, the subjects are most likely uniform in terms of environmental exposures, including habitual dietary intake of high amount of salt and fat31 and a lower rate of hypertension recognition and treatment. These factors render this population more appropriate for enhancing our understanding of the genetic predisposition to essential hypertension. Conversely, as recommended by Cardon and Bell32, recruiting 1000 individuals in each group is required to yield a firm conclusion. Although our sample size of 1009 hypertensive patients and 756 normotensive controls fails to meet this requirement, an a priori power calculation suggested that we had >90% power to detect the significant genetic variants of realistic effect sizes.

Several limitations should be considered when interpreting our results. First, because this was a retrospective case–control study, our data cannot prove the existence of a cause–effect relationship between examined genetic variants and hypertension.33 Second, this study, which involved 1765 subjects, may be underpowered to demonstrate small risk effects. Third, the fact that only 10 common variants were selected from the promoter or coding regions of the 5 MMP family genes leaves an opening for the incorporation of more functional variants, especially the low-penetrance variants from the MMP family genes. Fourth, data on the types of antihypertensive drugs used and the treatment period were not available, precluding further analyses of drug effects on our findings. Fifth, the MDR method used in this study has some underlying drawbacks, including computational intensiveness, indistinct interpretation, lack of sensitivity and heterogeneity-free assumption.34, 35 Finally, the fact that our study participants were of northeastern Han Chinese descent limits the generalizability of our findings and necessitates further confirmation in other ethnic populations.

In summary, our findings suggest that the true association between individual variants and the risk of hypertension may not be revealed until combined analyses of multiple variants from genes involving a specific physiological or cellular function are performed. Nevertheless, for practical reasons, we hope that this study will serve to encourage more comprehensive evaluation of a larger panel of genetic variants in the MMP family genes in other independent groups, thus furthering our understanding of the genetic underpinnings of hypertension and related cardiovascular diseases.