Introduction

Cervical cancer (CC) is one of the most common causes of cancer mortality for women worldwide. According to the International Agency for Research on Cancer (IARC) (GLOBOCAN 2012), an estimated of 266,000 deaths from CC occurred worldwide in 2012, accounting for 7.5% of all female cancer deaths1. Although infection with high risk human papillomavirus (HR-HPV) has been accepted as the major risk factor of CC, individual genetic heritability also plays a fundamental role in the development of CC2,3,4.

Long noncoding RNAs (lncRNAs) are a kind of non-coding RNAs longer than 200 nucleotides5, 6. The ENCODE project suggests that about 76% of the human genome is transcribed to a series of lncRNAs7. Recently, lncRNAs have gained more and more attention due to their complex and extensive regulatory functions5, 6, 8. LncRNAs functions vitally in a broad range of cellular processes, such as cell growth, survival, migration and invasion9, 10. Deregulated lncRNAs have been reported to be involved in pathogenesis of cancers6, 11. LncRNA HOX transcript antisense intergenic RNA (HOTAIR) locates in the HOXC locus, and involves epigenetic regulation of transcription in a 40 kb region of HOXD10. HOTAIR participates in the development and progression of a variety of malignances including breast cancer10, colorectal cancer12, and gastric cancer13. A number of studies showed that HOTAIR was overexpressed in CC and associated with progression and poor prognosis of CC and could be served as a new biomarker for overall survival14, 15.

Recently, considerable efforts have been made to investigate the effect of genetic variations in the lncRNA genes on the susceptibility of various tumors. Verhaegh et al.16 first reported that the tagging SNP (tagSNP) rs2839698 of H19 was significantly associated with the decreased risk of bladder cancer. Hereafter, a number of lncRNA SNPs, such as rs6434568 in the PCGEM1 and rs11655237 in exon 4 of LINC00673 genes, have been found to confers susceptibility to tumorigenesis17, 18. In our previous studies, we observed that two SNPs, rs7958904 and rs4759314, in the HOTAIR gene were significantly associated with colorectal19 and gastric cancer20, respectively. Given the important role of HOTAIR in carcinogenesis and its dysregulation in CC, we hypothesized that the HOTAIR SNPs could influence the susceptibility of CC. As a result, we identified rs7958904, located in exon 6 of HOTAIR, was significantly associated with increased risk of CC.

Materials and Methods

Ethics statement

The study was approved by the institutional review board of Southeast University. Each subject signed an informed consent. The research protocol was carried out in accordance with the approved guidelines.

Study subjects

There were 1209 cases and 1348 healthy controls enrolled in this study. Among them, 571 cases and 657 controls were recruited from hospitals in Nanjing and Wuxi cities between January 2007 and December 2010, which has been described elsewhere21. The other 638 cases and 691 controls were recruited from Nantong hospitals between January 2009 and December 2014. The patients were all histologically confirmed and those who had metastasized cancers from other origin were excluded. The healthy controls were enrolled from people seeking for health care in the same hospitals. They were genetically unrelated to the cases and had no history of gynecologic tumors.

SNP selection and Genotyping

The overall flow of SNP selection was described in our previous study19. Briefly, the three tagging SNPs (i.e., rs4759314, rs7958904, and rs874945) were picked up using Haploview 4.0 software and the threshold for analysis was set as r 2 > 0.8. The genotypes of all SNPs were determined by TaqMan allelic discrimination methods. The random 10% of samples were repeatedly genotyped and the results were 100% concordant.

Genotype imputation

The Cancer Genome Atlas (TCGA) SNP array genotyping has been done using Affymetrix Genome-wide human SNP array 6.022. Imputation for the three SNPs was performed using IMPUTE2 v2.3.2 with the 1000 Genomes Phase 3 as reference data.

Cell proliferation assay

The assay was performed as described previously19 with a slight modification: approximately 5 × 103 and 2 × 103 transfected HeLa and SiHa cells, respectively, were plated in 96-well plates, and their proliferation rate was assessed using MTT at the time points of 0, 48, 72 h. The absorbance was measured at 552 nm using a spectrophotometer (Bio-Red-680, Bio-Red, USA).

Statistical analyses

Hardy-Weinberg equilibrium (HWE) for each SNP among the controls was tested using a goodness-of-fit χ2 test. The Pearson χ2 test was used to test the differences in the frequency distribution of alleles and genotypes of SNPs. The association between SNPs and risk of CC was evaluated by multivariate unconditional logistic regression analyses with odds ratios (ORs) and 95% confidence intervals (95% CIs). The adjusted covariates included age, parity, and menopausal status. Data with homogenous variances were analyzed by using the t-test or ANOVA with Tukey-HSD multiple comparison test. In case of inequality of variances, data would first be subjected to log2 transformation, and then be handled with t-test or ANOVA if the new resultant variances were equal. Otherwise, the difference would be analyzed with Wilcoxon rank sum test. All analyses were performed using R 3.2.5 and Stata 11.0 (StataCorp LP, College Station, TX) and a P value of less than 0.05 was considered statistically significant.

Results

Association of HOTAIR tagging SNPs and CC susceptibility

The genotype frequencies of 3 tagging SNPs (i.e., rs4759314, rs7958904 and rs874945) among the controls were all in accordance with HWE (P = 0.294, 0.083 and 0.757, respectively). As shown in Table 1, the difference of genotype distribution of rs7958904 between the case and controls was of statistical significance (P = 0.029). The CC genotype was associated with a 1.58-fold increased risk (95% CI = 1.10–2.28; Table 1) of CC as compared with the GG genotype. The GC genotype conferred no significant risk of CC compared with the GG genotype (adjusted OR = 1.01, 95% CI = 0.85–1.21; Table 1). When a recessive model was employed, the CC genotype was associated with a 1.57-fold increased risk (95% CI = 1.10–2.25) of CC as compared with the GG/GC genotypes (Table 2). The difference of the rs4759314 genotype distribution was Quasi significant between the cases and controls (P = 0.071; Table 1). Further studies with large samples are warranted to re-confirm the association of rs4759314 with CC risk. However, no significant association was observed between the rs874945 polymorphism and risk of CC (P = 0.434).

Table 1 Association of HOTAIR SNPs with risk of CC.
Table 2 Stratified analysis of rs7958904 genotypes and CC risk by selected variables.

Stratified analyses of rs7958904 and risk of CC

Further stratified analyses by demographic and clinical variables showed that the association of rs7958904 with CC risk was more prominent among the subgroups of age > 49 years, parity ≤ 1, and having abortion (P = 0.026, OR = 1.70, 95% CI = 1.00–2.88; 0.036, 1.60, 1.05–2.46; and 0.028, 1.73, 1.07–2.80, respectively; Table 2). We also evaluated the relationship between rs7958904 and Stage II of CC, and found that the polymorphism conferred a 2.06-fold increased risk of CC (P = 0.002, OR = 2.06, 95% CI = 1.29–3.31).

Allele-specific effect of rs7958904 on HOTAIR expression in CC tissues

We next investigated the allele-specific effects of SNPs on HOTAIR expression by exploring the TCGA database which contains SNPs and RNA-seq information for multiple tumors including CC. Because rs4759314, rs7958904 and rs874945 were not enrolled in the TCGA SNPs data (Genome-Wide Human SNP Array 6.0), we obtained imputed genotypes of the three SNPs by imputation to the 1000 Genomes Project for the TCGA CESC SNPs data. As shown in Fig. 1, there was higher HOTAIR expression in the tumors with rs7958904 CC genotype than with wild GG genotype (P = 0.046). Although tumors heterozygous for rs7958904 had increased expression of HOTAIR compared with homozygous tumors, the difference was not statistically significant (P = 0.179). In addition, we observed no allele-specific effects of rs4759314 and rs874945 on HOTAIR expression.

Figure 1
figure 1

Genetic effects of SNPs on HOTAIR expression in TCGA database. Their differences were compared by ANOVA with Tukey-HSD multiple comparison test. RSEM, RNA-seq by Expectation- Maximization.

Effects of HOTAIR rs7958904 on cell proliferation

Previously, we identified a growth-inhibiting role of rs7958904 C allele in colorectal cancer LoVo cells19. To determine the underlying mechanism for rs7958904 conferring an increased risk of CC, we also performed a cell proliferation assay to test the effect of rs7958904 on CC cell proliferation. Interestingly, MTT assay showed higher proliferation rate of both HeLa and SiHa cells transfected with rs7958904 C allele than with G allele (Fig. 2), which was opposite to the findings in the LoVo cells.

Figure 2
figure 2

Effects of HOTAIR rs7958904 alleles on CC cell proliferation. Increased cell proliferation was observed in the CC cells transfected with plasmids containing mutant C allele over the wild G allele.

Discussion

In the present study, we performed a case-control study to investigate the association of 3 tagging SNPs with risk of CC, and found that the rs7958904 polymorphism conferred an increased risk of CC. Functional assays showed higher HOTAIR expression in CC tissues with rs7958904 CC genotype than with GG genotype. MTT assay demonstrated a growth-promoting role of rs7958904 C allele on CC cells.

HOTAIR has been found to be deregulated in a variety of tumors, including lung23, colon12, stomach13, ovarian24, and esophageal cancers25 since it was first identified in breast cancer to promote tumor invasiveness and metastasis in 201010, HOTAIR is also found to be overexpressed in CC and predict a poor prognosis of patients14, 15. Jing et al.26 reported that HOTAIR enhanced aggressive biological behaviors and induced radio-resistance via inhibiting p21, suggesting an onco-lncRNA in CC. Accumulating evidence has suggested that genetic variants in the lncRNAs can modulate individual susceptibility to cancer17, 27. Zhang et al.28 first evaluated the relationship between the tagging SNPs (i.e., rs920778, rs1899663 and 4759314) of HOTAIR and risk of esophageal squamous cell carcinoma (ESCC). Thereafter, several studies have investigated the association of HOTAIR SNPs with multiple cancer susceptibility29.

To date, there are two studies on the association of HOTAIR SNPs and risk of CC30, 31. Qiu et al.31 reported that rs920778 was significantly associated with the development and progression of CC. Guo et al.30 evaluated the three tagging SNPs identified by Zhang et al.28 in relation to CC susceptibility, and also found a strong association between rs920778 and risk of CC. In the present study, we evaluated the association of three tagging SNPs (i.e., rs4759314, rs7958904 and 874945) with risk of CC, and found only rs7958904 was significantly associated with risk of CC. We previously identified rs7958904 was not markedly associated with gastric cancer20 but related to a decreased risk of colorectal cancer19. We observed a growth-promoting role of rs7958904 C allele on CC cells, which was opposed to its growth-inhibiting effect in colorectal cancer LoVo cells. The opposite effect of rs7958904 on cancer risk may reflect the complex function of HOTAIR and its genetic variation is complex and depends on cell-type context. Another explanation for rs7958904 in relation to CC susceptibility is that the real functional SNP is rs920778, which is in high LD (r 2 = 1) with rs7958904. Guo et al.30 found rs920778 T allele could enhance intronic enhancer activity and increase HOTAIR expression in CC cells. Further studies on the interaction effect of rs7958904 C allele and rs920778 T allele on biological behavior of CC cells are warranted.

The rs4759314 polymorphism locates in the first intron of HOTAIR. We previously observed a significant association between rs4759314 and risk of gastric cancer20. However, no significant association was observed between rs4759314 and risk of several tumors including breast, colorectal, and esophageal cancer29. The rs874945 polymorphism locates in the 3’ near gene of HOTAIR. In our previous studies, there was no significant association of rs874945 with risk of gastric or colorectal cancer19, 20. Because rs874945 and rs1899663 are in high LD (r 2 = 0.904)20, the association of rs1899663 with cancer susceptibility was also reviewed. No significant association was observed between rs1899663 and risk of cervical, gastric, breast, and esophageal cancer29, 30.

In conclusion, we identified rs7958904 in the exon of HOTAIR significantly was associated with increased risk of CC and could influence CC cell proliferation. Further functional studies on the effect of rs7958904 C allele and its interaction with rs920778 T allele on biological behavior of CC cells are are needed to confirm and extend our findings.