Introduction

Gastric cancer (GC) is a prevalent disease of the digestive system1,2,3. It is the fifth prevalent kind of cancer (5.7%) and the third cause of cancer-related mortality (8.2%)4. In spite of the incidence decline in some parts of the world, GC is a crucial challenge since most incidences are diagnosed at advanced stages, following poor prognosis5,6. Thus, reliable biomarkers of GC must be identified for effective therapy, early diagnosis, and prognosis evaluation. Single nucleotide polymorphisms (SNPs) have profound to have influences on gene function and expression, and contribute to carcinogenesis. Studies of genome-wide association which scan the whole genome for prevalent genetic variants have shown over 450 SNPs related to susceptibility to different cancer types7. Only 7% of these loci are in protein-coding areas, but 93% are located in non-coding areas8,9. Non-coding RNAs are the major regulators of some biological processes, including translation, transcription, epigenetic gene expression, splicing, cell cycle, embryogenesis, stem cell pluripotency and reprogramming, and the immune response regulation10,11. Aberrant expression of long non-coding RNAs (lncRNAs) may bring about different cancers12,13,14. Various lncRNAs are associated with different cancer types15,16,17,18,19. HOX transcript antisense RNA (HOTAIR)—a well-known oncogenic lncRNA—is highly expressed in GC tissues and has been recognized as a critical prognostic biomarker for major cancers, including GC. HOTAIR inhibition not only reduces tumor invasiveness but also reverses EMT in GC cells by regulating N-cadherin, E-cadherin, vimentin, and a transcription factor snail. HOTAIR targets miR-126 to activate the multidrug resistance-associated protein 1/phosphatidylinositol 3-kinase (PI3-K)/Akt and thus promotes cisplatin resistance in GC. Specifically, it directly inhibits miR126, promoting the expression of PI3-K regulatory subunit beta and vascular endothelial growth factor A. Therefore, HOTAIR-targeted therapies may potentially improve prognosis and survival of patients suffering from GC15,20,21,22. The HOXA transcript at the distal tip (HOTTIP), transcribed from the 5′ tip of HOXA cluster, is a cancer-related lncRNA23,24. Recruiting HOXA13–HOTAIR and HOXA13–HOTTIP to different sites in the promoter of bone morphogenetic protein 7 (BMP7) is critical for the oncogenic fate of the human gastric cells24. HOTTIP was overexpressed significantly in cell lines of GC; HOTTIP down-regulation would hinder cell proliferation, degrade cell invasion and migration, and develop cell apoptosis25. Ardabil Province is a very high-risk area in North-West Iran (ASRs, 51.8/100,000 and 24.9/100,000 for males and females, respectively), with one of the highest cardia GC (CGC) rates worldwide. Hence, in a case–control study from Ardabil, we genotyped seven HOTAIR (i.e., rs17720428, rs7958904, rs1899663, and rs4759314) and HOTTIP (i.e., rs3807598, rs17501292, and rs185916) tagSNPs to assess their associations with the risk of GC. In addition, to perform data mining regarding the SNP-SNP interactions, all possible pair combinations between all of the HOTAIR and HOTTIP SNPs in relation to GC susceptibility were analyzed.

Results

General characteristics of the study subjects

Each of GC and control groups consisted of 300 subjects, of whom 74.7% were males. The average age (mean ± SD (min–max)) was 66.54 ± 10.43 (34–88) and 66.48 ± 9.71 (38–91) years for cases and controls, respectively. Age and gender were not significantly different between the case and control groups (p = 0.37 and p = 1.00, respectively), which indicated that these groups were matched well with respect to these parameters. The genotype distributions of SNPs in cases and controls met the Hardy–Weinberg equilibrium conditions (Table 1(. The prevalence rate of GC patients based on the anatomic site of the tumor origin was 52.5% with CGC, 35% with NCGC, and 12.5% with both CGC and NCGC. According to histopathologic features, the prevalence rate of the intestinal-, diffuse-, and indeterminate-type GC was 49.7%, 19.7%, and 30.6%, respectively. Moreover, the 0.8, 5.2, 45.5 and 48.5% of patients were diagnosed at stages I, II, III, and IV, respectively. The clinical and demographic characteristics of the subjects are presented in Table 2.

Table 1 Exact test for Hardy–Weinberg equilibrium (p-value).
Table 2 Baseline characteristics of total 300 GC patients and 300 cancer-free controls.

The association between HOTAIR/HOTTIP tagSNPs and GC risk

The average call rate for the 600 analyzed samples was 99.84%, showing high call rates and high reproducibility. Three SNPs of HOTAIR (i.e., rs17720428; rs7958904; rs1899663) were associated with an increased risk of GC. It was found that the rs17720428 polymorphism was associated with the risk of GC, assuming allelic, dominant, and log-additive models of inheritance. The findings revealed that the rs17720428 G allele was significantly associated with the increased risk of GC (G vs. T, OR = 1.27, 95% CI = 1.01–1.61; p = 0.04). In the dominant model, subjects carrying the TG + GG genotype of rs17720428, as compared with those carrying the TT genotype, had a significantly higher risk of GC (OR = 1.5, 95% CI: 1.08–2.1; p = 0.01).

The rs7958904 SNP was associated with the risk of GC in allelic, co-dominant, dominant, and log-additive models of inheritance. The rs7958904 C allele was significantly associated with the increased risk of GC (C vs. G, OR = 1.31, 95% CI: 1.04–1.65; p = 0.02). Subjects carrying the CC or GC genotype of rs7958904, as compared with those carrying the GG genotype in the co-dominant model, showed an increased risk of GC (CC vs. GG, OR = 1.54, 95% CI: 1.07–2.22 and GC vs. GG, OR = 1.64, 95% CI: 1.03–2.62; p = 0.04). In the dominant model, subjects carrying the GC + CC genotype of rs7958904 showed an increased risk of GC in comparison with those carrying the GG genotype (OR = 1.57, 95% CI: 1.1–2.22; p = 0.01).

The rs1899663 SNP was associated with the risk of GC, assuming allelic, dominant and log-additive models of inheritance. The findings indicated that the rs1899663 T allele was significantly associated with the increased risk of GC (T vs. G, OR = 1.27, 95% CI: 1.01–1.61; p = 0.04). The GT + TT genotype of rs1899663, in comparison with GG genotype, had a significantly higher risk of GC in the dominant model (OR = 1.5, 95% CI: 1.08–2.08; p = 0.02). No significant associations were observed between the rs4759314 SNP and GC susceptibility (Table 3). No evidence regarding the association between the HOTTIP tagSNPs (i.e., rs3807598, rs17501292, and rs1859168) and GC risk was found in any of the genetic models (p > 0.05; Table 4). The HOTAIR and HOTTIP variants were not associated with any clinicopathologic characteristics (Table 5). Moreover, the frequency of each HOTAIR/HOTTIP tagSNP did not show a significant difference between patients having stage I-II and stage III-IV disease (Table 6).

Table 3 Genotype and allele frequencies of HOTAIR SNPs in cases and controls, and genotype- and allelotype-specific risks.
Table 4 Genotype and allele frequencies of HOTTIP SNPs in cases and controls, and genotype- and allelotype-specific risks.
Table 5 Subgroup analysis of clinical characteristics for the association of SNPs with GC risk.
Table 6 Relationship of clinical stage with HOTAIR/HOTTIP polymorphisms in GC patients.

The association of haplotype in two lncRNA genes with GC risk

According to Table 7, the results of haplotype analysis showed that the G-C-T-A haplotype of HOTAIR rs17720428, rs7958904, rs1899663, and rs4759314, respectively, increased the risk of GC by 1.31-fold (95% CI: 1.03–1.67; p = 0.029). No haplotype of the three HOTTIP tagSNPs was associated with the risk of GC (p > 0.05).

Table 7 Association of the haplotype of HOTAIR/HOTTIP gene with GC risk were calculated using the SNPStats.

SNP-SNP interaction models for lncRNA polymorphisms

To perform data mining regarding the SNP-SNP interactions, all possible pair combinations between all of the HOTAIR and HOTTIP tagSNPs were analyzed. The interaction of HOTAIR rs17720428 TG with HOTTIP rs1859168 CC potentially increased the risk of GC (OR = 1.76, 95% CI: 1.22–2.54; p = 0.003). In addition, the interaction of HOTAIR rs7958904 with HOTTIP rs1859168 potentially increased the risk of GC (rs7958904 GC-rs1859168 CC, OR = 1.85, 95% CI: 1.25–2.73, p = 0.002; rs7958904 CC-rs1859168 CC, OR = 1.86, 95% CI: 1.14–3.06, p = 0.01). Interestingly, the interaction of HOTAIR rs1899663 with HOTTIP rs1859168 strongly increased the risk of GC (rs1899663 GT-rs1859168 CC, OR = 4.3, 95% CI: 2.75–6.7; rs1899663 TT-rs1859168 CC, OR = 9.37, 95% CI: 5.43–16.18; rs1899663 TT-rs1859168 CA, OR = 6.59, 95% CI: 2.12–20.51; all the p-values were < 0.001.) (Table 8).

Table 8 The two-way interaction of HOTAIR and HOTTIP polymorphism in the risk of GC.

The potential impact of each SNP on the establishment or destruction of the miRNA binding site

Bioinformatic analysis showed that the HOTAIR rs17720428/rs7958904 and HOTTIP rs17501292 tagSNPs cause miRNA target gain and loss. Moreover, the HOTTIP rs1859168 polymorphism could lead to miRNA target gain while the rs3807598 polymorphism could lead to miRNA target loss. For the HOTAIR rs1899663 and rs4759314 tagSNPs, no miRNA target gain or loss was recognized (Table 9).

Table 9 The potential impact of each SNP on the establishment or destruction of the miRNA binding site.

Discussion

Evidences have demonstrated that the aberrant expression of lncRNAs may develop various malignancies26,27. Moreover, polymorphisms in lncRNAs may influence their expression and bring about GC susceptibility28,29. SNPs in lncRNAs may affect different biological processes through affecting biological pathways. Studies have confirmed the roles of lncRNAs as critical regulators of tumorigenesis30. The current study explored whether the tagSNPs of HOTAIR (i.e., rs17720428, rs7958904, rs1899663, and rs4759314) and HOTTIP (i.e., rs3807598, rs17501292, and rs185916) affect GC development. The G allele and TG + GG genotype of rs17720428 in HOTAIR significantly increased the risk of GC (G vs. T, OR = 1.27; TG + GG vs. TT, OR = 1.5, respectively). We also showed that the T allele and GT + TT genotype of rs1899663 in HOTAIR were correlated with the higher GC risk (T vs. G, OR = 1.27; GT + TT vs. GG, OR = 1.5).

The C allele of rs7958904 in HOTAIR was correlated with the increased risk of GC (C vs. G, OR = 1.31). Patients carrying the GC or CC genotype of rs7958904 had considerably increased the risk of GC compared to those carrying the GG genotype (OR = 1.54 and GC vs. GG, OR = 1.64, respectively). In addition, subjects carrying the GC + CC genotype of rs7958904 possessed a meaningful increased risk of GC compared to individuals carrying the GG genotype (OR = 1.57). It has been shown that the HOTAIR rs7958904 CC genotype associates with the higher cervical cancer risk in comparison to the GG/GC genotypes (OR = 1.57). TCGA database revealed that the cervical cancer tissues with the rs7958904 CC genotype had increased the expression of HOTAIR compared to those with GG genotype. Hence, HOTAIR rs7958904 may affect cervical cancer susceptibility by the modulation of CC cell proliferation31. It is the possibility of additive roles of genetic and environmental factors with SNPs and understanding gene–gene/gene-environmental interactions are prerequisites for highly effective prevention.

Du et al. demonstrated that the HOTAIR SNP rs4759314 was significantly associated with the increased risk of GC (OR = 1.39). The HOXC11 and HOTAIR expression levels in the subjects with AG genotype were much higher than those with AA genotype. In the same vein, the promoter activity of G allele was more significant than that of A allele29. Finally, a meta-analysis study by Tao et al. showed that the HOTAIR rs4759314 polymorphism may play a role in GC susceptibility32. In this case, all studies were in Chinese populations and therefore could not give an overview of its status in other populations. In contrast, we did not find any significant correlation between the HOTAIR rs4759314 SNPs and GC susceptibility. This may indicate the fact that some HOTAIR risk SNP(s) may be ancestry-specific; however, this is just a hypothesis and needs to be established, by studying this SNP in other types of cancer in Ardabil and in different (ethnic) population groups suffering from GC.

Only one haplotype in the HOTAIR (GCTA) gene was associated with the risk of GC (OR = 1.31). Studies have shown that different HOTAIR variants (e.g., rs920778, rs7958904, and rs874945) correlate with different cancers, including GC, colorectal cancer, breast cancer, and esophageal cancer33. Knockdown of HOTAIR can prevent cell growth of GC, influence cell cycle distribution, and improve P21 and P53 protein levels15.

HOTTIP knockdown in GC cells hindered cell proliferation, invasion, and migration. Additionally, HOTTIP down-regulation reduced the expression of homeobox protein Hox-A13 (HOXA13) in cell lines of GC. HOXA13 affected GC cells’ HOTTIP‑induced malignant phenotypes. Both HOXA13 and HOTTIP were up-regulated in GC tissues than adjacent noncancerous tissues25. In the present study, none of the HOTTIP SNPs (i.e., rs3807598, rs17501292, and rs1859168) were associated with the risk of GC. In contrast, Hu et al. showed that HOTTIP rs1859168 A > C notably was associated with a decreased risk of pancreatic cancer (PC) (CC vs. AA: OR = 0.71). The C allele of HOTTIP rs1859168 could significantly reduce the relative luciferase activity in comparison to the A allele in three PC cell lines. Therefore, the functional rs1859168 A > C polymorphism could reduce the risk of PC by downregulating HOTTIP expression34. This discrepancy between the two studies represents the hypothesis that some HOTTIP risk SNPs may be tissue-specific. However, further studies in different cancer cell lines are required to confirm such a hypothesis. In Hepatocellular carcinoma (HCC) patients, HOTTIP rs2071265 was related with an earlier recurrence. The HOTTIP suppression in cancer cell lines of liver decreased the rates of cell invasion and increased chemosensitivity35. The interaction of HOTTIP rs17501292 with MALAT1 rs619586 polymorphisms had a decreased impact on the risk of HCC (OR = 0.3)33.

In the present study, although none of the HOTTIP SNPs increased the risk of GC, the SNP-SNP interactions of HOTAIR with HOTTIP were strongly associated with risk of GC. The SNP-SNP interaction of HOTAIR rs17720428 TG with HOTTIP rs1859168 CC increased the risk of GC (OR = 1.76). In addition, the SNP-SNP interaction of HOTAIR rs7958904 with HOTTIP rs1859168 increased the risk of GC (rs7958904 GC-rs1859168 CC, OR = 1.85; rs7958904 CC-rs1859168 CC, OR = 1.86). Interestingly, the SNP-SNP interaction of HOTAIR rs1899663 with HOTTIP rs1859168 strongly increased the risk of GC (rs1899663 GT-rs1859168 CC, OR = 4.3; rs1899663 TT-rs1859168 CC, OR = 9.37; rs1899663 TT-rs1859168 CA, OR = 6.59). To verify the findings and validate the results, further studies in diverse ethnicities and functional analysis are required.

In our research, the stratified analysis of genetic association of the HOTAIR and HOTTIP tagSNPs with clinicopathologic characteristics (such as tumor origin and intestinal-, diffuse-, or indeterminate-types of GC) revealed no significant association in all subgroups. An important problem in GC is that the most GC patients are diagnosed at the advanced stage36. In the present study, which was confined to Ardabil (a very high-risk area of GC in Northwestern Iran), the 0.8, 5.2, 45.5 and 48.5% of patients were diagnosed at stages I, II, III, and IV, respectively. The frequency of each HOTAIR or HOTTIP tagSNP did not show a significant difference between patients having stage I-II and stage III-IV disease. It might probably be explained by the fact that almost all the patients (94%) recruited in the study were at the advanced stage (III-IV), having poor prognosis.

The influence of lncRNAs on microRNA function and vice versa is emerging, affecting the gene expression programs. LncRNA tagSNPs can cause or destroy miRNA binding site(s) on the lncRNA. Some LncRNAs act as molecular decoys or sponges of microRNAs, with sequestrating microRNAs favoring the expression of suppressed target mRNAs. Other lncRNAs compete with miRNAs for interacting with shared target mRNAs, causing the derepression of gene expression. They can also be precursors to the production of miRNAs for silencing target mRNAs. In contrast, little is known about the influence of microRNAs on lncRNA function. They can target lncRNAs for degradation37,38. Here, using bioinformatic analysis, we showed that the HOTAIR rs17720428/rs7958904 and HOTTIP rs17501292/rs1859168/rs3807598 tagSNPs could lead to miRNA target gain and/or loss. However, for the HOTAIR rs1899663 and rs4759314 tagSNPs, no miRNA target gain or loss was recognized. Among the miRNAs listed in Table 9, for a small number, the functional role has been recently determined to a somewhat large extent in cancer, although not necessarily in GC, including the miR-615-3p, miR-874-5p, miR-506-5p, miR-769-3p, miR-1252-5p, and especially miR-216a-5p. For example, miR-615-3p can promote the epithelial mesenchymal transition (EMT) and metastasis of breast cancer by targeting protein interacting with C kinase 1 (PICK1)/TGFBRI axis39. MicroRNA-874-mediated inhibition of the major G1/S phase cyclin, cyclin E1 (CCNE1) does not occur in osteosarcomas. It also inhibits tumor metastasis in hepatocellular carcinoma by targeting the δ opioid receptor (DOR)/epidermal growth factor receptor (EGFR)/extracellular signal-regulated kinase (ERK) pathway40,41. MiR-506 inhibits the proliferation and invasion of i) colorectal cancer by targeting ubiquitin-like with plant homeodomain and RING finger domains 1 (UHRF1) via the KISS1/PI3K/NF-kB signaling axis and ii) nasopharyngeal carcinoma by targeting Forkhead box Q1 (FOXQ1), and is also epigenetically silenced in pancreatic cancer42,43,44. During Reoxygenation microRNA-769-3p down-regulates N-myc downstream-regulated gene 1 (NDRG1) and enhances Apoptosis45. By targeting miR-1252-5p, the lncRNA AL161431.1 can facilitate cellular proliferation and migration via MAPK signaling in endometrial carcinoma46. The function of miR-216a-5p has also been studied in depth in various cancers, playing a role of tumor suppressor. It inhibits the cell proliferation and metastasis by targeting Janus kinase 2 (JAK2)/signal transducer and activator of transcription 3 (STAT3)-mediated EMT process in GC and by targeting p21-activated protein kinase 2 (PAK2) in breast cancer. It also inhibits the cell proliferation and induces apoptosis by targeting tectonic family member 1 (TCTN1) in esophageal squamous cell carcinoma. Moreover, the low expression of miR-216a results in the upregulation of tetraspanin 1 (TSPAN1) that contributes to pancreatic cancer progression via transcriptional regulation of integrin alpha 2 (ITGA2)47,48,49,50.

Except for miR-615-3p and miR-1252-5p, which have lost their potential binding sites to HOTAIR (due to rs7958904 polymorphism) and HOTTIP (due to rs17501292 polymorphism), respectively and are thought to be oncogenic, the other four molecules including miR-874-5p, miR-506-5p, miR-769-3p, and miR-216a-5p play the role of tumor suppressors. MiR-615-3p and miR-1252-5p molecules may retain their oncogenic effect due to the loss of their binding site to HOTAIR and HOTTIP, respectively; however, the possible mechanism(s) is unknown and requires functional studies. MiR-769-3p has a potential binding site to HOTAIR due to the rs17720428 polymorphism associated with GC in the present study. Interestingly, all the three molecules miR-874-5p, miR-506-5p, and miR-216a-5p have a possible binding site to HOTTIP due to rs1859168 polymorphism. In the present study, the SNP-SNP interaction of HOTAIR rs1899663 with HOTTIP rs1859168 was strongly associated with GC, which may be due to the destruction of these molecules that are thought to function as tumor suppressors. However, functional studies need to be done to determine if these bindings actually occur and what the role of binding of these molecules to HOTTIP is in the progression to GC. These studies should be performed in the presence of HOTTIP rs1859168 tagSNPs by controlling the presence of HOTAIR rs1899663 polymorphism.

Taking altogether, we showed that HOTAIR rs17720428, rs7958904, and rs1899663 tagSNPs and their interactions with the HOTTIP rs1859168 polymorphism were significantly associated with GC risk. Specifically, novel SNP-SNP interactions between HOTAIR and HOTTIP tagSNPs have a larger impact than individual SNP effects on GC risk, thereby providing us with valuable information to reveal potential biological mechanisms for developing GC.

Materials and methods

Study subjects

A hospital-based case–control study, from October 2017 to February 2019, was conducted. A total of 300 cases were selected from patients undergoing endoscopic examination in the Imam Khomeini Hospital in the Ardabil. One control was sought for each case, frequency matched to the case group by 5-year age groups and gender. The controls were randomly selected from subjects who received routine physical examinations in the same hospital and had no self-reported history of cancer at any site. According to histopathologic and endoscopic results, gastroduodenal disease was diagnosed. GC diagnoses were categorized by anatomic sub-sites based on the International Classification of Diseases, 10th Revision (ICD-10) as cardia (ICD-10 code C16.0) and non-cardia (ICD-10 codes C16.1–C16.9, involving unspecified and overlapping subsites)51. According to the classification of Lauren, histologic subtypes were assessed as diffuse-type, intestinal-type, and other/unspecified histologies52. Finally, the AJCC 8th TNM staging system for GC was considered, showing an improved efficiency in GC prognosis53. The study was conducted on the basis of ethical principles of human research expressed in the 1975 Declaration of Helsinki. All participants signed an informed consent form. This study was approved by the Ethics Committee of the National Institute for Medical Research Development (NIMAD)/ IR.NIMAD.REC.1396.097.

SNP selection and genotyping

The data of genetic polymorphism from the entire sequence of lncRNAs was achieved from the dbSNP database (https://www.ncbi.nlm.nih.gov/projects/SNP/). The lncRNA HOTTIP gene sequences were downloaded by the 1000 Genomes Browser (https://www.ncbi.nlm.nih.gov/variation/tools/1000genomes/) after enlarging 2 kb of upstream and downstream flanking sequences of the gene. The selection criteria were: (i) linkage disequilibrium (LD) r2 lower than 0.8, (ii) minor allele frequency (MAF) higher than 0.05, and (iii) the p-value of Hardy–Weinberg equilibrium (HWE) higher than 0.05. Seven eligible tagSNPs were chosen involving four SNPs for HOTAIR (i.e., rs17720428, rs7958904, rs1899663, and rs4759314) and three SNPs for HOTTIP (i.e., rs3807598, rs17501292, and rs17501292) eventually included in the final analysis. From each participant, venous blood samples were taken into an ethylenediaminetetraacetic acid (EDTA)-containing tube and were stored at -80 °C. Using QIAamp DNA blood mini kit (QIAGEN, Germany), genomic DNA was extracted from 200 µL peripheral blood samples as previously described54. All samples were genotyped by the Infinium HTS platform according to the standard protocol (https://www.illumina.com/Documents/products/workflows/workflow_infinium_ii.pdf) with a customized Illumina Infinium GSA BeadChip—a robust, high-quality assay. This SNP microarray uses known nucleotide sequences as probes to hybridize with the tested DNA sequences, allowing a qualitative and quantitative SNP analysis. Data quality control was performed using Genome Studio. The call rate cut-off was 98% as it an off-the-shelf array.

Statistical and bioinformatic analysis

Genotyping results of SNPs were evaluated for significant departure from Hardy–Weinberg equilibrium. Using Pearson chi-square test or Fisher’s exact probability (for categorical variables), the variations in frequency distribution of genotypes and demographic characteristics were assessed. The association strength was calculated applying odds ratios (ORs) and 95% confidence intervals (CIs). All genetic models were evaluated, including dominant, recessive, co-dominant, over dominant, and log additive models of inheritance for seven SNPs. Each model provides different assumptions regarding the genetic effect. Using the SNPStats (https://www.snpstats.net/start.htm), haplotype frequencies were obtained for HOTAIR and HOTTIP according to the expectation maximization algorithm. The pairwise interactions of lncRNA SNP-SNP were calculated. Statistical analyses were done by SPSS version 19.0 (IBM, Chicago, USA). The correlations between every genetic variant and clinical features of GC were investigated. The statistical tests were two-sided; p < 0.05 was assumed statistically significant. The potential impact of each SNP on the establishment or destruction of the miRNA binding site was analyzed using the lncRNASNP2 database55.