Introduction

Breast cancer is the most common diagnosed cancer and the leading cause of cancer death in Chinese women1. It alone is expected to account for fifteen percent of all new cancers in women with about 4292,000 newly diagnosed invasive cancer cases in 2015 in China2.

Noncoding RNAs (ncRNAs) which function by means other than directing the production of proteins were a distinguishing feature of metazoan genomes3. Numerous studies have underlined the regulatory role of microRNAs (miRNAs) in the development of cancers, and their variants are reported to be related to various cancer4,5,6. LncRNAs ranging from 200 nucleotides (nt) to over 10 kb are spliced, polyadenylated, and are roughly as diverse in a given cell type as protein-coding mRNAs7,8,9,10. In addition to the small regulatory RNAs, emerging studies indicate that lncRNAs play critical roles in various biological processes ranging from embryonic development to human diseases, including controlling cell cycle progression, apoptosis, invasion, and migration11. The aberrant expressions of several lncRNAs in various cancers indicate that lncRNAs may be play roles in tumor carcinogenesis12,13,14. Moreover, recent studies have shownthe important roles of lncRNAs and their genetic variants played in tumor carcinogenesis. Yan et al. found that TC genotype of rs10463297 in lncRNA SRA could increase BC risk compared with CC genotype15. Reactivation of the H19 gene has been observed in bladder tumors, and TC genotype of rs2839698 in H19 was found to decrease bladder cancer risk16. Yao et al. suggested that individuals with rs7958904 CC genotype in HOTAIR had asignificantlydecreasedriskofcolorectalcancer17.

Large intergenic noncoding RNA regulator of reprogramming (lincRNA-ROR, Linc-ROR), at 2.6 kb in length, was first identified as a regulator for reprogramming of differentiated cells to induced pluripotent stem cells (iPSCs)18. Linc-ROR also functions as a microRNA sponge to prevent the core transcription factors (TFs) Oct4, Nanog, and Sox2 from miRNA-mediated suppression in self-renewing human embryonic stem cells (ESCs)19 and studies indicated that Linc-ROR could act as a p53 repressor in response to DNAdamage20. All the evidence intrigued us to speculate that Linc-ROR might also have a role in cancer progression and we found several studies have focused the role of Linc-ROR in the development of cancers, including breast cancer21,22,23,24.

Reproductive factors are closely related to the development of breast cancer and the research on interaction between reproductive factors and susceptibility gene has been done25. However, up to now, research combining the SNPs in Linc-ROR and reproductive factors on BC risk has not been reported. In our current case-control study, we selected four tagger SNPs (rs6420545, rs4801078, rs1942348 and rs9636089) in Linc-ROR by bioinformatics method and explored a potential correlation between those potentially functional polymorphisms and risk of BC in Chinese women. Furthermore, analysis of qRT - PCR was applied to detect the relative expression level of Linc-ROR in plasma and the interactions of gene reproductive factors were analyzed using MDR method.

Results

Characteristics of patients and controls

The demographic and clinical characteristics of 968 subjects were displayed in Table 1. The mean age of patients and matched controls were 48.41 ± 10.21 and 49.23 ± 10.15 respectively (P = 0.21). The mean age at menarche of BC patients (14.48 ± 1.82) was significantly higher than mean age at menarche of controls (14.08 ± 1.78). More than 4 pregnancies for women increased the risk of BC (OR: 2.01, 95%CI: 1.32–3.06). However, there is no difference in the distributions of the other characteristics.

Table 1 Thebaseline characteristics of total 484 BC patients and 484 cancer-free controls.

The SNPs genotypes were associated with BC

Table 2 showed the distributions of genotypes in BC patients and control group for four selected SNPs. All of SNPs genotypes in control group meet Hardy-Weinberg equilibrium (P = 0.99 for rs1942348, 0.77 for rs4801078, 0.94 for rs6420545 and 0.99 for rs9636089). TT (OR: 1.79; 95%CI: 1.20–2.68) genotype of rs4801078in Linc-ROR increased BC risk incodominant model. Also, TT (OR: 1.84, 95%CI: 1.28–2.66) genotype of rs4801078 in Linc-ROR sho wed increased risk of BC in recessive model.

Table 2 The association between four SNPs genotypes and risk of breast cancer.

Stratified analysis

We evaluated the relationship of SNPs genotypes and BC risk stratified by the reproductive factors (Table 3). The positive effect of rs6420545 CT + TT genotype was more significant in the subjects (P = 0.005) with age at menarche ≤13 (OR: 0.95, 95%CI: 0.92–0.99) and rs19 CT + TT genotype were more evident in the subjects (P = 0.045) with age >50 (OR: 0.60, 95%CI: 0.37–0.99). The protective role of CT + CC genotypes for rs9636089 were more obvious in the women (P = 0.021) with age > 50 (OR: 0.57, 95%CI: 0.35–0.92).In addition, the CT + TT genotypes of rs4801078revealed a significant higher risk of BC in the participant with age >50 (OR: 2.14, 95%CI: 1.33–3.45) and age of menarche >13 (OR: 1.52, 95%CI: 1.06–2.19).

Table 3 Stratification analysis of the fiveSNPs and BC susceptibility.

The association of receptor status and the four SNPs genotypes

The association between SNPs in Linc-ROR and the receptors status in cases were displayed in Table 4. After adjusted for reproductive factors, only a boundary significant association between rs6420545 CT (OR: 0.66; 95%CI: 0.99–2.76) genotype with PR status of patients was detected.

Table 4 The Associations between four SNPs and ER, PR and HER-2 Status of Breast Cancer Patients.

Haplotype analysis

The Haplotype analysis for polymorphisms in Linc-ROR was showed in Table 5.Four haplotypes were showed in the table (a total of 16 haplotypes) and all those frequencies < 0.03 in cases or controls has been dropped in this analysis. Trs6420545Trs4801078Trs1942348Trs9636089 was the most frequent haplotype in the cases (38.5%) and controls (35.3%). Compared with the controls, the frequency of haplotype Trs6420545Crs4801078Trs1942348Trs9636089 was lower in cases (OR: 0.72, 95%CI: 0.54–0.97).

Table 5 Haplotype analysis of four SNPs in Linc-ROR.

The analysis of Gene reproductive factors interactions

The interactions of gene and reproductive factors among four SNPs (rs6420545, rs4801078, rs1942348 and rs9636089) and reproductive factors were revealed in Table 6. Four models in MDR were showed and we found the interaction of rs4801078, number of pregnancy and menopausal status (Trs4801078, number of pregnancy ≥2 and post-menopausal) might increase risk for breast cancer by 2.78 times (OR: 2.78; 95%CI: 2.74–3.61).

Table 6 Interaction results between the SNPs and reproductive factors by MDR.

The results of Benjamini-Hochberg (BH) correction for false discovery rate (FDR)

We applied the BH correction to control FDR (Table 7). The q-value indicated TT genotype and T allele of rs4801078in Linc-ROR could still increase BC risk. Besides, the relationship between CT + TT genotypes of rs4801078and BC was still noteworthy in the participant with age >50.

Table 7 Results of Benjamini-Hochberg (BH) correction.

The relative expression of Linc-ROR in plasma

We investigated the association between rs4801078 genotypes and Linc-ROR mRNA expression level by the real-time PCR amplification reactions in 150 subjects. There were 38 subjects with CC genotype, 64 with CT genotype and 48 with TT genotype (Fig. 1). The relative expression of Linc-ROR mRNA in CT + TT group (1.94 ± 0.27) was significantly higher than the CC group (1.21 ± 0.19) and that indicated the SNPs in Linc-ROR might play a role in the expression level of Linc-ROR mRNA.

Figure 1
figure 1

The relative expression of linc-ROR in plasma Note: linc-ROR expression was assessed by qRT-PCR in plasma. Data was evaluated statistically usingthe one-way ANOVA with the Tukey method and represent the mean ± SD from the experiments in triplicate.

Discussion

This case-control study was the first to demonstrate the association between potential regulatory variants in Linc-ROR and BC risk, and we found TT (OR: 1.79; 95%CI: 1.195–2.68) genotype of rs4801078 in Linc-ROR had a significant association with the higher risk of BC in Chinese population. Furthermore, analysis of qRT-PCR showed the relative expression of Linc-ROR mRNA in rs4801078CT + TT group (1.94 ± 0.27) was higher than the CC group (1.21 ± 0.19). In addition, we found the frequency of haplotype Trs6420545Crs4801078Trs1942348Trs9636089was lower in cases than in controls (OR: 0.72, 95%CI: 0.54–0.97) and the interactions of rs4801078, number of pregnancy and menopausal status might increase BC risk (OR: 2.78; 95%CI: 2.74–3.61).

LncRNAs have received widespread attention and are observed to play pivotal roles in tumorigenesis and progression of human cancers. It has already been revealed that some lncRNAs, such as HOTAIR, H19 and MALAT1, are potential biomarkers in cancer diagnosis and prognosis. Among them, Linc-ROR, first discovered in 2010, are also found to have strong association with tumorigenesis, metastasis and poor therapeutic response of malignant tumors26,27,28. Recent studies found that Linc-ROR was upregulated in pancreatic cancer tissues and decreased Linc-ROR expression could inhibit pancreatic cancer cell proliferation, invasion, and tumourigenicity22,29,30. In another study, the researchers found Linc-ROR confers gemcitabine resistance to pancreatic cancer cells at least partly via inducing autophagy31. Zhou et al. found that Linc-ROR had an important role during endometrial carcinogenesis by acting as a miR-145 “sponge” to inhibit mediation of the differentiation of endometrial tumorspheres32. A recent study suggested the function of Linc-ROR exerted in LAD cells depended on the sponging of miR-145 and it led to the chemotherapy resistance and EMT phenotypes of docetaxel-resistant LAD cells33. The qRT-PCR showed a significant up-regulation of Linc-ROR and its variants 2 (P = 0.025) and 4 (P = 0.0002) in esophageal squamous cell carcinoma34. Li et al. found that Linc-ROR was significantly upregulated in nasopharyngeal carcinoma tissues and the enrichment of Linc-ROR played acritical functional role in chemoresistance by suppressing P53 signal pathway35. A recent study also provided several new mechanistic insights into acquired chemoresistance in HCC and they found Linc-ROR acting as mediators are involved in modulation of cellular responses to chemotherapy36. However, the role of Linc-ROR in glioma is the opposite of other tumors. Feng et al. suggested that Linc-ROR might act as a novel tumor suppressor gene in glioma by inhibiting the proliferation of cancer cell, self-renewal of GSCs and the KLF4 expression37.

In addition to the mentioned malignant tumor, the role of Linc-ROR in breast cancer has also been reported. In 2014, Hou et al. found that Linc-ROR could function as an important regulator of epithelial-to-mesenchymal transition and promote breast cancer progression and metastasis through regulation of miRNAs21. In 2016, the study of Chen et al. investigated the role of Linc-ROR in the chemotherapy resistance of human BC cells and its mechanism23. The effect of the Linc-ROR on epithelial-to-mesenchymal transition was verified to contribute to the chemotherapy resistance and invasion of breast cancer cells23. In the same year, One study concluded that Linc-ROR suppressed gemcitabine-induced autophagy and apoptosis in breast cancer cells by silencing miR-34a expression38. Recently, Zhang et al. found down-regulated Linc-ROR could enhance the sensibility of breast cancer cells to tamoxifen by increasing miR-205 expression and suppressing the expressions of ZEB1 and ZEB239. Zhao et al. suggested the expression levels of Linc-ROR were significantly higher in breast cancer and combination of the Linc-ROR with the conventional biomarkers might produce better diagnostic ability40. Together, these results indicate that Linc-ROR might have a crucial impact on the development of BC and it is necessary to investigate the association between regulatory variants in Linc-ROR and BC.

In recent years, a large amount of SNPs in lncRNAs have been found to be related to carcinogenesis41. For example, genetic variants in MALAT1A were suggested to be associated with breast cancer42 and colorectal cancer risk27. A combined analysis of genome-wide association study (GWAS) and meta-analysis identified a novel and significant association between rs16941835 in lncRNARP11-58A18.1 and CRC susceptibility43. Not only that, but those SNPs in lncRNAs were implied to have function in regulating the expression level of lncRNAs in the process of cancer. A genome-wide association study indicated that variant SNPs in a long noncoding RNA MIR2052HG showed a dose-dependent increase in MIR2052HG expression as well as increased binding of ERα to the EREs by performing estradiol treatment44. Another research showed the rs2147578 in lnc-LAMC2–1:1 were significantly associated with increased CRC risk by influencing the binding of lnc-LAMC2–1:1 and miR-128–3p45. In addition, the latest findings were reported that the SNP rs2027701 of Linc-ROR in the lncRNA-p53 regulatory network had significant associations with the risk of neutropaenia42. These results inspire us to study the role of Linc-ROR tagger SNPs in the expression of Linc-ROR mRNA and the the process of breast cancer.

However, to date, no study about the tagger SNPs of Linc-ROR has ever been reported in BC. In our case-control study, we found TT (OR: 1.79; 95%CI: 1.20–2.68) genotype of rs4801078 in Linc-ROR had a significant association with the higher BC risk in codominant model (OR: 1.79; 95%CI: 1.20–2.68) and recessive model (OR: 1.80, 95%CI: 1.28–2.66). Moreover, the expression of Linc-ROR mRNA was closely related with the alleles of rs4801078. The results of qRT-PCR revealed the relative expression of Linc-ROR mRNA in rs4801078 CT + TT group (1.94 ± 0.27) was significantly higher than the CC group (1.21 ± 0.19). No meaningful association between the other three SNPs (rs1942348, rs6420545 and rs9636089) and the risk of BC was observed in the major models, however, women with the mutant alleles of rs1942348, rs6420545 and rs9636089 had a lower BC risk in the subgrounps with age at menarche ≤ 13 (OR: 0.95, 95%CI: 0.92–0.99 for rs6420545) and age >50 (OR: 0.60, 95%CI: 0.37–0.99 for rs1942348; OR: 0.57, 95%CI: 0.35–0.92 for rs9636089). In addition, the Haplotype analysis showed haplotype Trs6420545Crs4801078Trs1942348Trs9636089 could decrease BC risk (OR: 0.72, 95%CI: 0.54–0.97). Our results suggest that the regulatory polymorphisms in Linc-ROR might influence breast cancer risk and large sample studies carrying in other races are needed to verify our discovery.

BC is a complex disease and epidemiological studies suggest that the occurrence of BC is usually due to the gene-environmental factors interaction46. The reproductive factors have been proved to be related with BC, including menarche age, menopausal state, number of pregnancy, number of abortion, breast-feeding history for born baby47,48. In addition, a large number of SNPs in susceptibility genes have been found to be related to BC risk. There have been few reports of the interactions between SNPs in susceptibility genes and breast cancer influence factors46,49. However, the analysis of such gene-environment interaction is rarely reported in Chinese population and no study about the interaction between variants in Linc-ROR and the reproductive factors was found. In the current study, we found the interaction of rs4801078, number of pregnancy and menopausal status might increase BC risk by 2.78 times. We all know that reproductive factors usually represent a difference in hormone levels for women. Recent research suggested that Linc-ROR functions as an onco-lncRNA to promote estrogen-independent growth of ER+ breast cancer by the MAPK/ERK signaling pathway24. Moreover, another research showed the Linc-ROR expression levels in plasma were correlated with estrogen receptor (P < 0.05) and progesterone receptor (P < 0.05)40. Considering that the result of our study “functional polymorphisms in Linc-ROR might be associated with expression of Linc-ROR”, we assume that functional polymorphisms in Linc-ROR might have interaction with reproductive factors and the MDR analysis of our study proved the idea. Our results suggested that the gene-reproductive factors interaction might contribute to BC risk and further studies about the gene-reproductive factors interaction were warranted.

The strengths and limitations of the article are as follows: the new and pathological diagnosed BC patients avoids the prevalence-incidence bias; the selection bias is effectively controlled by random selection of the control group in the community; age and gender-matched controls and strict inclusion criteria reduce confounding bias; data was collected through the questionnaire by uniformly trained investigators and information was double entered in the database; moreover, randomly selected ten percent DNA samples were directly sequenced and each sample was repeated three times independently in qRT-PCR tests for the repeatability; nevertheless, in some subgroups, the statistical power is reduced due to the relatively small sample size; in addition, since early menarche was an established risk factor for breast cancer, the higher mean age at menarche in cases showed that there was still selection bias in our study.

To sum up, we found rs4801078 TT genotype in Linc-ROR had a significant association with the higher risk of BC and analysis of qRT -PCR showed the expression of Linc-ROR mRNA was closely related with the alleles of rs4801078. Rs4801078 might be related to BC by interacting with the number of pregnancy and menopausal status. Our results suggested that tagger SNPs in Linc-ROR and the interaction of tagger SNP-reproductive factors might contribute to BC risk, and the alleles of rs4801078 have positive correlation with Linc-ROR mRNA expression. Prospective studies with large sample and furthermechanism research are warranted to explore the role of the variants in Linc-ROR.

Methods

Subjects

A total of 968 participants were enrolled in the genetic epidemiology study between 2013 and 2015 from a community-based study in Henan province attended by 20000 individuals. Due to the low prevalence of breast cancer in the program, most of BC patients came from the First Affiliated and the Third Affiliated Hospital of Zhengzhou University. The inclusion criteria included the newly pathological diagnosed BC patients in Henan province without any other malignant tumor. 484 frequency matched cancer-free controls were selected from the program randomly. The inclusion criteria of the control group was healthy people without chronic diseases history and age-appropriate frequency matched (±2 years). All the participants were unrelated.

The demographic data and some potential BC risk factors were collected from a structured questionnaire by face to face interviews. The information of receptor status (estrogen receptor (ER), progesterone receptor (PR) and human epidermal growth factor receptor-2 (HER-2) status) came from pathological report. Some potential BC risk factors in the current study included age, age of menarche, menopause age, menopausal state (premenopausal, postmenopausal), gravidity, number of abortions, history of breast-feeding (yes, no) and BC family history in first-degree relatives (yes, no). Written informed consents were obtained from all participants. The study was approved by the ethical review committee of Zhengzhou University Committee for Medical and Health Research Ethics and all experiments were performed in accordance with relevant guidelines and regulations.

DNA extraction

We collected 5 ml venous blood from each subject in ethylene diamine tetra acetic acid (EDTA), part of the blood cells were used to extract genomic DNA with the DNA Extraction Kit (TIANGEN BIOTECH, Beijing). Separated plasma and DNA samples were stored at −80 °C.

SNP selection and genotyping

All tagger SNPs of Linc-ROR were selected by Haploview software basing on Chinese Han population data of HapMap Project (HapMapRel 28, NCBI B36) as well as 1000 Genome Project with minor allele frequency (MAF) > 0.1 in Chinese Han population (Table 8).

Table 8 PCR information of the four SNPs.

rs6420545 and rs9636089were genotyped by polymerase chain reaction-restriction fragment-length polymorphism (PCR-RFLP) while the genotyping of rs1942348 and rs4801078were conducted by created restriction site PCR (CRS-RFLP) assays in our research. Primer 5.0 software was used to design the primer pairs of the four SNPs for PCR amplification and the NCBI BLAST was performed to evaluate the specificity of primers. Gradient PCR reactions was performed to standardize the DNA amplification conditions and optimize the annealing temperature for the primers set (Table 1). Rs6420545, rs9636089 rs1942348 and rs4801078 were digested by restriction enzymes of NsiI, MwoI, BlpI and BsmAI (Thermo Scientific) respectively which were selected by WATCUT website (http://watcut.uwaterloo.ca/watcut/watcut/template.php).In order to control the quality, we randomly selected 10% of the DNA samples to be sequenced (BGI Sequencing, Beijing).

The relative expression of the Linc-ROR in plasma

One hundred and fifty individuals from the controls were selected and qRT-PCR test was used to detect the relative expression of Linc-ROR in plasma by the Eco Real-Time PCR System (Illumina, USA). The conditions for the reaction of real-time PCR are as follows: 1) initial denaturation: 95 °C for 30 s; 2) 40 cycles of denaturation: 95 °C for 10 s; 3) anneal: 60 °C for 30 s. Our assay provided GAPDH as a reference gene and the 2−ΔΔCt method was performed to calculate the relative expression of Linc-ROR. The real-time PCR amplification of each sample was repeated three times.

Analysis

The sample size (n = 459) of the study was calculated by the software power analysis and sample size (PASS)with the minimum allele frequency (0.25) and the study power (0.9). We assess the representativeness of the control population using a goodness-of-fit χ2-test (Hardy–Weinberg equilibrium).Categorical variables and continuous variables were calculated by Chi-squared and Student’s t test respectively to assess distribution departure in two groups. Unconditional logistic regression analysis was applied to estimate the relationship between SNPs and BC (or receptor status) with adjusted for the potential BC risk factors. Stratified analysis for the potential risk factors mentioned was made in different subgroups to estimate the relationship between SNPs and BC risk. MDR method was conducted to calculate the gene-reproductive factors interactions and online SHEsis (http://analysis.bio-x.cn/myAnalysis.php) was used to analyze the difference of haplotype frequencies in both patients and controls. Relative expressions of the gene Linc-ROR were presented as mean ± standard deviation (\(\overline{X}\) ± SD) and the one-way ANOVA was applied to assess the difference. The Benjamini-Hochberg (BH) correction was used to control false discovery rate (FDR).The SPSS 21.0 statistical software package (Analysis software, Shanghai, co., LTD, 6761805c6989326cbf14) was used for all statistics analyses, and two-side P value less than 0.05 was regarded as significant.

Data availability

The datasets generated during and analysed during the current study are available from the corresponding author on reasonable request.