Main

Over the last decade, advances in genome-wide analyses of the eukaryotic transcriptome have revealed that most of the human genome is transcribed. In fact, the genome was found to be more complicated than merely composed of a collection of protein-coding genes, as it shows extensive antisense, overlapping, and non-coding RNA (ncRNA) expression.1, 2, 3 Although previously considered to be transcriptional noise, recent evidence suggests that ncRNA may have a major biological role in cellular development and metabolism.4, 5 One of which, the newly discovered long non-coding RNAs (lncRNAs), are defined as endogenous cellular RNAs of more than 200 nucleotides in length composed of subsets of polyadenylated and non-polyadenylated transcripts differentially accumulating in the nucleus and cytoplasm.5, 6, 7 However, lncRNAs overlapping with or interspersed between coding and non-coding transcripts have been found to lack protein-coding capacity.6, 8 From a genetic point of view, lncRNAs fall into five broad categories: sense, antisense, bidirectional, intronic, and intergenic.4

Similar to protein-coding genes and miRNAs, lncRNAs have key roles in tumorigenesis. These molecules have been shown to have a functional role in a number of fundamental processes associated with cancer including cell cycle regulation, apoptosis, the DNA damage response, and metastasis (Table 1). A recent study has proposed six hallmarks of cancer that together form the fundamental principle of malignant transformation. The same study summarizes the conceptual basis of each hallmark as it relates to different lncRNAs. These basic hallmarks include the ability to evade growth suppressors, activate invasion and metastasis, sustain proliferative signaling, sustain replicative immortality, induce angiogenesis, and resist cell death.9

Table 1 Examples of lncRNAs with various modes of action in human cancers

One function of lncRNAs in cancer: mediating invasion and metastasis

The capability to invade and form distant metastases involves complicated interactions and regulatory mechanisms. Therefore, improvements in morbidity and mortality will require effective treatments targeting metastatic disease. Despite the knowledge gained from more than a century of metastasis research, the molecular mechanisms are still not fully understood. It was indicated that high-grade epithelial carcinomas showed a more invasive phenotype and distant metastases, with most patients dying from these metastases rather than from the primary tumor.10 Invasion and metastasis involves various genes and multiple steps.11 Each step in invasion and/or metastasis requires the coordinated temporal expression of genes and spatiotemporal expression of proteins. Cancer cells usually undergo morphological alterations and change their cell–cell or cell–matrix interactions. Taken together, these processes enable the cells to successfully undergo the first steps of the multistep process of invasion and metastasis.

During their transition via the lymphatic system and blood circulation, cancer cells escape immune surveillance and demonstrate anchorage-independent growth and survival. Then, the cancer cells invade their target tissues to form micrometastases and eventually form a secondary tumor. As metastatic cancers are frequently refractory to the therapeutic treatments that are effective on the primary cancers,12 it is critical to explore the knowledge of the specific regulatory mechanisms underlying tumor invasion and metastasis. In this review, we summarize the respective biological mechanisms of different lncRNAs related to cancer invasion and/or metastasis (Table 2).

Table 2 Examples of potential lncRNAs associated with cancer invasion and/or metastasis

lncRNAs in PRC-dependent transcriptional programs

One aspect of cancer development involves the alteration of gene expression patterns due to epigenetic changes. The polycomb group (PcG) proteins function in multiprotein complexes referred to as polycomb repressive complexes (PRCs), which repress gene expression via chromatin modification. PcG proteins bind to and repress the promoters of genes that encode proteins with key roles in cell fate determination and embryonic development. In cancer, PcG target genes are frequently epigenetically silenced by DNA methylation,13 and this silencing may be due to the high level of PcG proteins in cancer.14

HOTAIR and PCAT-1—PRC2 pathway

Long intergenic ncRNAs (lincRNAs) are a class of pervasively transcribed genes that can serve as the interface between DNA and chromatin-modification complexes.15, 16 Certain lincRNAs tether PRCs to nearby genes in cis, for example, in developmental dosage compensation and imprinting.17 The lncRNA HOTAIR is the first example of a lincRNA that regulates genes at a distance. HOTAIR, expressed from the developmental HOXC locus located on chromosome 12q13.13, binds to the PRC2 complex and targets the complex to HOXD genes located on a different chromosome, resulting in the cross-regulation of HOX genes for proper positional identity.15, 18 Gupta et al19 recently revealed an important role for HOTAIR in breast cancer metastasis. HOTAIR is highly induced in metastatic breast cancer, and a high level of HOTAIR expression in primary breast tumors is a powerful predictor of eventual metastasis and mortality.19, 20 HOTAIR overexpression induced the genome-wide localization of the PRC2 subunit enhancer of zeste homolog 2 (EZH2) to many genes, leading to altered histone H3 lysine 27 methylation and the downregulation of multiple metastasis suppressor genes, including HOXD10, JAM2, and PCDH10 (Figure 1).19, 20, 21, 22 Conversely, the loss of HOTAIR can inhibit cancer invasiveness, particularly in cells that possess excessive PRC2 activity.21 Kogo et al18 also suggested that HOTAIR expression is associated with the genome-wide reprogramming of PRC2 function in both breast cancer and colorectal carcinoma (CRC), whereby the upregulation of this lncRNA may be a critical element for metastatic progression.

Figure 1
figure 1

Proposed metastatic mechanism of HOTAIR in cancer.

Similar regulatory mechanisms may occur in prostate cancer-associated ncRNA transcripts-1 (PCAT-1), another lncRNA that was first found to be highly expressed in prostate cancer. PCAT-1 resides in the 8q24 ‘gene desert’ locus, within the vicinity of well-studied SNPs associated with prostate cancer risk and approximately 725 kb upstream of the c-MYC oncogene, suggesting that this locus may be linked to additional aspects of cancer biology.23 Prensner et al24 recently showed that PCAT-1 is upregulated in a subset of metastatic and high-grade localized cancers and can promote cancer cell proliferation by repressing the PRC2 complex. It was suggested that PCAT-1 might also be coordinated with the oncoprotein EZH2, which encodes a core PRC2 protein that contributes to the metastatic phenotype. Knockdown of EZH2 causes the dramatic upregulation of PCAT-1 expression,24 and PCAT-1 overexpression leads to increased cell proliferation in vitro and is associated with metastasis in prostate cancer. The PCAT-1 upregulation observed in prostate cancer may represent an adaptive cellular mechanism for overcoming controlled proliferation and promoting metastasis.25, 26 Furthermore, a recent report has demonstrated that PCAT-1 is similarly upregulated in CRC and is highly related to CRC metastasis.25 However, further investigation is still required to illustrate the regulatory mechanism of PCAT-1.

MALAT1—PRC1 pathway

MALAT1 (metastasis-associated lung adenocarcinoma transcript 1), also later referred to as NEAT2 (nuclear-enriched abundant transcript 2), is a prognostic marker for metastasis and patient survival in non-small cell lung cancer.9, 27 This ncRNA is extremely abundant in many human cell types and is highly conserved across several species, underscoring its functional importance. MALAT1 has been linked to a variety of cancers of the breast, liver, and nasopharynx.28, 29, 30

The molecular mechanism of MALAT1 action is currently under debate. Previous studies have identified MALAT1 as a regulator of the alternative splicing (AS) of a subset of genes.31 Other studies have revealed that MALAT1 is an active player in lung cancer metastasis and have found that the genetic regulation of metastasis-associated genes, rather than AS, is the critical MALAT1 mechanism in metastasis.32, 33 MALAT1 was found to interact with the demethylated form of CBX4 (chromobox homolog 4), also referred to as polycomb 2 (Pc2), a component of the polycomb repressive complex 1 (PRC1). E2F1 is a transcription factor that has a crucial role in cell cycle control and the action of tumor suppressor proteins. Binding of MALAT1 to unmethylated Pc2 promotes E2F1 SUMOylation, leading to the relocalization of growth control proteins between polycomb bodies and interchromatin granules, which are areas of genetic silencing and active gene expression, respectively. MALAT1 resides in these subnuclear structures and functions as an activator of gene expression, potentially by mediating the assembly of coactivator complexes.32 One study identified a set of genes that might act to promote lung cancer metastasis, reporting that the expression of these genes depends on MALAT1.34, 35, 36, 37 These MALAT1 target genes were either associated with metastasis or represented critical regulators of metastasis establishment, and were strongly reduced by the loss of MALAT1.34 These reports support the hypothesis that MALAT1 is an activator of metastasis and affects metastatic processes.

In summary, HOTAIR and MALAT1 regulate a broad subset of genes involved in cancer cell invasion and metastasis. On the one hand, these two lncRNAs are part of the ‘metastatic signature’ of gene expression; on the other hand, they contribute to shaping this signature. Regardless, further studies are necessary to elucidate the mechanism of PCAT-1 regulation. Taken together, these findings reveal an lncRNA–PRC pathway for cancer invasion and metastasis and suggest that lncRNAs are not only prognostic biomarkers for metastasis development but may also be major players in disease progression by regulating metastatic gene expression programs.

lncRNAs in signal transduction pathways

Cancer often arises when normal cellular growth goes awry due to defects in critical signal transduction pathways.38, 39 Although lncRNAs have recently emerged as novel biomarkers in cancer development and progression, many of them have been found to be associated with fundamental biological processes and signaling networks (ie, the p53 pathway). Huarte et al40 have demonstrated that numerous lncRNAs are key constituents in the p53-dependent transcriptional pathway.41 However, the means by which lncRNAs function in cancer invasion and/or metastasis via different signal transduction pathways is still unclear and must be further investigated and clarified. Here, we summarize the particular lncRNAs that have been studied with regard to this aspect.

LET–HIF-1α pathway

LncRNA-low expression in tumor (lncRNA-LET), also known as NPTN intronic transcript 1, is significantly underexpressed in hepatocellular carcinoma (HCC), CRC, and squamous cell lung carcinoma tissues compared with its expression in peritumoral tissue. Furthermore, a lower lncRNA-LET level was found to be associated with tumor metastasis and determined to have tumor suppressor activity.42 Hypoxic microenvironments have a critical role in the metastasis of various cancers,12, 43 and it has been demonstrated that a hypoxic microenvironment suppresses lncRNA-LET through histone deacetylase 3 (HDAC3) activation, resulting in reduced histone H3 and H4 acetylation levels in the lncRNA-LET promoter region. NF90 is a double-stranded RNA-binding protein implicated in the translational control of many target mRNAs, including hypoxia-inducible factor 1 alpha (HIF-1α) subunit and cell division cycle 42 (CDC42)44, 45 The lncRNA-LET transcript was found to associate with NF90 to promote NF90 protein ubiquitination and subsequent degradation, and the downregulation of lncRNA-LET was found to be a key step in the stabilization of nuclear factor 90 protein via the ubiquitin–proteasome pathway. The association between lncRNA-LET and NF90 indirectly regulates the protein levels of HIF-1α and CDC42, and the HIF-1α/HDAC3/lncRNA-LET/NF90 signaling pathway might provide a positive feedback loop that augments the HIF-1α response, leading to hypoxia-induced cancer cell invasion and tumor metastasis (Figure 2).12 These results indicate that lncRNA-LET acts as a tumor suppressor and may promote HCC metastasis when downregulated, suggesting that it may be an important target for tumor therapy.

Figure 2
figure 2

Proposed metastatic mechanism of lncRNA-LET in cancer.

MALAT1, H19 and CCAT2—Wnt/β-catenin pathway

Recent studies are helping to unravel the association between EMT and lncRNA. EMT is a key step toward cancer metastasis. E-cadherin (CDH1) is a critical cell-to-cell adhesion molecule and aids in the assembly of epithelial cell layers; therefore, increased CDH1 expression inhibits invasion and metastasis formation. Accordingly, the loss of E-cadherin expression is a hallmark of the EMT process and is likely required for enhanced tumor cell migration.46 The Wnt signaling pathway has a particularly tight link with EMT.47 Canonical Wnt signaling is activated on the binding of Wnt ligands to Frizzled (Fzd)/LDL-related protein coreceptor complexes, which leads to the activation of Fzd and the nuclear translocation of β-catenin. The nuclear translocation of β-catenin in turn activates the expression of Slug, which triggers EMT and a proinvasive gene expression profile.47, 48

As mentioned above, MALAT1 functions as an active player in cancer metastasis by interacting with the demethylated form of Pc2, a component of PRC1, thereby promoting E2F1 SUMOylation and leading to relocalization of growth control proteins. However, another report has suggested that MALAT1 promotes EMT by activating the Wnt/β-catenin pathway in bladder cancer cells: the downregulation of MALAT1 resulted in a decrease in the levels of Slug, as well as ZEB1 and ZEB2 (known as transcriptional repressors of E-cadherin), and an increase in CDH1 levels, indicating that MALAT1 promotes EMT by activating Wnt signaling.49

LncRNA H19 may be involved in a similar regulatory mechanism. H19, an imprinted and maternally expressed oncofetal gene, is located on chromosome 11p15.5 near the IGF II (encoding insulin-like growth factor II) locus.50 H19 is expressed abundantly in many cancers and only marginally in nearly all normal adult tissues.50 Adriaenssens et al51 have suggested that H19 has a role in epithelial–mesenchymal interactions and thus in morphogenesis and/or migration of epithelial cells. A previous study has shown that H19 is highly expressed in hepatic metastases from a range of human carcinomas,52 although the underlying mechanism has not been clearly shown. A recent study suggested that upregulated H19 enhances bladder cancer metastasis by associating with EZH2, resulting in Wnt/β-catenin activation and the subsequent downregulation of CDH1.53

Recent research has identified another novel lncRNA-CCAT2 that maps to 8q24, is highly overexpressed in colorectal cancer, and promotes tumor growth, metastasis, and chromosomal instability.54 It has been demonstrated that MYC, miR-17-5p, and miR-20a are upregulated by CCAT2 through TCF7L2-mediated transcriptional regulation, and the physical interaction between CCAT2 and TCF7L2 results in the enhancement of Wnt signaling activity. CCAT2 is also a Wnt downstream target, which suggests the existence of a feedback loop.54

Intronic lncRNAs (PPP3CB, MAP3K14, and DAPK1 loci) and MAPK pathway

The activity of the RAS/MAPK pathway was shown to be significantly elevated in both primary and metastatic lesions, and the inhibition of RAS/MAPK signaling significantly reduced the metastatic progression of transplanted stem/progenitor cells. These findings suggest that RAS/MAPK activation could have a critical role in promoting cancer progression and metastasis.55

Some of the earliest and most universal genetic alterations observed in pancreatic cancer are activating mutations in the oncogene KRAS.56 At least 99% of pancreatic intraepithelial neoplasia lesions harbor mutations in KRAS, suggesting that its activation is an important initiating step in the carcinogenesis of most pancreatic cancers.57 Activating mutations in the KRAS gene result in a loss of the intrinsic GTPase activity of the Ras protein, and constitutive signaling occurs, even in the absence of extracellular signals.58 Activated Ras is involved in a number of signaling pathways, including the RAF/MAPK pathway, resulting in Ras–Raf–MAPK signaling.59, 60

Recently, pancreatic ductal adenocarcinoma (PDAC) was studied to identify the gene expression profiles of lncRNAs correlated with pancreatic cancer and metastasis.61, 62, 63 A subset of intronic lncRNAs (PPP3CB, MAP3K14, and DAPK1 loci) expressed in pancreatic tissues was identified; the abundance of these lncRNAs was correlated with PDAC metastasis and were enriched in genes associated with the MAPK pathway.64, 65 Nevertheless, further studies will be necessary to reveal the possible biological functions and molecular mechanisms of these lncRNAs in PDAC tumorigenesis and/or progression, and these intronic lncRNAs might be novel candidate biomarkers of pancreatic tumors.

The data accumulated thus far have suggested that lncRNAs may mediate cancer invasion and metastasis via different signal transduction pathways. These results advance our understanding of the role of lncRNAs as regulators of various signaling pathways and offer new avenues for therapeutic intervention against cancer progression.

lncRNAs as diagnostic and prognostic markers

Cancer is a complicated, multifactorial, multistep disease. Desirable molecular markers of malignancy are important diagnostic and prognostic tools that can aid in patient management in oncology clinics. Numerous molecular markers for various cancers have been discovered and validated over the last few decades, and lncRNAs may be new potential markers based on their roles in different pathways. Several lncRNAs have been suggested to be dysregulated in various cancers, such as breast cancer, colorectal cancer, HCC, prostate cancer, bladder cancer, and lung cancer (Table 2),18, 19, 20, 21, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34 indicating that lncRNAs may become a promising biomarker in cancer diagnosis and prognosis.

Cancer metastasis portends a poor prognosis and cannot be reliably predicted. In the setting of metastasis, patients usually have a poorer prognosis and shorter survival period, and few patients achieve a durable remission with the currently available therapies. Tumor metastasis cannot be reliably predicted based on patients’ clinical manifestations, pathologic findings, or other currently available laboratory tests. Although different methods have been used to predict clinical outcomes for patients with metastasis on the basis of clinical and pathologic features, these approaches do not incorporate the more complex biological features of individual patients.66

Recent studies have shown that the metastatic capability of cancer is conferred by genetic changes that occur relatively early in tumorigenesis and that metastatic dissemination may occur continually throughout the course of primary tumor development.67, 68, 69 Therefore, it is scientifically and clinically relevant to identify metastasis-specific molecular biomarkers to predict metastasis. Numerous proteinaceous molecular markers for various cancer metastases have been discovered and validated over the last few decades, and lncRNAs are emerging as new factors that have potential roles as diagnostic and prognostic predictors via both oncogenic and tumor-suppressive pathways. For example, the increased expression of the lncRNA HOTAIR was shown to be associated with metastasis in breast cancer patients, demonstrating a unique association with patient prognosis.20 Besides, HOTAIR expression levels were found to be correlated with metastasis in colorectal cancer and many other malignant neoplasms in liver, stomach, nasopharynx, esophagus, and skin, thus becoming a predictor of tumor recurrence in patients.18, 20, 70, 71, 72, 73 In addition, the inhibition of MALAT1, which is associated with metastasis and poor prognosis, might be an effective therapeutic strategy for metastatic cancer. Importantly, the inhibition of MALAT1 does not affect normal cells because this protein is not required for the viability of these cells.9, 28, 33, 34 Recently, Wheeler et al.74 reported a novel strategy for effectively abolishing MALAT1 in vivo using antisense oligonucleotides. This result strongly supports the notion that the elimination of MALAT1 using gene therapy has an excellent potential for curing metastatic cancers. Besides non-small lung cancer, MALAT1 was also found to involve in the progression of other tumors. For example, Ren S et al75 demonstrated that siRNA targeting MALAT1 elicited delayed tumor growth and reduced metastasis of prostate cancer xenografts in castrated male nude mice, suggesting that MALAT1 may be needed to maintain prostate tumorigenicity and may serve as a potential therapeutic target for castration-resistant prostate cancer. Like HOTAIR, MALAT1 was also known to be misregulated in many other cancer metastases, including CRC and bladder cancer.49, 76 HOTAIR and MALAT1 both act as oncogenic lncRNAs, whereas some other lncRNAs were found to exert tumor-suppressive roles in cancer invasion and metastasis. For instance, lncRNA-LET was discovered to be significantly underexpressed in HCC, CRC, as well as squamous cell lung carcinoma tissues compared with its expression in peritumoral tissue, and a lower lncRNA-LET level was found to be associated with tumor metastasis and determined to have tumor suppressor activity.42 Another lncRNA called lincRNA-BM742401 was downregulated in cancer, and its downregulation was associated with poor survival in gastric cancer patients. Ectopic overexpression of BM742401 inhibited metastasis-related phenotypes and decreased the concentration of extracellular MMP9, suggesting that BM742401 is also a potential lncRNA marker and therapeutic target77 (Table 3).

Table 3 lncRNAs as therapy targets for metastasis in different cancer types

One single lncRNA may indicate poor/good prognosis of various tumors, such as HOTAIR, MALAT1, and LET mentioned above. On the other hand, various lncRNAs may be over/underexpressed in one single tumor type. For example, in CRC, HOTAIR, PCAT-1, MALAT1, CCAT2 were discovered to act as oncogenes and overexpressions of these lncRNAs indicate a poor prognosis by promoting cancer metastasis. While Lower LET and ncRAN expression levels were significantly associated with poor disease outcome for CRC patients, thereby having tumor-suppressive roles in cancer42, 78 (Table 3).

Overall, an understanding of the precise molecular mechanisms by which lncRNAs function in various invasive and metastatic tumors will be critical for the development of new potential strategies for early diagnosis and therapy. It is necessary to revisit established methods for targeting RNA molecules and discuss new agents and strategies of lncRNAs as biomarkers in cancer invasion and metastasis.

Conclusions and perspectives

Much progress has been made in recent years concerning the biological mechanisms of cancer invasion and/or metastasis. The idea that invasion is regulated by the developmental regulatory program known as EMT is a hot topic. Indeed, EMT processes, which are characterized by diminished epithelial and increased mesenchymal characteristics, endow epithelial cells with an enhanced migratory potential and have been implicated in many pathological processes requiring cell migration.79 EMT is a key step toward cancer metastasis: in most carcinomas, the invasion of cancer cells into the surrounding tissues involves their molecular reprogramming as part of EMT.9 In this review, we related that some lncRNAs, such as MALAT1 and H19, were found to promote EMT by activating the Wnt/β-catenin pathway in bladder cancer cells.49, 53 A recent study has identified the role of HOTAIR within the context of EMT and has suggested that HOTAIR inhibition could reverse the EMT process in gastric carcinoma cells. Furthermore, the expression of Snail, one of the major transcription factors regulating EMT, could be induced by HOTAIR overexpression, implying that HOTAIR could act as a potential regulator of EMT.80 It is possible that other lncRNAs could be important during EMT or could act as regulators of other EMT-related genes. In addition, it is possible that lncRNAs could be involved in the complex interplay between tumor cells and the tumor stroma or that they could also influence inflammatory cell behavior. Further investigations and studies will be required to confirm these possibilities.

In this review, we highlighted the several lncRNAs that have functional roles in cancer-associated invasive and metastatic processes. An understanding of the interrelationships between regulatory genes and gene products (including lncRNA) and how these relationships are modulated within the microenvironmental context is necessary to unravel the complex tapestry of cancer invasion and/or metastasis. Further studies are needed to identify the genetic networks and pathways that are regulated by abnormally expressing lncRNAs to fully understand the molecular basis of cancer cell invasion and metastasis, In addition, it will become essential to explore the potential utility of lncRNAs in cancer as diagnostic and prognostic markers, as well as the potential of developing lncRNA-mediated therapy.