Evidence suggests Insulin-like growth factor 1 (IGF1) signaling is involved in the initiation and progression of a subset of breast cancers by inducing cell proliferation and survival. Although the signaling cascade following IGF1 receptor activation is well-studied, the key elements of the transcriptional response governing IGF1’s actions are not well understood. Recent studies reveal that the majority of the genome is transcribed and that there are more long non-coding RNAs (lncRNAs) than protein coding genes, several of which are dysregulated in human cancer. However, studies on the regulation and mechanism of action of these lncRNAs are in their infancy. Here we show that IGF1 alters the expression levels of a subset of lncRNAs. SNHG7, a member of the small nucleolar host gene family, is a highly-expressed lncRNA that is consistently and significantly down-regulated by IGF1 signaling by a post-transcriptional mechanism through the MAPK pathway. SNHG7 regulates proliferation of breast cancer cell lines in a dose-dependent manner, and silencing SNHG7 expression causes cell cycle arrest in G0/G1. Intriguingly, SNHG7 alters the expression of many IGF1 signaling intermediates and IGF1-regulated genes suggesting a feedback mechanism to tightly regulate the IGF1 response. Finally, we show in clinical data that SNHG7 is overexpressed in tumors of a subset of breast cancer patients and that these patients have lower disease-free survival than patients without elevated SNHG7 expression. We propose that SNHG7 is a lncRNA oncogene that is controlled by growth factor signaling in a feedback mechanism to prevent hyperproliferation, and that this regulation can be lost in the development or progression of breast cancer.
Substantial evidence implicates IGF1 signaling in the initiation and development of a number of cancers including breast cancer1,2,3,4. The signaling initiated by IGF1 binding to IGF1R, a receptor tyrosine kinase, is well known. IGF1R activation induces a phosphorylation cascade through IRS1 and IRS2, which stimulates the MAPK and PI3K/AKT pathways3. Ultimately, IGF1 signaling leads to a robust and temporal transcriptional response5,6—10% of all protein coding genes6—and an array of biological processes including cell proliferation and survival7. While the signaling and biological responses elicited by IGF are well-known, the IGF-regulated genes and the molecular mechanisms that govern those biological responses are largely unclear. Furthermore, there has not been a comprehensive examination of IGF1-induced transcriptome changes using RNA sequencing. This is critical given that IGF regulates a vast number of protein coding genes and recent large-scale omics studies including ENCODE demonstrate that there are more non-coding transcripts than coding8,9,10.
Long non-coding RNAs (lncRNAs) are a diverse class of RNA molecules that are loosely defined by an arbitrary length of greater than 200 nucleotides and the apparent lack of protein coding potential10,11,12,13,14,15,16,17. The number of lncRNAs, although debated in the literature, at least rivals the number of protein coding genes10. While the vast majority were recently identified and do not have a known function, several lncRNAs including XIST18,19,20, HOTAIR12, and H1921,22, have been studied for decades. From those and recent studies, it is evident that lncRNAs are important regulators of a variety of cellular processes including transcriptional regulation, chromatin structure, RNA stability, and cell proliferation. lncRNAs regulate these biological processes through a variety of novel mechanisms that often are due to the ability of lncRNAs to bind to DNA, RNA, and proteins and act as guides, scaffolds, and decoys23. Further, the dysregulation of lncRNAs is implicated in the development and progression of many diseases including breast cancer11,12,13,24,25,26,27,28,29. Therefore, it is imperative to identify and characterize the regulation and functional significance of novel lncRNAs to understand basic biological processes and the pathogenesis and treatment of diseases such as breast cancer.
There has not been a comprehensive examination of regulation of lncRNAs by IGF1, but IGF/Insulin signaling represses the expression of CRNDE30, a lncRNA highly expressed in colorectal cancer and gliomas31,32. In this report, we aimed to further understand the molecular mechanisms of the biological functions of IGF1 and to leverage the extensive knowledge of IGF1 as a model system to identify and characterize growth factor regulated lncRNAs that are functionally critical in breast cancer. Here, we demonstrate through whole transcriptome RNAseq that IGF1 signaling regulates a subset of lncRNAs that are altered in breast cancer, including the known but unstudied lncRNA, SNHG7, which is amplified or overerxpressed in ~5% of breast tumors in TCGA. Further, we show that SNHG7 is downregulated by IGF via a post-transcriptional mechanism through MAPK and controls proliferation in a dose-dependent manner. SNHG7, in part, tightly controls proliferation by altering mRNA levels of both IGF1 signaling intermediates and downstream IGF1 regulated genes. Thereby, we identified a novel fine-tuning feedback mechanism of growth factor induced proliferation and gene expression response that is disrupted in the tumors of a subset of breast cancer patients.
IGF regulates lncRNAs that are dysregulated in breast cancer
The MCF7 cell line is a model breast cancer cell line that is robustly responsive to IGF1. Addition of IGF1 to serum deprived MCF7 cells leads to rapid activation of AKT/PI3K and MAPK pathways, expression changes of 1000 s of genes, and proliferation6. To identify lncRNAs regulated by IGF1 signaling that may be critical for proliferation of breast cancer cells, we examined the transcriptional response induced by the addition of IGF1 to serum starved MCF7 cells after 3 and 8 hrs using whole transcriptome RNAseq. The Tuxedo package33 was used for transcriptome assembly and differential gene expression analysis. The reads were aligned and transcriptomes assembled using the GRCh38 genome build with all annotated Gencode v2134 transcripts allowing for novel transcript detection. Additionally, reads that mapped to tRNAs, snoRNAs, miRNAs, and rRNAs were masked during transcript assembly to ensure proper expression calls of lncRNAs that are ‘hosts’ for small noncoding RNAs (Fig. 1A). When small ncRNA reads were not removed, expression of host lncRNA genes were often miscalculated because of the abundant reads of the small ncRNAs that are present in their introns, but are not part of the mature lncRNA (data not shown). IGF1 signaling significantly (q < 0.05) induced a greater than 1.5-fold change in 1067 and 2061 annotated (Gencode v.21) genes at 3 and 8 hrs respectively (Fig. S1A,B; Supplementary Table 1). Individual gene expression changes were validated by qPCR in the same, and in an independent set of RNA (Fig. S1C). The global changes in gene expression observed correlated with the changes shown by expression microarray in our previous study6 (data not shown). Also, as expected, pathway analysis of IGF1-regulated genes at 3 and 8 hrs (FDR < 0.05;FC > 2.0) revealed that these transcripts were involved in activation of proliferation, survival, and cancer development, as well as, inhibition of cell death (Fig. S1D). Collectively, the qPCR and pathway analyses demonstrate the quality and validity of the RNA-seq data.
To determine if any of the differentially expressed genes were lncRNAs, we used a conservative approach of extracting any IGF-regulated gene that was annotated as a lncRNA in Gencode v.21 that was not merged with a protein coding gene during transcript assembly (Fig. 1A), thus excluding many highly-overlapping antisense lncRNAs that were not properly aligned due to the use of an unstranded RNAseq library. This revealed that the expressions of 225 previously annotated lncRNAs with a minimum fpkm of 1 at either 3 or 8 hrs were significantly altered by IGF1 treatment at 3 or 8 hrs with nearly an equal number upregulated as downregulated (Fig. 1B; Supplementary Table 2). Consistent with mRNA regulation by IGF in this and our previous study, slightly more were significantly regulated at 8 hrs than 3 hrs (Fig. 1C). The expression of 65 annotated lncRNAs changed at both 3 and 8 hrs suggesting early and sustained control by IGF1 signaling (Fig. 1C). To identify cancer relevant, IGF-induced lncRNAs, we sought to examine the alteration of these lncRNAs in The Cancer Genome Atlas (TCGA) breast cancer data (http://cancergenome.nih.gov/). Of the 65 lncRNAs only 12 had a “KNOWN” gene status by GENCODE meaning the annotation is identical to a known and curated gene in Entrez and is reported in TCGA. Examination of the 12 lncRNAs in the TCGA breast cancer data through the cBIO portal35,36 revealed that 11 of them have copy number or gene expression alterations in a subset of breast cancer patients (Table 1). Interestingly, the dysregulation of one lncRNA, SNHG7, is enriched in a patient population with a poorer prognosis. SNHG7 is altered in ~5% of all breast cancer tumors in TCGA (70 of 1105 samples; 67 overexpressed or amplified). Patients with overexpressed or amplified SNHG7 had a statistically significant poorer disease-free survival (Fig. 1D and Table 1; logrank test p-value = 0.0139; N = 7 of 45 with altered SNHG7 relapsed vs. 61 of 866). This demonstrates that SNHG7 is potentially translationally relevant and was selected for further study. In addition, analysis of gene expression data extracted for all TCGA breast cancer samples demonstrates that the expression of many of the 12 IGF-regulated lncRNAs are significantly enriched in a specific molecular subtype of breast cancer (Fig. 1E). For example, SNHG15 is significantly enriched in the basal subtype (Figs. 1E and S2A,B). The regulation of SNHG7 and SNHG15 by IGF1 was confirmed with qPCR (Fig. 1F).
SNHG7 is downregulated post transcriptionally by IGF via the MAPK pathway
Because SNHG7 is highly expressed, robustly regulated by IGF1 signaling, and is altered in a subset of breast cancer patients that correlate with survival, it was investigated further. SNHG7 is a relatively understudied lncRNA and is a snoRNA Host Gene (SNHG). SNHGs are highly structured genes (noncoding or protein coding) that have snoRNAs that are spliced and processed from their introns after they are transcribed, often resulting in two functional RNA species—1) snoRNAs and 2) mRNAs or lncRNAs. For example, the well-characterized tumor suppressor lncRNA GAS5, which is down-regulated in breast cancer, is a SNHG that has multiple snoRNAs that are processed from its introns. The snoRNAs are functional, but it is the mature GAS5 lncRNA that controls apoptosis by regulating glucocorticoid receptor signaling24.
SNORA43 and SNORA17 are the snoRNAs hosted in two of the introns of SNHG7 (Figs. 2A and S3A). After the snoRNAs are spliced out of the primary SNHG7 transcript they are further processed to become functional snoRNAs. However, the mature SNHG7 transcript is conserved among primates (Fig. S3A), highly and ubiquitously expressed (Fig. S3B; GTEX data37), unlikely to encode for a protein as indicated by low PhyloCSF38 (negative for all 6 frames) and txCDsPredict (576.00) scores (images adapted from UCSC Genome Browser39), and is predicted to be highly structured (Fig. S3C; image adapted from Forna’s RNAFold40) suggesting it is noncoding and has biological functions independent of the snoRNAs. Both 5′ and 3′ Rapid Amplification of cDNA Ends (RACE) confirmed that there are at least two main REFseq annotated isoforms expressed in MCF7 cells that differ by one intron (Figs. 2A and S3A red and blue and S3D). In this report, the 5 exon, 4 intron isoform is referred to as SNHG7-I (Fig. S3A red) and the 4 exon, 3 intron isoform is referred to as SNHG7-NI (Fig. S3A blue). The 3rd RefSeq SNHG7 isoform (Fig. S3A no color) was not detected by RACE. Subcellular fractionation followed by qPCR demonstrates SNHG7 is predominantly expressed in the cytoplasm (Fig. 2B).
To test the kinetic regulation of SNHG7 by IGF1, MCF7 cells were treated with IGF1 for various lengths of time. The expression of SNHG7 is significantly and continuously down-regulated by IGF1 signaling for 24 hrs (Figs. 2C and S3E). Conversely, when IGF1 signaling is inhibited with an IGF1R kinase inhibitor (BMS-754807) in MCF7 cells growing logarithmically in full serum, SNHG7 levels increase (Fig. 2D). This further suggests that the expression of SNHG7 is tightly regulated by IGF1 signaling. This regulation is not unique to MCF7 cells as SNHG7 is also regulated by IGF1 in the immortalized but non-transformed MCF10A cells (Fig. 2E). Additionally, there is a significant negative correlation (r = −0.2727; p < 0.05) between RNA levels of SNHG7 and IGF1R (Fig. 2F) as determined by RNAseq data published for a set of 56 breast cancer cell lines41 reanalyzed through the pipeline described above, suggesting the regulation of SNHG7 by IGF signaling is common in breast cancer cell lines.
While mature SNHG7 is downregulated by IGF1 signaling, the snoRNAs contained within the introns of SNHG7 are not significantly reduced (Fig. 2G), suggesting post-transcriptional regulation of mature SNHG7 instead of transcriptional regulation of the primary transcript. To determine if this is the case, serum starved MCF7 cells were treated with Actinomycin D before addition of IGF1 or vehicle. The inhibition of transcription did not ablate the reduction of SNHG7 expression by IGF1 (Fig. 2H) suggesting that IGF1 alters SNHG7 expression by reducing the stability of the transcript and not through transcriptional repression. The reduction of SNHG7 levels after Actinomycin treatment (Fig. 2H DMSO/Ctl vs. Actinomycin/Ctl) demonstrates transcription was effectively inhibited. Combined, these results suggest that the regulation of the mature transcript is not merely a mechanism to change the expression of the snoRNAs in the introns, but rather a tight regulation of the levels of the mature SNHG7 lncRNA.
SNHG7 is a 5′terminal oligopyrimidine (5′TOP) gene similar to Gas5. It is known that Gas5 lncRNA levels and other 5′TOP genes are destabilized by translation42. Given that IGF1 signaling regulates translation, we tested if IGF1 regulates SNHG7 levels through translation. Surprisingly, we observed that inhibition of translation with cycloheximide did not prevent IGF1 from decreasing the levels of SNHG7 (Fig. 2I), so we examined the effects of signaling intermediates. Two of the primary downstream signaling pathways of IGF1R are PI3K/AKT/mTOR and MAPK. Small molecule inhibitors of PI3K, MEK, and mTOR were used to examine how IGF1 alters the stability of SNHG7. Inhibition of PI3K and mTOR had little effect on IGF1’s control of SNHG7 levels, while inhibition of MEK fully prevented alterations of SNHG7 levels by IGF1 signaling in serum starved MCF7 cells (Fig. 2I) indicating MEK signaling in the destabilization of SNHG7. Collectively, these results (Fig. 2) suggest a novel mechanism whereby IGF1 significantly down-regulates the expression of SNHG7 through posttranscriptional alteration of SNHG7 mature RNA stability via the MAPK pathway.
SNHG7 is necessary and sufficient for breast cancer cell proliferation
IGF1 signaling regulates proliferation of breast cancer cells. To determine if SNHG7 has similar effects, we examined the response of proliferation to altered SNHG7 levels. A pool of independently designed siRNA duplexes significantly reduced mature SNHG7 expression without altering the expression of the snoRNAs hosted in the introns (Fig. 3A). The proliferation of MCF7 cells in full serum with reduced SNHG7 expression was drastically reduced as scored by both a fluorometric assay measuring DNA content (Fig. 3B) and by counting cells with a hemacytometer using trypan blue exclusion (Fig. S4A,B). Proliferation of both other cell lines examined, MDA-MB-231 (Fig. S4C,D) and MCF10A (Fig. 3C) were also significantly reduced by RNAi targeting SNHG7. The inhibition of proliferation in these cells is due to the reduction of SNHG7 levels and not an off-target effect as demonstrated by the ability of 3 different individual siRNA duplexes (Fig. 3D) that target SNHG7 to all inhibit proliferation (Fig. 3E). Interestingly, these data suggest that there is a dose-dependent response to SNHG7 levels as the individual duplexes that were most efficient at inhibiting SNHG7 levels also inhibited proliferation the most (Fig. 3D,E). A live/dead assay demonstrated that the reduction in cell numbers by siSNHG7 treatment in full serum is due to a decrease in proliferation (Fig. S4E) and not an increase in cell death (Fig. S4F). While control treated cells in full serum continued to increase in number, siSNHG7 treated cells do not (Fig. S4E). However, the number of dead cells is not significantly different between treatment groups (Fig. S4F) of cells growing in full serum. FACS analysis with propidium iodine staining indicates that by 3 days, siSNHG7 treated MCF7 cells begin to arrest in G0/G1 (Fig. 3F), which also suggests SNHG7 is regulating proliferation and not death in cells growing in full serum. Furthermore, siSNHG7 treatment of MCF7 cells led to a significant reduction in IGF1-stimulated proliferation in serum free media (Fig. S4G; siSNHG7 + IGF vs siCtl + IGF). However, siSNHG7 treatment in serum-free media without IGF1 results in a decrease in cell numbers (Fig. S4G; siSNHG7 + Veh vs siCtl + Veh; non-linear regression p < 0.0001) suggesting that MCF7 cells may not be able to survive without SNHG7 and growth factor signaling. Finally, reducing the expression of SNHG7 had no effect on the sensitivity of MCF7 cells to the dual-kinase IGF1R/InsR inhibitor, BMS-754807 (Fig. S4H). However, once again it is obvious that reduced SNHG7 expression decreases basal proliferation (Fig. S4H; siCtl vs siSNHG7 at 10−9M). Together these data demonstrate that SNGH7 is necessary for full proliferation of breast cancer cell lines.
To test if SNHG7 is sufficient to induce or enhance proliferation, the two main isoforms of SNHG7 identified by RACE were cloned from cDNA of MCF7 cells. Two polyclonal MCF7 cell lines stably expressing SNHG7 were generated for each isoform (Fig. 3G) and non-linear regression analysis of proliferation data demonstrated that MCF7 cells overexpressing either isoform proliferated faster than cells expressing empty vector (Fig. 3H; doubling time = 1.746-2.183 days for SNHG7 overexpressing cells vs. 2.684-2.89 days for empty vector cells p < 0.0001). Therefore, SNHG7 is both necessary and sufficient for proliferation and regulates it in a dose-dependent manner. Furthermore, as described above, SNHG7 is overexpressed or amplified in ~5% of all breast cancer tumors in TCGA and correlates significantly with poorer disease-free survival (Fig. 1D and Table 1). This suggests that SNHG7 may act as an oncogene under certain conditions driving poor prognosis through the regulation of proliferation.
IGF/SNHG7 feedback through regulation of common transcripts
Proliferation in response to IGF is regulated, at least in part, through the vast transcriptional changes downstream of IGF signaling. It is apparent that SNHG7 is also important for proliferation (Fig. 3). To determine if SNHG7 controls proliferation through the alteration of similar transcripts as IGF1, we examined the expression of four known IGF1 regulated genes after knockdown of SNHG7 (versus scramble control) and in an SNHG7 overexpressing cell line (versus a vector control). Like IGF1 stimulation (Fig. 4A dark green), overexpression of SNHG7 (Fig. 4A dark blue) resulted in higher expression of LIF and EGR3 and lower expression of IRS2 and SOCS2 compared to empty vector control cells (Fig. 4A). Reduction of SNHG7 expression (Fig. 4A dark red) caused the opposite effect, decreased expression of LIF and EGR3 and increased expression of IRS2 and SOCS2 (Fig. 4A). Together these data suggest that IGF1 and SNHG7 regulate the expression and direction of expression of similar transcripts.
To examine if this pattern is comprehensive we performed RNAseq following reduced expression of SNHG7 by RNAi. The expressions of 4,341 genes were significantly altered (Fig. 4B and Supplementary Table 3; FDR < 0.05) with 1308 annotated genes changing by at least 1.5-fold. The regulation of expression of several genes was confirmed with qPCR (Fig. S5). At a global level, there was a negative correlation between genes regulated by IGF1 induction and knockdown of SNHG7 (Fig. 4C). Gene Set Enrichment Analyses (GSEA43) demonstrated that IGF1-regulated genes (8 hrs; FDR < 0.05) are highly enriched in genes regulated by siSNHG7 treatment (FDR < 0.05; Fig. 4D top), and genes regulated by siSNHG7 are enriched for IGF1-regulated genes (Fig. 4D bottom). Collectively, these data demonstrate that IGF1 and SNHG7 control the transcript levels of a similar set of genes and suggest that SNHG7, in part, regulates proliferation through the control of a similar transcriptome response as IGF. Additionally, Ingenuity Pathway Analysis (IPA) of siSNHG7-regulated genes showed that the top canonical pathways are Molecular Mechanisms of Cancer and IGF1 Signaling (p = 3.93E-09; 42/97 molecules altered; Fig. 4E) and the top molecular and cellular function is Cellular Growth and Proliferation. This further validates that SNHG7 is important in cancer development and proliferation. Likewise, it reveals that SNHG7 directly regulates the expression of IGF signaling transcripts (Fig. 4E) in addition to downstream targets in a manner that enhances the response of IGF1 signaling. However, RNAseq following IGF1 induction of siSNHG7 treated cells demonstrated that lack of SNHG7 did not prevent IGF from activating its signaling cascade (data not shown) or from regulating induction or repression of most transcripts (Fig. 4F; differences in siCtl_ctl and siCtl_IGF vs differences in siSNHG7_ctl vs. siSNHG7_IGF; Fig. S6). However, the overall levels of the transcripts were altered by reduction of SNHG7 expression leading to an attenuated IGF1 effect on transcripts (Fig. 4F siCtl_ctl vs siSNGH7_ctl; Fig. 4F siCtl_IGF vs siSNHG7_IGF; Supplementary Table 4) much like that on proliferation (Fig. S4G). This implies a fine-tuning feedback mechanism whereby IGF1 signaling decreases the expression of SNHG7, which is a positive regulator of IGF1 signaling intermediates and downstream targets through an independent regulation mechanism.
Finally, there are well-known issues with using breast cancer clinical data from TCGA due to short-term and limited follow-up of the patients44. Accordingly, we sought to confirm the clinical impact of extreme levels of SNHG7 in the tumors of breast cancer patients in the METABRIC45 dataset that includes rich and long-term clinical data from over 2000 patients. However, the METABRIC gene expression dataset was calculated by microarray analysis, making it impossible to know the direct levels of SNHG7 and many other lncRNAs. For that reason, we used a guilt-by-association technique to infer the levels of SNHG7 in each of the patients. The top 100 upregulated and downregulated genes by siSNHG7, determined by fold change with a FDR < 0.05, were used as an ‘SNHG7 signature’ and a Gene Set Variation Analysis43,46 was performed to provide a score to each breast cancer tumor in the METABRIC dataset. Kaplan-Meier analysis demonstrates that patients with tumors with the highest decile of SNHG7 scores (indicative of high SNHG7 levels) have a statistically significant poorer disease-free survival (logrank test p-value = 0.00079) than those with lower scores (Fig. S7). This further argues that SNHG7 has an important biological and clinical role in breast cancer.
We leveraged the knowledge of IGF1 signaling and biology as a model system to identify a lncRNA, SNHG7, that is important for proliferation and breast cancer biology. By doing so we uncovered a novel fine-tuning feedback mechanism between IGF1 and SNHG7 that tightly regulates RNA expression and cell proliferation. As summarized in a schematic in Fig. 5, our data shows that in addition to the regulation of many protein coding genes, IGF, which is necessary for proliferation, downregulates the expression of SNHG7. Our results also implicate SNHG7 in the regulation of expression of an enriched set of IGF1-regulated genes and of IGF1 signaling intermediates (Fig. 5 left). Additionally, there is a dose-response correlation between SNHG7 levels and proliferation. Therefore, when IGF1 signaling is active it alters gene expression (including downregulation of SNHG7) to increase proliferation (Fig. 5 middle). However, by reducing SNHG7, which regulates a similar set of genes as IGF1, and also numerous IGF1 signaling intermediates, the amplitude of IGF1-regulated genes is muted (Fig. 5 middle). When this feedback mechanism is overwhelmed, for example by the overexpression of SNHG7 or the disruption of SNHG7 regulation by IGF1 (indicated by an x), it leads to enhanced proliferation at least in part through differences in overall magnitude of IGF targets (Fig. 5 right – induced genes are expressed higher; repressed genes are repressed lower).
It is paradoxical that IGF1 would repress SNHG7, which controls the expression of many of the same genes (in the same direction) and is necessary for proliferation, while simultaneously inducing proliferation. However, our results and others6 show that IGF1 signaling reduces the expression of IRS2, an immediate downstream signaling scaffold, and increases the expression of numerous phosphatases (DUSPs) that dephosphorylate and inactivate many of the kinases downstream of IGF1R. Thus, IGF1 regulation of SNHG7 expression is an example of a systems biology feedback mechanism to auto-attenuate IGF1 signaling. Further, our knock-down experiments that completely inhibit proliferation reduce SNHG7 levels much lower than IGF1 signaling does (90% vs. 40%) suggesting there is a critical amount of SNHG7 necessary for proliferation. Therefore, we propose that IGF1 regulates SNHG7 levels as a feed-back mechanism to fine-tune the transcriptional response and proliferation induced by IGF1 to prevent hyperproliferation or transformation/progression. If this is true, we would predict that high levels of SNHG7 could lead to hyperproliferation. Accordingly, SNHG7 is overexpressed or amplified in ~5% of TCGA breast cancer patients, and these patients have worse disease-free survival than those without SNHG7 alterations.
In this report, we also describe a novel posttranscriptional mechanism of regulation of SNHG7 through alterations in stability via the MAPK pathway. SNHG7 is a 5′TOP gene like Gas5, which are regulated by nonsense mediated decay (NMD) through translation47. While SNHG7 levels are altered by mTOR and translational inhibition (data not shown), it is clear that IGF1/MAPK regulation of SNHG7 levels is independent of translation induced by IGF1 because inhibition of translation, mTOR, and PI3K/AKT did not prevent IGF1 mediated downregulation of SNHG7. This suggests an additional mechanism of regulation of 5′TOP genes that requires further investigation.
Our results that IGF-regulated lncRNAs, including SNHG7 and SNHG15, are important for biology, enriched in breast cancer subtypes, and correlate with survival are consistent with recent studies. A large number of functionally important lncRNAs were shown to be regulated by estrogen signaling25, but ours is the first study that examined regulation of lncRNAs by IGF. Additionally, through reanalysis of TCGA data, others have demonstrated that certain lncRNAs are enriched in specific breast cancer subtypes and lncRNAs alone can accurately stratify patients into molecular subtypes48,49,50,51. In fact, lncRNAs were shown to be more subtype specific than protein coding genes and some correspond to patient survival, suggesting their utility as biomarkers49. It is still unclear if SNHG7 or other IGF-regulated lncRNAs can be used as biomarkers or targeted for therapy. However, further understanding of the IGF1/SNHG7 system, the mechanisms of SNHG7 functions, and the characterization of other IGF1-regulated lncRNAs clearly will impact our understanding of both basic and breast cancer biology.
Cell culture, treatments, and transfections
MCF7, MDA-MB-231, T47D, and MCF10A cells were obtained by ATCC and all experiments were performed within 25 passages. MCF7 and MDA-MB-231 cells were maintained in DMEM + 10%FBS; T47D in RPMI-1640 + 10%FBS; and MCF10A cells in DMEM:F12(1:1) + 5%HS, 20 ng/ml EGF, 0.5 mg/ml hydrocortisone, 100 ng/ml cholera toxin, and 10ug/ml Insulin. For IGF induction experiments, all cells were washed 2x in PBS and serum deprived in modified IMEM + 10 mM Hepes, 1ug/ml transferrin, 1ug/ml fibronectin, and 2mM l-glutamine for 16 hrs before addition of 100 ng/mL or equal volume of 10 mM HCl as a vehicle control. To determine the mechanism of SNHG7 regulation, serum starved cells were pretreated for 1-2 hrs with 10ug/ml actinomycin to prevent transcription, 1ug/ml rapamycin (mTOR inhibitor), 50uM U0126 (MEK inhibitor), 500 nM Wortmanin (PI3K inhibitor), or 50ug/ml cycloheximide (translational inhibitor) before addition of IGF1. BMS-754807 at 10 uM was used an IGF1R inhibitor. MCF7 cells expressing either SNHG7 isoform or vector alone were created by cloning and then transfecting (Fugene 6) the respective SNHG7 isoform from MCF7 generated cDNA using the GeneRacer Kit (Thermofisher) after 3′ RACE (see Supplementary Methods for primers and additional details). Individual polyclonal lines were isolated following 2 weeks of selection with 1ug/ml G418.
Total RNA from biological triplicates was isolated, quality was determined (Bioanalyzer), rRNA was depleted (RiboMinus), multiplexed paired-end libraries were prepared (Illumina TruSeq), and sequencing was performed on an Illumina HiSeq (IGF RNAseq) or NextSeq (siSNHG7 RNAseq). Quality of the sequencing was determined by running FastQC. Differential gene expression was calculated by mapping reads to hg19 with Tophat2 (masking reads to miRNAs, snRNAs, snoRNAs, rRNAs, and tRNAs) to a concatenated.gtf of UCSC known genes and lincRNA annotations published by the Broad Institute52 and assembled using Cufflinks allowing for novel gene discovery. To determine IGF1 regulated lncRNAs as listed in Fig. 1, the raw reads were reanalyzed to a newer and more comprehensive annotations. Reads were mapped to GRCh38 with Tophat2 as documented above using Gencode v.21 annotations and again assembled using Cufflinks allowing for novel gene discovery. For all analyses differential gene expression was determined with Cuffdiff and gene names were converted with custom scripts as needed. A conservative list of IGF-regulated lncRNAs was generated by extracting any differentially expressed gene with a Cufflinks prescribed lncRNA annotation (Gencode v.21). If that gene also had a protein coding gene annotation, it was not considered a lncRNA. Heatmaps of differentially expressed lncRNAs were generated in MeV after the described normalizations. Preranked Gene Set Enrichment Analysis46 was performed according to instructions comparing IGF-regulated genes to those altered by siSNHG7 treatment. Ingenuity Pathway Analysis was performed according to protocol using genes significantly regulated (FDR < 0.01) by siSNHG7 treatment compared to control. All reads are deposited in SRA with accession numbers: PRJNA514323, PRJNA515247, and PRJNA515028.
After treatment at the indicated times, cells were harvested, RNA was isolated, cDNA was generated, and qPCR were performed as described previously53. Relative RNA levels were calculated using the ΔΔCT method compared to RPL19 as the reference gene. All experiments were conducted in biological and technical triplicates. For subcellular localization, logarithmically growing cells were trypsinized, pellet was washed x2 in PBS, and cells were lysed in buffer RLN (50 mM Tris-HCl pH 8.0, 140 mM NaCl, 1.5 mM MgCl2, 0.5% NP40). After the cytoplasm was removed, the nuclear pellet was washed x2 in Buffer RLN before addition of buffer RLT (Qiagen). RNA from both fractions were isolated using Qiazol following manufactures’ protocol.
All cells were reverse transfected using 50-100 nM final concentration of either individual or 2-4 pooled oligos from Dharmacon (see Supplementary methods for sequences) using RNAi Max at a final concentration of 3ul/ml. All assays were performed ~72 hrs after siRNA treatment unless otherwise indicated.
Cells treated as described were seeded in 96-well dishes with at 5-6 biological replicates. At the indicated times following treatment, plates were harvested and proliferation was scored with the FluoReporter (ThermoFisher) assay by quantitation of dsDNA according to manufacturers’ instructions on the Victor X4 (PerkinElmer). Proliferation was also scored via counting cells with a hemocytometer (Fig. S4A) using Trypan Blue exclusion in triplicate plated MCF7 cells in 6-well dishes.
Cell cycle assay
MCF7 cells were reverse transfected with siSNHG7, nontargeting control, or nothing in biological triplicates. After 3 days, the cells were collected, fixed in 70% ethanol for 1 hr, stained with 100ug/mL propidium iodide for 1 hr, and then analyzed by flow cytometry. The percentage of cells in each phase of the cell cycle was calculated according to protocol.
Gualberto, A. & Pollak, M. Emerging role of insulin-like growth factor receptor inhibitors in oncology: early clinical trial results and future directions. Oncogene. 28(34), 3009–21 (2009).
Pollak, M. Insulin and insulin-like growth factor signalling in neoplasia. Nat Rev Cancer 8(12), 915–28 (2008).
Boone, D. N. & Lee, A. V. Targeting the insulin-like growth factor receptor: developing biomarkers from gene expression profiling. Crit Rev Oncog 17(2), 161–73 (2012).
Pollak, M. The insulin and insulin-like growth factor receptor family in neoplasia: an update. Nat Rev Cancer 12(3), 159–69 (2012).
Rajski, M. et al. IGF-I induced genes in stromal fibroblasts predict the clinical outcome of breast and lung cancer patients. BMC Med. 8, 1 (2010).
Creighton, C. J. et al. Insulin-like growth factor-I activates gene transcription programs strongly associated with poor breast cancer prognosis. J Clin Oncol 26(25), 4078–85 (2008).
Pollak, M. Insulin and insulin-like growth factor signalling in neoplasia. Nat Rev Cancer 8(12), 915–28 (2008).
Guttman, M. et al. Chromatin signature reveals over a thousand highly conserved large non-coding RNAs in mammals. Nature. 458(7235), 223–7 (2009).
Iyer, M. K. et al. The landscape of long noncoding RNAs in the human transcriptome. Nat Genet. 47(3), 199–208 (2015).
Djebali, S. et al. Landscape of transcription in human cells. Nature. 489(7414), 101–8 (2012).
Schmitt, A. M. & Chang, H. Y. Long Noncoding RNAs in Cancer Pathways. Cancer Cell. 29, 452–63. (2016).
Gupta, R. A. et al. Long non-coding RNA HOTAIR reprograms chromatin state to promote cancer metastasis. Nature. 464(7291), 1071–6 (2010).
Yuan, J. et al. A long noncoding RNA activated by TGF-β promotes the invasion-metastasis cascade in hepatocellular carcinoma. Cancer Cell. 25(5), 666–81 (2014).
Rinn, J. L. & Chang, H. Y. Genome regulation by long noncoding RNAs. Annu Rev Biochem 81, 145–66 (2012).
Guttman, M. & Rinn, J. L. Modular regulatory principles of large non-coding RNAs. Nature. 482(7385), 339–46 (2012).
Guttman, M., Russell, P., Ingolia, N. T., Weissman, J. S. & Lander, E. S. Ribosome profiling provides evidence that large noncoding RNAs do not encode proteins. Cell. 154(1), 240–51 (2013).
Ulitsky, I. and DB. lincRNAs: Genomics, Evolution, and Mechanisms. Cell. 154(1), 26–46 (2013).
Lee, J. T., Davidow, L. S. & Warshawsky, D. Tsix, a gene antisense to Xist at the X-inactivation centre. Nat Genet. 21(4), 400–4 (1999).
Engreitz, J. M. et al. The Xist lncRNA exploits three-dimensional genome architecture to spread across the X chromosome. Science. 341(6147), 1237973 (2013).
Brown, C. J. et al. The human XIST gene: Analysis of a 17 kb inactive X-specific RNA that contains conserved repeats and is highly localized within the nucleus. Cell. 71(3), 527–42 (1992).
Marisa, S. BARTOLOMEI SZ& SMT. Parental imprinting of the mouse H19 gene. Nature. 351, 153–5 (1991).
Sur, I. & Taipale, J. Parental imprinting of the mouse H19 gene. Nat Rev Cancer 351(6322), 153–5 (2016).
Wang, K. C. & Chang, H. Y. Molecular mechanisms of long noncoding RNAs. Mol Cell. 43(6), 904–14 (2012).
Mourtada-Maarabouni, M., Pickard, M., Hedge, V., Farzaneh, F. & Williams, G. GAS5, a non-protein-coding RNA, controls apoptosis and is downregulated in breast cancer. Oncogene. 28373, 195–208 (2009).
Sun, M., Gadad, S. S., Kim, D. S. & Kraus, W. L. Discovery, Annotation, and Functional Analysis of Long Noncoding RNAs Controlling Cell-Cycle Gene Expression and Proliferation in Breast Cancer Cells. Mol Cell. 59(4), 698–711 (2015).
Hansji, H., Leung, E. Y., Baguley, B. C., Finlay, G. J. & Askarian-Amiri, M. E. Keeping abreast with long non-coding RNAs in mammary gland development and breast cancer. Front Genet. 5(OCT), 1–15 (2014).
Ji, P. et al. MALAT-1, a novel noncoding RNA, and thymosin beta4 predict metastasis and survival in early-stage non-small cell lung cancer. Oncogene. 22(39), 8031–41 (2003).
Askarian-Amiri, M. E. et al. SNORD-host RNA Zfas1 is a regulator of mammary development and a potential marker for breast cancer. RNA. 17(5), 878–91 (2011).
Prensner, J. R. & Chinnaiyan, A. M. The emergence of lncRNAs in cancer biology. Cancer Discovery 1, 391–407 (2011).
Ellis, B. C., Graham, L. D. & Molloy, P. L. CRNDE, a long non-coding RNA responsive to insulin/IGF signaling, regulates genes involved in central metabolism. Biochim Biophys Acta - Mol Cell Res. 1843(2), 372–86 (2014).
Graham, L. D. et al. Colorectal Neoplasia Differentially Expressed (CRNDE), a Novel Gene with Elevated Expression in Colorectal Adenomas and Adenocarcinomas. Genes and Cancer 2(8), 829–40 (2011).
Ellis, B. C., Molloy, P. L. & Graham, L. D. CRNDE: A long non-coding RNA involved in CanceR Neurobiology, and DEvelopment. Frontiers in Genetics 3, 1–15 (2012).
Trapnell, C. et al. Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks. Nat Protoc. 7(3), 562–78 (2012).
Harrow, J. et al. GENCODE: The reference human genome annotation for the ENCODE project. Genome Res. 22(9), 1760–74 (2012).
Cerami, E. et al. The cBio Cancer Genomics Portal: An open platform for exploring multidimensional cancer genomics data. Cancer Discov 2(5), 401–4 (2012).
Gao, J. et al. Integrative analysis of complex cancer genomics and clinical profiles using the cBioPortal. Sci Signal. 6(269), 1–19 (2013).
GTEx Consortium. The Genotype-Tissue Expression (GTEx) Project. Nat Genet. 45(6), 580–5 (2013).
Lin, M. F., Jungreis I. & Kellis, M. PhyloCSF: A comparative genomics method to distinguish protein coding and non-coding regions. Bioinformatics. 27(13) (2011).
Kent, W. J. et al. The human genome browser at USCS. Genome Res. 12(6), 996–1006 (2002).
Kerpedjiev, P., Hammer, S. & Hofacker, I. L. Forna (force-directed RNA): Simple and effective online RNA secondary structure diagrams. Bioinformatics. 31(20), 3377–9 (2015).
Daemen, A. et al. Modeling precision treatment of breast cancer. Genome Biol. 14(10), R110 (2013).
Jefferies, H. B. J., Fumagalli, S., Dennis, P. B., Reinhard, C., Pearson, R. B. & Thomas, G. Rapamycin suppresses 5′TOP mRNA translation through inhibition of p70(s6k). EMBO J 16(12), 3693–704 (1997).
Subramanian, A. et al. Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles. PNAS. 102(43), 15545–50 (2005).
Liu J et al. An Integrated TCGA Pan-Cancer Clinical Data Resource to Drive High-Quality Survival Outcome Analytics. Cell. (2018).
Curtis, C. et al. The genomic and transcriptomic architecture of 2,000 breast tumours reveals novel subgroups. Nature. 486(7403), 346–52 (2012).
Hänzelmann, S., Castelo, R. & Guinney, J. GSVA: Gene set variation analysis for microarray and RNA-Seq data. BMC Bioinformatics., 14 (2013).
Williams, G. T., Mourtada-Maarabouni, M. & Farzaneh, F. A critical role for non-coding RNA GAS5 in growth arrest and rapamycin inhibition in human T-lymphocytes. Biochem Soc Trans 39(2), 482–6 (2011).
Van Grembergen, O. et al. Defrance M FF. RNAs, Portraying breast cancers with long noncoding. Sci Adv. 2, 9 (2016).
Yan, X. et al. Comprehensive Genomic Characterization of Long Non-coding RNAs across Human Cancers. Cancer Cell. 28(4), 529–40. (2015).
Su, X. et al. Comprehensive analysis of long non-coding RNAs in human breast cancer clinical subtypes. Oncotarget. 5(20), 9864–76 (2014).
Warburton, A. J. & Boone, D. N. Insights from Global Analyses of Long Noncoding RNAs in Breast Cancer. Current Pathobiology Reports 5(1), 23–34 (2017)
Cabili, M. N. et al. Integrative annotation of human large intergenic noncoding RNAs reveals global properties and specific subclasses. Genes Dev. 25(18), 1915–1927 (2011).
Boone, D. N., Qi, Y., Li, Z. & Hann, S. R. Egr1 mediates p53-independent c-Myc-induced apoptosis via a noncanonical ARF-dependent transcriptional mechanism. Proc Natl Acad Sci U S A 108(2), 632–7 (2011).
Koboldt, D. C. et al. Comprehensive molecular portraits of human breast tumours. Nature. 490(7418), 61–70 (2012).
We thank the Breast Cancer Research Foundation (AVL), National Cancer Institute of the National Institutes of Health award number R01CA94118 (AVL) and P30CA047904 (AVL), and a Susan G Komen for the Cure postdoctoral fellowship, award number PDF12229789 (DB). AVL is a recipient of Scientific Advisory Council award from Susan G. Komen for the Cure, and a Hillman Foundation Fellow. We would also like to thank William Horn and the Genomics Research Core at the University of Pittsburgh as well as McGill University for performing the RNA sequencing and Nolan Priedigkeit and Ryan Hartmaier for carefully editing the manuscript.
The authors declare no competing interests.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Boone, D.N., Warburton, A., Som, S. et al. SNHG7 is a lncRNA oncogene controlled by Insulin-like Growth Factor signaling through a negative feedback loop to tightly regulate proliferation. Sci Rep 10, 8583 (2020). https://doi.org/10.1038/s41598-020-65109-7
Biochimica et Biophysica Acta (BBA) - Reviews on Cancer (2021)
Small nucleolar RNA host genes promoting epithelial–mesenchymal transition lead cancer progression and metastasis
IUBMB Life (2021)
Biomedicine & Pharmacotherapy (2021)