The systematic analysis of coding and long non-coding RNAs in the sub-chronic and chronic stages of spinal cord injury

Duran, Raquel Cuevas-Diaz; Yan, Han; Zheng, Yiyan; Huang, Xingfan; Grill, Raymond; Kim, Dong H.; Cao, Qilin; Wu, Jia Qian

doi:10.1038/srep41008

Download PDF

Article
Open access
Published: 20 January 2017

The systematic analysis of coding and long non-coding RNAs in the sub-chronic and chronic stages of spinal cord injury

Raquel Cuevas-Diaz Duran^1,2^na1,
Han Yan^1,2^na1,
Yiyan Zheng^1,2,
Xingfan Huang³,
Raymond Grill⁴^nAff5,
Dong H. Kim¹,
Qilin Cao^1,2 &
…
Jia Qian Wu^1,2

Scientific Reports volume 7, Article number: 41008 (2017) Cite this article

3209 Accesses
46 Citations
Metrics details

Subjects

Abstract

Spinal cord injury (SCI) remains one of the most debilitating neurological disorders and the majority of SCI patients are in the chronic phase. Previous studies of SCI have usually focused on few genes and pathways at a time. In particular, the biological roles of long non-coding RNAs (lncRNAs) have never been characterized in SCI. Our study is the first to comprehensively investigate alterations in the expression of both coding and long non-coding genes in the sub-chronic and chronic stages of SCI using RNA-Sequencing. Through pathway analysis and network construction, the functions of differentially expressed genes were analyzed systematically. Furthermore, we predicted the potential regulatory function of non-coding transcripts, revealed enriched motifs of transcription factors in the upstream regulatory regions of differentially expressed lncRNAs, and identified differentially expressed lncRNAs homologous to human genomic regions which contain single-nucleotide polymorphisms associated with diseases. Overall, these results revealed critical pathways and networks that exhibit sustained alterations at the sub-chronic and chronic stages of SCI, highlighting the temporal regulation of pathological processes including astrogliosis. This study also provided an unprecedented resource and a new catalogue of lncRNAs potentially involved in the regulation and progression of SCI.

Identification of key autophagy-related genes and pathways in spinal cord injury

Article Open access 19 March 2024

Zhen Shang, Weipeng Shi, … Tengbo Yu

Gene co-expression network analysis in human spinal cord highlights mechanisms underlying amyotrophic lateral sclerosis susceptibility

Article Open access 11 March 2021

Jerry C. Wang, Gokul Ramaswami & Daniel H. Geschwind

MicroRNA-182 improves spinal cord injury in mice by modulating apoptosis and the inflammatory response via IKKβ/NF-κB

Article Open access 31 May 2021

Min Fei, Zheng Li, … Zixian Chen

Introduction

Spinal cord injury (SCI) is one of the most debilitating neurological diseases. In the United States, SCI affects more than 300,000 people, and approximately 11,000 new SCI cases occur every year¹. The majority of SCI patients are in the chronic phase of SCI because of the lack of cure². Despite the efforts devoted to treatment and patient care, there are still no effective therapeutic solutions for SCI. Understanding the underlying cellular and molecular mechanisms, and particularly the regulation of SCI pathophysiological events in a systemic manner, is critical for developing promising treatment strategies.

So far, few studies have attempted to understand the altered expression of genes related to SCI at a global level and most of these studies have used microarrays^3,4. Compared to microarrays, RNA-Sequencing (RNA-Seq) possesses several advantages, such as a larger dynamic range of detection, higher sensitivity and specificity, and an enhanced ability to interrogate any location in the genome⁵. Previous work in our lab has demonstrated the power of RNA-Seq technology in characterizing the transcriptomic alterations in mouse contusive SCI models via integrated transcriptomic and network analyses, which revealed new pathways and candidate molecular targets for acute and sub-acute SCI⁶. In this study, we investigated the molecular mechanisms of the sub-chronic and chronic SCI in rat models by examining the changes in expression of both protein-coding and long non-coding genes at 1 month (1 M), 3 months (3 M), and 6 months (6 M) after injury, respectively. Our results demonstrated that a high level of transcriptional disturbance persists during the sub-chronic and chronic injury phases, with many genes enriched in pathways such as immune and inflammatory responses, as well as gliosis.

Genome-wide analyses have indicated that non-coding RNAs comprise a major part of the genome and revealed another essential dimension of gene regulation⁷. A large number of lncRNAs (over 200 nt in length) have been discovered in recent years and been shown to play critical roles in various biological processes including central nervous system development^8,9 and diseases^10,11. Rat lncRNAs have never been studied in SCI. In the current study, we thoroughly annotated the characteristics of lncRNAs in the rat genome, and predicted the potential regulatory function of these non-coding transcripts by correlating the differential expression patterns of lncRNAs with those of protein-coding genes. Further, we searched for transcription factor (TF) motifs enriched in the upstream regulatory regions of these differentially expressed (DE) lncRNAs, and identified DE lncRNAs that are homologous to human genomic regions which contain single-nucleotide polymorphisms (SNPs) associated with neurological diseases.

In summary, this is the first comprehensive study using RNA-Seq to analyze the transcriptomic alterations of both coding and long non-coding genes in the sub-chronic and chronic stages of SCI. It provided a new perspective for the SCI field and a catalogue of lncRNAs potentially involved in the regulation and progression of this disorder.

Results

Overview of the rat transcriptome

We mapped the sequenced reads to the Ensembl rat reference genome and transcriptome (Rnor6). Due to a lack of a comprehensive rat lncRNAs database, we combined the known and predicted lncRNA annotations from Ensembl and NCBI. On average, samples had 33.6 million reads with a standard deviation of 7.7 million reads (see Supplementary Table S1 for a detailed description of the number of reads, mapping rate, and processing batch for all samples). Approximately 91.1% of all fragments were mapped to the rat reference genome. Pairwise Pearson correlation coefficients among all samples were calculated based on the log2 transformed and quantile normalized FPKM (Fragments Per Kilobase of transcript per Million mapped reads) value of 30,443 genes. The average correlation coefficient for pairwise comparisons of replicates at each time point was 0.984 with a standard error of 0.005, indicating a high level of consistency among biological replicates. The Pearson correlation matrix for all samples is included in Supplementary Table S1. Based on Euclidean distance of transformed FPKM values, a consensus dendrogram was constructed (Fig. 1A). The most discriminant tree branches represent control and SCI samples; Within the SCI samples, 3 M and 6 M had the highest similarity. Using the Washington University Epigenome Browser¹², we set up an interactive, searchable database to display gene expression levels of SCI samples at all time points and made it freely available to the research community. Protein-coding genes and lncRNAs are exhibited as separate tracks (http://jiaqianwulab.org/ratSCI/ratSCI.html).

**Figure 1: Gene expression profiles at 1 M, 3 M, and 6 M post-SCI compared to sham controls.**

Gene expression levels (FPKM) and raw count data matrices were generated for downstream analyses (Supplementary Table S2). A gene was considered expressed if the FPKM > 1 in at least one of the samples. This filtering process yielded 13,847 expressed protein-coding genes and 1,035 expressed lncRNA genes.

Transcriptomic dynamics in the sub-chronic and chronic phases of SCI

Analysis of differential gene expression was performed by comparing the expression of all genes at each time point, both protein-coding and long non-coding, to that of the control group. The numbers of DE genes (DEGs) in each category are summarized in Table 1. A list of DE protein-coding (4,633) and lncRNA (277) genes from all time points was compiled and used for further analysis. A complete list of DEGs at each time point and those that either overlap among all three time points or are unique to one time point is provided in Supplementary Table S3. Expression profiles of genes that code for selected specific cell markers^13,14,15 are available in Supplementary Table S4.

Table 1 Summary of differentially expressed genes (DEGs) in rat SCI.

Full size table

Enriched canonical pathways at each time point generated by using Ingenuity Pathway Analysis (IPA), including gene members for each pathway, are available in Supplementary Table S5. As shown in the Venn diagram (Fig. 1B), 2,055 DEGs overlapped among all three time points. The most enriched pathways throughout the sub-chronic and chronic stages include fibrosis, immune responses, and inflammatory responses. Notably, cell cycle-related pathways are highly enriched in 401 genes that were differentially expressed only at 1 M.

Examining genes with the highest fold-change at each time point can provide insights into the major pathological events at different stages of SCI. Some of the top 10 DEGs (Supplementary Table S6) are well annotated in SCI or other neurological disorders, while others, including the lncRNAs, are less studied and require further investigation. Matrix metalloproteinase-12 (Mmp12), an extracellular matrix enzyme involved in blood-brain barrier disruption and CNS repair¹⁶, was significantly upregulated from 1 M through 6 M. Expression of Scd1, which encodes an enzyme controlled by sterol regulatory element binding proteins (SREBPs), was significantly decreased at all three time points examined. Interestingly, other genes that are regulated by SREBP in a similar manner and involved in myelination¹⁷, including Hmgcr, Hmgcs1, and Fdps, together with genes coding for SREBPs, were also significantly downregulated at multiple time points. Therefore we selected a subset of the top 10 DEGs (Gpnmb, Lilrb4, and Scd1) and aforementioned Scd1-associated genes for qPCR validation of differential expression. The fold-change in expression indicated by qPCR is very similar to the FPKM fold-change determined by RNA-Seq (Fig. 2).

**Figure 2: Verification of differential expression of selected genes by qPCR.**

Identification of enriched gene sets by clustering analysis

Hierarchical clustering was performed to obtain an overview of the temporal expression dynamics of DEGs with fold-change (FC) > 2 and FDR < 0.01. The analysis resulted in five clusters among 4,910 genes (4,633 protein-coding and 277 lncRNA) (Fig. 1C; Supplementary Table S7). Cluster 1 consists of 790 protein-coding genes and 79 lncRNAs whose expression displayed gradual downregulation from control to sub-chronic and chronic stages. Enriched gene sets in this cluster are related to the neuronal system (synapse, synaptic transmission, and axon), myelin sheath, and cholesterol biosynthesis, and are likely downregulated due to neural cell death and demyelination that occur upon SCI. Cluster 2 consists of 540 protein-coding genes and 30 lncRNAs that exhibited significant upregulation at 3 M and reached their highest degree of upregulation at 6 M, and is enriched mostly in genes involved in cellular motility. The expression of Cluster 3, which includes 880 protein-coding genes and 46 lncRNAs, displayed gradual upregulation over time. Genes in this cluster are enriched in cell-to-cell interaction pathways including extracellular matrix, focal adhesion, and wound healing. The expression of Cluster 4, consisting of 1,420 protein-coding genes and 60 lncRNAs, reached its highest level of upregulation at 1 M, and then gradually returned to the baseline levels. Genes in cluster 4 are highly enriched for epigenetic modification mechanisms such as histone methylation, chromatin modification, methyltransferase activity, and RNA processing. Cluster 5 comprises 1,003 protein-coding genes and 62 lncRNAs that were robustly upregulated at 1 M and remained moderately upregulated at 3 M and 6 M. Cluster 5 is mainly enriched for genes involved in immune and inflammatory responses.

Characteristics of lncRNAs in the rat genome

Distribution of lncRNA loci

We categorized rat lncRNAs according to the location of their genes with respect to the most proximal protein-coding genes, based on a previous classification¹⁸ with modifications (Fig. 3). The lncRNA genes were first defined as ‘genic’ if they intersect a protein-coding gene, or ‘intergenic’ if they do not. Intergenic lncRNAs were further classified as ‘convergent’ (IC) if transcribed from the same strand or divergent (ID) if transcribed from the opposite strand. Genic lncRNAs were further subdivided into ‘genic exonic’, ‘genic intronic’, or ‘overlapping’, depending on whether they overlap with the exons or introns of a protein-coding gene. Exonic lncRNA genes, which overlap with at least one exon of a protein-coding gene, were categorized as GES if they lie on the same strand as the coding gene or as GEAS if they lie on the opposite strand. Intronic lncRNA genes, which overlap with the intronic region of a protein-coding gene, were further categorized as GIS if they lie on the same strand as the intron of a coding gene, or otherwise called GIAS. If a protein-coding gene is located completely within the intronic region of a lncRNA gene, this lncRNA is categorized as ‘overlapping’, and further classified as GOS if transcribed from the same strand as the coding gene, or GOAS if transcribed from the opposite strand.

**Figure 3: Classification of rat lncRNAs.**

Based on the combined lncRNA annotations, the majority of rat lncRNAs (8,875 out of 10,889 or 81.5%) were intergenic (37% IC and 44% ID). The remaining 2,014 lncRNA genes were genic lncRNAs [6.4% exonic (GES 1.9%, GEAS 4.5%), 7.3% intronic (GIS 1.8%, GIAS 5.5%), and 4.8% overlapping (GOS 1.5%, GOAS 3.3%), Fig. 3]. A complete list of annotated lncRNAs, including genomic locations and classifications is available in Supplementary Table S8.

ID lncRNAs are located closer to protein-coding genes than are IC lncRNAs

Distribution of the distances from intergenic lncRNAs to their nearest protein-coding gene neighbors showed that ID lncRNAs are closer to their neighbors than are IC lncRNAs. Approximately 21.6% (1,039 out of 4,816) of ID transcripts and 10.4% (424 out of 4,059) of IC transcripts lie within 5 kb of a protein-coding gene (median distance 641 bp compared with 2,235 bp, respectively, t-test p < 1.41⁻²⁹, Fig. 4A).

**Figure 4: Characteristics of lncRNAs in the rat genome.**

lncRNAs exhibit specific exonic structure and transcript size compared to protein-coding genes

In our combined annotation there is a compelling tendency of lncRNA transcripts (40%) to have only two exons, whereas for protein-coding transcripts only 7.5% have two exons (Fig. 4B). These results are concordant with previous observations on human and rat lncRNAs^18,19. The transcript size also differs between lncRNAs and protein-coding genes (Fig. 4C). Overall, lncRNA transcripts are shorter than protein-coding genes (median transcript size 1,065 bp compared with 1,810 bp; t-test, p < 1.23⁻¹⁴⁵).

Differential expression of lncRNAs after SCI

After finding the DE protein-coding and lncRNA genes with DESeq, we compared their expression patterns at different time points using Kolmogorov-Smirnoff test. As illustrated in Fig. 4D, both expressed and DE lncRNAs generally had lower FPKM values compared to expressed and DE protein-coding genes across all time points (p < 2.2⁻¹⁶). Meanwhile, when comparing the FPKM distributions of all the expressed lncRNAs or DE lncRNAs of SCI samples to those of control samples, the expression levels are significantly higher in SCI samples (1 M, 3 M, and 6 M compared with control for all of the expressed lncRNAs: p < 0.0015, p < 4.88⁻⁸, and p < 1.8⁻⁶; and for DE lncRNAs: p < 3.94⁻⁵, 2.89⁻¹⁰, and 5.62⁻⁸, respectively). Similarly, the FPKM distributions of protein-coding genes are also significantly higher in SCI samples than control samples (for all of the expressed protein-coding genes, 1 M, 3 M, and 6 M compared with control: p < 8.17⁻¹⁹, p < 1.1⁻⁴², and p < 7.07⁻²⁶; and for DE protein-coding genes: p < 3.47⁻⁴⁷, p < 1.34⁻⁹², and p < 3.14⁻⁶², respectively). As observed in Fig. 4E, most of the 277 DE lncRNAs were classified as ID, IC, or GEAS. Same is true for the 1,035 expressed lncRNAs.

Inferring potential functions of DE lncRNAs in rat SCI

By correlating the expression profiles of DE lncRNAs with those of protein-coding genes, a ranked list of co-expressed protein-coding genes for each lncRNA was generated. Significantly enriched gene sets were identified using a Gene Set Enrichment Analysis (GSEA) with a false discovery rate (FDR) < 0.25. Figure 5A shows a list of selected DE lncRNAs with high normalized enrichment score (NES) based on the correlation of their expression with that of significantly enriched gene sets. The association matrix of significant gene sets and DE lncRNAs can be found in Supplementary Table S9. Enriched gene sets were classified into five categories including signaling pathways (S), immune response (IR), epigenetic modification (EM), nervous system (N), and extracellular matrix (ECM). A few representative DE lncRNAs in this list were selected for further validation by qPCR to verify their differential expression at multiple time points in the RNA-Seq data (Fig. 2).

The correlation between the expression of DE lncRNAs and that of their closest protein-coding genes was also investigated. All 277 DE lncRNAs were further filtered using two criteria: 1) the nearest protein-coding gene neighbor is also a DEG; and 2) the correlation of expression of the lncRNA and its neighboring protein-coding gene was significant (Pearson correlation, p < 0.05). This filtering yielded 77 lncRNAs including 16 GEAS, 2 GES, 3 GIAS, 2 GIS, 6 GOAS, 1 GOS, 22 IC, and 25 ID. Among these 77 lncRNAs, 51 (66%) are located within 5 kb of their significantly co-expressed protein-coding neighbor, 38 of which are transcribed from the antisense strand and 13 from the same strand. The expression of most of these neighboring DE lncRNA and protein-coding gene pairs was positively correlated (50 out of 51, 98%). Figure 5B shows representative lncRNAs and their significantly co-expressed protein-coding neighboring genes. A matrix of DE lncRNAs including lncRNA class, their closest protein-coding neighboring gene, the distance between them, and the correlation of their expression is available in Supplementary Table S10. Two example DE lncRNAs with expression significantly correlated to that of their DE protein-coding gene neighbors are depicted in further details (Fig. 5C). ENSRNOG00000051791 is an IC lncRNA that is transcribed from the same strand as Cdh11, which codes for a cadherin membrane protein that mediates calcium-dependent cell-cell adhesion²⁰. As shown in the genomic browser view, ENSRNOG00000051791 and Cdh11 are only 76 bp apart and display similar temporal changes in expression. LOC102547088, another DE lncRNA, is categorized as GEAS and overlaps with one exon of Tchp, a protein-coding gene with the ability to inhibit cell growth or act as a pro-apoptotic agent during cell stress²¹.

TF binding motifs enriched in the regulatory regions of DE lncRNAs

The regulatory regions of DE lncRNAs (5 kb upstream and 1 kb downstream regions) were searched for TF binding motifs using FIMO (FDR < 0.05)²² with position weight matrices (PWM) obtained from ENCODE²³, and a total of 200 motifs for 117 TFs were found. This list was further filtered to obtain TF motifs found in more than 100 lncRNA regulatory regions (Fig. 6A). Interestingly, 53% (9 out of 17) of these TFs are in our list of DEGs in SCI, including STAT3, RREB1, and SPI1. An association matrix depicting the presence of TF motifs in the regulatory regions of DE lncRNAs can be found in Supplementary Table S11.

**Figure 6: TF motifs found in the upstream regulatory regions of DE lncRNAs.**

To predict whether TFs with binding motif(s) in lncRNA regulatory regions are likely to regulate the lncRNAs, the correlations between the expression of the 117 TFs and that of the 277 DE lncRNAs were calculated. About 38.7% (12,563 out of 32,409) of the possible pairs are significantly correlated in their expression (Pearson coefficient r > |0.75|, p-value < 0.01), among which 71% are positively correlated and 29% are negatively correlated (Fig. 6B). Refer to Supplementary Table S11 for correlation coefficients between expression of TFs and that of DE lncRNAs.

Rat lncRNAs homologous to disease-SNPs-harboring human genomic regions

An increasing number of studies are associating lncRNAs with human diseases²⁴. Recently, genome-wide association studies (GWAS) have compiled information on a large number of disease-associated SNPs²⁵. Notably, the majority of these SNPs were mapped to non-coding regions²⁶. We lifted over rat SCI-associated DE lncRNAs to human genomic regions (hg38), yielding 224 homologous regions. Next, the GWAS SNP annotations were combined with the ClinVar genomic variation database to search for GWAS/ClinVar SNPs entries annotated in the aforementioned regions. We identified 23 rat DE lncRNAs that are homologous to human genomic regions that harbor disease-associated SNPs. For example, SNPs related to psychosis and Alzheimer’s disease were found in the human genomic region homologous to LOC103693573. Similarly, LOC102555675 was homologous to regions harboring important SNPs involved in bipolar disorder in humans. The list of SNPs found in human homologs of rat DE lncRNAs is shown in Supplementary Table S12, including the rat DE lncRNAs, the genomic locations of their human homologs, the SNPs identified in GWAS/ClinVar, the trait or disease, the mapped human gene, and the reference PubMed ID.

Discussion

Following mechanical and physical trauma caused by a contusive impact on the spinal cord tissue, a series of pathophysiological events occur at molecular and cellular levels. It is essential to understand how these events are orchestrated at a systemic level. The data presented here revealed critical pathways and networks that exhibit sustained alterations at the sub-chronic and chronic stages of SCI such as fibrosis and inflammatory responses, as well as pointed to the temporal regulation of important molecular and cellular machinery that potentially contribute to inhibition of axonal regeneration such as astrogliosis and compromised remyelination. The results of our analysis agree with recent findings by weighted gene coexpression network analysis (WGCNA) on microarray data from mouse spinal cord crush injury²⁷ and a Bayesian network analysis (BNA) of RNA-Seq data from rat spinal cord transection injury²⁸. Furthermore, our thorough annotations of rat lncRNAs, extensive analyses of DE lncRNAs to predict their potential regulatory functions in SCI, and identification of human disease-associated variants homologous to rat SCI-associated lncRNAs provided valuable resources for future functional investigations.

A major barrier to the treatment of chronic SCI is the increased astrogliosis that inhibits axonal growth and regeneration²⁹. Our data demonstrated that genes related to gliosis are among the highly upregulated DE genes in the sub-chronic and chronic phases of SCI. Expression of CSPGs (Ncan, Smc3, and Ptprz1), Tenascin C (Tnc), Slit3³⁰, and reactive astrocyte marker Gfap increased at all time points, and expression of Ephrin-B2 (Efnb2) increased at 3 M. Additionally, among the canonical pathways generated from DEGs that overlap in 1 M, 3 M, and 6 M, ‘fibrosis’ ranked as the most enriched. Upregulation of genes that code for collagen IV, laminin, and nidogen in our data confirmed the excessive synthesis of ECM components as a scaffold of basement membrane. The expression of multiple genes in the critical fibrosis signaling pathways TGFβ, WNT, and YAP/TAZ³¹, including Tgfb1, Smad-2, 3, & 4, Wnt, Yap, and Ctnnb1, increased from 1 M through 6 M. Many genes involved in fibrosis are also associated with glial scar formation. For example, Egr1 (early growth response 1), a gene that was upregulated at three time points, encodes a TF that can regulate Ptprz1 expression³² and modulate pathological matrix remodeling by enhancing collagen accumulation³³.

Network analysis provided further insights in molecular signaling that regulates gliosis after the sub-chronic and chronic SCI. We analyzed DE gene set common to 1 M, 3 M, and 6 M using Ingenuity Pathway Analysis (IPA) to generate networks depicting direct physical interactions such as binding and phosphorylation as well as indirect relationships inferred from literature in the Ingenuity Knowledge Base. In one of the most enriched networks (Fig. 7A), molecules with the greatest number of interactions include TGFBR2, GFAP, STAT3, and EGR1, among others, many of which are involved in astrogliosis and fibrosis. Transforming growth factor β receptor 2 (TGFBR2), which showed a 7–9-fold increase in gene expression from 1 M to 6 M post-SCI, had the most connections in the network, suggesting its important roles in gliosis. This is consistent with previous studies which suggest its ligand, TGF-β, functions as an essential regulatory factor of astrocyte function³⁴ and gliosis by inducing synthesis of chondroitin sulfate proteoglycans³⁵.

**Figure 7: Enriched network and canonical pathway generated with DEGs that are important in chronic SCI.**

The expression of bone morphogenetic proteins (BMPs)³⁶ and their receptors, which are members of the TGF-β superfamily, have been previously reported to increase 2 days after SCI and lasted for 1 month³⁷. RNA-Seq data in the present study extends these findings to show that increased expression of BMP ligands (Bmp1, 2, 4, 5, 6, and 7) as well as BMP receptors (Bmpr1a, 1b, and 2) persists during chronic SCI at 3 M and 6 M post injury. BMP ligands bind to a complex of BMP receptors to initiate the transphosphorylation of receptor 1 which phosphorylates the receptor-activated Smads (R-Smads, Smad1, 5, and 9). The phosphorylated R-Smads form complexes with the co-Smad (Smad4) and move into the nucleus, where they combine with various transcriptional co-activators or co-repressors to regulate the transcription of target genes such as the inhibitors of differentiation (ID2, 3, and 4)^38,39. Consistent with the activation of BMP pathways, our study shows not only increased expression of BMPs and their receptors but also upregulation of R-Smads and co-Smad as well as their target genes such as ID2, 3, and 4. Interestingly, our study also shows upregulation of inhibitory Smads (I-Smads, Smad6/7) at 3 M. I-Smads are critical BMP responsive genes^40,41,42. Expression of I-Smads, which are induced by BMP canonical pathway could prevent the further activation of R-Smads and inhibit BMP signaling. Thus, R-Smad/co-Smad and I-Smad form a negative feedback loop for the fine regulation of BMP signaling (Fig. 7B). At 6 M after SCI, BMP canonical pathway genes remain upregulated while I-Smads become significantly downregulated. Overall, these data indicate that BMP signaling is significantly increased in chronic SCI. Previous studies have implied that the BMP pathway can increase the proliferation and differentiation of astrocytes at the expense of oligodendrogenesis during development⁴³ and after SCI^44,45. The persistent upregulation of BMP signaling might be involved in maintaining the glial scar after chronic SCI. Moreover, BMP signaling also plays an important role in regulating angiogenesis during embryonic and postnatal development⁴⁶. Therefore, our finding of sustained upregulation of the BMP pathway in chronic SCI requires further studies to achieve a greater understanding on the long-term effects of SCI on blood vessel integrity and replacement.

Another highly connected molecule in Fig. 7A, signal transducer STAT3, is connected indirectly to TGFBR2 and directly to GFAP. STAT3 has been proposed as a critical regulator of astrogliosis and axonal growth. Knockout of STAT3 in astrocytes resulted in decreased astrocyte reactivation and astroglial scar formation⁴⁷, whereas activation of STAT3 in neurons was reported to enhance the axonal regeneration, especially in the initial phase of regeneration after injuries⁴⁸. Interestingly, activation of STAT3 can upregulate the gene expression of Socs3, which can in turn inhibit STAT3 activity and impede axonal sprouting/regeneration⁴⁹, possibly serving as a negative feedback to STAT3 activation to fine-tune its regulation in astrogliosis and axon regrowth after injury. In fact, both Stat3 and Socs3 were upregulated from 1 M through 3 M and Socs3 stayed upregulated at 6 M. It is important to note that the transcriptomic alterations of the DEGs reported in our study represent changes in a mixture of cell types in the spinal cord tissue. Therefore, future studies such as co-localization with cell specific markers and classic genetic manipulation will help us to understand the functions and the underlying mechanisms of STAT3 and SOCS3 in axonal regeneration and astrogliosis. Additionally, SOCS3 is connected with transcription factor ATF3 which has been found to be induced in sensory and motor neurons after SCI⁵⁰. ATF3 binds DNA sites as a homo- or hetero- dimer with JUN⁵⁰. Interestingly, our analysis showed that both STAT3 and JUN are differentially expressed, have high number of binding motifs in the regulatory regions of DE lncRNAs, and have significant expression correlation with a subset of DE lncRNAs (Fig. 6). We hypothesize that these transcription factors may be involved in the upstream regulatory functions of lncRNAs in SCI.

To better understand the alterations in gene expression after SCI and their functional significance, it is essential to comprehend the transcriptional regulation of the genes involved. Systematic characterization and functional prediction of lncRNAs will significantly facilitate our understanding of transcriptional regulation. We created a combined annotation of 10,889 lncRNA transcripts derived from ENSEMBL and NCBI repositories. The majority of lncRNA annotations are predictions based on the methylation state of chromatin regions and the evaluation of coding potential. Our analyses showed that even though the genomic structure of lncRNAs is similar to that of protein-coding genes, lncRNAs are predominantly two-exon transcripts with a shorter median transcript length. Based on the current annotation databases, approximately two times as many intergenic divergent lncRNAs are located within less than 5 kb of their protein-coding neighbors as are intergenic convergent lncRNAs.

Our temporal analyses of the SCI transcriptome identified 277 DE lncRNAs in the sub-chronic and chronic stages of SCI. For example, Miat is a DE lncRNA that has been studied previously in other biological contexts. Miat is related to neuron commitment, development, and survival⁵¹, as well as oligodendrocyte lineage specification of neural stem cells⁸. Our results showed a downregulation of Miat expression after SCI, which could be due to neuronal death. Further investigations are necessary to validate this hypothesis. We would like to point out that in the present study, polyadenylated RNAs were selected using the polyA-based RNA-Seq library kit; therefore, lncRNAs that were not polyadenylated were not captured.

Through guilt-by-association methods, we were able to infer that DE lncRNAs are associated with functions related to signaling cascades, epigenetic modification, immune responses, nervous system, and extracellular matrix, all of which are highly relevant to SCI. To unravel the potential functional relevance of the proximity of lncRNAs to nearby protein-coding genes, we identified 51 pairs of DE lncRNAs and protein-coding gene neighbors that either overlap or lie within 5 kb and have statistically significantly correlated expression (Pearson correlation, p-value < 0.01). This finding suggests a role for these lncRNAs in cis-regulation of nearby protein-coding genes⁵².

To understand the upstream regulatory mechanisms of DE lncRNAs, we investigated the TF binding motifs present in their regulatory regions. Our results demonstrated that binding motifs for TFs relevant to gliosis (STAT3) and immune response (JUN⁵³ and SP1⁵⁴) are highly represented in the regulatory regions of DE lncRNAs, and that changes in the expression of these TFs are highly correlated with a subset of DE lncRNAs.

In summary, a high level of transcriptomic disturbance persisted in the sub-chronic and chronic phases of rat SCI. Our systematic examination and analysis of post-SCI transcriptional alterations in rat has identified important pathways and networks for the pathological progression of SCI, and pinpointed novel target genes for further investigation, including a number of interesting lncRNA candidates with potentially important regulatory functions and human disease homologs.

Methods

All methods were carried out in accordance with relevant guidelines and regulations. All experimental protocols were approved by the Institutional Biosafety Committee at the University of Texas Health Science Center at Houston. Animal usage and manipulations were performed in accordance with the Public Health Service Policy on Humane Care and Use of Laboratory Animals, Guide for the Care and Use of Laboratory Animals, and with the approval of the Animal Welfare Committee at the University of Texas Health Science Center at Houston.

Rat SCI model

A total of 36 female Sprague-Dawley rats (12–14 weeks of age) were used. Tissues from three rats were pooled to form one biological replicate (n). Animals were randomly assigned to either the control group or to one of the three time points (9 rats [3 biological replicates]): 1 M, 3 M, and 6 M. SCI surgeries were performed as described previously⁵⁵. Briefly, a moderate (150 kdyn) contusive injury was induced to the 9^th thoracic vertebra (T9). Animals in the sham control group received a dorsal laminectomy without a contusive injury.

At 1 month, 3 months, and 6 months post-SCI, animals were intracardially perfused with 0.01 M PBS under anesthesia. A segment of the spinal cord tissue (0.5 mm) at the epicenter was dissected and snap-frozen. Animals in the control group were sacrificed at 6 months post-SCI and a section of the spinal cord tissue at an equivalent location was collected.

RNA isolation, library construction, and RNA sequencing

Total RNA was extracted from the spinal cord tissue using TRIzol reagent (Invitrogen) following the manufacturer’s instructions. RNA quality was determined using a Bioanalyzer (Agilent), and all RNA integrity numbers (RINs) were greater than 8. About 150–300 ng total RNA was used to construct each RNA-Seq library. RNA samples were poly-A selected and paired-end sequencing libraries were constructed using the TruSeq RNA Sample Prep Kit, as described in the TruSeq RNA Sample Preparation V2 Guide (Illumina), and sequenced using the Illumina HiSeq 2000 sequencer. Library preparation and sequencing were performed in different batches based on the SCI time points and the sequencing availability. Batch assignments are included in Supplementary Table S1. Hierarchical clustering was performed to identify any possible batch effects.

Read mapping and quantification

Before read mapping, the quality of raw sequenced reads was verified using FastQC⁵⁶. Quality control metrics are listed in Supplementary Table S1. Read mapping, transcript assembly, and expression estimation were performed as described in our previous publication¹⁴. Reads were mapped to the rat reference genome Rnor6 downloaded from Ensembl (ftp://ftp.ncbi.nlm.nih.gov/genomes/all/GCF_000001895.5_Rnor_6.0/). The 100-bp paired-end reads were aligned to the reference genome using TopHat v2.1.0⁵⁷ with default parameters. Mapped reads were assembled using Cufflinks v2.2.1⁵⁸, and FPKM (Fragments Per Kilobase of transcript per Million mapped reads) values were obtained for genes and transcripts annotated. Our annotation included 39,183 transcripts spanning 30,443 genes (22,075 protein-coding and 8,368 lncRNA genes). Any FPKM < 0.1 was set to 0.1 to avoid ratio inflation⁵⁹. Read counts for annotated genes and transcripts were obtained using HTSeq-count⁶⁰.

lncRNA annotation and categorization

We surveyed lncRNA public databases with known and predicted annotations. All gene and transcript biotypes labelled as ‘lincRNA’ from the Ensembl rat genome annotation were collected. Additionally, lncRNA annotations from NCBI repository (ftp://ftp.ncbi.nlm.nih.gov/genomes/all/GCF_000001895.5_Rnor_6.0/) were downloaded. The intersect function from Bedtools suite⁶¹ was used to avoid redundancy. Annotations were filtered by removing lncRNAs smaller than 200 nt and those with overlapping exons. The resulting lncRNA set consisted of 10,889 transcripts (2,901 from Ensembl and 7,988 from NCBI) spanning 8,370 loci. We categorized these lncRNAs according to their positions with respect to their closest protein-coding genes, based on previously published classification¹⁸ with modifications.

Analysis of transcriptomic gene expression profiles and differential expression

The similarities between all samples were assessed using a consensus dendrogram generated from the Euclidean distances between log2-transformed quantile-normalized FPKM values of all samples. One outlier was identified in the 6 M group, so it was excluded from subsequent analyses. For more details on outlier identification, refer to “Sample correlation assessment” in Supplementary Methods.

Genes with FPKM 1 in at least one of the samples were included in the analysis of differential gene expression. Expression profiles of SCI samples at each time point were compared against those of the control group using the DESeq package⁶². Genes were classified as differentially expressed (DEGs) if they 1) had an FPKM > 1 in at least one sample, 2) exhibited an expression fold-change (FC) > 2 (mean normalized counts), and 3) their DESeq statistical test was significant with FDR < 0.01. Genes that met these three criteria were included in the downstream analyses.

Pathway analysis

DEGs at each time point were imported into Ingenuity Pathway Analysis (IPA, http://www.ingenuity.com/products/ipa) to generate enriched pathways and networks. DEGs at each time point were also cross-compared to create lists of genes commonly expressed between time points or uniquely expressed at certain time points, as illustrated using Venn diagrams⁶³ (Fig. 1B).

Clustering temporal gene expression profiles

Hierarchical cluster analysis was performed using Ward’s method based on a Euclidean distance matrix of normalized log2-transformed quantile normalized FPKM values of DEGs. Genes within each cluster were used to estimate the enrichment of gene sets through a hypergeometric statistical test (phyper R function), downloaded from gene ontologies pre-built for rat⁶⁴ and from the Molecular Signatures Database (MSigDB)⁶⁵. Gene sets with an FDR < 0.05 were considered enriched. The number of clusters was selected so that the profiles of enriched gene sets showed the least redundancy. A heatmap was generated to depict these DEGs clusters and their corresponding enriched gene sets.

Predicting potential functions of DE lncRNAs in SCI

We adopted a co-expression analysis of lncRNA genes and protein-coding genes to infer the potential functions of lncRNAs using a ‘guilt-by-association’ method⁶⁶. The relationship between the expression of DE lncRNAs and protein-coding genes was assessed using Pearson correlation. A ranked list of protein-coding genes for each DE lncRNA was obtained. Gene Set Enrichment Analysis (GSEA)⁶⁷ was used to identify significantly enriched gene sets corresponding to gene ontologies and canonical pathways from MSigDB⁶⁵ for each lncRNA. Gene sets with an FDR < 0.25 (as recommended in the GSEA manual) were used to create an association matrix. A heatmap of a subset of DE lncRNAs was obtained showing the normalized enrichment score (NES) for selected significantly enriched gene sets.

Correlation of expression of lncRNAs and protein-coding gene neighbors

To identify possible interactions between DE lncRNAs and their neighboring protein-coding genes, we first selected those pairs in which both the lncRNA and its protein-coding neighbor were differentially expressed and had a significant correlation (Pearson coefficient r > , p < 0.01). We further filtered the list leaving only pairs that either overlap or are less than 5 kb apart. The Integrative Genomics Viewer (IGV)⁶⁸ was used to browse the alignment files for DE lncRNAs and neighboring protein-coding genes.

Identification of TF binding using ENCODE motifs

To identify TFs potentially involved in the regulation of DE lncRNAs, motif analysis was performed. The motif data set at ENCODE²³ (http://compbio.mit.edu/encode-motifs) was downloaded. Motifs were used to scan the regulatory regions of DE lncRNAs using FIMO²² with an FDR < 0.05.

The regulatory regions of lncRNAs were defined as 5 kb upstream and 1 kb downstream from lncRNA transcription start sites. If any lncRNA was shorter than 1 kb, then its length was used as the cut-off value for the downstream region. Sequences were obtained using the BSgenome package⁶⁹. A Pearson correlation matrix was computed for gene expression levels between TFs and lncRNAs with TF binding sites.

Find rat lncRNAs homologous to disease-SNPs-harboring human genomic regions

To investigate the functional relevance of DE lncRNAs from our study to human diseases, we downloaded a collection of Genome-Wide Association Studies (GWAS)⁷⁰ and Clinically Relevant Sequence Variations (ClinVar)⁷¹ to obtain a database of single-nucleotide polymorphisms (SNPs) linked to human diseases and traits. We lifted-over the DE lncRNAs from rat (Rnor6) to human (hg38) genome using the UCSC liftOver Utility⁷². A table was constructed of DE lncRNAs in rat SCI, their homologous regions in human, and the diseases or traits linked to these regions (Supplementary Table S12).

Reverse transcription, quantitative PCR (qPCR) verification, and statistical analysis

The differential expression of selected protein-coding genes and lncRNAs was validated by qPCR. Complementary DNAs (cDNAs) were synthesized from the same RNA samples as those used for the RNA-Seq experiment using SuperScript II (Invitrogen #18064–014) and random primers following the manufacturer’s instructions. SYBR^® Green (Bio-Rad #172–5122) was used as detector in qPCR reactions. Primer sequences are available in Supplementary Table S13. GAPDH was used as an endogenous control. The 2^-ΔΔCt method was used for relative quantification of expression.

Student’s t-tests were performed to compare gene expression at each time point to that of control, and significance was established when p < 0.05.

Data availability.

(1) Raw RNA-Seq datasets have been deposited in Gene Expression Omnibus (GEO) database under accession number GSE93249.

(2) Sequence alignments and lncRNA annotations are publicly available to the research community for browsing (http://jiaqianwulab.org/ratSCI/ratSCI.html).

Additional Information

How to cite this article: Cuevas-Diaz Duran, R. et al. The systematic analysis of coding and long non-coding RNAs in the sub-chronic and chronic stages of spinal cord injury. Sci. Rep. 7, 41008; doi: 10.1038/srep41008 (2017).

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

Liverman, C. T., Altevogt, B. M., Joy, J. E. & Johnson, R. T. Spinal Cord Injury. doi: 10.17226/11253 (National Academies Press, 2005).
Fehlings, M. G. & Perrin, R. G. The timing of surgical intervention in the treatment of spinal cord injury: a systematic review of recent clinical evidence. Spine (Phila. Pa. 1976). 31, S28–35 discussion S36 (2006).
Google Scholar
Byrnes, K. R., Washington, P. M., Knoblach, S. M., Hoffman, E. & Faden, A. I. Delayed inflammatory mRNA and protein expression after spinal cord injury. J. Neuroinflammation 8, 130 (2011).
CAS PubMed PubMed Central Google Scholar
Nesic, O. et al. DNA microarray analysis of the contused spinal cord: effect of NMDA receptor inhibition. J. Neurosci. Res. 68, 406–423 (2002).
CAS PubMed Google Scholar
Dong, X., You, Y. & Wu, J. Q. Building an RNA Sequencing Transcriptome of the Central Nervous System. Neuroscientist, doi: 10.1177/1073858415610541 (2015).
Chen, K. et al. RNA-seq characterization of spinal cord injury transcriptome in acute/subacute phases: a resource for understanding the pathology at the systems level. PLoS One 8, e72567 (2013).
ADS CAS PubMed PubMed Central Google Scholar
Djebali, S. et al. Landscape of transcription in human cells. Nature 489, 101–8 (2012).
ADS CAS PubMed PubMed Central Google Scholar
Mercer, T. R. et al. Long noncoding RNAs in neuronal-glial fate specification and oligodendrocyte lineage maturation. BMC Neurosci. 11, 14 (2010).
PubMed PubMed Central Google Scholar
Dong, X. et al. Comprehensive Identification of Long Non-coding RNAs in Purified Cell Types from the Brain Reveals Functional LncRNA in OPC Fate Determination. PLoS Genet. 11 (2015).
PubMed PubMed Central Google Scholar
Faghihi, M. A. et al. Expression of a noncoding RNA is elevated in Alzheimer’s disease and drives rapid feed-forward regulation of β-secretase. Nat. Med. 14, 723–730 (2008).
CAS PubMed PubMed Central Google Scholar
Daughters, R. S. et al. RNA gain-of-function in spinocerebellar ataxia type 8. PLoS Genet. 5, e1000600 (2009).
PubMed PubMed Central Google Scholar
Zhou, X. et al. The Human Epigenome Browser at Washington University. Nat. Methods 8, 989–90 (2011).
CAS PubMed PubMed Central Google Scholar
Kozlenkov, A. et al. Differences in DNA methylation between human neuronal and glial cells are concentrated in enhancers and non-CpG sites. Nucleic Acids Res. 42, 109–27 (2014).
CAS PubMed Google Scholar
Zhang, Y. et al. An RNA-Sequencing Transcriptome and Splicing Database of Glia, Neurons, and Vascular Cells of the Cerebral Cortex. J. Neurosci. 34, 11929–47 (2014).
CAS PubMed PubMed Central Google Scholar
Darmanis, S. et al. A survey of human brain transcriptome diversity at the single cell level. Proc. Natl. Acad. Sci. 112, 7285–7290 (2015).
ADS CAS PubMed PubMed Central Google Scholar
Wells, J. E. A. et al. An adverse role for matrix metalloproteinase 12 after spinal cord injury in mice. J. Neurosci. 23, 10107–10115 (2003).
CAS PubMed PubMed Central Google Scholar
Fernø, J., Skrede, S., Vik-Mo, A. O., Håvik, B. & Steen, V. M. Drug-induced activation of SREBP-controlled lipogenic gene expression in CNS-related cell lines: marked differences between various antipsychotic drugs. BMC Neurosci. 7, 69 (2006).
PubMed PubMed Central Google Scholar
Derrien, T. et al. The GENCODE v7 catalog of human long noncoding RNAs: Analysis of their gene structure, evolution, and expression. Genome Res. 22, 1775–1789 (2012).
CAS PubMed PubMed Central Google Scholar
Wang, F. et al. Characteristics of Long Non-coding RNAs in the Brown Norway Rat and Alterations in the Dahl Salt-Sensitive Rat. (Supplemental Information). Sci. Rep. 4, 7146 (2014).
CAS PubMed PubMed Central Google Scholar
Maglott, D., Ostell, J., Pruitt, K. D. & Tatusova, T. Entrez Gene: gene-centered information at NCBI. Nucleic Acids Res. 33, D54–8 (2005).
CAS PubMed Google Scholar
The UniProt Consortium. UniProt: a hub for protein information. Nucleic Acids Res. 43, D204–12 (2014).
Grant, C. E., Bailey, T. L. & Noble, W. S. FIMO: Scanning for occurrences of a given motif. Bioinformatics 27, 1017–1018 (2011).
CAS PubMed PubMed Central Google Scholar
Kheradpour, P. & Kellis, M. Systematic discovery and characterization of regulatory motifs in ENCODE TF binding experiments. Nucleic Acids Res. 42, 2976–87 (2014).
CAS PubMed Google Scholar
Wapinski, O. & Chang, H. Y. Long noncoding RNAs and human disease. Trends in Cell Biology 21, 354–361 (2011).
CAS PubMed Google Scholar
McCarthy, M. I. & Hirschhorn, J. N. Genome-wide association studies: Potential next steps on a genetic journey. Hum. Mol. Genet. 17 (2008).
Pennisi, E. Genomics 1000 Genomes Project gives new map of genetic diversity. Science 330, 574–575 (2010).
CAS PubMed Google Scholar
Duan, H. et al. Transcriptome analyses reveal molecular mechanisms underlying functional recovery after spinal cord injury. Proc. Natl. Acad. Sci. USA. 112, 13360–5 (2015).
ADS CAS PubMed PubMed Central Google Scholar
Di Narzo, A. F. et al. Decrease of mRNA Editing after Spinal Cord Injury is Caused by Down-regulation of ADAR2 that is Triggered by Inflammatory Response. Sci. Rep. 5, 12615 (2015).
ADS CAS PubMed Google Scholar
Yiu, G. & He, Z. Glial inhibition of CNS axon regeneration. Nat. Rev. Neurosci. 7, 617–27 (2006).
CAS PubMed PubMed Central Google Scholar
Yuan, W. et al. The mouse SLIT family: secreted ligands for ROBO expressed in patterns that suggest a role in morphogenesis and axon guidance. Dev. Biol. 212, 290–306 (1999).
CAS PubMed Google Scholar
Piersma, B., Bank, R. A. & Boersema, M. Signaling in Fibrosis: TGF-β, WNT, and YAP/TAZ Converge. Front. Med. 2, 59 (2015).
Google Scholar
Beck, H., Semisch, M., Culmsee, C., Plesnila, N. & Hatzopoulos, A. K. Egr-1 regulates expression of the glial scar component phosphacan in astrocytes after experimental stroke. Am. J. Pathol. 173, 77–92 (2008).
CAS PubMed PubMed Central Google Scholar
Wu, M. et al. Essential roles for early growth response transcription factor Egr-1 in tissue fibrosis and wound healing. Am. J. Pathol. 175, 1041–55 (2009).
CAS PubMed PubMed Central Google Scholar
Schachtrup, C. et al. Nuclear pore complex remodeling by p75(NTR) cleavage controls TGF-β signaling and astrocyte functions. Nat. Neurosci. 18, 1077–80 (2015).
CAS PubMed PubMed Central Google Scholar
Jahan, N. & Hannila, S. S. Transforming growth factor β-induced expression of chondroitin sulfate proteoglycans is mediated through non-Smad signaling pathways. Exp. Neurol. 263, 372–84 (2015).
CAS PubMed Google Scholar
Ara, J. et al. Bone morphogenetic proteins 4, 6, and 7 are up-regulated in mouse spinal cord during experimental autoimmune encephalomyelitis. J. Neurosci. Res. 86, 125–35 (2008).
CAS PubMed Google Scholar
Chen, J., Leong, S.-Y. & Schachner, M. Differential expression of cell fate determinants in neurons and glial cells of adult mouse spinal cord after compression injury. Eur. J. Neurosci. 22, 1895–906 (2005).
PubMed Google Scholar
Katagiri, T. & Watabe, T. Bone Morphogenetic Proteins. Cold Spring Harb. Perspect. Biol. 8 (2016).
Miyazono, K., Kamiya, Y. & Morikawa, M. Bone morphogenetic protein receptors and signal transduction. J. Biochem. 147, 35–51 (2010).
CAS PubMed Google Scholar
Moustakas, A. et al. Smad regulation in TGF-beta signal transduction. Nat. Rev. Mol. Cell Biol. 267, 169–178 (2000).
Google Scholar
Itoh, S., Itoh, F., Goumans, M. J. & Ten Dijke, P. Signaling of transforming growth factor-beta family members through Smad proteins. Eur. J. Biochem. 267, 6954–67 (2000).
CAS PubMed Google Scholar
Massagué, J. How cells read TGF-beta signals. Nat. Rev. Mol. Cell Biol. 1, 169–78 (2000).
PubMed Google Scholar
Gomes, W. A., Mehler, M. F. & Kessler, J. A. Transgenic overexpression of BMP4 increases astroglial and decreases oligodendroglial lineage commitment. Dev. Biol. 255, 164–77 (2003).
CAS PubMed Google Scholar
Cheng, X. et al. Bone morphogenetic protein signaling and olig1/2 interact to regulate the differentiation and maturation of adult oligodendrocyte precursor cells. Stem Cells 25, 3204–14 (2007).
CAS PubMed PubMed Central Google Scholar
Wang, Y. et al. Astrocytes from the Contused Spinal Cord Inhibit Oligodendrocyte Differentiation of Adult Oligodendrocyte Precursor Cells by Increasing the Expression of Bone Morphogenetic Proteins. J. Neurosci. 31, 6053–6058 (2011).
CAS PubMed PubMed Central Google Scholar
Jin, Y., Kaluza, D. & Jakobsson, L. VEGF, Notch and TGFβ/BMPs in regulation of sprouting angiogenesis and vascular patterning. Biochem. Soc. Trans. 42, 1576–83 (2014).
CAS PubMed Google Scholar
Okada, S. et al. Conditional ablation of Stat3 or Socs3 discloses a dual role for reactive astrocytes after spinal cord injury. Nat. Med. 12, 829–34 (2006).
CAS PubMed Google Scholar
Bareyre, F. M. et al. In vivo imaging reveals a phase-specific role of STAT3 during central and peripheral nervous system axon regeneration. Proc. Natl. Acad. Sci. USA 108, 6282–7 (2011).
ADS CAS PubMed PubMed Central Google Scholar
Park, K. W., Lin, C.-Y. & Lee, Y.-S. Expression of suppressor of cytokine signaling-3 (SOCS3) and its role in neuronal death after complete spinal cord injury. Exp. Neurol. 261, 65–75 (2014).
CAS PubMed Google Scholar
Tsujino, H. et al. Activating transcription factor 3 (ATF3) induction by axotomy in sensory and motoneurons: A novel neuronal marker of nerve injury. Mol. Cell. Neurosci. 15, 170–82 (2000).
CAS PubMed Google Scholar
Aprea, J. et al. Transcriptome sequencing during mouse brain development identifies long non-coding RNAs functionally involved in neurogenic commitment. EMBO J. 32, 3145–60 (2013).
CAS PubMed PubMed Central Google Scholar
Rinn, J. L. & Chang, H. Y. Genome regulation by long noncoding RNAs. Annu. Rev. Biochem. 81, 145–166 (2012).
CAS PubMed Google Scholar
Wang, A. et al. Transcription factor complex AP-1 mediates inflammation initiated by Chlamydia pneumoniae infection. Cell. Microbiol. 15, 779–94 (2013).
CAS PubMed Google Scholar
Chen, G., Fang, X. & Yu, M. Regulation of gene expression in rats with spinal cord injury based on microarray data. Mol. Med. Rep. 12, 2465–72 (2015).
CAS PubMed PubMed Central Google Scholar
Cao, Q. et al. Functional and electrophysiological changes after graded traumatic spinal cord injury in adult rat. Exp. Neurol. 191 (2005).
PubMed Google Scholar
FastQC: A quality control tool for high throughput sequence data. Available at: http://www.bioinformatics.babraham.ac.uk/projects/fastqc/ (Accessed: 1st January 2016).
Trapnell, C., Pachter, L. & Salzberg, S. L. TopHat: Discovering splice junctions with RNA-Seq. Bioinformatics 25, 1105–1111 (2009).
CAS PubMed PubMed Central Google Scholar
Trapnell, C. et al. Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks. Nat. Protoc. 7, 562–78 (2012).
CAS PubMed PubMed Central Google Scholar
Quackenbush, J. Microarray data normalization and transformation. Nat. Genet. 32 Suppl, 496–501 (2002).
CAS PubMed Google Scholar
Anders, S., Pyl, P. T. & Huber, W. HTSeq-A Python framework to work with high-throughput sequencing data. Bioinformatics 31, 166–169 (2015).
CAS PubMed Google Scholar
Quinlan, A. R. & Hall, I. M. BEDTools: A flexible suite of utilities for comparing genomic features. Bioinformatics 26, 841–842 (2010).
CAS PubMed PubMed Central Google Scholar
Anders, S. & Huber, W. Differential expression analysis for sequence count data. Genome Biol. 11, R106 (2010).
CAS PubMed PubMed Central Google Scholar
Oliveros, J. V. E. N. N. Y. An interactive tool for comparing lists with Venn Diagrams. BioinfoGP of CNB-CSIC Available at: http://bioinfogp.cnnb.csic.es/tools/venny/index.html (Accessed: 2nd February 2016) (2007).
Powell, J. A. C. GO2MSIG, an automated GO based multi-species gene set generator for gene set enrichment analysis. BMC Bioinformatics 15, 146 (2014).
PubMed PubMed Central Google Scholar
Liberzon, A. et al. Molecular signatures database (MSigDB) 3.0. Bioinformatics 27, 1739–40 (2011).
CAS PubMed PubMed Central Google Scholar
Guttman, M. et al. Chromatin signature reveals over a thousand highly conserved large non-coding RNAs in mammals. Nature 458, 223–7 (2009).
ADS CAS PubMed PubMed Central Google Scholar
Subramanian, A. et al. Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles. PNAS 102, 15545–15550 (2005).
ADS CAS PubMed PubMed Central Google Scholar
Robinson, J. T. et al. Integrative genomics viewer. Nat. Biotechnol. 29, 24–26 (2011).
CAS PubMed PubMed Central Google Scholar
H., P. BSgenome: Infrastructure for Biostrings-based genome data packages. R Packag. version (2009).
Welter, D. et al. The NHGRI GWAS Catalog, a curated resource of SNP-trait associations. Nucleic Acids Res. 42 (2014).
Landrum, M. J. et al. ClinVar: Public archive of relationships among sequence variation and human phenotype. Nucleic Acids Res. 42 (2014).
PubMed PubMed Central Google Scholar
Meyer, L. R. et al. The UCSC Genome Browser database: Extensions and updates 2013. Nucleic Acids Res. 41 (2013).
Kanehisa, M. & Goto, S. KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 28, 27–30 (2000).
CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

J.Q.W., H.Y., and R.C.D.D. were supported by grants from the National Institutes of Health R01 NS088353; The Staman Ogilvie Fund-Memorial Hermann Foundation; Mission Connect, a program of the TIRR Foundation; the UTHealth BRAIN Initiative and CTSA UL1 TR000371; and a grant from the University of Texas System Neuroscience and Neurotechnology Research Institute (Grant #362469). QLC and YZ were supported by NIH R01 NS061975; NSF-CBET 1134449; The Staman Ogilvie Fund-Memorial Hermann Foundation; Mission Connect, a program of the TIRR Foundation; and UTHealth Senator Lloyd & B. A. Bentsen Center for Stroke Research. The authors would like to thank Ms. Mary Ann Cushman for editing the manuscript.

Author information

Raymond Grill
Present address: Present address: Department of Neurobiology and Anatomical Sciences, University of Mississippi Medical Center, Jackson, MS 39216, USA.,
Raquel Cuevas-Diaz Duran and Han Yan: These authors contributed equally to this work.

Authors and Affiliations

The Vivian L. Smith Department of Neurosurgery, McGovern Medical School, The University of Texas Health Science Center at Houston, Houston, 77030, TX, USA
Raquel Cuevas-Diaz Duran, Han Yan, Yiyan Zheng, Dong H. Kim, Qilin Cao & Jia Qian Wu
Center for Stem Cell and Regenerative Medicine, UT Brown Foundation Institute of Molecular Medicine, Houston, 77030, TX, USA
Raquel Cuevas-Diaz Duran, Han Yan, Yiyan Zheng, Qilin Cao & Jia Qian Wu
Department of Bioengineering, Rice University, Houston, 77005, TX, USA
Xingfan Huang
Department of Integrative Biology and Pharmacology, McGovern Medical School, The University of Texas Health Science Center at Houston, Houston, 77030, TX, USA
Raymond Grill

Authors

Raquel Cuevas-Diaz Duran
View author publications
You can also search for this author in PubMed Google Scholar
Han Yan
View author publications
You can also search for this author in PubMed Google Scholar
Yiyan Zheng
View author publications
You can also search for this author in PubMed Google Scholar
Xingfan Huang
View author publications
You can also search for this author in PubMed Google Scholar
Raymond Grill
View author publications
You can also search for this author in PubMed Google Scholar
Dong H. Kim
View author publications
You can also search for this author in PubMed Google Scholar
Qilin Cao
View author publications
You can also search for this author in PubMed Google Scholar
Jia Qian Wu
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.Q.W., Q.L.C., and D.H.K. conceived the project. H.Y. and Y.Z. performed the experiments. R.G. contributed reagents/materials. R.C.D.D. and X.H. carried out bioinformatics and statistical analysis. H.Y., R.C.D.D., Q.L.C., and J.Q.W. participated in writing the manuscript. All authors have read and approved the final manuscript.

Corresponding authors

Correspondence to Qilin Cao or Jia Qian Wu.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Supplementary information

Supplementary Information (DOC 51 kb)

Supplementary Table S1 (XLS 64 kb)

Supplementary Table S2 (XLS 11454 kb)

Supplementary Table S3 (XLS 20525 kb)

Supplementary Table S4 (XLS 1123 kb)

Supplementary Table S5 (XLS 295 kb)

Supplementary Table S6 (XLS 24 kb)

Supplementary Table S7 (XLS 580 kb)

Supplementary Table S8 (XLS 1585 kb)

Supplementary Table S9 (XLS 21103 kb)

Supplementary Table S10 (XLS 120 kb)

Supplementary Table S11 (XLS 608 kb)

Supplementary Table S12 (XLS 50 kb)

Supplementary Table S13 (XLS 22 kb)

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Duran, RD., Yan, H., Zheng, Y. et al. The systematic analysis of coding and long non-coding RNAs in the sub-chronic and chronic stages of spinal cord injury. Sci Rep 7, 41008 (2017). https://doi.org/10.1038/srep41008

Download citation

Received: 01 July 2016
Accepted: 14 December 2016
Published: 20 January 2017
DOI: https://doi.org/10.1038/srep41008

This article is cited by

Identification of hub genes in the subacute spinal cord injury in rats
- Lei Yan
- Jiawei Fu
- Zhiming Cui
BMC Neuroscience (2022)
Coding and long non-coding gene expression changes in the CNS traumatic injuries
- Xizi Wu
- Haichao Wei
- Jia Qian Wu
Cellular and Molecular Life Sciences (2022)
LncRNA MIAT Promotes Spinal Cord Injury Recovery in Rats by Regulating RBFOX2-Mediated Alternative Splicing of MCL-1
- Xin He
- Jianan Zhang
- Dingjun Hao
Molecular Neurobiology (2022)
Transcriptome profiling of long noncoding RNAs and mRNAs in spinal cord of a rat model of paclitaxel-induced peripheral neuropathy identifies potential mechanisms mediating neuroinflammation and pain
- Yuanyuan Li
- Chengyu Yin
- Boyi Liu
Journal of Neuroinflammation (2021)
OLIG2 regulates lncRNAs and its own expression during oligodendrocyte lineage formation
- Haichao Wei
- Xiaomin Dong
- Jia Qian Wu
BMC Biology (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.