Introduction

Spinal cord injury (SCI) is a type of damage to the nerve tract that runs from the lower back to the brain.1 As a high-cost neurological disability,2 SCI can even lead to permanent paralysis and loss of sensation.3 Evidence has shown that SCI is correlated with lots of genetic factors. It has been reported that interleukin-10 (IL-10) can reduce injury-induced inflammation, thus limiting the extent of neuronal injury and the development of injury-induced pain behaviours following excitotoxic SCI.4, 5 Schwann cell transplantation combined with glial cell line-derived neurotrophic factor administration may contribute to axonal regeneration and myelin formation following SCI.6 Permeable peptide 1-superoxide dismutase 1 protein can significantly decrease apoptosis of motor neurons induced by reactive oxygen species, which is helpful for functional recovery following SCI.7, 8 Signal transducer and activator of transcription 3 is an important mediator of reactive astrocytes during the healing process of SCI and may be a promising target for intervention therapy in central nervous system injury.9, 10 Sry-related HMG box-9 (SOX9) inhibition decreases the levels of chondroitin sulphate proteoglycans and enhances motor function, indicating that SOX9 inhibition is a promising therapeutic strategy for SCI.11 However, the genetic mechanisms underlying SCI have not been comprehensively revealed. Thus, it is urgent to investigate the pathogenesis of SCI and develop new therapies.

The need for biologically relevant animal SCI models has focused on the development of animal injury models. Techniques such as weight drop, clip compression and chemically mediated SCI have been successfully introduced into laboratory animal models.12 Among these injury models, the clip compression model is introduced as one of the nontransection models of SCI in rodents.12, 13 The clip compression model has the ability to mimic injury at different levels by adjusting the force and duration of clip application.12, 14 Using the clip compression model, Chamankhah et al.12 perform microarray data analysis, time-series expression profile clustering and functional enrichment analysis to provide an overall picture of biological processes and the relevant molecular events underlying SCI model rats. However, the mechanisms of SCI are not entirely clear. As different analysis procedures and methods can generate different results, we further screened the top 100 temporal genes using the data from Chamankhah et al.12 Then, gene clustering of the top 100 temporal genes was performed. The potential functions of these temporal genes were predicated by functional, pathway and protein domain enrichment analyses. In addition, a pathway–gene crosslinking network was constructed to investigate the interaction relationships between these temporal genes and pathways.

Materials and methods

Microarray data

The expression profile of GSE45006 deposited by Chamankhah et al.12 was downloaded from the Gene Expression Omnibus (GEO, http://www.ncbi.nlm.nih.gov/geo/) database, which was based on the platform of GPL 1355 [Rat230_2] Affymetrix Rat Genome 230 2.0 Array. GSE45006 included 20 SCI samples (including samples at 1 day, 3 days, 1 week, 2 weeks and 8 weeks after injury, four repetitions for each time point) and 4 normal samples. The rats used in this study were female Wistar rats (250 g, Charles River Laboratories (Wilmington, MA, USA), four injured rats for each time point and four sham rats). Injuries were caused by the aneurysm clip method.15, 16 Briefly, under a 1:1 mixture of O2/N2O, the rats were drugged with halothane (1–2%), and the surgical area was disinfected using 70% ethanol and betadine. An incision was made in the midline of thoracic area (T4-T9), and superficial muscles and skin were retracted. Rats were subject to a T6–T8 laminectomy, and they subsequently received a 35-g clip (Walsh) compression injury at T7 for 1 min. The rats had a cut sutured and were administered Clavamox for 7 days, standard post-operative analgesia treatments and saline (0.9%; 5 ml).

Temporal genes screening

After GSE45006 was downloaded, microarray data were pre-processed according to the following processes. Combined with the annotation information in the platform, probe names were transformed into gene symbols. Using aggregate function methods,17 the mean value of probes mapped with the same gene was obtained and taken as the gene expression value. Using the KMN method of the impute package18 in R (k was set to 10), the probe without a corresponding value was added with difference value. Then, quantile normalisation was conducted by the median method of the preprocessCore package19 in R to obtain a normalised matrix.

The analysis of variance method of the genefilter package20 in R was used to filter genes with a larger interblock difference than intra-group difference. The significance threshold was set to 0.05. As an improved and sensitive empirical Bayes approach, the Bayesian Estimation of Temporal Regulation (BETR) was utilised to measure differential expression in time-series data.21 The BETR algorithm of the BETR package21 in R was applied to calculate probabilities of differentially expressed genes in microarray time-course data. A probability >0.95 was used as the cutoff criterion. The classification algorithm random forest is appropriate for gene expression profile data, given that it can be used for a larger number of variables and problems with more than two classes. The algorithm performs well regardless of whether predictive variables are noise, and evaluates variable importance.22 Using the randomForest package22 in R, the top 100 temporal genes were identified by the random forest method.

Expression pattern clustering of temporal genes

The gplots package23 in R was used to perform heatmap analysis for the expression patterns of the top 100 temporal genes and to construct two-dimensional hierarchical clustering figures. Subsequently, Pearson correlation coefficient, which is one of the most common coefficients used to evaluate the dependence of two variables,24 was employed to cluster the expression patterns of the top 100 temporal genes.

Functional, pathway and protein domain enrichment analysis

Gene Ontology (GO) is one of the most important tools used to represent and process information concerning genes, gene functions and gene products.25 The Kyoto Encyclopedia of Genes and Genomes (KEGG) is a database comprised of known genes and their biochemical functionalities.26 Using the clusterProfiler package27 in R, GO and KEGG pathway enrichment analyses were performed for the top 100 temporal genes. P-values were adjusted using the Benjamini and Hochberg test,28 and an adjusted P-value <0.05 was used as the cutoff criterion. In addition, the TargetMine online tool29 was used to conduct GO, KEGG pathway and protein domain enrichment analyses for the genes in clusters. P-values were adjusted using the Holm–Bonferroni method,30 and an adjusted P-value <0.05 was used as the threshold.

Pathway–gene crosslinking network construction

Cytoscape is a popular software package that can integrate high-throughput expression data or other molecular networks with biomolecular interaction networks into a unified biological network.31 After pathway enrichment analysis was performed for the genes in clusters, Cytoscape software31 (http://www.cytoscape.org) was used to visualise the pathway–gene crosslinking network.

Results

Temporal genes analysis

After analysis using the betr package in R, a total of 1907 temporal genes with a probability 0.95 were identified from the microarray data of GSE45006. Then, the top 100 temporal genes were obtained using the randomForest package in R.

Expression pattern clustering of temporal genes

Two-dimensional hierarchical clustering features of expression patterns of the top 100 temporal genes are presented in a heatmap (Figure 1). The hierarchical clustering tree (Figure 2) showed that the top 100 temporal genes were classified into six clusters (cluster 1–6) according to their expressions in samples at each time point. The heatmaps of the six clusters are presented in Figure 3. From 1 day to 8 weeks after injury, the expressions of genes in cluster 1 and cluster 2 decreased firstly and then gradually increased, in contrast to the genes in cluster 6. After the introduction of clip compression injury and up to 8 weeks after injury, the expressions of genes in cluster 3 gradually decreased along with the experimental duration. Gene expression levels in normal samples were significantly reduced compared with those in SCI models at each time point. In cluster 4, gene expression levels in SCI models at 1 day and 3 days post injury were significantly increased compared with those in SCI models at other time points and normal samples. The expression of genes in cluster 5 gradually increased, and gene expression levels in normal samples were significantly increased compared with those in SCI models at each time point.

Figure 1
figure 1

Heatmap for the top 100 temporal genes. A full colour version of this figure is available at the Spinal Cord journal online.

Figure 2
figure 2

Clustering tree for the top 100 temporal genes.

Figure 3
figure 3

Heatmaps for the top 100 temporal genes in the six clusters (cluster 1, cluster 2, cluster 3, cluster 4, cluster 5 and cluster 6). A full colour version of this figure is available at the Spinal Cord journal online.

Functional, pathway and protein domain enrichment analysis

The top 10 GO terms enriched for the top 100 temporal genes are listed in Table 1. Most of the GO terms were in biological process categories, including localisation (P-value=8.29E-08), single-organism process (P-value=2.52E-07) and response to organic substance (P-value=9.93E-07). The KEGG pathways enriched for the top 100 temporal genes are listed in Table 2, including arginine and proline metabolism (P-value=2.37E-03), colorectal cancer (P-value=2.37E-03) and glycerolipid metabolism (P-value=8.63E-03).

Table 1 The top 10 enriched functions for the top 100 temporal genes
Table 2 The enriched pathways for the top 100 temporal genes

Using the TargetMine online tool, GO functional, KEGG pathway and protein domain enrichment analyses were performed for the genes in the six clusters (Table 3). Two genes in cluster 1 (CD4 and VCAM1) were involved in immunoglobulin C2-set domain (P-value=1.48E-02). Three genes in cluster 2 (CTSH, RNPEP and TPP1) were enriched in molecular function of exopeptidase activity (P-value=4.20E-02). Ten genes in cluster 3 were involved in cellular component of cytosol (P-value=1.33E-02). Moreover, arginase 1 (ARG1) and nitric oxide synthase 3 (NOS3) in cluster 4 were enriched in biological processes of arginine catabolic process (P-value=3.77E-03), arginine metabolic process (P-value=5.65E-03) and glutamine family amino acid catabolic process (P-value=4.53E-02). There were no terms enriched for the genes in cluster 5 and cluster 6.

Table 3 The enriched functions and protein domains for genes in the six clusters

Construction of a pathway–gene crosslinking network

The pathway–gene crosslinking network was constructed following pathway enrichment analysis for the top 100 temporal genes in clusters (Figure 4). As a result, transforming growth factor beta 2 (TGFβ2); TGFβ3, aldehyde dehydrogenase 2 (ALDH2); and aldehyde dehydrogenase 3 family, member A2 (ALDH3A2) were involved in numerous pathways. Both TGFB2 and TGFB3 were associated with map05146 amoebiasis pathway. In addition, ALDH3A2 and ALDH2 are associated with both map01100 metabolic pathways and map00330 arginine and proline metabolism pathways. In addition, ARG1 is the unique connection to the map05146, map01100 and map00330 pathways (Figure 4).

Figure 4
figure 4

The pathway–gene crosslinking network for genes in the six clusters (cluster 1, cluster 2, cluster 3, cluster 4, cluster 5 and cluster 6). Purple rectangles represent genes in cluster 1. Pink rectangles indicate genes in cluster 2. Yellow rectangles represent genes in cluster 3. Green rectangles represent genes in cluster 4. Blue rectangles indicate genes in cluster 5. Orange rectangles represent genes in cluster 6. Ovals indicate pathways. A full colour version of this figure is available at the Spinal Cord journal online.

Discussion

In this study, 1907 temporal genes in the SCI models with a probability 0.95 were obtained via comparisons with normal samples. Hierarchical clustering showed that the top 100 temporal genes that could be divided into six clusters (cluster 1, cluster 2, cluster 3, cluster 4, cluster 5 and cluster 6) exhibited different expression patterns during the time series. Most of the enriched functions for the top 100 temporal genes were in biological process categories. TGFβ2, TGFβ3, ALDH2 and ALDH3A2 were correlated with numerous pathways in the pathway–gene crosslinking network.

The treatment of SCI rats with TGFβ reduces mononuclear phagocyte numbers around the injury site, reducing their contribution to secondary damage by decreasing lesion volume.32 Anti-TGFβ1 aggravates secondary injury by inhibiting the anti-inflammatory effect of TGFβ1.33 Lagord et al.34 reported the temporal expression and cellular localisation of TGFβ following SCI, suggesting that TGFβ1 regulates the inflammatory and neuronal responses, whereas TGFβ2 moderates glial/collagen scarring. Thus, TGFβ2 correlates with the deposition of scar tissue in SCI lesion rather than TGFβ1 does. In this study, TGFβ2 and TGFβ3 expression levels were successively evaluated at 1–3 days and 1–8 weeks following SCI injury. These findings indicate that TGFβ2 and TGFβ3 exhibit a close correlation with the repair of SCI. TGFβ2 expression levels post SCI were decreased by agmatine treatment, suggesting the therapeutic potential of inhibition of TGFβ2 in the treatment of SCI.35 As a mitochondrial enzyme metabolises numerous aldehydes, ALDH2 mediates nociception and can be used as a molecular target for controlling pain.36 ALDH1A1 protects dopaminergic neurons in substantia nigra pars compacta through inhibiting the formation of cytotoxic α-synuclein oligomers and accumulation of dopamine aldehyde intermediates.37 Thus, ALDH2 and ALDH3A2 might also have important roles in the repair process after SCI.

There are three types of NOS enzymes (constitutive Ca2+/calmodulin-dependent neuronal NOS, nNOS; endothelial NOS, eNOS; and inducible calcium-independent isoform, iNOS). These enzymes have a crucial role in the progression of SCI, with degradation of l-arginine to l-citrulline and NO.38 Arginase regulates NO levels by competing with NOS for the substrate l-arginine.39 Upregulations of eNOS, iNOS and NAD(P)H oxidase cooperate to sustain oxidative and nitrosative stress in the damaged spinal cord tissue.40 In addition, the expression of ARG1 may provide a balance for NOS amounts.41 Thus, inhibition of arginase may be a promising treatment strategy for neurogenic bladder overactivity in SCI.42

Different types of NOS enzymes act in different periods of SCI in rats. In detail, nNOS induces the initial maximal NO increase, and iNOS causes the second wave of NO production.43, 44, 45 NO induces breakdown of the blood–spinal cord barrier, motor dysfunction and oedema formation in SCI, indicating that inhibitions of nNOS and NO production is essential for neuroprotection.38, 46 Lee et al.47 demonstrated that peripheral nerve grafts with acidic fibroblast growth factor treatment markedly induced nNOS protein, and significantly increased eNOS protein compared with injury groups. As previously reported, TGFβ is associated with l-arginine-dependent NO-mediated mechanisms.48 TGFβ2 and TGFβ3 induce Sertoli cell NOS production, suggesting that TGFβ2 and TGFβ3 regulate tight junction-barrier function in Sertoli cells.49 In this current study, TGFβ2 and TGFβ3 were also associated with arginine catabolic processes through ARG1. These findings might provide a new insight of correlation between TGFβ and arginine catabolic processes for investigations of SCI mechanism.

Despite the comprehensive analysis, several limitations in the present study should be noted. First, this study was based on the expression profiles data downloaded from the GEO database. Second, the sample size was relatively small, with only four repetitions for each time point. Moreover, all the predictive results were obtained from bioinformatics analysis and must be further confirmed.

In conclusion, we conducted an integrated bioinformatics analysis of genes potentially related to SCI. A total of 1907 temporal genes were identified in the SCI samples. Among the top 100 temporal genes, TGFβ2, TGFβ3, ALDH2, ALDH3A2, ARG1 and NOS3 are potentially associated with the mechanisms of SCI. These findings might promote the understanding of the pathogenesis of SCI and lay the foundation for the targeted therapy of SCI. However, further experimental studies are still necessary to unravel their mechanisms of action in SCI.

Data archiving

There were no data to deposit.