Main

Colorectal cancer (CRC) is one of the most common cancers in Western countries and it is the second leading cause of cancer-related deaths among adults (Smith et al, 2010). For patients who undergo potentially curative resection, prognosis is adversely impacted by disease recurrence, which is often associated with distant metastasis (Mina and Sledge, 2011). It is established that CRC patients with stage III disease benefit from adjuvant chemotherapy, whereas such benefit in stage II patients has remained an area of controversy (Bertagnolli et al, 2011, O’Connor et al, 2011). 5-Fluorouracil (5-FU)-based chemotherapy remains the mainstay for treating CRC patients, and its combination with oxaliplatin is considered as the standard of care chemotherapeutic treatment in stage III disease (Kopetz et al, 2008). A major problem in the treatment of CRC is the intrinsic or acquired drug resistance that can ultimately lead to death of the patient (Goldberg et al, 2004). Therefore, prognostic markers, which can facilitate stratification of stage II and III patients that are at high risk for developing tumour recurrence and metastasis, are an imperative need, as availability of such tools may have a significant impact on the clinical management of CRC patients.

MicroRNAs (miRNAs) are a class of small noncoding RNAs that have been implicated in the initiation, progression and metastasis in various human malignancies (Calin and Croce, 2006, Baek et al, 2008). MiRNAs can function as either oncogenes or tumour suppressors, can control the expression of hundreds of downstream gene targets and are resistant to RNAse-mediated degradation making them inherently stable within a variety of clinical specimens—all of these features, which have made them attractive molecular signatures for the development of various disease biomarkers, as well as potential therapeutic targets in cancer (Nicoloso et al, 2009; Spizzo et al, 2009). Interestingly, recent evidence also suggests that expression alterations of specific miRNAs may also be associated with determining chemosensitivity and chemoresistance in human cancer cells (Kagan et al, 1995; Liu et al, 2010; Tomimaru et al, 2010). In this regard, recently, the expression levels of miR-34a, miR-10b, miR-19b and miR-143 were shown to be related to responsiveness to 5-FU-based chemotherapy in CRC cell lines (Borralho et al, 2009; Akao et al, 2011; Kurokawa et al, 2012; Nishida et al, 2012). Such in vitro findings provide a rationale for interrogating the expression of miRNAs in clinical specimens from patients with CRC, which might facilitate development of specific prognostic and predictive biomarkers that can help identify CRC patients who will or will not respond to specific chemotherapies.

The introduction of newer technological platforms including microarrays and next-generation sequencing have revolutionised the approaches aimed towards a better understanding of the molecular events related to tumour progression, distant spread and response to chemotherapies (Boyer et al, 2006; Lujambio et al, 2008). However, till date, only a very small number of studies have attempted genome-wide approaches for the identification of novel prognostic and predictive miRNA biomarkers in clinical specimens from patients with CRC, who have been treated with specific chemotherapies (Schepeler et al, 2008; Schetter et al, 2008). More importantly, to the best of our knowledge, translatability of markers developed is such studies have been hampered due to lack of validation of discovered biomarkers in independent patient cohorts. In view of these shortcomings in the existing literature, we designed the present two phase study: in the first phase, we performed a systematic, genome-wide discovery of miRNAs that may serve as prognostic and predictive biomarkers in patients treated with 5-FU-based chemotherapy. In the second phase, we validated the expression of discovered biomarkers in another large, independent cohort of patients enrolled in a clinical trial where patients were treated with 5-FU-based chemotherapy regimens. Using such an approach, we have identified miR-320e, as a novel prognostic biomarker, which is associated with adverse clinical outcome in stage III CRC patients treated with 5-FU-based adjuvant chemotherapy. These findings have important clinical implications for the personalised management of patients with CRC.

Materials and Methods

Patients

The discovery cohort

A cohort of 100 stage III colon cancer tissues that included 50 patients with tumour recurrence and 50 without recurrence within 3 years of treatment. These patients were enrolled as part of the NCCTG N0147, a phase III trial of FOLFOX alone or combined with cetuximab as adjuvant chemotherapy (Alberts et al, 2012); however, the clinical specimens in this study were limited to the FOLFOX alone study arm. Median patient follow-up on this trial was 4.1 years. Of the 47 patients without tumour recurrence and 50 with recurrence from whom miRNA expression profiling was successfully obtained, 53.2% and 56% were treated with FOLFOX alone, respectively. This study was approved by the institutional review boards of all participating centres, and a written informed consent was obtained from all patients. Data are summarised either as mean with 95% confidence intervals (CIs) on the log2 scale, or these values were exponentiated to generate fold-change with a CI.

The validation cohort

This group included a total of 237 CRC patients (65 stage II, 102 stage III and 70 stage IV), who were enrolled as part of a clinical trial conducted at the Hospital Clinic, Barcelona, Spain. Patients in this clinical trial were enrolled between 1996 and 2008, and the median follow-up was 4.7 years in patients who were alive at last follow-up, and 3.5 years for all patients. All stage II and III patients were treated with 5-FU-based adjuvant chemotherapy following resection of the primary tumour; all stage IV patients were treated with 5-FU and oxaliplatin (Takahashi et al, 2012). Normal colonic mucosa (n=20) from healthy individuals was obtained from the Pathology Department of the Hospital Clinic of Barcelona and included as controls. This study was approved by the Institutional Review Board of Hospital Clinic, Barcelona, Spain, and a written informed consent was obtained from all patients.

RNA extraction

Total RNA including the miRNA fraction was extracted from formalin-fixed paraffin-embedded tissues using the RecoverAll Total Nucleic Acid Isolation Kit (Ambion, Inc., Austin, TX, USA) according to the manufacturer’s protocol.

MiRNA expression microarrays

MiRNA microarray expression profiling analysis included interrogation of 2221 miRNAs, using the Affymetrix GeneChip miRNA 2.0 Arrays (Santa Clara, CA, USA). Each sample was labelled using the Genisphere FlashTag Biotin HSR kit (Hatfield, PA, USA). Briefly, 1 mg of total RNA was incubated with ATP and poly-A-polymerase to add a 3′-polyA tail. A ligation reaction was then performed to covalently attach to the miRNA population a multiple-biotin molecule containing a 3′-DNA dendrimer. Labelled samples were subsequently processed according to the manufacturer’s instructions. Following hybridisation for 16 h at 48 °C, the arrays were washed and stained in an Affymetrix Fluidics station 450, scanned using an Affymetrix 3000 7G scanner and the expression changes between cases and controls were analysed.

Quantification of miRNA expression by real-time RT–PCR

Expression of miR-320e in the validation cohort was performed utilising the TaqMan reverse transcription–PCR (qRT–PCR) method with the predesigned TaqMan miRNA expression assays (Applied Biosystems Inc., Foster City, CA, USA) using a StepOnePlus Real-Time PCR System (Applied Biosystems). All the experiments were done in duplicate. Results were expressed as 2−ΔCt, and the results were normalised to miR-16. To keep consistent measurements throughout all plates, two independent RNA cell line samples were loaded as internal controls along with each PCR run, and the results from each plate were normalised according to data obtained from these internal controls.

Statistical analysis

The quality of the raw Affymetrix miRNA data was assessed using boxplots and residual MVA plots for each subject pre- and post-normalisation (Eckel et al, 2005). Per-probe and -patient distributions of detection calls were also examined, where signal intensities with detection P<0.06 were considered to be detected. The non-human probes were excluded before normalisation, leaving 2221 human miRNA transcripts for analyses. Data were analysed on the log2 scale, and were normalised using quantile normalisation (Bolstad et al, 2003). Differential expression between tumours with and without recurrence within 3 years was performed via logistic regression by incorporating various variables: both unadjusted and adjusted individually for deficient DNA mismatch repair (dMMR) vs proficient MMR (pMMR) status, presence/absence of a BRAFV600E or KRAS mutation, right vs left side and metastatic regional lymph nodes <4 or >=4. FDR values were calculated, and the significance level was set at FDR q<0.05 (Storey and Tibshirani, 2003). Probes were filtered out of consideration if the standard deviation (s.d.) in cases and controls combined for a given probe was less than the 95th percentile s.d. (0.164) of those probes that were not detected in ≥45% of the samples.

Distributions in the validation cohort are presented as mean (s.d.) or median (range) depending on level of skewness in the data for continuous variables or as counts (percentage) for categorical variables. Nonparametric tests (rank sum, Kruskal Wallace) were used to compare expression between groups. Kaplan–Meier and Cox regression models were used to assess association with time to event outcomes that included overall survival (OS), defined as time from diagnosis to death, and disease-free survival (DFS) defined as time from diagnosis to disease recurrence or death. Owing to the scale of miR320e expression, hazard ratios (HRs) are presented as the change in risk of a person at the 75th percentile relative to a person at the 25th percentile of miR-320e distribution. This can be interpreted as the increase in risk due to a patient moving from the middle of the bottom half of the miR-320e distribution to the middle of the top half of the distribution. Bootstrapping was used to calculate an optimism-corrected c-index, analogous to the area under the receiver operating characteristic curve (Steyerberg et al, 2011). A c-index of 0.5 denotes random predictions, whereas a c-index of 1 denotes perfect predictions.

Results

MiR-320e is frequently upregulated in CRC patients with recurrence to 5-FU-based chemotherapy

The discovery phase

We performed systematic discovery for miRNA biomarkers using Affymetrix microarrays in 100 stage III patients treated with 5-FU-based adjuvant chemotherapy, during the course of NCCTG N0147 trial. High-quality miRNA data were obtained on all but three patients without tumour recurrence at 3 years of follow-up. The median age (range) for these 97 patients was 59 (25–81) years, of which 46 (47.4%) were male. There were 48 (49.5%) primary tumours that were located in the distal colon and 49 (50.5%) tumours that were proximal (Table 1).

Table 1 Characteristics of patients in the discovery and validation cohorts

Of the 2221 miRNA probes analysed via microarray analysis, one met statistical significance criteria with unadjusted analyses: miR-320e, with P<0.0001 and a corresponding FDR q-value of 0.0497 (Supplementary Table 1). MiR-320e was upregulated in cases with recurrence vs no recurrence (controls; Figure 1). Specifically, the mean level of miR-320e expression (mean±s.d.) on the log2 scale in patients that recurred was 8.07±0.46 vs 7.72±0.33 compared with patients without recurrence, resulting in a raw fold change of +1.27-fold. After adjusting for sidedness, the mutational status of BRAF and KRAS, and number of metastatic regional lymph nodes (< or ≥4), miR-320e expression was significantly higher in cases vs controls (P=0.0003).

Figure 1
figure 1

MiR-320e expression in the discovery cohort. Log2 normalised expression levels for miR-320e in stage III colorectal cancers treated with 5-FU-based chemotherapy. Cases (patients with tumour recurrence) had significantly higher miR-320e expression compared with controls (patients without recurrence). The red line indicates median value for expression.

The validation phase

To further confirm our discovery phase results for miR-320e expression, we perform validation of these data in an independent cohort of 237 patients with stage II–IV CRC. The median (range) age of the patients was 65 (32–82) years, and of these 141 (59.5%) patients were male. Of the 237 CRCs, 69 (29.1%) were located in the proximal colon, 155 (65.4%) were distal and 13 (5.5%) were located in the rectum. In all, 65 (27.43%) of the cases were stage II, 102 (43.04%) were stage III and 70 (29.5%) were stage IV (Table 1).

MiR320e results were expressed as 2−ΔCt and normalised with the expression of miR-16. The median level of miR-320e expression in primary CRC tissues was almost 13-fold higher than in normal colonic mucosa (median (range): 0.000286 (0.0000238–0.0017000) vs 0.0037 (0–0.028), P<0.0001)). There was a gradual increasing trend for miR-320e expression with advancing tumour stage (Figure 2A). Specifically, significant miR-320e upregulation was found in CRC patients with lymph node metastases, that is, stage III (0.00357 (0.00013–0.022) vs 0.00200 (0–0.028); P=0.0266) or with distant metastasis, that is, stage IV (0.0056 (0.0011–0.02) vs 0.00200 (0–0.028); P<0.0001) compared with patients without regional or distant metastases, that is, stage II.

Figure 2
figure 2

MiR-320e expression in normal colonic mucosa and CRC tissues from the validation cohort. MiR-320e results were expressed as 2−ΔCt and normalised to miR-16. (A) Colonic mucosa vs primary tumours–stage II–IV, miR-320e expression in colonic mucosa from healthy controls (NC) and in stage II, III and IV CRCs; the number of patients (N) and median expression (median) are listed below the graph. (B) Recurrence or death within 3 years vs non-recurrence or death within 3 years–stage II and III, miR-320e expression; the number of patients (N) and median expression (median) are listed below the graph. The red line indicates median value for expression.

Higher-expression miR320e is associated with poor outcome in patients with stage II and III CRC

To determine the potential prognostic significance of miR-320e expression, Cox regression models were utilised with the miRNA expression levels considered as continuous variables. There were 167 stage II and III CRC patients in the validation cohort with sufficient data for these analyses. All of these patients were treated with 5-FU-based adjuvant chemotherapy for 6 months subsequent to tumour resection. Of these patients, 65 out of 167 (38.9%) were stage II and the remainder were stage III. Median follow-up for these patients was 4.4 years (range: 0.24–14.4 years), and was 4.7 years for patients alive at last follow-up (range: 0.69–14.4 years). The 2- and 5-year Kaplan–Meier DFS estimates were 78.32% (72.29, 84.84) and 63.96% (56.58, 72.30), respectively. The 2- and 5-year OS estimates were 91.57% (95% CI=87.44, 95.89) and 78.52% (95% CI=72.15, 85.46), respectively.

Multivariate models were employed to interrogate the statistical significance of results in these 167 stage II/III patients, and to assess the association of miR-320e expression with DFS and OS, after adjusting for tumour location and TNM stage. After adjustment for TNM stage and tumour location in stage II and III patients, an increase of 0.0052 units of miR-320e expression (that is, an increase from the 25th percentile to the 75th percentile of the miR-320e distribution) was associated with an increased HR for DFS (HR=1.39, 95% CI=1.11–1.74, P=0.0036) and OS (HR=1.46, 95% CI=1.11–1.91, P=0.0059; Table 2, Figure 3). The optimism-corrected c-indices were 0.63 and 0.62 for DFS and OS, respectively.

Table 2 Multivariable analysis of the association between DFS or OS and miR-320e expression in stage II/III CRC patients, adjusting for tumour location and stage
Figure 3
figure 3

Predicted DFS and OS functions. Predicted DFS (A) and OS (B) from the Cox regression model containing TNM stage, tumour location and miR-320e expression to visualise the hazard ratios reported in Table 2. Tumour location was set to the left side for calculating the predicted functions. Stage II curves are blue; stage III curves are red. Q1 (solid line) denotes patients with miR-320e expression at the 25th percentile, whereas Q3 (dashed line) denotes patients with miR-320e expression at the 75th percentile. Thus, the curves represent survival functions for patients in the middle of the bottom half of the miR-320e distribution vs those in the middle of the top half of the miR-320e distribution.

When categorising stage II/III the patients into those who had recurred or died within 3 years vs those who were recurrence-free and alive at ≥3 years of follow-up, miR-320e was not significantly different between these groups (0.00410 (0.00013–0.02200) vs 0.00290 (0.00023–0.02800)), P=0.1457; Figure 2B). However, when the cohort was restricted to stage III patients, the level of miR-320e expression was found to be significantly different between patients who had recurred or died within 3 years vs those who were alive and recurrence free after 3 or more years of follow-up (0.00578 (0.00013–0.02200) vs 0.00260 (0.00025–0.01800), P=0.0132).

High miR-320e expression associated with advancing CRC stage

Analysis of the clinicopathological features associated with tumour miR-320e expression levels was also investigated. The variables included in this analysis were age, gender, tumour location, tumour size, tumour grade, mucinous, histology, stage and lymph node metastasis. TNM stage was found to be significantly associated with expression level of miR-320e (P=0.0266). Age was not significantly correlated with miR-320e expression levels (r=0.0626 P=0.4220). Neither tumour location nor lymph node metastasis were significantly associated with the expression level of miR-320e (Table 3). Interestingly, there was a trend towards a positive association between miR-320e and tumour size (P=0.1047) and with lymph node metastasis (P=0.0626); however, these associations were not statistically significant at the 5% level (Table 3).

Table 3 miR-320e expression levels and clinico-pathologic characteristics of stage II and III CRC patients in the validation cohort

Discussion

This clinical study incorporated both a systematic discovery and an independent validation of a miRNA biomarker, miR-320e, which was found to be associated with CRC recurrence in two clinical trial cohorts of patients treated with 5-FU-based adjuvant chemotherapy. We discovered a number of factors that support miR-320e as a potential and novel prognostic biomarker in CRC patients. First, miR-320e expression was significantly increased in resected primary tumours compared with colon mucosa from healthy controls. We observed a gradual increase in miR-320e expression with advancing TNM stages (II–IV). In this regard, there was a significant increase in miR-320e levels in primary tumours from patients with lymph node (stage III) or distant metastasis (IV), compared with stage II tumours. Second, we were able to corroborate the miR-320e findings in the discovery cohort with those found in the validation cohort. Third, increased miR-320e expression was significantly associated with worse DFS and OS among patients diagnosed with stage II/III CRCs, and in stage III CRCs alone, even after adjusting for lymph node status.

MiR-320e belongs to the miR-320 family of miRNAs that encompass a total of 37 sequences. The best studied family member from this cluster is miR-320a (frequently referred to as miR-320), which was reported to have a tumour-suppressive role in prostate, breast and colon cancer (Schepeler et al, 2008, Bronisz et al, 2011, Hsieh et al, 2013). MiR-320a is encoded within a CpG-rich promoter region of the cell cycle gene POLR3D in the antisense orientation (Kim et al, 2008), and is reportedly regulated by epigenetic methylation silencing (Lee et al, 2009). MiR-320a is located within a chromosomal region at 8p21.3, which is a frequent target for loss of heterozygosity during cancer progression (Kagan et al, 1995). In contrast, miR-320e is an intragenic miRNA located at chromosome 19 q13.32 within intron-3 of the PRKD2 gene, which encodes a kinase D (PKD) serine/threonine protein that appears to have a role in the development and progression of leukemia (Yang et al, 2013) and breast cancer (Onodera et al, 2012). Currently, much less is known about the functional role of miR-320e in human cancer compared with the rest of the miR-320 family, which is not surprising given that our study for the first time has identified its role in CRC through a systematic discovery process. Expression of miR-320e was identified in melanoma and in human embryonic stem cells by next-generation sequencing (Goff et al, 2009, Stark et al, 2010); however, the mechanism responsible for miR-320e deregulation and its functional role remain unknown. To elucidate the mechanisms responsible for miR-320e upregulation, the identification of miRNA gene targets will be crucial to our understanding of the role of miR-320e in CRC. Prominent bioinformatic tools, such as TargetScan, miRBase and DianaLab, have identified five predicted targets for miR-320e, including PP2R2C, IRF6, ONECUT2, CMCL1 and CPEB2 genes–all of which have been shown to display tumour-suppressor activities in different cancer types (Banerjee et al, 2007; Fournier et al, 2010; Botti et al, 2011; Nairismagi et al., 2012; Laudadio et al, 2012; Beukers et al, 2013; Bluemn et al, 2013). Further functional studies are necessary to investigate the mechanistic role of miR-320e in colon cancer progression, metastasis and chemotherapeutic response.

Our findings are in line with some of the previous reports highlighting the potential significance of miRNAs as potential prognostic biomarkers in CRC. The first study in this regard reported that high expression of miR-21 in CRC patients associated with poor survival (Schetter et al, 2008). Since then, although several others overexpressed miRNAs, including miR-10b (Nishida et al, 2012), miR-17-92a cluster (Valladares-Ayerbes et al, 2011), miR-29a (Weissmann-Brenner et al, 2012), miR-31 (Yang et al, 2013) and miR-182 (Liu et al, 2013), as well as few downregulated miRNAs such as miR-143 (Guo et al, 2013) and miR-124 (Jinushi et al, 2014) have been proposed to have prognostic usefulness, currently miR-21 is considered as one of the best miRNA-based prognostic biomarkers in CRC.

Strengths of our study are that tumour specimens used for the discovery cohort are from a randomised clinical trial conducted through a cancer cooperative group in the United States. Thus, they are well annotated and from patients meeting strict eligibility criteria where detailed follow-up data were collected. Importantly, patients received the FOLFOX regimen, which is standard adjuvant therapy for stage III colon cancer patients. In addition, we have replicated our findings in the discovery cohort in an independent European clinical trial cohort. Validation in an independent cohort is considered one of the highest levels of validation (Steyerberg et al, 2011). Together, these strengths mitigate the high risk of false discovery in our discovery cohort resulting from testing over 2221 miRNA probes. Although we present provocative evidence for the clinical relevance of miR-320e in CRC, our study has limitations. There is a need for further verification of the relevance of our findings to CRC patients, as well as studies to understand the functional importance of this miRNA. Because of unavailability of matched serum/blood specimens from patients analysed in this study, we were unable to explore the potential clinical usefulness of miR-320e as a non-invasive biomarker–a concept, which might lead to further enhancement of its diagnostic and prognostic utility in future. Last, our study focused only on the prognostic value of miR-320e, and did not address its potential utility as a predictive biomarker. However, this issue could not be evaluated in our study cohorts that lacked untreated control arms.

In conclusion, our study first reports the clinical and prognostic relevance of miR-320e in patients with CRC. Expression of miR-320e was frequently upregulated, particularly in advanced stage tumours, and was associated with recurrence and survival in CRC in patients treated with 5-FU-based adjuvant chemotherapy. Evidence that miR-320e expression may serve as a novel prognostic biomarker was shown whereby elevated levels were associated with poorer DFS and OS rates. In future studies, bioinformatic and transcriptomic approaches, as well as functional analysis, will be necessary to investigate the mechanistic role of miR-320e in colon cancer.