Main

Colorectal cancer (CRC) remains the second main cause of cancer death in western countries (Jemal et al, 2011). Despite significant advances in the management of CRC, early diagnosis and timely surgery of an already malignant but not yet invasive tumour still represent the best- and only-cure options (Center et al, 2009; Ferlay et al, 2010; Gellad and Provenzale, 2010). Available methodologies for early detection are still based on traditional screening methods such as the Faecal Occult Blood Test (FOBT) in average-risk populations followed by colonoscopy in the case of positivity. However, FOBT is neither very sensitive nor particularly specific, and colonoscopy, though specific and sensitive, is not suitable for screening in the general population because of low compliance rate and high costs. Therefore, there is an increased need for new markers for earlier disease detection and a more efficient monitoring during the follow-up period (McLoughlin and O'Morain, 2006; Burch et al, 2007).

Plasma/serum circulating microRNAs (miRNAs) are promising novel minimally invasive biomarkers for diagnosing and monitoring human cancers (Calin and Croce, 2006; Chen et al, 2008; Filipowicz et al, 2008; Mitchell et al, 2008). They are small non-coding RNAs (18–25 nucleotides long) that can promote mRNA degradation or regulate the translation of specific protein-coding genes (Kosaka et al, 2010). microRNAs have an important regulatory role in a wide range of biological and pathological processes and are involved in tumorigenesis and in the development of various cancers, and miRNA signatures related to tumour classification, diagnosis, and disease progression have been identified (Calin and Croce, 2006). Tumour-derived miRNAs are resistant to endogenous ribonuclease activity, as well as extreme pH and temperature and can therefore be present in plasma/serum in a remarkably stable form (Mitchell et al, 2008). Initially, this was explained by the packaging of miRNAs in some kinds of secretory particles including apoptotic bodies and exosomes in plasma/serum (Kosaka et al, 2010). Recent studies also found miRNAs in protein complexes including argonaute (AGO) or high-density lipoproteins (Arroyo et al, 2011; Vickers et al, 2011). In addition, the expression level of serum/plasma miRNAs is reproducible and consistent among individuals and miRNAs are measurable with a much greater sensitivity than proteins (Chen et al, 2008; Mitchell et al, 2008). The presence of circulating miRNAs that could be used for early detection of cancer has successfully been investigated in several malignancies such as prostate cancer, lymphoma, ovarian cancer, lung and breast cancer, gastric, oesophageal, and colorectal cancer (Cortez et al, 2011).

The identification of disease-specific miRNAs is hampered by the fact that blood also contains miRNAs derived from the haemolysis of cells of different hematopoietic origins, whose levels correlate with the blood cell count. Some of these cellular miRNAs are included in the lists of tumour-related miRNA circulating biomarkers reported in the literature, and differences in their levels might be related to blood cell events rather than the presence of cancer (Pritchard et al, 2012). One possible test method to avoid misidentification of non-specific miRNAs might be the measurement of haemoglobin levels as markers of haemolysis in plasma samples to determine whether they are suitable for miRNA analysis (Kirschner et al, 2011).

In this study, we investigated whether the plasma concentration of a group of miRNAs that were differently expressed in tumour vs normal colon tissues (Reid et al, 2012) could be used to screen for the presence of CRC or for monitoring recurrence. Our results provided evidence that the plasma level of miR-378 could be used to discriminate CRC patients from healthy individuals, irrespective of the level of haemoglobin of plasma samples and confirmed that miRNA analysis in plasma samples represents a useful non-invasive tool to detect CRC as well as to monitor recurrence.

Materials and methods

Patient cohort

The study design, approved by the institutional review board of Fondazione IRCCS Istituto Nazionale dei Tumori Milano (INT), includes a total of 65 patients (cases). Peripheral blood was collected in EDTA tube the day before surgery (T0) and for 46 patients also 4–6 months after surgical resection (T1), during the scheduled medical examinations. Patients who had received preoperative radiotherapy or chemotherapy were excluded. Overall, a series of 70 blood donors recruited through the Transfusion Medicine Service of our Institute and the Associazione Volontari Italiani Sangue of Milan who appeared to be disease free by routine serological and instrumental analyses were considered as a normal control. The distribution of sex and age was quite comparable in cases and controls.

All samples were collected at INT between 2009 and 2011 and each patient and donor provided written informed consent to donate blood samples for research purposes. Tumours were classified according to Dukes classification (Dukes, 1932), 54% of them were stages I and II, the remaining were at stages III and IV. Plasma was immediately separated from the cellular fraction by centrifugation at 3000 g for 10 min at room temperature and stored at −80 °C. An additional centrifugation (1000 g for 5 min at 4 °C) was performed before RNA extraction to remove cell debris.

Free haemoglobin concentrations in plasma samples were analysed by spectrophotometric analysis. The ratio between the absorbance peaks at 414 and 375 nm, suggestive of the presence of free haemoglobin, was used for quantitative determination of haemolysis (Kirschner et al, 2011). Samples with a haemolysis ratio of 2 were considered as haemolysis free. Starting from the overall cohort of 135 subjects (65 cases and 70 controls) the training and the validation sets were selected.

Nucleic acids extraction and plasma RNA quantification

Total RNA (including small RNAs) was extracted from 400 μl of plasma. Trizol reagent (Invitrogen, Carlsbad, CA, USA) and the miRNeasy Mini Kit (Qiagen, Chatsworth, CA, USA) were used for samples included in the training set, with the following modifications: Trizol was added to plasma samples in a ratio of 3 : 1. After chloroform addition and centrifugation at 16 100 g for 15 min at 4 °C, nucleic acids were precipitated at −80 °C with 1.5 volumes of 96% ethanol and isolated on miRNeasy columns. DNAse treatment was carried out following the manufacturer’s instructions. Qiazol Lysis reagent (Qiagen) added with the MS2 RNA Carrier (Roche Applied Science, Milan, Italy) and the miRNeasy Mini Kit modified by Exiqon (Vedbaek, Denmark; ‘RNA Purification from blood and Serum Protocol’) were used for samples from the validation set and for samples collected after surgery. Qiazol mixture was combined to plasma in a ratio of 3.75 : 1. Centrifugation was done at 12 000 g for 15 min at 4 °C and samples were washed three times with 500 μl of RPE buffer. RNA was eluted in 40 μl of water pre-warmed at 42 °C.

miRNA quantitative real-time PCR

Taqman MicroRNA Assays were used for miRNA quantification. In brief, 2 μl of RNA was reverse transcribed to cDNA using the High-Capacity cDNA Reverse Transcription Kit and miRNA-specific primers (Applied Biosystems, Foster City, CA, USA) according to the manufacturer’s instructions in a reaction volume of 15 μl. qPCR was done using the FAST chemistry (Applied Biosystems) with the manufacturer provided miRNA-specific assays in ABI PRISM 7900 HT Real-Time PCR system (Applied Biosystems). The assays analysed are listed in Table 1.

Table 1 List of the miRNAs analysed in the plasma samples; miR-16, in italics, was used as a normaliser

miR-29a and miR-92 were tested using the Qiagen technology as reported by Huang et al (2010) and Ng et al (2009). Briefly, 4 μl of RNA was reverse transcribed to cDNA using the miScript Reverse Transcription Kit (Qiagen) according to the manufacturer’s instructions in a reaction volume of 20 μl. SYBR Green chemistry (Qiagen) was used to perform the qPCR with the manufacturer provided miScript Universal Primer and miScript Primer Assay in LightCycler 480 II Roche.

Statistical analysis

The relative quantity (RQ) of each miRNA was calculated using the comparative threshold cycle (Ct) method with the equation 2ΔCt, where ΔCt=Ct miRNA−Ct miRNA-16 (Livak and Schmittgen, 2001). The association between the RQ, expressed in logarithmic scale (log2 RQ=−ΔCt), of each miRNA and the risk of colon cancer was evaluated by resorting to a univariate logistic regression model (Hosmer and Lemeshow, 1989) that estimates odds ratio (OR) and its 95% confidence intervals (CIs). We implemented this model in the training set to first select the promising miRNAs. These miRNAs were then evaluated in the validation set. Next, receiver operating characteristic (ROC) curves were implemented for the validated miRNA(s) and the area under the ROC curve (AUC), with the corresponding 95% CI, was estimated to assess their predictive capability (Harrell et al, 1982). For the 46 patients where peripheral blood was collected both the day before surgery (T0) and 4–6 months after surgery (T1) the RQ of the validated miRNA(s) was compared using a mixed regression model (McCulloch and Searle, 2001) with patient as the random factor and time (T0 vs T1) as the fixed factor. Statistical analyses were performed by the SAS software v. 9.2 (SAS Institute Inc., Cary, NC, USA).

Results

Selection of tumour-associated miRNAs

Twenty-five miRNAs differentially expressed in tumour vs normal colon tissue (Schetter et al, 2008; Reid et al, 2012) or previously found in plasma of CRC patients (Ng et al, 2009; Huang et al, 2010) were selected to evaluate their possible use as circulating biomarkers of CRC (see Table 1). A pilot analysis, performed on RNA extracted from plasma of 10 patients with CRC and from 10 healthy donors, showed that 17 of the analysed miRNAs were expressed at detectable levels (mean Ct values <34) also in the plasma (see Table 1). Among them, miR-885-5p presented a detection rate of <50% in both control and CRC plasma samples and was discarded.

Identification of CRC-related plasma circulating miRNAs

Expression of the 16 selected miRNAs was measured in the training and validation sets. The miRNAs that resulted different among patients and healthy individuals also in the validation set were tested on plasma collected 4–6 months after surgery from disease-free patients.

Training set

In Figure 1 are reported the ORs and the corresponding 95% CI obtained from the univariate logistic model implemented for each of the 16 miRNAs evaluated in the training set consisting of plasma samples from 36 patients with cancer and from 41 donors. The four miRNAs with an OR significantly different from 1.00 were miR-21, miR-135b, miR-378, and miR-204. The distributions of the RQ of these miRNAs in cases and controls are shown in Figure 2 by box-plots. miR-204 and miR-378 were not detected in two controls and miR-135 in three controls. All the data were available for miR-21. The median RQ was higher in cases than in controls for miR-135b, miR-21, and miR-378.

Figure 1
figure 1

Results of the logistic analysis in the training set. Odds ratio (OR) and its 95% confidence interval (CI) obtained from the univariate logistic model implemented for each of the 16 miRNAs evaluated in the training set.

Figure 2
figure 2

Distribution of the relative quantity of the four relevant miRNAs in the training set. Box-plots reflect the distribution in cases and controls of the relative quantity of the miRNAs with odds ratio statistically significant in the training set. Each box indicates the 25th and 75th percentiles of the distribution. The horizontal line inside the box indicates the median, and the whiskers indicate the extreme measured values. Each individual value is represented by a dot.

Validation set

In Table 2 are presented the results of the univariate logistic models and the AUC for the four selected miRNAs in the validation set, consisting in a total of 58 samples including 29 cases from cancer patients and 29 from controls. From this analysis, it appears that the OR of miR-21 and miR-378 remains statistically significant in the validation set. For both these miRNAs, we observed a significant predictive capability to discriminate between case and control with a lower limit of AUC 95% CI>0.5 (Table 2). In particular, miR-378 has the highest predictive capability (AUC 95% CI: 0.68–0.91). Figure 3 reports the ROC curves (panels B and D) and the box-plots (panels A and C) of the RQ of the two validated miRNAs. The level of correlation between the two validated miRNAs was moderate (Spearman’s correlation coefficient: 0.26; 95% CI: 0.005–0.488), and in the logistic model including both miRNAs only miR-378 maintains its significance with an OR similar to that obtained in the univariate model (miR-21: OR=1.59, 95% CI: 0.69–3.64; miR-378: OR=4.03, 95% CI: 1.67–9.73).

Table 2 Results of the univariate logistic model and AUC of each miRNA in the validation set
Figure 3
figure 3

Distribution of the relative quantity of the two validated miRNAs and the corresponding receiver operating characteristic (ROC) curves. (A and C) The box-plots reflect the distribution in cases and controls of the relative quantity of miR-378 (A) and miR-21 (C) in the validation set. Each box indicates the 25th and 75th percentiles of the distribution. The horizontal line inside the box indicates the median, and the whiskers indicate the extreme measured values. (B and D) The ROC curves derived from the univariate logistic analysis in the validation set for miR-378 (B) and miR-21 (D), respectively.

Analysis of miRNA expression after CRC surgery

To assess the possible value of the two validated miRNAs in monitoring outcome after surgery, we measured their expression levels in plasma from 46 patients undergoing planned surgery, collected the day before surgery (T0) and 4–6 months after surgery (T1) during scheduled clinical evaluation, and the relative expression levels at the two time points were compared. All the patients analysed at T1 were disease free. A statistically significant reduction was observed for miR-378 after surgery (P-value=<0.0001), whereas the expression of miR-21 was not significantly different at the two considered time points (P-value=0.8611). Figure 4 reports the box-plot of the relative expression of miR-378 according to the two time points.

Figure 4
figure 4

Distribution of the relative quantity of miR-378 in plasma collected before and after surgery. Box-plot reflects the distribution of the relative quantity of miR-378 in plasma collected the day before surgery (T0) and 4–6 months after it (T1) in disease-free patients. Each box indicates the 25th and 75th percentiles of the distribution. The horizontal line inside the box indicates the median, and the whiskers indicate the extreme measured values. Each individual value is represented by a dot and the two points corresponding to the same individual are connected by a line.

Identification of CRC-related circulating miRNAs in the haemolysis-free subgroup

Data were analysed by considering only haemolysis-free cases (haemolysis ratio 2, measured as described in Materials and Methods).

From the training set, we selected the 24 cancer patients and 19 donor samples with a haemolysis ratio of −2 and implemented a univariate logistic regression model for each of the 16 considered miRNAs. The only two miRNAs with an OR significantly different from 1.00 were miR-21 (OR: 2.83; 95% CI: 1.42–5.63; P-value=0.003) and miR-378 (OR: 3.43; 95% CI: 1.27–9.26; P-value=0.015). For both of them, median relative expression was higher in cases than in controls (see Supplementary Figure 1). Expression of miR-21 and miR-378 was compared by considering 18 cancer patients and 20 controls with a haemolysis ratio of 2 from the validation set. The results of the univariate logistic models showed that only miR-378 remains statistically significant (OR: 3.03; 95% CI: 1.17–7.86, P-value=0.023). The ROC analysis confirmed also the significant predictive capability of miR-378 (AUC 0.74; 95% CI: 0.58–0.90). Supplementary Figure 2 reports the ROC curve (panel B) and the box-plot (panel A) of the relative expression of miR-378.

Analysis of the expression levels of miR-378 in plasma collected after surgery from the 21 patients with a haemolysis ratio of 2 at both collection times (the day before surgery and 4–6 months after it) confirmed a statistically significant reduction in the expression of miR-378 after cancer removal (P-value=0.0002). Supplementary Figure 3 reports the box-plot of the relative expression of miR-378 according to the two time points.

Discussion

Tumour-specific alterations in nucleic acids detectable in the plasma of cancer patients are promising non-invasive biomarkers for diagnosis and for monitoring of various cancers (Frattini et al, 2008). Tumour-derived miRNAs have been found in plasma, where they show tissue-specific signatures and exhibit high stability compared with circulating DNA and mRNA (Mitchell et al, 2008), highlighting their possible use as new biomarkers for identifying patients with cancer.

To find possible CRC-derived circulating miRNAs, we focused on plasma levels of a set of 23 tumour-associated miRNAs differently expressed between matched normal and tumoral colorectal tissues we had previously identified (Reid et al, 2012). Among the top miRNAs differentially expressed in tumours reported in the work of Schetter et al (2008), we included miR-93, as well as miR-29a, miR-92, and miR-203 that have been described in plasma from CRC patients (Ng et al, 2009; Huang et al, 2010). Sixteen of the analysed miRNAs were expressed at detectable levels also in plasma, supporting the hypothesis that CRC cells can release miRNAs. Four of them, miR-135b, miR-204, miR-21, and miR-378, were present at significant different levels in plasma from cancer patients, in comparison with cancer-free individuals. miR-21 and miR-378 confirmed these results in an independent cohort of samples, selected with the same criteria as the test set, thus providing strong evidence that miR-21 and miR-378 are valid plasma markers of the presence of CRC. Among the two miRNAs, miR-378 has the highest predictive capability (AUC=0.796). Results were further confirmed when 12 more cases and 13 control samples were added to the validation set (data not shown). In addition, plasma levels of miR-378 significantly decreased after surgical removal of CRCs, indicating that this miRNA is also a potential marker for monitoring therapeutic outcome.

A comparison between the predictive capability of the two identified miRNAs and other known CRC biomarkers shows that miR-378 predicts the presence of CRC similarly to DNA methylation markers (Lange et al, 2012), while miR-21 has a predictive value similar to ctDNA (Perrone et al, in press).

No significant association was observed between miR-378 and miR-21 expression among the different tumour stages of the cases, categorised as early (stages I and II) and late (stages III and IV) (data not shown), suggesting that the two miRNAs could potentially be used as novel non-invasive biomarkers for early CRC detection after successful treatment. This possibility needs to be further investigated in larger case series. In addition, patients included in this study are under follow-up at our Institute and therefore we will also be able to assess the predictive value of recurrence of the two markers.

Previous studies identified CRC-related circulating miRNAs: Ng et al (2009) found that miR-92a was significantly increased in CRC patients plasma and its levels decreased after surgery, suggesting that plasma miRNAs may be used to monitor cancer patients. Huang et al (2010) found that miR-29a and miR-92a levels were higher in plasma from patients with advanced CRC than in plasma samples from healthy individuals and differentiated advanced adenomas from normal controls. miR-29a was also found higher in serum of CRC patients carrying liver metastases (Wang and Gu, 2012). Other researchers found an association between CRC and expression of miR-221 in directly amplified plasma (Pu et al, 2010). The three miRNAs were included in the set we analysed, but were not specifically associated with CRC, as also found by Faltejskova et al (2012). Possible reasons for these discrepancies could be the different methods used for RNA extraction and miRNA detection, although miR-92a was analysed using the same technology of Huang et al (2010) and Ng et al (2009), or differences in the endogenous controls used for normalisation. The genetic variations among different ethnic groups and environmental factors may also contribute to the differing results.

Another strong possible explanation of the discrepancies comes from recent studies on the origin of the miRNAs reported in the literature as circulating cancer biomarkers. These studies indicate that the majority of those miRNAs do not have a tumour cell-specific origin and may derive from the rupture of blood cells still present in plasma (haemolysis) because of collection procedures or specimen processing conditions (Pritchard et al, 2012), thus impacting on their possible use as tumour biomarkers. Interestingly, miR-92a, miR-29a, and miR-221 are expressed by red blood cells (Pritchard et al, 2012) and miR-92a is strongly affected by haemolysis. The fact that the range of haemolysis ratio in our cases was quite narrow (from 1.03 to 2.53 for cases and from 1.11 to 3.05 for controls in the training set and from 1.18 to 2.90 for cases and from 1.03 to 2.74 for controls in the validation set) could also explain why we did not confirm the three miRNAs as CRC biomarkers.

When we considered samples with haemolysis ratio 2 (24 cases and 19 controls in the training set and 18 cases and 20 controls in the validation set), only miR-378 was confirmed as a CRC biomarker also able to monitor therapeutic outcome. Since miR-21 has recently been found expressed by red blood cells (Kirschner et al, 2013), we cannot exclude that the result we observed in the whole case series is linked to the few haemolysed samples present. Alternatively, the reduced number of haemolysis-free samples was insufficient to highlight differences between cases and controls.

Normalisation was done using miR-16 as an internal control, according to the initial studies on plasma circulating miRNAs (Mitchell et al, 2008), and under the same conditions as the first studies on plasma from CRC patients, to compare the obtained results (Chang et al, 2010; Huang et al, 2010). Plasma levels of miR-16 are affected by haemolysis (Pritchard et al, 2012), but the levels of haemolysis observed in our samples were so low as to justify their inclusion in our analysis. In fact, Kirschner et al (2011) have shown that, in the absence of haemolysis, the levels of miR-16 are sufficiently constant to serve as a normaliser.

Regarding the two miRNAs we found highly expressed in plasma from CRC patients, miR-21 is an oncogenic miRNA altered in many tumours that regulates the expression of multiple cancer-related target genes such as PTEN and TPM1 (Meng et al, 2007; Zhu et al, 2007). Several studies have highlighted the potential of miR-21 as a highly promising prognostic marker. In fact, we and others found that miR-21 is upregulated in CRCs tumours compared with matched non-tumour tissues (Schetter et al, 2008; Reid et al, 2012) and its expression increases during tumour progression (Schetter et al, 2008). In addition, higher miR-21 levels in primary tumours were correlated with shorter disease-free interval (Kulda et al, 2010) and high miR-21 expression was associated with poor response to adjuvant therapy and more rapid recurrence in patients with stage III disease (Schetter et al, 2008). miR-21 is highly expressed in serum from several cancers including CRC, breast, oesophageal, gastric, and lung cancer (Wang and Zhang, 2012) and has also been found higher in plasma from 30 CRC carrying patients (Kanaan et al, 2012).

miR-378 was overexpressed in plasma from CRC patients and its levels were strongly reduced after surgery; it is a target of the c-Myc oncoprotein and can promote cellular transformation, at least in part, by targeting and inhibiting TOB2, an anti-proliferative member of the BTG family that could repress cyclin D1 transcription (Feng et al, 2011). This miRNA targets the tumour suppressor genes Sufu and Fus-1 and can enhance cell survival, tumour growth, and angiogenesis (Lee et al, 2007). miR-378 was higher in serum from patients with renal cell carcinoma (Redova et al, 2012) and in serum from gastric cancer patients than in healthy individuals (Liu et al, 2012). Similarly to what has been observed for gastric and renal cancer (Chow et al, 2010; Redova et al, 2012) miR-378 is downregulated in CRC tissues (Reid et al, 2012; Mazeh et al, 2013). Regarding gastric cancer, some authors speculated that cancer cells may selectively release or capture cellular miRNAs, resulting in increased (as for miR-378) or decreased serum miRNA levels with respect to tumour tissue. Further analyses are needed to clarify the existence of a cellular mechanism that selects and regulates miRNA circulation.

Our study suggests that the ongoing search for plasma miRNA markers can be a fruitful approach. Our results, further validated in an independent cohort of samples, show that plasma circulating miR-378 could be a valuable haemolysis-free biomarker for the early detection and monitoring of CRC. Even if our observations are promising, further large-scale prospective studies are necessary to validate miR-378 clinical potential as a non-invasive, cost-effective screening tool for CRC.