Introduction

Invasive breast carcinoma accounts for half of breast ductal carcinoma in situ (DCIS) recurrences [1,2,3,4,5,6]. Several prognostic indices and molecular signatures have been devised to assess the potential risk in DCIS towards its progression into invasive disease and to anticipate disease recurrence. However, the reliability and reproducibility of these indices are still insufficient for precise management decision making. Lack of reproducibility and reliability of these signatures and indices might be a result of their focus on imprecise clinical parameters only such as Van Nuys Prognostic Index [4], or depending on molecular genetic signatures like Oncotype DX DCIS, which has been updated to incorporate other clinical data such patient age and DCIS size by the Genomic Health (www.genomichealth.com) [2]. However, the latter mRNA-based signatures ignore the potential effect of post-transcriptional and post-translational modifications resulting in the differential expression of proteins within the tumor cells and subsequent effects of such changes on their functions. One more limitation of these assays is the collective measurement of gene expression levels in the tested tumor tissue sample, with variable proportions of tumor and stromal elements, therefore, omitting or underestimating the role of tumor microenvironment. Cancer is not exclusively a disease of tumor cells, but a disease of wider derangement and crosstalk involving tumor epithelial cells, surrounding stroma and tumor microenvironment including immune infiltrate. Recently, the role of microenvironment has been magnified in cancer development and progression and considered as a main component of cancer hallmarks [7]. A supportive microenvironment could be crucial for remnant DCIS cells to survive therapy and develop invasive capabilities. Therefore, identification of more robust genetic and/or proteomic signatures incorporating the crosstalk between tumor epithelial cells and surrounding microenvironment might provide a better approach for DCIS risk assessment and hence better personalised management to avoid over- and under-treatment.

Extracellular matrix and its main constituent, collagen, plays a critical role in several biological processes in normal tissues. Collagen is aberrantly produced by cancer-associated fibroblasts, which reciprocally interact with tumor cells promoting the latter’s proliferation, migration and differentiation and, therefore, aiding cancer development and progression [8].

Collagen11A1 (COL11A1) is a type of fibrillary collagen encoded by the COL11A1 gene and is expressed mainly in cartilaginous tissue [9]. Its deficiency is associated with skeletal and chondroid disorders [10, 11]. It constitutes a minor component of the extracellular matrix and plays a role in its tensile strength. COL11A1 expression is associated with poor prognosis in non-small cell lung carcinoma and linked to its resistance to chemotherapy [12]. Moreover, COL11A1 promotes malignant cell proliferation and is associated with worse prognosis in many other types of cancer including ovarian cancer, squamous cell carcinoma of the head and neck [13, 14], pancreatic [15], gastric [16], colorectal [17] carcinomas, and some sarcomas [18]. It is also associated with metastasis in breast cancer [19] and was previously described as a diagnostic marker to differentiate between invasive and non-invasive breast cancer [20]. COL11A1 predicts invasive recurrence after primary diagnosis of breast intraductal papillomas [21], and is differentially expressed between invasive breast carcinoma and DCIS [22]. However, the role of COL11A1 in DCIS and its prognostic significance has yet to be investigated. In this study, we aimed to assess the pattern of COL11A1 expression and its association with DCIS behavior in large well-annotated DCIS cohorts composed of pure DCIS and DCIS associated with invasive disease.

Material and methods

Study cohort

A large well-characterized annotated cohort of DCIS including pure DCIS (n = 776) and DCIS mixed with invasive breast carcinoma (DCIS-Mixed) (n = 239) diagnosed between 1990 and 2012 at Nottingham City Hospital, Nottingham, United Kingdom was used in this study, as previously described [23]. To avoid selection bias, the DCIS-mixed cohort was selected with clinicopathological features comparable to the pure cohort regarding the range of age at diagnosis, DCIS nuclear grade, and presence of comedo type necrosis. All demographic, clinicopathological, management data, molecular subtype, tumor infiltrating lymphocytes density and hypoxia related markers in addition to local recurrence-free interval data were available and prospectively maintained [24,25,26]. Briefly, the molecular classes were defined based on the immunohistochemistry using estrogen and progesterone receptor, the human epidermal growth factor receptor 2 and Ki-67 proliferation index. Estrogen and progesterone receptor were considered positive if > 1% of tumor cells showed nuclear staining [27], while the human epidermal growth factor receptor 2 positivity was defined when more than 10% of tumor cells showed strong complete membranous staining (+3 score). Chromogenic in situ hybridisation technique was used to determine the gene amplification status in borderline cases [28]. Proliferation index was defined as low when < 14% of cells showed nuclear staining [29]. In addition, hypoxia inducible factor 1 alpha was evaluated using immunohistochemistry and was considered positive when > 1% of tumor cells showed nuclear staining as previously described [30] . Tumors with an average number of 20 lymphocytes/duct or more was considered as dense tumor infiltrating lymphocytes in DCIS as previously described [25]. Local recurrence-free interval was defined as the time (in months) between 6 months after the primary DCIS excision and the development of ipsilateral recurrence (either as DCIS or invasive breast carcinoma). Cases undergoing completion re-excision due to margins’ involvement or presence of residual tumor tissue within the first 6 months were not considered as disease recurrence. Moreover, all cases with contralateral tumors or ipsilateral new event but developed in another quadrant, with morphological features different from the primary tumor or with nuclear downgrade were not considered as a recurrence and censored at time of the new event occurrence. In the pure DCIS cohort and within a median follow up period of 103 months (range: 6–240), 83 (11%) cases developed recurrence [30 cases (36%) recurred as DCIS and 53 (64%) recurred as invasive disease]. The majority of the recurrences (n = 66) developed in patients treated with breast conserving surgery alone.

Immunohistochemistry

Tissue microarrays were prepared from both cohorts as previously described [23]. Briefly, 1 mm punch sets were used to construct the tissue microarray using an automated Grand Master 2.4-UG-EN tissue microarray machine (3DHISTECH, Budapest, Hungary). For better representation of cases with heterogeneous DCIS histological patterns and/or nuclear grade, different areas representative of such heterogeneous patterns were sampled. Prior to immunohistochemical staining, full-face tissue sections from 10 randomly selected cases were stained and assessed to evaluate the pattern of COL11A1 expression in malignant breast tissue and adjacent stroma and normal terminal duct-lobular units.

Primary antibody specificity for anti-human COL11A1 mouse monoclonal antibody, clone 1E8.33 [Ref#DMTX-0014, LOT#070912, Oncomatryx Biopharma S.L., Spain] was validated using Western blot on whole cell lysates of MCF7 and SKBR3 human breast cancer cell lines (obtained from the American Type Culture Collection; Rockville, MD, USA). COL11A1 antibody was used at a dilution of 1:1000, which showed a specific band at the predicted size of ~180 kDa.

Expression of COL11A1 protein was assessed by immunohistochemistry using the Novocastra NovolinkTM Polymer Detection Systems kit (Code: RE7280-K, Leica, Biosystems, UK). Tissue microarrays and full-face sections (4 µm) were stained with mouse monoclonal COL11A1 (dilution 1:150), incubated for 24 h after antigen retrieval using EDTA pH 9.0 in a water bath at 95 °C. Hyaline cartilage tissue section in bronchi was used as positive control while the negative control was applied by omitting the primary antibody and both were simultaneously stained with the staining run.

Scoring of COL11A1 expression

Cytoplasmic expression of COL11A1 in tumor epithelial cells and the surrounding stromal cells was assessed. Semi-quantitative Histo-score was applied for cytoplasmic expression of COL11A1 in epithelial cells (staining intensity was multiplied by the percentage of representative cells in the tissue for each intensity, producing a range of values between 0 and 300) [31]. The percentage of stromal fibroblasts showing cytoplasmic staining was estimated, as the intensity of staining within the scanty cytoplasm of the slender fibroblasts was difficult to assess consistently. The tumor epithelial cells and surrounding fibroblasts were assessed and scored separately. Cores containing < 15% of tumor epithelial cells and associated stroma were excluded from the scoring. All scored cores showed representative areas of specialized stroma (within two high power fields) [32] surrounding the malignant duct(s). In addition, a few cores with malignant epithelial cells without surrounding stroma were excluded. The cases excluded were unbiased. Cases with multiple cores were scored and the average final score was used for the analysis. Moreover, in the DCIS-mixed cohort, each component was evaluated and scored separately for the tumor epithelial cells and surrounding stroma. All cases were scored by two pathologists (MST and IMM) using a multiheaded microscope. For dichotomization of protein expression, outcome-dependent cut-off points for either malignant epithelial cells or stromal expression of COL11A1 were defined according to X-tile bioinformatics software (Yale University, version 3.6.1) [33] based on local recurrence-free interval in the pure DCIS cohort. High COL11A1 expression within tumor epithelial cells was considered when Histo-score was > 125, while expression in > 15% of the surrounding fibroblasts was considered as high expression.

Analysis of COL11A1 mRNA expression in breast cancer

To evaluate the prognostic role of COL11A1 in breast cancer and due to the scarcity of data on the genomic and proteomic profiles of DCIS, COL11A1 normalized mRNA expression was assessed as a potential predicative marker in the Molecular Taxonomy of Breast Cancer International Consortium cohort dataset [34], which comprises a large well characterised (n = 1980) cohort of invasive breast cancer with representative molecular profile. Furthermore, to validate the prognostic role of COL11A1 in publicly-available transcriptomic breast cancer data, analysis using the Breast Cancer Gene-Expression Miner v4.1 (bc-GenExMiner v4.1) database was carried out.

Statistical analysis

Statistical analyses were performed using SPSS v21 (Chicago, IL, USA) for Windows. Student’s t test and analysis of variance were used to correlate between COL11A1 mRNA level and other clinicopathological parameters in METABRIC data. Association with COL11A1 mRNA expression and breast cancer specific survival was performed after dichotomization of expression into high and low groups based on the median. Spearman’s Rho test was used to correlate between COL11A1 expression within the tumor and stromal cells; as well as their correlation with tumor infiltrating lymphocytes, Ki67, and the hypoxia inducible factor 1 alpha. Association between COL11A1 expression and clinicopathological parameters in pure DCIS was performed using Χ2, Mann–Whitney U and Kruskal–Wallis tests. Wilcoxon signed rank test was used to compare the expression of COL11A1 between DCIS component and invasive component within the DCIS-mixed cases. Univariate survival analysis against local recurrence-free interval was carried out using log rank test and Kaplan–Meier curves. Cox regression model was used for multivariate analysis of COL11A1 expression for all recurrences (DCIS or invasive breast cancer) and invasive recurrences. For all tests, a two-tailed p -value of less than 0.05 was considered as statistically significant.

Results

The pattern of COL11A1 expression

The evaluation of full-face tissue sections demonstrated even distribution of COL11A1 expression in the tumor epithelial cells and the surrounding specialized stroma throughout the whole section, indicating representability of tissue microarrays to assess its expression. Adjacent normal breast terminal ducto-lobular units showed negative or faint cytoplasmic staining. When present, COL11A1 was expressed in the cytoplasm of the epithelial tumor cells and/or surrounding fibroblasts (Fig. 1). Higher stromal COL11A1 expression was observed around DCIS with periductal stromal reaction.

Fig. 1
figure 1

Representative photomicrographs of COL11A1 IHC expression. a Normal breast ductolobular unit (×20) shows negative staining of COL11A1 in epithelial cells and occasional staining in the surrounding fibroblasts. b Negative COL11A1 expression (x20) in a pure DCIS case. c Strong expression of COL11A1 in tumor cells and surrounding fibroblasts (x20) in a pure DCIS case. d, e Expression of COL11A1 in a mixed case (x40) showing strong staining in invasive component within the tumor cells and/or surrounding stromal fibroblasts; and (f) negative expression of COL11A1 in a mixed case (×20)

After unbiased exclusion of uninformative cores (lost, folded or those containing < 15% tumor and surrounding stroma), the final number of cases suitable for scoring was 488 pure DCIS and 184 DCIS-mixed. COL11A1 expression showed a unimodal distribution. The median Histo-score of protein expression within the tumor was 100 in pure DCIS (range: 0–160), 110 in the DCIS component of mixed cases (range: 0–170), and 140 in invasive component of the latter (range: 0–200). For stromal expression, the median percentage of positive stromal cells was 0% in pure DCIS (range: 0–50%), 20% in the DCIS component of mixed cases (range: 0–80%) and 40% in the invasive component of the latter (range: 0–100%). Within the pure DCIS cohort, high COL11A1 expression was observed in 25 and 13% in tumor epithelial cells and surrounding fibroblasts, respectively. There was a positive linear correlation between expression of COL11A1 within the epithelial and surrounding stromal cells (Spearman’s correlation co-efficient r = 0.3, p < 0.0001).

The proportion of cases with positive COL11A1 was significantly greater in DCIS-mixed than pure DCIS, both within the tumor cells (25% of pure DCIS cases vs. 40% of DCIS mixed with invasive breast cancer, χ2 = 14.9, p < 0.0001) and stromal cells (13% for pure DCIS vs. 61% of DCIS mixed with invasion, χ2 = 26.3, p < 0.0001). Similar results were observed when the data were analyzed using a continuous scale (p < 0.0001 for both tumor epithelial cells and stromal cells). Moreover, there was a statistically significant difference between COL11A1 expression within the tumor cells and surrounding stromal fibroblasts of the DCIS component and invasive component of DCIS-mixed cohort (p < 0.0001 for both, using categorical and continuous data) (Fig. 2).

Fig. 2
figure 2

Violin plots showing differences of COL11A1 expression between pure DCIS and DCIS-mixed both in tumor cells (a) and surrounding stroma (b)

Significance of COL11A1 expression in pure DCIS

High expression of COL11A1 within the malignant epithelial cells and/or surrounding stromal fibroblasts in the pure DCIS was associated with hormonal receptor negativity, HER2 enriched and triple negative molecular subtypes. No association was observed between COL11A1 expression and the conventional clinicopathological parameters used in DCIS risk assessment such as age at diagnosis, tumor size, DCIS nuclear grade, and comedo necrosis (Table 1). Analysis of continuous data of COL11A1 expression showed similar results (Supplementary Table 1). Moreover, there was a positive linear correlation between COL11A1 expression within tumor cells and surrounding fibroblasts with tumor proliferation fraction as assessed by Ki67 (r = 0.3; p < 0.0001, r = 0.14; p = 0.01, respectively), hypoxia inducible factor 1alpha (r = 0.15; p = 0.03, r = 0.13; p = 0.01, respectively) and tumor infiltrating lymphocytes density (r = 0.15; p = 0.009, r = 0.2; p = 0.001, respectively).

Table 1 Correlation between COL11A1 expression with different clinicopathological parameters in the pure DCIS cohort

To assess the prognostic value of COL11A1 in invasive breast cancer, the Molecular Taxonomy of Breast Cancer International Consortium cohort [34] was used to correlate the levels of COL11A1 mRNA expression with the clinicopathological variables and outcome. Higher COL11A1 mRNA level was associated with younger age (p < 0.0001), lymph node metastasis (p = 0.001), HER2 positivity (p < 0.0001), HER2 enriched, triple negative and luminal B subtypes in addition to shorter breast cancer specific survival (HR = 1.2, 95% CI = 1.02–1.4, p = 0.01) (Supplementary Tables 2, and Supplementary Figure 1). Analysis using the Breast Cancer Gene-Expression Miner v4.1 (bc-GenExMiner v4.1) database showed that high COL11A1 mRNA was associated with significantly higher metastatic relapse and shorter overall survival (HR = 1.1, 95% CI = 1.02–1.3, p = 0.03; Supplementary Figure 2).

Outcome analysis in pure DCIS cohort

Higher COL11A1 expression within the epithelial tumor cells was associated with shorter local recurrence-free interval for all recurrences (DCIS and invasive breast cancer) (HR = 2.5; 95% CI = 1.4–4.2, p = 0.001) and showed a trend for invasive recurrences (HR = 1.8; 95% CI = 0.9–3.7, p = 0.08) (Fig. 3), while higher stromal expression was associated with shorter local recurrence-free interval in the whole cohort for all recurrences and invasive recurrences (HR = 15.9; 95% CI = 9.1–27.5, p < 0.0001 and (HR = 12.1; 95% CI = 6.1–24.0, p < 0.0001, respectively) (Fig. 4). Comparable results were obtained when the analysis was carried out with respect to different treatment groups (Fig. 4). In the pure DCIS cohort, patients treated with breast conserving surgery and followed by adjuvant radiotherapy showed better outcome than those treated with breast conserving surgery only either for all recurrences (HR = 0.5, 95% CI = 0.3–0.9, p = 0.014) or invasive recurrence (HR = 0.3, 95% CI = 0.2–0.8, p = 0.008) (Supplementary Figure 3). Interestingly, when the cohort was stratified according to COL11A1 expression, radiotherapy did not improve the outcome in patients with high stromal COL11A1 either in all recurrences (HR = 0.8, 95% CI = 0.5–1.6, p = 0.587) or for invasive recurrences (HR = 0.6; 95% CI = 0.2–1.8, p = 0.225) compared to patients with low COL11A1 expression (HR = 0.2; 95% CI = 0.2–0.9, p = 0.04 for all recurrences and HR = 0.4; 95% CI = 0.1–0.7, p = 0.03 for invasive recurrences) (Supplementary Figure 4). Supporting this, evaluation of the interaction between high stromal COL11A1*radiotherapy using a Cox regression model showed significant association with shorter local recurrence-free interval (HR = 4.2, 95% CI = 1.5–11.7, p = 0.006). Furthermore, high COL11A1 expression within the stromal fibroblasts was associated with poorer outcome in all DCIS groups irrespective of the DCIS nuclear grade (low, intermediate and high) for all recurrences and for invasive recurrences (all p < 0.0001). Supplementary Figure 5 shows forest plots for the univariate analysis of COL11A1 and other clinicopathological parameters and their association with outcome.

Fig. 3
figure 3

Kaplan–Meier curves showing that high expression of COL11A1 within the tumor epithelial cells is associated with ipsilateral local recurrence-free interval in the whole series either for all recurrences (a) or invasive recurrences (b), in breast conserving surgery without adjuvant radiotherapy (c: all recurrences and d: invasive recurrences) and in patients treated with breast conserving surgery followed by adjuvant radiotherapy (e: all recurrences and f: invasive recurrences)

Fig. 4
figure 4

Kaplan–Meier curves showing that high expression of COL11A1 within the surrounding stromal fibroblast is associated with ipsilateral local recurrence-free interval in the whole series either for all recurrences (a) or invasive recurrences (b), in breast conserving surgery without adjuvant radiotherapy (c: all recurrences and d: invasive recurrences) and in patients treated with breast conserving surgery followed by adjuvant radiotherapy (e: all recurrences and f: invasive recurrences)

Multivariate Cox proportional hazards analysis for the conventional clinicopathological parameters used to assess DCIS risk showed that patient’s age at diagnosis, DCIS size, nuclear grade, presence of comedo type necrosis and surgical resection margins were independent prognostic factors for disease recurrence. However, when expression of COL11A1 in stromal fibroblasts was included in the model, it was solely the independent poor prognostic factor for tumor recurrence in patients treated with breast conserving surgery regardless of known other determinants of high-risk DCIS; either for all recurrences (HR = 13.2, 95% CI = 6.9–25.4, p < 0.0001) or when the analysis was confined to invasive recurrences (HR = 11.2, 95% CI = 4.9–25.8, p < 0.0001), (Table 2).

Table 2 Multivariate survival analysis (Cox regression model) of variables (with and without COL11A1) predicting outcome in terms of ipsilateral local all recurrences (A) and invasive recurrences (B) in patients treated by breast conserving surgery in pure DCIS

Interestingly, when COL11A1 expression in the surrounding fibroblasts was incorporated with the other determinants of DCIS risk described by Van Nuys Prognostic Index [35], it provided better stratification for local recurrence risk, whereby high expression of stromal COL11A1 was associated with poorer outcome in all risk groups when compared to similar groups with low COL11A1 expression (HR = 1.9, 95% CI = 1.2–3.1; p = 0.004) (Supplementary Figure 6).

Discussion

Despite the advances in diagnostic and therapeutic modalities and the breakthrough in molecular genetics profiling, the underlying mechanisms promoting DCIS progression to invasive disease remain unclear and there is a desperate demand for better risk stratification tools. None of the currently available clinical indices or molecular signatures provide a reliable and clinically valid tool to predict DCIS risk of progression and/or recurrence to improve personalized management. Relying on the conventional methods for DCIS management by surgery with or without adjuvant radiotherapy for all patients based on such inadequate risk assessment leads to over- or under-treatment of a substantial proportion of patients. Furthermore, the biological and clinical heterogeneity of DCIS makes risk stratification quite challenging. Studying the role of DCIS microenvironment and the interaction between its various components and understanding how this affects disease behavior could resolve the DCIS dilemma and provide an adequate risk stratification model [36,37,38,39]. As crosstalk between tumor epithelial cells and extracellular matrix, including collagen remodelling, deposition and degradation is an essential key step in carcinogenesis and the progression of in situ tumors to invasive disease, studying potential markers that drive this process and their prognostic value is a credible approach to refine DCIS risk.

COL11A1 is a minor fibrillary collagen present mainly in cartilage, but it is produced at varying levels by other normal tissues as well as and under some pathological conditions. Interestingly, several studies reported that overexpression of COL11A1 by tumor cells and/or the surrounding cancer-associated fibroblasts is linked with tumor progression and poorer prognosis in a variety of cancers including invasive breast cancer [12,13,14, 16, 17, 40]. Moreover, in invasive breast cancer, some authors reported that COL11A1 is exclusively produced by invasive tumors [20], while others showed that it is expressed in normal, preinvasive and invasive tumors at different levels [40]. COL11A1 is differentially expressed between invasive breast cancer and DCIS [22], however, there is no previous study investigating its role in DCIS progression and its potential prognostic significance. Furthermore, using the Molecular Taxonomy of Breast Cancer International Consortium cohort for robust molecular data in a large number of invasive breast cancer, we have shown an association between aggressive behavior of invasive breast cancer and higher levels of COL11A1 mRNA. Taken together, these observations support our hypothesis that COL11A1 is a promising potential protein that could provide additional knowledge to DCIS behavior and might help in the stratification of disease risk.

Here we explored the expression of COL11A1 in a large well-characterized cohort of DCIS and scored the protein expression in tumor cells and surrounding stromal fibroblasts. Interestingly, high COL11A1 expression was associated with some features of high-risk DCIS, supporting its role in DCIS progression. Additionally, our data showed that COL11A1 expression was higher in DCIS co-existing with invasive breast cancer than pure DCIS, and much higher in the invasive component both within the tumor cells and in the surrounding stroma.

The prognostic value of COL11A1 was shown with a shorter local recurrence-free interval in patients with higher levels of COL11A1 expression, particularly stromal expression, independently from other clinicopathological factors. These findings were consistent for all recurrent events including DCIS and invasive breast cancer, and also when the analysis was confined to invasive recurrences only, which provides more evidence that COL11A1 could play a key role in DCIS progression to invasive disease. Although it was thought that COL11A1 is exclusively produced by cancer-associated fibroblasts, other studies showed that COL11A1 was expressed in tumor cells as well as the surrounding stroma [22, 40]. Our study showed that expression of COL11A1 in tumor epithelial and stromal cells is associated with disease recurrence; a finding that might reflect the epithelial-stromal interaction and its role in progression of DCIS. This interaction is supported by the obvious increase of COL11A1 expression in stromal and epithelial cells surrounding the invasive component compared to those surrounding the DCIS component in mixed cases or those surrounding pure DCIS. In our cohort, we have observed that higher expression COL11A1 is associated with periductal stromal reaction around DCIS, a finding that was reported to be associated with invasive recurrence in DCIS [41]. However, further functional studies are highly recommended to understand the underlying mechanisms and functions of COL11A1 expression in carcinogenesis and tumor progression either from the tumor cells or the surrounding stroma.

Multiple authors have noted a group of low-grade DCIS with indolent appearance and low proliferation index that yet carries progression potential to invasive breast cancer [42,43,44]. An explanation of disease progression based exclusively on intrinsic epithelial tumor cell-related factors is inadequate and identifying other candidate potential markers that could explain progression of different phenotypes and hence better risk stratification is highly warranted. Our findings showed that there was no association between high COL11A1 expression and other conventional clinicopathological factors that are described as a surrogate for high-risk DCIS such as younger age at diagnosis, high grade DCIS, comedo type necrosis and mode of presentation. In addition, high expression of stromal COL11A1 showed an association with recurrence regardless the grade of the DCIS or method of therapy. These findings show that COL11A1 is a very promising protein for better DCIS risk stratification and understanding of the disease biology. Moreover, incorporation of COL11A1 with the other clinicopathological factors described in the Van Nuys Prognostic Index provided a better stratification of different risk groups. These findings indicate that COL11A1 is a marker that could be used to better define high-risk DCIS and identify of patients who need more radical treatment e.g. breast conserving surgery with wider excision margins or mastectomy.

Although the effect of radiotherapy in reduction of DCIS recurrence in undeniable [45,46,47,48], there is still a considerable proportion of patients treated with breast conserving surgery and followed by adjuvant radiotherapy who develop disease recurrence. There are no data available for DCIS patients at higher susceptibility to show poor outcome after adjuvant radiotherapy management. Our findings showed that high expression of stromal COL11A1 is associated with higher rate of recurrence even with radiotherapy compared to DCIS with lower levels of expression, which indicates that COL11A1 could provide a signature for radiotherapy resistance in DCIS. However, the number of cases with recurrence after radiotherapy, especially invasive recurrence, is low, and this finding needs to be assessed in an independent cohort. Studies showed that dense stroma is associated with poor response to radiotherapy in cancer treatment [49, 50], which might provide a similar explanation in DCIS. These findings warrant more validation and mechanistic studies to decipher the underlying mechanisms and biology.

Our data support a key role for COL11A1 in progression to invasive disease and yet high levels of COL11A1 were highly significantly related to invasive recurrence. Thus far, understanding of the biological processes which involve COL11A1 in cancer progression is lacking, although we hypothesize that high stromal COL11A1 may in some way enable individual tumor cell invasion and/or remnant cell survival beyond the surgical margin, leading to invasive recurrence. Collagen provides a scaffold for the tumor cells and helps in tumor growth, angiogenesis and invasion [8]. These findings are supported by the association between COL11A1 and proliferation and hypoxia related proteins in our cohort. The link between hypoxia and increased collagen deposition and remodelling through prolyl-hydroxylases and lysyl oxidase enzymes is well studied even in DCIS [26, 51, 52]. One of the potential mechanisms that COL11A1 could use in promoting cellular proliferation is its association with the Smad signalling pathway via binding to integrin [12]. In addition, it was thought that collagen acts as a barrier against tumor invasion, but now it is proven that it has multifaceted roles and could promote tumor progression. Increased collagen deposition promotes tumor progression through destabilization of cell polarity and cell-cell adhesion, and enhancement of growth factor signaling. Tissue tension regulates the epithelial–mesenchymal transition through crosslink between tumor cells and stromal cells [53, 54].

Although the role of collagen in inflammatory processes and tissue healing is obvious, the specific role of COL11A1 in such conditions is unclear. Our data showed a correlation between higher COL11A1 expression mainly in stromal cells and dense inflammatory cell infiltrates. The underlying biology is warranted to be investigated, as we previously reported that dense tumor infiltrating lymphocytes have poor prognostic significance in DCIS, a phenomenon different from invasive disease and for which the underlying mechanisms are unclear [25]. COL11A1 may interact with the inflammatory related markers; for instance, cytokines and or interleukins, and affect DCIS behavior.

Conclusion

Extracellular matrix remodelling is an essential feature associated with DCIS that could lead to either promoting or circumventing its progression to invasive disease. COL11A1 might have a potential role in DCIS aggressiveness through its collagen remodeling and regulatory mechanism in cellular proliferation. Additional functional studies to decipher the role of COL11A1 and its mechanism of action in DCIS behavior are indicated. Applying COL11A1 in clinical practice, especially stromal expression, would provide a highly promising prognostic indicator for DCIS invasive recurrence and may be a potential predictive marker for radiotherapy response. As a consequence, the group of patients with high stromal COL11A1 (13% of cases) could require more extensive surgery, regardless of other factors.

Limitations of the study

This study has been carried out on tissue microarray sections, which might underestimate the role of tumor heterogeneity. However, all cases in our cohort were histologically reviewed before tissue microarray construction and we used multiple cores for cases with heterogeneous grades or morphological patterns. Moreover, our cohort did not include any patients treated with endocrine therapy, and the number of recurrences after radiotherapy was small. Further validation studies on larger cohorts especially with radiotherapy treated patients is highly warranted.