Introduction

Ductal carcinoma in situ (DCIS) of the breast accounts for 10–20% of all newly detected breast cancer cases, which is a non-obligate precursor of invasive ductal carcinoma (IDC)1,2. Patients diagnosed with DCIS always have a favorable prognosis with a 20-year breast cancer-specific mortality rate of approximately 3.3%3.

Several trials were conducted to compare the active surveillance with conventional surgery for low-risk DCIS4. However, nearly a quarter of patients diagnosed with pure DCIS at core needle biopsy (CNB) upstaged to IDC at surgery, which is defined as pathological underestimation5,6. Carefully selecting eligible patients for watchful waiting and excluding those with an occult invasive component is critical. Meanwhile, pure DCIS has no potential of metastasis, thus lymph node staging at surgery is not routinely indicated2,7. But for patients with a pathological underestimation, re-operation for axillary status evaluation is required.

Currently, the systemic individualized treatment of breast cancer is guided by molecular subtype, which was defined according to the expression status of estrogen receptor (ER), progesterone receptor (PR), human epidermal growth factor receptor 2 (HER2), and Ki-67. The molecular phenotype presenting in DCIS could independently predict the disease outcome and invasive recurrence. For some micro-invasion lesion which accounts for only 1% of all breast cancer cases, the small size of the invasive foci makes it difficult to perform the pathological examination. For those patients, whether the pathological interpretation of in situ component could be taken as a reference for the micro-invasive component remains to be studied.

Several studies had reported predictive factors related to pathological underestimation, including clinical, radiological, and histopathological characteristics. However, much less has been reported on the comparison of the status of ER, PR, HER2, and Ki-67 between paired CNB sample and surgical specimen in patients with a final diagnosis of both DCIS and IDC separately5,7,8,9,10,11,12. In the current study, we aim to evaluate the pathological underestimation in patients diagnosed with DCIS at CNB as well as to investigate the accordance rate for biomarkers between paired CNB samples and surgical specimens.

Results

Patient population and baseline characteristics

A total of 1226 patients diagnosed with DCIS at biopsy were reviewed, and 735 patients were included eventually (Fig. 1). Pure DCIS was confirmed in 343 (46.7%) patients after surgery and the other 392 patients (53.3%) upstaged to IDC (including microinvasion). Baseline characteristics were listed in Table 1. The median age at diagnosis was 52 (range, 24–89) years with 417 patients (56.7%) > 50 years. Over the 718 patients with data of BMI, 561 patients (78.1%) were < 25.00 kg/m2. Among all patients, 340 (46.3%) were pre- or peri-menopausal, and 395 (53.7%) were postmenopausal. Totally, 161 patients (21.9%) had benign breast disease history, 36 patients (4.9%) had past malignant tumor history, and 57 (7.8%) patients had a family history of breast cancer. There were 667 patients (90.7%) having a mass at clinical examination, 392 patients (53.3%) having calcification on imaging manifestation, and 99 patients (13.5%) having nipple discharge or erosion as clinical symptom. Regarding the radiological characteristics, 34.8% (245 out of 704) of patients were ≥ 5 BI-RADS category by US, 71.5% (418 out of 585) were ≥ 4B BI-RADS category by MMG, and 32.5% (194 out of 597) were ≥ 5 BI-RADS category by MRI. A total of 691 (94.0%) patients underwent CNB and 44 (6.0%) received VAB. Regarding the pathological characteristics, low or intermediate- grade tumors accounted for 28.2%, high-grade accounted for 16.2%, and the other 409 patients were unknown for the tumor grade. The IHC results showed that 57.3% of patients were ER-positive, 44.8% of patients were PR-positive, 47.5% were HER2-positive, and 48.9% had Ki-67 ≥ 14%.

Figure 1
figure 1

Flow chart. DCIS = ductal carcinoma in situ, LN lymph node, FNAC fine needle aspiration cytology.

Table 1 Baseline characteristics of 735 patients.

Characteristics associated with pathological underestimation

Univariate analysis demonstrated that Imaging manifestations of calcification (57.4% vs. 48.7%, P = 0.022), US BI-RADS ≥ 5 (39.1% vs. 29.7%, P = 0.011), MMG BI-RADS ≥ 4B (77.8% vs. 64.2%, P < 0.001), MRI BI-RADS ≥ 5 (38.8% vs. 25.4%, P < 0.001), larger tumor size (P = 0.001), and higher nuclear grade at biopsy (P = 0.005) were significantly correlated with pathological underestimation. Furthermore, patients underwent pathological underestimation were more likely to be biopsied by CNB, with the underestimate rate 54.4% in the CNB cohort and 36.4% in the VAB cohort, respectively (P = 0.028). Regarding the IHC assessment at biopsy, PR negativity (58.8% vs. 51.0%, P < 0.001) and Ki-67 ≥ 14% (52.5% vs. 44.8%, P = 0.058) were marginally significant associated with pathological underestimation (Table 1).

In multivariate analysis, MRI BI-RADS ≥ 5 (vs MRI BI-RADS < 5, OR 2.03, 95% CI 1.30–3.15, P = 0.002), tumor size (> 2.0 cm vs ≤ 2.0 cm, OR 1.94, 95% CI 0.93–4.08, P = 0.079; > 5.0 cm vs ≤ 2.0 cm, OR 1.79, 95% CI 1.06–3.02, P = 0.028), and nuclear grade at biopsy (high vs low/intermediate, P = OR = 2.01, 95% CI 1.27–3.18, P = 0.003) remained significantly associated with pathological underestimation. Whereas, calcification, US BI-RADS category, MMG BI-RADS category, biopsy method, PR status, and Ki-67 results at biopsy were no longer significant in multivariate analysis (P > 0.05, Table 2).

Table 2 Multivariable analysis of baseline characteristics for pathological underestimation.

Concordance between CNB and surgery for biomarkers in patients without pathological underestimation

Among 342 patients remaining DCIS after surgery, there were 45 patients with unknown ER, PR status, 175 patients with unknown or uncertain in HER2 status, and 48 patients having unknown results for the Ki-67. Evaluation of ER status between biopsy and surgery had a concordance rate of 89.6%, with a good overall agreement (κ = 0.786). The concordance rates for PR and HER2 were 91.9% and 94.8%, with a κ value of 0.839 and 0.886, respectively. When comparing with lesions at biopsy, there were more Ki-67 highly expressed lesions detected at surgery, and the concordance rate was 76.4% (κ = 0.530, Table 3, Fig. 2A).

Table 3 Concordance between biopsy and surgery for biomarkers in patients without pathological underestimation.
Figure 2
figure 2

Biomarkers concordance rates between CNB and surgery. (A) Concordance rates for ER, PR, HER2, and Ki-67 between DCIS at CNB and surgery in patients without pathological underestimation. (B) Concordance rates for ER, PR, HER2, and Ki-67 between DCIS at CNB and in situ components at surgery in patients with pathological underestimation. (C) Concordance rates for ER, PR, HER2, and Ki-67 between DCIS at CNB and invasive components at surgery in patients with pathological underestimation. (D) Concordance rates for ER, PR, HER2, and Ki-67 between in situ components and invasive components at surgery in patients with pathological underestimation. DCIS ductal carcinoma in situ, CNB core needle biopsy, ER estrogen receptor, PR progesterone receptor, HER2 human epidermal growth factor receptor 2, Ki-67 proliferation index.

When categorized into five subtypes, 13.2%, 9.0%, 19.8%, 56.3%, and 1.8% of the patients were categorized as Luminal A-like, Luminal B-like (HER2-negative), luminal B-like (HER2-positive), HER2-positive (non-luminal), and triple-negative at biopsy, respectively. There were 13 out of 32 patients (36.4%) with Luminal B-like (HER2-positive) breast cancer at biopsy classified as other subtypes at surgery. Whereas, only and 8 out of 94 patients (12.8%) with HER2-positive (non-luminal) lesions had different molecular subtypes at surgery. Overall, the concordance rate was 80.2% with a good agreement (κ value = 0.684) between biopsy and surgery (Table S1).

Photographs for discordant IHC results in terms of PR, HER2, and Ki-67 in patients without pathological underestimation were shown in Figure S1.

Concordance between biopsy and surgery for biomarkers in patients with pathological underestimation

There were 392 patients upstaged to IDC at surgery, among which 212 had IHC results for in situ component. Patients with unknown results for IHC were excluded from the analysis.

Regarding in situ component in IDC + DCIS lesions, concordance rates were 86.4% for ER status, 93.2% for PR status, 98.2% for HER2 status, and 76.3% for Ki-67 result, respectively (Fig. 2B). And the overall agreements were also good for receptor status (ER: κ = 0.712, PR: κ = 0.864, HER2: κ = 0.957) and moderate for Ki-67 results (κ = 0.522, Table 4). The molecular subtype had concordance rate of 72.9%, with only good agreement (κ value = 0.611). HER2 positive (non-luminal) DCIS had a concordance rate of 86.5% (90 out of 104 patients) between biopsy and surgery, which ranked the highest among five molecular subtypes. Luminal A-like DCIS only had a concordance rate of 63.3% (19 out of 0 patients, Table S2).

Table 4 Concordance between CNB and surgery for biomarkers in patients with pathological underestimation.

Evaluation of ER, PR, HER2, and Ki-67 at biopsy had a concordance rate of 85.0%, 87.1%, 94.3%, 71.0% with the invasive component at surgery (Fig. 2C). The overall agreements were good for receptor status (κ = 0.699, κ = 0.730, and κ = 0.870, respectively) and moderate for Ki-67 results (κ = 0.412, Table 4). The molecular subtype had a good agreement with a κ value of 0.635, and the concordance rate between biopsy and surgery was 75.0%. Among the 11 patients diagnosed with luminal B-like (HER2-negative) DCIS at biopsy only 21 (63.6%) remained the same molecular subtype at surgery. HER2-positive (non-luminal) lesions showed the highest concordance rate (86.3%, 44 out of 51 patients) among five subtypes (Table S3).

In patients who underwent pathological underestimation, discordance in IHC results was shown in Figure S2.

Concordance between in situ and invasive component for biomarkers in IDC + DCIS patients

A total of 212 patients were compared for the in situ and the invasive component of the same tumor tissue in terms of ER, PR, and Ki-67. And HER2 status was compared after excluding an additional 60 patients with uncertain results for HER2 expression. The concordance rate was 92.9% for ER status, 93.8% for PR status, 97.4% for HER2 status, and 86.5% for the Ki-67 (Fig. 2D). And the above biomarkers all showed good agreement, with κ values of 0.857, 0.875, 0.945, and 0.723, respectively (Table 5). As for the molecular subtype, the in situ and the invasive component also showed good agreement (κ = 0.820) with a concordance rate of 87.5%. Of note, for the 7 patients diagnosed with triple negative breast cancer, the concordance rate between in situ component and invasive component was 100%. However, the in-situ component classifying as Luminal B-like (HER2-positive) only had a concordance rate of 71.9% (23 out of 32) with the invasive component (Table S4).

Table 5 Concordance between in situ and invasive component for biomarkers in IDC + DCIS patients.

The Figure S3 showed the discordant IHC results in terms of ER in synchronous DCIS and IDC within a surgery sample.

Discussion

In the current study, we included 735 patients with DCIS at CNB and demonstrated that pathological underestimation was independently associated with tumor size, MRI BI-RADS category, and nuclear grade at CNB. Good concordances were observed between CNB and surgery in terms of ER, PR, and HER2 status irrespective of the presence of pathological underestimation or not. Whereas, there were more Ki-67 highly expressed lesions detected at surgery compared with at CNB. Regarding lesions with DCIS and IDC components, the overall agreement was also good for ER, PR, HER2, and Ki67 analysis between in situ and invasive components.

The majority of the patients (94.0%) in our center were biopsied with 14-G CNB, and the rate of pathological underestimation was 53.3%. However, previous studies reported that the underestimation rate was approximately 15–20% by using the VAB13,14. The method of the biopsy would influence the presence of pathological underestimation. Kim reported that the underestimation rate was 49.8% for the CNB and 29.2% for the vacuum method (P < 0.001)8. And in Park’s study, the underestimate rate was 50.0% and 18.8%, respectively (P < 0.001)15. Our result was consistent with previous data for the CNB cohort, and univariate analysis also showed that the underestimate rate was lower in patients who received VAB than CNB. Indeed, VAB by using Mammotome (8-gauge) and Mammotome elite (13-gauge) could provide more tissue samples, which can increase the diagnostic accuracy of biopsy. However, the Mammotome is not covered by the Chinese health insurance or government, and the Mammotome elite was just covered by the health care program at the end of the year 2018. Thus, when considering the treatment cost, both surgeons and patients may prefer 14-gauge CNB over VAB. Furthermore, using stereotactic biopsies instead of sonographic guidance may lower the risk of pathological underestimation (21.8% vs. 39.9%)5. It was acknowledged that DCIS is always associated with microcalcifications but not mass-like lesions. As was reported, mammography screening could increase the recall rate of suspicious microcalcifications, as well as the incidence of DCIS. However, there was lack of mammography national screening programs in China, thus the most common complaint was breast lump. In the current study, nearly 90% of the patients had mass-like lesions, thus we used the ultrasound-guided biopsy to diagnosis breast disease. Notably, even with an underestimation rate of 15%, it is still not appropriate to use the pathological results of biopsy specimen as a definitive diagnosis, as well as to safely guide the active surveillance or omit the axillary surgery. Therefore, identifying patients who are at high risk of pathological underestimation is of great importance.

Several factors were reported to be associated with pathological underestimation in previous studies, including palpability, high nuclear grade, BI-RADS category 5, HER2 positivity, Ki-67 overexpression, suspicious invasion, mammographic mass finding, and radiological tumor size ≥ 2.0 cm5,7,8,9,10,11,12. In the current study, we found that MRI BI-RADS ≥ 5, maximum mass diameter > 5.0 cm, and high nuclear grade were independent predictors for pathological underestimation. The presence of high nuclear grade and at CNB have been shown to be associated with an invasive component at surgical specimen5,16. Meanwhile, it has also been reported that high-grade DCIS developed to the invasive tumor more quickly than low-grade lesions17. This may indicate that high-grade DCIS was at a more advanced stage during progression, thus was more likely to harbor co-existing invasive component. Regarding the imaging feature, previous studies have suggested that breast MRI was highly sensitive for detecting and evaluating breast cancer of various types. DCIS often manifests as non-mass-like enhancements with segmental or ductal distribution, and presents clumped internal architecture in the MRI imaging14,18,19,20,21. The common manifestation of DCIS included lump as well as segmental distribution in the current study. Kim et al. reported that tumor size > 2.0 cm was an independent predictor of underestimation, which was consistent with the present findings8. A plausible reason may be that the larger target area for sampling increases the possibility of sampling error11. This suggests that increasing the amount of sample collection by using a thicker needle or increasing the number of cores examined may help to improve the diagnostic accuracy for large tumors.

Treatment patterns of breast cancer, including DCIS, relies on molecular subtype, which was determined by ER, PR, HER2 status, and Ki-67 index22,23,24,25,26. However, unlike invasive breast cancer, the IHC assessment of biomarker expression is less performed for DCIS at CNB. In the current study, in patients with a final diagnosis of DCIS at surgery, we observed good concordance rates regarding ER (89.6%), PR (91.9%), and HER2 (94.8%) status. Whereas, Ki-67 only showed a moderate agreement. Similar results had been constantly reported for invasive carcinoma. Tamaki et al. reported that CNB can provide reliable information on ER (κ = 0.82), PR (κ = 0.66), and HER2 (κ = 0.64) status of patients26. A meta-analysis study also demonstrated the high diagnostic accuracy of CNB in evaluating ER, PR, and HER2 status compared with open excision biopsy in breast cancer patients27. The poor agreement in terms of Ki-67 had also been recorded for invasive breast cancer28, which may due to the poorer fixation of surgical specimens compared with CNB ones and intra-tumor heterogeneity29. Meanwhile, the wound response of biopsy may also cause an increase in Ki-67 expression, since the invasive diagnostic procedure may accelerate the tumor growth30. Another point is the highly inconsistent measurement of Ki-67 among testing laboratories and pathologists, and further efforts were needed to set universally recognized cutoff for this biomarker.

In patients with a pathological underestimation, both the in situ components and the invasive components in surgery specimens showed good agreements (κ > 0.6) with CNB samples in terms of ER, PR, and HER2 status, which has rarely been evaluated in previous studies. Additionally, high concordance rates were also observed in terms of ER (92.9%), PR (93.8%), HER2 (97.4%), Ki-67 (86.5%) between the invasive component and in situ component at the surgery. Several studies investigated the genetic alteration between the in situ and invasive components of a tumor and found that they share a high degree of similarity31,32,33,34. Meanwhile, Schuetz et al. investigated the biology of transition from DCIS to IDC and identified progression-specific candidate genes35. Those to some extents indicated that the co-existing IDC was evolved from the in situ component, but not due to the intra-tumor heterogeneity. Thus, for tumors with micro-invasive foci that lack sufficient sample to complete the pathological analysis, IHC results from the in situ component can be used for tailoring subsequent treatment.

In the current study, we evaluated factors associated with pathological underestimation in a large cohort of patients diagnosed with pure DCIS at CNB and compared the concordance rate of IHC results between CNB and surgery. However, there were several potential limitations. First, selection bias may serve as an inevitable problem since this was a retrospective study, although we performed multivariate analysis to narrow this effect. Second, with data from a single institute, our results may not be appropriate when being extrapolated to other populations, and multicenter studies warrant consideration. Last but not least, the biopsy method, as well as the number of cores examined, were not considered in the current study. Thus, our results may only have meaning to estimate the upgrade of DCIS by using CNB, and further efforts were needed to evaluate the pathological underestimation by using VAB.

In conclusion, for patients diagnosed with DCIS at CNB, those who had MRI BI-RADS ≥ 5, high nuclear grade, and tumor size ≥ 5.0 cm were more likely to underwent pathological underestimation. ER, PR and HER2 status showed a high concordance rate between CNB and surgery, regardless of the presentation of pathological underestimation.

Methods

Patient population

Patients diagnosed with DCIS at biopsy from Jan. 2009 to May. 2020 at Ruijin Hospital, Shanghai Jiaotong University School of Medicine, Shanghai, China were retrospectively reviewed. Data of patients were extracted from Shanghai Jiao Tong University Breast Cancer Database (SJTU-BCDB, registration number of the State Copyright Administration: 2015SR199280), including age at diagnosis, body mass index (BMI), menopausal status, history of benign breast disease, history of malignant disease, family history of breast cancer, clinical symptoms and signs, radiological data, pathological data, as well as immunohistochemical (IHC) results of ER, PR, HER2, and Ki-67. Patients were included if they met the following criteria: (1) received both biopsy and surgery in our center; (2) female gender; (3) pathologically confirmed pure DCIS at biopsy. The following exclusion criteria were applied: (1) pathologically confirmed axillary metastasis preoperatively; (2) had invasive or micro-invasive component confirmed at biopsy; (3) underwent excisional biopsy; (4) had missing data for surgery.

Clinical and radiological examination

Clinical examination was recorded according to the initial physical examination after hospitalization. Ultrasound imaging examinations used a MyLab 90 ultrasound system (Esaote, Genoa, Italy). Mammography (MMG) imaging examinations were applied a full-field digital flat-panel mammography machine (Senographe DS, GE, USA), allowing for the high-resolution bilateral collection of craniocaudal (CC) and mediolateral oblique (MLO) images. MR imaging examinations were performed by 1.5-T MR imager (Magnetom Aera; Siemens Healthcare, Erlangen, Germany). Imaging assessment was referring to the ACR BI-RADS® Atlas Fourth Edition (before 2013) and ACR BI-RADS® Atlas Fifth Edition (since 2013)36. The maximum mass diameter was defined by using ultrasound imaging examination.

CNB, vacuum-assisted biopsy (VAB), and surgical specimen

The CNB were performed by using 14-gauge (G) automated biopsy guns (Magnum, BARD, Covington, U.S). The VAB were performed by using the Mammotome system with 8-G biopsy guns (Mammotome EX, Devicor Medical Products,Inc., USA) or 13-G biopsy guns (Mammotome elite, Devicor Medical Products,Inc., USA). The procedure was guided by sonographic device (sonosite S-Women’s health). CNB samples were fixed immediately in adequate volume of 4% buffered formaldehyde for at least 6 h according to the American Society of Clinical Oncology/College of American Pathologists (ASCO/CAP) guidelines, and embedded in paraffin for histopathological analysis37. The surgical specimen was cut into 1-cm-thick slices, fixed within 1 h, and followed by paraffin embedding.

Pathological and IHC analysis

Pathological and IHC analyses were conducted at the Department of Pathology, Ruijin Hospital, Shanghai Jiaotong University School of Medicine. Histopathological evaluation was according to the World Health Organization classification38. Pure DCIS was recognized if the neoplastic proliferation of epithelial cells was confined to the mammary ductal-lobular system. T1mic referred to DCIS with a microscopic focus of invasion ≤ 0.1 cm in the longest dimension38. DCIS + IDC was defined as the presence of both in situ component and invasive component39.

The IHC assessment of ER, PR, HER2, and Ki-67 was performed on 4 μm- formalin-fixed, paraffin-embedded tumor samples from CNB sample and surgical specimen, with the following antibodies: ER (SP1, Dako), PR (PgR 636, Dako), HER2 (4B5, Roche), and Ki-67 (MIB-1, Dako). ER-positivity (ER +) and PR-positivity (PR +) were defined as ≥ 1% positive tumor cells with nuclear staining. Ki-67 was characterized as the proportion of positive nuclear staining cells within at least 1000 tumor cells counted29. HER2-positivity (HER2 +) was referring to the 2018 ASCO/CAP guidelines, described as 3 + by IHC, or 2 + by IHC and HER2 amplified by fluorescence in situ hybridization (FISH). Cases with 2 + by IHC without results of FISH was regarded as uncertain. The molecular subtype was according to the 2013 St. Gallen breast cancer consensus as follow: Luminal A-like: ER + , PR + , HER2- and Ki-67 < 14%; Luminal B-like (HER2 negative): ER + , HER2-, PR < 20% or/and Ki-67 > 14%. Luminal B-like (HER2 positive): ER + or/and PR + , HER2 + ; HER2-positive (non-luminal): ER-, PR-, HER2 + ; Triple-negative: ER-, PR-, and HER2-40. IHC results were assessed based both on the invasive component and in situ component in DCIS + IDC groups.

Statistical analysis

Categorical variables between patients with and without underestimation were compared by using the Fisher’s exact test. Univariate and multivariate logistic regression analyses were performed to identify clinicopathological characteristics related to pathological underestimation with odds ratio (OR) and 95% confidence interval (CI). Concordance analysis of receptor status and Ki-67 between CNB and surgical specimen was calculated by the kappa test. A κ value > 0.6 is considered as having with good agreement, value between 0.4 and 0.6 suggests moderate agreement, value < 0.4 indicates fair agreement, and value < 0.2 indicates poor agreement. Two-sided P values < 0.05 were considered statistically significant. Statistical analyses were carried out by using the IBM SPSS statistics software version 25.0 (SPSS, Inc., IL, USA) and graphing were conducted by GraphPad Prism version 7.0 (GraphPad Software, CA, USA).

Ethics approval

The study was approved by the Ethical Committees of Ruijin Hospital, Shanghai Jiao Tong University School of Medicine. All procedures performed in studies involving human participants were in accordance with the ethical standards of the institutional and/or national research committee and with the 1964 Helsinki declaration and its later amendments or comparable ethical standards. Informed consent was obtained from all individual participants included in the study.