Introduction

Breast cancer is a heterogenous disease, which includes several histologic morphologies1. Invasive ductal carcinoma (IDC), the most common form of breast cancer (BCA), comprises ~80% of cases. Invasive lobular carcinoma (ILC), the second most prevalent BCA histology, represents about 10–15% of cases2,3. ILC is characterized by E-cadherin loss, and is typically hormone receptor-positive and HER2-negative2,4. Early ILC is more difficult to detect mammographically than IDC5, and advanced ILC is associated with a predilection for metastases to the peritoneum, gastrointestinal tract, and meninges4,6,7. Despite these differences, IDC and ILC are typically managed similarly.

The 21-gene Oncotype DX gene expression assay is prognostic for recurrence and predictive of chemotherapy benefit in early HR + HER2-negative BCA8,9,10,11,12. The TAILORx and RxPONDER trials established that postmenopausal women with HR + BCA and Oncotype DX recurrence score (RS) < 25 involving up to three axillary nodes do not benefit from chemotherapy, whereas premenopausal women with positive axillary nodes or with negative nodes and RS 16–25 may derive some chemotherapy benefit that has been ascribed to chemotherapy-induced amenorrhea8,13. ILC is typically associated with low to intermediate RS4,14,15,16, and with poorer response to chemotherapy than IDC17,18,19. We sought to evaluate differences in clinicopathologic characteristics, RS and chemotherapy benefit between IDC, ILC, and carcinomas of mixed histologies.

Results

Patient characteristics

As shown in Fig. 1, of 2,696,734 women with breast cancer in the NCDB database, 74,472 patients met inclusion criteria for this analysis; 62,395 (83.8%) node negative (N0) and 12,077 (16.2%) with up to three involved axillary nodes (N1). 57,615 patients (77.4%) had IDC; 8693 (11.7%) ILC; 5393 (7.2%) DLC; 2459 (3.3%) DOC; and 312 (0.4%) LOC.

Fig. 1: CONSORT diagram for patient selection.
figure 1

NCDB National Cancer Database, HR hormone receptor, N2, 4–9 involved axillary nodes, N3 10 or more involved axillary nodes, RS 21-gene recurrence score.

Associations between tumor subtypes and clinicopathologic characteristics

Associations between tumor subtypes and clinicopathologic characteristics are summarized in Table 1. Though statistically significant, variations in median age, race, and ethnicity among tumor subtypes were small in magnitude. DOC and LOC were slightly more common among Black women than other tumor subtypes, and DOC was slightly more common in Hispanics. IDC was associated with smaller tumor size. 79.2% of IDC measured 20 mm or less, compared with 65.5% of ILC, 71.2% of DLC, 74.7% DOC, and 70.4% LOC (p < 0.0001). IDC was also most likely to be high grade, while ILC was least. 18% of IDC were high grade, compared with 5.3% of ILC, 11.0% of DLC, 10.2% DOC, and 11.2% LOC (p < 0.0001).

Table 1 Patient characteristics.

Associations between tumor subtypes and RS

Associations between tumor subtypes and RS are shown in Fig. 2. Significant differences in RS distribution among tumor subtypes were seen. Lobular histology was associated with a lower incidence of high RS (>26), with rates of high RS of 7.2% and 8.7% for ILC and LOC, respectively, compared with 16.9% for IDC and 12.7% for DOC (p < 0.0001). Tumors containing both ductal and lobular features (DLC) had a 9.6% incidence of RS > 26, somewhat higher than seen in ILC and LOC, but lower than rates for IDC and DOC. DOC was associated with the highest likelihood of low RS of 0–10, with 28% of DOC having RS 0-10, compared with 22.3%, 19.7%, 21.6%, and 22.4%, for IDC, ILC, DLC, and LOC, respectively (p < 0.0001). Among the tumor subtypes, ILC was least likely to have both high and low RS.

Fig. 2: Distribution of RS among breast cancer histologic subtypes.
figure 2

Distribution of low (0–10), intermediate (11–25), and high (26–100) RS in IDC, ILC, DLC, DOC, and LOC (p < 0.0001).

Associations between tumor subtypes and cancer therapy

Associations between tumor subtypes and treatment are also summarized in Table 1. Presence of lobular histology was associated with increased likelihood of mastectomy compared to breast conserving surgery. 44.6% of patients with ILC and 44.6% of patients with LOC required mastectomy, compared with 31.5% of IDC and 36.4% of DOC. Again, DLC exhibited a mastectomy rate that was intermediate between IDC and ILC, with 42.0% of IDC patients undergoing mastectomy.

IDC was associated with increased likelihood of use of adjuvant chemotherapy. 27.4% of IDC patients received chemotherapy, compared with 19.3%, 21.9%, 20.5% and 19.2% for ILC, DLC, DOC, and LOC, respectively (p < 0.0001). Chemotherapy use was highly associated with high RS in the full patient cohort (p < 0.0001) (Fig. 3).

Fig. 3: Association of adjuvant chemotherapy use with RS.
figure 3

Association of adjuvant chemotherapy use with low (0–10), intermediate (11–25, and high (26–100) RS for entire patient cohort (no chemotherapy: n = 54,240; chemotherapy: n = 19,198) and for each histologic subtype. (IDC: no chemotherapy: n = 41,069; chemotherapy: n = 15,773; ILC: no chemotherapy: n = 6880; chemotherapy: n = 1680; DLC: no chemotherapy: n = 4129; chemotherapy: n = 1180; DOC: no chemotherapy: n = 1917; chemotherapy: n = 505; LOC: no chemotherapy: n = 245; chemotherapy: n = 60).

Associations between tumor subtypes and OS

When compared with IDC, overall survival (OS) (adjusted for age, race, ethnicity, rurality, median income, and educational level of area of residence, RS, tumor size, grade, node involvement, Charlson-Deyo comorbidity index, and treatment) for ILC was similar (HR 0.911; 95% CI 0.798, 1.039; p = 0.165). OS for DOC and LOC were also similar to IDC (HR 1.002; 95% CI 0.800, 1.254; p = 0.989 for DOC, and HR 0.530; 95% CI 0.237, 1.181; p = 0.120). In contrast, DLC had improved OS, compared with IDC (HR 0.824; 95% CI 0.696, 0.974; p = 0.024). Kaplan-Meier curves for OS in each histologic subtype are shown in Fig. 4.

Fig. 4: Kaplan–Meier curve for OS in different histological subtypes.
figure 4

Kaplan–Meier curve for OS comparing different tumor histologic subtypes. (a: all histologic subtypes, p = 0.1391; b: IDC vs ILC, p = 0.0772; c: IDC vs DLC, p = 0.0478; d: IDC vs DOC, p = 0.6363; e: IDC vs LOC, p = 0.4681).

Factors associated with OS in tumor subtypes

Factors associated with poorer OS in the full patient cohort included larger tumor size, node involvement, high grade disease, high RS, increasing age, Black race, and increasing comorbidity index (all p < 0.0001). These factors were also all significantly associated with poorer OS in IDC (all p < 0.0001). High RS (>26) remained prognostic for poorer OS in ILC (HR 1.906; 95% CI 1.151, 3.157; p = 0.0122), and in DLC (HR 2.557; 95% CI 1.398, 4.676); p = 0.0023), but not in DOC (HR 1.385; 95% CI 0.545, 3.519; p = 0.4942). Of note, a full analysis could not be performed on LOC, due to small sample size (n = 312, with 10 deaths). Other factors associated with poorer OS in ILC included larger tumor size (p = 0.0188), N1 disease (p = 0.0002), increasing age (p < 0.0001), and increasing comorbidity index (p < 0.0001). Black race and high grade disease were not associated with OS in ILC. For DLC, factors associated with poorer OS, apart from high RS, included increasing age (p < 0.0001), higher Charlson-Deyo comorbidity index (p = 0.0081) and Black race (p = 0.0025). Tumor size, grade, and node involvement were not associated with OS in DLC.

Evaluation of chemotherapy use and chemotherapy benefit

Receipt of adjuvant chemotherapy was associated with improvement in OS in the full patient cohort (HR 0.800, 95% CI 0.716, 0.894, p = 0.0001). IDC patients were more likely to receive chemotherapy than patients with other tumor histologies. 27.4% of IDC patients received adjuvant chemotherapy, compared with 19.3%, 21.9%, 20.5% and 19.2% for ILC, DLC, DOC, and LOC, respectively (p < 0.0001). Chemotherapy use was associated with increasing RS in the full patient cohort and in all tumor subtypes (all p < 0.0001, Fig. 3), although patients with ILC and RS 11–25 were more likely to receive chemotherapy than were patients with other tumor histologies. Chemotherapy use was also associated with larger tumor size, higher grade, N1 disease, Black race, and lower comorbidity score in the full patient cohort and in IDC (all p < 0.0001). Significant associations with chemotherapy use persisted for tumor size and nodal involvement for all tumor subtypes (p < 0.0001) except LOC. High grade disease was significantly associated with chemotherapy use in all tumor subtypes (p < 0.0001 for IDC, ILC, DLC and DOC, and 0.012 for LOC).

Evaluation of chemotherapy benefit in different tumor subtypes demonstrated that receipt of adjuvant chemotherapy was associated with improved OS in IDC (HR = 0.76; 95% CI 0.672, 0.864; p < 0.0001). In contrast, adjuvant chemotherapy was not associated with improvement in OS in ILC (HR = 0.986; 95% CI 0.700, 1.389; p = 0.9349), DLC (HR 1.039; 95% CI 0.673, 1.605; p = 0.8626), DOC (HR = 0.866; 95% CI 0.401, 1.871; p = 0.7148) and LOC (HR = 2.909; 95% CI 0.816, 10.372; p = 0.0996) (Fig. 5). However, because IDC is the largest subtype, the lack of statistical significance associated with chemotherapy for other subtypes may result from lack of statistical power (The minimum detectable HR associated with chemotherapy for ILC, DLC, DOC, and LOC is 0.80, 0.75, 0.63, 0.11, correspondingly, with 80% power and two-sided type I error rate of 5%.).

Fig. 5: Chemotherapy benefit in different histological subtypes.
figure 5

Evaluation of overall survival benefit from adjuvant chemotherapy in different tumor histologic subtypes (adjusted for age, race, ethnicity, rurality, median income and educational level of area of residence, RS, tumor size, grade, node involvement, and Charlson-Deyo comorbidity index).

Discussion

Although IDC and ILC are the most common BCA subtypes, IDC, represents ~80% of cases. The majority of clinical trials which inform BCA management do not distinguish between BCA subtypes, and their findings are likely driven by the behavior of IDC. We therefore sought to evaluate differences in clinicopathologic characteristics, RS distribution, and chemotherapy benefit between early HR + HER2-negative IDC and other HR + HER2-negative BCA subtypes in the NCDB database. We found statistically significant and clinically relevant differences in clinicopathologic features, RS distribution, and chemotherapy benefit between IDC, ILC and mixed ductal and lobular histologies.

As previously reported by other investigators2,3, we found that ILC was associated with larger tumors, lower histologic grade, and with increased use of mastectomy and decreased use of chemotherapy compared with IDC, but with similar OS. We also found improved adjusted OS in patients with DLC compared with IDC, which has not been previously reported.

We found a lower incidence of high RS in tumors with lobular histology compared with IDC. ILC was also associated with a lower incidence of low RS, and was therefore most likely to have intermediate RS of 11–25. High RS was prognostic for OS in the full patient cohort, and in IDC, and remained prognostic for OS in ILC and DLC, but not in DOC. In addition, we found that the OS benefit of adjuvant chemotherapy was limited to patients with IDC.

Several other investigators have evaluated RS in in non-ductal BCA histologies, and our findings complement their work. Tadros et al. evaluated RS in 610,350 tumor specimens from Genomic Health’s clinical laboratory, and found that ILC, DLC, DOC, and LOC all had significantly lower mean RS compared with IDC, but did not evaluate the impact of RS on prognosis or chemotherapy benefit in non-ductal histologies14. Christgen et al. compared RS and clinicopathologic prognostic factors in patients with node positive (N+) and high risk N0 ER+, HER2-negative lobular (n = 353) and non-lobular (n = 2232) BCA enrolled on the PlanB trial, and found that lobular tumors were associated with higher tumor and nodal stages, lower grade, and lower RS than non-lobular tumors, but similar disease-free survival (DFS). In contrast to our findings, Christgen et al. found that Grade 3 disease was associated with poorer DFS in lobular tumors, but high RS was not20.

Two studies have used SEER data to evaluate RS in BCA subtypes other than IDC. Wang et al. evaluated RS in 83,665 patients with N0 ER + BCA measuring under 5 cm, and belonging to one of eight different tumor subtypes (IDC, ILC, DLC, cribriform, tubular, mucinous, micropapillary, and intraductal papillary adenocarcinoma with invasion). They reported that IDC was more likely to have RS > 30 than IDC and DLC, but that IDC, ILC, and DLC had similar mean RS (which were higher than the mean RS of the other subtypes evaluated). RS was prognostic for breast cancer specific survival (BCSS) in IDC, ILC, and DLC, but not in other tumor subtypes21. Kizy et al. used SEER data to evaluate the prognostic and predictive impact of RS in Stage I to III ER + ILC (n = 7316), and similar to us, found that high RS was prognostic for worse OS, but did not predict chemotherapy benefit in ILC18. These studies are limited by the use of SEER data, which does not record patient comorbidities, collects incomplete information on chemotherapy and endocrine therapy use22, and did not collect HER2 status until 201018. In addition, Kizy’s study included patients with Stage III disease, for whom RS is not recommended by national clinical practice guidelines23,24, while Wang’s study used the initial cutpoints for intermediate and high RS10, which have now been largely replaced by the TAILORx cutpoints25.

Investigators have also evaluated other gene expression assays in ILC. Beumer et al. evaluated the prognostic value of MammaPrint in 217 ILC cases treated on one of five clinical trials, and found that high-risk MammaPrint was an independent poor prognostic factor for distant metastasis-free survival (DMFS), distant metastasis-free interval, and OS in ILC26. Metzger et al. compared clinicopathologic risk factors and MammaPrint results in IDC (n = 4826) and ILC (n = 487) patients enrolled on the MINDACT trial, and found that IDC and ILC had similar clinical risk distributions, but ILC was less likely than IDC to be genomically high risk. DMFS and DFS rates for IDC and ILC of similar genomic risk were similar to each other, suggesting prognostic value for MammaPrint in ILC27. The patient population for both of these studies was heterogeneous, and included ER-negative and HER2-positive cases26,27. Sestak et al. assessed the prognostic value of the EndoPredict assay in a cohort of 470 postmenopausal women with N0 and N + ILC treated with endocrine therapy on the ATAC, ABCSG-6, or ABCSG-8 trials. EPClin was prognostic for 10-year distant recurrence (DR) in ILC, and DR rates were similar in IDC and ILC patients with similar genomic risks by EpClin28. In contrast, Laenkholm compared results of the PAM50 assay in a Danish cohort of postmenopausal women with N0 or N1 ER + HER2-negative ILC (n = 340) and IDC (n = 1570), and found that ILC patients had significantly poorer 10-year DR than IDC patients with similar ROR scores29.

A different spectrum of genetic alterations in ILC compared with IDC may underlie the differing biologic behavior of the two histologies. In addition to the pathognomonic inactivation of CDH1, mutations in FOXA1 and in PIK3CA, PTEN and AKT are more common in ILC than in IDC, and GATA3 mutations are more frequent in IDC4,30,31,32. ILC is also associated with increased gains in chromosome 1q, 8q, and 16p; losses of 8p23-p21, 11q14.1-q25, and 16q; and amplifications of 1q32, 8p12, and 11q13 compared with IDC4,30,32,33. Thus, lobular-specific molecular assays may improve prognostication in ILC over currently available gene expression assays. LobSig, an ILC-specific 194-gene signature that incorporates gene expression and copy number, has recently been developed, and showed better prognostic ability in ILC than ROR and RS in a stepwise, multivariate Cox proportional hazards model4,30. Further studies to validate lobular-specific assays such as LobSig are warranted.

Our finding that the OS benefit of adjuvant chemotherapy is seen only in IDC warrants further study, especially as this finding may have been due to insufficient statistical power to detect benefit in less common BCA subtypes. While ILC is typically associated with both lower RS14,15,16 and poorer response to chemotherapy17,19, the trials that established that high RS is predictive of chemotherapy benefit in ER + HER2-negative BCA did not differentiate between tumor subtypes8,11,12, and no prospective trial has evaluated chemotherapy in ILC alone. Evaluation of histology, RS, and chemotherapy benefit in patients enrolled on large prospective clinical trials, such as TAILORx, may help further clarify this finding.

Our study has several strengths and limitations, due to use of NCDB data. Strengths include the large sample size, utilizing real-world data from CoC-accredited institutions. Limitations include the retrospective nature of the study, lack of information on cancer recurrence, relatively short duration of follow-up, and lack of central pathologic review.

In summary, our study confirmed that ILC and mixed BCA histologies are associated with distinct clinicopathologic and prognostic features compared with IDC, and with lower RS. Despite this, RS remains prognostic for OS in ILC and DLC. Further studies are warranted to evaluate our finding that chemotherapy benefit is limited to IDC.

Methods

Case selection

We utilized a dataset derived from the 2005-2016 National Cancer Database (NCDB). NCDB is a nationwide, facility-based database jointly sponsored by the American Cancer Society and the American College of Surgeons Commission on Cancer (CoC). The NCDB contains data collected on over 34,000,000 cancer cases from over 1500 CoC-accredited hospitals, representing over 70% of newly diagnosed cancers34, and 80% of newly diagnosed breast cancers in the United States35. The NCDB Participant User File (PUF) is a HIPAA-compliant data file, which is made available to investigators from CoC-accredited cancer programs who complete an application process. The data used in the study are derived from a de-identified NCDB file. The American College of Surgeons and the Commission on Cancer have not verified and are not responsible for the analytic or statistical methodology employed, or the conclusions drawn from these data by the investigator. The NCDB PUF contains de-identified patient data, including demographic information, tumor site and pathology data, first course of treatment, and mortality. The NCDB PUF does not contain information on recurrence. The NCDB began collecting information on results of gene expression assays in 2010.

The study population included women age 75 and under diagnosed between 1/1/2010 and 1/1/2014 with estrogen receptor-positive (ER+) HER2- BCA, measuring up to 5 cm, with 0–3 pathologically involved axillary nodes, treated with definitive surgery as first treatment, with tumor that contained ductal or lobular histology, and with numeric RS available. Tumor subtypes were coded as IDC, ILC, infiltrating duct and lobular carcinoma, infiltrating duct mixed with other types of carcinoma (DOC), and infiltrating lobular mixed with other types of carcinoma (LOC). Demographic information obtained included age at diagnosis, race, and ethnicity, as well as estimated annual household income, rurality and educational attainment of area of residence. Clinical characteristics included tumor size in mm, histologic grade, axillary node involvement, Charlson-Deyo comorbidity index, and numeric RS. Micrometastatic nodal involvement (pN1mi) was classified as node positive. RS was characterized as low, intermediate, or high using TAILORx cutpoints, where 0–10 was defined as low, 11–25 as intermediate, and 26–100 as high.

Statistical analysis

Associations between categorical variables were examined using the chi-square test. The Cox proportional hazards model was used to examine the difference in OS between histologic subtypes while controlling for age, race, ethnicity, Charlson-Deyo comorbidity index, median estimated annual household income of area of residence, rurality of area of residence, and educational attainment of area of residence, RS, tumor size, grade, node involvement and treatment. The estimated HR for each variable in the model, along with its 95% CI, was reported. All tests were two-sided with significance level <5%. Proportionality assumption was examined, and no violation was detected. All analyses were conducted using SAS 9.4 (SAS Institute Inc., Cary, NC, USA).

Ethics

NCDB PUF data are de-identified, and compliant with HIPAA. Hospitals, health care providers, and patients are not identified. Patient informed consent is not obtained prior to institutional data submission to NCDB. As this study utilizes de-identified patient data, with no attempt made to contact or re-identify the subjects, it is deemed exempt from oversight by the Institutional Review Board of Albert Einstein College of Medicine.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.