Introduction

Colon cancer represents the third most common malignancy among both men and women and is the second highest cause of cancer mortality in men and women, when combined, in the United States1,2. Early detection and improved treatments have resulted in not only improved overall survival in patients with colon cancer but also a decreased incidence in the United States2. Adenocarcinoma (AC) of the colon is the predominant histology, followed by neuroendocrine tumors and other rarer histologic entities3. In particular, medullary adenocarcinoma (MAC) is a very rare glandular tumor subtype of colon cancer4,5.

MAC was first described in the 1990’s6, but has historically been difficult to distinguish from poorly differentiated AC5. However, through identification of molecular markers specific to MAC5,7,8, MAC has been better defined and characterized leading to increased recognition and diagnosis in the last two decades. In published reports, the incidence rate of MAC is noted to be less than 1% of all primary colon cancers, with a preponderance in older women9,10. MAC is frequently characterized by features of aggressive tumor biology, such as lymphovascular invasion (LVI) and perineural invasion (PNI), as well as larger tumor size on presentation, yet with a decreased likelihood of progression to regional nodal and distant sites when compared with poorly differentiated AC10,11,12. At least part of the biological behavior of MAC may be explained by the finding that approximately 90% of MAC is associated with microsatellite instability (MSI)6,10,13.

Over the last two decades during which MAC has been recognized as a distinct subtype of colon cancer, there have been dramatic changes in our understanding and classification of colon cancer into integrated clinical and molecular colon cancer subtypes14,15. We analyzed data obtained through the Surveillance, Epidemiology, and End Results (SEER) program and the National Cancer Database (NCDB) to compare AC and MAC to better understand the incidence trends, clinicopathologic and treatment characteristics, and oncologic outcomes of MAC in the context of contemporary colon cancer management.

Materials and methods

This study was exempt from institutional research board approval given the use and analysis of de-identified national datasets. Our study consisted of three dataset analyses: (1) age-adjusted incidence trends over time using SEER data, (2) patient characteristics of MAC versus AC using unmatched NCDB data, and (3) prognostic characteristics of MAC versus AC using NCDB data matched 1:1 on age, sex, race/ethnicity, number of Charlson comorbidities, laterality, and disease stage.

Trends in the annual age-adjusted incidence rates of MAC and AC were calculated using SEER*Stat. Included in our analyses were patients diagnosed with MAC or AC from 2002–2017. We chose to use SEER data and the SEER*Stat application because the dataset contains a module specifically designed for age-adjusted incidence rate calculations. For the cohort analyses examining clinicopathologic characteristics and cancer outcomes, we chose the NCDB dataset because, unlike SEER, Collaborative Staging site-specific factor data on KRAS and MSI are well-captured.

The SEER 21 dataset contains cancer incidence and survival information from 21 SEER sites and represents 36.7% of the United States population (https://seer.cancer.gov/registries/data.html#a7). From 2002 to 2017, in these 21 sites, 1,236 patients were diagnosed with MAC and 471,927 were diagnosed with AC. All patients with a malignant diagnosis of MAC or AC were included in the incidence rate analysis.

NCDB data were used to compare demographic, clinicopathologic, and outcome characteristics across MAC and AC. The NCDB is a joint project of the Commission on Cancer of the American College of Surgeons and the American Cancer Society. The patient population captured within the dataset represents 70% of those with new cancer diagnoses treated at the approximately 1,500 Commission on Cancer-designated Clinical Cancer Programs across the United States. The 2019 release of NCDB colon cancer Participant User File (PUF) was queried for all patients diagnosed with MAC or AC from 2010–2016.

Given the de-identified nature of the data, this study was exempted from institutional review board approval. Figure 1 outlines the stepwise selection criteria for the NCDB patients. MAC patients were identified based on International Classification of Diseases for Oncology, 3rd Edition (ICD-O-3) morphology codes 8510 and 8513 and cases of AC were identified using ICD-O-3 morphological codes 8140–8144, 8210–8213, 8220, and 8260–8263. Patient demographic, clinicopathologic, and treatment data were analyzed. Laterality of the primary tumor was delineated by ICD-O-3 topographical codes. Laterality was defined as “right” if the cancer was located from the cecum to the transverse colon (codes C180, C182-184) and defined as “left” if the cancer was located from the splenic flexure up to the rectosigmoid (codes C185-187). Overlapping or unspecified sites were indicated by codes C188 and C189, respectively. Patients with other histologic diagnoses were excluded. Our analyses were restricted to patients diagnosed from 2010 to coincide with the NCDB addition of Collaborative Staging site-specific factor data pertaining to microsatellite stability (MSS) or instability (MSI), and KRAS gene mutation status.

Figure 1
figure 1

Stepwise patient selection criteria.

Statistical analysis

Statistical analyses were performed using SEER*Stat 8.3.616 and SAS version 9.4 TS Level 1M2 (SAS Institute, Cary, NC). Incidence rates were calculated using the number of cases over the number of individuals at risk in each of 19 age groups within the relevant SEER catchment area during a specific year. These rates were age adjusted using the 2000 US standard population (19 age groups; census P25-1130) and are expressed as per 100,000 individuals at risk17. Incidence rate changes over time are expressed as annual percent changes (APCs), which represent the log-transformed slopes across time.

To assess patient demographics and clinicopathologic characteristics, initially unmatched NCDB data were used. Due to the heterogeneity of the clinicopathologic features between MAC and AC, additional analyses of outcomes data were compared using a 1:1 matched analysis. The factors used to match the MAC patients with the AC patients included age (± 5 years), sex, race/ethnicity, number of comorbid conditions, laterality, and disease stage. All matching was performed using the GREEDY algorithm18.

Data were summarized using counts and percentages for categorical variables and median values with interquartile ranges (IQRs) for continuous data. Categorical data were compared using Pearson’s Χ2 analysis and continuous data were compared using the Wilcoxon rank-sum test. Univariate and multivariate Cox proportional hazards regression models were used to identify predictors of overall survival (OS). Multivariate OS results are depicted using forest plots. A two-sided p value of < 0.05 is considered significant.

Results

Annual incidence of MAC increased over time

From 2002–2017, the annual percent change (APC) in age-adjusted incidence rates of MAC increased by 23.8% (95% CI: 19.2–28.6) (Fig. 2A). In contrast, the APC in age-adjusted incidence rates of AC declined during the same time period (APC: − 2.9, 95% CI − 3.1 to − 2.8) (Fig. 2B).

Figure 2
figure 2

Colon medullary adenocarcinoma and adenocarcinoma incidence rates by histology. (A) Colon Medullary Adenocarcinoma. (B) Colon Adenocarcinoma.

Demographic, clinicopathologic, and treatment characteristics of MAC differ from AC

Unmatched cohort

A cohort of 1,018 patients diagnosed with MAC and 210,784 diagnosed with AC were identified for analysis. Patient, clinicopathologic, and treatment characteristics are presented in Table 1. Compared to AC patients, those with MAC were older, predominantly female, and white (all p < 0.0001). The reported diagnosis of MAC increased over time, with 50% of patients identified in the last four years of data collection (data not shown). MAC was located more often in the right colon; more frequently classified as high tumor grade and with LVI; and more likely to be Stage II disease (all p < 0.0001). Although such high-risk features were more commonly present in MAC, patients with MAC had lower rates of regional lymph node involvement (p = 0.037). Of patients with specified microsatellite status, over 81% of MAC patients had MSI versus 24% of AC patients (p < 0.0001). Among patients with known KRAS mutation status, MAC was associated with lower rates of KRAS-mutant disease (p < 0.0001). Overall, chemotherapy was administered to AC patients with greater frequency than to patients with MAC (p < 0.0001).

Table 1 Baseline demographic and clinicopathologic characteristics of participants by histology status.

Matched cohort

Included in the matched results were 1016 MAC patients matched 1:1 with AC patients on age, sex, race, number of comorbidities, laterality, and disease stage. Patient demographic, clinicopathologic, and treatment characteristics are presented in Table S1. Univariate analyses of matched variables showed good matching of the groups (p > 0.95 for all matched variables). Compared to AC, MAC lesions were significantly more likely to be high-grade with LVI, MSI, and wild-type KRAS (all p < 0.0001).

Multivariate Cox proportional hazards regression was used to look at independent risk factors for overall survival (OS). A forest plot depicting multivariate results is shown in Fig. 3. After adjusting for age group, sex, number of Charlson co-morbidities, grade, stage, laterality, LVI, PVI, MSI, KRAS, and receipt of chemotherapy, MAC was not independently associated with overall survival (OS) (HR 1.11, 95% CI 0.91–1.35).

Figure 3
figure 3

Overall survival by histology and laterality of left and right-sided colon medullary adenocarcinoma and colon adenocarcinoma.

Overall survival depends on laterality of subtypes

After stratifying the histologic groups by laterality, multivariate results on the unmatched group showed left-sided MAC tumors were associated with significantly shorter survival when compared to right-sided MAC (HR 1.66, 95% CI 1.16 − 2.38) and right-sided AC (HR 1.54, 95% CI 1.12–2.12) (Fig. 3). In contrast, left-sided AC was associated with significantly longer survival when compared to right -sided AC tumor (HR 0.93, 95% CI 0.91–0.94).

Discussion

Colon MAC is a rare subtype of colon cancer. Previous publications have estimated that the disease represents less than 1% of all colon cancers6,10. In our analysis of the SEER dataset, we confirmed the rarity of this disease. However, over the last two decades there has been an increase in the incidence of reported cases of MAC with an APC of 23.8%. Whether this rise in incidence is as a result of increased recognition and reporting of the histologic entity or due to actual increased number of cases in the population remains uncertain. That the majority of the cases captured have been in the last 4 years of our analysis suggests that better recognition of the entity, perhaps through immunohistochemical tests for mismatch repair (MMR) proteins and confirmation of MSI, may account for the incidence rise. Regardless, our finding that the incidence of MAC is rising argues that a better understanding of the clinical characteristics and oncologic outcomes of MAC, in comparison to AC, is needed to better tailor treatment to this distinct colon cancer histologic subtype.

Clinical characteristics of MAC have been previously described in small single institution studies and in large public dataset analyses4,9,11. There are distinct clinical features associated with MAC when compared to AC. MAC is more frequently characterized by local aggressiveness, including more advanced T-stage, higher histopathologic grade, and presence of LVI and PNI, although with less likelihood of nodal or distant involvement. Furthermore, MAC is predominantly found in the right colon (89%), with far higher frequency of MSI (81%) than would be expected for right-sided colon ACs.

The molecular characteristic that distinguishes MAC from AC is the predisposition to MSI5,6. As shown in our data and in support of the other studies, more than 80% of MAC patients with reported microsatellite status were positive for MSI. Microsatellite instability, first described in 1993, represents deficiency of the mismatch repair system (dMMR phenotype)19. Hereditary nonpolyposis colorectal cancer, a component of Lynch syndrome, is a consequence of germline mutations in one of the MMR genes, MLH1, MSH2, MSH6, and PMS220. In contrast, sporadic cases of colon cancer with MSI develop from senescent hypermethylation of the MLH1 gene promoter21. The clinical relevance of these differing etiologies has been explored, suggesting distinct tumor biology22,23. Although the NCDB does not have discrete information regarding sporadic or inherited etiology of MSI status, which would have enriched our analysis, the vast majority of MSI patients in our cohort were likely sporadic, especially given the older age at diagnosis, the sex distribution favoring female patients, and the overwhelmingly right-sided disease24.

MSI has been associated with improved OS in colon AC19. However, it is unclear whether colon MAC, given the high preponderance of MSI that is the hallmark of the subtype, likewise experiences improved OS compared to MSS MAC or AC. In the early description by Lanza et al., MAC was identified to be a distinct subtype with improved OS6. Subsequent studies, including a previous publication using the NCDB dataset, likewise show better OS for MAC9. However, in a recent publication from Gomez-Alvarez, OS was worse for MAC when compared with MSI high AC in a small, single institutional series11. To better understand the survival outcomes of MAC when compared to AC, we performed survival analyses. We find that overall survival was similar between MAC and AC in the NCDB dataset. To limit potential bias, we further analyzed the data by conducting a survival analysis of a 1:1 matched cohort and confirmed that overall survival was similar among MAC compared to AC (HR 1.11, 95% CI 0.91–1.35).

Sidedness or laterality in colon cancer is increasingly recognized as an important prognostic factor. A meta-analysis of colorectal cancer studies stratified by sidedness reported a 19% reduction in mortality for left-sided malignancies compared to right-sided cancers25. Embryologically, the colon develops from both the midgut and hindgut, which in conjunction with the evolution of the host’s microbiome, may lead to unique genetic phenotypes between right- and left-sided tumors14,26. Treatment strategies for colon cancer have evolved to account for differential genetic expression profiles by tumor location. One example of this is anti-epidermal growth factor receptor therapies for wild-type KRAS colon cancer, which have demonstrated a preferential survival benefit for left-sided cancers27,28. To evaluate the effect of sidedness on overall survival in colon AC and MAC patients, we grouped the unmatched dataset into right-sided AC, left-sided AC, right-sided MAC, and left-sided MAC. Compared to right-sided AC, improved survival was seen among left-sided AC (HR 0.93, 95% CI 0.91–0.94). In contrast, compared to right-sided AC, decreased survival was seen in left-sided MAC (HR 1.54, 95% CI 1.12–2.12). Similarly, when looking at MAC patients only, those with left-sided cancers had significantly worse survival than those with right-sided cancers (HR 1.66, 95% CI 1.16-2.38).

At present, resection remains upfront treatment for colon cancer regardless of subtype. However, nuances in adjuvant treatment strategies for stage II and III disease and systemic treatment regimens for stage IV disease are increasingly based on molecular markers such as MSI and KRAS status. Although we demonstrate an improvement in the survival HR in MAC patients who are treated with chemotherapy, the lack of granularity in public datasets limit the ability to characterize the agents and the regimens associated with that benefit.

The well-described limitations of databases such as the NCDB include the lack of disease recurrence data and cause of death. However, the use of a large, public dataset is necessary to aggregate adequate numbers of rare cancers such as MAC for analysis. Other limitations include data availability. Specific to our study, important tumor factors such as MSI, LVI, and KRAS status have only been collected since 2010, limiting the length of follow-up available for analysis. In addition, the low numbers of known KRAS and BRAF mutation information documented in the database precluded a more detailed analysis of MSI, KRAS, and BRAF as prognostic factors for MAC versus AC. Specific systemic treatment-related analysis remains limited by the demonstrated low sensitivity of records of such treatments in the NCDB29. Very few patients were recorded as having received immunotherapy, and a substantial minority of patients with stage III–IV disease were not noted to have received chemotherapy; suggesting both data points were likely under-reported. In addition, the NCDB lacks information on administration of targeted therapy in addition to cytotoxic chemotherapy.

Conclusions

Colon medullary adenocarcinoma is increasingly recognized and diagnosed. Medullary histology closely resembles microsatellite-unstable poorly differentiated colon adenocarcinoma with pathologic features of local aggressiveness such as larger tumor size and LVI. However, left-sided MAC demonstrates worse overall survival compared to right-sided MAC and bilateral AC. Efforts made to distinguish MAC from AC at the molecular level may improve on the ability to define each entity, when further stratified by sidedness, to determine the prognostic impact of KRAS and BRAF mutations in the setting of MSI, and to identify other prognostic and predictive molecular markers to develop more specific treatment guidelines for this rare cancer.