Introduction

Extrathyroidal extension (ETE) is defined as the direct extension of the primary thyroid cancer into the adjacent structures, including strap muscles, trachea, larynx, esophagus, recurrent laryngeal nerve and/or major vasculature1,2. While strap muscle invasion was subclassified as minimal or gross, the American Joint Committee on Cancer (AJCC) 8th edition TNM staging schema discarded the minimal ETE category entirely while defining gross strap muscle invasion alone as the T3b stage. Validation studies have corroborated the new staging system by showing better allocation of relatively low-risk patients into lower stages with excellent survival outcomes3, and high-risk patients into more advanced stages3,4,5,6,7.

The newly introduced staging factor of gross strap muscle invasion alone (T3b) has been associated with higher risk of positive resection margin, bigger tumor size, higher recurrence rate, and lower disease survival rate8,9,10. However, studies have reported that strap muscle invasion alone, both minimal and gross, may not have significant influence on overall survival11,12, particularly in tumors equal to or smaller than 40 mm13. A recent study has also suggested similar disease-free survival rate between the minimal ETE and gross strap muscle invasion alone groups in papillary thyroid carcinomas with tumor size 10- to 40-mm14. With advances in surgical techniques and adjuvant therapies13, it is possible that tumors with strap muscle invasion alone may indeed have better prognosis than previously anticipated. Therefore, the impact of strap muscle invasion alone, both minimal and gross, on the survival of thyroid cancer patients needs to be elucidated to accurately allocate patients into appropriate staging groups according to individual risk, which will ensure longer disease-free survival and prevent overtreatment.

The Surveillance, Epidemiology, and End Results (SEER) database of the National Cancer Institute (NCI) collects cancer incidence and survival information from population-based cancer registries encompassing nearly 34.6% of the total population of the United States15. The SEER program is regarded as a critical database for cancer survival surveillance in the United States. This openly available database has compiled cancer incidence and survival data since 1973. Accordingly, it is an ideal database for analyzing the effect of gross ETE on overall survival in thyroid cancer.

Therefore, the purpose of this study was to explore the impact of strap muscle invasion alone, both minimal and gross, on the disease-specific survival (DSS) of differentiated thyroid cancer (DTC) patients using the SEER database, and to potentially improve the survival prediction power of the TNM staging schema.

Results

Baseline characteristics

Baseline patient demographics are presented in Table 1. The median age at diagnosis was 47.0 years old (interquartile range, 36.0–57.0 years old), with a median follow-up of 55.0 months (interquartile range, 32.0–85.0 months; range, 1.0–155.0 months). A total of 18,085 (90.8%) patients were diagnosed with papillary carcinoma and its variants, while 1406 (7.1%) patients were diagnosed with follicular carcinoma and its variants. Other pathologies included oxyphilic adenocarcinoma (n = 355), non-encapsulated sclerosing carcinoma (n = 64), and clear cell adenocarcinoma (n = 4). Most patients underwent total thyroidectomy (n = 16,874, 84.7%). Adjuvant radiation therapy was performed in over half of the patients (n = 10,165, 51.0%), which included radioactive iodine (RAI) (n = 9740), radioactive implants (n = 120), beam radiation (n = 195), a combination of beam radiation with implants or isotopes (n = 66), and radiation therapy not otherwise specified (n = 44). Adjuvant chemotherapy was performed in only 57 (0.3%) patients. The distribution of T stage according to the AJCC 8th staging criteria is shown in Table 2.

Table 1 Baseline patient demographics.
Table 2 10-year DSS and T stage distribution according to the AJCC 8th TNM staging schema or modified TNM staging schema (n = 19,914).

Prognostic impact of strap muscle invasion alone on 10-year DSS

Cox proportional hazard analysis was performed to estimate the prognostic impact of strap muscle invasion on DSS in all patients (n = 19,914) and in the subgroup of patients with tumor size equal to or smaller than 40 mm (n = 17,837) (Table 3). Univariable and multivariable analyses revealed that strap muscle invasion alone as well as age, tumor size, major organ invasion, major vessel invasion, LN metastasis, distant metastasis, and chemotherapy were significant prognostic factors for DSS. Subgroup analysis of tumors equal to or smaller than 40 mm in size showed that although strap muscle invasion alone was a significant prognostic factor for DSS on univariable Cox regression analysis, it was not a significant prognostic factor for DSS on multivariable analysis.

Table 3 Cox proportional hazard analysis for disease-specific survival (DSS) according to the AJCC 8th TNM staging schema.

Competing risk analysis of cancer-caused deaths and deaths to other causes

On multivariable competing risk analysis, strap muscle invasion alone did not significantly impact death due to either cancer or other causes in all patients as well as in the subgroup of tumors equal to or smaller than 40 mm (Table 4), although it was a significant factor on univariable analysis (Supplementary Table S1). In addition, in the subgroup of patients 55 years old of age or older, multivariable competing risk analysis showed that strap muscle invasion only did not significantly impact death due to any cause, despite its significance on univariable competing risk analysis (Supplementary Table S2).

Table 4 Multivariable competing risk analysis of clinical and pathological variables for cancer–caused death according to the AJCC 8th TNM staging schema.

10-year DSS according to the T stages of the AJCC 8th TNM staging and the modified TNM staging schemas

Table 2 showed the 10-year DSS and the number of patients in each T stage according to either the AJCC 8th TNM staging schema or the “modified” TNM staging schema (discarding strap muscle invasion from the T stage criteria). While there was an overall negative correlation between T stage and 10-year DSS per the AJCC 8th staging schema, the T3b stage showed better 10-year DSS than the T3a stage (Fig. 1a). Restaging of the T3b stage (i.e., T3bNanyMany) according to the modified TNM staging schema resulted in a total of 1552 patients assigned to T1a (n = 238, 15.3%), T1b (n = 543, 35.0%), T2 (n = 506, 32.6%), and T3 (n = 265, 17.1%) stages with significant difference in DSS (Fig. 1b). Among the 1518 patients without distant metastasis (i.e., T3bNanyM0, stage II), 589 patients (38.8%) were downstaged to stage I, and there was significant difference in DSS after reallocation into either stage I or stage II (Fig. 1c,d). T stage and overall staging reallocation per the modified staging schema is shown in Fig. 2.

Figure 1
figure 1

Comparison of DSS of the T3b subgroup according to the AJCC 8th TNM staging and modified TNM staging schemas and alluvial plot of restaging according to the two staging schemas. (a,b) Comparison of the disease-specific survival (DSS) between the T3a and T3b stages according to the AJCC 8th TNM staging schema (a) and the modified TNM staging schema (b). Reallocation of the T3a and T3b stages according to the modified TNM staging schema (b) showed significantly different DSS according to tumor size only (p < 0.001). (c,d) Comparison of DSS of the T3bNanyM0 group according to the AJCC 8th TNM staging schema (c) and the modified TNM staging schema (d). The T3bNanyM0 group (c) was classified as stage II with the AJCC 8th TNM staging schema. According to the modified TNM staging schema (d), the T3bNanyM0 group was reallocated to either stage I or stage II with statistically significant difference in DSS (p = 0.015).

Figure 2
figure 2

Restaging of the AJCC 8th TNM stages according to the modified TNM staging schema. The alluvial plot shows the reallocation flow for T stages and overall stages according to the modified TNM staging schema. Orange color indicates patients with strap muscle invasion (T3bNanyM0) who were restaged according to tumor size. The number of patients in each T stage of the AJCC 8th TNM staging schema or the modified TNM staging schema are presented along the left and right margins.

Evaluation of the power of survival prediction for the AJCC 8th and the modified TNM staging schemas

The Harrell’s C concordance indices (C-indices) of the AJCC 8th and the modified TNM staging schemas was estimated as 0.9418 and 0.9405, respectively, but without statistically significant difference (p = 0.220). The proportion of variance explained (PVE) for DSS prediction with the AJCC 8th and the modified TNM staging schemas was estimated as 4.45% and 4.43%, respectively.

Discussion

This study suggested that strap muscle invasion alone of any degree was not a statistically significant prognostic factor of DSS in all tumor sizes of DTC. In particular, this finding may significantly impact tumors equal to or smaller than 40 mm, which may be overstaged by the current AJCC 8th TNM staging schema. In addition, survival analysis of the T3b group according to the AJCC 8th TNM staging schema showed that it consists of a heterogeneous group of patients with significantly different DSS according to tumor size. T3b patients reallocated to either the T1 or T2 stage according to the suggested modified staging schema showed significantly better DSS compared to those in the T3 stage.

Historically, ETE was assumed to have a positive correlation with compromised survival in DTC16,17, but with some controversy18,19. While minimal ETE was totally discarded from the AJCC 8th TNM staging schema, gross strap muscle invasion was introduced as the new T3b stage regardless of tumor size. However, there have been suggestions that even gross strap muscle invasion does not impact survival in patients with DTC8,9,10,11,13,20. Indeed, the AJCC 8th staging schema contains in-stage heterogeneity in regards to DSS prediction21. One potential source of this heterogeneity may be due to the prognostic impact of strap muscle invasion. A recent study of a total of 2804 patients and demonstrated that perithyroidal soft tissue or strap muscle invasion showed disease-free survival, overall survival and DSS comparable to those of intrathyroidal tumors equal to or smaller than 40 mm (median follow-up 59 months; range, 12–192 months), although with small number of strap muscle invasion cases (n = 61)13. Another study on 3104 patients with either papillary or follicular thyroid carcinoma revealed that gross strap muscle invasion does not significantly impact DSS in tumors equal to or smaller than 40 mm, and that DSS of the T3b stage did not significantly differ from that of the T2 stage (median follow-up 10 years; interquartile range, 8.1–12 years)11.

This population-based study utilized the SEER database15 without discrimination of tumor size to reveal that strap muscle invasion alone does not significantly impact DSS, regardless of tumor size or cause of death. This may have particular clinical impact on tumors equal to or smaller than 40 mm which may be appropriately downstaged. Although the estimated C-indices and PVE did not reveal statistically significant differences in the predictive power of the current AJCC 8th and the suggested modified TNM staging schemas, better survival curve separation was observed per the modified staging schema with statistical significance.

Invasion of strap muscle in any degree may not have significant survival impact due to the relative complexity of posterior anatomical structures such as trachea, recurrent laryngeal nerve, and prevertebral fascia. Critical laryngotracheal structure invasion leads to a higher possibility of incomplete resection and therefore to higher clinical recurrence and poorer prognosis9. On the other hand, strap muscles can be relatively easily resected in the presence of tumor invasion13,22, accounting for their lack of significant impact on DSS. Consequently, anterior or posterior ETE have been suggested as more appropriate staging factors than gross strap muscle invasion13,23. Therefore, the dismissal of both minimal and gross strap muscle invasion may lead to better allocation by the TNM staging schema and better reflection of DSS, as suggested by the modified staging schema, particularly for tumors equal to or smaller than 40 mm.

There are several limitations to our study. First, the retrospective nature of this study may have caused an inherent bias. Second, the SEER database did not distinguish between minimal and gross strap muscle invasion. However, any ETE beyond strap muscle invasion was explicitly recorded and an inference was made that the code for strap muscle invasion included both minimal and gross strap muscle invasion collectively. Thus, the purpose of this study was limited to exploring the potential changes in TNM staging with the collective dismissal of both minimal and gross strap muscle invasions. Third, the SEER database did not include recurrence data, which limited the analysis of the impact of strap muscle invasion alone on disease-free survival. Strengths of the present study include the use of a national, comprehensive database with analyses of prognostic variables in all tumor size groups as well as tumor size and age subgroups. Moreover, competing risk analysis offered a more accurate evaluation of potential prognostic variables relative to other causes of death.

In conclusion, strap muscle invasion alone of any degree does not significantly impact DSS in DTC patients, regardless of tumor size. The modified TNM staging schema suggests potential modification of the TNM staging schema to better reflect the risk of cancer-caused death and to prevent potential overstaging of tumors, particularly those equal to or smaller than 40 mm in size.

Methods

SEER database and study population

The SEER database was accessed on July 4th, 2019, and queried for all patients diagnosed with DTC alone. Inclusion criteria were as follows: (1) patients older than 18 years old, (2) patients diagnosed with only DTC according to the International Classification of Disease for Oncology, third edition (ICD-O-3) codes (Supplementary Table S3), and (3) patients who underwent surgery with or without adjuvant radiation therapy or chemotherapy. Patients with medullary carcinoma, mixed medullary carcinoma component, insular carcinoma, and anaplastic thyroid cancer were excluded. In addition, patients with unknown race, unknown tumor grade, unknown ETE, unavailable or incomplete TNM staging data, death certificate or autopsy alone, or unknown cause of death were excluded (Fig. 3). The extension item of the SEER Cancer Schema (CS) version 02.05.50, which encodes tumor ETE based on pathologic and/or clinical information, does not explicitly distinguish between minimal or gross strap muscle invasion. However, since more advanced organ invasion (i.e., recurrent laryngeal nerve, vagus nerve, major vessels, major organs or prevertebral fascia) was clearly distinguished by the CS extension coding, the invasion of omohyoid, sternohyoid, sternothyroid and/or thyrohyoid muscles (code 450) was considered to include both minimal and gross strap muscle invasion collectively. This code was regarded as the T3b stage per the AJCC 8th TNM staging schema in this study.

Figure 3
figure 3

Flow diagram of the SEER database search. Flow diagram of the SEER database with inclusion and exclusion criteria.

A total of 19,914 patients were included in the final analysis of demographic variables including age at diagnosis, race, sex, year of diagnosis and survival status. Pathologic characteristics included single or multifocal disease, tumor size, ETE, LN metastasis and distant metastasis. This retrospective study utilizing a publicly available database with de-identified records did not require informed consent from the SEER registered cases, and Institutional Review Board approval was formally waived.

Primary outcome

The primary outcome of this study was cancer-caused death from DTC. DSS was defined as the time from date of diagnosis until cancer-caused death or last censoring.

Tumor staging

Because the currently available SEER database only provides TNM staging according to the AJCC 7th or previous staging schemas, the included 19,914 patients were restaged according to the AJCC 8th TNM staging schema using the recorded pathologic characteristics. Strap muscle invasion alone (code 450 of the CS extension item in the SEER CS version 02.05.50) was categorized as T3b stage, while pericapsular soft tissue or connective tissue invasion was categorized according to tumor size only. Major organ invasion including gross invasion of the subcutaneous soft tissue, larynx, trachea, esophagus, or recurrent laryngeal nerve from any tumor size was classified as T4a stage per the AJCC 8th TNM staging schema. Likewise, major vessel and/or prevertebral fascia invasions including the encasement of carotid artery or mediastinal vessels from any tumor size were classified as T4b stage.

Based on our hypothesis that minimal and gross strap muscle invasion collectively in the absence of other risk factors does not significantly impact DSS, we suggested a “modified” staging schema thereby discarding strap muscle invasion of any degree from the T staging criteria. Therefore, T3b stage according to the AJCC 8th TNM staging schema were reallocated depending on tumor size only, into either the T1a, T1b, T2, or T3 stage per the modified staging schema. The staging of major adjacent structures extension (i.e., T4a or T4b stages) remained unchanged.

Statistical analysis

Statistical analyses were performed using commercially available software (R software, version 3.6.1 (https://www.R-project.org, R Foundation for Statistical Computing, Vienna, Austria) with the application of appropriate R packages (survival, survminer, cmprsk, compareC, ggplot2 and alluvial)24,25,26,27,28,29. The Cox proportional hazard analysis was used to examine the effects of clinical factors and histopathologic characteristics of DTC on DSS. Competing risk analysis by proportional subdistribution hazards regression modeling was performed26,30. In this study, competing risk was considered death due to other causes since deaths unrelated to DTC may obscure the ability to observe cancer-caused deaths31. Kaplan–Meier estimation and the log rank test were used to assess 10-year DSS probability and DSS curves according to T stages based on either the AJCC 8th TNM staging schema or the modified TNM staging schema. The measure of discrimination for survival prediction was estimated using C-index, and goodness-of-fit for survival prediction was estimated using the PVE for both staging schemas32. All tests were two-sided, and a p-value of less than 0.05 was considered statistically significant.