Introduction

Over the past two decades, evidence from randomized clinical trials has enabled the de-escalation of axillary surgery in the management of early breast cancer1, with the major change has begun the replacement of axillary lymph node dissection (ALND) by sentinel lymph node biopsy (SLNB). Three randomized trials, The American College of Surgeons Oncology Group (ACOSOG)-Z0011 trial, the After Mapping of the Axilla: Radiotherapy Or Surgery (AMAROS) trial, and the International Breast Cancer Study Group (IBCSG) 23–01 trial found that complete ALND could be safely omitted2,3,4.

The ACOSOG-Z0011 trial found that complete ALND did not improve locoregional recurrence-free survival rates in clinical T1-2N0M0 breast cancer patients with 1 or 2 positive sentinel lymph nodes (SLNs) who underwent breast-conserving surgery followed by whole breast radiotherapy, thus changing the standard approach to axillary surgery2. Intraoperative pathology assessment of SLNs is highly effective in detecting patients who may benefit from ALND at the same surgical procedure5. Nevertheless, a significant proportion of these patients may not need ALND, reducing the usefulness of intraoperative pathologic examination of frozen sections. Intraoperative diagnostic techniques, including frozen sections, add to the cost and complexity of the surgery and complicate surgical scheduling. Application of the criteria derived from these studies has reduced the need to identify lymph node status intraoperatively, thus reducing the performance of ALND and pathologic assessment of frozen sections of SLNs6.

Less invasive techniques are needed to predict the state of SLNs before surgery. A valuable nomogram to identify patients who do not require intraoperative frozen sections to evaluate SLNs was made in the previous study7. That nomogram did not predict the involvement of each ALN but identified patients with ≥ 3 metastatic axillary lymph nodes (ALNs). Patients with a high probability of having ≥ 3 metastatic ALNs could be identified using preoperative imaging modalities, such as computed tomography (CT) and ultrasonography (US), as well as patient demographics and characteristics. 18F-fluorodeoxyglucose (FDG) positron emission tomography (PET/CT) could not be included in the previous nomogram due to the results of PET/CT having a low positive predictive value (27.2%)7.

Current clinical guidelines and several retrospective studies have indicated that bone scan or CT scan of chest-abdomen-pelvis (CTCAP) is not clinically useful in breast cancer staging. Despite guidelines and limited clinical evidence, these imaging modalities have been commonly used for staging work-up in patients with early breast cancer in some areas, including Asia8,9,10. As neoadjuvant systemic therapy has been widely applied, the use of these imaging modalities seems to continue in practice. PET/CT is also not recommended for asymptomatic early breast cancer patients. PET/CT scanning can identify regional nodal disease and distant metastases, suggesting that PET/CT can replace multiple imaging modalities for breast cancer staging, including axilla assessment, bone scan, and CTCAP11. Chest CT can evaluate limited areas for metastases, whereas PET/CT can evaluate the entire body. Moreover, PET/CT is highly specific regarding regional lymph node involvement12 and has been used more than chest CT for evaluating ALN metastases, despite lacking sensitivity13. Recently, Hyland represented that PET/CT reduced the risk of false-positive in half and reduced work-up for incidental findings, enabling early treatment initiation14.

The previous study enrolled patients who underwent surgery between 2006 and 2011, a period during which only 19.9% of these patients underwent 18F-FDG PET/CT. These provided another reason for excluding 18F-FDG PET/CT results from the previous nomogram. This study enrolled patients who had sufficient PET/CT images from a more recent period. And using this, it aimed to create a more improved nomogram that predicts the probability of ≥ 3 ALNs.

Results

Patient and tumor characteristics, as well as treatment and preoperative imaging findings, are shown in Table 1. The mean age at diagnosis was 51.4 years (range, 24–82 years). Of the 1030 patients in the training set, 89 (8.6%) showed involvement of ≥ 3 ALNs, and 295 (28.6%) underwent ALND. 18F-FDG PET/CT findings were available for 853 patients (82.8%), with 188 (22.0%) of these patients having findings suspicious of ALN metastasis.

Table 1 Patient characteristics of the training set (N = 1030).

Univariate logistic regression analysis (Table 2) showed that having ≥ 3 ALNs was significantly associated with patient age (p = 0.048); mean tumor size on US (p < 0.001) and magnetic resonance imaging (MRI) (p < 0.001); tumor stage on US (p < 0.001); axillary ultrasonographic ALN classification (p < 0.001); chest CT findings suspicious of positive ALN (p < 0.001); and PET/CT findings suspicious of positive ALN (p < 0.001).

Table 2 Univariate logistic regression analysis for factors associated with involvement of three or more axillary lymph nodes.

Multivariate logistic regression analyses (Table 3) were performed using as factors patient age, tumor size on preoperative US (cm), ultrasonographic ALN classification, and PET/CT findings suspicious of positive ALN. Chest CT and MRI were significant factors in the univariate analysis but were excluded from multivariate analysis. Tumor size on MRI could be controversial to evaluate uniformly. These analyses showed that larger tumor size (odds ratio [OR], 1.58; 95% confidence interval [CI], 1.23–2.01; p < 0.001), higher grade ultrasonographic ALN classification (OR 2.03; 95% CI 1.61–2.56; p < 0.001) and PET/CT findings suspicious of positive ALN (OR 2.64; 95% CI 1.47–4.75; p = 0.001) were significant and independent predictors of having ≥ 3 involved ALNs.

Table 3 Multivariate logistic regression analysis for factors associated with involvement of three or more axillary lymph nodes.

The results of this multivariate analysis were used to construct a nomogram predicting the probability of involvement of ≥ 3 positive ALNs. To account for the method of constructing the developed nomogram by giving points to each variable, tumor size by preoperative US, axillary US grade, and PET/CT axillary LN uptake. Then, they were summed to give the total number of points. The factors were used to assign a probability of ≥ 3 positive ALNs for each patient using the scale at the bottom of Fig. 1. The ROC curve of the nomogram was plotted, and the cut-off number of points was used in R statistics. The optimum cut-off was 120 points, which yielded an AUC of 0.856 (95% CI, 0.815–0.897) for the training set (Fig. 2). Table 4 shows the characteristics of the 781 patients in the validation set. Their mean age at diagnosis was 51.9 years, 49 (6.3%) patients had ≥ 3 ALNs, and 137 (17.5%) underwent ALND. The AUC of the validation set was 0.866 (95% CI, 0.799–0.934) (Fig. 3). The actual probability of involvement of ≥ 3 positive ALN for each patient in the validation set was plotted against the calculated predicted probability of ≥ 3 positive ALN to evaluate the accuracy of the nomogram (Fig. 4). The overall predictive accuracy of the model was within an error range of 10%.

Figure 1
figure 1

Nomogram for predicting the probability of having three or more involved axillary lymph nodes. Abbreviations: US, ultrasonography; PET/CT, Positron emission tomography/computed tomography.

Figure 2
figure 2

The performance of the nomogram in the training set was measured using the area under the receiver operating characteristic curve (AUC). AUC of the training set: 0.856 (95% confidence interval, 0.815 to 0.897).

Table 4 Patient characteristics of the validation set (N = 781).
Figure 3
figure 3

The performance of the nomogram in the validation set was measured using the area under the receiver operating characteristic curve (AUC). AUC of the validation set: 0.866 (95% confidence interval, 0.799 to 0.934).

Figure 4
figure 4

Calibration plot of the nomogram using validation cohort.

Table 5 shows the accuracy of the nomogram in predicting the involvement of ≥ 3 ALNs by the calculated specificity and negative predictive value (NPV). This analysis excluded patients with predictive factors that could not be measured, including 195 patients in the training set and 109 in the validation set. The nomogram had a specificity of 89.2% and an NPV of 94.7% using a cut-off of 120 points in the training set. This nomogram yielded false-negative results in 37 (5.2%) patients, who were predicted by the nomogram to have ≤ 2 rather than ≥ 3 LNs. In addition, the nomogram indicated that 126 (15.1%) patients in the training set required intraoperative assessment of frozen sections of SLNs. When applied to the validation set, the nomogram had a specificity of 93.0% and an NPV of 97.2%, and false-negative results were obtained in 16 (2.6%) patients, who were predicted by the nomogram to have ≤ 2 rather than ≥ 3 LNs. The nomogram indicated that 65 (9.7%) patients in the validation set required intraoperative assessment of frozen sections of SLNs.

Table 5 Two-way contingency table analysis showing predictive accuracy of the nomogram.

The nomogram was also applied to 1067 patients who met the selection criteria of the ACOSOG-Z0011 trial (Table 6). The nomogram had a specificity of 93.8%, and an NPV was 96.4% using a cut-off of 120 points. Of the 1067 patients predicted to have ≤ 2 metastatic ALNs, only 37 (3.8%) showed false-negative results. Ninety (8.4%) patients had nomogram scores above the cut-off and required intraoperative assessment of frozen sections of SLNs.

Table 6 Predictive accuracy of the nomogram in patients who meet the ACOSOG-Z0011 criteria.

Discussion

This upgraded nomogram is useful for identifying patients who do not require intraoperative analysis of sentinel lymph nodes. And it improved the predictive accuracy. In contrast to the previous nomogram based on chest CT results, the current nomogram included 18F-FDG PET/CT results. In practice, this nomogram improved the predictive accuracy, with a specificity of 93.8% and an NPV of 96.4%.

Omission of ALND in women who satisfy the eligibility of the ACOSOG-Z0011 trial is a major step in reducing the intensity of surgery and the burden of treatment. The percentage of patients undergoing an intraoperative examination of frozen sections of SLNs also has been declining for over a decade15. Intraoperative evaluation of SLN decreased from 69 (230/335) to 26% (84/323) and from 92 (54/59) to 45% (14/31), according to observational studies which investigated the changes in medical practice after publishing the ACOSOG-Z0011 trial16. The role of frozen sections remains unclear17. Frozen sections have several drawbacks, including increased operation time. Although the SLNB procedure is not time-consuming, the turnaround time for evaluating intraoperative frozen sections is considerable compared to the total operation time. Breast cancer surgery often requires analyzing many tissues with the patient remaining anesthetized until the pathologic outcome is determined. Because the benefit of avoiding reoperative ALND is only 12.4% of patients, routine frozen sections may be indicated for only a selected group of patients, such as those with larger tumors18. Additionally, the process of obtaining intraoperative frozen sections can risk the destruction of potentially diagnostic tissue. The quality of frozen tissues may not be as high as well-fixed tissue preparations, and incomplete sections may exclude the critical subcapsular sinus6. Prior freezing may compromise the quality of paraffin section histology19.

Although noninvasive methods and preoperative imaging modalities have been evaluated20,21,22, their feasibility was insufficient to replace intraoperative frozen sections in the evaluation of SLNs. We previously reported that US classification had a sensitivity of 85% and a specificity of 78% in predicting ALN metastasis with an AUC of 0.861 (95% CI, 0.796–0.926)23. A review of 21 studies evaluating preoperative US-guided needle biopsy reported that the median sensitivity was 64% and the median specificity was 82%24. In a previous study, a prospective evaluation of 512 patients who met the criteria of ACOSOG-Z0011 trial and underwent surgery at our institution from January 2012 to June 2014 found that the intraoperative frozen section for evaluation of SLNs could be omitted in 452 (88.3%) cases. The reoperation rate (final pathology ≥ 3 LNs) was 1.6% (8/512) when the score was low and intraoperative frozen sectioning was not performed7. Recently, neoadjuvant chemotherapy has been increasingly used for early breast cancer. It may be possible to consider applying neoadjuvant chemotherapy in patients who would be predicted over three positive LNs by this nomogram depending on the subtype.

This method also has several disadvantages. First, 18F-FDG PET/CT results are required to apply the nomogram. Although several benefits have been mentioned for the use of PET/CT in early-stage breast cancer, in clinical practice, many clinicians do not schedule preoperative PET/CT in patients with early breast cancer. Recently, a study has revealed that preoperative PET/CT could predict high nodal burden with high accuracy. It appears that preoperative PET/CT is useful to perform before developing a treatment plan for patients with clinical T1-2N0 invasive breast cancer25. Additionally, the NCCN guidelines in 2021 recommend that FDG PET/CT can be performed concurrently with diagnostic CT as a work-up prior to preoperative systemic therapy if clinical T2 or clinical node-positive. In these situations, it will be possible that this nomogram can be used as one of the clinical bases for using PET/CT in early-stage breast cancer. When the patients enrolled in this study were diagnosed, they had little financial burden (5% deductible) because national insurance was applied, so most of them were taken the PET/CT. Although it is not a guideline at present, it might be one of the data for replacing the multiple CT scan of chest-pelvis-abdomen with PET/CT in regions frequently used in early breast cancer. Second, the application of this nomogram requires an experienced US operator23. The US operator measures tumor size and classifies the probability of lymph node metastasis based on the thickness of the cortex and the appearance of the fatty hilum. Contrary to MRI, the US depends on the patient body habitus and the operator's experience26. The US finding for nodal status in this study was applied to grade according to the maximum thickness of the cortex and the appearance of the fatty hilum. Therefore, it depended on the US operator and required an experienced US operator23. Third, the results of MRI were not included in the nomogram. MRI features of breast cancer can help in its diagnosis, making it a frequently used imaging modality in these patients. Non-mass enhancement on MRI may increase suspicion of an invasive lesion, particularly if the enhancement is associated with a focal lesion or exhibits a segmental distribution27. Although univariate analysis showed that larger tumor size on MRI was significantly associated with the involvement of ≥ 3 ALNs, this parameter could be replaced by tumor size in US.

According to the previous and this study, when PET/CT is taken, this nomogram applies to the patients, and for patients without PET/CT, chest CT is used to calculate the previous nomogram. If further research related to this is conducted, it is expected that taking PET/CT will be more advantageous.

In conclusion, we made a nomogram based on preoperative imaging modalities that can predict the involvement of ≥ 3 ALNs in women with early-stage breast cancer. This nomogram had excellent predictive power and was clinically useful. Intraoperative frozen sections for the detection of SLNs could be omitted in a significant number of patients who met the criteria of the ACOSOG-Z0011 trial, with a low rate of reoperation. This method could evaluate the pathology of SLNs in permanent sections, providing more accurate pathologic results, simplified surgical scheduling, saving time and costs. This is useful for identifying patients who do not require intraoperative analysis of SLNs. And, it would be considered that indications for applying neoadjuvant chemotherapy can be made through further studies based on this nomogram. This nomogram can be applied with the other purpose for luminal A and postmenopausal patients. It can select those who will proceed to upfront surgery rather than neoadjuvant chemotherapy if the patient has genomic low risk with multigene assay and limited number of involved LNs is predicted by our nomogram. Moreover, this study could be regarded as one of the backgrounds that PET/CT can replace multiple imaging modalities for preoperative work-up. This processing allows omitting unnecessary ALN assessment during operation.

Methods

Patients

The Seoul National University Hospital Breast Cancer Center (SNUHBCC) database, a relatively large, prospectively maintained web-based database that includes information on all patients who have undergone surgery for breast disease at Seoul National University Hospital since 1982 was reviewed. Detailed information regarding the SNUHBCC database that was prospectively collected after obtaining institutional review board approval has been reported28.

Patients who had clinical T1-2 and N0 invasive breast cancer and underwent preoperative 18F-FDG PET/CT were included. The training set included 1030 consecutive patients who underwent surgery between 2010 and 2013, whereas the validation set included 781 patients who underwent surgery from 2014 to 2015. Patients with a history of breast cancer, palpable ALNs, or carcinoma in situ on preoperative core needle biopsy were excluded, as were patients who received neoadjuvant chemotherapy, those with tumors over 5 cm on the preoperative US, and patients with stage IV breast cancer. Because the ACOSOG-Z0011 trial set the patients who underwent breast-conserving surgery as inclusion criteria, it was necessary to analyze patients who met the criteria to predict how clinically accurate the nomogram would be. After validating the nomogram, 1067 patients who met the inclusion criteria of the ACOSOG-Z0011 trial were selected and analyzed from the training and validation set.

Preoperative imaging

All patients underwent US and 18F-FDG PET/CT for preoperative work-up of the axilla and distant organs. All images were reported by experienced radiologists who had received information that the patients were diagnosed with invasive breast cancer. ALNs were evaluated one day before surgery by axillary US examination. Lymph nodes were prospectively classified by a radiologist. The probability of lymph node metastasis on the US was classified according to the maximum thickness of the cortex and the appearance of fatty hilum, with grade 1 indicating cortical thickness of ≤ 1.5 mm; grade 2 indicating 1.5 mm < cortical thickness ≤ 2.5 mm; grade 3 indicating 2.5 mm < cortical thickness ≤ 3.5 mm; grade 4 indicating cortical thickness > 3.5 mm with an intact fatty hilum; and grade 5 indicating cortical thickness > 3.5 mm with a loss of fatty hilum. The maximum cortical thickness was measured perpendicular to the long axis of the lymph node on a cross-sectional plane23.

18F-FDG PET/CT imaging was taken using a hybrid PET/CT scanner (Biograph 40 TruePoint; Siemens Healthcare, Knoxville, TN, USA). The patients were fasted for at least 6 h prior to being administered 18F-FDG (5.18 MBq/kg of body weight, intravenously), and imaging was performed 1 h later. CT images were acquired from the skull base to the upper thigh area for the attenuation map and lesion localization (50 mA, 120kVp, 5-mm section width, 4 mm collimation). After CT scanning, PET images of the same area were acquired in three-dimensional mode at six or seven-bed positions (1 min per bed position, 21.6 cm increments). Images were reconstructed on 128 × 128 matrices using an iterative algorithm. The PET/CT images were analyzed using a dedicated workstation and analysis software (Syngo.via, Siemens Healthcare) and interpreted by institutional nuclear medicine physicians individually as a standard-of-care examination. Positivity was defined as the PET/CT ALN uptake SUV set to 1.4 or over.

Management of ALNs

SLNs were detected intraoperatively using the SLNB technique, a radioisotope, and/or a blue dye. Alternatively, Tc-99 m antimony sulfur colloid (0.4 mCi) was intradermally injected 1 to 6 h before surgery into the quadrant in which the tumor was located. Lymphoscintigraphic images were attained about 40 min after injection, and SLNs were detected during operation by a gamma probe (NEO2000; Neoprobe Co., Dublin, USA). Immediately before surgery, 1 cc aliquots of 0.8% indigo carmine dye were injected intradermally into four subareolar areas around each areola. SLNs were defined as nodes with the hottest node, and any other nodes increased radioactivity at least 10% of the hottest node by the gamma probe and/or stained with blue dye. SLNs and suspicious metastatic nodes in the surgical field were removed; most were bisected and examined by hematoxylin and eosin staining of frozen sections during the surgery. ALND was performed when malignant cells were found in one or more sentinel lymph nodes. Postoperatively, SLNs were fixed in formalin, embedded in paraffin, and sectioned at 4 µm thickness. Patients with positive SLNBs in the training and validation underwent complete level I and II ALND.

Statistical analysis

The associations of ≥ 3 involved SLNs with patient demographic characteristics, tumor characteristics on biopsy, and preoperative work-up imaging results were evaluated by Fisher’s exact test. Multivariate logistic regression analyses were performed using combinations of continuous variables (age, tumor size on US, and ultrasonographic ALN classification) and dichotomized variables (positivity of ALN on PET/CT). A nomogram was generated based on a multivariate logistic regression model that predicted the probability of involvement of ≥ 3 ALNs. A forward stepwise selection method and likelihood ratio test were used to select a subgroup of all analyzed factors.

The performances of the nomogram were assessed using receiver operating characteristic (ROC) curve analysis and calculation of the areas under the ROC curves (AUCs). Calibration of the nomogram was evaluated by plotting the observed probabilities against the predicted probabilities calculated with the nomogram. A perfectly accurate nomogram prediction model would produce a plot where the observed and predicted probabilities for given groups fall along the 45-degree line. The distance between the pairs and the 45-degree line is a measure of the absolute error of prediction of the nomogram29. All statistical analyses were performed using SPSS ver. 21.0 (SPSS Inc., Chicago, IL) and R Software ver. 3.6.3 (http://www.r-project.org/), with p < 0.05 considered statistically significant.

Ethics approval and consent to participate

This study was approved by the Institutional Review Board of the Catholic Medical Center (IRB no. OC21EISI0076) and was conducted in accordance with the tenets of the Declaration of Helsinki. The requirement for informed consent was waived.