Introduction

Gallbladder adenocarcinoma (GBA), the sixth most common gastrointestinal cancer, is a rare disease with poor prognosis1. GBA is characterized by asymptomatic onset with rapid disease progression and is often diagnosed at advanced stages2. At present, radical surgical excision is the recommended treatment for GBA patients without distant metastasis (M0). Therefore, determining the prognostic indicators to predict outcomes for non-metastatic GBA (NM-GBA) patients after surgery will be crucial for developing possible adjuvant therapy and follow-up strategies.

Currently, survival prediction for GBA patients is based largely on the 8th edition tumor-node-metastasis (TNM) staging guidelines set forth in the American Joint Committee on Cancer (AJCC) staging system3. However, this staging system only incorporated factors that assess the local extension of the primary tumor, lymph node involvement and distant metastasis, ignoring other important factors such as age, gender, marital status and treatment4,5,6, which reduced the accuracy of the stratification.

Nomograms, incorporating and illustrating important prognostic factors in a visible way, have become a superior methods to accurately predict outcomes for patients with various cancers7,8. Previously, researchers have attempted to construct nomograms to predict the prognosis of patients with GBA9,10. Some novel LN schemes, such as lymph node ratio (LNR) and log odds of positive lymph node (LODDS), showed more accurate predictive ability than AJCC N status11,12,13. But most of these studies included GBA patients with distant metastases (M1 status)14,15. According to the 8th edition AJCC TNM-staging system, GBA patients with distant metastasis are staged as IVB and often unable to receive radical surgery, leading to extremely poor prognosis. The inclusion of M1 patients to evaluate predictive value of LNR and LODDS would result in bias. Investigating predictive capability of LNR and LODDS in NM-GBA patients is more reasonable and applicable.

To our knowledge, no previous study has examined the different LN staging/scoring systems in NM-GBA patients. Therefore, we attempted to compare the predictive value of N status, LNR, LODDS and construct a novel nomogram based on the optimal LN system and other prognostic factor to predict the prognosis of NM-GBA patients undergoing surgical treatment.

Material and methods

Patient selection

We identified patients diagnosed with GBA from January 1, 2000, to December 31, 2018 in Surveillance, Epidemiology and End Results (SEER) database. All data analyses were performed using SEER*Stat software version 8.3.9. Inclusion criteria were as follows: the International Classification of Diseases for Oncology, Third Edition (ICD-O-3) code was 8140/3, 8141/3, 8144/3, 8210/3, 8211/3, 8260-63/3, and 8310/33; primary site label was C23.9; GBA as the only or first primary tumor was confirmed by histology diagnosis. As defined by the SEER database, M1 meant that distant metastases occurred at the time of initial diagnosis. The exclusion criteria were as follows: distant metastasis; patients did not undergo surgical resection; missing information on age, ethnicity, marital status, LN status, tumor stage, radiotherapy and chemotherapy status. Our detailed workflow was shown in Fig.Ā 1.

Figure 1
figure 1

Enrollment flow chart of eligible patients in the present study.

Cohort definition and variable screening

Based on the inclusion and exclusion criteria, 1035 patients were included in the discovery cohort. By using the "createDataPartiton" function in R (version 4.1.0), the discovery cohort was divided into training and validation cohorts with a ratio of 7:3. Training cohort was used to build models, whereas validation cohort was used to compare models. A total of 15 different clinical variables were included. Staging was reclassified using best available data according to AJCC eighth edition criteria. In this study, LNR was defined as the ratio of the number of positive lymph nodes (NPLN) to the number of resected lymph nodes (NRLN). LODDS was calculated by log[(NPLNā€‰+ā€‰0.05)/(NRLN-NPLNā€‰+ā€‰0.05)].

Two continuous variables (LNR and LODDS) were all trichotomized via the X-tile software (version 3.6.1; Yale University, New Haven, CT, USA) based on the maximal log-rank chi-square value, which represented the greatest group difference in outcome probability16. In our study, LNR was categorized into LNR1 (range, 0 to 0), LNR2 (range, 0.04 to 0.88), and LNR3 (range, 0.89 to 1.0). LODDS was classified into LODDS1 (range,ā€‰āˆ’ā€‰2.9 toā€‰āˆ’ā€‰1.61), LODDS2 (range,ā€‰āˆ’ā€‰1.36 to 0.83), and LODDS3 (range, 0.89 to 2.42). Performance of LODDS, LNR, and N status were evaluated from multiple dimensions. The likelihood-ratio (LR) test was used to assess homogeneity between groups. Akaike Information Criterion (AIC) and bayesian information criterion (BIC) were applied to test goodness of fit17. Harrell concordance index (C-index) was calculated to assess the accuracy of prediction. Univariate and multivariate Cox proportional hazards regression models were utilized to detect independent prognostic factors.

Statistical analysis

The multivariate model used a forward stepwise variable selection procedure based on AIC. Model discrimination was assessed using the time-dependent area under the ROC curves (AUC) and the C-index, and calibration was measured using calibration plots. Based on previous research, C-index and AUC value greater than 0.7 indicated that the nomogram had good predictive power. Using the AJCC TNM staging system as a reference, integrated discrimination improvement (IDI) and net reclassification improvement (NRI)18,19 were calculated to assess the improvement of clinical benefits; Decision curve analysis (DCA)20 was used to evaluate the clinical utility of the nomogram. Survival analysis was performed using Kaplanā€“Meier curves and Cox proportional hazard models to compare the risk stratification effect of the nomogram with the AJCC staging system.

The primary endpoint was overall survival (OS), defined as the time from tumor excision to death or the last follow-up. Multicollinearity was checked using the variance inflation factor (VIF). Variables with VIFā€‰>ā€‰4 were removed. Categorical variables were compared by the Chi-square test, while the Kruskalā€“Wallis test was utilized to compare continuous data across groups. All tests were two-tailed, and P valuesā€‰<ā€‰0.05 were considered statistically significant. For all statistical analyses, R (version 4.1.0) was performed.

Ethical statement

This study was conducted in accordance with the ethical guidelines mandated by the Declaration of Helsinki. With use of publicly available data, this study was considered exempt from review and thus no patient written informed consent was needed. We signed the 'Surveillance, Epidemiology, and End Results Program Data Use Agreement' for accessing the SEER database legally. The author ST Jiang has gotten access to the SEER database (accession number: 16609-Nov2020).

Results

Patient characteristics

A total of 1035 patients with NM-GBA were included and randomly assigned into a training (nā€‰=ā€‰727) and validation (nā€‰=ā€‰308) cohorts. Cumulative survival in the whole cohort was 73.5% at 1Ā year, 44.2% at 3Ā year, and 33.9% at 5Ā year. Baseline characteristics of the patients included in the training and validation sets were balanced (Table 1).

Table 1 Demographic and clinical characteristics of patients with NM-GBA.

Variables screening

The C-index, LR, AIC and BIC were used to compare the predictive abilities of N stage, LNR, and LODDS. The LODDS system showed the highest C-index and LR test as well as the lowest AIC and BIC, which indicated that the superior performance of LODDS (C-index: 0.648, LR test: 133.4, AIC: 5863.572, BIC: 5872.009) over LNR (C-index: 0.637, LR test: 128, AIC: 5868.957, BIC: 5877.395) and N status (C-index: 0.622, LR test: 104.1, AIC: 5892.882, BIC: 5901.309) in terms of predicting OS for NM-GBA. To determine the optimal model, we performed forward stepwise regression with the LODDS and other ten significant modules. Finally, age, sex, chemotherapy, stage, grade, LODDS, and size were included. The univariate Cox regression analysis revealed that 9 variables (age, sex, stage, grade, LODDS, size, race, T, and marital) were significantly associated with OS in the training cohort. In multivariate Cox regression analysis, age, sex, chemotherapy, stage, grade, LODDS, and size remained as independent prognostic factors (Table 2).

Table 2 Univariate and multivariate Cox analyses on variables for the prediction of overall survival of NM-GBA patients.

Nomogram construction and validation

The new prognostic model for NM-GBA based on these variables was constructed. A nomogram displaying the predictive variables and corresponding point scales was shown in Fig.Ā 2. The nomogram estimated the survival probability for a patient based on a total score which is calculated by the addition of zero to 100 points for every individual predictor. Most patients in the present study had total risk points ranging from 137 to 384.

Figure 2
figure 2

A constructed nomogram for prognostic prediction of a patient with NM-GBA. For category variables, their distributions were reflected by the size of the box (to view boxes of the stage, the smaller one represents stage I and the bigger one represents stage II). The density plot of total points shows the distribution. The patient, a 29-year-old man, had a tumor of 24Ā mm, G1, stage II, LODDS 1 (āˆ’2.479), and did not receive chemotherapy. To use the nomogram, the specific points (black dots) of individual patients were located on each variable axis. Red lines and dots were drawn upward to determine the points received by each variable; the sum (200) of these points was located on the Total Points axis, and a line was drawn downward to the survival axes to determine the probability of 1-year (91.3%), 3-year (74.2%) and 5-year (65.0%) overall survival. NM-GBA non-metastatic gallbladder adenocarcinoma, LODDS log odds of positive lymph nodes.

The C-index of the nomogram in the training and validation cohorts were 0.730 (0.708ā€“0.752) and 0.746 (0.715ā€“0.777), respectively. The time-dependent AUC wasā€‰>ā€‰0.7 for the prediction of OS within 1, 3, and 5Ā years in both the training and validation cohorts (training cohort: 1-year AUCā€‰=ā€‰0.802 (0.766ā€“0.838), 3-year AUCā€‰=ā€‰0.803 (0.770ā€“0.835), 5-year AUCā€‰=ā€‰0.794 (0.756ā€“0.832); validation cohort: 1-year AUCā€‰=ā€‰0.784 (0.749ā€“0.819), 3-year AUCā€‰=ā€‰0.784 (0.751ā€“0.817), 5-year AUCā€‰=ā€‰0.786 (0.751ā€“0.821)), indicating favorable discrimination of the predictive model. Calibration plots demonstrated good agreement between nomogram-predicted and observed events (Fig.Ā 3aā€“f). The high calibration and discrimination performance of the nomogram was confirmed in the validation cohort.

Figure 3
figure 3

Calibration curves and decision curve analysis of the nomogram for the survival prediction of patients with NM-GBA. (aā€“f) Calibration curves of 1-year, 3-year, and 5-year OS in the training and validation cohorts. (gā€“l) 1-year, 3-year, and 5-year survival benefit of in the training and validation cohorts. NM-GBA non-metastatic gallbladder adenocarcinoma, OS overall survival.

Clinical value of the nomogram compared with the AJCC 8th edition TNM-staging system

In order to evaluate the accuracy of change in risk classification, we calculated the NRI, IDI, C-index, and AUC of the nomogram and the AJCC 8th edition TNM-staging system. When testing the nomogram in training cohort, the NRI for the 1-, 3-, and 5-year OS were 0.648 (95% CIā€‰=ā€‰0.532ā€“0.862), 0.625 (95% CIā€‰=ā€‰0.480ā€“0.757), and 0.589 (95% CIā€‰=ā€‰0.434ā€“0.739), the IDI for the 1-, 3-, and 5-year OS were 0.073 (95% CIā€‰=ā€‰0.058ā€“0.088, Pā€‰<ā€‰0.001), 0.102 (95% CIā€‰=ā€‰0.072ā€“0.132, Pā€‰<ā€‰0.001), and 0.099 (95% CIā€‰=ā€‰0.081ā€“0.117, Pā€‰<ā€‰0.001), and the change of C-index was 0.191 (95% CIā€‰=ā€‰0.169ā€“0.213, Pā€‰<ā€‰0.001). In addition, we compared the AUC values of different prediction models. The results showed that the AUC values for the nomogram were higher than the AJCC 8th edition TNM-staging system (Table 3).

Table 3 NRI, IDI, C-index, and AUC of the nomogram and the AJCC 8th edition TNM-staging system in survival prediction for NM-GBA patients.

This results indicated that the prognostic performance of the new-built model was superior than that of the traditional AJCC TNM staging system. DCA showed the nomogram could better predict the 1-year, 3-year, and 5-year OS in NM-GBA patients. Compared to the AJCC 8th edition system, the nomogram added more net benefits to almost all threshold probabilities in the training and validation cohorts (Fig.Ā 3gā€“l).

Clinical risk stratification of NM-GBA patients based on nomogram score

We finally stratified the risk of patients in the training and validation cohorts based on the total score calculated by the nomogram. Patients can be divided into four groups: nomo 1 (total scoreā€‰<ā€‰223), nomo 2 (223ā€‰ā‰¤ā€‰total pointsā€‰<ā€‰265), nomo 3 (265ā€‰ā‰¤ā€‰total pointsā€‰<ā€‰296), and nomo 4 (total pointsā€‰ā‰„ā€‰296). As shown in Fig.Ā 4, the differences between the Kaplanā€“Meier survival curves of all four groups of patients in Nomo I-IV were statistically significant (Pā€‰<ā€‰0.05). However, the differences between Stage II and I were not statistically significant. In addition, after risk stratification according to the AJCC stage, there was a significant intersection of Kaplanā€“Meier survival curves for patients in different stages. The Kaplanā€“Meier survival curves showed significant discrimination among the four risk groups of the Nomo staging system. However, the AJCC 8th edition staging system performed limited ability to identify patients at high and low risk in training and validation cohorts (Fig.Ā 4).

Figure 4
figure 4

Kaplanā€“Meier overall survival curves of patients with NM-GBA at different stages or with different risks stratified by the nomogram. (a) NM-GBA patients in the training cohort at different stages were classified according to the AJCC 8th edition TNM-staging system. (b) NM-GBA patients in the training cohort at different risks were stratified according to the nomogram. (c) NM-GBA patients in the validation cohort at different stages were classified according to the AJCC 8th edition TNM-staging system. (d) NM-GBA patients in the validation cohort at different stages were stratified according to the nomogram. NM-GBA non-metastatic gallbladder adenocarcinoma; AJCC American Joint Committee on Cancer, TNM tumor, node, metastasis.

Discussion

This study found that LODDS was the best independent indicator to describe the relationship between LN metastasis status and prognosis in patients with NM-GBA who underwent surgical treatment. The nomogram constructed using LODDS and six other variables (grade, sex, chemotherapy, size, age, and stage) could accurately predict the prognosis at 1-, 3-, and 5-years in patients with NM-GBA. Its prediction accuracy was higher than that of AJCC 8th edition TNM-staging system.

Currently, adjuvant therapy after surgery for patients with GBC is still controversial. Three expert groups had provided consensus guidelines for the adjuvant treatment of GBC after resection. The American Society of Clinical Oncology (ASCO) clinical practice guideline recommended that all patients with GBC who undergo radical resection should receive six months of adjuvant capecitabine21. However, the National Comprehensive Cancer Network (NCCN) guidelines recommended options including observation, fluoropyrimidine-based radiotherapy, fluoropyrimidine-based chemotherapy, or gemcitabine-based chemotherapy for patients with stage T1b or greater, negative margins, and negative lymph nodes. Observation only was recommended for patients with post-excisional (incidentally discovered) stage T1a lesions with negative margins22. For patients with positive margins or positive lymph nodes, options included fluoropyrimidine-based chemotherapy, gemcitabine-based chemotherapy, fluoropyrimidine-based radiotherapy, or combination therapy22. 2016 European Society of Medical Oncology (ESMO) consensus guidelines recommended that adjuvant therapy be considered for resected GBC only after a riskā€“benefit assessment23. The nomogram we developed provided an accurate and reliable risk stratification method for clinical practice. For low-risk patients, intensive observation might be a reasonable strategy, which might help patients avoid harm caused by excessive medical treatment. For high-risk patients, such as Nomo III or Nomo IV, aggressive postoperative adjuvant chemotherapy or radio-chemotherapy might be necessary. Previous studies have shown that it might improve the prognosis of patients. Therefore, we believe that risk stratification based on the nomogram could assist clinicians to identify patients for more active follow-up and adjuvant postoperative treatment.

Since a report in 2015 by Amini N et al., a total of 7 articles have analyzed the prognostic value of LODDS in gallbladder cancer (GBC). Amini N et al. firstly analyzed 1124 patients with GBA in the SEER database from 2004 to 2010. They found that LODDS performed better than AJCC N status and LNR when the NRLNā€‰ā‰„ā€‰4. In a subsequent study, they confirmed the predictive advantage of LODDS in a multicenter cohort of 214 patients with GBA24. In addition, Lee W et al.'s analysis of 398 patients with GBC showed that LODDS was the most appropriate indicator when NRLNā€‰ā‰„ā€‰625. Chen C et al. reached a different conclusion by analyzing a multicenter cohort of 226 patients with GBC. They used three different modeling methods to demonstrate that NPLN was the best LN system among various descriptions of LN status. However, they did not propose a new model based on LODDS to predict the prognosis of GBC patients. A total of three previous studies have constructed a prediction model including LODDS to predict the OS of patients with GBC. In their study, the C-index in the training set was 0.735ā€“0.752, and the C-index in validation set was 0.719ā€“0.75214,15,26. Table 4 summarized the major studies that developed LODDS-based predictive nomograms in GBC management.

Table 4 Comparison of major studies that developed predictive nomograms included LODDS in GBC.

Although all of the above nomograms perfomed better predictive ability than AJCC 8th edition TNM-staging system, they included patients with distant metastases (M1) in the study cohort. The study by Xiao et al. aimed to assess the prognostic value of different lymph node staging systems and to develop a prognostic prediction model after curative surgery26. However, in their study, patients with stage M1 accounted for 13.5%. In fact, M1 status represented that patients showed distant metastasis at the time of evaluation. These patients almostly lost the opportunity to undergo radical surgery and the extent of LN dissection varies widely. Therefore, it is more reasonable to avoid the inclusion of these patients when evaluating the predictive power of the LN staging system27. Instead, the present study only included patients with NM-GBA to more accurately assess the predictive power of LODDS. In addition, we also noticed that in previous studies, researchers did not pay attention to the collinearity of variables, which would cause the bias of the model. Therefore, in this study, we not only included as many variables as possible but also used the stepwise regression method to screen independent predictors, and calculated the collinearity of variables, thus ensuring the nomogram more stable.

A total of seven variables (age, sex, chemotherapy, stage, grade, LODDS, and size) were incorporated to construct the nomogram. In accordance with previous studies, advanced age, male gender, high tumor stage, high tumor grade, and large tumor size were independent prognostic factors associated with OS of GBC patients28,29,30. By analyzing 1137 patients with GBC in the SEER-Medicare database, Samuel J. Wang et al. found that some GBC patients could benefit from adjuvant chemotherapy31. Interestingly, in this study, chemotherapy was not significantly associated with prognosis in univariate Cox regression analysis, but it showed as independent prognostic factor in multivariate Cox regression analysis and stepwise regression analysis. The possible reason was that there might be a certain correlation between chemotherapy and other confounding factors. In univariate Cox regression analysis, the true effect of this factor was covered by the effect of other confounding factors. Only when the effect of other factors were eliminated through multivariate Cox regression analysis can the prognostic effect of chemotherapy be revealed32.

We divided patients into nomo1-4 groups according to their nomogram total points. The Kaplanā€“Meier method and Cox hazard ratio model demonstrated significant differences in OS among the four risk groups with better discrimination than the conventional staging system. In particular, the nomogram had greater ability to recognize the high-risk population than the conventional staging system. Due to the poor outcomes, particular attention should be paid to patients with total pointsā€‰ā‰„ā€‰296.

There were limitations to our study. Firstly, this study was a retrospective study based on the SEER database. Although we performed internal validation to confirm the nomogram's excellent predictive power, external validation based on multicenter cohorts would be needed in future studies. Secondly, the cut-off values of variables included in this study were obtained by X-tile software, which might not be applicable to other research. The appropriate cut-off values need to be explored by larger samples. In addition, because the SEER database only recorded cursory information on the surgical approach and chemotherapy, it was not easy to define the two rigorously in our manuscript, which might reduce the reproducibility of this study. Finally, The SEER database only included population from each state in the United States, so whether the prediction model established thus could be directly applied to studies on different population backgrounds needed further verification.

Conclusion

In conclusion, the LODDS-based nomogram has a significantly higher accuracy than the AJCC 8th edition TNM-staging to predict OS for NM-GBA patients. The novel staging system based on the nomogram score can intuitively and quantitively stratify NM-GBA patients after surgical treatment and further assist to design postoperative adjuvant chemotherapy.