Introduction

Primary liver cancer, the fourth leading cause of cancer-related mortality and the sixth most common cancer, is a global health issue1,2. Histologically, primary liver cancer includes three main subtypes: hepatocellular carcinoma (HCC), intrahepatic cholangiocarcinoma (ICC), and mixed hepatocellular cholangiocarcinoma3. HCC is the most frequent subtype of liver cancer, which accounts for more than 80% of all primary liver cancers4,5, in frequency only to HCC, ICC arises from the epithelial layer of the second-degree biliary tract and is highly malignant6. The prognosis of patients with HCC and ICC is a research hotspot because new and more effective treatment strategies need to be based on information regarding prognostic risks. Prognostic factors such as patient age, tumor grade, and the American Joint Committee on Cancer (AJCC) stage have been used to predict patients’ survival time and response to treatment7,8. However, oncologists face challenges with the use of these unconsolidated factors; therefore, it is necessary to integrate multiple prognostic factors into an easy-to-use predictive system to better inform oncologists and more accurately stratify patients.

A nomogram is a predictive tool that creates a simple graph based on a predictive statistical model9. It can be used to calculate the probability of a clinical event by considering the prognostic weight of each factor. Nomograms have been widely used to assist clinical decision- making10,11,12. This study aimed to incorporate important factors obtained from the Surveillance, Epidemiology, and End Results (SEER) database to construct and validate a nomogram for predicting the cancer-specific survival (CSS) of patients with HCC and ICC.

Methods

Ethics statement

Informed patient consent was not required for data obtained from SEER, as cancer is a publicly reportable disease in every state in the USA.

Study population

We obtained patient data from the SEER 18 Research Data + Hurricane Katrina Impacted Louisiana Cases, Nov 2018 Sub (2010–2015 varying) incidence database, using SEER*Stat version 8.3.6. Patients diagnosed with HCC or ICC (primary site code 220 or 221) were included. Cases that were not confirmed by microscopy or only by autopsy were excluded, as were those with missing or incomplete additional data. The following additional data (variables) were used in the analysis: patient age, race, and sex; AJCC staging for the extent of tumor (T), extent of spread to lymph nodes (N), and presence of metastasis (M); and tumor grade, surgery (Y/N), alpha fetoprotein (AFP) level, fibrosis score, months of survival, and cause of death. We applied the seventh edition of the AJCC staging system, which was available for patients between 2010 and 2015. Ultimately, we identified 3227 eligible patients for inclusion in our study.

Statistical analysis

For nomogram construction and validation, we randomly divided all the HCC patients into training (n = 2123) and validation (n = 915) cohorts, in a ratio of 7:313,14. Multivariate Cox proportional hazards regression analysis was performed to identify variables (P < 0.05) that significantly affected CSS in the training group. Using these identified prognostic factors, we constructed a nomogram for predicting 3- and 5-year CSS rates in HCC and ICC patients15.

The nomogram was validated internally in the training cohort and externally in the validation cohort. To evaluate the discriminative ability of the nomogram, we used the concordance index (C-index) and the receiver operating characteristic (ROC) curve and assessed the area under the curve (AUC)16,17. A C-index or AUC of 0.5 indicates a discrimination ability that is no better than chance, whereas that of 1.0 indicates a perfect discrimination ability18. Calibration curves were constructed using a bootstrap approach, with 500 resamples, to compare the predicted CSS with the CSS observed in the study.

All statistical analyses were conducted using SPSS (version 24.0; SPSS, Chicago, IL, USA) and R software (version 3.6.1; http://www.r-project.org/). A P value of less than 0.05 was considered to indicate statistical significance.

Results

Patient characteristics

A total of 3227 patients diagnosed with HCC (3038) and ICC (189) between 2010 and 2015 were included in this study. The training and validation cohorts of HCC patients consisted of 2123 and 915 cases, respectively, selected by the random split-sample method (split ratio: 7:3). In the total cohort of HCC patients, the majority of patients were under 65 years old (59.4%), white (66.5%), and male (76.0%). Furthermore, most of the patients had T1 (47.6%), N0 (95.7%), and M0 (93.5%); patients with a grade II tumor differentiation degree accounted for 50.5% of all cases. A large proportion of the patients had elevated AFP levels (64.9%) and cirrhosis (69.5%), and 46.6% of patients underwent tumor resection surgery. The characteristics of HCC patients in the training and validation cohorts were similar to those in the total cohort (Table 1).

Table 1 HCC patient characteristics in the study.

For ICC patients, the training and validation cohorts consisted of 132 and 57 cases respectively. In the total cohort of ICC patients, most of characteristics (Age, Race, Sex, AJCC stage, Surgery and Grade) were similar to HCC patients. On the contrary, the majority of ICC patients had normal AFP levels (73.5%) and fibrosis level (66.7%). The characteristics of ICC patients in the training and validation cohorts were similar to those in the total cohort (Supplementary Table S1).

Screening for prognostic factors of CSS

For HCC patients, based on univariate and multivariate Cox proportional hazards regression analyses, we identified six independent prognostic factors in the training cohort. AJCC T2/T3/T4 (hazard ratio [HR] = 1.280/2.221/3.352, P < 0.01), AJCC M1 (HR = 2.540, P < 0.001), tumor resection (HR = 0.363, P < 0.001), liver transplantation (HR = 0.096, P < 0.001), grade II/III/IV (HR = 1.234/1.998/2.214, P < 0.05), elevated AFP level (HR = 1.336, P < 0.001) and cirrhosis (HR = 1.356, P < 0.001) were all significantly associated with CSS in HCC patients (Table 2).

Table 2 Univariate and multivariate Cox regression analysis based on all variables for HCC patient cancer-specific survival (training cohort).

For ICC patients, only three independent prognostic factors in the training cohort were identified. AJCC T3 + T4 (HR = 1.913, P = 0.015), AJCC M1 (HR = 4.036, P < 0.001), and surgery (HR = 0.328, P < 0.001) were all significantly associated with CSS in ICC patients (Supplementary Table S2).

Nomogram construction

Due to the small sample size (189) and few independent prognostic factors of ICC patients, the nomogram was only constructed for HCC patients. A nomogram based on the selected prognostic factors from the training cohort was developed for the prediction of HCC patient CSS at 3 and 5 years (Fig. 1). The nomogram demonstrated that surgery contributed the most to prognosis, followed by AJCC T, AJCC M, grade, fibrosis and AFP level. Each level of every variable was assigned a score on the points scale. The total score was obtained by adding the scores of each of the selected variables. The prediction corresponding to this total score then helped in estimating the 3- and 5-year CSS for each HCC patient.

Figure 1
figure 1

The nomogram predicting CSS in patients with HCC. Each factor was given a point on the basis of the nomograms. The total points were obtained by adding the given points of all factors. The estimated 3- and 5-year probabilities of CSS of the individual patient can be easily obtained from the nomogram based on the total points.

Nomogram validation

The C-index of the nomogram (training cohort = 0.790, validation cohort = 0.806) was higher than which based on the AJCC stage (training cohort = 0.691, validation cohort = 0.675). Furthermore, the AUCs of the nomogram were higher than AJCC stage in both training (3-year AUC: 0.811 vs. 0.715, 5-year AUC: 0.793 vs.0.703, Fig. 2A,B) and validation (3-year AUC: 0.834 vs. 0.679, 5-year AUC: 0.808 vs. 0.667, Fig. 2C,D) cohorts for 3- and 5-year. These results indicate that the discrimination performance of the model was better than traditional AJCC stage in both the training and validation cohorts. The calibration plots for the 3- and 5-year CSS indicated that there was good agreement between the actual observations and predictions made using the nomogram in both the training cohort (Fig. 3) and the validation cohort (Fig. 4).

Figure 2
figure 2

ROC curves of the Nomogram and AJCC stage in prediction of prognosis at 3- (A) and 5-year (B) point in the training cohort. ROC curves of the Nomogram and AJCC stage in prediction of prognosis at 3- (C) and 5-year (D) point in the validation cohort.

Figure 3
figure 3

3- (A) and 5-years (B) calibration curves for probability of HCC patient CSS nomogram construction in training cohort (bootstrap = 500 repetitions).

Figure 4
figure 4

3- (A) and 5-years (B) calibration curves for probability of HCC patient CSS nomogram construction in validation cohort (bootstrap = 500 repetitions).

Discussion

Determining accurate tumor prognosis after definitive treatment is important19. Although the AJCC staging system is widely used for predicting prognosis in primary liver cancer patients, it has inherent defects because it neglects many significant risk factors such as race, age, and grade. Nomograms have been shown to be more accurate and user-friendly than the conventional staging system in many cancers20,21. In this study, we constructed a more comprehensive model based on a combination of various risk factors to better predict prognosis in HCC patients.

To identify independent prognostic factors, we performed univariate and multivariate Cox proportional hazards regression analysis. Univariate analysis indicated that race, sex, AJCC T, AJCC N, AJCC M, surgery, grade, AFP level, and fibrosis were potential prognostic factors for HCC patients. After multivariate analysis, only AJCC T, AJCC M, surgery, grade, AFP and fibrosis were independent prognostic factors for HCC patients and the nomogram was constructed based on those six factors. Our results showed that HCC patients who underwent liver transplantation had a better prognosis than those who underwent tumor resection. This is a useful conclusion for both doctors and patients.

For ICC patients, only AJCC T, AJCC M, and surgery were potential prognostic factors after univariate analysis and multivariate analysis. These factors were quite similar to traditional AJCC staging system and the sample size (189) of ICC patients was relatively small. Therefore it was not meaningful to construct a nomogram based on these three factors for ICC patients. A large sample size study including more risk factors in the future would be needed.

It is widely known that a model has relatively good discrimination if its C-index and AUC exceed 0.722; therefore, our model has a good discrimination ability. Furthermore, the calibration plot indicated that the CSS probabilities predicted by our nomogram were identical to the actual ones. The validation results indicate that our nomogram could be effective for application in the clinical setting.

Our study has some limitations. First, this large-sample retrospective study was based on the SEER database, which may have some inherent biases. Second, data regarding several potential important prognosis-related factors such as HBsAg, AST, CEA, and vascular invasion were not available in the SEER database. Third, due to the small sample size (189) and few independent prognostic factors of ICC patients, the nomogram was only constructed for HCC patients. Finally, our nomogram was internally validated, and it needs to be validated externally using other populations.

Conclusion

We constructed and validated a nomogram for predicting the 3- and 5-year CSS in HCC patients. The proposed nomogram considered six independent risk factors: AJCC T, AJCC M, surgery, tumor grade, AFP level and fibrosis. We have confirmed the precise calibration and excellent discrimination power of our nomogram. The predictive power of this nomogram may be improved by considering other potential important factors that we could not be obtained from the SEER database, and also by external validation.