Introduction

As a common type of malignancy (the fifth most common cancer and the third major reason of cancer-associated death worldwide1), more than 25,000 new cases of gastric cancer (GC) and 11,000 fatal cases were determined in U.S. for 20192. The onset of GC presents strong regional and gender features. Nearly 70% of patients with GC were diagnosed in developing countries, while the incidence of GC in male population is twice as much as female population3. The most prevalent type of GC is gastric adenocarcinoma4, which are further classified into different histologic subtypes according to the Lauren classification5, including diffuse (32%), intestinal (54%), and indeterminate type (15%). The initial stages of GC are usually asymptomatic and hard to be detected, so most of patients are diagnosed at advanced stages. This calls out a huge challenge which urges individualized and precise treatment for such patients.

Surgical operation brings a curative hope for the vast majority of patients and is regarded as the major foundation of holistic management of GC. Especially, the recent development of sentinel lymph node biopsy and indocyanine green fluorescence further increase the achievement ratio of stomach-sparing procedures, thus greatly improving quality of life without compromising oncological radicality6. However, while completely surgical resection can eliminate the visible lesions in the field of operation7, tumor recurrence still potentially occurs because of the residual micro-metastatic cells outside of the surgical field. Such unseen micro-metastatic cells eventually evolve into a tumor lump that can cause great suffering and burden for patients8. Thus, an effective way for the prognostic evaluation after surgery becomes urgently necessary.

Intestinal-type gastric adenocarcinoma (IGA) is the commonest subtype of GC9, which is most often diagnosed in older patients10. It’s reported that some ordinary environmental factors (including smoking, alcohol, diet, and H. pylori infection) are tightly associated with IGA11. However, the factors influencing the postoperative prognosis of IGA have not been detailly depicted yet.

Recently, the nomogram has been widely used to provide accurate evaluation for oncological disease because of its’ simple, intuitive, and practical characteristics12,13. In this study, we planned to use the Surveillance, Epidemiology, and End Results (SEER) database to develop more detailed nomogram to predict the survival of postoperative IGA patients. On the other hand, the tumour-lymph node-metastasis (TNM) stage system on basis of the anatomic extent of tumour is released by American Joint Committee on Cancer (AJCC) and have always been the benchmark in clinical practice around the world. Thus, the predication performance of our nomograms was evaluated by comparison with AJCC TNM stage system.

Methods

Patient selection

The data of IGA patients (between 2000 and 2018) was screened from SEER 18 registries database (with additional treatment fields) using SEER  Stat software (version. 8.4.0). As the Data Use Agreement to the SEER Program has been signed by us, we were allowed to access the SEER data without the need to apply for local ethical approval or declaration. The data utilized for the current study was extracted according to strict inclusion and exclusion criteria. The information of total 9459 patients were obtained following SEER variables: age at diagnosis, sex, race, marital status, surgery type, tumor grade, primary site, summary stage, AJCC TNM stage, chemotherapy, radiation therapy, the number of regional nodes examined (RNE) and regional nodes positive (RNP), tumor size and survival time. The exclusion criteria were listed below: (1) cases with non-surgical treatment or unknow, (2) unknow AJCC TNM stage at diagnosis, (3) unclear characteristic data. Figure 1 displayed the flowchart of data screen.

Figure 1
figure 1

Flow diagram of patients’ screen.

Data collection, construction and validation of the nomogram

The included IGA patients were randomly divided into training and validation cohorts at a ratio of 7:3 using completely randomized digital table. The training set was used to establish the nomogram, and then the validation set was chosen to optimize and evaluate the model parameters. In this study, we extracted 16 clinicopathological factors from the SEER database: age, sex (male and female), race (white; black; other), marital status (single; married; other), surgery type (local tumor excision; partial/subtotal/hemi-gastrectomy; near total or total gastrectomy; gastrectomy with removal of a portion of esophagus; gastrectomy with a resection in continuity with the resection of other organs; other), primary site (fundus; cardia; body; lesser; greater; gastric antrum; pylorus; other), grade (grade I; grade II; grade III; and grade IV), summary stage (localized; regional; and distant), 7th AJCC stage (I; II; III; IV), T stage (T1; T2; T3; and T4), N stage (N0; N1; N2 and N3), M stage (M0; M1), radiotherapy (yes or no), chemotherapy (yes or no), RNE and RNP, and tumor size. The follow-up data were used for overall survival (OS) and cancer-specific survival (CSS) analysis. All of thirteen prognostic factors (excluding T stage, N stage, and M stage) were included in univariate Cox regression analysis and then independent prognostic factors were obtained via multivariate Cox regression analysis based on the results of univariate Cox regression analysis (P < 0.05). Subsequently, the factors significantly associated with OS or CSS were selected to create the nomogram, while internal validation was conducted. Firstly, the performance of the nomogram was measured by calibration curves and the area under receiver operating characteristic (ROC) curve (AUC). Next, the predicative ability of the nomogram and the 7th AJCC TNM stage system was compared by C-index, net reclassification index (NRI), integrated discrimination improvement (IDI) and Decision curve analysis (DCA) curves. This study was performed under the guidance of the "TRIPOD" guideline14.

Statistical analysis

R software (version 4.2.1) was used to perform all statistical analyses. The two-sided P < 0.05 was set as the cutoff of significance.

Results

Clinical characteristics

A total of 1814 patients were included into OS analysis, which were randomly assigned training (n = 1270) and validation (n = 544) cohorts. The clinical characteristics of IGA patients were described below. In the training set, the median age at diagnosis was 72 years (range 18–98 years). There were 453 (35.7) female patients and 817 (64.3) male ones, among which white people accounted for 56.6%, while black people for 16.4% and other races for 27.0%. These patients were majorly married (59.1%), while 161 of them were single (12.7%) and 359 were of other marital status (28.3%). More than half these patients (62.6%) received near total or total gastrectomy. The primary sites were located in cardia/fundus (14.7%), gastric body (30.9%), gastric pylori (39.3%), and other part of stomach (15.1%). Moderate (47.4%) and worse (42.1%) differentiation were the commonest tumor grades, followed by well differentiation (9.5%) and undifferentiation (0.9%). The summary stage of IGA consisted of localized (40.9%), regional (46.9%) and distant cancer (12.2%). Most patients (33.3%) were clarified into stage I, 31.2% were stage III, 27.1% were stage II, and 8.4% were stage IV. Only 24.8% had the radiation record, while almost half of the patients (43.1%) had the chemotherapy record. The median RNE was 17 (range 1–76), while the median RNP was 1 (range 0–44) and median tumor size was 40 (range 1–165). In the validation set, patients displayed similar characteristics to those in the training cohort. On the other hand, there were 1513 patients in the study for CSS analysis with 1059 patients in the training set and the remaining 454 patients in the validation set. Both sets in CSS group also shared similar clinical characteristics. Table 1 summarized the clinic-pathological characteristics of patients in the OS and CSS groups.

Table 1 Patient characteristics in the study.

Nomogram construction

Two nomograms for IGA patients who receive surgery were established based on the variables screen from Cox analysis. After the multivariable Cox analysis, the outcomes in OS group revealed that the age at diagnosis, sex, race, surgery type, summary stage, grade, AJCC TNM stage, radiotherapy status, RNE and RNP can independently predict the OS of IGA patients who receive surgery (Table 2). In CSS group, the results from the multivariable Cox analysis demonstrated that the age at diagnosis, sex, race, surgery type, summary stage, grade, AJCC TNM stage, chemotherapy status, RNE and RNP are the independent risk factors of CSS for IGA patients who receive surgery (Table 3). All of these independent factors that were associated with OS and CSS were included in the prognostic nomogram created in this study (Fig. 2).

Table 2 Univariate and multivariate analysis of overall survival with IGA.
Table 3 Univariate and multivariate analysis of cancer-specific survival with IGA.
Figure 2
figure 2

Nomograms predicting 3-, 5- and 8-year OS (A) and CSS (B) of postoperative IGA patients. Note: 0, local tumor excision; 1, partial/subtotal/ hemi-gastrectomy; 2, near total or total gastrectomy; 3, gastrectomy with removal of a portion of esophagus; 4, gastrectomy with a resection in continuity with the resection of other organs; 5, other.

Validation of the nomogram for OS and CSS of IGA patients who receive surgery

Firstly, the calibration curves of these nomograms were established in OS and CSS group and the results displayed almost identical consistency of the actual likelihood with the predicted 3-, 5-, and 8-year probabilities in the training and validation set (Figs. 3, 4). Next, the results from the time-dependent AUC curves in the Cox models of OS and CSS group confirmed that AUCs were almost greater than 0.7 for the forecast of OS and CSS within eight years, suggesting the nomogram to be good discriminative ability (Figs. 5A,B and 6A,B). In addition, the AUCs of OS group in the training set, for predicting 1, 3, and 8 years were 0.788, 0.791, and 0.779, respectively (Fig. 5C). In the validation set, the AUCs at 1, 3, and 8 years were 0.787 0.813, and 0.802, respectively (Fig. 5D). Furthermore, the AUCs of the CSS group in the training cohort were 0.824 at 3 years, 0.832 at 5 years and 0.813 at 8 years, while in the validation cohort the AUCs were 0.828 at 3 years, 0.849 at five years and 0.820 at eight year (Fig. 6C,D).

Figure 3
figure 3

Calibration plots of the nomogram for 3-, 5- and 8-year OS prediction of the training cohort (AC) and internal validation cohort (DF).

Figure 4
figure 4

Calibration plots of the nomogram for 3-, 5- and 8-year CSS prediction of the training cohort (AC) and internal validation cohort (DF).

Figure 5
figure 5

Time-dependent AUC and receiver operating characteristic (ROC) curves of OS. (A,B) Time-dependent AUC of using the nomogram to OS probability within 8 years in the training cohort and validation cohort. The shading area between blue dotted curves represents 95% credible intervals. (C,D) ROC curves corresponding to 1-, 3-, and 8-year OS in the training and validation cohort, respectively.

Figure 6
figure 6

Time-dependent AUC and receiver operating characteristic (ROC) curves of CSS. (A,B) Time-dependent AUC of using the nomogram to CSS probability within 8 years in the training cohort and validation cohort. The shading area between blue dotted curves represents 95% credible intervals. (C,D) ROC curves corresponding to 1-, 3-, and 8-year CSS in the training and validation cohort, respectively.

Comparison of the values between nomograms and AJCC stage system

In order to further evaluate the clinical performance of our nomograms, their predictive capacity was directly contrast 7th AJCC TNM staging system for IGA following surgery. In OS group, the C-indexes for the nomogram in the training and validation sets (0.785 and 0.802, respectively) were larger compared to the 7th AJCC staging system (0.704 and 0.690). In CSS group, the nomogram of the training and validation cohorts (0.819 and 0.838, respectively) also had higher C-index than the 7th AJCC staging system (0.754 and 0.767). The NRI in the training set for the 3- , 5- and 8-year OS were 0.4882 (95% CI 0.3674–0.5965), 0.5262 (95% CI 0.4191–0.6501) and 0.5388 (95% CI 0.3878–0.7270), and the IDI values for the 3- , 5- and 8-year OS were 0.093 (95% CI 0.071–0.121, P < 0.001), 0.096 (95% CI 0.074–0.126, P < 0.001) and 0.130 (95% CI 0.074–0.181, P < 0.001) (Table 4). The outcomes of NRI and IDI from the contrasts between nomogram of CSS group and AJCC staging system similarly showed statistical significance (Table 5). Finally, the DCA analysis was performed to evaluate the 3-, 5-, and 8-year OS and CSS discrimination ability and the results are displayed in Figs. 7 and 8. The DCA plots showed good net benefits. All of these results were demonstrated in the validation set (Table 5), verifying the better predictive ability of our nomograms than the AJCC Stage System.

Table 4 Comparison of different models for estimating the overall survival of IGA patients.
Table 5 Comparison of different models for estimating the cancer-specific survival of IGA patients.
Figure 7
figure 7

Decision curve analysis of the nomogram in the estimation of OS of postoperative IGA patients. (AC) Training cohort. (DF) Validation cohort.

Figure 8
figure 8

Decision curve analysis of the nomogram in the estimation of CSS of postoperative IGA patients. (AC) Training cohort. (DF) Validation cohort.

Discussion

IGA is the most prevalent type of GC, which is obviously different in epidemiology, pathogenesis, prognosis, microscopic and gross appearance, and molecular characteristics from other subtypes (diffuse and intermediate type)15. For example, the incidence of diffuse-type GC was relatively higher in female and younger patients16. Importantly, a latest report indicated that for early-onset early-stage GC (diagnosed at < 50 years and limited to the mucosa or submucosa), the intestinal type showed more tight association with lymph node metastasis and worse prognosis compared to the diffuse type17. Thus, IGA is probably more noteworthy than other subtypes because of its’ higher incidence and worse prognosis. Up to now, the AJCC staging system is the widely accepted program for forecasting the prognosis of GC patients18. However, many crucial risk factors influence the OS and CSS of GC patients who received surgery as well, including age, sex, marital status, race, surgery type, primary tumor site, grade, summary stage, chemoradiotherapy and tumor size. Consequently, we constructed two nomograms to forecast the 3-, 5-, and 8-year OS and CSS of IGA patients who received surgery using the multi-center, multi-population, multi-ethnic data from the SEER database.

Our nomograms combined the AJCC staging system with basic demographics and other important oncology parameters. For all we know, these nomograms might be the first prognostic model for predicting the long-term OS and CSS (5 and 8 years) for postoperative IGA patients. In 2020, Chu et al. demonstrated that radiotherapy effectively improved the survival of patients with IGA via a SEER population‑based study19. In 2021, Tang et al. compared the difference of lymph node metastasis and prognosis between IGA and diffuse-type GC by screening SEER database as well17. Nevertheless, both studies did not try to construct a nomogram for the predication of prognosis of IGA patients. In the study, we used validation set of postoperative IGA patients from the same database to demonstrate the current nomograms. The results indicated that reliable nomograms for forecasting the 3-, 5-, and 8-year OS and CSS of postoperative IGA patients were successfully established based on good performance of nomogram validation in discriminative ability and calibration.

Several independent risk factors were incorporated into the established nomogram. The age at diagnosis is regarded as an important risk factor for prognosis of cancer patients, with survival being poorer in older patients20,21. The present study observed that the OS and CSS of postoperative IGA patients were negatively associated with age. Moreover, multivariate Cox analyses suggested that RNP and sex were statistical independent prognostic factors for the OS and CSS of postoperative IGA patients, and male patients had worse prognosis compared with female ones. According to the previous reports, race was tightly correlated with survival outcomes of GC patients. The black and white patients were indicated to have poorer prognosis than other races22. We also found that non-white or black seem to be a protective factor when compared to white or black. Currently, surgery is the only proven effective therapy for GC and intimately related to the prognosis of GC patients. This study revealed the association of the extent of surgical depletion with the prognosis of IGA patients. Furthermore, a higher AJCC stage was correlated with a worse OS and CSS, and compared with a distant summary stage, a localized summary stage was a protective factor for OS and CSS. Significantly, tumor differentiation degree was demonstrated to be associated with survival, and grade IV tumor (undifferentiated adenocarcinoma) was a risk factor for GC according to the precious results23. However, our multiple Cox analysis found that grade IV tumor (undifferentiated adenocarcinoma) was not obviously related to the OS and CSS in postoperative IGA patients. Dong and colleagues also obtained the same contradictory results24. We speculated that the small size of included patients with grade IV tumor may contribute to the conflicting phenomenon.

Other expected and noteworthy factors is radiotherapy and chemotherapy. Currently, an increasing number of studies demonstrated the role of radiotherapy in the treatment of GC patients. In the investigation of the effect of surgery plus postoperative chemoradiotherapy on the prognosis of R0 resected GC patients, Macdonald et al. found that postoperative chemoradiotherapy can prolong the median OS (from 27 to 36 months)25. Moreover, Shridhar et al. used the SEER database to investigate the effect of radiation and/or surgery on OS of patients with metastatic GC26. They demonstrated that radiation was correlated with prolonged OS in metastatic GC patients treated with surgery. Chu et al. also validated the benefits of radiation on the survival of IGA patients in a SEER population‑based study19. In the current study, we got the similar results that radiotherapy is an independent protective factor for OS of postoperative IGA patients. On the other hand, several clinical randomized controlled trials revealed the benefit of chemotherapy in advanced GC patients27,28,29. Importantly, Cheng et al. used their established GC database to evaluate the efficacy of oxaliplatin-based adjuvant chemotherapies in patients with distinct Lauren type GC after D2 gastrectomy30. Their results indicated that oxaliplatin-based adjuvant chemotherapy can obviously prolong the median disease-free survival of patients with IGA (from 18.33 months to 48.73 months). Similarly, in this predicative nomogram, we found that chemotherapy also was a statistical independent factor for CSS of postoperative IGA patients.

The included risk factors in our constructed nomograms are readily available in clinical historical records. In order to validate the accuracy of the predictive nomograms, calibration and time-dependent AUC curves were depicted. In our nomogram models, the AUC values were high (> 0.7), confirming the good discriminative ability of the models. Furthermore, we calculated and depicted the C-index, IDI, NRI, and DCA to further estimate whether the prognostic nomograms outperformed the traditional AJCC staging system. The C-index of our constructed nomograms was better than those of the AJCC staging system, demonstrating their good discrimination ability. The IDI and NRI are two more sensitive and precise indicators compared with C-index and AUC and their results reinforces the conclusion above. The IDI verified the preferably discriminative ability of the predicative models than the AJCC staging system, while the NRI suggested that the constructed model performed better than the AJCC staging system in terms of reclassifying the risk probabilities. The benefits of DCA have been reported by numerous precious studies31,32,33. In the training set and validation set in the current study, the 3-, 5-, and 8-year DCA curves showed larger net benefits than that of the traditional AJCC staging system.

Despite the nomogram performed well in predicting OS and CSS, some weaknesses of this study should be noticed. Firstly, the patient’s information collected from the SEER database, including specific radiotherapy and chemotherapy regimens, is insufficient, which probably influenced the obtained results. Secondly, the multivariate Cox regression analysis revealed grade IV tumor did not appear to be an important factor for prognosis, which obviously contradicted the clinical practice. Thirdly, other potentially important factors that could affect the survival of IGA (such as diet, smoking and alcohol consumption) were not included and thus the nomograms should be improved through further clinical trial. Fourthly, the AJCC 7th edition of the TNM stage system was used in our study for some reasons. It is widely known that the 7th edition has some shortcomings (e.g., it did not incorporate the pN3b category and its’ additional stage subgroups [stages IIB and IIIC] cannot improve predictive performance in stage-based prognosis) and thus had been replaced by 8th edition, so it's use cannot reflect real-world situations in clinic and will influence the accuracy of our model. Thus, the utilize of the 7th edition may influence the prediction results of the model. Finally, the lack of external validation by another real world weakens the reliability of the constructed models.

In conclusion, the results demonstrate that the construction of novel nomograms for forecasting OS and CSS in postoperative IGA patients is successful. The constructed nomograms not only have better predicative ability than that of the 7th AJCC staging system alone, but the indicators in the models are also routinely assessed and readily accessible in the real-world clinic. Therefore, the nomogram will assist clinicians in making personalized survival predictions and providing optimal treatment strategies for IGA patients.