Introduction

The incidence of gastroesophageal junction adenocarcinoma (GEJA) has increased markedly in Western countries over the past few decades1,2. Gastroesophageal junction cancer is a cancer in which the center of the tumor is situated in the gastroesophageal junction region. The widely used categorization of gastroesophageal junction cancer in Western countries is the Siewert classification3. The Siewert classification applies only to adenocarcinomas situated within 5 cm above or below the gastroesophageal junction. The Siewert classification divided GEJA into three types: type I is located about 1–5 cm above the esophagogastric junction; type II is located between 1 cm above and 2 cm below the esophagogastric junction; type III is located about 2–5 cm below the esophagogastric junction4,5. Cells at the gastroesophageal junction have histological features of both esophageal and gastric cells6. Therefore, its histological origin and appropriate treatment remain controversial7,8. In clinical practice, type I and type III GEJA are often treated and staged with reference to esophageal and gastric cancers9. Siewert type II GEJA is located along the borderline between the mediastinum and abdomen, they can metastasize to both thoracic and abdominal cavities. Therefore, the prognosis or metastasis of Siewert type II GEJA could be significantly different from other types of GEJA10,11,12. A previous study found that the poor prognosis of GEJA was largely attributable to early and frequent metastases13. Gastroesophageal junction carcinoma patients have a poor prognosis after metastases, with a 5-year survival rate of about 11%14. A population-based study showed that the liver is the most common site of metastasis for Siewert type II GEJA15. Consequently, it has important clinical value to construct a predictive model that can be applied to predict the risk of liver metastases from GEJA. This study tried to construct and validate a nomogram based on the SEER database for predicting the likelihood of liver metastases from Siewert type II GEJA.

Methods

Patients

We screened 3001 GEJA patients newly diagnosed between 2010 and 2015 from the SEER database who met our inclusion criteria, of which 281 developed liver metastases. The exclusion process is shown in Fig. 1. Patients included in this study must meet the following criteria: (1) Tumor size does not exceed 100 mm (2) First malignant primary indicator (3) Age between 19 and 84 (4) The pathological type is adenocarcinoma. The exclusion criteria for GEJA patients were as follows: (1) Incomplete clinical and pathological features. (2) Patients who were identified by autopsy. We extracted race, gender, year of diagnosis, T stage, N stage, tumor size, age, and bone/brain/liver/lung metastases, as well as other follow-up data, from the SEER database. This study adopted the 7th edition of the American Joint Committee on Cancer (AJCC) TNM staging. All data used in this study were anonymized and de-identified from the SEER database. Therefore, approval by an institutional review board is not required, nor is informed consent of all subjects and/or their legal guardian(s). All methods of this study were performed in accordance with the relevant regulations and guidelines.

Figure 1
figure 1

Patient screening flowchart. This figure contains how we screened 3001 Siewert type II gastroesophageal junction adenocarcinoma patients from the SEER database.

Statistical analysis and optimal cutoffs

We used x-tile v3.6.1 (Yale University) software to determine optimal cutoff values for tumor size and age16. By using X-tile, we transform all continuous variables into categorical variables. We used Fisher's exact test or chi-square to compare differences in categorical variables. To balance differences in other factors between GEJA patients with liver metastases and those without liver metastases, we performed a 1:3 PSM in R software v4.3.0. The Kaplan–Meier survival curves were applied to evaluate the difference in survival time between patients with liver metastases and those without liver metastases before and after PSM. We randomly divided the 3001 GEJA patients into a training group (n = 2101) and an internal validation group (n = 900) in a 7:3 ratio by R software. We performed univariate and multivariate logistic regression, from which we created a nomogram for predicting the possibility of liver metastases. We use ROC curve, C-index, calibration curve, and decision curve analysis to verify its validity. All statistical analyses were carried out using GraphPad Prism v8.0.2 (GraphPad Software, Inc.), SPSS v26.0 (SPSS Inc.), and R software v4.1.3 (https://www.r-project.org/). The difference is statistically significant at P < 0.05.

Results

Characteristics of GEJA patients

We included 3001 patients with GEJA diagnosed between 2010 and 2015 in this retrospective study, with 9.3% (n = 281) had liver metastases, 3.5% (n = 106) had lung metastases, 2.5% (n = 77) had bone metastases, 0.4% (n = 14) had brain metastases. Table 1 summarizes patient characteristics before and after PSM. As shown in Table 1, most of the variables have been balanced after PSM.

Table 1 Characteristics of all 3001 patients before and after PSM.

Survival analysis of liver metastases from gastroesophageal junction adenocarcinoma

Using R software v4.3.0, we performed a 1:3 propensity score matching of patients with gastroesophageal junction adenocarcinoma based on the presence or absence of liver metastases, and finally, 221 patients who had liver metastases were matched with 471 patients without liver metastases. The median follow-up for the pre-and post-PSM cohorts was 22 months (interquartile range: 10–44 months) and 11 months (interquartile range: 5–22 months), respectively. As shown in Table 1, other variables had been largely balanced. 1952 (65.0%) and 603 (87.1%) patients died during follow-up in the pre-and post-PSM cohorts, respectively. GEJA patients with liver metastases and those without liver metastases had median OS of 8.0 (95% CI 6.7–9.3) months and 30.0 (95% CI 27.7–32.3) months, respectively, in the pre-PSM cohort. (Fig. 2a) In the post-PSM cohort, they were 9.0 months (95% CI 6.5–9.5) and 14.0 months (95% CI 10.3–15.1), respectively. (Fig. 2b).

Figure 2
figure 2

Kaplan–Meier curves of the pre- and post-PSM cohort. Kalpan–Meier curves of (a) the pre-PSM cohort (b) the post-PSM cohort.

The diagnostic likelihood of liver metastases in GEJA patients

We randomly divided the patients into a training cohort and an internal validation cohort with an allocation of 7:3 ratio by R software v4.3.0. More information about the training and validation cohorts is shown in Table 2. As shown in Table 3, we performed univariate and multivariate logistic regression in the training cohort using SPSS v26.0 and finally identified age, T stage, bone metastasis, lung metastasis, and tumor size as risk factors for liver metastasis in GEJA patients.

Table 2 Patient characteristics of the training and validation cohorts.
Table 3 Univariate and multivariate logistic regression for analyzing associated factors for developing liver metastases.

Construction and validation of a predicted nomogram

Based on the risk factors for liver metastases identified by multivariate logistic regression, we created a nomogram to predict the risk of liver metastases in GEJA patients. (Fig. 3). The C-index in the training cohort is 0.816 and the C-index in the validation cohort is 0.771, which indicates that the prediction model has good discriminative ability as well as accuracy. As shown in Fig. 4, the AUC values of the nomogram in the training and validation cohorts are 0.816 and 0.771, respectively, which reflect the good predictive ability of our constructed prediction model. The calibration curves for both the training cohort (Fig. 5a) and the validation cohort (Fig. 5b) showed a good correlation between the predicted possibility of liver metastases and actually diagnosed liver metastases. A DCA was performed to determine the clinical utility of the nomogram. Figure 6 showed good positive net benefit in both the training cohort (Fig. 6a) and the internal validation cohort (Fig. 6b), indicating the good clinical applicability of the nomogram in predicting the presence of liver metastases in patients with adenocarcinoma of the gastroesophageal junction.

Figure 3
figure 3

Nomogram to predict the risk of liver metastases in gastroesophageal junction adenocarcinoma patients. From this nomogram, we can determine the risk of liver metastasis of gastroesophageal junction adenocarcinoma.

Figure 4
figure 4

ROC curves of the nomogram. ROC curves of the nomogram in the training cohort (a) and the internal validation cohort (b).

Figure 5
figure 5

calibration curves of the nomogram. The calibration curves of the nomogram for gastroesophageal junction adenocarcinoma patients in the training cohort (a), and the internal validation cohort (b).

Figure 6
figure 6

Evaluation of decision analysis (DCA) curves for nomination charts decision analysis curves (DCA) for patients with adenocarcinoma of the gastroesophageal junction in the training cohort (a) and the internal validation cohort (b).

Discussion

The survival of patients with metastatic Siewert type II GEJA is influenced by multiple factors, such as pathological type, age, metastatic pattern, degree of differentiation, and treatment17,18. The liver is the most ordinary site of distant metastases in patients with Siewert type II GEJA, and it is also the most prognostic factor in all distant metastases19. The liver is also the most ordinary metastatic site of esophageal and gastric cancers20,21. Once metastasis occurs, the prognosis of GEJA patients will be poor. Surgery is the most common method of treatment for GEJA. However, once liver metastasis occurs, the comprehensive benefit of surgery for patients will be very low, and radiotherapy and chemotherapy will become the first treatment for patients. We found that 9.3% (n = 281) of GEJA patients included in this study developed liver metastases, and 18.3% (n = 551) of GEJA patients developed distant metastases. The survival rate of GEJA patients with liver metastases was obviously lower than those without liver metastases. For patients with liver metastases, early detection and early treatment can greatly improve their survival time and quality of life. Ultrasound, CT, and PET-CT are commonly used methods to detect distant metastases in GEJA, and PET-CT can accurately exclude distant metastases22,23. However, it has been shown that 15–20% of new esophageal cancers have distant metastases that are not identified by CT, the sensitivity and specificity of detection of distant metastasis by CT are 52% and 91%, respectively24,25. In addition, these imaging tests have radiation hazards and high prices, and not every patient can afford to undergo long-term or frequent imaging tests. Therefore, constructing a nomogram to predict the risk of liver metastases in GEJA patients can better guide clinical practice. Nomograms have long been widely used in oncology because they can provide visual predictions of patient outcomes based on relevant clinical variables26. The results obtained through data analysis can demonstrate the high predictive performance as well as clinical utility of nomograms, which can reduce examination costs on the one hand and avoid radiation hazards on the other. Besides, nomograms can be used as an early and low-cost screening tool for tumor metastasis, which can be a very meaningful guide for tumor diagnosis, treatment, and prognosis. Several nomograms have previously been used to predict prognosis in patients with gastroesophageal junction cancer. However, a nomogram to predict the likelihood of liver metastases from gastroesophageal junction adenocarcinoma has not been constructed6,27,28. Therefore, based on the SEER database, we constructed a nomogram that can be used to predict the likelihood of liver metastases in GEJA patients.

In this retrospective study, a nomogram that can predict the risk of liver metastases resulting from GEJA was constructed. And its accuracy was verified by the ROC curve, C index, and calibration curve. Through univariate and multivariate logistic regression, we finally identified age, tumor size, N stage, T stage, bone metastases, and lung metastases as factors affecting liver metastases in gastroesophageal junction adenocarcinoma.

We found that the likelihood of liver metastases in GEJA patients decreased with increasing age. A previous study found that colorectal cancer incidence increases with age, but metastatic spread decreases with age29,30. And they further analyzed the possible mechanism and believed that this conclusion is the result of the interaction between the tumor microenvironment, tumor biology, the immune system, and the genome30. This is consistent with our conclusion that age is an important factor affecting cancer spread, and the risk of cancer metastasis decreases with age.

We found that tumor size was also an important factor affecting the happening of liver metastases in GEJA patients, and the larger the tumor size, the higher the risk of liver metastases. It has been previously reported that the risk of lymph node metastasis increases with increasing tumor size in patients with Siewert type II T1-T3 GEJA31. It can be seen that tumor size is related to tumor invasion, which is further demonstrated by our findings.

As shown in Fig. 3, N staging is the least influential among the many factors that affect the occurrence of liver metastases in GEJA. Previous studies have found that in non-small cell lung cancer, the rate of multiorgan metastases increases with increasing N stage32. Therefore, for GEJA patients, a higher N stage is linked with a higher risk of liver metastases. The mechanism behind it remains to be revealed in future studies.

T staging of gastroesophageal junction malignancies is based on the degree of invasion. However, to our surprise, we found that the higher the T stage, the lower the risk of liver metastases in GEJA patients. We also found that the risk of liver metastases in gastric cancer decreased with increasing T stage. Nevertheless, more evidence is needed to confirm the relationship between the T stage and liver metastases in GEJA patients.

Malignant tumors can metastasize to other organs in the body through blood spread. Therefore, it is not surprising that there is a correlation between bone metastases and lung metastases, and liver metastases. However, we found no correlation between brain and liver metastases in our study (P = 0.108). We believe this is due to the low number of patients with brain metastases in the patients included in this study (n = 14, 0.4%).

Therefore, age, tumor size, T stage, N stage, lung metastasis, and brain metastasis all affect the occurrence of liver metastases in GEJA patients. The nomogram we constructed can accurately predict the likelihood of liver metastases in GEJA patients and better guide clinical practice.

Conclusion

The survival time of GEJA patients with liver metastases was obviously lower than that of GEJA patients without liver metastases. The nomogram model developed in our study can precisely predict the possibility of GEJA patients with liver metastases.