Integrative proteomic characterization of adenocarcinoma of esophagogastric junction

Li, Shengli; Yuan, Li; Xu, Zhi-Yuan; Xu, Jing-Li; Chen, Gui-Ping; Guan, Xiaoqing; Pan, Guang-Zhao; Hu, Can; Dong, Jinyun; Du, Yi-An; Yang, Li-Tao; Ni, Mao-Wei; Jiang, Rui-Bin; Zhu, Xiu; Lv, Hang; Xu, Han-Dong; Zhang, Sheng-Jie; Qin, Jiang-Jiang; Cheng, Xiang-Dong

doi:10.1038/s41467-023-36462-8

Download PDF

Article
Open access
Published: 11 February 2023

Integrative proteomic characterization of adenocarcinoma of esophagogastric junction

Shengli Li ORCID: orcid.org/0000-0001-5430-303X^1,2^na1,
Li Yuan ORCID: orcid.org/0000-0002-6245-9437^1,3,4^na1,
Zhi-Yuan Xu^1,3,4,
Jing-Li Xu⁵,
Gui-Ping Chen⁶,
Xiaoqing Guan¹,
Guang-Zhao Pan¹,
Can Hu⁵,
Jinyun Dong¹,
Yi-An Du^1,3,4,
Li-Tao Yang^1,3,4,
Mao-Wei Ni¹,
Rui-Bin Jiang¹,
Xiu Zhu¹,
Hang Lv ORCID: orcid.org/0000-0002-9655-1573⁷,
Han-Dong Xu⁵,
Sheng-Jie Zhang¹,
Jiang-Jiang Qin ORCID: orcid.org/0000-0002-8559-616X^1,3,4 &
…
Xiang-Dong Cheng ORCID: orcid.org/0000-0003-1470-2831^1,3,4

Nature Communications volume 14, Article number: 778 (2023) Cite this article

6535 Accesses
11 Citations
Metrics details

Subjects

Abstract

The incidence of adenocarcinoma of the esophagogastric junction (AEG) has been rapidly increasing in recent decades, but its molecular alterations and subtypes are still obscure. Here, we conduct proteomics and phosphoproteomics profiling of 103 AEG tumors with paired normal adjacent tissues (NATs), whole exome sequencing of 94 tumor-NAT pairs, and RNA sequencing in 83 tumor-NAT pairs. Our analysis reveals an extensively altered proteome and 252 potential druggable proteins in AEG tumors. We identify three proteomic subtypes with significant clinical and molecular differences. The S-II subtype signature protein, FBXO44, is demonstrated to promote tumor progression and metastasis in vitro and in vivo. Our comparative analyses reveal distinct genomic features in AEG subtypes. We find a specific decrease of fibroblasts in the S-III subtype. Further phosphoproteomic comparisons reveal different kinase-phosphosubstrate regulatory networks among AEG subtypes. Our proteogenomics dataset provides valuable resources for understanding molecular mechanisms and developing precision treatment strategies of AEG.

Large-scale and high-resolution mass spectrometry-based proteomics profiling defines molecular subtypes of esophageal cancer for therapeutic targeting

Article Open access 16 August 2021

Integrative proteogenomic characterization of early esophageal cancer

Article Open access 25 March 2023

Multilevel proteomic analyses reveal molecular diversity between diffuse-type and intestinal-type gastric cancer

Article Open access 14 February 2023

Introduction

Adenocarcinoma of the esophagogastric junction (AEG) generally refers to the adenocarcinoma that occurs in the esophagogastric junction within the range of 5 cm in both directions^1,2. More than 1.5 million patients suffer from AEG each year^3,4. AEG tumors are anatomically classified into three types⁵: Siewert type I, tumors with an epicenter of 1–5 cm above the esophagogastric junction (EGJ); Siewert type II, tumors within 1 cm above and 2 cm below the EGJ; and Siewert type III, tumors within 2–5 cm below the EGJ. AEG is obviously different from gastric cancer in epidemiology, etiology, and pathological characteristics. The incidence rate of AEG has increased year by year, while that of gastric antral carcinoma has decreased significantly^6,7. According to the Lauren classification, the intestinal type was most common in AEG, and intestinal metaplasia led by gastroesophageal reflux disease (GERD) is the main risk factor for AEG^8,9. However, there are more diffuse type cases of gastric antrum carcinoma, and chronic atrophic gastritis is an important precancerous lesion of gastric antrum carcinoma⁹. In addition, Helicobacter pylori (H. pylori) infection is a recognized carcinogenic factor of gastric antrum cancer. Cytotoxigenic associated gene A (CagA) in H. pylori may significantly increase the risk of atrophic gastritis and gastric antrum cancer, but its role in AEG is controversial¹⁰. Some studies have shown that H. pylori infection can prevent GERD, Barrett’s esophagus and other reflux diseases, thus reducing the incidence of AEG to a certain extent¹¹. Currently, comprehensive treatment, including surgical resection, chemotherapy, and immunotherapy, is the most effective treatment for AEG. However, most AEG patients have locally advanced tumors or distant metastasis at diagnosis and are ineligible for surgery¹². Targeted therapies are only for patients with late-stage metastatic HER2-positive tumors, and the benefited population is very limited^12,13. With the use of PD1/PD-L1 inhibitors, the immunotherapy of AEG has made significant progress. However, due to the heterogeneity and complexity of the immune microenvironment, immunotherapy still has many challenges, such as hyperprogression¹⁴. Therefore, it is necessary to better understand the molecular mechanisms underlying AEG carcinogenesis and to identify potential prognostic indicators and drug targets.

Genomic interrogations in AEG have revealed that most AEG tumors are characterized by focal copy number variations (CNVs)^15,16. These focal CNVs are thought of as tumorigenic factors that promote chromosomal instability in AEG tumors. The TCGA Research Network analyzed 295 primary gastric adenocarcinomas using six molecular platforms, including array-based somatic copy number analysis, whole-exome sequencing, array-based DNA methylation profiling, messenger RNA sequencing, microRNA (miRNA) sequencing, and reverse-phase protein array (RPPAR)¹⁵. They classified gastric cancer into for subtypes: tumors positive for Epstein–Barr virus; microsatellite unstable tumors; genomically stable tumors; tumors with chromosomal instability, which was mainly dependent on genomics data. Cristescu et al. used transcriptomics data to describe four molecular subtypes of gastric cancer, including the mesenchymal-like type, microsatellite-unstable type, and the tumor protein 53 (TP53)-active and TP53-inactive types¹⁷. The subtyping was primarily based on gene expression signatures. Other studies related to AEG subtyping based on omics data mainly including genomics and transcriptomics data^{15,16,18,19,20,21}. In addition to various post-translational modifications, genomic changes are supposed to be translated into protein-level alterations to affect phenotypes^22,23. Increasing attention has been given to the application of proteomics and various modified proteomics approaches in the molecular typing of tumors. Multiple studies have included mass spectrometry (MS)-based proteomics analyses of various cancers, including brain cancer^24,25, gastrointestinal cancer^18,26,27, breast cancer²⁸, lung cancer^29,30,31,32, and liver cancer^33,34. These studies have revealed that proteomic signatures can provide complementary information for patient stratification and can better identify potential drug targets and disease markers. Proteomic analysis, integrated with other types of omics data, may help advance our understanding of the molecular mechanism of AEG carcinogenesis and the development of therapeutic drugs for AEG patients.

In this work, we perform comprehensive genomic, transcriptomic, proteomic, and phosphoproteomic analyses of tumor tissues and paired normal adjacent tissues (NATs) derived from 103 AEG patients. We describe integrative proteogenomic analyses of a large cohort of AEG samples and focus particularly on the clinically actionable insights revealed in the proteome and phosphorylation modifications. Based on proteomic data, we identify three different AEG subtypes that exhibit clearly significant differences in clinical and molecular features. Our study may improve current knowledge about AEG and contribute to its diagnosis, prognosis evaluation, and drug development.

Results

Molecular landscape of AEG tumor samples

To characterize a comprehensive molecular landscape in AEG tumors, we applied multi-omics profiling to the paired tumor and NAT samples from 103 patients (Supplementary Data 1), including proteomics profiling, phosphoproteomics profiling, WES, and RNA-seq (Fig. 1a). In particular, proteomics and phosphoproteomics profiling were performed on 206 samples. Of these 206 samples, 188 had been analyzed for WES, and 166 had corresponding RNA-seq data. In total, 30,053 non-synonymous single-nucleotide variants (SNVs) were identified in 94 AEG patients (Supplementary Data 2). In the present AEG cohort, the most frequently mutated cancer-related genes (derived from COSMIC v95)³⁵ were TP53 (62%), MUC16 (31%), FAT4 (22%), LRP1B (18%), ARID1A (16%), and FAT3 (16%) (Fig. 1b). We reviewed the gastroesophageal locations of cancer and retrieved 129 samples that were regarded as AEG in the TCGA esophageal and gastric carcinoma cohort³⁶. The most frequent genomic alterations in the TCGA AEG cohort were captured in our cohort (Supplementary Fig. 1a). Of note, 9 of top 10 mutated genes in our cohort were among the top mutated genes of the TCGA cohort. Genes with top 20 frequent CNVs in the TCGA cohort were also found to be frequently altered in our cohort (Supplementary Fig. 1b). The most frequent nucleotide variant across 103 AEG patients was C > T (16.7%). AEG patients of older age were found to harbor higher tumor mutation burdens (TMB) (P = 0.045, Wilcoxon rank sum test), while other clinicopathological features showed no obvious association with the TMB (Supplementary Fig. 2). Proteomics and phosphoproteomics data showed consistent quality across 206 samples (Supplementary Fig. 3a, b). In addition, principal component analysis of 206 proteomes showed clear divergence between AEG tumor and NAT samples and also showed heterogeneity among tumor samples (Supplementary Fig. 3c). On average, 8885 proteins (Fig. 1c) and 8445 phosphorylation sites (Fig. 1d) were identified from the 206 proteomes and phosphoproteomes of 103 AEG patients. From the RNA-seq data, 23,131 genes were found to be expressed in 166 AEG tumor and NAT samples on average (Fig. 1e). Overall, significantly more proteins (P = 3.8E−15, Wilcoxon rank sum test), phosphorylation sites (P = 1.6E−4, Wilcoxon rank sum test), and genes (P < 2.2E−16, Wilcoxon rank sum test) were detected in AEG tumors than in NAT samples (Supplementary Fig. 4). This observation indicates that compared with NATs, AEG tumors might show abnormally higher molecular activity. In summary, our multi-omics profiling presented a comprehensive molecular atlas of AEG.

**Fig. 1: Multi-omics landscape of adenocarcinoma of the esophagogastric junction (AEG).**

Proteomic characteristics of AEG tumors

We next investigated the disturbance of proteins in AEG tumors. Differential protein analysis revealed 2,300 upregulated and 1667 downregulated proteins in AEG tumor samples compared to paired NAT samples (Fig. 2a and Supplementary Data 3). The upregulated proteins were significantly enriched in genome regulation and instability-related biological processes, such as “spliceosome” and “DNA replication”, while downregulated proteins were more enriched in metabolism-related processes, such as “oxidative phosphorylation” and “carbon metabolism” (Fig. 2b). Furthermore, the overall protein-level integrated abundances of fifty hallmark biological processes were evaluated in each sample (see Methods). Most of the hallmarks (36 out of 50, 72%) showed significantly distinct integrated abundance between paired tumor and NAT samples (Fig. 2c). For example, the “apical junction” hallmark gene set was remarkably upregulated (P = 2.40E−16), whereas the “KRAS signaling up” hallmark gene set was significantly downregulated (P = 1.1E−3) in tumor samples (Fig. 2d). Higher integrated abundances of the “apical junction” hallmark gene set indicate a worse prognosis (P = 0.016, log-rank test), while the higher integrated abundance of “KRAS signaling up” indicated a longer overall survival time in AEG patients (P = 0.0033, log-rank test) (Fig. 2e). These results revealed extensive dysregulation of hallmark biological processes in AEG tumors, which also showed clinical significance. To examine whether these differentially expressed proteins (DEPs) were targeted by FDA-approved drugs or candidate anti-cancer compounds in clinical trials, we screened datasets of the Genomics of Drug Sensitivity in Cancer (GDSC)³⁷, Cancer Therapeutics Response Portal (CTRP)³⁸, and Broad Institute Drug Repurposing project³⁹. Of these DEPs, 252 were found to be targeted by FDA-approved drugs or candidate drugs that are currently under clinical trials (Supplementary Data 4 and Supplementary Fig. 5a). For example, the AHR protein, which could be inhibited by flutamide, was significantly upregulated in tumor samples (Supplementary Fig. 5b). AEG patients with high AHR protein levels showed markedly shorter (P = 6.7E−3, log-rank test) overall survival times than those with low levels (Supplementary Fig. 5c). To identify proteins that may play crucial roles in AEG, and can be potential drug targets, we constructed the protein-protein interactions (PPI) network of DEPs (see Methods). A PPI network of 3923 nodes and 79,088 edges was obtained (Supplementary Fig. 6a). The network topology was further analyzed to identify hub proteins, including the degree, closeness and betweenness (Supplementary Fig. 6b–d and Supplementary Data 5). To further optimize the list of protein candidates, we mapped the top 50 DEPs with the top 50 proteins with the largest degree, closeness, or betweenness, some of which were also found to be targeted by known anti-cancer compounds, such as HDAC1, HSP90AA1, and TP53 (Fig. 2f). Our analysis presented a comprehensive view of proteomic alterations in AEG tumors, and further investigation on their functions and molecular mechanisms in AEG may provide promising drug targets for this disease.

**Fig. 2: Proteomic variations in AEG tumors.**

Proteomics-based subtyping of AEG tumors

The proteomic heterogeneity among tumor samples inspired us to explore AEG subtypes based on proteomics data. A NMF algorithm was employed to cluster AEG tumor samples by using proteomics data (see Methods). Three different subtypes were identified, with 40 samples in the S-I subtype, 23 samples in the S-II subtype, and 40 samples in the S-III subtype (Fig. 3a and Supplementary Data 6). Clinicopathological characteristics, including age, sex, smoking, alcohol, Siewert type and tumor stage, exhibited no significant differences between these three AEG subtypes except for age and Siewert type. The S-I subtype was significantly associated with older age (75% ≥65 years old, P = 0.0093, Fisher’s exact test). The Siewert type II patients were more enriched in the S-I subtype, while the S-III subtype had many more Siewert type III patients (P = 0.011, Fisher’s exact test). Patients in these three subtypes showed significantly distinct overall survival times (P = 0.0011, log-rank test), wherein S-III patients had the longest survival time and S-I patients had the shortest survival time (Fig. 3b). The proteomics-based AEG subtyping remained an independent prognostic factor when adjusted for other clinicopathological characteristics in multivariate Cox regression analysis (P = 0.002, Supplementary Fig. 7). The top mutated genes showed clear distinctions among these three subtypes (Supplementary Fig. 8a-c). We next compared gene mutation frequencies among these three subtypes and found 97, 143, and 29 specifically mutated genes in the S-I, S-II, and S-III subtypes, respectively (Fig. 3C and Supplementary Data 7). For example, LEPR mutation was most common in the S-I subtype (OR = 20.1, P = 2.8E−4, Fisher’s exact test), NCKAP1 mutation was most common in the S-II subtype (OR = 10.5, P = 5.8E−3, Fisher’s exact test), and WIZ mutation was most common in the S-III subtype (OR = 10.0, P = 7.5E−3, Fisher’s exact test) (Supplementary Fig. 8d). To further integrate the genomics and proteomics data, we examined how subtype-specific mutations influence proteins (Supplementary Fig. 9 and Supplementary Data 8). The consequence of mutation on protein was evaluated by comparing the T/N (tumor/normal) values between mutation and wild-type samples as described in a previous study³². For each mutated gene, we examined changes of all the possible proteins. We identified 65,184, 3900, and 1146 significant mutation-to-protein associations in the S-I subtype, S-II subtype, and S-III subtype, respectively (Supplementary Fig. 9a). In all three subtypes, over 60% are negative associations, i.e., most mutations directly or indirectly led to the decrease of protein levels. We showed the top five mutation-protein associations of the top five mutated genes in Supplementary Fig. 9b–d. Although tumor samples exhibited dysregulation of integrated protein abundance of hallmarks in all subtypes, samples in the S-II subtype showed a decreased degree of change (Fig. 3d). The S-III subtype not only displayed a higher degree of dysregulation in tumor samples but also showed a substantial difference in abundance than the S-I and S-II subtypes. For example, the integrated abundance of the “G2M checkpoint” hallmark in the S-III subtype was significantly greater than that in the other two subtypes (P = 1.7E−3 compared to S-I subtype, P = 1.2E−4 compared to S-II subtype, Student’s t test) (Fig. 3e), while “pancreas beta cells” showed markedly lower levels in the S-III subtype (P = 1.7E−2 compared to S-I subtype, P = 4.3E−2 compared to S-II subtype, Student’s t test) (Fig. 3f). To further investigate the protein features in specific subtypes, we identified subtype signature proteins that showed subtype-specific high expression patterns (see “Methods”). Briefly, the expression levels of signature proteins in specific subtypes were significantly higher in tumor samples than in all NAT samples and tumor samples of the other subtypes. Our analysis found 36, 54, and 10 signature proteins in the S-I, S-II, and S-III subtypes, respectively. Of these, 12 signature proteins showed a significant association with patient survival time in the univariate Cox regression analysis (Fig. 3g). Seven of these 12 signature proteins showed significant prognostic associations in AEG patients (Supplementary Fig. 10). In the multivariate Cox regression analysis, FBXO44 was the most unfavorable risk factor according to the risk score, while PKD2 and CD3D were potent favorable factors. In summary, our proteomics analysis identified three different AEG subtypes that exhibited molecular and clinical distinctions.

**Fig. 3: Proteomic subtyping of AEG tumors.**

FBXO44 promotes AEG tumor progression and metastasis

In the multivariate Cox regression analysis above, FBXO44 showed a significantly high unfavorable risk score (Fig. 3g), which was a valuable candidate for further investigation. FBXO44 is a member of the F-box protein family that has been shown to play roles in human cancers⁴⁰. The FBXO44 gene showed significant dysregulation in eight of 18 different tumor types from TCGA cohorts (Supplementary Fig. 11a). FBXO44 showed upregulation in colon cancer but showed no significant expression change in stomach cancer. The FBXO44 protein exhibited significantly higher abundance in S-II AEG tumor samples than in S-II normal samples (P = 1.1E−4, Student’s t test), S-I tumor samples (P = 2.3E−3, Student’s t test), and S-III tumor samples (P = 5.3E−4, Student’s t test) (Fig. 4a). The upregulation of FBXO44 protein in tumor samples was further validated in an independent clinical cohort of 251 AEG patients (P = 1.55E−4, Student’s t test) (Fig. 4b and Supplementary Fig. 11b). Our analysis in this cohort found that FBXO44 was significantly associated with distant metastasis (χ² = 6.19, P = 0.013) and advanced TNM stage (χ² = 8.95, P = 0.030) of AEG tumor (Supplementary Fig. 12). Furthermore, we also assessed the association between FBXO44 protein level and all other available clinicopathological features of AEG patients (Supplementary Data 9). In addition to distant metastasis and advanced TNM stage, FBXO44 was found to be highly associated with older age (χ² = 5.507, P = 0.019) and high AFP level (χ² = 14.489, P < 2.00E−16). AEG patients with high levels of FBXO44 showed significantly shorter survival times than those expressing low levels of FBXO44 in both the present cohort (P = 1.5E−2, log-rank test) (Fig. 4c) and the other independent clinical cohort of 251 AEG patients (P = 7.0E−3, log-rank test) (Fig. 4d). To further confirm the role of FBXO44 in AEG, we performed overexpression (OE) and knockdown (KD) of FBXO44 in two different AEG cell lines, OE19 and SK-GT-4. In OE19 and SK-GT-4 cells, FBXO44 OE promoted cell proliferation by 1.79-fold (P = 0.031) and 1.48-fold (P = 0.029) (Fig. 4e and Supplementary Fig. 11c), increased cell invasion by 1.68-fold (P = 0.032) and 2.18-fold (P = 0.035) (Fig. 4f and Supplementary Fig. 11d), and enhanced cell migration by 2.13-fold (P = 0.004) and 1.18-fold (P = 0.018) (Fig. 4g and Supplementary Fig. 11e), respectively, compared to control cells. In contrast, FBXO44 KD inhibited cell proliferation by 68.1% (P = 0.002) and by 49.1% (P = 0.005) (Fig. 4e and Supplementary Fig. 11c), decreased cell invasion by 79.3% (P = 0.008) and 70.9% (P = 0.001) (Fig. 4f and Supplementary Fig. 11d), and reduced cell migration by 71.8% (P = 0.005) and 54.7% (P = 0.003) (Fig. 4g and Supplementary Fig. 11e) in OE19 and SK-GT-4, respectively. The oncogenic role of FBXO44 in AEG was further validated in the OE19 xenograft mouse model. We observed that FBXO44 OE increased the growth of AEG xenograft tumors by 2.54-fold (P = 0.004), whereas FBXO44 KD suppressed tumor growth by 67.17% (P = 0.029) in vivo (Fig. 4h and Supplementary Fig. 11f–h). Similar results were also observed in an OE19 orthotopic AEG mouse model (Fig. 4i, j, and Supplementary Fig. 11i, j). Of note, FBXO44 OE not only enhanced tumor growth but also increased the incidence of liver metastasis. In conclusion, our analysis revealed that a high level of FBXO44 expression is associated with a poor prognosis in AEG patients and promotes the growth and metastasis of AEG tumor cells in vitro and in vivo.

**Fig. 4: Clinical relevance and biological functions of FBXO44.**

Genomic differences among different AEG subtypes

We further examined the genomic alterations between different AEG proteomics subtypes. Mutation signatures were separately extracted in AEG subtypes (see Methods). These three subtypes showed shared and specific mutation signatures (Fig. 5a–c). In particular, S-I and S-II shared the SBS3 signature (Fig. 5a, c), which indicates defects in DNA double-strand break (DSB) repair by homologous recombination (HR). Both the S-II and S-III subtypes exhibited SBS6 mutation signatures that represent defective DNA mismatch repair (Figs. 5b and 5c). The SBS17b mutation signature was shared by the S-I and S-III subtypes (Fig. 5a, c), which displayed an exclusively high frequency of T > G nucleotide substitution. The SBS1 signature was specifically identified in the S-I subtype, which showed spontaneous or enzymatic deamination of 5-methylcytosine (Fig. 5a). The S-II subtype exclusively exhibited the mutation signature of APOBEC cytidine deaminase (the SBS2 signature) (Fig. 5b). The mutation signature of “deficiency in base excision repair due to inactivating mutations in NTHL1” (the SBS30 signature) was specifically detected in the S-III subtype (Fig. 5c). To further characterize subtype-specific genomic features, we separately conducted somatic interaction analyses in different AEG subtypes. We identified 21, 12, and 19 co-occurrence mutated gene pairs in the S-I, S-II, and S-III subtypes, respectively (Fig. 5d–f). Moreover, 2 and 4 mutually exclusive mutated gene pairs were found in the S-II and S-III subtypes, respectively. In particular, CSMD1 and ANKRD36C genes showed significant mutation co-occurrence across patients in the S-I AEG tumor subtype (Fig. 5d and Supplementary Fig. 13a). Co-occurring mutations of the MUC4 and CPED1 genes were specifically identified in the S-II subtype (Fig. 5e and Supplementary Fig. 13b). Mutations in FAT4 and PRKDC genes showed significant co-occurrence across AEG patients in the S-III subtype (Fig. 5f and Supplementary Fig. 13c). In addition, RYR2 and TTN were found to be exclusively mutated in the S-III AEG subtype (Fig. 5f and Supplementary Fig. 13d). Apart from being distinctive features among different AEG subtypes, co-occurring or exclusive mutations also implicate potential therapeutic strategies that pharmacologically target both genes or either gene of the related gene pair. Furthermore, known oncogenic pathways were examined in AEG tumors. The most frequently mutated oncogenic pathways in all subtypes were the “TP53”, “RTK-RAS”, and “Hippo” pathways (Fig. 5g). Although gene mutations in the “RTK-RAS” pathway were found in over half of the samples for individual subtypes, remarkably different sets of genes were affected in distinct subtypes (Fig. 5h). In conclusion, AEG subtypes showed clearly distinguishable genomic characteristics that might suggest different etiologic mechanisms and precision treatments for individual subtypes.

**Fig. 5: Comparisons of genomic features among the three proteomic subtypes.**

Immune infiltration in AEG tumors

To investigate the heterogeneity of the tumor microenvironment in AEG tumors, we performed cell type deconvolution analysis based on RNA-seq data. The xCell algorithm was employed to infer the relative cell abundance of 41 different cell types (see Methods). The infiltration of some cell types showed significant differences between the three AEG subtypes, such as regulatory T cells and fibroblasts, but none of them have associations with clinicopathological features of AEG patients (Fig. 6a). The S-II AEG tumor samples showed lower abundance of gamma delta T cells, regulatory T cells, and plasmacytoid dendritic cells, whereas they had higher infiltration of fibroblasts, lymphatic endothelial cells, and microvascular endothelial cells, compared to those of the S-I and S-III subtype (Supplementary Fig. 14). Comparisons of cell abundances between tumor and NAT samples in each subtype revealed pervasive changes in cell abundances across various cell types (Fig. 6b). Compared to the corresponding NAT samples, tumors in the S-II subtype had the least number of cell types, while the S-III subtype had the most cell types that showed alterations in cell abundance, especially the increase in lymphoid and myeloid cells. Some types of cells exhibited dysregulated abundances in all AEG subtypes. For example, the abundance of activated dendritic cells (aDCs) showed a significant increase in tumor samples of all three AEG subtypes (Fig. 6c). The abundance of fibroblasts was significantly decreased in the S-III subtype (FDR = 2.6E−4, Student’s t test) but showed no obvious changes in tumor samples from the S-I (FDR = 0.48, Student’s t test) and S-II (FDR = 0.98, Student’s t test) subtypes (Fig. 6d). Compared to samples in the S-I and S-II subtypes, our H&E analysis also revealed a decrease in fibroblast abundance of the S-III subtype (Fig. 6e). Given that fibroblasts may limit the immune cell infiltration to exert the immunosuppressive role in cancer⁴¹, this observation may partly explain that AEG patients in the S-I and S-II subtype had worse prognosis than those in the S-III subtype. Furthermore, we examined the expression changes in immune checkpoint genes, which were retrieved from a previous study⁴². Some immune checkpoints, such as CEACAM1, CD276, PLEC, HLA-DRB1, and LAIR1, were consistently up-regulated in all three subtypes (Fig. 6f). Subtype-specific dysregulation of immune checkpoints, such as the upregulation of CD200 and downregulation of TNFSF14 in the S-II subtype, was also observed. We further evaluated the associations between FBXO44 and immune cells or checkpoints. The high expression of FBXO44 was found associated with the low infiltration of Th2 cells and CD4⁺ Tem cells (Supplementary Fig. 15a, b), and also correlated with the high expression of immune checkpoints TNFRSF14, TNFRSF25, CD40, and VTCN1 (Supplementary Fig. 15c, d). Our analysis revealed the heterogeneity of tumor microenvironment infiltration and immune checkpoints, which suggested potential common and subtype-specific immunotherapy strategies for AEG patients.

**Fig. 6: Immune infiltration across different proteomic subtypes.**

Phosphoproteomic characterization of AEG tumors

We next investigated the alterations of phosphorylation modifications and kinase activity in AEG tumors. Differential phosphorylation analysis identified 4932 sites with increased phosphorylation (fold change > 1.5 and FDR < 0.05) and 3146 sites with decreased phosphorylation (fold change < 0.67 and FDR < 0.05) in tumor samples (Fig. 7a and Supplementary Data 10). Furthermore, sites with differential phosphorylation were identified in each subtype, revealing 1930, 601, and 2111 sites with increased phosphorylation and 1472, 645, and 1580 sites with decreased phosphorylation in the S-I, S-II, and S-III subtypes, respectively (Supplementary Data 11). The differentially phosphorylated proteins in the S-I and S-II subtypes were enriched in nuclear transport and cell organization, whereas those in the S-III subtype were enriched in chromatin modification and organization (Fig. 7b). A large proportion (35.5%) of differentially phosphorylated sites that were identified in all AEG tumor samples showed no obvious dysregulation in all subtypes (Supplementary Fig. 16a). Specifically, 2040 sites with increased phosphorylation and 1078 sites with decreased phosphorylation exhibited no significant changes in all three subtypes (Supplementary Fig. 16b). The kinase activities were then interpreted based on the differentially phosphorylated sites in each AEG subtype. Kinase-substrate enrichment analysis was performed to detect enriched kinases in different subtypes. Different AEG subtypes were enriched for distinct lists of kinases, and the same kinases showed different levels of activities in the S-I, S-II, or S-III subtypes (Fig. 7c). CDK2 and CDK7 were highly enriched in all three subtypes. The S-I subtype specifically showed enrichment of IKBKB and PRKDC. HIPK2 kinase was exclusively enriched in the S-II subtype, while CHEK2 and AURKB were specifically enriched in the S-III AEG subtype. Based on the correlations of known kinase-phosphosubstrate pairs (see Methods), we separately constructed the kinase-phosphosubstrate regulatory networks in three AEG proteomic subtypes (Fig. 7d–f and Supplementary Data 12). In both S-I and S-III subtypes, CDK1 exhibited the most significant correlations with its phosphosubstrates (Fig. 7d, f). CDK2 showed significantly positive correlations with 18 and 28 phosphosubstrates in the S-I and S-III subtypes, while no remarkable correlations were detected in the S-II subtype (Fig. 7e). We observed only one remarkable kinase-phosphosubstrate pair in the S-II subtype, wherein CSNK2A1 was significantly associated with the phosphorylation of Occludin S408 (P = 3.5E−2). Significant correlations of some kinases were found in specific subtypes, such as ATR in the S-I subtype and MAPK3 in the S-III subtype. Conclusively, our analysis revealed differences in kinase-phosphosubstrate regulatory networks between different subtypes and suggested potential personalized responses to clinical therapeutics for AEG patients.

**Fig. 7: Phosphoproteomic analyses in three AEG subtypes.**

Discussion

AEG is a gastroesophageal cancer whose incidence has notably risen in recent decades. However, there has been a lack of molecular classification and systematic characterization for AEG, which prevents the development of effective therapeutic strategies^2,13. Our study represents the attempt at proteomics-based multi-omics profiling for AEG tumors, including genomics, transcriptomics, proteomics, and phosphoproteomics. We presented the proteogenomic alterations in AEG tumors and classified AEG into three different subtypes based on proteomics data. These three AEG subtypes significantly differ in terms of clinical prognosis and molecular alterations.

It is well recognized that molecular subtyping has greatly improved our understanding of inter- and intra-tumor heterogeneity and promoted the development of personalized oncotherapy^15,43,44. Based on proteomics data, three different AEG subtypes were identified in our study. Patients with the S-I subtype have the shortest survival, whereas those with the S-III subtype have the longest survival. Stratification of patients based on survival time will help with precise clinical management and intervention strategies. Furthermore, we compared molecular features among these AEG subtypes. We identified signature proteins that exhibited exclusive high expression in specific subtypes, which could be used for subtype differential diagnosis and as potential targets of personalized treatments. Of these signature proteins, some were found to be significantly associated with AEG tumor progression. For example, FBXO44 was specifically upregulated in the S-II subtype, and its high expression is closely related to a poor prognosis in AEG patients. We experimentally validated that FBXO44 could promote the proliferation and metastasis of AEG tumor cells in vitro and in vivo. A recent study demonstrated that FBXO44 is an essential repressor of DNA replication-coupled repetitive elements in human cancer⁴⁰. The same study also showed that FBXO44 inhibition could enhance the response to anti-PD-1 therapy in immunocompetent mice bearing 4T1 cell-derived tumors. Combining our observations in AEG tumors, these results suggest that FBXO44 inhibition might overcome anti-PD-1 resistance in AEG tumors, especially for patients with the S-II subtype.

It is known that molecular alterations occurred frequently in tumor samples, but the specific alterations of proteome in AEG tumor have not yet systematically investigated. Pairwise comparisons of tumor and NAT around tumor sites are common in many multi-omics studies in gastric or colon cancer^18,26,27. By comparing to the normal samples, we identified differentially expressed proteins and altered biological processes in AEG tumor. Our analysis presented a comprehensive view of proteomic alterations in AEG tumors, and further investigation on their functions and molecular mechanisms in AEG may provide promising drug targets for this disease. The normal samples were also used to identify subtype-specific alterations. In our study, all NAT samples were collected from regions within ~2 cm around the corresponding AEG tumor sites. Paired tumor-NAT samples were derived from the same patients. To reduce the effect of inter-patient heterogeneity and identify subtype-specific tumor differences, we separately compared tumor with NAT samples in each AEG subtype.

In the hallmark gene set analysis, the “pancreas beta cell” gene set showed a significant decrease in AEG tumor samples, especially in the S-III subtype. A large number of adult stem or progenitor cells residue in the epithelium of gastrointestinal organs, which is a source of renewable insulin⁺ cells^45,46. The pancreas and gastrointestinal organs are developed from adjacent embryonic domains⁴⁷. Moreover, native antral endocrine cells and pancreatic β cells share high molecular similarity, and Ariyachet et al. demonstrated that antral stomach cells could be reprogrammed into pancreatic β cells in vivo⁴⁸. Therefore, the changes of “pancreas beta cell” gene set observed in our study might reflect changes in the epithelium. Further investigations are needed to examine our conjecture.

We examined the expression changes in immune checkpoint genes to screen potential immunotherapy targets of different AEG subtypes, which were not necessarily associated with prognosis. We observed that some of the markers may be related to the prognosis, indicating that patients of the S-III subtype may have a better response rate and treatment effect to tumor immunotherapy. Specifically, the expression of CD27 in the S-III subtype was significantly higher than that in the other types, while the expression of VTCN1 in the S-III subtype was significantly lower than that in the other types. CD27, which belongs to the tumor necrosis factor receptors, is a co-stimulatory immune checkpoint. CD27 has been demonstrated to participate in the regulation of generating and maintaining T cell immunity. Evidences have shown that CD27 was able to promote T cell function or dysfunction by regulating the production of IL-2^49,50. VTCN1, also known as B7-H4, is an immune checkpoint molecule that negatively regulates immune responses and is known to be overexpressed in many human cancers⁵¹. VTCN1 negatively regulates T cell immune response and promotes immune escape by inhibiting the proliferation, cytokine secretion, and cell cycle of T cells⁵². However, further studies are needed to confirm the specific role of these markers in the immune microenvironment of AEG.

Protein kinases have been developed as operable drug targets in the treatment of cancer^53,54. We identified hundreds of differentially phosphorylated sites in each AEG subtype, which could be utilized as possible subtype-specific drug targets. Kinase enrichment and kinase-phosphosubstrate relations were also evaluated in all AEG subtypes. Our analysis revealed shared and subtype-specific kinase enrichment and kinase-phosphosubstrate regulatory networks. These results suggest that drugs targeting different kinases might be effective in distinct AEG subtypes (for example, casein kinase II subunit alpha (CSNK2A1) could be a target in the S-II subtype). We hope that these target candidates could be experimentally and clinically explored to benefit patients with AEG tumors in the near future.

In conclusion, the multidimensional analysis in this study represents an advancement in the understanding of the molecular alterations and possible oncological mechanisms of AEG tumors. Although some of our findings need further biological and clinical validation, as the proteomics-based multi-omics characterization of AEG, these data and observations open prospective paths for biological interrogation and therapeutic exploration. Our study may also serve as a valuable resource for future drug discovery and precision clinical practice for patients with AEG tumors.

Methods

Collections and preparation of clinical specimen

This study included samples derived from 103 patients from the Cancer Hospital of the University of Chinese Academy of Sciences (Zhejiang Cancer Hospital) from April 2009 to April 2018. The Research Ethics Committees of Zhejiang Cancer Hospital approved the study (No. IRB-2021-288) and all patients provided written informed consent. The informed consent form clearly informs the patients that all clinical information such as age, sex, and TNM staging will be used for academic research and publication. These patients were all newly diagnosed patients with AEG who underwent surgical resection and had received no prior treatment for this disease, including chemotherapy, radiotherapy, targeted therapy, or biological therapy. Patients who were found to have two or more malignancies were excluded.

Patients in this cohort ranged from 40 to 87 years old; the cohort included 81 males and 22 females, 4 cases in stage I, 24 cases in stage II, 69 cases in stage III, and 6 cases in stage IV. We included 27 Siewert type I, 31 Siewert type II, and 45 Siewert type III AEG patients. More detailed clinical information of individual patients, including age, sex, smoking, and drinking status, date of surgery, Lauren type, Borrmann classification, grade of differentiation, tumor size, tumor-node-metastasis (TNM) staging, and survival status and time, are listed in the Supplementary Data 1. Pathological staging was based on the eighth edition of the American Joint Committee on Cancer’s Staging System. Tumor tissues and paired NATs were collected from the same patients at surgical resection. Of note, NAT samples were collected from regions within ~2 cm around the corresponding tumor sites. The sample size was approximately 0.5$\times$0.5 cm, and four to five tumor specimens and NATs were collected for most cases. For genomic, proteomic, and phosphoproteomic analyses, tissue specimens endured cold ischemia for less than 30 min prior to freezing in −80 °C refrigerators. For transcriptomic analysis, tissue specimens were soaked in RNA protective solution at 4 °C overnight, and then frozen in −80 °C refrigerators. Histologic sections obtained from the top and bottom portions of each specimen were reviewed by a senior board-certified pathologist to confirm the tissues as tumors or NATs. The top and bottom sections had to contain an average of 60% tumor cell nuclei with less than 20% necrosis to be deemed acceptable for this study.

Protein extraction and tryptic digestion

A total of 103 AEG tumor tissues and paired NATs were analyzed by proteomics and phosphoproteomics profiling. The samples were taken out from the −80 °C freezers and total proteins were extracted from each sample. In particular, approximately 20–60 mg of tissue sample was placed into a mortar that was pre-cooled with liquid nitrogen and fully ground to a powder under liquid nitrogen. Four volumes of lysis buffer (1% Triton X-100, 1% protease inhibitor, 1% phosphatase inhibitor) were added to the sample powder of each group for ultrasonic lysis. The debris was removed by centrifugation at 12,000 × g at 4 °C for 10 min. Finally, the supernatant was collected and transferred to a new centrifuge tube and the protein concentration was determined using the BCA protein assay (BCA Protein Assay Kit, Pierce). For digestion, the same amount of protein was extracted from each sample, and the volume of each group was adjusted with lysate. Then, 20% trichloroacetic acid was added slowly and precipitated at 4 °C for 2 h. The samples were centrifuged at 4500 × g for 5 min, the supernatant was discarded, and the precipitate was washed with pre-cooled acetone three times. After drying the protein pellets, triethyl-ammonium bicarbonate buffer was added at a concentration of 200 mM, and the pellet was ultrasonically dispersed. Then, trypsin was added at a ratio of 1:50 (protease:protein; m/m) to hydrolyze the proteins at 37 °C overnight. Dithiothreitol (DTT, 5 mM) was added as the reducing agent at 56 °C for 30 min. Finally, iodoacetamide (IAA, 11 mM) was added and incubated at room temperature in the dark for 15 min.

Phosphorylation modification enrichment

The peptides were dissolved in an enrichment buffer solution (50% acetonitrile/0.5% acetic acid). The supernatant was transferred to the pre-washed immobilized metal affinity capture (IMAC) material, placed on a rotating shaker, and incubated by gentle shaking. The IMAC microspheres with enriched phosphopeptides were collected by centrifugation, and the supernatant was removed. To remove nonspecifically adsorbed peptides, the IMAC microspheres were sequentially washed with 50% acetonitrile/6% trifluoroacetic acid and 30% acetonitrile/0.1% trifluoroacetic acid. To elute the enriched phosphopeptides from the IMAC microspheres, an elution buffer containing 10% NH₄OH was added, and the enriched phosphopeptides were eluted with vibration. The supernatant containing phosphopeptides was collected and lyophilized for LC-MS/MS analysis.

Liquid chromatography-mass spectrometry (LC-MS) analysis

The tryptic peptides were dissolved in solvent A (0.1% formic acid, 2% acetonitrile in water) and directly loaded onto a homemade reversed-phase analytical column (25-cm length, 100 μm i.d.). Liquid gradient settings for proteomic analysis: Peptides were separated with a gradient from 6% to 24% solvent B (0.1% formic acid in acetonitrile) over 70 min, 24% to 35% in 14 min, further climbing to 80% in 3 min, and then holding at 80% for the last 3 min, all at a constant flow rate of 450 nL/min on a NanoElute UHPLC system (Bruker Daltonics). Liquid gradient settings for phosphoproteomic analysis: Peptides were separated with a gradient from 2% to 22% solvent B (0.1% formic acid in acetonitrile) over 50 min, 22% to 35% over 2 min, further climbing to 80% over 4 min, and then holding at 80% for the last 4 min, all at a constant flow rate of 450 nL/min on a nanoElute UHPLC system (Bruker Daltonics). Then, the peptides were subjected to a capillary source followed by timsTOF Pro (Bruker Daltonics) mass spectrometry. The electrospray voltage applied was 1.7 kV. Precursors and fragments were analyzed at the time-of-flight (TOF) detector, with an MS/MS scan range from 100 to 1700 m/z. The timsTOF Pro was operated in parallel accumulation serial fragmentation (PASEF) mode. Precursors with charge states of 0–5 were selected for fragmentation, and 10 PASEF-MS/MS scans were acquired per cycle. The dynamic exclusion was set to 30 s/24 s (proteomic analysis/phosphoproteomic analysis).

Protein database searching

The resulting tandem mass spectrometry data were processed using the MaxQuant search engine (v.1.6.6.0)⁵⁵. Tandem mass spectra were searched against the human UniProt database⁵⁶ (20,366 entries, downloaded on May 9, 2020) concatenated with a reverse decoy database. Trypsin/P was specified as a cleavage enzyme allowing up to 2 missing cleavages. The mass tolerance for precursor ions was set as 20 ppm in the first search and 20 ppm in the main search, and the mass tolerance for fragment ions was set as 20 ppm. Carbamidomethyl on Cys was specified as a fixed modification, and acetylation on the protein N-terminal, oxidation on Met, and phosphorylation on Ser, Thr, and Tyr were specified as variable modifications. The quantitative method was set as label free quantitative (LFQ), and the FDR threshold for protein identification and peptide-spectrum match (PSM) identification was set as 1%. The protein group intensities are provided in Supplementary Data 13.

Normalization of proteomic and phosphoproteomic data

The iBAQ intensities for proteomics and phosphoproteomics data of 206 samples (103 paired tumors and NATs) were extracted from the MaxQuant result files. A 10,148 × 206 matrix was generated to represent the expression of particular proteins across samples, and a 37,773 × 206 expression matrix was obtained for particular phosphorylation sites. The proteomics and phosphoproteomics data were normalized following a previous study³⁴. More specifically, expression matrixes were then subjected to quantile normalization by using the normalized quantile functions implemented in the limma R package (version 3.46.0, R version 4.0.2)⁵⁷. Next, log2-transformation of the normalized iBAQ intensities was calculated for the following quantitative analyses. In addition, all missing values were imputed with the minimum values across individual expression matrixes. The limma package was also adopted to compute the difference in protein and phosphorylation abundances between tumor and paired NAT samples. Specifically, the difference was statistically evaluated by employing a simple linear model and moderated t-statistics by the empirical Bayes shrinkage method.

Whole-exome sequencing

WES was performed for paired tumor tissues and NATs of 94 AEG cases. Genomic DNA was isolated from tumor tissues and NATs using a DNeasy tissue kit (Qiagen, Hilden, Germany). The concentrations of genomic DNA samples were determined by using the Qubit dsDNA BR Assay (Thermo Fisher Scientific). The DNA integrity was determined by 1% agarose gel electrophoresis. Genomic DNA samples of 1-3 µg were sheared by a Bioruptor® Pico Sonication System (Diagenode SA, Belgium), and an Agilent 2100 Bioanalyzer (Agilent Technologies) was used to assess DNA fragment sizes of approximately 250 bp. These whole-genomic libraries were subsequently prepared by the SureSelectXT Target Enrichment System for Illumina Paired-End Multiplexed Sequencing Library kit (Agilent Technologies). The whole-exome sequence was captured by SureSelectXT Human All Exon V6 (Agilent Technologies) and quantified by Qubit, Agilent 2100 Bioanalyzer, and qPCR (KAPA Library Quantification Kit KR0405). The final libraries were sequenced for paired-end 150 bp using the Illumina NovaSeq 6000 Sequencing System (Illumina Inc., San Diego, CA, USA) at LC-Bio Technology Co., Ltd. Adapters and low-quality reads (q quality score < 20) were removed from raw WES reads by using fastp software (version 0.21.0)⁵⁸. Then, BWA software (version 0.7.17)⁵⁹ was utilized to align filtered reads to the human reference genome (GRCh38). Alignments were subjected to Picard tools (http://broadinstitute.github.io/picard/) to identify and mark duplicate reads. Next, local realignment was performed to correct potential alignment errors around indels. Prior to variant calling, base quality score recalibration was performed to reduce systematic biases. Then, somatic SNVs and InDels were jointly called by Mutect2 (version 4.1.9.0)⁶⁰ and Strelka2 (version 2.9.10)⁶¹. Only variants that passed both quality filtering steps were used in the follow-up analysis. The Variant Effect Predictor (VEP) tool⁶² was utilized to fetch biological information of the variant set. Called mutations with annotation information are supplied in Supplementary Data 2.

mRNA sequencing

mRNA sequencing (RNA-seq) was performed in paired tumor tissues and NATs of 83 AEG cases. Total RNA was isolated from the tumor tissues and NATs in RNA protective solution using TRIzol reagent (Invitrogen, Carlsbad, CA, USA) following the manufacturer’s procedure. The RNA amount and purity of each sample were quantified by using a NanoDrop ND-1000 (NanoDrop, Wilmington, DE, USA). The RNA integrity was assessed by an Agilent 2100 with RIN > 7.0. For mRNA sequencing, the library was prepared on 1 μg of DNase I-treated total RNA using a TruSeq kit (Illumina), and 150-bp paired-end sequencing was performed on an Illumina HiSeq X Ten machine at LC-Bio Technology Co., Ltd. (Hangzhou, China) following the vendor’s recommended protocol. Raw sequencing RNA reads were first trimmed to remove low-quality bases and reads by using Trimmomatic software (version 0.39)⁶³ with default parameters. The filtered reads were then aligned to the human reference genome (GRCh38) by using the splice-aware aligner HISAT2 (version 2.2.1)⁶⁴. Alignment results were subjected to gene quantification with gene annotation from GENCODE (version 35)⁶⁵ by adopting StringTie software (version 2.14)⁶⁶. Gene expression levels were normalized in the unit of transcripts per million mapped reads (TPM). Genes with expression levels higher than 0.1 TPM in at least one sample remained for downstream analysis. Raw gene counts are provided in Supplementary Data 14.

Hallmark gene set analysis

The hallmark gene sets were retrieved from the Molecular Signatures Database (MSigDB)⁶⁷. These fifty gene sets were refined from a wide range of biological processes by reducing both variation and redundancy. The integrated abundance of proteins in these hallmarks was then calculated in each sample by utilizing the GSVA R package (version 1.38.2)⁶⁸. A normalized protein expression matrix was used in the calculation.

Proteomic subtype identification in AEG tumor samples

The non-negative matrix factorization (NMF) algorithm, which is a popular approach to effectively distinguish groups with different molecular features^15,34,69, was employed to identify AEG subtypes from the protein expression profiles of 103 AEG tumor samples. In particular, the consensus cluster method implemented in the NMF R package (version 0.23.0)⁷⁰ was utilized to identify the distinct proteomics patterns among individual samples. First, the proteomics profile was filtered before NMF analysis to remove proteins that were detected in less than 25% of the samples, leaving 9783 proteins. Then, the variation coefficient of each protein across all samples was calculated, and the top 25% of most variable proteins (2445) were used for unsupervised consensus clustering. Next, the NMF algorithm was performed to estimate the optimal rank in a given range from 2 to 5 using 200 interactions. A rank of 3 was selected to run the NMF clustering in 200 interactions. Missing values were imputed with the minimum value in our proteomic dataset.

Identification of signature proteins in each subtype

To identify the specific molecular alterations in our proteomic subtypes, we compared the protein abundances between tumor samples in individual subtypes with those in tumor and NAT samples of the other subtypes. The statistical significance was estimated by the empirical Bayes shrinkage method implemented in the limma package as described above. In each subtype, a protein that showed remarkably higher abundances than all NAT samples and tumor samples in the other subtypes was considered a signature protein.

TCGA gene expression analysis

The gene expression profiles of 18 different cancer types in the TCGA cohort with paired tumor and normal adjacent samples were retrieved from the Genomic Data Commons data portal⁷¹ (GDC). In each cancer type, the normalized expression matrix (in TPM unit) was adopted to perform differential expression analysis by using paired Student’s t test (as implemented in the R software). Genes with |fold change| ≥ 1.5 and FDR < 0.05 were regarded as statistically significant.

Survival analysis

The overall survival time was compared between different groups by using the log-rank test implemented in the survival package (version 3.2.3, https://CRAN.R-project.org/package=survival). The survival curves were generated by using the Kaplan–Meier method in the survminer R package (version 0.4.9, https://CRAN.R-project.org/package=survminer). Except for the analysis of subtypes, tumor patients were divided into high- and low-abundance groups by using the median abundances of individual proteins, phosphorylation sites or genes. Hazard ratios with 95% confidence intervals were calculated from the Cox proportional hazards regression analysis. Clinical variables, including age, sex, smoking history, alcohol history, Siewert type, and tumor stage, were used in the Cox regression multivariate analysis.

Protein-protein interaction network analysis

The human protein–protein interactions (PPIs) were obtained from the STRING database (v11.5)⁷². Differentially expressed proteins (DEPs) were mapped to PPIs to generate the DEP PPI network in AEG. Single nodes were removed from the network. We obtained a PPI network of 3,923 nodes and 79,088 edges. The Cytoscape (version 3.9.0) software⁷³ was used to visualize the network. The Cytoscape plugin cytoHubba⁷⁴ was utilized to calculate the degree, closeness, and betweenness of all nodes in the PPI network.

Tissue microarray (TMA) construction and immunohistochemistry analysis

A total of 251 formalin-fixed, paraffin-embedded AEG tissues and corresponding NATs from Jan 1, 2009 to Dec 31, 2017 were collected in Zhejiang Cancer Hospital. Two pathologists independently selected the most representative tumors and paired NATs, and TMAs were produced as previously described⁷⁵. Immunohistochemical staining of serial TMAs was carried out as previously described⁷⁵. After treating with 3% H₂O₂/methyl alcohol solution for 10 min at room temperature, 5% normal goat serum buffer was used to block the tissue at 37 °C for 30 min. Slides were then incubated with primary antibodies at 4 °C overnight. After washing, the slides were incubated with biotin labeled goat anti-rabbit IgG and HRP-conjugated streptavidin at 37 °C for 1 h. Immunoreaction was visualized by diaminobenzidine (DAB) (Cat#ZLI-9065, ZSGB-BIO Corp., Shanghai, China). After DAB staining, all tissues were counterstained with hematoxylin (Cat#ZLI-9609 ZSGB-BIO Corp., Shanghai, China) dehydrated and then blocked. The FBXO44 (1:300) antibody was purchased from Proteintech (Chicago, USA). Two experienced pathologists independently evaluated the slides. Brown-stained cells were considered positive. The expression of FBXO44 was assessed using the H-score system. The formula for the H-score was as follows:

$$H\,_{{{{{\rm{score}}}}}}=\sum \left({{{{\rm{IS}}}}}\times {AP}\right)$$

(1)

where IS represents the staining intensity and AP represents the percentage of positively stained tumor cells. The H-score ranged between 0 and 12. An IS between 0 and 3 was assigned for the intensity of tumor cell staining (0 for no staining; 1 for weak staining; 2 for intermediate staining; 3 for strong staining). AP depended on the percentage of positively stained cells as follows: 0 (0%), 1 (1–25%), 2 (26–50%), 3 (51–75%), and 4 (76–100%). The score was assigned using the estimated proportion of positively stained tumor cells. A score ≥6 was considered positive, and <6 was considered negative.

Cell lines and cell culture

Human AEG cell lines, including OE19 (Cat#CBP60495, OE19 was established in 1993 from a 72-year-old male patient with gastric cardia adenocarcinoma⁷⁶) and SK-GT-4 (Cat#CBP60462, SK-GT-4 was established in 1989 from the primary tumor of an 89-year-old Caucasian male with an adenocarcinoma of the distal esphagus^77,78), were obtained from Cobioer Biosciences Co., Ltd. (Nanjing, China). OE19 and SK-GT-4 cells were cultured in RPMI 1640 medium (Kino Biological and Pharmaceutical Technology Co., Ltd, Hangzhou, China) containing 10% fetal bovine serum (FBS, Gibco, Grand Island, USA) and 1% penicillin/streptomycin (Kino Co., Ltd., Hangzhou, China) at 37 °C under 5% CO₂ in a cell culture incubator. These two cell lines were identified by Short Tandem Repeat, and bacterial and fungi contamination test were negative.

Colony formation assays

FBXO44 knockdown (shFBXO44) and corresponding negative control (shCtrl) cells and FBXO44 overexpression (FBXO44) and corresponding vector cells were seeded in 6-well plates (500 cells/well) and cultured for 14 days with fresh medium. Thereafter, the cells were subjected to fixation and crystal violet (Solarbio, China) staining. Visible colonies (with >50 cells) were counted to determine the clonogenic potential of these cells.

Transwell invasion assays

For invasion assays, the upper surface of the membrane was covered by a layer of Matrigel (BD Biosciences, USA). Then, approximately 5 × 10⁴ OE19 and SK-GT-4 (Vector, FBXO44, shCtrl, and shFBXO44) cells were suspended in 200 µL serum-free medium and inoculated in the upper compartment of the transwell chamber (Corning, USA). Furthermore, 500 µL of complete medium containing 20% FBS was added to the lower chamber. After incubation for 48 h, the cells on the upper surface of the cell membrane were removed with cotton swabs, and the remaining cells were washed with PBS, stained with crystal violet (Solarbio, China), and photographed and analyzed under a microscope at 200× magnification (ix71, Olympus, Japan).

Wound healing assays

For wound healing assays, approximately 2 × 10⁶ OE19 and SK-GT-4 (Vector, FBXO44, shCtrl and shFBXO44) cells were seeded onto 6-well plates. Then, three fields of vision were randomly selected for each group and photos were taken at 200× magnification under an optical microscope (ix71, Olympus, Japan) at 0 h and 12 or 24 h after wound induction.

Mutation signature analysis

To characterize the patterns of nucleotide substitutions, the trinucleotideMatrix function implemented in the maftools R package (version 2.6.05)⁷⁹ was used to extract the matrix of nucleotide substitutions in each AEG proteomic subtype. The nucleotide substitution matrix was then decomposed to generate mutation signatures by classifying the immediate bases surrounding mutated bases into 96 substitution classes. Each identified mutation signature was compared to the COSMIC SBS signatures⁸⁰ by calculating the cosine similarity.

Identification of somatic interactions

Some genes were mutually or concomitantly mutated in individual samples. The somaticInteractions function implemented in the maftools R package was employed to detect the mutually exclusive or co-occurring gene pairs. In particular, the pair-wise Fisher’s exact test was used to identify significant gene pairs with mutual or co-occurring mutations.

Bioluminescence imaging

In vivo bioluminescence imaging was carried out by using a cooled CCD camera system (IVIS Imaging System, PerkinElmer, CA, USA) to observe tumor growth. Briefly, normal saline containing 15 mg/mL d-luciferin (Art.No.40901ES03, Yeasen Corp., Shanghai, China) was intraperitoneally injected into mice at 150 mg/kg body weight. These mice were placed in the light-tight chamber of the CCD camera system accompanying 2% isoflurane anesthesia. For luminescent image acquisition, an integration time of 1 to 60 sand binning factors of 4 was used. Signal intensity was measured according to the flux of all detected photon counts from the region tumor area using the Living Image software package (Xenogen Corp., Alameda, CA, USA).

Hematoxylin–eosin staining and immunohistochemistry

Paraformaldehyde-fixation, ethanol dehydration, transparency with xylene, and paraffin-embedding were carried out for all tissues. A hematoxylin-eosin (H&E) staining kit (Art. ZLI-9609 ZSGB-BIO Corp., Shanghai, China) was used to stain the tissue slices. The histological changes in the tumor tissues were observed with a microscope at 200× magnification. For immunohistochemistry staining, 4-μm tissue slides were treated with 1 mM EDTA buffer (pH = 9.0) for antigen retrieval. The samples were incubated with the anti-FBXO44 antibody (Cat. No. 10626-1-AP) from Proteintech (Chicago, IL, USA). They were then incubated with biotin-labeled goat-rabbit IgG and horseradish peroxidase-conjugated streptavidin for 1 h. They were then photographed with an inverted microscope at 200× magnification.

Estimation of infiltrating cell abundance

The abundances of different infiltrating cell types were calculated by using xCell (https://xcell.ucsf.edu/)⁸¹ based on transcriptomic data. In the 64 cell types curated by the xCell method, we removed those that were not relevant in AEG tissues, such as hepatocytes, keratinocytes, and osteoblast. We then removed those cell types that had a xCell score of 0 across all samples. A total of 41 cell types were involved in subsequent analysis, including 10 stromal cell types (adipocytes, astrocytes, fibroblasts, preadipocytes, pericytes, lymphatic [ly] endothelial cells, microvascular [mv] endothelial cells, smooth muscle, endothelial cells, and myocytes), 9 lymphoid cell types (central memory CD4⁺ T cells [CD4⁺ Tcm], effector memory CD4⁺ T cells [CD4⁺ Tem], CD4⁺ memory T cells, regulatory T cells [Treg], gamma delta T cells [Tgd], T helper type 1 cells [Th1], T helper type 2 cells [Th2], naïve B cells, and plasma cells), 11 myeloid cell types (basophils, dendritic cells [DC], macrophages, M1 macrophages, M2 macrophages, mast cells, activated dendritic cells [aDC], conventional dendritic cells [cDC], immature dendritic cells [iDC], plasmacytoid dendritic cells [pDC], and monocytes), 7 stem cell types (platelets, common lymphoid progenitor [CLP], common myeloid progenitor [CMP], granulocyte-macrophage progenitor [GMP], hematopoietic stem cells [HSC], megakaryocyte-erythroid progenitor [MEP], and megakaryocytes) and 4 other cell types (epithelial cells, mesangial cells, neurons, and sebocytes). Briefly, xCell inferred cell types based on gene signatures that were extracted from 1822 pure human cell type transcriptomes by a curve fitting approach. The xCell scores (relative abundances) were calculated in each sample (Supplementary Data 15) and were compared between different groups by using Student’s t test.

Kinase-substrate enrichment analysis

The kinase-substrate enrichment analysis (KSEA) was conducted by using the KSEAapp R package (version 0.99.0)⁸² with known kinase-substrate pairs derived from PhosphoSitePlus® (PSP)⁸³ and NetworKIN 3.0⁸⁴. Kinase-substrate pairs with a score of more than 1 were used in the enrichment analysis. In each subtype, Spearman correlation coefficients between different kinase proteins and paired phosphosubstrates were calculated to build the kinase-phosphosubstrate network.

Establishment of stable FBXO44 overexpression and knockdown cell lines

Human FBXO44-shRNA and FBXO44-overexpression lentiviral vectors were constructed, validated, and supplied by Shanghai Genechem Chemical Technology Co., Ltd. (Genechem, Shanghai, China). For FBXO44 silencing, among the three designed FBXO44 siRNA target sequences tested, the target sequence with the best silencing efficiency was: CCAGCAGAAGAGCGATGCCAA. After annealing, oligonucleotides were cloned into the AgeI/EcoRI sites of Luc-tagged GV344 lentivirus vectors (Genechem, Shanghai, China). After identification of the correct sequence and lentivirus packaging, OE19 and SK-GT-4 cells were infected at a multiplicity of infection (MOI) of 10 for 24 h. For FBXO44 overexpression, the cDNA of FBXO44 was sub-cloned using Taq DNA polymerase (SinoBio Biltech Co. Ltd., Shanghai, China) and inserted into the BamHI/AgeI sites of Luc-tagged GV260 lentivirus vectors (Genechem, Shanghai, China). Forward primer: AGGTCGACTCTAGAGGATCCCGCCACCATGGCTGTGGGGAACATCAAC, reverse primer: CTTCCATGGTGGCGACCGGTACGGGCAGCGGGGGCCCGATGGTGATG. After identification of the correct sequence and lentivirus packaging, OE19 and SK-GT-4 cells were infected at an MOI of 10 for 24 h.

Xenograft and orthotopic mouse models of AEG

In accordance with the protocols for experimentation on animals (National Institutes of Health Publication No. 85-23, revised 1996), the animal experiments conducted were approved by the Institutional Animal Care and Use Committee of Zhejiang Chinese Medical University (The Ethics Committee stipulates that the xenograft and orthotopic tumor volume of mice should not exceed 2000 mm³, and our experiments meet the ethical requirements.). The nude mice (male, 4 weeks old) were raised in the laboratory for a week before the experiment. Mice were fed in the Specific Pathogen Free (SPF) barrier center at the animal experimental center of Zhejiang Chinese Medical University, under standard conditions of temperature (25 ± 2 °C) and humility (50 ± 5%) in a 12 h light/12 h dark cycle with normal drink and food. A total of 5 × 10⁶ OE19 (Vector, FBXO44, shCtrl and shFBXO44) cells were injected subcutaneously to establish subcutaneous xenograft tumor models in nude mice, 6 mice in each group. The body weight, living status, and tumor size of nude mice were recorded. After 5 weeks of observation, the mice were put into the carbon dioxide anesthesia box, the carbon dioxide valve was then open, and when the animal gradually loses consciousness, the carbon dioxide concentration was increased to 100% for 2 min, and then followed by cervical dislocation. The nude mice were sacrificed, and tumors were frozen at −80 °C until use. For the orthotopic mouse model, subcutaneous tumors grown in nude mice were harvested and resected under aseptic conditions. Necrotic tissues were removed, and viable tissues were cut with scissors and minced into 1–2 mm³ fragments. Before implantation, the mice were anesthetized by an intraperitoneal injection of 0.3% pentobarbital sodium (25 μl/g body weight) (Sigma, Steinheim, Germany). A 10–15 mm midline incision was made in the upper abdomen, and the stomach was carefully exposed. Part of the serosal membrane, approximately 2 mm in diameter, in the middle of the greater curvature of the stomach was mechanically injured with a scalpel. A tumor piece was then fixed onto the injured site of the serosal surface with medical OB glue. The stomach was then returned to the peritoneal cavity, and the abdominal wall and the skin were closed with sutures. The remaining steps were the same as those in the xenograft mouse model experiments.

Statistical analysis

Statistical analysis and data visualization in this study were performed by using R software (R Foundation for Statistical Computing, Vienna, Austria; http://www.r-project.org). Unless otherwise specified, all tests were two-tailed, and a P value or FDR < 0.05 was considered to indicate statistical significance.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

The proteomics and phosphoproteomics data were deposited in the ProteomeXchange database⁸⁵ with dataset identifiers PXD030667 and PXD030725, respectively. The WES and RNA-seq data were deposited in the Sequence Read Archive (SRA) database under the accession number PRJNA788008. The gene expression profiles, mutation, and CNV datasets of TCGA cohorts were retrieved from the Genomic Data Commons (GDC) data portal (https://portal.gdc.cancer.gov/). Software and publicly available resources used in this study were described in the Methods section. Other results generated in this study can be found in the Supplementary data. Source data are provided with this paper.

Code availability

Scripts and code that were used for data analysis and visualization were deposited in https://github.com/lishenglilab/AEG_Proteomics.

References

Donlon, N. E. et al. Adverse biology in adenocarcinoma of the esophagus and esophagogastric junction impacts survival and response to neoadjuvant therapy independent of anatomic subtype. Ann. Surg. 272, 814–819 (2020).
Article PubMed Google Scholar
Yamashita, H. et al. Results of a nation-wide retrospective study of lymphadenectomy for esophagogastric junction carcinoma. Gastric Cancer 20, 69–83 (2017).
Article CAS PubMed Google Scholar
Arnold, M. et al. Global burden of 5 major types of gastrointestinal cancer. Gastroenterology 159, 335–349.e315 (2020).
Article PubMed Google Scholar
Sung, H. et al. Global Cancer Statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J. Clin. 71, 209–249 (2021).
Article PubMed Google Scholar
Schuhmacher, C. et al. Neoadjuvant chemotherapy compared with surgery alone for locally advanced cancer of the stomach and cardia: European Organisation for Research and Treatment of Cancer randomized trial 40954. J. Clin. Oncol. 28, 5210–5218 (2010).
Article PubMed PubMed Central Google Scholar
Cao, F. et al. Current treatments and outlook in adenocarcinoma of the esophagogastric junction: a narrative review. Ann. Transl. Med. 10, 377 (2022).
Article CAS PubMed PubMed Central Google Scholar
Saito, T. et al. Treatment response after palliative radiotherapy for bleeding gastric cancer: a multicenter prospective observational study (JROSG 17-3). Gastric Cancer 25, 411–421 (2022).
Article CAS PubMed Google Scholar
Qiu, M. Z. et al. Clinicopathological characteristics and prognostic analysis of Lauren classification in gastric adenocarcinoma in China. J. Transl. Med. 11, 58 (2013).
Article PubMed PubMed Central Google Scholar
Abdi, E., Latifi-Navid, S., Zahri, S., Yazdanbod, A. & Pourfarzi, F. Risk factors predisposing to cardia gastric adenocarcinoma: Insights and new perspectives. Cancer Med. 8, 6114–6126 (2019).
Article PubMed PubMed Central Google Scholar
Tomb, J. F. et al. The complete genome sequence of the gastric pathogen Helicobacter pylori. Nature 388, 539–547 (1997).
Article ADS CAS PubMed Google Scholar
Smolka, A. J. & Schubert, M. L. Helicobacter pylori-induced changes in gastric acid secretion and upper gastrointestinal disease. Curr. Top. Microbiol. Immunol. 400, 227–252 (2017).
CAS PubMed Google Scholar
Ajani, J. A. et al. Esophageal and esophagogastric junction cancers, version 2.2019, NCCN Clinical Practice Guidelines in Oncology. J. Natl Compr. Cancer Netw. 17, 855–883 (2019).
Article CAS Google Scholar
Ajani, J. A. et al. Gastric adenocarcinoma. Nat. Rev. Dis. Prim. 3, 17036 (2017).
Article PubMed Google Scholar
Janjigian, Y. Y. et al. First-line nivolumab plus chemotherapy versus chemotherapy alone for advanced gastric, gastro-oesophageal junction, and oesophageal adenocarcinoma (CheckMate 649): a randomised, open-label, phase 3 trial. Lancet 398, 27–40 (2021).
Article CAS PubMed PubMed Central Google Scholar
Cancer Genome Atlas Research, N. Comprehensive molecular characterization of gastric adenocarcinoma. Nature 513, 202–209 (2014).
Article ADS Google Scholar
Lin, Y. et al. Genomic and transcriptomic alterations associated with drug vulnerabilities and prognosis in adenocarcinoma at the gastroesophageal junction. Nat. Commun. 11, 6091 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Cristescu, R. et al. Molecular analysis of gastric cancer identifies subtypes associated with distinct clinical outcomes. Nat. Med. 21, 449–456 (2015).
Article CAS PubMed Google Scholar
Mun, D. G. et al. Proteogenomic characterization of human early-onset gastric cancer. Cancer Cell 35, 111–124 e110 (2019).
Article CAS PubMed Google Scholar
Wang, K. et al. Whole-genome sequencing and comprehensive molecular profiling identify new driver mutations in gastric cancer. Nat. Genet. 46, 573–582 (2014).
Article CAS PubMed Google Scholar
Suh, Y. S. et al. Comprehensive molecular characterization of adenocarcinoma of the gastroesophageal junction between esophageal and gastric adenocarcinomas. Ann. Surg. 275, 706–717 (2022).
Article PubMed Google Scholar
Hao, D. et al. Integrated genomic profiling and modelling for risk stratification in patients with advanced oesophagogastric adenocarcinoma. Gut 70, 2055–2065 (2021).
Article CAS PubMed Google Scholar
Reva, B., Antipin, Y. & Sander, C. Predicting the functional impact of protein mutations: application to cancer genomics. Nucleic Acids Res. 39, e118 (2011).
Article CAS PubMed PubMed Central Google Scholar
Sevim Bayrak, C. et al. Identification of discriminative gene-level and protein-level features associated with pathogenic gain-of-function and loss-of-function variants. Am. J. Hum. Genet. 108, 2301–2318 (2021).
Article CAS PubMed PubMed Central Google Scholar
Wang, L. B. et al. Proteogenomic and metabolomic characterization of human glioblastoma. Cancer Cell 39, 509–528.e520 (2021).
Article CAS PubMed PubMed Central Google Scholar
Archer, T. C. et al. Proteomics, post-translational modifications, and integrative analyses reveal molecular heterogeneity within medulloblastoma subgroups. Cancer Cell 34, 396–410.e398 (2018).
Article CAS PubMed PubMed Central Google Scholar
Li, C. et al. Integrated omics of metastatic colorectal cancer. Cancer Cell 38, 734–747.e739 (2020).
Article CAS PubMed Google Scholar
Ge, S. et al. A proteomic landscape of diffuse-type gastric cancer. Nat. Commun. 9, 1012 (2018).
Article ADS PubMed PubMed Central Google Scholar
Krug, K. et al. Proteogenomic landscape of breast cancer tumorigenesis and targeted therapy. Cell 183, 1436–1456.e1431 (2020).
Article CAS PubMed PubMed Central Google Scholar
Xu, J. Y. et al. Integrative proteomic characterization of human lung adenocarcinoma. Cell 182, 245–261.e217 (2020).
Article CAS PubMed Google Scholar
Stewart, P. A. et al. Proteogenomic landscape of squamous cell lung cancer. Nat. Commun. 10, 3578 (2019).
Article ADS PubMed PubMed Central Google Scholar
Gillette, M. A. et al. Proteogenomic characterization reveals therapeutic vulnerabilities in lung adenocarcinoma. Cell 182, 200–225.e235 (2020).
Article CAS PubMed PubMed Central Google Scholar
Chen, Y. J. et al. Proteogenomics of non-smoking lung cancer in east asia delineates molecular signatures of pathogenesis and progression. Cell 182, 226–244.e217 (2020).
Article CAS PubMed Google Scholar
Dong, L. et al. Proteogenomic characterization identifies clinically relevant subgroups of intrahepatic cholangiocarcinoma. Cancer Cell 40, 70–87.e15 (2022).
Article CAS PubMed Google Scholar
Jiang, Y. et al. Proteomics identifies new therapeutic targets of early-stage hepatocellular carcinoma. Nature 567, 257–261 (2019).
Article ADS CAS PubMed Google Scholar
Sondka, Z. et al. The COSMIC Cancer Gene Census: describing genetic dysfunction across all human cancers. Nat. Rev. Cancer 18, 696–705 (2018).
Article CAS PubMed PubMed Central Google Scholar
Cancer Genome Atlas Research, N. et al. Integrated genomic characterization of oesophageal carcinoma. Nature 541, 169–175 (2017).
Article Google Scholar
Iorio, F. et al. A landscape of pharmacogenomic interactions in cancer. Cell 166, 740–754 (2016).
Article CAS PubMed PubMed Central Google Scholar
Basu, A. et al. An interactive resource to identify cancer genetic and lineage dependencies targeted by small molecules. Cell 154, 1151–1161 (2013).
Article CAS PubMed PubMed Central Google Scholar
Corsello, S. M. et al. The Drug Repurposing Hub: a next-generation drug library and information resource. Nat. Med. 23, 405–408 (2017).
Article CAS PubMed PubMed Central Google Scholar
Shen, J. Z. et al. FBXO44 promotes DNA replication-coupled repetitive element silencing in cancer cells. Cell 184, 352–369 e323 (2021).
Article CAS PubMed Google Scholar
Barrett, R. L. & Pure, E. Cancer-associated fibroblasts and their influence on tumor immunity and immunotherapy. Elife 9, e57243 (2020).
Article CAS PubMed PubMed Central Google Scholar
Auslander, N. et al. Robust prediction of response to immune checkpoint blockade therapy in metastatic melanoma. Nat. Med. 24, 1545–1549 (2018).
Article CAS PubMed PubMed Central Google Scholar
Cancer Genome Atlas Research Network. Electronic address: wheeler@bcm.edu; Cancer Genome Atlas Research Network Comprehensive and integrative genomic characterization of hepatocellular carcinoma. Cell 169, 1327–1341.e1323 (2017).
Article Google Scholar
Liu, Y. et al. Comparative molecular analysis of gastrointestinal adenocarcinomas. Cancer Cell 33, 721–735.e728 (2018).
Article CAS PubMed PubMed Central Google Scholar
Barker, N. et al. Lgr5(+ve) stem cells drive self-renewal in the stomach and build long-lived gastric units in vitro. Cell Stem Cell 6, 25–36 (2010).
Article CAS PubMed Google Scholar
May, C. L. & Kaestner, K. H. Gut endocrine cell development. Mol. Cell Endocrinol. 323, 70–75 (2010).
Article CAS PubMed Google Scholar
Offield, M. F. et al. PDX-1 is required for pancreatic outgrowth and differentiation of the rostral duodenum. Development 122, 983–995 (1996).
Article CAS PubMed Google Scholar
Ariyachet, C. et al. Reprogrammed stomach tissue as a renewable source of functional beta cells for blood glucose regulation. Cell Stem Cell 18, 410–421 (2016).
Article CAS PubMed PubMed Central Google Scholar
Matter, M. et al. Virus-induced polyclonal B cell activation improves protective CTL memory via retained CD27 expression on memory CTL. Eur. J. Immunol. 35, 3229–3239 (2005).
Article CAS PubMed Google Scholar
Peperzak, V., Xiao, Y., Veraar, E. A. & Borst, J. CD27 sustains survival of CTLs in virus-infected nonlymphoid tissue in mice by inducing autocrine IL-2 production. J. Clin. Invest. 120, 168–178 (2010).
Article CAS PubMed Google Scholar
Iizuka, A. et al. A T-cell-engaging B7-H4/CD3-bispecific Fab-scFv antibody targets human breast cancer. Clin. Cancer Res. 25, 2925–2934 (2019).
Article CAS PubMed Google Scholar
Wang, J. Y. & Wang, W. P. B7-H4, a promising target for immunotherapy. Cell Immunol. 347, 104008 (2020).
Article CAS PubMed Google Scholar
Islam, S., Wang, S., Bowden, N., Martin, J. & Head, R. Repurposing existing therapeutics, its importance in oncology drug development: kinases as a potential target. Br. J. Clin. Pharm. 88, 64–74 (2022).
Article CAS Google Scholar
Verbaanderd, C., Meheus, L., Huys, I. & Pantziarka, P. RepurposinG Drugs in Oncology: next Steps. Trends Cancer 3, 543–546 (2017).
Article CAS PubMed Google Scholar
Cox, J. & Mann, M. MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification. Nat. Biotechnol. 26, 1367–1372 (2008).
Article CAS PubMed Google Scholar
UniProt, C. UniProt: the universal protein knowledgebase in 2021. Nucleic Acids Res. 49, D480–D489 (2021).
Article Google Scholar
Ritchie, M. E. et al. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 43, e47 (2015).
Article PubMed PubMed Central Google Scholar
Chen, S., Zhou, Y., Chen, Y. & Gu, J. fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics 34, i884–i890 (2018).
Article PubMed PubMed Central Google Scholar
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25, 1754–1760 (2009).
Article CAS PubMed PubMed Central Google Scholar
Cibulskis, K. et al. Sensitive detection of somatic point mutations in impure and heterogeneous cancer samples. Nat. Biotechnol. 31, 213–219 (2013).
Article CAS PubMed PubMed Central Google Scholar
Kim, S. et al. Strelka2: fast and accurate calling of germline and somatic variants. Nat. Methods 15, 591–594 (2018).
Article CAS PubMed Google Scholar
McLaren, W. et al. The Ensembl Variant Effect Predictor. Genome Biol. 17, 122 (2016).
Article PubMed PubMed Central Google Scholar
Bolger, A. M., Lohse, M. & Usadel, B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114–2120 (2014).
Article CAS PubMed PubMed Central Google Scholar
Kim, D., Langmead, B. & Salzberg, S. L. HISAT: a fast spliced aligner with low memory requirements. Nat. Methods 12, 357–360 (2015).
Article CAS PubMed PubMed Central Google Scholar
Frankish, A. et al. GENCODE reference annotation for the human and mouse genomes. Nucleic Acids Res. 47, D766–D773 (2019).
Article CAS PubMed Google Scholar
Pertea, M. et al. StringTie enables improved reconstruction of a transcriptome from RNA-seq reads. Nat. Biotechnol. 33, 290–295 (2015).
Article CAS PubMed PubMed Central Google Scholar
Liberzon, A. et al. The Molecular Signatures Database (MSigDB) hallmark gene set collection. Cell Syst. 1, 417–425 (2015).
Article CAS PubMed PubMed Central Google Scholar
Hanzelmann, S., Castelo, R. & Guinney, J. GSVA: gene set variation analysis for microarray and RNA-seq data. BMC Bioinforma. 14, 7 (2013).
Article Google Scholar
Brunet, J. P., Tamayo, P., Golub, T. R. & Mesirov, J. P. Metagenes and molecular pattern discovery using matrix factorization. Proc. Natl Acad. Sci. USA 101, 4164–4169 (2004).
Article ADS CAS PubMed PubMed Central Google Scholar
Gaujoux, R. & Seoighe, C. A flexible R package for nonnegative matrix factorization. BMC Bioinforma. 11, 367 (2010).
Article Google Scholar
Grossman, R. L. et al. Toward a shared vision for cancer genomic data. N. Engl. J. Med. 375, 1109–1112 (2016).
Article PubMed PubMed Central Google Scholar
Szklarczyk, D. et al. The STRING database in 2021: customizable protein-protein networks, and functional characterization of user-uploaded gene/measurement sets. Nucleic Acids Res 49, D605–D612 (2021).
Article CAS PubMed Google Scholar
Shannon, P. et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 13, 2498–2504 (2003).
Article CAS PubMed PubMed Central Google Scholar
Chin, C. H. et al. cytoHubba: identifying hub objects and sub-networks from complex interactome. BMC Syst. Biol. 8, S11 (2014).
Article PubMed PubMed Central Google Scholar
Yuan, L. et al. p-MEK expression predicts prognosis of patients with adenocarcinoma of esophagogastric junction (AEG) and plays a role in anti-AEG efficacy of Huaier. Pharm. Res. 165, 105411 (2021).
Article CAS Google Scholar
Yin, X. et al. Diallyl disulfide inhibits the metastasis of type esophagealgastric junction adenocarcinoma cells via NF-kappaB and PI3K/AKT signaling pathways in vitro. Oncol. Rep. 39, 784–794 (2018).
CAS PubMed Google Scholar
Boonstra, J. J. et al. Verification and unmasking of widely used human esophageal adenocarcinoma cell lines. J. Natl Cancer Inst. 102, 271–274 (2010).
Article PubMed PubMed Central Google Scholar
de Both, N. J., Wijnhoven, B. P., Sleddens, H. F., Tilanus, H. W. & Dinjens, W. N. Establishment of cell lines from adenocarcinomas of the esophagus and gastric cardia growing in vivo and in vitro. Virch. Arch. 438, 451–456 (2001).
Article Google Scholar
Mayakonda, A., Lin, D. C., Assenov, Y., Plass, C. & Koeffler, H. P. Maftools: efficient and comprehensive analysis of somatic variants in cancer. Genome Res. 28, 1747–1756 (2018).
Article CAS PubMed PubMed Central Google Scholar
Alexandrov, L. B. et al. The repertoire of mutational signatures in human cancer. Nature 578, 94–101 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Aran, D., Hu, Z. & Butte, A. J. xCell: digitally portraying the tissue cellular heterogeneity landscape. Genome Biol. 18, 220 (2017).
Article PubMed PubMed Central Google Scholar
Wiredja, D. D., Koyuturk, M. & Chance, M. R. The KSEA App: a web-based tool for kinase activity inference from quantitative phosphoproteomics. Bioinformatics 33, 3489–3491 (2017).
Article CAS PubMed PubMed Central Google Scholar
Hornbeck, P. V. et al. PhosphoSitePlus, 2014: mutations, PTMs and recalibrations. Nucleic Acids Res. 43, D512–520 (2015).
Article CAS PubMed Google Scholar
Horn, H. et al. KinomeXplorer: an integrated platform for kinome biology studies. Nat. Methods 11, 603–604 (2014).
Article CAS PubMed Google Scholar
Deutsch, E. W. et al. The ProteomeXchange consortium in 2020: enabling ‘big data’ approaches in proteomics. Nucleic Acids Res. 48, D1145–D1152 (2020).
CAS PubMed Google Scholar

Download references

Acknowledgements

This study was supported by The National Key Research and Development Program of China (2021YFA0910100 to X.C.), Zhejiang Provincial Research Center for Upper Gastrointestinal Tract Cancer (JBZX-202006 to X.C.), Medical Science and Technology Project of Zhejiang Province (WKJ-ZJ-2202 to J.Q., WKJ-ZJ-2104 to X.C.), National Natural Science Foundation of China (82074245 to X.C., 81973634 to Z.X., 81903842 to J.Q.), Natural Science Foundation of Zhejiang Province (LR21H280001 to J.Q.), Science and Technology Projects of Zhejiang Province (2019C03049 to X.C.), and Program of Zhejiang Provincial TCM Sci-tech Plan (2018ZY006 to X.C., 2020ZZ005 to J.Q.). We thank the staff of the follow-up room of Zhejiang Cancer Hospital for their support to this work.

Author information

These authors contributed equally: Shengli Li, Li Yuan.

Authors and Affiliations

Department of Gastric Surgery, The Cancer Hospital of the University of Chinese Academy of Sciences (Zhejiang Cancer Hospital), Institutes of Basic Medicine and Cancer (IBMC), Chinese Academy of Sciences, Hangzhou, 310022, China
Shengli Li, Li Yuan, Zhi-Yuan Xu, Xiaoqing Guan, Guang-Zhao Pan, Jinyun Dong, Yi-An Du, Li-Tao Yang, Mao-Wei Ni, Rui-Bin Jiang, Xiu Zhu, Sheng-Jie Zhang, Jiang-Jiang Qin & Xiang-Dong Cheng
Precision Research Center for Refractory Diseases, Institute for Clinical Research, Shanghai General Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, 201620, China
Shengli Li
Zhejiang Provincial Research Center for Upper Gastrointestinal Tract Cancer, Zhejiang Cancer Hospital, Hangzhou, 310022, China
Li Yuan, Zhi-Yuan Xu, Yi-An Du, Li-Tao Yang, Jiang-Jiang Qin & Xiang-Dong Cheng
Zhejiang Key Lab of Prevention, Diagnosis and Therapy of Upper Gastrointestinal Cancer, Zhejiang Cancer Hospital, Hangzhou, 310022, China
Li Yuan, Zhi-Yuan Xu, Yi-An Du, Li-Tao Yang, Jiang-Jiang Qin & Xiang-Dong Cheng
First Clinical Medical College, Zhejiang Chinese Medical University, Hangzhou, 310053, China
Jing-Li Xu, Can Hu & Han-Dong Xu
Department of Gastrointestinal Surgery, the First Affiliated Hospital of Zhejiang Chinese Medical University, Hangzhou, 310006, China
Gui-Ping Chen
Biological Sample Bank, the First Affiliated Hospital of Zhejiang Chinese Medical University, Hangzhou, 310006, China
Hang Lv

Authors

Shengli Li
View author publications
You can also search for this author in PubMed Google Scholar
Li Yuan
View author publications
You can also search for this author in PubMed Google Scholar
Zhi-Yuan Xu
View author publications
You can also search for this author in PubMed Google Scholar
Jing-Li Xu
View author publications
You can also search for this author in PubMed Google Scholar
Gui-Ping Chen
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoqing Guan
View author publications
You can also search for this author in PubMed Google Scholar
Guang-Zhao Pan
View author publications
You can also search for this author in PubMed Google Scholar
Can Hu
View author publications
You can also search for this author in PubMed Google Scholar
Jinyun Dong
View author publications
You can also search for this author in PubMed Google Scholar
Yi-An Du
View author publications
You can also search for this author in PubMed Google Scholar
Li-Tao Yang
View author publications
You can also search for this author in PubMed Google Scholar
Mao-Wei Ni
View author publications
You can also search for this author in PubMed Google Scholar
Rui-Bin Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Xiu Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Hang Lv
View author publications
You can also search for this author in PubMed Google Scholar
Han-Dong Xu
View author publications
You can also search for this author in PubMed Google Scholar
Sheng-Jie Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Jiang-Jiang Qin
View author publications
You can also search for this author in PubMed Google Scholar
Xiang-Dong Cheng
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

X.C. and J.Q. designed and supervised the project; S.L. designed and performed omics data analysis and visualization; J.Q. and X.C. designed and supervised the experiments; L.Y., Z.X., and J.X. collected clinical specimens. L.Y., G.C., X.G., and G.P. conducted in vitro experiments; L.Y., C.H., and H.X. conducted in vivo experiments. L.Y. performed experimental data analysis. J.D., Y.D., L.Y., M.N., R.J., X.Z., H.L., and S.Z. interpreted the results and commented on the paper; S.L., J.Q., and X.C. wrote the paper from comments of other authors. All listed authors discussed the results and reviewed the paper.

Corresponding authors

Correspondence to Jiang-Jiang Qin or Xiang-Dong Cheng.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Frank McKeon and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Description of Additional Supplementary Files

Supplementary Data 1

Supplementary Data 2

Supplementary Data 3

Supplementary Data 4

Supplementary Data 5

Supplementary Data 6

Supplementary Data 7

Supplementary Data 8

Supplementary Data 9

Supplementary Data 10

Supplementary Data 11

Supplementary Data 12

Supplementary Data 13

Supplementary Data 14

Supplementary Data 15

Reporting Summary

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Li, S., Yuan, L., Xu, ZY. et al. Integrative proteomic characterization of adenocarcinoma of esophagogastric junction. Nat Commun 14, 778 (2023). https://doi.org/10.1038/s41467-023-36462-8

Download citation

Received: 09 March 2022
Accepted: 02 February 2023
Published: 11 February 2023
DOI: https://doi.org/10.1038/s41467-023-36462-8

This article is cited by

The PTM profiling of CTCF reveals the regulation of 3D chromatin structure by O-GlcNAcylation
- Xiuxiao Tang
- Pengguihang Zeng
- Junjun Ding
Nature Communications (2024)
Comparative single-cell analysis reveals heterogeneous immune landscapes in adenocarcinoma of the esophagogastric junction and gastric adenocarcinoma
- Jierong Chen
- Qunsheng Huang
- Yong Li
Cell Death & Disease (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.