Bulk and single-cell transcriptome profiling reveal the metabolic heterogeneity in gastric cancer

Tao, Guoqiang; Wen, Xiangyu; Wang, Xingxing; Zhou, Qi

doi:10.1038/s41598-023-35395-y

Download PDF

Article
Open access
Published: 31 May 2023

Bulk and single-cell transcriptome profiling reveal the metabolic heterogeneity in gastric cancer

Guoqiang Tao¹^na1,
Xiangyu Wen¹^na1,
Xingxing Wang¹^na1 &
…
Qi Zhou¹

Scientific Reports volume 13, Article number: 8787 (2023) Cite this article

1317 Accesses
Metrics details

Subjects

Abstract

Metabolic reprogramming has been defined as a key hall mark of human tumors. However, metabolic heterogeneity in gastric cancer has not been elucidated. Here we separated the TCGA-STAD dataset into two metabolic subtypes. The differences between subtypes were elaborated in terms of transcriptomics, genomics, tumor-infiltrating cells, and single-cell resolution. We found that metabolic subtype 1 is predominantly characterized by low metabolism, high immune cell infiltration. Subtype 2 is mainly characterized by high metabolism and low immune cell infiltration. From single-cell resolution, we found that the high metabolism of subtype 2 is dominated by epithelial cells. Not only epithelial cells, but also various immune cells and stromal cells showed high metabolism in subtype 2 and low metabolism in subtype 1. Our study established a classification of gastric cancer metabolic subtypes and explored the differences between subtypes from multiple dimensions, especially the single-cell resolution.

Characterizing cancer metabolism from bulk and single-cell RNA-seq data using METAFlux

Article Open access 12 August 2023

Spatially resolved multi-omics highlights cell-specific metabolic remodeling and interactions in gastric cancer

Article Open access 10 May 2023

Identification and subsequent validation of transcriptomic signature associated with metabolic status in endometrial cancer

Article Open access 23 August 2023

Introduction

Gastric cancer (GC) is the sixth most common cancer and the third leading cause of cancer-related deaths worldwide¹. Early screening for GC currently relies on gastroscopy; however, this examination is not yet performed annually in many regions, resulting in many GC patients being diagnosed at later stages. Treatment strategies for GC have mainly relied on clinicopathological assessments of tumors, including surgery, various chemotherapy treatments, and immunotherapies targeting immune checkpoints (ICIs). However, since GC is a heterogeneous disease, these treatments only show efficacy in some patients. Currently, the classification of gastric cancer mainly relies on AJCC staging, Lauren classification, and grade, which have limitations. Therefore, it is necessary to classify tumors according to their intrinsic heterogeneity, determine their relationship with tumor treatment, and optimize tumor treatment strategies accordingly.

Metabolic reprogramming has been considered a key hallmark of human tumors^2,3, as cancer cells require change in their metabolic processes to meet the demands of their rapidly growing biomass and energy needs^4,5. Metabolic reprogramming plays a crucial role in various tumor processes, including tumor progression, chemotherapy resistance, immune response and epithelial–mesenchymal transition^6,7,8,9,10. Identification of distinct metabolic isoforms in cancer can aid in patient selection of investigational metabolic inhibitors and new therapeutic targets¹¹. Many previous studies have categorized tumors into different metabolic subtypes^12,13,14,15. However, tumor cells exist in a microenvironment composed of stromal cells such as tumor-associated fibroblasts, immune cells, and endothelial cells. Each cell type plays an active role in tumor cell proliferation, and each has unique metabolic needs to perform specific functions. By employing single-cell sequencing technology, we can understand the metabolic characteristics of various types of cells in tumor tissue at the single-cell resolution.

Here, we have initially divided gastric cancer into two subgroups based on bulk sequencing data. We have explored the differences between subtypes from various perspectives, including metabolic subtype, which showed varying responses to various chemotherapeutic agents and immune checkpoint-targeting therapies. Finally, we have explored the metabolic differences in multiple cell types across subtypes from a single-cell dimension.

Materials and methods

Data acquisition and processing

We systematically searched publicly available gene expression datasets for GC. Samples without complete prognosis information were removed from further evaluation. In total, 9 datasets from the Gene-Expression Omnibus (GEO; https://www.ncbi.nlm.nih.gov/gds/) (GEO: GSE62254¹⁶, GSE15459¹⁷, GSE57303¹⁸, GSE34942¹⁹, GSE84437²⁰, GSE26942²¹, GSE29272²², GSE28541²¹ and GSE13861²³) and one RNA-sequencing dataset (TCGA-STAD) from The Cancer Genome Atlas (TCGA; https://portal.gdc.cancer.gov/) were found. Because of the large number of datasets on the GPL570 platform, four datasets (GSE62254, GSE15459, GSE57303 and GSE34942) from the GPL570 platform were merged as one dataset named as GPL570 meta-dataset using “oligo” package in R²⁴. We used the oligo package in R software for quality control analysis and the “ComBat” function in R to remove the batch effect (Fig. S1)²⁵. GSE29272, GSE28541 and GSE13861 were removed due to small sample size. All microarray data included in our study were log2 transformed. Data files of counts expression of TCGA-STAD and clinical data were downloaded by using “TCGAbiolinks” package in R²⁶. The data downloaded from TCGA were converted into transcripts per million (TPM) value. TCGA-STAD somatic mutation and DNA methylation profiles with illumina human methylation 450 platform were downloaded using the package “TCGAbiolinks” in R. Somatic mutation data were analysed using R package “maftools”²⁷. The chi-square test was used to assess the mutational difference between the two groups. Methylation profiles were analysed using R package “ChAMP”²⁸. The beta values were calculated to assess the methylation level of each CpG site in each sample. It is generally considered that β value greater than 0.6 is fully methylated, 0.2–0.6 is partially methylated, and less than 0.2 is completely unmethylated. For differentially methylated probes (DMPs) analysis, we first removed CpG sites that were both fully methylated and fully unmethylated in both clusters. The |diffBeta| was set as 0.15.

Single cell-seq data from GSE183904 were selected for further analysis²⁹. This is the largest number size study to date for single-cell sequencing of gastric cancer. Due to the large sample size, the working upper limit of our equipment was exceeded. Therefore, we divided gastric cancer samples into three subgroups according to Lauren classification, and randomly selected one third of the samples from each group, and finally obtained 8 gastric cancer single-cell sequencing samples, and subsequent single-cell analysis was based on these 8 samples. Each sample was considered for genes/features shared by three or more cells, and cells showing 300 or more features. Cells with mitochondrial RNA percentages of > 20 were filtered out. We use the “DoubletFinder” package to remove the "doublets cell"³⁰. Tumor specimens also inevitably contain normal epithelial cells. So, “CopyKAT” package was used to predict malignant cells in epithelial cells³¹.

Metabolic subgroup classification

Metabolite and protein interactions profile was obtained from a previous study¹². After processing the profile, we input it into Cytoscape software (version 3.6.1) to extract the proteins with more than 5°³² (details in Table S1). In the previous step, we obtained 1202 metabolism-related genes. In order to better classify the malignant metabolic characteristics of tumors, we extracted the metabolic genes related to prognosis by “survival” package (P < 0.05). Consensus clustering with 1000 iterations and resampling of 80% was performed based on the expression levels of these genes using “ConsensusClusterPlus” package³³. We use CDF plot and PAC methods to confirm the best K value.

The single-cell sequencing used was not accompanied by bulk sequencing. We can think of bulk sequencing as measuring the total expression of each gene in all cells in a tumor tissue. Therefore, after the quality control of the single-cell expression matrix was completed, the average expression matrix of all cells in each sample was calculated to estimate the bulk-level expression of a single sample. We have built a metabolic subtype classifier in the TCGA-STAD dataset. The TCGA-STAD dataset is randomly divided into training dataset and test dataset according to 7:3. Metabolic classification model was trained based on prognostic-relevant metabolic genes in the training dataset using support vector machines (SVM) algorithm, and validated in the test dataset. Single-cell samples are classified according to the established classification model.

Pathway enrichment analyses

We download the latest Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway data using R package “KEGGREST” and made the required “gmt” format file^34,35,36. We downloaded the GSEA software (version 4.3) from the gene set enrichment analysis (GSEA: http://software.broadinstitute.org/gsea/index.jsp) website. NOM p-value < 0.05 were considered statistically significant. Single sample gene set enrichment analysis (ssGSEA) was used to estimate KEGG pathway enrichment level in a single sample or cell based on “GSVA” package³⁷.

Evaluation of infiltrating immune cells in the TME

The proportions of 22 immune cell types in GC samples were estimated using the CIBERSORT algorithm (https://cibersortx.stanford.edu/) with batch-corrected mode, relative mode and 1000 permutations of b mode³⁸. The estimation of stromal and immune cells in tumor tissues was performed by ESTIMATE algorithm³⁹. The wilcoxon test was used to find the significantly different immune cells among different groups.

Metabolic subtype characteristic score construction

According to the consensus clustering, we could successfully classify patients into two clusters. Differentially expressed genes were determined by using the package “DESeq2” package⁴⁰. The significance criteria for determining DEGs were set as a false discovery rate (FDR) < 0.01 and |log2 fold change (FC)| > 1.0. In methylation analysis, differential expression analysis between cluster 1 and cluster 2 was also taken to take the above parameters. TCGA-STAD, GPL570 meta-dataset, GSE84437 and GSE26942 were Z-score transformed for subsequent analysis. To remove the effect of low-level expressed genes, we removed genes with average TPM value of less than 2 across all samples. Then, LASSO-Cox regression analysis based on the “glmnet” package in R was applied to build an optimal metabolism classification-related gene signature for GC⁴¹. The metabolic subtype characteristic score of our model for each sample was defined by the relative expression of each gene and its associated Cox coefficient. The optimal cutoff value was confirmed by “maxstat” package. The patients were divided into high-score group and low-score group, and the Kaplan–Meier (KM) method with logrank test was used to further analyze the prognostic differences between the two groups. The prognostic or predictive accuracy of gene signature was assessed using time-dependent receiver operating characteristic (ROC) analysis. The area under the curve (AUC) at different cutoff times was used to measure the accuracy of prognosis or prediction. The model was then validated on three additional independent datasets.

Additional bioinformatic and statistical analyses

The half maximal inhibitory concentration (IC50) is estimated by R package “pRRophetic”⁴². The Connectivity Map (CMap, https://clue.io/) was used to predict the small candidate molecules based on differentially expressed genes. The TIDE algorithm was used to predict ICB responses (http://tide.dfci.harvard.edu)⁴³. All of the above analyses were performed using the R software (version 4.0.2, http://www.rproject.org). Statistical differences not specifically stated were set at P < 0.05.

Result

Metabolism-associated genes and subtypes identification

98 metabolic related genes (the origin of these genes has been described in the “Materials and methods” section) obtained are significantly enriched in metabolic pathways, including arachidonic acid metabolism, chemical carcinogenesis, drug metabolism—other enzymes and pentose/ glucuronate interconversions, etc. (Fig. 1I). Based on the expression values of these 98 genes, we divided the TCGA-STAD cohort into two clusters, with the optimal k of 2 (Figs. 1A, S2). Our analysis revealed a significant prognostic difference between the two metabolic subtypes (Fig. 1E). Kaplan–Meier analysis showed that patients who were divided into cluster 1 suffered inferior overall survival (OS) (Fig. 1E). Importantly, our clustering result was also verified in the other three cohorts (GPL570 meta-dataset, GSE26942 and GSE84437) (Fig. 1B–D). Furthermore, the difference in prognosis was also observed in these three cohorts (Fig. 1F–H). These results confirmed the metabolic heterogeneity of gastric cancer and its prognostic significance.

Metabolic classifier-specific gene enrichment pathway of gastric cancers

To understand the metabolic differences and functional differences among various metabolic subtypes, we performed ssGSEA and GSEA analysis. First, the scores of all patient-related pathways were obtained based on the ssGSEA algorithm. We discovered that 4 of the 5 signaling pathways related to tumor metabolism regulation differed between the 2 clusters, which explain the sources of metabolic differences between the 2 clusters (Fig. 2A). Additionally, we observed that 7 of the 10 oncogenic signaling pathways significantly differed between the two clusters, thereby demonstrating the dissimilarities in tumor characteristics between the groups (Fig. 2B).

We then used the GSEA method to analyze the metabolic subtype-specific KEGG pathway. Both subtypes had a considerable number of specific signaling pathways (details in Table S2). Among the metabolic pathways, 3 pathways were significantly enriched in cluster 1, while 30 pathways were enriched in cluster 2 (Fig. 2C). In cluster 1, multiple intercellular communication-related signaling pathways were activated (Table S2). In addition to the activation of a considerable number of metabolic pathways, cluster 2 also showed activation of nucleotide processing and repair-related pathways (Table S2).

Metabolic classifier-specific mutation, DNA methylation and immune cell infiltration characteristics of gastric cancers

Oncogene mutations have been shown to induce reprogramming of cell-autonomous metabolism. To further investigate whether there is evidence of the disparity in the genomic layer between the two metabolic subtypes, we analyzed somatic mutations in the TCGA-STAD cohort. Top 20 most frequently mutated genes in two subtypes were illustrated in Fig. 3A. And we also analyzed genes that were differentially mutated in the two subtypes (Fig. 3B). The top 5 most frequently mutated genes in gastric cancer patients are TTN, TP53, MUC16, LRP1B and SYNE1 (Fig. 3A). Among them, TTN and SYNE1 are different between the two metabolic subtypes (Fig. 3B). The top 20 differentially mutated genes were more frequently mutated in the cluster 2, indicating that the formation of metabolic subtypes may be related to genes mutations. We found that chromosomal instability (CIN) and Epstein–Barr virus (EBV) subtypes had consistent distribution between cluster 1 and cluster 2, but cluster 1 had more genomically stable (GS) subtype, while cluster 2 had more microsatellite instability (MSI) subtype (Fig. 3C). We also found that cluster 2 subtype also had significantly higher tumor mutational burden (TMB) levels (Fig. 3D).

We removed CpG sites that have no corresponding gene, as well as the completely unmethylated or fully methylated CpG sites in both metabolic subtypes, leaving us with 71,648 CpG sites with P < 0.05. After applying a delaBeta of 0.15 to further screen for differential CpG sites, we identified 1431 hypomethylated genes in cluster 1 and 2410 hypomethylated genes in cluster 2. Among the hypomethylated genes in cluster 1, 480 genes were highly expressed, while among those in cluster 2, only 25 genes were highly expressed. The results of the enrichment analysis of these genes are shown in Fig. 3E,F. Interestingly, genes with hypomethylation and high expression in cluster 1 were enriched in oncogenic signaling pathways, suggesting that abnormal methylation may lead to the activation of oncogenic signaling pathways in cluster 1. Genes with hypomethylation and high expression in cluster 2 were enriched in HIF-1 signaling pathway, indicating that cluster 2 characteristics may be associated with hypoxia. Interactions between tumor cells and surrounding infiltrating cells, especially stromal cells and immune cells, can either promote or inhibit tumor progression⁴⁴. We calculated the score of stromal and immune using ESTIMATE algorithm. The three scores in the cluster 1 group were significantly higher than those in the cluster 2 group (Fig. 3G), suggesting that the samples in cluster 1 had more non-tumor cell components. To further explore the differences of immune cell composition, CIBERSORT algorithm was implemented to assess the composition of 22 immune cells in the TCGA-STAD cohort (Fig. 3H). The samples by CIBERSORT that generated P-values greater than 0.05 were removed. Only 7 immune cells (T cells CD4 memory activated, T cells follicular helper, T cells regulatory (Tregs), Macrophages M0, Dendritic cells activated, Eosinophils and Neutrophils) did not differ between the two groups. All diverse immune signatures, except NK cells resting and Mast cells activated, were increased in the cluster 1 subgroup. The above results indicated that the tumor immune microenvironment may be associated with gastric cancer metabolic subtypes.

Metabolic subtype-associated treatment strategy for gastric cancer

To assess the association of metabolic subtypes with immunotherapy, we adopted Tumor Immune Dysfunction and Exclusion (TIDE; http://tide.dfci.harvard.edu/) for TCGA-STAD cohort. The cluster 1 subgroup had higher TIDE scores, indicating that the cluster 1 subgroup had a higher probability of immune escape and thus lower benefit from immunotherapy (Fig. 4A). We also compared the n T cell dysfunction scores and T cell exclusion scores between the two subgroups, which supported the above conclusion (Fig. 4B,C). The above analysis suggested that cluster 2 subgroup may have a better response to immunotherapy.

The previous findings showed that metabolic subtypes were associated with drug metabolism signaling pathways, which led us to explore their potential as a marker for predicting drug response. The Cancer Genome Project (CGP) database was used to predict chemotherapeutic response. We found 5 drugs commonly used in gastric cancer chemotherapy in the CGP database, and the estimated IC50 of these 5 drugs were significantly differed between the two subgroups (Fig. 4D–H). The patients with metabolic subtype-2 were more sensitive to the anticancer drugs 5-fluorouraci, docetaxel, mitomycin C and paclitaxel. The patients with metabolic subtype-1 were more sensitive to cisplatin.

Furthermore, we screened the CMap database for small-molecule drugs with therapeutic effects on gastric cancer, based on differentially expressed genes between the two metabolic subtypes. As a result, we identified three potential small molecule drugs for gastric cancer (dimercaptosuccinic-acid, lapatinib, tracazolate).

Metabolic subtype-associated signature is a prognostic indicator for gastric cancer

Multiple datasets demonstrated significant prognostic differences between two metabolic subtypes. Therefore, we explored whether a metabolic subtype-related signature could be used to predict patient outcomes. First, we performed differential analysis to obtain a list of differentially expressed genes between metabolic subtypes. Subsequently, LASSO Cox algorithm with 0.07 of the optimal λ value in the model was applied to identify the most robust prognostic genes based on differentially expressed genes profiles after Z-score transformed (Fig. S3A,B). KM analysis revealed that patients with a low metabolic subtype-associated signature score demonstrated a prominent survival benefit (log-rank test, p = 5.7e−7; Fig. 5A). Furthermore, ROC analysis also showed that this model can accurately predicted patient survival time (Fig. 5B). We also performed KM analysis on these 11 genes separately, and found that the expression levels of nine genes were associated with the prognosis of gastric cancer patients (Fig. S4A–K).

In order to validate the stability of the model, we performed analyses in three additional cohorts. The results revealed that the low metabolic subtype-associated signature score group still had better survival in the three cohorts (GPL570 meta-dataset: HR = 1.74, 95% CI = 1.39–2.19, p = 1.2e−6; GSE26942: HR = 2.48, 95% CI = 1.63–3.77, p = 1.2e−5; GSE84437: HR = 1.61, 95% CI = 1.14–2.28, p = 6.9e−3; Fig. 5E–G).

Metabolic features of epithelial cells in the single-cell resolution

Tumor tissue contains a variety of non-tumor cells that play an important role. Therefore, we aimed to explore the differences between metabolic subtypes at a single-cell resolution. Among the eight gastric cancer samples analyzed, four were intestinal-type, two were diffuse-type, and one was mixed-type (Table S3). After quality control processing, there were 16,397 cells left, and we normalized the count data using “LogNormalize” method built into "Seurat" package. Subsequently, we utilized a classification model to perform metabolic classification of patients with single-cell sequencing. First, we calculated a matrix of average expression values of all genes in all cells of a single patient. We divided TCGA-STAD into training set and test set, and the SVM classification model showed extremely high accuracy in both training set and test set (the train set: AUC = 0.9498, 95% CI = 0.9268–0.9728; the test set: AUC = 0.9231, 95% CI = 0.8777–0.9684; Fig. 6A,B). We then classified the 8 patients into 2 metabolic subgroups according to this classification model (details in Table S4). The expression data of the above SVM analysis have been transformed by Z-score. The ssGSEA algorithm was used to assess the relevant KEGG signaling pathway levels in 8 single-cell sequencing samples. The results were the same to those obtained from bulk sequencing, with the high-level pathways in bulk cluster 1 subtype activated in single-cell cluster 1 subtype (Fig. S5A), and similarly for the pathways enriched in cluster 2 subtypes (Fig. S5B). Although the p-value was insignificant due to the small sample size, there was a clear trend in the results from Fig. S5A,B, indicating the robustness of the single-cell sample classification.

All cells were annotated into 5 cell types based on relevant markers (detailed markers in Table S5, Fig. 6D), including epithelial cell, T cell, B cell, stromal cell and myeloid cell. The numbers of 5 cells in the two metabolic subtypes are shown in Fig. 6C. The ssGSEA algorithm was used to assess the relevant KEGG signaling pathway levels in 16,397 cells. We wondered whether epithelial cells dominate the metabolic signaling pathways associated with metabolic subtypes. Three metabolic signaling pathways were mainly enriched in metabolic subtype 1, but only one was dominated by epithelial cells (Fig. 6E). Interestingly, the majority of the 30 metabolic signaling pathways enriched in metabolic subtype 2 were still dominated by epithelial cells (Fig. 6F–H).

To make the results more credible, we screened 1144 malignant epithelial cells from epithelial cells using the “CopyKAT” algorithm. GSEA analysis was used to analyze the metabolic heterogeneity of various cells in different metabolic subtypes. Six metabolic pathways were enriched in malignant epithelial cells of metabolic subtype 1, including the Glycosaminoglycan biosynthesis—chondroitin sulfate/dermatan sulfate pathway, which was consistent with bulk sequencing analysis (Fig. 6I). Then, 15 metabolic pathways were enriched in malignant epithelial cells of metabolic subtype 2, of which 8 were also enriched in subtype 1 of the bulk data (Fig. 6I). In addition, we analyzed the metabolic pathways of normal epithelial cells and malignant epithelial cells. The results indicated that normal epithelial cells did not show any significant enrichment of metabolic pathways, while malignant epithelial cells exhibited activation of 57 metabolic pathways (Table S6). Taken together, our findings highlight the metabolic heterogeneity of malignant cells.

Metabolic features of non-epithelial cells in the single-cell resolution

The non-epithelial cells were classified into 2 metabolic subtypes, based on their metabolic pathways. The four types of non-epithelial cells have specific metabolic pathways and common metabolic pathways (Fig. 7A–D). Metabolic subtype 2 showed stable activation of central carbon metabolism in cancer pathway and lysine degradation pathway in non-epithelial cells. On the other hand, only a small number of metabolic pathways were significantly enriched in metabolic subtype 1, specifically in B cells and stromal cells (Fig. 7B,C). These findings indicate that not only epithelial cells but also all non-epithelial cells exhibit metabolic characteristics of metabolic subtype 2, which is abundant in nature.

Discussion

Due to the heterogeneity of gastric cancer, the existing treatment methods are inevitably ineffective for some patients. Therefore, it is urgent to classify gastric cancer patients based on the existing data to discover potential subtypes of gastric cancer and facilitate personalized treatment. Metabolic reprogramming of tumor cells is required for tumorigenesis and progression. Tumor cells autonomously alter their phenotype through various metabolic pathways to meet increased energetic and biosynthetic demands. Therefore, we classified gastric cancer patients into two subtypes based on metabolic gene expression and conducted a detailed analysis of their differences from the genomic, epigenetic, and single-cell dimensions.

Our analysis classified gastric cancer patients into two metabolic subgroups, with cluster 1 consistently indicating a worse prognosis. We found that metabolic subtype cluster 1 was characterized by low metabolism, while cluster 2 was characterized by abundant and higher metabolism. Pentose phosphate pathway, Glycolysis/Gluconeogenesis pathway and Citrate cycle (TCA cycle) maintained high levels in cluster 2. In addition, cluster 2 was enriched with multiple amino acid metabolism-related pathways, which also indicated the high metabolic feature of cluster 2. Analysis of 5 tumor metabolism-related pathways may partially explain of the metabolic differences between the two clustered subtypes^{45,46,47,48,49}. More of the 10 tumor-related pathways were highly activated in cluster 1, indicating that cluster 1 had a higher degree of malignancy and therefore, a shorter survival time.

After conducting genomic analysis, we observed that cluster 2 exhibited a higher incidence of gene mutations, which is consistent with the above-mentioned activation of several nucleotide processing and repair-related pathways in cluster 2. Epigenetic analysis showed that the hypomethylated genes in cluster 1 were mostly oncogenic signaling pathways-related genes, leading us to believe that the high malignancy of cluster 1 may be related to gene hypomethylation. Both CIBERSORT and ESTIMATE analyses demonstrated that cluster 1 possessed a more abundant immune cell infiltration. Although cluster 1 had more abundant immune cell infiltration, we found that naive cells and memory cells, as well as various immune cells with immunosuppressive effects, were more abundant in cluster 1. However, the activated cells that exert anti-tumor effects did not differ between the two groups. This may explain high immune cell infiltration in cluster 1 but shorter survival time.

Given the notable prognostic differences between the two metabolic subtypes, we explored subtype-specific genes as potential prognostic markers. The results were also satisfactory, with the 6-gene prognostic model not only showing satisfactory results in the TCGA-STAD cohort, but also performing extremely well in three other independent data cohorts.

Based on the above two metabolic subtypes, we explored and discovered different treatment strategies. We predicted the therapeutic effects of 5 common chemotherapeutic agents in different metabolic subtypes. According to IC50 estimates, cluster 1 patients were more sensitive to cisplatin, while cluster 2 was more sensitive to 5-fluorouraci, docetaxel, mitomycin C and paclitaxel. This information enables medical professionals to more precisely select a suitable chemotherapy program for their patients. TIDE analysis suggested that cluster 2 patients may benefit more from immunotherapy than cluster 1 patients.

In addition, we further explored the metabolic differences of various cells at the single-cell level, focusing on different metabolic subtypes. In most subtype-related metabolic pathways, most of them are dominated by epithelial cells. To enhance the validity of our findings, we also separated from malignant epithelial cells from epithelial cells, as a result, the malignant epithelial cell metabolism is higher. Then, five major types of cells (epithelial cells, T cells, B cells, stromal cells and myeloid cells) analysis indicate that various cells in cluster 2 are in high metabolic level, and the cells in cluster 1 are in a relatively low metabolic level. Therefore, the cluster 1 subtype is metabolically indolent gastric cancer. The low-metabolic level of immune cells in cluster 1 may be associated with its poor prognosis and low immunotherapy response.

Nonetheless, our research has some limitations. Specifically, we did not investigate metabolic subtypes at the protein level, which could be an area for future research.

Conclusion

We provide a new perspective on the heterogeneity of gastric cancer from the metabolic. And we reveal the characteristics of metabolic subtypes from the genome, DNA methylation and single cells.

Data availability

The data that support the findings of this study are available in GEO (https://www.ncbi.nlm.nih.gov/geo/, GSE62254, GSE15459, GSE57303, GSE34942, GSE84437, GSE26942 and GSE183904), TCGA (https://portal.gdc.cancer.gov/repository, TCGA-STAD), and the Supporting Information.

Abbreviations

GC:: Gastric cancer
ICIs:: Immune checkpoints targeting immunotherapies
GEO:: Gene-Expression Omnibus
TCGA:: The Cancer Genome Atlas
TPM:: Transcripts per million
DMPs:: Differentially methylated probes
SVM:: Support vector machines
KEGG:: Kyoto Encyclopedia of Genes and Genomes
GSEA:: Gene set enrichment analysis
ssGSEA:: Single sample gene set enrichment analysis
FDR:: False discovery rate
KM:: Kaplan–Meier
ROC:: Receiver operating characteristic
AUC:: The area under the curve
IC50:: The half maximal inhibitory concentration
OS:: Overall survival
TIDE:: Tumor Immune Dysfunction and Exclusion
CGP:: The Cancer Genome Project

References

Sung, H. et al. Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J. Clin. 71, 209–249. https://doi.org/10.3322/caac.21660 (2021).
Article PubMed Google Scholar
Gentric, G., Mieulet, V. & Mechta-Grigoriou, F. Heterogeneity in cancer metabolism: New concepts in an old field. Antioxid. Redox Signal. 26, 462–485. https://doi.org/10.1089/ars.2016.6750 (2017).
Article CAS PubMed PubMed Central Google Scholar
Hanahan, D. & Weinberg, R. A. Hallmarks of cancer: The next generation. Cell 144, 646–674. https://doi.org/10.1016/j.cell.2011.02.013 (2011).
Article CAS PubMed Google Scholar
Vander Heiden, M. G. & DeBerardinis, R. J. Understanding the intersections between metabolism and cancer biology. Cell 168, 657–669. https://doi.org/10.1016/j.cell.2016.12.039 (2017).
Article CAS PubMed Central Google Scholar
Pavlova, N. N. & Thompson, C. B. The emerging hallmarks of cancer metabolism. Cell Metab. 23, 27–47. https://doi.org/10.1016/j.cmet.2015.12.006 (2016).
Article CAS PubMed PubMed Central Google Scholar
Sciacovelli, M. & Frezza, C. Metabolic reprogramming and epithelial-to-mesenchymal transition in cancer. FEBS J. 284, 3132–3144. https://doi.org/10.1111/febs.14090 (2017).
Article CAS PubMed PubMed Central Google Scholar
Hua, W., Ten Dijke, P., Kostidis, S., Giera, M. & Hornsveld, M. TGFβ-induced metabolic reprogramming during epithelial-to-mesenchymal transition in cancer. Cell Mol. Life Sci. 77, 2103–2123. https://doi.org/10.1007/s00018-019-03398-6 (2020).
Article CAS PubMed Google Scholar
Xia, L. et al. The cancer metabolic reprogramming and immune response. Mol. Cancer 20, 28. https://doi.org/10.1186/s12943-021-01316-8 (2021).
Article PubMed PubMed Central Google Scholar
Biswas, S. K. Metabolic reprogramming of immune cells in cancer progression. Immunity 43, 435–449. https://doi.org/10.1016/j.immuni.2015.09.001 (2015).
Article CAS PubMed Google Scholar
Boroughs, L. K. & DeBerardinis, R. J. Metabolic pathways promoting cancer cell survival and growth. Nat. Cell Biol. 17, 351–359. https://doi.org/10.1038/ncb3124 (2015).
Article CAS PubMed PubMed Central Google Scholar
Jain, M. et al. Metabolite profiling identifies a key role for glycine in rapid cancer cell proliferation. Science 336, 1040–1044. https://doi.org/10.1126/science.1218595 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Chen, D. et al. Identification and characterization of robust hepatocellular carcinoma prognostic subtypes based on an integrative metabolite-protein interaction network. Adv. Sci. 8, e2100311. https://doi.org/10.1002/advs.202100311 (2021).
Article CAS Google Scholar
Yu, T. J. et al. Bulk and single-cell transcriptome profiling reveal the metabolic heterogeneity in human breast cancers. Mol. Ther. 29, 2350–2365. https://doi.org/10.1016/j.ymthe.2021.03.003 (2021).
Article CAS PubMed PubMed Central Google Scholar
Daemen, A. et al. Metabolite profiling stratifies pancreatic ductal adenocarcinomas into subtypes with distinct sensitivities to metabolic inhibitors. Proc. Natl. Acad. Sci. USA 112, E4410-4417. https://doi.org/10.1073/pnas.1501605112 (2015).
Article CAS PubMed PubMed Central Google Scholar
Gentric, G. et al. PML-regulated mitochondrial metabolism enhances chemosensitivity in human ovarian cancers. Cell Metab. 29, 156-173.e110. https://doi.org/10.1016/j.cmet.2018.09.002 (2019).
Article CAS PubMed PubMed Central Google Scholar
Cristescu, R. et al. Molecular analysis of gastric cancer identifies subtypes associated with distinct clinical outcomes. Nat. Med. 21, 449–456. https://doi.org/10.1038/nm.3850 (2015).
Article CAS PubMed Google Scholar
Ooi, C. H. et al. Oncogenic pathway combinations predict clinical prognosis in gastric cancer. PLoS Genet. 5, e1000676. https://doi.org/10.1371/journal.pgen.1000676 (2009).
Article CAS PubMed PubMed Central Google Scholar
Qian, Z. et al. Whole genome gene copy number profiling of gastric cancer identifies PAK1 and KRAS gene amplification as therapy targets. Genes Chromosom. Cancer 53, 883–894. https://doi.org/10.1002/gcc.22196 (2014).
Article CAS PubMed Google Scholar
Chia, N. Y. et al. Regulatory crosstalk between lineage-survival oncogenes KLF5, GATA4 and GATA6 cooperatively promotes gastric cancer development. Gut 64, 707–719. https://doi.org/10.1136/gutjnl-2013-306596 (2015).
Article CAS PubMed Google Scholar
Yoon, S. J. et al. Deconvolution of diffuse gastric cancer and the suppression of CD34 on the BALB/c nude mice model. BMC Cancer 20, 314. https://doi.org/10.1186/s12885-020-06814-4 (2020).
Article CAS PubMed PubMed Central Google Scholar
Oh, S. C. et al. Clinical and genomic landscape of gastric cancer with a mesenchymal phenotype. Nat. Commun. 9, 1777. https://doi.org/10.1038/s41467-018-04179-8 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Wang, G. et al. Comparison of global gene expression of gastric cardia and noncardia cancers from a high-risk population in China. PLoS ONE 8, e63826. https://doi.org/10.1371/journal.pone.0063826 (2013).
Article ADS PubMed PubMed Central Google Scholar
Cho, J. Y. et al. Gene expression signature-based prognostic risk score in gastric cancer. Clin. Cancer Res. 17, 1850–1857. https://doi.org/10.1158/1078-0432.Ccr-10-2180 (2011).
Article CAS PubMed PubMed Central Google Scholar
Carvalho, B. S. & Irizarry, R. A. A framework for oligonucleotide microarray preprocessing. Bioinformatics 26, 2363–2367. https://doi.org/10.1093/bioinformatics/btq431 (2010).
Article CAS PubMed PubMed Central Google Scholar
Leek, J. T., Johnson, W. E., Parker, H. S., Jaffe, A. E. & Storey, J. D. The sva package for removing batch effects and other unwanted variation in high-throughput experiments. Bioinformatics 28, 882–883. https://doi.org/10.1093/bioinformatics/bts034 (2012).
Article CAS PubMed PubMed Central Google Scholar
Colaprico, A. et al. TCGAbiolinks: An R/Bioconductor package for integrative analysis of TCGA data. Nucleic Acids Res. 44, e71. https://doi.org/10.1093/nar/gkv1507 (2016).
Article CAS PubMed Google Scholar
Mayakonda, A., Lin, D. C., Assenov, Y., Plass, C. & Koeffler, H. P. Maftools: Efficient and comprehensive analysis of somatic variants in cancer. Genome Res. 28, 1747–1756. https://doi.org/10.1101/gr.239244.118 (2018).
Article CAS PubMed PubMed Central Google Scholar
Morris, T. J. et al. ChAMP: 450k chip analysis methylation pipeline. Bioinformatics 30, 428–430. https://doi.org/10.1093/bioinformatics/btt684 (2014).
Article CAS PubMed Google Scholar
Kumar, V. et al. Single-cell atlas of lineage states, tumor microenvironment, and subtype-specific expression programs in gastric cancer. Cancer Discov. 12, 670–691. https://doi.org/10.1158/2159-8290.Cd-21-0683 (2022).
Article CAS PubMed PubMed Central Google Scholar
McGinnis, C. S., Murrow, L. M. & Gartner, Z. J. DoubletFinder: Doublet detection in single-cell RNA sequencing data using artificial nearest neighbors. Cell Syst. 8, 329-337.e324. https://doi.org/10.1016/j.cels.2019.03.003 (2019).
Article CAS PubMed PubMed Central Google Scholar
Gao, R. et al. Delineating copy number and clonal substructure in human tumors from single-cell transcriptomes. Nat. Biotechnol. 39, 599–608. https://doi.org/10.1038/s41587-020-00795-2 (2021).
Article CAS PubMed PubMed Central Google Scholar
Shannon, P. et al. Cytoscape: A software environment for integrated models of biomolecular interaction networks. Genome Res. 13, 2498–2504. https://doi.org/10.1101/gr.1239303 (2003).
Article CAS PubMed PubMed Central Google Scholar
Wilkerson, M. D. & Hayes, D. N. ConsensusClusterPlus: A class discovery tool with confidence assessments and item tracking. Bioinformatics 26, 1572–1573. https://doi.org/10.1093/bioinformatics/btq170 (2010).
Article CAS PubMed PubMed Central Google Scholar
Kanehisa, M. & Goto, S. KEGG: Kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 28, 27–30. https://doi.org/10.1093/nar/28.1.27 (2000).
Article CAS PubMed PubMed Central Google Scholar
Kanehisa, M. Toward understanding the origin and evolution of cellular organisms. Protein Sci. 28, 1947–1951. https://doi.org/10.1002/pro.3715 (2019).
Article CAS PubMed PubMed Central Google Scholar
Kanehisa, M., Furumichi, M., Sato, Y., Kawashima, M. & Ishiguro-Watanabe, M. KEGG for taxonomy-based analysis of pathways and genomes. Nucleic Acids Res. 51, D587-d592. https://doi.org/10.1093/nar/gkac963 (2023).
Article PubMed Google Scholar
Hänzelmann, S., Castelo, R. & Guinney, J. GSVA: Gene set variation analysis for microarray and RNA-seq data. BMC Bioinformatics 14, 7. https://doi.org/10.1186/1471-2105-14-7 (2013).
Article PubMed PubMed Central Google Scholar
Newman, A. M. et al. Determining cell type abundance and expression from bulk tissues with digital cytometry. Nat. Biotechnol. 37, 773–782. https://doi.org/10.1038/s41587-019-0114-2 (2019).
Article CAS PubMed PubMed Central Google Scholar
Yoshihara, K. et al. Inferring tumour purity and stromal and immune cell admixture from expression data. Nat. Commun. 4, 2612. https://doi.org/10.1038/ncomms3612 (2013).
Article ADS CAS PubMed Google Scholar
Varet, H., Brillet-Guéguen, L., Coppée, J. Y. & Dillies, M. A. SARTools: A DESeq2- and EdgeR-based R pipeline for comprehensive differential analysis of RNA-Seq data. PLoS ONE 11, e0157022. https://doi.org/10.1371/journal.pone.0157022 (2016).
Article CAS PubMed PubMed Central Google Scholar
Friedman, J., Hastie, T. & Tibshirani, R. Regularization paths for generalized linear models via coordinate descent. J. Stat. Softw. 33, 1–22 (2010).
Article PubMed PubMed Central Google Scholar
Geeleher, P., Cox, N. & Huang, R. S. pRRophetic: An R package for prediction of clinical chemotherapeutic response from tumor gene expression levels. PLoS ONE 9, e107468. https://doi.org/10.1371/journal.pone.0107468 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Jiang, P. et al. Signatures of T cell dysfunction and exclusion predict cancer immunotherapy response. Nat. Med. 24, 1550–1558. https://doi.org/10.1038/s41591-018-0136-1 (2018).
Article CAS PubMed PubMed Central Google Scholar
Higareda-Almaraz, J. C. et al. Systems-level effects of ectopic galectin-7 reconstitution in cervical cancer and its microenvironment. BMC Cancer 16, 680. https://doi.org/10.1186/s12885-016-2700-8 (2016).
Article CAS PubMed PubMed Central Google Scholar
Driskill, J. H. & Pan, D. The hippo pathway in liver homeostasis and pathophysiology. Annu. Rev. Pathol. 16, 299–322. https://doi.org/10.1146/annurev-pathol-030420-105050 (2021).
Article CAS PubMed Google Scholar
Leone, R. D. & Powell, J. D. Metabolism of immune cells in cancer. Nat. Rev. Cancer 20, 516–531. https://doi.org/10.1038/s41568-020-0273-y (2020).
Article CAS PubMed PubMed Central Google Scholar
Tambay, V., Raymond, V. A. & Bilodeau, M. MYC rules: Leading glutamine metabolism toward a distinct cancer cell phenotype. Cancers https://doi.org/10.3390/cancers13174484 (2021).
Article PubMed PubMed Central Google Scholar
Lahalle, A. et al. The p53 pathway and metabolism: The tree that hides the forest. Cancers https://doi.org/10.3390/cancers13010133 (2021).
Article PubMed PubMed Central Google Scholar
Ciccarese, F., Zulato, E. & Indraccolo, S. LKB1/AMPK pathway and drug response in cancer: A therapeutic perspective. Oxid. Med. Cell. Longev. 2019, 8730816. https://doi.org/10.1155/2019/8730816 (2019).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

Thanks to Di Chen, et al. for the list of metabolism-related genes¹².

Author information

These authors contributed equally: Guoqiang Tao, Xiangyu Wen and Xingxing Wang.

Authors and Affiliations

Department of General Surgery, Shanghai Punan Hospital, Pudong New District, Shanghai, China
Guoqiang Tao, Xiangyu Wen, Xingxing Wang & Qi Zhou

Authors

Guoqiang Tao
View author publications
You can also search for this author in PubMed Google Scholar
Xiangyu Wen
View author publications
You can also search for this author in PubMed Google Scholar
Xingxing Wang
View author publications
You can also search for this author in PubMed Google Scholar
Qi Zhou
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conception and design, Q.Z. and G.T.; manuscript writing, Q.Z., G.T., X.Wen and X.Wang; acquisition of data, X.Wen and X.Wang; analysis and interpretation of data, Q.Z., G.T., X.Wen and X.Wang. All authors contributed to the article and approved the submitted version.

Corresponding author

Correspondence to Qi Zhou.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Figures.

Supplementary Tables.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Tao, G., Wen, X., Wang, X. et al. Bulk and single-cell transcriptome profiling reveal the metabolic heterogeneity in gastric cancer. Sci Rep 13, 8787 (2023). https://doi.org/10.1038/s41598-023-35395-y

Download citation

Received: 25 November 2022
Accepted: 17 May 2023
Published: 31 May 2023
DOI: https://doi.org/10.1038/s41598-023-35395-y

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.