Gene expression profiling and protein–protein network analysis revealed prognostic hub biomarkers linking cancer risk in type 2 diabetic patients

Kasera, Harshita; Shekhawat, Rajveer Singh; Yadav, Pankaj; Singh, Priyanka

doi:10.1038/s41598-023-49715-9

Download PDF

Article
Open access
Published: 18 December 2023

Gene expression profiling and protein–protein network analysis revealed prognostic hub biomarkers linking cancer risk in type 2 diabetic patients

Harshita Kasera¹^na1,
Rajveer Singh Shekhawat¹^na1,
Pankaj Yadav¹ &
…
Priyanka Singh¹

Scientific Reports volume 13, Article number: 22605 (2023) Cite this article

1182 Accesses
4 Altmetric
Metrics details

Subjects

Abstract

Type 2 diabetes mellitus (T2DM) and cancer are highly prevalent diseases imposing major health burden globally. Several epidemiological studies indicate increased susceptibility to cancer in T2DM patients. However, genetic factors linking T2DM with cancer have been poorly studied. In this study, we followed computational approaches using the raw gene expression data of peripheral blood mononuclear cells of T2DM and cancer patients available in the gene expression omnibus (GEO) database. Our analysis identified shared differentially expressed genes (DEGs) in T2DM and three common cancer types, namely, pancreatic cancer (PC), liver cancer (LC), and breast cancer (BC). The functional and pathway enrichment analysis of identified common DEGs highlighted the involvement of critical biological pathways, including cell cycle events, immune system processes, cell morphogenesis, gene expression, and metabolism. We retrieved the protein–protein interaction network for the top DEGs to deduce molecular-level interactions. The network analysis found 7, 6, and 5 common hub genes in T2DM vs. PC, T2DM vs. LC, and T2DM vs. BC comparisons, respectively. Overall, our analysis identified important genetic markers potentially able to predict the chances of PC, LC, and BC onset in T2DM patients.

Identification of candidate biomarkers and pathways associated with type 1 diabetes mellitus using bioinformatics analysis

Article Open access 01 June 2022

Biomarker screening using integrated bioinformatics for the development of “normal—impaired glucose intolerance—type 2 diabetes mellitus”

Article Open access 24 February 2024

Identification of HDAC9 and ARRDC4 as potential biomarkers and targets for treatment of type 2 diabetes

Article Open access 25 March 2024

Introduction

Type 2 diabetes mellitus (T2DM) is a highly prevalent metabolic disorder that can occur at any age, albeit widespread in the middle (i.e., 45 years) to late age individuals. Insulin resistance, a condition in which muscle, liver, and fat cells fail to use insulin properly, precedes the onset of T2DM. Eventually, the beta cells of the pancreas cannot produce enough insulin due to progressive cell mass reduction or dysfunction. T2DM is characterized by insulin resistance and hyperglycemia¹. The genome-wide association studies in the past have revealed some 403 distinct genetic variants in T2DM, which could influence beta-cell functioning, adipocytes, liver, skeletal muscle², and many other tissues. As a result, it is not surprising that chronic T2DM can lead to additional complications such as nephropathy, cardiomyopathy, retinopathy, and neuropathy³. Consequentially, many differentially expressed genetic markers that could confer T2DM susceptibility were identified⁴. Subsequent bioinformatics analysis of these differentially expressed genes has revealed the genetic association of T2DM with these co-morbidities⁵. These findings have advanced our understanding of complications arising due to T2DM and have prospective applications in designing personalized prognostic and diagnostic tools for such heterogenic human diseases.

Cancer is another heterogenic disease that is also the second leading cause of human death⁶. It is characterized by unrestricted growth of abnormal cells. In some cases, these abnormal cells could metastasize to other parts of the human body. Liver, pancreatic, and breast cancers are among the most common cancer types⁷. It is well-known that T2DM and many common cancers share several risk factors like aging, obesity, and an unhealthy lifestyle⁸. Different epidemiological studies in the past suggest that T2DM condition increases the risk of several cancers, including liver, pancreatic⁹, breast^10,11, and endometrial¹². They report standardized incidence ratios to indicate an increased risk of cancers in T2DM patients. Pancreatic and liver cancers showed the highest standardized incidence ratios in different populations of T2DM patients from Denmark, Tyrol/Austria, Taiwan, Sweden, Australia, the Chinese mainland¹³, Finland¹⁴, and Lithuania. In addition, a few meta-analysis studies reported an increased risk of breast cancer in diabetic women^10,11. There is no clear molecular understanding of T2DM link to specific cancer types yet. However, the state of insulin resistance, hyperinsulinemia, hyperglycemia, chronic inflammation, and increased oxidative stress in T2DM could probably elicit mitogenic pathways and cause these cancers¹⁵. Moreover, a few Mendelian randomization studies indicate a positive association between T2DM and the risk of pancreatic, breast, lung, liver, and kidney cancer^16,17. Despite the availability of extensive evidence from epidemiological and meta-analysis that links cancer risk to T2DM, a systematic study of the shared genetic markers possibly predisposing this risk in T2DM patients is lacking for the common cancer types, namely pancreatic (PC), liver (LC), and breast (BC) cancer.

In this work, we performed gene expression analysis to identify predominant differential expressed genes (DEGs) from the peripheral blood mononuclear cell (PBMC) samples of T2DM patients, posing a risk towards three common cancer types (PC, LC, and BC). Their functional enrichment analysis indicated the involvement of gene expression, cell transport, and oxidation pathways. The protein–protein interaction (PPI) network provided common hub genes between T2DM and the three cancer types. We identified TGFB1 as a common hub gene between T2DM and PC/LC, significantly affecting survival in cancer patients. Therefore, the identified hub genes have a potential prognostic and therapeutic value in patients with T2DM patients and high cancer risks.

Results

Shared DEGs in T2DM and three common cancer types

Gene expression data of Homo sapiens in different diseased conditions were obtained from gene expression omnibus (GEO) database. Table 1 provides the summary of three different datasets used in our study. We employed a three-tiered filtering criterion to identify the shared DEGs between T2DM and three cancer types (Fig. 1). The raw gene expression datasets were normalized for each GEO study using the GEO2R tool (Supplementary Fig. S1). We identified 94 DEGs shared between T2DM and LC using our filtering criterion (Supplementary Fig. S2). Of these, 59 DEGs were upregulated, while 35 were downregulated (Supplementary Fig. S3). Likewise, for T2DM vs. BC comparison, we identified 16 shared DEGs (Supplementary Fig. S2), including 8 upregulated and 8 downregulated (Supplementary Fig. S3). For T2DM vs. PC, the three stringent filtering criteria resulted in an insignificant number of shared DEGs. However, using FDR ≤ 0.1 at filter 1, we identified 66 shared DEGs (Supplementary Fig. S2). Interestingly, the GSE15932 dataset also has data from 8 patients suffering from T2DM and PC disease. We applied the first two filtering criteria on T2DM & PC dataset, which identified 1203 DEGs (Supplementary Fig. S2). Surprisingly, we observed 69% (46 DEGs) shared DEGs of T2DM vs. PC overlapping with the DEG identified from T2DM & PC. For further functional analysis, we proceeded with the above 46 DEGs in T2DM vs. PC, where 26 were upregulated, and 20 were downregulated (Supplementary Fig S3). Similar validation could not be performed for T2DM vs. LC and T2DM vs. BC due to the unavailability of such a dataset. We observed a strong Pearson’s correlation (0.98 for T2DM vs. PC, 0.90 for T2DM vs. LC, and 0.87 for T2DM vs. BC) for the identified shared DEGs between T2DM and three common cancer types (i.e., PC, LC, and BC). Table 2 summarizes the number of shared DEGs narrowed down after applying our filtering criteria to each dataset.

Table 1 Overview of GEO datasets used in this study. The dash (–) symbol indicates that the information is not available.

Full size table

Table 2 Summary of the number of genes filtered with three-tiered filtering criteria (statistical significance (filter 1; adjusted p-value ≤ 0.05 unless specified), biological significance (filter 2; log₂FC ≥ |0.5|) and 10% relative distance around linear regression line of the correlated gene (filter 3).

Full size table

Functional enrichment and pathway analysis of shared DEGs

To understand the functional relevance of identified common genes between T2DM and three cancers (PC, LC and BC), we utilized GO biological process and KEGG signaling pathway analysis tools from GENECODIS software. The GO biological process of the shared DEGs in T2DM vs. PC comparison showed enrichment of vesicle-mediated transport, protein export from the nucleus, engulfment of target by autophagosome, positive regulation of cellular protein metabolic process, positive regulation of protein localization to nucleus, etc. (Table 3). The KEGG pathway analysis results were insignificant for the T2DM vs. PC comparison. In the T2DM vs. LC comparison, the shared DEGs revealed enrichment in several critical biological processes such as ATP biosynthetic process, peptide modification, regulation of RNA splicing, regulated exocytosis, and neutrophil degranulation (Table 4). We also found enrichment of KEGG pathways shown in Supplementary Table S1, (section T2DM vs. LC), which includes synaptic vesicle cycle (hsa04721, p = 5.63 × 10⁻²), rheumatoid arthritis (hsa05323, p = 5.63 × 10⁻²), collecting duct acid secretion (hsa04966, p = 5.63 × 10⁻²), phagosome (hsa04145, p = 5.63 × 10⁻²), and oxidative phosphorylation (hsa00190, 9.05 × 10⁻²).

Table 3 Shows top significantly enriched GO:BP involving the identified DEGs for T2DM vs. PC patients.

Full size table

Table 4 Shows top significantly enriched GO:BP involving the identified DEGs for T2DM vs. LC comparison.

Full size table

Similarly, analysis of common DEGs in T2DM vs. BC comparison led to the enrichment of hydrogen peroxide catabolic process, cellular oxidant detoxification, autophagic cell death, regulation of DNA recombination, hemoglobin biosynthetic process, and plasminogen activation from GO biological process (Table 5). The KEGG pathway analysis results were insignificant for the T2DM vs. BC comparison. Overall, cellular transport, gene expression, and cellular oxidation pathways were affected in this analysis, which motivated us further to perform the interaction analysis at the molecular level.

Table 5 Shows top significantly enriched GO:BP involving the identified DEGs for T2DM vs. BC comparison.

Full size table

Deducing molecular level interactions using PPI network

The above analysis linked identified common genes to specific biological pathways, which intrigued us to investigate their relationship at the molecular level. We constructed the protein–protein interaction (PPI) networks of identified common DEGs between T2DM and three cancer types (PC, LC, and BC). Our analysis yielded 16 nodes (genes) in T2DM vs. PC, 25 in T2DM vs. LC, and 5 in T2DM vs. BC in the main connected PPI networks (Supplementary Table S2, Fig. 2a–c).

In the PPI network analysis, the node size reflects their degree, and node color indicates the expression pattern, thus making it possible to deduce certain hub genes for respective disease conditions. Moreover, network analysis provides information on the important hub genes and enriched processes and pathways involving major hub nodes as represented by a colored ring around each node (Fig. 2a–d). Our network analysis revealed important genes associated with more than one biological pathway, thus suggesting their evident involvement in the respective disease conditions. The important genes from the T2DM vs. PC network included, HIST2H2AA3, HIST2H2AA4, NFKBIA, SESN2, SMURF1, TGFB1, TNRC6A (Fig. 2a). The T2DM vs. BC network could not reveal the genes associated with the biological pathways due to insufficient DEGs (Fig. 2b). For the T2DM vs. LC network, ALDOA, ATP6V0D1, ATP6V0C, ATP6V0E1, TGFB1, CYBA, CTSD, and DBNL genes were considered significant (Fig. 2c).

Interestingly, a common gene, TGFB1, between T2DM vs. PC and T2DM vs. LC, is upregulated in both conditions. This gene codes for a growth factor in cell proliferation, differentiation, and death. We used these findings as a basis for further investigation of hub genes.

Hub genes identification and survival analysis

We ranked the nodes in our PPI network analysis based on eleven different topological features using the cytoHubba plugin in the Cytoscape tool (see details in the materials and methods section). Thus, we identified the top 15 genes for T2DM vs. PC and T2DM vs. LC (Supplementary Table S3). We noticed several hub genes such as HIST2H2AA4, SESN2, and TNRC6A for T2DM vs. PC and ATP6V0D1, ATP6V0C and TGFB1 for T2DM vs. LC that were top-ranked in almost all the computed topological features. This analysis was not performed for T2DM vs. BC due to the insufficient number of identified common DEGs. We further narrowed common hub genes based on their top ranking and commonness across the computed topological features. Accordingly, we could identify seven genes as the hub genes for the T2DM vs. PC comparison. The relative log₂ fold expression of these genes in two diseased conditions is represented in Fig. 3a. For the T2DM vs. LC comparison, we identified six hub genes. We found similar expression levels of these genes in T2DM and LC disease conditions, as indicated in Fig. 3b. For the T2DM vs. BC comparison, five genes identified from PPI network analysis were considered the hub genes, as represented in Fig. 3c. We identified 17 hub genes combined for the three comparisons, which could serve as potential biomarkers. Further, survival analysis was also performed for all the common hub genes using a web resource UALCAN, which analyzes publicly available cancer OMICS data^18,19. The survival analysis revealed 4 hub genes (ATP6V0C, p < 0.051; ATP6V0D1, p < 0.02; ATP6V0E1, p < 0.0002 and TGFB1, p = 0.042) with significant p-value ≤ 0.05 to be linked with poor survival (Supplementary Fig. S4). The expression levels of these four hub genes were similar to our analysis. Interestingly, we found a common hub gene TGFB1 between T2DM vs. pancreatic cancer and T2DM vs. liver cancer patients, which significantly affects survival. In the case of breast cancer samples, the diagnosis was benign, so, likely, the survival is not dependent on the identified common hub signatures for these samples.

We further validated the identified prognostic 17 hub markers from our study using three additional publicly available datasets for T2DM patients (Supplementary Fig. S5a). Our validation analysis revealed overlapping functionally-enriched gene ontology biological processes (Supplementary Fig. S5b–e). Out of the 17 hub genes, we could validate 12 genes (p-value ≤ 0.05) based on their expression profile (Supplementary Fig. S5f), despite differences in their sample sources (whole blood sample vs. PBMC) and variability in detection ranges due to different platforms (Supplementary Fig. S6).

Discussion

T2DM and cancer have burdened the health sector throughout the world. Recently, several epidemiological studies have indicated a causal link between T2DM and common cancer types. However, the genetic association of T2DM to these cancers remains largely unknown. Our work identifies a genetic association between T2DM and three common cancer types, i.e., PC, LC, and BC. Our analysis identified 7, 6, and 5 hub genes showing a correlation between T2DM and the three cancers, i.e., PC, LC, and BC, respectively.

The KEGG pathway and GO biological process analyses showed enrichment of vesicle-mediated transport, vital in tumor microenvironment remodeling and transport of secretory insulin or other circulating mediators in diabetes^20,21,22,23. Neutrophil deregulation is known to be associated with diabetes²⁴ and cancer cell progression, metastasis, and activating dormant cancer cell²⁵. Moreover, we noticed positive regulation of metabolic and cellular protein metabolic processes in T2DM vs. PC conditions. Our analysis also revealed important processes contributing to gene expression, including regulation of RNA splicing and protein degradation, which are required to maintain homeostasis. In nutshell, we identified seven hub genes (HIST2H2AA3, HIST2H2AA4, NFKBIA, SESN2, SMURF1, TGFβ1, TNRC6A) in T2DM vs. PC, six common hub genes (ATP6V0D1, ATP6V0C, ATP6V0E1, CTSD, CYBA, TGFβ1) in T2DM vs. LC and five common hub genes (ALYREF, CDKN2D, NGDN, THRAP3, UBE2M) in T2DM vs. BC, respectively (Fig. 3). Supplementary Table S4 provides the detailed function and description of these hub genes. Noteworthy, The NFKBIA gene involved in the NFKB pathway has a crucial role in the initiation and progression of PC²⁵, and its gene polymorphism has an implicit role in the prognosis of T2DM²⁶. TGFB1 is involved at an early and advanced stage in liver tumorigenesis²⁷, and TGF-β signaling plays diverse roles in β cell development and functioning that has an impactful role in diabetic condition²⁸. The V-ATPase H+ transporting genes (ATP6V0D1, ATP6V0C, ATP6V0E1) maintain the intracellular pH and thus are critical for the Warburg effect observed in the cancer cells²⁹. The CTSD gene has a role in decreasing the expression of IGFBP3, contributing to mitogenesis in hepatoma cells³⁰, and it also has increased plasma activity in T2DM male patients³¹. The ALYREF gene plays a significant role in cellular growth, apoptosis, and mitochondrial energy metabolism in BC³², and as a 5mC-related gene that could have a functional role in T2DM³³. THRAP3 deficiency sensitizes BC cells, suggesting a probable involvement in DDR³⁴. Also, THRAP3 plays a direct role in controlling diabetic gene programming by interacting with PPARγ³⁵.

Thus, our analysis provides insight into biological and molecular events that could link T2DM with three common cancer types (PC, LC, and BC). Further, the identified genetic markers hold the potential to predict the chances of cancer onset in T2DM patients. However, systematic approaches for data collection, which consider variations in genetic profiling based on ethnicity, sex, and age, could further expand our understanding. Notably, such markers in T2DM patient PBMC samples predisposing to increased cancer risk could help diagnosis at an early stage and provide benefits for developing personalized therapeutic strategies.

Material and methods

Microarray data collection

The raw gene expression data of Homo sapiens used in this study was available at the gene expression omnibus (GEO; http://www.ncbi.nlm.nih.gov/geo/) database. The studies used in this work specifically comprised the expression profile of the peripheral blood mononuclear cells (PBMCs) using the Affymetrix platform GPL570. The GSE15932 dataset included expression data of 32 PBMC samples containing 8 healthy individuals, 8 T2DM, 8 PC, and 8 samples of both T2DM and PC patients. The LC gene expression data along with their respective healthy controls were collected from GSE58208. The study comprised of gene expression analysis of PBMC samples from healthy individuals, liver cancer and hepatitis B carrier patients. The GSE27562 datasets collected the breast cancer gene expression profile and the healthy women samples. Blood was collected from 37 women who have benign breast cancer in comparison to 31 healthy individuals.

Data pre-processing and identification of DEGs

The raw gene expression data was normalized using the GEO2R tool. The prospective shared genetic markers between T2DM and three cancer types (PC, LC, and BC) were obtained by applying three filters. Our first filter is based on the adjusted p-value, indicating the statistical significance of differentially expressed genes. We considered genes having adjusted p-value ≤ 0.05 for further analysis. Our second filtering criteria is based on the relative log₂FC of gene expression, calculated for disease condition samples with respect to healthy condition samples indicating the biologically significant genes. We retained genes having absolute log₂FC ≥ |0.5| for further analysis. Lastly, we kept correlated genes falling within the 10% interval from a regression line passing through the origin (i.e., x = 0 and y = 0 on an xy plane) between log₂FC in T2DM and respective cancer types. Only upregulated or downregulated genes in both conditions were selected for further analysis. Pearson correlation was calculated for the genes narrowed down after applying filtering criteria in each condition, using the cor function in the R software.

Functional enrichment analysis of DEGs

The gene ontology (GO) enrichment analysis and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis were performed to annotate the biological function of the DEGs using the online software GENECODIS. We considered a cut-off of the false discovery rate (FDR) at 0.10 to define the significance level.

Protein–protein interaction (PPI) network construction and visualization

The PPI network analysis was performed based on the Search Tool for the Retrieval of Interacting Genes (STRING, https://string-db.org), a database of known and predicted protein–protein interactions. We used genes differentially expressed in T2DM and three cancer types (PC, LC, and BC) to construct the PPI network. Interaction with a score > 0.8 was deemed statistically significant. The PPI network was created using Cytoscape (version 3.8.2), an open-source software for visualizing molecular interaction networks and biological pathways.

Hub genes identification

The hub genes were explored using the cytoHubba application in the Cytoscape tool. For this purpose, the PPI network was analyzed to compute various topological features, including degree, maximal clique, centrality, density of maximum neighborhood component, maximum neighborhood component, edge percolated component, bottleneck, eccentricity, closeness, radiality, betweenness, and stress. The top 15 nodes were considered notable genes in the network for each computed topological feature. The nodes common to all topological features were regarded as the critical hub genes or key nodes in the network.

Survival analysis

The survival analysis was performed using UALCAN to analyze expression data from publicly available databases (http://ualcan.path.uab.edu/index.html)^18,19. The correlation between hub gene expression and survival in the three cancer types was analyzed by UALCAN. The patients with cancer were split into two groups according to the expression of a particular gene (high vs. low/medium expression), and the survival time was compared between the two groups.

Validation dataset analysis

To validate the discovery dataset (GSE15932), we shortlisted 3 validation datasets (GSE23561, GSE69528, and GSE189005) matching criteria of T2DM, Homo Sapien taxid, expression profile by array, and blood samples. Raw gene expression data for GSE69528 and GSE189005 datasets were normalized using GEO2R tool. GSE23561 raw gene expression data was normalized by "normalizeBetweenArrays" command of Bioconductor package limma V3.562 in R4.3.0. Significant DEGs in T2DM were obtained after applying a filter for adjusted p-value ≤ 0.05 and log2FC ≥ |0.5|. Functional enrichment analysis of the obtained DEGs was performed using GENECODIS. For representation, only functionally enriched GO:BP overlapping with the discovery datasets were shown in Fig S5c–e. The expression profile of the 17 hub genes among discovery and validation datasets were plotted and color-coded (yellow and green shaded boxes) when found significant (p ≤ 0.05) in any of the validation datasets.

Ethical standards

The manuscript does not contain clinical studies or patient data.

Data availability

The datasets used during the current study are available on the gene expression omnibus database (URL: http://www.ncbi.nlm.nih.gov/geo/). The datasets used in this study are available with the following accession IDs: GSE15932, GSE58208 and GSE27562. The codes used in the study are available on GitHub (https://github.com/RajveerSingh27R/Cross-Phenotype-Analysis).

Abbreviations

T2DM:: Type 2 diabetes mellitus
PC:: Pancreatic cancer
LC:: Liver cancer
BC:: Breast cancer
DEGs:: Differentially expressed genes
PPI:: Protein-protein interaction
PBMC:: Peripheral blood mononuclear cells
log₂FC:: Log₂ fold change
GO:: Gene ontology
KEGG:: Kyoto encyclopedia of genes and genomes
FDR:: False discovery rate

References

DeFronzo, R. A. et al. Type 2 diabetes mellitus. Nat. Rev. Dis. Primers 1, 1–22 (2015).
Article Google Scholar
Mahajan, A. et al. Fine-mapping type 2 diabetes loci to single-variant resolution using high-density imputation and islet-specific epigenome maps. Nat. Genet. 50, 1505–1513 (2018).
Article PubMed PubMed Central CAS Google Scholar
Daryabor, G., Atashzar, M. R., Kabelitz, D., Meri, S. & Kalantar, K. The Effects of type 2 diabetes mellitus on organ metabolism and the immune system. Front. Immunol. 11, 1582 (2020).
Article PubMed PubMed Central CAS Google Scholar
Morris, A. P. et al. Large-scale association analysis provides insights into the genetic architecture and pathophysiology of type 2 diabetes. Nat. Genet. 44, 981–990 (2012).
Article PubMed PubMed Central CAS Google Scholar
Azzam, S. K. et al. Genetic associations with diabetic retinopathy and coronary artery disease in emirati patients with type-2 diabetes mellitus. Front. Endocrinol. (Lausanne). 10, 283 (2019).
Article PubMed PubMed Central Google Scholar
Rositch, A. F. Global burden of cancer attributable to infections: The critical role of implementation science. Lancet Glob. Health 8, e153–e154 (2020).
Article PubMed Google Scholar
Siegel, R. L., Miller, K. D. & Jemal, A. Cancer statistics, 2016. CA Cancer J. 66, 7–30 (2016).
Article Google Scholar
Giovannucci, E. et al. Diabetes and cancer: A consensus report. Diabetes Care 33, 1674–1685 (2010).
Article PubMed PubMed Central Google Scholar
Pan, X.-F. et al. Type 2 diabetes and risk of incident cancer in China: A prospective study among 0.5 million chinese adults. Am. J. Epidemiol. 187, 1380–1391 (2018).
Article PubMed PubMed Central Google Scholar
Boyle, P. et al. Diabetes and breast cancer risk: A meta-analysis. Br. J. Cancer 107, 1608–1617 (2012).
Article PubMed PubMed Central CAS Google Scholar
Hardefeldt, P. J., Edirimanne, S. & Eslick, G. D. Diabetes increases the risk of breast cancer: A meta-analysis. Endocr.-Related Cancer 19, 793–803 (2012).
Article Google Scholar
Saed, L. et al. The effect of diabetes on the risk of endometrial Cancer: An updated a systematic review and meta-analysis. BMC Cancer 19, 527 (2019).
Article PubMed PubMed Central Google Scholar
Wang, M. et al. Cancer risk among patients with type 2 diabetes mellitus: A population-based prospective study in China. Sci. Rep. 5, 11503 (2015).
Article PubMed PubMed Central ADS CAS Google Scholar
Saarela, K. et al. Cancer incidence among Finnish people with type 2 diabetes during 1989–2014. Eur. J. Epidemiol. 34, 259–265 (2019).
Article PubMed Google Scholar
Shlomai, G., Neel, B., LeRoith, D. & Gallagher, E. J. Type 2 diabetes mellitus and cancer: The role of pharmacotherapy. J. Clin. Oncol. 34, 4261–4269 (2016).
Article PubMed PubMed Central CAS Google Scholar
Pearson-Stuttard, J. et al. Type 2 diabetes and cancer: An umbrella review of observational and mendelian randomization studies. Cancer Epidemiol. Biomark. Prev. 30, 1218–1228 (2021).
Article CAS Google Scholar
Shen, B. et al. Association between age at diabetes onset or diabetes duration and subsequent risk of pancreatic cancer: Results from a longitudinal cohort and mendelian randomization study. Lancet Reg. Health West Pac. 30, 100596 (2022).
Article PubMed PubMed Central Google Scholar
Chandrashekar, D. S. et al. UALCAN: An update to the integrated cancer data analysis platform. Neoplasia 25, 18–27 (2022).
Article PubMed PubMed Central CAS Google Scholar
Chandrashekar, D. S. et al. UALCAN: A portal for facilitating tumor subgroup gene expression and survival analyses. Neoplasia 19, 649–658 (2017).
Article PubMed PubMed Central CAS Google Scholar
Bracey, K. M., Gu, G. & Kaverina, I. Microtubules in pancreatic β cells: Convoluted roadways toward precision. Front. Cell Develop. Biol. 10, 915206 (2022).
Wattanathamsan, O. & Pongrakhananon, V. Emerging role of microtubule-associated proteins on cancer metastasis. Front. Pharmacol. 13, 935493 (2022).
NorenHooten, N. & Evans, M. K. Extracellular vesicles as signaling mediators in type 2 diabetes mellitus. Am. J. Physiol. Cell Physiol. 318, C1189–C1199 (2020).
Article Google Scholar
Li, Y., Zhao, W., Wang, Y., Wang, H. & Liu, S. Extracellular vesicle-mediated crosstalk between pancreatic cancer and stromal cells in the tumor microenvironment. J. Nanobiotechnol. 20, 208 (2022).
Article Google Scholar
Wong, S. L. et al. Diabetes primes neutrophils to undergo NETosis, which impairs wound healing. Nat. Med. 21, 815–819 (2015).
Article PubMed PubMed Central CAS Google Scholar
Silke, J. & O’Reilly, L. A. NF-κB and pancreatic cancer; chapter and verse. Cancers (Basel) 13, 4510 (2021).
Article PubMed CAS Google Scholar
Raza, W., Ghafoor, S., Abbas, S. Z. & Muhammad, S. A. Polymorphic evaluation of NFKBIA and SRR with type 2 diabetes mellitus in population of southern Punjab. Meta Gene 26, 100803 (2020).
Article Google Scholar
Tu, S., Huang, W., Huang, C., Luo, Z. & Yan, X. Contextual regulation of TGF-β signaling in liver cancer. Cells 8, 1235 (2019).
Article PubMed PubMed Central CAS Google Scholar
Wang, H.-L., Wang, L., Zhao, C.-Y. & Lan, H.-Y. Role of TGF-beta signaling in beta cell proliferation and function in diabetes. Biomolecules 12, 373 (2022).
Article PubMed PubMed Central Google Scholar
Sun, H., Chen, L., Cao, S., Liang, Y. & Xu, Y. Warburg effects in cancer and normal proliferating cells: Two tales of the same name. Genom. Proteom. Bioinform. 17, 273–286 (2019).
Article CAS Google Scholar
Ruiz-Blázquez, P., Pistorio, V., Fernández-Fernández, M. & Moles, A. The multifaceted role of cathepsins in liver disease. J. Hepatol. 75, 1192–1202 (2021).
Article PubMed Google Scholar
Ding, L. et al. Plasma cathepsin D activity rather than levels correlates with metabolic parameters of type 2 diabetes in male individuals. Front. Endocrinol. 11, 575070 (2020).
Klec, C. et al. ALYREF, a novel factor involved in breast carcinogenesis, acts through transcriptional and post-transcriptional mechanisms selectively regulating the short NEAT1 isoform. Cell Mol. Life Sci. 79, 391 (2022).
Article PubMed PubMed Central CAS Google Scholar
Song, Y. et al. Comprehensive analysis of key m5C modification-related genes in type 2 diabetes. Front. Genet. 13, 1015879 (2022).
Beli, P. et al. Proteomic investigations reveal a role for RNA processing factor THRAP3 in the DNA damage response. Mol. Cell 46, 212–225 (2012).
Article PubMed PubMed Central CAS Google Scholar
Choi, J. H. et al. Thrap3 docks on phosphoserine 273 of PPARγ and controls diabetic gene programming. Genes Dev. 28, 2361–2369 (2014).
Article PubMed PubMed Central CAS Google Scholar

Download references

Acknowledgements

HK (file number: 09/1125(0017)/2020-EMR-I) and RSS (file number: 09/1125(0019)/2021-EMR-I) are supported by the CSIR-NET fellowship. PY acknowledges the financial support from the Department of Biotechnology (project number BT/GenomeIndia/2018) and Indian Institute of Technology, Jodhpur, India (project number I/SEED/PY/20200037). PS is thankful to the Science and Engineering Research Board (ECR/2017/001410), Department of Biotechnology (BT/12/IYBA/2019/02), and Board of Research in Nuclear Sciences (55/14/02/2021-BRNS/10206) for the financial support.

Author information

These authors contributed equally: Harshita Kasera and Rajveer Singh Shekhawat.

Authors and Affiliations

Department of Bioscience and Bioengineering, Indian Institute of Technology Jodhpur, NH 62, Nagaur Road, Karwar, Jodhpur, Rajasthan, 342037, India
Harshita Kasera, Rajveer Singh Shekhawat, Pankaj Yadav & Priyanka Singh

Authors

Harshita Kasera
View author publications
You can also search for this author in PubMed Google Scholar
Rajveer Singh Shekhawat
View author publications
You can also search for this author in PubMed Google Scholar
Pankaj Yadav
View author publications
You can also search for this author in PubMed Google Scholar
Priyanka Singh
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

H.K. collected study data, performed primary DEG analysis and designed the working strategy; R.S.S. performed functional enrichment, PPI network and hub gene analysis; P.S. conceptualized the work and together with P.Y. designed and supervised the study; all authors contributed to writing the manuscript.

Corresponding authors

Correspondence to Pankaj Yadav or Priyanka Singh.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Kasera, H., Shekhawat, R.S., Yadav, P. et al. Gene expression profiling and protein–protein network analysis revealed prognostic hub biomarkers linking cancer risk in type 2 diabetic patients. Sci Rep 13, 22605 (2023). https://doi.org/10.1038/s41598-023-49715-9

Download citation

Received: 27 July 2023
Accepted: 11 December 2023
Published: 18 December 2023
DOI: https://doi.org/10.1038/s41598-023-49715-9

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Identification of candidate biomarkers and pathways associated with type 1 diabetes mellitus using bioinformatics analysis

Biomarker screening using integrated bioinformatics for the development of “normal—impaired glucose intolerance—type 2 diabetes mellitus”

Identification of HDAC9 and ARRDC4 as potential biomarkers and targets for treatment of type 2 diabetes

Introduction

Results

Shared DEGs in T2DM and three common cancer types

Functional enrichment and pathway analysis of shared DEGs

Deducing molecular level interactions using PPI network

Hub genes identification and survival analysis

Discussion

Material and methods

Microarray data collection

Data pre-processing and identification of DEGs

Functional enrichment analysis of DEGs

Protein–protein interaction (PPI) network construction and visualization

Hub genes identification

Survival analysis

Validation dataset analysis

Ethical standards

Data availability

Abbreviations

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information.

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links