Identidication of novel biomarkers in non-small cell lung cancer using machine learning

Wang, Fangwei; Su, Qisheng; Li, Chaoqian

doi:10.1038/s41598-022-21050-5

Download PDF

Article
Open access
Published: 06 October 2022

Identidication of novel biomarkers in non-small cell lung cancer using machine learning

Fangwei Wang¹,
Qisheng Su² &
Chaoqian Li¹

Scientific Reports volume 12, Article number: 16693 (2022) Cite this article

6079 Accesses
10 Citations
2 Altmetric
Metrics details

Subjects

Abstract

Lung cancer is one of the leading causes of cancer-related deaths worldwide, and non-small cell lung cancer (NSCLC) accounts for a large proportion of lung cancer cases, with few diagnostic and therapeutic targets currently available for NSCLC. This study aimed to identify specific biomarkers for NSCLC. We obtained three gene-expression profiles from the Gene Expression Omnibus database (GSE18842, GSE21933, and GSE32863) and screened for differentially expressed genes (DEGs) between NSCLC and normal lung tissue. Enrichment analyses were performed using Gene Ontology, Disease Ontology, and the Kyoto Encyclopedia of Genes and Genomes. Machine learning methods were used to identify the optimal diagnostic biomarkers for NSCLC using least absolute shrinkage and selection operator logistic regression, and support vector machine recursive feature elimination. CIBERSORT was used to assess immune cell infiltration in NSCLC and the correlation between biomarkers and immune cells. Finally, using western blot, small interfering RNA, Cholecystokinin-8, and transwell assays, the biological functions of biomarkers with high predictive value were validated. A total of 371 DEGs (165 up-regulated genes and 206 down-regulated genes) were identified, and enrichment analysis revealed that these DEGs might be linked to the development and progression of NSCLC. ABCA8, ADAMTS8, ASPA, CEP55, FHL1, PYCR1, RAMP3, and TPX2 genes were identified as novel diagnostic biomarkers for NSCLC. Monocytes were the most visible activated immune cells in NSCLC. The knockdown of the TPX2 gene, a biomarker with a high predictive value, inhibited A549 cell proliferation and migration. This study identified eight potential diagnostic biomarkers for NSCLC. Further, the TPX2 gene may be a therapeutic target for NSCLC.

Use tumor suppressor genes as biomarkers for diagnosis of non-small cell lung cancer

Article Open access 12 February 2021

Lung adenocarcinoma and lung squamous cell carcinoma cancer classification, biomarker identification, and gene expression analysis using overlapping feature selection methods

Article Open access 25 June 2021

Genetic network and gene set enrichment analyses identify MND1 as potential diagnostic and therapeutic target gene for lung adenocarcinoma

Article Open access 03 May 2021

Introduction

Non-small cell lung cancer (NSCLC), a subtype of lung cancer, is one of the most prevalent malignancies worldwide. According to studies, the prognosis for NSCLC is highly dependent on the stage of disease progression, and the earlier the disease is detected, the better the chances of survival within 5 years¹. Patients with early-stage lung cancer often have no obvious symptoms and thus miss the best time for treatment. Moreover, metastasis is the most devastating feature of the tumor, ultimately leading to a high mortality rate². Although some advances in lung cancer treatment and pharmaceutical research have been made, serious complications such as anemia and neutropenia persist, and recurrence rates and mortality in NSCLC patients are still not effectively controlled³. Therefore, there is an urgent need to identify reliable biomarkers in the diagnosis and prognosis of NSCLC.

With the significant advancement of microarray and sequencing technology in recent years, gene characterization based on messenger RNA expression levels has shown great promise in diagnosing cancer. For instance, the breast cancer susceptibility protein-1 (BRCA1) gene has been considered a predictor in breast cancer risk models. It is used as a clinical genetic test standard, while the tumor germination (Bd) gene is also considered a prognostic biomarker and has an independent prognostic value for disease-free survival (DFS) and overall survival (OS) in colon cancer⁴.

It has previously been reported that bioinformatics methods are used to analyze comprehensive gene expression data to obtain cancer-related biomarkers for effective prevention, diagnosis, and treatment of cancer. Machine learning (ML) methods are a branch of bioinformatics applied to various aspects of cancer research and are a hot topic in lung cancer research. For example, ML methods successfully developed and validated a predictive model for cancer-related deep vein thrombosis⁵. Lai et al. used ML methods to create gene signatures that accurately predicted prostate cancer prognosis⁶. Zheng et al. developed an integrated radiomic model using the ML method to predict the prognosis of prostate cancer patients⁷. ML methods have also been used to screen for potential biomarkers of prostate cancer and osteosarcoma^8,9. Support vector machines (SVM), a linear classifier with maximum intervals defined in the feature space, have gradually been applied to machine learning of cancer using its efficient two-class model to classify cancer with high accuracy from a large number of genomes and efficiently extract key genes to achieve high-accuracy, low-error sample classification. In recent years, many researchers have used SVM methods to study cancer and have made significant advances. For example, Luo et al. used SVM methods to construct predictive models for synchronous lung metastases (SLM) in osteosarcoma¹⁰. Su et al. identified eight genes associated with colon cancer prognosis using the SVM method¹¹. Cai et al. used SVM methods to construct radiomics-based models for diagnosing lung adenocarcinoma (LUAD)¹². SVM methods also differentiated between epithelial ovarian cancer (EOC) and surrounding tissues, aiding EOC diagnosis¹³. In addition, the SVM methods successfully screened colorectal biomarkers for personalized treatment of patients with post-operative liver metastases¹⁴. However, there have been few studies on potential NSCLC biomarkers.

In this study, we obtained differentially expressed genes (DEGs) for NSCLC from the Gene Expression Omnibus (GEO) database. We used functional enrichment analysis to identify NSCLC-related biomarkers using an ML approach, followed by infiltrative immune cell analysis and in vitro validation of the biomarkers’ functions. Our findings may be useful in the early diagnosis of NSCLC and the mechanistic study of NSCLC.

Materials and methods

Data processing and DEGs screening

We obtained three microarray datasets (GSE18842, GSE32863, and GSE21933) from the Gene Expression Omnibus (GEO) public databases, and the organism parameter was set to “Homo sapiens”. The raw data from these datasets were processed using R language statistical software (version 4.1.2)¹⁵. Using the R ‘Limma’ package¹⁶, we identified DEGs between normal lung and NSCLC tissues, with an adjusted P-value < 0.05 and |logFC|≥ 2 as statistical significance. Heatmaps and volcano plots were plotted to show the differential expression of DEGs.

Functional and pathways enrichment analysis

Gene Ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG)^17,18,19, and Disease Ontology (DO) enrichment analyses of DEGs were performed using the R ‘clusterProfiler’ package²⁰ and visualized using the R ‘ggplot2’ package²¹ with GO functional annotation, including biological process, cellular component, and molecular function terms. The R ‘clusterProfiler’ and R ‘org.Hs.eg.db’ packages²² were used to enrich DEGs using gene set enrichment analysis (GSEA). A result with an adjusted P-value < 0.05 and a false discovery rate < 0.05 was considered statistically different.

ML methods to identify NSCLC biomarkers

For biomarker screening, two ML methods were used: least absolute shrinkage and selection operator (LASSO) logistic regression²³ and support vector machine-recursive feature elimination (SVM-RFE)²⁴. The algorithm LASSO used the R ‘glmnet’ package²⁵, while the SVM-RFE algorithm used the R ‘e1071’ package²⁶. The following are the model settings: LASSOcvfit = cv.glmnet (x,y,family = ‘binomial,” alpha = 1, type.measure = ‘deviance,’ nfolds = 10). SVM = rfeControl (functions = caretFuncs, method = “cv,” methods = “svmRadial”). The point with the lowest cross-validation in the vertical axis corresponds to the biomarker genes to be found; a difference of P < 0.05 was considered statistically significant.

Validation of biomarkers

The R ‘ggpubr’ package²⁷ was used to examine the biomarker expression in GSE32864. In addition, the biomarkers’ diagnostic efficiency was validated using receiver operating characteristic (ROC) curves generated with the R ‘pROC’ package²⁸; P < 0.05 was considered statistically significant.

Assessment and correlation analysis of infiltrating immune cells

The CIBERSORT algorithm²⁹ was used to analyze the relationship between infiltrating immune cells and biomarkers; a correlation heatmap was produced using the R ‘corrplot’ package³⁰ to detect the association of each immune cell with the other cells in the LUAD sample; the violin map using the R “ggplot2” package showed differences in the expression of 22 immune cells. A Spearman correlation analysis was performed between diagnostic biomarkers and infiltrating immune cells using the R “ggstatsplot” package.

Survival analysis

The GEIPA online website (http://gepia.cancer-pku.cn/) is a dataset based on The Cancer Genome Atlas and Genotype-Tissue Expression that provides a fast and customizable web-based tool. A summary of the process is as follows: click on “survival plots” and enter the gene name, then select “LUAD,” “OS,” and “RFS,” and finally observe the P-value and output the graph; P < 0.05 was considered statistically significant.

Cell culture and cell transfection

A549 cells were purchased from the ATCC (Shanghai, China). A549 cells were cultured in Dulbecco’s modified Eagle medium (Thermo Fisher Scientific, USA) containing 10% serum (Thermo Fisher Scientific, Wilmington, DE, USA). For si-RNA transfection, A549 cells were transfected with si-TPX2 using the Lipofectamine 2000 transfection reagent (Invitrogen, Waltham, MA, USA) according to the manufacturer’s instructions. To target TPX2, the following si-RNA sequences were used: AGCCTCAGAAGATCTCTTAG (si-TPX2).

Western blotting

The total protein concentration extracted with lysis buffer containing protease inhibitors was measured using the bicinchoninic acid (BCA) protein assay kit (Beyotime Biotechnology Inc., Shanghai, China), and proteins were separated using a polyvinylidene fluoride membrane (Millipore, Billerica, MA, USA). Proteins were separated on 10% skim sodium dodecyl sulfate–polyacrylamide gel electrophoresis, blocked with 20% skim milk and incubated with primary antibody overnight at 4 °C. Western blotting was performed using an enhanced chemiluminescence detection reagent (Beyotime Biotechnology Inc., Shanghai, China.) and according to the manufacturer’s protocol. P < 0.05 was considered statistically significant.

Statistical analysis

All statistical analyses were performed using the SPSS version 21.0 software package (SPSS, Chicago, Il, USA). The data are expressed as mean ± standard deviation. Categorical variables were analyzed using the χ2 or Fisher’s exact test. For paired samples, continuous variables were analyzed using the student’s t-test, and differences between groups were analyzed using an analysis of variance calculation. When the basic assumptions of the student’s t-test were not satisfied, the Wilcoxon–Mann–Whitney test was used. P-value < 0.05 was considered to indicate a statistically significant difference.

Results

Identification of DEGs

To analyze the diagnostic genes of NSCLC, we designed a flowchart (Fig. 1). Using the ‘Limma’ package in the R language, 371 DEGs, including 165 up-regulated genes and 206 down-regulated genes, were screened in NSCLC and normal lung tissue samples of GSE18842, GSE32864, and GSE21933 according to the criteria (adjusted P-value < 0.05 and |logFC|≥ 1). The results were expressed in the heatmap (Fig. 2a) and a volcano plot (Fig. 2b).

Functional enrichment analyses of the DEGs

We investigated the possible biological functions of these 371 DEGs using the GO, KEGG, DO, and GSEA functional enrichment analyses. The GO analysis revealed that DEGs were primarily enriched in nuclear division and extracellular matrix organization, implying a link between tumor cell division and distant tumor metastasis (Fig. 3a). According to KEGG pathway analysis, the IL-17 signaling pathway, cell cycle, complement and coagulation cascades, and malaria were four significantly enriched pathways (Fig. 3b). The DO analysis revealed that these DEGs were remarkably enriched in lung disease, NSCLC, and integumentary system disease (Fig. 3c). Finally, the GSEA analysis revealed that in lung cancer tissue, complement and coagulation cascades, cytokine-cytokine receptor interactions, hematopoietic cell lineage, leukocyte transendothelial migration, lysosome, and natural killer cell-mediated cytotoxicity were highly active (Fig. 3d), whereas base excision repair, cell cycle, DNA replication, mismatch repair, p53 signaling pathway, and pyrimidine metabolism were highly active in normal nasopharyngeal tissue (Fig. 3e). All these findings indicate that these DEGs may be critical in NSCLC.

Screening for NSCLC biomarkers and validation

The LASSO logistic algorithm was used in this study to identify, 26 characteristic genes (Fig. 4a), while the SVM-RFE method was used to identify 40 characteristic genes (Fig. 4b). Eight biomarker genes were obtained as a result of the intersection, ABCA8, ADAMTS8, ASPA, CEP55, FHL1, PYCR1, RAMP3, and TPX2 genes (Fig. 4c). To further validate their potential as diagnostic biomarkers for NSCLC, we examined their expression in the GSE32863 dataset (Fig. 5), which revealed that ABCA8, ADAMTS8, ASPA, FHL1, and RAMP3 genes were down-regulated in NSCLC while CEP55, PYCR1, and TPX2 genes were up-regulated. The accuracy of these eight biomarkers in distinguishing NSCLC from normal individuals was evaluated using a receiver operating characteristic (ROC) analysis, and all eight biomarkers demonstrated high sensitivity and specificity. (The areas under the ROC curves (AUCs) = 0.999 in GSE18842 and GSE 21993 and 0.910 in GSE32863 for the ABCA8 gene; AUCs = 0.998 in GSE18842 and GSE21993 and 0.930 in GSE32863 for the ADAMTS8 gene, AUCs = 0.996 in GSE18842 and GSE 21993 and 0.941 in GSE32863 for the ASPA gene, AUCs = 0.998 in GSE18842 and GSE21993 and 0.904 in GSE32863 for the CEP55 gene, AUCs = 0.998 in GSE18842 and GSE21993 and 0.945 in GSE32863 for the FHL1 gene, AUCs = 0.998 in GSE18842 and GSE21993 and 0.921 in GSE32863 for the PYCR1 gene, AUCs = 0.993 in GSE18842 and GSE21993 and 0.936 in GSE32863 for the RAMP3 gene, AUCs = 0.998 in GSE18842 and GSE21993 and 0.895 in GSE32863 for the TPX2 gene. All P < 0.05) (Figs. 6 and 7).

Assessment of immune cell infiltration

A comprehensive and dynamic understanding of the immune microenvironment is essential to develop effective therapeutic strategies. Therefore, in this study, we investigated immune cell infiltration in NSCLC and the relationship between biomarkers and infiltrating immune cells. First, we found significant differences in the composition of the 22 infiltrating immune cell types in each tissue sample (Fig. 8a). The correlation matrix showed the strongest positive correlation between eosinophils and monocytes and the strongest negative correlation between macrophage M0 cells and monocytes (Fig. 8b). Monocytes and eosinophils were the most down-regulated cells in NSCLC, while plasma cells and macrophage M0 were the most up-regulated, and activation of neutrophils, macrophage M1, and NK cells was low (Fig. 8c). Figure 8 depicts the relationship between the expression of eight biomarkers and the infiltration of immune cells. Monocytes, eosinophils, NK cells activated, neutrophils, mast cells resting, and T cells CD4 memory resting were positively correlated with with the ABCA8 gene expression, whereas plasma cells, macrophages M0, Follicular helper T (Tfh) cells, and macrophages M1 were negatively correlated (Fig. 9). The ADAMTS8 expression levels were negatively correlated with macrophages M1 T cells follicular helper, macrophages M0, and plasma cells, and positively correlated with monocytes, eosinophils, NK cells activated, neutrophils, T cells CD8, and T cells CD4 memory resting. The with the APSA gene expression levels expression levels were negatively correlated with T cells follicular helper, macrophages M0, and plasma cells and positively correlated with monocytes, eosinophils, neutrophils, mast cells resting, and NK cells activated. With The CEP55 gene expression levels expression levels were positively correlated with plasma cells, macrophages M0, macrophages M1, and T cells follicular helper and negatively correlated with T cells CD8, NK cells activated, mast cells resting, neutrophils, eosinophils, and monocytes. With The FHL1 gene expression levels expression levels were positively correlated with monocytes, eosinophils, NK cells activated, neutrophils, T cells CD4 memory resting, mast cells resting, and T cells CD8 and negatively correlated with T cells gamma delta, T cells regulatory, T cells CD4 memory activated, T cells follicular helper, macrophages M0, and plasma cells. With the PYCR1 gene expression levels were positively correlated with plasma cells, macrophages M0, T cells regulatory, T cells follicular helper, and macrophages M1 and negatively correlated with mast cells resting, T cells CD4 memory resting, NK cells activated, neutrophils, eosinophils, and monocytes. With The RAMP3 gene expression levels were positively correlated with monocytes, eosinophils, mast cells resting, T cells CD8, NK cells activated, and neutrophils, and negatively correlated with T cells follicular helper, macrophages M0, and plasma cells. The TPX2 expression levels were positively correlated with plasma cells, macrophages M0, macrophages M1, T cells follicular helper, and B cells naive and negatively correlated with T cells CD4 memory resting, NK cells activated, mast cells resting, neutrophils, eosinophils, and monocytes (P < 0.01).

Survival analysis

To determine the potential prognostic value of these eight biomarkers, we investigated the relationship between each biomarker’s expression and OS and DFS in NSCLC. The findings revealed that ABCA8 and FHL1 genes were associated with longer OS in lung cancer patients (P < 0.05), whereas TPX2 and CEP55 genes were associated with shorter OS (P < 0.05) (Figs. 10a–d), and other biomarkers were not significantly associated with OS (P > 0.05). Notably, only the TPX2 gene was associated with poor DFS (Fig. 10e), suggesting that the TPX2 gene may have a potential prognostic value in NSCLC, which we investigated further.

Knockdown of the TPX2 gene inhibited the proliferation and migration of A549 cells

To investigate the potential role of the TPX2 gene in NSCLC, we used a western blot to compare the expression of TPX2 in A549 cells and BEAS-2B cells. We found that A549 cells had higher levels of TPX2 protein expression than BEAS-2B cells (P < 0.05) (Fig. 11a, b). Furthermore, we inhibited TPX2 expression by transfecting si-RNA-targeted TPX2 into A549 cells. The results of western blot analysis revealed that the levels of TPX2 proteins were significantly lower in A549 cells after transfection with si-TPX2 compared to si-control (P < 0.05) (Fig. 11c, d), indicating that the transfection was complete and ready for the next step of the experiment. The CCK8 proliferation assays confirmed that the TPX2 gene knockdown significantly inhibited A549 cell proliferation (P < 0.05) (Fig. 12). Moreover, the results of the transwell migration assay revealed that the relative number ratios of migrating cells were significantly lower in the TPX2 gene knockdown cells compared to si-control cells (P < 0.05) (Fig. 13), indicating that the TPX2 gene knockdown resulted in A549 cell migration ability. Therefore, we speculated that the TPX2 gene might act as an oncogene, promoting NSCLC progression. However, this conclusion needs to be verified via in vivo experiments.

Discussion

NSCLC is well known for being asymptomatic and can only be detected at an early stage through physical examination¹. As the tumor grows, develops, and spreads, serious symptoms emerge, including chest pain, breathing difficulties, liver metastases, and a slew of seriously life-threatening symptoms. Given the lack of obvious symptoms in the early stages of NSCLC, which makes diagnosis difficult, “early diagnosis and early treatment” has become the treatment consensus for NSCLC³¹. With the advancement of bioinformatics, effective analysis and exploration of cancer genes to find tumor biomarkers have become a hot topic for early cancer diagnosis expression profiles and treatment, For example, chen et al. have used new computational models in the field of miRNA to make great contributions to the pathogenesis of diseases and new drug development. For example, through the construction of Neighborhood Constraint Matrix Completion for MiRNA-Disease Association prediction (NCMCMDA), deep-belief network for miRNA-disease association prediction (DBNMDA), Ensemble of Decision Tree based MiRNA-Disease Association prediction (EDTMDA) models to accurately predict the potential relationship between miRNA-disease and in breast neoplasms, lung neoplasms, esophageal neoplasms have been validated^32,33,34. The study of chen et al. has greatly improved the experimental efficiency and provided a new theoretical basis for the prevention, diagnosis and treatment of complex human diseases by screening disease-associated miRNAs through computational models, but such methods have not been adequately studied and described for biomarkers of NSCLC progression. Therefore, it is critical to identify a sensitive, safe, and feasible NSCLC biomarker for diagnostic and therapeutic purposes and to improve patient survival³⁵.

The bioinformatics analysis of the microarray dataset from the GEO database identified 165 up-regulated and 206 down-regulated genes between NSCLC and normal lung tissue samples. Moreover, functional analyses revealed that these DEGs were linked to lung cancer tumorigenesis and metastasis. The most significantly enriched pathway, the cell cycle signaling pathway, has been shown to have mutations that can affect the genomic and microenvironmental characteristics of LUAD patients³⁶ and can also be used as an assessment criterion for post-operative adjunctive therapy in LUAD patients³⁷. Furthermore, GSEA analysis revealed that these DEGs might affect base excision repair, the cell cycle, DNA replication, mismatch repair, and the p53 signaling pathway. The most activated p53 signaling pathway, which has been linked to oncogenic effects³⁸, promotes tumor growth in NSCLC and pancreatic ductal adenocarcinoma (PDAC)^39,40. However, these DEGs’ specific functions and molecular mechanisms need to be investigated further.

Because of its flexibility and power, ML is increasingly being used to screen novel biomarkers. We used two recently popular ML approaches, LASSO logistic regressions, and SVM-RFM, to identify the best diagnostic biomarkers for NSCLC. After combining the two methods and ROC analysis, eight NSCLC-related biomarkers with accurate predictive properties were identified (the AUCs of all these eight genes were greater than 0.89), including ADAMTS8, ABCA8, TPX2, CEP55, ASPA, FHL1, RAMP3, and PYCR1 genes. A disintegrin and metallopeptidase with thrombospondin motif type 8 (ADAMTS8) is a member of the zinc metalloproteinase family and is considered a tumor suppressor⁴¹. Wu et al. found that ADAMTS8 has been associated with clinical staging and lymph node metastasis in esophageal cancer patients⁴². ADAMTS8 can also inhibit lung cancer by targeting Vascular endothelial growth factor (VEGFA)⁴³. ATP-binding cassette subfamily A 8 (ABCA8) has been linked to tumors. Studies have shown that ABCA8 can inhibit the proliferation of breast cancer cells by regulating the AMPK/mTOR signaling pathway⁴⁴, and ABCA8 can also be used as a prognostic marker for hepatocellular carcinoma⁴⁵ and gastric adenocarcinoma⁴⁶. Centrosomal protein, 55 kD (CEP55) has been reported to be a potential biomarker and therapeutic target for PDAC and lung cancer⁴⁷. Several studies have shown that CEP55 is carcinogenic in the colon and esophageal cancers^48,49. Targeting protein for xenopus kinesin-like protein 2 (TPX2) is a cell cycle-associated gene that plays a pro-oncogenic role in hepatocellular carcinoma cells and acts synergistically with anti-cancer drugs⁵⁰. TPX2 can also be a therapeutic target for breast cancer and is associated with patient prognosis⁵¹. Current studies have shown that recombinant Helicobacter Pylori aspartate ammonia-lyase (ASPA) is an effective predictor of prognosis in colorectal cancer patients⁵². Four-and-a-half LIM domains protein (FHL1) has a diagnostic value in microscopic papillary thyroid carcinoma⁵³, and it has been reported that FHL1 acts as an inhibitor in colorectal cancer cells^54,55,56. Receptor activity-modifying protein 3 (RAMP3) is an accessory molecule that forms complexes with and regulates the function of specific G protein-coupled receptors (GPCRs). To date, studies have shown that RAMP3 is overexpressed in hepatocellular carcinoma patients and that RAMP3 is an independent prognostic factor for overall survival and RFS⁵⁷. Pyrroline-5-Carboxylate Reductase 1 (PYCR1) is a mitochondrial enzyme that is the final step in the proline biosynthetic pathway. Currently, PYCR1 has been reported to regulate tumour cell proliferation in hepatocellular carcinoma and cloud be an effective therapeutic target for multiple myeloma^58,59, as well as inhibit the development of clear cell renal cell carcinoma by causing mitochondrial dysfunction and interfering with oxidative stress pathways⁶⁰. Thus, according to the available studies, the eight biomarkers mentioned above play an important role in tumorigenesis and progression, but their exact mechanisms in NSCLC remain unknown.

Given the importance of the immune microenvironment in the development of lung cancer, we also performed immunological analyses. We found that monocytes differed most between normal and NSCLC tissue samples and correlated closely with all eight previously obtained biomarkers, implying that monocytes may be the most active immune cells in NSCLC. Monocytes have been shown to play a role in tumor progression⁶¹. For example, up-regulation of monocytes promotes CT26 tumor progression^62,63. It has also been linked to the immune checkpoint matrix metalloproteinases in hepatocellular carcinoma⁶⁴. The findings of this study suggest that monocytes may contribute to clinical immunotherapy and warrant further investigation.

Another significant finding in our study was that TPX2 expression was linked to poor prognosis, and TPX2 was up-regulated in both bioinformatics and western blot validation. We also found that TPX2 promoted the proliferation and migration of lung cancer A549 cells in vitro, suggesting that TPX2 was involved in developing NSCLC and could be a therapeutic target for NSCLC. However, the exact mechanism remains unknown, and we will investigate the role of TPX2 in vivo experiments.

ML methods have been successfully applied in cancer research in recent years, for example, for cancer classification⁶⁵, for analyzing gene chip data to screen for cancer-related biomarkers to assist doctors in making better diagnoses and decisions⁶⁶, and for cancer prediction to significantly improve prediction accuracy⁶⁵. Notably, the ML method has been demonstrated to be effective in determining a non-invasive, accurate, and reliable diagnosis of NSCLC. Li et al. used specific carbonyl volatile organic compounds in exhaled breath as a biomarker for detecting lung cancer to distinguish lung cancer patients from healthy controls and patients with benign lung nodules⁶⁶. Zhang et al. suggest that 5-hydroxymethylcytosine in circulating cell-free DNA can be used to diagnose and treat NSCLC⁶⁷. Zhang et al. concluded that five circulating micro RNAs have important prognostic capabilities in lung cancer⁶⁸. Wang et al. identified eight differentially expressed long non-coding RNAs in LUAD that could be used as potential diagnostic biomarkers⁶⁹.

This study combines machine learning methods (LASSO algorithm and SVM-RFE algorithm) with medical experiments to identify non-small cell lung cancer (NSCLC)-related biomarkers through the GEO public database, and finds that the progression of NSCLC is associated with immune cell infiltration, which greatly improves the efficiency of basic experiments for clinicians, and the screened NSCLC-related biomarkers is important for the prevention, diagnosis and treatment of NSCLC. Subsequently, we used in vitro functional assays to silence the prognostic TPX2 biomarker gene in NSCLC A549 cells and found that the proliferation and migration ability of A549 cells were significantly reduced, which further confirmed that the NSCLC-related biomarkers screened by the machine learning approach are novel and reliable. Our study provides a new idea for the pathogenesis and targeted therapy of NSCLC.

Conclusion

In conclusion, we used machine learning methods to identify eight diagnostic biomarkers for NSCLC, including ADAMTS8, ABCA8, TPX2, CEP55, ASPA, FHL1, RAMP3, and PYCR1 genes, followed by functional enrichment analysis and immune correlation analysis, and we validated the potential role of the TPX2 gene in vitro. Our findings identify new potential biomarkers for diagnosing and treating NSCLC and reveal new approaches that may have therapeutic potential for NSCLC.

Data availability

The datasets generated during the current study are available in the Gene Expression Omnibus (GEO) repository, (https://www.ncbi.nlm.nih.gov/geo/), (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE18842), (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE21933), (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE32863).

References

Meador, C. B. & Lovly, C. M. A tale of two histologies: Dissecting the biology of lineage transformation in lung cancer. Cancer Discov. 11, 2962–2964 (2021).
Article PubMed PubMed Central Google Scholar
Esfahani, M. S. et al. Inferring gene expression from cell-free DNA fragmentation profiles. Nat. Biotechnol. https://doi.org/10.1038/s41587-022-01222-4 (2022).
Article PubMed PubMed Central Google Scholar
Dai, J. et al. Sleeve resection after neoadjuvant chemoimmunotherapy in the treatment of locally advanced non-small cell lung cancer. Transl. Lung Cancer Res. 11, 188–200 (2022).
Article CAS PubMed PubMed Central Google Scholar
Basile, D. et al. Tumor budding is an independent prognostic factor in stage III colon cancer patients: A post-hoc analysis of the IDEA-France phase III trial (PRODIGE-GERCOR). Ann. Oncol. https://doi.org/10.1016/j.annonc.2022.03.002 (2022).
Article PubMed Google Scholar
Jin, S. et al. Machine learning predicts cancer-associated deep vein thrombosis using clinically available variables. Int. J. Med. Inform. 161, 104733 (2022).
Article PubMed Google Scholar
Lai, Y.-L. et al. Identification of a steroid hormone-associated gene signature predicting the prognosis of prostate cancer through an integrative bioinformatics analysis. Cancers 14, 1565 (2022).
Article CAS PubMed PubMed Central Google Scholar
Zheng, H. et al. Multiparametric MRI-based radiomics model to predict pelvic lymph node invasion for patients with prostate cancer. Eur. Radiol. https://doi.org/10.1007/s00330-022-08625-6 (2022).
Article PubMed PubMed Central Google Scholar
Ayyad, S. M. et al. A new framework for precise identification of prostatic adenocarcinoma. Sensors 22, 1848 (2022).
Article ADS PubMed PubMed Central Google Scholar
Ding, F.-P., Tian, J.-Y., Wu, J., Han, D.-F. & Zhao, D. Identification of key genes as predictive biomarkers for osteosarcoma metastasis using translational bioinformatics. Cancer Cell. Int. 21, 640 (2021).
Article CAS PubMed PubMed Central Google Scholar
Luo, Z. et al. Radiomics analysis of multiparametric MRI for prediction of synchronous lung metastases in osteosarcoma. Front. Oncol. 12, 802234 (2022).
Article PubMed PubMed Central Google Scholar
Su, Y. et al. Colon cancer diagnosis and staging classification based on machine learning and bioinformatics analysis. Comput. Biol. Med. 145, 105409 (2022).
Article PubMed Google Scholar
Cai, J. et al. A radiomics study to predict invasive pulmonary adenocarcinoma appearing as pure ground-glass nodules. Clin. Radiol. 76, 143–151 (2021).
Article CAS PubMed Google Scholar
van Vliet-Pérez, S. M. et al. Hyperspectral imaging for tissue classification after advanced stage ovarian cancer surgery-a pilot study. Cancers 14, 1422 (2022).
Article PubMed PubMed Central Google Scholar
Granata, V. et al. EOB-MR based radiomics analysis to assess clinical outcomes following liver resection in colorectal liver metastases. Cancers 14, 1239 (2022).
Article PubMed PubMed Central Google Scholar
R Core Team. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, (Vienna, Austria, 2017). https://www.R-project.org.
Ritchie, M. E. et al. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 43, e47 (2015).
Article PubMed PubMed Central Google Scholar
Kanehisa, M. & Goto, S. KEGG: Kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 28, 27–30 (2000).
Article CAS PubMed PubMed Central Google Scholar
Kanehisa, M. Toward understanding the origin and evolution of cellular organisms. Protein Sci. 28, 1947–1951 (2019).
Article CAS PubMed PubMed Central Google Scholar
Kanehisa, M., Furumichi, M., Sato, Y., Ishiguro-Watanabe, M. & Tanabe, M. KEGG: Integrating viruses and cellular organisms. Nucleic Acids Res. 49, D545–D551 (2021).
Article CAS PubMed Google Scholar
Yu, G., Wang, L.-G., Han, Y. & He, Q.-Y. clusterProfiler: An R package for comparing biological themes among gene clusters. OMICS 16, 284–287 (2012).
Article CAS PubMed PubMed Central Google Scholar
Perez-Iratxeta, C., Bork, P. & Andrade-Navarro, M. A. Update of the G2D tool for prioritization of gene candidates to inherited diseases. Nucleic Acids Res. 35, W212-216 (2007).
Article PubMed PubMed Central Google Scholar
Subramanian, A. et al. Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles. Proc. Natl. Acad. Sci. USA 102, 15545–15550 (2005).
Article ADS CAS PubMed PubMed Central Google Scholar
Tibshirani, R. The lasso method for variable selection in the Cox model. Stat. Med. 16, 385–395 (1997).
Article CAS PubMed Google Scholar
Lin, X. et al. A support vector machine-recursive feature elimination feature selection method based on artificial contrast variables and mutual information. J. Chromatogr. B Analyt. Technol. Biomed. Life Sci. 910, 149–155 (2012).
Article CAS PubMed Google Scholar
Friedman, J., Hastie, T. & Tibshirani, R. Regularization paths for generalized linear models via coordinate descent. J. Stat. Softw. 33, 1–22 (2010).
Article PubMed PubMed Central Google Scholar
Huang, M.-L., Hung, Y.-H., Lee, W. M., Li, R. K. & Jiang, B.-R. SVM-RFE based feature selection and Taguchi parameters optimization for multiclass SVM classifier. Sci. World J. 2014, 795624 (2014).
Article Google Scholar
Pathan, M. et al. FunRich: An open access standalone functional enrichment and interaction network analysis tool. Proteomics 15, 2597–2601 (2015).
Article CAS PubMed Google Scholar
Tang, X., Zhang, S., Wang, Z., Liu, J. & Ying, Z. ProcData: An R package for process data analysis. Psychometrika 86, 1058–1083 (2021).
Article MathSciNet PubMed MATH Google Scholar
Xue, G., Hua, L., Zhou, N. & Li, J. Characteristics of immune cell infiltration and associated diagnostic biomarkers in ulcerative colitis: Results from bioinformatics analysis. Bioengineered 12, 252–265 (2021).
Article CAS PubMed PubMed Central Google Scholar
Serang, S., Jacobucci, R., Brimhall, K. C. & Grimm, K. J. Exploratory mediation analysis via regularization. Struct. Equ. Modeling 24, 733–744 (2017).
Article MathSciNet PubMed PubMed Central Google Scholar
Oudkerk, M., Liu, S., Heuvelmans, M. A., Walter, J. E. & Field, J. K. Lung cancer LDCT screening and mortality reduction–evidence, pitfalls and future perspectives. Nat. Rev. Clin. Oncol. 18, 135–151 (2021).
Article PubMed Google Scholar
Chen, X., Sun, L.-G. & Zhao, Y. NCMCMDA: miRNA-disease association prediction through neighborhood constraint matrix completion. Brief. Bioinform. 22, 485–496 (2021).
Article CAS PubMed Google Scholar
Chen, X., Li, T.-H., Zhao, Y., Wang, C.-C. & Zhu, C.-C. Deep-belief network for predicting potential miRNA-disease associations. Brief. Bioinform. 22, bbaa186 (2021).
Article PubMed Google Scholar
Chen, X., Zhu, C.-C. & Yin, J. Ensemble of decision tree reveals potential miRNA-disease associations. PLoS Comput. Biol. 15, e1007209 (2019).
Article PubMed PubMed Central Google Scholar
Li, N. et al. One-off low-dose CT for lung cancer screening in China: A multicentre, population-based, prospective cohort study. Lancet. Respir. Med. 10, 378–391 (2022).
Article CAS PubMed Google Scholar
Shan, G. et al. Genomic and tumor microenvironment differences between cell cycle progression pathway altered/non-altered patients with lung adenocarcinoma. Front. Oncol. 12, 843528 (2022).
Article PubMed PubMed Central Google Scholar
Li, J. et al. Identifying 18F-FDG PET-metabolic radiomic signature for lung adenocarcinoma prognosis via the leveraging of prognostic transcriptomic module. Quant. Imaging Med. Surg. 12, 1893–1908 (2022).
Article PubMed PubMed Central Google Scholar
Su, R. et al. A pan-cancer analysis of the oncogenic role of Holliday junction recognition protein in human tumors. Open Med. (Wars) 17, 317–328 (2022).
Article CAS Google Scholar
Xiao, X. et al. Green tea-derived theabrownin suppresses human non-small cell lung carcinoma in xenograft model through activation of not only p53 signaling but also MAPK/JNK signaling pathway. J. Ethnopharmacol. 291, 115167 (2022).
Article CAS PubMed Google Scholar
Ma, Z. et al. ZMAT1 acts as a tumor suppressor in pancreatic ductal adenocarcinoma by inducing SIRT3/p53 signaling pathway. J. Exp. Clin. Cancer Res. 41, 130 (2022).
Article CAS PubMed PubMed Central Google Scholar
Zhang, K. et al. ADAMTS8 inhibits cell proliferation and invasion, and induces apoptosis in breast cancer. Onco. Targets Ther. 13, 8373–8382 (2020).
Article CAS PubMed PubMed Central Google Scholar
Wu, Z. et al. ADAMTS8 inhibits progression of esophageal squamous cell carcinoma. DNA Cell. Biol. https://doi.org/10.1089/dna.2020.6053 (2020).
Article PubMed Google Scholar
Zhang, Y., Hu, K., Qu, Z., Xie, Z. & Tian, F. ADAMTS8 inhibited lung cancer progression through suppressing VEGFA. Biochem. Biophys. Res. Commun. 598, 1–8 (2022).
Article CAS PubMed Google Scholar
Lv, C., Yang, H., Yu, J. & Dai, X. ABCA8 inhibits breast cancer cell proliferation by regulating the AMP activated protein kinase/mammalian target of rapamycin signaling pathway. Environ. Toxicol. https://doi.org/10.1002/tox.23495 (2022).
Article PubMed Google Scholar
Zhang, J., Zhang, X., Li, J. & Song, Z. Systematic analysis of the ABC transporter family in hepatocellular carcinoma reveals the importance of ABCB6 in regulating ferroptosis. Life Sci. 257, 118131 (2020).
Article CAS PubMed Google Scholar
Guo, Y., Wang, Z. W., Su, W. H., Chen, J. & Wang, Y. L. Prognostic value and immune infiltrates of ABCA8 and FABP4 in stomach adenocarcinoma. Biomed. Res. Int. 2020, 4145164 (2020).
Article PubMed PubMed Central Google Scholar
Wang, C. et al. Identification of hub genes in pancreatic ductal adenocarcinoma using bioinformatics analysis. Iran J. Public Health 50, 2238–2245 (2021).
PubMed PubMed Central Google Scholar
Lin, Y., Chen, Y., Shen, R., Chen, D. & Lin, Y. MicroRNA-148a-3p suppresses cell proliferation and migration of esophageal carcinoma by targeting CEP55. Cell. Mol. Biol. Lett. 26, 54 (2021).
Article CAS PubMed PubMed Central Google Scholar
Bozic, D. et al. Predicting sulforaphane-induced adverse effects in colon cancer patients via in silico investigation. Biomed. Pharmacother. 146, 112598 (2022).
Article CAS PubMed Google Scholar
Wang, X., Wang, J., Shen, H., Luo, Z. & Lu, X. Downregulation of TPX2 impairs the antitumor activity of CD8+ T cells in hepatocellular carcinoma. Cell. Death Dis. 13, 223 (2022).
Article CAS PubMed PubMed Central Google Scholar
Kahl, I. et al. The cell cycle-related genes RHAMM, AURKA, TPX2, PLK1, and PLK4 are associated with the poor prognosis of breast cancer patients. J. Cell. Biochem. 123, 581–600 (2022).
Article CAS PubMed Google Scholar
Zhao, F. et al. Identification of sixteen metabolic genes as potential biomarkers for colon adenocarcinoma. J. BUON 26, 1252–1259 (2021).
PubMed Google Scholar
Yang, F. et al. Identification of key genes associated with papillary thyroid microcarcinoma characteristics by integrating transcriptome sequencing and weighted gene co-expression network analysis. Gene 811, 146086 (2022).
Article CAS PubMed Google Scholar
Liu, Y. et al. FHL1 Inhibits the progression of colorectal cancer by regulating the Wnt/β-catenin signaling pathway. J. Cancer 12, 5345–5354 (2021).
Article CAS PubMed PubMed Central Google Scholar
Eshibona, N. et al. Upregulation of FHL1, SPNS3, and MPZL2 predicts poor prognosis in pediatric acute myeloid leukemia patients with FLT3-ITD mutation. Leuk. Lymphoma. https://doi.org/10.1080/10428194.2022.2045594 (2022).
Article PubMed Google Scholar
Niu, C. et al. Downregulation and growth inhibitory role of FHL1 in lung cancer. Int. J. Cancer 130, 2549–2556 (2012).
Article CAS PubMed Google Scholar
Fang, A. et al. RAMP3 is a prognostic indicator of liver cancer and might reduce the adverse effect of TP53 mutation on survival. Future Oncol. 14, 2615–2625 (2018).
Article CAS PubMed Google Scholar
Zhang, J., Shang, L., Jiang, W. & Wu, W. Shikonin induces apoptosis and autophagy via downregulation of pyrroline-5-carboxylate reductase1 in hepatocellular carcinoma cells. Bioengineered 13, 7904–7918 (2022).
Article CAS PubMed PubMed Central Google Scholar
Oudaert, I. et al. Pyrroline-5-carboxylate reductase 1: A novel target for sensitizing multiple myeloma cells to bortezomib by inhibition of PRAS40-mediated protein synthesis. J. Exp. Clin. Cancer Res. 41, 45 (2022).
Article CAS PubMed PubMed Central Google Scholar
Wu, Y. et al. A mitochondrial dysfunction and oxidative stress pathway-based prognostic signature for clear cell renal cell carcinoma. Oxid. Med. Cell Longev. 2021, 9939331 (2021).
PubMed PubMed Central Google Scholar
Albakri, M. M., Huang, S.C.-C., Tashkandi, H. N. & Sieg, S. F. Fatty acids secreted from head and neck cancer induce M2-like macrophages. J. Leukoc. Biol. https://doi.org/10.1002/JLB.1A0521-251R (2022).
Article PubMed Google Scholar
Simon Davis, D. A. et al. Machine learning predicts cancer subtypes and progression from blood immune signatures. PLoS ONE 17, e0264631 (2022).
Article CAS PubMed PubMed Central Google Scholar
Hecking, T. et al. Programmed cell death ligand-1 (PDL-1) correlates with tumor infiltration by immune cells and represents a promising target for immunotherapy in endometrial cancer. Anticancer Res. 42, 1367–1376 (2022).
Article CAS PubMed Google Scholar
Zhang, L. et al. Comprehensive analysis of the MIR4435-2HG/miR-1-3p/MMP9/miR-29-3p/DUXAP8 ceRNA network axis in hepatocellular carcinoma. Discov. Oncol. 12, 38 (2021).
Article CAS PubMed PubMed Central Google Scholar
Cruz, J. A. & Wishart, D. S. Applications of machine learning in cancer prediction and prognosis. Cancer Inform. 2, 59–77 (2007).
PubMed PubMed Central Google Scholar
Li, M. et al. Breath carbonyl compounds as biomarkers of lung cancer. Lung Cancer 90, 92–97 (2015).
Article PubMed Google Scholar
Zhang, J. et al. 5-Hydroxymethylome in circulating cell-free DNA as a potential biomarker for non-small-cell lung cancer. Genom. Proteom. Bioinformat. 16, 187–199 (2018).
Article CAS Google Scholar
Zhang, Y.-H., Jin, M., Li, J. & Kong, X. Identifying circulating miRNA biomarkers for early diagnosis and monitoring of lung cancer. Biochim. Biophys. Acta Mol. Basis Dis. 1866, 165847 (2020).
Article CAS PubMed Google Scholar
Wang, Y. et al. Screening key lncRNAs for human lung adenocarcinoma based on machine learning and weighted gene co-expression network analysis. Cancer Biomark. 25, 313–324 (2019).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

We are grateful to the GEO database for providing the platform and to the contributors for uploading their meaningful datasets. We thank Bullet Edits Limited for the linguistic editing and proofreading of the manuscript. This work was done at the First Affiliated Hospital of Guangxi Medical University.

Funding

This research was supported by Guangxi Natural Science Foundation under Grant NO. 2020GXNSFDA238003.

Author information

Authors and Affiliations

Department of Respiratory Medicine, The First Affiliated Hospital of Guangxi Medical University, Nanning, 530021, Guangxi, China
Fangwei Wang & Chaoqian Li
Department of Clinical Laboratory, The First Affiliated Hospital of Guangxi Medical University, Nanning, 530021, Guangxi, China
Qisheng Su

Authors

Fangwei Wang
View author publications
You can also search for this author in PubMed Google Scholar
Qisheng Su
View author publications
You can also search for this author in PubMed Google Scholar
Chaoqian Li
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

W.F.W. and C.Q. L. designed the implementation of the research, drafted preliminary papers, and participated in investigations. W.F.W., Q.S.S. and C.Q.L. participated in research design and implementation, manuscript revision, manuscript submission, and fund acquisition. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Chaoqian Li.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Wang, F., Su, Q. & Li, C. Identidication of novel biomarkers in non-small cell lung cancer using machine learning. Sci Rep 12, 16693 (2022). https://doi.org/10.1038/s41598-022-21050-5

Download citation

Received: 06 July 2022
Accepted: 22 September 2022
Published: 06 October 2022
DOI: https://doi.org/10.1038/s41598-022-21050-5

This article is cited by

Machine learning pipeline to analyze clinical and proteomics data: experiences on a prostate cancer case
- Patrizia Vizza
- Federica Aracri
- Giuseppe Tradigo
BMC Medical Informatics and Decision Making (2024)
Transcriptome profiling and metabolic pathway analysis towards reliable biomarker discovery in early-stage lung cancer
- Muthu Kumar Thirunavukkarasu
- Priyanka Ramesh
- Shanthi Veerappapillai
Journal of Applied Genetics (2024)
ABCA8 Elevation Predicts the Prognosis and Exerts the Anti-oncogenic Effects on the Malignancy of Non-small Cell Lung Cancer via TCF21-Mediated Inactivation of PI3K/AKT
- Xin Yu
- Guoqiong Zhou
- Nana Zhang
Molecular Biotechnology (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Use tumor suppressor genes as biomarkers for diagnosis of non-small cell lung cancer

Lung adenocarcinoma and lung squamous cell carcinoma cancer classification, biomarker identification, and gene expression analysis using overlapping feature selection methods

Genetic network and gene set enrichment analyses identify MND1 as potential diagnostic and therapeutic target gene for lung adenocarcinoma

Introduction

Materials and methods

Data processing and DEGs screening

Functional and pathways enrichment analysis

ML methods to identify NSCLC biomarkers

Validation of biomarkers

Assessment and correlation analysis of infiltrating immune cells

Survival analysis

Cell culture and cell transfection

Western blotting

Statistical analysis

Results

Identification of DEGs

Functional enrichment analyses of the DEGs

Screening for NSCLC biomarkers and validation

Assessment of immune cell infiltration

Survival analysis

Knockdown of the TPX2 gene inhibited the proliferation and migration of A549 cells

Discussion

Conclusion

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information.

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Machine learning pipeline to analyze clinical and proteomics data: experiences on a prostate cancer case

Transcriptome profiling and metabolic pathway analysis towards reliable biomarker discovery in early-stage lung cancer

ABCA8 Elevation Predicts the Prognosis and Exerts the Anti-oncogenic Effects on the Malignancy of Non-small Cell Lung Cancer via TCF21-Mediated Inactivation of PI3K/AKT

Comments

Search

Quick links