Expression and gene regulation network of INHBA in Head and neck squamous cell carcinoma based on data mining

Inhibin subunit beta A(INHBA) encodes an individual from the TGF-β superfamily of proteins and the ligand could be further homo-dimerized to shape activin A or hetero-dimerized to frame inhibin with inhibin beta B. We studied INHBA expression, mutations, regulation, function networks and immune infiltrates in data from patients with Head and neck squamous cell carcinoma (HNSCC) based on different open databases by utilizing multi-dimensional investigation techniques. This study gives staggered evidence for the significance of INHBA in head and neck squamous cell carcinoma and its potential role as a novel biomarker. Our outcomes propose that INHBA overexpression in HNSCC has profound impacts in the center hub of post-transcriptional regulation, which is firmly identified with protein translation. Meanwhile, we also examine the function of the identified miRNAs that were related to INHBA and molecular function of these miRNAs were mainly enhanced in transcription factor activity, transcription regulator activity. In addition, B cells of immune infiltrates affecting the prognosis and might have a prognostic significance related to INHBA in HNSCC. Our outcomes show that data mining efficiently uncovers information about INHBA expression in HNSCC and more importance establishing a foundation for further investigation of the role of INHBA in carcinogenesis.

in gastric cancer and linked to poor survival, INHBA gene silencing can gives a potential target in the treatment of gastric cancer [9][10][11] . Seder et al. indicated that INHBA may be regulated by DNA methylation in lung adenocarcinoma 12 . In addition, abnormal overexpression of INHBA was detected in various malignant tumors, including esophageal cancer, prostate cancer, and ovarian cancer [13][14][15] . It was also discovered that INHBA is involved in tumor-node-metastasis (TNM) stage and venous invasion 16 . Wang et al. discovered that INHBA was linked to cancer diameter and tumor invasion depth, and further suggested that patients with a higher expression of INHBA had a shorter disease-free survival rate 17 . Above these results propose that INHBA may be a novel proto-oncogene. However, determining the interaction between INHBA and HNSCC remains an unsolved problem. Thus, in the present study our objection is to studied INHBA expression, mutations, regulation, function networks and immune infiltrates in data from patients with HNSCC based on different open databases by utilizing multi-dimensional analysis strategies. Our results will be of great significance in clarifying the pathogenesis of HNSCC and in filtering biomarkers for diagnosis.

Results
inHBA expression in HnScc. Information in the Oncomine database uncovered that mRNA expression and DNA copy number variation of INHBA were fundamentally higher in HNSCC tissues when compare with normal tissues (P < 0.01). Box plot showing INHBA mRNA levels in, respectively, the Peng Head-Neck, Sengupta Head-Neck, Ginos Head-Neck, Ye Head-Neck and Pyeon Multi-cancer Head-Neck datasets, meanwhile, the fold change differences all were over 2 means that INHBA expression were significantly higher in HNSCC tissues (Fig. 1). Further subgroup analysis of multiple clinic pathological features of 520 HNSCC samples in the TCGA reliably indicated high transcription of INHBA. In age subgroup (normal-vs-age (21-40 yrs), normal-vs-age (41-60 yrs), normal-vs-age (61-80 yrs) and normal-vs-age (81-100 yrs)) analysis the transcription level of INHBA was essentially higher in HNSCC patients than healthy individuals. In HPV status subgroups (normal-vs-HPV − ve and normal-vs-HPV + ve) analysis; gender subgroup (normal-vs-male and normal-vs-female); race subgroup (normal-vs-Caucasian,normal-vs-AfricanAmerican and normal-vs-Asian); tumor grade subgroup (normal-vs-Grade 1, normal-vs-Grade 2, normal-vs-Grade 3 andnormal-vs-Grade 4) analysis the INHBA was also significantly higher in HNSCC patients (Fig. 2). frequency and type of inHBA alterations in HnScc. We then based on cBioPortal to explore the sorts and frequency of INHBA modifications in HNSCC from HNSCC patients sequencing data in the TCGA database. INHBA was modified in 42 of 496 (8%) HNSCC patients (Fig. 3A). These modifications were mRNA upregulation in 34 cases, amplification in 4 cases, missense mutation in 3 cases and inframe mutation in 1 case. Thus, mRNA upregulation is the most common type of INHBA in HNSCC patients. Besides, Kaplan-Meier survival analysis demonstrated statistically significant that INHBA overexpression was related to overall survival (P < 0.05) and disease/progression-free survival (P < 0.05) in HNSCC (Fig. 3C,D).
Biological interaction network of RBM8A. In order to decide the biological interaction network of INHBA in HNSCC, we applied to tab Network in cBioPortal to demonstrate INHBA neighboring genes that were changed at top 10 frequencies (Fig. 3B and Table 1). The results indicated that the most amplification gene were GDNF and GDF6; the most up-regulation and down-regulation gene was FKBP1A and MAP3K7, respectively; the most mutation gene were HRAS and the most total alteration was GDNF ( Table 1). The 50 most as often altered neighbor genes of INHBA were showed utilizing Network and the most frequent alterations were GDNF (13.1%), GDF6 (10.9%) and MAP3K7 (10.1%). To examine the function of the identified 50 neighbor genes, biological analyses were performed utilizing GO enrichment and KEGG pathway analysis via Enrichr online database. P < 0.05 as the criterion deemed statistically significant. GO analysis results demonstrated that biological processes (BP) were significantly enriched in transmembrane receptor protein serine/threonine kinase signaling pathway, positive guideline of pathway-restricted SMAD protein phosphorylation, cellular response to BMP stimulus et al.  (Fig. 4). Therefore, the biological interaction network of INHBA alterations is engaged with the activing binding, protein complex form, regulation of protein and several cancer processes.
Go and KeGG pathway examinations of co-expression genes connected with inHBA in HnScc. LinkedOmics was utilized to analyze mRNA sequencing information from HNSCC patients in the TCGA. A T-test was utilized to analyze connections among INHBA and genes differentially expressed in HNSC (Fig. 5A). The 50 critical gene sets positively and negatively connected with INHBA as appeared in the heat map ( Fig. 5B,C). Significant GO and KEGG term examination by gene set enrichment analysis (GSEA) demonstrated that genes differentially expressed in connection with INHBA were found essentially in the sensory perception of chemical stimulus, olfactory receptor activity and olfactory transduction (Fig. 6). We also list the 5 most significant target networks miRNA, kinase and transcription factor target networks of decidedly related INHBA created by GSEA and the results indicated that the most leading-edge number in kinase target, miRNA target and transcription factor target was kinase_GRK3, GTGCCAA, MIR-96 and V$CP2_01, respectively ( Table 2). miRnAs related to inHBA. According to cumulative weighted context++ score, the top 5 among 961 miRNAs family was miR-133a-3p.2/133b, miR-130-3p/301-3p/454-3p, miR-153-3p, miR-140-3p and miR-203a-3p.1 that related to gene INHBA. Conserved sites for miRNA family broadly conserved among vertebrates showed in Fig. 7A. To examine the function of the identified 961 miRNAs, biological enrichment was performed via Funrich database. Biological processes were significantly enriched in regulation of nucleic acid metabolism, cell communication and signal transduction; Cell component were primarily enriched in the nucleus, cytoplasm, golgi apparatus; Molecular function were mainly enhanced in ubiquitin-specific protease activity, transcription factor activity, transcription regulator activity, Biological pathway enriched in TRAIL signaling pathway, VEGY and VEGFR signaling pathway, glypican pathway.

Discussion
Differential expression and dysfunction of the TGF-β protein has been reported in various cancers 18,19 . INHBA encodes an individual from the TGF-β superfamily of proteins and the ligand could be further homo-dimerized to shape activin A or hetero-dimerized to frame inhibin with inhibin beta B 20 . In our previous bioinformatic analysis, we found that INHBA was overexpressed in HNSCC tissue based on online analysis tools. Until now, there is no effective biomarkers and therapeutic targets relevant to HNSCC progression and treatment response. So, it is emergency to identifying oncogenic biomarkers and elucidating the underlying mechanism of the initiation and development of HNSCC would make profound impact of the early diagnosis and effective treatment for patients with high malignancy 21 . To acquire more detailed information into the potential elements of INHBA in HNSCC and its regulatory mechanism, we performed bioinformatics analysis of open sequencing information to provide instruct future research in HNSCC.
Investigation of transcriptional sequencing information from thousands clinical samples TCGA databases comprising six geographic regions and subgroups analysis stratified based on gender, age, HPV status, gender, race and tumor grade confirmed that INHBA mRNA levels and CNVs are fundamentally higher in HNSCC when compare with normal tissue. Meanwhile, the fold change differences all were over 2 indicated strong that INHBA overexpression in HNSCC. We hypothesis that INHBA may be a potential diagnostic and prognostic marker and deserves further clinical validation. CNV affects gene expression through changes in gene sequence location and

Gene Symbol Amplification
Homozygous Deletion www.nature.com/scientificreports www.nature.com/scientificreports/ further influences disease susceptibility and disease progression 22 . High proportion of aberrant methylation being associated with cancer-associated CNAs while not somatically acquired epigenetic defects 23 . In our study, we found that mRNA upregulation is the most common alterations type of INHBA in HNSCC patients. This alteration may be caused by the overexpression of INHBA. We speculate that abnormal expression and dysfunction of INHBA result from protein modification and further cause changes in translation control and post-translation control. In addition, Kaplan-Meier survival analysis exhibited that high INHBA expression was related to overall survival and progression-free survival in HNSCC patients. The function of the identified neighbor genes of INHBA was involved in the activing binding, protein complex form, regulation of protein and several cancer processes. Thus, INHBA modifications is engaged with the center hub of post-transcriptional regulation, which is firmly related to protein translation.

Upregulation
Activins and inhibins assumed play a role in different physiological procedures through endocrine as well as autocrine or paracrine mechanisms 24 . They are also engaged with cell growth, proliferation, differentiation, apoptosis and carcinogenesis 25 . In order to start the activin cascade pathway, activin ties to a complex and triggers phosphorylation of the receptor and starts actuation of Smad proteins 26 . Studies revealed that high expression of INHBA gene was related to significantly poorer 5-year survival rate 27,28 . Previous studies have confirmed INHBA expression is related tumor invasiveness and promoting metastasis but the specific pathogenesis is unclear. Seder et al. 29 found that INHBA was overexpressed and advanced cell proliferation via promoter demethylation and histone acetylation in esophageal adenocarcinoma. Kaneda et al. 30 showed that activin A hinders vascular endothelial cell growth and further suppresses tumor angiogenesis in gastric cancer. Chang et al. 31 also discovered that activin A advances tumor invasion and metastasis in HNSCC. Our study can provide information for further potential researches on regulatory mechanism. Using GSEA enrichment analysis of INHBA can help find vital networks of target kinases, miRNAs and transcription factors. Kinases and their related signaling pathways help balance and repair genomic DNA 32 . We found that INHBA in HNSCC is linked to a network of kinases including GRK3 and CSNK1E. GRK3 specifically phosphorylates activated beta-adrenergic and related G-protein coupled  www.nature.com/scientificreports www.nature.com/scientificreports/ receptors. As an important signal regulator widely distributed in the body, GRK3 can keep cells de-sensitized during excessive signal stimulation 33 . It is reported in the literature that GRK3 is highly expressed in prostate cancer cells and can promote angiogenesis and distant metastasis of prostate cancer 34 . So we assume that INHBA and GRK3 play a synergistic role in the HNSCC. MiR-26b, is lowly expressed in various cancer tissues such as breast cancer and esophageal cancer, and participates in the growth process of tumor cells 35,36 . Jin et al. 37 first demonstrated that miR-26a/b can advance apoptosis by means of suppressing the expression of autophagy initiator ULK1. He et al. 38 indicated that miR-26 induced apoptosis and restrained autophagy based on the TGF-β1-JNK signaling pathway, recommending that miR-26 could be a useful novel target for the treatment of non-small cell lung cancer (NSCLC). We suggest that statement of miR-26a/b could induce apoptosis in HNSCC cells and needs more researches to convince. CP2 is a transcription factor that has a place with the Drosophila grainyhead-like gene family, and has been found to invigorate transcription of the a-globin gene 39 .In addition, CP2 have been implicated as being tumor suppressors in various human cancers. Despite their physiological importance, little is known about their structure and DNA binding mode 40 . In our study, we suggested that CP2 as a transcriptional factor that regulates by INHBA gene in light of HNSCC and further studies should test this theory.
Our study distinguished miRNAs that were related to INHBA using Targetscan online tool. To examine the function of the identified miRNAs, biological enrichment was performed via Funrich database. Biological processes were significantly enriched in regulation of nucleic acid metabolism; signal transduction; cell communication. Cell component were primarily enriched in the nucleus, cytoplasm, golgi apparatus. Molecular function was mainly enhanced in Transcription factor activity, transcription regulator activity. MiR-133 can significantly   www.nature.com/scientificreports www.nature.com/scientificreports/ inhibited the migration and invasion of the cancer cells 41 . Study indicate that miR-153 affects the progression of nasopharyngeal cancer (NPC) by targeting the TGF-β2/Smad2 signaling pathway 42 . Overexpression of miR-140-3p inhibited cell proliferation, migration and invasion 43,44 . While dysregulation of these miRNAs of INHBA overexpression in HNSCC is extremely rare and need more researches to confirm. A superior comprehension of the interplay between the tumor microenvironment and the infiltrating immune cells is crucial. Characterization of the adaptive immune response has been appeared to be an indispensable prognostic tool in a wide scope of carcinomas, possibly significantly more important than the present cancer staging system 45,46 . In our Immune infiltrates study, cumulative survive showed that B cells of immune infiltrates statistically significant of INHBA in HNSCC indicating that B cells significantly affecting the prognosis and might have a prognostic significance, but it is worth further research and exploration.
This study gives staggered evidence for the significance of INHBA in head and neck squamous cell carcinoma and its potential role as a novel biomarker. Our outcomes recommend that INHBA overexpression in HNSCC has profound impacts in the center hub of post-transcriptional regulation, which is firmly related to protein translation. Meanwhile, we also examine the function of the identified miRNAs that were related to INHBA and molecular function of these miRNAs were mainly enhanced in transcription factor activity, transcription regulator activity. In addition, B cells of immune infiltrates affecting the prognosis and might have a prognostic significance related to INHBA in HNSCC. This study utilizes online tools dependent on the most prevalent bioinformatics theories to perform target gene analyses on tumor data information from open databases and empowers expansive scale HNSCC genomics research and subsequent functional explore 47 . Meanwhile, our study has limitations, one is that our results not verified in clinical samples and cannot provides precise clinical data, another is that due www.nature.com/scientificreports www.nature.com/scientificreports/ to the histological types and multiple anatomical sites and of HNSCC the tumor markers may vary widely. We will do a lot of related research in the follow-up.

We studied INHBA expression, mutations, regulation, function networks and immune infiltrates in data from patients with HNSCC based on different open databases by utilizing mul-
ti-dimensional analysis strategies. Oncomine analysis. The DNA copy number and mRNA expression of INHBA in HNSCC were investigated inside the Oncomine 4.5 database. Oncomine (www.oncomine.org) contains 715 gene expression data sets and 86,733 samples, is also the biggest oncogene chip database and incorporated data mining platform worldwide 48 . This analysis drew on a series of HNSCC studies, including Peng HNSCC, Sengupta HNSCC, Ginos HNSCC, Ye HNSCC, Pyeon HNSCC, and TCGA HNSCC studies [49][50][51][52][53] . INHBA expression was involved in evaluated in HNSCC tissue in respect to its expression in normal tissue, and P < 0.05 as the cutoff criterion considered statistically significant.
UALCAN analysis. UALCAN (http://ualcan.path.uab.edu) is a user-friendly, intelligence web asset for analyzing, integrating and discovering cancer transcriptome data and in-depth analyses of TCGA gene expression information 54 . TCGA uses large-scale sequencing-based genomic analysis technology to understand the molecular mechanisms of cancer through extensive collaboration. One of the portal's highlights features is that it enables users to distinguish biomarkers or to perform in silico approval of potential genes of interest and evaluate gene expression in different tumor subgroups, such as gender, age, race, tumor grade et al. www.nature.com/scientificreports www.nature.com/scientificreports/ c-BioPortal analysis. The cBioPortal (http://cbioportal.org) 55 is an open-access asset gives visualization, analysis and download of substantial scale cancer genomics data sets which portal currently containing 245 cancer studies. We utilized c-BioPortal to analyze INHBA alterations in the TCGA HNSCC samples and shows an overview of genetic alterations per test in INHBA. Besides, using the Kaplan-Meier analysis in cBioPortal can evaluate the effect of gene expression dysregulation on the patient's overall survival and disease-free survival. A tab biological interaction network of the INHBA and their co-expression genes was analyzed and neighboring genes with alteration frequencies were included. The Enrichr (https://amp.pharm.mssm.edu/Enrichr/) (version 6.8) 56 which is a useful online platform database that incorporates biological information and gives a thorough set of functional annotation data of genes as well as proteins for users to analyze the functions or signaling pathways. So, we performed GO and KEGG pathway enrichment analyses of the most frequently modified neighbor genes using Enrichr. P-Value < 0.05 as statistically significant.
LinkedOmics analysis. LinkedOmics (http://www.linkedomics.org/ login.php) 57 is openly accessible entry that includes multi-omics information from each of the 32 TCGA cancer. The LinkCompare module utilizes visualization functions and meta-analysis to look at and integrate affiliation results created by the LinkFinder module, which allows multi-omics analysis in a cancer type or pan-cancer analysis and enables user to scan for hopeful target genes of transcriptional factors, protein kinases or microRNAs. LinkedOmics was utilized to study genes differentially expressed related to INHBA in the TCGA HNSCC cohort (n = 520) and results were analyzed statistically using T-test then GSEA was utilized to perform analyses of GO and KEGG pathways analysis.
TargetScan analysis. TargetScan (http://www.targetscan.org/vert_72/) 58 is a flexible web interface for predicts biological targets of miRNAs. TargetScanHuman considers matches to human 3′ UTRs and their orthologs, as characterized by UCSC whole-genome alignments. As an alternative, predictions are ranked by their likelihood of conserved targeting 59 . FunRich (http://www.funrich.org/) 60 is an independent programming tool intended to deal with variety of gene/protein data sets regardless of the organism and for functional enrichment and interaction network analysis of genes and proteins. Currently, TargetScan was utilized to study miRNAs differentially expressed in connection with INHBA and then we used Funrich tool for miRNA enrichment analysis, including Biological process, Cellular component, Molecular function and Biological pathways.
TIMER analysis. TIMER (https://cistrome.shinyapps.io/timer/) 61 is a comprehensive asset for systematical investigation of immune infiltrates over various malignancy types. The abundances of six immune infiltrates (CD8+ T cells, B cells, CD4+ T cells, Macrophages, Neutrphils and Dendritic cells) are assessed by our statistical method, which is approved using pathological estimations. This web server enables users to input function-specific parameters, with resulting figures dynamically showed to conveniently access the tumor immunological, clinical, and genomic features. Using Gene module to explore correlation between INHBA expression and abundance of immune infiltrates in HNSCC; Survival module to draw Kaplan-Meier plots for immune infiltrates and INHBA to picture the survival differences. We also provides the comparison of tumor infiltration levels among tumors with different somatic copy number alterations for INHBA in HNSCC. In addition, we explored the differential expression between tumor and normal tissues for INHBA over all TCGA tumors. P-Value < 0.05 was as statistically significant.