ESCC ATLAS: A population wide compendium of biomarkers for Esophageal Squamous Cell Carcinoma

Tungekar, Asna; Mandarthi, Sumana; Mandaviya, Pooja Rajendra; Gadekar, Veerendra P.; Tantry, Ananthajith; Kotian, Sowmya; Reddy, Jyotshna; Prabha, Divya; Bhat, Sushma; Sahay, Sweta; Mascarenhas, Roshan; Badkillaya, Raghavendra Rao; Nagasampige, Manoj Kumar; Yelnadu, Mohan; Pawar, Harsh; Hebbar, Prashantha; Kashyap, Manoj Kumar

doi:10.1038/s41598-018-30579-3

Download PDF

Article
Open access
Published: 24 August 2018

ESCC ATLAS: A population wide compendium of biomarkers for Esophageal Squamous Cell Carcinoma

Asna Tungekar^1,2^na1,
Sumana Mandarthi^1,3^na1,
Pooja Rajendra Mandaviya^1,2^na1,
Veerendra P. Gadekar^1,2,10^na1,
Ananthajith Tantry^1,4,
Sowmya Kotian^1,2,
Jyotshna Reddy^1,2,
Divya Prabha¹,
Sushma Bhat^1,2,
Sweta Sahay¹,
Roshan Mascarenhas^1,2,9,
Raghavendra Rao Badkillaya^1,11,
Manoj Kumar Nagasampige^1,12,
Mohan Yelnadu^1,4,6,7,
Harsh Pawar⁷,
Prashantha Hebbar^1,2^na2 &
…
Manoj Kumar Kashyap^1,5,8,10^na2

Scientific Reports volume 8, Article number: 12715 (2018) Cite this article

4471 Accesses
21 Citations
5 Altmetric
Metrics details

Subjects

Oesophageal cancer

Abstract

Esophageal cancer (EC) is the eighth most aggressive malignancy and its treatment remains a challenge due to the lack of biomarkers that can facilitate early detection. EC is identified in two major histological forms namely - Adenocarcinoma (EAC) and Squamous cell carcinoma (ESCC), each showing differences in the incidence among populations that are geographically separated. Hence the detection of potential drug target and biomarkers demands a population-centric understanding of the molecular and cellular mechanisms of EC. To provide an adequate impetus to the biomarker discovery for ESCC, which is the most prevalent esophageal cancer worldwide, here we have developed ESCC ATLAS, a manually curated database that integrates genetic, epigenetic, transcriptomic, and proteomic ESCC-related genes from the published literature. It consists of 3475 genes associated to molecular signatures such as, altered transcription (2600), altered translation (560), contain copy number variation/structural variations (233), SNPs (102), altered DNA methylation (82), Histone modifications (16) and miRNA based regulation (261). We provide a user-friendly web interface (http://www.esccatlas.org, freely accessible for academic, non-profit users) that facilitates the exploration and the analysis of genes among different populations. We anticipate it to be a valuable resource for the population specific investigation and biomarker discovery for ESCC.

Whole-genome sequencing of 508 patients identifies key molecular features associated with poor prognosis in esophageal squamous cell carcinoma

Article 12 May 2020

Multi-omic characterization of genome-wide abnormal DNA methylation reveals diagnostic and prognostic markers for esophageal squamous-cell carcinoma

Article Open access 25 February 2022

Dissecting esophageal squamous-cell carcinoma ecosystem by single-cell transcriptomic analysis

Article Open access 06 September 2021

Introduction

Esophageal cancer (EC) is eighth most prevalent cancers worldwide with an estimate of 456,000 new cases in 2012, and the sixth most common cause of death from cancer eliciting 400,000 estimated cases¹. Surprisingly in China alone, the estimated number of new EC cases and EC caused deaths are recorded as 291,238 and 218,957 respectively². EC can be classified as esophageal adenocarcinoma (EAC) and esophageal squamous cell carcinoma (ESCC) according to the type of cells that are involved. EAC mainly occurs in the cells of mucus-secreting glands in the esophagus, whereas ESCC affects the thin cells that line the surface of the esophagus³. The incidence of EAC is predominant in the United States, whereas ESCC exhibits a greater geographical diversity in incidence, mortality and sex ratio especially between Eastern and Asian countries such as Turkey, Iran, Kazakhstan and especially northern and central China. More than half of global ESCC cases are recorded in China and accounts for 90% of the total EC cases alone^2,4,5,6.

Some of the common and well-known risk factors associated with ESCC in Asian countries are tobacco/opium smoking, consumption of alcohol and chewing nass or areca nut often mixed with tobacco. In addition to these, certain regional food habits are also identified as the potential risk factors associated with the cause of ESCC e.g. consumption of beverages such as tea and coffee (linked to amount and temperature), consumption of nitrogenous component rich food, boiled yellow butter, and moldy cheese etc^7,8. Most often these risk factors are strongly associated with sub-populations as they are linked to region-specific lifestyles. For example, although the use of tobacco and alcohol are clearly the major risk factors for developing ESCC, they are not accounted in Northern China (specifically Anyang region) because of the moderate alcohol consumption in this region⁹. Similarly, tobacco and opium smoking are recognized as the main cause for ESCC in Iranian population¹⁰, however, in Linxian, Shanxi province, China the dietary habits are suspected to be the main causal factor instead of smoking¹¹. Several cohort studies have indicated that the obesity is positively associated with EAC, whereas negatively associated with ESCC in the European and USA population^12,13. The increase in body mass index (BMI) is seen as a protective factor against ESCC in a Chinese cohort study¹⁴. This indicates for the existence of a regional difference in the risk factors for ESCC. Nevertheless, to date, there are no concrete studies available that can categorize the risk factors in respect to the regional population groups.

Apart from the lifestyles, genetic variations in the diverse population such as single nucleotide polymorphism (SNP)¹⁵, and structural variation¹⁶, along with epigenetic modifications such as DNA Methylation^17,18,19, and Histone modification (HM)^20,21, are also known to play a major role in ESCC. Due to the epidemiological differences for ESCC between the diversity of a population, there is an increased complexity in the selection of population-specific treatment. Hence, in order to get to the roots of ESCC, it becomes important to understand the population-specific cause of ESCC in the genomic and proteomic level.

Over the last decade, a number of studies are published on ESCC using genomics and proteomics high throughput techniques. Unfortunately, the generated data remain scattered in literature and unavailable as a compendium to the scientific community; An effort to compile this information in past resulted in a manually curated “Dragon database of genes involved in esophageal cancer (DDEC)²². However, DDEC is not updated recently. Focusing mainly on ESCC, here we have developed a new database called ‘ESCC ATLAS’ that catalogs all the genes and related molecular signatures that are involved in ESCC based on an exhaustive literature survey until May 2018. As of now the ESCC ATLAS contains 3475-curated gene from 403 unique publications from China (205), Japan (74), India (16), Korea (6), Iran (5), America (2), South Africa (4), Italy and Germany representing Europe (5) (Numbers in the bracket represents total publications from the corresponding region).

Materials and Methods

Data Collection and Screening

To ensure the highest quality in data collection process, all the entries in the ESCC ATLAS were manually collected based on a systematic search of the published literature in the PubMed/PubMed Central database. The keywords used to search for the relevant articles were designed considering the combination of the terminologies related to ESCC and its associated molecular alterations. The articles were considered eligible only if the reported clinical studies were conducted on human patients and if they included the (i) SNP, (ii) Copy number variation (iii) methylation, (iv) miRNA, (v) histone modifications, (vi) transcriptomic or (vii) proteomic regulation data (together referred as molecular signature hereafter) relevant to the etiology, pathophysiology of ESCC. To maintain the high standards of the collected data, special efforts were taken to catalog only those genes or relevant molecular signatures that were shown to have a significant association with ESCC by recording the reported statistical test values such as the p-value or fold change. Additionally, each entry in ESCC ATLAS was tagged with the cell line information (if reported), sample size, study method type and most importantly the population group name by making use of the reported location of the sample collection centers (hospital) or the reported population from which the study sample were collected. To fetch maximum information possible from each surveyed article, the submitted supplementary data were reviewed and included in ESCC ATLAS. Additionally, references cited in the articles were also considered for the data extraction when found relevant. The schema for the plan of annotation and biocuration has been shown in Fig. 1.

Database Organization and Web Interface

The web application (front and back end) is developed using ExtJS (version 4.3), which facilitates efficient and quick multi-column search. To represent population specific data in the world map view, GoogleMap.js (ExtJS built-in plugin) was implemented. All the data in ESCC ATLAS were stored and managed using MySQL (version 5.0.51 b). The web services were built using Apache (version 2.2.17) and PHP (version 5.2.14) as server side scripting language. The whole system is hosted under Red Hat Enterprise Linux 5 environment.

Population specific ESCC incidence and ESCC ATLAS entries

To get an overview of the worldwide population specific ESCC incidence, we collected the age-standardized rate of ESCC incidence available from GLOBOCAN 2012, and compared the number of collected molecular signatures for different populations in our survey.

Functional enrichment analysis

Gene Ontology (GO) analysis was performed across all the three domains, Biological process, Molecular Function, and Cellular component. The populations analyzed were: Indian, Chinese, Japanese, Iranian, Korean and South African. The GO annotations associated to all 3475 genes from ESCC ATLAS were downloaded using biomaRt Bioconductor package in R (Ensembl release 89)²³. An in-house R script was used to identify the overrepresented GO terms by comparing the proportion of genes annotated to a specific GO term in each population (test gene group) against the total protein-coding genes in human genome (background gene group). The prop.test() function in R was implemented for the comparison, which calculates a chi-squared statistic to test the null hypothesis that the proportion of test and the background set of genes annotated to a specific GO terms are the same. Since multiple comparisons are performed in this analysis, the p-values thus obtained are adjusted to narrow down the chances of false discoveries using the False Discovery Rate method^24,25 implemented in the p.adjust function in R. Additionally, for each of the GO term an enrichment score (ES) is calculated, where ES is, (Total number of annotated test genes/Total number of annotated background genes) * 100.

The overrepresented GO terms among the genes corresponding to each population were then selected based on the p-value cut off of <0.01 and summarized as scatterplots using REVIGO²⁶. The enriched GO terms and their corresponding ES were supplied to the to REVIGO web server (http://revigo.irb.hr/). The option concerning semantic similarity measure was set to SimRel and the database for GO term size was set to Homo sapiens. The REVIGO checks for the redundancy of the GO terms represented by bubbles into a two-dimensional space by applying the multidimensional scaling to a matrix of GO terms semantic similarities.

Protein-protein interactions

To get further insights on the important genes from the GO enrichment analysis, we looked for protein-protein interactions (PPI) networks using the PEPPER (Protein complex Expansion using Protein-Protein intERaction networks) application in Cytoscape²⁷. PEPPER identifies the meaningful pathways/complexes as densely connected sub-networks from seeds (lists of input protein coding genes) using the information from protein interactions available in BioGRID database (in this analysis)²⁸. PEPPER includes a topological and function-based post-processing pipeline for ranking the newly added proteins (expansions) according to their relevance to the seeds. The expansions are scored according to their co-occurrence with the seeds based on their impact on the overall connectivity of the subnetwork. The default parameter settings were used for the PEPPER algorithm. The output PPI network view were manually customized by hiding or showing the proteins/edges that were added by PEPPER algorithm to make it more readable in respect to the seeds. The interaction networks between the expansions are not shown.

Statistical Analysis

The studies took into consideration for annotation purpose were considered only if a given molecule had either 2-fold up or downregulation, and/or differential regulation with p < 0.05. Since, the data included in the study is derived from the previously published studied by the scientist from all around the world, we applied these two-criteria only to avoid any tweaking of the data.

“Ethics approval” and consent to participate

The study does not involved human, animal or cell lines as a material for experimental purpose. The data involved in the study is curated exclusively using published studies available through PubMed or PubMed Central (PMC) or Google search based research articles or sources.

Results

Data Summary

We presented the schema for collection of the data for this study in Fig. 1. We collected a total of 3475 non-redundant ESCC associated genes that fulfills the eligibility criteria to be included in ESCC ATLAS. The total number of published research articles referred for the data extraction for each molecular signature is shown in Dataset 1.

Based on our literature survey and the ESCC incidence data available from GLOBOCAN 2012, we identified that despite of a very high incidence rate of ESCC among Iranian and South African population only few published studies are available focusing on these population that could aid the ESCC investigation and biomarker discovery (Fig. 2).

All the data in ESCC ATLAS is freely accessible via user-friendly web interface at http://www.esccatlas.org. Additionally, ESCC ATLAS also provides an option for the bulk data download as flat files which is useful for other downstream analysis.

Collection of population specific ESCC associated molecular signatures

Genes involved in the transcriptomic and proteomic regulation of ESCC

We found majority of the transcriptomic and proteomic signatures associated to ESCC were uniquely identified for different populations suggesting a population specific ESCC etiology that could be linked to different lifestyles and environmental exposures (Fig. 3A and B).

Transcriptome and proteome landscape of ESCC in global population

A total of 2592 unique transcriptomic signatures (genes) were curated. The expression status of these genes was studied and their distribution in different populations was analyzed. Gene regulation differences between populations with respect to the biological sample used to conduct the study was analyzed using transcriptomic data. It was found that the regulation remained consistent between populations in the same biological material. 17 genes were found where the biological material used differed, however, the regulation status remained consistent between populations. Out of these, only one gene, glutathione peroxidase 3 (GPX3) showed irregularities or difference in regulation between populations in different biological materials. GPX3 transcriptomic data was recorded for Chinese and Indian populations. Its expression was found to be downregulated in cell line samples of Chinese population, whereas in tissue samples of the same population it was found to be upregulated. Proteomic data was also analyzed to check for any inconsistencies with respect to biological material. For GPX3, expression regulation and status was found to be consistent between all population levels in proteomics in same or different biological materials.

An expression of GPX3 was downregulated in ESCC as compared with normal samples^29,30. Furthermore, GPX3 methylation was significantly higher in ESCC as compared to normal tissue samples³¹.

GPX3 was among an 8-gene signature panel that had been used for predicting the status (normal versus tumor) of gastric tissues with an accuracy of >96%. As an outcome of promoter hypermethylation, GPX3 was down regulated in gastric cancer³². Furthermore, an in vitro study an association between GPX3 and lymph node metastasis was found in gastric cancer upon downregulation of GPX3 expression and promoter hypermethylation³³.

In a previous study, GPX3 was methylated in prostate cancer³⁴ and an in vitro overexpression of GPX3 in prostate cancer cell lines could suppress formation of colonies as well as growth of cells in an anchorage-independent manner, and reduce the invasiness of the prostate cancer cells indicates the tumor suppressor activity of GPX3³⁵. In case of breast cancer as well GPX3 was downregulated at mRNA as well as at protein levels in the inflammatory breast cancer as compared with non-IBC. Hypermethylation of GPX3 promoter was observed in breast cancer, but not in normal tissues³⁶.

Population levels for transcriptomic data were found to be the following: Chinese, Japanese, and Indian. A total of 2060 genes in the Indian Population, 977 in the Chinese population, 203 genes in the Japanese population, and 163 ESCC-related genes were associated with the unknown/mixed population. A study of common and unique genes between combinations of different populations showed that the Chinese and Indian populations have the highest number of genes common between them. Only 1 gene (CSK2) was found to be common between all four populations, and was found to be upregulated.

Similarly, genes curated for proteomic data were also compared. Population levels were: Indian, Chinese and Japanese. The Indian population (293) had the highest number of genes curated, followed by Chinese (31) and Japanese (13). An analysis of common and unique genes between the populations revealed that only 3 genes (CDH1, EZR, YBX1) were found common between Indian and Chinese populations.

Among these genes E-cadherin (CDH1) is a tumor suppressor involved in epithelial cell-cell interactions. It’s a calcium-dependent cell adhesion protein involved in tumor invasiveness. CDH1 has been reported to be hypermethylated in previous studies on ESCC^{37,38,39,40,41,42}, and downregulation and/or loss of CDH1 has been documented in ESCC^40,41. Interestingly, there was lack of association between -160C/A SNP of CDH1 and ESCC development⁴³.

Ezrin is a cytoplasmic peripheral membranous protein, which is a member of the ezrin/radixin/moesin (ERM) family of proteins linking plasma membrane to F-actin cytoskeleton. Expression of EZR has been associated with invasiveness and lymph node metastasis of ESCC^44,45. Translocation of EZR protein from plasma membrane to cytoplasm was observed in ESCC cells⁴⁶. The knockdown inhibition of EZR led to decrease in growth, adhesion, and invasiveness of the ESCC cells in vitro and ability to induce tumorigenesis in in vivo conditions⁴⁷.

Y-box binding protein 1 (YB-1 or YBX1) is an RNA and DNA binding factor. YBX1 expression had been observed not only in endothelial cells but in esophageal cancer cells as well. YBX1 gene was upregulated and overexpressed in ESCC as compared with normal samples^48,49. An elevated level of YBX1 protein associated with higher recurrence & lower survival in ESCC patients⁴⁹.

Among another set of genes, we found two metalloproteinases (MMPs) genes (MMP2 and MMP3). These found in between Chinese and Japanese populations. These Zinc dependent MMPs including MMP2, MMP3, and MMP9 along with serine proteases play an important role in ESCC pathogenesis^50,51. An overexpression of MMP2 has been reported in ESCC tissues as compared to adjacent normal epithelia⁵², and MMP3 SNP (MMP3 -1612 5A/6A) polymorphism was significantly associated with susceptibility to ESCC means ESCC subjects bearing 5A allele were more prone of getting ESCC as compared with 6A allele⁵⁰.

There was not even a single found common between the Indian and Japanese populations, and same scenario was observed when we compared Indian, Chinese and Japanese populations.

Upon careful study of derived transcriptomic and proteomic population data, it is found that the highest number of recorded cases was found in the Indian population, followed by the Chinese and finally, the Japanese population. A list of complete and unique genes is provided in the supplementary material. The Venn diagram package from CRAN repository was used to create a graphical representation depicting common and unique gene distribution between different populations at the transcriptomic and proteomic levels (Fig. 4A and B).

SNPs in ESCC

In risk variants survey for ESCC, we found 85 genes harboring one or more risk variants in Chinese, 8 in Iranian, 5 in Indian, 4 in South African, 3 in European, and 1 in Japanese populations but none in Korean or unknown/mixed populations. Examining these population specific genes with that of transcriptome and proteome data, we could determine only 1 of the genes (PADI4) in Chinese (19 in transcriptome, 1 in proteome), but none in Japanese, Indian (4 in transcriptome), South Africa, European and Iranian populations that link with both altered transcriptomic and proteomic status. However, when we studied the transcriptomics and proteomics of global populations irrespective of population specificity in consideration, we could identify 8 genes: GSTP1, PADI4, CCND1, FEN1, ADH1A, CRNN and S100A14 (40 in transcriptome; 11 in proteome) that were linked with both altered transcriptomic and proteomic status among 95 total unique genes harboring one or more risk variants (Supplementary Dataset # 2). Among these genes, PADI4 encode for protein arginine deiminase 4 catalyzes the hydrolytic deimination of arginine residues to produce citrulline and ammonia. Elevation of PADI4 was observed at mRNA as well as at protein levels in ESCC as well as in EAD⁵³. It is an intracellular protein but it had been detected in pIasma samples of the cancer patients as well. In SNPs based study on PADI4, rs2240337 G > A SNP was found to be significantly associated with decreased risk of ESCC⁵⁴. Earlier findings in SNPs study on PADI4 in EC reported rs10437048 and rs41265997 were significantly associated with decreased and increased risk of EC, respectively⁵³.

Methylation in ESCC

We identified 51 genes with altered methylation status for Chinese (hyper: 50, hypo: 1), 22 for Japanese (hyper: 22, hypo: 0), 8 for Korean (hyper: 8, hypo: 0), 2 for Indian (hyper: 2, hypo: 0), and 2 for Iranian (hyper: 2, hypo: 0) populations. Examining these population specific genes (altered methylation) with that of transcriptome and proteome data, we could determine only 3 of the genes (GADD45A, UPK1A, and C2orf40) in Chinese (15 in transcriptome, 3 in proteome), 1 (TIMP3) in Japanese (3 in transcriptome, 1 in proteome), 1 (CLDN4) in Korean (4 in transcriptome, 1 in proteome), but none in Indian and Iranian populations that were linked with altered transcriptomic and proteomic status. However, when examined with transcriptomics and proteomics of global populations without specific populations into consideration, we could identify 16 of them (44 in transcriptome; 24 in proteome) were linked with both altered transcriptomic and proteomic status among 95 total unique methylated genes (hyper: 85; hypo: 4; unknown status: 6) (Dataset # 2).

Histone modification in ESCC

We could find 6 genes with altered histone modifications for Chinese (Acetylation 1; phosphorylation 6, deacetylation 1, gene PTK6 has both Deacetylation, Phosphorylation evidence), 4 for Japanese (Acetylation 3; phosphorylation 1, deacetylation 0), 1 for Korean (Acetylation 0; phosphorylation 1, deacetylation 0), and none for Indian and Iranian populations (no HM studies in these two). Examining these population specific genes (altered HM) with that of transcriptome and proteome data, we could determine only 3 of the genes in Chinese (0 in transcriptome, 3 in proteome), 1 in Japanese (0 in transcriptome, 1 in proteome), 1 in Korean (0 in transcriptome, 1 in proteome), link with altered proteomic status. However, none of the population specific gene could link to transcriptomic changes. Upon examining all the 18 unique HM altered genes (18; Acetylation 6; phosphorylation 11, deacetylation 2) with transcriptomics and proteomics of global populations without specific populations into consideration, we could identify 5 (PTK6, KLF4, MAGEA3, PIK3C2B, WDHD1) of them (5 in transcriptome; 16 in proteome) were linked with both altered transcriptomic and proteomic status (Dataset # 2).

Structural variation in ESCC

We found 92 (amplification: 35, Deletion: 6, LOH: 52) unique genes overlapping/published on the structural variations regions from so far published literature in Chinese, 91 (Amplification: 43, Deletion: 47, loss of heterozygosity i.e. LOH: 1) in Indian, 25 (Amplification: 22, Deletion: 13) in South African and 2 (Amplification: 1, LOH: 1) in Japanese populations. Examining these population specific genes with that of transcriptome and proteome data, we could determine only 4 of the genes (CCND1, CTTN, FAS, CDC25B) in Chinese (60 in transcriptome; 4 in proteome), 3 (20 in transcriptome; 3 in proteome) of the genes (GSN, KRT13, COL12A1) in Indian, but none in SA and Japanese populations, link with both altered transcriptomic and proteomic status. However, when examined with transcriptomics and proteomics of global populations without specific populations into consideration, we could identify 17 (CCND1, CTTN, FAS, CDC25B, KRT13, COL12A1, GSR, IL1RN, PDCD4, TP63, SERPINH1, PSMA6, CFL1, GNPNAT1, GSTP1, PRKCI) (Amplification: 12, Deletion:1, LOH: 4) of them were linked with both altered transcriptomic and proteomic status among 304 total unique genes published on structural variations so far (refer supplementary data).

miRs in ESCC

Each collected miRNA entries in ESCC ATLAS were mapped to their target genes by making use of the miRTarBase (a resource for experimentally validated miRNA targets) and www.microRNA.org (resource for predicted targets using miRanda algorithm). Currently, we have curated a total of 184 unique miRNAs in our database, out of which 61 miRNAs were mapped to their experimentally validated target genes listed in ESCC ATLAS. The miRNA entries were also checked for one to many mappings to their target genes, as a single miRNA can have more than one target gene(s). For example, from our curated list of miRNAs, hsa-miR-107, hsa-miR-326, hsa-miR-375 and hsa-miR-320a have multiple target genes.

A total of 184 miRNAs were studied, some of those were repetitive due to the redundancy of their respective target genes. The number of target genes for these cannot be accurately estimated due to the large number of target genes for a single miRNA species. The populations included in the study included Chinese, Japanese, and Iranian. We identified uniquely, 38 miRNA linked to the Chinese population, 25 in the Japanese Population, 1 in the Iranian and 2 in the unknown/mixed population. Upon comparing miRNA target gene regulation with that of transcriptomic and proteomic regulation data, we found two genes to show consistency in regulation status between miRNA transcriptomic and proteomic data. In the Chinese population, hsa-miR-451a was found to be downregulated. It’s target gene MMP-9 was found to be upregulated in the Chinese population, in both transcriptomic and proteomic datasets. Similarly, another gene FSCN1 had upregulated transcriptomic and proteomic status, is a target gene for hsa-miR-133b, which was found to be downregulated in the Chinese population. This corroborates the miRNA-gene-silencing mechanism, which is reflected in this analysis.

Population-wide GO analysis

Several GO terms (with respect to Biological process, Molecular function and Cellular component) found through analysis, were common/overlapping and unique between populations such as Chinese, Indian, Japanese, South African and unknown/mixed (Fig. 5A,B and C). The list of all overrepresented GO terms and the number of genes associated to them are shown in the Dataset # 3.

Certain over-represented GO terms were found to overlap between populations (Fig. 5), for e.g. the GO terms “negative regulation of apoptosis”(11% of all test genes), “positive regulation of cell proliferation”(10% of all test genes) which are the hallmarks of cancer, were found to be over-represented in the Chinese, Indian and unknown/mixed populations.

The GO term “positive regulation of cell proliferation” was found to exist in unknown/mixed, Chinese and Indian population. Among the test genes, we found that STAT3, CCND1, FGFR2 were of significance when compared to their regulation status in our curated data. FGFR2 was observed as overexpressed in the unknown/mixed population. In the Chinese population, STAT3 is commonly found overexpressed in ESCC and is associated with invasion of ESCCs⁵⁵. Amplification and overexpression of CCND1 was found in ESCC in the North-eastern Chinese population⁵⁶. Lastly, frequent gene amplification of FGFR2 is found in ESCC specimens⁵⁷. This suggests that upregulation of STAT3, CCND1 and FGFR2 derails positive regulation of cell proliferation in ESCC. To get further insight into PPI network for STAT3, CCND1 and FGFR2, we used PEPPER to identify only few direct interactions within our input list of proteins (seeds, purple and green nodes) (Fig. 6). However the proteins (expansions) added by PEPPER algorithm constitutes a complex interaction network that connects together almost all of our input protein coding genes. The predicted network provides a global aspect for PPI of the key protein coding genes that could be involved in ESCC e.g STAT3 (Signal Transducer and Activator of Transcription 3), which is used as the bait protein in PEPPER PPI analysis clearly shows direct interactions between FGFR2 (Fibroblast growth factor receptors) and CCND1 (Cyclin D1). Indeed, oncogenic FGFR amplification is seen as a requirement that results in ectopic activation of the STAT3 transcriptional response⁵⁸. And the constitutive activation of STAT3 and CCND1 overexpression is accounted for the proliferation, migration and invasion in gastric cancer cells⁵⁹. Based on the interaction networks considering the expansion proteins we could observe STAT3 interacts with PML (Promyelocytic Leukemia Protein) and HDACs (Histone Deacetylase proteins). Indeed it is known that the acetylation of STAT3 and its subsequent binding with HDAC1 is involved in control of its nucleocytoplasmic distribution in Human hepatoblastoma HepG2 cells⁶⁰. Similarly, PML is known to modulate interleukin (IL)-6-induced STAT3 activation and hepatoma cell growth by interacting with HDAC3⁶¹. RELA (NFκB) is known to be constitutively active in many cancers where it up-regulates anti-apoptotic and other oncogenic genes and here we observe a direct interaction between STAT3 and RELA. Studies have shown that STAT3 are involved in prolonging RELA nuclear retention through EP300 (acetyltransferase p300-mediated) RELA acetylation, thereby interfering with RELA nuclear export⁶². In summary, based on the PPI network analysis we obtained a global aspect of interaction networks between the candidate genes shortlisted genes from the GO enrichment analysis and other key protein-coding genes that could be involved in ESCC biogenesis.

Similarly, the GO term “negative regulation of apoptosis”, was found associated with the enriched term in the Chinese and the unknown/mixed population. It is known that CASP3 is involved in the apoptosis pathway and certain genetic variants of the gene confer susceptibility to ESCC. CASP3 829A > C (rs4647602) genotype is found to be associated with risk of ESCC. Variant rs4647602 (CC genotype) significantly reduced the transcriptional activity of caspase-3 and it was associated with an increased risk of ESCC in the Chinese population⁵⁷. Furthermore, CASP3 is downregulated in the Chinese population when a comparison was made between normal versus basal cell hyperplasia⁶³, but upregulated in ESCC as compared with adjacent normal tissues⁶⁴.

Discernibly, some GO terms that may be of significance were unique for certain populations. For example, the term “cell differentiation” was uniquely associated with the unknown/mixed population. One of the genes, GRB2 that contributed to enrich the term, is found overexpressed with lymph node metastasis and poor prognosis in ESCC where cellular differentiation could be an influencing factor⁶⁵. Evidently, from our curated data, we find that GRB2 is found to be upregulated in the unknown/mixed population.

Distinctly, the GO term “DNA repair” was found only in the Chinese population with 11% of test genes associated with this term. On comparing the genes for which this term was enriched, we found that some of them showed interesting expression levels in correlation with ESCC and literature. Curated SNP data shows that the polymorphisms found in the genes XRCC1, POLQ, FANCG, FEN1, SMUG1 were directly associated with risk of ESCC in the Chinese population^66,67. These genes were also found in our GO analysis results for the term DNA repair. Expression status of some of these genes namely, XRCC1, FANCG and FEN1 was found to be upregulated in curated gene expression data. Therefore, one may infer a relation between polymorphisms in DNA repair genes that may lead to ESCC development and progression.

In the Indian population, 20% of the test genes were associated with the GO term “oxidation-reduction process”. The genes enriched for this term belong to family of cytochrome p450 enzymes (CYP1A1, CYP24A1, CYP26A1, CYP27B1, CYP2C18, CYP2C9, CYP2E1, CYP2J2, CYP3A4, CYP4B1, CYP4F12, CYP4F8, CYP4X1). Interestingly, all the genes found to be downregulated in the Indian population except CYP24A1, CYP26A1 and CYP27B1. Polymorphisms found in CYP2E1 and CYP1A1 are risk factors for development of ESCC^68,69.

Another interesting observation found in the unknown/mixed population is that of oncogene DJ-1 (PARK7). Although out of 94 test genes, only DJ-1 was associated with the GO term “negative regulation of TRAIL-activated signaling pathway”, it is found in our curated data and upregulated in the unknown/mixed population⁷⁰ (Dataset # 2). The oncogene, DJ-1 inhibits TRAIL-activated apoptosis pathway by blocking pro-caspase 8⁷¹. Incidentally, CASP8 was found to be downregulated in ESCC ATLAS in Chinese population. Overall, DJ1 plays an important role in transformation and ESCC progression, which suggests that DJ1 could be a prognostic marker for ESCC.

Alcohol consumption is widely known to be a risk factor for ESCC. In the Chinese population, GO analysis results showed enrichment for the terms “ethanol oxidation” and “alcohol metabolism”. Six of the test genes were associated with the former term and 3 of the test genes for the latter. Extensive literature review reveals that the alcohol metabolism pathway has two steps - ethanol oxidation, i.e. conversion of ethanol to acetaldehyde; and elimination of acetaldehyde from the body by conversion to acetate and finally to water and carbon dioxide. The first step carried out by involvement of two enzymes - Alcohol dehydrogenases (ADH1B) and cytochrome P450 (CYP2E1) enzymes. The second step is catalyzed by aldehyde dehydrogenase (ALDH2)⁷², it is found that variants of these genes affect the catalytic rate of the reaction in both steps⁷³. An SNP found in the ADH1B gene (rs1229984) that replaces arginine with Histidine at the 48^th position makes the enzyme catalytically faster⁷². The heterozygous (ADH1B Arg/His) and homozygous (ADH1B His/His) variants of the gene possess higher catalytic activity and are faster in converting ethanol to acetaldehyde compared to the wild type (ADH1B Arg/Arg) allele. It is these variants that are associated with risk of ESCC. Acetaldehyde is a known carcinogen. It’s accumulation in the body due to its rapid production by the variant versions of the ADH1B enzyme leads to acetaldehyde carcinogenesis. This SNP is also found curated in our SNP dataset in the Chinese population, along with an SNP found in ALDH2 (rs671). The variant alleles of ALDH2 render it inactive, thereby promoting acetaldehyde accumulation and carcinogenesis in ESCC. With numerous examples of GO based analysis shows that ESCC ATLAS provided in-depth analysis on molecules differentially regulated between ESCC versus normal across different population.

Metabolism related aspects also contribute to malignancies including EC. In recent years, relations have been established between EC types and metabolism associated parameters such as body mass index, and blood pressure. A positive association was found between BMI and EAC, and a negatively association was found with risk of ESCC¹². Obesity acts as a risk factor for BE as well as for EAC⁷⁴. The underlying mechanisms behind association or role of obesity with ECs must be further explored in light of molecules such as leptin, adiponectin, estrogen and obesity related genes⁷⁵. Interestingly adipose tissue has the ability to influences the development of tumor, which is dependent on secretor product adipokines and cytokines secreted by adipocytes and inflammatory cells, respectively⁷⁶. An abundant supply of lipids by the adipocytes in tumor-microenvironment, supports progression and uncontrolled growth of the tumor⁷⁷.

A good number of studies on single nucleotide polymorphisms (SNPs) based study was carried out on obesity related genes to find out an association with EAC, adenocarcinoma of esophagogastric junction, and ESCC, but there was no association found EGJAC or ESCC⁷⁸. Three SNPs in NER pathway particularly the variant alleles of the XPD Lys751Gln, ERCC1 8092C/A and ERCC1 118C/T were associated with an increased risk of EAC⁷⁹.

Overall, we found >600 common genes for Indian and Chinese population. The details of the genes have been provided in Dataset # 4. This large set of genes includes extracellular matrix related genes such as MMP1, MMP3, MPP7, MMP9, MMP10, MMP11, MMP12, MMP13, and MMP14. Some of the interesting genes include DUSP5 (Dual Specificity Phosphatase 5), which codes for DUSP5 protein. DUSP5 was found as downregulated in achalasia (a condition in which lack of relaxation of the lower esophageal sphincter occurs, act as a risk factor for ESCC, and 6% of the total achalasia subjects develop ESCC)⁸⁰. Furthermore, DUSP5 was downregulated in ESCC as compared with adjacent normal in Indian and Chinese studies^30,81,82. In addition to different molecules contributing to ESCC tumorigenesis in Indian and Chinese population, there is life style mainly diet pattern (Chinese and Indian particularly South Indians either prefers to have spicy food and/or consumption of pickled vegetables, and intake of alcohol and tobacco). In addition to these, to establish what are common factors among Chinese and India population, a comparative analysis is required to establish common pre-disposing and risk factors (achalasia, atrophic gastritis⁸³, an infection by cytotoxin-associated gene A (CagA)-positive Helicobacter pylori⁸⁴, injury to the esophagus, consumption of pickled vegetables, Plummer Vinson syndrome, Chaggas-associated mega-esophagus, and a history of certain head and neck cancers)⁸⁵.

A quick search for the genes and their molecular signatures associated to ESCC in different population groups could be easily seen in 3 steps as shown in Fig. 7. Various useful external links are also embedded in the search results, for example, gene IDs, symbols, and gene alias from HUGO Gene Nomenclature Committee (HGNC), human protein reference database (HPRD) and Ensembl databases. The gene function and associated gene ontologies could also easily browsed with the links to GO database. A PubMed link to the reference research article is also made available. Further, upon entering the gene symbol for gene of interest e.g. osteopontin (SPP1) (Fig. 6), details about population specific, mRNA or protein level expression can be seen and additional information about the molecule is possible to explore by hitting external links like HGNC, HPRD or OMIM.

Discussions

The “Omics” driven projects produced enormous amount of data. One of the biggest challenges faced by the scientific community was ‘how to handle and present this gigantic amount of data in a user friendly way’. In order to overcome, a number of cancer specific database developed including ONCOMINE (a cancer microarray database and web-based data-mining platform)⁸⁶, pancreatic cancer^87,88,89, Breast Cancer Database for breast cancer^90,91, Dragon Database of Genes associated with Prostate Cancer⁹², Renal cancer gene database on renal cancer⁹³, HLungDB: an integrated database of human lung cancer research¹⁴, Cervical Cancer Database on cervical cancer⁹⁴, curatedOvarianDat on Clinically annotated data for the ovarian cancer transcriptome⁹⁵, and Osteosarcoma, a database of osteosarcoma-associated genes⁹⁶. Furthermore, there were efforts to make repository to have peptidome (a pool of peptides) detected in cancer-associated biofluids⁹⁷.

In an earlier effort, DDEC was developed where a total number of 529 genes were reported. The database excluded the data derived from high-throughput microarray studies leaving an enormous amount of data not curated for ESCC²². Hence, with an aim of providing a central repository for scientific community in the area of ESCC, we developed ESCC ATLAS, a database system that focused on providing an in-depth resource of gene, miRNA and protein information and their relationships to ESCC overall and in a population specific manner. We made efforts to make ESCC ATLAS as a repository to have biocurated data for ESCC from low as well as high-throughput studies.

In the last 5 yrs, large amounts of data have been collected for making ESCC ATLAS. Information on ESCC data was obtained from the PubMed, PubMed central, google search engine. Genes, miRNAs, proteins, gene promoters, transcription factors, transcription factor-binding sites and the SNPs related to ESCC cancer have been collected and integrated into this system. The data collected in ESCC ATLAS was not only cataloged, but also exploited to gather regional disparity with ESCC research activities and molecular signatures discovered so far. Interestingly, cross-comparison of incidence of ESCC, number of molecular studies and total molecular signature identified between different populations suggests that despite high incidence of ESCC in Iranian and South African population low number of molecular studies lead to low number of molecular signatures identification. The 3475 genes/proteins in ESCC ATLAS are integrated in a way that investigator can rapidly query whether a gene or protein is found in human ESCC, and other detailed ESCC-related information about this gene. User-friendly query interfaces have made all the features of ESCC ATLAS easily accessible. ESCC ATLAS provides a comprehensive resource for human ESCC research. We believe that ESCC ATLAS will be particularly interesting to the life science community and will greatly facilitate cancer biologists’ mission of unraveling the pathogenesis of ESCC.

To extrapolate curated information in ESCC ATLAS, we created navigational links to other resources such as NCBI Entrez gene⁹⁸, HGNC⁹⁹, OMIM¹⁰⁰, HPRD^101,102, Ensemble¹⁰³, KEGG¹⁰⁴, WikiPathways¹⁰⁵, GO¹⁰⁶, miRBase¹⁰⁷, and DGV¹⁰⁸.

Strikingly, evaluation of transcriptome and proteome signatures among all populations suggested that from transcriptome only CSK2 found to be common with upregulation status in Chinese, Japanese, and Indian populations, whereas from proteome, genes CDH1, EZR, and YBX1 were found common between Indian and Chinese populations and genes MMP2, and MMP3 between Chinese and Japanese populations. This clearly exhibits, a significant number of population specific genes regulation involved in the etiology of ESCC between different populations.

Consolidation of GO analysis results with regulation of molecular signatures captured through our curation process, corroborate mechanism that drive progression of ESCC. The GO term “positive regulation of cell proliferation” (enriched from STAT3, CCND1, FGFR2) was found to exist in, Chinese and Indian population. Similarly, the GO term “negative regulation of apoptosis”, (enriched from CASP3) was found associated with Chinese population.

Distinctly, the term “cell differentiation” (GRB2) was uniquely associated with the unknown/mixed population, the term “DNA repair” (enriched from XRCC1, POLQ, FANCG, FEN1, SMUG1) was found only in the Chinese population, the GO term “oxidation-reduction process” (enriched from family of cytochrome p450 enzymes) were associated Indian population, and “negative regulation of TRAIL-activated signaling pathway” (enriched by oncogene DJ-1) in unknown/mixed population. Importantly, alcohol consumption is widely known to be a risk factor for ESCC, GO analysis results showed enrichment for the terms “ethanol oxidation” and “alcohol metabolism” in the Chinese population (found in Iranian as well).

Furthermore, ESCC ATLAS showed that variation between populations for a single molecule/gene are there as e.g. KRT17 gene is upregulated in 7 studies on Chinese (06) and North Indian (01) population, but downregulated in study on North-East Indian population¹⁰⁹. The further collection data in ESCC ATLAS will allow researchers to study and compare the variation between the different populations. The information and link for the genes/proteins altered in ESCC are provided which allows researchers to know whether a particular molecule bears surface expression and secretory in nature to detect it in biological fluids using assays like ELISA assay.

Over the course of time, watching the approaches undertaken in human genomics, now research community is urging for an epoch with radical revision of human genomics¹¹⁰. In nutshell, we have biocurated the literature-derived data from studies on ESCC in ESCC ATLAS covering from DNA, mRNA, miRNA to protein levels. ESCC ATLAS provides scientific community with an easy access to different aspects of information regarding molecules driving ESCC carcinogenesis. In the future, we will incorporate more different kinds of genomics, proteomics, metabolomics, and pharmacogenomics based data in further updates to ESCC ATLAS, so that the database will continue to be an informative and valuable source ESCC associated molecular alterations and serve as a key repository for basic and translational research on ESCC.

Conclusions

In conclusion, here we present ESCC ATLAS, an on-line database through which users can perform search for close to 3500 differentially regulated molecular alterations associated with ESCC tissues or cell lines. ESCC ATLAS is, as far as we know, the largest database on ESCC developed to be a user-friendly web based platform for the investigation of molecules to be queried either from the gene, miRNA or protein perspective. To make sure the relevance of ESCC ATLAS, we plan to update it annually to facilitate inclusion of new publicly available and annotation data. We strongly feel that ESCC ATLAS provides a novel and comprehensive tool for the systematic identification of molecular alterations in ESCC, which could be useful for the discovery of novel anticancer drugs targeting the ESCC, or for better understanding of the ESCC pathogenesis. All together, ESCC ATLAS is the largest database providing a comprehensive view for published data derived from either primary ESCC tissues samples or established ESCC cell lines. The data is provided in an easily available platform for the academic and not-for-profit organizations when exploring the molecular alterations that underlie ESCC. The current version of ESCC ATLAS covers DNA, mRNA, miRNA and protein level alterations. We will continue to review and assess the quality of our information and collection of ESCC-related molecular alterations in the future. For future, one of our goals is to identify the signal transduction pathway events that have significant impact on ESCC tumorigenesis and pathophysiology. In parallel, we will also catalog and include the downstream molecules of the altered signaling pathway in ESCC to fulfill the requirement towards identification of new drug targets, ESCC genes and novel mechanisms. For accelerating the research beyond geographic borders, between basic biomedical and clinical research in esophageal cancer, it is of utmost importance to have upcoming clinical and biological data on ESCC in a single repository like ESCC ATLAS. Hence we will invite researchers in the field of ESCC to deposit their data so that along with them, rest of the world also have access to the recent data on ESCC.

Data Availability

The data in ESCC ATLAS is freely accessible for academic Institutes/Universities and as well as for Not-for-profit organizations at the http://www.esccatlas.org website. All data generated and analyzed during our study are included in this published article and its supplementary information file-Dataset 1,2,3 and 4.

References

Globocan. http://globocan.iarc.fr/Pages/fact_sheets_cancer.aspx (2012).
Zeng, H. et al. Esophageal cancer statistics in China, 2011: Estimates based on 177 cancer registries. Thoracic cancer 7, 232–237, https://doi.org/10.1111/1759-7714.12322 (2016).
Article PubMed Google Scholar
Jemal, A., Center, M. M., DeSantis, C. & Ward, E. M. Global patterns of cancer incidence and mortality rates and trends. Cancer Epidemiol Biomarkers Prev 19, 1893–1907, https://doi.org/10.1158/1055-9965.EPI-10-0437 (2010).
Article PubMed Google Scholar
Gholipour, C., Shalchi, R. A. & Abbasi, M. A histopathological study of esophageal cancer on the western side of the Caspian littoral from 1994 to 2003. Dis Esophagus 21, 322–327, https://doi.org/10.1111/j.1442-2050.2007.00776.x (2008).
Article PubMed CAS Google Scholar
Islami, F. et al. Epidemiologic features of upper gastrointestinal tract cancers in Northeastern Iran. Br J Cancer 90, 1402–1406, https://doi.org/10.1038/sj.bjc.6601737 (2004).
Article PubMed PubMed Central CAS Google Scholar
Tran, G. D. et al. Prospective study of risk factors for esophageal and gastric cancers in the Linxian general population trial cohort in China. Int J Cancer 113, 456–463, https://doi.org/10.1002/ijc.20616 (2005).
Article PubMed CAS Google Scholar
Koca, T. et al. Dietary and demographical risk factors for oesophageal squamous cell carcinoma in the Eastern Anatolian region of Turkey where upper gastrointestinal cancers are endemic. Asian Pac J Cancer Prev 16, 1913–1917 (2015).
Article PubMed Google Scholar
Domper Arnal, M. J., Ferrandez Arenas, A. & Lanas Arbeloa, A. Esophageal cancer: Risk factors, screening and endoscopic treatment in Western and Eastern countries. World journal of gastroenterology 21, 7933–7943, https://doi.org/10.3748/wjg.v21.i26.7933 (2015).
Article PubMed PubMed Central Google Scholar
He, Z. et al. Prevalence and risk factors for esophageal squamous cell cancer and precursor lesions in Anyang, China: a population-based endoscopic survey. Br J Cancer 103, 1085–1088, https://doi.org/10.1038/sj.bjc.6605843 (2010).
Article PubMed PubMed Central CAS Google Scholar
Nasrollahzadeh, D. et al. Opium, tobacco, and alcohol use in relation to oesophageal squamous cell carcinoma in a high-risk area of Iran. Br J Cancer 98, 1857–1863, https://doi.org/10.1038/sj.bjc.6604369 (2008).
Article PubMed PubMed Central CAS Google Scholar
Yu, Y. et al. Retrospective cohort study of risk-factors for esophageal cancer in Linxian, People’s Republic of China. Cancer causes & control: CCC 4, 195–202 (1993).
PubMed Google Scholar
Lindkvist, B. et al. Metabolic risk factors for esophageal squamous cell carcinoma and adenocarcinoma: a prospective study of 580,000 subjects within the Me-Can project. BMC Cancer 14, 103, https://doi.org/10.1186/1471-2407-14-103 (2014).
Article PubMed PubMed Central CAS Google Scholar
Navarro Silvera, S. A. et al. Principal component analysis of dietary and lifestyle patterns in relation to risk of subtypes of esophageal and gastric cancer. Annals of epidemiology 21, 543–550, https://doi.org/10.1016/j.annepidem.2010.11.019 (2011).
Article PubMed Google Scholar
Wang, L. et al. HLungDB: an integrated database of human lung cancer research. Nucleic acids research 38, D665–669, https://doi.org/10.1093/nar/gkp945 (2010).
Article PubMed CAS Google Scholar
Genomes Project, C. et al. A global reference for human genetic variation. Nature 526, 68–74, https://doi.org/10.1038/nature15393 (2015).
Article ADS CAS Google Scholar
Zarrei, M., MacDonald, J. R., Merico, D. & Scherer, S. W. A copy number variation map of the human genome. Nature reviews. Genetics 16, 172–183, https://doi.org/10.1038/nrg3871 (2015).
Article PubMed CAS Google Scholar
Fraser, H. B., Lam, L. L., Neumann, S. M. & Kobor, M. S. Population-specificity of human DNA methylation. Genome biology 13, R8, https://doi.org/10.1186/gb-2012-13-2-r8 (2012).
Article PubMed PubMed Central CAS Google Scholar
Giuliani, C. et al. Epigenetic Variability across Human Populations: A Focus on DNA Methylation Profiles of the KRTCAP3, MAD1L1 and BRSK2 Genes. Genome biology and evolution 8, 2760–2773, https://doi.org/10.1093/gbe/evw186 (2016).
Article PubMed PubMed Central CAS Google Scholar
Heyn, H. et al. DNA methylation contributes to natural human variation. Genome research 23, 1363–1372, https://doi.org/10.1101/gr.154187.112 (2013).
Article PubMed PubMed Central CAS Google Scholar
Waszak, S. M. et al. Population Variation and Genetic Control of Modular Chromatin Architecture in Humans. Cell 162, 1039–1050, https://doi.org/10.1016/j.cell.2015.08.001 (2015).
Article PubMed CAS Google Scholar
Huang, R. S. et al. Population differences in microRNA expression and biological implications. RNA biology 8, 692–701, https://doi.org/10.4161/rna.8.4.16029 (2011).
Article PubMed PubMed Central CAS Google Scholar
Essack, M. et al. DDEC: Dragon database of genes implicated in esophageal cancer. BMC Cancer 9, 219 (2009).
Article PubMed PubMed Central CAS Google Scholar
Durinck, S., Spellman, P. T., Birney, E. & Huber, W. Mapping identifiers for the integration of genomic datasets with the R/Bioconductor package biomaRt. Nature protocols 4, 1184–1191, https://doi.org/10.1038/nprot.2009.97 (2009).
Article PubMed PubMed Central CAS Google Scholar
Benjamini, Y., Drai, D., Elmer, G., Kafkafi, N. & Golani, I. Controlling the false discovery rate in behavior genetics research. Behavioural brain research 125, 279–284 (2001).
Article PubMed CAS Google Scholar
Benjamini, Y. Y. D The Control of the False Discovery Rate in Multiple Testing under Dependency. The Annals of Statistics 29, 1165–1188 (2001).
Article MathSciNet MATH Google Scholar
Supek, F., Bosnjak, M., Skunca, N. & Smuc, T. REVIGO summarizes and visualizes long lists of gene ontology terms. PloS one 6, e21800, https://doi.org/10.1371/journal.pone.0021800 (2011).
Article ADS PubMed PubMed Central CAS Google Scholar
Winterhalter, C. et al. Pepper: cytoscape app for protein complex expansion using protein-protein interaction networks. Bioinformatics 30, 3419–3420, https://doi.org/10.1093/bioinformatics/btu517 (2014).
Article PubMed PubMed Central CAS Google Scholar
Chatr-Aryamontri, A. et al. The BioGRID interaction database: 2017 update. Nucleic acids research 45, D369–D379, https://doi.org/10.1093/nar/gkw1102 (2017).
Article PubMed CAS Google Scholar
He, Y. et al. Identification of GPX3 epigenetically silenced by CpG methylation in human esophageal squamous cell carcinoma. Digestive diseases and sciences 56, 681–688, https://doi.org/10.1007/s10620-010-1369-0 (2011).
Article PubMed CAS Google Scholar
Kashyap, M. K. et al. Genomewide mRNA profiling of esophageal squamous cell carcinoma for identification of cancer biomarkers. Cancer biology & therapy 8, 36–46 (2009).
Article CAS Google Scholar
Li, X. et al. Identification of a DNA methylome profile of esophageal squamous cell carcinoma and potential plasma epigenetic biomarkers for early diagnosis. PloS one 9, e103162, https://doi.org/10.1371/journal.pone.0103162 (2014).
Article ADS PubMed PubMed Central Google Scholar
Zhang, X. et al. An 8-gene signature, including methylated and down-regulated glutathione peroxidase 3, of gastric cancer. International journal of oncology 36, 405–414 (2010).
PubMed CAS Google Scholar
Peng, D. F. et al. Silencing of glutathione peroxidase 3 through DNA hypermethylation is associated with lymph node metastasis in gastric carcinomas. PloS one 7, e46214, https://doi.org/10.1371/journal.pone.0046214 (2012).
Article ADS PubMed PubMed Central CAS Google Scholar
Lodygin, D., Epanchintsev, A., Menssen, A., Diebold, J. & Hermeking, H. Functional epigenomics identifies genes frequently silenced in prostate cancer. Cancer Res 65, 4218–4227, https://doi.org/10.1158/0008-5472.CAN-04-4407 (2005).
Article PubMed CAS Google Scholar
Yu, Y. P. et al. Glutathione peroxidase 3, deleted or methylated in prostate cancer, suppresses prostate cancer growth and metastasis. Cancer Res 67, 8043–8050, https://doi.org/10.1158/0008-5472.CAN-07-0648 (2007).
Article PubMed CAS Google Scholar
Mohamed, M. M. et al. Promoter hypermethylation and suppression of glutathione peroxidase 3 are associated with inflammatory breast carcinogenesis. Oxidative medicine and cellular longevity 2014, 787195, https://doi.org/10.1155/2014/787195 (2014).
Article PubMed PubMed Central CAS Google Scholar
Li, B., Wang, B., Niu, L. J., Jiang, L. & Qiu, C. C. Hypermethylation of multiple tumor-related genes associated with DNMT3b up-regulation served as a biomarker for early diagnosis of esophageal squamous cell carcinoma. Epigenetics 6, 307–316 (2011).
Article PubMed PubMed Central CAS Google Scholar
Ling, Z. Q. et al. Hypermethylation-modulated down-regulation of CDH1 expression contributes to the progression of esophageal cancer. International journal of molecular medicine 27, 625–635, https://doi.org/10.3892/ijmm.2011.640 (2011).
Article PubMed CAS Google Scholar
Lee, E. J. et al. CpG island hypermethylation of E-cadherin (CDH1) and integrin alpha4 is associated with recurrence of early stage esophageal squamous cell carcinoma. Int J Cancer 123, 2073–2079, https://doi.org/10.1002/ijc.23598 (2008).
Article PubMed CAS Google Scholar
Takeno, S. et al. E-cadherin expression in patients with esophageal squamous cell carcinoma: promoter hypermethylation, Snail overexpression, and clinicopathologic implications. American journal of clinical pathology 122, 78–84, https://doi.org/10.1309/P2CD-FGU1-U7CL-V5YR (2004).
Article PubMed CAS Google Scholar
Si, H. X. et al. E-cadherin expression is commonly downregulated by CpG island hypermethylation in esophageal carcinoma cells. Cancer letters 173, 71–78 (2001).
Article PubMed CAS Google Scholar
Guo, M. et al. Accumulation of promoter methylation suggests epigenetic progression in squamous cell carcinoma of the esophagus. Clinical cancer research: an official journal of the American Association for Cancer Research 12, 4515–4522, https://doi.org/10.1158/1078-0432.CCR-05-2858 (2006).
Article CAS Google Scholar
Zhang, X. F. et al. Association of CDH1 single nucleotide polymorphisms with susceptibility to esophageal squamous cell carcinomas and gastric cardia carcinomas. Dis Esophagus 21, 21–29, https://doi.org/10.1111/j.1442-2050.2007.00724.x (2008).
Article PubMed Google Scholar
Zhai, J. W., Yang, X. G., Yang, F. S., Hu, J. G. & Hua, W. X. Expression and clinical significance of Ezrin and E-cadherin in esophageal squamous cell carcinoma. Chinese journal of cancer 29, 317–320 (2010).
Article PubMed Google Scholar
Zhai, J. et al. DRP-1, ezrin and E-cadherin expression and the association with esophageal squamous cell carcinoma. Oncology letters 8, 133–138, https://doi.org/10.3892/ol.2014.2114 (2014).
Article ADS PubMed PubMed Central Google Scholar
Zeng, H. et al. Altered expression of ezrin in esophageal squamous cell carcinoma. J Histochem Cytochem 54, 889–896, https://doi.org/10.1369/jhc.5A6881.2006 (2006).
Article PubMed CAS Google Scholar
Xie, J. J. et al. Roles of ezrin in the growth and invasiveness of esophageal squamous carcinoma cells. Int J Cancer 124, 2549–2558, https://doi.org/10.1002/ijc.24216 (2009).
Article PubMed CAS Google Scholar
Sato, T. et al. Esophageal squamous cell carcinomas with distinct invasive depth show different gene expression profiles associated with lymph node metastasis. International journal of oncology 28, 1043–1055 (2006).
PubMed MathSciNet CAS Google Scholar
Li, Y. et al. High expression of Y-box-binding protein-1 is associated with poor survival in resectable esophageal squamous cell carcinoma. Annals of surgical oncology 18, 3370–3376, https://doi.org/10.1245/s10434-011-1725-0 (2011).
Article PubMed Google Scholar
Guan, X. et al. Matrix metalloproteinase 1, 3, and 9 polymorphisms and esophageal squamous cell carcinoma risk. Medical science monitor: international medical journal of experimental and clinical research 20, 2269–2274, https://doi.org/10.12659/MSM.892413 (2014).
Article CAS Google Scholar
Zhang, J. et al. The functional SNP in the matrix metalloproteinase-3 promoter modifies susceptibility and lymphatic metastasis in esophageal squamous cell carcinoma but not in gastric cardiac adenocarcinoma. Carcinogenesis 25, 2519–2524, https://doi.org/10.1093/carcin/bgh269 (2004).
Article PubMed CAS Google Scholar
Augoff, K. et al. Upregulated expression and activation of membraneassociated proteases in esophageal squamous cell carcinoma. Oncol Rep 31, 2820–2826, https://doi.org/10.3892/or.2014.3162 (2014).
Article PubMed CAS Google Scholar
Chang, X. et al. Investigating the pathogenic role of PADI4 in oesophageal cancer. International journal of biological sciences 7, 769–781 (2011).
Article ADS PubMed PubMed Central CAS Google Scholar
Wang, L. et al. PADI4rs2240337 G > A polymorphism is associated with susceptibility of esophageal squamous cell carcinoma in a Chinese population. Oncotarget 8, 93655–93671, https://doi.org/10.18632/oncotarget.20675 (2017).
Article PubMed PubMed Central Google Scholar
Xuan, X. et al. Stat3 promotes invasion of esophageal squamous cell carcinoma through up-regulation of MMP2. Molecular biology reports 42, 907–915, https://doi.org/10.1007/s11033-014-3828-8 (2015).
Article PubMed CAS Google Scholar
Ying, J. et al. Genome-wide screening for genetic alterations in esophageal cancer by aCGH identifies 11q13 amplification oncogenes associated with nodal metastasis. PloS one 7, e39797, https://doi.org/10.1371/journal.pone.0039797 (2012).
Article ADS PubMed PubMed Central CAS Google Scholar
Zhang, C. et al. Fibroblast growth factor receptor 2-positive fibroblasts provide a suitable microenvironment for tumor development and progression in esophageal carcinoma. Clinical cancer research: an official journal of the American Association for Cancer Research 15, 4017–4027, https://doi.org/10.1158/1078-0432.CCR-08-2824 (2009).
Article CAS Google Scholar
Dudka, A. A., Sweet, S. M. & Heath, J. K. Signal transducers and activators of transcription-3 binding to the fibroblast growth factor receptor is activated by receptor amplification. Cancer research 70, 3391–3401, https://doi.org/10.1158/0008-5472.CAN-09-3033 (2010).
Article PubMed PubMed Central CAS Google Scholar
Luo, J., Yan, R., He, X. & He, J. Constitutive activation of STAT3 and cyclin D1 overexpression contribute to proliferation, migration and invasion in gastric cancer cells. American journal of translational research 9, 5671–5677 (2017).
PubMed PubMed Central Google Scholar
Ray, S., Lee, C., Hou, T., Boldogh, I. & Brasier, A. R. Requirement of histonedeacetylase1 (HDAC1) in signal transducer and activator of transcription 3 (STAT3) nucleocytoplasmic distribution. Nucleic acids research 36, 4510–4520, https://doi.org/10.1093/nar/gkn419 (2008).
Article PubMed PubMed Central CAS Google Scholar
Kato, M. et al. PML suppresses IL-6-induced STAT3 activation by interfering with STAT3 and HDAC3 interaction. Biochemical and biophysical research communications 461, 366–371, https://doi.org/10.1016/j.bbrc.2015.04.040 (2015).
Article PubMed CAS Google Scholar
Lee, H. et al. Persistently activated Stat3 maintains constitutive NF-kappaB activity in tumors. Cancer cell 15, 283–293, https://doi.org/10.1016/j.ccr.2009.02.015 (2009).
Article PubMed PubMed Central CAS Google Scholar
Zhou, J. et al. Gene expression profiles at different stages of human esophageal squamous cell carcinoma. World journal of gastroenterology 9, 9–15 (2003).
Article PubMed PubMed Central CAS Google Scholar
Hu, Y. C., Lam, K. Y., Law, S., Wong, J. & Srivastava, G. Identification of differentially expressed genes in esophageal squamous cell carcinoma (ESCC) by cDNA expression array: overexpression of Fra-1, Neogenin, Id-1, and CDC25B genes in ESCC. Clinical cancer research: an official journal of the American Association for Cancer Research 7, 2213–2221 (2001).
CAS Google Scholar
Li, L. Y. et al. Overexpression of GRB2 is correlated with lymph node metastasis and poor prognosis in esophageal squamous cell carcinoma. International journal of clinical and experimental pathology 7, 3132–3140 (2014).
PubMed PubMed Central Google Scholar
Li, W. Q. et al. Genetic variants in DNA repair pathway genes and risk of esophageal squamous cell carcinoma and gastric adenocarcinoma in a Chinese population. Carcinogenesis 34, 1536–1542, https://doi.org/10.1093/carcin/bgt094 (2013).
Article PubMed PubMed Central CAS Google Scholar
Liu, R., Yin, L. H. & Pu, Y. P. Reduced expression of human DNA repair genes in esophageal squamous-cell carcinoma in china. Journal of toxicology and environmental health. Part A 70, 956–963, https://doi.org/10.1080/15287390701290725 (2007).
Article PubMed CAS Google Scholar
Bhat, G. A. et al. CYP1A1 and CYP2E1 genotypes and risk of esophageal squamous cell carcinoma in a high-incidence region, Kashmir. Tumour Biol 35, 5323–5330, https://doi.org/10.1007/s13277-014-1694-6 (2014).
Article PubMed CAS Google Scholar
Sepehr, A. et al. Genetic polymorphisms in three Iranian populations with different risks of esophageal cancer, an ecologic comparison. Cancer letters 213, 195–202, https://doi.org/10.1016/j.canlet.2004.05.017 (2004).
Article PubMed CAS Google Scholar
Greenawalt, D. M. et al. Gene expression profiling of esophageal cancer: comparative analysis of Barrett’s esophagus, adenocarcinoma, and squamous cell carcinoma. Int J Cancer 120, 1914–1921, https://doi.org/10.1002/ijc.22501 (2007).
Article PubMed CAS Google Scholar
Fu, K. et al. DJ-1 inhibits TRAIL-induced apoptosis by blocking pro-caspase-8 recruitment to FADD. Oncogene 31, 1311–1322, https://doi.org/10.1038/onc.2011.315 (2012).
Article PubMed CAS Google Scholar
Lee, C. H. et al. Genetic modulation of ADH1B and ALDH2 polymorphisms with regard to alcohol and tobacco consumption for younger aged esophageal squamous cell carcinoma diagnosis. Int J Cancer 125, 1134–1142, https://doi.org/10.1002/ijc.24357 (2009).
Article PubMed CAS Google Scholar
Guo, Y. M. et al. Genetic polymorphisms in cytochrome P4502E1, alcohol and aldehyde dehydrogenases and the risk of esophageal squamous cell carcinoma in Gansu Chinese males. World journal of gastroenterology 14, 1444–1449 (2008).
Article PubMed PubMed Central CAS Google Scholar
Long, E. & Beales, I. L. The role of obesity in oesophageal cancer development. Therapeutic advances in gastroenterology 7, 247–268, https://doi.org/10.1177/1756283X14538689 (2014).
Article PubMed PubMed Central CAS Google Scholar
Lagergren, J. Influence of obesity on the risk of esophageal disorders. Nature reviews. Gastroenterology & hepatology 8, 340–347, https://doi.org/10.1038/nrgastro.2011.73 (2011).
Article Google Scholar
Duggan, C. et al. Association between markers of obesity and progression from Barrett’s esophagus to esophageal adenocarcinoma. Clinical gastroenterology and hepatology: the official clinical practice journal of the American Gastroenterological Association 11, 934–943, https://doi.org/10.1016/j.cgh.2013.02.017 (2013).
Article CAS Google Scholar
Carr, J. S., Zafar, S. F., Saba, N., Khuri, F. R. & El-Rayes, B. F. Risk factors for rising incidence of esophageal and gastric cardia adenocarcinoma. Journal of gastrointestinal cancer 44, 143–151, https://doi.org/10.1007/s12029-013-9480-z (2013).
Article PubMed Google Scholar
Doecke, J. D. et al. Single nucleotide polymorphisms in obesity-related genes and the risk of esophageal cancers. Cancer Epidemiol Biomarkers Prev 17, 1007–1012, https://doi.org/10.1158/1055-9965.EPI-08-0023 (2008).
Article PubMed CAS Google Scholar
Tse, D. et al. Polymorphisms of the NER pathway genes, ERCC1 and XPD are associated with esophageal adenocarcinoma risk. Cancer causes & control: CCC 19, 1077–1083, https://doi.org/10.1007/s10552-008-9171-4 (2008).
Article PubMed Google Scholar
Bonora, E. et al. INPP4B overexpression and c-KIT downregulation in human achalasia. Neurogastroenterology and motility: the official journal of the European Gastrointestinal Motility Society, e13346, https://doi.org/10.1111/nmo.13346 (2018).
Su, H. et al. Global gene expression profiling and validation in esophageal squamous cell carcinoma and its association with clinical phenotypes. Clinical cancer research: an official journal of the American Association for Cancer Research 17, 2955–2966 (2011).
Article CAS Google Scholar
Su, H. et al. Gene expression analysis of esophageal squamous cell carcinoma reveals consistent molecular profiles related to a family history of upper gastrointestinal cancer. Cancer Res 63, 3872–3876 (2003).
PubMed CAS Google Scholar
Almodova Ede, C. et al. Atrophic gastritis: risk factor for esophageal squamous cell carcinoma in a Latin-American population. World journal of gastroenterology 19, 2060–2064, https://doi.org/10.3748/wjg.v19.i13.2060 (2013).
Article PubMed Google Scholar
Ye, W. et al. Helicobacter pylori infection and gastric atrophy: risk of adenocarcinoma and squamous-cell carcinoma of the esophagus and adenocarcinoma of the gastric cardia. Journal of the National Cancer Institute 96, 388–396 (2004).
Article PubMed Google Scholar
McCormack, V. A. et al. Informing etiologic research priorities for squamous cell esophageal cancer in Africa: A review of setting-specific exposures to known and putative risk factors. Int J Cancer 140, 259–271, https://doi.org/10.1002/ijc.30292 (2017).
Article PubMed CAS Google Scholar
Rhodes, D. R. et al. ONCOMINE: a cancer microarray database and integrated data-mining platform. Neoplasia 6, 1–6 (2004).
Article PubMed PubMed Central CAS Google Scholar
Cutts, R. J., Gadaleta, E., Lemoine, N. R. & Chelala, C. Using BioMart as a framework to manage and query pancreatic cancer data. Database: the journal of biological databases and curation 2011, bar024, https://doi.org/10.1093/database/bar024 (2011).
Article PubMed CAS Google Scholar
Harsha, H. C. et al. A compendium of potential biomarkers of pancreatic cancer. PLoS Med 6, e1000046 (2009).
Article PubMed PubMed Central CAS Google Scholar
Thomas, J. K. et al. Pancreatic Cancer Database: an integrative resource for pancreatic cancer. Cancer biology & therapy 15, 963–967, https://doi.org/10.4161/cbt.29188 (2014).
Article CAS Google Scholar
Mohandass, J. et al. BCDB - A database for breast cancer research and information. Bioinformation 5, 1–3 (2010).
Article PubMed PubMed Central Google Scholar
Mosca, E. et al. A multilevel data integration resource for breast cancer study. BMC systems biology 4, 76, https://doi.org/10.1186/1752-0509-4-76 (2010).
Article PubMed PubMed Central CAS Google Scholar
Maqungo, M. et al. DDPC: Dragon Database of Genes associated with Prostate Cancer. Nucleic acids research 39, D980–985, https://doi.org/10.1093/nar/gkq849 (2011).
Article PubMed CAS Google Scholar
Ramana, J. RCDB: Renal Cancer GeneDatabase. BMC research notes 5, 246, https://doi.org/10.1186/1756-0500-5-246 (2012).
Article PubMed PubMed Central CAS Google Scholar
Agarwal, S. M., Raghav, D., Singh, H. & Raghava, G. P. CCDB: a curated database of genes involved in cervix cancer. Nucleic acids research 39, D975–979, https://doi.org/10.1093/nar/gkq1024 (2011).
Article PubMed CAS Google Scholar
Ganzfried, B. F. et al. curatedOvarianData: clinically annotated data for the ovarian cancer transcriptome. Database: the journal of biological databases and curation 2013, bat013, https://doi.org/10.1093/database/bat013 (2013).
Article PubMed CAS Google Scholar
Poos, K. et al. Structuring osteosarcoma knowledge: an osteosarcoma-gene association database based on literature mining and manual annotation. Database: the journal of biological databases and curation 2014, https://doi.org/10.1093/database/bau042 (2014).
Bhalla, S. et al. CancerPDF: A repository of cancer-associated peptidome found in human biofluids. Scientific reports 7, 1511, https://doi.org/10.1038/s41598-017-01633-3 (2017).
Article ADS PubMed PubMed Central CAS Google Scholar
Kashyap, M. K. et al. SILAC-based quantitative proteomic approach to identify potential biomarkers from the esophageal squamous cell carcinoma secretome. Cancer biology & therapy 10, 796–810 (2010).
Article CAS Google Scholar
Gray, K. A., Yates, B., Seal, R. L., Wright, M. W. & Bruford, E. A. Genenames.org: the HGNC resources in 2015. Nucleic acids research 43, D1079–1085, https://doi.org/10.1093/nar/gku1071 (2015).
Article PubMed CAS Google Scholar
McKusick, V. A. Mendelian Inheritance in Man and its online version, OMIM. American journal of human genetics 80, 588–604, https://doi.org/10.1086/514346 (2007).
Article PubMed PubMed Central CAS Google Scholar
Keshava Prasad, T. S. et al. Human Protein Reference Database–2009 update. Nucleic acids research 37, D767–772 (2009).
Article PubMed CAS Google Scholar
Peri, S. et al. Human protein reference database as a discovery resource for proteomics. Nucleic acids research 32, D497–501, https://doi.org/10.1093/nar/gkh070 (2004).
Article PubMed PubMed Central CAS Google Scholar
Hubbard, T. et al. The Ensembl genome database project. Nucleic acids research 30, 38–41 (2002).
Article PubMed PubMed Central CAS Google Scholar
Kanehisa, M. & Goto, S. KEGG: kyoto encyclopedia of genes and genomes. Nucleic acids research 28, 27–30 (2000).
Article PubMed PubMed Central CAS Google Scholar
Pico, A. R. et al. WikiPathways: pathway editing for the people. PLoS biology 6, e184, https://doi.org/10.1371/journal.pbio.0060184 (2008).
Article PubMed PubMed Central CAS Google Scholar
Ashburner, M. et al. Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet 25, 25–29, https://doi.org/10.1038/75556 (2000).
Article PubMed PubMed Central CAS Google Scholar
Griffiths-Jones, S., Grocock, R. J., van Dongen, S., Bateman, A. & Enright, A. J. miRBase: microRNA sequences, targets and gene nomenclature. Nucleic acids research 34, D140–144, https://doi.org/10.1093/nar/gkj112 (2006).
Article CAS Google Scholar
MacDonald, J. R., Ziman, R., Yuen, R. K., Feuk, L. & Scherer, S. W. The Database of Genomic Variants: a curated collection of structural variation in the human genome. Nucleic acids research 42, D986–992, https://doi.org/10.1093/nar/gkt958 (2014).
Article PubMed CAS Google Scholar
Chattopadhyay, I. et al. Gene expression profile of esophageal cancer in North East India by cDNA microarray analysis. World journal of gastroenterology 13, 1438–1444 (2007).
Article PubMed PubMed Central CAS Google Scholar
Popejoy, A. B. & Fullerton, S. M. Genomics is failing on diversity. Nature 538, 161–164, https://doi.org/10.1038/538161a (2016).
Article ADS PubMed PubMed Central CAS Google Scholar

Download references

Acknowledgements

This research was not supported by any funding agencies. Hence, the authors have no support or funding to report.

Author information

Asna Tungekar, Sumana Mandarthi, Pooja Mandaviya and Veerendra P. Gadekar contributed equally.
Prashantha Hebbar and Manoj K. Kashyap jointly supervised this work.

Authors and Affiliations

Mbiomics, Manipal, Karnataka, India
Asna Tungekar, Sumana Mandarthi, Pooja Rajendra Mandaviya, Veerendra P. Gadekar, Ananthajith Tantry, Sowmya Kotian, Jyotshna Reddy, Divya Prabha, Sushma Bhat, Sweta Sahay, Roshan Mascarenhas, Raghavendra Rao Badkillaya, Manoj Kumar Nagasampige, Mohan Yelnadu, Prashantha Hebbar & Manoj Kumar Kashyap
Manipal Life Sciences Center, Manipal University, Manipal, Karnataka, India
Asna Tungekar, Pooja Rajendra Mandaviya, Veerendra P. Gadekar, Sowmya Kotian, Jyotshna Reddy, Sushma Bhat, Roshan Mascarenhas & Prashantha Hebbar
Department of Biochemistry, Kasturba Medical College, Manipal University, Manipal, Karnataka, India
Sumana Mandarthi
Manipal Center for Information Sciences, Manipal University, Manipal, Karnataka, India
Ananthajith Tantry & Mohan Yelnadu
Faculty of Applied Sciences and Biotechnology, Shoolini University of Biotechnology and Management Sciences, Bajhol, Solan, Himachal Pradesh 173229, India
Manoj Kumar Kashyap
Infosys Technologies Ltd, Bangalore, Karnataka, India
Mohan Yelnadu
Faculty of Biology, Technion-Israel Institute of Technology, Haifa, 3200003, Israel
Mohan Yelnadu & Harsh Pawar
School of Life and Allied Health Sciences, Glocal University, Saharanpur, Uttar Pradesh, 247001, India
Manoj Kumar Kashyap
Newcastle University Medicine Malaysia, Johor Bahru, 79200, Malaysia
Roshan Mascarenhas
Institute for Theoretical Chemistry, University of Vienna, Währingerstrasse 17, 1090, Vienna, Austria
Veerendra P. Gadekar & Manoj Kumar Kashyap
Department of Biotechnology, Alva’s college, Moodubidre, Karnataka, India
Raghavendra Rao Badkillaya
Department of Biotechnology, Sikkim Manipal University, Gangtok, Sikkim, 737102, India
Manoj Kumar Nagasampige

Authors

Asna Tungekar
View author publications
You can also search for this author in PubMed Google Scholar
Sumana Mandarthi
View author publications
You can also search for this author in PubMed Google Scholar
Pooja Rajendra Mandaviya
View author publications
You can also search for this author in PubMed Google Scholar
Veerendra P. Gadekar
View author publications
You can also search for this author in PubMed Google Scholar
Ananthajith Tantry
View author publications
You can also search for this author in PubMed Google Scholar
Sowmya Kotian
View author publications
You can also search for this author in PubMed Google Scholar
Jyotshna Reddy
View author publications
You can also search for this author in PubMed Google Scholar
Divya Prabha
View author publications
You can also search for this author in PubMed Google Scholar
Sushma Bhat
View author publications
You can also search for this author in PubMed Google Scholar
Sweta Sahay
View author publications
You can also search for this author in PubMed Google Scholar
Roshan Mascarenhas
View author publications
You can also search for this author in PubMed Google Scholar
Raghavendra Rao Badkillaya
View author publications
You can also search for this author in PubMed Google Scholar
Manoj Kumar Nagasampige
View author publications
You can also search for this author in PubMed Google Scholar
Mohan Yelnadu
View author publications
You can also search for this author in PubMed Google Scholar
Harsh Pawar
View author publications
You can also search for this author in PubMed Google Scholar
Prashantha Hebbar
View author publications
You can also search for this author in PubMed Google Scholar
Manoj Kumar Kashyap
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.K.K. and P.H., conceived and guided the research. M.K.K., A.T., S.M., P.M., S.K., J.R., D.P., S.B., S.S. participated in curation. R.M., R.R.B., M.K.N., H.P., M.K.K., validated curation. P.H., A.T., M.K.K. and V.G.P. performed all the data analysis. P.H., A.T. and M.A.Y., developed the front end, database and setup the server. S.M., V.G.P., A.T., and M.K.K. wrote the manuscript.

Corresponding authors

Correspondence to Prashantha Hebbar or Manoj Kumar Kashyap.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Dataset 1

Dataset 2

Dataset 3

Dataset 4

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Tungekar, A., Mandarthi, S., Mandaviya, P. et al. ESCC ATLAS: A population wide compendium of biomarkers for Esophageal Squamous Cell Carcinoma. Sci Rep 8, 12715 (2018). https://doi.org/10.1038/s41598-018-30579-3

Download citation

Received: 05 December 2017
Accepted: 01 August 2018
Published: 24 August 2018
DOI: https://doi.org/10.1038/s41598-018-30579-3

This article is cited by

Exploring the hub genes and mechanisms of Daphne altaica treating esophageal squamous cell carcinoma based on network pharmacology and bioinformatics analysis
- Sendaer Hailati
- Ziruo Talihati
- Wenting Zhou
Journal of Cancer Research and Clinical Oncology (2023)
RNA splicing: a dual-edged sword for hepatocellular carcinoma
- Anjali Kashyap
- Greesham Tripathi
- Manoj Kumar Kashyap
Medical Oncology (2022)
Role of ACE2 receptor and the landscape of treatment options from convalescent plasma therapy to the drug repurposing in COVID-19
- Pravindra Kumar
- Ashok Kumar Sah
- Manoj Kumar Kashyap
Molecular and Cellular Biochemistry (2021)
A DNA methylation-based test for esophageal cancer detection
- Sofia Salta
- Catarina Macedo-Silva
- Carmen Jerónimo
Biomarker Research (2020)
Elevated DHODH expression promotes cell proliferation via stabilizing β-catenin in esophageal squamous cell carcinoma
- Yu Qian
- Xiao Liang
- Yongping Cui
Cell Death & Disease (2020)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.