Transcriptome analysis and prognosis of ALDH isoforms in human cancer

Chang, Peter Mu-Hsin; Chen, Che-Hong; Yeh, Chi-Chun; Lu, Hsueh-Ju; Liu, Tze-Tze; Chen, Ming-Huang; Liu, Chun-Yu; Wu, Alexander T. H.; Yang, Muh-Hwa; Tai, Shyh-Kuan; Mochly-Rosen, Daria; Huang, Chi-Ying F.

doi:10.1038/s41598-018-21123-4

Download PDF

Article
Open access
Published: 09 February 2018

Transcriptome analysis and prognosis of ALDH isoforms in human cancer

Peter Mu-Hsin Chang^1,2,
Che-Hong Chen³,
Chi-Chun Yeh⁴,
Hsueh-Ju Lu^5,6,
Tze-Tze Liu⁷,
Ming-Huang Chen^1,2,
Chun-Yu Liu^1,2,
Alexander T. H. Wu⁸,
Muh-Hwa Yang^1,9,
Shyh-Kuan Tai^2,10,
Daria Mochly-Rosen³ &
…
Chi-Ying F. Huang¹¹

Scientific Reports volume 8, Article number: 2713 (2018) Cite this article

4807 Accesses
22 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Overexpression of ALDH is associated with cancer stem-like features and poor cancer prognosis. High ALDH activity has been observed in cancer stem-like cells. There are a total of 19 human ALDH isoforms, all of which are associated with reducing oxidative stress and protecting cells from damage. However, it is unknown whether all ALDHs are associated with poor cancer prognosis and which ones play a significant role in cancer progression. In this study, we used RNA sequencing data from The Cancer Genome Atlas (TCGA) to evaluate the differential expression of 19 ALDH isoforms in 5 common human cancers. The 19 ALDH genes were analyzed with an integrating meta-analysis of cancer prognosis. Genotyping and next-generation RNA sequencing for 30 pairwise samples of head and neck squamous cell carcinoma were performed and compared with the TCGA cohort. The analysis showed that each ALDH isoform had a specific differential expression pattern, most of which were related to prognosis in human cancer. A lower expression of ALDH2 in the tumor was observed, which was independent from the ALDH2 rs671 SNP variant and the expression of other mitochondria-associated protein coding genes. This study provides new insight into the association between ALDH expression and cancer prognosis.

Comprehensive molecular characterization of mitochondrial genomes in human cancers

Article Open access 05 February 2020

Genomic basis for RNA alterations in cancer

Article Open access 05 February 2020

A distinct class of pan-cancer susceptibility genes revealed by an alternative polyadenylation transcriptome-wide association study

Article Open access 26 February 2024

Introduction

Carcinogenesis is an extremely complicated process that may involve multilevel mutations such as karyotype changes, loss of heterogeneity, DNA copy-number variations, sequence mutations and aberrant mRNA and/or protein expression. Among them, microarray and next-generation RNA sequencing (RNA-seq) have been widely used to identify oncogenic expression on a genome-wide scale because of the strength of simultaneous analysis of thousands of genes, which may help to identify novel biomarkers for treatment response, cancer prognosis and precision medicine¹. High-throughput approaches for transcription-level changes of oncogenes, novel biomarkers and signaling can be identified for cancer phenotypes in different human cancers^2,3,4. However, questions have been raised regarding the reproducibility and reliability of microarray experiments. The main challenge of these microarray studies are because of a small number of samples, inconsistent tissue sample quality from the DNA/RNA extraction and incomprehensive clinical data for analysis. In recent years, The Cancer Genome Atlas (TCGA: https://cancergenome.nih.gov/) program, which includes comprehensive, multi-dimensional maps of the key genomic changes in more than 30 types of cancer, has been used for the cancer studies. The TCGA dataset places an emphasis on the tissue sample quality that was used and has more than 2.5 petabytes of data, including pairwise tumor/normal tissues, from more than 10,000 patients. It has become one of the most powerful and popular tools for genomic studies of human cancers^5,6,7,8.

In human, the multigene ALDH family that consists of 19 different isozymes has been identified due to similar amino acid sequences and functions^9,10. Furthermore, elevated ALDH activity has been used as a cancer stem cell biomarker¹¹. Cancer stem-like features account for the relative aggressiveness of tumors and are potential prognostic indicators for patients with cancer¹². Several recent studies have shown that ALDH1A1 and ALDH3A1 may detoxify cyclophosphamide and result in cancer resistance^13,14. High ALDH1A3 expression has been reported as a poor prognostic marker for breast cancer and cholangiocarcinoma^10,15. However, ALDHs are also well known for metabolizing aldehydes and thus reducing the oxidative stress in cells from damage. For example, ALDH2 has the lowest Michaelis constant for acetaldehyde, which has been classified as a group 1 carcinogen by the International Agency for Research on Cancer¹⁶. Reduction in ALDH2 activity increases acetaldehyde accumulation in the human body, which increases the cancer risk in patients, especially in those that consume alcohol¹⁷. Finally, the exact functions of other ALDH isoforms remain unclear; therefore, a more comprehensive approach for the differential expressions (DEs) and prognosis of all ALDH isoforms in human cancers is warranted.

In silico analysis has been commonly utilized in genomic studies, thus resulting in public microarray or RNA-seq datasets^1,18. These high-throughput bioinformatics tools can provide insight into the biological dynamics and functional validation of candidate genes. The challenge of reproducibility for an individual microarray study may potentially be improved by a systematic approach using standardized methods^19,20. Prognoscan (http://www.abren.net/PrognoScan/) is a bioinformatics tool that contains more than 70 microarray studies from 13 different human cancer types with clinical prognosis²¹. It has been used widely for human cancer research^22,23,24,25 and provides a method to cross-link a group of candidate genes with prognoses in a systematic manner. In this study, we analyzed the DEs of all 19 ALDH isotypes using the TCGA RNA-seq dataset and integrated prognostic evaluations from the Prognoscan microarray meta-analysis. Finally, the 30 pairwise head and neck squamous cell carcinomas (HNSCs) from Taiwanese patients were used to compare the ALDH2 genotype with the DEs and cancer prognosis.

Results

Various differential expressions exist in the 19 ALDH isoforms compared to the TCGA cohort

From the TCGA database, the RNA-seq data were extracted for samples of breast cancer (BRCA) (1097 tumor vs. 114 normal samples), lung adenocarcinoma (LUAD) (515 tumor vs. 59 normal samples), lung squamous cell carcinoma (LUSC) (502 tumor vs. 51 normal samples), esophageal squamous cell carcinoma (ESSC) (82 tumor vs. 8 normal samples) and HNSC (520 tumor vs. 44 normal samples). The DEs for all 19 ALDH tumors vs. normal samples in 5 cancer types are shown in Fig. 1 and Supplemental Fig. 1. Generally, the pairwise comparison (BRCA, 114 pairs; LUAD, 59 pairs; LUSC, 51 pairs; HNSC, 44 pairs) showed a similar trend with case-control comparison, while the pairwise study showed a more significant p-value than the case-control study. We hypothesized that pairwise samples have more specific DEs because individual heterogeneity was minimized. Interestingly, there were several different DEs among the 19 ALDH isoforms. For example, ALDH1A2, ALDH2, ALDH3A2 and ALDH9A1 were downregulated in all tumors among the 5 cancer types (Fig. 1a,b,c and Supplemental Fig. 1a), whereas ALDH1B1, ALDH1L2 and ALDH18A1 were most upregulated in tumor parts (Fig. 1d,e,f). Some tumor type-specific DEs were observed for ALDH1L1, ALDH3B1, ALDH3B2, ALDH4A1 and ALDH7A1 (Fig. 1g,h,i and Supplemental Fig. 1b,c). For validation with other non-TCGA cohorts, we also use the NGS study from Djureinovic et al., both including LUSC and LUAD to compare with normal tissue of lung (GSE81089). In addition, tumor vs. normal microarray profiles for HNSC (GSE6631) and BRCA (GSE25291) have also been downloaded and analyzed. The DEs showed similar trends comparing with TCGA cohort (Supplemental Fig. 2). This result suggested that, at least among these 5 common cancer types, there were various DEs for each individual ALDH isoform.

ALDH differential expression is associated with prognosis in human cancer

We applied all 19 ALDH isoforms into Prognoscan to evaluate the survival differences among high and low expressing subgroups. The microarray studies with corrected p-values < 0.05 were extracted from the querying result of each isozyme and meta-analyses were performed individually. As shown in Figs 2 and 3, various prognostic values were observed among the ALDH isoforms that had lower expressions of ALDH2, ALDH3A1, ALDH5A1 and ALDH6A1 (Fig. 2a–d) but higher expressions of ALDH1B1, ALDH1L2, ALDH3B2 and ALDH16A1 (Fig. 2e–h) in tumors that were associated with poorer OS. Lower expression of ALDH1A1, ALDH1L1, ALDH2, ALDH3A1, ALDH3A2, ALDH3B1, ALDH5A1, ALDH6A1 and ALDH9A1 (Fig. 3a–d, Supplemental Fig. 3a–e) while higher expression of ALDH1A3, ALDH1B1, ALDH1L2, ALDH3B2, ALDH8A1 and ALDH18A1 (Fig. 3e–h, Supplemental Fig. 3f and g) were associated with poorer Progression-free survival (PFS). Interestingly, these prognostic trends were compatible with the DEs of most ALDH isoforms (Supplemental Fig. 4), which implied that ALDH DEs were associated with cancer prognosis.

Lower ALDH2 expression is observed in tumors and is associated with poor cancer prognosis

Since ALDH2 is the most well-known ALDH isozyme for its function to reduce cancer risk²⁶, we were especially interested when we observed that ALDH2 expression in tumors was downregulated and associated with poor cancer prognosis. First, we evaluated the DE for ALDH2 in our VGHTPE cohort. Fifteen tumor samples and 3 normal samples were within the QC criteria. Contrary to the commonly observed EGFR overexpression in HNSC, there were similar ALDH2 downregulations in both case-control (15 tumors vs. 3 normal) and pairwise (3 tumors vs. 3 normal) comparisons, although without a significant p-value, which may be due to the small number of samples (Supplemental Table 1). To further validate the prognostic role of the DE observed for ALDH2 in human cancer, we used KM plotter (http://kmplot.com/analysis/), which collected 10,188 human cancer microarray samples and normalized them together to generate a common high vs. low comparison of the DEs for each evaluated gene for the indicated cancer²⁷. As shown in Fig. 4, lower ALDH2 expression in the tumor was also associated with significantly poorer prognosis in BRCA (RFS for 1973 high vs. 1978 low samples, HR = 0.67, CI = 0.6–0.75, p-value = 0; OS for high 701 low vs. 701 low samples, HR = 0.69; CI = 0.56–0.86, p-value = 0.0008) (Fig. 4a and b) and LUAD (time to first progression [FP] for 231 high vs. 230 low samples, HR = 0.4, CI = 0.29–0.56, p-value = 0; OS for 360 high vs. 360 low samples, HR = 0.47, CI = 0.7–0.6, p-value = 0) (Fig. 4c and d). Because there is no HNSC profile in KM plotter and only one HNSC study in Prognoscan with small case number (N = 28), we used SurvExpress (http://bioinformatica.mty.itesm.mx:8080/Biomatec/SurvivaX.jsp)²⁸ to extract survival from HNSC TCGA cohort. The analysis showed that ALDH2 high expressed patients (N = 142) had significantly better survival than ALDH2 low expressed patients (N = 141) (HR = 1.59, CI = 1.1–2.29, p-value = 0.014) (Fig. 4e and f).

ALDH2 expression in the tumor is independent from ALDH2*2 SNP and other mitochondria-associated proteins

Since the ALDH2 rs671 SNP is specifically common in the Asian and Taiwanese populations¹⁶, we performed genotyping for HNSC patient samples and also compared the RNA expression between the tumor and normal tissues. Interestingly, comparisons of ALDH2 genotype and expression levels in samples derived from our VGHTPE cohort resulted in the identification of a similar trend of lower ALDH2 expression in tumor samples both in the ALDH2 rs671 GG wild type allele and in the GA heterozygous allele when compared to normal tissues (Fig. 5a). Furthermore, since ALDH2 enzyme only exists as an active form in the mitochondrial matrix, we thus compared the DEs of functional coding genes in mitochondrial matrix to see whether down-regulation of ALDH2 in tumor is independent from other mitochondria matrix associating proteins. As shown in Fig. 5b, most mitochondrial matrix protein-coding genes were significantly upregulated in tumors between the 5 different cancers, which was opposite to the ALDH2 DE. In addition, TOM complex accounts for transporting functional proteins into mitochondria and is upregulated in some cancer cells to stabilize anti-apoptotic proteins^29,30. Therefore, we also compared the DEs of TOM complex genes to see whether TOM complex genes are associated with ALDH2 expression. As shown in Fig. 5c, most TOM complex proteins were upregulated in tumors among the 5 cancer types, which is also opposite to the DE of ALDH2.

Discussion

In this study, we used an integrated analysis to evaluate the DEs for all ALDH isotypes as well as their correlation with cancer prognosis. We found that there were different DEs and prognosis among the 19 ALDH subtypes, suggesting that they may have individual functional roles in cancer prognosis. Interestingly, we found that some ALDHs are downregulated in tumors and are also associated with poorer prognosis, especially for ALDH2. This result is inconsistent with the current hypothesis for high ALDH activity in tumors with cancer stem-like features. ALDH2 is regarded as a mitochondrial enzyme with an activated form existing only in the mitochondrial matrix. In addition to acetaldehyde metabolism, it also plays a role in the removal of other reactive aldehydes derived from oxidative stress and lipid peroxidation, such as 4-hydroxy-nonenal and malondialdehyde. In human hepatocellular carcinoma, the downregulation of ALDH2 in the tumor has also been reported³¹. On the other hand, increasing mitochondria-associated gene expression is commonly observed during carcinogenesis or cancer progression^32,33, which was compatible but also opposite to the ALDH2 downregulation in the current study. These results all suggested that downregulation of ALDH2 in tumor may be associated with cancer progression and influence prognosis. For the proof of concept, we used the ALDH2 agonist, Alda-1, which can specifically enhance enzymatic activity both in ALDH2 wild type and mutant form³⁴ to treat cancer cells and observe the responsive phenotypes. The preliminary results showed that Alda-1 inhibited the migration/invasion ability as well as the cell viability in aggressive breast cancer MDA-MB-468 and MDA-MB-231 cells (Supplemental Fig. 5a and b). On the contrary, using siRNA to knockdown ALDH2 in MDA-MB-468 also increased migration (Supplemental Fig. 5e). In addition, both glycolysis and mitochondria respiration of HNSC FaDu cell was downregulated after Alda-1 treatment (Supplemental Fig. 5c and d). These results suggest that ALDH2 activity may be associated with cancer metabolism and influence cancer progression. Therefore, further exploratory experiments to confirm the underlying mechanism are warranted.

The East Asian-specific ALDH2 rs671 SNP has raised attention and has been demonstrated to be a strong genetic factor for increased cancer risk, especially in patients with high alcohol intake¹⁶. ALDH2 rs671 is a SNP resulting in a K487E mutation³⁵. This single amino acid mutation causes a severe functional deficiency of the ALDH2 enzymatic activity which then leads to acetaldehyde accumulation, even after intake of a single alcoholic beverage¹⁷ and is believed to be the underlying cause of increased cancer risks for HNSC³⁶ and ESSC¹⁷. In the current study, because the TCGA cohort represents data for mostly non-Asian subjects, the effects of the ALDH2 rs671 SNP on the DE for ALDH2 could only be analyzed from our own VGHTPE cohort. The results showed similar downregulation of ALDH2 in tumors with the ALDH2 GG wild type allele and the rs671 GA heterozygous allele, suggesting that this may be a general regulation independent from the ALDH2 SNP. Larger data collection from cohorts of Asian patients with cancer is therefore needed for future studies.

Furthermore, some tumor type-specific DEs were also observed. We noticed that more variations existed in HNSC when compared to the other 4 cancer types. HNSC is the most common cancer occurrence among middle-aged males in Taiwan and the sixth most common cancer in the world³⁷. The etiology of HNSC is attributed to the exposure to environmental carcinogens derived from alcoholic beverages, cigarette smoke and betel nut use. Exposure to these environmental carcinogens incurs repeated damage to the upper aerodigestive tract mucosa cells and results in DNA damage, inappropriate modulation of autophagy and and carcinogenesis^38,39,40. The complex interaction of several environmental carcinogens makes the spectrum of the mutations observed in HNSCs very heterogeneous and individualized. From the PCA analyses of both the TCGA database and our VGHTPE cohorts, it is almost impossible to identify a simple differentiating genetic signature based on comparisons between the tumor vs. normal tissues from patients with HNSC (Supplemental Fig. 6). Compared to other cancers with more separated gene expression between the tumor and normal tissues, this phenomenon may account for the controversial result between HNSC and the other 4 cancers.

Lastly, there are some limitations in this study. First, the retrospective cohort from our own databank is small and with an inevitable selection bias. Furthermore, the mitochondrial DNA (mtDNA) was not used for the RNA-seq, which also makes it impossible to evaluate their DEs with the DEs for other nuclear DNA coding mitochondria proteins. Finally, only 5 common cancer types were involved in this pilot study; therefore, any specific variations between cancers should be concluded only after completing a more comprehensive analysis with even more cancer types. For the prognostic survey of more other cancers, a newly developed bioinformatics tool “SurvExpress” (http://bioinformatica.mty.itesm.mx:8080/Biomatec/SurvivaX.jsp), which contains more 20,000 patient samples from 142 datasets, can be used for cross validation and more comprehensive analysis with multiple genes could be achieved in the future²⁸.

In summary, this study provides a new insight into the ALDH family with their DEs in tumors vs. normal tissues as well as their association with cancer prognosis. For the high prevalence of the ALDH2 rs671 SNP, ALDH2 downregulation not only increases cancer risk but also influences cancer prognosis. This study provided the first systemic analysis for the differential expression and prognosis of all 19 human ALDH isoenzymes from publically available datasets, which may be applicable for other functional group of oncogenes, such as HER or VEGF families. Novel ALDH modulators could also be developed in the future, according to the prognostic role of each ALDH isoform as the biomarker. The results may have significant clinical implications and may also raise concerns for public health issues. Further research is therefore needed to focus on the relationship between the ALDH2 SNP, DE and their associated cancer phenotypes.

Methods

Gene expression profiling and differential gene expression analysis from TCGA

Using the TCGA database⁴¹, we extracted expression values of protein-coding genes for the following five types of carcinomas: LUAD, LUSC, BRCA, ESSC and HNSC. The expression values of protein-coding genes for adjacent non-cancerous tissues from surgical specimens were also extracted. The expression value data were extracted from the TCGA data matrix in August 2015 with the following criteria: disease type: (LUSC, LUAD, BRCA and HNSC), data type: RNASeq V2, data level: 3, batch number: all, platform: UNC (IlluminaHiSeq_RNASeqV2). ESSC data was extracted additionally in July 2017. The fold change of expression values between cancerous and normal tissues were expressed as log₂ transformation. The gene-specific read counts were preprocessed with quantile normalization with the R package preprocessCore. The calculated p-values were adjusted to q-values for multiple testing using the Benjamini–Hochberg correction.

In silico analysis of prognosis and ALDH expression in human cancer

PrognoScan is a bioinformatics tool that identifies an optimal threshold with the minimum p-value to separate the “high” and “low” expressing groups for survival difference in the selected genes. To control for type I errors, the p-value was corrected by the standard formula and shown as “corrected p-value”²¹. First, we extracted the microarray studies from only solid tumors with corrected p-values < 0.05, which meant the prognosis of the high vs. low expressing subgroups could be separated significantly. Hazard ratios (HRs) and 95% confidence intervals (CIs) for overall survival (OS) and PFS of each selected microarray study were downloaded from the Prognoscan database. The survival results were pooled with the meta-analysis through the Review Manager software, version 5.3 (Cochrane Collaboration).

Analysis of HNSC clinical samples and data collection

Thirty matched, pairwise tumor/normal human HNSC samples, which were stored in liquid nitrogen immediately after resection, were selected from collections in the Taipei Veteran’s General Hospital (VGHTPE) tissue bank. All participants have signed informed consents before donating their tissue samples into this legal tissue bank. Also available with the 30 matched pairwise samples were confidential clinicopathological data that were used for the genomic study and correlation analysis in this study. DNA was extracted from these samples and dissolved in 1 x TE buffer (10 mM Tris, 1 mM EDTA, pH 8.0) for ALDH2 SNP determination. The QC of DNA OD was within 1.8–2.0. The quantified samples were then diluted to 10–20 ng/µl. Total RNA (1–5 µg) with concentrations >200 ng/μl were decontaminated by using DNase I and dissolved in 1 x TE or RNase-free H₂O. The QC of RNA for the RNA integrity number (RIN) was within 8.0 and the OD 260/280 was within 1.8–2.0. The extracted samples were then transferred in dry ice packages to the Yang Ming National University Genomic Center for further genotyping and RNA-seq. The local ethics committee (Taipei Veterans General Hospital, Taiwan, R.O.C.) approved this study (TPEVGH IRB No.: 2015–08–003CC). All experiments were performed in accordance with relevant guidelines and regulations.

Genotyping and whole transcriptome sequencing for HNSC

Genotyping of ALDH2*2 (rs671) was performed by Sequenom MassARRAY technology with iPLEX gold chemistry (Sequenom, San Diego, CA, USA). Briefly, the PCR primers and single-base extension primers were designed using the Assay Design Suite v2.0 software. The genotyping analysis was performed using the iPLEX Gold Reagent Kit (Sequenom) according to the manufacturer’s instruction. PCR followed by single-base primer extension was performed with 10 μg of the DNA sample (10 ng/µl). The extended reaction products were purified by cation-exchange resins and then spotted onto a 384-format SpectroCHIP II array using a MassArray Nanodispenser RS1000. Mass determination was performed on a MassARRAY Compact Analyzer. The resulting spectra were processed and alleles called with the MassARRAY Typer 4.0 (Sequenom) using the default settings. Extraction of RNA from frozen tissue samples was performed using the Qiagen RNeasyMini Kit. The quality of the RNA was assessed using the Agilent 2100 Bioanalyzer (Agilent Technologies, Santa Clara, CA, USA); samples with RIN >8 were used for further whole genome RNA-seq. The directional RNA-seq libraries were prepared using TruSeq Stranded mRNA Sample Prep Kit (Illumina). The sequencing libraries were sequenced on the HiSeq. 2500 platform (Illumina, San Diego, CA) by single-end sequencing with 100 bp read lengths to a depth of 28 to 42 million reads for each library. The RNA-seq data was analyzed with the CLC genomics workbench (Qiagen, Hilden, Germany). The quality of the raw read data in FASTQ format was assessed and reads of low quality were trimmed or removed. The adapter sequences were trimmed also. The sequenced reads were aligned to the NCBI_GRCH38 human reference genome and, following the removal of multi-mapping reads, converted to gene-specific read counts for annotated genes in the form of a reads per kilobase of exon model per million mapped reads (RPKM) value. The gene-specific read counts of HNSC samples were preprocessed with quantile normalization with the R package preprocessCore, with the sample procedure as processing TCGA data.

Cell viability assay

MDA-MB-468 and MDA-MB-231 Cells (1 × 10⁴) were seeded onto 24-well plates for 24 h and then treated with indicated concentration of Alda-1 (A kind gift from Dr. Che-Hong Chen, Stanford University) for 72 h. The treated cells were added 0.5 mg/mL MTT (Sigma-Aldrich) to each well and incubated for 3 h at 37 °C. The violet MTT formazan precipitates were subsequently dissolved in 100 μL of DMSO. The absorbance at 570 nm was measured on an UQuant reader.

Migration and invasion assays

The migration and invasion assays were performed in 24-well plate for 12 and 20 hours respectively. MDA-MB-468 cells (5 × 10⁴) in 200 μL of serum free medium were seeded onto upper Cell Culture Insert with 8 μm pores (Greiner Bio One) for migration assay and Matrigel matrix (Corning) coated Cell Culture Insert for invasion assay. The lower chamber contained 900 μL of complete medium. The cells migrated or invaded to the Cell Culture Insert membrane which were fixed with methanol for 10 minutes and stained with 0.005% crystal violet for 1 hour. The numbers of migrated or invaded cells were counted under the microscope from 10 random fields. For silencing ALDH2 assay, MDA-MB-468 were seeded for 24 hours and transfected with control siRNA or siALDH2 using Dharmafect 1 transfection reagent according to the manufacturer’s protocol. (Dharmacon, CO, USA) After 48 hours of transfection, cells were collected and resuspended in serum free medium for migration assay.

Seahorse metabolism assay

2 × 10⁴ FaDu Cells were seeded in XF24 cell culture microplates, with adding 10 μM Alda-1 or not and activated the probe in non CO2 incubator on the first day. Second day, replacing growth medium with assay medium in XF24 cell culture microplates at least for 1 hour at 37 °C before running the assay. Next, oligomycin, FCCP and Rotenone/antimycine A were loaded in sensor cartridge and then sensor cartridge was set in XF24 analyzer to correct the condition. After the correction, began to metabolic determination.

Statistical analysis

Overall pooled hazard ratios (HRs) were analyzed with a fixed effect model. Heterogeneity between microarray studies was investigated using Chi-square tests and the I² index that expresses the percentage variability of the results related to the heterogeneity rather than to the sampling error. Statistical significance of the overall result was expressed with the probability value (p-value) in the “test for overall effect.” The result is regarded as statistically significant if p < 0.05. We compared the expression values of the 19 ALDH isotypes in normal and cancerous tissues and used the Mann-Whitney U test to determine the level of statistical significance for the differences in expression values. To determine the significance of differential gene expression between cancerous and normal samples in case-control comparisons, cancerous and normal samples were treated as independent samples and the two-sample test was used. For the pair-wised comparison, cancer and normal tissue samples from the same patient were treated as dependent samples and the paired difference test was used. We used the expression values of all protein-coding genes to depict the level of similarity between the cancer and normal tissue sample via the principal coordinate analysis (PCA) plot. Pearson correlations were used to approximate distances between samples in the PCA plot. Clinicopathological variables were compared using the Chi-square test or the Fisher’s exact test to differentiate between each other. A p-value of <0.05 was regarded as statistically significant in the 2-sided tests. Kaplan-Meier methods were used to evaluate PFS or OS. Log-rank tests were used for comparisons. All statistical analyses were performed using the SPSS statistical software version 18 (SPSS, Chicago, IL, USA) and R package (version 3.01, http://www.rproject.Org).

Availability of data and materials

The datasets generated and/or analyzed during the current study are available in the TCGA repository (http://cancergenome.nih.gov/), PrognoScan repository (http://www.abren.net/PrognoScan/), KM plotter repository (http://kmplot.com/analysis/index) and SurvExpress repository (http://bioinformatica.mty.itesm.mx).

References

Li, M. H., Fu, S. B. & Xiao, H. S. Genome-wide analysis of microRNA and mRNA expression signatures in cancer. Acta. Pharmacologica Sinica. 36, 1200–1211 (2015).
Article PubMed PubMed Central Google Scholar
Botling, J. et al. Biomarker discovery in non-small cell lung cancer: integrating gene expression profiling, meta-analysis and tissue microarray validation. Clin. Cancer Res. 19, 194–204 (2013).
Article CAS PubMed Google Scholar
Bull, J. H. et al. Identification of potential diagnostic markers of prostate cancer and prostatic intraepithelial neoplasia using cDNA microarray. Br. J. Cancer 84, 1512–1519 (2001).
Article CAS PubMed PubMed Central Google Scholar
Hippo, Y. et al. Global gene expression analysis of gastric cancer by oligonucleotide microarrays. Cancer Res. 62, 233–240 (2002).
CAS PubMed Google Scholar
Cancer Genome Atlas Network. Comprehensive genomic characterization of head and neck squamous cell carcinomas. Nature 517, 576–582 (2008).
Cancer Genome Atlas Network. Comprehensive molecular portraits of human breast tumours. Nature 490, 61–70 (2008).
Cancer Genome Atlas Research Network. Comprehensive genomic characterization defines human glioblastoma genes and core pathways. Nature 455, 1061–1068 (2008).
Hoadley, K. A. et al. Multiplatform analysis of 12 cancer types reveals molecular classification within and across tissues of origin. Cell 158, 929–944 (2014).
Article CAS PubMed PubMed Central Google Scholar
Perez-Miller, S. J. & Hurley, T. D. Coenzyme isomerization is integral to catalysis in aldehyde dehydrogenase. Biochemistry 42, 7100–7109 (2003).
Article CAS PubMed Google Scholar
Marcato, P. et al. Aldehyde dehydrogenase activity of breast cancer stem cells is primarily due to isoform ALDH1A3 and its expression is predictive of metastasis. Stem Cells 29, 32–45 (2011).
Article CAS PubMed Google Scholar
Ma, I. & Allan, A. L. The role of human aldehyde dehydrogenase in normal and cancer stem cells. Stem Cell Rev. 7, 292–306 (2011).
Article CAS PubMed Google Scholar
Cairo, S. et al. Hepatic stem-like phenotype and interplay of Wnt/beta-catenin and Myc signaling in aggressive childhood liver cancer. Cancer Cell 14, 471–484 (2008).
Article CAS PubMed Google Scholar
Magni, M. et al. Induction of cyclophosphamide-resistance by aldehyde-dehydrogenase gene transfer. Blood 87, 1097–103 (1996).
CAS PubMed Google Scholar
Moreb, J. S., Mohuczy, D., Ostmark, B. & Zucali, J. R. RNAi-mediated knockdown of aldehyde dehydrogenase class-1A1 and class-3A1 is specific and reveals that each contributes equally to the resistance against 4-hydroperoxycyclophosphamide. Cancer Chemother. Pharmacol. 59, 127–136 (2007).
Article CAS PubMed Google Scholar
Chen, M. H. et al. ALDH1A3, the major aldehyde dehydrogenase isoform in human cholangiocarcinoma cells, affects prognosis and gemcitabine resistance in cholangiocarcinoma patients. Clin. Cancer Res. 22, 4225–4235 (2016).
Article CAS PubMed Google Scholar
Gross, E. R. et al. A personalized medicine approach for Asian Americans with the aldehyde dehydrogenase 2*2 variant. Annu. Rev. Pharmacol. Toxicol. 55, 107–127 (2015).
Article CAS PubMed Google Scholar
Brooks, P. J., Enoch, M. A., Goldman, D., Li, T. K. & Yokoyama, A. The alcohol flushing response: an unrecognized risk factor for esophageal cancer from alcohol consumption. PLoS. Med. 6, e50, https://doi.org/10.1371/pmed1000050 (2009).
Article PubMed Google Scholar
Diao, C. Y. et al. Screening for metastatic osteosarcoma biomarkers with a DNA microarray. Asian Pac. J. Cancer Prev. 15, 1817–1822 (2014).
Article PubMed Google Scholar
Hayes, D. N. et al. Gene expression profiling reveals reproducible human lung adenocarcinoma subtypes in multiple independent patient cohorts. J. Clin. Oncol. 24, 5079–5090 (2006).
Article CAS PubMed Google Scholar
Kuijjer, M. L., Hogendoorn, P. C. & Cleton-Jansen, A. M. Genome-wide analyses on high-grade osteosarcoma: making sense of a genomically most unstable tumor. Int. J. Cancer 133, 2512–2521 (2013).
CAS PubMed Google Scholar
Mizuno, H., Kitada, K., Nakai, K. & Sarai, A. PrognoScan: a new database for meta-analysis of the prognostic value of genes. BMC. Med. Genomics 2, 18, https://doi.org/10.1186/1755-8794-2-18 (2009).
Article PubMed PubMed Central Google Scholar
Olcina, M. M. et al. H3K9me3 facilitates hypoxia-induced p53-dependent apoptosis through repression of APAK. Oncogene 35, 793–799 (2016).
Article CAS PubMed Google Scholar
Takahashi, Y. et al. The AURKA/TPX2 axis drives colon tumorigenesis cooperatively with MYC. Ann. Oncol. 26, 935–942 (2015).
Article CAS PubMed Google Scholar
Goto, Y. et al. UCHL1 provides diagnostic and antimetastatic strategies due to its deubiquitinating effect on HIF-1α. Nat. Commun. 6, 6153, https://doi.org/10.1038/ncomms7153 (2015).
Article CAS PubMed PubMed Central Google Scholar
Tseng, G. C., Ghosh, D. & Feingold, E. Comprehensive literature review and statistical considerations for microarray meta-analysis. Nucleic Acids Res. 40, 3785–3799 (2012).
Article CAS PubMed PubMed Central Google Scholar
Chang, J. S., Hsiao, J. R. & Chen, C. H. ALDH2 polymorphism and alcohol-related cancers in Asians: a public health perspective. J. Biomed. Sci. 24, 19 (2017).
Article PubMed PubMed Central Google Scholar
Szasz, A. M. et al. Cross-validation of survival associated biomarkers in gastric cancer using transcriptomic data of 1,065 patients. Oncotarget 7, 49322–49333 (2016).
Article PubMed PubMed Central Google Scholar
Aguirre-Gamboa, R. et al. SurvExpress: An online biomarker validation tool and database for cancer gene expression data using survival analysis. PLoS One 8, e74250 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Bellot, G. et al. TOM22, a core component of the mitochondria outer membrane protein translocation pore, is a mitochondrial receptor for the proapoptotic protein Bax. Cell Death Differ. 14, 785–94 (2007).
Article CAS PubMed Google Scholar
Feng, L. et al. ER stress-mediated apoptosis induced by celastrol in cancer cells and important role of glycogen synthase kinase-3β in the signal network. Cell Death Dis. 11, e715 (2013).
Article Google Scholar
Jin, S. et al. ALDH2(E487K) mutation increases protein turnover and promotes murine hepatocarcinogenesis. Proc. Natl. Acad. Sci. USA 112, 9088–9093 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
LaBiche, R. A., Demars, M. & Nicolson, G. L. Transcripts of the mitochondrial gene ND5 are overexpressed in highly metastatic murine large cell lymphoma cells. In Vivo. 6, 317–324 (1992).
CAS PubMed Google Scholar
Glaichenhaus, N., Léopold, P. & Cuzin, F. Increased levels of mitochondrial gene expression in rat fibroblast cells immortalized or transformed by viral and cellular oncogenes. EMBO. J. 5, 1261–1265 (1986).
CAS PubMed PubMed Central Google Scholar
Perez-Miller, S. et al. Alda-1 is an agonist and chemical chaperone for the common human aldehyde dehydrogenase 2 variant. Nat. Struct. Mol. Biol. 17, 159–64 (2010).
Article CAS PubMed PubMed Central Google Scholar
Chen, C. H., Ferreira, J. C., Gross, E. R. & Mochly-Rosen, D. Targeting aldehyde dehydrogenase 2: new therapeutic opportunities. Physiol. Rev. 94, 1–34 (2014).
Article PubMed PubMed Central Google Scholar
Tsai, S. T. et al. The interplay between alcohol consumption, oral hygiene, ALDH2 and ADH1B in the risk of head and neck cancer. Int. J. Cancer 135, 2424–2436 (2014).
Article CAS PubMed Google Scholar
Jemal, A. et al. Cancer statistics, 2009. CA: a cancer journal for clinicians 59, 225–249 (2009).
Google Scholar
Hakenewerth, A. M. et al. Effects of polymorphisms in alcohol metabolism and oxidative stress genes on survival from head and neck cancer. Cancer Epidemiol. 37, 479–491 (2013).
Article CAS PubMed PubMed Central Google Scholar
Liu, Y. et al. Correlation between superoxide dismutase 1 and 2 polymorphisms and susceptibility to oral squamous cell carcinoma. Exp. Ther. Med. 7, 171–178 (2014).
Article CAS PubMed Google Scholar
Lu, H. H. et al. Areca nut extract induced oxidative stress and upregulated hypoxia inducing factor leading to autophagy in oral cancer cells. Autophagy 6, 725–737 (2010).
Article PubMed Google Scholar
Chin, L., Andersen, J. N. & Futreal, P. A. Cancer genomics: from discovery science to personalized medicine. Nat. Med. 17, 297–303 (2011).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

This study was supported by a research grant (105-2627-M-075-001) from the Ministry of Science and Technology, Taiwan to Peter Mu-Hsin Chang and a research grant from the United States NIH-NIAAA (AAA11147) to Daria Mochly-Rosen. The authors acknowledge the High-throughput Genome and Big Data Analysis Core Facility of National Core Facility Program for Biotechnology, Taiwan (MOST 104-2319-B-010-001) for SNP genotyping and RNA-seq. This study is partially supported by Taiwan Clinical Oncology Research Foundation. We thank Dr. Dennis, shin-shian Hsu for the Seahorse metabolism assay.

Author information

Authors and Affiliations

Department of Oncology, Taipei Veterans General Hospital, Taipei, 112, Taiwan
Peter Mu-Hsin Chang, Ming-Huang Chen, Chun-Yu Liu & Muh-Hwa Yang
Faculty of Medicine, National Yang Ming University, Taipei, 112, Taiwan
Peter Mu-Hsin Chang, Ming-Huang Chen, Chun-Yu Liu & Shyh-Kuan Tai
Department of Chemical and Systems Biology, Stanford University, School of Medicine, Stanford, CA, 94305, USA
Che-Hong Chen & Daria Mochly-Rosen
Jin An Clinic, New Taipei City, 24256, Taiwan
Chi-Chun Yeh
Division of Medical Oncology, Department of Internal Medicine, Chung Shan Medical University Hospital, Taichung, Taiwan
Hsueh-Ju Lu
School of Medicine, Chung Shan Medical University, Taichung, Taiwan
Hsueh-Ju Lu
Genome Research Center, National Yang-Ming University, Taipei, 112, Taiwan
Tze-Tze Liu
The Ph.D. Program for Translational Medicine, College of Science and Technology, Taipei Medical University, Taipei, 110, Taiwan
Alexander T. H. Wu
Institute of Clinical Medicine, National Yang Ming University, Taipei, 112, Taiwan
Muh-Hwa Yang
Division of Laryngology-Head and Neck Surgery, Taipei Veterans General Hospital, Taipei, 112, Taiwan
Shyh-Kuan Tai
Institute of Biopharmaceutical Sciences, National Yang Ming University, Taipei, 112, Taiwan
Chi-Ying F. Huang

Authors

Peter Mu-Hsin Chang
View author publications
You can also search for this author in PubMed Google Scholar
Che-Hong Chen
View author publications
You can also search for this author in PubMed Google Scholar
Chi-Chun Yeh
View author publications
You can also search for this author in PubMed Google Scholar
Hsueh-Ju Lu
View author publications
You can also search for this author in PubMed Google Scholar
Tze-Tze Liu
View author publications
You can also search for this author in PubMed Google Scholar
Ming-Huang Chen
View author publications
You can also search for this author in PubMed Google Scholar
Chun-Yu Liu
View author publications
You can also search for this author in PubMed Google Scholar
Alexander T. H. Wu
View author publications
You can also search for this author in PubMed Google Scholar
Muh-Hwa Yang
View author publications
You can also search for this author in PubMed Google Scholar
Shyh-Kuan Tai
View author publications
You can also search for this author in PubMed Google Scholar
Daria Mochly-Rosen
View author publications
You can also search for this author in PubMed Google Scholar
Chi-Ying F. Huang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

P.M.H. proposed the hypothesis, collected, analyzed and interpreted all of the data and wrote the manuscript. C.H. helped to establish the S.N.P. sequencing protocol and critically reviewed the manuscript. C.C., T.T. and A.T.H. performed the RNA-seq and transcriptome analyses. H.J., C.Y., M.H.-C., M.H.-Y., and S.K. supported the clinical samples and data. D.M.R. and C.Y. were major contributors to this study. All authors read and approved the final manuscript.

Corresponding authors

Correspondence to Daria Mochly-Rosen or Chi-Ying F. Huang.

Ethics declarations

Competing Interests

Daria Mochly-Rosen and Che-Hong Chen hold patents related to compound activator of ALDH2*2.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Chang, P.MH., Chen, CH., Yeh, CC. et al. Transcriptome analysis and prognosis of ALDH isoforms in human cancer. Sci Rep 8, 2713 (2018). https://doi.org/10.1038/s41598-018-21123-4

Download citation

Received: 25 May 2017
Accepted: 30 January 2018
Published: 09 February 2018
DOI: https://doi.org/10.1038/s41598-018-21123-4

This article is cited by

Alternative RNA splicing modulates ribosomal composition and determines the spatial phenotype of glioblastoma cells
- Tatyana D. Larionova
- Soniya Bastola
- Marat S. Pavlyukov
Nature Cell Biology (2022)
Uncovering cancer vulnerabilities by machine learning prediction of synthetic lethality
- Salvatore Benfatto
- Özdemirhan Serçin
- Balca R. Mardin
Molecular Cancer (2021)
Endogenous aldehyde accumulation generates genotoxicity and exhaled biomarkers in esophageal adenocarcinoma
- Stefan Antonowicz
- Zsolt Bodai
- George B. Hanna
Nature Communications (2021)
Identification of cancer-type specific expression patterns for active aldehyde dehydrogenase (ALDH) isoforms in ALDEFLUOR assay
- Lei Zhou
- Dandan Sheng
- Suling Liu
Cell Biology and Toxicology (2019)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.