Identifying gene expression patterns associated with drug-specific survival in cancer patients

Neary, Bridget; Zhou, Jie; Qiu, Peng

doi:10.1038/s41598-021-84211-y

Download PDF

Article
Open access
Published: 02 March 2021

Identifying gene expression patterns associated with drug-specific survival in cancer patients

Bridget Neary¹,
Jie Zhou¹ &
Peng Qiu²

Scientific Reports volume 11, Article number: 5004 (2021) Cite this article

5100 Accesses
9 Citations
1 Altmetric
Metrics details

Subjects

Abstract

The ability to predict the efficacy of cancer treatments is a longstanding goal of precision medicine that requires improved understanding of molecular interactions with drugs and the discovery of biomarkers of drug response. Identifying genes whose expression influences drug sensitivity can help address both of these needs, elucidating the molecular pathways involved in drug efficacy and providing potential ways to predict new patients’ response to available therapies. In this study, we integrated cancer type, drug treatment, and survival data with RNA-seq gene expression data from The Cancer Genome Atlas to identify genes and gene sets whose expression levels in patient tumor biopsies are associated with drug-specific patient survival using a log-rank test comparing survival of patients with low vs. high expression for each gene. This analysis was successful in identifying thousands of such gene–drug relationships across 20 drugs in 14 cancers, several of which have been previously implicated in the respective drug’s efficacy. We then clustered significant genes based on their expression patterns across patients and defined gene sets that are more robust predictors of patient outcome, many of which were significantly enriched for target genes of one or more transcription factors, indicating several upstream regulatory mechanisms that may be involved in drug efficacy. We identified a large number of genes and gene sets that were potentially useful as transcript-level biomarkers for predicting drug-specific patient survival outcome. Our gene sets were robust predictors of drug-specific survival and our results included both novel and previously reported findings, suggesting that the drug-specific survival marker genes reported herein warrant further investigation for insights into drug mechanisms and for validation as biomarkers to aid cancer therapy decisions.

Gene expression based inference of cancer drug sensitivity

Article Open access 27 September 2022

An integrated approach to biomarker discovery reveals gene signatures highly predictive of cancer progression

Article Open access 04 December 2020

Identification of important genes and drug repurposing based on clinical-centered analysis across human cancers

Article 18 June 2020

Introduction

Cancer has been a major focus in precision medicine because it is a heterogeneous disease with significant variations in therapeutic responses. Improved understanding of a drug’s molecular mechanisms and the relationship between its efficacy and molecular variation among tumors will help inform doctors’ decisions about individual patient treatment options, which will improve both overall patient outcomes and patient quality of life by decreasing the use of ineffective therapies. Thus, precision medicine aims to identify molecular markers in cancers to predict patients’ responses to different therapies and provide molecular insights into drug mechanisms.

Most research identifying molecular biomarkers of drug efficacy in cancer have been in the field of pharmacogenomics, which researches genome-level changes as potential biomarkers of drug response¹. However, in vitro studies indicate that gene expression variation accounts for even more variability in drug sensitivity than genomic changes do and may offer better insight into clinical drug efficacy²; yet, there have been few systematic efforts to identify gene expression patterns that influence tumors’ drug sensitivity. While some in vitro studies have sought to identify the relationship between gene expression and drug response by studying differential gene expression when cells are exposed to a drug or by linking cell line gene expression profiles and drug sensitivity³, most do not consider real patient outcomes. Previous studies incorporating gene expression and patient drug response were limited to specific cancers or drugs⁴, or focused exclusively on genes implicated in drug metabolism⁵.

The Cancer Genome Atlas (TCGA) is a large dataset with multiple types of molecular data from primary tumors before treatment from a range of cancers and corresponding clinical information, including drug exposures and survival data. The RNA-seq dataset from TCGA is an excellent resource for predictive biomarker identification because pre-treatment gene expression provides a snapshot of a tumor’s transcriptional state at diagnosis, when decisions about treatment options are made. Previously, our group manually standardized drug exposure data to identify gene copy number variations related to survival in a drug-specific manner^6,7. Analyzing the gene expression data in combination with these clinical data is a similarly powerful strategy for identification of biomarkers for drug-specific survival.

In this study, we perform drug-specific survival analyses to identify genes and gene sets whose pre-treatment expression levels are associated with therapeutic response. We grouped patients based on cancer type and drug exposure and identified genes where patients with high and low pre-treatment expression of that gene had significant survival differences after exposure to that drug. We then clustered these genes into sets based on frequency of co-expression among patients in that group. We identified thousands of gene–drug relationships, with which we subsequently queried PubMed to identify previous reports linking them. Here, we present the results of our analysis, which show promise as potential transcriptomic biomarkers with predictive value for therapeutic response.

Results

An integrative pipeline for drug-specific survival analysis

To identify drug-specific survival markers based on gene expression, we integrated drug treatment data, survival data, and RNA-seq gene expression data from TCGA. As part of preprocessing, the gene expression values were binarized based on a high/low threshold calculated separately for each of the 60,483 genes across expression values of all samples for which gene expression data were available. We stratified patients by cancer type and drug exposure: for every unique cancer–drug combination, we defined a patient group as all patients with that cancer treated with the given drug. For each group, we used a log-rank test to compare survival outcomes between patients with high vs. low pre-treatment expression of each gene. In this way, we identified genes whose pre-treatment expression was associated with statistically significant survival differences for that cancer–drug patient group.

Next, for each cancer–drug patient group with at least ten significant genes identified, we used a gene clustering algorithm to define sets of these genes that tended to be co-expressed among patients⁸. To test whether each gene set was predictive of drug-specific survival, we used a log-rank test to compare patients in the relevant group expressing a high number of the genes in that set to patients expressing few of the genes. We calculated the threshold number of genes required for the high expression group for each gene set using the percentage of expressed genes in the gene set across patients in that group using the same method used in the binarization step of preprocessing.

To compare our results with current knowledge about molecular interactions with various cancer therapies, we performed gene set enrichment analysis (GSEA) on each identified set of co-expressing genes to look for enrichment of transcription factor (TF) target genes in the set. We also ran a literature search on PubMed programmatically for each gene–drug combination associated with survival identified in the individual gene analysis as well as each drug–TF combination identified in the GSEA of our co-occurring gene sets.

This pipeline is summarized in Fig. 1. More details on the analysis can be found in the “Methods” section.

Individual gene expression predictive of drug-specific survival

TCGA has RNA-seq data for 3533 patients with drug treatment records and survival data. This cohort included 32 cancer types and 284 unique drugs (after drug name standardization). The drug treatment records for these patients consisted of 8836 drug treatment entries, which each included patient information, drug name, time frame of the treatment, etc. After excluding cancer–drug patient groups with fewer than 20 patients, there were 99 groups ranging up to 469 patients. The heat map in Fig. 2 shows the number of patients in each of the 99 cancer–drug groups.

For each cancer–drug patient group, we performed survival analysis on all genes with at least ten low expressors and ten high expressors within the group. We determined significant differential survival using a log-rank test with a 10% false discovery rate (FDR) for the group. Out of 2.2 million cancer–drug–gene combinations tested, we identified 9216 where patients with that cancer who took that drug have significantly different survival rates when stratified by expression of that gene. These occurred across 46 cancer–drug groups, which included 14 cancers and 20 drugs, and we identified 7832 unique genes that were significant in at least one cancer–drug patient group. There were 9212 unique gene–drug interactions identified, with four that were significant in more than one cancer. Table 1 highlights a selected subset of gene–drug interactions we identified, which included the gene–drug interaction that showed the largest difference in drug-specific survival for each of the cancers in our analysis.

Table 1 Top gene–drug interactions across cancers.

Full size table

Our analysis identified many cancer-specific gene–drug interactions, including previously characterized gene–drug interactions as well as ones that are novel and have never been reported in the literature. To gauge the extent of literature support for the identified gene–drug interactions, we queried PubMed for published papers mentioning the drug and gene for each of the 9212 significant gene–drug interactions identified. While most of these gene–drug queries returned no results, indicating that the identified gene–drug interactions have not been previously reported, 531 returned at least one result on PubMed and 158 had three or more papers mentioning the gene and the drug. This strategy identified the gene–drug pairs that are likely to have literature support and helped us confirm multiple examples of our identified gene–drug relationships that have been previously described.

One example of a known gene–drug interaction that our analysis identified is between the gene XRCC2, a key player in the homologous recombination process, and temozolomide, a methylating agent. In our literature search, we found studies showing that lower XRCC2 in cancer cells increases temozolomide efficacy by inhibiting their ability to repair the DNA damage induced by temozolomide^9,10. Our survival analysis showed that lower grade glioma patients with tumors expressing lower levels of XRCC2 prior to treatment have better outcomes on temozolomide (Fig. 3A), potentially because the function of XRCC2 counteracts the drug’s mechanism of action. We also identified a previously reported gene–drug relationship between fluorouracil and TWIST1. Studies have shown that silencing TWIST1 can increase certain cancer cells’ sensitivity to fluorouracil^11,12, which agrees with our findings that, among patients taking fluorouracil for stomach adenocarcinoma, survival outcomes are better for patients with low expression levels of TWIST1 than for those with high TWIST1 expression (Fig. 3B).

We also found examples of genes that interact positively with drugs. For example, studies have shown that antiproliferative BTG1 acts synergistically with paclitaxel in certain cancer cell lines: cells with induced BTG1 overexpression were more sensitive to paclitaxel and exhibited lower post-treatment expression of chemoresistance genes than controls^13,14. This aligns with our results, which show that head and neck cancer patients with higher levels of BTG1 had significantly better survival after taking paclitaxel (Fig. 3C). Additionally, we identified a previously reported relationship between SMAD4 and carboplatin. Mutations in the SMAD4 gene have been linked to resistance of platinum-based drugs like carboplatin^15,16, and our data suggest that head and neck cancer patients on carboplatin stratified by pre-treatment SMAD4 expression have significantly differential survival between the strata (Fig. 3D).

We also identified four gene–drug interactions occurring in multiple cancer types. Figure 4 shows the Kaplan–Meyer survival curves comparing high and low expressors of these genes in two different cancer–drug patient groups. Three of the four interactions occurred in low-grade glioma and glioblastoma, while the fourth occurred between LPP and paclitaxel in breast invasive carcinoma and head and neck squamous cell carcinoma. A previous study in ovarian tumor-bearing mice linked LPP silencing with increased chemosensitivity and improved delivery of paclitaxel to tumor cells, which improved the effectiveness of the drug¹⁷. In contrast, our analysis found worse patient outcomes in patients with low LPP expression; nevertheless, it is encouraging that previous literature has implicated a connection between LPP and paclitaxel.

Table 2 summarizes the total numbers of individual genes identified per cancer–drug group and their literature search results. The full list of identified significant gene–drug interactions can be found in Additional file 1. Given the literature support found for many of the identified gene–drug interactions, the novel and highly significant interactions we identified, such as those highlighted in Table 1, are worth investigating for biological insights and validation as biomarkers of drug efficacy.

Table 2 Summary of survival analyses of individual genes for each group.

Full size table

Gene clusters predictive of drug-specific survival

Clustering the significant genes from each cancer–drug patient group identified 32 different sets of co-expressing genes in eight of the cancer–drug patient groups. For each gene set, we stratified the patients into high and low gene-set expressors based on the percentage of set genes they expressed, and then tested for differential survival between the strata. All identified gene sets showed statistically significant survival differences, and many were more significant than the majority (> 95%) of individual genes in those gene sets. See Additional file 2 for the lists of genes in each gene set.

To elucidate the biological context and meaning of these co-expressing gene sets, we performed gene set enrichment analysis (GSEA) using MSigDB¹⁸. Of the 32 identified gene sets, 21 were significantly enriched for target genes of at least one transcription factor (TF). We then performed a literature search for each TF–drug combination identified in the GSEA. Table 3 summarizes the gene set analysis results.

Table 3 Summary of gene set analysis.

Full size table

Literature searches revealed that many of these TFs have been discussed in the context of the corresponding drug. For example, we identified a set of genes in head and neck cancer patients taking paclitaxel where patients with high set expression have better survival than low expressors. This gene set was significantly enriched for targets of NF-κB, which previous studies found to be related to paclitaxel efficacy^19,20. Another gene set found in head and neck cancer patients taking carboplatin showing survival differences between high and low expressors was enriched for target genes of NRF2, and activation of the NRF2 pathway has been linked to carboplatin resistance^21,22. In addition, we identified a set of co-expressed genes enriched for SRY targets that exhibits differential survival among low-grade glioma patients on temozolomide, and previous studies have shown a link between the SRY pathway and sensitivity to temozolomide^23,24,25. This literature support lends credibility to our analysis strategy and findings and suggests that many other TFs identified in our analysis may also contribute to differences in patient response to specific drugs.

Discussion

Our analysis identified many genes and gene sets whose expression is associated with survival times in various cancer–drug patient groups. With this analysis, we hoped to identify gene–drug interactions that may impact drug efficacy. We found four gene–drug combinations that had a significant association with survival in more than one cancer type. Of these four, three are significant only in closely related cancers: low-grade glioma is a grade II glioma, and glioblastoma multiforme is a grade IV glioma that can arise from a low-grade glioma or develop de novo^26,27. The low number of gene–drug combinations related to survival that transcended cancer type is likely due to the fact that only a small number of drugs are used in multiple cancers, as illustrated in Fig. 2. In addition, given that most of the gene–drug combinations that are significant in multiple cancers occur in the same tissue, it is possible that many of the identified effects are tissue-specific.

The identified sets of co-expressing genes have several advantages over individual genes as predictors of patient response to therapy. As noted earlier, many of these gene sets stratified the patients into groups with larger survival differences than any of the individual genes in those gene sets. This indicates that, compared to individual genes alone, these gene sets can more accurately separate patients into responders and non-responders. In addition, the gene sets exhibiting the strongest associations with survival contain genes that are part of a similar transcriptome profile and stratify the patients similarly. This means that these gene sets could make better biomarkers than individual genes because they are less vulnerable to measurement errors, minor differences in threshold calculations, or patient-to-patient variability in expression of one or a small number of genes.

While TCGA is a rich resource for genomic and integrative analyses, it is not without its limitations. Drilling down to such fine granularity as cancer- and drug-specific patient groups means that several of the groups contain very few patients. It is likely that we may miss relevant genes due to lower statistical power in smaller cancer–drug patient groups. We mitigated this to the extent possible by excluding small cancer–drug patient groups from our analysis, analyzing only the genes with at least ten low expressors and ten high expressors in a given group, and using a generous FDR to define significance; however, our results would benefit from validation with larger datasets for each cancer–drug patient group. Another limitation is that most of the drug exposure data do not include records of the treatment response, which is why our analysis uses patient survival outcome as an indirect measure of drug efficacy. Since treatment schedules for a given drug can vary between patients and some patients received multiple drugs, patient survival outcome is an imperfect surrogate to measure drug efficacy.

We performed literature searches on PubMed to identify previous reports of the gene–drug and TF–drug relationships identified in our analysis. The results of the PubMed search were reported in Tables 2 and 3. Our PubMed search strategy was rudimentary, and it was not feasible to manually confirm a link between the corresponding gene and drug in the large number of papers from the search. Without a manual review of the papers, the quantity of PubMed search results may not directly represent the amount of literature supporting a particular gene–drug pair. This is especially true when the gene name overlaps with English words, author names, or common abbreviations. However, despite these limitations, our success in manually confirming literature support for multiple examples of gene–drug interactions suggests a high likelihood that literature support exists for many of the gene–drug pairs whose corresponding papers we did not review. Additionally, because it is unlikely that a PubMed query would return no results for a gene–drug pair with a previously reported interaction, we can reasonably conclude that a large majority of our identified gene–drug interactions with no PubMed results are novel and have not been previously reported.

Many of the gene–drug interactions we identified were novel and did not have literature support, and many of our gene sets had no obvious unifying biological interpretation. While many of our identified gene–drug interactions are sufficiently promising to warrant further investigation into the biological mechanisms, these findings could be useful as biomarkers even before the underlying biological mechanism is fully understood. The most significant gene–drug interactions, such as the examples shown in Table 1, have high predictive value, and could serve as biomarkers for drug efficacy.

Conclusion

In this analysis, we identified many genes that are associated with drug-specific survival outcomes in various cancers. In addition, we were able to identify sets of co-expressed genes that were, in many cases, more strongly associated with patient survival than any of the individual genes in those gene sets, and therefore had higher potential predictive value.

This analysis successfully identified putative biomarkers for drug response in a range of cancers based on gene expression. Therefore, a future research direction is to replicate this analysis in other omics data types available in TCGA, such as DNA methylation, miRNA expression and protein expression, for further insights into variation in drug response among patients. This analysis can then be extended to integrate multiple omics data types for a multi-omics understanding of the molecular variations predictive of drug-specific survival outcomes.

The interactions we identified in our analysis are promising and warrant further investigation, which could yield valuable biological insights into drug mechanisms and variations in drug response. In addition, many of our findings show promise as potential biomarkers of drug response that could be used clinically to predict whether a patient will do well on a drug. Validating these as biomarkers would help doctors in formulating treatment strategies with the highest chances of success for each patient and would be a measurable step toward improving precision medicine.

Materials and methods

Data acquisition

We acquired TCGA gene expression data and drug treatment data from the Genomic Data Commons (GDC) database using the GDC Data Transfer Tool. We obtained the file manifest for data files via the GDC API and used the GDC Data Transfer Tool to download files. The parameters used when creating the manifest were “return_type: manifest” along with the filters “files.data_type: Gene Expression Quantification” and “analysis.workflow_type: HTSeq—FPKM-UQ” for RNA-seq data and “files.data_type: Clinical Supplement” and “files.data_format: BCR Biotab” for clinical data.

Patient survival and other clinical data were queried through the GDC API for the most current information.

Data preprocessing

Drug names from TCGA were standardized based on a manually curated list created by our group previously⁷. The RNA-seq dataset was obtained as FPKM-UQ values and log-transformed for better distribution. We then calculated a binarization threshold to delineate high vs. low expression values for each gene by adapting the StepMiner method described previously²⁸. Briefly, gene expression values are ordered from lowest to highest and then fitted with a step function that minimizes the mean square error within the two groups. We tested 400 thresholds for each gene, 200 between evenly distributed bins of samples and 200 evenly distributed through the range of the expression values.

Survival analysis

All patients with a given cancer and exposed to a given drug were split into high and low expression groups for each gene, and survival was compared between these two groups using a log-rank test. All cancer–drug–gene combinations were analyzed that had a minimum of ten patients in the low and high expression groups. Log-rank calculations and Kaplan–Meyer curves were generated using the lifelines Python package. Q-values were calculated using the Benjamini–Hochberg procedure to control for multiple hypothesis testing with 10% FDR (performed using the fdrcorrection function in the statsmodels Python package).

Co-occurrence clustering

We adapted a clustering method developed for the analysis of single cell RNA-seq data called co-occurrence clustering to identify sets of co-expressed genes⁸. Briefly, this algorithm constructs a gene–gene graph based on a chi-square pairwise association measure and uses the Louvain algorithm for community detection to identify gene clusters from the graph, then clusters patients similarly based on their expression levels of each gene cluster. This process then iterates for each patient cluster identified. We used this algorithm to identify co-occurring gene sets among the individual genes with significantly differential survival in each cancer–drug patient group. For each gene set identified, we used the percentages of the member genes that were highly expressed for each patient to calculate a binarization threshold to stratify patients into high and low gene-set expression groups and tested for differential survival.

Literature search

Literature searches were conducted using a Python script with the Bio.Entrez package from Biopython. Queries were formulated as the drug name and the gene or TF name separated by “AND” and relevant PMIDs were retrieved using efetch.

TF target gene enrichment analysis

To examine whether the gene sets we identified through co-occurrence clustering of drug-specific survival marker genes were significantly enriched for TF targets, we performed gene set enrichment analysis (GSEA) using the Molecular Signatures Database 7.0 (MSigDB). Specifically, we used the GSEA tool (version 4.0.0) to compute overlaps between the gene sets we identified and the sub-collection of MSigDB gene sets that were known or predicted targets of various transcription factors. In MSigDB, the target gene set of a transcription factor is defined as either genes whose predicted binding site for the given TF is within − 1000 to + 500 bp of the transcription start site or genes with upstream cis-regulatory motifs in the promoter region. A detailed explanation can be found at https://www.gsea-msigdb.org/gsea/msigdb/collection_details.jsp#GTRD. We then identified the TFs whose target genes were significantly enriched in each gene set using a 5% FDR to determine significance.

Data availability

The TCGA dataset analyzed in this study is available in the GDC repository, https://portal.gdc.cancer.gov/repository. The MSigDB gene sets are available at https://www.gsea-msigdb.org.

Abbreviations

TCGA:: The Cancer Genome Atlas
GSEA:: Gene set enrichment analysis
TF:: Transcription factor
FDR:: False discovery rate
GDC:: Genomic Data Commons

References

Lauschke, V. M., Milani, L. & Ingelman-Sundberg, M. Pharmacogenomic biomarkers for improved drug therapy-recent progress and future developments. AAPS J. 20(1), 4 (2017).
Article Google Scholar
Costello, J. C. et al. A community effort to assess and improve drug sensitivity prediction algorithms. Nat. Biotechnol. 32(12), 1202–1212 (2014).
Article CAS Google Scholar
Shee, K., Wells, J. D., Jiang, A. & Miller, T. W. Integrated pan-cancer gene expression and drug sensitivity analysis reveals SLFN11 mRNA as a solid tumor biomarker predictive of sensitivity to DNA-damaging chemotherapy. PLoS ONE 14(11), e0224267 (2019).
Article CAS Google Scholar
Han, Y., Huang, H., Xiao, Z., Zhang, W., Cao, Y., Qu, L., et al. Integrated analysis of gene expression profiles associated with response of platinum/paclitaxel-based treatment in epithelial ovarian cancer. PLoS ONE 7(12), e52745 (2012).
Zimmermann, M.T., Therneau, T.M., Kocher, J.P.A. The impact of pharmacokinetic gene profiles across human cancers. BMC Cancer 18(1), 577 (2018).
Spainhour, J.C.G., Qiu, P. Identification of gene–drug interactions that impact patient survival in TCGA. BMC Bioinform. 17(1), 409 (2016).
Spainhour, J. C. G., Lim, J. & Qiu, P. GDISC: a web portal for integrative analysis of gene–drug interaction for survival in cancer. Bioinformatics 33(9), 1426–1428 (2017).
CAS PubMed PubMed Central Google Scholar
Qiu, P. Embracing the dropouts in single-cell RNA-seq analysis. Nat. Commun. 11(1), 1169 (2020).
Article ADS CAS Google Scholar
Roos, W. P. et al. Brca2/Xrcc2 dependent HR, but not NHEJ, is required for protection against O(6)-methylguanine triggered apoptosis, DSBs and chromosomal aberrations by a process leading to SCEs. DNA Repair. (Amst.). 8(1), 72–86 (2009).
Article CAS Google Scholar
Tsaryk, R., Fabian, K., Thacker, J. & Kaina, B. Xrcc2 deficiency sensitizes cells to apoptosis by MNNG and the alkylating anticancer drugs temozolomide, fotemustine and mafosfamide. Cancer Lett. 239(2), 305–313 (2006).
Article CAS Google Scholar
Przybyla, T., Wesserling, M., Sakowicz-Burkiewicz, M., Maciejewska, I. & Pawelczyk, T. The Level of TWIST1 expression determines the response of colon cancer cells to mitogen-activated protein kinases inhibitors. Saudi J. Gastroenterol. 24(1), 37–45 (2018).
Article Google Scholar
Zhu, D. J. et al. Twist1 is a potential prognostic marker for colorectal cancer and associated with chemoresistance. Am. J. Cancer Res. 5(6), 2000–2011 (2015).
PubMed PubMed Central Google Scholar
Zhao, S. et al. BTG1 might be employed as a biomarker for carcinogenesis and a target for gene therapy in colorectal cancers. Oncotarget. 8(5), 7502–7520 (2017).
Article Google Scholar
Zheng, H. C. et al. BTG1 expression correlates with pathogenesis, aggressive behaviors and prognosis of gastric cancer: A potential target for gene therapy. Oncotarget. 6(23), 19685–19705 (2015).
Article Google Scholar
Crow, J. et al. Exosomes as mediators of platinum resistance in ovarian cancer. Oncotarget. 8(7), 11917–11936 (2017).
Article Google Scholar
Cardona, A. F. et al. Multigene mutation profiling and clinical characteristics of small-cell lung cancer in never-smokers vs heavy smokers (Geno1.3-CLICaP). Front. Oncol. 9, 254 (2019).
Article Google Scholar
Leung, C. S. et al. Cancer-associated fibroblasts regulate endothelial adhesion protein LPP to promote ovarian cancer chemoresistance. J. Clin. Invest. 128(2), 589–606 (2018).
Article Google Scholar
Subramanian, A. et al. Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles. Proc. Natl. Acad. Sci. USA 102(43), 15545–15550 (2005).
Article ADS CAS Google Scholar
Crozier, M. & Porter, L. A. Paclitaxel-induced transcriptional regulation of Fas signaling pathway is antagonized by dexamethasone. Breast Cancer Res. Treat. 154(1), 33–44 (2015).
Article CAS Google Scholar
Mabuchi, S. et al. Inhibition of inhibitor of nuclear factor-kappaB phosphorylation increases the efficacy of paclitaxel in in vitro and in vivo ovarian cancer models. Clin. Cancer Res. 10(22), 7645–7654 (2004).
Article CAS Google Scholar
Konstantinopoulos, P. A. et al. Keap1 mutations and Nrf2 pathway activation in epithelial ovarian cancer. Cancer Res. 71(15), 5081–5089 (2011).
Article CAS Google Scholar
Liby, K. T. Synthetic triterpenoids can protect against toxicity without reducing the efficacy of treatment with Carboplatin and Paclitaxel in experimental lung cancer. Dose Response. 12(1), 136–151 (2014).
Article CAS Google Scholar
Li, B., Zhao, H., Song, J., Wang, F. & Chen, M. LINC00174 down-regulation decreases chemoresistance to temozolomide in human glioma cells by regulating miR-138-5p/SOX9 axis. Hum. Cell. 33(1), 159–174 (2020).
Article CAS Google Scholar
Wang, Z. et al. SOX9-PDK1 axis is essential for glioma stem cell self-renewal and temozolomide resistance. Oncotarget. 9(1), 192–204 (2018).
Article Google Scholar
Xu, X. et al. Association between SOX9 and CA9 in glioma, and its effects on chemosensitivity to TMZ. Int. J. Oncol. 53(1), 189–202 (2018).
CAS PubMed Google Scholar
Ohgaki, H. & Kleihues, P. The definition of primary and secondary glioblastoma. Clin. Cancer Res. 19(4), 764–772 (2013).
Article CAS Google Scholar
Schneider, T., Mawrin, C., Scherlach, C., Skalej, M. & Firsching, R. Gliomas in adults. Dtsch Arztebl Int. 107(45), 799–807; quiz 808 (2010).
PubMed PubMed Central Google Scholar
Sahoo, D., Dill, D. L., Tibshirani, R. & Plevritis, S. K. Extracting binary signals from microarray time-course data. Nucleic Acids Res. 35(11), 3705–3712 (2007).
Article CAS Google Scholar

Download references

Funding

This work was partially supported by funding from the National Science Foundation [CCF1552784 and CCF2007029]; and the Giglio Family Breast Cancer Fund. P.Q. is an ISAC Marylou Ingram Scholar and a Carol Ann and David D. Flanagan Faculty Fellow. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Author information

Authors and Affiliations

School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA, USA
Bridget Neary & Jie Zhou
Department of Biomedical Engineering, Georgia Institute of Technology and Emory University, Atlanta, GA, USA
Peng Qiu

Authors

Bridget Neary
View author publications
You can also search for this author in PubMed Google Scholar
Jie Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Peng Qiu
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors were involved in method development. B.N. and J.Z. acquired and cleaned data. B.N. performed the analysis and drafted the manuscript. P.Q. revised the manuscript. All authors have read and approved the final manuscript.

Corresponding author

Correspondence to Peng Qiu.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information 1.

Supplementary Information 2.

Supplementary Information 3.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Neary, B., Zhou, J. & Qiu, P. Identifying gene expression patterns associated with drug-specific survival in cancer patients. Sci Rep 11, 5004 (2021). https://doi.org/10.1038/s41598-021-84211-y

Download citation

Received: 13 October 2020
Accepted: 11 February 2021
Published: 02 March 2021
DOI: https://doi.org/10.1038/s41598-021-84211-y

This article is cited by

MetaTron: advancing biomedical annotation empowering relation annotation and collaboration
- Ornella Irrera
- Stefano Marchesin
- Gianmaria Silvello
BMC Bioinformatics (2024)
Integrative analysis of TCGA data identifies miRNAs as drug-specific survival biomarkers
- Shuting Lin
- Jie Zhou
- Peng Qiu
Scientific Reports (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

An integrative pipeline for drug-specific survival analysis

Individual gene expression predictive of drug-specific survival

Gene clusters predictive of drug-specific survival

Discussion

Conclusion

Materials and methods

Data acquisition

Data preprocessing

Survival analysis

Co-occurrence clustering

Literature search

TF target gene enrichment analysis

Data availability

Abbreviations

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links