Introduction

Ovarian cancer is the deadliest gynecologic malignancy, with 44.6% five-year overall survival (OS) and 1% lifetime mortality risk1. Standard management includes surgical staging and cytoreduction followed by adjuvant intravenous (IV) platinum-taxane chemotherapy2. Compared to treatment with only IV chemotherapy, addition of adjuvant intraperitoneal (IP) chemotherapy increased progression free survival (PFS) and OS among patients with stage III ovarian cancer3,4,5. Gynecologic Oncology Group protocol 172 reported a median PFS and OS increase of 5.5 and 15.9 months, respectively, after IP chemotherapy5. A recent systemic analysis confirmed increased PFS and OS after adjuvant IP chemotherapy6. Survival advantage after adjuvant IP chemotherapy extended beyond 10 years7.

Despite available strategies for preemptive management of side effects of IP chemotherapy, widespread adoption of IP chemotherapy remains limited by toxicities8,9. Prognostic factors including age, histology, and cytoreduction aided selection of patients for IP chemotherapy10. Low BRCA1 protein expression measured by immunohistochemistry of primary tumor specimens was associated with 36 months increased OS among patients treated with IP compared to IV-only adjuvant chemotherapy11. In an effort to identify biomarkers of response to IP chemotherapy among high grade serous ovarian cancer (HGS OvCa) patients, we used data from The Cancer Genome Atlas (TCGA) to discover genes with mRNA expression levels that were associated with PFS and OS after treatment with adjuvant IP or IV chemotherapy.

Methods

Case selection and data collection

Chemotherapy information was from the TCGA drug dataset12. Patients were included in the study based on their chemotherapy exposures. Patients who received adjuvant IP ± IV (n = 90) or only IV (n = 398) chemotherapy were included (Supplementary Dataset 1). Patients who received neoadjuvant chemotherapy (n = 1) or who had no available clinical outcome data (n = 5) were excluded. We used Level 3 mRNA expression data from primary tumor specimens measured by Affymetrix U133A microarray, annotated with the hthgu133a.db package13,14, or by RNA sequencing (RNA-Seq). TCGA previously described specimen collection and assay validation12,13. Clinical and mRNA expression data was downloaded using CGDS-R15,16. Data was public per TCGA polices17.

Survival analysis by chemotherapy exposure

We used the R platform for statistical computing18,19. P-values were two-tailed. Demographic and outcome information were compared using statistical tests indicated with Results. Incomplete clinical data reporting to TCGA was common. Cases with missing data were excluded from statistical comparisons. Cytoreduction was considered optimal if residual disease was ≤10 mm after surgery. PFS and OS data was truncated at 60 months to prevent biasing results by long surviving individuals and to match a restriction time of 60 months set for restricted mean survival (RMS) calculations. Empiric Kaplan-Meier (KM) survival analysis and Cox proportional-hazards regression adjusted for covariates age, surgical stage, histologic grade, cytoreduction status and race were used to compare PFS and OS by chemotherapy route (IP versus IV). For KM analysis of all cases (n = 488), 96 and 42 cases were omitted due to missing OS and PFS data, respectively. For KM analysis of optimally cytoreduced cases (n = 282), 47 and 18 cases were omitted due to missing OS and PFS data, respectively. For multivariate Cox regression of OS and PFS for all cases (n = 488), 178 and 144 cases, respectively, were omitted due to missing data. For multivariate Cox regression of OS and PFS for optimally cytoreduced cases (n = 282), 52 and 22 cases, respectively, were omitted due to missing data.

Discovery and validation of differentially expressed genes

Within each adjuvant chemotherapy group, cases were stratified by PFS time <or ≥12 months. Twelve months was used because cancers that recurred or progressed before 12 months were likely chemoresistant. Patients who undergo surgical staging and six cycles of adjuvant chemotherapy often reach 6 months after completion of chemotherapy, the time at which platinum sensitivity is designated, about 12 months after surgery. A diagram of the exploratory gene analysis and validation steps is shown in Supplementary Table 1. A t-test compared microarray mRNA expression between PFS strata to discover differentially expressed genes. Standard t-testing was used because the results are fully reproducible unlike permutation methods of microarray data analysis, which yield a larger number of differentially expressed genes with a higher number of false positive discoveries. Microarray data was used for exploratory analysis because too few patients had RNA-Seq data for gene discovery. Genes evaluated were listed under KEGG pathways identified by the microarray annotation14,20. Multiple comparison FDR adjusted p-values and fold changes were calculated for differentially expressed genes. As a first validation that differentially expressed genes are associated with differences in survival, Cox regression of PFS and OS by microarray mRNA expression with inclusion of covariates age, surgical stage and histologic grade was performed for all differentially expressed genes. For multivariate Cox regression (microarray) of OS and PFS for IP patients (n = 90), 30 and 18 cases, respectively, were omitted due to missing data. For multivariate Cox regression (microarray) of OS and PFS for IV patients (n = 398), 121 and 90 cases, respectively, were omitted due to missing data. As a second validation, multivariate Cox regression for differentially expressed genes was performed with the RNA-Seq data. For multivariate Cox regression (RNA-Seq) of OS and PFS for IP patients (n = 34), 7 and 2 cases, respectively, were omitted due to missing data. For multivariate Cox regression (RNA-Seq) of OS and PFS for IV patients (n = 187), 34 and 21 cases, respectively, were omitted due to missing data. Genes significantly associated with OS and/or PFS by both validations were considered positive discoveries, creating a stringent validation.

Survival differences between IP versus IV chemotherapy groups subdivided by gene expression

RMS curves were plotted for differentially expressed genes as functions of normalized relative microarray mRNA expression21. RMS times were the areas under the Cox regression adjusted survival curves of OS and PFS using a restriction time of 60 months21,22. For univariate Cox regression of OS and PFS for IP patients (n = 90), 16 and 2 cases, respectively, were omitted due to missing data. For univariate Cox regression of OS and PFS for IV patients (n = 398), 80 and 40 cases, respectively, were omitted due to missing data. Univariate Cox regression was used to calculate RMS times for plotting RMS curves. The p-values for OS and PFS reported on RMS curves in Results are multivariate regression derived p-values after adjustment for age, stage, and grade. Tumor mRNA expression was normalized by ordering TCGA expression values from least to greatest and calculating a continuous relative quantile value from 0 to 1 for each expression value. Normalized relative expression values were used to place cases along the x-axis of RMS curves. RMS curves represent the mean survival times (y-axis values) of patients with up to 60 months clinical follow-up given a particular relative mRNA expression level (the x-axis values). Permutation testing (10,000 permutations) was done to test the significance of the difference in mean OS and PFS between patients treated with IP versus IV chemotherapy given a specified range of gene expression. The null hypothesis was that patients treated with IP chemotherapy have significantly increased OS and PFS at all expression levels of each gene. This null hypothesis is consistent with survival analysis of the clinical data (see Results). Findings from permutation testing with microarray expression data were validated by identical permutation testing using RNA-Seq data. Any gene for which the null hypothesis is rejected by both methods is considered a candidate biomarker that warrants experimental validation.

Results

Patient and chemotherapy information

Table 1 shows patient and disease characteristics for the adjuvant IP and IV chemotherapy groups. Patients who received IP chemotherapy were younger (55.3 versus 59.9 years, p < 0.001) and more often achieved optimal cytoreduction (86.6% versus 70.4%, p = 0.007). Adjuvant chemotherapy administration for the IP and IV groups is shown in Table 2. In the IP group, 89/90 (99%) patients received both IP and IV chemotherapy. Median IV chemotherapy cycles completed was 6 in both groups. In the IP group, 51% of patients completed 6 or more cycles of IP chemotherapy.

Table 1 Comparison of patient and case characteristics between adjuvant chemotherapy groups.
Table 2 Adjuvant chemotherapy administration.

Survival outcomes by chemotherapy route

KM survival curves comparing survival by chemotherapy route are shown in Fig. 1. IP chemotherapy was associated with increased PFS (26.7 versus 16.0 months, HR (95% CI) 0.43 (0.28–0.66), p = 0.0001) and OS (49.6 versus 38.2 months, HR (95% CI) 0.46 (0.25–0.83), p = 0.01) among all (n = 488) cases. Analyzing optimally cytoreduced cases (n = 281) separately, IP chemotherapy was associated with increased PFS (26.5 versus 15.1 months, HR (95% CI) 0.46 (0.29–0.73), p = 0.001) and OS (56.5 versus 39.1 months, HR (95% CI) 0.42 (0.21–0.85), p = 0.017). Age, stage, grade, cytoreduction status, and race were not significant covariates. There were no significant differences in OS or PFS among suboptimally cytoreduced patients (Supplementary Figure 1).

Figure 1: Kaplan-Meier survival curves for patients treated with intraperitoneal (IP) and intravenous (IV) chemotherapy.
figure 1

Log rank is log rank p-value from Kaplan-Meier analysis. Cox PH is multivariate Cox proportional hazards regression p-value for chemotherapy route with adjustment for age, surgical stage, histologic grade, cytoreduction status, and race.

Gene expression analysis by PFS stratification

Supplementary Table 2 shows patient and disease characteristics for chemotherapy groups stratified by PFS < or ≥12 months. Only 9/90 (10%) patients in the IP group had PFS < 12 months. The odds ratio for PFS < 12 months in the IP versus IV group was 0.38, p = 0.011 (Fisher’s Exact test). Thirty-six and three genes were differentially expressed between PFS strata within the IV and IP groups, respectively (Supplementary Dataset 2 and Supplementary Table 3). Differentially expressed genes that were significantly associated with OS and/or PFS by multivariate regression are tabulated with adjusted (FDR) p-values and fold changes for differential expression between PFS strata, as well as with HRs for the associations of microarray or RNA-Seq mRNA expression with PFS and OS (Table 3).

Table 3 Associations of differentially expressed gene tumor mRNA expression levels with survival outcomes.

Associations of gene expression with route-specific chemoresistance

Univariate and multivariate associations of microarray mRNA expression with PFS and OS were used to validate differentially expressed genes (Validation step 1) (Supplementary Tables 3 and 4). Hazards ratios (HRs) of genes significantly associated with OS and/or PFS ranged from 0.39 to 6.35 (Supplementary Table 4). To make HRs comparable, gene expression values were scaled such that adjusted HRs are interpretable as the change in hazard per 1 standard deviation change in gene expression (Table 3). For instance, an adjusted HR of 1.30 (example: ASPA, overall survival) indicates that the risk of death increased 30% for each 1 standard deviation increase in microarray mRNA expression across the range of observed expression values. Since expression data varies over several standard deviations for each gene, survival effects for some genes are large comparing the lower and upper limits of gene expression (see Supplementary RMS curves for visual illustrations). Almost half (19/39, 49%) of the differentially expressed genes were associated with PFS and/or OS by multivariate regression using microarray data (Validation step 1) (Table 3). Only 5/39 (12.8%) of the differentially expressed genes were associated with PFS and/or OS by multivariate regression using RNA-Seq data (Validation step 2) (Supplementary Tables 5 and 3). Increased expression of NCAM2 and TSHR and decreased expression of GCNT1 was associated with decreased PFS and OS after IV chemotherapy (p < 0.05) (Table 3).

Candidate biomarker discovery for patient selection for IP chemotherapy

Since patients treated with IP chemotherapy typically have significantly increased OS and PFS compared to patients who receive IV-only chemotherapy, biomarker evaluation for response to IP chemotherapy may aim to discover primary tumor biomarkers that are associated with worse or no better survival after IP chemotherapy. RMS curves demonstrate PFS and OS by relative gene mRNA expression (Supplementary RMS curves) allowing for evaluation of survival as a function of gene expression across the range of observed gene expression values. Pointwise 95% confidence intervals around each univariate RMS curve allow visual comparison of OS and PFS between IP and IV groups (Supplementary RMS curves). Illustrative RMS curves are shown (Fig. 2). Comparing univariate RMS curves and multivariate RMS times between IP and IV groups evaluates differentially expressed genes as biomarkers for treatment benefit from IP chemotherapy. For instance, according to microarray data, patients whose tumors had low expression of APC2 or high expression of BCAT1 experienced less survival benefit from the addition of IP chemotherapy to their treatment (Fig. 2). Patients whose tumors had high expression of PER1 or TSHR experienced decreased survival compared to patients whose tumors did not have high expression of these genes, with the associations of gene expression and survival outcomes reaching statistical significance in the larger cohort of patients who received IV-only adjuvant chemotherapy (Fig. 2). Table 4 lists differentially expressed genes for which there is evidence to reject the null hypothesis based on microarray data and designate the gene as a candidate biomarker for patient selection for IP chemotherapy (see Methods above). For example, patients with high expression (upper 10th percentile) of FZD5 showed an expected overall survival benefit among the IP chemotherapy group but did not show a significant PFS benefit (no significant difference between IP and IV-only groups) (Table 4). Table 4 also shows the results of validation permutation testing using RNA-Seq data. High tumor expression of LMAN2, FZD4, FZD5, or STT3A was associated with no significant PFS increase after IP compared to IV-only chemotherapy. Low expression of APC2 and high expression of FUT9 was associated with 5.5 and 7.2 months, respectively, decreased OS after IP compared to IV-only chemotherapy (p ≤ 0.007).

Figure 2: Restricted mean survival curves by normalized relative gene expression, shown with pointwise 95% confidence intervals.
figure 2

IP: Intraperitoneal. IV: Intravenous. PFS: Progression free survival. OS: Overall survival. Cox PH: multivariate Cox proportional hazards regression p-value for the association of gene expression with indicated survival times after adjustment for age, surgical stage and histologic grade.

Table 4 Comparisons of mean RMS times between patients treated with IP versus IV chemotherapy selected by relative expression quantile range.

Discussion

Adjuvant IP chemotherapy is associated with increased PFS and OS in TCGA data. A large and increasing number of publications reported survival outcomes of TCGA ovarian cancer patients without adjusting for chemotherapy route as a potential confounder. Reanalysis of previously reported survival outcomes to adjust for chemotherapy route as a potential confounder may be needed given that IP chemotherapy use was common among TCGA HGS OvCa patients and that receiving IP chemotherapy was associated with large and highly significant increases in PFS and OS.

A detailed review of the tumor biology of all candidate biomarkers is beyond the scope of this report. DAVID gene functional classification failed to reveal any clusters among differentially expressed genes23. Three differentially expressed genes, APC2, FZD4, and FZD5, which had mRNA levels that were significantly associated with OS and PFS, are members of the Wnt signaling pathway. The observation that increased expression of APC2 and decreased expression of FZD4 and FZD5 was associated with decreased OS and PFS among patients treated with IV chemotherapy suggests that downregulation of Wnt signaling may play a role in chemoresistance24,25. The Wnt pathway may be targeted in cancer trials25,26. Recently, advanced ovarian cancers with increased membrane β-catenin expression by immunohistochemistry were shown to have decreased PFS and increased platinum-resistance, consistent with our findings27.

Limitations of our study include incomplete clinical data reporting to TCGA, a relatively small number of patients who received IP chemotherapy, and also small numbers of tumors that were analyzed by RNA-Seq, especially among the IP group. During the RNA-Seq validation, many survival associations trended (p = 0.05–0.10) but failed to reach significance due to few cases having RNA-Seq data. Clinical and chemotherapy information reported to TCGA was also not verified for correctness. We could not perform validation in another patient cohort because no other suitable data is available. To our knowledge, TCGA provides the only cohort of ovarian cancer patients treated with IP chemotherapy for which there is currently transcriptome or proteome data. Also, tumor protein expression was not confirmed experimentally by immunohistochemistry of primary tumor specimens. The TCGA reverse phase protein array does not include any of the differentially expressed genes and therefore the TCGA proteomics data could not be used as a potential protein-level validation of our transcription-level findings. In addition, the process of gene discovery by t-testing and gene validation with limited RNA-Seq data is very stringent. Thus, there are likely additional genes that are significantly associated with survival outcomes among this patient cohort. Targeted rather than exploratory analyses of well-known molecular pathways driving cancer development or chemotherapy resistance may lead to discovery of additional genes with similar survival associations after adjusting for chemotherapy exposure.

Strengths of our study include use of a standard and fully reproducible statistical test (the t-test) to compare gene expression levels between groups. We also analyzed gene mRNA expression levels as a continuous variable in all regression models of PFS or OS. Dichotomizing or otherwise arbitrarily subdividing cases by gene expression thresholds is very commonly reported in the literature but is less informative than RMS analysis. RMS curves illustrate the relationships of gene expression and survival. We provide TCGA case identifiers with chemotherapy route to ease reproduction of our findings and reanalysis of previous studies.

Adjuvant IP chemotherapy of TCGA patients was associated with increased PFS and OS similar to randomized trials3,4,5. Our study is the second study, after one other recent report, to demonstrate off-trial survival advantage associated with adjuvant IP chemotherapy use that included some common “modified regimens” that are often individual provider-designed to decrease toxicity28. Our study reports exploration of transcriptome data motivated by a practical, clinically-oriented question: Given the increased morbidity of IP chemotherapy, are there biomarkers for response to IP chemotherapy that may aid selection of patients for IP chemotherapy? We discovered candidate biomarkers that are significantly associated with chemoresistance and decreased survival, or lack of benefit from IP chemotherapy. Our findings generate hypotheses regarding route-specific chemoresistance that may be tested using in vivo models. A targeted clinical assay for efficient measurement of primary tumor gene expression of these candidate biomarkers may be developed. Some gene products may be amenable to measurement in serum or ascites, or by primary tumor immunohistochemistry. Our findings may be corroborated at the level of protein expression by performing immunohistochemical analysis of primary tumor specimens, if made available, from a previous randomized trial of IP versus IV chemotherapy3,4,5. If similar results are obtained, a study of specimens from a previous randomized trial may efficiently provide validation of these candidate biomarkers as clinical biomarkers for patient selection for IP chemotherapy.

Additional Information

How to cite this article: Seagle, B.-L. L. et al. Discovery of candidate tumor biomarkers for treatment with intraperitoneal chemotherapy for ovarian cancer. Sci. Rep. 6, 21591; doi: 10.1038/srep21591 (2016).