A machine learning approach to identify predictive molecular markers for cisplatin chemosensitivity following surgical resection in ovarian cancer

Shannon, Nicholas Brian; Tan, Laura Ling Ying; Tan, Qiu Xuan; Tan, Joey Wee-Shan; Hendrikson, Josephine; Ng, Wai Har; Ng, Gillian; Liu, Ying; Ong, Xing-Yi Sarah; Nadarajah, Ravichandran; Wong, Jolene Si Min; Tan, Grace Hwei Ching; Soo, Khee Chee; Teo, Melissa Ching Ching; Chia, Claramae Shulyn; Ong, Chin-Ann Johnny

doi:10.1038/s41598-021-96072-6

Download PDF

Article
Open access
Published: 19 August 2021

A machine learning approach to identify predictive molecular markers for cisplatin chemosensitivity following surgical resection in ovarian cancer

Nicholas Brian Shannon^1,2,3,
Laura Ling Ying Tan^1,2,3,
Qiu Xuan Tan^1,2,3,
Joey Wee-Shan Tan^1,2,3,
Josephine Hendrikson^1,2,3,
Wai Har Ng^1,2,3,
Gillian Ng^1,2,3,
Ying Liu^1,2,3,
Xing-Yi Sarah Ong^1,2,3,
Ravichandran Nadarajah⁴,
Jolene Si Min Wong^1,2,
Grace Hwei Ching Tan^1,2,
Khee Chee Soo^1,2,5,
Melissa Ching Ching Teo^1,2,5,
Claramae Shulyn Chia^1,2,5 &
…
Chin-Ann Johnny Ong^1,2,3,5,6

Scientific Reports volume 11, Article number: 16829 (2021) Cite this article

2743 Accesses
12 Citations
2 Altmetric
Metrics details

Subjects

Abstract

Ovarian cancer is associated with poor prognosis. Platinum resistance contributes significantly to the high rate of tumour recurrence. We aimed to identify a set of molecular markers for predicting platinum sensitivity. A signature predicting cisplatin sensitivity was generated using the Genomics of Drug Sensitivity in Cancer and The Cancer Genome Atlas databases. Four potential biomarkers (CYTH3, GALNT3, S100A14, and ERI1) were identified and optimized for immunohistochemistry (IHC). Validation was performed on a cohort of patients (n = 50) treated with surgical resection followed by adjuvant carboplatin. Predictive models were established to predict chemosensitivity. The four biomarkers were also assessed for their ability to prognosticate overall survival in three ovarian cancer microarray expression datasets from The Gene Expression Omnibus. The extreme gradient boosting (XGBoost) algorithm was selected for the final model to validate the accuracy in an independent validation dataset (n = 10). CYTH3 and S100A14, followed by nodal stage, were the features with the greatest importance. The four gene signature had comparable prognostication as clinical information for two-year survival. Assessment of tumour biology by means of gene expression can serve as an adjunct for prediction of chemosensitivity and prognostication. Potentially, the assessment of molecular markers alongside clinical information offers a chance to further optimise therapeutic decision making.

Prediction of tumor origin in cancers of unknown primary origin with cytology-based deep learning

Article Open access 16 April 2024

Feasibility of functional precision medicine for guiding treatment of relapsed or refractory pediatric cancers

Article Open access 11 April 2024

PERCEPTION predicts patient response and resistance to treatment using single-cell transcriptomics of their tumors

Article 18 April 2024

Introduction

Ovarian cancer is the seventh most common cancer in women, with over 290,000 cases diagnosed in 2018¹. It is also the eighth most common cause of cancer death in women^2,3. Ovarian cancer is often asymptomatic in its early stages, with approximately 60% of women having stage III or IV disease at the time of diagnosis⁴. The prognosis of ovarian cancer is poor, with a 5-year survival rate ranging from 30 to 50%⁵. Peritoneal carcinomatosis is a common feature of advanced stage epithelial ovarian carcinoma.

In the treatment of advanced-stage epithelial ovarian cancer, primary surgical cytoreduction and adjuvant chemotherapy are the preferred initial treatments. Certain patients who are poor surgical candidates due to extensively invasive disease or multiple comorbidities may undergo neoadjuvant chemotherapy. The first-line choice of chemotherapy for patients with advanced epithelial ovarian cancer includes a platinum and taxane combination⁶. Platinum agents include carboplatin and cisplatin.

While initial response rates to platinum-based chemotherapy are high at 70–80%⁷, the majority of patients with distant stage disease often relapse within two years⁸. Factors affecting tumour recurrence include initial disease stage, presence of optimal cytoreduction, CA125 levels and treatment response after chemotherapy.

Chemotherapy resistance contributes significantly to tumour recurrence. The Gynaecologic Oncology Group has stratified patients based on platinum sensitivity⁹, with platinum resistance defined as occurring in those who relapse within six months of platinum-containing therapy. Furthermore, the platinum-free interval, defined as the length of time between the last administration of the platinum agent and the appearance of disease progression, is known to correlate strongly with overall survival (OS) and progression-free survival (PFS)¹⁰. Despite the importance of platinum sensitivity, no predictive markers of platinum chemosensitivity in ovarian cancer have been integrated into routine clinical practice.

By identifying novel markers of chemosensitivity in ovarian cancer, we can improve therapeutic decisions by individualizing therapy for patients. Strategies for patients identified as at risk of developing platinum resistance include the early assessment of disease control and consideration of intraperitoneal or second-line chemotherapy if disease control is suboptimal. Identifying patients at risk of platinum resistance can also guide the selection of immunotherapy drugs in maintenance therapy.

We hypothesize that tumour biology plays a significant role in affecting the tumour response to chemotherapy. Understanding various genomic alterations that alter chemotherapy resistance to platinum agents would allow us to determine predictive markers of platinum resistance in ovarian cancer patients¹¹. We aim to generate a panel of predictive markers of platinum chemosensitivity and to retrospectively validate these markers in tumour samples from patients who have received adjuvant chemotherapy.

Material and methods

The study was approved by the SingHealth Centralised Institutional Review Board (Reference No.: 2015/2479). The study was conducted in compliance with all applicable SingHealth institutional policies and regulations, and the Declaration of Helsinki. Informed consent was obtained from all participants.

Identification of potential predictive molecular markers

Publicly available datasets from the Genomics of Drug Sensitivity in Cancer (GDSC) and The Cancer Genome Atlas (TCGA) databases were used to select candidate genes for validation. The GDSC dataset consists of > 1000 genetically characterised human cell lines treated with a variety of anticancer therapeutics with matched genomic and expression data, allowing the identification of genetic features predictive of sensitivity. 39 ovarian cancer cell lines with cisplatin sensitivity data were identified. Genes were filtered based on area under the receiver operating characteristic (ROC) curve (AUC) scores ≥ 0·8, utilising gene expression to stratify cell lines into resistant vs intermediate/sensitive or sensitive vs intermediate/resistant.

We also used TCGA data, which consists of over 20,000 primary cancer and normal samples across multiple cancer types with extensive omics profiling and matched clinical information. Due to the limited number of patients treated with cisplatin (n = 19) in the ovarian cancer (OV) TCGA dataset, we decided to examine head and neck squamous cell carcinoma (HNSC). For the HNSC dataset, recurrence-free survival data were available for 518 patients. These patients were stratified into those who had been treated upfront with a platinum compound (n = 124) and those who were chemo-naive (n = 335). Patients treated with other chemotherapy agents as first-line or as subsequent therapy were excluded. Genes were filtered based on the AUC score utilising gene expression to stratify patients to early recurrence within two years vs recurrence after two years or no recurrence (platinum-treated samples, n = 36 early recurrence vs 77 late/no recurrence; chemo-naive samples, n = 34 early recurrence vs 233 late/no recurrence), with a median follow-up of two years. As recurrence is only a surrogate measure of chemosensitivity, we filtered genes limited to chemotherapy-treated patients. Genes were selected for further validation if the AUC in the platinum group was greater than the AUC in the chemo-naive group by at least 0·1 (n = 3688).

Study population

Tissue microarrays (TMAs) containing 60 single 1-mm cores representative of ovarian cancer surgical specimens (n = 60) were purchased from TriStar Technology Group, LLC (Washington, DC, USA). These 60 surgical specimens were obtained from various accredited hospitals. Briefly, a trained pathologist identified and annotated the location of the tumour of interest on a haematoxylin and eosin (H&E) stained section of the paraffin block obtained from the hospitals. Using a TMA builder instrument, the core of interest would be punched out from the donor paraffin block and transferred to a pre-determined location on the recipient paraffin block to create the array. The constructed TMA would be lightly heated to fuse the cores with the recipient block. Of the 60 specimens, follow-up data of 59 cases were obtained. Seven of these cases consisted of patients with low-grade ovarian cancer who had undergone cytoreductive surgery and received no subsequent adjuvant therapy. One sample was excluded from the analysis because the TMA core had insufficient tissue for further analysis. The remaining cases (n = 51) consisted of patients with high-grade ovarian cancers who first underwent cytoreductive surgery. They subsequently received intravenous (IV) adjuvant chemotherapy consisting of either a carboplatin-taxol combination or carboplatin. Patients’ response to chemotherapy was assessed based on the Response Evaluation Criteria in Solid Tumours (RECIST)¹². The change in tumour size is an objective indicator for assessing the efficiency of chemotherapy¹³. Response to chemotherapy was classified into progressive or stable disease. Patients were identified as having recurrence when there was progressive disease (PD) or when they had died of the disease or complications due to ovarian cancer. Patients with no recurrence were identified as having no evidence of disease (NED) or stable disease (SD). PD was defined as at least a 20% increase in the sum of the diameters of the target lesions. SD was identified when the sum of the diameters of the target lesions did not increase in size by more than 50% or decrease in size by more than 30%, and no new tumours developed¹². Response was assessed after completion of neoadjuvant chemotherapy at a median duration of 181 days [IQR 151–184] from time of surgery. In the event of progressive disease, the patients were treated with carboplatin as well as a host of other chemotherapeutic agents. Their subsequent response to chemotherapy was also recorded. The clinical characteristics of the patients who received postoperative IV chemotherapy (n = 51) are further detailed in Table 1.

Table 1 Clinicopathological features of the patients (n = 50).

Full size table

Immunohistochemistry

The TMAs were used to assess the expression of various proteins by immunohistochemistry (IHC). The antibody concentrations, optimum IHC staining conditions and antibody sources are described in Supplementary Table S1. IHC staining was performed on a Bond System (Leica Microsystems, Ltd., Milton Keynes, UK) according to the manufacturer’s recommendations. Staining intensity was stratified into 4 groups: negative (score 0), weak (score 1), moderate (score 2), and strong (score 3) for CYTH3, GALNT3 and S100A14 (Supplementary Fig. S1). For ERI1, scoring was binarized into negative (score 0) and positive (score 1), as the staining on the TMA only included samples with negative or weak staining despite the adequate optimisation of ERI1 antibody concentration on preliminary tissue and TMAs. The staining results were determined by 2 independent researchers (LLYT and NBS) blinded to the outcomes. For any discrepancies between the scores, a third researcher (JWST) scored the TMA sample independently to determine the final assigned score.

Gene expression datasets

Three gene expression microarray datasets obtain from the Gene Expression Omnibus (GEO) GSE26193, GSE49997, GSE9891 had both clinical data available and the four genes present on the platform used to profile gene expression. Expression of the genes in these datasets was binarised to high or low based on the median expression and a four-gene model trained on the TMA data used to obtain a prediction score for each sample in the expression dataset. The prediction score was then assessed for prognostication of two-year overall survival. To compare to prognostication with clinical information alone, a leave one out approach was used, with the clinical information (tumour grade and tumour stage) for the other two datasets used to generate a model predicting survival in the third.

Data processing and statistical analysis

The statistical modelling and generation of algorithms were performed using Python 3. Accuracy and ROC curve analysis were used to assess model performance. An AUC with a value of 1 indicates an ideal result, whereas values lower than 0·5 indicate an insignificant result. Generally, a value above 0·7 is satisfactory, and a value above 0·8 is excellent.

Modelling of clinical and molecular data

A machine learning predictive model was subsequently generated to compare the combined clinical and molecular data. This process consisted of 2 aspects, namely, data input and model selection and training, as illustrated in Fig. 3a. As part of the data input, preprocessing was conducted. The protein expression of the above four genes was binarized to 0 or 1 vs 2 + (GALNT3) or 0 vs 1 + (CYTH3, ERI1, and S100A14). The clinical factors assessed were binarized as large (T3 +) vs small (T1-T2) tumours and node positive (N1 +) vs node negative (N0). Metastasis (M0 vs M1) and grade (2 vs 3) were already in a binary format. The data were split into a training set comprising 80% of the patients (n = 40) and a validation subset (n = 10) comprising 20% of the patients.

As a part of the model selection and training, typical (linear and radial support vector machine, logistic regression, K-nearest neighbour, decision tree and random forest) and boosted (AdaBoost, gradient boosting machine, XGBoost) algorithms were trained, and their performance was assessed on the training subset. Each algorithm was used to generate a predictive model. Feature selection was applied to the training dataset. The feature selection process selects the most relevant molecular markers and patient characteristics. Thresholds for each selected feature were chosen. This predictive model’s performance was subsequently assessed on the independent validation data subset. Each model was trained using five-fold cross validation with stratification to ensure consistent class distribution, as well as an internal five-fold cross validation for hyperparameter tuning utilising a 80%/20% split of training data. The learning rate of XGBoost, the final model selected, was set to 0·01, and the maximum depth of a tree was set to 3 to reduce the model complexity. To prevent overfitting, the subsample was set to 0·8. These and other hyperparameters were optimized by grid search and cross validation.

Results

Identification of potential predictive markers

Following a comprehensive investigation of the GDSC dataset, 41 ovarian cancer cell lines were identified. Of these, 39 had cisplatin sensitivity data and were stratified as sensitive (IC₅₀ < 10 μM, n = 13), intermediate (IC₅₀ 10—30 μM, n = 12) or resistant (IC₅₀ > 30 μM, n = 14). A total of 248 genes were identified (n = 204 resistant, n = 31 sensitive, and n = 13 stratifying both).

A TCGA-based filter was then applied to identify genes for which stratification of patients to early recurrence within two years vs recurrence after two years or no recurrence was more apparent in platinum-treated samples (platinum-treated samples, n = 36 early recurrence vs 77 late/no recurrence; chemo-naive samples, n = 34 early recurrence vs 233 late/no recurrence). Applying this filter to the genes identified from GDSC (n = 248) narrowed the gene list to 61 genes.

Further pragmatic filtering was performed to assess the concordance of the GDSC and TCGA data (i.e., if high expression predicted resistance in the GDSC dataset, high expression (upper quartile) should segregate patients to early recurrence in the TCGA dataset (selecting n = 27 of 61 genes)). The final list of genes was ranked by their expression difference between sensitive and resistant cell lines, and the genes were assigned points by rank according to their expression difference, using the AUC score to identify resistant or sensitive cell lines (1st place: two points, 2nd place: one point). The four genes with two points were selected for further analysis (CYTH3, ERI1, GALNT3, and S100A14). The process of selecting gene markers of chemoresistance is illustrated in Fig. 1. All four of these genes demonstrated a correlation between gene expression and cisplatin half maximal inhibitory concentration (IC₅₀) and could potentially be used to predict chemosensitivity to cisplatin, as illustrated in Fig. 2.

A machine learning predictive model was subsequently generated to compare the combined clinical and molecular data, as illustrated in Fig. 3a. Typical (linear and radial support vector machine, logistic regression, K-nearest neighbour, decision tree and random forest) and boosted (AdaBoost, gradient boosting machine, XGBoost) algorithms were trained, and their performance was assessed on the training subset. XGBoost was selected for the final model because it had a higher accuracy than the other algorithms (Fig. 3b).

The extreme gradient boosting (XGBoost) algorithm is a tree learning algorithm that generates a decision tree to predict an outcome variable based on a series of rules concerning other variables arranged in a tree-like structure. The decision tree consists of a series of split points based on the values of the input features with the final node being a leaf giving the specific value of the output variable.

In the decision tree modelling, the model can be interpreted by looking at the importance of features across the model (Fig. 4a). Both pathological and molecular features contributed to model prediction and the features with greatest importance consisted of two of the four genes (CYTH3 and S100A14), followed by N stage. The final XGBoost model tested on the validation dataset had higher performance (Fig. 4b) than logistic regression (Fig. 4c). For the XGB model, the two patients who failed first-line chemotherapy were placed 1st and 4th. For the logistic regression model, the two patients who failed first-line chemotherapy were placed 1st and 6th.

Validation of the four-gene signature

A model consisting of the four genes trained on the TMA dataset to predict response to chemotherapy was then applied to external datasets, assessing performance of the model against the surrogate outcome of two-year survival. AUC for survival at two years was 0·67 (CI 0·55–0·79, n = 107), 0.61 (CI 0·51–0·71, n = 194) and 0·63 (CI 0·56–0·72, n = 260) in each of the three datasets respectively. Prediction using clinical information alone (tumour stage and grade), despite being trained to predict two-year survival was marginally less predictive with AUC of 0·63 (CI 0·51–0·74), 0·62 (CI 0·52–0·72), and 0·58 (CI 0·5–0·66) respectively.

Discussion

There are no established predictors of platinum resistance that are currently utilised in routine clinical practice. Hence, our aim was to identify certain key molecular features that contribute to platinum chemotherapy resistance. This was accomplished by integrating both our molecular and clinical data to generate a predictive model to determine chemotherapy resistance to platinum agents.

We studied four genes identified using the GDSC and TCGA datasets. These genes were profiled in a TMA consisting of samples from stage III and IV ovarian cancer patients who received adjuvant chemotherapy, with response assessed in serial CT imaging using the RECIST criteria. After combining and processing the molecular and clinical data, the XGB model indicated that the features with the greatest importance in predicting chemosensitivity include the 2 markers CYTH3 and S100A14, followed by nodal stage (Fig. 4a). This finding indicates that patients with higher expression levels of CYTH3 and S100A14 and node-positive ovarian cancer have the greatest risk of resistance to platinum chemotherapy. The feature assessment of the model suggests that tumour biology plays a significant role in resistance to platinum-based chemotherapy in ovarian cancer, as gene expression was more important for prediction than clinical information. In addition to traditional pathological factors (such as nodal stage, presence of metastasis and tumour grade), determining the presence of molecular markers (CYTH3 and S100A14) may allow us to more adequately assess a patient’s risk of developing chemoresistance in ovarian cancer. The clinical utility of such prediction is additional as demonstrated by the four-gene signature also being prognostic for two-year overall survival in an additional three independent datasets. Tumours predicted as being chemoresistant should be monitored closely for response and early recurrence and considered for second-line chemotherapy once resistance is confirmed.

The exact mechanism of action of how the four markers studied contribute to chemotherapy resistance remains to be elucidated. The S100A14 protein has a role in regulating p53 protein levels, thereby playing a role in the regulation of cell survival and apoptosis. It also plays a role in the regulation of cell migration by modulating the levels of matrix metalloproteinase-2 (MMP2), a matrix protease under transcriptional control of p53. As such, intracellular S100A14 may promote cell motility and invasiveness by regulating the expression and function of MMP-2. S100A14 is also associated with tumorigenesis in colorectal cancer¹⁴ and breast cancer¹⁵. S100A14 is known to be overexpressed and associated with poor prognosis¹⁶ and higher recurrence rates in optimally debulked ovarian cancers¹⁷. Higher levels of S100A14 expression have also been correlated with resistance to platinum-based chemotherapy¹⁸. CYTH3 is involved in the control of Golgi structure and function, and its upregulation in hepatocellular carcinoma is associated with poor survival¹⁹. ERI1 is an RNA exonuclease involved in the binding to histone mRNA and in the degradation of short interfering RNAs. The roles of CYTH3 and ERI1 in ovarian cancer tumorigenesis have yet to be extensively explored in the literature.

GALNT3 protein is known to be overexpressed in high-grade serous epithelial ovarian cancer tumours and has a role in modulating post-translational modifications and metabolism pathways in ovarian cancer cells²⁰. Its overexpression is correlated with poorer prognosis²¹ in advanced-stage ovarian cancer patients. GALNT3 also plays a role in membrane-associated mucin-1 (MUC1) protein stabilization. A potential mechanism for ovarian tumorigenesis involves the GALNT3-MUC1 pathway promoting cell proliferation and invasion in ovarian tumours.

Platinum resistance is a strong driving factor behind the high relapse rates of ovarian cancer, and the platinum-free interval is an important predictor of patient outcomes. At present, platinum resistance is often only determined after evidence of disease recurrence, which is seen in approximately 80% of patients with advanced stage ovarian cancer²². The management of relapsed disease is heavily guided by whether patients are platinum-sensitive or platinum-resistant. Patients with platinum-sensitive disease are considered for both secondary cytoreductive surgery and platinum-based chemotherapy⁹. Patients with platinum-resistant disease are given either single-agent second-line chemotherapy or a combination with bevacizumab²³. The IHC panel and subsequent usage of the combined model will allow clinicians to better predict patient response to platinum-based chemotherapy and allow the improved individualization of chemotherapy regimens. For patients at high risk of resistance, early assessment and more intensive surveillance after surgery can help in the early detection of tumour recurrence. In terms of the choice of adjuvant therapy, patients with a higher risk of recurrence can be more readily considered for second-line chemotherapy agents beyond the standard platinum and taxane combination.

Several in vitro chemosensitivity assays, including the Chemo-FX assay and the extreme drug resistance (EDR) assay^24,25, have been developed with a similar aim of predicting chemotherapy sensitivity. By determining the in vitro response of tumour cells, these assays aim to offer an individualised choice of chemotherapeutic agents compared to standard chemotherapy²⁶. Chemo-FX is an in vitro cell culture-based drug response marker that assesses the sensitivity of ovarian tumour cells to various chemotherapeutic agents²⁷. After submission of surgical specimens to a designated centre, cell cultures are formed and incubated with a panel of therapeutic drugs. While a multi-centre study has reported improved survival outcomes with the Chemo-FX assay²⁷, it has not been adopted into routine clinical practice^26,28. Reasons include a lack of prospective evaluation compared to current clinical practice and a tendency to recommend treatments that would otherwise be administered empirically²⁸. In the EDR assay, fresh tumour specimens are similarly needed to identify drug resistance in tumour specimens²⁵. However, its clinical relevance is limited²⁹, and it is not associated with improved survival outcomes²⁹.

The main strength of our study is the applicability of routine IHC stains, which can be easily adopted to individualize patient therapy compared to other chemosensitivity assays that utilize a highly specialized commercial service. In the current staging evaluation of patients with ovarian cancer, tumour samples are obtained for routine histological examination, and existing pathology services can easily adopt the IHC panel. The IHC scoring is straightforward and can be performed in the same setting when pathologists review the slides. In vitro assays such as the Chemo-FX assay require sample collection and transport to a specialist laboratory, with costs amounting to thousands of dollars per patient³⁰. Therefore, the routine in vitro testing of drug sensitivity is both more difficult and costly to implement.

Furthermore, as an IHC stain, this panel can potentially be validated in a larger cohort retrospectively in future research. Currently, numerous pathology laboratories contain a tissue repository in which surgical specimens are preserved in formalin-fixed paraffin-embedded tissue blocks³¹ that allow for long-term storage and easy retrieval for research³². Another strength of the study is the direct assessment of chemotherapy response in our validation cohort based on the RECIST guidelines, instead of using surrogate markers of chemotherapy response such as survival or recurrence.

There are certain limitations to our study, the first being a small sample size on which to train and assess the prediction model (n = 40). A larger sample size in future studies would allow us to refine the training algorithm and improve its ability to accurately predict tumour response. Data from existing studies assessing chemotherapy assays can be used to inform sample size calculations with HR ranging from 1·5 to 2·9 depending on outcomes assessed²⁴. A second limitation is with regard to tumour heterogeneity. Each donor sample was obtained from one location in the tumour. Recent studies indicate that a high degree of intratumour heterogeneity³³ exists in advanced-stage ovarian cancer, with varying protein expression levels in various parts of the tumour. To accurately quantify protein expression via IHC, it is important to obtain biopsy samples from at least 3 different intratumour locations. The third limitation is the generalizability of the results, which is limited by the retrospective nature of the study. The donor samples were procured from various accredited hospitals. More clinical characteristics about the patient cohort (e.g., whether R0 resection was achieved) should be obtained.

In conclusion, we highlight how in addition to existing clinicopathological factors, molecular features may play a key role in determining the likelihood of platinum-based chemoresistance. Combined clinical and molecular models have the potential to identify patients at high risk of developing platinum resistance and can be straightforward to implement as part of the routine histopathological assessment of tumour specimens. Identifying such patients would allow appropriate patient management and the selection of adjuvant therapy after initial cytoreductive surgery. For patients identified as being at high risk of developing chemotherapy resistance, future studies can explore the benefit of closer surveillance or the consideration of second-line chemotherapy agents beyond the standard platinum and taxane combination. This would allow us to better individualize the chemotherapy regimen to best target tumour biology and achieve maximum response.

Data availability

The datasets generated during and/or analysed during the current study are available from the corresponding author on reasonable request.

TCGA data used for gene discovery is available from The TCGA Research Network: https://www.cancer.gov/tcga.

Cancer Genome Atlas Research Network, Integrated genomic analyses of ovarian carcinoma. Nature. . 2011 Jun 29;474(7353):609–15.

GDSC data used for gene discovery is available from The Genomics of Drug Sensitivity in Cancer database (https://www.cancerrxgene.org/).

Genomics of Drug Sensitivity in Cancer (GDSC): a resource for therapeutic biomarker discovery in cancer cells. (Nucl. Acids Res.2013 Database issue. PMID:23180760).

Gene expression datasets from the Gene Expression Omnibus (GEO) can be obtained from the website [https://www.ncbi.nlm.nih.gov/geo].

Edgar R, Domrachev M, Lash AE.

Gene Expression Omnibus: NCBI gene expression and hybridization array data repository. Nucleic Acids Res. 2002 Jan 1;30(1):207–10.

The datasets used and references to the original manuscripts are as follows;

GSE26193.

Kieffer Y, Bonneau C, Popova T, Rouzier R et al. Clinical Interest of Combining Transcriptomic and Genomic Signatures in High-Grade Serous Ovarian Cancer. Front Genet 2020;11:219. PMID: 32256521.

GSE49997

Pils D, Hager G, Tong D, Aust S et al. Validating the impact of a molecular subtype in ovarian cancer on outcomes: a study of the OVCAD Consortium. Cancer Sci 2012 Jul;103(7):1334–41. PMID: 22497737.

GSE9891.

Tothill RW, Tinker AV, George J, Brown R et al. Novel molecular subtypes of serous and endometrioid ovarian cancer linked to clinical outcome. Clin Cancer Res 2008 Aug 15;14(16):5198–208. PMID: 18698038.

References

Ferlay, J. et al. Estimating the global cancer incidence and mortality in 2018: GLOBOCAN sources and methods. Int. J. Cancer 144, 1941–1953. https://doi.org/10.1002/ijc.31937 (2019).
Article CAS PubMed Google Scholar
Webb, P. M. & Jordan, S. J. Epidemiology of epithelial ovarian cancer. Best Pract. Res. Clin. Obstet. Gynaecol. 41, 3–14. https://doi.org/10.1016/j.bpobgyn.2016.08.006 (2017).
Article PubMed Google Scholar
Bray, F. et al. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J. Clin. 68, 394–424. https://doi.org/10.3322/caac.21492 (2018).
Article PubMed Google Scholar
Doubeni, C. A., Doubeni, A. R., Myers, A. E. & Doubeni, A. R. Diagnosis and management of ovarian cancer. Am. Fam. Physician 93, 937–944 (2016).
PubMed Google Scholar
Wiseman, M. The second World Cancer Research Fund/American Institute for Cancer Research expert report. Food, nutrition, physical activity, and the prevention of cancer: A global perspective. Proc. Nutr. Soc. 67, 253–256. https://doi.org/10.1017/S002966510800712X (2008).
Article PubMed Google Scholar
Kyrgiou, M., Salanti, G., Pavlidis, N., Paraskevaidis, E. & Ioannidis, J. P. Survival benefits with diverse chemotherapy regimens for ovarian cancer: Meta-analysis of multiple treatments. J. Natl. Cancer Inst. 98, 1655–1663. https://doi.org/10.1093/jnci/djj443 (2006).
Article CAS PubMed Google Scholar
Ledermann, J. A. & Kristeleit, R. S. Optimal treatment for relapsing ovarian cancer. Ann. Oncol. 21(Suppl 7), vii218-222. https://doi.org/10.1093/annonc/mdq377 (2010).
Article PubMed Google Scholar
Ushijima, K. Treatment for recurrent ovarian cancer-at first relapse. J. Oncol. https://doi.org/10.1155/2010/497429 (2010).
Article PubMed Google Scholar
Coleman, R. L. & Sabbatini, P. Medical treatment for relapsed epithelial ovarian, fallopian tubal, or peritoneal cancer: Platinum-sensitive disease. In UpToDate (eds Goff B. & Dizon D. S.) (Waltham, MA, 2021).
Liu, J. & Matulonis, U. A. New advances in ovarian cancer. Oncology (Williston Park) 24, 721–728 (2010).
Google Scholar
Lesnock, J. L. et al. BRCA1 expression and improved survival in ovarian cancer patients treated with intraperitoneal cisplatin and paclitaxel: A Gynecologic Oncology Group Study. Br. J. Cancer 108, 1231–1237. https://doi.org/10.1038/bjc.2013.70 (2013).
Article CAS PubMed PubMed Central Google Scholar
Eisenhauer, E. A. et al. New response evaluation criteria in solid tumours: Revised RECIST guideline (version 11). Eur. J. Cancer 45, 228–247. https://doi.org/10.1016/j.ejca.2008.10.026 (2009).
Article CAS PubMed Google Scholar
He, L. et al. Initial partial response and stable disease according to RECIST indicate similar survival for chemotherapeutical patients with advanced non-small cell lung cancer. BMC Cancer 10, 681. https://doi.org/10.1186/1471-2407-10-681 (2010).
Article PubMed PubMed Central Google Scholar
Wang, H. Y. et al. Expression status of S100A14 and S100A4 correlates with metastatic potential and clinical outcome in colorectal cancer after surgery. Oncol. Rep. 23, 45–52 (2010).
CAS PubMed Google Scholar
Xu, C. S., Chen, H. Y., Lu, C. R. & Liu, Z. H. Expression and regulatory mechanism of S100A14 in breast cancer. Zhonghua Zhong Liu Za Zhi 38, 252–257. https://doi.org/10.3760/cma.j.issn.0253-3766.2016.04.003 (2016).
Article CAS PubMed Google Scholar
Cho, H. et al. The role of S100A14 in epithelial ovarian tumors. Oncotarget 5, 3482–3496. https://doi.org/10.18632/oncotarget.1947 (2014).
Article PubMed PubMed Central Google Scholar
Zhao, H. et al. KCNN4 and S100A14 act as predictors of recurrence in optimally debulked patients with serous ovarian cancer. Oncotarget 7, 43924–43938. https://doi.org/10.18632/oncotarget.9721 (2016).
Article PubMed PubMed Central Google Scholar
Qian, J., Ding, F., Luo, A., Liu, Z. & Cui, Z. Overexpression of S100A14 in human serous ovarian carcinoma. Oncol. Lett. 11, 1113–1119. https://doi.org/10.3892/ol.2015.3984 (2016).
Article CAS PubMed Google Scholar
Fu, Y. et al. Cytohesin-3 is upregulated in hepatocellular carcinoma and contributes to tumor growth and vascular invasion. Int. J. Clin. Exp. Pathol. 7, 2123–2132 (2014).
CAS PubMed PubMed Central Google Scholar
Sheta, R. et al. A metabolic labeling approach for glycoproteomic analysis reveals altered glycoprotein expression upon GALNT3 knockdown in ovarian cancer cells. J. Proteomics 145, 91–102. https://doi.org/10.1016/j.jprot.2016.04.009 (2016).
Article CAS PubMed PubMed Central Google Scholar
Wang, Z. Q. et al. Role of the polypeptide N-acetylgalactosaminyltransferase 3 in ovarian cancer progression: Possible implications in abnormal mucin O-glycosylation. Oncotarget 5, 544–560. https://doi.org/10.18632/oncotarget.1652 (2014).
Article PubMed PubMed Central Google Scholar
Hanker, L. C. et al. The impact of second to sixth line therapy on survival of relapsed ovarian cancer after primary taxane/platinum-based therapy. Ann. Oncol. 23, 2605–2612. https://doi.org/10.1093/annonc/mds203 (2012).
Article CAS PubMed Google Scholar
Birrer, M. J. & Fujiwara, K. Medical treatment for relapsed epithelial ovarian, fallopian tubal, or peritoneal cancer: Platinum-resistant disease. In UpToDate (eds Goff, B. & Dizon, D. S.) (Waltham, MA, 2016).
Herzog, T. J., Krivak, T. C., Fader, A. N. & Coleman, R. L. Chemosensitivity testing with ChemoFx and overall survival in primary ovarian cancer. Am. J. Obstetrics Gynecol. 203, e61–e68 (2010).
Article Google Scholar
Kern, D. H. & Weisenthal, L. M. Highly specific prediction of antineoplastic drug resistance with an in vitro assay using suprapharmacologic drug exposures. JNCI J. Natl. Cancer Institute 82, 582–588 (1990).
Article CAS Google Scholar
Samson, D. J., Seidenfeld, J., Ziegler, K. & Aronson, N. Chemotherapy sensitivity and resistance assays: A systematic review. J. Clin. Oncol. 22, 3618–3630 (2004).
Article CAS Google Scholar
Richard, S., Wells, A., Connor, J. & Price, F. Use of ChemoFx® for identification of effective treatments in epithelial ovarian cancer. PLoS Curr. 7. https://doi.org/10.1371/currents.eogt.8b0b6fffc7b999b34bc4c8152edbf237 (2015).
Article PubMed PubMed Central Google Scholar
Burstein, H. J. et al. American Society of Clinical Oncology clinical practice guideline update on the use of chemotherapy sensitivity and resistance assays. J. Clin. Oncol. 29, 3328–3330 (2011).
Article Google Scholar
Matsuo, K., Eno, M. L., Im, D. D., Rosenshein, N. B. & Sood, A. K. Clinical relevance of extent of extreme drug resistance in epithelial ovarian carcinoma. Gynecol. Oncol. 116, 61–65. https://doi.org/10.1016/j.ygyno.2009.09.018 (2010).
Article CAS PubMed Google Scholar
Plamadeala, V. et al. A cost-effectiveness analysis of a chemoresponse assay for treatment of patients with recurrent epithelial ovarian cancer. Gynecol. Oncol. 136, 94–98 (2015).
Article Google Scholar
Patel, A. A. et al. Availability and quality of paraffin blocks identified in pathology archives: a multi-institutional study by the Shared Pathology Informatics Network (SPIN). BMC Cancer 7, 37 (2007).
Article Google Scholar
Becich, M. J. The role of the pathologist as tissue refiner and data miner: The impact of functional genomics on the modern pathology laboratory and the critical roles of pathology informatics and bioinformatics. Mol. Diagn. 5, 287–299 (2000).
Article CAS Google Scholar
Kossai, M., Leary, A., Scoazec, J. Y. & Genestie, C. Ovarian cancer: A heterogeneous disease. Pathobiology 85, 41–49. https://doi.org/10.1159/000479006 (2018).
Article PubMed Google Scholar

Download references

Acknowledgements

This initiative is funded by NCCS Cancer Fund and SingHealth Duke-NUS Academic Medical Centre, facilitated by the Joint Office of Academic Medicine (JOAM). It is an initiative of Surgery Academic Clinical Programme (SURG ACP), hosted at National Cancer Centre Singapore. CAJO is supported by the National Research Council Transition Award (NMRC/TA/0061/2017). All authors have read the journal’s authorship agreement and policy on disclosure of potential conflicts of interest.

Author information

Authors and Affiliations

Department of Sarcoma, Peritoneal and Rare Tumours (SPRinT), Division of Surgery and Surgical Oncology, National Cancer Centre Singapore, 11 Hospital Crescent, Singapore, 169610, Singapore
Nicholas Brian Shannon, Laura Ling Ying Tan, Qiu Xuan Tan, Joey Wee-Shan Tan, Josephine Hendrikson, Wai Har Ng, Gillian Ng, Ying Liu, Xing-Yi Sarah Ong, Jolene Si Min Wong, Grace Hwei Ching Tan, Khee Chee Soo, Melissa Ching Ching Teo, Claramae Shulyn Chia & Chin-Ann Johnny Ong
Department of Sarcoma, Peritoneal and Rare Tumours (SPRinT), Division of Surgery and Surgical Oncology, Singapore General Hospital, Outram Road, Singapore, 169608, Singapore
Nicholas Brian Shannon, Laura Ling Ying Tan, Qiu Xuan Tan, Joey Wee-Shan Tan, Josephine Hendrikson, Wai Har Ng, Gillian Ng, Ying Liu, Xing-Yi Sarah Ong, Jolene Si Min Wong, Grace Hwei Ching Tan, Khee Chee Soo, Melissa Ching Ching Teo, Claramae Shulyn Chia & Chin-Ann Johnny Ong
Laboratory of Applied Human Genetics, Division of Medical Sciences, National Cancer Centre Singapore, 11 Hospital Crescent, Singapore, 169610, Singapore
Nicholas Brian Shannon, Laura Ling Ying Tan, Qiu Xuan Tan, Joey Wee-Shan Tan, Josephine Hendrikson, Wai Har Ng, Gillian Ng, Ying Liu, Xing-Yi Sarah Ong & Chin-Ann Johnny Ong
Department of Obstetrics and Gynaecology, Division of Surgery and Surgical Oncology, Singapore General Hospital, Outram Road, Singapore, 169608, Singapore
Ravichandran Nadarajah
SingHealth Duke-NUS Oncology Academic Clinical Program, Duke-NUS Medical School, 8 College Road, Singapore, 169857, Singapore
Khee Chee Soo, Melissa Ching Ching Teo, Claramae Shulyn Chia & Chin-Ann Johnny Ong
Institute of Molecular and Cell Biology, A*STAR Research Entities, 61 Biopolis Drive, Singapore, 138673, Singapore
Chin-Ann Johnny Ong

Authors

Nicholas Brian Shannon
View author publications
You can also search for this author in PubMed Google Scholar
Laura Ling Ying Tan
View author publications
You can also search for this author in PubMed Google Scholar
Qiu Xuan Tan
View author publications
You can also search for this author in PubMed Google Scholar
Joey Wee-Shan Tan
View author publications
You can also search for this author in PubMed Google Scholar
Josephine Hendrikson
View author publications
You can also search for this author in PubMed Google Scholar
Wai Har Ng
View author publications
You can also search for this author in PubMed Google Scholar
Gillian Ng
View author publications
You can also search for this author in PubMed Google Scholar
Ying Liu
View author publications
You can also search for this author in PubMed Google Scholar
Xing-Yi Sarah Ong
View author publications
You can also search for this author in PubMed Google Scholar
Ravichandran Nadarajah
View author publications
You can also search for this author in PubMed Google Scholar
Jolene Si Min Wong
View author publications
You can also search for this author in PubMed Google Scholar
Grace Hwei Ching Tan
View author publications
You can also search for this author in PubMed Google Scholar
Khee Chee Soo
View author publications
You can also search for this author in PubMed Google Scholar
Melissa Ching Ching Teo
View author publications
You can also search for this author in PubMed Google Scholar
Claramae Shulyn Chia
View author publications
You can also search for this author in PubMed Google Scholar
Chin-Ann Johnny Ong
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

N.B.S. conceptualized the study. N.B.S. and L.L.Y.T. wrote the manuscript. N.B.S. performed bioinformatics analysis. L.L.Y.T., Q.X.T., J.W.S.T., J.H., W.H.N., G.N., Y.L., and X.Y.S.O. performed the experiments and analysed the results. R.N., J.S.M.W., G.H.C.T., and C.S.C., K.C.S., M.C.C.T., and O.C.A.J. provided critical scientific, clinical, and intellectual inputs. O.C.A.J. reviewed the manuscript critically for publication. All authors reviewed and agreed to submit the manuscript in its current form.

Corresponding author

Correspondence to Chin-Ann Johnny Ong.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Shannon, N.B., Tan, L.L.Y., Tan, Q.X. et al. A machine learning approach to identify predictive molecular markers for cisplatin chemosensitivity following surgical resection in ovarian cancer. Sci Rep 11, 16829 (2021). https://doi.org/10.1038/s41598-021-96072-6

Download citation

Received: 17 December 2020
Accepted: 31 July 2021
Published: 19 August 2021
DOI: https://doi.org/10.1038/s41598-021-96072-6

This article is cited by

Enhancing precision medicine: a nomogram for predicting platinum resistance in epithelial ovarian cancer
- Ruyue Li
- Zhuo Xiong
- Chunfang Ha
World Journal of Surgical Oncology (2024)
Classification and diagnostic prediction of breast cancer metastasis on clinical data using machine learning algorithms
- Mahendran Botlagunta
- Madhavi Devi Botlagunta
- Mohd Asif Shah
Scientific Reports (2023)
Clinical significance of metabolism-related genes and FAK activity in ovarian high-grade serous carcinoma
- Masakazu Sato
- Sho Sato
- Kosei Hasegawa
BMC Cancer (2022)
A machine learning approach applied to gynecological ultrasound to predict progression-free survival in ovarian cancer patients
- Francesca Arezzo
- Gennaro Cormio
- Vera Loizzi
Archives of Gynecology and Obstetrics (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.