The Oncology Biomarker Discovery framework reveals cetuximab and bevacizumab response patterns in metastatic colorectal cancer

Ohnmacht, Alexander J.; Stahler, Arndt; Stintzing, Sebastian; Modest, Dominik P.; Holch, Julian W.; Westphalen, C. Benedikt; Hölzel, Linus; Schübel, Marisa K.; Galhoz, Ana; Farnoud, Ali; Ud-Dean, Minhaz; Vehling-Kaiser, Ursula; Decker, Thomas; Moehler, Markus; Heinig, Matthias; Heinemann, Volker; Menden, Michael P.

doi:10.1038/s41467-023-41011-4

Download PDF

Article
Open access
Published: 04 September 2023

The Oncology Biomarker Discovery framework reveals cetuximab and bevacizumab response patterns in metastatic colorectal cancer

Alexander J. Ohnmacht^1,2^na1,
Arndt Stahler³^na1,
Sebastian Stintzing ORCID: orcid.org/0000-0002-3297-5801^3,4^na1,
Dominik P. Modest ORCID: orcid.org/0000-0002-6853-0599³,
Julian W. Holch^4,5,
C. Benedikt Westphalen ORCID: orcid.org/0000-0002-5310-3754⁵,
Linus Hölzel¹,
Marisa K. Schübel^1,2,
Ana Galhoz^1,2,
Ali Farnoud¹,
Minhaz Ud-Dean¹,
Ursula Vehling-Kaiser⁶,
Thomas Decker⁷,
Markus Moehler⁸,
Matthias Heinig¹,
Volker Heinemann⁵ &
…
Michael P. Menden ORCID: orcid.org/0000-0003-0267-5792^1,2,9

Nature Communications volume 14, Article number: 5391 (2023) Cite this article

5394 Accesses
3 Citations
28 Altmetric
Metrics details

Subjects

Abstract

Precision medicine has revolutionised cancer treatments; however, actionable biomarkers remain scarce. To address this, we develop the Oncology Biomarker Discovery (OncoBird) framework for analysing the molecular and biomarker landscape of randomised controlled clinical trials. OncoBird identifies biomarkers based on single genes or mutually exclusive genetic alterations in isolation or in the context of tumour subtypes, and finally, assesses predictive components by their treatment interactions. Here, we utilise the open-label, randomised phase III trial (FIRE-3, AIO KRK-0306) in metastatic colorectal carcinoma patients, who received either cetuximab or bevacizumab in combination with 5-fluorouracil, folinic acid and irinotecan (FOLFIRI). We systematically identify five biomarkers with predictive components, e.g., patients with tumours that carry chr20q amplifications or lack mutually exclusive ERK signalling mutations benefited from cetuximab compared to bevacizumab. In summary, OncoBird characterises the molecular landscape and outlines actionable biomarkers, which generalises to any molecularly characterised randomised controlled trial.

The Drug Rediscovery protocol facilitates the expanded use of existing anticancer drugs

Article 30 September 2019

Precision oncology in metastatic colorectal cancer — from biology to medicine

Article 16 April 2021

Molecular tumour boards — current and future considerations for precision oncology

Article 16 October 2023

Introduction

Precision medicine aims to tailor therapeutic interventions to specific patient subgroups defined by predictive biomarkers detected in tumours. Accordingly, strategies are required to identify such patient subgroups systematically¹. For performing subgroup analysis and exploratory biomarker discovery, the European Medicines Agency (EMA) has provided specific guidelines². According to these, biological knowledge should underpin subgroup definitions, and subgroup-specific effects in late-stage clinical trials should still be interpreted with caution owing to the exploratory and retrospective nature of the analyses. For this purpose, a large number of computational methods have been proposed and discussed^3,4,5, e.g., tree-based methods using recursive partitioning^6,7,8, virtual twins⁹, outcome weighted methods^10,11, causal forests¹² and metalearners for estimating heterogeneous treatment effects¹³. However, most of these computational methods neglect cancer biology, i.e., exploiting the molecular landscape of a clinical trial and customising models to cancer subtypes and mutational patterns.

Clinical outcomes of patients with metastatic colorectal cancer (mCRC) significantly improved upon the introduction of targeted treatments, including anti-EGFR and anti-VEGF directed monoclonal antibodies such as cetuximab and bevacizumab, respectively¹⁴. Tumours of colorectal cancer patients were shown to exhibit, for instance, either KRAS or NRAS mutations (referred to as RAS mutations) with a rate of about 50%, which tend to occur mutually exclusive^15,16. These RAS mutations are clinically approved predictive biomarkers of resistance against anti-EGFR directed monoclonal antibodies such as cetuximab¹⁷. Bevacizumab has been reported to improve progression-free survival in first-line mCRC trials¹⁸; however, no comparable biomarker has been depicted yet.

In this study, we focused on the open-label randomised phase III clinical trial FIRE-3. Here, patients with KRAS exon 2 wild-type mCRC were randomised to receive either cetuximab or bevacizumab in combination with 5-fluorouracil, leucovorin and irinotecan (FOLFIRI) as a first-line regimen. Several retrospective subgroup analyses revealed potential prognostic and predictive biomarkers based on tumour DNA and clinical characteristics, such as the relevance of the molecular status, i.e., alterations other than KRAS exon 2, such as KRAS exon 3-4, NRAS exon 2-4 and BRAF V600E, or primary tumour sidedness^{19,20,21,22,23}. For example, targeting EGFR in RAS wild-type mCRC tumours located in the left hemicolon (left-sided) was shown to be beneficial, whilst RAS wild-type tumours located in the right colon (right-sided) were less likely to respond²⁴. Additionally, in the more recent FIRE-4.5 study, it was demonstrated that patients with BRAF V600E mutant tumours may benefit from the treatment with 5-fluorouracil, oxaliplatin, leucovorin and irinotecan (FOLFOXIRI) backbone plus bevacizumab²⁵, whereas in contrast, these patients lacked benefits from cetuximab^26,27. This hints towards tumour subtype-specific interactions and alternative mechanisms to acquire EGFR inhibitor resistance²⁸.

Previously proposed tumour subtypes in colorectal adenocarcinoma are based on the gene expression-derived consensus molecular subtypes (CMS) and could identify subtypes that reflected distinct tumour biology²⁹. Recently, the prognostic value of CMS has been confirmed in the FIRE-3, CALGB/SWOG 80405 and AGITG MAX clinical trials for FOLFIRI combined with either cetuximab or bevacizumab^21,30,31. In particular, CMS4 patients with RAS wild-type have shown a significantly longer overall survival when treated with cetuximab compared to bevacizumab in metastatic disease²¹. However, the clinical translation of the CMS classification of colorectal cancer is still in its infancy and is further investigated in multiple clinical trials³². These sparse results have illustrated that modelling interactions between somatic alterations and tumour subtypes can yield insights into complex biomarkers and highlight the urgent need for computational frameworks to systematically decipher the molecular landscape, tumour subtypes and biomarkers. Thus, we hypothesised that predictive response biomarkers may be revealed by systematically deconvoluting cancer genetic events and tumour subtypes within a clinical trial.

Here, we present the Oncology Biomarker Discovery (OncoBird) framework, which empowers the systematic identification of actionable biomarkers for clinical trials in oncology. OncoBird is publicly available as a software package at https://github.com/MendenLab/OncoBird and a demo run is available at https://codeocean.com/capsule/9911222/tree/v1. Furthermore, users can run a graphical user interface within a docker container (Supplementary Fig. 1).

The OncoBird workflow is divided into five distinct steps: it systematically (1) investigates the molecular landscape of a clinical trial, i.e., copy number alterations, somatic mutations, mutually exclusive patterns and predefined tumour subtypes; (2) identifies biomarkers within a treatment arm based on genetic alterations, and (3) in relation to the predefined tumour subtypes; consecutively, (4) evaluates their predictive component across treatment arms; and finally, (5) it comprehensively corrects for multiple hypothesis testing and adjusts treatment effects of biomarkers based on resampling methods. To enhance the biological signal, this analysis integrates the molecular and biomarker landscape of cancer clinical trials by customising models to established cancer subtypes and mutational patterns. In essence, OncoBird yields subtype-specific biomarkers with treatment benefits in an interpretable and transparent manner and therefore operates complementary to existing methods. The utility of OncoBird is exemplified by the application to the FIRE-3 clinical trial, generalises to the ADJUVANT clinical trial^33,34,35, and in fact, would generalise to any molecularly characterised randomised controlled trial (RCT) in oncology.

Results

OncoBird is applicable to RCTs accompanied with molecular characteristics, including genetic sequencing panels which yield copy number alterations and somatic driver mutations (Fig. 1a, b). In addition, a second layer of stratification can be supplied in the form of predefined tumour subtypes (Fig. 1a). Then, OncoBird systematically assesses the genetic landscape in the context of tumour subtypes (Fig. 1c) and outlines the biomarker landscape across multiple clinical responses (Fig. 1d), i.e., time-to-event data (overall or progression-free survival; “Methods”), and binary variables capturing treatment success (objective response rate; “Methods”).

**Fig. 1: The Oncology Biomarker Discovery (OncoBird) workflow.**

Here, we leveraged the FIRE-3 RCT, including 752 mCRC patients who have been treated with FOLFIRI and either cetuximab or bevacizumab. We defined tumour subtypes based on CMS²¹, and tumour sidedness, i.e., left- or right-sided mCRC. In addition, 373 tumours were genetically characterised, i.e., the mutational status of 277 frequently altered cancer genes. To reveal the biomarker landscape, we employed the following stratification and modelling strategies (Supplementary Data 1; “Methods”): We first investigated each alteration for stratifying patients by their prognosis within each treatment arm (Fig. 1e). Consecutively, we inspected alterations in tumour subtypes (Fig. 1f), revealing subtype-specific biomarkers. Finally, we tested for treatment interactions to reveal biomarkers with predictive effects (Fig. 1g). Importantly, subtypes and genetic alterations ought to be independent of the treatment assignment. The molecular landscape and individual treatment arm analysis could be applied to any trial design without limitations.

Exemplified with a well-established biomarker of cetuximab response¹⁷, RAS wild-type mCRC patients showed longer overall survival (Fig. 1h; p = 0.0002, HR = 0.53 [0.38–0.73]). Consistent with a previous study³⁶ and more recently defined treatment guidelines for mCRC³⁷, the cetuximab overall survival (OS) benefit for patients with RAS wild-type tumours was conserved in left-sided tumours (Fig. 1i; p = 7.6 × 10⁻⁵, HR = 0.44 [0.29–0.66]). Furthermore, we observed interactions between RAS mutations and the treatment arm in left-sided tumours (p_int = 0.07): Cetuximab remained superior to bevacizumab in RAS wild type and left-sided tumours (Fig. 1j; p = 0.05, HR = 0.73 [0.52–1.00]) in terms of OS, whilst bevacizumab and cetuximab achieved comparable OS for patients with RAS mutant and left-sided tumours (Supplementary Fig. 2; p = 0.32, HR = 1.22 [0.85–1.75]).

Whilst we particularly focused on the FIRE-3 trial in colorectal cancer, we also demonstrate the generalisability of OncoBird by applying it with the same default biomarker thresholds to the ADJUVANT clinical trial (“Methods”), which explored gefitinib in non-small cell lung cancer (NSCLC)^33,34,35. The ADJUVANT study reported predictive components of five alterations, i.e., TP53 mutations, RB1 alterations and copy number amplifications of NKX2-1, CDK4 and MYC³⁵. Four out of five biomarkers were concordantly identified for disease-free survival with OncoBird (FDR_int < 0.2; Supplementary Data 2; Supplementary Fig. 3–6). In addition, OncoBird suggests that the mutual exclusivity patterns play a role in the biomarker landscape of NSCLC (Supplementary Fig. 3c, d). In more detail, we observed gefitinib benefits in tumours that were characterised by mutations in either TP53, SMAD4 or CDK4 amplifications (p = 0.0002, HR = 0.37 [0.21–0.63]; Supplementary Data 2; Supplementary Figs. 5c and 6a), for which the resampling-based adjustment of the conditional average treatment effect yielded p_adj = 0.001 with HR = 0.32 [0.14–0.86] (Supplementary Data 2; “Methods”). These findings highlight the accessibility, reproducibility and interoperability of OncoBird.

The molecular landscape of the FIRE-3 clinical trial

Leveraging OncoBird, we assessed the genetic landscape of patient tumours in the FIRE-3 clinical trial. In total, 373 tumours were genetically characterised, including 31 frequently altered cancer genes observed in at least 12 patients (Fig. 2a). We observed amplifications in chromosome arm 20q (chr20q) in 74/373 tumours (19.8%), which includes SRC, TOP1, BCL2L1, ZNF217, AURKA, GNAS and ARFRP1 (Fig. 2a). Indeed, chr20q amplifications have been reported to define a distinct subtype of left-sided colon cancers³⁸. In addition, we identified 39 mutually exclusive somatic alterations (gene modules) using the Mutex algorithm (Fig. 2b; “Methods”)³⁹, thus grouping low frequent but functionally similar somatic events within a signalling pathway. We could confirm that chr20q amplifications were mutually exclusive to somatic mutations in the ERK signalling pathway (KRAS, NRAS or BRAF; p = 0.0002, Fisher’s exact test).

**Fig. 2: Molecular landscape of the FIRE-3 clinical trial.**

In addition, we analysed 451 gene expression profiles and showed consistency with their derived CMS subtypes (Fig. 2c), whilst the primary tumour side displayed a heterogeneous gene expression pattern (Fig. 2d). Right-sided tumours were particularly enriched in CMS1 tumours (p = 0.009, hypergeometric test; Supplementary Fig. 7) and depleted in CMS2 tumours (p = 0.007, hypergeometric test; Supplementary Fig. 7).

The concordance between right-sided tumours and CMS1 (Fig. 2e) was reflected by genetic alterations that were enriched in both tumour subtypes. Microsatellite instabilities (MSI) and somatic mutations in BRAF and RNF43 were enriched in both CMS1 and right-sided tumours (FDR_mol < 0.05, hypergeometric test). Additionally, mutations in PIK3CA, FAM123B and KRAS were only associated with right-sided tumours (Fig. 2e; FDR_mol < 0.05, hypergeometric test). In contrast, the similarity of left-sided tumours and CMS2 (Fig. 2f) was characterised by mutations in APC, TP53 and chr20q amplifications (SRC, TOP1, BCL2L1, ZNF217), which were all significantly enriched in both left-sided and CMS2 tumours (Fig. 2g, h; FDR_mol < 0.05, hypergeometric test). Somatic mutations in PTEN, ARID1A, ATM, LRP1B, BRCA2 and NF1 did not show a preference for a particular primary tumour side, but were enriched in CMS1 tumours (Fig. 2h), and were associated with an increased tumour mutational burden (p = 0.008, p = 0.002, p = 0.017, p = 0.0001, p = 0.010 and p = 0.051, respectively, Fisher’s exact test).

In summary, leveraging OncoBird and investigating patterns of genetic events in tumour subtypes revealed meaningful tumour biology. For example, mutations of either BRAF or KRAS promote ERK signalling and therefore occur mutually exclusive. BRAF mutations were predominantly found in CMS1, but nevertheless, 27 out of 53 BRAF mutant tumours were distributed among CMS2-4. Therefore, it is of utmost importance to gain an enhanced understanding of the molecular landscape of mCRC prior to the interpretation of biomarkers, which is further empowered by OncoBird.

Genetic biomarkers of cetuximab

First, independent of tumour subtypes, we assessed single genes and mutually exclusive gene modules (Fig. 2a, b) as biomarkers for cetuximab. For this, we leveraged Cox proportional hazards regression and logistic regression models (“Methods”), considering overall survival (OS; Fig. 3a–h), progression-free survival (PFS; Supplementary Fig. 8) and the objective response rate (ORR; Supplementary Fig. 9). We quantified effect sizes by hazard ratios (HR) for survival data and odds ratios (OR) for binary data including 95% confidence intervals (Supplementary Data 3).

**Fig. 3: Identification of genetic biomarkers for FOLFIRI plus cetuximab or bevacizumab.**

The clinically established resistance biomarkers of cetuximab were recovered, i.e., mutations in RAS (either KRAS or NRAS) referred to a poorer OS in the cetuximab treatment arm (Fig. 1h; p = 0.0002, HR = 1.90 [1.36–2.65], FDR_cet < 0.1). In addition, we confirmed that BRAF mutations are mutually exclusive to RAS mutations (Fig. 2b; p = 0.0008, Fisher’s exact test), and both contributed to a poor OS when treated with cetuximab (Fig. 3b; p = 5.7 × 10⁻⁷, HR = 2.29 [1.65–3.16], FDR_cet < 0.1), which has been consistently observed in an independent cohort⁴⁰.

Most resistance biomarker modules grouped mutations in KRAS and BRAF (FDR_cet < 0.1). In addition, we found a gene module including mutations in SOX9 and MYC amplifications, for which mutant tumours displayed a worse prognosis based on OS (Fig. 3d, e; p = 0.02, HR = 1.50 [1.07–2.37], FDR_cet < 0.1). By inspecting their oncoprint (Fig. 3f), 27/59 tumours harboured mutations in either SOX9 or MYC and were wild-type in either BRAF, KRAS or NRAS, hinting towards an alternative cetuximab resistance mechanism.

In addition, we found TOP1 amplifications to be a strong predictor of a prolonged OS for treatment with cetuximab (Fig. 3c; p = 0.005, HR = 0.50 [0.30–0.81], FDR_cet < 0.1). In fact, we could identify multiple co-amplifications that showed prognostic value for the cetuximab treatment arm, which are located on chromosome 20q. Among the most predictive amplifications for a longer OS were SRC, TOP1, AURKA and ARFRP1 (Fig. 3g–i; Supplementary Data 3). Consistent trends were observed with SRC amplifications in PFS (p = 0.10, HR = 0.69 [0.44–1.07], median PFS wild-type tumours 9.6 months vs mutants 11.1 months) and ORR (p = 0.18, OR = 0.45 [0.14–1.45], ratio ORR wild-type 0.66 vs mutant tumours 0.83).

Genetic biomarkers of bevacizumab

Analogously to the cetuximab biomarker analysis, for the bevacizumab treatment arm, we also built Cox proportional hazards regression models (“Methods”) applied to OS (Fig. 3j–l; Supplementary Fig. 10) and PFS (Supplementary Fig. 11), and logistic regression models for ORR (Supplementary Fig. 12). For exploring bevacizumab biomarker trends, we employed a lenient threshold of FDR_bev < 0.3, which deviates from the default setting (“Methods”). The mutually exclusive module of KRAS and BRAF mutations showed lower OS (Fig. 3j, k; p = 0.01, HR = 1.50 [1.10–2.04], FDR_bev < 0.3), which is consistent with literature reports^41,42. A better predictor for poor OS was the APC wild-type status for tumours treated with FOLFIRI plus bevacizumab (Fig. 3j, l; p = 0.01, HR = 1.69 [1.14–2.50], FDR_bev < 0.3).

Subtype-specific biomarkers of cetuximab and bevacizumab

The previous analyses focused on genetic biomarkers in isolation, whilst here, we investigated them within the context of tumour subtypes (“Methods”). In FIRE-3, tumour subtypes are defined as either left- or right-sided tumours, or alternatively, classified according to the consensus molecular subtypes, i.e., CMS1-4 (“Methods”)²⁹. Here, we tested stratifications based on each single gene or gene module within tumour subtypes for OS (Fig. 4a, b), PFS (Supplementary Fig. 13) and ORR (Supplementary Fig. 14).

**Fig. 4: Identification of subtype-specific genetic biomarkers for FOLFIRI plus cetuximab or bevacizumab.**

In total, we found 38 subtype-specific biomarkers of cetuximab for OS (FDR_cet < 0.1; “Methods”). In particular, we recovered favourable OS of CMS2 patients treated with cetuximab (Fig. 4a), if their tumours additionally carried chr20q amplifications, i.e., ARFRP1 (Fig. 4c; p = 0.01, HR = 0.32 [0.13–0.77], FDR_cet < 0.1), TOP1 (Supplementary Fig. 15a; p = 0.01, HR = 0.34 [0.15–0.74], FDR_cet < 0.1) and SRC (Supplementary Fig. 15b; p = 0.01, HR = 0.37 [0.17–0.78], FDR_cet < 0.1). Additionally, CMS4 KRAS mutant tumours treated with cetuximab showed worse OS (Fig. 4d; p = 0.002, HR = 2.60 [1.44–4.70], FDR_cet < 0.1) and PFS (Supplementary Fig. 13a, c).

For reporting bevacizumab biomarker trends, we employed a lenient false discovery rate (FDR_bev < 0.3), which deviates from the conservative OncoBird default setting (“Methods”). Tumours with KRAS mutations classified as CMS2 tended to show worse OS when treated with bevacizumab (Fig. 4e; p = 0.004, HR = 2.33 [1.31–4.15], FDR_bev < 0.3). In contrast, KRAS mutated tumours classified as CMS1 tended to show a longer OS compared to wild-type tumours when treated with bevacizumab (Fig. 4f; p = 0.03, HR = 0.33 [0.12–0.93], FDR_bev < 0.3).

Predictive components of biomarkers

For assessing the predictive component of response biomarkers, here, we compared the cetuximab and bevacizumab treatment arms against each other by focusing on interactions between genetic alterations in the context of tumour subtypes (“Methods”). Subsequently, we compared the prognosis of both inhibitors for each subgroup according to the interaction biomarkers, thus assessing potential treatment benefits. In addition, we corrected the conditional average treatment effects in the identified subgroups using resampling methods to obtain multiplicity-adjusted p-values and bias-corrected confidence intervals (“Methods”). The results were summarised for OS (Fig. 5a, b) and PFS (Supplementary Fig. 16), whereas no significant interactions were detected for ORR. In total, we found five putative interactions (Supplementary Data 4; FDR_int < 0.2; “Methods”). For reporting other biomarker trends, we also included summary statistics of 57 subgroups with a lenient threshold of FDR_int < 0.6, which deviates from the default setting (Supplementary Data 3).

**Fig. 5: Predictive biomarkers in the context of tumour subtypes.**

For example, we found predictive value of chr20q amplifications in CMS2 tumours treated with FOLFIRI plus cetuximab (Fig. 5a, b), which is evident by the significant interactions of TOP1 (p_int = 0.07, FDR_int < 0.2) and ARFRP1 (p_int = 0.01, FDR_int < 0.2). ARFRP1 amplifications showed the largest predictive component among the chr20q amplifications. Accordingly, we observed longer OS in the cetuximab treatment arm compared to bevacizumab in CMS2 (Fig. 5a, c; ARFRP1: p = 0.003, HR = 0.21 [0.07–0.59], FDR_int < 0.2; Supplementary Data 3). The resampling-based adjusted treatment effect confirmed this observation and yielded a hazard ratio in this subgroup of HR = 0.21 [0.09–0.54] with p_adj = 0.04 (Fig. 5a, c). Previous reports have indicated a prognostic value of chr20q amplifications in colorectal cancer patients^38,43, whilst OncoBird yielded additional evidence that they harbour a predictive component.

Another interaction example was tumours with KRAS mutations that showed CMS-specific responses. In CMS4, patients with KRAS wild-type tumours responded better to cetuximab compared to patients treated with bevacizumab (Fig. 5b, d; KRAS wild types: p = 0.02, HR = 0.57 [0.35–0.93]; p_int = 0.02, FDR_int < 0.2), for which the resampling-based adjusted treatment effect yielded HR = 0.70 [0.25–2.35] with p_adj = 0.14 (Fig. 5b, d). Our results suggest a predictive role of KRAS mutations in CMS4 for cetuximab, which we also identified for PFS (Supplementary Fig. 16c, d). Notably, modules containing alterations in NRAS, BRAF and SRC showed similar statistics since only four, eight and twelve mutant tumours were present in CMS4. Insignificant but numerically longer OS was observed for patients with KRAS mutated tumours classified as CMS4 treated with bevacizumab (Fig. 5e, KRAS mutants: p = 0.24, HR = 0.66 [0.33–1.31]), with a median OS 28.3 months compared to 18.4 months when treated with cetuximab.

In order to assess the ability of OncoBird to discover the same biomarkers for different datasets, we applied 5-fold cross-validation repeated five times and extracted the ten most significant biomarkers for OS across each of the 25 models (Fig. 6a). Consistent with our previous findings, gene modules containing KRAS mutations for CMS4 were found in 21/25 training sets and chr20q amplifications in CMS2 were reproduced in 22/25 training sets (Fig. 6a).

**Fig. 6: Stability analysis and benchmark with other methods.**

Benchmarking of methods for subgroup analysis

For benchmarking OncoBird, we compared it to alternative methods that can be used to investigate predictive biomarkers based on overall survival. Together with OncoBird, eight algorithms and implementations were used in order to identify subgroups with differential treatment effects, i.e., virtual twins (VT)⁹, model-based partitioning (MOB)⁸, an outcome-weighting method (OWE)¹¹, causal random forests (CRF)¹², policy learning (POL)⁴⁴, GUIDE⁴⁵ and PRISM⁴⁶ (Supplementary Table 1; “Methods”; Fig. 6b).

For the evaluation, we first derived hazard ratios for cetuximab benefit based on OS in the subgroups according to the predicted biomarkers for all methods across five times 5-fold cross-validation (“Methods”). We also focused on the current treatment guidelines for mCRC, according to which patients should receive cetuximab if their tumours are RAS wild-type and left-sided (std; Fig. 6b)³⁷. While the treatment benefit was not significant for the std-positive subgroup (Fig. 6b, median HR = 0.78, p_cv = 0.129), the methods that found the highest significant benefits were OncoBird (median HR = 0.74, p_cv = 0.046), POL (median HR = 0.81, p_cv = 0.048), MOB (median HR = 0.83, p_cv = 0.048) and OWE (median HR = 0.84, p_cv = 0.049) ordered by the magnitude of the hazard ratio (Fig. 6b).

Next, we leveraged the whole dataset to identify cetuximab sensitivity biomarkers with each method and compared them to the treatment guidelines. On average, 73% of methods identified cetuximab benefit for a patient in the std-positive subgroup, whereas only 33% of methods detected further benefits in the std-negative subgroup (Fig. 6c). 7/8 (88%) methods found mutually exclusive alterations in KRAS, NRAS or BRAF as a predictive biomarker, from which one, two and four methods proposed this marker in conjunction with tumour sidedness, CMS and across all patients, respectively (Supplementary Table 1). Only 2/8 (25%) methods highlighted TOP1 amplifications as a potential biomarker (Supplementary Table 1). This highlights that current subgroup analysis methods mostly recover standard clinical practice, whilst sparsely identifying complementary predictive subgroups, thus highlighting the unmet need for cancer biology-driven frameworks such as OncoBird.

Ideally, subgroup analysis should reveal subgroups with high treatment effects for refining treatment strategies and recover subgroups in the standard treatment strategy. Therefore, we evaluated the newly proposed subgroup for which standard treatment is not recommended (new-std-negative) for each method. We derived the hazard ratios for cetuximab benefit based on OS for all methods in the new-std-negative subgroups (Fig. 6d). Lower hazard ratios in new-std-negative patients indicate the discovery of off-label subgroups for which cetuximab is currently not recommended (Fig. 6d). Accordingly, OncoBird showed the numerically lowest hazard ratio HR = 0.57 (p = 0.16, N = 29) for the new-std-negative subgroup compared to all other methods (Supplementary Table 1; Fig. 6d).

In summary, many of the computational methods reproduced the clinically established biomarkers, whilst OncoBird empowers advanced biomarker identification by thoroughly integrating biological priors in the form of tumour subtypes. The simplicity of statistical models leveraged in OncoBird further increases interpretability and transparency.

Discussion

We demonstrated that OncoBird has the capabilities to characterise the molecular and biomarker landscape of RCTs. Here exemplified, we captured the established clinical biomarkers of FIRE-3, and proposed five predictive biomarker hypotheses (FDR_int < 0.2). The biomarkers were based on either individual cancer genes or mutually exclusive patterns and exploited these genetic events in the context of well-characterised cancer subtypes. In addition, OncoBird thoroughly corrects for multiple hypothesis testing and includes resampling-based adjustments of treatment effects. In essence, OncoBird systematically investigated the molecular landscape of the FIRE-3 clinical trial, suggested biomarkers based on genetic alterations, performed a data-driven subgroup analysis, and finally, presented the results in an interpretable and intuitive way.

The statistical power of detecting biomarkers depends on the amount of screened genes and subtypes, sample sizes and magnitude of treatment effects. For example, subtype-specific analyses reduce patient subgroup sizes, thus limiting the power for detecting interactions. In order to gain statistical power to detect genetic biomarkers with low mutational frequency, Oncobird exploits mutually exclusive modules (“Methods”). Despite the use of resampling-based treatment effect estimation in the found subgroups, hypotheses generated by exploratory tools such as OncoBird ought to be replicated in independent clinical trials. Nonetheless, OncoBird identified promising patient subpopulations within the FIRE-3 and ADJUVANT clinical trials with supported biological interpretation, which indicated refined predictive benefits in cancer subtypes.

A limitation of data-driven subgroup analysis is that these may produce spurious results if not biologically interpretable⁴⁷. To mitigate this risk, we used established tumour subtypes with distinct tumour biology in mCRC, i.e., here, the consensus molecular subtypes (CMS)²⁹ and primary tumour sidedness²¹. Furthermore, the grouping of functionally similar mutually exclusive somatic mutations in the cancer gene sequencing panel reinforced the identification of biological signals.

Somatic mutations may drive tumour subtypes, therefore, we systematically investigated mutational patterns within CMS1-4 and tumour sidedness. We found the majority of BRAF mutations in CMS1 and observed a co-occurrence between CMS2, left-sided tumours and amplifications in chr20q. In particular, CMS2 is characterised by a MYC signalling activation²⁹, which may be co-regulated by activation of the co-amplified AURKA⁴⁸. While we predominantly identified CMS-specific biomarkers, our results suggest that both primary tumour side and CMS subtypes play a major role in the landscape of predictive biomarkers. This highlights the need for OncoBird, an integrated biomarker discovery framework, which integrates the molecular landscape of RCTs with its biomarkers.

Several genes were co-amplified in chr20q, i.e., ARFRP1, TOP1, and SRC, thus determining the drivers among these biomarker candidates is challenging. Among the prominent chr20q amplifications, TOP1 was previously proposed as a biomarker for irinotecan efficacy in metastatic colorectal cancer^49,50, which is part of the chemotherapeutic backbone of the FIRE-3 trial. Literature suggests that TOP1 abundance is essential for irinotecan-induced DNA double-strand breaks during DNA replication⁵¹. Additionally, TOP1 was identified to regulate EGFR through an endogenous interaction with the transcription factor c-Jun⁵², which supports the hypothesis that TOP1 amplifications may be the actionable biomarker. SRC has been reported to play a role in cancer progression^53,54, whereas for ARFRP1, no functional evidence has been presented yet.

The resulting co-amplifications between these cancer genes complicate the determination of the genetic driver in chr20q. To understand the causality of cancer aetiologies, further efforts require additional treatment regimes. Alternative clinical trials for metastatic colorectal cancer often involve different chemotherapy backbones, i.e., fluorouracil, leucovorin, and oxaliplatin (FOLFOX) or fluorouracil, leucovorin, and irinotecan (FOLFIRI)³⁰. The use of other therapy backbones may unravel the role of ARFRP1, TOP1 and SRC amplifications regarding better efficacy for patients treated with cetuximab. However, discrepancies may arise due to the synergism and antagonism of the different chemotherapy backbones and targeted treatments⁵⁵.

The prognostic potential of APC wild-type tumours for bevacizumab has been previously reported⁵⁶, whereas OncoBird did not yield enough evidence to support this. Indeed, a confounding factor is the enrichment of BRAF mutations in the APC wild-type tumours (p = 1.4 × 10⁻¹⁰, Fisher’s exact test). This is, 48% of APC wild-type tumours were BRAF mutated in the bevacizumab treatment arm, whereas in the cetuximab treatment arm, only 29% were BRAF mutated (p = 0.13, Fisher’s exact test). Nevertheless, independently a correlation between VEGFA expression and the mutational status of APC has been previously observed in primary colorectal tumour samples⁵⁷, suggesting that within APC mutated tumours, anti-VEGF treatment may indeed be beneficial.

Furthermore, RAS/BRAF mutations are known to harbour prognostic value in terms of overall survival^38,43. Furthermore, we observed that KRAS mutations showed highly CMS-specific responses. In particular, treatment response differed for tumours classified as CMS4 by KRAS status, showing better response for cetuximab in KRAS wild-type and for bevacizumab in KRAS mutated tumours, respectively. CMS4 has been reported to be associated with VEGF pathway activation and is thus associated with angiogenesis²⁹. Thus, patients with tumours resistant towards anti-EGFR treatment may benefit from VEGF inhibition. Further exclusion of BRAF mutations did not elevate the predictive potential of KRAS mutations in CMS4. However, the statistical power is limited by the fact that only six tumours harboured the prognostically unfavourable BRAF V600E mutation in CMS4²⁰.

In summary, OncoBird reproduced clinically established biomarkers and derived five hypotheses of biomarkers with predictive roles for FOLFIRI plus either cetuximab or bevacizumab. Highlighted examples include chr20q amplifications in CMS2 and KRAS mutations in CMS4, which may optimise patient stratification for metastatic colorectal cancer. Leveraging OncoBird for molecular profiling in the FIRE-3 clinical trial offered an expanded perspective on the molecular and biomarker landscape of these patients.

In the future, we anticipate that the analysis of clinical trials will progressively demand molecular patient tumour data, including predefined subtypes, highlighting the urgent need for integrative analysis tools such as OncoBird. Notably, OncoBird was developed for RCT designs and is generalisable to any trial designs for which the intention-to-treat population was defined before the treatment randomisation, i.e., the treatment assignment is independent of patient characteristics. According to this, OncoBird is applicable to modern clinical trial designs based on master protocols⁵⁸, i.e., basket, umbrella, and platform trials if control arms are included. In an emerging landscape of predictive molecular biomarkers in cancer, OncoBird may untangle complex dependencies between somatic alterations and tumour subtypes in RCTs. Furthermore, OncoBird is generalisable to any cancer entity, thus ultimately paving the way for the next generation of precision oncology therapies.

Methods

Clinical data of the FIRE-3 clinical trial

FIRE-3 is an open-label, randomised phase III trial to compare first-line treatment in KRAS exon 2 wild-type metastatic colorectal cancer patients (mCRC) with either cetuximab or bevacizumab in combination with 5-fluorouracil, leucovorin and irinotecan (FOLFIRI). The protocol and rules of conduct were previously published^23,59 (NCT00433927). The trial was conducted in accordance with the declaration of Helsinki (1996). All translational analyses were approved by the local ethics committee (University of Munich, registry no. 186-15). All patients included in this analysis provided written informed consent. 24% and 34% of the patients had female sex in the FOLFIRI plus cetuximab and bevacizumab arm, respectively. The sex is reported according to the study protocol^23,59, and gender cannot be distinguished retrospectively. The biological sex of patients (i.e., male or female) was assigned by the study doctor of the respective trial centre and reported to the clinical research organisation (CRO). The original intention-to-treat population consisted of 752 patients in total. Primary and secondary endpoints of the FIRE-3 trial, including the median overall survival (OS) and progression-free survival (PFS), were expressed as months and defined as stated in the respective articles^23,59. The objective response rate (ORR) was evaluated by the RECIST 1.0 criteria^23,59.

Next-generation sequencing and genetic alterations in FIRE-3

Primary tumour tissues from 373 patients have been molecularly characterised by next-generation sequencing (NGS) with the FoundationOne® panel (Foundation Medicine, Inc., MA, USA; catalogue number not available), which identified somatic mutations and copy number alterations, i.e., deletions and amplifications, of 277 key cancer genes, microsatellite instability (MSI) and tumour mutational burden²⁰. Somatic alterations were delivered in the form of binary matrices, that reflect the mutant or wild-type status of a given gene based on single nucleotide variants (SV), copy number amplifications (AMP) and deletions (DEL). MSI is an important prognostic predictor and enriched in CMS1⁶⁰, which is observed in our study, with 8 of 10 MSI-H tumours being classified as CMS1. However, MSI-H tumours are less prevalent in metastatic disease (~5%)⁶⁰. Furthermore, only six and four MSI-H tumours were treated with bevacizumab and cetuximab, respectively.

Gene expression profiling in FIRE-3

The genetic characterisation is complemented with gene expression profiles from Xcel® microarrays (Almac Ltd, Belfast, UK; catalogue number: 902016) in a subset of 451 patients. The clinical data and the layers of molecular characterisation led to 163 and 186 patients, which are fully characterised in the cetuximab and bevacizumab treatment arms, respectively.

Tumour subtypes in FIRE-3

A clinically established subtype for mCRC is its primary tumour sidedness. Left-sided tumours were located in the left hemicolon, e.g., splenic flexure to the rectum. In contrast, right-sided tumours were located in the right colon, e.g., coecum to the transverse colon. In addition, annotations for molecular subtypes of mCRC were obtained from transcriptome data that has been previously used to classify patients into their closest consensus molecular subtype (CMS)^21,29 using the cmsclassifier package with the SSP predictor. Thereby, 24 of out 373 patient tumours were not allocated to any CMS because of missing transcriptomics data and were left out of the CMS-specific analysis. The CMS classification was used as a complementary alternative to the primary tumour side and is currently discussed in multiple clinical settings⁶¹.

Oncology Biomarker Discovery workflow

The Oncology Biomarker Discovery (OncoBird) framework applies to RCTs for which patients received either treatment $t\epsilon \{{{{{\mathrm{0,1}}}}}\}$ according to the treatment indicator $T$, had an associated outcome $Y$ and can be classified into $q$ subtypes $\{{s}_{1},\ldots,{s}_{q}\}$ according to the subtype variable $S$ (clinical data). Additionally, patient tumours are characterised by $m$ candidate genetic biomarkers ${{{{{\bf{X}}}}}}={X}_{1},\ldots,{X}_{m}$ with the observed biomarkers for patients ${{{{{\bf{x}}}}}}={x}_{1},\ldots,{x}_{m}$ (genetic data). The genetic data can be used to group functionally similar genes that can be added to the set of candidate biomarkers. Furthermore, it is possible to add additional binary features to ${{{{{\bf{X}}}}}}$ such as binarised copy number alterations with appropriate cutoffs or the MSI status of a tumour. Both genetic data (MUT) and clinical data (CLIN) are required inputs to the OncoBird workflow (Supplementary Data 1), which is described in the following sections. All implemented thresholds of OncoBird can be adjusted by the user, thus empowering more lenient or stringent analyses.

Characterising the molecular landscape in clinical trials

OncoBird first examines genetic features ${{{{{\bf{X}}}}}}$ in tumour subtypes $\{{s}_{1},\ldots,{s}_{q}\}$ independent of the treatment and patient response (function GET-MUTATIONS-IN-SUBTYPES in Supplementary Data 1). For examining enrichment or depletion of each genetic feature in tumour subtypes, one-sided hypergeometric tests are performed using the ‘phyper’ R function. Consecutively, the resulting p-values are corrected for multiple hypothesis testing with the Benjamini–Hochberg (BH) method⁶². The FDR cutoff for this analysis step is denoted by FDR_mol and controlled at FDR_mol = 0.05 as our default setting. Our method generalises to any binary tumour characterisation, e.g., the MSI status in FIRE-3. As a default setting, we test genetic features that were mutated in at least ten tumours (n = 10).

Identifying mutual exclusivity

For the identification of mutually exclusive modules, we used the Mutex algorithm³⁹ (function GET-MUTATIONS-MODULES in Supplementary Data 1). It leverages a signalling network⁶³ collecting interactions from Pathway Common⁶⁴, SPIKE⁶⁵ and SignaLink⁶⁶ in order to scan for common downstream effects of combinations of somatic alterations ${{{{{\bf{X}}}}}}$. The default setting only uses somatic variants that were altered in at least ten tumours (n = 10).

Genetic and subtype-specific biomarkers

OncoBird tests single somatic alterations and previously derived mutually exclusive somatic alterations for differential prognosis in each treatment arm separately (function GET-TREATMENT-SPECIFIC-BIOMARKERS in Supplementary Data 1). The patient outcome $Y\left(T=t,S={s}_{k}\right)$ for the treatment arm $T=t$ in subtype $S={s}_{k}$ with $k=1,\ldots,{q}$ may be defined by survival data (OS or PFS) or a binary variable measuring the objective response rate (ORR). Depending on the type of outcome, this is modelled with either Cox proportional hazards regression models or logistic regression models expressed by their linear predictor function $f\left({{{{{\bf{x}}}}}},t\right)$. Using this classical approach for subgroup analysis, the treatment-specific regression models in subtypes take the form

$$f\left({{{{{\bf{x}}}}}},t\right)={\alpha }_{0j}+{\alpha }_{1j}{x}_{j}+\mathop{\sum }\limits_{l}{C}_{l}$$

(1)

Cox proportional hazards regression models for survival endpoints were implemented with the ‘coxph’ function from the survival R package or logistic regression models for binary response variables were implemented using the ‘glm’ function. We test each ${{{{{\bf{x}}}}}}={x}_{1},\ldots,{x}_{m}$ first across all tumours, and subsequently in tumour subtypes $\{{s}_{1},\ldots,{s}_{q}\}$, i.e., CMS or primary sidedness. ${\alpha }_{1j}$ is the coefficient estimating the contribution of candidate biomarker $j=1,\ldots,m$ for patient outcomes in the context of each treatment arm $T=t$ in the subtype $S={s}_{k}$. The predictors ${C}_{1},\ldots,{C}_{l}$ include additional prognostic covariates and their coefficients.

The p-value ${p}_{{\alpha }_{1j}}$ derived by a Wald test from the coefficient ${\alpha }_{1j}$ is multiplicity-adjusted for each treatment arm $t$ and across all biomarkers ${x}_{j}$ with $j=1,\,\ldots,{m}$ for either all patients or across subtypes ${s}_{k}$ with $k=1,\ldots,{q}$ and yields adjusted p-values ${\widetilde{p}}_{{\alpha }_{1j}}$ using the Benjamini–Hochberg (BH) method⁶². The default false discovery rates (FDR) are controlled at ${{{\mbox{FDR}}}}_{{{{{{\rm{\alpha }}}}}}}=0.1$ for either treatment-specific component ${\alpha }_{1j}$.

The adjustable default setting of OncoBird is to only perform statistical tests if, for a given candidate biomarker ${x}_{j}$ and tumour subtype ${s}_{k}$, at least n = 10 samples were present in each mutant and wild-type population. Additionally, OncoBird only tested alterations for which its corresponding gene module had at least n tumours redistributed compared to the single gene alteration.

Predictive components of biomarkers

For the subsequent comparison of treatment arms, OncoBird tests for significant statistical interactions between treatment arms and genetic alterations in tumour subtypes (function GET-PREDICTIVE-BIOMARKERS in Supplementary Data 1). For that, we modelled the outcome $Y\left(S={s}_{k}\right)$ in subtype $S={s}_{k}$ with $k=1,\ldots,{q}$ using regression models with interactions between $T$ and ${X}_{j}$ which take the form

$$f\,\left({{{{{\bf{x}}}}}},\,t\right)={\beta }_{0j}+{\beta }_{1j}{x}_{j}+{\beta }_{2j}{x}_{j}t+\mathop{\sum }\limits_{l}{C}_{l},$$

(2)

where the coefficients ${\beta }_{1j}$ and ${\beta }_{2j}$ estimate the prognostic and predictive component of biomarker ${x}_{j}$ in subtype ${s}_{k}$, respectively. The p-value ${p}_{{\beta }_{2j}}$ derived with a Wald test from the coefficient ${\beta }_{2j}$ is multiplicity-adjusted across all $m$ biomarkers for either all patients or across subtypes ${s}_{k}$ with $k=1,\ldots,{q}$ and yields BH adjusted p-values ${\widetilde{p}}_{{\beta }_{2j}}$. The default FDR is controlled at ${{{\mbox{FDR}}}}_{{{{{{\rm{\beta }}}}}}}=0.2$ for predictive components. The biomarker ${X}_{j}$ in subtype ${s}_{k}$ is a putatively predictive biomarker if ${\widetilde{p}}_{{\alpha }_{1j}} < {{{\mbox{FDR}}}}_{{{{{{\rm{\alpha }}}}}}}$ for either $t$ and ${\widetilde{p}}_{{\beta }_{2j}} < {{{\mbox{FDR}}}}_{{{{{{\rm{\beta }}}}}}}$.

Furthermore, OncoBird only performs statistical tests if for a given genetic alteration ${X}_{j}$ and tumour subtype ${s}_{k}$, at least n = 10 samples were present in each mutant and wild-type population for each treatment arm as default setting.

Resampling for correction of conditional average treatment effects

Lastly, we estimate the conditional average treatment effect (CATE) for the found biomarkers (function GET-PREDICTIVE-BIOMARKERS in Supplementary Data 1). For each significant ${X}_{j}$ in ${s}_{k}$, there is one CATE estimate in each found subpopulation with a positive (mutant) biomarker ${x}_{j}=1$ and negative (wild type) biomarker ${x}_{j}=0$. In each population, we estimate the CATE by modelling the outcome $Y$ by

$$f\left({{{{{\bf{x}}}}}},t\right)={\gamma }_{0}+{\gamma }_{1}t+\mathop{\sum }\limits_{l}{C}_{l},$$

(3)

where ${\gamma }_{1}$ estimates the (biased) CATE in terms of either hazard ratios or odds ratios dependent on outcome type in the subgroup defined by biomarker ${x}_{j}$ and subtype ${s}_{k}$. The population with the larger absolute estimate ${\gamma }_{1}$ is used to estimate the subgroups ${A}_{{x}_{j},{s}_{k}}$.

For each found subgroup $A$, we assess the significance to the associated CATE estimate ${\gamma }_{1}$ and derive the p-value ${p}_{{\gamma }_{1}}$ using a Wald test. Furthermore, we perform a multiplicity-adjustment of ${p}_{{\gamma }_{1}}$ and derive honest estimates of the CATE.

The p-values are adjusted for multiplicity using a permutation-based approach that takes into account the entire subgroup search strategy³. For that, we permuted the treatment labels U = 1000 times to obtain null datasets without any differential treatment effects. Next, for each null dataset, we select significant subgroups ${A}^{\left(u\right)}$ for the same thresholds and record the treatment effect p-value of the best subgroup ${p}^{\left(u\right)}$ with $u=1,\ldots,U$. The adjusted p-values are then given by

$${\widetilde{p}}_{{\gamma }_{1}}=\frac{1}{U}\mathop{\sum }\limits_{u=1}^{U}{I}_{\{{p}^{\left(u\right)}\le {p}_{{\gamma }_{1}}\}}\left({p}^{\left(u\right)}\right),$$

(4)

the fraction of p-values ${p}^{\left(u\right)}$ that are smaller or equal than ${p}_{{\gamma }_{1}}$ with the indicator function $I$. Furthermore, we derive an honest estimate of the treatment effect ${\gamma }_{1}$. Since subgroups $A$ are derived from the same data as the treatment effect estimates, the estimates from the resubstitution ${\gamma }_{1}\left({A}_{{x}_{j},{s}_{k}}\right)$ will be biased. In order to derive a bias-corrected estimate ${\widetilde{\gamma }}_{1}$, we use a previously proposed non-parametric bootstrap approach⁹. For that, we generated B = 500 bootstrapped datasets. For each resampled dataset $b=1,\,\ldots,B$ we estimate subgroups ${\hat{A}}_{{x}_{j},{s}_{k}}^{\left(b\right)}$. The treatment effects can then be either estimated on the b-th resampled dataset ${\gamma }_{1}^{\left(b\right)}\left({A}^{\left(b\right)}\right)$ or on the original dataset ${\gamma }_{1}\left({A}^{\left(b\right)}\right)$. The bias-corrected CATE estimate is then given by

$${\hat{\gamma }}_{1}=\frac{1}{B}\mathop{\sum }\limits_{b=1}^{B}\left({\gamma }_{1}\left(A\right)+{\gamma }_{1}\left({A}^{\left(b\right)}\right)-{\gamma }_{1}^{\left(b\right)}\left({A}^{\left(b\right)}\right)\right).$$

(5)

The 95% confidence intervals are constructed by the 0.025 and 0.975 quantiles of the bootstrapped distribution.

OncoBird parameterisation for FIRE-3

We used the function GET-MUTATIONS-IN-SUBTYPES to evaluate the primary tumour side and CMS as tumour subtypes with the default setting FDR_mol < 0.05. In total, we performed 156 and 312 statistical tests for the primary tumour sidedness and CMS, respectively. Using the GET-MUTATIONS-MODULES function with default settings, we analysed 42 genes which yielded 29 mutually exclusive modules. Mutations in KRAS or NRAS are the established clinical biomarkers for anti-EGFR treatment, thus we jointly modelled KRAS and NRAS as RAS mutations resulting in 10 additional modules.

The GET-TREATMENT-SPECIFIC-BIOMARKERS function was used with the number of metastatic sites and the information about a prior tumour resection as added covariates ${C}_{1},{C}_{2}$. With the OncoBird default setting, we performed 816 statistical tests across all readouts $Y$ (OS, PFS and ORR), the cetuximab and bevacizumab treatment arm and tumour subtypes, i.e., CMS1-4, left- and right-sided and across all tumours. FDR cutoffs are employed for each treatment arm separately and are denoted FDR_cet and FDR_bev for the analysis in the cetuximab and bevacizumab treatment arms, respectively. In total, we found 92 significant associations with the default setting FDR_cet/bev < 0.1. The criteria HR < 1 and OR < 1 corresponded to a better prognosis for the mutant tumours compared to the wild-type tumours and vice versa. To consistently report HR < 1 and OR < 1 as beneficial risk reduction, reciprocal values of HRs and ORs were used if wild-type tumours displayed a better prognosis. We represent p-values, hazard/odds ratios with the 95% confidence intervals (CI) in square brackets and the associated FDRs.

In FIRE-3, the GET-PREDICTIVE-BIOMARKERS function with default settings resulted in a total amount of 396 statistical tests across the readouts $Y$ (OS, PFS and ORR) and the tumour subtypes ${s}_{k}$. FDR cutoffs for the interaction tests across both treatment arms are denoted by FDR_int. We explored 57 associations with FDR_int < 0.6 and FDR_cet/bev < 0.1 (Supplementary Data 3) and further focused on a subset of five biomarkers with default setting FDR_int < 0.2 for OS, i.e., two gene modules and three single genes (Supplementary Data 4). For the cross-validation analysis, a more lenient FDR_int < 0.3 was employed, which deviated from default setting to account for reduced sample sizes in the training and testing splits. HRs and ORs >1 and <1 corresponded to benefit with cetuximab and bevacizumab, respectively. To report the benefits of cetuximab treatment, the reciprocal values of HRs and ORs were used in the manuscript in order to display treatment benefits consistently with HR < 1 and OR < 1. We reported p-values and hazard/odds ratios with the 95% CIs for the treatment comparison and the p-values and associated FDRs for the interaction tests.

OncoBird parameterisation for ADJUVANT

The ADJUVANT clinical trial in EGFR mutant non-small cell lung cancer (NSCLC) aimed to assess the efficacy of gefitinib versus chemotherapy with vinorelbine and cisplatin (NCT01405079)³⁴. The trial was previously approved by the research ethics boards of Guangdong Provincial People’s Hospital and all other participating hospitals³⁵. Of note, 58% and 59% of patients had female sex in the gefitinib and chemotherapy arm, respectively. The sex was reported according to the study protocol³⁴, and gender cannot be distinguished retrospectively. We used the EGFR subtype, i.e., exon 19 deletion or exon 21 Leu858Arg, and the smoking history as putative tumour subtypes and clinical endpoints were disease-free survival (DFS) and overall survival (OS). We analysed 22 somatic alterations in 171 patients, from which 76 patients were treated with chemotherapy alone, and 95 were treated with gefitinib. For the subsequent analysis, we used the OncoBird default settings. The obtained results (Supplementary Data 2; Supplementary Figs. 3–6) and an associated extensive report can be reproduced in a runnable demo on Code Ocean (https://codeocean.com/capsule/9911222/tree/v1).

Benchmarking of alternative methods with FIRE-3

For benchmarking the biomarker identification, we compared OncoBird to seven competing subgroup analysis algorithms leveraging the overall survival of FIRE-3 (Supplementary Table 1)^{8,9,11,12,44,45,46}. We formed predictors by concatenating clinical annotations, including information about tumour resection, number of metastatic sites, age, gender, MSI and lung metastatic status. We added single genetic alterations and mutually exclusive modules observed across at least ten patients and in both investigated tumour subtypes, thus mirroring the OncoBird default settings. Furthermore, we investigated interactions between genetic alterations and tumour primary sidedness or CMS as predictors. Subgroups for the method evaluation were formed as the union of the subgroups showing cetuximab benefit according to the identified biomarkers (Supplementary Table 1).

All benchmarked models were 5-fold cross-validated with five repetitions. A univariate Cox proportional hazards model assessed performances leveraging the treatment effect based on OS in the subgroups with predicted benefits according to the found biomarkers. This included the treatment effect across the whole test set and in the subgroup defined by the current treatment guidelines, i.e., left-sided and RAS wild-type tumours³⁷. The significance of the treatment effect in the subgroups of the test set was assessed using a modified t-test for resampled performance metrics⁶⁷, denoted by p_cv.

For comparing computational methods and their predicted biomarkers, the models were fitted on the whole dataset. The parameterisation of these methods was followed according to the suggested default settings unless in conflict with the above outlined use case. For example, for tree-based methods, the features contained in the resulting tree were used as biomarkers with tree depths = 2, with a minimum subgroup size of n = 10. For the implementation of the virtual twins method (VT)⁹, we used the R package randomForestSRC with default parameters and averaged predictions over 10 times repeated 10-fold cross-validation. Subsequently, a regression tree was fitted to the original data. In order to perform model-based recursive partitioning⁸, we used the R package model4you⁶⁸ using an exponential model with default conditional inference tree control parameters. The PRISM method⁴⁶ was implemented in the R package StratifiedMedicine, for which we used Cox proportional hazards regression. We used the implementation of causal survival forests⁶⁹ (CRF) in the R package grf⁷⁰ for estimating conditional treatment effects. The propensity scores were set as constant and the target estimand was set to restricted mean survival time (RMST) with horizon = 100. After model fitting, variable importance scores were extracted, and biomarkers were selected according to predictors with significant linear projections of the conditional average treatment effects (p < 0.05). Next, we employed policy learning (POL)⁴⁴ to find optimal treatment regimens using the R package policytree⁷¹. We used the 50 most important predictors according to the CRF causal survival forest model variable importance scores and their treatment effect estimates to produce a decision tree.

The remaining methods were not based on trees. For the outcome weighted method (OWE)¹¹, implemented in the R package personalized⁷², we used a constant propensity score, lasso loss and 10-fold cross-validation. The GUIDE method⁴⁵ was available as a binary executable under https://pages.cs.wisc.edu/~loh/guide.html. We used Cox proportional hazards regression with interactions tests and mean-based trees with pruning. For the SIDES method (R package SIDES)⁷, we used level_control=0 and alpha=0.05.

Statistics and reproducibility

The investigators were not blinded to the randomised treatment allocation during the data collection and outcome assessment. Since the conducted subgroup analysis is retrospective, the sample sizes were not predetermined. No data were excluded from the analysis. Details of the conducted statistical tests are provided in the figure captions, Supplementary Data 2–4 and Source Data. The results of the statistical analysis of the ADJUVANT clinical trial are reproducible from a demo run on Code Ocean (https://codeocean.com/capsule/9911222/tree/v1).

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

The clinical data summary from the FIRE-3 clinical trial analysed in this study has been deposited in the Pharmnet.bund online platform of the German Federal Ministry of Health (https://portal.dimdi.de/data/ctr/O-0329_01-2-1-B80630-20190731152224.pdf) and was published before¹⁹. The clinical and molecular data is available under restricted access due to data privacy laws. The raw and processed data can be obtained through the corresponding author at volker.heinemann@med.uni-muenchen.de. The data from the results of OncoBird v0.1.0 executed on the FIRE-3 trial are available in Supplementary Data 3 and Source Data. The processed data from the ADJUVANT clinical trial is available on Zenodo^33,35. The data from the results of OncoBird v0.1.0 executed on the ADJUVANT trial are available in Supplementary Data 2, Source Data and on Code Ocean (https://codeocean.com/capsule/9911222/tree/v1). Source data are provided with this paper.

Code availability

Oncology Biomarker Discovery (OncoBird) is publicly available at https://github.com/MendenLab/OncoBird. The repository contains an R package as well as a Shiny application with a graphical user interface in a local docker container (Supplementary Fig. 1). Additionally, a demo run of OncoBird v0.1.0 used for analysis is available on Code Ocean (https://codeocean.com/capsule/9911222/tree/v1).

References

Ting, N., Cappelleri, J. C., Ho, S. & Chen, D.-G. (eds) Design and Analysis of Subgroups with Biopharmaceutical Applications (Springer, 2020).
European Medicines Agency. Guideline on the Investigation of Subgroups in Confirmatory Clinical Trials. Draft. European Medicines Agency/Committee for Medicinal Products for Human Use. EMA/CHMP/539146/2013 (EMA, 2014).
Lipkovich, I., Dmitrienko, A. & D'Agostino Sr, B. R. Tutorial in biostatistics: data-driven subgroup identification and analysis in clinical trials. Stat. Med. 36, 136–196 (2017).
Article MathSciNet PubMed Google Scholar
Zhang, Z., Seibold, H., Vettore, M. V., Song, W.-J. & François, V. Subgroup identification in clinical trials: an overview of available methods and their implementations with R. Ann. Transl. Med. 6, 122 (2018).
Article PubMed PubMed Central Google Scholar
Loh, W., Cao, L. & Zhou, P. Subgroup identification for precision medicine: a comparative review of 13 methods. Wiley Interdiscip. Rev. Data Min. Knowl. Discov. 9, e1326 (2019).
Article Google Scholar
Lipkovich, I., Dmitrienko, A., Denne, J. & Enas, G. Subgroup identification based on differential effect search—a recursive partitioning method for establishing response to treatment in patient subpopulations. Stat. Med. 30, 2601–2621 (2011).
Article MathSciNet PubMed Google Scholar
Lipkovich, I. & Dmitrienko, A. Strategies for identifying predictive biomarkers and subgroups with enhanced treatment effect in clinical trials using SIDES. J. Biopharm. Stat. 24, 130–153 (2014).
Article MathSciNet PubMed Google Scholar
Seibold, H., Zeileis, A. & Hothorn, T. Model-based recursive partitioning for subgroup analyses. Int. J. Biostat. 12, 45–63 (2016).
Article MathSciNet PubMed Google Scholar
Foster, J. C., Taylor, J. M. G. & Ruberg, S. J. Subgroup identification from randomized clinical trial data. Stat. Med. 30, 2867–2880 (2011).
Article MathSciNet PubMed Google Scholar
Xu, Y. et al. Regularized outcome weighted subgroup identification for differential treatment effects. Biometrics 71, 645–653 (2015).
Article MathSciNet PubMed PubMed Central MATH Google Scholar
Chen, S., Tian, L., Cai, T. & Yu, M. A general statistical framework for subgroup identification and comparative treatment scoring. Biometrics 73, 1199–1209 (2017).
Article MathSciNet PubMed PubMed Central MATH Google Scholar
Wager, S. & Athey, S. Estimation and inference of heterogeneous treatment effects using random forests. J. Am. Stat. Assoc. 113, 1228–1242 (2018).
Article MathSciNet CAS MATH Google Scholar
Künzel, S. R., Sekhon, J. S., Bickel, P. J. & Yu, B. Metalearners for estimating heterogeneous treatment effects using machine learning. Proc. Natl Acad. Sci. USA 116, 4156–4165 (2019).
Article ADS PubMed PubMed Central Google Scholar
Cremolini, C. et al. First-line chemotherapy for mCRC—a review and evidence-based algorithm. Nat. Rev. Clin. Oncol. 12, 607–619 (2015).
Article CAS PubMed Google Scholar
Thomas, R. K. et al. High-throughput oncogene mutation profiling in human cancer. Nat. Genet. 39, 347–351 (2007).
Article CAS PubMed Google Scholar
Kawazoe, A. et al. A retrospective observational study of clinicopathological features of KRAS, NRAS, BRAF and PIK3CA mutations in Japanese patients with metastatic colorectal cancer. BMC Cancer 15, 258 (2015).
Article PubMed PubMed Central Google Scholar
Van Cutsem, E. et al. Cetuximab and chemotherapy as initial treatment for metastatic colorectal cancer. N. Engl. J. Med. 360, 1408–1417 (2009).
Article PubMed Google Scholar
Saltz, L. B. et al. Bevacizumab in combination with oxaliplatin-based chemotherapy as first-line therapy in metastatic colorectal cancer: a randomized phase III study. J. Clin. Oncol. 26, 2013–2019 (2008).
Article CAS PubMed Google Scholar
Heinemann, V. et al. FOLFIRI plus cetuximab or bevacizumab for advanced colorectal cancer: final survival and per-protocol analysis of FIRE-3, a randomised clinical trial. Br. J. Cancer 124, 587–594 (2021).
Article CAS PubMed Google Scholar
Stahler, A. et al. Single-nucleotide variants, tumour mutational burden and microsatellite instability in patients with metastatic colorectal cancer: next-generation sequencing results of the FIRE-3 trial. Eur. J. Cancer 137, 250–259 (2020).
Article CAS PubMed Google Scholar
Stintzing, S. et al. Consensus molecular subgroups (CMS) of colorectal cancer (CRC) and first-line efficacy of FOLFIRI plus cetuximab or bevacizumab in the FIRE3 (AIO KRK-0306) trial. J. Clin. Orthod. 35, 3510–3510 (2017).
Google Scholar
Laurent-Puig, P. et al. MiR-31-3p is a predictive biomarker of cetuximab response in FIRE3 clinical trial. Ann. Oncol. 27, vi151 (2016).
Google Scholar
Heinemann, V. et al. FOLFIRI plus cetuximab versus FOLFIRI plus bevacizumab as first-line treatment for patients with metastatic colorectal cancer (FIRE-3): a randomised, open-label, phase 3 trial. Lancet Oncol. 15, 1065–1075 (2014).
Article CAS PubMed Google Scholar
Duarte, S. et al. Right vs left-sided RAS wild-type metastatic colorectal cancer treated with EGFR inhibitors: prognostic differences. Ann. Oncol. 30, iv53 (2019).
Article Google Scholar
Stintzing, S. et al. Randomized study to investigate FOLFOXIRI plus either bevacizumab or cetuximab as first-line treatment of BRAF V600E-mutant mCRC: the phase-II FIRE-4.5 study (AIO KRK-0116). J. Clin. Orthod. 39, 3502–3502 (2021).
Google Scholar
Peeters, M. et al. Massively parallel tumor multigene sequencing to evaluate response to panitumumab in a randomized phase III study of metastatic colorectal cancer. Clin. Cancer Res. 19, 1902–1912 (2013).
Article CAS PubMed Google Scholar
Seymour, M. T. et al. Panitumumab and irinotecan versus irinotecan alone for patients with KRAS wild-type, fluorouracil-resistant advanced colorectal cancer (PICCOLO): a prospectively stratified randomised trial. Lancet Oncol. 14, 749–759 (2013).
Article CAS PubMed PubMed Central Google Scholar
Dienstmann, R., Salazar, R. & Tabernero, J. Overcoming resistance to anti-EGFR therapy in colorectal cancer. Am. Soc. Clin. Oncol. Educ. Book.35, e149–e156 (2015).
Guinney, J. et al. The consensus molecular subtypes of colorectal cancer. Nat. Med. 21, 1350–1356 (2015).
Article CAS PubMed PubMed Central Google Scholar
Lenz, H.-J. et al. Impact of consensus molecular subtype on survival in patients with metastatic colorectal cancer: results from CALGB/SWOG 80405 (Alliance). J. Clin. Oncol. 37, 1876–1885 (2019).
Article PubMed PubMed Central Google Scholar
Mooi, J. K. et al. The prognostic impact of consensus molecular subtypes (CMS) and its predictive effects for bevacizumab benefit in metastatic colorectal cancer: molecular analysis of the AGITG MAX clinical trial. Ann. Oncol. 29, 2240–2246 (2018).
Article CAS PubMed Google Scholar
Sveen, A., Kopetz, S. & Lothe, R. A. Biomarker-guided therapy for colorectal cancer: strength in complexity. Nat. Rev. Clin. Oncol. 17, 11–32 (2020).
Article PubMed Google Scholar
cancer-oncogenomics. cancer-oncogenomics/minerva-adjuvant-nsclc: adjuvant minerva study v1.0.0. Zenodo https://doi.org/10.5281/zenodo.5242512 (2021).
Zhong, W.-Z. et al. Gefitinib versus vinorelbine plus cisplatin as adjuvant treatment for stage II-IIIA (N1-N2) EGFR-mutant NSCLC (ADJUVANT/CTONG1104): a randomised, open-label, phase 3 study. Lancet Oncol. 19, 139–148 (2018).
Article CAS PubMed Google Scholar
Liu, S.-Y. et al. Genomic signatures define three subtypes of EGFR-mutant stage II-III non-small-cell lung cancer with distinct adjuvant therapy outcomes. Nat. Commun. 12, 6450 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Holch, J. W., Ricard, I., Stintzing, S., Modest, D. P. & Heinemann, V. The relevance of primary tumour location in patients with metastatic colorectal cancer: a meta-analysis of first-line clinical trials. Eur. J. Cancer 70, 87–98 (2017).
Article PubMed Google Scholar
Chiorean, E. G. et al. Treatment of patients with late-stage colorectal cancer: ASCO Resource-Stratified Guideline. JCO Glob. Oncol. 6, 414–438 (2020).
Article PubMed Google Scholar
Ptashkin, R. N. et al. Chromosome 20q amplification defines a subtype of microsatellite stable, left-sided colon cancers with wild-type RAS/RAF and better overall survival. Mol. Cancer Res. 15, 708–713 (2017).
Article CAS PubMed PubMed Central Google Scholar
Babur, Ö. et al. Systematic identification of cancer driving signaling pathways based on mutual exclusivity of genomic alterations. Genome Biol. 16, 45 (2015).
Hsu, H.-C. et al. Mutations of KRAS/NRAS/BRAF predict cetuximab resistance in metastatic colorectal cancer patients. Oncotarget 7, 22257–22270 (2016).
Article PubMed PubMed Central Google Scholar
Díaz-Rubio, E. et al. Role of Kras status in patients with metastatic colorectal cancer receiving first-line chemotherapy plus bevacizumab: a TTD group cooperative study. PLoS ONE 7, e47345 (2012).
Article ADS PubMed PubMed Central Google Scholar
Modest, D. P. et al. Outcome according to KRAS-, NRAS- and BRAF-mutation as well as KRAS mutation variants: pooled analysis of five randomized trials in metastatic colorectal cancer by the AIO colorectal cancer study group. Ann. Oncol. 27, 1746–1753 (2016).
Article CAS PubMed PubMed Central Google Scholar
Zhang, B., Yao, K., Zhou, E., Zhang, L. & Cheng, C. Chr20q amplification defines a distinct molecular subtype of microsatellite stable colorectal cancer. Cancer Res. 81, 1977–1987 (2021).
Article CAS PubMed PubMed Central Google Scholar
Athey, S. & Wager, S. Policy learning with observational data. Econometrica 89, 133–161 (2021).
Article MathSciNet MATH Google Scholar
Loh, W.-Y. & Zhou, P. The GUIDE approach to subgroup identification. Design and Analysis of Subgroups with Biopharmaceutical Applications (eds Ting, N. et al.) 147–165 (Springer, 2020).
Jemielita, T. O. & Mehrotra, D. V. PRISM: patient response identifiers for stratified medicine. Preprint at https://arxiv.org/abs/1912.03337 (2019).
Dmitrienko, A., Muysers, C., Fritsch, A. & Lipkovich, I. General guidance on exploratory and confirmatory subgroup analysis in late-stage clinical trials. J. Biopharm. Stat. 26, 71–98 (2016).
Article PubMed Google Scholar
Takahashi, Y. et al. The AURKA/TPX2 axis drives colon tumorigenesis cooperatively with MYC. Ann. Oncol. 26, 935–942 (2015).
Article CAS PubMed Google Scholar
Nygård, S. B. et al. DNA topoisomerase I gene copy number and mRNA expression assessed as predictive biomarkers for adjuvant irinotecan in stage II/III colon cancer. Clin. Cancer Res. 22, 1621–1631 (2016).
Article PubMed Google Scholar
Palshof, J. A. et al. Topoisomerase I copy number alterations as biomarker for irinotecan efficacy in metastatic colorectal cancer. BMC Cancer 17, 48 (2017).
Article PubMed PubMed Central Google Scholar
Xu, Y. & Her, C. Inhibition of topoisomerase (DNA) I (TOP1): DNA damage repair and anticancer therapy. Biomolecules 5, 1652–1670 (2015).
Article CAS PubMed PubMed Central Google Scholar
Mialon, A. et al. DNA topoisomerase I is a cofactor for c-Jun in the regulation of epidermal growth factor receptor expression and cancer cell proliferation. Mol. Cell. Biol. 25, 5040–5051 (2005).
Article CAS PubMed PubMed Central Google Scholar
Chen, J., Elfiky, A., Han, M., Chen, C. & Saif, M. W. The role of Src in colon cancer and its therapeutic implications. Clin. Colorectal Cancer 13, 5–13 (2014).
Article PubMed Google Scholar
Koh, H. M. et al. Aurora kinase A is a prognostic marker in colorectal adenocarcinoma. J. Pathol. Transl. Med. 51, 32–39 (2017).
Article PubMed Google Scholar
Aderka, D., Stintzing, S. & Heinemann, V. Explaining the unexplainable: discrepancies in results from the CALGB/SWOG 80405 and FIRE-3 studies. Lancet Oncol. 20, e274–e283 (2019).
Article PubMed Google Scholar
Wang, C., Ouyang, C., Sandhu, J. S., Kahn, M. & Fakih, M. Wild-type APC and prognosis in metastatic colorectal cancer. J. Clin. Orthod. 38, 223–223 (2020).
Google Scholar
Easwaran, V. et al. beta-Catenin regulates vascular endothelial growth factor expression in colon cancer. Cancer Res. 63, 3145–3153 (2003).
CAS PubMed Google Scholar
Meyer, E. L. et al. The evolution of master protocol clinical trial designs: a systematic literature review. Clin. Ther. 42, 1330–1360 (2020).
Article PubMed Google Scholar
Stintzing, S. et al. FOLFIRI plus cetuximab versus FOLFIRI plus bevacizumab for metastatic colorectal cancer (FIRE-3): a post-hoc analysis of tumour dynamics in the final RAS wild-type subgroup of this randomised open-label phase 3 trial. Lancet Oncol. 17, 1426–1434 (2016).
Article CAS PubMed Google Scholar
Battaglin, F., Naseem, M., Lenz, H.-J. & Salem, M. E. Microsatellite instability in colorectal cancer: overview of its clinical significance and novel perspectives. Clin. Adv. Hematol. Oncol. 16, 735–745 (2018).
PubMed PubMed Central Google Scholar
Fontana, E., Eason, K., Cervantes, A., Salazar, R. & Sadanandam, A. Context matters-consensus molecular subtypes of colorectal cancer as biomarkers for clinical trials. Ann. Oncol. 30, 520–527 (2019).
Article CAS PubMed PubMed Central Google Scholar
Benjamini, Y. & Hochberg, Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J. R. Stat. Soc. B Stat. Methodol. 57, 289–300 (1995).
MathSciNet MATH Google Scholar
Babur, Ö. et al. Pattern search in BioPAX models. Bioinformatics 30, 139–140 (2014).
Article CAS PubMed Google Scholar
Cerami, E. G. et al. Pathway Commons, a web resource for biological pathway data. Nucleic Acids Res. 39, D685–D690 (2011).
Article CAS PubMed Google Scholar
Paz, A. et al. SPIKE: a database of highly curated human signaling pathways. Nucleic Acids Res. 39, D793–D799 (2011).
Article CAS PubMed Google Scholar
Fazekas, D., Koltai, M. & Türei, D. SignaLink 2–a signaling pathway resource with multi-layered regulatory networks. BMC Syst. Biol. 7, 7 (2013).
Article PubMed PubMed Central Google Scholar
Bouckaert, R. R. & Frank, E. Evaluating the replicability of significance tests for comparing learning algorithms. Advances in Knowledge Discovery and Data Mining, 3–12 (Springer, 2004).
Seibold, H., Zeileis, A. & Hothorn, T. Model4you: an R package for personalised treatment effect estimation. J. Open Res. Softw. 7, 17 (2019).
Cui, Y., Kosorok, M. R., Sverdrup, E., Wager, S. & Zhu, R. Estimating heterogeneous treatment effects with right-censored data via causal survival forests. J. R. Stat. Soc. Series B Stat. Methodol. 85, 179–211 (2023).
Athey, S., Tibshirani, J. & Wager, S. Generalized random forests. AOS 47, 1148–1178 (2019).
MathSciNet MATH Google Scholar
Sverdrup, E., Kanodia, A., Zhou, Z., Athey, S. & Wager, S. policytree: policy learning via doubly robust empirical welfare maximization over trees. J. Open Source Softw. 5, 2232 (2020).
Article ADS Google Scholar
Huling, J. D. & Yu, M. Subgroup identification using the personalized package. J. Stat. Softw. 98, 1–60 (2018).

Download references

Acknowledgements

This project has received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (grant agreement No. 950293, M.P.M.). The clinical study received industrial funding from Merck KGaA, Darmstadt, Germany and Pfizer GmbH, Germany. The transcriptome-based microarray for gene expression using Xcel® Array received funding from Almac Ltd, Belfast, UK. The FoundationOne® based sequencing analysis (MSI) received funding from Roche Pharma AG, Grenzach, Germany (grant numbers: n/a, V.H., S.S.).

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

These authors contributed equally: Alexander J. Ohnmacht, Arndt Stahler, Sebastian Stintzing.

Authors and Affiliations

Computational Health Center, Helmholtz Munich, 85764, Neuherberg, Germany
Alexander J. Ohnmacht, Linus Hölzel, Marisa K. Schübel, Ana Galhoz, Ali Farnoud, Minhaz Ud-Dean, Matthias Heinig & Michael P. Menden
Department of Biology, Ludwig-Maximilians University Munich, 82152, Martinsried, Germany
Alexander J. Ohnmacht, Marisa K. Schübel, Ana Galhoz & Michael P. Menden
Charité Universitätsmedizin, corporate member of Freie Universität Berlin and Humboldt-Universität zu Berlin, Department of Hematology, Oncology, and Cancer Immunology, Charitéplatz 1, 10117, Berlin, Germany
Arndt Stahler, Sebastian Stintzing & Dominik P. Modest
German Cancer Consortium (DKTK), partner sites Berlin and Munich, German Cancer Research Center (DKFZ), 69120, Heidelberg, Germany
Sebastian Stintzing & Julian W. Holch
Department of Medicine III and Comprehensive Cancer Center Munich, University Hospital, Ludwig-Maximilians University Munich, 81377, Munich, Germany
Julian W. Holch, C. Benedikt Westphalen & Volker Heinemann
Oncological Practice, 84028, Landshut, Germany
Ursula Vehling-Kaiser
Oncological Practice, 88212, Ravensburg, Germany
Thomas Decker
Department of Medicine I and Research Center for Immunotherapy (FZI), Johannes Gutenberg-University Clinic, 55131, Mainz, Germany
Markus Moehler
Department of Biochemistry and Pharmacology, University of Melbourne, Victoria, 3010, Australia
Michael P. Menden

Authors

Alexander J. Ohnmacht
View author publications
You can also search for this author in PubMed Google Scholar
Arndt Stahler
View author publications
You can also search for this author in PubMed Google Scholar
Sebastian Stintzing
View author publications
You can also search for this author in PubMed Google Scholar
Dominik P. Modest
View author publications
You can also search for this author in PubMed Google Scholar
Julian W. Holch
View author publications
You can also search for this author in PubMed Google Scholar
C. Benedikt Westphalen
View author publications
You can also search for this author in PubMed Google Scholar
Linus Hölzel
View author publications
You can also search for this author in PubMed Google Scholar
Marisa K. Schübel
View author publications
You can also search for this author in PubMed Google Scholar
Ana Galhoz
View author publications
You can also search for this author in PubMed Google Scholar
Ali Farnoud
View author publications
You can also search for this author in PubMed Google Scholar
Minhaz Ud-Dean
View author publications
You can also search for this author in PubMed Google Scholar
Ursula Vehling-Kaiser
View author publications
You can also search for this author in PubMed Google Scholar
Thomas Decker
View author publications
You can also search for this author in PubMed Google Scholar
Markus Moehler
View author publications
You can also search for this author in PubMed Google Scholar
Matthias Heinig
View author publications
You can also search for this author in PubMed Google Scholar
Volker Heinemann
View author publications
You can also search for this author in PubMed Google Scholar
Michael P. Menden
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualisation, M.P.M. and V.H.; Data curation, A.S., S.S., D.P.M., U.V., T.D., M.M. and A.J.O.; Formal analysis, A.J.O.; Methodology, A.J.O., A.S. and M.P.M.; Supervision, V.H. and M.P.M.; Visualisation, A.J.O. and L.H.; Writing original draft, A.J.O., A.S., V.H. and M.P.M.; Writing, review and editing, all authors.

Corresponding authors

Correspondence to Volker Heinemann or Michael P. Menden.

Ethics declarations

Competing interests

A.S. served on advisory boards for BMS and Novocure, received honoraria for talks by Roche, Servier and Taiho Pharmaceuticals and received reimbursement for travel by Roche, Merck KGaA, MSD Sharp & Dohme, Pfizer, Lilly Oncology, and Amgen. V.H., S.S. and D.P.M. received honoraria for talks, advisory boards and travel expenses by Merck KGaA, Amgen, Roche, Pfizer, BMS, MSD, AstraZeneca, Novartis, Terumo, Oncosil, Nordic, Seagen, GSK, Takeda, Servier, Pierre Fabre, Taiho, Lilly Oncology, Servier, Sanofi and Bayer Pharmaceuticals. M.P.M. is a former employee at AstraZeneca, academically collaborates with AstraZeneca, GSK and Roche, and receives funding from GSK and Roche. J.W.H. served on an advisory board for Roche, has received honoraria from Roche, and travel support from Novartis. M.M. received honoraria for advisory boards or talks by Amgen, BMS, Roche, Merck KGaA, MSD Sharp & Dohme, Lilly Oncology, Servier, Pierre Fabre, Taiho Sanofi and Bayer Pharmaceuticals and serves as officer for the European Organisation on Research and Treatment of Cancer (EORTC), and Arbeitsgemeinschaft internistische Onkologie (AIO). C.B.W. has received honoraria from Amgen, Bayer, Chugai, Celgene, GSK, MSD, Merck, Janssen, Ipsen, Roche, Servier, SIRTeX, Taiho; served on advisory boards for Bayer, BMS, Celgene, Servier, Shire/Baxalta, Rafael Pharmaceuticals, RedHill, Roche, has received travel support by Bayer, Celgene, RedHill, Roche, Servier, Taiho and research grants (institutional) by Roche. C.B.W. serves as an officer for the European Society of Medical Oncology (ESMO), Deutsche Krebshilfe (DKH) and AIO. The remaining authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Saskia Wilting and the other anonymous reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Reporting Summary

Description of Additional Supplementary Files

Supplementary Data 1

Supplementary Data 2

Supplementary Data 3

Supplementary Data 4

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Ohnmacht, A.J., Stahler, A., Stintzing, S. et al. The Oncology Biomarker Discovery framework reveals cetuximab and bevacizumab response patterns in metastatic colorectal cancer. Nat Commun 14, 5391 (2023). https://doi.org/10.1038/s41467-023-41011-4

Download citation

Received: 04 March 2022
Accepted: 17 August 2023
Published: 04 September 2023
DOI: https://doi.org/10.1038/s41467-023-41011-4

This article is cited by

Unraveling temporal and spatial biomarkers of epithelial-mesenchymal transition in colorectal cancer: insights into the crucial role of immunosuppressive cells
- Muhong Wang
- Chunyu Deng
- Zhiwei Yu
Journal of Translational Medicine (2023)
Colitis-associated carcinogenesis: crosstalk between tumors, immune cells and gut microbiota
- Junshu Li
- Yanhong Ji
- Hongxin Deng
Cell & Bioscience (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

The molecular landscape of the FIRE-3 clinical trial

Genetic biomarkers of cetuximab

Genetic biomarkers of bevacizumab

Subtype-specific biomarkers of cetuximab and bevacizumab

Predictive components of biomarkers

Benchmarking of methods for subgroup analysis

Discussion

Methods

Clinical data of the FIRE-3 clinical trial

Next-generation sequencing and genetic alterations in FIRE-3

Gene expression profiling in FIRE-3

Tumour subtypes in FIRE-3

Oncology Biomarker Discovery workflow

Characterising the molecular landscape in clinical trials

Identifying mutual exclusivity

Genetic and subtype-specific biomarkers

Predictive components of biomarkers

Resampling for correction of conditional average treatment effects

OncoBird parameterisation for FIRE-3

OncoBird parameterisation for ADJUVANT

Benchmarking of alternative methods with FIRE-3

Statistics and reproducibility

Reporting summary

Data availability

Code availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Supplementary information

Source data

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links