Bayesian interim analysis for prospective randomized studies: reanalysis of the acute myeloid leukemia HOVON 132 clinical trial

van der Maas, Niek G.; Versluis, Jurjen; Nasserinejad, Kazem; van Rosmalen, Joost; Pabst, Thomas; Maertens, Johan; Breems, Dimitri; Manz, Markus; Cloos, Jacqueline; Ossenkoppele, Gert J.; Floisand, Yngvar; Gradowska, Patrycja; Löwenberg, Bob; Huls, Gerwin; Postmus, Douwe; Pignatti, Francesco; Cornelissen, Jan J.

doi:10.1038/s41408-024-01037-3

Download PDF

Article
Open access
Published: 27 March 2024

Bayesian interim analysis for prospective randomized studies: reanalysis of the acute myeloid leukemia HOVON 132 clinical trial

Niek G. van der Maas ORCID: orcid.org/0009-0006-9751-8122¹^na1,
Jurjen Versluis¹^na1,
Kazem Nasserinejad¹,
Joost van Rosmalen^2,3,
Thomas Pabst^4,5,
Johan Maertens ORCID: orcid.org/0000-0003-4257-5980⁶,
Dimitri Breems⁷,
Markus Manz ORCID: orcid.org/0000-0002-4676-7931^5,8,
Jacqueline Cloos⁹,
Gert J. Ossenkoppele⁹,
Yngvar Floisand¹⁰,
Patrycja Gradowska^1,11,
Bob Löwenberg ORCID: orcid.org/0000-0001-8982-5217¹,
Gerwin Huls¹²,
Douwe Postmus¹³,
Francesco Pignatti¹³ &
…
Jan J. Cornelissen^1,13

Blood Cancer Journal volume 14, Article number: 56 (2024) Cite this article

1101 Accesses
7 Altmetric
Metrics details

Subjects

Abstract

Randomized controlled trials (RCTs) are the gold standard to establish the benefit-risk ratio of novel drugs. However, the evaluation of mature results often takes many years. We hypothesized that the addition of Bayesian inference methods at interim analysis time points might accelerate and enforce the knowledge that such trials may generate. In order to test that hypothesis, we retrospectively applied a Bayesian approach to the HOVON 132 trial, in which 800 newly diagnosed AML patients aged 18 to 65 years were randomly assigned to a “7 + 3” induction with or without lenalidomide. Five years after the first patient was recruited, the trial was negative for its primary endpoint with no difference in event-free survival (EFS) between experimental and control groups (hazard ratio [HR] 0.99, p = 0.96) in the final conventional analysis. We retrospectively simulated interim analyses after the inclusion of 150, 300, 450, and 600 patients using a Bayesian methodology to detect early lack of efficacy signals. The HR for EFS comparing the lenalidomide arm with the control treatment arm was 1.21 (95% CI 0.81–1.69), 1.05 (95% CI 0.86–1.30), 1.00 (95% CI 0.84–1.19), and 1.02 (95% CI 0.87–1.19) at interim analysis 1, 2, 3 and 4, respectively. Complete remission rates were lower in the lenalidomide arm, and early deaths more frequent. A Bayesian approach identified that the probability of a clinically relevant benefit for EFS (HR < 0.76, as assumed in the statistical analysis plan) was very low at the first interim analysis (1.2%, 0.6%, 0.4%, and 0.1%, respectively). Similar observations were made for low probabilities of any benefit regarding CR. Therefore, Bayesian analysis significantly adds to conventional methods applied for interim analysis and may thereby accelerate the performance and completion of phase III trials.

Efficacy of front-line immunochemotherapy for follicular lymphoma: a network meta-analysis of randomized controlled trials

Article Open access 05 January 2022

Two decades of targeted therapies in acute myeloid leukemia

Article 15 February 2021

Cancer patient survival can be parametrized to improve trial precision and reveal time-dependent therapeutic effects

Article Open access 15 February 2022

Introduction

Time elapsing between design, completion of patient accrual, and final outcome analysis of prospective randomized clinical trials (RCT) is generally very long, which hampers the rapid approval of drugs for patients with a high or unmet clinical need. To allow for timely access to new therapies, regulatory authorities have permitted drug development strategies other than RCTs, which have increasingly been used in (conditional) approval by the FDA and EMA [1, 2]. Recent FDA evaluation of the Accelerated Approval track highlighted the importance of enhancing quality and efficiency in drug development tracks using prospective comprehensive strategies in order to expedite therapeutic advancements [3].

Prospective phase III RCTs are still pivotal to evaluate the risk-benefit ratio of experimental therapies compared to a well-balanced control group [4,5,6,7]. The expected benefit of the experimental treatment is often based on data from earlier phase II studies. RCTs may include interim analyses that focus on toxicity or efficacy endpoints to prevent excessive harm for patients or to stop a study early because of early evidence of benefit or futility. However, the conventional, frequentist approach commonly employed to evaluate these endpoints in clinical trials might be limited by implicit prior assumptions, the need for long-term follow-up to observe the number of events required for final evaluation, and conservative interim stopping rules.

Bayesian statistical methods have been proposed as a tool that might meet these limitations, for example to estimate the maximum tolerated dose of a drug in early phase studies. It may allow for an adaptive conduct of trials by incorporating prior knowledge from historical patients with similar disease and treatment characteristics [8,9,10]. While Bayesian inference has been well-established in the design of phase I/II studies [11,12,13,14,15,16,17,18,19,20], its use in prospective phase III RCTs has been more limited [21,22,23,24,25,26,27,28].

This study aims to retrospectively evaluate how external data as prior knowledge can be used to analyze primary and secondary trial endpoints, and to challenge the original study assumptions at successive interim analyses of an RCT. Therefore, we reanalyzed a prospective phase III trial from the Haematology-Oncology for Adults in the Netherlands (HOVON) and the Swiss Group for Clinical Cancer Research (SAKK) cooperative groups in patients with acute myeloid leukemia (AML) which did not meet its primary endpoint [29], and used a dynamic borrowing approach with Bayesian inference to reinforce the control treatment arm with external data from a previous AML trial.

Methods

Study design

In this reanalysis of a randomized phase III clinical trial, the HOVON 132 AML/SAKK 30/13 (HO132) study was used [29]. Four interim analyses were simulated in the prospective conduct of the HO132 trial after the inclusion of 150, 300, 450, and 600 patients for an early benefit-risk assessment (Fig. 1). Outcome data from patients enrolled in the control treatment arm of the preceding prospective HOVON 102 AML/ SAKK 30/09 (HO102) trial [30] were used to reinforce the control treatment arm of the HO132.

Data sources

Data from the HO132 and HO102 trials were used in this reanalysis (Table 1). The HO132 is a phase III RCT which included patients aged 18 to 65 years with newly diagnosed AML between 2014 and 2017 [29]. Patients were randomized between two cycles of standard remission induction therapy with or without lenalidomide. After remission, induction therapy, patients in complete remission (CR) or CR with incomplete hematologic recovery (CRi) received post‐remission treatment with either a third cycle of chemotherapy, high‐dose chemotherapy followed by autologous stem cell transplantation (SCT) or an allogeneic SCT, as described previously [29]. The primary endpoint was event free survival (EFS), with a total of 800 patients with 441 events being considered to detect an hazard ratio (HR) of 0.76 with 82% power and at the 5% significance level, corresponding with an increased EFS of 10% at 3 years by lenalidomide. Upon final analysis in 2019, EFS was not significantly different between patients receiving intensive induction with or without lenalidomide (HR 0.99, p = 0.96) [29]. Additionally, the percentage of patients achieving CR or CRi after two cycles of induction chemotherapy was 82% for the experimental arm and 87% for the control arm (odds ratio [OR] 0.71 p = 0.08). Measurable residual disease (MRD) negativity in patients in CR after the second induction cycle were 78 and 77% (OR 0.92, p = 0.73), respectively. Early mortality rates at 2 months after the start of treatment were also not different (7 and 5%, respectively). The incidence and severity of adverse events were comparable between the arms during both the induction and maintenance phases, with no evident variations in the frequencies of adverse events.

Table 1 Patient outcome and characteristics of studies HO132 & HO102 [29, 30].

Full size table

The preceding HO102 trial randomized patients between intensive induction treatment with or without clofarabine for patients with newly diagnosed AML, aged between 18 to 65 years. Patients included in the control treatment arm received induction treatment similar to the patients in the HO132 control treatment arm. Patient accrual occurred between 2010 to 2013 and the primary endpoint was EFS.

Propensity score matching between patients from the control treatment arms from both the HO102 and HO132 was used to aim for similar patient characteristics. Propensity scores were obtained through logistic regression on age and European LeukemiaNET 2017 risk [31]. After the propensity score was calculated for each individual, HO102 controls were matched 1:1 to HO132 controls using the nearest neighbor matching algorithm [32]. A total of 300 patients were used from the HO102 control treatment arm, which were matched with HO132 control patients in order to maximize the number of external control patients. Subsequently, outcome data of the 300 patients from the HO102 control treatment arm were used for the construction of the Bayesian prior (see next paragraph, Fig. S1–5) at each interim analysis, meaning that primary analyses were based on the HO132 trial data with a reinforced control treatment arm (for more details see supplementary methods). The median follow-up time was 7, 10, 12 and 16 months at each simulated interim analysis, respectively.

Bayesian statistical methods

Bayesian inference is a method of statistical inference using Bayes’ theorem to update a probability distribution of a parameter when new information is obtained. Three key concepts need to be considered including (1) the prior distribution (prior), (2) the likelihood, and (3) the posterior probability. The prior is a probability distribution that represents the prior knowledge before seeing any data. The prior can be based on previously observed data or expert opinion. Non-informative priors can be used when no prior data or expert opinion is available. The likelihood is a function that describes the probability density of the newly observed data. The posterior probability is a probability distribution based on the prior distribution combined with the likelihood of newly observed data. The posterior probability represents the updated belief of an event or hypothesis given the available evidence (Fig. S6). Three Markov Monte Carlo chains were run with 50,000 iterations. Chain convergence was evaluated by quantile plots and the Gelman–Rubin diagnostic. See supplementary methods for more details about dynamic borrowing with the commensurate prior.

Posterior probability distributions of the treatment difference (i.e. risk difference) were calculated for efficacy endpoints including (1) EFS, (2) CR rate after two cycles of chemotherapy, (3) proportion of MRD negative patients in CR after cycle 2, and safety endpoints including (4) rate of early mortality within 2 months after start of treatment, and (5) rate of grade 4–5 adverse events. The probability distributions were summarized to provide point estimates of the probability distribution median and 95% Bayesian credible intervals (95% CI) on the treatment difference and HR between both arms. CR, MRD, adverse events above grade III and early death outcomes were assessed with Bayesian beta-binomial models and the probability of any benefit (treatment difference > 0%) of the lenalidomide treatment versus the control treatment arm was estimated. EFS was evaluated using a Bayesian Weibull survival model and the probability distributions of the HR for EFS were used to estimate the probability of HR < 0.76, which was the assumed effect size for EFS in the HO132 study. Less optimistic treatment effects were also studied, including the probability of EFS HR < 0.87, which corresponded to a 5% increase in EFS at 3 years by lenalidomide. Lastly, the probability of EFS HR < 1 was estimated, which corresponds to any benefit in EFS for the lenalidomide treatment arm.

All Bayesian analyses were performed in R version 4.2.2 with the additional software of JAGS, using the package “rjags” [33, 34]. The R script of the analyses can be found online (https://github.com/niekvandermaas/Bayesian_reanalysis_HO132_paper).

Conventional futility methods

In this study, a conventional group sequential design was retrospectively implemented to monitor early efficacy and futility through four interim analyses at the same time points as for the Bayesian approach with EFS as the primary endpoint. This sequential design was based on the original HO132 statistical analysis plan and would require 883 patients with 488 events to detect a HR of 0.76 with 82% power and a one-sided Type I error of 2.5%. The higher number of patients and events in a group sequential design reflects the penalty of four interim efficacy analyses. Efficacy and futility bounds were derived using a Lan-DeMets O’Brien-Fleming approximation spending function, and the analysis was conducted using EAST statistical software [35].

Results

Benefit-risk assessment at interim analyses

Treatment efficacy: EFS

Lenalidomide treatment was compared with the reinforced control treatment at the four defined interim time-points. The HR for EFS was 1.21 (95% CI 0.81 to 1.69), 1.05 (95% CI 0.86 to 1.30), 1.00 (95% CI 0.84 to 1.19), and 1.02 (95% CI 0.87 to 1.19), at interim analyses 1, 2, 3, and 4, respectively (Fig. 2, Table S1). At interim analyses 1 and 2, the probability of being below the anticipated HR of 0.76 was 1.2% and 0.6%, which probability was 0.4% at interim analyses 3 and 0.1% at interim analysis 4 (Fig. 2, Table S1). The probability for a moderate treatment benefit in EFS (HR < 0.87) at interim analysis 1, 2, 3, and 4 was 5.0%, 6.5%, 9.0%, and 4.4%, respectively (Fig. S7). The probability of any benefit (HR < 1.0) for the lenalidomide treatment arm compared with the control treatment arm was moderate at all interim analyses, with a probability for any benefit of 16.4%, 32.9%, 49.0%, and 41.0%, respectively (Fig. S8).

**Fig. 2: Estimation of the probability for the assumed treatment effect (HR < 0.76) by Bayesian analysis of event free survival comparing lenalidomide treatment versus reinforced control treatment.**

While the lack of treatment efficacy for EFS was already identified early after the first interim analysis using Bayesian inference with 150 patients enrolled, a conventional group sequential design showed that the HR of treatment benefit by lenalidomide crossed the futility boundary at the third interim analysis with 450 patients randomized (Fig. 3). The observed HR for EFS comparing the lenalidomide arm with the control treatment arm was 1.12, 0.92, 0.99, and 1.02 at interim analysis 1, 2, 3, and 4 respectively (Fig. 3).

**Fig. 3: Conventional group sequential design with non-binding two-sided hazard ratio bounds for stopping for efficacy or futility at each interim analysis.**

Treatment efficacy: CR and MRD negativity

In the lenalidomide treatment arm, the median percentage of patients obtaining CR was lower compared with the control treatment arm at interim analysis 1 (82% vs 91% comparing the lenalidomide arm vs control treatment arm; treatment difference: −8.9%; 95% CI −19.9 to 1.0, Fig. 4A, Table S1) and the probability of a higher CR rate in the lenalidomide treatment arm compared with the control treatment arm was 3.9% (Fig. 4A, Table S1). At interim analyses 2 to 4 the median CR proportions resulted in treatment differences of −7.8% (79% vs 87%; 95% CI −16.0 to 0.02), −7.0% (80% vs 87%; 95% CI −13.5 to −0.5), and −9.8% (78% vs 88%; 95% CI −15.6 to −4.1) with a probability of a higher CR rate of 2.8%, 1.7%, and 0.0%, respectively (Fig. 4A, Table S1). A low probability of a higher CR rate in the lenalidomide treatment arm suggests no benefit of this treatment compared with the control treatment arm.

**Fig. 4: Comparison of lenalidomide treatment versus reinforced control treatment by Bayesian analysis of CR and early death rate.**

Data on MRD after two induction cycles in patients who obtained a CR was available in 78.1% of patients, which was equally balanced between treatment arms (Table 1). Patients in CR and assigned to the lenalidomide treatment arm were less often MRD negative compared with the control treatment arm at every interim analysis. At interim analysis 1, 73% of patients in the lenalidomide treatment arm were in MRD negative CR, whereas 83% of patients were MRD negative in the control treatment arm (treatment difference: −10.3%; 95% CI −28.5 to 6.8, Fig. S9, Table S1). The probability for a higher MRD negative CR rate for the lenalidomide treatment arm compared with the control treatment arm was 12.2% at the first interim analysis. At interim analyses 2 to 4, the probability of higher MRD negative rates were 10.1%, 6.2%, and 12.0%, respectively (Fig. S9, Table S1). Similar to CR, a low likelihood of a higher MRD rate in the lenalidomide treatment arm indicates that this treatment does not have any advantage compared with the control treatment arm.

Treatment toxicity: early death and adverse events

Death within the first 60 days of treatment was more frequently observed in the experimental treatment arm compared with the control treatment arm at interim analysis 1 (7% vs 2%; treatment difference: 5.1%; 95% CI −1.3 to 12.8, Fig. 4B, Table S1). The treatment differences at interim analyses 2 to 4 were 3.2% (8% vs 5%; 95% CI −2.0 to 9.1), 2.3% (8% vs 6%; 95% CI −2.2 to 7.2), and 2.0% (8% vs 6%; 95% CI −1.9 to 6.1), respectively. Grade 4–5 adverse events were not different between both arms in any of the interim analyses (Fig. S10, Table S1).

Interim analyses without external data

The impact of the external data (HO102 trial) on the primary outcome analysis of EFS for lenalidomide vs control treatment was determined by performing a Bayesian analysis without external data (Table S2). The HR was 1.09 (95% CI 0.72 to 1.65), 0.95 (95% CI 0.71 to 1.28), 1.00 (95% CI 0.78 to 1.27), and 1.05 (95% CI 0.85 to 1.29) at interim analyses 1 to 4, respectively (Table S2), which corresponds to a similarly low probability of the assumed benefit (HR < 0.76) for EFS of 4.7%, 7.0%, 1.6%, and 0.2%, respectively (Table S2). This suggests that the external data had a relatively limited impact on the Bayesian analysis at interim time points. The probability of a moderate treatment effect (HR < 0.87) for EFS in favor of the lenalidomide treatment arm was 14.8%, 27.5%, 14.5%, and 4.4%, respectively and for any benefit by lenalidomide for EFS was 34.7%, 62.7%, 52.6%, and 33.6%.

Discussion

A prospective RCT is the preferred type of trial to evaluate the benefits and risks of new therapies [4,5,6,7]. Historical reports have highlighted that 71% of RCTs in hemato-oncology resulted in non-significant outcomes or negative findings. This observation might be linked to unrealistically high expectations regarding the treatment effect size [36]. Currently, innovative approaches are being developed that may accelerate and enhance the knowledge arising from prospective studies, including phase II and phase III studies. Bayesian statistical methods have been applied in phase I/II studies but may also be applied in phase III RCTs. Here, we retrospectively simulated four interim analyses within a recent phase III RCT, randomizing patients for lenalidomide in AML [29]. We evaluated outcome parameters during patient accrual using a Bayesian approach and compared it to conventional frequentist statistical methods. At all four Bayesian interim analyses, the likelihood of the expected benefit (HR < 0.76) was very low. In contrast, the frequentist group sequential design declared futility in the third interim analysis. Additionally, our Bayesian analysis of efficacy endpoints, including CR and MRD negative CR, at four interim time points showed a low probability of benefit (<15%) by lenalidomide. Risk assessment showed excess mortality in patients randomized to the lenalidomide arm. Our data indicate that interim analyses in phase III clinical trial using Bayesian inference addressing both the benefits and risks of an experimental drug proved to be highly informative.

Randomized phase III clinical trials are generally designed, taking into account results from earlier phase I/II studies. By virtue of randomization, the experimental treatment can be evaluated to a concurrent control population with little or minimal bias. Bayesian statistics additionally allow for the use of external data in the context of a RCT, next to the randomized control population. It may be done so upon trial completion, but also during patient accrual at specific interim analysis time points. It might allow for a reinforced control treatment arm with an informative prior [8]. External and current data need to be carefully matched for risk factors and eligibility criteria to avoid selection bias as much as possible. Here, a previous trial conducted by HOVON-SAKK was used, that included patients with similar inclusion criteria and control treatment. It enabled a rapid and complete matching procedure by a dynamic borrowing approach, which improved the precision of the posterior probabilities for EFS. In addition, it might be recommended to consider the results with and without added prior knowledge in order to investigate the impact of the external data. Here, a low probability of obtaining an HR < 0.76 for EFS was also observed without the external control treatment data, suggesting that these data did not essentially change the conclusion of the Bayesian analysis at interim time points. The external data added relatively limited value because the lenalidomide treatment arm performed worse than the control treatment arm for multiple efficacy endpoints, including EFS, CR and MRD negative CR, already from an early time point. Although the conclusion of the interim analyses were not different using external data, reinforcing a control arm increases statistical power, which consequently may potentially lead to a reduction in sample size [37]. Furthermore, Bayesian analysis at interim time points may provide broader-based recommendations to an independent safety and monitoring board (DSMB) of an ongoing study, which should preferably remain blinded to the trial team. If an alarming signal at multiple interim analyses time points arises, that may impact the advice to the principal investigator and trial team. However, to implement a Bayesian sequential design in the future, it has been recommended by regulatory authorities to evaluate the operating characteristics such as power and type I error rate, for establishing a futility threshold [38].

A Bayesian approach to interim analysis might have several limitations. First, Bayesian approaches or adaptive designs cannot control for selection bias and residual confounding, highlighting the importance of a control arm in a randomized setting. Second, incorporating overly positive (or negative) prior information may introduce bias impacting the posterior distribution [39]. Third, we assumed that clinical data from the HO102 trial were comparable to the HO132 trial. While there were no large differences in baseline characteristics between the two studies, EFS was significantly different without matching. After matching, EFS at interim analysis was similar illustrating that without addressing changes in the underlying population by e.g. matching methods, external data may introduce bias.

RCTs are often criticized for their limited generalizability due to the selection of patients, such as the exclusion of patients with older age and comorbidities [40, 41]. Real-world data (RWD) contain patient health and healthcare data of patients, predominantly outside the context of clinical trials [42]. Thereby, these data have the potential to provide insights into the benefits and risks of therapeutic interventions in a more generalizable patient population. Similar as done in this study with external data, RWD may be applied in the context of prospective phase III studies, in order to reinforce the control population. Nevertheless, an ongoing challenge is to approximate the quality of RWD and trial data as much as possible [43].

In conclusion, simulations of four interim analyses in the HO132 study showed that the assumed benefit of lenalidomide was unlikely to be achieved, which was already observed after the first interim analysis using Bayesian inference with external data as an informative prior, whereas a conventional evaluation would have a futility conclusion only after the third interim analysis. These results augment conventional futility analyses, highlighting the potential of Bayesian statistical methods to provide earlier and highly informative insights into trial outcomes at interim time points. This methodology might be considered to expedite clinical trial adaptation and enhance efficiency in drug development. External data, such as historical trial data or RWD, may be used to increase the precision of the control treatment arm of clinical trials, but caution must be taken to ensure data comparability and minimize bias.

Data availability

The data that support the findings of this study are available from HOVON but restrictions apply to the availability of these data, which were used under license for the current study, and so are not publicly available. Data are, however, available from the authors upon reasonable request and with permission of HOVON.

References

Goring S, Taylor A, Müller K, Li TJJ, Korol EE, Levy AR, et al. Characteristics of non-randomised studies using comparisons with external controls submitted for regulatory approval in the USA and Europe: a systematic review. BMJ Open. 2019;9:e024895.
Article PubMed PubMed Central Google Scholar
Hatswell AJ, Baio G, Berlin JA, Irs A, Freemantle N. Regulatory approval of pharmaceuticals without a randomised controlled study: analysis of EMA and FDA approvals 1999-2014. BMJ Open. 2016;6:e011666.
Article PubMed PubMed Central Google Scholar
Fashoyin-Aje LA, Mehta GU, Beaver JA, Pazdur R. The on- and off-ramps of oncology accelerated approval. N Engl J Med. 2022;387:1439–42.
Article PubMed Google Scholar
Grimes DA, Schulz KF. An overview of clinical research: the lay of the land. Lancet. 2002;359:57–61.
Article PubMed Google Scholar
Moher D, Hopewell S, Schulz KF, Montori V, Gøtzsche PC, Devereaux PJ, et al. CONSORT 2010 explanation and elaboration: updated guidelines for reporting parallel group randomised trials. BMJ. 2010;340:c869.
Article PubMed PubMed Central Google Scholar
Pignatti F, Wilking U, Wilking N, Delgado J, Bergh J. The value of anticancer drugs—a regulatory view. Nat Rev Clin Oncol. 2021;19:207–15.
Article PubMed Google Scholar
Marini BL, Goodman AM, Perissinotti AJ. The essential role of randomised controlled trials. Lancet Haematol. 2023;10:e486–e487.
Article CAS PubMed Google Scholar
van Rosmalen J, Dejardin D, van Norden Y, Löwenberg B, Lesaffre E. Including historical data in the analysis of clinical trials: Is it worth the effort? Stat Methods Med Res. 2018;27:3167–82.
Article PubMed Google Scholar
Hobbs BP, Sargent DJ, Carlin BP. Commensurate priors for incorporating historical information in clinical trials using general and generalized linear models. Bayesian Anal. 2012;7:639–74.
Article PubMed PubMed Central Google Scholar
Hobbs BP, Carlin BP, Mandrekar SJ, Sargent DJ. Hierarchical commensurate and power prior models for adaptive incorporation of historical information in clinical trials. Biometrics. 2011;67:1047–56.
Article PubMed PubMed Central Google Scholar
Cheung YK, Chappell R. Sequential designs for phase I clinical trials with late-onset toxicities. Biometrics. 2000;56:1177–82.
Article CAS PubMed Google Scholar
Babb JS, Rogatko A. Patient specific dosing in a cancer phase I clinical trial. Stat Med. 2001;20:2079–90.
Article CAS PubMed Google Scholar
Ji Y, Liu P, Li Y, Bekele BN. A modified toxicity probability interval method for dose-finding trials. Clin Trials. 2010;7:653–63.
Article PubMed PubMed Central Google Scholar
Yuan Y, Hess KR, Hilsenbeck SG, Gilbert MR. Bayesian optimal interval design: a simple and well-performing design for phase I oncology trials. Clin Cancer Res. 2016;22:4291–301.
Article PubMed PubMed Central Google Scholar
Angus DC, Berry S, Lewis RJ, Al-Beidh F, Arabi Y, van Bentum-Puijk W, et al. The REMAP-CAP (randomized embedded multifactorial adaptive platform for community-acquired pneumonia) study. rationale and design. Ann Am Thorac Soc. 2020;17:879–91.
Article PubMed PubMed Central Google Scholar
Schuetze SM, Wathen JK, Lucas DR, Choy E, Samuels BL, Staddon AP, et al. SARC009: Phase 2 study of dasatinib in patients with previously treated, high-grade, advanced sarcoma. Cancer. 2016;122:868–74.
Article CAS PubMed Google Scholar
Hirakawa A, Nishikawa T, Yonemori K, Shibata T, Nakamura K, Ando M, et al. Utility of Bayesian single-arm design in new drug application for rare cancers in Japan: a case study of phase 2 trial for sarcoma. Ther Innov Regul Sci. 2018;52:334–8.
Article PubMed Google Scholar
Kim ES, Herbst RS, Wistuba II, Lee JJ, Blumenschein GR Jr, Tsao A, et al. The BATTLE trial: personalizing therapy for lung cancer. Cancer Discov. 2011;1:44–53.
Article CAS PubMed PubMed Central Google Scholar
Papadimitrakopoulou V, Lee JJ, Wistuba II, Tsao AS, Fossella FV, Kalhor N, et al. The BATTLE-2 study: a biomarker-integrated targeted therapy study in previously treated patients with advanced non-small-cell lung cancer. J Clin Oncol. 2016;34:3638–47.
Article CAS PubMed PubMed Central Google Scholar
Berry DA, Dhadda S, Kanekiyo M, Li D, Swanson CJ, Irizarry M, et al. Lecanemab for patients with early Alzheimer disease: bayesian analysis of a phase 2b dose-finding randomized clinical trial. JAMA Netw Open. 2023;6:e237230.
Article PubMed PubMed Central Google Scholar
Broglio K, Meurer WJ, Durkalski V, Pauls Q, Connor J, Berry D, et al. Comparison of Bayesian vs frequentist adaptive trial design in the stroke hyperglycemia insulin network effort trial. JAMA Netw Open. 2022;5:e2211616.
Article PubMed PubMed Central Google Scholar
Muss HB, Berry DA, Cirrincione CT, Theodoulou M, Mauer AM, Kornblith AB, et al. Adjuvant chemotherapy in older women with early-stage breast cancer. N Engl J Med. 2009;360:2055–65.
Article CAS PubMed PubMed Central Google Scholar
Reis G, Silva EASM, Silva DCM, Thabane L, Milagres AC, Ferreira TS, et al. Effect of early treatment with ivermectin among patients with covid-19. N Engl J Med. 2022;386:1721–31.
Article CAS PubMed Google Scholar
Takahashi T, Yamanaka T, Seto T, Harada H, Nokihara H, Saka H, et al. Prophylactic cranial irradiation versus observation in patients with extensive-disease small-cell lung cancer: a multicentre, randomised, open-label, phase 3 trial. Lancet Oncol. 2017;18:663–71.
Article PubMed Google Scholar
Nogueira RG, Jadhav AP, Haussen DC, Bonafe A, Budzik RF, Bhuva P, et al. Thrombectomy 6 to 24 h after Stroke with a mismatch between deficit and infarct. N Engl J Med. 2018;378:11–21.
Article PubMed Google Scholar
Reardon MJ, Van Mieghem NM, Popma JJ, Kleiman NS, Søndergaard L, Mumtaz M, et al. Surgical or transcatheter aortic-valve replacement in intermediate-risk patients. N Engl J Med. 2017;376:1321–31.
Article PubMed Google Scholar
Shah PL, Slebos D-J, Cardoso PFG, Cetti E, Voelker K, Levine B, et al. Bronchoscopic lung-volume reduction with Exhale airway stents for emphysema (EASE trial): randomised, sham-controlled, multicentre trial. Lancet. 2011;378:997–1005.
Article CAS PubMed Google Scholar
Ferreira D, Vivot A, Diemunsch P, Meyer N. Bayesian analysis from phase III trials was underused and poorly reported: a systematic review. J Clin Epidemiol. 2020;123:107–13.
Article PubMed Google Scholar
Löwenberg B, Pabst T, Maertens J, Gradowska P, Biemond BJ, Spertini O, et al. Addition of lenalidomide to intensive treatment in younger and middle-aged adults with newly diagnosed AML: the HOVON-SAKK-132 trial. Blood Adv. 2021;5:1110–21.
Article PubMed PubMed Central Google Scholar
Löwenberg B, Pabst T, Maertens J, van Norden Y, Biemond BJ, Schouten HC, et al. Therapeutic value of clofarabine in younger and middle-aged (18-65 years) adults with newly diagnosed AML. Blood. 2017;129:1636–45.
Article PubMed Google Scholar
Döhner H, Estey E, Grimwade D, Amadori S, Appelbaum FR, Büchner T, et al. Diagnosis and management of AML in adults: 2017 ELN recommendations from an international expert panel. Blood. 2017;129:424–47.
Article PubMed PubMed Central Google Scholar
Austin PC. An introduction to propensity score methods for reducing the effects of confounding in observational studies. Multivar Behav Res. 2011;46:399–424.
Article Google Scholar
Plummer M rjags: Bayesian Graphical Models using MCMC R package version 4-12 (2021). https://CRAN.R-project.org/package=rjags.
R Core Team. R: A language and environment for statistical computing (2023). R Foundation for Statistical Computing, Vienna, Austria. https://www.R-project.org/.
East 6. Statistical software for the design, simulation and monitoring clinical trials. Cambridge MA: Cytel Inc.; 2020.
Google Scholar
Kumar A, Soares H, Djulbegovic B. Are statistically non-significant findings necessarily negative? a review of all phase III randomized controlled trials in hematology conducted by NCI sponsored cooperative groups. Blood. 2005;106:293.
Article Google Scholar
Qi H, Rizopoulos D, van Rosmalen J. Sample size calculation for clinical trials analyzed with the meta-analytic-predictive approach. Res Synth Methods. 2023;14:396–413.
Article PubMed Google Scholar
Center for Drug Evaluation, Research. Adaptive Design Clinical Trials for Drugs and Biologics Guidance for Industry. U.S. Food and Drug Administration. https://www.fda.gov/regulatory-information/search-fda-guidance-documents/adaptive-design-clinical-trials-drugs-and-biologics-guidance-industry (Accessed 20 Dec 2022).
Muehlemann N, Zhou T, Mukherjee R, Hossain MI, Roychoudhury S, Russek-Cohen E. A tutorial on modern Bayesian methods in clinical trials. Ther Innov Regul Sci. 2023;57:402–16.
Article PubMed PubMed Central Google Scholar
Lewis JH, Kilgore ML, Goldman DP, Trimble EL, Kaplan R, Montello MJ, et al. Participation of patients 65 years of age or older in cancer clinical trials. J Clin Oncol. 2003;21:1383–9.
Article PubMed Google Scholar
Ruiter R, Burggraaf J, Rissmann R. Under-representation of elderly in clinical trials: an analysis of the initial approval documents in the Food and Drug Administration database. Br J Clin Pharm. 2019;85:838–44.
Article Google Scholar
Makady A, de Boer A, Hillege H, Klungel O, Goettsch W. What is real-world data? a review of definitions based on literature and stakeholder interviews. Value Health. 2017;20:858–65.
Article PubMed Google Scholar
Hermans S, van Norden Y, van Werkhoven E, Dinmohamed A, Huijgens P, Ossenkoppele G, et al. Real-world data as supplementary controls for the prospective randomized hovon-103 trial in intensively treated elderly acute myeloid leukemia patients. Hemasphere. 2023;7:e323641c.

Download references

Acknowledgements

The views expressed in this article are the personal views of the authors and may not be understood or quoted as being made on behalf of or reflecting the position of the agencies and organizations with which the authors are employed or affiliated.

Funding

This research work received support from the Erasmus Medical Center Foundation-Daniel den Hoed (N.v.d.M. and J.V.).

Author information

These authors contributed equally: Niek G. van der Maas, Jurjen Versluis.

Authors and Affiliations

Department of Hematology, Erasmus Medical Center Cancer Institute, Erasmus University Medical Center, Rotterdam, the Netherlands
Niek G. van der Maas, Jurjen Versluis, Kazem Nasserinejad, Patrycja Gradowska, Bob Löwenberg & Jan J. Cornelissen
Department of Biostatistics, Erasmus MC, Rotterdam, the Netherlands
Joost van Rosmalen
Department of Epidemiology, Erasmus MC, Rotterdam, the Netherlands
Joost van Rosmalen
University Hospital, Inselspital, Bern, Switzerland
Thomas Pabst
Swiss Group for Clinical Cancer Research (SAKK), Bern, Switzerland
Thomas Pabst & Markus Manz
University Hospital Gasthuisberg, Leuven, Belgium
Johan Maertens
Ziekenhuis Netwerk Antwerpen, Antwerp, Belgium
Dimitri Breems
University Hospital Zurich, Zurich, Switzerland
Markus Manz
Amsterdam UMC, location VUMC, Cancer Center Amsterdam, Amsterdam, the Netherlands
Jacqueline Cloos & Gert J. Ossenkoppele
Oslo University Hospital, Oslo, Norway
Yngvar Floisand
HOVON Foundation, Rotterdam, the Netherlands
Patrycja Gradowska
University Medical Center, University Groningen, Groningen, the Netherlands
Gerwin Huls
Oncology and Hematology Office, European Medicines Agency, Amsterdam, the Netherlands
Douwe Postmus, Francesco Pignatti & Jan J. Cornelissen

Authors

Niek G. van der Maas
View author publications
You can also search for this author in PubMed Google Scholar
Jurjen Versluis
View author publications
You can also search for this author in PubMed Google Scholar
Kazem Nasserinejad
View author publications
You can also search for this author in PubMed Google Scholar
Joost van Rosmalen
View author publications
You can also search for this author in PubMed Google Scholar
Thomas Pabst
View author publications
You can also search for this author in PubMed Google Scholar
Johan Maertens
View author publications
You can also search for this author in PubMed Google Scholar
Dimitri Breems
View author publications
You can also search for this author in PubMed Google Scholar
Markus Manz
View author publications
You can also search for this author in PubMed Google Scholar
Jacqueline Cloos
View author publications
You can also search for this author in PubMed Google Scholar
Gert J. Ossenkoppele
View author publications
You can also search for this author in PubMed Google Scholar
Yngvar Floisand
View author publications
You can also search for this author in PubMed Google Scholar
Patrycja Gradowska
View author publications
You can also search for this author in PubMed Google Scholar
Bob Löwenberg
View author publications
You can also search for this author in PubMed Google Scholar
Gerwin Huls
View author publications
You can also search for this author in PubMed Google Scholar
Douwe Postmus
View author publications
You can also search for this author in PubMed Google Scholar
Francesco Pignatti
View author publications
You can also search for this author in PubMed Google Scholar
Jan J. Cornelissen
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

N.v.d.M., J.V., F.P., J.C. contributed to the study design. N.v.d.M., K.N., J.v.R. participated in data analysis. K.N. and J.V. have accessed and verified the data. N.v.d.M., J.V., K.N., J.v.R., D.P., F.P. and J.C. contributed to data interpretation, revising the manuscript, and approving this manuscript for submission. All other authors provided study materials or enrolled patients were involved in the collection and assembly of clinical data, revised and approved the manuscript for submission.

Corresponding author

Correspondence to Jan J. Cornelissen.

Ethics declarations

Competing interests

J.C. (institute) received royalties from Navigate and BD Biosciences and received research funding from Takeda, DC-one, Genentech, Janssen, Novartis and Merus.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Materials

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

van der Maas, N.G., Versluis, J., Nasserinejad, K. et al. Bayesian interim analysis for prospective randomized studies: reanalysis of the acute myeloid leukemia HOVON 132 clinical trial. Blood Cancer J. 14, 56 (2024). https://doi.org/10.1038/s41408-024-01037-3

Download citation

Received: 23 January 2024
Revised: 06 March 2024
Accepted: 08 March 2024
Published: 27 March 2024
DOI: https://doi.org/10.1038/s41408-024-01037-3