Introduction

A proportion of patients (10–40%) with acute myeloid leukemia (AML) fails to achieve a complete remission (CR) after two courses of intensive chemotherapy (IC) and 50-70% of those achieving CR will eventually relapse [1]. While their management is still controversial, the prognosis is quite evenly dismal, independently of the treatment approach, including the option of allogeneic hematopoietic stem cell transplantation (allo-HSCT), which however maintains a beneficial role [2, 3]. Similarly, the fate of patients with refractory or relapsed (R/R) acute lymphoblastic leukemia (ALL) disease still have a poor prognosis in term of overall survival [4] although the approved immunotherapeutic agents lead to very high rate of remission, even at the molecular level [5, 6] Allo-HSCT performs best in first complete remission (CR1) while in case of active disease shows all the limitations mainly due to the high cumulative incidence of relapse CIR) and non-relapse mortality (NRM) [3, 7].

However, different clinical outcomes have been reported in this setting, largely due to the heterogeneity of criteria adopted to define refractoriness and diverseness of treatment. The old ELN criteria [8] defined resistant leukemia after induction as persistence of blasts in patients alive after 7 days from treatment start or later while several studies defined refractory disease as persistence of leukemic blasts after two courses of intensive chemotherapy [2, 9].

Diverseness of treatment includes at least the heterogeneity of conditioning regimen and the choice of the transplant time strategy (upfront transplant versus attempts to induce a pre-transplant CR).

For all these reasons, in 2013 a prospective multicenter study, named GANDALF-01 (GITMO Against Non-responding anD Acute Leukemia Failures for transplant R/R leukemias), including an homogeneous myeloablative dual alkylator-conditioning regimen in patients lacking an HLA identical sibling donor was launched. Due to the poor prognosis of allo-HSCT with active disease in acute leukemias the Italian Transplant Programs were not allowed to proceed to an allotransplant from an unrelated donor (URD) in this setting of patients until the study run, which definitively changed that policy. The antileukemic strategy of the study was based on the need of a homogeneous truly myeloblative conditioning (MAC), being easily adopted by all the Italian centers. The choice of a uniform preparative regimen was justified by the need of limiting the diverseness of treatment within a prospective trial, even across different diseases (AML and ALL). The combination of thiotepa, busulfan, and fludarabine (TBF), initially designed for cord blood transplantation of patients with active disease [10, 11] and then widely used in Italy in the setting of haploidentical transplants [12,13,14,15], foresees the combination of two drugs, busulfan and thiotepa, with an optimal penetration into central nervous system, mimicking the radiation therapy. The exclusion of TBI, which is associated with lower relapse, at least in young adult ALL [16, 17] but not always to an increased survival in other settings [18, 19] was due to the limited availability of TBI in a population of patients needing an urgent transplant.

Materials and methods

Study design and inclusion criteria

A phase II multicenter (supplementary table 1) open-label study on allo-HSCT from URD, CB and family haploidentical donors in patients with active acute leukemia was promoted by GITMO in 2013 with the objective of increasing OS. The primary endpoint was the OS at 2 years of all patients enrolled into the study (either transplanted or not); secondary endpoints were the cumulative incidence of NRM, leukemia relapse (CIR), acute and chronic GVHD, primary engraftment, progression-free survival (PFS). Before the study run in Italy it was allowed to activate an URD search but not to proceed to transplant for patients suffering from acute leukemia with active disease. Accordingly, the patients activating an URD search at diagnosis and then becoming resistant/refractory could not proceed to transplant with active disease as well as those patients for whom an unrelated search was not yet activated early and then became either resistant or refractory. With the approval of the study instead both the former and the latter patients could finally proceed to potentially benefit from transplant option.

Inclusion criteria were adult patients (18–70 years) diagnosed of acute leukemia, with either a chemo-resistant relapse (relapsed patients) or primary induction failure (refractory patients), for whom an URD search (adult and cord blood search) at the Italian Bone Marrow Donor Registry (IBMDR) was activated, with Performance Status ECOG ≤ 3 and a life expectancy not severely limited by concomitant illness. The definition of refractoriness was defined as a failure to achieve a CR after at least one cycle of intensive chemotherapy.

Exclusion criteria were previous allo-HSCT, availability of a matched related donor and any active uncontrolled infection.

The study foresaw a homogenous approach for conditioning regimen while the GVHD prevention strategy was stratified according to donor type. The final choice of the donor was left to the center policy. Anti-infectious strategy was left to the center guidelines and a sub-study with micafungin was proposed to the participating centers as primary fungal prophylaxis (not reported here).

The study was conducted according to the Helsinki declaration, after approval of ethical committees of each participating center. All patients signed a written consent. The study was registered at NCT as NCT01814488 and EUDRACT as 2012-004008-37.

Transplant procedures

All patients enrolled were patients for whom an URD search at the IBMDR was activated at any time, either at diagnosis of acute leukemia or after diagnosis of refractoriness/chemo-resistant relapse. For the patients activating the URD search at diagnosis and then becoming resistant/refractory, the study entry was immediately possible while for those ones not activating the URD search at diagnosis for any reasons and therefore after becoming resistant/refractory, the study entry was possible only after HLA typing and unrelated search activation.

The criteria for unrelated, CB and haploidentical donor search are reported on supplementary Table 2. A MAC regimen including thiotepa (10 mg/kg, total dose), busulfan (9.6 mg/kg), and fludarabine (150 mg/mq) (TBF) was given to all patients, while GVHD prophylaxis was stratified according to donor type: either anti-thymocyte globulin (ATG, 6 mg/kg) or anti-T lymphocyte globulin (ATLG, 30 mg/kg) was added to cyclosporin with either short-term methotrexate (URD) or mycophenolate mofetil (CB transplant); GVHD was instead prevented by post-transplant cyclophosphamide (PT-Cy), tacrolimus and mycophenolate mofetil [20] or ATLG (20 mg/kg), cyclosporin, short-term methotrexate, mycophenolate mofetil, and basiliximab [12] in haploidentical transplants, according to the center policy.

Sample size calculation

Criteria for sample size assessment did not refer to a formal statistical power calculation because of a lack of detailed and focused data into this setting and the absence of alternative curative options. Therefore, by means of this study, GITMO promoted, for the first time in Italy, a study allowing patients with active leukemia to proceed to an allo-HSCT with the aim to collect outcome variables in the widest and most representative cohort of this specific patient population, independently of the type of donor (haploidentical, URD, CB) finally selected. The choice of 80 patients transplanted was based on feasibility reasons following the GITMO estimation on Italian transplant activity pointing to an estimated accrual of 40 patients per year over a 24-month enrollment time.

Statistical analysis

All patients enrolled in the study were included in the intention to transplant (ITT) analysis. This population was called overall population. Specific analyses were done only on the transplant cohort.

Survival outcomes included in the analysis were OS, PFS, NRM, and cumulative incidence of relapse (CIR).

In the overall population, OS and PFS were calculated using as starting times both the time of study recruitment and the time of diagnosis of resistance/refractoriness, which guided the ITT (in the supplementary material as Supplementary Figs. 1 and 2)

In the transplant cohort instead OS was calculated from transplant to death. PFS, NRM, and CIR were calculated from transplant to relapse or death, whichever occurred first.

OS was estimated using the Kaplan–Meier method and the log-rank test was applied to test differences between groups. Follow-up was updated on April 2021. NRM and CIR were estimated using cumulative incidence function, considering relapse and death as a competing event, respectively, and the Gray’s non-parametric test was used to assess group differences. Univariate analysis was performed by fitting Cox models for OS and Fine and Gray models for NRM and CIR. Hazard ratio with 95% confidence intervals were reported. Transplant in the overall population and acute and chronic GvHD in transplant cohort were all considered as time-dependent factors and their unadjusted effect on survival outcomes was tested performing Cox models with time-dependent covariates. Transplant effect was graphically illustrated by Simon–Makuch plot. All reported p values were two-sided. All the analyses were performed with R software (version 4.0).

Results

One hundred and one patients (Fig. 1) activated the URD search at the IBMDR and were enrolled (supplementary table 1). For 90 of them a suitable donor was identified; 87 of the 90 ultimately underwent allo-HSCT with a median time interval from donor search activation to transplant of 106 days (interquartile range: 56–155 days). The median time from diagnosis of refractoriness/resistant relapse to transplant was 91 days (interquartile range: 49–156) while the median time from enrollment to transplant was only 18 days (interquartile range: 9–43); this short delay is justified by the new chance offered by GANDALF trial to proceed to transplant in patients with a donor already found and waiting for the approval of the study to proceed to transplant, that could not be performed before the study approval. In the transplant population, 60.3% of the patients had received ≥2 courses of intensive chemotherapy. Detailed clinical characteristics of the study population were reported in Table 1.

Fig. 1: Consort diagram.
figure 1

IBMDR Italian Bone Marrow Donor Registry, HSCT allogeneic hematopoietic stem cell transplantation, CR complete remission.

Table 1 Characteristics of the patients.

The median times to neutrophil (neutrophils greater than 0.5 × 109/L) and platelet (platelets greater than 20 ×109/L) engraftment were 20 and 25 days while the cumulative incidence was 78% and 71%, respectively, at 100 days.

Overall, with a median follow up of 9 months (range 0.7–93) for the patients enrolled into the study and 74 months (range: 11–93) for the living patients, 89 died: 59 due to disease recurrence while 30 deaths were related to transplant. Among the 87 patients who were transplanted, 76 patients died, 46 because of disease recurrence and 30 died due to NRM. Among these latter, 11 patents died because of infection (10/11 bacterial, 1/11 fungal infection), 5 because of GVHD, 11 of multi-organ failure, 2 after an acute respiratory distress, and one of hyperacute hemolytic event.

The OS of the overall population was 38% and 19% at 1 and 2 years, respectively (Fig. 2a); the OS in the transplant vs no transplant cohort is reported in Fig. 2b; in the transplant cohort OS was 40% and 22% at 1 and 2 years (Fig. 3a), and no significant differences were found according to disease (Fig. 3b), donor type (Fig. 3c) and disease history (refractory vs relapsed) (Fig. 3d) and to CIBMTR score (Fig. 3e). In the refractory cohort, OS shows higher estimates for patients being transplanted after failing only one cycle of chemotherapy, without reaching however the statistical significance (Fig. 3f).

Fig. 2: Overall survival and progression-free survival of the entire study population.
figure 2

Overall survival of the overall population (a); Simon–Makuch survival curves comparing the transplant vs not transplant cohorts (b). Progression-free survival of the overall population (c); Simon-Makuch progression-free survival curves comparing the transplant vs not transplant cohorts (d). OS overall survival, PFS progression-free survival.

Fig. 3: Overall survival and progression-free survival of the transplant cohort.
figure 3

Overall survival of the transplant cohort (a); overall survival according to disease: ALL vs AML (b), to donor type (c), to disease history: relapsed vs refractory (d), and to cIBMTR score (e); overall survival, in the refractory cohort, according to the number of pre-transplant cycles of chemotherapy (f). OS overall survival, ALL acute lymphoblastic leukemia, AML acute Myeloid Leukemia, cIBMTR center of International Bone Marrow Transplantation Registry.

The PFS at two years was 19% and 17% for the overall population and for the transplant cohort (Fig. 2c, d).

At 1 and 2 years after transplant, the CIR was 38% and 49% while the NRM was 30% and 33%, respectively (Fig. 4). OS and NRM were univariately associated with Sorror comorbidity index (Table 2) while CIR was slightly lower in refractory compared to relapsed patients (Table 2 and Supplementary Fig 3). Characteristics of long-term survivors are shown in Table 3.

Fig. 4: Cumulative incidence of relapse and non-relapse mortality of the transplant cohort.
figure 4

A The cumulative incidence of relapse in the transplant cohort; B the non-relapse mortality of the transplant cohort. CIR cumulative incidence of relapse, NRM non-relapse mortality.

Table 2 Univariate analysis on transplanted patients.
Table 3 Characteristics of long-term survivors.

Thirty-seven patients experienced acute GVHD (14 grade I, 13 grade II, 6 grade III, 4 grade IV) while only 10 developed chronic GVHD (4 mild, 4 moderate, 2 severe).

Discussion

The present study shows the long-term clinical outcome of patients with R/R acute leukemias treated with a myeloablative transplant from alternative donors in an intention-to-transplant trial.

The choice of a MAC based on a double alkylating agents schedule, in our prospective study did not improve significantly the leukemia control and the NRM proved remarkably high. In a retrospective Registry study, focused on sibling transplants [21], TBF regimen was associated with 2-yr NRM of 27% versus 16% in the busulfan-fludarabine cohort and lower risk of disease recurrence.

The high NRM in this study (33% at two years) can be attributed to the highly intensive conditioning regimen using thiotepa at the dose of 10 mg/kg and busulfan at 9.6 mg/mg (T2B3F) in a heavily pretreated patients, whose median age at transplant was 54 years. Rodriguez-Arbolì et al. [22]. showed lower NRM after TBF regimen in a patient population (with almost 50% patients being in CR) with an age distribution similar to that reported in the present study. However, lower doses of either busulfan or thiotepa were given to a significant proportion of patients [22]. Reduction of doses were also reported in the haplo-HSCT setting [23], where the reduction of thiotepa dose significantly spared toxicity without loosing the disease control.

The sequential approach [24] was designed for R/R acute leukemias with the aim to guarantee an high antileukemic activity with chemo/radiotherapy soon before conditioning, an immunosuppressive effect allowing engraftment after a RIC, which can limit NRM, and finally preemptive donor lymphocyte infusions to elicit early post-transplant graft-versus-malignancy effect. No robust data are still available to investigate if a MAC approach is better than a sequential-RIC regimen in this setting. A retrospective registry analysis [22] compared 1018 patients younger than 50 years transplanted with active disease after either a MAC regimen (TBI or busulfan) or FLAMSA-RIC (with TBI or with chemotherapy) showing better OS and lower NRM for FLAMSA-chemotherapy. On the contrary, a multicenter retrospective analysis reported by Decroocq [25] and the analysis conducted by the Acute Leukemia Working Party of the EBMT [26] did not find any differences in the transplant outcome between myeloablative conditioning or treosulfan-fludarabine regimen with the sequential FLAMSA-RIC approach.

Several attempts to modify the original FLAMSA-RIC schedule have been proposed to increase the efficacy of transplant in high-risk AML using either clofarabine, busulfan, treosulfan or melphalan [27,28,29] but not definitive data are available at the moment to establish the best combination. The only available prospective controlled study [30] reported the absence of survival advantage after intensification of FLAMSA-busulfan regimen in high-risk AML and myelodysplastic syndromes.

We have to acknowledge several major limitations of this study including the non-randomized design and the sample size which can limit the likelihood to capture the effect of already validated prognostic factors such as cytogenetics, number of courses of pretransplant treatments, disease burden at conditioning and the pre-transplant history of disease [31, 32].

The study we reported here did not include any requirements in term of type and number of pre-transplant cycles of chemotherapy and all patients were transplanted with active disease. Proceeding to allo-HSCT trying or not to obtain a CR with either chemotherapy regimens, or novel drugs or targeted therapies even with subsequent trials, is still an open question, particularly in AML, [1, 33,34,35,36,37,38,39,40,41,42] NCT024615357 ETAL-3-ASAP trial is ongoing) while in ALL the use of immunotherapy before transplant is recommended [5, 6, 43]

Since only 14 patients did not undergo allo-HSCT we were not able to demonstrate the benefit of transplant over the non-transplant cohort However, based on the survival plot reported in Fig. 2b it is temptive to speculate that allo-HSCT may provide a limited beneficial effect on long term survival in these patients is confirmed, even after correcting for the time to transplant bias.

GVHD was not the leading cause of NRM in this setting of patients. Instead, bacterial infections had a leading role in the early post-transplant NRM [44], especially after intensive MAC regimen.

Despite the unsatisfactory results, a major strength of this study is represented by its prospective design in opening the chance to access alternative donor transplant in the R/R setting, the uniform criteria adopted for patients’ selection, the possibility to use any possible alternative donors and the unique conditioning regimen.

The characteristics of the patient population here reported are closer to the real life more than to selective clinical trials, limiting significantly the potential positive selection bias of a non-randomized design. For this reason, although the definition of refractoriness in the study did not match perfectly the criteria proposed by Ferguson et al. [45], it cannot surprise that the patients here enrolled fully mirror the fate of patients not achieving a CR after two courses of chemotherapy (REF2 population elsewhere described [45]).

The crucial factors dictating the poor prognosis of the patients here reported were the use of a very intensive regimen, the truly refractory status and pretreatment burden, and, even more importantly, the absence of a pre-planned strategy to prevent relapse included in the study.

At this stage, we believe that the need for a high-intensity conditioning regimen for transplant in active disease should be integrated with approaches aimed at lowering post-transplant relapse risk such as prophylactic modified or unmodified DLIs, targeted drugs maintenance, immunosuppression modulation [46], as indeed envisaged in the original conceptualization of the sequential approach [47] and prospectively tested in this setting.