Introduction

Adult T-cell leukemia/lymphoma (ATL) is an aggressive T-cell neoplasm caused by human T-lymphotropic virus type 1 [1] and has a poor prognosis [2,3,4]. Intensive chemotherapy and autologous stem cell transplantation have definite limitations [5,6,7,8,9]. Although allogeneic hematopoietic stem cell transplantation (allo-HSCT) may provide long-term remission [10,11,12,13], the 3-year survival rate after this procedure is only 33–36% in Japan [14, 15], and similar outcomes have been reported in Europe [16]. Given that grade I–II acute graft-versus-host disease (GVHD) is associated with better overall survival (OS) [17,18,19], graft-versus-leukemia (GVL) effects induced by donor lymphocytes are important in ATL [20, 21]. Mild to moderate GVHD is therefore likely to be beneficial in ATL; however, severe GVHD is associated with non-relapse mortality [17]. Thus, it is difficult to optimize the balance between GVHD and GVL effects while avoiding infection in patients who undergo allo-HSCT for ATL.

Efforts to reduce GVHD decreased transplant-related morbidity/mortality but increased risk of relapse, which may be attributable to diminished GVL effects [22]. On the other hand, efforts to reduce relapse-related mortality by delivering enhanced high-dose chemotherapy prior to allo-HSCT led to excess deaths due to organ damage, infections, and GVHD [23]. The final goal of allo-HSCT is to avoid relapse, severe GVHD, and other fatal complications. Therefore, GVHD-free, relapse-free survival (GRFS), defined as the absence of grade III–IV acute GVHD, chronic GVHD that requires systemic treatment, relapse, and death, has been proposed as a novel and clinically meaningful endpoint of allo-HSCT [24]. Cord blood transplantation (CBT) reportedly provides favorable GRFS compared with transplantation from other donor sources in some Japanese populations [25]. The median age of ATL patients is higher and the incidence of fatal opportunistic infection in ATL patients is higher than in patients with other hematologic malignancies [26]. Therefore, we hypothesized that GRFS of ATL patients may differ from that of patients with other hematologic malignancies.

As most ATL patients are over 60 years old, many will not have human leukocyte antigen (HLA)-matched healthy siblings of appropriate age to be eligible as donors. Therefore, alternative donor sources are often utilized in ATL. Among several donor sources, CBT is often performed, especially in patients with relapsing or refractory ATL [27]. This is because such cases require rapid transplantations and the availability of cord blood has recently improved in Japan [28]. HLA-haploidentical transplantation (haplo-HSCT) is also available. However, such transplants are associated with transplant-related mortality and limited quality of life (QOL) [29]. OS of ATL patients who underwent CBT between 1996 and 2005 was inferior to that of ATL patients who underwent transplantation from other donor sources [14]. Beneficial outcomes of CBT for chemotherapy-sensitive ATL are reported to be feasible [30, 31], although these findings should be verified in a larger study. Furthermore, CBT results in a low incidence of GVHD [32], which raises concerns about diminished GVL effects and higher relapse rates. Given that the outcome of CBT for other hematologic malignancies improved after 2006 [33], the limitations and advantages of CBT for ATL should be reevaluated.

Here, we conducted a nationwide retrospective study focusing on OS and GRFS of patients with allo-HSCT-treated ATL after 2006 and compared GRFS among patients who underwent transplantation from different donor sources. Our results illustrate several important advantages and disadvantages of transplantation from each donor source, which may help to improve the overall outcome of allo-HSCT for ATL.

Methods

Data collection

Data of 1363 patients with ATL who first underwent allo-HSCT between 2006 and 2015 in Japan were retrospectively analyzed. Allo-HSCT recipient clinical data were collected by the Japan Society for Hematopoietic Cell Transplantation (JSHCT) using the Transplant Registry Unified Management Program (TRUMP) [34, 35]. This analysis included the following clinical characteristics of the patients: age at transplantation, gender, disease status at transplantation, date of transplantation, duration from diagnosis to transplantation, and conditioning regimens. The numbers of HLA antigen mismatches in CBT, related bone marrow transplantation (R-BMT), and related peripheral blood stem cell transplantation (R-PBSCT) were counted with respect to HLA-A, -B, and -DRB1. Haplo-HSCT was defined as transplantation with two to three HLA mismatches in the graft-versus-host direction. The number of HLA allelic mismatches in unrelated bone marrow transplantation (UR-BMT) was counted with respect to HLA-A, -B, -C, and -DRB1. The present study was approved by the data management committees of the JSHCT as well as the Institutional Review Board of the Kyoto University Graduate School of Medicine. All patients provided consent to participate in the present study.

Definition of endpoints

OS was defined as the duration from transplantation to death, and the patients who remained alive at the final follow-up were censored. GRFS was defined as the duration from transplantation until death, relapse, development of grade III–IV acute GVHD, or development of chronic GVHD that required systemic treatment, and the patients without any of these events at the time of the final follow-up were censored. The relapse date of patients who did not achieve complete remission (CR) before and after transplantation was defined as day 0.1. Non-GVHD/relapse mortality (NGRM) was defined as the incidence of deaths without relapse, development of grade III–IV acute GVHD, or development of chronic GVHD that required systemic treatment. Patients were divided into two groups according to the conditioning regimens: myeloablative conditioning (MAC) and reduced intensity conditioning (RIC). MAC and RIC were defined according to the proposals of Giralt et al. [36] and Bacigalupo et al. [37] respectively.

Statistical analysis

Descriptive statistics were used to summarize variables related to the demographics and clinical characteristics of the patients. Groups were compared using Fisher’s exact test as appropriate for categorical variables and the Kruskal–Wallis test for continuous variables. The Mann–Whitney U test was used as a nonparametric test to compare two groups. The probabilities of OS and GRFS were estimated according to the Kaplan–Meier method, and univariable comparisons among the groups were performed using the log-rank test. The Cox proportional hazard model was used for multivariate analysis of OS and GRFS. The cumulative incidence rates of relapse and GVHD were estimated, and death without these events was considered as a competing factor. NGRM was also estimated, and its competing events were relapse, grade III–IV acute GVHD, and systemically-treated chronic GVHD. Results were expressed as hazard ratios and their 95% confidence intervals (CI). All tests were two-sided, and a p value of <0.05 was considered to indicate statistical significance. All statistical analyses were performed using Stata (version 13.0, Stata Corporation) and EZR (Saitama Medical Center, Jichi Medical University), a graphical user interface for R (The R Foundation for Statistical Computing, version 2.3.0) [38]. The shared scripts from the TRUMP data were used in the analyses [39].

Results

Characteristics of the patients

The median patient age was 57 years (range: 20–78 years) and the median observation period of survivors was 3.1 years (range: 0.0–10.5 years). The number of CBTs performed in 2011–2015 (n = 240) was double the number performed in 2006–2010 (n = 119, p < 0.001). The median age of patients who underwent CBT was 59 years, which was higher than that of patients who underwent the other types of transplantation (p < 0.001). The proportion of patients in CR at transplantation was lower among those who underwent haplo-HSCT (25%), CBT (37%), and R-PBSCT (35%) than among those who underwent R-BMT (44%) and UR-BMT (45%; p = 0.023). The duration from diagnosis to transplantation was longer for patients who underwent UR-BMT than for patients who underwent the other types of transplantation (p < 0.001). The percentage of HLA-mismatched transplantations was lower among patients who underwent R-BMT (15%) and higher among patients who underwent CBT (96%; p < 0.001). These characteristics are summarized in Table 1. Details of conditioning regimens are summarized in Supplementary Table 1. Of the patients who received MAC, 55% received cyclophosphamide and total body irradiation, and 13% received busulfan and cyclophosphamide. Of the patients who received RIC, 55% received melphalan-based conditioning and 39% received busulfan-based conditioning.

Table 1 Characteristics of the patients.

OS

First, we compared R-BMT, R-PBSCT, UR-BMT, and CBT. In this analysis, we excluded patients who received haplo-HSCT as there were many fewer than those who received the other transplantation types. One-year OS was worse after CBT (38%, 95% CI: 32–43%) than after R-BMT (49%, 95% CI: 40–58%), R-PBSCT (52%, 95% CI: 45–58%), and UR-BMT (47%, 95% CI: 43–51%; p < 0.001; Fig. 1a). However, among patients in CR at transplantation, 1-year OS after CBT (52%, 95% CI: 43–60%, n = 132) was equivalent to that after R-BMT (55%, 95% CI: 41–68%, n = 51), R-PBSCT (57%, 95% CI: 45–67%, n = 79), and UR-BMT (58%, 95% CI: 52–63%, n = 280; p = 0.15; Fig. 1b). On the other hand, in the non-CR patients, 1-year OS after CBT (28%, 95% CI: 22–34%, n = 223) was inferior to that after R-BMT (45%, 95% CI: 33–57%, n = 63), R-PBSCT (49%, 95% CI: 41–57%, n = 147), and UR-BMT (39%, 95% CI: 34–44%, n = 337; p < 0.001; Fig. 1c).

Fig. 1: OS and cumulative incidence of relapse in ATL patients.
figure 1

Kaplan–Meier curves of OS of all patients who received R-BMT, R-PBSCT, UR-BMT, and CBT (a). OS of patients in CR (b) and non-CR (c) at transplantation. Cumulative incidence of relapse in all patients (d), patients in CR at transplantation (e), and non-CR at transplantation (f).

Relapse

The 1-year cumulative incidence of relapse after CBT was 47% (95% CI: 42–52%), which was higher than that after R-BMT (40%, 95% CI: 31–49%), R-PBSCT (42%, 95% CI: 35–48%), and UR-BMT (35%, 95% CI: 31–39%; p = 0.003; Fig. 1d). Among patients in CR at transplantation, the 1-year cumulative incidence of relapse after CBT (26%, 95% CI: 19–34%) was not significantly different from that after R-BMT (31%, 95% CI: 18–44%), R-PBSCT (31%, 95% CI: 21–42%), and UR-BMT (21%, 95% CI: 16–26%; p = 0.19; Fig. 1e). On the other hand, among the patients in non-CR at transplantation, the 1-year cumulative incidence of relapse after CBT (60%, 95% CI: 53–66%) was higher than that after R-BMT (48%, 95% CI: 35–60%), R-PBSCT (47%, 95% CI: 39–55%), and UR-BMT (46%, 95% CI: 41–52%; p = 0.002; Fig. 1f).

GRFS

One-year GRFS did not significantly differ after CBT (21%, 95% CI: 17–26%), R-BMT (26%, 95% CI: 18–34%), R-PBSCT (22%, 95% CI: 16–27%), and UR-BMT (26%, 95% CI: 22–29%; p = 0.09; Fig. 2a). Among patients in CR at transplantation, GRFS did not differ among the four donor sources (CBT: 28%, 95% CI: 21–36%, R-BMT: 35%, 95% CI: 22–49%, R-PBSCT: 27%, 95% CI: 18–37%, and UR-BMT: 34%, 95% CI: 29–40% at 1 year; p = 0.49; Fig. 2b). Similarly, in the non-CR patients, GRFS did not differ among donor sources at 1 year (CBT: 16%, 95% CI: 12–22%, R-BMT: 19%, 95% CI: 11–30%, R-PBSCT: 18%, 95% CI: 13–25%, and UR-BMT: 19%, 95% CI: 15–23%; p = 0.25; Fig. 2c).

Fig. 2: Comparison of GRFS and first GRFS events among donor sources.
figure 2

Kaplan–Meier curves of GRFS of all patients (a), patients in CR at transplantation (b) and non-CR at transplantation (c). Band graphs showing the percentages of first GRFS events in all patients (d) and patients in CR at transplantation (e).

We also analyzed the first GRFS events in all the patients and those in CR at transplantation. The rate of NGRM after R-PBSCT (15%) was lower than that after the other transplantation types (26%) (p < 0.001), whereas the incidence of chronic GVHD that required systemic treatment after CBT (8%) was lower than that after the other transplantation types (19%, p < 0.001; Fig. 2d). Among patients in CR at transplantation, the proportion relapsing after CBT (29%) was similar to that after the other transplantation types (29%, p = 0.90), but CBT resulted in a higher rate of NGRM (41%) compared with the other transplantation types (28%, p = 0.014; Fig. 2e).

GVHD and NGRM

The 1-year cumulative incidence of grade III–IV acute GVHD after CBT was 11% (95% CI: 8–15%), which was lower than that after R-PBSCT (20%, 95% CI: 15–26%) and tended to be lower than that after R-BMT (17%, 95% CI: 10–24%) and UR-BMT (13%, 95% CI: 11–16%; p = 0.014; Fig. 3a). The percentage of patients with grade II acute GVHD after UR-BMT was 33%, which was higher than that after transplantation from the other donor sources (R-BMT: 23%, R-PBSCT: 24%, and CBT: 20%, p < 0.001). The percentage of patients with chronic GVHD that required systemic therapy at 1 year after CBT was 9% (95% CI: 6–12%), which was lower than that after transplantation from the other sources (R-BMT: 21%, 95% CI: 14–29%, R-PBSCT: 29%, 95% CI: 23–35%, and UR-BMT: 18%, 95% CI: 15–21%, p < 0.001; Fig. 3b). The 1-year NGRM after R-PBSCT was 9% (95% CI: 5–13%), which was lower than that after transplantation from the other sources (CBT: 20%, 95% CI: 16–24%, R-BMT: 12%, 95% CI: 7–19%, and UR-BMT: 16%, 95% CI: 13–19%; p < 0.001; Fig. 3c). Similar differences in the incidence of GVHD and NGRM were detected in both the CR and non-CR patients for each donor source.

Fig. 3: Cumulative incidences of acute/chronic GVHD and non-GVHD/relapse mortality.
figure 3

Cumulative incidences of grade III–IV acute GVHD (a), systemically treated chronic GVHD (b), and non-GVHD/relapse mortality (c) after transplantation from each source.

Causes of NGRM are summarized in Table 2. The proportion of deaths caused by infection was significantly higher after CBT (50%) than after the other transplantation types (29%, p = 0.004). This result was associated with a higher incidence of graft failure after CBT (18%) than after the other transplantation types (7%, p < 0.001).

Table 2 Causes of non-graft-versus-host-disease/relapse deaths.

HLA matching status

Next, we compared the outcomes of HLA-matched transplantation with HLA-mismatched (including HLA-haploidentical) transplantation. OS, relapse, grade III–IV acute GVHD, chronic GVHD that required systemic treatment, NGRM, and GRFS did not significantly differ between patients who underwent HLA-matched BMT and those who underwent HLA-mismatched BMT (data not shown). PBSCT was divided into three groups: HLA 6/6-matched, 5/6-matched, and haploidentical in the graft-versus-host direction. The proportion of CR patients in each group was 35%, 29%, and 30%, respectively. For GVHD prophylaxis, 67% of the patients in the HLA 5/6-matched transplantation group received standard prophylaxis (calcineurin inhibitor plus methotrexate or mycophenolate mofetil), while 24% received anti-thymocyte globulin (ATG) and 4% posttransplant cyclophosphamide (PTCY), in addition to standard prophylaxis. In the haplo-HSCT group, 47% of patients received ATG, 25% PTCY, and 25% corticosteroids, in addition to standard prophylaxis. One-year OS after HLA-haploidentical PBSCT was 39% (95% CI: 23–55%), which tended to be inferior to that after HLA 6/6-matched (53%, 95% CI: 45–60%) and HLA 5/6-matched (49%, 95% CI: 33–62%) PBSCT (p = 0.11; Fig. 4a). GRFS did not significantly differ among these three groups (haploidentical: 22%, 95% CI: 10–36%, 6/6-matched: 20%, 95% CI: 15–26%, and 5/6-matched: 27%, 95% CI: 15–40% at 1 year; p = 0.69; Fig. 4b). There were no significant differences in the cumulative incidence of relapse after HLA 6/6-matched (40%, 95% CI: 32–47% at 1 year), HLA 5/6-matched (49%, 95% CI: 34–63%), and HLA-haploidentical (52%, 95% CI: 34–67%) PBSCT (p = 0.33; Fig. 4c). The cumulative incidences of grade III–IV acute GVHD, systemically-treated chronic GVHD, and NGRM did not differ among the groups (Fig. 4d–f).

Fig. 4: HLA mismatch influences the outcome of PBSCT.
figure 4

Kaplan–Meier curves of OS (a) and GRFS (b) according to the HLA-matched status. Cumulative incidences of relapse (c), grade III–IV acute GVHD (d), systemically treated chronic GVHD (e), and non-GVHD/relapse mortality (f) according to the HLA-matched status.

Univariate and multivariate analysis of OS and GRFS

We performed univariate and multivariate analyses of OS and GRFS. In univariate analysis of OS, male sex, higher age (>50 years), worse performance status (PS >1), non-CR status at transplantation, and CBT were associated with lower OS (Table S2). Multivariate analysis revealed that male sex, higher age, worse PS, MAC, and non-CR status at transplantation were associated with lower OS, but CBT was not (Table S2). In univariate analysis of GRFS, male sex, higher age, worse PS, non-TBI conditioning, non-CR status at transplantation, and HLA mismatch were associated with lower GRFS (Table 3). Multivariate analysis revealed that male sex, higher age, worse PS, longer duration from diagnosis to transplantation (>180 days), and non-CR status at transplantation were associated with lower GRFS, whereas HLA mismatch was not (Table 3). Donor sources were not associated with GRFS, and the era of transplantation (2006–10 vs. 2011–15) was not associated with OS or GRFS (Table 3).

Table 3 Univariate and multivariate analyses of graft-versus-host disease-free, relapse-free survival.

Discussion

The success of allo-HSCT is dependent on suppression of severe complications such as GVHD, relapse, infection, and vital organ failure. However, no one factor can predict long-term survival without ongoing morbidity. GRFS represents ideal recovery from allo-HSCT and is now recognized as an important treatment endpoint. Transplant-treated ATL patients often develop GVHD or other severe complications that lead to limited activity, and thus GRFS is also an important endpoint with respect to QOL. This is the first report of GRFS in ATL, and we have focused on differences in each component associated with GRFS among donor sources. We identified different advantages and disadvantages of each donor source, which provided clues to improve the outcome of allo-HSCT.

We made several important findings regarding CBT. Among patients in CR, OS and relapse rates after CBT were not inferior to those after transplantation from the other sources (Fig. 1b, d). GRFS did not significantly differ between patients who underwent CBT and those who underwent transplantation from the other donor sources (Fig. 2a–c). Whereas CBT was associated with a decrease in severe GVHD (Fig. 3a, b), relapse of CR patients after CBT was equivalent to that after transplantation from the other sources (Fig. 1c). These findings imply that the GVL effects of CBT are substantial for CR patients but may be limited in cases with a high tumor burden. On the other hand, the disadvantage of CBT was a high NGRM rate (Fig. 3c), which may be attributable to fatal opportunistic infections in many cases. This derives from delayed immune reconstitution after CBT, which should be further investigated.

In general, GRFS is shorter following PBSCT than BMT due to the higher incidence of chronic GVHD requiring systemic treatment [25]. In our study, however, the duration of GRFS was equivalent in PBSCT and BMT (Fig. 2a–c), due to the low NGRM rate after PBSCT. As described in Table 2, most of the NGRM derived from infectious complications. It is widely recognized that ATL patients often develop severe opportunistic infections [40]. In addition, infection-related mortality of transplant-treated ATL patients is higher than that of patients with other hematological malignancies [26]. PBSCT enables quick recovery of neutrophils and lymphocytes [41], which reduces severe infection. The benefits of reducing infection-related deaths following PBSCT in ATL may be greater than in other hematological malignances. In other words, quick immune reconstitution following PBSCT may be meaningful in ATL, although it remains to be verified in HLA-mismatched settings.

The cumulative incidence of severe GVHD tended to be higher after PBSCT (Fig. 3a, b); however, the relapse rate was not lower after PBSCT than after transplantation from the other donor sources (Fig. 1d, e). On the other hand, the incidence of grade II acute GVHD was higher and the cumulative incidence of relapse was lower after UR-BMT (Fig. 1d, e). These results suggest that moderate GVHD is associated with potent GVL effects and a reduced relapse rate, while severe GVHD is harmful and has no relapse suppression advantages. Many of the patients with severe GVHD possibly died of GVHD itself, infection, or organ failure before the appearance of GVL effects. These findings are compatible with those of another study, which looked at the impact of GVHD on outcomes after allo-HSCT for ATL [17].

In analysis of all hematological malignancies in Japan, GRFS was worse after HLA-mismatched transplantation than after HLA-matched transplantation. However, in the present study, HLA matching status did not influence GRFS in multivariate analysis (Table 3). Among patients who underwent PBSCT, the incidence of relapse tended to be higher after HLA-mismatched transplantation (Fig. 4c) and the incidence of grade III–IV acute GVHD tended to be lower after HLA 5/6-matched transplantation (Fig. 4d). This might be because patients who underwent HLA-mismatched transplantation received strong GVHD prophylaxis such as ATG, PTCY, and corticosteroids. These results suggest that adjusting the strength of GVHD prophylaxis may improve the outcome of HLA-mismatched transplantation in ATL. The relapse rate was not decreased following HLA-mismatched transplantation; therefore, we could not confirm the HLA-dependent GVL effects in ATL, which should be further explored. On the other hand, the relapse rate tended to be lower after unrelated transplantation (UR-BMT and CBT) than after related transplantation among patients in CR (Fig. 1e). GVL effects may be more potent after unrelated transplantation due to a mechanism other than HLA mismatch, such as mismatch of killer immunoglobulin-like receptor or its ligand polymorphisms [42].

Multivariate analysis revealed that male sex, higher age (>50 years), worse PS (>1), longer duration from diagnosis to transplantation (>180 days), and non-CR status at transplantation were associated with lower GRFS, which is consistent with the survey of GRFS in all hematological malignancies [25] and the report concerning early donor application for ATL patients [43]. Only 5% of the patients received ATG; therefore, a larger amount of data must be analyzed to clarify the association between ATG and GRFS in ATL. The outcome of allo-HSCT for non-CR ATL is still unsatisfactory. Many aspects of ATL therapy including conventional chemotherapy, conditioning regimens, GVHD prophylaxis, and maintenance therapies should be further optimized.

This study has inherent limitations that are common among observational studies. Eligibility for transplantation, as well as the choice of transplantation protocol including the selection of graft source, was determined by the treating physicians at each institution. The confounding effect of some variables, such as disease subtype, could not be evaluated because of missing data. Moreover, the data analyzed did not include information regarding the use of mogamulizumab or lenalidomide, which have both been shown to increase fatal GVHD [44,45,46,47].

The present study revealed that transplant from all donor sources is feasible for those patients in CR and that each source can be selected depending on the clinical characteristics of patients. Great efforts should be made to improve or avoid the shortcomings of alternative donor sources, which should ultimately improve the outcome of ATL.