Introduction

Allogeneic hematopoietic cell transplantation (HCT) is an effective therapy that minimizes the risk of relapse in patients with acute myeloid leukemia (AML), but in exchange for a substantial risk of non-relapse mortality (NRM) [1, 2]. Given the complex and specialized nature of the modality, it is expected that allogeneic HCT outcomes could be affected by treating centers’ experience. This concept, referred to as “the center effect,” has been investigated; however, the results are conflicting [3,4,5,6,7,8,9,10,11,12,13,14,15,16]. When interpreting the existing literature, it is worth noting that most of the previous studies analyzed heterogeneous patient populations with various diseases based on relatively older data.

The situation surrounding allogeneic HCT is changing. Owing to the rapid progress toward an information society, it has become much easier to obtain medical information by means other than one’s own experience, which may act to reduce the center effect. On the other hand, the growing diversification of transplantation procedures may act to increase the center effect. Therefore, it is worthwhile to reevaluate the center effect in this era of major changes in transplantation practice. In this study, we analyzed the center effect using the Japanese nationwide registry data of patients with AML who underwent allogeneic HCT during the past decade from 2010 to 2019 under a uniform healthcare system.

Methods

Study population

All data were collected using the Transplant Registry Unified Management Program sponsored by the Japanese Society for Transplantation and Cellular Therapy and the Japanese Data Center for Hematopoietic Cell Transplantation [17]. This registration program covers >99% of the more than 300 transplantation centers nationwide [18].

Patients aged ≥16 years with AML (except for acute promyelocytic leukemia) were eligible for the analysis if they underwent their first allogeneic HCT during first complete remission (CR), second CR, or non-CR between 2010 and 2019 from a matched related or unrelated donor, a haploidentical related donor, or single-unit umbilical cord blood (UCB) containing a total nucleated cell dose of ≥2.0 × 107/kg of the recipient’s body weight with at least a 4/6 match at the antigen level for human leukocyte antigen (HLA)-A, HLA-B, and HLA-DR [19].

This study was approved by the Institutional Review Board of the Aichi Cancer Center. The patients provided informed consent for the use of their clinical information for research.

Definitions

Center experience was defined based on the number of allogeneic HCTs performed for any indication during the study period, by which centers were first divided into four groups using quartiles. Preliminary analysis showed no significant difference between the second and third quartile groups in terms of overall survival (OS), relapse, or NRM (data not shown); thus, these two groups were combined, leading to the adoption of the following three categories: low-volume (median [range], 65 [1–93] cases per decade), intermediate-volume (median [range], 170 [94–316] cases per decade), and high-volume (median [range], 389 [328–1028] cases per decade) centers for subsequent analyses.

CR was determined when the following conditions were fulfilled: the presence of <5% of blasts in the bone marrow, absence of leukemic blasts in the peripheral blood and extramedullary sites, and recovery of blood counts [20]. Relapse was defined as the loss of CR, whereas NRM was defined as death without relapse. For the analysis of relapse, patients who had never achieved CR after transplantation were categorized as relapse cases at time zero. The Eastern Cooperative Oncology Group performance status [21], HCT-specific comorbidity index (HCT-CI) [22], conditioning intensity [23], and cytogenetic risk [24] were defined based on published criteria. Neutrophil and platelet engraftments were defined as the first of three consecutive measures with neutrophil and platelet counts of ≥0.5 × 109/L and ≥2.0 × 109/L, respectively [18]. Acute and chronic graft-versus-host disease (GVHD) were diagnosed and graded according to the standard criteria [25, 26].

Statistical analysis

The probability of OS was estimated using the Kaplan–Meier estimator, and differences between groups were analyzed using the log-rank test [27]. Multivariate analyses were performed using the Cox proportional hazards model [28]. All covariates listed in Table 1 were included in the final multivariate model considering their potentially confounding effects. Relapse and NRM were mutually competing events, and their incidence was estimated using the cumulative incidence estimator [29]. Engraftment and GVHD were also assessed by accommodating the competing risks. The Fine–Gray model was used for the univariate and multivariate analyses in a competing risk setting [30].

Table 1 Characteristics of patients according to the center volume.

Results

Patients

A total of 7821 patients were eligible for the analysis, with 1833 (23%), 3884 (50%), and 2104 (27%) patients being classified into the low-, intermediate-, and high-volume center groups, respectively. The median follow-up for survivors was 4.3 (interquartile range [IQR], 2.3–6.6) years. Table 1 shows the characteristics of the patients in each center category. Patients who underwent transplantation at high-volume centers were characterized by older age, worse HCT-CI, and a higher frequency of non-CR at the time of transplantation. In addition, there was a notable imbalance regarding donor type; matched related donors accounted for 34%, 24%, and 19% in the low-, intermediate-, and high-volume centers, whereas UCB accounted for 32%, 45%, and 49%, respectively.

OS, relapse, and NRM

Figure 1 shows the OS, relapse, and NRM according to the center category. The 4-year probabilities of OS were 58% (95% confidence interval [CI], 55–61%), 63% (95% CI, 61–65%), and 66% (95% CI, 63–69%) for patients who underwent transplantation during CR at low-, intermediate-, and high-volume centers, respectively (P < 0.001, Fig. 1a). Their respective cumulative incidences of relapse were 23% (95% CI, 21–26%), 18% (95% CI, 17–20%), and 19% (95% CI, 17–22%) (P = 0.016, Fig. 1b), and their respective cumulative incidences of NRM were 22% (95% CI, 19–25%), 21% (95% CI, 19–23%), and 19% (95% CI, 17–22%) (P = 0.333, Fig. 1c). The 4-year probabilities of OS were 22% (95% CI, 19–25%), 29% (95% CI, 26–31%), and 29% (95% CI, 26–31%) for patients who underwent transplantation during non-CR at low-, intermediate-, and high-volume centers, respectively (P = 0.008, Fig. 1d). Their respective cumulative incidences of relapse were 56% (95% CI, 52–59%), 49% (95% CI, 47–52%), and 42% (95% CI, 39–45%) (P < 0.001, Fig. 1e), and their respective cumulative incidences of NRM were 27% (95% CI, 23–30%), 26% (95% CI, 24–28%), and 33% (95% CI, 30–36%) (P < 0.001, Fig. 1f).

Fig. 1: Outcomes according to the center category.
figure 1

Patients who underwent transplantation at low-, intermediate-, and high-volume centers are compared. a Overall survival (OS), (b) cumulative incidence of relapse, and (c) and cumulative incidence of non-relapse mortality (NRM) in patients with acute myeloid leukemia (AML) who underwent allogeneic hematopoietic cell transplantation (HCT) during complete remission (CR). d OS, (e) cumulative incidence of relapse, and (f) cumulative incidence of NRM in patients with AML who underwent allogeneic HCT during non-CR.

The results of the multivariate analysis on OS, relapse, and NRM are shown in Table 2. Using the high-volume centers as a reference, the risk of overall mortality was higher for the low- (hazard ratio [HR], 1.31; 95% CI, 1.20–1.44; P < 0.001) and intermediate-volume centers (HR, 1.11; 95% CI, 1.03–1.20; P = 0.008). Patients who underwent transplantation at low- and intermediate-volume centers showed a higher risk of relapse than those treated at high-volume centers (HR, 1.31; 95% CI, 1.16–1.48; P < 0.001 for the low-volume centers; and HR, 1.18; 95% CI, 1.07–1.31; P = 0.001, for the intermediate-volume centers). On the other hand, no difference was found in the risk of NRM between the low- and high-volume centers (HR, 1.08; 95% CI, 0.94–1.23; P = 0.278) or between the intermediate- and high-volume centers (HR, 0.90; 95% CI, 0.81–1.01; P = 0.070). Compared to the intermediate-volume centers, the low-volume centers were associated with higher risks of overall mortality (HR, 1.18; 95% CI, 1.09–1.28; P < 0.001) and NRM (HR, 1.19; 95% CI, 1.05–1.35; P = 0.005), and a trend toward a higher risk of relapse (HR, 1.10; 95% CI, 0.99–1.23; P = 0.062). Table 3 summarizes adjusted effects of the center volume on overall mortality, relapse, and NRM for each donor group. The center effect was most pronounced in UCB transplantation (UCBT), whereas it was not seen in haploidentical related HCT (Haplo-HCT).

Table 2 Results of the multivariate analysis on overall mortality, relapse, and non-relapse mortality.
Table 3 Adjusted effects of the center volume on overall mortality, relapse, and non-relapse mortality for each donor group.

Engraftment and GVHD

The adjusted effects of the center volume on engraftment and GVHD are shown in Supplementary Table 1. The likelihood of neutrophil engraftment was different only between the low- and high-volume centers (HR, 0.91; 95% CI, 0.85–0.98; P = 0.010); the differences in the likelihood of platelet engraftment were significant only between the intermediate- and high-volume centers (HR, 1.09; 95% CI, 1.02–1.17; P = 0.007). Compared to patients in the high-volume centers, those in the low-volume centers had lower risks of grade 2–4 acute GVHD (HR, 0.80; 95% CI, 0.71–0.90; P < 0.001), grade 3–4 acute GVHD (HR, 0.76; 95% CI, 0.62–0.92; P = 0.006), overall chronic GVHD (HR, 0.74; 95% CI, 0.66–0.84; P < 0.001), and extensive chronic GVHD (HR, 0.85; 95% CI, 0.72–1.00; P = 0.048), and those in the intermediate-volume centers had lower risks of grade 2–4 acute GVHD (HR, 0.91; 95% CI, 0.83–0.99; P = 0.032), overall chronic GVHD (HR, 0.68; 95% CI, 0.61–0.75; P < 0.001), and extensive chronic GVHD (HR, 0.76; 95% CI, 0.66–0.87; P < 0.0001). The difference between patients transplanted at low- and intermediate-volume centers was detected only in terms of grade 2–4 acute GVHD (HR, 0.88; 95% CI, 0.80–0.98; P = 0.019).

UCBT cohort

We next restricted our analysis to 3381 patients who underwent UCBT by dividing centers into four groups based on the number of UCBTs performed during the study period. The second and third quartile groups were combined because no difference was found in OS, relapse, or NRM (data not shown), and three categories of low-volume (n = 801, 24%), intermediate-volume (n = 1628, 48%), and high-volume centers (n = 952, 28%) were generated, with the numbers of UCBT performed during the study period of 10 years ranging 1–34 (median, 21), 35–117 (median, 79), and 121–794 (median, 204) cases, respectively. Consistent with the overall cohort, patients who underwent transplantation at high-volume centers were characterized by older age, worse HCT-CI, and a higher frequency of non-CR at transplantation (Table 4). Figure 2 shows the OS, relapse, and NRM according to the center category, and Table 5 presents the results of the multivariate analysis on OS, relapse, and NRM. The risk of overall mortality was higher for the low-volume centers (HR, 1.58; 95% CI, 1.38–1.82; P < 0.001) and the intermediate-volume centers (HR, 1.28; 95% CI, 1.13–1.44; P < 0.001) than the high-volume centers. Patients who underwent transplantation at low- and intermediate-volume centers had a higher risk of relapse than those treated at high-volume centers (HR, 1.66; 95% CI, 1.37–2.02; P < 0.001 for the low-volume centers; and HR, 1.54; 95% CI, 1.31–1.82; P < 0.001, for the intermediate-volume centers). There was no difference in the risk of NRM between the low- and high-volume centers (HR, 1.11; 95% CI, 0.93–1.34; P = 0.253) or between the intermediate- and high-volume centers (HR, 0.93; 95% CI, 0.80–1.09; P = 0.398). Compared to the intermediate-volume centers, the low-volume centers were associated with a higher risk of overall mortality (HR, 1.24; 95% CI, 1.09–1.40; P < 0.001) and a trend for a higher risk of NRM (HR, 1.19; 95% CI, 1.00–1.42; P = 0.054), but the risk of relapse was not different (HR, 1.08; 95% CI, 0.91–1.27; P = 0.381). The adjusted effects of the center category on engraftment and GVHD are shown in Supplementary Table 2. The high-volume centers had a higher likelihood of neutrophil engraftment than the low- or intermediate-volume centers (P = 0.003 and P = 0.014, respectively), and a higher likelihood of platelet engraftment than the low-volume centers (P = 0.017). As for GVHD, the high-volume centers showed higher risks for grade 2–4 acute GVHD, grade 3–4 acute GVHD, and overall chronic GVHD (P < 0.001 each, compared with the low- or intermediate-volume centers). The risk of extensive chronic GVHD did not differ between the low- and high-volume centers (P = 0.657) or between the intermediate- and high-volume centers (P = 0.336). The difference between patients transplanted at low- versus intermediate-volume centers was significant only in terms of grade 2–4 acute GVHD (HR, 0.78; 95% CI, 0.66–0.93; P = 0.005) and grade 3–4 acute GVHD (HR, 0.63; 95% CI, 0.45–0.87; P = 0.005).

Table 4 Characteristics of patients who underwent UCBT according to the center volume.
Fig. 2: Outcomes of patients undergoing umbilical cord blood transplantation (UCBT) according to the center category.
figure 2

Patients who underwent transplantation at low-, intermediate-, and high-volume centers are compared. a Overall survival (OS), (b) cumulative incidence of relapse, and (c) cumulative incidence of non-relapse mortality (NRM) in patients with acute myeloid leukemia (AML) who underwent UCBT during complete remission (CR). d OS, (e) cumulative incidence of relapse, and (f) cumulative incidence of NRM in patients with AML who underwent UCBT during non-CR.

Table 5 Results of the multivariate analysis on overall mortality, relapse, and non-relapse mortality for patients who underwent UCBT.

Discussion

To date, several studies have investigated the center effect in allogeneic HCT; some have demonstrated the survival advantage for patients who undergo transplantation at high-volume centers [3,4,5, 8, 10,11,12,13, 15], whereas others have not [6, 7, 14]. Moreover, only a fraction of these studies have evaluated the center effect on relapse and NRM, with mixed results [3, 4, 8, 10, 11, 15]. The major limitation in the existing literature is patient heterogeneity in terms of disease and healthcare environment. In addition, most of the published studies are based on relatively old data, raising questions about the applicability of the results to current practice. With these considerations in mind, we evaluated the center effect with a focus on adults with AML who underwent allogeneic HCT during the past decade in Japan where medical care was provided within a uniform healthcare system.

Our study underscores the difference in patient characteristics according to the center volume. Patients transplanted at high-volume centers were older, had more comorbidities, and were more frequently in non-CR at the time of transplantation, indicating that the high-volume centers were more likely to undertake challenging cases. After adjustment for known confounding factors, the risk of overall mortality was lowest for the high-volume centers and highest for the low-volume centers, with the difference in overall mortality being attributed primarily to the difference in relapse rather than NRM. The lack of difference in NRM between the intermediate- and high-volume centers and between the low- and high-volume centers suggests that the transplantation care quality is becoming equalized among centers in recent years. On the other hand, a strong center effect was demonstrated in terms of relapse. As expected, disease-related factors, such as disease status and cytogenetic risk, were greatly associated with the risk of relapse. Thus, the observation that patients who underwent transplantation at high-volume centers included more patients who did so while not in CR merits special attention. In this respect, the difference in the relapse incidence according to the center category was significant in both patients with and without CR at the time of transplantation and the multivariate analysis adjusting for these disease-related factors corroborates the finding of the center effect on relapse.

Determining the exact reason for the lower relapse risk observed in high-volume centers is challenging due to the retrospective nature of the present registry-based study. However, it can be assumed that the availability of advanced diagnostic technologies and novel targeted therapies in these centers might have contributed to the reduction in relapse [31,32,33]. Moreover, higher risk for acute and chronic GVHD in patients who underwent transplantation at high-volume centers suggests that GVHD policies might differ across the centers. It is widely recognized that GVHD is accompanied by the graft-versus-leukemia (GVL) effect, which plays a beneficial role in reducing relapse in patients undergoing allogeneic HCT [34,35,36]. Although the strategy to allow the development of GVHD can increase NRM, it is noteworthy that NRM was not higher in the high-volume centers.

The subgroup analysis by donor source showed that the center effect was most pronounced in patients who underwent UCBT. Even when the center effect for UCBT recipients was analyzed separately based on the number of UCBTs performed during the study period, the results remained unchanged. A recent study showed that the GVL effect was more potent in patients who underwent UCBT than those who underwent allogeneic bone marrow transplantation (BMT) or peripheral blood stem cell transplantation (PBSCT) [37], lending support to the hypothesis that an occurrence of GVHD is more beneficial in UCBT than in BMT/PBSCT. By contrast, NRM was not affected by the center category in our UCBT cohort. UCBT has been performed actively in Japan, accounting for one-third of all allogeneic HCTs [38], and it is conceivable that equalization of the transplantation techniques among centers has been achieved in not only BMT/PBSCT but also UCBT. In our study, the center effect was not seen in Haplo-HCT. This could be explained at least partly by the small sample size; however, another potential explanation could be that the characteristic of Haplo-HCT to suppress GVHD extensively in a straightforward way may have provided little room to modulate the GVL effect. In the secondary analysis of the Blood and Marrow Transplant Clinical Trials Network 1101 study, in which patients were randomized to either UCBT or Haplo-HCT, centers experienced in UCBT (>10 UCBTs per year) had similar outcomes with UCBT and Haplo-HCT, whereas those experienced in Haplo-HCT (>10 Haplo-HCTs but ≤10 UCBTs per year) had worse outcomes with UCBT than Haplo-HCT because of a higher risk of relapse [16], which is in keeping with our findings.

Because of the retrospective nature of the study, our results suffer from inherent biases, such as patient selection, treatment heterogeneity, and the presence of unmeasured factors. In addition, we used the number of allogeneic HCTs as an indicator of center experience, whereas several studies have shown that the prognostic relevance of other center factors, such as patient-per-physician ratio [7], accreditation status [9], and program duration [11], which were not taken into consideration in our study. While acknowledging these potential limitations, our analysis was strengthened by a homogeneous patient population, namely, adults with AML who underwent allogeneic HCT under a uniform healthcare system. In this regard, however, consideration should be given to the fact that our findings were exclusively based on Japanese patient data; thus, it is premature to apply our conclusions to other patient populations.

In summary, this study reveals the presence of the center effect in allogeneic HCT performed during the past decade, 2010–2019, in Japan. The difference in overall mortality according to the center volume resulted primarily from the difference in relapse, and this may be derived from centers’ attitude toward the allowance of GVHD. The weaker effect of center experience on NRM compared with that on relapse suggests the advancement in the transplantation care quality across the country.