Introduction

Myelodysplastic syndrome (MDS) comprises a very heterogeneous group of myeloid neoplasms with varying outcomes. While several therapies have been developed in recent years, allogeneic hematopoietic stem cell transplantation (HCT) remains the only potentially curative treatment for these patients. However, its use is restricted due to the risk of morbidity and mortality [1]. In the past decade, as a result of advances in therapy and supportive care, along with wider donor availability, the number of patients with MDS undergoing HCT has been increasing [2].

Graft-versus-host disease (GVHD) remains one of the main causes of transplant-related morbidity and mortality. One strategy to reduce GVHD is the use of ex vivo T-cell depletion (TCD) of the graft through CD34+ selection. We have previously reported rates of overall survival (OS) of 56.9% at 2 years (95% CI, 48–67.3%) and 49.3% at 5 years (95% CI, 40.4–60.2%) with cumulative incidences of grades II–IV acute GVHD of 9.8% at day 100 in patients with MDS undergoing this approach [3]. This study demonstrated that CD34-selected HCT offers long-term OS and RFS with low rates of acute and chronic GVHD, and without an increased risk of relapse.

Since outcomes for patients with MDS are heterogeneous, individual risk stratification is important in managing patients. Most published scoring systems have focused on prognostic variables measured before HCT. The International Prognostic Scoring System (IPSS) [4] and the revised IPSS (IPSS-R) [5] are commonly used by clinicians to help risk stratification. While these scores are helpful to guide decisions prior to transplant, they are less useful in predicting outcomes in patients who have already undergone HCT. Post-transplant complications are well known and can be easily identified, but no definitive prediction model based on them exits. Robust statistical models of post-transplant complications should guide decisions for each patient with a risk-adapted individualized strategy.

The Francophone Society of Bone Marrow Transplantation and Cellular therapy (SFGM-TC) group has recently validated a scoring system based on post-transplant complications that can estimate the survival probability of patients with MDS who survive more than 100 days after HCT [6]. This post-transplant scoring system (PTSS) is a clinical risk assessment tool based on three independent risk factors: grade of acute GVHD, lack of platelet recovery before day 100, and relapse before day 100. This score can be helpful in decision making for patients with complications after allo-HCT.

In this retrospective study, we calculated the PTSS at different timepoints after transplant in a cohort of patients with MDS who underwent TCD-HCT, in an attempt to validate the prognostic tool using a different transplant approach.

Material and methods

Patients

The analysis included patients aged 18 or older diagnosed with MDS who underwent a first allo-HCT using ex vivo CD34+ cell-selected peripheral blood stem cell (PBSC) transplant. Patients were treated at Memorial Sloan Kettering Cancer Center between January 1, 2008 and May 31, 2018. Patients with MDS classified by WHO category 2008 [7] were included. Disease status at transplant (responder vs no responder) was assessed according to IWG 2006 criteria. HLA matching was established by DNA sequence-specific oligonucleotide typing for HLA-A, -B, -C, -DRB1, and -DQB1 loci. Patients received grafts from HLA-matched (10/10) or single mismatched (9/10) related or unrelated donors. Clinical outcomes, including acute GVHD, engraftment, relapse, and causes of death, were captured per standard clinical practice. Written informed consent for treatment was obtained from all patients and donors. Approval for this retrospective review was obtained from the Institutional Review Board and Privacy Board.

Conditioning regimens and graft source

All patients were treated with myeloablative conditioning regimens, including hyper fractionated total body irradiation 1375 cGy over 4 days, followed by thiotepa 5 mg/kg/day i.v. for 2 days and either fludarabine 25 mg/m2/day i.v. for 5 days, or cyclophosphamide 60 mg/kg/day i.v. for 2 days; or busulfan followed by melphalan 70 mg/m2/day i.v. for 2 days, and fludarabine 25 mg/m2/day i.v. for 5 days. TCD of granulocyte colony-stimulating factor-mobilized PBSCs grafts was performed as described previously [8, 9]. Ex vivo CD34+ selection of hematopoietic progenitor cells was performed using one of two methods as previously described: Isolex 300i Magnetic Cell Separator (Baxter, Deerfield, IL), followed by T cell rosetting with sheep erythrocytes (9 patients), or using the CliniMACS® CD34+ Reagent System (Miltenyi Biotech, Gladbach, Germany). All patients received either equine or rabbit anti-thymocyte globulin (ATG). Patients did not receive any other post-transplant immunosuppressive prophylaxis. All patients received supportive care and prophylaxis against opportunistic infections according to standard guidelines [10].

Post-transplant scoring system (PTSS) and hematopoietic cell transplantation comorbidity index (HCT-CI)

The PTSS score assigns 1 point for grade II acute GVHD, 2 points for grade III/IV acute GVHD, 2 points for lack of platelets recovery before day 100, and 4 points for relapse. GVHD was diagnosed clinically, confirmed pathologically whenever possible, and classified according to standard criteria [11, 12]. Lack of platelets recovery until day 100 was defined as the absence of platelets >20,000/μl or need for transfusion until day 100. Relapse or progression was identified by conventional diagnostic methods. Classification of patients into risk groups was the same as the original study [6]: the point score model ranged from 0 to 8 points, discriminating low- (0), intermediate- (1–3), and high-risk (4–8) patients. PTSS was also analyzed as a two-level variable with patients with scores <1 categorized as low, and patients ≥1 categorized as intermediate/high. The HCT-CI was calculated as originally described [13, 14] and patients were classified in two ways: (1) low (0–1), intermediate (2–3), and high (>3) and (2) low/intermediate (0–3) and high (>3).

Endpoints and statistical analysis

The primary endpoint of the study was OS, defined as time from landmark date (100, 180, or 365 days post HCT) until death from any cause or the last follow-up. Kaplan–Meier curves were used to estimate event-time distribution for OS endpoint. To evaluate the association between PTSS score and OS, a landmark analysis was performed at day 100, 180, and 365 post HCT. Patients who died before the landmark timepoint (death before day 100, 180, and 365) were excluded. To evaluate the association between the HCT-CI and OS, univariate cox proportional hazards models were fit from landmark timepoints at baseline (day 0) and day 100, 180, and 365. For exploratory purposes, multivariate cox proportional hazard models were used to investigate if PTSS score was prognostic after adjusting for the HCT-CI. Univariate and multivariate cox proportional hazard models were fit using each respective landmark timepoint as baseline to obtain hazard ratio (HR) estimates and corresponding confidence intervals (CIs) of OS. Secondary endpoints included cumulative incidence of relapse and cumulative incidence of acute GvHD. Cumulative incidence of relapse was estimated considering non-relapse mortality (NRM) as a competing risk. Cumulative incidence of acute GVHD (≥grade 2) was estimated with death without prior relapse, or relapse considered as competing risk events. Descriptive statistics were used to summarize patient characteristics. All statistical analyses were performed using R version 3.6.2.

Results

Patients’ characteristics

One hundred and nine patients with MDS met the inclusion criteria and constituted the study population. Median age was 61 years (range 20–72), 64 patients (59%) were male, and 89 (82%) had a matched related or unrelated donor. Table 1 describes the baseline characteristics and early post-transplant complications of the cohort.

Table 1 Characteristics of the transplant and post-transplant scoring system complications.

Patients were distributed per the WHO classification 2008: refractory anemia with excess of blasts type 2 (RAEB-2) = 28 (25.7%), refractory anemia with excess of blasts type 1 (RAEB-1) = 34 (31.2%), refractory cytopenia with unilineage dysplasia (RCUD), refractory anemia with ringed sideroblasts (RARS), refractory cytopenia with multilineage dysplasia (RCMD) = 39 (35.8%), MDS with isolated del(5q) 4 (3.7%), and unclassified = 4 (3.7%).

Most patients had received prior hypomethylating agents (decitabine or azacitidine), including six patients treated on a clinical trial of ATRA + decitabine (NCT00382200). Patients with del5q and transfusion-dependent anemia were also treated with lenalidomide, and one patient received ATG. The remaining patients proceeded directly to transplant, as they were classified as high risk by cytogenetics with <10% blasts in bone marrow. Over 70% patients responded to the pre-HCT treatments and most had less than 5% bone marrow blasts pre-HCT, although only 12 patients achieved a complete remission pre-transplant.

Engraftment and graft-versus-host disease

Successful engraftment was observed in 107 patients. Only 2 patients (1.8%) did not recover their platelet count before day 100. The cumulative incidence of grade II–IV acute or late acute GVHD at day 100 and day 180 were 19% (95% CI 12–27%) and 24.84% (95% CI 17–33%), respectively (Fig. 1a). Before the 100-day landmark, 22 patients had developed grade I, 13 had developed grade II, and 3 grade III GVHD. Before the 1-year landmark, 20 patients had developed grade I, 13 grade II, and 11 grade III-IV GVHD. The remaining patients (n = 65) had no GVHD at 1 year.

Fig. 1: Transplant outcomes in patients with MDS.
figure 1

a Cumulative incidence of GVHD grade II–IV. b Cumulative incidences of relapse and NRM.

Relapse/progression and non-relapse mortality

The 3-year cumulative incidence of relapse was 17.1% (95% CI 10.5–24.9%) with a median time to relapse after transplant of 9 months (range, 3–90 months). Among 23 patients with relapse events, 6 were alive at last follow-up (Fig. 1b). Among patients who relapsed during the first year, one relapsed before day 100, 4 before day 180 landmark, and 14 before day 365. The 3-year cumulative incidence of NRM was 20% (95% CI 13–28%). At last follow-up, relapse was the most common cause of death, GVHD was the second most common cause of death, followed closely by infection.

Pre-transplant HCT-CI score and PTSS

The median HCT-CI score was 2. Thirty-six patients (33%) had a low, 40 (37%) an intermediate, and 33 (30%) a high HCT-CI score. Twenty-one patients had an HCT-CI score ≥5 (19.3%). Sixteen of the 17 comorbidities included in the HCT-CI were present in at least one patient, except for renal comorbidity. The most frequent comorbidities included pulmonary impairment (moderate, n = 21; severe, n = 15), prior solid tumor (22 patients, 19.3%), psychiatric disorders (21 patients, 19.3%), and mild hepatic impairment (19 patients, 17.5%). Since the HCT-CI was only assessed at baseline, we also examined the KPS at the landmark timepoints (day 180 and day 365). At day 180, KPS ranged from 20 to 90, with 30 of 102 patients (30%) having a KPS < 70 and 71 patients (70%) having KPS > 70. KPS was missing in one patient. At day 365, KPS ranged from 30 to 100, with 17 of 90 patients (19%) having a KPS < 70 and 73 patients (81%) having KPS > 70.

At day 100, 90 patients (82.6%) had a low risk PTSS score, 18 patients (16.5%) an intermediate risk, and only 1 patient (0.9%) was classified as high risk. At post-transplant day 180, 72 of 102 patients at this time (71.3%) had a PTSS of low risk, 25 patients (24.5%) an intermediate score, and 4 patients (3.9%) a high-risk score. The score was missing in one patient at this timepoint. At the day 365 landmark, 55 of 90 patients still on study were low risk (61.1%), 26 patients (28.9%) were intermediate risk, and 9 (10.0%) were high risk.

Overall survival

With a median follow-up of 59 months (95% CI 56, 66), 1- and 3-year OS in the overall population were 85% (95% CI 79, 92) and 67% (95% CI 58,77), respectively. Seventy patients (64.2%) were alive at last follow-up. No patients died or were lost to follow-up before day 100. Six patients died, one patient was lost to follow-up before the 180-day landmark timepoint, and one patient did not have data available for scoring at day 180. By 365 days, ten additional patients died and two were lost to follow-up. Among the 55 low-risk patients who were included in the 365-day landmark analysis, the median follow-up was 65.6 months (95% CI 58.8–76.2 months) from time of transplant. Eight patients in this group died after day 365. The range of follow-up among those who did not have an event is 16.4–114.7 months from transplant.

OS did not significantly differ between HCT-CI groups at baseline (day 0; log rank p = 0.53; Fig. 2). Similarly, there was no association between HCT-CI and OS at any of the other landmark timepoints (day 100, 180, and 365) (Table 2). Additional analysis using a modified two-group scoring system of low-risk (0–3) and high-risk (>3) HCT-CI groups did not show any difference.

Fig. 2
figure 2

Overall survival in patients with MDS based on HCT-CI subgroups at baseline.

Table 2 Univariate cox proportional hazard models.

Due to limited events, we categorized KPS into two groups (<70 and >70) to limit multivariate models to three variables. There was no association between KPS and OS and day 180 (HR 0.63 [95% CI 0.30–1.32] p value = 0.2), but there was a significant association at day 365 (HR 0.24 [95% CI 0.10–1.57] p value = 0.001). Overall, the strength and direction of associations do not change greatly after adjusting for KPS at these timepoints.

Because of limited number of events in some of the landmark groups, PTSS was analyzed as a two-level variable: low (0), and intermediate–high (≥1). Using the PTSS at day 180, OS was significantly lower in the intermediate/high-risk group (HR 3.25 [95% CI 1.60, 6.60] p value 0.001) (Fig. 3a) compared to the low-risk group. Similarly, when assessed at day 365 intermediate/high-risk group, PTSS was significantly associated with shorter OS (HR 5.42 [95% CI 2.21, 13.3] p value <0.001) compared to the low-risk group (Fig. 3b). PTSS score calculated at day 100 was not associated with survival (HR of intermediate/high-risk group: 1.89 [0.89, 4.01] p value 0.10). In the multivariate analyses, intermediate/high PTSS scores remained significantly associated with shorter OS, after adjusting for comorbidity index (Table 3).

Fig. 3: Overall survival in patients with MDS based on PTSS subgroups.
figure 3

a OS with landmark analysis at day 180. b OS with landmark analysis at day 365.

Table 3 Multivariate cox proportional hazard models.

Discussion

There are limited data regarding post-transplant prognostic evaluation [15,16,17]. The SFGM-TC group proposed and validated the PTSS, showing that post-transplant complications were the only independent risk factors associated with decreased OS, while this was not the case for pre-transplant factors. In our cohort of 109 patients with MDS who received myeloablative CD34-selected HCT, patients with higher PTSS scores at 6 months and 1 year after transplant had a significantly lower OS compared to patients with lower scores, thus confirming the results of the original study. Patients included in the study had similar baseline characteristics to those in the French study in terms of age, gender, WHO category, pre-transplantation therapy, disease status at transplant, and donor and recipient CMV status. In addition to using a different transplant approach, other differences included a higher number of patients with less than 5% blasts in the bone marrow at time of transplant (82%) and higher number of HLA matching in our population.

The PTSS was originally developed to estimate the survival probability of patients with MDS who survive more than 100 days after HCT. However, the day 100 landmark PTSS was not associated with OS in our patient cohort. Possible explanations for these differences include the fact that we observed a lower incidence of acute and chronic GVHD in the TCD group compared to what is seen in unmodified grafts  [18,19,20,21,22,23,24]. We have previously reported cumulative incidences of grades II–IV acute GVHD of 9.8% at day 100 and 15.7% at day 180 in patients with advanced MDS who received a CD34-selected allo-HCT [3]. In contrast in the French study, 38.2% of the patients had a grade II–IV GVHD before day 100, while only 19% had grade II–IV GVHD at day 100 in our cohort.

Furthermore, the use of myeloablative conditioning regimens in recipients of CD34-selected transplants results in a significantly lower incidence of overall relapse and progression. In the French study, 8.9% patients relapsed before day 100 while just one patient (0.9%) relapsed or progressed in our study at the same timepoint. These significant differences in both GVHD and relapse may account for the fact that the day 100 score was not correlated with survival in the TCD group.

Interestingly, the HCT-CI was not predictive of OS in our study, even when we used a modified two-group scoring system of low-risk (0–3) and high-risk (>3) HCT-CI groups, which has been used in previous studies to stratify patients within a small sample size. The HCT-CI score is routinely used in clinical practice and has been extensively validated as a robust predictor of NRM and survival, including in patients receiving CD34+-selected grafts [25, 26]. Both of these prior reports from our center included larger cohorts of patients as well as patients with acute leukemia. While we cannot exclude that further discrimination would be detected in a larger cohort, a potential interpretation is that pre-existing comorbidities are less relevant in terms of OS in MDS patients who underwent TCD transplants.

Robust statistical models of post-transplant complications should guide decisions for each patient but for now, this new score can be helpful in making decisions after transplant. The prompt identification of high-risk PTSS patients should be targeted for interventions as soon as possible to reduce the likelihood of death. For example, strategies that mitigate relapse, such as the use of post-transplant hypomethylating agents, could be used in a risk-stratified manner based on the PTSS.

Similar to other landmark analyses, the PTSS can provide more accurate information on expected outcomes for patients who reach the landmark. Furthermore, the PTSS may also be helpful in making decisions in the setting of potentially life-threatening complications unrelated to transplant. For example, during the COVID-19 pandemic, patients, families, and the medical team are often faced with having to make decisions regarding withholding or withdrawing life support, or using aggressive and/or investigational treatments in patients who developed COVID-19 infection after a bone marrow transplant. The excellent survival in patients with a low PTSS score (estimated 3-year OS of 70%) would argue for aggressive care in this setting.

The PTSS was originally developed in patients with MDS, and we now validate its use in a different transplant approach. However, it would be interesting to examine whether it may be applicable in other diseases such as AML. Although the multivariate model in the original study included several variables including early relapse, the only independent risk factors associated significantly with 3-year OS were the post-transplant complications. In contrast, early relapse in AML is typically associated with dismal survival. As a result, a score that includes relapse as well as post-HCT complications is likely to be of lesser prognostic value in AML. The more chronic nature of MDS may in part explain these likely differences. However, the value or lack of value of the PTSS in diseases other than MDS would need to be established in more formal studies.

Even though our study largely confirms the results of the previous study, there are certain limitations. The main one is the risk of selection or information bias that can be seen on in any retrospective study.

We conclude that, in patients with MDS undergoing TCD-HCT, the PTSS scores at day 180 and day 365 are significantly associated with OS. The lower incidence of acute GVHD in recipients of CD34-selected transplants and the use of myeloablative condition regimens, with lower early relapse rates, likely explain differences with the original finding by the French group that the PTSS was predictive of survival at day 100 in unmodified grafts. Further studies including prospective evaluation are needed to validate these results, as well as the use of these tools in prospective clinical trials that seek to improve outcomes in patients with higher PTSS scores.