Have we been qualifying measurable residual disease correctly?

Feng, Yahui; Qi, Saibing; Liu, Xueou; Zhang, Li; Hu, Yu; Shen, Qiujin; Gong, Xiaowen; Zhang, Wei; Wang, Junxia; Yan, Wen; Wang, Tiantian; Wang, Huijun; Song, Zhen; Zhu, Xiaofan; Gale, Robert Peter; Chen, Junren

doi:10.1038/s41375-023-02026-4

Download PDF

Perspective
Open access
Published: 13 September 2023

ACUTE LYMPHOBLASTIC LEUKEMIA

Have we been qualifying measurable residual disease correctly?

Yahui Feng^1,2^na1,
Saibing Qi^1,2^na1,
Xueou Liu^1,2^na1,
Li Zhang^1,2^na1,
Yu Hu^1,2,
Qiujin Shen^1,2,
Xiaowen Gong^1,2,
Wei Zhang^1,2,
Junxia Wang^1,2,
Wen Yan^1,2,
Tiantian Wang^1,2,
Huijun Wang^1,2,
Zhen Song^1,2,
Xiaofan Zhu ORCID: orcid.org/0000-0002-2572-6495^1,2,
Robert Peter Gale ORCID: orcid.org/0000-0002-9156-1676³ &
…
Junren Chen ORCID: orcid.org/0000-0003-3691-4931^1,2

Leukemia volume 37, pages 2168–2172 (2023)Cite this article

2703 Accesses
5 Citations
1 Altmetric
Metrics details

Subjects

Someone told me that each equation I included in the book would halve the sales. I therefore resolved not to have any equations at all. In the end, however, I did put in one equation, Einstein’s famous equation, E = m c squared. I hope that this will not scare off half of my potential readers.

Stephen Hawking

Introduction

There is considerable interest in tests quantifying remaining leukaemia cells after therapy, termed measurable residual disease (MRD)-tests, to predict therapy outcomes, leukaemia recurrence and consider potential subsequent interventions [1,2,3,4,5,6,7,8,9,10]. Many studies reported a negative MRD-test during or after completing anti-leukaemia therapy independently identifies persons with a low risk of leukaemia relapse compared with those with a positive MRD-test after adjusting for other predictive and prognostic co-variates [5, 11,12,13,14,15,16]. Other studies recommend specific interventions in someone with a positive MRD-test such as a haematopoietic cell transplant or immune therapy such as chimaeric antigen receptor (CAR)-T-cells. Whether such interventions reduce leukaemia relapse risk in someone with a positive MRD-test can only be proved in a randomized controlled trial [8, 17].

Most MRD-tests focus on detecting a leukaemia-related or -specific immune phenotype, cytogenetic and/or molecular abnormality [1, 2, 18,19,20,21,22,23,24,25]. A perfect MRD-test would precisely quantify only leukaemia cells biologically capable of causing leukaemia relapse and likely to do so within a defined interval after accounting for competing causes of therapy-failure [7, 8]. Routine clinical use of MRD-testing requires refinements and standardization/harmonization of assay platforms and result reporting [1, 2, 21,22,23].

There is consensus a flow cytometry-based MRD-test should be reproducible at a limit of detection (LoD) of ≤0.01% leukaemia cells in a blood or bone marrow sample [26]. Based on this reasoning it is proposed a multi-parameter flow cytometry (MPFC)-based MRD-test should only be declared positive if ≥ \(5\times 10{{\mbox{E+5}}}\) cells are analysed and if ≥20 or ≥50 cells are positive [27,28,29,30]. However, this definition is often unmet in clinical practice. For example, modern MRD-directed, risk-stratified approach to treating childhood acute lymphoblastic leukaemia (ALL) requires an MPFC-based MRD-test done in bone marrow aspirate 2–3 weeks after starting induction chemotherapy, a time when collecting > \(5\times 10{{\mbox{E+5}}}\) bone marrow mononuclear cells is difficult [31, 32]. The same limitation operates in adults receiving intensive induction chemotherapy. How should a physician use results of MRD-testing in these settings?

Tyranny of sampling error

Assume in an MPFC-based MRD-test \(N\) cells are analysed out of which \(n\) cells are identified as leukaemia cells. By leukaemia cells we mean cells with immune phenotype of the leukaemia, not necessarily cells able to cause relapse within a defined interval. The conventional way to estimate MRD is \({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{conventional}}}}}}}=\frac{n}{N}\) [33, 34].

When the true proportion of leukaemia cells (“true MRD”) is < \(\frac{1}{N}\), the standard error of \({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{conventional}}}}}}}\) has a magnitude even larger than true MRD because of sampling error (Supplementary Methods). Simply put, the \({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{conventional}}}}}}}\) test can be very imprecise.

To better appreciate the tyranny of sampling error consider the hypothetical example of a haematologist reviewing the following MRD-test result: \(N=50000\) and \(n=0\). Analysing these few cells is not uncommon in practice for reasons we discussed above. Using the conventional approach to quantifying MRD the haematologist interprets this MRD-test as \({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{conventional}}}}}}}=\frac{0}{50000}=0 \% .\) In doing so the haematologist fails to appreciate the result of this MRD-test is compatible with a broad range of true MRD values. In reality, the haematologist can only conclude MRD-test result is \(\le\!\!0.006 \%\) with a 5-percent probability true MRD is actually \( > 0.006 \%\).

Using Bayesian reasoning, the worst-case (probability <0.05)^{Footnote 1} scenario estimate of MRD, which we denote as \({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{worst}}}}}}\_{{{{{\rm{case}}}}}}}\), can be computed using a beta distribution (the formula is “BETA.INV (0.95, \(1+n\), \(1+N-n\))” in Microsoft Excel; Supplementary Methods) [35].

Table 1 displays the extent to which \({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{conventional}}}}}}}\) under-estimates true MRD at different values of \(N\) in the worst-case scenario (that is, by how much \({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{conventional}}}}}}}\) under-estimates \({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{worst}}}}}}\_{{{{{\rm{case}}}}}}}\)). Note that when \({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{conventional}}}}}}}\) is \(\le 0.01 \%\) \({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{worst}}}}}}\_{{{{{\rm{case}}}}}}}\) is considerably larger than \({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{conventional}}}}}}}\) across a broad range of \(N\) values. Conversely, when \({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{conventional}}}}}}}\) is \(\ge 0.1 \%\) \({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{worst}}}}}}\_{{{{{\rm{case}}}}}}}\) is usually very close to \({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{conventional}}}}}}}\) unless the number of analysed cells N is < 10E+5.

Table 1 To what extent \({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{conventional}}}}}}}\) under-estimates true MRD at different numbers of analysed cells N in the worst-case scenario.

Full size table

Typically result of an MRD-test is interpreted as positive or negative based on applying a cut-off threshold to \({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{conventional}}}}}}}\). Our analysis of the adverse impact of sampling error (Table 1) suggests any cut-off threshold <0.01% used in \({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{conventional}}}}}}}\) would yield unreliable results with many false-negatives. Moreover, when estimating the hazard function of \({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{conventional}}}}}}}\) for leukaemia relapse risk false-negatives would cause “flattening” of the estimated curve because the contrast between MRD-positives and -negatives is attenuated by contamination of false-negative MRD-test results.

Borrowing lessons from decision science

How to solve this problem when an inaccurate false-negative test result could have adverse clinical consequences? We propose the haematologist should instead rely on \({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{worst}}}}}}\_{{{{{\rm{case}}}}}}}\) rather than \({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{conventional}}}}}}}\) to estimate relapse risk.

Our reasoning follows. When interpreting an MRD-test result to predict relapse the haematologist is essentially playing a “chess game against nature”. It’s his/her 1st move to make, declaring the MRD-test result positive or negative. In response the opponent (nature) has two possible moves, causing relapse or not. When \({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{worst}}}}}}\_{{{{{\rm{case}}}}}}}\) is larger the haematologist is more likely to later regret if he/she declares the MRD-test result negative, because more plausibly nature would play tricks on the haematologist by causing relapse.

Ranking of people’s test results based on \({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{worst}}}}}}\_{{{{{\rm{case}}}}}}}\) from high to low values minimises the sum of regrets in the worst-case scenario because people whose MRD-test results are more likely to cause regret in case of a negative interpretation are already considered to have a higher risk of relapse. In the language of decision science, \({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{worst}}}}}}\_{{{{{\rm{case}}}}}}}\) is a minimax regret approach to quantifying MRD test results according to Leonard Savage’s theory of statistical decision or Herbert Simon’s theory of rational choice under uncertainty [36, 37].

A clinical example

To illustrate using \({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{worst}}}}}}\_{{{{{\rm{case}}}}}}}\) to interpret test results we interrogated data from 883 consecutive children with ALL <16 years (Supplementary Fig. 1; Supplementary Table 1; and Supplementary Methods). The subjects were treated on the Chinese Children’s Cancer Group study ALL-2015 (CCCG-ALL-2015) protocol [32]. 618 (70%) and 265 (30%) of the children were low- and intermediate-risk at diagnosis according to the CCCG-ALL-2015 criteria. MPFC-based MRD-testing was done on bone marrow samples 19 days after starting therapy. Median number of analysed cells (\(N\)) was 4 × 10E+5 (Interquartile Range [IQR], 2.4–5.0 × 10E+5; Range, 3.4 × 10E+3 to 1.0 × 10E+6). 686 (78%) MRD-tests analysed <5 × 10E+5 cells, a threshold stipulated by guideline for good laboratory practice (GLP) [27, 28, 30].

294 (33%) children had \({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{conventional}}}}}}} < 0.01 \%\) on day 19, 274 (93%) of whom had zero values (i.e. no leukaemia cell was detected [\(n=0\)]). The remainder (20 [7%]) had 8–24 leukaemia cells detected. Because most children with \({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{conventional}}}}}}} < 0.01 \%\) had no leukaemia cells detected in the sample, \({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{conventional}}}}}}}\) could not identify relative relapse risk in these children. The C-statistic (the probability of pairwise agreement with relapse time [38]) of \({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{worst}}}}}}\_{{{{{\rm{case}}}}}}}\) (0.57) was significantly higher (P <0.001; 2-sided Wilcoxon test on 500 bootstrap samples [39]) compared with C-statistic of \({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{conventional}}}}}}}\) (0.50). In short, \({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{worst}}}}}}\_{{{{{\rm{case}}}}}}}\) was a better predictor of relapse than \({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{conventional}}}}}}}\) when \({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{conventional}}}}}}}\) was close to zero (Fig. 1A). In contrast, for the 589 (67%) children who had \({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{conventional}}}}}}}\ge 0.01 \%\) on day 19, C-statistics of \({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{worst}}}}}}\_{{{{{\rm{case}}}}}}}\) (0.58) and \({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{conventional}}}}}}}\) (0.58) were similar (P = 0.61).

**Fig. 1: Using MRD_{worse_case} in a cohort of children with ALL.**

We estimated non-linear hazard functions of \({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{conventional}}}}}}}\) and \({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{worst}}}}}}\_{{{{{\rm{case}}}}}}}\) for relapse by fitting restricted cubic spline curves using Markov chain Monte Carlo [40,41,42,43]. Since \({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{worst}}}}}}\_{{{{{\rm{case}}}}}}}\) is always larger than \({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{conventional}}}}}}}\), all else being equal, switching from \({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{conventional}}}}}}}\) to \({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{worst}}}}}}\_{{{{{\rm{case}}}}}}}\) should induce a right-shift of the hazard function curve. Instead, we observed the hazard function of \({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{worst}}}}}}\_{{{{{\rm{case}}}}}}}\) rose more steeply than the hazard function of \({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{conventional}}}}}}}\) (Fig. 1B). Inaccuracies in MRD-estimation using the conventional approach distorted the critical range of MRD for discriminating low- from high-risks of cumulative incidence of relapse (CIR).

Combining \({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{worst}}}}}}\_{{{{{\rm{case}}}}}}}\) on day 19 with estimated relapse risk at diagnosis further improved risk-stratification of the children whose \({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{conventional}}}}}}}\) on day 19 was \( < 0.01 \%\) with a C-statistic of 0.73. This was significantly better than using \({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{worst}}}}}}\_{{{{{\rm{case}}}}}}}\) alone (0.73 vs. 0.57 [P < 0.001; 2-sided Wilcoxon test on 500 bootstrap samples]) or using relapse risk at diagnosis alone (0.73 vs. 0.68 [P < 0.001]; Fig. 1C). 214 children (73%) with \({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{conventional}}}}}}} < 0.01 \%\) on day 19 were low-risk at diagnosis and all subsequently received low-intensity therapy. The remainder (80 [27%]) were intermediate-risk at diagnosis and all received high-intensity therapy. Consequently, therapy-intensity did not confound results within each therapy cohort.

Interestingly, point-estimates for relapse at 1.5 years for high- and low-\({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{worst}}}}}}\_{{{{{\rm{case}}}}}}}\) cohorts were similar and their relapse curves only diverged after 1.5 years (Fig. 1A, C). Because \({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{worst}}}}}}\_{{{{{\rm{case}}}}}}}\) corrected for (probable) under-sampling of leukaemia cells at therapy start this divergence likely resulted from expansion of pre-existing sub-clones during and/or after the end of low-intensity maintenance therapy (54 to 125 weeks) [32].

Is MRD_{worst_case} an index or a metric for MRD?

Index is defined as a number (such as a ratio) derived from a series of observations and used as an indicator or measure. Metric is defined as a standard of measurement. Some may argue \({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{worst}}}}}}\_{{{{{\rm{case}}}}}}}\) is an index for MRD whilst \({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{conventional}}}}}}}=\frac{n}{N}\) is a metric. The distinction between index and metric is in some measure semantic. Even \({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{conventional}}}}}}}\) is a statistical construct for estimating likelihood of relapse. \({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{conventional}}}}}}}\) is what statisticians call a maximum-likelihood estimate, which is not the same as an estimate for the median (i.e. 50th-percentile) value among all the possible values of true MRD conditional on test result (Supplementary Methods). When \({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{conventional}}}}}}}\) is zero \({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{conventional}}}}}}}\) is actually the 0th-percentile (i.e. the lowest possible) value among all the possible values of true MRD conditional on test result! \({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{worst}}}}}}\_{{{{{\rm{case}}}}}}}\), on the other hand, is the 95th-percentile value among all the possible values of true MRD conditional on test result.

Discussion

In this Perspective we argue the consensus GLP of MRD-testing is sub-optimal in many instances. Under these circumstances \({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{conventional}}}}}}}\) test results are sometimes mis-leading. Our analyses of data from a large cohort of childhood ALL indicates the minimax regret approach (\({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{worst}}}}}}\_{{{{{\rm{case}}}}}}}\)) improves relapse risk prediction over the current method (\({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{conventional}}}}}}}\)). \({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{worst}}}}}}\_{{{{{\rm{case}}}}}}}\) corrects for variation in strength of evidence in MRD-tests when predicting leukaemia relapse. Moreover, non-linear modeling of \({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{worst}}}}}}\_{{{{{\rm{case}}}}}}}\) hazard function uncovers the critical range of MRD wherein the risk of leukaemia relapse accelerates. Because the true hazard function curve is steeper and operates at a lower range of MRD than previously realised based on \({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{conventional}}}}}}}\) it is important to continue developing and using increasingly sensitive (and specific) assays for detecting residual leukaemia cells.

We acknowledge several limitations. Our analyses of the clinical data were retrospective and subject to bias. We focused on MPFC, which enumerates mostly live cells one-by-one and is distinct from other types of assays such as quantitative real time polymerase chain reaction (RT-qPCR) or next generation sequencing (NGS). We also did not analyse false-positive errors in MRD-tests, which are more likely a biological than statistical issue as many or perhaps most false-positives are caused by not knowing which leukaemia cells have the biological ability to cause relapse within an observation interval [44,45,46]. In MPFC some aberrant leukaemia phenotypes may be more confidently identified as positive compared with others. Consequently, further refinement of results of MRD-testing is possible. Also, molecular tests such as NGS may increase accuracy of identifying residual leukaemia cells [8, 47]. However, sampling error remains an inherent limitation for any MRD-test as does the current inability to identify leukaemia cells biologically able to cause relapse regardless of detection technology.

We suggest our proposed metric \({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{worst}}}}}}\_{{{{{\rm{case}}}}}}}\) will help haematologists more accurately predict leukaemia relapse. It is possible to further improve accuracy of predicting leukaemia relapse by considering additional data beyond MRD-tests provided confounding predictive and prognostic co-variates are adjusted for and the therapy regimen is considered.

Data availability

Clinical data are available upon reasonable request to the corresponding authors.

Notes

Strictly speaking, the worst possible value of true MRD is always ≈1 even when \({{{{{{\rm{MRD}}}}}}}_{{{{{{\rm{conventional}}}}}}}=0\). (The chance of true MRD ≈1 might be practically zero but the probability of this unlikely event is never zero.) Defining the worst-case scenario estimate as “not likely (probability ≤ 0.05) to exceed this value” is more useful for comparing MRD-test results.

References

Lucio P, Parreira A, van den Beemd MW, van Lochem EG, van Wering ER, Baars E, et al. Flow cytometric analysis of normal B cell differentiation: a frame of reference for the detection of minimal residual disease in precursor-B-ALL. Leukemia. 1999;13:419–27.
Article PubMed CAS Google Scholar
Gabert J, Beillard E, van der Velden VH, Bi W, Grimwade D, Pallisgaard N, et al. Standardization and quality control studies of ‘real-time’ quantitative reverse transcriptase polymerase chain reaction of fusion gene transcripts for residual disease detection in leukemia - a Europe Against Cancer program. Leukemia. 2003;17:2318–57.
Article PubMed CAS Google Scholar
Mullighan CG, Goorha S, Radtke I, Miller CB, Coustan-Smith E, Dalton JD, et al. Genome-wide analysis of genetic alterations in acute lymphoblastic leukaemia. Nature. 2007;446:758–64.
Article PubMed CAS Google Scholar
Vardiman JW, Thiele J, Arber DA, Brunning RD, Borowitz MJ, Porwit A, et al. The 2008 revision of the World Health Organization (WHO) classification of myeloid neoplasms and acute leukemia: rationale and important changes. Blood. 2009;114:937–51.
Article PubMed CAS Google Scholar
Conter V, Bartram CR, Valsecchi MG, Schrauder A, Panzer-Grumayer R, Moricke A, et al. Molecular response to treatment redefines all prognostic factors in children and adolescents with B-cell precursor acute lymphoblastic leukemia: results in 3184 patients of the AIEOP-BFM ALL 2000 study. Blood. 2010;115:3206–14.
Article PubMed CAS Google Scholar
Papaemmanuil E, Gerstung M, Bullinger L, Gaidzik VI, Paschka P, Roberts ND, et al. Genomic classification and prognosis in acute myeloid leukemia. N. Engl J Med. 2016;374:2209–21.
Article PubMed PubMed Central CAS Google Scholar
Estey E, Gale RP. How good are we at predicting the fate of someone with acute myeloid leukaemia? Leukemia. 2017;31:1255–8.
Article PubMed CAS Google Scholar
Hourigan CS, Gale RP, Gormley NJ, Ossenkoppele GJ, Walter RB. Measurable residual disease testing in acute myeloid leukaemia. Leukemia. 2017;31:1482–90.
Article PubMed CAS Google Scholar
Ceppi F, Rizzati F, Colombini A, Conter V, Cazzaniga G. Utilizing the prognostic impact of minimal residual disease in treatment decisions for pediatric acute lymphoblastic leukemia. Expert Rev Hematol. 2021;14:795–807.
Article PubMed CAS Google Scholar
Dohner H, Wei AH, Appelbaum FR, Craddock C, DiNardo CD, Dombret H, et al. Diagnosis and management of AML in adults: 2022 recommendations from an international expert panel on behalf of the ELN. Blood. 2022;140:1345–77.
Article PubMed Google Scholar
Terwijn M, van Putten WL, Kelder A, van der Velden VH, Brooimans RA, Pabst T, et al. High prognostic impact of flow cytometric minimal residual disease detection in acute myeloid leukemia: data from the HOVON/SAKK AML 42A study. J Clin Oncol. 2013;31:3889–97.
Article PubMed Google Scholar
Freeman SD, Virgo P, Couzens S, Grimwade D, Russell N, Hills RK, et al. Prognostic relevance of treatment response measured by flow cytometric residual disease detection in older patients with acute myeloid leukemia. J Clin Oncol. 2013;31:4123–31.
Article PubMed Google Scholar
Chen X, Xie H, Wood BL, Walter RB, Pagel JM, Becker PS, et al. Relation of clinical response and minimal residual disease and their prognostic impact on outcome in acute myeloid leukemia. J Clin Oncol. 2015;33:1258–64.
Article PubMed Google Scholar
Othus M, Wood BL, Stirewalt DL, Estey EH, Petersdorf SH, Appelbaum FR, et al. Effect of measurable (‘minimal’) residual disease (MRD) information on prediction of relapse and survival in adult acute myeloid leukemia. Leukemia. 2016;30:2080–3.
Article PubMed PubMed Central CAS Google Scholar
Berry DA, Zhou S, Higley H, Mukundan L, Fu S, Reaman GH, et al. Association of minimal residual disease with clinical outcome in pediatric and adult acute lymphoblastic leukemia: a meta-analysis. JAMA Oncol. 2017;3:e170580.
Article PubMed PubMed Central Google Scholar
Dillon LW, Gui G, Page KM, Ravindra N, Wong ZC, Andrew G, et al. DNA sequencing to detect residual disease in adults with acute myeloid leukemia prior to hematopoietic cell transplant. JAMA. 2023;329:745–55.
Article PubMed PubMed Central CAS Google Scholar
Campbell M, Kiss C, Zimmermann M, Riccheri C, Kowalczyk J, Felice MS, et al. Childhood acute lymphoblastic leukemia: results of the randomized acute lymphoblastic leukemia Intercontinental-Berlin-Frankfurt-Munster 2009 Trial. J Clin Oncol. 2023;41:3499–511.
Article PubMed CAS Google Scholar
Beillard E, Pallisgaard N, van der Velden VH, Bi W, Dee R, van der Schoot E, et al. Evaluation of candidate control genes for diagnosis and residual disease detection in leukemic patients using ‘real-time’ quantitative reverse-transcriptase polymerase chain reaction (RQ-PCR) - a Europe against cancer program. Leukemia. 2003;17:2474–86.
Article PubMed CAS Google Scholar
van der Velden VH, Hochhaus A, Cazzaniga G, Szczepanski T, Gabert J, van Dongen JJ. Detection of minimal residual disease in hematologic malignancies by real-time quantitative PCR: principles, approaches, and laboratory aspects. Leukemia. 2003;17:1013–34.
Article PubMed Google Scholar
Loken MR, Alonzo TA, Pardo L, Gerbing RB, Raimondi SC, Hirsch BA, et al. Residual disease detected by multidimensional flow cytometry signifies high relapse risk in patients with de novo acute myeloid leukemia: a report from Children’s Oncology Group. Blood. 2012;120:1581–8.
Article PubMed PubMed Central Google Scholar
Kalina T, Flores-Montero J, van der Velden VH, Martin-Ayuso M, Bottcher S, Ritgen M, et al. EuroFlow standardization of flow cytometer instrument settings and immunophenotyping protocols. Leukemia. 2012;26:1986–2010.
Article PubMed PubMed Central CAS Google Scholar
van Dongen JJ, Lhermitte L, Bottcher S, Almeida J, van der Velden VH, Flores-Montero J, et al. EuroFlow antibody panels for standardized n-dimensional flow cytometric immunophenotyping of normal, reactive and malignant leukocytes. Leukemia. 2012;26:1908–75.
Article PubMed PubMed Central Google Scholar
Grimwade D, Freeman SD. Defining minimal residual disease in acute myeloid leukemia: which platforms are ready for “prime time”? Blood. 2014;124:3345–55.
Article PubMed CAS Google Scholar
Ladetto M, Bruggemann M, Monitillo L, Ferrero S, Pepin F, Drandi D, et al. Next-generation sequencing and real-time quantitative PCR for minimal residual disease detection in B-cell disorders. Leukemia. 2014;28:1299–307.
Article PubMed CAS Google Scholar
Pulsipher MA, Carlson C, Langholz B, Wall DA, Schultz KR, Bunin N, et al. IgH-V(D)J NGS-MRD measurement pre- and early post-allotransplant defines very low- and very high-risk ALL patients. Blood. 2015;125:3501–8.
Article PubMed PubMed Central CAS Google Scholar
Saygin C, Cannova J, Stock W, Muffly L. Measurable residual disease in acute lymphoblastic leukemia: methods and clinical context in adult patients. Haematologica. 2022;107:2783–93.
Article PubMed PubMed Central CAS Google Scholar
Roschewski M, Stetler-Stevenson M, Yuan C, Mailankody S, Korde N, Landgren O. Minimal residual disease: what are the minimum requirements? J Clin Oncol. 2014;32:475–6.
Article PubMed Google Scholar
Schuurhuis GJ, Heuser M, Freeman S, Bene MC, Buccisano F, Cloos J, et al. Minimal/measurable residual disease in AML: a consensus document from the European LeukemiaNet MRD Working Party. Blood. 2018;131:1275–91.
Article PubMed PubMed Central CAS Google Scholar
Paiva B, Puig N, Cedena MT, Rosinol L, Cordon L, Vidriales MB, et al. Measurable residual disease by next-generation flow cytometry in multiple myeloma. J Clin Oncol. 2020;38:784–92.
Article PubMed CAS Google Scholar
Buccisano F, Palmieri R, Piciocchi A, Arena V, Maurillo L, Del Principe MI, et al. Clinical relevance of an objective flow cytometry approach based on limit of detection and limit of quantification for measurable residual disease assessment in acute myeloid leukemia. A post-hoc analysis of the GIMEMA AML1310 trial. Haematologica. 2022;107:2823–33.
Article PubMed PubMed Central CAS Google Scholar
Jeha S, Pei D, Choi J, Cheng C, Sandlund JT, Coustan-Smith E, et al. Improved CNS control of childhood acute lymphoblastic leukemia without cranial irradiation: St Jude Total Therapy Study 16. J Clin Oncol. 2019;37:3377–91.
Article PubMed PubMed Central CAS Google Scholar
Yang W, Cai J, Shen S, Gao J, Yu J, Hu S, et al. Pulse therapy with vincristine and dexamethasone for childhood acute lymphoblastic leukaemia (CCCG-ALL-2015): an open-label, multicentre, randomised, phase 3, non-inferiority trial. Lancet Oncol. 2021;22:1322–32.
Article PubMed PubMed Central CAS Google Scholar
Theunissen P, Mejstrikova E, Sedek L, van der Sluijs-Gelling AJ, Gaipa G, Bartels M, et al. Standardized flow cytometry for highly sensitive MRD measurements in B-cell acute lymphoblastic leukemia. Blood. 2017;129:347–57.
Article PubMed PubMed Central CAS Google Scholar
Modvig S, Hallbook H, Madsen HO, Siitonen S, Rosthoj S, Tierens A, et al. Value of flow cytometry for MRD-based relapse prediction in B-cell precursor ALL in a multicenter setting. Leukemia. 2021;35:1894–906.
Article PubMed CAS Google Scholar
Gelman A, Carlin JB, Stern HS, Dunson DB, Vehtari A, Rubin DB. Bayesian Data Analysis. 3rd ed. Boca Raton, FL: Chapman and Hall/CRC; 2013.
Simon HA. A behavioral model of rational choice. Q J Econ. 1955;69:99–118.
Article Google Scholar
Savage LJ. The theory of statistical decision. J Am Stat Assoc. 1951;46:55–67.
Article Google Scholar
Pencina MJ, D’Agostino RB Sr. Evaluating discrimination of risk prediction models: the C statistic. JAMA. 2015;314:1063–4.
Article PubMed CAS Google Scholar
Efron B. Bootstrap methods: another look at the jackknife. Ann Stat. 1979;7:1–26.
Article Google Scholar
Kirkpatrick S, Gelatt CD Jr, Vecchi MP. Optimization by simulated annealing. Science. 1983;220:671–80.
Article PubMed CAS Google Scholar
Green PJ, Silverman BW. Nonparametric regression and generalized linear models: a roughness penalty approach. London: Chapman & Hall; 1994.
Gauthier J, Wu QV, Gooley TA. Cubic splines to model relationships between continuous variables and outcomes: a guide for clinicians. Bone Marrow Transpl. 2020;55:675–80.
Article CAS Google Scholar
Chen J, Gale RP, Feng Y, Hu Y, Qi S, Liu X, et al. Are haematopoietic stem cell transplants stem cell transplants, is there a threshold dose of CD34-positive cells and how many are needed for rapid posttransplant granulocyte recovery? Leukemia. 2023. https://doi.org/10.1038/s41375-023-01973-2.
Song J, Mercer D, Hu X, Liu H, Li MM. Common leukemia- and lymphoma-associated genetic aberrations in healthy individuals. J Mol Diagn. 2011;13:213–9.
Article PubMed PubMed Central CAS Google Scholar
Farina M, Rossi G, Bellotti D, Marchina E, Gale RP. Is having clonal cytogenetic abnormalities the same as having leukaemia. Acta Haematol. 2016;135:39–42.
Article PubMed Google Scholar
Young AL, Challen GA, Birmann BM, Druley TE. Clonal haematopoiesis harbouring AML-associated mutations is ubiquitous in healthy adults. Nat Commun. 2016;7:12484.
Article PubMed PubMed Central CAS Google Scholar
Zhang Y, Wang S, Zhang J, Liu C, Li X, Guo W, et al. Elucidating minimal residual disease of paediatric B-cell acute lymphoblastic leukaemia by single-cell analysis. Nat Cell Biol. 2022;24:242–52.
Article PubMed CAS Google Scholar
Gray RJ. A class of K-sample tests for comparing the cumulative incidence of a competing risk. Ann Stat. 1988;16:1141–54.
Article Google Scholar
Fine JP, Gray RJ. A proportional hazards model for the subdistribution of a competing risk. J Am Stat Assoc. 1999;94:496–509.
Article Google Scholar

Download references

Acknowledgements

Profs. Nick Cross (University of Southampton), Christopher Hourigan (National Institutes of Health) and Alec Morley (Flinders University) kindly reviewed the typescript. RPG acknowledges support from the National Institute of Health Research (NIHR). JC acknowledges support from the Institute of Hematology, Chinese Academy of Medical Sciences (IHCAMS).

Funding

Supported, in part, by grants from the Ministry of Science and Technology of China (2019YFA0110803; XZ), the Chinese Academy of Medical Sciences (CAMS) Innovation Fund for Medical Sciences (2021-I2M-1-001 and 2022-I2M-2-003; JC), the National Institute of Health Research (NIHR) Biomedical Research Centre (RPG) and the Ministry of Science and Technology of China (84000-51200002; RPG).

Author information

These authors contributed equally: Yahui Feng, Saibing Qi, Xueou Liu, Li Zhang.

Authors and Affiliations

State Key Laboratory of Experimental Hematology, National Clinical Research Center for Blood Diseases, Haihe Laboratory of Cell Ecosystem, Institute of Hematology & Blood Diseases Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Tianjin, China
Yahui Feng, Saibing Qi, Xueou Liu, Li Zhang, Yu Hu, Qiujin Shen, Xiaowen Gong, Wei Zhang, Junxia Wang, Wen Yan, Tiantian Wang, Huijun Wang, Zhen Song, Xiaofan Zhu & Junren Chen
Tianjin Institutes of Health Science, Tianjin, China
Yahui Feng, Saibing Qi, Xueou Liu, Li Zhang, Yu Hu, Qiujin Shen, Xiaowen Gong, Wei Zhang, Junxia Wang, Wen Yan, Tiantian Wang, Huijun Wang, Zhen Song, Xiaofan Zhu & Junren Chen
Centre for Haematology, Department of Immunology and Inflammation, Imperial College of Science, Technology and Medicine, London, UK
Robert Peter Gale

Authors

Yahui Feng
View author publications
You can also search for this author in PubMed Google Scholar
Saibing Qi
View author publications
You can also search for this author in PubMed Google Scholar
Xueou Liu
View author publications
You can also search for this author in PubMed Google Scholar
Li Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Yu Hu
View author publications
You can also search for this author in PubMed Google Scholar
Qiujin Shen
View author publications
You can also search for this author in PubMed Google Scholar
Xiaowen Gong
View author publications
You can also search for this author in PubMed Google Scholar
Wei Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Junxia Wang
View author publications
You can also search for this author in PubMed Google Scholar
Wen Yan
View author publications
You can also search for this author in PubMed Google Scholar
Tiantian Wang
View author publications
You can also search for this author in PubMed Google Scholar
Huijun Wang
View author publications
You can also search for this author in PubMed Google Scholar
Zhen Song
View author publications
You can also search for this author in PubMed Google Scholar
Xiaofan Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Robert Peter Gale
View author publications
You can also search for this author in PubMed Google Scholar
Junren Chen
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

JC and RPG conceived the study. XZ co-led the CCCG-ALL-2015 study, assisted by LZ and JW. HW led the team that performed MRD-testing. XL and QS compiled and curated the data, assisted by YH, WY, TW and ZS. JC developed the alternative MRD metric. YF, SQ, XL, YH, XG and WZ did the computation and developed the graphs and tables. JC and RPG prepared the typescript. All the authors reviewed the typescript, take responsibility for the content and agreed to submit for publication.

Corresponding authors

Correspondence to Xiaofan Zhu or Junren Chen.

Ethics declarations

Competing interests

RPG is a consultant to Antengene Biotech LLC, Ascentage Pharma Group and NexImmune Inc.; Medical Director, FFF Enterprises Inc.; Board of Directors: Russian Foundation for Cancer Research Support; and Scientific Advisory Boards, Nanexa AB and StemRad Ltd.

Ethics approval

Approved by the Academic Committee (IIT-NI2020001) and Ethics Review Committee (NI2020001-EC-1) of the Institute of Hematology, Chinese Academy of Medical Sciences (IHCAMS). Subjects gave written informed consent consistent with precepts of the Helsinki Declaration.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplement material

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Feng, Y., Qi, S., Liu, X. et al. Have we been qualifying measurable residual disease correctly?. Leukemia 37, 2168–2172 (2023). https://doi.org/10.1038/s41375-023-02026-4

Download citation

Received: 21 July 2023
Revised: 30 August 2023
Accepted: 05 September 2023
Published: 13 September 2023
Issue Date: November 2023
DOI: https://doi.org/10.1038/s41375-023-02026-4

This article is cited by

Quantifying measurable residual disease correctly
- Alexander A. Morley
Leukemia (2024)
Measurable residual disease (MRD)-testing in haematological and solid cancers
- Junren Chen
- Robert Peter Gale
- Wei Zhang
Leukemia (2024)
Response to comment on Have we been qualifying measurable residual disease correctly?
- Junren Chen
Leukemia (2024)

Have we been qualifying measurable residual disease correctly?

Subjects

Introduction

Tyranny of sampling error

Borrowing lessons from decision science

A clinical example

Is MRD_{worst_case} an index or a metric for MRD?

Discussion

Data availability

Notes

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Ethics approval

Additional information

Supplementary information

Supplement material

Rights and permissions

About this article

Cite this article

This article is cited by

Quantifying measurable residual disease correctly

Measurable residual disease (MRD)-testing in haematological and solid cancers

Response to comment on Have we been qualifying measurable residual disease correctly?

Response to comment on Have we been qualifying measurable residual disease correctly?

Quantifying measurable residual disease correctly

Search

Quick links

Subjects

Introduction

Tyranny of sampling error

Borrowing lessons from decision science

A clinical example

Is MRDworst_case an index or a metric for MRD?

Discussion

Data availability

Notes

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Ethics approval

Additional information

Supplementary information

Supplement material

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Quantifying measurable residual disease correctly

Measurable residual disease (MRD)-testing in haematological and solid cancers

Response to comment on Have we been qualifying measurable residual disease correctly?

Search

Quick links

Is MRD_{worst_case} an index or a metric for MRD?