Deceptive measures of progress in the NHS long-term plan for cancer: case-based vs. population-based measures

Oke, Jason L.; Brown, Sarah Jo; Senger, Chris; Welch, H. Gilbert

doi:10.1038/s41416-023-02308-9

Download PDF

Perspective
Open access
Published: 17 June 2023

Deceptive measures of progress in the NHS long-term plan for cancer: case-based vs. population-based measures

Jason L. Oke ORCID: orcid.org/0000-0003-3467-6677¹,
Sarah Jo Brown²,
Chris Senger² &
…
H. Gilbert Welch³

British Journal of Cancer volume 129, pages 3–7 (2023)Cite this article

2089 Accesses
1 Citations
21 Altmetric
Metrics details

Subjects

Abstract

The NHS Long Term Plan for cancer aims to increase early-stage diagnoses from 50% to 75% and to have 55,000 more people each year survive their cancer for at least 5 years following diagnosis. The targets measures are flawed and could be met without improving outcomes that really matter to patients. The proportion of early-stage diagnoses could increase, while the number of patients presenting at a late-stage remains the same. More patients could survive their cancer for longer, but lead time and overdiagnosis bias make it impossible to know whether anyone had their life prolonged. The target measures should switch from biased case-based measures to unbiased population-based measures that reflect the key objectives in cancer care: reducing late-stage incidence and mortality.

Monitoring the impact of COVID-19 in France on cancer care: a differentiated impact

Article Open access 10 March 2022

The changing landscape of cancer in the USA — opportunities for advancing prevention and treatment

Article 28 May 2020

Conditional crude probabilities of death for English cancer patients

Article Open access 11 October 2019

Introduction

In June 2018, the UK Prime Minister announced a new 5-year funding settlement for The National Health Service (NHS) in return for developing a long-term plan for the service. One of the goals of the NHS Long Term Plan is “to save thousands more lives each year by dramatically improving how we diagnose and treat cancer”. [1] In January 2019, Health secretary Matt Hancock set out two key 2028 targets as a means to achieve this goal:

1.
The proportion of all cancers diagnosed at an early stage would rise from approximately 50% currently to 75%.
2.
55,000 more people each year would survive their cancer for at least 5 years following diagnosis.

These targets would be achieved by implementing a series of initiatives, including an overhaul and expansion of existing cancer screening programmes, the introduction of new tests, mobile lung cancer screening units and significant investment in artificial intelligence (AI) to better target at-risk populations.

While we applaud the goal, the target measures are flawed. While these targets could be achieved through meaningful improvements for patients with cancer, they could also be met without making a single improvement in the outcomes that really matter to patients: a reduced risk of suffering symptoms from cancer or a reduced risk of dying from cancer. Furthermore, the pursuit of these targets could even harm patients directly, by diagnosing and treating cancers that were otherwise not destined to cause problems, and indirectly, by siphoning resources away from more effective health initiatives.

The problem is in the target measures themselves. Both stage distribution and 5-year survival are case-based measures—that is, both use the number of diagnosed cancer cases in the denominator (Table 1). Here we show how both can be deceptive in signalling apparent benefit when none exists. We argue that progress against cancer must be measured using a population-based denominator—specifically, late-stage incidence and mortality.

Table 1 NHS target measures, definitions, problems and alternatives.

Full size table

Cancer paradigms: the traditional view

Diagnosing cancer earlier is a goal sought by individuals, health systems and governments across the world. The rationale is familiar: cancers found at an early stage are apparently more “curable” and require less aggressive treatment—with fewer attendant side effects.

This strategy makes sense under a widely-held model of cancer progression typically attributed to William Stewart Halsted [2]. Halsted argued that cancer progresses in an orderly fashion: it arises at a single location, grows there, and then eventually spreads to other parts of the body (Fig. 1, left-hand panel). Crucially, in terms of early detection, this model posits that cancer metastasis only happens late in the disease, many years after the onset of cancer. Furthermore, this homogeneous model of progression suggests that all cancers, if left untreated, will relentlessly progress to ultimately metastasise and cause death. Under the traditional model, it follows that finding more early-stage cancers is always beneficial.

**Fig. 1: Two models of cancer progression.**

Cancer paradigms: the contemporary view

The traditional model is outdated. It is far too simple to adequately represent the constellation of diseases currently labelled as “cancer” [3]. The contemporary model of cancer progression is necessarily more complex and heterogeneous (Fig. 1, right-hand panel).

In the 1960s and 70s, Bernard Fisher questioned Halsted’s view of orderly cancer progression. He hypothesised that breast cancer could be a systemic disease from the outset: that tumour cells could be disseminated throughout the body by the time of detection [4]. Recent cancer genomic research suggests Fisher’s hypothesis extends beyond breast cancer. In an analysis of 118 biopsies from 23 colorectal cancer patients with distant metastases, dissemination was estimated to occur well before the primary tumour was large enough to be clinically detectable [5]. These aggressive, “born to be bad” cancers would elude any feasible early detection efforts, yet they are the ones most likely to cause death.

Cancers at the opposite extreme of the growth spectrum became apparent with the advent of widespread prostate cancer screening in the United States during 1990s. Some localised prostate cancers grew so slowly that they were not destined to causes symptoms before the patient died from competing risks of death—particularly in older men [6, 7]. Alternatively, some lesions meeting the pathological criteria for cancer may not grow at all. The same phenomena soon became evident in randomised trials of chest X-ray screening for lung cancer [8]. Adding to the complexity were subsequent observations suggesting that some breast [9], thyroid [10] and kidney [11] cancers, in fact, regress. Collectively, the detection of these very slow growing, non-progressive, and regressing cancers became known as overdiagnosis—the diagnosis of a “disease” not otherwise destined to be experienced by the patient.

We are only beginning to learn about the heterogeneity of cancer growth. But it seems likely that this heterogeneity exists within cancer primary sites. In other words, there are some breast, colorectal and lung cancers that are already systemic by the time they are detectable and there are others that are not destined to ever metastasise. Under the contemporary model, it follows that finding more early-stage cancer is not always beneficial—and, in fact, can be harmful.

How stage distribution can be deceptive

“The proportion of all cancers diagnosed at an early stage would rise from approximately 50% currently to 75%. “

The contemporary model acknowledges that some early-stage cancers are not destined to become late-stage cancer. Thus, it is possible to find more early-stage cancers yet have no effect on the number of individuals who first present with late-stage cancer. Nonetheless, the case-based measure of stage-distribution will become apparently more favourable simply by finding more early-stage disease.

Two prominent examples of this phenomenon appear in Fig. 2. The introduction of widespread screening with mammography in the United States during the 1980s led to many more breast cancers being detected at an early stage, while the incidence of late-stage breast cancer remained about the same [12]. Nevertheless, the stage-distribution became apparently more favourable: before screening 55% of breast cancers were diagnosed at an early stage, after screening 75% were diagnosed at an early stage. The reframed statement is arguably more powerful: before screening 45% of breast cancers were diagnosed at a late stage, while after screening 25% were diagnosed at a late stage. Yet both statements are deceptive as there was little change in incidence of late-stage disease.

**Fig. 2: Deceptive stage distributions.**

A similar pattern was recently observed with the promotion of low-dose computed tomography lung cancer screening in Taiwanese women—the majority of whom have never smoked [13]. Many more lung cancers were detected at an early stage, while the incidence of late-stage lung cancers remained stable. Again, the stage distribution became apparently more favourable: before screening 90% of lung cancers were diagnosed at a late stage, while after screening 58% were diagnosed at a late stage. These two examples highlight how a favourable change stage distribution can be deceptive and why a shift in stage distribution does not by itself provide evidence that patients have benefited.

How survival can be deceptive

“55,000 more people each year would survive their cancer for at least 5 years following diagnosis”

Even under the traditional model of cancer progression, it is possible to find cancers earlier yet have no effect on when patients die from their cancer—simply because treatment initiated earlier conferred no advantage over treatment initiated later. Nevertheless, earlier detection biases the case-based measure of survival time. Because survival time is measured from the time of diagnosis, cancer screening will always “start the clock earlier”—thus always lengthen survival times. Whether life is prolonged (that is, death is delayed) is a separate question. In the simplest case—no change in the time of death—survival time will lengthen and signal a benefit when none exists. Yet even if death has been delayed, survival time will exaggerate the apparent effectiveness of screening. Because of this so-called lead time bias [14], higher survival does not necessarily mean that earlier detection has prolonged patient’s lives.

But there is another, potentially larger bias associated with contemporary model of cancer progression: the detection of cancers not destined to cause symptoms or death. The introduction of screening tends to uncover these sub-clinical cancers that have previously gone unnoticed. Overdiagnosis wreaks havoc on survival statistics (Fig. 3).

**Fig. 3: Illustration of how overdiagnosis inflates 5-year survival, while the number of deaths remains unchanged.**

The scale of this problem should not be underestimated. For example, when fee-for-service providers introduced thyroid screening with ultrasonography in South Korea, the incidence of thyroid cancer increased 15 times over a decade. All of the increase consisted of small papillary thyroid cancers—long known to be a common finding at autopsy, but an extremely rare cause of death [15]. More than 40,000 people were diagnosed with the disease in 2011 alone—virtually all of whom survive 5 years or more. In fact, a website promoting Korean medical tourism advertised Korea as the place be treated for thyroid cancer—touting “the highest thyroid cancer survival rate in the world” [16].

There is no evidence anyone benefited from screening, but many were certainly harmed by unneeded surgery and loss of thyroid function. Yet by these actions, South Korea, a country with a smaller population than the UK, very nearly managed to get 55,000 more people each year surviving their cancer for at least 5 years following diagnosis simply by screening for thyroid cancer.

While survival is a perfectly valid measure in a randomised trial of treatment, survival comparisons across time (e.g. 1980 vs today) or place (e.g. UK vs. US) may say more about diagnostic practice than the quality of treatment or the risk of death [17]. In thyroid cancer, for example, 5-year survival is 87% in the UK and 98% in the US [18, 19]. While it is tempting to imagine thyroid cancer treatment must be better in the US, thyroid cancer mortality is actually lower in the UK (2.4 vs 3.0 per million age-standardised to the world population) [20].

Moving forward—population-based measures

The NHS target measures, stage distribution and survival, regularly overstate the value of early cancer detection. The problem with these case-based measures is that early detection efforts influence both the numerator and the denominator, making it impossible to discern whether genuine progress has been made. What is needed is a stable denominator—one unaffected by early detection—the population (Table 1).

Late-stage incidence

Declining late-stage cancer incidence suggests that screening is doing what it is intended to do: advance the time of diagnosis for cancers otherwise destined to present clinically at a late-stage. It is important to emphasise that late-stage incidence only includes patients in whom the cancer is first diagnosed at a late stage; it does not include those in whom cancer is diagnosed at an early stage, but nonetheless progress to a late stage [21]. Cancers destined to clinically present at a late stage represent the most aggressive and deadly cancers. They are the ones we most want to find early, in the hope that treatment initiated earlier will confer some benefit over treatment initiated later.

Declining late-stage incidence may not lead to fewer deaths, however, because treatment initiated earlier is not reliably more effective than treatment initiated later. The UKCTOCS ovarian cancer screening trial, for example, was able to reduce late-stage (Stage IV) incidence by 25%, yet this earlier detection and treatment did not translate into fewer ovarian cancers deaths [22]. The authors explanation for this was that “the cancers shifted to an earlier stage had an intrinsic poor prognosis”—in other words, they were born to be bad. Randomised trials of breast [23] and colon cancer [24] surveillance showed similar results: aggressive surveillance did detect cancer recurrence earlier, yet earlier detection and treatment did not change the risk of death. Thus, while a reduction late-stage incidence is evidence that screening works in terms of advancing the time of diagnosis for the worst cancers, it does not necessarily mean that patients are being helped.

Mortality: all causes vs target cancer

“The risk of death is the risk with which the individual is most concerned”, said Sir Richard Doll 30 years ago, when examining whether progress was being made on cancer [25]. It is still true today: reduced mortality remains the most important measure of progress against cancer.

The language is subtle but unambiguous: it is the risk of death from all causes that concerns patients, not simply the risk of dying from cancer. Averting death from cancer only to succumb to some other cause is not really progress—some have even argued that dying from other causes may be worse [26].

Randomised trials of screening for lung [27], colon [28], and prostate cancer [29] have demonstrated that screening significantly reduced the risk of dying from the target cancer but had no impact on all-cause mortality. The apparent paradox may be the result of both (1) off-target deaths (i.e. deaths that are a consequence of screening and subsequent intervention, yet are not ascribed to the target cancer) and (2) the competing risks of death associated with the ageing soma (i.e. those at a high risk of dying from cancer are also a high risk of dying from other causes) [30]. Patients and NHS policymakers learning that screening “saves lives” might reasonably expect that screening would enhance their longevity (i.e. reduce all-cause mortality). But that may not be the case.

Alternatively, the apparent paradox may be explained more simply: as being the result of the play of chance. All-cause mortality is an insensitive measure for population wide interventions targeting a single cancer (e.g. colon or lung cancer) as deaths from the target cancer are a small component of all deaths. A trial screening for one cancer powered to detect the effect on all deaths would require a Herculean effort—hundreds of thousands of people followed for a decade or more. Thus as the NHS looks to lower the starting age for colon cancer screening (from age 60 years to age 50 years) or expand lung cancer screening by adding mobile units it is reasonable to measure progress in terms of colon or lung cancer mortality. But as the NHS considers interventions intended to address all cancers combined—such as AI to better target at-risk populations and multi-cancer early detection tests (liquid biopsies)—we would argue not only is reduced all-cause mortality the best measure of progress, but also that it is an achievable one, as all cancers combined are a substantial component of all deaths [31].

Conclusion

Death is not the only outcome relevant to early cancer detection, other outcomes matter as well. It is conceivable, for example, that earlier detection might reduce the symptom burden of some cancer patients without extending their life. But it is far more likely that screening produces additional burden for others. First, many healthy people have to be persuaded that they “need” to be tested—too often with scary messages suggesting that people who die from cancer could have avoided the outcome with earlier detection. Then there are the problems caused by abnormal results: the emotional and psychological stress in those falsely alarmed, the routine subsequent testing of those deemed to be at “high risk” because of a detected abnormality, and the toxicity and complications of unneeded treatment in those overdiagnosed.

The conundrum of cancer screening is that while only a few participants can potentially benefit, all can be potentially harmed. Thus, arguments for more screening require that its benefit be sufficiently large to warrant the associated harms and opportunity costs. As we have shown here, surrogate measures of benefit can be deceptive—what is required is evidence that screening, in fact, saves lives. This will be hard to do because the effect being sought is necessarily small. Given the evolving understanding that tumour biology and host response are more relevant to prognosis than the time of diagnosis, we believe it’s time to challenge the assertion that more screening is the best strategy to make progress against cancer.

Data availability

Data sharing not applicable to this article as no data sets were generated or analysed during the current study.

References

Gov.UK. Government announces plans for earlier diagnosis for cancer patients. 2018. https://www.gov.uk/government/news/government-announces-plans-for-earlier-diagnosis-for-cancer-patients.
Hellman S. Karnofsky Memorial Lecture. Natural history of small breast cancers. J Clin Oncol. 1994;12:2229–34.
Article CAS PubMed Google Scholar
Breslow L, Bailar JC, Brown BW, Brown HG, Darity WA, Defendi V, et al. Measurement of progress against cancer. Extramural Committee to Assess Measures of Progress Against Cancer. J Natl Cancer Inst. 1990;82:825–35.
Google Scholar
Travis K. Bernard Fisher reflects on a half-century’s worth of breast cancer research. J Natl Cancer Inst. 2005;97:1636–7.
Article PubMed Google Scholar
Hu Z, Ding J, Ma Z, Sun R, Seoane JA, Scott Shaffer J, et al. Quantitative evidence for early metastatic seeding in colorectal cancer. Nat Genet. 2019;51:1113–22.
Article CAS PubMed PubMed Central Google Scholar
Welch HG, Albertsen PC, Nease RF, Bubolz TA, Wasson JH. Estimating treatment benefits for the elderly: the effect of competing risks. Ann Intern Med. 1996;124:577–84.
Article CAS PubMed Google Scholar
Pashayan N, Powles J, Brown C, Duffy SW. Excess cases of prostate cancer and estimated overdiagnosis associated with PSA testing in East Anglia. Br J Cancer. 2006;95:401–5.
Article CAS PubMed PubMed Central Google Scholar
Black WC. Overdiagnosis: an underrecognized cause of confusion and harm in cancer screening. J Natl Cancer Inst. 2000;92:1280–2.
Article CAS PubMed Google Scholar
Zahl PH, Maehlen J, Welch HG. The natural history of invasive breast cancers detected by screening mammography. Arch Intern Med. 2008;168:2311–6.
Article PubMed Google Scholar
Tuttle RM, Fagin JA, Minkowitz G, Wong RJ, Roman B, Patel S, et al. Natural history and tumor volume kinetics of papillary thyroid cancers during active surveillance. JAMA Otolaryngol Head Neck Surg. 2017;143:1015–20.
Article PubMed PubMed Central Google Scholar
Jewett MA, Mattar K, Basiuk J, Morash CG, Pautler SE, Siemens DR, et al. Active surveillance of small renal masses: progression patterns of early stage kidney cancer. Eur Urol. 2011;60:39–44.
Article PubMed Google Scholar
Bleyer A, Welch HG. Effect of three decades of screening mammography on breast-cancer incidence. N Engl J Med. 2012;367:1998–2005.
Article CAS PubMed Google Scholar
Gao W, Wen CP, Wu A, Welch HG. Association of computed tomographic screening promotion with lung cancer overdiagnosis among Asian women. JAMA Intern Med. 2022;182:283–90.
Article PubMed Google Scholar
Morrison AS. The effects of early treatment, lead time and length bias on the mortality experienced by cases detected by screening. Int J Epidemiol. 1982;11:261–7.
Article CAS PubMed Google Scholar
Vanderlaan WP. The occurrence of carcinoma of the thyroid gland in autopsy material. N Engl J Med. 1947;237:221.
Article CAS PubMed Google Scholar
Welch HG. Cancer screening, overdiagnosis, and regulatory capture. JAMA Intern Med. 2017;177:915–6.
Article PubMed Google Scholar
Oke JL, O’Sullivan JW, Perera R, Nicholson BD. The mapping of cancer incidence and mortality trends in the UK from 1980-2013 reveals a potential for overdiagnosis. Sci Rep. 2018;8:14663.
Article PubMed PubMed Central Google Scholar
CRUK. Thyroid cancer survival statistics. 2023. https://www.cancerresearchuk.org/health-professional/cancer-statistics/statistics-by-cancer-type/thyroid-cancer/survival#heading-Zero.
NCI. Thyroid: recent trends in SEER relative survival rates, 2000-2018. Surveillance, Epidemiology, and End Results Program. 2022. https://seer.cancer.gov/statistics-network/explorer/application.html?site=80&data_type=4&graph_type=2&compareBy=sex&chk_sex_1=1&relative_survival_interval=5&race=1&age_range=1&stage=101&advopt_precision=1&advopt_show_ci=on&hdn_view=0&advopt_display=2#graphArea.
WHO. Estimated age-standardized mortality rates (World) in 2020, thyroid, both sexes, all ages. Cancer Today. 2020. https://gco.iarc.fr/today/online-analysis-map?v=2020&mode=population&mode_population=continents&population=900&populations=900&key=asr&sex=0&cancer=32&type=1&statistic=5&prevalence=0&population_group=0&ages_group%5B%5D=0&ages_group%5B%5D=17&nb_items=10&group_cancer=1&include_nmsc=0&include_nmsc_other=0&projection=natural-earth&color_palette=default&map_scale=quantile&map_nb_colors=5&continent=0&show_ranking=0&rotate=%255B10%252C0%255D.
Welch HG, Gorski DH, Albertsen PC. Trends in metastatic breast and prostate cancer — lessons in cancer dynamics. N Engl J Med. 2015;373:1685–7.
Article PubMed Google Scholar
Menon U, Gentry-Maharaj A, Burnell M, Singh N, Ryan A, Karpinskyj C, et al. Ovarian cancer population screening and mortality after long-term follow-up in the UK Collaborative Trial of Ovarian Cancer Screening (UKCTOCS): a randomised controlled trial. Lancet. 2021;397:2182–93.
Article PubMed PubMed Central Google Scholar
Ghezzi P, Magnanini S, Rinaldini M, Berardi F, Di Biagio G, Testare F, et al. Impact of follow-up testing on survival and health-related quality of life in breast cancer patients: a multicenter randomized controlled trial. JAMA. 1994;271:1587–92.
Article Google Scholar
Primrose JN, Perera R, Gray A, Rose P, Fuller A, Corkhill A, et al. Effect of 3 to 5 years of scheduled CEA and CT follow-up to detect recurrence of colorectal cancer: the FACS randomized clinical trial. JAMA. 2014;311:263–70.
Article CAS PubMed Google Scholar
Doll R. Progress against cancer - are we winning the war. Acta Oncol. 1989;28:611–21.
Article CAS PubMed Google Scholar
Smith, R. Dying of cancer is the best death. BMJ Opinion. 2014. https://blogs.bmj.com/bmj/2014/12/31/richard-smith-dying-of-cancer-is-the-best-death/.
de Koning HJ, van der Aalst CM, de Jong PA, Scholten ET, Nackaerts K, Heuvelmans MA, et al. Reduced lung-cancer mortality with volume CT screening in a randomized trial. N Engl J Med. 2020;382:503–13.
Article PubMed Google Scholar
Shaukat A, Mongin SJ, Geisser MS, Lederle FA, Bond JH, Mandel JS, et al. Long-term mortality after screening for colorectal cancer. N Engl J Med. 2013;369:1106–14.
Article CAS PubMed Google Scholar
Schröder FH, Hugosson J, Roobol MJ, Tammela TL, Zappa M, Nelen V, et al. Screening and prostate cancer mortality: results of the European Randomised Study of Screening for Prostate Cancer (ERSPC) at 13 years of follow-up. Lancet. 2014;384:2027–35.
Article PubMed PubMed Central Google Scholar
DeGregori J, Pharoah P, Sasieni P, Swanton C. Cancer screening, surrogates of survival, and the soma. Cancer Cell. 2020;38:433–7.
Article CAS PubMed Google Scholar
Carr D, Kent DM, Welch HG. All-cause mortality as the primary endpoint for the GRAIL/National Health Service England multi-cancer screening trial. J Med Screen. 2022;29:3–6.
Article PubMed Google Scholar

Download references

Author information

Authors and Affiliations

Nuffield Department of Primary Care Health Sciences, Oxford University, Oxford, England
Jason L. Oke
Manchester, NH, USA
Sarah Jo Brown & Chris Senger
The Center for Surgery and Public Health, Department of Surgery, Brigham and Women’s Hospital, Boston, MA, USA
H. Gilbert Welch

Authors

Jason L. Oke
View author publications
You can also search for this author in PubMed Google Scholar
Sarah Jo Brown
View author publications
You can also search for this author in PubMed Google Scholar
Chris Senger
View author publications
You can also search for this author in PubMed Google Scholar
H. Gilbert Welch
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

JLO and HGW conceived the idea and drafted the first version of the manuscript. SJB and CS contributed to revising the final manuscript. JLO is the guarantor.

Corresponding author

Correspondence to Jason L. Oke.

Ethics declarations

Competing interests

The authors declare no competing interests.

Ethics approval and consent to participate

Not applicable.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Oke, J.L., Brown, S.J., Senger, C. et al. Deceptive measures of progress in the NHS long-term plan for cancer: case-based vs. population-based measures. Br J Cancer 129, 3–7 (2023). https://doi.org/10.1038/s41416-023-02308-9

Download citation

Received: 06 February 2023
Revised: 09 May 2023
Accepted: 05 June 2023
Published: 17 June 2023
Issue Date: 27 July 2023
DOI: https://doi.org/10.1038/s41416-023-02308-9