Potential utility of risk stratification for multicancer screening with liquid biopsy tests

Kim, Elle S.; Scharpf, Robert B.; Garcia-Closas, Montserrat; Visvanathan, Kala; Velculescu, Victor E.; Chatterjee, Nilanjan

doi:10.1038/s41698-023-00377-w

Download PDF

Brief Communication
Open access
Published: 22 April 2023

Potential utility of risk stratification for multicancer screening with liquid biopsy tests

Elle S. Kim¹,
Robert B. Scharpf^1,2,
Montserrat Garcia-Closas ORCID: orcid.org/0000-0003-1033-2650³,
Kala Visvanathan^2,4,
Victor E. Velculescu² &
…
Nilanjan Chatterjee ORCID: orcid.org/0000-0002-9060-008X^1,2

npj Precision Oncology volume 7, Article number: 39 (2023) Cite this article

2627 Accesses
4 Altmetric
Metrics details

Subjects

Abstract

Our proof-of-concept study reveals the potential of risk stratification by the combined effects of age, polygenic risk scores (PRS), and non-genetic risk factors in increasing the risk-benefit balance of rapidly emerging non-invasive multicancer early detection (MCED) liquid biopsy tests. We develop and validate sex-specific pan-cancer risk scores (PCRSs), defined by the combination of body mass index, smoking, family history of cancers, and cancer-specific polygenic risk scores (PRSs), to predict the absolute risk of developing at least one of the many common cancer types. We demonstrate the added value of PRSs in improving the predictive performance of the risk factors only model and project the positive and negative predictive values for two promising multicancer screening tests across risk strata defined by age and PCRS.

Clinical utility of genomic signatures in young breast cancer patients: a systematic review

Article Open access 25 September 2020

Cynthia Villarreal-Garza, Ana S. Ferrigno, … Hatem A. Azim Jr.

Polygenic risk scores to stratify cancer screening should predict mortality not incidence

Article Open access 30 May 2022

Andrew J. Vickers, Amit Sud, … Richard Houlston

Prostate cancer risk prediction using a polygenic risk score

Article Open access 13 October 2020

Csilla Sipeky, Kirsi M. Talala, … Johanna Schleutker

Developing effective screening tools for early cancer detection has long been a pressing interest due to the poor prognosis and survival associated with advancing cancer stage¹. Identifying individuals at the subclinical or asymptomatic stage provides a unique window of opportunity for early intervention that has been shown to improve survival². In a general population with a relatively low prevalence of cancer, ideally, a screening test needs to be broadly accessible, highly specific, and sensitive. A screening test should be specific to minimize overdiagnosis-related psychological and financial burdens and risks associated with unnecessary follow-up treatments and sensitive to prevent missed or interval cases. For these reasons, to date, the United States Preventive Service Tasks Forces has recommended only a handful of age-based single-cancer screening modalities such as colonoscopy³ (37.1% to 79.4% sensitivity (se) and 86.7% to 97.3% specificity (sp)) for colorectal cancer⁴ mammogram⁵ (86.9% se and 88.9% sp) for breast cancer⁶, low-dose computerized tomography⁷ (59% to 100% se and 26.4 to 99.7% sp) scan for lung cancer⁸, and pap test⁹ (70% to 80% se and 95% sp) for cervical cancer¹⁰.

Existing single-cancer screening tools face several challenges, including lack of adherence to screening recommendations^11,12,13, low positive predictive value (PPV) or high false positives¹⁴, and missed or interval cancer cases¹⁵. Additionally, there are no presently accepted screening tools for many cancers with poor prognoses or high late-stage diagnosis rates for cancer detection in asymptomatic individuals. In this context, multicancer early detection (MCED) liquid biopsy tests using analytes such as cell-free DNA (cfDNA) are gaining traction^{16,17,18,19,20,21,22,23,24,25,26}. Several recent studies have started to explore the feasibility of such approaches for early cancer detection in a limited clinical setting^{18,19,20,21,22,23,24,25,26,27,28}. GRAIL’s Galleri test^21,26 and Thrive’s DETECT-A²⁴ (Detecting cancers Earlier Through Elective Mutation-based blood Collection and Testing) are of note.

MCED has the promise to lower cancer mortality, especially through early detection of cancers for which there is currently no screening available. However, as many recent studies have shown, the issue of low PPV persists as a significant limitation of the newly developed multicancer tests (Supplementary Table 1)^21,24,26,29. The values of PPV for tests are anticipated to be highly influenced by specificity, and for a constant specificity value, by the combination of sensitivity and prevalence. The current MCED tests have high specificity (99% or higher) but typically have low PPV for the general population due to modest sensitivity and low prevalence of cancer in the general population. While sensitivity for some of these tests can be substantially higher for some specific cancers and may be improved further through the incorporation of additional features in cfDNA^19,27,28, the PPV in most settings will still be expected to remain low as the prevalence of individual cancers is even lower. Thus, in the future, a risk-stratified approach is likely to be needed to enhance PPV and the risk-benefit balance of these tests. In the multicancer setting, however, population risk stratification becomes more challenging as one needs to consider risk factors across many cancers. There are also new opportunities due to the emergence of polygenic risk scores (PRSs) from genome-wide association studies (GWASs) across many cancers³⁰. Recently, a study investigated the potential utility of PRSs and other risk factors to build a model for predicting risk for at least one of several cancers to understand the impact of lifestyle modifications on overall cancer risk³⁰. However, the prospects of risk stratification in multicancer screening and the added values of PRS in addition to classical risk factors are yet to be investigated in the context of emerging MCED tests.

We use data from the prospective UK Biobank (UKBB) study and the US population cancer incidence rates to estimate future cancer risk given individuals’ genetic and nongenetic profiles. We identified the top ten incident cancer types for females and males with sufficiently large and publicly available GWAS (see Methods)³¹. Our final analysis involved 133,830 female and 115,207 unrelated male participants of White British ancestry aged 40–73, with 5807 and 5906 incident cancer cases for bladder, breast (female only), colorectum, endometrium, kidney, lung, melanoma, non-Hodgkin’s lymphoma, ovary, pancreas, and prostate (male only), respectively, over the course of follow-up. We used sex-specific Cox proportional hazards models where the baseline hazard is specified as a function of age and assumed the multiplicative effects of the risk factors³² with the outcome as the first cancer incidence of the above-mentioned cancers. PRSs were calculated for eleven (bladder, breast, colorectum, endometrium, kidney, lung, melanoma, non-Hodgkin’s lymphoma, ovary, pancreas, and prostate) cancer types (Supplementary Figs. 1 and 2). We included two major lifestyle-related exposures, namely smoking (status and pack-years of smoking) and body mass index (BMI), known to influence risk across multiple cancers, and family history of breast, colorectal, lung, and prostate cancer in non-adoptive first-degree relatives as risk factors. We then computed pan-cancer risk scores (PCRSs) as the weighted sum of the predictors included in the multicancer Cox model. The performance of the PCRSs was evaluated using a standardized hazard ratio for instantaneous risk and area under the curve (AUC) using up to 5 years of follow-up data. We assume that the probability of an individual carrying an asymptomatic, but screen-detectable cancer is proportional to the risk of incident cancer over a small-time interval (eleven months for DETECT-A and one year for Galleri). We use the Bayes theorem to determine expected PPVs and NPVs for DETECT-A and Galleri across the PCRS percentiles and ages based on reported diagnostic accuracies of these tests.

As anticipated, PCRSs were strongly associated with the risk of developing at least one cancer during the follow-up of the UKBB study in both females (HR: 1.39 per 1 SD, 95% CI: 1.33–1.45) and males (HR: 1.43 per 1 SD, 95% CI: 1.37–1.49) (Supplementary Table 2). We observed a strong degree of multicancer risk stratification by the combined effects of age, cancer-specific polygenic risk scores, and conventional risk factors shared across multiple cancer types (Figs. 1, 2, and Supplementary Figs. 3–6). Comparison of the 1-year and 10-year trajectories across various ages and risk strata for the PRS only model (Female AUC: 0.58, Male AUC: 0.59), risk factors only model (Female AUC: 0.55, Male AUC: 0.57), and the combined model (PCRS model; Female AUC: 0.60, Male AUC: 0.62), further demonstrates the added value of cancer-specific PRSs as covariates to improve multicancer risk stratification and predictive model performance (Supplementary Table 2 and Supplementary Figs. 3–6). The combined (i.e., PCRS) model showed a mean 1-year overall absolute cancer risk of 3.58% for the high-risk females aged 75 (top 10 percentile) and 0.77% for the low-risk females of the same age (bottom 10 percentile)—close to a 4.6-fold increase—whereas the risk factors only model showed a lower level of overall cancer risk stratification between the high-risk group and low-risk group, with 1-year absolute risks sitting at 2.36% and 1.12%, respectively, approximately corresponding to a 2.1-fold increase (Supplementary Fig. 3a–c).

**Fig. 1: Estimated 1-year absolute risk and the projected PPV and NPV of the Galleri test with an overall sensitivity of 51.5% at 99.5% specificity for females and males.**

**Fig. 2: Comparison of the estimated absolute risk and the projected PPV and NPV of the DETECT-A test for three separate models (PCRS, Risk factors only, and PRS only).**

Further, the projected PPVs of the MCED tests (Galleri with a sensitivity of 51.5% at a specificity of 99.5% and DETECT-A with a sensitivity of 27.1% at 98.9% specificity) varied substantially by the level of the underlying risk of the population strata and the diagnostic accuracies of the liquid biopsy test in question (Figs. 1c, d, 2d–f). For example, 75-year-old females in the 90–95th PCRS percentile (AR: 2.27%) will have a 2.6-fold increased 1-year risk compared to the same-aged female in the 5–10th PCRS percentile (AR: 0.89%) (Fig. 1a). This corresponds to a PPV value of 70.3% and 48.2% for the Galleri test, translating to a 22.1% PPV difference for Galleri (Fig. 1c). NPV across all risk percentiles was reasonably high for both tests across all strata (Fig. 1e, f).

As the first step to integrating these new multicancer screening tests in the clinical setting, one can also consider a scenario in which a fixed threshold for PPV is employed as a metric to recommend early multicancer screening in the asymptomatic stage. The eligibility will strongly vary by both age and PCRS percentile. For example, at a threshold of 40% PPV, females could be eligible for the Galleri test as early as age 50 and in the 95th PCRS risk percentile and above (Fig. 1c). However, for DETECT-A, females would be eligible for screening starting at age 61 and in the highest PCRS risk percentile (Fig. 2d). Raising the PPV threshold to 60%, for DETECT-A, none of the females would achieve the required PPV even at the oldest age and highest risk groups (Fig. 2d). With Galleri’s high sensitivity and specificity, females will reach the desired threshold starting at age 56 and in the highest PCRS percentile (Fig. 1c).

Our analysis has several limitations. We assumed that the reported sensitivity and specificity of a test like DETECT-A and Galleri would be applicable across all age, sex, and risk groups. Given the increase in observed sequence alterations in cfDNA resulting from clonal hematopoiesis in older individuals^33,34,35,36, improvements in cfDNA analyses will be needed to overcome these challenges, potentially through the use of mutation-agnostic methods^19,20,36. Additional empirical data are needed to explore the potential heterogeneity of the diagnostic accuracy of MCED tests by age and other risk factors. In our multicancer risk prediction models, we included a limited set of risk factors, namely two major lifestyle-related factors that influence the risk of multiple cancers, family history of the most common cancers, and PRSs for each cancer type. We further assumed a proportional hazard model for all cancer risks, assuming multiplicative effects of age and all the other risk factors. Additional efforts are needed to build and validate more refined multicancer risk models in prospective cohort studies by including additional risk factors and interaction effects. Further, models^37,38 that incorporate extensive family history information and carrier status for rare high-penetrant mutations would be important for individuals in high-risk families with strong clustering of related cancers. Excluding variants with minor allele frequency <0.01 is another limitation of our study.

We did not account for all cancer types in our multicancer model. To build a more robust and complete multicancer risk prediction model, cancer-specific risk models should first be developed and then combined to generate the risk of composite outcomes, like that of any cancer among several. The use of site-specific models will also allow PPV calculations to take into consideration underlying variations in the diagnostic accuracy of the MCED tests across different cancer types. Finally, our model-building effort was restricted to the participants of White British ancestry in the UKBB due to the limited sample size of other ancestry groups in this study, and also the lack of well-validated PRS in non-European ancestry populations. Large studies of diverse populations are urgently needed to study the accuracy of MCED tests and to build and validate robust multicancer risk prediction models across different racial and ethnic groups.

In summary, we conducted a first-of-a-kind study highlighting the potential for a risk-based approach in multicancer screening. We observed the added value of PRS in improving the degree of risk stratification for composite cancer outcomes compared to the model defined by age, family history, smoking, and BMI. In the context of population-level early cancer screening, the addition of PRS could allow the detection of high-risk individuals even in the absence of conventional risk factors. In the future, well-powered empirical studies are needed in diverse populations to prospectively evaluate the utility of the multicancer liquid biopsy tests for their use in personalized early cancer detection.

Methods

Based on the report from the 2013–2017 United States Cancer Statistics (USCS) database, we identified the top ten malignant incident cancer types for females and males, after excluding non-melanoma skin cancer³¹. First, we surveyed the NHGRI-EBI Catalog of Published Genome-Wide Association Studies (GWAS Catalog)³⁹ and the Polygenic Risk Score (PGS) Catalog⁴⁰ to select the largest European ancestry-based GWAS as of May 2020 for each cancer type. We additionally browsed PubMed⁴¹ for large cancer-specific GWASs that were not included in the GWAS Catalog or PGS Catalog. For breast and colorectal cancer, we searched for prior European sample-based large-scale polygenic risk score (PRS) studies as of July 2020 and selected studies reporting the best-performing PRS (Supplementary Data). We did not consider pleiotropic GWAS. We filtered to cancer types with at least ten independent genome-wide significant SNPs after LD clumping at a genome-wide significant (GWS) p-value, 5E-8, threshold. Ultimately, eleven cancer types (bladder, breast, colorectum, endometrium, kidney, lung, melanoma, Non-Hodgkin’s lymphoma, ovary, pancreas, and prostate) were included in our analysis. For the full list of source literature and GWAS summary statistics included in our analysis, see Supplementary Data.

UK Biobank (UKBB) is a prospective epidemiological cohort study with over 500,000 participants^42,43,44. Individuals aged 40–69 at baseline were recruited across the United Kingdom (UK) from 2006–2010^42,43,44. A wide range of genotypic and phenotypic information, including personal medical and family history and lifestyle data, were collected at enrollment^42,43,44. UKBB data is regularly updated by completing follow-up questionnaires, linkage to national cancer and mortality registries, and hospital inpatient electronic medical records systems^42,43,44. With linkage to the national cancer registry data, cancer diagnosis date and type (coded based on International Classification of Disease 10 (ICD-10)) were available for participants diagnosed with cancer^42,43,44. For our analysis, we used ICD-10 codes for cancer classification (see Supplementary Table 4).

We then filtered to unrelated UKBB participants of White British ancestry with imputed genotype data. We excluded individuals who were lost to follow-up, with genetic sex and self-reported sex mismatch, those with any cancer diagnosis prior to baseline assessment (prevalent cancers), and participants with missing data in any one of the classical risk factors (BMI, smoking status, pack years of smoking, and family history of cancer in non-adoptive first-degree relatives). In UKBB, family history of all cancers is not available. UK Biobank only reports family history of the top three cancer incident types for females (breast, bowel, and lung) and males (breast, bowel, and prostate). These quality control procedures resulted in a study population involving 133,830 females and 115,207 males.

After determining the source literature (Supplementary Data) for each cancer type, we reviewed the manuscript and any relevant additional resources. We extracted all autosomal SNPs from each cancer GWAS along with their summary statistics such as RSIDs, observed effect size estimates (OR or beta), effective (or risk) allele, risk allele frequency (RAF), and p-value. We excluded variants with minor allele frequency (MAF) < 0.01 and ambiguous SNPs (A/T or G/C allele) with MAF > 0.40. We filtered to variants with a MAF difference of less than 0.10 relative to the UK Biobank data. We removed variants with allele mismatches that could not be resolved by strand or dosage flips and/or SNPs with complete information mismatch, based on RSID, chromosome number, and position, to the European 1000 Genome reference panel⁴⁵ or the UK Biobank data. We filtered to variants with an information score ≥0.90 based on the UK Biobank imputed genotype data. Finally, we used the fixed threshold approach to calculate PRS for each cancer. Using Plink⁴⁶, we performed LD clumping at a p-value threshold of 5E-8, r² of 0.1, and 1000 kb window with the European 1000 Genome reference panel⁴⁵ as the reference panel to remove SNPs in linkage disequilibrium within each cancer type.

Then, PRS for UK Biobank participants was computed using PRSice2⁴⁷.

The formula used for PRS calculation in PRSice2:

$PRS_j = \mathop {\sum }\limits_i^{} \beta _iSNP_{ij}$ where $PRS_j$ is the PRS for the jth individual, β_i is the observed effect size estimate for the ith SNP, and $SNP_{ij}$ is the dosage information for the effective allele of the ith SNP for the jth individual. We standardized each PRS to have unit variance and zero mean.

We developed a sex-specific pan-cancer risk prediction model to estimate the risk of developing at least one cancer over the course of follow-up. The multicancer model included eleven cancer types (bladder, breast [Female only], colorectum, endometrium [Female only], kidney, lung, melanoma, Non-Hodgkin’s lymphoma, ovary [Female only], pancreas, and prostate [Male only]). Data were split into 2/3 training set and 1/3 of test set—independent validation datasets used for model performance evaluation and subsequent analysis.

Cox proportional hazard regression (Cox) model³² was fitted to the training set with the outcome as an incidence of any first cancer included in the analysis. The models specified a baseline hazard as a function of age and assumed multiplicative effects of the risk factors³²:

$$\lambda \left( {t|{{{\boldsymbol{z}}}}} \right) = \lambda _0(t)\exp \left( {\beta _1z_1 + \beta _2z_2 + \ldots + \beta _nz_n} \right)$$

t: time-to-event; time to any first cancer incidence, censoring age, or death age

$\lambda _0(t)$: baseline hazard function

z = (z_1, z_{2, …,} z_n): set of covariates (risk factors) included in the Cox model

β = (β₁, β₂, …, β_n): set of coefficients (log hazard ratios) for the predictors

Polygenic risk scores for each cancer (Supplementary Figs. 1 and 2), family history of cancer (breast, colorectum, lung, and prostate) in any first-degree relatives (nonadopted), body mass index, and pack-years of smoking were included as predictors in the model. We also adjusted for the first ten principal components. Also, as UKBB is a left-truncated and right-censored cohort, we used age as the timescale for the Cox model—that is, participants enter the model at recruitment age and exit at cancer incidence age, censoring age, or death age–whichever occurs first. We used the censoring date for the cancer registry data provided by UKBB⁴⁸. In the underlying analysis of the UK Biobank data using the Cox proportional hazard model, the “event” is defined as the occurrence of any of these cancers, and the “time-to-event” is the time to first onset of any of these cancers. Thus, if an individual has multiple cancers, e.g., lung cancer first and then prostate, the individual is censored at the onset of the lung cancer. Further, if an individual first develops cancer of a type other than the ones included in our list, then they are censored at the first onset of those cancer types. Further, deaths from non-cancer causes were also treated as censoring events. Thus, the underlying hazard ratio parameters of the model can be interpreted as the instantaneous risk of developing at least one among the set of selected cancers, given a person was free of all cancers up to that time point.

Additionally, recognizing the concerns with the imputation of clinical/epidemiologic data, we conducted a complete-case analysis for the paper. A total of ~19% of subjects were removed who have missing data in any of the risk factors. Pack-years of smoking had the highest amount of missing data (~16%) missing, but all other individual variables had a small missing rate (<5%). For demonstrating the risk-stratification ability of models, a complete-case analysis is more desirable as imputation and model averaging will cause a diminishing of risk-stratification compared to the full potential of the model. In other words, our goal is to demonstrate the risk-stratification ability of the models for a population in which the underlying risk factors could be fully observed. From that point of view, a complete-case analysis is more desirable.

We computed pan-cancer risk scores (PCRS) or cancer-specific risk scores for all UKBB participants as the weighted sum of the predictors, with weights for each predictor as the estimated log hazard ratio (HRs) from the fitted Cox model. Then, in the test set, we assessed the discriminatory accuracy of the pan-cancer risk score (PCRS) or the cancer-specific risk score (for individual cancer models) using Harrel’s concordance index (C-statistic) and area under the curve (AUC) at five years of follow-up.

We used iCARE (Individualized Coherent Absolute Risk Estimation)⁴⁹ to estimate absolute risk. Detailed methodology for absolute risk model building is described in Choudhury et al. 2020⁴⁹. Briefly, risk estimates for each individual in the test set were obtained by feeding age-specific cancer incidence rates by 1-year strata, log HR parameters from the Cox model, and the reference dataset into the model. We used 2016 cancer incidence rates in white individuals of the SEER*Stat database⁵⁰. Site-specific cancer incidence rates were obtained and then added to get the overall incidence rates for any cancer included in our study. Cancer incidence rates for a given age and sex were determined by the following year’s cancer incidence rates. For instance, in our study, cancer incidence rates for females aged 50–51 will correspond to SEER*Stat’s cancer incidence rates for females aged 51–52. This is to account for the fact that the DETECT-A test was performed at study enrollment, and the female participants were followed up over the course of 12 months. DETECT-A and Galleri will both be used to detect cancers early, prior to conventional diagnosis. The reference dataset was obtained by simulating 10,000 samples representative of the underlying UKBB population using the normal distribution with PCRS or cancer-specific risk score mean and standard deviation.

DETECT-A study reported an overall sensitivity of 27.1% at 98.9% specificity and an empirical PPV value of 19.4% (95% CI: 13.1–27.1%)²⁴. We wanted to select a time window for absolute risk estimation so that the PPV for females aged 65–75 is equal to the point estimate of 19.4% reported in the DETECT-A study²⁴. We varied the time window by one month around one year and calculated the weighted average PPV for females aged 65–75 based on the UKBB PCRS distribution and age distribution as reported by the US Census Bureau⁵¹. We found that a time window of 11 months provided the best match for the overall PPV for the 65–75 group to the empirically determined PPV value of 19.4%. Thus, subsequently, we calculated PPV and NPV for different age and PCRS risk groups based on underlying 11-month absolute risk.

Galleri reported an overall sensitivity of 51.5% at 99.5% specificity. For Galleri, we used a time window of 1-year^21,26. For DETECT-A, we omit the calculation of projected PPVs and NPVs for males as it does not include prostate cancer (highest incident cancer for males) as one of the detectable cancer types⁵⁰.

Given the absolute risk estimate, x, the positive predictive value and negative predictive value of the multicancer liquid biopsy test can be calculated using the formula below:

$$Se = sensitivity;Sp = specificity$$

$$PPV(x) = \frac{{Se \times p(x)}}{{Se \times p(x) + \left( {1 - Sp} \right) \times \left( {1 - p(x)} \right)}}$$

$$NPV(x) = \frac{{Sp \times (1 - p(x))}}{{\left( {1 - Se} \right) \times p(x) + Sp \times \left( {1 - p(x)} \right)}}$$

The absolute risk estimate can be written as a function of age and risk factors. We assumed that the sensitivity and specificity of the multicancer liquid biopsy test do not depend on the underlying risk factors, and we used the value of these as reported from the DETECT-A and Galleri study (Supplementary Table 1)^24,26.

This study was conducted under UK Biobank Application Number 17712 (PI: Dr. Nilanjan Chatterjee). The study analyzes existing UK Biobank data and does not involve new human research participants. UK Biobank was approved by the North West Multi-center Research Ethics Committee (https://www.ukbiobank.ac.uk/learn-more-about-uk-biobank/about-us/ethics).

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

UK Biobank data are available through an application to the UK Biobank Access Management System (AMS), https://www.ukbiobank.ac.uk/enable-your-research/register. GWAS summary statistics of the single-nucleotide polymorphisms (SNPs) used for polygenic risk score (PRS) construction for each trait is available in the Supplementary Data section with relevant source literature.

Code availability

This project was developed using R version 4.2.2. and the codes used for the project are available from Github at https://github.com/eswk-im/PCRSanalysis-.git.

References

Siegel, R. L., Miller, K. D. & Jemal, A. Cancer statistics, 2017. Ca. Cancer J. Clin. 67, 7–30 (2017).
Article PubMed Google Scholar
Hiom, S. C. Diagnosing cancer earlier: reviewing the evidence for improving cancer survival. Br. J. Cancer 112, S1–S5 (2015).
Article PubMed PubMed Central Google Scholar
Mankaney, G., Sutton, R. A. & Burke, C. A. Colorectal cancer screening: choosing the right test. Cleve. Clin. J. Med. 86, 385–392 (2019).
Article PubMed Google Scholar
US Preventive Services Task Force. et al. Screening for colorectal cancer: US preventive services task force recommendation statement. JAMA 315, 2564 (2016).
Article Google Scholar
Lehman, C. D. et al. National performance benchmarks for modern screening digital mammography: update from the breast cancer surveillance consortium. Radiology 283, 49–58 (2017).
Article PubMed Google Scholar
Nelson, H. D. et al. Screening for Breast Cancer: A Systematic Review to Update the 2009 U.S. Preventive Services Task Force Recommendation. (Agency for Healthcare Research and Quality (US), 2016).
Jonas, D. E. et al. Screening for lung cancer with low-dose computed tomography: updated evidence report and systematic review for the US preventive services task force. JAMA 325, 971 (2021).
Article PubMed Google Scholar
Humphrey, L. et al. Screening for Lung Cancer: Systematic Review to Update the U.S. Preventive Services Task Force Recommendation. (Agency for Healthcare Research and Quality (US), 2013).
PDQ Screening and Prevention Editorial Board. Cervical Cancer Screening (PDQ®): Health Professional Version. in PDQ Cancer Information Summaries (National Cancer Institute (US), 2002).
Melnikow, J. et al. Screening for cervical cancer with high-risk human papillomavirus testing: updated evidence report and systematic review for the US preventive services task force. JAMA 320, 687–705 (2018).
Article PubMed Google Scholar
Brown, M. L. et al. Challenges in meeting Healthy People 2020 objectives for cancer-related preventive services, National Health Interview Survey, 2008 and 2010. Prev. Chronic Dis. 11, E29 (2014).
Article PubMed PubMed Central Google Scholar
Lopez-Olivo, M. A. et al. Patient adherence to screening for lung cancer in the US: a systematic review and meta-analysis. JAMA Netw. Open 3, e2025102 (2020).
Article PubMed PubMed Central Google Scholar
Vijan, S. Adherence to colorectal cancer screening: a randomized clinical trial of competing strategies. Arch. Intern. Med. 172, 575 (2012).
Article PubMed PubMed Central Google Scholar
Nelson, H. D. et al. Harms of breast cancer screening: systematic review to update the 2009 U.S. preventive services task force recommendation. Ann. Intern. Med. 164, 256–267 (2016).
Article PubMed Google Scholar
Irvin, V. L. et al. Comparison of mortality among participants of women’s health initiative trials with screening-detected breast cancers vs interval breast cancers. JAMA Netw. Open 3, e207227 (2020).
Article PubMed PubMed Central Google Scholar
Mattox, A. K. et al. Applications of liquid biopsies for cancer. Sci. Transl. Med. 11, eaay1984 (2019).
Article PubMed Google Scholar
Cescon, D. W., Bratman, S. V., Chan, S. M. & Siu, L. L. Circulating tumor DNA and liquid biopsy in oncology. Nat. Cancer 1, 276–290 (2020).
Article CAS PubMed Google Scholar
Cohen, J. D. et al. Detection and localization of surgically resectable cancers with a multi-analyte blood test. Science 359, 926–930 (2018).
Article CAS PubMed PubMed Central Google Scholar
Cristiano, S. et al. Genome-wide cell-free DNA fragmentation in patients with cancer. Nature 570, 385–389 (2019).
Article CAS PubMed PubMed Central Google Scholar
Mouliere, F. et al. Enhanced detection of circulating tumor DNA by fragment size analysis. Sci. Transl. Med. 10, eaat4921 (2018).
Article PubMed PubMed Central Google Scholar
Liu, M. C. et al. Sensitive and specific multi-cancer detection and localization using methylation signatures in cell-free DNA. Ann. Oncol. 31, 745–759 (2020).
Article CAS PubMed Google Scholar
Jiang, T., Ren, S. & Zhou, C. Multi-cancer blood testing combined with PET-CT: road for hope to screen for cancer and guide intervention. Signal Transduct. Target. Ther. 5, 95 (2020).
Article PubMed PubMed Central Google Scholar
Chen, X. et al. Non-invasive early detection of cancer four years before conventional diagnosis using a blood test. Nat. Commun. 11, 3475 (2020).
Article CAS PubMed PubMed Central Google Scholar
Lennon, A. M. et al. Feasibility of blood testing combined with PET-CT to screen for cancer and guide intervention. Science 369, eabb9601 (2020).
Article CAS PubMed PubMed Central Google Scholar
Phallen, J. et al. Direct detection of early-stage cancers using circulating tumor DNA. Sci. Transl. Med. 9, eaan2415 (2017).
Article PubMed PubMed Central Google Scholar
Klein, E. A. et al. Clinical validation of a targeted methylation-based multi-cancer early detection test using an independent validation set. Ann. Oncol. J. Eur. Soc. Med. Oncol. 32, 1167–1177 (2021).
Article CAS Google Scholar
Foda, Z. H. et al. Detecting liver cancer using cell-free DNA fragmentomes. Cancer Discov. 13, 616–631 (2023).
Article CAS PubMed Google Scholar
Mathios, D. et al. Detection and characterization of lung cancer using cell-free DNA fragmentomes. Nat. Commun. 12, 5060 (2021).
Article CAS PubMed PubMed Central Google Scholar
Ueberroth, B. E., Marks, L. A., Borad, M. J. & Agrwal, N. Multicancer early detection panels (MCEDs) in the primary care setting. Am. J. Med. 135, e145–e149 (2022).
Article PubMed Google Scholar
Zhu, M. et al. Genetic risk for overall cancer and the benefit of adherence to a healthy lifestyle. Cancer Res. 81, 4618–4627 (2021).
Article CAS PubMed Google Scholar
U.S. Cancer Statistics Working Group. U.S. Cancer Statistics Data Visualizations Tool, based on 2019 submission data (1999-2017): U.S. Department of Health and Human Services, Centers for Disease Control and Prevention and National Cancer Institute; www.cdc.gov/cancer/dataviz, released in June 2020.
Cox, D. R. Regression models and life-tables. J. R. Stat. Soc. Ser. B Methodol. 34, 187–202 (1972).
Google Scholar
Xie, M. et al. Age-related mutations associated with clonal hematopoietic expansion and malignancies. Nat. Med. 20, 1472–1478 (2014).
Article CAS PubMed PubMed Central Google Scholar
McKerrell, T. et al. Leukemia-associated somatic mutations drive distinct patterns of age-related clonal hemopoiesis. Cell Rep. 10, 1239–1245 (2015).
Article CAS PubMed PubMed Central Google Scholar
Genovese, G. et al. Clonal hematopoiesis and blood-cancer risk inferred from blood DNA sequence. N. Engl. J. Med. 371, 2477–2487 (2014).
Article PubMed PubMed Central Google Scholar
Leal, A. et al. White blood cell and cell-free DNA analyses for detection of residual disease in gastric cancer. Nat. Commun. 11, 525 (2020).
Article CAS PubMed PubMed Central Google Scholar
Chen, S. et al. Prediction of germline mutations and cancer risk in the Lynch syndrome. JAMA 296, 1479–1487 (2006).
Article CAS PubMed PubMed Central Google Scholar
Lee, A. et al. BOADICEA: a comprehensive breast cancer risk prediction model incorporating genetic and nongenetic risk factors. Genet. Med. J. Am. Coll. Med. Genet. 21, 1708–1718 (2019).
Google Scholar
Buniello, A. et al. The NHGRI-EBI GWAS Catalog of published genome-wide association studies, targeted arrays and summary statistics 2019. Nucleic Acids Res. 47, D1005–D1012 (2019).
Article CAS PubMed Google Scholar
Lambert, S. A. et al. The Polygenic Score Catalog: an open database for reproducibility and systematic evaluation. Nat. Genet. 53, 420–425 (2021).
Article CAS PubMed Google Scholar
National Center for Biotechnology Information. National Library of Medicine. PubMed. https://pubmed.ncbi.nlm.nih.gov (2021).
Ollier, W., Sprosen, T. & Peakman, T. UK Biobank: from concept to reality. Pharmacogenomics 6, 639–646 (2005).
Article PubMed Google Scholar
Bycroft, C. et al. The UK Biobank resource with deep phenotyping and genomic data. Nature 562, 203–209 (2018).
Article CAS PubMed PubMed Central Google Scholar
Sudlow, C. et al. UK biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS Med. 12, e1001779 (2015).
Article PubMed PubMed Central Google Scholar
1000 Genomes Project Consortium. et al. A global reference for human genetic variation. Nature 526, 68–74 (2015).
Article Google Scholar
Purcell, S. et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81, 559–575 (2007).
Article CAS PubMed PubMed Central Google Scholar
Choi, S. W. & O’Reilly, P. F. PRSice-2: Polygenic Risk Score software for biobank-scale data. GigaScience 8, giz082 (2019).
Article PubMed PubMed Central Google Scholar
UK Biobank. Data providers and dates of data availability. https://biobank.ndph.ox.ac.uk/ukb/exinfo.cgi?src=Data_providers_and_dates (2021).
Pal Choudhury, P. et al. iCARE: An R package to build, validate and apply absolute risk models. PLoS One 15, e0228198 (2020).
Article CAS PubMed PubMed Central Google Scholar
Surveillance, Epidemiology, and End Results (SEER) Program (www.seer.cancer.gov) SEER*Stat Database: Incidence - SEER 21 Regs Limited-Field Research Data + Hurricane Katrina Impacted Louisiana Cases, Nov 2018 Sub (2000-2016) <Katrina/Rita Population Adjustment> - Linked To County Attributes - Total U.S., 1969–2017 Counties, National Cancer Institute, DCCPS, Surveillance Research Program, released April 2019, based on the November 2018 submission.
Bureau, U. C. Age and Sex Composition in the United States: 2019. The United States Census Bureau. https://www.census.gov/data/tables/2019/demo/age-and-sex/2019-age-sex-composition.html (2021).

Download references

Acknowledgements

This research has been conducted using the UK Biobank Resource under Application Number 6 17712. This work was supported in part by was supported by grants from the National Human Genome Research Institute [1 R01 HG010480-01] and the National Cancer Institute [1 1U01CA249866-01, CA121113, CA233259], Dr. Miriam and Sheldon G. Adelson Medical Research Foundation, the Gray Foundation, the Commonwealth Foundation, and the Cole Foundation. R.B.S. is a founder and consultant of Delfi Diagnostics. We thank members of our laboratories and Dr. Bert Vogelstein (Ludwig Center at Johns Hopkins University School of Medicine) for his comments on the manuscript.

Author information

Authors and Affiliations

Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, 21205, USA
Elle S. Kim, Robert B. Scharpf & Nilanjan Chatterjee
Department of Oncology, The Sidney Kimmel Comprehensive Cancer Center, Johns Hopkins University School of Medicine, Baltimore, MD, 21205, USA
Robert B. Scharpf, Kala Visvanathan, Victor E. Velculescu & Nilanjan Chatterjee
Division of Cancer Epidemiology and Genetics, National Cancer Institute of Health, 9609 Medical Center Drive 7E-342, Rockville, MD, 20850, USA
Montserrat Garcia-Closas
Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, 21205, USA
Kala Visvanathan

Authors

Elle S. Kim
View author publications
You can also search for this author in PubMed Google Scholar
Robert B. Scharpf
View author publications
You can also search for this author in PubMed Google Scholar
Montserrat Garcia-Closas
View author publications
You can also search for this author in PubMed Google Scholar
Kala Visvanathan
View author publications
You can also search for this author in PubMed Google Scholar
Victor E. Velculescu
View author publications
You can also search for this author in PubMed Google Scholar
Nilanjan Chatterjee
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

E.S.K. and N.C. conceived the study design and drafted the manuscripts. E.S.K. did all the data analyses under the supervision of N.C. All other co-authors reviewed and approved the final version of the paper.

Corresponding author

Correspondence to Nilanjan Chatterjee.

Ethics declarations

Competing interests

K.V. currently receives research funding from Cepheid and collaborates with Optra Health. R.B.S. is a co-founder of and holds equity in Delfi Diagnostics. He also serves as a consultant. V.E.V. is a founder of Delfi Diagnostics, serves on the Board of Directors and as an officer for this organization, and owns Delfi Diagnostics stock, which is subject to certain restrictions under university policy. Additionally, Johns Hopkins University owns equity in Delfi Diagnostics. V.E.V. divested his equity in Personal Genome Diagnostics (PGDx) to LabCorp in February 2022. V.E.V. is an inventor on patent applications submitted by Johns Hopkins University related to cancer genomic analyses and cell-free DNA for cancer detection that have been licensed to one or more entities, including Delfi Diagnostics, LabCorp, Qiagen, Sysmex, Agios, Genzyme, Esoterix, Ventana and ManaT Bio. Under the terms of these license agreements, the University and inventors are entitled to fees and royalty distributions. V.E.V. is an advisor to Viron Therapeutics and Epitope. These arrangements have been reviewed and approved by the Johns Hopkins University in accordance with its conflict-of-interest policies. These arrangements have been reviewed and approved by Johns Hopkins University in accordance with its conflict-of-interest policies. The remaining authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Data

Supplementary Information

REPORTING SUMMARY

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Kim, E.S., Scharpf, R.B., Garcia-Closas, M. et al. Potential utility of risk stratification for multicancer screening with liquid biopsy tests. npj Precis. Onc. 7, 39 (2023). https://doi.org/10.1038/s41698-023-00377-w

Download citation

Received: 15 December 2022
Accepted: 30 March 2023
Published: 22 April 2023
DOI: https://doi.org/10.1038/s41698-023-00377-w