The potential of polygenic scores to improve cost and efficiency of clinical trials

Fahed, Akl C.; Philippakis, Anthony A.; Khera, Amit V.

doi:10.1038/s41467-022-30675-z

Download PDF

Comment
Open access
Published: 25 May 2022

The potential of polygenic scores to improve cost and efficiency of clinical trials

Nature Communications volume 13, Article number: 2922 (2022) Cite this article

5034 Accesses
13 Citations
44 Altmetric
Metrics details

Subjects

Polygenic scores can identify individuals with high disease risk based on inborn DNA variation. We explore their potential to enrich clinical trials by identifying individuals based on higher risk of disease (‘prognostic enrichment’), or increased probability of benefit (‘predictive enrichment’).

Clinical trials typically study rates of disease in participants randomized to a placebo or a given intervention, serving two primary purposes—first, to provide ‘gold standard’ evidence of efficacy and safety needed to obtain regulatory approval; and second, to demonstrate adequate benefit to convince clinicians and payers to use the drug within clinical practice. Because such trials for common diseases often require tens of thousands of participants followed for several years, the typical cost is $350 million, out of reach for all but the largest pharmaceutical companies or governmental agencies¹.

One important approach to increase clinical trial efficiency is to selectively enroll participants based on clinical or molecular characteristics². Guidance from the U.S. Food and Drug Administration outlines two distinct conceptual approaches for enrichment. The first, termed ‘prognostic enrichment,’ aims to increase statistical power—and thus decrease sample size and cost—by increasing the proportion of patients likely to demonstrate disease onset or progression. Taking COVID-19 vaccine trials as an example, Moderna and other sponsors selectively enrolled participants in areas where the virus was rapidly spreading to more quickly demonstrate benefit³. For a new cholesterol-lowering therapy designed to prevent heart attack and stroke, the pivotal trial enrolled only those with preexisting cardiovascular disease based on data that the event rates in these individuals is much higher⁴. The second, termed ‘predictive enrichment,’ aims to enroll participants who are more likely to have an outsized benefit to the trial intervention. Demonstration that patients whose lung cancer contained specific gain-of-function mutations in the target of an inhibitor of this receptor’s signaling respond to treatment, while those without such mutations do not, inspired a new era in oncologic development where predictive enrichment using molecular profiling has substantially reduced development cost and duration^2,5.

Despite the frequent use of enrichment strategies, clinical trials still often fail to achieve their aim of allowing the intervention to gain regulatory approval and adoption in clinical practice. These (costly) failures are particularly common when low event rates preclude the preferred trial design or the existing standard of care is already good, thus making the demonstration of a meaningful improvement with a new drug more challenging. For conditions such as Alzheimer’s dementia, enrollment of participants late in the disease process—which aims to increase event rates via prognostic enrichment—is often cited as a potential reason for the failures that have occurred even when the therapeutic target is believed to be pathophysiologically sound, as was the case for an antibody designed to clear amyloid plaques from the brain^6,7. In cardiovascular disease, a powerful cholesterol-lowering medicine reduced the frequency of clinical events from 11.8 to 10.8% compared to placebo, achieving its primary endpoint with a compelling degree of statistical confidence (p = 0.004), but this effect size was deemed inadequate to justify pursuing its commercialization⁸.

Given these challenges in clinical trial design and execution, are genetic enrichment strategies using ‘polygenic scores’ worthwhile to consider?

The traditional approach to genetic risk stratification has focused on identifying the small subset of the population with rare monogenic mutations that substantially increase risk via disruption of a specific biologic pathway. More recently, polygenic scores—which instead consider the cumulative impact of many common DNA variants scattered across the genome—have gained traction as a promising approach with relevance for much larger subsets of the population. Initially proposed for applications in plant and animal breeding, newer generation polygenic scores have considerable predictive capacity across a range of important common diseases^9,10,11. This stratification allows for the identification of individuals (as early as birth) whose inborn DNA variation places them on a markedly accelerated trajectory of disease onset. For coronary artery disease, we demonstrated that up to 8% of the population inherits triple the normal risk based on genetic variation alone, and these high-risk individuals cannot be reliably identified with traditional risk factors or family history¹⁰.

Post hoc analyses of clinical trials involving cholesterol-lowering therapies for cardiovascular disease have suggested that polygenic scores hold promise as a powerful enrichment strategy. Among healthy individuals randomized to statin or placebo to prevent cardiovascular disease, those with the highest polygenic score demonstrated the greatest benefit^12,13. This benefit was related to both prognostic enrichment—the rates of developing heart disease in the placebo group was 19.6% for those in the top quintile of the score versus 12.9% in all others—and predictive enrichment, where a 44% relative risk reduction was noted for those with high score versus only 24% in the remainder of the participants¹³.

This observation from statin trials was later extended to two trials focused on preventing a second cardiovascular event in those with existing disease using powerful (and expensive) new injectable medications, where those with the highest polygenic score again derived the greatest benefit due to both prognostic and predictive enrichment^14,15. This analysis suggests that—had it been possible to predict this enrichment in advance—the trials could have successfully demonstrated benefit with substantially fewer participants (Fig. 1). In this specific case, we estimate that a trial that enrolled only those participants in the top quintile of the polygenic score might have required only 2360 participants—a greater than 90% reduction from the 27,564 studied—and demonstrated a 31% relative risk reduction as compared to the 20% observed in the overall trial population. For a drug class that faced post-approval access challenges, initial commercialization for a subset of the population who derived greater benefit may have enhanced clinical uptake, perceived cost-effectiveness, and overall public health impact.

**Fig. 1: Power and sample size estimation using prognostic or predictive model for polygenic score enrichment.**

We believe that polygenic risk estimation will play an important role in the future of clinical medicine, enabling targeted screening or prevention strategies to overcome inherited predisposition, and warrants consideration as an enrichment strategy for clinical trials as well. Although the potential requirement of a genetic test as an inclusion criterion for a given trial creates a potential hurdle to recruitment, this has become common within clinical medicine for use cases ranging from targeted cancer therapies, drugs for cystic fibrosis that work only among those with a given genetic mutation in the CFTR gene, or a potent fish oil formulation that is approved only for those with high circulating triglyceride levels^2,16,17. Compelling use cases might include primary prevention trials where traditional approaches would require a clinical trial that is intractable owing to the very large sample size and long follow-up that would be necessary to show benefit. Beyond conditions such as Alzheimer’s dementia discussed above, an additional public health need relates to nonalcoholic fatty liver disease—which affects up to 20% of the world’s population and is the leading risk factor for liver cirrhosis or cancer—but has been challenging to conduct trials for since only a small fraction of afflicted individuals progress to more advanced disease in a given year¹⁸. We and others have recently developed polygenic scores for this condition, laying the scientific foundation for a new generation of trials that incorporate genetic enrichment strategies¹⁹.

Alongside the considerable (and warranted) enthusiasm for the use of polygenic scores to meaningfully enhance clinical development, several potential barriers warrant discussion. First, the predictive capacity of a polygenic score is limited by heritability (proportion of risk explained by common DNA variants) and scores may not have adequate ability to stratify risk for some conditions²⁰. Second, although in principle polygenic scores can be assessed for less than $100 U.S. dollars, few patients or healthcare systems currently offer them clinically, posing a logistical challenge for trial enrollment or medication prescribing. Third, current polygenic scores are typically associated with increased risk across all ancestries, but with an effect size that is highest in those of European ancestry (primarily due to lack of adequate training data in other groups)^21,22. Fourth, most scores developed to date are based on case-control datasets for a given disease—additional work is needed to determine whether the genetic basis of disease progression meaningfully differs from disease onset and whether ‘pathway-specific’ scores may provide more reliable predictive enrichment^23,24. Fifth, an approach that integrates polygenic risk with additional rare genetic or non-genetic factors such as clinical or biomarker concentrations is likely to outperform strategies based on a polygenic score alone, but few such algorithms have been developed to date^25,26. Sixth, the regulatory guidelines surrounding polygenic score use in clinical development have not been fully articulated and scores are likely to evolve over time due to a lack of accepted standards to evaluate performance and reproducibility—increasing the risk of a sponsor obtaining an approved drug label with a given score. Seventh, most investigations of utilizing polygenic scores in clinical trials are from post hoc analyses, but prospective implementation may still face logistical and scientific challenges that would need to be solved.

Despite potential barriers, the high cost of clinical trials has emerged as arguably the single biggest barrier to the development of innovations that may well have substantial public health benefit—and potential strategies to meaningfully alter this landscape mandate serious consideration²⁷. As observed in trials of cholesterol-lowering therapies, polygenic scores hold the potential to enable substantial predictive or prognostic enrichment and could have a deep impact on enabling a new era in clinical development.

References

Moore, T. J., Zhang, H., Anderson, G. & Alexander, G. C. Estimated costs of pivotal trials for novel therapeutic agents approved by the US Food and Drug Administration, 2015–2016. JAMA Intern. Med. 178, 1451–1457 (2018).
Article Google Scholar
U.S. Food and Drug Administration. Enrichment strategies for clinical trials to support approval of human drugs and biological products. https://www.fda.gov/regulatory-information/search-fda-guidance-documents/enrichment-strategies-clinical-trials-support-approval-human-drugs-and-biological-products (2019).
Baden, L. R. et al. Efficacy and safety of the mRNA-1273 SARS-CoV-2 vaccine. N. Engl. J. Med. 384, 403–416 (2021).
Article CAS Google Scholar
Sabatine, M. S. et al. Evolocumab and clinical outcomes in patients with cardiovascular disease. N. Engl. J. Med. 376, 1713–1722 (2017).
Article CAS Google Scholar
Lynch, T. J. et al. Activating mutations in the epidermal growth factor receptor underlying responsiveness of non-small-cell lung cancer to gefitinib. N. Engl. J. Med. 350, 2129–2139 (2004).
Article CAS Google Scholar
McDade, E. & Bateman, R. J. Stop Alzheimer’s before it starts. Nature 547, 153 (2017).
Article ADS CAS Google Scholar
Honig, L. S. et al. Trial of solanezumab for mild dementia due to Alzheimer’s disease. N. Engl. J. Med. 378, 321–330 (2018).
Article CAS Google Scholar
Bowman, L. et al. Effects of anacetrapib in patients with atherosclerotic vascular disease. N. Engl. J. Med. 377, 1217–1227 (2017).
Article Google Scholar
Lande, R. & Thompson, R. Efficiency of marker-assisted selection in the improvement of quantitative traits. Genetics 124, 743–756 (1990).
Article CAS Google Scholar
Khera, A. V. et al. Genome-wide polygenic scores for common diseases identify individuals with risk equivalent to monogenic mutations. Nat. Genet. 50, 1219–1224 (2018).
Article CAS Google Scholar
Torkamani, A., Wineinger, N. E. & Topol, E. J. The personal and clinical utility of polygenic risk scores. Nat. Rev. Genet. https://doi.org/10.1038/s41576-018-0018-x (2018).
Mega, J. L. et al. Genetic risk, coronary heart disease events, and the clinical benefit of statin therapy: An analysis of primary and secondary prevention trials. Lancet 385, 2264–2271 (2015).
Article CAS Google Scholar
Natarajan, P. et al. Polygenic risk score identifies subgroup with higher burden of atherosclerosis and greater relative benefit from Statin therapy in the primary prevention setting. Circulation 135, 2091–2101 (2017).
Article Google Scholar
Marston, N. A. et al. Predicting benefit from evolocumab therapy in patients with atherosclerotic disease using a genetic risk score. Circulation 141, 616–623 (2020).
Article Google Scholar
Damask, A. et al. Patients with high genome-wide polygenic risk scores for coronary artery disease may receive greater clinical benefit from alirocumab treatment in the ODYSSEY OUTCOMES trial. Circulation 141, 624–636 (2020).
Article Google Scholar
Collins, F. S. Realizing the dream of molecularly targeted therapies for cystic fibrosis. N. Engl. J. Med. 381, 1863–1865 (2019).
Article Google Scholar
Bhatt, D. L. et al. Cardiovascular risk reduction with icosapent ethyl for hypertriglyceridemia. N. Engl. J. Med. 380, 11–22 (2019).
Article CAS Google Scholar
Loomba, R., Friedman, S. L. & Shulman, G. I. Mechanisms and disease consequences of nonalcoholic fatty liver disease. Cell 184, 2537–2564 (2021).
Article CAS Google Scholar
Haas, M. E. et al. Machine learning enables new insights into genetic contributions to liver fat accumulation. Cell Genom. https://doi.org/10.1016/j.xgen.2021.100066 (2021).
Zhang, Y., Qi, G., Park, J. H. & Chatterjee, N. Estimation of complex effect-size distributions using summary-level statistics from genome-wide association studies across 32 complex traits. Nat. Genet. 50, 1318–1326 (2018).
Article CAS Google Scholar
Fahed, A. C. et al. Transethnic transferability of a genome-wide polygenic score for coronary artery disease. Circ. Genomic Precis. Med. https://doi.org/10.1161/CIRCGEN.120.003092 (2021).
Martin, A. R. et al. Clinical use of current polygenic risk scores may exacerbate health disparities. Nat. Genet. 51, 584–591 (2019).
Article CAS Google Scholar
Liu, G. et al. Genome-wide survival study identifies a novel synaptic locus and polygenic score for cognitive progression in Parkinson’s disease. Nat. Genet. 53, 787–793 (2021).
Article CAS Google Scholar
McCarthy, M. I. Painting a new picture of personalised medicine for diabetes. Diabetologia 60, 793–799 (2017).
Article Google Scholar
Fahed, A. C. et al. Polygenic background modifies penetrance of monogenic variants for tier 1 genomic conditions. Nat. Commun. 11, 3635 (2020).
Article ADS CAS Google Scholar
Lee, A. et al. BOADICEA: A comprehensive breast cancer risk prediction model incorporating genetic and nongenetic risk factors. Genet. Med. 21, 1708–1718 (2019).
Article Google Scholar
Moscicki, R. A. & Tandon, P. K. Drug-development challenges for small biopharmaceutical companies. N. Engl. J. Med. 376, 469–474 (2017).
Article Google Scholar
Sabatine, M. S. et al. Rationale and design of the further cardiovascular outcomes research with PCSK9 inhibition in subjects with elevated risk trial. Am. Heart J. 173, 94–101 (2016).
Article CAS Google Scholar

Download references

Acknowledgements

Funding support was provided by grants 1K08HG010155 and 1U01HG011719 from the National Human Genome Research Institute (A.V.K.), a Hassenfeld Scholar Award from Massachusetts General Hospital (A.V.K.), a Merkin Institute Fellowship from the Broad Institute of MIT and Harvard (to A.V.K.), a sponsored research agreement from Bayer (A.A.P), the Eric and Wendy Schmidt Center (A.A.P.), and a sponsored research agreement from IBM Research (A.V.K. & A.A.P.).

Author information

Authors and Affiliations

Division of Cardiology and Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
Akl C. Fahed & Amit V. Khera
Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, MA, USA
Akl C. Fahed & Amit V. Khera
Department of Medicine, Harvard Medical School, Boston, MA, USA
Akl C. Fahed & Amit V. Khera
Data Sciences Platform, Broad Institute of MIT and Harvard, Cambridge, MA, USA
Anthony A. Philippakis
Eric and Wendy Schmidt Center, Broad Institute of MIT and Harvard, Cambridge, MA, USA
Anthony A. Philippakis
Verve Therapeutics, Cambridge, MA, USA
Amit V. Khera

Authors

Akl C. Fahed
View author publications
You can also search for this author in PubMed Google Scholar
Anthony A. Philippakis
View author publications
You can also search for this author in PubMed Google Scholar
Amit V. Khera
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.C.F., A.A.P., and A.V.K. jointly drafted the manuscript and critically revised the manuscript for intellectual content.

Corresponding author

Correspondence to Amit V. Khera.

Ethics declarations

Competing interests

A.C.F. is a consultant and holds equity in Goodpath. A.A.P. is employed as a Venture Partner at GV, a subsidiary of Alphabet Corporation. A.V.K. is an employee and holds equity in Verve Therapeutics; has served as a scientific advisor to Amgen, Maze Therapeutics, Navitor Pharmaceuticals, Sarepta Therapeutics, Novartis, Silence Therapeutics, Korro Bio, Veritas International, Color Health, Third Rock Ventures, Illumina, Foresite Labs, and Columbia University (NIH); received speaking fees from Illumina, MedGenome, Amgen, and the Novartis Institute for Biomedical Research; received a sponsored research agreement from IBM Research, and is listed as a co-inventor on a patent application for use of imaging data in assessing body fat distribution and associated cardiometabolic risk.

Peer review

Peer review information

Nature Communications thanks the anonymous reviewer for their contribution to the peer review of this work.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Fahed, A.C., Philippakis, A.A. & Khera, A.V. The potential of polygenic scores to improve cost and efficiency of clinical trials. Nat Commun 13, 2922 (2022). https://doi.org/10.1038/s41467-022-30675-z

Download citation

Received: 16 August 2021
Accepted: 09 May 2022
Published: 25 May 2022
DOI: https://doi.org/10.1038/s41467-022-30675-z

This article is cited by

Recent advances in polygenic scores: translation, equitability, methods and FAIR tools
- Ruidong Xiang
- Martin Kelemen
- Samuel A. Lambert
Genome Medicine (2024)
The Role of Genetics in Advancing Cardiometabolic Drug Development
- Roukoz Abou-Karam
- Fangzhou Cheng
- Akl C. Fahed
Current Atherosclerosis Reports (2024)
From target discovery to clinical drug development with human genetics
- Katerina Trajanoska
- Claude Bhérer
- Vincent Mooser
Nature (2023)
A multi-ancestry polygenic risk score improves risk prediction for coronary artery disease
- Aniruddh P. Patel
- Minxian Wang
- Amit V. Khera
Nature Medicine (2023)

The potential of polygenic scores to improve cost and efficiency of clinical trials

Subjects

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Rights and permissions

About this article

Cite this article

This article is cited by

Recent advances in polygenic scores: translation, equitability, methods and FAIR tools

The Role of Genetics in Advancing Cardiometabolic Drug Development

From target discovery to clinical drug development with human genetics

A multi-ancestry polygenic risk score improves risk prediction for coronary artery disease

Search

Quick links

Subjects

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Recent advances in polygenic scores: translation, equitability, methods and FAIR tools

The Role of Genetics in Advancing Cardiometabolic Drug Development

From target discovery to clinical drug development with human genetics

A multi-ancestry polygenic risk score improves risk prediction for coronary artery disease

Search

Quick links