Failed trials for central nervous system disorders do not necessarily invalidate preclinical models and drug targets

Bespalov, Anton; Steckler, Thomas; Altevogt, Bruce; Koustova, Elena; Skolnick, Phil; Deaver, Daniel; Millan, Mark J.; Bastlund, Jesper F.; Doller, Dario; Witkin, Jeffrey; Moser, Paul; O'Donnell, Patricio; Ebert, Ulrich; Geyer, Mark A.; Prinssen, Eric; Ballard, Theresa; Macleod, Malcolm

doi:10.1038/nrd.2016.88

Download PDF

Correspondence
Published: 17 June 2016

Failed trials for central nervous system disorders do not necessarily invalidate preclinical models and drug targets

Anton Bespalov^nAff14,
Thomas Steckler¹,
Bruce Altevogt²,
Elena Koustova³,
Phil Skolnick³,
Daniel Deaver⁴,
Mark J. Millan⁵,
Jesper F. Bastlund⁶,
Dario Doller^nAff15,
Jeffrey Witkin⁷,
Paul Moser^nAff16,
Patricio O'Donnell⁸,
Ulrich Ebert⁹,
Mark A. Geyer¹⁰,
Eric Prinssen¹¹,
Theresa Ballard¹¹ &
…
Malcolm Macleod¹²

Nature Reviews Drug Discovery volume 15, page 516 (2016)Cite this article

6988 Accesses
51 Citations
12 Altmetric
Metrics details

Subjects

A recent article identified five key technical determinants that make substantial contributions to the outcome of drug R&D projects (Lessons learned from the fate of AstraZeneca's drug pipeline: a five-dimensional framework. Nat. Rev. Drug Discov. 13, 419–431 (2014))¹. Careful consideration of such determinants might be particularly valuable in the fields of neurology and psychiatry, in which successful drug development has declined precipitously over the past decade. This decline has largely been fuelled by a high failure rate in the translation of preclinical efficacy findings, caused by multiple factors (see Supplementary information S1 (table)), including limited training and poor protocol design, inadequate animal models, insufficiently validated therapeutic targets and problems with data handling and reporting.

Here, we focus on three factors that can be addressed immediately in order to re-evaluate the therapeutic potential of older drugs and targets and to increase the probability of success for future preclinical-to-clinical translation: data robustness, data generalizability and target engagement data, a factor that was also highlighted in the recent article¹. We argue that the many failed clinical trials in neuropsychiatry do not necessarily invalidate the potential of a drug target or an animal model. Rather, these failures indicate a need for improved experimental designs and a robust translational strategy to better inform compound and dose selection for clinical trials. We conclude that many of the drugs and targets in neuropsychiatry that have been discarded because of negative clinical trial outcomes may deserve re-evaluation using contemporary knowledge, methodology and tools.

Robustness

The problem of robustness in preclinical data is best illustrated by an example from research in amyotrophic lateral sclerosis (ALS), a severe progressive neurodegenerative disease. There is currently one approved medication for ALS, riluzole, which has only modest effects on survival. Numerous other drug candidates have reported efficacy in a superoxide dismutase 1 (SOD1) mouse model of ALS (one of the common animal models for this disease), but none of these candidates produced an efficacy signal in clinical trials. The ALS Therapy Development Institute later rigorously retested more than 100 of those molecules in the SOD1 model (using adequate statistical power, treatment groups matched for litter and gender, blinding, uniform end point criteria, tracking of non-ALS deaths and quantitative analysis of transgene copy number prior to assigning mice to a study), and they were unable to replicate any of the previously reported preclinical efficacy findings². In this context, the lack of clinical efficacy is not surprising.

Similar problems related to deficiencies in experimental design (such as inadequate blinding and randomization) had previously been observed in studies with animal models of stroke and multiple sclerosis³. The potential impact of such experimental design problems can be assessed retrospectively for drugs that have already been approved or abandoned, and steps can be taken to improve the robustness of experiments for drugs in development (see Supplementary information S2 (table)).

A hallmark of the scientific method is the replication of findings both within and between laboratories. However, such replication is limited by cost, human resources, time and bioethical considerations. Additionally, “there is an almost irresistible pressure to stop when the result is about what one expects it to be,” according to Terry Quinn⁴. However, the more novel the findings of an experiment appear, the less likely they are to be true⁵, especially in the context of poorly designed and underpowered studies. This problem reflects the pressure to publish novel findings in high-impact journals before being scooped by a rival laboratory or funding runs out. The conventional value attached to such publications for career advancement and future funding often outweighs the efforts required to rigorously challenge the novel findings; thus, verification in an independent laboratory is unlikely.

As the costs of clinical studies are so much higher than those of preclinical development, one might assume that pharmaceutical companies would conduct robust replications of key findings. This is indeed the case during lead optimization, candidate selection, testing different administration routes and the use of primary preclinical disease models. However, this is far less common for studies in the more complex disease models that are used in late stages of development and are potentially more relevant for predicting clinical efficacy.

In general, the rigorousness with which preclinical data is obtained — and the resulting robustness of the data — is quite low; few studies report randomization, blinding, sample size calculations or attrition.

Generalizability

Every laboratory has a unique combination of protocols, suppliers of tools and reagents, source of animals, and animal husbandry characteristics. As drugs are used in highly heterogeneous patient populations, efficacy observed in a single lab is more likely to be successfully translated when similar findings can also be obtained under different conditions in other laboratories. There is empirical evidence to support this assumption: the broader the range of circumstances and laboratory environments in which preclinical efficacy can be demonstrated, the higher the likelihood of detecting efficacy signals in clinical studies⁶. A recent European Union-funded initiative (the Multicentre Preclinical Animal Research Team (MultiPART)) established web-based platforms for multicentre animal studies, and the National Institute of Ageing supports an Interventions Testing Program that seeks to validate the efficacy of treatments for ageing across several test sites with adequately powered, rigorous experiments using genetically heterogeneous mice of both sexes.

Generalizability of preclinical data is not only an issue concerned with laboratory conditions and animal strains, age and sex. For example, there is a remarkable paucity of studies employing chronic or subchronic drug administration. Given that even a second dose of most drugs can alter the biological milieu (for example, tolerance, sensitization or receptor regulation), chronic dosing studies are important for generating the best predictions of effects in patients. It is largely unknown whether a compound's efficacy has been confirmed in properly designed studies with chronic administration in animals before initiation of a clinical trial⁷. This information is particularly important for those indications for which preclinical models do not require repeated administration to detect drug efficacy, while the treatment duration in clinical trials can range from weeks to several months, depending on the indication.

Target engagement

In the context of hypothesis-driven drug research, any observation of clinical efficacy is serendipitous if the molecule does not engage its biological targets at the dose tested. Various modelling tools can be used to assess target engagement, and direct target occupancy assays including positron emission tomography (PET) are increasingly available. The aim is to demonstrate that, at relevant doses, the drug is present in the same compartment as the target and in appropriate free concentrations to bind to it. PET studies demonstrate receptor occupancy but not targeted downstream effects; such approaches may not be appropriate for novel drugs working through allosteric or non-competitive molecular mechanisms, and for some targets PET tracers are not yet available.

Scientists at Pfizer considered evidence of exposure at the site of action, target binding and expression of functional pharmacological activity for 44 Phase II programmes across several therapeutic areas⁸. In 43% of cases, it was reported that the target mechanism had not been adequately tested owing to the lack of evidence of target engagement. Similarly, AstraZeneca reported that 40% of efficacy failures in Phase II projects could be attributed to a lack of clear target linkage to a disease or validated animal model, and 29% could be attributed to a lack of data establishing tissue exposure¹.

Even the use of cerebrospinal fluid concentrations to guide dosing may be misleading for intracellular targets or for compounds that are actively transported across the blood–brain barrier. Similar considerations may apply to antibody therapeutics. Key parameters required for brain penetration have been discussed⁹, and the failure of AZD8529 — a positive allosteric modulator of metabotropic glutamate receptor 2 (mGluR2) — in a Phase II study in schizophrenia has been attributed, in part, to unreliable target engagement¹.

We have used the Thomson Reuters Cortellis database to identify drug development projects in schizophrenia between 1994 and 2014; we could not find evidence of biomarker-driven dose selection for 80% of 72 novel drugs subjected to Phase II clinical proof-of-concept studies (Fig. 1).

**Figure 1: Analysis of the use of biomarkers in the development of novel treatments for schizophrenia.**

Path forward

Efforts to improve the robustness of preclinical data will lead to better study designs. Strengthening of publication policies and open access to data (including negative data) is an additional key way to improve data reliability and transparency. The Preclinical Data Forum was established with support from the European College of Neuropsychopharmacology (ECNP)¹⁰ and is developing an online platform to enable scientists to exchange unpublished data in a pre-competitive manner and to share knowledge on the use of tool compounds. This platform should facilitate disclosure of large amounts of pre-competitive information and should be paralleled by the development of consensus approaches to data robustness and demonstrating generalizability.

There may be compounds that have failed in clinical proof-of-concept studies owing to poor target engagement or that might be more appropriate for other diseases sharing similar mechanisms. The ECNP medicines chest provides a list of pharmacological tools no longer under development and can be used to obtain further clinical information on a particular target. Such target revalidation efforts should ideally occur in a pre-competitive space and may involve the development of new business models.

Finally, improved training on appropriate study designs will ensure that preclinical research is conducted to the highest standards, with appropriate protocol design informed by statistical and power analyses similar to those processes now standard for clinical trials.

Summary and conclusions

Herein, we challenge the widely held view that the high failure rate in neuropsychiatry trials invalidates both the drug targets chosen and the preclinical models used. We argue instead that the scientific community should attach greater importance to issues of data robustness, data generalizability and target engagement when designing preclinical studies. We do not seek to detract attention from the fundamental need to strengthen our scientific understanding of disease mechanisms, improve clinical testing strategies, and develop better disease models. Our premise is that an increase in robustness, generalizability and evidence of target engagement will increase the probability of successful translation of preclinical findings into Phase II efficacy. We are optimistic that the adoption of these approaches will enhance our ability to bring improved and much needed medicines to patients.

References

Cook, D. et al. Lessons learned from the fate of AstraZeneca's drug pipeline: a five-dimensional framework. Nat. Rev. Drug Discov. 13, 419–431 (2014).
Article CAS Google Scholar
Scott, S. et al. Design, power, and interpretation of studies in the standard murine model of ALS. Amyotroph. Lateral Scler. 9, 4–15 (2008).
Article CAS Google Scholar
van der Worp, H. B. et al. Can animal models of disease reliably inform human studies? PLoS Med. 7, e1000245 (2010).
Article Google Scholar
Quinn, T. Don't stop the quest to measure Big G. Nature 505, 455 (2014).
Article CAS Google Scholar
Ioannidis, J. P. Why most published research findings are false. PLoS Med. 2, e124 (2005).
Article Google Scholar
Richter, S. H. et al. Systematic variation improves reproducibility of animal experiments. Nat. Methods 7, 167–168 (2010).
Article CAS Google Scholar
Bespalov, A. et al. Drug tolerance: a known unknown in translational neuroscience. Trends Pharmacol. Sci, http://dx.doi.org/10.1016/j.tips.2016.01.008 (2016).
Morgan, P. et al. Can the flow of medicines be improved? Fundamental pharmacokinetic and pharmacological principles toward improving Phase II survival. Drug Discov. Today 17, 419–424 (2012).
Article CAS Google Scholar
Di, L. et al. Demystifying brain penetration in central nervous system drug discovery. J. Med. Chem. 56, 2–12 (2013).
Article CAS Google Scholar
Steckler, T. et al. The preclinical data forum network: A new ECNP initiative to improve data quality and robustness for (preclinical) neuroscience. Eur. Neuropsychopharmacol. 25, 1803–1807 (2015).
Article CAS Google Scholar

Download references

Acknowledgements

The authors thank Martien Kas (Utrecht University), Michael Decker (AbbVie), Lynn Butler-David (Exciva), Martin Weber (Genentech), Jurgen Gottowik (Roche) and Katja Brose (Cell Press) for stimulating discussion and helpful comments.

Author information

Anton Bespalov
Present address: Neuroscience Research, AbbVie, 6706 Ludwigshafen, Germany. Present address: Partnership for Assessment and Accreditation of Scientific Practice, Am Aukopf 14/1, D-69118 Heidelberg, Germany and Institute of Pharmacology, Pavlov Medical University, 197022 St Petersburg, Russia.,
Dario Doller
Present address: Discovery Chemistry & DMPK; Lundbeck Research USA, Paramus, New Jersey 07652, USA. Present address: Concert Pharmaceuticals, Inc., 99 Hayden Avenue, Lexington, Massachusetts 02421, USA.,
Paul Moser
Present address: Pierre Fabre Research Institute, 81106 Castres, France. Present address: BIAL-Portela & Ca S.A., Avenida da Siderurgia Nacional, 4745–457 São Mamede do Coronado, Portugal.,

Authors and Affiliations

Janssen Research and Development, B-2340, Beerse, Belgium
Thomas Steckler
Global Policy and International Public Affairs, Pfizer, New York, 10017, New York, USA
Bruce Altevogt
National Institute on Drug Abuse, National Institutes of Health, Bethesda, 20892, Maryland, USA
Elena Koustova & Phil Skolnick
Non-Clinical Research and Development, Alkermes, Waltham, 02451, Massachusetts, USA
Daniel Deaver
Institut de Recherche Servier, 78290, Croissy sur Seine, France
Mark J. Millan
Neuroscience Research, H. Lundbeck A/S, Copenhagen, 2500, Valby, Denmark
Jesper F. Bastlund
Neuroscience Discovery Research, Lilly Research Labs, Eli Lilly and Company, Indianapolis, 46285, Indiana, USA
Jeffrey Witkin
Neuroscience and Pain Research Unit, Pfizer, Cambridge, 02139, Massachusetts, USA
Patricio O'Donnell
Boehringer Ingelheim Pharma, 55218, Ingelheim am Rhein, Germany
Ulrich Ebert
University of California San Diego, La Jolla, 92093, California, USA
Mark A. Geyer
Roche Pharma Research and Early Development, Neuroscience, Ophthalmology and Rare Diseases, Roche Innovation Center Basel, CH-4070, Basel, Switzerland
Eric Prinssen & Theresa Ballard
Edinburgh University, Old College, South Bridge, EH8 9YL, Edinburgh, UK
Malcolm Macleod

Authors

Anton Bespalov
View author publications
You can also search for this author in PubMed Google Scholar
Thomas Steckler
View author publications
You can also search for this author in PubMed Google Scholar
Bruce Altevogt
View author publications
You can also search for this author in PubMed Google Scholar
Elena Koustova
View author publications
You can also search for this author in PubMed Google Scholar
Phil Skolnick
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Deaver
View author publications
You can also search for this author in PubMed Google Scholar
Mark J. Millan
View author publications
You can also search for this author in PubMed Google Scholar
Jesper F. Bastlund
View author publications
You can also search for this author in PubMed Google Scholar
Dario Doller
View author publications
You can also search for this author in PubMed Google Scholar
Jeffrey Witkin
View author publications
You can also search for this author in PubMed Google Scholar
Paul Moser
View author publications
You can also search for this author in PubMed Google Scholar
Patricio O'Donnell
View author publications
You can also search for this author in PubMed Google Scholar
Ulrich Ebert
View author publications
You can also search for this author in PubMed Google Scholar
Mark A. Geyer
View author publications
You can also search for this author in PubMed Google Scholar
Eric Prinssen
View author publications
You can also search for this author in PubMed Google Scholar
Theresa Ballard
View author publications
You can also search for this author in PubMed Google Scholar
Malcolm Macleod
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Anton Bespalov.

Ethics declarations

Competing interests

At the time the manuscript was prepared and submitted, A.B. was an employee and shareholder of AbbVie, D.Do. was an employee and shareholder of Lundbeck, and P.M. was an employee and shareholder of Pierre Fabre. T.S. is an employee and shareholder of Janssen. D.De. is an employee and shareholder of Alkermes. J.B. is an employee and shareholder of Lundbeck. J.W. is an employee and shareholder of Eli Lilly. M.M is an employee of the Institut de Recherche Servier. E.P. and T.B. are employees and shareholders of Roche. B.A. and P.O'D. are employees and shareholders of Pfizer. U.E. is an employee of Boehringer-Ingelheim. In the past three years, M.G. has received consulting compensation from Abbott, Dart, Lundbeck, Neurocrine, Omeros, Otsuka, and Sunovion, and he holds an equity interest in San Diego Instruments. He also has research grant support from the National Institute on Drug Abuse (NIDA), the National Institute of Mental Health (NIMH), and the US Veteran's Administration (VISN 22 Mental Illness Research, Education, and Clinical Center. M.M., E.K. and P.S. have no competing interests.

PowerPoint slides

PowerPoint slide for Fig. 1

Supplementary information

Supplementary information S1 (table)

Possible causes of unsuccessful outcomes of clinical trials with novel agents (PDF 140 kb)

Supplementary information S2 (table)

Observations that make reported pharmacological findings appear not robust (PDF 134 kb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Bespalov, A., Steckler, T., Altevogt, B. et al. Failed trials for central nervous system disorders do not necessarily invalidate preclinical models and drug targets. Nat Rev Drug Discov 15, 516 (2016). https://doi.org/10.1038/nrd.2016.88

Download citation

Published: 17 June 2016
Issue Date: July 2016
DOI: https://doi.org/10.1038/nrd.2016.88

This article is cited by

From data deluge to publomics: How AI can transform animal research
- Benjamin V. Ineichen
- Marianna Rosso
- Malcolm R. Macleod
Lab Animal (2023)
A paradigm shift in translational psychiatry through rodent neuroethology
- Yair Shemesh
- Alon Chen
Molecular Psychiatry (2023)
Engineering a 3D functional human peripheral nerve in vitro using the Nerve-on-a-Chip platform
- Anup D. Sharma
- Laurie McCoy
- Michael J. Moore
Scientific Reports (2019)
Psychoactive drug exposure during breastfeeding: a critical need for preclinical behavioral testing
- Irving Zucker
Psychopharmacology (2018)

Failed trials for central nervous system disorders do not necessarily invalidate preclinical models and drug targets

Subjects

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Competing interests

Related links

FURTHER INFORMATION

PowerPoint slides

PowerPoint slide for Fig. 1

Supplementary information

Supplementary information S1 (table)

Supplementary information S2 (table)

Rights and permissions

About this article

Cite this article

This article is cited by

From data deluge to publomics: How AI can transform animal research

A paradigm shift in translational psychiatry through rodent neuroethology

Engineering a 3D functional human peripheral nerve in vitro using the Nerve-on-a-Chip platform

Psychoactive drug exposure during breastfeeding: a critical need for preclinical behavioral testing

Search

Quick links

Subjects

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Competing interests

Related links

FURTHER INFORMATION

PowerPoint slides

PowerPoint slide for Fig. 1

Supplementary information

Supplementary information S1 (table)

Supplementary information S2 (table)

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

From data deluge to publomics: How AI can transform animal research

A paradigm shift in translational psychiatry through rodent neuroethology

Engineering a 3D functional human peripheral nerve in vitro using the Nerve-on-a-Chip platform

Psychoactive drug exposure during breastfeeding: a critical need for preclinical behavioral testing

Search

Quick links