A combined pre-clinical meta-analysis and randomized confirmatory trial approach to improve data validity for therapeutic target validation

Kleikers, Pamela WM.; Hooijmans, Carlijn; Göb, Eva; Langhauser, Friederike; Rewell, Sarah SJ.; Radermacher, Kim; Ritskes-Hoitinga, Merel; Howells, David W.; Kleinschnitz, Christoph; HHW Schmidt, Harald

doi:10.1038/srep13428

Download PDF

Article
Open access
Published: 27 August 2015

A combined pre-clinical meta-analysis and randomized confirmatory trial approach to improve data validity for therapeutic target validation

Pamela WM. Kleikers¹^na1^na2,
Carlijn Hooijmans²^na1^na2,
Eva Göb³^na1^na2,
Friederike Langhauser³^na1^na2,
Sarah SJ. Rewell⁴^na1^na2,
Kim Radermacher¹^na1^na2,
Merel Ritskes-Hoitinga²^na1^na2,
David W. Howells⁴^na1^na2,
Christoph Kleinschnitz³^na1^na2 &
…
Harald HHW Schmidt¹^na1^na2

Scientific Reports volume 5, Article number: 13428 (2015) Cite this article

4038 Accesses
28 Citations
3 Altmetric
Metrics details

Subjects

Abstract

Biomedical research suffers from a dramatically poor translational success. For example, in ischemic stroke, a condition with a high medical need, over a thousand experimental drug targets were unsuccessful. Here, we adopt methods from clinical research for a late-stage pre-clinical meta-analysis (MA) and randomized confirmatory trial (pRCT) approach. A profound body of literature suggests NOX2 to be a major therapeutic target in stroke. Systematic review and MA of all available NOX2^-/y studies revealed a positive publication bias and lack of statistical power to detect a relevant reduction in infarct size. A fully powered multi-center pRCT rejects NOX2 as a target to improve neurofunctional outcomes or achieve a translationally relevant infarct size reduction. Thus stringent statistical thresholds, reporting negative data and a MA-pRCT approach can ensure biomedical data validity and overcome risks of bias.

Prevalence and incidence of stroke in Latin America and the Caribbean: a systematic review and meta-analysis

Article Open access 26 April 2023

The role of early-phase trials and real-world evidence in drug development

Article 15 February 2024

Evidence and reporting standards in N-of-1 medical studies: a systematic review

Article Open access 18 July 2023

Introduction

Medical innovation is impacted by major quality and reproducibility issues of basic science data^1,2,3 that then impact on further clinical development^4,5. These issues are on top of a general notion that animal models may be less predictive than we thought^6,7. It has been shown that this is partly a consequence of suboptimal study design⁸ and analysis⁹ and poor reporting^10,11,12.

In stroke and its preclinical research, the situation is particularly alarming^13,14. Stroke is the third cause of death and number one reason for chronic disability. Despite this high medical need only one drug (recombinant tissue plasminogen activator) is registered to treat ischemic stroke; however, it has limited efficacy¹⁵ and over 30 contraindications, so that 85% of all stroke patients remain without any acute drug treatment. This single “successful” stroke drug development stands in sharp contrast to 1,026 failed experimental stroke targets^14,16, despite several roundtable recommendations to improve the quality of preclinical stroke research^17,18 including advanced study designs¹⁹. Consequently, industry scientists have left target discovery and drug development for stroke almost completely non-investigated. Recently, this has led to proposals that preclinical research needs to improve in quality by adopting elements of clinical research, including multi-center studies, randomization, blinding and a priori power calculation for relevant outcomes^17,20,21.

With respect to relevant pre-clinical outcomes, a shift is necessary from surrogates such as infarct size to measurement of neurological function; with respect to the therapeutic approach, patients need a shift from vascular re-canalization to adding on neuroprotection²². Oxidative stress or the occurrence of reactive oxygen species (ROS) in increased amounts, in unphysiological places or with unphysiological chemistry, has been suggested to play a major role in neurodegeneration upon ischemic stroke^23,24,25. Even under conditions of ischemia²⁶, ROS have both protective and deleterious effects²⁷, which explains why global anti-oxidant therapy has failed^28,29,30. A more promising approach is to target oxidative stress in a manner that leaves essential, physiological ROS formation untouched and inhibits only disease-relevant enzymatic sources^30,31,32. As such a source of ROS, NADPH oxidases (NOX) stand out as they represent the only enzyme family that has ROS formation as its only known function. Most prominently NOX2³³ has been suggested to be a prime target in stroke³⁴, whilst our own previous observations²⁴ profoundly disagreed. Nine other publications (using a total of 159 animals), however, investigated NOX2^-/y mice to conclude a major contribution of this enzyme to infarct size by up to 60%^{23,35,36,37,38,39,40,41,42}. As a point of concern, NOX2 is known to play a major role in innate immunity and its deletion causes severe immune deficiencies, in particular with common comorbidities such as diabetes mellitus⁴³. Because of these risks, the benefit of inhibiting NOX2 in stroke would need to be validated beyond doubt before entering a discovery program or clinical trials.

Adoption of large-scale collaborative research has been suggested as a means to improve the success rate and validity of such pre-clinical target validation. Clearly this is not feasible for every exploratory pre-clinical trial. However, in order to make a valid target statement, we here learn from the successes of clinical research and implement for the first time a recent suggestion²¹ to conduct pharmacological target validation research as pre-clinical randomized confirmatory trials (pRCTs). We also combine this with a systematic review (SR) and meta-analysis (MA) that is then re-run after the pRCT. The outcomes of the SR-MA-pRCT-MA approach have implications for stroke research and late-stage preclinical biomedical research in general.

Methods

Animals

NOX2 deficient mice (NOX2 KO, stock #002365) from C57Bl/6J background and corresponding age-matched C57Bl/6J control mice (stock #000664) with an SPF health status were purchased from Jackson Laboratories (Bar Harbor, ME, USA). In a previous study²⁴ we already tested young male (6–8 weeks, 20–25 grams) mice. We therefore extended our inclusion criteria by using also female (8–10 weeks, 18–21 grams) and older (18–20 weeks, 26–31 grams) mice. All experiments were approved by the local animal ethics committees of Maastricht (DEC 2011-106) and Würzburg (69/08). Animals were socially housed in IVC cages under controlled conditions (22 °C, 55–65% humidity, 12 h light-dark cycle, in type II IVC macrolon cages up to 3 mice in Würzburg, up to 4 males and 5 females in Maastricht; type III, up to 10 in Würzburg) and were allowed free access to water and standard laboratory chow (Maastricht, R/M-H, ssniff, Soest, Germany; Würzburg, Altromin standard diet, Altromin Spezialfutter GmbH & Co. KG, Lage, Germany).

Systematic review

The present review was based on published results of animal studies on the role of NADPH oxidase 1 and/or 2 in experimental ischemic stroke. PubMed and EMBASE were searched for original papers and conference abstracts concerning the effects of NADPH oxidase 1 and/or 2 on experimental stroke until October 23, 2013. The search strategy involved the following 3 search components: ischemic stroke, NADPH oxidase 1 and/or 2 and animals (for the complete search strategy, see Supplementary Table 1). For detecting animal studies, search filters developed by SYRCLE were used^44,45. No language restriction was used. Our search strategy identified 562 records in PubMed and 812 records in EMBASE. After removal of duplicates, a total of 1089 records were screened a first time based on title and abstract, excluding non-in-vivo papers, papers not using ischemic stroke and papers using an unspecific inhibitor of NOX or combining an inhibitor with other therapies. 25 articles were included for full text screening, of which 22 addressed NOX2 and 4 articles addressed NOX1. Two independent researchers (PWMK and SSJR) screened all titles and abstracts for the inclusion criteria. Studies were included if they 1) investigated the role of NADPH oxidase 1 and/or 2 on the infarct size and neurological scoring after experimental focal ischemic stroke using either genetic or specific pharmacological inhibition of these NOX isoforms; 2) were performed in animals in vivo; 3) resulted in an original full paper or conference abstract which presented unique data. Papers were excluded when unspecific NOX inhibitors such as apocynin^46,47 were used or when NOX inhibition was combined with other drugs/therapies. The in– and exclusion criteria and methods of analysis were specified in advance and documented in a protocol.

After full text assessment, thirteen papers for the NOX2 study (using in total 162 WT and 171 KO mice) and four papers for the NOX1 study (using in total 69 WT and 141 KO mice) and 16 rats (8 with and 8 without siRNA) were included (Supplementary Fig. 1) for qualitative and quantitative analyses^{23,24,36,37,38,39,40,41,42,48,49,50,51,52,53}.

Study characteristics

From the included studies, bibliographic data such as authors, year of publication, journal of publication and language were registered. Study characteristics concerning study design were extracted and summarized in Supplementary Table 2: species, strain (including genetic KO), gender, age and weight of the animals used; type of anesthesia; method and duration of ischemia; duration of reperfusion (timing of outcome measurements), type of inhibitor used; method of culling; method of infarct size measurement and neurological outcome assessment; (reason for) dropouts and mortality. All studies except one, used NOX KO mice as experimental animals, with one study using both genders, the rest only males. One study used rats treated with siRNA. All studies used the middle cerebral artery (MCA) occlusion model, sixteen studies occluded transiently allowing reperfusion afterwards and two studies occluded the MCA permanently (one study used both permanent and transient ischemia). The duration of the ischemia and the reperfusion varied greatly among studies (5 to 120 minutes). Three different methods of infarct size measurement and three different scoring systems for the neurological assessment were used. All retrieved data sets could be taken into account for the meta-analysis regarding infarct size. Neurological scoring was measured in seven out of thirteen NOX2 papers and three out of four NOX1 studies.

Assessment of methodological quality and risk of bias

Study quality and risk of bias in the included studies was independently assessed by two reviewers (PWMK and CH), using a predefined 9-point rating system (based on⁵⁴) (see Supplementary Table 3 and legend for details). Seven items were assessed to study risk of bias. A “yes” judgment indicates a low risk of bias; a “no” judgment indicates high risk of bias; the judgment was “unclear” if insufficient details had been reported to assess the risk of bias properly. The possible presence of selection bias (items 1, 2 and 4) detection bias (items 6, 7 and 8) performance bias (item 5) and attrition bias (items 9) were judged. Because of poor reporting of essential details in animal studies, we also included two reporting items: we assessed whether it had been reported if the experiments were randomized or blinded at any level (item 1 and 3). Disagreements were solved by discussion.

Meta-analysis

Infarct size and neurological outcome were included in the meta-analysis. Data were extracted if raw data or group averages, standard deviation (SD) or standard error (SE) and number of animals per group (n) were reported, or could be recalculated. All authors were contacted to contribute their original data to the meta-analysis. From two publications, no response from authors was obtained. For one publication, authors could not recollect their original data. In these cases, data were extracted from the text, or if presented only graphically, measured using a universal on-screen digitizer (Universal Desktop Ruler). CAMARADES consortium (http://www.dcn.ed.ac.uk/camarades//default.htm) suggests the use of normalized mean difference (NMD), which requires correcting for sham values. However, of all thirteen included NOX2 papers, only three reported the use of shams, but not for all groups and all outcome parameters^35,38,42. None of the others mention any sham animals. Therefore, we applied the standardized mean difference (SMD) which is also regularly used in clinical meta-analyses⁵⁵ for both the outcome measure ‘infarct size’ and ‘neurological score’. An SMD expresses the difference between the groups relative to the standard deviation. Calculation of mean differences^56,57,58 was not possible because of the heterogeneity in study designs (i.e. species) and the variety of scales used to determine the outcomes. The studies of NOX1 and NOX2 were analyzed separately. In case different measures of neurobehavioral outcomes were reported from the same cohort of animals we pooled the individual effect sizes and used this pooled estimate in the overall meta-analysis. Despite anticipated heterogeneity, the individual SMDs were pooled whenever possible (starting from two studies or more) to obtain an overall SMD and 95% confidence interval.

To account for anticipated heterogeneity, we used the random effects model in which some heterogeneity beyond sampling errors is allowed. In order to assess the robustness of our findings and in an attempt to explain observed study heterogeneity, we performed a sensitivity analysis and we investigated the effects of excluding the study with permanent ischemia. Meta-analysis was performed using Comprehensive Meta Analysis (CMA version 2.0). Forest plots were used to display the mean overall effect sizes. Data are expressed as SMD with 95% confidence intervals. For the outcome measure infarct size, we assessed the possibility of publication bias by visually evaluating the possible asymmetry in funnel plots⁵⁹. Using the trim and fill analyses an adjusted intervention effect was calculated⁶⁰.

pRCT

The preclinical Randomized, Confirmatory (and blinded) animal Trial was performed in parallel at Maastricht University (The Netherlands) and at the University of Würzburg (Germany). All animals studies were done in accordance with the approved national animal experimental guidelines and were approved by the local animal ethics committees. The objective of the study was to compare the extent of neurological damage after stroke in mice with or without NOX2 gene deletion. At each study site, surgery and follow-up measurements were performed blinded and animals were operated randomly according to an online randomization tool (www.randomizer.org). For a power of 80%, based on a minimal effect on infarct size of −40% and an SD of 30%, the required animal numbers were at least n = 10 per study arm. Transient middle cerebral artery occlusion (tMCAO) was performed with an intraluminal filament method as described by Kleinschnitz et al.²⁴. After 60 minutes of ischemia, the filament was withdrawn and reperfusion established. Twenty-four hours after induction of the ischemia, mice were scored for neurological and motor function. Infarct size was determined using 2, 3, 5-triphenyltetrazolium chloride (TTC) staining. For a more detailed description, see the Supplementary Information Methods.

Power analysis

We conducted a post hoc analysis of power in all earlier published studies in Nox2^-/y mice and stroke. Twelve studies and our own new data were analyzed for their power to detect a difference of 40% in infarct size. This threshold of 40% difference was based on post-hoc analysis of failed clinical trials where preclinical studies showed a 30–40% difference^29,61,62. Power was calculated using Russ Lenth’s power software, an alpha of 0.05, an effect of 0.4 and a pooled variance [(Lenth, R.V 2006-9, java Applets for Power and Sample Size [Computer Software], Retrieved 02-17-2014, from http://www.stat.uiowa.edu/~rlenth/Power)]. The pooled variance was calculated for 4 different groups according to ischemic and reperfusion time: a) long reperfusion time (72 h), b) short reperfusion time (24 h) after short ischemia (30 min), c) short reperfusion time (24 h) after medium ischemia (60-75 min) and d) short reperfusion time (24 h) after long ischemia (90-120 min). From each individual study, the coefficient of variation was calculated for both KO and WT values. Taking into account all these values, the pooled variance for each group was calculated according to formula (1) here with n the size of the group and CV the coefficient of variation (SD/Mean) of the group.

Statistics

Infarct volume data are expressed as mean ± SEM. Statistical differences between mean values were determined by Student’s two-tailed t test, using the GraphPad Prism 5.0 software package. Neurological scores were expressed as median. For discrete variables (behavior and motor function scores), the Mann-Whitney U-test was used. A value of P < 0.05 was considered to be statistically significant. Power calculations were performed by using Russ Lenth’s power and sample size software.

Results

Systematic review and meta-analysis suggest a role of NOX2 in stroke

In preparation for a risk of bias and statistical power analysis on pre-clinical animal studies on NOX2 in experimental stroke, we first conducted a systematic review⁶³ (for search strategy see Supplementary Table 1, Supplementary Fig. 1) followed by a meta-analysis (for study characteristics see Supplementary Table 2)^{23,24,35,36,37,38,39,40,41,42,53,64,65}. Heterogeneity was relatively high (I² = 73%) as expected given the large variations in study designs and methodological quality (Supplementary Table 2, Fig. 1). All studies on NOX2 in stroke were conducted in mice. One study created a permanent occlusion⁴⁸ and all studies but six^{23,37,39,41,53,65} studied neurological outcome 24 h after ischemia. Indeed, we found that the reported infarct size for NOX2 KO vs. NOX2 WT infarct size was significantly smaller (Fig. 2a). NOX2 KO mice also showed a small but significant decreased neurological score compared to WT mice (Fig. 2b), which implicates an improved neurological function (SMD −0.67 [−1,17; −0,16]; n = 9, p = 0.010). Calculation of mean differences^56,57,58 was not possible because of the heterogeneity in study designs (i.e. species) and the variety of scales used to determine the outcomes (see Methods).

Most studies on NOX2 in stroke were insufficiently powered

We then analyzed whether those studies that were included in the systematic review were sufficiently powered to detect an effect size of 40%, which is considered a minimum to be subsequently translationally relevant (see Methods). Since especially in small studies the observed variance is not a precise estimate of the true variance, we computed pooled variances. For four different groups (short ischemia time, medium ischemia time, long ischemia time and long reperfusion time), we found these calculated pooled variances to be 0.44, 0.60, 0.29 and 0.43, respectively. With these and the number of animals described in each study, we calculated the power to detect a difference of at least 40% (Table 1). Most notably, none of the studies reached a sufficient level of power (1-ß ≥ 0.08), with the exception of our own earlier study (1-ß = 0.95), which had shown no effect for NOX2²⁴.

Table 1 Power analysis of experimental stroke studies validation the role of NOX2 in Nox2-/y and WT mice.

Full size table

Poor reporting and risk of bias in studies on NOX2 in stroke

To assess the quality of the included papers, we conducted a risk of bias analysis. However, as a result of poor reporting in most animal studies, we also assessed a few reporting criteria (item 1 and 3). Figure 1 shows this risk of bias results for the NOX2 studies. Only a single study reported randomization. Also, in only 15% of the studies it was clear that at baseline groups were similar with respect to age, weight and supplier. None of the papers mentioned whether or not they housed the animals randomly and used a random order to assess the outcomes. Only four papers reported blinding at any level, for two of these four studies, both the treatment and assessment, for the other two only the outcome assessment was blinded. Overall, the risk of bias analysis showed that reporting essential details of the animal stroke studies is poor and there seems to be a substantial risk of bias.

Publication bias leads to an overestimation of the NOX2 effect size in stroke

In addition to insufficient power, publication bias has been shown to influence preclinical study results. To identify a possible publication bias for the outcome ‘infarct size’, we created a funnel plot^10,60. Figure 3 shows that it is likely that several small studies reporting larger infarcts in NOX2 KO mice are missing from the literature. This may indicate an overestimation of the overall effect (SMD −0,66 [−1,20; −0,14]) as calculated in the meta- analysis and questions the significance of NOX2. Moreover, reporting of essential details of the animal stroke studies was poor and there was a substantial risk of bias (Fig. 1). Only 31% of the studies reported on blinding of the outcome measures. In addition, just 15% of the studies described whether they randomized the allocation of the animals to the various groups. None of the papers described the method for randomization. Blinding and randomization are key quality measures of experimental design of intervention studies and are known to cause bias⁵⁴.

A randomized, confirmatory, blinded and fully powered pre-clinical trial excludes a relevant role for NOX2 in stroke

The literature on NOX2 in stroke fulfilled all criteria to justify conducting the first pre-clinical, randomized, confirmatory trial, powered for a minimally relevant effect of 40% reduction of infarct volume, with the aim to provide reliable target validation data. A priori sample size calculation showed that at least 10 animals would be necessary in each study arm, which we exceeded with n = 41 WT and n = 51 NOX2 KO mice. Importantly, 24 h after transient middle cerebral artery occlusion (tMCAO), neither infarct distribution (Fig. 4A), infarct size (Fig. 4B), nor neurofunctional parameters such as the Bederson score (Fig. 4C) or the Grip test (Fig. 4D) were significantly different between NOX2 KO (n = 51) and WT (n = 41) mice. If one was to examine the possible difference in infarct size of −10% in NOX2 (for which our study was not powered), a much larger study with n = 202 animals per study arm would be needed (based on a power of 1 − β = 0.08 and an α of 0, 05, a standard deviation of 30%, an effect of 10% and an average reported acute mortality rate of 30%). The use of 404 animals to clarify such a small and translationally insignificant effect would be ethically non-justifiable.

With respect to stroke and NOX, gender-specific effects have been reported⁶⁶. In our study, a sub-group analysis showed in male NOX2 KO mice a larger than average infarct size reduction of 25% (see Fig. 4B, second data set), which reached significance (p = 0.04) but was underpowered (1 − β = 0.68). In none of the subgroups did we find any translationally relevant improvements, neither in neurological behavior or motor function (see Fig. 4C,D).

Our revised meta-analysis suggests an even lower effect size, no neurological improvement and persisting publication bias

To test whether these new findings would affect our above meta-analysis (see Fig. 2, left data set) we re-ran the extended data set (see Fig. 2, right data set). Still, NOX2 appeared to significantly decrease the infarct size (Fig. 2a SMD −1.15 [−1.67; −0.63]; n = 20; p = 0.000). However, the effect was now smaller and even after including our new study results, a publication bias still appeared to overestimate our overall effect estimation (Fig. 5). Importantly and independently of infarct size, the effect of NOX2 KO on the neurological score was no longer significant (SMD −0.37 [−0.79; 0.06]; n = 12; p = 0.094). Thus even if a small effect on the surrogate infarct size would ever be shown with sufficient power, we can predict it will not translate into any significant neurological outcome improvement. Clearly, this conclusion is a definitive counter-argument against any further clinical development of this target.

Discussion

Here we provide a feasible solution to a major problem of current biomedical research, the irreproducibility of pre-clinical results leading subsequently to translational failures at the path to the clinic. We show that adopting our SR-MA-pRCT-MA approach to a more large-scale, collaborative way of research will improve research quality and enhance the validity of pre-clinical decision-making on therapeutic targets. Undoubtedly, it also challenges current funding and career reward systems which value rather individual than team achievements. We chose to examine stroke as an example as this is an area of extreme medical need with probably one of the lowest translational success rates in biomedicine. However, we feel confident that what we show applies most likely to many other fields and many other claimed pharmacological targets.

A profound body of pre-clinical literature seemed to suggest that NOX2 is a therapeutic target in stroke whilst some data had argued clearly against this. We hypothesized that this may be a case of insufficient pre-clinical validity on either side, which would qualify to conduct a pRCT. After contacting every group that had published in this field on this pharmacological target we conducted a systematic review. Indeed our subsequent meta-analysis suggested that a small effect of NOX2 on infarct size and neurological score might exist but that there was also a significant publication bias. In the present case, complete reporting of data would have shifted the true effect size towards a lower or no role of NOX2 in stroke. Reporting of essential experimental parameters was also rather poor. Failure in reporting these details is known to skew the interpretation of study results and subsequent translation into clinical benefits. Another criterion that was fulfilled in all studies that reported significant effects was the lack of statistical power to detect a relevant infarct size reduction of 40%⁶², which is a soft target as this will still not ensure clinical benefit^67,68,69,70. The true threshold is likely to be even higher, but in the absence of any successful translation of pre-clinical stroke research in the past 20 years¹⁴, this cannot be determined²¹.

Having established that the literature on NOX2 in stroke fulfilled all typical criteria for insufficient pre-clinical target validation (publication bias, poor methodological quality and lack of power), we here present the first SR-MA-pRCT-MA approach to validate a single intervention or, in this case, target validation. Based on its outcome data, a NOX2 KO does not improve neurological outcome and has an effect on infarct size that was too small to be determined by a trial powered for 40% reduction. Conducting another pRCT using >400 animals would be required to determine whether indeed a 10% infarct size reduction occurs; this however would be considered translationally irrelevant (see above examples and threshold) and thus unethical.

Moving to the SR-MA-pRCT-MA approach as the new quality standard, at least for pharmacological target validation, will in all likelihood exceed individual laboratories’ capacity. Thus conducting such studies in a more collaborative manner, meaning multi-center trials, seems to be the logical way forward. In fact, for other targets, including very late antigen 4 (VLA-4)⁷¹ and transient receptor potential cation channel, subfamily M, member 2 (TRPM2; PMID: 25236871)⁷², such trials are currently under way already. A website has been launched (www.p3pt.de) as an invitation to the community to provide their position on pRCTs and potential suggestions how those should be organized and performed²¹.

Importantly, implications reach further. The studies that we have analyzed were conducted in Korea, Germany, USA, Australia and the Netherlands. Animal ethics regulations differ, but at least for the European Community it can be said that in recent years there has been a massive push towards ‘The Three Rs’, reduction, replacement and refinement⁷³. However, reducing the number of animals below the limits of statistical power will lead to underpowered and in the end meaningless pre-clinical data sets. Whilst we strongly support the goal to achieve pre-clinical evidence with the least amount of animal sacrifice, the use and reporting of a power calculation is essential to the proper conduct of confirmatory animal studies. Other study formats, e.g. in earlier pre-clinical stages are important as well; however, they should describe themselves as exploratory and not make statements on target validity.

Finally, funding agencies and journals may need to adapt. Funding and career incentives typically reward individuals, whereas pRCTs require team approaches. Journals will need to equally accept for publication well-conducted (e.g. statistically powered) negative findings, so that the literature is truly representative of the science. To ensure that such findings are accessible, even if not submitted for publication, a rather far-reaching but effective measure would be to require pre-registration of animal experiments, whether they were conducted as pRCTs or just pilots, similar to requirements for clinical trials. Registration would be a pre-requisite for ethics approval and include the obligation to subsequently enter the data into a publicly available database. This will reduce the efforts to conduct MAs as an interim surrogate for pRCTs. However, conducting fully powered pRCTs, including detailed reporting and subsequent MA is clearly the way forward.

Additional Information

How to cite this article: Kleikers, P.W.M. et al. A combined pre-clinical meta-analysis and randomized confirmatory trial approach to improve data validity for therapeutic target validation. Sci. Rep. 5, 13428; doi: 10.1038/srep13428 (2015).

References

Prinz, F., Schlange, T. & Asadullah, K. Believe it or not: how much can we rely on published data on potential drug targets? Nat Rev Drug Discov 10, 712–712 (2011).
Article CAS Google Scholar
Anonymous. Facilitating reproducibility. Nat. Chem. Biol. 9, 345–345 (2013).
Ioannidis, J. P. A. Why most published research findings are false. PLoS Med. 2, e124 (2005).
Article Google Scholar
Mullard, A. Reliability of ‘new drug target’ claims called into question. Nat Rev Drug Discov 10, 643–644 (2011).
Article CAS Google Scholar
Arrowsmith, J. Trial watch: Phase II failures: 2008–2010. Nat Rev Drug Discov 10, 328–329 (2011).
Article CAS Google Scholar
Seok, J. et al. Genomic responses in mouse models poorly mimic human inflammatory diseases. Proc Natl Acad Sci USA 110, 3507–3512 (2013).
Article CAS ADS Google Scholar
van der Worp, H. B. et al. Can animal models of disease reliably inform human studies? PLoS Med. 7, e1000245 (2010).
Article Google Scholar
Dirnagl, U. & Fisher, M. International, multicenter randomized preclinical trials in translational stroke research: it’s time to act. J Cereb Blood Flow Metab 32, 933–935 (2012).
Article Google Scholar
Sterne, J. A. & Davey Smith, G. Sifting the evidence-what’s wrong with significance tests? BMJ 322, 226–231 (2001).
Article CAS Google Scholar
Sena, E. S., van der Worp, H. B., Bath, P. M. W., Howells, D. W. & Macleod, M. R. Publication bias in reports of animal stroke studies leads to major overstatement of efficacy. PLoS Biol. 8, e1000344 (2010).
Article Google Scholar
Kilkenny, C. et al. Survey of the quality of experimental design, statistical analysis and reporting of research using animals. PLoS ONE 4, e7824 (2009).
Article ADS Google Scholar
Eisen, J. A., Ganley, E. & MacCallum, C. J. Open science and reporting animal studies: who’s accountable? PLoS Biol. 12, e1001757 (2014).
Article Google Scholar
Philip, M., Benatar, M., Fisher, M. & Savitz, S. I. Methodological quality of animal studies of neuroprotective agents currently in phase II/III acute ischemic stroke trials. Stroke 40, 577–581 (2009).
Article CAS Google Scholar
O’Collins, V. E. et al. 1,026 experimental treatments in acute stroke. Ann. Neurol. 59, 467–477 (2006).
Article Google Scholar
Maiser, S. J. et al. Intravenous recombinant tissue plasminogen activator administered after 3 h following onset of ischaemic stroke: a metaanalysis. Int J Stroke 6, 25–32 (2011).
Article Google Scholar
Radermacher, K. A. et al. The 1027th target candidate in stroke: Will NADPH oxidase hold up? Exp Transl Stroke Med 4, 11 (2012).
Article CAS Google Scholar
Fisher, M. et al. Update of the stroke therapy academic industry roundtable preclinical recommendations. Stroke 40, 2244–2250 (2009).
Article Google Scholar
Stroke Therapy Academic Industry Roundtable (STAIR). Recommendations for standards regarding preclinical neuroprotective and restorative drug development. Stroke 30, 2752–2758 (1999).
O’Collins, V. E. et al. Preclinical drug evaluation for combination therapy in acute stroke using systematic review, meta-analysis and subsequent experimental testing. J Cereb Blood Flow Metab 31, 962–975 (2011).
Article Google Scholar
Dirnagl, U. Bench to bedside: the quest for quality in experimental stroke research. J Cereb Blood Flow Metab 26, 1465–1478 (2006).
Article Google Scholar
Boltze, J., Ayata, C., Wagner, D.-C. & Plesnila, N. Preclinical phase III trials in translational stroke research: call for collective design of framework and guidelines. Stroke 45, 357–357 (2014).
Article Google Scholar
Tymianski, M. Novel approaches to neuroprotection trials in acute ischemic stroke. Stroke 44, 2942–2950 (2013).
Article Google Scholar
Walder, C. E. et al. Ischemic stroke injury is reduced in mice lacking a functional NADPH oxidase. Stroke 28, 2252–2258 (1997).
Article CAS Google Scholar
Kleinschnitz, C. et al. Post-stroke inhibition of induced NADPH oxidase type 4 prevents oxidative stress and neurodegeneration. PLoS Biol. 8, (2010). 10.1371/journal.pbio.1000479.
Chen, H. et al. Oxidative stress in ischemic brain damage: mechanisms of cell death and potential molecular targets for neuroprotection. Antioxid Redox Signal 14, 1505–1517 (2011).
Article CAS Google Scholar
Schroder, K. et al. Nox4 is a protective reactive oxygen species generating vascular NADPH oxidase. Circ Res 110, 1217–1225 (2012).
Article Google Scholar
Schmidt, H. H. H. W., Wingler, K., Kleinschnitz, C. & Dusting, G. NOX4 is a Janus-faced reactive oxygen species generating NADPH oxidase. Circ Res 111, e15–6– author reply e17–8 (2012).
Article Google Scholar
Shuaib, A. et al. NXY-059 for the treatment of acute ischemic stroke. N. Engl. J. Med. 357, 562–571 (2007).
Article CAS Google Scholar
Macleod, M. R. et al. Evidence for the efficacy of NXY-059 in experimental focal cerebral ischaemia is confounded by study quality. Stroke 39, 2824–2829 (2008).
Article Google Scholar
Radermacher, K. A. et al. Neuroprotection after stroke by targeting NOX4 as a source of oxidative stress. Antioxidants and Redox Signaling 18, 1418–1427 (2013).
Article CAS Google Scholar
Sedeek, M. et al. Renoprotective effects of a novel Nox1/4 inhibitor in a mouse model of type 2 diabetes. Clin. Sci. (2012). 10.1042/CS20120330.
Di Marco, E. et al. Pharmacological inhibition of NOX reduces atherosclerotic lesions, vascular ROS and immune-inflammatory responses in diabetic Apoe(-/-) mice. Diabetologia 57, 633–642 (2014).
Article CAS Google Scholar
Weissmann, N. et al. Activation of TRPC6 channels is essential for lung ischaemia-reperfusion induced oedema in mice. Nat Commun 3, 649 (2012).
Article ADS Google Scholar
Drummond, G. R., Selemidis, S., Griendling, K. K. & Sobey, C. G. Combating oxidative stress in vascular disease: NADPH oxidases as therapeutic targets. Nat Rev Drug Discov 10, 453–471 (2011).
Article CAS Google Scholar
De Silva, T. M., Brait, V. H., Drummond, G. R., Sobey, C. G. & Miller, A. A. Nox2 oxidase activity accounts for the oxidative stress and vasomotor dysfunction in mouse cerebral arteries following ischemic stroke. PLoS ONE 6, e28393 (2011).
Article ADS Google Scholar
Tang, X. N., Zheng, Z., Giffard, R. G. & Yenari, M. A. Significance of marrow-derived nicotinamide adenine dinucleotide phosphate oxidase in experimental ischemic stroke. Ann. Neurol. 70, 606–615 (2011).
Article CAS Google Scholar
Kahles, T. et al. NADPH oxidase plays a central role in blood-brain barrier damage in experimental stroke. Stroke 38, 3000–3006 (2007).
Article CAS Google Scholar
Brait, V. H. et al. Mechanisms contributing to cerebral infarct size after stroke: gender, reperfusion, T lymphocytes and Nox2-derived superoxide. J Cereb Blood Flow Metab 30, 1306–1317 (2010).
Article CAS Google Scholar
Kunz, A., Anrather, J., Zhou, P., Orio, M. & Iadecola, C. Cyclooxygenase-2 does not contribute to postischemic production of reactive oxygen species. J Cereb Blood Flow Metab 27, 545–551 (2007).
Article CAS Google Scholar
Chen, H., Song, Y. S. & Chan, P. H. Inhibition of NADPH oxidase is neuroprotective after ischemia-reperfusion. J Cereb Blood Flow Metab 29, 1262–1272 (2009).
Article CAS Google Scholar
Chen, H., Kim, G. S., Okami, N., Narasimhan, P. & Chan, P. H. NADPH oxidase is involved in post-ischemic brain inflammation. Neurobiol Dis 42, 341–348 (2011).
Article CAS Google Scholar
Wang, Z. et al. NOX2 deficiency ameliorates cerebral injury through reduction of complexin II-mediated glutamate excitotoxicity in experimental stroke. Free Radic Biol Med 65, 942–951 (2013).
Article CAS Google Scholar
Gray, S. P. et al. NADPH oxidase 1 plays a key role in diabetes mellitus-accelerated atherosclerosis. Circulation 127, 1888–1902 (2013).
Article CAS Google Scholar
Hooijmans, C. R., Tillema, A., Leenaars, M. & Ritskes-Hoitinga, M. Enhancing search efficiency by means of a search filter for finding all studies on animal experimentation in PubMed. Lab. Anim. 44, 170–175 (2010).
Article CAS Google Scholar
de Vries, R. B. M., Hooijmans, C. R., Tillema, A., Leenaars, M. & Ritskes-Hoitinga, M. A search filter for increasing the retrieval of animal studies in Embase. Lab. Anim. 45, 268–270 (2011).
Article CAS Google Scholar
Wind, S. et al. Comparative pharmacology of chemically distinct NADPH oxidase inhibitors. Br J Pharmacol 161, 885–898 (2010).
Article CAS Google Scholar
Williams, H. C. & Griendling, K. K. NADPH oxidase inhibitors: new antihypertensive agents? J. Cardiovasc. Pharmacol. 50, 9–16 (2007).
Article CAS Google Scholar
Kim, H. A. et al. Brain infarct volume after permanent focal ischemia is not dependent on Nox2 expression. Brain Res. 1483, 105–111 (2012).
Article CAS Google Scholar
Liu, W., Chen, Q., Liu, J. & Liu, K. J. Normobaric hyperoxia protects the blood brain barrier through inhibiting Nox2 containing NADPH oxidase in ischemic stroke. Med Gas Res 1, 22 (2011).
Article CAS Google Scholar
Jackman, K. A., Miller, A. A., Drummond, G. R. & Sobey, C. G. Importance of NOX1 for angiotensin II-induced cerebrovascular superoxide production and cortical infarct volume following ischemic stroke. Brain Res. 1286, 215–220 (2009).
Article CAS Google Scholar
Kahles, T. et al. NADPH oxidase Nox1 contributes to ischemic injury in experimental stroke in mice. Neurobiol Dis 40, 185–192 (2010).
Article CAS Google Scholar
Choi, D.-H. et al. in Abstracts of the World Stroke Congress. October 13–16, 2010. Seoul, Republic of Korea (2010).
McCann, S., Dusting, G. & Roulston, C. in Abstracts of the th International Symposium on Neuroprotection and Neurorepair. Rostock, Germany. October 1–4, 2010 (2010).
Hooijmans, C. R. et al. SYRCLE’s risk of bias tool for animal studies. BMC Med Res Methodol 14, 43 (2014).
Article Google Scholar
Higgins JPT, Green S (editors) Cochrane Handbook for systematic reviews of interventions. Version 5.1.0 [updated March 2011]. Chapter 9, section 9.2.3.2 The Cochrane Collaboration, 2011. Available from www.cochrane-handbook.org (Date of access: 11-05-2015).
Vesterinen, H. M. et al. Meta-analysis of data from animal studies: a practical guide. J. Neurosci. Methods 221, 92–102 (2014).
Article CAS Google Scholar
Tsilidis, K. K. et al. Evaluation of excess significance bias in animal studies of neurological diseases. PLoS Biol. 11, e1001609 (2013).
Article CAS Google Scholar
Sena, E. S., Currie, G. L., McCann, S. K., Macleod, M. R. & Howells, D. W. Systematic reviews and meta-analysis of preclinical studies: why perform them and how to appraise them critically. J Cereb Blood Flow Metab 34, 737–742 (2014).
Article Google Scholar
Egger, M., Davey Smith, G., Schneider, M. & Minder, C. Bias in meta-analysis detected by a simple, graphical test. BMJ 315, 629–634 (1997).
Article CAS Google Scholar
Duval, S. & Tweedie, R. Trim and fill: A simple funnel-plot-based method of testing and adjusting for publication bias in meta-analysis. Biometrics 56, 455–463 (2000).
Article CAS Google Scholar
Crossley, N. A. et al. Empirical evidence of bias in the design of experimental stroke studies: a metaepidemiologic approach. Stroke 39, 929–934 (2008).
Article Google Scholar
Minnerup, J. et al. Meta-analysis of the efficacy of granulocyte-colony stimulating factor in animal models of focal cerebral ischemia. Stroke 39, 1855–1861 (2008).
Article CAS Google Scholar
Hooijmans, C. R. & Ritskes-Hoitinga, M. Progress in using systematic reviews of animal studies to improve translational research. PLoS Med. 10, e1001482 (2013).
Article CAS Google Scholar
Kim, H. A. et al. Brain infarct volume after permanent focal ischemia is not dependent on Nox2 expression. Brain Res. 1483, 105–111 (2012).
Article CAS Google Scholar
Liu, W. et al. Normobaric hyperoxia inhibits NADPH oxidase-mediated matrix metalloproteinase-9 induction in cerebral microvessels in experimental stroke. J. Neurochem. 107, 1196–1205 (2008).
Article CAS Google Scholar
Miller, A. A., Drummond, G. R., Mast, A. E., Schmidt, H. H. & Sobey, C. G. Effect of gender on NADPH-oxidase activity, expression and function in the cerebral circulation: role of estrogen. Stroke 38, 2142–2149 (2007).
Article CAS ADS Google Scholar
Ringelstein, E. B. et al. Granulocyte colony-stimulating factor in patients with acute ischemic stroke: results of the AX200 for Ischemic Stroke trial. Stroke 44, 2681–2687 (2013).
Article CAS Google Scholar
Wagner, D.-C. et al. Allometric dose retranslation unveiled substantial immunological side effects of granulocyte colony-stimulating factor after stroke. Stroke 45, 623–626 (2014).
Article CAS Google Scholar
Pfeffer, M. A. et al. A trial of darbepoetin alfa in type 2 diabetes and chronic kidney disease. N. Engl. J. Med. 361, 2019–2032 (2009).
Article Google Scholar
Skali, H. et al. Stroke in patients with type 2 diabetes mellitus, chronic kidney disease and anemia treated with Darbepoetin Alfa: the trial to reduce cardiovascular events with Aranesp therapy (TREAT) experience. Circulation 124, 2903–2908 (2011).
Article CAS Google Scholar
Schäbitz, W.-R. & Dirnagl, U. Are we ready to translate T-cell transmigration in stroke? Stroke 45, 1610–1611 (2014).
Article Google Scholar
Gelderblom, M. et al. Transient receptor potential melastatin subfamily member 2 cation channel regulates detrimental immune cell invasion in ischemic stroke. Stroke 11, 3395–3402 (2014).
Article Google Scholar
Balls, M. et al. The three Rs: the way forward: the report and recommendations of ECVAM Workshop 11. Altern Lab Anim 23, 838–866 (1995).
PubMed Google Scholar

Download references

Acknowledgements

We are greatly thankful to all colleagues and fellow-researchers who were willing to share their data sets with us. We also thank Dr. Emily Sena from CAMARADES for most helpful discussions. We also gratefully acknowledge Dr. de Haan from Radboud University Nijmegen for statistical advice and Helma van Essen and Jacques Debets, Maastricht University, for their expert technical assistance in the in vivo experiments. Financial disclosure: This work was supported by a Marie-Curie International Reintegration Grant (FP7-RG 268235 - Radical Pharmacology), an ERC Advanced Investigator Grant (294683 - RADMED), by the Nederlandse Hersenstichting, by the NHMRC Australia (awarded to HHHWS) and by the Deutsche Forschungsgemeinschaft (awarded to CK).

Author information

Kleikers Pamela WM., Hooijmans Carlijn and Göb Eva contributed equally to this work.
Kleinschnitz Christoph and HHW Schmidt Harald jointly supervised this work.

Authors and Affiliations

Department of Pharmacology, CARIM, Faculty of Health, Medicine and Life Sciences, Maastricht University, The Netherlands
Pamela WM. Kleikers, Kim Radermacher & Harald HHW Schmidt
SYRCLE at Central Animal Laboratory, Radboud University Medical Centre, Nijmegen, The Netherlands
Carlijn Hooijmans & Merel Ritskes-Hoitinga
Neurologische Klinik und Poliklinik der Universitätsklinik Würzburg, Würzburg, Germany
Eva Göb, Friederike Langhauser & Christoph Kleinschnitz
Florey Institute of Neuroscience and Mental Health, Austin Health, Melbourne, Victoria, Australia
Sarah SJ. Rewell & David W. Howells

Authors

Pamela WM. Kleikers
View author publications
You can also search for this author in PubMed Google Scholar
Carlijn Hooijmans
View author publications
You can also search for this author in PubMed Google Scholar
Eva Göb
View author publications
You can also search for this author in PubMed Google Scholar
Friederike Langhauser
View author publications
You can also search for this author in PubMed Google Scholar
Sarah SJ. Rewell
View author publications
You can also search for this author in PubMed Google Scholar
Kim Radermacher
View author publications
You can also search for this author in PubMed Google Scholar
Merel Ritskes-Hoitinga
View author publications
You can also search for this author in PubMed Google Scholar
David W. Howells
View author publications
You can also search for this author in PubMed Google Scholar
Christoph Kleinschnitz
View author publications
You can also search for this author in PubMed Google Scholar
Harald HHW Schmidt
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

C.H., M.R.H., C.K. and H.H.H.W.S. designed research, P.W.M.K., C.H., E.G. and F.L. performed research, C.H. and D.W.H. contributed new reagents or analytic tools, P.W.M.K., C.H., E.G., F.L., S.S.J.R., K.R. and H.H.H.W.S. analyzed data, P.W.M.K., C.H., D.W.H., C.K. and H.H.H.W.S. wrote the paper.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Electronic supplementary material

Supplementary Information

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Kleikers, P., Hooijmans, C., Göb, E. et al. A combined pre-clinical meta-analysis and randomized confirmatory trial approach to improve data validity for therapeutic target validation. Sci Rep 5, 13428 (2015). https://doi.org/10.1038/srep13428

Download citation

Received: 18 February 2015
Accepted: 27 July 2015
Published: 27 August 2015
DOI: https://doi.org/10.1038/srep13428

This article is cited by

Preclinical Stroke Research and Translational Failure: A Bird’s Eye View on Preventable Variables
- Devendra Singh
- Himika Wasan
- K. H. Reeta
Cellular and Molecular Neurobiology (2022)
A diseasome cluster-based drug repurposing of soluble guanylate cyclase activators from smooth muscle relaxation to direct neuroprotection
- Friederike Langhauser
- Ana I. Casas
- Harald H. H. W. Schmidt
npj Systems Biology and Applications (2018)
Regulatory T Cells in Post-stroke Immune Homeostasis
- Arthur Liesz
- Christoph Kleinschnitz
Translational Stroke Research (2016)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.