Value of preclinical systematic reviews and meta-analyses in pediatric research

Romantsik, Olga; Bank, Matthias; Menon, Julia M. L.; Malhotra, Atul; Bruschettini, Matteo

doi:10.1038/s41390-024-03197-1

Download PDF

Review Article
Open access
Published: 13 April 2024

Value of preclinical systematic reviews and meta-analyses in pediatric research

Olga Romantsik ORCID: orcid.org/0000-0003-1414-9386¹,
Matthias Bank²,
Julia M. L. Menon³,
Atul Malhotra^4,5,6^na1 &
…
Matteo Bruschettini¹^na1

Pediatric Research (2024)Cite this article

257 Accesses
7 Altmetric
Metrics details

Abstract

Similar to systematic reviews (SRs) in clinical fields, preclinical SRs address a specific research area, furnishing information on current knowledge, possible gaps, and potential methodological flaws of study design, conduct, and report. One of the main goals of preclinical SRs is to identify aspiring treatment strategies and evaluate if currently available data is solid enough to translate to clinical trials or highlight the gaps, thus justifying the need for new studies. It is imperative to rigorously follow the methodological standards that are widely available. These include registration of the protocol and adherence to guidelines for assessing the risk of bias, study quality, and certainty of evidence. A special consideration should be made for pediatric SRs, clinical and preclinical, due to the unique characteristics of this age group. These include rationale for intervention and comparison of primary and secondary outcomes. Outcomes measured should acknowledge age-related physiological changes and maturational processes of different organ systems. It is crucial to choose the age of the animals appropriately and its possible correspondence for specific pediatric age groups. The findings of well-conducted SRs of preclinical studies have the potential to provide a reliable evidence synthesis to guide the design of future preclinical and clinical studies.

Impact

This narrative review highlights the importance of rigorous design, conduct and reporting of preclinical primary studies and systematic reviews.
A special consideration should be made for pediatric systematic reviews of preclinical studies, due to the unique characteristics of this age group.

A methodological review with meta-epidemiological analysis of preclinical systematic reviews with meta-analyses

Article Open access 21 November 2022

How is Big Data reshaping preclinical aging research?

Article 28 November 2023

The “completely randomised” and the “randomised block” are the only experimental designs suitable for widespread use in pre-clinical research

Article Open access 16 October 2020

Introduction

It is extremely challenging to keep up to date with medical literature due to the high publication rate and data overload. More than 4.6 million papers are available on PubMed for different medical conditions in children (birth - 18 years). The rate of publications is increasing exponentially, starting with low numbers of publications for many decades and reaching more than 130000 publications in 2022 (Fig. 1). Electronic scoping searches of PubMed were performed on October 17, 2023 (Supplementary material). This trend presents in many medical branches^1,2 implies a great challenge for healthcare professionals to keep updated with the progress in their specific fields. Consequently, one relies on reviews (both narrative and systematic) to get an overview of the specific topic.

**Fig. 1: Number of studies published in pediatrics and neonatology over the years (range 1938–2022).**

In this paper, we focus on preclinical systematic reviews (SRs) which pertain to pediatric medicine. We start with an overview of reviews including clinical SRs, unique preclinical SRs features, and those that apply especially to pediatric medicine. Emphasis on the quality of SRs is reiterated.

SRs and meta-analyses systematically scrutinize available literature on the topic and evaluate the study limitations of the reported data, following a detailed protocol. SRs are based on robust methodology, starting with a well-defined review question, which steers the criteria for a comprehensive literature search and the precise inclusion and exclusion criteria.³ Thus, the reader gets the opportunity to reproduce the search and understand the selection of the papers. In addition, two people independently conduct the screening, extract relevant outcome data, and evaluate the risk of bias and certainty of the evidence, thus minimizing the risk of introducing an error in the process. The meta-analysis, where possible, allows to calculate the effect size for each outcome. Furthermore, the SR points out the possible presence of publication bias. SRs even report the characteristics of ongoing studies on the specific topic. By doing so, it is easier for readers to follow up on the latest developments in the field. With this premise in mind, SRs and meta-analyses are valuable tools, providing a systematic assessment of the specific questions, highlighting the knowledge gaps, and addressing independently the quality of published science, thereby raising awareness of research waste caused by studies of mediocre quality.

Despite this exponential growth of SRs in clinical medicine, they are less common in preclinical medicine (in vitro studies, animal studies, and ex-vivo studies).⁴ The first SR of animal studies was published by Omarini et al. in 1992.⁵ Their SR regarded the placental perfusion in seven different animal species either in situ or in vitro. Freedman et al. published the first meta-analysis of animal studies on the effects of dietary fat consumption on mammary tumor development in 1994.⁶ Since then, approximately 3000 SRs in animal studies have been published, and approximately one-third of those included a meta-analysis.⁴ Hunniford et al. reported in their epidemiological study conducted in 2015–2018 that approximately 54% of all preclinical SRs focused on pharmacological interventions and 46% on non-pharmacological interventions, mainly cell therapies, and surgery.⁷

Is there a need for preclinical SRs?

Similar to SRs in the clinical field, preclinical SRs address a very specific research area and describe the current knowledge, possible gaps, and flaws of the study design, conduct, and reporting of each included study.^8,9 By analyzing available data, SRs may prevent the duplication of experiments and thereby reduce research waste and unethical use of animals. Since launching the methodology for preclinical SRs, Radboud University in the Netherlands could reduce the use of research animals by 35% at their institution, and by 15% in the whole country.^10,11 Additionally, they raised awareness of possible methodological flaws and biases, ideally resulting in improved study design, conduct, and report. Menon et al. demonstrated in their mixed case study that the conduct of preclinical SRs changed the mentality of the surveyed scientists on the quality of animal research, resulting in higher quality and transparency of the following work of the same preclinical researchers. It led to a desire to diffuse this knowledge within their research teams and advocate for the broader education of the scientists.²

Given the heterogeneity of preclinical research and multiple animal species used to model different health-related conditions, SRs may be extremely valuable in choosing the most appropriate animal model^12,13 and outcome measures. SRs anticipate the information if the obtained evidence is sufficient to move the research question into the clinic or illuminate the gaps thereof justifying the need for new studies.^10,14,15,16 One could argue that preclinical SRs may act as the bridge between the preclinical and clinical scientific world.

Quality assessment in clinical SRs

SRs are often valuable evidence sources for clinical guidelines, drug regulation processes, and decision-making tasks for physicians and policymakers, which require high quality.^9,17 This is why SRs must follow rigorous and detailed guidelines for the summarized evidence to be reproducible and trustworthy. In the clinical field of healthcare, two international organizations Cochrane (formerly Cochrane Collaboration; https://www.cochrane.org) and JBI (formerly Joanna Briggs Institute, https://jbi.global/) provide the criteria and methodological standards for assessment of current evidence, periodically updating their methods based on the reflection of the new information and ever-changing needs. Importantly, they use the Grading of Recommendations Assessment, Development, and Evaluation (GRADE) system to assess the certainty of the presented evidence.¹⁸ The GRADE working group, which consists of various healthcare professionals, methodologists, guidelines developers, healthcare researchers, and economists, developed and implemented “a common, transparent and sensible approach to grade the quality of evidence and strengths of recommendations in healthcare”.¹⁸ Well-defined protocols and checklists have to be followed, involving a multi-step, peer-review process.^19,20

Despite being the largest database for SRs in clinical medicine, the Cochrane Database of Systematic Reviews accounts only for 7% of published SRs.²¹ To date, the World Health Organization demands Cochrane standards to summarize the evidence for the development of their clinical practice guidelines.²² It has been reported that the quality of Cochrane reviews is superior to non-Cochrane reviews.^9,23,24 For example, Kolaski et al. assessed the quality of SRs of interventions for children with cerebral palsy using the Measurement Tool to Assess Systematic Reviews-2 (AMSTAR-2).^24,25 Eighty-three SRs were included in their analysis, four of these were Cochrane reviews. The only reviews that were approved by the AMSTAR-2 tool²⁵ were published within Cochrane, the remaining SRs were deficient for critical and non-critical domains of AMSTAR-2 evaluation. This implies that recommendations on the treatment of children with cerebral palsy are based on critically low quality of evidence.²⁴ One of the critical items in the AMSTAR-2 quality assessment tool is the publication of the SR protocol before conducting SR.²⁵ Protocol registration increases the transparency and quality of the research and diminishes the risk of duplication, and potential misconduct.²⁶ It thus appears to result in higher quality methodology.^27,28,29 Protocol registration is mandatory for Cochrane reviews³ but rarely for non-Cochrane SRs, depending on journal requirements. To overcome this problem the Prospective Register of Systematic Reviews (PROSPERO, https://www.crd.york.ac.uk/prospero/) was launched in 2011.^29,30 It is a free open database for the registration of protocols for SRs associated with health care. Differently to clinical trials on humans where the registration of protocol is obligatory,³¹ there is no such requirement for SRs. Indeed, according to a recent study by van der Braak et al., only 38% of SRs on interventions published between January 2020 and January 2021 had a preregistered/ published protocol.²⁹ This percentage is increasing compared to 5.6% in 2013 (no protocols for SRs were found before 2013), and 31.6% in 2018.³²

A tool to assess the risk of bias within SRs is the Risk of Bias in Systematic Reviews (ROBIS).³³ Differently from the AMSTAR-2 that is applied for the intervention SRs,³⁴ ROBIS may be applied for intervention, diagnostic, etiology, and prognostic SRs.³³ The two tools are related and have several overlapping domains, however, they are not interchangeable. Both tools pinpoint the methodological quality (the prevention of systematic errors by study design, conduction, analysis, interpretation, and publication) and risk of bias (whether the results of the study are affected by the drawbacks in design, conduction, and analysis).³⁵ Both AMSTAR-2 and ROBIS demonstrated good inter-rater reliability,^{24,25,33,35,36} being not superior to each other.^36,37 Indeed, in the overview of SRs on complementary and alternative medicine therapies for infantile colic, inter-rater reliability was 0.6 for AMSTAR-2 and 0.63 for ROBIS.³⁶ It is pivotal though to train the authors for AMSTAR-2 and ROBIS to understand the differences between these two methods, and to make the conscious choice of which one to use to follow methodological rigor.

While AMSTAR-2 and ROBIS are crucial tools to assess SR conduct, the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) is the guideline for the reporting of SR.^38,39 The updated version of the PRISMA guideline includes 27 items within seven sections (title, abstract, introduction, methods, results, discussion, and other information).³⁸ The acknowledgment of PRISMA guidelines is beneficial already in the planning phase of a SR to ensure that all required items are covered and the appropriate methodological choices are made.^9,38 Following the PRISMA guidelines allows the authors to generate a complete and transparent reporting of their SR.

Importantly, the PRISMA checklist determines how completely each of the seven sections of SR is reported, but does not ascertain the quality of conduct and performance. Likewise, AMSTAR-2 and ROBIS are tools to assess the conduct of SR but they do not replace the methodological guidance. It has been shown that adherence to the PRISMA checklist does not guarantee achieving AMSTAR-2 and/or ROBIS standards.^40,41,42 In a quality assessment study on the timing of complementary feeding for early childhood allergy prevention, it has been demonstrated that only two SRs out of 12 fulfilled all PRISMA 2009 checklist items.³⁹ However, both these SRs were assessed to have low and critical low quality assessed by the AMSTAR-2 tool; one of them had a low risk of bias, and the other one high risk of bias assessed by the ROBIS tool.⁴² Therefore, the implementation of AMSTAR-2 and ROBIS for the evaluation of SR conduct and PRISMA for the comprehensiveness of reporting is recommended (Table 1).

Table 1 Available tools for clinical and preclinical SRs at planning, conduct and report stage.

Full size table

Preclinical and clinical SRs: similarities and differences

Preclinical studies aim to understand the pathophysiological processes of the diseases, explore and discover potential treatment strategies, and test the safety and efficacy of new drugs before the initiation of clinical trials.^14,43 However, the attention to the methodological quality of primary animal studies is still unsatisfactory. Thus, preclinical SRs often have their focus on possible areas in improvement of study design, conduct, and report.

Within pediatrics, the impact of the findings of the SRs on antenatal steroids provides an excellent example. High certainty evidence shows that the administration of antenatal steroids in case of risk of preterm delivery reduces neonatal mortality.⁴⁴ It is useful to look at similarities and discrepancies between the findings of preclinical⁴⁵ and clinical⁴⁶ SRs of antenatal steroids on long-term outcomes. The preclinical SR on antenatal steroids included 64 studies performed mainly in rodents.⁴⁵ The number of primary studies included in this preclinical SR⁴⁵ is twice as high as in clinical SR on antenatal steroids.⁴⁶ However, it is not possible to calculate the total number of animals in this preclinical SR, due to unclear reporting of the sample size in the primary studies (personal communication with Dr. van der Merwe). This is not the case for the clinical SR where the total number of children is reported (1.25 million).⁴⁶ The authors of the two SRs could not perform the subgroup analysis based on sex due to a lack of data in the primary studies.^45,46 The mortality rate was also underreported in the primary studies included in the preclinical SR (personal communication with Dr. van der Merwe). The lack of information on sex, mortality rate, and how many animals were used at the entry in the primary study raises several ethical questions regarding the completeness of the reporting. The outcomes were measured on term-born animals: animals had mature organ systems and physiology, leading to a further relevant question as a translation of the data into the clinical field. In the clinical study setting betamethasone was the most used antenatal steroid (in 77% of the included studies),⁴⁶ whereas in animal studies dexamethasone was used in 81% of the included studies.⁴⁵ Moreover, only 28% of the animal studies used clinically equivalent doses of steroids.⁴⁵ Two-thirds of studies in animals used multiple courses of steroids⁴⁵ while in clinical studies 1/3 of included studies reported a single dose of antenatal steroids.⁴⁶ Such divergence in the different steroids (betamethasone or dexamethasone), dosage, and administration regimen used between preclinical and clinical studies is problematic. Of note, the authors of the preclinical SR did not perform a meta-analysis of outcome data due to differences in outcome definition, animal model, the dosage of steroids, single/ multiple courses, age of the animal at assessment, and methods of outcome measurement.⁴⁵

Quality assurance of preclinical primary studies

To improve the reporting of primary preclinical studies the ARRIVE (Animal Research: Reporting of In Vivo Experiments) guidelines were developed in 2010. The purpose of the ARRIVE guidelines is to increase the quality, reporting, transparency, and reproducibility of primary animal studies.⁴⁷ Endorsement of these guidelines was applied by several journals. Yet, no marked progress was noted by the ARRIVE working group in 2020: randomization was reported by 30–40% of publications, blinding only by 20% of publications, sample size calculation, and basic animal characteristics below 10% of publications.⁴⁷ The authors of the guidelines address two possible reasons for the limited adherence to the guidelines: scarce awareness of the weight of incomplete reporting, and to which extent the journal staff is committed to fulfilling the guidelines.⁴⁷ To defeat the issue of compliance with the ARRIVE guidelines, the ARRIVE working group revised, updated, and reorganized the first version introducing a more user-friendly adaptation of ARRIVE 2.0 guidelines.⁴⁷ They consist of two sets: the “ARRIVE Essential 10” and the “ARRIVE Recommended Set”. The former provides the fundamental requirements for the reliability of the manuscript: study design, sample size, measures to reduce subjective bias, outcome measures, statistical methods, animals, experimental procedures, and results.⁴⁷ The “ARRIVE Recommended Set” invites to provide detailed information on animal husbandry and care, protocol registration, ethical disclosure, and declaration of interests.⁴⁷

One of the possible solutions to these problems is protocol registration, or preregistration.^47,48,49 Although widely accepted and used in clinical trials, it is still extremely uncommon in preclinical research. ARRIVE 2.0 guidelines strongly recommend the registration of protocol.⁴⁷ Preregistration of protocol results in reporting on detailed study design, randomization, blinding, primary outcome measure, and planned analysis, which reduces risks of questionable research practices like HARKing (Hypothesizing After the Results are Known⁵⁰ or cherry picking (report of advantageous results with occulting the unfavorable results).⁵¹ Registration of the primary studies’ protocols is a simple and free procedure, which might be performed in registers such as https://preclinicaltrials.eu/ or https://www.animalstudyregistry.org/.

The consultation with ARRIVE 2.0 guidelines at the protocol stage of the study enhances the chances of higher quality and addresses the potential biases. If the primary outcomes are accurately pre-specified in a-priori published protocol, the obtained data, independently whether it is positive, negative, or neutral, is more reliable.^{15,16,47,48,49,52,53} Moreover, it can minimize the risk of outcome switching based on results, thus the research remains hypothesis-driven and not result-driven.⁴⁹

Additionally, PREPARE (Planning Research and Experimental Procedures on Animals: Recommendations for Excellence) guidelines, available at https://norecopa.no/prepare, may be used for individual animal studies.⁵⁴ This guideline consists of the following parts: formulation of the study, dialog between scientists and the animal facility, and quality control of the various components of the study.⁵⁴

Recognizing the problem within preclinical research, Nature Publishing Group changed the editorial policy creating a 10-item checklist for manuscript revision that addresses whether certain measurements were applied to assure randomization, blinding, sample size calculation, data analysis, and publication bias, in May 2013.⁵⁵ The follow-up study revealed an improvement in reporting risk of bias by 16.4% in Nature group journals compared to the other types of journals (no change detected).⁵⁵ This indicates that change in acceptance to follow higher study conduct and report standards may be, although slow, possible.

Quality assessment in preclinical SRs

The recognition of the above-mentioned problems led to the development of detailed guidance via a free-of-charge online platform (The Systematic Review & Meta-analysis Facility (SyRF), https://syrf.org.uk/) on how to perform a high-quality SRs in animals.^{15,48,56,57,58,59,60,61} The Collaborative Approach to Meta-Analysis and Review of Animal Data from Experimental Studies (CAMARADES) and Systematic Review Center for Laboratory Animal Experimentation (SYRCLE) have been probably the largest groups, providing methodological assistance (both in means of tools, educational courses, and practical assistance if needed) for the evaluation of animal studies using systematic review approaches.^48,53 Despite the recent integration of the SYRCLE group into CAMARADES it is still possible to use the SYRCLE tool to assess risk of bias. CAMARADES approaches quality score checklists. It seems that there might be a poor understanding of the differences between these two tools.¹ The CAMARADES checklist evaluates the reporting by answering pre-specified questions (yes/no, maximum 10) regarding the appropriateness of the animal model, randomization, blinding, sample size calculation, temperature control, compliance with regulatory committees, and statements of conflict of interest.⁵³ The SYRCLE’s risk of bias tool uses reporting to look into the risk of bias. It is based on the Cochrane risk of bias assessment tool (a translation from clinical to preclinical SRs research) and it contains 10 bias items (selection, performance, detection, attrition, reporting, and other biases) with possible answers low/high/unclear.⁴⁸ The overview of these two tools is provided in Table 2. Despite the availability of quality and risk of bias assessment tools for the past 20 years, approximately only 45% of SRs in animals have some kind of quality assessment, and around 17% of SRs include both quality assessment and meta-analysis.⁴ The trend of quality assessment in animal studies appears to be promising since the first animal SR publication in 1992,⁵ increasing to 36% in 2010 and 45% in 2019.⁴ Russell et al. reported that only 5% of the SRs published the quality assessment based on the CAMARADES checklist and risk of bias based on SYRCLE.¹

Table 2 CAMARADES quality assessment checklist and SYRCLE risk of bias items.

Full size table

As a result of the adoption and use of SRs in the preclinical field, several common issues became obvious. Lack of randomization and blinding in preclinical studies results in an overestimation of the detected size effect, leading to erroneous and misleading interpretations.^{8,53,62,63,64} Sample size calculation is scarcely reported and the majority of animal studies on intervention have very few animals per group (e.g. 6–8) resulting in poor statistical power.^1,8,52,56 On top of that, small studies may result in a greater effect compared to large studies (both in preclinical and clinical settings) due to the heterogeneity.^65,66,67 They are more subjected to selection, attrition, and publication biases, resulting in false-positive intervention effects (both in clinical and preclinical research).^24,65,66,67 Publication bias, e.g. the studies with positive results are most likely to be published, may lead to overestimation of the intervention effect and potentially duplication of the research due to missing reporting of negative results.^49,68,69 Mueller et al. reported that just half of the animal SRs assessed publication bias.⁶⁹

Likewise for the primary preclinical studies the registration of protocol for SR is crucial. Indeed, it is one of the domains in the SYRCLE tool on selective outcome reporting.⁴⁸

There are different databases available for animal SR protocol registration: PROSPERO (https://www.crd.york.ac.uk/prospero/), Research Registry (Research Registry - Registry of Systematic Reviews/Meta-Analyses, https://www.researchregistry.com/register-now/register-your-systematic-review), INPLASY (International Platform of Registered Systematic Review and Meta-analysis Protocols, https://inplasy.com/). PROSPERO is free of charge, accepts only SRs with a clear benefit for human health, and allows version tracking. Research Registry and INPLASY are available by payment and both accept SRs, INPLASY accepts additional scoping reviews. All three databases provide a unique identifying number and the data submission section is possible only for the Research Registry.⁷⁰ Two other registers accept all study designs: OSF Registries (OSF preregistration, https://www.cos.io/initiatives/prereg) and protocols.io (https://www.protocols.io/). Both are free of charge and provide version tracking and a DOI.⁷⁰ The result submission is not possible, but OSF Registers provide the link to the OSF projects where the data can be presented.⁷⁰ PROSPERO seems to be the most used with more than 100000 SR protocols registered.⁷⁰ For veterinary SRs specifically, another dedicated register is available, VetSRev (https://vetsrev.nottingham.ac.uk/).

A database for animal SRs was developed by Langendam et al. in 2021, and it is freely available online (https://data.mendeley.com).⁴ The purposes of this database are: “(1) avoid duplication of effort and, thus, reduce research waste, (2) facilitate researchers in easily identifying all systematic reviews on a specific topic, (3) aid in the creation of evidence maps, (4) serve as a resource for further analysis to advance the methodology in evidence synthesis of animal-based research”.⁴ The database contains all SRs in animals since the first publication in 1992. Another question that may be addressed with the help of this database is a translation of animal studies in humans.⁴

In conclusion, education and familiarization with available methodological tools, ARRIVE 2.0, PRISMA, CAMARADES, and SYRCLE (Fig. 2), is warranted to shift the research from the chancing significance to higher quality and thereby reduce research waste, unethical animal use, and ultimately unnecessary clinical studies.

**Fig. 2: Infographic: ”Important steps to mitigate biases in systematic review and meta-analysis of preclinical studies”.**

Challenges of SRs in children

Several characteristics should be considered when dealing with the pediatric population and thereby some requirements may differ from the adult population. The research in the pediatric field has been deficient.^71,72 Many currently used treatments in children are extrapolated from adult efficacy and safety data and may result in over- or under-treatment.^71,73,74 The rapidly developing physiology in children (neonates, infants, children, and adolescents) results in differences in pharmacology and psychology^74,75,76,77 emphasizing the importance of the research questions to be age-specific. Evidence could be summarized and presented in recommendations for the relevant age group. However, the first problem arises from the definition of newborns, infants, children, adolescents, and adults across the studies.^78,79,80 Indeed, the age of the sample population was not defined in approximately 1/3 of Cochrane reviews in children.⁷⁸ This confusion in the definition of “child” and other age-related terms is present among the databases, leading to possible flaws in electronic search.⁸⁰ Moreover, there is inconsistency in reporting the age of the study population in titles.⁸¹ To overcome this problem, specific filters for electronic search for primary studies in the pediatric population were developed.^82,83 Kastner et al. demonstrated that the combination of MESH terms and keywords for the MEDLINE database resulted in high sensitivity and specificity (for clinical pediatric studies – 98% and 81.2% and for neonatology – 95.3 and 83.6%, respectively).⁸²

Another aspect to keep in mind in pediatric SRs is the consideration of age-related physiological changes in defining the interventions. The rationale of the specific intervention should be defined for a distinct age group specifying age-related dosage, route of administration, duration of therapy, bioavailability, and other pharmacokinetics and pharmacodynamics aspects.⁷⁷ For example, the total body water content (%) in the full-term newborn is approximately 80%, and decreases over the first year of life to approximately 60%, reaching the adult level.⁸⁴ Thus, the water-soluble drugs have a higher volume of distribution in neonates compared to older infants and adults. This results in the need for dose adjustment. In addition to pharmacokinetic aspects, even pharmacodynamics may be influenced by developing physiology and produce different responses on intervention. A good illustration of that is the data on selective serotonin re-uptake inhibitors (SSRI) that are used for treatments of depressive and anxiety disorders. Differently from the adult population, SSRIs increase the risk for suicidal behavior, aggression, and akathisia in children and adolescents.⁸⁵

Not only justifying the intervention but also clarifying the comparison is crucial in pediatric SRs. Commonly used comparisons in pediatric trials are “standard care” which may happen to be an off-label drug or placebo. When the “standard care” is an off-label drug there is the underlying problem of insufficient safety and effectiveness data, which raises the ethical dilemma of protecting children from research risks against not approved therapies.⁸⁶ When a placebo is used as a comparison one should have in mind that it may have a higher response rate in children and adolescents compared to adults.^85,87 This may ultimately introduce the underestimation of the placebo effect and overestimation of the drug effect if the drug efficacy is extrapolated from adult trials.⁸⁸

Likewise in adult SRs, the outcomes should be determined at the planning stage of the review. Considering the age-related maturational process, some benefits and harms of the intervention may appear later in life. Acknowledging this may influence which study designs and outcomes should be included in SR. For instance, a recent scoping review on attention deficit hyperactivity disorder (ADHD) brings up the limitations of the evidence on long-term outcomes of the intervention in children and adolescents with ADHD.⁸⁹ It suggests the overdiagnosis and overtreatment in children and adolescents with ADHD, highlighting the gaps in evidence regarding the long-term benefits and harms of diagnosis and treatment of children with milder symptoms.⁸⁹ Such “gaps in evidence” may be the outcome of the interest of SR even though they are not identified in the primary studies. The primary outcomes of the SR must be specified a priori to avoid the outcome reporting bias.⁹⁰

A special consideration should be made for perinatal/neonatal medicine due to the unique characteristics of this population compared to older patients. Additionally, the outcome measures are different based on gestational age at birth and related to prematurity itself, such as intraventricular hemorrhage, chronic lung disease, retinopathy of prematurity, and necrotizing enterocolitis. Additionally, because of the immaturity of several organ systems and critical illness in the neonatal period, there are long-term consequences on development. An overview of the challenges in children and juvenile animals is presented in Table 3.

Table 3 Challenges of SRs in children and juvenile animals.

Full size table

Considering the above-mentioned differences in the pediatric population and to increase the quality, completeness, and transparency of pediatric SRs, the proposal for the development of an extension of PRISMA guidelines, PRISMA-Protocol Children (PRISMA-PC) and PRISMA-Children (reporting), was made.^72,80,91 The last update on protocol status was in May 2023 and it is expected that the PRISMA-C statement will be published in Q4 2023 (https://lab.research.sickkids.ca/enrich/reporting-standards/prisma-c-prisma-pc/).

Challenges of SRs in juvenile animals

Similarly to human children, the age of the animals is one of the major determinating factors, given the continuous development and related to changes in body composition and physiology. Recently, van der Laan and colleagues demonstrated a wide variety of the ages of animals at the start of the pharmacological compounds (a total of 15 different compounds were used).⁹² For example, a compound was given at postnatal day (PND) 28 in juvenile rats while the planned pediatric age for the study was neonatal meaning that rats at PND 28 were too old (neonatal period in rats is considered to be up to PND10).^92,93,94 The authors identified that in four studies a compound was started at PND 7 while the targeted pediatric age started at six years (postnatal weeks 3-6 in rats), indicating that the drug of investigation was given way too early.⁹² One of these early started compounds for the treatment of ADHD led to the development of novel behavioral effects (increased agitation, tenseness, aggressiveness, followed by decreased activity), suggesting that this drug affects the neurodevelopmental processes during brain development, resulting in changes in neuropharmacological response and altered behavior later in life.⁹² Sometimes animals may not be juvenile but the model might mimic a pediatric condition. Examples include models of hyperoxia models of bronchopulmonary dysplasia in rodent studies.⁹⁵ It is therefore pivotal to acknowledge the developmental stage of animals in relation to the children´s treatment period, which was violated in some of the above studies, compromising the reliability of the results. Thus, animal age should be rigorously justified when dealing with juvenile animal studies.

In pharmacological studies in juvenile animals, even the use of excipients should be considered. The formulations of the drugs, including excipients, are commonly the same as in adults. However, due to the different development stages of the organs of young animals, some of the excipients may be harmful. For example, the use of propylene glycol as an excipient resulted in the mortality of mice.⁹⁶ This means that excipient toxicity may influence the outcomes and the reliability of the results.

Another important aspect to highlight is the number of animals used in longitudinal pharmacological juvenile studies by introducing animals at different phases of the study, sometimes using more than one animal species (even though EMA regulatory guidelines indicate that one species is sufficient), and the willingness to cover multiple outcomes (complicated design) resulting in a high number of animals.^92,94 Differently from the human studies, the mortality rate is rarely reported as well as the sex of animals. Therefore, the sample size calculation, appropriate study design, and scientific justification of animal model choice should be designed adequately.

Once studies have been included in a SR, caution is needed to ascertain whether pooling studies in the same meta-analysis, in separate meta-analyses, or narratively. For instance, studies conducted in different species or strains might lead to substantial heterogeneity, due to different responses to the same intervention. In addition, different injury models may also cause concern about the appropriateness of combining those studies in the same analysis.

Consequently, juvenile animal SRs need to deal with the weaknesses of primary studies. They have, however, the potential to reveal and highlight these weaknesses in terms of design, conduct, and reporting, as well as the lack of understanding of age relevance and its relation to specific pediatric populations, and in some cases lack of species-specific knowledge of biology and physiology. The overview of the challenges in children and juvenile animals is presented in Table 3.

Finally, GRADE guidance is needed to assess the certainty of the evidence of animal studies following well-defined criteria. For example, the GRADE domains´ indirectness and dissemination bias present different challenges than those in clinical studies.

Translational value of preclinical research

Several methodological, conductive, and reporting problems of animal studies have been clearly shown by critically summarizing the available data in SRs. Therefore, the translational value of animal studies has been questioned.^{52,62,97,98,99,100,101} In the recent scoping review by Leenaars et al. of 121 reviews and “umbrella”-studies with meta-analysis on translational value of animal studies was demonstrated that the concordance rate was between 0 and 100%.⁹⁹

Two major categories were suggested to explain the discordance between animal and human data.^97,99 The first one may be attributed to methodological weakness and biased reporting of animal studies. Acknowledging ARRIVE 2.0, CAMARADES and SYRCLE guidelines ideally may result in high-quality animal studies and major trustworthiness of the results. For individual animal studies adherence to PREPARE (Planning Research and Experimental Procedures on Animals: Recommendations for Excellence) guidelines (https://norecopa.no/prepare) is recommended.⁵⁴ The meticulous planning will increase the likelihood of implementation of the 3Rs principles (replacement, reduction, refinement).¹⁰²

The second category of translational failure is based on the differences between the species and is difficult to address.¹⁰³ It has never been scientifically proven that animals are predictable for human outcomes.¹⁰⁴ Both animals and humans have sophisticated physiology and biology resulting in unpredictability of various degrees.¹⁰⁵ Since the publication of the 3Rs principles,¹⁰² the focus of animal research was mainly on ethical aspects and regulations rather the scientific validity. Preclinical SRs highlighted flaws in the internal and external validity of animal studies. Consequently, it has been suggested to try to replace animal studies with new experimental techniques and methods based on human biology whenever possible.¹⁰⁰ These approaches include sophisticated in-vitro cell models (organoids, organs-on-a-chip), computer-based models, and artificial intelligence.¹⁰⁰ The modification of original 3Rs¹⁰² has been proposed: replacement (over reduction and refinement), research, and relevance (to humans rather than non-human animals).¹⁰⁰

Currently, the research regarding these two viewpoints continues in parallel.⁹⁹ Despite some inevitable difficulties, it is fundamental to improve animal study design, conduct, and report. Several methodological tools are available and the education of the research is a cornerstone to change the mentality, ultimately resulting in the transparency, study quality, and availability of the raw data for the research community.

Conclusions

SRs of preclinical studies aim to assess the benefits and harms of a specific intervention. It is imperative for SRs to adhere to methodological standards, which are freely available. These include registration of the protocol, implementation of the guidelines for assessing the risk of bias, quality of the studies, and certainty of the evidence. The findings of well-conducted SRs of preclinical studies have the potential to provide a reliable evidence synthesis to guide the design of future preclinical and clinical studies.

Data availability

Data sharing does not apply to this article as no datasets were generated or analyzed during the current study.

References

Russell, A. A. M., Sutherland, B. A., Landowski, L. M., Macleod, M. & Howells, D. W. What has preclinical systematic review ever done for us? BMJ Open Sci. 6, e100219 (2022).
Article PubMed PubMed Central Google Scholar
Menon, J. M. L., Ritskes-Hoitinga, M., Pound, P. & van Oort, E. The impact of conducting preclinical systematic reviews on researchers and their research: a mixed method case study. PLoS ONE 16, e0260619 (2021).
Article CAS PubMed PubMed Central Google Scholar
Higgins, J. P. T. et al. Cochrane Handbook for Systematic Reviews of Interventions Version 6.4 (Updated August 2023) (2023).
Langendam, M. W. et al. Developing a database of systematic reviews of animal studies. Regul. Toxicol. Pharm. 123, 104940 (2021).
Article CAS Google Scholar
Omarini, D., Pistotti, V. & Bonati, M. Placental perfusion. an overview of the literature. J. Pharm. Toxicol. Methods 28, 61–66 (1992).
Article CAS Google Scholar
Freedman, L. S. Meta-analysis of animal experiments on dietary fat intake and mammary tumours. Stat. Med. 13, 709–718 (1994).
Article CAS PubMed Google Scholar
Hunniford, V. T. et al. Epidemiology and reporting characteristics of preclinical systematic reviews. PLoS Biol. 19, e3001177 (2021).
Article CAS PubMed PubMed Central Google Scholar
Hirst, J. A. et al. The need for randomization in animal trials: an overview of systematic reviews. PLoS ONE 9, e98856 (2014).
Article PubMed PubMed Central Google Scholar
Kolaski, K., Logan, L. R. & Ioannidis, J. P. A. Guidance to best tools and practices for systematic reviews. Acta Anaesthesiol. Scand. 67, 1148–1177 (2023).
Article PubMed Google Scholar
Pound, P. & Ritskes-Hoitinga, M. Can prospective systematic reviews of animal studies improve clinical translation? J. Transl. Med. 18, 15 (2020).
Article PubMed PubMed Central Google Scholar
Cochrane‐Reward Prizes for Reducing Waste: 2017 Winners., http://www.cochrane.org/news/cochrane‐reward‐prizes‐reducing‐waste‐2017‐winners (2017).
Veening-Griffioen, D. H. et al. Are some animal models more equal than others? a case study on the translational value of animal models of efficacy for Alzheimer’s disease. Eur. J. Pharm. 859, 172524 (2019).
Article CAS Google Scholar
Veening-Griffioen, D. H. et al. Tradition, not science, is the basis of animal model selection in translational and applied research. ALTEX 38, 49–62 (2021).
PubMed Google Scholar
Hooijmans, C. R. & Ritskes-Hoitinga, M. Progress in using systematic reviews of animal studies to improve translational research. PLoS Med. 10, e1001482 (2013).
Article CAS PubMed PubMed Central Google Scholar
de Vries, R. B. et al. The usefulness of systematic reviews of animal experiments for the design of preclinical and clinical studies. ILAR J. 55, 427–437 (2014).
Article PubMed PubMed Central Google Scholar
Ritskes-Hoitinga, M. & Wever, K. Improving the conduct, reporting, and appraisal of animal research. BMJ 360, j4935 (2018).
Article PubMed Google Scholar
Thomas, J. et al. Machine learning reduced workload with minimal risk of missing studies: development and evaluation of a randomized controlled trial classifier for Cochrane reviews. J. Clin. Epidemiol. 133, 140–151 (2021).
Article PubMed PubMed Central Google Scholar
Guyatt, G. et al. Grade guidelines: 1. Introduction-grade evidence profiles and summary of findings tables. J. Clin. Epidemiol. 64, 383–394 (2011).
Article PubMed Google Scholar
Higgins, J. P. T. et al. Methodological Expectations of Cochrane Intervention Reviews., https://community.cochrane.org/mecir-manual/key-points-and-introduction (2022).
Cumpston M. C. J. Chapter Ii: Planning a Cochrane Review. In: Higgins J. et al. Editors. Cochrane Handbook for Systematic Reviews of Interventions., https://training.cochrane.org/handbook (2022).
Hoffmann, F. et al. Nearly 80 systematic reviews were published each day: observational study on trends in epidemiology and reporting over the years 2000–2019. J. Clin. Epidemiol. 138, 1–11 (2021).
Article PubMed Google Scholar
World Health Organization. Who Handbook for Guideline Development, 2nd Ed., https://www.who.int/publications/i/item/9789241548960 (2014).
Lorenz, R. C. et al. Amstar 2 overall confidence rating: lacking discriminating capacity or requirement of high methodological quality? J. Clin. Epidemiol. 119, 142–144 (2020).
Article PubMed Google Scholar
Kolaski, K., Romeiser Logan, L., Goss, K. D. & Butler, C. Quality appraisal of systematic reviews of interventions for children with cerebral palsy reveals critically low confidence. Dev. Med. Child Neurol. 63, 1316–1326 (2021).
Article PubMed Google Scholar
Shea, B. J. et al. Amstar 2: a critical appraisal tool for systematic reviews that include randomised or non-randomised studies of healthcare interventions, or both. BMJ 358, j4008 (2017).
Article PubMed PubMed Central Google Scholar
Munafo, M. R., Hollands, G. J. & Marteau, T. M. Open science prevents mindless science. BMJ 363, k4309 (2018).
Article PubMed PubMed Central Google Scholar
Ge, L. et al. Association between prospective registration and overall reporting and methodological quality of systematic reviews: a meta-epidemiological study. J. Clin. Epidemiol. 93, 45–55 (2018).
Article PubMed Google Scholar
Dos Santos, M. B. F., Agostini, B. A., Bassani, R., Pereira, G. K. R. & Sarkis-Onofre, R. Protocol registration improves reporting quality of systematic reviews in dentistry. BMC Med Res Methodol. 20, 57 (2020).
Article PubMed PubMed Central Google Scholar
van der Braak, K. et al. The score after 10 years of registration of systematic review protocols. Syst. Rev. 11, 191 (2022).
Article PubMed PubMed Central Google Scholar
Booth, A. et al. The nuts and bolts of Prospero: an international prospective register of systematic reviews. Syst. Rev. 1, 2 (2012).
Article PubMed PubMed Central Google Scholar
De Angelis, C. et al. Clinical trial registration: a statement from the international committee of medical journal editors. Arterioscler Thromb. Vasc. Biol. 25, 873–874 (2005).
Article PubMed Google Scholar
Rombey, T., Doni, K., Hoffmann, F., Pieper, D. & Allers, K. More systematic reviews were registered in Prospero each year, but few records’ status was up-to-date. J. Clin. Epidemiol. 117, 60–67 (2020).
Article PubMed Google Scholar
Whiting, P. et al. Robis: a new tool to assess risk of bias in systematic reviews was developed. J. Clin. Epidemiol. 69, 225–234 (2016).
Article PubMed PubMed Central Google Scholar
Puljak, L. et al. Amstar 2 Is only partially applicable to systematic reviews of non-intervention studies: a meta-research study. J. Clin. Epidemiol. 163, 11–20 (2023).
Article PubMed Google Scholar
Banzi, R. et al. Quality assessment versus risk of bias in systematic reviews: Amstar and Robis had similar reliability but differed in their construct and applicability. J. Clin. Epidemiol. 99, 24–32 (2018).
Article PubMed Google Scholar
Perry, R., Whitmarsh, A., Leach, V. & Davies, P. A comparison of two assessment tools used in overviews of systematic reviews: robis versus Amstar-2. Syst. Rev. 10, 273 (2021).
Article CAS PubMed PubMed Central Google Scholar
Pieper, D., Puljak, L., Gonzalez-Lorenzo, M. & Minozzi, S. Minor differences were found between Amstar 2 and Robis in the assessment of systematic reviews including both randomized and nonrandomized studies. J. Clin. Epidemiol. 108, 26–33 (2019).
Article PubMed Google Scholar
Page, M. J. et al. The Prisma 2020 statement: an updated guideline for reporting systematic reviews. Syst. Rev. 10, 89 (2021).
Article PubMed PubMed Central Google Scholar
Moher, D., Liberati, A., Tetzlaff, J., Altman, D. G. & Group, P. Preferred reporting items for systematic reviews and meta-analyses: the prisma statement. Open Med. 3, e123–e130 (2009).
PubMed PubMed Central Google Scholar
Koster, T. M., Wetterslev, J., Gluud, C., Keus, F. & van der Horst, I. C. C. Systematic overview and critical appraisal of meta-analyses of interventions in intensive care medicine. Acta Anaesthesiol. Scand. 62, 1041–1049 (2018).
Article Google Scholar
Javidan, A. et al. Completeness of reporting in systematic reviews and meta-analyses in vascular surgery. J. Vasc. Surg. 78, 1550–1558.e1552 (2023).
Article PubMed Google Scholar
Matterne, U. et al. Quality of systematic reviews on timing of complementary feeding for early childhood allergy prevention. BMC Med. Res. Methodol. 23, 80 (2023).
Article PubMed PubMed Central Google Scholar
Sandercock, P. & Roberts, I. Systematic reviews of animal experiments. Lancet 360, 586 (2002).
Article PubMed Google Scholar
McGoldrick, E., Stewart, F., Parker, R. & Dalziel, S. R. Antenatal corticosteroids for accelerating fetal lung maturation for women at risk of preterm birth. Cochrane Database Syst. Rev. 12, CD004454 (2020).
PubMed Google Scholar
van der Merwe, J. L., Sacco, A., Toelen, J. & Deprest, J. Long-term neuropathological and/or neurobehavioral effects of antenatal corticosteroid therapy in animal models: a systematic review. Pediatr. Res 87, 1157–1170 (2020).
Article PubMed Google Scholar
Ninan, K., Liyanage, S. K., Murphy, K. E., Asztalos, E. V. & McDonald, S. D. Evaluation of long-term outcomes associated with preterm exposure to antenatal corticosteroids: a systematic review and meta-analysis. JAMA Pediatr. 176, e220483 (2022).
Article PubMed PubMed Central Google Scholar
Percie du Sert, N. et al. The arrive guidelines 2.0: updated guidelines for reporting animal research. Br. J. Pharm. 177, 3617–3624 (2020).
Article CAS Google Scholar
Hooijmans, C. R. et al. Syrcle’s risk of bias tool for animal studies. BMC Med. Res. Methodol. 14, 43 (2014).
Article PubMed PubMed Central Google Scholar
Munafo, M. R. et al. A manifesto for reproducible science. Nat. Hum. Behav. 1, 0021 (2017).
Article PubMed PubMed Central Google Scholar
Kerr, N. L. Harking: hypothesizing after the results are known. Pers. Soc. Psychol. Rev. 2, 196–217 (1998).
Article CAS PubMed Google Scholar
Murphy, K. R. & Aguinis, H. Harking: how badly can cherry-picking and question trolling produce bias in published results? J. Bus. Psychol. 34, 1–17 (2019).
Article Google Scholar
Ritskes-Hoitinga, M. & Pound, P. The role of systematic reviews in identifying the limitations of preclinical animal research, 2000–2022: Part 2. J. R. Soc. Med. 115, 231–235 (2022).
Article PubMed Google Scholar
Macleod, M. R., O’Collins, T., Howells, D. W. & Donnan, G. A. Pooling of animal experimental data reveals influence of study design and publication bias. Stroke 35, 1203–1208 (2004).
Article PubMed Google Scholar
Smith, A. J., Clutton, R. E., Lilley, E., Hansen, K. E. A. & Brattelid, T. Prepare: guidelines for planning animal research and testing. Lab Anim. 52, 135–141 (2018).
Article CAS PubMed Google Scholar
NPQIP Group. Did a change in nature journals’ editorial policy for life sciences research improve reporting? BMJ Open Sci. 3, e000035 (2019).
Google Scholar
Sena, E. S., Currie, G. L., McCann, S. K., Macleod, M. R. & Howells, D. W. Systematic reviews and meta-analysis of preclinical studies: why perform them and how to appraise them critically. J. Cereb. Blood Flow. Metab. 34, 737–742 (2014).
Article PubMed PubMed Central Google Scholar
Hooijmans, C. R., Tillema, A., Leenaars, M. & Ritskes-Hoitinga, M. Enhancing search efficiency by means of a search filter for finding all studies on animal experimentation in PubMed. Lab. Anim. 44, 170–175 (2010).
Article CAS PubMed PubMed Central Google Scholar
Hooijmans, C. R., IntHout, J., Ritskes-Hoitinga, M. & Rovers, M. M. Meta-analyses of animal studies: an introduction of a valuable instrument to further improve healthcare. ILAR J. 55, 418–426 (2014).
Article CAS PubMed PubMed Central Google Scholar
Hooijmans, C. R. et al. Facilitating healthcare decisions by assessing the certainty in the evidence from preclinical animal studies. PLoS ONE 13, e0187271 (2018).
Article PubMed PubMed Central Google Scholar
Hooijmans, C. R. et al. Assessment of key characteristics, methodology, and effect size measures used in meta-analysis of human-health-related animal studies. Res. Synth. Methods 13, 790–806 (2022).
Article PubMed PubMed Central Google Scholar
Soliman, N., Rice, A. S. C. & Vollert, J. A practical guide to preclinical systematic review and meta-analysis. Pain 161, 1949–1954 (2020).
Article PubMed Google Scholar
Perel, P. et al. Comparison of treatment effects between animal experiments and clinical trials: systematic review. BMJ 334, 197 (2007).
Article CAS PubMed Google Scholar
Pound, P. et al. Where is the evidence that animal research benefits humans? BMJ 328, 514–517 (2004).
Article PubMed PubMed Central Google Scholar
Silverblatt, J. A. et al. Therapies to limit myocardial injury in animal models of myocarditis: a systematic review and meta-analysis. Basic Res. Cardiol. 114, 48 (2019).
Article PubMed PubMed Central Google Scholar
Tsilidis, K. K. et al. Evaluation of excess significance bias in animal studies of neurological diseases. PLoS Biol. 11, e1001609 (2013).
Article CAS PubMed PubMed Central Google Scholar
IntHout, J., Ioannidis, J. P., Borm, G. F. & Goeman, J. J. Small studies are more heterogeneous than large ones: a meta-meta-analysis. J. Clin. Epidemiol. 68, 860–869 (2015).
Article PubMed Google Scholar
Stanley, T. D., Doucouliagos, H. & Ioannidis, J. P. A. Beyond random effects: when small-study findings are more heterogeneous. Adv. Methods Pr. Psychol. Sci. 5, 1–11 (2022).
Google Scholar
Fanelli, D. Negative results are disappearing from most disciplines and countries. Scientometrics 90, 891–904 (2011).
Article Google Scholar
Mueller, K. F. et al. Dissemination bias in systematic reviews of animal research: a systematic review. PLoS ONE 9, e116016 (2014).
Article PubMed PubMed Central Google Scholar
Pieper, D. & Rombey, T. Where to prospectively register a systematic review. Syst. Rev. 11, 8 (2022).
Article PubMed PubMed Central Google Scholar
Martinez-Castaldi, C., Silverstein, M. & Bauchner, H. Child versus adult research: the gap in high-quality study design. Pediatrics 122, 52–57 (2008).
Article PubMed Google Scholar
Farid-Kapadia, M., Joachim, K. C., Balasingham, C., Clyburne-Sherin, A. & Offringa, M. Are child-centric aspects in newborn and child health systematic review and meta-analysis protocols and reports adequately reported?-Two systematic reviews. Syst. Rev. 6, 31 (2017).
Article PubMed PubMed Central Google Scholar
Contopoulos-Ioannidis, D. G., Baltogianni, M. S. & Ioannidis, J. P. Comparative effectiveness of medical interventions in adults versus children. J. Pediatr. 157, 322–330.e17 (2010).
Article PubMed Google Scholar
Lathyris, D., Panagiotou, O. A., Baltogianni, M., Ioannidis, J. P. & Contopoulos-Ioannidis, D. G. Safety of medical interventions in children versus adults. Pediatrics 133, e666–e673 (2014).
Article PubMed Google Scholar
Klassen, T. P., Hartling, L., Craig, J. C. & Offringa, M. Children are not just small adults: the urgent need for high-quality trial evidence in children. PLoS Med. 5, e172 (2008).
Article PubMed PubMed Central Google Scholar
Ginsberg, G. et al. Evaluation of child/adult pharmacokinetic differences from a database derived from the therapeutic drug literature. Toxicol. Sci. 66, 185–200 (2002).
Article CAS PubMed Google Scholar
Smits, A. et al. Current knowledge, challenges and innovations in developmental pharmacology: a combined conect4children expert group and European society for developmental, perinatal and paediatric pharmacology white paper. Br. J. Clin. Pharm. 88, 4965–4984 (2022).
Article Google Scholar
Cramer, K. et al. Children in reviews: methodological issues in child-relevant evidence syntheses. BMC Pediatr. 5, 38 (2005).
Article PubMed PubMed Central Google Scholar
Contopoulos-Ioannidis, D. G. et al. Empirical evaluation of age groups and age-subgroup analyses in pediatric randomized trials and pediatric meta-analyses. Pediatrics 129, S161–S184 (2012).
Article PubMed Google Scholar
Farid-Kapadia, M. et al. Do systematic reviews on pediatric topics need special methodological considerations? BMC Pediatr. 17, 57 (2017).
Article PubMed PubMed Central Google Scholar
Moher, D., Soeken, K., Sampson, M., Ben-Porat, L. & Berman, B. Assessing the quality of reports of systematic reviews in pediatric complementary and alternative medicine. BMC Pediatr. 2, 3 (2002).
Article PubMed PubMed Central Google Scholar
Kastner, M., Wilczynski, N. L., Walker-Dilks, C., McKibbon, K. A. & Haynes, B. Age-specific search strategies for medline. J. Med. Int. Res. 8, e25 (2006).
Google Scholar
Leclercq, E., Leeflang, M. M., van Dalen, E. C. & Kremer, L. C. Validation of search filters for identifying pediatric studies in Pubmed. J. Pediatr. 162, 629–634.e2 (2013).
Article PubMed Google Scholar
Friis-Hansen, B. Body composition during growth. In vivo measurements and biochemical data correlated to differential anatomical growth. Pediatrics 47, 264 (1971).
CAS Google Scholar
Sharma, T., Guski, L. S., Freund, N. & Gotzsche, P. C. Suicidality and aggression during antidepressant treatment: systematic review and meta-analyses based on clinical study reports. BMJ 352, i65 (2016).
Article PubMed PubMed Central Google Scholar
Amin, S. B., McDermott, M. P. & Shamoo, A. E. Clinical trials of drugs used off-label in neonates: ethical issues and alternative study designs. Acc. Res. 15, 168–187 (2008).
Article Google Scholar
Rheims, S., Cucherat, M., Arzimanoglou, A. & Ryvlin, P. Greater response to placebo in children than in adults: a systematic review and meta-analysis in drug-resistant partial epilepsy. PLoS Med. 5, e166 (2008).
Article PubMed PubMed Central Google Scholar
Weimer, K. et al. Placebo effects in children: a review. Pediatr. Res. 74, 96–102 (2013).
Article PubMed Google Scholar
Kazda, L. et al. Overdiagnosis of attention-deficit/hyperactivity disorder in children and adolescents: a systematic scoping review. JAMA Netw. Open 4, e215335 (2021).
Article PubMed PubMed Central Google Scholar
Kirkham, J. J., Altman, D. G. & Williamson, P. R. Bias due to changes in specified outcomes during the systematic review process. PLoS ONE 5, e9810 (2010).
Article PubMed PubMed Central Google Scholar
Kapadia, M. Z. et al. Prisma-Children (C) and Prisma-Protocol for Children (P-C) extensions: a study protocol for the development of guidelines for the conduct and reporting of systematic reviews and meta-analyses of newborn and child health research. BMJ Open 6, e010270 (2016).
Article PubMed PubMed Central Google Scholar
van der Laan, J. W. et al. Evaluation of juvenile animal studies for pediatric cns-targeted compounds: a regulatory perspective. Int J. Toxicol. 38, 456–475 (2019).
Article PubMed Google Scholar
Semple, B. D., Blomgren, K., Gimlin, K., Ferriero, D. M. & Noble-Haeusslein, L. J. Brain development in rodents and humans: identifying benchmarks of maturation and vulnerability to injury across species. Prog. Neurobiol. 106-107, 1–16 (2013).
Article PubMed Google Scholar
Kim, N. N., Parker, R. M., Weinbauer, G. F., Remick, A. K. & Steinbach, T. Points to consider in designing and conducting juvenile toxicology studies. Int J. Toxicol. 36, 325–339 (2017).
Article CAS PubMed Google Scholar
Hilgendorff, A., Reiss, I., Ehrhardt, H., Eickelberg, O. & Alvira, C. M. Chronic lung disease in the preterm infant. lessons learned from animal models. Am. J. Respir. Cell Mol. Biol. 50, 233–245 (2014).
Article PubMed PubMed Central Google Scholar
Lau, K., Swiney, B. S., Reeves, N., Noguchi, K. K. & Farber, N. B. Propylene glycol produces excessive apoptosis in the developing mouse brain, alone and in combination with phenobarbital. Pediatr. Res. 71, 54–62 (2012).
Article CAS PubMed PubMed Central Google Scholar
Ioannidis, J. P. Extrapolating from animals to humans. Sci. Transl. Med. 4, 151ps115 (2012).
Article Google Scholar
Pound, P. & Bracken, M. B. Is animal research sufficiently evidence based to be a cornerstone of biomedical research? BMJ 348, g3387 (2014).
Article PubMed Google Scholar
Leenaars, C. H. C. et al. Animal to human translation: a systematic scoping review of reported concordance rates. J. Transl. Med. 17, 223 (2019).
Article PubMed PubMed Central Google Scholar
Herrmann, K., Pistollato, F. & Stephens, M. L. Beyond the 3rs: expanding the use of human-relevant replacement methods in biomedical research. ALTEX 36, 343–352 (2019).
Article PubMed Google Scholar
Ritskes-Hoitinga, M. et al. Improving translation by identifying evidence for more human-relevant preclinical strategies. Animals 10, 1170 (2020).
Article PubMed PubMed Central Google Scholar
Russell, W. M. S. & Burch, R. L. The Principles of Humane Experimental Technique. (Methuen, 1959).
Pound, P. & Ritskes-Hoitinga, M. Is it possible to overcome issues of external validity in preclinical animal research? Why most animal models are bound to fail. J. Transl. Med 16, 304 (2018).
Article PubMed PubMed Central Google Scholar
Shanks, N., Greek, R. & Greek, J. Are animal models predictive for humans? Philos. Ethics Humanit Med. 4, 2 (2009).
Article PubMed PubMed Central Google Scholar
Greek, R. & Shanks, N. Complex systems, evolution, and animal models. Stud. Hist. Philos. Biol. Biomed. Sci. 42, 542–544 (2011).
Article PubMed Google Scholar

Download references

Acknowledgements

Dr. van der Merwe for providing additional information on their SR on “Long-term neuropathological and/or neurobehavioral effects of antenatal corticosteroid therapy in animal models: a systematic review”.

Funding

No financial assistance was received in support of the study. AM is supported by funding from the National Health and Medical Research Council of Australia. Open access funding provided by Lund University.

Author information

These authors contributed equally: Atul Malhotra, Matteo Bruschettini.

Authors and Affiliations

Department of Clinical Sciences Lund, Division of Pediatrics, Lund University, Skåne University Hospital, Lund, 21185, Sweden
Olga Romantsik & Matteo Bruschettini
Library and ICT, Faculty of Medicine, Lund University, Lund, Sweden
Matthias Bank
Preclinicaltrials.eu, Netherlands Heart Institute, Utrecht, The Netherlands
Julia M. L. Menon
Department of Pediatrics, Monash University, Melbourne, Australia
Atul Malhotra
Monash Newborn, Monash Children’s Hospital, Melbourne, Australia
Atul Malhotra
The Ritchie Centre, Hudson Institute of Medical Research, Melbourne, Australia
Atul Malhotra

Authors

Olga Romantsik
View author publications
You can also search for this author in PubMed Google Scholar
Matthias Bank
View author publications
You can also search for this author in PubMed Google Scholar
Julia M. L. Menon
View author publications
You can also search for this author in PubMed Google Scholar
Atul Malhotra
View author publications
You can also search for this author in PubMed Google Scholar
Matteo Bruschettini
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Each author made a substantial contribution to conception and design of the review, drafting the article or revising it critically for important intellectual content; and gave final approval of the version to be published.

Corresponding author

Correspondence to Olga Romantsik.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary material

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Romantsik, O., Bank, M., Menon, J.M.L. et al. Value of preclinical systematic reviews and meta-analyses in pediatric research. Pediatr Res (2024). https://doi.org/10.1038/s41390-024-03197-1

Download citation

Received: 13 January 2024
Revised: 15 March 2024
Accepted: 23 March 2024
Published: 13 April 2024
DOI: https://doi.org/10.1038/s41390-024-03197-1