Introduction

Disruptive behavior problems (DBs) in children and adolescents are one of the most common reasons for referral to mental health care facilities1. They are a primary component of serious child and adolescent psychopathologies2, and are characterized by behavioral manifestations that typically share difficulties in modulation of aggression, self-control, and impulsivity, and also result in threats to the safety of others and disrupt social norms3,4. Oppositional, aggressive, and hyperactive behaviors present in early childhood often predict negative mental health outcomes later in life, ranging from school failure to substance abuse and criminality5,6.

For complex disorders with limited existing treatment options and interventions with relatively lower efficacy/effectiveness, clinicians and researchers have attempted to identify subgroups of individuals based on clusters of specific symptoms or other clinical and/or socio-environmental factors7. The utility of identifying subgroups is unfortunately limited by the methods used, such as using a categorical diagnostic approach8. Because individuals with the same categorical diagnosis display a wide range of symptoms, groups characterized in this manner are very heterogeneous9. Furthermore, youth with multiple comorbid disorders are more common in child and adolescent psychiatry10.

Recent clinical research has shifted interest to shared transdiagnostic psychopathologies, in accordance with the dimensional model for common psychiatric disorders11. Prior studies have demonstrated that DBs are associated with almost all forms of psychopathology in children and adolescents and are broadly present alongside both internalizing and externalizing symptoms12,13,14. Therefore, DBs may be excellent candidates as trans-diagnostic markers for studying the effectiveness of interventions, as well as guide ongoing and future child and adolescent psychiatric research and clinical practice15.

Use of a transdiagnostic method is further supported by the fact that there may be a common underlying pathophysiological mechanism of DBs in youths across various psychiatric diagnoses. Impairments in emotion regulation and inhibitory control16,17,18,19 may be key predictors of DBs20,21. According to the dysregulation hypothesis, DBs can be induced by the failure of top-down modulation of neural areas implicated in emotional responding16. This model possibly implicates the mesolimbic dopamine system, including projections from the ventral tegmental area to the nucleus accumbens and ventral striatum, and eventually to the prefrontal cortex22,23,24. Additionally, previous research has suggested that DBs in various psychiatric diagnoses result from dysfunction in the serotonergic and dopaminergic systems in the prefrontal cortex25,26,27.

Regarding psychopharmacological interventions, a number of agents have been studied for treatment of DBs in children and adolescents. Prior clinical trials investigated the efficacy of stimulants28,29,30,31, atomoxetine28,32,33,34, atypical antipsychotics30,35,36,37,38,39,40, mood stabilizers39,41, and anticonvulsants42,43 for treatment of DBs in youths with various psychiatric diagnoses (i.e., Attention Deficit Hyperactivity Disorder (ADHD), Conduct Disorder (CD), Oppositional Defiant Disorder (ODD), Disruptive Mood Dysregulation Disorder (DMDD), Major Depressive Disorders (MDD), Anxiety Disorders, Autism Spectrum Disorder (ASD), and tic disorder). Most of the previous studies, however, focused on the primary symptoms of each categorical diagnoses, for example, inattention in ADHD, or global improvement, and reported on DBs as co-occurring symptoms. Additionally, several systematic reviews reported an overview of the efficacy and safety of common medications for DBs44,45,46.

The previous studies leave a general gap in knowledge on the relative efficacy of each agent in the treatment of DBs, especially in regard to the different mechanisms of action of medications and their potential impact on the neurobiology of DBs. These studies did not use an objective appraisal of the evidence with a thorough methodological approach47, and many of them compared only two interventions (i.e., placebo vs pharmacological intervention), in which a conventional pair-wise meta-analysis may be conducted48.

We have instead applied network meta-analysis (NMA)49. Unlike traditional meta-analyses, which only allow for direct comparisons of interventions using the pooled data from clinical trials with similar treatment arms, NMA enables the estimation of relative effects of interventions without direct comparison50. Specifically, NMA is a method for comparison among multiple treatments simultaneously in a single analysis by integrating direct and indirect data from randomized controlled trials in a network. NMA has become a focus of clinical research since it may help examine the comparative efficacy of several treatments often utilized in clinical practice simultaneously49.

To this end, we systematically reviewed randomized trials (55 in total) of various psychopharmacological agents targeting the symptoms that represent DBs (i.e., aggression, hostility, impulsivity, conduct problems, and oppositional defiant problems) and applied NMA to determine their relative efficacy. We included the previous studies of psychotropic agents across medication classes (stimulants, anticonvulsants, second-generation antipsychotics, selective serotonin reuptake inhibitors (SSRIs), aripiprazole, tricyclic antidepressants, non-stimulant ADHD medication, lithium, and dasotraline), which contain valid measures for DBs even though the treatment of DBs may not have been a primary outcome.

Methods

Study protocol registration

This study was written in compliance with the recommendation of the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) statement51. The study protocol was registered with the International Prospective Register of Systematic Reviews (PROSPERO) in February 2022 (CRD42021256959). Supplementary Table 1 reported our results based on the updated PRISMA checklist for NMA.

Search strategy and study selection

We applied a comprehensive search strategy in literature databases. Six electronic databases (PubMed, Cochrane Central Register of Controlled Trials (CENTRAL), EMBASE, Web of Science, PsycINFO, and ProQuest Dissertations) were used for literature search and review for articles published up to February 2022 with the following search terms: (disruptive behavior OR conduct problem OR oppositional problem OR defiant problem OR aggression OR hostility OR impulsivity) AND (children OR adolescent OR youth) AND (randomized OR random OR randomly OR randomization OR randomization OR RCT OR RCTS) AND (pharmacological treatment OR pharmacological therapy OR pharmacological intervention OR drug OR medication OR medication treatment OR norepinephrine-dopamine reuptake inhibitors (methylphenidate stimulants, amphetamine stimulants, OR bupropion) OR dopamine receptor antagonists (aripiprazole, haloperidol, ziprasidone, OR ecopipam) OR GABA receptor agonists (divalproex sodium, valproate sodium, divalproex, diazepam, zolpidem, OR carbamazepine) OR selective norepinephrine reuptake inhibitors (atomoxetine) OR serotonin-norepinephrine inhibitors (duloxetine, desvenlafaxine, nortriptyline, clomipramine, imipramine, OR venlafaxine) OR serotonin-dopamine antagonists (risperidone, lurasidone, clozapine, OR quetiapine) OR selective serotonin reuptake inhibitors (sertraline, paroxetine, escitalopram, fluvoxamine, trazodone, OR fluoxetine) OR Serotonin-norepinephrine-dopamine reuptake inhibitor (dasotraline) OR lithium. After the initial search and review, a manual search was performed by reviewing the reference lists of all identified publications and reviewing similar articles suggested by the meta-analysis and systematic review articles relevant to this topic52. To supplement incomplete reports in the original papers, relevant authors were contacted.

Selection criteria and full-text screening

The review process identified randomized clinical trials (RCTs) assessing the efficacy of psychotropic medications on disruptive behavior. All RCTs of pharmacological treatment reporting measures of symptoms that may represent underlying disruptive behavior were considered for inclusion. Based on our literature review3,53, the following symptoms were selected because they may represent underlying DB: (1) aggression/hostility, (2) oppositional/defiant problem, (3) conduct problem and (4) impulsivity. For the measurements that were used for each symptom, see Table 2.

The following exclusion criteria were applied: (1) abstract-only articles, case reports and case series (2) overlapping data set (3) animal studies (4) studies which did not report the mean or standard deviation in the post-treatment measurements (5) studies investigating comparison of efficacy between psychotropic medications with similar enough mechanism of drug action (i.e., risperidone vs. quetiapine vs. clozapine or dextroamphetamine vs. methylphenidate vs. bupropion) (6) studies of combination of more than one psychotropic medication, as it is difficult to distinguish the efficacy of psychotropic medication from that of combined effect (7) studies using unreliable measurement and studies with unclear data which included non-peer-reviewed publication. There were no limitations on the language, year of publication, country, gender, or ethnicity of the patients. Through the predefined eligibility criteria, two independent reviewers first screened and chose the title and abstract, then conducted full-text screening of included articles. Any disagreement between the reviewers was decided by an independent experienced literature reviewer.

Data extraction

A standardized template file was created based on a pilot extraction with the two most relevant references. Two researchers separately extracted the data using this template file. Extracted data included: title, authors, publication year, and participants’ characteristics including age, gender, sample size and diagnosis. The data for the dosages of each psychotropic medication, methods of administration, trial duration, measurements for symptoms representing underlying DB, and pre- and post-treatment scores for symptoms were also extracted. The data extracted was reviewed by the two independent researchers and disagreements were resolved through discussion to reach a consensus.

Risk of bias assessment

Two reviewers independently evaluated the validity of each RCT through “The Cochrane Risk of Bias Tool for randomized controlled trials”54. The tool evaluates RCT using a set of domains of bias including random sequence generation, allocation concealment, selective reporting, blinding of participants and personnel, blinding of outcome assessment, incomplete outcome data, and other sources of bias. Each study domain was assigned a rating of low, high, or unclear risk. Any discrepancies were resolved by a discussion among reviewers or guidance from an independent experienced literature reviewer.

Summary measure

Statistical analysis

As pre-specified in the study protocol, we analyzed psychotropic medications according to their primary mechanisms of action. This led to categorization of the listed psychotropic medications as follows: (1) Norepinephrine and dopamine reuptake inhibitors (stimulants), (2) GABA receptor agonists (anticonvulsants), (3) serotonin-dopamine antagonists (second-generation antipsychotics), (4) selective serotonin reuptake inhibitors (SSRIs), (5) dopamine receptor modulator (aripiprazole), (6) Serotonin-Norepinephrine Reuptake inhibitors (Tricyclic antidepressants, TCA), (7) selective norepinephrine reuptake inhibitors (non-stimulant ADHD medication such as guanfacine and clonidine), (8) serotonin-norepinephrine-dopamine reuptake inhibitor (dasotraline) and (9) lithium. The list of medications in each category is provided in Table 1. After the categorization, NMA was performed to compare the efficacy of different psychotropic medications with a frequent random-effects model, which preserves randomized treatment comparisons within trials using the net-meta R package version 8.0 (available at: http://CRAN.R-project.org/package=netmeta)55. International Society for Pharmacoeconomics and Outcome Research (ISPOR) recommend the NMA for comparing efficacy between different treatment modalities56. NMA enables simultaneous comparisons between all treatment arms across the studies in a single analysis by combining both direct and indirect comparisons57.

Table 1 The list of medications in each category in our study.

Heterogeneity is estimated by the I2 and Q statistics, which measure the percentage of total variation in point estimates among studies that is attributable to heterogeneity55,58. The I2 statistic describes the percentage of variation that is not attributable to chance. A value of I2 from 0 to 50% was considered unlikely to be important, 50–75% as moderate heterogeneity, and 75–90% as considerable heterogeneity59. Also, the Q statistics, which are a statistic for inconsistency, are calculated using the weighted sum of squared differences across studies, which represents the variation of treatment effect between direct and indirect comparisons at the meta-analytic level60. Significant heterogeneity was examined to be present when the p for heterogeneity was < 0.05 in the result of Q statistics.

To assess inconsistency in the network, a net-heat plot was used61. The net-heat plot is helpful to examine direct comparisons that might be likely sources of important inconsistency in the network. In that graph, the larger the gray box, the more important that treatment comparison is relative to another treatment comparison and the colored backgrounds indicate the degree of inconsistency in the network61. Additionally, the node-splitting method was used for assessment of inconsistency in the forest plots of NMA62. Publication bias was assessed by adjusted funnel plots. If data were sufficient, an Egger’s test of the intercept with centralized effect size and SE was further conducted63.

Since we assumed that the included studies differed with respect to clinical and other factors, we applied random-effects models instead of fixed-effect models. To rank the treatments based on their efficacy for each outcome, P-scores, a statistical parameter in frequentist NMA and ranging from 0 to 1, were calculated using the surface under the cumulative ranking curve (SUCRA)64. A higher P-score expresses the probability that the treatment would be better than the competing treatments65.

To interpret frequentist NMA, we checked the confidence interval to find any significant difference between the treatment arm and placebo. Then, we ordered the treatment using P-score.

Results

Search results

Our initial search yielded 1678 papers relative to our research terms. Title/abstract screening excluded 1552 irrelevant papers while a further 78 papers were excluded following full-text screening due to unavailability of data extraction, presence of adjunctive treatment, etc. Seven additional papers were included from the manual search. In total, 55 RCTs (comprising 5684 patients) were included for the NMA (Fig. 1). In addition, there were four studies comparing individual agent arms against each other (stimulant vs. non-stimulant ADHD medication, second-generation antipsychotic vs. dopamine receptor modulator, and second-generation antipsychotic vs. anticonvulsant) (Table 2, Fig. 2A). For the list of the articles included, see Table 2.

Figure 1
figure 1

Flow diagram of study selection.

Table 2 Studies measuring the symptoms of disruptive behavior problem.
Figure 2
figure 2

The result of NMA for disruptive symptoms. (A) The network graph representing treatment arms included in the network for disruptive symptoms, the thickness of the lines shows the number of studies. (B) Random effect model forest plot for comparison of each treatment arm vs. placebo. (C) Ranking of medications for disruptive symptoms using SUCRA values. (D) Comparison of the included agent groups: standardized mean differences (95% CI). Each cell represents the effect of the column-defining agent group compared to the row-defining agent group. DRA Dopamine receptor antagonist/aripiprazole, NDRI Norepinephrine–dopamine reuptake inhibitor/stimulant, SDA Serotonin dopamine antagonist/second-generation antipsychotics, sNRI Selective norepinephrine reuptake inhibitors/non-stimulant ADHD medication, SNDRI Serotonin-norepinephrine-dopamine reuptake inhibitor, SNRI Serotonin-norepinephrine reuptake inhibitors/tricyclic antidepressants, SSRI selective serotonin reuptake inhibitor.

Study characteristics

The characteristics of included studies are summarized in Table 1. Studies were performed and published between 1990 and 2021. Across studies, a total of 15 distinct psychotropic medications were represented. (1) one stimulant (Norepinephrine–dopamine reuptake inhibitor, NDRI): methylphenidate; (2) one non-stimulant ADHD medication (Selective norepinephrine reuptake inhibitors, SNRI): atomoxetine; (3) three second-generation antipsychotics (Serotonin dopamine antagonist, SDA): risperidone olanzapine and quetiapine (4) one dopamine receptor modulator (Dopamine receptor antagonist, DRA): aripiprazole (5) two SSRIs selective serotonin reuptake inhibitor): sertraline and fluoxetine; (6) three anticonvulsants (GABA receptor agonists, GRA): valproate sodium, divalproex, and carbamazepine (7) two TCAs (Serotonin-Norepinephrine Reuptake inhibitors, SNRI): desipramine and imipramine (8) one Serotonin-norepinephrine-dopamine reuptake inhibitor (Serotonin-norepinephrine-dopamine reuptake inhibitor, SNDRI): dasotraline (9) lithium.

A total of 9 diagnoses were included in the RCTs on disruptive behavior ADHD (n = 3988, 70.16%), disruptive behavior disorders (DBD) (n = 714, 12.56%), pervasive developmental disorder (n = 274, 4.82%), bipolar disorder (BD) (n = 217, 3.82%), ASD (n = 215, 3.78%), CD (n = 147, 2.59%), obsessive–compulsive disorder (n = 92, 1.62%), MDD (n = 31, 0.54%), and ODD (n = 6, 0.11%).

In total, 3490 and 2194 children and adolescents with psychiatric disorders were randomly assigned to active psychotropic medications and placebo, respectively. The age range of subjects in the RCT’s was 2–20 years (RCT’s mean = 7.03). Sixteen studies did not provide specific information on age. Subjects in both sets of RCT’s were predominantly male (83.75%) with 23 studies not reporting sex.

The mean duration of trials was 57.44 days (SD = 46.29) for medication treatment. There was one study that did not provide the duration.

Network meta-analysis results for disruptive behavior

The relative efficacy of each agent group compared with placebo was conducted for disruptive behavior symptoms (Fig. 2b). The network consisted of 55 studies comparing nine agent groups versus placebo, which were stimulant (NDRI), non-stimulant ADHD medication (sNRI), second-generation antipsychotic (SDA), lithium, dopamine receptor modulator (DRA), SSRI, anticonvulsant (GRA), TCA (SNRI) and SNDRI.

Four agent groups were significantly more efficacious than placebo when data was combined in the NMA: second-generation antipsychotic with a Standardized Mean Differences (SMD) of 0.668 (95% CI 0.537–0.800), stimulant with 0.633 (95% CI 0.513–0.752), dopamine receptor modulator with 0.402 (95% CI 0.046–0.757) and non-stimulant ADHD medication with 0.386 (95% CI 0.280–0.492) (Fig. 2b). The effect sizes from traditional meta-analysis are presented in the supplementary data (Fig. S6).

Second-generation antipsychotics (SDA) had the highest probability for being the most efficacious agents for disruptive behavior symptoms (P score = 0.9356) followed by stimulants (P score = 0.8906), then dopamine receptor modulators (P score = 0.5932), while placebo ranked as the least efficacious in reducing the symptoms of disruptive behavior (P score = 0.0811) (Fig. 2c).

The Fig. 2d shows detailed results of pairwise meta-analyses. Even though the second-generation antipsychotics and stimulants showed the highest efficacies for disruptive behavior symptoms when compared to placebo, the direct and indirect pairwise comparisons among medications revealed they weren’t significantly more efficacious than various agent groups including TCA, lithium, SSRI and dopamine receptor modulator. There were significant differences between second-generation antipsychotic and anticonvulsant with an SMD of 0.574 (95% CI 0.216–0.933), between the second-generation antipsychotic and SNDRI with 0.460 (0.114–0.806), and between the second-generation antipsychotic and non-stimulant ADHD medication with 0.282 (0.114–0.451). Stimulant also showed significant differences in its efficacy when compared with anticonvulsant (SMD 0.539 95% CI 0.164–0.914), SNDRI (SMD 0.425, 95% CI 0.083–0.766) and non-stimulant ADHD medication (SMD 0.247, 95% CI 0.095–0.399).

Quality assessment

We applied rigorous methods of quality assessment to provide evidence of the integrity of the NMA results. Supplementary Figs. S1 and S2 presented the methodological features examined for each trial and a summary result of the judged risk of bias across studies based on the Cochrane risk of bias tool. 22(40%) trials were assessed as a high risk of bias, 34 (61.81%) trials as unclear risk, and 12 (21.81%) as low risk of bias in disruptive symptoms.

Figure 2a show the network of eligible comparisons for efficacy in disruptive symptoms. All agent groups had at least one placebo-controlled trial. In network quality analysis for disruptive symptoms, we did not find evidence for heterogeneity (Q = 54.74, p = 0.1768, tau2 = 0.0086, I2 = 16). Also, there was no evidence of significant inconsistency between the direct and indirect estimates (i.e., all p-values were above 0.05) of disruptive symptoms (Tables S1, S2). The node-splitting method was also used to evaluate consistency between direct and indirect estimates (Fig. S3). For illustrating the consistency pattern that existed in each comparison, net heat plots were also formed to detect hot spots (red squares) indicating greater inconsistency among comparisons (Fig. S4). There was no significant inconsistency between direct and indirect evidence for most comparisons, as demonstrated by both the node splitting approach (P-value > 0.05) and net heat plots. Therefore, we concluded that the consistency model is valid in our NMA.

Potential publication bias in NMA was assessed through the funnel plots produced (Fig. S5). The result of the funnel plots showed no significant asymmetry pattern, and most studies were normally distributed in the funnel plot. Therefore, we determined that there was no significant publication bias in our research.

Discussion

To our knowledge, this is the first NMA of Randomized Clinical Trials (RCTs) to date exploring the individual efficacy of pharmacological treatments on disruptive behavior problems (DBs) in youths with various psychiatric disorders. The efficacy of 9 psychotropic medications on treatment of symptoms representing underlying DBs varied widely. NMA revealed that, for reducing disruptive symptoms, second-generation antipsychotics, stimulant, and non-stimulant ADHD medications were significantly more efficacious than placebo, and second-generation antipsychotics were the most efficacious. For disruptive symptoms, there is a consistent finding that psychotropic medications with higher dopaminergic receptor affinity, including methylphenidate and risperidone, showed significant efficacy in reducing these symptoms compared to the other psychotropic agents30,46,66.

For reducing disruptive symptoms, the effect size of only second-generation antipsychotics (SMD = 0.668) was large enough to be compatible with treatment of other primary/secondary symptoms of, for example, schizophrenia67 or autistic spectrum disorder68,69 in youths. Stimulants (SMD = 0.633) also showed the efficacy for reducing disruptive symptoms, but the effect size was relatively small or similar compared to the impact on main symptoms of ADHD (SMD over 0.8 and 0.6 for hyperactivity and inattention)70,71,72. The effect sizes of the other agent groups were small to medium (SMD values of 0.3–0.5) for disruptive symptoms, and generally lower than the effect size found in previous meta-analyses for these agents in reducing primary symptoms of ADHD (non-stimulant ADHD medication: SMD over 0.7 and 0.5 for hyperactivity and inattention)70,71,72, Tourette’s disorder (dopamine receptor modulator: SMD over 0.5 for tic)25,73, and depression (SSRI: SMD over 0.5 for depression)48,74. Therefore, while our results show that stimulants, dopamine receptor modulators, SSRIs, and non-stimulant ADHD medications may all decrease disruptive symptoms and may have neural level impacts on the pathophysiology of disruptive behavior in youths with various psychiatric disorders, the magnitude of their effect might be lower than that found in treatment of primary symptoms of specific disorders.

The fact that psychotropic medications with dopaminergic affinity (i.e., second-generation antipsychotics, stimulants, and dopamine receptor modulators) were significantly efficacious for reducing disruptive behavior problems indicates that while the exact nature of dopamine’s relationship to disruptive behavior is not completely understood, dysfunction in the dopaminergic system itself may play a significant role in youths’ response to these medications (e.g. reducing affective aggression)30,75,76. Consideration should be given to the fact that although stimulants and second-generation antipsychotics both impact the dopaminergic system, their specific mechanisms of action are quite different (i.e., methylphenidate is a dopamine agonist, but risperidone and olanzapine are D2 receptor and 5-HT2a antagonists).

A simplistic view of dopamine in mental disorders and their treatment has been that Dopamine levels are either too high or too low in specific brain regions. This may not, however, be sufficient to provide an explanation of the nature of dopaminergic dysfunction in various clinical presentations77,78,79,80,81. One potential explanation is that while stimulants and second-generation antipsychotics seem to have different mechanisms of action, they both result in augmenting tonic dopamine levels and reducing exaggerated phasic responses79,81. Secondly, it is also possible that not only the stimulants, but also the second-generation antipsychotics may directly affect top-down modulation by facilitating dopaminergic neurotransmission in the prefrontal cortex, even if they primarily affect the limbic system. It has been shown that acute treatment with second-generation antipsychotics, especially risperidone and olanzapine, increases the extracellular levels of dopamine in the prefrontal cortex (one of the main areas implicated in top-down modulation) as well as the nucleus accumbens82,83. The increased dopamine level in the nucleus accumbens induced by second-generation antipsychotics may mainly be due to the blockade of presynaptic D2 receptors84, since D2/3 receptor affinities of antipsychotics is associated with their preferential effects on dopamine levels there82. Since most prefrontal dopaminergic neurons do not possess D2 auto receptors, the increase dopamine release in the prefrontal cortex induced by second-generation antipsychotics appears to involve additional receptors85. Serotonergic pathways might also influence the regulation of extracellular dopamine in the prefrontal cortex86. Local activation of 5-HT2A receptors in the prefrontal cortex influences dopamine neuron activity in the ventral tegmental area and dopamine release in the mesocortical pathway87. The combined effects on D2 and 5-HT2A antagonism induced by second-generation antipsychotics (not just by reducing dopaminergic action) may have a more complex mechanism of reducing disruptive behavior in youths.

Clinically there has been concern that the effect of second-generation antipsychotics on disruptive symptoms is merely a product of their sedating effect88, due to their anti-histaminergic and anti-cholinergic actions89, however, it is also possible that these systems may play a critical role in disruptive behavior symptoms90,91. The fact that both stimulants and second-generation antipsychotics can exercise their impact on the neural areas of bottom-up emotional processing in addition to top-down cognitive control92, indicates that the symptoms of disruptive behavior may be driven by more complex mechanisms, which requires further future study.

In sum, the present NMA demonstrated that stimulants (i.e., methylphenidate) and second-generation antipsychotics, specifically risperidone, have large effect sizes in the treatment of DBs in children and youths, while mood stabilizers, SNRIs, α-2 agonists and antidepressants showed low to medium effect sizes. These results suggest that those agents with a higher affinity for dopamine receptors may be useful in the management of DBs, a result consistent with the hypothesis that DBs are due to deficits in the top-down inhibitory processes in the mesolimbic dopamine system.

There are a few caveats to offer. While this review examined various psychotropic medications for the treatment of symptoms of disruptive behavior, it did not take into account administration method, dosage or duration of treatment, safety or tolerability issues, or the age, gender and race/ethnicity of participants. These were not explored due to lack of reported information.

Secondly, we excluded studies that combined two or more psychotropic medications or psychotropic medication with other modalities of treatment (behavioral intervention/therapy) due to limitations of statistical analysis. We also did not examine studies that implemented neurobiological assessment methods to evaluate the neural level changes in the underlying pathophysiology of disruptive behavior (such as neuroimaging or genetics) to reduce measurement heterogeneity. Future study is warranted on these as well.

Third, some agent groups had small sample sizes, such as lithium (n = 27) and TCAs (SNRI) (n = 48). Previous work suggests that treatment efficacy would be overestimated in studies with small sample sizes93, therefore, it might be difficult to accurately assess the treatment efficacy of lithium and TCAs for DBs.

Fourth, because the symptom indicators for DBs applied in this study included symptoms in the context of externalizing disorders (i.e., ADHD, CD, ODD, DBD), there is a disparity in sample size between externalizing and internalizing disorders. The generalization of our findings to the full spectrum of pediatric psychiatric symptoms is therefore limited by the inability to interpret findings in symptoms manifested in internalizing disorders (i.e., depressive disorders, anxiety disorders, obsessive–compulsive and related disorders, trauma and stress-related disorders, and dissociative disorders). DBs in Autism Spectrum Disorder are a significant clinical issue, but these may or may not share the mechanisms of DBs in the externalizing disorders94,95. Thus, although in many cases stimulants and antipsychotic medications are being used for DBs in Autism Spectrum Disorders94, our current finding may not apply to those cases, and future study is necessary. In addition, disruptive behavior problems in other psychiatric diagnoses (such as BD and psychotic disorder) in children and adolescents may benefit from a separate and comprehensive study with similar design as this, since the underlying neurobiological mechanism of disruptive behavior problem in those diagnoses may significantly differ from the ones included in the current study96,97.

Lastly, including studies with various scales to measure DB symptoms may potentially limit interpretability. The analysis methodology we used (NMA), however, allows us to pool data from different trials with similar treatment arms, to measure relative effects of interventions98. In addition, all of the studies that were included in the NMA implemented the scales to measure similar if not identical disruptive behavior elements, such as oppositional/defiance, overt aggression, and hyperactivity/impulsivity (see Table 2)99. Previous studies demonstrated high levels of overlap and strong correlations of these symptoms with each other100.

Despite these limitations, our study was able to identify the efficacy of various psychotropic medications for treating the various symptoms of disruptive behavior, and speculates that this efficacy may be related to the mechanism of action of medications at the neural level. In particular, psychotropic medications with dopaminergic affinity, especially the second-generation antipsychotics, were particularly efficacious in treating the symptoms of disruptive behavior.

Clinical implication and conclusion

Based on this review, several pharmacotherapies such as second-generation antipsychotics and stimulants may have potential efficacy for the treatment of disruptive behavior symptoms in the psychiatric diagnoses included in our study. This may be a first step toward refining and individualizing treatment options for youth with externalizing disorders, as well as eventually improving our understanding of the underlying neurobiological mechanisms of disruptive behavior in these diagnoses. Future study may consider incorporating a trans-diagnostic concept into designing RCTs targeting DBs.