Introduction

The diagnosis of beta-lactam allergy without allergic workup for confirmation is a public health problem which is mostly observed among children. Approximately 2–5% of the pediatric population have self-reported beta-lactam allergy.1 However, most children with suspected beta-lactam allergy are likely to be overlabeled.2,3 Misinterpretation of beta-lactam allergy could increase antibiotic resistance and healthcare costs due to the use of broad-spectrum antibiotics.4,5

The drug challenge test is considered the gold standard to diagnose drug allergy.1,2 Although skin tests are commonly performed before the drug challenge, the diagnostic accuracy of skin tests is considered insufficient; in particular, they have low sensitivity.6,7 A previous meta-analysis published in 2015 determined that children with suspected beta-lactam allergy were truly allergic if either the skin or drug challenge test was positive with an immediate reaction.3 The meta-analysis focused only on immediate reactions and included only four original studies, which reported prevalence rates between 0.9% and 14.8%. Since 2015, an increasing number of studies have assessed the true-positive rate for subgroups with a history of immediate or nonimmediate reactions8,9 using a direct drug challenge without performing a prior skin test.10,11,12

To the best of our knowledge, there has been no systematic review and meta-analysis that has examined the child prevalence of true beta-lactam allergy in suspected cases based on the results of drug challenges. Neither safety nor the predictive negative value of the drug challenge has been thoroughly evaluated in previous systematic reviews and meta-analyses. We, therefore, performed a systematic review and meta-analysis to assess: (a) the prevalence of positive results from drug challenge test for children with suspected beta-lactam allergy; (b) the prevalence of serious adverse events in response to a drug challenge test; and (c) the prevalence of positive results from a second drug challenge result among children with negative results from a first drug challenge.

Methods

Protocol and registration

We conducted a systematic review and meta-analysis in accordance with the Preferred Reporting Item for Systematic Review and Meta-analysis guidelines (PRISMA 2020).13 We developed a detailed protocol and registered our study with the International Prospective Register of Systematic Reviews (PROSPERO: CRD42021240535).

Article inclusion criteria

We included all cohort studies. We did not restrict publication date and status (full publications, conference abstracts, and unpublished data), or languages. We excluded case reports, case series, case–control studies, and reviews. Study participants were children under the age of 18 with suspected beta-lactam allergy who underwent a drug challenge test. We excluded studies in which the participants were mostly adults, however, included studies involving both adults and children where data on children could be extracted independently. We accepted other definitions of children and differing procedures for administering the drug challenge test. We allowed the determination of results used in the original studies. We also excluded studies with (i) overlapping participants, (ii) unclear information on the suspected drug, or (iii) unclear information on the challenge test.

Primary outcomes

The primary outcomes were (a) the prevalence of positive results from a drug challenge test for children with suspected beta-lactam allergy; (b) the prevalence of serious adverse events in response to a drug challenge test; and (c) the prevalence of positive results from a second drug challenge among children with negative results from a first drug challenge.

We defined the prevalence of positive results in the first challenge as the number of positive first drug challenge results per total number of first drug challenge tests. We accepted the criteria adopted by the original studies for the results of the drug challenge test.

The prevalence of serious adverse events was defined as the number of serious adverse events per total number of first drug challenge tests. We accepted the definition adopted by the original studies for serious adverse events. If the authors did not assess the severity of adverse events, we defined serious adverse events as immediate reactions with anaphylaxis or requiring administration of epinephrine or delayed reactions such as Stevens–Johnson syndrome or toxic epidermal necrolysis.

The prevalence of positive results in the second challenge after a first negative result was defined as the number of positive second drug challenge results per total number of second drug challenge tests. Children who had a negative first drug challenge result underwent a second drug challenge test. Those who were positive were defined as having a positive second drug challenge result. We did not set a criterion for the interval between the first and second challenges. If a second drug challenge test was not performed, children who used the suspect drug and developed allergy-like symptoms after a first negative result were also classified as having a positive second drug challenge result.

Search strategy

We searched PubMed MEDLINE, Dialog EMBASE, the Cochrane Central Register of Controlled Trials, and the World Health Organization International Clinical Trials Platform Search Portal (ICTRP), and ClinicalTrials.gov from inception to March 5, 2021. We did not restrict to language or date of publication. Supplementary Table S1 shows the details of search terms used for each database. We also confirmed the references included in the previous systematic reviews,2,3 clinical guidelines,1,14 or position papers,15,16 and studies included in the present review.

Study selection and data extraction

Two of the three review authors (YK, NT, and HT) screened the titles and abstracts of studies according to the inclusion and exclusion criteria and then obtained and independently assessed the full text for eligibility. If we were not sure whether a study met the inclusion criteria, we emailed the authors of original study and requested additional information. We used Rayyan software17 for study selection. Two of the three review authors (YK, NT, and HT) independently extracted data from the studies. Any disagreements between the two reviewers were resolved through discussion or with a third reviewer if required.

Quality assessment

Two of the three review authors (YK, NT, and HT) independently assessed the risk of bias for each study by using the Joanna Briggs Institute prevalence critical appraisal tool.18 This tool consists of nine domains. Each domain is evaluated as Yes, No, Unclear, or Not Applicable. The quality score for each study was defined as the percentage of Yes answers obtained for the total number of domains evaluated. Quality scores were graded as low quality if the percentage was <50%, moderate quality if it was 50–80%, and high quality if it was >80%.19,20 Any conflict between the two reviewers were resolved through discussion or with a third reviewer if required.

Data synthesis and statistical analysis

All statistical analyses were performed using R software (version 4.0.3).21 For continuous variables, data were expressed as median (interquartile range (IQR)). To estimate the pooled prevalence, we synthesized the data with the “meta” and “metafor” packages for R and combined proportions with 95% confidence intervals (CIs) and 95% prediction intervals. We used an inverse-variance-weighted random-effects model with the DerSimonian–Laird estimator to estimate the between-study variance and normal approximation intervals based on summary measures to calculate CIs for individual study results. To stabilize variances, we transformed the data with the Freeman–Tukey double arcsine transformation. The random-effects model (DerSimonian–Laird method) rather than fixed-effect model was selected at the time of the study protocol because of the expected heterogeneity of the included studies.

We visually evaluated heterogeneity using forest plots. We also calculated and analyzed I2 statistics (I2 values of 0–40% = might not be important; 30–60% = moderate heterogeneity; 50–90% = substantial heterogeneity; 75–100% = considerable heterogeneity). To calculate the I2 statistics, we applied the Cochrane χ2 test (Q-test). We also calculated τ2 statistics for the prediction intervals.22

To assess the reporting bias, we searched clinical trial registry systems (ICTRP and ClinicalTrials.gov) to identify completed but unpublished studies. We assessed potential publication bias by visual inspection of a funnel plot and also performed the Egger test. Funnel plots are used to assess the presence of potential publication bias in meta-analyses. However, these are likely to be inaccurate in the meta-analyses of prevalence studies with low proportions of outcomes.23 Therefore, we did not create funnel plots for meta-analyses that included <20 studies24,25 or studies with similar sample sizes.

Subgroup analyses

We considered the following subgroups for the prevalence of positive results in the first challenge: children who had undergone the drug challenge with or without a preliminary test, including skin prick test, intradermal test, or specific IgE quantification; children with a history of an immediate or nonimmediate reaction; children with a suspected penicillin class allergy or cephalosporin allergy; children who underwent a direct drug challenge test with a history of nonimmediate reaction. We also considered the following ad hoc subgroups for the prevalence of positive results in the second challenge after a first negative result: children who were evaluated with a second drug challenge test or with questionnaires.

Results

Literature search results

Figure 1 displays a PRISMA 2020 flow chart detailing the process of study selection, including exclusion criteria. We screened 4112 records, after the removal of duplicates, and assessed 347 full texts. Eighty-four studies reporting a prevalence estimate were synthesized in the meta-analysis. In the clinical trial registry system, 49 records were screened, and 3 records were included. We found one unpublished study.

Fig. 1: Preferred Reporting Items for Systematic Reviews and Meta-analyses (PRISMA) diagram for literature search and study selection.
figure 1

A list of the excluded studies with reasons is shown in Supplemental Table S4.

Study characteristics

Table 1 and Supplemental Table S2 show the characteristics of the 85 studies. Of them, 55 studies were published in or after 2015.

Table 1 Study characteristics.

Results of synthesis

Prevalence of positive results in the first challenge

A total of 12,846 participants were included from 78 studies. The methodological quality of the 78 studies was moderate (Supplemental Table S3A). The reason for the moderate quality was inappropriate sampling processes. The prevalence from the included studies was 0.052 (0.028–0.077). The pooled prevalence was 0.049 (95% CI, 0.041–0.057; I2 = 71%; Fig. 2). Visual inspection of funnel plot asymmetry did not indicate the presence of publication bias (Supplemental Fig. S1A), and no significant publication bias was found by the Egger test (p = 0.85). The results were consistent across all the pre-defined subgroups (Supplemental Figs. S2–S4). The pooled prevalence among children with drug challenge test with or without preliminary test was 0.054 (95% CI, 0.040–0.070; I2 = 77%) from 28 studies and 0.043 (95% CI, 0.032–0.054; I2 = 70%) from 44 studies, respectively (Supplemental Fig. S2). The pooled prevalence among children with a history of immediate or nonimmediate reactions was 0.043 (95% CI, 0.014–0.081; I2 = 51%) from 12 studies and 0.062 (95% CI, 0.048–0.079; I2 = 73%) from 23 studies, respectively (Supplemental Fig. S3). Of the 12 studies on children with a history of immediate reactions, 8 had a preliminary test, 2 did not, and 2 were unclear. In contrast, on the 23 studies of children with a history of nonimmediate reactions, 10 had preliminary testing, 11 did not, and 2 were unclear. Nine studies assessed children with a history of nonimmediate reactions using a direct drug challenge. Of these, eight studies had been published in or after 2015. The results were consistent, with a pooled prevalence of 0.073 (95% CI, 0.047–0.104; I2 = 83%; Fig. 3).

Fig. 2: Pooled prevalence of positive results for the first challenge.
figure 2

An inverse-variance-weighted random-effect meta-analysis was used. The size of the point estimate (square data markers) is proportional to the weight of the study. Horizontal lines indicate 95% confidence intervals (CIs) of the estimate. The diamond data marker represents the overall estimator based on included studies.

Fig. 3: Pooled prevalence of positive results for the first challenge (direct challenge for children with a history of nonimmediate reaction).
figure 3

An inverse-variance-weighted random-effect meta-analysis was used. The size of the point estimate (square data markers) is proportional to the weight of the study. Horizontal lines indicate 95% confidence intervals (CIs) of the estimate. The diamond data marker represents the overall estimator based on included studies.

Prevalence of serious adverse events

A total of 11,083 participants were included from 62 studies. The methodological quality of the 62 studies was moderate (Supplemental Table S3B). The reason for the moderate quality was inappropriate sampling processes. Of the 11,083 participants, only 13 experienced serious adverse events. All cases developed anaphylaxis and/or required adrenaline administration. The prevalence from the included studies was 0.000 (0.000–0.000). The pooled prevalence was 0.00 (95% CI, 0.00–0.00; I2 = 0.0%; Fig. 4). Of the nine studies that assessed children with a history of nonimmediate reactions using a direct drug challenge, seven reported prior serious adverse reactions. However, of these, including 2598 participants, none had serious adverse reactions to the drug challenge. Visual inspection of the funnel plot asymmetry indicated the presence of publication bias (Supplemental Fig. S1B), and this was confirmed by significant results on the Egger test (p < 0.001).

Fig. 4: Pooled prevalence of serious adverse events.
figure 4

An inverse-variance-weighted random-effect meta-analysis was used. The size of the point estimate (square data markers) is proportional to the weight of the study. Horizontal lines indicate 95% confidence intervals (CIs) of the estimate. The diamond data marker represents the overall estimator based on included studies.

Prevalence of positive results in the second challenge after a negative result in the first challenge

A total of 1361 participants were included from 18 studies. The methodological quality of the 18 studies was low (Supplemental Table S3C). Three studies assessed the results of drug challenge tests, and the remaining 15 studies confirmed the outcomes of re-exposure to the suspected drug with questionnaires. The prevalence from the included studies was 0.033 (0.019–0.062). Overall, the pooled prevalence was 0.028 (95% CI, 0.016–0.043; I2 = 38%; Fig. 5). The pooled prevalence from the three studies with the drug challenge test was 0.017 (95% CI, 0.005–0.035; I2 = 0.0%). The pooled prevalence from the 15 studies that used questionnaires was 0.033 (95% CI, 0.018–0.052; I2 = 41%; Supplemental Fig. S5).

Fig. 5: Pooled prevalence of positive results for the second challenge after a first negative result.
figure 5

An inverse-variance-weighted random-effect meta-analysis was used. The size of the point estimate (square data markers) is proportional to the weight of the study. Horizontal lines indicate 95% confidence intervals (CIs) of the estimate. The diamond data marker represents the overall estimator based on included studies.

Discussion

In this systematic review and meta-analysis, approximately 95% of children with a suspected beta-lactam allergy were not truly allergic to the drug based on the drug challenge test result. Few of the children who underwent the drug challenge test had serious adverse events. The false-negative rate for the drug challenge test was also very low.

The prevalence of beta-lactam allergy among children appears to be overestimated. The tolerance rate found was comparable to that found by a recent meta-analysis of adult populations.26 Over-labeling of beta-lactam allergy can be attributed to the infection process caused by interaction between the virus and drug-induced immune activation in susceptible individuals.27 A study screening for viral infections by polymerase chain reaction suggested that the viral infection was suspected to be the trigger for the initial rash in most children with negative drug challenge results.7 Other possible causes include misdiagnosis of a side effect and the disappearance of a long-term IgE-mediated allergy caused by refraining from the culprit drug use.28 Therefore, if there are no contraindications, a drug challenge should be routinely performed for children with suspected beta-lactam allergies. This is important for future treatments because inpatients suspected with a beta-lactam allergy are likely to receive broad-spectrum antibiotics for the treatment of acute illness to avoid recommended beta-lactams.4

The prevalence of positive direct drug challenge results for children with a history of mild nonimmediate reactions was about 7%, which is roughly the same as the overall percentage. Additionally, none had serious adverse reactions. Because beta-lactam allergy in children is more often a mild nonimmediate reaction,29 establishing a testing protocol for re-diagnosing children with a history of mild nonimmediate reactions is an important challenge. Furthermore, skin tests are commonly conducted before a drug challenge.30 However, since 2015, there has been an increasing number of reports suggesting that direct drug challenge tests can be safely performed without preliminary skin tests in children with nonimmediate and/or mild symptoms of beta-lactam allergy.10,11,12,31 Additionally, there remain concerns regarding the low diagnostic accuracy of skin tests performed for children with suspected nonimmediate allergic reactions.32,33 Recently published clinical guidelines1,14 and position papers15,16 have recommended direct challenge testing for children with a history of mild nonimmediate reactions, and the results of the present study support this strategy.

In our subgroup analyses, caution should be exercised in comparing the results of studies on children with a history of immediate reactions with those on children with a history of nonimmediate reactions. In the present study, both had comparable percentages of positive results, but a higher percentage of studies for children with a history of immediate reactions was conducted for those with a negative preliminary test. There were two studies in which direct challenge tests were performed on children with a history of immediate reactions.8,34 In both studies, those with a history of severe immediate reactions were excluded, and the positive challenge test rates were reported to be 0% (0/38)8 and 7.2% (5/69),34 with no children receiving adrenaline or steroids. For children with a history of immediate reactions, a drug challenge test should be performed cautiously, with a thorough evaluation of the severity of past allergic reactions and additional preliminary testing if necessary.

The present study indicated that the drug challenge test seems to be a very safe procedure in children, with serious adverse events being rare. The incidence of anaphylaxis, fatal or otherwise, in patients receiving beta-lactams for the treatment of infections is also rare, with estimated rates of 0.015–0.004% and 0.0015–0.002%, respectively.1,35 Clinicians should not refrain from the drug challenge test for fear of serious adverse events. Of necessity, challenge testing should be avoided in children with a history of delayed reaction with organ dysfunction or delayed severe cutaneous reaction including skin desquamation, purpura, mucosal lesions, drug reactions with eosinophilia and systemic symptoms, Stevens–Johnson syndrome/toxic epidermal necrolysis, or acute generalized exanthematous pustulosis.14

Based on the low false-negative rate, repeat testing for confirmation may not be required following a negative drug challenge result.36 Because many of the original studies included in this meta-analysis used questionnaires to determine whether allergy-like symptoms occurred in children who were re-administered a culprit drug, the true false-negative rate may be even lower than the point estimate obtained in this meta-analysis. A previous study reported that many children with a negative result from a first drug challenge test who developed symptoms on re-administration tested negative on a second drug challenge.37 If the drug challenge test is negative, IgE-mediated allergy can be largely ruled out. However, the children will experience other benign skin rashes with the same incidence as that of the general population.38 Even if mild skin symptoms appear when re-administering a culprit drug after confirming a negative result in the drug challenge test, it does not necessarily denote an allergic reaction, and the drug may not need to be uniformly restricted.

To our knowledge, this is the most recent and comprehensive systematic review and meta-analysis to examine the true-positive rate in children with suspected beta-lactam allergy based on the results of drug challenge tests. We included 20 times more original studies than the previous meta-analysis.3 The large number of original studies allowed investigation of subgroup differences and reporting biases, increasing the robustness of our results.

There were some limitations to this study. First, the background of participants who underwent the drug challenge varied between studies. Although our subgroup analyses showed that true-positive rates for almost all subgroups were comparable, the certainty that children truly have the allergy may also vary. Second, the drug challenge testing protocols followed in studies differed in dose step, dose interval, and test length. The optimal drug challenge protocol for children with suspected beta-lactam allergy is still controversial.39,40 Third, the research settings differed between studies: they included an emergency department,41 outpatient pediatric allergy clinics,42,43 and a drug allergy center.44 These differences in challenge testing protocols and research settings might have affected the results. Fourth, there were no studies examining false-positives resulting from the drug challenge test. However, if there is a history of suspected beta-lactam allergy and the drug challenge test is positive, false-positives are less likely to occur and will have less impact on the results.

In conclusion, the prevalence of children with suspected beta-lactam allergy who are truly positive on the drug challenge test was very low. Serious adverse events in response to the drug challenge were rare. The false-negative rate for drug challenge tests was also very low. Physicians should consider performing a drug challenge test after or without skin tests according to the severity of past allergic reactions, if not contraindicated, such as a suspected history of severe allergic reactions, to prevent the misdiagnosis of beta-lactam allergy.