Introduction

Kawasaki disease (KD) is a systemic vasculitis of childhood. According to a recent epidemiological study in Shanghai, China, the average annual incidence rate of KD was 50.5 per 100,000 children.1 Coronary artery abnormalities (CAAs) secondary to KD represent the major contributors to morbidity and mortality in both the acute stage and long term. Although intravenous immunoglobulin (IVIG) plus aspirin is the mainstay of KD treatment, approximately 5% of patients still developed CAAs. In a study of long-term outcomes during an average follow-up period of 14.6 years, Holve et al.2 observed that, of the 546 KD patients, 12.8% had CAAs in the acute stage, and moreover 5% developed persistent CAAs at least 1 year after the KD illness.

In the past decade, several epidemiological surveys and meta-analyses have found some risk factors possibly related to CAAs. A recent retrospective study encompassing 1016 KD patients from China identified that male, albumin, erythrocyte sedimentation rate (ESR), Mycoplasma pneumoniae infection, IVIG started after the tenth day of illness, and IVIG non-responders were predictive of CAAs.3 Based on the data from 92 studies encompassing 8330 KD patients, a meta-analysis by Chen et al.4 indicated that platelet count (PLT), neutrophils count, platelet hematocrit, platelet distribution width, mean platelet volume, ESR, cardiac troponin I (cTnI), endothelin-1, albumin, and hemoglobin may be the potential candidates for CAA predictors. To date, about 10 scoring systems have been formulated to predict CAA risks. However, the criteria in these scoring systems are inconsistent with the findings of previous studies completely and have significant variations in predictive efficacy among different populations. Several risk factors, such as age, C-reactive protein (CRP), and fever duration are commonly included in multiple scoring systems, whereas their associations with CAAs are seldom confirmed by previous studies. Recently, Kobayashi score was applied to predict CAA risks in non-Japanese populations; the sensitivities were 50, 60, and 15% in Germany,5 British,6 and Polish7 respectively, far lower than that in Japanese (90.6%).8 On the other hand, Fabi et al.9 used Kobayashi, Egami, and Formosa score to evaluate CAA risks in Italian simultaneously and found that Formosa score had the highest predictive efficacy with a sensitivity of 72.0% and a specificity of 45.5%, followed by Egami and Kobayashi scores.

In China, CAAs tend to have an increased occurrence relative to the other Asian countries. Several previous epidemiological surveys covering 2 decades indicated that the CAA morbidities were 15.9% in Shanghai,1 20.6% in Beijing,10 and 24.1% in Hangzhou.11 Given this background, we hypothesize that Chinese KD patients may possess several unique risk factors of CAAs, and the predictive efficacy of multiple scoring systems in Chinese patients are still to be further studied.

Methods

Subjects

The present retrospective study included a total of 203 KD children between July 2015 and March 2019 in our center (Fig. 1). There was no fatal case during the observational period. According to the 2017 American Heart Association (AHA) guidelines,12 the diagnostic criteria for complete KD comprise the presence of ≥5 days of fever and ≥4 of the following 5 major features: (1) bilateral conjunctival injection without exudates; (2) changes in the oral mucosa, such as erythema and cracking lips, erythema of the pharynx, strawberry tongue; (3) changes in extremities, such as redness and swelling in the acute phase, periungual desquamation in subacute phase; (4) polymorphous exanthema; and (5) cervical lymphadenopathy (≥1.5 cm in diameter), usually unilateral. Patients with fever for ≥5 days and ≥2 of the major features with compatible laboratory or echocardiographic findings can be diagnosed as incomplete KD, if no other disease processes could explain the illness. The first day of the illness is defined as the first day of fever. All patients received the standard therapy for KD, including a single infusion of IVIG (2 g/kg) and aspirin (30–50 mg/kg/day). IVIG non-responsiveness is defined as persistent or recrudescent fever ≥36 h after completion of the initial IVIG infusion. In this circumstance, a second dose of IVIG would be given alone or combined with corticosteroids simultaneously. Delayed diagnosis of KD is defined as days of initial IVIG treatment >10 days. Echocardiography was performed in the acute episode of KD. Two echocardiographic evaluative criteria of CAAs are adopted in the present study. One proposed by the Japanese Ministry of Health (JMH) classifies coronary arteries as abnormality if the internal lumen diameter is >3.0 mm in children <5 years of age or >4.0 mm in children ≥5 years of age, if the internal diameter of a segment measures ≥1.5 times that of an adjacent segment, or if the coronary artery lumen is clearly irregular. The other defines CAAs on the basis of a coronary artery Z score (standard deviation units from the mean, normalized for body surface area) ≥2.5. In the present study, CAAs could be identified if any patient fulfilled either of the above two criteria.

Fig. 1: Patient selection.
figure 1

CAA coronary artery abnormality, KD Kawasaki disease.

Data collection

The investigation conforms to the principles outlined in the Declaration of Helsinki. Approval for this research was acquired from our medical ethic committee (No. LLSC20150009) and the informed consent was obtained from the parents of each patient. Demographic, clinical characteristics, electrocardiography, and echocardiography were collected. Laboratory testing within 24 h pre-IVIG therapy included white blood cell counts with differential, hemoglobin, hematocrit (Hct), PLT, alanine aminotransferase, aspartate aminotransferase, total bilirubin, direct bilirubin, indirect bilirubin, γ-glutamyltransferase, lactate dehydrogenase, cTnI, creatine kinase MB (CK-MB), albumin, sodium, CRP, ESR, procalcitonin (PCT), interleukin-6, and tumor necrosis factor-α.

Scoring systems for CAAs

Eight scoring systems were adopted in the present study. The Harada score (Japan),13 Egami score (Japan),14 Kobayashi score (Japan),8 Sato score (Japan),15 San Diego score (the United States),16 Formosa score (China),17 Tang score (China),18 and Hua score (China)11 are shown in Table 1. Besides predicting IVIG-nonresponsive KD, these scoring systems also identified CAA risks in their KD patients simultaneously. Their sensitivity and specificity for CAAs are also depicted in Table 1.

Table 1 Eight scoring systems to predict CAAs in the present study.

Statistical analysis

Continuous data were expressed as mean ± SD or median (interquartile range). Categorical variables were presented as count with percentage. Data were analyzed using two independent samples t test or Mann–Whitney U test for continuous variables and Chi-squared test or Fisher’s exact test for categorical variables. A p < 0.05 was considered significant. Risk factors for CAAs selected by univariate analysis were considered for multivariable analysis. Results are expressed as an odds ratio with a 95% confidence interval (CI). Goodness of fit of the regression model was tested with Hosmer–Lemeshow test, with p > 0.05 considered to indicate lack of deviation between the predictive model and the sample data. To facilitate clinical application, continuous variables were turned into dichotomous variables. Cut-off values were chosen based on the receiver operating characteristic (ROC) curve of each variable, which were dependent on the maximum value of Youden index according to the sensitivity and specificity and were justified medically. The score point of each predictor was assigned based on the value of logistic coefficient. ROC curve and the area under the curve (AUC) were used to assess the predictive efficacy, sensibility, and specificity of these scoring systems. Statistical analyses were performed using the statistical package for social studies SPSS ver 16.0.

Results

Patient characteristics

The demographic features of KD patients are shown in Table 2. In the present study, 203 patients with KD were included with a mean age of 35.15 ± 28.95 months and a range from 3 months to 13 years. Of the 203 KD patients, 167 (82.27%) were younger than 5 years, 62 (30.54%) were younger than 1 year, and 16 (7.88%) were younger than 6 months. Males outnumbered females by 1.28:1.

Table 2 Demographic, clinical and laboratory characteristics of KD patients with and without CAAs.

Clinical classification

All 5 major features for KD were met in 59 cases (29.06%), 4 features in 80 (39.40%), 3 features in 38 (18.72%), and 2 features in 20 (9.85%). Therefore, 139 patients (68.47%) had complete KD, including 83 males and 56 females with the mean age of 36.46 ± 29.25 months; 64 patients (31.53%) had incomplete KD, including 31 males and 33 females with the mean age of 32.31 ± 28.33 months. The mean age and male/female ratio were almost identical in the 2 groups (p > 0.05).

IVIG therapy

Among the 203 KD patients, 11 (7.75%) of them were identified as IVIG nonresponders and had persistent fever for about 71.00 ± 12.27 h after IVIG therapy, including 4 males and 7 females with the mean age of 33.73 ± 27.46 months. In contrast, 192 patients were identified as IVIG responders, including 110 males and 82 females with the mean age of 35.23 ± 29.10 months. The mean age and male/female ratio were no significant differences between IVIG responders and IVIG nonresponders (p > 0.05).

Cardiovascular findings

Based on the 2017 AHA guidelines, 17 (8.37%) KD patients were identified as CAAs (left coronary artery: 3.16 ± 0.57 mm; right coronary artery: 3.34 ± 1.35 mm), including 11 males and 6 females with the mean age of 38.47 ± 33.78 months. In contrast, 186 (91.63%) patients had normal coronary artery (left coronary artery: 2.07 ± 0.37 mm, right coronary artery: 1.99 ± 0.32 mm), including 103 males and 83 females with the mean age of 34.85 ± 28.56 months.

By echocardiography, the other abnormalities included tricuspid regurgitation (112 cases, 55.17%), mitral regurgitation (77 cases, 37.93%), aortic regurgitation (1 cases, 0.49%), and pericardial effusion (14 cases, 6.90%). By electrocardiography, abnormal changes were observed in 11 patients (5.41%) who dominantly suffered from ST-T changes (7 patients), prolonged PR (2 patients), ventricular premature beat (1 patient), and pre-excitation syndrome (1 patient). In addition, 33 patients had acute myocarditis, featured by elevated CK-MB (37.38 ± 13.03 μ/L from 32 cases) or cTnI (0.53 ng/mL from 1 case).

The establishment of logistic regression model for CAA predictors

The clinical and laboratory characteristics of KD patients are shown in Table 2. For clinical characteristics, significant differences were found in days of illness at initial treatment and redness and swelling of extremities between KD patients with and without CAAs; for laboratory data, percentage of monocytes (MO%), CRP, and PCT was significantly elevated, whereas Hct was lower between KD patients with and without CAAs (p < 0.05). Therefore, the above six variables were subsequently included in the multivariable logistic regression analysis. Days of illness at initial treatment, redness and swelling of extremities, Hct, MO%, and PCT were independent predictors of CAAs (p < 0.05). The Hosmer–Lemeshow statistic was not significant (p > 0.05), and the AUC value of this regression model was 0.770 (95% CI 0.590–0.950).

Assessment of the newly established scoring system for CAAs

Further analyses for the logistic regression model are listed in Table 3. The odds ratio (95% CI) values for days of illness at initial treatment ≥7, redness and swelling of extremities, Hct ≤33%, MO% ≥8.89%, and PCT ≥0.5 ng/mL were 1.588 (1.123–2.245), 1.733 (1.446–6.736), 1.165 (1.034–1.360), 1.347 (1.004–1.808) and 1.208 (1.056–1.367), respectively. Based on the logistic coefficient of significant predictors, two point was assigned for days of illness at initial treatment ≥7 and redness and swelling of extremities, and one point was assigned for Hct ≤33%, MO% ≥8.89%, and PCT ≥0.5 ng/mL. After identifying the occurrence of CAAs and overall numbers of patients for individual scores, a scoring system was established, dependent on the sum of points in each patient. For simplicity, 2 risk strata were identified: low risk, with scores of 0–4, encompassing 82.27% of patients; and high risk, with scores of ≥5, 17.73% of patients. The AUC value of the newly established scoring system was 0.685 (95% CI 0.526–0.844), with a sensitivity of 41.18% and a specificity of 84.41%.

Table 3 Multivariable predictors of CAAs using the newly established scoring system.

Comparisons of the newly established scoring system with the others

Comparisons of the newly established scoring system with the others are shown in Fig. 2. The newly established scoring system contained five independent predictors: days of illness at initial treatment, redness and swelling of extremities, Hct, MO%, and PCT. Among them, days of illness at initial treatment ≥7 and PCT were unique for the newly established scoring system; in addition, MO% was included in Hua score11; redness and swelling of extremities were included in Tang score18; and Hct was included in Harada score,13 simultaneously.

Fig. 2: Comparisons of the newly established scoring system with the others.
figure 2

Alb albumin, ALT alanine aminotransferase, AST aspartate aminotransferase, CAA coronary artery abnormality, CRP C-reactive protein, GGT γ-glutamyltransferase, Hct hematocrit, IL-6 interleukin-6, IVIG intravenous immunoglobulin, MO%, percentage of monocytes, N% percentage of neutrophils, PLT platelet count, WBC white blood cell count.

CAA risks of our patients were evaluated by the newly established scoring system and eight other scoring systems (Table 4 and Fig. 3). The newly established scoring system had a sensitivity of 41.18% and a specificity of 84.41%; similarly, based on our patients, the sensitivity of 8 published scoring systems ranged from 5.88 to 52.94% and the specificity ranged from 41.94 to 94.62% for predicting CAAs. The newly established scoring system had a positive predictive value of 19.44% and a negative predictive value of 94.01%, similar to 8 other scoring systems (positive predictive values: 4.76–18.75%, negative predictive values: 89.66–93.57%). The AUC value of newly established scoring system was 0.685, higher than Harada (0.452),13 Egami (0.479),14 Kobayashi (0.526),8 Sato (0.476),15 San Diego (0.522),16 Formosa (0.473),17 and Tang score (0.458),18 whereas only lower than Hua score (0.722).11

Table 4 Comparisons of the newly established scoring system with the others.
Fig. 3: ROC curves of multiple scoring systems for predicting CAAs.
figure 3

Thick solid line respresents ROC curve of the newly established scoring system. Thin solid lines respresent ROC curves of  other published scoring systems.

Discussion

Cardiovascular manifestations are prominent during the acute KD episode and a major contributor to the long-term prognosis. CAAs are the most important cardiovascular complications. In the past two decades, several previous epidemiological surveys from mainland China indicated that the overall trend in CAA morbidity appeared to be on the rise from 15.9 to 24.1%.1,10,11 Taking noncoronary cardiac abnormalities (NCAs) into account, data from multiple centers revealed that mitral regurgitation was the most common (27%), followed by pericardial effusion (15%), cardiac arrhythmia (11%), and myocarditis (9.9%).19,20 In the present study, 8.37% of KD patients developed CAAs and 65.02% had NCAs in the acute stage; more specifically, valvular regurgitation was the most prevalent type of NCAs, followed by myocarditis, pericardial effusion, and cardiac arrhythmia. It should be pointed out that despite higher incidence of NCAs and KD in children aged >5 years, the present study was totally irrelevant to multisystem inflammatory syndrome in children (MIS-C) triggered by coronavirus disease 2019 (COVID-19). First, the present study between July 2015 and March 2019 was before the date of the first official report of COVID-19 and none of patients were infected by COVID-19. Second, according to the latest American College of Rheumatology Clinical Guidance for MIS-C in children,21 the incidence of MIS-C is lower in patients of East Asian descent than of African, Afro-Caribbean, and Hispanic descent. Third, patients with MIS-C have more prominent gastrointestinal and neurologic symptoms and present more frequently in a state of shock. However, in our patients, gastrointestinal and neurologic system involvements were rare. Fourth, in the present study, we analyzed that the higher incidence of myocarditis may be attributed to larger variations in classification criteria, genetic background, and pediatricians’ awareness.22

In the present study, a new scoring system for CAAs was established based on the data of Chinese population; it contained 5 variables and cut-off values of each variable are as follows: days of illness at initial treatment ≥7, redness and swelling of extremities, Hct ≤33%, MO% ≥8.89%, and PCT ≥0.5 ng/mL. Compared with the other published scoring systems, days of illness at initial treatment ≥7 and PCT were unique predictors of CAAs. PCT is an acute-phase reactant and displays a quick response to bacterial infection. In a recent study from Chongqing, China, Si et al.23 noted that PCT underwent a 12-fold increase in KD patients compared with the healthy controls. More persuasively, a multivariable logistic regression model established by Yoshikawa et al.24 revealed that PCT had a sensitivity of 67% and a specificity of 55% at a cut-off value of >0.5 ng/mL to predict CAAs and possessed the highest odds ratio compared with the other classical inflammatory mediators. However, in a retrospective study, Dominguez et al.25 found that elevated PCT was not related to CAA development, providing controversial evidence for the role of PCT in predicting CAAs. Therefore, further studies of larger sample size and long-term follow-up will be necessary to consolidate our findings.

As we all know, days of illness at initial treatment ≤10 can dramatically decrease CAA risks in KD patients.12 In the present study, more rigorously, days of illness at initial treatment <7 were identified as an optimal cut-off value for IVIG administration. In a retrospective study from Wenzhou, China, Qiu et al.26 recruited 930 KD patients with a mean day at initial treatment of 6.99 and found that CAA risks increased 18% per 1 day delay of IVIG administration at 1 month after illness onset. In 2018, a national epidemiological survey from Japan also highly recommended initial IVIG administration within 7 days of illness for preventing CAAs.27 However, in Egami,14 Kobayashi,8 and San Diego scores,16 days of illness at initial treatment ≤4 were regarded as a strong predictor of CAAs. Because days of illness at initial treatment in all patients ranged from 5 to 11 days, it was unlikely to identify whether IVIG treatment before the fourth day of illness was an independent predictor in the present study. Therefore, based on the current evidence from our center and the others, we hypothesize that the time option of IVIG administration should be in parallel with peak time of systemic inflammation for the purpose of reducing CAA risks; in contrast, either earlier or later IVIG administration may increase CAA risks.

The sign of redness and swelling in extremities was the only major clinical feature included in our scoring system and also included in Tang score.18 According to the 2017 AHA guidelines,12 in the presence of ≥4 major clinical features, particularly when redness and swelling of the hands and feet are present, the diagnosis of KD may be made with only 4 days of fever. Both redness/swelling and periungual desquamation are the common extremity changes in KD; the former often occurs within 2 weeks of illness, while the latter usually begins thereafter. A multivariable analysis by Wang et al.28 indicated that KD patients without periungual desquamation were more likely to develop CAAs. On the other hand, Michie et al.29 followed up 259 KD patients for >4 years and found that recurrent desquamation was significantly less frequent in CAA patients. In spite of limited information, pediatricians should pay more attention to the potential contribution of extremity changes to CAA risks in KD patients.

Although the detailed pathogenesis of KD is still unclear, monocyte/macrophage activation, cytokine cascade, and vascular endothelial cell injury have been proven to be the three key steps of CAAs. Infiltration by monocytes/macrophages is ubiquitous in vascular pathology. Consistently, MO% was screened as a candidate predictor of CAAs in both the present and Hua score.11 Furukawa et al.30 analyzed the medical records of 27 KD patients and detected that MO% was subjected to a 0.81-fold increase in cases with CAAs compared with their counterparts. In 2018, a multivariable logistic model by Hua et al.11 showed that KD patients with MO% ≥5.9% were 1.36 times likely to develop CAAs as compared with their counterparts.

Anemia occurs in 87.28% of KD patients and resolves with resolution of systemic inflammation.31 Bone marrow infiltration by infective agents, alteration of iron metabolism, and reduction in erythropoiesis and erythrocyte survival are speculated to play a combined role in the pathogenic connection between KD and anemia. In the present study, a lower Hct was identified as a powerful predictor of CAAs and also included in Harada score.13 In an American retrospective study, Ghelani et al.32 reported that Hct significantly decreased in KD patients with CAAs (29.5 ± 4.7%) compared to those with normal coronary artery (32.0 ± 3.4%). In addition, the 15th Japanese epidemiological survey on KD indicated that patients with Hct ≤32.5% had a 0.45-fold increase in CAA risks as compared with their counterparts.33 As we all know, KD predominantly affects children <5 years of age. Although several studies have shown an underlying correlation between anemia and CAAs, interpreting hematology analyses in children is challenging and hematology reference intervals for separate age groups exhibits significant variations due to the extensive changes in hematopoiesis that accompany physiological development. A further study after adjustment for age is warranted to more accurately elucidate the predictive role of anemia in CAAs.

To date, at least three different scoring systems for predicting CAAs have been published in Chinese population.11,17,18 Apart from establishing a new scoring system, the present study undertook a systemic comparison of these Chinese scoring systems for the first time and revealed that the AUC value of newly established scoring system was 0.685, very similar to Hua score.11 Because of different identification criteria and analytical methodologies, there is still some heterogeneity among these Chinese scoring systems. First, the JMH criteria was used in Hua11 and Tang18 scores to identify CAAs, while Z score was used in Formosa score.17 Second, the occurrence of incomplete KD was 58.8% in the study of Hua et al.,11 significantly higher than the findings of Tang et al. (29.7%)18 and ours (31.5%). We speculate that the other febrile diseases known to mimic KD may not be excluded completely in the study of Hua et al.11 Third, variables with p < 0.1 in the univariate analysis were entered into the multivariable model in Formosa score,17 while variables with p < 0.05 were selected in the present study.

Conclusions

In the present study, 8.37% of KD patients were identified as having CAAs. Using a multivariable logistic regression analysis, a new scoring system for CAAs was established, including five independent predictors (days of illness at initial treatment, redness and swelling of extremities, Hct, MO%, and PCT). The AUC value of the newly established scoring system was 0.685, with a sensitivity of 41.18% and a specificity of 84.41%. Compared with 8 other published scoring systems, days of illness at initial treatment ≥7 and PCT were unique predictors of CAAs in our study. The AUC value of newly established scoring system was higher than Harada, Egami, Kobayashi, Sato, San Diego, Formosa, and Tang scores, whereas only lower than Hua score. Moreover, the present study also undertook a systemic comparison of three Chinese scoring systems for the first time and revealed that the AUC value of newly established scoring system was very similar to Hua score. Taking into account different identification criteria and analytical methodologies, there is still some heterogeneity among these Chinese scoring systems. Further multicenter collaborative studies should be advocated to consolidate our preliminary findings prior to widespread application in clinical practice.