Introduction

Kawasaki disease (KD) is an acute systemic vasculitis that primarily affects small and medium arteries of children under the age of 5 years. Children with KD who develop coronary artery aneurysms (CAA) are more likely to contract acquired heart disease. With the progress of diagnosis and treatment of KD, the risk of coronary artery damage has been reduced, nonetheless, about 8% of patients still have cardiovascular complications.1

Coronary artery lesions can be divided into mild dilation and coronary aneurysms. According to the diameter of dilated coronary arteries, coronary aneurysms can be classified into small, medium, and large (or giant) aneurysms. Studies have shown that almost all small CAA (z scores of 2.5–5) can regress in the acute phase, while most medium and giant CAA without prompt intervention may persist and lead to myocardial infarction, blood vessel rupture, and even sudden death.2,3,4 Among them, giant CAA had the worst prognosis without significant change in maximum diameter after 1 year.5 Due to the unknown etiology, CAA in KD is hard to predict. Therefore, to decrease morbidity and mortality in children with KD, it is necessary to identify the risk factors that contribute to the susceptibility to mild to giant-size CAA development. There are a few studies identifying risk factors for the development of medium and giant coronary artery CAA but without risk prediction models. As a result, the present study analyzed the clinical data of children with KD to identify the risk factors for medium and giant CAA and to develop valuable risk prediction models.

Methods

Study design and population

A retrospective study was conducted on patients diagnosed with KD between January 2015 and December 2020 at Shenzhen Children’s Hospital. These data (development dataset) were used to develop a predictive model (n = 1331, medium or giant CAA = 63, giant CAA = 27). From January 2021 to December 2021, an additional 193 patients were treated and studied prospectively to test the accuracy of prediction (n = 193, medium or giant CAA = 17, giant CAA = 7). These data represented the validation dataset.

KD can be diagnosed as complete or incomplete based on the KD Research Committee’s 6th revised edition in Japan in 2020.6 Patients were excluded if they had received intravenous immunoglobulin (IVIG) or glucocorticoid treatment before admission, or had infectious diseases were excluded.

All patients were initially treated with IVIG 2 g/kg as a single infusion, given over 10–12 h, following American Heart Association Guidelines. Patients with persistent or recrudescent fever (≥38.0 °C) for at least 36 h but no longer than 7 days after receiving the first IVIG infusion were considered IVIG resistance.7 There were 1524 children enrolled in this study, including 956 boys and 568 girls, ranging in age from 2 to 116 months. All patients with KD who did not develop medium or giant CAA were compared with those who did. Shenzhen Children Hospital’s Ethics Committee approved this study.

Data collection

The medical records were reviewed for demographics, clinical, laboratory, and treatment data. Demographics data includes sex, age (in months), clinical data includes diagnosis of incomplete KD, duration of fever before IVIG, total duration of fever, treatment data including whether receiving steroid therapy in initial therapy, and laboratory data before IVIG treatment include high-sensitivity C-reactive protein (CRP), neutrophils ratio, platelet count, white blood cell, hemoglobin, procalcitonin, erythrocyte sedimentation rate, alanine transaminase, aspartate aminotransferase, total bilirubin, albumin, serum sodium, and ferroprotein.

By echocardiography during hospitalization, we measured the maximum coronary arterial internal diameter of the right coronary artery, left main coronary artery, left anterior descending artery, and left circumflex coronary artery. With the help of a model based on data from a large cohort of Chinese healthy children, the maximum internal diameter of each artery in the coronary arteries was converted to a z-score corrected by the body surface area.8 In addition, we followed up the regression rate of medium and giant CAA 1 year after diagnosis. Medium CAA was defined as a z-score of ≥5–<10, or an absolute dimension of ≥4–<8 mm, and giant CAA as a z-score of ≥10, or an absolute dimension of ≥8 mm.1 CAA was considered to have regressed when the enlarged coronary arteries had normal z score or dilation only (z-score of <2.5) as visualized by echocardiogram.

Statistics analysis

Statistical analyses were conducted using Stata MP 17.0. Categorical variables were described by frequency and constituent ratio, and chi-square tests were used to compare differences between groups. A median with a quartile range is reported for continuous variables, and the difference between the two groups is compared using an independent sample T-test. To identify independent risk factors and develop risk prediction models, logistic regression analysis was used with a forward stepwise method.

The regression model was tested for fitness with the Hosmer–Lemeshow test. To evaluate the model’s capacity, the area under the receiver operating characteristic (ROC) curve (AUC) was calculated, and the cutoff values were determined by the Youden index derived from the calculated sensitivity and specificity. Ten folds cross-validation test was adopted in this study to validate the model’s prediction accuracy. The data from January 2021 to December 2021 were used for external verification. Additionally, we compared AUCs between logistic regression and scoring models. P < 0.05 was considered significantly different.

We assigned points to each risk predictor to develop an easy-to-use scoring model. Scores for each predictor in the scoring system were calculated based on the corresponding model coefficients and were re-scaled between 0 to 10 points using the following steps. First, the highest original score for each predictor was determined. A score for each predictor was calculated by multiplying its coefficients with its maximum value. We assigned a maximum score of 10 points to the prediction with the highest possible score, and we rescaled all other predictions using this same scaling factor. Based on the highest and lowest re-scaled scores for each predictor, the increase in the re-scaled score per one unit of the predictor value was calculated to obtain the equations for calculating an individual’s total score. To define the threshold for the scoring system, the model thresholds were transformed into scores. The starting value for each model was then calculated based on the minimum value of each predictor (or the maximum value of predictors with negative coefficients) and the model intercept. The original score at the threshold for the medium and giant CAA model was then calculated and distinguished between the high and low risk of CAA. Supplementary Appendixes A and B contain information about these transformations as well as a worked-out example.

Results

Baseline characteristics

This study included 1331 children (826 males and 505 females) who were hospitalized for KD. The median age was 22 months (ranging from 2 months to 9.7 years). 141 cases (10.6%) were diagnosed with incomplete KD. The median days of fever before IVIG was 6 days (ranged from 3 days to 24 days) and 258 patients (19.4%) received initial treatment after seven days of illness. 92 patients (6.9%) who met the indication of the initial steroid therapy received IVIG combined with systemic glucocorticoids in the first treatment after being diagnosed with Kawasaki disease. The total duration of fever ranged from 3 days to 26 days. Among them, 108 (8.1%) were defined as IVIG non-responders, and 63 (4.7%) were complicated by medium or giant coronary aneurysms (36 medium coronary aneurysms and 27 giant coronary aneurysms). Of the 63 patients with medium or giant coronary aneurysms, 27(42.9%) were irresponsive to the first dose of IVIG, and 5 cases (7.9%) were co-administered with IVIG and systemic glucocorticoids. In comparison, 13(48.1%) out of 27 patients with giant coronary aneurysms were irresponsive to the first dose of IVIG, and 4 cases (14.8%) were co-administered with IVIG and systemic glucocorticoids. The regression rates 1 year after diagnosis were only 9.7% for medium CAA (5 patients were lost to follow-up) and none of giant CAA (1 patient was lost to follow-up) regressed to normal z score or dilation only after 1 year.

Comparison of clinical characteristics between the medium-giant CAA and non-medium-giant CAA groups

Males made up 50 (79.4%) of the group with medium or giant CAA and 776 (61.2%) of the group without it (P < 0.05). In the medium or giant CAA group, the age ranged from 2 to 116 months, with a median age of 27 months, whereas in the non-medium or giant CAA group, the age ranged from 2 to 107 months, with a median age of 22 months (P > 0.05). Duration of fever before IVIG ranged from 5 to 13 days, with a median duration of 7 days, in the medium or giant CAA group, whereas in the non-medium or giant CAA group, the duration ranged from 5 to 7 days, with a median duration of 6 days. The prevalence of incomplete KD and IVIG resistance, the days of fever before IVIG and the total duration of fever differed substantially (P < 0.05) between the two groups (Table 1). In addition, the initial therapy with IVIG and systemic glucocorticoids in the medium or giant CAA group and non-medium or giant CAA group were respectively in 6.9% and 7.9% of patients, which showed no statistical significance (p > 0.05).

Table 1 Comparison of clinical characteristics between the medium or giant CAA and non-medium or giant CAA groups.

In comparison to the non-medium and giant CAA group for laboratory parameters, platelet count and erythrocyte sedimentation rate were significantly higher (P < 0.05) in the medium or giant CAA group. The levels of hemoglobin, albumin and total bilirubin levels in the medium or giant CAA group were significantly lower than in the non-medium or giant CAA group (P < 0.05) (Table 1).

Comparison of clinical characteristics between the giant CAA and non-giant CAA groups

There were respectively 805 (61.7%) males and 21 (77.8%) males in the non-giant CAA group and giant CAA group. In the giant CAA group, the age varied from 3 to 116 months, with a median age of 30 months, whereas in the non-giant CAA group, the age ranged from 2 to 107 months, with a median age of 22 months. Duration of fever before IVIG ranged from 5 to 14 days, with a median duration of 8 days, in the giant CAA group, whereas in the non-giant CAA group, the duration ranged from 5 to 7 days, with a median duration of 6 days. Significant differences (P < 0.05) were demonstrated in the frequency of IVIG resistance, the days of fever before IVIG, and the total duration of fever between the two groups (Table 2). Moreover, 14.8% of patients received IVIG combined with systemic glucocorticoids in their initial therapy in the giant CAA group, while 6.7% were in the non-giant CAA group, which were no statistical significance.

Table 2 Comparison of clinical characteristics between the giant CAA and non-giant CAA groups.

As for laboratory parameters, the levels of CRP, platelet count, erythrocyte sedimentation rate, and in the giant CAA group were significantly higher than those in the non-giant CAA group (P < 0.05). The levels of hemoglobin and albumin in the giant CAA group were significantly lower than in the non-giant CAA group (P < 0.05) (Table 2).

Risk prediction model for medium or giant coronary aneurysms

Twenty clinical and laboratory factors (male, age, incomplete KD, duration of fever before IVIG, total duration of fever, IVIG resistance, initial therapy combined with glucocorticoids, CRP, neutrophils ratio, platelet count, white blood cell, hemoglobin, procalcitonin, erythrocyte sedimentation rate, alanine transaminase, aspartate aminotransferase, total bilirubin, albumin, serum sodium, ferroprotein) were analyzed using logistic regression. Male, duration of fever, IVIG resistance, platelet count, and albumin were independent predictors of medium or giant CAA (Table 3). The Hosmer–Lemeshow value was 0.706, which showed that the observed event rate and the model did not deviate significantly. Cut-off values of 0.0398 in the logistic regression model resulted in a sensitivity of 86.89% and a specificity of 81.74%, respectively. This regression model’s area under the ROC curve was 0.913 (95% confidence interval (CI) 0.877, 0.950) (Fig. 1a).

Table 3 Independent factors identified by multiple logistic regression analysis for predicting medium or giant CAA.
Fig. 1: Receiver operating characteristic (ROC) curve of the logistic regression model and the scoring system for medium or giant CAA.
figure 1

In the logistic regression model and the scoring system, cut-off values of −0.0398 and 8.92 points yielded sensitivities of 86.89% and 86.89%, and specificities of 81.74% and 81.65%, respectively. The area under the curve (AUC) was 0.91 both in the logistic regression model and the scoring system. The comparison between the model’s and score’s AUCs showed no statistical difference (P = 0.43) (a); ROC curve of the logistic regression model and the scoring system for giant CAA. In the logistic regression model and the scoring system, cut-off values of −0.0289 and 15.34 points both yielded sensitivities of 92.59% and specificities of 87.93%, respectively. The AUC was 0.95 both in the logistic regression model and the scoring system. The comparison between the model’s and score’s AUCs showed no statistical difference (P = 0.82) (b).

The point for each risk predictor was assigned and a scoring model was obtained by the following calculation formula:

Medium or giant CAA score = Sex * 1.3182 + (Age − 2) * 0.0317 + (Total duration of fever − 3) * 0.2609 + IVIG resistance * 2.7315 + (PLT − 45) * 0.0088 + (ALB − 45.60) * (−0.1596)

Note: Sex: male = 1, female = 0; Age, months; Total duration of fever, days; IVIG resistance: Yes = 1, No = 0; PLT: platelet count, 109/L; ALB: albumin, g/L.

Using a cut-off of 8.97 points with this risk prediction score, KD patients with medium and giant CAA were with 86.89% sensitivity and 81.57% specificity. The AUC of this scoring system was 0.914 (95% CI 0.877, 0.950) (Fig. 1a). AUCs generated from the model and the scoring system for determining the risk of medium or giant CAA showed no significant difference (p > 0.05).

The 10-fold internal cross-validation was used to assess model performance, and discrimination was good (cross-validation (CV) AUC 0.904, 95% CI 0.863, 0.944). With a CV calibration-in-the-large (CITL) of 0.001 and a CV slope of 0.93, this definition was accurately calibrated to the reference standard (Fig. 2a). To assess the model’s and the scoring system’s clinical effectiveness, decision curve analysis (DCA) was used (Fig. 3a).

Fig. 2: Calibration plot of the model predicting CAA.
figure 2

a Medium or giant CAA; b Giant CAA.

Fig. 3: Decision curve analysis on the model predicting CAA.
figure 3

a The model and the scoring system for assessing the risk of medium or giant CAA; b The model and the scoring system for assessing the risk of giant CAA.

Risk prediction model for giant coronary aneurysm

Twenty variables in Table 2 were evaluated by multivariate logistic regression. The results showed that age, male, duration of fever, IVIG resistance, platelet count, hemoglobin, and erythrocyte sedimentation rate were independent risk predictors of the giant coronary aneurysm (Table 4). The Hosmer–Lemeshow statistic for this regression model was 0.945, and the AUC was 0.948 (95% CI 0.927, 0.969) (Fig. 1b). Cut-off values of 0.0289 in the logistic regression model resulted in a sensitivity of 92.59% and a specificity of 87.93%.

Table 4 Independent factors identified by multiple logistic regression analysis for predicting giant CAA.

A scoring model was obtained by the following calculation formula:

Giant CAA score = Sex * 2.0699 + (Age − 2) * 0.0553 + (Total duration of fever − 3) * 0.3110 + IVIG resistance * 4.2436 + (PLT − 45) * 0.0088 + (HGB − 146) * (−0.0838) + ESR * 0.0672

Note: Sex: male = 1, female = 0; age, months; Total duration of fever, days; IVIG resistance: Yes = 1, No = 0; PLT: platelet count, 109/L; HGB: hemoglobin, g/L; ESR: erythrocyte sedimentation rate, mm/h.

Using a cut-off of 18.74 points with this risk prediction score, KD patients with giant CAA were with 92.59% sensitivity and 87.93% specificity. The AUC of this scoring system was 0.948 (95% CI 0.927, 0.969) (Fig. 1b). The AUCs generated from the model and the scoring system for determining the risk of giant CAA were not significantly different (p > 0.05).

Discrimination was good (CVAUC 0.931, 95% CI 0.903, 0.959) by the 10-fold internal cross-validation. With a CV CITL of −0.03 and a CV slope of 0.84, this definition was accurately calibrated to the reference standard (Fig. 2b). To assess the model’s and the scoring system’s clinical effectiveness, decision curve analysis (DCA) was used (Fig. 3b).

Validation dataset

The validation dataset was utilized to evaluate how accurately the risk model identified medium and giant CAA. In the prospective data (from January 2021 to December 2021), there were medium or giant CAA accounted for 17 of 193 KD patients (8.8%) (Table 1). In the validation dataset, the AUC of the medium or giant CAA model was 0.906 (95% CI 0.813, 0.999). Sensitivity and specificity were 82.35% and 89.60%, respectively. In addition, giant CAA accounted for 7 of 193 KD patients in the validation dataset (3.6%) (Table 2). In the validation dataset, the AUC of the giant CAA model was 0.937 (95% CI: 0.852, 1.000). The sensitivity and specificity were, respectively, 83.33% and 91.06%.

Discussion

Several scoring models for predicting the development of CAA have been established previously. However, these models have significant variations in predictive efficacy among different populations. Recently, some Japanese risk scores were applied to predict CAA risks in non-Japanese populations, but they did not accurately predict poor outcomes in non-Japanese populations.9,10,11 Therefore, we attempted to develop a new risk models to predict CAA in the Chinese population.

In this training and validation study, two risk prediction models for medium or giant coronary aneurysm and giant coronary aneurysm were developed. In addition, this study showed that male, age, total duration of fever, IVIG resistance, platelet count, and albumin were independent predictors for medium or giant coronary aneurysms. A logistic regression model with six indicators was created to predict KD in children with medium or giant CAA, and the cut-off of −0.0398 produced sensitivity and specificity values of 86.89% and 81.74%, respectively. The cut-off score of 8.92 points was established for use in clinical practice, and it produced 86.89% sensitivity and 81.65% specificity.

Independent risk factors of giant CAA were similar to medium or giant CAA risk factors. Among these factors, male, age, total duration of fever, IVIG resistance, platelet count, hemoglobin, and erythrocyte sedimentation rate were independent predictors of giant CAA. The cut-off of −0.0398 yielded sensitivity and specificity values of 86.79% and 81.74%, respectively, in a logistic regression model with six factors to predict KD in children with medium or giant CAA. 86.89% sensitivity and 81.65% specificity were achieved by the cut-off score of 8.92 points, which was created for use in clinical practice.

CAA is a serious complication of KD. Compared with small CAA, most medium and giant CAA persist which is an important cause of mortality in children with KD. To the best of our knowledge, this is the first study to search for risk factors and establish a prediction model for the development of medium-giant CAA in the Chinese population using z-scores and absolute inner diameter values. Mary Beth et al.12 analyzed data from 903 KD patients and indicated that left anterior descending or right coronary artery Z score ≥2.0, age <6 months, Asian ethnicity, and CRP ≥ 13 mg/dL may be the potential candidates for CAA predictors and constructed a risk prediction model for CAA of which AUC was 0.82. Liu et al.10 selected 203 KD patients to develop a scoring system comprised of five different predictors: days of disease at initial therapy ≥7, redness and edema of extremities, hematocrit ≤33%, proportion of monocytes ≥8.89%, and procalcitonin ≥0.5 ng/mL. The newly constructed scoring system has an AUC of 0.685, with a sensitivity of 41.18% and a specificity of 84.41%. To date, a few grading methods have been developed to estimate CAA hazards. These scoring methods, however, were not based on large sample sizes and had lower sensitivity and specificity than the risk prediction model produced in our study.

The prevalence of giant CAA was 2.2% in our study, which is higher than 0.1% reported in studies conducted by the nationwide KD survey in Japan during 2017–2018.13 This difference may be because we have information based on both the absolute dimension of coronary and Z-score evaluation. The adoption of the Z-score criterion may result in more CAA detection.14,15 On the other hand, the high incidence of giant CAA may also be related to the delay in receiving initial IVIG in our study. Nearly 20% of patients in our study received initial treatment after 7 days of illness, while a recent survey showed that nearly 90% of patients in Japan received initial IVIG therapy within 7 days of illness.16 Some studies showed that compared with conventional therapy (within 7 days), late IVIG therapy may have a higher risk for coronary aneurysms.10,16

Previous research on long-term coronary outcomes in KD patients with CAA found that larger-sized coronary lesions were less likely to regress.3,5,17,18 Advani et al.3 found that CAA with a maximum diameter of ≥6.0 mm or Z ≥ 7.5 at 2 months after KD persisted at 15 years after KD. In our study, the regression rate of medium and giant CAA was only 5.3% at 1 year after the diagnosis of KD. Therefore, identifying potential risk factors for the development of larger-sized coronary lesions plays an important role in the treatment of KD.

In our study, 8.6% of patients received glucocorticoids in their initial therapy. In China, the use of steroids in the initial therapy of KD is suitable for Kawasaki disease shock syndrome (KDSS), KD with macrophage activation syndrome (MAS), and patients at high risk for nonresponse to IVIG defined by scoring systems including Kobayashi score or risk scores for predicting nonresponse to IVIG assigned by various hospitals. For patients who are IVIG resistance, it is recommended that receiving retreatment with high dose of IVIG, giving corticosteroids, or using infliximab (IFX). Chinese clinicians usually apply methylprednisolone 2 mg/ (kg·d) and decreased the dose gradually until CRP is normal, or intravenous high-dose methylprednisolone 10–30 mg/(kg·d) for 3–5 days, followed by oral prednisone 2 mg/ (kg·d), and then progressively discontinued. The total course of treatment for corticosteroids will be more than 2 weeks.19

Each of the variables in our model has also been shown to be a CAA risk factor.20,21,22,23 Male constituted a higher risk for medium to large CAA and other studies had also found that patients with large CAA were almost exclusively male, suggesting that sex may have a key influence in coronary artery disease in KD.20,22,24,25 Younger age may influence IVIG response. Despite prompt identification and treatment, age at illness onset of fewer than 6 months is associated with a negative result.21,26,27 As a result, one of the elements in the Egami and Kobayashi score is the age at diagnosis.28,29 Delayed diagnosis, along with incomplete clinical manifestation, has been indicated to be the major contributor to the development of CAA.22 In the current study, although incomplete KD and duration of fever before IVIG were not the independent risk factors for CAA, our data suggested that they were associated with medium and giant CAA.

A prolonged fever raises the risk of medium and giant coronary aneurysms. As a consequence, the objective of therapy during the acute phase of KD is to shorten fever duration, reduce systemic inflammation, and avoid vascular damage.30

In the present study, the incidence of IVIG-resistant KD was 8.1%, which is similar to previously reported data.31,32,33 Numerous studies have shown that timely IVIG therapy reduces the development and severity of coronary vasculitis. As with previous studies, patients who did not response to IVIG therapy in this study were about 10 times more likely to develop medium and giant CAA than those who were sensitive to IVIG.

Platelet count, hemoglobin, albumin, and erythrocyte sedimentation rate could be easily obtained from the blood test and are widely proven to be useful prognostic indicators of KD. Although the precise mechanism of thrombocytosis is uncertain, it has been hypothesized that an acute inflammatory response mediated by high thrombopoietin levels may cause thrombocytosis.34 Previous research found that anemia of inflammation induced by impaired erythropoiesis and reduced iron availability in critically sick individuals was an independent predictor of higher mortality.35,36 Lower hemoglobin levels may reflect that greater underlying inflammation with a more intense systemic vasculitis and more profound myocardial involvement. Thus, lower hemoglobin levels in the acute period of KD may be a significant risk factor for CAA. Hypoalbuminemia was often reported in individuals with KD, which may have occurred mostly from enhanced microvascular permeability during the acute phase.37 In the present study, Lower albumin levels have been associated with the development of medium or giant CAA formation, which may be indicative of more inflammation and vascular leakage in coronary aneurysms. An elevated erythrocyte sedimentation rate was found in almost all children with KD, which was an important variable to predict IVIG resistance in previous studies,38,39 and we found erythrocyte sedimentation rate was a useful predictor of giant CAA, highlighting its role in exuberant inflammation in KD.

The most important complication associated with KD is the formation of medium and giant coronary aneurysms, which is the most significant issue affecting the prognosis. As a result, the primary treatment objective for KD patients is to avoid the development of medium and giant CAA. This study constructed two risk prediction models to predict KD patients at high risk of complicating with medium and giant CAA, especially for patients who did not respond to IVIG or had a long duration of fever. We recommend that such patients use this predictive formula for prediction and they may require more rigorous treatments and more careful follow-up of echocardiography during acute KD. Further long-term studies to assess the risks of developing medium and giant CAA with KD are needed.

Limitations

There are some limitations to our study. First, this study was done at a single Chinese tertiary medical center, which might have resulted in selection bias. Second, this was a retrospective investigation, and thus, it is difficult to assess certain data except for the patient’s information in the medical record. Third, the evaluation of echocardiogram measurements was not centralized and standardized. Forth, coronary abnormalities on initial echocardiograms were not analyzed. Finally, other treatments like aspirin were not taken into consideration, which was potential confounders in the current study.

Conclusions

Two risk prediction models to predict the occurrence of medium or giant CAA and predict giant CAA were developed and validated in the present study. Moreover, we found that male, age, total duration of fever, IVIG resistance, platelet count, and albumin were independently associated with medium or giant CAA formation. The scoring approach employed in this study to distinguish between groups with a low and high risk of CAA may be a useful tool for guiding more efficient initial therapy and echocardiographic follow-up. Future larger studies with a more diverse population in China are needed to test this risk score and to assess accuracy and generalizability.