Explainable machine-learning predictions for complications after pediatric congenital heart surgery

Zeng, Xian; Hu, Yaoqin; Shu, Liqi; Li, Jianhua; Duan, Huilong; Shu, Qiang; Li, Haomin

doi:10.1038/s41598-021-96721-w

Download PDF

Article
Open access
Published: 26 August 2021

Explainable machine-learning predictions for complications after pediatric congenital heart surgery

Xian Zeng^1,2,
Yaoqin Hu¹,
Liqi Shu³,
Jianhua Li¹,
Huilong Duan²,
Qiang Shu¹ &
…
Haomin Li¹

Scientific Reports volume 11, Article number: 17244 (2021) Cite this article

2290 Accesses
19 Citations
1 Altmetric
Metrics details

Subjects

Abstract

The quality of treatment and prognosis after pediatric congenital heart surgery remains unsatisfactory. A reliable prediction model for postoperative complications of congenital heart surgery patients is essential to enable prompt initiation of therapy and improve the quality of prognosis. Here, we develop an interpretable machine-learning-based model that integrates patient demographics, surgery-specific features and intraoperative blood pressure data for accurately predicting complications after pediatric congenital heart surgery. We used blood pressure variability and the k-means algorithm combined with a smoothed formulation of dynamic time wrapping to extract features from time-series data. In addition, SHAP framework was used to provide explanations of the prediction. Our model achieved the best performance both in binary and multi-label classification compared with other consensus-based risk models. In addition, this explainable model explains why a prediction was made to help improve the clinical understanding of complication risk and generate actionable knowledge in practice. The combination of model performance and interpretability is easy for clinicians to trust and provide insight into how they should respond before the condition worsens after pediatric congenital heart surgery.

Dynamic predictions of postoperative complications from explainable, uncertainty-aware, and multi-task deep neural networks

Article Open access 21 January 2023

Features derived from blood pressure and intracranial pressure predict elevated intracranial pressure events in critically ill children

Article Open access 12 December 2022

A retrospective study of mortality for perioperative cardiac arrests toward a personalized treatment

Article Open access 12 August 2022

Introduction

Congenital heart disease is the most common form of major birth defect, affecting approximately 8 in 1000 live births worldwide¹. Extraordinary advances in cardiovascular diagnostics and cardiothoracic surgery have increased the survival of newborns with congenital heart disease². Nevertheless, the quality of treatment and prognosis after congenital heart surgery remains unsatisfactory, especially when complex surgery is performed^3,4. Postoperative complications in congenital heart surgery have been inconsistently reported but have important contributions to mortality, hospital stay, cost and quality of life^5,6,7. Heart centers with the best outcomes might not report fewer complications but rather have systems in place to recognize and correct complications before deleterious outcomes ensue⁶. In these cases, the early detection of deterioration after congenital heart surgery enables a prompt initiation of therapy, which may result in reduced impairment and earlier rehabilitation. Several scoring systems, such as the Risk Adjustment for Congenital Heart Surgery (RACHS-1) category⁸, the Aristotle Basic Complexity (ABC) score⁹, the European Association for Cardiothoracic Surgery and the Society of Thoracic Surgeons (STS-EACTS) mortality score¹⁰, and the STS-EACTS morbidity score¹¹, have been developed and used to adjust the risk of in-hospital morbidity and mortality in the community. However, all these consensus-based risk models only focus on the procedure themselves and cannot be adjusted for specific patient characteristics such as lower weight¹² and longer cardiopulmonary bypass (CPB)¹³, which were associated with worse outcomes after congenital heart surgery.

With the development of electronic health record (EHR) systems, abundant, complex, high-dimensional, and heterogeneous data are being captured during surgery and daily care. Researches using EHR data have shown that weight^12,14, perioperative blood transfusions¹⁵, CPB^13,16, and preoperative ejection fraction¹⁷ were associated with the risk of postoperative complications and mortality after congenital heart surgery. A machine learning-based predictive model¹⁸ has recently been used to identify independent risk factors and predict complications after congenital heart surgery. However, several gaps remain to be addressed. First, quantifying the effect of these risk factors in both a specific patient and a population in clinical practice is less explored. Second, highly intensive vital signs data during surgery were not fully utilized. Perioperative blood pressure control has been adopted as a significant clinical focus in congenital heart surgery. The previous model only used static features to predict postoperative complications and did not leverage intraoperative blood pressure data. Third, the previous model focused only on predicting whether patients had postoperative complications and did not address what kind of complications patients could experience. Clinicians might take different interventions for different complications, so it is of more clinical significance to predict the specific complications that patients will experience. Fourth, although machine learning model provides good prediction accuracy, its application in an actual clinical setting is limited because the prediction is difficult to interpret. Interpretable methods explain why a certain prediction was made for a patient, that is, a specific characteristic that led to the prediction.

We aimed to develop and internally validate a machine learning model to predict the risk of complications and what kind of complications patients could experience using patient demographics, surgery-specific features, and intraoperative blood pressure data, all of which are routinely collected as part of medical records. In addition, to gain insight into the specific factors that contribute most to the model predictions, we used the feature attribution framework of SHAP (SHapley Additive exPlanations). The schematic of data processing and workflow of our proposed model was shown in Fig. 1. We believe the combination of model performance and interpretability is an important step forwards that enables the prediction of postoperative complications prediction to be more widely used in practice.

Results

Population characteristics

A total of 1964 patients with a median age of 11 months (IQR 4–26) were included in the final analyses, of which 582 (34.4%) patients developed postoperative complications, 134 (6.8%) patients developed cardiac complications, 131 (6.7%) patients developed rhythm complications, 432 (22.0%) patients developed lung complications, 90 (4.6%) patients developed infectious complications, and 155 patients developed other complications. Patient characteristics used in the final prediction model were shown in Table 1. The univariate analysis revealed that patients with postoperative complications were more likely to be boys, had lighter weight, shorter height, and younger age. Lower blood oxygen saturation levels before and after surgery were also associated with postoperative complications. Moreover, a longer surgical time, CPB time, and aortic cross-clamping time were associated with complications. The trajectory of blood pressure change during surgery and blood pressure variability of different phases were also associated with postoperative complications.

Table 1 Characteristics of cardiac patients stratified by postoperative complications (PC).

Full size table

Data-driven clusters group blood pressure time-series data

When patients were ordered by their blood pressure cluster, the block-like structure of the similarity matrix becomes evident (Fig. 2a). Black lines along the diagonal marked blocks of patients grouped into the same cluster and similarities between patients in the marked blocks have some differences from those outside the mark blocks. We also compared the composition of each cluster in terms of whether the patients experienced postoperative complications, risk categories of operation, or primary diagnoses (Fig. 2b). Notably, there were significant differences in these components of clusters, and the subjects belonged to clusters with higher rates of complications harbored more complex surgery. Clusters with higher rates of complications were also composed primarily of patients with high levels of patent ductus arteriosus, coarctation of aorta, and type 1 total anomalous pulmonary venous connection, which have a higher risk than other diagnoses. In addition, we compared surgical time, CPB time, and aortic cross-clamping time between clusters and found statistically significant differences between clusters for these times (Fig. 2c). The mean and 95% confidence interval of the blood pressure readings during the surgery between different clusters were shown in Supplementary Fig. S3.

Performance of the complication prediction model

To test the potential of our model to aid postoperative complication prediction we evaluated the performance of our proposed model and four consensus-based risk models using receiver operating characteristic curves and other evaluation metrics (Fig. 3, Table 2, Supplementary Table S2). We found that for both the binary and multi-label classification tasks, the predictions made by our model are considerably more accurate than the predictions made by consensus-based risk models. The STS-EACTS morbidity score performed relatively well in both types of classification tasks compared with other risk models. Cardiac complication prediction has a higher AUC of 0.946, whereas lung complication prediction has a lower AUC of 0.785 (Fig. 3b–f, Supplementary Table S2).

Table 2 Experimental results of binary classification and multi-label classification on the test set.

Full size table

Inspection of model features

In Fig. 4a,b, we list the top 15 features by mean absolute SHAP value for both the binary and multi-label classification, and different colored circles in Fig. 4b represent the feature importance of each category in multi-label classification. The top 15 features importance of each complication category are respectively shown in Supplementary Fig. S4. The relationship between feature value and SHAP value in binary classification is illustrated in more detail for the features in Fig. 4c,d, with further examples in Supplementary Fig. S5. When removing the top 15 features in turn, the removal of CPB time noticeably decreased model performance in the binary classification, as is also observed in the analysis based on SHAP values. While these analyses show the overall effect of the features, SHAP values can also be inspected for individual predictions to identify the influential features (see Fig. 4e). All these effects explain why the model predicted a specific risk and thus allow appropriate interventions before deleterious outcomes ensue.

Discussion

We developed an efficient machine-learning-based model that comprehensively integrated patient- and surgery-specific static features and intraoperative time-series features to predict postoperative complications before they occur. Based on a comparison with existing consensus-based risk models, our model achieves superior performance both in binary and multi-label classification. Different from the traditional black-box model, we used the XGBoost model and SHAP framework which take advantage of artificial intelligence to process complex and high-dimensional features and identify the quantitative association between factors and prediction result to explain the prediction at different levels. In addition, we introduced blood pressure variability and k-means algorithm combined with soft-DTW to preprocessing intraoperative blood pressure. Such an approach can improve the interpretability of intraoperative time-series features, and we know which phase of fluctuation in surgery is more likely to lead to complications. This combination of model performance and interpretability allows physicians to receive the best predictions while also gaining insight into why those predictions were made.

The risk profiles learned by our model are clinically relevant. First, accumulating evidence has demonstrated the prolonged duration of CPB as a risk factor for neurologic, respiratory, infective, and renal complications. However, CPB time is frequently dichotomized at heterogeneous time points or the association between duration and risk of complication was not well characterized. Yamauchi and colleagues identified CPB times > 5 h as a risk factor of postoperative acute kidney injury¹⁹. Agarwal and colleagues reported that a longer CPB time was significantly associated with a great number of cardiac and extracardiac complications¹³. In our study, CPB time is also the most important variable as is observed in the analysis based on SHAP values and selection of variables guided by model performance. We provide a more useful perspective by considering the quantitative effect of continuous CPB time and its relationship with complications (Fig. 4c). The actionable knowledge such as control the CPB time under 80 min or 160 min will relatively control the risk of postoperative complications at different levels can be generated from these explainable plots. Second, clinical surgeons can now quantify risks of postoperative complications adjusted for other factors to the younger, those who are low weight and more susceptible to the environment. The exact relationships described in Fig. 4 and Supplementary Fig. S5 clearly show the patterns and threshold points for the risk. Third, different diagnoses are associated with different risk levels of postoperative complications. For example, patent ductus arteriosus or tetralogy of fallot patients may be more critically ill as they have more postoperative complications when compared with patients with other defects. When considering the standardization of care to reduce unwanted clinical deterioration, these data suggest that resources need to be differentially deployed to address differential rates of complications.

To the best of our knowledge, before, during, and after CPB are 3 distinctly different phases in cardiac surgery, and changes in blood pressure at these 3 phases may also have different effects on complications²⁰. Due to the lack of our data on patients’ start and end times of CPB, we attempted to explore the impact of changes in blood pressure at different phases of surgery distinguished by changes in temperature²¹. Naturally, changes in blood pressure before hypothermia had a minimal effect on the risk of complications when compared with intra- and post- hypothermia (Fig. 4, Supplementary Fig. S4). Interestingly, we found that the smaller average slope of systolic blood pressure was associated with an increased risk of postoperative complications in both the univariable analysis and prediction model (Table 1, Supplementary Fig. S5). This finding stands in contrast to the common belief that rapid fluctuations in intraoperative arterial blood pressure are deleterious and that clinicians should strive to maintain ‘railroad track’ hemodynamics²². One possible interpretation may be that patients with shorter surgical times quickly have steep changes in blood pressure readings because the trends in blood pressure readings are all from normal to lower and finally back to normal. For the analysis of postoperative outcomes, the complexity of cardiac surgery was considered an important risk factor in other studies¹³. The length of surgical time indirectly reflects the complexity of the surgery.

When using this model to generate early warnings before complications occur, it is important to understand the balance between recall (the sensitivity) and precision. Given that the ratio of positive to negative in our data is unbalanced, especially for specific complication types, we adjust one weight parameter in the model which can control the balance of positive and negative weights. However, in the multi-label classification, infectious and rhythm complications achieved the worst F1 score (Supplementary Table S2) when compared with other types of complications. One possible interpretation may be that the ratio of positive and negative for this type of complication is too unbalanced and adjusting the weight parameter to improve recall will result in a significant reduction in precision. In addition, the correlation between different types of complications and features is not the same, the current features maybe not the strongest predictor of infectious and rhythm complications (Supplementary Fig. S4).

The field of medicine is full of data science challenges that have the potential to fundamentally affect the way medicine is practiced. More and more data-driven predictions of patient prognosis are being proposed. However, black-box models which did not provide any explanations about why make this prediction, are difficult for physicians to trust. The ability to establish which features contributed to a prediction ensures that this technology remains interpretable to its clinical users. Using SHAP values, we see that the model provided quantitative insight into the exact changes in risk caused by changes in the features of certain patients. In addition, the interpretable prediction made by our model is easy for physicians to trust and provide insight into how they should respond before the condition worsens.

Even though our model gets a better performance when compared with other consensus-based risk models, it should still be considered as an initial attempt. In the multi-label predictions, considering the low number of some complication cases, we classified complications into five complication classes rather than predicting specific types of complications. To enhance the clinical availability, future attempts can focus on predicting specific types of complications and identifying features that led to this risk. Another future enhancement would be the integration of abundant preoperative data, such as detailed laboratory results of patients into the prediction model. More high-fidelity intraoperative data such as heart rate, End-tidal CO₂, and respiratory rate could include in the prediction model, thus potentially leading to more accurate predictions.

There are some possible limitations in this study. We only used relatively few data (n = 1964) from a single center to train and validate the model; thus, multicenter data will be used to train and validate the model. While we believe that a specific model trained on data from a single center will give a more specific prediction for a single center. This approach can be used to train specific models for data from multi-centers when considering hospitals and surgeons as features. In addition, we also performed a time-based split to divide the dataset into separate training and test sets (the experimental results of binary classification and multi-label classification on the time-based test set is listed in Supplementary Table S3). In shortly, the performance of proposed model is better than the risk adjustment models especially gave a much higher recall. Compared with randomly dividing the training sets and test sets, the time-based split prediction results have decreased slightly. One possible interpretation may be that the treatment of complex defects has greatly improved with the rapid development of surgical and interventional treatment, so there may be some differences in the characteristics of the dataset from year to year. Another limitation is the low frequency at which blood pressure was obtained, specifically one measurement every 5 min to 10 min during the surgery. A narrower time interval would have been desirable and possibly more illuminating.

Conclusions

In summary, with a novel interpretable machine learning algorithm, we can predict whether a patient has the complication after congenital heart surgery and what kind of complications will occur and explain the specific patient characteristics that led to this prediction. This prediction model achieved higher accuracy and sensitivity compared to risk adjustment models. We believe the combination of model performance and interpretability could provide useful information for physicians and can be used as part of clinical decision making.

Methods

Study design and population

A total of 2858 pediatric patients who underwent congenital heart surgery between December 2015 and December 2018 at the Children’s Hospital of Zhejiang University School of Medicine were enrolled in the present analysis. Exclusion criteria included patients who died during the surgery, patients who lacked intraoperative anesthesia records, or patients who underwent surgery without CPB, for which the selection process of eligible participants is shown in Supplementary Fig. S1. Thus, the dataset from the remaining 1964 patients were included in the present analyses. This retrospective study was performed according to relevant guidelines and approved by the institutional review board of the Children’s Hospital, Zhejiang University School of Medicine with a waiver of informed consent (2018_IRB_078).

Data collection and pre-processing

The following data elements were requested: gender, age, height, and weight of patients; diagnoses and types of procedures; surgical time, CPB time and aortic cross-clamping time; surgical access route; preoperative and postoperative oxygen saturation; intraoperative anesthetic record data; and postoperative complications.

The most challenging part of the data preprocessing is the time-series vital signs data during surgery with different lengths which cannot be directly used to construct the prediction model. The evidence-based literature supporting temperature management in cardiac surgery suggests that mild (32–35 °C), moderate (28–32 °C), or deep hypothermic (< 28 °C) is used to protect the brain and other vital organs during cardiopulmonary bypass²¹. Firstly, we divided surgery into three phases according to the changes in temperature, namely, the pre- (normal temperature—35 °C), intra- (< 35 °C), and post- (35 °C—normal temperature) hypothermic periods. Blood pressure variability including the coefficient of variation and slope was used to measure blood pressure fluctuations of different phases of surgery (Fig. 1). The coefficient of variation was defined as the standard deviation divided by the mean of each blood pressure sequence. In addition, the average changes (the slope) were also calculated as follows:

$${\text{Slope}} = \frac{1}{{N - 1}}\sum\limits_{{k = 1}}^{{N - 1}} {\frac{{|BP_{{k + 1}} - BP_{k} |}}{{t_{{k + 1}} - t_{k} }}}.$$

To further capture the dynamic temporal pattern of blood pressure during surgery, we used a k-means algorithm to cluster the pattern of blood pressure changes in distinct trajectories (Fig. 1). In time-series analyses, the smoothed formulation of dynamic time warping (soft-DTW) was used to measure the similarity between two temporal sequences, which may vary in length and speed²³. To perform clustering of the blood pressure, we constructed a matrix R whose elements R_i,j equal the blood pressure trajectory similarity calculated by soft-DTW between patient i and patient j. Next, we performed k-means clustering on the similarity matrix R and the number of clusters was determined by maximizing the average silhouette coefficient and minimizing the within clusters sum of squares (a more detailed description of determining the optimal number of clusters is illustrated in Supplementary Fig. S2). The application of k-means clustering was applied after data splitting when training prediction model. For each patient in the test set, we computed the similarities between data points and all centroids and assigned each data point to the closest cluster. Collectively, the extracted items including blood pressure variability and clustered trajectories combined with patient characteristics were summarized into 45 features (detail shown in Table 1), which were subsequently used to construct the machine learning prediction model.

The missing values were imputed using multivariate imputation via chained equations package in R²⁴. It is a practical approach to generating imputations based on a set of imputation models, one for each variable with missing values. We used the random forest to fit regression trees of the data and imputed each missing value as the prediction based on trees. Class imbalance is also a problem in this study since the number of patients with postoperative complications is relatively small in compassion with the number without complications in some scenarios. It is important to properly adjust your metrics and methods to adjust for your goals²⁵. In this study, we used the scale_pos_weight hyperparameter in XGBoost which is designed to tune the behavior of the algorithm for imbalanced classification problems. It has the effect of weighing the balance of positive examples, relative to negative examples when boosting decision trees.

Postoperative complication labels

The label of whether the patient had any complications after surgery and what kind of complications occurred was collected by clinicians based on the review of medical records. Based on more than 30 defined complications (detailed definition of the types of complications is listed in Supplementary Table S1), we classified complications into five complication classes: lung complication, cardiac complication, rhythm complication, infectious complication, and other complications²⁶. Cardiac complication indicates that a complication symptom appeared in the heart except for arrhythmia, such as cardiac dysfunction resulting in low cardiac output, pulmonary hypertension, and so on. Rhythm complication indicates that any cardiac rhythm other than normal sinus rhythm. Infectious complication is defined as the successful invasion and growth of organisms in the tissues of the host such as sepsis, urinary tract infection, and wound infection. Other complications indicate that the symptoms of complications in other organs apart from the lung and heart such as thrombosis, liver dysfunction, ascites, and so on. It is worth mentioning that a patient can experience multiple postoperative complications. In this study, we defined two tasks, binary classification and multi-label classification, to predict whether the corresponding patient has complications and what kind of complications.

Statistical analysis

The patients were categorized according to whether they had experienced postoperative complications. Categorical variables were presented as counts and percentages, and continuous variables as median with interquartile range (IQR) as 25th and 75th percentiles. The Chi-square test was used to compare categorical variables of patients with and without this outcome, and the continuous variables were compared using the Mann–Whitney U test. Bonferroni’s correction was used to control the family-wise error rate when multiple comparisons were performed. All tests were two-sided, and statistical significance was set at P-value < 0.05 for all analyses. Data analyses were performed using the published package in the Python (version 3.7) programming environments.

Model development and evaluation

As the collected features may have a variety of nonlinear interactions, we used XGBoost, a scalable tree boosting system, to link input features with postoperative complications. It implements machine learning algorithms under the gradient boosting framework and provides a parallel tree boosting that solves many data science problems in a fast and accurate way²⁷. To understand how single features relate to the model output we used SHAP (Shapley Additive exPlanations) values, which are suited for complex models such as neural networks and gradient-boosting machines²⁸. The impact of each feature on the model is represented using Shapley values, which are from the game theory and provide a theoretically justified method for allocation of a coalition’s output among the members of the coalition²⁸.

To ensure stability and extrapolation of machine learning model, we randomly divided the dataset into separate training (n = 1375) and test sets (n = 589) at a ratio of 7:3. We used fivefold cross-validation on the training set to tune hyperparameters for each classification and evaluated the final performance using the independent test set. The optimal model parameters were determined in a random search of 500 different combinations of hyperparameters of XGBoost. For the final binary classification model, we used learning rate as 0.01, gradient boosted trees as 292, maximum tree depth as 3, and minimum child weight of any branch in the trees as 5. For the final multi-label classification model, these parameters respectively were 0.02, 140, 5, and 4.

The accuracy, area under the receiver operating characteristic curve (AUC), recall, and F1 score were the metrics used to evaluate binary classification performance. The accuracy, micro-recall, micro-F1 score, and macro-AUC were the metrics used to evaluate multi-label classification performance. The F1 score is a measure of test data accuracy, which is a weighted average between precision and recall. The micro average calculates metrics globally by counting the total true positives, false negatives, and false positives; while the macro average calculates metrics for each label and finds their unweighted mean. We compared the performance of our prediction model with four risk adjustment models mentioned above in the binary and multi-label classification. For patients undergoing multiple procedures, the procedure with the highest level was scored. We assessed the RACHS-1 category, the ABC score, the STS-EACTS mortality and morbidity score as a predictor of postoperative complications by using the univariable logistic regression respectively.

Ethics declarations

This study was approved at 2018-09-19 by the Institutional Review Board/Ethics Committee of the Children’s Hospital, Zhejiang University School of Medicine (2018_IRB_078). Written informed consent was waived by the Institutional Review Board/Ethics Committee, as the utilization of anonymized retrospective data does not require patient consent under the local legislation.

Data availability

Data collected for this study are highly sensitive, and if reasonably requested, data supporting the findings of this study can be obtained from the corresponding author on reasonable request.

References

Bernier, P. L., Stefanescu, A., Samoukovic, G. & Tchervenkov, C. I. The challenge of congenital heart disease worldwide: Epidemiologic and demographic facts. Semin. Thorac. Cardiovasc. Surg. Pediatr. Cardiol. Surg. Annu. 13, 26–34 (2010).
Article Google Scholar
Van Der Linde, D. et al. Birth prevalence of congenital heart disease worldwide: A systematic review and meta-analysis. J. Am. Coll. Cardiol. 58, 2241–2247 (2011).
Article Google Scholar
Jacobs, J. P. et al. The society of thoracic surgeons congenital heart surgery database: 2016 update on outcomes and quality. Ann. Thorac. Surg. 101, 850–862 (2016).
Article Google Scholar
Triedman, J. K. & Newburger, J. W. Trends in congenital heart disease. Circulation 133, 2716–2733 (2016).
Article Google Scholar
Benavidez, O. J., Gauvreau, K., Nido, P. D., Bacha, E. & Jenkins, K. J. Complications and risk factors for mortality during congenital heart surgery admissions. Ann. Thorac. Surg. 84, 147–155 (2007).
Article Google Scholar
Pasquali, S. K. et al. Evaluation of failure to rescue as a quality metric in pediatric heart surgery: An analysis of the STS congenital heart surgery database. Ann. Thorac. Surg. 94, 573–580 (2012).
Article Google Scholar
Kansy, A., Tobota, Z., Maruszewski, P. & Maruszewski, B. Analysis of 14,843 neonatal congenital heart surgical procedures in the European Association for cardiothoracic surgery congenital database. Ann. Thorac. Surg. 89, 1255–1259 (2010).
Article Google Scholar
Jenkins, K. J. et al. Consensus-based method for risk adjustment for surgery for congenital heart disease. J. Thorac. Cardiovasc. Surg. 123, 110–118 (2002).
Article Google Scholar
Lacour-Gayet, F. et al. The Aristotle score: A complexity-adjusted method to evaluate surgical results. Eur. J. Cardio-thorac. Surg. 25, 911–924 (2004).
Article CAS Google Scholar
O’Brien, S. M. et al. An empirically based tool for analyzing mortality associated with congenital heart surgery. J. Thorac. Cardiovasc. Surg. 138, 1139–1153 (2009).
Article Google Scholar
Jacobs, M. L. et al. An empirically based tool for analyzing morbidity associated with operations for congenital heart disease. J. Thorac. Cardiovasc. Surg. 145, 1046–1057 (2013).
Article Google Scholar
Kalfa, D. et al. Outcomes of cardiac surgery in patients weighing < 2.5 kg: Affect of patient-dependent and -independent variables. J. Thorac. Cardiovasc. Surg. 148, 2499–2506 (2014).
Article Google Scholar
Agarwal, H. S., Wolfram, K. B., Saville, B. R., Donahue, B. S. & Bichell, D. P. Postoperative complications and association with outcomes in pediatric cardiac surgery. J. Thorac. Cardiovasc. Surg. 148, 609–616 (2014).
Article Google Scholar
Alsoufi, B. et al. Low-weight infants are at increased mortality risk after palliative or corrective cardiac surgery. J. Thorac. Cardiovasc. Surg. 148, 2508–2514 (2014).
Article Google Scholar
Iyengar, A. et al. Association of complications with blood transfusions in pediatric cardiac surgery patients. Ann. Thorac. Surg. 96, 910–916 (2013).
Article Google Scholar
Salis, S. et al. Cardiopulmonary bypass duration is an independent predictor of morbidity and mortality after cardiac surgery. J. Cardiothorac. Vasc. Anesth. 22, 814–822 (2008).
Article Google Scholar
Pieri, M. et al. Outcome of cardiac surgery in patients with low preoperative ejection fraction. BMC Anesthesiol. 16, 1–10 (2016).
Article Google Scholar
Zeng, X. et al. Prediction of complications after paediatric cardiac surgery. Eur. J. Cardio-thorac. Surg. 57, 350–358 (2020).
Google Scholar
Yamauchi, T. et al. Risk index for postoperative acute kidney injury after valvular surgery using. Ann. Thorac. Surg. 104, 868–875 (2017).
Article Google Scholar
Jinadasa, S. P. et al. Blood pressure coefficient of variation and its association with cardiac surgical outcomes. Anesth. Analg. 127, 832–839 (2018).
Article Google Scholar
Engelman, R. et al. The Society of Thoracic Surgeons, The Society of Cardiovascular Anesthesiologists, and The American Society of ExtraCorporeal Technology: Clinical practice guidelines for cardiopulmonary bypass—Temperature management during cardiopulmonary bypass. Ann. Thorac. Surg. 100, 748–757 (2015).
Article Google Scholar
Levin, M. A. et al. Intraoperative arterial blood pressure lability is associated with improved 30 day survival. Br. J. Anaesth. 115, 716–726 (2015).
Article CAS Google Scholar
Cuturi, M. & Blondel, M. Soft-DTW: A differentiable loss function for time-series. ICML 2017, 1483–1505 (2017).
Google Scholar
van Buuren, S. & Groothuis-Oudshoorn, K. Mice: Multivariate imputation by chained equations in R. J. Stat. Softw. 45, 1–67 (2011).
Article Google Scholar
Vluymans, S. Learning from imbalanced data. Stud. Comput. Intell. 807, 81–110 (2019).
Article Google Scholar
No authors listed. Part IV—The dictionary of definitions of complications associated with the treatment of patients with congenital cardiac disease. Cardiol. Young 18, 282–530 (2008).
Chen, T. & Guestrin, C. XGBoost: A scalable tree boosting system. KDD 2016, 785–794 (2016).
Google Scholar
Lundberg, S. M. & Lee, S.-I. A unified approach to interpreting model predictions. Nips 2017, 4765–4774 (2017).
Google Scholar

Download references

Acknowledgements

This work was supported by the National Natural Science Foundation of China (81871456) and the Chinese State Key Project of Research and Development Plan (2016YFC0901905). We also acknowledge the support of the Children’s Hospital of Zhejiang University School of Medicine (Zhejiang, China) for supplying the anonymized clinical data.

Author information

Authors and Affiliations

The Children’s Hospital of Zhejiang University School of Medicine and National Clinical Research Center for Child Health, Hangzhou, China
Xian Zeng, Yaoqin Hu, Jianhua Li, Qiang Shu & Haomin Li
The College of Biomedical Engineering and Instrument Science, Zhejiang University, Hangzhou, China
Xian Zeng & Huilong Duan
Department of Neurology, Rhode Island Hospital, Brown University, Providence, USA
Liqi Shu

Authors

Xian Zeng
View author publications
You can also search for this author in PubMed Google Scholar
Yaoqin Hu
View author publications
You can also search for this author in PubMed Google Scholar
Liqi Shu
View author publications
You can also search for this author in PubMed Google Scholar
Jianhua Li
View author publications
You can also search for this author in PubMed Google Scholar
Huilong Duan
View author publications
You can also search for this author in PubMed Google Scholar
Qiang Shu
View author publications
You can also search for this author in PubMed Google Scholar
Haomin Li
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

X.Z., S.L. and H.L. wrote the manuscript. H.L., Q.S. and X.Z. designed the research. H.Y. and J.L. collected the original EMR data. X.Z., H.L. and Q.S. performed the research. X.Z., H.L. and H.D. analyzed the data.

Corresponding authors

Correspondence to Qiang Shu or Haomin Li.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Zeng, X., Hu, Y., Shu, L. et al. Explainable machine-learning predictions for complications after pediatric congenital heart surgery. Sci Rep 11, 17244 (2021). https://doi.org/10.1038/s41598-021-96721-w

Download citation

Received: 08 February 2021
Accepted: 12 August 2021
Published: 26 August 2021
DOI: https://doi.org/10.1038/s41598-021-96721-w

This article is cited by

Clinical assistant decision-making model of tuberculosis based on electronic health records
- Mengying Wang
- Cuixia Lee
- Cheng Yang
BioData Mining (2023)
Emerging infectious disease surveillance using a hierarchical diagnosis model and the Knox algorithm
- Mengying Wang
- Bingqing Yang
- Cheng Yang
Scientific Reports (2023)
Survey on Explainable AI: From Approaches, Limitations and Applications Aspects
- Wenli Yang
- Yuchen Wei
- Byeong Kang
Human-Centric Intelligent Systems (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Dynamic predictions of postoperative complications from explainable, uncertainty-aware, and multi-task deep neural networks

Features derived from blood pressure and intracranial pressure predict elevated intracranial pressure events in critically ill children

A retrospective study of mortality for perioperative cardiac arrests toward a personalized treatment

Introduction

Results

Population characteristics

Data-driven clusters group blood pressure time-series data

Performance of the complication prediction model

Inspection of model features

Discussion

Conclusions

Methods

Study design and population

Data collection and pre-processing

Postoperative complication labels

Statistical analysis

Model development and evaluation

Ethics declarations

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information.

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Clinical assistant decision-making model of tuberculosis based on electronic health records

Emerging infectious disease surveillance using a hierarchical diagnosis model and the Knox algorithm

Survey on Explainable AI: From Approaches, Limitations and Applications Aspects

Comments

Search

Quick links