Interpretable machine learning for early neurological deterioration prediction in atrial fibrillation-related stroke

Kim, Seong-Hwan; Jeon, Eun-Tae; Yu, Sungwook; Oh, Kyungmi; Kim, Chi Kyung; Song, Tae-Jin; Kim, Yong-Jae; Heo, Sung Hyuk; Park, Kwang-Yeol; Kim, Jeong-Min; Park, Jong-Ho; Choi, Jay Chol; Park, Man-Seok; Kim, Joon-Tae; Choi, Kang-Ho; Hwang, Yang Ha; Kim, Bum Joon; Chung, Jong-Won; Bang, Oh Young; Kim, Gyeongmoon; Seo, Woo-Keun; Jung, Jin-Man

doi:10.1038/s41598-021-99920-7

Download PDF

Article
Open access
Published: 18 October 2021

Interpretable machine learning for early neurological deterioration prediction in atrial fibrillation-related stroke

Seong-Hwan Kim¹^na1,
Eun-Tae Jeon¹^na1,
Sungwook Yu²,
Kyungmi Oh³,
Chi Kyung Kim³,
Tae-Jin Song⁴,
Yong-Jae Kim⁵,
Sung Hyuk Heo⁶,
Kwang-Yeol Park⁷,
Jeong-Min Kim⁷,
Jong-Ho Park⁸,
Jay Chol Choi⁹,
Man-Seok Park¹⁰,
Joon-Tae Kim¹⁰,
Kang-Ho Choi¹¹,
Yang Ha Hwang¹²,
Bum Joon Kim¹³,
Jong-Won Chung¹⁴,
Oh Young Bang¹⁴,
Gyeongmoon Kim¹⁴,
Woo-Keun Seo¹⁴ &
…
Jin-Man Jung^1,15

Scientific Reports volume 11, Article number: 20610 (2021) Cite this article

3966 Accesses
15 Citations
1 Altmetric
Metrics details

Subjects

Abstract

We aimed to develop a novel prediction model for early neurological deterioration (END) based on an interpretable machine learning (ML) algorithm for atrial fibrillation (AF)-related stroke and to evaluate the prediction accuracy and feature importance of ML models. Data from multicenter prospective stroke registries in South Korea were collected. After stepwise data preprocessing, we utilized logistic regression, support vector machine, extreme gradient boosting, light gradient boosting machine (LightGBM), and multilayer perceptron models. We used the Shapley additive explanation (SHAP) method to evaluate feature importance. Of the 3,213 stroke patients, the 2,363 who had arrived at the hospital within 24 h of symptom onset and had available information regarding END were included. Of these, 318 (13.5%) had END. The LightGBM model showed the highest area under the receiver operating characteristic curve (0.772; 95% confidence interval, 0.715–0.829). The feature importance analysis revealed that fasting glucose level and the National Institute of Health Stroke Scale score were the most influential factors. Among ML algorithms, the LightGBM model was particularly useful for predicting END, as it revealed new and diverse predictors. Additionally, the effects of the features on the predictive power of the model were individualized using the SHAP method.

A prehospital diagnostic algorithm for strokes using machine learning: a prospective observational study

Article Open access 15 October 2021

Multilayer perceptron-based prediction of stroke mimics in prehospital triage

Article Open access 26 October 2022

Predicting mortality in brain stroke patients using neural networks: outcomes analysis in a longitudinal study

Article Open access 28 October 2023

Introduction

Early neurological deterioration (END) is a sudden worsening of neurological symptoms during the acute period of stroke. END leads to devastating clinical outcomes despite marked advances in acute stroke management over the past several years. The incidence of END is considerably high, ranging from 5 to 40%, and is associated with a poor 3-month clinical prognosis and high mortality^1,2. The standard treatment strategy for END has not been established, and an accurate prediction of END is unavailable in clinical practice owing to its complexity and heterogeneity. In addition, there has been no consensus on the definition. Therefore, various inclusion criteria and study designs have been used, with some studies preferring to define END according to specific stroke subtypes (e.g., cardioembolism), making each predictor and recent nomograms difficult to use in real-world clinical practice^3,4,5,6. Those obstacles make it difficult to design a prospective early detection and early interventional study. Accurate prediction of END is of paramount importance not only for the prognostication but also to motivate prospective, early interventional studies to prevent or restore END in patients with stroke.

Of the etiologies attributed to cardioembolic stroke, atrial fibrillation (AF) is one of the predictors of END^7,8. Several markers, including clinical, radiological, and laboratory findings, have been associated with END in AF-related stroke^9,10,11. However, in those studies, using a single marker had limited predictive power, since the diverse biomarkers and imaging markers relevant to END in AF-related stroke were not considered at the same time.

Continuous advancements in machine learning (ML) algorithms have led to their wide application in the medical field, as numerous variables and massive data can be included and analyzed. In contrast to transitional statistical models, ML models are compatible with predicting complex clinical events that can be affected by diverse situations and conditions. Nevertheless, the clinical application of ML models has been limited owing to the ‘black box problem’ of interpretability and explanation¹². Therefore, it is essential that ML models be interpretable to the current medical fields¹³. The Shapley additive explanations (SHAP) method is a novel, cutting-edge method designed to aid in clinical interpretation and intuitive understanding of feature importance by providing visualizations of the relationship between each feature and the associated predictive power¹⁴. Therefore, the aim of our study was to develop an interpretable ML model that could predict END using the feature importance technique in AF-related stroke using a real-world multicenter cohort database.

Methods

Study design and participants

The dataset from this study can be provided by the corresponding author upon reasonable request.

This study was based on the Korean Atrial Fibrillation Evaluation Registry in Ischemic Stroke Patients (K-ATTENTION), a real-world cohort composed of prospective stroke registries from 11 tertiary centers in South Korea. K-ATTENTION focused on characteristics, oral anticoagulant use, and outcomes in AF-related stroke patients¹⁵. Between January 2013 and December 2015, patients who were admitted to one of the participating centers within 7 days of stroke onset were enrolled. Detailed information regarding management and follow-up of the included patients has been provided previously¹⁵. In our study, only those who arrived at the hospital within 24 h of symptom onset and had information regarding END were included. Using the internet-based clinical recording system, we acquired the following patient information from each center: demographic characteristics, vascular risk factors, brain imaging results, laboratory findings, pre-admission medication histories, stroke severity on admission (according to the National Institutes of Health Stroke Scale [NIHSS] score), and functional status (modified Rankin score [mRS]). Additional information on variable acquisition and evaluation is provided in Supplemental Table I and Supplemental Methods I. This study followed the Transparent Reporting of a multivariable prediction model for Individual Prognosis or Diagnosis (TRIPOD) reporting guidelines¹⁶. The institutional review boards of Korea University Ansan Hospital (2016AS0051), Korea university Anam hospital, Korea University Guro Hospital, Ewha University College of Medicine, Eunpyeong St. Mary’s Hospital, Kyung Hee University College of Medicine, Chung-Ang University Hospital, Hanyang University Myongji Hospital, Jeju National University, Chonnam National University Hospital, Chonnam National University Hwasun Hospital, Kyungpook National University Hospital, Asan Medical Center, and Samsung Medical Center approved the study. The need for informed consent was waived by the ethics committee of all participating centers due to the retrospective design of the study using anonymous and de-identified information.

Definition of END as the main outcome

END was defined as an increase of at least 2 points in the total NIHSS score and at least 1 point on the level of consciousness or motor item score within 72 h of arrival at the hospital¹⁷.

Data splitting and preprocessing

Binary variables with less than 80% missing values and multinomial and numeric variables with less than 60% missing values were included to generate the available dataset¹⁸. In the first step, 25% of the dataset was randomly separated according to END stratification and used only in the final evaluation of model performance as a test set. The remaining 75% of the dataset was used as a training set for hyperparameter determination and training processes using leave-one-out cross-validation. In this data splitting process, we used the stratified random sampling method, stratifying institution sites to reduce the multi-site correction problem. Isolation forest and multivariate imputation by chained equations were used for outlier detection and imputation. Details of the methods are provided in Supplemental Methods II.

Feature selection and feature importance analysis

Recursive feature elimination¹⁹ was used to select the top-k ranked features that contributed to the overall model performance of the area under the receiver operating characteristic curve (AUROC). In this feature selection analysis, we also included institution site variable to evaluate the multi-site correction issue. Since the purpose of this study was to evaluate a predictive model based on variables that can be obtained at the time of admission to the hospital, variables that cannot be evaluated at the initial time point were excluded. To measure and rank the contribution of each variable, we obtained mean absolute SHAP values¹⁴ with a gradient boosted tree-based model, light gradient boosting machine (LightGBM)²⁰, which can deal natively with categorical features²¹ using leave-one-out cross-validation. The positive SHAP value for each variable indicated that the variable contributed positively to the model’s positive prediction, and vice versa. We performed an additional stepwise process to prevent underestimation of the relative importance of features due to multicollinearity, the details of which are described in Supplemental Methods III.

Modeling

We selected and tested one conventional statistical model, logistic regression²², as a baseline comparator, and four popular ML models: support vector machine²³, extreme gradient boosting²⁴ (XGBoost), Light GBM, and multilayer perceptron²⁵ (MLP) with a basic architecture. Detailed instructions for the applied models are provided in Supplemental Methods IV. All the processes were implemented in Python 3.8.2 with TensorFlow-GPU 2.4.0²⁶ and scikit-learn 0.22.1²⁷ libraries.

Primary outcome and evaluation criteria

AUROC was chosen as a primary evaluation metric for model performance, and all cross-validation and early stopping strategies in the modeling process were performed to maximize the AUROC score. The models were evaluated for the frequency of confident answers and errors, with a threshold of 0.50.

Statistical analysis

Categorical variables are presented as number (percentage), and continuous variables are presented as mean ± standard deviation or median (interquartile range), as appropriate. A simple comparison was performed using the χ² test for categorical variables and the Kruskal–Wallis test for continuous variables. Data analyses were performed using IBM SPSS version 20 software (IBM Corp. Armonk, NY, USA). The AUROC, with a 95% confidence interval (CI), was calculated using the Delong method and a CI that spanned 0.50 or more was not considered statistically different from a random performance²⁸. To evaluate the calibration error of the models, the Brier score, which is the mean squared difference between the predicted probability and the actual outcome, was calculated²⁹, with a lower score indicating better probabilistic prediction accuracy. In addition, the area under the precision-recall curve, accuracy, precision, recall, and F1 score were calculated as secondary outcome metrics. We also calculated the sensitivity, specificity, and precision values for various thresholds. The significance level was set at p< 0.05 and Bonferroni correction was used for multiple comparisons of the AUROC between models.

Results

Comparisons of baseline characteristics

Figure 1 shows the patient flow chart. A total of 2,363 patients were included in this study, of whom 318 (13.5%) had END. Comparisons of baseline clinical characteristics and MRI variables are listed in Supplemental Tables II and III.

Missing value imputation

The binary variables with missing values over 80% and multinomial and numeric variables with missing values over 60% were excluded from the model construction dataset according to the missing data imputation strategy described in a previous study³⁰. The variables were as follows: all Holter monitoring parameters, smoking pack-years, alcohol consumption, duration of PR and P-axis wave on electrocardiogram, susceptibility vessel sign (SVS) size, urine albumin, serum free fatty acid level, brain natriuretic peptide (BNP), N-terminal pro-BNP, and troponin T. The remaining missing values were imputed, with non-categorical missing values imputed using the multivariate imputation by chained equations imputation method, and categorical missing values were replaced with a single constant of -1. Details concerning the number of missing values for each variable are listed in Supplemental Table IV.

Model performances

A flow diagram of the ML model development process is presented in Supplemental Figure I. The performance of each model is shown in Table 1, and the receiver operating characteristic curve and precision-recall curve are shown in Fig. 2. LightGBM had the highest AUROC value (0.772 [0.715–0.829]); however, there was no significant difference between the ML models. Light GBM and MLP had significantly higher AUROC values than logistic regression (p = 0.003 and 0.002, respectively). At various discrimination thresholds, the sensitivity, specificity, and precision of the model were calculated, and our model showed relatively superior performance for specificity.

Table 1 Comparison of model performance.

Full size table

Identification of important features

From the recursive feature elimination, a total of 23 features were selected as important features. The SHAP feature importance matrix plots show important features according to the degree of contribution (bar plot, Fig. 3A) and the overall correlation and directionality between features and the SHAP value (violin plot, Fig. 3B) during model construction. Among them, fasting glucose levels and initial NIHSS score contributed the most to the model. The next highest-ranking features were the initial mRS and initial glucose level. All other features contributed less to the model. It was confirmed that the institute site variable had not significantly affected the model. In addition, most of the continuous variables, such as fasting glucose, initial NIHSS score, initial mRS, initial glucose, QRS axis, alkaline phosphatase, homocysteine, fibrin degradation product, initial diastolic blood pressure, D-dimer, hematocrit, total cholesterol, and T axis tended to be positively correlated with END. Activated partial thromboplastin time, aspartate aminotransferase, total bilirubin, and low-density lipoprotein (LDL) cholesterol showed complex patterns with mixed positive and negative trends. LA diameter and uric acid levels showed a negative correlation.

SHAP values corresponding to changes in the four representative features are presented in partial SHAP dependence plots (Fig. 4), and other representative feature plots are listed in Supplemental Figure II. The fasting glucose level and initial NIHSS score showed a positive correlation with the sigmoid or double sigmoid curve. The LA diameter declined negatively. LDL cholesterol was associated with a U-shaped trend line that initially showed a declining tendency followed by a reversed, increasing trend. The cut-off value for each variable that could predict the positive and/or negative probability of END occurrence is marked on each graph.

Lateralization of ischemic lesions, concomitant intracranial atherosclerosis, SVS signs, and hemorrhagic transformation were included as categorical variables. The presence of concomitant intracranial atherosclerosis, SVS signs, and symptomatic ICH among hemorrhagic transformations is likely to related to END occurrence. Posterior circulation lesions were unlikely to develop END (Supplemental Figure II).

In addition, we acquired information about the importance and contribution of each patient according to the specific features selected during modeling. Representative cases are summarized in Supplemental Figure III.

Discussion

In this study, we first demonstrated that integrated ML algorithms can be applied to predict END in AF-related stroke cases. Among the ML models investigated, LightGBM had the best performance, with an AUROC value of 0.772. This is a novel method with efficient computational power and wide scalability for processing categorical, multidimensional, and incredibly large datasets²⁰, which makes it a suitable ML model in clinical settings. In addition, this model was implemented using SHAP, which can visualize the level of contribution and directionality of specific input features using the entire dataset as well as individual patient information.

The highest contributing feature in our study was the fasting glucose level, followed by the initial NIHSS score. These variables have been consistently reported as risk factors for END in all-type as well as AF-related stroke cases^2,10,11. A possible explanation is that the impairment of glucose control causes vascular endothelial dysfunction³¹, post-ischemic inflammatory response, and neuroprotective heat-shock chaperone gene attenuation³², which could exacerbate post-stroke brain damage through increasing lactate production and leading to the breakdown of the blood–brain barrier, development of brain edema and hemorrhagic transformation, and enlargement of infarct volume¹⁸. The initial neurological functional deficits represented using the NIHSS score and mRS were also known to be prone with symptomatic intracranial hemorrhage, malignant edema or stroke-related infection³³, which are important causes of END². In fact, symptomatic cerebral hemorrhage of the hemorrhagic transformation subtype was positively associated with END in this study. In addition, homocysteine, which is related to vascular endothelial dysfunction³, and fibrin degradation product and D-dimer, which are important hematologic markers related to the coagulation system and thrombosis, were important features similar to previous studies^34,35,36. Other features were SVS presence implying large-size infarction; specific ischemic lesion location limited to anterior or posterior circulation³⁷; cardiac electrophysiological, and echocardiographic markers such as QRS axis, T axis and left atrium diameter; alkaline phosphatase^38,39 as surrogate markers of atherosclerosis, systemic inflammation, malnutrition, or metabolic syndrome; and the burden of atherosclerosis, such as concurrent intracranial atherosclerosis^37,40. Among cholesterol lipoproteins, total cholesterol and LDL were included as important features in this study, which have been previously reported as important predictors⁴¹.

Interestingly, the clinical implication of cut-off values in selected features may be applicable to real-world clinical practice. With regard to initial stroke severity measured using the NIHSS, cut-off values in the SHAP partial dependence plot were presented according to the effect direction of END prediction, suggesting that patients with severe stroke (NIHSS ≥ 15) tended to develop END, thus emphasizing that awareness and close medical attention are necessary for these patients, and patients with mild to moderate stroke (NIHSS ≤ 6) have a lower chance of developing END. Some cut-off values were statistically significantly similar to the clinical values. Indeed, the cut-off value for fasting glucose predicting END in our study was 116 mg/dL, which corresponds to the current diagnostic criteria for diabetes mellitus (≥ 126 mg/dL)⁴².

The SHAP and its corresponding graphs, which were used to evaluate the effect of continuous variables on the prediction of END, were characterized by four patterns. First, a positive correlation with or without a sigmoid or double sigmoid shape was observed. The initial glucose level, fasting glucose level, initial NIHSS score, homocysteine, D-dimer, fibrin degradation product, initial diastolic blood pressure, total cholesterol, QRS-axis, and T-axis corresponded to this pattern. Most of these variables have been reported as predictors of END in previous studies². Second, a U-shaped or J-shaped pattern with both cut-off values was observed for aspartate aminotransferase, alkaline phosphatase, total bilirubin, and LDL cholesterol. The lower cut-off value of each feature may have been associated with poor nutritional status and over the upper cut-off value may imply comorbid conditions including liver disease and hyperlipidemia. However, it is not possible to investigate the underlying pathomechanisms of these phenomena in this study. Third, the following had a negative correlation with END, with a reverse S or J shape: LA diameter and uric acid. In particular, the negative association between LA diameter and END is not consistent with the positive correlation found in a previous report⁴³. However, more accurate parameters, such as the LA volume index, have recently been identified as important predictors. Considerable imputation (21.1%) could lead to incorrect directions and biased results. Finally, a bizarre pattern with multidirectionality was observed in the activated partial thromboplastin time.

One strength of our study is that our interpretable ML model was constructed using many variables, including demographics and laboratory, radiological, and echocardiographic findings, all of which can be obtained upon arrival at the hospital. Additionally, an interpretable and explainable ML model was created to promote the use of applications for making clinical decisions. Our study demonstrates the potential of interpretable ML methods to predict END and individualize such predictions. Previous studies have focused on each risk factor individually and its pathophysiological interpretation, but there has been a shortage of clinical use of a large combination of variables once^3,4,5. Moreover, no standardized risk stratification scheme for predicting END has been available until now. Therefore, our ML model has the advantage of being able to predict END using diverse variables extracted from real-world clinical situations upon arrival at the hospital.

Our study has some limitations. First, the implementation and evaluation of the model were difficult to generalize because of the lack of external validation. Although this study is based on a multicenter dataset, it is difficult to clearly evaluate the exportability of the model if external validation is not carried out, particularly considering that Light GBM is prone to an overfitting problem. However, to the best of our knowledge, this is the first ML study based on a multicenter and nationwide dataset reflecting various environments across centers. This could partially contribute to the generalizability and representativeness of our ML model because our model could be generally applicable in various external conditions. Nonetheless, further verification is required through well-designed prospective clinical studies and external validation in the future. Second, since this was a registry-based study with a retrospective design, the ML model’s performance is not sufficient to be an absolute criterion for clinical use. It is necessary to develop a more accurate prediction model, and discover novel biomarkers, especially using neuroimaging with more advanced analysis methodology, for a deeper understanding of the pathophysiology, in parallel. Third, a considerable amount of data was missing because of the multicenter retrospective nature of the study. Although imputation of missing data was performed using the ML technique, the results may be biased and contradict previous findings. In particular, it seemed to occur with some elements (such as left atrial size) that were less important. In addition, laboratory and imaging protocols in each center were not concretely established before data collection. Additionally, Holter and electrocardiography parameters were not standardized; therefore, many variables were excluded.

In conclusion, ML algorithms, using the LightGBM model in particular, can be used to predict END in AF-related stroke cases. New and diverse predictors for END were revealed through this ML model, suggesting that the pathophysiology of END development could be a complex mechanism. Further verification through prospective clinical studies is required.

References

Siegler, J. E. & Martin-Schild, S. Early neurological deterioration (END) after stroke: the END depends on the definition. Int. J. Stroke 6(3), 211–212. https://doi.org/10.1111/j.1747-4949.2011.00596.x (2011).
Article PubMed Google Scholar
Seners, P., Turc, G., Oppenheim, C. & Baron, J. C. Incidence, causes and predictors of neurological deterioration occurring within 24 h following acute ischaemic stroke: A systematic review with pathophysiological implications. J. Neurol. Neurosurg. Psychiatry 86(1), 87–94. https://doi.org/10.1136/jnnp-2014-308327 (2015).
Article PubMed Google Scholar
Kwon, H. M., Lee, Y. S., Bae, H. J. & Kang, D. W. Homocysteine as a predictor of early neurological deterioration in acute ischemic stroke. Stroke 45(3), 871–873. https://doi.org/10.1161/STROKEAHA.113.004099 (2014).
Article CAS PubMed Google Scholar
Sun, W. et al. Asymmetrical cortical vessel sign on susceptibility-weighted imaging: A novel imaging marker for early neurological deterioration and unfavorable prognosis. Eur. J. Neurol. 21(11), 1411–1418. https://doi.org/10.1111/ene.12510 (2014).
Article CAS PubMed Google Scholar
Seo, W. K. et al. C-reactive protein is a predictor of early neurologic deterioration in acute ischemic stroke. J. Stroke Cerebrovasc. Dis. 21(3), 181–186. https://doi.org/10.1016/j.jstrokecerebrovasdis.2010.06.002 (2012).
Article PubMed Google Scholar
Gong, P. et al. A novel nomogram to predict early neurological deterioration in patients with acute ischaemic stroke. Eur. J. Neurol. 27(10), 1996–2005. https://doi.org/10.1111/ene.14333 (2020).
Article CAS PubMed Google Scholar
Kwan, J. & Hand, P. Early neurological deterioration in acute stroke: Clinical characteristics and impact on outcome. QJM 99(9), 625–633. https://doi.org/10.1093/qjmed/hcl082 (2006).
Article CAS PubMed Google Scholar
Thanvi, B., Treadwell, S. & Robinson, T. Early neurological deterioration in acute ischaemic stroke: Predictors, mechanisms and management. Postgrad. Med. J. 84(994), 412–417. https://doi.org/10.1136/pgmj.2007.066118 (2008).
Article CAS PubMed Google Scholar
Hong, H. J. et al. Early neurological outcomes according to CHADS2 score in stroke patients with non-valvular atrial fibrillation. Eur. J. Neurol. 19(2), 284–290. https://doi.org/10.1111/j.1468-1331.2011.03518.x (2012).
Article CAS PubMed Google Scholar
Kim, J. S. et al. Pre-stroke glycemic control is associated with early neurologic deterioration in acute atrial fibrillation-related ischemic stroke. eNeurologicalSci 8, 17–21. https://doi.org/10.1016/j.ensci.2017.06.005 (2017).
Article PubMed PubMed Central Google Scholar
Duan, Z. et al. Relationship between high-sensitivity C-reactive protein and early neurological deterioration in stroke patients with and without atrial fibrillation. Heart Lung 49(2), 193–197. https://doi.org/10.1016/j.hrtlng.2019.10.009 (2020).
Article PubMed Google Scholar
Yu, M. K. et al. Visible machine learning for biomedicine. Cell 173(7), 1562–1565. https://doi.org/10.1016/j.cell.2018.05.056 (2018).
Article CAS PubMed PubMed Central Google Scholar
Tjoa, E. & Guan, C. A survey on explainable artificial intelligence (XAI): Toward medical XAI. IEEE Trans. Neural Netw. Learn. Syst. 1, 1–21. https://doi.org/10.1109/TNNLS.2020.3027314 (2020).
Article Google Scholar
Lundberg, S. M. et al. From local explanations to global understanding with explainable AI for trees. Nat. Mach. Intell. 2(1), 56–67. https://doi.org/10.1038/s42256-019-0138-9 (2020).
Article PubMed PubMed Central Google Scholar
Jung, J. M. et al. Long-term outcomes of real-world Korean patients with atrial-fibrillation-related stroke and severely decreased ejection fraction. J. Clin. Neurol. 15(4), 545–554. https://doi.org/10.3988/jcn.2019.15.4.545 (2019).
Article PubMed PubMed Central Google Scholar
Collins, G. S., Reitsma, J. B., Altman, D. G. & Moons, K. G. M. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): The TRIPOD statement. BMJ 350, g7594. https://doi.org/10.1002/bjs.9736 (2015).
Article PubMed Google Scholar
Nam, K. W. et al. D-dimer as a predictor of early neurologic deterioration in cryptogenic stroke with active cancer. Eur. J. Neurol. 24(1), 205–211. https://doi.org/10.1111/ene.13184 (2017).
Article PubMed Google Scholar
McBride, D. W. et al. Acute hyperglycemia is associated with immediate brain swelling and hemorrhagic transformation after middle cerebral artery occlusion in rats. Acta Neurochir. Suppl. 121, 237–241. https://doi.org/10.1007/978-3-319-18497-5_42 (2016).
Article PubMed Google Scholar
Guyon, I., Weston, J., Barnhill, S. & Vapnik, V. Gene selection for cancer classification using support vector machines. Mach. Learn. 46(1/3), 389–422. https://doi.org/10.1023/A:1012487302797 (2002).
Article MATH Google Scholar
Ke, G., et al. LightGBM: A Highly Efficient Gradient Boosting Decision Tree. Paper presented at: Advances in Neural Information Processing Systems 3146–3154 (2017)
Hyland, S. L. et al. Early prediction of circulatory failure in the intensive care unit using machine learning. Nat. Med. 26(3), 364–373. https://doi.org/10.1038/s41591-020-0789-4 (2020).
Article CAS PubMed Google Scholar
Fan, R.-E., Chang, K.-W., Hsieh, C.-J., Wang, X.-R. & Lin, C.-J. Liblinear: A library for large linear classification. JMLR 9, 1871–1874 (2008).
MATH Google Scholar
Chang, C.-C. & Lin, C.-J. Libsvm: A library for support vector machines. ACM Trans. Intell. Syst. Technol. 2(3), 1–27. https://doi.org/10.1145/1961189.1961199 (2011).
Article Google Scholar
Chen, T. & Guestrin, C. Xgboost: A Scalable Tree Boosting System. Paper presented at: Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining 785–794 (2016)
LeCun, Y., Bengio, Y. & Hinton, G. Deep learning. Nature 521(7553), 436–444. https://doi.org/10.1038/nature14539 (2015).
Article ADS CAS PubMed Google Scholar
Abadi, M., et al. Tensorflow: A system for large-scale machine learning. Paper presented at: 12th USENIX symposium on operating systems design and implementation (OSDI 16) 265–283 (2016)
Pedregosa, F. et al. Scikit-learn: Machine learning in Python. JMLR 12, 2825–2830 (2011).
MathSciNet MATH Google Scholar
DeLong, E. R., DeLong, D. M. & Clarke-Pearson, D. L. Comparing the areas under two or more correlated receiver operating characteristic curves: A nonparametric approach. Biometrics 44(3), 837–845. https://doi.org/10.2307/2531595 (1988).
Article CAS PubMed MATH Google Scholar
Rufibach, K. Use of Brier score to assess binary predictions. J. Clin. Epidemiol. 63(8), 938–9; author reply 939 (2010). https://doi.org/10.1016/j.jclinepi.2009.11.009
Li, X., et al. Integrated machine learning approaches for predicting ischemic stroke and thromboembolism in atrial fibrillation. AMIA Annu. Symp. Proc. 2016, 799–807 (2016).
Jamwal, S. & Sharma, S. Vascular endothelium dysfunction: A conservative target in metabolic disorders. Inflam. Res. 67(5), 391–405. https://doi.org/10.1007/s00011-018-1129-8 (2018).
Article CAS Google Scholar
Tureyen, K., Bowen, K., Liang, J., Dempsey, R. J. & Vemuganti, R. Exacerbated brain damage, edema and inflammation in type-2 diabetic mice subjected to focal ischemia. J. Neurochem. 116(4), 499–507. https://doi.org/10.1111/j.1471-4159.2010.07127.x (2011).
Article CAS PubMed PubMed Central Google Scholar
Elkind, M. S., Boehme, A. K., Smith, C. J., Meisel, A. & Buckwalter, M. S. Infection as a stroke risk factor and determinant of outcome after stroke. Stroke 51, 3156–3168. https://doi.org/10.1161/STROKEAHA.120.030429 (2020).
Article CAS PubMed PubMed Central Google Scholar
Barber, M., Langhorne, P., Rumley, A., Lowe, G. D. & Stott, D. J. Hemostatic function and progressing ischemic stroke: D-dimer predicts early clinical progression. Stroke 35(6), 1421–1425. https://doi.org/10.1161/01.STR.0000126890.63512.41 (2004).
Article PubMed Google Scholar
Martin, A. J. & Price, C. I. A systematic review and meta-analysis of molecular biomarkers associated with early neurological deterioration following acute stroke. Cerebrovasc. Dis. 46(5–6), 230–241. https://doi.org/10.1159/000495572 (2018).
Article CAS PubMed Google Scholar
Donkel, S. J., Benaddi, B., Dippel, D. W. J., Ten Cate, H. & de Maat, M. P. M. Prognostic hemostasis biomarkers in acute ischemic stroke. Arterioscler. Thromb. Vasc. Biol. 39(3), 360–372. https://doi.org/10.1161/ATVBAHA.118.312102 (2019).
Article CAS PubMed PubMed Central Google Scholar
Kim, J. T. et al. MRI findings may predict early neurologic deterioration in acute minor stroke or transient ischemic attack due to intracranial atherosclerosis. Eur. Neurol. 64(2), 95–100. https://doi.org/10.1159/000315138 (2010).
Article PubMed Google Scholar
Kim, J. et al. Serum alkaline phosphatase and phosphate in cerebral atherosclerosis and functional outcomes after cerebral infarction. Stroke 44(12), 3547–3549. https://doi.org/10.1161/STROKEAHA.113.002959 (2013).
Article CAS PubMed Google Scholar
Uehara, T., Yoshida, K., Terasawa, H., Shimizu, H. & Kita, Y. Increased serum alkaline phosphatase and early neurological deterioration in patients with atherothrombotic brain infarction attributable to intracranial atherosclerosis. eNeurologicalSci 20, 1053. https://doi.org/10.1016/j.ensci.2020.100253 (2020).
Article Google Scholar
Lee, S. J. & Lee, D. G. Distribution of atherosclerotic stenosis determining early neurologic deterioration in acute ischemic stroke. PLoS ONE 12(9), e0185314. https://doi.org/10.1371/journal.pone.0185314 (2017).
Article CAS PubMed PubMed Central Google Scholar
Geng, H. H. et al. Early neurological deterioration during the acute phase as a predictor of long-term outcome after first-ever ischemic stroke. Med. (Baltim.) 96(51), e9068. https://doi.org/10.1097/MD.0000000000009068 (2017).
Article Google Scholar
American Diabetes Association. 2. Classification and diagnosis of diabetes: Standards of medical care in Diabetes-2020. Diabetes Care 43(Suppl 1), S14–S31 (2020). https://doi.org/10.2337/dc20-S002.
Samai, A. A., Albright, K., Navalkele, D., Alemayehu, C. & Martin-Schild, S. Left atrial enlargement is associated with neuroworsening and worse short-term outcomes in acute ischemic stroke. Stroke 50, AWP272-AWP272, abstract WP272: (2019).

Download references

Funding

The authors disclose receipt of the following financial support for the research, authorship, and publication of this article: the Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Science and ICT (NRF-2020R1C1C1009294) and Korea University Grant. The funders had no role in the study design; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

Author information

These authors contributed equally: Seong-Hwan Kim and Eun-Tae Jeon.

Authors and Affiliations

Department of Neurology, Korea University Ansan Hospital, Korea University College of Medicine, Gojan 1-Dong, Danwon-Gu, Ansan-Si, Gyeonggi-Do, 15355, South Korea
Seong-Hwan Kim, Eun-Tae Jeon & Jin-Man Jung
Department of Neurology, Korea University Anam Hospital, Korea University College of Medicine, Seoul, South Korea
Sungwook Yu
Department of Neurology, Korea University Guro Hospital, Korea University College of Medicine, Seoul, South Korea
Kyungmi Oh & Chi Kyung Kim
Department of Neurology, Seoul Hospital, Ewha University College of Medicine, Seoul, South Korea
Tae-Jin Song
Department of Neurology, Eunpyeong St. Mary’s Hospital, The Catholic University of Korea, Seoul, Korea
Yong-Jae Kim
Department of Neurology, Kyung Hee University College of Medicine, Seoul, South Korea
Sung Hyuk Heo
Department of Neurology, Chung-Ang University College of Medicine, Chung-Ang University Hospital, Seoul, South Korea
Kwang-Yeol Park & Jeong-Min Kim
Department of Neurology, Hanyang University Myongji Hospital Seoul, Seoul, South Korea
Jong-Ho Park
Department of Neurology, Jeju National University, Jeju, South Korea
Jay Chol Choi
Department of Neurology, Chonnam National University Hospital, Chonnam, South Korea
Man-Seok Park & Joon-Tae Kim
Department of Neurology, Chonnam National University Hwasun Hospital, Hwasun, South Korea
Kang-Ho Choi
Department of Neurology, Kyungpook National University Hospital, Dae-gu, South Korea
Yang Ha Hwang
Department of Neurology, Asan Medical Center, University of Ulsan College of Medicine, Seoul, South Korea
Bum Joon Kim
Department of Neurology and Stroke Center, Samsung Medical Center, 81, Irwon-Ro, Gangnam-Gu, Seoul, 06351, South Korea
Jong-Won Chung, Oh Young Bang, Gyeongmoon Kim & Woo-Keun Seo
Korea University Zebrafish Translational Medical Research Center, Ansan, South Korea
Jin-Man Jung

Authors

Seong-Hwan Kim
View author publications
You can also search for this author in PubMed Google Scholar
Eun-Tae Jeon
View author publications
You can also search for this author in PubMed Google Scholar
Sungwook Yu
View author publications
You can also search for this author in PubMed Google Scholar
Kyungmi Oh
View author publications
You can also search for this author in PubMed Google Scholar
Chi Kyung Kim
View author publications
You can also search for this author in PubMed Google Scholar
Tae-Jin Song
View author publications
You can also search for this author in PubMed Google Scholar
Yong-Jae Kim
View author publications
You can also search for this author in PubMed Google Scholar
Sung Hyuk Heo
View author publications
You can also search for this author in PubMed Google Scholar
Kwang-Yeol Park
View author publications
You can also search for this author in PubMed Google Scholar
Jeong-Min Kim
View author publications
You can also search for this author in PubMed Google Scholar
Jong-Ho Park
View author publications
You can also search for this author in PubMed Google Scholar
Jay Chol Choi
View author publications
You can also search for this author in PubMed Google Scholar
Man-Seok Park
View author publications
You can also search for this author in PubMed Google Scholar
Joon-Tae Kim
View author publications
You can also search for this author in PubMed Google Scholar
Kang-Ho Choi
View author publications
You can also search for this author in PubMed Google Scholar
Yang Ha Hwang
View author publications
You can also search for this author in PubMed Google Scholar
Bum Joon Kim
View author publications
You can also search for this author in PubMed Google Scholar
Jong-Won Chung
View author publications
You can also search for this author in PubMed Google Scholar
Oh Young Bang
View author publications
You can also search for this author in PubMed Google Scholar
Gyeongmoon Kim
View author publications
You can also search for this author in PubMed Google Scholar
Woo-Keun Seo
View author publications
You can also search for this author in PubMed Google Scholar
Jin-Man Jung
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

W.K.S. and J.M.J. conceived and designed the research. S.H.K. contributed to the drafting of the manuscript, statistical analysis, and interpretation of data. E.T.J. contributed to the machine learning model construction, interpretation of data and drafting of the manuscript. S.W.Y., K.M.O., C.K.K., T.J.S., T.J.K., S.H.H., K.Y.P., J.M.K., J.H.P., J.C.C., M.S.P., J.T.K., K.H.C., YHH, BJK, JWC, OYB, and GMK were involved in the acquisition of data and take responsibility for the integrity of data. All authors approved the final manuscript.

Corresponding authors

Correspondence to Woo-Keun Seo or Jin-Man Jung.

Ethics declarations

Competing interests

The authors declare the following potential conflicts of interest with respect to the research, authorship, and publication of this article: J-M Jung has received lecture honoraria from Pfizer, Sanofi-Aventis, Ostuka, Dong-A, and Hanmi Pharmaceutical Co., Ltd; consulting fees from Daewoong Pharmaceutical Co., Ltd. WK Seo received honoraria for lectures from Pfizer, Sanofi-Aventis, Otsuka Korea, Dong-A Pharmaceutical Co., Ltd., Beyer, Daewoong Pharmaceutical Co. Ltd., Daiichi Sankyo Korea Co., Ltd., and Boryung Pharmaceutical Co., Ltd.; a study grant from Daiichi Sankyo Korea Co., Ltd.; and consulting fees from OBELAB Inc. All other authors have no coompeting interest.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Kim, SH., Jeon, ET., Yu, S. et al. Interpretable machine learning for early neurological deterioration prediction in atrial fibrillation-related stroke. Sci Rep 11, 20610 (2021). https://doi.org/10.1038/s41598-021-99920-7

Download citation

Received: 21 April 2021
Accepted: 23 September 2021
Published: 18 October 2021
DOI: https://doi.org/10.1038/s41598-021-99920-7

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.