Identifying potential (re)hemorrhage among sporadic cerebral cavernous malformations using machine learning

Li, Xiaopeng; Jones, Peng; Zhao, Mei

doi:10.1038/s41598-024-61851-4

Download PDF

Article
Open access
Published: 14 May 2024

Identifying potential (re)hemorrhage among sporadic cerebral cavernous malformations using machine learning

Xiaopeng Li¹^na1,
Peng Jones²^na1 &
Mei Zhao³

Scientific Reports volume 14, Article number: 11022 (2024) Cite this article

215 Accesses
1 Altmetric
Metrics details

Subjects

Abstract

The (re)hemorrhage in patients with sporadic cerebral cavernous malformations (CCM) was the primary aim for CCM management. However, accurately identifying the potential (re)hemorrhage among sporadic CCM patients in advance remains a challenge. This study aims to develop machine learning models to detect potential (re)hemorrhage in sporadic CCM patients. This study was based on a dataset of 731 sporadic CCM patients in open data platform Dryad. Sporadic CCM patients were followed up 5 years from January 2003 to December 2018. Support vector machine (SVM), stacked generalization, and extreme gradient boosting (XGBoost) were used to construct models. The performance of models was evaluated by area under receiver operating characteristic curves (AUROC), area under the precision-recall curve (PR-AUC) and other metrics. A total of 517 patients with sporadic CCM were included (330 female [63.8%], mean [SD] age at diagnosis, 42.1 [15.5] years). 76 (re)hemorrhage (14.7%) occurred during follow-up. Among 3 machine learning models, XGBoost model yielded the highest mean (SD) AUROC (0.87 [0.06]) in cross-validation. The top 4 features of XGBoost model were ranked with SHAP (SHapley Additive exPlanations). All-Elements XGBoost model achieved an AUROCs of 0.84 and PR-AUC of 0.49 in testing set, with a sensitivity of 0.86 and a specificity of 0.76. Importantly, 4-Elements XGBoost model developed using top 4 features got a AUROCs of 0.83 and PR-AUC of 0.40, a sensitivity of 0.79, and a specificity of 0.72 in testing set. Two machine learning-based models achieved accurate performance in identifying potential (re)hemorrhages within 5 years in sporadic CCM patients. These models may provide insights for clinical decision-making.

Screening and diagnosis of cardiovascular disease using artificial intelligence-enabled cardiac magnetic resonance imaging

Article Open access 13 May 2024

AI-enabled electrocardiography alert intervention and all-cause mortality: a pragmatic randomized clinical trial

Article 29 April 2024

Prediction of tumor origin in cancers of unknown primary origin with cytology-based deep learning

Article Open access 16 April 2024

Introduction

Cerebral cavernous malformations (CCM), mostly caused by loss-of-function of mutations genes¹, are vascular lesions of the brain with a risk of causing intracerebral hemorrhage (ICH)^1,2,3. CCM show a familial or sporadic form⁴, and also could be detected after radiation therapy⁵, almost 20% of CCM found with multiple locations^6,7. These CCM-related ICH mainly caused headaches, seizures, impaired consciousness, and focal neurological deficits⁸. A meta-analysis with 7 patient cohorts demonstrated that a 5-year ICH risk for CCM was 15.8% using reported standards⁹. As the most feared complication, symptomatic (re)hemorrhage is the primary aim for CCM management⁴, especially repetitive hemorrhage leading to being disabled and fatal^10,11. Most previous studies focused on identifying risk factors of ICH among CCM patients^7,12,13,14. A report with a dataset containing 731 CCM patients followed up from 2003 to 2018 based on Cox proportional hazards model showed that prior ICH and brainstem localization were associated with a higher risk of (re)-hemorrhage¹⁵. Using this large and invaluable dataset, machine learning models also could be constructed to detect potential (re)-hemorrhage with several clinical records of CCM patients. Identifying the potential (re)bleeding in advance among CCM patients and initiating prompt treatment, such as surgical resection, conservative treatment^1,4 or long-term antithrombotic therapy use^16,17,18,19, is essential for CCM management.

However, the established machine learning model for detecting potential (re)hemorrhage among CCM patients is still lacking. The prediction models based on machine learning algorithms showed robust performance in various areas including medical events^20,21,22,23. Thus, we suppose that machine learning algorithms might make it possible to yield accurate prediction models, even providing limited medical information about CCM patients.

The present study aimed to develop and validate prediction models that could distinguish sporadic CCM patients of potential (re)hemorrhage from those without risk of (re)hemorrhage within 5 years. Here, we report machine learning models with comparatively high predictability for identifying potential (re)hemorrhage within 5 years, which may provide insights for clinical decision-making for the treatment of sporadic CCM patients.

Methods

Participants

This study included a dataset of 731 sporadic CCM patients in the data platform Dryad, the collection of which was approved by university institutional review and written consent was acquired from all patients^15,24. These consecutively admitted patients were prospectively followed up 5 years from January 1, 2003, to December 31, 2018¹⁵. Hemorrhage during registration and occurrence of (re)hemorrhage in follow-up were evaluated by reported standards, and 64% completeness of follow-up with a high censoring rate was due to surgical treatment¹⁵.

Study design and feature selection

We selected 517 sporadic CCM patients and 12 features in this dataset, which include: age at diagnosis, sex, supratentorial CCM, CCM at brain stem, CCM at infratentorial nonbrain stem, CCM volume, developmental venous anomaly (DVA), hypercholesterolemia, hypertension, diabetes, prior ICH, (re)hemorrhage during follow-up within 5 years. For patients with prior ICH, CCM volume was measured via the sum of CCM lesion and hemorrhage lesion¹⁵. Patients during follow-up without (re)hemorrhage receiving surgical treatment and those with missing information about surgery information in follow-up were excluded. Missing values in features of hypercholesterolemia, diabetes, and hypertension were imputed using multiple imputation by chained equations (MICE) with the aid of python module miceforest²⁵.

Machine learning algorithm and dealing with imbalanced data

Support vector machine (SVM), one robust supervised machine learning method, is used for analyzing datasets for classification and regression^26,27. Extreme gradient boosting (XGBoost) is an ensemble learning algorithm based on decision trees²⁸, showing accurate performance in the medical field^29,30. Stacked generalization, often termed as stacking, super learning, or stacked regression^31,32, combines multiple base classifiers with a final classifier aiming at reducing biases. Stacking is a common method to ensemble various algorithms into a powerful learner³¹. For our stacking model, we implemented decision trees³³, random forests, gradient boosted decision trees (GBDT)³⁴, SVM, multi-layer perceptron³⁵, and k nearest neighbors³⁶ as the base estimators, and logistic regression as the final estimator.

Among all included 517 sporadic CCM patients, 76 patients occurred (re)hemorrhage during follow-up, yielding imbalance. Dealing with imbalanced data for machine learning algorithms is challenging in academia and industry³⁷. Random under-sampling has been adopted to reduce the majority class^20,21, to aid the algorithm in identifying the minority class. In the training and validation cohort, we applied random under-sampling to reduce the size of sporadic CCM patients without (re)hemorrhage.

Model development and feature importance

The dataset was randomly split into the training and validation cohort (80%) and the testing cohort (20%). The prediction models were built with the aid of the efficient tool scikit-learn (version 1.0.2) and other modules (pandas, numpy, matplotlib). The hyperparameters were tuned to maximize the area under the receiver operating characteristic curves (AUROC) with the aid of GridSearchCV in the training and validation cohort. We trained three models using three repeats of five-fold stratified cross-validation. Models performance, including precision, recall, and F-score, was also calculated in the process of cross-validation. The search space for hyperparameters and the chosen values for all models are shown in Supplementary Table S1. Other parameters were set as default values.

We explored the importance ranking of features based on the XGBoost model interpreted by SHAP (SHapley Additive exPlanations). The 4-Elements model was built using the top 4 features (CCM volume, prior ICH, CCM at brain stem, age at diagnosis).

Model performance in testing cohort

To validate the performance of the XGBoost models, we calculated the AUROC and the area under the precision recall curve (PR-AUC). The evaluation metrics, including sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), positive likelihood ratio, and negative likelihood ratio, were also computed.

Statistical analysis

The normally distributed continuous variable was analyzed via a two-sided t-test, whereas the Mann–Whitney test was conducted for nonnormally distributed continuous variables. Categorical data were performed via χ² test including continuity correction in case of low frequencies. Statistical significance was set at p < 0.05 (two-sided). Statistical analyses were conducted with the use of SAS software, version 9.4 (SAS Institute Inc).

Ethical considerations

The dataset of sporadic CCM patients is sourced from the open data platform Dryad. Standard protocol and registrations of patients were approved by the institutional review board of Duisburg-Essen University (review board identification 14-5751-BO and 19-8662-BO)¹⁵. The written consent was also acquired from all patients¹⁵. All procedures of this study involving human participants were in accordance with the ethical guidelines of the declaration of Helsinki.

Results

Baseline characteristics

A total of 517 sporadic CCM patients were included in this study cohort (330 female patients [63.8%], mean [SD] age at diagnosis, 42.1 [15.5] years), among whom 76 patients (14.7%) experienced (re)hemorrhage during 5-year follow-up. The dataset was randomly assigned to the training set and the testing set. The baseline features of the two groups are shown in Table 1. The flow diagram of the modeling has been illustrated in Fig. 1.

Table 1 Baseline characteristics of the patients cohort.

Full size table

Comparison of model's performance in cross-validation

3 prediction models were developed using 11 features of sporadic CCM patients. For the evaluation of metrics of models, sporadic CCM patients who occurred (re)hemorrhage during follow-up were treated as true positives whilst those without risk of bleeding were considered to be true negatives. ROC curves and the performance of three prediction models resulting from three repeats of five-fold stratified cross-validation are shown (Fig. 2). Among these algorithms, the XGBoost model achieved the highest mean (SD) AUROC of 0.87 [0.06] with a recall of 0.78 and a precision of 0.79. Therefore, we selected XGBoost algorithm to build prediction models.

Feature Importance analysis and development of 4-Elements model

To shed light on the feature importance, Shapley values based on the XGBoost model were calculated. The feature importance ranking, determined by the sum of the Shapley value magnitudes, is illustrated in Fig. 3, each point with color on behalf of the feature value of one patient.

For easy usage of prediction models for clinicians, the 4-Elements model based on the top 4 features (CCM volume, prior ICH, CCM at brain stem, age at diagnosis) using XGBoost was built. It should be noted that, for those sporadic CCM patients with prior ICH, CCM volume was measured as the sum of CCM lesion and hemorrhage lesion. The ROC curves of 4-Elements model in cross-validation are demonstrated in Supplementary Fig. S1.

Performance of all-Elements model and 4-Elements model on testing cohort

Figure 4 shows ROC curves and PR curves of all-Elements model and 4-Elements model for testing cohort. The all-Elements model generated AUROCs of 0.84, whereas this value for 4-Elements model was 0.83. The all-Elements model and 4-Elements model demonstrated a PR-AUC of 0.49 and 0.40, respectively.

Table 2 demonstrates the representative performance of developed models. The sensitivity and specificity of all-Elements model based on XGBoost were 0.86 and 0.76. 4-Elements model achieved a sensitivity of 0.79 and a specificity of 0.72.

Table 2 The performance of all-Elements model and 4-Elements model on testing cohort.

Full size table

Discussion

To the best of our knowledge, the all-Elements model and 4-Elements model are the first developed machine-learning based models for detecting the potential (re)hemorrhage among sporadic CCM patients, especially the readily used 4-Elements model. The present developed all-Elements model using XGBoost algorithm achieved an AUROC of 0.84, with a sensitivity of 0.86 and a specificity of 0.76, demonstrating a comparatively accurate performance in identifying the potential (re)hemorrhage among CCM patients within 5 years. Importantly, the 4-Elements model yielded accurate performance as well in detecting the potential (re)hemorrhage, with an AUROC of 0.83, a sensitivity of 0.79, and a specificity of 0.72.

Compared to SVM model and stacking model, the developed XGBoost model yields higher AUROC in predicting the potential (re)hemorrhage using 11 clinical records of CCM patients. Shapley values have been adopted to interpret feature importance^20,22,29,38 and feature importance based on the XGBoost model was ranked with the aid of Shapley values. For easy and ready usage in clinical practice, previous studies have built machine learning models using only several top features^30,39,40. For our XGBoost model, the top 4 features are CCM volume, presence of ICH, CCM at brain stem, and age at diagnosis, with which we try to build 4-Elements model. Researchers identified prior hemorrhage as a major risk factor for subsequent hemorrhage^7,9,41,42,43. Localized in deep regions of the brain, brainstem CCM and thalamic CCM took up approximately one-third⁴⁴ and it was found that CCM lesions at the brainstem increased hemorrhage rate^{12,45,46,47,48}.

Abundant evidence links age with the risk of (re)hemorrhage among CCM patients. Based on 242 patients with brainstem CCM, Li et al. found that the interval of rehemorrhage-free was significantly shorter in patients aged 50 years or older⁴⁹ and subsequent studies also showed that patients aged 55 years or older were associated with hemorrhage⁵⁰. However, the finding related to the role of age in (re)hemorrhage of CCM patients is not consistent. Young age (< 40 years or < 45 years) was suggested to be associated with CM hemorrhage^13,51. In contrast with the above conclusion, several studies also demonstrated that age was not associated with subsequent symptomatic hemorrhage among CCM patients^47,52. From the sight based on machine learning, we identified age at diagnosis of CCM as a risk factor for (re)hemorrhage.

Although decades of surgical excision for CCM patients, surgical treatment remains controversial⁴. Neurosurgical excision of CCM is executed to prevent symptomatic ICH and the risk of CCM resection includes death or nonfatal stroke^4,53. To prevent potential hemorrhage, surgical excision could be considered in asymptomatic CCM patients in noneloquent areas⁵⁴. CMs located in proximity to the ventricular system or easily accessible solitary CMs in non-eloquent areas may be in need of neurosurgical treatment⁵⁵. Surgery for CCMs at critical supratentorial areas caused significant mainly transient morbidity, and these could be recovered over time⁵⁶. Performing surgery in a subacute phase 2–4 weeks after bleeding is suggested for CCM patients⁵⁵.

It is worth noting that several findings concluded CCM size was not a risk factor for the hemorrhage rate^{47,51,52,57,58}. However, one study based on anatomical location also found that CCM with volume (≥ 1 cm³) at infratentorial cavernous lesions was associated with a high risk of CM rupture whereas that at supratentorial cavernous lesions did not show any relating sign¹³. It is clearly shown that all the top 4 features interpreted by Shapley values have been suggested to be associated with CCM hemorrhage, which may ensure the accurate performance of our 4-Elements model. Interestingly, CCM volume in our prediction model has been demonstrated as the top 1 feature for distinguishing potential (re)hemorrhage of CCM patients from those without (re)hemorrhage risk. We suppose the underlying mechanism may be that machine learning algorithms do not view solitary features and complex relationships between features significantly influencing the resulting classification may be constructed²² in the process of building a model. CCM volume in this study was measured by the sum of CCM lesion and hemorrhage lesion in the case of CCM patients with prior ICH¹⁵.

Although we impute missing values in features of hypercholesterolemia, hypertension and diabetes, these features weakly affect the output of XGBoost model by viewing Shapley values. Further, features used to build the 4-Elements model do not contain missing values.

Surgical resection is a definitive cure for selected CCM patients though remains conflicting due to substantial operative risks^3,4. Antithrombotic therapy use in a long-term lowered the risk of ICH in CCM patients¹⁹. CCM patients labeled by the two machine learning models as potential (re)hemorrhage should, therefore, be considered by clinicians as requiring prompt treatment. Further, CCM patients who are predicted by two models without risk of potential (re)hemorrhage could avoid unnecessary treatment. Our all-Elements model and 4-Elements model fill the gap that the potential (re)hemorrhage CCM patients within 5 years among CCM patients could be recognized in advance.

This study has inherent limitations. 517 sporadic CCM patients were included in this study, and large datasets may need to further validate the all-Elements model and 4-Elements model. Moreover, collecting sufficient clinical records of sporadic CCM patients may facilitate select important features greatly influencing the output of the model.

In conclusion, we developed all-Elements XGBoost model and 4-Elements XGBoost model for identifying potential (re)hemorrhage within 5 years among sporadic CCM patients, both achieving comparatively accurate performance. Importantly, the 4-Elements model is convenient for clinical usage. The two models will aid clinical decision-making, such as initiating prompt treatment for the potential (re)hemorrhage or avoiding unnecessary treatment for those without (re)hemorrhage risk. We are limited by the size of institutions to collect follow-up data of CCM patients for external validation, and we also could not find a dataset of CCM patients in the open platform. Further validating the all-Elements model and 4-Elements model with large datasets of sporadic CCM patients is necessary.

Data availability

The data and code supporting this study is available from the corresponding author upon reasonable request.

References

Chohan, M. O. et al. Emerging pharmacologic targets in cerebral cavernous malformation and potential strategies to alter the natural history of a difficult disease: A review. JAMA Neurol. 76(4), 492–500. https://doi.org/10.1001/jamaneurol.2018.3634 (2019).
Article PubMed Google Scholar
Taslimi, S., Modabbernia, A., Amin-Hanjani, S., Barker, F. G. 2nd. & Macdonald, R. L. Natural history of cavernous malformation: Systematic review and meta-analysis of 25 studies. Neurology 86(21), 1984–1991. https://doi.org/10.1212/wnl.0000000000002701 (2016).
Article PubMed PubMed Central Google Scholar
Chen, B. et al. Modifiable cardiovascular risk factors in patients with sporadic cerebral cavernous malformations: Obesity matters. Stroke 52(4), 1259–1264. https://doi.org/10.1161/strokeaha.120.031569 (2021).
Article CAS PubMed Google Scholar
Akers, A. et al. Synopsis of guidelines for the clinical management of cerebral cavernous malformations: Consensus recommendations based on systematic literature review by the angioma alliance scientific advisory board clinical experts panel. Neurosurgery 80(5), 665–680. https://doi.org/10.1093/neuros/nyx091 (2017).
Article PubMed PubMed Central Google Scholar
Gastelum, E. et al. Rates and characteristics of radiographically detected intracerebral cavernous malformations after cranial radiation therapy in pediatric cancer patients. J. Child. Neurol. 30(7), 842–849. https://doi.org/10.1177/0883073814544364 (2015).
Article PubMed Google Scholar
Al-Shahi Salman, R. et al. Untreated clinical course of cerebral cavernous malformations: A prospective, population-based cohort study. Lancet Neurol. 11(3), 217–224. https://doi.org/10.1016/s1474-4422(12)70004-2 (2012).
Article PubMed Google Scholar
Flemming, K. D., Link, M. J., Christianson, T. J. & Brown, R. D. Jr. Prospective hemorrhage risk of intracerebral cavernous malformations. Neurology 78(9), 632–636. https://doi.org/10.1212/WNL.0b013e318248de9b (2012).
Article CAS PubMed Google Scholar
Al-Shahi Salman, R., Berg, M. J., Morrison, L. & Awad, I. A. Hemorrhage from cavernous malformations of the brain: definition and reporting standards Angioma Alliance Scientific Advisory Board. Stroke 39(12), 3222–3230. https://doi.org/10.1161/strokeaha.108.515544 (2008).
Article PubMed Google Scholar
Horne, M. A. et al. Clinical course of untreated cerebral cavernous malformations: A meta-analysis of individual patient data. Lancet Neurol. 15(2), 166–173. https://doi.org/10.1016/s1474-4422(15)00303-8 (2016).
Article PubMed PubMed Central Google Scholar
Arauz, A. et al. Rebleeding and outcome in patients with symptomatic brain stem cavernomas. Cerebrovasc. Dis. 43(5–6), 283–289. https://doi.org/10.1159/000463392 (2017).
Article PubMed Google Scholar
Dammann, P. et al. Solitary sporadic cerebral cavernous malformations: Risk factors of first or recurrent symptomatic hemorrhage and associated functional impairment. World Neurosurg. 91, 73–80. https://doi.org/10.1016/j.wneu.2016.03.080 (2016).
Article PubMed Google Scholar
Labauge, P., Brunereau, L., Laberge, S. & Houtteville, J. P. Prospective follow-up of 33 asymptomatic patients with familial cerebral cavernous malformations. Neurology 57(10), 1825–1828. https://doi.org/10.1212/wnl.57.10.1825 (2001).
Article CAS PubMed Google Scholar
Kashefiolasl, S. et al. A benchmark approach to hemorrhage risk management of cavernous malformations. Neurology 90(10), e856–e863. https://doi.org/10.1212/wnl.0000000000005066 (2018).
Article PubMed Google Scholar
Abla, A. A. et al. Cavernous malformations of the brainstem presenting in childhood: surgical experience in 40 patients. Neurosurgery 67(6), 1589–1598. https://doi.org/10.1227/NEU.0b013e3181f8d1b2 (2010).
Article PubMed Google Scholar
Chen, B. et al. Hemorrhage from cerebral cavernous malformations: The role of associated developmental venous anomalies. Neurology 95(1), e89–e96. https://doi.org/10.1212/wnl.0000000000009730 (2020).
Article CAS PubMed Google Scholar
Bervini, D., Jaeggi, C., Mordasini, P., Schucht, P. & Raabe, A. Antithrombotic medication and bleeding risk in patients with cerebral cavernous malformations: a cohort study. J. Neurosurg. 1, 1–9. https://doi.org/10.3171/2018.1.Jns172547 (2018).
Article Google Scholar
Flemming, K. D., Link, M. J., Christianson, T. J. & Brown, R. D. Jr. Use of antithrombotic agents in patients with intracerebral cavernous malformations. J. Neurosurg. 118(1), 43–46. https://doi.org/10.3171/2012.8.Jns112050 (2013).
Article PubMed Google Scholar
Schneble, H. M. et al. Antithrombotic therapy and bleeding risk in a prospective cohort study of patients with cerebral cavernous malformations. Stroke 43(12), 3196–3199. https://doi.org/10.1161/strokeaha.112.668533 (2012).
Article CAS PubMed Google Scholar
Zuurbier, S. M. et al. Long-term antithrombotic therapy and risk of intracranial haemorrhage from cerebral cavernous malformations: A population-based cohort study, systematic review, and meta-analysis. Lancet Neurol. 18(10), 935–941. https://doi.org/10.1016/s1474-4422(19)30231-5 (2019).
Article PubMed PubMed Central Google Scholar
Laqueur, H. S., Smirniotis, C., McCort, C. & Wintemute, G. J. Machine learning analysis of handgun transactions to predict firearm suicide risk. JAMA Netw. Open 5(7), e2221041. https://doi.org/10.1001/jamanetworkopen.2022.21041 (2022).
Article PubMed PubMed Central Google Scholar
Ogata, S. et al. Heatstroke predictions by machine learning, weather information, and an all-population registry for 12-hour heatstroke alerts. Nat. Commun. 12(1), 4575. https://doi.org/10.1038/s41467-021-24823-0 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Pfaff, E. R. et al. Identifying who has long COVID in the USA: A machine learning approach using N3C data. Lancet Digit. Health 4(7), e532–e541. https://doi.org/10.1016/s2589-7500(22)00048-6 (2022).
Article CAS PubMed PubMed Central Google Scholar
Dayan, I. et al. Federated learning for predicting clinical outcomes in patients with COVID-19. Nat. Med. 27(10), 1735–1743. https://doi.org/10.1038/s41591-021-01506-3 (2021).
Article CAS PubMed PubMed Central Google Scholar
Chen, B. et al. Hemorrhage from cerebral cavernous malformations: The role of associated developmental venous anomalies. Dryad https://doi.org/10.1111/ene.15574 (2021).
Article Google Scholar
Wilson, S. Miceforest. AnotherSamWilson/miceforest (2020). https://miceforest.readthedocs.io/_/downloads/en/latest/pdf/.
Cortes, C. & Vapnik, V. N. Support-vector networks. Mach. Learn. 20, 273–297. https://doi.org/10.1007/BF00994018 (1995).
Article Google Scholar
Boser, B. E., Guyon, I. M., & Vapnik, V. N. A training algorithm for optimal margin classifiers. In Proceedings of the fifth annual workshop on Computational learning theory: 1992; pp. 144–152 (1992). https://doi.org/10.1145/130385.130401.
Chen, T., & Guestrin, C. Xgboost: A scalable tree boosting system. In Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining: 2016, pp. 785–794 (2016). https://doi.org/10.1145/2939672.2939785.
Zachariah, F. J., Rossi, L. A., Roberts, L. M. & Bosserman, L. D. Prospective comparison of medical oncologists and a machine learning model to predict 3-month mortality in patients with metastatic solid tumors. JAMA Netw. Open 5(5), e2214514. https://doi.org/10.1001/jamanetworkopen.2022.14514 (2022).
Article PubMed PubMed Central Google Scholar
Abe, D. et al. A prehospital triage system to detect traumatic intracranial hemorrhage using machine learning algorithms. JAMA Netw. Open 5(6), e2216393. https://doi.org/10.1001/jamanetworkopen.2022.16393 (2022).
Article PubMed PubMed Central Google Scholar
Kurz, C. F., Maier, W. & Rink, C. A greedy stacking algorithm for model ensembling and domain weighting. BMC Res. Notes 13(1), 70. https://doi.org/10.1186/s13104-020-4931-7 (2020).
Article PubMed PubMed Central Google Scholar
Wolpert, D. H. Stacked generalization. Neural Netw. 5(2), 241–259. https://doi.org/10.1016/S0893-6080(05)80023-1 (1992).
Article Google Scholar
Breiman, L. Classification and regression trees (Routledge, 2017). https://doi.org/10.1201/9781315139470.
Friedman, J. H. Greedy function approximation: A gradient boosting machine. Ann. Stat. 1, 1189–1232. https://doi.org/10.1214/aos/1013203451 (2001).
Article MathSciNet Google Scholar
Hastie, T., Tibshirani, R., Friedman, J. H., & Friedman, J. H. The elements of statistical learning: Data mining, inference, and prediction, vol. 2 (Springer, 2009).
Zhang, Z. Introduction to machine learning: k-nearest neighbors. Ann. Transl. Med. 4(11), 1. https://doi.org/10.21037/atm.2016.03.37 (2016).
Article ADS CAS Google Scholar
He, H. & Garcia, E. A. Learning from imbalanced data. IEEE Trans. Knowl. Data Eng. 21(9), 1263–1284. https://doi.org/10.1109/TKDE.2008.239 (2009).
Article Google Scholar
Thorsen-Meyer, H. C. et al. Dynamic and explainable machine learning prediction of mortality in patients in the intensive care unit: A retrospective study of high-frequency data in electronic patient records. Lancet Digit. Health 2(4), e179–e191. https://doi.org/10.1016/s2589-7500(20)30018-2 (2020).
Article PubMed Google Scholar
Astner-Rohracher, A. et al. Development and validation of the 5-SENSE score to predict focality of the seizure-onset zone as assessed by stereoelectroencephalography. JAMA Neurol. 79(1), 70–79. https://doi.org/10.1001/jamaneurol.2021.4405 (2022).
Article PubMed Google Scholar
Liang, W. et al. Early triage of critically ill COVID-19 patients using deep learning. Nat. Commun. 11(1), 3543. https://doi.org/10.1038/s41467-020-17280-8 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Gross, B. A., Du, R., Orbach, D. B., Scott, R. M. & Smith, E. R. The natural history of cerebral cavernous malformations in children. J. Neurosurg. Pediatr. 17(2), 123–128. https://doi.org/10.3171/2015.2.Peds14541 (2016).
Article PubMed Google Scholar
Tian, K. B. et al. Clinical course of untreated thalamic cavernous malformations: Hemorrhage risk and neurological outcomes. J. Neurosurg. 127(3), 480–491. https://doi.org/10.3171/2016.8.Jns16934 (2017).
Article PubMed Google Scholar
Jeon, J. S. et al. A risk factor analysis of prospective symptomatic haemorrhage in adult patients with cerebral cavernous malformation. J. Neurol. Neurosurg. Psychiatry 85(12), 1366–1370. https://doi.org/10.1136/jnnp-2013-306844 (2014).
Article PubMed Google Scholar
Ding, D., Starke, R. M., Crowley, R. W. & Liu, K. C. Surgical approaches for symptomatic cerebral cavernous malformations of the thalamus and brainstem. J. Cerebrovasc. Endovasc. Neurosurg. 19(1), 19–35. https://doi.org/10.7461/jcen.2017.19.1.19 (2017).
Article PubMed PubMed Central Google Scholar
Cantu, C. et al. Predictive factors for intracerebral hemorrhage in patients with cavernous angiomas. Neurol. Res. 27(3), 314–318. https://doi.org/10.1179/016164105x39914 (2005).
Article PubMed Google Scholar
Eisner, W. et al. The mapping and continuous monitoring of the intrinsic motor nuclei during brain stem surgery. Neurosurgery 37(2), 255–265. https://doi.org/10.1227/00006123-199508000-00010 (1995).
Article CAS PubMed Google Scholar
Porter, P. J., Willinsky, R. A., Harper, W. & Wallace, M. C. Cerebral cavernous malformations: natural history and prognosis after clinical deterioration with or without hemorrhage. J. Neurosurg. 87(2), 190–197. https://doi.org/10.3171/jns.1997.87.2.0190 (1997).
Article CAS PubMed Google Scholar
Gross, B. A., Batjer, H. H., Awad, I. A. & Bendok, B. R. Brainstem cavernous malformations. Neurosurgery 64(5), E805–E818. https://doi.org/10.1227/01.Neu.0000343668.44288.18 (2009).
Article PubMed Google Scholar
Li, D. et al. Hemorrhage risk, surgical management, and functional outcome of brainstem cavernous malformations. J. Neurosurg. 119(4), 996–1008. https://doi.org/10.3171/2013.7.Jns13462 (2013).
Article PubMed Google Scholar
Kong, L. et al. Five-year symptomatic hemorrhage risk of untreated brainstem cavernous malformations in a prospective cohort. Neurosurg. Rev. 45(4), 2961–2973. https://doi.org/10.1007/s10143-022-01815-2 (2022).
Article PubMed Google Scholar
Aiba, T. et al. Natural history of intracranial cavernous malformations. J. Neurosurg. 83(1), 56–59. https://doi.org/10.3171/jns.1995.83.1.0056 (1995).
Article CAS PubMed Google Scholar
Kondziolka, D., Lunsford, L. D. & Kestle, J. R. The natural history of cerebral cavernous malformations. J. Neurosurg. 83(5), 820–824. https://doi.org/10.3171/jns.1995.83.5.0820 (1995).
Article CAS PubMed Google Scholar
Harris, L., Poorthuis, M. H. F., Grover, P., Kitchen, N. & Al-Shahi Salman, R. Surgery for cerebral cavernous malformations: A systematic review and meta-analysis. Neurosurg. Rev. 45(1), 231–241. https://doi.org/10.1007/s10143-021-01591-5 (2022).
Article PubMed Google Scholar
Kuroedov, D., Cunha, B., Pamplona, J., Castillo, M. & Ramalho, J. Cerebral cavernous malformations: Typical and atypical imaging characteristics. J. Neuroimaging 33(2), 202–217. https://doi.org/10.1111/jon.13072 (2023).
Article PubMed Google Scholar
Vercelli, G. G. et al. Natural history, clinical, and surgical management of cavernous malformations. Methods Mol. Biol. 2152, 35–46. https://doi.org/10.1007/978-1-0716-0640-7_3 (2020).
Article CAS PubMed Google Scholar
Pasqualin, A., Meneghelli, P., Giammarusti, A. & Turazzi, S. Results of surgery for cavernomas in critical supratentorial areas. Acta Neurochir. Suppl. 119, 117–123. https://doi.org/10.1007/978-3-319-02411-0_20 (2014).
Article CAS PubMed Google Scholar
Moriarity, J. L. et al. The natural history of cavernous malformations: A prospective study of 68 patients. Neurosurgery 44(6), 1166–1171. https://doi.org/10.1097/00006123-199906000-00003 (1999).
Article CAS PubMed Google Scholar
Robinson, J. R., Awad, I. A. & Little, J. R. Natural history of the cavernous angioma. J. Neurosurg. 75(5), 709–714. https://doi.org/10.3171/jns.1991.75.5.0709 (1991).
Article CAS PubMed Google Scholar

Download references

Funding

This study was supported by the Kaifeng Science and Technology Development Plan Project (Grant No. 2003042).

Author information

These authors contributed equally: Xiaopeng Li and Peng Jones.

Authors and Affiliations

Department of Neurology, The First Affiliated Hospital of Henan University, Kaifeng, China
Xiaopeng Li
Independent Researcher, Xinyang, Henan, China
Peng Jones
Department of Neurology, The First Affiliated Hospital of Nanchang University, No. 17 Yongwai Street, Nanchang, 330006, Jiangxi, China
Mei Zhao

Authors

Xiaopeng Li
View author publications
You can also search for this author in PubMed Google Scholar
Peng Jones
View author publications
You can also search for this author in PubMed Google Scholar
Mei Zhao
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Concept and design: MZ. Acquisition, analysis, or interpretation of data: all authors. Drafting of the manuscript: MZ. Critical revision of the manuscript: PJ and XL. Statistical analysis: all authors. Supervision: MZ. All authors approved the final manuscript.

Corresponding author

Correspondence to Mei Zhao.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Li, X., Jones, P. & Zhao, M. Identifying potential (re)hemorrhage among sporadic cerebral cavernous malformations using machine learning. Sci Rep 14, 11022 (2024). https://doi.org/10.1038/s41598-024-61851-4

Download citation

Received: 08 June 2023
Accepted: 10 May 2024
Published: 14 May 2024
DOI: https://doi.org/10.1038/s41598-024-61851-4

Keywords

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.