Acute ischemic stroke prediction and predictive factors analysis using hematological indicators in elderly hypertensives post-transient ischemic attack

Shu, Chang; Zheng, Chenguang; Luo, Da; Song, Jie; Jiang, Zhengyi; Ge, Le

doi:10.1038/s41598-024-51402-2

Download PDF

Article
Open access
Published: 06 January 2024

Acute ischemic stroke prediction and predictive factors analysis using hematological indicators in elderly hypertensives post-transient ischemic attack

Chang Shu¹,
Chenguang Zheng²,
Da Luo¹,
Jie Song³,
Zhengyi Jiang³ &
…
Le Ge¹

Scientific Reports volume 14, Article number: 695 (2024) Cite this article

888 Accesses
Metrics details

Subjects

Abstract

Elderly hypertensive patients diagnosed with transient ischemic attack (TIA) are at a heightened risk for developing acute ischemic stroke (AIS). This underscores the critical need for effective risk prediction and identification of predictive factors. In our study, we utilized patient data from peripheral blood tests and clinical profiles within hospital information systems. These patients were followed for a three-year period to document incident AIS. Our cohort of 11,056 individuals was randomly divided into training, validation, and testing sets in a 5:2:3 ratio. We developed an XGBoost model, developed using selected indicators, provides an effective and non-invasive method for predicting the risk of AIS in elderly hypertensive patients diagnosed with TIA. Impressively, this model achieved a balanced accuracy of 0.9022, a recall of 0.8688, and a PR-AUC of 0.9315. Notably, our model effectively encapsulates essential data variations involving mixed nonlinear interactions, providing competitive performance against more complex models that incorporate a wider range of variables. Further, we conducted an in-depth analysis of the importance and sensitivity of each selected indicator and their interactions. This research equips clinicians with the necessary tools for more precise identification of high-risk individuals, thereby paving the way for more effective stroke prevention and management strategies.

Association between Clinical and Laboratory Markers and 5-year Mortality among Patients with Stroke

Article Open access 08 August 2019

Identification of a plausible serum uric acid cut-off value as prognostic marker of stroke: the Uric Acid Right for Heart Health (URRAH) study

Article 29 September 2021

Development and verification of a nomogram for predicting short-term mortality in elderly ischemic stroke populations

Article Open access 03 August 2023

Introduction

Transient ischemic attack (TIA) and acute ischemic stroke (AIS) are both characterized by a sudden reduction in blood flow, leading to temporary or permanent loss of neurological function¹. TIA is defined as a transient episode of neurologic dysfunction due to focal brain, spinal cord, or retinal ischemia, without acute infarction². Recent studies have shown a strong correlation between TIA and the subsequent development of AIS^3,4. Approximately 20% of TIA patients experience an AIS within three months of the initial TIA event, with the highest risk occurring within the first 48 h⁵. Over the long term, TIA patients face a 10-year stroke risk of 19% and a combined 10-year risk of stroke, myocardial infarction, and vascular death at 43%⁶. TIA and AIS share several common risk factors, such as hypertension, diabetes mellitus, hyperlipidemia, and atrial fibrillation⁷. Among these, hypertension is the most prevalent risk factor for both conditions⁸. Elderly patients with hypertension who experience TIA symptoms, such as sudden weakness or numbness in the face, arms, or legs; confusion; difficulty speaking; vision problems; dizziness; and severe headache, are at an increased risk of developing AIS in the days and weeks following the TIA event⁹. The unpredictability of progression from TIA to AIS not only imposes a considerable burden on the healthcare system but also significantly impacts the mental well-being and daily activities of elderly hypertensive patients. Given these risks and the urgent need to identify predictive factors, establishing an effective risk prediction model for AIS following a TIA event in elderly hypertensive patients is crucial. However, the current literature on AIS prediction predominantly focuses on broader patient populations^10,11,12, often overlooking the unique characteristics and risk profiles of this specific group.

A peripheral routine blood test (RBT) is the most commonly performed clinical test and provides a comprehensive evaluation of various blood components and characteristics. This evaluation offers valuable insights into an individual's hematological profile, reflecting their overall health status^13,14. The direct measurements obtained from the RBT are known as primary hematological indicators (PHIs). Additionally, derived hematological indicators (DHIs), which are calculated from PHIs using various mathematical methods, are included. Together, these indicators are collectively referred to as primary and derived hematological indicators (PDHIs). Supplementary Material 1 in this study provides a comprehensive list of all PDHIs measured using the Sysmex XE 5000 Hematology Analyzer, including their full names, corresponding abbreviations, and the methodologies for calculating the DHIs. Numerous studies have indicated the critical role of PDHIs in the development and progression of hypertension, TIA, and AIS^15,16. Moreover, there is substantial evidence of common alterations in PDHIs across these three vascular-origin diseases. For instance, an elevation in neutrophils and a decrease in lymphocytes^17,18,19, as well as consistent changes in hematocrit^20,21,22 and red cell distribution width (RDW)^14,23,24, have been observed across different studies focusing on these three vascular-origin diseases. These findings suggest the presence of numerous shared hematological indicators within the internal environment of patients with these vascular diseases. These shared hematological indicators may hold the key to predicting the risk of AIS in elderly patients with hypertension who have experienced a TIA.

Despite the recognized importance of these PDHIs, a comprehensive and systematic study investigating their predictive power and associated risk factors for AIS following a TIA in elderly hypertensive patients is lacking. The complexities inherent in PDHIs, such as nonlinear relationships²⁵ and multicollinearity²⁶, necessitate the use of advanced data science methodologies to unlock their predictive potential and unravel associated risk factors. To address these challenges, this paper employs a robust analytic strategy by initially utilizing the searching for uncorrelated list of variables (SULOV)-recursive method, tailored for nonlinear data to select relevant variables while minimizing redundancy. Subsequently, an extreme gradient boosting (XGBoost) model, known for its efficacy in handling multicollinearity and capturing complex interaction relationships, is constructed. The model is fine-tuned through an exhaustive hyperparameter optimization process and further calibrated to enhance predictive accuracy. This comprehensive approach aims to construct a reliable three-year AIS risk prediction model for elderly hypertensive TIA patients, harnessing the full spectrum of PDHIs. The model's interpretability and sensitivity analysis are designed to identify and highlight the key factors that contribute to the progression from TIA to AIS in this high-risk group. Our research aims to enhance early prediction and intervention for AIS, potentially improving management and outcomes for elderly hypertensive patients post-TIA.

Methods

Cohort selection and variables definition

Our study extracted data from the Hospital Information System (HIS) and included 32,643 elderly patients consecutively admitted with a history of hypertension and subsequently received a primary diagnosis of TIA at Tianjin Huanhu Hospital's emergency department from July 2015 to December 2019. Follow-ups were conducted using a bulk mobile messaging-WeChat-remote follow-up system, supplemented by phone calls when necessary, to determine if the patients experienced cerebral infarction within three years post-TIA. Outcomes were gauged using a binary question: "Have you, or the patient, been diagnosed with cerebral infarction, confirmed by a neurologist's cranial CT or MRI scan, within three years following the TIA diagnosis at our hospital?" If direct contact failed, we reached out to relatives based on contact information provided during the initial hospital visit. This methodology ensured precise identification of AIS incidents within the follow-up window. We excluded patients based on the following criteria: (1) Patients were diagnosed with chronic cerebral infarction or other cerebrovascular diseases based on cranial CT or MRI scan reports in the EMR system. (2) Patients admitted to the emergency department who did not have a completed electronic medical record, thus precluding the extraction of RBT report, past medical history, and alcohol and tobacco use data. (3) Patients whose admission blood routine tests showed white blood cell (WBC), red blood cell (RBC), or platelet counts (PLT) outside of the normal range (WBC: 4–10 × 10⁹ cells/L, RBC: 4–6 × 10¹² cells/L, PLT: 150–450 × 10⁹ cells/L), were excluded. This criterion was set to mitigate the impact of infections and hematological diseases on the PDHIs. (4) Patients admitted to the emergency department who did not provide contact information. (5) Patients lost to follow-up. Supplemental Fig. 1 is the flowchart of patient selection. Our research incorporated 28 PHIs, eight DHIs, and eight categorized demographic and lifestyle variables including gender, age (categorized into three groups: 60–69, 70–79, and 80 + years), drinking history, and past medical history. All PDHIs (n = 36) used in this study are continuous variables, and they were obtained from the first blood draw post-admission. A comprehensive list of these PDHIs is provided in Supplementary Material 1.

All data used in this study were extracted from the hospital's business system with risk minimization measures to ensure data security. Private information, such as patient names, ID numbers, and addresses, was hidden, and data usage was in compliance with the provisions for informed exemption of the hospital ethics committee. Informed consent was obtained from all subjects or their legal guardian(s). The study was approved by the Huanhu Ethics Committee (No. 2021060). This study has been registered in the Chinese Clinical Trial Registry (https://www.chictr.org.cn/login.aspx?referurl=%2flistbycreater.aspx), with the registration number ChiCTR2100054189. All methods were performed in accordance with the Helsinki Declaration for human research.

Handling collinearity and variable selection

Considering the significant correlations among PDHIs internally, we employed the SULOV-Recursive method²⁷. The SULOV (Searching for Uncorrelated List of Variables) algorithm is an adaptation of the Minimum-Redundancy-Maximum-Relevance (MRMR) method, designed to handle multicollinearity. It works by identifying pairs of highly correlated variables, assessing their relevance to the target using the Mutual Information Score, and subsequently excluding the less informative variable of each pair. This process leaves a set of variables with maximum informational value and minimal mutual correlation. Subsequently, the algorithm deploys the XGBoost machine learning method in an iterative manner to pinpoint the most predictive variables, conducting multiple training-validation cycles and collating top features from varying data subsets. This procedure concludes by discarding redundant variables, providing a streamlined and effective set of predictors for the subsequent modeling stages. For the demographic and lifestyle variables, we applied integer encoding. The Cramer’s V correlation matrix algorithm was utilized to further remove multicollinearity among these categorical variables. For both continuous and categorical variables, we set the correlation threshold at 0.3.

Non-linearity assessment and modeling workflow

To assess the potential non-linearity between the final selected PDHIs and the outcome variable, we employed the Box-Tidwell test. This test investigates the linearity of predictors with respect to the logit of the outcome variable by introducing log-transformed interaction terms between the continuous predictors and their respective natural logs. This step is crucial as it aids us in making an informed choice about the appropriate predictive model to employ. A significant interaction term (p ≤ 0.05) signifies the presence of non-linearity.

Supplemental Fig. 2 illustrates the overall workflow of our model fitting and testing. To address the combined linear and non-linear characteristics of our data, we employed XGBoost as our principal model. This choice was based not only on the preliminary screening results from our training and validation sets, which demonstrated XGBoost's superior performance among 15 different machine learning algorithms, but also on its considerable suitability for handling medical tabular data, as evidenced by the relevant literature in the field^28,29,30. Three different XGBoost models were constructed. The first model utilized only the selected PDHIs as input. The second model incorporated both the selected PDHIs and categorical variables, while the third included all variables without feature selection. After applying Robust Scaler for continuous variables and Label Encoding for categorical variables, we tuned the hyperparameters for each XGBoost model using the Tree-structured Parzen Estimator (TPE) method within the Optuna framework²⁸. During the model training phase, we integrated a ten-fold cross-validation process. For each fold, class imbalance was addressed uniquely for each of the three models: applying the Synthetic Minority Over-sampling Technique (SMOTE) to the training subset for the model with only PDHIs³¹, and SMOTENC for the models including both PDHIs and categorical variables³². This treatment was restricted to the nine out of ten folds used for training in each cross-validation iteration. The remaining one fold, serving as the validation set, was kept untouched by either SMOTE or SMOTENC, thus preserving its original distribution. After hyperparameter tuning, we performed model calibration on the initially separated validation set, utilizing isotonic regression and sigmoid calibration methods. The optimal calibration approach for each model was determined by comparing the uncalibrated model with these two methods, selecting the one that yielded the lowest Brier score. To evaluate the performance of the three calibrated models, we employed a ten-fold cross-validation approach on the training set, incorporating appropriate class imbalance adjustments. This enhanced the models' ability to detect minority classes and ensured a balanced performance evaluation, preventing the overestimation of accuracy due to imbalanced class distributions. For the ultimate evaluation on the test set, we abstained from applying class imbalance processing to prevent data leakage and to ensure that the models' performance reflected a more realistic prediction scenario, where the original class distribution was maintained. In our study, three calibrated XGBoost models with varying input variables were developed in parallel. Each model underwent a rigorous process of hyperparameter tuning using cross-validation on the training set and calibration on an independent validation set. To assess the performance of these models, we initially conducted a comparative analysis using McNemar's test with Benjamini–Hochberg correction^33,34, applying it to both the validation and test sets. This is a statistical method used for comparing the predictive capabilities of already fitted classifiers.

Multi-tiered approach for predictive factor analysis

In this study, we adopted a multi-tiered approach for our predictive factor analysis. In the individual sensitivity analysis, we systematically varied the value of each selected PDHIs within its observed range, evaluating how these changes influenced the model's predictions for specific patients. In the global sensitivity analysis, we randomly shuffled the values of each PDHI across the entire dataset, disrupting their original correlations with the target variable. This process enabled us to evaluate the independent contribution of each PDHI to the model's predictive performance. Following the sensitivity analyses, we applied the SHAP (SHapley Additive exPlanations) methodology to rank risk factors according to their importance³⁵. The ranking is derived from each feature's SHAP value, which quantifies both the direct (main effect) and interaction contributions of each PDHI to the predictive outcome. The SHAP values essentially capture a feature's average contribution to the prediction outcome, considering all possible coalitions of features. Finally, we examined the interaction effects among the risk factors utilizing SHAP interaction values. This step uncovered the pairs of risk factors that significantly interact with each other, thereby shedding light on the complex interdependencies among the PDHIs.

Sample size and statistical analysis

We performed a power analysis for the sample size determination of our training, validation, and test sets using the R Package 'pmsampsize'. This package computes the minimum sample size required for developing a multivariable prediction model. It specifies an anticipated AUC of 0.9 and utilizes the expected prevalence to approximate the Cox-Snell R-squared, following the methodology proposed by Riley et al.³⁶. In our study, for the dataset with 44 input variables, the minimum sample size required is 989 cases. For the dataset with 14 input variables, it is 315 cases, and for the dataset with 7 input variables, it is 303 cases. The sizes of our training, validation, and test sets significantly exceed these thresholds, indicating a reduced risk of overfitting and ensuring precise estimation of key parameters in the prediction models. This substantial sample size provides a robust foundation for the development and validation of our models.

Continuous variables were reported as medians with interquartile range (IQR) and categorical variables as percentages. Statistical comparisons were performed using the Kruskal‒Wallis and chi-squared tests. P ≤ 0.05 for statistical significance. In this study, given the characteristics of imbalanced data and our practical experience, balanced accuracy was employed as the primary optimization metric to rank the performance of these models. In our model evaluation, we also reported other metrics. For detailed introductions to these metrics, please refer to Supplementary Material 2. The computer program was implemented in Python 3.8.13, with XGBoost (1.6.1), scikit-learn (1.1.1), SHAP (0.41.0), running on Ubuntu 20.04.

Results

Data split, variables selection and nonlinear detection

Our study cohort consisted of 11,056 elderly patients diagnosed with TIA, having a mean age of 68 [@@64, 73] and a male to female ratio of 5451:5605. All patients had a history of hypertension. By applying a random shuffle strategy, the cohort was randomly split into training (n = 5527), validation (n = 2212), and testing datasets (n = 3317) at a ratio of 5:2:3. The proportions of positive outcomes were 28.2% in the training set, 26.8% in the validation set, and 27.8% in the test set. The descriptive statistics of variables across these datasets are provided in Supplementary Table 1. A pairwise Pearson correlation analysis was performed on 36 PDHIs in the training set (Supplementary Table 2). We found 12 pairs of PDHIs with absolute correlation coefficients greater than 0.9, and 48 pairs with coefficients greater than 0.7, indicating multicollinearity among the PDHIs data. the application of the SULOV algorithm effectively reduced multicollinearity among PDHIs, identifying seven key indicators (SIRI, HCT, RDW_CV, PLT, IG_p, BAS_p and EOS) with mutual correlation coefficients below 0.3. Similarly, using the Cramer’s V correlation matrix for categorical variables, we pinpointed seven significant factors: smoking status, alcohol consumption, diabetes, heart disease, respiratory disorders, gender, and age, each exhibiting a correlation coefficient under 0.3.

Based on the results of the Box-Tidwell test, we observed that in the training set, the predictors 'HCT', 'RDW_CV', 'PLT', and 'SIRI' showed p-values less than 0.05, indicating non-linear relationships with the outcome. Conversely, 'IG_p' (p = 0.168), 'BAS_p' (p = 0.413), and 'EOS' (p = 0.375) had p-values greater than 0.05, suggesting linear relationships. Given the presence of both linear and non-linear relationships among the variables, we opted for the versatile XGBoost algorithm for our modeling, following an initial screening of 15 machine learning algorithms (Supplemental Fig. 3).

Model fitting and performance evaluation

We employed three models for thorough assessment of variable fitting to the outcome. These included XGBoost with only selected PDHIs (XGB-PDHIs); XGBoost featuring both selected PDHIs and categorical variables (XGB-Mixed); XGBoost incorporating all variables without feature selection (XGB-All). The optimal hyperparameters determined for each model after tuning are outlined in Supplementary Table 3. The probability calibration results are depicted in Fig. 1. It was observed that for the three XGBoost models, the Brier scores were higher after calibration. Hence, the uncalibrated versions of these models were selected for further dataset evaluation. Table 1 outlines the results of our assessment, featuring the performance metrics of the optimized models as evaluated through tenfold cross-validation on the training set, and their ultimate evaluation on the test set. The slightly lower metrics on the test set, in comparison to the training set cross-validation results, indicate that our model maintains good generalization capabilities. This finding suggests that our model has effectively learned the underlying patterns in the data without overfitting to the training set, thereby ensuring its applicability to real-world scenarios.

Table 1 Model performance assessment through cross-validation on training set and independent evaluation on test set.

Full size table

To compare the classification abilities of the three final fitted models, we employed McNemar's test in conjunction with the Benjamini-Hochberg (BH) correction. From our results (Table 2), we found no significant difference in the predictive capabilities of the three examined models on different data set, even though they each yielded different McNemar test statistics. This lack of statistical distinction suggests that, given our dataset, the predictive performances of the three models are effectively indistinguishable. Moreover, the input data for the XGB-PDHIs model consist solely of objectively measured continuous variables, which can be easily obtained through a single routine blood test, making it highly suitable for clinical application. Considering both its performance and simplicity, we chose the XGB-PDHIs model for in-depth interpretation.

Table 2 Table of McNemar's test results.

Full size table

Risk factor analysis

Through the individual sensitivity analysis, we observed that modifying each selected PDHI within its observed range uniquely influenced the model's predictions for specific patients (Fig. 2a–g). This highlighted the distinct impact each risk factor had on the predicted outcome. For instance, as the value of SIRI increased, the probability of predicting a positive outcome for samples that were originally negative also increased. The RDW_CV displayed a notable trend: as the value increased, samples that were originally negative initially saw an increased probability of being predicted as positive, followed by a decrease. The trends for other indicators were more complex, with the probability variation for individual samples demonstrating polymorphism, likely due to intricate interactions. This indicates the existence of complex interactions leading to diverse trends in single sample probability variations.

Our global sensitivity analysis revealed that among the independent predictive factors, SIRI exerted the most significant influence on the predictive outcome, with a value of 0.117 (Fig. 2h). This value represents the degree of change in the model's predicted outcome when SIRI values are shuffled, thereby disrupting their correlation with the target variable. The second most influential factor was HCT, with a value of 0.108. All other examined factors exhibited values less than 0.08, indicating a lesser degree of influence on the prediction outcome. This suggests that, in the context of forecasting acute ischemic stroke occurrence in elderly hypertensive patients with TIA, the impact of a single PDHI appears relatively limited. In parallel with the global sensitivity analysis, we employed SHAP values for a comprehensive feature importance analysis (Fig. 2i). The results revealed that the top five contributors to the model, in order, were: SIRI, RDW_CV, BAS_p, HCT, and PLT. Apart from SIRI, the overall contribution rankings of factors in the model differed from those obtained in the global sensitivity analysis. These analyses highlight the intricate interplay of selected PDHIs in determining the outcome variable.

Finally, we sought to elucidate potential interaction effects within our XGB-PDHIs model by conducting a pairwise analysis of all PDHIs using SHAP interaction values. Our analysis, conducted at the individual level, revealed complex interactions between different pairs of PDHIs. For illustrative purposes, we visualized the interaction effects involving SIRI (Fig. 3). Positive SHAP interaction values imply that the synergistic presence of two features increases the risk of elderly hypertensive TIA patients subsequently developing Acute Ischemic Stroke (AIS). Conversely, negative SHAP interaction values signify that the combined existence of two features reduces the likelihood of a positive prediction, thus amplifying the probability of these patients not suffering from AIS in the future. In Fig. 3, SIRI is shown to have significant non-linear interactions with each of the selected PDHIs. For instance, Fig. 3a displays the impact of different SIRI and BAS_p values on their interaction as captured by the XGB-PDHIs model. The graph demonstrates that as SIRI values increase, the direction and strength of their interaction with BAS_p values vary within different SIRI ranges. Initially, there is an enhancement in the positive interaction when BAS_p values are low, followed by a stronger positive interaction with high BAS_p values, and then a stronger negative interaction emerges as BAS_p values remain high. Subsequently, increased negative interaction occurs when BAS_p values are low again. Overall, the interaction between these two variables transitions from positive to negative enhancement. In Fig. 3b, within the same range of SIRI values, the impact of RDW_CV values on their interaction is dichotomous: higher RDW_CV values are associated with a strong positive interaction, while lower RDW_CV values correlate with a strong negative interaction. Then, the pattern reverses, showing a strong negative interaction with high RDW_CV values, and a strong positive interaction with low RDW_CV values. Similar trends are observed with other variables interacting with SIRI, indicating a complex pattern of interactions within the components of the XGB-PDHIs model. This complexity underscores the interdependent and regulatory nature of hematological indicators within the body's internal environment.

Discussion

A vast array of studies has employed machine learning and statistical methods for AIS prediction. However, most of these studies focus on the prognosis of AIS, while research specifically aimed at predicting the incidence of AIS is less common^37,38,39. Studies focusing on AIS incidence risk frequently address AIS as a uniform condition or may introduce a single stratifying factor, such as hypertension or diabetes, to forecast AIS occurrences^11,12. Research that incorporates multiple stratifying factors to identify specific populations, such as forecasting in elderly diabetic patients or in hypertensive patients with coronary artery disease, remains relatively uncommon^40,41. This scarcity can largely be attributed to the challenges in gathering large sample sizes for specific populations defined by numerous restrictive criteria. Furthermore, when multiple criteria are used to define a study population, the complexity of interactions among variables often increases and becomes more intricate. Traditional statistical models often fall short in accurately analyzing these intricate interactions, thereby limiting our understanding of AIS risk factors in these targeted cohorts. Our study overcomes these issues by extracting data from the HIS of a national-level neuro-specialty hospital, thereby ensuring a substantial sample size. We employed the XGBoost model to fully utilize the non-linear interactions between input variables^42,43. Innovatively, we predicted the occurrence of AIS within three years in a patient cohort defined by three stratifying factors: elderly age, transient ischemic attack (TIA), and hypertension. Each of these is a key factor for AIS incidence^4,9,44, and older hypertensive patients with TIA are undoubtedly a high-risk group in need of predictive assessment for AIS. We opted for the simplest model comprising only seven PDHIs ('SIRI', 'HCT', 'RDW_CV', 'PLT', 'BAS_p', 'IG_p', and 'EOS'), given its comparable performance to more complex models. This decision was based on balancing predictive accuracy with practicality for clinical application, ensuring both efficacy and ease of use for future research and practical deployment.

Machine learning significantly enhances stroke prediction accuracy by focusing on pivotal risk factors and utilizing extensive healthcare datasets⁴⁵. Recent reviews identified several commonly used ML algorithms in cerebrovascular risk assessment, such as support vector machines, artificial neural networks, linear and logistic regression, and tree-based methods like random forests and gradient tree boosting^45,46,47. Due to the lack of models specifically designed for predicting AIS in elderly hypertensives with TIA, we screened 15 models incorporating these algorithms. XGBoost emerged as the top performer. Its advanced tree-building and regularization techniques provide nuanced pattern recognition and help mitigate overfitting, rendering it particularly adept at predicting AIS within specific patient demographics⁴⁸. Ruixuan Huang et al., using data from the Chinese Longitudinal Healthy Longevity Study (CHADS) and similar class imbalance techniques as our study, constructed multifactorial stroke prediction models for the elderly. The performance of these models was as follows: Logistic Regression (Recall: 0.75, Specificity: 0.68, AUC: 0.72), SVM (Recall: 0.70, Specificity: 0.72, AUC: 0.71), and Random Forest (Recall: 0.62, Specificity: 0.79, AUC: 0.71)⁴⁹. Yuexin Qiu et al. compared multiple tree-based models after hyperparameter tuning in a large sample study of 46,240, finding the best performances in random forest (sensitivity: 0.778, specificity: 0.913, AUC: 0.924) and XGBoost (sensitivity: 0.776, specificity: 0.916, AUC: 0.924)⁵⁰. Chuan Hong et al., using neural networks and random survival forests on data from diverse large-scale studies in Western populations, fitted models for subgroups based on race, sex, and age, with the highest AUC for neural networks at 0.75 and for random survival forests at 0.73⁵¹. Our XGB-PDHIs model (Sensitivity: 0.869, Specificity: 0.936, AUC: 0.970) not only surpasses the performance of the above-mentioned specific cohort models but is also precisely tailored for a more narrowly defined specific high-risk population: elderly hypertensive patients with TIA. The input variables for this model, derived from easily accessible clinical laboratory data, enhance its practicality and suitability for clinical application.

Our analysis prominently identifies SIRI as the most significant predictive factor, a consistent finding across global sensitivity and feature importance analyses, reaffirming its pivotal role in our model. SIRI, indicative of systemic immune-inflammation, is calculated from neutrophil, monocyte, and lymphocyte counts, and is integral in reflecting the balance between inflammatory and immune responses⁵². Parameters like HCT, RDW_CV, and PLT, linked to erythrocyte and platelet series, have been widely acknowledged in numerous studies for their association with AIS development and progression^24,53,54. These factors, relating to blood's oxygen-carrying capacity, erythrocyte size variability, and clotting potential, are fundamentally connected to AIS via pathways like inflammation, oxidative stress, endothelial dysfunction, hemostatic balance and regulation of coagulation mechanism^{20,24,53,54,55}. While BAS_p, IG_p, and EOS in AIS have been less explored, their potential in providing unique predictive insights cannot be overlooked. A study indicates that eosinophil cationic protein, a marker of eosinophil activity and degranulation, when elevated, is associated with an increased incidence of AIS⁵⁶. IG has been recommended as a new indicator of systemic inflammation, showing potential to predict AIS risk⁵⁷. There has also been a report of BAS being successfully used as one of the input variables in machine learning to predict AIS⁵⁸. Notably, apart from SIRI's consistent top ranking, the order of other indicators varies in global sensitivity and SHAP value-based feature importance analyses. As global sensitivity analysis evaluates the impact of individual input variability on predictions, SHAP values provide insights into both the direct and interaction effects of features on model outputs. Such differential ranking highlights the complex nature of vascular mechanisms in the pathogenesis of AIS, where each predictor's biological significance may vary depending on interactions with other factors⁵⁹. Utilizing SHAP interaction value plots, our study has uncovered, for the first time, the intricate and non-linear interplay among various hematological indicators (SIRI, HCT, RDW_CV, PLT, IG_p, BAS_p and EOS) in elderly hypertensive patients with TIA. We observed that these interactions exhibit considerable complexity and demonstrate varying trends across individuals, depending on the values of different hematological indicators, underscoring the necessity for personalized risk prediction for AIS within this demographic. Our XGB-PDHIs model emerges as a promising tool for such individualized predictions.

Advantage and limitation

Our study introduces a precise XGBoost model, meticulously developed to predict AIS progression within three years in elderly hypertensive patients with TIA. This model utilizes a rigorous workflow and focuses on key PDHIs. We conducted an in-depth analysis of the non-linear interactions between these PDHIs, elucidating their collective impact at an individual level in the assessment of AIS risk within this demographic. Our study also has some limitations. First, our findings were derived from a single-center dataset, which may limit the generalizability of our results. Multi-center studies with diverse patient cohorts would be beneficial in validating and refining our predictive model. Second, our analysis was primarily centered on the pairwise interactions among variables. The investigation into more complex interactions involving more than two factors, as well as the establishment of thresholds for interaction effects, remains unexplored. These elements are key areas for our future research efforts. Third, although our XGBoost model shows promising results, machine learning offers possibilities for further improvement. Future research could explore alternative models and reassess feature importance to potentially enhance our findings. Last, we recognize the potential influence of additional factors such as nutrition, socioeconomic, and psychosocial elements on the onset of AIS. Integrating these factors into our analysis could improve the predictive accuracy and offer a more comprehensive understanding of AIS risk in elderly hypertensive patients with TIA.

Conclusion

We developed an optimized XGBoost model using selected PDHIs (XGB-PDHIs), which performed competitively against more complex models incorporating a wider range of variables. This indicates the efficacy of the XGB-PDHIs in capturing the primary key variations necessary for accurate AIS prediction over a three-year period in elderly hypertensive patients with TIA. Through model interpretability analysis and SHAP interaction value plots, our study revealed the importance of nonlinear interactions among SIRI, HCT, RDW_CV, PLT, BAS_p, IG_p, and EOS in assessing AIS risk within this demographic. The XGB-PDHIs model, notable for its robust performance and practicality, provides a valuable contribution to predicting AIS risk by enabling more targeted screening and personalized risk assessment. Future work should focus on validating these findings in larger, multicenter studies and further investigating the interaction mechanisms that link key PDHIs to AIS risk.

Data availability

Due to data security reasons, the Huanhu data derived from the hospital's systems are not publicly available, but can be obtained from the corresponding author upon reasonable request for research purposes. The data will be updated and supplemented in real time.

Code availability

The code used for feature selection, model hyperparameter selection, and calibration will be made available at https://github.com/nkchangshu/PDHIs_ML/tree/main.

References

Easton, J. D. et al. Definition and evaluation of transient ischemic attack: a scientific statement for healthcare professionals from the American Heart Association/American Stroke Association Stroke Council; Council on Cardiovascular Surgery and Anesthesia; Council on Cardiovascular Radiology and Intervention; Council on Cardiovascular Nursing; and the Interdisciplinary Council on Peripheral Vascular Disease. The American Academy of Neurology affirms the value of this statement as an educational tool for neurologists. Stroke 40, 2276–2293. https://doi.org/10.1161/STROKEAHA.108.192218 (2009).
Panuganti, K. K., Tadi, P. & Lui, F. in StatPearls (2023).
Johnston, S. C., Gress, D. R., Browner, W. S. & Sidney, S. Short-term prognosis after emergency department diagnosis of TIA. JAMA 284, 2901–2906. https://doi.org/10.1001/jama.284.22.2901 (2000).
Article PubMed CAS Google Scholar
Ghozy, S. et al. Transient ischemic attacks preceding ischemic stroke and the possible preconditioning of the human brain: A systematic review and meta-analysis. Front. Neurol. 12, 755167. https://doi.org/10.3389/fneur.2021.755167 (2021).
Article PubMed PubMed Central Google Scholar
Johnston, D. C. & Hill, M. D. The patient with transient cerebral ischemia: A golden opportunity for stroke prevention. CMAJ 170, 1134–1137. https://doi.org/10.1503/cmaj.1021148 (2004).
Article PubMed PubMed Central Google Scholar
Sadighi, A. et al. Six-month outcome of transient ischemic attack and its mimics. Front. Neurol. 10, 294. https://doi.org/10.3389/fneur.2019.00294 (2019).
Article PubMed PubMed Central Google Scholar
Kernan, W. N. et al. Guidelines for the prevention of stroke in patients with stroke and transient ischemic attack: A guideline for healthcare professionals from the American Heart Association/American Stroke Association. Stroke 45, 2160–2236. https://doi.org/10.1161/STR.0000000000000024 (2014).
Article PubMed Google Scholar
Kleindorfer, D. O. et al. 2021 Guideline for the prevention of stroke in patients with stroke and transient ischemic attack: A guideline from the American Heart Association/American Stroke Association. Stroke 52, e364–e467. https://doi.org/10.1161/STR.0000000000000375 (2021).
Article PubMed Google Scholar
Turin, T. C. et al. Hypertension and lifetime risk of stroke. J. Hypertens. 34, 116–122. https://doi.org/10.1097/HJH.0000000000000753 (2016).
Article PubMed CAS Google Scholar
Kaur, M., Sakhare, S. R., Wanjale, K. & Akter, F. Early stroke prediction methods for prevention of strokes. Behav. Neurol. 2022, 7725597. https://doi.org/10.1155/2022/7725597 (2022).
Article PubMed PubMed Central Google Scholar
Chang, H. W. et al. Ischemic stroke prediction using machine learning in elderly Chinese population: The Rugao Longitudinal Ageing study. Brain Behav. https://doi.org/10.1002/brb3.3307 (2023).
Article PubMed PubMed Central Google Scholar
Shao, X. et al. Development and validation of risk prediction models for stroke and mortality among patients with type 2 diabetes in northern China. J. Endocrinol. Invest. 46, 271–283. https://doi.org/10.1007/s40618-022-01898-0 (2023).
Article PubMed CAS Google Scholar
Gong, P. et al. The association of neutrophil to lymphocyte ratio, platelet to lymphocyte ratio, and lymphocyte to monocyte ratio with post-thrombolysis early neurological outcomes in patients with acute ischemic stroke. J. Neuroinflamm. 18, 51. https://doi.org/10.1186/s12974-021-02090-6 (2021).
Article CAS Google Scholar
Feng, G. H., Li, H. P., Li, Q. L., Fu, Y. & Huang, R. B. Red blood cell distribution width and ischaemic stroke. Stroke Vasc. Neurol. 2, 172–175. https://doi.org/10.1136/svn-2017-000071 (2017).
Article PubMed PubMed Central Google Scholar
McCabe, D. J. et al. Platelet degranulation and monocyte-platelet complex formation are increased in the acute and convalescent phases after ischaemic stroke or transient ischaemic attack. Br. J. Haematol. 125, 777–787. https://doi.org/10.1111/j.1365-2141.2004.04983.x (2004).
Article PubMed Google Scholar
Siedlinski, M. et al. White blood cells and blood pressure: A Mendelian randomization study. Circulation 141, 1307–1317. https://doi.org/10.1161/CIRCULATIONAHA.119.045102 (2020).
Article PubMed PubMed Central CAS Google Scholar
Zhang, Y., Xing, Z., Zhou, K. & Jiang, S. The predictive role of Systemic Inflammation Response Index (SIRI) in the prognosis of stroke patients. Clin. Interv. Aging 16, 1997–2007. https://doi.org/10.2147/CIA.S339221 (2021).
Article PubMed PubMed Central CAS Google Scholar
Jhuang, Y. H. et al. Neutrophil to lymphocyte ratio as predictor for incident hypertension: A 9-year cohort study in Taiwan. Hypertens. Res. 42, 1209–1214. https://doi.org/10.1038/s41440-019-0245-3 (2019).
Article PubMed PubMed Central Google Scholar
Chan, K. L. et al. Elevated neutrophil to lymphocyte ratio associated with increased risk of recurrent vascular events in older minor stroke or TIA patients. Front. Aging Neurosci. 13, 646961. https://doi.org/10.3389/fnagi.2021.646961 (2021).
Article PubMed PubMed Central Google Scholar
Kellert, L. et al. Cerebral oxygen transport failure?: Decreasing hemoglobin and hematocrit levels after ischemic stroke predict poor outcome and mortality: STroke: RelevAnt Impact of hemoGlobin, Hematocrit and Transfusion (STRAIGHT)–an observational study. Stroke 42, 2832–2837. https://doi.org/10.1161/STROKEAHA.110.606665 (2011).
Article PubMed CAS Google Scholar
Emamian, M. et al. Association of hematocrit with blood pressure and hypertension. J. Clin. Lab Anal. 31, 66. https://doi.org/10.1002/jcla.22124 (2017).
Article CAS Google Scholar
Palm, F. et al. Stroke seasonality associations with subtype, etiology and laboratory results in the Ludwigshafen Stroke Study (LuSSt). Eur. J. Epidemiol. 28, 373–381. https://doi.org/10.1007/s10654-013-9772-4 (2013).
Article PubMed Google Scholar
Seo, S. G. et al. The association between red cell distribution width and incident hypertension in Korean adults. Hypertens. Res. 43, 55–61. https://doi.org/10.1038/s41440-019-0334-3 (2020).
Article PubMed CAS Google Scholar
Xie, K. H. et al. Red cell distribution width: A novel predictive biomarker for stroke risk after transient ischaemic attack. Ann. Med. 54, 1167–1177. https://doi.org/10.1080/07853890.2022.2059558 (2022).
Article PubMed PubMed Central CAS Google Scholar
Yoon, Y. Z., Kotar, J., Yoon, G. & Cicuta, P. The nonlinear mechanical response of the red blood cell. Phys. Biol. 5, 036007. https://doi.org/10.1088/1478-3975/5/3/036007 (2008).
Article ADS PubMed Google Scholar
Gregorich, M., Strohmaier, S., Dunkler, D. & Heinze, G. Regression with highly correlated predictors: Variable omission is not the solution. Int. J. Environ. Res. Public Health 18, 66. https://doi.org/10.3390/ijerph18084259 (2021).
Article Google Scholar
Strutt, J. P. B. et al. Machine learning-based detection of adventitious microbes in T-cell therapy cultures using long-read sequencing. Microbiol. Spectr. 11, e0135023. https://doi.org/10.1128/spectrum.01350-23 (2023).
Article PubMed CAS Google Scholar
Lai, J. P. et al. Tree-based machine learning models with Optuna in predicting impedance values for circuit analysis. Micromachines 14, 66. https://doi.org/10.3390/mi14020265 (2023).
Article Google Scholar
Wei, T. T. et al. Development and validation of a machine learning model for differential diagnosis of malignant pleural effusion using routine laboratory data. Ther. Adv. Respir. Dis. 17, 17534666231208632. https://doi.org/10.1177/17534666231208632 (2023).
Article PubMed PubMed Central Google Scholar
Liu, M. et al. A computational framework of routine test data for the cost-effective chronic disease prediction. Brief Bioinform https://doi.org/10.1093/bib/bbad054 (2023).
Article PubMed PubMed Central Google Scholar
Rafiei, A., Ghiasi Rad, M., Sikora, A. & Kamaleswaran, R. Improving mixed-integer temporal modeling by generating synthetic data using conditional generative adversarial networks: A case study of fluid overload prediction in the intensive care unit. Comput. Biol. Med. 168, 107749. https://doi.org/10.1016/j.compbiomed.2023.107749 (2023).
Article PubMed Google Scholar
Gozukara Bag, H. G. et al. Estimation of obesity levels through the proposed predictive approach based on physical activity and nutritional habits. Diagnostics 13, 66. https://doi.org/10.3390/diagnostics13182949 (2023).
Article Google Scholar
Chen, T. L. et al. Domain specific word embeddings for natural language processing in radiology. J. Biomed. Inform. 113, 103665. https://doi.org/10.1016/j.jbi.2020.103665 (2021).
Article PubMed Google Scholar
Mei, X. et al. Artificial intelligence-enabled rapid diagnosis of patients with COVID-19. Nat. Med. 26, 1224–1228. https://doi.org/10.1038/s41591-020-0931-3 (2020).
Article PubMed PubMed Central CAS Google Scholar
Hammoud, B. et al. Predicting incomplete occlusion of intracranial aneurysms treated with flow diverters using machine learning models. J. Neurosurg. 66, 1–10. https://doi.org/10.3171/2023.9.JNS231031 (2023).
Article Google Scholar
Riley, R. D. et al. Calculating the sample size required for developing a clinical prediction model. BMJ 368, m441. https://doi.org/10.1136/bmj.m441 (2020).
Article PubMed Google Scholar
Pacchiano, F. et al. Artificial intelligence applied in acute ischemic stroke: From child to elderly. Radiol. Med. https://doi.org/10.1007/s11547-023-01735-1 (2023).
Article PubMed Google Scholar
Yang, Y. et al. The predictive performance of artificial intelligence on the outcome of stroke: A systematic review and meta-analysis. Front. Neurosci. 17, 1256592. https://doi.org/10.3389/fnins.2023.1256592 (2023).
Article PubMed PubMed Central Google Scholar
Liu, Y., Luo, Y. & Naidech, A. M. Big data in stroke: How to use big data to make the next management decision. Neurotherapeutics 20, 744–757. https://doi.org/10.1007/s13311-023-01358-4 (2023).
Article PubMed Google Scholar
Zheng, X., Fang, F., Nong, W., Feng, D. & Yang, Y. Development and validation of a model to estimate the risk of acute ischemic stroke in geriatric patients with primary hypertension. BMC Geriatr. 21, 458. https://doi.org/10.1186/s12877-021-02392-7 (2021).
Article PubMed PubMed Central Google Scholar
Coca, A. et al. Predicting stroke risk in hypertensive patients with coronary artery disease: A report from the INVEST. Stroke 39, 343–348. https://doi.org/10.1161/STROKEAHA.107.495465 (2008).
Article PubMed Google Scholar
Khajehpiri, B. et al. Survival analysis in cognitively normal subjects and in patients with mild cognitive impairment using a proportional hazards model with extreme gradient boosting regression. J. Alzheimers Dis. 85, 837–850. https://doi.org/10.3233/JAD-215266 (2022).
Article PubMed CAS Google Scholar
Zuranski, A. M., Gandhi, S. S. & Doyle, A. G. A machine learning approach to model interaction effects: Development and application to alcohol deoxyfluorination. J. Am. Chem. Soc. 145, 7898–7909. https://doi.org/10.1021/jacs.2c13093 (2023).
Article PubMed CAS Google Scholar
Ma, Q. et al. Temporal trend and attributable risk factors of stroke burden in China, 1990–2019: An analysis for the Global Burden of Disease Study 2019. Lancet Public Health 6, e897–e906. https://doi.org/10.1016/S2468-2667(21)00228-0 (2021).
Article MathSciNet PubMed PubMed Central Google Scholar
Daidone, M., Ferrantelli, S. & Tuttolomondo, A. Machine learning applications in stroke medicine: Advancements, challenges, and future prospectives. Neural Regen. Res. 19, 769–773. https://doi.org/10.4103/1673-5374.382228 (2024).
Article PubMed Google Scholar
Heo, J. et al. Machine learning-based model for prediction of outcomes in acute stroke. Stroke 50, 1263–1265. https://doi.org/10.1161/STROKEAHA.118.024293 (2019).
Article PubMed Google Scholar
Boyd, C. et al. Machine learning quantitation of cardiovascular and cerebrovascular disease: A systematic review of clinical applications. Diagnostics https://doi.org/10.3390/diagnostics11030551 (2021).
Article PubMed PubMed Central Google Scholar
Qinghe, Z., Wen, X., Boyan, H., Jong, W. & Junlong, F. Optimised extreme gradient boosting model for short term electric load demand forecasting of regional grid system. Sci. Rep. 12, 19282. https://doi.org/10.1038/s41598-022-22024-3 (2022).
Article ADS PubMed PubMed Central CAS Google Scholar
Wu, Y. & Fang, Y. Stroke prediction with machine learning methods among older Chinese. Int. J. Environ. Res. Public Health 17, 66. https://doi.org/10.3390/ijerph17061828 (2020).
Article Google Scholar
Qiu, Y. et al. Development of rapid and effective risk prediction models for stroke in the Chinese population: A cross-sectional study. BMJ Open 13, e068045. https://doi.org/10.1136/bmjopen-2022-068045 (2023).
Article PubMed PubMed Central Google Scholar
Hong, C. et al. Predictive accuracy of stroke risk prediction models across black and white race, sex, and age groups. JAMA 329, 306–317. https://doi.org/10.1001/jama.2022.24683 (2023).
Article PubMed PubMed Central Google Scholar
Xia, Y. et al. Systemic Immune Inflammation Index (SII), System Inflammation Response Index (SIRI) and risk of all-cause mortality and cardiovascular mortality: A 20-year follow-up cohort study of 42,875 US adults. J. Clin. Med. 12, 66. https://doi.org/10.3390/jcm12031128 (2023).
Article CAS Google Scholar
Sico, J. J. et al. Association between admission haematocrit and mortality among men with acute ischaemic stroke. Stroke Vasc. Neurol. 3, 160–168. https://doi.org/10.1136/svn-2018-000149 (2018).
Article PubMed PubMed Central Google Scholar
Franks, Z. G., Campbell, R. A., Weyrich, A. S. & Rondina, M. T. Platelet-leukocyte interactions link inflammatory and thromboembolic events in ischemic stroke. Ann. N. Y. Acad. Sci. 1207, 11–17. https://doi.org/10.1111/j.1749-6632.2010.05733.x (2010).
Article ADS PubMed PubMed Central CAS Google Scholar
Liu, Y. et al. Combined prognostic significance of D-dimer level and platelet count in acute ischemic stroke. Thromb. Res. 194, 142–149. https://doi.org/10.1016/j.thromres.2020.05.021 (2020).
Article PubMed CAS Google Scholar
Sundstrom, J. et al. Eosinophil cationic protein, carotid plaque, and incidence of stroke. Stroke 48, 2686–2692. https://doi.org/10.1161/STROKEAHA.117.018450 (2017).
Article PubMed CAS Google Scholar
Korkut, M., Selvi, F. & Bedel, C. Echocardiographic epicardial fat thickness and immature granulocyte are novel inflammatory predictors of acute ischemic stroke: A prospective study. Sao Paulo Med. J. 140, 384–389. https://doi.org/10.1590/1516-3180.2021.0461.R1.16082021 (2022).
Article PubMed PubMed Central Google Scholar
O’Connell, G. C. et al. Use of deep artificial neural networks to identify stroke during triage via subtle changes in circulating cell counts. BMC Neurol. 22, 206. https://doi.org/10.1186/s12883-022-02726-x (2022).
Article PubMed PubMed Central Google Scholar
Sierra, C., Coca, A. & Schiffrin, E. L. Vascular mechanisms in the pathogenesis of stroke. Curr. Hypertens. Rep. 13, 200–207. https://doi.org/10.1007/s11906-011-0195-x (2011).
Article PubMed CAS Google Scholar

Download references

Acknowledgements

We would like to express our gratitude to Director Xiaoguang Tong and Professor Hua Yan for their invaluable assistance in providing access to hospital data for this study. The Internet+ Patient Follow-up Service Center of Huanhu Hospital provided essential support for the collection of patient follow-up data, and we sincerely appreciate their contribution. Additionally, we would like to express our sincere appreciation to the anonymous reviewers and the Editor-in-Chief for their valuable time and insightful feedback, which greatly improved the quality of this paper.

Funding

This work was supported by the Natural Science Foundation of Tianjin (grant number: 20JCZDJC00540) and Tianjin Health and Technology Project (grant number: TJWJ2021MS032).

Author information

Authors and Affiliations

Tianjin Key Laboratory of Cerebral Vascular and Neurodegenerative Diseases, Tianjin Neurosurgical Institute, Tianjin Huanhu Hospital, Tianjin, 300350, China
Chang Shu, Da Luo & Le Ge
Tianjin Key Laboratory of Brain Science and Neural Engineering, Tianjin University, Tianjin, China
Chenguang Zheng
Academy of Medical Engineering and Translational Medicine, Intelligent Medical Engineering, Tianjin University, Tianjin, China
Jie Song & Zhengyi Jiang

Authors

Chang Shu
View author publications
You can also search for this author in PubMed Google Scholar
Chenguang Zheng
View author publications
You can also search for this author in PubMed Google Scholar
Da Luo
View author publications
You can also search for this author in PubMed Google Scholar
Jie Song
View author publications
You can also search for this author in PubMed Google Scholar
Zhengyi Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Le Ge
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization: CS.; Methodology: C.S.; Data collection: C.S., J.S. and Z.J.; Data analysis: C.S.; Writing—original draft preparation: C.S.; Writing—review and editing: C.S.; Supervision: C.Z. and L.G.; Funding acquisition: L.G., C.Z. and L.D.

Corresponding authors

Correspondence to Chang Shu or Le Ge.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information 1.

Supplementary Information 2.

Supplementary Figures.

Supplementary Table S1.

Supplementary Table S2.

Supplementary Table S3.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Shu, C., Zheng, C., Luo, D. et al. Acute ischemic stroke prediction and predictive factors analysis using hematological indicators in elderly hypertensives post-transient ischemic attack. Sci Rep 14, 695 (2024). https://doi.org/10.1038/s41598-024-51402-2

Download citation

Received: 20 June 2023
Accepted: 04 January 2024
Published: 06 January 2024
DOI: https://doi.org/10.1038/s41598-024-51402-2

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.