Machine learning predictive model for aspiration screening in hospitalized patients with acute stroke

Park, Dougho; Son, Seok Il; Kim, Min Sol; Kim, Tae Yeon; Choi, Jun Hwa; Lee, Sang-Eok; Hong, Daeyoung; Kim, Mun-Chul

doi:10.1038/s41598-023-34999-8

Download PDF

Article
Open access
Published: 15 May 2023

Machine learning predictive model for aspiration screening in hospitalized patients with acute stroke

Dougho Park^1,2,
Seok Il Son³,
Min Sol Kim³,
Tae Yeon Kim⁴,
Jun Hwa Choi⁵,
Sang-Eok Lee²,
Daeyoung Hong⁶ &
…
Mun-Chul Kim⁶

Scientific Reports volume 13, Article number: 7835 (2023) Cite this article

1078 Accesses
2 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Dysphagia is a fatal condition after acute stroke. We established machine learning (ML) models for screening aspiration in patients with acute stroke. This retrospective study enrolled patients with acute stroke admitted to a cerebrovascular specialty hospital between January 2016 and June 2022. A videofluoroscopic swallowing study (VFSS) confirmed aspiration. We evaluated the Gugging Swallowing Screen (GUSS), an early assessment tool for dysphagia, in all patients and compared its predictive value with ML models. Following ML algorithms were applied: regularized logistic regressions (ridge, lasso, and elastic net), random forest, extreme gradient boosting, support vector machines, k-nearest neighbors, and naïve Bayes. We finally analyzed data from 3408 patients, and 448 of them had aspiration on VFSS. The GUSS showed an area under the receiver operating characteristics curve (AUROC) of 0.79 (0.77–0.81). The ridge regression model was the best model among all ML models, with an AUROC of 0.81 (0.76–0.86), an F1 measure of 0.45. Regularized logistic regression models exhibited higher sensitivity (0.66–0.72) than the GUSS (0.64). Feature importance analyses revealed that the modified Rankin scale was the most important feature of ML performance. The proposed ML prediction models are valid and practical for screening aspiration in patients with acute stroke.

Prehospital stroke-scale machine-learning model predicts the need for surgical intervention

Article Open access 05 June 2023

Yoichi Yoshida, Yosuke Hayashi, … Taka-aki Nakada

A prehospital diagnostic algorithm for strokes using machine learning: a prospective observational study

Article Open access 15 October 2021

Yosuke Hayashi, Tadanaga Shimada, … Taka-aki Nakada

Development of postoperative delirium prediction models in patients undergoing cardiovascular surgery using machine learning algorithms

Article Open access 30 November 2023

Chie Nagata, Masahiro Hata, … Takayoshi Ueno

Introduction

Dysphagia is a common comorbidity after acute stroke¹, occurring in more than half of the stroke survivors². Moreover, dysphagia associated with acute stroke causes aspiration in many cases, which can result in severe complications such as aspiration pneumonia, dehydration, and malnutrition³. Post-stroke pneumonia occurs in about 15% of patients with acute stroke and is a fatal conditions with a 30-day mortality rate of up to 30%^4,5. Furthermore, it has been reported that up to 40% of patients with acute stroke are at risk of malnutrition, which is linked to pressure ulcers, increased dependency, prolonged institutionalization, and high mortality rates^6,7. Moreover, aspiration increases the burden of the initial medical treatment, and active rehabilitation and return to society are inevitably delayed. As a result, the patient’s long-term prognosis is adversely affected due to dysphagia and aspiration, which in turn causes a vicious cycle that deteriorates the patient’s quality of life⁸.

Therefore, early screening for dysphagia and aspiration develops an appropriate feeding strategy⁹. The Gugging Swallowing Screen (GUSS), introduced by Trapl et al.¹⁰, is one of the most widely used dysphagia screening tools in the clinical field and has undergone the most validation testing^11,12. The GUSS comprises direct and indirect evaluations. The indirect test assesses dry saliva swallowing, level of consciousness, and the ability to cough. The direct test evaluates signs of swallowing difficulties, such as delayed swallowing, coughing, drooling, and voice changes after food intake¹³. Although the GUSS has the advantage of being relatively easy to perform at the bedside, its direct evaluation is invasive and carries aspiration pneumonia risk in patients with acute stroke. In addition, the examiner should be sufficiently trained to ensure the reliability of the results. Further, the direct swallowing test sometimes has a high false-positive rate, leading to unnecessary referrals for further testing¹⁴. Finally, accurate evaluation is limited in patients with cognitive impairment or communication difficulties¹⁵. There are questionnaires such as the dysphagia handicap index and eating assessment tool-10 to screen dysphagia early in hospitalization^16,17; they are widely used because of their easy-to-perform and non-invasive advantages and have been translated into various languages and passed through many validation tests. However, they also have a fundamental disadvantage because they can be applied only to a limited patient group. Moreover, some screening tools not only lack standardization in terms of the timing and frequency of examination but also have not been adequately validated in diverse patient populations or settings¹⁸. For these reasons, studies reporting on the predictive power of dysphagia screening tools were often heterogenous and did not provide precisely estimated results¹⁹.

The videofluoroscopic swallowing study (VFSS) has been widely used as a confirmatory test to diagnose aspiration²⁰; despite the advantage of accurately determining whether aspiration is achieved through visualization of all stages of swallowing, it is still invasive and has the disadvantage of being exposed to radiation²¹. In addition, the examination C-arm device is required, and the test is possible in a state where the patient can maintain an appropriate posture²². VFSS also needs suitable space and schedule for these two limitations. Therefore, it can be considered that the VFSS is inappropriate for screening patients with acute stroke.

Machine learning (ML) algorithms are good at performing regression and classification by learning from tubular data²³. Some parts can be a bit confusing when you come across the term “learning,” but it would be correct to say that ML algorithms perform calculations rather than learning. The generally used method in current ML-related medical research has been supervised learning, which compares the real-world output that comes out through a human expert’s decision with the result calculated by ML algorithms based on the same input data^24,25. Based on electrical health records (EHRs), ML models have been widely proposed for the diagnosis, treatment, and prognosis of diseases^26,27. In particular, ML models for the early detection of diseases have been presented in various fields, such as coronary heart disease²⁸, aortic dissections²⁹, depression³⁰, and Alzheimer’s disease³¹, and the results have demonstrated that ML prediction models are comparable to existing screening tools. In addition, ML algorithms have the advantage of using EHR to develop predictive models relatively readily and efficiently, even using a dataset consisting of a large sample with numerous variables. However, despite the rapid expansion of medical research applying ML algorithms, to the best of our knowledge, no reports have presented an ML-based model for screening aspiration in patients with acute stroke. Although Jauk et al.³² introduced the ML-based dysphagia prediction model, which showed acceptable prediction performance in the geriatric cohort, no study targeted patients with acute stroke using ML-based aspiration prediction models.

This study aimed to establish ML prediction models to screen for aspiration, confirmed by VFSS, particularly applicable for patients with acute stroke. According to the screening tool’s purpose, the initial information obtained before the VFSS was used as potential predictors and compared the performance of ML models with the predictive power of the GUSS as a traditional screening tool. To explain the causality of variables, we used both the stepwise logistic regression model and the feature importance analysis of ML models. Ultimately, this study investigated whether ML prediction models enabled early and accurate aspiration screening after acute stroke and could be a reliable alternative to traditional aspiration screening.

Methods

Study population and ethical statements

This retrospective study utilized EHRs of patients hospitalized with acute stroke between January 2016 and June 2022 at a single cerebrovascular specialty hospital. Acute stroke was defined as hospitalization within seven days of new-onset stroke, and patients hospitalized with International Classification of Diseases-10 codes of I60–I63 were selected. The following exclusion criteria were applied in this study: (1) discharged before completion of the VFSS, (2) VFSS failed because of poor cooperation, (3) unspecified stroke type or unclear diagnosis, (4) missing values or lack of clinical information (more than 20%), (5) mortality during the hospitalization period, (6) head and neck cancers, (7) neuromuscular diseases, and (8) underwent prior radiation therapy (Fig. 1). The institutional review board of Pohang Stroke and Spine Hospital reviewed and approved the study design (PSSH0475-202201-HR-001-01). All data was anonymized, excluding patients’ resident and hospital registration numbers and detailed home addresses. Then the dataset was exported to the authorized researcher for this study. Informed consent was waived owing to the study’s retrospective nature by the institutional review board of Pohang Stroke and Spine Hospital. This study was conducted in compliance with the Declaration of Helsinki and the International Conference on Harmonization–Good Clinical Practice Guidelines.

The rationale for selecting the potential predictors

We applied the following criteria in extracting potential predictors. First, personal and clinical information available within a short time after admission was defined as potential predictors to develop an ML prediction model for screening purposes. Second, as much as possible, the variables identified as risk factors for post-stroke dysphagia or pneumonia reported in previous studies were included. Thirdly, the variable had to be reliably evaluated and readily extracted from EHR, and the missing value should not exceed 20% of the total.

Age and sex have been known risk factors for stroke and stroke-related pneumonia³³. Meanwhile, smoking, obesity, and comorbidities such as hypertension, diabetes, dyslipidemia, and previous cerebrovascular lesions, which act as vascular risk factors, are also significant variables that increase the risk of aspiration after stroke^34,35. Symptoms related to motor impairment, such as impaired physical morbidity and dysarthria, are also known to increase the risk of stroke-related aspiration³⁶. Among stroke-related factors, it has been known that the higher frequency of dysphagia was associated with brain stem lesions, hemorrhagic stroke, and stroke severity³⁷. Additionally, malnutrition is a complication of dysphagia and increases the risk of stroke-related aspiration³⁸.

Basic patient information, including age, sex, and socioeconomic status, was identified first based on these previous reports. Then, to confirm the nutritional state and general condition of the patient with acute stroke, mental status, vital signs, and laboratory findings at admission were investigated. In addition, patient or guardian interviews, medical records, and medication history were used to identify the patient’s comorbidities. As stroke-related factors, stroke type and territories were identified. Further, the modified Rankin scale, Morse Fall scale, facial asymmetry, and aphasia were checked to identify the motor and functional impairments. Detailed definitions of each variable are presented in Supplementary Table S1.

Patients with acute stroke underwent GUSS examinations when consultation with early rehabilitation was received; this mainly occurred before the VFSS. Skilled occupational therapists performed the GUSS. The highest score on the GUSS is 20; the higher score means less severe swallowing difficulties.

Videofluoroscopic swallowing study and outcome definition

We used a ZEN-5000 C-arm fluoroscope for the VFSS (Genoray Inc., Seongnam, Korea). The patient maintained an upright sitting posture in a chair or wheelchair, and postural support was provided if the patient could not sit upright. As a contrast agent, 230% barium liquid was diluted to approximately 35% in free water. Food forms consisted of solid, semi-solid, and liquid (2 ml, 5 ml, and 90 ml, respectively). Three examiners from a multidisciplinary team performed the VFSS and on-site interpretations. The team consisted of rehabilitation medicine specialists, occupational therapists, and a speech-language therapist. The next day, the same team reviewed the video recording again for an accurate interpretation. Interpretations were primarily based on the patient’s sagittal view images. We defined aspiration, the target outcome of this study, as the detection of one or more swallowing with a Penetration-Aspiration scale score of 6–8 on the VFSS during hospitalization³⁹.

Statistical analysis

Statistical analyses were performed using R software version 4.2.3 (R Core Team, R Foundation for Statistical Computing, Vienna, Austria). Continuous variables were tested for normality using the Shapiro–Wilk test and are expressed as median (interquartile range). The Wilcoxon rank-sum test was then applied for comparative analysis between the two groups. Categorical variables are expressed as frequency (proportion). The chi-squared (trend) test was used for comparative analysis between the two groups. The area under the receiver operating characteristic curve (AUROC) was analyzed using the “Epi” package in the R software to determine the predictive value of the GUSS for aspiration⁴⁰. We established a stepwise logistic regression model using the backward elimination method to interpret the adjusted odds ratio (aOR) for predicting aspiration. During the stepwise elimination of covariates, the model fitness was assessed using the Akaike information criterion. Multicollinearity between variables was confirmed using the variation inflation factor, with sqrt (variation inflation factor) > 2 as the threshold. We defined statistical significance as a P-value less than 0.05.

Machine learning

Data pre-processing and model establishing

We used the “caret” package of the R software for the ML modeling process⁴¹. Before ML modeling, the data were pre-processed. First, we identified variables with near-zero-variance and removed them. Then, the threshold was set at a correlation coefficient > 0.7 to check for multicollinearity between continuous variables. Continuous variables were then subjected to centering and scaling. Categorical variables underwent one-hot encoding and were transformed into dummy variables. We also detected and removed variables and individuals with more than 20% missing values. Then, we imputed remained missing values while applying a multivariate imputation via the method of the chained equation⁴².

We randomly allocated the entire data into 75% of the training set and 25% of the test set for ML prediction. A synthetic minority oversampling technique was applied to balance the target classes of the training dataset. We utilized the following ML algorithms to generate the prediction model: regularized logistic regression (RLRs)–ridge, lasso, and elastic net–and ensemble algorithms such as random forest (RF) and extreme gradient boosting (XGB). We also utilized classic ML classifiers such as support vector machines (SVM), k-nearest neighbors (KNN), and naïve Bayes (NB). We performed five-fold cross-validation with 50 repeats for an optimal training model. In addition, we used a random or grid search for hyperparameter tuning. We provide tuned hyperparameters and their searching method for each model in Supplementary Table S2. The AUROC, F1 score, sensitivity, and specificity were used as metrics to measure the performance of the ML models (Fig. 2). The entire code for the machine learning process is available in the Online Supplementary Content S1.

Regularized logistic regressions

Some classical algorithms have the advantage of being fast and easy to apply, but the biggest problem is the possibility of overfitting⁴³. Overfitting is defined as when the data has many features, and the hypothesis function fits nicely on the training data. However, it fails to generalize the validation data, a common problem when doing ML modeling⁴⁴. Logistic regression is based on a linear model commonly used in medical statistics, and the more features it has, the more vulnerable it is to overfitting. RLR proceeds in a way that minimizes overfitting through regularization; it overcomes overfitting with a non-sparse solution (L2 regularization, Ridge) or sparse solution method (L1 regularization, Lasso) for high-order variables in a linear equation while maximizing predictive power⁴⁵. Meanwhile, the elastic net method performs regularization by the hybrid method of Ridge and Lasso⁴⁶.

Ensemble algorithms

Ensemble learning is a technique for deriving more accurate results by creating multiple classifiers and combining the predictions. This method helps more accurate prediction by combining several weaker models instead of one robust model, and bagging and boosting types are the most representative^47,48.

RF is a representative ensemble algorithm that uses a bagging method based on a decision tree⁴⁹. It is an algorithm that improves predictive power while solving the overfitting problem that inevitably occurs as the number of branches in the decision tree increases⁵⁰. This algorithm allows duplication of data division during the bagging process, and through this, a unique dataset can be continuously formed⁵¹.

The boosting method differs from bagging in that several classifiers perform learning sequentially, and predictions are performed while weighting the next classifier⁵². One representative boosting module is XGB, which provides optimized custom options by providing parallel processing techniques and various hyperparameter settings⁵³. Therefore, it solves the problem of slow process and overfitting of the boosting method with sequential features in general and shows high predictability simultaneously⁵⁴.

Other classic classifiers

SVM is a classical ML algorithm that creates a virtual vector space; then, it finds the margins that separate each group and recognizes patterns based on such boundaries⁵⁵. SVM performs the classification task by maximizing the distance between the margins that classify the two groups⁵⁶. SVM can be used not only for linear classification but also for non-linear classification through kernelization. On the other hand, SVM is not suited for datasets with a lot of noise⁵⁷.

KNN works by finding the nearest neighbors based on the distance between data points and predicting the label of new data by referring to the labels of those neighbors⁵⁸. KNN measures the distance between all data points each time, so the computation cost is high. However, it has the advantage of obtaining simple and good classification performance when the data is relatively small⁵⁹.

Bayes' theorem is a formula that calculates conditional probability, which is the probability of an event occurring, given that another event has already happened⁶⁰. Based on Bayes' theorem, NB calculates the probability that the input data belongs to each class. Like other classic classifiers, NB has the advantage of fast model learning and efficient data processing⁶¹. On the other hand, NB is calculated based on the assumption that each feature is independent. Therefore, accuracy may be low if some features' independence assumption is unsuitable⁶².

Results

Baseline characteristics

A total of 3408 hospitalized patients with acute stroke were included for analysis. Among them, 448 patients presented with aspiration on VFSS during hospitalization. The results of the baseline characteristics and comparison analyses between the aspiration and non-aspiration groups are presented in Table 1. The aspiration group was significantly older than the non-aspiration group (73.0 [63.0–79.0] vs. 67.0 [58.0–77.0] years old; p < 0.001). Furthermore, the ratio of males, medical aid, previous cerebrovascular accidents, and diabetes were significantly higher in the aspiration group (p = 0.002, p < 0.001, p < 0.001, and p = 0.036, respectively). In addition, the ratio of dyslipidemia was significantly lower in the aspiration group (p = 0.013). Among the stroke-related features, the aspiration group had a significantly higher rate of hemorrhagic stroke (25.9% vs. 18.5%; p < 0.001), initially altered mental status (29.5% vs. 6.3%; p < 0.001), aphasia (18.1% vs. 5.5%; p < 0.001), and facial asymmetry (63.4% vs. 40.0%; p < 0.001) than the non-aspiration group. Additionally, the aspiration group showed a significantly higher rate of patients admitted via the emergency department (91.1% vs. 86.9%; p = 0.016) and more severe functional deterioration–higher modified Rankin scale and Morse Fall scale (3.0 [2.0–4.0] vs. 2.0 [1.0–3.0]; p < 0.001 and 35.0 [35.0–50.0] vs. 35.0 [20.0–35.0]; p < 0.001, respectively). Finally, days from admission to the initial VFSS study were significantly longer in the aspiration group than in the non-aspiration group (5.0 [3.0–8.0] vs. 2.0 [1.0–5.0] days; p < 0.001).

Table 1 Baseline characteristics.

Full size table

Comparisons of initial laboratory findings between the two groups are presented in Supplementary Table S3. The aspiration group showed a significantly lower albumin level, hemoglobin, platelet, total cholesterol, and triglyceride (p < 0.001, p = 0.012, p = 0.025, p = 0.011, and p < 0.001, respectively). Furthermore, the random glucose level was significantly higher in the aspiration group (p = 0.046).

Aspiration screening with the GUSS

The GUSS score was significantly lower in the aspiration group (9.0 [7.0–14.0]) than in the non-aspiration group (20.0 [13.0–20.0]) (p < 0.001) (Table 1). When evaluating the predictive value of the GUSS for aspiration, the AUROC was 0.79 (0.77–0.81), and the cut-off score was 14.5. Based on the cut-off value, the F1 measure was 0.39, the sensitivity was 0.64, and the specificity was 0.83 (Table 2).

Table 2 Prediction performance.

Full size table

Machine learning models

We provide the number of samples after random allocation and target class balancing for each ML model in Supplementary Table S4. The predictive values and confusion matrix for each model are provided in Table 2 and Supplementary Table S5, respectively. Overall, the RLRs, RF, XGB, and NB algorithms showed AUROC values similar to that of the GUSS. Among the applied ML algorithms, ridge regression showed the highest AUROC (0.81 [0.76–0.86]) and F1 measure (0.45). The elastic net regression had the highest sensitivity (0.72), higher than that of the GUSS (0.64). The RF, XGB, SVM, and NB models showed low sensitivity and high specificity.

Most ML algorithms identified the modified Rankin scale as the most important variable for their performance. For RLRs, mental status, facial asymmetry, stroke territory, and sex were highly important features for the prediction. Meanwhile, days to the VFSS study were also relatively crucial for other ML algorithms’ prediction performance. The entire list of the top-five most important variables for each model is shown in Fig. 3.

The stepwise logistic regression model

The final logistic regression model and covariates are provided in Table 3. Higher age (aOR, 1.03; 95% confidence interval [CI] 1.01–1.04; p < 0.001), male sex (aOR, 2.19; 95% CI 1.71–2.81; p < 0.001), days to initial VFSS (aOR, 1.02; 95% CI 1.01–1.04; p = 0.002), posterior circulation stroke (aOR, 1.59; 95% CI 1.21–2.09; p = 0.001), altered mental status (aOR, 2.61; 95% CI 1.89–3.60; p < 0.001), aphasia (aOR, 1.95; 95% CI 1.35–2.81; p < 0.001), higher modified Rankin scale score (aOR, 1.63; 95% CI 1.47–1.80; p < 0.001), previous cerebrovascular accidents (aOR, 1.48; 95% CI 1.13–1.94; p = 0.004), and higher systolic blood pressure (aOR, 1.11; 95% CI 1.05–1.18; p < 0.001) were significantly associated with a higher risk of aspiration. In contrast, facial symmetry (aOR, 0.49; 95% CI 0.39–0.62; p < 0.001), higher body mass index (aOR, 0.96; 95% CI 0.93–1.00; p = 0.039), left side lesion (aOR, 0.75; 95% CI 0.59–0.96; p = 0.023), and higher diastolic blood pressure (aOR, 0.87; 95% CI 0.79–0.96; p = 0.007) were significantly associated with a lower risk of aspiration.

Table 3 Final logistic regression model for identifying risk factors of aspiration after acute stroke.

Full size table

Discussion

In patients hospitalized with acute stroke, we compared the predictive value of aspiration with the GUSS, an existing dysphagia screening tool. We proposed ML models based on the patients’ initial information to enable early screening. Among applied ML algorithm predictors, RLRs showed valid prediction performances and were not inferior to GUSS. This study provides a significant contribution to the field because it is the first study to develop ML models to screen aspiration in patients with acute stroke with a relatively large sample compared to related previous studies. In addition, our study demonstrated that a new aspiration screening tool could be developed by utilizing scattered and various clinical information from hospitalized patients with acute stroke. Therefore our ML models have the potential to minimize the time and efforts of medical staff for screening dysphagia after acute stroke in the clinical field and enable an efficient decision-making process, ultimately improving patient outcomes.

Early dysphagia assessment in patients with acute stroke is critical and essential for establishing a dietary and fluid intake strategy¹⁸. Screening tools for dysphagia can identify swallowing difficulties before confirmatory studies, such as a VFSS or fiberoptic endoscopic evaluation of swallowing, which require a separate space and scheduling. These screening tools have been found to reduce the rate of aspiration pneumonia and unnecessary tube feeding or nil per os period in patients with acute stroke¹². However, although traditional screening tools can be applied to most patients in an awake and alert state, an accurate evaluation is impossible if there is a cognitive decline or the patient cannot obey instructions because of aphasia. In particular, questionnaires such as the dysphagia handicap index and eating assessment tool-10 are relatively more restricted by the limitations mentioned above, although they have the advantage of being non-invasive, unlike the GUSS^16,17. Consequently, a significant limitation of these traditional screening tools is their limited ability to detect dysphagia in patients who may be highly likely at risk for aspiration conditions after acute stroke.

In this study, we demonstrated that the limitations of these existing screening tools could be overcome through ML prediction models, especially in unstable patients with acute stroke. In particular, compared to the GUSS, ML-based models demonstrated a vital advantage; they could predict aspiration with similar performance without requiring an invasive direct swallowing test. Moreover, another advantage is that aspiration screening can be performed much more readily and efficiently based on the initial clinical information obtained from patients with acute stroke⁶³. Unlike traditional screening tools relying on subjective assessments by human experts, ML-based models also have the potential to screen for aspiration more objective and standardized manner by utilizing various features. Furthermore, ML models have the evolutionary potential to continuously learn and adapt to new data, leading to further improvement in their predictive performance over time. Overall, using an ML-based aspiration screening tool potentially contribute to improving patient outcomes and reducing costs.

Our newly developed ML-based screening tool showed valid performance compared to previous dysphagia screening tools. Unfortunately, few studies have attempted to predict precisely aspiration, not overall dysphagia, after acute stroke. Kim et al.¹³ identified the predictive values for aspiration on the GUSS and dysphagia handicap index in a single-center prospective study, with AUROCs of 0.77 and 0.79, respectively; our ML models’ prediction performances were not inferior to their screening tools. Warnecke et al.¹¹ conducted a study to predict aspiration using the GUSS in 100 patients with acute stroke, with an AUROC confirmed as 0.76. Meanwhile, Edmiaston et al.¹⁴ introduced a bedside stroke dysphagia screen named Barnes-Jewish Hospital-Stroke Dysphagia Screen. In their study with 225 patients with acute stroke, sensitivity and specificity for detecting aspiration were 95% and 50%, respectively, showing relatively higher false-positive rates. Leder et al.⁶⁴ reported clinical predictors such as dysphonia, dysarthria, abnormal gag reflex, abnormal volitional cough, cough after swallowing, and voice change after swallowing for post-stroke aspiration. Their model’s sensitivity and specificity were 80% and 30% for predicting aspiration, respectively, similar to Edmiaston et al.’s. Meanwhile, RLR models in our study showed relatively balanced sensitivity and specificity compared to other models through regularization. Consequently, the ML-based aspiration prediction models presented in this study showed similar or slightly better predictive values than the previous results of dysphagia screening tools.

This study is significant in providing clinical clues while analyzing both ML and stepwise logistic regression models; it comprehensively examined related predictors and presented their serial importance. Our results showed that functional level was a significant predictor of aspiration. The modified Rankin scale provides a functional evaluation of stroke severity⁶⁵. Days to initial VFSS also showed high feature importance; we inferred that this feature was an indirect indicator of medical complications or functional level, demonstrating that the patient could sit upright for the VFSS. In a previous study, Henke et al.⁶⁶ demonstrated stroke severity as a reliable and straightforward predictor, consistent with our findings. As reported in previous studies, facial asymmetry was also a significant predictor of aspiration^67,68.

Some clinical findings showed notable results. This study showed that the dyslipidemia rate was higher in the non-aspiration group. In a previous study, Scheitz et al.⁶⁹ reported that statin users’ risk of post-stroke pneumonia was reduced; the results of this study supported their findings, which might be related to the intravascular anti-inflammatory effect of statin⁷⁰. However, in both groups, the frequency of dyslipidemia was less than 10%; therefore, it needs to be cautious for this interpretation. Meanwhile, in the logistic regression model of this study, it was confirmed that the higher the systolic blood pressure and the lower the diastolic blood pressure, the higher the risk of aspiration. These results were inferred because extraordinarily high or low blood pressure on admission was associated with worse stroke severity⁷¹. However, clinical findings such as older age, male, previous cerebrovascular disease, stroke in the posterior circulation, and altered mental status were also factors related to stroke severity. They were associated with a significantly high aspiration risk in this study, consistent with previous studies’ results^72,73.

We designated the AUROC and F1 measures as metrics because of an imbalance in the dependent variable. A linear model slightly outperformed the ensembled algorithms and other classical classifiers utilized in our study. The best model in terms of AUROC was ridge regression. Meanwhile, the elastic net regression method, which combines ridge and lasso regularization for the linear model⁷⁴, showed the highest sensitivity among other ML models. However, the ensembled algorithms, such as RF and XGB, showed low sensitivity with very high specificity. Thus, the ability to discriminate negative cases was high, somewhat inconsistent with the original purpose of screening aspiration. These results demonstrated that approaches using ML sometimes easily over-rely on some features, resulting in overfitting, which leads to non-generalizable results⁷⁵. We also infer from these results that regularization methods more effectively reduced overfitting than ensemble algorithms in our dataset⁷⁶. Appropriate regularization techniques are crucial in deriving generalizable and applicable results from ML models.

The study has several limitations. First, it was a single-center, retrospective study. Due to the study's retrospective nature, some variables related to dysphagia, such as sensory change of throat, were not included as covariates because they could not be consistently evaluated in patients with acute stroke. Moreover, the generalizability of our results is not verified and requires further validation through multicenter studies. Second, we attempted to create an ML model that can be widely applied to all patients with acute stroke. However, stroke is a broad-spectrum disease entity. If subgroup analyses may yield better model performance. Thus, future studies should establish more specific prediction models for certain stroke patients. These models can be helpful for patients with mild stroke to avoid unnecessary radiation exposure as well as enable quick decisions to prevent aspiration pneumonia in patients with severe stroke before a VFSS study. Third, we could not present longitudinal findings regarding the prediction of long-term outcomes; this limitation was primarily related to the hospital setting and rehabilitation treatment delivery system of South Korea. We observed that many patients were transferred to rehabilitation or convalescent hospitals after acute care, and some were not reliably followed long-term. Finally, we could only compare predictive values between ML algorithms and GUSS among several existing dysphagia screening tools. Future studies should verify the validity of our proposed ML models for other screening tools.

In conclusion, this study demonstrated that an ML-based screening model was not inferior to the GUSS in predicting aspiration in hospitalized patients with acute stroke. The RLRs showed better performance among the evaluated ML algorithms. Our findings suggest that ML prediction models can be efficient and straightforward, reducing the time and efforts of medical staff for dysphagia screening in patients with acute stroke. Furthermore, ML prediction models are objective and can overcome the limitations of previous dysphagia screening tools. However, additional validation is required, and specific ML models for each subgroup based on stroke severity and subtype are necessary for clinical applications.

Data availability

All data generated or analysed during this study are included in its supplementary information files.

References

Martino, R. et al. Dysphagia after stroke: Incidence, diagnosis, and pulmonary complications. Stroke 36, 2756–2763. https://doi.org/10.1161/01.STR.0000190056.76543.eb (2005).
Article PubMed Google Scholar
Gonzalez-Fernandez, M., Ottenstein, L., Atanelov, L. & Christian, A. B. Dysphagia after Stroke: an overview. Curr. Phys. Med. Rehabil. Rep. 1, 187–196. https://doi.org/10.1007/s40141-013-0017-y (2013).
Article PubMed PubMed Central Google Scholar
Lundy, D. S. et al. Aspiration: cause and implications. Otolaryngol. Head Neck Surg. 120, 474–478. https://doi.org/10.1053/hn.1999.v120.a91765 (1999).
Article CAS PubMed Google Scholar
Kishore, A. K. et al. How is pneumonia diagnosed in clinical stroke research? A systematic review and meta-analysis. Stroke 46, 1202–1209. https://doi.org/10.1161/STROKEAHA.114.007843 (2015).
Article PubMed Google Scholar
Katzan, I. L., Cebul, R. D., Husak, S. H., Dawson, N. V. & Baker, D. W. The effect of pneumonia on mortality among patients hospitalized for acute stroke. Neurology 60, 620–625. https://doi.org/10.1212/01.wnl.0000046586.38284.60 (2003).
Article CAS PubMed Google Scholar
Sabbouh, T. & Torbey, M. T. Malnutrition in stroke patients: Risk factors, assessment, and management. Neurocrit. Care 29, 374–384. https://doi.org/10.1007/s12028-017-0436-1 (2018).
Article PubMed PubMed Central Google Scholar
Smithard, D. G., Smeeton, N. C. & Wolfe, C. D. Long-term outcome after stroke: Does dysphagia matter?. Age Ageing 36, 90–94. https://doi.org/10.1093/ageing/afl149 (2007).
Article CAS PubMed Google Scholar
Gustafsson, B. & Tibbling, L. Dysphagia, an unrecognized handicap. Dysphagia 6, 193–199. https://doi.org/10.1007/BF02493525 (1991).
Article CAS PubMed Google Scholar
Etges, C. L., Scheeren, B., Gomes, E. & Barbosa Lde, R. Screening tools for dysphagia: A systematic review. Codas 26, 343–349. https://doi.org/10.1590/2317-1782/20142014057 (2014).
Trapl, M. et al. Dysphagia bedside screening for acute-stroke patients. Stroke 38, 2948–2952. https://doi.org/10.1161/strokeaha.107.483933 (2007).
Article PubMed Google Scholar
Warnecke, T. et al. Aspiration and dysphagia screening in acute stroke—the Gugging Swallowing Screen revisited. Eur. J. Neurol. 24, 594–601. https://doi.org/10.1111/ene.13251 (2017).
Article CAS PubMed Google Scholar
Benfield, J. K., Everton, L. F., Bath, P. M. & England, T. J. Accuracy and clinical utility of comprehensive dysphagia screening assessments in acute stroke: A systematic review and meta-analysis. J. Clin. Nurs. 29, 1527–1538. https://doi.org/10.1111/jocn.15192 (2020).
Article PubMed Google Scholar
Cao, Y. et al. A linkage representation of the human hand skeletal system using CT hand scan images. Appl. Sci. 11, 5857. https://doi.org/10.3390/app11135857 (2021).
Article CAS Google Scholar
Edmiaston, J., Connor, L. T., Steger-May, K. & Ford, A. L. A simple bedside stroke dysphagia screen, validated against videofluoroscopy, detects dysphagia and aspiration with high sensitivity. J. Stroke Cerebrovasc. Dis. 23, 712–716. https://doi.org/10.1016/j.jstrokecerebrovasdis.2013.06.030 (2014).
Article PubMed Google Scholar
Park, K. D., Kim, T. H. & Lee, S. H. The Gugging Swallowing Screen in dysphagia screening for patients with stroke: A systematic review. Int. J. Nurs. Stud. 107. https://doi.org/10.1016/j.ijnurstu.2020.103588 (2020).
Belafsky, P. C. et al. Validity and reliability of the Eating Assessment Tool (EAT-10). Ann. Otol. Rhinol. Laryngol. 117, 919–924. https://doi.org/10.1177/000348940811701210 (2008).
Article PubMed Google Scholar
Silbergleit, A. K., Schultz, L., Jacobson, B. H., Beardsley, T. & Johnson, A. F. The Dysphagia handicap index: Development and validation. Dysphagia 27, 46–52. https://doi.org/10.1007/s00455-011-9336-2 (2012).
Article PubMed Google Scholar
Poorjavad, M. & Jalaie, S. Systemic review on highly qualified screening tests for swallowing disorders following stroke: Validity and reliability issues. J. Res. Med. Sci. 19, 776–785 (2014).
PubMed PubMed Central Google Scholar
Boaden, E. et al. Screening for aspiration risk associated with dysphagia in acute stroke. Cochrane Database Syst. Rev. 10, CD012679. https://doi.org/10.1002/14651858.CD012679.pub2 (2021).
Giraldo-Cadavid, L. F. et al. Accuracy of endoscopic and videofluoroscopic evaluations of swallowing for oropharyngeal dysphagia. Laryngoscope 127, 2002–2010. https://doi.org/10.1002/lary.26419 (2017).
Article PubMed Google Scholar
Pikus, L. et al. Videofluoroscopic studies of swallowing dysfunction and the relative risk of pneumonia. AJR Am. J. Roentgenol. 180, 1613–1616. https://doi.org/10.2214/ajr.180.6.1801613 (2003).
Article PubMed Google Scholar
Palmer, J. B., Kuhlemeier, K. V., Tippett, D. C. & Lynch, C. A protocol for the videofluorographic swallowing study. Dysphagia 8, 209–214. https://doi.org/10.1007/BF01354540 (1993).
Article CAS PubMed Google Scholar
Kersting, K. Machine learning and artificial intelligence: Two fellow travelers on the quest for intelligent behavior in machines. Front. Big Data 1, 6. https://doi.org/10.3389/fdata.2018.00006 (2018).
Article PubMed PubMed Central Google Scholar
Handelman, G. S. et al. eDoctor: Machine learning and the future of medicine. J. Intern. Med. 284, 603–619. https://doi.org/10.1111/joim.12822 (2018).
Article CAS PubMed Google Scholar
Sidey-Gibbons, J. A. M. & Sidey-Gibbons, C. J. Machine learning in medicine: A practical introduction. BMC Med. Res. Methodol. 19, 64. https://doi.org/10.1186/s12874-019-0681-4 (2019).
Article PubMed PubMed Central Google Scholar
Schwartz, J. T. et al. Applications of machine learning using electronic medical records in spine surgery. Neurospine 16, 643–653. https://doi.org/10.14245/ns.1938386.193 (2019).
Article PubMed PubMed Central Google Scholar
Maarseveen, T. D. et al. Machine learning electronic health record identification of patients with rheumatoid arthritis: Algorithm pipeline development and validation study. JMIR Med. Inf. 8. https://doi.org/10.2196/23930 (2020).
Zhang, J. et al. Ensemble machine learning approach for screening of coronary heart disease based on echocardiography and risk factors. BMC Med. Inform. Decis. Mak. 21, 187. https://doi.org/10.1186/s12911-021-01535-5 (2021).
Article CAS PubMed PubMed Central Google Scholar
Liu, L. et al. An early aortic dissection screening model and applied research based on ensemble learning. Ann. Transl. Med. 8, 1578. https://doi.org/10.21037/atm-20-1475 (2020).
Article CAS PubMed PubMed Central Google Scholar
Souza Filho, E. M. et al. Can machine learning be useful as a screening tool for depression in primary care? J. Psychiatr. Res. 132, 1–6. https://doi.org/10.1016/j.jpsychires.2020.09.025 (2021).
Carpenter, K. A. & Huang, X. Machine learning-based virtual screening and its applications to Alzheimer’s drug discovery: A review. Curr. Pharm. Des. 24, 3347–3358. https://doi.org/10.2174/1381612824666180607124038 (2018).
Article CAS PubMed PubMed Central Google Scholar
Jauk, S. et al. Evaluation of a machine learning-based dysphagia prediction tool in clinical routine: A prospective observational cohort study. Dysphagia, 1–9. https://doi.org/10.1007/s00455-022-10548-9 (2023).
Sui, R. & Zhang, L. Risk factors of stroke-associated pneumonia in Chinese patients. Neurol. Res. 33, 508–513. https://doi.org/10.1179/016164111X13007856084205 (2011).
Article PubMed Google Scholar
Ishigami, K. et al. Association of severe hypertension with pneumonia in elderly patients with acute ischemic stroke. Hypertens Res. 35, 648–653. https://doi.org/10.1038/hr.2012.7 (2012).
Article PubMed PubMed Central Google Scholar
Grossmann, I. et al. Stroke and pneumonia: Mechanisms, risk factors, management, and prevention. Cureus 13, e19912. https://doi.org/10.7759/cureus.19912 (2021).
Oliveira, A. R. d. S. et al. Clinical factors predicting risk for aspiration and respiratory aspiration among patients with Stroke. Revista Latino-Americana de Enfermagem 23, 216–224. https://doi.org/10.1590/0104-1169.0197.2545 (2015).
Kumar, S. et al. Recovery of swallowing after dysphagic stroke: An analysis of prognostic factors. J. Stroke Cerebrovasc. Dis. 23, 56–62. https://doi.org/10.1016/j.jstrokecerebrovasdis.2012.09.005 (2014).
Article PubMed Google Scholar
Matsumura, T., Mitani, Y., Oki, Y., Fujimoto, Y. & Ishikawa, A. Risk factors for the onset of aspiration pneumonia among stroke patients in the recovery stage. Nihon Ronen Igakkai Zasshi 51, 364–368. https://doi.org/10.3143/geriatrics.51.364 (2014).
Article PubMed Google Scholar
Rosenbek, J. C., Robbins, J. A., Roecker, E. B., Coyle, J. L. & Wood, J. L. A penetration-aspiration scale. Dysphagia 11, 93–98. https://doi.org/10.1007/BF00417897 (1996).
Article CAS PubMed Google Scholar
Carstensen, B., Plummer, M., Laara, E. & Hills, M. Epi: A Package for Statistical Analysis in Epidemiology. R package version 2.44. (2021).
Kuhn, M. caret: Classification and Regression Training. R package version 6. 0–90. (2021).
Zhang, Z. Multiple imputation with multivariate imputation by chained equation (MICE) package. Ann. Transl. Med. 4, 30. https://doi.org/10.3978/j.issn.2305-5839.2015.12.63 (2016).
Article PubMed PubMed Central Google Scholar
Demsar, J. & Zupan, B. Hands-on training about overfitting. PLoS Comput. Biol. 17, e1008671. https://doi.org/10.1371/journal.pcbi.1008671 (2021).
Bartlett, P. L., Long, P. M., Lugosi, G. & Tsigler, A. Benign overfitting in linear regression. Proc Natl Acad Sci U S A 117, 30063–30070. https://doi.org/10.1073/pnas.1907378117 (2020).
Article ADS MathSciNet CAS PubMed PubMed Central MATH Google Scholar
Sheridan, R. P., Wang, W. M., Liaw, A., Ma, J. & Gifford, E. M. Extreme gradient boosting as a method for quantitative structure-activity relationships. J. Chem. Inf. Model. 56, 2353–2360. https://doi.org/10.1021/acs.jcim.6b00591 (2016).
Article CAS PubMed Google Scholar
Munch, M. M., Peeters, C. F. W., Van Der Vaart, A. W. & Van De Wiel, M. A. Adaptive group-regularized logistic elastic net regression. Biostatistics 22, 723–737. https://doi.org/10.1093/biostatistics/kxz062 (2021).
Article MathSciNet PubMed Google Scholar
Dong, X., Yu, Z., Cao, W., Shi, Y. & Ma, Q. A survey on ensemble learning. Front. Comp. Sci. 14, 241–258. https://doi.org/10.1007/s11704-019-8208-z (2019).
Article Google Scholar
Jafarzadeh, H., Mahdianpari, M., Gill, E., Mohammadimanesh, F. & Homayouni, S. Bagging and boosting ensemble classifiers for classification of multispectral, hyperspectral and PolSAR data: A comparative evaluation. Remote Sens. 13. https://doi.org/10.3390/rs13214405 (2021).
Campos, R., Canuto, S., Salles, T., de Sá, C. C. A. & Gonçalves, M. A. in Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval 105–114 (2017).
Yang, L. et al. Study of cardiovascular disease prediction model based on random forest in eastern China. Sci. Rep. 10, 5245. https://doi.org/10.1038/s41598-020-62133-5 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Altman, N. & Krzywinski, M. Ensemble methods: Bagging and random forests. Nat. Methods 14, 933–934. https://doi.org/10.1038/nmeth.4438 (2017).
Article CAS Google Scholar
Chen, T. & Guestrin, C. in Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining 785–794 (2016).
Liu, K., Chen, W. & Lin, H. XG-PseU: an eXtreme Gradient Boosting based method for identifying pseudouridine sites. Mol. Genet. Genomics 295, 13–21. https://doi.org/10.1007/s00438-019-01600-9 (2020).
Article CAS PubMed Google Scholar
Budholiya, K., Shrivastava, S. K. & Sharma, V. An optimized XGBoost based diagnostic system for effective prediction of heart disease. J. King Saud Univ. Comput. Inf. Sci. 34, 4514–4523. https://doi.org/10.1016/j.jksuci.2020.10.013 (2022).
Article Google Scholar
Noble, W. S. What is a support vector machine?. Nat. Biotechnol. 24, 1565–1567. https://doi.org/10.1038/nbt1206-1565 (2006).
Article CAS PubMed Google Scholar
Ben-Hur, A. & Weston, J. A user’s guide to support vector machines. Methods Mol. Biol. 609, 223–239. https://doi.org/10.1007/978-1-60327-241-4_13 (2010).
Article CAS PubMed Google Scholar
Kafai, M. & Eshghi, K. CROification: Accurate kernel classification with the efficiency of sparse linear SVM. IEEE Trans. Pattern Anal. Mach. Intell. 41, 34–48. https://doi.org/10.1109/TPAMI.2017.2785313 (2019).
Article PubMed Google Scholar
Zhang, Z. Introduction to machine learning: k-nearest neighbors. Ann. Transl. Med. 4, 218. https://doi.org/10.21037/atm.2016.03.37 (2016).
Article PubMed PubMed Central Google Scholar
Uddin, S., Haque, I., Lu, H., Moni, M. A. & Gide, E. Comparative performance analysis of K-nearest neighbour (KNN) algorithm and its different variants for disease prediction. Sci. Rep. 12, 6256. https://doi.org/10.1038/s41598-022-10358-x (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Webb, M. P. K. & Sidebotham, D. Bayes’ formula: A powerful but counterintuitive tool for medical decision-making. BJA Educ. 20, 208–213. https://doi.org/10.1016/j.bjae.2020.03.002 (2020).
Article CAS PubMed PubMed Central Google Scholar
Ahmed, M. S., Shahjaman, M., Rana, M. M. & Mollah, M. N. H. Robustification of naive bayes classifier and its application for microarray gene expression data analysis. Biomed. Res. Int. 2017, 3020627. https://doi.org/10.1155/2017/3020627 (2017).
Article PubMed PubMed Central Google Scholar
Wei, W., Visweswaran, S. & Cooper, G. F. The application of naive Bayes model averaging to predict Alzheimer’s disease from genome-wide data. J. Am. Med. Inform. Assoc. 18, 370–375. https://doi.org/10.1136/amiajnl-2011-000101 (2011).
Article PubMed PubMed Central Google Scholar
Nwanosike, E. M., Conway, B. R., Merchant, H. A. & Hasan, S. S. Potential applications and performance of machine learning techniques and algorithms in clinical practice: A systematic review. Int. J. Med. Inform. 159, 104679. https://doi.org/10.1016/j.ijmedinf.2021.104679 (2021).
Leder, S. B. & Espinosa, J. F. Aspiration risk after acute stroke: comparison of clinical examination and fiberoptic endoscopic evaluation of swallowing. Dysphagia 17, 214–218. https://doi.org/10.1007/s00455-002-0054-7 (2002).
Article PubMed Google Scholar
Broderick, J. P., Adeoye, O. & Elm, J. Evolution of the modified rankin scale and its use in future stroke trials. Stroke 48, 2007–2012. https://doi.org/10.1161/STROKEAHA.117.017866 (2017).
Article PubMed PubMed Central Google Scholar
Henke, C., Foerch, C. & Lapa, S. Early screening parameters for dysphagia in acute ischemic stroke. Cerebrovasc. Dis. 44, 285–290. https://doi.org/10.1159/000480123 (2017).
Article PubMed Google Scholar
Wang, B. J., Carter, F. L. & Altman, K. W. Relationship between Dysarthria and Oral-Oropharyngeal Dysphagia: The present evidence. Ear, Nose Throat J. https://doi.org/10.1177/0145561320951647 (2020).
Bahia, M. M., Mourão, L. F. & Chun, R. Y. S. Dysarthria as a predictor of dysphagia following stroke. NeuroRehabilitation 38, 155–162. https://doi.org/10.3233/nre-161305 (2016).
Article PubMed Google Scholar
Scheitz, J. F., Endres, M., Heuschmann, P. U., Audebert, H. J. & Nolte, C. H. Reduced risk of poststroke pneumonia in thrombolyzed stroke patients with continued statin treatment. Int. J. Stroke 10, 61–66. https://doi.org/10.1111/j.1747-4949.2012.00864.x (2015).
Article PubMed Google Scholar
Ridker, P. M. et al. Rosuvastatin to prevent vascular events in men and women with elevated C-reactive protein. N. Engl. J. Med. 359, 2195–2207. https://doi.org/10.1056/NEJMoa0807646 (2008).
Article CAS PubMed Google Scholar
Liu, C. H. et al. Initial blood pressure is associated with stroke severity and is predictive of admission cost and one-year outcome in different stroke subtypes: A SRICHS registry study. BMC Neurol 16, 27. https://doi.org/10.1186/s12883-016-0546-y (2016).
Article CAS PubMed PubMed Central Google Scholar
Appelros, P., Nydevik, I., Seiger, A. & Terent, A. Predictors of severe stroke: influence of preexisting dementia and cardiac disorders. Stroke 33, 2357–2362. https://doi.org/10.1161/01.str.0000030318.99727.fa (2002).
Article PubMed Google Scholar
Appelros, P., Nydevik, I. & Viitanen, M. Poor outcome after first-ever stroke: predictors for death, dependency, and recurrent stroke within the first year. Stroke 34, 122–126. https://doi.org/10.1161/01.str.0000047852.05842.3c (2003).
Article PubMed Google Scholar
Xu, Q. F., Ding, X. H., Jiang, C. X., Yu, K. M. & Shi, L. An elastic-net penalized expectile regression with applications. J. Appl. Stat. 48, 2205–2230. https://doi.org/10.1080/02664763.2020.1787355 (2021).
Article MathSciNet CAS PubMed MATH Google Scholar
Badillo, S. et al. An introduction to machine learning. Clin. Pharmacol. Ther. 107, 871–885. https://doi.org/10.1002/cpt.1796 (2020).
Article PubMed PubMed Central Google Scholar
Park, D. & Kim, I. Application of machine learning in the field of intraoperative neurophysiological monitoring: A narrative review. Appl. Sci. 12, 1. https://doi.org/10.3390/app12157943 (2022).
Article CAS Google Scholar

Download references

Acknowledgements

The authors appreciate our hospital’s occupational therapists, Mr. Seong Hun Son, Ms. Ye Ji Son, and Ms. Su Min Lim, for their support of this project.

Author information

Authors and Affiliations

Department of Medical Science and Engineering, School of Convergence Science and Technology, Pohang University of Science and Technology, Pohang, Republic of Korea
Dougho Park
Department of Rehabilitation Medicine, Pohang Stroke and Spine Hospital, Pohang, Republic of Korea
Dougho Park & Sang-Eok Lee
Occupational Therapy Department of Rehabilitation Center, Pohang Stroke and Spine Hospital, Pohang, Republic of Korea
Seok Il Son & Min Sol Kim
Speech-Language Therapy Department of Rehabilitation Center, Pohang Stroke and Spine Hospital, Pohang, Republic of Korea
Tae Yeon Kim
Department of Quality Improvement, Pohang Stroke and Spine Hospital, Pohang, Republic of Korea
Jun Hwa Choi
Department of Neurosurgery, Pohang Stroke and Spine Hospital, Pohang, Republic of Korea
Daeyoung Hong & Mun-Chul Kim

Authors

Dougho Park
View author publications
You can also search for this author in PubMed Google Scholar
Seok Il Son
View author publications
You can also search for this author in PubMed Google Scholar
Min Sol Kim
View author publications
You can also search for this author in PubMed Google Scholar
Tae Yeon Kim
View author publications
You can also search for this author in PubMed Google Scholar
Jun Hwa Choi
View author publications
You can also search for this author in PubMed Google Scholar
Sang-Eok Lee
View author publications
You can also search for this author in PubMed Google Scholar
Daeyoung Hong
View author publications
You can also search for this author in PubMed Google Scholar
Mun-Chul Kim
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

D.P. designed the study. S.-E.L., S.I.S., M.S.K., T.Y.K., and J.H.C. and performed data acquisition and investigation. D.P. performed the first analysis and validation. D.P. wrote the first draft of the manuscript. D.P., D.H, and M.-C.K. wrote the revised version of the manuscript. All authors approved the final version of the manuscript.

Corresponding author

Correspondence to Dougho Park.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information 1.

Supplementary Information 2.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Park, D., Son, S.I., Kim, M.S. et al. Machine learning predictive model for aspiration screening in hospitalized patients with acute stroke. Sci Rep 13, 7835 (2023). https://doi.org/10.1038/s41598-023-34999-8

Download citation

Received: 19 December 2022
Accepted: 11 May 2023
Published: 15 May 2023
DOI: https://doi.org/10.1038/s41598-023-34999-8

This article is cited by

Comprehensive Analysis of the SUMO-related Signature: Implication for Diagnosis, Prognosis, and Immune Therapeutic Approaches in Cervical Cancer
- Xing Zhang
- Jian Cao
- Shizhi Wang
Biochemical Genetics (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Prehospital stroke-scale machine-learning model predicts the need for surgical intervention

A prehospital diagnostic algorithm for strokes using machine learning: a prospective observational study

Development of postoperative delirium prediction models in patients undergoing cardiovascular surgery using machine learning algorithms

Introduction

Methods

Study population and ethical statements

The rationale for selecting the potential predictors

Videofluoroscopic swallowing study and outcome definition

Statistical analysis

Machine learning

Data pre-processing and model establishing

Regularized logistic regressions

Ensemble algorithms

Other classic classifiers

Results

Baseline characteristics

Aspiration screening with the GUSS

Machine learning models

The stepwise logistic regression model

Discussion

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information 1.

Supplementary Information 2.

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comprehensive Analysis of the SUMO-related Signature: Implication for Diagnosis, Prognosis, and Immune Therapeutic Approaches in Cervical Cancer

Comments

Search

Quick links