Application of machine learning in the diagnosis of vestibular disease

Anh, Do Tram; Takakura, Hiromasa; Asai, Masatsugu; Ueda, Naoko; Shojaku, Hideo

doi:10.1038/s41598-022-24979-9

Download PDF

Article
Open access
Published: 02 December 2022

Application of machine learning in the diagnosis of vestibular disease

Do Tram Anh¹^na1,
Hiromasa Takakura¹^na1,
Masatsugu Asai¹,
Naoko Ueda¹ &
…
Hideo Shojaku¹

Scientific Reports volume 12, Article number: 20805 (2022) Cite this article

2377 Accesses
7 Altmetric
Metrics details

Subjects

Abstract

Machine learning is considered a potential aid to support human decision making in disease prediction. In this study, we determined the utility of various machine learning algorithms in classifying peripheral vestibular (PV) and non-PV diseases based on the results of equilibrium function tests. A total of 1009 patients who had undergone our standardized neuro-otological examinations were recruited. We applied five supervised machine learning algorithms (random forest, adaboost, gradient boosting, support vector machine, and logistic regression). After preprocessing the data, optimizing the hyperparameters using GridSearchCV, and performing a final evaluation on the test set using scikit-learn, we evaluated the predictive capability using various performance metrics, namely, accuracy, F1-score, area under the receiver operating characteristic curve, precision, recall, and Matthews correlation coefficient (MCC). All five machine learning algorithms yielded satisfactory results; the accuracy of the algorithms ranged from 76 to 79%, with the support vector machine classifier having the highest accuracy. In cases where the predictions of the five models were consistent, the accuracy of the PV diagnostic results was improved to 83%, whereas it increased to 85% for the non-PV diagnostic results. Future research should increase the number of patients and optimize the classification methods to obtain the highest diagnostic accuracy.

Evaluating machine learning classifiers for glaucoma referral decision support in primary care settings

Article Open access 20 May 2022

Decision making on vestibular schwannoma treatment: predictions based on machine-learning analysis

Article Open access 15 September 2021

Machine learning approach for prediction of hearing preservation in vestibular schwannoma surgery

Article Open access 28 April 2020

Introduction

Dizziness and vertigo are symptoms that frequently induce patients to visit a physician. However, determining the cause of these symptoms is complicated because of the wide range of diseases with which they are associated. Peripheral vestibular (PV) system dysfunction is one of the most common etiologies of vertigo¹, causing conditions such as benign paroxysmal positional vertigo, vestibular neuritis, and Meniere’s disease². The diagnostic criteria for PV disease are mainly based on the patient’s history and a systematic clinical examination of the vestibular, oculomotor, and cerebellar systems according to the clinically oriented diagnostic criteria of the Bárány Society³.

In our department, various equilibrium examinations, including the caloric test and the optokinetic nystagmus test, have been used as diagnostic methods. In addition, imaging tests such as brain magnetic resonance imaging (MRI), brain computed tomography (CT), temporal bone CT, and cervical vascular sonography are performed as needed in almost all patients. Emergency patients with vertigo/dizziness are not included among the subjects who undergo the examinations mentioned above. Our equilibrium examinations are performed on outpatients who visit our department, including patients who are referred from other hospitals and patients who are admitted to our department due to vertigo/dizziness. In addition, we also examine some patients after admission to various other departments in our hospital, such as the emergency, neurology, and neurosurgery departments. These patients often have difficult medical conditions resulting from undiagnosed, ineffective treatment and long-term persistence of symptoms. Therefore, the accurate diagnosis of these patients and the differentiation of patients with PV disease from others are important tasks for us.

Machine learning (ML) has been developing rapidly in recent years and is being used in many aspects of medicine, especially radiology, robotic surgical systems, and disease diagnosis^4,5,6. Since the 1980s, computer-based algorithms in medicine, called “expert systems”, have been used to simulate the steps and decision-making processes in the specific field of otolaryngology^7,8. Recently, ML has been studied as a useful software method to assist medical decision making for vestibular dysfunction^{9,10,11,12,13}. These studies show that ML is becoming a potential solution to help physicians most effectively access and use large amounts of information to make accurate diagnoses. However, the patients we must diagnose in our daily practice vary in time from disease onset and in their symptom severity. The most important capability that we desire an ML system to have is the ability to distinguish between PV disease and non-PV disease in patients who are difficult to diagnose. This result influences the choice of both treatment and the next test to be performed.

The purpose of the present study was to evaluate the ML models created with various learning algorithms for binary classification between PV disease and non-PV disease using the datasets generated from our equilibrium examinations. Furthermore, we devised a method to optimize the performance of the ML model.

Materials and methods

This study was approved by the Ethics Committee of the Toyama University Hospital, Toyama Japan (Approval Number: R2019003). All methods were performed in accordance with the relevant guidelines and regulations. In this study, informed consent was obtained by publishing an opt-out document on the website, based on the Ethical Guidelines for Medical and Health Research Involving Human Subjects established by the Ministry of Health, Labour and Welfare of Japan.

Subjects

The data of 1009 patients who underwent equilibrium examinations in our department (the Department of Otolaryngology, Head and Neck Surgery, University of Toyama) in the 10-year span from 2009 to 2019 were retrieved. The number of patients with PV was 497, and the number with non-PV disease was 512 (611 males and 398 females overall; mean age, 55.6 years). PV disease and non-PV disease were diagnosed according to the International Classification of Vestibular Disorders of the Bárány Society¹⁴ and the guidelines of the Japan Society for Equilibrium Research¹⁵ (Table 1). For example, small acoustic neuromas corresponding to Koos¹⁶ grade I or II were classified in the PV disease group. Patients who were confirmed to have unilateral PV dysfunction but could not be diagnosed as having an established clinical entity were classified in the PV disease group and considered to have inner ear disorder. Patients who were evaluated to have normal PV function and in whom central nervous system disease was ruled out by neurological examinations and brain MRI/magnetic resonance angiography (MRA) or brain CT were classified in the non-PV disease group and considered to have dizziness syndromes of unclear etiology. However, even if brain MRI/MRA and brain CT did not show any abnormalities, patients who showed normal vestibular function but showed abnormalities in the optokinetic nystagmus test and eye tracking test were classified in the non-PV disease group and considered to have central balance disorder. These patients often showed downbeat nystagmus, failure of fixation suppression, and abnormal eye movement. Although the cause of persistent postural-perceptual dizziness may be rooted in the PV system, these symptoms are thought to be modified by other factors. For this reason, patients with these symptoms were classified in the non-PV disease group.

Table 1 Demographic data and clinical diagnosis of patients.

Full size table

All patients underwent our standardized neuro-otological examinations, listed as number (No.) 1 to No. 16 in Table 2. These 16 examinations yielded a total of 44 features, which could be divided into two types: continuous and categorical. Continuous features with numerical values were used as they were, and categorical features were coded as integers from 0 to 3. Examinations No. 1 to No. 14 were performed as routine examinations, and examinations No. 15 and No. 16 were added as needed. In the caloric test (No. 6), we injected air currents at 24 °C and 50 °C (6 L/min) into each ear canal for 60 s with the patient’s eyes closed. The maximal slow-phase velocity (MSPV), canal paresis percentage (CP%), and directional preponderance percentage (DP%) of the caloric nystagmus were recorded after each irrigation, and the CP% and DP% were calculated from the MSPV according to Jonkees’ formula¹⁷. In our department, if the CP is ≥ 20%, the ear with the lower response is assumed to have unilateral vestibular hypofunction, indicating an abnormal caloric reflex. Bilateral vestibular hypofunction as evaluated by MSPV is defined as < 6°/s in each ear after caloric stimulation¹⁸. The failure of fixation suppression test (No. 7) started at 80 s after the beginning of the air current and continued for 10 s. The patient, with both eyes open, stared at an optotype^19,20. The pendular sinusoidal rotation test (No. 8) was performed with rotation of the chair at 0.l Hz, amplitude 240°, maximum velocity of 75.4°/s, with the patient’s eyes closed²¹. In the eye tracking test (No. 9), the patient gazed at and pursued an optotype lamp (viewing angle 20 degrees, frequency 0.3 Hz) that moved left and right²². In the optokinetic nystagmus test (No. 10), 12 striations were projected onto a hemispherical drum. The striations began to rotate in the clockwise (CW) direction of 1°/s and accelerated until a velocity of 100°/s was reached. Next, the striations began to rotate in the counterclockwise (CCW) direction²³. Two neuro-otology specialists (M.A. and H.S.) certified by the Japan Society for Equilibrium Research evaluated the waveforms from electronystagmography (ENG) by visual inspection and diagnosed all ENG findings. Stabilometry (No. 11) was performed according to the Japanese standard²⁴. The Mann test (No. 12) was performed during tandem standing for 30 s with the eyes open and 30 s with the eyes closed, and then the positions of the front and back legs were reversed²⁵. In the Fukuda stepping test (No. 13), the patient stood upright with eyes closed and arms extended forward and took 50 steps^26,27. In the Schellong test (No. 14), blood pressure was measured twice in a recumbent position and 3 additional times: immediately after standing and 5 and 10 min later²⁸. The galvanic body-sway test (GBST) (No. 15) evaluates the body-sway response induced by 0.2 mA and 0.4 mA electrical stimulation applied to the retroauricular area. Bipolar rectangular current stimulation lasting for 3 s was repeated 10 times, alternating between the left and right, as the patient stood on the stabilometer with his or her feet close together²⁹. The stimulus conditions of the cervical vestibular evoked myogenic potential (cVEMP) test (No. 16) were a click sound of 0.1 ms, a frequency of 5 Hz, and a sound pressure level of 105 dB. Two hundred reaction waveforms were summed³⁰.

Table 2 Equilibrium examinations and each feature’s name.

Full size table

Steps in the machine learning classification method

In the present research, we used supervised ML to perform classification, which aims to predict the categories of new observations based on a training set of data whose categories are known³¹. The program was created on Google Colaboratory using Python version (v) 3.7.12, scikit-learn³² v1.0.2, NumPy v1.21.5, SciPy v1.4.1, Pandas v1.3.5, and Matplotlib v3.2.2. Five well-known algorithms, random forest (RF), adaboost (AB), gradient boosting (GB), support vector machine (SVM), and logistic regression (LR), were adopted. These algorithms have been used in a large number of treatises and specialized books based on an established theory^{33,34,35,36,37,38,39}. The steps in classification are as follows.

Import the data

From the results of the 1009 patients, we created a CSV data file consisting of 44 features and target categories (PV = 0, non-PV = 1). After the CSV data were imported into the program, they were preprocessed to ensure the accuracy of future predictions⁴⁰.

Split the data

The preprocessed dataset was randomly divided into 75% training data (n = 756) and 25% testing data (n = 253), as shown in Fig. 1. The randomness of splitting for training and testing data was controlled via the "random_state" parameter in scikit-learn.

ML and predictions

ML was performed to create the best model using the training data. In the learning process, various parameters in the algorithm are automatically adjusted. However, some parameters need to be determined by a human to achieve the best prediction⁴¹. These variables, known as hyperparameters, can be set using GridsearchCV in scikit-learn³². By using GridsearchCV for each model, we could select the best hyperparameters and create the best model with each of the 5 algorithms. Thereafter, the best models were applied to the test data, as shown in Fig. 1, to create the final evaluation output. In the following description, the models obtained under the condition of random_state = 0 are presented as the best models. Furthermore, we performed 10 replicates of the whole process from splitting the data to applying the new best model for the test data by changing the random state and calculated average values such as accuracy.

Test measures

In the binary classification, one of the two predicted groups was called the negative group (N), and the other was called the positive group (P). We defined PV disease as (N) and non-PV disease as (P). The confusion matrix is commonly used to evaluate the diagnostic ability of classifiers. In Table 3, the basic framework of the confusion matrix³² displays the number of predictions by each model in each of four categories: TP (true positive), FP (false positive), FN (false negative), and TN (true negative).

Table 3 Basic framework of the confusion matrix.

Full size table

The six test measures used for evaluating the predictive performance of ML are as follows: accuracy, precision, recall (also known as sensitivity), area under the receiver operating characteristic curve (AUC-ROC), F1-score, and Matthews correlation coefficient (MCC). The first five measures are displayed as numerical values ranging from 0 to 1, whereas MCC is displayed as numerical values from − 1 to 1. The greater each value is, the higher the predictive performance.

Statistical analysis

The Mann‒Whitney U test was used for statistical evaluation of age, precision, recall, and F1-score between PV and non-PV. The Chi-square test was used for statistical evaluation of gender proportion. BellCurve for Excel v3.21 (Social Survey Research Information Co., Ltd., Japan) was used for the analysis, and P < 0.05 was considered statistically significant.

Results

We created five models of classifiers for binary classification using a training set (n = 756) and applied them to a test set (n = 253), which was composed of PV (n = 123) and non-PV (n = 130) data. The predictive performance of the best models and the average data after ten different iterations are summarized for each of the five algorithms in Tables 4 and 5 with the six evaluation measures. Among the five models, the best single-trial performance metrics were as follows: 79% for SVM in accuracy, 0.87 for LR in AUC-ROC, and 0.57 for SVM in MCC (Table 4). By contrast, when the results were averaged across ten iterations, LR was the top performers on all three metrics, with accuracy of 77%, an AUC-ROC of 0.85, and an MCC of 0.54 (Table 5). Although LR and SVM showed better predictive performance than the other models, there was no remarkable difference among the five models. The AUC-ROC is one of the most commonly used metrics in evaluating the performance of binary classifiers. Based on a comparison of ROC curves among the five ML models, as shown in Fig. 2, all five algorithms achieved high AUC-ROC values from 0.85 to 0.87. All models were similar not only in their high AUC-ROC values but also in the shape of their ROC curves, indicating that all of the classifiers yielded consistently good results.

Table 4 The best performance of different machine learning algorithms.

Full size table

Table 5 The average performance of different machine learning algorithms after ten iterations.

Full size table

Table 4 also shows the best model results by diagnostic category. The best precision values were 0.78 by SVM for PV and 0.80 by SVM and LR for non-PV. The best recall values were 0.80 by LR for PV and 0.78 by GB and SVM for non-PV. The best F1-scores were 0.78 by SVM and LR for PV and 0.79 by SVM for non-PV. Apart from the individual model's superiority, the average precision of non-PV showed a higher value than that of PV, with 0.79 for non-PV and 0.76 for PV (P < 0.05). The average recall was 0.78 for PV and 0.76 for non-PV (no significant difference). The average F1-scores were 0.77 for PV and 0.78 for non-PV (no significant difference). In contrast, when the average results of each model were calculated after ten different iterations (Table 5), LR performed best on all metrics: precision (0.78 for PV and 0.76 for non-PV), recall (0.75 for PV and 0.79 for non-PV), and F1-score (0.76 for PV and 0.77 for non-PV). Although these values were slightly lower than the best model results, no significant difference was observed.

An index showing how much each of the features contributed to the prediction of a given model can be calculated by the property of “feature_importances” in scikit-learn. The “feature_importances” ranking indicates which features may be most relevant or least relevant to the research objective. The RF method is the most common method in feature importance selection and rankings⁴². In our research, the feature importance of RF, AB, and GB was ranked based on the selected frequency of a variable as a decision node of decision trees. We used all of these classifiers to rank the importance of variables according to their discriminative performance. Out of the forty-four features, the importance of each of the top ten selected features is presented in rank order in Fig. 3. Each feature was scored with a numerical value ranging between 0 and 1, where 0 means “not used at all” and 1 means “perfectly predicts the target” the higher the value, the more important the variable was. Among the features for evaluating vestibular function, the features of the caloric test (Caloric_CP, Caloric_CP%) were ranked highest in all three models. This confirms that CP in the caloric test is a parameter that plays an important role in classifying PV versus non-PV disease. As for the features related to the stabilometry test, the Romberg ratio of sway length (Romberg_Length) was included in the top 10 features in the AB model, ranking as high as Caloric_CP. Other features, such as Envelop Area_Op, Envelop Area_Cl, Sway Length_Op, Sway Length_Cl, and Romberg_Area, were also present in the top rankings for the three models. Among the features for assessing cerebellar and brainstem function, two features of the optokinetic nystagmus test (OKN_CW, OKN_CCW) were included in the RF model. The features of the eye tracking test (ETT), Schellong test (Schellong), and pendular sinusoidal rotation test (PSRT_R, PSRT_L) were also present in the top rankings of the three models.

All five models in Table 4 were applied to evaluate the accuracy of PV versus non-PV classification in the 25% of data (253 cases) that formed the test set (Fig. 4). The numbers of models with matching predictions are shown in the six columns, which are labeled PV 0 to PV 5 and non-PV 0 to non-PV 5. In the graph, PV 5/non-PV 0 means that all five classifiers predicted PV and no model predicted non-PV. PV 0/non-PV 5 means that all five models predicted non-PV and no model predicted PV. The other labels indicate that the predictions were different depending on the model. In the first column, among the 104 patients predicted to have PV by all five models, 86 patients truly had PV and 18 patients had non-PV, which is equivalent to 83% accuracy. Similarly, in the last column, among the 100 patients predicted to have non-PV by all five models, 85 patients truly had non-PV and 15 patients had PV, which is equivalent to 85% accuracy. These percentages were higher than the accuracy of the models individually (Table 4).

Discussion

In the present study, we evaluated the ability of ML models created from five algorithms to discriminate between PV disease and non-PV disease. These five algorithms were the commonly used RF, AB, GB, SVM, and LR methods and suggest the potential for supporting the prediction of vestibular disease diagnosis. Furthermore, our approach of combining all five ML classifier models was expected to support the prediction performance of each model individually.

All five models presented relatively good results by tuning the algorithms and choosing the best parameters using GridSearchCV. Among those, SVM in the five best models and LR in the average results seemed to be superior to those of the other models, with accuracy values of 79% and 77%, respectively. Joutsijoki et al.⁴³ applied thirteen classification methods to oto-neurological disease classification and showed that the half-against-half (HAH) architecture with SVM achieved the best accuracy of 76.9% compared to the other classification methods, including LR, with an accuracy only slightly above 60%. Masankaran et al.⁴⁴ used four classifier models (RF, SVM, k-nearest neighbor, and naïve Bayes) with the Dizziness Handicap Inventory questionnaire to distinguish benign paroxysmal positioning vertigo types with a best accuracy of 73.91%. Priesol et al.¹¹ applied five classifier models (DT, RF, LR, AB, and SVM) and reported an overall accuracy of 76%. Compared to these reports, the performance of our best classifier had a higher accuracy of 79%. To further improve performance, we devised a method by combining all five models in the prediction data (Fig. 4). As a result, when the predictions of the five models matched in PV, the correct answer rate was 83%, and when they matched in non-PV, the correct answer rate was 85%. This result was superior to the accuracy of SVM and LR individually. However, when PV and non-PV predictions were presented simultaneously, the accuracies of SVM and LR were superior. Therefore, the combination of SVM and LR together with our new ML approach has the potential to diagnose PV disease and distinguish it from non-PV disease.

For otolaryngologists, it is important to reliably detect PV disease in patients with chaotic symptoms of vertigo/dizziness. However, the non-PV group included various diseases of cerebral etiology, such as brain tumor, brain infarction, spinocerebellar degeneration, vertebrobasilar insufficiency, and others for which a delayed diagnosis might lead to life-threatening consequences. Thus, ML should have a high predictive ability not only for PV diseases but also for non-PV diseases. This balance of predictive performance can be evaluated using precision, recall, and the F1-score. The F1-score is a measure that can comprehensively evaluate precision and recall. As shown in the best model results in Table 4, the precision average of the five models was better in non-PV than in PV. However, the F1-score averages of the five models were 0.77 for PV and 0.78 for non-PV. This result means that our models function well for both groups. Furthermore, the F1-scores of SVM were the best, with 0.78 for PV and 0.79 for non-PV. Thus, SVM appears to be a useful classifier for discriminating between the two disease groups.

Our dataset was established based on the clinical data of patients who were diagnosed by our 16 different types of equilibrium function tests, whereas previous studies usually used the most commonly performed vestibular tests, such as the caloric test and vestibulo-ocular reflex derived from the rotation test¹¹, or used head impulse, gaze-evoked nystagmus, or a test of skew for differentiation of vestibular stroke and peripheral acute vestibular syndrome⁹. In Fig. 3, features related to the caloric test were the most important features, but the optokinetic nystagmus test, eye tracking test, Schellong test, pendular sinusoidal rotation test, and stabilometry also ranked in the top 10. Thus, the combination of multiple kinds of equilibrium examinations might help to increase the variety of features and improve the quality of the training dataset for ML. However, not all features in our dataset have equal importance. Determining which features yield the most predictive power is another crucial step in the model-building process.

This study has some important limitations, including the characteristics and size of the dataset and optimization of the models. In this study, ML was used to classify PV disease and non-PV disease, which include a wide range of diseases. Further studies using synthetic models in the classification of PV disease and a particular disease are needed to improve the diagnostic ability of ML. In addition, the number of study subjects was relatively modest, and other ML algorithms using advanced analytics techniques will be necessary to enhance the results. Moreover, the proportion of males and females has not been included in the training and test sets. Also, since we were unable to perform external validation, whether our ML model in the diagnosis of vestibular diseases is reproducible and generalizable has not been assessed. These could be subjects to be considered for future studies. Furthermore, obtaining extensive testing batteries as presented here will not tailor for clinical decision making in the setting of acutely dizzy patients in an emergency condition or in an outpatient center without examination equipment. Finally, even though ML can assist in making good predictions, it does not completely replace the physician. Especially with some diseases, which require patient‒physician interaction and critical thinking, the physician needs to make the final diagnosis.

Conclusion

Diagnosis in neuro-otology is mainly deductive and based on the results of various vestibular function tests, which are difficult for otolaryngologists because of the experience requirements and the time-consuming nature of the tests. The current algorithm shows the effectiveness of using five ML models as an adjunct to distinguish between PV and non-PV diseases. The adoption of ML algorithms in clinical practice might free up physician time and enhance the accuracy and efficiency of the diagnosis and treatment of patients with PV disease.

Data availability

The datasets used and/or analyzed during the current study cannot be shared publicly so as to maintain the privacy of the individuals who participated in the study. The data will be shared on reasonable request to the corresponding author.

References

Labuguen, R. H. Initial evaluation of vertigo. Am. Fam. Physician 73, 244–251 (2006).
PubMed Google Scholar
Stern, S. D. C., Cifu, A. S. & Altkorn, D. Dizziness. In Symptom to Diagnosis: An Evidence-Based Guide 3rd edn (ed. Stern, S. D. C.) (McGraw-Hill Education, 2014).
Google Scholar
Strupp, M., Feil, K. & Zwergal, A. Diagnosis and differential diagnosis of peripheral and central vestibular disorders. Laryngorhinootologie 100, 176–183 (2021).
PubMed Google Scholar
Mayo, R. C. & Leung, J. Artificial intelligence and deep learning: Radiology’s next frontier?. Clin. Imaging 49, 87–88 (2018).
Article PubMed Google Scholar
Egert, M., Steward, J. E. & Sundaram, C. P. Machine learning and artificial intelligence in surgical fields. Indian J. Surg. Oncol. 11, 573–577 (2020).
Article PubMed PubMed Central Google Scholar
Rajkomar, A., Dean, J. & Kohane, I. Machine learning in medicine. N. Engl. J. Med. 380, 1347–1358 (2019).
Article PubMed Google Scholar
Gavilán, C., Gallego, J. & Gavilán, J. “Carnisel”: An expert system for vestibular diagnosis. Acta Otolaryngol. 110, 161–167 (1990).
Article PubMed Google Scholar
Viikki, K., Kentala, E., Juhola, M. & Pyykkö, I. Decision tree induction in the diagnosis of otoneurological diseases. Med. Inform. Internet Med. 24, 277–289 (1999).
Article CAS PubMed Google Scholar
Ahmadi, S. A. et al. Modern machine-learning can support diagnostic differentiation of central and peripheral acute vestibular disorders. J. Neurol. 267, 143–152 (2020).
Article PubMed PubMed Central Google Scholar
Kamogashira, T. et al. Prediction of vestibular dysfunction by applying machine learning algorithms to postural instability. Front. Neurol. 11, 5–12 (2020).
Article Google Scholar
Priesol, A. J., Cao, M., Brodley, C. E. & Lewis, R. F. Clinical vestibular testing assessed with machine-learning algorithms. JAMA Otolaryngol. Head Neck Surg. 141, 364–372 (2015).
Article PubMed Google Scholar
Juhola, M. On machine learning classification of otoneurological data. Stud. Health Technol. Inform. 136, 211–216 (2008).
PubMed Google Scholar
Walther, L. E. et al. Die Anwendung künstlicher neuronaler netze bei der auswertung posturografischer messungen [the use of artificial neural networks in evaluation of posturographic data]. Interpretation J. Bible Theol. 1, 211–217 (2011).
Google Scholar
Bisdorff, A., von Brevern, M., Lempert, T. & Newman-Toker, D. E. Classification of vestibular symptoms: Towards an international classification of vestibular disorders. J. Vestib. Res. 19, 1–13 (2009).
Article PubMed Google Scholar
Japan Society for Equilibrium Research. https://www.memai.jp/guide/.
Erickson, N. J. et al. Koos classification of vestibular schwannomas: A reliability study. Neurosurgery 85, 409–414 (2019).
Article PubMed Google Scholar
Jongkees, L. B. W., Maas, J. P. M. & Philipszoon, A. J. Clinical nystagmography: A detailed study of electro-nystagmography in 341 patients with vertigo. Pract. Otorhinolaryngol. 24, 65–93 (1962).
CAS Google Scholar
Strupp, M. et al. Bilateral vestibulopathy: Diagnostic criteria consensus document of the Classification Committee of the Bárány Society. J. Vestib. Res. 27, 177–189 (2017).
Article PubMed PubMed Central Google Scholar
Kato, I. et al. Caloric pattern test with special reference to failure of fixation-suppression. Acta Otolaryngol. 88, 97–104 (1979).
Article CAS PubMed Google Scholar
Kato, I., Nakamura, T., Koike, Y. & Watanabe, Y. Computer analysis of fixation-suppression of caloric nystagmus. ORL J. Otorhinolaryngol. Relat. Spec. 44, 277–287 (1982).
Article CAS PubMed Google Scholar
Mizukoshi, K., Kobayashi, H., Ohashi, N. & Watanabe, Y. Quantitative analysis of the human visual vestibulo-ocular reflex in sinusoidal rotation. Acta Otolaryngol. Suppl. 393, 58–64 (1983).
Article CAS PubMed Google Scholar
Ohashi, N., Watanabe, Y., Kobayashi, H. & Mizukoshi, K. Quantitative comparison between saccadic and ataxic pursuits. Acta Otolaryngol. 101, 200–206 (1986).
Article CAS PubMed Google Scholar
Watanabe, Y., Ohashi, N., Ohmura, A., Itoh, M. & Mizukoshi, K. Gain of slow-phase velocity of optokinetic nystagmus. Auris Nasus Larynx 13, S63–S68 (1986).
Article PubMed Google Scholar
Yamamoto, M. et al. Japanese standard for clinical stabilometry assessment: Current status and future directions. Auris Nasus Larynx 45, 201–206 (2018).
Article PubMed Google Scholar
Ito, S., Odahara, S., Hiraki, M. & Idate, M. Evaluation of imbalance of the vestibulo-spinal reflex by “the circular walking test”. Acta Otolaryngol. Suppl. 115, 124–126 (1995).
Article Google Scholar
Fukuda, T. The stepping test: Two phases of the labyrinthine reflex. Acta Otolaryngol. 50, 95–108 (1959).
Article CAS PubMed Google Scholar
Cohen, H. S. A review on screening tests for vestibular disorders. J. Neurophysiol. 122, 81–92 (2019).
Article PubMed PubMed Central Google Scholar
Fanciulli, A., Campese, N. & Wenning, G. K. The Schellong test: Detecting orthostatic blood pressure and heart rate changes in German-speaking countries. Clin. Auton. Res. 29, 363–366 (2019).
Article PubMed Google Scholar
Watanabe, Y. et al. Retro-labyrinthine disorders detected by galvanic body sway responses in routine equilibrium examinations. Acta Otolaryngol. Suppl. 108, 343–348 (1989).
Article Google Scholar
Shojaku, H., Takemori, S. & Watanabe, Y. Vestibular evoked myogenic potentials. Equilib. Res. 59, 186–192 (2000).
Article Google Scholar
Handelman, G. S. et al. eDoctor: Machine learning and the future of medicine. J. Intern. Med. 284, 603–619 (2018).
Article CAS PubMed Google Scholar
Pedregosa, F. et al. Scikit-learn: Machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011).
MathSciNet MATH Google Scholar
Breiman, L. Random forests. Mach. Learn. 45, 5–32 (2001).
Article MATH Google Scholar
Friedman, J. H. Greedy function approximation: A gradient boosting machine. Ann. Stat. 29, 1189–1232 (2001).
Article MathSciNet MATH Google Scholar
Hu, W., Member, S., Hu, W. & Maybank, S. AdaBoost-based algorithm for network intrusion detection. IEEE Trans. Syst. Man. Cybern. B 38, 577–583 (2008).
Article Google Scholar
Cortes, C. & Vapnik, V. Support-vector networks. Mach. Learn. 20, 273–297 (1995).
Article MATH Google Scholar
Yu, W., Liu, T., Valdez, R., Gwinn, M. & Khoury, M. J. Application of support vector machine modeling for prediction of common diseases: The case of diabetes and pre-diabetes. BMC Med. Inform. Decis. Mak. 10, 16 (2010).
Article PubMed PubMed Central Google Scholar
Hosmer, D. & Lemeshow, S. Applied Logistic Regression 3rd edn. (Wiley, 2004).
MATH Google Scholar
Colombet, I., Jaulent, M. C., Degoulet, P. & Chatellier, G. Logistic regression model: An assessment of variability of predictions. Stud. Health Technol. Inform. 84, 1314–1318 (2001).
CAS PubMed Google Scholar
Ngiam, K. Y. & Khor, I. W. Big data and machine learning algorithms for health-care delivery. Lancet Oncol. 20, e262–e273 (2019).
Article PubMed Google Scholar
Müller, A. C. & Guido, S. Model evaluation and improvement. In Introduction to Machine Learning with Python 1st edn (ed. Müller, A. C.) 262–263 (O’Meilly Media, 2017).
Google Scholar
Genuer, R., Poggi, J. M. & Tuleau-Malot, C. Variable selection using random forests. Pattern Recognit. Lett. 31, 2225–2236 (2010).
Article Google Scholar
Joutsijoki, H., Varpa, K., Iltanen, K. & Juhola, M. Machine learning approach to an otoneurological classification problem. Annu. Int. Conf. IEEE Eng. Med. Biol. Soc. 2013(2013), 1294–1297 (2013).
PubMed Google Scholar
Masankaran, L., Viyanon, W. & Mahasittiwat, V. Classification of benign paroxysmal positioning vertigo types from Dizziness Handicap Inventory using machine learning techniques. in 2018 International Conference on Intelligent Informatics and Biomedical Sciences, ICIIBMS, 209–214 (2018).

Download references

Acknowledgements

This work was supported by a grant from the Ministry of Health, Labor and Welfare of Japan (20FC1048). We express our sincerely appreciation to all members of the Department of Otolaryngology, University of Toyama, for their assistance and cooperation.

Author information

These authors contributed equally: Do Tram Anh and Hiromasa Takakura.

Authors and Affiliations

Department of Otorhinolaryngology, Head and Neck Surgery, Faculty of Medicine, Academic Assembly, University of Toyama, 2630 Sugitani, Toyama City, Toyama Prefecture, 930-0194, Japan
Do Tram Anh, Hiromasa Takakura, Masatsugu Asai, Naoko Ueda & Hideo Shojaku

Authors

Do Tram Anh
View author publications
You can also search for this author in PubMed Google Scholar
Hiromasa Takakura
View author publications
You can also search for this author in PubMed Google Scholar
Masatsugu Asai
View author publications
You can also search for this author in PubMed Google Scholar
Naoko Ueda
View author publications
You can also search for this author in PubMed Google Scholar
Hideo Shojaku
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

D.T.A., study design, patient recruitment, data acquisition, analysis, and manuscript writing. H.T., study design, manuscript revision, and supervision. M.A., study design, writing machine learning programs, and manuscript revision. N.U., patient recruitment; H.S., grant application, study design, manuscript revision. All authors contributed to the article, read, and approved the submitted version.

Corresponding author

Correspondence to Masatsugu Asai.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Anh, D.T., Takakura, H., Asai, M. et al. Application of machine learning in the diagnosis of vestibular disease. Sci Rep 12, 20805 (2022). https://doi.org/10.1038/s41598-022-24979-9

Download citation

Received: 04 June 2022
Accepted: 23 November 2022
Published: 02 December 2022
DOI: https://doi.org/10.1038/s41598-022-24979-9

This article is cited by

“Vertigo, likely peripheral”: the dizzying rise of ChatGPT
- Jeremy Chee
- Eunice Dawn Kwa
- Xueying Goh
European Archives of Oto-Rhino-Laryngology (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Application of machine learning in the diagnosis of vestibular disease

Subjects

Abstract

Similar content being viewed by others

Evaluating machine learning classifiers for glaucoma referral decision support in primary care settings

Decision making on vestibular schwannoma treatment: predictions based on machine-learning analysis

Machine learning approach for prediction of hearing preservation in vestibular schwannoma surgery

Introduction

Materials and methods

Subjects

Steps in the machine learning classification method

Import the data

Split the data

ML and predictions

Test measures

Statistical analysis

Results

Discussion

Conclusion

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information.

Rights and permissions

About this article

Cite this article

This article is cited by

“Vertigo, likely peripheral”: the dizzying rise of ChatGPT

Comments

Search

Quick links

Subjects

Abstract

Similar content being viewed by others

Evaluating machine learning classifiers for glaucoma referral decision support in primary care settings

Decision making on vestibular schwannoma treatment: predictions based on machine-learning analysis

Machine learning approach for prediction of hearing preservation in vestibular schwannoma surgery

Introduction

Materials and methods

Subjects

Steps in the machine learning classification method

Import the data

Split the data

ML and predictions

Test measures

Statistical analysis

Results

Discussion

Conclusion

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information.

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

“Vertigo, likely peripheral”: the dizzying rise of ChatGPT

Comments

Search

Quick links