A nystagmus extraction system using artificial intelligence for video-nystagmography

Lee, Yerin; Lee, Sena; Han, Junghun; Seo, Young Joon; Yang, Sejung

doi:10.1038/s41598-023-39104-7

Download PDF

Article
Open access
Published: 24 July 2023

A nystagmus extraction system using artificial intelligence for video-nystagmography

Yerin Lee¹,
Sena Lee²,
Junghun Han¹,
Young Joon Seo^3,4 &
…
Sejung Yang²

Scientific Reports volume 13, Article number: 11975 (2023) Cite this article

1488 Accesses
1 Citations
1 Altmetric
Metrics details

Subjects

An Author Correction to this article was published on 05 September 2023

This article has been updated

Abstract

Benign paroxysmal positional vertigo (BPPV), the most common vestibular disorder, is diagnosed by an examiner changing the posture of the examinee and inducing nystagmus. Among the diagnostic methods used to observe nystagmus, video-nystagmography has been widely used recently because it is non-invasive. A specialist with professional knowledge and training in vertigo diagnosis is needed to diagnose BPPV accurately, but the ratio of vertigo patients to specialists is too high, thus necessitating the need for automated diagnosis of BPPV. In this paper, a convolutional neural network-based nystagmus extraction system, ANyEye, optimized for video-nystagmography data is proposed. A pupil was segmented to track the exact pupil trajectory from real-world data obtained during field inspection. A deep convolutional neural network model was trained with the new video-nystagmography dataset for the pupil segmentation task, and a compensation algorithm was designed to correct pupil position. In addition, a slippage detection algorithm based on moving averages was designed to eliminate the motion artifacts induced by goggle slippage. ANyEye outperformed other eye-tracking methods including learning and non-learning-based algorithms with five-pixel error detection rate of 91.26%.

Introduction

Dizziness and vertigo are common symptoms, affecting approximately 20–30% of the population^1,2. Vertigo, specifically, refers to the perception of motion when no actual motion is occurring relative to the Earth’s gravity³. Some experts in the field of dizziness differentiate vertigo as a symptom caused by disorders of the vestibular system, as opposed to general dizziness⁴. One prominent vestibular disorder is known as Benign Paroxysmal Positional Vertigo (BPPV), which accounts for a significant proportion of vertigo cases, diagnosed in approximately 17–47% of patients⁵. BPPV is characterized by episodes of vertigo that are triggered by specific head movements, such as turning in bed, bending over, or looking up⁶. The underlying cause of BPPV is the displacement of tiny calcium carbonate crystals, known as otoliths, which are responsible for sensing gravity in the utricles and entering the semicircular canals. This displacement can occur due to various factors, including illness or the natural aging process. BPPV can be further classified into two subtypes: cannalithiasis and cupulithiasis. In cannalithiasis, the otoliths freely move within the semicircular canals, while in cupulithiasis, the otoliths adhere to the cupula, although the latter subtype is less common^7,8. Depending on the affected semicircular canal, BPPV is categorized as posterior canal BPPV, lateral canal BPPV, or anterior canal BPPV, with posterior canal BPPV being the most frequently observed subtype⁹.

BPPV is a multifactorial disease characterized by the displacement of otoliths, which are responsible for detecting gravity in the utricles and entering the semicircular canals. While aging and certain illnesses are commonly linked to BPPV, it can arise from various etiologies. Aging is a significant contributor, accounting for approximately 40% of BPPV cases⁴. Additionally, traumatic events, such as head injuries or accidents, have been associated with BPPV, contributing to around 20% of cases. Inflammation within the inner ear, resulting from conditions like vestibular neuritis or labyrinthitis, can also trigger BPPV in approximately 10–15% of individuals. Moreover, certain medical conditions and comorbidities have been implicated as potential risk factors for BPPV. For instance, vitamin D deficiency has been identified as a possible contributing factor, with studies suggesting an association in about 25% of BPPV cases⁵. Other underlying factors, including Ménière’s disease, vestibular migraine, or autoimmune disorders, may contribute to a smaller percentage of BPPV cases. It’s important to note that while these percentages provide a general overview, the exact contribution of each etiology can vary among individuals, and BPPV can often arise from a combination of factors. Understanding the diverse etiologies of BPPV is crucial for effective diagnosis, management, and treatment strategies.

BPPV is diagnosed by an examiner changing the posture of the examinee, which causes the vestibulo-ocular reflex. The method of diagnosis depends on the location of misplaced otoliths. Posterior canal BPPV is diagnosed using the Dix–Hallpike maneuver, which was first introduced in 1952¹⁰. According to this maneuver, the examinee initially sits on the bed, and then the examiner turns the head of the examinee 45 degrees to the left or right and makes the examinee quickly lay down to cause dizziness. Torsional nystagmus occurs in patients with positive results when the head is lowered. Lateral canal BPPV is diagnosed using the supine roll maneuver¹¹. If the subject is in the supine position and the head of the examinee is turned quickly, horizontal nystagmus occurs in positive patients.

Nystagmus is an involuntary periodic eye movement characterized by a slow phase moving slowly in one direction and a fast phase that quickly returns to its original position¹². BPPV nystagmus is caused by improper stimulation of semicircular canal receptor hair cells according to changes in the head position¹³. Nystagmus occurs in various directions depending on the location of the misplaced otoliths and includes horizontal nystagmus, vertical nystagmus, and torsional nystagmus. The diagnostic methods used to observe nystagmus include electro-oculography, video-nystagmography, and scleral search coil technology^14,15. Among them, video-nystagmography utilizing an infrared camera has been widely used in recent years because it is non-invasive and does not cause pain to the examinee¹⁶. However, video-nystagmography has limitations, such as lack of dimension, goggle slippages, and manual evaluation of data¹⁷. Therefore, only a specialist with professional knowledge and training in vertigo can diagnose BPPV accurately using video-nystagmography data obtained through the positional tests. However, the number of specialists is insufficient to accommodate all vertigo patients, thus necessitating automated diagnosis of video-nystagmography data.

Owing to recent developments in image processing technology and machine learning, several studies have attempted to detect and diagnose nystagmus using video-nystagmography images. Lim et al.¹⁸ extracted the pupil trajectory and iris pattern from a video in which 10 types of tests were performed and obtained an amplitude in three directions to distinguish eight types of BPPV with a deep learning model. Slama et al.¹⁹ extracted pupil trajectories from caloric and kinetic test videos and extracted various features to diagnose vestibular neuritis using support vector machines. Reinhardt et al.²⁰ developed an algorithm that detects the eye using a cascade classifier in a webcam image to obtain eye trajectories and determine the time when nystagmus occurs. Zhang et al.²¹ used a two-stage deep learning model that selected invalid frames of video nystagmography videos and found that torsional nystagmus occurred. However, most of these studies used the Hough transform or machine learning to find the trajectory of nystagmus, with deep learning only used for diagnosis. Considering the amplitude, speed, and direction of nystagmus are important in BPPV diagnosis, these methods have limitations because it is necessary to accurately track the waveform of nystagmus to measure its characteristics.

Pupil detection for eye tracking and gaze estimation has been studied over the last few years, with growing attention focused on commercial eye trackers. A histogram-based algorithm was developed^22,23. For algorithms such as Starburst²⁴, ExCuSe²⁵, ElSe²⁶, and PuRe²⁷, ellipse fitting was applied after detecting the edge of the pupil. However, these algorithms have not enabled smooth detection in noisy environments. With the development of convolutional neural networks, deep learning applications in the field of computer vision have been actively conducted recently. Accordingly, eye-tracking algorithms that apply neural networks have emerged, such as PupilNet²⁸, DeepEye²⁹, and DeepVOG³⁰. All these attempts have focused on finding pupil location or dividing the pupil itself using convolutional neural networks (CNN). EllSeg³¹ achieved a higher performance than previous algorithms by splitting the pupil and iris regions simultaneously with a CNN and approximating ellipses via representation maps. However, the dataset used for learning is limited to commercially available eye-tracker images. In addition, considering the dataset overfitting characteristics of CNN, the images of eye trackers and video-nystagmography have significantly different environments and conditions, making them incompatible with each other. Therefore, it is necessary to train a CNN-based tracking algorithm using video nystagmography data.

When performing video-nystagmography, slippage occurs because of the heavy weight of the device or quick changes in posture when performing the diagnosis^32,33,34, causing motion artifacts in the pupil trajectory. As device slippage occurs at a high frequency, significant efforts have been made to prevent slipping in the diagnostic stage^35,36. According to our research, studies on removing motion artifacts from the perspective of signal processing have not been conducted.

In this paper, a deep learning-based nystagmus extraction system optimized for video-nystagmography (ANyEye) is proposed in the first study for automating BPPV diagnosis. ANyEye consists of two parts: eye-tracking and slippage detection (Fig. 1). In eye-tracking process, the pupil is segmented through CNN model to track the exact pupil trajectory from noisy real-world data obtained during diagnosis, and a compensation algorithm corrects the position to obtain more precise center. This process is optimized to dark-field video-nystagmography data which has different environment from typical open-space commercial eye tracker data. In addition, a slippage detection algorithm was designed based on two-stage moving average to remove motion artifacts caused by the slippage of video-nystagmography devices.

Methods

Data acquisition

Data resource

This study was approved by the Institutional Ethics Committee of Yonsei University Wonju College of Medicine, Wonju, Korea (No. CR319082), and informed consent was obtained from all subjects. All methods were performed in accordance with the principles of the Declaration of Helsinki. The dataset used in the experiment comprised video-nystagmography infrared videos of 46 posterior semicircular canal BPPV patients and nine lateral semicircular canal BPPV patients, acquired retrospectively at Yonsei University Wonju Severance Hospital. The video-nystagmography goggles (Easy-Eyes, SLMED, Seoul, Korea) weighed 330 g with a resolution of 640 × 480 px and supported 30 frames per second. The screen consists of a section with infrared camera images showing the right and left eyes and a section with images showing the overall appearance of the examination environment. In one patient video, the results of the spontaneous nystagmus test, Dix–Hallpike test, supine roll test, head-shaking test, and bow-and-lean test were recorded.

Positional test dataset

Two datasets were used for the experiments. The first was a dataset with labels of positional tests and video-nystagmography videos conducted on 48 posterior semicircular canal BPPV patients and four lateral semicircular canal BPPV patients. A total of 66 tests were conducted, including several tests conducted on the same patient. Approximately 10 s of the section to check for nystagmus after each test were manually edited and indicated by otolaryngologists, with 165 video clips obtained from the original data.

The ground truths of the pupil region for the deep learning model were generated by four researchers using a self-constructed labeling application. Participants were required to specify the boundary of the pupil with at least ten points to generate ground truths. The data points were then used to approximate the ellipse using the RANSAC algorithm³⁷, with an approximated ellipse displayed on the screen for confirmation. If the user determines that the displayed ellipse matches the pupil, the program automatically generates an image that designates the inside of the ellipse as the pupil. The frame was selected using two rules: (1) 15 frames from the first frame and (2) one frame with an interval of 20 frames. A total of 8284 frames were obtained, with the data divided in a ratio of 7:1:2 for use as training, validation, and test sets for cross-validation (Table 1). Examples of the data and ground truths are shown in Fig. 2.

Table 1 Dataset for cross validation.

Full size table

Slippage dataset

The data used for evaluating the slippage detection algorithm comprised eight video infrared videos. The test was performed by an otolaryngologist on eight patients with lateral semicircular BPPV. All patients showed geotropic nystagmus, seven of the eight subjects showed left-direction nystagmus, and one subject showed right-direction nystagmus. The acquired data were observed to analyze slippage-induced motion artifacts, with the section where the slippage occurred indicated by two otolaryngologists.

Pupil segmentation

The proposed algorithm ANyEye first recognizes pupil by a CNN model-based approach. The backbone model was trained with positional test dataset to adapt the features of the video-nystagmography data. All inputs were resized to 256 × 256 pixels and normalized to a mean of 0.5 with a standard deviation of 0.5. In the training phase, random rotation with a range of − 10° to 10°, random scaling with a ratio of 0.8–1.2, and random shift with a ratio of 0–0.5 were applied to the input image. The total loss ${L}_{total}$ was calculated by adding the binary cross-entropy loss ${L}_{BCE}$ and dice loss ${L}_{dice}$ using the following equations:

$$L_{BCE} = - \frac{1}{N}\sum\nolimits_{i = 1}^{N} {} \left( {y_{i} \cdot \log x_{i} + \left( {1 - y_{i} } \right) \cdot \log \left( {1 - x_{i} } \right)} \right)$$

(1)

$$L_{dice} = 2 \times \frac{{\left| {X \cap Y} \right|}}{\left| X \right| + \left| Y \right|}$$

(2)

$$L_{total} = L_{BCE} + L_{dice}$$

(3)

where $X$ is the input data, $Y$ is the output, and ${x}_{i}\in X,{y}_{i}\in Y, i=1, \dots , N$. The Adam optimizer³⁸ was used with a learning rate of 0.001 over 500 epochs using the early stopping method with patience of 100. The best model was selected based on the loss of the validation set. A batch size of 16 was selected, considering our GPU, NVIDIA GeForce RTX 3090. The Pytorch framework³⁹ was used for implementing the CNN models in Python 3.6.8.

Compensation algorithm

In the output of the segmentation model, the pupils may not be fully estimated; therefore, additional steps are required. As the pupils were elliptical and had similar sizes throughout one video, we designed an algorithm based on this idea (Fig. 3). The algorithm first compares the area of the estimated pupil region with the segmentation method with the previous i frames and determines whether the size of the selected region is within a certain range. If the area of the region is similar to that in the previous frames, the ellipse is approximated from the selected region⁴⁰, with the ellipse center defined as the pupil location. The brightness of the shadow on the side of the data is similar to that of the pupil, which may confuse the segmentation models. The algorithm stores the positions and compares the distance of movement from the previous frame with the major axis of the ellipse from the previous j frames to prevent misjudgment. If the distance is abnormally long, the frame is indicated as invalid and removed from the data. The pupil coordinates of the frames removed by the algorithm are estimated using linear interpolation. We empirically set i and j as 5.

Slippage detection

Previous nystagmus detection algorithms based on conventional signal processing methods had difficulties separating motion artifacts from nystagmus patterns, resulting in significant errors in the detection of nystagmus. In ANyEye, an algorithm was additionally constructed with a moving average to eliminate motion artifacts caused by the movement of goggles or eyes. Two types of slippages occur in eye-movement signals: fast slippage and slow slippage. The fast slippage tends to not include nystagmus because the position changes quickly in a short time, and the slow slippages often included nystagmus because the position changes slowly over a long period of time. The causes of the noise included voluntary causes such as movement due to the weight of the device and change of gaze of the subject, and involuntary causes such as the examiner adjusting the position or the camera of the device. Among them, the noise induced by the position of the eye changes in the screen was defined as slippage. Slow slippage was mainly caused by sliding due to the weight of the device when the subject was still, and fast slippage was mainly caused by shaking because the device was not fixed to the head of the subject during rapid posture change during the positional test. The algorithm generates two moving average values from the original signal, i.e., the short- and long-window moving averages, to remove these two types of slippages using the following equations:

$$MA(n) = 1/(w + 1)\sum\nolimits_{(m = n - w/2)}^{(n + w/2)} {x(m)} ,w = window \cdot fps$$

(4)

where $MA(n)$ is the moving average signal, $n$ is the nth frame, $x(n)$ is the eye movement signal, $window$ is the window length in seconds, and $fps$ is the frames per second of the video. Slow slippage was removed by subtracting the moving average of the long window from the original signal. Next, fast slippage was detected using a short-window moving average signal. After calculating the short-window moving averages of the x- and y-axis data, the vector velocity was calculated using the following equation:

$$v_{total} = \sqrt {v_{x}^{2} + v_{y}^{2} }$$

(5)

where ${v}_{x}$ is the velocity of the x-axis, and ${v}_{y}$ is the velocity of the y-axis. The threshold was used to find the section that moved faster than a specific speed. When the interval among the selected sections was 0.7 s or less, it was determined that the speed was instantaneously slowed during the slippage; when the interval was 0.3 s or less, it was determined that the interval was not a slippage and excluded. The fast slippage sections were removed from the slow-slippage-removed signal using linear interpolation. A flowchart of the slippage detection algorithm is shown in Fig. 4.

Results

Segmentation performance

The test set outputs of CNN models were compared to evaluate the segmentation performance. The backbone model was trained with positional test dataset to adapt the features of the video-nystagmography data. The architectures and specific configurations for the CNN network used in this paper are detailed in the “Supplementary Materials” (Figure S1, Table S1, Table S2). Using the early stopping rule, U-Net model with 95 epochs, U-Net++ L2 model with 217 epochs, U-Net++ L3 model with 231 epochs, and U-Net++ L4 model with 269 epochs were chosen as the best models. The performances of the four pupil segmentation methods were evaluated using five metrics based on the ground truths, and the mean of inference time for one batch was measured (Table 2). The Dice coefficient was used to calculate the similarity between the two samples. The area under the receiver operating characteristic curve is an indicator of the performance of a classifier. A curve drawn on a graph with the false positive rate and true positive rate as the axes is called the receiver operating characteristic (ROC) curve, and the area below the ROC curve is called the area under the ROC curve (AUROC).

Table 2 Performance comparison of segmentation methods. The value highlighted in bold represents the best performance achieved.

Full size table

Tracking evaluations

The detection rates of the conventional methods and the proposed method were compared with the results of 34 test set videos to evaluate the tracking performance. The detection error ${\mathcal{E}}_{d}$ was defined as L2 norm between the estimated center of the pupil ${p}_{i}$ and the ground truth ${q}_{i}$ to evaluate the performance of the eye-tracking algorithm.

$${\mathcal{E}}_{d} \left( {p_{i} , q_{i} } \right) = \left\| {p_{i} - q_{i} } \right\|_{2}$$

(6)

The detection rate, which is the ratio of the number of samples with a detection error ${\mathcal{E}}_{d}$ less than a specific threshold value ${t}_{d}$ to the total number of samples, was calculated. Typically, the detection rate is compared when the ${t}_{d}$ is 5 pixels; thus, the performance between algorithms is compared based on the detection rate at a 5-pixel error.

Figure 5 shows the detection rates of ANyEye, EllSeg, PuRe, ElSe, and ExCuSe by adjusting the error threshold ${t}_{d}$ from 0 to 10. The statistics of detection errors and detection rates up to an error of five pixels for AnyEye, EllSeg, PuRe, ElSe, and ExCuSe are shown on Table 3. ANyEye scored the highest detection rate compared to previous methods with five-pixel error detection rate of 91.26%. Figure 6 shows the distribution of detection errors to the result of five eye-tracking methods including ANyEye. Figure 7 shows the tracking results marked on the examples from the test set. The sample input and output of the ANyEye framework are provided in Figure S2 in the "Supplementary Materials," accompanied by a link to the Python code for evaluation.

Table 3 Detection error analysis of eye-tracking methods with test set videos.

Full size table

Slippage detection results

The parameters of the slippage detection algorithm were selected empirically as 4 s for the long window and 1 s for the short window. Algorithm performance was verified by comparing the pupil trajectory signal and the results of the slippage detection algorithm (Fig. 8). The nystagmus waveform was maintained while the motion artifact due to gaze change or slippage was removed from the algorithm results. Next, the fast slippage detected by the slippage detection algorithm was compared with the slippage indicated by the expert (Fig. 9).

Discussion

Nystagmus in BPPV can be observed in infrared video-nystagmography images. However, diagnosing BPPV requires expert knowledge as it is difficult for untrained non-experts to diagnose because nystagmus appears within a short range and in a short period. In the future, deep-learning nystagmus analysis or automated nystagmus analysis programs are expected to be developed to overcome the shortage of specialists and the rapidly increasing number of patients with dizziness. To accurately analyze the nystagmus, it is necessary to find the center of the pupil accurately. However, several factors interfere with finding the center of the pupil. If it is difficult to distinguish the pupil due to eyelashes or dark makeup, or if the center of the pupil changes due to slippage of the device, it may be difficult to automatically classify the nystagmus pattern appear in video-nystagmography data. In this study, a CNN was introduced to segment the pupil to obtain the exact eye trajectory, with CNN shortcomings addressed through a compensation algorithm. In addition, a moving-average-based slippage detection algorithm was developed to remove motion artifacts caused by frequent slippage of the device during diagnosis.

The algorithm proposed in this paper, ANyEye, first estimates the shape of the pupil from video nystagmography videos using CNN-based segmentation model. Four segmentation methods were tested: U-Net, UNet++ L2, UNet++ L3, and UNet++ L4. The U-Net architecture was selected based on AUROC among five metrics because robustness in the pupil segmentation process was the most important feature for eye tracking.

A compensation algorithm was designed to determine the exact center of the pupil from the region estimated using segmentation model. After selecting the connected component with the largest area in the segmentation result, the size and location of the previous frames were compared to determine whether the frame was valid. As the pupil is oval, the ellipse-fitting algorithm was applied to find a more accurate center. The performance of eye-tracking in AnyEye was evaluated by calculating the detection rate of the tracking algorithm results and compared with both CNN-based and iterative eye-tracking methods from related studies. The detection rates up to an error of five pixels of the EllSeg, PuRe, ElSe, ExCuSe were less than those of ANyEye, with detection rates of up to an error of 5 pixels of 91.26%. The detection rate of PuRe was relatively high when the error threshold was less than 1px, but ANyEye significantly outperformed other methods at a threshold above that. Analyzing the error distribution of each method in the test set, the three iterative eye tracking algorithms had a wide error distribution, while most of the results of the CNN-based algorithms had an error of less than 50 pixels. ANyEye had least standard deviation than other eye-tracking algorithms.

In this study, a slippage detection algorithm was applied to the pupil trajectory to remove slippage-induced motion artifacts in video-nystagmography. Considering the nystagmus of the lateral semicircular canal BPPV appears in the horizontal direction⁴¹, x-axis signal were used to determine the performance of the algorithm. Experimental results revealed that slow slippage was removed, nystagmus waveform was maintained, and fast slippage-induced motion artifacts were removed. Thus, sections likely to be confused with nystagmus were effectively removed. The section obtained through the slippage detection algorithm included motion artifact with large position changes, and the section where nystagmus occurred was not included. Compared to the slippage section marked by the expert, ANyEye selected a more delicate range, excluding the section where the slow slippage occured which could include nystagmus.

Conclusion

In this study, we proposed ANyEye, a system including an eye-tracking algorithm and a moving-average-based slippage detection algorithm for automating BPPV diagnosis. The ANyEye outperformed both learning-based and non-learning-based algorithms. Fast slippage and slow slippage were found in the pupil trajectory data obtained from the video nystagmography dataset, and the optimal parameters for removing both types of slippages were found and applied to the slippage detection algorithm.

As this study used only one video-nystagmography device, generalization of the obtained results is limited; therefore, additional learning using the data of various devices is needed. In addition, there is a risk that using the slippage detection algorithm cannot effectively remove short-length motion artifacts compared with the parameters applied by the algorithm and distorts the nystagmus waveform over a long period. Moreover, noises generated by factors other than slipping detected in the trajectory of the pupil, such as movements of the voluntary eye or body movements of the patients during position conversion, need to be considered. To preserve the nystagmus waveform of various cycles and eliminate all types of noise, the characteristics of the nystagmus waveform and the noise appearing in the video nystagmography device need to be analyzed in more detail, and additional algorithms that adapt according to the situation need to be studied. Since our research has not been validated with external data yet, we have planned to conduct external data validation during future works. Additionally, the trajectory of the pupil of video-nystagmography videos obtained with the eye-tracking algorithm developed in this study will be used to classify the types of BPPV. Furthermore, the slippage detection algorithm is expected to minimize the errors caused by slippage during the development of diagnostic assistance algorithms.

Data availability

The datasets generated and analyzed during the current study cannot be publicly available due to patient privacy concerns but are available from the corresponding author on reasonable request.

Change history

05 September 2023
A Correction to this paper has been published: https://doi.org/10.1038/s41598-023-41913-9

References

Yardley, L., Owen, N., Nazareth, I. & Luxon, L. Prevalence and presentation of dizziness in a general practice community sample of working age people. Br. J. Gen. Pract. 48, 1131 (1998).
CAS PubMed PubMed Central Google Scholar
Hannaford, P. C. et al. The prevalence of ear, nose and throat problems in the community: Results from a national cross-sectional postal survey in Scotland. Fam. Pract. 22, 227–233 (2005).
Article PubMed Google Scholar
Pearson, B. W. & Brackmann, D. E. Committee on hearing and equilibrium guidelines for reporting treatment results in meniere’s disease. Otolaryngol. Head Neck Surg. 93, 579–581 (1985).
Article CAS PubMed Google Scholar
Neuhauser, H. K. Epidemiology of vertigo. Curr. Opin. Neurol. 20, 40–46 (2007).
Article PubMed Google Scholar
Bhattacharyya, N. et al. Clinical practice guideline: Benign paroxysmal positional vertigo (update) executive summary. Otolaryngol. Head Neck Surg. 156, 403–416 (2017).
Article PubMed Google Scholar
Furman, J. M. & Cass, S. P. Benign paroxysmal positional vertigo. N. Engl. J. Med. 341, 1590–1596 (1999).
Article CAS PubMed Google Scholar
Hall, S., Ruby, R. & McClure, J. The mechanics of benign paroxysmal vertigo. J. Otolaryngol. 8, 151–158 (1979).
CAS PubMed Google Scholar
Schuknecht, H. F. Cupulolithiasis. Arch. Otolaryngol. 90, 765–778 (1969).
Article CAS PubMed Google Scholar
Parnes, L. S., Agrawal, S. K. & Atlas, J. Diagnosis and management of benign paroxysmal positional vertigo (BPPV). CMAJ 169, 681–693 (2003).
PubMed PubMed Central Google Scholar
Castellucci, A. et al. Spontaneous upbeat nystagmus and selective anterior semicircular canal hypofunction on video head impulse test: a new variant of canalith jam? J. Audiol. Otol. 26(3), 153–159 (2022).
Article PubMed Google Scholar
McClure, J. Horizontal canal BPV. J. Otolaryngol. 14, 30–35 (1985).
CAS PubMed Google Scholar
Hertle, R. W. Nystagmus in infancy and childhood: Characteristics and evidence for treatment. Am. Orthopt. J. 60, 48–58 (2010).
Article PubMed Google Scholar
Aw, S., Todd, M., Aw, G., McGarvie, L. & Halmagyi, G. Benign positional nystagmus: A study of its three-dimensional spatio-temporal characteristics. Neurology 64, 1897–1905 (2005).
Article CAS PubMed Google Scholar
Brandt, T. & Strupp, M. General vestibular testing. Clin. Neurophysiol. 116, 406–426 (2005).
Article PubMed Google Scholar
Ganança, M. M., Caovilla, H. H. & Ganança, F. F. Eletronistagmografia versus videonistagmografia. Braz. J. Otorhinolaryngol. 76, 399–403 (2010).
Article PubMed Google Scholar
Aydemir, A. & Uneri, A. in 2006 IEEE 14th Signal Processing and Communications Applications (2006).
Suh, M.-W. et al. Effect of goggle slippage on the video head impulse test outcome and its mechanisms. Otol. Neurotol. 38, 102–109 (2017).
Article PubMed Google Scholar
Lim, E.-C. et al. Developing a diagnostic decision support system for benign paroxysmal positional vertigo using a deep-learning model. J. Clin. Med. 8, 633 (2019).
Article PubMed PubMed Central Google Scholar
Ben Slama, A. et al. Machine learning based approach for vestibular disorder diagnostic in videonystagmography. Biomed. Res. https://doi.org/10.35841/biomedicalresearch.30-19-216 (2019).
Article Google Scholar
Reinhardt, S., Schmidt, J., Leuschel, M., Schüle, C. & Schipper, J. VertiGo–a pilot project in nystagmus detection via webcam. Curr. Dir. Biomed. Eng. 6(1), 20200043 (2020).
Article Google Scholar
Zhang, W. et al. Deep learning based torsional nystagmus detection for dizziness and vertigo diagnosis. Biomed. Signal Process Control 68, 102616 (2021).
Article Google Scholar
Keil, A., Albuquerque, G., Berger, K. & Magnor, M. A. Real-time gaze tracking with a consumer-grade video camera. In Proceedings of the 18th International Conference in Central Europe on Computer Graphics, Visualization and Computer Vision in co-operation with EUROGRAPHICS, 129–134 (2010).
Goni, S., Echeto, J., Villanueva, A. & Cabeza, R. in Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICP. 941–944 (IEEE, 2004).
Li, D., Winfield, D. & Parkhurst, D. J. in 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05)-Workshops. 79–79 (IEEE).
Fuhl, W., Kübler, T., Sippel, K., Rosenstiel, W. & Kasneci, E. in International Conference on Computer Analysis of Images and Patterns. 39–51 (Springer).
Fuhl, W., Santini, T. C., Kübler, T. & Kasneci, E. in Proceedings of the Ninth Biennial ACM Symposium on Eye Tracking Research & Applications. 123–130.
Santini, T., Fuhl, W. & Kasneci, E. PuRe: Robust pupil detection for real-time pervasive eye tracking. Comput. Vis. Image Underst. 170, 40–50 (2018).
Article Google Scholar
Fuhl, W., Santini, T., Kasneci, G. & Kasneci, E. Pupilnet: Convolutional neural networks for robust pupil detection. arXiv preprint arXiv:1601.04902 (2016).
Vera-Olmos, F. J., Pardo, E., Melero, H. & Malpica, N. DeepEye: Deep convolutional network for pupil detection in real environments. Integr. Comput. Aided Eng. 26, 85–95 (2019).
Article Google Scholar
Yiu, Y.-H. et al. DeepVOG: Open-source pupil segmentation and gaze estimation in neuroscience using deep learning. J. Neurosci. Methods 324, 108307 (2019).
Article PubMed Google Scholar
Kothari, R. S., Chaudhary, A. K., Bailey, R. J., Pelz, J. B. & Diaz, G. J. Ellseg: An ellipse segmentation framework for robust gaze tracking. IEEE Trans. Visual Comput. Graph. 27, 2757–2767 (2021).
Article Google Scholar
MacDougall, H. G., McGarvie, L. A., Halmagyi, G. M., Curthoys, I. S. & Weber, K. P. The video head impulse test (vHIT) detects vertical semicircular canal dysfunction. PLoS ONE 8, e61488 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Hayashi, Y. et al. Three-dimensional analysis of nystagmus in benign paroxysmal positional vertigo. J. Neurol. 249, 1683–1688 (2002).
Article PubMed Google Scholar
Seo, Y. J., Park, Y. A., Kong, T. H., Bae, M. R. & Kim, S. H. Head position and increased head velocity to optimize video head impulse test sensitivity. Eur. Arch Otorhinolaryngol. 273, 3595–3602 (2016).
Article PubMed Google Scholar
Roh, K. J., Kim, J. Y. & Son, E. J. Comparison of suppression head impulse and conventional head impulse test protocols. Res. Vestib. Sci. 18, 91–97 (2019).
Article Google Scholar
Chang, T. P., Zee, D. S. & Kheradmand, A. Technological advances in testing the dizzy patient: the bedside examination is still the key to successful diagnosis. In Dizziness and Vertigo Across the Lifespan 9–30 (Elsevier, 2019).
Chapter Google Scholar
Fischler, M. A. & Bolles, R. C. Random sample consensus: A paradigm for model fitting with applications to image analysis and automated cartography. Commun. ACM 24, 381–395 (1981).
Article MathSciNet Google Scholar
Kingma, D. P. & Ba, J. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).
Paszke, A. et al. Pytorch: An imperative style, high-performance deep learning library. Adv. Neural. Inf. Process. Syst. 32, 8026–8037 (2019).
Google Scholar
Fitzgibbon, A. W. & Fisher, R. B. in British Machine Vision Conference, 513–522 (Birmingham, 1995).
White, J. A., Coale, K. D., Catalano, P. J. & Oas, J. G. Diagnosis and management of lateral semicircular canal benign paroxysmal positional vertigo. Otolaryngol. Head Neck Surg. 133, 278–284 (2005).
Article PubMed Google Scholar

Download references

Acknowledgements

This research was supported by ‘Regional Innovation Strategy (RIS)’ through the National Research Foundation of Korea (NRF) funded by the Ministry of Education (MOE) (2022RIS-005) (Y.J.S.) and a National Research Foundation of Korea grant provided by the Korean government (Ministry of Science and ICT) (NRF-2022R1A2C2091160) (S.Y.). We would like to thank Editage (www.editage.co.kr) for editing and reviewing this manuscript for English language.

Author information

Authors and Affiliations

Department of Biomedical Engineering, Yonsei University, Wonju, 26493, Republic of Korea
Yerin Lee & Junghun Han
Department of Precision Medicine, Yonsei University Wonju College of Medicine, Wonju, 26426, Republic of Korea
Sena Lee & Sejung Yang
Research Institute of Hearing Enhancement, Yonsei University Wonju College of Medicine, Wonju, 26426, Republic of Korea
Young Joon Seo
Department of Otorhinolaryngology, Yonsei University Wonju College of Medicine, Wonju, 26426, Republic of Korea
Young Joon Seo

Authors

Yerin Lee
View author publications
You can also search for this author in PubMed Google Scholar
Sena Lee
View author publications
You can also search for this author in PubMed Google Scholar
Junghun Han
View author publications
You can also search for this author in PubMed Google Scholar
Young Joon Seo
View author publications
You can also search for this author in PubMed Google Scholar
Sejung Yang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization, Investigation, Project administration: S.Y. and Y.J.S.; Data curation: Y.J.S. and Y.L.; Formal analysis, Visualization, Writing–original draft: Y.L.; Methodology, Writing–review and editing: All authors. All authors have read and approved the final manuscript.

Corresponding authors

Correspondence to Young Joon Seo or Sejung Yang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

The original online version of this Article was revised: The original version of this Article contained errors in the Methods section, References 20 and 22, and the Acknowledgements section. Full information regarding the corrections made can be found in the correction for this article.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Lee, Y., Lee, S., Han, J. et al. A nystagmus extraction system using artificial intelligence for video-nystagmography. Sci Rep 13, 11975 (2023). https://doi.org/10.1038/s41598-023-39104-7

Download citation

Received: 15 March 2023
Accepted: 20 July 2023
Published: 24 July 2023
DOI: https://doi.org/10.1038/s41598-023-39104-7

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.