Driver drowsiness estimation using EEG signals with a dynamical encoder–decoder modeling framework

Arefnezhad, Sadegh; Hamet, James; Eichberger, Arno; Frühwirth, Matthias; Ischebeck, Anja; Koglbauer, Ioana Victoria; Moser, Maximilian; Yousefi, Ali

doi:10.1038/s41598-022-05810-x

Download PDF

Article
Open access
Published: 16 February 2022

Driver drowsiness estimation using EEG signals with a dynamical encoder–decoder modeling framework

Sadegh Arefnezhad¹,
James Hamet²^nAff7,
Arno Eichberger¹,
Matthias Frühwirth³,
Anja Ischebeck⁴,
Ioana Victoria Koglbauer⁵,
Maximilian Moser^3,8 &
…
Ali Yousefi^2,6

Scientific Reports volume 12, Article number: 2650 (2022) Cite this article

10k Accesses
27 Citations
3 Altmetric
Metrics details

Subjects

Abstract

Drowsiness is a leading cause of accidents on the road as it negatively affects the driver’s ability to safely operate a vehicle. Neural activity recorded by EEG electrodes is a widely used physiological correlate of driver drowsiness. This paper presents a novel dynamical modeling solution to estimate the instantaneous level of the driver drowsiness using EEG signals, where the PERcentage of eyelid CLOSure (PERCLOS) is employed as the ground truth of driver drowsiness. Applying our proposed modeling framework, we find neural features present in EEG data that encode PERCLOS. In the decoding phase, we use a Bayesian filtering solution to estimate the PERCLOS level over time. A data set that comprises 18 driving tests, conducted by 13 drivers, has been used to investigate the performance of the proposed framework. The modeling performance in estimation of PERCLOS provides robust and repeatable results in tests with manual and automated driving modes by an average RMSE of 0.117 (at a PERCLOS range of 0 to 1) and average High Probability Density percentage of 62.5%. We further hypothesized that there are biomarkers that encode the PERCLOS across different driving tests and participants. Using this solution, we identified possible biomarkers such as Theta and Delta powers. Results show that about 73% and 66% of the Theta and Delta powers which are selected as biomarkers are increasing as PERCLOS grows during the driving test. We argue that the proposed method is a robust and reliable solution to estimate drowsiness in real-time which opens the door in utilizing EEG-based measures in driver drowsiness detection systems.

Robust anticipation of continuous steering actions from electroencephalographic data during simulated driving

Article Open access 03 December 2021

Electrophysiological frequency domain analysis of driver passive fatigue under automated driving conditions

Article Open access 13 October 2021

Utilizing EEG and fNIRS for the detection of sleep-deprivation-induced fatigue and its inhibition using colored light stimulation

Article Open access 20 April 2023

Introduction

Recent studies have shown that drowsiness is one of the major factors of road accidents that causes a large number of fatalities and monetary losses^1,2,3,4. National Highway Traffic Safety Administration (NHTSA) announces that about 1.9% of total driving fatalities in 2019 (697 fatalities) were caused by drowsy drivers⁵. In another report, NHTSA estimated that in 2017, 91,000 police-reported crashes involved drowsy drivers that led to approximately 50,000 traffic injuries and 800 fatalities⁶. An assessment of the American Automobile Association (AAA) found that about 24% of drivers revealed been extremely drowsy while driving, at least once in the last month⁷. Furthermore, 14.5% of the drivers in the USA have been involved in at least one drowsiness-related traffic collision, according to a study carried out in 2008⁸. Some studies also showed that the level of drowsiness in automated driving is significantly higher than in manual driving^10,11,12. Given all this evidence, the estimation of driver fatigue is essential for road safety and also future intelligent transportation systems require a vigilant driver for take-over requests from automated vehicles failing to perform safely.

Generally, three types of data have been used in the literature to design driver drowsiness detection systems: (1) vehicle-based^13,14, (2) vision-based^15,16, and (3) physiological data^17,18. The literature suggests that physiological data such as EEG may be more appropriate than other systems to detect the onset of driver drowsiness^19,20 specifically because vehicle-based and vision-based systems can be too late in warning the driver in the early stages of drowsiness, when there might still be time to prevent the accident. Critical signs of drowsiness such as yawning and head-nodding often appear before lateral displacement of the car and other non-physiological signs. Vision-based systems, while convenient, also suffer from robustness limitations in different light conditions and their performance can be significantly degraded when the drivers wear glasses or sunglasses^21,22. Furthermore, data privacy can also be another issue for vision-based drowsiness detection systems which should be more studied in future research works.

Neural activities collected using EEG electrodes are widely exploited to classify and predict the different levels of driver drowsiness. There are many methods to produce these classifications and predictions, including a range of useful EEG layouts and machine learning techniques. For example, in designing a driver drowsiness detection system, Ma et al.²³ used the Principal Component Analysis (PCA) technique and a deep neural network to extract features and predict instances of drowsiness using EEG data. Another study used features such as Higuchi and Petrosian fractal dimensions, and the logarithm of energy extracted from EEG as inputs to a Neural Network (NN) which is trained to classify the driver drowsiness²⁴. Chen et al. used a similar NN-based classifier with different non-linear neural features extracted from subbands of the EEG signals using wavelet transformation²⁵. There is also research suggesting single EEG channel recordings from the T7 electrode in the temporal lobe have predictive power to detect driver drowsiness. Wavelet transform has been used to extract features from this channel and classical classifiers have obtained satisfying performance for drowsiness classification²⁶. Bajaj et al.²⁷ proposed another EEG feature extraction methodology based on tunable Q-factor wavelet transformation. The extracted features were then interpreted by classifiers such as support vector machines and K-nearest neighbors to classify the driver’s vigilance as alert or drowsy. Yeo et al.²⁸ also proposed a method based on a support vector machine classifier trained by several extracted frequency-domain features from EEG sub-bands. Independent Component Analysis (ICA) has been exploited in²⁹ to extract the EEG sources where ICA models are designed for each alertness and drowsiness state. Results showed that ICA models are negatively and positively correlated with reaction speeds in the alertness and drowsiness states, respectively. The power spectrum of EEG sub-bands has been analyzed in³⁰ and results showed Alpha and Theta band powers increase significantly during transition from alert to drowsy state. Budak et al.³¹ also proposed the ensemble majority voting of three deep networks that were trained using different EEG features to classify the vigilance state into two classes: awake and drowsy. On the contrary, a support vector machine-based posterior probabilistic model was proposed in³² that used the power of Theta, Alpha, and Beta sub-bands of EEG data and transformed the drowsiness level to any value between 0 and 1.

Aforementioned solutions show the promising classification of driver drowsiness using EEG-based neural features. For a system to be useful in predicting drowsiness before a subject is drowsy and accident prevention is possible, we can improve on this work by providing information on how the level of drowsiness changes as a function of neural activities. In other words, these existing methods are unable to detect the early stages of the drivers’ drowsiness, when drivers can be warned early enough to prevent impaired driving. We seek, with this paper, to solve this issue by developing a framework that characterizes the distribution of neural activities as a function of driver drowsiness. A scientifically established measure to assess driver drowsiness is the PERcentage of Eyelid CLOSure (PERCLOS)^33,34. Therefore, we use this variable as the ground truth of the driver drowsiness and our proposed modeling framework predicts PERCLOS as a function of neural features. To resolve the issue of previous methods, a generative model for drowsiness tracking is proposed in this paper that provides a moment-to-moment assessment of PERCLOS. This method provides a posterior distribution of PERCLOS. Therefore, we can build other metrics like drowsiness level at a specific time or over a period as a function of the PERCLOS posterior distribution estimate. In other words, this method makes it possible to predict the trajectory of PERCLOS in the next multiple seconds which is an important factor to prevent accidents or create a timely countermeasure.

Alongside developing a real-time modeling solution to estimate driver drowsiness, we are interested in identifying neural biomarkers of drowsiness which may be useful to others studying drowsiness and needing reliable biomarkers. In the development of our modeling solution, we expand on the dynamical neural encoder-decoder modeling framework which has been successfully utilized in other applications such as extracting multi-dimensional auditory and visual stimulus-response correlations³⁵, decoding neural recordings to predict speech³⁶, reconstructing natural images using Bayesian decoder³⁷, and decoding hidden cognitive states³⁸.

In the extension of the dynamical encoder-decoder modeling framework in estimating PERCLOS, we provide a new model to characterize the temporal dynamics of PERCLOS. Using neural encoder models, we search for a subset of neural features encoding PERCLOS. We finally demonstrate how the state process for the PERCLOS and neural encoder models can be combined to estimate PERCLOS in real-time.

Data collection and study procedure

Apparatus

This study was carried out in a fixed-base driving simulator called Automated Driving Simulator of Graz (ADSG) at the Graz University of Technology (TU Graz), which is based on a full production vehicle. Visual cues are simulated by eight large LCDs, placed around the windshield and the left and right side windows, and one in the rear section of the car. Acoustic cues are simulated by a stereo sound system and several shakers, providing engine sound, background noise, and vibrations. The vehicle has an automatic gearbox, and drivers can control the car using a force feedback steering wheel and pedals. The realism of the simulator was validated with driving tests in previous projects³⁹. Automated driving functions are implemented for longitudinal (by employing the adaptive cruise control) and lateral vehicle control (by employing lane-keeping assist). The driver information was limited to speed and indicator information, using a tablet PC. The driver can operate adaptive cruise control and lane-keeping assist systems with a touch screen located on the right side of the dashboard. For the present study, EEG channels are collected using an g.Nautilus Research® EEG cap (https://www.gtec.at/) and driver’s head position, eyelid movement, pupil diameter, and gaze direction are also measured with an infrared-based eye-tracking system called SmartEye® (https://smarteye.se/). Figure 1 shows four different views of a driver when he was performing the test.

Driving tests procedure

In this study, drivers participated in two different driving modes: ‘manual’ and ‘automated’. In the automated mode, lane-keeping and cruise control systems adjust the vehicle’s lateral position and longitudinal speed in the test track, respectively. Drivers conducted a 30-minute highway driving test in each of these modes in two states including ‘rested’ and ‘fatigued’. In the rested state test, drivers were asked to stick to a full night’s sleep routine before the test and not diverge from their usual circadian cycle. For carrying out the fatigued state test, there were two choices. One choice was to stay awake for at least 16 hours continuously before starting the test procedure (resulting in a drive after at least 17 to 18 hours of wakefulness) and to take the test at their usual bedtime. Another choice was a sleep restriction of at least 50% (max. 4 hours of sleep) the night before the test. Overall, 92 drivers, balanced in age and gender participated in the four different driving tests described above. More information about the testing procedure can be found in our previous work¹⁴.

In this study, a data subsample of 18 driving sessions from 13 drivers (5 females and 8 males; age $44.5\mp 18.8$ years) that have various levels of PERCLOS and acceptable eyelid data quality have been selected to estimate the level of drowsiness in drivers. The study was conducted according to the guidelines of the Declaration of Helsinki, and approved by the Ethics Committee of Medical University of Graz (Code 30-409 ex 17/18, approved on 03.08.2018). Informed consent was obtained from all participants before the experiments. EEG signals were collected using gel electrodes, with a sampling frequency of 500 Hz, and 24-bit resolution. Eight electrodes have been used to collect EEG signals including Cz, Fz, T7, T8, C3, C4, PO7, and PO8. To capture eye movements, two EOG electrodes were attached above and below the right eye. The EOG signal was calculated as the difference between the two EOG electrodes data. The positions of the EEG electrodes along with the ground electrode (GND) placed in the AFZ electrode are marked by red and yellow circles in Fig. 2, respectively.

Methodology

Calculation of the actual PERCLOS

PERCLOS is a measure of drowsiness that is defined as the proportion of time in a minute that eyes are at least 80 percent closed³³. For higher PERCLOS values, where the eyes are mostly closed for longer periods of time than for lower PERCLOS values, subjects exhibit strong correlation with a common sign of drowsiness in driving which is lane deviation in the road^33,40. To calculate this measure, a one-minute sliding window with a 30 seconds overlap between every two consecutive windows has been applied to the eyelid signal. The PERCLOS of four different driving modes of the same driver is shown in Figure 3. In this Figure, PERCLOS increases up to 0.9 in the Fatigued-Automated test and goes up to 1 (completely closed) in the Fatigued-Manual test. This range of PERCLOS suggests that the driver is extremely drowsy in the fatigued mode tests while PERCLOS is barely higher than 0.3 in the rested tests.

Preprocessing of the EEG channels

EEG signals are contaminated by different noise sources including eye movements, eye blinks, and muscle activity. Different methods have been proposed by researchers to preprocess the EEG signals. For example, Empirical Mode Decomposition (EMD) was used in^41,42 that decomposes the EEG signals into a specified number of mode functions that can characterize both of neural activities and muscle artefacts. Cleaned signals were obtained by using the mode functions which present the neural activities. Wavelet-based denoising methods have also been applied in previous studies^43,44,45. In those methods, Discrete Wavelet Transform (DWT) is applied to the EEG signals to decompose them to their wavelet coefficients. A threshold filter is applied then to remove the wavelet coefficients which represent the noise and artefacts. Finally, the cleaned EEG signals are reconstructed using the remaining coefficients. Independent Component Analysis (ICA) is also another method that has been widely used to preprocess the EEG channels^46,47,48. ICA decomposes the raw EEG signals into independent components, where the source of each component can be identified using its scalp topography. The denoised signals are reconstructed by removing those components that are not originated from the brain lobes^49,50. In this paper, we applied the ICA method via the EEGLAB toolbox⁵¹ to preprocess the EEG data. This toolbox provides some visual information using scalp topography that makes it easier than other methods to remove noisy components when the clean data are retained in the reconstructed channels. Supplementary Information presents more details about using this toolbox for EEG preprocessing.

PERCLOS neural encoder model

In this paper, driver drowsiness is considered as hidden cognitive state that cannot be measured or known directly. According to the literature^33,52,53, we can assume that PERCLOS represents the levels of driver’s vigilance and therefore we might estimate driver vigilance by directly estimating the PERCLOS with an encoder-decoder model that uses EEG features. The flowchart of the proposed framework is presented in Fig. 4. In this subsection, the elements of our proposed encoder-decoder model are discussed. Before building the encoding model, the driving tests are randomly separated into two sets: training set, and test set. Three driving tests with the IDs of 6, 9, and 15 make the test set and the data of the other fifteen tests are used as the training set. The encoder model is designed by using only the training set. The encoder model consists of: (1) a dynamical model to characterize how PERCLOS evolves over time as a state variable independent of EEG data, and (2) an observation model which characterizes how PERCLOS is encoded in the neural features. Each of these models has a separate set of learnable parameters that are estimated using linear regression.

We assume that PERCLOS is a stochastic process characterized by a positive random variable, with a range of 0 to 1. PERCLOS temporal dynamic over-time is defined by the state transition process shown in Eq. (1). This Equation shows our prior knowledge of how PERCLOS changes over-time without knowing the neural activity. As this Equation shows, PERCLOS in every time index only depends on its value in the previous time index and it is independent of the EEG features. We assumed that this equation is valid for the calculated PERCLOS values for all drivers and the PERCLOS of training set tests are concatenated to each other to identify its unknown parameters.

$$\begin{aligned} {X}_{i} = 0.5 (1+\tanh (a{X}_{i-1}+b+\epsilon _{i-1}))\; ; i = 1, 2, ..., K \end{aligned}$$

(1)

In this Equation, ${X}= [x^{1}, x^{2}, ..., x^{Ntr}]$ is the $1 \times K$ vector of PERCLOS made by horizontally concatenating of the PERCLOS values of the training tests ($x^{1}$ to $x^{Ntr}$) and Ntr is the number of driving tests in the training set which here is 15. $\{a,b\}\in \mathbb {R}$ are free unknown parameters and $\epsilon$ is a zero-mean Gaussian noise with the unknown variance of $\sigma _{\epsilon }^{2}$; $\epsilon _{i} \sim \mathcal {N}(0,\,\sigma _{\epsilon }^{2})$. The identification of these unknown parameters is described in the subsection of ”Estimation of state transition process’ parameters”.

We assume that $Y^{d} = [y^{1,d},y^{2,d},...,y^{C,d}]$ is the $L \times C$ matrix of EEG features extracted from the d-th driving test of the training set ($d = 1, 2, ..,Ntr$), where C is the number of EEG features and L is the length of extracted feature and length of PERCLOS vector in every driving test. We also assume that every EEG feature is independent of other features given PERCLOS. Therefore, the conditional distribution of each feature extracted from every driving test, $y^{c,d}$, given the corresponding PERCLOS values of the driver, $x^{d}$, is presented by Eq. (2).

$$\begin{aligned} y_{k}^{c,d}|x_{k}^{d}\sim f(x_{k}^{d};\theta ^{c,d})\; ; c = 1,2,...,C\; ; d= 1, 2, ..,Ntr\; ; k = 1,2,...,L \end{aligned}$$

(2)

where f defines the conditional distribution and $\theta ^{c,d}$ is the set of parameters for the c-th EEG feature of d-th driving test. Equations (1) and (2) define our dynamical encoder model, characterizing how changes in EEG features over time encode PERCLOS progression in every driving test. In our modeling of the EEG features, we assumed that the conditional distribution of each feature given PERCLOS follows a normal distribution. The mean of the distribution is defined as a linear function of the PERCLOS and the standard deviation of the distribution is assumed to be constant for every EEG feature. This distribution is defined by Eq. (3).

$$\begin{aligned} y_{k}^{c,d}|x_{k}^{d}\sim \mathcal {N}(\alpha ^{c,d}x_{k}^{d}+\beta ^{c,d},\,\sigma _{c,d}^{2})\; ; c = 1,2,...,C\; ; d= 1, 2, ..,Ntr\; ; k = 1,2,...,L \end{aligned}$$

(3)

where $\alpha ^{c,d}$, $\beta ^{c,d}$ and $\sigma _{c,d}^{2}$ are the unknown slope and intercept parameters and the unknown variance of observed noise for the c-th EEG feature of d-th driving test, respectively. Therefore, every EEG feature in every driving test has a specific set of parameters given the PERCLOS vector of the corresponding driving test. The Subsection of ”Estimation of observation equation’s parameters” describes the identification of these unknown parameters.

PERCLOS decoder model

The parameter learning of the method which is based on the training data is performed in the encoding step. The learning process is explained in the “Model Identification” section. However, in the decoder step, the learnt parameters are used in the structure of the Bayesian filtering to estimate the PERCLOS. In other words, the model parameters are not fixed or pre-known and they are estimated using the training data in the encoder step and the same estimated values are used for the decoding step.

Given the encoder model with estimated parameters, we can use Bayesian filtering to estimate PERCLOS from neural data. This filter provides the best estimation of the PERCLOS, which is our dynamic state, given current and previous values of EEG features through its posterior distribution. The Bayesian filter is a recursive technique that can be conducted by calculating two equations per each time index: one-step perdition and update⁵⁴. Bayesian filter benefits from two processes: state-transition process and observation processes that we have defined these processes in Equations (1) and (3), respectively. Figure 5 demonstrates the general structure of the Bayesian filtering that has three main steps: (1) Chapman-Kolmogorov equation that calculates the one-step prediction of the state, (2) Likelihood function that calculates the likelihood of possible values of PERCLOS given the observed neural feature, and (3) The Bayes’ rule to update the one-step prediction based on the current EEG features. This step updates the filter and provides a posterior distribution of the state (PERCLOS) given the measured observations (EEG features).

Model identification

In the previous subsections, we described how the encoder model is defined. We also described the decoder or PERCLOS estimation using neural data. For the decoder step, we assumed that the encoder and the state transition model parameters are known; here, we describe how those parameters can be identified.

Estimation of state transition process’ parameters

The state transition process defined in Equation (1) can be rewritten by Equation (4). Now, the equation becomes a linear function of a and b when PERCLOS values are known.

$$\begin{aligned} a{X}_{i-1}+b+\epsilon _{i-1} = tanh^{-1}(2{X}_{i}-1)\; ; i = 1, 2, ..., K \end{aligned}$$

(4)

where a, b, and noise variance $\sigma _{\epsilon }^{2}$ are estimated using Least Square (LS) technique. In the LS, it is assumed that $h_{i}=tanh^{-1}(2{X}_{i}-1)$ is the input for the regression problem:

$$\begin{aligned} a{X}_{i-1}+b+\epsilon _{i-1} = h_{i}\; ; i = 1, 2, ..., K \end{aligned}$$

(5)

The Residual Sum of Square (RSS) error, as presented in Equation (6), is minimized to obtain the parameters of a and b:

$$\begin{aligned} RSS_{X} = \sum _{i=1}^{K} \epsilon _{i-1}^2 = \sum _{i=1}^{K} \left( h_{i}-a{X}_{i-1}-b\right) ^2, \end{aligned}$$

(6)

In order to minimize $RSS_{X}$, the derivatives of the $RSS_{X}$ w.r.t two parameters of a and b are set to be zero:

$$\begin{aligned} \frac{\partial RSS_{X}}{\partial a} = 0\; ; \frac{\partial RSS_{X}}{\partial b} = 0 \end{aligned}$$

(7)

After applying the Eq. (7) and simplification of the results, the parameters of a and b are calculated as presented in Eq. (8)⁵⁵.

$$\begin{aligned} a = \frac{\sum _{i=1}^{K} \left( X_{i-1}- \mu _{X}\right) \left( h_{i}-\mu _{h}\right) }{\sum _{i=1}^{K}\left( X_{i-1}-\mu _{X}\right) ^2}\; ; b = \mu _{h}-a\mu _{X}, \end{aligned}$$

(8)

where $\mu _{X}= \frac{1}{K}\sum _{i=1}^{K} X_{i}$ and $\mu _{h} = \frac{1}{K}\sum _{i=1}^{K} h_{i}$ are the PERCLOS mean and input mean (see Equation 5), respectively. The noise variance is also calculated as the variance of $\epsilon _{i-1} = h_{i}-aX_{i-1}-b$ $(i = 1, 2, ..., K)$.

Estimation of observation equation’s parameters

We use linear regression to identify the parameters of the observation equation per EEG feature in every driving test $(\alpha ^{c,d}$, $\beta ^{c,d}$ and $\sigma _{c,d}^{2})$. Linear regression minimizes the root mean square error between estimated and actual EEG features given PERCLOS data. In order to estimate these parameters, Eq. (3) is rewritten as Eq. (9).

$$\begin{aligned} y_{k}^{c,d} = \alpha ^{c,d}x_{k}^{d}+\beta ^{c,d}+v_{k}^{c,d}\; ; c = 1,2,...,C\; ; d= 1, 2, ..,Ntr\; ; k = 1,2,...,L \end{aligned}$$

(9)

where $v_{k}^{c,d} \sim \mathcal {N}(0,\,\sigma _{c,d}^{2})$ is the Gaussian noise of linear regression of Eq. (9) which is calculated per EEG feature in every driving test. In order to estimate these parameters, the RSS error ($RSS_{y}^{c,d}$) is calculated for each EEG feature in every driving test by using the Eq. (10).

$$\begin{aligned} RSS_{y}^{c,d} = \sum _{k=1}^{L} \left( y_{k}^{c,d} -\alpha ^{c,d}x_{k}^{d}-\beta ^{c,d}\right) ^2\; ; c = 1,2,...,C\; ; d= 1, 2, ..,Ntr\; ; k = 1,2,...,L \end{aligned}$$

(10)

After minimization of $RSS_{y}^{c,d}$, the unknown parameters of $\alpha ^{c,d}$ and $\beta ^{c,d}$ are determined using Equation (11)⁵⁵.

$$\begin{aligned} \alpha ^{c,d} = \frac{\sum _{k=1}^{L} \left( x_{k-1}^{d} - \mu _{x}^{d}\right) \left( y_{k}^{c,d}-\mu _{y}^{c,d}\right) }{\sum _{k=1}^{L}\left( x_{k-1}^{d}-\mu _{x}^{d}\right) ^2}; \; \beta ^{c,d} = \mu _{y}^{c,d}-a\mu _{x}^{d} \; ; c = 1,2,...,C\; ; d= 1, 2, ..,Ntr\; ; k = 1,2,...,L \end{aligned}$$

(11)

where $\mu _{x}^{d} = \frac{1}{L}\sum _{k=1}^{L}x_{k}^{d}$ and $\mu _{y}^{c,d} = \frac{1}{L}\sum _{k=1}^{L}{y}_{k}^{c,d}$ and the variance of the error for every EEG feature in every driving test is calculated as the variance of $v_{k}^{c,d} = {y}_{k}^{c,d}-\alpha ^{c,d}x_{k}^{d}-\beta ^{c,d}$.

We also applied some transformations such as logarithm and exponential functions to the EEG features to check if these transformations can improve the regression or not. The autocorrelation of residual errors and R2 coefficient are also calculated to evaluate the goodness of fit for every EEG feature.

Observation model selection

Though all the neural features can be used in the decoding step, a more practical approach would use only a subset of features that shows strong encoding properties. This process helps to build a more robust and generalizable decoder model by excluding those features which lack reliable and consistent predictive power. With the independence assumption of the neural features, we can check the statistical significance of encoding power of each feature by examining the value of $\alpha ^{c,d}$ described in Eq. (3).

Our null hypothesis is that $\alpha ^{c,d}$ (the slope parameter in Eq. (4)) is zero. Therefore, a t-test per each neural feature has been applied to check whether the null hypothesis can be rejected or not. The p-value of 0.05 has been set as a threshold and only those neural features that their corresponding $\alpha ^{c,d}$ comes with strong evidence to fall in the alternate hypothesis, have been selected. This subset of features is then used in the decoding step. Therefore, in the decoding step, only a subset of neural features will be picked, whose statistical significance is in a favor of being included in the encoder model feature set.

Application of the methodology

In this section, we first discuss how neural features are extracted from EEG signals; we then use the encoder-decoder pipeline to predict the PERCLOS. Preprocessed EEG data of each channel is first decomposed into four sub-bands using band-pass filtering: delta (0.5–4 Hz), theta (4–8 Hz), alpha (8–12 Hz), and beta (12–30 Hz)^56,57. Then, statistical features of these sub-bands such as their spectral power have been employed in different applications to reduce the dimensionality of the EEG data while significant information is retained during feature extraction^58,59. These features help to investigate the changes in the EEG data in an interpretable way when the driver drowsiness level is fluctuating during the driving test. Here, fifty features are extracted from each one of the eight EEG channels and one EOG channel that results in 450 neural features for each driving test. The extracted features are listed in Supplementary, Table S1. The same sliding time window that is used to calculate the PERCLOS (1-minute length with a 30-second overlap between two adjacent time windows) has also been applied for EEG feature extraction. All parts of the proposed method have been programmed in MATLAB R2021a. Moreover, the EEGLAB toolbox (v14.1.2) was used in MATLAB to preprocess the EEG data using the ICA technique.

To find the state transition process parameters, we concatenated PERCLOS data across all users (resulted in approximately 540 minutes of driving) and use the LS approach to estimate a, b, and $\sigma _{\epsilon }^{2}$ parameters. In this work, it has been assumed that all users have a reasonably similar state transition process. Therefore, the same estimated parameters of the state transition process (Eq. 1) are used in the Bayesian filtering to estimate PERCLOS. Table 1 represents the estimated parameters for PERCLOS dynamical model defined by Eq. (1). The dynamics of the estimated PERCLOS are adjustable by changing these parameters. For instance, if each of the a and b parameters approaches the positive infinity, the limit of PERCLOS is one (its maximum value). On the other hand, the limit of the PERCLOS is zero (its minimum value) if each one of these parameters approaches negative infinity. Figure 6 shows the PERCLOS residual error of the outputted result from the LS method using the estimated parameters of the state transition equation. As Fig. 6 shows, the absolute value of the residual error in some parts is about two to three times larger than other parts. After checking the actual PERCLOS values, we realized that these parts are either associated with very high (approximately one) or very low (approximately zero) actual PERCLOS values that show the states of completely alert and extremely drowsy situations, respectively. However, we are aimed to model the dynamic transition between these two states (completely alert and extremely drowsy). The root mean squares error (RMSE) between actual PERCLOS and modeled PERCLOS is 0.061. This result suggests that the proposed state transition process (Equation 1) can reasonably capture the PERCLOS dynamics with acceptable performance.

Table 1 State transition process model parameters. These parameters are estimated using LS method and they are assumed to be constant for all of the driving tests. In other words, state transition process of PERCLOS (Equation 1) has the same parameters in different driving tests.

Full size table

In the encoder model, a subset of EEG features has been selected using the model selection approach. Note that the decoder model combines two sources of information at different temporal scales in the prediction of PERCLOS: (1) long-term information that is carried by the state process Eq. (1), and (2) instantaneous information carried by neural activity about PERCLOS (Eq. 3). These two sources of information are combined through Bayesian Filtering in the estimation of PERCLOS.

Using the proposed modeling framework, we build a user-specific encoder and decoder model of PERCLOS. we only assumed that PERCLOS temporal dynamics across users share the same characteristics. Given this model, it is possible that a neural feature might be positively correlated with PERCLOS in one participant and negatively correlated in another one. Whilst this might provide a more accurate prediction given the specificity of the model to a specific user, we can search for possible neural biomarkers which are showing consistent encoding properties across participants. We can benefit from the encoding step in the search for possible biomarkers, those that are representing PERCLOS changes consistently across users.

Results

Results of biomarker identification

This subsection explains the results of searching the EEG features to find biomarkers of drowsiness. We searched across all EEG features to identify strong correlations to PERCLOS based on their slope ($\alpha ^{c,d}$ in Equation 9). According to the results of the encoder model, 28 highly performant EEG features have been identified that generalize for all 18 driving tests. This means that regardless of the user, these features are significantly realted to the PERCLOS values recorded during the driving test. Therefore, they have the potential to be biomarkers of drowsiness, with highest performance when considered together. These features are presented in Table 2 that include skewness of Alpha of all EEG channels and one EOG channel (9 features), Delta power of all EEG channels and one EOG channel (9 features), Theta power of all EEG channels except Cz and P08 (7 features), and Hjorth mobility of Delta of T8, P08, and EOG channels (3 features).

Table 2 EEG features that are consistently significant (p-value $< 0.05$) across all the 18 studied driving tests. Overall, 28 features are selected by the encoder regardless of driving tests to encode the PERCLOS dynamics. These features include skewness of Alpha (all EEG channels), Delta power (all EEG channels), Theta power (all EEG channels except Cz and P08), Hjorth mobility of Delta (T8, P08, and EOG channels).

Full size table

Figure 7 also shows the regression coefficient between every biomarker and PERCLOS in all driving tests. As this Figure shows, the average consistent Delta and Theta powers are positively correlated with PERCLOS in all EEG channels. This result is in accordance with established studies that report increases in Theta and Delta powers as indicators of drowsiness^4,58,60. The skewness of Alpha in all EEG channels except T8 and P07 is also positively correlated with PERCLOS while Hjorth Mobility of Delta in T8 and P08 are negatively and in EOG channels positively correlated with PERCLOS. Therefore, the proposed framework establishes biomarkers that have consistent relationships with PERCLOS. These neural features could therefore be extracted from EEG signals to estimate the drowsiness independently of the drivers and driving conditions. Figure 8 shows the sign of the correlation coefficient of biomarkers and PERCLOS in the driving tests. According to this Figure, about 73% and 66% of the Theta and Delta powers which were selected as biomarkers are positively correlated with PERCLOS, respectively. On the other hand, only 48.8% and 44.4% of the skewness of Alpha and Hjorth mobility of Delta are positively correlated with PERCLOS, respectively. Therefore, discovered biomarkers make a “push-pull mechanism” to estimate the driver drowsiness. In this mechanism, one group of biomarkers that includes Theta and Delta powers are increasing with increasing the level of drowsiness (pushing part of the mechanism), whereas another group of biomarkers that consists of skewness of Alpha and Hjorth mobility of Delta are decreasing with increasing the drowsiness level (pulling part of the mechanism). This interaction between these two parts of the mechanism suggests that considering these biomarkers together provides the best performance for estimating PERCLOS and driver drowsiness and obtains a satisfying estimation of driver drowsiness associated with the PERCLOS data.

Results of the decoder model for PERCLOS estimation

In this subsection, we discuss the modeling results of our proposed encoder-decoder framework in the estimation of PERCLOS. The data set has been randomly separated into two data sets: train and test. The training set contains 15 tests where three driving tests with ID = 6, 9, and 15 have been selected to make a test dataset. The test dataset has not been involved for selecting the biomarkers. Neural biomarkers (see Table 2) are found using training dataset and are employed to estimate the PERCLOS in the test dataset. In Fig. 9, we show the decoding results of the test dataset. These results suggest that the proposed framework reasonably traces the drowsiness level presented by the actual PERCLOS data. Figure 9 also presents the upper and lower bounds of the 95% confidence interval of the Bayesian estimation. These bounds are utilized to calculate the High Probability Density (HPD) percentage⁶¹. The HPD presents the percentage of the data samples per driving test where the actual PERCLOS falls in the 95% confidence interval of the estimated one.

In order to investigate the PERCLOS estimation accuracy of the frameworks, the RMSE and HPD percentage metrics for PERCLOS estimation are provided in Fig. 10 for all driving tests. This Figure shows that the average RMSE and average HPD percentage are 0.117 and 62.5%, respectively. Moreover, to study the performance of the method in the different levels of drowsiness, PERCLOS is separated into four intervals: 0-0.25, 0.25-0.5, 0.5-0.75, and 0.75-1. The average RMSE and HPD percentage of each one of these intervals during all of the driving tests are presented in Fig. 11. According to this Figure, this average HPD percentage and RMSE are increasing and decreasing as PERCLOS grows, respectively. Therefore, this model obtains better performance in the higher actual PERCLOS (moderate and extreme levels of driver drowsiness), which in practice is more important to detect the driver drowsiness than states with low PERCLOS values.

Discussion

Different biosignal-based methodologies have been proposed in the literature to estimate the drivers’ workload and their cognitive states^62,63,64. For example, electrodermal activity was recorded in⁶⁵ with simultaneous collection of subjective rating of mental workload during driving tests where drivers were asked to perform a time-production task. Results showed that electrodermal activity increases together with subjective ratings in more complex driving scenarios that would indicate higher levels of driving workloads. Argyle et al.⁶⁶ also investigated the relationship between fatigue as a cognitive state and heart rate, breathing rate, and hemodynamic response in the prefrontal cortex as physiological responses. According to the results, fatigue significantly influences physiological responses.

In Brain-Computer Interface (BCI) applications^67,68 and clinical neuroscience^69,70, researchers are interested to find neural biomarkers. To find biomarkers, the data of different subjects are usually combined while the individual differences between subjects are ignored. Because of the present individual differences, it can be challenging to find neural biomarkers which are scalable from one person to another one across the group. Most of the similar previous studies used EEG data as inputs to discriminative models to classify levels of drivers’ drowsiness. For example, Li et al.³² developed a method for drowsiness estimation using powers of Theta, Alpha, and Beta subbands of EEG data while the ground truth for drowsiness was derived by a combination of PERCLOS and Number of Adjustment (NOA) of the steering wheel⁷¹ during the test. Considering this ground truth, three classes were defined: (1) alert (PERCLOS$<8\%$ and NOA$>26$), (2) early warning (8%$\le$PERCLOS$<12\%$ and $9<$NOA$\le$26), and (3) drowsy (PERCLOS$\ge$12% and NOA$\le$9). Finally, a support vector machine was used as classifier and according to the results, this method provided classification accuracies of 91.25%, 83.78%, and 91.92% for alert, early warning, and drowsy classes, respectively. A Convolutional Neural Network (CNN) was also applied to the EEG data for drowsiness detection in⁷². In that study, the Alpha-Theta waves (5–9 Hz) of two occipital (O1 and O2) electrodes and two temporal (T7 and T8) electrodes were used as inputs to a Convolutional Neural Network (CNN) network while data augmentation was also used to reduce the risk of over-fitting. Results demonstrated that this method achieved the binary classification accuracy of 90% for driver drowsiness classification. Detection of more levels of drowsiness was also studied in some previous works. For example, the classification of driver drowsiness into five different classes was performed in⁷³ using EEG channels and by applying a combination of CNN and Bidirectional Long-Short-Term-Memory (Bi-LSTM) network. In that study, the CNN extracted the features from EEG data and Bi-LSTM derived the long-term dependencies between extracted features. According to the results, this method achieved an average classification accuracy of 69% for five different levels of driver drowsiness.

The main advantage of our proposed method over the previous method is its capability for finding neural biomarkers that consistently encode the drowsiness dynamics in different drivers independent of their characteristics (e.g. age and gender) and driving conditions (manual or automated). Moreover, our method estimates the real-time estimation of drowsiness by providing the estimation for the posterior distribution of PERCLOS. This posterior distribution can also be used to predict the drowsiness level in the next time indices.

In this paper, we proposed a new modeling framework using neural activities to provide an instantaneous estimation of the PERCLOS as a widely used estimation of driver drowsiness. The PERCLOS is being considered as a robust correlate of driver drowsiness which is widely studied to assess driver’s performance in the different vigilance states. Our proposed framework is derived from extensive work in the neuroscience domain where the question was finding the relationship between cognitive state and neural correlates^36,74,75. The framework has two steps: encoder and decoder. When each of these steps is built through a sequential process, we come up with a dynamical estimation of PERCLOS as a function of the selected number of neural features. One of the advantages of this method compared to previously developed methods is that we are providing a posterior distribution of PERCLOS at every time point which is a fairly complete measure of PERCLOS. Through this measure, we can build other metrics which can be used to assess the trajectory of a driver’s drowsiness and anticipate the time that the driver can be in a dangerous level of driver drowsiness or even decide about whether the driver drowsiness level is above a specific predefined level or not (a predefined threshold might be used to trigger appropriate action or warning). Another advantage of this model is the real-time estimation of driver drowsiness that can reduce the risk of accidents caused by drowsy driving.

Given the preprocessing and encoding strategy of our model, the decoding step for drowsiness estimation requires low computational effort and can be performed in real-time as data is collected. In the preprocessing step, artifacts that might have undesirable effects on the system’s performance are rejected from EEG data, and in the encoding step, the EEG features that can encode the PERCLOS (biomarkers) are discovered across participants. Consequently, only 28 features out of the initial 450 features are used in the decoding step. Therefore, the decoding step is computationally inexpensive and we can have a real-time estimation of PERCLOS. Compared to previously used deep learning methods^76,77,78, our proposed method also needs a lower computational cost in the training phase to find out the neural biomarkers. This advantage is obtained by applying the prior knowledge of the real PERCLOS dynamics regardless of EEG features, and also the discovered relationship between EEG features and PERCLOS in the encoder step.

The most important element of our research is the principal approach to find neural biomarkers for driver drowsiness which has not yet been extensively studied in previous works as those works were classifying driver drowsiness as a black-box model^79,80,81. In this study, we are providing more details about the relationship between neural activities and PERCLOS in an interpretable manner for the benefit of other researchers in our domain. For instance, we found that the Theta power of the C4 EEG channel is a biomarker of drowsiness which increased with higher PERCLOS values in 15 out of 18 driving tests. On the other hand, the Alpha power of the PO8 EEG channel is another example of obtained biomarkers that decreased in 13 out of 18 tests for higher PERCLOS values.

Although we added new utilities to this domain, more research should be conducted to enhance the performance of this method. Some of the challenges that need to be addressed are as follows:

1.
Producing a personalized model is possible by including more EEG channels but there is a trade-off between the utility of the device and the number of EEG channels. We think that rather than increasing the number of EEG channels, more physiological information such as ECG and heart rate variability data^17,82 can be utilized to enhance the model performance with greater feature independence and robustness.
2.
Although the proposed method provides us a solution to find neural biomarkers, we have only studied the fluctuation of nodal frames while more advanced techniques are studying network global dynamics^83,84. Employing other features that are presenting the network global dynamics of brain activities like coherence, correlation, and mutual information between different EEG channels^85,86,87 might improve the performance of the framework. It should be noted that the proposed framework is flexible enough to incorporate those features into our model but the question is which one of them are informative to estimate the driver drowsiness. The proposed framework can also be helpful to select the informative features.
3.
This paper discussed the estimation or decoding capability of the proposed framework but another important application of this method is its prediction capability where we can predict what will be the level of drowsiness based on the current and previous neural activities. This capability requires to use a more accurate state transition process that is tuned for every individual driver.
4.
In this method, the same estimated parameters in the encoder step were used in the Bayesian filtering of the decoder step to estimate the drowsiness. However, these parameters can also be estimated online in a recursive way when new data are available in every time index. Reinforcement learning can also be used as the method for parameter updating when the driving performance (e.g. reaction time to a traffic event) is used a reference data to provide the award/punishment mechanism.

Conclusions

One of the requirements of upcoming automated cars is monitoring the driver’s states since he/she is responsible for controlling the car in case of system failure of automated cars. Drowsiness is one of the drivers’ mental states that can significantly degrade driving performance and increase reaction time in critical situations where an accident may be avoided. Therefore, we concentrated on driver drowsiness prediction using EEG signals which were used in previous studies to detect the early stages of drowsiness^19,20. In order to accomplish this goal, an encoding-decoding framework based on EEG signals was presented to estimate PERCLOS which is a widely used indirect measure of driver drowsiness. This framework is composed of two main steps: encoder and decoder steps. In the encoder step, the relationship between every EEG feature and PERCLOS of the driving test was assumed to be linear with unknown parameters which were estimated using the least-square method. Moreover, the state transition process of PERCLOS regardless of EEG data was defined which has another set of unknown parameters. The least-square approach was also used to estimate this set of parameters.

The training set of EEG data was used to find out a set of EEG features that are significantly correlated with PERCLOS progression for every driving test. To select these features for every driving test, a t-test with a threshold of 0.05 was used to check the magnitude of the slope parameter of every EEG feature in linear regression given the PERCLOS vector. In the decoder step, only EEG features that were selected in the encoder step were used as inputs to a Bayesian filtering to estimate PERCLOS values in real-time. Moreover, the selected feature set for different driving tests was searched to find EEG biomarkers that encode PERCLOS regardless of the driver and driving condition. Overall, 450 features were extracted from EEG data and according to the results, 28 EEG biomarkers were are discovered out of all features. These 28 biomarkers (instead of all 450 features) were used in the decoder step to estimate PERCLOS.

Results of the decoder step show that the proposed method estimates the PERCLOS values with an average RMSE of 0.117 and an average HPD percentage of 62.5% over all driving tests. Therefore, this method not only estimates the drowsiness in real-time but also provides some EEG biomarkers that encode drowsiness. These biomarkers alleviate the required computational power to estimate the onset of the driver’s drowsiness using EEG data independent of driver-specific factors.

References

Arefnezhad, S., Samiee, S., Eichberger, A. & Nahvi, A. Driver drowsiness detection based on steering wheel data applying adaptive neuro-fuzzy feature selection. Sensors 19, 943. https://doi.org/10.3390/s19040943 (2019).
Article ADS PubMed Central Google Scholar
Zhang, G., Yau, K. K. W., Zhang, X. & Li, Y. Traffic accidents involving fatigue driving and their extent of casualties. Accident Anal. Prevent. 87, 34–42. https://doi.org/10.1016/j.aap.2015.10.033 (2016).
Article Google Scholar
Bener, A., Yildirim, E., Özkan, T. & Lajunen, T. Driver sleepiness, fatigue, careless behavior and risk of motor vehicle crash and injury: Population based case and control study. J. Traffic Transp. Eng. Eng. Ed. 4, 496–502. https://doi.org/10.1016/j.jtte.2017.07.005 (2017).
Article Google Scholar
Nguyen, T., Ahn, S., Jang, H., Jun, S. C. & Kim, J. G. Utilization of a combined EEG/NIRS system to predict driver drowsiness. Sci. Rep. 7, 43933. https://doi.org/10.1038/srep43933 (2017).
Article ADS PubMed PubMed Central Google Scholar
National Center for Statistics and Analysis. Overview of motor vehicle crashes in 2019: Traffic safety facts research note. report no. dot hs 813 060. https://crashstats.nhtsa.dot.gov/Api/Public/Publication/813060.
National Highway Traffic Safety Administration. Traffic safety facts 2015. https://crashstats.nhtsa.dot.gov/Api/Public/Publication/812384.
AAA Foundation for Traffic Safety. 2019 traffic safety culture index (2019). https://aaafoundation.org/wp-content/uploads/2020/06/2019-Traffic-Safety-Culture-Index.pdf
Vanlaar, W., Simpson, H., Mayhew, D. & Robertson, R. Fatigued and drowsy driving: A survey of attitudes, opinions and behaviors. J. Saf. Res. 39, 303–309. https://doi.org/10.1016/j.jsr.2007.12.007 (2008).
Article Google Scholar
Kundinger, T., Riener, A., Sofra, N. & Weigl, K. IUI 2020. In Driver drowsiness in automated and manual driving (eds Paternò, F., Oliver, N., Conati, C., Spano, L.D. & Tintarev, N.) 369–379 (Association for Computing Machinery, 2020). https://doi.org/10.1145/3377325.3377506.
Chapter Google Scholar
Schmidt, J., Braunagel, C., Stolzmann, W. & Karrer-Gauss, K. Driver drowsiness and behavior detection in prolonged conditionally automated drives. In 2016 IEEE Intelligent Vehicles Symposium (IV 2016), 400–405, https://doi.org/10.1109/IVS.2016.7535417 (IEEE, Piscataway, NJ, 2016).
Schömig, N., Hargutt, V., Neukum, A., Petermann-Stock, I. & Othersen, I. The interaction between highly automated driving and the development of drowsiness. Proc. Manuf. 3, 6652–6659. https://doi.org/10.1016/j.promfg.2015.11.005 (2015).
Article Google Scholar
Kaufmann, C., Frühwirth, M., Messerschmidt, D., Moser, M., Eichberger, A., & Arefnezhad, S. Driving and tiredness: Results of the behaviour observation of a simulator study with special focus on automated driving. Trans. Tranp. Sci. 11(2), 51–63 https://doi.org/10.5507/tots.2020.011 (2020).
Article Google Scholar
McDonald, Anthony D., Lee, John D., Schwarz, Chris & Brown, Timothy L. Steering in a random forest: Ensemble learning for detecting drowsiness-related lane departures. Human Factors 56, 986–998. https://doi.org/10.1177/0018720813515272 (2014).
Article PubMed Google Scholar
Arefnezhad, S. et al. Applying deep neural networks for multi-level classification of driver drowsiness using vehicle-based measures. Expert Syst. Appl. 162, 113778. https://doi.org/10.1016/j.eswa.2020.113778 (2020).
Article Google Scholar
Maior, C. B. S., Moura, M. J. D. C., Santana, J. M. M. & Lins, I. D. Real-time classification for autonomous drowsiness detection using eye aspect ratio. Expert. Syst. Appl. 158, 113505. https://doi.org/10.1016/j.eswa.2020.113505 (2020).
Article Google Scholar
Nojiri, N., Kong, X., Meng, L. & Shimakawa, H. Discussion on machine learning and deep learning based makeup considered eye status recognition for driver drowsiness. Proc. Comput. Sci. 147, 264–270. https://doi.org/10.1016/j.procs.2019.01.252 (2019).
Article Google Scholar
Fujiwara, K. et al. Heart rate variability-based driver drowsiness detection and its validation with EEG. IEEE Trans. Biomed. Eng. 66, 1769–1778. https://doi.org/10.1109/TBME.2018.2879346 (2019).
Article PubMed Google Scholar
Akin, M., Kurt, M. B., Sezgin, N. & Bayram, M. Estimating vigilance level by using EEG and EMG signals. Neural Comput. Appl. 17, 227–236. https://doi.org/10.1007/s00521-007-0117-7 (2008).
Article Google Scholar
Lee, B.-G., Lee, B.-L. & Chung, W.-Y. Mobile healthcare for automatic driving sleep-onset detection using wavelet-based eeg and respiration signals. Sensors (Basel, Switzerland) 14, 17915–17936. https://doi.org/10.3390/s141017915 (2014).
Article ADS Google Scholar
Zhang, Z. et al. Automatic sleep onset detection using single eeg sensor. In 2014 36th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2265–2268, https://doi.org/10.1109/EMBC.2014.6944071 (IEEE, 8/26/2014 - 8/30/2014).
Weng, C.-H., Lai, Y.-H. & Lai, S.-H. Driver drowsiness detection via a hierarchical temporal deep belief network. In Chen, C.-S., Lu, J. & Ma, K.-K. (eds.) Computer vision – ACCV 2016 Workshops, vol. 10118 of LNCS sublibrary. SL 6, Image processing, computer vision, pattern recognition, and graphics, 117–133, https://doi.org/10.1007/978-3-319-54526-4_9 (Springer, Cham, Switzerland, 2017).
Shih, T.-H. & Hsu, C.-T. Mstn: Multistage spatial-temporal network for driver drowsiness detection. In Chen, C.-S., Lu, J. & Ma, K.-K. (eds.) Computer vision – ACCV 2016 Workshops, vol. 10118 of LNCS sublibrary. SL 6, Image processing, computer vision, pattern recognition, and graphics, 146–153, https://doi.org/10.1007/978-3-319-54526-4_11 (Springer, 2017).
Ma, Y. et al. Driving drowsiness detection with eeg using a modified hierarchical extreme learning machine algorithm with particle swarm optimization: A pilot study. Electronics 9(5), 775. https://doi.org/10.3390/electronics9050775 (2020).
Article Google Scholar
Mardi, Z., Ashtiani, S. N. M. & Mikaili, M. EEG-based drowsiness detection for safe driving using chaotic features and statistical tests. J. Med. Signals Sens. 1, 130–137 (2011).
Article Google Scholar
Chen, L.-I., Zhao, Y., Zhang, J. & Zou, J.-Z. Automatic detection of alertness/drowsiness from physiological signals using wavelet-based nonlinear features and machine learning. Expert Syst. Appl. 42, 7344–7355. https://doi.org/10.1016/j.eswa.2015.05.028 (2015).
Article Google Scholar
Venkata-Phanikrishna, B. & Suchismitha, C. Automatic classification methods for detecting drowsiness using wavelet packet transform extracted time-domain features from single-channel EEG signal. J. Neurosci. Methods 347, 108927. https://doi.org/10.1016/j.jneumeth.2020.108927 (2021).
Article Google Scholar
Bajaj, V., Taran, S., Khare, S. K. & Sengur, A. Feature extraction method for classification of alertness and drowsiness states EEG signals. Appl. Acoust. 163, 107224. https://doi.org/10.1016/j.apacoust.2020.107224 (2020).
Article Google Scholar
Yeo, M. V., Li, X., Shen, K. & Wilder-Smith, E. P. Can SVM be used for automatic EEG detection of drowsiness during car driving?. Saf. Sci. 47, 115–124. https://doi.org/10.1016/j.ssci.2008.01.007 (2009).
Article Google Scholar
Hsu, S.-H. & Jung, T.-P. Monitoring alert and drowsy states by modeling EEG source nonstationarity. J Neural Eng. 14, 056012. https://doi.org/10.1088/1741-2552/aa7a25 (2017).
Article PubMed Google Scholar
Awais, M., Badruddin, N. & Drieberg, M. Driver drowsiness detection using eeg power spectrum analysis. In IEEE Region 10 symposium, 2014, 244–247, https://doi.org/10.1109/TENCONSpring.2014.6863035 (IEEE, Piscataway, NJ, 2014).
Budak, U., Bajaj, V., Akbulut, Y., Atila, O. & Sengur, A. An effective hybrid model for EEG-based drowsiness detection. IEEE Sens. J. 19, 7624–7631. https://doi.org/10.1109/JSEN.2019.2917850 (2019).
Article ADS Google Scholar
Li, G., Lee, B.-L. & Chung, W.-Y. Smartwatch-based wearable EEG system for driver drowsiness detection. IEEE Sens. J. 15, 7169–7180. https://doi.org/10.1109/JSEN.2015.2473679 (2015).
Article ADS Google Scholar
Dinges, D. & Perclos, R. G. a valid psychophysiological measure of alertness as assessed by psychomotor vigilance. report no. FHWA-MCRT-98-006. https://doi.org/10.21949/1502740 (1998).
Hanowski, R., Bowman, D., Alden, A., Wierwille, W. & Carroll, R. Perclos+: Development of a robust field measure of driver drowsiness. In 15th World Congress on Intelligent Transport Systems and ITS America's 2008 Annual Meeting, 2008, New York NY, United States.
Dmochowski, Jacek P., Ki, Jason J., DeGuzman, Paul, Sajda, Paul & Parra, Lucas C. Extracting multidimensional stimulus-response correlations using hybrid encoding-decoding of neural activity. NeuroImage 180, 134–146. https://doi.org/10.1016/j.neuroimage.2017.05.037 (2018).
Article PubMed Google Scholar
Holdgraf, C. R. et al. Encoding and decoding models in cognitive electrophysiology. Front. Syst. Neurosci. 11, 61. https://doi.org/10.3389/fnsys.2017.00061 (2017).
Article PubMed PubMed Central Google Scholar
Naselaris, T., Prenger, R. J., Kay, K. N., Oliver, M. & Gallant, J. L. Bayesian reconstruction of natural images from human brain activity. Neuron 63, 902–915. https://doi.org/10.1016/j.neuron.2009.09.006 (2009).
Article CAS PubMed PubMed Central Google Scholar
Yousefi, A. et al. Decoding hidden cognitive states from behavior and physiology using a bayesian approach. Neural Comput. 31, 1751–1788. https://doi.org/10.1162/neco_a_01196 (2019).
Article MathSciNet PubMed MATH Google Scholar
Schinko, C. et al. Building a Driving Simulator with Parallax Barrier Displays. In Proceedings of the 11th Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - GRAPP, Rome, Italy, https://doi.org/10.5220/0005711302810289 (2016).
Devos, H. et al. Real-time assessment of daytime sleepiness in drivers with multiple sclerosis. Mult. Scler. Related Disord. 47, 102607. https://doi.org/10.1016/j.msard.2020.102607 (2021).
Article Google Scholar
Zhang, D.-X., Wu, X.-P. & Guo, X.-j. The eeg signal preprocessing based on empirical mode decomposition. In 2008 2nd International Conference on Bioinformatics and Biomedical Engineering, 2131–2134, https://doi.org/10.1109/ICBBE.2008.862 (IEEE, 5/16/2008 - 5/18/2008).
Muñoz-Gutiérrez, P. A., Giraldo, E., Bueno-López, M. & Molinas, M. Localization of active brain sources from EEG signals using empirical mode decomposition: A comparative study. Front. Integr. Neurosci. 12, 55. https://doi.org/10.3389/fnint.2018.00055 (2018).
Article PubMed PubMed Central Google Scholar
Estrada, E., Nazeran, H., Sierra, G., Ebrahimi, F. & Setarehdan, S. K. Wavelet-based eeg denoising for automatic sleep stage classification. In CONIELECOMP 2011, 21st International Conference on Electrical Communications and Computers, 295–298, https://doi.org/10.1109/CONIELECOMP.2011.5749325 (IEEE, 2/28/2011 - 3/2/2011).
Mamun, M., Al-Kadi, M. & Marufuzzaman, M. Effectiveness of wavelet denoising on electroencephalogram signals. J. Appl. Res. Technol. 11, 156–160. https://doi.org/10.1016/S1665-6423(13)71524-4 (2013).
Article Google Scholar
Turnip, A. & Pardede, J. Artefacts removal of EEG signals with wavelet denoising. MATEC Web Conf. 135, 00058. https://doi.org/10.1051/matecconf/201713500058 (2017).
Article Google Scholar
Hyvärinen, A. & Oja, E. Independent component analysis: Algorithms and applications. Neural Netw. 13, 411–430. https://doi.org/10.1016/S0893-6080(00)00026-5 (2000).
Article PubMed Google Scholar
Yosrita, E., Nur Aziza, R., Farah Ningrum, R. & Muhammad, G. Denoising of eeg signal based on word imagination using ica for artifact and noise removal on unspoken speech. Indones. J. Electr. Eng. Comput. Sci. 22, 83. https://doi.org/10.11591/ijeecs.v22.i1.pp83-88 (2021).
Article Google Scholar
Albera, L. et al. Ica-based EEG denoising: A comparative analysis of fifteen methods. Bull. Polish Acad. Sci. Tech. Sci. 60, 407–418. https://doi.org/10.2478/v10175-012-0052-3 (2012).
Article Google Scholar
Vigario, R., Sarela, J., Jousmiki, V., Hamalainen, M. & Oja, E. Independent component approach to the analysis of EEG and MEG recordings. IEEE Trans. Biomed. Eng. 47, 589–593. https://doi.org/10.1109/10.841330 (2000).
Article CAS PubMed Google Scholar
Winkler, I., Haufe, S. & Tangermann, M. Automatic classification of artifactual ICA-components for artifact removal in EEG signals. Behav. Brain Funct. BBF 7, 30. https://doi.org/10.1186/1744-9081-7-30 (2011).
Article PubMed Google Scholar
Delorme, A. & Makeig, S. EEGLAB: an open source toolbox for analysis of single-trial EEG dynamics including independent component analysis. J. Neurosci. Methods 134, 9–21. https://doi.org/10.1016/j.jneumeth.2003.10.009 (2004).
Article PubMed Google Scholar
Junaedi, S. & Akbar, H. Driver drowsiness detection based on face feature and PERCLOS. J. Phys. Conf. Ser. 1090, 012037. https://doi.org/10.1088/1742-6596/1090/1/012037 (2018).
Article Google Scholar
Sommer, D. & Golz, M. Evaluation of PERCLOS based current fatigue monitoring technologies. In 2010 Annual International Conference of the IEEE Engineering in Medicine and Biology, 4456–4459, https://doi.org/10.1109/IEMBS.2010.5625960 (IEEE, 8/31/2010–9/4/2010).
Särkkä, S. Bayesian filtering and smoothing Vol. 3, https://doi.org/10.1017/CBO9781139344203 (Cambridge University Press, 2013).
Book Google Scholar
James, G., Witten, D., Hastie, T. & Tibshirani, R. Linear regression. In James, G., Witten, D., Hastie, T. & Tibshirani, R. (eds.) An Introduction to Statistical Learning, vol. 103 of Springer Texts in Statistics, 59–126, https://doi.org/10.1007/978-1-4614-7138-7_3 (Springer 2013).
Sikdar, Debdeep, Roy, Rinku & Mahadevappa, Manjunatha. Epilepsy and seizure characterisation by multifractal analysis of EEG subbands. Biomed. Signal Process. Control 41, 264–270. https://doi.org/10.1016/j.bspc.2017.12.006 (2018).
Article Google Scholar
Hamad, A., Houssein, E. H., Hassanien, A. E. & Fahmy, A. A. Feature extraction of epilepsy eeg using discrete wavelet transform. In 2016 12th International Computer Engineering Conference (ICENCO), 190–195, https://doi.org/10.1109/ICENCO.2016.7856467 (IEEE, 12/28/2016 - 12/29/2016).
Belakhdar, I., Kaaniche, W., Djemal, R. & Ouni, B. Single-channel-based automatic drowsiness detection architecture with a reduced number of EEG features. Microprocess. Microsyst. 58, 13–23. https://doi.org/10.1016/j.micpro.2018.02.004 (2018).
Article Google Scholar
Amin, H. U., Mumtaz, W., Subhani, A. R., Saad, M. N. M. & Malik, A. S. Classification of EEG signals based on pattern recognition approach. Front. Comput. Neurosci. 11, 103. https://doi.org/10.3389/fncom.2017.00103 (2017).
Article PubMed PubMed Central Google Scholar
Yu, S. et al. Support vector machine based detection of drowsiness using minimum EEG features. In 2013 International Conference on Social Computing, 827–835, https://doi.org/10.1109/SocialCom.2013.124 (2013).
Srivastava, R. C. Estimation of probability density function based on random number of observations with applications. Int. Stat. Rev. Revue Internationale de Statistique 41, 77. https://doi.org/10.2307/1402789 (1973).
Article MATH Google Scholar
Zhang, Y., Owechko, Y. & Zhang, J. Driver cognitive workload estimation: A data-driven perspective. In ITSC 642–647, 2004. https://doi.org/10.1109/ITSC.2004.1398976 (2004) ((IEEE, Piscataway, N.J).
Xing, Y., Lv, C., Cao, D., Wang, H. & Zhao, Y. Driver workload estimation using a novel hybrid method of error reduction ratio causality and support vector machine. Measurement 114, 390–397. https://doi.org/10.1016/j.measurement.2017.10.002 (2018).
Article ADS Google Scholar
Faure, V., Lobjois, R. & Benguigui, N. The effects of driving environment complexity and dual tasking on drivers’ mental workload and eye blink behavior. Transp. Res. Part F Traffic Psychol. Behav. 40, 78–90. https://doi.org/10.1016/j.trf.2016.04.007 (2016).
Article Google Scholar
Baldauf, D., Burgard, E. & Wittmann, M. Time perception as a workload measure in simulated car driving. Appl. Erg. 40, 929–935. https://doi.org/10.1016/j.apergo.2009.01.004 (2009).
Article Google Scholar
Argyle, E. M., Marinescu, A., Wilson, M. L., Lawson, G. & Sharples, S. Physiological indicators of task demand, fatigue, and cognition in future digital manufacturing environments. Int. J. Human-Comput. Stud. 145, 102522. https://doi.org/10.1016/j.ijhcs.2020.102522 (2021).
Article Google Scholar
Mane, R. et al. Prognostic and monitory EEG-biomarkers for BCI upper-limb stroke rehabilitation. IEEE Trans. Neural Syst. Rehabil. Eng. 27, 1654–1664. https://doi.org/10.1109/TNSRE.2019.2924742 (2019).
Article PubMed Google Scholar
Burelo, K. A neuromorphic brain-computer interface for real-time detection of a new biomarker for epilepsy surgery. In Guger, C., Allison, B. Z. & Gunduz, A. (eds.) Brain-Computer Interface Research, SpringerBriefs in Electrical and Computer Engineering, 111–117, https://doi.org/10.1007/978-3-030-79287-9_12 (Springer, 2021).
Engel, J. & Pitkänen, A. Biomarkers for epileptogenesis and its treatment. Neuropharmacology 167, 107735. https://doi.org/10.1016/j.neuropharm.2019.107735 (2020).
Article CAS PubMed Google Scholar
Serrallach, B. et al. Neural Biomarkers for Dyslexia, ADHD, and ADD in the Auditory Cortex of Children. Front. Neurosci. 10, 324. https://doi.org/10.3389/fnins.2016.00324 (2016).
Article PubMed PubMed Central Google Scholar
Lee, B.-L., Lee, B. G., Li, G., & Chung, W.-Y. (eds.). Wearable Driver Drowsiness Detection System Based on Smartwatch, vol. 15 (2014).
Chaabene, S., Bouaziz, B., Boudaya, A., Hökelmann, A., Ammar, A. & Chaari, L. Convolutional neural network for drowsiness detection using eeg signals. Sensors (Basel, Switzerland) 21, 1734, https://doi.org/10.3390/s21051734 (2021).
Jeong, J.-H., Yu, B.-W., Lee, D.-H. & Lee, S.-W. Classification of drowsiness levels based on a deep spatio-temporal convolutional bidirectional ISTM network using electroencephalography signals. Brain Sci. 9, 348. https://doi.org/10.3390/brainsci9120348 (2019).
Article PubMed Central Google Scholar
Rezaei, M. R., Hadjinicolaou, A. E., Cash, S. S., Eden, U. T. & Yousefi, A. Bayesian decoder models with a discriminative observation process. bioRxivhttps://doi.org/10.1101/2020.07.11.198564 (2020).
Sabbagh, M. A., Moulson, M. C. & Harkness, K. L. Neural correlates of mental state decoding in human adults: An event-related potential study. J. Cognit. Neurosci. 16, 415–426. https://doi.org/10.1162/089892904322926755 (2004).
Article Google Scholar
Rundo, F. et al. An innovative deep learning algorithm for drowsiness detection from EEG signal. Computation 7, 13. https://doi.org/10.3390/computation7010013 (2019).
Article Google Scholar
Chaabene, S. et al. Convolutional neural network for drowsiness detection using EEG signals. Sensors 21, 1734. https://doi.org/10.3390/s21051734 (2021).
Article ADS PubMed PubMed Central Google Scholar
Zhu, M. et al. Vehicle driver drowsiness detection method using wearable EEG based on convolution neural network. Neural Comput. Appl. 33, 1–16. https://doi.org/10.1007/s00521-021-06038-y (2021).
Article Google Scholar
Hu, J. Comparison of different features and classifiers for driver fatigue detection based on a single eeg channel. Comput. Math. Methods Med. https://doi.org/10.1155/2017/5109530 (2017).
Article PubMed PubMed Central Google Scholar
Zeng, H. et al. EEG classification of driver mental states by deep learning. Cognit. Neurodyn. 12, 597–606. https://doi.org/10.1007/s11571-018-9496-y (2018).
Article Google Scholar
Chai, R. et al. Improving EEG-based driver fatigue classification using sparse-deep belief networks. Front. Neurosci. 11, 103. https://doi.org/10.3389/fnins.2017.00103 (2017).
Article PubMed PubMed Central Google Scholar
Vicente, J., Laguna, P., Bartra, A. & Bailón, R. Drowsiness detection using heart rate variability. Med. Biol. Eng. Comput. 54, 927–937. https://doi.org/10.1007/s11517-015-1448-7 (2016).
Article PubMed Google Scholar
Cabral, J., Kringelbach, M. L. & Deco, G. Exploring the network dynamics underlying brain activity during rest. Progress Neurobiol. 114, 102–131. https://doi.org/10.1016/j.pneurobio.2013.12.005 (2014).
Article Google Scholar
Leopold, D. A., Murayama, Y. & Logothetis, N. K. Very slow activity fluctuations in monkey visual cortex: implications for functional brain imaging. Cerebral Cortex (New York, N.Y. : 1991) 13, 422–433, https://doi.org/10.1093/cercor/13.4.422 (2003).
Salvador, R., Suckling, J., Schwarzbauer, C. & Bullmore, E. Undirected graphs of frequency-dependent functional connectivity in whole brain networks. Philos. Trans. R. Soc. London Ser. B Biol. Sci. 360, 937–946, https://doi.org/10.1098/rstb.2005.1645 (2005).
Bowyer, S. M. Coherence a measure of the brain networks: Past and present. Neuropsychiatr. Electrophysiol. https://doi.org/10.1186/s40810-015-0015-7 (2016).
Article Google Scholar
Guevara, M. A. & Corsi-Cabrera, M. EEG coherence or EEG correlation?. Int. J. Psychophysiol. 23, 145–153. https://doi.org/10.1016/S0167-8760(96)00038-4 (1996).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

This study is based on data from the WACHSens project. The WACHsens project was carried out by Human Research Institut für Gesundheitstechnologie und Präventionsforschung GmbH, Graz University of Technology, AVL Powertrain UK Limited, and Factum apptec ventures GmbH and was co-funded by the Austrian Research Promotion Agency (FFG) via the program of mobility of the future (grant no. 860875). The authors are indebted to all drivers who participated in the experiment, to the experimenters, and the many people who helped to set up the tests. The authors would also like to thank Peter Brunner, PhD for introducing the Neurable team to its Austrian collaborators. This paper is supported by TU Graz Open Access Publishing Fund.

Author information

James Hamet
Present address: Vistim Labs Company, Salt Lake City, UT, 84103, USA

Authors and Affiliations

Institute of Automotive Engineering, Graz University of Technology, 8010, Graz, Austria
Sadegh Arefnezhad & Arno Eichberger
Neurable Company, Boston, MA, 02108, USA
James Hamet & Ali Yousefi
Human Research Institute, Weiz, 8160, Austria
Matthias Frühwirth & Maximilian Moser
Institute of Psychology, University of Graz, 8010, Graz, Austria
Anja Ischebeck
Institute of Engineering and Business Informatics, Graz University of Technology, Graz, 8010, Austria
Ioana Victoria Koglbauer
Department of Computer Science Worcester Polytechnic Institute, 100 Institute Road, MA, 01609, Worcester, USA
Ali Yousefi
Chair of Department of Physiology, Medical University of Graz, 8036, Graz, Austria
Maximilian Moser

Authors

Sadegh Arefnezhad
View author publications
You can also search for this author in PubMed Google Scholar
James Hamet
View author publications
You can also search for this author in PubMed Google Scholar
Arno Eichberger
View author publications
You can also search for this author in PubMed Google Scholar
Matthias Frühwirth
View author publications
You can also search for this author in PubMed Google Scholar
Anja Ischebeck
View author publications
You can also search for this author in PubMed Google Scholar
Ioana Victoria Koglbauer
View author publications
You can also search for this author in PubMed Google Scholar
Maximilian Moser
View author publications
You can also search for this author in PubMed Google Scholar
Ali Yousefi
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.A. wrote the first draft of the manuscript and programmed the framework. J.H. prepared the first results of the framework. A.E. supervised the WACHSens project to collect the dataset. M.F., I.V.K. and M.M. designed the driving tests’ procedure. A.I. helped to preprocess the EEG channels. A.Y. developed the main idea of the proposed framework. All of the authors edited the submitted version of the manuscript.

Corresponding author

Correspondence to Sadegh Arefnezhad.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Arefnezhad, S., Hamet, J., Eichberger, A. et al. Driver drowsiness estimation using EEG signals with a dynamical encoder–decoder modeling framework. Sci Rep 12, 2650 (2022). https://doi.org/10.1038/s41598-022-05810-x

Download citation

Received: 03 June 2021
Accepted: 14 January 2022
Published: 16 February 2022
DOI: https://doi.org/10.1038/s41598-022-05810-x

This article is cited by

Studying Drowsiness Detection Performance While Driving Through Scalable Machine Learning Models Using Electroencephalography
- José Manuel Hidalgo Rogel
- Enrique Tomás Martínez Beltrán
- Alberto Huertas Celdrán
Cognitive Computation (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.