Extreme value theory inspires explainable machine learning approach for seizure detection

Karpov, Oleg E.; Grubov, Vadim V.; Maksimenko, Vladimir A.; Kurkin, Semen A.; Smirnov, Nikita M.; Utyashev, Nikita P.; Andrikov, Denis A.; Shusharina, Natalia N.; Hramov, Alexander E.

doi:10.1038/s41598-022-15675-9

Download PDF

Article
Open access
Published: 06 July 2022

Extreme value theory inspires explainable machine learning approach for seizure detection

Oleg E. Karpov¹,
Vadim V. Grubov^2,3,
Vladimir A. Maksimenko^2,3,
Semen A. Kurkin^2,3,
Nikita M. Smirnov²,
Nikita P. Utyashev¹,
Denis A. Andrikov⁴,
Natalia N. Shusharina³ &
…
Alexander E. Hramov^2,3,5

Scientific Reports volume 12, Article number: 11474 (2022) Cite this article

4039 Accesses
10 Citations
18 Altmetric
Metrics details

Subjects

Abstract

Epilepsy is one of the brightest manifestations of extreme behavior in living systems. Extreme epileptic events are seizures, that arise suddenly and unpredictably. Usually, treatment strategies start by analyzing brain activity during the seizures revealing their type and onset mechanisms. This approach requires collecting data for a representative number of events which is only possible during the continuous EEG monitoring over several days. A big part of the further analysis is searching for seizures on these recordings. An experienced medical specialist spends hours checking the data of a single patient and needs assistance from the automative systems for seizure detection. Machine learning methods typically address this issue in a supervised fashion and exhibit a lack of generalization. The extreme value theory allows addressing this issue with the unsupervised machine learning methods of outlier detection. Here, we make the first step toward using this approach for the seizure detection. Based on our recent work, we specified the EEG features showing extreme behavior during seizures and loaded them to the one-class SVM, a popular outlier detection algorithm. Testing the proposed approach on 83 patients, we reported 77% sensitivity and 12% precision. In 60 patients, sensitivity was 100%. In the rest 23 subjects, we observed deviations from the extreme behavior. The one-class SVM used a single subject’s data for training; therefore, it was stable against between-subject variability. Our results demonstrate an effective convergence between the extreme value theory, a physical concept, and the outlier detection algorithms, a machine learning concept, toward solving the meaningful task of medicine.

Concept-drifts adaptation for machine learning EEG epilepsy seizure prediction

Article Open access 08 April 2024

Theory of Lehmer transform and its applications in identifying the electroencephalographic signature of major depressive disorder

Article Open access 07 March 2022

EEG epilepsy seizure prediction: the post-processing stage as a chronology

Article Open access 03 January 2024

Introduction

Extreme events are rare abnormal deviations of the system’s behavior from a typical state^1,2,3. From the statistical perspective, extreme events occur in the tails of a probability distribution. In the absence of extreme events, the probability distribution has typically a gaussian shape with exponential tails. In the presence of extreme events, these tails become heavy and start following the power law with a fixed power $\sim p^{-\alpha }$, $\alpha >0$. Sometimes, extreme events (severe rains, economic crises, tsunamis) occur in real-world systems and harm human life. Therefore, an important task is understanding the developing mechanisms of these events to predict their occurrence in the future^4,5,6,7.

On the time series, extreme events correspond to outliers. Outlier is a general term in statistics and data science that characterizes data points that differ significantly from other observations. Outliers can occur by chance in any data, but they also indicate that a system has a heavy-tailed distribution. Outliers reduce the statistical explanation abilities for traditional data-driven modeling methods. Therefore, an essential part of data analysis is searching for outliers and removing them from the data⁸.

Recently scientists started considering spontaneous synchronous activity in neuronal populations as extreme events, referring the epileptic seizures as their brightest example^3,9,10,11. In the previous studies, we found the signs of extreme behavior on epileptic electroencephalograms (EEG). First, we revealed that the onsets of seizures in rats demonstrate on-off intermittency dynamics, similar to extreme events^12,13,14. Then, we reported heavy tails on the probability distributions of the spectral power of epileptic EEG¹⁵. Finally, we considered EEG of the patients with epilepsy and revealed that the probability distribution of the spectral power in the 1–5 Hz frequency range also has a heavy tail manifesting extreme behavior¹⁶. Thus, epileptic seizures in humans are extreme events that cause outliers on the time series of the 1–5 Hz EEG spectral power. The connection between extreme events and outliers opens the way for using a vast repertory of outlier detection techniques for seizure detection.

According to the International League Against Epilepsy and the International Federation of Clinical Neurophysiology, there is a need for automated seizure detection to decrease morbidity and mortality associated with seizures and for objective seizure identification and quantification¹⁷. The authors concentrated on the wearable devices for seizure detection on an outpatient basis. In contrast, we focus on seizure identification in hospital settings, which also plays a crucial role in epilepsy diagnostics and treatment. Usually, treatment strategies start by analyzing brain activity during seizures revealing their type and onset mechanisms. This approach requires collecting data for a representative number of events which is only possible during the continuous EEG monitoring over several days. Based on 248 patients, Friedman and Hirsch concluded that it is common to require more than three days in an epilepsy monitoring unit to record and diagnose the nature of paroxysmal episodes and not rare to require more than a week¹⁸. A big part of the further analysis is searching for seizures on these recordings. An experienced specialist spends hours reviewing the data of a single patient and needs assistance from automated systems for seizure detection. While the fully automated diagnostic remains a matter of discussion, the working solution is partial automation, known as the Clinical decision support system (CDSS)¹⁹. In CDSS, the computer analyses data and provides recommendations, and the medical expert makes the final decision.

An optimistic approach to seizure detection is machine learning (ML)²⁰. For the ML, seizure detection comes down to classifying EEG data in two classes, seizures (class 1) and non-seizures (class 2)^21,22 . The classification requires ML algorithms to be trained on the training data. This approach, a supervised classification, suffers from the class imbalance and overfitting. The class imbalance comes from the rare nature of seizures and requires artificially balancing the number of the class 1 and the class 2 examples in the training data. A promising way to manage the imbalance is constructing feature space resulting in the long distance between classes. The modern deep learning algorithms handle it well but suffer from overfitting. The overfitting implies that the algorithm performs satisfactorily on the training data but fails to operate with data of the other patients. Addressing this issue relies on selecting a subspace of features containing the biomarkers of seizures common for the most patients. These reasonings lead us to the problem of the features interpretability which often occurs in machine learning.

We propose using the outlier detection ML technique, one-class SVM, to overcome these issues. First, this method incorporates the rare nature of seizures, hence handling the class imbalance problem. Second, it is unsupervised and may train and perform on the data of a single patient. Finally, we will feed the algorithm with an interpretable feature (EEG power in the 1–5 Hz frequency range) which demonstrates extreme behavior during seizures. We suppose that interpretability gives us a prior knowledge about the ability of an algorithm to handle the data of this patient, hence assessing its credibility.

Methods

Dataset

The experimental dataset under study was provided by the National Medical and Surgical Center named after N. I. Pirogov of the Russian Healthcare Ministry (Moscow, Russia). All medical procedures were held in the Center following the Helsinki Declaration and the Center’s medical regulations and were approved by the local ethics committee. All patients provided written informed consent before participation. The dataset includes anonymized long-term monitoring data of patients in the Department of Neurology and Clinical Neurophysiology between 2017 and 2019. The data were collected during routine medical procedures aimed at registration of epileptic activity and verification of epileptogenic zones for further clinical treatment. Continuous EEG and video monitoring during everyday activity including sleep and wakefulness were performed for these patients. During the monitoring, patients kept a regular daily routine with occasional physiological trials (such as photostimulation and hyperventilation) that are standard for this type of research^23,24. The length of recording varied from 8 to 84 h according to the patient’s condition and the number of epileptiform activity episodes needed for the proper diagnosis. Each patient had from one to five epileptic seizures during the time of the monitoring. None of the seizures was triggered by photostimulation or hyperventilation, i.e., all epileptic seizures were spontaneous. In this work, we use the dataset which contains the data of 83 patients diagnosed pathologically with focal epilepsy. Epileptic focuses were found in frontal, temporal, or parietal areas of left, right, or both hemispheres, so there was no homogeneity in the dataset in terms of the epileptic focus localization.

Data acquisition and preprocessing

“Micromed” encephalograph (Micromed S.p.A., Italy) was used for EEG recording. EEG signals were recorded with 25 channels according to the international “10–20” system with a ground electrode placed on the forehead and reference electrodes placed at the ears. EEG signals were recorded with a sampling rate of 128 Hz. The video monitoring system was used to monitor patients’ states for easier data analysis and segmentation. Experimental EEG data and video recordings were examined and deciphered by an experienced neurophysiologist. This analysis resulted in the marking that includes information on episodes of epileptic seizures and physiological tests.

EEG signals are known to be highly susceptible to the influence of various external and internal noises, especially during prolonged recording²⁵. In clinical monitoring, external noises usually come from hardware-related problems (poor contact of EEG electrodes, loose wires), power network, cellphone calls, etc. Internal noises (or physiological artifacts) originate from physiological processes such as heartbeat, blinking, or breathing²⁶. The basic way to deal with noises and artifacts on EEG is the data filtration: a high-pass filter is used to restrain the low-frequency components (stray effects, cardiac rhythm, breathing artifacts), while a low-pass filter deals with the high-frequency activity, commonly associated with the muscle artifacts. In our study, we used the band-pass filter with the cutoff frequencies of 1 Hz and 60 Hz and the 50-Hz notch filter.

We considered the frequency band 1–30 Hz, which is commonly regarded as an effective frequency range of epileptic EEG. However, some undesired activity (e.g., eye-movement artifacts) can interfere with the EEG activity in this range. We used the standard procedure based on an independent component analysis (ICA) to remove these artifacts²⁷.

Time-frequency analysis

We performed a time-frequency analysis of EEG signals using continuous wavelet transform (CWT) with Morlet mother wavelet function²⁸. CWT has proved itself as a powerful instrument in the analysis of complex nonstationary signals in systems of different nature, including biological ones²⁹. We considered wavelet power (WP):

$$\begin{aligned} W_n(f,t)=|w_n(f,t)|, \end{aligned}$$

(1)

where n = 1, 2...N is the number of EEG channel ($N = 25$ for the considered dataset), f and t are the frequency and time point, $w_n(f,t)$ are the coefficients of CWT. WP is one of the common CWT-based characteristics to describe the time-frequency structure of a signal³⁰.

CWT can be demanding in terms of computational costs: application of CWT in the frequency range of interest to several day-long multichannel EEG data with even low sampling rate will generate an immense amount of WP data. Such a big dataset is difficult to calculate, store, and analyze, so some approaches to reduce data complexity are required.

The first step includes averaging WP over the frequency band of interest. Initially, WP was calculated in the 1–30 Hz frequency range as it is acceptable for both normal and epileptic EEG³¹. Earlier in our paper¹⁵, we applied extreme event theory to analyze electrocorticogram (ECoG) recordings of WAG/Rij rats with a genetic predisposition to the absence epilepsy. We reported that the absence seizures induced a drastic increase of WP in the frequency range of 6–8 Hz, and WP distribution is best fitted with the Weibull function after averaging WP over the 6–8 Hz range. The WP time series demonstrated extreme events-related properties in this frequency range, while there were no manifestations of the extreme behavior for other frequencies. We conducted similar research in our paper¹⁶ for epilepsy in human patients and showed that WP averaged over the frequency range of 2–5 Hz demonstrates extreme-like behavior during epileptic seizures. So, we choose the $F \in [2;5]$ Hz range as the frequency band of interest in the present study.

The second step includes averaging WP over the 25 EEG channels. This step can be explained by the features of epilepsy—in the case of generalized seizures, activity arises suddenly all over the brain, and all EEG signals are highly correlated³². In focal seizures, there are only a few EEG channels near the focus that demonstrate pronounced epileptic activity, however, these channels stand out in terms of EEG signal amplitude and frequency structure, so even after averaging over the channels WPs for normal and pathological activity differ drastically.

Thus, we calculated averaged over the frequency band $F \in [2;5]$ Hz and $N=25$ EEG channels WP (AWP) as:

$$\begin{aligned} E(t)=\frac{1}{N}\sum _{n=1}^{N} \frac{1}{\Delta F} \int \limits _{f \in F} W_n(f,t)df, \end{aligned}$$

(2)

where $\Delta F=3$ Hz is the width of the frequency band $F \in [2;5]$ Hz.

We applied “downsampling” of AWP to decrease the complexity of the data additionally. We divided each EEG recording into 60-s intervals $T_m$, where $m = 1, 2...M$, $M =L//60$, L—the length of EEG recording in seconds, “//” stands for integer division. The choice of such interval length is justified by the average duration of an epileptic seizure— from 30 to 120 s³³. AWP values were calculated for each time interval m and averaged over the whole length of the interval to obtain downsampled AWP (DAWP):

$$\begin{aligned} e_m=\frac{1}{\Delta T} \int _{t \in T_m} E(t)dt, \end{aligned}$$

(3)

where $\Delta T$ is the length of each interval $T_m$ ($\Delta T = 60$ s).

Probability density function

To construct a probability density function (PDF), we normalized E(t) and $e_m$ by corresponding global minimum as: $E(t)^{norm} = E(t) - E_{min}$, $e_m^{norm} = e_m - e_{min}$. For $E(t)^{norm}$, we additionally extracted all local maxima. According to the Fisher–Tippett–Gnedenko theorem³⁴, if a Gumbel, Frechet, or Weibull distribution describes the obtained PDF, then the process is characterized by extreme event properties. Our previous studies on epilepsy in the WAG/Rij rats¹⁵ and human patients¹⁶ suggest that the Weibull distribution is more suitable for this task. Thus, PDFs for EEG data were fitted by the Weibull distribution in the present work³⁵:

$$\begin{aligned} f(x,c)=c\left( \frac{x-l}{s} \right) ^{c-1}\exp \left( -\left( \frac{x-l}{s} \right) ^c \right) , \end{aligned}$$

(4)

where c is the shape parameter of the Weibull law ($x>0$, $c>0$), l is the localization parameter that shifts the distribution with fixed shape parameter c across the axis of WP, s is the scale parameter responsible for the expansion/compression of the initial distribution.

In our previous papers^15,16, we have shown that experimental PDFs for normal and epileptic activity are fitted by Weibull distributions with drastically different parameters, so it can be problematic to properly fit the whole dataset with a single Weibull distribution. Figure 1 illustrates this issue, demonstrating the PDF of $E(t)^{norm}$ for one subject as the blue histogram and the fitted Weibull distribution as the red line. It is clear from Fig. 1A that the tail of the Weibull distribution, in this case, is poorly fitted. In the present study, we are mainly interested in epileptic extreme events that would obviously lie within the tail of the Weibull distribution. With this in mind, we applied additional preprocessing prior to fitting experimental data with Weibull distribution. We have shown in Refs.^15,36 that the cross point between normal and extreme behavior distributions can be found using the Pickands-Balkema-de Haan theorem and assessed empirically. In the present study, we proposed another approach for separating “normal” and “extreme” distributions. We calculated the 95th percentile of the analyzed data and divided the whole dataset into two subsets: above the 95th percentile (shown as a blue histogram in Fig. 1A) and below the 95th percentile (shown as a dark blue in Fig. 1A). We suggested that 5% of data (above the 95th percentile) should mainly consist of extreme events, while the rest $95\%$ (below the 95th percentile) should contain the normal data. We selected the above 95th percentile subset and fitted it with the Weibull distribution; Fig. 1B shows that the fitting becomes more accurate.

According to Ref.¹⁵, we used the chi-square goodness-of-fit test to be sure that the Weibull distribution is appropriate to fit the studied experimental data. The chi-square goodness-of-fit test determines if a data sample comes from a specified probability distribution, with parameters estimated from the data. The test groups the data into bins, calculating the observed and expected counts for those bins and computing the chi-square test statistic.

$$\begin{aligned} \chi ^2=\sum _{h=1}^H (O_h-E_h)^2/E_h, \end{aligned}$$

(5)

where $O_h$ are the observed counts and $E_h$ are the expected counts based on the hypothesized distribution, $h=1,2...H$ is the number of bins in PDF distribution.

To fit the experimental PDFs with the Weibull distribution and perform the goodness-of-fit test, we used NumPy and SciPy libraries from Python, namely, statistical functions of the special module (scipy.stats). The function stats.weibull_min.fit was applied to PDF data—it returns maximum likelihood estimations for c, l, and s parameters of the Weibull distribution for the data (PDF). The function numpy.percentile was used to calculate the 95th percentile. The function stats.chisquare was used to calculate the one-way chi-square test.

Machine-learning algorithm

One of the most common unsupervised ML algorithms is one-class SVM^37,38,39. In this work, we addressed the aforementioned issues by proposing a new approach that uses a “non-black-box” unsupervised ML algorithm, one-class SVM, fed with the features extracted following the fundamental knowledge about human seizures. We used SVM with a kernel in the form of a radial basis function and the standardized predictor. The Iterative Single Data Algorithm (ISDA)⁴⁰ was used for the classifier optimization.

The overall framework of the ML algorithm is illustrated in Fig. 2, while Fig. 3 depicts examples of data obtained from certain steps of the algorithm. The first step of the algorithm took raw EEG data as the input and performed preprocessing. This included filtration and ICA-based artifact removal as well as EEG data segmentation into a number of 60-s intervals—the result of preprocessing step is shown in Fig. 3A. Then the preprocessed data served as the input for the feature extraction step. The features for the SVM algorithm were extracted according to the results of the time-frequency analysis of human epileptic EEG. During the feature extraction procedure based on expert-level domain knowledge the following actions were performed:

1.
CWT was applied to EEG data intervals and WP was computed in the frequency range of 1–30 Hz;
2.
WP was averaged over:
1. (a)
  2–5 Hz frequency band of interest;
2. (b)
  25 EEG channels;
3. (c)
  length of each time interval (60 s);

Figure 3B illustrates the result of feature extraction.

Obtained features were used as the input for the SVM training step. At this step, we trained multiple SVM models with different hyperparameters. There were two hyperparameters of the algorithm that might affect its performance:

Type of learning is a strategy used in learning and approbation of the algorithm. We considered two approaches:
- k-fold cross-validation (CV): the dataset is randomly permuted and split up into k groups or folds, then, for each k, the kth group is taken as a test data set, while the remaining $k-1$ groups comprise the training data set⁴¹. In our case, we used data of one subject as a dataset and chose $k=10$.
- “Leave-one-out cross-validation” (LOO): a configuration of k-fold cross-validation where k is set to the number of examples in the dataset⁴². In our case, we set k to the number of subjects, i.e. we used data of all subjects except one in learning and then verified the algorithm on the data of the excluded subject.
The threshold is the expected proportion of outliers in training data. SVM algorithm trains the bias term such that 100$\times$ Threshold% of the observations in the training data have negative scores and will be considered as “outliers”. The minimum achievable threshold value is determined by dataset properties and SVM classifier parameters. Actually, by applying the threshold, we consider only some fraction of these events with the highest AWP values. We assume that outliers corresponding to extreme events—epileptic seizures—should be the ones with the highest values of AWP. Thus, with the application of the threshold, we aim to separate epileptic seizure outliers from others caused by noises and artifacts. We tested eight values of the threshold: 10, 5, 2.5, 1, 0.5, 0.25, 0.1, 0.05%.

After this step, we obtained a set of SVM models with different hyperparameters. We used this set of SVM models as the input for the next step, where we aimed to find the most efficient model and corresponding optimal hyperparameters. To evaluate the efficiency of the ML algorithm, we used several characteristics derived from a confusion matrix⁴³:

True positive (TP): correctly identified events, i.e. identified as epileptic activity by both ML method and expert.
False positive (FP): falsely identified events, i.e. identified as epileptic activity by the ML method but not by an expert.
False negative (FN): missed events, i.e. identified as epileptic activity by an expert but not by the ML method.

Each event was represented by one or several 60-s intervals $T_m$ (when the algorithm identifies/misses several consecutive intervals, they are treated as one event). Examples of these events are shown in Fig. 3C.

Using these characteristics, we evaluate the efficiency of our algorithm in terms of two measures: sensitivity and precision. In diagnostics, sensitivity is a measure of how well a test can identify true positives, or, in our case, the probability of a positive test given that the patient has epilepsy. Precision is the probability that subjects with a positive screening test truly have the disease. To reflect sensitivity, we calculated the True Positive Rate (TPR) as:

$$\begin{aligned} TPR = \frac{TP}{TP+FN} \times 100\%, \end{aligned}$$

(6)

Precision or the Positive Predictive Value (PPV) is commonly defined as:

$$\begin{aligned} PPV = \frac{TP}{TP+FP} \times 100\%. \end{aligned}$$

(7)

Dependence of TPR and PPV on threshold value is usually quite simple for classifiers: TPR decreases and PPV increases with the decrease of the threshold value. Thus, the choice of the optimal threshold value is dictated by balancing between appropriate TPR and PPV. However, the effect of the type of learning is not so obvious and requires statistical analysis.

We performed the group-level statistics for TPR and PPV. For statistical analysis, we used Wilcoxon signed-rank test since the samples’ distribution is abnormal. After that, Holm correction was applied. Normality was tested via the Kolmogorov-Smirnov test. We performed a statistical analysis using SciPy and Statsmodels packages for Python.

After finding the optimal SVM model, we used it to analyze the clinical data and obtain the final results—the marking with time intervals that correspond to epileptic seizures (see Fig. 3C).

Results

To verify the presence of extreme behavior in the analyzed data, we fitted AWP data of all 83 subjects with the Weibull distribution, as we described in the Methods section. Figure 4 shows the PDF of $E(t)^{norm}$ for all subjects as the blue histogram and fitted Weibull distribution as the red line. According to the Pickands–Balkema–de Haan theorem^44,45, the elongated tail of experimental distribution fitted by the “heavy-tailed” Weibull distribution (shape parameter $c \sim 0.6$) is the sign of extreme behavior (see Fig. 4). This finding also agrees quite well with our other paper¹⁵. We can conclude that our data demonstrates extreme behavior and thus can be subjected to techniques aimed at detection of “outliers”, such as SVM.

We applied the developed SVM to the data and calculated TPR and PPV. At first, we calculated TPR and PPV for different values of threshold. Results are shown in Table 1: we calculated TPR and PPV values averaged over all 83 subjects for each value of threshold (10, 5, 2.5, 1, 0.5, 0.25, 0.1, 0.05%) and each type of learning (CV and LOO). We can see that for CV TPR decreases and PPV increases with the decrease of the threshold value, as expected. However, an increase of PPV past the threshold value of 0.5% becomes less noticeable while TPR is still high. For LOO, TPR also decreases with the decrease of the threshold value, but TPR values are generally lower than in the case of CV. Thus, we focused on TPR and PPV for CV and selected the threshold value of 0.5% to be used in the SVM classifier.

Table 1 Results of data analysis with SVM classifier: threshold influence.

Full size table

As we mentioned above, mean TPR is noticeably higher for CV than for LOO, thus comparison of these two types of learning via statistical test is required for better representation. With chosen threshold, we calculated TPR and PPV values for two types of learning—the results are shown in Table 2. We found a significant main effect of type of learning on TPR but surprisingly not on PPV. Learning and testing with CV resulted in $TPR = 76.97 \pm 4.40$ (mean ± standard error (SE)) that was significantly ($p = 3.18 \times 10^{-7}$) higher than for LOO—$TPR=39.96 \pm 5.20$. However, there was no significant effect ($p=0.12$) for PPV: $PPV=12.70 \pm 1.47$ for CV vs $PPV=9.30 \pm 1.72$ for LOO. Thus, we chose CV as a more appropriate type of learning and considered these TPR and PPV values as the final result of the SVM algorithm. The value of TPR is acceptable for a simple unsupervised classifier, and it possibly can be improved by further tuning of classifier’s hyperparameters. The value of PPV is comparatively low, but we should take into account the heavy skewness of EEG data and the considerable length of recordings. In the end, the results of the proposed classifier are not outstanding and cannot compete with some supervised classifiers in terms of sensitivity or precision. However, the primary goal of this SVM classifier was to demonstrate that the link between epilepsy and extreme behavior opens up the opportunity for novel approaches in EEG data analysis.

Table 2 Results of data analysis with SVM classifier: type of learning influence.

Full size table

TPR in most cases (76 out of 83 subjects) has one of the two opposite values: 100% (60 subjects) or 0% (16 subjects). On one hand, this may be caused by a small number of seizures in each subject, usually, only one, so $TPR=100\%$ when this one seizure is detected and $TPR=0\%$ when it is not detected. On the other hand, there are some instances where multiple seizures were all detected or all missed—this may suggest some unexpected features in the data that make difference. To study this possibility, we formed and analyzed two datasets: subjects with $100\%\, TPR$ (shown as red in Table 2) and subjects with $0\% \,TPR$ (shown as blue).

We hypothesized that extreme behavior somehow differs between these two datasets. To check the hypothesis, we performed an analysis similar to that presented in Fig. 4, but this time we considered DAWP. We constructed PDF individually for each subject and fitted it with the Weibull distribution. We analyzed Weibull distributions for two datasets separately and calculated the mean ± SE for each dataset. Figure 5A demonstrates PDFs for $100\% TPR$ (shown as red histogram) and $0\% \,TPR$ (blue histogram). Fitted Weibull distributions are shown as solid lines and shaded areas (mean ± SE): red for $100\% \,TPR$ and blue for $0\% \,TPR$. Figure 5B shows marking with TPs (red) and FNs (blue) that were obtained during classification. PDF for $100\%\, TPR$ group possesses a more evident “heavy tail”, and we speculate that it corresponds to more pronounced extreme behavior. To check this, we performed the statistical analysis and compared characteristics of fitted Weibull distributions for $100\% \,TPR$ and $0\% \,TPR$ groups. The analysis showed that shape parameter c was significantly ($p<0.005$) higher for $100\% \,TPR$ dataset, which explains the “extreme” tail of the Weibull distribution. Additionally, one can see that TPs are distributed in the tail, while many FNs lie in the $<0$ area, which suggests that they did not make to the 95th percentile.

Discussion

We propose using the outlier detection ML technique, one-class SVM, to detect epileptic seizures. Testing our approach on 83 patients, we reported 77% sensitivity and 12% precision. In 60 patients, sensitivity achieved 100%. Low precision indicates that 88% of detections reflect the suspicious non-seizure activity that becomes a subject of manual sorting by an medical expert. Due to the rare nature of seizures, their number hardly surpasses 10 resulting in the overall number of detections below 100. Thus, running a seizure detection algorithm reduced the expert’s workload up to 95%.

Unlike the traditional classification algorithms, outlier detection incorporates the rare nature of seizures, hence handling the class imbalance problem. Classifying EEG data into two classes, “seizure” and “non-seizure”, one finds that the distribution of examples across these classes is heavily biased. Most of the classification algorithms use an assumption of an equal number of examples for each class; therefore, they demonstrate poor predictive performance for the minority class⁴⁶. The popular methods to handle imbalance data rely on the oversampling of the minority class or undersampling of the majority class⁴⁷. These methods (known as external methods) aim at balancing data and require advanced preprocessing procedures⁴⁸. An alternative approach is employing the design of a new classification algorithm to tackle the bias produced by the imbalanced data (internal methods). A working solution is the one-class classifiers, which fit models using only a single class of data⁴⁹. A one-class SVM^37,38,39 is an example of one-class classifiers used for outlier detection⁵⁰ including the detection of extreme events^51,52.

Unlike the supervised algorithms, outlier detection methods may train and operate on the data of a single patient, hence handling the overfitting. The majority of existing seizure detection methods rely on supervised ML algorithms⁵³ and face a high possibility of overfitting. The EEG data demonstrate high variability between patients and even between the different recordings of the same patient. Thus, there is a risk that the model fits specific features of the particular subject and fails on the data of other subjects. Addressing this issue requires forming highly representative training data, which is a challenging task in epileptology. Unsupervised algorithms use unlabeled data and perform its clusterization²¹. For seizure detection, they cluster EEG data in two clusters: “seizure” and “normal activity”⁵⁴. For the one-class SVM, we tested how its performance depends on the training data and found the better performance when it classifies the data of the same subject rather than others. These results evidence that there are subject-specific epilepsy-related characteristics of EEG. Thus, we theorize that these individual features are more important for the classifier than the common ones derived from the dataset of multiple subjects.

Table 3 Results of data analysis with SVM classifier: workload reduction.

Full size table

We feed the algorithm with the interpretable feature (EEG power in the 2–5 Hz frequency range) which demonstrates extreme behavior during seizures. Direct application of ML algorithm to the raw EEG data barely produces a reasonable output. Thus, it requires using informative input features, which are identified through the feature selection procedure. Informative feature space may be constructed from the data using deep-learning (DL) methods. Sometimes, DL enables achieving outstanding performance but it utilizes features that lack interpretation. In supervised algorithms, it increases the risk of overfitting since the model fits the data in the data-specific feature space. An alternative way is constructing features manually using the expert-level knowledge of the dataset. In the multichannel EEG data, the high-dimensional feature space can be derived by the time-frequency decomposition^55,56,57. Many researchers proposed time-domain features, e.g., line length, frequency, and energy^58,59. The repeatability, regularity (periodicity), synchronicity, and amplitude variation of EEG are also the major time-domain features differentiating seizure from the normal activity⁶⁰. The most understandable way is using features whose significance is supported by the results of EEG research. For epilepsy, a bulk of studies report EEG biomarkers distinguishing seizures from the normal EEG^61,62. In this study, we use the previous results proving extreme behavior of the 2–5 Hz power during seizures^15,16.

We suppose that the interpretability gives us prior knowledge about the ability of an algorithm to handle the data of current patient, hence assessing its credibility. Comparing the data of the subjects from 100%-precision group with others, we observe distinct probability distributions. In the 100%-precision group, the probability distribution has a heavy tail indicating the presence of the extreme events. For the rest of the subjects, the probability of the extreme events is smaller and the tail is close to exponential. The detailed analysis reveals that the tail in the 100%-precision group almost solely consists of the detected seizures. For the rest of the subjects, the actual seizures belong to the 95% of data (see Fig. 5B).

Finally, we estimate a possible practical effect of using our algorithm in a decision-support system. The analysis of the whole dataset with a total duration of $t_{base}$ can be now substituted with the analysis of the subset with the duration of $t_{reduc}$. This subset includes all episodes marked as seizures by the algorithm: true positives (TPs) with a total duration of $t_{TP}$ and false positives (FPs) with $t_{FP}$, so the duration of the subset is $t_{reduc}=t_{TP}+t_{FP}$. We assume that the time required for the expert to process this data is proportional to the size of the dataset. Thus, the workload reduction percentage is $P_{reduc}=100\%(t_{base}-t_{reduc})/t_{base}$.

For the 100%-precision group, we found that the workload reduction percentage was $P_{reduc}=95.84 \pm 0.46$ (mean ± SE) (see Table 3). Thus, the expert will spend $\sim \,95\%$ less time detecting seizures, so the patient will earlier be provided with a confirmed diagnosis and get the necessary treatment. Moreover, the time reduction may prevent the fatigue of the expert, which occurs during the analysis of prolonged EEG datasets and increases the probability of misdiagnosis.

The principal limitation of the present study is that it did not aim to develop an optimal algorithm with characteristics (sensitivity and precision) superior to other methods for seizure detection or the results of a medical expert. Here, we demonstrate the concept of applying the outlier detection methods to the seizure selection problem, with the extreme event theory applied to the considered problem supporting the choice of this class of methods. This makes the developed approach interpretable, and one should consider the proposed method as the basis of a medical decision support system. It is important to note that the developed method can greatly facilitate the work of the expert, reducing the duration of the analyzed dataset up to 95%. Also, note that we chose one-class SVM in our work because this classifier is the traditional algorithm for solving the problem of detecting outliers. Of course, SVM can be replaced by other better methods. Conducting such research is intriguing and planned for our future studies.

Conclusions

Based on extreme event theory, we propose using an outlier detection algorithm, one-class SVM, to detect epileptic seizures on the human EEG using the 2–5 Hz power as a feature. In most patients, the probability distribution of the 2–5 Hz EEG power was heavy-tailed, demonstrating the presence of extreme events. For them, one-class SVM achieved 100% sensitivity and 12% precision. Low precision indicates that 88% of detections reflect the suspicious non-seizure activity that becomes a subject of manual sorting by an expert. Due to the rare nature of seizures, their number hardly surpasses 10, resulting in the overall number of detections below 100. Thus, running a seizure detection algorithm reduced the expert’s workload up to 95%. For the rest of the subjects, the probability distribution barely displayed the manifestations of extreme behavior, and SVM demonstrated a poor ability to detect seizures. Finally, we showed that one-class SVM used a single subject’s data for training; therefore, it was stable against between-subject variability. Our results demonstrate an effective convergence between the extreme value theory, a physical concept, and the outlier detection algorithms, a machine learning concept, toward solving the meaningful task of medicine.

Data availability

The data that support the findings of this study are available from National Medical and Surgical Center named after N. I. Pirogov of Russian Healthcare Ministry but restrictions apply to the availability of these data, which were used under license for the current study, and so are not publicly available. Data are however available from the authors upon reasonable request and with permission of National Medical and Surgical Center named after N. I. Pirogov of Russian Healthcare Ministry.

References

Cavalcante, H. L. D. S., Oriá, M., Sornette, D., Ott, E. & Gauthier, D. J. Predictability and suppression of extreme events in a chaotic system. Phys. Rev. Lett. 111, 198701 (2013).
Article ADS PubMed CAS Google Scholar
Ghil, M. et al. Extreme events: Dynamics, statistics and prediction. Nonlinear Process. Geophys. 18, 295–350 (2011).
Article ADS Google Scholar
Albeverio, S., Jentsch, V. & Kantz, H. Extreme Events in Nature and Society (Springer, 2006).
Book Google Scholar
Helbing, D. et al. Saving human lives: What complexity science and information systems can contribute. J. Stat. Phys. 158, 735–781 (2015).
Article ADS MathSciNet PubMed MATH Google Scholar
Piccoli, P., Chaudhury, M., Souza, A. & da Silva, W. V. Stock overreaction to extreme market events. N. Am. J. Econ. Financ. 41, 97–111 (2017).
Article Google Scholar
Marwan, N. & Kurths, J. Complex network based techniques to identify extreme events and (sudden) transitions in spatio-temporal systems. Chaos Interdiscipl. J. Nonlinear Sci. 25, 097609 (2015).
Article MathSciNet Google Scholar
Yeom, D.-I. & Eggleton, B. J. Rogue waves surface in light. Nature 450, 953–954 (2007).
Article ADS CAS PubMed Google Scholar
Zhu, J., Ge, Z., Song, Z. & Gao, F. Review and big data perspectives on robust data mining approaches for industrial process modeling with outliers and missing data. Annu. Rev. Control. 46, 107–133 (2018).
Article MathSciNet Google Scholar
Lehnertz, K. Epilepsy: Extreme events in the human brain. In Extreme Events in Nature and Society (eds Albeverio, S. et al.) 123–143 (Springer, 2006).
Chapter Google Scholar
Osorio, I., Frei, M. G., Sornette, D., Milton, J. & Lai, Y.-C. Epileptic seizures: Quakes of the brain? Phys. Rev. E 82, 01919 (2010).
Article ADS MathSciNet CAS Google Scholar
Kuhlmann, L., Lehnertz, K., Richardson, M. P., Schelter, B. & Zaveri, H. P. Seizure prediction-ready for a new era. Nat. Rev. Neurol. 14, 618–630 (2018).
Article PubMed Google Scholar
Hramov, A., Koronovskii, A. A., Midzyanovskaya, I., Sitnikova, E. & Van Rijn, C. On-off intermittency in time series of spontaneous paroxysmal activity in rats with genetic absence epilepsy. Chaos Interdiscipl. J. Nonlinear Sci. 16, 043111 (2006).
Article MATH Google Scholar
Sitnikova, E., Hramov, A. E., Grubov, V. V., Ovchinnkov, A. A. & Koronovsky, A. A. On-off intermittency of thalamo-cortical oscillations in the electroencephalogram of rats with genetic predisposition to absence epilepsy. Brain Res. 1436, 147–156 (2012).
Article CAS PubMed Google Scholar
Koronovskii, A. A. et al. Coexistence of intermittencies in the neuronal network of the epileptic brain. Phys. Rev. E 93, 032220 (2016).
Article ADS MathSciNet PubMed CAS Google Scholar
Frolov, N. S. et al. Statistical properties and predictability of extreme epileptic events. Sci. Rep. 9, 1–8 (2019).
Article CAS Google Scholar
Karpov, O. E. et al. Noise amplification precedes extreme epileptic events on human eeg. Phys. Rev. E 103, 022310 (2021).
Article ADS CAS PubMed Google Scholar
Beniczky, S. et al. Automated seizure detection using wearable devices: A clinical practice guideline of the international league against epilepsy and the international federation of clinical neurophysiology. Clin. Neurophysiol. 132, 1173–1184 (2021).
Article PubMed Google Scholar
Friedman, D. E. & Hirsch, L. J. How long does it take to make an accurate diagnosis in an epilepsy monitoring unit? J. Clin. Neurophysiol. 26, 213–217 (2009).
Article PubMed Google Scholar
Berner, E. S. Clinical Decision Support Systems Vol. 233 (Springer, 2007).
Book Google Scholar
Mohri, M., Rostamizadeh, A. & Talwalkar, A. Foundations of Machine Learning (MIT Press, 2018).
MATH Google Scholar
Birjandtalab, J., Pouyan, M. B. & Nourani, M. Unsupervised eeg analysis for automated epileptic seizure detection. In First International Workshop on Pattern Recognition, vol. 10011, 100110M (International Society for Optics and Photonics, 2016).
Tzallas, A. T., Tsipouras, M. G. & Fotiadis, D. I. Automatic seizure detection based on time-frequency analysis and artificial neural networks. Comput. Intell. Neurosci. 2007, 80510 (2007).
Article PubMed Central Google Scholar
Kasteleijn-Nolst Trenité, D. et al. Methodology of photic stimulation revisited: Updated European algorithm for visual stimulation in the eeg laboratory. Epilepsia 53, 16–24 (2012).
Article PubMed Google Scholar
Holmes, M. D., Dewaraja, A. S. & Vanhatalo, S. Does hyperventilation elicit epileptic seizures? Epilepsia 45, 618–620 (2004).
Article PubMed Google Scholar
White, D. M. & Van Cott, C. A. Eeg artifacts in the intensive care unit setting. Am. J. Electroneurodiagnostic Technol. 50, 8–25 (2010).
Article PubMed Google Scholar
Ebersole, J. S. & Pedley, T. A. Current Practice of Clinical Electroencephalography (Lippincott Williams & Wilkins, 2003).
Google Scholar
Hyvärinen, A. & Oja, E. Independent component analysis: Algorithms and applications. Neural Netw. 13, 411–430 (2000).
Article PubMed Google Scholar
Aldroubi, A. & Unser, M. Wavelets in Medicine and Biology (Routledge, 2017).
Book MATH Google Scholar
Hramov, A. E. et al. Wavelets in Neuroscience (Springer, 2021).
Book MATH Google Scholar
Torrence, C. & Compo, G. P. A practical guide to wavelet analysis. Bull. Am. Meteor. Soc. 79, 61–78 (1998).
Article ADS Google Scholar
Adeli, H., Zhou, Z. & Dadmehr, N. Analysis of eeg records in an epileptic patient using wavelet transform. J. Neurosci. Methods 123, 69–87 (2003).
Article PubMed Google Scholar
Gloor, P. & Fariello, R. Generalized epilepsy: Some of its cellular mechanisms differ from those of focal epilepsy. Trends Neurosci. 11, 63–68 (1988).
Article CAS PubMed Google Scholar
Trinka, E., Höfler, J. & Zerbs, A. Causes of status epilepticus. Epilepsia 53, 127–138 (2012).
Article PubMed Google Scholar
Gnedenko, B. Sur la distribution limite du terme maximum d’une serie aleatoire. Ann. Math. 44, 423–453 (1943).
Article MathSciNet MATH Google Scholar
Rinne, H. The Weibull Distribution: A Handbook (Chapman and Hall/CRC, 2008).
Book MATH Google Scholar
Pisarchik, A. et al. Extreme events in epileptic eeg of rodents after ischemic stroke. Eur. Phys. J. Spl. Top. 227, 921–932 (2018).
Article Google Scholar
Muller, K.-R., Mika, S., Ratsch, G., Tsuda, K. & Scholkopf, B. An introduction to kernel-based learning algorithms. IEEE Trans. Neural Netw. 12, 181–201 (2001).
Article CAS PubMed Google Scholar
Zhou, J., Chan, K., Chong, V. & Krishnan, S. M. Extraction of brain tumor from mr images using one-class support vector machine. In 2005 IEEE Engineering in Medicine and Biology 27th Annual Conference, 6411–6414 (IEEE, 2006).
Mourão-Miranda, J. et al. Patient classification as an outlier detection problem: An application of the one-class support vector machine. Neuroimage 58, 793–804 (2011).
Article PubMed Google Scholar
Kecman, V., Huang, T.-M. & Vogt, M. Iterative single data algorithm for training kernel machines from huge data sets: Theory and performance. In Support Vector Machines: Theory and Applications (ed. Wang, L.) 255–274 (Springer, 2005).
Chapter Google Scholar
Fushiki, T. Estimation of prediction error by using k-fold cross-validation. Stat. Comput. 21, 137–146 (2011).
Article MathSciNet MATH Google Scholar
Wong, T.-T. Performance evaluation of classification algorithms by k-fold and leave-one-out cross validation. Pattern Recogn. 48, 2839–2846 (2015).
Article ADS MATH Google Scholar
Stehman, S. V. Selecting and interpreting measures of thematic classification accuracy. Remote Sens. Environ. 62, 77–89 (1997).
Article ADS Google Scholar
Balkema, A. A. & De Haan, L. Residual life time at great age. Ann. Probab. 2, 792–804 (1974).
Article MathSciNet MATH Google Scholar
Pickands, J. Statistical inference using extreme order statistics. Ann. Stat. 3, 119–131 (1975).
MathSciNet MATH Google Scholar
Devi, D., Biswas, S. K. & Purkayastha, B. Learning in presence of class imbalance and class overlapping by using one-class svm and undersampling technique. Connect. Sci. 31, 105–142 (2019).
Article ADS Google Scholar
Spelmen, V. S. & Porkodi, R. A review on handling imbalanced data. In 2018 International Conference on Current Trends towards Converging Technologies (ICCTCT), 1–11 (IEEE, 2018).
He, H. & Garcia, E. A. Learning from imbalanced data. IEEE Trans. Knowl. Data Eng. 21, 1263–1284 (2009).
Article Google Scholar
Bellinger, C., Sharma, S. & Japkowicz, N. One-class versus binary classification: Which and when? In 2012 11th International Conference on Machine Learning and Applications, Vol. 2, 102–106 (IEEE, 2012).
Hejazi, M. & Singh, Y. P. One-class support vector machines approach to anomaly detection. Appl. Artif. Intell. 27, 351–366 (2013).
Article Google Scholar
Nayak, M. A. & Ghosh, S. Prediction of extreme rainfall event using weather pattern recognition and support vector machine classifier. Theoret. Appl. Climatol. 114, 583–603 (2013).
Article ADS Google Scholar
Eskandarpour, R. & Khodaei, A. Machine learning based power grid outage prediction in response to extreme events. IEEE Trans. Power Syst. 32, 3315–3316 (2016).
Article ADS Google Scholar
Abbasi, B. & Goldenholz, D. M. Machine learning applications in epilepsy. Epilepsia 60, 2037–2047 (2019).
Article PubMed Google Scholar
Jiang, F. et al. Artificial intelligence in healthcare: Past, present and future. Stroke Vasc. Neurol. 2, 4 (2017).
Article Google Scholar
Direito, B. et al. Feature selection in high dimensional eeg features spaces for epileptic seizure prediction. IFAC Proc. Vol. 44, 6206–6211 (2011).
Article Google Scholar
Amin, H. U. et al. Feature extraction and classification for eeg signals using wavelet transform and machine learning techniques. Austral. Phys. Eng. Sci. Med. 38, 139–149 (2015).
Article Google Scholar
Esteller, R., Echauz, J., Tcheng, T., Litt, B. & Pless, B. Line length: an efficient feature for seizure onset detection. In 2001 Conference Proceedings of the 23rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, Vol. 2, 1707–1710 (IEEE, 2001).
Logesparan, L., Casson, A. J. & Rodriguez-Villegas, E. Optimal features for online seizure detection. Med. Biol. Eng. Comput. 50, 659–669 (2012).
Article PubMed Google Scholar
Guerrero-Mosquera, C., Trigueros, A. M., Franco, J. I. & Navia-Vazquez, A. New feature extraction approach for epileptic eeg signal detection using time-frequency distributions. Med. Biol. Eng. Comput. 48, 321–330 (2010).
Article PubMed Google Scholar
Harpale, V. & Bairagi, V. An adaptive method for feature selection and extraction for classification of epileptic eeg signal in significant states. J. King Saud Univ.-Comput. Inf. Sci. 33, 668 (2018).
Google Scholar
Hammer, G. D., McPhee, S. J. & Education, M.-H. Pathophysiology of Disease: An Introduction to Clinical Medicine (McGraw-Hill Education Medical, 2014).
Google Scholar
Goldberg, E. M. & Coulter, D. A. Mechanisms of epileptogenesis: A convergence on neural circuit dysfunction. Nat. Rev. Neurosci. 14, 337–349 (2013).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This work was performed within the scope of the State Assignment of the Ministry of Education and Science of the Russian Federation FZWM-2020-0013, and grant SPbU, ID 91924061. S.A.K. is supported by Doctor Support Program (Grant MD-590.2022.1.2). N.N.S. and A.E.H. thank Leading Scientific School Support Program (Grant NSH-589.2022.1.2).

Author information

Authors and Affiliations

National Medical and Surgical Center named after N. I. Pirogov, Ministry of Healthcare of the Russian Federation, Moscow, Russia
Oleg E. Karpov & Nikita P. Utyashev
Neuroscience and Cognitive Technology Laboratory, Innopolis University, Kazan, Russia
Vadim V. Grubov, Vladimir A. Maksimenko, Semen A. Kurkin, Nikita M. Smirnov & Alexander E. Hramov
Baltic Center for Artificial Intelligence and Neurotechnology, Immanuel Kant Baltic Federal University, Kaliningrad, Russia
Vadim V. Grubov, Vladimir A. Maksimenko, Semen A. Kurkin, Natalia N. Shusharina & Alexander E. Hramov
Research and Production Company “Immersmed”, Moscow, Russia
Denis A. Andrikov
Department of Theoretical Cybernetics, Saint Petersburg State University, St. Petersburg, Russia
Alexander E. Hramov

Authors

Oleg E. Karpov
View author publications
You can also search for this author in PubMed Google Scholar
Vadim V. Grubov
View author publications
You can also search for this author in PubMed Google Scholar
Vladimir A. Maksimenko
View author publications
You can also search for this author in PubMed Google Scholar
Semen A. Kurkin
View author publications
You can also search for this author in PubMed Google Scholar
Nikita M. Smirnov
View author publications
You can also search for this author in PubMed Google Scholar
Nikita P. Utyashev
View author publications
You can also search for this author in PubMed Google Scholar
Denis A. Andrikov
View author publications
You can also search for this author in PubMed Google Scholar
Natalia N. Shusharina
View author publications
You can also search for this author in PubMed Google Scholar
Alexander E. Hramov
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization, O.K., V.M. and A.H.; Data curation, V.M. and N.U.; Formal analysis, V.M.; Funding acquisition, N.S. and A.H.; Investigation, V.G., D.A. and A.H.; Methodology, V.G. and N.S.; Project administration, A.H.; Resources, O.K.; Software, D.A. and S.K.; Supervision, A.H.; Validation, D.A. and A.H.; Visualization, V.S. and N.S.; Writing—original draft, V.G. and S.K.; Writing—review & editing, O.K. and A.H. All authors reviewed the manuscript.

Corresponding author

Correspondence to Alexander E. Hramov.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Karpov, O.E., Grubov, V.V., Maksimenko, V.A. et al. Extreme value theory inspires explainable machine learning approach for seizure detection. Sci Rep 12, 11474 (2022). https://doi.org/10.1038/s41598-022-15675-9

Download citation

Received: 01 April 2022
Accepted: 28 June 2022
Published: 06 July 2022
DOI: https://doi.org/10.1038/s41598-022-15675-9

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.