Asynchronous c-VEP communication tools—efficiency comparison of low-target, multi-target and dictionary-assisted BCI spellers

Gembler, Felix W.; Benda, Mihaly; Rezeika, Aya; Stawicki, Piotr R.; Volosyak, Ivan

doi:10.1038/s41598-020-74143-4

Download PDF

Article
Open access
Published: 13 October 2020

Asynchronous c-VEP communication tools—efficiency comparison of low-target, multi-target and dictionary-assisted BCI spellers

Felix W. Gembler¹,
Mihaly Benda¹,
Aya Rezeika¹,
Piotr R. Stawicki¹ &
…
Ivan Volosyak¹

Scientific Reports volume 10, Article number: 17064 (2020) Cite this article

1809 Accesses
13 Citations
Metrics details

Subjects

Abstract

Keyboards and smartphones allow users to express their thoughts freely via manual control. Hands-free communication can be realized with brain–computer interfaces (BCIs) based on code-modulated visual evoked potentials (c-VEPs). Various variations of such spellers have been developed: Low-target systems, multi-target systems and systems with dictionary support. In general, it is not clear which kinds of systems are optimal in terms of reliability, speed, cognitive load, and visual load. The presented study investigates the feasibility of different speller variations. 58 users tested a 4-target speller and a 32-target speller with and without dictionary functionality. For classification, multiple individualized spatial filters were generated via canonical correlation analysis (CCA). We used an asynchronous implementation allowing non-control state, thus aiming for high accuracy rather than speed. All users were able to control the tested spellers. Interestingly, no significant differences in accuracy were found: 94.4%, 95.5% and 94.0% for 4-target spelling, 32-target spelling, and dictionary-assisted 32-target spelling. The mean ITRs were highest for the 32-target interface: 45.2, 96.9 and 88.9 bit/min. The output speed in characters per minute, was highest in dictionary-assisted spelling: 8.2, 19.5 and 31.6 characters/min. According to questionnaire results, 86% of the participants preferred the 32-target speller over the 4-target speller.

Asynchronous non-invasive high-speed BCI speller with robust non-control state detection

Article Open access 04 June 2019

Optimising non-invasive brain-computer interface systems for free communication between naïve human participants

Article Open access 10 December 2019

Multi-frequency steady-state visual evoked potential dataset

Article Open access 04 January 2024

Introduction

Brain–computer interfaces (BCIs) hold the potential to aid people with severe clinical disorders in their daily life as they allow hands-free control and communication. These systems translate the BCI users’ brain activity, usually acquired non-invasively via electroencephalography (EEG), into control commands for external devices¹. For example, BCIs may serve as communication tools for people who cannot use typical manual input devices.

Various BCI communication applications (typically referred to as spellers) have been realized over the last years. They have been categorized according to the analysed brain signal (e. g. event-related potentials or sensorymotor rhythms), the graphical user interface (GUI) design (multi-step versus single-step), the selection interval mechanism (synchronous versus asynchronous) and the usage of additional features (e. g. word completion methods)².

Brain signals used for BCI control also include visual evoked potentials (VEPs) which have been studied since the 1970s³. Nowadays, two VEP approaches are predominantly used in BCI research: the frequency-modulated VEPs (f-VEPs)^4,5,6,7 and the code-modulated VEPs (c-VEPs)^8,9,10,11. In spellers based on VEPs, several stimuli classes, each flickering with a unique pattern, represent control commands; for example, for selecting letters of a virtual keyboard. The BCI classifies which target the user is looking at by interpreting the brain signals in real-time. For the c-VEP paradigm (used in the presented study), the flickering patterns are modulated with different time lags of a binary code sequence; EEG templates for each stimulus class need to be generated from data collected in a recording session.

BCI spellers can employ a low-target (multi-step) or a multi-target (single-step) graphical user interface (GUI) design. In low-target interfaces, several selections are needed to choose the desired character. A low number of stimulus classes are sufficient; spellers with only four or five different flickering patterns are quite common^{4,12,13,14,15}. While low-target spellers allow high classification accuracies, the overall spelling speed is limited, as several selection intervals (typically consisting of stimulation intervals and flicker-free intervals for gaze-shifting, where users can shift their gaze to the next target) are required for letter selections.

Multi-target spellers, on the other hand, employ a single-step GUI design and typically resemble a QWERTY-style keyboard layout^16,17,18,19 or use an alpha-numeric letter arrangement^7,10,11,20. These interfaces usually use 28¹⁹ to 55¹⁷ stimulus targets to present the English alphabet consisting of 26 letters and sometimes additional characters such as numbers or punctuation marks. According to recent publications, the highest spelling speeds were achieved with an alpha-numeric 40-target interface developed by Chen et al.⁷, who reported an average information transfer rate (ITR) of 267 bit/min employing 0.5 s stimulation intervals and 0.5 s gaze-shifting intervals. Employing even shorter stimulation time windows of 0.3 s, Nakanishi et al.²¹ reported average ITRs of 325 bit/min (cue-guided selection task) and 199 bit/min (copy-spelling task).

While multi-target spellers allow faster speeds, they may cause more eye fatigue than low-target spellers, as was observed in SSVEP studies (e. g.²²). More importantly, due to the high number of targets, which need to be distiguished by the system, these systems tend to be less precise. Bin et al.²³ tested both a 16-target and a 32-target c-VEP system and observed that doubling the number of targets caused an accuracy drop from 92% to 85%. Moreover, so-called BCI illiteracy cases, where users were not able to achieve sufficient control over multi-target systems, have been reported repeatedly^11,19,24. For example, Renton et al.¹⁹ reported that almost half of 38 participants did not achieve sufficient accuracies for reliable free communication with a 28-target f-VEP speller employing 1.5 s stimulation intervals (i. e. <80% accuracy in their preliminary assessment).

Many researchers focus on improving classification accuracy to reduce BCI illiteracy. The classical classification method involving canonical correlation analysis (CCA)^5,25 has been improved several times. Chen et al.²⁰ suggested CCA classification based on filter banks for the f-VEP paradigm. Their method decomposes the original data by applying several band-pass filters. The authors tested several different decomposition designs: equally spaced, harmonic and overlapping sub-bands, and observed that the latter yielded the highest accuracy. Recently, Monidini et al.²⁶ investigated the number of correlations coefficients considered for CCA classification and found a significant improvement in classification accuracy if more than one coefficients (as in the conventional approach) were considered.

An approach to improve both the classification accuracy and the overall system usability is a dynamic classification window paradigm. The stimulation intervals can either be determined by the system (synchronous spellers^7,11,23) or involve on-line classification scores based on real-time EEG data, which are compared to threshold values (asynchronous spellers^12,17,27). The latter approach reduces unintended selections (often referred to as the Midas touch problem²⁸) and significantly increases accuracies in practical spelling scenarios. While synchronous applications with small stimulation intervals are often tested to demonstrate high ITRs, they may overestimate the true communication speed achievable in a realistic setting. On the other hand, although generally slower, asynchronous applications may achieve a more naturalistic communication and are better suited for naive users.

A way to improve spelling efficiency for asynchronous spellers is word completion features, which allow users to spell words with fewer selections. While several word prediction features have been implemented for other BCI paradigms such as event-related potentials^29,30, they are rarely used in BCIs based on VEPs. The few VEP spellers with dictionary support fall in the category of asynchronous low-target systems: Volosyak et al.³¹ presented a dictionary feature for an asynchronous multi-step f-VEP system (the so-called Bremen-BCI speller), where a drop-down list containing six dictionary suggestions was employed; more recently, we presented an asynchronous multi-step 8-target c-VEP system offering word suggestions based on an n-gram word prediction model³². For multi-target systems, these kinds of features may be beneficial as well.

To investigate what kind of speller (multi-step, single-step, dictionary-assisted) is ideal in terms of reliability, speed, cognitive load, and visual load, we tested a 4-target system (low-target, multi-step) and a 32-target (multi-target, single-step) system using the c-VEP paradigm. The latter system also offered dictionary suggestions. A large subject group (58 participants) went through different spelling tasks: letter-by-letter spelling tasks to investigate the effect of the numbers of targets and dictionary-assisted spelling to investigate the efficiency of the word prediction feature.

For signal classification, we used a new ensemble approach employing multiple spatial filters based on CCA correlation coefficients. Moreover, as naturally occurring EEG activity (e. g. alpha activity when closing the eyes) may lead to false classifications, the original EEG data were decomposed into alpha-band (8–12 Hz), beta-band (approx. 12–30 Hz), and gamma-band (>30 Hz) related activities. Weights for this filter bank design were determined individually based on the training data. The idea behind the approach was to enhance the separation between natural brain activity and stimuli induced responses. For example, for some users, natural alpha activity (associated with tiredness) may interfere with cVEP detection. Due to the individual weights the impact of alpha activity on classification can be reduced in such cases.

In summary, the overall aims of the study were the following:

Confirming our previous results that all subjects are able to use c-VEP-based BCIs,
Comparing low-target and multi-target BCIs in terms of user-friendliness, accuracy and speed,
Evaluating the efficiency of dictionary features for asynchronous multi-target BCIs,
Evaluating the proposed classification model based on individualized filter bank design and multiple spatial filters.

Next to the typical performance metrics employed in BCI research, we assessed subjective opinions after training and spelling phases using multiple questionnaires.

Results

The overall performance of the spellers was assessed via classification accuracy, selection time (including 1 s gaze shift), ITR¹, and output characters per minute (OCM)²⁹ (see Methods section). The results of the off-line cross-validation and the on-line spelling experiments for the 4-target and 32-target system are presented in the following.

On-line spelling phase

Three spelling tasks were performed: letter-by-letter spelling with the 4-target GUI, letter-by-letter spelling with the 32-target GUI (in both cases the same pangram was spelled), and dictionary-aided spelling with the 32-target GUI (different real world English sentences were spelled). All 58 participants successfully completed the three spelling tasks (BCI literacy rate 100%); Figure 1 displays the individual results. In the following, mean (M) scores and standard deviations (SDs) are listed. Paired two-tailed Students t-tests were used to investigate differences in performance between the 32- and 4-target spellers, and between the dictionary-supported spelling and standard letter-by-letter spelling.

Regarding the differences between 32 and 4 targets in on-line performance, the letter-by-letter spelling tasks were performed with high accuracies for the 32-target system ($M=95.5\%$, $SD=3.6$) and for the 4-target system ($M=94.4\%$, $SD=4.3$) system. The difference in accuracy was not significant ($t=1.42$, $p=0.16$). The mean selection time was longer for the 32-target system ($M=3.04$ s, $SD=1.01$) than for the 4-target system ($M=2.31$ s, $SD=0.80$). This difference was significant $t=5.60$, $p<0.0001$. The mean ITR was higher for the 32-target system ($M=96.9$ bit/min, $SD=24.9$) than for the 4-target system ($M=45.2$ bit/min, $SD=10.9$). This difference was significant $t=17.2$, $p<0.0001$. Similarly, the mean OCM in letter-by-letter spelling was significantly higher ($t=19.1$, $p<0.0001$) for the 32-target system ($M=19.5$ characters/min, $SD=5.0$) than for the 4-target system ($M=8.2$ characters/min, $SD=2.0$).

To investigate the differences between dictionary-supported spelling and standard letter-by-letter spelling, the on-line performances of the respective tasks with the 32-target system were evaluated. High mean accuracies in letter-by-letter spelling ($M=95.5\%$, $SD=3.6$) and dictionary-supported spelling ($M=94.0\%$, $SD=6.0$) were achieved. The difference in accuracy was not significant ($t=1.9$, $p=0.06$). The mean selection time was slightly shorter in letter-by-letter spelling ($M=3.04$ s, $SD=1.01$) than in dictionary-supported spelling ($M=3.16$ s, $SD=0.87$). This difference was also not significant $t=1.32$, $p=0.19$. The mean ITR was slightly higher in letter-by-letter spelling ($M=96.9$ bit/min, $SD=24.9$) than in dictionary-supported spelling ($M=88.9$ bit/min, $SD=20.8$). This difference was significant $t=2.6$, $p=0.01$. On the other hand, due to the dictionary suggestions, the mean OCM was significantly higher ($t=9.1$, $p<0.0001$) in dictionary-supported spelling ($M=31.7$ characters/min, $SD=10.5$) than in letter-by-letter spelling ($M=19.5$ characters/min, $SD=5.0$).

Additional exploratory analysis was conducted to investigate differences between male and female participants. Welch’s two sample t-tests were conducted for the accuracies reached with the 4-target and 32-target letter-by-letter tasks. For the 4-target speller, the difference between female ($M=95.6\%$, $SD=2.6$) and male ($M=93.3\%$, $SD=5.3$) participants was significant ($t=2.1$, $p=0.04$). In the same way, for the 32-target speller, the difference between female ($M=96.6\%, SD=2.5$) and male ($M=94.3\%$, $SD=4.2$) participants was significant ($t=2.5$, $p=0.02$).

Questionnaire results

After off-line recording sessions and spelling sessions, participants went through questionnaires (for more details refer to the experimental procedure in the Methods sections).

A series of two-tailed Wilcoxon signed-rank tests were conducted to examine differences regarding the subjective user impressions between the 4-target and the 32-target speller. In the following, median scores, interquartile ranges (IQRs) and ranges are listed.

Figure 2a shows the post-training questionnaire results related to the flickering. For the Likert item exhausting (7)/relaxing (1), the median score for the 4-target speller was 2 (IQR 2, range 1–5) compared to 3 (IQR 2, range 1–6) for the 32-target speller. For the Likert item annoying (7)/comfortable (1), the median score for the 4-target speller was 2 (IQR 2, range 1–6) compared to 2 (IQR 3, range 1–7) for the 32-target speller. In both cases, answers for the 4-target speller were shifted more towards the positive statement, i.e. the lower number ($p<0.001$).

Figure 2b shows the post-spelling questionnaire results related to the flickering. For the Likert item exhausting (7)/relaxing (1), the median score for the 4-target speller was 3 (IQR 2, range 1–7) compared to 2 (IQR 2, range 1–7) for the 32-target speller. For the Likert item annoying (7)/comfortable (1), the median score for the 4-target speller was 3 (IQR 2, range 1–7) compared to 2 (IQR 2, range 1–6) for the 32-target speller. Now, in both cases, answers for the 32-target speller were shifted more towards the positive statement ($p<0.001$).

Figure 2c shows the post-spelling questionnaire results related to the GUI. For the Likert item confusing (7)/clear (1), the median score for the 4-target speller was 1 (IQR 1, range 1–5) compared to 1 (IQR 1, range 1–5) for the 32-target speller. The question did not result in a significant difference ($p=0.57$). For the Likert item boring (7)/interesting (1), the median score for the 4-target speller was 1 (IQR 1, range 1–7) compared to 1 (IQR 1, range 1–6) for the 32-target speller. For this Item, the answers for the 32-target speller were shifted more towards the positive statement ($p=0.001$).

Figure 2d shows the post-spelling questionnaire results related to the performance. For the Likert item slow (7)/fast (1), the median score for the 4-target speller was 3 (IQR 1, range 1–6) compared to 2 (IQR 2, range 1–5) for the 32-target speller. The answers for the 32-target speller were shifted more towards the positive statement ($p<0.001$). Finally, when asked, which of the two systems they preferred, 50 participants (86.2%) voted for the 32-target speller.

Off-line results

The recorded training data was used for exploratory analysis via a 4-fold stratified cross-validation³³. The results were then averaged across folds.

Figure 3 shows the accuracies and ITRs for time windows up to 1 s. As expected, the accuracies are considerably higher for the 4-target system, while ITRs are considerably higher for the 32-target system.

To further compare 4-target and 32-target systems, the effect of the number of electrodes on BCI performance was analysed. Figure 4 shows classification accuracies for the 4-target and the 32-target speller for different channel montages around the visual cortex. The classification accuracy decreases with the reduction of electrodes. The drop in accuracy from 16 to 8 electrodes is much larger than that from 32 to 16 electrodes.

The effectiveness of the proposed classification (filter bank approach and adaptive weight mechanism) was assessed. Figure 5 compares the off-line accuracies of the standard c-VEP classification method (i. e. without filter bank decomposition) and the proposed method. The median accuracies were generally higher with the suggested methods. However, for the 4-target speller, only low classification time windows yielded considerable differences.

Discussion

For practical BCI applications classification accuracy, communication speed, and robust non-control state are essential. The latter point is particularly crucial, as in true communication, users do not always intend to enter commands for certain time periods. In asynchronous implementations, output commands are only produced if the user intends to do so.

One aim of the study was to explore the efficiency of the asynchronous dictionary-supported multi-target c-VEP system. We used a dynamic time window mechanism employing a threshold-based classification approach. For the user, this means the flicker intervals changed dynamically, the flicker-free gaze-shifting phases were set to 1 s in this study. Various other studies employ shorter gaze-shifting phases of 0.5 or 0.75 s. In preliminary tests, we found that such short intervals may reduce classification accuracy, especially for users who are unfamiliar with the system. This is inline with remarks by Chen et al., who used 1 s gaze shifting windows to increase accuracy in some cases.

Several other asynchronous VEP spellers have been developed previously^4,17,32,34: For example, Cecotti et al.¹² achieved 37.6 bit/min and 5.5 characters/min with 8 participants testing a 4-target menu-based f-VEP speller. Volosyak et al.⁴ achieved 61.7 bit/min and about 10 characters/min with 7 participants testing a 5-target letter grid layout employing the f-VEP paradigm. In a previous study³², we tested an 8-target layout with 18 participants using n-gram dictionary functionality and achieved an ITR of 57.8 bit/min and 18.4 characters/min with different English sentences. Nagel et al.¹⁷ reported 109.1 bit/min and 16.1 characters/min with 10 participants who used a 55-target German QWERTZ-layout, spelling 3 times the German phrase “Asynchron BCI” (case sensitive). In terms of ITR, these results were among the fastest reported for asynchronous applications. For the 32-target speller used in this study, slightly lower ITRs of about 90 bit/min were achieved; but due to the dictionary integration, the average output character speed increased to 31.6 characters/min on the average (up to 60 characters/min) in dictionary-assisted spelling. Notably, despite the high number of participants and the complexity of the spelling tasks, this is the highest character output efficiency reported with asynchronous applications until now. We would like to point out that the reported OCM values are highly dependent on the complexity of the sentence tasks. For simple sentences, dictionary suggestions are more helpful resulting in higher OCM scores. For example, S20 and S21 both achieved an ITR of roughly 100 bit/min; the OCMs, however, were quite different (S20 achieved 24.2 spelling “LIBERTY CONSISTS IN DOING WHAT ONE DESIRES” and S21 achieved 38.2 characters/min spelling “I WILL TRY TO MAKE IT RIGHT THIS TIME”). The ITR in dictionary-assisted spelling was slightly lower than in the pangram spelling task (88.9 bit/min versus 96.9 bit/min). The reduced ITR can be attributed to additional search phases and increased mental load. The fact that accuracy remained high (no significant difference) demonstrates the robustness of the asynchronous selection paradigm. It should be highlighted that subjects used the GUI for the first time. The dictionary function was not used optimal by the participants. In some cases, useful suggestions were overseen and participants continued to spell letter by letter (we did not consider this as a false classification). With more experience with the dictionary functionality and letter arrangement, performance may increase. As dictionary integrations and auto-correction methods improve further, usability and efficiency will also increase further.

Another aim of the study was the comparison of low-target and multi-target c-VEP BCI control. The 32-target speller outperformed the 4-target speller significantly in terms of ITR and OCM in the letter-by-letter spelling task. According to the off-line analysis, accuracies of the 4-target speller were considerably higher than the accuracies of the 32-target speller (see Figure 3). Interestingly, however, accuracies in the letter-by-letter spelling task did not differ significantly. While, according to the questionnaire, most participants preferred the 32-target speller, some participants noted that the visual stimulation was overwhelming, especially during the training phase.

Although in this study, a large subject group was tested, it does not reflect the general population due to the low mean age. Previous studies suggest that elderly users achieve lower ITRs^35,36. Disabled users also tend to achieve lower ITRs³⁶. Although successful tests with patients using multi-target systems have been reported⁸, a lower number of targets may be the better option in terms of classification accuracy^22,23,37. For example, Carvalho et al.³⁷ tested SSVEP systems using two, four, and six class layouts with two stroke patients and eight healthy participants. They observed a negative correlation between the number of targets and accuracy reporting 97%, 77%, and 57% for a two, four, and six class interface, respectively. It should be noted that the target size and the number of trials for training differed for the two layouts. Especially the latter difference was significant (24 trials versus 128 trials), which makes a comparison difficult. The off-line accuracies, which are (as expected) better for the 4-target system suggests that much less trials are needed for low-target systems.

The exploratory analysis supports a trend that female participants achieve better accuracies than male participants as observed in several other studies^13,38,39: Also in the current study, for both the 4-target and the 32-target speller, female users achieved significantly higher accuracies than male users in letter-by-letter spelling.

According to the post-training questionnaires related to the stimulus presentation, participants rated the training for the 32-target speller more exhausting and annoying than for the 4-target speller. This is likely because a much higher number of trials were recorded for the 32-target speller, which made the training much longer. Several methods to reduce or eliminate the training time have been proposed for the f-VEP paradigm: Yuan et al.⁶ generated EEG templates from a large data set from various subjects and transferred it to a new subject. Nakanishi et al.⁴⁰ explored the usage of individual templates in several sessions. Similar approaches could be realized for the c-VEP paradigm.

According to the post-spelling questionnaire related to the stimulus presentation and overall BCI performance, participants rated the 4-target speller as more exhausting, more annoying and slower. These scores reflect the on-line results, which likely impacted the subjective impression. In general, only a few participants rated the flickering sensation during the spelling tasks as exhausting (16% and 5% for the 4-target and 32-target speller) or annoying (14% and 5%). Still, the flickering sensation can be reduced by employing more subtle stimulus patterns^41,42 or higher flickering rates^43,44. However, a decrease in performance may be expected as a consequence.

Next to the flickering sensation, another important issue regarding usability is the EEG setup. Using a low number of EEG electrodes/signal channels simplifies the electrode montage. Unfortunately, especially for poor performers, a higher number of EEG channels seems to be required to ensure adequate speed and accuracy. According to our off-line analysis, using 16 instead of 8 electrodes yielded a considerable increase in accuracy for both the 4-target and the 16-target speller. The extension from 16 to 32 electrodes, on the other hand, yielded only a minor increase.

Lastly, another aim of the study was to evaluate the classification model based on individualized filter bank design and multiple correlations. According to the off-line analysis, the methods yielded a substantially higher accuracy, especially for small classification windows. In this study, the filter bank design was based on alpha, beta, and gamma-band related brain activities (i. e. three sub-band components). The weights for the sub-band components were determined individually using the training data. For several participants, the filter bank approach resulted in substantially higher off-line accuracy values in comparison to the standard method. One explanation for this could be that subjects with high alpha activity may yield higher accuracies because the alpha-band activity is less dominant in classification (because of a smaller weight). Chen et al.²⁰ have introduced the filter bank approach for the f-VEP paradigm and reported maximal accuracies when using seven sub-band components. For the c-VEP paradigm, a higher number of sub-band decompositions may be applicable as well. It should be noted that additional sub-band components increase the computational complexity significantly. Moreover, to increase the robustness of the non-control state of the asynchronous selection approach, the implementation of pseudo targets which do not trigger a command selection may be applicable. For the f-VEP paradigm, this approach has already been implemented^35,45, where additional classes, e. g. averages between neighbouring frequencies, were considered during classification to increase overall system robustness. For the c-VEP paradigm, unused bit-shifts could be employed as pseudo targets.

The study explored usability and efficiency of asynchronous BCI speller variations. The dictionary-supported multi-target system yielded higher accuracies than expected and achieved high character output speeds due to the used word suggestion module. While small improvements in terms of EEG-based classification algorithms are still expected and needed, much greater improvements can be made with respect to GUI efficiency and user-friendliness. We encourage researchers to put a greater focus on user-centered features which are currently lacking behind.

Methods

This section describes the hardware and software setup used in this study; furthermore, details about the subjects and the procedure are provided. The experiment was performed in accordance with the Declaration of Helsinki and approved by the ethical committee of the University Duisburg-Essen, Germany. All participants gave written informed consent before participation and information needed for the analysis were stored anonymously.

Participants

In total, 58 (29 males, 29 females) able-bodied subjects with mean age of 24.4 years, standard deviation 3.6 years participated. All of them were recruited among students of the Rhine-Waal University of Applied Sciences. They had normal or corrected-to-normal vision, little to no prior experience with BCIs and no experience with the spellers tested. The experiments took approximately 60 minutes. All participants received a financial reward for their participation.

Hardware

The used computer (Dell Precision 3630 Tower) with operating system Microsoft Windows 10 Education, running on an Intel processor (Intel Core i7-8700K, @3.70 GHz) equipped with 16 GB RAM, and an NVIDIA graphics card (GeForce GTX 1080). The BCI GUI was displayed on a liquid crystal display screen (Acer Predator XB252Q, 1920 $\times $ 1080 pixel, 240 Hz refresh rate).

We used two synchronized EEG amplifiers (g.USBamp, Guger Technologies, Graz, Austria) connected to 32 passive Ag/AgCl signal electrodes according to the international 10/10 system of electrode placement⁴⁶: FCz, C3, C4, CP5, CP3, CP1, CPz, CP2, CP4, CP6, P7, P5, P3, P1, Pz, P2, P4, P6, P8, PO9, PO7, PO3, POz, PO4, PO8, O10, O1, Oz, O2, O9, Iz, O10. The reference electrode was placed at Cz and the ground electrode at AFz. Abrasive electrode gel was applied between the electrodes and the scalp to bring impedances below 5 k$\Omega $. The amplifier setup was the following: a band-pass filter from 2 to 100 Hz and a notch filter around 50 Hz were applied; the sampling frequency, $F_s$, was set to 600 Hz.

Spellers

Figure 6 shows the user interfaces of the 4-target and the 32-target system. In both spellers, audio (the selected command was voiced) and visual feedback (the selected box increased in size for a short period of time) were provided. Progress bars reflected the current state of the classifier output.

The 4-target speller^13,35 required three steps to select a letter. Four targets (230 $\times $ 230 pixel) represented the menu options, allowing the selection of 27 characters (26 letters and one underscore/space character) and a correction option. In the first step, the characters were presented into three groups of nine characters each (“A-I”, “J-R”, “S-_”) and the correction option (“$\leftarrow $”) allowed the deletion of the previously selected character. In the second step, the characters of the selected groups were presented in groups of three characters each, and in the third step, individually. In the second and the third step, the correction option (“$\leftarrow $”) allowed the user to go back to the previous step.

To spell the letter “B”, the user had to select the group “A–I” in the first step. The letters were then divided into the sub-groups “A–C”, “D–F”, and “G–I”. After selecting the group “A–C” in the second step, the individual letters “A”, “B” and “C” were presented, and the desired letter “B” could be selected.

The 32-target speller required one step for a selection. The 32 targets (150 $\times $ 150 pixel) represented 28 characters (26 letters, underscore and full stop character), 3 dictionary suggestions, and 1 correction option. The correction option (“$\leftarrow $”) enabled the user to undo the previous selection.

The dictionary suggestions of the 32-target speller were updated after each selection according to an n-gram prediction model (as used in our previously developed 8-target speller^27,47). In general, an n-gram model suggests a next item $x_{i}$ for a given sequence of n items by considering the probabilities $P(x_i|x_{i-(n-1)},\ldots ,x_{i-1})$. The 32-target interface used a bi-gram ($n=2$), where each item $x_i$ represented a word. The word suggestions were updated according to the previously spelled word.

The prediction model was implemented using a frequency list and a bi-gram list from the Leipzig Corpora Collection⁴⁸, which were based on approximately 1 million English sentences. After each selection, the suggestions were retrieved via structured query language (SQL). We used the database software SQLite to embed the dictionary functionality into our BCI software (written in C++).

To reduce the number of saccades in free communication, the three updated dictionary suggestions (selectable via the corresponding targets in the bottom row) were also displayed as information at the top of a selected target box during the gaze-shifting phase. Thus, the user did not need to move the gaze to the dictionary targets in the bottom row to check if the suggestions were useful. After the gaze-shifting phase, this additional information was removed from the previously selected target.

Experimental procedure

The study consisted of a session with the 4-target and another session with the 32-target speller. The order of sessions was randomly permuted to reduce the effects of learning and fatigue on the results. Each session consisted of a training phase and a spelling phase. After each phase, a short questionnaire was conducted.

During training, several trials of EEG data were recorded which were used to generate templates for individual c-VEP targets. The training was divided into $n_b$ blocks, where each target of the interface was fixated on for a 2.1 s trial (two full stimulation cycles of the code pattern). For the 32-target speller, $n_b=4$ training blocks were recorded ($N=4\cdot 32 = 128$ trials) and for the 4-target speller, $n_b=6$ training blocks were recorded ($N=6\cdot 4=24$ trials).

The participants initiated the training phase by pressing the space bar. The target they needed to gaze at was highlighted by a green frame. Targets were highlighted from upper left to right and top to bottom. In between trials, the flickering paused for 1 s, and in between blocks, the users could rest.

In the spelling phase a brief familiarization run, where participants learned the functionality of the speller layout was performed. For this, participants went through the copy-spelling tasks “BCI” and “BRAIN”. Thereafter, the pangram “THE QUICK BROWN FOX JUMPS OVER THE LAZY DOG” was spelled (letter-by-letter spelling). Participants were told to spell the phrase letter-by-letter; selections of dictionary suggestions were still possible and treated as false selection. Occurring errors needed to be corrected using the undo functionality. For the 32-target speller, an additional spelling task was performed: Participants typed sentences of varying complexity ($M=36.0$ characters, $SD=5.5$) using the dictionary functionality of the interface (dictionary-supported spelling). Table 1 shows the sentences used for this task.

Table 1 Sentences used in the on-line spelling phase with the 32-target speller. Provided are subject number with gender (M/F) and age in years, and the sentence used in the dictionary-assisted spelling task. Each participant had a different spelling task.

Full size table

Questionnaires

For both systems, the questionnaires consisted of several questions to assess the subjective impression regarding user-friendliness and efficiency. Two questions were answered after the training phase (post-training questionnaire) and five questions were asked after the spelling task (post-spelling questionnaire). Participants answered on a 7-point Likert scale, where 2 corresponded to complete agreement with a statement, and 7 corresponded to complete agreement with the opposing statement. In this regard, the opposing terms were exhausting versus relaxing, and annoying versus comfortable in the post-training questionnaire, and, in addition, confusing versus clear, boring versus interesting, slow versus fast in the post-spelling questionnaire.

Stimulus design

Stimulus presentation was realized with circularly shifted 63-bit m-sequences, which have been used in many c-VEP systems^10,11,23. The stimuli altered between the binary states ’black’ (the background colour, represented by ’0’) and ’white’ (represented by ’1’). The stimulus update rate was set to 60 Hz (a quarter of the monitor refresh rate). The duration of one stimulus cycle was therefore $63/60=1.05\,s$.

The initial code sequence used for the upper left BCI target was defined as

$$\begin{aligned} c_1=101011001101110110100100111000101111001010001100001000001111110. \end{aligned}$$

The remaining stimuli $c_k$, $k=2,\ldots ,K$, were circularly left shifted versions of $c_1$; in this respect, we employed left shifts of $k \cdot 4$ bit and $k \cdot 2$ bit for 4-target and the 32-target speller, respectively.

Stimulus presentation and data acquisition were synchronized via separated timers (one in the stimulus acquisition thread and another one in the stimulus presentation thread)³². Time stamps were accessed via system_clock::now from the std::chrono library (the accuracy of the function is hardware dependent).

Spatial filter design and template generation

We designed spatial filters by conducting CCA on the data collected during the training sessions²³. Given two multi-dimensional variables $\mathbf{X} \in {\mathbb {R}}^{m_1\times n}$ and $\mathbf{Y} \in {\mathbb {R}}^{m_2 \times n}$, CCA identifies weights $\mathbf{a} \in {\mathbb {R}}^{m_1}$ and $\mathbf{b} \in {\mathbb {R}}^{m_2}$ that maximize the correlation, $\rho $, between the so called canonical variates $\mathbf{x} =\mathbf{X} ^T \mathbf{a} $ and $\mathbf{y} =\mathbf{Y} ^T\mathbf{b} $ by solving

$$\begin{aligned} {\rho }=\max _\mathbf{a ,\mathbf{b} } \frac{E[\mathbf{a} ^T \mathbf{X} \mathbf{Y} ^T \mathbf{b} ]}{\sqrt{E[\mathbf{a} ^T \mathbf{X} \mathbf{X} ^T \mathbf{a} ] \, E[\mathbf{b} ^T \mathbf{Y} \mathbf{Y} ^T \mathbf{b} ]}}, \end{aligned}$$

(1)

where E denotes the expectation operator. The correlation value $\rho $ that solves (1) is the first and also called maximal canonical correlation.

Typically, in VEP research, only this first canonical correlation is used for classification or for the design of spatial filters. However, due to the noisiness of the EEG, information may be distributed over several coefficients⁵. Recently, Mondini et al.²⁶ showed that considering multiple correlations can improve signal classification.

CCA identifies further correlations as follows: Determining weights $\mathbf{a} _2$,$\mathbf{b} _2$ maximizing (1) subject to the restriction that the resulting pair of canonical variates is uncorrelated with the first pair yields the second canonical correlation, $\rho _{2}$. These steps can be repeated several times. In general, the number of canonical correlations is equal to the number of rows of the smaller variable. In this respect, CCA yields $m=\min \{m_1,m_2\}$ canonical correlations $\rho _{1}(=\rho ),\rho _{2},\ldots , \rho _{m}$ (sorted from highest to lowest), and the corresponding weight pairs $\mathbf{a} _i,\mathbf{b} _i$, $i=1,\ldots ,m$.

Here, multiple weights were used as well. Each training trial was stored in an $m\times n$ matrix, where m denotes the number of electrode channels (here, $m=32$) and n denotes the number of sample points (here, two 1.05 s stimulus cycles, $n=1.05 \cdot F_s\cdot 2=1260$). Initially, the N recorded trials were circularly shifted to match the phase of the first trial (which corresponded to a bit-shift of 0). The shifted trials $\mathbf{Z} _i\in {\mathbb {R}}^{m\times n}$, $i=1,\ldots ,N$, were then averaged, yielding

$$\begin{aligned} \bar{\mathbf{Z }}=\sum _{i=1}^N \frac{1}{N}\mathbf{Z} _i. \end{aligned}$$

(2)

From this matrix, templates $\mathbf{X} _i \in {\mathbb {R}}^{m\times n}$, $i=1\ldots K$ for each target class were constructed by circularly shifting, $\bar{\mathbf{Z }}$ according to the bit-shift of the underlying code sequence $c_i$ (see²³).

Two $m\times N\cdot n$ matrices were constructed to design CCA-based spatial filters,

$$\begin{aligned} \tilde{\mathbf{X }}=[\mathbf{Z} _1 \mathbf{Z} _2 \ldots \mathbf{Z} _{N}]\quad \text{ and }\quad \tilde{\mathbf{Y }}=[\underbrace{\bar{\mathbf{Z }}\bar{\mathbf{Z }}\ldots \bar{\mathbf{Z }}}_{\begin{array}{c} N \end{array}}], \end{aligned}$$

(3)

applying (1), yields the weight sets $\tilde{\mathbf{a }}_i, \tilde{\mathbf{b }}_i$, $i=1,\ldots ,m$. A subset of the former (the first s weights) were used as spatial filters, $\mathbf{w} _i=\tilde{\mathbf{a }}_i$, $i=1,\ldots ,s$. For the presented experiment, the number of considered canonical variates was $s=4$.

Asynchronous target identification

Every 0.05 s, the classification thread processed received EEG data blocks (stored as $m\times n_a$-matrix, where $n_a=F_s\cdot 0.05=30$). These data blocks were accumulated in a data buffer $\mathbf{Y} \in {\mathbb {R}}^{m\times n_y}$, and compared to reference signals $\mathbf{R} _i \in {\mathbb {R}}^{m\times n_y}$, $i=1,\ldots ,K$, which were constructed as sub-matrices of the templates $\mathbf{X} _i$, containing only the first $n_y$ columns.

Classification was performed for time windows higher or equal to 0.25 s ($n_y\ge 150$) . Correlation values $\lambda _k$, between reference signals and data buffer were calculated as

$$\begin{aligned} {\lambda }_k={\rho }\; \left( \begin{bmatrix} \mathbf{Y} ^T \mathbf{w} _1\\ \vdots \\ \mathbf{Y} ^T \mathbf{w} _s \end{bmatrix}, \begin{bmatrix} \mathbf{X _k}^T \mathbf{w} _1\\ \vdots \\ \mathbf{X _k}^T \mathbf{w} _s \end{bmatrix} \right) , \quad k=1,\ldots ,K. \end{aligned}$$

(4)

The classification candidate index C was determined as

$$\begin{aligned} C= \mathop {\hbox {arg max}}\limits _{k=1,\ldots , K} {\lambda }_k \,. \end{aligned}$$

(5)

The BCI output associated with C was only produced if the distance between the highest and second-highest correlation surpassed a threshold value, $\beta $. Otherwise, further samples were collected. The classification window of length $n_y$ extended incrementally as long as $n_y<n$. When $n_y=n$, the first n/2 columns of Y were shuffled out. For this window mechanism the samples per data block (here, $n_a=30$) were selected as divider of the cycle length in samples (here, $n/2=630$).

Whenever the threshold criterion was satisfied, the associated BCI output was produced, the data buffer $\mathbf{Y} $ was cleared, and a gaze-shifting period of 1 s followed, where data collection and flickering paused. We used $\beta =0.15$ and $\beta =0.1$ for the 4-target speller and 32-target speller, respectively. These values were determined based on preliminary test runs allowing low time classification windows and high accuracies for the presented layouts. This asynchronous approach was used during on-line tasks; the off-line evaluation in the result section was based on a synchronous approach (i. e. $\beta =0$).

Individualized filter bank design

The filter banks were generated with an 8th order Butterworth band-pass filter. The three sub-bands used in this study were defined by the following lower and upper cut-off frequencies:

1.
the sub-band between 8 and 60 Hz (covering the alpha, beta and gamma-bands),
2.
the sub-band between 12 and 60 Hz (covering the beta and gamma bands),
3.
the sub-band between 30 and 60 Hz (covering the gamma band).

For each sub-band, a separate set of spatial filters $\mathbf{w} ^{(l)}_i$ and templates $\mathbf{X} _i^{(l)}$ were determined as described before.

For filter bank classification, correlations were calculated for each sub-band independently using (4), which yielded a set of coefficients ${\tilde{\lambda }}^{(j)}_k$, $j=1,2,3$; $k=1,\ldots ,K$. For target identification, the following individualized combination was considered

$$\begin{aligned} \lambda _k= a_1{\tilde{\lambda }}_k^{(1)}+a_2{\tilde{\lambda }}_k^{(2)}+a_3{\tilde{\lambda }}_k^{(3)} , \quad k=1,\ldots ,K. \end{aligned}$$

(6)

The weights $a_j$, were set to $a_j=\rho ^{(j)}/(\rho ^{(1)}+\rho ^{(2)}+\rho ^{(3)})$, $j=1,2,3$, where $\rho ^{(j)}$ refers for the maximal correlation coefficients obtained via CCA (1) for the respective sub-band decomposition of the matrices in (3). Finally, the class label C was again obtained with (5).

Performance metrics

The classification accuracy, ITR¹, and OCM²⁹ were used to investigate BCI performance.

The classification accuracy, p, is calculated as the number of correctly classified selections divided by the total number of selections.

The ITR in bit/min, $B_m$, is calculated as

$$\begin{aligned} B_m=\frac{\log _2 K+p\log _2 p+(1-p)\log _2\left( \frac{1-p}{K-1}\right) }{t/60}, \end{aligned}$$

(7)

where K denotes the number of classes, and t denotes the average selection interval (in s). The number of classes was $K=4$ for the 4-target speller and $K=32$ for the 32-target speller. It should be noted, that for the 4-target speller, K can be determined by the number of selections in each step (i. e. the number of targets) or by the total number of possible selections (i. e. the number of output characters). In this study, the first option was used, which is applicable to measure the performance with respect to the classification methods (as the classification is performed in each step). An ITR calculation tool can be found at https://bci-lab.hochschule-rhein-waal.de/en/itr.html.

To evaluate speller efficiency, the OCM may be better suited than the ITR. The OCM score is calculated by dividing the number of spelled characters by the spelling time in min (required to complete the entire spelling task). This metric assumes that the user corrects all errors. The metric is applicable to measure the performance with respect to the application efficiency.

References

Wolpaw, J. R., Birbaumer, N., McFarland, D. J., Pfurtscheller, G. & Vaughan, T. M. Brain–computer interfaces for communication and control. Clin. Neurophysiol. 113, 767–791. https://doi.org/10.1016/S1388-2457(02)00057-3 (2002).
Article PubMed Google Scholar
Rezeika, A. et al. Brain–computer interface spellers: A review. Brain Sci.https://doi.org/10.3390/brainsci8040057 (2018).
Article PubMed PubMed Central Google Scholar
Vidal, J. J. Real-time detection of brain events in EEG. Proc. IEEE 65, 633–641 (1977).
Article Google Scholar
Volosyak, I. SSVEP-based Bremen–BCI interface—boosting information transfer rates. J. Neural Eng. 8, 036020. https://doi.org/10.1088/1741-2560/8/3/036020 (2011).
Article ADS PubMed Google Scholar
Lin, Z., Zhang, C., Wu, W. & Gao, X. Frequency recognition based on canonical correlation analysis for SSVEP-based BCIs. IEEE Trans. Biomed. Eng. 54, 1172–1176. https://doi.org/10.1109/TBME.2006.889197 (2007).
Article PubMed Google Scholar
Yuan, P., Chen, X., Wang, Y., Gao, X. & Gao, S. Enhancing performances of SSVEP-based brain–computer interfaces via exploiting inter-subject information. J. Neural Eng. 12, 046006. https://doi.org/10.1088/1741-2560/12/4/046006 (2015).
Article ADS PubMed Google Scholar
Chen, X. et al. High-speed spelling with a noninvasive brain–computer interface. Proc. Nat. Acad. Sci. 112, E6058–E6067. https://doi.org/10.1073/pnas.1508080112 (2015).
Article CAS PubMed Google Scholar
Sutter, E. E. The brain response interface: Communication through visually-induced electrical brain responses. J. Microcomput. Appl. 15, 31–45. https://doi.org/10.1016/0745-7138(92)90045-7 (1992).
Article Google Scholar
Bin, G., Gao, X., Wang, Y., Hong, B. & Gao, S. VEP-based brain-computer interfaces: Time, frequency, and code modulations [research frontier]. IEEE Comput. Intell. Mag. 4, 22–26. https://doi.org/10.1109/MCI.2009.934562 (2009).
Article Google Scholar
Wittevrongel, B., Van Wolputte, E. & Van Hulle, M. M. Code-modulated visual evoked potentials using fast stimulus presentation and spatiotemporal beamformer decoding. Sci. Rep.https://doi.org/10.1038/s41598-017-15373-x (2017).
Article PubMed PubMed Central Google Scholar
Spüler, M., Rosenstiel, W. & Bogdan, M. Online adaptation of a c-VEP brain–computer interface(BCI) based on error-related potentials and unsupervised learning. PLoS ONE 7, e51077. https://doi.org/10.1371/journal.pone.0051077 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Cecotti, H. A self-paced and calibration-less SSVEP-based brain–computer interface speller. IEEE Trans. Neural Syst. Rehabil. Eng. 18, 127–133. https://doi.org/10.1109/TNSRE.2009.2039594 (2010).
Article PubMed Google Scholar
Gembler, F., Stawicki, P. & Volosyak, I. Autonomous parameter adjustment for SSVEP-based BCIs with a novel BCI wizard. Front. Neurosci. https://doi.org/10.3389/fnins.2015.00474 (2015).
Article PubMed PubMed Central Google Scholar
Volosyak, I., Rezeika, A., Benda, M., Gembler, F. & Stawicki, P. Towards solving of the illiteracy phenomenon for vep-based brain-computer interfaces. Biomed. Physi. Eng. Express (in press).
Brennan, C. et al. Performance of a steady state visual evoked potential and eye gaze hybrid brain-computer interface on participants with and without a brain injury. IEEE Trans. Hum. Mach. Syst. (in press).
Hwang, H.-J. et al. Development of an SSVEP-based BCI spelling system adopting a QWERTY-style LED keyboard. J. Neurosci. Methods 208, 59–65. https://doi.org/10.1016/S1388-2457(02)00057-30 (2012).
Article PubMed Google Scholar
Nagel, S. & Spüler, M. Asynchronous non-invasive high-speed BCI speller with robust non-control state detection. Sci. Rep. 9, 8269. https://doi.org/10.1038/s41598-019-44645-x (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Gembler, F., Benda, M., Saboor, A. & Volosyak, I. A multi-target c-VEP-based BCI speller utilizing n-gram word prediction and filter bank classification. In 2019 IEEE International Conference on Systems, Man and Cybernetics (SMC), 2719–2724, https://doi.org/10.1109/SMC.2019.8914235 (2019).
Renton, A. I., Mattingley, J. B. & Painter, D. R. Optimising non-invasive brain–computer interface systems for free communication between naïve human participants. Sci. Rep. https://doi.org/10.1038/s41598-019-55166-y (2019).
Article PubMed PubMed Central Google Scholar
Chen, X., Wang, Y., Gao, S., Jung, T.-P. & Gao, X. Filter bank canonical correlation analysis for implementing a high-speed SSVEP-based brain–computer interface. J. Neural Eng. 12, 046008 (2015).
Article ADS Google Scholar
Nakanishi, M. et al. Enhancing detection of ssveps for a high-speed brain speller using task-related component analysis. IEEE Trans. Biomed. Eng. 65, 104–112 (2017).
Article Google Scholar
Gembler, F., Stawicki, P. & Volosyak, I. Exploring the possibilities and limitations of multitarget SSVEP-based BCI applications. In Engineering in Medicine and Biology Society (EMBC), 2016 IEEE 38th Annual International Conference of the the IEEE Engineering in Medicine and Biology Society (EMBC), 1488–1491, https://doi.org/10.1109/EMBC.2016.7590991 (Orlando, FL, USA, 2016).
Bin, G. et al. A high-speed BCI based on code modulation VEP. J. Neural Eng. 8, 025015. https://doi.org/10.1088/1741-2560/8/2/025015 (2011).
Article ADS PubMed Google Scholar
Gembler, F., Stawicki, P. & Volosyak, I. Suitable Number of Visual Stimuli for SSVEP-Based BCI Spelling Applications. In Rojas, I., Joya, G. & Catala, A. (eds.) Advances in Computational Intelligence: 14th International Work-Conference on Artificial Neural Networks, IWANN 2017, Cadiz, Spain, June 14-16, 2017, Proceedings, Part II, 441–452, https://doi.org/10.1007/978-3-319-59147-6_38 (Springer International Publishing, Cham, 2017).
Hotelling, H. Relations between two sets of variates. Biometrika 28, 321–377. https://doi.org/10.1016/S1388-2457(02)00057-33 (1936).
Article MATH Google Scholar
Mondini, V., Mangia, A. L., Talevi, L. & Cappello, A. Sinc-windowing and multiple correlation coefficients improve SSVEP recognition based on canonical correlation analysis. Comput. Intell. Neurosci. 1–11, 2018. https://doi.org/10.1155/2018/4278782 (2018).
Article Google Scholar
Gembler, F., Stawicki, P., Saboor, A. & Volosyak, I. Dynamic time window mechanism for time synchronous VEP-based BCIs—Performance evaluation with a dictionary-supported BCI speller employing SSVEP and c-VEP. PLoS ONE 14, e0218177. https://doi.org/10.1016/S1388-2457(02)00057-35 (2019).
Article CAS PubMed PubMed Central Google Scholar
Stawicki, P., Gembler, F., Rezeika, A. & Volosyak, I. A novel hybrid mental spelling application based on eye tracking and SSVEP-based BCI. Brain Sci. 7, 35. https://doi.org/10.3390/brainsci7040035 (2017).
Article PubMed Central Google Scholar
Ryan, D. B. et al. Predictive spelling with a P300-based brain–computer interface: Increasing the rate of communication. Int. J. Hum.-Comput. Interact. 27, 69–84. https://doi.org/10.1080/10447318.2011.535754 (2010).
Article Google Scholar
Kaufmann, T., Völker, S., Gunesch, L. & Kübler, A. Spelling is just a click away—a user-centered brain–computer interface including auto-calibration and predictive text entry. Front. Neurosci. https://doi.org/10.3389/fnins.2012.00072 (2012).
Article PubMed PubMed Central Google Scholar
Volosyak, I., Moor, A. & Gräser, A. A Dictionary-Driven SSVEP Speller with a Modified Graphical User Interface. In Cabestany, J., Rojas, I. & Joya, G. (eds.) Advances in Computational Intelligence, vol. 6691, 353–361, https://doi.org/10.1007/978-3-642-21501-8_44 (Springer Berlin Heidelberg, Berlin, Heidelberg, 2011).
Gembler, F. & Volosyak, I. A novel dictionary-driven mental spelling application based on code-modulated visual evoked potentials. Computers https://doi.org/10.3390/computers8020033 (2019).
Article Google Scholar
Kohavi, R. A Study of Cross-validation and Bootstrap for Accuracy Estimation and Model Selection. In Proceedings of the 14th International Joint Conference on Artificial Intelligence - Volume 2, IJCAI’95, 1137–1143 (Morgan Kaufmann Publishers Inc., Montreal, Quebec, Canada, 1995).
Suefusa, K. & Tanaka, T. Asynchronous brain-computer interfacing based on mixed-coded visual stimuli. IEEE Trans. Biomed. Eng. 65, 2119–2129. https://doi.org/10.1109/TBME.2017.2785412 (2018).
Article PubMed Google Scholar
Volosyak, I., Gembler, F. & Stawicki, P. Age-related differences in SSVEP-based BCI performance. Neurocomputing 250, 57–64. https://doi.org/10.1016/j.neucom.2016.08.121 (2017).
Article Google Scholar
Hsu, H.-T. et al. Evaluate the feasibility of using frontal SSVEP to implement an SSVEP-based BCI in young, elderly and ALS groups. IEEE Trans. Neural Syst. Rehabil. Eng. 24, 603–615. https://doi.org/10.1109/TNSRE.2015.2496184 (2016).
Article PubMed Google Scholar
Carvalho, S. N. et al. Effect of the combination of different numbers of flickering frequencies in an SSVEP-BCI for healthy volunteers and stroke patients. In 2015 7th International IEEE/EMBS Conference on Neural Engineering (NER), 78–81, https://doi.org/10.1109/NER.2015.7146564 (Montpellier, France, 2015).
Allison, B. et al. BCI demographics: How many (and what kinds of) people can use an SSVEP BCI?. IEEE Trans. Neural Syst. Rehabil. Eng. 18, 107–116. https://doi.org/10.1109/TNSRE.2009.2039495 (2010).
Article PubMed Google Scholar
Allison, B. Z. et al. Towards an independent brain–computer interface using steady state visual evoked potentials. Clin. Neurophysiol. 119, 399–408. https://doi.org/10.1088/1741-2560/8/3/0360202 (2008).
Article PubMed PubMed Central Google Scholar
Nakanishi, M., Wang, Y. & Jung, T.-P. Session-to-Session Transfer in Detecting Steady-State Visual Evoked Potentials with Individual Training Data. In Schmorrow, D. D. & Fidopiastis, C. M. (eds.) Foundations of Augmented Cognition: Neuroergonomics and Operational Neuroscience, 253–260 (Springer International Publishing, Cham, 2016).
Shirzhiyan, Z. et al. Introducing chaotic codes for the modulation of code modulated visual evoked potentials (c-VEP) in normal adults for visual fatigue reduction. PLoS ONE 14, e0213197. https://doi.org/10.1088/1741-2560/8/3/0360203 (2019).
Article CAS PubMed PubMed Central Google Scholar
Gembler, F. W., Rezeika, A., Benda, M. & Volosyak, I. Five shades of grey: Exploring quintary m-sequences for more user-friendly c-vep-based bcis. Computational Intelligence and Neuroscience2020, (2020).
Başaklar, T., Tuncel, Y. & Ider, Y. Z. Effects of high stimulus presentation rate on EEG template characteristics and performance of c-VEP based BCIs. Biomed. Phys. Eng. Exp. 5, 035023. https://doi.org/10.1088/2057-1976/ab0cee (2019).
Article Google Scholar
Gembler, F. et al. Effects of Monitor Refresh Rates on c-VEP BCIs. In Ham, J., Spagnolli, A., Blankertz, B., Gamberini, L. & Jacucci, G. (eds.) Symbiotic Interaction. Symbiotic 2017. Lecture Notes in Computer Science, Vol 10727., 53–62, https://doi.org/10.1007/978-3-319-91593-7_6 (Springer, Cham, 2018).
Pan, J., Gao, X., Duan, F., Yan, Z. & Gao, S. Enhancing the classification accuracy of steady-state visual evoked potential-based brain–computer interfaces using phase constrained canonical correlation analysis. J. Neural Eng. 8, 036027. https://doi.org/10.1088/1741-2560/8/3/0360205 (2011).
Article ADS PubMed Google Scholar
Oostenveld, R. & Praamstra, P. The five percent electrode system for high-resolution EEG and ERP measurements. Clin. Neurophysiol. 112, 713–719. https://doi.org/10.1088/1741-2560/8/3/0360206 (2001).
Article CAS PubMed Google Scholar
Gembler, F. et al. A Dictionary Driven Mental Typewriter Based on Code-Modulated Visual Evoked Potentials (cVEP). In 2018 IEEE International Conference on Systems, Man, and Cybernetics (SMC), 619–624, https://doi.org/10.1109/SMC.2018.00114 (Miyazaki, Japan, 2018).
Eckart, T. & Quasthoff, U. Statistical Corpus and Language Comparison on Comparable Corpora. In Sharoff, S., Rapp, R., Zweigenbaum, P. & Fung, P. (eds.) Building and Using Comparable Corpora, 151–165, https://doi.org/10.1007/978-3-642-20128-8_8 (Springer, Berlin, 2013).

Download references

Acknowledgements

This study was supported by the European Fund for Regional Development (EFRD or EFRE in German) under Grant IT-1-2-001. The authors thank the participants of this research study and the student assistants for their help in conducting of the experiments. Special thanks goes to Abdul Saboor for his support with the programming.

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Authors and Affiliations

Rhine-Waal University of Applied Sciences, Technology and Bionics, 47533, Kleve, Germany
Felix W. Gembler, Mihaly Benda, Aya Rezeika, Piotr R. Stawicki & Ivan Volosyak

Authors

Felix W. Gembler
View author publications
You can also search for this author in PubMed Google Scholar
Mihaly Benda
View author publications
You can also search for this author in PubMed Google Scholar
Aya Rezeika
View author publications
You can also search for this author in PubMed Google Scholar
Piotr R. Stawicki
View author publications
You can also search for this author in PubMed Google Scholar
Ivan Volosyak
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

F.G., M.B., A.R., P.S., and I.V. conceived the experiment(s), F.G., M.B., A.R., P.S. conducted the experiment(s), F.G., M.B, and A.R. analysed the results. All authors were involved in writing of the manuscript.

Corresponding author

Correspondence to Ivan Volosyak.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Gembler, F.W., Benda, M., Rezeika, A. et al. Asynchronous c-VEP communication tools—efficiency comparison of low-target, multi-target and dictionary-assisted BCI spellers. Sci Rep 10, 17064 (2020). https://doi.org/10.1038/s41598-020-74143-4

Download citation

Received: 01 April 2020
Accepted: 28 September 2020
Published: 13 October 2020
DOI: https://doi.org/10.1038/s41598-020-74143-4

This article is cited by

Blink-To-Live eye-based communication system for users with speech impairments
- Mohamed Ezzat
- Mohamed Maged
- Sara El-Metwally
Scientific Reports (2023)
Riemannian geometry-based transfer learning for reducing training time in c-VEP BCIs
- Jiahui Ying
- Qingguo Wei
- Xichen Zhou
Scientific Reports (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.