Identifying autism spectrum disorder symptoms using response and gaze behavior during the Go/NoGo game CatChicken

Putra, Prasetia Utama; Shima, Keisuke; Alvarez, Sergio A.; Shimatani, Koji

doi:10.1038/s41598-021-01050-7

Download PDF

Article
Open access
Published: 10 November 2021

Identifying autism spectrum disorder symptoms using response and gaze behavior during the Go/NoGo game CatChicken

Prasetia Utama Putra¹,
Keisuke Shima²,
Sergio A. Alvarez³ &
…
Koji Shimatani⁴

Scientific Reports volume 11, Article number: 22012 (2021) Cite this article

2433 Accesses
6 Citations
11 Altmetric
Metrics details

Subjects

A Publisher Correction to this article was published on 08 December 2021

This article has been updated

Abstract

Previous studies have found that Autism Spectrum Disorder (ASD) children scored lower during a Go/No-Go task and faced difficulty focusing their gaze on the speaker’s face during a conversation. To date, however, there has not been an adequate study examining children’s response and gaze during the Go/No-Go task to distinguish ASD from typical children. We investigated typical and ASD children’s gaze modulation when they played a version of the Go/No-Go game. The proposed system represents the Go and the No-Go stimuli as chicken and cat characters, respectively. It tracks children’s gaze using an eye tracker mounted on the monitor. Statistically significant between-group differences in spatial and auto-regressive temporal gaze-related features for 21 ASD and 31 typical children suggest that ASD children had more unstable gaze modulation during the test. Using the features that differ significantly as inputs, the AdaBoost meta-learning algorithm attained an accuracy rate of 88.6% in differentiating the ASD subjects from the typical ones.

Atypical processing pattern of gaze cues in dynamic situations in autism spectrum disorders

Article Open access 08 March 2022

Social attention and social-emotional modulation of attention in Angelman syndrome: an eye-tracking study

Article Open access 28 February 2023

Large scale validation of an early-age eye-tracking biomarker of an autism spectrum disorder subtype

Article Open access 11 March 2022

Introduction

People often misinterpret invisible disorder symptoms in their children, such as inattentiveness and difficulty communicating with other people, as willful misconduct or misbehavior. The prevalence of clinical disorders, however, is high. In Japan, the prevalence of Autism Spectrum Disorder (ASD) symptoms among children has been estimated to be between 1.9 and $9.3\%$ based on parent and teacher reports ¹; in the USA, about 1 of 54 children was diagnosed with ASD in 2020 ², while in 2016, a study found that 9.41% of children had Attention Deficit Hyperactivity Disorder (ADHD) symptoms³.

Since the conventional diagnosis method requires comprehensive tests that are time-consuming, many studies have proposed to automatically distinguish disordered children from typical ones by utilizing machine learning with biosignals such as EEG⁴ or fMRI⁵. Those methods extracted features from children’s brain activity to identify disorder symptoms.

In contrast, psychiatry studies recognize disorder symptoms by employing behavioral tests, e.g., Go/NoGo^6,7,8 and visual attention tests. Previous studies have discovered a significant difference between ASD and typical children during a Go/NoGo task. The task requires a subject to react to the Go stimulus and inhibit their reaction to the NoGo stimulus⁸. The stimuli can be represented by visual objects with different colors and shapes^6,8 or by sounds with different frequencies⁷. The task evaluates the subjects by measuring their percentages of correct responses and omission errors and their average response time and its variability. Children with ASD tend to perform worse⁹ and have high response time variability¹⁰ than typical children during the task, which may be caused by variability in neural activations¹¹.

Moreover, studies on ASD children’s gaze behavior have observed that the ASD group was slower to adjust their gaze to the stimulus position during eye-tracking measurement of joint attention¹², and faced difficulty in modulating their gaze during face-to-face conversation¹³. Previous works have found that temporal features of gaze are more informative than global measurements in differentiating ASD from typically developing children. Swanson and Siller¹² have found that ASD and typical children allocated the same amount of time to key areas but their duration of the first fixation to the target differed. Likewise, studies of gaze-shift¹⁴ and gaze-to-stimulus-distance¹⁵ have signified that gaze behavior of ASD and typical children differed significantly in the spatio-temporal aspect.

Other researchers extend those works by employing machine learning, and children’s behavior features to identify ASD symptoms^16,17. They asked participants to participate in face-to-face conversation¹⁸ or to complete visual tasks such as viewing a sequence of face images¹⁶ or identifying directional cues¹⁷. Then they utilized spatial features extracted from children’s eye movement distribution to recognize ASD symptoms in children.

This study aims to investigate the response and gaze behavior of children during the Go/NoGo task and to utilize features extracted from those measurements to identify ASD symptoms that suggest difficulty in inhibiting action and point of view¹⁹. Contrary to previous works on ASD subjects’ gaze behavior, which have focused on global summary measures related to the gaze and stimulus positions, this study examined the intrinsic spatio-temporal structure of the gaze trajectories in greater detail by employing entropy-based and autoregressive features.

Using the CatChicken game²⁰, we measured 21 ASD (10 with and 11 without ADHD) and 31 typical children’s response and gaze modulation; the use of a standardized task minimizes the bias that often occurs in face-to-face conversation. Spatial and gaze-adjustment features were extracted to represent each child’s response, performance during the game, and gaze behavior. Statistical comparisons between typical and ASD disorder children were performed using Student t and Mann-Whitney U tests²¹. Additional details of the statistical methodology appear in the Methods section. The AdaBoost algorithm was employed²² to distinguish the features of ASD disorder children from those of typical children. Experiments employing spatial features, gaze-adjustment features, and a combination of them were conducted to identify differentiating features. Classification performance of the model was evaluated with accuracy, Matthews Correlation Coefficient (MCC)²³, and Area Under the Curve (AUC)²⁴ metrics and validated using three-fold cross-validation.

Results

Spatial features

Statistical analysis demonstrated a significant difference (by both Student t and Mann-Whitney U tests) between typical and ASD groups for eight spatial features ($n = 52$, $p < 0.01$ corrected by Benjamini-Hochberg at the level 0.05; see Supplementary Materials for further detail): variance of fixation time, average and entropy of gaze acceleration, spectral entropy of gaze-to-object-distance, sample entropy of gaze distance, gaze angle, gaze-to-obj-distance, and velocity. For all such variables, the mean of the second group (ASD) was larger than that of the first group (typical); the corresponding effect sizes were large²⁵ ($|d| > 0.8$), except for average acceleration, for which a moderately large effect size ($d = -0.763$) was observed.

In contrast, although a medium effect size ($|d| > 0.5$) was observed for Go positive and negative percentages, response-time variance (RT-var), and gaze-acceleration standard deviation, the differences between typical and ASD groups for those variables were insignificant. Nevertheless, within-group mean values indicated that ASD children more often responded incorrectly with higher response-time variance than the typical subjects.

A comparison between typical and ASD children without ADHD yielded similar results. The results, however, also suggested a significant difference in Go positive and negative percentage between those groups ($n = 42$, $p < 0.008$). Typical children responded correctly towards the Go stimulus more often than ASD participants without ADHD ($d > 0.99$). A significant difference in RT-var (Mann-Whitney $p = 0.012$) was observed between those groups.

Different results were observed in the statistical comparison between typical and ASD subjects with ADHD. An insignificant difference ($n = 41$, $p > 0.02$) was observed for spectral entropy of gaze-to-object-distance. The Student t-test suggested a significant difference ($p < 0.012$) of gaze acceleration variance between those groups. The statistical tests also showed that ASD children without ADHD did not differ from the children with ADHD ($n = 21$, $p > 0.09$). However, the latter had a lower gaze-fixation time than the former (moderate effect size, $d = 0.726$).

Similarly to the results of comparison between typical and ASD groups described above, statistical analysis by ANOVA revealed that typical and ASD participants with and without ADHD differed significantly in the same eight features($n = 52$, $p < 0.02$). The results also showed a significant difference in gaze acceleration variance ($p = 0.003$) and in the average and variance of gaze distance and velocity ($p <= 0.025$).

Gaze-adjustment features

A significant difference ($n = 208$, $p < 0.023$) between the groups was observed in the mean values of $\alpha$, $\theta _1$ and $\theta _2$ by the Mann-Whitney U test. In contrast, both the Student t-test and effect size ($|d| < 0.2$) suggested that ASD children’s gaze-adjustment features did not differ from the typical ones.

Separating gaze-adjustment features according to response types (Go-positive, Go-negative, NoGo-positive, and NoGo-negative) yielded statistically significant differences ($n = 52$, $p < 0.023$) between typical and ASD children in all auto-regressive coefficients by the Mann-Whitney U test, as well as greater effect size (mean |d| > 0.4). The t-test results also signified that ASD gaze modulation differed when they responded incorrectly to the Go stimulus and correctly to the NoGo stimulus ($p < 0.007$).

Furthermore, extrapolation of the gaze-to-obj distance in time using the average values of the autoregressive coefficients suggested that separating the features (Fig. 1C–J) produced a more obvious difference between the groups than mixing them (Fig. 1A,B). Typical children adjusted their gaze to the stimulus position faster when they responded correctly to the Go and NoGo characters and when they reacted incorrectly to the latter stimulus (Fig. 1C,G,I); the velocity of their extrapolated gaze-adjustment (Fig. 1D,H,J) was ±0.0014 faster compared to the ASD children (the velocity of extrapolated gaze-adjustment was computed by averaging the negative of the first derivative of the extrapolated gaze-to-obj distance over time). Nevertheless, typical children modulated their gaze in a similar way to the ASD subjects when they missed the Go stimulus (Fig. 1E,F).

The Student t-test between the typical and ASD children without ADHD suggested those groups differed when they responded correctly towards the Go and NoGo stimuli ($n = 52$, $p <= 0.004$). Comparison results of typical children to ASD children with ADHD indicated that the former responded differently from the latter during Go-negative and NoGo-positive ($n = 52$, $p <= 0.017$). The results also demonstrated that ASD children with and without ADHD did not differ significantly. Moreover, the ANOVA test showed a significant difference ($n = 52$, $p < 0.04$) among those three groups for gaze-adjustment features when the subjects responded correctly to the NoGo stimulus; the difference was insignificant in the other conditions.

The extrapolation results of ASD children with and without ADHD symptoms (Fig. 2A–J) suggested that the former tended to adjust their gaze to the stimulus position slightly faster than the latter, with respective extrapolation gaze-adjustment velocities of 0.0153 and 0.0156, respectively. Both groups had lower gaze-modulation speed compared to typical participants, whose average velocity was 0.0164.

Classification

Classification results (Table 1) showed that when using only spatial features, the accuracy of the AdaBoost model was 6.1% lower than when utilizing gaze-adjustment features. A significant increase in the model’s recognition rate occurred when separating the gaze-adjustment features based on response types (response-type-gaze features). The classification rate was 17.1% higher than when employing gaze-adjustment features.

Table 1 The AdaBoost algorithm obtained high performance when employing gaze-related features.

Full size table

Using both response-type-gaze and significant spatial features, the model obtained an insignificant increase in its accuracy rate, which was 0.3% higher than when using the former features alone. Combining the gaze features with significant spatial and game performance features, however, decreased the accuracy rate by 4.1%.

The MCC score agreed with the accuracy results: combining response-type-gaze features and significant spatial features yielded a 0.01 higher MCC score than using only response-type-gaze features. Even though the increase was insignificant, the high MCC score indicated that the model’s prediction results strongly correlated with the ground-truth labels, and the model could reliably recognize both the typical and ASD children. Besides, the AdaBoost obtained AUC scores higher than 0.85 when utilizing response-type-gaze features and when combining them with significant spatial and performance features. This suggested that high performance could be expected from the algorithm when employing those features²⁶.

The confusion matrix (Table 2), however, shows that the model more frequently misclassified typical subjects as ASD (false-positive) than it misclassified ASD subjects as typical (false-negative). Visualization of the features through star plots (Fig. 3) reveals higher mean values and variability in the misclassified typical subjects’ features than the correctly-classified subjects’. On the other hand, the features of misclassified ASD subjects show lower mean values and variability.

Table 2 False-positive rate of the AdaBoost algorithm was higher than false-negative rate.

Full size table

In differentiating the typical group from ASD children with and without ADHD using gaze-adjustment and significant spatial features (Table 3), the AdaBoost algorithm achieved a 17.5% lower accuracy rate than when classifying typical and ASD populations. Also, combining the gaze-adjustment features with significant features selected by ANOVA resulted in a 30.9% lower recognition rate. The MCC scores and the confusion matrix results (Table 4) suggested that the model had poor performance in recognizing ASD children with and without ADHD symptoms.

Table 3 Although the AdaBoost algorithm achieved a competitive accuracy rate, its low MCC score indicated high false positive and negative.

Full size table

Table 4 Misclassification of ASD populations were higher than that of typical subjects.

Full size table

Discussion

This study evaluated whether features extracted from response and gaze behavior during Go/NoGo task can be used to identify ASD symptoms in children. We utilized the CatChicken game²⁰ to measure the response and gaze modulation of 21 ASD and 31 typical children. During the game, the children should respond to the chicken character (Go stimulus) by pressing a space bar but should inhibit their action towards the cat character (NoGo stimulus).

The game outputs four variables: response types and times, and the stimulus and gaze locations over time. Statistical analyses using Student t and Mann-Whitney U tests were performed on spatial and gaze-adjustment features extracted from those variables.

As we expected, we found a significant difference in gaze modulation between ASD and typical children. Previous studies found that ASD children’s gaze movement differed significantly from typical children’s in terms of variability of the gaze pattern²⁷, the fixation time spent on the stimulus¹³, and duration of the first fixation to the target¹². Our results suggest lower accuracy and greater randomness of the ASD subjects’ visual tracking of the target: the relative gaze-to-object difference was less steady over time than for typical subjects, and predictability of ASD subjects’ gaze was lower as measured by sample entropy of both distance and angle. A greater irregularity of gaze distance and angle may indicate that ASD children over-interpreted the information of a given stimulus, thereby causing more unintentional viewing behavior²⁸. The higher value of ASD children’s gaze-to-object entropy suggested less structured tracking in a spatial sense, while a greater value of sample entropy value demonstrated lowered predictability of the gaze-to-object difference as a function of time. Likewise, greater spectral entropy indicated less structure of the frequency content of ASD subjects’ gaze signals.

Compared to typical subjects, we observed that ASD subjects without ADHD symptoms tended to perform worse while the children with ADHD had less structured gaze modulation. Nevertheless, the results demonstrated that the game performance of ASD subjects without ADHD did not differ significantly from that of the subjects with ADHD symptoms. We found that the ASD children without ADHD symptoms tended to fixate their gaze on the stimulus position longer than the children with ADHD symptoms.

Second, statistical analysis using the Mann-Whitney U test demonstrated significant differences in the gaze-adjustment features among the groups. The extrapolation results show that children with ASD symptoms, on average, adjusted their gaze more slowly to the stimulus location than their typical peers. When ASD children reacted incorrectly towards the stimulus, their extrapolation results tended to be slower at the beginning; these results were also observed when comparing the ASD group with ADHD to the group without it. The results, however, did not signify that typical subjects’ gaze movement was faster, as an insignificant difference in gaze velocity was observed between the groups.

In contrast, gaze trajectory area and the entropy of gaze distribution showed no significant differences between typical and ASD populations. While our previous work²⁰ found greater dispersion of gaze movement in ASD children, the results of the present paper suggest that global measures of gaze behavior are similar in the two groups. The discrepancy may be due to the greater size of the sample available for the present paper. Swanson and Siller¹² also observed that total gaze allocation of ASD and typical children did not differ but their temporal gaze movement (duration of the first fixation to the target) did. Our present findings and theirs provide compelling evidence that the gaze behavior of ASD children may differ from the typically developing children in the temporal aspect; the difference was more pronounced between typical and ASD children with ADHD symptoms.

Another major finding of this study was that the performance of the game and response time of typical and ASD children did not differ significantly. Even though we observed greater Go and NoGo negative percentages and higher response time variance in ASD children, statistical analyses demonstrated an insignificant difference between the groups. The results contradict previous works that observed greater RT variability of go-response⁹ and higher omission error¹⁰ in ASD population than their typical peers. One interpretation of these findings is that the insignificant difference of RT occurred because this work computed RT of both Go and NoGo trials; Lee et.al²⁹ found similar RT for ASD and typical subjects. Outlier removal in the pre-processing step of the present work might affect the statistical results of game performance and RT variability, as well. Nevertheless, we observed that the ASD group without ADHD had a lower Go-positive score and higher response time variance than typical subjects.

Lastly, our classification results suggest the promising performance of more detailed spatio-temporal features extracted from children’s gaze during the Go/NoGo task. Compared to previous works utilizing global features^16,17, our model yielded competitive results. Nevertheless, the accuracy rate of our model was lower than that of the previous work utilizing features from visual fixation and session length¹⁸. The discrepancy might be affected by different features, experiment protocol, and subjects used in the previous study. Furthermore, our classification results show that using responses and gaze features together produced a higher recognition rate in differentiating ASD from typical children than using either type of information alone, as indicated by high accuracy rate, MCC, and AUC scores. The results, however, revealed that a promising performance could not be achieved to identify ADHD symptoms in ASD children. The results might be affected by unbalanced labels: the training data of each fold comprised 58.8% typical, 20.6% ASD with ADHD, and 20.6% ASD without ADHD.

Two limitations of this work are the relatively small sample size and the limited number of features. Also, since this work only measured response behavior by calculating response time and game performance, which represented the execution stage of response, the difference between groups in the preparation stage of response is unclear. Future studies should measure both the preparation and execution stages of participants’ responses. It would be of interest to consider subjects across a broader age range to enable capturing a greater variety of behaviors. This study involved 22 ASD and 35 typical children with a narrow age range and found that among these subjects only one gaze modulation existed: all subjects adjusted their gaze to the stimulus position. In contrast, the results of our previous work²⁰ suggested that older subjects were of two types by viewing behavior: ones who adjusted their gaze (55.9% of total subjects) and ones who concentrated on the middle of the screen (44.1% of total subjects).

Conclusion

This study examined the difference in gaze behavior and response features of ASD and typical children during the Go/NoGo task. Contrary to our hypothesis, the experimental results of this paper showed higher performance in differentiating ASD from typical children using gaze behavior alone, as compared with a combination of gaze behavior features with features extracted from participants’ responses. Even though the use of the features showed promising performance in identifying ASD symptoms, it yielded poor performance in identifying ADHD symptoms in ASD children.

Methods

CatChicken game

The CatChicken²⁰ game was utilized to measure children’s response and gaze movement during a Go/NoGo task. The Go/NoGo task was used to measure a person’s inhibitory control; a subject should respond to the Go stimulus but inhibit their action towards the NoGo stimulus⁸. The game represented the Go and NoGo stimuli as “Chicken” and “Cat” characters, respectively. A stimulus appeared randomly in one of nine locations for a fixed duration of time (Fig. 4). The interval between two consecutive stimuli was set by configuring the minimum and maximum waiting-time values.

The system outputted the user’s response types and time, and stimulus and eye locations on the monitor (Fig. 5). A user responded to the stimulus by pressing the spacebar. The system categorized a subject’s response as one of four types: Go-positive if the subject responded to the Go character; Go-negative if they missed it; NoGo-positive if they inhibited their action in response to the NoGo character; NoGo-negative if they reacted to it. Different audio feedback was given when the subject responded correctly and incorrectly towards the stimulus. The system was equipped with a Tobii 4C eye tracker that recorded the user’s eye position on the monitor continuously. The eye tracker sampling rate was 90 Hz (interlaced), and its operating distance was 50 cm to 95 cm. The stimulus and eye locations on the monitor were normalized to the unit interval [0, 1] by dividing the pixel coordinates by the window’s coordinate length.

Participants

Participants involved 22 autism spectrum disorder children (16 male and 6 female) and 35 typical children (24 male and 11 female) with an average age of five years from two local schools in Japan (Table 5). All ASD subjects attended special education school and had been diagnosed by clinicians; 10 (+ one suspected) ASD children also had attention deficit symptoms, and seven of them were identified as having hyperactivity as well. Both ASD and typical children did not have any physical disorder. One ASD child (male) and four typical children (1 male and 3 female) were excluded because their data were corrupted. Therefore, this study only processed 21 ASD and 31 typical children’s data.

Table 5 Differences for age and Development Quotient (DQ) were insignificant ($p > 0.05$).

Full size table

During the experiment, the subjects were seated in front of a notebook equipped with an eye tracker and web camera (Fig. 6). They responded to the stimulus by pressing the spacebar on the keyboard. The proportion of the Go and NoGo stimuli was uniform and the order of appearance was set in advance; their appearance time was 700 ms; the minimum and maximum of the waiting period were 700 and 1000 ms.

Before starting the experiment, the eye tracker was calibrated and an instructor explained the game and its rules to the subject; the instructor asked the subjects to respond immediately when the stimulus appeared. All subjects participated in a one-minute training session before taking a four-minute evaluation.

Ethics statement and consent

Before participating in the experiment, informed consent was obtained from teachers and parents on behalf of the children. The study was approved by the Research Ethics Committee of the Prefectural University of Hiroshima (letter no: 15MH070) and was conducted in accordance with the amended Declaration of Helsinki.

Features

Figure 7 shows the pipeline for extracting spatial and gaze-adjustment features from response and gaze data. Before extracting the features, preprocessing was performed to eliminate noise and redundant data.

The responses whose RT was less than a threshold were considered as outliers; 6.6% of typical and 7.3% of ASD data were removed. The threshold was the RT’s median absolute deviation³⁰ (104.75 ms) multiplied by a constant scale factor of the normal distribution (1.4826): 1.4826 $\times$ 104.75 = 155.30 ms. The data were down-sampled from 144 to 72 Hz to remove redundancy; a Savitzky-Golay filter³¹ ($n = 5$ and $poly = 2$) was used to perform smoothing to prevent artifacts during numerical differentiation.