Multi-level modeling with nonlinear movement metrics to classify self-injurious behaviors in autism spectrum disorder

Cantin-Garside, Kristine D.; Srinivasan, Divya; Ranganathan, Shyam; White, Susan W.; Nussbaum, Maury A.

doi:10.1038/s41598-020-73155-4

Download PDF

Article
Open access
Published: 07 October 2020

Multi-level modeling with nonlinear movement metrics to classify self-injurious behaviors in autism spectrum disorder

Kristine D. Cantin-Garside¹,
Divya Srinivasan¹,
Shyam Ranganathan²,
Susan W. White³ &
…
Maury A. Nussbaum¹

Scientific Reports volume 10, Article number: 16699 (2020) Cite this article

1097 Accesses
1 Altmetric
Metrics details

Subjects

Abstract

Self-injurious behavior (SIB) is among the most dangerous concerns in autism spectrum disorder (ASD), often requiring detailed and tedious management methods. Sensor-based behavioral monitoring could address the limitations of these methods, though the complex problem of classifying variable behavior should be addressed first. We aimed to address this need by developing a group-level model accounting for individual variability and potential nonlinear trends in SIB, as a secondary analysis of existing data. Ten participants with ASD and SIB engaged in free play while wearing accelerometers. Movement data were collected from > 200 episodes and 18 different types of SIB. Frequency domain and linear movement variability measures of acceleration signals were extracted to capture differences in behaviors, and metrics of nonlinear movement variability were used to quantify the complexity of SIB. The multi-level logistic regression model, comprising of 12 principal components, explained > 65% of the variance, and classified SIB with > 75% accuracy. Our findings imply that frequency-domain and movement variability metrics can effectively predict SIB. Our modeling approach yielded superior accuracy than commonly used classifiers (~ 75 vs. ~ 64% accuracy) and had superior performance compared to prior reports (~ 75 vs. ~ 69% accuracy) This work provides an approach to generating an accurate and interpretable group-level model for SIB identification, and further supports the feasibility of developing a real-time SIB monitoring system.

Prediction of autistic tendencies at 18 months of age via markerless video analysis of spontaneous body movements in 4-month-old infants

Article Open access 27 October 2022

Hirokazu Doi, Naoya Iijima, … Toshio Tsuji

Quantifying the social symptoms of autism using motion capture

Article Open access 22 May 2019

Ian Budman, Gal Meiri, … Ilan Dinstein

Social and non-social autism symptoms and trait domains are genetically dissociable

Article Open access 03 September 2019

Varun Warrier, Roberto Toro, … Simon Baron-Cohen

Introduction

Autism spectrum disorder (ASD) is a pervasive neurodevelopmental disability marked by communicative, social, and behavioral impairments¹. Self-injurious behavior (SIB), including head banging and self-hitting², is reported in roughly half of people with ASD^3,4,5. SIB is a leading cause of hospitalization among people with ASD and can lead to physical damage such as lacerations and contusions^2,6. These behaviors can be repetitive or rhythmic, though behavior presentations vary widely². Applied behavioral analysis thus suggests that caregivers perform a functional assessment (FA) to determine potential triggers of SIB^7,8,9. To complete an FA, clinicians or trained caregivers observe and record details about events preceding, during, and following SIB⁹. Accuracy of an FA can suffer if caregivers are not adequately trained, though, and if events are recalled after they occurred^10,11. Observations also can differ between caregivers and clinicians, and can be challenging to track consistently due to other stressors and contextual influences on behavior^{8,10,12,13,14}. FAs require detailed observations and extensive note-taking on complex data, and these requirements lead to a time-consuming process¹⁵. Further, the behavior of interest may not occur during the observation period of an FA. The lack of SIB during an FA may be due to the time window of assessment or the absence of triggers in a specific environment¹⁶, which in turn may necessitate a repeated FA and add to the required completion time. Thus, traditional manual methods are often inefficient^10,14,15,16, and as such do not support the widespread need for care across contexts.

An accurate SIB tracking system might overcome these challenges if the system could identify triggers and inform and evaluate management. Sensing technology has the potential to comprehensively, objectively, and accurately track movement for people with SIB, as supported by previous research on behavioral monitoring for non-SIB behaviors in ASD^17,18,19. Nonwearable and wearable technologies, such as embedded camera systems or accelerometers in everyday items (e.g., cellphones), could record data continuously for SIB monitoring without requiring high levels of caregiver or clinician compliance^19,20. Wearable accelerometers address limitations of nonwearable technology, such as restricted field of view and privacy concerns^21,22, and were selected for the current study to reflect caregiver preferences from our previous work²³. Caregivers in that work indicated a need for data collection methods applicable in school and at home, and they suggested that children with SIB would accept wearable technology if noninvasive, comfortable, and discrete attachment methods were possible. Accelerometers have also been shown to provide sufficient data to detect repetitive motions among individuals with ASD, with 80–97% accuracy using wrist and/or back sensors^24,25, though use for SIB detection has not been previously explored.

In conjunction with wearable technology, SIB monitoring requires effective modeling. Earlier findings support the feasibility of tracking behaviors in ASD, specifically stereotypical motor movements (SMM) such as hand-flapping or rocking, which may relate to SIB and be similarly repetitive and rhythmic². Machine learning classifiers applied to accelerometry data—including decision trees²⁶, neural networks^27,28, and support vector machines⁷—detected SMM with accuracies up to 99%. However, there is very limited extant evidence for SIB classification. Previous work on SIB detection, to our knowledge, is limited to two studies that either created classifiers from trained actors imitating aggressive behaviors¹⁸ or focused on SMM with one example that could be considered SIB²⁹. The former study extended models that were trained on imitated movement to one child with SIB, and found that classification with individual accelerometry data yielded accuracies on the order of 60–70%¹⁸. Classifiers may have had stronger performance if trained on natural data, (versus simulated SIB), and their generalizability is unknown when used on more than one participant with more than one behavior. One study also examined aggression towards others among youth with ASD³⁰. Naturally-collected episodes of aggression were classified with high accuracy using physiological and movement sensors (area under the curve: AUC = 71–80 for individuals; AUC = 69 for group performance), though SIB was not included in the activities of interest³⁰. Importantly, sensory aversions prevalent in SIB³¹ may preclude the physiological sensors that require skin contact, which were used in Ozdenizci et al.³⁰, so other sensor and classification methods may be preferable for our application.

Classification models in earlier studies were typically specific to each participant, with training and testing completed on each individual^18,29. When group-level models were employed, accuracy levels tended to decrease, for example from 80% for individual models to 69% for group-level models in Ozdenizci et al.³⁰. Additionally, machine-learning based classification methods used in earlier studies (e.g., SVMs or neural networks) can have low interpretability, and other more accessible models should also be explored, such as regression³⁰. Interpretable models could provide information about predictors of SIB onset, which would be particularly relevant for clinicians and caregivers seeking to manage this behavior (see Cantin-Garside et al.²³, Dunlap et al.¹² and Williams et al.⁹ for further discussion on the need to capture triggers of SIB). Multilevel regression models with varying intercepts and slopes could account for the variability among individual diagnoses of ASD and in presentations of behavior^32,33, though such a model has yet to be applied to SIB.

In our previous study, we examined featureless classification methods to detect the presence of SIB and classify the type of SIB among individuals and groups of individuals with ASD³⁴. Using data from wearable accelerometers as input, accuracy was up to 99.1% for individual models, and up to 94.6% for models specific to SIB type. However, the detection models that incorporated all participant data had substantially poorer performance (accuracy = 48.8%), likely due to the inter- and intra-individual variability in SIB and activity levels. The current study is a secondary analysis of data described in Cantin-Garside et al.³⁴, and employed a multi-level modeling technique that includes motor variability features to address this limitation³⁴.

Including additional features, such as metrics from movement variability, may optimize system performance. In general, variability can be described using linear measures included in SMM classification²⁷, such as the standard deviation, as well as using nonlinear dynamics measures, such as entropy³⁵. Linear measures of variability consider variability in systems to stem fundamentally from noise, and utilize statistical dispersion measures such as the standard deviation to quantify the variability in time-domain signals³⁶. Nonlinear measures of variability, such as entropic and fractal measures, can quantify the temporal evolution of movement³⁶. Dynamical systems theory suggests that human movement changes and evolves over time, as governed by a deterministic process³⁷. Dynamical systems analyses separate variability in a movement process into chaotic vs. deterministic variability components^36,38. On the other hand, non-linear measures based in chaos theory and dynamical systems analysis consider the evolution of changes in a system over time.

Prior work classifying SMM in ASD has used time- and frequency-domain features²⁵, or has focused on relatively simple measures of variability such as the standard deviation and variance, with the latter yielding frequent false positives²⁷. Nonlinear movement variability features could improve classification model performance by capturing the underlying variability in SIB movements³⁹, even if the SIB changes between episodes. Dynamical systems theory may be relevant to SIB, since nonlinear components were found in the temporal patterns of SIB, though differing within and between individuals⁴⁰. More complex temporal patterns also emerged in the presence of SIB⁴⁰, and recent work suggests that movements become increasingly complex as a child with ASD transitions to an episode of SIB⁴¹. This complexity can be captured by measuring nonlinear variability in the movements of individuals with ASD and SIB, as further explained below in the description of nonlinear motor variability metrics.

Prior work also has found that nonlinear measures, such as entropy, are indicative of diagnosis when applied to motor control in ASD. For example, children with ASD had decreased dynamical complexity during quiet stance compared to typically-developing children^42,43, although people with stereotypy showed greater linear variability (standard deviation) during postural sway⁴³. Given that they can distinguish between neurotypical and pathological movements, these methods could capture changes in pathology within an individual (i.e., detecting health changes such as early signs of aggression). Variability has also been associated with other pathological behaviors^35,36,44 and the progression of health conditions^45,46, and thus could reflect changing risk of SIB in ASD as well. To our knowledge, though, only one study employed a nonlinear approach (recurrence quantification analysis, described below) to classify motion in ASD, and found that the additional nonlinear features of movement variability improved classification accuracy by 5–9%³⁹. Although the analysis in Großekathöfer et al.³⁹ was performed on SMM, their results could generalize to SIB, which is similarly repetitive and rhythmic.

In summary, SIB is one of the most dangerous behaviors in ASD, and a monitoring system could address the limitations of traditional tracking methods. Predictive modeling with features capturing nonlinear motor variability has the potential to provide superior performance vs. more traditional methods used in related ASD research. However, sensor-based behavioral monitoring for SIB has not yet been explored. A long-term goal of our research is to develop a real-time SIB monitoring system that can collect continuous movement data, alert the caregiver before SIB onset, and assist in management methods (e.g., redirecting the individual with SIB towards a different task). To this end, we aimed in the present study to develop an interpretable and generalizable model to classify a variety of behaviors among a range of participants, specifically by:

1.
Utilizing dynamical systems theory to extract measures of nonlinear motor variability as features in an SIB prediction model
2.
Building a multilevel logistic regression model with variable intercepts and slopes to account for inter-individual variability

Materials and methods

Participants

Data used here were obtained in Cantin-Garside et al.³⁴ and are briefly summarized below, with additional information in Supplementary Material. Children with SIB and ASD were recruited through the university-affiliated psychology clinic and through the authors’ networks. Caregivers were pre-screened to confirm inclusion criteria: (1) children aged 5–14 years, reflecting heightened aggression in childhood^47,48; (2) diagnosis of ASD; (3) SIB episodes > 3/hour, to ensure multiple episodes during the 1–3 h sessions; (4) fluency in English; and (5) home within driving distance of the noted Center. Note that the last inclusion criterion required non-representative convenience sampling. All adult participants provided informed consent, and qualifying children (> 7 years of age and of developmental level) provided assent before any data collection. Caregivers provided informed consent for their children who were younger than 18 years of age. The Virginia Tech Institutional Review Board (IRB) reviewed and approved all experimental procedures. All experimental methods were completed in accordance with relevant guidelines and regulations.

Eleven participants (5–14 years, M = 9.5, SD = 3.0) and their caregivers completed the study. Sessions lasted 35–147 min, providing more than 1000 min of data and > 200 episodes of SIB. Ten of the 11 participants exhibited SIB (participants 1–4 and 6–11, denoted as “P#”) with 18 different types (Table 1). All participants wore the wrist sensor, and the limited sensor configurations of P1, P8, and P9 precluded the use of other sensors in the group-level model. To include all participants in one group-level model, only the wrist sensor was considered. In contrast, data from 2 to 6 tri-axial accelerometers (Table 1) were used in individual-level models.

Table 1 Participant identifier, type of SIB shown during the session, total duration (seconds), and sensors worn.

Full size table

Study overview

After obtaining consent, the lead author, or the caregiver guided by the author, secured sensors on the child where tolerated. Demographic information was obtained, including potential SIB triggers identified by the caregiver. A trained clinical psychology doctoral candidate confirmed ASD diagnosis using standard tools (i.e., ADOS-2)⁴⁹. The examiner was research reliable in administration and scoring of the ADOS-2, and was supervised by a licensed clinical psychologist who was also reliable (SWW). Subsequently, movement sensors (see “Instrumentation”), video cameras, and 2–3 observing researchers monitored each child during free-play. Researchers instructed the caregivers to respond to SIB as if at home. If sufficient SIB episodes failed to occur during free play, caregivers had the option to prompt SIB in a controlled fashion with a commonly-used procedure (Standardized Observation Analogue Procedure, SOAP)⁵⁰. The session ended when either: (a) > 3 episodes of SIB were observed, or (b) participants or researchers stopped the session to prevent escalating behavior. At the end of the session, participants were compensated for their travel and time, and the children were presented with a selected toy.

Instrumentation

Tri-axial accelerometers (ActiGraph GT9X Link, www.actigraphcorp.com) were used to track participant movement (sampling frequency = 60 Hz) throughout the session. Earlier work found that these particular sensors were both reliable and accurate when used with children and adolescents⁵¹ for tracking movement among both pathological and healthy populations⁵². A maximum of six sensors were placed on/in the wrists, waist, pockets, and ankles as accepted by the participant. Sensor choice and placement reflected prior research that found high reliability and high accuracy when classifying activities with movement sensors on either the wrist or torso^25,52,53. Ankle sensors were also included as potentially necessary to capture lower-body injurious behaviors¹⁸. Three Go-Pro cameras and an overhead camera recorded videos for each child as “ground truth”.

Data processing and analysis

Sensor data were exported into MatLab (R2018a, MathWorks), which was used for data analysis and modeling (using an Intel, dual-core, 2.9 GHz CPU). Accelerometer data were labeled as non-SIB events (0) or SIB (1) using the ground truth video data and annotations from in-session observations. Before the session began, members of our research team discussed the SIB that parents described during pre-screening. Behaviors were defined by watching the children individually and captured by terms provided by their caregivers (e.g., eye-gouging). Members of our team also discussed behaviors observed in sessions, both during and after the session, for consensus building. Behavioral definitions were further clarified prior to data labeling. Multiple researchers annotated and discussed the video data before labeling raw accelerometry files (see Fig. 1 for an overview of the modeling process, and Cantin-Garside et al.³⁴ for details on consensus-building for SIB labels)³⁴.

Raw sensor data were filtered using a 4th order, low-pass, recursive Butterworth filter, with a cutoff frequency of 20 Hz. Filtered data were used to obtain time- and frequency-domain features. Based on prior work^54,55, raw data were used for extracting features of nonlinear motor variability (see “Derivation of nonlinear metrics of motor variability”). For continuous analysis of discrete data, all data were segmented into 2-s sliding windows with a 1-s overlap^7,27. This short time window was used to minimize delays, which was considered important for real-time monitoring and reflects the potential for relatively short “bursts” of SIB.

Feature extraction

Three sets of features were extracted: (1) features in the time-domain; (2) features in the frequency-domain; and (3) nonlinear metrics based on chaos theory and dynamical systems. As discussed above, the use of time- and frequency-domain features is supported by prior findings on classifying SMM^7,18,22,27, with nonlinear motor variability features included to capture the dynamical complexity of motion and to improve classifier performance^56,57. Table 2 lists the features extracted for each channel. The presence (1) or absence (0) of a prompt to instigate SIB (from SOAP) was also included initially during feature selection (see “Feature selection”); all caregivers except for Participant 4 opted to use SOAP at least one time during their session.

Table 2 Time, frequency, and nonlinear motor variability features.

Full size table

Derivation of nonlinear metrics of motor variability

Nonlinear metrics of motor variability were extracted by first reconstructing the phase space of the raw sensor data^36,58. Phase space represents the states (“state space”) of dynamical system behavior in a plot, and this reconstruction involves creating M copies of the original time series $x$, where M is the embedding dimension, using a time delay $(\tau )$^36,59. Time delay was determined using two methods: (1) the first minimum of the average mutual information function^60,61; (2) the delay when the time series autocorrelation was less than $e$^-1^56,62. Time delay was determined using both methods separately for SIB and non-SIB events (see^36,56,59 for additional details). The selected $\tau$ was the value for which both methods converged, and was similar for both SIB and non-SIB ($\tau =5)$. The resulting embedding dimension (M) was 4, derived from the global false nearest neighbor analysis method^61,63. Both $\tau$ and M values here are similar to prior work on the nonlinear variability of human motion⁶¹. State space was reconstructed as embedding vectors $X\left(t\right)$ in the form:

$$X\left(t\right)=\left[x\left(t\right),x\left(t+\tau \right),x\left(t+2\tau \right),\dots , x\left(t+\left(M-1\right)\tau \right)\right]$$

(1)

such that $t=1,\dots ,N-\left(M-1\right)$. Delay reconstruction was used to create the phase space, which produced consistent results in other work⁶¹. The parameters described below were then calculated using the reconstructed phase space.

Entropy

Sample entropy (SaEn) was used to quantify the complexity of the acceleration signals, with low values indicating low complexity^56,64,65. SaEn was calculated using the following steps:

1.
Compute ${C}_{t}^{M}\left(r\right)=\left\{number of X\left(i\right) such that |\left|X\left(t\right)-X\left(i\right)\right|{|}_{\infty }\ll r\right\},$where C is the probability that the vector X(i) is within the tolerance threshold, $r=0.2*STDDEV\left(x\right)$ of X(t), M is the embedding dimension, and $t\ne i$
2.
Find ${\phi }^{M}\left(r\right)=\sum_{t=1}^{N-M+1}\frac{{C}_{t}^{M} (r)}{N-M+1},$ which is the average of ${C}_{t}^{M} \left(r\right)$where N is the number of data points from the signal.
3.
Calculate $SaEn=-ln\frac{{\phi }^{M+1}\left(r\right)}{{\phi }^{M}\left(r\right)}$

Cross-sample entropy

Cross-sample entropy has not been explored in models classifying ASD motion, though was determined here between each sensor channel and used in our feature set. Cross-sample entropy parallels SaEn in its estimation, but examines the difference between one data stream and another data stream^56,66. Lower values imply similarity and synchronicity between the two data streams⁶⁴.

Recurrence quantification analysis

Recurrence quantification analysis (RQA) was performed with a MatLab toolbox^67,68 to evaluate phase space predictability and intermittency^56,69. An RQA map is first constructed through a distance matrix comparison. A distance matrix $(\mathrm{DM})$ consists of elements (${\mathrm{DM}}_{ij}$) that are Euclidian distances (${\mathrm{DM}}_{ij}=d[X\left(i\right),X\left(j\right)]$) between embedding vectors $X\left(i\right)$ and $X\left(j\right)$⁵⁶. ${\mathrm{DM}}_{ij}$ elements are then compared against a threshold determined by recurring dynamical trajectories, with elements = 1 for ${\mathrm{DM}}_{ij}$< threshold, indicating recurrent points returned to a previous location, and = 0 otherwise^39,56. The selected threshold guarantees that the percentage of recurrent points remains within 0.1–2% of the total recurrent elements⁵⁶. RQA can be evaluated using several measures^56,69,70, with the following selected as reliable for human subject research^{56,66,69,70,71}:

1.
Recurrence—regularity of the time series as the percentage of recurrent points
2.
Determinism—percentage of consecutive, diagonally-aligned recurrent points indicating signal periodicity and predictability; this relates to the inverse of the largest positive Lyapunov exponent, because longer diagonals imply deterministic versus chaotic movements
3.
Laminarity—percentage of vertically aligned recurrent points, indicating signal stability (similar to determinism)
4.
Divergence—inverse of the maximum diagonal line segment, related to the maximal Lyapunov exponent
5.
Maximum diagonal length—proportional to the inverse of the maximal Lyapunov exponent, indicating the longest duration of periodicity
6.
Trapping time—mean vertical line length, indicating the duration of the trapped state, reflecting signal constancy

Detrended fluctuation analysis

Detrended fluctuation analysis (DFA) was used to quantify the persistence of SIB movements. DFA exponents (α) were calculated for every time segment to assess long-range correlations^36,72, with persistence indicated by α > 0.5 for time series deviations that continue in the same direction, and anti-persistence by α < 0.5 for deviations that continue in the opposite direction⁷². DFA has been used in analyses of motor control for ASD, with evidence of long-range correlations (persistence) during a drawing task⁷³.^.DFA has also been applied to capture the predictability of a movement, specifically when walking^72,74. Persistence typically degrades in pathology. The underlying long-range correlations are altered in disordered movement, compared to consistent correlations in healthy movement (Goldberger et al.⁷⁵). This finding remains evident irrespective of whether the outward appearance of pathological behavior appears more restricted or more chaotic⁷⁵.

Feature selection

The least absolute shrinkage and selection operator (lasso) method was used to address multicollinearity, to remove redundant features, and to determine the sparsest model when considering all 96 features and all sensors⁷⁶. This method directly selects variables that most contribute to the model. Principal component analysis (PCA) was then used for dimensional reduction, by finding the optimal combination of the 59 selected variables (see Supplementary Materials, Table S2 for further information on variable selection and loadings)⁷⁷. This feature selection approach is capable of characterizing data despite high variability between participants. PCA output was subsequently used as input to a multilevel logistic regression model (MLR).

Regression modeling

A multilevel logistic regression (MLR) model was created with variable slopes and intercepts. The latter were used to account for high inter-subject variability³², which could be particularly relevant for ASD. Further, this model relied upon data from other individuals when classifying an episode of SIB to improve the overall performance of the entire group. This aspect of the model is particularly relevant, as SIB episodes can be sparse, depending on the individual and type of SIB. Specifically, the model can be written as:

$${Y}_{i}=logi{t}^{-1}\left({\alpha }_{j\left[i\right]}+{\sum }_{k}{\beta }_{j\left[i\right],k}{X}_{k}\right), i=\mathrm{0,1},\dots I;j=\mathrm{1,2},\dots ,J;k=\mathrm{1,2},\dots ,K$$

(2)

where i indexes over events, j[i] is the index of a subject who exhibits event i, k indexes over the features $X$, and ${Y}_{i}=1$ is the outcome variable if the event is SIB vs. 0 otherwise. Intercepts ${\alpha }_{j\left[i\right]}$ and feature slopes ${\beta }_{j\left[i\right], k}$ are variable for each subject, and both can be modeled linearly as:

$${\alpha }_{j}\sim N\left({U}_{j}{\gamma }_{j}^{\alpha }, {\sigma }_{\alpha }^{2}\right); {\beta }_{j,k}\sim N\left({V}_{j,k}{\gamma }_{j,k}^{\beta }, {\sigma }_{\beta k }^{2}\right)$$

(3)

where ${U}_{j}$ and ${V}_{j,k}$ are potential features, with corresponding linear coefficients ${\gamma }_{j}^{^{\prime}}$ specific to the individual level, and modeling error variances ${\sigma }^{2}$.

Evaluation

Data were balanced and randomly selected following a 8:2 training/validation:testing ratio, so as to build a robust model and to test the built model⁷⁸. SIB events are relatively rare compared to non-SIB events, which leads to a skewed distribution as found in prior work with ASD-related behaviors^7,39,79. SIBs here lasted for about two seconds at minimum, though more subtle movements, such as picking, lasted longer, which lasted ~ 10 to 90 s. SIB and non-SIB data were balanced as in other work to address skewness^{28,39,79,80,81}. Balanced data were used for training, and tenfold cross-validation was used^18,26,30. This validation method was implemented to reflect the likely use cases in SIB interventions, including training and validating a model using data from each unique individual. SIB management is highly specific to the individual and the demonstrated behaviors, thus requiring representative data. Movement classification for SIB must therefore reflect the need for highly customizable tracking methods and account for heterogeneity. Two datasets were used for testing model generalization: (1) balanced data; and (2) natural, unbalanced data reflecting the ratio of SIB:non-SIB in the complete dataset. These testing methods were used to examine the potential use of a model pre-trained on controlled data for application to natural, unbalanced datasets. All data were randomly selected from across the duration of a given session, and observations were assumed to reflect the entire dataset⁸¹.

Outcome measures were calculated for each model (MLR, and the models described below), with classification performance (accuracy, specificity, precision, recall, and F-score) calculated for each classification method⁸². Training and testing time were also computed for all developed models to assess the potential for application to real-time monitoring.

Model comparisons

Additional group-level models were trained, validated, and tested, for the purpose of comparison with the MLR model (“MLR – variable intercepts and slopes”). These additional models were of five different types:

1.
Logistic regression (LR) with variable intercept only (“LR—variable intercept”). This was used to compare the MLR with a less complex model, while still accounting for participant variability.
2.
LR without variable slopes or intercepts (“LR—no variable terms”). This model was included to compare the MLR with a model that does not consider participant-level variation.
3.
Two-way interaction model (stepwise LR), with included terms determined by BIC (“LR—stepwise”). This model was included to compare the MLR with higher-order, nonlinear models, and to evaluate the effect on accuracy when including terms with lower interpretability but potentially higher predictive power.
4.
Participant-level LR models (“LR-ind”), one for each of the 10 who exhibited SIB. These models were included to compare the group-level MLR with highly specific modeling that may have low generalizability, yet high accuracy.
5.
Several models using machine learning methods: k-nearest neighbors (“kNN”), with k = 11 selected through optimization; support vector machines (“SVM”), and decision trees (“DT”). These three models were included to compare the MLR with previously-employed methods demonstrating strong individual (though not group-level) performance in other ASD applications and our previous featureless work (Cantin-Garside et al.³⁴), and to compare the MLR with “black-box” models with lower interpretability but typically high predictive power.

Results

Dimensional reduction

Fifty-nine variables were selected from lasso and were then input in PCA for the group-level MLR model; these variables included both linear and nonlinear features of motor variability features from each sensor channel. Lasso results, though, excluded the prompt variable. Means and standard deviations of select nonlinear motor variability features are provided in Supplementary Materials, Table S1. PCA generated 12 principal components (PCs). Coefficients and the explained variance of each PC are provided in Supplementary Materials, Table S2, and a summary of PCs and top loading variables is provided below in Table 3. PC1 explained 23% of the variance, with loadings primarily from frequency-based measures and measures capturing sudden or sharp movements (e.g., jerk, peak). Measures of the Z channel (vertical) loaded primarily on PC2, with coefficients > 0.3 for mean absolute value and RMS. Nonlinear motor variability metrics had coefficients up to ~ 0.6 in some PCs. Nonlinear metrics from RQA had coefficients > 0.4 on PC6 (Z channel), > 0.3 on PC8 (X channel), and > 0.4 on PC9 (Y channel), while SaEn had coefficients > 0.3 on PC10 and cross-sample entropy (ZY) had coefficients > 0.3 on PC12. The components listed above with nonlinear variable loadings > 0.3 accounted for 11.1% of the total variance (3.6, 3.0, 2.6, 2.5, and 1.9% respectively). All 12 components contributed to 65.6% of the group data variance.

Table 3 Summary of each PC, with top-loading features and total explained variance per PC.

Full size table

Table 4 summarizes results using the MLR. PC1 and PC12 were both significant features when considered across participants, though not when varying with participant level. The intercept, along with PCs 2, 5, 6, 7 and 9, were only significant features when considering participant levels, and not when fixed. PCs 3, 8, 10, and 11 were significant features both when fixed and when randomly varying with participant level. PC4 was not significant in the model, either when fixed or when varying with participant level.

Table 4 Multi-level logistic regression parameter values for the group-level model including all 10 participants.

Full size table

Classifier performance

Tables 5 and 6 respectively summarize results regarding training time, accuracy, specificity, precision, recall, F-scores, and adjusted R² for validation and testing of group-level models. Training times for MLR, stepwise LR, and cubic SVM were 2–4 times longer than for other classifiers (10^–2 vs. 10^–6 s/observation), though this same difference was not reflected in testing times (all times within 10^–6–10^–5 s/prediction). MLR had high accuracy (74.7%) and F-score (0.752) in validation, which decreased minimally when testing with balanced data (73.2% and 0.733). Accuracy and F-score decreased with unbalanced test data for MLR (69.1% and 0.184). Specificity, precision, and recall were all highest for MLR in validation (~ 0.73 to 0.77) and testing (~ 0.73 for all three measures). Adjusted R² was the highest for MLR (0.502) and the lowest for LR without variable intercepts/slopes (0.106). When considering participant levels with only a variable intercept versus both variable intercept and slopes, most performance metrics decreased by ~ 2 to 5%. LR without variable intercepts/slopes had the lowest accuracy (64.0%) in validation, dropping to 47.0 and 56.1% for balanced and unbalanced data, respectively. Linear SVM had the lowest specificity and precision (0.599 and 0.631), while LR without variable intercept/slopes had the lowest recall (0.663), though this trend did not extend to testing results. Stepwise LR had the lowest specificity for both balanced and unbalanced test data (0.526 and 0.552, respectively). LR without variable intercepts/slopes had the lowest precision, recall, and F-score for balanced test data (0.455, 0.304, and 0.365, respectively). The kNN classifier had the lowest precision, recall, and F-score for unbalanced test data (0.064, 0.591, and 0.116, respectively), while LR without variable intercept/slopes had the lowest F-score for validation (0.648) and balanced test data (0.365).

Table 5 Validation results for group-level classifiers.

Full size table

Table 6 Test results at the group level.

Full size table

Tables 7 and 8 show the classifier performance for validating and testing individual models, respectively. Time was on the order of 10^–5–10^–3 s/observation for training and 10^–4–10^–3 s/prediction for testing. Validation accuracy was higher overall (70.7–97.9%) when compared to group-level models (64.0–74.7%). Testing accuracy ranged widely, from 50.0–83.3% for balanced data to 27.2–95.7% for unbalanced data. Note that the unbalanced dataset, relative to the balanced dataset, did not lead to a substantial decrement in accuracy from validation to test set because there are substantially more behaviors labeled as non-SIB than SIB, and hence the label of non-SIB is easier to predict. Specificity ranged from 0.648–1 for validation datasets, and from 0.500–1 for balanced and unbalanced test data. Precision was between 0.692–1 for validation data, 0–0.818 for balanced test data, and 0–0.5 for unbalanced test data. The ranges of recall values were 0.746–1 for validation data and 0–1 for both balanced and unbalanced test data. F-scores ranged from 0.718–0.979 for validation data, 0–0.857 for balanced test data, and 0–0.667 for unbalanced data.

Table 7 Validation results for individual participants.

Full size table

Table 8 Test results for individual participants.

Full size table

Discussion

We developed an interpretable model to identify diverse types of SIB among a range of participants. Traditional time- and frequency-domain features were used, along with features capturing nonlinear motor variability, as input to a multi-level logistic regression model capable of detecting SIB at the group level, with selected components from dimension reduction explaining > 65% of the data variance. The lasso method did not select the prompt variable for this group-level model (recall that this prompt represented the presence or absence of caregiver actions that were targeted at instigating SIB), indicating that this model explains the presence of SIB beyond an identified SIB trigger. This finding is consistent with a prior report⁴⁰ that temporal patterns in SIB occur independent of behavioral or environmental influences (or “triggers”). Nonlinear motor variability features (e.g., from RQA and entropy) loaded on PCs that accounted for > 10% of the explained variance in the dataset. DFA features had moderate loadings on PCs, and these features were a novel addition to modeling for ASD.

Descriptive statistics of metrics of nonlinear motor variability from pooled SIB versus non-SIB events across participants showed little difference between the behavioral classes (Supplementary Materials, Table S1), which may explain the poor performance of a general group-level model without participant levels. However, upon examining one of the most severe behaviors (head banging) in one participant, nonlinear motor variability of SIB differed from non-SIB events (Supplementary Materials, Table S3). For example, DFA exponents for both non-SIB and SIB events in the Y and Z axes were slightly anti-persistent (< 0.5), indicating changes evolving in different directions over time. Though exponents remained < 0.5, there was a slight increase in DFA exponents for the Y and Z axes for SIB events compared to non-SIB, indicating more persistence in SIB events. Differences among other nonlinear metrics were evident for head banging in P9. Recurrence rate, for example, decreased for head banging in this individual compared to non-SIB events, indicating less regularity in SIB data. This finding opposes the common perception that repetitive behaviors are “regular”. There was a slight decrease in sample entropy for SIB in the Y and Z axes compared to non-SIB events, suggesting lower levels of complexity; however, cross-sample entropy increased during SIB, indicating higher levels of complexity between two channels of data. These findings may indicate that SIB occurs due to over/understimulation to seek system stability⁸³ (“less” or “more” complexity)⁸⁴ (see Mazefsky et al.⁸³ for a review of emotional regulation in ASD, and Stergiou and Decker⁸⁴ for a review of nonlinear dynamics and pathology).

Together, these results suggest that underlying nonlinear trends exist in the movements occurring during SIB. However, classic time- and frequency-domain features had the highest loadings on the first PC. These loadings indicate that jerk and FFT peaks are the strongest features of SIB, although nonlinear trends appear to differ between SIB and non-SIB. Consistent outcomes were found in prior work that used time- and frequency-domain features to accurately classify SMM^7,18,22,27, and suggest that such features should be the first ones considered when creating a model to predict SIB. Further considerations for SIB modeling include variable intercepts and slopes. LR without variable intercepts or slopes performed inferiorly to LR with a variable intercept and inferiorly to MLR with both variable intercept and slopes. MLR with both variable intercept and slopes performed superior to all other classifiers, including commonly used machine learning algorithms implemented in other work^7,26. These results imply that inter-individual variability also contributed to dataset variance.

This study is the first, to our knowledge, to incorporate nonlinear variability in addition to traditional time–frequency metrics to explain the variance in SIB movement data. Our findings suggest that movements in SIB can be described as a dynamical system with long-term deviations, which is consistent with prior evidence that stereotypical motor movements in ASD can be accurately detected using nonlinear features from RQA³⁹. Similarly, our feature extraction revealed higher loadings for RQA features when compared to other nonlinear factors, and the associated principal components were significant in our MLR model. Our findings, along with those of³⁹, indicate that RQA metrics could be critical in detecting repetitive and rhythmic motor movements (such as stereotypical motor movements, and SIB) in ASD.

Further, PCs with nonlinear variable loadings were significant in the MLR model. PC6 and PC9, with loadings primarily from RQA metrics from the Z and Y axes, respectively, were significant in the model only when randomly varying with participant level; these metrics were not sufficient to classify SIB unless including variable slopes, which indicates that nonlinear movement aspects are specific to each individual. Time- and frequency-domain features that loaded on PC1 (frequency components, jerk, and peak/minimum) were significant features in MLR when independent from participant levels, indicating that these variables could be predictive of SIB without considering individual variability; the only nonlinear metric of motor variability to which this finding applied was PC12 (cross-sample entropy). Other PCs with loadings from metrics of nonlinear motor variability (RQA for X on PC8, sample entropy on PC10) were significant only when considered as either a fixed or a variable effect, implying that features such as sample entropy vary consistently between participants while still explaining individual-level variability. Nonlinear measures were significant features of SIB when they varied with participants, which supports prior evidence that nonlinear components of movement are specific to individuals⁸⁵.

We believe the current group-level model is the first to achieve accuracy of ~ 75% when identifying SIB among a diverse group of behaviors and participants. Previous research on classifying other repetitive motor movements^{7,22,24,25,26,27,28,29,39} and aggression^18,30 has evaluated specific models trained and/or tested only on individual participants, and performance dropped from 80% with individual models to 69% when applied to the group of participants³⁰. Though a similar decrease was also evident here, percent accuracy was ~ 6% higher than earlier group-level results. The increased performance we found may be due, at least in part, to the use of feature selection and dimensional reduction methods, along with the multi-level properties of our model that accounted for inter-individual variability. Also, a larger sample of participants was included here, compared to earlier modeling reports of ASD behaviors, with samples ranging from one to six^{7,17,22,25,26,27,29}. Further, our participant pool encompassed 18 different behaviors across children 5–14 years of age, suggesting potential generality to a wider sample of children with ASD and SIB. Note that the 18 types of SIB mentioned in the study are a reflection of our study cohort and were an attempt to subtype SIB. We were not seeking to identify underlying subclasses of SIB, but instead to classify behaviors (as demonstrated naturally by our study cohort) automatically using technology (sensors and machine learning), rather than manually with traditional observation methods.

As in the work of³⁰, regression showed promising results here compared to other classifiers; however, these earlier authors focused on aggression towards others, whereas our study applied regression on SIB data. MLR here had higher accuracy, specificity, precision, and recall compared to several commonly-used machine learning algorithms. These machine learning algorithms also detected SIB with high accuracy in our previous study using featureless data, though accuracy greatly decreased at the group-level (see Cantin-Garside et al. for further details)³⁴. Multi-level regression with both variable slopes and intercept may be preferred for group data with variable behaviors that could be specific to an individual, and it may also be more accessible to interpretation than other machine learning algorithms. MLR had classifier performance superior to LR without varying intercept/slope, further emphasizing the potential importance of individualizing models for participants with ASD and SIB. MLR, though, was only inferior to several highly-specific participant models, which may not generalize beyond the participant.

The widely varying performance metrics across participants, however, could account for the unexplained variance in MLR. Several participants had either near perfect detection (e.g., P1) or quite poor detection (P8 or P9) in testing. This wide range could have resulted from the inconsistent amount of SIB data included (P9 had the shortest session of all participants), the different types of SIB, or the variable sensor configurations between participants. Of note, MLR only included the one sensor worn in common by all participants: the wrist sensor. This single wrist sensor may not be sufficient for all SIB types, such as head banging or kicking, and therefore might have led to decreased performance measures in the group model. Yet, despite having only one sensor to incorporate in the group model, MLR still showed superior performance to all other tested classifiers. Group-level classifiers may be more practical (i.e., efficient and generalizable) to implement in real-world applications, and the current results are promising for automatically identifying diverse SIB with minimally-invasive technology.

Limitations and future work

Outcomes here provide initial groundwork toward creating a group-level classifier for SIB classification, yet further research is needed in several respects. Such work should include expanded data to improve classifier performance and to extend the model for more individuals with ASD, given the highly heterogeneous ASD diagnoses and the lifelong pervasiveness of ASD and SIB. SIB presentation can be extremely variable, including in its duration, and thus the current study is limited in size and scale (though more extensive than comparable existing studies). SIB data here may not have been sufficient for some participants, such as P8, when SIB episodes are few and/or short (< 100 s). There would thus be value in monitoring SIB across several days (longitudinal recordings) to capture additional episodes across different contexts, as well as expanding the study with a larger sample. Data from other episodes may also help increase explanatory power for the MLR, though accuracy is perhaps more critical for online detection of SIB. Although recall, sensitivity, and accuracy remained relatively high when testing MLR with unbalanced, natural data, precision decreased. This decrease in precision indicates that a “quasi-balancing” may be required when implementing classifiers in a real-world settings. At present, this technology would be most useful in settings where individuals frequently exhibit SIB (e.g., school), as there would be a greater need for support in these settings and a more balanced SIB:non-SIB ratio. A classifier could be deployed when caregivers cannot maintain both tracking and behavior management due to the high frequency and/or intensity of behaviors. It may be possible to improve MLR by weighting terms based on the frequency of the behaviors, as well as based on caregiver perception of imminent danger. Other methods of improvement could include nonlinear terms with variable slopes, though this could decrease interpretability of the model. Additional levels could also be incorporated into the model, such as age, SIB type, frequency or intensity. The definition of frequency or the rate of SIB can vary, depending on the type of behavior and the observer (e.g., counting each hit or counting each set of hits); thus, frequency was not included to describe behaviors. Behavior was classified by the presence/absence of SIB in the time window versus using a defined frequency. Similarly, intensity was not quantified objectively, though doing so would be a valuable contribution for future iterations of sensing technology for ASD. Operational definitions of the frequency of SIB and associated intensity, though, would be necessary to establish ground truth for inter-rater reliability. Further, additional analysis of effects of observation time and duration on classifier performance could support decisions about data requirements for training and testing data.

Our work supports the presence of nonlinear motor variability within SIB. However, several features of nonlinear motor variability (DFA) loaded only modestly (< ~ 0.1) on PCs, which may be due to the young ages of the participant pool. The long-range correlations quantified by DFA only develop in gait during late childhood⁸⁶, so such correlations may not yet be evident in SIB movements when the participants are young. Age could be incorporated as a covariate in future work, which may show differences in feature importance, such as in DFA, among age groups. Older participants could show more explicit anti/persistence in pathological movement, which could lead to additional evidence of nonlinear motor variability in SIB. Dynamic movement signatures of individuals with ASD could provide information to detect pathology, such as movements involved in SIB, before typical diagnostic measures⁸⁵, and could explain the pattern of SIB onset. These individual movement signatures might also reveal trends about intentions that underlie SIB movements, such as whether the motion is goal-directed or spontaneous⁸⁵, or the etiology of ASD, through mapping movement characteristics to underlying mechanisms of movement⁸⁷. With additional information about ASD movement signatures, variability components (quantified with metrics such as RQA and entropy) could be the basis for an intervention to promote self-awareness and intentional movements in ASD⁸⁸. Specifically, if SIB is a deterministic process, nonlinear motor variability metrics could capture the convergence or divergence of the repeated motions that may indicate the onset of SIB from sensor data. If values from such metrics surpass a certain threshold, the child would be considered at risk for starting an SIB. Stimuli (e.g., visual or auditory signals) could provide feedback to alert the child and divert the child’s attention to alternative coping mechanisms (e.g., a breathing app, squeezing a sensory object, or feeling a certain texture).

In future work, we plan to use the current findings to build more sophisticated hierarchical models. One useful addition might be including Bayesian priors for individual-specific information. If such models improve accuracy while retaining interpretability, they could be used to determine the necessity of intervention at the earliest indicator of an event. Specifically, the predicted probability score from logistic regression can serve as a criterion for caregiver interjection, by setting a pre-determined threshold (e.g., caregivers should interject if the probability of SIB > 0.8). A monitoring system could also include real-time estimation of variable parameters (intercept and slope) for each individual with ASD and SIB. Parameter coefficients can be estimated by the empirical Bayes approach, which allows the mean value of the prior distribution to equal the mean of coefficients from the training data. Using new data in real-time, the posterior distribution can then be recomputed and updated for that participant. Continuously adapting features could both improve current models and address evolving behavior when tracking SIB. Alternatively, autoregressive models could be employed to account for the temporal dynamics of SIB. As in Rad et al.²⁸, such an approach could address the common challenge of class imbalances in detecting SIB. However, applying more advanced models, such as an autoregressive model, would likely detract from the interpretability of the SIB prediction, and such a tradeoff would need to be considered carefully when developing a monitoring system.

Conclusions

This work provides a framework for, and initial results obtained from, interpretable SIB classification at the group-level, particularly through introducing new features with variable slopes and intercepts in a multi-level classifier. A new application of nonlinear metrics to movement in SIB was employed, specifically to develop a group-level classification model. We found that both linear and nonlinear measures of motor variability and time/frequency-domain features, paired with feature selection and dimensional reduction, explained > 65% of the variance found in SIB movement data, and classified diverse SIBs among a group of 10 participants with ~ 75% accuracy. Our results are promising in terms of the feasibility of developing a continuous monitoring system for SIB that can be applied to different types of behaviors and a range of individuals. This work serves as a proof of concept for the utility of technology to track SIB in ASD, which is necessary to apply this work to future Phase 1 prevention efforts. Future work should continue to build on these results, with added consideration of prior distributions for adaptive modeling.

Data availability

The datasets analyzed for this study are not publically available because they contain sensitive information with identifiable behaviors from minors. Requests to access these datasets should be directed to DS.

Abbreviations

ASD:: Autism spectrum disorder
SIB:: Self-injurious behavior
SMM:: Stereotypical motor movements
SaEn:: Sample entropy
RQA:: Recurrence quantification analysis
DFA:: Detrended fluctuation analysis
PCA:: Principal component analysis
MLR:: Multi-level logistic regression
LR-variable intercept:: Logistic regression with variable intercept only
LR-no variable terms:: Logistic regression without variable slopes or intercepts
LR-stepwise:: Stepwise logistic regression
LR-ind:: Participant-level logistic regression
SVM:: Support vector machine
DT:: Decision tree
kNN:: K-nearest neighbor

References

American Psychiatric Association. Diagnostic and Statistical Manual of Mental Disorders 5th edn. (American Psychiatric Association, Philadelphia, 2013).
Book Google Scholar
Minshawi, N. F. et al. The association between self-injurious behaviors and autism spectrum disorders. Psychol. Res. Behav. Manag. 7, 125–136 (2014).
Article PubMed PubMed Central Google Scholar
Murphy, O., Healy, O. & Leader, G. Risk factors for challenging behaviors among 157 children with autism spectrum disorder in Ireland. Res. Autism Spectrum Disord. 3, 474–482 (2009).
Article Google Scholar
McTiernan, A., Leader, G., Healy, O. & Mannion, A. Analysis of risk factors and early predictors of challenging behavior for children with autism spectrum disorder. Res. Autism Spectrum Disord. 5, 1215–1222 (2011).
Article Google Scholar
Richards, C., Oliver, C., Nelson, L. & Moss, J. Self-injurious behaviour in individuals with autism spectrum disorder and intellectual disability. J. Intellect. Disabil. Res. 56, 476–489 (2012).
Article CAS PubMed Google Scholar
Rooker, G. W. et al. Classification of injuries observed in functional classes of self-injurious behaviour. J. Intellect. Disabil. Res. 62, 1086–1096 (2018).
Article CAS PubMed PubMed Central Google Scholar
Goodwin, M. et al. Moving towards a real-time system for automatically recognizing stereotypical motor movements in individuals on the autism spectrum using wireless accelerometry. In Proceedings of the 2014 ACM International Joint Conference on Pervasive and Ubiquitous Computing, 861–872 (2014).
Kirby, A. V., Boyd, B. A., Williams, K. L., Faldowski, R. A. & Baranek, G. T. Sensory and repetitive behaviors among children with autism spectrum disorder at home. Autism 20, 1–13 (2016).
Google Scholar
Williams, S. K., Johnson, C. & Sukhodolsky, D. G. The role of the school psychologist in the inclusive education of school-age children with autism spectrum disorders. J. Sch. Psychol. 43, 117–136 (2005).
Article Google Scholar
Marcu, G. et al. Why do they still use paper? Understanding data collection and use in Autism education. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, 3177–3186 (2013).
Tarbox, J. et al. Comparing indirect, descriptive, and experimental functional assessments of challenging behavior in children with autism. J. Dev. Phys. Disabil. 21, 493–514 (2009).
Article Google Scholar
Dunlap, G., Newton, J. S., Fox, L., Benito, N. & Vaughn, B. Family involvement in functional assessment and positive behavior support. Focus Autism Other Dev. Disabil. 16, 215–221 (2001).
Article Google Scholar
Allen, K. D. & Warzak, W. J. The problem of parental nonadherence in clinical behavior analysis: Effective treatment is not enough. J. Appl. Behav. Anal. 33, 373–391 (2000).
Article CAS PubMed PubMed Central Google Scholar
Dracobly, J. D., Dozier, C. L., Briggs, A. M. & Juanico, J. F. Reliability and validity of indirect assessment outcomes: Experts versus caregivers. Learn. Motiv. 62, 77–90 (2018).
Article PubMed Google Scholar
McMahon, R. J. & Forehand, R. L. Helping the Noncompliant Child: Family-Based Treatment for Oppositional Behavior (Guilford Press, New York, 2005).
Google Scholar
Sturmey, P. Functional Analysis in Clinical Treatment (Academic Press, London, 2020).
Google Scholar
Goodwin, M. S., Intille, S. S., Albinali, F. & Velicer, W. F. Automated detection of stereotypical motor movements. J. Autism Dev. Disord. 41, 770–782 (2011).
Article PubMed Google Scholar
Plötz, T. et al. Automatic assessment of problem behavior in individuals with developmental disabilities. In Proceedings of the 2012 ACM Conference on Ubiquitous Computing 391–400 (2012).
Cabibihan, J.-J., Javed, H., Aldosari, M., Frazier, T. & Elbashir, H. Sensing technologies for autism spectrum disorder screening and intervention. Sensors 17, 46 (2017).
Article Google Scholar
Zheng, Y.-L. et al. Unobtrusive sensing and wearable devices for health informatics. IEEE Trans. Biomed. Eng. 61, 1538–1554 (2014).
Article PubMed PubMed Central ADS Google Scholar
Goncalves, N., Rodrigues, J. L., Costa, S. & Soares, F. Preliminary study on determining stereotypical motor movements. Proceedings of Engineering in Medicine and Biology Society1598–1601 (2012).
Goncalves, N., Rodrigues, J. L., Costa, S. & Soares, F. Automatic detection of stereotyped hand flapping movements: Two different approaches. In Proceedings of The 21st IEEE International Symposium on Robot and Human Interactive Communication, 392–397 (2012).
Cantin-Garside, K. et al. Understanding the experiences of self-injurious behavior in autism spectrum disorder: Implications for monitoring technology design. J. Am. Med. Inform. Assoc. 20, 20 (2020).
Google Scholar
Goodwin, M. S., Intille, S. S., Velicer, W. F. & Groden, J. Sensor-enabled detection of stereotypical motor movements in persons with autism spectrum disorder. In Proceedings of the 7th International Conference on Interaction Design and Children 109–112 (2008).
Min, C.-H., Tewfik, A. H., Kim, Y. & Menard, R. Optimal sensor location for body sensor network to detect self-stimulatory behaviors of children with autism spectrum disorder. In Proceedings of the 31st Annual International Conference of the IEEE EMBS 3489–3492 (2009).
Albinali, F., Goodwin, M. S. & Intille, S. Detecting stereotypical motor movements in the classroom using accelerometry and pattern recognition algorithms. Pervas. Mobile Comput. 8, 103 (2011).
Article Google Scholar
Coronato, A., De Pietro, G. & Paragliola, G. A situation-aware system for the detection of motion disorders of patients with Autism Spectrum Disorders. Expert Syst. Appl. 41, 7868–7877 (2014).
Article Google Scholar
Rad, N. M., Furlanello, C. & Kessler, F. B. Applying deep learning to stereotypical motor movement detection in autism spectrum disorders. In Proceedings of the IEEE International Conference on Data Mining Workshops (2016).
Min, C.-H. Automatic detection and labeling of self-stimulatory behavioral patterns in children with Autism Spectrum Disorder. In Proceedings of Engineering in Medicine and Biology Society, 2017 39th Annual International Conference of the IEEE 279–282 (2017).
Ozdenizci, O. et al. Time-series prediction of proximal aggression onset in minimally-verbal youth with autism spectrum disorder using physiological biosignals. In Proceedings of 2018 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (2018).
Chen, Y.-H., Rodgers, J. & McConachie, H. Restricted and repetitive behaviours, sensory processing and cognitive style in children with autism spectrum disorders. J. Autism Dev. Disord. 39, 635–642 (2009).
Article PubMed Google Scholar
Gelman, A. & Hill, J. Data Analysis Using Regression and Multilevel/Hierarchical Models (Cambridge University Press, Cambridge, 2006).
Book Google Scholar
Gowen, E. & Hamilton, A. Motor abilities in autism: A review using a computational context. J. Autism Dev. Disord. 43, 323 (2013).
Article PubMed Google Scholar
Cantin-Garside, K. D. et al. Detecting and classifying self-injurious behavior in autism spectrum disorder using machine learning techniques. J. Autism Dev. Disord 20, 1–14 (2020).
Google Scholar
Robertson, G., Caldwell, G., Hamill, J., Kamen, G. & Whittlesey, S. Research Methods in Biomechanics, 2E (Human Kinetics, Champaign, 2013).
Google Scholar
Stergiou, N. Innovative Analyses of Human Movement 63–87 (Human Kinetics, Champaign, 2004).
Google Scholar
Davids, K., Glazier, P., Araújo, D. & Bartlett, R. Movement systems as dynamical systems. Sports Med. 33, 245–260 (2003).
Article PubMed Google Scholar
Pincus, S. M. Greater signal regularity may indicate increased system isolation. Math. Biosci. 122, 161–181 (1994).
Article CAS PubMed MATH Google Scholar
Großekathöfer, U. et al. Automated detection of stereotypical motor movements in autism spectrum disorder using recurrence quantification analysis. Front. Neuroinform. 11, 9 (2017).
Article PubMed PubMed Central Google Scholar
Sandman, C. A., Kemp, A. S., Mabini, C., Pincus, D. & Magnusson, M. The role of self-injury in the organisation of behaviour. J. Intellect. Disabil. Res. 56, 516–526 (2012).
Article CAS PubMed PubMed Central Google Scholar
Kemp, A. S. et al. The self-organization of self-injurious behavior as revealed through temporal pattern analyses. Discovering Hidden Temporal Patterns in Behavior and Interaction: T-Pattern Detection and Analysis with THEME™, 101–124 (2016).
Fournier, K. A., Amano, S., Radonovich, K. J., Bleser, T. M. & Hass, C. J. Decreased dynamical complexity during quiet stance in children with Autism Spectrum Disorders. Gait Posture 39, 420–423 (2014).
Article PubMed Google Scholar
Bodfish, J. W., Parker, D. E., Lewis, M. H., Sprague, R. L. & Newell, K. M. Stereotypy and motor control: Differences in the postural stability dynamics of persons with stereotyped and dyskinetic movement disorders. Am. J. Ment. Retard. 106, 123–134 (2001).
Article CAS PubMed Google Scholar
Hausdorff, J. et al. Impaired regulation of stride variability in Parkinson’s disease subjects with freezing of gait. Exp. Brain Res. 149, 187–194 (2003).
Article CAS PubMed Google Scholar
Socie, M. J., Motl, R. W. & Sosnoff, J. J. Examination of spatiotemporal gait parameters during the 6-min walk in individuals with multiple sclerosis. Int. J. Rehabil. Res. 37, 311–316 (2014).
Article PubMed Google Scholar
Kalron, A. Gait variability across the disability spectrum in people with multiple sclerosis. J. Neurol. Sci. 361, 1–6 (2016).
Article PubMed ADS Google Scholar
Kanne, S. M. & Mazurek, M. O. Aggression in children and adolescents with ASD: Prevalence and risk factors. J. Autism Dev. Disord. 41, 926–937 (2011).
Article PubMed Google Scholar
Hill, A. P. et al. Aggressive behavior problems in children with autism spectrum disorders: Prevalence and correlates in a large clinical sample. Res. Autism Spectrum Disord. 8, 1121–1133 (2014).
Article Google Scholar
Lord, C. et al. Autism Diagnostic Observation Schedule–2nd Edition (ADOS-2) (Western Psychological Corporation, Los Angeles, 2012).
Google Scholar
Johnson, C. R. et al. Standardised Observation Analogue Procedure (SOAP) for assessing parent and child behaviours in clinical trials. J. Intellect. Dev. Disabil. 34, 230–238 (2009).
Article PubMed PubMed Central Google Scholar
Robusto, K. M. & Trost, S. G. Comparison of three generations of ActiGraph^TM activity monitors in children and adolescents. J. Sports Sci. 30, 1429–1435 (2012).
Article PubMed PubMed Central Google Scholar
Bussmann, J. B. J. et al. Measuring daily behavior using ambulatory accelerometry: The Activity Monitor. Behav. Res. Methods Instrum. Comput. 33, 349–356 (2001).
Article CAS PubMed Google Scholar
Trost, S. G., Zheng, Y. & Wong, W.-K. Machine learning for activity recognition: Hip versus wrist data. Physiol. Meas. 35, 2183 (2014).
Article PubMed Google Scholar
Mees, A. I. & Judd, K. Dangers of geometric filtering. Phys. D 68, 427–436 (1993).
Article MATH Google Scholar
Bruijn, S., Meijer, O., Beek, P. & Van Dieën, J. Assessing the stability of human locomotion: A review of current measures. J. R. Soc. Interface 10, 20 (2013).
Article Google Scholar
Samani, A., Srinivasan, D., Mathiassen, S. E. & Madeleine, P. Nonlinear metrics assessing motor variability in a standardized pipetting task: Between-and within-subject variance components. J. Electromyogr. Kinesiol. 25, 557–564 (2015).
Article PubMed Google Scholar
Yentes, J. M. et al. The appropriate use of approximate entropy and sample entropy with short data sets. Ann. Biomed. Eng. 41, 349–365 (2013).
Article PubMed Google Scholar
Wolf, A., Swift, J. B., Swinney, H. L. & Vastano, J. A. Determining Lyapunov exponents from a time series. Phys. D 16, 285–317 (1985).
Article MathSciNet MATH Google Scholar
Wurdeman, S. R., Myers, S. A. & Stergiou, N. Transtibial amputee joint motion has increased attractor divergence during walking compared to non-amputee gait. Ann. Biomed. Eng. 41, 806–813 (2013).
Article PubMed Google Scholar
Fraser, A. M. & Swinney, H. L. Independent coordinates for strange attractors from mutual information. Phys. Rev. A 33, 1134 (1986).
Article MathSciNet CAS MATH ADS Google Scholar
Gates, D. H. & Dingwell, J. B. Comparison of different state space definitions for local dynamic stability analyses. J. Biomech. 42, 1345–1349 (2009).
Article PubMed PubMed Central Google Scholar
Rosenstein, T., Collins, M. J. & De Luca, C. A practical method for calculating largest lyapunov exponents from small data set. Phys. D Nonlinear Phenom. 65, 20 (1993).
Article MathSciNet MATH Google Scholar
Kennel, M. B., Brown, R. & Abarbanel, H. D. Determining embedding dimension for phase-space reconstruction using a geometrical construction. Phys. Rev. A 45, 3403 (1992).
Article CAS PubMed ADS Google Scholar
Richman, J. & Moorman, J. Physiological time-series analysis using approximate entropy and sample entropy. Am. J. Physiol. Heart Circ. Physiol. 278, 20 (2000).
Article Google Scholar
Olofsen, E., Sleigh, W. J. & Dahan, A. Permutation entropy of the electroencephalogram: A measure of anaesthetic drug effect. Br. J. Anaesth. 101, 20 (2008).
Article CAS Google Scholar
McCamley, J., Denton, W., Lyden, E. & Yentes, J. M. Measuring coupling of rhythmical time series using cross sample entropy and cross recurrence quantification analysis. Comput. Math. Methods Med. 20, 20 (2017).
Google Scholar
Chen, Y. & Yang, H. Multiscale recurrence analysis of long-term nonlinear and nonstationary time series. Chaos Solitons Fract. 45, 978–987 (2012).
Article CAS ADS Google Scholar
Yang, H. Multiscale recurrence quantification analysis of spatial cardiac vectorcardiogram signals. IEEE Trans. Biomed. Eng. 58, 339–347 (2011).
Article PubMed ADS Google Scholar
Zbilut, J. P. & Webber, C. L. Recurrence Quantification Analysis. Wiley Encyclopedia of Biomedical Engineering (Wiley, New York, 2006).
Google Scholar
Richardson, M. J., Schmidt, R. C. & Kay, B. A. Distinguishing the noise and attractor strength of coordinated limb movements using recurrence analysis. Biol. Cybern. 96, 59–78 (2007).
Article PubMed MATH Google Scholar
Curtin, P. et al. Recurrence quantification analysis to characterize cyclical components of environmental elemental exposures during fetal and postnatal development. PLoS One 12, 20 (2017).
Google Scholar
Dingwell, J. B. & Cusumano, J. P. Re-interpreting detrended fluctuation analyses of stride-to-stride variability in human walking. Gait Posture 32, 348–353 (2010).
Article PubMed PubMed Central Google Scholar
Fleury, A., Kushki, A., Tanel, N., Anagnostou, E. & Chau, T. Statistical persistence and timing characteristics of repetitive circle drawing in children with ASD. Dev. Neurorehabil. 16, 245–254 (2013).
Article PubMed Google Scholar
Dingwell, J. B. et al. Neuropathic gait shows only trends towards increased variability of sagittal plane kinematics during treadmill locomotion. Gait Posture 10, 21–29 (1999).
Article CAS PubMed Google Scholar
Goldberger, A. L. et al. Fractal dynamics in physiology: Alterations with disease and aging. J. Proc. Natl. Acad. Sci. 99, 2466–2472 (2002).
Article ADS Google Scholar
Tibshirani, R. Regression shrinkage and selection via the lasso. J. R. Stat. Soc. 20, 267–288 (1996).
MathSciNet MATH Google Scholar
Zwick, W. R. & Velicer, W. F. Comparison of five rules for determining the number of components to retain. Psychol. Bull. 99, 432–442 (1986).
Article Google Scholar
Poliker, R. Pattern Recognition. Wiley Encyclopedia of Biomedical Engineering (Wiley, New York, 2006).
Google Scholar
Albinali, F., Goodwin, M. S. & Intille, S. S. Recognizing stereotypical motor movements in the laboratory and classroom: A case study with children on the autism spectrum. In Proceedings of the 11th International Conference on Ubiquitous Computing 71–80 (2009).
Bastani, K., Rao, P. K. & Kong, Z. An online sparse estimation-based classification approach for real-time monitoring in advanced manufacturing processes from heterogeneous sensor data. IIE Trans. 48, 579 (2016).
Article Google Scholar
Bastani, K., Kim, S., Kong, Z. J., Nussbaum, M. A. & Huang, W. Online classification and sensor selection optimization with applications to human material handling tasks using wearable sensing technologies. IEEE Trans. Human Mach. Syst. 46, 485–497 (2016).
Article Google Scholar
Webb, G. I. Model Evaluation. In Encyclopedia of Machine Learning (eds Claude, S. & Geoffrey, I. W.) 683–683 (Springer, New York, 2010).
Google Scholar
Mazefsky, C. A. et al. The role of emotion regulation in autism spectrum disorder. J. Am. Acad. Child Adolesc. Psychiatry 52, 679–688 (2013).
Article PubMed PubMed Central Google Scholar
Stergiou, N. & Decker, L. M. Human movement variability, nonlinear dynamics, and pathology: Is there a connection?. Hum. Mov. Sci. 30, 869–888 (2011).
Article PubMed PubMed Central Google Scholar
Torres, E. B. & Donnellan, A. M. Autism: The Movement Perspective (Frontiers Media SA, Lausanne, 2015).
Book Google Scholar
Goldberger, A. L. et al. Fractal dynamics in physiology: Alterations with disease and aging. Proc. Natl. Acad. Sci. 99, 2466–2472 (2002).
Article PubMed ADS PubMed Central Google Scholar
Rinehart, N. J. et al. An examination of movement kinematics in young people with high-functioning autism and Asperger’s disorder: Further evidence for a motor planning deficit. J. Autism Dev. Disord. 36, 757–767 (2006).
Article PubMed PubMed Central Google Scholar
Torres, E. B., Yanovich, P. & Metaxas, D. N. Give spontaneity and self-discovery a chance in ASD: Spontaneous peripheral limb variability as a proxy to evoke centrally driven intentional acts. Front. Integr. Neurosci. 7, 46 (2013).
PubMed PubMed Central Google Scholar

Download references

Funding

This research was supported by a National Science Foundation Graduate Research Fellowship (to the first author) and a Virginia Tech Institute for Society, Culture and Environment Grant (to DS). However, neither agency had any involvement in data analysis, interpretation, or the decision to publish. The authors thank the Virginia Tech OASF for supporting publication costs for this article.

Author information

Authors and Affiliations

Department of Industrial and Systems Engineering, Virginia Tech, Blacksburg, VA, USA
Kristine D. Cantin-Garside, Divya Srinivasan & Maury A. Nussbaum
Department of Statistics, Virginia Tech, Blacksburg, VA, USA
Shyam Ranganathan
Center for Youth Development and Intervention, Department of Psychology, University of Alabama, Tuscaloosa, AL, USA
Susan W. White

Authors

Kristine D. Cantin-Garside
View author publications
You can also search for this author in PubMed Google Scholar
Divya Srinivasan
View author publications
You can also search for this author in PubMed Google Scholar
Shyam Ranganathan
View author publications
You can also search for this author in PubMed Google Scholar
Susan W. White
View author publications
You can also search for this author in PubMed Google Scholar
Maury A. Nussbaum
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors listed have made a substantial, direct and intellectual contribution to the work, and approved it for publication.

Corresponding author

Correspondence to Maury A. Nussbaum.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary file1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Cantin-Garside, K.D., Srinivasan, D., Ranganathan, S. et al. Multi-level modeling with nonlinear movement metrics to classify self-injurious behaviors in autism spectrum disorder. Sci Rep 10, 16699 (2020). https://doi.org/10.1038/s41598-020-73155-4

Download citation

Received: 27 May 2020
Accepted: 14 September 2020
Published: 07 October 2020
DOI: https://doi.org/10.1038/s41598-020-73155-4

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Prediction of autistic tendencies at 18 months of age via markerless video analysis of spontaneous body movements in 4-month-old infants

Quantifying the social symptoms of autism using motion capture

Social and non-social autism symptoms and trait domains are genetically dissociable

Introduction

Materials and methods

Participants

Study overview

Instrumentation

Data processing and analysis

Feature extraction

Derivation of nonlinear metrics of motor variability

Entropy

Cross-sample entropy

Recurrence quantification analysis

Detrended fluctuation analysis

Feature selection

Regression modeling

Evaluation

Model comparisons

Results

Dimensional reduction

Classifier performance

Discussion

Limitations and future work

Conclusions

Data availability

Abbreviations

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary information

Supplementary file1

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links