At-home wearables and machine learning sensitively capture disease progression in amyotrophic lateral sclerosis

Gupta, Anoopum S.; Patel, Siddharth; Premasiri, Alan; Vieira, Fernando

doi:10.1038/s41467-023-40917-3

Download PDF

Article
Open access
Published: 21 August 2023

At-home wearables and machine learning sensitively capture disease progression in amyotrophic lateral sclerosis

Nature Communications volume 14, Article number: 5080 (2023) Cite this article

4800 Accesses
4 Citations
25 Altmetric
Metrics details

Subjects

Abstract

Amyotrophic lateral sclerosis causes degeneration of motor neurons, resulting in progressive muscle weakness and impairment in motor function. Promising drug development efforts have accelerated in amyotrophic lateral sclerosis, but are constrained by a lack of objective, sensitive, and accessible outcome measures. Here we investigate the use of wearable sensors, worn on four limbs at home during natural behavior, to quantify motor function and disease progression in 376 individuals with amyotrophic lateral sclerosis. We use an analysis approach that automatically detects and characterizes submovements from passively collected accelerometer data and produces a machine-learned severity score for each limb that is independent of clinical ratings. We show that this approach produces scores that progress faster than the gold standard Amyotrophic Lateral Sclerosis Functional Rating Scale-Revised (−0.86 ± 0.70 SD/year versus −0.73 ± 0.74 SD/year), resulting in smaller clinical trial sample size estimates (N = 76 versus N = 121). This method offers an ecologically valid and scalable measure for potential use in amyotrophic lateral sclerosis trials and clinical care.

A machine-learning based objective measure for ALS disease severity

Article Open access 08 April 2022

Enabling precision rehabilitation interventions using wearable sensors and machine learning to track motor recovery

Article Open access 21 September 2020

Wearable full-body motion tracking of activities of daily living predicts disease trajectory in Duchenne muscular dystrophy

Article Open access 19 January 2023

Introduction

Novel therapeutic modalities are now aimed at proximal disease mechanisms in amyotrophic lateral sclerosis (ALS) and other neurodegenerative diseases^1,2. One major barrier to the successful and efficient development of disease-modifying therapies for neurodegenerative disorders is a lack of objective clinical outcome measures that account for disease heterogeneity and can sensitively quantify disease progression over the duration of a clinical trial^3,4,5. The standard tool for assessing disease severity in ALS clinical trials and clinical care is a semi-quantitative rating scale (ALS Functional Rating Scale-Revised^6,7 or ALSFRS-R) that uses multiple choice questions to evaluate several behavioral functions (e.g., walking, handwriting, speech, swallowing). The assessment is most often completed by clinicians specializing in ALS^7,8, however recent studies have shown high correlation between clinician-performed ALSFRS-R and at-home, patient-performed ALSFRS-R⁹. Clinician or patient-performed ALSFRS-R is a useful assessment of global motor function; however, it is subjective, categorical, and is only performed intermittently over time, which limits its sensitivity for measuring disease change and contributes to the need for relatively large and expensive trials^10,11. This is a particular challenge in rare diseases and results in pressure to include relatively homogenous cohorts with faster rates of disease progression, which restricts participation of some individuals and may not be representative of the entire ALS population¹².

There is a great opportunity to reduce the size and cost of ALS trials, increase the population of individuals who can participate, and accelerate the evaluation of promising therapeutics through the development of new categories of sensitive quantitative motor outcome measures^13,14,15. Quantitative motor outcome measures may be task-based (i.e., measuring behavior during the performance of a specific task) or task-free, where an individual’s natural behavior is measured passively and continuously at home. There has been recent development of several task-based approaches to quantify speech and limb function in ALS using scalable technologies at home^9,16,17,18 and only a single report of a task-free approach in ALS using a waist-worn accelerometer¹⁹. Task-based measures, however, have some of the same limitations as rating scales in that they are based on a relatively small number of data samples and cannot easily account for diurnal and day-to-day variability, they rely on the participant’s ability and motivation to perform the task, and they are susceptible to learning and placebo effects.

Task-free assessment approaches which passively and continuously measure natural behavior at home have the potential to overcome these limitations and be transformative by making reliable and sensitive measures available at scale. Furthermore, they have the potential to produce measures that more closely reflect the day-to-day function of the individual by measuring the individual’s own selection of behaviors. However, the information obtained by the tool must be interpretable and meaningful to support its use in clinical trials or clinical care.

Here we demonstrate that a submovement-focused analysis of triaxial accelerometer data^20,21, recorded from wrist and ankle sensors worn by hundreds of individuals with ALS at home during natural behavior, produces interpretable and robust measures of motor function and disease progression. We develop a machine learning approach to train a model that is sensitive to disease change by utilizing the information for how individuals’ sensor-based movement patterns change over time, rather than being constrained by existing clinical assessments such as ALSFRS-R. We show that the model’s severity estimates and longitudinal trajectories are reliable and consistent with ALSFRS-R, but are more sensitive than the clinical scale for measuring change over time. Thus, we demonstrate that objective, sensitive, and scalable measures of motor function and disease change can be obtained from passive analysis of everyday behavior using inexpensive wearable sensors.

Results

Overview of the dataset

We analyzed accelerometer data from wrist and ankle-worn sensors collected as part of the Precision Medicine Program launched by the ALS Therapy Development Institute (ALS TDI) in 2014 (see Methods). Individuals were asked to wear a sensor on each wrist and ankle continuously for a week each month. Participants also performed a sequence of 5 limb-based exercises on alternating days, lasting a total of approximately 5 min. Participants were instructed that sensors must be worn during the brief exercises, but to also wear the sensors as much as possible throughout the week without further specifying periods of wear time. An analysis of accelerometer data collected only during the task-based assessments was previously reported¹⁷. Here, we analyze the entirety of accelerometer data collected at home as individuals performed their typical daily routine without any constraints. Participant clinical and demographic data are shown in Fig. 1A, including the median ALSFRS-R at the study start and study end. 95% of participants lived in the United States (41 states represented), 3% lived in Canada, and 1% lived in the United Kingdom. 93.5% of participants were White, 2% Hispanic, 2% Asian, 1% Black, <1% Middle Eastern, and <1% Polish. 15% of ALS participants had a family history of ALS. The dataset filtering steps are described in Fig. 1B. Cross-sectional analysis included 4637 sessions from 402 unique participants (376 ALS, 26 controls) with at least 24 h of recorded accelerometer data, pooled only from days with at least 3 h of data, from all four limbs (Fig. 1B). The 24 h session minimum for daytime data was chosen based on prior work demonstrating high reliability of daytime data across the first three and last three days in a week^20,21. Longitudinal analysis was conducted using data from participants with at least three data collection sessions spanning a minimum of 0.75 years (188 ALS and 6 control participants). Participants had a median of 9 days per session with at least 3 h/day of data and averaged 8.9 h/day of wear time for the wrist sensors and 7.4 h/day for the ankle sensors. The duration of daily sensor wear time (averaged across the four sensors) decreased over the course of the study from an average of 9.1 h/day (first session) to 6.5 h/day (last session). To understand the burden of wearing the four sensors periodically over a 0.75-year period (relative to at-home self-report of ALSFRS-R), we identified the subset of the 402 individuals with adequate cross-sectional data who did not wear the sensors for at least 0.75 years but continued to perform ALSFRS-R self-report for 90 days or more after the last time they wore sensors. This consisted of 39 participants or ~10% of the cohort who continued performing ALSFRS-R but stopped wearing the sensors.

**Fig. 1: Overview of population and dataset.**

Submovement, activity bout, activity index, and spectral movement features (85 total) were extracted from each session as previously described^20,21 (Fig. 1C, Supplementary Table 1). Briefly, continuous triaxial accelerometer data was processed to identify activity bouts (short periods of continuous movement), which were projected onto a 2D plane using principal component analysis to identify the primary and secondary directions of motion^20,21. The acceleration time series was converted to a velocity time series via integration. Submovements (i.e., typically bell-shaped velocity-time curves flanked by zero velocity crossings, see Fig. 1C) were then identified in the primary and secondary directions of motion and grouped into long and short-duration submovements. Single feature analysis was performed on a subset of 24 key submovement (SM) features of interest. These included SM distance, peak velocity, and peak acceleration (8 features each). Mean and standard deviation were computed for short-duration and long-duration SMs in the primary and secondary directions of planar movement resulting in 8 features for each measurement type.

Overview of the pairwise comparisons severity estimation model

The task was to train a machine learning model that could combine information across the 85 movement features, previously shown to strongly reflect motor function in pediatric and adult ataxias^20,21, to produce an ALS-specific composite measure that was sensitive to disease progression. The standard machine learning approach is to train a regression model to predict the clinical scale score (e.g., ALSFRS-R). However, the sensitivity of the model is then constrained by the sensitivity of the scale. In the “pairwise model” approach, the model is trained to learn the steepest direction of disease change in feature space based on longitudinal data, without using clinician or patient-reported information. This approach, described in Fig. 2, can be applied to any disease that progresses over time. The model produces a score in which lower values represent increased impairment (as in ALSFRS-R) and there is no lower or upper bound on the value of the score, although scores in the current population ranged from −11.3 to 9.6. In addition to the pairwise model, linear regression models with L1-regularization²² were trained to predict ALSFRS-R total, ALSFRS-R gross motor subscore, and ALSFRS-R fine motor subscore, and were evaluated using five-fold cross-validation.

**Fig. 2: Overview of the pairwise comparisons model.**

Cross-sectional properties of ankle and wrist sensor data

Individual right ankle SM features, including SM distance, velocity, and acceleration were significantly correlated with ALSFRS-R total (r = 0.31–0.58), demonstrated high test–retest reliability (ICC = 0.71–0.93), and were significantly different between ALS and control participants (Table 1). All submovement features were positively correlated with ALSFRS-R, indicating that submovement distances, peak velocities, and peak accelerations were smaller and less variable in individuals with more severe disease. Similarly, right wrist submovement (SM) features were positively correlated with ALSFRS-R total (r = 0.31–0.48) and were significantly different between ALS and control participants (Table 2). Long-duration wrist submovements showed high test–retest reliability (ICC = 0.86–0.91), whereas short-duration submovements had moderate test–retest reliability (ICC = 0.55–0.83). Ankle submovement features were more strongly correlated with the ALSFRS-R gross motor subscore (r = 0.40–0.68) than with the ALSFRS-R fine motor subscore (r = 0.16–0.51) and were only weakly correlated with respiratory and bulbar subscores. Conversely, wrist submovement features correlated more strongly with the ALSFRS-R fine motor subscore (r = 0.40–0.60) compared with ALSFRS-R gross motor subscore (r = 0.19–0.32), and also only weakly correlated with the respiratory and bulbar subscores. Both ankle and wrist submovements demonstrated good agreement between right and left limbs, however ankle right/left agreement (r = 0.81–0.97) was stronger than wrist right/left agreement (r = 0.65–0.82).

Table 1 Cross-sectional properties of ankle submovement models and features

Full size table

Table 2 Cross-sectional properties of wrist submovement models and features

Full size table

Machine learning models trained to learn a composite severity score based on right ankle movement features, correlated well with ALSFRS-R total and ALSFRS-R gross motor subscore (r = 0.66-0.77), had high test–retest reliability (ICC = 0.88–0.92), distinguished between ALS participants and controls, and demonstrated strong right/left limb agreement (r = 0.91-0.95, see Table 1). For the right wrist, composite severity scores correlated well with ALSFRS-R total and ALSFRS-R fine motor subscore (r = 0.65–0.72), had high test–retest reliability (ICC = 0.84–0.90), distinguished between ALS participants and controls, and demonstrated strong right/left limb agreement (r = 0.82–0.86, see Table 2). Composite scores correlated more strongly with ALSFRS-R for male participants compared to female participants, however test–retest reliability and right/left limb agreement was similar for both groups (Supplementary Tables 2–5). The ankle and wrist pairwise models had the highest test–retest reliability among the machine learning models and were the focus of longitudinal analysis. To understand which individual features were the most salient in the ankle and wrist pairwise models, we identified features that were in the top five (out of 85) in feature importance for all 5 cross-validation folds. For the right ankle pairwise model, the features included SM peak velocity (mean, PC2 direction, long duration SM group) and SM distance (mean, PC2 direction, long duration SMs). For the right wrist pairwise model the most salient features were SM peak velocity (mean, PC2 direction, long duration SMs) and SM peak velocity (mean, PC2 direction, short duration SMs). Although SM peak velocity was strongly represented in the models, the properties of peak velocity at an individual feature level (e.g., relationships with ALSFRS-R, test–retest reliability) were comparable to SM acceleration and distance features and showed weaker relationships with ALSFRS-R compared to the pairwise models (see Tables 1, 2).

Longitudinal properties of ankle and wrist sensor data

The rate of change over time for each sensor-based composite score and ALSFRS-R score was modeled using linear regression, with the slope of the best fit line determining the rate of change²³. To compare the rate of change of different scores, each with a different range of values, each score was standardized (subtracting the mean and dividing by the standard deviation) and expressed as a z-score. Rate of change for each score was reported as z-score per year or equivalently as standard deviations (SD) per year.

The rate of change of the pairwise model composite score was computed for each limb. Rate of change was highly consistent across right and left ankles (r = 0.87) and right and left wrists (r = 0.80, Fig. 3A). There was lesser agreement (r = 0.52–0.56) between each upper and lower limb pair (e.g., right ankle versus right wrist). Individual-level trajectories demonstrated examples in which all four limbs progressed similarly over time (Fig. 3B), the lower limb pair had similar trajectories but differed from the upper limbs (Fig. 3C), and where the trajectory of one or two limbs deviated from the others (Fig. 3D).

**Fig. 3: Longitudinal data from each limb.**

There was also congruence between lower limb pairwise model trajectories and ALSFRS-R gross motor subscore trajectories and between upper limb and ALSFRS-R fine motor subscore trajectories (Fig. 3B–D). The population-level agreement between the right ankle pairwise model rate of change and ALSFRS-R gross motor rate of change (r = 0.73, p = 1.5 × 10⁻³³) was stronger than the agreement with ALSFRS-R fine motor (r = 0.56, p = 1.4 × 10⁻¹⁷), and the right wrist pairwise model rate of change showed stronger agreement with ALSFRS-R fine motor (r = 0.73, p = 1.1 × 10⁻³³) compared to ALSFRS-R gross motor rate of change (r = 0.60, p = 4.2 × 10⁻²⁰).

Next, for each participant, the pairwise model rate of change was combined over the four limbs by either taking the average rate of change or the maximum rate of change. When taking the average of the four limbs, the pairwise model rate of change had strong agreement with ALSFRS-R total rate of change (r = 0.71), gross motor subscore rate of change (r = 0.75), and fine motor subscore rate of change (r = 0.68, Fig. 4A), and weak agreement with respiratory and bulbar subscores (r = 0.38 and r = 0.45, respectively). Similarly, when taking the limb with the maximum rate of change, the pairwise model had strong agreement with ALSFRS-R (r = 0.69), gross motor subscore (r = 0.75), and fine motor subscore (r = 0.69, Fig. 4B), and weak agreement with respiratory and bulbar subscores (r = 0.34 and r = 0.43, respectively). The sensor-based pairwise model, which was trained to estimate disease severity without knowledge of ALSFRS-R scores, had strong rate-of-change agreement with the regression model trained to estimate ALSFRS-R total score, regardless of whether the average of the four limbs or the limb with the fastest progression rate was used (r = 0.92 for both, Fig. 4A, B). Thus, averaging or taking the maximum rate of change across the four limbs produced equally robust and consistent measures of disease progression.

**Fig. 4: Rate of change comparison between sensor-based models and ALSFRS-R.**

When taking the maximum rate of change, points shift downward with respect to the y = x line (Fig. 4A versus 4B) indicating increased sensitivity of the sensor-based model to disease change in comparison with ALSFRS-R total. Using the maximum rate of change, the pairwise model had a progression rate of −0.86 ± 0.70 (mean ± standard deviation) SD/year and the regression model had a progression rate of −0.86 ± 0.74 SD/year. Both the pairwise and regression models progressed faster over time than ALSFRS-R total (−0.73 ± 0.74 SD/year, p = 0.007 and p = 0.017, respectively; Fig. 4C). Female and male participants had nearly identical pairwise model progression rates (−0.86 ± 0.69 SD/year and −0.87 ± 0.70, respectively). Hypothetical clinical trial sample size estimates were smallest for the pairwise model (N = 76), followed by the regression model (N = 86), and ALSFRS-R (N = 121). Pairwise model and the regression model scores did not progress for control participants and were significantly different between ALS and control participants (p = 0.0004 and p = 0.0006, respectively; Fig. 4C). When using the mean rate of change of all four limbs, the pairwise model and ALSFRS-R total score were not significantly different (−0.56 ± 0.51 SD/year versus −0.73 ± 0.74 SD/year, p = 0.12) and ALSFRS-R total score was more sensitive than the regression model (−0.73 ± 0.74 SD/year versus −0.54 ± 0.56 SD/year, p = 0.037). Hypothetical clinical trial sample size estimates were again smallest for the pairwise model (N = 101), due to lower population variance in rate of change, followed by ALSFRS-R (N = 121), and the regression model (N = 126).

Discussion

We have shown that data from inexpensive sensors worn on limbs at home during natural behavior can produce reliable, sensitive, and interpretable measures of gross and fine motor function in individuals with ALS. Ankle movement features derived from accelerometer data were highly consistent across right and left ankles and were in agreement with gross motor function as assessed on ALSFRS-R, both in terms of cross-sectional severity and in terms of rate of change over time. Similarly, wrist movement features were highly consistent across right and left wrists and were in agreement with fine motor function on ALSFRS-R. Although there was strong right-left limb agreement at a population level, arm-leg agreement showed only moderate agreement, and some individuals were observed to have different rates of progression for each limb. Taking the score of the limb with the maximum progression rate produced a motor outcome measure that was consistent with but more sensitive than the current primary outcome measure in most ALS trials (ALSFRS-R), resulting in smaller hypothetical clinical trial sample size estimates.

The analysis approach for quantifying motor function in ALS centered on the extraction and characterization of motor primitives called submovements during natural behavior, which was previously developed for quantifying motor function in ataxia-telangiectasia²⁰ and adult cerebellar ataxias²¹. There is evidence that motor control is achieved by combining submovements to compose complex voluntary motor behaviors^24,25,26,27 and that submovements change in a consistent manner with the state of the motor system. In various contexts, such as infant development²⁸, aging²⁹, stroke recovery³⁰, and ataxia^31,32, submovements extracted from specific motor tasks reflect changes in motor function. During natural at-home behavior, ankle submovement distance, peak velocity, and peak acceleration are smaller in adults with spinocerebellar ataxias and multiple system atrophy compared to controls and become progressively smaller and less variable as self-reported function decreases and ataxia severity increases²¹. The submovement analysis approach contrasts with a prior analysis of task-free, at-home measurement in 42 individuals with ALS using waist-worn accelerometers, which quantified overall activity levels (e.g., activity count, percent of day active)¹⁹. Although overall motor activity is a pertinent outcome in ALS, it is reliant on full-day sensor wear and is likely more susceptible to day-to-day changes in behavioral context (e.g., travel, systemic illness, sleep quality), requiring careful consideration of reliability.

Based on our literature review, limb submovement features have not been previously studied in ALS. Several studies, however, have investigated the relationship between muscle strength (a direct cause of motor impairment in ALS³³) and submovement characteristics. In a heterogeneous population of individuals with motor impairments (e.g., spinal cord injury, cerebral palsy, stroke), participants were asked to perform a computer-based pointing task and a mechanical dynamometer was used to measure grip strength and pinch strength³⁴. The authors found that the number of submovements per pointing movement was negatively correlated with grip strength (the movement was composed of smaller submovements as grip strength decreased) and that the velocity of movement was directly proportional to grip strength³⁴. In another study of individuals with hemiparesis secondary to stroke, it was found that peak arm reaching velocity was influenced most by shoulder, elbow, and wrist flexor and extensor muscle strength (58% of variance explained), measured using a hand-held dynamometer³⁵. In a study of individuals without motor disability, submovement organization was examined as participants tracked a small or large dot on a screen with a pen placed on a digitizer tablet, while simultaneously recording activity from muscles in the neck and upper extremity using surface electrodes³⁶. When tracking the smaller target, extensor and flexor muscles of the forearm activated more strongly, and submovements were found to have increased peak velocities³⁶.

These studies support that there is a robust relationship between muscle strength and submovement features, in particular peak velocity. Consistent with these studies, we found that wrist and ankle submovements from individuals with ALS had smaller velocities, accelerations, and distances traversed. Submovement peak velocity was the only highly selected feature in both the right ankle and the right wrist pairwise models, demonstrating its importance for measuring disease progression ALS. This supports a model in which muscle weakness and decreased muscle activation caused by motor neuron pathology gives rise to slower and smaller submovements during everyday limb movement. Further supporting this model, are the parallels in left-right symmetry observed in the present study with the left-right symmetry observed in large studies of hand-held dynamometry (HHD)²³ and Accurate Test of Limb Isometric Strength (ATLIS)³⁷ in ALS. Individual arm and leg muscles were found to correlate strongly with the identical muscles on the contralateral side, both in terms of cross-sectional strength measurements (r = 0.65–0.90) and also in terms of rate of change over time (r = 0.43–0.82)²³. We observed similar side-to-side cross-sectional and rate of change symmetry in individual submovement features (cross-sectional r = 0.65–0.97) and composite models (cross-sectional r = 0.82–0.95; rate of change r = 0.80–0.87). The high degree of correlation between right and left limbs and the observation that handedness and footedness can change over time in individuals with ALS, motivated our designation of limbs as right and left rather than dominant and nondominant. Interestingly, side-to-side cross-sectional symmetry of the leg was stronger than side-to-side symmetry of the arm here and in the HHD study. This may have implications for how ALS disease pathology spreads and highlights a potential future application of this technology in characterizing phenotypic spread across limbs in a continuous and granular fashion, for example in presymptomatic gene carriers. This also supports that submovement characteristics may be a suitable proxy for muscle strength in ALS, and offers an advantage over HHD and ATLIS of being able to measure strength continuously over multiple days, during the individual’s own selection of behaviors, and without relying on participant effort or evaluator training and strength. Thus, it may produce more reliable, ecologically valid, and scalable measures of muscle strength and motor function. It may also apply to other neurological conditions that affect muscle strength. A future study that collects HHD and/or ATLIS measurements along with submovements from accelerometer data would help clarify the relationship between strength and submovements in ALS.

As discussed above, strong side-to-side correlations of ankle and wrist submovement features and composite models were observed. This is consistent with previously reported strength measurements in ALS^23,37, but also highlights the robustness of the submovement measures that are generated independently from each limb’s movement during natural behavior at home. Ankle submovement measures correlated strongly with ALSFRS-R gross motor subscore (both cross-sectional scores and rate of change) and wrist submovement measures correlated strongly with ALSFRS-R fine motor subscores. We found high test–retest reliability of the sensor-based features and composite models. Finally, two machine learning models trained based on different information (pairwise model trained on longitudinal change; regression model trained on ALSFRS-R) generated composite scores that had strong agreement in the rate of disease progression (r = 0.92). These properties support that sensor-derived submovements obtained during natural behavior provide highly robust measures of disease severity for each limb. Since each limb can be reliably and independently measured, these data support the use of the fastest progressing limb’s rate of progression in order to obtain a personalized overall measure that is more sensitive for measuring disease change than ALSFRS-R and which may be more responsive to therapeutic intervention. However, the choice of if and how to combine severity measures from each limb can be determined based on the clinical application as well as on the individual’s prior clinical trajectory (for example in a run-in period prior to intervention in a clinical trial). To achieve maximal sensitivity for disease change, these data support collecting movement information from all four limbs. Given the high reliability of the sensor-based measures and since each limb is analyzed independently, it is not necessary to wear all four sensors simultaneously. An alternative design could be to wear one sensor at a time and rotate its location on the body in one-week intervals. Thus, each limb is still measured continuously for one week each month.

Two different supervised machine-learning approaches were used to create composite measures of overall motor impairment for each limb based on the collection of sensor-based movement features. One used the traditional approach of training a regression model to predict severity as measured by ALSFRS-R. The other approach learned the trajectory of disease progression (in feature space) from the longitudinal data and computed how far the individual had moved along that trajectory without ever having access to rating scale data (i.e., pairwise model). Despite the very different training approaches, both models were highly consistent in their estimates of progression rate (r = 0.92) and were similarly consistent with ALSFRS-R total’s progression rate (r = 0.69 and r = 0.71). The pairwise model was highlighted in analysis for three main reasons: (1) it had higher reliability than the regression models, (2) the consistency with ALSFRS-R in cross-section and in the rate of change was striking given that it had no chance to “overfit” to the clinical score, and (3) the pairwise modeling approach may be useful for other diseases where the existing clinical rating scale is less sensitive for capturing disease change. Furthermore, the pairwise modeling approach can be extended in a number of ways, for example by filtering comparisons, changing the type of classifier used, and aggregating data across multiple disease populations.

The large and longitudinal dataset generated by the ALS TDI Precision Medicine Program, consisting of 376 individuals with ALS who wore four sensors for multiple hours and days at home and with 188 participants who wore the four sensors longitudinally over a minimum of 0.75 years (median of 15 times over 1.5 years), supports the feasibility of the at-home passive data collection approach from both a patient and clinical operations perspective. Notably, although the Actigraph GT3X device was used in the current study, different devices were used in prior studies in ataxias^20,21, and the analytic approach presented here can likely be applied to any wearable sensor that captures triaxial accelerometer data at a minimum of 30 Hz, including consumer-grade sensors.

There were some limitations to the study. There was a relatively small number of controls included in the study and the controls were not age matched. However, the size and characteristics of the control sample do not affect the main conclusions of the study. There was heterogeneity in the number of hours each participant wore the sensors at home. This was mitigated in part by only including days in which sensors were worn for at least 3 h. We anticipate higher reliability estimates of all sensor-based measures if participants are explicitly asked to wear the sensor throughout the entire day with exception of bathing (and night if possible). Finally, as expected, the severity estimates based on limb movement did not correlate well with bulbar and respiratory function. These functions are represented in ALSFRS-R and other digital strategies (e.g., video-based analysis of facial movement or speech analysis^17,38,39,40) are needed to quantify these important motor domains in ALS.

In summary, we have shown that a submovement-based analysis of natural behavior at home using wearable sensors produces interpretable, reliable, sensitive, and ecologically valid measures of gross and fine motor function in ALS. This technology has properties that support its use as an outcome measure in ALS clinical trials with the potential to reduce the cost and size of future trials. The use of inexpensive sensors, worn at home with minimal instruction and no eligibility limitations, could increase access to clinical trials and support virtual clinical trials in ALS. It may also support the routine clinical care of individuals with ALS by providing clinicians and patients with an objective and reliable motor assessment that can be passively obtained at home with a relatively low burden and cost.

Methods

This research study was conducted in accordance with the ethical principles posited in the Declaration of Helsinki - Ethical Principles for Medical Research Involving Human Subjects. Protocol approval was provided by the institutional review board (ADVARRA CIRBI). Every participant consented to participate in this research by signing an IRB approved informed consent form. There was no participant compensation in this study. Gender of participants was determined based on self-report and was not explicitly considered in the study design.

Wearable sensor data processing and feature types

Continuous triaxial accelerometer data collected at 30 Hz was obtained from Actigraph GT3X devices (one for each limb). The cost of a single sensor ranged from $234-433 over the course of the study. Participants received a different sensor at each time point in the study. Any repeat use of a device by a participant would have been coincidental. In prior work, each participant’s wearable sensor data were manually partitioned into day and night segments based on changes in each participant’s daily activity level represented in the accelerometer data^20,21,41. However, given the large size of this dataset, day segments were automatically partitioned to include data collected between 7:21 am and 11:27 pm, the pooled mean estimates of sleep offset and sleep onset in the oldest age group (15–18 year old’s) studied in Galland et al.⁴², while accounting for each individual’s time zone. Visual inspection of random samples of 24-h periods of accelerometer data from multiple participants demonstrated that these times produced reasonable day-night segmentations. Data analysis focused on daytime segments. Gravity and high-frequency noise were removed from the acceleration time series using a sixth-order Butterworth filter with cutoff frequencies of 0.1 and 20 Hz^20,21,41,43.

Several classes of features were extracted from daytime ankle and wrist sensor data as in prior work^20,21. These included total power in the 0.1-5 Hz frequency range and features based on the distribution of activity intensity computed in 1-second time bins. Features were also extracted from “activity bouts” and from submovements. Supplementary Table 1 provides a description of the 85 features extracted from ankle and wrist sensor data. Based on prior work, single feature analysis was performed on a subset of 24 submovement features of interest as described in the main text.

Severity estimation models

Supervised machine learning approaches were used to create composite severity scores that aggregate over the 85 movement features. Separate models were trained for each limb. The pairwise comparison approach is described in Fig. 2 and the main text. To ensure that the pairwise model did not inadvertently learn longitudinal changes resulting from changes in device settings, comparisons were only allowed between sessions that had the same critical firmware version (where raw data were collected in an identical way). Five-fold cross-validation was used: for each fold comparisons from 80% of ALS participants were used to train a classification model and the model weights were applied to data from the held-out 20% of participants to generate severity scores for each session. Additionally, we trained linear regression models with L1 regularization (i.e., lasso regression)²² to predict ALSFRS-R total, ALSFRS-R gross motor subscore (ankle sensor data only), and ALSFRS-R fine motor subscore (wrist sensor data only). Five-fold cross-validation was also used to evaluate the performance of the regression models. For both the pairwise models and the regression models, each feature was z-score transformed prior to model training such that feature value ranges and model weights were comparable. Pearson correlation coefficient was used to measure performance, with each model compared with ALSFRS-R.

Statistical analyses

Statistical analyses were completed in MATLAB version R2022a (Mathworks, Natick, MA). In longitudinal data analysis, each participant’s progression rate for a given measure was determined by fitting a linear regression model to the individual’s longitudinal data for the measure and using the slope of the curve to represent a progression over time²³. The mean and standard deviation of the slope for each measure were computed across all ALS participants. For hypothetical clinical trial sample size estimates, we used a one-sample model for a continuous outcome⁴⁴ as described in Rutkove et al.¹⁶ with the same model parameters: 90% power to detect a 30% mean change in progression rate, with two-sided P values and a significance level of 0.05.

The non-parametric Mann–Whitney U test was used to determine individual feature differences between disease and control groups and Cohen’s d was used to measure effect size. The Mann–Whitney U test was also used to determine differences in the rate of change between different assessments. The Benjamini–Hochberg method was used to adjust for multiple comparisons and corrected p-values are reported⁴⁵. Corrected p-values <0.05 were considered significant. Single-measure intraclass correlation coefficients (ICCs) were used to determine the test–retest reliability of features and composite scores. To evaluate the reliability of sensor-based features, features were computed from data recorded in first half of the days in the session (e.g., days 1–4) and the second half of the days in the session (e.g., days 5–8), separately, and ICCs were computed using a 2-way mixed effects model⁴⁶. Pearson correlation coefficients and p-values were used to evaluate the relationship between sensor-based features and ALSFRS-R. As above, the Benjamini–Hochberg method was used to adjust for multiple comparisons⁴⁵.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

The GTX3 accelerometer data and associated ALSFRS-R data are available upon request because file sizes necessitate coordinated data transfer. Access can be obtained by visiting https://www.als.net/arc/data-commons/ and requesting the dataset by submitting accession code 06162023. Source data are provided with this paper.

Code availability

The code to train the pairwise model is available in the github repository https://github.com/neuropheno-org/Pairwise_model_code.

References

Amado, D. A. & Davidson, B. L. Gene therapy for ALS: A review. Mol. Ther. 29, 3345–3358 (2021).
Article CAS PubMed PubMed Central Google Scholar
Ly, C. V. & Miller, T. M. Emerging antisense oligonucleotide and viral therapies for amyotrophic lateral sclerosis. Curr. Opin. Neurol. 31, 648–654 (2018).
Article CAS PubMed PubMed Central Google Scholar
Cudkowicz, M. E. et al. Toward more efficient clinical trials for amyotrophic lateral sclerosis. Amyotroph. Lateral Scler. 11, 259–265 (2010).
Article CAS PubMed Google Scholar
Nicholson, K. A., Cudkowicz, M. E. & Berry, J. D. Clinical trial designs in amyotrophic lateral sclerosis: does one design fit all? Neurotherapeutics 12, 376–383 (2015).
Article CAS PubMed PubMed Central Google Scholar
Kiernan, M. C. et al. Improving clinical trial outcomes in amyotrophic lateral sclerosis. Nat. Rev. Neurol. 17, 104–118 (2021).
Article PubMed Google Scholar
Cedarbaum, J. M. & Stambler, N. Performance of the Amyotrophic Lateral Sclerosis Functional Rating Scale (ALSFRS) in multicenter clinical trials. J. Neurol. Sci. 152, S1–S9 (1997).
Article PubMed Google Scholar
Cedarbaum, J. M. et al. The ALSFRS-R: a revised ALS functional rating scale that incorporates assessments of respiratory function. BDNF ALS Study Group (Phase III). J. Neurol. Sci. 169, 13–21 (1999).
Article CAS PubMed Google Scholar
Atassi, N. et al. Analysis of start-up, retention, and adherence in ALS clinical trials. Neurology 81, 1350–1355 (2013).
Article PubMed PubMed Central Google Scholar
Berry, J. D. et al. Design and results of a smartphone-based digital phenotyping study to quantify ALS progression. Ann. Clin. Transl. Neurol. 6, 873–881 (2019).
Article PubMed PubMed Central Google Scholar
Bakers, J. N. E. et al. Using the ALSFRS-R in multicentre clinical trials for amyotrophic lateral sclerosis: potential limitations in current standard operating procedures. Amyotroph. Lateral Scler. Frontotemporal Degener. 23, 500–507 (2022).
Article PubMed Google Scholar
Fournier, C. N. Considerations for amyotrophic lateral sclerosis (ALS) clinical trial design. Neurotherapeutics 19, 1180–1192 (2022).
Article PubMed PubMed Central Google Scholar
van Eijk, R. P. A. et al. Innovating clinical trials for amyotrophic lateral sclerosis: challenging the established order. Neurology 97, 528–536 (2021).
Article PubMed PubMed Central Google Scholar
Dorsey, E. R., Venuto, C., Venkataraman, V., Harris, D. A. & Kieburtz, K. Novel methods and technologies for 21st-century clinical trials a review. JAMA Neurol. 72, 582–588 (2015).
Article PubMed PubMed Central Google Scholar
Stroud, C., Onnela, J.-P. & Manji, H. Harnessing digital technology to predict, diagnose, monitor, and develop treatments for brain disorders. npj Digital Med. 2, 3–6 (2019).
Article Google Scholar
Gupta, A. S. Digital phenotyping in clinical neurology. Semin. Neurol. 42, 48–59 (2022).
Article PubMed PubMed Central Google Scholar
Rutkove, S. B. et al. Improved ALS clinical trials through frequent at-home self-assessment: a proof of concept study. Ann. Clin. Transl. Neurol. 7, 1148–1157 (2020).
Article PubMed PubMed Central Google Scholar
Vieira, F. G. et al. A machine-learning based objective measure for ALS disease severity. NPJ Digit Med. 5, 45 (2022).
Article PubMed PubMed Central Google Scholar
Beswick, E. et al. A systematic review of digital technology to evaluate motor function and disease progression in motor neuron disease. J. Neurol. 269, 6254–6268 (2022).
Article PubMed PubMed Central Google Scholar
van Eijk, R. P. A. et al. Accelerometry for remote monitoring of physical activity in amyotrophic lateral sclerosis: a longitudinal cohort study. J. Neurol. 266, 2387–2395 (2019).
Article PubMed PubMed Central Google Scholar
Gupta, A. S., Luddy, A. C., Khan, N. C., Reiling, S. & Thornton, J. K. Real-life wrist movement patterns capture motor impairment in individuals with Ataxia-Telangiectasia. Cerebellum https://doi.org/10.1007/s12311-022-01385-5. (2022).
Eklund, N. M. et al. Real-life ankle submovements and computer mouse use reflect patient-reported function in adult ataxias. Brain Commun. 5, fcad064 (2023).
Tibshirani, R. Regression shrinkage and selection via the lasso. J. R. Stat. Soc. 58, 267–288 (1996).
MathSciNet MATH Google Scholar
Shefner, J. M. et al. Quantitative strength testing in ALS clinical trials. Neurology 87, 617–624 (2016).
PubMed PubMed Central Google Scholar
Woodworth, R. S. Accuracy of voluntary movement. Psychol. Rev.: Monogr. Suppl. 3, i (1899).
Viviani, P. Do units of motor action really exist? Exp. Brain Res. Ser. 15, 201–216 (1986).
Google Scholar
Flash, T. & Hochner, B. Motor primitives in vertebrates and invertebrates. Curr. Opin. Neurobiol. 15, 660–666 (2005).
Article CAS PubMed Google Scholar
Hogan, N. & Sternad, D. Dynamic primitives of motor behavior. Biol. Cybern. 106, 727–739 (2012).
Article MathSciNet PubMed PubMed Central Google Scholar
von Hofsten, C. Structuring of early reaching movements: a longitudinal study. J. Mot. Behav. 23, 280–292 (1991).
Article Google Scholar
Walker, N., Philbin, D. A. & Fisk, A. D. Age-related differences in movement control: adjusting submovement structure to optimize performance. J. Gerontol. B: Psychol. Sci. Soc. Sci. 52B, P40–P53 (1997).
Article CAS PubMed Google Scholar
Rohrer, B. et al. Submovements grow larger, fewer, and more blended during stroke recovery. Mot. Control 8, 472–483 (2004).
Article Google Scholar
Oubre, B. et al. Decomposition of reaching movements enables detection and measurement of ataxia. Cerebellum https://doi.org/10.1007/s12311-021-01247-6 (2021).
Lee, J. et al. Analysis of gait sub-movements to estimate ataxia severity using ankle inertial data. IEEE Trans. Biomed. Eng. https://doi.org/10.1109/TBME.2022.3142504 (2022).
Sobue, G. et al. Degenerating compartment and functioning compartment of motor neurons in ALS: possible process of motor neuron loss. Neurology 33, 654–657 (1983).
Article CAS PubMed Google Scholar
Biswas, P. & Langdon, P. Developing multimodal adaptation algorithm for mobility impaired users by evaluating their hand strength. Int. J. Hum.–Computer Interact. 28, 576–596 (2012).
Article Google Scholar
Zackowski, K. M., Dromerick, A. W., Sahrmann, S. A., Thach, W. T. & Bastian, A. J. How do strength, sensation, spasticity and joint individuation relate to the reaching deficits of people with chronic hemiparesis? Brain 127, 1035–1046 (2004).
Article CAS PubMed Google Scholar
Huysmans, M. A., Hoozemans, M. J. M., van der Beek, A. J., de Looze, M. P. & van Dieën, J. H. Submovement organization, pen pressure, and muscle activity are modulated to precision demands in 2D tracking. J. Mot. Behav. 44, 379–388 (2012).
Article PubMed Google Scholar
Rushton, D. J., Andres, P. L., Allred, P., Baloh, R. H. & Svendsen, C. N. Patients with ALS show highly correlated progression rates in left and right limb muscles. Neurology 89, 196–206 (2017).
Article PubMed PubMed Central Google Scholar
Stegmann, G. M. Early detection and tracking of bulbar changes in ALS via frequent and remote speech analysis. npj Digital Medicine 3, 132 (2020).
Bandini, A. et al. Kinematic features of jaw and lips distinguish symptomatic from presymptomatic stages of bulbar decline in amyotrophic lateral sclerosis. J. Speech Lang. Hear. Res. 61, 1118–1129 (2018).
Article PubMed PubMed Central Google Scholar
Eshghi, M. et al. Rate of speech decline in individuals with amyotrophic lateral sclerosis. Sci. Rep. 12, 15713 (2022).
Article CAS PubMed PubMed Central ADS Google Scholar
Khan, N. C., Pandey, V., Gajos, K. Z. & Gupta, A. S. Free-living motor activity monitoring in Ataxia-Telangiectasia. Cerebellum https://doi.org/10.1007/s12311-021-01306-y (2021).
Galland, B. C. et al. Establishing normal values for pediatric nighttime sleep measured by actigraphy: a systematic review and meta-analysis. Sleep 41 (2018).
Bouten, C. V., Koekkoek, K. T., Verduin, M., Kodde, R. & Janssen, J. D. A triaxial accelerometer and portable data processing unit for the assessment of daily physical activity. IEEE Trans. Biomed. Eng. 44, 136–147 (1997).
Article CAS PubMed Google Scholar
Ryan, T. P. Sample Size Determination and Power (John Wiley & Sons, 2013).
Benjamini, Y. & Hochberg, Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J. R. Stat. Soc. 57, 289–300 (1995).
MathSciNet MATH Google Scholar
Shrout, P. E. & Fleiss, J. L. Intraclass correlations: uses in assessing rater reliability. Psychol. Bull. 86, 420–428 (1979).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

The authors thank James Berry and Katherine Burke for helpful discussions. We also thank the community of people with ALS who contributed data to these studies. The study was supported in part by NIH R01 NS117826. (A.S.G.).

Author information

Authors and Affiliations

Department of Neurology, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
Anoopum S. Gupta & Siddharth Patel
ALS Therapy Development Institute, Watertown, MA, USA
Alan Premasiri & Fernando Vieira

Authors

Anoopum S. Gupta
View author publications
You can also search for this author in PubMed Google Scholar
Siddharth Patel
View author publications
You can also search for this author in PubMed Google Scholar
Alan Premasiri
View author publications
You can also search for this author in PubMed Google Scholar
Fernando Vieira
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

F.V. conceived of the translational research program that resulted in the data collected and analyzed in this manuscript. A.S.G., S.P., A.P., and F.V. conceived of the study objectives. A.P. and F.V. contributed to data collection efforts. S.P. performed ingestion and preprocessing of the dataset. A.S.G. performed analysis of the dataset. A.S.G., S.P., A.P., and F.V. contributed to the interpretation of the results. A.S.G. took the lead in writing the manuscript. All authors provided critical feedback and helped shape the research, analysis, and manuscript.

Corresponding author

Correspondence to Anoopum S. Gupta.

Ethics declarations

Competing interests

For the methods for extracting and characterizing submovements from wearable sensor data, a PCT (US2022/081374) was filed on December 12, 2022, titled “System and method for clinical disorder assessment”. An earlier US Provisional Application (Serial No. 63/288,619) was filed on December 12, 2021. The patent applicant is the institution (Massachusetts General Hospital) and inventor is Anoopum Gupta. The remaining authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Jeremy Shefner, Ruben van Eijk and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Reporting Summary

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Gupta, A.S., Patel, S., Premasiri, A. et al. At-home wearables and machine learning sensitively capture disease progression in amyotrophic lateral sclerosis. Nat Commun 14, 5080 (2023). https://doi.org/10.1038/s41467-023-40917-3

Download citation

Received: 14 March 2023
Accepted: 04 August 2023
Published: 21 August 2023
DOI: https://doi.org/10.1038/s41467-023-40917-3

This article is cited by

Citizen data sovereignty is key to wearables and wellness data reuse for the common good
- Stephen Gilbert
- Katie Baca-Motes
- Dirk Brockmann
npj Digital Medicine (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.