The detection of age groups by dynamic gait outcomes using machine learning approaches

Zhou, Yuhan; Romijnders, Robbin; Hansen, Clint; Campen, Jos van; Maetzler, Walter; Hortobágyi, Tibor; Lamoth, Claudine J. C.

doi:10.1038/s41598-020-61423-2

Download PDF

Article
Open access
Published: 10 March 2020

The detection of age groups by dynamic gait outcomes using machine learning approaches

Yuhan Zhou ORCID: orcid.org/0000-0002-3091-1425¹,
Robbin Romijnders²,
Clint Hansen ORCID: orcid.org/0000-0003-4813-3868²,
Jos van Campen³,
Walter Maetzler²,
Tibor Hortobágyi¹ &
…
Claudine J. C. Lamoth¹

Scientific Reports volume 10, Article number: 4426 (2020) Cite this article

5501 Accesses
32 Citations
2 Altmetric
Metrics details

Subjects

Abstract

Prevalence of gait impairments increases with age and is associated with mobility decline, fall risk and loss of independence. For geriatric patients, the risk of having gait disorders is even higher. Consequently, gait assessment in the clinics has become increasingly important. The purpose of the present study was to classify healthy young-middle aged, older adults and geriatric patients based on dynamic gait outcomes. Classification performance of three supervised machine learning methods was compared. From trunk 3D-accelerations of 239 subjects obtained during walking, 23 dynamic gait outcomes were calculated. Kernel Principal Component Analysis (KPCA) was applied for dimensionality reduction of the data for Support Vector Machine (SVM) classification. Random Forest (RF) and Artificial Neural Network (ANN) were applied to the 23 gait outcomes without prior data reduction. Classification accuracy of SVM was 89%, RF accuracy was 73%, and ANN accuracy was 90%. Gait outcomes that significantly contributed to classification included: Root Mean Square (Anterior-Posterior, Vertical), Cross Entropy (Medio-Lateral, Vertical), Lyapunov Exponent (Vertical), step regularity (Vertical) and gait speed. ANN is preferable due to the automated data reduction and significant gait outcome identification. For clinicians, these gait outcomes could be used for diagnosing subjects with mobility disabilities, fall risk and to monitor interventions.

Selecting Clinically Relevant Gait Characteristics for Classification of Early Parkinson’s Disease: A Comprehensive Machine Learning Approach

Article Open access 21 November 2019

Prediction of fall risk among community-dwelling older adults using a wearable system

Article Open access 25 October 2021

XGBoost based machine learning approach to predict the risk of fall in older adults using gait outcomes

Article Open access 09 June 2021

Introduction

Over the last decades, medical and technical developments have extended human lifespan. However, with the increasing number of adults in society, there is a parallel increase in the number of people with serious impairments of mobility, gait, and postural control¹. Natural aging comes hand in hand with mobility decline and impairments in gait and postural control. When the level of decline in physical and cognitive functions exceeds the degree of decline expected due to the natural aging process, we speak of a geriatric condition. Typical geriatric patients are characterized by co-morbidities such as sarcopenia, cognitive impairment, osteoporosis, weight loss, and frailty^2,3.

Gait disorders are common in older adults; prevalence increases with age and is associated with increased fall risk, mobility decline, and loss of independence⁴. For geriatric patients, the risk of having gait disorders with an increased fall incidence is even higher⁵. Consequently, objective gait assessment in the clinics has become increasingly important for the diagnosis of motor impairments and the assessment of mobility decline and fall risk⁶, as well as for the monitoring of the efficacy of interventions designed to improve mobility⁷. The most often used gait parameter for disability is gait speed. After age 60, gait speed slows by 16% per decade⁸. In geriatric patients, a gait speed below 1.0 m/s signifies an additional clinical or sub-clinical impairment, such as mobility decline, frailty, recurrent falling, loss of independence and institutionalization⁹. Complementary to gait speed, aging impacts the spatial-temporal characteristics of gait, e.g., walking with a shorter step length, larger step width and increased step time or variability of these parameters^4,10. However, gait speed may be insensitive and unselective to accurately classify different age and patient groups with specific mobility disabilities.

Advances in technology, in particular with respect to small, light wearable sensors like inertial measurement units (IMU), have considerably aided the practice of clinical gait analysis. Wearable sensors like accelerometer sensors offer new opportunities for clinicians and researchers to record gait over a longer time and allow the application of methods that quantify how gait evolves over time, e.g., the dynamics of gait^5,11. In addition to gait speed and gait speed related parameters like stride length or stride time, a variety of measures can be derived from these accelerometer signals, that characterize the dynamics of gait through metrics such as, regularity, synchronization, variability, local stability, predictability, smoothness and symmetry^12,13. These gait outcomes characterize the quality of gait and can be considered complementary to each other. However, not all of these gait outcomes are independent of each other (e.g., gait speed and stride time; regularity and symmetry) and may interact in a non-linear fashion¹⁴. To analyze multidimensional gait data, specific mathematical approaches are required to define and extract the most informative features of the data and extract parameters that are characteristic for a certain (clinical) population.

The use of machine learning for human gait analysis is nowadays widely explored¹⁵. Machine learning methods can identify redundancies in a dataset and extract the most informative features of the data by creating new and uncorrelated variables that characterize the original data. Besides, these methods can process high dimensional, non-linear data structures, and based on the learned/trained models, they have the potential to estimate the gait status of new patients¹⁶. Principal Component Analysis (PCA) has been commonly used to extract significant information from a large number of variables^17,18. PCA preserves the variability and multivariate features while decreasing dimensionality to make the data analysis more tractable. PCA creates a set of orthogonal bases that capture the directions of maximum variance for the original dataset, and the uncorrected expansion coefficients in the new dataset¹⁸. However, gait outcomes are not only interrelated with each other but also interact in a complex nonlinear manner¹⁹. Alternatively, kernel PCA (KPCA) can extract higher-order relations among gait outcomes. The kernel function can employ PCA in high dimensional space but ignores the effect on the non-linear structure²⁰. Wu et al. showed that KPCA efficiently reduced 23 non-linear gait variables to 17 gait variables, and consequently increased the Support Vector Machine (SVM) classification accuracy from 85% (SVM classification with PCA) to 91%²¹.

Previous studies have also successfully employed machine learning methods to identify gait abnormality in different populations¹⁵. For instance, Artificial Neural Network (ANN) and SVM are the two most popular machine learning methods in gait analysis²². Begg et al. applied ANN with linear, polynomial and Radial Basis Function (RBF) kernels to age-classify 30 young and 28 older subjects based on their gait, with a classification accuracy of 75%²³. In line with this result, SVM classified differences in spatial-temporal, kinematic and kinetic gait variables from 12 young subjects and 12 older subjects due to aging with a 91.7% accuracy²⁴. In addition to ANN and SVM, various machine learning methods have been successfully employed for the classification of different patient populations based on gait analysis. K-nearest neighbors (KNN) classification method identified different gait pattern of patients with Cerebral Palsy and Multiple Sclerosis from healthy adults with a classification accuracy of 85%²⁵, and of patients with hemiplegia, Parkinson’s disease and back pain with a classification accuracy of 90–98%. However, a limitation of KNN is that it is an instance-based learning method, implying that it only uses the training data for classification but does not learn from it. Similar classification results were obtained from decision tree and Naive Bayes methods²⁶. The Random Forest (RF) was used to identify patients with Parkinson’s disease with time-domain and frequency-domain gait features to obtain 98.04% accuracy²⁷. In recent studies, several machine learning methods have been employed for classifying fallers and non-fallers with a functional test (such as Timed Up and Go) and questionnaire data to obtained high accuracy of 89.4%²⁸.

In sum, these studies support the fact that machine learning methods can be successfully employed for clinical gait analysis to identify differences in gait performance due to pathology, using various types of gait variables. However, in order to be useful for clinical applications, several requirements and constraints need to be considered. In clinical gait analysis, usually the number of variables obtained is high, whereas the number of subjects is relatively low. This may result in an excessively complicated machine learning model with poor predicting performance (overfitting)²². With a limited number of subjects, the best choice might be SVM and RF. The effect of a limited number of subjects (data set) is minimized because the classification of SVM depends on the support vectors and the slack variables (not the entire data set) and on the non-linear variables’ distance, to distinguish different groups²⁹.

However, the black box problem of SVM implies that before classification, significant features should be detected using, for instance (kernel) PCA³⁰. Alternatively, RF can be employed as it is not very sensitive to small data size and is based on decision trees, in which every subject can be repeatedly classified³¹. Nevertheless, RF disregard the intact interactions within and between trees, which might negatively impact the classification performance³². Although the black box problem also exists in the hidden layers of ANN³⁰, the activation functions such as the tangents hyperbolic can properly analyze the complex interactions among the gait variables to improve the classification performance³³. A recent study used deep learning to explain gait patterns based on kinematic and kinetic variables.

Due to no recent study investigated aging effect on gait based on dynamic gait outcomes through more quantitative ways, the aims of the present study are two-folded; Firstly, based on an existing dataset 3D-accelerometer signal of healthy young, middle aged older adults and geriatric patients, we evaluated if different groups can be classified based on dynamic gait outcomes. Dynamic gait variables that quantify the quality of gait over time were used as input for the classification of healthy young-middle aged adults, healthy older adults, and geriatric patients. Secondly, we compared the performance of three machine learning models, KPCA in combination with SVM, RF and ANN that can be used for clinical gait analysis.

Results

Gait outcome identification and classification with KPCA in combination with SVM

The radial basis function and polynomial function were used in KPCA and SVM, however, no differences were found in KPCA and SVM results between the two kernels functions. In the end, the RBF kernel function was employed in the KPCA and the SVM model.

From the KPCA applied to the original data set of 239 subjects, the first five principal components (PC) captured 97% information of the original 23 gait variables.

The different weights of eigenvectors represent the contributions of the gait outcomes on the five PCs (Fig. 1(a)). Gait outcomes achieved weights > = 0.4 were considered significant to the model. PC1 reflected most gait outcomes related to step regularity, step symmetry and amplitude variability (RMS), whereas stability, synchronicity of movement directions, and smoothness were captured by PC2 to PC5.

The extracted PCs of the KPCA were used as the input of the SVM machine learning classifier. To validate the SVM model and decrease the risk of overfitting, LOOCV was used to split the dataset into a training set and a test set for SVM. Figure 1(b) shows the SVM classification (confusion matrix) results for the three groups. The overall classification accuracy is 89.5%. Of the 57 subjects in the healthy young-middle aged group, 4 of them were misclassified and assigned to the healthy older group and 4 were assigned to the geriatric patient group. Of the 55 subjects in the healthy older group, 41 of them were successfully classified into the healthy older group, one was assigned to the young-middle aged group and 13 were assigned to the geriatric group. The 127 geriatric patients were correctly classified with the exception of 3 geriatric patients who were assigned to the healthy older group (Fig. 1(b)).

Gait outcome identification and classification with random forest

Figure 2(a) shows the classification results matrix of the RF method. The RF classification accuracy was 73.6%. Of the 57 subjects in the healthy young-middle aged group, 8 of them were assigned to the healthy older group and 7 were assigned to the geriatric patient group. As is shown in Fig. 2(a), the classification accuracy was worse for the healthy older group. That is 14 healthy older adults were assigned to the young-middle aged group and 24 were assigned to the geriatric group. Finally, 10 of the 117 geriatric patients were misclassified, 6 as healthy young-middle aged and 4 were assigned to the healthy older group.

The gait outcomes that contributed most to the RF classification are presented in Fig. 2(b). Seven of them have larger weights than others (>6), these were the Root Mean Square in AP, ML and V, gait speed, step regularity V, Cross Entropy MLV and Lyapunov exponent V.

Gait outcome identification and classification with artificial neural network

The ANN model obtained the best classification performance with one hidden layer, including three units. The overall classification accuracy was 90.4%. Figure 3(a) shows the classification results matrix of the ANN. 2 of 57 healthy young-middle aged subjects were assigned to the healthy older group and 3 were assigned to the geriatric patient group. Similar to the RF, the classification of the healthy older groups was worse, of the 55 subjects 13 were assigned to the geriatric group. Of 127 geriatric patients 122 patients were correctly classified, only one patient was classified as young-middle aged adult and 4 patients were assigned to the healthy older group. According to the ANN classification results, Fig. 3(b) shows 23 gait outcomes in terms of their weight of the ANN classification. The weight of each gait parameter was calculated from the overall layers. In Fig. 3(b), it is shown that 8 gait outcomes contributed much more to the age-based classification than the others. The 8 gait outcomes (weights > 40) were the Root Means Square AP, V, Cross Entropy APV, MLV, step regularity V, Lyapunov exponent V, stride regularity V and The Index of Harmonicity V.

Evaluation of the machine learning classification approaches

The overall classification performances of SVM, RF and ANN were evaluated by the ROC curve. The AUC of SVM, RF and ANN is 0.91, 0.86 and 0.86, respectively. Figure 4 shows the ROC curve for each machine learning classification model.

For SVM, the sensitivity for these three groups is 86% (healthy Y/M), 75% (healthy old), 98% (geriatric) respectively, and the specificity is 99%, 96%, 85%, respectively.

For RF classification, the sensitivity and specificity for the young-middle aged group are 74% and 87% respectively. The sensitivity and specificity for the healthy elderly group are 31% and 93% respectively. The classification from RF in the geriatric patients without CI has the sensitivity and specificity of 92% and 66% respectively.

For the ANN classification, the sensitivity in these three groups is 91%, 76%, 96% respectively, and the specificity is 99%, 91%, 85%, respectively.

Summary and statistical analysis of the machine learning classification

Table 1 below shows the accuracy and the AUC with a confidence interval (CI) for each model.

Table 1 The accuracy and the Area Under the Curve (AUC) with confidence intervals (CI) for each model.

Full size table

Table 2 shows the sensitivity and specificity with a confidence interval (CI) for each model.

Table 2 Sensitivity and Specificity with confidence intervals (CI) of each group for each model.

Full size table

The abbreviations in Table 2 are: Kernel Principal Component Analysis (KPCA), Support Vector Machine (SVM), Random Forest (RF), Artificial Neural Network (ANN), Healthy young-middle age adults (Healthy Y/M).

Table 3 The demographics of three age groups.

Full size table

Figure 5 shows that there is no significant difference between the three machine learning models’ sensitivity and specificity. This conclusion supports our statement that the classification performance among the three models was similar and valid. However, our statement that ANN has the best classification performance is not supported by this comparison.

Figure 5 (a,b) shows the data with a p-value for the comparison of three machine learning models for sensitivity and specificity, respectively.

Discussion

The purpose of the study was to identify gait characteristics that could classify young-middle age, older and geriatric adults. To that aim three supervised machine learning approaches were compared in terms of their classification performance, and their ability to identify the gait characteristics of interest.

Overall, KPCA in combination with SVM, RF and ANN, had satisfactory classification accuracy for the three groups (89%, 73.6% and 90%). In addition to accuracy values, AUC values were 0.91, 0.86 and 0.86 for SVM, RF and ANN, respectively.

Based on our results we conclude that machine learning methods (SVM, RF and ANN) when applied to dynamic gait outcomes have the potential to distinguish between groups. The dynamic gait outcomes that were important for classifying the three groups were related to gait synchronization (Cross Entropy between medial-vertical and vertical acceleration), regularity (vertical direction), stability (maximal Lyapunov exponent of the vertical acceleration) and pace (gait speed and the variability of the accelerations (RMS) in anterior-posterior and vertical direction). This result is in line with several previous studies, for instance, healthy older adults and geriatric patients have a gait that is more unstable³⁴, more variable and fewer regular³⁵ while the slower pace is also the symptoms to distinguish healthy aging and abnormal aging³⁶.

The combination of gait characteristics used in the present study had a high specificity ranging from 87% to 99% for healthy adults for SVM, RF and ANN, while the specificity to detect the geriatric patient was lower (66–85%). The sensitivity for healthy older subjects (<76%) for SVM, RF and ANN were worse compared to the patient group (>90%). As shown in Fig. 4(b), the misclassification of healthy older subjects, concerned mostly an assignment to the geriatric patient group and vice versa. The lower sensitivity for the healthy older group could be due to the fact that aging is a continuum, with enormous heterogeneity between subjects in particularly at an older age and in geriatric patients³⁷.

The three machine learning approaches showed different classification accuracy, specificity and sensitivity results since they differ in terms of how the algorithms handle non-linearities and interactions between gait outcomes.

Our findings for the KPCA in combination with SVM approach is in line with previous research showing classification accuracy of 91% when comparing young and older adults based on 36 gait spatial-temporal and kinematic gait variables²¹. An advantage of using SVM is that it does not require a large number of subjects (data) because hyperplanes for classification are based on the support vectors and the slack variables²⁹. However, all data are configured to a high dimensional space, changing the structure and making it hard to explain which gait outcomes contribute to the classification results^22,30.

Random Forest (RF) builds multiple decision trees randomly in parallel, considering the correlations of features in every single tree until there is a classification result. Similar to SVM, RF can provide limited datasets because it categorizes the subject by multiple decision trees³¹, the samples in the dataset can be repeatedly selected to be classified in these decision trees³¹. RF is more visible than SVM and ANN, and can output the significant features for the small size of clinical data³². The Layer-Wise Relevance Propagation technique in combination with deep learning was used to address the black box problem³⁸. However, deep learning usually requires a large amount of data and more suitable for raw data from accelerometer signals. The size of the data and the structure dynamic gait outcomes in this study are not proper for deep learning.

Irrespective of the heterogeneous population, ANN, showed only five misclassifications in the healthy young-middle age group, two were assigned to the healthy older group and three to the patient group showing an impressive performance for this group in particular. In contrast to both SVM and RF, ANN can analyze the complex structure among variables, by using various activation functions (e.g., Tanh, Sigmoid) even though, similar to SVM the variable interactions are not visible^30,33. Although a large data set is required to find the optimal activation function and avoid overfitting²², ANN has the capacity to adapt for the limited dataset with suitable activation functions which may need to be adjusted depending on the type of data^33,39, to build a small neural network.

While KPCA reduces the data dimensionality as a step toward a subsequent SVM, ANN can automatically reduce the data dimensionality and identify gait outcomes with high weights. RF has good visibility to show significant gait outcomes similar to ANN, however, RF showed lower classification accuracy of healthy older and geriatric patients than ANN. From the results in Fig. 5 we can see there is no significant statistical difference between the three methods. However, ANN has an important advantage, for clinical applications because it does not only have good classification performance, but also those gait features that contributed to the classification can be identified. In contrast, SVM suffers from the “black box problem” which can only be solved by applying the first KPCA. Therefore, we suggest that ANN is more preferable to the current gait data set.

With regard to the selection of machine learning methods, what needs to be taken into account is that the specific selection of machine learning methods depends on the type of data as well as the sample size. If the machine learning method is too simple to model the data, the high bias and low variance model will underfit the classification results. In contrast, if the machine learning method is too complicated for the data structure, the low bias high variance model will overfit the classification results²². In clinical data analysis, machine learning methods have the advantage that no prior clinical/gait feature selection is necessary as the features can be automatically selected and used for classification. Yet, to be of clinical relevance, it is important that the results of the machine learning parameters are translated into meaningful clinical knowledge despite the complex interactions among the variables leading to the classification^22,30.

For the clinician, machine learning methods will not replace but can support and assist human clinical decision-making. For instance, the identified gait outcomes could be used for new patients, for diagnosing fall risk, to monitor the interventions in patients’ daily lives and to optimize the efficacy of specific rehabilitation protocols²². For instance, the specific gait outcomes identified by the models can be measured in new patients, to identify the at-risk gait early on, diagnose the potential disorders and to finally determine the patients with a high risk of falling or mobility decline¹⁶.

To improve the classification of patient groups (e.g., geriatric patients) with numerous co-morbidities in the future, additional variables should be added to the model to improve its clinical value. The three domains, body structure and functions, activity and participation, of the International Classification of Functioning Disability and Health (ICF) provide a framework for including additional variables in the classification model. Future studies should focus on applying pattern recognition methods to identify gait abnormalities in different patient populations based on broader types of parameters reflecting the different domains of the ICF e.g. the Fall Efficacy Scale (FES_I), the Charlson Comorbidity index and further cognition tests^5,40.

In summary, the present study identified gait characteristics (gait features in synchronization, regularity, stability, variability and pace) that distinguish healthy young-middle aged adults, healthy old adults, and geriatric patients without cognitive impairment using three different classification models. In the future, the gait outcomes identified in this study could be used in the clinics to diagnose patients and monitor interventions in patients. Overall, classifier performance was good, although KPCA in combination with SVM (best AUC) and ANN (best performance) performed slightly better than RF. However, the automated data reduction, classification accuracy and identification of gait outcomes make ANN superior for classifying different age groups based on dynamic gait outcomes. The most difficult group to classify were the healthy older adults, due to the heterogeneity of their gait and the fact that a part of this population might be in a preclinical phase towards geriatric symptoms. Incorporating objective and subjective measures at the different levels of the ICF model in the future, could improve the classification of clinical gait analysis even further, thereby adding to its clinical value.

Methods

Data description

Data from different studies^{11,41,42,43,44}were pooled to create the present accelerometer dataset including 239 participants in three sub-groups: the young-middle aged group (18–65), the healthy older group (>65), and a group of geriatric patients without cognitive impairment (CI) (Table 3). Data from the geriatric patients were obtained between 2009 and 2018^41,42,44. The young-middle aged and healthy older participants were recruited from a population that didn’t visit the geriatric clinic, by means of advertising in local papers, community centers, and by word of mouth. Data of geriatric patients were obtained from an existing database of a geriatric day clinic in a Hospital. These were patients that visited a geriatric day clinic based on a medical referral by a general practitioner. These patients underwent extensive screening for physical, psychological, and cognitive functions. Criteria for excluding patients from these studies were: (1) inability to walk for three minutes without a walking aid, (2) neurological disorders (e.g., Parkinson’s disease, history of stroke), (3) severe mobility disability caused by pain and/or orthopedic conditions, and (4) the inability to speak and understand the Dutch language. Data of healthy young-middle aged and older adults were obtained from previous studies^11,43. Procedures followed were in accordance with the Declaration of Helsinki 2000 and all studies were approved by the Medical Ethics Committee of the MC Slotervaart (geriatric patients) or by the ethical committee of the Centre of Human Movement Science Groningen of the University Medical Centre Groningen. All participants of who participated in the studies signed an informed consent form. All subjects followed the same walking test protocols. Subjects walked for three minutes at a comfortable walking speed without aid. During walking trunk accelerations were measured either using the iPod Touch G4 (iOS 6; Apple Inc.), which has a built-in tri-axial acceleration sensor, or using a stand-alone accelerometer unit, the DynaPort hybrid unit (McRoberts BV, The Hague, the Netherlands)⁴⁵.

From trunk acceleration signals, anterior-posterior (AP), medio-lateral (ML) and vertical (V) direction gait outcomes related to the quality of gait, were calculated using custom-made software in MATLAB (version 2014b; The MathWorks Inc.). From the signals, 23 dynamic gait variables were calculated quantifying pace, predictability, regularity, stability, synchronization and smoothness (for a detailed explanation see^11,41). In brief, gait speed (GaitSpeed) was calculated dividing distance walked (m) by time (s). The Root Mean Square (RMS) is a measure for the variability of the amplitude of accelerations. The Index of Harmonicity (IH) is a measure of gait smoothness. Values range between 0–1, and an IH of 1 reflects a perfectly smooth gait. Multiscale Entropy (msEn) quantifies the predictability at different time scales, testing the complexity of the signal. A value of 0 reflects a completely predictable gait parameter. The Cross-sample Entropy (CrEn) quantifies the degree of synchronization between AP and ML, AP and V, and ML and V accelerations. A value of 0 reflects perfect synchronization between acceleration signals. The maximal Lyapunov exponent (LyaP) represents the local stability of trunk acceleration patterns, as calculated by the Wolff algorithm. Higher values indicate greater sensitivity to local perturbations. The unbiased auto-correlation function of the acceleration signal in AP and V directions was used to calculated gait step or stride regularity (Step/StrideReg) and symmetry (Symm). The signal was phase shifted with a window approximating average step and stride time. Perfectly regular steps or strides are reflected by a value of one. The difference between step and stride regularity showed the gait symmetry, zero representing a perfect symmetric gait. Finally, frequency variability (FreqVar) reflects the relative fluctuations in step frequency. More details of all gait variables with three groups were shown in supplementary information on Table 4.

Gait outcomes standardization

Overall, Fig. 6 shows the procedures of the data analysis in the present study. The inter-relationships between the calculated gait outcomes could provide a better understanding of how the process of aging impacts gait. However, before using machine learning approaches to classify age groups based on these gait dynamic outcomes, standardization is needed, since all gait outcomes are calculated at different scales, for example, gait speed in meter/second, IH in scale 0–1. Machine learning algorithms are sensitive to the scales of variables; therefore, the Z-scores method was used to standardize the data.

Gait outcomes extraction

To reduce the dimensionality of calculated outcomes while preserving the informative and variability properties and improving classification performance the KPCA was employed. KPCA produces orthogonal bases to capture the directions of maximum variance and the uncorrelated expansion coefficients²¹. Using the KPCA approach, the non-linear input data was mapped to a high dimensional space by different kernel functions, such as linear, polynomial and Gaussian radial basis function (RBF). Then the formal PCA was employed in this new feature space. The formal KPCA algorithm is: the non-linear gait outcomes in the original space are mapped to a high dimensional space Ƒ through kernel functions: ${x}_{i}\to \varnothing ({x}_{i})$ and two inputs ${x}_{i}$ and ${x}_{j}$, which represent two gait outcomes as the examples in original space⁴⁶.

In the present study, two types of kernel functions were taken into account; the first is the polynomial (Poly), d is the degree of Poly kernel:

$$\begin{array}{c}{\rm{K}}({x}_{i},{x}_{j})={(({x}_{i}\cdot {x}_{j})+1)}^{d}\end{array}$$

(1)

The second kernel function is the RBF, $\sigma $ is the width of the RBF:

$$\begin{array}{c}{\rm{K}}({x}_{i},\,{x}_{j})=\exp -\frac{{||{x}_{i}-{x}_{j}||}^{2}}{2{\sigma }^{2}}\end{array}$$

(2)

The data centered by the following equation:

$$\begin{array}{c}\hat{K}({x}_{i},\,{x}_{j})=K-{1}_{N}K+{1}_{N}K{1}_{N}-K{1}_{N}\end{array}$$

(3)

The new gait outcomes in the new high dimensional space are:

$$\begin{array}{rcl}{y}_{i} & = & \varnothing {({x}_{i})}^{\top }{v}_{j}=\mathop{\sum }\limits_{i=1}^{n}{\alpha }_{ij}\varnothing {(x)}^{\top }\varnothing ({x}_{k})\\ & = & \mathop{\sum }\limits_{i=1}^{n}{\alpha }_{j{\rm{i}}}K(x,{x}_{j}),j=1\ldots d\end{array}$$

(4)

In the present study, the entire dataset was used for feature selection because KPCA is an unsupervised process. The components including more than 90% values of the sum of the eigenvalues, were selected as the principal components (PCs). These PCs include almost all information from the original dataset but now with low dimensionality. The distribution of the eigenvectors on each PC shows the contributions of each original gait outcomes to a PC.

Cross-validation

The number of subjects in the dataset is relatively low for machine learning approaches. To avoid overfitting²², a cross-validation method was used to split the dataset into subsets for training the model, adjusting the model’s parameters and evaluating the classification performance. In this study, the robust cross-validation method “leave one out method” (LOOCV) was applied because it does not randomly partition the data but every subject is used to test the model to reduce the bias. One subset was used to test the performance and k-1 subsets were used to train the machine learning model. Then for each k (equals number of subjects) subset the classification performance was evaluated²⁵.

Classification

The machine learning methods Support Vector Machine (SVM), Random Forest (RF) and Artificial neural network (ANN) were used in the present study to classify the groups based on gait outcomes. The optimal hyper-parameters are based on the overall classification performance.

Support vector machine (SVM)

SVM was used as a classifier to predict subjects’ groups based on their gait performance. In SVM, referring to equation (1) and (2), the two kernel functions map the original data to a high dimensional feature space by finding hyperplanes for different classes, and to maximize the margin between different classes⁴⁷. The output from KPCA was used as the input of SVM classification. The hyperplane for different age scales is defined as:

$$\begin{array}{c}\varnothing (x)=sign({\sum }^{}\begin{array}{c}{\beta }_{i}{y}_{i}K({x}_{i},{x}_{j})+b\end{array})\end{array}$$

(5)

$K({x}_{i}\,,\,{x}_{j})$ is a kernel function, b is the bias of the training data, β is the coefficient of the separating hyperplane. Then the distance between each class and the hyperplane is maximized:

$$\begin{array}{c}\min \,W(\beta )=-\,{\beta }^{{\top }}I+\frac{1}{2}{\beta }^{{\top }}H\beta \end{array}$$

(6)

subject to ${\beta }^{{\top }}\,y=0\,$, finally, the new data is classified into a given class:

$$\begin{array}{c}H={y}_{i}{y}_{j}K({x}_{i}\,,\,{x}_{j}),i,j=1\ldots M\end{array}$$

(7)

All these labeled subjects were classified by SVM with Poly as in (1) and RBF as in (2) kernel functions.

Random forest

The Random Forest (RF) method builds various decision trees and merges them to obtain the optimal classification performance. A decision tree suggests the best option to classify the subjects into the three groups. RF combines several decision trees. The subjects were repeatedly classified by each tree. In the end, RF selected the best classification result including the importance of each gait parameter. From the training set ${\{({x}_{i},{y}_{i})\}}_{i=1}^{n}$ (includes n rows ${x}_{i}$ and n columns ${y}_{i}$), a set of m trees were built with individual weight functions ${W}_{j}$ in the individual tree leaf $j$, the prediction $\hat{y}$ of the new testing set $x{\prime} $ is⁴⁸:

$$\begin{array}{rcl}\hat{y} & = & \frac{1}{m}\mathop{\sum }\limits_{j=1}^{m}\mathop{\sum }\limits_{i=1}^{n}{W}_{j}({x}_{i},x{\prime} ){y}_{i}\\ & = & \mathop{\sum }\limits_{i=1}^{n}\frac{1}{m}\mathop{\sum }\limits_{j=1}^{m}{W}_{j}({x}_{i},x{\prime} ){y}_{i}{y}_{i}\end{array}$$

(8)

The dataset was split into a training and testing set, based on the above described LOOCV method. The original 23 gait outcomes served as M input variables, and m variables (m≪M) were randomly selected to build m decision trees for RF. The value of m remains unchanged during forest growth. Each tree is grown to the most significant extent possible.

Artificial neural network (ANN)

The multi-layer perceptron ANN consists of input, output, and one hidden layer with some units. In the ANN system, the artificial neurons are interconnected and communicate with each other. Each connection is weighted by previous learning events and the weight between artificial neurons is adjusted as learning progresses. In the end, the input subjects will be classified into a group through the optimal connection way⁴⁹.

For the components in the ANN, there are neurons, connections, weights, biases, propagation functions, and learning rules. A neuron with label j (here, the label j is healthy young-middle aged, healthy older and geriatric patient without cognitive impairment) receiving an input ${p}_{j}(t)\,\,$consists of the following components: the neuron’s state activation variable ${a}_{j}(t)$, depending on the time parameter t, an activation function $f$ that computes the new activation at a given time t + 1 from ${a}_{j}(t)$, ${\theta }_{j}$ is a fixed threshold, and the net input ${p}_{j}(t)\,\,$giving rise to the relation:

$$\begin{array}{c}{a}_{t}(t+1)=f({a}_{j}(t),{p}_{j}(t),{\theta }_{j})\end{array}$$

(9)

and then the output is computed by the function:

$$\begin{array}{c}{O}_{j}(t)={f}_{out}({a}_{j}(t))\end{array}$$

(10)

ANN consists of connections, each connection transferring the output of a neuron i to the input of a neuron j, each connection is assigned a weight ${W}_{ij}$. And a bias ${b}_{0j}\,\,$was added to the total weighted sum of inputs to shift the activation function. To compute the input to the neuron j from the given three age-based groups, the equation shown below adds the bias value:

$$\begin{array}{c}{p}_{j}(t)=\sum _{i}{O}_{i}(t){W}_{ij}+{b}_{0j}\end{array}$$

(11)

Then the learning process was constructed to modifying the weights and thresholds of the variables within the network.

In the present study, the 23 gait variables were the neurons in the input layer. This model has one hidden layer with three units. There are three output neurons, one for each group, healthy young-middle age adult group, healthy older adult group and geriatric patients’ group. The activation function of ANN is the “Rectified Linear Unit (ReLU).

In this study, ANN classification performance is based on LOOCV, k-1 sets of training and 1 set of tests. This process was repeated k times (k = 239). The best hyper-parameters of ANN were decided from the 239 LOOCV iterations of model training and testing, for example, after using LOOCV to train and test ANN, three units in a hidden layer showed the best performance of classification.

Evaluation of classification

The accuracy, sensitivity and specificity were calculated to evaluate the performance of the three machine learning classifiers to identify gait for the groups. Sensitivity represents the proportion of those subjects that are assigned to the correct group (true positive rate), and specificity represents those subjects that are rejected to assign to the incorrect group (true negative rate). In the receiver operating characteristic (ROC) curve plot, the y-axis represents the sensitivity and the x-axis represents the 1- specificity. The area under the ROC curve (AUC) provides an overall evaluation of the classification. The baseline of AUC is 0.5, and the perfect machine learning classification model has the AUC = 1.

To statistically test differences between the three machine learning models for sensitivity and specificity we applied a Mann-Whitney U test. The different classes were three models: KPCA in combination with SVM: class 0; RF: class 1; ANN: class 2. Then the three sensitivity values and specificity values across the three groups were regarded as two variables in the model.

Data availability

The data of all 239 participants and 23 gait variables are not publicly available due to Institute Review Board related matters, but available from the principal investigator Claudine Lamoth (c.j.c.Lamoth@umcg.nl) upon reasonable request.

References

Brown, C. J. & Flood, K. L. Mobility Limitation in the Older Patient. JAMA 310, 1168 (2013).
Article CAS Google Scholar
Marengoni, A. et al. Aging with multimorbidity: A systematic review of the literature. Ageing Res. Rev. 10, 430–439 (2011).
Article Google Scholar
Inouye, S. K. et al. Geriatric Syndromes: Clinical, Research, and Policy Implications of a Core Geriatric Concept. J. Am. Geriatr. Soc. 55, 780–791 (2007).
Article Google Scholar
Aboutorabi, A., Arazpour, M., Bahramizadeh, M., Hutchins, S. W. & Fadayevatan, R. The effect of aging on gait parameters in able-bodied older subjects: a literature review. Aging Clin. Exp. Res. 28, 393–405 (2016).
Article Google Scholar
Kikkert, L. H. J. et al. Gait dynamics to optimize fall risk assessment in geriatric patients admitted to an outpatient diagnostic clinic. PLoS One 12, e0178615 (2017).
Article Google Scholar
Barth, J. et al. Biometric and mobile gait analysis for early diagnosis and therapy monitoring in Parkinson’s disease. In 2011 Annual International Conference of the IEEE Engineering in Medicine and Biology Society 868–871 (IEEE). https://doi.org/10.1109/IEMBS.2011.6090226 (2011)
Shull, P. B., Jirattigalachote, W., Hunt, M. A., Cutkosky, M. R. & Delp, S. L. Quantified self and human movement: A review on the clinical impact of wearable sensing and feedback for gait analysis and intervention. Gait Posture 40, 11–19 (2014).
Article Google Scholar
Abellan van Kan, G. et al. Gait speed at usual pace as a predictor of adverse outcomes in community-dwelling older people an International Academy on Nutrition and Aging (IANA) Task Force. J. Nutr. Health Aging 13, 881–9 (2009).
Article CAS Google Scholar
Peel, N. M., Kuys, S. S. & Klein, K. Gait Speed as a Measure in Geriatric Assessment in Clinical Settings: A Systematic Review. Journals Gerontol. Ser. A 68, 39–46 (2013).
Article Google Scholar
Hollman, J. H., McDade, E. M. & Petersen, R. C. Normative spatiotemporal gait parameters in older adults. Gait Posture 34, 111–118 (2011).
Article Google Scholar
Kosse, N. M., Vuillerme, N., Hortobágyi, T. & Lamoth, C. J. Multiple gait parameters derived from iPod accelerometry predict age-related gait changes. Gait Posture 46, 112–117 (2016).
Article Google Scholar
Hamacher, D., Singh, N. B., Van Dieën, J. H., Heller, M. O. & Taylor, W. R. Kinematic measures for assessing gait stability in elderly individuals: a systematic review. J. R. Soc. Interface 8, 1682–1698 (2011).
Article CAS Google Scholar
Kavanagh, J. J. & Menz, H. B. Accelerometry: A technique for quantifying movement patterns during walking. Gait Posture 28, 1–15 (2008).
Article Google Scholar
Beauchet, O. et al. Walking speed-related changes in stride time variability: effects of decreased speed. J. Neuroeng. Rehabil. 6, 32 (2009).
Article Google Scholar
Figueiredo, J., Santos, C. P. & Moreno, J. C. Automatic recognition of gait patterns in human motor disorders using machine learning: A review. Med. Eng. Phys. 53, 1–12 (2018).
Article Google Scholar
Phinyomark, A., Petri, G., Ibáñez-Marcelo, E., Osis, S. T. & Ferber, R. Analysis of Big Data in Gait Biomechanics: Current Trends and Future Directions. J. Med. Biol. Eng. 38, 244–260 (2018).
Article Google Scholar
Reid, S. M., Graham, R. B. & Costigan, P. A. Differentiation of young and older adult stair climbing gait using principal component analysis. Gait Posture 31, 197–203 (2010).
Article Google Scholar
Daffertshofer, A., Lamoth, C. J. C., Meijer, O. G. & Beek, P. J. PCA in studying coordination and variability: a tutorial. Clin. Biomech. 19, 415–428 (2004).
Article Google Scholar
Quach, L. et al. The nonlinear relationship between gait speed and falls: the Maintenance of Balance, Independent Living, Intellect, and Zest in the Elderly of Boston Study. J. Am. Geriatr. Soc. 59, 1069–73 (2011).
Article Google Scholar
Rosipal, R., Girolami, M., Trejo, L. J. & Cichocki, A. Kernel PCA for Feature Extraction and De-Noising in Nonlinear Regression. Neural Comput. Appl. 10, 231–243 (2001).
Article Google Scholar
Wu, J., Wang, J. & Liu, L. Feature extraction via KPCA for classification of gait patterns. Hum. Mov. Sci. 26, 393–411 (2007).
Article Google Scholar
Halilaj, E. et al. Machine learning in human movement biomechanics: Best practices, common pitfalls, and new opportunities. J. Biomech. 81, 1–11 (2018).
Article Google Scholar
Begg, R. K., Palaniswami, M. & Owen, B. Support Vector Machines for Automated Gait Classification. IEEE Trans. Biomed. Eng. 52, 828–838 (2005).
Article Google Scholar
Begg, R. & Kamruzzaman, J. A machine learning approach for automated recognition of movement patterns using basic, kinetic and kinematic gait data. J. Biomech. 38, 401–408 (2005).
Article CAS Google Scholar
Alaqtash, M. et al. Automatic classification of pathological gait patterns using ground reaction forces and machine learning algorithms. in 2011 Annual International Conference of the IEEE Engineering in Medicine and Biology Society 453–457 (IEEE). https://doi.org/10.1109/IEMBS.2011.6090063 (2011)
Pogorelc, B., Bosnić, Z. & Gams, M. Automatic recognition of gait-related health problems in the elderly using machine learning. Multimed. Tools Appl. 58, 333–354 (2012).
Article Google Scholar
Açıcı, K. et al. A Random Forest Method to Detect Parkinson’s Disease via Gait Analysis. in 609–619 (Springer, Cham). https://doi.org/10.1007/978-3-319-65172-9_51 (2017)
Chapter Google Scholar
Qiu, H., Rehman, R. Z. U., Yu, X. & Xiong, S. Application of Wearable Inertial Sensors and A New Test Battery for Distinguishing Retrospective Fallers from Non-fallers among Community-dwelling Older People. Sci. Rep. 8 (2018).
Matykiewicz, P. & Pestian, J. Effect of small sample size on text categorization with support vector machines. in Proceedings of the 2012 workshop on biomedical natural language processing 193–201 (Association for Computational Linguistics (2012).
Dinov, I. D. Black Box Machine-Learning Methods: Neural Networks and Support Vector Machines. in Data Science and Predictive Analytics 383–422 (Springer International Publishing. https://doi.org/10.1007/978-3-319-72347-1_11 (2018)
Chapter Google Scholar
Loh, W.-Y. Variable Selection for Classification and Regression in Large p, Small n Problems. In Probability Approximations and Beyond (eds. Andrew Barbour, Hock Peng Chan, D. S.) 133–157 (Springer New York (2012).
Poggi, R. G.-J.-M. & Tuleau, C. Random Forests: some methodological insights. arXiv preprint arXiv 32 (2008).
Bircanoglu, C. & Arica, N. A comparison of activation functions in artificial neural networks. in 2018 26th Signal Processing and Communications Applications Conference (SIU) 1–4 (IEEE). https://doi.org/10.1109/SIU.2018.8404724 (2018)
Mehdizadeh, S. The largest Lyapunov exponent of gait in young and elderly individuals: A systematic review. Gait Posture 60, 241–250 (2018).
Article Google Scholar
Bautmans, I., Jansen, B., Van Keymolen, B. & Mets, T. Reliability and clinical correlates of 3D-accelerometry based gait analysis outcomes according to age and fall-risk. Gait Posture 33, 366–372 (2011).
Article Google Scholar
Studenski, S. et al. Gait Speed and Survival in Older Adults. JAMA 305, 50 (2011).
Article CAS Google Scholar
Franceschi, C. et al. The Continuum of Aging and Age-Related Diseases: Common Mechanisms but Different Rates. Front. Med. 5, 61 (2018).
Article Google Scholar
Horst, F., Lapuschkin, S., Samek, W., Müller, K. R. & Schöllhorn, W. I. Explaining the unique nature of individual gait patterns with deep learning. Sci. Rep. 9 (2019).
Pasini, A. Artificial neural networks for small dataset analysis. J. Thorac. Dis. 7, 953–60 (2015).
PubMed PubMed Central Google Scholar
van Schooten, K. S. et al. Ambulatory Fall-Risk Assessment: Amount and Quality of Daily-Life Gait Predict Falls in Older Adults. Journals Gerontol. Ser. A Biol. Sci. Med. Sci. 70, 608–615 (2015).
Article Google Scholar
Kikkert, L. H. J. et al. Gait characteristics and their discriminative power in geriatric patients with and without cognitive impairment. J. Neuroeng. Rehabil. 14, 84 (2017).
Article Google Scholar
Lamoth, C. J. et al. Gait stability and variability measures show effects of impaired cognition and dual tasking in frail people. J. Neuroeng. Rehabil. 8, 2 (2011).
Article Google Scholar
IJmker, T. & Lamoth, C. J. C. Gait and cognition: The relationship between gait stability and variability with executive function in persons with and without dementia. Gait Posture 35, 126–130 (2012).
Article Google Scholar
de Groot, M. H. et al. The Association of Medication-Use and Frailty-Related Factors with Gait Performance in Older Patients. PLoS One 11, e0149888 (2016).
Article Google Scholar
Kosse, N. M., Caljouw, S., Vervoort, D., Vuillerme, N. & Lamoth, C. J. C. Validity and Reliability of Gait and Postural Control Analysis Using the Tri-axial Accelerometer of the iPod Touch. Ann. Biomed. Eng. 43, 1935–1946 (2015).
Article Google Scholar
Schölkopf, B., Smola, A. & Müller, K.-R. Nonlinear Component Analysis as a Kernel Eigenvalue Problem. Neural Comput. 10, 1299–1319 (1998).
Article Google Scholar
Cristianini, N. & Shawe-Taylor, J. Support vector machines. in An introduction to support vector machines: and other kernel-based learning methods 93–124 (Cambridge University Press (2000).
Breiman, L. Random Forests. Mach. Learn. 45, 5–32 (2001).
Article Google Scholar
Bishop, C. M. Neural Networks. in Pattern Recognition and Machine Learning (eds. Jordan, M., Kleinberg, J. & Scholkopf, B.) 225–284 (Springer (2006).

Download references

Author information

Authors and Affiliations

Center for Human Movement Sciences, University Medical Center Groningen, University of Groningen, Groningen, The Netherlands
Yuhan Zhou, Tibor Hortobágyi & Claudine J. C. Lamoth
Department of Neurology, University Hospital Schleswig-Holstein, Christian-Albrechts-Universität Kiel, Kiel, Germany
Robbin Romijnders, Clint Hansen & Walter Maetzler
Department of Geriatric Medicine, OLVG hospital, Amsterdam, The Netherlands
Jos van Campen

Authors

Yuhan Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Robbin Romijnders
View author publications
You can also search for this author in PubMed Google Scholar
Clint Hansen
View author publications
You can also search for this author in PubMed Google Scholar
Jos van Campen
View author publications
You can also search for this author in PubMed Google Scholar
Walter Maetzler
View author publications
You can also search for this author in PubMed Google Scholar
Tibor Hortobágyi
View author publications
You can also search for this author in PubMed Google Scholar
Claudine J. C. Lamoth
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Y.Z. and C.J.C.L. designed the study, analyzed the data and wrote the paper under supervision of C.J.C.L. Under supervision of J.V.C. collected the data from participants. R.R., C.H., W.M. and T.H. revised the draft critically. All authors reviewed the manuscript.

Corresponding author

Correspondence to Yuhan Zhou.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Zhou, Y., Romijnders, R., Hansen, C. et al. The detection of age groups by dynamic gait outcomes using machine learning approaches. Sci Rep 10, 4426 (2020). https://doi.org/10.1038/s41598-020-61423-2

Download citation

Received: 06 November 2019
Accepted: 26 February 2020
Published: 10 March 2020
DOI: https://doi.org/10.1038/s41598-020-61423-2

This article is cited by

Random forest algorithms to classify frailty and falling history in seniors using plantar pressure measurement insoles: a large-scale feasibility study
- Emi Anzai
- Dian Ren
- Yuji Ohta
BMC Geriatrics (2022)
A machine-vision-based frailty index for mice
- Leinani E. Hession
- Gautam S. Sabnis
- Vivek Kumar
Nature Aging (2022)
Anticholinergic and Sedative Medications and Dynamic Gait Parameters in Older Patients
- Hans Wouters
- Jos P. Van Campen
- Claudine J. C. Lamoth
Drugs & Aging (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Selecting Clinically Relevant Gait Characteristics for Classification of Early Parkinson’s Disease: A Comprehensive Machine Learning Approach

Prediction of fall risk among community-dwelling older adults using a wearable system

XGBoost based machine learning approach to predict the risk of fall in older adults using gait outcomes

Introduction

Results

Gait outcome identification and classification with KPCA in combination with SVM

Gait outcome identification and classification with random forest

Gait outcome identification and classification with artificial neural network

Evaluation of the machine learning classification approaches

Summary and statistical analysis of the machine learning classification

Discussion

Methods

Data description

Gait outcomes standardization

Gait outcomes extraction

Cross-validation

Classification

Support vector machine (SVM)

Random forest

Artificial neural network (ANN)

Evaluation of classification

Data availability

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Supplementary Information.

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Random forest algorithms to classify frailty and falling history in seniors using plantar pressure measurement insoles: a large-scale feasibility study

A machine-vision-based frailty index for mice

Anticholinergic and Sedative Medications and Dynamic Gait Parameters in Older Patients

Comments

Search

Quick links