A comparison of machine learning models’ accuracy in predicting lower-limb joints’ kinematics, kinetics, and muscle forces from wearable sensors

Moghadam, Shima Mohammadi; Yeung, Ted; Choisne, Julie

doi:10.1038/s41598-023-31906-z

Download PDF

Article
Open access
Published: 28 March 2023

A comparison of machine learning models’ accuracy in predicting lower-limb joints’ kinematics, kinetics, and muscle forces from wearable sensors

Shima Mohammadi Moghadam¹,
Ted Yeung¹ &
Julie Choisne¹

Scientific Reports volume 13, Article number: 5046 (2023) Cite this article

3940 Accesses
18 Citations
Metrics details

Subjects

Abstract

A combination of wearable sensors’ data and Machine Learning (ML) techniques has been used in many studies to predict specific joint angles and moments. The aim of this study was to compare the performance of four different non-linear regression ML models to estimate lower-limb joints’ kinematics, kinetics, and muscle forces using Inertial Measurement Units (IMUs) and electromyographys’ (EMGs) data. Seventeen healthy volunteers (9F, 28 ± 5 years) were asked to walk over-ground for a minimum of 16 trials. For each trial, marker trajectories and three force-plates data were recorded to calculate pelvis, hip, knee, and ankle kinematics and kinetics, and muscle forces (the targets), as well as 7 IMUs and 16 EMGs. The features from sensors’ data were extracted using the Tsfresh python package and fed into 4 ML models; Convolutional Neural Networks (CNN), Random Forest (RF), Support Vector Machine, and Multivariate Adaptive Regression Spline for targets’ prediction. The RF and CNN models outperformed the other ML models by providing lower prediction errors in all intended targets with a lower computational cost. This study suggested that a combination of wearable sensors’ data with an RF or a CNN model is a promising tool to overcome the limitations of traditional optical motion capture for 3D gait analysis.

Algorithm based on one monocular video delivers highly valid and reliable gait parameters

Article Open access 07 July 2021

Lower limb kinematic, kinetic, and EMG data from young healthy humans during walking at controlled speeds

Article Open access 12 April 2021

A wearable motion capture device able to detect dynamic motion of human limbs

Article Open access 05 November 2020

Introduction

Three-dimensional gait analysis (3DGA) provides quantitative information on the locomotion system and lower-limb functionality level during gait. 3DGA is an effective way to monitor changes in gait and is commonly used in hospitals and gait clinics. However, due to the optical motion capture (OMC) system cost and the time needed for pre- and post-processing of the data, gait clinics are sparse, and the waiting time to get assessed can become quite high. Moreover, OMC systems and force plates need to be set up in a controlled environment, such as a lab or a clinic, which has been shown to affect human gait^1,2.

With the emergence of lightweight and inexpensive wearable sensors, collecting human gait data outside the clinic has been made feasible. Inertial Measurement Unit (IMU) and Electromyography (EMG) are two types of wearable sensors that are becoming widely used in 3DGA. IMUs are made of a single electronics module combining three accelerometers and three gyroscopes which respectively collect linear acceleration and angular velocity in 3 dimensions³. EMG electrodes are placed on the person’s skin at the muscle’s belly location and indirectly measure the electrical signals transmitted by motor neurons that cause muscles to contract^4,5. Although wearable sensors are very promising in motion analysis, barriers exist to their widespread clinical adaptation. First, the integration of acceleration data to determine the IMUs’ position and orientation causes numerical drift errors over time⁶. Second, scaling the EMG signal to the patient’s maximum voluntary muscle contraction to determine muscle activation is not always feasible in patients with impaired muscle forces, such as stroke patients or children with Cerebral Palsy^7,8.

To overcome challenges associated with wearable sensors’ data processing limitations, regression-based machine learning (ML) techniques can be used. ML models can establish a direct relationship between wearable sensors’ data and intended targets; such as joint kinematics, joint kinetics, and muscle forces in this study. Training an ML model would enable us to predict targets for either (1) a specific patient at a different time point/session (intra-subject model) or (2) additional unseen patients (inter-subject model). Furthermore, data-driven models will enable joint kinetics calculations without ground reaction force data (from force plates) and eliminate the need for expensive motion capture equipment. It is worth mentioning that for the intra-subject model, one session of data collection in a lab with an OMC system would be required to build a specific ML model for each participant. After one session of data collection in the lab (OMC + IMUs + EMGs), the IMU and EMG sensors’ data can be collected during rehabilitation and outside the clinic without the need for an OMC system to enable clinicians to quantify the patient’s recovery progress until the end of the treatment/procedure.

Neural Networks (NN), Random Forest (RF), and Support Vector Machines (SVM) are powerful ML algorithms that can be used for regression even when non-linearity exists inside the targets. In recent years, a few research groups have implemented these algorithms to estimate gait time series from wearable sensors. Some have looked at joint kinematics^{9,10,11,12,13,14,15,16,17,18,19,20,21,22}, others at joint kinetics^12,23,24,25 but also gait parameters such as stride length, velocity, and toe clearance^{26,27,28,29,30}. To date, NNs are the most used ML model to predict joint kinematics and kinetics from IMUs^{10,16,18,20,23,25,27,28,31,32}. Most of the mentioned studies^{10,18,23,25,31,32} used classic feedforward neural networks and achieved correlation coefficients higher than 0.86. However, it has been shown that convolutional neural networks (CNN) outperform classic NN models in gait time-series prediction, especially for joint kinematics^33,34. To the best of our knowledge, there is only one research³⁵ in which neural networks are implemented to predict muscle activations from EMG data to estimate joint kinetics in a forward dynamics model. Unfortunately, they did not determine muscle forces based on the predicted muscle activations. Bolam et al.³⁶ developed an RF model to predict maximum knee flexion angle and provided a reliable workflow to remotely monitor post-operative progress in knee arthroplasty patients. Estimation of hip, knee, and ankle joint angles in the sagittal plane (but not joint kinetics and muscle forces) was performed in another study¹⁷ using five ML algorithms (multiple linear regression, RF, SVM, back propagation neural network and eXtreme gradient boosting). SVM models have been used mainly to predict gait parameters (stride length and width, stride time, and foot clearance)^26,29,30 rather than the prediction of joint kinematics and kinetics. Multivariate Adaptive Regression Splines (MARS) is another powerful ML method that is an extension of linear models and automatically models non-linearities and interactions between variables. It seems that MARS has not been investigated to predict joint kinematics, kinetics, and muscle forces during 3D gait analysis yet.

Although the performance of different ML models has been investigated in some studies^{10,11,12,13,14,15,16,17,18,19,23,24,25,26,27,28,29,30,31,32,35,36}, there is a lack of consensus on which ML algorithm is the most accurate for predicting joint kinematics, kinetics, and muscle forces. Most studies focused on specific joint angles or joint moments of the lower limbs in one plane. Furthermore, only a few studies used automatic feature extraction and selection to train each ML model^36,37,38.

Therefore, the aim of this study was two-fold: (1) Extract features automatically and determine the most important features for the estimation of each target and (2) Compare the performance of four non-linear regression ML models (CNN, RF, SVM, and MARS) to estimate pelvis, hip, knee, and ankle joint angles, moments, and muscle forces in both intra-subject and inter-subject examinations. To this end, we employed a python package called Tsfresh³⁹ to extract features from EMGs and IMUs data and developed each ML model by using the most important features. Finally, each ML model’s performance was evaluated based on its prediction accuracy and computational time.

Methodology

The workflow to develop the ML models is represented in Fig. 1. The procedure of data collection, calculating targets from marker trajectories and ground reaction forces, extracting features from sensors’ data, and building ML models are explained in detail in the next paragraphs.

Data collection

Seventeen healthy adults (9F, 28 ± 5 years, 1.70 ± 0.08 m, 66 ± 10 kg) with no recently reported injuries voluntarily participated in this study. Inclusion criteria were adults aged 18 years and older, and exclusion criteria were previous lower limb surgery, joint pain, osteoarthritis, or any other form of arthritis that would alter gait and any injury to the lower limbs in the past six months prior to the data collection. Each participant signed an informed consent form prior to collecting data in accordance with the World Medical Association Declaration of Helsinki (1964, last updated in 2013) and was approved by the University of Auckland (New Zealand) human participant ethics committee (reference number 019911).

Participants were assessed in one session with at least one static, one squat, one squat jump, one heel raise, and sixteen over-ground walking trials with their self-selected speed. Each participant completed about ten gait cycles in each trial but only one gait cycle was used per trial in this study (the gait cycle that occurred over the force plates to calculate joint moments). The gait cycles used in the analysis were defined as the period of time from one heel strike of one foot to the next heel strike of the same foot. The steps in which the participant’s feet were outside the force plates were removed. After this initial data cleaning step, a different number of gait cycles remained for each participant (min = 8 and max = 24). The gait cycle duration for each participant varied based on their self-selected walking speed and step length. The minimum and maximum time for gait cycles were 0.75 and 1.25 s, respectively. In each trial, marker trajectories from a 12-camera optical motion capture system (Vicon Motion Systems Ltd., UK), ground reaction forces from three gound embedded force plates (Bertec, Columbus, Ohio), EMG (Mini-Wave, Italy), and IMUs (Vicon IMeasureU Ltd., NZ) were recorded. Twenty-seven reflective markers were placed on participants, as shown in Fig. 2, to determine the three-dimensional position and orientation of each body segment. Sixteen EMG surface electrodes were used to record lower limb muscles' activity on both legs (Gluteus maximus, Rectus femoris, Vastus lateralis, Biceps femoris, Semimembranosus, Medial gastrocnemius, Soleus, and Tibialis anterior). Three-dimensional acceleration and angular velocity were recorded from 7 IMUs attached to each segment of the lower limbs (one on the pelvis, one on each foot, shank, and thigh). With the exception of marker trajectories data, which was captured at a sampling frequency of 200 Hz, all data were recorded at 1 kHz. For each participant, maximum voluntary contraction (MVC) of lower limb muscles were also collected. During the MVC collections, the lab operator held the participant’s leg in a fixed position and asked them to move their leg with maximum effort to activate the group of muscles of interest. For each participant, the IMUs were taken off and repositioned in the middle of the session to account for the effect of small displacement in the IMUs’ position. Therefore half of the data captured was collected before and the other half after that extra step of removing and reattaching the IMUs on the participant’s skin. These trials were used randomly for training and testing for the intra-subject examination.

Data post-processing

All captured data was synchronized, and marker trajectories were reconstructed through Vicon Nexus software (Version 2.12). The MOtoNMS⁴⁰ (Matlab Motion data elaboration toolbox for neuromusculoskeletal applications) was used to filter 3D marker positions and force plate data (Butterworth 4th order, 10 Hz low pass filter) and rotate them according to the OpenSim coordinate system (X is perpendicular to the frontal plane pointing forward, Y is perpendicular to the transverse plane pointing upward, and Z is perpendicular to the sagittal plane pointing to the right). MOtoNMS was also used to process the EMG recording to determine muscles’ activations; (1) a zero-lag band-pass filter (4th order Butterworth 30–300 Hz), (2) full-wave rectification, and (3) a low pass filter (4th order Butterworth 4–10 Hz). Finally, the signals were normalized to the maximum value of EMG recorded during the MVC trials to scale underlying muscles' excitations as a number between 0 and 1.

An OpenSim model (gait2392)⁴¹ was scaled using the MAP-client workflow⁴² based on marker data using Principal Component Analysis to build a personalized musculoskeletal model for each participant. Pelvis, hips, knees, and ankles kinematics and kinetics were computed using the OpenSim inverse kinematics (IK) and inverse dynamics (ID) tools, respectively (version 3.3). The Calibrated EMG-Informed Neuromusculoskeletal Modelling (CEINMS)⁴³ toolbox was used to estimate muscle forces. To calibrate musculotendon units (MTUs), we used three walking, one heel raising, and one squat trial in the CEINMS calibration step to adjust musculotendon parameters like tendon slack length, optimal fiber length, and strength coefficient. The objective function for calibration was defined by minimizing the differences between the joint moments estimated by the EMG-driven model and those derived from inverse dynamics during multiple calibration trials. Once calibration was completed, all MTUs activation and forces were predicted using the EMG-assisted approach (hybrid mode) of CEINMS. Finally, IMU, EMG, joint angles, joint moments, and muscle forces data were down-sampled to 100 Hz to decrease the computational cost of feature extraction and ML models construction. Joint moments were normalized to each participant’s body weight.

Models’ development

The development process of each ML model to predict the targets (joint angles, joint moments, and muscle forces) from wearable sensors’ data is explained below. To increase the predictive power and facilitate the ML process, we identified all features from raw IMU (acceleration and angular velocity in three directions) and EMG data by using an open-source python package called Tsfresh (Time Series FeatuRe Extraction on basis of Scalable Hypothesis tests)³⁹. In order to prepare the IMU and EMG data for Tsfresh, we put them into sequences of consecutive and overlapping windows, where a window is shifted across the data points to create smaller segments of time series signals per target value. In this study, the window size was one second, as recommended by Banos et al.⁴⁴.

Feature extraction and selection

The minimum and maximum number of gait cycles for participants were 8 and 24, respectively. The length of gait cycles was also different (between 0.75 and 1.25 s) based on participants’ self-selected walking speed and step length, as mentioned in 2.1. However, the total number of data points was 37,579 for all participants. Each participant’s data was split into training (70% of gait cycles) and testing (the remaining 30% of gait cycles) sets. All participants’ training data (26,298 data points) were used for feature extraction and selection procedure. Tsfresh extracted 788 features from each channel of IMU and EMG data. A total of 58 channels were available from all IMUs and EMGs; 42 channels from seven IMUs (each IMU had six components: triaxial gyroscope and triaxial acceleration data), and 16 channels of EMGs. From these 58 channels, 45,704 features were extracted. In order to increase the prediction power of ML models, we eliminated irrelevant features that were not providing information to predict our targets. Removing unnecessary features will also decrease computational cost and time, which is crucial for real-world applications. First, for each target, a primary selection step took place to retain features with non-zero variance (31,487 features remained). Then extra feature selection procedures were performed to find the most important features related to each target by removing non-significant features in predicting target values. Then, the rank of each feature was determined based on Gini Importance⁴⁵ for predicting each target using an RF regressor. Finally, the top ten features related to each target were selected. All top features were put together to build a super feature set, including 500 features (10 for each of the 50 targets). The final feature set was further reduced by removing repeated features to avoid redundancy, resulting in a total of 441 features.

Non-linear regression ML models

The most important features extracted by Tsfresh were used as inputs to the ML models (CNN, RF, SVM, and MARS) to predict the targets (joint kinematics, joint kinetics, and muscle forces) in over-ground walking. All ML models were multi-output, which predicted all targets simultaneously. Scikit-learn (a python library for ML) was used to set up the CNN, RF, and SVM models, while the MARS model was built using the py-earth python library. To optimize the performance of the models, the hyperparameters were tuned using the following approach. The data from all participants were divided into two sets: a training and validation set (80% of the data) and a testing set (20% of the data). We used a five-fold cross-validation on the first set to determine a combination of parameters that resulted in the lowest error. It involved splitting the data into five equally sized subsets, or "folds." The model was then trained on four of the folds and evaluated on the fifth fold. This process is repeated five times, with each fold being used as the evaluation set once. Finally, the testing set was used to evaluate the final performance of the model. To perform the hyperparameter search, the GridSearchCV method was utilized, which searches over the hyperparameters defined in the parameter grid. This approach ensures that the model's hyperparameters are optimized while minimizing the risk of overfitting, ultimately leading to a robust and accurate model. The tunned hyperparameters found for each ML are described below.

CNNs are a specialized type of NN model that has shown remarkable performance in various tasks, including gait time-series prediction. This study used a multi-output CNN model with five hidden layers to estimate joint kinematics, joint kinetics, and muscle forces. First, we used the StandardScaler function from the sklearn library for scaling features to ensure all variables are in the same range (between zero and one). It was also necessary to scale targets as we used a multi-output CNN model. Targets were scaled back to their original scale using the same scaler after predictions. Then, the model's architecture was defined with an input layer size of 441. Then two convolutional layers were added, each followed by a max pooling layer. Both convolutional layers had 256 filters with a kernel size of three and a “relu” activation function. The max-pooling layers had a pool size of two. These layers helped reduce the data's dimensionality and identify the most prominent features of the previous feature map. After the max-pooling layers, the data was flattened and passed through the output layer, which was a dense layer with a linear activation function. The number of units in the output layer was equal to the number of targets (50). The 'Adam' solver (with a learning rate of 0.01), a stochastic gradient-based optimizer, was used for weight optimization, and the loss function was “mean squared error”. The EarlyStopping function was used to monitor the validation loss and stop the training if the loss did not improve after five epochs. The batch size was set to 32, and the model was trained for a maximum of 100 epochs. Supplementary Fig. 1 represents the loss versus the number of epochs for training and validation. In this model, the optimal activation function (among 'relu', 'sigmoid', and 'tanh') in hidden layers, the optimizer (among 'adam', 'rmsprop', and 'sgd') and its learning rate (among ‘0.1’, ‘0.01’, and ‘0.001’), and the number of neurons (among 64, 128, and 256) in each convolutional layer were found through grid search.

RF is a flexible and easy to use ML model for regression. RF builds forest (ensemble of decision trees) trained with the bootstrap aggregating (bagging) method and outputs the average of prediction of individual decision trees⁴⁶. The RF model's tunned hyperparameters in this study were the number of trees (among: 100, 200, 300, 400, and 500), the maximum number of randomly selected variables in each tree (among 'auto', 'sqrt', and 'log2'), and the maximum depth of each tree (among 15, 20, 25, and 30). Based on the grid search, the final combination of hyperparameters that provided the lowest error was as follows: 500 for the number of trees (Increasing the number of trees can improve the model's performance, but it may also increase the computational complexity and training time which might not be ideal for using the model in real applications), ‘sqrt’ for the number of randomly selected features which means the root square of the number of inputs (in this study, this number was equal to 21 as the number of input variables was 441), and the maximum depth of 25 for each tree.

Another powerful supervised learning model for non-linear regression is SVM⁴⁷. In this model, a threshold (ε) is set by the user to control the maximum allowable error for the regression setting. When there is non-linearity in the dataset, a kernel function is used to map the input feature vectors to a higher dimensional feature space. As SVMs are sensitive to the scale of features, we performed feature scaling to improve this model’s performance. In this study, after hyperparameters tunning, we set \(\varepsilon =\) 0.01 (among 0.001, 0.01, 0.1, and 1), cost parameter \(C=\) 10 (among 0.1, 1, 10, and 100), and radial basis function ‘rbf’ as the kernel function (among 'linear', 'rbf', and 'sigmoid').

The last model developed in this study was a MARS which is well suited for high-dimensional problems. MARS is an extension of linear models by modeling non-linearities in target values. This model aggregates a set of simple linear functions' results to perform well in predicting any kind of target vector. MARS algorithm automatically discovers the number and type of basis functions to use. We set the number of input variables considered by each piecewise linear function (max_degree) to two (among 1, 2, and 3). The maximum number of basis functions (max_term) was 100 (among 100, 200, and 300).

Performance evaluation

Intra-subject examination

To investigate each ML model’s performance for predicting the targets for the same participant, the intra-subject examination was performed. In the intra-subject examination, we used 70% of one participant’s gait data to train the ML model, and we tested the model on the remaining 30% of the same participant’s data. This examination was done for all participants (creating 17 different models, 1 per participant).

Inter-subject examination

The inter-subject examination evaluated the ML models’ performance to predict targets for an unseen participant. Leave-one-out (LOO) cross-validation was performed to investigate ML models’ generalizability. A LOO analysis consists of splitting the training (N-1 participants) and testing (1 participant) dataset N times, with N = number of participants. Therefore we created 17 training/testing combinations of partcipants data to build 17 ML models. Each time 16 participants’ data were used for training the ML model, and the model was tested on the remaining participant’s data.

Performance metrics

To compare the performance of the ML models in the testing datasets, the Root Mean Square Error (RMSE), Mean Absolute Error (MAE), and coefficient of determination (\({R}^{2}\)) between the computed and predicted targets were calculated for each gait cycle and each participant for both intra and inter-subject examinations. In order to make a better interpretation of muscle forces errors for each muscle, we reported NRMSE (RMSE normalized to the range of data). The reported RMSEs, NRMSEs, and MAEs are the average of cross-validation for all participants. The \({R}^{2}\) values are presented in percentages and calculated for each target using predicted data from all participants to have a single value.

To identify the most computationally efficient model and investigate the effect of feature selection on the ML model’s performance, an additional examination was performed. All models were trained (on 16 randomly selected participants’ data) and tested (on the remaining participant’s data) twice. The first time, we used all features with non-zero variance (31,487 features), while the second time, we only utilized the selected features (441 features). For both cases, the prediction accuracy and computational time were recorded. This examination enabled us to identify the model that can be effectively applied in a practical setting across various systems and clinics.

Results

The most important features selected to predict each target are available in the Supplementary Table S1. The top features were extracted from the IMU signals for kinematics and kinetics prediction. In contrast, most of the top features were extracted from EMG signals for muscle forces prediction. According to Supplementary Table S2, the highest and lowest computational times for testing the models were related to SVM and CNN models respectively. The testing time for the CNN and RF models using only the selected features (441 features) took less than a second, while the testing time for the SVM was 945 s. Using all non-zero variance features (31,487 features) to train and test the models increased computational time, especially for SVM and MARS models. While the prediction accuracy improved slightly for the CNN model, the performance of other models worsened by using all non-zero variance features instead of the selected features (Supplementary Table S3).

The following results represent the performance of each ML model, CNN (pink), RF (blue), SVM (yellow), and MARS (green), for the prediction of intended targets (joint kinematics, joint kinetics, and muscle forces).

Joint kinematics

Although CNN, RF, and SVM models' performance were in the same range for most joint angles, the RF model provided the lowest RMSEs in all joints and planes of motion compared to other models for the intra-subject examinations (Fig. 3). The most accurate estimations for the inter-subject examination were provided by the RF and CNN models. The CNN model outputted the lowest prediction errors in pelvic tilt, hip rotation, and ankle inversion/eversion angles. The best performance for knee flexion/extension angle was related to the SVM model, and for the rest of the joint angles, RF provided the lowest error. The highest RMSE was related to MARS predictions in most joint angles in both intra and inter-subject examinations. The lowest joint angles RMSE in the RF model were related to pelvic obliquity (RMSE = \(0.74^\circ\) for intra-subject examination and \(2.95^\circ\) for inter-subject examinations), and the highest error was found for ankle inversion/eversion (RMSE = \(2.58^\circ\) for intra-subject examination and \(8.32^\circ\) for inter-subject examinations). The same trend can be seen by investigating other evaluation metrics, such as MAE and the \({R}^{2}\) values presented in the Supplementary Table S4.

Examples of the hip, knee, and ankle sagittal plane ROM during a gait cycle for the best participants (based on RF results) for the intra and inter-subject examinations are displayed in Fig. 4. The worst participant's results and other joint kinematics are shown in Supplementary Figs. S2 and S3 for one gait cycle. As can be seen, the RF model (dashed blue line) provided the best predictions by following OpenSim inverse kinematics output (solid grey line) better than other models.

Joint kinetics

The RF model consistently provided lower RMSE than other ML models in all joints moments’ predictions, analogously to the kinematics results (Fig. 5). The MARS model produced the maximum RMSE in most joints for the intra-subject examinations, with the worst predictions provided the by SVM and MARS models for the inter-subject examination. RF model’s RMSE in joint kinetics prediction ranged between 0.023 Nm/kg (hip rotation moment) and 0.191 Nm/kg (pelvic tilt moment) in the intra-subject examination. The minimum and maximum joint kinetics RMSE were 0.047 Nm/kg (hip rotation moment) and 0.269 Nm/kg (pelvic tilt moment) in the inter-subject examination for the RF model. MAE and \({R}^{2}\) values are presented in Supplementary Table S5 for joint kinetics predictions by all models.

The best participant predictions for all ML models for one gait cycle for ankle, knee, and hip moments in the sagittal plane are displayed in Fig. 6. The worst participant predictions and other joints kinetics are presented in Supplementary Figs. S4 and S5 for a gait cycle. The RF model (dashed blue line) provided better predictions of the OpenSim inverse dynamics outputs (solid grey line) compared to the other models.

Muscle forces

To predict muscle forces, the RF and CNN models displayed the lowest NRMSE between CEINMS outputs and prediction output from ML models, while the highest NRMSEs came from the MARS model (Fig. 7). In the inter-subject examination, the CNN model outperformed the RF model for the tibialis anterior and gastrocnemius muscles prediction. The maximum average NRMSE value occurred when predicting the semitendinosus muscle force (NRMSE of 14.1%) for the intra-subject examination and biceps femoris short head muscle (NRMSE of 36.2%) in the inter-subject examination. The average RMSE for the biceps femoris long head muscle force prediction was the lowest among all muscles for all ML models in both intra-subject (NRMSE of 2.6%) and inter-subject (NRMSE of 4.5%) examinations. MAE and \({R}^{2}\) values between models’ predictions and CEINMS output are presented in Supplementary Table S6 for the muscle forces predictions.

Muscle forces predictions by all models can be seen in Fig. 8 for the soleus (ankle plantar flexor muscle), semitendinosus (knee flexor muscle), and rectus femoris (hip flexor muscle) across one gait cycle for the best participants in each examination. Figures demonstrating the worst participant's results and other muscle forces are available in Supplementary Figs. S6 and S7. The best match between the ML models’ prediction and CEINMS output came from the RF model for both intra-subject and inter-subject examinations.

Discussion

This study aimed to compare the performance of ML models for the prediction of important lower-limb gait time series (joint kinematics, joint kinetics, and muscle forces) from wearable sensors’ data with the aid of automatic feature extraction. The first objective of this study was to extract features automatically and determine the most important features for the estimation of each target. To extract all possible features from raw EMG and IMU data, we used a python package called Tsfresh³⁹. Tsfresh’s ability to extract a high number of features and determine their significance makes it more suitable than manual feature extraction methods. Furthermore, the most important features that are essential for predicting a particular target might be neglected when extracted manually⁴⁸. As a result, the top features to predict joint kinematics and kinetics were extracted from the IMU data, and the top features for most of the muscle forces were extracted from EMG data. These results were predictable, as the joint angles are closely related to the angular velocity (gyroscope data), joint moments are associated with linear acceleration (accelerometers data) and angular velocity, and EMG data are correlated with muscle activation and, therefore, muscle forces. Interestingly, the most important features for some targets appeared unrelated. For example, we found that a feature extracted from the z-axis of gyro data from the thigh sensor was the most informative feature for predicting hip flexion/extension angle and hip abduction/adduction. It may suggest that the sensors' axes may not be perfectly aligned with the joint axes of rotation in the body.

Despite the parallelization of the extraction and selection tools in tsfresh, the memory consumption of parallel calculations can be high. Tasks with a high number of processes may be limited to machines with low memory. Therefore, reducing the number of features enable the use of our workflow on any system. Although feature selection can extend the time required to train models, it can significantly reduce the time needed for model’s inference. Our experiments showed that including all non-zero variance features could increase testing time for all models. This is especially important for real-world clinical applications, where efficient models with lower computational costs are essential. However, our findings also suggest that including all non-zero variance features does not necessarily improve model performance. In fact, using all features can actually worsen the performance of the RF, SVM, and MARS models by including many unrelated features. While the use of all non-zero variance features slightly improved the performance of the CNN model, the improvements were not substantial. Therefore, careful feature selection is important for developing accurate and efficient models. The second objective of this study was to compare the performance of four non-linear regression ML models (CNN, SVM, RF, and MARS) to estimate pelvis, hip, knee, and ankle joint angles, moments, and muscle forces in both intra-subject and inter-subject examinations. The ML models’ performance were compared based on their resulting RMSE, MAE, and R² against the OpenSim and CEINMS output (used here as ground truth). The computed OpenSim joint angles and moments waveforms found in this study were similar to the literature^49,50. Muscle forces computation were validated by comparing them to the experimental EMG recordings.

We found that the RF and CNN models performed best in predicting joint kinematics and muscle forces for the intra and inter-subject examinations, as they provided the lowest prediction errors and computational time to be trained and tested. The SVM model also provided prediction errors in the range of the RF and CNN models in some joint angles and muscle forces; however, its high inferring time makes it inappropriate for some applications. The RF model provided the best joint moments’ prediction results in the intra and inter-subject examinations. Based on the figures representing the models’ predictions for one gait cycle, the RF models provided smoother outputs, in addition to having lower prediction errors compared to other models. The RF algorithm is less prone to overfitting than other models⁴⁶, which might explain its higher performance. Moreover, RF is a tree-based model and naturally ranks features by how well they improve the model’s performance and only uses the most important features to build trees. The good performance of the CNN models is due to their ability to automatically recognize relevant features, learn spatially correlated features, and create hierarchical representations of the input data. The lowest inferring time was related to the CNN model (0.23 s), making it suitable for real-time prediction by leveraging the power of parallel processing. Regardless of the ML model’s type, the level of prediction accuracy decreased (lower R² and higher RMSE and MAE) for the inter-subject examination (when the training dataset did not include any of the testing subjects' trials). This can be partially explained by the fact that individuals' joint motion characteristics are distinct⁵¹. By including more participants' data for training the ML models, better predictions would be expected. Most outliers in Figs. 3 and 5 are related to two specific participants for whom the models provided poor estimations. This might be because of their particular walking patterns compared to other participants in inter-subject examination or walking unnaturally with diverse gait patterns in different trials in the intra-subject examination due to lab’s constraints.

To avoid the limitations of the traditional OMC systems, like the need for expensive equipment in a controlled environment and time-consuming data processing, previous studies have developed different algorithms to estimate joint kinematics from IMUs^{51,52,53,54,55,56,57,58,59,60,61,62}. One of these algorithms used filtering approaches to cope with IMU sensor noise and integration drift^{51,59,60,61,62}. While these algorithms succeeded in reproducing a similar joint angle waveform, the offset between IMU results and OMC systems is considered relatively high compared to our results. The RMSE ranging from \(5^\circ\) to \(10.14^\circ\) in the hip joint angle in the sagittal plane was previously reported^{52,53,55,56,57,58,60,62}, while the present study achieved an RMSE of \(1.38^\circ\) and \(4.79^\circ\) for intra and inter-subject examinations, respectively. Our model produced lower RMSEs in knee joint flexion/extension (\(1.85^\circ\) and \(5.46^\circ\) for intra and inter-subject examinations, respectively), compared to other studies with reported RMSE between \(4.1^\circ\) to \(11.22^\circ\)^{52,53,55,56,57,58,62}. The accuracy of our model for ankle joint dorsi/plantarflexion angle prediction (2.14 \(^\circ\) and \(6.52^\circ\) for intra and inter-subject examinations) was comparable to previous studies with an RMSE of \(1.9^\circ\) to \(9.75^\circ\)^{52,53,55,57,62}. Other research groups achieved good accuracy by combining wearable sensors’ data with ML techniques for joint kinematics prediction^{11,12,13,14,15,16,17,19}. The better performance of this approach (IMUs + ML model) provided low estimation errors in previous studies, especially in the intra-subject examinations with an RMSE ranging from \(1.72^\circ\) to \(3.58^\circ\) in hip flexion/extension^11,14,15, from \(2.21^\circ\) to \(3.96^\circ\) in knee flexion/extension^11,12,14,15 and from \(1.81^\circ\) to \(3.58^\circ\) in ankle dorsi/plantarflexion angle^11,12,14,15. The performance of our RF model in the intra-subject examination was better than previous studies in the hip (\(1.38^\circ\)) and knee (\(1.85^\circ\)) and was in the range of these studies for ankle angle (\(2.14^\circ\)) in the sagittal plane. The higher prediction error for inter-subject examination is provided in some of the previous studies with hip flexion/extension angle RMSE ranging from \(5.37^\circ\) to \(8.85^\circ\)^14,21,22 (4.79\(^\circ\) in the present study) and knee flexion/extension angle RMSE of \(5.6^\circ\) to \(7.41^\circ\)^14,22 (\(5.46^\circ\) in the present study). However, the prediction error for the ankle dorsi/plantar flexion angle was lower in the previous studies with RMSE of \(4.6^\circ\) to \(5.5^\circ\)^14,21,22 (\(6.53^\circ\) in the present study). In another study¹⁹, only the average RMSE of \(7^\circ\) for all joint angles is reported, which is a higher prediction error compared to our results (\(5.31^\circ\)). Ren et al.¹⁷ developed five different ML models to predict hip, knee, and ankle joint angles in the sagittal plane and achieved MAE = \(4.6^\circ\), \(7.38^\circ\), and \(4.74^\circ\), respectively, by using the RF model. They illustrated that the RF model outperforms other ML models (SVR, NN, multiple linear regression (MLR), and eXtreme gradient boosting (XGboost)) for joint kinematics prediction. The RF model in the current study carried out lower error than their model in hip and knee angles prediction by having MAE = \(4.2^\circ\) and \(4.59^\circ\), respectively, while we had higher errors than Ren et al.¹⁷ in predicting ankle dorsi/plantar flexion angle (MAE of \(5.28^\circ\)) for inter-subject examination. Long short-term memory neural network models were used in a recent study²⁰ to estimate hip and knee joint angles in all planes of motion. The authors developed their models by using both measured and synthetic IMU data. When they used measured IMU data to train their models, RMSEs of \(7.2^\circ\) for hip flexion/extension, \(2.1^\circ\) for hip adduction/abduction, \(4.2^\circ\) for hip rotation, and \(2.9^\circ\) for knee flexion/extension angles were achieved. Their model outperformed our RF and CNN models in all of their targets except for hip flexion/extension (\(4.79^\circ\) in our study). However, they didn’t perform any kind of cross validation to investigate the generalizability of their model. By changing training and testing datasets, different results may be found. Accurate results were achieved in another study¹⁶, in which 70 participants’ data were used to examine the model. They reported the MAE of 3.73, 5.41, and 3.58 for hip, knee, and ankle angles in the sagittal plane, respectively. However, we had more accurate estimation for knee flexion/extension angle (MAE = 4.59\(^\circ\)). While most of the previous studies concentrated on joint range of motion in the sagittal plane, our study additionally included the pelvis in all planes, hip int/ext rotation and abd/add and ankle inv/eversion.

Fewer studies were conducted to investigate the performance of different ML models for joint kinetics estimation^12,23,24,25 compared to joint kinematics. All previous studies would focus on specific lower-limb joint kinetics (e.g. knee and ankle moments in the sagittal plane¹², knee adduction/abduction moment²³, medial and lateral knee contact forces²⁴, knee flexion/extension and adduction/abduction moments²⁵), while the present study investigated the prediction accuracy for the pelvis (in three planes of motion), hip (in three planes of motion), knee (in the sagittal plane) and ankle (in sagittal and frontal planes) joint moments during gait. The RF model presented in our study achieved an RMSE of 0.066 Nm/kg for ankle moment prediction in the intra-subject examination, which is more accurate compared to previous studies with an RMSE of 0.119 Nm/kg¹². For the knee flexion/extension moment, we had lower accuracy in intra-subject examination compared to other studies (RMSE of 0.089 versus RMSE ranging from 0.042 to 0.068 Nm/kg^12,23). However, our model outperformed another study in inter-subject examination (RMSE of 0.187 versus 0.27 Nm/kg)²⁵. Higher prediction errors (BWBH%: Nm/bodyweight.bodyheight) compared to our results are reported²¹ for hip (1.78 BWBH%), knee (1.28 BWBH%), and ankle joint moments (1.39 BWBH%) in the sagittal plane. While we achieved %BWBH of 1.23 for the hip, 1.16 for the knee, and 1.08 for ankle joint moments.

To the best of our knowledge, there is no other study using wearable sensors’ data to estimate muscle forces. Ardestani et al.⁶³ used an NN model to estimate muscle activations from EMG signals. They used muscle activations in a forward dynamic model to estimate lower-limb joint moments, but unfortunately, they didn’t report any prediction error for muscle activation or forces. In another study⁶⁴, a Gaussian Mixture Regressor was employed to estimate muscle kinematics (fiber elongations and moment arms) and muscle activations from IMUs. They reported NRMSE lower than 30% of muscle activation for all muscles. In the present study, the lowest muscle forces prediction errors were associated with the biceps femoris long head muscle (NRMSE of 2.6% in intra and 4.5% inter-subject examinations). The highest NRMSEs were related to the semitendinosus muscle (NRMSE of 14.1%) in the intra-subject examination and biceps femoris short head muscle (NRMSE of 36.2%) in the inter-subject examination. A higher offset between actual and predicted values for muscle forces prediction compared to joint kinematics and kinetics can be seen in the figures depicting actual and predicted values. The lower accuracy in the estimation of muscle forces compared to other targets (joint kinematics and kinetics) was predictable. Data showed different muscle recruitment during walking between individuals and even between trials for the same participant, leading to less consistency in muscle forces across the population. Overall, compared to previous research, we predicted more targets at the same time with a multi-output RF model and achieved prediction errors within the range of what is reported in the literature.

Despite the number of participants (17 total), our RF model resulted in low prediction errors (comparable to the literature) in joints kinematics and kinetics estimation. We will investigate if increasing the number of participants to include a variety of gait profiles and self-selected speed provides more accurate estimations, specifically for muscle forces prediction. One limitation of the current study is the use of a multi-output RF model to predict many targets at the same time. A multi-output model helps us to improve the management of a high number of targets and allows us to decrease computational cost and monitor all intended targets simultaneously in real time. However, it may result in lower prediction accuracy by feeding many unrelated features to the model for some targets. The differences between the performance of a multi-output and a single-output model for the prediction of specific gait time series should be explored in a future study. Using separated single-output ML models would be more efficient in case we want to monitor a specific target.

The personalised musculoskeletal model for each participant was built using the gait 2392 OpenSim model, which lacks the degrees of freedom on the knee adduction/abduction, knee rotation, and ankle int/ext rotation, which does not allow us to study other planes of motion at the knee and ankle. The other limitation of this study was using an RF regressor to determine the rank of each feature. This can be a bias in favor of the RF model when comparing it with other ML models. However, we’ve shown that using selected features instead of all features is more efficient in the case of computational time, and its’ effect on the prediction accuracy of the CNN model is negligible. A further point to highlight is that we cannot guarantee that estimations using data from other labs will be as accurate as our own; this is largely due to differences in the equipment and sensors used. However, incorporating data from multiple labs into the training data set for our models can improve the models’ generalizability.

Although the findings of this study are very promising to benefit the community, more research is required to investigate the optimum number of IMUs needed to achieve these results. Looking back at the top features, it appears that some IMU data are not needed for 3D gait analysis. The optimal number and combination of IMUs can eliminate the need for seven sensors reducing data processing time and sensor cost. Reducing the number of sensors on the subjects’ bodies will also facilitate workflow implementation in the real world.

Conclusion

This study showed that a combination of wearable sensors and ML techniques is an accurate and promising approach for improving traditional methods of gait time-series prediction. We also demonstrated that the higher performance of the RF and CNN models compared to other ML models make them more appropriate for predicting the lower limb’s joint kinematics, kinetics, and muscle forces, especially in the intra-subject prediction. Successful implementation of an intra-subject model enables us to remotely monitor changes in patients’ gait outside the clinic. While by having a precise inter-subject model, a gait analysis will be possible where an optical motion capture system is not available.

Data availability

The post-processed data (joint kinematics, joint kinetics, and muscle forces) along with the raw IMU and EMG data used in this study to build machine learning models are available on the open-source platform SimTK.org (https://simtk.org/projects/ml_sensors).

References

Renggli, D. et al. Wearable inertial measurement units for assessing gait in real-world environments. J. Front. Physiol. 11, 90. https://doi.org/10.3389/fphys.2020.00090 (2020).
Article Google Scholar
Takayanagi, N. et al. Relationship between daily and in-laboratory gait speed among healthy community-dwelling older adults. J. Sci. Rep. 9, 1–6. https://doi.org/10.1038/s41598-019-39695-0 (2019).
Article CAS Google Scholar
Castillo, P., Lozano, R. & Dzul, A.E. Sensors, modems and microcontrollers for UAVs. In: Modelling and Control of Mini-Flying Machines. Advances in Industrial Control. (Springer, London, 2005) https://doi.org/10.1007/1-84628-179-2_9.
Basmajian, J. V. Muscles alive. Their functions revealed by electromyography. J. Acad. Med. 37, 802 (1962).
Google Scholar
Sartori, M., Lloyd, D. G., Besier, T., Fernandez, J. & Farina, D. Electromyography-driven modeling for simulating subject-specific movement at the neuromusculoskeletal level. J. Surf. Electromyogr. Physiol. Eng. Appl. 78, 247–272 (2016).
Article Google Scholar
Iosa, M., Picerno, P., Paolucci, S. & Morone, G. Wearable inertial sensors for human movement analysis. J. Expert Rev. Med. Devices 13, 641–659. https://doi.org/10.1080/17434440.2016.1198694 (2016).
Article CAS Google Scholar
De Luca, C. J. The use of surface electromyography in biomechanics. J. Appl. Biomech. 13, 135–163. https://doi.org/10.1123/jab.13.2.135 (1997).
Article Google Scholar
Sartori, M., Farina, D. & Lloyd, D. G. Hybrid neuromusculoskeletal modeling to best track joint moments using a balance between muscle excitations derived from electromyograms and optimization. J. Biomech. 47, 3613–3621. https://doi.org/10.1016/j.jbiomech.2014.10.009 (2014).
Article PubMed Google Scholar
Argent, R., Drummond, S., Remus, A., O’Reilly, M. & Caulfield, B. Evaluating the use of machine learning in the assessment of joint angle using a single inertial sensor. J. Rehabil. Assist. Technol. Eng. 6, 2055668319868544. https://doi.org/10.1177/2055668319868544 (2019).
Article PubMed PubMed Central Google Scholar
Błażkiewicz, M. & Wit, A. Artificial neural network simulation of lower limb joint angles in normal and impaired human gait. J. Acta Bioeng. Biomech. https://doi.org/10.5277/ABB-01129-2018-02 (2018).
Article Google Scholar
Chen, J., Zhang, X., Cheng, Y. & Xi, N. Surface EMG based continuous estimation of human lower limb joint angles by using deep belief networks. J. Biomed. Signal Process. Control. 40, 335–342. https://doi.org/10.1016/j.bspc.2017.10.002 (2018).
Article Google Scholar
Dey, S., Yoshida, T., Ernst, M., Schmalz, T., & Schilling, A.F. A random forest approach for continuous prediction of joint angles and moments during walking: An implication for controlling active knee-ankle prostheses/orthoses. In 2019 IEEE International conference on Cyborg and bionic systems (CBS). IEEE (2019).
Farmer, S., Silver-Thorn, B., Voglewede, P. & Beardsley, S. A. Within-socket myoelectric prediction of continuous ankle kinematics for control of a powered transtibial prosthesis. J. Neural Eng. 11, 056027. https://doi.org/10.1088/1741-2560/11/5/056027 (2014).
Article ADS PubMed Google Scholar
Findlow, A., Goulermas, J., Nester, C., Howard, D. & Kenney, L. Predicting lower limb joint kinematics using wearable motion sensors. J. Gait Posture 28, 120–126. https://doi.org/10.1016/j.gaitpost.2007.11.001 (2008).
Article CAS Google Scholar
Goulermas, J., Howard, D., Nester, C., Jones, R., & Ren, L., Regression techniques for the prediction of lower limb kinematics. J. (2005).
Luu, T. P., Low, K., Qu, X., Lim, H. & Hoon, K. An individual-specific gait pattern prediction model based on generalized regression neural networks. J. Gait Posture 39, 443–448. https://doi.org/10.1016/j.gaitpost.2013.08.028 (2014).
Article Google Scholar
Ren, S. et al. Personalized gait trajectory generation based on anthropometric features using random forest. J. Ambient Intell. Humaniz. Comput. https://doi.org/10.1007/s12652-019-01390-3 (2019).
Article Google Scholar
Sivakumar, S., Gopalai, A. A., Lim, K. H. & Gouwanda, D. Artificial neural network based ankle joint angle estimation using instrumented foot insoles. J. Biomed. Signal Process. Control 54, 101614. https://doi.org/10.1016/j.bspc.2019.101614 (2019).
Article Google Scholar
Wouda, F. J., Giuberti, M., Bellusci, G. & Veltink, P. H. Estimation of full-body poses using only five inertial sensors: An eager or lazy learning approach?. J. Sens. 16, 2138. https://doi.org/10.3390/s16122138 (2016).
Article ADS Google Scholar
Sharifi Renani, M., Eustace, A. M., Myers, C. A. & Clary, C. W. The use of synthetic imu signals in the training of deep learning models significantly improves the accuracy of joint kinematic predictions. J. Sens. 21, 5876 (2021).
Article ADS Google Scholar
Dorschky, E. et al. CNN-based estimation of sagittal plane walking and running biomechanics from measured and simulated inertial sensor data. J. Front. Bioeng. Biotechnol. 8, 604 (2020).
Article Google Scholar
Gholami, M., Napier, C. & Menon, C. Estimating lower extremity running gait kinematics with a single accelerometer: A deep learning approach. J. Sens. 20, 2939 (2020).
Article ADS Google Scholar
Aljaaf, A.J., Hussain, A.J., Fergus, P., Przybyla, A., & Barton, G.J. Evaluation of machine learning methods to predict knee loading from the movement of body segments. In 2016 International Joint Conference On Neural Networks (IJCNN). IEEE (2016).
Giarmatzis, G., Zacharaki, E. I. & Moustakas, K. Real-time prediction of joint forces by motion capture and machine learning. J. Sens. 20, 6933. https://doi.org/10.3390/s20236933 (2020).
Article ADS Google Scholar
Stetter, B. J., Krafft, F. C., Ringhof, S., Stein, T. & Sell, S. A machine learning and wearable sensor based approach to estimate external knee flexion and adduction moments during various locomotion tasks. J. Front. Bioeng. Biotechnol. https://doi.org/10.3389/fbioe.2020.00009 (2020).
Article Google Scholar
Lai, D.T., Shilton, A., Charry, E., Begg, R., & Palaniswami, M. A machine learning approach to k-step look-ahead prediction of gait variables from acceleration data. In 2009 annual international conference of the IEEE engineering in medicine and biology society. IEEE (2009).
Luu, T.P., Lim, H.B., Hoon, K.H., Qu, X., & Low, K. Subject-specific gait parameters prediction for robotic gait rehabilitation via generalized regression neural network. In 2011 IEEE International Conference On Robotics And Biomimetics. IEEE (2011).
Sandhu, K., Kamboj, V.K. Role of artificial neural network for prediction of gait parameters and patterns, In AI techniques for reliability prediction for electronic components, IGI Global. 124–135 (2020).
Santhiranayagam, B.K., Lai, D., Shilton, A., Begg, R., & Palaniswami, M. Regression models for estimating gait parameters using inertial sensors. In 2011 Seventh International Conference On Intelligent Sensors, Sensor Networks And Information Processing. IEEE (2011).
Zhang, H., Guo, Y. & Zanotto, D. Accurate ambulatory gait analysis in walking and running using machine learning models. IEEE Trans. Neural Syst. Rehabil. Eng. 28, 191–202. https://doi.org/10.1109/TNSRE.2019.2958679 (2019).
Article PubMed Google Scholar
Ferreira, J. P., Vieira, A., Ferreira, P., Crisostomo, M. & Coimbra, A. P. Human knee joint walking pattern generation using computational intelligence techniques. J. Neural Comput. Appl. 30, 1701–1713. https://doi.org/10.1007/s00521-018-3458-5 (2018).
Article Google Scholar
Mundt, M. et al. Prediction of lower limb joint angles and moments during gait using artificial neural networks. J. Med. Biol. Eng. Comput. 58, 211–225. https://doi.org/10.1007/s11517-019-02061-3 (2020).
Article Google Scholar
Mundt, M. et al. A comparison of three neural network approaches for estimating joint angles and moments from inertial measurement units. J. Sens. 21, 4535 (2021).
Article ADS Google Scholar
Sharifi Renani, M. et al. Deep learning in gait parameter prediction for oa and tka patients wearing imu sensors. J. Sens. 20, 5553 (2020).
Article ADS Google Scholar
Wang, L. & Buchanan, T. S. Prediction of joint moments using a neural network model of muscle activations from EMG signals. IEEE Trans. Neural Syst. Rehabil. Eng. 10, 30–37. https://doi.org/10.1109/TNSRE.2002.1021584 (2002).
Article PubMed Google Scholar
Bolam, S. M. et al. Remote patient monitoring with wearable sensors following knee arthroplasty. J. Sens. 21, 5143. https://doi.org/10.3390/s21155143 (2021).
Article ADS Google Scholar
Dindorf, C., Teufl, W., Taetz, B., Bleser, G. & Fröhlich, M. Interpretability of input representations for gait classification in patients after total hip arthroplasty. J. Sensors 20, 4385. https://doi.org/10.3390/s20164385 (2020).
Article ADS Google Scholar
Gholami, M., Ejupi, A., Rezaei, A., Ferrone, A., & Menon, C. Estimation of knee joint angle using a fabric-based strain sensor and machine learning: A preliminary investigation. In 2018 7th IEEE International Conference On Biomedical Robotics And Biomechatronics (Biorob). IEEE (2018).
Christ, M., Braun, N., Neuffer, J. & Kempa-Liehr, A. W. Time series feature extraction on basis of scalable hypothesis tests (tsfresh–a python package). J. Neurocomput. 307, 72–77. https://doi.org/10.1016/j.neucom.2018.03.067 (2018).
Article Google Scholar
Mantoan, A. et al. MOtoNMS: A MATLAB toolbox to process motion data for neuromusculoskeletal modeling and simulation. J. Source Code Biol. Med. 10, 1–14. https://doi.org/10.1186/s13029-015-0044-4 (2015).
Article Google Scholar
Delp, S. L. et al. OpenSim: Open-source software to create and analyze dynamic simulations of movement. J. IEEE Trans. Biomed. Eng. 54, 1940–1950. https://doi.org/10.1109/TBME.2007.901024 (2007).
Article Google Scholar
Zhang, J., et al. (2014) The MAP client: User-friendly musculoskeletal modelling workflows. In International Symposium On Biomedical Simulation. Springer.
Pizzolato, C. et al. CEINMS: A toolbox to investigate the influence of different neural control solutions on the prediction of muscle excitation and joint moments during dynamic motor tasks. J. Biomech. 48, 3929–3936. https://doi.org/10.1016/j.jbiomech.2015.09.021 (2015).
Article PubMed PubMed Central Google Scholar
Banos, O., Galvez, J.-M., Damas, M., Pomares, H. & Rojas, I. Window size impact in human activity recognition. J. Sens. 14, 6474–6499. https://doi.org/10.3390/s140406474 (2014).
Article ADS Google Scholar
Nembrini, S., König, I. R. & Wright, M. N. The revival of the Gini importance. J. Bioinform. 34, 3711–3718. https://doi.org/10.1093/bioinformatics/bty373 (2018).
Article CAS Google Scholar
Breiman, L. Random forests. J. Mach. Learn. 45, 5–32. https://doi.org/10.1023/a:1010933404324 (2001).
Article MATH Google Scholar
Drucker, H., Burges, C. J., Kaufman, L., Smola, A. & Vapnik, V. Support vector regression machines. J. Adv. Neural Inform. Process. Syst. 9, 155–161 (1997).
Google Scholar
Dehzangi, O., Taherisadr, M. & ChangalVala, R. IMU-based gait recognition using convolutional neural networks and multi-sensor fusion. J. Sens. 17, 2735. https://doi.org/10.3390/s17122735 (2017).
Article ADS Google Scholar
Fukuchi, C. A., Fukuchi, R. K. & Duarte, M. A public dataset of overground and treadmill walking kinematics and kinetics in healthy individuals. J. PeerJ 6, e4640 (2018).
Article Google Scholar
Yu, P. et al. Morphology-related foot function analysis: Implications for jumping and running. J. Appl. Sci. 9, 3236 (2019).
Article Google Scholar
Horst, F., Lapuschkin, S., Samek, W., Müller, K.-R. & Schöllhorn, W. I. Explaining the unique nature of individual gait patterns with deep learning. J. Sci. Rep. 9, 1–13. https://doi.org/10.1038/s41598-019-38748-8 (2019).
Article CAS Google Scholar
Dorschky, E., Nitschke, M., Seifer, A.-K., van den Bogert, A. J. & Eskofier, B. M. Estimation of gait kinematics and kinetics from inertial sensor data using optimal control of musculoskeletal models. J. Biomech. 95, 109278. https://doi.org/10.1016/j.jbiomech.2019.07.022 (2019).
Article PubMed Google Scholar
Karatsidis, A., et al., Predicting kinetics using musculoskeletal modeling and inertial motion capture. arXiv preprint arXiv:1801.01668. https://doi.org/10.48550/arXiv.1801.01668 (2018).
Moon, K. S., Lee, S. Q., Ozturk, Y., Gaidhani, A. & Cox, J. A. Identification of gait motion patterns using wearable inertial sensor network. J. Sens. 19, 5024. https://doi.org/10.3390/s19225024 (2019).
Article ADS Google Scholar
Nüesch, C., Roos, E., Pagenstert, G. & Mündermann, A. Measuring joint kinematics of treadmill walking and running: Comparison between an inertial sensor based system and a camera-based system. J. Biomech. 57, 32–38 (2017).
Article PubMed Google Scholar
Ohtaki, Y., Sagawa, K. & Inooka, H. A method for gait analysis in a daily living environment by body-mounted instruments. JSME Int. J. Series C Mech. Syst. Mach. Elem. Manuf. 44, 1125–1132. https://doi.org/10.1016/j.jbiomech.2017.03.015 (2001).
Article ADS Google Scholar
Tadano, S., Takeda, R. & Miyagawa, H. Three dimensional gait analysis using wearable acceleration and gyro sensors based on quaternion calculations. J. Sens. 13, 9321–9343 (2013).
Article ADS Google Scholar
Takeda, R., Tadano, S., Natorigawa, A., Todoh, M. & Yoshinari, S. Gait posture estimation using wearable acceleration and gyro sensors. J. Biomech. 42, 2486–2494. https://doi.org/10.3390/s130709321 (2009).
Article PubMed Google Scholar
Cikajlo, I., Matjačić, Z. & Bajd, T. Efficient FES triggering applying Kalman filter during sensory supported treadmill walking. J. Med. Eng. Technol. 32, 133–144. https://doi.org/10.1080/03091900601029627 (2008).
Article CAS PubMed Google Scholar
Dong, L., Wu, J. & Bao, X. A Hybrid HMM/Kalman filter for tracking hip angle in gait cycle. J. IEICE Trans. Inform. Syst. 89, 2319–2323. https://doi.org/10.1093/ietisy/e89-d.7.2319 (2006).
Article ADS Google Scholar
Sabatini, A. M. Quaternion-based extended Kalman filter for determining orientation by inertial and magnetic sensing. J. IEEE Trans. Biomed. Eng. 53, 1346–1356. https://doi.org/10.1109/TBME.2006.875664 (2006).
Article Google Scholar
Agostini, V., Gastaldi, L., Rosso, V., Knaflitz, M. & Tadano, S. A wearable magneto-inertial system for gait analysis (H-Gait): Validation on normal weight and overweight/obese young healthy adults. J. Sens. 17, 2406. https://doi.org/10.3390/s17102406 (2017).
Article ADS Google Scholar
Ardestani, M. M. et al. Human lower extremity joint moment prediction: A wavelet neural network approach. J. Expert Syst. Appl. 41, 4422–4433. https://doi.org/10.1016/j.eswa.2013.11.003 (2014).
Article Google Scholar
Cimolato, A., et al. Hybrid machine learning-neuromusculoskeletal modeling for control of lower limb prosthetics. In 2020 8th IEEE RAS/EMBS International Conference For Biomedical Robotics And Biomechatronics (BioRob). IEEE (2020).

Download references

Acknowledgements

We would like to thank the participants for contributing to this study. We also want to acknowledge the Health Research Council NZ, the Aotearoa foundation and the Science for Technological Innovation NZ for funding this study.

Author information

Authors and Affiliations

Auckland Bioengineering Institute, The University of Auckland, Auckland, New Zealand
Shima Mohammadi Moghadam, Ted Yeung & Julie Choisne

Authors

Shima Mohammadi Moghadam
View author publications
You can also search for this author in PubMed Google Scholar
Ted Yeung
View author publications
You can also search for this author in PubMed Google Scholar
Julie Choisne
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

T.Y. and J.C. conceived and designed the study. S.M.M. developed and analysed ML models and prepared the original draft. T.Y. collected participants’ data. J.C. and T.Y. supervised the study and performed funding acquisition. S.M.M. and J.C. wrote and revised the manuscript. All authors reviewed and approved the manuscript prior to submission.

Corresponding author

Correspondence to Julie Choisne.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Moghadam, S.M., Yeung, T. & Choisne, J. A comparison of machine learning models’ accuracy in predicting lower-limb joints’ kinematics, kinetics, and muscle forces from wearable sensors. Sci Rep 13, 5046 (2023). https://doi.org/10.1038/s41598-023-31906-z

Download citation

Received: 20 September 2022
Accepted: 20 March 2023
Published: 28 March 2023
DOI: https://doi.org/10.1038/s41598-023-31906-z

This article is cited by

On the prediction of tibiofemoral contact forces for healthy individuals and osteoarthritis patients during gait: a comparative study of regression methods
- Felipe Arruda Moura
- Alexandre R. M. Pelegrinelli
- Ricardo da Silva Torres
Scientific Reports (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.