Multimodal video and IMU kinematic dataset on daily life activities using affordable devices

Martínez-Zarzuela, Mario; González-Alonso, Javier; Antón-Rodríguez, Míriam; Díaz-Pernas, Francisco J.; Müller, Henning; Simón-Martínez, Cristina

doi:10.1038/s41597-023-02554-9

Download PDF

Data Descriptor
Open access
Published: 22 September 2023

Multimodal video and IMU kinematic dataset on daily life activities using affordable devices

Scientific Data volume 10, Article number: 648 (2023) Cite this article

1833 Accesses
1 Citations
3 Altmetric
Metrics details

Subjects

Rehabilitation

Abstract

Human activity recognition and clinical biomechanics are challenging problems in physical telerehabilitation medicine. However, most publicly available datasets on human body movements cannot be used to study both problems in an out-of-the-lab movement acquisition setting. The objective of the VIDIMU dataset is to pave the way towards affordable patient gross motor tracking solutions for daily life activities recognition and kinematic analysis. The dataset includes 13 activities registered using a commodity camera and five inertial sensors. The video recordings were acquired in 54 subjects, of which 16 also had simultaneous recordings of inertial sensors. The novelty of dataset lies in: (i) the clinical relevance of the chosen movements, (ii) the combined utilization of affordable video and custom sensors, and (iii) the implementation of state-of-the-art tools for multimodal data processing of 3D body pose tracking and motion reconstruction in a musculoskeletal model from inertial data. The validation confirms that a minimally disturbing acquisition protocol, performed according to real-life conditions can provide a comprehensive picture of human joint angles during daily life activities.

Functional movement screen dataset collected with two Azure Kinect depth sensors

Article Open access 25 March 2022

From raw measurements to human pose - a dataset with low-cost and high-end inertial-magnetic sensor data

Article Open access 30 September 2022

A multi-camera and multimodal dataset for posture and gait analysis

Article Open access 06 October 2022

Background & Summary

Physical rehabilitation requires continuous monitoring to achieve a personalized exercise program that is constantly adapted to the patient’s individual needs. Such monitoring has two main advantages. First, it provides the medical team with information that can be used to quantify the improvement of a specific physical therapy program. Second, it can be used to adapt online training programs to match the individual needs of the patient while reducing the needs to attend the clinic, reducing consequently the cost of healthcare. To establish such monitoring, several tele-health strategies have been proposed. Research on optical and inertial sensing devices, together with recent advances in deep learning have raised different technologies that can be used for human body tracking. The use of traditional video, acquired with a single camera, and inertial measurement units (IMUs), combined with the rapid advances in data science form the perfect scenario to profit from blending both methods to optimize tele-rehabilitation programs.

Whilst the quantitative assessment of movement analysis with patients is typically based in a laboratory with accurate high-end systems¹, the assessment of patients outside the laboratory can provide more informative measurements on their functional ability in activities of daily living². Such lab-based method for quantifying human body movement consists of a multi-optoelectronic configuration of several infrared cameras (Vicon, Qualysis, OptiTrack), which are used for precise tracking of joint and bone markers, that are previously placed manually by an expert on the patient’s body parts. However, the use of these systems in telemedicine approaches is unfeasible for several reasons, such as cost, size, configuration time, and complexity of use. It has been shown that rehabilitation in a natural environment is more effective for motor restoration compared to clinic-based programs³. Consequently, the use of technology in the patient’s natural environment for quantitative movement tracking is crucial to meet the needs of a home-based rehabilitation program. Novel advances in computer vision and wearable devices, have shed some light onto the possibilities of performing recognition of daily life activities and kinematic evaluations in the wild using more feasible and minimally invasive solutions, that can be integrated to home-based rehabilitation approaches. In the last years, two approaches have gained attention among the scientific community: (1) single-camera systems using consumer depth cameras like Microsoft Azure Kinect® or 2D conventional cameras, and (2) wearable sensors using IMUs.

First, consumer depth-cameras have been widely used for patient interaction in virtual reality telerehabilitation systems⁴ showing satisfactory performance in monitoring basic human poses. In addition, recent human pose estimators using deep neural networks can infer a simplified skeletal model of the human body even from 2D videos, despite the presence of cluttered backgrounds. The evolution of computer vision techniques already available in OpenPose⁵ or DeepLabCut⁶, among others, will eventually revolutionize the assessment of patients in their natural environment. Remarkably, they have been successfully used in neurological disorders to estimate gait parameters^7,8,9 and describe trunk deficits¹⁰.

Second, inertial sensors also offer a promising alternative to gold standard movement acquisition tools. Some commercial systems, such as Xsens Awinda have been shown to provide joint angle measurements with a technological error under 5° RMSE with respect to multi-optoelectronic systems^11,12, yet considered today the gold standard. The use of this kind of sensors in the medical field has been thoroughly explored in an extensive number of scientific publications^13,14 proving their usability and user acceptance. Additionally, advanced signal processing tools have enabled the instrumentation of standard clinical tests that provide relevant data on balance¹⁵ and risk of falls^13,16.

Although single-camera and inertial sensing solutions are promising, they are not without limitations. Human pose estimators using only one camera do not capture subtle movements and their joint positions detection are not yet accurate enough for 3D kinematics, due to the inherent limitations of 2D video analysis and self-occlusions of body parts^5,6. On the other hand, although the use of inertial sensors is beneficial for tracking the 3D rotation of each body segment with high accuracy, IMU-to-segment calibration and drift challenges introduce complexity and errors in the acquisition¹².

A search among publicly available datasets on human body movement showed us that there are databases with a human activity recognition focus and those with a pure biomechanical focus. Significantly, the first ones do not use tools to reconstruct movement tracking^17,18,19,20; and the second ones use high-end laboratory equipment, thus do not include data that could be feasibly collected in the natural environment^{21,22,23,24,25}. Therefore, there is a need for datasets that include movements resembling daily-life activities with technologies that can be used in the wild to pave the way toward more efficient telemedicine settings that are able to recognize the patients’ activity and track their movements in their natural environment.

Here, we propose the VIDIMU dataset²⁶ that includes 54 healthy young adults recorded with video and 16 of them simultaneously with IMUs while performing daily life activities. The novelty of the dataset is threefold: (i) the clinical relevance of the chosen movements, as these are included in typical functional assessment scales and physical rehabilitation programs, (ii) the acquisition of multimodal data has been done using very affordable equipment: a commodity webcam and custom IMU sensors, (iii) the use of open-source state-of-the-art tools for processing and synchronization of raw data.

Altogether, our dataset aims to achieve 3D body pose tracking from video and 3D motion reconstruction in musculoskeletal models from inertial data during daily-life movements and it is anticipated to contribute to advancements in various scientific domains, including human body tracking, movement forecasting and recognition, and gross motor movement assessment, among others. This valuable resource has the potential to drive the development of affordable and dependable solutions for patient monitoring in their natural environments.

Methods

Overview of VIDIMU

The VIDIMU dataset²⁶ includes 54 healthy young adults that were recorded on video. A subgroup of 16 subjects were simultaneously recorded using IMUs. For each subject, 13 activities were registered using a low-resolution video camera and five Inertial Measurement Units (IMUs). Inertial sensors were placed in the lower or the upper limbs of the subject, respectively for activities that involve movement with the lower or the upper body. Video recordings were postprocessed using the state-of-the-art pose estimator BodyTrack (included in Maxine-AR-SDK²⁷) to provide a sequence joint positions for each movement. This estimator was chosen for our dataset instead of other approaches such as OpenPose⁵, because it can infer the 3D position (x,y,z) of the joints from a single camera video. Raw IMU recordings were post-processed to compute joint angles by inverse kinematics with OpenSim²⁸ (see Fig. 1). In addition, for recordings including simultaneous acquisition of video and IMU data types, these signals were used for data file synchronization.

Subjects and ethical requirements

The data were recorded from 54 healthy adult subjects recruited among students (36 males, 18 females; 46 right-handed, 8 left-handed; age 25.0 ± 5.4 years). Before data acquisition, each subject received both written and oral explanation of the experiment and signed an informed consent form allowing their video and IMU data records to be published. The study was approved by the Institutional Review Board (or Ethics Committee) of “CEIm ÁREA DE SALUD VALLADOLID ESTE” (Valladolid, Spain), under protocol code PI 21-2341. The ethics approval allowed for the data (both video and IMU data records) to be published under an open license. Dataset acquisition took place in the facilities of the Higher School of Telecommunications Engineering of the University of Valladolid from June 2022 to January 2023. Subjects not wearing IMUs did wear a face mask because their data collection was done right after the COVID-19 pandemic. Subjects wearing IMUs were captured later and do not wear a mask.

A battery of lower (Fig. 2) and upper (Fig. 3) limb activities, typically used to evaluate motor deficits and success of rehabilitation programs^29,30,31,32 were selected. Among the lower limb activities, we included ‘walk forward (A01)’, ‘walk backward (A02)’, ‘walk along a line (A03)’ and ‘sit to stand (A04)’. Walking in different directions is one of the main goals of many rehabilitation programs as it forms the basis of mobility and has several positive physiological effects. The activity ‘sit-to-stand’ intended to mimic the transfer from being seated to a standing position, and it is also key for mobility and to gain strength in the lower limbs.

Among the upper limb activities, we included unimanual and bimanual tasks, to cover the different aspects of an upper limb rehabilitation program. As the upper limb involves movements from very simple to very complex, we attempted to cover a variety of complexities, starting from simply ‘move a bottle from side to side (right (A05) and left (A06) hand)’, continuing to functional movements like ‘drink from a bottle (right (A07) and left (A08) hand)’ and ‘reach up a bottle from a high position (mimicking a shelf, right (A11) and left (A12) hand). More complex and bimanual movements included ‘assemble and disassemble a 6-pieces LEGO tower (A09)’, ‘throw up a ball and catch it (A10)’ and ‘tear a paper in 4 pieces, make a ball and throw it (A13)’. These more complex activities are often used in rehabilitation programs to increase the leisure component of the exercises and motivate the patients.

Acquisition setup

The acquisition of the movements was performed using a commodity webcam (Microsoft^TM LifeCam studio for business) and 5 affordable custom designed IMU sensors³³. Video was captured at 30 fps and 640 × 480 pixel resolution. IMU data was acquired wirelessly at 50 Hz using the 2.4 GHz frequency band. The sensors collected quaternion data and were set according to a right-handed ENU (East North Up) coordinate system. Before the sensors were worn by the subject, they were arranged in parallel on a table and a heading reset was performed, so that the local X axis of the sensor faced towards the camera frontal plane. Figure 4a) shows the IMU reference coordinate system used during IMUs acquisition and Fig. 4b) shows sensors location for upper and lower body acquisitions.

The IMUs were placed on the subject’s limbs and trunk with velcro straps. For upper body activities, the sensors were positioned on the back following an imaginary line connecting both posterior axillary folds (around T5-T7), on the lateral middle part of each upper arm, and posterior part of each wrist (Fig. 4b, left side picture). For lower body activities, the sensors were positioned on the lower back (around L3-L5), lateral middle part of each thigh, and lateral cranial part of each lower leg (Fig. 4b, right side picture). For data records including IMU data, the subject adopted a neutral pose (N-pose) before starting the movement and the instantaneous orientations of the sensors were recorded. The orientation of the sensors in this position is registered in the dataset as frame 0 and is used to perform IMU to segment calibration. A detailed description on the mathematical procedure followed can be found in a previous study³³. In addition, the VIDIMU dataset²⁶ includes a video file of the subject while adopting the N-pose for each activity, and a file with the estimated joint locations during this pose.

Acquisition protocol

The VIDIMU dataset²⁶ includes data files of the 13 activities shown in Figs. 2, 3. Table 1 summarizes complementary information regarding the activities conducted by the subjects, including the number of repetitions and the oral instructions given to the participant. The instructions for the subjects to adopt N-pose were: “adopt a standing position, facing frontally towards the camera, with the arms outstretched along the body, and with the palms of the hands facing inwards”. It is important to note that in activities A01 and A02, the subject was moving perpendicular to the camera plane, so that gait movements were captured from the sagittal plane. In activity A03 the subjects walked along a line oriented 20 degrees with respect to the camera plane. In activities A04 to A13, the subject was turned to their left approximately 45° with respect to the frontal camera plane. This configuration has significant implications for the accuracy of the body pose detector from video. More specifically, although it avoids self-occlusions of the body, a side-effect on the measurement of joint angles is introduced.

Table 1 From left to right, the information displayed includes the activity ID, activity description, number of repetitions requested from participants, and oral instructions given to the subjects before initiating the activity.

Full size table

Signal processing

In VIDIMU dataset²⁶ it is provided both the raw data and the pre-processed data. The pre-processing steps include estimating joints positions from video data and estimating joint angles from IMU data. For those subjects captured both with video and IMUs, the outputs of these processing steps were also employed to further reprocess and synchronize IMU and video files. A detailed description of these procedures follows.

The raw video data captured with the webcam was used for estimating 3D (x, y, z) body joint absolute positions in mm from video data, using BodyTrack from Maxine-AR-SDK²⁷. The plain text output of BodyTrack was redirected to a text file (.out). The dataset²⁶ includes the same information in a more convenient comma-separated-values files (.csv).

The raw quaternion data captured with the IMUs (.raw) was used for inverse kinematic computation of the joint angles using OpenSim²⁸. To this end, an OpenSim full-body model from Rajagopal et al. model³⁴ was edited. The model was modified to adopt a neutral pose and the constraints of the different joints were set according to Table 2.

Table 2 Joint angles constraints in OpenSim model.

Full size table

OpenSim’s IMU placer tool was used to orient the IMUs on the model according to the initial orientation of the real sensors during N-pose calibration. The rest of the data records in each activity were used to compute inverse kinematics (IK). The weights for IK processing were configured down-weighting distal IMUs, what improves accuracy of the kinematic estimates and reduces drift³⁵. Table 3 subsumes the ideal orientation of the sensors during N-pose calibration, and the weight employed during IK.

Table 3 Ideal expected orientation of IMU sensors during N-pose calibration and weight assigned to the sensor during IK estimation.

Full size table

Data synchronization

For those subjects and activities in which video and IMU data records were acquired simultaneously, a step for data synchronization was applied.

Firstly, for each trial the joint positions extracted from video were employed to compute the angle of a joint of interest (see Table 4). This angle was computed as the angle between two consecutive 3D body segments, which were determined from the BodyTrack’s estimated position of the joints included in the third column of the table. The criteria to choose the joint of interest was to select the one for which the estimation of body segments could be, beforehand, more reliable according to the position of the subject and the direction of movement with respect to the camera. Joint angles for flexion-extension of the knee, the elbow, and the arm were chosen.

Table 4 Joint angles in the sagittal plane used for data synchronization.

Full size table

Secondly, joint angle signals computed from video and the IMUs were synchronized for each data record. This process included steps for: subsampling IMUs joint angles from 50 Hz to 30 Hz, using a moving average filter of 5 samples to smooth the video signals, and shifting one signal over the other until minimizing the RMSE for the first 180 signal samples, what corresponds to the 6 first seconds of the activity. A detailed view of each step is visualized in Fig. 5.

Following this strategy, synchronized versions of text files containing video (.csv) and IMU (.raw,.mot) information were generated. The VIDIMU dataset²⁶ includes those files in a specific subfolder (/dataset/videoandimusync). In addition, equivalent plots as those in Fig. 5, but for every subject and activity are also available as dataset files in a specific subfolder (/analysis/videoandimusync).

Data Records

The VIDIMU dataset²⁶ is stored in Zenodo. Human body movements dataset record files are named according to the pattern: S##_A&&_T$$.@@@, for subject (S), activity (A) and trial (T) and where ## digits refer to the subject number, && refer to the activity, $$ refer to the recorded trial, and @@@ refer to the file extension (e.g. S40_A01_T01.raw, S40_A01_T01.mp4, S40_A01_T01.csv). Only one trial per subject and activity is available in the dataset. This single file includes all the movement repetitions required according to the protocol (see Table 1). The reason a trial identifier can be different from T01 is that some trials were discarded during acquisition due to e.g., incorrect calibration, incorrect movements, sensor, or video errors. The following body measurements of the 16 subjects recorded in video and wearing IMU sensors are provided in the “bodyMeasurements.csv” file: height (cm), weight (cm), shoulder height (cm), shoulder width (cm), elbow span (cm), wrist span (cm), arm span (cm), hip height (cm), hip width (cm), knee height (cm), ankle height (cm), foot length (cm). For the inverse kinematics process to be executed, OpenSim requires that the quaternion information in .raw files is translated to storage.sto file with a tabular text format, which are also included as part of the dataset²⁶. The inverse kinematics process in OpenSim generates for every subject and record a motion.mot file that includes the prefix ‘ik_’ (e.g. ik_S40_A01_T01.mot), and a orientations errors .sto file that includes the same prefix and the suffix ‘_orientationErrors’ (e.g. ik_S40_A01_T01_orientationErrors.sto).

A general overview of dataset folders hierarchy and data file formats is included in Table 5. Subjects S03, S04 revoked their informed consent for publishing their videos and data. Subjects S43 and S45 were removed because of technical issues detected during IMU data collection, and S48 was removed because of significant errors during body pose detection with BodyTrack caused by the poor stability of the camera focus. The related files have not been included in the dataset.

Table 5 Overview of dataset’s folder organization.

Full size table

Technical Validation

The technical validation aimed at verifying that the acquired data was representative of movement in real-life conditions and subject and activities were correctly indexed. More specifically, for video data it was checked that consistent joint angles were correctly inferred from the 3D joints using BodyTrack; and for IMU data it was checked that inverse kinematics generated consistent motions in a musculoskeletal model using OpenSim.

Detailed checking of the video data was done following several steps. First, by assuring the integrity of the files containing BodyTrack output. Second, by estimating joint angles and plotting them. Next, by letting a biomechanics expert to perform visual comparison of those graphs and the inferred skeleton in BodyTrack video output. Last, visual inspection of the coherence of plotted signals for each activity across different subjects, was used to detect possible labelling errors. Figure 6 shows an example of estimated joint angles plotting representation for subjects for lower-body activity A01 and upper-body activity A10. Joint angles estimation was performed with the code accompanying the dataset and a median filter was used to remove peaks of the signals before plotting. The dataset includes a folder with equivalent plots for every activity and subject (folders: /analysis/videonly/vangles, and /analysis/videoandimus/vangles). It is important to note that when subject tracking errors occur, BodyTrack applies a default value for the 3D position of the joints. In the plots, this implies that the angle of the joints takes a constant value (e.g. 90 degrees).

The IMU data underwent comprehensive verification. First, we assured the integrity of the files containing raw IMU quaternion data. Second, we plotted the data to ensure that all the wirelessly connected sensors collected data were complete (e.g. Figure 7, more on folder: /analysis/videoandimus/quats). Next, we applied inverse kinematics using OpenSim and generated motion files (.mot) and plotted the estimated joint angles through inverse kinematics (e.g. Figure 8, more on folder: /analysis/videoandimus/iangles), which were verified by an expert and compared to the movements registered in video. Lastly, the generated joints graphs were visually compared with the video and IMU signals in synchronization plots (e.g. Figure 5), and the reconstructed movements were inspected in.mot files using OpenSim (e.g. Figure 9). Motion files (.mot) for every subject and activity are included as dataset files (folder: /dataset/videoandimus).

Usage Notes

Specific considerations on the content of the different data file types in the VIDIMU dataset²⁶ follow:

.raw files: these files include original raw quaternions (w,x,y,z) info acquired with the custom IMUs. The first 5 data lines of the file contain the orientation of the IMUs while the subject was adopting the N-pose. The first column indicates the body location of the IMU sensor: qsHIPS stands for lower back, qsRUL for right upper leg, qsRLL for right lower leg, qsLUL for left upper leg, qsLLL for left lower leg, qsBACK for upper back, qsRUA for right upper arm, qsRLA for right lower arm, qsLUA for left upper arm, and qsLLA for left lower arm.
.sto files: these files follow the required tabular format in OpenSim to store time series data. The dataset includes.sto files containing the same information as.raw files, and also.sto files generated by Opensim after the inverse kinematics computation and containing orientation errors.
.mot files: these files include 3D joint angles computed in OpenSim. The first data line of the text file do contain the orientation of the IMUs while the subject was adopting the N-pose. These files include the following estimated joint angles in OpenSim:pelvis_tilt, pelvis_list, pelvis_rotation, pelvis_tx, pelvis_ty, pelvis_tz, hip_flexion_r, hip_adduction_r, hip_rotation_r, knee_angle_r, knee_angle_r_beta, ankle_angle_r, subtalar_angle_r, mtp_angle_r, hip_flexion_l, hip_adduction_l, hip_rotation_l, knee_angle_l, knee_angle_l_beta, ankle_angle_l, subtalar_angle_l, mtp_angle_l, lumbar_extension, lumbar_bending, lumbar_rotation, arm_flex_r, arm_add_r, arm_rot_r, elbow_flex_r, pro_sup_r, wrist_flex_r, wrist_dev_r, arm_flex_l, arm_add_l, arm_rot_l, elbow_flex_l, pro_sup_l, wrist_flex_l, wrist_dev_l. Loading various motion files in OpenSim at once onto a previous loaded model is faster by drag & drop of the.mot files onto the Toolbar of the application.
.csv files: include the 3D coordinates (x, y, z) in mm estimated by BodyTrack of the following body parts: pelvis, left hip, right hip, torso, left knee, right knee, neck, left ankle, right ankle, left big toe, right big toe, left small toe, right small toe, right small toe, left heel, right heel, nose, left eye, right eye, left ear, right ear, left shoulder, right shoulder, left elbow, right elbow, left wrist, right wrist, left pinky knuckle, right pinky knuckle, left middle tip, right middle tip, left index knuckle, right index knuckle, left thumb tip, right thumb tip, right thumb tip.

Given the multimodal approach of the dataset, the thorough acquisition protocol followed, and the details explained in the technical validation section, the authors consider that the VIDIMU dataset²⁶ is useful for a wide range of applications. However, it should also be noted that it has certain limitations that must be considered prior to its use, specifically if the purpose is to pursue clinically applicable solutions. The main limitation in this regard is related to the lack of ground truth kinematic information using a gold standard optoelectronic system. Another limitation of the dataset is that the raw IMU data only includes the quaternion information that was collected during acquisition. Although the accelerometer, gyroscope and magnetometer data were individually acquired by every single sensor, only the quaternion data (computed internally) was wirelessly sent to a computer. The reason for this was to reduce the data load over the 2.4 GHz communication, ensuring a more reliable sensor synchronization. The data fusion algorithm that is applied internally on every BNO080 sensor was configured following the manufacturer’s recommendations. Among those possible configurations, dynamic data acquisition using the rotation vector configuration was chosen.

Code availability

The VIDIMU dataset²⁶ (https://doi.org/10.5281/zenodo.7681316) was built using the free tools BodyTrack (v0.8) and OpenSim (v4.4). The VIDIMU-TOOLS code contains the Jupyter notebooks and Python scripts used for data conversion, data synchronization and checking the contents of the dataset to ensure its integrity. A first release of the VIDIMU-TOOLS project is accessible in Zenodo³⁶ (https://doi.org/10.5281/zenodo.7693096) and the latest version of the code can be found in GitHub (https://github.com/twyncoder/vidimu-tools).

References

Wren, T. A. L., Tucker, C. A., Rethlefsen, S. A., Gorton, G. E. & Õunpuu, S. Clinical efficacy of instrumented gait analysis: Systematic review 2020 update. Gait & Posture 80, 274–279 (2020).
Article Google Scholar
Carcreff, L. et al. The effects of dual tasks on gait in children with cerebral palsy. Gait & Posture 70, 148–155 (2019).
Article Google Scholar
Novak, I. et al. State of the Evidence Traffic Lights 2019: Systematic Review of Interventions for Preventing and Treating Children with Cerebral Palsy. Curr Neurol Neurosci Rep 20, 3 (2020).
Article PubMed PubMed Central Google Scholar
Milosevic, B., Leardini, A. & Farella, E. Kinect and wearable inertial sensors for motor rehabilitation programs at home: state of the art and an experimental comparison. BioMed Eng OnLine 19, 25 (2020).
Article PubMed PubMed Central Google Scholar
Cao, Z., Hidalgo, G., Simon, T., Wei, S.-E. & Sheikh, Y. OpenPose: Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields. IEEE Trans. Pattern Anal. Mach. Intell. 43, 172–186 (2021).
Article PubMed Google Scholar
Mathis, A. et al. DeepLabCut: markerless pose estimation of user-defined body parts with deep learning. Nature Neuroscience 21, 1281–1289 (2018).
Article CAS PubMed Google Scholar
Kidziński, Ł. et al. Deep neural networks enable quantitative movement analysis using single-camera videos. Nature Communications 11, 1–10 (2020).
Article Google Scholar
Yamamoto, M. et al. Accuracy of Temporo-Spatial and Lower Limb Joint Kinematics Parameters Using OpenPose for Various Gait Patterns With Orthosis. IEEE Trans. Neural Syst. Rehabil. Eng. 29, 2666–2675 (2021).
Article PubMed Google Scholar
Sabo, A., Gorodetsky, C., Fasano, A., Iaboni, A. & Taati, B. Concurrent Validity of Zeno Instrumented Walkway and Video-Based Gait Features in Adults With Parkinson’s Disease. IEEE J. Transl. Eng. Health Med. 10, 1–11 (2022).
Article Google Scholar
Cunningham, R., Sánchez, M. B., Butler, P. B., Southgate, M. J. & Loram, I. D. Fully automated image-based estimation of postural point-features in children with cerebral palsy using deep learning. Royal Society Open Science 6, (2019).
Robert-Lachaine, X., Mecheri, H., Larue, C. & Plamondon, A. Validation of inertial measurement units with an optoelectronic system for whole-body motion analysis. Med Biol Eng Comput 55, 609–619 (2017).
Article PubMed Google Scholar
Lopez-Nava, I. H. & Munoz-Melendez, A. Wearable Inertial Sensors for Human Motion Analysis: A Review. IEEE Sensors J. 16, 7821–7834 (2016).
Article ADS Google Scholar
Jalloul, N. Wearable sensors for the monitoring of movement disorders. Biomedical Journal 41, 249–253 (2018).
Article PubMed PubMed Central Google Scholar
Voinea, G.-D., Butnariu, S. & Mogan, G. Measurement and Geometric Modelling of Human Spine Posture for Medical Rehabilitation Purposes Using a Wearable Monitoring System Based on Inertial Sensors. Sensors 17, 0003 (2016).
Article ADS Google Scholar
Bergamini, E. et al. Multi-sensor assessment of dynamic balance during gait in patients with subacute stroke. Journal of Biomechanics 61, 208–215 (2017).
Article PubMed Google Scholar
Isho, T., Tashiro, H. & Usuda, S. Accelerometry-Based Gait Characteristics Evaluated Using a Smartphone and Their Association with Fall Risk in People with Chronic Stroke. Journal of Stroke and Cerebrovascular Diseases 24, 1305–1311 (2015).
Article PubMed Google Scholar
Yadav, S. K., Tiwari, K., Pandey, H. M. & Akbar, S. A. A review of multimodal human activity recognition with special emphasis on classification, applications, challenges and future directions. Knowledge-Based Systems 223, 106970 (2021).
Article Google Scholar
Ionescu, C., Papava, D., Olaru, V. & Sminchisescu, C. Human3.6M: Large Scale Datasets and Predictive Methods for 3D Human Sensing in Natural Environments. IEEE Trans. Pattern Anal. Mach. Intell. 36, 1325–1339 (2014).
Article PubMed Google Scholar
Banos, O., Toth, M., Damas, M., Pomares, H. & Rojas, I. Dealing with the Effects of Sensor Displacement in Wearable Activity Recognition. Sensors 14, 9995–10023 (2014).
Article ADS PubMed PubMed Central Google Scholar
Ciliberto, M., Rey, V. F., Calatroni, A., Lukowicz, P. & Roggen, D. Opportunity++: A Multimodal Dataset for Video- and Wearable, Object and Ambient Sensors-based Human Activity Recognition https://doi.org/10.21227/YAX2-GE53 (2021).
Caicedo, P. E., Rengifo, C. F., Rodriguez, L. E., Sierra, W. A. & Gómez, M. C. Dataset for gait analysis and assessment of fall risk for older adults. Data in Brief 33, 106550 (2020).
Article PubMed PubMed Central Google Scholar
Rosenberg, M. C., Banjanin, B. S., Burden, S. A. & Steele, K. M. Predicting walking response to ankle exoskeletons using data-driven models. J. R. Soc. Interface. 17, 20200487 (2020).
Article PubMed PubMed Central Google Scholar
Sy, L. Replication Data for Estimating Lower Limb Kinematics using a Reduced Wearable Sensor Count https://doi.org/10.7910/DVN/9QDD5J (2019).
Papageorgiou, E. et al. Are spasticity, weakness, selectivity, and passive range of motion related to gait deviations in children with spastic cerebral palsy? A statistical parametric mapping study. PLoS ONE 14, e0223363 (2019).
Article CAS PubMed PubMed Central Google Scholar
Simon-Martinez, C. et al. Age-related changes in upper limb motion during typical development. PLoS ONE 13, e0198524 (2018).
Article PubMed PubMed Central Google Scholar
Martínez-Zarzuela, M. et al. VIDIMU. Multimodal video and IMU kinematic dataset on daily life activities using affordable devices, Zenodo, https://doi.org/10.5281/zenodo.7681316 (2023).
Maxine. NVIDIA Maxine AR SDK https://developer.nvidia.com/maxine.
OpenSim. OpenSim Community - The National Center for Simulation in Rehabilitation Research https://opensim.stanford.edu.
Sköld, A., Hermansson, L. N., Krumlinde-Sundholm, L. & Eliasson, A.-C. Development and evidence of validity for the Children’s Hand-use Experience Questionnaire (CHEQ): Validity of the Children’s Hand-use Experience Questionnaire. Developmental Medicine & Child Neurology 53, 436–442 (2011).
Article Google Scholar
Krumlinde-Sundholm, L., Holmefur, M., Kottorp, A. & Eliasson, A.-C. The Assisting Hand Assessment: current evidence of validity, reliability, and responsiveness to change. Developmental Medicine & Child Neurology 49, 259–264 (2007).
Article Google Scholar
Wang, T.-N., Liang, K.-J., Liu, Y.-C., Shieh, J.-Y. & Chen, H.-L. Psychometric and Clinimetric Properties of the Melbourne Assessment 2 in Children With Cerebral Palsy. Archives of Physical Medicine and Rehabilitation 98, 1836–1841 (2017).
Article PubMed Google Scholar
Avery, L. M., Russell, D. J. & Rosenbaum, P. L. Criterion validity of the GMFM-66 item set and the GMFM-66 basal and ceiling approaches for estimating GMFM-66 scores. Dev Med Child Neurol 55, 534–538 (2013).
Article PubMed Google Scholar
González-Alonso, J. et al. Custom IMU-Based Wearable System for Robust 2.4 GHz Wireless Human Body Parts Orientation Tracking and 3D Movement Visualization on an Avatar. Sensors 21, (2021).
Rajagopal, A. et al. Full-Body Musculoskeletal Model for Muscle-Driven Simulation of Human Gait. IEEE Trans. Biomed. Eng. 63, 2068–2079 (2016).
Article PubMed PubMed Central Google Scholar
Al Borno, M. et al. OpenSense: An open-source toolbox for inertial-measurement-unit-based measurement of lower extremity kinematics over long durations. Journal of NeuroEngineering and Rehabilitation 19, 22 (2022).
Article PubMed PubMed Central Google Scholar
Martínez-Zarzuela, M. twyncoder/vidimu-tools: initial release, Zenodo, https://doi.org/10.5281/zenodo.7693096 (2023).

Download references

Acknowledgements

The authors would like to thank the volunteers who participated in the data collection. This research was partially funded by the funded by the Ministry of Science and Innovation of Spain under research grant “Rehabot: Smart assistant to complement and assess the physical rehabilitation of children with cerebral palsy in their natural environment”, with code 124515OA-100, and the mobility grant “Ayudas Movilidad Estancias Senior (Salvador Madariaga 2021)” with code PRX21/00612. Cristina Simon-Martinez is funded by the European Union’s Horizon 2020 research and innovation program under the Marie Sklodowska-Curie grant agreement No 890641 (“Optimizing Vision reHABilitation with virtual-reality games in paediatric amblyopia (V-HAB)”).

Author information

Authors and Affiliations

University of Valladolid, Valladolid, Spain
Mario Martínez-Zarzuela, Javier González-Alonso, Míriam Antón-Rodríguez & Francisco J. Díaz-Pernas
University of Applied Sciences and Arts Western Switzerland (HES-SO) Valais-Wallis, Sierre, Switzerland
Henning Müller & Cristina Simón-Martínez
Medical faculty, University of Geneva, Geneva, Switzerland
Henning Müller

Authors

Mario Martínez-Zarzuela
View author publications
You can also search for this author in PubMed Google Scholar
Javier González-Alonso
View author publications
You can also search for this author in PubMed Google Scholar
Míriam Antón-Rodríguez
View author publications
You can also search for this author in PubMed Google Scholar
Francisco J. Díaz-Pernas
View author publications
You can also search for this author in PubMed Google Scholar
Henning Müller
View author publications
You can also search for this author in PubMed Google Scholar
Cristina Simón-Martínez
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M. Martinez-Zarzuela contributed to the conceptualization of the dataset, data collection, data curation, code programming and writing of the first version of the manuscript. J. González-Alonso contributed to custom IMU sensors development, subject measurement, data collection and data processing. M. Antón-Rodríguez contributed to video data collection, video data curation and manuscript reviewing and editing. F.J. Díaz-Pernas contributed to IMU data curation and manuscript reviewing and editing. Henning Müller contributed to the conceptualization of the dataset, manuscript reviewing and editing. Cristina Simón-Martínez contributed to the conceptualization of the dataset, data curation, data validation and manuscript reviewing and editing.

Corresponding author

Correspondence to Mario Martínez-Zarzuela.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Martínez-Zarzuela, M., González-Alonso, J., Antón-Rodríguez, M. et al. Multimodal video and IMU kinematic dataset on daily life activities using affordable devices. Sci Data 10, 648 (2023). https://doi.org/10.1038/s41597-023-02554-9

Download citation

Received: 06 March 2023
Accepted: 08 September 2023
Published: 22 September 2023
DOI: https://doi.org/10.1038/s41597-023-02554-9