A myoelectric digital twin for fast and realistic modelling in deep learning

Maksymenko, Kostiantyn; Clarke, Alexander Kenneth; Mendez Guerra, Irene; Deslauriers-Gauthier, Samuel; Farina, Dario

doi:10.1038/s41467-023-37238-w

Download PDF

Article
Open access
Published: 23 March 2023

A myoelectric digital twin for fast and realistic modelling in deep learning

Nature Communications volume 14, Article number: 1600 (2023) Cite this article

6310 Accesses
5 Citations
5 Altmetric
Metrics details

Subjects

Abstract

Muscle electrophysiology has emerged as a powerful tool to drive human machine interfaces, with many new recent applications outside the traditional clinical domains, such as robotics and virtual reality. However, more sophisticated, functional, and robust decoding algorithms are required to meet the fine control requirements of these applications. Deep learning has shown high potential in meeting these demands, but requires a large amount of high-quality annotated data, which is expensive and time-consuming to acquire. Data augmentation using simulations, a strategy applied in other deep learning applications, has never been attempted in electromyography due to the absence of computationally efficient models. We introduce a concept of Myoelectric Digital Twin - highly realistic and fast computational model tailored for the training of deep learning algorithms. It enables simulation of arbitrary large and perfectly annotated datasets of realistic electromyography signals, allowing new approaches to muscular signal decoding, accelerating the development of human-machine interfaces.

Neuro-Musculoskeletal Mapping for Man-Machine Interfacing

Article Open access 02 April 2020

Tamas Kapelner, Massimo Sartori, … Dario Farina

Recent advances in flexible noninvasive electrodes for surface electromyography acquisition

Article Open access 16 August 2023

Lian Cheng, Jun Li, … Jianhua Zhang

A hierarchical dynamic Bayesian learning network for EMG-based early prediction of voluntary movement intention

Article Open access 23 March 2023

Yongming Chen, Haihong Zhang, … Zhiping Lin

Introduction

Biosignals have been classically used for studying the underlying physiology, for clinical diagnostics, and for monitoring. More recently, they have also been used for interfacing humans with external devices. For example, signals measured at the surface of the skin from skeletal muscle electrical activity, i.e. surface electromyography (sEMG), are used for the control of bionic limbs¹. In this application, the recorded electrical signals are converted into motion commands using machine learning^2,3,4. In recent years, with the development of deep-learning-based methods as well as wearable and cost-effective recording devices, there has been increased interest in using muscular signals as a basis for human-machine interfaces^5,6. The potential applications go well beyond the traditional clinical domains of prostheses and orthoses and range from robotic control to gaming and virtual reality⁷.

To be useful in real-world applications, particularly for the mass market, EMG-based muscle-computer interfaces need to be highly intuitive and functional. They also need to work for the vast majority of subjects with little or no user-specific calibration, regardless of their anatomy and physiology. Traditional machine learning techniques might work well for the relatively simple task of hand gesture classification, but they require training on a specific subject and potential regular recalibrations^1,7. This is a major bottleneck for mass market applications, where obtaining subject-specific data is not possible. Deep learning methods could tackle these problems⁸, but large, annotated, and high-quality datasets are required. To consider deep learning, training data needs to be recorded for different subjects, at different times, with high variability in electrode configurations and experimental paradigms. In addition, it is challenging and in some cases impossible to properly describe the underlying physiological or neural parameters (e.g. individual muscle forces, fibre physiological parameters, motor neuron impulse timings), which are crucial for the correct annotation of data samples. As a result, acquiring experimental EMG data in sufficient quantity and quality is not only expensive and time-consuming, but in many cases not possible.

Data augmentation via simulation is an alternative approach to lengthy data acquisitions, and indeed, augmentation techniques have been recently introduced for electrophysiological signals^9,10,11,12. However, most of these augmentation methods use black-box models, which aim to capture essential features of the signal without relating them to the underlying physiology¹³. Thus, the ground truth for most of the crucial parameters is still unknown, greatly limiting the potential use cases of such approaches. More sophisticated biophysical modelling methods are based on solving so-called forward equations (e.g., Poisson equation in the electrostatics case). However, this type of biophysical modelling has not been considered in the context of data augmentation for machine learning approaches. Indeed, state-of-the-art models are either not sufficiently realistic or not computationally efficient to produce suitable training data. For example, in the case of describing the generation of EMG signals, analytical models based on simple geometries of the tissues^{14,15,16,17,18} provide simulations that reflect the broad characteristics of the signals, but cannot be used to reproduce specific experimental conditions due to the overly simplified anatomy. The more realistic models of EMG generation based on numerical solutions of the Poisson equation with generic volume conductor shapes^19,20 are currently limited by their prohibitive computational time.

Here, we describe an EMG simulation method, based on the numerical solution of the forward equations suitable for deep learning data augmentation. It produces highly realistic EMG recordings, provides access to all underlying physiological parameters, and is significantly more computationally efficient, compared to the state-of-the-art methods. Although, there is no benchmark to evaluate and compare the exact performance of different simulation methods, our model takes minutes (or even seconds) to perform simulations that take hours with the state-of-the-art methods^20,21. We achieved this gain in computational efficiency not by simply optimizing the code or using parallel computing, but by exploiting the mathematical and structural properties of the model and reformulating them theoretically. Our results show that it is possible to simulate EMG signals for anatomically accurate conductor geometries and multiple muscles with tens of thousands of muscle fibres in a few seconds. This realistic and fast model allows, for the first time, the generation of synthetic EMG data in sufficient quantity and quality to train deep learning algorithms. Figure 1a illustrates a general pipeline of using the Myoelectric Digital Twin to train EMG processing AI. At the training stage, the user can define a high variety of simulation parameters. The software generates a corresponding EMG dataset coupled with all the metadata that can potentially be used for annotation. The annotated data is then used to train a specific AI. This trained AI can then be used to process real EMG measurements and translate it into commands for real-world applications.

Note, that our model and the proposed pipeline are quite general and do not depend on a specific neural network architecture or the final application. As a concrete example, Fig. 1a shows an AI which is trained to reconstruct individual muscle forces from an EMG signal. In this case, the exact muscle forces provided as input to the simulation are used to annotate the simulated data. Notice that training such an algorithm with experimental EMG data would be impossible, because the ground truth of exact muscle forces is unavailable in real measurements. Once trained on synthetic data, the AI can be used on real subjects without calibration or fine-tuning. When a person makes some complex hand movements, the corresponding EMG signal is processed by the AI and the muscle activations are reconstructed in real time. This information can then be used to interfaces with different devices with high precision and complexity of the gestures.

This particular application is just an example. The same approach can be used for different tasks: categorical gesture classification, denoising, fatigue detection, and so on. Our approach is also agnostic to the exact neural network architecture. As an application scenario and a proof of concept, in this work, we demonstrate the use of this model for data augmentation by pre-training neural networks that decompose EMG into the underlying neural activity sent from the spinal cord to muscles²².

Note, that because large EMG dataset simulation was not possible before, the effect of using this data for AI training is not yet well studied. It includes all the potential advantages of using synthetic EMG data, as well as potential biases that it can introduce to the training. Our tool, however, opens doors for new research topics in this domain.

Our model is the only realistic and computationally efficient simulator targeted to AI training and approaching the concept of a Myoelectric Digital Twin. It allows generating arbitrary large datasets of realistic and personalized EMG signals, with high data variability and with a perfect annotation of diverse hidden parameters. As a result, our model may allow breakthrough approaches in AI-based EMG signal processing and decoding.

Results

Software structure and user interface

Our Myoelectric Digital Twin is a cloud-based software with a Python API which allows users a simple yet flexible way to define various simulation parameters and to control the simulation pipeline. User can define subject’s anatomy (by providing surfaces of the muscles, bones, fat and skin), different tissue conductivities, the electrode configuration (electrode locations, sizes, shapes), individual fibre properties (location of neuromuscular junctions, tendon lengths, action potential propagation velocities), motor unit and recruitment model parameters, as well as the activations of each muscle, etc.

Our software is not limited to individual muscle activations. Instead, it allows users to define arbitrary activation (% of MVC as a function of time) for each muscle of the model (e.g. a forearm) simultaneously. The corresponding EMG signal is a superposition of contributions of each individual muscle. Figure 1b shows a schematic representation of the user’s input, main simulation modules and their interaction. Each step of the simulation pipeline depends on the output of the previous module and on a specific subset of the input parameters. This architecture, dictated by the mathematical properties of the model, allows efficient management of pre-computed data (i.e. changing fibre properties does not require recomputing the forward solver). Moreover, users can provide their own data as an input for each module instead of using automated tools of our software, if necessary (see Supplementary Movie for a video example of a simulation script).

Biophysics

To allow the efficient simulation of a large quantity of highly realistic EMG recordings, we have developed a novel approach to solve the forward problem of the volume conductor in electrostatic conditions. Our approach is based on a hierarchical and flexible decomposition of the EMG simulation pipeline, which allows the reuse and optimisation of individual steps.

In this section, we give a general overview of the model and advantages that it provides. The details and all mathematical equations related to the model development are described in the ‘Methods’.

First, a realistic anatomy, described by bone, muscle, skin, and electrode surfaces, is discretized into a tetrahedral volume mesh. A conductivity tensor, anisotropic for muscles and isotropic elsewhere, is associated with each tetrahedral of the volume. Unlike the state-of-the-art approaches, which solve the quasi-static Maxwell’s equations for each fibre source and for each time instant, we solve them for a set of unit point sources located at each vertex of the mesh associated with the muscle tetrahedra, which are referred to as basis sources. This computation does not depend on the time variable, nor on the fibres and motor unit geometry and their physiological properties. Therefore, changing these parameters does not require recomputing the forward solutions.

Moreover, due to a rewriting of the equations involved using the so-called adjoint method, the solution is obtained by solving as many systems of equations as there are electrodes, rather than basis sources. Because the number of electrodes (≈10²) is typically much lower than the number of basis sources (≈10⁵), computational performance is substantially improved.

Second, using the same muscle surfaces used to describe the volume conductor, individual fibre geometries can be automatically generated, if this data is not available from other sources (e.g. from diffusion magnetic resonance imaging). Moreover, the fibres are grouped into motor units (MUs) following the state-of-the-art models for MU physiology. This step does not depend on the forward computations, and thus altering the related parameters and producing new simulation is highly efficient.

Third, the current source density propagating along the fibres is generated using a realistic intracellular action potential model. The contribution of individual fibres to the EMG recordings is obtained by discretizing each fibre into a set of points, integrating the current source density along its length, and projecting onto the sensor locations using the basis points computed in the first step. This approach effectively decouples the number of fibres and their discretization from the conductor model, allowing the simulator to handle tens of thousands of fibres per muscle. Again, changing the fibre parameters (end-plate location, action potential propagation velocity, tendons length, etc.) does not require recomputing the other blocks of the simulation.

Fourth, given a muscle activation profile, we use the size principle to recruit MUs and their associated fibres. This allows a simple and easily interpretable input to the simulation that can be used to simulate EMG recordings associated to specific muscle contractions and their movements.

As a result, our model is the first that allows the generation of highly realistic and arbitrarily large (because of its computational performance) datasets of simulated EMG signals that can be used for AI training.

The simulator reproduces analytical solutions

To produce realistic EMG data, the simulator leverages a flexible representation of the underlying anatomy and physiology. This flexibility does not only allow the use of realistic and personalized models, but also permits reproducing simple conductor geometry used in analytical solutions. We compare our numerical solution with its analytical counterpart for a cylindrical volume conductor geometry²³. Taking into account the complexity of the model and the number of approximations used in computing the numerical solution, it is important to show that it reproduces the analytical solution with low error. The normalized mean square error between the two solutions depended on the depth of the fibre and varied between 3% (1-mm depth from the muscle surface) and 5% (11-mm depth). Figure 2 illustrates the analytical and numerical solutions for a fibre depth of 1 mm from the muscle surface. Because of the low error, the two waveforms are almost indistinguishable. It is important to note that the two volume conductor models in this validation are not identical. The theoretical/analytical solution is computed for an infinitely long cylinder (repeated periodically when discretized), while the numerical solution uses a cylinder of a large (sufficiently longer than the fibre and the electrode array), yet finite length. Increasing the length of the cylinder did not significantly alter the error.

**Fig. 2: Comparison of the numerical and analytical solutions.**

The simulator generates realistic EMG data

To evaluate the performance of the simulator at multiple scales, we started by simulating EMG signals associated to a single fibre activation inside the brachioradialis muscle. The signal recorded by an array of 16 rectangular electrodes (15 differential channels) when a single fibre was active is shown in Fig. 3a. The volume conductor model is based on an anatomically accurate forearm geometry, which includes all the muscles, bones, fat and skin tissues.

**Fig. 3: Simulation examples at multiple activation scales.**

Different distinctive features are present in the simulated signal that are also observed in experimental EMG signals²⁴. In particular, electrodes of channel 4 are located on different sides of the neuromuscular junction (NMJ) and thus the respective signals cancel each other out. Channels 7–11 present propagating EMG components resulting from the fibre action potentials (AP) propagating from the NMJ to the tendons. Channels 2–6, as well as channels 12–15, contain non-propagating sEMG components, which are due to the AP generation at the NMJ and its extinction at the tendon (end-of-fibre effect), respectively.

A further example is a simulation of an excitation of a single muscle, illustrated in Fig. 3b. A simple excitation drive for the Brachioradialis muscle is simulated as gradually increasing from 0% to 100% of the maximum voluntary contraction and smoothly decreasing back to 0%. As described in the section ‘Computational performance’, 50,000 muscle fibres were realistically distributed into 200 motor units over the muscle volume and recruited according to the size principle²⁵. The signal was simulated for 8 circular bipolar electrodes located around the forearm. In this example, the volume conductor effect becomes particularly visible with electrodes nearer to the active muscle having higher signal amplitudes. Notice that the electrodes record different signal waveforms as the muscle units are located at varying distances from the electrodes, weighting their contribution to the observed EMG signals. We also observe an increase of the signal amplitude with muscle excitation, an important feature of experimental EMG signals, which is a consequence of progressive motor unit recruitment and of an increase in the discharge rates of the active motor units.

Finally, we simulated sEMG signals from multiple muscle excitations, corresponding to the active wrist flexion and extension and passive wrist abduction against gravity. We used a simple muscle excitation model for three groups of muscles (flexors, extensors and abductors). More details about the experimental design are presented in the section ‘Details of realistic simulation examples’. Figure 3c and Fig. 3d clearly show the qualitative similarities in signal characteristics between experimental and simulated data. Our model was able to reproduce the different signal patterns during both flexion and extension. Beside the different activation across the electrodes during flexion and extension, the effect of wrist abduction is also visible in both datasets. Thus, channels 2, 3 and 7, 8 present a small signal activity during the whole duration of the simulation, and not only during flexion/extension peaks. Similar activity can also be seen in experimental data, with channels 2 and 7 being the most active.

Figure 4a shows the comparison between the experimental and simulated signals for hand flexion and extension using root mean square (RMS) values per electrode. This is a pattern recognition feature which is commonly used in EMG. Most of the simulation parameters were fixed for default values, only the amplitudes of muscle activations were roughly chosen to match the real measurements (without a specific optimization). The simulated data reproduces well the RMS properties of the experimental signals. Yet, the match is not perfect, especially for the wrist flexion. This is, however, not surprising, because the model was not personalized, and simulation parameters were not optimized.

In addition to the analysis in the time domain, simulated data were compared against the experimental data in the frequency domain. Figure 4b illustrates an example of the measured and simulated single-channel sEMG. It has to be noted that the spectral characteristics of a signal strongly depends on multiple simulation parameters. In this example, we ran several hundreds of simulations by varying the simulation parameters in a realistic range and selected the set of parameters leading to the minimal spectral difference. This approach, which is a simple version of inverse modelling, was possible because of the high computational speed of the simulations.

The simulator is fast

The computational performance of an EMG signal simulation depends on the model properties and the particular experimental setup. Consequently, there is no benchmark to evaluate and compare the performance of different simulation methods. The computational time magnitude of the state-of-the-art methods is, in the best cases, in the order of hours for a single simulation (with a fixed set of model parameter values, ≈50,000 fibres, 5 electrodes)^20,21.

By exploiting the mathematical properties of the forward equations and the model architecture described in the previous sections, we were able to achieve a computational performance of the order of minutes per simulation. Because in our model, changing most of the simulation parameters does not require recomputing the whole model and reduces the computational time of new simulations to the order of seconds, if the volume conductor remains constant.

As a result, it becomes practically possible to simulate arbitrary large datasets of highly realistic EMG signals with high variability in the simulation parameters. Details on the computational time in several conditions are provided in Methods (section ‘Computational performance’).

The proposed model is also highly scalable for multiprocessing, and the current computational time can be further reduced by several orders of magnitude by implementing parallel computation on CPU and GPU.

Realistic and fast EMG simulations open unique perspectives for deep learning

Here, we show a potential use of high volumes of simulated surface EMG data for deep learning, utilising the proposed model to generate data which can be used to pretrain neural networks. This methodology is used in other deep learning domains, such as the use of the ImageNet image database to pretrain object classifiers prior to adaptation to specific applications²⁶. The selected application was that of a time series classifier, which took as an input unwhitened high-density surface EMG (HD-sEMG) signals recorded from the dominant wrist of nine participants and detected discrete MU activation events (Fig. 5a). Participants performed an isometric contraction at a constant force of 15% of the maximal voluntary contraction, as measured by a force cell and fed back to the user on a computer monitor. Only four seconds of data was used to optimise each network. The objective was to examine whether a neural network that was pre-trained using simulated sEMG and then fine-tuned on individual experimental data outperformed a neural network that used only a random initialisation, potentially compensating for low volumes of training data.

**Fig. 5: Results on the high-density EMG decomposition.**

Specifically, the myoelectric digital twin simulation (Fig. 5b) was used to generate a large library of MUAP templates, which were then used to pretrain a gated recurrent unit (GRU) neural network architecture. GRU-based neural networks were selected because they have been shown to be effective at sEMG decomposition in past studies²⁷, but pre-training is applicable to any neural network design, such as convolutional neural networks. The architecture of the GRU network was selected through grid search hyperparameter optimisation, and consisted of a single GRU cell with a hidden dimension of 1024. This was used to convert 130-channel HD-sEMG to a 1024-channel featurised signal, of which 20 samples were taken into a densely-connected linear layer for final prediction of MU activity for each time step.

The network was pre-trained using 320 simulation-generated MUAP templates. Training a classification network with an output dimension of 320 is very unstable when the target data is highly sparse and class-imbalanced, so instead a multitask framework was used. The task of training a classifier on 320 MUAPs was subdivided into 64 individual tasks, each predicting 5 MUAPs. The value of the GRU featurisation layer parameters were used and updated simultaneously for all 64 tasks, but each task used its own final linear layer with output dimension of 5. In this way, the GRU-layer could be pre-trained to find an efficient featurisation, which could then be used as a starting point for fine-tuning on the final prediction task with experimental wrist sEMG signals (see ‘Details of deep learning experiment’ for details).

The simulation pre-trained network outperformed random initialisation in decomposition accuracy when compared to the original decomposition as measured by the rate of agreement (RoA) metric calculated on a one second segment of data²⁸ (Fig. 5c). The RoA metric efficiently incorporates false positive and false negative into a single metric of accuracy relevant to the decomposition of discrete events in time series data. The median (IQR) RoA of the pre-trained network was 93.8% (84.8 to 100.0), compared to 82.4% (71.6 to 100.0) in the random initialisation network, with a median difference of 8.1 (Hodges-Lehmann estimator, 95% CI 3.4 to 13.3 using method of Walsh averages), significant according to the two-tailed Wilcoxon signed-rank test (p = 0.00006, Z = 4.0). Of the 39 decoded motor units, 22 had improved RoAs with pre-training and one had a worse RoA, with the remaining 16 showing no change, generally because the initial RoA was already 100% without pre-training (these are large-amplitude MUAPs that are simple to detect). The pre-trained network had a much lower variance in the accuracy of predictions on the test sets than random initialisation, quickly optimising to a model effective for generalisation to new signals.

When units were subdivided into those from female (18 units) and male (21 units) subjects, both subsamples also showed a significant difference. For female subjects the median difference between trained and random initialisation was 9.1 (Hodges-Lehmann estimator, 95% CI 0.0 to 20.3 using method of Walsh averages) and significant (two-tailed Wilcoxon signed-rank, p = 0.00694, Z = 2.7). For male subjects the median difference between trained and random initialisation was 5.7 (Hodges-Lehmann estimator, 95% CI 1.6 to 9.7 using method of Walsh averages) and significant (two-tailed Wilcoxon signed-rank, p = 0.00064, Z = 3.4).

Discussion

We have proposed an efficient computational approach to highly realistic surface EMG modelling. The method provides the solution to the generation of EMG signals from anatomically accurate volume conductor properties and number of muscle fibres, within limited computational time compatible with real-time signal generation. The proposed model is the only available EMG simulator with realistic description of the volume conductor and optimized for such computational efficiency. The main value of the model is that it opens perspectives for using simulated sEMG for data augmentation in the deep learning framework, something that was never done before using state-of-the-art simulation methods.

The computational efficiency in the volume conductor solution has been recognized as an important component of EMG modelling, and some attempts to decrease the computational time in EMG simulations have been described. For example, the approaches developed by Dimitrov and Dimitrova²⁹ and Farina et al.^23,30 substantially decreased the computational time in analytical EMG modelling for simple volume conductor geometries. These models provide simulations which reflect the broad characteristics of EMG signals, but cannot be anatomically accurate because of the restrictions on the volume conductor and fibre source geometry. Realistic models using numerical solutions have also been recently proposed. The previous most complete and efficient model has been proposed by Pereira Botelho et al.²⁰. These authors have used an anatomically accurate model to simulate EMG signals generated during index finger flexion and abduction. They gained computational speed by using the principle of reciprocity. In fact, one part of our calculations also includes the adjoint method, which is an algebraic representation of this principle. By using reciprocity, Pereira Botelho et al.²⁰ reported a computational time of 1 h for simulating the activation of nearly 15,500 fibres for 5 electrodes. This time, however, remains impractical for simulating arbitrary large datasets for a variety of parameter values. The model we proposed in this paper significantly surpasses the computational efficiency reported in ref. ²⁰. We achieved it by efficiently exploiting mathematical properties of the forward equations, in particular by introducing the concept of basis points and by separating model parameters and variables into independent computational blocks. Our approach does not only reduce the computational time for a full simulation, but also allows us to scale the solution, so that new solutions for the same volume conductor can be obtained without recomputing the volume conductor transformation. In this way, the generation of EMG signals within the same volume conductor, but varying all other simulation parameters, can be performed in even shorter time. Complex EMG signals from tens of thousands of muscle fibres located in multiple muscles, can be generated (and regenerated with different parameter values) in a computational time of the order of seconds.

Some limitations remain in the current state of the presented model. It does not include some sources of variability that are present in experimental EMG signals and strongly impact their processing and analysis. For example, the model does not include advanced noise and artefacts descriptions, biomechanical modelling of the musculoskeletal system, and non-stationary volume conductor properties and fibre geometry.

Also, the automated modelling of muscle forces (or % MVC) for specific movements is out of the scope of this work. We recognize, however, the value of such a tool, and we see two potential ways to approach this task. First, by integrating our myoelectrical model with biomechanical modelling of the musculoskeletal system. This would allow users to define a movement by the dynamics of degrees of freedom (e.g. of a hand). Then, inverse dynamics could be used to estimate the corresponding muscle forces and % MVC, that can be used as an input to our current model. Another possibility is to estimate individual muscle activity from real EMG data. This would require an inverse problem solving algorithm. As a matter of fact, our Myoelectric Digital Twin is, practically, the only way to obtain labelled data for training such inverse algorithms, because the exact muscle forces of real subjects are not, in general, accessible.

While these aspects are beyond the scope of this paper, they are relevant features to include in future developments.

The advances presented in this work, together with the proposed future developments, naturally lead to the concept of a Myoelectric Digital Twin—a realistic, personalized, computationally-efficient model which generates EMG data in a quality and quantity sufficient not only to augment but to replace real data, with utility for AI training in the various real-world applications. Here, we have illustrated the potential of this approach by augmenting training data for deep neural networks, with the aim of identifying the discharge times of spinal motor neurons from surface EMG signal. By using the simulator to augment training (through a pre-training procedure), we showed an increase in the performance of the decomposition network when applied to experimental data, demonstrating a highly relevant use of the proposed approach for decreasing the need for experimental training data in human-machine interfacing applications.

Methods

Forward problem

The fibre extracellular potentials that are measured by EMG electrodes are generated by transmembrane currents. The properties of bioelectric currents and potential fields can be determined from solutions of the Maxwell’s equations, taking into account the electrical properties of biological tissues. Because of the relatively low frequencies of signal sources of biological origin, the quasi-static assumption can be applied^31,32, so that the electric potential and the primary current sources are related by the following Poisson equation^31,33,34 with Neumann boundary conditions:

$$\left\{\begin{array}{l}\nabla \cdot (\sigma \nabla \phi )=-I\quad \,{{\mbox{in}}}\;\Omega \quad \\ \sigma \frac{\partial \phi }{\partial {{{{{{{\bf{n}}}}}}}}}=\sigma \nabla \phi \cdot {{{{{{{\bf{n}}}}}}}}=0\quad \,{{\mbox{on}}}\,\partial \;\Omega \quad \end{array}\right.$$

(1)

where $\Omega \subset {{\mathbb{R}}}^{3}$ is a volume conductor domain of interest, ∂Ω its boundary with outward pointing normal unit vector n, ϕ(r)[V] is the electric potential, I(r)[A/m³] is the current source density (CSD), σ(r)[S/m] is a conductivity tensor. The second line of the equation (boundary condition) reflects the assumption that no current flows out of the domain of interest. In the context of EMG modelling, this implies that there is no current flow between the skin and air. The current source density I(r) is interpreted as the volume density of current entering or leaving the extracellular medium at position r ∈ Ω. A negative CSD corresponds to current leaving the extracellular medium (due to the fibre transmembrane currents) and is thus conventionally called a sink. Likewise, current entering the extracellular medium is called a source^35,36.

Equation (1) cannot be solved analytically for general volume conductor geometries, but several numerical methods can be used to approximate its solution. Here, we use the finite element method (FEM)³⁷, which discretizes the volume conductor Ω as a tetrahedral mesh Ω_t. Given this mesh, we use the Galerkin method to project the potential ϕ onto the space of piecewise affine functions defined on Ω_t. Figure 6a and Fig. 6b illustrate an example of a realistic forearm model and corresponding discretized volume mesh, respectively.

This discretization process converts the continuous operator problem of Eq. (1) to a finite system of linear equations:

$$A{{{{{{{\bf{v}}}}}}}}={{{{{{{\bf{b}}}}}}}}$$

(2)

where A is a symmetric and sparse n_v × n_v matrix, n_v is the number of mesh vertices, ${{{{{{{\bf{v}}}}}}}}\in {{\mathbb{R}}}^{{n}_{v}}$ is a vector of potential values at mesh nodes, and ${{{{{{{\bf{b}}}}}}}}\in {{\mathbb{R}}}^{{n}_{v}}$ is a vector containing source information. Because the electric potential is defined up to a constant, the matrix A always has a one-dimensional null space. To obtain a unique solution to the system of Eq. (2), we constrain potentials v to have a zero sum.

In the context of EMG, we are not interested in finding electric potentials everywhere in the conductor, but only at the electrode locations. Let S be a selection matrix with a shape n_e × n_v which only selects the values at EMG electrode locations (n_e is the number of electrodes). Each row of S can be designed to select a single point location or to integrate over an area (e.g. the electrode-skin interface) depending on the location and number of its non-zero elements. Also, let b(r) correspond to a point source at location r. The resulting EMG signal is thus given by:

$${{{{{{{{\bf{v}}}}}}}}}_{{{{{{{{\bf{point}}}}}}}}}({{{{{{{\bf{r}}}}}}}})=S{A}^{-1}{{{{{{{\bf{b}}}}}}}}({{{{{{{\bf{r}}}}}}}}).$$

(3)

Let us analyse in more detail the structure of A and b from Eq. (2). Let {wⁱ(r), i = 1…n_v} be a set of n_vP¹ (piecewise linear) basis functions over the tetrahedral mesh Ω_t. Note, that wⁱ is 1 at the i-th vertex of the mesh, is 0 at all other vertices and is linear at all tetrahedra adjacent to the i-th vertex. In this case, A and b have the following structure:

$${A}_{ij} ={\int}_{{\Omega }_{t}}\sigma ({{{{{{{\bf{r}}}}}}}})\nabla {w}^{i}({{{{{{{\bf{r}}}}}}}})\nabla {w}^{j}({{{{{{{\bf{r}}}}}}}})d{{{{{{{\bf{r}}}}}}}}\\ {b}_{i} ={\int}_{{\Omega }_{t}}I({{{{{{{\bf{r}}}}}}}}){w}^{i}({{{{{{{\bf{r}}}}}}}})d{{{{{{{\bf{r}}}}}}}}.$$

First, let us notice that A is symmetric and, in general, a very large matrix which can be stored only because it is sparse. Indeed, the functions wⁱ have a compact support and their pairwise scalar product is non-zero only for neighbour functions. Since the pseudo-inverse (or the inverse) of a sparse matrix is usually not a sparse matrix, it is impractical to compute it because of the amount of memory needed to store it. Thus, iterative methods are typically used to solve the system of Eq. (2) for every given b.

Consider the case of $I({{{{{{{\bf{r}}}}}}}})=\delta ({{{{{{{\bf{r}}}}}}}}-\bar{{{{{{{{\bf{r}}}}}}}}})$ which corresponds to a unit point current source at a location $\bar{{{{{{{{\bf{r}}}}}}}}}$. Without loss of generality, we assume that this source is inside a tetrahedron formed by the vertices i₁, …, i₄ of the mesh. In this case, we obtain:

$${b}_{i}=\left\{\begin{array}{l}{\lambda }_{j},\;{{\mbox{if}}}\,i\in \{{i}_{1},\ldots,{i}_{4}\},\quad \\ 0,\,\hfill{{\mbox{otherwise}}}\,\quad \end{array}\right.$$

where {λ_j, j = 1, …, 4} are the barycentric coordinates of the point $\bar{{{{{{{{\bf{r}}}}}}}}}$ inside the tetrahedron {i₁, …, i₄}. Applying this expression to Eq. (3), we obtain:

$${{{{{{{{\bf{v}}}}}}}}}_{{{{{{{{\bf{point}}}}}}}}}(\bar{{{{{{{{\bf{r}}}}}}}}})=S{A}^{-1}{{{{{{{\bf{b}}}}}}}}(\bar{{{{{{{{\bf{r}}}}}}}}})=S{A}^{-1}\overline{B}{{{{{{{\boldsymbol{\lambda }}}}}}}}.$$

where $\bar{B}$ is a n_v × 4 matrix with ${\overline{B}}_{{i}_{j},j}=1$ for j = 1, …, 4, and 0 otherwise. This implies that the solution of the system of Eq. (2) for any unit point source can be computed as a barycentric sum of solutions on the vertices of the corresponding tetrahedron. Therefore, it is sufficient to compute solutions of Eq. (2) for basis sources located on mesh vertices, to be able to evaluate a solution for any point inside this mesh efficiently. Let n_s be the number of such basis sources. For the most general case, when the source can be located anywhere inside the mesh and n_s = n_v, let B be a n_v × n_s identity matrix. The objective is to compute basis solutions:

$${V}_{{{{{{{{\rm{basis}}}}}}}}}=S{A}^{-1}B$$

(4)

where V_basis is a n_e × n_s matrix, whose columns contain the solutions of Eq. (2) for a unit point source located at the corresponding mesh vertex. Hence, the potentials for any source location r is given by:

$${{{{{{{{\bf{v}}}}}}}}}_{{{{{{{{\bf{point}}}}}}}}}({{{{{{{\bf{r}}}}}}}})={V}_{{{{{{{{\rm{basis}}}}}}}}}{{{{{{{\boldsymbol{\lambda }}}}}}}}({{{{{{{\bf{r}}}}}}}})$$

(5)

where ${{{{{{{\boldsymbol{\lambda }}}}}}}}({{{{{{{\bf{r}}}}}}}})\in {{\mathbb{R}}}^{{n}_{s}}$ is a vector, whose four non-zero elements contain the barycentric coordinates of a point r inside a corresponding tetrahedron. Note, that one may restrict potential sources to be located inside specific subdomains of the whole mesh (which is the case for EMG). In this case, n_s corresponds to the number of vertices of these subdomains, and the matrix B is a submatrix of the identity matrix.

The most straightforward way to compute V_basis from Eq. (4) is to solve a problem of the form Ax = b_i for each column of the matrix B. It would thus require solving n_s systems of linear equations. For realistic conductor geometries, which have a large number of vertices, solving a single system may take up to a few minutes and solving n_s systems quickly becomes impractical. Therefore, we propose the use of the adjoint method³⁸, which requires solving n_e systems only. In the context of EMG, the number of electrodes is usually significantly smaller than the number of vertices in the muscle subdomain meshes, i.e. n_e << n_s. Let us define K = SA⁻¹, which is a matrix of size n_e × n_v. Because A is symmetric, and the inverse of a symmetric matrix is also symmetric, we can write K^T = A⁻¹S^T. Then, K can be found by solving the system:

$$A{K}^{T}={S}^{T}.$$

(6)

The matrix S^T has n_e columns and, thus, only n_e linear systems need to be solved to find K. The basis solutions can then be found as:

$${V}_{{{{{{{{\rm{basis}}}}}}}}}=KB.$$

(7)

EMG signal of a single fibre activation

The action potential generated by the flow of ionic currents across the muscle fibre membrane is the source of excitation. For a given intracellular action potential (IAP) model V_m(z), the transmembrane current source per unit length is proportional to the second derivative of V_m(z), where z is a fibre arc length measured in mm. A general description of the current source density travelling at velocity v along the fibre with the origin at the neuromuscular junction at location z₀ is^28,30,39:

$$I(z,t)={\sigma }_{{{{{{{{\rm{in}}}}}}}}}\pi {r}^{2}\cdot \frac{\partial }{\partial z}\left[\psi (z-{z}_{0}-vt){w}_{{L}_{1}}\left(z-{z}_{0}-\frac{{L}_{1}}{2}\right)\\ -\psi (-z+{z}_{0}-vt){w}_{{L}_{2}}\left(z-{z}_{0}+\frac{{L}_{2}}{2}\right)\right]$$

(8)

where z ∈ [0, L] is a location along the fibre of length L, $\psi (z)=\frac{d}{dz}{V}_{m}(-z)$, L₁ and L₂ are the semi-lengths of the fibre from the end-plate to the right and to the left tendon, respectively, σ_in is the intracellular conductivity, and r is the fibre radius. We have chosen w_L to be a Tukey window, as proposed in ref. ²¹. The intracellular action potentials (IAP) ${V}_{m}\left[\frac{mV}{mm}\right]$ can be mathematically described in the space domain as proposed in ref. ⁴⁰:

$${V}_{m}(z)=96{z}^{3}{e}^{-z}-90.$$

Let r(z) be a fibre geometry parametrized with respect to the fibre arc length z. Combining the transfer function of a point source in Eq. (3) with the fibre’s current density in Eq. (8), we obtain the equation for the EMG signal resulting from a single fibre activation:

$${{{{{{{{\bf{v}}}}}}}}}_{{{{{{{{\bf{fibre}}}}}}}}}(t)=\int{{{{{{{{\bf{v}}}}}}}}}_{{{{{{{{\bf{point}}}}}}}}}({{{{{{{\bf{r}}}}}}}}(z))\,I\,(z,t)dz.$$

(9)

This integral can be efficiently approximated by discretizing the fibre geometry into sufficiently dense spatial samples ${\{{{{{{{{\bf{r}}}}}}}}({z}_{i})\}}_{i}$ and assuming that v_point(r(z)) is piecewise constant around these points. If we also rewrite Eq. (8) in a shorter form as $I(z,\,t)={\sigma }_{{{{{{{{\rm{in}}}}}}}}}\pi {r}^{2}\cdot \frac{\partial }{\partial z}F(z,\,t)$, Eq. (9) becomes:

$${{{{{{{{\bf{v}}}}}}}}}_{{{{{{{{\bf{fibre}}}}}}}}}(t)\, \approx \, \mathop{\sum}\limits_{i}{{{{{{{{\bf{v}}}}}}}}}_{{{{{{{{\bf{point}}}}}}}}}({{{{{{{\bf{r}}}}}}}}({z}_{i}))\int\nolimits_{{z}_{i}-{\Delta }_{i}}^{{z}_{i}+{\Delta }_{i}}I(z,t)dz \\ \,=\mathop{\sum}\limits_{i}{{{{{{{{\bf{v}}}}}}}}}_{{{{{{{{\bf{point}}}}}}}}}({{{{{{{\bf{r}}}}}}}}({z}_{i}))\int\nolimits_{{z}_{i}-{\Delta }_{i}}^{{z}_{i}+{\Delta }_{i}}{\sigma }_{{{{{{{{\rm{in}}}}}}}}}\pi {r}^{2}\cdot \frac{\partial }{\partial z}F(z,t)dz \\ \,={\sigma }_{{{{{{{{\rm{in}}}}}}}}}\pi {r}^{2}\mathop{\sum}\limits_{i}{{{{{{{{\bf{v}}}}}}}}}_{{{{{{{{\bf{point}}}}}}}}}({{{{{{{\bf{r}}}}}}}}({z}_{i}))\left(F({z}_{i}+{\Delta }_{i},t)-F({z}_{i}-{\Delta }_{i},t)\right).$$

(10)

Note, that v_point(r(z_i)) can be efficiently computed from Eq. (5). Moreover, once v_point(r(z_i)) are computed for all given fibres, we can change the parameters of the current source density (action potential waveform shape, propagation velocity, location of neuromuscular junction), and compute the corresponding EMG signal with Eq. (10) by only matrix multiplication complexity.

Geometrical and physiological modelling of motor units

The motor unit action potential (MUAP) is the summation of the single fibre action potentials (APs) of the muscle fibres in the MU. Different types of MUs can be modelled^41,42. Our approach consists in generating fibre and motor unit distributions inside a unit circle, and then projecting it into arbitrary 3D muscle geometry (Fig. 6d), using methods similar to those described in ref. ⁴³. This provides a high level of control for the fibre and MU distribution parameters independently of a particular muscle geometry. A common way to simulate fibres and MUs is to start by defining MU positions, sizes and territories, and then simulate fibres inside these MUs according to their parameters^44,45. We, however, propose another approach. First, we simulate uniformly distributed fibres inside a unit circle. Then, MU centres and their circular territories are generated and, finally, we associate each fibre to an MU. A fibre is associated to one of the MUs that contains it inside its territory with a probability proportional to the MU density (Fig. 6c). This approach has two main advantages. First, it guaranties (by construction) the uniform fibre distribution inside a circular muscle cross-section. Second, once fibres are generated and projected into a muscle geometry, different MU distributions can be generated very quickly, without regenerating fibres and recomputing transfer functions v_point(r(z_i)) for their nodes.

MU recruitment model

During muscle contraction, the MUs are recruited according to the size principle²⁵. This can be simulated by associating a threshold of excitation to each MU, as described for example by Fuglevand et al.⁴⁶. Linear or non-linear rate coding models can be used^46,47,48.

The excitation rate as a function of time for each muscle is converted into the firing rates of the active MUs. Inter-discharge intervals are then generated with variability of the discharges around the mean firing interval⁴⁹.

Implementation remarks

The implementation of the main steps presented in the previous section can be summarized as follows. Once the matrices S, A and B are computed, the matrix K is determined using Eq. (6) by solving n_e linear systems. Then, Eq. (7) is used to find the solutions for n_s basis points, which is a fast matrix multiplication operation. For any given point source location r, we compute its barycentric coordinates in associated tetrahedron and apply Eq. (5) to get values of electrical potentials at electrode locations. Finally, for a given fibre geometry, the single fibre action potential as recorded by the EMG electrodes is computed using Eq. (9).

The results presented in this study are obtained using a Python implementation of the proposed strategy. Assembling the matrix A and solving the system (6) is delegated to the FEniCS computing platform^50,51. The forearm geometry that is here representatively used as a conductor model is taken from the website of BodyParts3D, The Database Center for Life Science (http://lifesciencedb.jp/bp3d/). The volume mesh is generated from the surface meshes of the forearm tissues using the CGAL C++ library⁵².

Computational performance

In this section, we report the computational time of the proposed model for a specific simulation case. The exact computational time values strongly depend on the implementation, experiment design, model parameters etc. The order of magnitude, however, stays the same. Note, that no multiprocessing tools were used in these computations. Each step, however, is highly scalable and can be efficiently distributed between parallel processes, which would significantly increase the performance. Computations for each muscle and fibre are independent and can be performed in parallel. Parallel computing would also apply to the electrodes in the general basis points computation.

The reduction of the time complexity allowed by our model can be generally expressed in the following way. Solving forward equations naively, i.e. for each fibre and each time sample separately, the corresponding complexity is in order of the product of corresponding parameters O(∏_in_i), where n_i are respectively the number of mesh vertices, electrodes, fibres, time samples, etc. Our method decomposes this original product into a sum of smaller products of parameter subsets $O({\sum }_{j}{\prod }_{i\in {I}_{j}}{n}_{i})$. Each such sub-product corresponds to a specific step in the simulation pipeline. For example, computing the forward solver is dominated by solving Eq. (6) with the conjugate gradient method, which only depends on the number of electrodes and mesh vertices. Not only this decomposition reduces the overall time complexity, but also allows using pre-computed data for a new simulation if only a subset of parameters is changed.

For the purpose of demonstration, we simulated a 1-min-long, 100% maximum voluntary contraction (MVC) excitation of the Brachioradialis muscle with 50,000 individual fibres and 200 motor units. The mesh of the volume conductor contained 2.1M vertices, which formed 13M tetrahedra. 16 rectangular and 16 circular electrodes were included in the model. The sampling frequency of the simulated signals was 2000 Hz. Table 1 shows the computational time for each of the main steps in this simulation.

Table 1 Computational performance of each of the main steps of a raw EMG simulation

Full size table

An important property of our model is that each step depends only on the data produced by the previous steps. This property can be exploited to change some simulation parameters without recomputing every step of the simulation. For example, it is not necessary to recompute solutions for the fibre basis points if fibres geometry and conductor model stay the same and only the parameters related to the fibre properties (AP velocity, end-plate location, tendon sizes, etc.), MU distribution or recruitment model are modified. In this example, the total simulation time for this new set of parameters will only take approximately 30 s + 0.8 s + 2.6 s = 33.4 s.

A brief description of the main parameters required at each step follows. The full arm and electrode geometry as well as the tissue conductivities define the computation of general basis points. To compute fibres basis points solutions, the 3D geometry of the fibres is required. Computing the fibre EMG responses requires the shape of the intracellular AP waveforms, AP propagation velocity, sizes of tendon and active fibre parts, neuromuscular junction location, fibre diameter and intracellular conductivity, and sampling frequency. To compute the MUs action potentials, the MU distribution in the muscle, i.e. the association of fibres to each motor unit, need to be defined. In the proposed model, once the number of MUs, their sizes and territory areas are selected, the MU distribution is randomly generated. Finally, to synthesize the sEMG signal, the muscle excitation drives and recruitment model parameters (motor unit recruitment thresholds and firing rates) are required.

Comparison with the cylindrical analytical solution

First, we compared our numerical solution with its analytical counterpart for a simple volume conductor geometry²³. We used a four layer cylindrical model with layers corresponding to bone (r = 0.7 cm), muscle (r = 2 cm), fat (r = 2.3 cm) and skin (r = 2.4 cm) surfaces. 16 point electrodes were simulated on the skin surface directly above a fibre. The fibre was located at varying depths into the muscle tissue, in the range of 1–11 mm. Differential sEMG signals were simulated using the analytical and numerical solutions of the forward problem.

Details of realistic simulation examples

For the single muscle excitation example, 50k muscle fibres were generated inside the muscle and distributed within 200 motor units. The size of MUs varied exponentially from 11 to 1150 fibres. The areas of MU territories varied from 10% to 50% of the muscle cross-sectional area. The muscle excitation drive was decomposed into MU impulse trains according to the size principle. In this example, the firing rate for each MU ranged from 8 Hz to 35 Hz and all MUs were recruited when an excitation level of 75% MVC was reached.

For the multiple muscles experiment, the flexor group included the Palmaris longus, Flexor carpi ulnaris (ulnar head), Flexor carpi ulnaris (humeral head), and Flexor carpi radialis muscles. The extensor group included the Extensor digitorum, Extensor carpi ulnaris, Extensor carpi radialis brevis, and Extensor carpi radialis longus muscles. During a wrist flexion, the muscles of the flexor group reached an excitation level of 50% MVC. During extension, the extensor group was activated with the same excitation level. Moreover, a small but constant excitation (7% MVC) of the abduction muscle group was added to simulate the wrist resistance against gravity. The abduction muscle group included the Flexor carpi radialis, Extensor carpi radialis brevis, and Extensor carpi radialis longus muscles. For each muscle, a number of muscle fibres between 32k and 78k was simulated, depending on the muscle cross-sectional area. Muscle fibres were distributed within motor units, whose number varied from 150 to 300 per muscle.

Details of deep learning experiment

To evaluate the effect of using the simulation-pretrained network, an experimentally collected high-density surface electromyography (HD-sEMG) signal dataset was used, originally created to test low-force human-computer interaction with wrist-wearable interfaces⁷. The experimental protocol was designed in agreement with the Declaration of Helsinki and was approved by Imperial College London ethics committee (JRCO: 18IC4685). A balanced gender distribution (as self-reported) was prioritised during participants’ recruitment. Nine participants (4 females, 5 males, ages: 23–31) took part in the study after signing informed consent forms. Participants were not compensated for their involvement in the experiments. The participants performed 5-s isometric contractions of their dominant-hand index finger at 15% of maximal force, with sEMG activity measured using two flexible 5x13 electrode grids with 8-mm spacing placed on the circumference of the wrist, immediately proximal to the ulnar head. HD-sEMG signal was acquired using a Quattrocento amplifier (OT Bioelettronica) at 2048 Hz sampling frequency, while force profiles were sampled with a Phidget load cell at 10 Hz. A Matlab 2019b (The MathWorks, Inc) program was used to synchronyse both modalities. The HD-sEMG signal was then decomposed into motor neuron activity using convolutive blind source separation⁵³. For the purpose of training and testing the supervised decomposition pipeline, motor neuron activity was accepted if it was present for at least 80% of the contraction window. For each participant the HD-sEMG signal and accompanying decomposed motor neuron activity (as a sparse binary matrix) was then split into a 4-s training window and a 1-s testing window.

A gated recurrent unit (GRU) neural network was used as the deep learning model due to previous studies showing good performance with this data type²⁷. After hyperparameter optimisation by grid search, a minimally-parameterised model was found to perform optimally, likely due to the short length of the training data available. Input HD-sEMG signal was first encoded by a single layer GRU with a hidden dimension of 1024 in length⁵⁴. To make a time instant prediction a densely-connected linear layer with sigmoid activation function took as an input a moving 20 sample-wide window from the GRU output, centred on the time instant of interest. Predicted activity was converted to spike timestamps using a two-class K-means clustering algorithm. Binary cross entropy was used as the loss function and Adam with weight decay used as the optimising algorithm⁵⁵.

To improve model generalisation an early-stopping framework was used, based on 10% of the training data retained as a validation set. Training, validation and test data was z-score standardised using the mean and standard deviation calculated from the training set. During training the input signal was augmented with noise of standard normal distribution. To account for the high sparsity of the output matrix, samples containing motor neurons were artificially oversampled, with each input batch of 512 time instants containing at least 20% motor neuron activation. All machine learning was implemented using the PyTorch library in python. Final performance was assessed using the rate of agreement metric (RoA).

The optimised architecture of the GRU network was used for pre-training, which was conducted using multitask learning in a hard parameter sharing paradigm⁵⁶. Four digital twins were created for simulation using different model parameters (tissue conductivities, MU distribution, fibre properties, etc.), with the generated motor unit activation (MUAP) templates from flexor digitorum profundus and superficialis used to create 64 sets, each containing 5 MUAPs. Each set was used to generate windows of signal with a range of MUAP superpositions (Fig. 7a). In signal windows with motor neuron activity a MUAP template was placed in the centre of the window, before being additively superimposed with a random number of MUAP templates from other motor units at random time offsets. In windows without activity no template was placed in the centre of the window. During multitask learning training, the same GRU layer (and parameters) were shared between the 64 recordings, but each recording had its own output layer, operating on a 20 sample-wide window as in the experimental recordings (Fig. 7b). In this way the GRU layer was trained to act as a more general feature extractor, while the individual linear output layers made class predictions specific to each recording. Training again used noise augmentation, binary cross-entropy and Adam with weight decay.

**Fig. 7: Neural network training pipeline.**

To use the simulation-pre-trained network in the experimental data the GRU parameters from the pre-trained network were used, while the linear output layer used a normal random initialisation. This was the compared to a normal random initialisation of both the GRU and output layer. In both instances the network was trained using the methodology specified above, with the only difference being whether the GRU layer was simulation-pre-trained or not.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

Surface geometry of muscles, bones, subcutaneous tissue and skin data used for arm modelling were taken from BodyParts3D, The Database Center for Life Science (http://lifesciencedb.jp/bp3d/). The simulated MUAPs data that was used to train the neural network in the deep learning experiment have been deposited in the Science Data Bank database under https://doi.org/10.57760/sciencedb.07548 and CC BY-NC-SA 4.0 licence. The raw experimental data that was used in the deep learning experiment has also been deposited in the Science data Bank database under https://doi.org/10.57760/sciencedb.07586 and CC BY-NC 4.0 licence.

Code availability

The EMG simulation software, the code is proprietary and is not publicly available. To request the access to the simulation software, please contact kostiantyn.maksymenko@neurodec.ai.

References

Farina, D. et al. Toward higher-performance bionic limbs for wider clinical use. Nat. Biomed. Eng. https://doi.org/10.1038/s41551-021-00732-x (2021).
Farina, D. et al. The extraction of neural information from the surface EMG for the control of upper-limb prostheses: Emerging avenues and challenges. IEEE Trans. Neural Syst. Rehabilitation Eng. 22, 797–809 (2014).
Article Google Scholar
Farina, D. et al. Man/machine interface based on the discharge timings of spinal motor neurons after targeted muscle reinnervation. Nat. Biomed. Eng. 1, 0025 (2017).
Article Google Scholar
Zhuang, K. Z. et al. Shared human-robot proportional control of a dexterous myoelectric prosthesis. Nat. Mach. Intell. 1, 400–411 (2019).
Article Google Scholar
Geng, W. et al. Gesture recognition by instantaneous surface EMG images. Sci. Rep. 6, 36571 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Guo, W. et al. Long exposure convolutional memory network for accurate estimation of finger kinematics from surface electromyographic signals. J. Neural Eng. 18, 026027 (2021).
Article ADS Google Scholar
Mendez Guerra, I., Barsakcioglu, D. Y., Vujaklija, I., Wetmore, D. Z. & Farina, D. Far-field electric potentials provide access to the output from the spinal cord from wrist-mounted sensors. J. Neural Eng. 19, 026031 (2022).
Article ADS Google Scholar
Holobar, A. & Farina, D. Noninvasive neural interfacing with wearable muscle sensors: combining convolutive blind source separation methods and deep learning techniques for neural decoding. IEEE Signal Process. Mag. 38, 103–118 (2021).
Article Google Scholar
Bird, J. J., Pritchard, M., Fratini, A., Ekart, A. & Faria, D. R. Synthetic biological signals machine-generated by GPT-2 improve the classification of EEG and EMG through data augmentation. IEEE Robot. Autom. Lett. 6, 3498–3504 (2021).
Article Google Scholar
Tsinganos, P., Cornelis, B., Cornelis, J., Jansen, B. & Skodras, A. Data augmentation of surface electromyography for hand gesture recognition. Sensors 20, 4892 (2020).
Article ADS PubMed PubMed Central Google Scholar
Wang, F., Zhong, S.-h., Peng, J., Jiang, J. & Liu, Y. in MultiMedia Modeling (eds. Schoeffmann, K. et al.) 82–93 (Springer International Publishing, 2018).
Zanini, R. A. & Colombini, E. L. Parkinson’s disease EMG data augmentation and simulation with DCGANs and style transfer. Sensors 20, 2605 (2020).
Article ADS Google Scholar
Wen, S. et al. Rapid adaptation of brain-computer interfaces to new neuronal ensembles or participants via generative modelling. Nat. Biomed. Eng. https://www.nature.com/articles/s41551-021-00811-z#citeas (2021).
Gootzen, T. H. J. M., Stegeman, D. F. & van Oosterom, A. Finite limb dimensions and finite muscle length in a model for the generation of electromyographic signals. Electroencephalogr. Clin. Neurophysiol. Evoked Potentials 81, 152–162 (1991).
Article CAS PubMed Google Scholar
Fuglevand, A. J., Winter, D. A., Patla, A. E. & Stashuk, D. Detection of motor unit action potentials with surface electrodes: influence of electrode size and spacing. Biol. Cybern. 67, 143–153 (1992).
Article CAS PubMed Google Scholar
Stegeman, D. F. & Linssen, W. H. Muscle fiber action potential changes and surface EMG: a simulation study. J. Electromyogr. Kinesiol. 2, 130–140 (1992).
Article CAS PubMed Google Scholar
Yue, G., Fuglevand, A. J., Nordstrom, M. A. & Enoka, R. M. Limitations of the surface electromyography technique for estimating motor unit synchronization. Biol. Cybern. 73, 223–233 (1995).
Article CAS PubMed Google Scholar
Roeleveld, K., Blok, J. H., Stegeman, D. F. & Oosterom, A. V. Volume conduction models for surface emg; confrontation with measurements. J. Electromyogr. Kinesiol. 7, 221–232 (1997).
Article CAS PubMed Google Scholar
Schneider, J., Silny, J. & Rau, G. Influence of tissue inhomogeneities on noninvasive muscle fiber conduction velocity measurements—investigated by physical and numerical modeling. IEEE Trans. Biomed. Eng. 38, 851–860 (1991).
Article CAS PubMed Google Scholar
Botelho, D. P., Curran, K. & Lowery, M. M. Anatomically accurate model of EMG during index finger flexion and abduction derived from diffusion tensor imaging. PLoS Comput. Biol. 15, 1–24 (2019).
Google Scholar
Carriou, V., Boudaoud, S., Laforet, J. & Ayachi, F. S. Fast generation model of high density surface EMG signals in a cylindrical conductor volume. Comput. Biol. Med. 74, 54–68 (2016).
Article PubMed Google Scholar
Vecchio, A. D. D. et al. Spinal motoneurons of the human newborn are highly synchronized during leg movements. Sci. Adv. 6, eabc3916 (2020).
Article ADS PubMed PubMed Central Google Scholar
Farina, D., Mesin, L., Martina, S. & Merletti, R. A surface EMG generation model with multilayer cylindrical description of the volume conductor. IEEE Trans. Biomed. Eng. 51, 415–426 (2004).
Article PubMed Google Scholar
Merletti, R. & Muceli, S. Tutorial. Surface EMG detection in space and time: best practices. J. Electromyogr. Kinesiol. 49, 102363 (2019).
Article CAS PubMed Google Scholar
Henneman, E. Relation between size of neurons and their susceptibility to discharge. Science 126, 1345–1347 (1957).
Article ADS CAS PubMed Google Scholar
Girshick, R., Donahue, J., Darrell, T. & Malik, J. Rich feature hierarchies for accurate object detection and semantic segmentation. in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 580–587 (2014).
Clarke, A. K. et al. Deep learning for robust decomposition of high-density surface EMG signals. IEEE Trans. Biomed. Eng. 68, 526–534 (2021).
Article PubMed Google Scholar
Merletti, R. & Farina, D. Surface Electromyography: Physiology, Engineering, and Applications (John Wiley & Sons, Ltd, 2016).
Dimitrov, G. V. & Dimitrova, N. A. Precise and fast calculation of the motor unit potentials detected by a point and rectangular plate electrode. Med. Eng. Phys. 20, 374–381 (1998).
Article CAS PubMed Google Scholar
Farina, D. & Merletti, R. A novel approach for precise simulation of the EMG signal detected by surface electrodes. IEEE Trans. Biomed. Eng. 48, 637–646 (2001).
Article CAS PubMed Google Scholar
Plonsey, R. Action potential sources and their volume conductor fields. Proc. IEEE 65, 601–611 (1977).
Article Google Scholar
Plonsey, R. & Heppner, D. B. Considerations of quasi-stationarity in electrophysiological systems. Bull. Math. Biophys. 29, 657–664 (1967).
Article CAS PubMed Google Scholar
Heringa, A., Stegeman, D. F., Uijen, G. J. & Weerd, J. P. D. Solution methods of electrical field problems in physiology. IEEE Trans. Biomed. Eng. BME-29, 34–42 (1982).
Article Google Scholar
Farina, D., Mesin, L. & Martina, S. Advances in surface electromyographic signal simulation with analytical and numerical descriptions of the volume conductor. Med. Biol. Eng. Comput. 42, 467 (2004).
Article CAS PubMed Google Scholar
Nicholson, C. & A. Freeman, J. Theory of current source density analysis and determination of conductivity tensor for anuran cerebellum. J. Neurophysiol. 38, 356–368 (1975).
Article CAS PubMed Google Scholar
Pettersen, K. H., Lindén, H., Dale, A. M. & Einevoll, G. T. Extracellular Spikes and Current-source Density 92–135 (Cambridge University Press, 2010).
Peter Knabner, L. A. The Finite Element Method for the Poisson Equation 46–91 (Springer New York, 2003).
Vallaghé, S., Papadopoulo, T. & Clerc, M. The adjoint method for general EEG and MEG sensor-based lead field equations. Phys. Med. Biol. 54, 135–147 (2008).
Article PubMed Google Scholar
Plonsey, R. The active fiber in a volume conductor. IEEE Trans. Biomed. Eng. BME-21, 371–381 (1974).
Article Google Scholar
Rosenfalck, P. Intra- and extracellular potential fields of active nerve and muscle fibres. A physico-mathematical analysis of different models. Acta Physiol. Scand. Suppl. 321, 1–168 (1969).
CAS PubMed Google Scholar
Burke, R. E., Levine, D. N., Tsairis, P. & Zajac, F. E. Physiological types and histochemical profiles in motor units of the cat gastrocnemius. J. Physiol. 234, 723–748 (1973).
Article CAS PubMed PubMed Central Google Scholar
Schiaffino, S. & Reggiani, C. Fiber types in mammalian skeletal muscles. Physiol. Rev. 91, 1447–1531 (2011).
Article CAS PubMed Google Scholar
Modenese, L. & Kohout, J. Automated generation of three-dimensional complex muscle geometries for use in personalised musculoskeletal models. Ann. Biomed. Eng. 48, 1793–1804 (2020).
Article PubMed PubMed Central Google Scholar
Keenan, K. G., Farina, D., Merletti, R. & Enoka, R. M. Influence of motor unit properties on the size of the simulated evoked surface EMG potential. Exp. Brain Res. 169, 37–49 (2006).
Article PubMed Google Scholar
Carriou, V., Laforet, J., Boudaoud, S. & Al Harrach, M. Realistic motor unit placement in a cylindrical HD-sEMG generation model. in 2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC) 1704–1707 (IEEE, 2016).
Fuglevand, A., Winter, D. A. & Patla, A. E. Models of recruitment and rate coding organization in motor-unit pools. J. Neurophysiol. 70, 2470–2488 (1993).
Article CAS PubMed Google Scholar
Ayachi, F. S., Boudaoud, S. & Marque, C. K. Evaluation of muscle force classification using shape analysis of the sEMG probability density function: a simulation study. Med. Biol. Eng. Comput. 52, 673–684 (2014).
Article CAS PubMed Google Scholar
Luca, C. J. D. & Hostage, E. C. Relationship between firing rate and recruitment threshold of motoneurons in voluntary isometric contractions. J. Neurophysiol. 104, 1034–1046 (2010).
Article PubMed PubMed Central Google Scholar
Arabadzhiev, T. I., Dimitrov, V. G., Dimitrova, N. A. & Dimitrov, G. V. Influence of motor unit synchronization on amplitude characteristics of surface and intramuscularly recorded EMG signals. Eur. J. Appl. Physiol. 108, 227 (2010).
Article PubMed Google Scholar
Logg, A., Mardal, K. A. & Wells, G. N. Automated Solution of Differential Equations by the Finite Element Method Vol. 84. LNCSE of Lecture Notes in Computational Science and Engineering (Springer, 2012).
Alnæs, M. et al. The FEniCS Project Version 1.5. Arch. Numer. Softw. 3, 9–23 (2015).
The CGAL Project. CGAL User and Reference Manual, 5.2.1 edn. https://doc.cgal.org/5.2.1/Manual/packages.html (CGAL Editorial Board, 2021).
Negro, F., Muceli, S., Castronovo, A. M., Holobar, A. & Farina, D. Multi-channel intramuscular and surface EMG decomposition by convolutive blind source separation. J. Neural Eng. 13, 026027 (2016).
Article ADS PubMed Google Scholar
Cho, K. et al. Learning phrase representations using RNN encoder–decoder for statistical machine translation. in Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP) 1724–1734 (Association for Computational Linguistics, 2014).
Loshchilov, I. & Hutter, F. Decoupled weight decay regularization. In International Conference on Learning Representations (ICLR 2019) (2019).
Baxter, J. A Bayesian information theoretic model of learning to learn via multiple task sampling. Mach. Learn. 28, 7–39 (1997).
Article MATH Google Scholar

Download references

Acknowledgements

For this study, DF was sponsored by the European Research Council (ERC) under the Synergy Grant Natural BionicS (810346) and the EPSRC Transformative Healthcare for 2050 project NISNEM Technology (EP/T020970/1). AC and IMG are sponsored by the Engineering and Physical Sciences Research Council (EPSRC) - Centre for Doctoral Training in Neurotechnology for Life and Health and Meta.

Author information

Authors and Affiliations

Neurodec, Sophia Antipolis, France
Kostiantyn Maksymenko & Samuel Deslauriers-Gauthier
Department of Bioengineering, Imperial College London, London, UK
Alexander Kenneth Clarke, Irene Mendez Guerra & Dario Farina
Centre Inria d’Université Côte d’Azur, Valbonne, France
Samuel Deslauriers-Gauthier

Authors

Kostiantyn Maksymenko
View author publications
You can also search for this author in PubMed Google Scholar
Alexander Kenneth Clarke
View author publications
You can also search for this author in PubMed Google Scholar
Irene Mendez Guerra
View author publications
You can also search for this author in PubMed Google Scholar
Samuel Deslauriers-Gauthier
View author publications
You can also search for this author in PubMed Google Scholar
Dario Farina
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

K.M., S.D.G. and D.F. conceptualized the study. K.M. and S.D.G. developed the software implementation of the simulator. KM, A.C., I.M.G., S.D.G. and D.F. performed the experimental measures and conceptualized the data analysis. A.C. and I.M.G. performed the data analysis. K.M., S.D.G. and D.F. prepared the first draft of the manuscript. All authors edited the manuscript for important scientific content and all approved the final version.

Corresponding authors

Correspondence to Kostiantyn Maksymenko or Dario Farina.

Ethics declarations

Competing interests

K.M. and S.D.G. are founders of the company Neurodec which specializes in EMG simulation and analysis. The remaining authors declare no competing interests.

Ethical statement

The experimental protocol was designed in agreement with the Declaration of Helsinki and was approved by Imperial College London ethics committee (JRCO: 18IC4685). All participants signed informed consent forms.

Peer review

Peer review information

Nature Communications thanks Aditya Shekha Nittala, Thomas Stieglitz and the other anonymous reviewer(s) for their contribution to the peer review of this work.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Description of Additional Supplementary Files

Supplementary Video 1

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Maksymenko, K., Clarke, A.K., Mendez Guerra, I. et al. A myoelectric digital twin for fast and realistic modelling in deep learning. Nat Commun 14, 1600 (2023). https://doi.org/10.1038/s41467-023-37238-w

Download citation

Received: 21 June 2022
Accepted: 08 March 2023
Published: 23 March 2023
DOI: https://doi.org/10.1038/s41467-023-37238-w

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.