Prospective errors determine motor learning

Takiyama, Ken; Hirashima, Masaya; Nozaki, Daichi

doi:10.1038/ncomms6925

Download PDF

Article
Open access
Published: 30 January 2015

Prospective errors determine motor learning

Ken Takiyama¹,
Masaya Hirashima² &
Daichi Nozaki³

Nature Communications volume 6, Article number: 5925 (2015) Cite this article

9158 Accesses
44 Citations
14 Altmetric
Metrics details

Subjects

Abstract

Diverse features of motor learning have been reported by numerous studies, but no single theoretical framework concurrently accounts for these features. Here, we propose a model for motor learning to explain these features in a unified way by extending a motor primitive framework. The model assumes that the recruitment pattern of motor primitives is determined by the predicted movement error of an upcoming movement (prospective error). To validate this idea, we perform a behavioural experiment to examine the model’s novel prediction: after experiencing an environment in which the movement error is more easily predictable, subsequent motor learning should become faster. The experimental results support our prediction, suggesting that the prospective error might be encoded in the motor primitives. Furthermore, we demonstrate that this model has a strong explanatory power to reproduce a wide variety of motor-learning-related phenomena that have been separately explained by different computational models.

Reinforcement learning establishes a minimal metacognitive process to monitor and control motor learning performance

Article Open access 08 July 2023

Remembrance of things practiced with fast and slow learning in cortical and subcortical pathways

Article Open access 23 December 2020

Comparison of online, offline, and hybrid hypotheses of motor sequence learning using a quantitative model that incorporate reactive inhibition

Article Open access 26 February 2024

Introduction

Diverse features of motor learning have been reported by numerous experiments, but no single theoretical framework concurrently accounts for all of these features. For example, after learning in a novel visuomotor environment followed by a washout phase, the learning speed in the relearning phase is faster than that in the initial learning phase. This acceleration of motor learning has been explained by the incorporation of fast and slow components into the motor-learning process¹. However, it remains unclear how such a multi-learning-rate model can be extended to explain the decrement of learning speed with increased uncertainty of feedback information. Although a standard Kalman filter^2,3,4 successfully explains this uncertainty effect, it cannot explain how motor memory can be formed and maintained even when the environment randomly varies from trial to trial (structural learning)^5,6,7. Several models have been proposed to explain structural learning by assuming that subjects have already acquired a priori knowledge regarding the tendency of environmental variation^8,9. However, to our knowledge, few computational models can explain structural learning without any a priori knowledge. Thus, a single framework that can explain such a wide variety of phenomena is currently unavailable.

Here we propose a novel model for motor learning to explain a wide variety of phenomena in a unified way by extending a theoretical framework of motor primitives^{10,11,12,13,14,15}. In the original framework, activities of motor primitives determine motor commands, and an appropriate set of motor primitives is recruited according to the various features of the desired movement, such as planned movement direction^10,11. This framework successfully reproduces the basic pattern of trial-dependent changes in the movement error and how motor learning is generalized when the kinematics (for example, movement direction) change.

However, the manner in which the activities of motor primitives are determined remains controversial. In contrast to the conventional idea that the desired movement direction determines the activities of motor primitives^10,11,12, a recent study suggested the possible involvement of the executed movement in determining these activities¹³. The model we propose in the present study assumes that the predicted movement error of an upcoming movement, termed the prospective error (PE), also contributes to determining the activities of the primitives. This assumption is based on two components: (1) a theoretical consideration regarding the formation and maintenance of motor memory from a randomly changing environment, and (2) recent neurophysiological findings^16,17 showing that some motor-related neurons encode the PE rather than the desired or executed movements.

In the present study, first, we analytically reveal that the activities of motor primitives need to be determined based on the PE such that the motor memory can be formed and maintained in a randomly changing environment. Second, to validate the idea of incorporating the PE into motor learning, we experimentally demonstrate a novel motor-learning phenomenon that can be predicted by our model: after experiencing an environment in which the movement error is more easily predictable, subsequent motor learning should become faster. Finally, using a computer simulation, we show that our model can account for several different and seemingly unrelated phenomena in motor learning, such as structural learning^5,6,7, modulation of the learning rate because of uncertainty of error feedback^3,4, savings after short and long washout trials^18,19,20, anterograde interference^21,22 and spontaneous recovery^1,23,24. Although different conventional models have separately explained these phenomena, our model is unique in that it can explain them within a single framework.

Results

General framework

The present study used a task involving reaching towards a single target in a horizontal plane (Fig. 1a). The goal of the task was to move a cursor to the target accurately in a situation where an executed movement is perturbed by a change in the environment, p, for example, the external force generated by a manipulandum²⁵ (Fig. 1b) or visuomotor transformation²⁶ (Fig. 1c). The motor command, x, to compensate for a perturbation, p, is modelled by the summation of the activities of the motor primitives as x=WA^T, where W=(W₁, ..., W_N), N is the total number of motor primitives, W_i represents how the ith primitive contributes to the production of the motor command, A=(A₁, ..., A_N), and A_i is the activity of the ith primitive (we propose that this be determined depending on the PE (details are provided in the section Prospective error)). The movement error at the t-th trial can thus be expressed as . To minimize the squared movement error, W is modified as

where λ is the forgetting rate and η is the learning rate, indicating that the more activated the ith primitive, the more the W_i is modified to minimize the squared movement error (the stronger the motor memory is formed in the ith primitive). Similarly, if the ith primitive is not activated at the t-th trial, W_i is not modified (the motor memory embedded in the ith primitive can be kept).

Theoretical considerations in randomly changing environments

First, we analytically considered the problem of what characteristics of the movement the primitives need to encode. We focused on the problem of how a motor memory can be formed within a randomly changing environment. Recent works have illustrated the ability of the motor system to form motor memories from randomly changing environments: the experience of a randomly changing visuomotor rotation increased the speed of the subsequent learning to a constant visuomotor rotation (structural learning)^5,6,7.

In our model described above, when the perturbation randomly changes from trial to trial, the ensemble average for W_t, W_t+1, W_t+2, ... across all possible realizations converges to

after many trials, where E[·] represents the ensemble average taken across different simulation runs (see the Theoretical analysis section in Methods for a detailed analysis). When the perturbation randomly changes around 0, E[p]=0. If p and A are independent, then the weighting parameter W=0. This indicates that motor primitives can form and maintain motor memory in a randomly varying environment only when A_t encodes the information of p_t.

Prospective error

Notably, A_t cannot directly encode p_t, because the information for p_t is only available after motor execution. A possible solution is to assume that the motor-learning system predicts a factor (factors) that contains the information of p_t. Because the goal of motor learning is to minimize movement error, the motor-learning system uses a movement error, e_t, as a learning signal. Here, we assumed that e_t is used not only as a learning signal but also as a signal for predicting the PE (Fig. 1a), which should contain the information regarding the perturbation. Recent neurophysiological studies have suggested that some neurons actually encode the PE, or the movement error predicted to be observed in the near future for online movement control^16,17.

Specifically, we assume that the PE is predicted from both the PE and the observed movement error in the previous (t−1)-th trial:

where α is a parameter that determines the degree of update based on the difference between the PE and the observed movement error. This update rule is rational when movement error shows trial-to-trial variability, as previously reported in an experimental study²⁷, and movement error is observed with a sensory noise (detailed descriptions are given in the Update rule of PE section in Methods).

We also assume that the primitives encode the PE following a Gaussian: (ê_t is the PE), where the scaling parameter σ_i=σ is independent of i and μ_iε(−180°, 180°) is randomly sampled from a uniform distribution. The ith primitive is maximally activated when the PE is equivalent to its preferred PE μ_i. A summarized procedure for the computer simulation is provided in the Summary of computer simulations section in Methods.

Numerical simulation in randomly changing environments

Here, we try to observe the behaviour of motor learning under a stochastically changing environment. Our model predicts that learning speed in the test phase can be increased when the perturbation randomly varies in every two or three trials during the training trials (groups 2 and 3) (Fig. 2a,d). In contrast, learning speed was not facilitated when the perturbation randomly varied in every trial during the training trials (group 1). In group 2 (or 3), two (or three) consecutive identical perturbations make it more reliable to predict the movement error, and the primitives encoding the PE gradually acquire the knowledge to compensate for the same movement error (for example, primitives for 30° PE learn the 30° perturbation) (Fig. 2b and red dotted line in Fig. 2c). In the test phase, the motor memory embedded in the primitives for the positive PE is reactivated, which leads to an increase in learning speed. In contrast, when the perturbation changes from trial to trial (group 1), the PE does not have information regarding the perturbation because it was completely unpredictable (Fig. 2e and green dotted line in Fig. 2f), resulting in the failure of motor memory formation.

Behavioural experiment

It should be noted that the difference among groups 1, 2 and 3 described above is a novel prediction that has never been predicted nor tested. Therefore, we performed a behavioural experiment to validate this prediction. Notably, this prediction contrasts with a conventional Bayesian framework because, according to this framework, a more uncertain random perturbation is associated with faster learning in a subsequent adaptation to a constant perturbation³. In the present experiment, subjects moved a manipulandum to control a cursor on a horizontal screen towards a forward target. In training trials, the cursor’s movement direction randomly rotated either in every trial (group 1), in every two trials (group 2) or in every three trials (group 3) by a certain amount sampled from a set of rotations (−45°, −30°, −15°, 0°, 15°, 30° and 45°) (Fig. 3a,b). Hand movements during the training trials were always constrained along a straight line from the starting position to the target by the manipulandum (that is, force channel trial) (Fig. 3b), which allowed us to differentiate the predictions of our model from those of conventional models, as described below. After the training phase, subjects experienced a constant amount of visuomotor rotation (±30°) in test trials without the force channel. The training and test trials were interleaved with washout trials to rule out the possible effect of cursor movements in the last training trial on the learning speed in the test trials. Although this experimental setting was slightly different from the conditions we simulated in Fig. 2, the predictions of our model were invariant: learning speed in test trials was predicted to be faster in groups 2 and 3 than in group 1 (Fig. 3c; in these simulations, x_t in training trials was always set to 0 with assuming force channel trials).

We used the force channel trials as training trials because they were useful to clarify the differences between our model and other conventional models. Although the force channel trials seem unnatural for an experimental setting, subjects can generate forces to compensate for the observed movement error (Fig. 4a). Because the force channel trials made identical target and hand-movement directions throughout all of the training trials, the same primitives were always activated according to the ideas from conventional models^10,11,12,13. Because the average value of the movement error experienced by these primitives across many trials would be 0, the conventional models predict that no adaptation should occur. As several recent studies have suggested, motor adaptation could be influenced by reward^28,29,30. In our experiment, however, the reward was likely to be almost identical among groups 1, 2 and 3 (the success rate was 1/7 in all the groups), suggesting no reward-associated difference in motor adaptation among the three groups. In contrast, because the PE was easily predicted in groups 2 and 3 compared with group 1, our model predicted that subjects in groups 2 and 3 would show faster adaptation during the test phase than those in group 1 (Fig. 3c).

**Figure 4: Results of our behavioural experiment.**

The experimental results supported this prediction: in test trials, subjects in groups 2 (12 subjects: 6 for +30° rotation, 6 for −30° rotation) and 3 (12 subjects: 6 for +30° rotation, 6 for −30° rotation) demonstrated faster adaptation than those in group 1 (12 subjects: 6 for +30° rotation, 6 for −30° rotation), and subjects in group 3 demonstrated faster adaptation than those in group 2 (Fig. 4b). We fit an exponential function e_t=a exp(−bt)+c to the bootstrapped data and estimated the learning speed b. The mean value of learning speed b was 0.1410 for group 1, 0.2845 for group 2 and 0.3037 for group 3 (Fig. 4c). Because these differences were significant (P<0.0001, randomization test), subjects in groups 2 and 3 were considered to adapt to visuomotor rotation faster than those in group 1, which was consistent with our model’s prediction.

Furthermore, we fit our model to the data from group 1 and tried to predict the data from groups 2 and 3 (details are provided in the Fitting our model to data from our experiment section in Methods). When we fit our model to the forces in force channel trials and the movement angles in test trials, R² was 0.9950 and 0.8638, respectively (Fig. 4a,b). The movement angles in the test phase of groups 2 and 3 could be predicted with R²=0.7967 and R²=0.7968 (Fig. 4b).

In addition, when our model was used to fit the data sets from previous studies, the resulting R² was higher than 0.8240 (Fig. 5, details are provided in the Fitting our model to data sets from previous studies section in Methods). These studies investigated phenomena seemingly unrelated to structural learning and our behavioural experiment, such as uncertainty effects³¹ or error size effects on error modification³², which were separately reproduced by different computational models, but our PE-based model could be fit to the data sets. Thus, we expect that the PE-based model will reproduce diverse features of motor learning in a unified manner.

**Figure 5: Model fitting to data in crcns.org.**

Reproduction of other phenomena

Here, we demonstrate that our PE-based model can also reproduce diverse phenomena that have previously been explained by different models. We used the best-fit parameters for group 1 in the numerical simulations described below.

Effect of uncertainty on learning speed

Motor learning is hindered when the observed movement error includes uncertainty. For instance, motor-learning speed decreases when the end-point hand position is blurred^3,4. In addition, increased blurring of the end-point position (higher uncertainty) is associated with slower learning speed. To explain this effect of uncertainty, previous studies used a Kalman filter^3,4. Because the uncertainty in the observation of the movement decreases the Kalman gain and learning rate, the framework using a Kalman filter can explain how the uncertainty of the observation adversely influences the motor-learning speed.

Our model also reproduced the detrimental influence of the uncertainty of the error feedback on motor-learning performance (Fig. 6). The influence of the uncertainty can be interpreted based on a recursive equation of motor command (see the Recursive equation of motor command section in Methods for a detailed analysis):

The learning rate is modulated by an inner product A(ê_t) A^T(ê_t+1). The inner product is maximal when ê_t+1=ê_t and minimal when ê_t+1 is completely different from ê_t; great inaccuracy of the prediction of the PE (that is, greater uncertainty of error feedback) is associated with reduced modulation of the learning rate.

Savings

Savings is a phenomenon in which the adaptation to the second exposure is faster than that to the first exposure, although a washout is experienced after the first exposure^1,19,23.

Figure 7a,d indicates the result of a simulation of an experiment in which subjects experience a 30°-visuomotor rotation (initial learning) followed by a −30°-visuomotor rotation (opposite learning) and then are exposed again to a 30°-visuomotor rotation (relearning). The −30°-exposure appears to eliminate motor memory, but the adaptation was faster in the relearning phase than in the initial learning, indicating that our model reproduced the savings. Notably, in contrast to previous models that adopt processes with multiple time constants (that is, slow and fast^1,2,20), our model did not explicitly consider the presence of slow and fast states.

In our model, at the beginning of the initial learning phase, the motor primitives with preferred PEs close to 30° are activated (Fig. 7b) and the weighting parameters of these primitives are modified to decrease the movement error of the 30° rotation (Fig. 7c). However, as the adaptation proceeds, the movement error and the PE decrease, and as a result, different primitives are gradually involved in the decrement of the movement error (Fig. 7b). Because the motor primitives activated at the beginning of the initial learning phase are no longer activated during the latter half of the initial learning phase nor in the opposite learning phase, the weighting parameters of those primitives remain unchanged. Thus, when a 30°-perturbation was re-imposed in the relearning phase, the primitives maintaining the memory are reactivated, which contributes to accelerating adaptation to the 30°-perturbation relative to the initial learning phase.

Previous studies^19,20 have also noted that even the two-state model comprising fast and slow processes, which was developed to explain the savings, cannot explain the experimental result that savings still exist even after a sufficient number of washout trials following the initial learning phase. As shown in Fig. 7e, even with a sufficiently long washout phase, our model can still account for the savings effect when the forgetting rate is close to 1.

Anterograde interference

Anterograde interference is a phenomenon in which the adaptation to a novel environment (for example, clockwise visuomotor rotation) interferes with the subsequent adaptation to another novel environment (for example, counter-clockwise visuomotor rotation)^22,23.

Figures 8a, d demonstrate the results of a simulation in which the subjects experienced a 30°-visuomotor rotation (initial learning) followed by a −30°-visuomotor rotation (opposite learning). Adaptation was slower in the opposite learning phase than in the initial learning phase, indicating that our model reproduced anterograde interference. The motor primitives whose preferred PEs were close to 0° were activated in the latter part of the initial and opposite learning phases (Fig. 8b). The weighting parameters of these primitives were modified to reduce the positive movement error in the initial learning phase, but the content of the motor memory of these primitives needed to be reversed for the opposite learning phase (Fig. 8c). This reversal may increase the number of trials needed for the adaptation in the opposite learning phase. In fact, a longer initial learning phase was associated with slower adaptation in the opposite learning phase (Fig. 8e).

Spontaneous recovery

Motor memory is not easily eliminated once it is formed. After a sufficient amount of force-field training, a short exposure to the opposing force field appears to reverse the motor output (that is, the motor memory content). However, during the forgetting process of the motor memory, the motor memory for the originally trained force field can be spontaneously recovered¹. This phenomenon is called spontaneous recovery^1,23,24.

Figure 9a indicates the result of a simulation in which the subjects experienced a 30°-visuomotor rotation (initial learning phase) followed by a brief period of a −30°-visuomotor rotation (opposite learning phase) and finally a series of error-clamp trials in which the movement error was constrained to 0 (error-clamp trials). At the end of the opposite learning phase, the motor memory for the 30°-visuomotor rotation appeared to be completely eliminated, but the motor memory re-emerged during the error-clamp trials, indicating that our model successfully reproduced spontaneous recovery.

A sufficient amount of initial training trials resulted in a PE of almost 0, and almost all of the motor primitives involved in compensating for the 30°-visuomotor rotation had preferred PEs that were close to 0 (Fig. 9c). However, during the subsequent opposite learning phase, the number of training trials was small and the adaptation was accomplished while the PE did not converge to 0. Thus, the motor primitives involved in the opposite learning phase had PEs that were different from 0, indicating that the motor memory formed in the initial learning phase was not overwritten (Fig. 9d). In the error-clamp trials, the PE gradually approached 0, which reactivated the motor memory embedded in the motor primitives involved in the initial learning phase, leading to a spontaneous recovery of the motor memory.

Discussion

We propose a novel motor-learning model based on motor primitives. Our model assumes that each primitive is activated by a PE, based on both theoretical consideration of how motor memory can be formed and maintained in a randomly varying environment and previous neurophysiological findings showing that some neurons encode a PE for online movement control^16,17. To validate our model, we confirmed its novel prediction that motor-learning speed in response to a constant amount of perturbation is increased after experiencing the same movement errors in two or three consecutive trials. This phenomenon cannot be predicted by conventional computational models, assuming that the recruitment of the motor primitives is determined only by the planned movement direction^10,11,12, by Bayesian framework³ nor by reinforcement learning based on ‘reward’^28,29,30. In addition, this facilitatory effect cannot be explained by a previous model where an update of the motor command depended on the executed movement directions¹³, because the hand-movement direction in our experiment was kept identical to the target direction using the force channel. Although it is possibile that the update of the motor command depends on the cursor movement directions (see Discussion in Gonzalez-Castro et al.¹³), this framework cannot solely explain why a blurred end-point position decreases the learning rate; if movement error is linearly processed, the ensemble-averaged movement errors are the same between blurred and non-blurred conditions, E[e_t+ξ_t]=E[e_t], where ξ_t denotes uncertainty. In contrast, our behavioural experiment validated our novel prediction (Fig. 4).

Our model also has strong power to explain a wide variety of other motor-learning-related phenomena^{1,2,3,4,5,6,7,8,19,20,22,23}. Although different models have been conventionally proposed to explain different types of phenomena, our model can explain these phenomena in a unified manner (that is, in a single model with the same parameters) (Figs 2 and 6, 7, 8, 9).

To account for phenomena such as savings, anterograde interference and spontaneous recovery, recent computational studies have proposed that a motor memory has multiple time constants (that is, fast and slow processes^1,2,20,22,33). Conversely, our model does not explicitly assume the presence of fast and slow motor-learning processes. Nevertheless, our model was able to account for these motor-learning phenomena, in addition to other types of phenomena that multiple timescale models cannot explain, such as structural learning or the change in learning rates due to uncertainty.

The explanatory power of our model is derived from the determination of the recruitment pattern of motor primitives based on the trial-by-trial variation of the PE. When the movement error is positive in consecutive trials, the PE is also predicted to be positive, and this positive PE activates a group of motor primitives responsible for compensating for the positive movement error. In these trials, a group of motor primitives responsible for compensating for a negative movement error remains inactivated and maintains the motor memory compensating for a negative movement error (Figs 7a and 9a). In contrast, a group of motor primitives for a near-zero PE is activated in the latter part of the learning phase independent of whether the movement error is positive or negative (Fig. 8a). Therefore, the motor primitives for a large PE are recruited in a task-dependent manner, but only at the beginning of the learning phase, whereas those for a small PE are recruited in a task-independent manner, but only in the latter part of the learning phase. The PE-dependent recruitment pattern of motor primitives explains why our model can reproduce savings, anterograde interference and spontaneous recovery. Furthermore, simulated relearning curves in Fig. 7d can be observed in an experiment in which subjects can use cognitive strategy to correct errors³⁴. Our model indicates that cognitive strategy can be partly explained from a mechanistic viewpoint.

Similarly, this recruitment feature can also explain why the trial-dependent characteristics of the perturbation influence the learning rate. When the perturbation changes from trial to trial, the PE also randomly fluctuates, activating different sets of motor primitives, which lead to a lower learning rate because the formation of the motor memory is distributed across a large portion of motor primitives. Conversely, when the perturbations are more predictable, such as when identical perturbations are repeated in consecutive trials, the PE can be more reliably predicted. This predictability of the PE activates the same sets of motor primitives, and thus the formation of the motor memory is concentrated in a small portion of motor primitives, leading to a higher learning rate. These results suggest a novel interpretation for how the brain processes movement-error information; the movement error is used both for motor learning and for determining which primitives are recruited for that motor learning.

It is well known that when a visuomotor rotation is abruptly imposed, the amount of motor-command correction in the subsequent trial is not proportional to the amount of rotation; rather, it decreases with the amount of rotation³². This phenomenon was previously explained by a Bayesian framework³² in which a larger the visuomotor rotation was associated with a larger difference between the planned cursor movement direction and the executed hand-movement direction, resulting in a decreased learning rate. However, when the amount of visuomotor rotation is gradually increased, such a reduction in the learning rate is not observed³⁵. The different adaptation behaviours between abrupt and gradual applications of visuomotor rotation can also be explained by our model framework. In the case of gradual visuomotor rotation, the movement error is very small and the PE is reliably predictable. Thus, the same group of motor primitives is always recruited, indicating that the learning rate is not affected by the difference between planned and executed movement directions. By contrast, abrupt visuomotor rotation results in greater movement error and the PE changes considerably, leading to a decrease in the learning rate.

We have theoretically shown that motor primitives should encode the information of p_t. In our model framework, however, we assumed that the motor primitives encode the prediction of e_t rather than the prediction of p_t itself, because e_t contains some information regarding p_t. Interestingly, a model in which the PE determines A_t has stronger explanatory power than a model in which the predicted p_t determines A_t (Fig. 10).

**Figure 10: Comparison of the prospective error model and perturbation prediction model.**

We also assumed that the PE is updated based on a simple linear updating equation with a constant α (equation (3)), but other candidates can be considered. An example is the Kalman filter³⁶, in which α can be modulated in each trial by uncertainty. In addition, ê_t can be updated based not only on ê_t−1, but also ê_t−2, ê_t−3 or a longer history of ê. Although a simple linear update of the PE is sufficient to reproduce many simulated phenomena in this study, we expect that the Kalman filter and a longer history will have stronger explanatory power than equation (3). Further study is needed to investigate how the PE is updated.

Our model was confirmed by an experiment involving only a 10-cm (ballistic) reaching movement. Thus, the current aspects may or may not be applicable to more general movements such as longer reaching movements and three-dimensional reaching. Future studies will be necessary to answer this problem, but we believe that the present ideas are also applicable to those movements, considering that the aspects of motor learning revealed by previous studies using the same experimental set-up have been confirmed for the other movements such as saccadic adaptation³⁷ and locomotion³⁸.

Furthermore, for simplicity, this study addressed with reaching movements towards a single target. However, we need to expand our model into one that can account for movement towards multiple targets. Adaptation effects in a reaching movement towards a single training target are generalized to movements towards other spatially distributed targets^10,11. The degree of generalization depends on the angular difference between the trained and tested target directions. To explain this generalization effect, one possible idea is to extend from a univariate function A_i(ê_t) to a bivariate function A_i(d_t, ê_t), where d_t is a target direction. There are several candidates for these extensions. For example, the PE and desired movement direction could be either additively integrated, that is, A_i(d_t, ê_t)=f_i(d_t)+g_i(ê_t) (f(·) and g(·) are functions), or multiplicatively integrated, that is, A_i,t(d_t, ê_t)=f_i(d_t)g_i(ê_t). Although recent studies support the multiplicative interaction as a strong candidate for the integration of multiple variables^14,15, this idea needs to be validated by conducting additional experiments.

Methods

Theoretical analysis

The averaged update rule across all possible realizations can be written as

After many trials, E[W_t+1] and E[W_t] converge to W, and we obtain equation (2). If p_t is independent of A_t and E[p_t]=0, E[p_tA_t]=E[p_t]E[A_t]=0 and because E[W₀] is 0. Thus, motor primitives can form and maintain motor memory in a randomly varying environment when A_t is correlated to p_t.

Update rule of PE

Prospective error is a predicted movement error based on the current prediction and the prediction error between the current prediction and the observed movement error. When the observed movement error is e_t and the true (noiseless) movement error is g_t, the observation process can be written as e_t=g_t+ξ_t, where ξ_t is the observation noise (sensory noise). Here, we assume a Gaussian noise whose mean is 0 and variance is as the observation noise. Recent studies reported that, even when there is no perturbation, movement error shows trial-to-trial variability²⁷. If the variability of movement error is available in our motor system (that is, our motor system can utilize a generative model of movement error g_t+1=g_t+ζ_t (ζ_t is a Gaussian noise whose mean is 0 and variance is )), our motor system can optimally predict the movement error in the next trial following

to minimize the variance of prediction error. Equation (3) is thus an optimal update of the PE when . Notably, this update rule is equivalent to a Kalman filter³⁶, but we did not assume any update of and for simplicity (see Discussion).

Recursive equation of motor command

We can derive the recursive equation of motor command (state-space representation of motor learning) when movement error decreases gradually. In this case, A(ê_t+1)=A(ê_t+α(e_t−ê_t))≃A(ê_t)+α A′(ê_t)(e_t−ê_t), where A′ is the derivative of A. When A_i is a Gaussian, multiplying the update equation of W_t (equation (1)) by A^T(ê_t+1) yields

where the learning rate is modulated by the inner product A(ê_t)A^T(ê_t+1). The inner product can be further calculated as , where N→∞ and με(−∞,∞) are assumed. The recursive equation can be rewritten as:

where is and both the forgetting and learning rate are modulated by (e_t−ê_t)². Therefore, a more predictable PE is associated with higher forgetting and learning rates (slower forgetting and faster learning).

Summary of computer simulations

By setting ê₀=e₀=0 and W₀=0, our simulation consisted of the following four steps:

Fitting our model to data from our experiment

Our model has four parameters: a forgetting rate λ, a learning rate η, an update rate of PE α and a width of motor primitives σ. First, assuming W_t=0 and ê_t=0, we determined α and σ by fitting the amount of error modification (equation (8)) to the data in training trials of group 1 (Fig. 4a, R²=0.9950), because f_t+1 is uncorrelated to e_t only in group 1. The assumptions, W_t=0 and ê_t=0, can be assumed only in data from group 1, because the average error in training trials of group 1 is 0 as a result of completely random cursor movements. Because the data were related to generated force and our model focused on movement direction, we scaled the equation, mf_t+1+n to fit for the data (m and n were best-fit parameters). This fitting yielded the best-fit σ/α=0.3586 × (360/2π), that is, we could not separate α and σ based on this data fitting. Next, we searched the best-fit λ, η, α and σ for the learning curve for group 1 in test trials, resulting in λ=0.9586, η=2.3913, α=0.8 (we searched the best α by setting α=0, 0.1, 0.2, ..., 0.9, or 1.0) and σ=0.2868 × (360/2π). Notably, we fit all of the parameters to the data from group 1 (R²=0.8638). However, our model can also predict the data from groups 2 and 3 (R²=0.7967 and R²=0.7968).

Fitting our model to data sets from previous studies

We fit our model to conventional data in ( http://crcns.org): data from Körding and Wolpert³¹, Wei and Körding³² and Thoroughman and Taylor³⁹. Parameters σ and α were set to the best-fit parameters for our experimental data, σ/α=0.3586 × (360/2π) and α=0.8. The best-fit forgetting and learning rates λ and η were identified for each data set.

Data from Körding and Wolpert

When error feedback includes uncertainty, the learning rate in our model is modulated by (equation (8)). If this factor is averaged across all of the possible uncertainty values, ξ_t, simple calculations yield ; therefore, the amount of error modification is . We scaled this equation, , to fit the data of Körding and Wolpert³¹, assuming that ê_t=0 (this assumption is correct because the averaged error across all of the trials was almost zero), σ_G=(18°, 30°, 36° and 60°) in the σ₀, σ_M, σ_L and σ_∞ conditions, respectively (Fig. 5a). Because our model focused on movement direction and their data focused on movement deviation, this scaling was necessary. R² was 0.9315, 0.9448, 0.9823 and 0.9786 for data of σ₀, σ_M, σ_L and σ_∞, respectively.

Data from Wei and Körding

We calculated the relationship between motor command at the (t+1)-th trial, x(t+1) and perturbation at the t-th trial, p(t), when the perturbation in each trial was randomly sampled from p=(−45°, −30°, −15°, 0°, 15°, 30°, 45°). This simulation was conducted for 30 simulation runs and 210 trials in each simulation run (the weight parameter W was reset to 0 at the beginning of each simulation run). When we compared the scaled motor commands mx(t+1)+n to the data of Wei and Körding³² (Fig. 5b), R² was 0.8947.

Data from Thoroughman and Taylor

Data from Thoroughman and Taylor³⁹ were related to adaptation to a curl force field with 16 targets. Because we did not consider multiple targets in our model (see Discussion), we fit our model to their data after moving average filtering. The size of the filter was 16 and weight was uniform, that is, the filtered error at the t-th trial ē_t was , where e_t represents movement error without the filtering. This filter can be expected to minimize the effect of the generalization of learning effects across different target directions. Figure 5c shows the filtered error. We scaled the movement error in our model, me(t)+n, to fit to their data. R² was 0.8240.

Perturbation prediction model

We theoretically proved that A_t should encode the information for perturbation p_t. Here, we assumed a perturbation prediction model in which A_t is determined by , where is a predicted perturbation and updated by . We compared the PE model and the perturbation prediction model based on numerical simulations of spontaneous recovery (Fig. 10). Because we are not sure how the subjects predicted p_t in error-clamp trials, was forcibly set to 0 or −30 (perturbation just before the error-clamp trials).

Behavioural experiment

Thirty-six healthy, right-handed volunteers (22 males, 14 females, aged 18–38 years) participated in this study and were paid for their time. The participants were pseudo-randomly assigned to one of the six experimental groups, group 1 CW, group 1 CCW, group 2 CW, group 2 CCW, group 3 CW or group 3 CCW, where CW indicates clockwise rotation (−30° rotation) and CCW indicates counter-clockwise rotation (30° rotation). The numbers of females and males were the same in group 1 CCW and in group 2 CCW (three males and three females) and among group 1 CW, group 2 CW, group 3 CW and group 3 CCW (four males and two females). The subjects had no cognitive or motor disorders and were naïve to the concept of visuomotor rotation and the purpose of the experiment. All participants were clearly informed of the experimental procedures in accordance with the Declaration of Helsinki and provided written informed consent before the experiment began. All procedures were approved by the ethics committee of the Graduate School of Education at the University of Tokyo.

Participants were asked to make pointing movements with their right arm while holding the handle of the manipulandum (Phantom 1.5 HF; Geomagic, Rock Hill, SC, USA). The handle position was displayed as a white cursor (a 6-mm circle) on a black background on a horizontal screen located above their hand. The movement of the handle was constrained to a virtual horizontal plane (10 cm below the screen) that was implemented by a simulated spring (1.0 kN m⁻¹) and dumper (0.1 N per (m s⁻¹)). A brace was used to reduce unwanted wrist movement. Upper trunk motion was constrained by a harness. Before each trial, participants were required to hold the cursor at its starting position (a 10-mm circle). After a 2-s holding time, a grey target (a 10-mm circle) appeared. After an additional randomly selected holding time (250–350 ms), the target colour changed to purple, signalling the participant to initiate a pointing movement. Subjects were required to move the handle with a peak velocity of 470±45 mm s⁻¹ (the target velocity was calculated using the minimum-jerk theory with a movement amplitude of 10 cm and a duration of 0.4 s). A warning message appeared on the screen if the movement velocity of the handle rose above (‘fast’) or fell below (‘slow’) this threshold value. Subjects were also required to move the handle with an amplitude of 10 cm. When the movement amplitude was 10 cm, the sound of an explosion was produced. At the end of each trial, the handle was automatically moved back to the starting position by the manipulandum.

In training trials (force channel trials), we used the ‘error-clamp’ method^1,40,41. During error-clamped trials, the trajectory of the handle was constrained to a straight line towards the target by a virtual ‘channel’ in which any motion perpendicular to the target direction was constrained by a one-dimensional spring (2.5 kN m⁻¹) and damper (25 N/(m/s)).

Manipulandum motion data were recorded at a sampling rate of 500 Hz. Motion data were low-pass filtered using a fourth-order Butterworth filter with a 10-Hz cutoff. Movement onset time was defined as the first time point during which hand-movement velocity first exceeded 10% of its peak value for at least 50 ms.

For the second trial of the test trials with visuomotor rotation, one of the 12 subjects in group 2 showed an outlying behaviour. The mean movement angle in group 2 at the trial μ was 27.6944, the s.d. σ was 11.6704 and the movement angle of this subject in this trial was 62.8017, which is larger than μ+3σ. Thus, we eliminated this outlying data point from our analysis. Notably, this elimination of the outlier did not affect our results at all.

To determine whether learning speed was different among groups 1 (CCW and CW), 2 (CCW and CW) and 3 (CCW and CW), we conducted a bootstrap sampling and a randomization test. For bootstrap sampling, the learning speed was sampled 3,000 times in each group, and we calculated the mean value of the 3,000 sampled learning speeds. To determine whether the mean values of each group were significantly different, randomization tests were conducted. In each randomization test, the bootstrap-sampled learning speeds in groups 1 and 2 (1 and 3, or 2 and 3) were intermingled and randomly divided into two groups. We calculated the difference in the mean values of each randomized group and counted how many times this difference was larger than the difference of the mean learning speed (0.1410 for group 1, 0.2845 for group 2 and 0.3037 for group 3) to calculate P-values for the randomization tests.

Additional information

How to cite this article: Takiyama, K. et al. Prospective errors determine motor learning. Nat. Commun. 6:5925 doi: 10.1038/ncomms6925 (2015).

References

Smith, M. A., Ghazizadeh, A. & Shadmehr, R. Interacting adaptive processes with different timescales underlie short-term motor learning. PLoS Biol. 4, e179 (2006).
Article Google Scholar
Körding, K. P., Tenenbaum, J. B. & Shadmehr, R. The dynamics of memory as a consequence of optimal adaptation to a changing body. Nat. Neurosci. 10, 779–786 (2007).
Article Google Scholar
Burge, J., Ernst, M. O. & Banks, M. S. The statistical determinants of adaptation rate in human reaching. J. Vis. 8, 20 (2008).
Article Google Scholar
Wei, K. & Köding, K. P. Uncertainty of feedback and state estimation determines the speed of motor adaptation. Front. Comput. Neurosci. 4, 11 (2010).
PubMed PubMed Central Google Scholar
Braun, D. A., Aertsen, A., Wolpert, D. M. & Mehring, C. Motor task variation induces structural learning. Curr. Biol. 19, 352–357 (2009).
Article CAS Google Scholar
Turnham, E. J. A., Braun, D. A. & Wolpert, D. M. Facilitation of learning induced by both random and gradual visuomotor task variation. J. Neurophysiol. 107, 1111–1122 (2012).
Article Google Scholar
Kobak, D. & Mehring, C. Adaptation paths to novel motor tasks are shaped by prior structure learning. J. Neurosci. 32, 9898–9908 (2012).
Article CAS Google Scholar
Braun, D. A., Aertsen, A., Wolpert, D. M. & Mehring, C. Learning optimal adaptation strategies in unpredictable motor tasks. J. Neurosci. 29, 6472–6478 (2009).
Article CAS Google Scholar
Braun, D. A., Waldert, S., Aertsen, A., Wolpert, D. M. & Mehring, C. Structure learning in a sensorimotor association task. PLoS ONE 5, e8973 (2010).
Article ADS Google Scholar
Thoroughman, K. A. & Shadmehr, R. Learning of action through adaptive combination of motor primitives. Nature 407, 742–747 (2000).
Article CAS ADS Google Scholar
Donchin, O., Francis, J. T. & Shadmehr, R. Quantifying generalization from trial-by-trial behavior of adaptive systems that learn with basis functions: theory and experiments in human motor control. J. Neurosci. 23, 9032–9045 (2003).
Article CAS Google Scholar
Tanaka, H., Sejnowski, T. J. & Krakauer, J. W. Adaptation to visuomotor rotation through interaction between posterior parietal and motor cortical areas. J. Neurophysiol. 102, 2921–2932 (2009).
Article Google Scholar
Gonzalez Castro, L. N., Monsen, C. B. & Smith, M. A. The binding of learning to action in motor adaptation. PLoS Comput. Biol. 7, e1002052 (2011).
Article ADS Google Scholar
Yokoi, A., Hirashima, M. & Nozaki, D. Gain field encoding of the kinematics of both arms in the internal model enables flexible bimanual action. J. Neurosci. 31, 17058–17068 (2011).
Article CAS Google Scholar
Brayanov, J. B., Press, D. Z. & Smith, M. A. Motor memory is encoded as a gain-field combination of intrinsic and extrinsic action representations. J. Neurosci. 32, 14951–14965 (2012).
Article CAS Google Scholar
Ferrera, V. P. & Barborica, A. Internally generated error signals in monkey frontal eye field during an inferred motion task. J. Neurosci. 30, 11612–11623 (2010).
Article CAS Google Scholar
Popa, L. S., Hewitt, A. L. & Ebner, T. J. Predictive and feedback performance errors are signaled in the simple spike discharge of individual Purkinje cells. J. Neurosci. 32, 15345–15358 (2012).
Article CAS Google Scholar
Krakauer, J. W., Ghilardi, M.-F. & Ghez, C. Independent learning of internal models for kinematic and dynamic control of reaching. Nat. Neurosci. 2, 1026–1031 (1999).
Article CAS Google Scholar
Zarahn, E., Weston, G. D., Liang, J., Mazzoni, P. & Krakauer, J. W. Explaining savings for visuomotor adaptation: linear time-invariant state-space models are not sufficient. J. Neurophysiol. 100, 2537–2548 (2008).
Article Google Scholar
Berniker, M. & Körding, K. P. Estimating the relevance of world disturbances to explain savings, interference and long-term motor adaptation effects. PLoS Comput. Biol. 7, e1002210 (2011).
Article MathSciNet CAS ADS Google Scholar
Krakauer, J. W., Ghez, C. & Ghilardi, M. F. Adaptation to visuomotor transformations: consolidation, interference, and forgetting. J. Neurosci. 25, 473–478 (2005).
Article CAS Google Scholar
Sing, G. C. & Smith, M. A. Reduction in learning rates associated with anterograde interference results from interactions between different timescales in motor adaptation. PLoS Comput. Biol. 6, e1000893 (2010).
Article ADS Google Scholar
Kojima, Y., Iwamoto, Y. & Yoshida, K. Memory of learning facilitates saccadic adaptation in the monkey. J. Neurosci. 24, 7531–7539 (2004).
Article CAS Google Scholar
Stollhoff, N., Menzel, R. & Eisenhardt, D. Spontaneous recovery from extinction depends on the reconsolidation of the acquisition memory in an appetitive learning paradigm in the honeybee (Apis mellifera). J. Neurosci. 25, 4485–4492 (2005).
Article CAS Google Scholar
Shadmehr, R. & Mussa-Ivaldi, F. A. Adaptive representation of dynamics during learning of a motor task. J. Neurosci. 14, 3208–3224 (1994).
Article CAS Google Scholar
Krakauer, J. W., Pine, Z. M., Ghilardi, M. F. & Ghez, C. Learning of visuomotor transformations for vectorial planning of reaching trajectories. J. Neurosci. 20, 8916–8924 (2000).
Article CAS Google Scholar
van Beers, R. J. Motor learning is optimally tuned to the properties of motor noise. Neuron 63, 406–417 (2009).
Article CAS Google Scholar
Huang, V. S., Haith, A., Mazzoni, P. & Krakauer, J. W. Rethinking motor learning and savings in adaptation paradigms: model-free memory for successful actions combines with internal models. Neuron 70, 787–801 (2011).
Article CAS Google Scholar
Izawa, J. & Shadmehr, R. Learning from sensory and reward prediction errors during motor adaptation. PLoS Comput. Biol. 7, e1002012 (2011).
Article CAS ADS Google Scholar
Pekny, S. E., Criscimagna-Hemminger, S. E. & Shadmehr, R. Protection and expression of human motor memories. J. Neurosci. 31, 13829–13839 (2011).
Article CAS Google Scholar
Körding, K. & Wolpert, D. Bayesian integration in sensorimotor learning. Nature 427, 244–247 (2004).
Article ADS Google Scholar
Wei, K. & Körding, K. Relevance of error: what drives motor adaptation? J. Neurophysiol. 101, 655–664 (2008).
Article Google Scholar
Lee, J. Y. & Schweighofer, N. Dual adaptation supports a parallel architecture of motor memory. J. Neurosci. 29, 10396–10404 (2009).
Article CAS Google Scholar
Fernandez-Ruiz, J., Wong, W., Armstrong, I. T. & Flanagan, J. R. Relation between reaction time and reach errors during visuomotor adaptation. Behav. Brain Res. 219, 8–14 (2011).
Article Google Scholar
Honda, T., Hirashima, M. & Nozaki, D. Adaptation to visual feedback delay influences visuomotor learning. PLoS ONE 7, e37900 (2012).
Article CAS ADS Google Scholar
Wolpert, D. M., Ghahramani, Z. & Jordan, M. An internal model for sensorimotor integration. Science 269, 1880–1882 (1995).
Article CAS ADS Google Scholar
Ethier, V. & Zee, D. S. Shadmehr, Spontaneous recovery of motor memory during saccade adaptation. J. Neurophysiol. 99, 2577–2583 (2008).
Article Google Scholar
Mawase, F., Shmuelof, L., Bar-Haim, S. & Karniel, A. Savings in locomotor adaptation explained by changes in learning parameters following initial adaptation. J. Neurophysiol. 111, 1444–1454 (2014).
Article Google Scholar
Thoroughman, K. & Taylor, J. Rapid reshaping of human motor generalization. J. Neurosci. 25, 8948–8953 (2005).
Article CAS Google Scholar
Sing, G. C., Joiner, W. M., Nanayakkara, T., Brayanov, J. B. & Smith, M. A. Primitives for motor adaptation reflect correlated neural tuning to position and velocity. Neuron 64, 575–589 (2009).
Article CAS Google Scholar
Scheidt, R. A., Reinkensmeyer, D. J., Conditt, M. A., Zev Rymer, W. & Mussa-Ivaldi, F. A. Persistence of motor adaptation during constrained, multi-joint, arm movements. J. Neurophysiol. 84, 853–862 (2000).
Article CAS Google Scholar

Download references

Acknowledgements

This work was supported by a Grant-in-Aid for Japan Society for the Promotion of Science Fellows (13J06713), a Grant-in-Aid for Scientific Research on Innovative Areas (26120723) to K.T., the Funding Program for Next-Generation World-Leading Researchers (LS034) and a Grant-in-Aid for Scientific Research (A) (26242062) to D.N. We thank K. Abe, S. Maehiro-Monai and A. Sugiura for their assistance.

Author information

Authors and Affiliations

Brain Science Institute, Tamagawa University, Machida-shi, Tokyo, 194-8610, Japan
Ken Takiyama
Center for Information and Neural Networks (CiNet), National Institute of Information and Communications Technology, Osaka University, Suita, 565-0871, Osaka, Japan
Masaya Hirashima
Graduate School of Education, The University of Tokyo, Bunkyo-ku, Tokyo, 113-0033, Japan
Daichi Nozaki

Authors

Ken Takiyama
View author publications
You can also search for this author in PubMed Google Scholar
Masaya Hirashima
View author publications
You can also search for this author in PubMed Google Scholar
Daichi Nozaki
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

K.T. and M.H. designed and performed the experiments. K.T. performed the analyses and wrote the manuscript. D.N. oversaw the experiments, analyses and writing.

Corresponding author

Correspondence to Ken Takiyama.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Takiyama, K., Hirashima, M. & Nozaki, D. Prospective errors determine motor learning. Nat Commun 6, 5925 (2015). https://doi.org/10.1038/ncomms6925

Download citation

Received: 27 October 2014
Accepted: 21 November 2014
Published: 30 January 2015
DOI: https://doi.org/10.1038/ncomms6925

This article is cited by

Recognition capability of one’s own skilled movement is dissociated from acquisition of motor skill memory
- Nobuaki Mizuguchi
- Shohei Tsuchimoto
- Kazuyuki Kanosue
Scientific Reports (2021)
Optimizing motor decision-making through competition with opponents
- Keiji Ota
- Mamoru Tanae
- Ken Takiyama
Scientific Reports (2020)
Larger, but not better, motor adaptation ability inherent in medicated Parkinson’s disease patients revealed by a smart-device-based study
- Ken Takiyama
- Takeshi Sakurada
- Taiki Komatsu
Scientific Reports (2020)
Speed-dependent and mode-dependent modulations of spatiotemporal modules in human locomotion extracted via tensor decomposition
- Ken Takiyama
- Hikaru Yokoyama
- Kimitaka Nakazawa
Scientific Reports (2020)
A data-driven approach to decompose motion data into task-relevant and task-irrelevant components in categorical outcome
- Daisuke Furuki
- Ken Takiyama
Scientific Reports (2020)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

General framework

Theoretical considerations in randomly changing environments

Prospective error

Numerical simulation in randomly changing environments

Behavioural experiment

Reproduction of other phenomena

Effect of uncertainty on learning speed

Savings

Anterograde interference

Spontaneous recovery

Discussion

Methods

Theoretical analysis

Update rule of PE

Recursive equation of motor command

Summary of computer simulations

Fitting our model to data from our experiment

Fitting our model to data sets from previous studies

Data from Körding and Wolpert

Data from Wei and Körding

Data from Thoroughman and Taylor

Perturbation prediction model

Behavioural experiment

Additional information

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links