Literature DB >> 36040962

What is the nature of motor adaptation to dynamic perturbations?

Etienne Moullet¹, Agnès Roby-Brami¹, Emmanuel Guigon¹.

Abstract

When human participants repeatedly encounter a velocity-dependent force field that distorts their movement trajectories, they adapt their motor behavior to recover straight trajectories. Computational models suggest that adaptation to a force field occurs at the action selection level through changes in the mapping between goals and actions. The quantitative prediction from these models indicates that early perturbed trajectories before adaptation and late unperturbed trajectories after adaptation should have opposite curvature, i.e. one being a mirror image of the other. We tested these predictions in a human adaptation experiment and we found that the expected mirror organization was either absent or much weaker than predicted by the models. These results are incompatible with adaptation occurring at the action selection level but compatible with adaptation occurring at the goal selection level, as if adaptation corresponds to aiming toward spatially remapped targets.

Entities: Chemical

Mesh：

Year: 2022 PMID： 36040962 PMCID： PMC9467354 DOI： 10.1371/journal.pcbi.1010470

Source DB: PubMed Journal: PLoS Comput Biol ISSN： 1553-734X Impact factor: 4.779

Introduction

Motor behavior is both highly stable and widely flexible [1-3]. On the one hand, a large repertoire of skilled, efficient behaviors (e.g. speech production, handwriting, gait, …) is maintained for decades, often robust in the face of injury, aging, disease or brain damage. On the other hand, a few movements performed in a novel sensorimotor environment (e.g. wearing prismatic glasses, holding a visco-elastic manipulandum, …) or in some altered physiological state (e.g. muscular fatigue, pain, …) can induce lasting changes in motor performance [4-7]. A proper balance between stability and flexibility is necessary so that (1) ingrained skills remain sensitive to steady and persistent changes in the environment, the body and the nervous system but are not disproportionately influenced by temporary, incidental events; and (2) new skills can develop at any time. How then is skilled movement organized in response to these contrasting priorities? Motor learning and skill acquisition are generally understood from two distinct viewpoints [3]. The first view holds that learning occurs at the action selection (control) level, and modifies the mapping between the intended goals and those actions inclined to achieve these goals (Fig 1, purple). For instance, in the typical laboratory example of adaptation to a velocity-dependent force field (dynamic perturbation; [4]), learning has been described either as a compensation process, i.e. mapping is learned between states and compensatory forces opposite to the applied forces ([4]; Fig 1B, left and center; see also Fig 1C, left and center for the case of a visuomotor rotation), or as a reoptimization process, i.e. mapping is learned between goals and optimal forces to achieve the goals in the presence of the applied forces [8]. According to the second view, learning occurs at the goal selection level and modifies the mapping between intended and actual goals irrespective of how to achieve these goals (Fig 1B, right). For instance, adaptation to a visuomotor rotation of the visual display (kinematic perturbation) results from a redirection process, i.e. a remapping between target and movement vectors ([9]; Fig 1C, right). Although the latter learning process appears more flexible and frugal than the former, it is unclear whether it can account for adaptation to dynamic perturbations, i.e. when new patterns of force need to be learned.

Fig 1

Goal selection vs action selection.

Goal selection vs action selection.

A. The motor system contains: (1) a process, called action selection (AS; purple), which translates a current goal (e.g. a target to reach) into the proper displacement of the current effector (e.g. the arm) toward the goal (the target); (2) a process, called goal selection (GS; orange), which provides the current goal for a given task. B. Schematic of adaptation to a force field perturbation (only the early phase of movement is described). The small circle is the starting position, the double circle the goal position, the black arrow the planned displacement, and the gray arrow the actual (or observed) displacement. (left) For a planned displacement toward the goal position, the force field (black leftward arrows) induces an initial actual displacement in the direction of the perturbation. (center) Adaptation at the AS level consists in keeping the same goal position and applying compensatory forces (purple rightward arrows). (right) Adaptation at the GS level consists in re-aiming toward a new goal position (orange double circle). C. Schematic of adaptation to a rotation of the visual display. (left) For a planned displacement toward the goal position, the rotation induces an initial actual displacement in the direction of the rotation. (center) Adaptation at the AS level consists in keeping the same goal position and applying compensatory rotation (purple rightward arrow). (right) Adaptation at the GS level consists in re-aiming toward a new goal position (orange double circle). Models based on compensation or reoptimization are well formulated models that can be used to make predictions on adaptation to dynamic perturbations (velocity-dependent force fields; [4,8]). In particular, the shape of after-effect trajectories, i.e. late trajectories in the absence of the force field after adaptation, should incorporate a "negative image" of the forces induced by the applied force field, a reflection which mirrors before-effect trajectories, i.e. early trajectories in the presence of the force field before adaptation (this is exactly the case for the compensation model; [4]). The shape of before-effect trajectories has been thoroughly documented. They are initially curved "away" from the baseline (unperturbed) trajectory with a late ensuing correction toward the target [4]. We have not identified any study that quantitatively documents the shape of after-effect trajectories. Yet qualitative observations on published figures suggest that after-effect trajectories do not obey the predicted mirror organization (Fig 2 in [10]; Fig 4 in [11]; Fig 1 in [12]; Fig 2 in [13]; Fig 2 in [14]; Fig 1B in [15]). In fact after-effect trajectories seem to resemble "kinematic" trajectories, i.e. trajectories observed during visuomotor rotation or target jump tasks rather than "dynamic" trajectories observed during force field tasks (examples of contrast between kinematic and dynamic trajectories in Fig 2 in [16]; Fig 6 in [17]). They might thus be compatible with a redirection process, as if adaptation corresponded to aiming toward spatially remapped targets. The main goal of this study is to clarify the nature of before-effect and after-effect trajectories during a force field adaptation task in order to assess the pertinence of the compensation/reoptimization process as a basis for motor adaptation. A secondary goal is to promote the redirection process as a promising candidate for motor adaptation.

Results

We designed a force field adaptation experiment with a large number of trials and a small fraction of catch trials (unexpected addition or removal of the force field) to obtain "pure" before-effect and after-effect trajectories uncontaminated by ongoing learning processes [18]. Twenty two participants were asked to make fast, planar, forward arm reaching movements from a start position to a target position located 0.1-m away in the presence of a null field or a perpendicular clockwise (CW) or counterclockwise (CCW) velocity-dependent force field (Fig 2A and 2B). The participants performed four blocks of trials (Fig 2C) and we identified baseline, before-effect, adapted, and after-effect trajectories (see ). For data analysis, all trajectories were displayed with a CW deviation, i.e. for a CCW perturbation, a vertical symmetry was applied to the trajectories. A trajectory was described by (1) the angle (counted positive in the CCW direction) of its tangent relative to the target direction (Fig 2D); (2) the time derivative of the trajectory angle (see for details).

Fig 2

Description of the experiment.

A. Experimental setup. (left) Top view. The small open circle is the start position and the large open circle the target position. The black circle is the robot handle. The elongated open rectangle is a top view of a monitor. (right) Front view. The start position, target position, and visual feedback of hand position (black circle) are shown on the monitor. The black rectangle is the robot handle. The scales are not respected. B. Simulated velocity-dependent force field. A minimum-jerk velocity profile with a 0.3 m/s peak was multiplied by a 5 N/m force field. Vertical scale: 0.01 m. Horizontal scale: 1 N. C. Experimental protocol. The force field level (null or CW) is indicated by the horizontal black (baseline block), gray (before-effect and adaptation blocks) or green (adapted and after-effect blocks) thick line segments. The vertical line segments indicate catch trials: unexpected CW force field in the before-effect block (red); unexpected null force field in the adaptation block (gray) and in the after-effect block (blue). Only the colored trials (black: baseline; red: before-effect; green: adapted; blue: after-effect) were analyzed. D. Graphical definition of the trajectory angle. At one point along the trajectory (open square), the trajectory angle is the angle between start position/target position direction (dashed line) and the tangent to the trajectory (thick line).

Description of the experiment.

Predictions

The compensation model [4] makes immediate predictions on the shape of before-effect and after-effect trajectories and corresponding velocity profiles (S1 Fig). These prediction will not be further considered: they are robust but lack pertinence as the compensation model is not a general model of motor control (see ). In order to build precise predictions for the reoptimization model, we proceeded in the following way. We considered one participant (P7) and analyzed detailed characteristics of her motor behavior (Fig 3). We calculated the mean baseline and before-effect trajectories (Fig 3A) and velocity profiles (Fig 3B). For each single trial (e.g. a baseline trial; Fig 3C), we calculated a discrete measure of the frequency content (peak frequency; number of minima+number of maxima/duration/2) of velocity, acceleration and jerk traces. We plotted the peak frequency of all trials for the two types of trial (Fig 3G). These results show that the smoothness of mean trajectories and velocity profiles (Fig 3A and 3B) is an artifact of averaging widely nonsmooth and variable single trials (Fig 3C and 3G). Although these observations are not surprising [19,20], they cannot be explained by models that produce temporally invariant smooth movements [21-23]. To circumvent this difficulty, we considered a model which explains the frequency content of movements (Fig 3C and 3G) by the pursuit of intermediate goals (via-points) updated at ~8 Hz (see ; [20,24]. We searched for a series of via-points S and model parameters that account for experimental paths and velocity profiles of baseline and before-effect trajectories (Fig 3D and 3E; for a parametric study of the model, see below). The series S contained three intermediate via-points (squares; Fig 3D) at 32, 64 and 96% of the distance to the target in the direction of the target, and the target itself (circle; Fig 3D). Note that we did not search for the "best fit", as all single trials were different (Fig 3C and 3G). Note also that the intensity of the modeled force field (φ) was lower than that of the experimental field (see ). The amplitude and frequency contents of the resulting movement were consistent with the experimental data (Fig 3F and 3G). At this stage, the proposed model is appropriate for trajectory formation and online motor control during perturbations and further accounts for many characteristics of motor behavior [24]. We can now obtain proper predictions for the reoptimization model (Fig 4). The adapted trajectory was not a straight path but an overcompensation (green; Fig 4A) which is consistent with [8]. Its velocity profile was close to the baseline velocity (green vs black; Fig 4B). The after-effect trajectory had the expected mirror organization relative to the before-effect trajectory (blue vs red; Fig 4A) and a velocity profile which resembled the before-effect profile (blue vs red; Fig 4B). The mirror effect is quantitatively described in Fig 4C and 4D. The trajectory angles had opposite monotonic trends for before-effect and after-effect trajectories over the first ~0.6 s (blue vs red; Fig 4C) with corresponding changes in the sign of the derivatives (blue vs red; Fig 4D). In the following, we will focus on the early part of the trajectories (0.4 s; dotted boxes in Fig 4C and 4D; 4E and 4F) since trajectory averaging for experimental data may produce unreliable results for the late part of the trajectory. Two quantitative observations are relevant: (1) the angle derivative of the before-effect trajectory became positive at 0.29 s (vertical red dashed line; Fig 4F). This result is consistent with experimental data in P7 and across all the participants (S2 Fig); (2) the angle derivative of the after-effect trajectory became negative at 0.31 s (vertical blue dashed line; Fig 4F) which means that the derivative is negative 22.5% of the time during the first 0.4 s. For comparison with experimental data, we will use this number rather than the time of change in sign which might not be well defined in the data (e.g. due to multiple changes in sign).

Fig 3

Model adjustment based on data of participant P7.

A. Mean baseline (black; 17 trials) and before-effect (red; 19 trials) trajectories for P7. Scale: 0.02 m. B. Mean velocity profiles of baseline and before-effect trajectories for P7. C. Velocity (scale 0.1 m/s), acceleration (scale 2 m/s2) and jerk (scale 30 m/s3) profiles for a single baseline trial (P7). Time scale: 0.1 s. D. Simulated baseline (plain black) and before-effect (plain red) trajectories compared to experimental trajectories (dashed; data from A). Squares are via-points for the simulated trajectories. Same scale as in A. E. Simulated velocity profiles. F. Velocity (same as black in E), acceleration and jerk profiles for the simulated baseline trajectory. The profiles have been truncated to match the duration of the trial in C. Same scales as in C. G. Peak frequency for velocity (orange), acceleration (light green) and jerk (light blue) profiles for individual trials (small dots). Large circles correspond to the trial in C. Large squares correspond to the simulated trial in F. Thick lines are mean values and boxes indicate 25–75 percentiles.

Fig 4

Model predictions.

Model adjustment based on data of participant P7.

Model predictions.

A. Simulated adapted (green) and after-effect (blue) trajectories corresponding to simulated baseline (black) and before-effect (red) trajectories shown in Fig 3D and reproduced here with thin lines. Scale: 0.02 m. B. Simulated velocity profiles. C. Trajectory angle for A. D. Trajectory angle derivative for A. E. Zoom on trajectory angle (dotted box in C). F. Zoom on trajectory angle derivative (dotted box in D). Vertical dashed lines indicate the time of change in derivative sign. On the one hand, the expected positive sign of the derivative of the after-effect trajectory angle would add support to the reoptimization model. On the other hand, a null or negative derivative would contradict the reoptimization model.

Two participants

Results for participant P7 shown in Fig 5 (same format as in Fig 4) followed the typical pattern observed in force-field adaptation experiments [4,8]: 1. The mean baseline trajectory was straight (black; Fig 5A); 2. The mean before-effect trajectory deviated in the direction of the perturbation with a late hook-like correction (red; Fig 5A); 3. The mean adapted trajectory was straighter than the mean before-effect trajectory but not as straight as the baseline trajectory (green; Fig 5A); 4. The mean after-effect trajectory was deviated in the direction opposite to the perturbation (blue; Fig 5A); 5. The velocity profiles had a large initial peak followed by one or more smaller peaks (Fig 5B); 6. Single trial trajectories were variable but consistent with the mean trajectory (Fig 5C); 7. Lateral deviation decayed exponentially across trials (R2 = 0.69; Fig 5D).

Fig 5

Data of participant P7.

Data of participant P7.

A. Mean trajectories. Scale: 0.02 m. B. Mean velocity profiles. C. Single trials (17, 19, 107, 34 trials, from left to right). Same scale as in A. D. Changes in lateral deviation (maximum of trajectory deviation from the start position/target position line) with training. The data were taken from C (red and green). Fitting an exponential decay is shown. E. Mean trajectory angle over the first 0.4 s with the 95% confidence interval. Inset: mean trajectory angle over the first 0.8 s; the box indicates the 0–0.4 s window. Scale: 0.1 s, 60 deg. F. Same as E for mean trajectory angle derivative. Scale: 0.1 s, 200 deg/s. G. p-value of a test ≠0 vs = 0 for trajectory angle derivative in F. The dotted line indicates 0.05. H. Bayes factor for the test ≠0 vs = 0. The dotted lines delimitate regions of interpretation of Bayes factors. To test the reoptimization model, we analyzed the time course of the mean trajectory angle (Fig 5E) and trajectory angle derivative (Fig 5F). As expected, the mean angle of the before-effect trajectory decreased until ~0.3 s and then increased (red; Fig 5E and 5F and inset). The mean angle of the after-effect trajectory was initially approximately constant and then decreased (blue; Fig 5E and 5F and inset). To assess the statistical significance of this observation, we performed a t-test on the sign of the angle derivative (H0: = 0 vs H1: ≠ 0; N = 34 trials) at each timestep. The corresponding p-value was >0.05 for the first 0.1 s (Fig 5G), which indicates that we cannot reject the hypothesis that the angle derivative is zero. The p-value was <0.05 after 0.1 s (Fig 5G), meaning that the angle derivative was significantly different from zero and negative. We calculated the Bayes factor bf10 for H1 vs H0 which indicated that the data were 1 to 5 times more likely under H0 than under H1 when p>0.05 (Fig 5H). A different behavior was observed for participant P5 (Fig 6). The mean before-effect and after-effect trajectories were symmetrically organized (Fig 6A and 6C and 6D). A statistical analysis indicated that the angle derivative of the after-effect trajectory was non-zero and positive between ~0.1 and ~0.3 s following movement onset (p-value <0.05, Fig 6E; bf10>3, Fig 6G). Although the behavior of this participant matches some predictions of the reoptimization model, the experimental path and velocity profile of the after-effect trajectory were different from the predicted path and velocity profile (Figs 6A and 6B vs 4A and 4B).

Fig 6

Data of participant P5.

Same organization as Fig 5 with C, D, E, F corresponding to E, F, G, H.

Data of participant P5.

Same organization as Fig 5 with C, D, E, F corresponding to E, F, G, H.

All participants

For each participant, we calculated the percentage of time over the first 0.4 s during which the angle derivative of the after-effect trajectory was statistically null or negative using both p-values and Bayes factors (see Fig 5G and 5H). The reoptimization model predicts that this percentage should be around 22.5 (Fig 4F). The experimental percentage was different from the predicted percentage for all the participants (Fig 7A and 7B). The behavior of ten participants (blue bars; Fig 7) was similar to the behavior of P7 (see Figs 5 and S3). The behavior of eight participants (dark blue bars; Fig 7) was similar to the behavior of P5 (see Fig 6). The quantitative results for these participants are shown exhaustively in S4 Fig. It can be observed that the behavior of these participants is rather homogeneous and differs qualitatively and quantitatively from the predicted behavior in terms of path and velocity profile. The two remaining participants (light blue bars; Fig 7) failed to improve their behavior with training (S5 Fig).

Fig 7

All participants.

A. Percentage of time over the first 0.4 s during which the angle derivative of the after-effect trajectory was statistically null or negative using t-test p-values. Dashed white line represents model prediction. Color code for the participants: blue, behavior incompatible with the reoptimization model; dark blue, behavior partially compatible with the reoptimization model; light blue, behavior with no effect of training. B. Same as A using Bayes factor.

All participants.

Redirection model

We simulated adaptation through redirection using an ad-hoc series of via-points S′ to obtain an adapted trajectory (green; Fig 8A, left) which resembles a real adapted trajectory (green; Fig 5C). We generated small variations around these via-points to obtain an ensemble of adapted trajectories (green; Fig 8A, right). The corresponding after-effect trajectories (blue; Fig 8A) resembled real after-effect trajectories (blue; Fig 5C). The velocity profiles, the trajectory angles and the trajectory angle derivatives were consistent (Fig 8B and 8C and 8D). As expected, we observed an absence of mirror effect, the angle derivative of after-effect trajectories being positive < 10% of the time in the first 0.4 s (Fig 8D).

Fig 8

Simulations of the redirection model.

A. (left) Adapted (green) and after-effect (blue) trajectories corresponding to a series of via-points (squares). (right) Multiple adapted and after-effect trajectories corresponding to variations of the series of via-points. B. Corresponding velocity profiles. C. Corresponding trajectory angles. D. Corresponding trajectory angle derivatives.

Simulations of the redirection model.

Parametric study of the reoptimization model

The conclusions of this study are highly dependent on the predictions of the reoptimization model. As the model contains parameters, it is important to understand the influence of these parameters on the proposed predictions. We explored the role of 3 parameters: the feedback delay Δ, the noise ratio σ/σ of motor to sensory noise variance used in the state estimator, and the muscle gain gsh/gel. The first two parameters modulate how sensory information participates in state estimation. The third parameter calibrates the contribution of shoulder and elbow torques to coordination. The trajectories, velocity profiles, trajectory angles and trajectory angle derivatives were consistent across variations of these parameters (S6, S7 and S8 Figs). The mirror organization between before-effect and after-effect trajectories was robustly observed. We note that the time at which the angle derivative of the before-effect trajectory becomes negative (Fig 4F) varied with the feedback delay (S6D Fig) and the torque ratio (S8D Fig). A set of parameters () specify the boundary conditions at the via-points, i.e. whether position, velocity, activation and excitation are forced to take specified values. The predictions were built with and (constraints only on position). The role of these parameters is illustrated in S9 Fig. Although they have little influence on acceleration (S9A Fig), they have a clear effect on jerk, with a higher level of jerk whenever the constraints are not applied exclusively to position (S9B Fig). Only the lower level of jerk (constraints on position) is consistent with experimental data (Fig 4C and 4D).

Discussion

Classical computational models of motor adaptation assume that learning occurs at the action selection level [4,8]. We derived predictions for these models which show that after-effect trajectories following adaptation to a velocity-dependent force field are close to a mirror of before-effect trajectories (Figs 4 and S1). Experimental data collected in twenty-two participants did not follow these predictions (Figs 5, 6, 7 and S3 and S4). We discuss implications and limitations of these observations. To open the discussion, we note that we have worked from the modeling prediction that a mirror organization should be observed between before-effect and after-effect trajectories following adaptation to a force field. Yet, at least in the case of adaptation to a visuomotor rotation, the strength and the shape of the after-effects may depend not only on the nature of the perturbation but also on the adaptation protocol, e.g. the presence of error-clamp trials [25]. It is unclear how we can account for this fact in the framework of reoptimization models. We also note that the present study is not concerned with how adaptation occurs on a trial-by-trial basis and with issues related to feedforward and feedback corrections [26]. Hundreds of force field adaptation studies have been performed since the seminal study of Shadmehr and Mussa-Ivaldi (1994) [4], but none of them have quantitatively documented properties of after-effect trajectories. Although many published figures could informally be used to gain qualitative information on before-effect and after-effect trajectories and their differences (to mention a few: Figs 3 and 7 in [27]; Fig 13 in [4]; see for other references), they are not sufficient to draw firm conclusions. The lack of a specific interest for after-effect trajectories might be related to the prevalent view in computational motor control that adaptation results from changes at the control level, and that properties of after-effect trajectories are a direct by-product of these changes [4,8]. In this framework, an after-effect trajectory reflects a kind of compensation that attempts to negate the forces induced by the applied force field and thus inheres to the properties of the force field itself. As velocity along the trajectory increases, the compensation force increases and the after-effect trajectory curves away from the baseline trajectory, thus depicting a mirror image of the before-effect trajectory. Both the compensation and the reoptimization models [4,8] obey to this premise (Figs 4 and S1). Nonetheless, the data presented in this study are incompatible with these models. The expected mirror organization was completely absent in 11/22 participants, and present but far smaller than expected in the remaining participants (Fig 7). Yet, we did not average data across the participants, provided single participant analyses (S3 and S4 Figs), and made the raw data available to give a chance to any possible interpretation. At this stage, it is interesting to consider the implication of adaptation at the action selection level. It would mean that each new adaptation requires the building of a dedicated control policy which inherits from general motor abilities (e.g. when we hold a manipulandum in a force field task, we do not need to relearn motor coordination from scratch), but remains insulated from general and specific skills (e.g. it does not interfere with our ability to walk or to play the piano). The corresponding motor architecture would come with a heavy computational burden to build, maintain, update, share and exploit each learned ability in each specific context. A solution based on the storage of multiple controllers has been proposed [28], but does not address the associated computational burden. Furthermore, its proposed implementation through the huge computational power of the cerebellum is probably incompatible with the predominant sensory nature of cerebellar processing [29,30]. Besides these computational issues, interlimb transfer of force field adaptation [31,32] and adaptation by mere observation [33] are also inconsistent with adaptation at the action selection level. The study of de Rugy et al (2012) which is often taken to argue against optimal motor control models, for once, would be consistent with our view [34]. They showed that human participants failed to reoptimize their muscle recruitment patterns following (virtual) changes in muscle actions. They interpreted their results by the existence of "habitual" coordination patterns that are unaffected by selective modifications of the peripheral apparatus. A further interpretation could be that there is no available mechanism for adaptation at the action selection level. A last, indirect argument is related to the contribution of the primary motor cortex (M1) to motor adaptation. Since there is strong evidence that M1 participates to low-level (e.g. muscular) aspects of motor control [35-37], a likely hypothesis is that M1 neurons are involved in processes subserving adaptation to dynamic perturbations (e.g. force field). Yet, Perich and Miller (2017) have shown that the directional selectivity of M1 neurons was modified by the application of a force field but remained unchanged during the course of adaptation [38]. Their results suggest that adaptation occurs upstream of M1 and is transmitted to M1 which is responsible for motor execution. An alternative view is that adaptation to a novel motor environment relies on changes at the goal selection level (redirection model), i.e. aiming toward appropriately chosen successive spatial goals (e.g. via-points) would mimic adaptation and after-effects in a force field (Fig 8). In the proposed scenario, the "memory" of the perturbation is not a continuous mapping between state and force but a discrete set of via-points. This scenario can be reproduced in a simulation by a multiple-step target jump protocol involving several intermediate via-points and the actual target where the via-points have been chosen by hand to obtain an adapted trajectory which is close to the baseline trajectory (Fig 8). Our data are not incompatible with this view. Yet, they cannot be said to support it since the proposed adaptation mechanism remains incompletely specified: there is no structured approach to select or learn to select proper via-points. Interestingly, the redirection model is versatile enough to account for the whole dataset. The absence of a mirror effect in some participants and its presence in others can be explained by a specific configuration of via-points (Fig 8A and 8B). Whether it is possible to find the location of the via-points for a given task remains an open question. It is tempting to assume that variability should be minimal at the via-points as in the experiment of Todorov and Jordan (2002) [23] in which the via-points are indicated to the participants (their Fig 3). Unfortunately, kinematic variability is so large (e.g. Fig 3G) that we cannot expect anything precise from an analysis of variability, suggesting that in the absence of a constrained trajectory, the via-points may be themselves be subject to variability from trial to trial. The proposed mechanism should not be linked to the kind of explicit, cognitive strategy that can be used to compensate for a visuomotor rotation simply by changing the aiming direction [39]. Here, the proper choice of via-points is conceived as the outcome of a learning process. What drives the learning process is not specified, but could possibly be cast in a cost/benefit framework (e.g. effort vs accuracy). We have no information on the explicit or implicit nature of the mechanism. Yet, in post-experiment interviews, we noted that several participants believed that the after-effect deviations were due to a force field and not to their own behavior, which suggests that they probably had little conscious control over their behavior once adapted to the perturbation. A possible concern is the seemingly ad hoc nature of the proposed scenario. However this scenario is derived from a consistent theoretical construct which accounts for the production of fast and slow movements, the distinction between discrete and rhythmic movements, the ubiquity of isochronous behaviors, the existence of scaling laws, power laws and speed-accuracy tradeoffs [24]. Thus motor control would involve a unique, general-purpose, task-independent action selection mechanism (controller) and each task would have its own representation defined as a series of successive intermediate goals updated at a fixed frequency and pursued at a fixed horizon. In this framework, a skilled movement is not defined by the operation of a dedicated, "skilled" controller, but the use of a dedicated, "skilled" task representation. Consider the following example. It is probable that none of the readers of this article have the tennis skills of a top ranked tennis player. Yet, for the most part, they would not be clumsy in activities of daily living, probably have their own motor skills, and should sometimes be able to produce a magnificent backhand worthy of a good tennis player. Accordingly, the difference between a novice and an expert would not be found at a control level, e.g. a difference in mastering coordination, but at the level of task representation, i.e. how successive goals are consistently set to properly elicit and guide actions. Our view of motor adaptation can effortlessly be cast in this framework. Interestingly, the computational burden associated with the storage of multiple controllers is significantly alleviated with the storage of multiple task representations. Task representations are discrete sets which are much more frugal in neural resources than continuous mappings. Furthermore, they can be scaled spatially and shared between effectors, accounting for motor equivalence. Issues related to the stability and flexibility of skills appear much less enigmatic when skills are conceived as task representations rather than controllers. The main limitation of this study is its strong reliance on computational modeling. Our conclusions are based on the divergence between experimental data and predictions of the compensation/reoptimization models. So it is fundamental to check that the proposed predictions are both robust and realistic. As far as robustness is concerned, there is no difficulty with the compensation model which is well-formulated and easy to simulate. However, this model has little general relevance for motor control as it does not provide solutions to central problems such as trajectory formation and coordination [23]. For this reason, we have not pursued comparisons with this model. The reoptimization model is based on optimal feedback control [23] and has been updated here to account for proper online feedback control [24]. It generates movements with realistic trajectories, velocity profiles, and amplitude and frequency contents (Fig 3). We have shown that its predictions are robust to parameter changes (S6, S7 and S8 Figs). An unsettled and interesting issue is related to the intensity of the applied force field. The predictions were obtained with a 2-Ns/m force field as compared to the 10-Ns/m field of the experiment. In the model, for a given perturbation intensity, the size of the lateral deviation of the before-effect trajectory is determined by the interplay between the operation of the state estimator and the dynamics of the arm. Two observations can be made. First, changes in parameters of the state estimator (feedback delay, noise ratio) can reduce the impact of the force field. Yet even a fine tuning of these parameters would not lead to a realistic trajectory deviation for a 10-Ns/m field. Second, parameters of the dynamics have also an influence on the response to perturbations. For instance, we have assessed the influence of the torque ratio (S8 Fig). This parameter reflects the relative efficiency of shoulder and elbow muscles, but its value is not easy to set as it depends on the physiological cross-sectional area, the innervation ratio, the moment arm and the modulation of force production by firing rate and recruitment in pools of motoneurons of each muscle. Furthermore, we cannot play freely with this parameter as it has a strong impact on the timing of the movement (S8D Fig). Other parameters of the dynamics cannot be modified as they pertain to intrinsic characteristics of the arm. We propose two ideas to obtain quantitatively more realistic deviations with respect to the intensity of the perturbation. The first idea is to use a more realistic dynamics for the modeled arm. For simplicity, we considered the control of a planar two-link arm. Yet the participants were free to use all available degrees of freedom from the trunk to the wrist. The corresponding kinematic chain would likely offer a larger inertial resistance to perturbations. The second idea is in fact an extension of the first one and invokes impedance to account for resistance to perturbations, i.e. not only inertia, but also viscosity and stiffness, could contribute to the resistance [40,41]. In the simulations, we used a long feedback delay (0.12 s) to clearly indicate that any kind of instantaneous, short-latency and medium-latency visco-elastic contributions of muscles and tendons remained unmodeled. A model of these contributions is feasible for perturbations about a static posture [42] but remains elusive for perturbations during ongoing movements. Note that the very efficient elastic feedback along the desired trajectory used in the compensation model cannot be included in the reoptimization model or the redirection model due to the absence of a desired trajectory.

Materials and methods

Computational modeling

We simulate displacements of a planar two-link arm whose dynamics are given by where θ = [θsh,θel] are the shoulder and elbow angles, M(θ) the inertia matrix, the vector of velocity-dependent torques, τ the control torque produced by actuators and τ the torque due to external forces applied on the arm. We define and where msh and mel are the link masses, lsh and lel the link lengths, Ish and Iel the moments of inertia, ssh and sel the distances from the joint center to the center of mass, and d3 = Iel. Displacements are perturbed by a velocity-dependent force field producing a force proportional to the velocity along the movement direction ψ (direction is measured relative to initial hand position and 0 is rightward) and perpendicular to this direction. The force field is described by where φ is the force level (φ>0 for a counterclockwise perturbation, φ<0 for a clockwise perturbation) and R(ψ) the rotation matrix of angle ψ. The perturbation torque is where J(θ) is the Jacobian matrix of the kinematics Parameters are: msh = 1.4 kg, mel = 1.1 kg, lsh = 0.3 m, lel = 0.33 m, ssh = 0.11 m, sel = 0.16 m, Ish = 0.025 kg m2, Iel = 0.045 kg m2. In all the simulations, the initial arm configuration is [45°, 90°], movement amplitude is 0.1 m, movement direction is ψ = 90°, and force (field) level is φ = 2 Ns/m. Four conditions are considered: baseline, in the absence of the force field; before-effect, in the presence of the force field before adaptation; adapted, in the presence of the force field after adaptation; after-effect, in the absence of the force field after adaptation.

Compensation model

The compensation model is taken from [4]. The principle is the following. First we derive a desired 1-s spatial trajectory for a 0.1-m forward displacement based on a 0.5-s 0.1-m long minimum-jerk trajectory [21] followed by a 0.5-s stationary posture. Second we use the arm inverse kinematics to obtain the desired angular trajectory θ*(t), and the arm inverse dynamics (Eq 1) to calculate the joint torques which produce the desired angular trajectory. Third we obtain actual angular trajectories using where τ is a compensation torque built by adaptation, and B a feedback gain along the desired trajectory ( Nm/rad, where is the 2×2 identity matrix). The four conditions are: baseline, ; before-effect, and τ = 0; adapted, and ; after-effect, and .

Reoptimization model

The reoptimization model is an extension of the model described in [8]. The control torque is derived from a control input u = [ush, uel] according to where i = {sh, el}, α is muscle activation, ε muscle excitation, g muscle gain and v the muscle time constant (linear second-order muscle model; [43]). We define a state vector and rewrite the dynamics (Eqs 1 and 3) as for the unperturbed dynamics or for the perturbed dynamics, where ndyn is additive noise on the dynamics. We formulate an optimal feedback control problem for this dynamics as a search for a control policy u(t) to reach a goal while minimizing the cost where F• is either F0 or F to indicate whether optimization applies to the unperturbed or the perturbed dynamics, and TH is the planning horizon [24]. In [8], optimization runs on a fixed duration (0.5 s) and thus cannot be used to simulate before-effect and after-effect conditions which require flexible time to produce online movement corrections. Control with a planning horizon offers an efficient solution to time flexibility as at any time and in any changing situation due to a perturbation there always remains the duration of a planning horizon to reach designated goals [24]. The initial boundary condition is given by , where is the estimated value of X(t) provided by an optimal state estimator using forward modeling and delayed sensory feedback with delay Δ [24,44]. The state estimator is given by where is the dynamics for estimation which is either F0 or F (see below), is a 4×8 observation matrix ( is the 4×4 identity and the 4×4 null matrix), indicating that only the position and velocity are observed, K(t) the Kalman gain and where nobs is additive observation noise. The Kalman gain is given by where and where δ is the integration timestep, Ω the covariance matrix of observation (sensory) noise nobs (4-dimensional, zero-mean, Gaussian random vector) and Ω the covariance matrix of dynamic (motor) noise ndyn (8-dimensional, zero-mean, Gaussian random vector). We take and where σ and σ are the variance of sensory and motor noise, respectively, and diag[] indicates the diagonal matrix with listed values on the diagonal. The state estimator is formulated to be optimal taking into account the feedback delay as explained in the Supplementary Notes of [23]. To control movement duration, the goal X# is updated every TG within a series of successive intermediate goals (via-points) S = {X0, X1,⋯,X} with X = X*, i.e. X# = X0 at t = 0, X# = X1 at time t = TG,⋯,X# = X at time t = nTG, where X* is the final goal of the movement [20,24]. The four conditions are: baseline: ; before-effect: (the trajectory is planned based on the unperturbed dynamics but executed against a perturbation), (the estimator is unaware of the perturbation); adapted, and (the trajectory is planned based on the perturbed dynamics and executed against a perturbation), (the estimator is tuned to the perturbed dynamics); after-effect, (the trajectory is planned based on the perturbed dynamics but executed in the absence of the perturbation), (the estimator remains tuned to the perturbed dynamics). The same series of via-points S is used in all the conditions. The fact that the estimator becomes adapted to the perturbed dynamics is consistent with experimental observations [45,46]. Parameters are: v = 0.05 s, gsh = 2, gel = 1, TH = 0.28 s, TG = 0.13 s, Δ = 0.12 s, δ = 0.01 s, σ = 1, σ = 1. The final goal state is X* = [60.7°, 60°, 0,0,0,0,0,0], i.e. the final shoulder and elbow angles corresponding to a 0.1-m forward displacement, zero final velocity, activation and excitation. The redirection model is taken from [24] and customized to the current formulation. The baseline and before-effect conditions are the same as for the reoptimization model. In the adapted and after-effect conditions, the cost function is , i.e. the controller is unaware of the perturbation, but the series of via-points S used in the baseline and before-effect conditions is replaced by a new series of via-points S′ which defines adaptation. Like the controller, the estimator remains unaware of the perturbation ().

Numerical solution

The reoptimization and redirection models are simulated numerically using the iLQR method proposed by Li and Todorov (2004) [47]. For this, we reformulate the optimal control problem defined by Eq 4 and the final boundary constraint X# as a "regulator" problem with a cost function including both the control cost and the final boundary constraint as a task cost. Parameters w (for the control), (for via-points) and (for the final goal) are necessary to weight the different terms of the cost function. The parameters are: . This means that only the position is constrained at the intermediate goals. Note that the models are formulated in a stochastic setting (noise on the dynamics and the observation) but are simulated without noise. There is no particular reason to add noise in the simulations.

Experiment

Ethics statement

The experiment was approved by Comité d’Ethique de La Recherche at Sorbonne Université (CER-2021-112). Participants signed a consent form prior to participating in the experiment and in accordance with the ethical guidelines of Sorbonne Université and in accordance with the Declaration of Helsinki.

Participants

Twenty-two volunteers (20–30 yr old, 8 female) participated in the behavioral experiment. According to the Edinburgh Protocol of handedness [48], 18 were right-handed, 2 left-handed and 2 ambidextrous. They had no known neurological disorders and normal or corrected to normal vision and they were uninformed as to the purpose of the experiment.

Apparatus

Participants were seated on a chair and used their dominant hand (their most comfortable hand for ambidextrous participants) to move the handle of a robotic arm programmed to constrain the displacement of the hand in a horizontal plane and apply force perturbations. Task instructions, feedback information, and continuous visual feedback of hand displacement were provided on a monitor placed vertically in front of the participant. The flow of the task was controlled by a personal computer running Windows 7 (Microsoft Corporation, USA). The 3D position of the robot was recorded at 1000 Hz and stored on the computer for offline processing and analysis using custom written Matlab scripts (Mathworks, Natick, MA, USA).

Experimental procedure

The participants were asked to make forward reaching movements from a start position to a target position located 0.1-m away using visual information displayed on the monitor (start position: 0.6-cm diameter white circle; target position: 1-cm diameter white circle; moving cursor: 0.3-cm diameter black circle). To start a trial, the participants placed the cursor at the start position and began to move when ready. Once the cursor stopped inside the target circle (cartesian velocity < 0.01 m/s), feedback was given regarding desired movement velocity. The circle appeared blue if the movement was deemed too slow (peak velocity along target direction < 0.25 m/s) or red if deemed too fast (peak velocity > 0.35 m/s). No specific constraint was applied to movement accuracy other than the displacement of the cursor to the target circle. The return movement was unconstrained except for the need to stop inside the start circle (cartesian velocity < 0.01 m/s) to start the next trial. On some trials, a velocity-dependent force field was applied during the forward displacement as defined by Eq 2 with ψ = 90° and φ = ±10 Ns/m. The force field was CCW (φ>0) for half of the participants. The participants performed four blocks of trials: block 1 (20 trials, 100% vs 0% of null field vs force field), block 2 (200 trials, 90% vs 10%), block 3 (100 trials, 5% vs 95%), block 4 (min 150 trials, max 400 trials, 10% vs 90%). The last block involved many trials to maximize the number of recordings of after-effect trajectories. Yet the participants were offered the possibility to stop the experiment after 150 trials if they felt exhausted or bored. A pause was proposed between each block. Participants were given the following instructions: "Perform forward reaching movements to the target according to the required speed, as indicated by the color code (blue, green, red). You may return to the starting position at your own pace. Make a brief pause in the target and at the starting position and avoid rhythmic back and forth movements. Sometimes the robot may perturb your movement. Whenever it happens, continue to obey to the task instructions". At the start of recording, the participants were already familiar with the robot as they performed unrelated preliminary trials of force and position measurements. The robot was transparent and easy to manipulate.

Data processing and analysis

Raw data were used to obtain the planar trajectory of the hand for each trial. A symmetry relative to the start position/target position axis was applied to the trajectories of participants receiving a CCW perturbation. Velocity, acceleration and jerk were calculated numerically from the two-sample difference of the position, velocity and acceleration signals, respectively. Position, velocity, acceleration and jerk were filtered with a fourth-order Butterworth low-pass filter with a cutoff at 10 Hz. Valid trials were detected by a peak velocity along target direction between 0.25 and 0.35 m/s in the forward part of the movement. For each valid trial, the forward trajectory was extracted by detection of movement onset and offset with a velocity threshold of 0.01 m/s and two time-varying quantities were calculated: (1) the angle (counted positive in the CCW direction) of the tangent to the trajectory relative to the line between the start position and the target position; (2) the time derivative of this angle which is closely related to the curvature of the trajectory. The valid trials were divided into four categories: baseline (trials of block 1), before-effect (perturbed trials of block 2), adapted (perturbed trials of block 4), and after-effect (unperturbed trials of block 4). For each category, mean trajectory, mean angle and mean angle derivative were calculated over the trials. The rationale for the choice of the filter cutoff frequency is the following. A power spectrum analysis was performed on the unfiltered timeseries using a specific method for short-duration timeseries [49]. The results are shown in S10 Fig for velocity, acceleration and jerk pooled across trials and participants, separately for each category (baseline, before-effect, adapted, after-effect). Much of the power was below 10 Hz.

Statistical analysis

A classical Student’s t-test was used to assess the sign of the trajectory angle derivative (H0: = 0 vs H1: ≠ 0). A p-value < 0.05 was taken to support H1. A p-value > 0.05 indicated that we could not reject H0. To assess the status of H0 vs H1 in the latter case, we calculated the Bayes factor bf10 which is the ratio between the likelihood of the data under H1 and H0 [50]. Bayes factors were interpreted according to the following table: 13: substantial. The Bayes factors were calculated with the Matlab toolbox FieldTrip (https://www.fieldtriptoolbox.org/; [51]).

Predictions of the compensation model.

A. Simulated trajectories. B. Simulated velocity profiles. (PDF) Click here for additional data file.

Time at which the angle derivative of the before-effect trajectory became positive.

A. All participants with mean value (thick line) and 25–75 percentiles (box). B. Data of participant P7 and mean of all the participants. The black dashed line is the model prediction. (PDF) Click here for additional data file.

Participants whose behavior is incompatible with the reoptimization model.

Same format as in Fig 5. For bf10, the dotted lines correspond, from bottom to top, to substantial =, anecdotal =, anecdotal≠, and substantial≠. (PDF) Click here for additional data file.

Participants whose behavior is partially compatible with the reoptimization model.

Same format as S3 Fig. (PDF) Click here for additional data file.

Two participants that failed to improve their behavior with training.

Same format as Fig 5. (PDF) Click here for additional data file.

Parametric study of the model: influence of feedback delay.

A. Before-effect (red) and after-effect (blue) trajectories. Feedback delay: 0, 0.05, 0.12, 0.15 s; light to dark color. B. Velocity profile. C. Trajectory angle. D. Angle derivative. (PDF) Click here for additional data file.

Parametric study of the model: influence of noise ratio.

Same format as S6 Fig. Noise ratio σ/σ (motor/sensory): 0.1, 1, 10, 100; light to dark color. (PDF) Click here for additional data file.

Parametric study of the model: influence of torque ratio.

Same format as S6 Fig. Muscle gain ratio gsh/gel (shoulder/elbow): 1, 2, 5, 10; light to dark color. (PDF) Click here for additional data file.

Parametric study of the model: influence of boundary conditions.

A. Mean and 25–75 percentiles of positive acceleration peaks for baseline (black) and before-effect (red) trajectories for different boundary conditions at via-points: p: only position; pv: position and velocity; pva: position, velocity and activation; pvae: position, velocity, activation and excitation. B. Same as A for jerk. (PDF) Click here for additional data file.

Power spectrum analysis.

A. Power spectrum density (arbitrary unit) of velocity average across trials and participants, for baseline (black), before-effect (red), adapted (green) and after-effect (green) trials. B. Same as A for acceleration. C. Same as A for jerk. (PDF) Click here for additional data file. 20 Jun 2022 Dear Dr. Guigon, Thank you very much for submitting your manuscript "What is the nature of motor adaptation to dynamic perturbations?" for consideration at PLOS Computational Biology. As with all papers reviewed by the journal, your manuscript was reviewed by members of the editorial board and by several independent reviewers. In light of the reviews (below this email), we would like to invite the resubmission of a significantly-revised version that takes into account the reviewers' comments. The reviewers were divided in their assessment of the manuscript; I opted to give you the benefit of the doubt. In order to fully address the issues raised, you will need to defend the adequacy of the data to test your hypothesis, and to more comprehensively evaluate alternative hypotheses. We cannot make any decision about publication until we have seen the revised manuscript and your response to the reviewers' comments. Your revised manuscript is also likely to be sent to reviewers for further evaluation. When you are ready to resubmit, please upload the following: [1] A letter containing a detailed list of your responses to the review comments and a description of the changes you have made in the manuscript. Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out. [2] Two versions of the revised manuscript: one with either highlights or tracked changes denoting where the text has been changed; the other a clean version (uploaded as the manuscript file). Important additional instructions are given below your reviewer comments. Please prepare and submit your revised manuscript within 60 days. If you anticipate any delay, please let us know the expected resubmission date by replying to this email. Please note that revised manuscripts received after the 60-day due date may require evaluation and peer review similar to newly submitted manuscripts. Thank you again for your submission. We hope that our editorial process has been constructive so far, and we welcome your feedback at any time. Please don't hesitate to contact us if you have any questions or comments. Sincerely, Samuel J. Gershman Deputy Editor PLOS Computational Biology *********************** Reviewer's Responses to Questions Comments to the Authors: Reviewer #1: The authors describe a study aimed at understanding the nature of force field adaptation in reaching movements in humans. They combined empirical studies of FF reaching adaptation with an unusual data analysis technique to distinguish between a model of adaptation that involves learning novel patterns of forces, from a model that instead involves remapping the location of the endpoint target. I find the central premise/rationale of the study to be highly problematic. The authors try to draw a distinction between FF learning involving learning a new mapping between states and compensatory forces, and a second view that involves modifying the mapping between intended and actual goals 'irrespective of how to achieve these goals' and they use visuomotor rotation as an example of the latter. (Introduction, second paragraph). It's important to note that in visuomotor adaptation, if participants learn to aim to a different target location, that new actual movement direction does in fact involve new patterns of forces as well. Thus I don't see how one can in principle distinguish between these two models, at least not without some empirical method of disentangling the movement goal from the underlying forces. The authors state (Introduction) that they have not found any study that quantitatively documents the shape of the after-effect trajectories. I think Bhushan & Shadmehr (1999) have done this, no? - Bhushan N (1998) A Computational Approach to Human Adaptive Motor Control Shadmehr R, ed. Available at: http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.392.2697&rep=rep1&type=pdf. - Bhushan N, Shadmehr R (1999) Computational nature of human adaptive control during learning of reaching movements in force fields. Biol Cybern 81:39–60 Available at: http://dx.doi.org/10.1007/s004220050543. The authors state (Introduction) that after-effect trajectories seem to resemble "kinematic" trajectories rather than "dynamic" trajectories. I have no idea what this means, which is problematic since this is part of the stated premise/rationale of the study. The authors use a rather unusual method of frequency analysis of kinematics (velocity, acceleration and jerk) in order to distinguish between the competing models described above. I think this is highly problematic, namely using kinematics alone to try to distinguish between models that learn to reach to targets in a force field. It would seem to me the only way to test different models of FF adaptation would be to use models that actually produce forces, e.g. a dynamic model of a two-link planar arm, with for example a hill-type muscle model, and put that model in a simulated FF, and test the two models of control described in the Introduction. In fact, using an explicit computational model as described above would enable the reader to fully understand what exactly these competing models are, because in such a computational model one would have to make them perfectly explicit in order to actually run the models. This would be my major suggestion for the authors—use a mechanistic model of arm movement that actually generates forces to test competing models of control/learning. Reviewer #2: I would like to disclose my identity as Jonathan Tsay. Moullet and colleagues asked, “what is the nature of motor adaptation to dynamic perturbations”. They took a fine-grain approach to analyze after-effect trajectories following force-field perturbations and noticed that the data are not consistent with the predictions of a compensatory mechanism in which a mapping was learned between the goal and the optimal forces to achieve the goals in the presence of the applied forces (e.g., after-effects are not a mirror reflection of before-effects). Instead, the after-effects are consistent with a redirection process where participants (implicitly) aim towards a series of spatially remapped targets (i.e., via-points). The fine-grain analysis of after-effect trajectories is rigorous, but the current manuscript seems to miss several important components (see below): First, the current version of the manuscript is difficult to follow, and the theoretical differences/underpinnings of the models can be clearer. E.g., which level of behavior do these models try to explain with regards to feedback and feedforward corrections? Some work has suggested that feedforward corrections during late adaptation/aftereffects is a time-shifted version of the feedback corrections during before-effects (Albert & Shadmehr, 2016), whereas others have challenged this idea, positing that feedforward and feedback appear to be more independent (https://drive.google.com/file/d/1Nlhl59X1U8ELzKLTicW1VQSNSCYIRcHd/view). The current manuscript does not address this critical issue. The authors bring up that the models differ in how they alter different stages of movement (e.g., action selection, goal selection) – however, this terminology can be used differently across papers. For example, some refer action selection as where people explicitly aim (Kim et al., 2020; Krakauer et al., 2019), but in this paper, goal selection seems to correspond to where people explicitly aim. Clarifying how these terms are used in this paper would help readers follow the logic/differences among these models (a schematic may help). Second, missing model comparisons. The authors have focused on whether different models can qualitatively predict the after-effect trajectories. However, have the authors addressed the differences in model complexity (e.g., number of parameters, or number of assumptions of these models)? The “winning” model (redirection model) includes the notion of via-points, each of which make up the entire trajectory. Do the number of via-points change the model complexity, and therefore, make the re-direction model less parsimonious? An analysis of AIC and BIC may be useful to arbitrate among these models. Third, missing tests of alternative hypotheses. The current conclusions are largely based on one observation, the trajectory of the aftereffect data. While the re-direction model does a decent job capturing this behavior (but perhaps not parsimoniously, see Point 2), the data may also be consistent with one or more of the following: a) A model of Proprioceptive re-alignment (or PReMo; see Feature 6 of (Tsay et al., 2021)): participants following forcefield perturbation may experience a shift in their sensed hand position towards the direction of the force-field (Ostry et al., 2010). PReMo provides a theoretical account for the feedforward corrections (although feedback corrections are not considered and thought to arise from another learning system). b) The presence of cognitive re-aiming strategies: The authors note in the Discussion that the re-direction model does not arise from a change in explicit re-aiming in response to a force-field. Can the authors rule out this explanation? There have been a study that shows explicit re-aiming in response to forcefields (Schween et al., 2020). Aftereffects may also arise from explicit re-aiming if instructions are ambiguous (e.g., “reach directly to the target” could either refer to brining the hand or the invisible cursor to the target; if referring to the invisible cursor, participants may still explicitly re-aim away from the original target during the washout aftereffect period). c) Before-effects time-shifted to form after-effects: Can the compensatory model be salvaged if a gain (or time shift) parameter was included? That is, can the authors experimentally or logically rule out the notion that after-effects are just a time-shifted version of the before-effects (i.e., people are selecting the same goal, that is, the original target), rather than a re-direction towards another target? d) while the re-direction is a viable hypothesis, the critical idea (i.e., people are redirecting their movements to spatially re-mapped targets) was not directly tested (currently, only inferred via model comparison). Is there a way to probe where people are aiming, or the location of these via-points directly? If it’s explicit, then the authors could consider using an aiming wheel + aim report method. If these via points are implicit, does the re-direction model generate any qualitatively different predictions with regards to how aftereffects generalize to local/global targets, compared to the compensation hypothesis? References: Albert, S. T., & Shadmehr, R. (2016). The neural feedback response to error as a teaching signal for the motor learning system. The Journal of Neuroscience: The Official Journal of the Society for Neuroscience, 36(17), 4832–4845. Kim, H. E., Avraham, G., & Ivry, R. B. (2020). The Psychology of Reaching: Action Selection, Movement Implementation, and Sensorimotor Learning. Annual Review of Psychology. https://doi.org/10.1146/annurev-psych-010419-051053 Krakauer, J., Hadjiosif, A. M., Xu, J., Wong, A. L., & Haith, A. M. (2019). Motor Learning. Comprehensive Physiology, 9(2), 613–663. Ostry, D. J., Darainy, M., Mattar, A. A. G., Wong, J., & Gribble, P. L. (2010). Somatosensory plasticity and motor learning. The Journal of Neuroscience: The Official Journal of the Society for Neuroscience, 30(15), 5384–5393. Schween, R., McDougle, S. D., Hegele, M., & Taylor, J. A. (2020). Assessing explicit strategies in force field adaptation. Journal of Neurophysiology, 123(4), 1552–1565. Tsay, J. S., Kim, H. E., Haith, A. M., & Ivry, R. B. (2021). Proprioceptive Re-alignment drives Implicit Sensorimotor Adaptation. In bioRxiv. https://doi.org/10.1101/2021.12.21.473747 Reviewer #3: This study examined whether motor adaptation to a dynamic (force) perturbation reflects changes at the action selection level vs at the goal selection level. The authors performed a strict examination of arm trajectories as subjects made reaching movements with one arm to a visuomotor target. A velocity-dependent force field disrupted the reaching movements. The "change in action selection" model predicts mirror-symmetric shapes for the arm trajectories early in adaptation and the after-effect trajectories. This was not found, however. Instead, the afer-effect trajectories suggest that the process of adaptation occurs instead at the goal selection level. By selecting a new goal, the motor system has the option to generate arbitrary trajectories, which in some cases might be more efficient. I have no problems with the manuscript. The topic is important as it sheds light on how much the motor system controls trajectory details--more than was expected. The approach is sound, the results are clear, and the figures are great. The results are interesting and they will help advance our understanding of human gait control. I have one comment regarding a premise of the study's hypothesis: p. 4, par. 2. "after-effect trajectories, ... should incorporate a negative image of the forces induced by the applied force field..." A 2015 study by Krakauer's group showed that the amount of after-effect in rotation learning can be influenced, and even entirely cancelled, by the number of trials performed at asymptote. The interpretation was that reward-related components of motor adaptation. Minor suggestions: Minor changes are needed concerning several instances of awkward word choice or use of prepositions. There are many phrases, throughout the manuscript, that are either grammatically incorrect or that vary from typical English usage. I give a few examples below, but this list is not complete. It would be best to have the manuscript revised by a reader with fluent mastery of English. Examples: p. 2, par. 2: delete "against reality." p. 2, par. 2: Change "The results suggest to change our mind..." to something like "The results change our view of motor adaptation." p. 3, par. 1: change "cooperation" with something like "balance"; change "mandatory" to "necessary."; change "outrageously" to "disproportionately."; delete "upon request." ********** Have the authors made all data and (if applicable) computational code underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data and code underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data and code should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data or code —e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: None Reviewer #2: Yes Reviewer #3: No: ********** PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No Reviewer #2: Yes: Jonathan Tsay Reviewer #3: No Figure Files: While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, . PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at . Data Requirements: Please note that, as a condition of publication, PLOS' data policy requires that you make available all data used to draw the conclusions outlined in your manuscript. Data must be deposited in an appropriate repository, included within the body of the manuscript, or uploaded as supporting information. This includes all numerical values that were used to generate graphs, histograms etc.. For an example in PLOS Biology see here: http://www.plosbiology.org/article/info%3Adoi%2F10.1371%2Fjournal.pbio.1001908#s5. Reproducibility: To enhance the reproducibility of your results, we recommend that you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option to publish peer-reviewed clinical study protocols. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols 22 Jul 2022 Submitted filename: revPLoS.pdf Click here for additional data file. 4 Aug 2022 Dear Dr. Guigon, We are pleased to inform you that your manuscript 'What is the nature of motor adaptation to dynamic perturbations?' has been provisionally accepted for publication in PLOS Computational Biology. Before your manuscript can be formally accepted you will need to complete some formatting changes, which you will receive in a follow up email. A member of our team will be in touch with a set of requests. Please note that your manuscript will not be scheduled for publication until you have made the required changes, so a swift response is appreciated. IMPORTANT: The editorial review process is now complete. PLOS will only permit corrections to spelling, formatting or significant scientific errors from this point onwards. Requests for major changes, or any which affect the scientific understanding of your work, will cause delays to the publication date of your manuscript. Should you, your institution's press office or the journal office choose to press release your paper, you will automatically be opted out of early publication. We ask that you notify us now if you or your institution is planning to press release the article. All press must be co-ordinated with PLOS. Thank you again for supporting Open Access publishing; we are looking forward to publishing your work in PLOS Computational Biology. Best regards, Samuel J. Gershman Deputy Editor PLOS Computational Biology *********************************************************** Reviewer's Responses to Questions Comments to the Authors: Reviewer #2: The authors have sufficiently addressed my comments. That being said, while I appreciate that the central premise of the paper was to reject the re-optimization model, their results - as the authors noted themselves -- rely heavily on modeling without further experimental validation. I encourage the authors to at a minimum discuss concretely how they/the field should test the redirection model experimentally. ********** Have the authors made all data and (if applicable) computational code underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data and code underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data and code should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data or code —e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #2: Yes ********** PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #2: Yes: Jonathan S. Tsay 24 Aug 2022 PCOMPBIOL-D-22-00735R1 What is the nature of motor adaptation to dynamic perturbations? Dear Dr Guigon, I am pleased to inform you that your manuscript has been formally accepted for publication in PLOS Computational Biology. Your manuscript is now with our production department and you will be notified of the publication date in due course. The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript. Soon after your final files are uploaded, unless you have opted out, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers. Thank you again for supporting PLOS Computational Biology and open-access publishing. We are looking forward to publishing your work! With kind regards, Zsofia Freund PLOS Computational Biology | Carlyle House, Carlyle Road, Cambridge CB4 3DN | United Kingdom ploscompbiol@plos.org | Phone +44 (0) 1223-442824 | ploscompbiol.org | @PLOSCompBiol

47 in total