Introduction

The ability to volitionally modulate the activity of neurons has been demonstrated in both human subjects and animals1. This skill is fundamental to the operation of brain–machine interfaces (BMIs), in which external devices, such as computer cursors or robotic arms, are controlled by the production of specific patterns of neural activity. Motor BMIs specifically rely on the modulation of motor cortical neural activity to control external devices2,3,4, while also relying on higher-level cognitive processes to integrate sensory information, plan and initiate actions, and monitor visual feedback of the effector5. These systems hold great potential for rehabilitation and restoration of lost motor function4,6, but the underlying neural processes involved in controlling a motor BMI are still unclear.

Controlling a BMI shares characteristics with both overt motor control and abstract cognitive tasks7,8,9. BMI control requires users to produce specific patterns of neural activity to manipulate their physical environment without physical movement and often without proprioceptive feedback. Thus, BMI control may rely on networks underlying both overt motor control and abstract cognitive tasks. While both BMI control and overt motor control rely on the direct modulation of the motor cortex, the mapping between neural activity and output is distinct between these types of control because a specific subpopulation of neurons is selected for BMI control. The prefrontal cortex and dorsal striatum are important for both motor control10,11,12,13 and for learning abstract associations14,15,16,17.

The primary motor cortex (M1), the dorsolateral prefrontal cortex (DLPFC) and the caudate nucleus of the striatum (Cd) are extensively interconnected with one another, as well as with other sensory and higher-level associational areas involved in goal-directed behavior18,19,20,21,22. Cortico-striatal plasticity is necessary for motor control23 and cortico-striatal and cortico-cortical interactions are involved in abstract skill learning18,24,25, suggesting that these interactions may play an important role in BMI control.

Successful control of a BMI depends on multiple cognitive processes that are employed differentially across events within the task. For example, in a center-out BMI task, the go cue may be more dependent on action selection and initiation, while control at target acquisition may depend more on error correction and fine movement control. Therefore, it is possible that different regions of interest (ROIs), such as motor cortex, prefrontal cortex, or striatum, are more or less involved at different points within a trial. For instance, the prefrontal cortex may be more involved at the go cue when action planning and selection are critical13,18,26 whereas the primary motor cortex may be more involved at target acquisition when movement execution and fine effector control are more important27. Thus, it is important to investigate neural activity in each ROI at multiple task events to better understand its role during manual and BMI control.

Previous studies that have used BMIs to dissect cognitive processes have primarily focused on neural activity in BMI-selected neurons, motor cortical neurons whose activity is directly input to the BMI28. A few studies have investigated the role of neural activity from non-BMI neurons within the motor cortex29,30, highlighting that BMI control involves large-scale networks that extend beyond BMI-selected neurons. However, it is becoming apparent that extending these investigations to brain areas beyond the motor cortex will be essential for gaining a more complete understanding of BMI control31. Previous studies have shown the involvement of cortico-striatal networks during a BMI task in rodents32,33,34 and of the prefrontal cortex in BMI skill acquisition in humans8,35,36. However, due to experimental constraints, these studies relied on simplified, one-dimensional BMI tasks, making it difficult to gain insight into the distributed network activity that may be specific to more complex and cognitively demanding two-dimensional (2-D) BMI control. To overcome these limitations, we successfully trained non-human primates (NHPs) to perform a 2-D BMI center-out task with recordings from a semi-chronic microdrive. These microdrives have previously been used to study large-scale networks37,38, but never in the context of BMI control. To our knowledge, this is the first time recordings from a semi-chronic microdrive have been used to drive a BMI in NHPs, allowing us to obtain simultaneous cortical and subcortical recordings to make direct comparisons of activity in these regions and investigate their interactions during BMI control.

Here, we use this unique experimental opportunity to investigate the modulation of and interactions between M1, DLPFC, and Cd activity during a motor BMI task and compare this activity to that during manual (overt motor) control. We perform these analyses at both the go cue and at target acquisition allowing us to compare the role of each ROI across two task events that involve distinct cognitive processes. We demonstrate that DLPFC holds the most information for distinguishing BMI and manual control at the go cue, while M1 is most informative at target acquisition. Additionally, we show that directed information flow is present from DLPFC → M1 in both BMI and manual control and from Cd → M1 during BMI control. These directed interactions were present at both the go cue and at target acquisition. Ultimately, our findings confirm the involvement of distributed cortical and subcortical networks during BMI control and identify similar but distinct network activity during manual control.

Results

To explore whether there are BMI-specific neural representations in M1, DLPFC, and Cd, we simultaneously recorded from these regions during a BMI control task, a manual control task, and a baseline rest period. Two rhesus macaques (Monkey H and Monkey Y) were implanted with a custom-fit large-scale semi-chronic microdrive array on the left hemisphere (Fig. 1a). Single- and multi-unit recordings from the motor cortex were used as input to a BMI decoder, while local-field potentials (LFP) were simultaneously recorded from three regions of interest (ROIs): M1, DLPFC, and Cd. Each day, the animals performed a two-dimensional, self-initiated, center-out task, in which they were instructed to move a cursor from a center target to one of eight pseudo-randomly instructed peripheral targets for a juice reward. A successful center-out trial required a brief hold at the center target, moving to the peripheral target within a specified time, and a brief hold at the target (Fig. 1b). First, they performed the task under manual control (Fig. 1c), followed by a four-minute baseline period (Fig. 1d). Then, they performed the same center-out task under BMI control (Fig. 1e,f).

Figure 1
figure 1

Experimental set-up and task performance. (a) A 3-D model of a custom-fit, large-scale, semi-chronic microdrive from Gray Matter Research used for simultaneous recordings of neural activity from ROIs at different depths. (b) Timeline of the center-out task (see “Methods” for details). (c) Two monkeys completed a 2-D, self-initiated, center-out movement task under manual control. (d) During a baseline period, neural activity was recorded during the absence of visual cues and the subject’s arm locked in a fixed position. (e) During BMI control, the subject’s arm was locked in a fixed position while they modulated neural activity to control a cursor in the same 2-D, self-initiated, center-out task. (f) Example BMI control trajectories from early learning (right) and late learning (left) to each target from Monkey Y. (g) Fraction of self-initiated trials that were successful in BMI control (teal) and manual control (yellow) for Monkey H (left) and Monkey Y (right). (h) Time from go cue to target acquisition in BMI control (yellow) and manual control (teal) for Monkey H (left) and Monkey Y (right). Shading represents the standard error of the mean across trials within a day.

Successful BMI control with a semi-chronic array

Both monkeys successfully learned to perform the eight-target, center-out task using a cursor under BMI control. Task performance improved across days, despite using different direct units and decoders each day (see “Methods” for details). Although only one monkey significantly increased the fraction of successfully completed trials (Fig. 1g; Linear regression, Monkey H: R2 = 0.003, p = 0.852, Monkey Y: R2 = 0.593, p = 2.78e−5), both monkeys learned to produce faster, straighter cursor trajectories, resulting in a decreased target acquisition time over days (Fig. 1h; Linear regression, Monkey H: R2 = 0.035, p = 3.20e−27, Monkey Y: R2 = 0.093, p = 4.67e−141). This improved performance demonstrates that the animals were able to learn to control a BMI using neural activity recorded from a semi-chronic microdrive.

Spectral power from M1, DLPFC, and Cd distinguishes BMI control, manual control, and baseline

To understand whether activity from M1, DLPFC, and Cd differs between BMI control, manual control, and baseline, we compared the spectral power in 5 distinct frequency bands across these three conditions (Supplementary Figs. 1, 2, see “Methods” for details). Using the LFP recorded from each ROI, we computed the mean power across electrode channels in each frequency band. For BMI and manual control, power was computed in the 500 ms window following the go cue and the 500 ms window preceding target acquisition on successful trials. The same calculations were performed in random non-overlapping 500 ms windows during the baseline period. The most notable differences occurred between baseline and the two control tasks in the theta and beta frequency bands (Fig. 2). At the go cue, theta power at baseline was significantly lower than during both manual and BMI control across all ROIs, while beta power at baseline was significantly higher than during either of the two control tasks (Fig. 2a). At target acquisition, beta power at baseline remained significantly higher than during the two control tasks in M1 but was less consistent at other ROIs and frequency bands (Fig. 2b). While these differences provide insight into how spectral power differs between the baseline rest period and the two control tasks at different task events, differences in spectral power within individual frequency bands between the two types of control were less consistent across animals.

Figure 2
figure 2

Theta and beta power in M1, DLPFC, and Cd is significantly different between baseline rest period and manual or BMI control. (a) Average theta (top) and beta (bottom) power normalized to total power in each ROI during baseline (gray), manual (yellow), and BMI (teal) at the go cue for Monkey H (left) and Monkey Y (right). Error bars represent standard error mean across days. (b) Same as (a), but for power features obtained at target acquisition rather than at the go cue. Normalized spectral power estimates significantly differing between task types after Bonferroni correction for multiple comparisons are indicated with an asterisk (Supplementary Tables 1, 2, Bonferroni corrected p-value < 0.05).

To determine whether combinations of spectral power from multiple frequency bands across our ROIs yield more distinct representations between task types, we trained quadratic discriminant analysis (QDA) classifiers to distinguish between these task types (Fig. 3a). Although only theta and beta power showed consistent significant differences across task types, incorporating power from all frequency bands as input features yielded a higher classification accuracy than including any individual feature for each individual ROI, so all power features were included for each ROI in subsequent analyses (Supplementary Fig. 3). We compared neural activity recorded during BMI control to activity recorded during both manual control and baseline, allowing us to identify task-related activity that is specific to BMI control (Fig. 3a). At both the go cue and target acquisition, classifiers using power features from all ROIs, M1 + DLPFC + Cd, either yielded the highest classification accuracy or were not significantly different from the highest accuracy (Fig. 3b,c, Supplementary Fig. 4a,b). At target acquisition, classifiers that incorporated M1 power features (M1 + DLPFC + Cd, M1 + DLPFC, M1 + Cd, and M1 only) yielded significantly higher classification accuracy than classifiers without M1 power features (DLPFC + Cd, DLPFC only, and Cd only) (Fig. 3c, Supplementary Fig. 4b). With BMI performance improving across days (Fig. 1g,h), we considered whether these results varied across different stages of BMI learning. We repeated these analyses within early (first third of recording days; Monkey H: n = first 4 days, Monkey Y: n = first 7 days) and late learning periods (last third of recording days; Monkey H: n = last 4 days, Monkey Y: n = last 7 days). Both the go cue and target acquisition results were similar across BMI learning stages (Supplementary Fig. 5). Overall, these results indicate that it is possible to distinguish between BMI control, manual control, and baseline using spectral power features from M1, DLPFC, and Cd. Furthermore, the increase in classification accuracy resulting from the inclusion of M1 activity indicates that there are distinct neural representations of the three task types in this region at target acquisition.

Figure 3
figure 3

Spectral power in M1, DLPFC, and Cd distinguishes between BMI control, manual control, and baseline at the go cue and at target acquisition. (a) LFP was decomposed into 5 distinct frequency bands. The average power in each of these bands in each ROI was used as input to the classifiers. 3-class task-type QDA classifiers were trained to distinguish between BMI control, manual control, and baseline. (b) Mean tenfold cross-validated classification accuracy across days for a 3-class task-type QDA classifier using all frequency bands from individual ROIs (M1 in blue, DLPFC in green, Cd in pink) or combinations of ROIs (purple) at the go cue for Monkey H (left) and Monkey Y (right). Classifiers are presented in order of ascending accuracy. Chance accuracy shown as a dashed line. Error bars represent standard error mean across days. (c) Same as (b), but for power features obtained at target acquisition rather than at the go cue. (d) Most misclassifications occurred between BMI control and manual control. Confusion matrix for the model including all frequency bands from all ROIs at the go cue. Numbers and shading correspond to the fraction of correct predictions within a class for Monkey H (left) and Monkey Y (right). (e) Same as (d), but for power features obtained at target acquisition rather than at the go cue.

To further explore the differences in the neural representations of BMI control, manual control, and baseline in M1, DLPFC, and Cd, we computed confusion matrices for the classifiers trained using power features from all ROIs (Fig. 3d,e). Here, we assessed which type of misclassifications were most common. If the majority of misclassifications occur between BMI control and baseline, this would imply that these two classes are more similar to one another and distinct from manual control. Thus, we would infer that the differences in neural representations primarily correspond to the presence or absence of large, physical arm movements. On the other hand, a majority of misclassifications occurring between BMI control and manual control could indicate that differences in neural representations were largely task-related. Thus, quantifying the 3-class task-type QDA misclassifications allowed us to identify whether the model performed better at separating movement-related conditions or task-related conditions. At both the go cue and target acquisition, we found that the primary misclassifications occurred between BMI control and manual control, demonstrating that neural activity during the two tasks is more similar to one another than it is during baseline (Fig. 3d,e). This suggests that the primary distinction in neural representations was due to the presence or absence of a task, rather than the presence or absence of movement.

DLPFC and M1 best distinguish between control-type at go cue and target acquisition, respectively

To gain further insight into the differences between neural representations of BMI control and manual control, we used a 2-class task-type QDA model to distinguish between BMI control and manual control only (Fig. 4a). At the go cue, using power features from DLPFC resulted in significantly higher classifier performance than using power features from M1 or Cd for Monkey H (Fig. 4b,c, Supplementary Fig. 6a). For Monkey Y, classifiers using power features from DLPFC had better performance than those using features from M1 or Cd, but there was only a significant improvement from Cd to DLPFC (Fig. 4b,c, Supplementary Fig. 6a). Additionally, the classification accuracy of models using power features from either DLPFC + Cd or M1 + DLPFC was not significantly different from that of models using power features from all ROIs for either subject. At target acquisition, models using power features from M1 resulted in significantly higher classifier performance than the classifiers using power features from DLPFC or Cd for Monkey H and from Cd for Monkey Y (Fig. 4d,e, Supplementary Fig. 6b). Additionally, the classification accuracy of models using power features from either M1 + DLPFC or M1 + Cd was not significantly different from that of models using power features from all ROIs for either subject. Furthermore, both the go cue and target acquisition results were similar across BMI learning stages (Supplementary Fig. 7). These results suggest that most of the information for distinguishing between BMI and manual control at the go cue is present in DLPFC, while most of the information at target acquisition is present in M1. This difference may indicate that cognitive mechanisms known to rely more heavily on DLPFC or M1, such as action-planning or movement-execution, are more involved at the go cue and at target acquisition, respectively.

Figure 4
figure 4

DLPFC activity best distinguishes between BMI and manual control at the go cue, while M1 activity best distinguishes between BMI and manual control at target acquisition. (a) LFP was decomposed into 5 distinct frequency bands. The average power in each of these bands in each ROI was used as input to the classifiers. 2-class task-type QDA classifiers were trained to distinguish between BMI control and manual control only. (b) Mean tenfold cross-validated classification accuracy across days for a 2-class task-type QDA classifier trained to distinguish between BMI and manual control using all frequency bands from individual ROIs (M1 in blue, DLPFC in green, Cd in pink) or combinations of ROIs (purple) at the go cue for Monkey H (left) and Monkey Y (right). Classifiers are presented in order of ascending accuracy. Chance accuracy shown as a dashed line. Error bars represent standard error mean across days. (c) Mean classification accuracy for each combination of ROIs at the go cue. (d,e) Same as (b,c), but for power features obtained at target acquisition rather than at the go cue.

Though DLPFC and M1 hold key information for distinguishing BMI and manual control at the go cue and at target acquisition, respectively, models using power features from all individual ROIs at both task events yielded above-chance classification accuracy (Fig. 4b and d). This indicates that each individual ROI holds important information for the 2-class task-type model. Additionally, models with features from 2 or 3 ROIs combined consistently yielded the highest classification accuracy (Supplementary Fig. 6). This increase in accuracy obtained by combining information across ROIs suggests that additional ROIs contribute distinct information.

To extend the comparison of neural representations during BMI and manual control in these ROIs, we introduced an 8-class target-direction Linear Discriminant Analysis (LDA) classifier to distinguish between the eight different target-directions in both manual control and BMI control scenarios (Fig. 5a). Similar to previous analyses, power from all frequency bands during the 500 ms after the go cue and before target acquisition from different ROIs was used as input to the models. To ensure a fair comparison between models trained on data from the two control types, an equal number of trials per target-direction under BMI and manual control were incorporated into their respective classifiers. Additionally, the data from the first 5 days were excluded from each model to ensure that there were a sufficient number of successful trials to each target (see “Methods” for details). Under manual control, accuracies of classifiers at the go cue using power features from M1 only were significantly greater than those using power features from DLPFC only and Cd only for both animals (Fig. 5b,c, Supplementary Fig. 8a). Under BMI control, accuracies of classifiers using M1 were significantly greater than models using Cd only for both animals (Fig. 5b,c, Supplementary Fig. 8a). Furthermore, models using power features from all 3 ROIs (M1 + DLPFC + Cd), M1 + DLPFC, and M1 + Cd out-performed the models using DLPFC only and Cd only under both manual and BMI control, suggesting that the addition of M1 power features boosts prediction of target-direction at the go cue during both modes of control (Fig. 5b,c, Supplementary Fig. 8a). While classification accuracies of models including M1 power features tended to be higher under manual control than under BMI control at the go cue, only M1 + Cd and M1 + DLPFC + Cd were significantly greater for both animals (Fig. 5c, Supplementary Table 3a). Conversely, at target acquisition, accuracies from nearly all models using power features from manual control were significantly greater than the models using the same power features from BMI control (Fig. 5e, Supplementary Table 3b). Under manual control, the classifiers containing power features from M1 + DLPFC and M1 + DLPFC + Cd performed the best at target acquisition (Fig. 5d,e). Under BMI control, there were no significant differences between any of the combinations of ROIs at target acquisition and very few models achieved above-chance accuracies (Supplementary Fig. 8b). In contrast, under manual control, all but one model (Cd only for Monkey Y) significantly predicted target-direction at target acquisition (Fig. 5d,e).

Figure 5
figure 5

M1 activity best distinguishes between target-direction at the go cue for both BMI and manual control and DLPFC + M1 activity best predicts target-direction under manual control while all ROIs predict target-direction near chance accuracy under BMI control at target acquisition. (a) LFP was decomposed into 5 distinct frequency bands. The average power in each of these bands in each ROI was used as input to the classifiers. 8-class target-direction LDA classifiers were trained to distinguish between target-directions under BMI control and manual control separately. (b) Mean tenfold cross-validated classification accuracies for each combination of ROIs at the go cue under BMI control (teal) and manual control (yellow) for Monkey H (left) and Monkey Y (right). (c) Classification accuracies for the 8-class target-direction LDA classifiers for BMI control and manual control at the go cue. Classifiers are presented in order of ascending accuracy for manual control. Chance accuracy shown as a dashed line. Error bars represent standard error mean across days. Classification accuracies significantly differing between BMI and manual control after Bonferroni correction for multiple comparisons are indicated with an asterisk (Bonferroni corrected p-value < 0.05; see Supplementary Table 3 for exact values). (d,e) Same as (b,c), but for power features obtained at target acquisition rather than at the go cue.

Overall, DLPFC appears to best distinguish between BMI and manual control at the go cue, while target-prediction at the go cue is best predicted by M1 within each mode of control. Thus, the differences in neural activity within DLPFC that differentiate BMI and manual control could pertain to factors other than the encoding of target-direction. At target acquisition, M1 best distinguishes between modes of control. M1 and DLPFC seemed to play a large role in predicting target-direction under manual control, while classifiers predicting the target-direction from LFP power features under BMI control perform near chance accuracy.

Information flow to M1 during both BMI and manual control

To investigate how these regions interact with one another during BMI and manual control, we evaluated the directed functional connectivity, or effective connectivity, using Granger causality. Granger causality is a statistical test for determining whether one time-series is useful in predicting another, and therefore provides insight into the direction of information flow between these ROIs. For successful trials under BMI and manual control, we calculated Granger causality in the 500 ms LFP segments following the go cue and those preceding target acquisition. The same calculations were also performed on random non-overlapping 500 ms windows from the baseline period. At both task events, we found significantly above-chance effective connectivity in every ROI → ROI interaction direction during all task types: BMI control, manual control, and baseline (see “Methods” for details). This indicates that every ROI sends information to every other ROI. Because all ROIs exhibit significant reciprocal interactions, even during baseline, we isolated task-specific directed information flow by calculating a normalized Granger causality that compared interactions during BMI and manual control to interactions during baseline. A normalized Granger causality value greater than zero indicates an increase from baseline, while a value less than zero indicates a decrease from baseline. This normalized value provides insight into which interactions are task-specific and accounts for variability in baseline information flow across days. We first analyzed this normalized metric of directed information flow during BMI control. We observed an increase in normalized Granger causality in the DLPFC → M1 and Cd → M1 directions and a decrease in the M1 → Cd direction in both monkeys at both the go cue and target acquisition (Fig. 6a,b). Additionally, there was an increase in the DLPFC → Cd direction for Monkey H and a decrease in the Cd → DLPFC direction for Monkey Y at the go cue (see Supplementary Table 4). These results suggest information flow from DLPFC → M1 and from Cd → M1 during both task events, as well as information flow from Cd → DLPFC at the go cue. Next, we determined the change in directed information flow from baseline during manual control. We observed a similar increase in normalized Granger causality from DLPFC → M1 and from Cd → M1 in both monkeys at both task events (Fig. 6a,b). However, unlike during BMI control, DLPFC ↔ Cd interactions were inconsistent across animals. While there were significant differences between BMI and manual control normalized Granger causality within a subset of interaction directions, these differences were inconsistent across monkeys and mainly differed in amplitude rather than direction (Fig. 6a,b, see Supplementary Table 5). Overall, these results suggest information flow from DLPFC → M1 and Cd → M1 is present during both BMI and manual control.

Figure 6
figure 6

Net flow of information between M1, DLPFC, and Cd relative to baseline is similar across BMI and manual control (a) Normalized Granger causality estimates at the go cue during BMI (teal) and manual (yellow) control. Normalized Granger causality greater than zero indicates an increase from baseline, whereas less than zero indicates a decrease from baseline. Normalized Granger causality estimates significantly differing between BMI and manual control are indicated with an asterisk (Bonferroni corrected p-value < 0.05; see Supplementary Table 5 for exact values). Error bars represent standard error mean across days. (b) Same as (a) but for target acquisition, rather than go cue. (c) Schematic depicting the calculation of net normalized granger causality. The difference in normalized granger causality for reciprocal interactions was used to estimate net flow of information relative to baseline. (d) Arrows represent significantly directed net normalized Granger causality (Bonferroni corrected p-value < 0.05) at the go cue during BMI control (see Supplementary Table 6 for all values of net normalized Granger causality). The absence of an arrow indicates that the net normalized Granger causality was not significantly directed. (e) Same as (d), but for target acquisition during BMI control. (f) Net normalized granger causality at the go cue during manual control. (g) Same as (f), but for target acquisition during manual control.

For both BMI and manual control, most reciprocal interactions underwent opposite changes from baseline. To consolidate these results, we calculated the net normalized Granger causality by taking the difference between each pair of reciprocal directed interactions (Fig. 6c). This metric, which represents the overall predominant direction of task-specific information flow, provides a clear representation of the network between our ROIs. We refer to information flow between two ROIs as significantly directed if we found the difference in normalized Granger causality to be significantly different from zero. During BMI control, we found that information was significantly directed from DLPFC → M1 and Cd → M1 for both monkeys at both task events (Fig. 6d,e, Supplementary Table 6a,b). At the go cue, information was also significantly directed from Cd → DLPFC (Fig. 6d, Supplementary Table 6a). Just as with our control-type classification results, we assessed whether this directed network was present across BMI learning stages and found no differences (Supplementary Fig. 9). Under manual control, we found similar directions of net change in information flow from baseline. The net normalized Granger causality was significantly directed, from DLPFC → M1 for both monkeys across both task events (Supplementary Table 6a,b), similar to BMI control (Fig. 6f,g). The interaction from Cd → M1 was significantly directed for Monkey Y, but was not above chance for Monkey H. Interactions between DLPFC and Cd were significantly directed from DLPFC → Cd in Monkey H, rather than Cd → DLPFC as observed during BMI control, but were not significant for Monkey Y. Within each monkey, these results were consistent across task events. Overall, there were many similarities in the net direction of information flow across both BMI and manual control. Under both control types, the net direction of Granger causality increased from DLPFC → M1 relative to baseline at both the go cue and target acquisition. However, the net direction of information flow between Cd and M1 and between DLPFC and Cd was not consistent across monkeys during manual control. Therefore, DLPFC may play an upstream role for both BMI and manual control, while the upstream role of Cd may be more clear for BMI control.

Our results demonstrate the presence of distinct neural representations of BMI and manual control in M1, DLPFC, and Cd. More specifically, we found that neural activity from DLPFC and M1 best distinguish between control types, at the go cue and target acquisition, respectively. Further, we identified net directed information flow from DLPFC → M1 in both BMI and manual control, and from Cd → M1 in BMI control. Together, these results suggest that DLPFC plays an important upstream role in both BMI and manual control tasks, with especially distinct neural representations during a period in the task that requires action planning, selection, and initiation.

Discussion

In this study, single- and multi-unit motor cortical recordings from a semi-chronic microdrive array were used to successfully control a BMI. We leveraged simultaneous LFP recordings from M1, DLPFC, and Cd to investigate differences in distributed activity within cortical and subcortical networks during motor BMI control and during overt motor (manual) control. Our results demonstrate the presence of distinct neural representations of BMI and manual control in M1, DLPFC, and Cd. Notably, we found that neural activity from DLPFC and M1 best distinguishes control types at the go cue and at target acquisition, respectively. Further, we identified net directed information flow from DLPFC → M1 in both BMI and manual control and from Cd → M1 in BMI control. Together, these results suggest that DLPFC plays an important upstream role in both BMI and manual control tasks, with especially distinct neural representations during a period in the task that requires action planning, selection, and initiation. On the contrary, M1 is a net-receiver of information and is better at distinguishing control types during a period in the task that requires more fine effector control. Overall, this work provides evidence of coordinated network activity between M1, DLPFC, and Cd during BMI control that is similar yet distinct from manual control.

We compared spectral power during baseline, BMI control, and manual control and found that theta power in M1, DLPFC, and Cd was significantly higher during the control tasks than during baseline at the go cue. In contrast, there were no significant distinctions in theta power at target acquisition. Previous work has demonstrated the involvement of theta activity in goal-directed information processing39,40, with increased theta activity in the prefrontal cortex observed at decision points where choice-relevant information is under consideration. In the center-out task, the go cue can be thought of as a decision point, where subjects must choose a cursor trajectory, while the target acquisition period involves less choice-relevant information processing. Therefore, this observed increase in theta power across all ROIs at the go cue may be related to trajectory selection and planning. In contrast to theta power, beta power in M1, DLPFC, and Cd at the go cue was significantly lower during the control tasks than at baseline. At target acquisition, only M1 beta power remained distinct between baseline and the two control tasks. Beta power in the prefrontal cortex has been implicated in attention and task-specific rule representation41,42. Beta power in the motor cortex and striatum can be modulated by task-relevant cues, movement preparation, and movement execution43,44,45,46. Therefore, beta power within each of our ROIs may play an important role in maintaining task-relevant information, with DLPFC and Cd beta power modulated more specifically during trajectory planning at the go cue. These differences in oscillatory activity in the theta and beta frequency bands between baseline and the two control tasks suggest that this activity, which has previously been observed during planning, movement, and other goal-directed tasks, may be utilized with similar functionality in both BMI and manual control.

We found distinct BMI and manual neural representations in M1, DLPFC, and Cd at both task events when comparing combinations of frequency bands. We also found that control-type classification accuracy generally increased with the addition of neural activity from a second or third ROI, suggesting that each ROI holds some non-overlapping information for differentiating between BMI and manual control. One major difference between our BMI and manual control tasks is the lack of proprioceptive feedback in the BMI task. Many models of how the motor cortex controls movement incorporate sensory feedback as a relevant factor47. One potential explanation for observed differences in M1 activity is the difference in sensory feedback between the two control types. The differences between control types in M1 LFP may also be related to changes in the properties of individual M1 neuron activity. Distinct changes in neural activity between BMI and manual control have been previously observed in single- and multi-unit motor cortical activity, in the form of changes in preferred direction and modulation depth29,48,49,50,51. The differences we observed between BMI control and manual control in M1 LFP may reflect or drive these distinct single- and multi-unit properties across control types. DLPFC activity also successfully differentiated between BMI and manual control and is largely involved in action planning, selection, and initiation13,18,26. Thus, the distinct neural representations in DLPFC activity may reflect differences between action planning, selection, and initiation between BMI and manual control. Furthermore, we also found that DLPFC activity was better than other ROIs at distinguishing control types at the go cue, a portion of the task that may rely more heavily on action planning, selection, and initiation, further supporting the hypothesis that the differences in DLPFC activity between BMI and manual control are related to differences in these cognitive processes. Interestingly, while DLPFC was best at distinguishing between control types at the go cue, M1 was best at distinguishing between target-directions, suggesting that DLPFC may have a role at the go cue that is distinct from target-direction encoding. The prefrontal cortex has also been implicated in rule representations and in task- and rule-switching18, as has the basal ganglia52,53. In our experiment, proficient motor BMI and manual control could be viewed as two rules dictating how the subjects must perform the center-out task. Thus, the distinct neural representations in DLPFC and Cd may be involved in differentiating between these two control types as distinct rules.

Behavioral performance is another major difference between BMI and manual control, with BMI behavioral performance significantly changing across days while manual performance remains more stereotyped and consistent. These differences in consistency across trials may explain why predicting target-direction at target acquisition under BMI control performed worse than under manual control. This also may suggest that BMI performance was more stereotyped at the go cue, where there were fewer significant differences between the modes of control, than at target acquisition. However, despite behavioral performance changes in the BMI control task over days, our control-type classification results did not differ across learning stages, implying that the distinctions in neural representations of control-type do not reflect skill learning and changes in performance. In contrast, previous literature has shown decreases in DLPFC activity and increases in striatal activity across motor learning54. Additionally, decreased PFC activity has been observed across learning of a 1-D BMI task in humans8,35 as well as an increase in cortico-striatal coherence across learning of a 1-D BMI task in rodents32. However, the 2-D, center-out BMI task used in this study is more challenging than both an overtrained overt motor control task and a 1-D BMI control task. Because the task used in this study required the animal to hold a center target in order to successfully initiate a trial, even trial initiation required some amount of proficient control from each NHP. Thus, any truly initial learning that would need to occur prior to this baseline level of proficiency would not have been captured by our experimental design. Although our results were consistent across early and late learning stages, it is important to note that even at their most proficient BMI control, both subjects performed the task more slowly under BMI control and never achieved the same target acquisition time as manual control. Thus, even proficient BMI control may involve greater online error correction than manual control. Previous studies have discussed the role of the striatum in error-based motor learning and encoding error in movement observation55,56,57,58. The distinctions we observe in Cd may represent this difference in online error correction across control types. However, the exact role of Cd in error correction is still debated and comparing the role of other regions implicated in error prediction, such as the cerebellum, is necessary59. While further research is necessary to understand which cognitive processes are relevant to each brain region in BMI control, our results expand on prior knowledge of distinct neural representations between BMI and manual control in M1 and also show distinctions in the neural activity of distributed brain regions beyond M1 including in DLPFC and Cd.

To investigate how these regions interact with one another during both BMI and manual control, we quantified directed functional connectivity using Granger causality. Overall, we observed a similar direction of net information flow across ROIs between BMI and manual control despite M1 spikes being used as direct input to the BMI decoder, which may alter the neural dynamics within M130,60 leading to possible changes in LFP synchronization across interconnected brain regions. Our results suggest that these potential changes in neural dynamics during BMI control do not greatly reshape the feedback loops between M1, PFC, and Cd that already exist under overt motor control. Instead, we observed net directed information flow from DLPFC → M1 during both BMI and manual control. Previous studies have identified structural and functional connections between prefrontal and motor cortices, with the prefrontal cortex sending information to the motor cortex regarding both motor regulatory functions13,61 and goal-directed behaviors25. Our results support this upstream information processing model during overt motor control and suggest that a similar model is present during BMI control. The direction of net information flow was consistent across the go cue and target acquisition, suggesting that the information-processing network is stable despite shifts in primary cognitive demands throughout an individual trial and may be representative of a more general state of goal-directed behavior.

We also observed net directed information flow from Cd → M1 during BMI control. Much of the previous literature on cortico-striatal connections has focused on motor execution directed from M1 to the striatum due to the existence of direct structural connections between them20. However, recent studies in mice have confirmed the presence of a cortico-striatal thalamo-cortical feedback loop that is relevant in motor control, suggesting that indirect modulation of the motor cortex by the striatum also plays an important role in motor control22,62,63,64. Additionally, a recent study in humans suggests that basal ganglia may contribute to the regulation of sensorimotor cortical regions65. While the bottom-up indirect feedback from the striatum to the motor cortex has not received as much attention, particularly in NHPs, our results suggest that this directed functional connection may be an important area of future study in BMI control.

Altogether, this study leverages simultaneous cortical and subcortical recordings during the same task under BMI control and under overt motor control to identify control-specific neural representations within M1, DLPFC, and Cd and to uncover an information-processing network from DLPFC and Cd to M1 during BMI control. Our results contribute to a growing body of work elucidating the neural mechanisms underlying BMI control, improving our understanding of BMIs as a scientific tool and as a therapeutic device. This study provides insight into the cortical and subcortical circuits supporting both motor and high-level cognitive functions associated with BMI control.

Methods

Surgery

Two rhesus macaques were implanted with recording chambers. Chamber positions were calculated based on images obtained from 3 T magnetic resonance imaging (MRI) (Siemens Medical Solutions, Malvern, PA) scans of each subject’s brains. Regions of interest, including primary motor cortex (M1), dorsolateral prefrontal cortex (DLPFC), and caudate (Cd), were manually traced in 3D Slicer66 using the Paxinos primate atlas as a reference67. The resulting neuroanatomical models were used to decide on stereotaxic coordinates for implantation. All procedures were conducted in compliance with the National Institute of Health (NIH) Guide for the Care and Use of Laboratory Animals and were approved by the University of California at Berkeley Institutional Animal Care and Use Committee.

Large-scale semi-chronic microdrive

We used custom large-scale semi-chronic microdrive arrays for recording in both animals (Gray Matter Research, MT). These arrays allowed us to simultaneously record from regions of interest at different depths using independently-moveable single microelectrodes (n = 124). Electrodes consisted of both glass-coated Tungsten electrodes (Alpha Omega, Nof HaGalil (Nazareth Illit), Israel) and Platinum-Iridium electrodes (MicroProbes for Life Sciences, Gaithersburg, MD). Throughout implantation and recovery, electrodes were stored inside of the microdrive. The impedance of the electrodes was monitored using a TDT NanoZ (Tucker-Davis Technologies, Alachua, Florida) while advancing them into the brain. First, electrodes were lowered out of the microdrive until they penetrated the dura. While most electrodes successfully entered the brain (Monkey H: 79/124; Monkey Y: 64/124), a subset broke upon penetrating the dura (Monkey H: 45/124; Monkey Y: 60/124). These recording channels were excluded from all analyses. Electrodes that successfully entered the brain were advanced until their respective neural targets were reached and unit activity was detected. The estimated target depth of each electrode was calculated using the neuroanatomical models built in 3D Slicer. Ultimately, we successfully lowered a subset of electrodes into each region of interest (Monkey H: M1 = 27, PMd = 10, DLPFC = 22, Cd = 8; Monkey Y: M1 = 12, PMd = 12, DLPFC = 16, Cd = 8).

Intracortical Recording

Neural data were recorded using the OmniPlex Neural Recording Data Acquisition System (Plexon Inc, Dallas, TX). Single- and multi-unit activity was sorted prior to beginning recording sessions using an online sorting application (Sort Client, Plexon Inc, Dallas, TX). Wideband activity was recorded at 5 kHz. LFP activity was obtained by low-pass filtering at 250 Hz, notch filtering at 60 Hz and 120 Hz, and down-sampling to 1 kHz. LFP activity was common median referenced by first z-scoring activity within each channel (\({x}_{i} where i: \{1, 2, ... n\}\)) by subtracting the mean and dividing by the standard deviation in each recording session (\(s\))

$${Z}_{{i}^{s}} =\frac{{x}_{{i}^{s}}- {\mu }_{{i}^{s}}}{{\sigma }_{{i}^{s}}}$$

and then subtracting the median value across all channels at each time point (\(t\)).

$${Z{\prime}}_{{i}^{t}} = {Z}_{{i}^{t}} - median(\{{Z}_{{1}^{t}}, {Z}_{{2}^{t}}, ... {Z}_{{n}^{t}}\})$$

After subtracting the median to remove common signals, likely low frequency noise, across channels, the LFP activity was multiplied by the standard deviation and the mean was added to restore the LFP to its original scaling.

$${x{\prime}}_{{i}^{s}}{ = (Z{\prime}}_{{i}^{s}}*{\sigma }_{{i}^{s}}) + {\mu }_{{i}^{s}}$$

Common median referencing was used instead of common average referencing to avoid influence from outliers. The common median referenced LFP activity was then z-scored using the mean and standard deviation within channel across all recording sessions within a day (\(1, 2, ... S\)).

$${Z{\prime}{\prime}}_{{i}^{s}} =\frac{{x{\prime}}_{{i}^{s}}- {\mu {\prime}}_{{i}^{1, 2, ... S}}}{{\sigma {\prime}}_{{i}^{1, 2, ... S}}}$$

Center-out task

Subjects performed a self-paced, center-out, reaching task to eight targets. Trials were initiated by moving a cursor to the central target. A successful trial required a short hold at the center, moving to the peripheral target within a specified time, and a brief hold at the target. Successful trials resulted in a juice reward; failed trials were repeated up to 10 times before a new target was presented. Target directions were presented in a pseudo-randomized order.

Subjects were first overtrained in the center-out task performed with arm movements before starting BMI. In this manual control (MC) version of the task, the subject’s arm moved in a KINARM exoskeleton (BKIN Technologies, Kingston, ON, Canada) that restricted movements to the horizontal plane. Neural activity recorded during MC was used to train a BMI decoder. Using this BMI decoder, the animals performed the same task under BMI control (BC). During BC, the animals’ arms were restricted in a fixed position within the exoskeleton and the animals were required to move the cursor to the target by modulation of motor cortex activity.

Brain–machine interface

Subjects learned to control a two-dimensional BMI cursor in real-time using a fixed velocity Kalman Filter (KF) decoder68,69,70. The KF assumes two linear models:

$${x}_{t + 1} = A{x}_{t} + {w}_{t}$$
$${y}_{t + 1} = C{x}_{t} + {q}_{t}$$

where and are the cursor state and neural activity at time t, respectively. The first equation represents the state-transition model, which describes the state of the cursor over time. It is specified by the state-transition matrix and additive Gaussian noise term. The second equation represents the observation model and describes the relationship between neural activity and cursor state. It is parameterized by the observation matrix and additive Gaussian noise. Neural activity was input as a vector of spike counts in 100 ms bins from the selected direct units.

Decoder parameters were initialized from neural and cursor kinematic data collected during the MC version of the task at the beginning of each recording day. Maximum likelihood estimation methods were used to fit initial parameters. Neural data from 10 to 20 single- and multi-units recorded from motor cortex were selected for BMI control each day (i.e., direct units). The population of direct units was highly overlapping from day to day, but there was some variability as units dropped out or new ones appeared. For Monkey Y, closed-loop decoder adaptation (CLDA) was performed using the SmoothBatch algorithm before fixing the BMI decoder. This algorithm uses knowledge of task goals (i.e., reaching targets) to infer a subject’s intent. The intended kinematics and observed neural activity during closed-loop BMI were used to re-estimate KF parameters. The SmoothBatch algorithm71,72,73 re-estimates the observation model of the KF (matrices C and Q), and updates were constrained to enforce smoothness. CLDA was typically run for 2–5 min to provide the subject with adequate performance to allow successful reaches to all targets. For Monkey H, the initial decoder trained from MC data was used.

Neural data analysis

All analyses were performed in Python with custom-written routines utilizing publicly available software packages including scipy74, numpy75, sklearn76, and statsmodels77. Unless otherwise specified, analyses involving activity from DLPFC and Cd were performed on the average LFP signal across all channels within DLPFC and Cd, respectively. Analyses involving M1 were performed on the average LFP signal across all channels that were used to record single- or multi-unit activity that was input to the BMI decoder.

Task-type classifiers

In order to understand which of our ROIs were most predictive of control type, we trained classifiers to predict between task-types using power from different ROIs and frequency bands as input features. We used quadratic discriminant analysis (QDA) models using singular value decomposition (SVD) from the sklearn toolbox in Python. We chose QDA as our classification model as it does not assume equal variances, unlike other linear classifiers, which was essential for our 3-class task-type input data.

Power in the theta (4–8 Hz), alpha (8–13 Hz), beta (13–35 Hz), gamma (35–75 Hz), and highgamma (75–150 Hz) frequency bands was calculated using Welch’s method on segments of LFP recorded during each control type. For BMI and manual control, power was computed in the 500 ms window following the go cue and in the 500 ms window preceding target acquisition on successful trials. Power estimates were averaged across all channels in each ROI prior to being used as input features. While no individual channels were notable outliers within each ROI, we chose to use the channel-average in our analyses in order to prevent unequal representation of ROIs with more channels than others (Supplementary Fig. 10). The same calculations were also performed in random non-overlapping 500 ms windows during the baseline period. To avoid negative effects on imbalanced data between classes, the number of trials selected for each task type was matched to the minimum number of trials per individual task type each day by selecting a random subset of trials from the larger class to match that minimum. The corresponding number of windows were selected from the baseline period as well.

Results for each model were validated using a tenfold cross-validation within each recording day. For each round of cross-validation, QDA classifiers were trained on 90% of data on each day (3-class: Monkey H: 151–696 trials, Monkey Y: 224–699 trials; ; 2-class: Monkey H: 100–464 trials, Monkey Y: 149–466 trials) and test on 10% of data on each day (3-class: Monkey H: 16–77 trials, Monkey Y: 24–77 trials; 2-class: Monkey H: 11–51 trials, Monkey Y: 16–51 trials). To validate whether classification accuracy results were significantly above chance, we used a paired t-test to compare each result to a chance accuracy. For each frequency band and ROI combination, chance accuracy was calculated by randomly shuffling class labels and rerunning the QDA classification (see above for ranges of training and test data set sizes) 1000 times for each day.

To determine whether particular frequency bands were useful for distinguishing between task types, we computed the QDA classification accuracy using each individual frequency band from each ROI as well as using all frequency bands from each ROI as input (Supplementary Fig. 3). We determined that all frequency bands in all ROIs could predict the task type significantly above chance and that the classifier trained on power features from all 5 frequency bands outperformed the classifiers using individual bands, so all subsequent analyses use classifiers trained on all 5 frequency bands from each ROI. This resulted in 5 features per ROI included in a QDA classifier, with models ranging from 5 to 15 features in total depending on the input ROI combination.

Target-direction classifiers

To further compare neural representations under BMI and manual control, we also trained classifiers to predict between the eight target-directions in the task under BMI control and under manual control separately. As in our task-type classifiers, we used power from different ROIs and frequency bands as input features (same as described above). We used linear discriminant analysis (LDA) models using SVD from the sklearn toolbox in Python. Rather than employing QDA, as we did for the task-type classifiers, we used LDA models for target-direction prediction due to training data for these classification containing one-eighth of the number of samples per class, which could result in overfitting with QDA models. Variances across target-directions were equal, therefore satisfying LDA’s equal variances assumption.

Data from the first 5 recording days was excluded from these analyses to ensure that there were enough successful trials per each target-direction under BMI control to train a classifier without overfitting. Because there were an unequal number of successful trials under manual control and BMI control, a random selection of equal number of BMI and manual trials per target-direction per day was selected for input into the classifiers. This ensured that all targets and control types were equally represented, allowing for a fair comparison of accuracies between the two modes of control. Results for each model were validated using a tenfold cross-validation within each included recording day. For each round of cross-validation, LDA classifiers were trained on 90% of data (8-class: Monkey H: 165–230 trials, Monkey Y: 136–230 trials) and tested on the remaining 10% of data (8-class: Monkey H: 18–25 trials, Monkey Y: 15–25 trials). To validate whether classification accuracy results were significantly above chance, we used a paired t-test to compare each result to a chance accuracy. For each frequency band and ROI combination, chance accuracy was calculated by randomly shuffling class labels and rerunning the LDA classification (see above for ranges of training and test data set sizes) 1000 times for each day.

Granger causality

Granger causality was used to estimate the directional functional connectivity among all pairs of regions of interest. Granger causality relies on an autoregressive (AR) modeling framework, in which future values of a time series are modeled as a weighted combination of past values of time series. The quality of an AR-model is assessed by quantifying the variance of the model’s residuals. If the variance of the AR-model’s residuals is reduced by the inclusion of past measurements from a second time series, then the second time series is said to Granger-cause or G-cause the first78,79. Applying this logic, we obtained Granger causality estimates using simultaneously recorded LFP signals. We compared Bayesian information criterion (BIC) of AR-models of different orders, p, for each combination of LFP signals obtained from all pairs of regions during each trial from all recording sessions. We selected p = 13 to minimize BIC in the average case, allowing for the signals to be sufficiently long enough to capture the data structure without over-parameterization.

For successful trials under BMI and manual control, we calculated Granger causality in the 500 ms average LFP segments following the go cue and those preceding target acquisition. The same calculations were also performed on random non-overlapping 500 ms average LFP segments from the baseline period. To obtain an estimate of spurious interactions, we calculated Granger causality between the average LFP during baseline and the average LFP during BMI control. Because these signals were recorded at separate times in different tasks, any interactions that occur between regions would be artifactual. True interactions between regions within baseline and BMI control were considered significant if they were statistically different from this null distribution. The average Granger causality value across BMI control trials within each day was determined and compared to that of the null distribution. A null distribution for manual trials was calculated and compared using the same protocol.

We compared ROI → ROI Granger causality (\({G}_{Y\to X})\) in each interaction direction to the null distribution via an unpaired t-test and found that every interaction was significantly greater than the null distribution during all task types: BMI control, manual control, and baseline. In order to isolate task-relevant interactions, we computed the normalized Granger causality for BMI and manual control as follows:

$$(norm. {G)}_{Y\to X, BMI} =\frac{{G}_{Y\to X, BMI} - {G}_{Y\to X, Baseline}}{{G}_{Y\to X, Baseline}}$$
$$(norm. {G)}_{Y\to X, Manual} =\frac{{G}_{Y\to X, Manual} - {G}_{Y\to X, Baseline}}{{G}_{Y\to X, Baseline}}$$

Here, a normalized Granger causality greater than zero indicates an increase from baseline, whereas a normalized Granger causality less than zero indicates a decrease from baseline. To further assess the direction of information flow, we also calculated the net normalized Granger causality by subtracting the value of normalized Granger causality between ROI → ROI reciprocal interactions.

Quantification and statistical analyses

All analyses were performed within a single recording day and error is depicted across days. For analyses comparing two distributions or comparing a single distribution to a value, two-sample or one-sample t-tests were used, respectively. Bonferroni correction was used post hoc to correct for multiple comparisons. Significance is reported after correction for multiple comparisons.

Ethical approval

All procedures were conducted in compliance with the NIH Guide for the Care and Use of Laboratory Animals and reporting in the manuscript follows the recommendations in the ARRIVE guidelines. All procedures were approved by the University of California at Berkeley Institutional Animal Care and Use Committee under protocol ID AUP-2014-09-6720-2.