This section reviews the key elements needed to construct a computational neurorehabilitation model, which are A) a quantitative description of the sensorimotor activity that the patient experiences; B) a computational model of the plasticity mediating recovery; and C) a quantitative description of the patient’s behavioral outcomes. To provide a specific context for the discussion, we again concentrate on strokes affecting motor control of the upper extremity, as much of the initial work in computational neurorehabilitation is being done in this area.
Modeling activity-dependent plasticity requires a quantitative description of activity that stimulates plasticity. Historically, sensorimotor activity during neurorehabilitation has been characterized in research studies and clinical practice primarily by the amount of time spent in assigned therapy sessions [
9,
27]. It is also possible to simulate training sessions, in order to derive theoretical inputs for models, as has been done for most initial models described below. However, one reason that computational neurorehabilitation models have the potential to soon become much more elaborate and powerful is that researchers are beginning to quantify more precisely the sensorimotor activity that a patient experiences. There has been increased interest in quantitative, observational studies, and in new sensing technologies, including robotics and wearable sensors.
Examples of the type of data available from robotic and sensor-based therapy devices include the number of movements made and the trajectories achieved while making these movements. Other key variables relate to kinetics, such as the forces applied by the robot to the patient [
56,
57] or by the patient to the robot [
58], or amount of positive and negative work done on the patient during therapy with the device [
59,
60]. Isolated sensors can also quantify the physical interaction forces and motions that therapists apply during hands-on therapy [
61]. Such biomechanical measurements can be combined with measures of Electromyography (EMG) to generate estimates of muscle activity during training, and, increasingly, brain imaging techniques, including Electroencephalography (EEG) [
62], Near-Infrared Spectroscopy (NIRS) [
63], and functional Magnetic Resonance Imaging (fMRI) [
64], to provide insight into brain activity during training.
Wrist accelerometry is typically used to detect the amount of time spent moving the arm using a thresholding approach [
73]. If sensors are worn on both arms, the amount of bimanual activity can be quantified, and the activity of the stroke-impaired arm can be compared to that of the less affected arm [
74]. Indeed most human motor activity seems to be bimanual in nature [
74,
75], which has implications for how computational neurorehabilitation models should be structured. New wearable sensing approaches are making it possible to non-obtrusively quantify finger and hand activity as well as gross arm movement during daily life [
76,
77].
Innards: modeling activity-dependent plasticity
Given a quantitative description of sensorimotor activity during stroke recovery, a computational neurorehabilitation model uses this description to drive a mathematical model of activity-dependent plasticity mechanisms. Here, we briefly overview two types of activity-dependent plasticity that will play a key role in computational neurorehabilitation models – one related to spontaneous biological recovery, and one related to motor learning. For reviews see [
78‐
80]. Note that for ease of presentation we speak of plasticity and learning rules as if they were independent form the model structure, but for most models they will not be. The model will need to consider how the necessary anatomical and functional structures support learning and plasticity, regardless of the abstraction level of the model.
Soon after stroke, abnormal cortical patterns of excitation and inhibition occur both near [
84‐
87]) and far from the lesion [
88]. Homeostatic plasticity, which is ubiquitous in the brain, acts to maintain desired firing rates and patterns [
81]. After a lesion, because of the loss of interneuronal connections, the activities of neurons neighboring the lesions, or neurons previously connected to neurons within the lesion, are affected. Homeostatic plasticity may be crucial for network recovery, as measured by re-establishment of lesion-affected inputs [
89]. In addition, sensorimotor activity might modulate this homeostatic plasticity, which is of importance for computational neurorehabilitation models, as it is one example of how sensorimotor activity appears to modulate spontaneous recovery [
6,
31,
89,
90].
LTP, LTD and neural structural plasticity such as dendritic and axonal sprouting, are also modulated by sensorimotor activity, and also change as a function of time. Following stroke, some features of brain function revert to those seen at an early stage of development, with the subsequent process of “recovery recapitulating ontogeny” [
91], but there is also a distinct, age-related pattern of gene expression, a “recovery transcriptome” [
92]. Genetic changes in the perilesional area allow for a window of increased plasticity that makes it easier for the perilesional neurons to modify existing connections and form new ones in response to sensorimotor activity [
81]. Increased LTP may also potentially lead to maladaptive plasticity and poor cortical reorganization if existing inputs are further strengthened at the expense of the reemergence of weak afferent synapses [
89]. In summary, underlying mechanisms assumed to contribute to the non-linear time course of recovery of movement in the first 3 months after stroke presumably reflect the interaction between a period of heightened plasticity mechanisms, occurring in a limited time window, and sensorimotor activity [
81,
93‐
95]. A practical implication is that, when new patterns of movement that are a consequence of specific combinations of muscle weakness (e.g. increased trunk flexion and abduction of the shoulder during reaching) are attempted repeatedly during this period of heightened plasticity, they may become the new ‘norm’ – hence patients get stuck in local minima. Further, use of the less-impaired arm may subvert the heightened plasticity of the stroke-affected hemisphere, preventing it from improving the paretic arm function [
14].
There are as yet few computational models of spontaneous biological recovery, much less of the interaction between spontaneous biological recovery and sensorimotor activity. Computational models of the effects of stroke to date have primarily focused on the network effects of deleting cells or of altered connectivity. For example, one early model used a difference-of-Gaussians connectivity pattern to explain rapid changes in the size of cellular receptive fields after stroke lesions [
96,
97]. Other models have studied interhemispheric effects of lesions [
23,
24], and used connectome data to model brain regions as graphical network nodes, evaluating the effects of node deletion on network dynamics [
98‐
100]. One of the first models to study the effects of network changes on movement kinematics evaluated the effect of lesion size on post-stroke reach variability using cortical cells that were tuned to preferred reach directions, but did not simulate plastic processes after lesion [
101]. A recent model studied the interaction between homeostatic plasticity and Hebbian-plasticity after stroke in the somatosensory cortex, and suggests that after a lesion, a delay preceding rehabilitation would allow a return of homeostatically-determined desired firing in cells neighboring the lesions, and thus may allow a faster network recovery in the rehabilitation training compared to no delay [
89]. It will be increasingly important to compare models that incorporate spontaneous biological recovery mechanisms to ones that do not, to determine how modeling these phenomena improves explanatory power. New analytical approaches to examine structural and functional connectivity within well-defined macroscopic brain networks, as briefly reviewed in Section II C below, will increasingly play a role, and will integrate plasticity rules with the necessary anatomical and functional structures.
Unsupervised learning is related to the concept of use-dependent learning, which refers to the phenomenon that the motor system can modify its performance through pure repetition of movements, without external feedback as to the success or failure of the movement [
103,
111]. Several initial models of network dynamics after stroke incorporate unsupervised learning (see
review [
25]). Unsupervised learning, together with homeostatic plasticity, likely plays a role in map and neural reorganization post-stroke, and presumably in decreasing movement variability and thereby improving functional performance [
89,
112].
Adaptation studies have shown that humans interact with novel environments by minimizing error (e) relative to the planned movement, and effort (u) [
118,
119]. This can be modeled as the minimization of the cost function.
$$ \mathrm{V}={\upalpha \mathrm{e}}^2+{\upbeta \mathrm{u}}^2,\upalpha, \upbeta >0. $$
(1)
A key recent result is that a simple neural algorithm, which is a “sunken-v”, muscle-specific activation update law that relates the error experienced in muscle coordinates to the change in in muscle activation on the next movement trial, implements this minimization, while simultaneously shaping arm impedance to the task at hand [
118‐
120]. Another factor involved in movement generation is that subjects tend to minimize time to complete an action, which stands in tradeoff with the required effort [
121].
Whereas behavioral observations suggest that at least two learning processes are involved in adaptation, it is unclear how many distinct memories the brain actually updates. In addition, it is unclear whether these putative multiple motor memories reside within a single neural system that contains a distribution of possible timescales, or in qualitatively distinguishable neural systems. A recent study addressed these issues using a model-based fMRI approach [
127]. The behavioral data of subjects adapting to two opposing visuo-motor perturbations were first used to derive a large number of possible memory “states”, each with different dynamics, which were then correlated with neural activities. Regional specificity to timescales were identified. In particular, the activity in inferior parietal region and in the anterior-medial cerebellum was associated with memories for intermediate and long timescales, respectively. A sparse singular value decomposition analysis of variability in specificities to timescales over the brain identified four components, two fast, one middle, and one slow, each associated with different brain networks. Then, a multivariate decoding analysis showed that activity patterns in the anterior-medial cerebellum progressively represented the two rotations. These results thus support the existence of brain regions associated with multiple timescales in adaptation and a role of the cerebellum in storing multiple internal models.
Note that these multiple-time constant models assume error-based learning mechanisms. A recent summary of behavioral evidence concluded that while there are at least two components of motor adaptation in response to perturbations, they cannot be fully characterized by first order processes driven by error. For example, the slow process is implicit and learns form errors, while the fast process is explicit and is sensitive to success and failure, among other key differences [
128]. The evidence for reward-based and use-dependent mechanisms in motor adaptation suggest they operate at multiple time constants as well, and are likely to be of more relevance to restitution rather than compensation [
129‐
131].
Such state-space models can account for short-term motor adaptation as well as multiple task learning and the contextual interference effect in post-stroke individuals [
132,
133]. At least one initial computational neurorehabilitation model successfully used state-space models inspired by supervised learning data [
56]. In addition, robotically amplifying errors can help stroke patients eliminate steady state directional reaching errors [
116]. Further, training with amplified errors may increase arm movement recovery after chronic stroke [
113]. The beneficial effects of sensory augmentation may be due to the larger error available to the brain for perception and for learning. A recent study however suggests that augmenting errors can decrease motivation in a way that persists beyond the experience of the augmented errors [
134], and motivation plays a key role in neurorehabilitation [
112].
Note that both supervised and reinforcement learning likely operate simultaneously as both error and reward feedback are often available [
137]. For instance for fast reaching to targets by unimpaired subjects, it has been shown that different time constants of learning, and forgetting, may be associated with supervised and reinforcement learning [
130]. In rehabilitation therapy, receiving error feedback from a therapist can be rewarding and reinforces behavior. In the absence of external feedback, the learner still has access to intrinsic feedback and this can strongly promote self-learning. Thus, what is presumed to be unsupervised learning can be instead reinforcement learning driven by self-generated feedback. Also, it is the self-generated feedback that the patient needs to rely on when returning to his or her home environment. Accordingly, whenever external feedback is provided, it is important not to become too dependent on this source of augmented information by gradually weaning the learner from external feedback during practice, i.e. to learn to rely on self-generated feedback [
137,
138].
Humans do not always appear to minimize error or maximize future rewards, however. In some instances, humans tend to perform a motor task by using the same strategy as they had used in previous trials, even if they had previously experienced a strategy using much less effort [
139‐
142]. This suggests that rather than attempting global minimization of effort, the sensorimotor system might rather repeat a strategy that it knows will achieve the goal, a finding with implications for modeling use of compensatory movements by stroke patients.
Generalization refers to the concept that training on one task can improve performance on other tasks. Patterns of generalization are complex, in that generalization has been found to be limited in some conditions [
143], but rather broad in others [
54]. For example, after training to reach in one direction with a planar robotic perturbation, there is little transfer to other directions [
144], but relatively broad generalization across certain arm postures [
145]. The concept that motor generalization is rather limited has helped drive a strong focus on task-specific training after stroke [
146]. However, a key qualifier of this concept is that the organization of practice may determine how much generalization occurs. If one trains one specific task or task variant, there may be little transfer. However if many task variants are practiced, transfer will likely be larger. This is called the variability of practice hypothesis [
147], and has clear relevance for computational neurorehabilitation models.
Finally, motor learning not only involves building new action patterns but also suppressing or modulating pre-existing patterns or synergies. This is clear in bimanual skill learning in which the learner gradually overcomes the effect of pre-existing preferred coordination modes (such as in-phase and anti-phase patterns) that are part of the intrinsic dynamics of the system in order to acquire new coordination modes (such as less intrinsic, relative phase patterns) [
75]. Like any motor learner, individuals with a stroke may be more constrained by preferred coordination modes and/or by basic synergies that need to be overcome to develop skill. Similarly, previously acquired coordination modes can hamper the acquisition of new coordination modes, a phenomenon called negative transfer [
148]. Further, neural damage itself may fundamentally constrain the solutions possible for motor learning.
Outputs: functional outcomes and kinematics
Currently, in rehabilitation, behavior is usually described with relatively coarse scales, which often sum scores of performance on many tasks, and are typically taken at widely spaced time points. Ideally, computational neurorehabilitation models will bridge the causal link between network plasticity and behavioral changes, which will require higher resolution measurements of behavior at many repeated time points.
Note that data sets that evaluate outcomes differ from the data sets discussed in Section I A in that they quantify how much and how well a patient can move, rather than the total amount and features of rehabilitation training activity. There can be overlap between the two data sets, however, in that measurements made during training can be used to assess movement outcomes, and measurements made of movement during daily life can be used to quantify both training inputs (inasmuch as daily movement serves a training function), as well as serve as a way to quantify outcomes. For instance, kinematic measurements from a robotic rehabilitation device obtained during the course of robotic therapy have been shown to predict standard functional outcomes, without the need for dedicated assessment procedures [
149].
Higher resolution outcomes data are becoming available through detailed kinematic studies of upper extremity movement in stroke recovery. In one study that serves as an example, patients with active proximal and distal limb movement within the first 2 weeks after stroke participating in the VECTORS trial were studied with kinematics and electromyography, identifying deficits in movement accuracy, reduced muscle efficiency, delayed muscle onsets, and a reduced ability to modulate muscle activity [
150,
151]. Within the first 3 months after stroke, muscle onset times and percentage of muscle capabilities were similar to a neurologically-intact control group, but deficits in the ability to modulate muscle activity remained [
151], including an inability to efficiently open and close the fingers on a target object [
152,
153]. No computational neurorehabilitation models have yet to our knowledge attempted to model these outcomes.
Other longitudinal movement data from multiple labs around the world are accumulating [
7,
154,
155]. For example, a recent kinematic study with intensive repeated measurements in the first months post stroke used principal components analysis to show that individuals with stroke learn to dissociate shoulder and elbow movements mainly in the early phase post-stroke, but do not achieve fully dissociated movements even at 26 weeks [
7]. Likewise, recovery in smoothness in reaching and hand aperture was mainly predicted by progress of time alone and almost plateaued within the first 8 weeks post stroke [
156]. Again, no models that incorporate plasticity mechanisms have yet attempted to model these findings.
Ideally, motion capture data sets would include the effect of different interventions. For example, there is an ongoing debate on the issue of whether recovery of functional movement is best achieved through restitution (such as reaching with normal kinematics) or compensation (such as using the less affected extremity or leaning forward with the trunk) [
4,
157,
158]. At the present time, there are only small amounts of movement data collected pre- and post-intervention to address this issue. In a pilot trial of intensive, progressive, task-specific upper extremity training for people with stroke [
41], kinematic and kinetic movement data were examined pre- and post-intervention to examine how movement changed [
159]. The results suggest that recovery of function via restitution versus compensation is not an all-or-none phenomenon, but varies within and across individuals. All patients demonstrated improvements in function on clinical scales. In contrast, some movement variables in some subjects indicated restitution of normal movement patterns, while other variables in the same or different subjects indicated the adoption of compensatory movement patterns [
159].
Just as wearable sensors will drive computational neurorehabilitation models with data from self-training of the arm during home exercise or daily life, they will provide the descriptors of movement recovery that the models seeks to predict. Such sensors will provide data at a much finer temporal resolution than previous clinical data, which typically are obtained only at baseline, post-intervention, and at one or two follow-ups. This fact, along with the fact that the sensors provide kinematic data, will facilitate simulation of neural networks controlling movement recovery. Such technology-based measurements are also being found to map well to clinical outcome scales [
149,
160‐
162]. Thus, these measures have validity in terms of established clinical measures, while enhancing the interpretation of these measures, facilitating more fine-grain modeling, and developing new measures. Further, sensor-based measures may catch improvements or differences in behavior when clinical assessment suffers from lack of resolution or floor/ceiling effects, e.g. [
163]. Again, we are at a propitious time for computational neurorehabilitation because of the rapid rise of new wearable sensing technologies, the data from which can be used to quantify functional outcomes important to patients in clinicians.