- 1Department of Biomedical Engineering, Duke University, Durham, NC, United States
- 2School of Electrical and Information Engineering, Tianjin University, Tianjin, China
- 3School of Automation and Electrical Engineering, Tianjin University of Technology and Education, Tianjin, China
Clinically deployed deep brain stimulation (DBS) for the treatment of Parkinson’s disease operates in an open loop with fixed stimulation parameters, and this may result in high energy consumption and suboptimal therapy. The objective of this manuscript is to establish, through simulation in a computational model, a closed-loop control system that can automatically adjust the stimulation parameters to recover normal activity in model neurons. Exaggerated beta band activity is recognized as a hallmark of Parkinson’s disease and beta band activity in model neurons of the globus pallidus internus (GPi) was used as the feedback signal to control DBS of the GPi. Traditional proportional controller and proportional-integral controller were not effective in eliminating the error between the target level of beta power and the beta power under Parkinsonian conditions. To overcome the difficulties in tuning the controller parameters and improve tracking performance in the case of changes in the plant, a supervisory control algorithm was implemented by introducing a Radial Basis Function (RBF) network to build the inverse model of the plant. Simulation results show the successful tracking of target beta power in the presence of changes in Parkinsonian state as well as during dynamic changes in the target level of beta power. Our computational study suggests the feasibility of the RBF network-driven supervisory control algorithm for real-time modulation of DBS parameters for the treatment of Parkinson’s disease.
Introduction
Parkinson’s disease (PD) is a progressive neurodegenerative disorder resulting from death of dopaminergic neurons in the substantia nigra (Titcombe et al., 2001; Novikova et al., 2006; Bras et al., 2008; Jankovic, 2008; de Paor and Lowery, 2009). Deep brain stimulation (DBS), that delivers high-frequency electrical pulses via an implanted pulse generator to focal targets in the basal ganglia (BG) including the subthalamic nucleus (STN), the globus pallidus internus (GPi), or the ventrolateral thalamus (Vim), is a widely used therapy for treating PD when drug therapy such as the administration of levodopa no longer provides adequate control of symptoms (Haeri et al., 2005; Kiss et al., 2007; Vidailhet et al., 2007; Mehta and Sethi, 2009; Follett et al., 2010; Santaniello et al., 2011). Present open-loop DBS delivers invariant stimulation with parameters selected manually based solely on previous empirical evidence. Pre-programmed stimulation is applied regardless of changes in the patient’s clinical symptoms or underlying physiological activity, and open-loop DBS is limited in terms of efficacy, side effects and efficiency (Modolo et al., 2012; Popovych and Tass, 2012; Priori et al., 2012).
Optimization of stimulation parameters according to the individual and time-varying needs of patients is necessary to improve the treatment of PD (Androulidakis et al., 2008; Steiner et al., 2017). Several studies suggested that closed-loop DBS is an effective approach to improve therapeutic efficacy while limiting side effects and prolonging battery life (Doshi et al., 2003; Rosin et al., 2011; Little et al., 2013; Wu et al., 2015). Inspired by successful clinical use of closed-loop stimulation based on ECoG recordings in the treatment of epilepsy, this approach was initially adopted for DBS parameter modulation where the stimulation signal was switched on when beta oscillatory power exceeded a pre-set threshold in a primate model of PD (Little et al., 2013) and was subsequently extended to a dual threshold algorithm (Velisar et al., 2019). The design of closed-loop DBS, which uses a feedback signal and real-time adjustment of stimulation parameters, is considered the next frontier in the field of neuromodulation (Pizzolato and Mandat, 2012; Broccard et al., 2014; Hebb et al., 2014; Arlotti et al., 2016a; Swann et al., 2018).
A range of challenges are associated with closed-loop DBS including detectable control signals that are stable and robust in the long term (Little and Brown, 2012; Hoang et al., 2017; Steiner et al., 2017), understanding the relationship between patient states and brain control signals (Buzsáki et al., 2012), closed-loop control algorithms for automatic adjustment of stimulation parameters (Pirini et al., 2009; Guo and Rubin, 2011; Gorzelic et al., 2013), and comparisons of open-loop versus closed-loop DBS and clarification of their underlying mechanisms (Parastarfeizabadi and Kouzani, 2017). Therefore, the objective of this manuscript is to develop a computational model-based closed-loop scheme to adjust automatically the stimulation parameters for suppressing abnormal oscillatory activity in the BG. Local field potential (LFP) signals directly recorded from the DBS electrode appear to be a promising source of feedback signals (Mazzoni et al., 2015), and beta-band oscillations in the LFP are related to bradykinesia and rigidity in persons with PD (Beudel et al., 2017; Rosa et al., 2017; Deffains and Bergman, 2019; Lofredi et al., 2019; Montgomery, 2020). Here, a biophysically-based computational network model serves as the plant for the design of closed-loop control systems, from which LFP signals are obtained to simulate clinically detectable and recordable signals.
The highly nonlinear dynamics of the cortex-basal ganglia-thalamus network make the selection of controller parameters a substantial challenge (Kumaravelu et al., 2016). For traditional proportional-integral-derivative (PID) control, it is difficult to select appropriate controller gains, and the dependence on the precise mathematical model of the plant means that control accuracy cannot be guaranteed (Su et al., 2019). Neural network control has several potential advantages in this application. First, the capacity of neural network controllers to represent arbitrary functions avoids the complex mathematical analysis required for traditional adaptive control theory. In addition to modeling the complex and non-linear plant, neural networks can also act as the controller and continuously adjust the internal connection weights according to learning rules to minimize a given performance index. Thus, a supervisory control method based on radial basis function (RBF) neural networks was developed in this manuscript. In section “Materials and Methods,” we introduce the feedback signal selected for closing the loop of DBS and detail the design of the closed-loop control system. The control effects of a traditional controller and the intelligent supervisory controller are analyzed and compared in section “Results,” and the results are discussed in section “Discussion.” The proposed algorithm adaptively produced effective stimulation signals in response to changes in the state (plant) and the reference (target) signal.
Materials and Methods
Cortical-Basal Ganglia-Thalamus Network Model
A biophysically-based model of the cortex-basal ganglia-thalamus network (Kumaravelu et al., 2016), modified from the original Rubin-Terman model (Rubin and Terman, 2004), was adopted as a platform to develop and evaluate the controllers. The model included representations of neurons in cortex (CTX), striatum (STR) [sum of direct striatum (dSTR) and indirect striatum (idSTR)], STN, globus pallidus (GP) (sum of externa part GPe and interna part GPi) and thalamus (TH), and all the nuclei were interconnected through either excitatory or inhibitory synaptic connections to form a network. Each nucleus contained 10 single-compartment model neurons (Figure 1).
Figure 1. Cortical-basal ganglia-thalamus network model. Gray blocks represent the collections of cortical (CTX) neurons and basal ganglia (BG) neurons, respectively. Model schematic shows connections within the network, where black lines denote inhibitory connections and orange lines denote excitatory connections. Here, the direct pathway (eCTX → dSTR → GPi → TH → eCTX), the indirect pathway (eCTX → idSTR → GPe → GPi → TH → eCTX) and the hyper-direct pathway (eCTX → STN → GPi → TH → eCTX) are depicted. As well, excitatory-inhibitory coupling exists between STN and GPe. Excitatory eCTX and inhibitory iCTX neurons also receive synaptic connections from each other. The numbers in parentheses on the arrows indicate the synaptic conductance (mS/cm2) and transmission delay (ms), respectively.
The neurons of the STN, GP, TH, and STR were modeled using Hodgkin-Huxley (HH) type equations:
where C represented the membrane capacitance and was set to 1μF/cm2 for all cell models, vi(i ∈ {STN,GP,TH,STR}) represented the transmembrane potential of the corresponding model neuron and was expressed in mV. INa,IK,andIl were the sodium current, potassium current and non-specific leak current, IL,IT,andICa were, respectively, the L-type, T-type, and high-threshold calcium current, and Ia,Im,ICak,andIAHP were, respectively, A-type, outward M-type potassium, calcium-dependent and after-threshold potassium current. Parameters and equations of the ionic currents are provided in Table 1. Therein, a, b, c, d1, d2, h, m, n, p, q, and r were activation or inactivation variables, and the gating kinetics took the form
Table 1. Equations and parameters for subthalamic nucleus (STN), globus pallidus (GP), TH, and striatum (STR) model neurons.
where X represented one of a, b, c, d1, d2, h, m, n, p, q, or r. Steady-state gating variables were calculated using
where wX and σX were the half voltage and slope, respectively. Gating kinetics for STR took the form
Isyn represented the sum of synaptic currents, with each projection from presynaptic neuron α to postsynaptic neuron β(α,β ∈ {CTX,STR,STN,GP,TH}) given by Iα→β = gα,β×(vβ−Esyn)×S, where gα,β described the maximal synaptic conductance, and Esyn represented the reversal potential (uniformly set as −85 mV). An alpha synapse S was used to model the synaptic dynamics
where td was the synaptic transmission delay, and τrepresented the time constant of 5 ms. Further, bias currents Iapp_i(i ∈ {GPe,GPi,TH}) represented other synaptic inputs that were not described explicitly in this model.
The dynamics of the CTX neurons were described based on the model developed by Izhikevich (2003)
where vCTX represented the transmembrane potential and uCTX was the recovery variable. The time scale of the recovery variable uCTX was chosen as a = 0.02 and a = 0.1, respectively, for excitatory-CTX (eCTX) and inhibitory-CTX (iCTX) neurons. If the transmembrane potential of CTX neuron exceeded 30 mV, then vCTX was set to a resting potential equal to -65 mV and uCTX was set to uCTX + d (for eCTX, d = 8; for iCTX, d = 2). IinterCTX represented the reciprocal synaptic current from iCTX neurons to eCTX neurons or that from eCTX neurons to iCTX neurons, and ITH→CTX was the synaptic input from TH.
The Parkinsonian state was simulated by the adjustment of model parameters implemented using a parkinsonism variable, pd, where pd = 0 and pd = 1 were defined as the healthy state and full Parkinsonian state, respectively. The M-type potassium current in striatal neurons was reducedgm = 2.6−0.9×pd, cortico-striatal coupling strength was decreased gCTX,STR = 0.07−0.044×pd and coupling strength between GPe neurons was increasedgGPe,GPe = 0.0125 + 0.0375×pd. To quantify the difference between the healthy and Parkinsonian states, changes of firing rates and firing patterns of model neurons were analyzed. It was considered that one neuron produced a spike or action potential once its transmembrane potential was greater than the threshold Vthre = −20mV, with the time of crossing the threshold defined as the firing time. The average firing rates were calculated based upon the firing time during the entire simulation period. In addition, spike synchrony that characterizes the dynamic patterns in each population of model neurons was measured. Defining vi(t) as the membrane potential time course of the jth neuron from a population of n neurons, then we could average over the population . The variance of membrane potential vi(t) and the variance of the averaged membrane potential V(t) were expressed as and , respectively referred to the average value of the variables within the time of T), and the level of synchrony was calculated according to the following equation,
where χ was normalized between 0 and 1, with χ = 0 indicating neurons within one population fire out of sync and χ = 1 indicating neurons within that population discharge synchronously.
Control Problem Description
Parkinson’s disease is characterized by diverse changes in neuronal activity, and single neuron action potentials, electrocorticograms, LFPs, and electroencephalograms have been considered as feedback control signals for closed-loop DBS (Hoang et al., 2017). The LFP generated by model GPi neurons was adopted as the feedback signal for closed loop control. A simple average of transmembrane potentials was adopted to calculate the LFP of the modeled population due to its ability to capture subthreshold activity and thereby reflect oscillatory phenomena (Pettersen et al., 2012; Mazzoni et al., 2015). Expression of the GPi LFP was given as
Here, n = 10 represented the total number of GPi neurons and vGPi_i corresponded to the transmembrane potential of the ith GPi neuron. The power within particular frequency bands of the LFP signal was determined from power spectra using the Chronux neural signal analysis package [length of moving window 1 s, step size 0.1 s and tapers in the form of [3 5] (3 is the time-bandwidth product and 5 is the number of tapers to be used)], and the beta band power was defined as the total power over 13–30 Hz.
Our goal was to design an adaptive closed-loop controller to adjust automatically the stimulation signals delivered to the model neurons of the GPi based on the beta LFP activity calculated from the model GPi neurons as the feedback control signal. We defined the stimulation signal as Isti and delivered it directly to each GPi neuron, and the resulting transmembrane potential was expressed as
Isti was constructed by using the controller output u(t) to construct a variable frequency pulse train stimulation signal [amplitude of 300μA/cm2, pulse duration of 0.3 ms and period of 1,000/u(t) ms (frequency of u(t))]. After the end of each stimulation period, we recalculated the beta power of the LFP signal and repeated the above steps to update continuously the optimal controller output. The transmembrane potentials of model neurons in that other nuclei were unaffected by direct electrical stimulation but their activity was influenced during stimulation via either excitatory or inhibitory synaptic connections.
The classical error-based PID control law has the form
where e(t) = yd(t)−y(t)(yd(t) and y(t) represented the beta power in LFP signals of the GPi from healthy control and controlled Parkinsonian states, respectively). The performance of the PID controller depends greatly on selecting the appropriate gains, and this can be a time-consuming manual process. As the LFP was a highly dynamic variable subject to large changes, differential action might amplify noise interference. Therefore, the differential term was omitted, and both proportional (P) and proportional-integral (PI) controllers were designed to minimize the error between the desired and measured beta band power in the LFP.
A stable self-tuning controller was designed using a dynamic RBF network. Figure 2 is a block diagram of the RBF supervisory control system and a schematic diagram of the RBF network. The neural network-based controller acted as a feedforward controller, by building an inverse model of the controlled plant. The input layer, hidden layer and output layer determined the structure of the RBF neural network. x = [x1,x2,…,xi,…,xn]T (n was the number of input layer nodes) represented the network input vector and h = [h1,h2,…,hj,…,hm]T (m was the number of hidden layer nodes) represented the hidden layer output. Each hidden layer node had a central value cj, the Euclidean distance of which to network input xi was described as ||xi−cj||. As well, each hidden layer node was an arithmetic element with activation function given by
Figure 2. Block diagram of improved supervisory algorithm under the guidance of radial basis function (RBF) network. Stimulation signal u(t) is applied to GPi model neurons and the simulated beta power y(t) is obtained from LFP of the GPi. The stimulation signal u(t) is determined by the joint action of the P controller output up(t) and RBF network controller output urbf(t).
The center vector c = [c1,c2,…,cm] and the width vector b = [b1,b2,…,bm]T determined the influence of the Gaussian function, where the width of the Gaussian basis function directly influenced the mapping capability to network input, while the center value correlated with its sensitivity to network input.
If we set the weight vector as w = [w1,w2,…,wm]T, then the network output can be obtained as
The structure of the RBF network was selected as 1−nm−1, that is, containing 1 input layer node, nm hidden layer nodes and 1 output layer node. The number of hidden layer nodes was set to 11, network weights were initially set to random values between 0 and 1, and parameters of Gaussian function were set as c = [−2,−1,0,1,2]T, b = [5,5,5,5,5]T. The weights of the RBF controller were continuously adjusted on-line to make the feedback error e(t) approach zero, which equated to up(t) approaching zero. Consequently, the RBF controller gradually occupied the leading position and even replaced the function of the P/PI controller. The RBF network error index was designed in the form of , to lead up(t)(e(t)) to converge to 0. Considering that the total controller output was the sum of the traditional P/PI controller and the adaptive RBF controller u(t) = urbf(t) + up(t), the error index can be written as
According to the gradient descent method, the network weights were adjusted as follows
Further, applying the gradient descent method to the adjustment of c and b will optimize effective learning by the RBF network, thus we had
where η ∈ (0,1) represented the learning rate η = 0.30and α ∈ (0,1) represented the momentum factor α = 0.05.
A quantitative index of the control effect was defined as the root mean square error between the controlled output and the reference signal,
where N represented the sampling point of the feedback signal.
Results
The biophysically-based cortical-Basal-thalamus network model was used to test the effectiveness of closed-loop DBS. Performance of the RBF network-based supervisory algorithm was evaluated by considering changes in the state of the plant intended to represent dynamics including cycling of medication and progress of the disease, as well as dynamic changes in the reference (target) signal.
Firing Rates and Firing Patterns of Model BG Neurons
The transmembrane potentials of model GPe, GPi, and STN neurons in the cortical-Basal-thalamus network model are displayed in Figure 3. The Parkinsonian condition resulted in changes in both the rate and pattern of model neuron activity. As a result of excitation via the indirect pathway and hyper-direct pathway together with inhibition via the direct pathway, firing rates in the Parkinsonian condition increased in STN and GPi model neurons and decreased in GPe model neurons (Figure 4A), consistent with previous experimental studies (Kita and Kita, 2011). Moreover, increased spike synchrony was observed across all nuclei in the Parkinsonian state compared to the healthy state (Figure 4B). These indexes were analyzed using one-way analysis of variance (ANOVA), which revealed a significant difference in firing rate (STN: F = 901 and p < 0.001, GPe: F = 48 and p < 0.001, and GPi: F = 184 and p < 0.001) and synchrony index (STN: F = 239 and p < 0.001, GPe: F = 1554 and p < 0.001, and GPi: F = 62 and p < 0.001) between healthy state and Parkinsonian state. The model thus exhibited features of the pathophysiological neural activity occurring in PD and was a suitable testbed to develop and analyze closed-loop control strategies.
Figure 3. Transmembrane potential as a function of time of model neurons in STN, GPe, and GPi. The blue traces depict model neuron activity in the healthy condition while orange traces depict model neuron activity in the Parkinsonian condition.
Figure 4. Characterization of model neuron activity in healthy and Parkinsonian conditions. (A) Firing rates (mean ± standard error) for model STN, GPe, and GPi neurons in healthy (blue) and Parkinsonian (orange) conditions. Values are averaged across three runs for each nucleus. (B) Spike synchrony (mean ± standard error) for model STN, GPe, and GPi neurons under healthy (blue) and Parkinsonian (orange) conditions. All model neurons exhibit increases in spike synchrony in the Parkinsonian state as compared to the healthy state. (*** represented a significant difference, p < 0.001).
The GPi, which is clinically accessible for both recording of LFPs and delivery of DBS, was selected as the source of the feedback signal for closed loop DBS (Figures 5A,B). Compared to the healthy state, where the LFP exhibited little power in the beta band, the LFP in the Parkinsonian condition exhibited oscillatory activity, generating a significant peak in the power spectrum (Figure 5C). The LFP signals were filtered within the beta band to extract differences between healthy and Parkinsonian states, and the filtered GPi LFP activity in the healthy state (Figure 5D) served as a reference to guide the modulation of Parkinsonian neural activity by DBS, thus constituting a closed-loop system to suppress exaggerated beta oscillatory activity (Figure 5E).
Figure 5. Local field potential (LFP) activity from model neurons in the GPi. In panels (A,B), blue trace depicts the LFP in the healthy state while the orange trace depicts the LFP in the Parkinsonian state. Panel (C) illustrates the power spectral density of the GPi LFP across 10 trails to quantify the corresponding oscillatory activity, where shaded error region represents standard errors. Panels (D,E) depict the band-pass filtered (13–30 Hz) LFP activity in the GPi.
Limitation of Traditional P and PI Controllers
We first quantified the relationship between the stimulation frequency and the beta power (Figure 6), where f = 0 is equivalent to the Parkinsonian state without DBS. Low frequency stimulation actually increased beta power, and the beta power was progressively reduced as the stimulation frequency was increased higher than 50 Hz. A beta power of 120 dB was calculated from the healthy state and set as the desired state, and the range of stimulation frequencies was between 5 and 200 Hz. The modulation of DBS was reformulated as a train of monophasic pulses with an initial stimulation frequency of 5 Hz, and the updated stimulation frequency was generated based upon the error between the actual and target beta power in the LFP from the GPi model neurons.
Figure 6. Relationship between the deep brain stimulation pulse repetition frequency and the beta power in the LFP from the GPi (mean ± standard error for 50 trials).
The performance of the P controller and PI controller in regulating the beta band LFP in the GPi with GPi DBS were compared (Figures 7, 8), where the blue dotted lines represented the healthy state and the orange traces represented the controlled PD state. Small kp appeared to make no difference to the suppression of beta band activity, while large kp caused strong oscillations between effective and ineffective suppression of beta band activity (Figure 7). The performance following addition of the integral controller with different combinations of kp and ki was assessed (Figure 8). The control of beta power in the GPi LFP was strongly dependent on the selection of kp and ki, where effective values (kp = 0.5,ki = 0.5) promoted the suppression of high beta power while ineffective values (kp = 0.1,ki = 0.01) did not contribute to improvement of the PD state. The search of proportional and integral gains through trial-and-error produced fluctuations in performance and created uncertainty about the effectiveness of closed-loop DBS, especially in the face of changes in the properties of the plant.
Figure 7. Control effect of P controller with beta power as the feedback signal. Blue dotted lines represent the desired GPi beta power that recorded from healthy state, and orange traces represent feedback GPi beta power from the controlled state. Black solid lines denote the beginning of stimulation. (A) kp = 0.01, (B) kp = 0.1, (C) kp = 1, (D) kp = 10.
Figure 8. Control effect of PI controller with beta power as the feedback signal. Blue dotted lines represent the desired GPi beta power that recorded from healthy state, and orange traces represent feedback GPi beta power from the controlled state. Black solid lines denote the beginning of stimulation. (A) kp = 0.1,ki = 0.01, (B) kp = 0.1,ki = 0.5, (C) kp = 0.1,ki = 2, (D) kp = 0.5,ki = 0.01, (E) kp = 0.5,ki = 0.5, (F) kp = 0.5,ki = 2.
A substantial improvement in control performance was achieved by the supervisory algorithm, where suppression of the exaggerated beta power present in the Parkinsonian condition was achieved within 1 s (Figure 9A). The weights of the RBF network were adjusted in real-time, in response to the update of the beta power of the GPi (Figure 9B). As the beta power was gradually suppressed, the RBF network took over the leading role that the P/PI controller played in the initial control stage (Figure 9C). Ultimately, the DBS pulse repetition frequency was calculated as shown in Figure 9D, and the DBS signal is depicted in Figure 9E. Changes in beta power between the healthy state, open-loop 130 Hz DBS, P controller, PI controller and adaptive DBS were compared by calculating the root mean square error between the controlled output signal and the reference signal during the last 4 s of simulation. The averaged RMSE were 18.64, 112.41, 39.38, and 23.89, respectively, In cases of open-loop 130 Hz DBS, P control (kp = 0.1), PI control (kp = 0.5,ki = 0.5) and combined P and RBF control, indicating that the improved supervisory algorithm drive the beta power to the target setting with a higher accuracy.
Figure 9. Feedback control using the RBF controller with beta power as the control signal (kp = 0.1). Panel (A) depicts the dynamic process of the controller reducing beta power in the GPi, panel (B) shows the evolution of real-time updated weights of the RBF network, (C) plots the trend of P controller and RBF controller, respectively, (D) generates the DBS pulse repetition frequency. Panel (E) is the stimulation signal, a series of 0.3 ms duration 300μA/cm2 amplitude pulses with the instantaneous frequency determined by the controller.
Evaluation of Robustness of the Supervisory Algorithm
The closed-loop RBF network-based supervisory control algorithm achieved effective tracking of the target beta power, but several challenges required further consideration. First, Parkinson’s disease is a chronic and progressive disease in which the patient’s condition gradually worsens over time, and individual variation should be considered in model-based evaluations. Second, the reference waveform was the LFP signal from the healthy network model. Although this signal carries abundant information, it does not represent the variety of different disease states, for example during cycling of medication, and this potentially limits the generalizability of the control system.
To evaluate further the performance of the proposed closed-loop algorithm in the face of changes in the Parkinsonian state (Figure 10), the parkinsonism variable pd was randomly generated from 0 to 1. Controlled beta power gradually converged to the desired healthy signal after 1 s, demonstrating the adaptive capability of the RBF network across disease states. In addition, beta power exhibits dynamic changes, especially prior to and during movement, and thus tracking of time-varying beta power may be required to promote desired movement behavior. In the face of a time varying reference beta power signal switching at 1 Hz (Figure 11), the controlled beta power still followed the dynamic reference signal, albeit with substantial overshoot.
Figure 10. Robustness analysis of the RBF controller in the presence of dynamic changes in the Parkinsonian state (kp = 0.1). (A) Dynamic change of Parkinsonian state is characterized by the parameter, pd. The RBF-controller modulated beta power during dynamic changes is depicted in the bottom panel. Panel (B) shows the evolution of real-time updated weights of the RBF network, (C) plots the trend of P controller and RBF controller, and (D) generates the DBS pulse repetition frequency. Here, the stimulation amplitude is set to 300μA/cm2 and the pulse duration is set to 0.3 ms.
Figure 11. Robustness analysis of the RBF controller during dynamic changes in the reference beta power (kp = 0.1). (A) Dynamic change of the reference beta power is characterized by the blue dotted line. The RBF-controller modulated beta power is depicted in the bottom panel. Panel (B) shows the evolution of real-time updated weights of the RBF network, (C) plots the trend of P controller and RBF controller, and (D) generates the DBS pulse repetition frequency. Here, the stimulation amplitude is set to 300μA/cm2 and the pulse duration is set to 0.3 ms.
Discussion
This manuscript proposed an improved supervisory control algorithm for adaptively adjusting the stimulation signal to improve DBS control of the Parkinsonian state. Myriad control algorithms have been applied to the design of closed-loop DBS system, for example, on-off control, dual-threshold control, delayed feedback control, PID control, fuzzy control, and model predictive control (Little et al., 2013; Liu et al., 2015, 2016; Popovych et al., 2017; Su et al., 2019; Velisar et al., 2019). For on-off control and dual-threshold control, a stimulus was triggered by the control signal exceeding or falling below a threshold. Energy consumption was reduced as compared to traditional open-loop stimulation, but selection of optimal stimulation parameters during the DBS-on stage still needed to be addressed. The design of delayed feedback controller or PID controller depended strongly on the selection of controller gains and delay time constant, and the performance was dependent on the plant. Thus, such controllers may exhibit limited adaptability for individual variations due to, for example, changes in medication status or active versus inactive state. More advanced control algorithms, for example fuzzy control and model predictive control, have been developed for modulation of DBS parameters. However, the robustness of the control algorithm was improved at the cost of using a non-standard signal–the unmodulated controller output was applied directly to the stimulated targets, and this may be difficult to implement with an implanted pulse generator. In addition, Gao et al. (2020) proposed a deep reinforcement learning-based approach to construct an adaptive DBS framework. The reinforcement signal provided by the environment was an evaluation of the quality of the action that the agent produced. It should be noted that the external environment yielded evaluations (reward or punishment) rather than correct answers to the output of the learning system, and the performance of the learning system was improved by reinforcing the actions that were rewarded. Neural networks have the capacity to approximate arbitrary complex nonlinear systems, and an RBF network was adopted in this manuscript for selecting the appropriate pulse repetition frequency of DBS. The proposed RBF-based algorithm constituted supervised learning that provided a corresponding target output for each input. Through the feedback structure, the stability and robustness can be guaranteed, and the precision and adaptability were improved.
A biophysiologically feasible cortex-basal ganglia-thalamus computational network model that represents the Parkinsonian state in 6-OHDA lesioned rats was used as the plant and to calculate LFP signals and the effects of DBS. DBS of the STN or the GPi are currently the most common and effective surgical targets for the treatment of PD, but there does not appear to be one superior target. Several studies compared the efficacy of stimulating STN versus GPi and that both STN-DBS and GPi-DBS are equally effective in improving motor dysfunction (Honey et al., 2017; Zhang et al., 2021). STN-DBS contributes to more significant medication reduction and is favorable to decrease energy consumption due to the smaller stimulating region, but STN-DBS appears to increase the incidence of psychiatric complication. If medication reduction is not a major concern, GPi-DBS has the advantage of direct dyskinesia suppression. LFP signals from the GPi, which can be directly obtained through DBS leads in clinical application (Stanslaski et al., 2012; Arlotti et al., 2016b; Parastarfeizabadi and Kouzani, 2019) and carry abundant potential information from synchronous neural activity were extracted and processed as the feedback signal for closing the loop (Little and Brown, 2012; Priori et al., 2012; Hebb et al., 2014; Arlotti et al., 2016a; Sinclair et al., 2018). The design of the closed-loop control system followed a traditional strategy. For such a highly nonlinear and complex plant, the selection of optimal proportional gains was challenging, and simulation results illustrated that the P controller did not achieve effective tracking of the reference signal. The RBF neural network exhibits both self-learning and self-adaptation and was the foundation for constructing an intelligent control system. The improved supervisory control algorithm with the RBF network controller showed satisfactory tracking performance and was able to regulate the beta oscillatory power across dynamic changes in the plant and the reference signal.
The proposed algorithm has several potential advantages for clinical implementation. First, although the closed-loop control algorithm was designed based on a biophysical model of the cortex-basal ganglia-thalamus network, precise parameters (e.g., synaptic conductance, reversal potentials) and network structure (e.g., synaptic connectivity) were not necessary since the RBF network builds an inverse representation based on input output information. Second, DBS stimulation signals were delivered through and LFP recordings were obtained from same implanted electrode, thereby avoiding the requirement of additional external sensors. A limitation of this simulation study is the setting of the desired tracking signal, and the variable dynamics of the cortical-Basal-thalamus network were not fully considered. Further exploration combined with the selection of biological markers that relate to specific symptoms and states remains an important challenge. Further, understanding the relationship between stimulation parameter changes and changes in specific patient symptoms, including the time course of such changes, is crucial to improving clinical treatment. For example, data-driven input-output model identification might be a promising solution for quantifying responsiveness to specific stimulation signals.
Data Availability Statement
The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding author.
Author Contributions
YZ was responsible for manuscript development, concept, and study design, and wrote the manuscript. WG conceived the project, and supervised the study design and manuscript development. JW collaborated in manuscript development and concept. HL and CL collaborated in methodology design and implementation. All authors read and approved the final manuscript.
Funding
This work was supported in part by grant R37 NS040894 from the US National Institutes of Health.
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s Note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
Androulidakis, A. G., Brücke, C., Kempf, F., Kupsch, A., Aziz, T., Ashkan, K., et al. (2008). Amplitude modulation of oscillatory activity in the subthalamic nucleus during movement. Eur. J. Neurosci. 27, 1277–1284. doi: 10.1111/j.1460-9568.2008.06085.x
Arlotti, M., Rosa, M., Marceglia, S., Barbieri, S., and Priori, A. (2016a). The adaptive deep brain stimulation challenge. Parkinsonism Relat. Disord. 28, 12–17. doi: 10.1016/j.parkreldis.2016.03.020
Arlotti, M., Rossi, L., Rosa, M., Marceglia, S., and Priori, A. (2016b). An external portable device for adaptive deep brain stimulation (aDBS) clinical research in advanced Parkinson’s Disease. Med. Eng. Phys. 38, 498–505. doi: 10.1016/j.medengphy.2016.02.007
Beudel, M., Oswal, A., Jha, A., Foltynie, T., Zrinzo, L., Hariz, M., et al. (2017). Oscillatory beta power correlates with Akinesia-rigidity in the Parkinsonian subthalamic nucleus. Mov. Disord. 32, 174–175. doi: 10.1002/mds.26860
Bras, J., Singleton, A., Cookson, M. R., and Hardy, J. (2008). Emerging pathways in genetic Parkinson’s disease: potential role of ceramide metabolism in Lewy body disease. FEBS J. 275, 5767–5773. doi: 10.1111/j.1742-4658.2008.06709.x
Broccard, F., Mullen, T., Chi, Y., Peterson, D., Iversen, J., Arnold, M., et al. (2014). Closed-loop brain-machine-body interfaces for noninvasive rehabilitation of movement disorders. Ann. Biomed. Eng. 42, 1573–1593. doi: 10.1007/s10439-014-1032-6
Buzsáki, G., Anastassiou, C. A., and Koch, C. (2012). The origin of extracellular fields and currents–EEG, ECoG, LFP and spikes. Nat. Rev. Neurosci. 13, 407–420. doi: 10.1038/nrn3241
de Paor, A. M., and Lowery, M. M. (2009). Analysis of the mechanism of action of deep brain stimulation using the concepts of dither injection and the equivalent nonlinearity. IEEE Trans. Biomed. Eng. 56, 2717–2720. doi: 10.1109/TBME.2009.2019962
Deffains, M., and Bergman, H. (2019). Parkinsonism-related β oscillations in the primate basal ganglia networks–recent advances and clinical implications. Parkinsonism Relat. Disord. 59, 2–8. doi: 10.1016/j.parkreldis.2018.12.015
Doshi, P. K., Chhaya, N. A., and Bhatt, M. A. (2003). Bilateral subthalamic nucleus stimulation for Parkinson’s disease. Neurol. India 51, 43–48.
Follett, K. A., Weaver, F. M., Stern, M., Hur, K., Harris, C. L., Luo, P., et al. (2010). Pallidal versus subthalamic deep-brain stimulation for Parkinson’s Disease. N. Engl. J. Med. 362, 2077–2091. doi: 10.1056/NEJMoa0907083
Gao, Q., Naumann, M., Jovanov, I., Lesi, V., Kamaravelu, K., Grill, W. M., et al. (2020). “Model-based design of closed loop deep brain stimulation controller using reinforcement learning,” in Proceedings of the 2020 ACM/IEEE 11th International Conference on Cyber-Physical Systems (ICCPS), (Sydney, NSW: IEEE), 108–118. doi: 10.1109/ICCPS48487.2020.00018
Gorzelic, P., Schiff, S., and Sinha, A. (2013). Model-based rational feedback controller design for closed-loop deep brain stimulation of Parkinson’s disease. J. Neural Eng. 10:026016. doi: 10.1088/1741-2560/10/2/026016
Guo, Y., and Rubin, J. E. (2011). Multi-site stimulation of subthalamic nucleus diminishes thalamocortical relay errors in a biophysical network model. Neural Netw. 24, 602–616. doi: 10.1016/j.neunet.2011.03.010
Haeri, M., Sarbaz, Y., and Gharibzadeh, S. (2005). Modeling the Parkinson’s tremor and its treatments. J. Theor. Biol. 236, 311–322. doi: 10.1016/j.jtbi.2005.03.014
Hebb, A., Zhang, J., Mahoor, M., Tsiokos, C., Matlack, C., Chizeck, H., et al. (2014). Creating the Feedback Loop. Neurosurg. Clin. North Am. 25, 187–204. doi: 10.1016/j.nec.2013.08.006
Hoang, K., Cassar, I., Grill, W., and Turner, D. (2017). Biomarkers and stimulation algorithms for adaptive brain stimulation. Front. Neurosci. 11:564. doi: 10.3389/fnins.2017.00564
Honey, C. R., Hamani, C., Kalia, S. K., Sankar, T., Picillo, M., Munhoz, R. P., et al. (2017). Deep brain stimulation target selection for Parkinson’s disease. Can. J. Neurol. Sci. 44, 3–8. doi: 10.1017/cjn.2016.22
Izhikevich, E. M. (2003). Simple model of spiking neurons. IEEE Trans. Neural Netw. 14, 1569–1572. doi: 10.1109/TNN.2003.820440
Jankovic, J. (2008). Parkinson’s disease: clinical features and diagnosis. J. Neurol. Neurosurg. Psychiatry 79, 368–376. doi: 10.1136/jnnp.2007.131045
Kiss, Z. H. T., Doig-Beyaert, K., Eliasziw, M., Tsui, J., Haffenden, A., Suchowersky, O., et al. (2007). The Canadian multicentre study of deep brain stimulation for cervical dystonia. Brain 130, 2879–2886. doi: 10.1093/brain/awm229
Kita, H., and Kita, T. (2011). Cortical stimulation evokes abnormal responses in the dopamine-depleted rat basal ganglia. J. Neurosci. 31, 10311–10322. doi: 10.1523/JNEUROSCI.0915-11.2011
Kumaravelu, K., Brocker, D. T., and Grill, W. M. (2016). A biophysical model of the cortex-basal ganglia-thalamus network in the 6-OHDA lesioned rat model of Parkinson’s disease. J. Comput. Neurosci. 40, 207–229. doi: 10.1007/s10827-016-0593-9
Little, S., and Brown, P. (2012). What brain signals are suitable for feedback control of deep brain stimulation in Parkinson’s disease? Ann. N. Y. Acad. Sci. 1265, 9–24. doi: 10.1111/j.1749-6632.2012.06650.x
Little, S., Pogosyan, A., Neal, S., Zavala, B., Zrinzo, L., Hariz, M., et al. (2013). Adaptive deep brain stimulation in advanced Parkinson disease. Ann. Neurol. 74, 449–457. doi: 10.1002/ana.23951
Liu, C., Wang, J., deng, B., Wei, X., Yu, H.-T., and Li, H.-Y. (2015). Variable universe fuzzy closed-loop control of tremor predominant Parkinsonian state based on parameter estimation. Neurocomputing 151, 1507–1518. doi: 10.1016/j.neucom.2014.10.028
Liu, C., Wang, J., Li, H., Lu, M., deng, B., Yu, H., et al. (2016). Closed-loop modulation of the pathological disorders of the basal Ganglia Network. IEEE Trans. Neural Netw. Learn. Syst. 28, 371–382. doi: 10.1109/TNNLS.2015.2508599
Lofredi, R., Tan, H., Neumann, W.-J., Yeh, C. H., Schneider, G.-H., Kühn, A., et al. (2019). Beta bursts during continuous movements accompany the velocity decrement in Parkinson’s disease patients. Neurobiol. Dis. 127, 462–471. doi: 10.1016/j.nbd.2019.03.013
Mazzoni, A., Lindén, H., Cuntz, H., Lansner, A., Panzeri, S., and Einevoll, G. (2015). Computing the Local Field Potential (LFP) from Integrate-And-Fire Network Models. PLoS Comput. Biol. 11:e1004584. doi: 10.1371/journal.pcbi.1004584
Mehta, S. H., and Sethi, K. D. (2009). Bilateral deep brain stimulation versus best medical therapy for patients with advanced Parkinson’s disease. Curr. Neurol. Neurosci. Rep. 9, 266–267. doi: 10.1007/s11910-009-0039-0
Modolo, J., Beuter, A., Thomas, A. W., and Legros, A. (2012). Using “Smart Stimulators” to treat Parkinson’s disease: re-engineering neurostimulation devices. Front. Comput. Neurosci. 6:69. doi: 10.3389/fncom.2012.00069
Montgomery, E. (2020). Debugging adaptive deep brain stimulation for Parkinson’s disease. Mov. Disord. 35, 1891–1891. doi: 10.1002/mds.28235
Novikova, L., Garris, B. L., Garris, D. R., and Lau, Y.-S. (2006). Early signs of neuronal apoptosis in the substantia nigra pars compacta of the progressive neurodegenerative mouse 1-methyl-4-phenyl-1,2,3,6-tetrahydropyridine/probenecid model of Parkinson’s disease. Neuroscience 140, 67–76. doi: 10.1016/j.neuroscience.2006.02.007
Parastarfeizabadi, M., and Kouzani, A. (2017). Advances in closed-loop deep brain stimulation devices. J. Neuroeng. Rehabil. 14:79. doi: 10.1186/s12984-017-0295-1
Parastarfeizabadi, M., and Kouzani, A. (2019). A miniature dual-biomarker-based sensing and conditioning device for closed-loop DBS. IEEE J. Transl. Eng. Health Med. 7:2000308. doi: 10.1109/JTEHM.2019.2937776
Pettersen, K., Linden, H., Dale, A., and Einevoll, G. (2012). “Extracellular spikes and CSD,” in Handbook of Neural Activity Measurement, eds R. Brette and A. Destexhe (Cambridge: Cambridge University Press), 92–135. doi: 10.1017/CBO9780511979958.004
Pirini, M., Rocchi, L., Sensi, M., and Chiari, L. (2009). A computational modelling approach to investigate different targets in deep brain stimulation for Parkinson’s disease. J. Comput. Neurosci. 26, 91–107. doi: 10.1007/s10827-008-0100-z
Pizzolato, G., and Mandat, T. (2012). Deep brain stimulation for movement disorders. Front. Integr. Neurosci. 6:2. doi: 10.3389/fnint.2012.00002
Popovych, O. V., Lysyansky, B., and Tass, P. A. (2017). Closed-loop deep brain stimulation by pulsatile delayed feedback with increased gap between pulse phases. Sci. Rep. 7:1033. doi: 10.1038/s41598-017-01067-x
Popovych, O. V., and Tass, P. A. (2012). Desynchronizing electrical and sensory coordinated reset neuromodulation. Front. Hum. Neurosci. 6:58. doi: 10.3389/fnhum.2012.00058
Priori, A., Foffani, G., Rossi, L., and Marceglia, S. (2012). Adaptive deep brain stimulation (aDBS) controlled by local field potential oscillations. Exp. Neurol. 245, 77–86. doi: 10.1016/j.expneurol.2012.09.013
Rosa, M., Arlotti, M., Marceglia, S., Cogiamanian, F., Ardolino, G., Di Fonzo, A., et al. (2017). Adaptive deep brain stimulation controls levodopa-induced side effects in Parkinsonian patients: DBS controls levodopa-induced side effects. Mov. Disord. 32, 628–629. doi: 10.1002/mds.26953
Rosin, B., Slovik, M., Mitelman, R., Rivlin-Etzion, M., Haber, S. N., Israel, Z., et al. (2011). Closed-loop deep brain stimulation is superior in ameliorating Parkinsonism. Neuron 72, 370–384. doi: 10.1016/j.neuron.2011.08.023
Rubin, J. E., and Terman, D. (2004). High frequency stimulation of the subthalamic nucleus eliminates pathological thalamic rhythmicity in a computational model. J. Comput. Neurosci. 16, 211–235. doi: 10.1023/B:JCNS.0000025686.47117.67
Santaniello, S., Fiengo, G., Glielmo, L., and Grill, W. M. (2011). Closed-loop control of deep brain stimulation: a simulation study. IEEE Trans. Neural Syst. Rehabil. Eng. 19, 15–24. doi: 10.1109/TNSRE.2010.2081377
Sinclair, N. C., Mcdermott, H. J., Bulluss, K. J., Fallon, J. B., Perera, T., Xu, S. S., et al. (2018). Subthalamic nucleus deep brain stimulation evokes resonant neural activity. Ann. Neurol. 83, 1027–1031. doi: 10.1002/ana.25234
Stanslaski, S., Afshar, P., Cong, P., Giftakis, J., Stypulkowski, P., Carlson, D., et al. (2012). Design and validation of a fully implantable, chronic, closed-loop neuromodulation device with concurrent sensing and stimulation. IEEE Trans. Neural Syst. Rehabil. Eng. 20, 410–421. doi: 10.1109/tnsre.2012.2183617
Steiner, L. A., Neumann, W.-J., Staub-Bartelt, F., Herz, D. M., Tan, H., Pogosyan, A., et al. (2017). Subthalamic beta dynamics mirror Parkinsonian bradykinesia months after neurostimulator implantation. Mov. Disord. 32, 1183–1190. doi: 10.1002/mds.27068
Su, F., Kumaravelu, K., Wang, J., and Grill, W. M. (2019). Model-based evaluation of closed-loop deep brain stimulation controller to adapt to dynamic changes in reference signal. Front. Neurosci. 13:956. doi: 10.3389/fnins.2019.00956
Swann, N. C., de Hemptinne, C., Thompson, M. C., Miocinovic, S., Miller, A. M., Gilron, R., et al. (2018). Adaptive deep brain stimulation for Parkinson’s disease using motor cortex sensing. J. Neural Eng. 15:046006. doi: 10.1088/1741-2552/aabc9b
Titcombe, M. S., Glass, L., Guehl, D., and Beuter, A. (2001). Dynamics of Parkinsonian tremor during deep brain stimulation. Chaos 11, 766–773. doi: 10.1063/1.1408257
Velisar, A., Syrkin-Nikolau, J., Blumenfeld, Z., Trager, M. H., Afzal, M. F., Prabhakar, V., et al. (2019). Dual threshold neural closed loop deep brain stimulation in Parkinson disease patients. Brain Stimul. 12, 868–876. doi: 10.1016/j.brs.2019.02.020
Vidailhet, M., Vercueil, L., Houeto, J.-L., Krystkowiak, P., Lagrange, C., Yelnik, J., et al. (2007). Bilateral, pallidal, deep-brain stimulation in primary generalised dystonia: a prospective 3 year follow-up study. Lancet Neurol. 6, 223–229. doi: 10.1016/S1474-4422(07)70035-2
Wu, H., Ghekiere, H., Beeckmans, D., Tambuyzer, T., van Kuyck, K., Aerts, J.-M., et al. (2015). Conceptualization and validation of an open-source closed-loop deep brain stimulation system in rat. Sci. Rep. 5:9921. doi: 10.1038/srep09921
Keywords: Parkinson’s disease, feedback signal, beta power, RBF neural network, supervisory control algorithm
Citation: Zhu Y, Wang J, Li H, Liu C and Grill WM (2021) Adaptive Parameter Modulation of Deep Brain Stimulation Based on Improved Supervisory Algorithm. Front. Neurosci. 15:750806. doi: 10.3389/fnins.2021.750806
Received: 31 July 2021; Accepted: 20 August 2021;
Published: 16 September 2021.
Edited by:
John D. Rolston, The University of Utah, United StatesReviewed by:
Ryoma Morigaki, Tokushima University, JapanFei Su, Shandong Agricultural University, China
Copyright © 2021 Zhu, Wang, Li, Liu and Grill. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Warren M. Grill, warren.grill@duke.edu; orcid.org/0000-0001-5240-6588