Skip to main content

HYPOTHESIS AND THEORY article

Front. Neurosci., 21 October 2014
Sec. Decision Neuroscience
This article is part of the Research Topic Toward a Unified View of the Speed-Accuracy Trade-Off: Behaviour, Neurophysiology and Modelling View all 12 articles

Neural dynamics implement a flexible decision bound with a fixed firing rate for choice: a model-based hypothesis

  • 1Department of Biomedical and Molecular Sciences, Queen's University, Kingston, ON, Canada
  • 2Department of Systems Science/National Key Laboratory of Cognitive Neuroscience and Learning, Beijing Normal University, Beijing, China

Decisions are faster and less accurate when conditions favor speed, and are slower and more accurate when they favor accuracy. This speed-accuracy trade-off (SAT) can be explained by the principles of bounded integration, where noisy evidence is integrated until it reaches a bound. Higher bounds reduce the impact of noise by increasing integration times, supporting higher accuracy (vice versa for speed). These computations are hypothesized to be implemented by feedback inhibition between neural populations selective for the decision alternatives, each of which corresponds to an attractor in the space of network states. Since decision-correlated neural activity typically reaches a fixed rate at the time of commitment to a choice, it has been hypothesized that the neural implementation of the bound is fixed, and that the SAT is supported by a common input to the populations integrating evidence. According to this hypothesis, a stronger common input reduces the difference between a baseline firing rate and a threshold rate for enacting a choice. In simulations of a two-choice decision task, we use a reduced version of a biophysically-based network model (Wong and Wang, 2006) to show that a common input can control the SAT, but that changes to the threshold-baseline difference are epiphenomenal. Rather, the SAT is controlled by changes to network dynamics. A stronger common input decreases the model's effective time constant of integration and changes the shape of the attractor landscape, so the initial state is in a more error-prone position. Thus, a stronger common input reduces decision time and lowers accuracy. The change in dynamics also renders firing rates higher under speed conditions at the time that an ideal observer can make a decision from network activity. The difference between this rate and the baseline rate is actually greater under speed conditions than accuracy conditions, suggesting that the bound is not implemented by firing rates per se.

1. Introduction

In decision making experiments, subjects make faster, less accurate decisions when conditions favor speed, and make slower, more accurate decisions when conditions favor accuracy (e.g., Bogacz et al., 2010a; Heitz and Schall, 2012). These data describe the speed-accuracy trade-off (SAT) and can be explained by the principles of bounded integration. According to these principles, noisy evidence for the alternatives of a decision is integrated until the running total for one of the alternatives reaches a criterion level. The running total is referred to as a decision variable and the criterion is referred to as the bound. A higher bound allows evidence to be integrated for longer, increasing the percentage of correct decisions. A lower bound has the opposite effect. These abstract models have been invaluable in characterizing the computations underlying decisions and the SAT (see Smith and Ratcliff, 2004; Ratcliff and McKoon, 2008; Bogacz et al., 2010b).

The computations characterized by bounded integration models are hypothesized to be implemented by competitive interactions between neural populations selective for the alternatives of a decision (Usher and McClelland, 2001; Wang, 2002; Machens et al., 2005; Bogacz et al., 2006; Wong and Wang, 2006; Standage et al., 2011; You and Wang, 2013). According to this widely held hypothesis, temporal integration and competitive interactions are supported by recurrent excitation and feedback inhibition respectively, where each population implements a decision variable and a choice is made when the aggregate firing rate of one of the populations reaches a threshold. This hypothesis is supported by electrophysiological recordings from several cortical areas in non-human primates performing decision tasks, where the spike rates of neurons responsive to the chosen alternative (target-in neurons) increase over several hundreds of milliseconds prior to the animal's choice, and the spike rates of neurons unresponsive to the chosen alternative (target-out neurons) are much lower (e.g., Roitman and Shadlen, 2002; Thomas and Pare, 2007; Bollimunta and Ditterich, 2011; Ding and Gold, 2012).

Under several task paradigms, target-in activity of putative integrator neurons has been shown to reach an approximately fixed rate at the time of commitment to a choice (the choice threshold), regardless of the speed or accuracy of decisions (Hanes and Schall, 1996; Shadlen and Newsome, 2001; Roitman and Shadlen, 2002; Churchland et al., 2008; Purcell et al., 2010; Ding and Gold, 2012). These data have been interpreted as indicating that the neural implementation of the bound is fixed across conditions emphasizing speed over accuracy or vice versa (see Bogacz et al., 2010b). Under the assumption of linear integration, adjusting the starting point of a decision variable is equivalent to adjusting the bound, so it has been hypothesized that subjects trade speed and accuracy by adjusting the “baseline” rate of integrator populations, i.e., the activity on which a decision variable builds (see Bogacz et al., 2010b). According to this hypothesis, the SAT is controlled by a cognitive signal projecting uniformly to all integrator populations, where a stronger (weaker) signal favors speed (accuracy) by decreasing (increasing) the difference between the choice threshold and baseline activity (the threshold-baseline difference). We refer to this possibility as the threshold-baseline hypothesis (a.k.a. the changing-baseline hypothesis, Bogacz et al., 2010b). Several recent neuroimaging (Forstmann et al., 2008; Ivanoff et al., 2008; van Veen et al., 2008; Wenzlaff et al., 2011) and electrophysiological (Heitz and Schall, 2012; Hanks et al., 2014) studies have provided evidence for such a signal, reporting higher baseline (pre-stimulus) activity in decision-correlated cortical areas under speed conditions than accuracy and/or neutral conditions.

Here, we present an alternative hypothesis that does not assume linear integration. As above, we assume that a cognitive signal controls the SAT by projecting uniformly to integrator populations, but the underlying mechanism is grounded in the framework of attractor dynamics (e.g., Machens et al., 2005; Bogacz et al., 2006; Wong and Wang, 2006; Standage et al., 2011; You and Wang, 2013; see Wang, 2008, 2012 for review). According to this framework, integration times are determined by the non-linear dynamics of decision circuitry, where stronger and weaker dynamics furnish shorter and longer integration times respectively (Wong and Wang, 2006; Standage et al., 2011). The SAT can therefore be accomplished by any mechanism that modulates the strength of dynamics within and between neural populations selective for the decision alternatives (see Standage et al., 2014). Spatially non-selective excitation provides just such a mechanism (Salinas and Abbott, 1996), where a stronger (weaker) signal corresponds to speed (accuracy) conditions (Furman and Wang, 2008; Roxin and Ledberg, 2008). Of course, this input also entails higher (lower) baseline activity under speed (accuracy) conditions. In attractor network models, higher (lower) baseline activity will indeed decrease (increase) the threshold-baseline difference, but this decrease (increase) is epiphenomenal. The SAT is supported by the resulting changes to network dynamics.

Below, we use a neurally-derived model (Wong and Wang, 2006) to demonstrate that adjusting the strength of spatially non-selective excitation can control the SAT (Furman and Wang, 2008; Roxin and Ledberg, 2008). We demonstrate that this signal raises (lowers) the baseline activity of integrator populations, consistent with higher (lower) baseline activity under speed (accuracy, neutral) conditions in SAT experiments (Forstmann et al., 2008; Ivanoff et al., 2008; van Veen et al., 2008; Wenzlaff et al., 2011; Heitz and Schall, 2012; Hanks et al., 2014). We use a fixed choice threshold in the model, so the spatially non-selective signal decreases (increases) the threshold-baseline difference under speed (accuracy) conditions, relative to a neutral condition. We demonstrate that the threshold-baseline difference cannot account for the SAT in the model, since raising (lowering) the threshold to compensate for the higher (lower) baseline activity under the speed (accuracy) condition does not “untrade” speed and accuracy, i.e., reinstating the threshold-baseline difference of the neutral condition does not recover the neutral behavior of the model. Using dynamic systems analysis, we show that a higher (lower) baseline decreases (increases) the effective time constant of integration of the network under speed (accuracy) conditions, accounting for the SAT in a manner consistent with a flexible bound, while also changing the shape of the decision space so as to further decrease (increase) accuracy. Finally, we show that decision-selective firing rates in the model are actually higher (lower) under speed (accuracy) conditions at the time at which an ideal observer can discriminate between the rates of the integrator populations; as is the difference between these rates and the baseline rate (the discrimination-baseline difference). Thus, the discrimination-baseline difference increases under speed conditions and decreases under accuracy conditions, opposite to the principles of the threshold-baseline hypothesis. Our analysis explains these observations.

Our simulations show that under the framework of attractor dynamics, there is no discrepancy between a flexible bound and a fixed choice threshold. The bound—or the difference between the bound and the starting point of a decision variable—is a computational device for controlling the duration of evidence accumulation in abstract models. It can be implemented by the effective time constant of integration of decision circuitry, with corresponding changes to the decision space. This space and its time evolution are emergent properties of network dynamics and are qualitatively different than the synaptic current required to elicit choice behavior.

2. A Common Input to Integrators Controls the SAT in an Attractor Model, but not by the Threshold-Baseline Difference

In their seminal study, Wong and Wang (2006) used analytic methods to reduce a biophysically-based cortical network model (Wang, 2002) to a 2-variable system, tractable for analysis (depicted in Figure 1A). They showed that each of the populations selective for the decision alternatives corresponds to a stable state in the space of possible states of network activity, i.e., each population corresponds to an attractor (Figures 1B,C). The attractors are separated by an unstable “saddle” steady state with two manifolds: a stable manifold that draws the network toward the saddle point, and an unstable manifold that repels it toward one of the stable attractors (Figure 1C). They further calculated the time constants of these two manifolds, showing that the dynamics in the vicinity of the saddle support integration times much longer than the time constants of decay of contributing biophysical processes, such as those of neurons and synapses.

FIGURE 1
www.frontiersin.org

Figure 1. (A) The reduced model by Wong and Wang (2006), approximating a biophysically-based cortical network model (left of the thick arrow) with a 2-variable system (right). The thick arrow depicts the derivation of the latter from the former. The large oval on the left depicts a network of cortical pyramidal neurons. Inside the oval, the three open circles depict the target and distractor populations with selective input T and D respectively, and a population unresponsive to the evidence for either alternative. Looping arcs depict recurrent synapses, which are stronger within each selective population (thicker arcs). All pyramidal neurons excite a common inhibitory pool, which uniformly inhibits all pyramidal neurons. Excitatory and inhibitory synapses are depicted by arrows and closed circles respectively, small black dots depict individual neurons, and BG refers to background input. (B) Cartoon depiction of an attractor “energy landscape” for 2-choice decisions, where the energy decreases over time. An unstable steady state (high energy) separates two stable attractors (low energy), corresponding to the target and distractor stimuli. Conceptually, a ball placed between the two attractors will eventually role one way or the other, depicted by the dashed arrows. The ball enters an attractor basin sooner (later) under speed (accuracy) conditions because the dynamics evolve more quickly (slowly). Below the cartoon, the firing rates of target (blue) and distractor (red) neural populations are plotted over time during two decision trials, corresponding to the ball rolling into the target attractor basin (left) and the distractor attractor basin (right). (C) Decision space for two choices. Stable (solid) and unstable (dashed) manifolds of the saddle point (intersection of the manifolds, see text). The system moves toward this state along the stable manifold and is repelled along the unstable manifold. The firing rates of the winning populations in the two decision trials in (B) are plotted against each other, superimposed on the decision space, along with two noise-free trajectories (gray) with initial conditions inside each attractor basin. On each trial, the network state moves along the stable manifold before being repelled toward an attractor.

We used Wong and Wang's (2006) model in simulations of a 2-choice random dot motion (RDM) task (Supplementary Material Section 1). We ran 1000 trials for each motion coherence c ∈ {0, 1, 2, 4, 8, 16, 32}%, where the motion stimulus was provided for 5s following a 2.5s pre-stimulus interval. We refer to the integrator population receiving the stronger (weaker) stimulus as the target (distractor) population. We modeled speed and accuracy conditions by increasing and decreasing a uniform input to the two populations respectively, relative to a neutral condition. To this end, we adjusted the mean background current I0, capturing the total input current from upstream neurons other than those encoding motion stimuli. This current therefore subsumes the hypothesized cognitive signal controlling the SAT. Because the model's parameter values and corresponding dynamics are rigorously described by Wong and Wang (2006), we used the same parameter values here (excepting I0 and its corresponding standard deviation, see Supplementary Material Section 1).

Unsurprisingly, the spatially non-selective current I0 produced higher and lower pre-stimulus (baseline) firing rates under speed and accuracy conditions respectively, compared to the neutral condition. Baseline rates can be seen to the left of the vertical line in Figure 2A for an example coherence value (c = 4%, see Figure caption). The resulting SAT can be seen in Figures 2B,C, where the psychometric curve is shifted to the right and left under speed and accuracy conditions respectively; and for correct and error trials, mean decisions times are shorter and longer respectively. Thus, Figure 2 shows that by raising and lowering baseline activity, uniform input to both integrator populations controls the SAT. At first glance, these results appear to support the threshold-baseline hypothesis.

FIGURE 2
www.frontiersin.org

Figure 2. Trading speed and accuracy as a function spatially non-selective input I0. Simulated neural activity (A) and resulting psychometric (B) and chronometric (C) curves for neutral (I0 = 321pA, medium gray), speed (I0 = 325pA, black) and accuracy (I0 = 316pA, light gray) conditions. (A) Trial-averaged firing rates for coherence c = 4%. For each condition, the upper and lower curves show the mean rate over all correct trials for the target and distractor populations respectively. The vertical line at 0ms indicates the time of simulated motion onset. To the left of this line, pre-stimulus/baseline firing rates are higher (lower) under speed (accuracy) conditions compared to the neutral condition. Thus, the threshold-baseline difference is smaller (larger) under speed (accuracy) conditions. The solid horizontal line shows the “default” choice threshold θ = 15Hz used by Wong and Wang (2006). The dashed horizontal lines depict other possible thresholds. (B) The percentage of correct trials as a function of coherence. The data are fitted with a Weibull function for each condition. Error bars show standard error. The solid vertical line indicates coherence c = 4%, corresponding to the firing rates in (A). The dotted lines indicate the coherence value at 75% accuracy (see Figure 3A). (C) Mean decision times over coherence for correct (solid) and error (dashed) trials for each condition. Error bars show standard error. The vertical line indicates coherence c = 4%, corresponding to the firing rates in (A).

However, the threshold-baseline hypothesis dictates that the speed and accuracy of decisions are determined by the threshold-baseline difference. According to this hypothesis, a fixed threshold-baseline difference will produce uniform decision making performance, regardless of the rate of baseline activity. The threshold-baseline hypothesis therefore requires that any changes to the speed or accuracy of decisions resulting from a change in baseline activity (with a fixed threshold) can be “reversed” by an equal change to the threshold. We therefore increased the threshold under the speed condition by the difference between baseline activity under speed and neutral conditions (Δns, the mean difference over the last 1000 ms of the pre-stimulus interval), and we decreased the threshold under the accuracy condition by the difference between baseline activity under neutral and accuracy conditions (Δna). These adjustments to the threshold did not recover the psychometric and chronometric curves produced under the neutral condition, i.e., the black and light gray curves in Figures 2B,C do not overlay the medium gray curves. Denoting the threshold used by Wong and Wang (2006) as θ (vertical line in Figure 3), increasing (decreasing) θ by Δnsna) under the speed (accuracy) condition has almost no effect on performance. The same is true for any value of the choice threshold above θ. For thresholds below θ, the effect of these adjustments increases with decreasing threshold, but the psychometric (Figure 3A) and chronometric (Figures 3B,C) curves under speed and accuracy conditions do not come close to overlaying the neutral curves. For the lowest value of the threshold, there is a moderate effect on the psychmetric curves (the difference between the solid and dotted curves for speed and accuracy conditions), but such a low threshold does not allow a firing-rate excursion, so this moderate effect can only be achieved if the model deviates from the neural data on which the threshold-baseline hypothesis is founded, i.e., a fixed rate of target-in activity that is much higher than target-out activity at the time of commitment to a choice (e.g., Shadlen and Newsome, 2001; Roitman and Shadlen, 2002; Thomas and Pare, 2007; Purcell et al., 2010; Bollimunta and Ditterich, 2011; Ding and Gold, 2012). See the Discussion for other issues with such a low threshold. The psychometric and chronometric curves break down for thresholds lower than those in the figure. Note that Figures 3B,C show results for coherence values of c = 1% and c = 32% respectively. Values in between these extremes yield the same qualitative result. These results demonstrate that the threshold-baseline hypothesis does not account for the SAT under the principles of the attractor framework.

FIGURE 3
www.frontiersin.org

Figure 3. (A) The value of simulated motion coherence at which accuracy equals 75% for a range of choice thresholds under speed (black), neutral (medium gray) and accuracy (light gray) conditions (see dashed lines in Figure 2B). Dotted curves show results for simulations in which the threshold was raised (speed condition) and lowered (accuracy condition) by Δns and Δna respectively (see text). Solid curves show results for the corresponding unadjusted threshold. Adjusting the choice threshold makes little difference to accuracy, i.e., the solid and dotted curves are similar for speed and accuracy conditions, and do not overlay the neutral curve. (B,C) Mean decision times over all trials for coherence c = 1% (B) and c = 32% (C). Conventions are the same as in (A). Adjusting the threshold by Δns and Δna makes little difference to decision times, regardless of the threshold chosen. The solid vertical line in each panel indicates the threshold used by Wong and Wang (2006).

3. The SAT is Controlled by Network Dynamics

Returning to Figure 2A, the mean firing rates following motion onset (to the right of the vertical line) point to the mechanism by which the spatially non-selective input I0 controls the SAT in the model. The rate of increase of target activity is higher and lower under speed and accuracy conditions respectively, relative to the neutral condition. The different rates of increase reflect the dynamics furnished by the different values of I0 under speed, accuracy and neutral conditions. As shown by Wong and Wang (2006), the dynamics in the vicinity of the saddle point determine the length of time the network can integrate evidence, which can be approximated by the time constant of the unstable manifold (the effective time constant of integration, Supplementary Material Section 2). Wong and Wang (2006) calculated this time constant for several values of the strength of recurrent excitation, showing the consequent changes to the speed and accuracy of decisions (see their Figure 11). Figure 4A shows these calculations for our changes to I0. Under speed and accuracy conditions, higher and lower values of I0 furnish shorter and longer time constants respectively, relative to the neutral condition. Here, it is worth noting that the effective time constant behaves in exactly the same way as the bound of bounded integration models, decreasing (increasing) integration time under speed (accuracy) conditions (Figure 4A). Additionally, the shape of the attractor landscape changes with I0. Figures 4B–D show that for a given task difficulty (c = 4% in the figure), higher values of I0 push the stable manifold toward the midline at low rates below the saddle point. Since the network approaches the saddle from below (Figure 1C) and since errors occur when noise pushes the state of the network over the stable manifold (Wong and Wang, 2006), this re-positioning of the stable manifold further lowers (raises) accuracy under speed (accuracy) conditions. This mechanism is evident in Figures 4B–D, in which the solid circle in each panel shows the mean initial state of the network (immediately prior to the onset of evidence). With increasing I0, the stable manifold moves toward this initial state, which becomes increasingly precarious. Thus, a common input to integrators controls the rate of baseline activity, but the SAT does not result from the consequent changes to the threshold-baseline difference. The SAT results from the changes to network dynamics.

FIGURE 4
www.frontiersin.org

Figure 4. (A) The time constant of the unstable manifold of the saddle point (see Figure 1) for speed (black), neutral (medium gray) and accuracy (light gray) conditions, as a function of coherence. The time constant determines the time over which the system is repelled from the saddle toward an attractor corresponding to the target or the distractor (Figure 1C; T and D in B–D). (B–D) Stable (solid) and unstable (dashed) manifolds of the saddle for the accuracy (B, I0 = 316pA), neutral (C, I0 = 321pA) and speed (D, I0 = 325pA) conditions for coherence c = 4%. At low rates below the saddle, the stable manifold is pushed closer to the midline with increasing I0, while rates at the saddle increase. Solid circles show the initial state of the network. Insets show close-ups of the stable manifold and the midline at frequencies ≤5 Hz.

Increasing I0 not only re-positions the stable manifold, but also re-positions the saddle point, so that both populations fire at higher rates (Figures 4B–D). This change in position of the saddle dictates that firing rates will be higher when the network begins its descent into an attractor basin under speed conditions. In other words, firing rates will be higher when decision-selective rates separate from those of the competing population. To confirm this effect, we used signal detection theory to determine when an ideal observer can discriminate target activity from distractor activity in the model under speed, accuracy and neutral conditions (Supplementary Material Section 3). Signal detection theory is commonly used to estimate the time of target selection from neural data (Thompson et al., 1996; Cohen et al., 2009) and assumes that a downstream circuit makes decisions by discriminating the activity of neural populations selective for the alternatives (see Standage and Pare, 2011). Firing rates at the time of discrimination were higher under speed conditions and lower under accuracy conditions (Figure 5).

FIGURE 5
www.frontiersin.org

Figure 5. The mean firing rate of the target population at the time at which an ideal observer can discriminate target activity from distractor activity, calculated across all positive coherence values. The rate is higher (lower) under speed (accuracy) conditions.

Next, we subtracted the baseline rate under speed, accuracy and neutral conditions from the corresponding rate at discrimination time (the discrimination-baseline difference). The discrimination-baseline difference was larger under speed conditions and smaller under accuracy conditions. Because decisions are over when the firing rates separate, the rate at this time approximates a “decision threshold,” as opposed to the choice threshold (see the Discussion). To summarize: the difference between this decision threshold and baseline activity is larger under speed conditions and smaller under accuracy conditions in the model. Thus, stronger (weaker) non-selective input under speed (accuracy) conditions modulates decision-selective firing rates in a manner opposite to the principles of the threshold-baseline hypothesis. We confirmed these findings with an alternative method, in which decision times (and correctness) were determined by the last intersection of target and distractor activity on each trial, i.e., decisions were made when target and distractor activity separated for the final time. The mean rate at the time of separation was higher (lower) under speed (accuracy) conditions, as was the difference between this rate and the baseline rate (not shown). Importantly, our analysis in this section makes two predictions for electrophysiological studies of the SAT: (1) target-in and target-out data will separate at higher (lower) rates under speed (accuracy) conditions, and (2) the discrimination-baseline difference will be larger (smaller) under speed (accuracy) conditions.

4. Discussion and Conclusions

We have demonstrated that spatially non-selective excitation can control the SAT in an attractor model (Figures 2B,C), as shown previously (Furman and Wang, 2008; Roxin and Ledberg, 2008). The non-selective input increases and decreases baseline activity under speed and accuracy conditions respectively (Figure 2A), which unavoidably decreases and increases the difference between baseline activity and a fixed choice threshold. The threshold-baseline difference, however, does not control the SAT in the model (Figure 3). Rather, an increase (decrease) in non-selective input increases (decreases) the strength of network dynamics, which decreases (increases) the effective time constant of integration (Figure 4A) and renders the initial state of the network closer to (farther from) the stable manifold of the saddle, the crossing of which results in errors (Figures 4B–D).

Our findings are consistent with the hypothesis that a cognitive signal controls the SAT by adjusting a uniform input to integrator populations (see Bogacz et al., 2010b; Standage et al., 2014). This hypothesis is supported by neuroimaging (Forstmann et al., 2008; Ivanoff et al., 2008; van Veen et al., 2008; Wenzlaff et al., 2011) and electrophysiological (Heitz and Schall, 2012; Hanks et al., 2014) data from SAT tasks, where pre-stimulus activation has been shown to be higher (lower) under speed (accuracy, neutral) conditions. Like the threshold-baseline hypothesis, our results are consistent with these data. Our results conflict with the threshold-baseline hypothesis because the changes in network dynamics engendered by a uniform input dwarf the corresponding changes to the threshold-baseline difference. A related reason is that the choice threshold is qualitatively different than the bound of bounded integration models. The rate of target-in activity at the time of commitment to a choice has been shown to be considerably higher than the rate at which this activity separates from target-out activity (see e.g., Shadlen and Newsome, 2001; Roitman and Shadlen, 2002; Bollimunta and Ditterich, 2011; Ding and Gold, 2012). Under the framework of attractor dynamics, this excursion of target-in activity corresponds to the repulsion of a decision network from the saddle along its unstable manifold. Thus, these neural data suggest that the choice threshold is much higher than the saddle. As such, changes to the choice threshold will not influence decision accuracy over a broad range, unlike the bound of bounded integration models. This effect is clear in Figure 2A. As noted in Section 3, the rate at which target and distractor activity separates can be thought of as a “decision threshold,” but our simulations predict that this rate is not fixed across speed and accuracy conditions. Indeed, we predict that it changes (Figure 5) in a manner opposite to a flexible bound (e.g., Ratcliff and McKoon, 2008; Bogacz et al., 2010a). Our findings therefore suggest that the bound is not implemented in terms of firing rates per se. In this regard, the astute reader may have noticed our use of the term “choice threshold” when referring to decision-selective firing rates at the time of commitment to a choice, as opposed to the more conventional “decision threshold.” We believe the latter term is misleading in this context.

There are potential advantages to choice thresholds being higher than decision thresholds. For example, a high choice threshold alleviates the need for fine tuning (Roxin and Ledberg, 2008). Furthermore, the difference between the choice threshold and a decision threshold provides a buffer between decisions and their enactment. This buffer may confer advantages to decision makers. For instance, a high choice threshold gives an upstream decision variable the opportunity to suppress its competitors, that is, the choice is not made until the “winning” integrator population is firing at a high rate and the losing populations are firing at much lower rates. Thresholds are hypothesized to be implemented by networks with very strong dynamics (Simen, 2012), which are poorly suited to decision making (Standage and Pare, 2011), i.e., they implement an all-or-none response to a critical level of input. If the respective rates of the choice threshold and the decision threshold were similar (a small buffer), then the difference between the decision variables would be smaller when the largest one reaches the choice threshold, increasing the possibility that the thresholding circuit would inadvertently choose the wrong decision variable. Simultaneous electrophysiological recordings from decision circuitry and thresholding circuitry would be informative in this regard. It seems unlikely that target-in activity in one structure would coincide with target-out activity in the other, even infrequently. Another possibility is that thresholding circuitry implements an ideal observer of integrator circuitry, where back-projections from the former to the latter account for the excursion of decision-selective activity prior to choice selection (see Simen, 2012). Under this scenario, bidirectionally-coupled decision circuits would collectively implement both integration and choice, a compelling possibility that warrants further investigation.

Another perspective on the difficulties of equating the difference between the bound and the starting point of a decision variable with the threshold-baseline difference relates to levels of abstraction in models of brain function (Marr, 1982; Trappenberg, 2010). From this perspective, bounded integration models can be considered algorithms that characterize the computations underlying decisions. They have been (and continue to be) invaluable for our understanding of decision processing and the SAT, but it is not necessary to attribute direct biological correlates to each of their parameters. Qualitatively, the effective time constant of integration under speed and accuracy conditions changes in the same manner as the bound (Figure 4A) and therefore provides a plausible neural implementation of this abstract term, but the corresponding changes to the attractor basins show that this interpretation may be overly simplistic (Figures 4B–D). Note that we do not suggest the twain shall never meet. Far from it, formal equivalence has been shown between different classes of (linear) bounded integration models and the (non-linear) biophysically-based model on which our simulations are based (Bogacz et al., 2006). The constraints under which these models are equivalent define the relationship between decision models at these two levels of abstraction, allowing the systematic consideration of one class in terms of the other. Where earlier work has largely considered the commonalities between classes of model, e.g., the range of parameters under which non-linear, feedback-inhibition models are well-approximated by linear integration models (Usher and McClelland, 2001; Bogacz et al., 2006), we have focused on their differences. In this sense, we have shown what is lost in translation in relation to the SAT, suggesting that caution is warranted when interpreting neural data in terms of models that are purposefully simplified. Note that earlier discussions of the threshold-baseline hypothesis have made it clear that changes to the bound and the starting point of a decision variable are not equivalent in all abstract models (Bogacz et al., 2010b). For more extensive treatment of the constraints of the threshold-baseline hypothesis in relation to implementation-level models, see Marshall et al. (2012).

It is possible that a different kind of threshold-baseline difference could account for the SAT. If the baseline rate of thresholding circuitry were increased (decreased) under speed (accuracy) conditions, then lower rates of integrator activity would be sufficient to elicit choice behavior, i.e., to drive the relevant motor circuitry (see Standage et al., 2014 for review). As such, a cognitive signal controlling the SAT could bypass integrator populations. However, the rates of integrator populations at the time of commitment to a choice would be lower under speed conditions and higher under accuracy conditions, which conflicts with recent electrophysiological recordings from putative integrator neurons showing the opposite profile of activity (Heitz and Schall, 2012). Notably, these data also show higher (lower) baseline rates and a higher (lower) rate of increase under speed (accuracy) conditions, suggesting that speed and accuracy conditions do modulate integrator neurons. These findings are qualitatively reproduced by our simulations (Figure 2A).

Finally, we do not suggest that single-circuit attractor models provide a complete picture of decision making. For example, these models produce slower mean decision times on error trials than correct trials because the network state has to cross the unstable manifold (Wong and Wang, 2006; Standage et al., 2011), but error trials are faster than correct trials under some task paradigms (see Smith and Ratcliff, 2004). Such shortcomings point to the need for coupled-circuit models (e.g., Lo and Wang, 2006; Standage et al., 2013). The recent surge in neuroimaging studies of decision making and the SAT represents an important direction in this regard, identifying contributing brain regions and pointing to their respective roles in decision processing (Forstmann et al., 2008; Ivanoff et al., 2008; van Veen et al., 2008; Forstmann et al., 2010; van Maanen et al., 2011; Wenzlaff et al., 2011; Green et al., 2012; Ho et al., 2012). Guided by these data, models of distributed decision circuitry are an exciting direction in decision neuroscience (Frank, 2006; Lo and Wang, 2006; Bogacz and Gurney, 2007). Simulations of the bidirectional coupling between circuits supporting evidence integration and choice may be highly informative about the relationship between decision bounds and choice thresholds.

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

Da-Hui Wang was supported by NSFC under Grant No.31271169. Dominic Standage and Gunnar Blohm were supported by CFI (Canada), ORF (Canada) and NSERC (Canada).

Supplementary Material

The Supplementary Material for this article can be found online at: http://www.frontiersin.org/journal/10.3389/fnins.2014.00318/abstract

References

Bogacz, R., Brown, E., Moehlis, J., Holmes, P., and Cohen, J. D. (2006). The physics of optimal decision making: a formal analysis of models of performance in two-alternative forced-choice tasks. Psychol. Rev. 113, 700–765. doi: 10.1037/0033-295X.113.4.700

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Bogacz, R., and Gurney, K. (2007). The basal ganglia and cortex implement optimal decision making between alternative actions. Neural Comput. 19, 442–477. doi: 10.1162/neco.2007.19.2.442

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Bogacz, R., Hu, P. T., Holmes, P. J., and Cohen, J. D. (2010a). Do humans produce the speed-accuracy tradeoff that maximizes reward rate? Q. J. Exp. Psychol. 63, 863–891. doi: 10.1080/17470210903091643

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Bogacz, R., Wagenmakers, E.-J., Forstmann, B. U., and Nieuwenhuis, S. (2010b). The neural basis of the speedaccuracy tradeoff. Trends Neurosci. 33, 10–16. doi: 10.1016/j.tins.2009.09.002

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Bollimunta, A., and Ditterich, J. (2011). Local computation of decision-relevant net sensory evidence in parietal cortex. Cereb. Cortex 22, 903–917. doi: 10.1093/cercor/bhr165

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Churchland, A. K., Kiani, R., and Shadlen, M. N. (2008). Decision-making with multiple alternatives. Nat. Neurosci. 11, 693–702. doi: 10.1038/nn.2123

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Cohen, J. Y., Crowder, E. A., Heitz, R. P., Subraveti, C. R., Thompson, K. G., Woodman, G. F., et al. (2009). Cooperation and competition among frontal eye field neurons during visual target selection. J. Neurosci. 30, 3227–3238. doi: 10.1523/JNEUROSCI.4600-09.2010

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Ding, L., and Gold, J. I. (2012). Neural correlates of perceptual decision making before, during, and after decision commitment in monkey frontal eye field. Cereb. Cortex 22, 1052–1067. doi: 10.1093/cercor/bhr178

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Forstmann, B. U., Anwander, A., Schafer, A., Neumann, J., Brown, S., Wagenmakers, E.-J., et al. (2010). Cortico-striatal connections predict control over speed and accuracy in perceptual decision making. Proc. Natl. Acad. Sci. U.S.A. 107, 15916–15920. doi: 10.1073/pnas.1004932107

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Forstmann, B. U., Dutilh, G., Brown, S., Neumann, J., von Cramon, D. Y., Ridderinkhof, K. R., et al. (2008). Striatum and pre-sma facilitate decision-making under time pressure. Proc. Natl. Acad. Sci. U.S.A. 105, 17538–17542. doi: 10.1073/pnas.0805903105

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Frank, M. J. (2006). Hold your horses: a dynamic computational role for the subthalamic nucleus in decision making. Neural Netw. 19, 1120–1136. doi: 10.1016/j.neunet.2006.03.006

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Furman, M., and Wang, X.-J. (2008). Similarity effect and optimal control of multiple-choice decision making. Neuron 60, 1153–1168. doi: 10.1016/j.neuron.2008.12.003

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Green, N., Biele, G. P., and Heekeren, H. R. (2012). Changes in neural connectivity underlie decision threshold modulation for reward maximization. J. Neurosci. 32, 14942–14950. doi: 10.1523/JNEUROSCI.0573-12.2012

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Hanes, D. P., and Schall, J. D. (1996). Neural control of voluntary movement initiation. Science 274, 427–430. doi: 10.1126/science.274.5286.427

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Hanks, T., Kiani, R., and Shadlen, M. N. (2014). A neural mechanism of speed-accuracy tradeoff in macaque area lip. eLife 3:e02260. doi: 10.7554/eLife.02260

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Heitz, R. P., and Schall, J. D. (2012). Neural mechanisms of speed-accuracy tradeoff. Neuron 76, 616–628. doi: 10.1016/j.neuron.2012.08.030

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Ho, T., Brown, S., van Maanen, L., Forstmann, B. U., Wagenmakers, E.-J., and Serences, J. T. (2012). The optimality of sensory processing during the speed-accuracy tradeoff. J. Neurosci. 32, 7992–8003. doi: 10.1523/JNEUROSCI.0340-12.2012

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Ivanoff, J., Branning, P., and Marois, R. (2008). fmri evidence for a dual process account of the speed- accuracy tradeoff in decision-making. Public Library Sci. One 3, 1–14. doi: 10.1371/journal.pone.0002635

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Lo, C.-C., and Wang, X.-J. (2006). Corticobasal ganglia circuit mechanism for a decision threshold in reaction time tasks. Nat. Neurosci. 9, 956–963. doi: 10.1038/nn1722

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Machens, C. K., Romo, R., and Brody, C. D. (2005). Flexible control of mutual inhibition: a neural model of two-interval discrimination. Science 307, 1121–1124. doi: 10.1126/science.1104171

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Marr, D. (1982). Vision. New York, NY: W. H. Freeman and Company.

Marshall, J. A. R., Bogacz, R., and Gilchrist, I. D. (2012). Consistent implementation of decisions in the brain. PLoS ONE 7:e43443. doi: 10.1371/journal.pone.0043443

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Purcell, B. A., Heitz, R. P., Cohen, J. Y., Schall, J. D., Logan, G. D., and Palmeri, T. J. (2010). Neurally constrained modeling of perceptual decision making. Psychol. Rev. 117, 1113–1143. doi: 10.1037/a0020311

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Ratcliff, R., and McKoon, G. (2008). The diffusion decision model: theory and data for two-choice decision tasks. Neural Comput. 20, 873–922. doi: 10.1162/neco.2008.12-06-420

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Roitman, J. D., and Shadlen, M. N. (2002). Response of neurons in the lateral intraparietal area during a combined visual discrimination reaction time task. J. Neurosci. 22, 9475–9489.

Pubmed Abstract | Pubmed Full Text | Google Scholar

Roxin, A., and Ledberg, A. (2008). Neurobiological models of two-choice decision making can be reduced to a one-dimensional nonlinear diffusion equation. Public Library Sci. Comput. Biol. 4:e1000046. doi: 10.1371/journal.pcbi.1000046

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Salinas, E., and Abbott, L. F. (1996). A model of multiplicative neural responses in parietal cortex. Proc. Natl. Acad. Sci. U.S.A. 93, 11956–11961. doi: 10.1073/pnas.93.21.11956

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Shadlen, M. N., and Newsome, W. T. (2001). Neural basis of a perceptual decision in the parietal cortex (area lip) of the rhesus monkey. J. Neurophysiol. 86, 1916–1936.

Pubmed Abstract | Pubmed Full Text | Google Scholar

Simen, P. (2012). Evidence accumulator or decision threshold - which cortical mechanism are we observing? Front. Psychol. 3:183. doi: 10.3389/fpsyg.2012.00183

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Smith, P. L., and Ratcliff, R. (2004). Psychology and neurobiology of simple decisions. Trends Neurosci. 27, 161–168. doi: 10.1016/j.tins.2004.01.006

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Standage, D., Blohm, G., and Dorris, M. C. (2014). On the neural implementation of the speed-accuracy trade-off. Front. Neurosci. 8:236. doi: 10.3389/fnins.2014.00236

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Standage, D., and Pare, M. (2011). Persistent storage capability impairs decision making in a biophysical network model. Neural Netw. 24, 1062–1073. doi: 10.1016/j.neunet.2011.05.004

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Standage, D., You, H., Wang, D.-H., and Dorris, M. C. (2011). Gain modulation by an urgency signal controls the speed-accuracy trade-off in a network model of a cortical decision circuit. Front. Comput. Neurosci. 5:7. doi: 10.3389/fncom.2011.00007

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Standage, D., You, H., Wang, D.-H., and Dorris, M. C. (2013). Trading speed and accuracy by coding time: a coupled-circuit cortical model. PLoS Comput. Biol. 9:e100321. doi: 10.1371/journal.pcbi.1003021

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Thomas, N. W. D., and Pare, M. (2007). Temporal processing of saccade targets in parietal cortex area LIP during visual search. J. Neurophysiol. 97, 942–947. doi: 10.1152/jn.00413.2006

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Thompson, K., Hanes, D., Bichot, N., and Schall, J. (1996). Perceptual and motor processing stages identified in the activity of macaque frontal eye field. J. Neurophysiol. 76, 440–455.

Pubmed Abstract | Pubmed Full Text | Google Scholar

Trappenberg, T. P. (2010). Fundamentals of Computational Neuroscience. New York, NY: Oxford University Press.

Google Scholar

Usher, M., and McClelland, J. L. (2001). On the time course of perceptual choice: The leaky competing accumulator model. Psychol. Rev. 108, 550–592. doi: 10.1037/0033-295X.108.3.550

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

van Maanen, L., Brown, S. D., Eichele, T., Wagenmakers, E.-J., Ho, T., Serences, J., et al. (2011). Neural correlates of trial-to-trial fluctuations in response caution. J. Neurosci. 31, 17488–17495. doi: 10.1523/JNEUROSCI.2924-11.2011

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

van Veen, V., Krug, M. K., and Carter, C. S. (2008). The neural and computational basis of controlled speed-accuracy tradeoff during task performance. J. Cogn. Neurosci. 20, 1952–1965. doi: 10.1162/jocn.2008.20146

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Wang, X.-J. (2002). Probabilistic decision making by slow reverberation in cortical circuits. Neuron 36, 955–968. doi: 10.1016/S0896-6273(02)01092-9

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Wang, X.-J. (2008). Decision making in recurrent neuronal circuits. Neuron 60, 215–234. doi: 10.1016/j.neuron.2008.09.034

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Wang, X.-J. (2012). Neural dynamics and circuit mechanisms for decision-making. Curr. Opin. Neurobiol. 22, 1039–1046. doi: 10.1016/j.conb.2012.08.006

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Wenzlaff, H., Bauer, M., Maess, B., and Heekeren, H. R. (2011). Neural characterization of the speed-accuracy tradeoff in a perceptual decision-making task. J. Neurosci. 31, 1256–1266. doi: 10.1523/JNEUROSCI.4000-10.2011

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Wong, K.-F., and Wang, X.-J. (2006). A recurrent network mechanism of time integration in perceptual decisions. J. Neurosci. 26, 1314–1328. doi: 10.1523/JNEUROSCI.3733-05.2006

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

You, H., and Wang, D.-H. (2013). Dynamics of multiple-choice decision making. Neural Comput. 25, 2108–2145. doi: 10.1162/NECO_a_00473

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Keywords: speed-accuracy trade-off, neural dynamics, bounded integration, decision threshold, threshold-baseline difference

Citation: Standage D, Wang D-H and Blohm G (2014) Neural dynamics implement a flexible decision bound with a fixed firing rate for choice: a model-based hypothesis. Front. Neurosci. 8:318. doi: 10.3389/fnins.2014.00318

Received: 28 February 2014; Accepted: 19 September 2014;
Published online: 21 October 2014.

Edited by:

Patrick Simen, Oberlin College, USA

Reviewed by:

Samuel Franklin Feng, Princeton University, USA
Nicholas Cain, Allen Institute for Brain Science, USA

Copyright © 2014 Standage, Wang and Blohm. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Dominic Standage, Department of Biomedical and Molecular Sciences, Queen's University, Botterell Hall, Room 230, Kingston, ON K7L 3N6, Canada e-mail: standage@queensu.ca

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.