Skip to main content

ORIGINAL RESEARCH article

Front. Psychiatry, 03 September 2018
Sec. Mood Disorders
This article is part of the Research Topic Cognition in Mood Disorders View all 17 articles

Haste or Speed? Alterations in the Impact of Incentive Cues on Task Performance in Remitted and Depressed Patients With Bipolar Disorder

\r\nHenry W. Chase*Henry W. Chase1*Jay C. FournierJay C. Fournier1Haris AslamHaris Aslam1Richelle StifflerRichelle Stiffler1Jorge R. AlmeidaJorge R. Almeida2Barbara J. SahakianBarbara J. Sahakian3Mary L. PhillipsMary L. Phillips1
  • 1Department of Psychiatry, University of Pittsburgh School of Medicine, Pittsburgh, PA, United States
  • 2Department of Psychiatry, Dell Medical School, University of Texas at Austin, Austin, TX, United States
  • 3Department of Psychiatry, University of Cambridge, Cambridge, United Kingdom

A variety of evidence suggests that bipolar disorder is associated with disruptions of reward related processes, although the properties, and scope of these changes are not well understood. In the present study, we aimed to address this question by examining performance of patients with bipolar disorder (30 depressed bipolar; 35 euthymic bipolar) on a motivated choice reaction time task. We compared performance with a group of healthy control individuals (n = 44) and a group of patients with unipolar depression (n = 41), who were matched on several demographic variables. The task consists of an “odd-one-out” discrimination, in the presence of a cue signaling the probability of reward on a given trial (10, 50, or 90%) given a sufficiently fast response. All groups showed similar reaction time (RT) performance, and similar shortening of RT following the presentation of a reward predictive cue. However, compared to healthy individuals, the euthymic bipolar group showed a relative increase in commission errors during the high reward compared to low condition. Further correlational analysis revealed that in the healthy control and unipolar depression groups, participants tended either to shorten RTs for the high rather than low reward cue a relatively large amount with an increase in error rate, or to shorten RTs to a lesser extent but without increasing errors to the same degree. By contrast, reward-related speeding and reward-related increase in errors were less well coupled in the bipolar groups, significantly so in the BPD group. These findings suggest that although RT performance on the present task is relatively well matched, there may be a specific failure of individuals with bipolar disorder to calibrate RT speed and accuracy in a strategic way in the presence of reward-related stimuli.

Introduction

Bipolar disorder (BD) is characterized by modified behavioral and neural responsiveness to reward (13). Although individuals with BD report a heightened desire to attain happiness (4), patients' lives are typically characterized by instability and low levels of (eudaimonic) well-being (5). One potential explanation for this discrepancy is that there is an imbalance in the neural systems controlling the pursuit of reward. Responsiveness to reward in BD may better reflect the arousing or activating properties of reinforcement-related stimuli rather than enhanced hedonic responses (6), akin to a non-specific invigoration of behavior elicited by reward-related stimuli (7). In general, invigoration is likely to be adaptive in a reward-rich environment, but there may also be deleterious consequences [e.g., (8)] which may be relevant for BD.

Alterations in the performance of BD on reinforcement learning paradigms have been reported (9, 10), as well as altered decision making (11), but these studies vary in their emphasis on risk taking, blunted reward sensitivity and cognitive flexibility. In the present study, we employed the Cued Reinforcement Reaction Time (CRRT) task, a motivated reaction time task which has been employed in studies of 5-HT (serotonin) manipulations (12) and patients (13, 14). The reinforcement contingencies on the task should prompt the participant to shorten their reaction times on high reinforcement trials compared to the low trials, so primarily the task is used as an index of the motivational impact of cues on a simple cognitive task. In a previous investigation with the CRRT (12), manipulation of central nervous system 5-HT using acute tryptophan depletion (ATD) reduced reinforcement related speeding in a group of healthy individuals. In addition, the pattern of errors changed, with a relative reduction of errors in the high reinforcement condition.

With regard to BD, we considered two hypotheses. First, there may be generally inflated urgency across all rewarding contexts, or a preference for speed over accuracy. Second, there may be an impairment in the contextual calibration of urgency. It is often assumed that increasing impulsivity is broadly maladaptive [e.g., (15)]. However, there is also a concept of “functional” impulsivity (16), in which rapid if inaccurate decisions are necessary in order to obtain reward and are thereby adaptive. Thus, impaired calibration could lead to hasty and inaccurate decision making in reward-sparse environments when accuracy is needed, but also excessively conservative decision making in reward-rich environments. We performed a detailed analysis of task performance to investigate these possibilities.

We recruited four groups of participants (bipolar euthymic, bipolar depressed, major depressive disorder (MDD), and healthy controls), and hypothesized that the normal pattern of reward-related speeding would be altered in individuals with bipolar disorder. Note that these predictions are focused on the notion that BD patients (euthymic or depressed) would show altered impulsivity that would be detected on the CRRT paradigm. This design allows the impact of mood state to be largely differentiated from a BD-related trait abnormality. In a previous study with unipolar depressed patients (14), we observed an overall intact pattern of responding on the paradigm (i.e., similar reward-related speeding to controls), and even enhanced performance on some metrics. We also sought to determine whether a similar finding could be identified in the present data set. Our analytical strategy incorporated conventional reaction time (RT) and error difference scores. To provide further support to our conclusions, we supplemented this analysis with a simple reinforcement learning-based approach, instantiated within a general linear model (GLM).

Methods

Participants

Thirty currently depressed adults with bipolar disorder (BPD) type I, 35 currently Euthymic individuals with bipolar disorder (BPE) type I, and 41 currently depressed adults with MDD participated in the study. All BPE/BPD/MDD participants were diagnosed according to DSM-IV criteria using the Structured Clinical Interview for DSM-IV-Research-Version [SCID-P: (17)]. All BPD and MDD participants were in a Major Depressive Episode, as determined by SCID-P criteria, at the time of a functional magnetic resonance imaging (fMRI) scan which occurred ~1 week prior to the behavioral testing session at which the present data was collected. Data obtained from this scanning session have been previously reported (6, 1820). Current mood state was confirmed by having a Hamilton Rating Scale for Depression (HRSD-25) ≥17 (21) and a Young Mania Rating Scale [YMRS: (22)] score ≤10 on the day of the test (unless it occurred < 4 days after the scanning session), although three BPD and five MDD individuals meeting SCID-P criteria for depressive episode had a HRSD-25 score between 11 and 17 on the test day, while three BPE had scores of 17 or over. All participants also completed the Spielberger State Anxiety Inventory on the test day (23). Prevalence of lifetime comorbid anxiety and substance use disorders are reported in Table 1. Importantly, all BPE/BPD/MDD participants were free from alcohol/substance abuse or dependence for a minimum of 3 months prior to the study (range: 4–235 months). Forty-four healthy adult control participants (HC) with no previous personal or family history of psychiatric illness in first-degree relatives participated in the study. All HC participants were also free of previous or current alcohol/illicit substance abuse. All participants were right-handed and native English speaking. The study protocol was approved by the University of Pittsburgh Institutional Review Board. After complete description of the study to the participants, written informed consent was obtained.

TABLE 1
www.frontiersin.org

Table 1. Demographic and clinical information for participants in the four groups.

Further exclusion criteria for all participants included history of head injury, systemic medical illness, cognitive impairment, premorbid IQ estimate < 85 (as derived from the National Adult Reading Test: NART), schizophrenia/schizo-affective disorder and rapid cycling disorder. All included participants showed <21 errors per condition/block, and < 8 time-outs. The final participant numbers per group listed above and in Table 1 do not include a further three individuals with BPE and two with MDD who were excluded due to poor task performance (>9 time outs). A high number of timeouts might have been reflective of a failure to understand the instructions fully, and would have resulted in a reduction in the number of trials in which performance feedback was provided. Timeouts also reduce the amount of data available for modeling within the GLM. Two further participants (one BPE and one HC) were excluded on the basis of a relatively poor GLM fit (log likelihood <1100/Z score of the residual variance relative to overall mean Z>7; all included participants log likelihood = 1150–1350), which was generally reflective of abnormal performance (outliers) on one or more metric.

Procedure

The paradigm was the same as that employed by Cools et al. (12). Participants performed two short practice blocks (20 trials) of a task, based on the circles task of Duncan et al. (24), in which they had to select the odd-one-out of three stimuli, within 2,000 ms. The mean and standard deviation of the participant's reaction time on the second practice block were recorded, and used as the “reward threshold” for the next stage of the task. Participants then performed two longer blocks (96 trials each) of a similar task, which contained a reinforcement component via the potential to win points. Points rewards were available on some trials, and were dependent on subjects' accuracy and speed, as well as the presence of rectangular cues which surrounded the area where the circles were presented, and were presented before the circles were presented. One cue predicted reward availability on 90% of trials (high probability cue), the second on 50% (medium probability cue), and the third on 10% of trials (low probability cue). Colors used for these cues were always red, blue and yellow, and subjects were randomly assigned to one of two mappings between cue and reward probability. Of the 96 trials in each block, 32 trials were performed with each cue type. There were 12 different arrays of three circles, and these were counterbalanced within cues such that there were no cue repetitions, and only two response repetitions occurred. If rewards were available on a given trial and the subject responded both correctly and faster than their reward threshold, the subject would receive 100 points, a green smiling face and a flourish sound. If the same were true, but the subject had responded slower than their reward threshold, they would receive 1 point, a green smiling face and a high frequency tone. Finally, if the subject was incorrect on trials where reward was available, they would receive negative feedback (a red frowning face and a low frequency tone, but no loss of points). If reward was not available on a given trial, no feedback was provided.

Data Analysis: Basic Analyses

First, raw RT data were transformed using a reciprocal transform [see (25)]. For basic analyses, mean reciprocal RTs from correct trials were calculated for each cue (high/medium/low) and block (first/second) separately. Repeated-measures ANOVA was then conducted for the resulting variables, including effects of block and cue as a within-participant factors, and the effect of group as a between-participant factor. Following previous studies (26), we also computed a measure of RT speeding, contrasting the high vs. low reward probability cue following Z transformation of the raw RT values (with reference to the mean and standard deviation for all cues within a block). We focused on block 2 only, as the reward contingencies were more likely to have been learned by that stage. All parametric analyses were confirmed by including years of education, age and gender as covariates. In all cases, findings were similar and these analyses are not reported. Following the suggestion of a reviewer, we also performed an additional analysis involving number of manic episodes in the bipolar groups alone (BPE/BPD).

Due to their non-normal distribution, error scores (per block/cue) were analyzed using non-parametric tests (Kruskal–Wallis test). Similar difference scores reflecting the effect of high vs. low probability cues for block 2 were calculated to compare with RT speeding metrics described above. Speed/accuracy relationships were investigated in two ways: first, with a simple Spearman's Rho correlation; second, using ordinal logistic regression. For the latter analysis, six error categories (−3 to +2 errors for high vs. low cue trials, block 2) containing roughly similar numbers of individuals were created to reduce model complexity.

Data Analysis: General Linear Model

As a follow up analysis, reciprocal RTs from the whole task (both blocks) were fit within a single general linear model [GLM: (25, 27)] separately for each participant. The goal of this modeling was broadly to demonstrate that the data are consistent with a Reinforcement Learning (RL) model, and that similar findings could be obtained using different modeling approaches. Our initial analyses suggested that the data did not provide strong constraints over multiple possible free parameters—specifically, the design did not allow unique identification of these parameters. We therefore set them either on the basis of prior work or to reflect a nominal magnitude ranking. These arbitrary decisions were justified insofar as they were unlikely to have a substantial impact on the overall pattern of data—at least with respect to group differences. To correct for autocorrelated properties of the timeseries, an AR(3) ARIMA model [see (25)] was fit using MATLAB (regARIMA function), and fixed effects at the subject level were thereby obtained. The model contained the following components as independent (predictor) variables: the first 5 trials per block [see (25)]; error trials; post error trials; a linear trend to correct for non-specific improvement on the task (trial number); and reward expectancy generated from a reinforcement learning model. The latter component was most relevant to our focus, and was derived from the following equation:

Q(cue,t)Q(cue,t)+alpha*(outcome(t)-Q(cuet))

The learning rate (alpha) was set to 0.2 for win or no-win outcomes [see (28)], but for punishment trials (signaled errors) it was set to 0.3 to reflect an increased salience of this condition. For 100 point wins, the outcome value was set to 3, and for 1 point wins it was set to 1. Trials on which responses were too slow were excluded from the analysis. We focused on the beta parameters associated with the Q-value regressor for further analysis, examining whether there were group differences using a combination of one-way ANOVA and a Bayesian test of the null hypothesis (29), and also recapitulated the ordinal logistic regression analysis described above using the Q-value beta value instead of the RT speeding measure.

Effect of Medication

We computed: (1) medication load, an index that reflects the number and dose of different medications, as in our previous neuroimaging studies on bipolar disorder [e.g., (30) and 2] identified medication status (taking vs. not taking each of five main psychotropic medication subclasses: mood stabilizers/antipsychotics/antidepressant/anxiolytics/dopaminergic-antidepressants, e.g., Bupropion: see Table 1). Medication load calculation was based on a binary categorization of low and high dose groupings of antidepressants and mood stabilizers; a binary categorization of antipsychotics relative to mean effective daily dose of chlorpromazine hydrochloride; and a binary categorization of benzodiazepine dose relative to the recommended daily dose of each type of benzodiazepines. If a participant was not taking a given class of the medication, they would receive a score of 0, a low dose would receive a score of 1 and a high dose would receive a score of 2. Scores for each class of medication were added together to produce the final medication load score. Details of the patients' medications are presented in Supplementary Table 1. We tested whether the speed/accuracy relationships with the bipolar groups described within the ordinal logistic regression model were significant when medication (both main effect and medication by RT speeding interaction) was concurrently modeled.

Results

Demographic Variables

The four groups were well matched for age, gender, and NART-estimated IQ (see Table 1). The groups differed on years of education, due to the BPD group showing a lower number of years of education.

Reaction Times and Overall Performance

Analysis of variance (ANOVA) conducted on the reciprocal RTs revealed a significant main effect of block [F(1, 146) = 48.38, p < 0.001], a significant main effect of cue [F(2.00, 291.22) = 10.94, p < 0.001] and a significant block by cue interaction [F(1.99, 290.98) = 15.73, p < 0.001]. The main effect of group was not significant [F(3, 146) <1], neither were any significant interaction effects involving group (p's > 0.26). The robust effect of cue and cue by block interaction supported the hypothesis that RTs are sensitive to task contingencies. The presence of learning was further supported by significant high vs. low reward (z-transformed) RT differences in all four groups on the second block (t's > 2.71, p's < 0.011: see Figure 1). Raw RT and error scores are presented in Table 2.

FIGURE 1
www.frontiersin.org

Figure 1. Reward-related speeding (high vs. low reward cue) on the second block in all four groups. Higher scores on the Y axis indicate faster responding for high vs. low reward cues. Units are 1/ms; error bars reflect standard error of the mean; asterisks reflect the mean value being significantly greater than zero (p's < 0.05).

TABLE 2
www.frontiersin.org

Table 2. Raw RT (ms units) and error scores (mean/standard deviation) for each group, for each block (1 and 2), and cue (high, mid, and low reward probability).

In terms of the number of points won overall, a significant main effect of block was observed [F(1, 146) = 42.31, p < 0.001] which reflected a general increase in points scored in block 2, but no main effect of group [F(3, 146) = 1.34, p = 0.26]. A marginal block by group interaction was also observed [F(3, 146) = 2.68, p = 0.049]. Games-Howell non-parametric post hoc tests on the changing score from block 1 to block 2 revealed that healthy controls showed a relatively smaller increase in the number of points across blocks, and this was significantly lower than the BPD group (p = 0.014). No other significant findings were observed.

Errors

Overall, all three groups showed similar rates of overall commission errors (χ2 < 1.2, p's > 0.7 across both blocks and overall). Similar findings were seen with overall omission (“too late”) errors (p's > 0.45). Following the RT analysis by focusing on relative errors for the high vs. low reward cue in the second block, Kruskal–Wallis tests revealed a main effect of group (χ2 = 9.26, df = 3, p = 0.026). The only post-hoc test reaching significance when corrected for the number of possible group (n = 6) comparisons was the difference between control and BPE individuals (Z = 2.76, p = 0.006), with BPE individuals making relatively more errors in the high reward than the low reward condition than the controls in the second block (see Figure 2). The BPE group also tended to make relatively more errors on high vs. low reward trials in the second block than the BPD group (p = 0.027) and the MDD group (p = 0.090). Looking at differential error rates within the groups, the BPE group was the only group to show a significant difference between the block 2 stimuli in terms of error rates, showing significantly higher error rates to the high compared to low reward stimuli (Z = 2.49, p = 0.013). The three other groups showed similar error rates on block 2 across stimuli. Finally, a difference between the groups on high vs. low error rate was not seen on the first block (χ2 = 3.77, df = 3, p = 0.29). We also investigated the impact of the number of prior manic episodes on the reward-related error rate within the BPE/BPD groups. A significant relationship was observed (ρ = −0.28, p = 0.027, n = 65), with relatively greater errors on the high reward trials being observed in patients who had experienced greater numbers of manic episodes.

FIGURE 2
www.frontiersin.org

Figure 2. Reward-related error rate (high vs. low reward cue) on the second block in all four groups. Higher scores on the Y axis indicate more accurate responding for high vs. low reward cues. Error bars reflect standard error of the mean; asterisks reflect the mean value being significantly greater than zero (BPE), or the BPE vs. control difference being significant (p's < 0.05).

Relationship of Speed and Accuracy

As described above, the strongest metric of reward-related speeding was high vs. low reward RT difference on the second block. If participants are differentially balancing speed and accuracy, we might expect that, across individuals, shorter RTs in the high reward condition would come at the cost of a relatively increased error rate. Across all individuals, this hypothesis was supported (ρ = 0.31, p < 0.001), and strongly in the HC (ρ = 0.43, p = 0.003) and MDD (ρ = 0.50, p = 0.001) individually. In the BPE group, the relationship was significant if numerically smaller (ρ = 0.37, p = 0.031), and was not present in the BPD group (ρ = −0.27, p = 0.15: see Figure 3).

FIGURE 3
www.frontiersin.org

Figure 3. Trade-off between increases in speed and accuracy for the high vs. low cue. (A) Between subject relationship between reward-related speed and accuracy improvements across all four groups. Asterisks reflect correlations being significantly different from zero (p's < 0.05). (B) Plot of the individual subject data for the healthy control group alone, revealing the trade-off between speed and accuracy improvements. Note that a large proportion of individuals are in the upper right-hand quadrant, suggesting overall reward-related improvements.

To model this finding more formally, we constructed an ordinal logistic regression model in which we aimed to predict the (high-low, block 2) errors from the amount of speeding differentially across the groups. The model confirmed a large main effect of high vs. low RT speeding (Wald = 13.00, p < 0.001). In addition, BPE showed main effect (Wald = 5.50, p = 0.019) (i.e., overall difference in high vs. low errors), but the BPE*RT speeding interaction term was not significant (p = 0.35). By contrast, the BPD group showed no significant main effect (p = 0.29) but significant interaction term (Wald = 9.089, p = 0.003).

General Linear Model

The Q value of the cue, as derived from the RL model, was related to RT as evidenced by a beta statistic that was significantly different from zero across all individuals [t(149) = 6.32, p < 0.001]: trials where the cue value was higher had shorter RTs. Value-related RT speeding was correlated with block 2 high vs. low RT difference across all participants (r = −0.49, p < 0.001). No main effect of group was seen on this variable [F(3, 146) = 1.68, p = 0.17]. Moreover, a Bayesian test of null hypothesis of currently depressed individuals (i.e., all MDD/BPD participants) vs. currently non-depressed individuals (all HC/BPE participants) revealed strong evidence for the null hypothesis of no difference between the groups (Bayes Factor = 12.91). Finally, of the three significant findings in the ordinal logistic regression model described in section Relationship of Speed and Accuracy, the main effect of value-related speeding (Wald = 4.91, p = 0.027), main effect of BPE (Wald = 3.41, p = 0.065) and BDE*value-related speeding (Wald = 5.88, p = 0.015), all at least trended in the same direction as previously, while no further significant findings were observed.

Medication Effects

Including the presence of antipsychotic medications or mood stabilizers in the ordinal logistic regression model described in section General Linear Model reduced the effect of the BPE main effect (p = 0.079 or p = 0.2 respectively), while mood stabilizers themselves had a trend level effect (p = 0.063). A clearer influence was seen with overall medication load, which itself had a significant main effect (Wald = 4.23, p = 0.040) and load by speeding interaction (Wald = 4.58, p = 0.032). In this model, the BPE main effect was also reduced to marginal (p = 0.058). In all of these models, the significance of the BPD by speeding interaction was unaffected. Importantly, when the effect of antipsychotic medication, mood stabilizers, or medication load were examined in the BPE group alone, no significant differences in relative error rate were observed (all p's>0.47). In other words, differential medication status within BPE participants was associated with similar patterns of reward-related error rates.

Discussion

In the present study, we evaluated four groups of participants - euthymic and depressed patients with bipolar disorder, unipolar depressed individuals and healthy controls—on a motivated reaction time paradigm. Across all individuals, significant, and similar RT speeding was observed across all four groups. An analysis using RT differences was confirmed with simple reinforcement learning (RL) model instantiated within a GLM. Second, different pattern of cue-related errors was observed in the BPE group: BPE individuals showed higher rates of errors in the high vs. low reward condition on block 2, whereas no other group showed this effect. Complementing this finding, greater numbers of manic episodes in the bipolar groups also predicted higher error rates in the high vs. low reward condition, suggesting that there may be dimensions of illness severity or chronicity which may generalize across the bipolar groups.

Accounting for differences in RT clarified the relative error rate finding further, as there were substantial individual differences in RT/errors. Controls and MDD groups showed strong negative relationships between high vs. low error difference scores, and high vs. low RT difference scores. This suggests that individuals adopt different strategies: on one end of the spectrum, they might reduce RTs dramatically for high reward cues at the cost of increased errors; at the other end, they might reduce RTs slightly and show relatively low error rates following the high reward cue. By contrast, both of the BD groups showed a complication of this predicted relationship: the BPE group showed an overall increase in errors in the high vs. low reward cue, while the BPD group showed a decoupling of high vs. low reward error rate and high vs. low RT.

Comparison With Previous Findings

Similar findings to the present were obtained by Mueller et al. (31): youth with pediatric bipolar disorder showed a higher error rate under the incentive condition of an anti-saccade task than healthy controls, while similar performance between the groups was seen on the no incentive condition, and on a pro-saccade task. This type of finding is compatible with our observations, insofar as the task incentives were associated with impaired performance in the bipolar group.

Existing data on the speed/accuracy trade off (SAT) in major depression are somewhat complex. Although MDD patients might be expected to favor accuracy over speed, Dillon et al. (32) demonstrated, using drift diffusion modeling of a flanker task, that a slower executive control process is offset by a slower prepotent bias. As a result, the SAT seen in MDD patients was roughly similar to that seen in controls. We were unable to replicate the finding of enhanced performance in MDD patients that we had previously reported (14). The present study cannot be considered an exact replication attempt, because the age of the participants was different between the samples (participants in the present sample were around ~10 years younger), and different medications had been prescribed. Age may be a relevant dimension for further investigation because reanalysis of our previous findings suggested that age was positively related to overall errors and reaction time variability in the healthy control group (r's = 0.44–0.50) but not in the patient group (r's = −0.08: unpublished data). As the age range of the present sample is only partially overlapping with the previous sample, it is difficult to perform a direct replication of this finding here. Overall, a unifying conclusion of both datasets, which would be directly testable, is that MDD and/or antidepressant medications may prevent decline in overall RT performance in individuals >45 years old.

Nevertheless, the present findings strengthen an important conclusion of the previous study: namely, that reward-predictive cues can exert similar speeding effects on motivated behavior in depressed and healthy individuals. The present study was adequately powered to detect effects of similar magnitude to those reported in previous studies of psychopathology with this task [d = 0.77–1.29: (13, 14)]. These findings contribute to a growing literature describing areas of intact reward processing in MDD (33), and provide contrast with paradigms which appear to be more reliably sensitive (34, 35). Although details of the experimental design within the reinforcement learning framework may be relevant to understanding these discrepancies, it may also be that theoretical accounts of motivation that extend beyond a focus on the arrangement of stimuli, responses requirements and reinforcement contingencies may be insightful in determining why some paradigms are sensitive to differences related to depression and some are not (36).

Neurocomputational Basis

If the spirit of the impulsivity construct is one of sub-optimal decision making, then considering impulsivity in terms of the SAT might emphasize a failure to optimize RT in order to maximize utility, rather than simple premature or hasty responding. But whether or not healthy individuals show an optimal trade-off of speed and accuracy is also debated (37), and can be considered in terms of the drift diffusion model. This provides a framework to allow a derivation of optimal performance on decision paradigms across different levels of sensorimotor signal to noise ratio [SNR—(37)]. In addition, Manohar et al. (38) demonstrated that reward can provide a general improvement in performance rather than biasing toward speed or accuracy. In our data, there was a strong negative correlation between the degree of RT speeding and enhanced error rates. For a given SNR, this might be explained by proposing that individuals differ considerably regarding the relative utility of reward and punishments available on the task. For the HC and MDD groups, the pattern of findings follows what might be expected if participants showed some variation in how the response threshold parameter is set (i.e., more or less liberally) in order to promote reward rate or reduce punishment rate. It would also be necessary to propose an increase in SNR in the high reward condition [see (38)], as performance was generally better (faster and as accurate) here than in the low reward conditions. By contrast, the bipolar participants did not conform to this pattern. Both bipolar groups shortened their RTs overall in the high reward condition, suggesting a loosening of the decision threshold, but individual differences in this shortening were not always accompanied by predictable changes in error rate. It may be that some bipolar participants showed a decrease in SNR during the high reward condition. In combination with looser decision thresholds, a decrease in SNR might lead to increased error rates and shorter reaction times—the pattern seen in the BPE group. A more subtle effect of reducing SNR in the high condition might simply be to reduce the coupling between error rates and RT, which was the finding seen in the BPD group. In other words, the same type of explanation (i.e., a relative decrease in SNR in high reward conditions) might account for both the observed main effect (BPE) or the group by speeding interaction (BPD). High reward cues in the bipolar groups may affect behavior by a number of different mechanisms, including reduced selective attention, or the engagement of competing but irrelevant responses. Either possibility may appear to decrease SNR in the high reward condition.

Clearly then, our data do not support the idea that there is a simple change in response threshold in mood disorders, but previous studies which manipulated serotoninergic neurotransmission using acute tryptophan depletion (ATD) can be largely explained in this way (12, 26). Specifically, ATD was associated with a reduction in reward-related speeding, particularly in highly impulsive individuals (12) and individuals carrying the ss allele of the 5-HT transporter gene (26). In both of these studies, errors broadly tended to follow what would be expected by RT speeding: for example, ATD was associated with overall reduced error rates in ss individuals (26), and fewer errors on high compared to low reward conditions (12).

Limitations

Medication was a confound in the present study, and the presence of psychiatric control groups (e.g., MDD) did not allow us to correct for this completely: the effect seen in the BPE group (increased errors in the high vs. low condition) did not survive correction for medication load, although the effect seen in the BPD group (error by RT interaction) did. However, as the BPE group showed similar error rates if medicated by antipsychotics, mood stabilizers or higher overall doses than if they were not medicated in this way, it seems most likely that the reduction in significance relates to confounding and a consequent loss of statistical power rather than a particular effect of medication on performance. Finally, the findings of the present study do not support the contention that antidepressants improve performance which was an interpretation of our previous work (14), but it remains possible that the effect of anti-depressants may be age dependent, as described above.

One limitation of the paradigm is that it is broadly reward-focused. Recent studies have manipulated reward and punishment independently within the context of compatible motivated RT paradigms (27, 39). Future studies could usefully examine the generalization of these finding across different experimental contingencies. One straightforward manipulation would be to add another block (or more) of trials: this would allow more precise measurement of asymptotic performance, particularly of error rates which are low and thus may be difficult to estimate accurately. Finally, future studies could also explore drift diffusion modeling to verify some of our conclusions, although some more substantial alterations to the experimental design may be necessary to constrain the number of potential free parameters (40).

Summary

In summary, we emphasize two primary contributions of the present work. First, the findings confirm the presence of reward-related speeding within mood disorders: this finding is in line with our previous work, but does not support the hypothesis of motivational impairment within depressed individuals. Second, we provide evidence for an alteration in a reward-related trade-off between speed and accuracy within individuals with BD. Our favored interpretation of this finding is that individuals with BD may show a decrease in sensorimotor SNR under high reward expectation, as opposed to individuals with MDD and HC who show increased SNR. Together, the findings suggest a novel avenue for research into impulsivity in mood disorders.

Author Contributions

HC was involved in the framing of the study, data analysis, and write up of the manuscript. JF was involved in data analysis and editing of the manuscript. HA and RS was involved in data collection and management. JA was involved in study design, organization, and data collection. BS was involved in study design and editing of the manuscript. MP was involved in study design, organization, framing of the study, and write up of the manuscript.

Funding

This study was supported by National Institute of Mental Health (NIMH) grant MH076971 (to MP).

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpsyt.2018.00396/full#supplementary-material

References

1. Muhtadie L, Johnson SL, Carver CS, Gotlib IH, Ketter TA. A profile approach to impulsivity in bipolar disorder: the key role of strong emotions. Acta Psychiatr Scand. (2014) 129:100–8. doi: 10.1111/acps.12136

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Phillips ML, Swartz HA. A critical appraisal of neuroimaging studies of bipolar disorder: toward a new conceptualization of underlying neural circuitry and a road map for future research. Am J Psychiatry (2014) 171:829–43. doi: 10.1176/appi.ajp.2014.13081008

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Nusslock R, Alloy LB. Reward processing and mood-related symptoms: an RDoC and translational neuroscience perspective. J Affect Disord. (2017) 216:3–16. doi: 10.1016/j.jad.2017.02.001

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Ford BQ, Mauss IB, Gruber J. Valuing happiness is associated with bipolar disorder. Emotion (2015) 15:211–22. doi: 10.1037/emo0000048

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Cooke RG, Robb JC, Young LT, Joffe RT. Well-being and functioning in patients with bipolar disorder assessed using the MOS 20-ITEM short form (SF-20). J Affect Disord. (1996) 39:93–7. doi: 10.1016/0165-0327(96)00016-X

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Chase HW, Nusslock R, Almeida JR, Forbes EE, Labarbara EJ, Phillips ML. Dissociable patterns of abnormal frontal cortical activation during anticipation of an uncertain reward or loss in bipolar versus major depression. Bipolar Disord. (2013) 15(8):839–54. doi: 10.1111/bdi.12132

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Balleine BW, Killcross S. Parallel incentive processing: an integrated view of amygdala function. Trends Neurosci. (2006) 29:272–9. doi: 10.1016/j.tins.2006.03.002

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Chib VS, Shimojo S, O'doherty JP. The effects of incentive framing on performance decrements for large monetary outcomes: behavioral and neural mechanisms. J Neurosci. (2014) 34:14833–44. doi: 10.1523/JNEUROSCI.1491-14.2014

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Pizzagalli DA, Goetz E, Ostacher M, Iosifescu DV, Perlis RH. Euthymic patients with bipolar disorder show decreased reward learning in a probabilistic reward task. Biol Psychiatry (2008) 64:162–8. doi: 10.1016/j.biopsych.2007.12.001

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Roiser JP, Cannon DM, Gandhi SK, Taylor Tavares J, Erickson K, Wood S, et al. Hot and cold cognition in unmedicated depressed subjects with bipolar disorder. Bipolar Disord. (2009) 11:178–89. doi: 10.1111/j.1399-5618.2009.00669.x

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Adida M, Jollant F, Clark L, Besnier N, Guillaume S, Kaladjian A, et al. Trait-related decision-making impairment in the three phases of bipolar disorder. Biol Psychiatry (2011) 70:357–65. doi: 10.1016/j.biopsych.2011.01.018

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Cools R, Blackwell A, Clark L, Menzies L, Cox S, Robbins TW. Tryptophan depletion disrupts the motivational guidance of goal-directed behavior as a function of trait impulsivity. Neuropsychopharmacology (2005) 30:1362–73. doi: 10.1038/sj.npp.1300704

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Murray GK, Clark L, Corlett PR, Blackwell AD, Cools R, Jones PB, et al. Incentive motivation in first-episode psychosis: a behavioural study. BMC Psychiatry (2008) 8:34. doi: 10.1186/1471-244X-8-34

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Chase HW, Michael A, Bullmore ET, Sahakian BJ, Robbins TW. Paradoxical enhancement of choice reaction time performance in patients with major depression. J Psychopharmacol. (2010) 24:471–9. doi: 10.1177/0269881109104883

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Evenden JL. Varieties of impulsivity. Psychopharmacology (1999) 146:348–61. doi: 10.1007/PL00005481

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Dickman SJ. Functional and dysfunctional impulsivity: personality and cognitive correlates. J Pers Soc Psychol. (1990) 58:95–102. doi: 10.1037/0022-3514.58.1.95

PubMed Abstract | CrossRef Full Text | Google Scholar

17. First MB, Spitzer RL, Gibbon M, Williams JB. Structured Clinical Interview for DSM-IV Axis I Disorders-Patient Edition (SCID-I/P, Version 2.0). New York, NY: Biometrics Research Department, New York State Psychiatric Institute (1995).

18. Nusslock R, Almeida JR, Forbes EE, Versace A, Frank E, Labarbara EJ, et al. Waiting to win: elevated striatal and orbitofrontal cortical activity during reward anticipation in euthymic bipolar disorder adults. Bipolar Disord. (2012) 14:249–60. doi: 10.1111/j.1399-5618.2012.01012.x

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Fournier JC, Chase HW, Almeida J, Phillips ML. Model specification and the reliability of fMRI results: implications for longitudinal neuroimaging studies in psychiatry. PLoS ONE (2014) 9:e0105169. doi: 10.1371/journal.pone.0105169

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Fournier JC, Chase HW, Almeida J, Phillips ML. Within- and between-session changes in neural activity during emotion processing in unipolar and bipolar depression. Biol Psychiatry Cogn Neurosci Neuroimaging (2016) 1:518–27. doi: 10.1016/j.bpsc.2016.03.005

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Hamilton M. A rating scale for depression. J Neurol Neurosurg Psychiatry (1960) 23:56–62. doi: 10.1136/jnnp.23.1.56

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Young RC, Biggs JT, Ziegler VE, Meyer DA. A rating scale for mania: reliability, validity and sensitivity. Br J Psychiatry (1978) 133:429–35. doi: 10.1192/bjp.133.5.429

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Spielberger CD, Gorsuch RL, Lushene R. State-Trait Anxiety Invenstory Test Manual Form Y. Palo Alto, CA: Consulting Psychological Press (1983).

Google Scholar

24. Duncan J, Seitz RJ, Kolodny J, Bor D, Herzog H, Ahmed A, et al. A neural basis for general intelligence. Science (2000) 289:457–60. doi: 10.1126/science.289.5478.457

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Harris CM, Waddington J, Biscione V, Manzi S. Manual choice reaction times in the rate-domain. Front Hum Neurosci. (2014) 8:418. doi: 10.3389/fnhum.2014.00418

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Roiser JP, Blackwell AD, Cools R, Clark L, Rubinsztein DC, Robbins TW, et al. Serotonin transporter polymorphism mediates vulnerability to loss of incentive motivation following acute tryptophan depletion. Neuropsychopharmacology (2006) 31:2264–72. doi: 10.1038/sj.npp.1301084

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Griffiths B, Beierholm UR. Opposing effects of reward and punishment on human vigor. Sci Rep. (2017) 7:42287. doi: 10.1038/srep42287

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Kumar P, Waiter G, Ahearn T, Milders M, Reid I, Steele JD. Abnormal temporal difference reward-learning signals in major depression. Brain (2008) 131:2084–93. doi: 10.1093/brain/awn136

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Gallistel CR. The importance of proving the null. Psychol Rev. (2009) 116:439–53. doi: 10.1037/a0015251

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Versace A, Almeida JR, Hassel S, Walsh ND, Novelli M, Klein CR, et al. Elevated left and reduced right orbitomedial prefrontal fractional anisotropy in adults with bipolar disorder revealed by tract-based spatial statistics. Arch Gen Psychiatry (2008) 65:1041–52. doi: 10.1001/archpsyc.65.9.1041

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Mueller SC, Ng P, Temple V, Hardin MG, Pine DS, Leibenluft E, et al. Perturbed reward processing in pediatric bipolar disorder: an antisaccade study. J Psychopharmacol. (2010) 24:1779–84. doi: 10.1177/0269881109353462

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Dillon DG, Wiecki T, Pechtel P, Webb C, Goer F, Murray L, et al. A computational analysis of flanker interference in depression. Psychol Med. (2015) 45:2333–44. doi: 10.1017/S0033291715000276

PubMed Abstract | CrossRef Full Text | Google Scholar

33. Robinson OJ, Chase HW. Learning and choice in mood disorders: searching for the computational parameters of Anhedonia. Comput Psychiatr. (2017) 1:208–33. doi: 10.1162/CPSY_a_00009

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Treadway MT, Bossaller NA, Shelton RC, Zald DH. Effort-based decision-making in major depressive disorder: a translational model of motivational anhedonia. J Abnorm Psychol. (2012) 121:553–8. doi: 10.1037/a0028813

PubMed Abstract | CrossRef Full Text | Google Scholar

35. Huys QJ, Pizzagalli DA, Bogdan R, Dayan P. Mapping anhedonia onto reinforcement learning: a behavioural meta-analysis. Biol Mood Anxiety Disord. (2013) 3:12. doi: 10.1186/2045-5380-3-12

PubMed Abstract | CrossRef Full Text | Google Scholar

36. Ryan RM, Deci EL. On happiness and human potentials: a review of research on hedonic and eudaimonic well-being. Annu Rev Psychol. (2001) 52:141–66. doi: 10.1146/annurev.psych.52.1.141

PubMed Abstract | CrossRef Full Text | Google Scholar

37. Bogacz R, Hu PT, Holmes PJ, Cohen JD. Do humans produce the speed-accuracy trade-off that maximizes reward rate? Q J Exp Psychol. (2010) 63:863–91. doi: 10.1080/17470210903091643

PubMed Abstract | CrossRef Full Text | Google Scholar

38. Manohar SG, Chong TT, Apps MA, Batla A, Stamelou M, Jarman PR, et al. Reward pays the cost of noise reduction in motor and cognitive control. Curr Biol. (2015) 25:1707–16. doi: 10.1016/j.cub.2015.05.038

PubMed Abstract | CrossRef Full Text | Google Scholar

39. Crockett MJ, Clark L, Robbins TW. Reconciling the role of serotonin in behavioral inhibition and aversion: acute tryptophan depletion abolishes punishment-induced inhibition in humans. J Neurosci. (2009) 29:11993–9. doi: 10.1523/JNEUROSCI.2513-09.2009

PubMed Abstract | CrossRef Full Text | Google Scholar

40. Winkel J, Hawkins GE, Ivry RB, Brown SD, Cools R, Forstmann BU. Focal striatum lesions impair cautiousness in humans. Cortex (2016) 85:37–45. doi: 10.1016/j.cortex.2016.09.023

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: bipolar disorder, reinforcement (psychology), reaction time, motivation, major depressive disorder

Citation: Chase HW, Fournier JC, Aslam H, Stiffler R, Almeida JR, Sahakian BJ and Phillips ML (2018) Haste or Speed? Alterations in the Impact of Incentive Cues on Task Performance in Remitted and Depressed Patients With Bipolar Disorder. Front. Psychiatry 9:396. doi: 10.3389/fpsyt.2018.00396

Received: 30 April 2018; Accepted: 07 August 2018;
Published: 03 September 2018.

Edited by:

Catherine Harmer, Warneford Hospital, United Kingdom

Reviewed by:

Mirko Manchia, Università Degli Studi di Cagliari, Italy
Peter Gallagher, Newcastle University, United Kingdom

Copyright © 2018 Chase, Fournier, Aslam, Stiffler, Almeida, Sahakian and Phillips. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Henry W. Chase, Y2hhc2VoQHVwbWMuZWR1

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.