AUTHOR=Liebenow Brittany , Jones Rachel , DiMarco Emily , Trattner Jonathan D. , Humphries Joseph , Sands L. Paul , Spry Kasey P. , Johnson Christina K. , Farkas Evelyn B. , Jiang Angela , Kishida Kenneth T. TITLE=Computational reinforcement learning, reward (and punishment), and dopamine in psychiatric disorders JOURNAL=Frontiers in Psychiatry VOLUME=13 YEAR=2022 URL=https://www.frontiersin.org/journals/psychiatry/articles/10.3389/fpsyt.2022.886297 DOI=10.3389/fpsyt.2022.886297 ISSN=1664-0640 ABSTRACT=

In the DSM-5, psychiatric diagnoses are made based on self-reported symptoms and clinician-identified signs. Though helpful in choosing potential interventions based on the available regimens, this conceptualization of psychiatric diseases can limit basic science investigation into their underlying causes. The reward prediction error (RPE) hypothesis of dopamine neuron function posits that phasic dopamine signals encode the difference between the rewards a person expects and experiences. The computational framework from which this hypothesis was derived, temporal difference reinforcement learning (TDRL), is largely focused on reward processing rather than punishment learning. Many psychiatric disorders are characterized by aberrant behaviors, expectations, reward processing, and hypothesized dopaminergic signaling, but also characterized by suffering and the inability to change one's behavior despite negative consequences. In this review, we provide an overview of the RPE theory of phasic dopamine neuron activity and review the gains that have been made through the use of computational reinforcement learning theory as a framework for understanding changes in reward processing. The relative dearth of explicit accounts of punishment learning in computational reinforcement learning theory and its application in neuroscience is highlighted as a significant gap in current computational psychiatric research. Four disorders comprise the main focus of this review: two disorders of traditionally hypothesized hyperdopaminergic function, addiction and schizophrenia, followed by two disorders of traditionally hypothesized hypodopaminergic function, depression and post-traumatic stress disorder (PTSD). Insights gained from a reward processing based reinforcement learning framework about underlying dopaminergic mechanisms and the role of punishment learning (when available) are explored in each disorder. Concluding remarks focus on the future directions required to characterize neuropsychiatric disorders with a hypothesized cause of underlying dopaminergic transmission.