The final, formatted version of the article will be published soon.
ORIGINAL RESEARCH article
Front. Comput. Neurosci.
Volume 18 - 2024 |
doi: 10.3389/fncom.2024.1466364
This article is part of the Research Topic Health Data Science and AI in Neuroscience & Psychology View all articles
Simulated synapse loss induces depression-like behaviors in deep reinforcement learning
Provisionally accepted- 1 Mount Royal University, Calgary, Canada
- 2 University of Lethbridge, Lethbridge, Alberta, Canada
Deep Reinforcement Learning is a branch of artificial intelligence that uses artificial neural networks to model reward-based learning as it occurs in biological agents. Here we modify a Deep Reinforcement Learning approach by imposing a suppressive effect on the connections between neurons in the artificial network -simulating the effect of dendritic spine loss as observed in major depressive disorder (MDD). Surprisingly, this simulated spine loss is sufficient to induce a variety of MDD-like behaviors in the artificially intelligent agent, including anhedonia, increased temporal discounting, avoidance, and an altered exploration/exploitation balance. Furthermore, simulating alternative and longstanding reward-processing-centric conceptions of MDD (dysfunction of the dopamine system, altered reward discounting, context-dependent learning rates, increased exploration) does not produce the same range of MDD-like behaviors. These results support a conceptual model of MDD as a reduction of brain connectivity (and thus information-processing capacity) rather than an imbalance in monoamines -though the computational model suggests a possible explanation for the dysfunction of dopamine systems in MDD. Reversing the spine-loss effect in our computational MDD model can lead to rescue of rewarding behavior under some conditions. This supports the search for treatments that increase plasticity and synaptogenesis, and the model suggests some implications for their effective administration.
Keywords: Major Depressive Disorder, reinforcement learning, neuroplasticity, Monoamine hypothesis, psychedelics, reward prediction error
Received: 17 Jul 2024; Accepted: 23 Oct 2024.
Copyright: © 2024 Chalmers, Duarte, Al-Hejji, Devoe, Gruber and McDonald. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence:
Eric Chalmers, Mount Royal University, Calgary, Canada
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.