- 1Department of Neurology, BG University Hospital Bergmannsheil, Ruhr-University Bochum, Bochum, Germany
- 2Collaborative Research Centre 874 "Integration and Representation of Sensory Processes", Ruhr-University Bochum, Bochum, Germany
- 3Guangdong Key Laboratory of Mental Health and Cognitive Science, Ministry of Education Key Laboratory of Brain Cognition and Educational Science, School of Psychology, Center for Studies of Psychological Application, South China Normal University, Guangzhou, China
- 4Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA, United States
- 5McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, MA, United States
- 6Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA, United States
- 7Department of Neuroscience, Tufts University School of Medicine, Boston, MA, United States
Humans and other animals can maintain constant payoffs in an uncertain environment by steadily re-evaluating and flexibly adjusting current strategy, which largely depends on the interactions between the prefrontal cortex (PFC) and mediodorsal thalamus (MD). While the ventromedial PFC (vmPFC) represents the level of uncertainty (i.e., prior belief about external states), it remains unclear how the brain recruits the PFC-MD network to re-evaluate decision strategy based on the uncertainty. Here, we leverage non-linear dynamic causal modeling on fMRI data to test how prior belief-dependent activity in vmPFC gates the information flow in the PFC-MD network when individuals switch their decision strategy. We show that the prior belief-related responses in vmPFC had a modulatory influence on the connections from dorsolateral PFC (dlPFC) to both, lateral orbitofrontal (lOFC) and MD. Bayesian parameter averaging revealed that only the connection from the dlPFC to lOFC surpassed the significant threshold, which indicates that the weaker the prior belief, the less was the inhibitory influence of the vmPFC on the strength of effective connections from dlPFC to lOFC. These findings suggest that the vmPFC acts as a gatekeeper for the recruitment of processing resources to re-evaluate the decision strategy in situations of high uncertainty.
Introduction
The prefrontal cortex (PFC) consists of several regions that are thought to play an important role in flexible decision-making. The dorsolateral PFC (dlPFC) is assumed to support executive functions (Jones and Graff-Radford, 2021), whereas the orbitofrontal cortex (OFC) appears to be involved in the flexible adaptation of behavior (Schoenbaum et al., 2021; Wang et al., 2023). The ventromedial PFC (vmPFC) was shown to be associated with the estimation of the value and saliency of sensory events and thereby guides value-based decision-making (Dundon et al., 2021). Interactions between these regions are thought to implement a variety of functions relevant to the flexibility by which cognitive resources are deployed. Interaction between medial PFC and the anterior cingulate cortex, for instance, are thought to contribute to the updating of beliefs about higher-order contextual associations (Botvinick et al., 1999, 2001; Sarafyazd and Jazayeri, 2019), whereas the interplay between OFC and vmPFC appears important for the prediction of value-based behavioral changes (Howard et al., 2016).
Recent studies in animals (Halassa and Kastner, 2017; Saal et al., 2017; Schmitt et al., 2017; Mukherjee et al., 2021) and humans (Hwang et al., 2017; Wen et al., 2021) revealed compelling evidence that cognitive flexibility also depends on interactions between distinct PFC subregions and the mediodorsal nucleus of the thalamus (MD). These studies have provided a complementary perspective on thalamic function challenging the classical notion of the thalamus as a sensory relay (Sherman, 2016). Combining hierarchical Bayesian modeling with fMRI in humans, we recently unveiled distinct prefrontal connections targeting the MD in relation to the participant’s prior belief during associative learning (Wang and Pleger, 2020). The surprise about an unexpected outcome lowered the prior belief about the sample-target association and hence triggered a switch of the decision strategy through modulations of connections between MD and lateral OFC (Wang and Pleger, 2020). These findings are supported by neuronal recordings obtained from mice. When mice decided between different sets of learned cues that directed attention to either visual or auditory targets, responses from the medial PFC reflected both, the individual cue as well as its importance as a task-rule (Rikhye et al., 2018a). The MD, on the other hand, appeared to facilitate switching between cueing contexts by supporting or suppressing task-associated representations in the PFC (Schmitt et al., 2017; Rikhye et al., 2018a; Mukherjee et al., 2021). Importantly, clinical studies also showed that focal lesions in associative thalamic structures can lead to psychosis, particularly if interactions with prefrontal cortical regions become dysfunctional (Schmitt and Halassa, 2017; Mukherjee and Halassa, 2022; Anticevic and Halassa, 2023). Together, these findings from mice and humans emphasize crucial prefrontal-MD computations necessary for learning stimulus-incentive associations.
We recently refined a previously developed computational prefrontal-MD model, inspired by cell recordings obtained from mice (Rikhye et al., 2018a), and trained it on human empirical data to test whether the re-evaluation and adjustment of the decision strategy in both species follow the same computational principles (Hummos et al., 2022). We found that the MD learned abstract representations of its cortical inputs through biologically plausible Hebbian learning rules. Direct feedback from MD to prefrontal cortex supported switching between behavioral strategies, while lateral OFC (lOFC) constantly accumulated evidence for a strategy switch based on rapid Bayesian estimation. Following these computational rules, our human fMRI results revealed that lOFC directed its outputs to MD, rendering MD as the brain site which dynamically integrates crucial inputs relevant for forming the behavioral strategy. These abstract MD representations and their ability to reorganize prefrontal computations describes an efficient way how the brain utilizes the MD to integrate inputs from other brain regions and to dynamically select between competing behavioral strategies (Hummos et al., 2022).
Another prefrontal region, the vmPFC, was shown to encode the beliefs about outcome values, which represents an intermediate signal required for efficient prefrontal-MD computations underpinning the re-evaluation of decision strategy. In our prefrontal-MD model (Hummos et al., 2022), we directed the vmPFC output on the executive dlPFC, as the most relevant receiver of value-related information, but this dlPFC-vmPFC interaction was not directly supported by empirical evidence. Using bilinear Dynamic Causal Modeling (DCM), we could not directly model the vmPFC as an additional hub in the prefrontal-MD network since the vmPFC was not among the regions involved in decision switches. In the present study, we therefore applied non-linear DCM, which allowed us to capture the non-linear history of prior synaptic activity (Stephan et al., 2008), and hence the modulatory (second-order) effects of prior belief-dependent responses in vmPFC on the gain of activity within the prefrontal-MD network. We re-analyzed our previously collected fMRI dataset and combined Bayesian hierarchical modeling with non-linear DCM on fMRI data to test how prior belief-related activity in the vmPFC tunes functional couplings in the dlPFC-lOFC-MD network during the adjustment of the decision strategy.
Materials and methods
Participants and associative learning task
The analyses in this study were based on the previously collected dataset from 28 healthy human participants (mean age ± SD: 25.3 ± 3.9 years, only male participants). The study was approved by the local ethics committee of the Ruhr-University Bochum. All participants gave written informed consent prior to participation. Demographics and the experimental design were described in more detail elsewhere (Supplementary Figure S1; Wang et al., 2020; Wang and Pleger, 2020; Hummos et al., 2022).
In each trial, participants first received one out of two tactile cues for 500 ms to the tip of their right index finger using an MRI-compatible Braille piezo stimulator with 8-pins (2 × 4 array) (Metec, Stuttgart, Germany). Subsequently, they had to predict (within 1,300 ms) whether the following tactile stimulus (i.e., target) will show the same pattern (e.g., 4 upper pins lifted) or the alternative pattern (4 lower pins lifted) by pressing one of two buttons (LumiTouch keypads, Photon Control) with the index or middle finger of the left hand. After an interval of 500-1500 ms (jitter), the target was presented for 500 ms which indicated whether the preceding prediction was correct or incorrect. A variable delay of 1,500–3000 ms separated trials. The predictability of the target stimulus was modulated by the strength of the sample–target contingency over time [i.e., strongly predictive blocks (90 and 10%), moderately predictive (70 and 30%), and non-predictive (50%) blocks; either 30 or 40 trials per block to avoid predictability of the block onsets]. The order of blocks was pseudorandomized and fixed across participants to ensure inter-subject comparability of the learning process. The fMRI experiment consisted of 350 trials in total, which were split into three runs, each lasting about 10 min.
In order to examine the flexibility in decision-making, we tested the adjustment of the decision strategy, i.e., whether the statistical property of the environment (sample stimulus matches or mismatches target stimulus) has been changed or not. To this end, we separated the trials across all sample–target associations into two conditions: (1) Switching condition (122 ± 10 trials): trials with decision switches across two successive trials, and (2) Staying condition (211 ± 17 trials): trials without decision switches across two trials.
Estimating the prior belief with the Hierarchical Gaussian Filter
Human behavioral data were applied to a three-level Hierarchical Gaussian Filter (HGF) model using the HGF toolbox (v5.2) as implemented in TAPAS (Translational Algorithms for Psychiatry-Advancing Science, https://www.tnu.ethz.ch/en/software/tapas, Frässle et al., 2021), to calculate the individual trial-wise prior belief about external states at different levels. The first level of the HGF represents a sequence of inputs about the environmental states (i.e., whether the sample stimulus matches the target stimulus or not), the second level represents the sample-target contingency (i.e., the conditional probability, in logit space, of the target stimulus given the sample cue), and the third level represents the log-volatility of the environment. Each of these hidden states is assumed to evolve as a Gaussian random walk, such that its variance depends on the state at the next higher level (Mathys et al., 2011).
In the HGF, at any level i of the hierarchy, the prior belief about the external state on trial k ( ) is evolved from the posterior belief of the previous trial (k-1)
The posterior belief on trial t ( ) is updated based on the prediction error from the level below ( ) weighted by the precision of prediction ( ):
The precision of prediction is updated with every trial and can be regarded as equivalent to a dynamic learning rate in reward learning models, as follows:
where ω is a free parameter of the perceptual model in HGF, which determines the step size between consecutive time steps. The prediction error , which drives learning at the second level of our HGF model, is defined as the difference between the actual outcome and its estimated probability before the outcome:
FMRI data processing
FMRI data were collected on a Philips Achieva 3.0 T X-series scanner using a 32-channel head coil. For functional imaging, we used a T2*- weighted echoplanar imaging (EPI) sequence (voxel size, 2 mm × 2 mm ×3 mm; field of view, 224 mm× 224 mm; interslice gap, 0.6 mm; TR = 2,800 ms; TE = 36 ms) to acquire 36 transaxial slices covering the whole brain. Pre- and post-processing of the fMRI data was done using the Statistical Parametric Mapping software SPM12 (Wellcome Department of Imaging Neuroscience, University College London, London, UK1) implemented in MATLAB R2022a (MathWorks). All fMRI images were first applied to slice time correction, spatial realignment, and normalization to the MNI template using the unified segmentation approach (Ashburner and Friston, 2005). Finally, normalized images were spatially smoothed using a Gaussian filter with a full-width half-maximum kernel of 6 mm. Data were high pass filtered at 1/128 Hz. For each participant, we conducted a first level general linear model (GLM). Events were time-locked to the onset of the presentation of the cue stimulus using stick functions and split into two regressors, one for Staying (no strategy switches) and the other one for Switching trials. For each of these two regressors, two parametric modulators of prior belief were defined. The first parametric modulator was prior belief about the sample-target contingency ( ). The second modulator was prior belief about the volatility ( ), orthogonalized with respect to . Notably, the sign of the prior belief depends on the sample-target contingency due to the arbitrarily chosen coding of a binary input (i.e., trials in which the sample matched the target were coded as 1, samples that mismatched the target were coded as 0). Given that the current study’s focus was on the effect of different level of predictabilities rather than different contingencies, we employed the unsigned prior belief (i.e., absolute value) as parametric modulators for the GLM. Invalid trials (i.e., missing or late responses) were modeled separately. Furthermore, six head motion parameters, as estimated during the realignment procedure, were added as regressors of no interest to minimize false-positive activations due to task-correlated motion.
Using the GLM, we investigated prefrontal and thalamic responses when switching the decision strategy using the contrast ‘Switching > Staying’. The detailed data analysis for this comparison and corresponding results have been reported in our recent paper (Hummos et al., 2022). In current study, we primarily analyzed the main effect of parametric modulation by prior belief (both and ) for all trials using the GLM. The respective t-contrast images of the modulatory effect for each subject were applied to the group-level one-sample t test (p < 0.05, family-wise error (FWE) corrected for the whole brain).
Non-linear dynamic causal models
In our previous study (Hummos et al., 2022), the fMRI GLM analyses revealed two prefrontal regions, i.e., right dlPFC and lOFC, together with the MD thalamus, which were all significantly modulated by decision switches (Figure 1A). The Bayesian parameter averaging (BPA) across participants revealed that connections from lOFC to MD, from dlPFC to lOFC, as well as between dlPFC and MD in both directions were all significantly strengthened by strategy switches (Figure 1B). In the current study, we show that the vmPFC reflects the value of the prediction, i.e., the prior belief about the sample-target contingency (see results section). Based on these findings, we next questioned how increased BOLD responses in the vmPFC, which were positively related to stronger prior belief, modulated the connection strength of the prefrontal-MD network, thus enhancing its computational efficiency and facilitating strategy switches. To test this, we constructed a non-linear DCM including right dlPFC, right lOFC, right MD and vmPFC, and compared several alternative vmPFC modulations. For each brain region, subject-specific time series were extracted from the nearest local maximum within a sphere with a radius of 8 mm centered on each node’s group maximum. The first Eigenvariate was extracted across all voxels surviving p = 0.05, uncorrected, within a 4 mm sphere centered on the individual peak voxel. The resulting BOLD time series were adjusted for effects of no interest (e.g., invalid trials, and movement parameters).
Figure 1. The fMRI activity and dynamic causal modeling (DCM). (A) Strategy switches (Switching > Staying) entailed significant BOLD activity in right dlPFC, right lOFC and right MD (Hummos et al., 2022). (B) The bilinear DCM revealed that four connections (orange), i.e., from lOFC to MD, from dlPFC to lOFC, as well as between dlPFC and MD in both directions were significantly strengthened by strategy switches (Switching > Staying) (Hummos et al., 2022). The black connections indicate endogenous connections between brain areas. (C) Upper panel: The prior beliefs over the course of the experiment. The red shaded area indicates the standard error of the mean (SEM) of prior beliefs over time. Lower panel: Prior belief-related vmPFC activity is projected on sagittal and axial MRI brain slices (p < 0.05, whole brain-FWE corrected). (D) The non-linear DCM, which we used to test how prior belief-related activity in the vmPFC (blue) gated the information flow in the dlPFC-lOFC-MD network. (dlPFC–dorsolateral prefrontal cortex; lOFC–lateral orbitofrontal cortex; MD–thalamic mediodorsal nucleus; vmPFC-ventromedial prefrontal cortex).
The basic architecture of the model, shown in Figure 1B and in Hummos et al. (2022), included the driving sensory (i.e., tactile) input directed to the dlPFC, as well as the four connections that were all significantly strengthened by strategy switches (i.e., dlPFC to lOFC, lOFC to MD, as well as between dlPFC and MD in both directions). With the non-linear DCM we extended this model by the modulation of these connections through the activity in the vmPFC, which was driven by the trial-by-trial prior belief derived from the HGF model (Figure 1D). We specified the model space with the modulatory influence of the vmPFC on different connections. More specifically, we tested the modulatory influence of prior belief related vmPFC on two out of the four significant connections within the dlPFC-lOFC-MD network, resulting in six models that were compared to each other (Supplementary Figure S2A). The fixed-effects Bayesian model selection (BMS) was used to assess the most likely model among the six competing models. Parameters of the winning model were then summarized by Bayesian parameter averaging (BPA), which computes a joint posterior density for the entire group by combining the individual posterior densities. A posterior probability criterion of 90% was considered to reflect significant modulatory effect on the connections.
Results
The participants’ behavioral performance during the associative learning task has been presented in more detail in our previous studies (Wang et al., 2020; Wang and Pleger, 2020; Hummos et al., 2022). In brief, we found a significant effect of learning [one-way ANOVA: F(4,27) = 256, p < 0.001] and post-hoc paired t-tests revealed that participants made significantly more correct predictions in learning blocks with high predictability (i.e., 90%/10%) than in blocks with low predictability [i.e., 70%/30%, t(1,27) = 20.75, p < 0.001, Bonferroni- corrected] or unpredictability [i.e., 50%, t(1,27) = 24.70, p < 0.001, Bonferroni-corrected]. In addition, we found that the participants had a lower prior belief and required more time to respond (p < 0.001) in trials when they changed their decision strategy (Switching), compared to the trials where the decision strategy did not change (Staying).
In order to test whether participants’ learning behavior could be rather explained by hierarchical learning (i.e., Bayesian HGF model), which updates the learning rate dynamically, or by a fixed ‘ideal’ learning rate as assumed by the reinforcement learning algorithm (Rescorla-Wagner, RW), we compared the HGF with RW model using the random-effect Bayesian model selection (BMS). The BMS result revealed that HGF was the winning model with a posterior model probability of 90% (posterior probability, 0.90; exceedance probability, 1.00).
To assess the parametric modulation by the prior beliefs, both derived from HGF model were included as modulatory parameters in the GLM of fMRI data. We found that responses from the vmPFC reflected the prior belief about the sample-target contingency [ , Figure 1C, x = −6, y = 62, z = −4, t(31) = 6.69, p < 0.05, FWE whole-brain corrected]. Besides the vmPFC, also the left precuneus (x = −10, y = −56, z = 26) and middle cingulate cortex (x = 2, y = −12, z = 38) represented significant prior belief-related activity (p < 0.05, FWE whole-brain corrected). The analysis of parametric effects related to prior belief about volatility ( ) did not reveal any significant brain regions (p > 0.05, FWE whole-brain corrected).
As shown in our recent study (Hummos et al., 2022), the Bayesian model comparison across different plausible bilinear dlPFC-lOFC-MD models revealed that the adjustment of the decision strategy (Switching) significantly modulated connections from lOFC to MD, from dlPFC to lOFC, as well as between dlPFC and MD in both directions (Figure 1B). In the present study, using non-linear DCM, we tested how prior belief related response from vmPFC modulated the connection strength in the dlPFC-lOFC-MD network (Figure 1D). More specifically, we tested which two of the four projections were directly modulated by vmPFC activity. This resulted in six competing models that were further evaluated with Bayesian model selection (BMS) (Supplementary Figure S2A). BMS revealed that the prior belief associated activity in vmPFC had a modulatory influence on the projections originating in the dlPFC and targeting both, lOFC and MD (Supplementary Figure S2B; Figure 2A). The posterior probability for the winning model was 0.48, surpassing the posterior probabilities of the other tested models which ranged from 0.01 to 0.25 (Supplementary Figure S2B). This winning non-linear model assumed a direct effect of prior belief on vmPFC whose activity then mediated the gain of the dlPFC→lOFC and dlPFC→MD connections. We next applied Bayesian parameter averaging (BPA), which computes a joint posterior density for the entire sample and found that the prior belief-dependent vmPFC activity significantly influenced the dlPFC→lOFC connection (posterior probability = 0.94, Figure 2B). The modulations of the dlPFC→MD connection showed a trend into the same direction, but the posterior probabilities failed to surpass the 90% threshold (posterior probability = 0.81, Figure 2B). The modulatory effect was generally inhibitory, suggesting that the weaker the activity in vmPFC, and the lower the prior belief (i.e., high task uncertainty), the stronger was the connection strength from dlPFC to lOFC.
Figure 2. The modulatory effect on the dlPFC-lOFC-MD network by prior belief-associated vmPFC activity. (A) The Bayesian Model Selection across the possible six models revealed the winning model in which prior belief-dependent vmPFC activity specifically influenced the connections originating in the dlPFC and targeting both, lOFC and MD. The black connections indicate endogenous connections between brain areas. (B) Bayesian parameter averaging revealed that the prior belief-dependent vmPFC activity significantly inhibited the dlPFC→lOFC connection (posterior probability = 0.94, solid red line) suggesting that the higher the activity in vmPFC, and the stronger prior belief, the weaker was the connection strength from dlPFC to lOFC. The negative numbers shown next to the projections and above the posterior probabilities index the inhibitory inputs per second (Hz). The dlPFC→MD connection showed a trend into the same direction, but posterior probabilities failed to surpass the 90% threshold (posterior probability = 0.81, dotted red line).
Discussion
In this study, we investigated how prior belief-related activity in the vmPFC gates the information flow in the dlPFC-lOFC-MD network underpinning the re-evaluation of decision strategy during associative learning. We showed that the strength of connections from dlPFC to lOFC fluctuated from trial to trial in relation to the level of task uncertainty, i.e., the prior beliefs about the sample-target contingency, signaled by inhibitory inputs from the vmPFC. The weaker the activity in vmPFC, and the stronger the task uncertainty (i.e., low prior belief), the stronger was the strength of effective connection from executive dlPFC to lOFC. These findings suggest that in situations with high task uncertainty, there is an increase in lOFC responses to inputs from dlPFC, which provides direct empirical evidence for the key role of prior belief-dependent synaptic plasticity in driving the re-evaluation of the decision strategy during flexible decision-making.
Representations of sensory stimuli are modulated by internal states and beliefs about the world, which are crucial for flexibly adjusting perception and behavior in humans and animals (Snyder et al., 2015; Akrami et al., 2018). The prior beliefs have been shown to warp neural representations in the frontal cortex, which allows the mapping of sensory inputs to motor outputs and to incorporate prior statistics in accordance with Bayesian inference (Sohn et al., 2019). These results uncover a simple and general principle whereby prior beliefs exert their influence on behavior by sculpting cortical latent dynamics. The failure of beliefs about the environmental state forms the prediction error and updates expectations for upcoming stimuli and associated rewards. In a previous study, we investigated how humans apply probabilistic computations, following Bayesian rules, to infer on joint brain representations of prior belief and decision switches (Wang et al., 2020). We found, that during such switches prior belief specifically modulated connectivity among the anterior insular cortex (AIC), the premotor cortex (PMd), and the inferior parietal lobule (IPL). On a trial-by-trial basis, prior belief weakened connectivity between AIC and IPL when the sensory stimulus was expected, whereas it strengthened connectivity between AIC and PMd when the stimulus was unexpected (Wang et al., 2020). AIC has been shown to act as a core hub modulating the interaction of bodily, attentional, and anticipatory sensory signals (Sridharan et al., 2008; Craig, 2009; Allen et al., 2016). Our results furnish a picture in which AIC, in conjunction with other brain regions, contributes not only to the coordination of expectation and sensory inputs, but also to the integration of priors and prediction outcomes for updating beliefs specifically supporting strategy switches during associative learning (Ferrari et al., 2022). The present findings extend the scope of belief-related brain functions by the vmPFC. According to our non-linear DCM, vmPFC plays a key role in exerting modulatory influences on prefrontal interactions, which does not directly reflect prior belief such as the aforementioned AIC-network (Allen et al., 2016), but which critically depends on prior belief-dependent information from the vmPFC to flexibly guide strategy switches.
It has been shown that vmPFC plays a major role in sensory integration to achieve abstract and conceptual interpretation of the environment (Petrides and Pandya, 2007). The vmPFC represents signals more suited for subject-centered, internally driven motivational processes, whereas lOFC encodes signals for evaluating environment-centered, externally driven motivational processes (Bouret and Richmond, 2010). Functional interactions between vmPFC and lOFC in humans are thought to implement important functions relevant to cognitive flexibility and the prediction of value-based behavioral changes (Howard et al., 2016). In primates, the dlPFC has been shown to be anatomically (Mackey and Petrides, 2010; Saleem et al., 2014) and functionally (Kahnt et al., 2012) connected to the lOFC, and also in rodents, mPFC terminals - i.e., homolog of the primate dlPFC - were detected in the lateral and ventrolateral OFC (Dalley et al., 2004). Rodent mPFC projection neurons furthermore target the thalamus MD to regulate adaptive control to flexibly optimize behavioral responses in goal-directed behavior and receive an MD innervation (Hoover and Vertes, 2007; Carlén, 2017). Together, these anatomical evidence from rodents and non-human primates provide a solid anatomical basis for our winning non-linear DCM model, in which the responses of lOFC to the presynaptic input from dlPFC depend on the history of inputs that they receive from vmPFC. Inconsistencies in the nomenclature and anatomical boundaries of PFC areas have made it difficult to compare data and interpret findings across species, especially between primates and rodents. In addition, we cannot exclude that the modulatory effect on the PFC interactions is implemented through an intermediate region, for example the amygdala or the striatum, given the fact that both receive many cortical inputs from prefrontal areas (Middleton and Strick, 2002; Chang and Grace, 2018).
The weaker the prior belief related activity in vmPFC, the stronger is the connection strength of the dlPFC projections, suggesting that the re-evaluation of decision strategy under high task uncertainty requires further processing resources controlled by vmPFC. This gating mechanism agrees well with the theoretical accounts of the free-energy principle (Friston, 2009). Accordingly, perception optimizes predictions by minimizing free energy with respect to perceptual inference, memory, attention and salience (Friston, 2010). Based on these principles, the high prior belief about the environmental state in our study may have weakened the strength of the dlPFC-output projections to minimize the engagement of additional cognitive resources and hence to lower the energy costs for the decision process. The process that alters synaptic strengths with time constants in the range of milliseconds to minutes, i.e., the so-called “short-term synaptic plasticity” (STP), is proposed to be the underlying neurobiological mechanisms for this non-linear modulatory effect, including NMDA-controlled rapid trafficking of AMPA receptors (Diering and Huganir, 2018), synaptic depression/facilitation (Zucker and Regehr, 2002) or “early long-term potentiation (LTP)” (Frey and Morris, 1998). Notably, we found that the prior belief-related responses in the vmPFC significantly gate outputs from dlPFC to lOFC, but from dlPFC to MD only at trend level. The thalamic circuits have been shown to play a role in differentially using contextual information to reconfigure functional connectivity within and across cortical areas in a task-dependent manner for perceptual inference and behavioral flexibility (Nakajima and Halassa, 2017; Rikhye et al., 2018b; Halassa and Sherman, 2019; Mukherjee et al., 2020). However, based on our results, the roles of PFC-MD interactions in switching the behavioral strategies are relatively less affected by the prior beliefs about the environment, but more important for the processing of decision strategy (i.e., the selection of Stay or Switch). The relationship between thalamic circuits and the prior beliefs should be further investigated in the future studies.
There are limitations that should be considered when interpreting the current findings. First, single unit recording in mice are well suited to assess how distinct neuronal populations in subregions of the PFC and MD nuclei respond to specific task-related cues and contexts (Rikhye et al., 2018a). FMRI recordings instead represent comparably crude measures of neural activity capturing hemodynamic responses only from large neuronal populations. These topographic inaccuracies, especially for signals originating in small cortical and subcortical structures, represent a general limitation of fMRI. Second, we did not split the MD into its subnuclei when we constructed the network models. A recently developed MRI-based atlas provides masks for four of these subregions (the medial (MDm), central (MDc), dorsal (MDd), and lateral (MDl)), created with connectivity-based methods applied to the high-resolution data from the Human Connectome Project (Li et al., 2022). In the current study with lower functional MRI resolution and signal-to-noise ratio, it remained difficult to reliably capture these subregions. Therefore, we considered the MD as a single region of interest. Considering these topographic inaccuracies, the trend level effect we found for the modulatory influence on the dlPFC-MD connection should be interpreted with caution. Third, although we focused on thalamocortical connections, other regions of the broader PFC-MD network, including the basal ganglia or the amygdala, could have also been involved in guiding cognitive flexibility. For example, the cortical-basal ganglia-thalamocortical subnetworks have been shown to be involved in impaired cognitive flexibility in patients with obsessive-compulsive disorder (Kim et al., 2022). Given the above limitations, it will be one of the major challenges for future decision-making research to extend the here proposed PFC-MD network by other key regions in the PFC, basal ganglia, and other subcortical structures, through combining various imaging techniques and cross-species approaches (e.g., Nakajima et al., 2019a,b).
Data and code availability
The data and code that were applied to assess the findings of this study are available from the corresponding author upon reasonable request.
Data availability statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
Ethics statement
The studies involving humans were approved by Ethics committee of the Ruhr-University Bochum. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.
Author contributions
BW: Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Resources, Software, Validation, Visualization, Writing – original draft, Writing – review & editing. SD: Conceptualization, Investigation, Methodology, Writing – review & editing. AH: Conceptualization, Investigation, Methodology, Writing – review & editing. MH: Conceptualization, Investigation, Methodology, Writing – review & editing. BP: Conceptualization, Funding acquisition, Investigation, Methodology, Project administration, Supervision, Writing – review & editing.
Funding
This work was supported by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation): Project number 122679504 – SFB 874 ‘Integration and Representation of Sensory Processes’ (to BW and BP), the DFG grant with the project number PL602/6-1 ‘Prefrontal-thalamic control of cognitive flexibility – from mice to humans’ (to BP) and National Natural Science Foundation of China: Project number 32200867 (to BW). We acknowledge support by the Open Access Publication Funds of the Ruhr-Universität Bochum.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fnins.2023.1278096/full#supplementary-material
Footnotes
References
Akrami, A., Kopec, C. D., Diamond, M. E., and Brody, C. D. (2018). Posterior parietal cortex represents sensory history and mediates its effects on behaviour. Nature 554, 368–372. doi: 10.1038/nature25510
Allen, M., Fardo, F., Dietz, M. J., Hillebrandt, H., Friston, K. J., Rees, G., et al. (2016). Anterior insula coordinates hierarchical processing of tactile mismatch responses. NeuroImage 127, 34–43. doi: 10.1016/j.neuroimage.2015.11.030
Anticevic, A., and Halassa, M. M. (2023). The thalamus in psychosis spectrum disorder. Front. Neurosci. 17:1163600. doi: 10.3389/fnins.2023.1163600
Ashburner, J., and Friston, K. J. (2005). Unified segmentation. NeuroImage 26, 839–851. doi: 10.1016/j.neuroimage.2005.02.018
Botvinick, M. M., Carter, C. S., Braver, T. S., Barch, D. M., and Cohen, J. D. (2001). Conflict monitoring and cognitive control. Psychol. Rev. 108, 624–652. doi: 10.1037/0033-295X.108.3.624
Botvinick, M., Nystrom, L. E., Fissell, K., Carter, C. S., and Cohen, J. D. (1999). Conflict monitoring versus selection-for-action in anterior cingulate cortex. Nature 402, 179–181. doi: 10.1038/46035
Bouret, S., and Richmond, B. J. (2010). Ventromedial and orbital prefrontal neurons differentially encode internally and externally driven motivational values in monkeys. J. Neurosci. 30, 8591–8601. doi: 10.1523/JNEUROSCI.0049-10.2010
Carlén, M. (2017). What constitutes the prefrontal cortex? Science 358, 478–482. doi: 10.1126/science.aan8868
Chang, C.-H., and Grace, A. A. (2018). Inhibitory modulation of orbitofrontal cortex on medial prefrontal cortex-amygdala information flow. Cereb. Cortex 28, 1–8. doi: 10.1093/cercor/bhw342
Craig, A. D. (2009). How do you feel - now? The anterior insula and human awareness. Nat. Rev. Neurosci. 10, 59–70. doi: 10.1038/nrn2555
Dalley, J. W., Cardinal, R. N., and Robbins, T. W. (2004). Prefrontal executive and cognitive functions in rodents: neural and neurochemical substrates. Neurosci. Biobehav. Rev. 28, 771–784. doi: 10.1016/j.neubiorev.2004.09.006
Diering, G. H., and Huganir, R. L. (2018). The AMPA receptor code of synaptic plasticity. Neuron 100, 314–329. doi: 10.1016/j.neuron.2018.10.018
Dundon, N. M., Shapiro, A. D., Babenko, V., Okafor, G. N., and Grafton, S. T. (2021). Ventromedial prefrontal cortex activity and sympathetic Allostasis during value-based ambivalence. Front. Behav. Neurosci. 15:615796. doi: 10.3389/fnbeh.2021.615796
Ferrari, A., Richter, D., and de Lange, F. P. (2022). Updating contextual sensory expectations for adaptive behavior. J. Neurosci. 42, 8855–8869. doi: 10.1523/JNEUROSCI.1107-22.2022
Frässle, S., Aponte, E. A., Bollmann, S., Brodersen, K. H., Do, C. T., Harrison, O. K., et al. (2021). TAPAS: an open-source software package for translational Neuromodeling and computational psychiatry. Front. Psych. 12:680811. doi: 10.3389/fpsyt.2021.680811
Frey, U., and Morris, R. G. (1998). Synaptic tagging: implications for late maintenance of hippocampal long-term potentiation. Trends Neurosci. 21, 181–188. doi: 10.1016/s0166-2236(97)01189-2
Friston, K. (2009). The free-energy principle: a rough guide to the brain? Trends Cogn. Sci. 13, 293–301. doi: 10.1016/j.tics.2009.04.005
Friston, K. (2010). The free-energy principle: A unified brain theory? Nat. Rev. Neurosci. 11, 127–138. doi: 10.1038/nrn2787
Halassa, M. M., and Kastner, S. (2017). Thalamic functions in distributed cognitive control. Nat. Neurosci. 20, 1669–1679. doi: 10.1038/s41593-017-0020-1
Halassa, M. M., and Sherman, S. M. (2019). Thalamocortical circuit motifs: a general framework. Neuron 103, 762–770. doi: 10.1016/j.neuron.2019.06.005
Hoover, W. B., and Vertes, R. P. (2007). Anatomical analysis of afferent projections to the medial prefrontal cortex in the rat. Brain Struct. Funct. 212, 149–179. doi: 10.1007/s00429-007-0150-4
Howard, J. D., Kahnt, T., and Gottfried, J. A. (2016). Converging prefrontal pathways support associative and perceptual features of conditioned stimuli. Nat. Commun. 7:11546. doi: 10.1038/ncomms11546
Hummos, A., Wang, B. A., Drammis, S., Halassa, M. M., and Pleger, B. (2022). Thalamic regulation of frontal interactions in human cognitive flexibility. PLoS Comput. Biol. 18:e1010500. doi: 10.1371/journal.pcbi.1010500
Hwang, K., Bertolero, M. A., Liu, W. B., and D’Esposito, M. (2017). The human thalamus is an integrative hub for functional brain networks. J. Neurosci. 37, 5594–5607. doi: 10.1523/JNEUROSCI.0067-17.2017
Jones, D. T., and Graff-Radford, J. (2021). Executive Dysfunction and the Prefrontal Cortex. Continuum (Minneap Minn) 27, 1586–1601. doi: 10.1212/CON.0000000000001009
Kahnt, T., Chang, L. J., Park, S. Q., Heinzle, J., and Haynes, J.-D. (2012). Connectivity-based parcellation of the human orbitofrontal cortex. J. Neurosci. 32, 6240–6250. doi: 10.1523/JNEUROSCI.0257-12.2012
Kim, T., Kim, M., Jung, W. H., Kwak, Y., Bin Moon, S.-Y., Kyungjin Lho, S., et al. (2022). Unbalanced fronto-pallidal neurocircuit underlying set shifting in obsessive-compulsive disorder. Brain J. Neurol. 145, 979–990. doi: 10.1093/brain/awab483
Li, K., Fan, L., Cui, Y., Wei, X., He, Y., Yang, J., et al. (2022). The human mediodorsal thalamus: organization, connectivity, and function. NeuroImage 249:118876. doi: 10.1016/j.neuroimage.2022.118876
Mackey, S., and Petrides, M. (2010). Quantitative demonstration of comparable architectonic areas within the ventromedial and lateral orbital frontal cortex in the human and the macaque monkey brains. Eur. J. Neurosci. 32, 1940–1950. doi: 10.1111/j.1460-9568.2010.07465.x
Mathys, C., Daunizeau, J., Friston, K., and Stephan, K. (2011). A Bayesian foundation for individual learning under uncertainty. Front. Hum. Neurosci. 5, 1–20. doi: 10.3389/fnhum.2011.00039
Middleton, F. A., and Strick, P. L. (2002). Basal-ganglia “projections” to the prefrontal cortex of the primate. Cerebral Cortex 12, 926–935. doi: 10.1093/cercor/12.9.926
Mukherjee, A., Bajwa, N., Lam, N. H., Porrero, C., Clasca, F., and Halassa, M. M. (2020). Variation of connectivity across exemplar sensory and associative thalamocortical loops in the mouse. eLife 9:e62554. doi: 10.7554/eLife.62554
Mukherjee, A., and Halassa, M. M. (2022). The associative thalamus: A switchboard for cortical operations and a promising target for schizophrenia. Neuroscientist :107385842211128. doi: 10.1177/10738584221112861
Mukherjee, A., Lam, N. H., Wimmer, R. D., and Halassa, M. M. (2021). Thalamic circuits for independent control of prefrontal signal and noise. Nature 600, 100–104. doi: 10.1038/s41586-021-04056-3
Nakajima, M., and Halassa, M. M. (2017). Thalamic control of functional cortical connectivity. Curr. Opin. Neurobiol. 44, 127–131. doi: 10.1016/j.conb.2017.04.001
Nakajima, M., Schmitt, L. I., Feng, G., and Halassa, M. M. (2019a). Combinatorial targeting of distributed forebrain networks reverses noise hypersensitivity in a model of autism Spectrum disorder. Neuron 104, 488–500.e11. doi: 10.1016/j.neuron.2019.09.040
Nakajima, M., Schmitt, L. I., and Halassa, M. M. (2019b). Prefrontal cortex regulates sensory filtering through a basal ganglia-to-thalamus pathway. Neuron 103, 445–458.e10. doi: 10.1016/j.neuron.2019.05.026
Petrides, M., and Pandya, D. N. (2007). Efferent association pathways from the rostral prefrontal cortex in the macaque monkey. J. Neurosci. 27, 11573–11586. doi: 10.1523/JNEUROSCI.2419-07.2007
Rikhye, R. V., Gilra, A., and Halassa, M. M. (2018a). Thalamic regulation of switching between cortical representations enables cognitive flexibility. Nat. Neurosci. 21, 1753–1763. doi: 10.1038/s41593-018-0269-z
Rikhye, R. V., Wimmer, R. D., and Halassa, M. M. (2018b). Toward an integrative theory of thalamic function. Annu. Rev. Neurosci. 41, 163–183. doi: 10.1146/annurev-neuro-080317-062144
Saal, H. P., Delhaye, B. P., Rayhaun, B. C., and Bensmaia, S. J. (2017). Simulating tactile signals from the whole hand with millisecond precision. Proc. Natl. Acad. Sci. 114, E5693–E5702. doi: 10.1073/pnas.1704856114
Saleem, K. S., Miller, B., and Price, J. L. (2014). Subdivisions and connectional networks of the lateral prefrontal cortex in the macaque monkey. J. Comp. Neurol. 522, 1641–1690. doi: 10.1002/cne.23498
Sarafyazd, M., and Jazayeri, M. (2019). Hierarchical reasoning by neural circuits in the frontal cortex. Science 364:eaav8911. doi: 10.1126/science.aav8911
Schmitt, L. I., and Halassa, M. M. (2017). Interrogating the mouse thalamus to correct human neurodevelopmental disorders. Mol. Psychiatry 22, 183–191. doi: 10.1038/mp.2016.183
Schmitt, L. I., Wimmer, R. D., Nakajima, M., Happ, M., Mofakham, S., and Halassa, M. M. (2017). Thalamic amplification of cortical connectivity sustains attentional control. Nature 545, 219–223. doi: 10.1038/nature22073
Schoenbaum, G., Khamassi, M., Pessiglione, M., Gottfried, J. A., and Murray, E. A. (2021). The magical orbitofrontal cortex. Behav. Neurosci. 135:108. doi: 10.1037/bne0000470
Sherman, S. M. (2016). Thalamus plays a central role in ongoing cortical functioning. Nat. Neurosci. 19, 533–541. doi: 10.1038/nn.4269
Snyder, J. S., Schwiedrzik, C. M., Vitela, A. D., and Melloni, L. (2015). How previous experience shapes perception in different sensory modalities. Front. Hum. Neurosci. 9:594. doi: 10.3389/fnhum.2015.00594
Sohn, H., Narain, D., Meirhaeghe, N., and Jazayeri, M. (2019). Bayesian computation through cortical latent dynamics. Neuron 103, 934–947.e5. doi: 10.1016/j.neuron.2019.06.012
Sridharan, D., Levitin, D. J., and Menon, V. (2008). A critical role for the right fronto-insular cortex in switching between central-executive and default-mode networks. Proc. Natl. Acad. Sci. U. S. A. 105, 12569–12574. doi: 10.1073/pnas.0800005105
Stephan, K. E., Kasper, L., Harrison, L. M., Daunizeau, J., den Ouden, H. E. M., Breakspear, M., et al. (2008). Nonlinear dynamic causal models for fMRI. NeuroImage 42, 649–662. doi: 10.1016/j.neuroimage.2008.04.262
Wang, B. A., and Pleger, B. (2020). Confidence in decision-making during probabilistic tactile learning related to distinct Thalamo-prefrontal pathways. Cereb. Cortex 30, 4677–4688. doi: 10.1093/cercor/bhaa073
Wang, B. A., Schlaffke, L., and Pleger, B. (2020). Modulations of insular projections by prior belief mediate the precision of prediction error during tactile learning. J. Neurosci. 40, 3827–3837. doi: 10.1523/JNEUROSCI.2904-19.2020
Wang, B. A., Veismann, M., Banerjee, A., and Pleger, B. (2023). Human orbitofrontal cortex signals decision outcomes to sensory cortex during behavioral adaptations. Nat. Commun. 14:3552. doi: 10.1038/s41467-023-38671-7
Wen, X., Li, W., Liu, Y., Liu, Z., Zhao, P., Zhu, Z., et al. (2021). Exploring communication between the thalamus and cognitive control-related functional networks in the cerebral cortex. Cogn. Affect. Behav. Neurosci. 21, 656–677. doi: 10.3758/s13415-021-00892-y
Keywords: cognitive flexibility, prefrontal cortex (PFC), mediodorsal thalamus (MD), non-linear dynamic causal modeling, uncertainty, prior belief
Citation: Wang BA, Drammis S, Hummos A, Halassa MM and Pleger B (2023) Modulation of prefrontal couplings by prior belief-related responses in ventromedial prefrontal cortex. Front. Neurosci. 17:1278096. doi: 10.3389/fnins.2023.1278096
Edited by:
Jan B. Engelmann, University of Amsterdam, NetherlandsReviewed by:
Ambra Ferrari, Max Planck Institute for Psycholinguistics, NetherlandsRuiwang Huang, South China Normal University, China
Copyright © 2023 Wang, Drammis, Hummos, Halassa and Pleger. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Bin A. Wang, bin.a.wang@outlook.com