- 1Program in Mathematical, Computational, and Systems Biology, University of California, Irvine, Irvine, CA, United States
- 2Department of Physiology and Biophysics, University of California, Irvine, Irvine, CA, United States
- 3Department of Neurobiology and Behavior, University of California, Irvine, Irvine, CA, United States
- 4Department of Biomedical Engineering, University of California, Irvine, Irvine, CA, United States
- 5Department of Pharmaceutical Sciences, University of California, Irvine, Irvine, CA, United States
- 6Center for the Neurobiology of Learning and Memory, University of California, Irvine, Irvine, CA, United States
Decades of research have revealed the remarkable complexity of the midbrain dopamine (DA) system, which comprises cells principally located in the ventral tegmental area (VTA) and substantia nigra pars compacta (SNc). Neither homogenous nor serving a singular function, the midbrain DA system is instead composed of distinct cell populations that (1) receive different sets of inputs, (2) project to separate forebrain sites, and (3) are characterized by unique transcriptional and physiological signatures. To appreciate how these differences relate to circuit function, we first need to understand the anatomical connectivity of unique DA pathways and how this connectivity relates to DA-dependent motivated behavior. We and others have provided detailed maps of the input-output relationships of several subpopulations of midbrain DA cells and explored the roles of these different cell populations in directing behavioral output. In this study, we analyze VTA inputs and outputs as a high dimensional dataset (10 outputs, 22 inputs), deploying computational techniques well-suited to finding interpretable patterns in such data. In addition to reinforcing our previous conclusion that the connectivity in the VTA is dependent on spatial organization, our analysis also uncovered a set of inputs elevated onto each projection-defined VTADA cell type. For example, VTADA→NAcLat cells receive preferential innervation from inputs in the basal ganglia, while VTADA→Amygdala cells preferentially receive inputs from populations sending a distributed input across the VTA, which happen to be regions associated with the brain’s stress circuitry. In addition, VTADA→NAcMed cells receive ventromedially biased inputs including from the preoptic area, ventral pallidum, and laterodorsal tegmentum, while VTADA→mPFC cells are defined by dominant inputs from the habenula and dorsal raphe. We also go on to show that the biased input logic to the VTADA cells can be recapitulated using projection architecture in the ventral midbrain, reinforcing our finding that most input differences identified using rabies-based (RABV) circuit mapping reflect projection archetypes within the VTA.
Introduction
The VTA plays a central role in a variety of both adaptive and pathological motivated behaviors, principally through cells that release the neurotransmitter DA (Morales and Margolis, 2017). These cells direct motivated behaviors by release of DA into downstream brain structures such as the nucleus accumbens (NAc), dorsal striatum (DStr), and medial prefrontal cortex (mPFC) (Beier et al., 2015). Activation of DA cells as a population is highly reinforcing, as animals will robustly self-administer stimulation of DA neurons (Olds and Milner, 1954). DA cells have also been implicated in reward-prediction error (RPE), or the difference between the received and anticipated value of an outcome (Schultz, 1998). While much of the data fit the RPE model, some do not. For example, an aversive stressful experience or a painful stimulus such as a foot pinch triggers DA release into forebrain structures (Navratilova et al., 2015). One recent study suggested that physiological DA release in the NAc only relates to outcomes predicted by RPE within a limited number of scenarios and instead broadly signals perceived salience (Kutlu et al., 2021). Other studies pointed to the existence of subsets of DA cells that not only project to different forebrain sites, but also have unique transcriptional, electrophysiological, and response properties to various stimuli (Lammel et al., 2008, 2011; Kim et al., 2016). We now know that the VTA is comprised of heterogenous cell types: DA cells comprise roughly 50% of VTA cells in the rat, fewer than the >70% previously estimated (Margolis et al., 2006); another ∼40% of cells in the VTA are GABAergic. Many of these GABAergic cells inhibit VTADA neurons, and their activation has the opposite effect of DA cell stimulation (Bouarab et al., 2019). In addition to locally inhibiting DA cells, VTAGABA neurons also project to a variety of forebrain sites, including the NAc and lateral habenula (LHb). Many VTAGABA cells can also co-transmit glutamate (Root et al., 2014). Additionally, many NAc-projecting midbrain DA cells co-transmit glutamate, and some can also synthesize and transmit GABA through a non-canonical pathway (Tritsch et al., 2012; Kim et al., 2015). This complexity makes it difficult to definitively disentangle the roles that various cells play in adaptive and maladaptive behaviors.
To date, DA cells have typically been differentiated based on output site. For example, Lammel et al. (2008) injected fluorescent microspheres into different forebrain sites and showed that the DA cells in the midbrain that took up the microspheres were largely distinct, as these cell populations differed in their expression of dopamine transporter, DAT, and in their electrophysiological properties. They later showed that these cells were differentially modulated by experience, as the synapses onto some cells and not others were modulated by either a cocaine (rewarding) or formalin (aversive) experience (Lammel et al., 2011). These results suggested that these cells are integrated into separate circuits that are differentially involved in either reward or aversion learning. The same investigators then showed that VTADA cells projecting to the NAc preferentially received inputs from the laterodorsal tegmentum (LDT) and signaled reward, whereas VTADA cells projecting to the mPFC preferentially received inputs from the LHb and signaled aversion (Lammel et al., 2012). These studies provided a simplified framework through which VTADA neurons could encode both reward and aversion-related signals through separate forebrain projections. Subsequent studies have largely supported this framework, with some modifications. We, therefore, wanted to explore the global anatomical organization of these cells and examine how connectivity logic may help to explain the roles different DA cells play in behavior. As midbrain DA cells have been shown to receive direct monosynaptic inputs from over 100 anatomically defined brain regions (Watabe-Uchida et al., 2012), our goal has been to create comprehensive input-output connectivity maps of discrete DA populations to compare the inputs and outputs of these cells.
To unambiguously define input-output relationships of midbrain DA cells, we developed an intersectional viral-genetic method to tag cells defined by both gene expression and output site, termed cell type-specific Tracing the Relationship of Inputs and Outputs (cTRIO) (Beier et al., 2015; Schwarz et al., 2015). In our initial study, we characterized the input-output relationships of VTADA cells projecting to the nucleus accumbens (NAcMed and NAcLat), medial prefrontal cortex (mPFC), and Amygdala (Beier et al., 2015). cTRIO revealed separate sub-circuits centered on midbrain DA cells that had biased inputs and discrete outputs. We then performed a more detailed characterization of the connectivity relationships of these populations (Beier et al., 2019), finding that the spatial location of starter cells in the VTA was the main determinant of the inputs that each population received while the neurotransmitters that the cells released did not strongly influence input patterns. However, relating the center of mass (COM) of “starter” neurons that initiate RABV tracing to input fraction using a simple linear regression only explained significant variance for about half of the input sites examined, suggesting that this level of analysis was not sufficient to explain the full complexity of input patterning to the VTA. Quantitative techniques have been adopted in other fields to reveal patterns in high dimensional data. In this study we aim to introduce such techniques to neural circuit mapping. We revisit previously published datasets describing the inputs and outputs of VTADA cells and find new patterns and rules underlying their connectivity.
Results
VTADA Neurons Segregate by Projection Condition With Characteristic Output Patterns
Lammel et al. (2008) first used retrobead injections into different forebrain regions in the mouse to show that DA cells projecting to different forebrain sites were physically located in different domains of the VTA or SNc. These results suggested that DA cells largely project to one forebrain site and not others. Recently, we used a more sensitive method that enabled brain-wide analysis of the entire axonal arbor of each DA cell subpopulation to show that each cell population in fact does send collaterals to other brain sites, but that the collateralization patterns are largely unique for each subpopulation, and thus the overall projection pattern of each population is largely distinct (Beier et al., 2015, 2019). We also were the first to perform brain-wide input mapping analysis from projection-defined DA populations in the VTA and the adjacent SNc (Beier et al., 2015, 2019; Lerner et al., 2015; Menegas et al., 2015). In contrast to the largely discrete output patterns of these cells, we and others observed that midbrain DA cells receive quantitatively similar inputs from most brain regions, with several biases in the contributions of these inputs onto defined DA cell types. These input biases between conditions may influence the differential role these cells play in subsequent behavioral output, for example in reinforcement behavior (Beier et al., 2015). Given that we have collected whole-brain quantitative datasets of the inputs and outputs of VTADA→NAcMed, VTADA→NAcLat, VTADA→mPFC, and VTADA→Amygdala cells, we wanted to perform a more in-depth analysis to identify factors that differentiated the inputs and outputs of different DA cell types. We previously performed hierarchical clustering on bootstrapped data and demonstrated that VTADA cells projecting to NAcMed, NAcLat, mPFC, or Amygdala clustered separately based on their output projections to 10 forebrain sites (Beier et al., 2019), indicating that their global output patterns were distinct. We also demonstrated the existence of four groups of output sites with high levels of covariance in our dataset, suggesting that each set of output regions may be preferentially targeted by one DA cell population. However, we did not rigorously identify how these conditions differed and which output sites most contributed to differentiating the projection pattern of each DA cell population.
To explore this dataset in greater detail, we first used Principal Component Analysis (PCA) to dimensionally reduce the output data (Figures 1A,B). The output data consist of 18 brain samples from 4 different output-defined conditions (n = 5 for NAcMed and mPFC; n = 4 for NAcLat and Amygdala). Each sample has 10 measurements, one for each of the output regions quantified. PCA is a linear dimensionality reduction technique that finds a lower dimensional representation of the data that maximizes variance for each principal component (PC). The first PC is a linear combination of the feature space that leads to the highest degree of variance in the data. Each component after makes the same optimization with the remaining dimensions. We found that three components are sufficient to explain ∼70% of the variance in the output data, indicating that these data have a relatively simple structure (Figure 1C). PC1 separates VTADA→NAcLat cells, PC2 separates VTADA→NAcMed cells, PC3 separates VTADA→Amygdala cells, and a combination of PC2 and PC3 separates VTADA→mPFC cells (Figures 1D,E). Thus, three PCs were sufficient to separate each condition.
Figure 1. VTADA outputs are organized by four core projections. (A) Schematic for axonal arborization experiments. Viral injections were performed in DAT-Cre mice to label collaterals to VTADA neurons projecting to a specified target. (B) Collaterals of VTA projections to the NAcMed, NAcLat, mPFC, and Amygdala were quantified in 10 brain regions across 18 mice. The NAcLat and its major collaterals are highlighted. (C) Cumulative explained variance from each principal component. (D) Brains are plotted in PCA space for the 1st and 2nd components, colored by projection. (E) Brains are plotted in PCA space for the 2nd and 3rd components, colored by projection. (F) Heatmap of each output region’s contribution to the first three principal components. (G) Brains are plotted in UMAP space, colored by projection. (H) Output regions are plotted in UMAP space, embedded with respect to z-scores across mouse brains. Clusters represent outputs with similar patterns of variation across the cohort.
Next, we wanted to explore how each output region contributed to each PC. For example, PC1, which separated VTADA→NAcLat cells, is driven by NAcLat, nucleus accumbens core (NAcCore), dorsomedial striatum (DMS), and dorsolateral striatum (DLS; Figure 1F). The finding that the NAcLat as an output site helps to differentiate VTADA→NAcLat cells is consistent with the biased projections of each midbrain DA cell population. Additionally, the contribution of other regions in the striatum (except for NAcMed) is consistent with the overall arborization pattern of these cells (Beier et al., 2015). This cell population had the most distinct overall arborization pattern and thus positive weights of these four regions were sufficient to differentiate it. PC2, which separates VTADA→NAcMed cells, is primarily made up of the NAcMed, with smaller contributions from the NAcCore and negative contributions from the mPFC, bed nucleus of the stria terminalis (BNST), and central amygdala (CeA). These negative contributions mean that VTADA→NAcMed cells do not prominently project to the mPFC, BNST, or CeA. Lastly, PC3, which separated VTADA→Amygdala cells, is made up of positive contributions from the ventral pallidum (VP), BNST, and CeA, and negative contributions from the mPFC and septum. The overall arborization patterns of NAcMed-, mPFC-, and Amygdala-projecting VTADA cells are more similar to one another than to NAcLat-projecting VTADA cells (Beier et al., 2019); thus in PC2, the negative contributions from the mPFC, CeA, and BNST differentiate NAcMed-projectors from Amygdala- and mPFC-projectors, and in PC3, the negative contributions from the mPFC and septum, which are the brain regions most enhanced in the output targets of VTADA→mPFC cells, differentiate VTADA→mPFC and VTADA→Amygdala cells.
While PCA is useful due to its interpretability, Uniform Manifold Approximation and Projection (UMAP) is better optimized for finding clusters in high dimensional data. Indeed, we find it is much more effective at clustering conditions by output site (Figure 1G; McInnes et al., 2018). As UMAP uses non-linear transformations to achieve clustering, it does not provide us the same detailed information about which output regions are differentiating these clusters. However, we can compute the transpose of the output data and take the z-score to look at how output scores per region vary across samples. Z-scoring normalizes the data such that high and low count regions that have the same variance will have similar values. We used UMAP on these z-scores and found two clusters of output sites with similar variance (Figure 1H). The bottom left cluster contains the four regions that show up in PC1: NAcLat, NAcCore, DMS, and DLS. These data provide confirmation that these four regions vary as a module across these four conditions and serve as a common set of brain sites targeted by the same cell population (VTADA→NAcLat) whereas the other three cell populations share more overlap in their overall projection patterns. This visualization serves as a complement to previous analysis of these data, where hierarchical clustering of the output regions’ covariances found the same organization, highlighting both the robustness of this result and these methods.
To ensure these results were not biased by outputs to the injected projection sites, we removed the projection sites from the output counts and performed the same analysis as before on just the collaterals. We largely see the same clustering behaviors as before (Supplementary Figure 1). The main difference is that the VTADA→NAcMed and VTADA→mPFC brains are harder to separate (Supplementary Figures 1B,C,E). Previously, PC2–now PC3–separated these two conditions the strongest (Figure 1D). This principal component previously had large contributions from three of the projection targets, so it is not surprising that the differences between these conditions are weakened along with the principal component (Figure 1F and Supplementary Figure 1D). Altogether, this analysis confirms that the clustering of projection conditions does not completely depend on including the main projection targets.
VTADA Neuron Inputs Do Not Cluster as Cleanly by Projection Site
We and others used intersectional viral-genetic methods to map global inputs to output-defined DA cells (Beier et al., 2015; Lerner et al., 2015; Menegas et al., 2015). While the exact relationships of inputs and outputs varied slightly between different studies, the common finding was that different DA cell populations largely shared common input patterns, with some quantitative differences. We more recently performed a comprehensive mapping of input-output relationships of different cell types in the VTA and reported that (1) the spatial location of cells in the VTA explained a significant amount of variation between conditions for about half of the input sites, (2) cell type did not explain much variation in the inputs between cell populations, and (3) the projection site explained about as much input variation as did spatial position of starter cells in the VTA (Beier et al., 2019). To account for neurons that co-release multiple neurotransmitters, for example glutamate and dopamine, we included the percentage of starter cell immunostaining for tyrosine hydroxylase (TH), a marker of DA neurons, in our linear regression analysis and found it had very little predictive value compared to spatial location (Beier et al., 2019). These observations suggested that the quantitative contribution of inputs a given population of cells receives depends heavily on the physical location of the starter cells in the brain, but not on the identify of what neurotransmitters (e.g., DA, GABA, glutamate) these starter cells release.
Here we used PCA and UMAP to dimensionally reduce and explore patterns in the input data. These data consist of 76 brains with counts across 22 input regions (Beier et al., 2019). These brains cover a variety of cTRIO and TRIO conditions as well as non-output-defined tracing, resulting in a mix of output and cell-type specifications (Figures 2A,B). A PCA analysis of these data found that three components explained only about 40% of the variance (Figure 2C). This is rather low compared to the output data, even considering the difference in dimensionality, and implies that this dataset is more complex. In the PCA embedding, cell types defined by Cre expression (DAT-Cre, GAD2-Cre, vGluT2-Cre, no Cre) mix together but cells projecting to a common output target do show some organization (Figures 2D,E and Supplementary Figure 2). For example, VTA→NAcLat cells have more positive values in the 1st PC and more negative values in the 2nd PC (Figure 2E). These coordinates reflect higher contributions from brain regions in the basal ganglia which include the NAc, dorsal striatum (DStr), and global pallidus external segment (GPe), as well as lower contributions from the VP and preoptic area (PO) (Figure 2F). Notably, the non-output-defined condition is most similar to the VTA→NAcLat condition, which is expected given that VTA→NAcLat cells comprise the majority of cells in the VTA (Beier et al., 2015).
Figure 2. Clusters of VTA inputs revealed by dimensional reduction. (A) Schematic for RABV input labeling experiments. DAT-, GAD-, vGlut2-Cre, and non-Cre-expressing mice were used to identify specific (or non-specific) VTA cell types. Injections of CAV were used to define output sites. (B) Input labeling experiments provided maps of inputs to VTA cells for a combination of different cell-type and projection specifications. Cohort includes 76 brains and 22 input regions counted. (C) Cumulative explained variance from each principal component. (D) Brains are plotted in PCA space for the 1st and 2nd components, colored by cell type. (E) Brains are plotted in PCA space for the 1st and 2nd components, colored by projection. (F) Heatmap of each input region’s contribution to the first five principal components. (G) Brains are plotted in UMAP space, colored by cell type. (H) Brains are plotted in UMAP space, colored by projection. (I) Input regions are plotted in UMAP space, embedded with respect to z-scores across mouse brains. Clusters represent inputs with similar patterns of variation across the cohort.
We next used UMAP to look for any additional clustering behavior between the inputs mapped in different brains in order to assess the similarities and differences between conditions (Figures 2G,H). When defining conditions by Cre expression (DAT-Cre, GAD2-Cre, vGluT2-Cre, no Cre), there are some local neighborhoods within the same conditions, but none are very well-separated into clusters. However, when defining conditions based on output site, the VTA→NAcLat conditions segregate relatively well (Figure 2H). These results are consistent with our previously published analysis (Beier et al., 2019). We then performed a UMAP analysis on the input region z-scores to identify regions with similar variation across conditions (Figure 2I). We found one cluster (cluster 1) made up of inputs from the NAc, DStr, GPe, and cortex. Almost all these regions follow a pattern of contributing positively to the 1st PC and negatively to the 2nd PC (Figure 2F). Thus, these regions provide a stronger fractional innervation to VTA→NAcLat cells than other VTA cells, as observed previously (Beier et al., 2015, 2019). In addition to cluster 1, we observed two other clusters of inputs; one included the CeA, parabrachial nucleus (PBN), zona incerta (ZI), entopeduncular nucleus (EP), and deep cerebellar nuclei (DCN; cluster 2), while the other included all the other regions: VP, PO, LDT, BNST, dorsal raphe (DR), medial habenula (MHb), lateral habenula (LHb), paraventricular nucleus of the hypothalamus (PVH), extended amygdala (EAM), lateral hypothalamus (LH), and septum (cluster 3). These clusters were not readily apparent in our previous analyses of our RABV tracing data, suggesting that there may be additional organization in the input patterns that we had overlooked previously.
Lateral or Medial Biases of Starter Cells Accounts for Some but Not All VTADA Input-Output Variation
We previously analyzed the spatial influence of starter neurons in the VTA on the fractional contribution of inputs by using a linear regression test with the medial-lateral and dorsal-ventral coordinates of the starter cell center of mass (COM) (Beier et al., 2019). Since the cells were counted on coronal slices, we do not have nearly as good resolution for the anterior-posterior axis as the ML and DV axes, and for the most part we focus our analyses on these axes. We observed that the medial-lateral coordinate of the COM explained a significant level of variance for about one half of the brain regions across conditions, about the same contribution as the output site and significantly more than the Cre line used to mark starter cells. These results suggested that many inputs to the VTA are biased along the medial/lateral axis in their projections to the VTA, and that the location of the starter cells, as defined by a single point in space, was significantly linked to the fraction of inputs from various brain regions those cells received.
To further explore the spatial organization of VTA inputs, we plotted each sample according to the starter cell COM and colored them according to their PC values, as calculated in Figure 2 (Figures 3A–E). PC1 has an increasing spatial gradient from the medial to the lateral VTA (Figure 3B). This principal component in general is made up of input populations that project more laterally in the VTA, or to the adjacent SNc/substantia nigra pars reticulata (SNr) (Oh et al., 2014; Beier et al., 2019). This result agrees with the previous finding that the medial-lateral coordinate is related to the fractional contribution from about one half of the input sites examined (Beier et al., 2019). Furthermore, we can compare this spatial organization with the location of VTA→NAcLat starter cells (Figure 3F). The VTA→NAcLat cells are biased toward the lateral side of the VTA, same as the +PC1 cell populations. As PC1 captures the most variation across the data, this means that the primary axis of variation in VTA inputs is whether or not the inputs are biased onto VTA→NAcLat cells, and hence whether the starter cells are located laterally within the VTA or not. PC2 has a mild spatial gradient that increases in the dorsal direction (Figure 3C). PC3, on the other hand, does not have much of a clear spatial bias in the medial-lateral or dorsal-ventral axes (Figure 3D). Rather, starter cell populations with +PC3 span the VTA across the two axes, suggesting that a lack of clear spatial bias in the VTA characterizes this PC. A linear regression analysis confirmed these observations: PC1 was found to have a significant slope in the lateral direction and PC2 in the dorsal direction, while other slopes were not found to be significant after correction for multiple comparisons (Table 1).
Figure 3. Spatial location of targeted cells in the VTA influences both inputs and outputs. (A) Context for coronal slice of VTA used in analysis is shown. (B) Brains are plotted by starter cell center of mass (COM), colored by PC1 value. (C) Brains are plotted by starter cell COM, colored by PC2 value. (D) Brains are plotted by starter cell COM, colored by PC3 value. (E) Brains are plotted by starter cell COM, colored by PC4 value. (F) Brains are plotted by starter cell COM, colored by projection specification. Ellipsoids are drawn for each condition and have radii of five standard deviations for both dorsoventral and lateromedial axes. (G) Z-scores of average input and output counts for each projection condition. Inputs are marked with a green down arrow and outputs with a red up arrow. All inputs and outputs quantified are shown. Inputs and outputs are sorted according to the projection in which they receive the highest z-score. Samples include inputs to and outputs from DA cells only. p-values are listed in Table 2.
Our analysis with PCA and UMAP separated VTA→NAcLat cells by inputs, but largely failed to differentiate VTA→NAcMed, VTA→mPFC, or VTA→Amygdala cells from one another. To explore the input-output features most specific to each VTA cell type, we stitched together the average input and output counts for each region. We then took the z-score of these values to see how enriched or diminished connections are for that region compared to the other conditions. For each projection condition, we found a unique set of enriched inputs and outputs (Figure 3G and Supplementary Figure 3). Many of these were found significant, even when corrected for multiple comparisons (Table 2). We observed some evidence for reciprocal connectivity: for example, inputs from NAcLat are enriched onto VTA→NAcLat cells, and inputs from the Amygdala and BNST are enriched onto VTA→Amygdala cells that collateralize principally to the BNST, both of which were found to be significant. However, this was not equally clear for all populations, as the NAcMed input was approximately equal onto VTADA→NAcLat and VTADA→NAcMed cells, and we did not observe a preference for cortical inputs onto VTADA→mPFC cells (Beier et al., 2019), suggesting that while some reciprocal connections may exist in the VTA, they may not be universal for all brain regions (Figure 3G).
The input and output sites enriched onto VTADA→NAcLat cells consist of those previously identified (Beier et al., 2015, 2019) and shown in Figures 1, 2. However, we also found a number of brain sites enriched as inputs to or outputs from VTADA→Amygdala cells that we did not previously identify. These outputs include preferential projections to the Amygdala and BNST, as previously described (Beier et al., 2019), as well as inputs from the CeA, PBN, ZI, PVH, BNST, EAM, DCN, LH, and MHb. Many of these brain regions, including the CeA, PBN, PVH, BNST, and EAM, are in the extended amygdala and are principally involved in stress and anxiety-related behaviors (Bernard and Besson, 1988; Han et al., 2015; Chou et al., 2018; Zhou et al., 2018; Chiang et al., 2019). These same regions are also the strongest positive contributors to PC3 (Figure 2F). Furthermore, the location of starter cell COM with a +PC3 (Figure 3D) most closely mirrored the distribution of VTA→Amygdala cells, which are distributed broadly throughout the VTA with a centroid in approximately the middle of the structure (Figure 3F). These visualizations therefore provide further evidence of the spatial organization of inputs on the VTA that we reported previously, and they also suggest the existence of subpopulations of VTA cells that receive preferential inputs from key brain regions involved in the brain’s stress response.
To explore how starter cell COM and RABV input cells distinguish the various projection conditions, we trained logistic regression models to predict each condition. Logistic regression can be used for multiclass classification, in which a logistic regression model is trained for each condition, and the condition with the highest probability is assigned to a given observation. We used the first five principal components as features representing the inputs to the VTA, to reduce overfitting our dataset and to simplify the model to increase the model’s interpretability. We trained models on the principal components and the starter cell COMs separately, and on both combined. Unsurprisingly, projection conditions already grouped together in the PCA plots were well-predicted by the principal components, for example the VTA→NAcLat and VTA→Amygdala cell populations (Table 3). Additionally, projection conditions that appeared to have a spatial bias achieved higher scores when predicted by COMs, for example the VTA→NAcLat and VTA→mPFC. VTA→NAcMed was predicted greater than chance across each individual set of features. It also ends up with one of the highest prediction scores when both PCs and COMs are considered. This result suggests that a combination of input features and spatial location is needed to encode the identity of this population. Logistic regression models are highly interpretable, as each feature is assigned a coefficient which models the increased or decreased likelihood of a label given a higher or lower value of the feature. These coefficients largely recapitulate observations we have already made. For example, PC1 is useful for predicting VTA→NAcLat, PC3 is useful for predicting VTA→Amygdala (Supplementary Figure 4A), and the medial-lateral coordinate is useful for predicting VTA→NAcLat and VTA→mPFC populations (Supplementary Figure 4B). In the model incorporating both PCs and COMs, we found that a combination of PC1 with the dorsal and anterior coordinates can predict the VTA→NAcMed condition (Supplementary Figure 4C). These analyses imply that while the most striking aspect of VTA input connectivity is the presence of spatial gradients, there may be some interesting connectivity relationships that are not uniquely delineated by a medial-lateral or dorsal-ventral gradient.
Table 3. Logistic regression scores predicting projection conditions from starter cell location and principal components.
Spatial Analysis of Allen Mouse Connectivity Atlas Data Finds Archetypal Projection Patterns to the Ventral Tegmental Area
Using publicly available data from the Allen Mouse Brain Connectivity Atlas, we had previously investigated the spatial organization of projections to the VTA. We had found that the relative projection ratio across some inputs varied across the lateral-medial axis and that was related to the relative ratio of inputs received by different VTADA cell populations, linking the density of projections from a given input site to RABV-labeled inputs (Beier et al., 2019). However, this analysis was done with a limited set of brain regions, focused only on the medial-lateral gradient along the VTA, and only explored the link between input density and DA neurons in the VTA. Here we wanted to explore this question with a broader perspective and assess the relationship between projections throughout the ventral midbrain from each of the input sites that we quantified in our previous studies. We wanted to assess globally how closely spatial projection patterns throughout the ventral midbrain relate to RABV input mapping datasets.
For each input region, we selected three experiments from the Allen Mouse Brain Connectivity Atlas and took the average projection into the ventral midbrain. The NAcLat was excluded as an input site, as the Atlas does not contain injections into this site. We also used injections in the infralimbic/prelimbic (IL/PL) and orbitofrontal cortex (Orb) to represent two distinct regions of the anterior cortex. We then mapped these projections onto a coronal slice of the ventral midbrain to facilitate visualization. We used an extended spatial domain that allowed us to assess projections within the VTA as well as to adjacent structures. As before, we used PCA to reduce the dimensions of this space. The first principal component is a weighted combination of the projections from the 22 input sites that maximizes variance across the ventral midbrain window. This weighted combination can then be visualized in the original spatial dimensions. By comparing the PC projection patterns with the region contributions to the PCs (Figure 4A), we can see what the archetypal projection patterns are and how input region projections are similar or dissimilar. For example, we computed and plotted the archetypal projection of four regions that provide preferential inputs onto VTA→NAcLat cells: The NAcMed, NAcCore, DStr, and GPe. This archetype shows a projection to the lateral VTA, where the VTA→NAcLat cells are located, as expected (Figure 4B).
Figure 4. PCA of data from the Allen Mouse Brain Connectivity Atlas reveals projection archetypes across different input groups. (A) Schematic of analysis. Sample projections to the VTA were pulled from the Allen Mouse Connectivity Atlas for 22 input regions. Pixel values representing projection density were pulled from the coronal slice for each region to generate a table of 2,058 pixel coordinates x 22 regions. PCA found a linear combination of input regions that maximizes variation across the pixels. Each pixel is then visualized on a coronal slice of the VTA with its PC value, revealing the most common projection portrait. (B) Brain regions that preferentially provide inputs to VTA→NAcLat cells were selected as a test case to see if they might have a common projection pattern. Sample projections from these regions to the VTA are shown from the Allen data, along with their average projection portrait. (C) Heatmap of each input region’s contribution to the first four principal components. (D) Positive and negative contributing regions to each principal component are summed according to their sign in order to generate the principal component projection portrait. (E–L) Example projection portraits are shown for the major positive and negative contributing regions for each of the first four principal components. The projection archetypes corresponding with these principal components are shown in panels (F,H,J,L).
PC1 includes projections that relatively uniformly innervate the entire VTA, with little bias (Figures 4C–E). This marks the +PC1 pixels, and thus we would expect the regions with positive contributions to this PC to have higher projections over this space. Some example input sites with this pattern include the PO, BNST, EAM, PVH, and LH (Figure 4C). Interestingly, these regions all fall within cluster 3 of our RABV data (Figure 2I) and have inputs that are enriched onto VTA→Amygdala cells (Figure 3G). Another characteristic of PC1 is that its negative values are ventral and lateral to the VTA. We therefore expect -PC1 pixels to have higher projections from the -PC1 regions and lower projections from the +PC1 regions. The DStr and GPe both do not project much to the VTA directly, but they do have strong projections lateral to the VTA (Figure 4E). Likewise, the +PC1 regions – PO, BNST, EAM, PVH, and LH – tend not to project at all to this area lateral and ventral to the VTA, but rather project broadly throughout the VTA. Thus, the primary axis of variation that PC1 seems to capture contains the regions that are projecting with little bias to the VTA, and those that are projecting lateral and ventral to the VTA (Figures 4D,F). This projection primarily innervates VTA→Amygdala cells that are distributed throughout the VTA (Figure 3F).
+PC2 receives the strongest weights from the septum, LDT, PO, MHb, and LHb, while -PC2 is composed primarily of the VP, GPe, CeA, PBN, and DCN (Figure 4C). This PC appears to have a medial/ventral bias, as the brain regions with the strongest weights project primarily to the medial and ventral portion of the VTA, while the GPe, CeA, PBN, and DCN all project laterally/dorsally (Figures 4G–H). Given that the VTA→NAcMed cells are located the furthest in the ventromedial portion of the VTA (Figure 3F), we would expect that VTADA→NAcMed cells receive preferential input from these brain regions. Indeed, the PO, LHb, and LDT preferentially connect to VTADA→NAcMed cells, while the septum connects approximately equally to VTA→NAcMed and VTA→NAcLat cells (Figure 3G). Notably, -PC2 receives a relatively strong negative weight from the DR (Figure 4C).
+PC3 is primarily composed of inputs from the basal ganglia (NAcMed, NAcCore, DStr, GPe), and the two cortical regions, IL/PL and Orb, while –PC3 is composed primarily of the EP, ZI, and DCN (Figure 4C). +PC3 corresponds to inputs that project ventrolateral to the VTA (Figures 4I,J), and primarily innervate VTA→NAcLat cells (Figures 2H, 3G). The brain regions that contribute to -PC3 project dorsal to the VTA.
+PC4 has strong contributions from the LHb, MHb, and DR, while -PC4 is primarily composed of the septum, PVH, and ZI (Figure 4C). Of the positive contributors, the LHb and MHb are also present in +PC2 as they broadly project to the medial VTA, which also is where VTADA→NAcMed cells are located. In contrast, the DR contributes mostly to +PC4 and +PC1. This combination of PCs describes the DR’s broad projection to the dorsal VTA with a strong bias to the dorsomedial VTA. +PC4’s archetypal projection is also to the dorsomedial VTA (Figures 4K,L), a region that most prominently includes VTADA→mPFC neurons (Figure 3F). Accordingly, the DR preferentially innervates VTADA→mPFC cells (Figure 3G) and thus is the main input brain region that differentiates VTADA→NAcMed from VTADA→mPFC cells.
These data demonstrate that the first four PCs using data from the Allen Mouse Brain Connectivity Atlas correspond to the input biases of the four VTA populations that we examined here. Therefore, our conclusion is that we can recapitulate the principal differences in inputs to different cell populations in the VTA solely by identifying the archetypal projections into the VTA using open-source data from the Allen Institute.
Patterns of Input Innervation Are Conserved Between RABV Mapping and Allen Projection Data
Our analysis of the Allen’s projection data suggests that we can recapitulate the variance in RABV mapping experiments by decomposing the Allen’s projection data into principal components. As we previously mentioned, UMAP is better optimized for identifying the relationship between variables in high-dimensional space. We therefore wanted to assess the relationship between input sites to the VTA, defined either through their covariance in our RABV mapping data or spatial similarity in Allen projection data. We demonstrated earlier that the input sites in RABV mapping experiments segregate into three clusters (Figures 2H, 5A). As UMAP embeddings can be somewhat stochastic because they rely on initial seeding conditions, we computed the distance between points relative to the maximum distance between any two points in each embedding, over 20 embeddings, then averaged across all embeddings (Figures 5A–D). In both cases, we identified three clusters of brain regions. Cluster 1 contained perfect correspondence between RABV and Allen datasets, and included regions in the frontal cortex (either anterior cortex or both the IL/PL and Orb), NAcMed, NAcCore, DStr, and GPe (the NAcLat was not included in the Allen dataset). While clusters 2 and 3 in the RABV and Allen datasets did not perfectly align, they did have similar structures. RABV cluster 2 included the CeA, EP, ZI, PBN, and DCN. These regions also clustered together in the Allen data, but were joined by the VP, EAM, LHb, MHb, and DR that split from cluster 3. The remainder of the brain regions (septum, BNST, PO, PVH, LH, LDT) are in cluster 3 for both datasets. Notably, the distance between clusters 2 and 3 in the Allen data is much smaller than to cluster 1 and thus, Allen clusters 2 and 3 have a more similar projection profile to each other than to cluster 1. Overall, we observed substantial similarity between RABV and Allen datasets, suggesting that the covariance in input labeling using RABV mapping can be largely attributed to differences in axonal innervation from input sites and thus, the information can be gleaned through parsing open-source projection datasets. To demonstrate that these associations were not attributed to chance, we scrambled the association of the COM with fraction of inputs labeled in the RABV dataset, or the order of z-scores for pixel intensity for each input site. UMAP was unable to identify clusters or significant levels of co-variance in either scrambled dataset (Figures 5E–H), demonstrating that the high covariance between selected input brain sites is highly significant and similar between both RABV and Allen datasets.
Figure 5. UMAP dimensional reduction of RABV and Allen data reveal common clusters of VTA inputs. (A) Input regions are plotted in UMAP space, embedded with respect to z-scores from the RABV input mapping data. Clusters represent inputs with similar patterns of variation across the cohort. (B) Input regions are plotted in UMAP space, embedded with respect to z-scores across pixels in the Allen data. (C) Heatmap of pairwise distances (averaged across 20 UMAP embeddings) for the RABV input data. Regions are grouped according to hierarchical clusters. Clusters are highlighted to match the clusters above in the UMAP plot; they are also annotated according to which principal components to which the regions contribute. Regions are grouped to line up with the Allen clusters. (D) Heatmap of pairwise distances (averaged across 20 UMAP embeddings) for the Allen input data. (E–H) Same plots as panels (A–D), but UMAP was run on scrambled data. For each region, z-score values were scrambled across mouse brains for RABV data, or pixel coordinate for the Allen data.
Discussion
Our detailed observations of input and output datasets of VTA cells revealed several interesting findings. The largest contributor to variance in our input tracing dataset is the medial-lateral gradient in the VTA, which differentiates the VTADA→NAcLat cells from the other three subpopulations. The VTADA→NAcLat cells also had the most distinct collateralization pattern of the four VTADA subpopulations studied. These results confirm our previous analyses (Beier et al., 2015, 2019). However, here we were able to further differentiate the VTADA projections to the NAcMed, mPFC, and Amygdala by inputs as well as outputs with an integrated spatial analysis of several high dimensional datasets. By exploring the z-scores of input counts in different brain regions, we found that the PO, LHb, VP, and LDT inputs were elevated for VTADA→NAcMed cells, DR and EP inputs are elevated for VTADA→mPFC cells, and CeA, PBN, ZI, PVH, BNST, EAM, DCN, LH, and MHb inputs are elevated for VTADA→Amygdala cells (Figure 6). The z-score normalization allowed us to find elevations in inputs and outputs whose fractional counts were smaller in magnitude than other regions. Logistic regression models demonstrated how RABV inputs and starter cell location contributed to differentiating these conditions. Investigation of the projection patterns of inputs to the VTA revealed that VTA input populations can be differentiated into several projection archetypes—projections to the VTA broadly, projections to regions around but not including the VTA, and projections to subdomains of the VTA. Lastly, we showed that the patterns of these projection archetypes mirror input differences to VTA subpopulations. These data together demonstrate that the location of different DA cell populations determines the quantitative contribution from different inputs and, thus, the signals that these cells receive.
Figure 6. Summary of findings. (A) All inputs and outputs we mapped in our experiments are shown, grouped in their clusters from UMAP analysis. (B–E) For each projection, elevated inputs and outputs are shown (according to Figure 3G). (F) Inputs or outputs significantly elevated in any projection condition are shown grouped by projection. p-values are corrected for multiple comparisons using a Bonferroni correction.
Comprehensive Quantitative Analysis Enables Differentiation of Four VTADA Cell Populations
Our goal in this study was to identify input and output factors that differentiate VTADA neurons. Previous studies have shown that subpopulations of VTADA cells differ in their forebrain projections, electrophysiological properties, and behavioral functions (Lammel et al., 2008, 2011; Kim et al., 2016). Comprehensive input-output mapping studies from us and several other groups suggested that DA cell populations received inputs from the same brain regions in quantitatively similar proportions, with some biases. Of note, we previously found that VTADA cells projecting to the NAcLat received more inputs from the striatum and globus pallidus external segment than the other VTADA cells that we examined (Beier et al., 2015, 2019). This is likely because the VTADA→NAcLat cells are located the most laterally within the VTA, and most of the basal ganglia inputs project most strongly to the adjacent SNr (Beier et al., 2015, 2019). While our previous analyses comparing the fractional contribution from 22 input sites to 4 different VTADA cell populations were able to differentiate VTADA→NAcLat cells, VTADA cells projecting to the NAcMed, mPFC, or Amygdala appeared highly similar. Here, by exploring the z-scored input and output data, we identified sets of inputs and outputs elevated for each cell type.
First, we observed that some VTADA cell types may be preferentially reciprocally connected, including the predominant VTADA→NAcLat subpopulation. While this was not the case for all our observed cell populations, as VTADA→mPFC cells received fewer mPFC inputs than did VTADA→NAcLat cells (Beier et al., 2019), it does suggest that the hypothesis of reciprocal connectivity cannot entirely be discarded. A model of reciprocal connectivity was proposed long ago (Swanson, 1982; Alheid and Heimer, 1988; Zahm, 2006; Yetnikoff et al., 2014), but a recent viral-genetic mapping study failed to find evidence for this reciprocal connectivity in the VTA (Menegas et al., 2015). By comparing the average percent of inputs arising from individual identified brain regions across animals, we also failed to observe statistically significant evidence of reciprocity (Beier et al., 2015, 2019). However, our z-scored analysis gave better visualizations of the lower fractional inputs, supporting the possibility that some preferential reciprocal connectivity may exist in the VTA. This observation argues that a detailed and higher powered investigation into reciprocal connections in RABV mapping datasets may be necessary to reveal the true connectivity relationships in the brain. It is also possible that reciprocal connections may be more present in certain structures and projections than others. It is however noteworthy that in order for these analyses to achieve significance, comparatively larger datasets like ours may be needed, whereas the majority of RABV mapping studies use only a handful of animals (typically 6 or fewer) per condition.
Second, input regions that are integrated into common circuits and have been implicated in common behavioral functions tend to provide preferential innervation onto one particular VTADA cell type. For example, striatal and globus pallidus inputs that comprise key components of the basal ganglia preferentially provide input to VTADA→NAcLat cells that project back into the striatum. We also found that several regions in the extended amygdala that have been implicated in stress-related behaviors preferentially provide input onto VTADA→Amygdala cells. Several studies have been published in the past few years about the role of VTADA→Amygdala cells in reward and aversion learning, fear learning, as well as anxiety (Lutas et al., 2019; Lin et al., 2020; Tang et al., 2020). The CeA, PBN, ZI, PVH, BNST, and EAM all play key roles in aversion and anxiety behaviors (Bernard and Besson, 1988; Han et al., 2015; Chou et al., 2018; Zhou et al., 2018; Chiang et al., 2019), and interestingly, all contain neurons that express CRF, a neuropeptide that modulates DA cells in the midbrain and DA responses in downstream structures (Ungless et al., 2003; Wanat et al., 2008; Lemos et al., 2012). While each of these brain regions participates in behaviors other than fear learning and anxiety, it is interesting that each of these regions, which are distributed throughout the brain, has a similar projection pattern in the VTA. This suggests that these regions may work in concert to facilitate behavioral outcomes associated with stress and aversion/fear learning through VTADA→Amygdala cells. The preferential inputs from basal ganglia regions to VTADA→NAcLat cells and stress-related inputs to VTADA→Amygdala cells is likely due to the fact that inputs with common functions form particular projection archetypes. This means that inputs with a similar function may share a set of factors that govern their connectivity, an idea that we explore further below.
Third, variance in our input and output data can be explained by differences in the location of starter cells within the VTA. The input regions that provided preferential innervation to particular VTADA cell populations preferentially innervated regions of the VTA that matched the spatial location or distribution of the corresponding VTADA cell type. These results reinforce our previous conclusion that organization within the VTA is largely spatial, with cell type providing little influence on the inputs that those cells receive (Beier et al., 2019). However, they also highlight that additional dimensions of spatial pattern exist within the VTA beyond the medial-lateral gradient that we identified earlier and that these patterns underlie differences in inputs that each cell population receives. For example, we found that while VTADA→NAcMed and VTADA→mPFC cells were both located medially in the VTA, inputs that were ventrally biased in the medial VTA preferred VTADA→NAcMed cells, and inputs that were dorsally biased preferred VTADA→mPFC cells. These results indicate that these spatial preferences matched the relative ventral or dorsal bias of these VTADA subpopulations, respectively (Figure 3). We also found that VTADA→Amygdala cells had the broadest medial-lateral distribution and were located the most centrally in the VTA. Inputs to these cells also lacked clear medial-lateral biases. Altogether, our conclusions in this study are entirely consistent with our previous conclusions, while also extending them by identifying more subtle differences in the location of DA cells within the VTA as well as the location of input projections throughout the VTA.
Specificity of RABV Transmission and Implications for Rules Governing Connectivity
As we noted above, comparing the averages between the percentage of inputs received from different brain regions across animals was sufficient only to reveal the largest differences between conditions. In our dataset, this was sufficient to differentiate VTADA→NAcLat cells from the rest, but insufficient to parse apart VTADA→NAcMed, VTADA→mPFC, and VTADA→Amygdala cells from one another. Notably, the method of comparing averages across animals is the standard method of analysis of RABV mapping datasets. Beyond being the simplest approach to analyzing these data, most mapping datasets likely contain too few samples to effectively perform PCA or UMAP analyses of their data. This is likely because RABV mapping experiments are labor intensive and typically not performed on the scale that ours was. In the case of our 76-brain dataset, it took years of viral generation, mouse breeding, stereotaxic injection, brain sectioning, imaging, and manual quantification to obtain it. That it is currently a one-of-a-kind dataset has provided a unique opportunity to explore connectivity within the VTA as well as assess the merit of different analyses of RABV mapping datasets.
It is also worth assessing what RABV mapping studies can tell us and what they cannot. The prevailing viewpoint among those who use RABV circuit mapping is that RABV transmits between neurons in a synapse-specific fashion. We have argued that the evidence for synaptic-exclusive transmission of RABV is weak (Beier, 2019, 2021; Rogers and Beier, 2021). The fact that the results from RABV mapping experiments such as we conducted in the VTA can be largely recapitulated only from anterograde mapping experiments such as those from the Allen Brain Institute, notably ones that do not differentiate axons of passage from axons that functionally innervate cells in the VTA, could be an additional argument that RABV can spread non-specifically. However, we previously performed an experiment in the VTA that showed that RABV transmission from one cell to another is quite different from direct injection of RABV (Beier et al., 2019). This result was also seen in a similar set of experiments carried out in the DMS (Wall et al., 2013). That a quantitatively different set of inputs was obtained from tracing experiments utilizing different modes of RABV administration provides a strong argument that one-step RABV mapping is not equivalent to directly administering RABV into the brain.
Our observation thus is that RABV mapping does not reveal cell type-specific connectivity, as defined by spatially intermingled cells defined by neurochemical identity. In assessing the implications of this finding, it is worthwhile to consider our state of knowledge regarding spatial patterning and mechanisms that govern connectivity between neurons in the brain. Spatial patterning within the brain during development has been extensively studied, and the roles of families of patterning molecules such as ephrins, netrins, slits, and semaphorins have been well documented (Yu and Bargmann, 2001; Bashaw and Klein, 2010). Other surface proteins such as Teneurins, Tolls, DIPs, and Dprs may play roles in regulating connectivity at the cellular level (Hong et al., 2012; Ward et al., 2015; Barish et al., 2018). However, our understanding of the exact roles that these surface proteins play in dictating whether or not two neurons form connections, particularly in the rodent brain, is limited. It is important to note that we do not know the biases that RABV may have for spread to particular cell types in the brain, and it is possible that these biases are similar for all cell types and outweigh any actual differences in connectivity. Advances in RABV mapping technology, for example the development of a genetically barcoded RABV, may enable the exploration of the role that classes of surface proteins may play in defining connections between neurons (Saunders et al., 2021). However, it is also possible that the lack of cell type-specific connectivity revealed by RABV may be biologically meaningful. Such random connectivity patterns would then have implications for how connections at both the macro and micro-scales influence circuit output and animal behavior.
Future Directions
We and others have extensively mapped inputs and outputs of cells in the ventral midbrain and have detailed the role of spatial location in determining input patterns between different cell types (Beier et al., 2015, 2019; Lerner et al., 2015; Menegas et al., 2015). Our analysis in this study extends our previous observations. One next step is to determine if this finding applies to brain regions outside of the VTA. The observation that spatially intermingled cell populations tend to receive inputs from the same brain regions in quantitatively similar proportions supports the hypothesis that spatial location is the major determinant of global input patterns, at least as measured by one-step RABV mapping. However, the sources of spatial patterning of inputs and projection archetypes remain unknown. That brain regions sharing a common behavioral role have a similar projection pattern throughout the ventral midbrain suggests that these regions likely follow similar rules of patterning in the ventral midbrain, and this patterning in turn guides their preferential connectivity into particular cell types within the ventral midbrain. The identification of patterning molecules expressed during development and synapse formation through single cell RNA sequencing, for example, would help to elucidate what molecular pathways dictate projection patterns. It would also be interesting to test how ubiquitous this phenomenon of projection archetypes is throughout the brain and if it relates to projection-defined cells in a similar way as in the VTA. If so, the definition of projection archetypes during development along with spatial localization of projection-defined cell types may be one important generator of specificity in circuit connectivity in the brain.
Materials and Methods
RABV Input and Axonal Arborization Output Tracing
Input and output mapping from VTA cells was described previously (Beier et al., 2015, 2019). Briefly, DAT-Cre, GAD2-Cre, vGluT2-Cre, and wild type C57Bl/6 mice were obtained and housed with 12 hour light/dark cycles and food and water ad libitum (Beier et al., 2019). Viral vectors were prepared as previously described (Schwarz et al., 2015). For TRIO experiments, CAV-Cre was injected into an output site, and Cre-dependent AAVs expressing the avian TVA protein as well as the rabies glycoprotein, RABV-G, were injected into the VTA. Two weeks later, EnvA-pseudotyped rabies virus (RABV) was injected into the VTA. These TRIO experiments thus labeled the inputs to VTA neurons with a specified output. We also performed cell-type specific TRIO (cTRIO) experiments. This included injecting a CAV-FLExloxP-Flp into a target output site and Flp-dependent AAVs expressing TVA and RABV-G into the VTA, and EnvA-pseudotyped RABV 2 weeks later. These cTRIO experiments labeled inputs to VTA neurons of a specific cell-type with a specified output. Rabies labeling experiments were also performed to cover conditions without an output target specified.
Axonal arborization experiments labeled the axons of VTA neurons with projections to a specified target. We performed similar CAV and AAV injections to the above, but rather than TVA and RABV-G we expressed a membrane-targeted GFP in targeted cells. This allowed us to view the entire axonal arbor of these cells. After 2 months, animals were perfused with PBS and 4% formaldehyde. For inputs, cells were counted manually using preselected regions. For both inputs and outputs, data were normalized by the total counts in each brain, accounting for differing levels of viral infection. Detailed protocols for input tracing and axon arborization can be found in previous publications (Beier et al., 2015, 2019).
Region Selection
Regions were selected for RABV input and axonal arborization output tracing according to previous publications (Beier et al., 2015, 2019). Notably, for VTA inputs we subdivided the global pallidus into the global pallidus external (GPe) and the entopeduncular nucleus (EP), the rodent equivalent of the GPi. For outputs, we subdivided the dorsal striatum into the dorsal lateral striatum (DLS) and dorsal medial striatum (DMS). Since the DLS does not substantially project to the VTA, and since the divide between the DMS and DLS is somewhat arbitrary, we did not subdivide the DStr for inputs. Here and previously we binned the anterior cortex into a single region. We previously subdivided the cortex into its composite regions, but did not find biased projections onto VTA cells according to cell type or projection (Beier et al., 2019). We did explore some substructures in the Allen Mouse Brain Connectivity Atlas analysis, including the orbital cortex, and the combined infralimbic and prelimbic cortical regions. For the amygdalar regions, we analyzed the central amygdala as an input site. For the projection site, we targeted the CeA, but we were not confident that our injections were completely restricted to this site, and hence we call these amygdala-projecting cells. It is likely that our VTA injections did not substantially induce DA cells located in the retrorubal field (RRF), where some have detected projections to amygdalar structures (Zahm et al., 2011).
Groupings of brain regions are listed below, in alphabetical order:
CeA–central amygdala lateral, medial, and capsular nuclei
Cortex–anterior cingulate cortex (ACC); infralimbic cortex (IL); insular cortex (Ins); motor cortex (MO; anterior portion); orbital cortex (Orb); prelimbic cortex (PL); somatosensory cortex (SS, anterior portion). This is the same composite structure as called the anterior cortex in Beier et al. (2015, 2019).
DR–as defined in Weissbourd et al. (2014).
EAM–anterior amygdaloid area, basomedial amygdala, anterior cortical amygdaloid nucleus, cortex-amygdala transition zone
LDT–laterodorsal tegmental area, dorsomedial tegmental area, dorsal tegmental nucleus, Barrington’s nucleus, ventral tegmental nucleus, subpeduncular tegmental nucleus
PO–medial preoptic area, lateral preoptic area, lateral anterior hypothalamic area, anterior hypothalamic area, striohypothalamic nucleus
Septum–triangular septal nucleus, lateral septum, dorsal fornix, septofimbrial nucleus, medial septum, septohypothalamic nucleus, septohippocampal nucleus, lambdoid septal zone
VP–interstitial nucleus of posterior limb of anterior commissure (IPAC), substantia innominata, horizontal diagonal band, nucleus of the vertical diagonal band
Abbreviations for brain regions made throughout the paper are listed below, in alphabetical order:
BNST–bed nucleus of the stria terminalis
CeA–central amygdala
DCN–deep cerebellar nucleus
DR–dorsal raphe
DStr–dorsal striatum
EAM–extended amygdala
EP–entopeduncular nucleus (GPi)
GPe–globus pallidus (GPe)
LDT–laterodorsal tegmentum
LH–lateral hypothalamus
LHb–lateral habenula
MHb–medial habenula
NAcCore- nucleus accumbens, core
NAcMed–nucleus accumbens, medial shell
NAcLat–nucleus accumbens, lateral shell
PBN–parabrachial nucleus
PO–pre-optic area
PVH–paraventricular hypothalamus
VP–ventral pallidum
VTA–ventral tegmental area
ZI–zona incerta
Dimensional Reduction of Output and RABV Input Data
Principal Component Analysis (PCA) was used to dimensionally reduce both axon arborization output and RABV input data. PCA is a linear dimensional reduction technique that finds the maximal axes of variation through a dataset. Once a PCA embedding is found, each principal component can be unpacked to find out what linear combination of features (output sites or input sites), or weights, comprise it. Input and output counts per brain region were converted to fraction data to account for variation in total number of cells across brains. Fraction data were scaled so that variations in larger regions do not provide oversized contributions to PCA, compared to smaller regions. Analyses were performed in Python using Scikit-learn’s PCA implementation (Pedregosa et al., 2011).
Uniform Manifold Approximation and Projection (UMAP) was used as a non-linear dimensional reduction technique on output and input data. UMAP is better optimized for finding local and global structures in high dimensional data than PCA, but it is far less interpretable. Analyses were performed using the official UMAP library (McInnes et al., 2018). The fractional counts data were z-scored to compare variation in output and input sites across regions with different magnitudes of counts. Z-scored data were dimensionally reduced with UMAP to find clusters of output and input sites with similar patterns of variation. UMAP parameters were tuned manually to optimize stability of clusters.
Regression Analysis of RABV Input Data
Linear regression was used to quantify the relationship of starter cell COM with the RABV input principal components. Slopes returned from the analysis reflect to what degree lateral and dorsal location increase, decrease, or have no effect on principal components. p-values give the probability of these slopes being 0 given the observed data. The statsmodels Python library was used to train these models and examine the slopes (Seabold and Perktold, 2010).
Logistic regression was used to classify the different projection conditions based on the RABV starter cell COMs and the RABV input principal components. To build a model for multiclass classification, we trained a separate logistic regression model to classify each projection condition. When evaluated against a given brain, the model prediction with the highest probability was used. Logistic regression coefficients represent the increased or decreased likelihood of the model prediction given a higher or lower value of a given feature. For example, a positive coefficient for Feature A means the model prediction increases in likelihood for higher values of Feature A and decreases for lower values. A negative coefficient has the opposite relationship; the model prediction increases in likelihood for lower values and decreases for higher values. The higher magnitude of the coefficient, the higher the importance of that feature on the prediction. The Scikit-learn implementation of logistic regression in Python was used for our analysis (Pedregosa et al., 2011).
Principal Component Analysis of Allen Mouse Brain Connectivity Data
For each of the input regions considered in the RABV experiments, we manually selected corresponding samples from the Allen Mouse Brain Connectivity Data. Experiments were selected based on whether or not the experiment contained labeled projections to the ventral midbrain. NAcLat was not included as an input site, as there were no samples that contained injections that were specific to NAcLat that also projected to the ventral midbrain. Cortex was subdivided into the orbital area and the combined infralimbic and prelimbic areas since our original quantification of RABV inputs included a broad spatial domain not encompassed by any single set of injections. The ID and hyperlink of each sample selected is provided in Supplementary Table 1. For each input region, the sample projections to the VTA were averaged together. Projections were sliced into a 42 pixel x 49 pixel rectangle to capture the largest coronal section of the VTA along with some of the surrounding area. PCA was used to find linear combinations of input regions that maximize variation across the pixels of this rectangle. PC values for each pixel were visualized on the original rectangular space to see how this variation is organized spatially within and around the VTA. These spatial projection “archetypes,” revealed by each principal component’s visualization, were compared to the primary regions that comprise them. Allen samples were accessed using the allensdk python library,1 and PCA was performed using Scikit-learn (Pedregosa et al., 2011).
Allen and RABV Input Clustering Comparison
Clustering of input regions was compared between the RABV input data and the Allen Mouse Brain Connectivity Data (Figures 5C,D,G,H). Z-scoring was performed as before on the input data, capturing variation for each input across the samples. Z-scored data were dimensionally reduced with UMAP to find clusters of inputs with similar variations in each dataset. To account for variability in embeddings, we ran these embeddings 20 times for each dataset and took the average relative pairwise distance between each region. These pairwise distances were computed relative to the maximum distance between any two points in each embedding. Regions were hierarchically clustered based on this distance matrix and compared across datasets.
In order to assess how much clustering we might expect in a random dataset with a similar distribution, we shuffled both RABV and Allen datasets. The z-scored input values were shuffled independently for each region across samples. This eliminated any association between input values for each sample across regions. The clustering comparison analysis was repeated as above.
Data Availability Statement
All code and data used to complete the analysis and generate the figures in this paper are publicly available in a repository on GitHub: https://github.com/pderdeyn/vtada-network.
Ethics Statement
The animal study was reviewed and approved by the University of California IACUC and the Stanford University IACUC.
Author Contributions
PD performed computational analyses. MH made figures. KB organized the results and led manuscript writing and submission. All authors contributed to writing the manuscript.
Funding
This work was funded by NIH T32-GM136624 and NSF GRFP DGE-1839285 to PD, NIH DP2-AG067666, R00-D041445, R01-DA054374, TRDRP T31KT1437, and T31IP1426, One Mind OM-5596678, Alzheimer’s Association AARG-NTF-20-685694, New Vision Research CCAD2020-002, and ADPA APDA-5589562 to KB, and NIH T32-GM008620 and TRDRP T31DT1729 to MH.
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s Note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Acknowledgments
We would like to thank Katrina Bartas for sharing code used to access the allensdk.
Supplementary Material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fncir.2021.799688/full#supplementary-material
Supplementary Figure 1 | VTADA outputs without the four targeted projection sites. (A) Cumulative explained variance from each principal component. (B) Samples are plotted in PCA space for the 1st and 2nd components, colored by projection. (C) Samples are plotted in PCA space for the 1st and 3rd components, colored by projection. (D) Heatmap of each output region’s contribution to the first three principal components. (E) Brains are plotted in UMAP space, colored by projection. (F) Output regions are plotted in UMAP space, embedded with respect to z-scores across mouse brains. Clusters represent outputs with similar patterns of variation across the cohort.
Supplementary Figure 2 | VTA input dimensional reduction without non-Cre and projection-undefined conditions. (A) Brains are plotted in PCA space for the 1st and 2nd components, colored by cell type. (B) Brains are plotted in PCA space for the 1st and 2nd components, colored by projection. (C) Brains are plotted in UMAP space, colored by cell type. (D) Brains are plotted in UMAP space, colored by projection.
Supplementary Figure 3 | Input and output z-scores stitched together for all cell types. (A) Z-scores of average input and output counts for each projection condition. Inputs are marked with a green down arrow and outputs with a red up arrow. Regions are sorted according to the projection in which they receive the highest rank.
Supplementary Figure 4 | Revealing RABV input PCs and starter cell locations that predict projection conditions. (A) Logistic regression coefficients for five principal components. Positive coefficients predict this condition when the feature is higher. Negative coefficients predict this condition when the feature is lower. Model scores are provided in Table 3. (B) Logistic regression coefficients for starter cell location. (C) Logistic regression coefficients for five principal components and starter cell location.
Supplementary Figure 5 | Projection portraits for all inputs from the Allen Brain Connectivity Atlas. All regions are in the same order as index from Figure 4C.
Footnotes
References
Alheid, G. F., and Heimer, L. (1988). New perspectives in basal forebrain organization of special relevance for neuropsychiatric disorders: the striatopallidal, amygdaloid, and corticopetal components of substantia innominata. Neuroscience 27, 1–39. doi: 10.1016/0306-4522(88)90217-5
Barish, S., Nuss, S., Strunilin, I., Bao, S., Mukherjee, S., Jones, C. D., et al. (2018). Combinations of DIPs and Dprs control organization of olfactory receptor neuron terminals in Drosophila. PLoS Genetics 14:e1007560. doi: 10.1371/journal.pgen.1007560
Bashaw, G. J., and Klein, R. (2010). Signaling from axon guidance receptors. Cold Spring Harb. Perspect. Biol. 2, 1–16.
Beier, K. T. (2019). Hitchhiking on the neuronal highway?: mechanisms of transsynaptic specificity. J. Chem. Neuroanat. 99, 9–17. doi: 10.1016/j.jchemneu.2019.05.001
Beier, K. T. (2021). The Serendipity of viral trans-neuronal specificity: more than meets the eye. Front. Cell. Neurosci. 15:720807. doi: 10.3389/fncel.2021.720807
Beier, K. T., Gao, X. J., Xie, S., DeLoach, K. E., Malenka, R. C., Luo, L., et al. (2019). Topological organization of ventral tegmental area connectivity revealed by viral-genetic dissection of input-output relations. Cell Rep. 26, 159–167.e6. doi: 10.1016/j.celrep.2018.12.040
Beier, K. T., Steinberg, E. E., DeLoach, K. E., Xie, S., Miyamichi, K., Schwarz, L., et al. (2015). Circuit architecture of VTA dopamine neurons revealed by systematic input-output mapping. Cell 162, 622–634. doi: 10.1016/j.cell.2015.07.015
Bernard, J., and Besson, J. (1988). Convergence of nociceptive information on the parabrachio-amygdala neurons in the rat. C R. Acad. Sci. III 307, 841–847.
Bouarab, C., Thompson, B., and Polter, A. M. (2019). VTA GABA neurons at the interface of stress and reward. Front. Neural Circuits 13:78. doi: 10.3389/fncir.2019.00078
Chiang, M. C., Bowen, A., Schier, L. A., Tupone, D., Uddin, O., and Heinricher, M. M. (2019). Parabrachial complex: a hub for pain and aversion. J. Neurosci. 39, 8225–8230. doi: 10.1523/jneurosci.1162-19.2019
Chou, X. L., Wang, X., Zhang, Z.-G., Shen, L., Zingg, B., Huang, J., et al. (2018). Inhibitory gain modulation of defense behaviors by zona incerta. Nat. Commun. 9:1151. doi: 10.1038/s41467-018-03581-6
Han, S., Soleiman, M., Soden, M., Zweifel, L., and Palmiter, R. D. (2015). Elucidating an affective pain circuit that creates a threat memory. Cell 162, 363–374. doi: 10.1016/j.cell.2015.05.057
Hong, W., Mosca, T. J., and Luo, L. (2012). Teneurins instruct synaptic partner matching in an olfactory map. Nature 484, 201–207.
Kim, C. K., Yang, S. J., Pichamoorthy, N., Young, N. P., Kauvar, I., Jennings, J. H., et al. (2016). Simultaneous fast measurement of circuit dynamics at multiple sites across the mammalian brain. Nat. Methods 13, 325–328. doi: 10.1038/nmeth.3770
Kim, J. I., Ganesan, S., Luo, S. X., Wu, Y. W., Park, E., Huang, E. J., et al. (2015). Aldehyde dehydrogenase 1a1 mediates a GABA synthesis pathway in midbrain dopaminergic neurons. Science 350, 102–106. doi: 10.1126/science.aac4690
Kutlu, M. G., Zachry, J. E., Melugin, P. R., Cajigas, S. A., Chevee, M. F., Kelley, S. J., et al. (2021). Dopamine release in the nucleus accumbens core signals perceived saliency. Curr. Biol. 31, 4748–4761.e8. doi: 10.1016/j.cub.2021.08.052
Lammel, S., Hetzel, A., Häckel, O., Jones, I., Liss, B., and Roeper, J. (2008). Unique properties of mesoprefrontal neurons within a dual mesocorticolimbic dopamine system. Neuron 57, 760–773. doi: 10.1016/j.neuron.2008.01.022
Lammel, S., Ion, D. I., Roeper, J., and Malenka, R. C. (2011). Report projection-specific modulation of dopamine neuron synapses by aversive and rewarding stimuli. Neuron 70, 855–862. doi: 10.1016/j.neuron.2011.03.025
Lammel, S., Lim, B. K., Ran, C., Huang, K. W., Betley, M. J., Tye, K. M., et al. (2012). Input-specific control of reward and aversion in the ventral tegmental area. Nature 491, 212–217. doi: 10.1038/nature11527
Lemos, J. C., Wanat, M. J., Smith, J. S., Reyes, B. A. S., Hollon, N. G., Van Bockstaele, E. G., et al. (2012). Severe stress switches CRF action in the nucleus accumbens from appetitive to aversive. Nature 490, 402–406. doi: 10.1038/nature11436
Lerner, T. N., Shilyansky, C., Davidson, T. J., Evans, K. E., Beier, K. T., Zalocusky, K. A., et al. (2015). Intact-Brain analyses reveal distinct information carried by SNc dopamine subcircuits. Cell 162, 635–647. doi: 10.1016/j.cell.2015.07.014
Lin, R., Liang, J., Wang, R., Yan, T., Zhou, Y., Liu, Y., et al. (2020). The raphe dopamine system controls the expression of incentive memory. Neuron 106, 498–514.e8.
Lutas, A., Kucukdereli, H., Alturkistani, O., Carty, C., Sugden, A. U., Fernando, K., et al. (2019). State-specific gating of salient cues by midbrain dopaminergic input to basal amygdala. Nat. Neurosci. 22, 1820–1833. doi: 10.1038/s41593-019-0506-500
Margolis, E. B., Lock, H., Hjelmstad, G. O., and Fields, H. L. (2006). The ventral tegmental area revisited: is there an electrophysiological marker for dopaminergic neurons? J. Physiol. 577, 907–924.
McInnes, L., Healy, J., Saul, N., and Großberger, L. (2018). UMAP: uniform manifold approximation and projection. J. Open Source Softw. 3:861.
Menegas, W., Bergan, J. F., Ogawa, S. K., Isogai, Y., Umadevi Venkataraju, K., Osten, P., et al. (2015). Dopamine neurons projecting to the posterior striatum form an anatomically distinct subclass. eLife 4:e10032. doi: 10.7554/eLife.10032
Morales, M., and Margolis, E. B. (2017). Ventral tegmental area: cellular heterogeneity, connectivity and behaviour. Nat. Rev. Neurosci. 18, 73–85.
Navratilova, E., Atcherley, C. W., and Porreca, F. (2015). Brain circuits encoding reward from pain relief. Trends Neurosci. 38, 741–750.
Oh, S. W., Harris, J. A., Ng, L., Winslow, B., Cain, N., Mihalas, S., et al. (2014). A mesoscale connectome of the mouse brain. Nature 508, 207–214.
Olds, J., and Milner, P. (1954). Positive reinforcement produced by electrical stimulation of septal area and other regions of rat brain. J. Comp. Physiol. Psychol. 47, 419–427. doi: 10.1037/h0058775
Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., et al. (2011). Scikit-learn: machine learning in python. J. Mach. Learn. Res. 12, 2825–2830. doi: 10.1080/13696998.2019.1666854
Rogers, A., and Beier, K. T. (2021). Can transsynaptic viral strategies be used to reveal functional aspects of neural circuitry? J. Neurosci. Methods 348:109005. doi: 10.1016/j.jneumeth.2020.109005
Root, D. H., Mejias-Aponte, C. A., Zhang, S., Wang, H. L., Hoffman, A. F., Lupica, C. R., et al. (2014). Single rodent mesohabenular axons release glutamate and GABA. Nat. Neurosci. 17, 1543–1551.
Saunders, A., Huang, K. W., Vondrak, C., Hughes, C., Smolyar, K., Sen, H., et al. (2021). Ascertaining cells’ synaptic connections and RNA expression simultaneously with massively barcoded rabies virus libraries. bioRxiv [preprint] doi: 10.1101/2021.09.06.459177
Schultz, W. (1998). Predictive reward signal of dopamine neurons. J. Neurophysiol. 80, 1–27. doi: 10.1152/jn.1998.80.1.1
Schwarz, L. A., Miyamichi, K., Gao, X., Beier, K., Charles Weissbourd, B., Deloach, K. E., et al. (2015). Viral-genetic tracing of the input–output organization of a central noradrenaline circuit. Nature 524, 88–92. doi: 10.1038/nature14600
Seabold, S., and Perktold, J. (2010). “Statsmodels: econometric and statistical modeling with python,” in Proceedings of the 9th Python in Science Conference (Austin, TX), 92–96. doi: 10.25080/majora-92bf1922-011
Swanson, L. W. (1982). The projections of the ventral tegmental area and adjacent regions: a combined fluorescent retrograde tracer and immunofluorescence study in the rat. Brain Res. Bull. 9, 321–353. doi: 10.1016/0361-9230(82)90145-9
Tang, W., Kochubey, O., Kintscher, M., and Schneggenburger, R. (2020). A VTA to basal amygdala dopamine projection contributes to signal salient somatosensory events during fear learning. J. Neurosci. 40, 3969–3980. doi: 10.1523/jneurosci.1796-19.2020
Tritsch, N. X., Ding, J. B., and Sabatini, B. L. (2012). Dopaminergic neurons inhibit striatal output through non-canonical release of GABA. Nature 490, 262–266. doi: 10.1038/nature11466
Ungless, M. A., Singh, V., Crowder, T. L., Yaka, R., Ron, D., and Bonci, A. (2003). Corticotropin-releasing factor requires CRF binding protein to potentiate NMDA receptors via CRF receptor 2 in dopamine neurons. Neuron 39, 401–407. doi: 10.1016/s0896-6273(03)00461-6
Wall, N. R., De La Parra, M., Callaway, E. M., and Kreitzer, A. C. (2013). Differential innervation of direct- and indirect-pathway striatal projection neurons. Neuron 79, 347–360. doi: 10.1016/j.neuron.2013.05.014
Wanat, M. J., Hopf, F. W., Stuber, G. D., Phillips, P. E. M., and Bonci, A. (2008). Corticotropin-releasing factor increases mouse ventral tegmental area dopamine neuron firing through a protein kinase C-dependent enhancement of Ih. J. Physiol. 586, 2157–2170. doi: 10.1113/jphysiol.2007.150078
Ward, A., Hong, W., Favaloro, V., and Luo, L. (2015). Toll receptors instruct axon and dendrite targeting and participate in synaptic partner matching in a drosophila olfactory circuit. Neuron 85, 1013–1028. doi: 10.1016/j.neuron.2015.02.003
Watabe-Uchida, M., Zhu, L., Ogawa, S. K., Vamanrao, A., and Uchida, N. (2012). Whole-Brain mapping of direct inputs to midbrain dopamine neurons. Neuron 74, 858–873. doi: 10.1016/j.neuron.2012.03.017
Weissbourd, B., Ren, J., DeLoach, K. E., Guenthner, C. J., Miyamichi, K., Luo, L., et al. (2014). Presynaptic partners of dorsal raphe serotonergic and GABAergic neurons. Neuron 83, 645–662. doi: 10.1016/j.neuron.2014.06.024
Yetnikoff, L., Lavezzi, H. N., Reichard, R. A., and Zahm, D. S. (2014). An update on the connections of the ventral mesencephalic dopaminergic complex. Neuroscience 282, 23–48. doi: 10.1016/j.neuroscience.2014.04.010
Yu, T. W., and Bargmann, C. I. (2001). Dynamic regulation of axon guidance. Nat. Neurosci. 4, 1169–1176. doi: 10.1038/nn748
Zahm, D. S. (2006). The evolving theory of basal forebrain functional - anatomical ‘macrosystems’. Neurosci. Biobehav. Rev. 30, 148–172. doi: 10.1016/j.neubiorev.2005.06.003
Zahm, D. S., Cheng, A. Y., Lee, T. J., Ghobadi, C. W., Schwartz, Z. M., Geisler, S., et al. (2011). Inputs to the midbrain dopaminergic complex in the rat, with emphasis on extended amygdala-recipient sectors. J. Comp. Neurol. 519, 3159–3188. doi: 10.1002/cne.22670
Keywords: VTA (ventral tegmental area), rabies, circuit mapping, dopamine, inputs and outputs, high dimension datasets, spatial patterning
Citation: Derdeyn P, Hui M, Macchia D and Beier KT (2022) Uncovering the Connectivity Logic of the Ventral Tegmental Area. Front. Neural Circuits 15:799688. doi: 10.3389/fncir.2021.799688
Received: 21 October 2021; Accepted: 14 December 2021;
Published: 28 January 2022.
Edited by:
Talia Newcombe Lerner, Northwestern University, United StatesReviewed by:
Mitsuko Watabe-Uchida, Harvard University, United StatesDavid Bortz, University of Pittsburgh, United States
Copyright © 2022 Derdeyn, Hui, Macchia and Beier. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Kevin T. Beier, a2JlaWVyQHVjaS5lZHU=