- Department of Biotechnology, Indian Institute of Technology Madras, Chennai, India
Grid cells and place cells are believed to be cellular substrates for the spatial navigation functions of hippocampus as experimental animals physically navigated in 2D and 3D spaces. However, a recent saccade study on head fixated monkey has also reported grid-like representations on saccadic trajectory while the animal scanned the images on a computer screen. We present two computational models that explain the formation of grid patterns on saccadic trajectory formed on the novel Images. The first model named Saccade Velocity Driven Oscillatory Network -Direct PCA (SVDON—DPCA) explains how grid patterns can be generated on saccadic space using Principal Component Analysis (PCA) like learning rule. The model adopts a hierarchical architecture. We extend this to a network model viz. Saccade Velocity Driven Oscillatory Network—Network PCA (SVDON-NPCA) where the direct PCA stage is replaced by a neural network that can implement PCA using a neurally plausible algorithm. This gives the leverage to study the formation of grid cells at a network level. Saccade trajectory for both models is generated based on an attention model which attends to the salient location by computing the saliency maps of the images. Both models capture the spatial characteristics of grid cells such as grid scale variation on the dorso-ventral axis of Medial Entorhinal cortex. Adding one more layer of LAHN over the SVDON-NPCA model predicts the Place cells in saccadic space, which are yet to be discovered experimentally. To the best of our knowledge, this is the first attempt to model grid cells and place cells from saccade trajectory.
Introduction
A map that aids (Andersen et al., 2009) spatial navigation of an animal was believed to be represented in the hippocampal-entorhinal complex (O'Keefe and Dostrovsky, 1971; Taube et al., 1990a,b; Rolls, 1999; Solstad et al., 2008). Grid cells reported in the dorso-caudal medial entorhinal cortex (MEC), fire periodically such that the firing fields of the neuron form a hexagonal grid-like structure in the physical space in which the animal navigates. There is a general consensus that grid cells code for the distance of movement and hence they have been assigned the function of path integration which is essential for spatial navigation (Hafting et al., 2005). There are other spatial cells, fewer in number, like the place cells, border cells, view cells, speed cells etc., that code for one or other aspect of the ambient space (O'Keefe and Dostrovsky, 1971; Taube et al., 1990a,b; Rolls, 1999; Franzius et al., 2007; Solstad et al., 2008; Kropff et al., 2015). The aforementioned neurons are thought to collectively form an internal map of the external space in which the animal navigates.
Killian et al. (2012) reported hexagonal grid-like representations in the MEC of monkeys during mere visual exploration of a scene, even when the animal was not performing active navigation in the external space. Recordings were taken from neurons in Entorhinal Cortex (EC) and hippocampus of three head fixed monkeys, performing a free-viewing visual recognition task, the visual preferential looking task (VPLT; Jutras et al., 2009; Jutras and Buffalo, 2010). Monkeys were shown a sequence of novel images on a computer screen. The displayed images consisted of diverse themes like art, animals, landscape, and people. These static images were scanned by the monkey using a dynamic sequence of fixation. Neurons in MEC emitted action potentials on the multiple fixation points, as the monkey scanned the images; the firing field resembled the canonical grid cells in navigation with distinct hexagonal firing fields (Hafting et al., 2005; Killian et al., 2012). The grid representations generated by the saccadic movements resembled those of spatial navigation in many respects. Gridness scores of saccade grids were comparable with those of the navigation grids. Saccade grids also exhibited theta modulation in its activity. The gradient of the grid scale along the dorso-ventral axis of MEC was reported in the case of saccade grid too and Local Field Potentials (LFPs) showed theta band oscillations (Killian et al., 2012).
Apart from the grid representations on saccade trajectory, experimental studies reported neurons coding for the direction of saccade movement viz. saccade direction (SD) cells (Killian et al., 2015). These cells are analogous to head direction cells, corresponding to spatial navigation, reported in the rat postsubicular region (Taube et al., 1990a,b). SD cells were reported from the posterior EC of two monkeys performing a visual recognition memory task (Manns et al., 2000; Jutras and Buffalo, 2010). During the tasks, the monkeys were allowed to freely scan the complex visual images. These neurons were reported to be preferentially active when the eye movement was made in a particular direction. SD cells showed a gradient in their tuning width such that with the increase in distance from rhinal sulcus, the width of tuning of individual neuron to preferred saccade direction also increased.
There exists a large corpus of literature on the computational models of the grid representation during active navigation. Models of grid cells generally fall into two categories: oscillatory interface models (OI) and attractor network models. Proposed by O'Keefe and Recce (1993), spatial periodicity in OI models arises as a result of the interference between velocity-controlled dendritic and constant somatic oscillations (Burgess et al., 2007) or from purely velocity-driven oscillators (Zilli and Hasselmo, 2010; Burgess and O'Keefe, 2011). In the case of neural attractor model, spatial periodicity arises due to the intrinsic symmetry of the attractor network (Fuhs and Touretzky, 2006; Burak and Fiete, 2009). A hybrid approach has also been used wherein these two methods were combined to explain spatial periodicity (Bush and Burgess, 2014). However, the aforementioned models are based on a biologically unrealistic assumption such as 60° phase difference in the head direction inputs of the oscillatory interference model, or the assumption of the weight connectivity of the attractor network having special symmetry conditions (Mhatre et al., 2012).
The proposed model for the neural representations on the saccade trajectory is built on the principles derived from a recent model that used multisensory modalities to explain the formation of spatial representations during active navigation (Soman et al., 2018a). It was a hybrid neural model that used both oscillatory and rate coded dynamics. The model captured the empirically reported spatial cell representations and the influence of multiple sensory modalities on such representations. We take the general principle of this model and currently adapt it to explain the grid cell representations in saccade trajectory.
We present Saccade Velocity Driven Oscillatory Network (SVDON) model that captures the empirically reported neural representations on saccade trajectories and also makes novel predictions on saccade representation. The input image presented to the SVDON is passed through four stages viz: saccade generation, saccade direction encoding, path integration, and unsupervised neural network stage which are explained in detail in the methods section.
Methods
In this Section, we present two versions of Saccade Velocity Driven Oscillatory Network (SVDON) model (Shown in Figure 1): SVDON Direct PCA (SVDON-DPCA) and SVDON-Network PCA (SVDON-NPCA). Both the models capture the responses of grid cells to saccadic trajectories. SVDON-DPCA model consists of a Saccade Generating stage (SG), Saccade Direction encoding layer (SD), Path Integration layer (PI), and Spatial Cell layer (SC). SVDON-NPCA model shares a similar architecture except SD layer and SC layer, where it uses the self-organizing map (SOM) and Lateral Anti-Hebbian Network -Spatial cell layer (LAHN-SC) as the output layer. SVDON-NPCA is a network extension of SVDON-DPCA.
Figure 1. Model Architecture of SVDON: The model consists of a saccade generation stage, Saccade Direction Encoding stage, Path Integration stage and the output SC layer. (No copyright permission is required).
SVDON-DPCA Architecture
The information flow in the model can be described as follows. The images to be scanned are given as input to the SG stage of the SVDON model to produce saccade trajectory. Velocity vectors are computed from the generated saccade trajectory. These velocity vectors are further passed on to the SD layer, where each neuron encodes for saccadic direction. Responses from the SD layer are passed on to the PI layer via one-to-one connection. Each neuron in the PI layer is a phase oscillator that receives SD response as its input. This further encodes the saccade position information along that direction component as the phase of the respective oscillator. The PI layer projects to the output SC layer which exhibits grid-like pattern by extracting the principal components of the oscillatory response. Each stage of the model is described below.
Saccade Generation (SG) Stage
The model used for saccade generation is a bottom-up model of attention that is based on locating the single most salient location on the saliency map. Given the input color image, different feature maps are produced by applying linear filters to a specific stimulus property like color, orientation, or intensity. The feature maps are then combined to give three Conspicuity Maps and finally, a saliency map is computed for the Conspicuity maps (Walther and Koch, 2006). A winner-take-all (WTA) mechanism finds the coordinates of the most salient location after scanning the saliency map. Inhibition of return (IOR) of a circular shape with fixed radius is applied around the attended location in the saliency map. Subsequent iteration of the WTA network attends to the locations in decreasing order of saliency. The model is verified in several human psychophysical experiments (Itti, 2005; Peters et al., 2005).
Saliency-Based Bottom-up Attention Model
The input image I is first sub-sampled into a Gaussian pyramid. The Gaussian pyramid is created by convolution of input image I with a set of Gaussian filters and subsampling with a decimation factor of 2 to generate a sequence of reduced resolution images. This process is repeated and a total of 9 different scales are created σ = [0, .., 8] (level: 0 corresponds to the original input image; Walther and Koch, 2006). At level σ, the resolution of the image is 1/2σ of the original image. For level eight i.e., σ = 8, the resolution equals to 1/256th of the input image I and (1/256)2 of the total no of pixels of the input image.
The intensity map MI is computed by adding the r (red), g (green), b (blue) values of the color image (Walther and Koch, 2006).
Intensity Pyramid MI(σ) is created by repeating the same operation at different levels.
Using the Image Pyramid, blue-yellow (BY), and red-green (RG) opponency maps are created at every level (Walther and Koch, 2006).
Orientation maps Mθ are obtained from intensity maps by convolving the various levels of Intensity pyramids with Gabor filters (Walther and Koch, 2006):
Multiscale feature extraction is done by across scale subtraction Θ between two maps levels c and s in these pyramids. Across scale subtraction Θ, is defined as interpolation to the finer scale, followed by point-to-point subtraction between maps. In other words, it is the difference between fine and coarse scale features of an image. Using many different values for c and s provides truly multiscale feature extraction (Walther and Koch, 2006).
where
Ɲ (·) is a non-linear iterative operator, which promotes local completion among neighborhood salient locations. At each iteration step, self-excitation and neighbor-induced inhibition is implemented with a “difference of Gaussians” filter and then followed by rectification.
Using across scale addition ⊕ features maps are then summed over then normalized again.
Three conspicuity maps of general features are created: one for intensity, one for color and one for orientation (Walther and Koch, 2006).
Then, a single saliency map is created by combining all the three conspicuity maps (Walther and Koch, 2006).
Within the saliency map, different locations compete for saliency. The most salient location is selected for attention. Inhibition of Return (IOR) is applied to the selected area for some time within a given radius. In the second iteration, the remaining locations compete for saliency and the second most salient location is selected. Thus, a saccadic scan path is created on the image in order of decreasing saliency (Walther and Koch, 2006).
In the simulation, to match with the experimental paradigm, we used 36 novel images wherein each image is presented twice, for 10 s each to produce the saccade trajectory (Killian et al., 2012). Two sample figures with trajectories superimposed on them are shown in Figures 2A,B. Image Source: Caltech-256 Object Category Dataset (Griffin et al., 2007).
Figure 2. Two sample images used (A1,A2) and overlapped trajectory (Yellow) generated on them (B1–B2) by bottom-up model of attention. Sample images are taken from: Caltech-256 Object Category Dataset. (No copyright permission is required).
Saccade Direction Layer (SD) Layer
Saccade trajectory generated from the SG stage is passed to the saccade direction layer (SD). SD layer encodes the animal's current saccade direction as given in Equation (11). The response of ith cell of SD layer is computed as the animal's current saccade direction projection on the ith preferred direction given as.
θ, θi are the current direction and the preferred direction of ith SD cell, respectively.
Path Integration (PI) Layer
SD layer connects to PI layer via one-to-one connections. The response of the ith PI cell is given as,
β is a spatial scaling parameter, A = Amplitude of oscillations. s is the speed of the Saccade. fo is the base frequency of the PI neuron. The ith PI neuron is then thresholded by the following equation.
where, H is Heaviside function and εηthe threshold value.
Power of oscillation is given in decibel as:
Output Layer (SC)
PI values project to SC layer via the weight stage (W–PC). Weights (WPC) from PI layer to SC layer are computed by performing Principal Component Analysis (PCA) over η Thr. PCA was done by extracting the top few eigenvectors of the covariance matrix of the . The response of the ith neuron in the SC layer is computed as:
where, H is Heaviside function.
N is the number of PI neurons, εSC is the threshold value.
The top few components of the computed principal component (PC) will be shown to reveal a variety of spatial cell-like responses including grid cells (Figure 3). Spatially periodic firing emerges due to the inherent periodicity in the PC weights. Hexagonal grid-like activity is shown by the neurons whose peaks are separated by ≈ 60° (PC = 6).
Figure 3. Spatial representations from SC layer: (A–F) Firing fields (Left): Blue is the trajectory of the Saccade and red dots are the spike locations; Firing Rate map (Middle): red is peak rate and blue is no firing; Autocorrelation map (Right) of SC layer neuron in SVDON-PCA model.
SVDON-NPCA Architecture
SVDON-NPCA model has a similar architecture to SVDON-DPCA. Here a neural network implementation of PCA is used instead of direct PCA, we replaced 1D SD layer with a Self-Organizing Map (SOM) where a two-dimensional layer of neuron is used to represent saccade direction. Lateral Anti-Hebbian Network (LAHN) is the network implementation of PCA (Foldiak, 1989) which is used to extract the optimal features from the input data by variance maximization principle. The changes made in this model permit us the leverage to study grid cells from network perspective.
Below we explain the SOM architecture of SD layer and the LAHN layer in detail.
Saccade Direction Layer
Like in SVDON-DPCA, here also saccade velocity vectors are passed on to the SOM in the next layer to obtain a direction map. SOM neuron response is given as:
ψT = [sin(θ), cos(θ)] where θ is current direction of navigation, is given as input given to SOM.
W = Normalized afferent weight matrix of SOM.
Lateral Anti Hebbian Network (LAHN) Layer
LAHN is an afferent Hebbian and lateral anti-Hebbian unsupervised neural network, which extracts the variance feature from the input.
The network is described as
q is the weight of the afferent connection
χ is input PI value
m is the input dimension
n is the number of LAHN neurons, ξ is the network response
Hebbian rule is used to update afferent connection and a anti-Hebbian rule is used to update lateral connection as described below
Where ηL and ηF lateral and forward learning rate, respectively. After training network weights of LAHN network converges to subspace of principal components of input vector.
Gridness Measure
The hexagonal gridness measure is quantified using Hexagonal Gridness score (HSG) on firing fields of each neuron. HGS is computed using Equations 20, 21 (Hafting et al., 2005).
r is an autocorrelation map, λ (x, y) is firing rate at (x,y) location of the rate map, M is the total no of pixels in the rate map, τx and τy correspond to x and y coordinates with a spatial lag
HGS stands for Hexagonal Gridness Score;
r° is the autocorrelation map rotated by θ degree;
cor(·) stands for correlation function;
min(·) function returns the minimum of its two arguments.
Results
SVDON-DPCA
SC neuron activity is mapped onto saccadic trajectory. Figure 3 shows the firing field, firing rate map and autocorrelation map of the six SC layer neuron receiving the first six principle components.
Hexagonal Firing field is shown by the neuron which received the sixth principal component (Figure 4). A neuron is considered to be canonical if it had a HGS>0 (Hafting et al., 2005).
Figure 4. Spatial representations from sixth SC layer neuron: Firing field (A): Blue is the trajectory of the Saccade and red dots are the firing locations; Firing map (B): red is peak rate and blue is no firing; Autocorrelation map (C) of the sixth SC layer neuron in SVDON- PCA model.
Oscillation Power and β Modulation
Necessity of oscillations to produce grid patterns was contested by varying the oscillatory power in the PI layer. Power is a function of amplitude of oscillations (Equation 14). Hence by changing the amplitude variable (Equation 12), we were able to change the power of the oscillations. By reducing oscillation power there was a loss of Grid Field formation, but grid field reemerged as the oscillation power was restored (Figure 5A).
Figure 5. Oscillation power modulation (A): loss of grid field formation on reducing oscillation power (A1); grid field reemerged as the oscillation power was restored (A2). β modulation (B): loss of grid field when β = 0 (B1), grid field reemerged when β = 0.2.
We also analyzed the criticality of the modulation of oscillations in the PI layer by varying the β parameter (Equation 12). Similar loss of grid field is seen with β modulation (Figure 5B).
SVDON-NPCA
LAHN (SC) network of the model shows the spatially periodic firing in Figure 6. The firing fields of LAHN (SC) neurons have more heterogeneity compared to PCA.
Figure 6. Spatial representations of three different neurons (A–C) from LAHN (SC) layer. Left Column is firing field of the neurons (A1–C1); Middle Column is the firing rate of the neurons (A2–C2); and the Right column is autocorrelation map of the neurons (A3–C3).
Spatial Characteristics of Grid Cells
Grid Scale variation across the dorso-ventral axis of MEC has been demonstrated in experimental studies of rodent navigation (Brun et al., 2008; Stensola et al., 2012). A similar gradient was observed in the case of saccadic trajectories also (Killian et al., 2012). To capture this in the model, we varied the β parameter as shown in Figure 7A. This variation is shown in Figure 7B1 contrasted with the experiment results in Figure 7B2. Grid scale was quantified by computing the distances between the six inner hexagonal vertices from the central peak in the autocorrelation map, minimum of these values represents the grid scale. (Burn, Solstad et al., 2008).
Figure 7. Effect of modulation factor β. (A) Grid Scale gradient is captured by varying spatial scale parameter β. (B) Comparison between the model and the experiment. (B1) Grid scale variation in the model by varying β (Grid scale is averaged for 10 trajectories). (B2) Empirically observed grid scale variation at different locations of medial to dorsal rhinal sulcus axis (Killian et al., 2012).
Predicting Place Cells Activity
SVDON-NPCA model is capable of exhibiting place cell like activity on saccadic space when a second LAHN (place cells) layer is added after the first LAHN (SC Layer in Figure 1). Experimental Studies have shown that the number of neurons (Akdogan et al., 2011) in rat CA1 region is about 90 percent of EC (considering only layer 2 and layer 3 of MEC as they form major afferent synapse with CA1). Accordingly, a similar ratio of neurons is kept in LAHN (SC) and LAHN (PC) layers. The output the LAHN (SC) is passed on to LAHN(PC) layer and the activity of the LAHN(PC) layer is observed, LAHN(PC) neurons showed a highly localized firing activity similar to that of place cells. To qualify a neuron as a place cell, the number of peaks in autocorrelation map is examined. A cell is characterized as a place cells if the number of peaks in autocorrelation is one (Soman et al., 2018b) due to its localized firing field and lack of spatial periodicity. LAHN (PC) layer also predicted spatial cells that showed spatial periodicity as shown in Figure 8.
Figure 8. LAHN (PC) layer activity of six different neurons. (A1–F1) firing field (blue is the trajectory of the Saccade and red dots are the spike locations) and (A2–F2) firing maps (red is peak rate and blue is no firing) of 6 different neurons in LAHN(PC). Characterization of LAHN (PC) layer showing the % of cells type vs. the number of neurons in layer. Here PC is Place cells, GC is Grid Cells and NC is non-spatial cells.
Place Activity on Single Image
In the simulations described in the previous sections, the trajectories were obtained from a large number of images and grid and place cell responses are generated from that combined trajectory. In this section, we generate a trajectory from single images and superimpose the place cells generated from that trajectory back on the original image. The objective is to see if the grid and place cells obtained from the trajectory correspond to salient features/objects in the image. To produce saccade trajectory, we presented a single image for 125 s to the saccade trajectory generating model. The trajectory is then used to train the model of Figure 1 with the added module of LAHN (PC). Figure 9 shows results from two images. Place cells obtained from these images are indeed localized on salient objects in the image such as face of the person (Figure 9A3) or the bat of the batsman (Figure 9B3). Image source : ImageNet: A large scale hierarchical Image Database (Deng et al., 2009).
Figure 9. Spatial cell response to a single image. (A1,B1) Images given as the input. (A2,B2) are the outputs of LAHN(SC) and (A3,B3) are the outputs of LAHN(PC). Red dots are the firing locations on the image (Images source is ImageNet: A Large-Scale Hierarchical Image Database).
Discussion
We present two models: SVDON-DPCA and SVDON-NDPCA to capture the saccadic representation based on the input saccadic trajectory formed on a series of images. In the SVDON-DPCA, we have shown the formation of hexagonal grid cell periodicity using Direct PCA. The model is simple and transparent and gives an insight into the origins of the grid cell spatial periodicity. In SVDON—NPCA, we used LAHN layer instead of direct PCA to produce hexagonal grid cells. This substitution is made since LAHN is based on a biologically more plausible learning mechanisms viz. lateral anti-Hebbian and afferent Hebbian learning, than the PCA. LAHN weight vectors have been shown to converge to the principal component subspace (Foldiak, 1989). Such a connectivity pattern is critical for a self-organization process because the excitatory Hebbian connections to a neuron could essentially correlate its activity to the input features and the lateral inhibitory connections could ensure competition among the ensemble of neurons to extract out diverse features of the input. This sort of connectivity pattern is biologically plausible and is consistent with the empirically reported GABArgic interneuron connections between the stellate cells in the superficial layer of the medial entorhinal cortex (Couey et al., 2013). In addition to this, anti-Hebbian network has been previously shown to encode the input data optimally by minimizing a representation error/multidimensional scaling cost function (Pehlevan et al., 2015). Hence the model gives insight to the self-organization among the grid/quasi-grid units and the relevance of such a connectivity pattern for optimal spatial representation.
The primate visual system scanning a complex visual scene seems to employ a serial search strategy. In primates, object identification and spatial analysis of the image is achieved by a series of rapid saccadic eye movements. Saccades occur reflexively whenever the eyes are open and also can be elicited voluntarily (Liversedge and Findlay, 2000). Different visual locations compete for activity and the strongest response draws the visual attention. These are called visually salient locations (Slllito et al., 1995; Sillito and Jones, 1996; Levitt and Lund, 1997). The bottom-up model used in our architecture for saccade generation is based on a similar approach that generates a two-dimensional saliency map of the visual environment. Experimental evidence has shown the existence of neural maps in the pulvinar, the superior colliculus, and the intraparietal sulcus which encode for the saliency for visual stimulus (Robinson and Petersen, 1992; Gottlieb et al., 1998; Colby and Goldberg, 1999; Rockland et al., 1999). The results from the models discussed above are similar to the grid cells that have been reported in the rat and bat during locomotion (Hafting et al., 2005; Yartsev et al., 2011). These results imply that ideas of spatial representation for navigation also apply to complex visual scene analysis because these results show that visual exploration of space can give rise to representations for that space even without performing active navigation over the corresponding physical space.
The results produced by our model are consistent with the experimental literature. The variation in the gradient of the grid scale along the dorso-ventral axis of the entorhinal cortex is reported in the experimental literature (Brun et al., 2008; Stensola et al., 2012). It is shown that the grid scale varied from low to high value with the distance from the rhinal sulcus (Killian et al., 2012), which is consistent with a dorsal-ventral gradient in rodents and bats for navigation (Hafting et al., 2005; Yartsev et al., 2011). To incorporate this in our model, we varied the parameter β in Path Integration layer. β determines the modulation factor for the path integration neuron. Even though the model captures gradient in the grid scale by varying the β parameter, it does not explain the modular formation of grid cells along the dorso-ventral axis of MEC where, in each module, grid cells with the same grid scale and grid orientation and different grid phases occur and the grid scale varies across the modules in a geometric progression fashion with a scale ratio of √2. Here, the grid scale can be fitted to any ratio by varying the β parameter accordingly.
From the model results it is understood that oscillations are critical for the grid cell generation. Oscillations introduce the first spatial periodicity by encoding the position information in their respective phase. This periodicity is further transformed to grid-like representations in the higher layer. It was empirically shown in rats that abolition of theta activity in the MEC causes the grid representations to fade out (Giocomo et al., 2007). In the model, we tested the same by decreasing the oscillatory power of the path integration neurons and found a corresponding disruption in the grid representations (Figure 5A). However, we would like to pose this oscillation and grid cell phenomenon as a prediction from the model since this phenomenon has not been reported yet in saccade studies. Further analysis also showed the criticality of modulation in oscillation for the grid formation. Modulation is set to off by making β set to zero. No grid fields are observed in that condition. When β is set to a non-zero value, grid fields start to appear (Figure 5B).
Place cell like activity is predicted by the model on the scaddic space upon adding an extra layer of LAHN(PC) on top of LAHN(SC). Although place cell like activity have not been experimentally reported yet. On giving saccadic trajectory of single images as a input to the model, LAHN(PC) neurons fired on naturally significant locations on image like the face and the bat of the batsman shown in Figure 9. These predictions are consistent with the recent observation that navigation in physical space can be just one of the many roles played by place cells, grid cells and other hippocampal spatial cells.
Taken together, these models computationally try to explain the generation of grid cell representations in the entorhinal cortex based on the saccade trajectory generated during visual exploration of a natural scene. They also predicts the place cells like activity on saccadic space. The grid field generated by the SVDON—DPCA does not have the central firing field which we consider as the limitation of this model, this limitation is overcome by the second model viz. SVDON—NPCA. In the future work, we would like to extend this model by including visual and locomotor input along with the saccadic input and search for the possible existence of joint representations arising out of the spatial navigation of the physical space and saccadic exploration of the image space. Virtual Reality (VR) environments offer a convenient setting for conducting such simulation studies.
Author Contributions
AC contributed to performing simulations and analysis. KS contributed to simulation design and manuscript preparation. VSC contributed to simulation design and manuscript preparation.
Conflict of Interest Statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
References
Akdogan, I., Unal, N., and Adiguzel, E. (2011). Estimation of the number of neurons in the hippocampus of rats with penicillin induced epilepsy. Image Analysis Stereology 21:117. doi: 10.5566/ias.v21.p117-120
Andersen, P., Morris, R., Amaral, D., Bliss, T., and Keefe, O.'J. (2009). The Hippocampus Book. Oxford: Oxford University Press.
Brun, V. H., Solstad, T., Kjelstrup, K. B., Fyhn, M., Witter, M. P., Moser, E. I., et al. (2008). Progressive increase in grid scale from dorsal to ventral medial entorhinal cortex. Hippocampus 18, 1200–1212. doi: 10.1002/hipo.20504
Burak, Y., and Fiete, I. R. (2009). Accurate path integration in continuous attractor network models of grid cells. PLoS Comput. Biol. 5:2. doi: 10.1371/journal.pcbi.1000291
Burgess, N., Barry, C., and O'Keefe, J. (2007). An oscillatory interference model of grid cell firing. Hippocampus 17, 801–812. doi: 10.1002/hipo.20327
Burgess, N., and O'Keefe, J. (2011). Models of place and grid cell firing and theta rhythmicity. Curr. Opin. Neurobiol. 21, 734–744. doi: 10.1016/j.conb.2011.07.002
Bush, D., and Burgess, N. (2014). A hybrid oscillatory interference/continuous attractor network model of grid cell firing. J. Neurosci. 34, 5065–5079. doi: 10.1523/JNEUROSCI.4017-13.2014
Colby, C. L., and Goldberg, M. E. (1999). Space and attention in parietal cortex. Annu. Rev. Neurosci. 22, 319–349. doi: 10.1146/annurev.neuro.22.1.319
Couey, J. J., Witoelar, A., Zhang, S.-J., Zheng, K., Ye, J., Dunn, B., et al. (2013). Recurrent inhibitory circuitry as a mechanism for grid formation. Nat. Neurosci. 16, 318–324. doi: 10.1038/nn.3310
Deng, J., Dong, W., Socher, R., Li, L.-J., Kai, L.i, and Li Fei-Fei (2009). “ImageNet: a large-scale hierarchical image database,” in 2009 IEEE Conference on Computer Vision and Pattern Recognition (Miami, FL: IEEE), 248–255.
Foldiak, P. (1989). “Adaptive network for optimal linear feature extraction,” in Neural Networks, 1989. IJCNN., International Joint Conference On, Vol. 1 (Washington, DC), 401–405.
Franzius, M., Sprekeler, H., and Wiskott, L. (2007). Slowness and sparseness lead to place, head-direction, and spatial-view cells. PLoS Comput. Biol. 3, 1605–1622. doi: 10.1371/journal.pcbi.0030166
Fuhs, M. C., and Touretzky, D. S. (2006). A spin glass model of path integration in rat medial entorhinal cortex. J. Neurosci. 26, 4266–4276. doi: 10.1523/JNEUROSCI.4353-05.2006
Giocomo, L. M., Zilli, E. A., Fransén, E., and Hasselmo, M. E. (2007). Temporal frequency of subthreshold oscillations scales with entorhinal grid cell field spacing. Science 315, 1719–1722. doi: 10.1126/science.1139207
Gottlieb, J. P., Kusunoki, M., and Goldberg, M. E. (1998). The representation of visual salience in monkey parietal cortex. Nature 391, 481–484. doi: 10.1038/35135
Griffin, G., Holub, A., and Perona, P. (2007). Caltech-256 Object Category Dataset. Available online at: https://authors.library.caltech.edu/7694/
Hafting, T., Fyhn, M., Molden, S., Moser, M., and Moser, E. I. (2005). Microstructure of a spatial map in the entorhinal cortex. Nature 436, 801–806. doi: 10.1038/nature03721
Itti, L. (2005). Quantifying the contribution of low-level saliency to human eye movements in dynamic scenes. Visual Cogn. 12, 1093–1123. doi: 10.1080/13506280444000661
Jutras, M. J., and Buffalo, E. A. (2010). Recognition memory signals in the macaque hippocampus. Proc. Natl. Acad. Sci. 107, 401–406. doi: 10.1073/pnas.0908378107
Jutras, M. J., Fries, P., and Buffalo, E. A. (2009). Gamma-band synchronization in the macaque hippocampus and memory formation. J. Neurosci. 29, 12521–12531. doi: 10.1523/JNEUROSCI.0640-09.2009
Killian, N. J., Jutras, M. J., and Buffalo, E. A. (2012). A map of visual space in the primate entorhinal cortex. Nature 491, 761–764. doi: 10.1038/nature11587
Killian, N. J., Potter, S. M., and Buffalo, E. A. (2015). Saccade direction encoding in the primate entorhinal cortex during visual exploration. Proc. Natl. Acad. Sci. 112,15743-15748. doi: 10.1073/pnas.1417059112
Kropff, E., Carmichael, J. E., Moser, M. B., and Moser, E. I. (2015). Speed cells in the medial entorhinal cortex. Nature 523, 419–424. doi: 10.1038/nature14622
Levitt, J. B., and Lund, J. S. (1997). Contrast dependence of contextual effects in primate visual cortex. Nature 387, 73–76. doi: 10.1038/387073a0
Liversedge, S. P., and Findlay, J. M. (2000). Saccadic eye movements and cognition. Trends Cogn. Sci. 4, 6–14. doi: 10.1016/S1364-6613(99)01418-7
Manns, J. R., Stark, C. E. L., and Squire, L. R. (2000). The visual paired-comparison task as a measure of declarative memory. Proc. Natl. Acad. Sci. 97, 12375–12379. doi: 10.1073/pnas.220398097
Mhatre, H., Gorchetchnikov, A., and Grossberg, S. (2012). Grid cell hexagonal patterns formed by fast self-organized learning within entorhinal cortex. Hippocampus 22, 320–334. doi: 10.1002/hipo.20901
O'Keefe, J., and Dostrovsky, J. (1971). The hippocampus as a spatial map. Preliminary evidence from unit activity in the freely-moving rat. Brain Res. 34, 171–175. doi: 10.1016/0006-8993(71)90358-1
O'Keefe, J., and Recce, M. L. (1993). Phase relationship between hippocampal place units and the EEG theta rhythm. Hippocampus 3, 317–330. doi: 10.1002/hipo.450030307
Pehlevan, C., Hu, T., and Chklovskii, D. B. (2015). A hebbian/anti-hebbian neural network for linear subspace learning: a derivation from multidimensional scaling of streaming data. arXiv arXiv:1503.00669v1. doi: 10.1162/NECO_a_00745
Peters, R. J., Iyer, A., Itti, L., and Koch, C. (2005). Components of bottom-up gaze allocation in natural images. Vision Res. 45, 2397–2416. doi: 10.1016/j.visres.2005.03.019
Robinson, D. L., and Petersen, S. E. (1992). The pulvinar and visual salience. Trends Neurosci. 15, 127–132. doi: 10.1016/0166-2236(92)90354-B
Rockland, K. S., Andresen, J., Cowie, R. J., and Robinson, D. L. (1999). Single axon analysis of pulvinocortical connections to several visual areas in the macaque. J. Compar. Neurol. 406, 221–250. doi: 10.1002/(SICI)1096-9861(19990405)406:2<221::AID-CNE7>3.0.CO;2-K
Rolls, E. T. (1999). Spatial view cells and the representation of place in the primate hippocampus. Hippocampus 9, 467–480. doi: 10.1002/(SICI)1098-1063(1999)9:4<467::AID-HIPO13>3.0.CO;2-F
Sillito, A. M., and Jones, H. E. (1996). Context-dependent interactions and visual processing in V1. J. Physiol. Paris 90, 205–210.
Slllito, A. M., Grieve, K. L., Jones, H. E., Cudeiro, J., and Davls, J. (1995). Visual cortical mechanisms detecting focal orientation discontinuities. Nature 378, 492–496. doi: 10.1038/378492a0
Solstad, T., Solstad, T., Boccara, C. N., Boccara, C. N., Kropff, E., Kropff, E., et al. (2008). Representation of geometric borders in the entorhinal cortex. Science 322, 1865–1868. doi: 10.1126/science.1166466
Soman, K., Muralidharan, V., and Chakravarthy, V. S. (2018a). A Model of multisensory integration and its influence on hippocampal spatial cell responses. IEEE T. Cogn. Dev. Syst. 10, 637–646. doi: 10.1109/TCDS.2017.2752369
Soman, K., Muralidharan, V., and Chakravarthy, V. S. (2018b). A unified hierarchical oscillatory network model of head direction cells, spatially periodic cells, and place cells. Eur. J. Neurosci. 47, 1266–1281. doi: 10.1111/ejn.13918
Stensola, H., Stensola, T., Solstad, T., Frøland, K., Moser, M. B., and Moser, E. I. (2012). The entorhinal grid map is discretized. Nature 492, 72–78. doi: 10.1038/nature11649
Taube, J. S., Muller, R. U., and Ranck, J. B. (1990a). Head-direction cells recorded from the postsubiculum in freely moving rats. I. description and quantitative analysis. J. Neurosci. 10, 420–435.
Taube, J. S., Muller, R. U., and Ranck, J. J. B. (1990b). Head-direction cells recorded from the postsubiculum in freely moving rats. II. effects of environmental manipulations. J. Neurosci. 10, 436–447. doi: 10.1212/01.wnl.0000299117.48935.2e
Walther, D., and Koch, C. (2006). Modeling attention to salient proto-objects. Neural Netw. 19, 1395–1407. doi: 10.1016/j.neunet.2006.10.001
Yartsev, M. M., Witter, M. P., and Ulanovsky, N. (2011). Grid cells without theta oscillations in the entorhinal cortex of bats. Nature 479, 103–107. doi: 10.1038/nature10583
Keywords: saccades, grid cells, salience map, hippocampus, principal component analysis-PCA, oscillator
Citation: Chauhan A, Soman K and Chakravarthy VS (2019) Saccade Velocity Driven Oscillatory Network Model of Grid Cells. Front. Comput. Neurosci. 12:107. doi: 10.3389/fncom.2018.00107
Received: 23 July 2018; Accepted: 13 December 2018;
Published: 10 January 2019.
Edited by:
Daya Shankar Gupta, Camden County College, United StatesReviewed by:
Alessandro Treves, Scuola Internazionale Superiore di Studi Avanzati (SISSA), ItalyJian K. Liu, University of Leicester, United Kingdom
Copyright © 2019 Chauhan, Soman and Chakravarthy. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: V. Srinivasa Chakravarthy, c2NoYWtyYUBpaXRtLmFjLmlu