Abstract
The classical receptive field (CRF) of a spiking visual neuron is defined as the region in the visual field that can generate spikes when stimulated by a visual stimulus. Many visual neurons also have an extra-classical receptive field (ECRF) that surrounds the CRF. The presence of a stimulus in the ECRF does not generate spikes but rather modulates the response to a stimulus in the neuron's CRF. Neurons in the primate Middle Temporal (MT) area, which is a motion specialist region, can have directionally antagonistic or facilitatory surrounds. The surround's effect switches between directionally antagonistic or facilitatory based on the characteristics of the stimulus, with antagonistic effects when there are directional discontinuities but facilitatory effects when there is directional coherence. Here, we present a computational model of neurons in area MT that replicates this observation and uses computational building blocks that correlate with observed cell types in the visual pathways to explain the mechanism of this modulatory effect. The model shows that the categorization of MT neurons based on the effect of their surround depends on the input stimulus rather than being a property of the neurons. Also, in agreement with neurophysiological findings, the ECRFs of the modeled MT neurons alter their center-surround interactions depending on image contrast.
Introduction
The classical receptive field (CRF) of a spiking visual neuron is defined as the region in the visual field within which the presentation of a visual stimulus can generate spikes in that neuron. Many visual neurons also have an extra-classical receptive field (ECRF) that surrounds the CRF. The presence of the stimulus in the ECRF cannot generate a response by itself but can have an excitatory or inhibitory effect on the neuron's response (Barlow and Levick, 1965). For direction-selective neurons, the directional tuning of the surround is mainly antagonistic, which results in a reduction of the activity of the neuron when the motion in the surround is in the preferred direction of the center; this is called an antagonistic surround. However, in some neurons, the surround can instead have a facilitatory effect, which reinforces the activity of the neuron when the motion in the surround is in the preferred direction of the center; this is a facilitatory surround (Barlow and Levick, 1965).
It is commonly reported that most of the neurons in the primate middle temporal (MT) area, which is a motion specialist region, have directionally antagonistic surrounds (Albright, 1984). However, Huang et al. (2007) showed that the dominance of the antagonistic surround in the literature is due to the characteristics of the stimuli that have been commonly used.
Here, we present a model in which the ECRFs of MT neurons are adaptive to the changes in the input stimulus. The surrounds have an antagonistic effect when there is a discontinuity in the input stimulus. In our definition, any inconsistency in the characteristics of the stimulus are regarded as discontinuities, including changes in luminance or shape. We show in our model that the level of the antagonism driven by the surround increases with increases in stimulus contrast. The antagonistic effect of the surround assists the MT neurons to detect discontinuities in the input stimulus and segregates stimuli that are moving in different directions. The antagonistic surrounds of the MT neurons switch to having facilitatory effects when there is coherency in the input stimulus, which facilitates the propagation of integrative signals.
Existing neurophysiological findings support the concept of stimulus-dependent center-surround characteristics (Thiele, 2007). To demonstrate alterations in surround modulation that depend on the input stimulus, Huang et al. (2007) evaluated the activity of MT neurons in response to two different stimuli and observed adaptive changes of the surround effect. According to their observations, the surround represents integrative features in response to moving contours to overcome ambiguous motion information resulting from the aperture problem. The aperture problem refers to the generation of ambiguous motion information by neurons with small receptive fields that are only able to measure the motion component in the direction orthogonal to the long edge of a bar stimulus (Born and Bradley, 2005). However, the surround regions of Huang et al. (2007) MT neurons show antagonistic features in response to moving dot fields that carry unambiguous motion signals. These experiments demonstrate switching of antagonistic surrounds to integrative surrounds when there is uncertainty in the CRF (Huang et al., 2007). The source of motion uncertainty could be the aperture problem, low luminance contrast, or the presence of noise. It is possible to make similar conclusions based on the results of experiments performed by Pack et al. (2005), who observed an enlargement in the spatial summation of the receptive fields of MT neurons with decreasing stimulus contrast. The increases in the sizes of the receptive fields are equivalent to the characteristics of the integrative surround. Therefore, a reduction in the contrast of the stimulus, which contributes to the uncertainty in the center of the receptive field, results in strengthening of the integrative features of the surround compared to the case at high contrasts (Pack et al., 2005).
Huang et al. (2008) also showed that the modulatory effects of MT neurons depend on the response magnitudes of the MT neurons to different stimuli. Their experiments showed that a stimulus that drives low amplitude responses produces surround integration while a stimulus that drives high amplitude responses produces surround antagonism and, therefore, image segmentation.
Pack et al. (2005) investigated the effects of changes in stimulus contrast on the suppressive surrounds of the neurons in area MT. They found a reduction in the suppressive influence of the surround in some MT neurons at low contrasts. The responses of neurons continuously increase with increases in the size of the stimulus, without saturating at low contrasts. For high contrasts, responses increase with increasing diameter until an optimum is reached, and then fall as the diameter expands further. Therefore, the responses of the neurons to stimuli with large diameters at high contrasts are significantly lower than the responses of the same neurons to the same large-diameter stimuli at low contrasts (Pack et al., 2005).
The results of psychophysical experiments performed by Tadin et al. (2003) can be also explained by the possibility of the facilitation of MT responses at low contrasts with larger stimuli. They proposed that the transition from suppressive surround to integrative surround occurs at around 5% contrast (Tadin et al., 2003). In their psychophysical experiments, they measured the duration threshold, which is the time that human observers need to recognize the accurate direction of motion. They observed an increase in duration threshold with increases in contrast and vice versa. They also revealed an increase in duration threshold as the size of the moving stimulus increased when embedded in high contrast noise, which decreased the visibility of the stimulus (Tadin et al., 2003).
In agreement with these neurophysiological and psychophysical findings, the MT neurons in our model have stimulus-dependent modulatory effects. The surrounds of the MT neurons have an inhibitory effect in the case of discontinuities in the stimulus or a high level of the contrast and a facilitatory effect where there is a coherency in the stimulus. Our proposed model explains the circuitry of this modulatory effect of the surround in response to the properties of the input stimulus.
Methods
There is a clear distinction between two different categories of neural models, functional and mechanistic models (Kay, 2018). Functional models account for the functional relationship between input, namely the stimulus, and output, which are the responses of the neurons. However, mechanistic models go further and not only predict the response of the neurons but also include the mechanism that associates the relationship between the stimulus and the responses of the neurons (Kay, 2018). The proposed model here is a mechanistic model that describes the relationship between different types of neurons in V1 and MT by explaining the specific role of every subtype of the modeled elements in the estimation of the correct direction of motion, which accords to the existing neurophysiological data.
The model is a two-stage process with initial visual information extracted by neurons that model the responses of V1 cells and then transmit this information to the neurons in MT for integration of local motion signals and segregation of overlapping stimuli that are moving in different directions (Figure 1). There are models of three types of neurons in V1 that each have distinctive features and different roles in visual information processing: standard complex V1 neurons, end-stopped V1 neurons, and V1 neurons with ECRFs, which have surrounds that are sensitive to the luminance of the stimulus and suppress center responses (Zarei Eskikand et al., 2018).
Figure 1
Standard V1 neurons determine the borders of the stimulus (edges), end-stopped neurons respond only to the unambiguous motion information of the end-points of the stimulus, and ECRF V1 neurons transmit initial form information to the next area to overcome the illusion formed at the crossing junctions of overlapping stimuli. The initial motion and form information are prosessed by the MT neurons in the next stage (Zarei Eskikand et al., 2018, 2019). A summary of different types of neurons in the model is presented in Table 1. The size of the receptive fields in Table 1 considers only the center of the receptive fields of the neurons and does not include the surround of the receptive fields. In contrast to the previous computational models by Zarei Eskikand et al. (2018, 2019), where there are two different types of MT neurons—integration and segmentation neurons—the model presented here introduces only a single type of MT neuron that has adaptive center-surround interactions depending on the characteristics of the input stimulus. Therefore, the categorization of MT neurons into segmentation and integration neurons depends on the input stimulus rather than being a property of the neuron. The modulatory surrounds of the MT neurons, which accord with existing neurophysiological data, improve the flexibility of the modeled neurons in response to changes in stimuli. The observation that this change in center-surround interaction depends on contrast suggests that the strategy of the visual system is to integrate motion signals when they are weak, thereby increasing sensitivity to areas of the image with low contrast (Pack et al., 2005).
Table 1
| Type | Definition/Role | Direction selective | Orientation selective | Contrast sensitive | Size of the RF | |
|---|---|---|---|---|---|---|
| V1 | Complex V1 neurons | Extract initial motion information. Determine the borders of the stimulus. | YES | YES | NO | 1° |
| End-stopped neurons | Respond only to the motion information of the end-points of the stimulus to overcome aperture problem. | YES | YES | NO | 1° | |
| ECRF V1 neurons | Extract initial form information. Differentiate the intrinsic terminators from extrinsic terminators in the case of overlapping stimuli. | NO | YES | YES | 1° | |
| MT | MT neurons | Integration of local motion signals to estimate the direction of motion and segregation of overlapping stimuli that are moving in different directions. | YES | YES | YES | 7° |
Different types of the neurons in the model and their functions, and receptive field (RF) sizes.
This proposed relationship between neurons in the model creates the potential to easily extend the model to cover other areas of the visual cortex and explain many of the neurophysiological findings in relation to the motion of different stimuli.
V1 Neurons
Initial motion information is extracted by standard complex V1 neurons that have small receptive fields. The model of standard complex V1 neurons is based on the motion energy filter, which is a modified version of the Reichardt motion detector (Adelson and Bergen, 1985; Van Santen and Sperling, 1985). The receptive fields of these neurons are spatiotemporal filters. The spatial filter of these motion detectors is modeled by a Gabor function and the temporal filter is a multi-stage low pass filter modeled by
where τg is the time constant of the filter and n takes values of 6 and 9, simulating the delay between two different temporal filters to compute the motion of the stimulus. The neurons are selective to eight different directions (Zarei Eskikand et al., 2016). The value of the parameters of the model are listed in Table 2.
Table 2
| Description | Parameter | Value |
|---|---|---|
| Connection strength of input to the end-stopped neurons | 2 | |
| Connection strength of inhibitory connections on end-stopped neurons | 3 | |
| Connection strength of complex V1 neurons to MT neurons | 0.5 | |
| Connection strength of end-stopped V1 neurons to MT neurons | 1 | |
| Connection strength of center-surround V1 neurons to component MT neurons | 2 | |
| Connection strength of excitatory connections to MT neurons | 0.1 | |
| Connection strength of inter-directional inhibitory connections | 1 | |
| Connection strength of long-range inhibitory connections | 1 | |
| Spatial extent of the inhibitory connections defined by the number of the neighboring neurons | ϕ | 6 |
| Number of neurons at each location-selective to different directions | N | 8 |
| Constant value for the threshold on the activity of complex V1 neurons | ρcx | 0.13 |
| Spatial extent of the surround of MT neurons | M | 49 |
| The threshold on the changes in the contrast level | cr | 0.2 |
| The slope of changes in the level of suppression with contrast | a | −10 |
| The slope of changes in the level of suppression with the activity level of the neurons with different direction selectivity | b | 2.4 |
| The limitation constant on the level of the surround suppression | k | 2 |
| Constant value for the threshold on the activity of MT neurons | Tmt | 0.001 |
| Constant value for the threshold on the activity of end-stopped neurons | ρes | 0.3 |
| Decay rate of the activity of MT neurons | τmt | 0.01 |
| Decay rate of the activity end-stopped neurons | τes | 0.01 |
| Simulation time step | Δt | 0.01 |
| Time constant of the temporal filter | τg | 0.01 |
| Time delay of inhibition between MT neurons | Tin | 0.4 |
| Spatial frequency | f | 1.1 |
| Standard deviation of horizontal spatial Gaussian filter | σx | 0.5 |
| Standard deviation of vertical spatial Gaussian filter | σy | 0.5 |
| Standard deviation of center portion of horizontal spatial Gaussian filter | σx_c | 0.35 |
| Standard deviation of center portion of vertical spatial Gaussian filter | σy_c | 0.4 |
| Standard deviation of surround portion of horizontal spatial Gaussian filter | σx_s | 0.4 |
| Standard deviation of surround portion of vertical spatial Gaussian filter | σy_s | 0.5 |
| Strength of center portion of spatial Gaussian filter | Ac | 1 |
| Strength of surround portion of spatial Gaussian filter | As | 0.72 |
The constant parameters used in the model, their values, and their units.
Standard complex V1 neurons detect only local motion signals because of their small receptive fields, which results in the aperture problem. However, the neurons at the terminators (corners) of the stimulus provide unambiguous motion signals because the corners are two-dimensional. To suppress the ambiguous activity of standard complex V1 neurons along the long-edges, a model of end-stopped V1 neurons is essential, which respond only to the terminators of the stimulus (Hubel and Wiesel, 1965; Pack et al., 2003). The activity of end-stopped V1 neurons is modeled using inhibitory interconnections between neurons. This inhibition is effective only if the neighboring neurons have activity above a threshold level (Tsui et al., 2010). The activity of these neurons is modeled by
where is the activity of an end-stopped neuron selective to direction θ (eight different directions: 0°, 45°, 90°, …) located at the coordinate (x,y), is the activity of the standard complex neuron in the same location and direction, τes is a decay rate, and and are constant gains. The activities of the standard complex V1 neurons are the outputs of the motion energy filters, which are computed from the correlation of the input image with the spatiotemporal filters. The parameter Γx,y,θ is the inhibition that the end-stopped neuron receives from standard complex V1 neurons when the activity of the neighboring standard complex neurons is above the threshold, ,
where μi,j is the inhibitory connectivity matrix that extends across a patch of 8x8 neighboring neurons and has a discretized Gaussian shape (Zarei Eskikand et al., 2016). These lateral inhibitory connections between standard complex V1 neurons (in our model) result in the end-stopping feature. In our proposed scheme, there are no interactions from the neurons that have end-stopping features upon those that do not have this feature (complex V1 neurons).
The activities of end-stopped neurons at the intrinsic terminators, the actual end-points of the stimulus, carry unambiguous motion information, while the direction perception of the motion by end-stopped neurons at the extrinsic terminators, formed when objects overlap, conflicts with the global movement of the stimulus.
Apart from the standard complex V1 neurons and end-stopped neurons, there is another set of V1 neurons in our model (ECRF neurons) that play a significant role in estimating the accurate direction of movement of the objects when they are overlapped. These modeled neurons have a high level of activity at the intrinsic terminators and their activities are strongly inhibited at the extrinsic terminators because of the stimulation of their suppressive surrounds by the border of the stimulus. To consider the suppressive surround of the neurons, the receptive field is modeled as a difference of Gaussians (DoG),
where (xo, yo) are oriented coordinates with orientation o at spatial location (x, y). These oriented coordinates result in the orientation selectivity of the receptive fields of these neurons. The parameters and are the standard deviations of the centers of the receptive fields, and are standard deviations of the surround, and AC and AS are constants.
The defined model of the receptive field of the neurons is applied on the stimulus to compute initial form information. The activity levels of standard complex V1 neurons are gated by the activity of ECRF neurons such that the resulting standard complex V1 outputs have a high level of motion information only when ECRF neurons are active, which occurs at the intrinsic terminators (Zarei Eskikand et al., 2018).
MT Neurons
The MT neurons in the model are adaptive to changes in the contrast of the stimulus as the center-surround interaction of MT neurons depends on the contrast of the input stimulus. The neurons have an integrative surround in response to input motion coherence when the motion of the center is in the same direction as the motion of the surround. However, this integrative surround can switch to an antagonistic surround when there is a discontinuity in the input stimulus that results in the suppression of its activity.
MT neurons receive excitatory input from standard V1 and end-stopped neurons. The gain of the excitatory input of the end-stopped neurons is higher than the activity of the complex V1 neurons to help deal with the aperture problem. The activity of MT neurons is also enhanced by the received input as the result of the interaction of form and motion information, which is computed by the activity of standard complex V1 neurons, gated by ECRF neurons. This excitatory input is essential to strengthen the influence of the intrinsic terminators compared to the extrinsic terminators. MT neurons receive excitatory connections from their neighboring neurons to propagate the unambiguous activity from the terminators along the whole of the object. The level of the excitatory connection, λx,y,θ, is gated by the summation of the activity of ECRF neurons, , over different orientations, and it is computed as
where is the activity level of the MT neuron selective to direction θ at location (x, y). There is an inter-directional inhibition between MT neurons at the same location, γx,y,θ computed as
where is the activity of the MT neuron selective to direction ϕ. Introducing the inhibitory connections between neurons selective to different directions results in winner-take-all operation in the model. Apart from this inter-directional inhibition, there is another set of inhibitory connections between neighboring neurons selective to different directions. These inhibitory connections assist MT neurons in the propagation of activity to other regions by suppressing the activity of the neurons with different direction selectivities. These interactions are modeled by
where ϕ determines the spatial extent of these inhibitory connections.
The last type of interconnection between neurons is defined by the interaction with each neuron's receptive field surround. The integrative surround of the MT neurons promotes the propagation of activity of MT neurons, while the antagonistic surround prevents the propagation of activity of MT neurons; i.e., where there is a discontinuity in the input stimulus resulting from the border of the stimulus or an overlap with another object. The level of coherency is determined using the form information provided by the activity of ECRF, which neurons that are sensitive to the luminance of the stimulus. The level of this contrast-dependent input (Figure 2) is computed as
where S() is a sigmoid function,
Λx,y indicates the level of the changes in the contrast of the stimulus, and △x,y,θ is the discontinuity in the motion information occurring when there is another object moving in a different direction in the neuron's surround. The contrast level, Λx,y, is computed by measuring the alterations in the levels of intensity of the neighboring pixels of the input image and normalizing this value by
where Ix,y is the intensity of the input image at location (x, y) and M is the spatial extent of the surround area of the MT neuron. △x,y,θ is obtained as the result of the weighted summation of activities of complex V1 neurons selective to different directions and normalizing this value between 0 and 1,
where M is spatial extent of the surround area of the neuron. The effectiveness weight of the activity of complex V1 neurons selective to the direction other than θ depends on the angular difference of the selective direction with θ.
Figure 2
The term, α, in equation (8) indicates the level of coherency in the input stimulus when the changes in the contrast level are below the threshold and is given by
The term, β, in equation (8) is the level of coherency of motion information across neurons selective to different directions, which is computed by
Other parameters in equation (8) have constant values: cr indicates the minimum of the contrast that triggers the suppressive surround, k limits the value of the level of surround suppression to saturate at 1, and a and b determine the slope of the changes in the level of suppression with contrast and the activity level of neurons with different direction selectivities.
The term, , within the sigmoid function represents the increase in the level of surround suppression where there is a high level of variation in the intensity of the input image. The term, bΔx,y,θ, refers to an increase in the suppression level of the surround where the neighboring neurons selective to other directions are active. The term, α.β, models the integrative aspect of the surround where there is coherency in the input image and the neighboring neurons selective to other directions have a low level of activity while there is activity in the center. The surround of the neuron has a suppressive or integrative effect only when the center is activated by motion information. Therefore, the level of the surround effect is gated by the activity of the MT neuron at the center, described by
The general influence of the surround suppression when there is motion in different directions upon the strength of the MT neuron activities, Δx,y,θ, is a shift from 0 to 1, as shown in Figure 3A. Figure 3B shows the value of the surround suppression when the contrast level is changed at location (x, y), Λx,y, from 0 to 1 when there is no motion detected at directions other than θ.
Figure 3
The general temporal dynamic behavior of component MT neurons is defined by
where λx,y,θ is the excitation from neighboring MT neurons, κx,y,θ is excitatory input from standard V1 neurons gated by the activity of ECRF neurons, γx,y,θ is inter-directional inhibition, ξx,y,θ is long-range inhibition, and χx,y,θ is the effect of the surround on these neurons that depends on the coherency of the stimulus. These neurons receive excitatory input from V1 standard complex and end-stopped neurons, and , respectively. Finally, h() is a piece-wise linear saturation function that keeps the level of activity within a specified range (between 0 and 1),
Table 3 summarizes the description and the role of each component in the equation 15.
Table 3
| Component | Description | Contribution |
|---|---|---|
| Input from standard complex V1 neurons | Provides initial motion information | |
| Input from the end-stopped V1 neurons | Deal with the aperture problem | |
| Excitation from neighboring MT neurons | Propagation of the activity of the MT neurons | |
| Interaction of ECRF neurons with standard complex V1 neurons | Discriminates the extrinsic terminators from the intrinsic terminators | |
| Inter-directional inhibition | Implements of the winner-take-all operation | |
| Long range inhibition | Assists in propagation of the activity of the MT neurons | |
| Decay term for the activity of the neurons | Low-pass filtering due to membrane dynamics | |
| χx,y,θ | The effect of the surround of the MT neurons | Modulatory effect of the surround |
Description of the different components of equation 15.
There is explicit evidence of the existence of end-stopped V1 neurons (Hubel and Wiesel, 1965; Pack et al., 2003; Tsui et al., 2010). According to the findings of Movshon and Newsome (1996) these neurons project to area MT, which we have incorporated into our model. There is also evidence for the existence of the orientation selective V1 neurons that are contrast sensitive and have suppressive surrounds (Sceniak et al., 2001). These neurons are included as ECRF neurons in our model and their role in motion detection of the moving stimuli is described. There are also neurophysiological experiments showing the center-surround interactions of the MT neurons and the input that they receive from special complex V1 neurons. The modulatory surround effects of the MT neurons are shown in the neurophysiological experiments by Huang et al. (2007, 2008). However, the circuitry of this adaptive modulatory effect is unknown. A possible mechanism for this adaptive modulatory effect of the center-surround interaction in MT is proposed in our model (Table 4).
Table 4
| Existence of different subtypes of neurons/ connections/ circuitry of the mechanisms/ phenomena | Known | Selected references |
|---|---|---|
| Direction selective standard complex V1 neurons | ✓ | Hubel and Wiesel (1962); Dreher (1972); Movshon (1975); Movshon et al. (1978a,b) |
| End-stopped V1 neurons | ✓ | Hubel and Wiesel (1965); Sceniak et al. (2001); Pack et al. (2003); Tsui et al. (2010) |
| Orientation selective V1 neurons with suppressive surround (ECRF neurons) | ✓ | Cavanaugh et al. (2002) |
| Component and pattern selective MT neurons | ✓ | Adelson and Movshon (1982); Albright (1984); Rodman and Albright (1989); Livingstone et al. (2001); Born and Bradley (2005) |
| Difference in the temporal dynamics of the component and pattern MT neurons | ✓ | Smith et al. (2005, 2009) |
| Projection of the complex V1 neurons to MT area | ✓ | Maunsell and van Essen (1983); Movshon and Newsome (1996) |
| Projection of the end-stopped V1 neurons to MT area | ✓ | Movshon and Newsome (1996); Sceniak et al. (2001) |
| Projection of the ECRF neurons to MT area | Hypothesized in the model | |
| Suppressive effect of the surround in V1 | ✓ | Hubel and Wiesel (1968); Cavanaugh et al. (2002) |
| Center-surround interaction of MT neurons | ✓ | Allman et al. (1985); Raiguel et al. (1995); Albright and Stoner (2002); Born and Bradley (2005) |
| Adaptive modulatory effect of the surround | ✓ | Huang et al. (2007, 2008) |
| Circuitry of the modulatory effect of the surround | Hypothesized in the model | |
| Contrast dependency of the pattern selectivity of the MT neurons | ✓ | Kumbhani et al. (2008) |
| Contrast dependency of the suppressive effect of the surround in MT neurons | ✓ | Pack et al. (2005) |
| The activity of V1 neurons is gated by the activity of the ECRF neurons | Hypothesized in the model | |
| The level of the excitatory connections between MT neurons depends on ECRF V1 neurons | Hypothesized in the model |
A summary of the known neurophysiological findings and unknown features that are hypothesized in the model.
Results
Initial motion signals are extracted by standard complex neurons in V1. The small receptive fields of these neurons result in ambiguous motion information. The activities of these neurons in response to two overlapping moving bars with the same contrast are shown in Figure 4A. The neurons selective to directions perpendicular to the edges of the bars have high levels of activity at these locations because of the aperture problem. The activities of neurons at the intrinsic terminators express accurate estimations of the directions of movement of the bars. The neurons selective to the upward direction also display high levels of activity at the extrinsic terminators formed at the crossing points of the bars.
Figure 4
The existence of end-stopped neurons is essential for MT neurons to differentiate unambiguous motion information of the terminators from the ambiguous motion information that arises from the aperture problem. Figure 4B shows the activities of the end-stopped neurons in response to the two overlapped moving bars. Among the neurons selective to the rightward direction, those at the intrinsic terminators of the bar that is moving to the right, have high levels of activity. Similarly, among the neurons selective to the leftward direction, the end-points of the bar that is moving to the left, have high levels of activity. Apart from the intrinsic terminators with unambiguous motion information, end-stopped neurons selective to upward directions have higher levels of activity at the extrinsic terminators, which represent upward local motion of the crossing junction of the overlapping bars.
To discriminate the motion signals of the intrinsic from the extrinsic terminators, the third set of luminance sensitive V1 neurons with suppressive surrounds play a significant role. The activities of these neurons are shown in Figure 4C. They are strongly suppressed at the extrinsic terminators where the inhibitory surrounds of the neurons are more stimulated compared to the intrinsic terminators. Therefore, excitatory connections from these neurons assist the modeled MT neurons to differentiate the unambiguous motion signals of the intrinsic terminators from the local motion signals of the extrinsic terminators.
The activities of the three sets of V1 neurons are transmitted to the neurons in MT. The surrounds of the MT neurons have an excitatory effect when there is motion coherency. The excitatory effects of the surround assist MT neurons to propagate motion signals. The surrounds of the neurons have an inhibitory effect at the discontinuities where the contrast level is high, which assist MT neurons to segregate the motion signals of different moving objects. The disambiguation of the motion signals by MT neurons takes 30 ms. Figure 4D shows the activities of MT neurons. MT neurons represent the correct direction of the motion of the overlapping bars. The MT neurons selective to the rightward direction have high levels of activity along the edges of the bar moving to the right and the MT neurons selective to the leftward direction have high levels of activity along the edges of the bar that is moving to the left. Without the contrast adaptive property of the MT neurons, the model requires two sets of MT neurons, named integration and segmentation MT neurons, for the correct estimation of motion. Introducing the contrast adaptive feature of the MT neurons, which also accords with existing neurophysiological findings, creates a unified model of MT neurons.
The spatial dynamics of the surround effect of the MT neurons selective to the leftward direction is shown in Figure 5. The surround regions of the neurons are suppressive at the sharp edges of the bar, which is moving to the left. The surrounds of the neurons selective to the leftward direction are not activated along the edge of the bar, which is moving to the right. The inhibitory effect of the surround switches to the excitatory effect along the bar to assist MT neurons in propagating the motion information from the intrinsic terminators.
Figure 5
To investigate the pattern motion selectivity of MT neurons, we examine the response of the model to a plaid pattern, which is obtained by occluding the intrinsic terminators of the overlapping moving bars. The responses of the MT neurons to the pattern or component motion of the stimulus are highly dependent on the connection strengths of the received inputs computed by standard V1 and ECRF neurons. The activities of the MT neurons with strong connections associated with the interaction of ECRF and standard V1 neurons are shown in Figure 6B.
Figure 6
The interaction with the ECRF neurons prevents the propagation of motion information from the extrinsic terminators along the whole of the stimulus. Therefore, MT neurons respond to the component motion information that they receive from standard V1 neurons and their response does not change over time, in contrast to the pattern MT neurons. The MT neurons respond to the pattern motion of the stimulus when there is a weak connection from the ECRF neurons. The activities of the neurons responding to the pattern motion of the stimuli are shown in Figure 6A.
The motion information of the extrinsic terminators is not suppressed when there is a weak connection from ECRF neurons to the pattern MT neurons. Therefore, the activities of the neurons in response to the local motion of the extrinsic terminators propagate to the other regions. Pattern MT neurons represent the component motion of the stimulus at the beginning and they respond to the pattern motion after a delay. This is because of the time required for the propagation of the motion information from extrinsic terminators.
Figure 7A shows the activities of pattern MT neurons responding to the pattern motion in the case of a difference in the contrast of the bars. The MT neurons selective to the upward direction have a high level of activity in response to the upward pattern motion. Figure 7B shows the activities of component MT neurons in response to the pattern motion. The neurons are able to detect the individual component motions and, as the contrasts of the bars are different, they are also able to estimate which bar is moving in front.
Figure 7
Discussion
We developed a model of MT neurons that adapts its properties depending on the input stimulus. The activities of these neurons are influenced by their ECRFs, which have directionally antagonistic or facilitatory effects depending on the properties of the input stimulus. For preferred direction motion, the surround regions are suppressive if there are discontinuities in the visual field or the contrast levels of the stimulus are high. The amount of suppression increases with increases in contrast. The surrounds are facilitatory when there is coherency in the directions of motion across the visual field, which assists the modeled MT neurons in the propagation of motion signals. Our model explains the circuitry of this modulatory effect of surrounds and provides insights into the mechanism by which different surround effects arise depending on the properties of the input stimulus.
The modulatory effect of the surround of MT neurons has been reported in several neurophysiological findings. For example, the neurophysiological experiments by Huang et al. (2007) showed that integration-MT neurons switch to segmentation-MT neurons depending on the ambiguity in the motion information of the stimulus. According to their findings, MT neurons are able to overcome the aperture problem by being integrative but can also code segmentation when stimulated by random dot patterns (Huang et al., 2007, 2008). The psychophysical studies by Tadin et al. (2003) also provide some insights on the possibility of the contrast dependency of the surround. The adaptive change of the relationship between the center and surround regions of MT neurons with contrast has also been shown by other studies. The experiments by Pack et al. (2005) showed a reduction in the suppression level as stimulus contrast was decreased.
The surrounds of the MT neurons in our model are contrast adaptive. Therefore, they have the characteristics of integration MT neurons when there is coherency in the input stimulus to facilitate the propagation of motion signals from bar terminators to other regions. The surrounds of the neurons have a suppressive effect in the case of high contrast at motion discontinuities. The pattern or component selectivity of the MT neurons depends on the input that they receive from the interaction of the ECRF neurons and standard complex V1 neurons. The strong connections from these neurons provide form information to assist MT neurons in suppressing the effects of the extrinsic terminators and respond to the component motions of the stimuli. Weak connections from these neurons result in pattern motion selectivity of the MT neurons. The pattern selectivity of the MT neurons is highly dependent on the contrast of the stimulus. The pattern motion selectivity index drops significantly when the contrast of the overlapping bars differs. The difference in the contrast of the overlapping bars results in the formation of illusionary depth when it appears that one of the bars is sliding over the other bar, which results in the dominance of component over pattern motion selectivity. The experiments by Kumbhani et al. (2008) supports the effect of the contrast on the pattern selectivity of the neurons. The results of their experiments show that the pattern selectivity index of the MT neurons drops significantly if the contrast of one of the gratings is reduced (Kumbhani et al., 2008).
In accordance with these neurophysiological findings, MT neurons are not categorized into integration or segmentation neurons in our proposed model. They represent the integrative or segregated features of the MT neurons depending on the characteristics of the input stimulus. Therefore, the activities of the MT neurons are influenced by their adaptive surround, which changes over time and space depending on the stimuli. Also, our model suggests that MT neurons are not grouped into different types of pattern or component MT neurons. Their level of pattern selectivity is only determined by the strength of the connections from ECRF neurons. Therefore, they are selective to the component motions of the stimuli when they receive strong input from ECRF neurons, and they represent the feature of the pattern selective MT neurons when the strength of the connection from the form processing input is weak or there is no connection. This theory on the pattern motion selectivity of the MT neurons contrasts with the modeling works by Simoncelli and Heeger (1998) and Rust et al. (2006). Simoncelli and Heeger (1998) suggested a linear-nonlinear model, which was further developed by Rust et al. (2006). The non-linear normalization in these models suppresses the unambiguous motion signals resulting from the aperture problem, which is similar to the role of end-stopping neurons in our model. The mechanism for pattern motion detection in the linear-nonlinear model is based on the hierarchical relationship between component and pattern MT neurons. Our model suggests an entirely different mechanism for pattern motion selectivity of MT neurons, which can explain the spatial and temporal limits on pattern motion selectivity of MT neurons observed in the experiments by Majaj et al. (2007) and Kumbhani et al. (2015). However, the linear-nonlinear model is not capable of explaining these features of pattern MT neurons. Due to the simulation of the propagation of activity, our model can replicate the temporal dynamics of component and pattern MT neurons observed by Smith et al. (2005, 2009), which shows a temporal delay in the pattern motion detection of the MT neurons. The linear-nonlinear model lacks these properties of the MT neurons (Simoncelli and Heeger, 1998; Rust et al., 2006).
Statements
Data availability statement
All datasets generated for this study are included in the article/supplementary material.
Author contributions
PZ, TK, MI, AB, and DG: conceptualization, methodology, validation, and writing–review and editing. TK, MI, AB, and DG: funding acquisition and supervision. DG: project administration and resources. PZ: investigation, software, visualization, formal analysis, and writing–original draft. All authors contributed to the article and approved the submitted version.
Funding
This research was supported by the Australian Research Council through Discovery Grants [DE120102210 and DP140104533], the ARC Center of Excellence for Integrative Brain Function [CE140100007] and Victorian Life Sciences Computation Initiative (VLSCI) grant number VR0138 on its Peak Computing Facility (Melbourne Bioinformatics) at the University of Melbourne, an initiative of the Victorian Government.
Acknowledgments
PZ acknowledges a postgraduate scholarship from the National Information and Communication Technology Australia (NICTA- http://www.nicta.com.au/). NICTA was funded by the Australian Government as represented by the Department of Broadband, Communications and the Digital Economy (http://www.communications.gov.au/) and the Australian Research Council through the ICT Center of Excellence program.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
References
1
AdelsonE. H.BergenJ. R. (1985). Spatiotemporal energy models for the perception of motion. J. Optical Soc. Am. A2, 284–299. 10.1364/JOSAA.2.000284
2
AdelsonE. H.MovshonJ. A. (1982). Phenomenal coherence of moving visual patterns. Nature300, 523–525. 10.1038/300523a0
3
AlbrightT. D. (1984). Direction and orientation selectivity of neurons in visual area MT of the macaque. J. Neurophysiol.52, 1106–1130. 10.1152/jn.1984.52.6.1106
4
AlbrightT. D.StonerG. R. (2002). Contextual influences on visual processing. Annual Rev. Neurosci.25, 339–379. 10.1146/annurev.neuro.25.112701.142900
5
AllmanJ.MiezinF.McGuinnessE. (1985). Direction- and velocity-specific responses from beyond the classical receptive field in the middle temporal visual area (MT). Perception14, 105–126. 10.1068/p140105
6
BarlowH. B.LevickW. R. (1965). The mechanism of directionally selective units in rabbit's retina. J. Physiol.178, 477–504. 10.1113/jphysiol.1965.sp007638
7
BornR. T.BradleyD. C. (2005). Structure and function of visual area MT. Annual Rev. Neurosci.28, 157–189. 10.1146/annurev.neuro.26.041002.131052
8
CavanaughJ. R.BairW.MovshonJ. A. (2002). Selectivity and spatial distribution of signals from the receptive field surround in macaque V1 neurons. J. Neurophysiol.88, 2547–2556. 10.1152/jn.00693.2001
9
DreherB. (1972). Hypercomplex cells in the cat's striate cortex. Invest. Ophthalmol. Visual Sci. 11, 355–356.
10
HuangX.AlbrightT. D.StonerG. R. (2007). Adaptive surround modulation in cortical area MT. Neuron53, 761–770. 10.1016/j.neuron.2007.01.032
11
HuangX.AlbrightT. D.StonerG. R. (2008). Stimulus dependency and mechanisms of surround modulation in cortical area MT. J. Neurosci.28, 13889–13906. 10.1523/JNEUROSCI.1946-08.2008
12
HubelD. H.WieselT. N. (1962). Receptive fields, binocular interaction and functional architecture in the cat's visual cortex. J. Physiol.160, 106–154. 10.1113/jphysiol.1962.sp006837
13
HubelD. H.WieselT. N. (1965). Receptive fields and functional architecture in two non-striate visual areas (18 and 19) of the cat. J. Neurophysiol.28, 229–289. 10.1152/jn.1965.28.2.229
14
HubelD. H.WieselT. N. (1968). Receptive fields and functional architecture of monkey striate cortex. J. Physiol.195, 215–243. 10.1113/jphysiol.1968.sp008455
15
KayK. N. (2018). Principles for models of neural information processing. NeuroImage180, 101–109. 10.1016/j.neuroimage.2017.08.016
16
KumbhaniR.SaberG.MajajN.TailbyC.MovshonJ. (2008). Contrast affects pattern direction selectivity in macaque MT neurons. SFN Annual Meeting, Washington, DC.
17
KumbhaniR. D.El-ShamaylehY.MovshonJ. A. (2015). Temporal and spatial limits of pattern motion sensitivity in macaque MT neurons. J. Neurophysiol.7, 1977–1988. 10.1152/jn.00597.2014
18
LivingstoneM. S.PackC. C.BornR. T. (2001). Two-dimensional substructure of MT receptive fields. Neuron30, 781–793. 10.1016/S0896-6273(01)00313-0
19
MajajN. J.CarandiniM.MovshonJ. A. (2007). Motion integration by neurons in macaque MT is local, not global. J. Neurosci.2, 366–370. 10.1523/JNEUROSCI.3183-06.2007
20
MaunsellJ. H.van EssenD. C. (1983). The connections of the middle temporal visual area (MT) and their relationship to a cortical hierarchy in the macaque monkey. J. Neurosci.3, 2563–2586. 10.1523/JNEUROSCI.03-12-02563.1983
21
MovshonJ. A. (1975). The velocity tuning of single units in cat striate cortex. J. Physiol.249, 445–468. 10.1113/jphysiol.1975.sp011025
22
MovshonJ. A.NewsomeW. T. (1996). Visual response properties of striate cortical neurons projecting to area MT in macaque monkeys. J. Neurosci.16, 7733–7741. 10.1523/JNEUROSCI.16-23-07733.1996
23
MovshonJ. A.ThompsonI. D.TolhurstD. J. (1978a). Receptive field organization of complex cells in the cat's striate cortex. J. Physiol.283, 79–99. 10.1113/jphysiol.1978.sp012489
24
MovshonJ. A.ThompsonI. D.TolhurstD. J. (1978b). Spatial and temporal contrast sensitivity of neurones in areas 17 and 18 of the cat's visual cortex. J. Physiol.283, 101–120. 10.1113/jphysiol.1978.sp012490
25
PackC. C.HunterJ. N.BornR. T. (2005). Contrast dependence of suppressive influences in cortical area MT of alert macaque. J. Neurophysiol.93, 1809–1815. 10.1152/jn.00629.2004
26
PackC. C.LivingstoneM. S.DuffyK. R.BornR. T. (2003). End-stopping and the aperture problem: Two-dimensional motion signals in macaque V1. Neuron39, 671–680. 10.1016/S0896-6273(03)00439-2
27
RaiguelS.HulleM.XiaoD.MarcarV.OrbanG. (1995). Shape and spatial distribution of receptive fields and antagonistic motion surrounds in the middle temporal area (V5) of the macaque. Eur. J. Neurosci.7, 2064–2082. 10.1111/j.1460-9568.1995.tb00629.x
28
RodmanH.AlbrightT. (1989). Single-unit analysis of pattern-motion selective properties in the middle temporal visual area (MT). Exp. Brain Res.75, 53–64. 10.1007/BF00248530
29
RustN. C.ManteV.SimoncelliE. P.MovshonJ. A. (2006). How MT cells analyze the motion of visual patterns. Nat. Neurosci.11, 1421–1431. 10.1038/nn1786
30
SceniakM. P.HawkenM. J.ShapleyR. (2001). Visual spatial characterization of macaque V1 neurons. J. Neurophysiol.85, 1873–1887. 10.1152/jn.2001.85.5.1873
31
SimoncelliE. P.HeegerD. J. (1998). A model of neuronal responses in visual area MT. Vision Res.5, 743–761. 10.1016/S0042-6989(97)00183-1
32
SmithM. A.MajajN.MovshonJ. A. (2009). Dynamics of pattern motion computation. Dynamics of Visual Motion Processing. Springer. 55–72. 10.1007/978-1-4419-0781-3_3
33
SmithM. A.MajajN. J.MovshonJ. A. (2005). Dynamics of motion signaling by neurons in macaque area MT. Nat. Neurosci.8, 220–228. 10.1038/nn1382
34
TadinD.LappinJ. S.GilroyL. A.BlakeR. (2003). Perceptual consequences of centre–surround antagonism in visual motion processing. Nature424, 312–315. 10.1038/nature01800
35
ThieleA. (2007). Reconstructing the world: switching from segmentation to integration allows neurons in area MT to make “sense” of the visual scene. Neuron53, 623–625. 10.1016/j.neuron.2007.02.008
36
TsuiJ. M.HunterJ. N.BornR. T.PackC. C. (2010). The role of V1 surround suppression in MT motion integration. J. Neurophysiol.103, 3123–3138. 10.1152/jn.00654.2009
37
Van SantenJ. P.SperlingG. (1985). Elaborated reichardt detectors. J. Optical Soc. Am. A2, 300–321. 10.1364/JOSAA.2.000300
38
Zarei EskikandP.KamenevaT.BurkittA. N.GraydenD. B.IbbotsonM. R. (2019). Pattern motion processing by MT neurons. Front. Neural Circuits13:43. 10.3389/fncir.2019.00043
39
Zarei EskikandP.KamenevaT.IbbotsonM. R.BurkittA. N.GraydenD. B. (2016). A possible role for end-stopped V1 neurons in the perception of motion: a computational model. PLoS ONE11:e0164813. 10.1371/journal.pone.0164813
40
Zarei EskikandP.KamenevaT.IbbotsonM. R.BurkittA. N.GraydenD. B. (2018). A biologically-based computational model of visual cortex that overcomes the X-junction illusion. Neural Netw. 102, 10–20. 10.1016/j.neunet.2018.02.008
Summary
Keywords
vision, neural model, motion perception, middle temporal (MT), adaptive surround modulation
Citation
Zarei Eskikand P, Kameneva T, Burkitt AN, Grayden DB and Ibbotson MR (2020) Adaptive Surround Modulation of MT Neurons: A Computational Model. Front. Neural Circuits 14:529345. doi: 10.3389/fncir.2020.529345
Received
24 January 2020
Accepted
22 September 2020
Published
26 October 2020
Volume
14 - 2020
Edited by
Edward S. Ruthazer, McGill University, Canada
Reviewed by
Yong-Jie Li, University of Electronic Science and Technology of China, China; Ayako Wendy Ishikawa, Keio University, Japan; Dario L. Ringach, University of California, Los Angeles, United States
Updates
Copyright
© 2020 Zarei Eskikand, Kameneva, Burkitt, Grayden and Ibbotson.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Parvin Zarei Eskikand pzarei@unimelb.edu.au
Disclaimer
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.