Distinct mechanisms subserve location- and object-based visual attention

Chou, Wei-Lun; Yeh, Su-Ling; Chen, Chien-Chung

doi:10.3389/fpsyg.2014.00456

ORIGINAL RESEARCH article

Front. Psychol., 21 May 2014

Sec. Perception Science

Volume 5 - 2014 | https://doi.org/10.3389/fpsyg.2014.00456

This article is part of the Research TopicUsing Noise to Characterize VisionView all 13 articles

Distinct mechanisms subserve location- and object-based visual attention

Wei-Lun Chou^1,2

Su-Ling Yeh^1,3*

Chien-Chung Chen^1,3

¹Department of Psychology, National Taiwan University, Taipei, Taiwan
²Department of Psychology, Fo Guang University, Yilan, Taiwan
³Neurobiology and Cognitive Science Center, National Taiwan University, Taipei, Taiwan

Visual attention can be allocated to either a location or an object, named location- or object-based attention, respectively. Despite the burgeoning evidence in support of the existence of two kinds of attention, little is known about their underlying mechanisms in terms of whether they are achieved by enhancing signal strength or excluding external noises. We adopted the noise-masking paradigm in conjunction with the double-rectangle method to probe the mechanisms of location-based attention and object-based attention. Two rectangles were shown, and one end of one rectangle was cued, followed by the target appearing at (a) the cued location; (b) the uncued end of the cued rectangle; and (c) the equal-distant end of the uncued rectangle. Observers were required to detect the target that was superimposed at different levels of noise contrast. We explored how attention affects performance by assessing the threshold versus external noise contrast (TvC) functions and fitted them with a divisive inhibition model. Results show that location-based attention – lower threshold at cued location than at uncued location – was observed at all noise levels, a signature of signal enhancement. However, object-based attention – lower threshold at the uncued end of the cued than at the uncued rectangle – was found only in high-noise conditions, a signature of noise exclusion. Findings here shed a new insight into the current theories of object-based attention.

Our visual world is full of information; however, not all can be selected for further processing due to limited capacity. Mechanisms of attention are thus employed to prioritize the processing of particular information. Past studies have shown that visual attention can be allocated either to a spatial location or to an object, called location-based attention or object-based attention, respectively (Posner, 1980; Duncan, 1984; Tipper et al., 1991; Egly et al., 1994; Gibson and Egeth, 1994; Brawn and Snowden, 2000).

In a seminal work, Egly et al. (1994) used a double-rectangle display to demonstrate both location-based attention and object-based attention. They presented two outlined rectangles, with one end of one rectangle brightened as a cue to indicate the possible location of a target. The target was a small solid square, shown subsequently within one end of a rectangle. Location-based attention was indicated by the spatial-cueing effect: reaction times (RTs) were shorter when the target appeared at the cued location than the uncued location. Object-based attention was indicated by the same-object advantage: RTs were shorter when the target appeared at the uncued end of the cued rectangle than at the uncued rectangle, with an equal cue-to-target distance between the two. Concurring with Egly et al. (1994), a series of studies using various stimuli and tasks have demonstrated the spatial-cueing effect and the same-object advantage (Moore et al., 1998; Abrams and Law, 2000; Lamy and Tsal, 2000; Moore and Fulton, 2005; Brown et al., 2006; Matsukura and Vecera, 2006; Shomstein and Behrmann, 2008).

The spatial-cueing effect has been explained by the movement of attention from one location to another in visual space. On valid trials, a shift of attention can be initiated to the expected target location before the target appears, thereby producing an RT or accuracy benefit (Posner, 1980). On the two kinds of invalid trials, however, a shift of attention would be initiated to a location on the wrong site of the display from the actual target location. This would produce an RT or accuracy cost because attention would need to be realigned with the correct target location after the target’s appearance.

The same-object advantage has been explained mainly by two competing theories. The spreading hypothesis states that when attention is cued to a location within an object, attention will spread automatically from the cued location to the whole object (e.g., Davis and Driver, 1997; Kasai and Kondo, 1997; Richard et al., 2008). Such spread of attention explains the participants’ better visual performance when the target was shown on the cued object than on the uncued object. Since the attentional modulation is triggered by a location cue and spreads to the whole object, the same-object advantage should be an instance of location-based attention. That is, the underlying mechanism of object-based attention is the same as that of location-based attention. In addition, it is shown that improvement of visual performance in a location-based attention task can be due to (a) the participant being more sensitive to a target at the cued location than that at the uncued one; and/or (b) the participant being less influenced by irrelevant visual information (Lu and Dosher, 1998). Hence, these two factors should be able to account for object-based attention as well, if it shares the same mechanism as location-based attention.

On the other hand, the prioritization hypothesis (Shomstein and Yantis, 2002) suggests that object-based attention reflects a specific attentional prioritization strategy rather than the modulation of an early sensory enhancement extending from the location-based attention. That is, the prioritization hypothesis does not take any position regarding the similarity of the mechanisms between location- and object-based attention. At best, it would predict different mechanisms for the exogenous spatial-cueing effect and the strategically object-based scanning strategy. Therefore, the same-object advantage cannot be explained by a change in early sensory mechanisms.

Here, we are interested in the mechanisms that subserve location- and object-based attention, especially whether the mechanisms underlying these two types of attention are the same. Notice that previous investigations adopting the double-rectangle method generally used RT measurement with a single level of task difficulty (Egly et al., 1994; Moore et al., 1998; Abrams and Law, 2000; Lamy and Tsal, 2000; Moore and Fulton, 2005; Brown et al., 2006; Shomstein and Behrmann, 2008). RT measurement may reflect processing speed, response bias, or a combination of the two (Ratcliff, 1978), making it hard to infer the underlying mechanisms. In addition, while an estimation of response variability is important to evaluate certain theories of location-based attention (Lu and Dosher, 1998), it is difficult to separate measurement error from the experimental procedure and the variability of the internal responses in the RT measurement.

We used a noise-masking paradigm (Nagaraja, 1964; Legge et al., 1987; Pelli, 1991; Lu and Dosher, 1998) that can evaluate the variability in the response of the visual system in the double-rectangle display to probe the mechanism(s) of location-based attention and object-based attention. In a typical noise-masking paradigm, the task of the observer is to detect a pre-designated target that is superimposed on a patch of white noise. In the context of our experiment, the target was a periodic pattern defined by a Gabor function, which is a product of a sine wave and a Gaussian envelope, while the noise was a random modulation of luminance. The intensity of the noise mask was defined by contrast, or the theoretical half range of the luminance modulation defined by a uniform distribution divided by the mean luminance. By systematically measuring the target threshold at different external noise levels, we can measure the threshold versus external noise contrast (TvC) functions. With an appropriate model, this information allows an estimation of the response properties and variability of the target detection mechanisms, thus providing a more comprehensive estimation of various perceptual mechanisms (Nagaraja, 1964; Legge et al., 1987; Pelli, 1991; Lu and Dosher, 1998; Chen and Tyler, 2001; Wu and Chen, 2010).

By taking advantage of the double-rectangle method, we evaluated the TvC functions of attended and unattended location/object within a single paradigm. In a two-alternative intervals choice task (Figure 1), participants were asked to detect a Gabor target that was superimposed on a noise pattern. The displays, if not stated otherwise, consisted of two vertical rectangles that were presented on each side of fixation. The four ends of the rectangles were where the cue (or target) was likely to occur. The target could occur at one of the three possible locations: the cued location (valid), the uncued location but on the cued object (same-object), or an equidistant location on the uncued object (different-object). Then, we measured the TvC functions for all the different conditions so that we can compare location-based attention and object-based attention and infer their mechanisms directly. If their mechanisms are identical, they should show the same kind of shift in the TvC functions.

FIGURE 1

FIGURE 1. Schematic overview of a typical valid trial with the target showing in interval 1. The task was to detect the target (a Gabor patch) superimposed on different levels of noise (mask) contrast in a two-alternative forced-choice paradigm. In each interval, a cue was flashed first for 16 ms, followed by a 64 ms blank, and then a stimulus presentation (either target-plus-noise mask or noise mask alone). Two intervals were separated by a 600 ms blank. The rectangles and the fixation point were always on-screen.

Materials and Methods

Ethics Statement

The use of human participants was approved by the IRB of National Taiwan University Hospital and followed the guideline of Helsinki Declaration. The written informed consent was obtained from each participant.

Apparatus

Two ViewSonic (15′′) CRT monitors, each driven by a Radeon 7200 graphic board, were used to present the stimuli. The graphic board provided 10-bit digital-to-analog converter depth and was controlled by a Macintosh computer. A beam splitter was used to combine lights from the two CRT monitors. The target was presented on one monitor and the cue and the external noise patch (mask) on the other. This two-monitor setup had the advantage that the contrast of the target could be controlled independently while keeping the context (the cue and the mask) identical in two intervals of a trial. At a viewing distance of 128 cm, the resolution on a 640 × 480 pixels monitor was 60 pixels per degree. The refresh rate of the monitors was 66 Hz. The viewing field was 10.7° × 8° (horizontal × vertical), and the mean luminance of the displays was 74.9 cd/m². The LightMouse photometer (Tyler and McBride, 1997) was used to measure the full-detailed input-output intensity function of the monitors, and this information was then used to compute linear lookup table settings so as to linearize the output within 0.2%.

Stimuli and Display

Figure 1 illustrates the stimuli and sequence of events for a trial. The displays are comprised of a pair of adjacent vertical rectangles. The fixation was a small dot. Each rectangle (1.63° × 4.88°, with a stroke width 0.13°) was centered 3° from fixation. The cue and the target were vertical Gabor patches defined by the following equation:

G (x, y, c, u_{x}, u_{y}) = L + L * c * \cos (2 π f x) * \exp (- \frac{{(x - u_{x})}^{2}}{2 σ^{2}}) * \exp (- \frac{{(y - u_{y})}^{2}}{2 σ^{2}})'

where L was the mean luminance, c was the contrast ranging from 0 to 1, f was the spatial frequency, σ was the scale parameter of the Gaussian envelope, u_x was the horizontal displacement, and u_y was the vertical displacement. Both Gabor patches had a spatial frequency (f) of 1.3 cycles/deg and a scale parameter (σ) of 0.3536°. The contrast of the cue (c) was -6 dB or 50%. For each external noise frame the pixel gray-levels were sampled from a Gaussian distribution.

Procedure

A two-alternative forced-choice paradigm was used to measure the threshold of the target (Figure 1). The cue was presented at one of four possible locations in each interval. After that, the target was presented at one of the three possible locations: (1) the cued location (valid trials), (2) the uncued end within the cued object (same-object trials), or (3) the uncued end within the uncued object (different-object trials) in one of the intervals.

A fixation display (a central fixation point and two outline rectangles) was presented first, followed by a 16-ms cue display, then a 64-ms fixation display, and finally a 96-ms target display (a target and four mask patches). The stimulus onset asynchrony between the cue and the target was 80 ms, the inter-stimulus-interval within a trial was 600 ms, and the inter-trial-interval was 800 ms. At the beginning of each trial an audio tone was presented as a signal to start. Correct and incorrect responses were followed by auditory feedbacks.

Each block of seven external noise levels (-∞,-26, -22, -18, -14, -10, -6 dB) were presented in random order, and each block contained the three attention conditions (valid, same-object, and different-object). The threshold was defined at 75% correct response level, measured by the PSI threshold-seeking algorithm (Kontsevich and Tyler, 1999). For each threshold measurement, two practice trials preceded 40 formal trials. Within a single block, four thresholds were measured in an interleaved way – two for the valid condition, one for the same-object and one for the different-object conditions, making the total number of valid trials (84 trials) twice as many as that of the same-object or different-object trials (42 trials). That is, the cue validity for predicting the target location was 50%. The sequence of trials was pseudo-randomized. The TvC function of the valid condition is the average of two threshold measurements. Each data point reported was an average of four to eight repeated measures. The task was to indicate which interval contained the target by pressing a corresponding key. Participants were told that the two outline rectangles were task-irrelevant, and they were well informed about the cue-target relationship.

Participants

Three participants with normal or corrected-to-normal visual acuity were tested. RY and TH were naïve as to the purposes of this study and WL was one of the authors.

Results

Figure 2 shows the result averaged across three participants. The blue circles and solid curve denote the TvC function for the valid condition; red squares and dash curve, the same-object condition; and green triangles and dash-dot curve, the different-object condition. To account for the individual difference in overall sensitivity to the target, we scaled each threshold by that measured at zero noise contrast of the valid condition of the corresponding participant before averaging. When there was no noise mask, the threshold for the valid condition was lower than that for both invalid conditions. The difference was 2 dB [t(2) = 3.46, p = 0.037 < 0.05] between the valid cue and both the invalid conditions. Such difference between the valid and invalid conditions remained as the mask increased. Thus, the TvC functions of the invalid conditions look like a vertically shifted copy of the valid condition on log–log coordinates. Such general facilitation on target detection suggests that the effect of the valid cue was to increase the sensitivity to the target (Cohn and Lasley, 1974; Lu and Dosher, 1998; Zenger et al., 2000; Pestilli and Carrasco, 2005; Chen and Tyler, 2010).

FIGURE 2

FIGURE 2. Target threshold versus noise contrast functions. Each data point represents the average of the normalized threshold from three observers. The blue circles and the solid curve denote the TvC function for the valid condition; the red squares and the dashed curve, the same-object condition; the green triangles and the dash-dot curve, the different-object condition. The smooth curves are fits of the model discussed in the text. The error bars are the estimated one standard error of normalized individual difference.

The target detection thresholds were not influenced by the low contrast noise mask for all attention conditions. As a result, all TvC functions were flat at low noise contrasts. When the noise contrast reached a critical value, the threshold began to increase with noise contrast. Here, whether or not the cue and the target were within the boundary of an object had an effect. The threshold increment for the different-object condition started at a lower noise contrast than that for the same-object condition. As a result, the TvC function for the different-object condition showed a leftward shift from the TvC function for the same-object condition. This suggests that the noise effect on target detection in the same-object condition is different from that in the different-object condition.

Our result cannot be explained by an inter-hemispherical effect. In a control condition, we used horizontal rectangles as the objects. We measured the target threshold at noise level -∞ and -6 dB. There was no statistical significant difference [t(11) = -1.1, p = 0.30] in target threshold between the vertical and the horizontal object configurations, averaged across all conditions and observers.

Model

We fitted the TvC functions by a version of the divisive inhibition model (Ross and Speed, 1991; Wilson and Humanski, 1993; Foley, 1994; Teo and Heeger, 1994; Watson and Solomon, 1997; Snowden and Hammett, 1998; Chen and Foley, 2004) modified to account for the noise-masking experiment (Lu and Dosher, 1998; Goris et al., 2008; Chen and Tyler, 2010). This model integrates features from the divisive inhibition models for pattern detection and discrimination (Foley, 1994; Chen and Foley, 2004) and conventional models for noise masking (e.g., Lu and Dosher, 1998). Chen and Tyler (2010) used a similar model to account for the cueing effect in a noise-masking paradigm. Figure 3 shows a diagram of this model. There are several stages in this model. The first stage is a band of linear filters operating on the input images. The excitation of a linear filter is then half-wave rectified, raised to a power and scaled by a divisive inhibition input to form the response of the target detector. The decision variable is the ratio of the response of the target detector and the noise from different sources.

FIGURE 3

FIGURE 3. Diagram of the model used to fit the data. See text for details.

Each mechanism j contains a linear operator within a spatial sensitivity profile f_j(x,y). The excitation of this linear operator to the i-th image component g_i(x,y) is specified as:

E_{{i j}^{'}} = \sum_{x} \sum_{y} f_{j} (x, y) g_{i} (x, y) (1)

where the linear filter f_j(x,y) is defined by a Gabor function (see “Materials and Methods”). Suppose that the image component g_i(x,y) has a contrast C_i. Summing over x and y, Eq. (1) can be simplified to

E_{{j i}^{'}} = {S e}_{j i} C_{i} (1^{'})

where Se_ji is a constant defining the excitatory sensitivity of the mechanism to the stimulus (j = t for the target and j = m for the mask). Detailed derivation of Eq. (1)’ from Eq. (1) has been discussed elsewhere (Chen and Tyler, 1999; Chen et al., 2000).

The excitation of the linear operator is half-wave rectified (Foley, 1994; Teo and Heeger, 1994; Foley and Chen, 1999) to produce the rectified excitation E_ji

E_{j i} = \max (E_{{j i}^{'}}, 0) (2)

where max denotes the operation of choosing the greater of the two numbers.

The total excitation of the j-th mechanism E_j is the sum of excitations produced by all image components. The response of the j-th detector is then E_j, raised by a power p and divided by a divisive inhibition term I_j plus an additive constant z. That is,

R_{j} = E_{j}^{p} / (I_{j} + z) (3)

where I_j is the summation of a non-linear combination of the excitations of all relevant mechanisms. This divisive inhibition term I_j can be represented as

I_{j} = \sum_{i} {({S i}_{j, i} C_{i})}^{q} (4)

where Si_j,i is the weight of the contribution from each component to the inhibition term.

The contribution of a detector to the visual performance is limited by the noise. We consider two sources of noise in this model: the internal noise inherent in the system, and the external noise provided by the noise patterns. The variability produced by the internal noise, $σ_{a}^{2}$ , is a constant for all detectors in the model. The variability produced by the external noise, $σ_{e}^{2}$ , is proportional to the square of the contrast noise mask; that is,

σ_{e}^{2} = w_{m} C_{m}^{2} (5)

where w_m is a scalar constant that determines the amount of contribution of the noise mask to the variance of the response. Pooling the effects of these two noise sources, the variance of the response distribution in each detector is

σ_{r}^{2} = (σ_{a}^{2} + σ_{e}^{2}) (6)

In the context of our experiment, the observer compared the response to the stimuli in both intervals at the three possible target locations. The observer can detect the target if the difference between the response to the target + mask, R_j_, _t+_m, and that to the mask alone, R_j_,_m, is greater in at least one channel than is the limitation imposed by the noise. In practice, we need to consider only the mechanism that produces the greatest response difference between the target + mask and the mask alone conditions. Thus, we can drop the subscript j for this study. That is, the decision variable d′ is,

d^{'} = (R_{m + t} - R_{m}) / {(2 σ_{r}^{2})}^{1 / 2} (7)

The threshold is defined when d′ reaches unity.

Table 1 shows the parameter of the model. To reduce the mathematical redundancy in the model, we fixed the sensitivity to the target, Se_t, for the valid cue condition to be 100 and the size of the internal noise, $σ_{a}^{2}$ to be 1. As shown in the Results section, the TvC functions for the invalid conditions are vertically shifted copies of the valid condition on log–log coordinates. As shown in Figure 4A, such vertical shift of TvC functions can be achieved by changing the sensitivity to the target, Se_t. Hence, our data suggest that the sensitivity to the target to be different for the valid and invalid cue conditions. This result is consistent with the models proposed by Reynolds and Heeger (2009), which suggested that spatial attention can operate in the early visual areas by affecting the attention field, and by Lu and Dosher (1998), which suggested that spatial cue enhances the target signal.

TABLE 1

TABLE 1. The estimated parameters of the model.

FIGURE 4

FIGURE 4. Performance signatures in threshold contrast versus external noise contrast (TvC) functions. (A) If the TvC functions are a vertically shifted copy of each other, that is, the same target would have different thresholds in the attended and unattended conditions, this suggests that the participant has a different sensitivity to the target in the two conditions. Hence, the effect of attention is to enhance the sensitivity [Se in Eq. (1)’] to the target in our model. (B) Suppose that the TvC functions for the attended condition is a rightward-shifted copy of the unattended condition. It means that the same external noise level can have different effects on target detection in the attended and unattended conditions. This suggests that attention allows the participants to exclude noise in the stimuli more easily. This corresponds to a reduction of the contribution from the external noise [w_m in Eq. (5)] in our model. (C) If the TvC functions showed a vertical shift at low noise contrast, but merged at high contrasts, the effect of attention is to reduce the internal noise. This corresponds to a reduction of the internal noise parameter [ $σ_{a}^{2}$ in Eq. (6)] in our model.

The TvC function for the different-object condition shifted to the left from that of the same-object condition. Such horizontal shift can be implemented a change in the relative contribution of the external noise w_m (Figure 4B). Thus, our result suggests that the contribution of the external noise to the response variance, w_m, is different in the same-object and the different-object conditions. Notice that in the valid condition, the target and the cue were also presented within the boundary of the same object. Therefore, we constrained all parameters to be the same across conditions except for sensitivity to the target, Se_t, and the contribution of the external noise, w_m. This model fits the data well; the root of mean squared error (RMSE) was 0.27. This model explains 98.61% of all variance in the averaged data.

To further validate our interpretation of the data, we tried various constraints to the model. If we constrained the sensitivity to the target, Se_t, to be the same for all conditions, the sum of squared error (SSE) of the model increased significantly [F(1,12) = 73.82, p < 0.0001] even when we took the number of free parameters into account. Similarly, constraining the contribution of the external noise, w_m, to be the same for both invalid conditions significantly increased the SSE [F(1,12) = 16.63, p < 0.05]. Therefore, the change of sensitivity to the target is necessary to explain the spatial-cueing effect while the change of the contribution of the external noise is necessary to explain the same-object advantage.

Lu and Dosher (1998) suggested a mechanism of internal noise reduction for attention. That is, the effect of the cue is to reduce the effect of the additive noise in the system. In our model, this can be implemented by changing the value of the internal noise parameter σ_a. As shown in Figure 4C, such change in parameter value will cause TvC function to shift vertically in the low noise contrasts. However, the TvC function would merge together at high contrasts. We did not find such a trend in our data. Hence, our result cannot be explained by a reduction of additive internal noise. We also found that more free parameters in the model never produced a significant improvement of goodness-of-fit. Thus, no extra factors are necessary to explain our results.

Discussion

The current study systematically probed the target threshold improvement by location- and object-based attention with different noise levels using the double-rectangle method, and the results suggest that location- and object-based attention involve different mechanisms. Location-based attention operates by enhancing signal strength, whereas object-based attention operates by excluding external noise. This study is the first to demonstrate the discrepancy in the TvC functions of location- and object-based attention within a single task.

In previous studies, location- and object-based attention were examined separately by the noise-masking paradigm. Location-based attention was observed in both no-noise and high-noise conditions (Dosher and Lu, 2000; Lu and Dosher, 2000), consistent with our results. However, Han et al. (2003) found that object-based attention was also observed in both no-noise and high-noise conditions, inconsistent with our findings here. Notice that Han et al. (2003) compared the performances of tasks that required participants to attend to only one object versus two spatially separated objects. Object-based attention was indexed by higher accuracy of reporting two attributes belonging to a single object than different objects, and it was shown in both no- and high-contrast noise conditions in Han et al.’s (2003) study. It is reasonable to argue that their participants may have changed their attentional window – like a zoom lens (Eriksen and Yeh, 1985) – from “wide” in the two-object condition to “small” in the single-object condition. Accordingly, the differences between the two-object and single-object conditions not only are the number of attended objects but also the size of spatial attention (Davis et al., 2000).

This argument is supported by Liu et al. (2009) with a design identical to Han et al.’s (2003). The magnitude of the same-object advantage was modulated by the required precision of judgments: the higher the task precision, the larger the difference in performance between the two-object and the single-object conditions (Liu et al., 2009). Assuming that attentional window is wide in the two-object condition, the density of attentional resource should be low due to the reciprocal relationship between size and density of attentional distribution (Eriksen and St. James, 1986; LaBerge and Brown, 1989). The low-precision task that requires less resources can be performed equally well with less attentional resource in the two-object condition as opposed to the one-object condition – leading to reduced or no same-object advantage. The critical comparison in their study – two-object and single-object conditions – may not reflect object-based attention but rather a change in the window size of spatial attention. Indeed, the modulation pattern of “object-based” attention in Han et al.’s (2003) study is similar to the modulation pattern of location-based attention (Dosher and Lu, 2000; Lu and Dosher, 2000): both can be observed in no-noise and high-noise conditions. However, the double-rectangle method compares the same-object and different-object conditions based on an equal cue-to-target distance between the two conditions. Using the double-rectangle method, we rule out the confounding of location-based attention in the current study and find that object-based attention is observed only in high-noise conditions, indicating that external noise exclusion plays a critical role in object-based attention.

The qualitative difference between the intrinsic mechanisms of location-based and object-based attention suggests that object-based attention is not an outcome of the spreading from the location-based attention, which is a finding arguing against the well-accepted spreading hypothesis (e.g., Davis and Driver, 1997; Kasai and Kondo, 1997; Richard et al., 2008). Instead, we suggest that object-based attention reflects a qualitatively different kind of attentional orienting that is independent of location-based attention, rather than the modulation of an early sensory enhancement extending from location-based attention. This argument is also against the prioritization hypothesis proposed by Shomstein and Yantis (2002), who claimed that object-based attention reflected strategic prioritization regardless of location-based effect and that neither was it due to object-based perceptual enhancement. However, using the noise-masking paradigm, we provide evidence for the underlying mechanism of object-based attention. The current finding of the leftward-shifted copies of the TvC functions in the same-object and different-object conditions suggests that the underlying mechanism of object-based attention is to exclude external noise, an evidence of object-based perceptual enhancement.

In our experiment, the target may appear in one of the three possible locations. As a result, the participant would experience a greater uncertainty in the invalid conditions, in which the participant needed to monitor three locations, than in the valid condition, in which the participant needed to monitor just one location. Hence, one may argue that perhaps our result can be explained by uncertainty reduction (Pelli, 1985; Tyler and Chen, 2000; Chen and Tyler, 2010). Our result did show a lower threshold in the valid condition than in the invalid conditions, and in turn a vertical shift of TvC functions that is consistent with uncertainty reduction. The three-fold increase in uncertainty from the valid to the invalid cued conditions, according to Tyler and Chen (2000), translated to a 2.5 dB threshold increment. This is slightly larger than the threshold difference between the valid and the invalid cue conditions in our data (2.2 dB). Furthermore, in our experiment, there were only two location-based cueing conditions (valid and invalid). The uncertainty effect, mathematically, as discussed in the Section “Model,” can be absorbed by a change of the sensitivity parameter, Se. Thus, for practical reasons, we can consider the reduction of uncertainty as a cause of sensitivity change that accounts for the spatial cueing effect. However, uncertainty cannot explain the same-object advantage in our result. For instance, the TvC functions for the same-object and the different-object conditions were different even though the uncertainty in these two conditions was identical.

Conclusion

The current study measured the thresholds in different levels of task difficulty and revealed the underlying mechanisms of location-based and object-based attention – which are difficult to evaluate from conventional RT measurements – and sheds a new light to current theories of object-based attention. Here, we overturn two widely accepted theories that object-based attention is due to the “spread” or “prioritization” of attention. In addition to revealing the underlying mechanisms of location- and object-based attention, the current finding fills the gap between previous physiological (Fink et al., 1997; He et al., 2004; Wager et al., 2004; He et al., 2008) and behavioral evidence (Shomstein and Yantis, 2004; List and Robertson, 2007; Chou and Yeh, 2008, 2011; Matsukura and Vecera, 2009) that have demonstrated the discrepancy in location-based and object-based attention by providing important convergent evidence from a novel aspect using the noise masking paradigm to the double-rectangle method.

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

This research was conducted as part of the first author’s Ph.D. dissertation project (Chou, 2010). Preliminary versions were presented at the 2009 European Conference on Visual Perception annual meeting (Chou et al., 2009). This study was supported by NSC-102-2410-H-431-008 and NSC-102-2420-H-431-001-MY2 to Wei-Lun Chou, by NSC101-2410-H-002-083-MY3 to Su-Ling Yeh, and by NSC 96-2413-H-002-006-MY3 to Chien-Chung Chen.

References

Abrams, R. A., and Law, M. B. (2000). Object-based visual attention with endogenous orienting. Percept. Psychophys. 62, 818–833. doi: 10.3758/BF03206925

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Brown, J. M., Breitmeyer, B. G., Leighty, K. A., and Denney, H. I. (2006). The path of visual attention. Acta Psychol. 121, 199–209. doi: 10.1016/j.actpsy.2004.10.020

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Brawn, P. T., and Snowden, R. J. (2000). Attention to overlapping objects: detection and discrimination of luminance changes. J. Exp. Psychol. Hum. Percept. Perform. 26, 342–358. doi: 10.1037/0096-1523.26.1.342

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Chen, C. C., and Foley, J. M. (2004). Pattern detection: interactions between oriented and concentric patterns. Vision Res. 44, 915–924. doi: 10.1016/j.visres.2003.11.017

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Chen, C. C., Foley, J. M., and Brainard, D. H. (2000). Detection of chromoluminance patterns on chromoluminance pedestals II: model. Vision Res. 40, 789–803. doi: 10.1016/S0042-6989(99)00228-X

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Chen, C. C., and Tyler, C. W. (1999). Spatial pattern summation is phase-insensitive in the fovea but not in the periphery. Spat. Vis. 12, 267–286. doi: 10.1163/156856899X00166

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Chen, C. C., and Tyler, C. W. (2001). Lateral sensitivity modulation explains the flanker effect in contrast discrimination. Proc. R. Soc. Lond. B Biol. Sci. 268, 509–516. doi: 10.1098/rspb.2000.1387

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Chen, C. C., and Tyler, C. W. (2010). Symmetry: modeling the effects of masking noise, axial cueing and salience. PLoS ONE 5:e9840. doi: 10.1371/journal.pone.0009840

CrossRef Full Text

Chou, W.-L. (2010). Distinct Mechanisms of Location-Based and Object-Based Visual Attention. Unpublished doctoral dissertation, National Taiwan University, Taiwan.

Chou, W.-L., Chen, C. C., and Yeh, S.-L. (2009). Mechanisms of location- and object-based attention as revealed by the external noise paradigm. Perception 38(Suppl.), 1–194.

Chou, W.-L., and Yeh, S.-L. (2008). Location-and object-based inhibition of return are affected by different kinds of working memory. Quart. J. Exp. Psychol. 61, 1761–1768. doi: 10.1080/17470210802194308

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Chou, W-L., and Yeh, S-L. (2011). Subliminal spatial cues capture attention and strengthen between-object link. Conscious. Cogn. 20, 1265–1271. doi: 10.1016/j.concog.2011.03.007

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Cohn, T. E., and Lasley, D. J. (1974). Detectability of a luminance increment: effect of spatial uncertainty. J. Opt. Soc. Am. 64, 1715–1719. doi: 10.1364/JOSA.64.001715

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Duncan, J. (1984). Selective attention and the organization of visual information. J. Exp. Psychol. Gen. 113, 501–517. doi: 10.1037/0096-3445.113.4.501

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Davis, G., and Driver, J. (1997). Spreading of visual attention to modally versus amodally completed regions. Psychol. Sci. 8, 275–281. doi: 10.1111/j.1467-9280.1997.tb00438.x

CrossRef Full Text

Davis, G., Driver, J., Pavani, F., and Shepherd, A. (2000). Reappraising the apparent costs of attending to two separate visual objects. Vision Res. 40, 1323–1332. doi: 10.1016/S0042-6989(99)00189-3

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Dosher, B. A., and Lu, Z.-L. (2000). Mechanisms of perceptual attention in precuing of location. Vision Res. 40, 1269–1292. doi: 10.1016/S0042-6989(00)00019-5

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Egly, R., Driver, J., and Rafal, R. D. (1994). Shifting visual attention between objects and locations: evidence from normal and parietal lesion subjects. J. Exp. Psychol. Gen. 123, 161–176. doi: 10.1037/0096-3445.123.2.161

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Eriksen, C. W., and St. James, J. D. (1986). Visual attention within and around the field of local attention: a zoom lens model. Percept. Psychophys. 40, 225–240. doi: 10.3758/BF03211502

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Eriksen, C. W., and Yeh, Y. Y. (1985). Allocation of attention in the visual field. J. Exp. Psychol. Hum. Percept. Perform. 11, 583–597. doi: 10.1037/0096-1523.11.5.583

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Foley, J. M. (1994). Human luminance pattern-vision mechanisms: masking experiments require a new model. J. Opt. Soc. Am. 11, 1710–1719. doi: 10.1364/JOSAA.11.001710

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Foley, J. M., and Chen, C. C. (1999). Pattern detection in the presence of maskers that differ in spatial phase and temporal offset: threshold measurements and a model. Vision Res. 39, 3855–3872. doi: 10.1016/S0042-6989(99)00104-2

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Fink, G. R., Dolan, R. J., Halligan, P. W., Marshall, J. C., and Frith, C. D. (1997). Space-based and object-based visual attention: shared and specific neural domains. Brain 120, 2013–2028. doi: 10.1093/brain/120.11.2013

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Gibson, B. S., and Egeth, H. (1994). Inhibition of return to object-based and environment-based locations. Percept. Psychophys. 55, 323–339. doi: 10.3758/BF03207603

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Goris, R. L., Wagemans, J., and Wichmann, F. A. (2008). Modeling contrast discrimination data suggest both the pedestal effect and stochastic resonance to be caused by the same mechanism. J. Vis. 8, 17.1–17.21. doi: 10.1167/8.15.17

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Han, S. M., Dosher, B. A., and Lu, Z.-L. (2003). Object attention revisited: boundary conditions and mechanisms. Psychol. Sci. 14, 598–604. doi: 10.1046/j.0956-7976.2003.psci_1471.x

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

He, X., Fan, S., Zhou, K., and Chen, L. (2004). Cue validity and object-based attention. J. Cogn. Neurosci. 16, 1085–1097. doi: 10.1162/0898929041502689

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

He, X., Humphreys, G., Fan, S., Chen, L., and Han, S. (2008). Differentiating spatial and object-based effects on attention: an event-related brain potential study with peripheral cueing. Brain Res. 1245, 116–125. doi: 10.1016/j.brainres.2008.09.092

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Kasai, T., and Kondo, M. (1997). Electrophysiological correlates of attention-spreading in visual grouping. Neuroreport 18, 93–98. doi: 10.1097/WNR.0b013e328011b8c9

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Kontsevich, L. L., and Tyler, C. W. (1999). Bayesian adaptive estimation of psychometric slope and threshold. Vision Res. 39, 2729–2737. doi: 10.1016/S0042-6989(98)00285-5

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

LaBerge, D., and Brown, V. (1989). Theory of attentional operations in shape identification. Psychol. Rev. 96, 101–124. doi: 10.1037/0033-295X.96.1.101

CrossRef Full Text

Lu, Z.-L., and Dosher, B. A. (1998). External noise distinguishes mechanisms of attention. Vis. Res. 38, 1183–1198. doi: 10.1016/S0042-6989(97)00273-3

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Lu, Z.-L., and Dosher, B. A. (2000). Spatial attention: different mechanisms for central and peripheral temporal precues? J. Exp. Psychol. Hum. Percept. Perform. 26, 1534–1548. doi: 10.1037/0096-1523.26.5.1534

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Liu, S.-H., Dosher, B, A., and Lu, Z.-L. (2009). The role of judgment frames and task precision in object attention: reduced template sharpness limits dual-object performance. Vision Res. 49, 1336–1351. doi: 10.1016/j.visres.2008.07.025

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Legge, G. E., Kersten, D., and Burgess, A. E. (1987). Contrast discrimination in noise. J. Opt. Soc. Am. 4, 391–404. doi: 10.1364/JOSAA.4.000391

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

List, A., and Robertson, L. C. (2007). Inhibition of return and object-based attentional selection. J. Exp. Psychol. Hum. Percept. Perform. 33, 1322–1334. doi: 10.1037/0096-1523.33.6.1322

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Lamy, D., and Tsal, Y. (2000). Object features, object locations, and object files: which does selective attention activate and when? J. Exp. Psychol. Hum. Percept. Perform. 26, 1387–1400. doi: 10.1037/0096-1523.26.4.1387

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Moore, C. M., and Fulton, C. (2005). The spread of attention to hidden portions of occluded surfaces. Psychon. Bull. Rev. 12, 301–306. doi: 10.3758/BF03196376

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Matsukura, M., and Vecera, S. P. (2006). The return of object-based attention: selection of multiple-region objects. Percept. Psychophys. 68, 1163–1175. doi: 10.3758/BF03193718

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Matsukura, M., and Vecera, S. P. (2009). Interference between object-based attention and object-based memory. Psychon. Bull. Rev. 16, 529–536. doi: 10.3758/PBR.16.3.529

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Moore, C. M., Yantis, S., and Vaughan, B. (1998). Object-based visual selection: evidence from perceptual completion. Psychol. Sci. 9, 104–110. doi: 10.1111/1467-9280.00019

CrossRef Full Text

Nagaraja, N. S. (1964). Effect of luminance noise on contrast thresholds. J. Opt. Soc. Am. 54, 950–955. doi: 10.1364/JOSA.54.000950

CrossRef Full Text

Pelli, D. G. (1985). Uncertainty explains many aspects of visual contrast detection and discrimination. J. Opt. Soc. Am. 2, 1508–1532. doi: 10.1364/JOSAA.2.001508

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Pelli, D. G. (1991). “Noise in the visual system may be early,” in Computational Models of Visual Processing, eds M. Landy and J. A. Movshon (Cambridge: MIT Press), 147–152.

Pestilli, F., and Carrasco, M. (2005). Attention enhances contrast sensitivity at cued and impairs it at uncued locations. Vision Res. 45, 1867–1875. doi: 10.1016/j.visres.2005.01.019

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Posner, M. I. (1980). Orienting of attention. Quart. J. Exp. Psychol. 32, 3–25. doi: 10.1080/00335558008248231

CrossRef Full Text

Ratcliff, R. (1978). A theory of memory retrieval. Psychol. Rev. 85, 59–108. doi: 10.1037/0033-295X.85.2.59

CrossRef Full Text

Reynolds, J. H., and Heeger, D. J. (2009). The normalization model of attention. Neuron 61, 168–185. doi: 10.1016/j.neuron.2009.01.002

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Richard, A. M., Lee, H., and Vecera, S. P. (2008). Attentional spreading in object-based attention. J. Exp. Psychol. Hum. Percept. Perform. 34, 842–852. doi: 10.1037/0096-1523.34.4.842

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Ross, J., and Speed, H. D. (1991). Contrast adaptation and contrast masking in human vision. Proc. Biol. Sci. 246, 61–70. doi: 10.1098/rspb.1991.0125

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Shomstein, S., and Behrmann, M. (2008). Object-based attention: strength of object representation and attentional guidance. Percept. Psychophys. 70, 132–144. doi: 10.3758/PP.70.1.132

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Shomstein, S., and Yantis, S. (2002). Object-based attention: sensory modulation or priority setting? Percept. Psychophys. 64, 41–51. doi: 10.3758/BF03194556

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Shomstein, S., and Yantis, S. (2004). Configural and contextual prioritization in object-based attention. Psychon. Bull. Rev. 11, 247–253. doi: 10.3758/BF03196566

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Snowden, R. J., and Hammett, S. T. (1998). The effects of surround contrast on contrast thresholds, perceived contrast and contrast discrimination. Vision Res. 38, 1935–1945. doi: 10.1016/S0042-6989(97)00379-9

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Tipper, S. P., Driver, J., and Weaver, B. (1991). Object-centred inhibition of return of visual attention. Quart. J. Exp. Psychol. 43A, 289–298. doi: 10.1080/14640749108400971

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Teo, P. C., and Heeger, D. K. (1994). Perceptual image distortion. SPIE Proc. 2179, 127–141. doi: 10.1117/12.172664

CrossRef Full Text

Tyler, C. W., and Chen, C. C. (2000). Signal detection theory in the 2AFC paradigm: attention, channel uncertainty and probability summation. Vision Res. 40, 3121–3144. doi: 10.1016/S0042-6989(00)00157-7

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Tyler, C. W., and McBride, B. (1997). The Morphonome image psychophysics software and a calibrator for Macintosh systems. Spat. Vis. 10, 479. doi: 10.1163/156856897X00410

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Wager, T. D., Jonides, J., and Reading, S. (2004). Neuroimaging studies of shifting attention: a meta-analysis. Neuroimage 22, 1679–1693. doi: 10.1016/j.neuroimage.2004.03.052

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Watson, A. B., and Solomon, J. A. (1997). Model of visual contrast gain control and pattern masking. J. Opt. Soc. Am. 4, 2379–2391. doi: 10.1364/JOSAA.14.002379

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Wilson, H. R., and Humanski, R. (1993). Spatial frequency adaptation and contrast gain control. Vision Res. 33, 1133–1149. doi: 10.1016/0042-6989(93)90248-U

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Wu, C. C., and Chen, C. C. (2010). Distinguishing lateral interaction from uncertainty reduction in collinear flanker effect on contrast discrimination. J. Vis. 10, 1–14. doi: 10.1167/10.3.8

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Zenger, B., Braun, J., and Koch, C. (2000). Attentional effects on contrast detection in the presence of surround masks. Vision Res. 40, 3717–3724. doi: 10.1016/S0042-6989(00)00218-2

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Keywords: attention mechanisms, location-based attention, object-based attention, threshold versus external noise contrast (TvC) function, noise-masking paradigm, divisive inhibition model

Citation: Chou W-L, Yeh S-L and Chen C-C (2014) Distinct mechanisms subserve location- and object-based visual attention. Front. Psychol. 5:456. doi: 10.3389/fpsyg.2014.00456

Received: 20 January 2014; Accepted: 28 April 2014;
Published online: 21 May 2014.

Edited by:

Jocelyn Faubert, Université de Montréal, Canada

Reviewed by:

Zhong-Lin Lu, University of Southern California, USA
Jason M. Gold, Indiana University Bloomington, USA

Copyright © 2014 Chou, Yeh and Chen. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Su-Ling Yeh, Department of Psychology, National Taiwan University, No. 1, Sec. 4, Roosevelt Road, Taipei 10617, Taiwan e-mail:c3VsaW5nQG50dS5lZHUudHc=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.