- 1Center for Biomedical Technology, Technical University of Madrid, Pozuelo de Alarcón, Madrid, Spain
- 2Laboratory of Neuroscience and Cognitive Technology, Center for Technologies in Robotics and Mechatronics Component, Innolpolis University, Innopolis, Russia
- 3Department of Automation, Control and Mechatronics, Saratov State Medical University, Saratov, Russia
In this study, voluntary and involuntary visual attention focused on different interpretations of a bistable image, were investigated using magnetoencephalography (MEG). A Necker cube with sinusoidally modulated pixels' intensity in the front and rear faces with frequencies 6.67 Hz (60/9) and 8.57 Hz (60/7), respectively, was presented to 12 healthy volunteers, who interpreted the cube as either left- or right-oriented. The tags of these frequencies and their second harmonics were identified in the average Fourier spectra of the MEG data recorded from the visual cortex. In the first part of the experiment, the subjects were asked to voluntarily control their attention by interpreting the cube orientation as either being on the left or right. Accordingly, we observed the dominance of the corresponding spectral component, and voluntary attention performance was measured. In the second part of the experiment, the subjects were asked to focus their gaze on a red marker at the center of the cube image without putting forth effort in its interpretation. The alternation of the dominant spectral energies at the second harmonics of the stimulation frequencies was treated as changes in the cube orientation. Based on the results of the first experimental stage and using a wavelet analysis, we developed a method which allowed us to identify the currently perceived cube orientation. Finally, we characterized involuntary attention using the distribution of dominance times when focusing attention on one of the cube orientations, which was related to voluntary attention performance and brain noise. In particular, we confirmed our hypothesis that higher attention performance is associated with stronger brain noise.
1. Introduction
Wilhelm Wundt was the first to suggest, in as early as in 1897, that two forms of attention exist: voluntary and involuntary (Wundt, 1897). There is already more than a justifiable number of terms used in the community that overlap with these two forms of attention, such as endogenous vs. exogenous attention, automatic vs. controlled attention, and pull vs. push attention (Prinzmetal et al., 2005). According to Prinzmetal and his colleagues, voluntary and involuntary attention have different functions and are controlled by distinct mechanisms (Prinzmetal et al., 2005). They supposed that voluntary attention affects perceptual attention and would affect both accuracy and reaction time (RT) experiments, whereas involuntary attention deals with the response-selection decision and is manifested only in RT experiments. To study these differences, Posner et al. (1978), Posner (1978), Posner et al. (1980), and Posner (1980) developed a spatial cuing task. In their method, the subjects were asked to detect or identify a peripheral stimulus. The participants were pre-cued to a possible location of the stimulus beforehand; in valid trials, the cue indicated the target location, whereas in the case of invalid trials, the cue indicated a non-target location. Since the participants were not allowed to move their eyes to the cued location, the observed differences in performance between valid and invalid trials reflected differences in attention which were independent of fixation. Later, Jonides (1980) and Jonides (1983) used this paradigm to study the difference between voluntary and involuntary attention by altering the “validity” of the cuing information. If the total number of valid trials for the correct stimulus location is as low as that for a random distribution in which no useful bias for the target location is provided, only involuntary attention would be involved in seeing the peripheral stimulus. On the other hand, in the presence of a high number of valid trials in which correct cuing information for the target location is available, both voluntary and involuntary attention would be engaged.
In 2005, Prinzmetal et al. (2005) introduced the idea of channel enhancement and channel selection in order to show how the two kinds of attention manifest. Channel enhancement is a process driven by voluntary attention that causes the visual system to gather more information from the attended stimulus than from the unattended stimulus specified by the informative cues. It changes the perceptual representation so that the observers have a clearer view of the stimulus they are attending to Prinzmetal et al. (1997a), Prinzmetal et al. (1997b), and Prinzmetal et al. (1998). Other researchers also confirmed that attention to the biasing cue improves the perceived contrast of both attended and unattended stimuli (Carrasco et al., 2004; Luck, 2004; Treue, 2004).
There is a general consensus that the Stroop effect alters the response selection, but not perceptual representation (Virzi and Egeth, 1985; Baldo et al., 1998). For example, when shown the word written in red and asked the color, it would lead to a competition in the response selection that delays the response, but no alteration in the perceived color would be observed. Similarly, involuntary attention would affect RT, but not detection accuracy. Conveniently, several researchers reported that involuntary attention to a stimulus only affects the response selection (Ooi and He, 1999; Mitchell et al., 2004; Hancock and Andrews, 2007).
It should be noted that there is a precedence for accuracy and RT studies to produce opposing effects (Santee and Egeth, 1982; Mordkoff and Egeth, 1993; Moore and Egeth, 1998). In particular, Santee and Egeth (1982) considered the redundant target paradigm, in which a target letter is repeated on a display. They found that the repeating target speeds up the reaction (Eriksen and Eriksen, 1974, 1979; Eriksen and Schultz, 1979) but reduces the accuracy (Bjork and Murray, 1977; Santee and Egeth, 1980). This phenomenon is known as the flanker effect. The correct selection of recording channels should also alter the detection accuracy in the target location which is being attended. Furthermore, it may also improve RT as information is presumably gathered faster in the cued than in the uncued location. Moreover, channel selection deals with decision making when determining the correct target location or response selection, and only affects RT experiments.
In this paper, we study voluntary and involuntary attention using multistable perception (Leopold and Logothetis, 1999), a phenomenon where the same stimulus can be perceived in more than one way. With regard to degrees of freedom, the simplest form of multistable perception is bistable perception: when two different interpretations of the same stimulus are possible. As a result of extensive research on this topic over the last two decades, many descriptive models were developed (Moreno et al., 2007; Shpiro et al., 2009; Huguet et al., 2014; Dotov et al., 2019; Meilikhov and Farzetdinova, 2019; Chholak et al., 2020a). The switches between alternate percepts were suggested to be driven by either stochastic processes in the brain (Moreno et al., 2007; Pisarchik et al., 2014) due to random neurophysiological activity and neuronal adaptation (Huguet et al., 2014; Dotov et al., 2019), which is defined as slow destabilization of currently dominant perception after being active for a prolonged time, or due to both noise and adaptation (Shpiro et al., 2009; Huguet et al., 2014; Chholak et al., 2020a). Each percept competes with another rival state, while the dominant active state tends to suppress alternative perception. Several researchers also studied visual attention modulation in the striate and extrastriate visual cortex (Hillyard and Anllo-Vento, 1998; Mangun et al., 1998; Brefczynski and De Yoe, 1999; Ghandi et al., 1999; McAdams and Maunsell, 1999; Reynolds and Desimone, 1999, 2003; Reynolds et al., 2000; Treue, 2000; Martinez-Trujillo and Treue, 2002; Saenz et al., 2002). Whether the interstate suppression comes before binocular confluence, such as in the primary visual cortex or the lateral geniculate nucleus (Blake, 1989; Lehky and Blake, 1991; Tong and Engel, 2001), or after binocular confluence (Logothetis et al., 1996; Andrews and Purves, 1997) was a matter of numerous debates. The latter assumes that there is a competition between high-level stimulus representations in visual neurons (Ooi and He, 1999; Hancock and Andrews, 2007).
Similarly, the phenomenon of visual attention is based on the competition of one object among a variety of other competing alternatives for enhanced perceptual representation as in voluntary attention. This leads to the suggestion that bistable perception and attention may be related processes (Helmholtz, 1962; Leopold and Logothetis, 1999). Previous studies on this topic were performed using evoked responses that consisted of numerous relatively short trials as opposed to a single long trial. The present work, on the contrary, is aimed to characterize voluntary and involuntary attention using visual responses from relatively long (120-s) trials. Furthermore, involuntary attention was only found in RT experiments under the evoked response regime. Instead of the evoked response, we use long entrained visual signals that can vary in phase and hence are unlocked in time with the start of stimulation. The corresponding brain response is termed as visual induced field (VIF), in contrast to the traditionally used visual evoked field (VEF). In the first part of our experiment we study controlled (voluntary) attention, when subjects are asked to fix their attention on one of two possible Necker cube orientations, whereas in the second part we investigate involuntary attention when subjects do not try to interpret (control) the cube orientation. We measure the subject's attentional ability in the first part and use the gained insight in the second part to estimate involuntary attention based on the method of wavelet energies. Finally, we characterize involuntary attention using dominance time distribution and study its relation to voluntary attention performance and brain noise.
2. Materials and Methods
2.1. Experimental Setup
Magnetoencephalographic (MEG) data were recorded with a whole-head Vectorview MEG system (Elekta AB, Stockholm, Sweden) with 306 channels (102 magnetometers and 204 planar gradiometers), which were placed inside a magnetically shielded room (Vacuum Schmelze GmbH, Hanau, Germany) at the Laboratory of Cognitive and Computational Neuroscience, Center for Biomedical Technology, Technical University of Madrid, Spain. Fastrak digitizers (Polhemus, Colchester, Vermont) were used to obtain a three-dimensional head shape using approximately 300 points on the scalp of each subject. Additionally, three fiducial points (nasion, left and right pre-auricular) were acquired for co-registration purposes. A vertical electrooculogram was placed to capture eye blinks. A single empty room recording lasting more than 2 min was performed on each day of the experiment (Day-1: 4 subjects; Day-2: 5 subjects; Day-3: 3 subjects). Data were sampled at 1,000 Hz with an on-line anti-alias bandpass filter between 0.1 and 330 Hz.
2.2. Participants
Twelve1 healthy subjects (aged 17–64 years, six males and six females) with normal or corrected-to-normal vision participated in the experimental study. All subjects provided written informed consent before the commencement of the experiment. The experimental studies were performed in accordance with the Declaration of Helsinki and approved by the Ethics Committee of the Technical University of Madrid.
2.3. Visual Stimulus
The visual stimulus was a gray Necker cube image on a gray background drawn on a computer monitor with a 60-Hz refresh rate and subsequently projected by a digital light processing projector onto a translucent screen located 150 cm away from the subject (Figure 1). The pixels' brightness on the left- and right-cube front faces was modulated by sinusoidal signals with 6.67-Hz (60/9) and 8.57-Hz (60/7) frequencies, respectively. The modulation depth was 100% with respect to the medium gray-scale level of the pixels' brightness (127 in an 8-bit format), i.e., the image brightness varied from black (0) to gray (127). The sinusoidal shape and modulation frequencies were chosen in preliminary experiments where different signal shapes (sinusoidal, rectangular and triangular) and different flicker frequencies which are integral fractions of the 60-Hz frame rate (i.e., 60/2, 60/3, 60/4, 60/5, 60/6, 60/7, 60/8, 60/9, 60/10, and 60/12) were explored. The selected frequencies produced the best tags in the brain response (Pisarchik et al., 2019a).
Figure 1. Presented visual stimulus. Necker cube with flickering left and right faces at 6.67 and 8.57 Hz, respectively. The subjects were asked to fix their gaze on the central red dot.
To perform the experiment, including the presentation of the visual stimulus, we designed a special algorithm using the Cogent Graphics MATLAB Toolbox publicly available on GitHub®. Data acquisition was made by the software provided with the Elekta-Neuromag system, while the time-stamps corresponding to the onset of visual stimulus presentation were marked on-line using a parallel port. The MATLAB code also included these event markers via parallel port connection. For more details, see the section on code and data sharing.
2.4. Experimental Procedure
The subjects were seated in a comfortable reclining chair with their legs straight and arms resting on an armrest in front or on their laps. The participants were asked to remove any metallic items above their waist like jewelry, belts, and brassieres, along with their shoes prior to the experiment. The experiment began with the recording of a 2-min background activity while the subject was focusing on a red dot located in the middle of a stationary (non-flickering) cube image. This MEG trial acted as a background reference in further analyses.
The entire experiment included two stages: voluntary control of the perceived cube orientation and involuntary spontaneous switching between two cube orientations. During the first stage, after a 30-s rest and an instructional visual message, a flickering Necker cube with two frequencies was presented 24 times on the screen (5 s each with a 5-s interval gap in-between). For the first 12 trials, 9 out of 12 participants were asked to interpret the cube as left-oriented. After a 30-s rest and an instructional visual message, the participants were requested to interpret the next 12 cubes as right-oriented. For 3 subjects, we reversed the order of voluntary perception by asking them to interpret the first 12 cubes as right-oriented and the next 12 cubes as left-oriented. This concluded the first experimental stage.
The second part of the experiment started with the same Necker cube presentation but now for 120 s. At this stage, the subjects were instructed not to fix their attention on any particular cube orientation. In all of the experiments, the participants were asked to focus only on the red dot at the center of the image. This was done to ensure that changes in the cube orientation were not caused by eye movements, i.e., at the retinal level, but instead by visual neurons at a higher level.
2.5. Visual Induced Field (VIF)
In this paper, we introduce a new measure of visual brain response which is defined as the average brain activity in the visual cortex for longer than transient-evoked-response time durations that are not phase-locked to the stimulus. The traditionally used visual evoked field (VEF) is time-locked to the stimulus and thus one averages visual brain activity across trials. Since we observe intermediate phase-slips in experiments with longer (e.g., 120-s) durations at unpredictable time moments, averaging across multiple trials is not possible. Although this response is caused by the stimuli, the time moments of its start and intermediate phase-slips are not fixed. Therefore, we aptly named it visual induced response (VIR) opposed to visual evoked response (VER). It should be noted that VER at such time scales, averaged over multiple trials, is called steady-state visual evoked response (SSVER).
The brain was mapped using a mesh of 15,004 points representing cortical sources. There are multiple combinations in which these numerous brain sources can produce the observed magnetic activity recorded by 306 MEG channels. This so-called inverse problem is ill-posed and can only be solved by using additional assumptions about the neuronal system such as minimization of the total energy of the system. The depth-dependent sensitivity and spatial resolution were normalized using the standardized low-resolution electromagnetic tomography (sLORETA) method.
After aligning a standard anatomical magnetic resonance imaging (MRI) provided in Brainstorm (Tadel et al., 2011) with the fiducial points, the cranial anatomy was warped to fit the acquired points on the scalp using the Polhemus device with an error margin of 2%. We used the Brodmann atlas in Brainstorm to find cortical sources associated with visual areas V1 and V2 on the modeled cortical mesh (1,227 points). The response of these visual sources was then averaged to obtain VIF for each trial.
2.6. Spectral Analysis
Morlet-based wavelets constructed from a mother wavelet with a 1-Hz central frequency and a 12-s full width at half maximum (FWHM) were utilized to obtain wavelet power time series at the second harmonics of the flicker frequencies. The second harmonic frequencies were fine-tuned based on the power spectrum of the VIF signals for each subject.
2.7. Wavelet Analysis
The time-frequency analysis is based on the continuous wavelet transform (Pavlov et al., 2012; Hramov et al., 2015)
where “*” denotes the complex conjugate and X(t) is the analyzed MEG signal. The complex-valued Morlet-wavelet is chosen as the mother wavelet
with ω0 = 2πf0 being the central frequency of the Morlet wavelets and . The wavelet powers W(f1, t) and W(f2, t) given by Equation (1) were evaluated at the tagging frequencies f1 = 13.33 Hz and f2 = 17.14 Hz (second harmonics of the flicker frequencies), respectively. Since the frequency response decays with increasing frequency as a 1/f rule, the wavelet energy is normalized to the corresponding modulation period (1/f1, 2). Hence, the wavelet time series are multiplied to their defining frequencies to get
and the difference between the spectral energies at f1 and f2 is then calculated as
and normalized to its maximum absolute value as
In our analysis, we averaged E1 and E2 over time and over all trials separately for the left-oriented ( and ) and for the right-oriented ( and ) cube interpretations. The power spectra averaged over all participants are shown in Figure 2.
Figure 2. Average power spectra. Fourier spectra of VIF averaged over all subjects during perception of non-flickering (background) (blue), left-oriented (red), and right-oriented (green) cubes.
The evolution of the normalized energy difference in Equation (5) for typical 5-s trials corresponding to the left and right cube orientations for one of the subjects is shown in Figure 3.
Figure 3. Spectral difference. Time series of normalized energy difference (Equation 5) for single trials corresponding to voluntary left-oriented (left) and right-oriented (right) cube perception.
The differences between the wavelet energies at f1 and f2 corresponding to the left-oriented and right-oriented cube perceptions () signify the bias in spectral reflection of left orientation in comparison to right orientation such that D1 should be higher and D2 should be lower. The difference between D1 and D2 defines the performance index μ as
The performance μ characterizes the ability of the subject to voluntarily attend to the foretold cube orientation. Similar to the voluntary case, normalized energy difference time series for both frequencies were evaluated from VIF for involuntary perception. However, unlike the voluntary case, the trial duration was increased to 120 s.
2.8. Marking Perception States
To determine the moment of switching between two different cube orientations, we propose a method based on wavelet power time series. In our approach, calculated by Equation (5) is screened for significant changes above a threshold equal to its standard deviation δ:
The active state is determined as left-oriented (Switch = 1) if and as right-oriented (Switch = 0) if . The algorithm is resilient to insignificant perturbations and sticks to the previous state for . Typical switches in perception between the two cube orientations are illustrated in Figure 4.
Figure 4. Difference in spectral energies. Time series with marked switches between involuntary left-oriented and right-oriented cube perception. The right panel shows an enlarged part of the left graph with marked resting times T1 and T2.
2.9. Event-Related Coherence
In order to localize the brain sources during the second part of the experiment, we calculated event-related coherence (ERC).
To reduce the computational load, we first stripped the 120-s trials into forty 3-s trials for both the second part of the experiment and the background recordings. For each of the stripped trials for both experimental conditions, the magnitude-squared coherence between the estimated source time series (15,004 signals) and a sinusoidal signal at either of the tagging frequencies, i.e., f1 or f2, was evaluated. After averaging the coherence values over all forty trials, the difference between average coherence during the second part of the experiment and background was calculated and termed as ERC. The ERC was thus computed over all 15,004 brain sources to generate heat maps for source localization. For a detailed description of the method, see (Chholak et al., 2020b).
The ERC maps were evaluated at both tagging frequencies, f1 and f2, and then averaged to give the final source localization map.
3. Results and Discussion
3.1. Experiment-1: Voluntary Control of Perceived Cube Orientation
We observe that for the left-oriented cube interpretation the spectral energy is higher at f1 than at f2, whereas for the right-oriented cube the opposite situation occurs. This can be seen in Figure 2, where we plot the power spectra averaged over all subjects during trials with the left-oriented cube, right-oriented cube, and stationary cube (background) without flickering.
Hence, we expect dominance of the left orientation over the right orientation, calculated as the difference between the spectral powers corresponding to two different cube orientations, at f1 (or D1) to be positive and higher than at f2 (or D2), which should be negative and lower than D1. Furthermore, the difference between D1 and D2 would signify the performance in subject's voluntary attention (μ) to tend to perceive both cube orientations, because the reason for perceiving the contrast between the attended and unattended stimuli is voluntary attention.
Figure 3 shows typical times series of the spectral power difference for the left- and right-face frequencies during voluntary attention on the left- and right-cube orientations. In Table 1, we present the dominance of the left orientation over the right orientation for both frequencies and voluntary attention performance μ calculated by Equation (6). While the spectral difference D1 is marginally positive, D2 is largely negative. One can see that μ is always positive.
Table 1. Dominance of left-oriented (D1) over right-oriented (D2) cube interpretations at f1 and f2, respectively, and voluntary attention performance (μ).
As discussed in Introduction, the influence of attention on contrast sensitivity is well-documented by various experiments. The remaining question is whether the enhancement in contrast is due to an increase in the dominance of an attended stimulus (Chong et al., 2005) or a decrease in the dominance of an unattended stimulus (Carrasco et al., 2004; Hancock and Andrews, 2007). Many studies claim that attention enhances perceptual sensitivity (Prinzmetal et al., 1997a, 1998; Lu and Dosher, 1998; Carrasco et al., 2000; Cameron et al., 2002). In this regard, two prominent models were proposed. One of them implies that attention improves the quality of neural response to the stimulus (signal enhancement) (Carrasco et al., 2000, 2002; Cameron et al., 2002), while the other suggests that attention reduces the response to an unattended stimulus (external noise reduction) (Lu and Dosher, 1998; Baldassi and Burr, 2000). In their pioneering work, Carrasco et al. (2004) demonstrated with a clever set of psychophysical experiments on a large number of subjects that attention enhances the strength of the perceived stimulus by reducing the impact of unattended stimuli.
Our findings also support the attentional mechanism of external noise reduction as opposed to signal enhancement. When the subject is voluntarily attending to the left-oriented cube in comparison to the right-oriented cube, the dominance of the attended stimulus frequency f1 does not increase as much as there is a decrease in the unattended stimulus frequency f2. Thus, the enhanced contrast of the attended to the unattended stimulus due to voluntary attention is caused by a decrease in the unattended stimulus dominance.
However, it is noteworthy that Carrasco et al. (2000), Cameron et al. (2002), and Carrasco et al. (2002) worked with the paradigm of transient attention which was infused using visual cues and lasted for up to a maximum of only 250 ms. In our study, the subjects were asked to maintain their attention during the entire period of 5-s trials. Therefore, the above mechanism is not only true for transient responses but also for sustained long-term responses.
3.2. Experiment-2: Involuntary Switches Between Different Perceptual States
When the subjects spontaneously switch their attention to either of the cube orientations, the VIF spectral content exhibits narrow peaks at tagging frequencies f1 and f2 and sum flicker frequencies (f1 + f2)/2 (Figure 5). This can be explained by the fact that during perception of either of the cube orientations, the central square at the intersection of both orientations is flickering at the superposition frequency, and is consequently attended during the perception of either orientation.
Figure 5. Involuntary attention. (A) Average power spectrum of VIF for all subjects during involuntary switching between the two cube orientations. (B,C) Probability distributions of dominance times for left (T1) and right (T2) cube orientations.
The average values of dominance times for both orientations are similar (Ta1 = 4.097 ms, Ta2 = 5.124 ms), but curiously, the most probable or modal dominance time for the left orientation (Tm1 = 2.275 s) is much higher than for the right orientation (Tm2 = 0.424 s). This seems to suggest a bias in the perception of the two cube orientations, i.e., the same stimulation excites the left orientation more easily and frequently than the right orientation.
Perception selection can be affected by spatial, ocular, or feature-based mechanisms. In our study, both cube perceptions were shown to both eyes and so the interocular competition did not affect orientation selection. Since the subject's eyes were fixated to the central red dot from which both cube skeletons were symmetrically located, we can also rule out spatial selection. Lastly, the features of both orientations were identical and came out on the screen together abruptly without any smooth transitions between them. Hence, we can also rule out feature-based mechanisms.
Another possible reason for the preference of the left-cube orientation can be that in our everyday lives, we see the left-oriented cube more often and hence the perceptual stability of the left-cube orientation is higher (Chholak et al., 2020a). This form of attention in perceptual selection that does not depend upon ocular, spatial, or feature-based mechanisms but solely on the representational object it corresponds to, is called object-based attention and has shown to determine dominance in bistable perception (Mitchell et al., 2004).
In addition, we localized brain sources averaging ERC maps at f1 and f2 frequencies. Figure 6 shows the localized brain activity in the visual cortex of one of the subjects (Subject-B). Interestingly, we observed a comparatively stronger activation in the right hemisphere, which corresponds to the left visual field. These results fall in line with the preference of the left-cube orientation.
Figure 6. Typical source localization map using average event-related coherence. Event-related coherence for both stimulation frequencies are calculated and averaged to reveal brain sources active during the spontaneous switching between the visual perceptual states. The sources are localized in the visual cortex.
In the left panel of Figure 7, we plot the average modal dominance time Tm0 = (Tm1 + Tm2)/2 vs. voluntary attention performance μ. As noted, only 10 out of 12 subjects participated in the second part of the experiment with an additional defaulter. Interestingly, higher attention performance leads to shorter dominance time. This is in accordance with our hypothesis that higher attention requires a larger neuronal network to process information and make a decision, this in turn increases neural noise since a larger number of synapses and neurons are involved (Pisarchik et al., 2019b). Finally, stronger brain noise causes more frequent switching between perceptual states or more frequent response selection and hence shorter dominance times.
Figure 7. Relation of dominance time with attention performance and brain noise. (A) Dominance time vs. attention performance with a linear fit (root mean squared error: 0.168; F-statistics: 5.7; p-value: 0.0484). (B) Dominance time vs. brain noise with a linear fit (root mean squared error: 0.147; F-statistics: 8.95; p-value: 0.0242.
To check this hypothesis, we estimated brain noise using the methodology based on phase synchronization (Boccaletti et al., 2018) as in the experiment described in our recent paper (Pisarchik et al., 2019a). In a separate set of experiments with only a single face of the cube flickering, we measured kurtosis of the probability distributions of the phase difference between the second harmonic of the flickering signal (f1) and VIF in the occipital cortex. In the right panel of Figure 7, we plot the average modal dominance time vs. brain noise (in units of inverse kurtosis). Not only do the two curves follow a similar downward trend, but subjects with higher voluntary attention also have higher noise roughly. It is important to note that brain noise was measured in a different way than in the experiment described in this paper. Nonetheless, a subject with higher (Subject-B) or lower voluntary attention capabilities (Subject-I) can be assumed to have paid a similar level of attention during the subsequent brain noise measurement experiment. As expected, these values anticorrelate, which confirms our hypothesis that higher attention performance is associated with stronger brain noise because a larger neural network is involved in information processing. This result is consistent with the Bialek and DeWeese theory (Bialek and DeWeese, 1995), who predicted that “the brain always finds the statistically optimal interpretation of the incoming sense data.”
4. Conclusion
In this paper, we have proposed novel approaches for estimating attention performance and classification of bistable perception states, based on wavelet transformation of neurophysiological brain activity. This allowed us to assess subjects in their ability to voluntarily attend to a given object and ignore the competing distractions. Owing to its non-invasive nature and relatively short conduction time, it can be used as a screening test for attentive subjects, much like IQ tests, but with much shorter conduction times.
With regard to possible applications, the developed algorithm for bistable state classification can be useful for designing new non-invasive real-time brain-computer interfaces, due to its fast computation and relative simplicity in comparison to the very heavy machine learning classification methods that require humongous computational times and larger data.
This perspective research direction requires further development. One of the possible improvements would be the combination of different methods for studying visual attention, e.g., visual-evoked spread spectrum analysis (Lalor et al., 2007) or blind source separation techniques (Tang et al., 2002). In particular, the latter method is a modification of the independent component analysis allowing to collect MEG data during cognitive tasks. Since this method does require good head stabilization, combining second-order blind identification with SSVEP would be straightforward. Filtering out the driving frequencies might allow segregation of the signal coming from different parts of the brain.
Data Availability Statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
Ethics Statement
The studies involving human participants were reviewed and approved by Ethics Committee of the Technical University of Madrid. The patients/participants provided their written informed consent to participate in this study.
Author Contributions
All authors listed have made a substantial, direct and intellectual contribution to the work, and approved it for publication.
Funding
The experimental study was supported by the Spanish Ministry of Economy and Competitiveness under Project SAF2016-80240. The data analysis was supported by the Russian Science Foundation (Grant No. 19-12-00050). This manuscript has been released as a preprint at BioRxiv (Chholak et al., 2020c).
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Footnotes
1. ^Subject-G and Subject-A did not participate in the involuntary attention experiment due to personal reasons. Additionally, there was a defaulter in each involuntary (Subject-K) and voluntary (Subject-J) attention experiments.
References
Andrews, T. J., and Purves, D. (1997). Similarities in normal and binocularly rivalrous viewing. Proc. Natl. Acad. Sci. U.S.A. 94, 9905–9908. doi: 10.1073/pnas.94.18.9905
Baldassi, S., and Burr, D. C. (2000). Feature-based integration of orientation signals in visual search. Vis. Res. 40, 1293–1300. doi: 10.1016/S0042-6989(00)00029-8
Baldo, J. V., Shimamura, A. P., and Prinzmetal, W. (1998). Mapping symbols to response modalities: Interference effects on Stroop-like tasks. Percept. Psychophys. 60, 427–437. doi: 10.3758/BF03206864
Bialek, W., and DeWeese, M. (1995). Random switching and optimal processing in the perception of ambiguous signals. Phys. Rev. Lett. 74, 3077–3080. doi: 10.1103/PhysRevLett.74.3077
Bjork, E. L., and Murray, J. T. (1977). On the nature of input channels in visual processing. Psychol. Rev. 84, 472–484. doi: 10.1037/0033-295X.84.5.472
Blake, R. (1989). A neural theory of binocular rivalry. Psychol. Rev. 96, 145–167. doi: 10.1037/0033-295X.96.1.145
Boccaletti, S., Pisarchik, A. N., del Genio, C. I., and Aman, A. (2018). Synchronization: From Coupled Systems to Complex Networks. Cambridge: Cambridge University Press. doi: 10.1017/9781107297111
Brefczynski, J. A., and De Yoe, E. A. (1999). A physiological correlate of the ‘spotlight’ of visual attention. Nat. Neurosci. 2, 370–374. doi: 10.1038/7280
Cameron, E. L., Tai, J., and Carrasco, M. (2002). Covert attention affects the psychometric function of contrast sensitivity. Vis. Res. 42, 949–967. doi: 10.1016/S0042-6989(02)00039-1
Carrasco, M., Ling, S., and Read, S. (2004). Attention alters appearance. Nat. Neurosci. 7, 308–313. doi: 10.1038/nn1194
Carrasco, M., Penpeci-Talgar, C., and Eckstein, M. (2000). Spatial covert attention increases contrast sensitivity along the CSF: support for signal enhancement. Vis. Res. 40, 1203–1215. doi: 10.1016/S0042-6989(00)00024-9
Carrasco, M., Williams, P., and Yeshurun, Y. (2002). Covert attention increases spatial resolution with or without masks: support for signal enhancement. J. Vis. 2, 467–479. doi: 10.1167/2.6.4
Chholak, P., Hramov, A. E., and Pisarchik, A. N. (2020a). An advanced perception model combining brain noise and adaptation. Nonlinear Dyn. 100, 3695–3709. doi: 10.1007/s11071-020-05741-0
Chholak, P., Kurkin, S., Hramov, A., and Pisarchik, A. (2020b). Event-Related Coherence in Visual Cortex and Brain Noise: a MEG study. Appl. Sci.
Chholak, P., Maksimenko, V. A., Hramov, A. E., and Pisarchik, A. N. (2020c). Estimating voluntary and involuntary attention in bistable visual perception: a MEG study. bioRxiv. doi: 10.1101/2020.02.18.953653
Chong, S. C., Tadin, D., and Blake, R. (2005). Endogenous attention prolongs dominance durations in binocular rivalry. J. Vis. 5, 1004–1012. doi: 10.1167/5.11.6
Dotov, D. G., Turvey, M. T., and Frank, T. D. (2019). Embodied gestalts: unstable visual phenomena become stable when they are stimuli for competitive action selection. Atten. Percept. Psychol. 82, 2330–2342. doi: 10.3758/s13414-019-01868-4
Eriksen, B. A., and Eriksen, C. W. (1974). Effects of noise letters upon the identification of a target letter in a nonsearch task. Percept. Psychophys. 16, 143–149. doi: 10.3758/BF03203267
Eriksen, C. W., and Eriksen, B. A. (1979). Target redundancy in visual search: do repetitions of the target within the display impair processing? Percept. Psychophys. 26, 195–205. doi: 10.3758/BF03199869
Eriksen, C. W., and Schultz, D. W. (1979). Information processing in visual search: a continuous flow conception and experimental results. Percept. Psychophys. 25, 249–263. doi: 10.3758/BF03198804
Ghandi, S. P., Heeger, D. J., and Boynton, G. M. (1999). Spatial attention affects brain activity in human primary visual cortex. Proc. Natl. Acad. Sci. U.S.A. 96, 3314–3319. doi: 10.1073/pnas.96.6.3314
Hancock, S., and Andrews, T. J. (2007). The role of voluntary and involuntary attention in selecting perceptual dominance during binocular rivalry. Perception 36, 288–298. doi: 10.1068/p5494
Helmholtz, H. (1962). Helmholtz's Treatise on Physiological Optics. New York, NY: Dover Publications;.
Hillyard, S. A., and Anllo-Vento, L. (1998). Event-related brain potentials in the study of visual selective attention. Proc. Natl. Acad. Sci. U.S.A. 95, 781–787. doi: 10.1073/pnas.95.3.781
Hramov, A. E., Koronovskii, A. A., Makarov, V. A., Pavlov, A. N., and Sitnikova, E. (2015). Wavelets in Neuroscience. New York, NY; Heidelberg: Springer. doi: 10.1007/978-3-662-43850-3
Huguet, G., Rinzel, J., and Hupe, J. M. (2014). Noise and adaptation in multistable perception: noise drives when to switch, adaptation determines percept choice. Perception 14:19. doi: 10.1167/14.3.19
Jonides, J. (1980). Towards a model of the mind's eye's movement. Can. J. Psychol. 34, 103–112. doi: 10.1037/h0081031
Jonides, J. (1983). Further toward a model of the Mind's eye's movement. Bull. Psychon. Soc. 21, 247–250. doi: 10.3758/BF03334699
Lalor, E., Kelly, S., Pearlmutter, B. A., Reilly, R., and Foxe, J. (2007). Isolating endogenous visuo-spatial attentional effects using the novel visual-evoked spread spectrum analysis (VESPA) technique. Eur. J. Neurosci. 26, 3536–3542. doi: 10.1111/j.1460-9568.2007.05968.x
Lehky, S. R., and Blake, R. (1991). Organization of binocular pathways: modeling and data related to rivalry. Neural Comput. 3, 44–53. doi: 10.1162/neco.1991.3.1.44
Leopold, D. A., and Logothetis, N. K. (1999). Multistable phenomena: changing views in perception. Trends Cogn. Sci. 3, 254–264. doi: 10.1016/S1364-6613(99)01332-7
Logothetis, N. K., Leopold, D. A., and Sheinberg, D. L. (1996). What is rivalling during binocular rivalry? Nature 380, 621–624. doi: 10.1038/380621a0
Lu, Z. L., and Dosher, B. A. (1998). External noise distinguishes attention mechanisms. Vis. Res. 38, 1183–1198. doi: 10.1016/S0042-6989(97)00273-3
Luck, S. J. (2004). Understanding awareness: one step closer. Nat. Neurosci. 7, 208–209. doi: 10.1038/nn0304-208
Mangun, G. R., Buonocore, M. H., Girelli, M., and Jha, A. P. (1998). ERP and fMRI measures of visual spatial selective attention. Hum. Brain Mapp. 6, 383–389. doi: 10.1002/(SICI)1097-0193(1998)6:5/6<383::AID-HBM10>3.0.CO;2-Z
Martinez-Trujillo, J. C., and Treue, S. (2002). Attentional modulation strength in cortical area MT depends on stimulus contrast. Neuron 35, 365–370. doi: 10.1016/S0896-6273(02)00778-X
McAdams, C. J., and Maunsell, J. H. R. (1999). Effects of attention on orientation-tuning functions of single neurons in macaque cortical area V4. J. Neurosci. 19, 431–441. doi: 10.1523/JNEUROSCI.19-01-00431.1999
Meilikhov, E. Z., and Farzetdinova, R. M. (2019). Bistable perception of ambiguous images: simple Arrhenius model. Cogn Neurodyn. 13, 613–621. doi: 10.1007/s11571-019-09554-9
Mitchell, J. F., Stoner, G. R., and Reynolds, J. H. (2004). Object-based attention determines dominance in binocular rivalry. Nature 429, 410–413. doi: 10.1038/nature02584
Moore, C. M., and Egeth, H. (1998). How does feature-based attention affect visual processing? J. Exp. Psychol. 24, 1296–1310. doi: 10.1037/0096-1523.24.4.1296
Mordkoff, J. T., and Egeth, H. E. (1993). Response time and accuracy revisited: converging support for the interactive race model. J. Exp. Psychol. 19, 981–991. doi: 10.1037/0096-1523.19.5.981
Moreno-Bote, R., Rinzel, J., and Rubin, N. (2007). Noise-induced alternations in an attractor network model of perceptual bistability. J. Neurophysiol. 98, 1125–1139. doi: 10.1152/jn.00116.2007
Ooi, T. L., and He, Z. J. (1999). Binocular rivalry and visual awareness: the role of attention. Perception 28, 551–574. doi: 10.1068/p2923
Pavlov, A. N., Hramov, A. E., Koronovskii, A. A., Sitnikova, E. Y., Makarov, V. A., and Ovchinnikov, A. A. (2012). Wavelet analysis in neurodynamics. Phys. Uspekhi. 55, 845–875. doi: 10.3367/UFNe.0182.201209a.0905
Pisarchik, A. N., Chholak, P., and Hramov, A. E. (2019a). Brain noise estimation from MEG response to flickering visual stimulation. Chaos Soliton Fract. X. 1:100005. doi: 10.1016/j.csfx.2019.100005
Pisarchik, A. N., Jaimes-Reátegui, R., Magallón-García, C. D. A., and Castillo-Morales, C. O. (2014). Critical slowing down and noise-induced intermittency in bistable perception. Biol. Cybern. 108, 397–404. doi: 10.1007/s00422-014-0607-5
Pisarchik, A. N., Maksimenko, V. A., Andreev, A. V., Makarov, V. V., Zhuravlev, M. O., Frolov, N. S., et al. (2019b). Coherent resonance in the distributed cortical network during sensory information. Sci Rep. 9:18325. doi: 10.1038/s41598-019-54577-1
Posner, M. I. (1980). Orienting of attention. Q. J. Exp. Psychol. 32, 3–25. doi: 10.1080/00335558008248231
Posner, M. I., Nissen, M. J., and Ogden, W. C. (1978). “Attended and unattended processing modes: the role of set for spatial location,” in Modes of Perceiving and Processing Information, H. L. Pick, E. J. Saltzman (Hillsdale, NJ: Erlbaum), 137–157.
Posner, M. I., Snyder, C. R., and Davidson, B. J. (1980). Attention and the detection of signals. J. Exp. Psychol. 109, 160–174. doi: 10.1037/0096-3445.109.2.160
Prinzmetal, W., Amiri, H., Allen, K., and Edwards, T. (1998). Phenomenology of attention: 1. Color, location, orientation, and spatial frequency. J. Exp. Psychol. 24, 261–282. doi: 10.1037/0096-1523.24.1.261
Prinzmetal, W., McCool, C., and Park, S. (2005). Attention: reaction time and accuracy reveal different mechanisms. J. Exp. Psychol. 134, 73–92. doi: 10.1037/0096-3445.134.1.73
Prinzmetal, W., Nwachuku, I., Bodanski, L., Blumenfeld, L., and Shimizu, N. (1997a). The phenomenology of attention. 2. Brightness and contrast. Conscious. Cogn. 6, 372–412. doi: 10.1006/ccog.1997.0313
Prinzmetal, W., and Wilson, A. (1997b). The effect of attention on phenomenal length. Perception 26, 193–205. doi: 10.1068/p260193
Reynolds, J. H., and Desimone, R. (1999). The role of neural mechanisms of attention in solving the binding problem. Neuron 24, 19–29. doi: 10.1016/S0896-6273(00)80819-3
Reynolds, J. H., and Desimone, R. (2003). Interacting roles of attention and visual salience in V4. Neuron 37, 853–863. doi: 10.1016/S0896-6273(03)00097-7
Reynolds, J. H., Pasternak, T., and Desimone, R. (2000). Attention increases sensitivity of V4 neurons. Neuron 26, 703–714. doi: 10.1016/S0896-6273(00)81206-4
Saenz, M., Buracas, G. T., and Boynton, G. M. (2002). Global effects of feature-based attention in human visual cortex. Nat. Neurosci. 5, 631–632. doi: 10.1038/nn876
Santee, J. L., and Egeth, H. E. (1980). Interference in letter identification: A test of feature-specific inhibition. Percept. Psychophys. 27, 321–330. doi: 10.3758/BF03206121
Santee, J. L., and Egeth, H. E. (1982). Do reaction time and accuracy measure the same aspects of letter recognition? J. Exp. Psychol. Hum. Percept. 8, 489–501. doi: 10.1037/0096-1523.8.4.489
Shpiro, A., Moreno-Bote, R., Rubin, N., and Rinzel, J. (2009). Balance between noise and adaptation in competition models of perceptual bistability. J. Comput. Neurosci. 27, 37–54. doi: 10.1007/s10827-008-0125-3
Tadel, F. F., Baillet, S., Mosher, J. C., Pantazis, D., and Leahy, R. M. (2011). Brainstorm: a user-friendly application for MEG/EEG analysis. Comput. Intel Neurosci. 2011:879716. doi: 10.1155/2011/879716
Tang, A. C., Pearlmutter, B. A., Malaszenko, N. A., Phung, D. B., and Reeb, B. C. (2002). Independent components of magnetoencephalography: localization. Neural Comput. 14, 1827–1858. doi: 10.1162/089976602760128036
Tong, F., and Engel, S. A. (2001). Interocular rivalry revealed in the human cortical blind-spot representation. Nature 411, 195–199. doi: 10.1038/35075583
Treue, S. (2000). Neural correlates of attention in primate visual cortex. Trends Neurosci. 24, 295–300. doi: 10.1016/S0166-2236(00)01814-2
Treue, S. (2004). Perceptual enhancement of contrast by attention. Trends Cogn. Sci. 8, 435–437. doi: 10.1016/j.tics.2004.08.001
Virzi, R. A., and Egeth, H. E. (1985). Toward a translational model of Stroop interference. Mem. Cogn. 13, 304–319. doi: 10.3758/BF03202499
Keywords: magnetoencephalography (MEG), attention, visual perception, brain noise, wavelet analysis (WA)
Citation: Chholak P, Maksimenko VA, Hramov AE and Pisarchik AN (2020) Voluntary and Involuntary Attention in Bistable Visual Perception: A MEG Study. Front. Hum. Neurosci. 14:597895. doi: 10.3389/fnhum.2020.597895
Received: 22 August 2020; Accepted: 20 November 2020;
Published: 22 December 2020.
Edited by:
Giuseppe Giglia, University of Palermo, ItalyReviewed by:
Barak A. Pearlmutter, Maynooth University, IrelandDaniel Zaldivar, National Institutes of Health (NIH), United States
Copyright © 2020 Chholak, Maksimenko, Hramov and Pisarchik. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Parth Chholak, parth.chholak@ctb.upm.es