Skip to main content

ORIGINAL RESEARCH article

Front. Neurosci. , 11 February 2025

Sec. Auditory Cognitive Neuroscience

Volume 19 - 2025 | https://doi.org/10.3389/fnins.2025.1536688

Behavioral representations within the endogenous dual attentional pathways during audiovisual integration processing

Zhongtian Guan&#x;Zhongtian Guan1Mingli Yan&#x;Mingli Yan2Miao HeMiao He1Yubo LiuYubo Liu1Zhixi Zhang
Zhixi Zhang3*Chunlin Li
Chunlin Li1*
  • 1School of Biomedical Engineering, Capital Medical University, Beijing, China
  • 2Institute of Large-Scale Scientific Facility, Beihang University, Hangzhou, Zhejiang, China
  • 3Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing, China

Introduction: Audiovisual integration processes are modulated by top-down attention and may involve different neural mechanisms under spatial, temporal, and neutral properties, which are externally manifested by subjects adopting different cognitive strategies. Composite behavioral indicators are able to assess the balance between speed and accuracy when subjects perform a task, thus further revealing behavioral representations of cognitive tasks. However, the cognitive strategies and neural mechanisms involved in audiovisual integration under endogenous attentional modulation are unclear, and in particular, the functional relationship between the dorsal and ventral pathways still needs to be thoroughly investigated.

Methods: We design only auditory (A), only visual (V), and audiovisual attention (VA) tasks based on the classical Posner paradigm with spatial, temporal and neutral cues for behavioral indicators, brain activation, and their correlations.

Results: Our results showed significant differences in behavioral performance between tasks, with weaker performance on the dual-channel task than on the single-channel task. The brain showed consistent activation in the frontal eye field, inferior parietal lobule, supplementary motor area, superior temporal gyrus, middle occipital gyrus and cuneus. The dorsal pathway is mainly associated with spatial processing and executive control, while the ventral pathway is involved in object recognition and semantic processing. In correlation analyses, proportions of correct responses (PC) showed a wider range of results. Spatio-temporal attention mobilized more cognitive resources than neutral attention in the audiovisual task and influenced execution strategies. Inverse efficiency score (IES) revealed endogenous attentional modulation of energy expenditure in the dual-channel task, whereas rate-correct score (RCS) revealed inter-task differences in correct response efficiency. Linear integrated speed–accuracy score (LISAS) and balanced integration score (BIS) showed different speed-accuracy balance in our task.

Conclusion: Our findings emphasize the synergy of functions and the dynamic synergy of cognitive processes in dorsal and ventral attentional pathways, which contribute to the management of cognitive flexibility and efficiency.

1 Introduction

Visual and auditory information are the two main sources of external information for humans, accounting for about 90 percent of the total (Treichler, 1967). The phenomenon of audio-visual integration exists everywhere in people’s lives, such as the integration of sound and subtitles when watching TV, the integration of speech and oral information when communicating with others, and the effective integration of information from the visual and auditory channels into a unified and coherent perceptual (Tang et al., 2016; Parker and Robinson, 2018). Audiovisual integration is able to integrate ambiguous information to make better inferences about external connections (Ernst and Bülthoff, 2004; Noppeney, 2021) and thus plays an important role in the human brain’s understanding of external information.

Audiovisual integration is influenced by many factors, including spatial (Nidiffer et al., 2016; Spence and Frings, 2020), temporal (Stevenson and Wallace, 2013), and attentional (Donohue et al., 2015; Mole, 2020) factors. Attention has top-down or bottom-up control over multisensory integration (Meyer et al., 2018), and endogenous, goal-driven attention, mediated by the dorsal attention network, is particularly effective in enhancing audiovisual integration under attended conditions (Talsma et al., 2007; Talsma et al., 2010). Whereas exogenous attention, often controlled by the ventral attention network, tends to override endogenous cues, resulting in reflexive spatial localization (Tang et al., 2016). These attentional processes guide the orientation of attention, with studies showing that the frontoparietal network, including both dorsal (for spatial attention) and ventral (for stimulus-driven attention) pathways attention (Landry et al., 2024), proposed by Posner (1980) in 1980, demonstrating that when cues produce short intervals between the cue and the target (< 250 ms), prior exogenous cues facilitated the subject’s response to the next target, but may also produce Inhibition of return (Van der Stoep et al., 2017). An fMRI study (Yan et al., 2015) measured cortical areas activated in the visual and auditory domains in a cue-target spatial attention paradigm. The results suggest that interactions between multisensory inputs can lead to enhancement or inhibition of cortical responses with top-down spatial attention. Thus audiovisual integration and attention are both important mechanisms, capable of recognizing the external world by enhancing sensory perception (Spence, 2010), and their interaction is particularly important (Talsma et al., 2010; Tang et al., 2016).

In the field of experimental psychology, it is common to use mean response time (RT) and proportions of correct responses (PC) to ideally quantify subject behavioral performance, however, it does not take into account the subject’s tendency to answer the questions, whether their strategy is to be more precise or faster at completing the task, which brings us to the speed-accuracy trade-off (SAT) (Luce, 1991; Heitz, 2014). SAT suggests that intra-or inter-subject trade-offs are often unpredictable (Gueugneau et al., 2017), For example, some researchers have focused on the ‘posterior response delay’ that occurs when subjects make a mistake (Thura et al., 2017). Thus, RT or PC are often very vague concepts, and by integrating them we can further focus on behavioral changes in one direction or another (Liesefeld et al., 2015). Inverse Efficiency Score (IES), Rate-Correct Score (RCS), Linear Integrated Speed-Accuracy Score (LISAS) and Balanced Integration Score (BIS) are important composite behavioral indicators for evaluating behavior in the field of psychology (Liesefeld and Janczyk, 2019).

Functional connectivity has been shown to correlate with transient mental states (Quinn et al., 2018), and the extent of its influence on task performance can also be established (Sadaghiani et al., 2015). However, this approach only reveals fixed intrinsic connectivity and behavioral properties at rest-state. In contrast, task-state fMRI allows for a more accurate characterization of behavioral states in a given cognitive direction and establishes a more direct link to behavioral contributions through local neural activity. Despite this, it remains unclear how different types of attention affect the behavioral strategies for audiovisual integration, particularly the shifting focus between accuracy and speed. Furthermore, how brain regions within the dorsal and ventral attention pathways fine-tune their functions and correlate with behavioral performance still requires further exploration.

Therefore, our study aims to establish the link between brain activity and behavioral representations of audiovisual integration under endogenous attentional conditioning. By combining basic and composite behavioral indicators, the study applies task-state fMRI to design three tasks—auditory, visual, and audiovisual—under three types of endogenous attention: spatial, temporal, and neutral, based on the classical Posner experimental paradigm. Behavioral and fMRI data were analyzed to explore differences in behavioral indicators and activated brain regions across the A, V, and VA tasks under spatial, temporal, and neutral attention conditions. The goal is to uncover the functional characteristics of the dorsal and ventral attention pathways in audiovisual integration and to analyze the key brain regions associated with behavioral indicators in the attention network, clarifying their regulatory mechanisms.

2 Materials and methods

2.1 Participants

52 young people aged 18–27 years with normal vision, hearing and cognitive abilities were recruited for this study, with the following inclusion criteria: right-handed, normal vision or corrected vision, normal hearing, and no history of brain injury; no history or diagnosis of psychiatric disorders; no use of psychotropic medications; and no contraindications to MRI (e.g., claustrophobia and metal implants). Considering the completeness of data collection, 49 of the subjects (mean age 22.7 years, 20 males, 29 females) were included in the analysis, including complete behavioral data and task-state fMRI data, which were collected from the radiology department of Beijing Youan Hospital affiliated with Capital Medical University. The study was approved by the Ethics Committee of Capital Medical University, and each subject was informed of the experimental details and signed a written informed consent form prior to the study, and received a certain amount of transport compensation.

2.2 Experimental paradigm

The cue and target stimulus were designed according to the classical Posner experimental paradigm (Posner et al., 1980), where a cue stimulus and a target stimulus are included in a trial. The visual display comprised a central cueing stimulus (1° eccentricity) and the bilateral white peripheral boxes (7° eccentricity) in which the visual target stimulus (“x”) appeared (Coull and Nobre, 1998). The additional auditory target stimulus was white noise from 20 to 20,000 Hz. Auditory and visual signals were generated in the experiment via Presentation 22.0.1 Spatial cue (S), temporal cue (T), and neutral cue (N) were included in the cued stimuli for this experiment (Li et al., 2012). Specifically, the spatial cue was designed as the highlighted half of the diamond in the middle of the screen, pointing to the left or right, to prompt participants to the possible location of the next target stimulus presented. The temporal cue was the small and large highlighted concentric circle in the middle of the screen, to call attention to the length of the gap between the cue and the target stimulus. The small circle stood for the interval of 300 ms, while the large circle stood for the interval of 1,500 ms. To remind participants to pay attention, all lines in the central screen were designed to be highlighted as a neutral cue.

The paradigm design is shown in Figure 1, with subgraph A showing that each task consists of 9 blocks, each of which in turn consists of 10 trials. There were peaceful black screen intervals between blocks in a random interval of 14 s, 16 s, 18 s, and 20s. All trials in each block had the same type of cue stimuli. Three groups of blocks are arranged in a cyclic sequence three times, and in order to eliminate the effect of the order of S, T and N: S, T, N, T, N, S, N, S, T. The temporal design of each trial is shown in Figures 1BD. Each trial was 4 s and was arranged by fixation point background, cue stimulus, background, target stimulus, and final background. The duration of the fixation point background was 1,500 ms, the cue duration was 100 ms, the interval between the cue and the target was either 300 ms or 1,500 ms, the target stimulus presentation was 50 ms, and the final background was either 2050 ms or 850 ms as a supplement to the total 4-s duration of the trial.

Figure 1
www.frontiersin.org

Figure 1. Task experimentation paradigm, exemplified by the VA task. (A) 9 blocks were acquired for each task, and the order of the blocks was S, T, N, T, N, S, N, S, T. S denotes spatial cues, T denotes temporal cues, and N denotes neutral cues. A random baseline gap of 14, 16, 18, and 20 s exists between every two blocks. (B) Presentation of each trial over time under spatial cues. (C) Presentation of each trial over time under temporal cues. (D) Presentation of each trial over time under neutral cues.

The A task gave signals to the auditory channel only; the V task gave signals to the visual channel only; and the VA task gave signals to both the visual and auditory channels. The experiment required subjects to draw attention to a cue stimulus when it appeared and subsequently make a button response when the target stimulus appeared, i.e., in the A task, subjects were required to press the button that indicated the side direction of sound appearance. In the V task, subjects were required to press the button that indicated the side direction of the appearance of the diagonal cross line. In the VA task, subjects were required to press the button that indicated the simultaneous appearance of the side direction response button for both sound and visual signals.

Participants were informed of all relevant details before the experiment and completed a pre-experiment session to familiarize themselves with the task paradigm. During the main experiment, they lay in the MRI scanner equipped with MRI-compatible audiovisual equipment, including earphones connected via audio cables to transmit auditory signals and eyepiece screens with interchangeable lenses connected to video cables to deliver visual stimuli. Participants held a custom response device in their right hand, with the index finger assigned to the left button and the ring finger to the right button, connected to an external computer outside the magnetic field. Synchronization was achieved via a trigger mechanism. Prior to each task, participant comfort was assessed and adjusted as necessary using MRI-compatible audiovisual communication equipment.

2.3 MRI data collection

The fMRI data were acquired using a 3.0 T MRI scanner (GE SIGNA Pioneer) with a 32-channel head coil. Functional images were collected with the following scanning parameters: repetition time (TR) of 2,000 ms, echo time (TE) of 30 ms, flip angle (FA) of 90°, slice thickness of 3.5 mm with no inter-slice gap, field of view (FOV) of 24 cm × 24 cm, matrix size of 64 × 64, across 36 slices. Additionally, T1-weighted high-resolution structural images covering the entire brain were obtained with a TR of 7.032 ms, TE of 3.016 ms, FA of 12°, number of excitations (NEX) of 1.0, FOV of 24 cm × 24 cm, matrix size of 256 × 256, slice thickness of 1.0 mm without inter-slice gap, yielding 192 slices. For participants with larger head sizes, adjustments were made as necessary.

2.4 The composite behavioral indicators

In the field of experimental psychology, Mean RT and PC are commonly used by researchers to quantify subject task performance. However, with the change of task mode, a single indicator cannot fully reflect the individual’s behavioral characteristics and cognitive state, so the composite performance indicators were introduced to comprehensively assess the subjects’ behavioral performance:

1. Inverse efficiency score (IES)

The IES (Bruyer and Brysbaert, 2011) is one of the most commonly used metrics to unify speed and accuracy and is defined as the average reaction time for a correct response divided by the percentage correct in Equation (1):

IESi,j=RTi,j¯PCi,j    (1)

where i denotes the subject and j denotes the task, RTi,j¯ indicates the mean RT for the i subject to respond correctly to the j task or condition; PCi,j denotes the correctness of the i subject under the j task. The average energy expended by subjects during the trial was characterized by the IES; the IES gradually increased with increasing cognitive difficulty.

1. Rate-correct score (RCS)

The RCS (Woltz and Was, 2006) is defined as the number of correct responses per unit time and Equation (2) is:

RCSi,j=NCi,jk=1ni,jRTi,j,k    (2)

where i denotes the subject and j denotes the task, NCi,j denotes the number of times the i subject responded correctly in the j task or condition, and the denominator is the sum of the RT of all trials of subject i in task j, the sum of the RT of all the ni,j trials of the i subject in the j task or condition. The RCS characterizes the production of correct response efficiency; the RCS decreases as cognitive difficulty increases.

1. Linear Integrated Speed-Accuracy Score (LISAS).

The LISAS (Vandierendonck, 2017) is based on a general linear model that linearly combines the mean RT with the correctness rate, which is formulated in Equation (3) as follows:

LISASi,j=RTi,j¯+SRTi,jSPEi,jPEi,j    (3)

where i denotes the subject and j denotes the task, RTi,j¯ denotes the mean RT of the i subject responding correctly in the j task or condition; PEi,j denotes the proportion error (PE) of subject i in condition j, with a value equal to 1-PC. SRTi,j and SPEi,j are the sample standard deviations for all trials of subject i under task j. The denominator is n, not the biased estimate n-1, calculated asSPE=PE1PE. Although both IES and RCS combine RT and PC into one metric and are easy to interpret (i.e., average energy consumed and efficiency of correct response), both are nonlinear, whereas LISAS provides a linear metric. And the larger the LISAS, the weaker the characterization of task switching capability.

1. Balanced integration score (BIS)

The BIS (Vandierendonck, 2018) is defined as the standardized correct rate minus the standardized mean RT, with the following Equation (4):

BISi,j=zPCi,jzRTi,j¯zxi,j=xi,jx¯Sx    (4)

where i denotes the subject and j denotes the task, where PCi, j denotes the rate of correctness of the i subject under the j task, and RTi,j¯ denotes the mean RT for the i subject to respond correctly under the j task or condition. zxi,j denotes the result after standardization of xi,j. Performance strengths and weaknesses, or the difficulty of conditions are characterized by BIS. As cognitive difficulty increases, BIS decreases (Posner et al., 1980).

As subjects completed the three tasks A, V, and VA, the computer automatically recorded the reaction time and correct situation for each trial. Based on the reaction time and correctness, the composite behavioral indicators under the three tasks of A, V and VA under S, T and N cues were calculated, respectively.

2.5 Data analysis

For behavioral data, two-way repeated measures ANOVA with three levels of each factor was used to compare the differences between the three tasks VA, V, and A under the three attentional cues at the p < 0.05 statistical level. The Geisser–Greenhouse method was applied for the correction to account for violations of sphericity, and Tukey’s test was used for post-hoc comparisons to control for multiple comparisons. This statistical approach allowed for the examination of the effects of task type and attentional cues on behavioral performance. For task-state fMRI data, SPM122 based on the MATLAB 2018a platform was used to perform preprocessing. Specifically, removing the first 6 time points to eliminate the effects of artefacts arising from magnetic field instability; temporal layer correction to remove the effects of temporal aberrations on the images; performing head movement correction to remove subjects with head motion translations greater than 3 mm and rotations greater than 3°; spatial normalization to resample to 3 × 3 × 3 mm3; and a 6 × 6 × 6 mm3 smoothing operation. With our sufficiently large number of subjects (49 available), low inter-subject variability, and high sample robustness, a full width at half maximum (FWHM) of 6 mm was effective in balancing spatial smoothing and signal fidelity and retaining sufficiently detailed information. In addition, the experimental design required observation of smaller brain regions contained in the bilateral attentional pathway, so a relatively small FWHM could reduce the loss of excessive smoothing (Mikl et al., 2008). The SPM toolkit was used to calculate the contrast of the two factors. Differences in brain activation for the three tasks A, V, and VA under the stimuli of S, T, and N were obtained separately. AlphaSim correction was used for multiple comparisons to achieve the correction thresholds correspond to a corrected p < 0.05 determined by the Monte Carlo simulation (cluster connectivity criterion rmm = 5) by a combination of a voxel-wise level of p < 0.01, with a minimum clusters >38 voxels with the program AlphaSim in AFNI (Ward, 2000). To further investigate the relationship between brain region activation levels and behavioral representations, Person correlation analyses of activation β-values for the three tasks VA, V, and A under each of the three attentional cues were performed with behavioral metrics using RESTplus (v1.303) which corrected also by AlphaSim in AFNI. To visualize the statistical results, Xjview10.0 was used for brain region labelling.

3 Results

3.1 Behavioral results

Calculated values for 6 indicators RT, PC, IES, RCS, LISAS and BIS are presented in Figure 2 and Table 1 as mean ± standard deviation according to the three tasks A, V and VA under S, T and N cues, respectively. A within-subjects group statistical analysis was performed using GraphPad prism 9.4.1 (Geisser–Greenhouse method for the correction, Tukey test, p < 0.05). In order to explore differences in behavioral performance between the VA and A, and VA and V tasks, the study was described separately in terms of spatial, temporal and neutral cues.

Figure 2
www.frontiersin.org

Figure 2. Results for 6 behavioral indicators. (A) RT behavioral results. (B) PC behavioral results. (C) IES behavioral results. (D) RCS behavioral results. (E) LISAS behavioral results. (F) BIS behavioral results. *Denotes p < 0.05, ** denotes p < 0.01, ***denotes p < 0.001, ****denotes p < 0.0001.

Table 1
www.frontiersin.org

Table 1. Means and standard deviations of behavioral indicators.

3.2 Differences in brain region activation

To compare the brain activation differences under the three tasks A, V and VA, the simple effects of audiovisual factors were analyzed under the three cues S, T and N, respectively. The locations of the activation difference brain regions are shown in Figure 3. Under spatial cues, the activated brain areas were mainly located in the Frontal Eye Field (FEF), Inferior Parietal Lobe (IPL), Supplementary motor area (SMA), Superior Temporal Gyrus (STG), Middle Occipital Gyrus (MOG), Cuneus (CUN). Activated brain regions were largely consistent with spatial cues under temporal and neutral cues.

Figure 3
www.frontiersin.org

Figure 3. Brain activation maps of the simple effects of audiovisual factors under spatial, temporal and neutral cues.

3.3 Correlation analysis of task activation with behavioral indicators

In order to investigate the correlation between task activation and behavioral indicators, Pearson correlation analyses were performed between the activation β-values of the three tasks, A, V and VA, and 6 behavioral indicators from spatial, temporal and neutral cues, respectively. The results are shown in Table 2. The brain regions of the A, V and VA tasks are shown in Figure 4.

Table 2
www.frontiersin.org

Table 2. Correlations between brain activation and behavioral indicators in three tasks with spatial cues.

Figure 4
www.frontiersin.org

Figure 4. Correlations between brain activation and behavioral indicators in three tasks with spatial cues. (A) A task. (B) V task. (C) VA task. SFG, Superior frontal gyrus; LING, Lingual gyrus; PHG, Parahippocampal gyrus; ANG, Angular gyrus; SPG, Superior parietal gyrus; ITG, Inferior temporal gyrus; IOG, Inferior occipital gyrus; TPOsup, Temporal pole: superior temporal gyrus; OLF, Olfactory cortex; PreCG, Precentral gyrus; SMG, Supramarginal gyrus; MOG, Middle occipital gyrus; ORBsup, Superior frontal gyrus, orbital part; PoCG, Postcentral gyrus; MTG, Middle temporal gyrus; MFG, Middle frontal gyrus. L, left hemisphere; R, right hemisphere. RT, Reaction Time; PC, Proportions of Correct Responses; IES, Inverse Efficiency Score; RCS, Rate-Correct Score; LISAS, Linear Integrated Speed-Accuracy Score; BIS, Balanced Integration Score.

The results of the correlation between brain activation and behavioral indices for the three tasks under the temporal cues are shown in Table 3. The brain regions for the A, V, and VA tasks are shown in Figure 5.

Table 3
www.frontiersin.org

Table 3. Correlations between brain activation and behavioral indicators in three tasks with temporal cues.

Figure 5
www.frontiersin.org

Figure 5. Correlations between brain activation and behavioral indicators in three tasks with temporal cues. (A) A task. (B) V task. (C) VA task. STG, Superior temporal gyrus; ITG, Inferior temporal gyrus; TPOsup, Temporal pole: superior temporal gyrus; IFGtriang, Inferior frontal gyrus, triangular part; PCUN, Precuneus; CUN, Cuneus; MOG, Middle occipital gyrus; IPL, Inferior parietal lobule; PoCG, Postcentral gyrus; FFG, Fusiform gyrus; SPG, Superior parietal gyrus; ORBinf, Inferior frontal gyrus, orbital part; SOG, Superior occipital gyrus; IOG, Inferior occipital gyrus; HIP, Hippocampus; PHG, Parahippocampal gyrus; PreCG, Precentral gyrus; ORBmid, Middle frontal gyrus, orbital part; LING, Lingual gyrus; CAL, Calcarine fissure and surrounding cortex. L, left hemisphere; R, right hemisphere. RT, Reaction Time; PC, Proportions of Correct Responses; IES, Inverse Efficiency Score; RCS, Rate-Correct Score; LISAS, Linear Integrated Speed-Accuracy Score; BIS, Balanced Integration Score.

The results of the correlation between brain activation and behavioral indices for the three tasks under neutral cues are shown in Table 4. The brain regions for the A, V, and VA tasks are shown in Figure 6 and Table 5.

Table 4
www.frontiersin.org

Table 4. Correlations between brain activation and behavioral indicators in three tasks with neutral cues.

Figure 6
www.frontiersin.org

Figure 6. Correlations between brain activation and behavioral indicators in three tasks with neutral cues. (A) A task. (B) V task. (C) VA task. SOG, Superior occipital gyrus; CUN, Cuneus; PCUN, Precuneus; SFGmed, Superior frontal gyrus, medial; ACG, Anterior cingulate and paracingulate gyri; THA, Thalamus; PoCG, Postcentral gyrus; PHG, Parahippocampal gyrus; ITG, Inferior temporal gyrus; MTG, Middle temporal gyrus; STG, Superior temporal gyrus; CAL, Calcarine fissure and surrounding cortex; IPL, Inferior parietal lobule; FFG, Fusiform gyrus; ANG, Angular gyrus; SFG, Superior frontal gyrus; LING, Lingual gyrus. L, left hemisphere; R, right hemisphere. RT, Reaction Time; PC, Proportions of Correct Responses; IES, Inverse Efficiency Score; RCS, Rate-Correct Score; LISAS, Linear Integrated Speed-Accuracy Score; BIS, Balanced Integration Score.

Table 5
www.frontiersin.org

Table 5. Summary of significant correlation analyses for three tasks with three attentional cues.

4 Discussion

In our study, we examined the relationship between brain activation and behavioral representations during audiovisual integration under endogenous attention, specifically focusing on how different types of endogenous attention—spatial, temporal, and neutral—affect behavioral strategies and modulate brain activity along the dorsal and ventral attention pathways. The auditory, visual, and audiovisual tasks based on the classical Posner experimental paradigm were designed. Behavioral data and task-state fMRI data were analyzed to explore differences in activation levels across subjects for the A, V, and VA tasks under different attention conditions. Additionally, we examined whether there were variations in brain region activation, particularly within the dorsal and ventral attention pathways, and explored correlations between activation levels and behavioral representations in each task. This approach highlights the focus on how endogenous attention modulates behavior and neural activity, emphasizing the functional roles of the dorsal and ventral attention pathways in audiovisual integration.

The Visual spatial cue (an arrow), first developed by Posner (1980), providing a spatial hint of the location of an upcoming target stimulus (Ristic and Kingstone, 2006), which are widely used to study the top-down spatial attention. We used 20% ineffective spatial cues to increase visual sensitivity (Solomon, 2004) to enhance attention-directing effects. We used circles as visual temporal cues that provide short or long-time intervals between the cue and the target stimulus, which would guide top-down time-related neural correlations. A study (Olk, 2014) showed that at around 300 ms, the study indicated the presence of a strong cueing effect, whereas greater than 1,500 ms, the cueing effect declined and a clear pattern of inhibition of return emerged. Thus temporal contextual differences in our experiments will affect the allocation of attention, revealing the importance of time-related implicit factors in attention (Olson and Chun, 2001). Neutral cues, on the other hand, are used as nonspecific processes to reveal purely attentional processes. The study demonstrated that we found extensive activation of FEF and IPL in brain activations for all three types of tasks, confirming that the dorsolateral frontoparietal (dFPN) network regulates top-down attention (Ptak et al., 2017) in visual and auditory targets with similarity in neural activation (Smith et al., 2010), suggesting that our experimental design successfully guided the attentional processes. We conducted a practice/training course with all subjects prior to the MR scan, which was conducted to ensure that subjects were familiar with the details of the tasks, and the correctness rate for all three tasks was above 95%, suggesting that the complexity and difficulty of the tasks were balanced. According to the analysis of the behavioral data on the process of audiovisual integration under endogenous attention, the results of the pairwise comparisons of the A, V, and VA tasks showed that there were significant differences in the psychological behavioral indicators including RT, IES, RCS, LISAS, and BIS, and that there was a significant difference in VA versus A and VA versus V for PC. Expressed in the VA task subjects responded more slowly and less correctly, expended more energy on average, responded less efficiently correctly, were less able to switch tasks, and were less able to make judgements about the difficulty of the task. Overall behavioral performance on the VA task was weaker. The classical MLE model assumes that visual and auditory signals of the same origin are integrated into a unified representation and weighted for sensory reliability, but sensory weights are subject to top-down modulation (Vercillo and Gori, 2015). The VA task is more complex than the single-channel tasks, as the brain must attend to information from both the visual and auditory channels, and then make a judgment about whether the stimuli are from the same side before issuing a response. This process is more complex and time-consuming, with sensory weights adjusted due to the spatial inconsistency that may arise in the VA task (Ernst and Banks, 2002; Rohe and Noppeney, 2018). This selective control reduces the gains from multiple senses, and being stimulated on different sides creates challenges in allocating attentional resources (Vatakis et al., 2007). Our findings highlight how the VA task requires different behavioral strategies, particularly in balancing speed and accuracy. The lower values in composite behavioral indicators like LISAS and BIS suggest that the increased complexity of integrating auditory and visual information leads to greater cognitive load, requiring more attention. The differences in these behavioral indicators emphasize the challenge of managing the trade-off between speed and accuracy in the VA task.

Differences in activated brain regions for the three tasks A, V, and VA under spatial, temporal, and neutral cues showed consistency in neural-level characterization, including the frontal eye field, inferior parietal lobule., supplementary motor area, superior temporal gyrus, middle occipital gyrus and cuneus brain regions. The frontal eye field, inferior parietal lobule and supplementary motor area belong to the dorsal frontoparietal network, an endogenous attentional task-activated brain region (Tang et al., 2016). These regions are involved in the regulation of spatial attention and task-related processing. The frontal eye field, inferior parietal lobule and superior temporal gyrus belonging to the association cortex, which processes information from visual & auditory channels and is responsible for audiovisual semantic integration (Li et al., 2016). The middle occipital gyrus and cuneus are primarily associated with visual information processing and belong to the primary visual cortex. These regions’ activation patterns highlight how attention networks, particularly the dorsal pathway, help allocate cognitive resources during task processing and facilitate the integration of audiovisual information.

In a correlation analysis of activation levels with behavioral indicators (Table 5), we used two basic behavioral metrics and four composite metrics and found that brain activity in multiple brain regions was behaviorally relevant and overlapped with task-related activation in three tasks, A, V, and VA, under spatial, temporal, and neutral cues. These overlapping brain regions suggest a direct link between their activity patterns during task performance and behavioral performance. The cuneus, lingual gyrus, and middle occipital gyrus are associated with visual information processing, and the inferior temporal gyrus and middle temporal gyrus are associated with visual information processing and belong to more advanced regions of visual processing. The angular gyrus and supramarginal gyrus are engaged in auditory information processing, with the angular gyrus also playing a role in decision-making by directing attention to relevant reward-related information in the visual environment (Studer et al., 2014). The fusiform gyrus, part of the multimodal association cortex, integrates sensory inputs from both visual and auditory channels. The frontal eye field, inferior parietal lobule, and superior parietal gyrus belong to the dorsal frontoparietal network and are involved in spatial attention and task-related focus (Chica et al., 2013), playing a crucial role in directing attentional resources. We found correlations with PC and BIS in the overlapping left Inferior parietal lobule, suggesting that IPL relies on accuracy in auditory tasks guided by visual endogenous attention, while top-down integration is enhanced in visual or audiovisual to balance overall task performance. The ventral attention network, including regions like the angular gyrus and superior temporal gyrus, is more stimulus-driven, helping the brain shift attention to relevant stimuli. We found in the overlapping bilateral Superior temporal gyrus that participation in endogenous attentional guidance in a single-channel task improves task completion by modulating information processing in a single modality to accomplish bottom-up allocation of attention. In addition, the hippocampus, parahippocampal gyrus, and temporal pole are associated with memory storage (Wagner et al., 2020; Setton et al., 2022), but also contribute to spatial perceptual learning and recognition of landmark objects (Jeffery, 2018; Sun et al., 2021), reflecting broader attentional modulation. These findings highlight how both dorsal and ventral attention pathways are engaged in managing cognitive load and balancing the integration of visual and auditory information.

In RT and PC correlations with brain activation we found broader significance results, with PC being more broadly significant than RT, suggesting that better PCs (higher grades) require more mobilization of perceptual resources than better RTs (faster speeds) (Helfrich et al., 2019; Hunter, 2021). In terms of RT (faster), we only observed a negative correlation of brain region activation accompanied by RT in neutral attention, which does suggest that increased information processing leads to slower reaction times when more brain activation intervenes, which may be related to the deliberate decision-making performed by the subjects (average task PC > 95%) (Vallesi et al., 2021; Schneider et al., 2022). Unlike single-channel stimuli, we observed differences in the correlation between RT and PC for spatial & temporal attention in the VA task, where increased brain activated areas resulted in increased RT and decreased PC, which is the opposite of neutral attention, and is manifested in areas dominated by sensory integration and visuo-spatial processing, suggesting that increased top-down spatio-temporal modulation mobilizes additional cognitive resources and thus affects the task performance strategy (Vidaud-Laperrière et al., 2022; Carmona et al., 2024). Improvements in task performance require greater mobilization of perceptual resources, while increased task complexity under different attentional modalities may alter an individual’s execution strategy, thereby affecting the relationship between reaction time and performance scores. Spatial and temporal attention are modulated, and individuals not only mobilize more cognitive resources, but may also respond to the challenges of the task by adjusting their task strategies.

In the IES we did not obtain a significance correlation in the A task, and in the V task we obtained an increase in spatial and neutral attention for the average energy consumed by the system over trials (Townsend, 1983) in the visual processing area. This exacerbates the energetic deepening of visual space in spatial attention, which differs from the monitoring function of visual targets in neutral attention, and follows the same trend as in VA, as evidenced by increased activation of visually and spatially relevant brain regions consuming energy in spatial and temporal attention, and energy savings in higher cognitive regions in neutral attention. In correlation with the RCS then characterized the efficiency of correct response trials, both of which negatively correlated with brain activation in the V task, as demonstrated by the processing of basic visual sensory information in spatial attention and the processing of higher cognition in temporal & neutral attention. This indicates that under spatial attention, the processing of basic visual information is associated with higher cognitive processing under temporal and neutral attention, suggesting that there are differences in top-down attentional modulation of cognitive strategies (Hahn et al., 2006; Li et al., 2012). Both LISAS and BIS are balanced for speed-accuracy, and the correlation differences between these two sets of metrics reveal differences between linear and nonlinear processing patterns in balancing cognitive resources in different brain regions. Our results reveal that the introduction of spatio-temporal attentional modulation in the A task increases visuo-spatial orienting processing would show accuracy, whereas in neutral attention it affects behavioral performance. This is the same in both indicators, whereas the LISAS and BIS show differential processing patterns after the addition of visual stimuli (V&VA task) and are completely reversed in the VA task, suggesting a dynamic modulation of the top-down attentional orientated processing audiovisual integration (Todd and Manaligod, 2018; Zeraati et al., 2023). Different attentional modalities not only modulate the brain’s workload, but also influence the efficiency of task performance, especially the balance between visuo-spatial and higher cognitive processing. Furthermore, audiovisual integration performance was affected by the modulation of top-down attention, suggesting that the brain optimizes task speed and accuracy through flexible resource allocation in the face of multimodal information.

5 Conclusion

Three audiovisual tasks based on the Posner experimental paradigm were designed to calculate behavioral indicators, brain region activation differences, and correlations between brain activation and behavioral representations under spatial, temporal, and neutral cues, respectively, for each task. Significant differences in behavioral performance were found across tasks, with the dual-channel task performing weaker than the single-channel task. Consistent and widespread brain activation was demonstrated in the frontal eye field, inferior parietal lobule, supplementary motor area, superior temporal gyrus, middle occipital gyrus and cuneus brain regions. In the correlations between behavioral indicators and brain activation, PC had broader differential results. In audiovisual tasks, spatio-temporal attentional modulation mobilizes additional cognitive resources and influences executive strategies compared to neutral attention. IES reveals endogenous attentional modulation of mean energy expenditure in dual-channel tasks, and the RCS reveals between-task differences in the efficiency of responding correctly. LISAS and BIS show different patterns of speed-accuracy balance in audiovisual tasks, indicating dynamic modulation of cognitive processes by attention networks. Dual attention pathways are involved in managing cognitive load, balancing the integration of visual and auditory information, and influencing task performance strategies.

Data availability statement

The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding authors.

Ethics statement

The study was approved by the Ethics Committee of Capital Medical University. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study. Written informed consent was obtained from the individual(s) for the publication of any potentially identifiable images or data included in this article.

Author contributions

ZG: Conceptualization, Writing – original draft, Writing – review & editing. MY: Conceptualization, Writing – original draft, Writing – review & editing. MH: Writing – review & editing. YL: Writing – review & editing. ZZ: Writing – review & editing. CL: Conceptualization, Writing – review & editing.

Funding

The author(s) declare that financial support was received for the research, authorship, and/or publication of this article. This work was supported by the National Natural Science Foundation of China (No. 62171300).

Acknowledgments

We express our gratitude to all those who provided assistance and thank every participant for their active cooperation.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The authors declare that no Gen AI was used in the creation of this manuscript.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Footnotes

References

Bruyer, R., and Brysbaert, M. (2011). Combining speed and accuracy in cognitive psychology: is the inverse efficiency score (IES) a better dependent variable than the mean reaction time (RT) and the percentage of errors (PE)? Psychologica Belgica 51, 5–13. doi: 10.5334/pb-51-1-5

Crossref Full Text | Google Scholar

Carmona, I., Rodriguez-Rodriguez, J., Alvarez, D., and Noguera, C. (2024). Inhibition and working memory capacity modulate the mental space-time association. Psychon. Bull. Rev. 31, 2634–2645. doi: 10.3758/s13423-024-02497-1

PubMed Abstract | Crossref Full Text | Google Scholar

Chica, A. B., Bartolomeo, P., and Lupiáñez, J. (2013). Two cognitive and neural systems for endogenous and exogenous spatial attention. Behav. Brain Res. 237, 107–123. doi: 10.1016/j.bbr.2012.09.027

PubMed Abstract | Crossref Full Text | Google Scholar

Coull, J. T., and Nobre, A. C. (1998). Where and when to pay attention: the neural systems for directing attention to spatial locations and to time intervals as revealed by both PET and fMRI. J. Neurosci. 18, 7426–7435. doi: 10.1523/JNEUROSCI.18-18-07426.1998

PubMed Abstract | Crossref Full Text | Google Scholar

Donohue, S. E., Green, J. J., and Woldorff, M. G. (2015). The effects of attention on the temporal integration of multisensory stimuli. Front. Integr. Neurosci. 9:32. doi: 10.3389/fnint.2015.00032

PubMed Abstract | Crossref Full Text | Google Scholar

Ernst, M. O., and Banks, M. S. (2002). Humans integrate visual and haptic information in a statistically optimal fashion. Nature 415, 429–433. doi: 10.1038/415429a

PubMed Abstract | Crossref Full Text | Google Scholar

Ernst, M. O., and Bülthoff, H. H. (2004). Merging the senses into a robust percept. Trends Cogn. Sci. 8, 162–169. doi: 10.1016/j.tics.2004.02.002

PubMed Abstract | Crossref Full Text | Google Scholar

Gueugneau, N., Pozzo, T., Darlot, C., and Papaxanthis, C. (2017). Daily modulation of the speed–accuracy trade-off. Neuroscience 356, 142–150. doi: 10.1016/j.neuroscience.2017.04.043

PubMed Abstract | Crossref Full Text | Google Scholar

Hahn, B., Ross, T. J., and Stein, E. A. (2006). Neuroanatomical dissociation between bottom–up and top–down processes of visuospatial selective attention. NeuroImage 32, 842–853. doi: 10.1016/j.neuroimage.2006.04.177

PubMed Abstract | Crossref Full Text | Google Scholar

Heitz, R. P. (2014). The speed-accuracy tradeoff: history, physiology, methodology, and behavior. Front. Neurosci. 8:150. doi: 10.3389/fnins.2014.00150

PubMed Abstract | Crossref Full Text | Google Scholar

Helfrich, R. F., Breska, A., and Knight, R. T. (2019). Neural entrainment and network resonance in support of top-down guided attention. Curr. Opin. Psychol. 29, 82–89. doi: 10.1016/j.copsyc.2018.12.016

PubMed Abstract | Crossref Full Text | Google Scholar

Hunter, C. R. (2021). Dual-task accuracy and response time index effects of spoken sentence predictability and cognitive load on listening effort. Trends Hearing 25:23312165211018092. doi: 10.1177/23312165211018092

PubMed Abstract | Crossref Full Text | Google Scholar

Jeffery, K. J. (2018). The hippocampus: from memory, to map, to memory map. Trends Neurosci. 41, 64–66. doi: 10.1016/j.tins.2017.12.004

PubMed Abstract | Crossref Full Text | Google Scholar

Landry, M., da Silva Castanheira, J., Raz, A., Baillet, S., and Sackur, J. (2024). A lateralized alpha-band marker of the interference of exogenous attention over endogenous attention. Cereb. Cortex 34:bhad457. doi: 10.1093/cercor/bhad457

PubMed Abstract | Crossref Full Text | Google Scholar

Li, C., Chen, K., Han, H., Chui, D., and Wu, J. (2012). An fMRI study of the neural systems involved in visually cued auditory top-down spatial and temporal attention. PLoS One 7:e49948. doi: 10.1371/journal.pone.0049948

PubMed Abstract | Crossref Full Text | Google Scholar

Li, Y., Wang, F., Huang, B., Yang, W., Yu, T., and Talsma, D. (2016). The modulatory effect of semantic familiarity on the audiovisual integration of face-name pairs. Hum. Brain Mapp. 37, 4333–4348. doi: 10.1002/hbm.23312

PubMed Abstract | Crossref Full Text | Google Scholar

Liesefeld, H. R., Fu, X., and Zimmer, H. D. (2015). Fast and careless or careful and slow? Apparent holistic processing in mental rotation is explained by speed-accuracy trade-offs. J. Exp. Psychol. Learn. Mem. Cogn. 41, 1140–1151. doi: 10.1037/xlm0000081

PubMed Abstract | Crossref Full Text | Google Scholar

Liesefeld, H. R., and Janczyk, M. (2019). Combining speed and accuracy to control for speed-accuracy trade-offs (?). Behav. Res. Methods 51, 40–60. doi: 10.3758/s13428-018-1076-x

PubMed Abstract | Crossref Full Text | Google Scholar

Luce, R. D. (1991). Response times: Their role in inferring elementary mental organization. Oxford: Oxford University Press.

Google Scholar

Meyer, K. N., Du, F., Parks, E., and Hopfinger, J. B. (2018). Exogenous vs. endogenous attention: shifting the balance of fronto-parietal activity. Neuropsychologia 111, 307–316. doi: 10.1016/j.neuropsychologia.2018.02.006

PubMed Abstract | Crossref Full Text | Google Scholar

Mikl, M., Mareček, R., Hluštík, P., Pavlicová, M., Drastich, A., Chlebus, P., et al. (2008). Effects of spatial smoothing on fMRI group inferences. Magn. Reson. Imaging 26, 490–503. doi: 10.1016/j.mri.2007.08.006

PubMed Abstract | Crossref Full Text | Google Scholar

Mole, C. (2020). The role of attention in multisensory integration. Multisens. Res. 34, 337–349. doi: 10.1163/22134808-bja10025

PubMed Abstract | Crossref Full Text | Google Scholar

Nidiffer, A. R., Stevenson, R. A., Fister, J. K., Barnett, Z. P., and Wallace, M. T. (2016). Interactions between space and effectiveness in human multisensory performance. Neuropsychologia 88, 83–91. doi: 10.1016/j.neuropsychologia.2016.01.031

PubMed Abstract | Crossref Full Text | Google Scholar

Noppeney, U. (2021). Perceptual inference, learning, and attention in a multisensory world. Annu. Rev. Neurosci. 44, 449–473. doi: 10.1146/annurev-neuro-100120-085519

PubMed Abstract | Crossref Full Text | Google Scholar

Olk, B. (2014). Effects of spatial, temporal and spatiotemporal cueing are alike when attention is directed voluntarily. Exp. Brain Res. 232, 3623–3633. doi: 10.1007/s00221-014-4033-7

PubMed Abstract | Crossref Full Text | Google Scholar

Olson, I. R., and Chun, M. M. (2001). Temporal contextual cuing of visual attention. J. Exp. Psychol. Learn. Mem. Cogn. 27, 1299–1313. doi: 10.1037/0278-7393.27.5.1299

PubMed Abstract | Crossref Full Text | Google Scholar

Parker, J. L., and Robinson, C. W. (2018). Changes in multisensory integration across the life span. Psychol. Aging 33, 545–558. doi: 10.1037/pag0000244

PubMed Abstract | Crossref Full Text | Google Scholar

Posner, M. I. (1980). Orienting of attention. Q. J. Exp. Psychol. 32, 3–25. doi: 10.1080/00335558008248231

PubMed Abstract | Crossref Full Text | Google Scholar

Posner, M. I., Snyder, C. R., and Davidson, B. J. (1980). Attention and the detection of signals. J. Exp. Psychol. Gen. 109, 160–174. doi: 10.1037/0096-3445.109.2.160

Crossref Full Text | Google Scholar

Ptak, R., Schnider, A., and Fellrath, J. (2017). The dorsal Frontoparietal network: a Core system for emulated action. Trends Cogn. Sci. 21, 589–599. doi: 10.1016/j.tics.2017.05.002

PubMed Abstract | Crossref Full Text | Google Scholar

Quinn, A. J., Vidaurre, D., Abeysuriya, R., Becker, R., Nobre, A. C., and Woolrich, M. W. (2018). Task-evoked dynamic network analysis through hidden Markov modeling. Front. Neurosci. 12:603. doi: 10.3389/fnins.2018.00603

PubMed Abstract | Crossref Full Text | Google Scholar

Ristic, J., and Kingstone, A. (2006). Attention to arrows: pointing to a new direction. Q. J. Exp. Psychol. 59, 1921–1930. doi: 10.1080/17470210500416367

PubMed Abstract | Crossref Full Text | Google Scholar

Rohe, T., and Noppeney, U. (2018). Reliability-weighted integration of audiovisual signals can be modulated by top-down attention. eneuro 5, ENEURO.0315–ENEU17.2018. doi: 10.1523/ENEURO.0315-17.2018

PubMed Abstract | Crossref Full Text | Google Scholar

Sadaghiani, S., Poline, J.-B., Kleinschmidt, A., and D’Esposito, M. (2015). Ongoing dynamics in large-scale functional connectivity predict perception. Proc. Natl. Acad. Sci. 112, 8463–8468. doi: 10.1073/pnas.1420687112

PubMed Abstract | Crossref Full Text | Google Scholar

Schneider, D., Herbst, S. K., Klatt, L. I., and Wöstmann, M. (2022). Target enhancement or distractor suppression? Functionally distinct alpha oscillations form the basis of attention. Eur. J. Neurosci. 55, 3256–3265. doi: 10.1111/ejn.15309

PubMed Abstract | Crossref Full Text | Google Scholar

Setton, R., Mwilambwe-Tshilobo, L., Sheldon, S., Turner, G. R., and Spreng, R. N. (2022). Hippocampus and temporal pole functional connectivity is associated with age and individual differences in autobiographical memory. Proc. Natl. Acad. Sci. 119:e2203039119. doi: 10.1073/pnas.2203039119

PubMed Abstract | Crossref Full Text | Google Scholar

Smith, D. V., Davis, B., Niu, K., Healy, E. W., Bonilha, L., Fridriksson, J., et al. (2010). Spatial attention evokes similar activation patterns for visual and auditory stimuli. J. Cogn. Neurosci. 22, 347–361. doi: 10.1162/jocn.2009.21241

Crossref Full Text | Google Scholar

Solomon, J. A. (2004). The effect of spatial cues on visual sensitivity. Vis. Res. 44, 1209–1216. doi: 10.1016/j.visres.2003.12.003

Crossref Full Text | Google Scholar

Spence, C. (2010). Crossmodal spatial attention. Ann. N. Y. Acad. Sci. 1191, 182–200. doi: 10.1111/j.1749-6632.2010.05440.x

PubMed Abstract | Crossref Full Text | Google Scholar

Spence, C., and Frings, C. (2020). Multisensory feature integration in (and out) of the focus of spatial attention. Atten. Percept. Psychophys. 82, 363–376. doi: 10.3758/s13414-019-01813-5

PubMed Abstract | Crossref Full Text | Google Scholar

Stevenson, R. A., and Wallace, M. T. (2013). Multisensory temporal integration: task and stimulus dependencies. Exp. Brain Res. 227, 249–261. doi: 10.1007/s00221-013-3507-3

PubMed Abstract | Crossref Full Text | Google Scholar

Studer, B., Cen, D., and Walsh, V. (2014). The angular gyrus and visuospatial attention in decision-making under risk. NeuroImage 103, 75–80. doi: 10.1016/j.neuroimage.2014.09.003

PubMed Abstract | Crossref Full Text | Google Scholar

Sun, L., Frank, S. M., Epstein, R. A., and Peter, U. T. (2021). The parahippocampal place area and hippocampus encode the spatial significance of landmark objects. NeuroImage 236:118081. doi: 10.1016/j.neuroimage.2021.118081

PubMed Abstract | Crossref Full Text | Google Scholar

Talsma, D., Doty, T. J., and Woldorff, M. G. (2007). Selective attention and audiovisual integration: is attending to both modalities a prerequisite for early integration? Cereb. Cortex 17, 679–690. doi: 10.1093/cercor/bhk016

PubMed Abstract | Crossref Full Text | Google Scholar

Talsma, D., Senkowski, D., Soto-Faraco, S., and Woldorff, M. G. (2010). The multifaceted interplay between attention and multisensory integration. Trends Cogn. Sci. 14, 400–410. doi: 10.1016/j.tics.2010.06.008

PubMed Abstract | Crossref Full Text | Google Scholar

Tang, X., Wu, J., and Shen, Y. (2016). The interactions of multisensory integration with endogenous and exogenous attention. Neurosci. Biobehav. Rev. 61, 208–224. doi: 10.1016/j.neubiorev.2015.11.002

PubMed Abstract | Crossref Full Text | Google Scholar

Thura, D., Guberman, G., and Cisek, P. (2017). Trial-to-trial adjustments of speed-accuracy trade-offs in premotor and primary motor cortex. J. Neurophysiol. 117, 665–683. doi: 10.1152/jn.00726.2016

Crossref Full Text | Google Scholar

Todd, R. M., and Manaligod, M. G. (2018). Implicit guidance of attention: the priority state space framework. Cortex 102, 121–138. doi: 10.1016/j.cortex.2017.08.001

PubMed Abstract | Crossref Full Text | Google Scholar

Townsend, J. (1983). Stochastic modeling of elementary psychological processes. Champaign, IL: University of Illinois Press.

Google Scholar

Treichler, D. (1967). Are you missing the boat in training aids. Film AV Commun. 1, 14–16.

Google Scholar

Vallesi, A., Tronelli, V., Lomi, F., and Pezzetta, R. (2021). Age differences in sustained attention tasks: a meta-analysis. Psychon. Bull. Rev. 28, 1755–1775. doi: 10.3758/s13423-021-01908-x

Crossref Full Text | Google Scholar

Van der Stoep, N., Van der Stigchel, S., Nijboer, T., and Spence, C. (2017). Visually induced inhibition of return affects the integration of auditory and visual information. Perception 46, 6–17. doi: 10.1177/0301006616661934

PubMed Abstract | Crossref Full Text | Google Scholar

Vandierendonck, A. (2017). A comparison of methods to combine speed and accuracy measures of performance: a rejoinder on the binning procedure. Behav. Res. Methods 49, 653–673. doi: 10.3758/s13428-016-0721-5

PubMed Abstract | Crossref Full Text | Google Scholar

Vandierendonck, A. (2018). Further tests of the utility of integrated speed-accuracy measures in task switching. J. Cogn. 1:8. doi: 10.5334/joc.6

PubMed Abstract | Crossref Full Text | Google Scholar

Vatakis, A., Bayliss, L., Zampini, M., and Spence, C. (2007). The influence of synchronous audiovisual distractors on audiovisual temporal order judgments. Percept. Psychophys. 69, 298–309. doi: 10.3758/BF03193751

PubMed Abstract | Crossref Full Text | Google Scholar

Vercillo, T., and Gori, M. (2015). Attention to sound improves auditory reliability in audio-tactile spatial optimal integration. Front. Integr. Neurosci. 9:34. doi: 10.3389/fnint.2015.00034

PubMed Abstract | Crossref Full Text | Google Scholar

Vidaud-Laperrière, K., Brunel, L., Syssau-Vaccarella, A., and Charras, P. (2022). Exploring spatiotemporal interactions: on the superiority of time over space. Atten. Percept. Psychophys. 84, 2582–2595. doi: 10.3758/s13414-022-02546-8

PubMed Abstract | Crossref Full Text | Google Scholar

Wagner, K., Gau, K., Metternich, B., Geiger, M. J., Wendling, A. S., Kadish, N. E., et al. (2020). Effects of hippocampus-sparing resections in the temporal lobe: hippocampal atrophy is associated with a decline in memory performance. Epilepsia 61, 725–734. doi: 10.1111/epi.16473

PubMed Abstract | Crossref Full Text | Google Scholar

Ward, B. D. (2000). Simultaneous inference for fMRI data. In AFNI 3dDeconvolve Documentation. Medical College of Wisconsin. Available at: http://afni.nimh.nih.gov/pub/dist/doc/manual/AlphaSim.pdf

Google Scholar

Woltz, D. J., and Was, C. A. (2006). Availability of related long-term memory during and after attention focus in working memory. Mem. Cogn. 34, 668–684. doi: 10.3758/BF03193587

PubMed Abstract | Crossref Full Text | Google Scholar

Yan, T., Geng, Y., Wu, J., and Li, C. (2015). Interactions between multisensory inputs with voluntary spatial attention: an fMRI study. Neuroreport 26, 605–612. doi: 10.1097/WNR.0000000000000368

PubMed Abstract | Crossref Full Text | Google Scholar

Zeraati, R., Shi, Y.-L., Steinmetz, N. A., Gieselmann, M. A., Thiele, A., Moore, T., et al. (2023). Intrinsic timescales in the visual cortex change with selective attention and reflect spatial connectivity. Nat. Commun. 14:1858. doi: 10.1038/s41467-023-37613-7

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: fMRI, endogenous attention pathway, audiovisual integration, Posner cueing paradigm, behavioral representations

Citation: Guan Z, Yan M, He M, Liu Y, Zhang Z and Li C (2025) Behavioral representations within the endogenous dual attentional pathways during audiovisual integration processing. Front. Neurosci. 19:1536688. doi: 10.3389/fnins.2025.1536688

Received: 05 December 2024; Accepted: 27 January 2025;
Published: 11 February 2025.

Edited by:

Claude Alain, Rotman Research Institute (RRI), Canada

Reviewed by:

Yang Xi, Changchun University of Science and Technology, China
Zhilin Zhang, Chinese Academy of Sciences (CAS), China

Copyright © 2025 Guan, Yan, He, Liu, Zhang and Li. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Zhixi Zhang, emhhbmd6aGl4aUBhaXJjYXMuYWMuY24=; Chunlin Li, bGljaHVubGluMTk4MUAxNjMuY29t

These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Research integrity at Frontiers

Man ultramarathon runner in the mountains he trains at sunset

94% of researchers rate our articles as excellent or good

Learn more about the work of our research integrity team to safeguard the quality of each article we publish.


Find out more