- 1State Key Laboratory of Cognitive Neuroscience and Learning, IDG/McGovern Institute for Brain Research, Beijing Normal University, Beijing, China
- 2Beijing Advanced Innovation Center for Future Education, Beijing Normal University, Beijing, China
- 3Siegler Center for Innovative Learning, Beijing Normal University, Beijing, China
- 4The Second High School Attached to Beijing Normal University, Beijing, China
Brain-behavior correlations are commonly used to explore the associations between the brain and human behavior in cognitive neuroscience studies. There are many critics of the correlation approach, however. Most problems associated with correlation approaches originate in the weak statistical power of traditional correlation procedures (i.e., the mean-wise interindividual brain-behavior correlation). This paper proposes a new correlation procedure, the item-wise interindividual brain-behavior correlation, which enhances statistical power via testing the significance of small correlation coefficients from trials against zero rather than simply pursuing the highest correlation coefficient. The item-wise and mean-wise correlation were compared in simulations and an fMRI experiment on mathematical problem-solving. Simulations show that the item-wise correlation relative to the mean-wise correlation results in higher t-values when signal-to-noise ratio is equal to or larger than 6%. Item-wise correlation displayed more voxels with significant brain-behavior correlation than did mean-wise correlation. Analyses with item-wise (rather than mean-wise) correlation showed significant brain-behavior correlation at the threshold of p < 0.05 corrected. Cross validation showed that odd- and even-ordered trials have greater stability in terms of the item-wise correlation (r = 0.918) than the mean-wise correlation (r = 0.686). The simulations and example analyses altogether demonstrate the effectiveness of the proposed correlation procedure for task neuroimaging studies.
Introduction
Neuroimaging researchers often use brain-behavior correlations to explore the association between the brain and human behavior. There are many critics of this approach, however. Problems seemingly inherent to correlation analysis include lack of correction for multiple comparisons (Kriegeskorte and Mur, 2012; Rousselet and Pernet, 2012) and confounding factors in the correlation (Lazic, 2010). Indeed, false correlations have been published in many journals (Vul et al., 2009a,b; Lazic, 2010; Rousselet and Pernet, 2012; Vul and Pashler, 2012). There have been initial attempts to address these problems (Wilcox, 2005; Pernet et al., 2012; Rousselet and Pernet, 2012), but much work remains to be done. The weak statistical power of the routine mean-based interindividual brain-behavior correlation approach is particularly problematic. Previous fMRI analyses of individual differences typically have very little power to detect all but the most powerful correlational effects (Yarkoni, 2009; Yarkoni and Braver, 2010).
This paper introduces a new correlation approach designed for enhanced statistical power. Traditional correlation involves first separately computing the mean values of all items (i.e., trials) on behavioral and brain responses for a participant, then performing correlation across the participants. The traditional correlation typically performed on mean values is referred to in this paper as “mean-wise correlation.” For example, consider 28 participants each of whom finishes 60 trials in an fMRI scanner. Following the procedure for traditional correlation, there are 28 mean values (e.g., reaction time, RT) and 28 brain maps with each voxel of each brain map involving the mean of brain activation from the 60 trials for all participants. The behavioral data (e.g., RT) correlate with the brain activation of each voxel across the 28 participants.
The new correlation approach is a two-step process. A correlation across participants is first performed on the behavioral measure and brain response for an item (i.e., a trial), then the r-values are tested via t-test or analysis of variance (ANOVA). The proposed correlation approach is performed on an item, so it is referred to here as the “item-wise correlation.” If the item-wise correlation is applied to the example above, there are 60 r-values which can be tested with a one-sample t-test against 0. The proposed analysis can be performed on a weak correlation due to the abundance of noise for a given item. The t-test or ANOVA can greatly enhance the statistical power in this scenario, possibly due to the r-values having removed much of the noise.
The traditional mean-wise and novel item-wise interindividual brain-behavior correlations are introduced below. Simulations based on the different signal-to-noise ratios of these two correlation approaches are then described to contrast their respective statistical power. Our fMRI study on mathematical problem-solving is discussed, as well as the application of item-wise and mean-wise correlations to reflect brain-behavior correlations. Cross-validation analysis further demonstrates the statistical stability of item-wise and mean-wise correlations. To conclude the paper, the interindividual correlation is also compared against the intraindividual correlation. For the example mentioned above (28 participants who finish 60 trials in an fMRI scanner), the interindividual brain-behavior correlation is computed first as the correlation between the behavioral score and brain activation for each item across all 28 participants, then a random effect test is run on all the 60 correlation coefficients against zero. The intraindividual correlation involves first computing the correlation between the behavioral score and brain activation for each participant across all 60 trials, followed by a random effect test on the 28 correlation coefficients against zero.
Item-Based and Mean-Based Interindividual Brain-Behavior Correlations
Most previous research on this subject center around mean-based interindividual brain-behavior correlation, in which the behavioral response and brain response averaged from each participant are correlated across participants. The Pearson correlation is commonly applied to correlation analysis, though outliers can easily expand or dilute the correlations (e.g., Vul et al., 2009a,b; Pernet et al., 2012). The t-test formula for the Pearson correlation in this context is as follows:
Cohen provided rules of thumb to characterize effect sizes as small, medium, or large; in the social sciences, r = 0.10 generally means small, 0.30 means medium, and 0.50 means a large effect size (Cohen, 1988, 1992) while r-values larger than 0.5 are extremely rare in most areas of psychology (Yarkoni and Braver, 2010). The typical effects across broad domains of psychology and medicine range from 0.1 to 0.3 (Meyer et al., 2001). Previous fMRI studies on emotion, personality, and social cognition have shown high brain-behavior correlation values (Vul et al., 2009a).
Magnetic resonance images or functional MRI studies are typically conducted with 30 participants. According to Formula (1), the t-value for the large effect size (r = 0.50) in 30 cases is still small – only t(28) = 3.06 with a corresponding p-value of 0.0048. The statistical power is not appropriate under a lenient but acceptable primary threshold (p < 0.001) in neuroimaging data analysis (Woo et al., 2014; Carter et al., 2016).
For any effect size (e.g., r = 0.50), the sample size can be determined according to the p-value. We determined the sample size for our threshold in three steps. First, the t-values were calculated for different sample sizes according to Formula (1); r was set to 0.50. Second, the Excel TDIST function was used to generate a table for the Student’s T-Distribution based on the t-values and different sample sizes (e.g., from 1 to 100). Finally, the sample size was determined according to the p-value in the table. For p < 0.001, the number of participants for r = 0.50 should be at least 40: t(38) = 3.56, p = 0.0009. For the threshold of p < 0.05 corrected, 0.05 should be divided by the number of brain voxels in the 3D whole-brain image by Bonferroni correction method.
The 3D whole-brain volume was set to 61 × 73 × 61 with reference to previously published fMRI studies. The number of brain voxels was 53,462. The p-value was determined to be 9.35 × 10-7 after correction and the sample size was 86 as per the Student’s T-Distribution table. That is, t(84) = 5.29, p = 9.32 × 10-7. It is difficult to obtain an ideal statistical result without a large number of participants. The item-wise interindividual brain-behavior correlation process involves pursuing high statistical power and statistical stability though the correlation coefficients may be very small. This type of correlation is performed at the item level; an “item” is a trial or a problem. When 28 participants finish 60 mathematical word problems in an fMRI scanner, there are 60 items or trials. First, the brain response of each item is modeled similarly to the first-level analysis in the item-wise brain activation analysis introduced by Bedny et al. (2007). Next, the correlation (e.g., Pearson approach) is computed for each item across participants. Finally, the r-values from all items are tested against a fixed number (e.g., 0) or compared to one another. The formula for the typical single-sample t-test is as follows:
where the t-value depends on sample size (i.e., the number of items), mean of r-values (X) and standard deviation of r-values (SD). μ0is 0 when examining whether brain and behavior are associated. The sample size has similar effect on Formulas (1) and (2). The r-value may be small for each item in the item-wise correlation; the t-value in Formula (2) could be very large when SD is very small. Thus, the item-wise correlation relative to the mean-wise correlation exacts greater statistical power by relying on possibly weak but stable correlations at the item level.
The one-sample sign test can be applied if the r-values for all items are not normally distributed. The r-values are first transformed into 0 if they are below the fixed number (e.g., 0), and into 1 if they are above the fixed number. A binomial distribution test can then be applied to compute the z score:
where n is the sample size (i.e., number of items), K is the number of the r-values below the fixed number, and P is the probability of r-value above the fixed number (e.g., P = 0.5).
Simulations for Item-Wise and Mean-Wise Correlations
We compared the two correlation approaches by simulation, as discussed below. All simulations were performed in MATLAB (R2014a, The MathWorks, Inc.). The simulations were first performed on random data (i.e., without any signal) with standard normal distribution. The mean is around 0 and SD is around 1.
The simulations were first performed on random values for two variables, then subjected to Pearson correlation. The sample size ranged from 2 to 1,200 with a step of 1. There were 1,000 samples for each sample size. The mean r-values for the 1,000 samples of each sample size in item- and mean-wise correlation are reported in Figure 1A. The data are random, so the mean r-value is very close to 0 in the item- and mean-wise correlations. The mean r-value for item-wise correlation is -0.00001; for mean-wise correlation it is 0.00014. Item-wise correlation relative to mean-wise correlation leads to smaller absolute r-values for 1,000 samples at each sample size (from 2 to 1,200), generally p < 2.95 × 10-125 (Figure 1B). The item-wise correlation also results in smaller absolute maximum or minimum r-values (Figure 1C). The t-values for each of the three types of r-values does not differ between the two correlation approaches.
FIGURE 1. Simulations of item-wise and mean-wise interindividual brain-behavior correlations on same random data (i.e., without any signal). (A) Mean r-values for 1,000 samples of each size in item- and mean-wise correlation. (B) Item-wise correlation relative to mean-wise correlation with smaller absolute r-values for 1,000 samples at each size. (C) Item-wise correlation also results in smaller absolute maximum or minimum r-value.
We performed another simulation on data involving signals. The signal-to-noise ratio in this simulation ranged from 0.01 to 1 with a step of 0.01. Again, there were 1,000 samples for each ratio. The sample size was assumed as 100 participants who finished 100 items. The x(100,100), y(100,100), and s(100,100) is represented a 100-by-100 array of pseudorandom values. The s(100,100) value was multiplied by the signal-to-noise ratio from 1% to 100%, resulting in signal (S). The signal was added into x(100, 100) and y(100, 100) as follows:
Thus, X represents one type of score (e.g., RT) of 100 participants finishing 100 items and Y represents another type of score (e.g., brain activation) of 100 participants finishing 100 items.
The r-values for item-wise correlation are smaller than the r-values for mean-wise correlation when the ratio is less than 40% (p < 0.05). No differences were observed beyond this value between the two correlations (Figure 2A). For ratios less than 5% (p > 0.05), the t-values we calculated do not differ between item-wise and mean-wise correlation. When the ratio is 6%, significance emerges at p < 0.05. As the ratio increases, the significance becomes more salient. When the ratio is 100%, the mean t-value for item-wise correlation is 10 times that for mean-wise correlation (Figure 2B).
FIGURE 2. Simulations of item-wise and mean-wise interindividual behavior-brain correlations on same data with signal-to-noise ratios from 1 to 100% (1,000 samples for each ratio). (A,C,E) r-Values for item-wise and mean-wise correlation. (B,D,F) t-Values for item-wise and mean-wise correlation.
The number of participants and the number of items always differ in a study. The simulations for fewer participants and more items (e.g., 20 participants and 100 items) and for more participants and fewer items (e.g., 100 participants and 20 items) are also presented in Figures 2C–F. The simulations resulted in similar patterns to those in the previous simulation (100 participants and 100 items).
The simulations effectively prove the validity of item-wise interindividual behavior-and-brain correlation analysis. We next directly compared the two approaches in a neuroimaging experiment on mathematical problem-solving. We sought to verify that the item-wise correlation reveals neural markers of cognition with sufficient statistical power.
Example Description and Data Preprocessing
Participants
Twenty-eight healthy, right-handed university students (11 male, 17 female) were recruited from Beijing Normal University, China. The average age of the subjects was 21.5 years-old within a range from 18.7 to 26.6 years old at the time of this study. They self-reported normal or corrected-to-normal eyesight, normal hearing, and lack of neurological or psychiatric illness. Subjects had no brain abnormality on their T1-weighted high-resolution magnetic resonance images (MRI) as determined by a neuroradiologist. Informed written consent was obtained from each subject after the procedures were fully explained. The experiment was approved by the State Key Laboratory of Cognitive Neuroscience and Learning at Beijing Normal University. Participants were compensated 100 RMB for their time.
Materials
Sixty mathematical word problems were presented to the participants; participants were required to compute the exact answer to each problem. The problems involved addition, subtraction, multiplication, or division and required either one or two steps to complete. For example: “A sister has 28 pencils, her brother has 10 pencils. How many pencils does she need to give to her brother so that they each have an equal amount?” One potential solution is to calculate how many more pencils the sister has than her brother (18 pencils) and then divide that difference in half (9 pencils). All the problems were presented randomly and self-paced during 60-s blocks.
Apparatus and Imaging Parameters
We conducted functional MRI (fMRI) analyses on a Siemens (Munich, Germany) 3T Trio scanner using an 8-channel phase array head coil. Participants laid supine in the scanner with their heads immobilized. A single shot, T2∗-weighted gradient-echo echo planar imaging (EPI) sequence was used for the fMRI scans with slice thickness of 6 mm and no gap between slices, in-plane resolution of 3.75 × 3.75 mm, and TR/TE = 3000 ms/30 ms. The field of view was 240 × 240 mm and the acquisition matrix was 64 × 64. Thirty contiguous axial slices parallel to AC-PC were acquired. Three-hundred and sixty-two images were acquired with a total scan time of 724 s in a single run. High-resolution T1-weighted anatomical images were also acquired for each participant (3D, gradient-echo pulse-sequence, TR/TE = 25 ms/6 ms, FOV = 220 mm × 220 mm, 89–92 contiguous slices, matrix = 220 × 220, thickness = 2 mm).
fMRI Scanning Procedures
The participants were given a practice session outside of the scanner prior to the fMRI analysis. In the scanner, mathematical problems were projected onto the center of a translucent screen and viewed by the participants through a mirror attached to the head coil. The stimuli were presented in black against a white background. The visual angle of each problem was less than 3° in both horizontal and vertical directions.
There was one run for the mathematical problem-solving task and a total of 16 blocks per run: eight blocks for mathematical problem-solving and eight blocks of visual fixation each lasting 30 s. The participants were encouraged to respond as quickly and accurately as possible by using their left or right index finger to press a button. After the participant responded, a new problem was presented after a blank of 1,000 ms. If the participants failed to respond within 20 s, the problem would disappear and a new one was presented. If there were 8 s or less left in a block, instead of presenting a new question, a fixation sign (“+”) was presented for the remainder of the block.
Neuroimaging Data Preprocessing
Neuroimage preprocessing was performed using SPM12 (Statistical Parametric Mapping, Welcome Department of Cognitive Neurology, London1). The functional data set acquired from the experiment consisted of 362 image volumes. Functional images were realigned to the first volume in the scanning session using affine transformations. A mean functional image volume was constructed for each participant from the realigned image volumes. This mean image volume was then used to determine the parameters for spatial normalization; these parameters were then applied to the corresponding functional image volumes for each participant. Spatial smoothing was performed on the normalized functional images using a Gaussian kernel 8 mm full-width at half-maximum (FWHM).
For the block-wise whole-brain analysis, statistical analyses were conducted on the smoothed data using a boxcar design with a canonical hemodynamic response function (HRF). A high-pass filter (186 s) was applied in order to remove low frequency effects and a low-pass filter (4 s) to remove the high frequency noise. The global temporal trend was removed. Contrasts of interests were calculated for each individual (below) and subjected to random effects analyses at the subject-based group level using one-sample t-tests.
For all contrasts of brain activation, we used the thresholds from the most lenient p < 0.05 uncorrected to p < 0.005, p < 0.001, to the most stringent p < 0.05 corrected with a minimum cluster size of 10 voxels.
Behavioral and Neuroimaging Results
Behavioral Results
The 28 participants responded to a total of 1,367 trials (math word problems). About 13.7% of the trials, or 187 of them, were responded to incorrectly. Of the incorrect responses, seven trials involved no keystrokes within the maximum RT of 20,000 ms. The average time for the trials with correct responses (2,726) was 8,064 ms, and the average time for incorrect responses (4,904) was 7,915 ms. There was no significant difference between the two types of trial responses: F(1,1365) = 0.372, p = 0.542, = 0.000. Only the trials with correct responses were entered into the RT correlation analysis.
Brain Activations
The whole-brain activation for mathematical problem-solving is shown in Figure 3. We identified four types of thresholds, 0.05 uncorrected, 0.005 uncorrected, 0.001 uncorrected, and 0.05 corrected, each with more than 10 voxels. Whole-brain activation was observed from two starting points: block-wise analysis and item-wise analysis. The block-wise analysis is the traditional analysis based on block design. The item-wise analysis involves computing the brain activation for each item (i.e., a trial or a math word problem), obtaining a brain map for each participant, and finally, conducting a random effect test. We found that both approaches have similar activation patterns regardless of positive or negative activation. Positive activation is typically located at the bilateral occipital, parietal, and frontal cortex. Negative activation is located at the default mode network including the angular gyrus, anterior temporal cortex, precuneus, and medial prefrontal cortex.
FIGURE 3. Brain activation for mathematical problem-solving from block-wise subject analysis and item-wise subject analysis. Four thresholds: 0.05 uncorrected, 0.005 uncorrected, 0.001 uncorrected, and 0.05 corrected level each with voxel size > 10, voxel in 3 mm × 3 mm × 3 mm. First row shows positive activation; second row shows negative activation in both block-wise subject analysis and item-wise subject analysis.
Item-Wise and Mean-Wise Correlation for Mathematical Problem-Solving
We used a Lilliefors test to determine whether our data (β value) satisfies normal distribution. The Lilliefors test is a normality test based on the Kolmogorov–Smirnov test (Lilliefors, 1969). We tested the activation (β value) of each voxel for all participants to find that 82.2% voxels satisfy normal distribution. The RT of each trial for all participants was also tested: 88.3% trials satisfy normal distribution. Therefore, the Pearson correlation method is applicable.
For the fMRI study wherein 28 participants finished 60 trials, the mean-wise correlation was computed on the 28 mean values of behavioral measure (i.e., RT) to obtain 28 brain activation maps with each voxel of each map involving the mean of brain activation based on the 60 trials. This process results in a t map where each brain voxel has a t-value.
In the item-wise correlation, we first computed the correlation between the RT of 28 participants and their brain maps for an item (resulting in 60 r-values for each brain voxel) and conducted a one-sample t-test on the 60 r-values. This analysis also results in a t map. The t maps for item-wise correlation and mean-wise correlation between brain activation (i.e., β value) and RT are shown in Figure 4, again with four thresholds: 0.05 uncorrected, 0.005 uncorrected, 0.001 uncorrected, and 0.05 corrected. Table 1 reports the brain regions with positive and negative brain-behavior correlations based on item-wise correlation analysis via Pearson method (Figure 4).
FIGURE 4. Significant clusters of item-wise and mean-wise interindividual behavior and brain correlation with voxel size > 10 (voxel in 3 mm × 3 mm × 3 mm).
TABLE 1. Brain regions with positive and negative brain-and-behavior correlations based on item-wise correlation at p < 0.05 corrected (voxel size > 10).
We observed positive correlations mostly at the SMA, right middle temporal gyrus, bilateral amygdala, medial superior frontal gyrus, bilateral fusiform gyrus, and bilateral superior temporal gyrus under the threshold p < 0.05 corrected. Negative correlations are generally located at the bilateral thalamus, bilateral cerebellum, bilateral inferior temporal gyrus extending to the inferior occipital gyrus and fusiform, bilateral middle frontal gyrus, left inferior frontal gyrus, and right postcentral gyrus under the threshold p < 0.05 corrected.
Table 2 shows the brain regions with positive and negative brain-behavior correlations based on traditional mean-wise correlation analysis (Figure 4). We found positive correlations mostly at the bilateral postcentral gyrus, right precentral gyrus, bilateral superior temporal gyrus, middle cingulum gyrus, supplementary motor area, and right fusiform gyrus under the threshold p < 0.05 uncorrected. Negative correlations were observed at the left precentral gyrus, left inferior parietal lobule, left superior occipital gyrus, and left inferior occipital gyrus extending to the inferior temporal gyrus under the threshold p < 0.05 uncorrected.
TABLE 2. Brain regions with positive and negative brain-and-behavior correlations based on mean-wise correlation at p < 0.05 uncorrected (voxel size > 10).
There were several brain regions that showed similar significant correlations both item-wise and mean-wise. The brain regions containing positive correlations include the bilateral precuneus, bilateral superior frontal gyrus, right postcentral gyrus, and left middle frontal gyrus; the same brain regions that showed negative correlations include the left precentral gyrus, left inferior parietal lobule, and left inferior temporal gyrus.
The relations of r-values in the two correlations are shown in Figure 5; the brain maps for the same r-values are shown in Figure 5A. We found that the r-value from item-wise correlation is generally smaller than that from mean-wise correlation (Figure 5B). The correlation between the r-values from the item-wise correlation and the r-values from the mean-wise correlation across all voxels in the whole brain results in a value of 0.81 (Figure 5C). We also found that 92.7% voxels (β value) positively or negatively correlated with RT under the threshold of p < 0.05 uncorrected by mean-wise correlation were observable via item-wise correlation analysis. Only 19.8% voxels (β value) positively or negatively correlated with RT under the threshold of p < 0.05 uncorrected by item-wise correlation analysis were observable via mean-wise correlation. That is, 80.2% of the voxels with significant correlation by item-wise correlation analysis were not observable by mean-wise correlation analysis (Figure 4). This suggests that item-wise correlation analysis reveals more significant correlations than mean-wise.
FIGURE 5. Relations of r-values for item-wise and mean-wise brain-behavior interindividual correlation and their correlation. (A) r-Value maps for item-wise and mean-wise correlation. (B) Relation of r-values at voxel level for the two correlation. Top: voxels ordered by coordinates. Middle: voxels ordered by item-wise r-values. Bottom: voxels ordered by mean-wise r-values. (C) Scatter map of r-values for two correlations.
Cross Validation
We ran a cross validation to assess the consistency of all the items contributing to the association between brain and behavior. The items were split into two halves according to their parity: odd-ordered trials and even-ordered trials. The item-wise correlation led to high consistency between the two trial halves (Figure 6). The same brain regions activated in the two correlations (odd- and even-ordered trials) is marked in yellow in the figures below. Table 3 reports the same brain regions with positive or negative brain-behavior correlation for odd- and even-ordered trials. The clusters of activation in the odd-ordered trials with p < 0.05 corrected were repeated in the clusters of activation in even-ordered trials.
FIGURE 6. Cross validation analysis on odd- and even-ordered trials (p < 0.05 uncorrected for both item-wise and mean-wise correlations).
TABLE 3. Same brain regions with positive or negative brain-behavior correlations for odd- and even-ordered trials (p < 0.05 uncorrected, voxel size > 10).
The AAL atlas is used here to show the statistical stability at brain region level. The correlation of activated voxel quantities in 116 regions of the AAL template is 0.918 and the mean-wise correlation is only 0.686 (Figure 7). The p-value is higher and r-value is lower in the mean-wise correlation than the item-wise, which suggests that mean-wise correlation analysis is less stable.
FIGURE 7. Number of voxels for odd- and even-ordered trials in AAL template (116 brain regions) for item-wise and mean-wise correlations (p < 0.05 uncorrected for both item-wise and mean-wise correlations).
Relation Between Interindividual and Intraindividual Approaches
Interindividual brain-behavior correlation tends to reflect the variations associated with individuals, but intraindividual brain-behavior correlation tends to reflect the variations associated with items. Both inter- and intraindividual correlations reflect variations in brain activation related to cognitive processing. An example for each approach is provided in Introduction section.
In our mathematical problem-solving experiment, we first performed intraindividual brain-behavior correlation for each individual across items/trials, then combined the correlation brain maps with single-sample t-tests against 0 (Figure 8). The intraindividual map and interindividual map was correlated with 0.306 according to the t-value at the voxel level for mathematical problem-solving. According to Cohen’s rules of thumb (1988, 1992), r in 0.306 is “medium” level; only 10% of the variation can be explained. The result suggests that the two correlations are independent.
FIGURE 8. Interindividual (top) and intraindividual (bottom) correlations in mathematical problem-solving (p < 0.001 uncorrected for both interindividual and intraindividual correlations).
We found that interindividual correlation is altogether more practical than intraindividual correlation. The ideal experimental design involves keeping the trials homogeneous, which greatly reduces the brain-behavior correlation. Conversely, recruiting participants across the widest possible range creates better sample distribution and more meaningful across-subject correlation.
Relation Between Brain Activation and Correlation
To determine the proportion of voxels with significant correlations in the positively or negatively activated brain voxels (see section “Brain Activations”), we set the most lenient threshold (p < .05 uncorrected) to cover the widest possible range of activated and associated brain regions. The relation is shown in Figures 9, 10.
FIGURE 9. Item-wise correlation in brain activation map (p < 0.05 uncorrected for both correlation and activation analysis, number of activated voxels exceeds 10).
FIGURE 10. Mean-wise correlation in brain activation map (p < 0.05 uncorrected for both correlation and activation analysis, number of activated voxels exceeds 10).
For the relation between brain activation and item-wise correlation, about 25.8% of brain voxels with the positive correlation were found in the positively activated brain and 51.3% in the negatively activated brain. About 76.3% of brain voxels with negative correlation were observed in the positively activated brain, but only 7.4% in the negatively activated brain. (Figure 9). For the relation between brain activation and mean-wise correlation, about 8.0% voxels that showed positive correlation were displayed in the positively activated brain and 74.5% in the negatively activated brain. About 96.9% voxels that showed negative correlation were observed in the positively activated brain, and no voxel was observed in the negatively activated brain (Figure 10).
In addition, our item-wise correlation analysis showed that the correlation of some voxels meets the threshold of p < .05 corrected. The relation between brain activation and the item-wise correlation was computed with the threshold of p < .05 corrected. Up to 33.9% of the brain voxels with positive correlation were observed in the positively activated brain and 27.0% in the negatively activated brain. Up to 98.8% of brain voxels with negative correlation were found in the positively activated brain, but no voxels were observed in the negatively activated brain.
According to the results of item-wise and mean-wise correlation analyses, negative correlation between brain activation and RT is generally found in the brain regions with positive activation (see section “Brain Activations,” “whole-brain activation analysis”). Positive correlation can be found in both positively and negatively activation brains.
Discussion
The goal of this study was to secure a novel, simple correlation procedure for brain-behavior association analysis across individuals and ultimately to shed light on how the brain processes information, as well as how individuals differ from one another in regards to cognition.
Traditional mean-wise individual brain-behavior correlation involves the pursuit of the highest possible correlation. Behavioral measurement techniques are limited, so the correlation cannot be especially high. Item-wise individual brain-behavior correlation, conversely, centers around the pursuit of weak but stable correlation coefficients. The weak correlation for an item arises due to an abundance of noise.
The statistical power in simulations on data-containing signals is enormously enhanced by this technique. Comparison between the two correlation approaches on any given example (e.g., mathematical problem-solving) also shows that only the item-wise correlation can meet the gold standard for activation detection. Cross validation also reveals stable correlation patterns under the item-wise correlation approach. The removal of noise during the correlation analysis of each item is why the item-wise correlation is so successful.
The linear regression model can be borrowed to demonstrate the working principle of item-wise correlation. The residual error (𝜀i) can be filtered out in the model. The regression coefficient in a simple linear regression can directly be transformed into the correlation coefficient r = β∗SDx/SDy. The mean-wise correction on random data (without any signal) results in larger absolute r-value, which indicates more false alarm error (type I error).
Enhanced statistical power lends strength to the application of correction for multiple testing in correlation analysis, which is otherwise a serious problem (Rousselet and Pernet, 2012). The present study was conducted to test the effectiveness of item-wise interindividual brain-behavior correlation for task fMRI studies. An fMRI experiment on mathematical problem-solving was run and analyzed with a typical item-wise correlation process. Task fMRI studies actually are a subset of neuroimaging studies – the question of whether item-wise correlation applies to other types of neuroimaging studies (e.g., on data collected outside the scanner) remains to be answered. We would assert that item-wise correlation analysis is indeed valid for studies with behavioral data collected outside the scanner. If there is a one-to-one corresponding relation between trials outside the scanner and inside (i.e., data can be paired), the item-wise analysis applies across the board. Behavioral data was collected outside an fMRI scanner (e.g., reaction time for 28 trials) and brain activation was collected as the participants completed 60 trials inside the fMRI scanner. There were no one-to-one corresponding relations between the sets of trials. The data (e.g., RT) for each trial collected outside the scanner can also correlate the brain activations of each trial inside the scanner. Average data of all trials for each participant collected outside the scanner can also correlate with the brain activations in each trial.
In a resting fMRI study, behavioral data is collected outside the scanner or collected in a secondary session inside the scanner. The behavioral data for each trial can be correlated with brain signals in the resting state. The resting state can also be separated into different time windows, e.g., with 15 to 30 TRs per window (Handwerker et al., 2012; Hutchison et al., 2013; Leonardi et al., 2013; Allen et al., 2014). The average data of all trials for each participant can be correlated with the brain signal for each time window in the resting state.
It should be noted that the proposed model (i.e., item-wise individual brain-behavior correlation) is actually a random-effects model on the items, and a fixed-effects model on subjects. The effect from statistical test can be generalized to new items from the same subjects. On the other hand, it is hard to make a prediction for new subjects that would be appended to the cohort.
Conclusion
The paper introduced a novel brain-behavior correlation method with markedly enhanced statistical power. The proposed method may effectively resolve problems inherent to traditional brain-behavior correlation techniques. Cross validation demonstrated the reliability of the item-wise interindividual correlation, which was also applied to a block-design neuroimaging study. The analysis technique discussed here could be extended to event-related design in future neuroimaging studies, as it can be used to conveniently and accurately model brain activation across multiple trials.
Ethics Statement
This study was carried out in accordance with the recommendations of guidelines from the Institutional Review Borad of BNU Imaging Center for Brain Research, State Key Laboratory of Cognitive Neuroscience And Learning. The protocol was approved by the Institutional Review Borad of BNU Imaging Center for Brain Research, State Key Laboratory of Cognitive Neuroscience and Learning.
Author Contributions
XZ proposed research questions and hypotheses, performed the data analyses, and wrote the manuscript. ML performed the data analyses and wrote the manuscript. HZ, LL, and JC performed the data analyses.
Funding
This research was supported by three grants from the Natural Science Foundation of China (31671151, 31600896, and 31521063), by a grant from the Advanced Innovation Center for Future Education (27900-110631111), and by a grant from Guizhou Provincial Engineering Research Center for Livelihood Big Data Comprehensive Application and Guiyang Berkeley Big Data Institute (GBIC2018001).
Conflict of Interest Statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Footnotes
References
Allen, E. A., Damaraju, E., Plis, S. M., Erhardt, E. B., Eichele, T., and Calhoun, V. D. (2014). Tracking whole-brain connectivity dynamics in the resting state. Cereb. Cortex 24, 663–676. doi: 10.1093/cercor/bhs352
Bedny, M., Aguirre, G. K., and Thompson-Schill, S. L. (2007). Item analysis in functional magnetic resonance imaging. Neuroimage 35, 1093–1102. doi: 10.1016/j.neuroimage.2007.01.039
Carter, C. S., Lesh, T. A., and Barch, D. M. (2016). Thresholds, power, and sample sizes in clinical neuroimaging. Biol. Psychiat. 1, 99–100. doi: 10.1016/j.bpsc.2016.01.005
Cohen, J. (1988). Statistical power analysis for the behavioral sciences. J. Am. Stat. Assoc. 84, 19–74. doi: 10.1016/B978-0-12-179060-8.50012-8
Handwerker, D. A., Roopchansingh, V., Gonzalez-Castillo, J., and Bandettini, P. A. (2012). Periodic changes in fMRI connectivity. Neuroimage 63, 1712–1719. doi: 10.1016/j.neuroimage.2012.06.078
Hutchison, R. M., Womelsdorf, T., Gati, J. S., Everling, S., and Menon, R. S. (2013). Resting-state networks show dynamic functional connectivity in awake humans and anesthetized macaques. Hum. Brain Mapp. 34, 2154–2177. doi: 10.1002/hbm.22058
Kriegeskorte, N., and Mur, M. (2012). Inverse MDS: inferring dissimilarity structure from multiple item arrangements. Front. Psychol. 3:245. doi: 10.3389/fpsyg.2012.00245
Lazic, S. E. (2010). Relating hippocampal neurogenesis to behavior: the dangers of ignoring confounding variables. Neurobiol. Aging 31, 2169–2171; discussion 2172-2165. doi: 10.1016/j.neurobiolaging.2010.04.037
Leonardi, N., Richiardi, J., Gschwind, M., Simioni, S., Annoni, J. M., Schluep, M., et al. (2013). Principal components of functional connectivity: a new approach to study dynamic brain connectivity during rest. Neuroimage 83, 937–950. doi: 10.1016/j.neuroimage.2013.07.019
Lilliefors, H. W. (1969). On the Kolmogorov-Smirnov test for the exponential distribution with mean unknown. J. Am. Stat. Assoc. 64, 387–389. doi: 10.1080/01621459.1969.10500983
Meyer, G. J., Finn, S. E., Eyde, L. D., Kay, G. G., Moreland, K. L., Dies, R. R., et al. (2001). Psychological testing and psychological assessment. A review of evidence and issues. Am. Psychol. 56, 128–165. doi: 10.1037//OOO3-O66X.56.2.128
Pernet, C. R., Wilcox, R., and Rousselet, G. A. (2012). Robust correlation analyses: false positive and power validation using a new open source matlab toolbox. Front. Psychol. 3:606. doi: 10.3389/fpsyg.2012.00606
Rousselet, G. A., and Pernet, C. R. (2012). Improving standards in brain-behavior correlation analyses. Front. Hum. Neurosci. 6:119. doi: 10.3389/fnhum.2012.00119
Vul, E., Harris, C., Winkielman, P., and Pashler, H. (2009a). Puzzlingly high correlations in fMRI studies of emotion, personality, and social cognition. Perspect. Psychol. Sci. 4, 274–290. doi: 10.1111/j.1745-6924.2009.01125.x
Vul, E., Harris, C., Winkielman, P., and Pashler, H. (2009b). Reply to comments on“puzzlingly high correlations in fMRI studies of emotion, personality, and social cognition”. Perspect. Psychol. Sci. 4, 319–324. doi: 10.1111/j.1745-6924.2009.01132.x
Vul, E., and Pashler, H. (2012). Voodoo and circularity errors. Neuroimage 62, 945–948. doi: 10.1016/j.neuroimage.2012.01.027
Wilcox, R. R. (2005). Introduction to Robust Estimation and Hypothesis Testing. New York, NY: Elsevier Academic Press.
Woo, C. W., Krishnan, A., and Wager, T. D. (2014). Cluster-extent based thresholding in fMRI analyses: pitfalls and recommendations. Neuroimage 91, 412–419. doi: 10.1016/j.neuroimage.2013.12.058
Yarkoni, T. (2009). Big correlations in little studies: inflated fmri correlations reflect low statistical power: commentary on vul et al. Perspect. Psychol. Sci. 4, 294–298. doi: 10.1111/j.1745-6924.2009.01127.x
Yarkoni, T., and Braver, T. S. (2010). “Cognitive neuroscience approaches to individual differences in working memory and executive control: conceptual and methodological issues,” in Handbook of Individual Differences in Cognition, eds A. Gruszka, G. Matthews, and B. Szymura (New York, NY: Springer), 87–107.
Keywords: brain-behavior correlation, item-wise interindividual brain-behavior correlation, functional magnetic resonance imaging, cognitive neuroscience, mathematical problem-solving
Citation: Zhou X, Li M, Zhou H, Li L and Cui J (2018) Item-Wise Interindividual Brain-Behavior Correlation in Task Neuroimaging Analysis. Front. Neurosci. 12:817. doi: 10.3389/fnins.2018.00817
Received: 07 March 2018; Accepted: 19 October 2018;
Published: 06 November 2018.
Edited by:
Bertrand Thirion, Institut National de Recherche en Informatique et en Automatique (INRIA), FranceReviewed by:
Stavros I. Dimitriadis, Cardiff University School of Medicine, United KingdomVeena A. Nair, University of Wisconsin–Madison, United States
Copyright © 2018 Zhou, Li, Zhou, Li and Cui. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Xinlin Zhou, emhvdV94aW5saW5AYm51LmVkdS5jbg==
†These authors have contributed equally to this work