Is increased activation in the fusiform face area to Greebles a result of appropriate expertise training or caused by Greebles' face likeness?

Liu, Kuo; Chen, Chiu-Yueh; Wang, Le-Si; Jo, Hanshin; Kung, Chun-Chia

doi:10.3389/fnins.2023.1224721

ORIGINAL RESEARCH article

Front. Neurosci., 17 October 2023

Sec. Perception Science

Volume 17 - 2023 | https://doi.org/10.3389/fnins.2023.1224721

Is increased activation in the fusiform face area to Greebles a result of appropriate expertise training or caused by Greebles' face likeness?

Kuo Liu^1,2

Chiu-Yueh Chen^2,3

Le-Si Wang⁴

Hanshin Jo^2,5

Chun-Chia Kung^2,6^*

¹School of Psychological and Cognitive Sciences and Beijing Key Laboratory of Behavior and Mental Health, Peking University, Beijing, China
²Department of Psychology, National Cheng Kung University, Tainan, Taiwan
³Brain & Cognition, Leuven Brain Institute, KU Leuven, Leuven, Belgium
⁴Institute of Creative Industries Design, National Cheng Kung University, Tainan, Taiwan
⁵Institute of Medical Informatics, National Cheng Kung University, Tainan, Taiwan
⁶Mind Research and Imaging (MRI) Center, National Cheng Kung University, Tainan, Taiwan

Background: In 2011, Brants et al. trained eight individuals to become Greeble experts and found neuronal inversion effects [NIEs; i.e., higher fusiform face area (FFA) activity for upright, rather than inverted Greebles]. These effects were also found for faces, both before and after training. By claiming to have replicated the seminal Greeble training study by Gauthier and colleagues in 1999, Brants et al. interpreted these results as participants viewing Greebles as faces throughout training, contrary to the original argument of subjects becoming Greeble experts only after training. However, Brants et al.'s claim presents two issues. First, their behavioral training results did not replicate those of Gauthier and Tarr conducted in 1997 and 1998, raising concerns of whether the right training regime had been adopted. Second, both a literature review and meta-analysis of NIEs in the FFA suggest its impotency as an index of the face(-like) processing.

Objectives: To empirically evaluate these issues, the present study compared two documented training paradigms Gauthier and colleagues in 1997 and 1998, and compared their impact on the brain.

Methods: Sixteen NCKU undergraduate and graduate students (nine girls) were recruited. Sixty Greeble exemplars were categorized by two genders, five families, and six individual levels. The participants were randomly divided into two groups (one for Greeble classification at all three levels and the other for gender- and individual-level training). Several fMRI tasks were administered at various time points, specifically, before training (1st), during training (2nd), and typically no <24 h after reaching expertise criterion (3rd).

Results: The ROI analysis results showed significant increases in the FFA for Greebles, and a clear neural “adaptation,” both only in the Gauthier97 group and only after training, reflecting clear modulation of extensive experiences following an “appropriate” training regime. In both groups, no clear NIEs for faces nor Greebles were found, which was also in line with the review of extant studies bearing this comparison.

Conclusion: Collectively, these results invalidate the assumptions behind Brants et al.'s findings.

Introduction

Face recognition is an enduring and intriguing research topic in psychology and visual neuroscience. This is partly because of its importance to our social lives and species survival (Rivolta, 2014) and partly because it showcases the intricate interplay between “nature” and “nurture” (Sugita, 2008; Arcaro et al., 2017). Since the first uses of PET and fMRI in research, studies in face recognition have identified several related processing regions (Sergent et al., 1992; Haxby et al., 1996). However, it was not until 1997, when the term fusiform face area (FFA) was coined (Kanwisher et al., 1997), that investigation of this area began in earnest. Since then, at least two opposing views (i.e., face specificity and perceptual expertise) have surfaced and continue to be discussed to this day (Gauthier and Tarr, 1997; Kanwisher, 2000, 2017; Tarr and Gauthier, 2000). As claimed by supporters of the domain-specific hypothesis, neural processing in the FFA is specific to faces (or face-like objects), less recruited by other objects, and more likely to be inherited by genetic endowment (Grimaldi et al., 2016). Conversely, proponents of the perceptual expertise hypothesis assert that experience plays a vital role in face-processing mechanisms. Faces are something most humans see every day from birth, making it incredibly likely that life-long experience with faces underpins our species' superb recognition capability. If this is true, the FFA should not only be activated in response to faces but also in response to “expert” object categories, such as car experts viewing cars or radiologists viewing X-ray images (Gauthier et al., 2000a).

To test and verify the expertise hypothesis, Gauthier and Tarr (1997) created Greebles—an artificial object set, for which they trained participants to recognize at various levels (i.e., sex, family, and individual). Participants were familiarized with various Greebles and fMRI-scanned before and after 8–10 training sessions. In one seminal fMRI study (Gauthier et al., 1999), three lines of evidence jointly supported the expertise hypothesis: specifically, (a) the decreased response times (RTs) for verification tasks with Greebles, to the point of statistical insignificance between RTs for identifying Greebles at the family level and those at the individual level; (b) the “surfacing” of the FFA by the “Faces vs. Objects” contrast before training, and by both “Faces vs. Objects” and the “Greebles vs. Objects” contrasts after training, suggesting that training “drove” Greeble selectivity in the FFA; and (c) comparison of activation levels before and after training found a neural inversion effect (NIE) closely associated with training (i.e., the sum of t-values in the right FFA for upright vs. inverted faces decreased, while for Greebles concurrently increased, and the gap of summed t-values between faces and Greebles significantly interacted). Along with studies of natural experts (Gauthier et al., 2000a; Xu, 2005), the perceptual expertise hypothesis has received decent support (Bukach et al., 2006). However, upon follow-up, some heated exchanges ensued (Grill-Spector et al., 2004; McKone and Kanwisher, 2005; Gauthier and Bukach, 2007; McKone et al., 2007; Op de Beeck and Baker, 2010a,b).

Among objections to the expertise hypothesis of the FFA, the most relevant was a study by Brants et al. (2011), who claimed that the FFA's increased response to Greebles was due to their resemblance to faces. Brants et al. claimed that, by replicating Gauthier's classic training paradigm (Gauthier and Tarr, 1997), they found NIEs in the FFA, as assessed by significantly higher blood-oxygen-level dependent (BOLD) activity for upright than for inverted stimuli (either faces or Greebles), both before and after training. In addition, Brants et al. found that either encouraging or discouraging subjects from seeing Greebles as “living individuals” or “objects” did not matter; all subjects reported perceiving Greebles as “face-like” after training. Based on the post-training interview and the NIE observed for faces and Greebles, both before and after training, Brants et al. concluded that it was Greebles' perceived face likeness, rather than acquired expertise, that drove the NIE, the presumed indicator of face selectivity for the FFA.

Upon closer inspection, however, four inconsistencies (two behavioral and two fMRI) between Gauthier et al. (1999) and Brants et al. (2011) could be identified. First, as the basic premise of a successful replication is an almost identical, or at least comparable, behavioral result (typically a prerequisite for the later fMRI findings), Brants et al. (2011) reported mean RTs for accurate verification trials of ~1,000 ms. This was nearly double the RTs (~500 ms) reported in the original study (figure 3 of Gauthier and Tarr, 1997, p. 1677). Although Brants et al. (2011), p. 3951, stated that “All aspects of the procedure were modeled after Gauthier and Tarr (1997), except the manipulation of inducing face likeness for half of the participants,” they nevertheless also stated “ten Greebles remained unknown throughout the training” (p. 3951), which happened neither in the study of Gauthier and Tarr (1997) nor in their study in 1999 (Gauthier et al., 1999). Such a “half-trained, half-untrained” arrangement of Greebles more resembles the training procedure of Gauthier et al. (1998), where the verification RTs for the two trained categories (e.g., sex and individual) differed significantly (ibid, figure 4B, p. 2407), and the mean verification RT of ~1,050 ms (averaging the former mean of ~800 ms and the later mean of ~1,300 ms) was also more consistent with those reported by Brants et al. (2011, figure 3b, p. 3953). These observations and reasoning suggest that Brants et al. (2011) adopted a different or variant of the training paradigm (i.e., Gauthier et al., 1998) than those originally used in Gauthier and Tarr (1997) and Gauthier et al. (1999). However, it would still be necessary to empirically compare the effects of different expertise training in the FFA/inferior occipitotemporal cortex to draw more precise conclusions. Second, Brants et al. (2011) used the “statistical equivalence between mean verification RTs between family and individual level” (Gauthier and Tarr, 2002) as the sole criterion for subjects to reach perceptual expertise. However, according to Tanaka and Gauthier (1997), two other implicit criteria also need to be met to jointly define “perceptual expertise,” which are as follows: (a) that the RTs between Greeble verifications at the family and those at the individual levels are supposed to be large and/or significantly different, with the latter usually longer; and (b) the RT differences would monotonically decrease to the point of near convergence or statistical insignificance. With only one criterion (i.e., statistical insignificance, as depicted by three arrows; sessions 1, 2, and 8, respectively; see Brants et al. (2011, figure 3b, p. 3953), it is unimaginable that novices could behave like experts at the beginning and at the end of, but not in between, the training sessions.

Third, while Gauthier et al. (1999) and Brants et al. (2011) reported behavioral training results and comparisons of inversion effects in the FFA for faces and Greebles, the former provided additional contrast images of category-selective regions. As shown in Gauthier et al. (1999, figure 4, p. 571), before training, the FFA could only be identified in the “Face vs. Object” contrast. After training, however, contrasts between “Face vs. Object” and “Greeble vs. Object” revealed the same FFA. If, according to Brants et al. (2011), participants interpreted Greebles as faces throughout training, further convincing support would be to similarly show FFA activations for “Face/Greeble vs. Object” contrast before and after training. While these were not seen in Brants et al. (2011), a counterargument to the “Greebles look like faces” claim could be that since most of us do sometimes consider Greebles “face-like” before (or without any) training, the fact that “Greeble vs. Object” contrast revealed FFA only after, but not before, training, as revealed in Gauthier et al. (1999), strongly suggest the necessity of sufficient training to shift processing from the basic to subordinate level (Gauthier et al., 1997, 2000b) as the crucial factor driving increased FFA activity for Greebles, not by Greebles' face-resemblance alone. Fourth, as both Gauthier et al. (1999) and Brants et al. (2011) observed NIEs in the FFA as the indispensable evidence for their respective claim, the reliability of NIE in the FFA as an index of face processing was, a bit surprisingly, hardly examined. Furthermore, the above two studies adopted different dependent measures: summed t-values (Gauthier et al., 1999) and average percent signal changes, or PSC (Brants et al., 2011), across FFA voxels. As behavioral inversion effects may not be face-specific (Valentine, 1988; Rossion and Gauthier, 2002), our extant literature search by keywords such as “FFA” and “face inversion” rendered 18 published articles (shown in Table 1). Not only did these articles show surprisingly inconsistent NIE effects at FFA but further funnel plots also suggested that the NIE in the FFA was never unanimously one-sided (larger for upright orientation). In light of the apparent unreliability of NIEs as an indicator of face-specific processing, it may be worthwhile to seek an additional neuronal measure of face/expertise processing.

TABLE 1

Table 1. Summary literature of the neural inversion effect of the face in FFA.

With these four comments, Gauthier and Tarr (1997), Gauthier et al. (1998, 1999), and Brants et al. (2011), the purpose of the present study was to revisit the two Greeble training regimes and to compare their differential interactions with the FFA. To show that FFA selectivity is not driven by Greebles' face likeness (but more likely by its subordinate-level processing), we chose asymmetric Greebles to minimize the effect of symmetry. If all putative Greeble selectivity in the FFA could still be observed for the asymmetric version, stimulus symmetry (i.e., a potential factor behind face likeness) becomes less important in shaping the selectivity of the FFA. Additionally, in the current experiment, both the FFA and lateral object complex (LOC) would be independently localized (c.f., Brants et al., 2011). Other experimental procedures, including the Greebles-Objects-Faces (GFO) tasks and the face/Greeble inversions manipulation, were kept as closely matched with those in both Gauthier et al. (1999) and Brants et al. (2011).

The domain-specific hypothesis predicts that Greeble training does not affect FFA but could be on another object-selective area (e.g., LOC). In contrast, the perceptual expertise hypothesis predicts that the “Greeble vs. Object” contrast should reveal the FFA only after training. We predicted that different training regimes would also yield different response profiles and different Greeble selectivities in the FFA accordingly. Regarding NIEs at the FFA, we are equivocal about the direction of NIE in the FFAs since the funnel plot suggests that positive, negative, or no different NIEs are all possible. Finally, because the expertise hypothesis predicts that if the FFA becomes similarly responsive to faces and Greebles after appropriate training, we would expect neural adaptation (Grill-Spector et al., 1998; Grill-Spector and Malach, 2001) or competition in the FFA: for example, reduced FFA activities for faces right after Greebles, but only after training, and only under the right training regime.

Materials and methods

Behavioral experiment

Participants

To justify the choice of subject number with the appropriate statistical power, we ran G^*Power with the estimated RT data: one-tailed between-group t-test, with effect size = 2 (RT = 2:1 in pre- vs. post-training), alpha = 0.05, power = 0.96, resulting in N = 7 in each group (N = 14), enough to show a reliable difference. We, therefore, recruited 16 NCKU undergraduate and graduate students (nine girls), all with normal or corrected-to-normal vision. After signing the informed consent approved by the NCKU Research Ethics Committee (REC case number 103-266), participants were randomly assigned into two separate training paradigms (n = 8 in each), which included three fMRI scans (before, during/halfway, and after training) and 10 sessions of behavioral training in between, lasting ~2 weeks. At the end of the training, all participants received a cash payment of ~NT$ 6,000 (or ~US$ 200).

Materials

The 60 Greeble exemplars (Figure 1) are categorized by two genders, five families, and six individual levels (also see Gauthier and Tarr, 1997, figure 1, p. 1674). Gender is defined by Greebles' upward or downward “appendages” (i.e., protrusions on the top and main body), the family by the five different body shapes, and the individual by the unique combination of body shape and the appendages. In addition, Figure 1 shows the meaningless syllables assigned to these asymmetric Greebles as their respective gender, family, and individual name, with all the initial letters different for the convenience of responses in the later naming task.

FIGURE 1

Figure 1. Thirty asymmetric Greebles were used in the present study, with each column representing a family (a total of 5 with family names above); each family contains six individuals each (names in between the two trained views), which are presented equally throughout the training process. There are three differences between Gauthier97 and Gauthier98: First, Gauthier97 has a family level to learn, whereas Gauthier did not; Second, Gauthier97 had 10 individual names to learn, while Gauthier98 had 20. Third, Gauthier98 added an unnamed label in the naming and verification task. All Greeble stimuli were downloaded from https://sites.google.com/andrew.cmu.edu/tarrlab/stimuli.

Both training regimes were performed in MATLAB 2010a on an iMac with a resolution of 1,024^*768. All Greeble models were presented in purple shade (Figure 1). The size of the presented images was ~6.5 cm high and 3.25 cm wide (nearly identical to Brants et al., 2011). Every participant saw Greebles in the center of the screen, which was placed at a distance of 60 cm, subtending an ~6.2° × 3.1° visual angle.

Procedure

The 16 participants were randomly divided into two groups (i.e., training regimes). The first group followed the training procedure of Gauthier and Tarr (1997) and Gauthier et al. (1999) (shortened to Gauthier97 hereafter), and the other followed those by Gauthier et al. (1998) and Brants et al. (2011) (shortened to Gauthier98 hereafter). As stated in Gauthier et al. (1998, p. 2403), there were three explicit procedural differences between the Gauthier97 and Gauthier98 groups. First, Gauthier97 participants were trained to classify Greebles at all three levels (gender, family, and individual), whereas Gauthier98 only included gender- and individual-level training. Second, Gauthier97 trained participants to discriminate 10 different Greebles (two from each gender and five from each family), as opposed to 20 in the Gauthier98 protocol. Finally, Gauthier97 had participants to perform only the verification task during training, while Gauthier98 required participants to alternate between the verification and naming tasks during most training sessions. Another outstanding but less clearly stated difference was the treatment of unnamed Greebles during training (also see Figure 1 of the present study). Of the 60 Greebles (or 30, if picking only one gender per family), only the chosen 10 were trained at the individual level in the Gauthier97 protocol, leaving the remaining 20 Greebles for the training at the gender and family level, respectively (10 for each). In contrast, in the Gauthier98 protocol, other than the 20 trained Greebles, the remaining 10 unnamed ones were mixed with 20 trained ones during training at both gender and individual levels, creating additional responses in both naming (pressing the letter “u” for “unnamed”) and verification (pressing spacebar for “null” responses) tasks. We believe, as we will also explain later in Figure 11, that these are the primary reasons driving the RT differences in the verification task between Gauthier98 and Gauthier97. Except for these differences, all other aspects of the training regimes, including the trial structure (e.g., time intervals, stimuli addition order (two per day for the first five training days, then level off the second 5 days), and a “beep warning sound after incorrect trials”), mimicked as closely as described in both Gauthiers' protocols. All 16 participants (eight in each training regime) completed 10 training sessions (~2 weeks), each lasting ~40 min. At the end of each naming/verification task block, mean accuracy and mean RT of accurate trials were shown on the screen. Finally, at the beginning and end of the training, we also asked each participant questions like “What do you think the Greebles look like?” and “Were the Greebles ever face-like to you at any time during training?”

For data analysis, accuracy and RTs for correct responses across each training group of participants (n = 8) were averaged. Throughout the study, we used the same definition of expertise criterion (Kanwisher et al., 1997; Gauthier et al., 2000a): namely, the statistical insignificance between correct RTs at the individual vs. the family (Gauthier97)/gender (Gauthier98) level in the verification task.

fMRI

Several fMRI tasks were administered at various time points: specifically, before training (1st), during training (2nd), and typically no <24 h after reaching the expertise criterion (3rd).

Greeble-Face-Object task

Present in both Gauthier et al. (1999) and Brants et al. (2011), the GFO task is necessary for both examination of training effects and cross-study comparisons. It requires subjects to passively view two runs (322 s; 161 volumes each) of 12 image blocks (each category repeated four times, 20 images per block, and stimulus-onset asynchrony of 1 s), sandwiched between 6-s fixation periods (first fixation lasting 10 s for dummy scan), in both before- and after-training fMRIs. Unique in the current protocol, to assess the relative interference or neural adaptation (e.g., face stimuli presented after the Greeble block), each stimulus category was with equal probability before and after the other two categories, so that the four face blocks would be twice after Greebles and twice after objects, with the same being true for both Greebles and objects.

NIE task

Also adopted from both Gauthier et al. (1999) and Brants et al. (2011, see Figure 2, p. 3952 for illustration), each NIE run contained four blocks of sequential matching trials (eight per block), where each trial consisted of fixation (800 ms), followed by consecutive presentations of stimulus (1 s for each image, separated by a 200 s mask in between). Four categories (Greebles/faces) by orientation (upright/inverted) combinations were block-randomized in each run, and fMRI data from five runs were collected both before and after the training session. Behavioral reaction time and accuracy inside the scanner were also recorded.

FIGURE 2

Figure 2. An example of GFO (Greeble, Face, Object) task used in fMRI scan. Stimuli were randomly presented for 1 s each during a 20 s block, while fixation blocks (6 s) in between each stimulus block. Gauthier used this for localizer but here different localizer task was used.

FFA localizer task

To independently assess the differential effects of training on GFO activities, we additionally ran an independent FFA localizer (from Peelen and Downing, 2005) for each participant. Participants performed a one-back identity judgment for faces, bodies, scrambled objects, and Greebles (arranged in +ABCD+, a pseudo-random fashion) for two runs. In addition to FFA, an object-selective region (i.e., the LOC) was circumscribed for companion analyses.

Scan parameters and analysis

The fMRI data were acquired from a GE750 3T scanner housed in the NCKU MRI Center, with a 32-channel head coil. Parameters were set to TR = 2,000 ms, TE = 33 ms, 64 × 64 matrix (so that FOV = 19.2 × 19.2 cm), 3 mm slice thickness (without gaps, making the voxel size = 3 × 3 × 3 mm³), and 40 axial slices, covering the whole brain.

Data were analyzed by BrainVoyager QX (BVQX v2.6) and NeuroElf 1.0 (http://neuroelf.net) under MATLAB 2018a. Functional data preprocessing included slice-time correction and motion correction. The resulting functional (T2^*) images were coregistered (with the normalized T1 image in the Talairach space) and then transformed into 3D volume time course (VTC) data, which was then entered into a generalized linear model (GLM) and contrast analysis. Cross-session alignments of the anatomical images were performed for the first session (before training) so that all subsequent volumes could be cross-compared. For the ROI analysis, the two-run localizer data first entered the GLM. Then, the “Faces vs. Objects” contrast was applied, with the FFA defined individually and exclusively in the right hemisphere (FFA was located in the Talairach space of X: 40–50; Y: −45 to −55; Z: −10 to −20, with a corrected threshold of p < 0.5). The summed t-values and FFA-averaged PSC were extracted and compared with the earlier findings. Finally, for the multi-voxel pattern analysis in the searchlight fashion, the Princeton MVPA toolbox (https://pni.princeton.edu/pni-software-tools/mvpa-toolbox) and the CoSMoMVPA (https://www.cosmomvpa.org/) were implemented (Oosterhof et al., 2016), with the default setup, and averaged individual classification accuracy (commonly across any two conditions) maps into one-tailed t-maps (see uploaded results in NeuroVault).

Results

Behavioral results

Figure 3 summarizes the mean accuracy (left panel) and RTs (right panel) by session results of the verification tasks in the two Greebles training regimes (Gauthier97 on top). For both Gauthier97 and Gauthier98, the gender accuracy was all around 90% after the first session [F_(1,14) = 0.145, p = 0.71, η² = 0.01]. As for the accuracy at the family and the individual levels, participants all improved with training [comparing two groups' individual-level accuracy between the first and the last(tenth) session, F_(1,30) = 108.35, p <0.01, η² = 0.78]. These accuracy results (Figures 3A, C) suggest that all participants could accurately identify Greebles on the designated (gender, family, or individual) level at the end of training.

FIGURE 3

Figure 3. (A) Mean accuracy of Gauthier97 group (n = 8) for the three types of category level (gender, family, and individual levels) of the verification task; (B) Mean response time (in milliseconds) of Gauthier97 on the same verification task. The response time of three category levels was statistically insignificant from Day 4. (C) Accuracy of Gautheir98 group (n = 7) for the two types of category level (gender and individual levels). (D) Mean response time (in milliseconds) of Gauthier98 in the verification task. Note that the individual-level RT combined both named and unnamed Greebles and the similarity between our (D) and figure 3B (p. 3953) of Brants et al. (2011).

In terms of RT, although in Figure 3B (Gauthier97) the mean response times in the individual level were longer than those in the family and the gender levels [the 1st ~5th session: F_(2,21) = 3.55, p = 0.047, η² = 0.253], with training the RT difference became closer and finally converged from the sixth all the way to the 10th session [the last 5 F_(2,21) = 0.07, p = 0.929–0.932, η² = 0.0066–0.0069]. According to the canonical definition of Greeble expertise (Gauthier and Tarr, 1997; Gauthier et al., 1997) (usually focusing on the statistical insignificance between verification RTs of family and individual levels), our Gauthier97 subjects have become experts since the 6th session and kept practicing for 4 more sessions.

In contrast with Figure 3B, the Gauthier98 RT results (Figure 3D) were quite different: not only that the mean RT was about 1,000 ms (compared to the mean final RT in Figure 3B, around 500 ms) but there also seemed no trend of convergence throughout Gauthier98 training. Strikingly, Figure 3D results were similar to the training results reported by Brants et al. (2011, figure 3b, p. 3953). Inspecting their reported procedure: “Participants were trained following the procedure used by Gauthier et al. (1997).” (ibid, p. 3951), we found the match between Brants et al.'s figure 3b (p. 3953) and figure 3d (plus the mismatch with the figure 3b) of the current study, lending support to the idea that Brants et al. (2011) misquoted the Gauthier98 as Gauthier97 training paradigm. Their following methodological descriptions “The participants learned the family labels and individual names for five Greebles in the first session and then learned five more Greebles in the following three sessions. Ten Greebles remained unknown throughout the train, which made the tasks more difficult” (ibid., 3951) further verifying our suspicion (of their mis-adoption).

Two follow-up observations remain. First, since our Gauthier98 results mimicked those of Brants et al. (2011), how about the replications between our Gauthier98 results and those from the original Gauthier et al. (1998), especially the gender and named/unnamed individual-level training RTs? For this comparison (figure 4b, ibid, p. 2407; and the similar plot of the current study, shown in the lower right corner of Figure 4), one sees a slightly different picture: While the individual-named Greebles RTs gradually converged with the gender RTs in both studies [similar to what was found in Gauthier et al. (1998)], our individual-unnamed Greeble RT, while also not converged with the mean gender RT, was gradually decreased with training [unlike what was shown in figure 4b of Gauthier et al. (1998), where the unnamed RT was constantly high]. Further comparison between our individual training RT plots (Figure 4), alongside the similar figure 5 of Gauthier et al. (1998, p. 2408), revealed that one likely explanation was the individual variability inherent in the small samples of both the original (N = 12) and the current replicated Gauthier98 (N = 8) study, respectively. Overall, both studies shared the converging finding between the gender and the individual-named RTs but no convergence between the gender and individual-unnamed RTs [figure 4 of Gauthier et al. (1998), and lower right of Figure 4 in this study]. This inability to converge between the two (here individual-combined and gender) levels, a signature of perceptual expertise (Tanaka and Gauthier, 1997), renders the Gauthier98 training regime relatively unfit for the behavioral training of Greeble experts.

FIGURE 4

Figure 4. Performance in the verification task throughout training for each expert participant in the Gauthier98 group. While the individual-named Greebles RTs gradually converged with the gender RTs, our individual-unnamed Greebles also gradually decreased, but not converge, with training. One potential explanation was the individual variability inherent in the small samples. Overall, this inability to converge between the two levels renders the Gauthier98 training regime unfit for the behavioral training of Greeble experts.

Second, there was an expertise-reaching RT difference between the two training regimes: In Gauthier97, it was ~500 ms at the end of expertise training, whereas in Gauthier98, the end RT hovered ~1,000 ms. Such a two-fold RT difference (500 ms in Gauthier97 vs. 1,000 ms in Gauthier98) likely resulted from the additional processing steps in Gauthier98. Figure 11 (in discussion) lists two of the possible stages of such putative decisions Greeble trainees faced during the training processes: one upon the label presentation and another upon Greeble presentation. With the additive factors logic (Sternberg, 1969), the 500-ms gap between the two training regimes could be reasonably explained.

In sum, the converging RT results with Gauthier97 training, as shown in Figure 5, in contrast to the no-converging results with the Gauthier98 paradigm, suggest that only the Gauthier97 group, and only after training, were the participants successfully achieving the criterion of perceptual expertise. We, therefore, concluded that the expertise training paradigm of Gauthier97 was more robust and, therefore, a more “appropriate” training regime (than that of Gauthier98, or Brants et al., 2011) in generating Greeble experts.

FIGURE 5

Figure 5. Performance in the verification task throughout training for each expert participant in the Gauthier97 group. Convergence happened among the trends of the mean response time of gender, family, and individual name. Only in the Gauthier97 group, and only after training, which reflects modulation of the expertise category following the “appropriate” training regime.

fMRI results: ROI (FFA) analyses

To distinguish different accounts of the FFA activities, namely the perceptual expertise vs. the face likeness, three proposed ROI (FFA) analyses were tested here, all between Gauthier97 and Gauthier98 training regimes: (a). the comparisons among upright faces, objects, and Greebles, between before and after training; (b). the neuronal inversion effect (NIE): the upright vs. inverted Greebles and faces, across before, during, and after training; and (c). the neural adaptation effects for both faces and Greebles, again before vs. after training. The perceptual expertise (PE) hypothesis of the FFA predicts that (a) FFA responses to Greebles would increase significantly from the before to after training, but no such effect for faces or objects, and only be so in the appropriate (e.g., Gauthier97) training paradigm; and (c) a significant increase in Faces after Greebles (e.g., neural adaptation) only after training, and under the appropriate, or Gauthier97, paradigm, as well. In contrast, the FFA face-specificity hypothesis, according to Brants et al. (2011), would predict: (b) an NIE for faces both before and after training, but not for asymmetric Greebles (given their much less, if not zero, face likeness), given the premise that NIE being a consistent and reliable index of face-related processing. This assumption will be examined in the funnel plot of the extant literature.

ROI (FFA) analysis

The results of FFA activities for faces, objects, and Greebles before and after training, as well as for both Gauthier97 and Gauthier98, are shown in Figure 6. For the “Faces vs. Objects” comparison, the main effects were found for both before [F_(1,110) = 301.29, p < 0.01] and after training [F_(1,110) = 195.64, p < 0.01]. No training effect (after vs. before training) was found for faces in the FFA [F_(1,110) = 0.1353, p = 0.71]. Similarly, the comparison of “Greebles vs. Objects” yielded a main effect both before [F_(1,110) = 6.17, p = 0.014] and after training [F_(1,110) = 22.97, p < 0.01]. More importantly, in the Gauthier97 paradigm, there was a significant difference for Greeble activations before vs. after training [F_(1,110) = 7.31, p < 0.01], indicating the Greeble training effect on the FFA (see Figure 6 left).

FIGURE 6

Figure 6. Percent signal changes for faces, objects, and Greebles in the FFA. Error bars were standard errors. Shown on the left by Gauthier97 training results, right by Gauthier98. N = 8 in each training group. As shown, there was a significant training effect for Greebles in the Gauthier97, but not in the Gauthier98, paradigm. * means p < 0.05, and ** means p < 0.01, the 95% confidence interval.

In contrast, as shown in Figure 6 right, other than the similarly significant “Faces vs. Objects” main effects both before [F_(1,110) = 26.06, p < 0.01] and after training [F_(1,110) = 21.12, p < 0.01], Gauthier98 results showed no “Greebles vs. Objects” main effects across training (both p > 0.05). Because the current training regimes strictly adhered to those in both Gauthier et al. (1999) and Brants et al. (2011), with the only exception the choice of asymmetric Greebles, these results were more in line with the perceptual expertise account.

Neural inversion effect in the FFA

The behavioral performance of all participants in the sequential matching task during the three fMRI sessions was shown in Figure 7. Both groups demonstrated improvements after five sessions of training [before vs. after training: Gauthier97: F_(1,78) = 7.1324, p < 0.01; Gauthier98: F_(1,78) = 16.68405, p < 0.01], different from the results reported by Brants et al. (2011).

FIGURE 7

Figure 7. Mean correct rates of the identity-matching task, either upright or inverted faces or Greebles, across three fMRI scan sessions. Error bars represent standard errors. ** means p < 0.01, the 95% confidence interval.

The mean Percent Signal Change (PSC) difference between the upright and the inverted faces, or Greebles, was defined as the neural inversion effect, or NIE. For faces, no significant NIEs were observed in either Gauthier97 or 98 (Figures 8A, C), across all three scan sessions (before, during, and after training). Out of the six possible face FIEs (three for each training group), only one (the during training of Gauthier97) was significant [F_(1,126) = 7.83, p = 0.0059]. As for Greebles, none of the six NIEs was significant (see Figures 8B, D). These results sharply contrast with Brants et al. (2011), who reported significant FIEs for faces and Greebles, both before and after training (four out of four). In other words, while the present study only found one significant NIE out of 12 possible tests (8.3%), Brants et al. got four out of four (or 100%) significant NIEs!

FIGURE 8

Figure 8. (A) FFA activities (percent signal change, or PSC) to the upright vs. inverted faces; (B) the same FFA neural inversion effect (or NIE) for Greebles; both (A, B) were from Gauthier97 paradigm; (C) FFA NIE for faces; (D) for Greebles; (C, D) were from Gauthier98 paradigm; (E, F) the summed t-values of NIE. T-values were summed and averaged across all FFA voxels corresponding to the upright vs. inverted stimulus version (E) for Gauther97, and (F) for Gauthier98. For all graphs, error bars denote standard errors. * means p < 0.05, the 95% confidence interval.

Although the PSC is now more pervasively used in the fMRI papers, the summed t-values, originally reported in Gauthier et al. (1999), might be another possible index to verify NIE. This method sums the t-values from each participant's FFA voxels and then averages across upright minus inverted conditions (and across subjects). As each participant's FFA contained different numbers of voxels, our summed t-values for faces and Greebles, again across three sessions, showed no significant interactions (though with trends of increase for Greebles NIE, in Gauthier97; see Figures 8E, F).

Meta-analysis of NIE

One untested assumption in supporting the NIE as the index of the “face specificity of FFA” account is its reliability: To be a strong indicator of face-related processing, NIE has to be consistently identified in relevant literature whenever upright and inverted faces are compared. To find out, we performed a search with the keywords “FFA” and “neural inversion effect,” which yielded 18 fMRI articles matching the criteria of “fMRI studies containing FFA NIE results.” As shown in Table 1, seven articles identified positively significant NIEs, two articles found negatively significant NIEs (i.e., FFA activities for inverted faces higher than those for upright faces), and the remaining 10 studies (nine plus the present study) showed no significant NIE for faces at the FFA.

A funnel plot of these 19 studies (14 with reported stats) is also shown in Figure 9. Five studies reporting non-significant results were given estimated t-values and 95% confidence intervals. With these results, one can see that the NIEs in the FFA did suffer certain degrees of publication bias: the slightly more no difference (Grimaldi et al., 2016) than positively (Kanwisher, 2000) and negatively (Sugita, 2008) significant NIE in the FFA in the literature, seriously undermining its reliability assumption that if NIE is a face-related index, most if not all, published studies should yield unanimously positive and significant NIE in the FFA for faces.

FIGURE 9

Figure 9. Funnel plot of reported t-statistics from Table 1. The positively significant NIEs were colored in green (seven studies), no significant NIEs in yellow (reported; five studies) and orange (estimated from confidence intervals; also five studies), and negative NIEs in blue (two studies). As shown, the almost 1:1 (10: 9) ratio between significant (including both positive and negative NIEs) and not-significant (including both reported, light-yellowed dots and unreported/estimated, dark-yellowed dots, NIEs) studies support not just the existence of publication bias (i.e., only significant t-values were reported), but also the unreliability of NIE as the index of face selectivity (~ = majority of studies with NIE in the FFA should be positively significant).

Finally, the last issue with Brants et al.'s reasoning is that when faces show reliable NIE (which is not the case, as it turns out), finding NIE (as they demonstrated in Brants et al., 2011) would reversely imply that the stimulus should be faces (or face-like). Such reasoning is exactly reverse inference (Poldrack, 2006), a situation that neuroscientists should try to circumvent. The Bayes theorem could estimate the probability of p(faces | NIE) = p(faces)^*p(NIE | faces)/p(NIE). With the help of the neurosynth database, which contains = 896 (papers with the term “faces”)/14,371 (total number of published papers in the neurosynth database, as of Jul 10, 2023) ^* 7/20 (the probability that was acquired by the meta-analysis below)/1/3 (NIE could be either positive, no difference, or negative) = 0.06, or around 1 in 20 studies. These results once again suggest the fallibility of equating NIE as face(like) processing.

Neural adaptation effect in the FFA

Another index of successful training would be to see whether the beholders' FFA responses for faces, after becoming Greeble experts, will become weaker (aka. adapted) following the presentation of Greebles, compared to the control condition of the before-training session (where Greebles are not face-like to FFA yet) and also compared to the inappropriate training condition (e.g., Gauthier97 vs. Gauthier98). Other than the two abovementioned controlled comparisons, there is one more specific prediction: the neural adaptation would only happen to the faces in the “Faces after Greebles” condition (not the other way around, i.e., “Greebles after Faces,” because Greebles would always be adapted following faces presented to the FFA, which is constantly face-selective), compared to the controlled “Faces after Objects” condition. Figure 10 shows t-test comparison results for both the before vs. after training sessions and the Gauthier97 and Gauthier98 regimes.

FIGURE 10

Figure 10. Neural adaptation effect in the FFA. The comparisons of percent signal change for “Faces after Greebles” vs. “Faces after Objects” in both Gauthier97 (N = 8) vs. Gauthier98 (N = 8) training regime were taken as the evidence for the support of the perceptual expertise hypothesis of FFA. Error bars represent standard errors. * means p < 0.05, the 95% confidence interval.

For the Gauthier97 group, there was a significant difference in the “Faces after Greebles” condition [t₍₇₎ = 2.01, p = 0.04], in line with the prediction that training-induced Greeble expertise recruiting face-like responses in the FFA, thereby making the subsequent faces “adapted.” In contrast, the “Faces after Objects,” plus neither “Faces after Greebles” nor “Faces after Objects” comparisons in the Gauthier98 group, showed any adaptation effects in the FFA. Together, these four comparison results strongly agree with the predictions made by the perceptual expertise account of the FFA.

fMRI results: MVPA classification performances

In light of the growing prevalence of multi-voxel pattern analyses, we also compared the category classification performances between Gauthier97 and Gauthier98, and between before vs. after training, using both ROI (e.g., FFA) and the whole-brain searchlight analysis fashion. The reason why multi-voxel pattern analyses were reported here was that in addition to the planned univariate analyses between upright and inverted faces and Greebles (and between training regimes), multivariate fMRI analyses have also received popularity and provided additional insights into the neural representation under investigation. For the current study, it may be of interest to check whether the “Greebles vs. Objects” contrast has become easier to classify (and “Faces vs. Greebles” harder to classify) after training, since Greeble training has increased FFA's sensitivity to Greebles, making Greebles more face-like (or less object-like), and probably also only in the more appropriate Gauthier97 training regime.

As shown in Table 2, the results of accuracy, recall, precision, F1-score and area under the ROC curve are in general agreement. Therefore, the recall/precision/F1-score results were aggregated for paired t-comparisons. The results show that there is a significant difference in recall (t₍₇₎ = −2.758, p = 0.028; Cohen's d = 0.98) and F1-score (t₍₇₎ = −2.749, p = 0.029; Cohen's d = 0.97) for Gauthier97 group before vs. after training, and the precision with marginal significance (t₍₇₎ = −2.086, p = 0.075). The increase in recall in the Gauthier97 group indicates that as individuals become Greeble experts, their FFA sensitivity to Greeble (e.g., true positives) has increased. Additionally, since the F1 score is related to both recall and precision, the change in F1 differences signifies that with individual Greeble training, the FFA performs better in classifying faces and Greebles.

TABLE 2

Table 2. ROI MVPA analysis for 2 (Gautiher97 vs. Gauthier98) ^*2 (before_ vs. after training) ^*2 (“Greeble vs. face” and “Greeble vs. object”) conditions (n = 8 in each condition).

In line with our univariate analysis results, the increased Greeble sensitivity (indexed by the recall score) and increased Greeble-Face separability (by the F1-score), only in the Gauthier97 paradigm, once again partially support the clear modulation of appropriate training on subjects' representations. By partially, we mean that the evidence was not indexed in classification accuracy, which may be due to the variability inherent in various dependent measures. It may also be that the choice of to-be-compared stimuli was relevant since both studies showed higher classification accuracies in chess experts' FFA on chess boards (Bilalić et al., 2011), and for radiologists' FFA on chess X-rayed pictures (Bilalić et al., 2014), the control stimuli were all scene pictures. Future analysis could compare the effects of training between Greebles to various control stimuli (i.e., the scene condition in our Peelen localizer scan). In addition, the increased F1-score may also be compatible with the interaction between faces and Greebles in the FFA, reflected by the adaptation effect of FFA for faces after Greebles (see Figure 10) after training. Taken together, we can summarize that the naive “before training, Greebles are object-like; after training, Greebles are face-like” hypothesis may not be directly applicable to multivariate analysis, as evidenced by our current MVPA analysis results of increased face vs. Greeble sensitivity.

Discussion

The present study revisited two assumptions behind Brants et al. (2011) JOCN article, one of the seminal studies in the prolonged discussions for vs. against the perceptual expertise of FFA and one widely known debate in modern cognitive neuroscience. As one of the oft-cited stimuli in the field of perception neuroscience, Greebles was also seen in many textbooks of vision research (Wolfe, 2009; Ward, 2015). The claim that the training-induced FFA activity increases were primarily due to Greebles' face likeness, instead of the original putative expertise training, indeed raises concern about the premises behind these claims: that (1) Brants et al. (2011) should have also replicated the Greeble training effects, at least behaviorally; and (2) that the neural inversion effect, or NIE at the FFA, should be a reliable index of human face processing. But as the current study has demonstrated, the first premise was unsubstantiated that by differentiating the Greeble training paradigm as the Gauthier97 vs. Gauthier98 version, the session-wise RT of the verification task between the two paradigms (Figure 3) was drastically different, suggesting that Brants et al. (2011) adopted the Gauthier98 paradigm (by their highly similar RT patterns). In contrast, the original Greeble training study (i.e., Gauthier et al., 1999) adopted the Gauthier97 version. Further individual RT analysis in Gauthier98 (e.g., Figure 5) showed that the large variability among participants was the main reason behind the inability to converge between RTs for trained and untrained Greebles, the main difference between the original Gauthier98 training results and those of the current study.

In addition, the current study further provided three pieces of the fMRI evidence that are consistent with the experience modulation account of the FFA: (a) FFA activities to Greebles after training were significantly larger than that of the before training condition, only under the appropriate (i.e., Gauthier97) training condition; (b) the lack of NIE effect in the FFA, the second premise for the Greebles are face-like hypothesis, plus the inconsistency of NIE effects at FFA in the extant literature (seven positives, two negatives, and 10 no activation differences between upright and inverted faces) further wrecked the assumption of the Brants et al. (2011) claim (that NIE in the FFA is the reliable index of human face processing); and (c) most importantly, the FFA adaptation effect, specifically for faces after Greebles, but not for faces after objects, and also not vice versa (i.e., Greebles after faces); under the appropriate (i.e., Gauthier97) regime, and only after (not before) training. Altogether, this combined evidence, including the clear RT difference by training paradigm and the close similarity between that of Gauthier98 (Figure 3D) and Brants et al. (2011) on the behavioral side and the three fMRI univariate results (FFA training effect, no clear NIE in the FFA, and FFA adaptation effect) on the neuronal side, is jointly in favor of the perceptual experience account and incongruent with the Greebles-like-face account, put forward by the Brants et al. (2011) study.

The results of ROI MVPA and searchlight analysis on various combinations of categories, training sessions, and regimes, as revealed in both Figure 11 and https://neurovault.org/collections/13893/, were not in line with those by univariate analyses, nor by either account. Compared with the extant evidence supporting the pattern difference between objects of expertise (e.g., higher classification accuracy for chess boards in chess experts), the control stimuli were scene pictures (Bilalić, 2016), whereas, in the current study, the control stimuli were everyday objects. Future studies will be needed to evaluate better the effect of control conditions in affecting MVPA outcomes. In addition, while there are studies that combined both multivariate and univariate analyses (Bilalić, 2016), other studies have highlighted the importance of “targeted tests of the informational content and/or dimensionality of activation patterns,” as they are “critical for drawing strong conclusions about the representational codes that are indicated by significant MVPA results” (Farah, 2000). In the current study, the FFA activation differences, as shown in Figures 6, 10, seem to be consistent with the MVPA searchlight results in the 11th neurovault map while also cannot exclude the before training variabilities among trainees in either group (e.g., the 13th and 14th neurovault maps). In short, while univariate analysis results are more supportive of the experience modulation account than those provided by multivariate analyses, it may not be easily attributable to any or a few dimensions that are (or are not) discoverable or impacted by Greeble training. Future analysis should empirically evaluate more testable hypotheses explaining the different results yielded by univariate and multivariate analyses.

FIGURE 11

Figure 11. (A) Proposed task flow of the verification task in the Gauthier97 training paradigm. (B, C) The same flowchart for the Gauthier98 training paradigm, for named and unnamed trials, respectively. The proposed steps in each condition (representing the putative stages of processing) proportionally correspond to the average response times at the end of the training: Gauthier97: 500 ms, Gauthier98 named trials: 800, and unnamed trials: 1,200 ms. These differences may explain why Gauthier97 is the relatively appropriate paradigm for Greeble training effects.

The RT differences near the end of training between the Gauthier97 and Gauthier98 paradigms were ~500 and 1,000 ms, respectively. As depicted in Figure 11, participants in the Gauthier98 training may have undergone more processing complications, which rendered the decision times of judging whether the image matched the label proportionally longer. The more uncertainty during the decision processes (or more proposed steps), the longer (800 and 1,200 ms for the named and unnamed Greeble versions in Gauthier98) the average reaction times at the end of training. These putative processes not only help explain the observed reaction time (RT) differences among different conditions but also underlie why Gauthier97 may be a relatively appropriate paradigm for the fMRI effect of Greeble training.

At the outset of this Greeble training study, the authors struggled with the choice of the symmetry (e.g., symmetrical or asymmetric) of Greeble stimuli versions. Our final choice of using asymmetric Greebles was based on the following concern: the observed effects of FFA activities on Greebles, faces, or objects, if found as predicted, might still be attributed to the adage of “Greebles look like faces” (Farah, 2000), even though Greebles are the same face-like to both experts and novices (but FFA only respond significantly to the former after expertise acquisition). If a set of totally non-face-like stimuli, such as the symmetric Greebles of the present study, could be trained to drive FFA significantly after extensive training to enable automatic processing at the subordinate level, then the face likeness may be less of an issue. Despite this, it may still be argued that the current study has not replicated the results of Gauthier et al. (1999) and Brants et al. (2011) because of the different Greeble versions. Our replies are as follows: (a) our Gauthier98 behavioral results were similarly comparable to those in Brants et al. (2011), providing evidence that Brants et al. may have adopted this sub-optimal training paradigm, different from those in Gauthier et al. (1999) and the Gauthier97 of the current study. The similar behavioral response patterns [c.f. Figures 3A, C to figure 3 of Gauthier97, as well as the Figures 3B, D to figure 4 of Gauthier98, and Brants et al. (2011), Figure 3], the dissimilarity among behavioral results from each paradigm, plus that the lack of literature support for the effect of stimulus asymmetry in short-term training provide converging evidence against the existence of stimulus symmetry in affecting short-term training; (b) in terms of featural selectivity, FFA has been shown to prefer stimulus symmetry (Caldara et al., 2006), properties strongly associated with faces (Caldara and Seghier, 2009), but the effect (or trainability) of symmetry is yet to be quantified, thereby the effect of stimulus symmetry onto the training effect awaits further assessment; and (c) currently there have been four studies comparing the training effects of both symmetric (Gauthier et al., 1999; Brants et al., 2011) and asymmetric Greebles (Kung et al., 2007 and the current study), while the effect of task (e.g., 1-back identity vs. passive viewing) has been shown to affect the observed training effect in the FFA (i.e., the 1-back task, due to its increased task demand, rendered the before-training FFA activities for Greebles relatively higher, and therefore obscured the later space of increase for Greebles activities with training). Whether such a cross-symmetry comparison could be made is still one of the potential research questions for future study.

The current study, though targeted at one specific article (Brants et al., 2011), was among the literature discussing the functional role of FFA, including three well-known positions (throughout 1997–2010): face specificity, perceptual expertise, and face network approach. Indeed, the first two positions have been exchanged for decades about their relative merits, while the 3rd position, the face network approach, also received extensive support but with less heated debates. For example, the famous Haxby's (Haxby et al., 2001) study initiated the 1st definition of multi-voxel pattern analysis (MVPA), namely the correlations between patterns of multiple voxels within the chosen ROI(s), functionally or anatomically defined. This approach, now contained in packages like CoSMoMVPA, sidelines reviewer 1's point that in the early stage of the fMRI development, the univariate GLM contrasts and then the subject-specific ROIs do have limitations (e.g., univariate only) and biases (e.g., Friston's critique of functional localizers, Friston et al., 2006), but it was precisely these methodologies that made the heated exchanges between the face specificity and perceptual expertise positions possible—without such procedural adherence, the comparison would be between apples (Kanwisher vs. Gauthier/Tarr, etc.) and oranges (Haxby) because they are not comparable on the same methodological background. On the one hand, we (the authors) would always love, and are kept amazed by, the methodological innovations that shed new light on the fMRI field: MVPA, hyperalignment (Haxby et al., 2020), and DNN/CNN (O'Toole and Castillo, 2021), to name just a few recent breakthroughs. These innovations hardly share a common focus. Rather, they diverge to enlighten us with the multifaceted complexities of the human brain. On the other hand, the previous exchanges between perceptual expertise and face specificity, although fruitless to some, sharpen researchers' minds on how the purported analysis methods (e.g., ROI overlap comparison, NIE, ROI-behavior correlations), with their background assumption checks, fare on any given proposal. These latter practices still have philosophical and reasoning merits, despite their 20 years of age (being “90s”). Both are quintessential learning ingredients for cognitive neuroscientists.

Concerning future research opportunities, one of the possible extensions could be to incorporate the lasting effects of training into the design. One general observation for Greeble experts, soon after their last after-training fMRI scan, was how soon they quickly forgot the associated Greeble shapes and names (though there were exceptions). Future training experiments, if adopted, could further investigate the extinction effects and how soon they could be quickly recovered (and their associated brain mechanisms). Additional fMRI analysis methods, such as functional connectivities (O'Reilly et al., 2012), representational similarity analysis (Kriegeskorte et al., 2008), or even deep neural network approaches, could all be viable options. Finally, the effect of FFA adaptation, except its usefulness as a companion index of expertise acquisition, could also be tested in natural experts (such as bird or car experts), further extending its reliability and validity as an alternate, or even the primary, index of perceptual experience.

Data availability statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the article/supplementary material.

Ethics statement

The studies involving humans were approved by NCKU Research Ethics Committee. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.

Author contributions

C-CK conceived the experiment. C-YC developed the code. KL carried out the experiments. KL, C-YC, HJ, and L-SW analyzed the data. KL, HJ, and L-SW prepared the figures. KL, L-SW, and C-CK discussed, revised, and completed the draft. All authors reviewed and approved the final manuscript.

Funding

This study was financially sponsored by the Ministry of Science and Technology of Taiwan (MOST-104-2410-H-006-022 for C-CK), the Humanistic-Social Benchmarking Project from the Ministry Of Education to the NCKU School of Social Sciences, and the Higher Education Sprout Project, Ministry of Education to the Headquarters of University Advancement at NCKU.

Acknowledgments

The authors would like to thank the NCKU MRI Center for the support and consultation provided; and Tzu-Hsuan Wen for her assistance in uploading the codes, results, and statistical maps onto https://osf.io/5tm9v/.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fnins.2023.1224721/full#supplementary-material

References

Aguirre, G. K., Singh, R., and D'Esposito, M. (1999). Stimulus inversion and the responses of face and object-sensitive cortical areas. NeuroReport 10, 189. doi: 10.1097/00001756-199901180-00036

PubMed Abstract | CrossRef Full Text | Google Scholar

Arcaro, M. J., Schade, P. F., Vincent, J. L., Ponce, C. R., and Livingstone, M. S. (2017). Seeing faces is necessary for face-domain formation. Nat. Neurosci. 20, 1404–1412. doi: 10.1038/nn.4635

CrossRef Full Text | Google Scholar

Bilalić, M. (2016). Revisiting the role of the fusiform face area in expertise. J. Cogn. Neurosci. 28, 1345–1357. doi: 10.1162/jocn_a_00974

PubMed Abstract | CrossRef Full Text | Google Scholar

Bilalić, M., Grottenthaler, T., Nägele, T., and Lindig, T. (2014). The faces in radiological images: fusiform face area supports radiological expertise. Cereb. Cortex 26, 1004–1014. doi: 10.1093/cercor/bhu272

PubMed Abstract | CrossRef Full Text | Google Scholar

Bilalić, M., Langner, R., Ulrich, R., and Grodd, W. (2011). many faces of expertise: fusiform face area in chess experts and novices. J. Neurosci. 31, 10206–10214. doi: 10.1523/JNEUROSCI.5727-10.2011

PubMed Abstract | CrossRef Full Text | Google Scholar

Bookheimer, S. Y., Wang, A. T., Scott, A., Sigman, M., and Dapretto, M. (2008). Frontal contributions to face processing differences in autism: evidence from fMRI of inverted face processing. J. Int. Neuropsychol. Soc. 14, 922–932. doi: 10.1017/S135561770808140X

PubMed Abstract | CrossRef Full Text | Google Scholar

Brants, M., Wagemans, J., and Op de Beeck, H. P. (2011). Activation of fusiform face area by Greebles is related to face similarity but not expertise. J. Cogn. Neurosci. 23, 3949–3958. doi: 10.1162/jocn_a_00072

PubMed Abstract | CrossRef Full Text | Google Scholar

Bukach, C. M., Gauthier, I., and Tarr, M. J. (2006). Beyond faces and modularity: the power of an expertise framework. Trends Cogn. Sci. 10, 159–166. doi: 10.1016/j.tics.2006.02.004

PubMed Abstract | CrossRef Full Text | Google Scholar

Caldara, R., and Seghier, M. L. (2009). The Fusiform Face Area responds automatically to statistical regularities optimal for face categorization. Hum. Brain Mapp. 30, 1615–1625. doi: 10.1002/hbm.20626

PubMed Abstract | CrossRef Full Text | Google Scholar

Caldara, R., Seghier, M. L., Rossion, B., Lazeyras, F., Michel, C., Hauert, C.-A., et al. (2006). The fusiform face area is tuned for curvilinear patterns with more high-contrasted elements in the upper part. NeuroImage 31, 313–319. doi: 10.1016/j.neuroimage.2005.12.011

PubMed Abstract | CrossRef Full Text | Google Scholar

Chou, K. P., Prasad, M., Yang, J., Su, S.-Y., Tao, X., Saxena, A., et al. (2021). A robust real-time facial alignment system with facial landmarks detection and rectification for multimedia applications. Multimed. Tools Appl. 80, 16635–16657. doi: 10.1007/s11042-020-09216-7

CrossRef Full Text | Google Scholar

Epstein, R. A., Higgins, J. S., Parker, W., Aguirre, G. K., and Cooperman, S. (2006). Cortical correlates of face and scene inversion: a comparison. Neuropsychologia 44, 1145–1158. doi: 10.1016/j.neuropsychologia.2005.10.009

PubMed Abstract | CrossRef Full Text | Google Scholar

Farah, M. J. (2000). Interview with Martha Farah. J. Cogn. Neurosci. 12, 360–363. doi: 10.1162/089892900562057

PubMed Abstract | CrossRef Full Text | Google Scholar

Friston, K. J., Rotshtein, P., Geng, J. J., Sterzer, P., and Henson, R. N. A. (2006). critique of functional localisers. Neuroimage 30, 1077–1087. doi: 10.1016/j.neuroimage.2005.08.012

CrossRef Full Text | Google Scholar

Gauthier, I., Anderson, A. W., Tarr, M. J., Skudlarski, P., and Gore, J. C. (1997). Levels of categorization in visual recognition studied using functional magnetic resonance imaging. Curr Biol. 7, 645–651. doi: 10.1016/S0960-9822(06)00291-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Gauthier, I., and Bukach, C. (2007). Should we reject the expertise hypothesis? Cognition 103, 322–330. doi: 10.1016/j.cognition.2006.05.003

PubMed Abstract | CrossRef Full Text | Google Scholar

Gauthier, I., Skudlarski, P., Gore, J. C., and Anderson, A. W. (2000a). Expertise for cars and birds recruits brain areas involved in face recognition. Nat. Neurosci. 3, 191–197. doi: 10.1038/72140

PubMed Abstract | CrossRef Full Text | Google Scholar

Gauthier, I., and Tarr, M. J. (1997). Becoming a “Greeble” expert: exploring mechanisms for face recognition. Vis. Res. 37, 1673–1682. doi: 10.1016/S0042-6989(96)00286-6

CrossRef Full Text | Google Scholar

Gauthier, I., and Tarr, M. J. (2002). Unraveling mechanisms for expert object recognition: bridging brain activity and behavior. J. Exp. Psychol. Hum. Percept. Perform. 28, 431–446. doi: 10.1037/0096-1523.28.2.431

PubMed Abstract | CrossRef Full Text | Google Scholar

Gauthier, I., Tarr, M. J., Anderson, A. W., Skudlarski, P., and Gore, J. C. (1999). Activation of the middle fusiform 'face area' increases with expertise in recognizing novel objects. Nat. Neurosci. 2, 568–573. doi: 10.1038/9224

PubMed Abstract | CrossRef Full Text | Google Scholar

Gauthier, I., Tarr, M. J., Moylan, J., Skudlarski, P., Gore, J. C., Anderson, A. W., et al. (2000b). The fusiform “face area” is part of a network that processes faces at the individual level. J. Cogn. Neurosci. 12, 495–504. doi: 10.1162/089892900562165

PubMed Abstract | CrossRef Full Text | Google Scholar

Gauthier, I., Williams, P., Tarr, M. J., and Tanaka, J. (1998). Training ‘greeble' experts: a framework for studying expert object recognition processes. Vis. Res. 38, 2401–2428. doi: 10.1016/S0042-6989(97)00442-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Grill-Spector, K., Knouf, N., and Kanwisher, N. (2004). The fusiform face area subserves face perception, not generic within-category identification. Nat. Neurosci. 7, 555–562. doi: 10.1038/nn1224

PubMed Abstract | CrossRef Full Text | Google Scholar

Grill-Spector, K., Kushnir, T., Edelman, S., Itzchak, Y., and Malach, R. (1998). Cue-invariant activation in object-related areas of the human occipital lobe. Neuron. 21, 191–202. doi: 10.1016/S0896-6273(00)80526-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Grill-Spector, K., and Malach, R. (2001). fMR-adaptation: a tool for studying the functional properties of human cortical neurons. Acta Psychol. 107, 293–321. doi: 10.1016/S0001-6918(01)00019-1

PubMed Abstract | CrossRef Full Text | Google Scholar

Grimaldi, P., Saleem, K. S., and Tsao, D. (2016). Anatomical connections of the functionally-defined “face patches” in the macaque monkey. Neuron 90, 1325–1342. doi: 10.1016/j.neuron.2016.05.009

PubMed Abstract | CrossRef Full Text | Google Scholar

Grotheer, M., and Kovács, G. (2014). Repetition probability effects depend on prior experiences. J. Neurosci. 34, 6640–6646. doi: 10.1523/JNEUROSCI.5326-13.2014

PubMed Abstract | CrossRef Full Text | Google Scholar

Haxby, J. V., Gobbini, M. I., Furey, M. L., Ishai, A., Schouten, J. L., Pietrini, P., et al. (2001). Distributed and overlapping representations of faces and objects in ventral temporal cortex. Science 293, 2425–2430. doi: 10.1126/science.1063736

PubMed Abstract | CrossRef Full Text | Google Scholar

Haxby, J. V., Guntupalli, J. S., Nastase, S. A., and Feilong, M. (2020). Hyperalignment: modeling shared information encoded in idiosyncratic cortical topographies. eLife 9, e56601. doi: 10.7554/eLife.56601

PubMed Abstract | CrossRef Full Text | Google Scholar

Haxby, J. V., Ungerleider, L. G., Clark, V. P., Schouten, J. L., Hoffman, E. A., Martin, A., et al. (1999). The effect of face inversion on activity in human neural systems for face and object perception. Neuron 22, 189–199. doi: 10.1016/S0896-6273(00)80690-X

PubMed Abstract | CrossRef Full Text | Google Scholar

Haxby, J. V., Ungerleider, L. G., Horwitz, B., Maisog, J. M., Rapoport, S. I., Grady, C. L., et al. (1996). Face encoding and recognition in the human brain. Proc. Natl. Acad. Sci. U.S.A. 93, 922–927. doi: 10.1073/pnas.93.2.922

PubMed Abstract | CrossRef Full Text | Google Scholar

James, T. W., Arcurio, L. R., and Gold, J. M. (2013). Inversion effects in face-selective cortex with combinations of face parts. J. Cogn. Neurosci. 25, 455–464. doi: 10.1162/jocn_a_00312

PubMed Abstract | CrossRef Full Text | Google Scholar

Kanwisher, N. (2000). Domain specificity in face perception. Nat. Neurosci. 3, 759–763. doi: 10.1038/77664

PubMed Abstract | CrossRef Full Text | Google Scholar

Kanwisher, N. (2017). The quest for the FFA and where it led. J. Neurosci. 37, 1056–1061. doi: 10.1523/JNEUROSCI.1706-16.2016

CrossRef Full Text | Google Scholar

Kanwisher, N., McDermott, J., and Chun, M. M. (1997). The fusiform face area: a module in human extrastriate cortex specialized for face perception. J. Neurosci. 17, 4302–1431. doi: 10.1523/JNEUROSCI.17-11-04302.1997

PubMed Abstract | CrossRef Full Text | Google Scholar

Kanwisher, N., Tong, F., and Nakayama, K. (1998). The effect of face inversion on the human fusiform face area. Cognition 68, B1–B11. doi: 10.1016/S0010-0277(98)00035-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Kriegeskorte, N., Mur, M., and Bandettini, P. (2008). Representational similarity analysis - connecting the branches of systems neuroscience. Front. Syst. Neurosci. 2, 4. doi: 10.3389/neuro.06.004.2008

PubMed Abstract | CrossRef Full Text | Google Scholar

Kung, C.-C., Peissig, J. J., and Tarr, M. J. (2007). Is region-of-interest overlap comparison a reliable measure of category specificity? J. Cogn. Neurosci. 19, 2019–2034. doi: 10.1162/jocn.2007.19.12.2019

PubMed Abstract | CrossRef Full Text | Google Scholar

Matsuyoshi, D., Morita, T., Kochiyama, T., Tanabe, H. C., Sadato, N., Kakigi, R., et al. (2015). Dissociable cortical pathways for qualitative and quantitative mechanisms in the face inversion effect. J. Neurosci. 35, 4268–4279. doi: 10.1523/JNEUROSCI.3960-14.2015

PubMed Abstract | CrossRef Full Text | Google Scholar

McKone, E., and Kanwisher, N. (2005). Does the Human Brain Process Objects of Expertise Like Faces? A Review of the Evidence. Cambridge, MA: MIT Press. doi: 10.7551/mitpress/3136.003.0024

CrossRef Full Text | Google Scholar

McKone, E., Kanwisher, N., and Duchaine, B. C. (2007). Can generic expertise explain special processing for faces? Trends Cogn. Sci. 11, 8–15. doi: 10.1016/j.tics.2006.11.002

PubMed Abstract | CrossRef Full Text | Google Scholar

Oosterhof N. N. Connolly A. C. Haxby J. V. CoSMoMVPA: multi-modal multivariate pattern analysis of neuroimaging data in Matlab/GNU Octave. Front Neuroinform. (2016) 10, 27. doi: 10.3389/fninf.2016.00027

CrossRef Full Text | Google Scholar

Op de Beeck, H. P., and Baker, C. I. (2010a). The neural basis of visual object learning. Trends Cogn. Sci. 14, 22–30. doi: 10.1016/j.tics.2009.11.002

PubMed Abstract | CrossRef Full Text | Google Scholar

Op de Beeck, H. P., and Baker, C. I. (2010b). Informativeness and learning: response to Gauthier and colleagues. Trends Cogn. Sci. 14, 236–237. doi: 10.1016/j.tics.2010.03.010

PubMed Abstract | CrossRef Full Text | Google Scholar

O'Reilly, J. X., Woolrich, M. W., Behrens, T. E. J., Smith, S. M., and Johansen-Berg, H. (2012). Tools of the trade: psychophysiological interactions and functional connectivity. Soc. Cogn. Affect. Neurosci. 7, 604–609. doi: 10.1093/scan/nss055

PubMed Abstract | CrossRef Full Text | Google Scholar

O'Toole, A. J., and Castillo, C. D. (2021). Face recognition by humans and machines: three fundamental advances from deep learning. Annu. Rev. Vis. Sci. 7, 543–570. doi: 10.1146/annurev-vision-093019-111701

PubMed Abstract | CrossRef Full Text | Google Scholar

Passarotti, A. M., Smith, J., DeLano, M., and Huang, J. (2007). Developmental differences in the neural bases of the face inversion effect show progressive tuning of face-selective regions to the upright orientation. NeuroImage 34, 1708–1722. doi: 10.1016/j.neuroimage.2006.07.045

PubMed Abstract | CrossRef Full Text | Google Scholar

Peelen, M. V., and Downing, P. E. (2005). Selectivity for the human body in the fusiform gyrus. J. Neurophysiol. 93, 603–608. doi: 10.1152/jn.00513.2004

PubMed Abstract | CrossRef Full Text | Google Scholar

Poldrack, R. A. (2006). Can cognitive processes be inferred from neuroimaging data? Trends Cogn. Sci. 10, 59–63. doi: 10.1016/j.tics.2005.12.004

PubMed Abstract | CrossRef Full Text | Google Scholar

Rhodes, G., Michie, P. T., Hughes, M. E., and Byatt, G. (2009). The fusiform face area and occipital face area show sensitivity to spatial relations in faces. Eur J Neurosci. 30, 721–733. doi: 10.1111/j.1460-9568.2009.06861.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Rivolta, D. (2014). Prosopagnosia: When all Faces Look the Same. Berlin: Springer-Verlag. XII, 95. doi: 10.1007/978-3-642-40784-0

CrossRef Full Text | Google Scholar

Rossion, B., and Gauthier, I. (2002). How does the brain process upright and inverted faces? Behav. Cogn. Neurosci. Rev. 1, 63–75. doi: 10.1177/1534582302001001004

PubMed Abstract | CrossRef Full Text | Google Scholar

Sergent, J., Ohta, S., and MacDonald, B. (1992). Functional neuroanatomy of face and object processing. A positron emission tomography study. Brain 115(Pt 1), 15–36. doi: 10.1093/brain/115.1.15

PubMed Abstract | CrossRef Full Text | Google Scholar

Sternberg, S. (1969). Memory-scanning: mental processes revealed by reaction-time experiments. Am. Sci. 57, 421–457.

PubMed Abstract | Google Scholar

Strother, L., Mathuranath, P. S., Aldcroft, A., Lavell, C., Goodale, M. A., Vilis, T., et al. (2011). Face inversion reduces the persistence of global form and its neural correlates. PLOS ONE 6, e18705. doi: 10.1371/journal.pone.0018705

PubMed Abstract | CrossRef Full Text | Google Scholar

Sugita, Y. (2008). Face perception in monkeys reared with no exposure to faces. PNAS 105, 394–398. doi: 10.1073/pnas.0706079105

PubMed Abstract | CrossRef Full Text | Google Scholar

Tanaka, J., and Gauthier, I. (1997). “Expertise in object and face recognition. perceptual learning,” in The Psychology of Learning And motivation, Vol. 36, eds R. L. Goldstone, D. L. Medin, and P. G. Schyns (San Diego, CA: Academic Press), 83–125. doi: 10.1016/S0079-7421(08)60282-0

CrossRef Full Text | Google Scholar

Tarr, M. J., and Gauthier, I. F. F. A. (2000). a flexible fusiform area for subordinate-level visual processing automatized by expertise. Nat. Neurosci. 3, 764–769. doi: 10.1038/77666

PubMed Abstract | CrossRef Full Text | Google Scholar

Valentine, T. (1988). Upside-down faces: a review of the effect of inversion upon face recognition. Br. J. Psychol. 79, 471–491. doi: 10.1111/j.2044-8295.1988.tb02747.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Ward, J. (2015). The Student's Guide to Cognitive Neuroscience. London: Psychology Press, 549. doi: 10.4324/9781315742397

CrossRef Full Text | Google Scholar

Wolfe, J. M. (2009). Sensation and Perception. Sunderland, MA: Sinauer Associates Incorporated, 460.

Google Scholar

Xu, Y. (2005). Revisiting the role of the fusiform face area in visual expertise. Cereb. Cortex 15, 1234–1242. doi: 10.1093/cercor/bhi006

PubMed Abstract | CrossRef Full Text | Google Scholar

Yovel, G., and Kanwisher, N. (2004). Face perception: domain specific, not process specific. Neuron 44, 889–898. doi: 10.1016/S0896-6273(04)00728-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Yovel, G., and Kanwisher, N. (2005). The neural basis of the behavioral face-inversion effect. Curr Biol. 15, 2256–2262. doi: 10.1016/j.cub.2005.10.072

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: fusiform face area (FFA), Greeble training, ROI analysis, adaptation effect, neural inversion effect (NIE), multi-voxel pattern analysis (MVPA)

Citation: Liu K, Chen C-Y, Wang L-S, Jo H and Kung C-C (2023) Is increased activation in the fusiform face area to Greebles a result of appropriate expertise training or caused by Greebles' face likeness? Front. Neurosci. 17:1224721. doi: 10.3389/fnins.2023.1224721

Received: 19 May 2023; Accepted: 20 September 2023;
Published: 17 October 2023.

Edited by:

Ernest Greene, University of Southern California, United States

Reviewed by:

Stephen José Hanson, Rutgers, The State University of New Jersey, United States
Giulio Contemori, University of Padua, Italy

Copyright © 2023 Liu, Chen, Wang, Jo and Kung. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Chun-Chia Kung, Y2NrdW5nQGt1bmdsYWItbmNrdXBzeS5vcmc=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.