Gender Biases in the Accuracy of Facial Judgments: Facial Attractiveness and Perceived Socioeconomic Status

Qi,  Yue; Ying, Jia

doi:10.3389/fpsyg.2022.884888

ORIGINAL RESEARCH article

Front. Psychol., 31 May 2022

Sec. Cognition

Volume 13 - 2022 | https://doi.org/10.3389/fpsyg.2022.884888

This article is part of the Research TopicFrom Facial Attractiveness Toward A Broader Aesthetics PerceptionView all 5 articles

Gender Biases in the Accuracy of Facial Judgments: Facial Attractiveness and Perceived Socioeconomic Status

Yue Qi¹

Jia Ying^2*

¹Department of Psychology, Renmin University of China, Beijing, China
²Graduate School of Education, University of Pennsylvania, Philadelphia, PA, United States

Many studies demonstrate that people form their first impression of a stranger based on facial appearance, and these impressions influence their subsequent decisions and behaviors. However, much less research has examined the factors that moderate the accuracy of first impressions based on a photo of face. The present study included three experiments to explore gender differences in the accuracy of impressions based on faces. The results showed that people judge facial attractiveness more accurately for female faces than for male faces while giving more accurate wealth judgments for male faces than for female faces. Interestingly, although we did not find a significant correlation between confidence ratings and the accuracy of wealth rating, we recognized a significant moderate correlation between confidence ratings and the accuracy of attractiveness ratings when female participants rated male faces. To our knowledge, the present study is the first to reveal gender biases in the accuracy of impression judgments based on facial appearance. These findings imply a significant influence of traditional gender roles on accurate facial judgments.

Introduction

When interacting with a stranger, people may form their first impression based on limited available information (e.g., facial appearance), and these judgments can subsequently and indirectly influence social decision making (Qi et al., 2018, 2021; Li et al., 2021). Many studies demonstrate that facial attractiveness has an impact on various social decisions, such as friendship and mating choices (Thornhill and Gangestad, 1999), monetary decision-making (Pandey and Zayas, 2021), and hiring (Luxen and Van De Vijver, 2006). People judge facial attractiveness based on common aesthetic or affective attributes of different genders (Rhodes, 2006). According to the owner hypothesis, facial attractiveness is a stable characteristic of those with faces (Chen et al., 1997; Little and Perrett, 2002). Researchers have explored some facial features that affect facial attractiveness judgments, such as averageness (Komori et al., 2009), symmetry (Baudouin and Tiberghien, 2004), sexual dimorphism (Perrett et al., 1998; Russell, 2003), and vitality (Zheng and Zhou, 2021). The observer hypothesis argues the importance of the beholder on facial attractiveness perception and emphasizes the characteristics of the observer, such as the observer’s age (Little et al., 2010), personality (Welling et al., 2009), and sociocultural factors (Little et al., 2002). Attractiveness can be a sign of health, and highly attractive faces can induce positive and pleasant emotional experiences (Rhodes, 2006; Zhang et al., 2021), which are rewarding to individuals (Aharon et al., 2001). Previous studies have found that the reward value of facial attractiveness can be influenced by the gender of the perceiver (Cloutier et al., 2008; Levy et al., 2008).

Facial gender is another impact factor in attractiveness processing. Mitrovic et al. (2018) found that both males and females looked longer at female faces, especially attractive female faces. This is in accordance with the “female beauty captures the mind” hypothesis (Maner et al., 2003). From an evolutionary perspective, males and females will emphasize the different characteristics of potential mates. Males pay more attention to characteristics related to reproductive potential, such as physical attractiveness, while females pay more attention to characteristics that signal resource acquisition, such as status and dominance (Buunk et al., 2002). Furthermore, attractive female faces capture more behavioral attention (Slater et al., 1998; Maner et al., 2003), bring more rewards (Collins and Missing, 2003; Colwell, 2007; Wang et al., 2015), and cause more brain activation in neural mechanisms (Zhang et al., 2012; Ru et al., 2017). In other words, attractive female faces capture more attention and are more visible than attractive male faces.

To date, many studies have discussed the accuracy of facial judgments (e.g., Todorov et al., 2015; Walker and Vetter, 2016). However, much less research has examined the factors that moderate the accuracy of first impressions when viewing a photo of a face (Alaei and Rule, 2016). Previous studies have investigated self-other agreement on traits in face-to-face contexts and found that extroversion and openness can be accurately judged (e.g., Borkenau et al., 2009; Back et al., 2010; Moritz and Roberts, 2018). However, neuroticism is the least accurately judged trait in online contexts (Gosling et al., 2007; Back et al., 2010). These findings can be explained by the trait visibility effect (Funder and Dobroth, 1987); that is, the more relevant and frequent the behaviors the trait elicits, the more accurate the judgments that are made will be (Watson et al., 2000), because perceivers can acquire more valid cues to judge the trait.

Moreover, previous studies have revealed that the longer people know each other, the more accurately they rate each other’s traits. Compared with strangers who observed behaviors for only a few minutes, acquaintances predicted behavior better and were more consistent with their reports of observed behavior (Biesanz et al., 2007). For example, married couples have higher self-other agreements on most affectivities and personalities than friendship dyads or dating couples do (Watson et al., 2000). Increased acquaintanceship is accompanied by more trait-relevant messages; thus, perceivers can make more accurate judgments of the target (Funder, 1995; Funder et al., 1995). Considering that facial attractiveness carries additional significance for women (Luxen and Van De Vijver, 2006), people may be more accustomed to evaluating women’s attractiveness in everyday life. Thus, we expected gender bias in the accuracy of attractiveness judgment from faces.

The present study was designed to explore the influence of gender factors on the accuracy of people’s judgments of facial attractiveness. In the review of Tsankova and Tair (2021), the accuracy of first impressions refers to “the correspondence between the subjective perception of the interaction partners and some more objective criterion (e.g., Funder and West, 1993; Brauer and Proyer, 2020).” Thus, previous research commonly uses the term “accuracy” to illustrate the agreement between actual cooperative behaviors or self-reported personality and perceived personality from others (e.g., Funder, 1995; Borkenau et al., 2009; Chan et al., 2010; Todorov et al., 2015; Alaei and Rule, 2016). However, with regard to attributes without objective criteria, such as self-reported stress (Little et al., 2011), researchers employ self-other agreement or distinctive self-other agreement (Human et al., 2013) to measure facial judgment accuracy. According to the above definitions of accuracy, in the current research, the accuracy of facial attractiveness was calculated by self-other agreement.

According to the trait visibility effect (Funder and Dobroth, 1987; Watson et al., 2000) and the acquaintanceship effect (Funder, 1995), we hypothesized that people tend to give more accurate ratings of the facial attractiveness of female faces than of male faces. These gender differences arise because across many cultures, a woman’s attractiveness is important (Li et al., 2002; Shackelford et al., 2005), whereas a man’s status and resources are more crucial than his attractiveness (Buss and Schmitt, 1993; Sprecher et al., 1994). Therefore, in Studies 1 and 2, we explored the gender differences in judgment accuracy and metaperception accuracy on facial attractiveness. Study 3 was designed to investigate the cognitive mechanism of these gender biases. Participants were asked to give their ratings on the perceived wealth of the person depicted in each photo in Study 3. The accuracy of perceived economic status in Study 3 was calculated by the correspondence between the participants’ subjective perception of faces and the actual wealth ranking group.

Study 1

This experiment was designed to explore the influence of the perceiver’s gender on accuracy in judging facial attractiveness. Considering that facial attractiveness carries additional significance for women (Luxen and Van De Vijver, 2006), people may be more accustomed to evaluating women’s attractiveness in everyday life, which motivates women to pay more attention than men to their attractiveness. Thus, we hypothesized that (1) people tend to give more accurate ratings of facial attractiveness for female faces than for male faces and (2) women tend to assess people’s facial attractiveness more accurately than men.

Methods

Participants

A total of 90 students participated in Study 1 for payment, including 41 males (M_age = 24.32, SD_age = 3.66) and 49 females (M_age = 24.86, SD_age = 4.68). This study was approved by the internal review board of the Department of Psychology, Renmin University of China. Each participant signed an informed consent form and received monetary compensation for his or her time.

Stimuli

Another 119 undergraduate students (58 male and 61 female, age range 18–25 years) were recruited to have frontal shoulder-up pictures taken with a digital camera in front of a white background for use as stimuli. Before the photos were taken, they removed all accessories except glasses (if they could not finish the task without them). We asked the students to maintain a natural (neutral emotion) expression. After the photos were taken, they were asked to rate their attractiveness in the eyes of others of the same gender and different genders using a 9-point scale ranging from 1 (not attractive at all) to 9 (extremely attractive). We did not find a difference between their self-ratings of attractiveness in the eyes of others of the same gender and others of a different gender t(236) = −0.42, p = 0.678. All photographed participants consented to the use of their photos for our research purposes, including showing their pictures to other participants. All the faces were adjusted to the same size, 295 × 295 pixels.

Apparatus and Procedure

The experiment was conducted on a computer with E-prime 2.0. Participants were told to use their gut feeling to rate the attractiveness of each face photo. In a typical trial of the study, a fixation point was presented for 500 ms, and then a face photo was shown with a 9-point rating scale below it. Participants were asked to give their rating on the attractiveness of each face photo from 1 (not attractive at all) to 9 (extremely attractive). The experiment contained two blocks with a total of 238 trials, and each face photo was presented once in a block. The order of the photos was random. Participants started with a practice block of 8 trials to familiarize them with the task. Between the two blocks, the participants were allowed to take a break and started the next block on their own if they thought they were ready for it (see Figure 1). Considering that one participant rated the same target twice during the study, we used the mean rating for each face as the other-rating of the face.

FIGURE 1

Figure 1. The order of events in a typical trial of Studies 1 and 2.

In the review of Devos et al. (2013), self-other agreement is a relative phenomenon that refers to a degree of discrepancy between self-ratings and other-ratings. In previous research, self-other agreement was operationalized as the absolute difference of self and other ratings (Atwater and Yammarino, 1992, 1997; Bernieri et al., 1994; Lee and Carpenter, 2018; Kim et al., 2019) in addition to correlation (Borkenau and Liebler, 1993; Rogers et al., 2018). In the present research, we standardized the ratings of attractiveness for each face by subtracting other-ratings from self-ratings. Specifically, when a participant rated the face of someone of the same gender, the other-rating of attractiveness for this face was subtracted from the self-rating in the eyes of others with the same gender and vice versa. Thus, the standardized rating scores, which refer to rating accuracy, ranged from −8 to 8, with higher scores indicating that participants rated the target’s attractiveness lower than the target’s self-ratings. The absolute value indicates the difference between self-rating scores and other-rating scores. To be more specific, a higher absolute value indicates that participants rated the target’s attractiveness lower than the target’s self-ratings. Positive or negative values suggest whether participants underestimated or overestimated facial attractiveness compared to self-ratings. All subsequent analyses were based on the standardized data.

Results

Mean standardized ratings were submitted to a 2 (participant’s gender: male, female) × 2 (facial gender: male, female) mixed-design measures ANOVA with face gender as a within-subject factor (Figure 2). The main effect of face gender was significant, F(1, 88) = 62.07, p < 0.001, η_p² = 0.414, indicating that female faces (2.14 ± 0.14) were judged more accurately than male faces (2.55 ± 0.13). The main effect of participant’s gender was not significant, F(1, 88) = 0.50, p = 0.483, η_p² = 0.006, indicating that the influence of the participant’s gender on judgment accuracy was relatively limited. More importantly, there was a significant interaction between the participant’s gender and facial gender, F(1, 88) = 6.87, p = 0.010, η_p² = 0.072. Male participants rated female faces (2.30 ± 0.21) more accurately than male faces (2.57 ± 0.19), p = 0.001. Female participants also rated female faces (1.97 ± 0.19) more accurately than male faces (2.52 ± 0.17), p < 0.001. These results indicated that compared to the self-ratings of the targets, the participants’ ratings tended to underestimate the targets’ attractiveness. More importantly, all participants showed higher rating accuracy in judging the attractiveness of female faces. In addition, for male faces, male participants and female participants had similar rating accuracy (2.57 ± 0.19 vs 2.30 ± 0.021, p = 0.845), while for female faces, male participants and female participants also had similar rating accuracy (2.52 ± 0.17 vs 1.97 ± 0.19, p = 0.255).

FIGURE 2

Figure 2. Rating accuracy of facial attractiveness in Study 1. Standardized rating scores as a subtraction of self-rating and other-rating facial attractiveness in Study 1. Error bars represent 1 S.E. of the means.

Study 2

Study 1 found that participants tend to rate female faces’ attractiveness more accurately than male faces, which confirms hypothesis 1. Thus, Study 2 was designed to retest these findings. Moreover, to explore whether participants were aware of their rating accuracy, we added a confidence-rating task to the experiment and calculated the correlation between confidence rating and rating accuracy.