Abstract
Perceived trustworthiness is one of the most important facial traits in social interaction. To elucidate how facial trustworthiness is assessed by others and its relationship to other facial traits would have significant theoretical and practical implications. Prior studies have shown that perceived attractiveness and typicality of a face may contribute to trustworthiness judgments; i.e., trustworthy faces are always the typical and attractive ones. Here, by conducting judgments of facial traits (i.e., trustworthiness, attractiveness, and typicality) on the same set of faces, we revealed a more profound relationship among these facial traits. First, we found that trustworthiness judgments did not always peak at the average face, in contrast to previous research. Second, trustworthiness exhibited a nonlinear relationship with attractiveness and typicality: Men relied more on typicality when judging a face as untrustworthy or neutral, whereas women relied more on typicality when judging a face as untrustworthy but more on attractiveness when judging a face as trustworthy. Third, women and men may utilize different traits to evaluate face trustworthiness: The relationship between trustworthiness and typicality judgments was closer in men than in women, whereas women counted on face attractiveness more than men did to evaluate face trustworthiness. These findings demonstrate that judging the trustworthiness of a face is a more complex process than previously thought, which may lead to a better understanding of the mechanisms underlying highly flexible and sophisticated social interactions in humans.
Introduction
Faces convey a wealth of information (e.g., identity and personality traits) central to daily life. When meeting someone, “gut feelings” or first impressions are often rapidly and automatically formed based on facial appearance and then used to make inferences about personality traits, like trustworthiness and attractiveness (Zebrowitz, 2004; Todorov et al., 2005; van’t Wout and Sanfey, 2008; Antonakis and Dalgas, 2009). Such impressions influence a diverse range of critical social outcomes, from mate choice to sentencing decisions (Todorov et al., 2005). Therefore, it is important to understand what underlies face-based personality trait judgments.
Trustworthiness is important for judgments of a variety of personality traits. For instance, when multiple personality traits are represented in a 2D face evaluation space, judgments of trustworthiness account for over half of the variance in face-based social judgments and approximate the general evaluation of face valence (Oosterhof and Todorov, 2008). Prior studies have found that trustworthiness judgments exhibit high agreement among children and adults (Ma et al., 2016), indicating that certain common facial properties may play important roles in trustworthiness judgments. Identifying trustworthiness and its relationship to other face properties is critical for understanding the origin, evolution, and functional significance of trustworthiness.
Many studies have demonstrated that perceived attractiveness may contribute to trustworthiness judgments across races and cultures (Oosterhof and Todorov, 2008; Xu et al., 2012; Todorov et al., 2013). The “beauty is good” stereotype provides a possible explanation for this relationship (Langlois et al., 2000; Lemay et al., 2010): Attractive individuals represent health and fitness, as well as positive personality traits, such as honesty (Rhodes et al., 2007; Zebrowitz et al., 2011). Honesty is also an important component of trustworthiness (Langlois et al., 2000). Moreover, it has been found that facial textural properties (e.g., skin smoothness) directly affect physical attractiveness and indirectly affect trustworthiness (Tsankova and Kappas, 2016). In addition, for mate choice, physical attractiveness may trump judgments of personality traits, including trustworthiness (Fugère et al., 2017). Trustworthiness can be judged after as little as 50 ms of exposure to a neutral face (Todorov et al., 2009) and more quickly than many other personality traits (e.g., likeability, competence, and aggressiveness; Willis and Todorov, 2006), but not more quickly than attractiveness (Olson and Marshuetz, 2005). A recent study showed that attractiveness judgments precede trustworthiness judgments, with lower detection thresholds and shorter decision latencies (Gutierrez-Garcia et al., 2019). Moreover, prior research has found that the judgment-based event-related potential component occurs earlier for attractiveness than for trustworthiness (Calvo et al., 2018). Therefore, individuals may use attractiveness as an easily accessible proxy for judgments of trustworthiness. However, it remains unclear how and to what extent people employ attractiveness cues when judging trustworthiness from faces.
Trustworthiness judgments can also be influenced by face typicality, an important determinant of face recognition (Todorov et al., 2015; Dotsch et al., 2017; Sofer et al., 2017). Face typicality originates from Valentine’s norm-based face space model (Valentine, 1991), where all faces are represented as vectors originating from a typical, or average, face. The further a face is from the typical face, the more distinctive it is and the more easily it can be recognized. Trustworthiness judgments have been found to decrease with distance of computer-generated (morphed) faces from the typical face (Sofer et al., 2015) and to peak around the typical face of one’s own race (Sofer et al., 2017). The same shape of relationship has also been identified between face typicality and attractiveness (Langlois and Roggman, 1990; Rhodes, 2006). However, it has also been shown that the relationship between typicality and attractiveness dissociates to some degree. For example, Perrett and his colleagues found that the average (typical) face composed of a set of female faces is less attractive than an average composed specifically of attractive faces from the same set (Perrett et al., 1994). That is, the average face is attractive but may not be optimally attractive. As trustworthiness judgments are highly correlated with attractiveness judgments, one may wonder whether careful experimental design would reveal such a dissociation in the relationship between face trustworthiness and typicality. Moreover, as both face attractiveness and typicality are important factors in face trustworthiness, it is important to value them in trustworthiness judgments.
Previous studies have shown gender differences in various social behaviors (Derks et al., 2014; Mattarozzi et al., 2015; Lemmers-Jansen et al., 2019). Yet, surprisingly little research has been conducted on gender differences in face-based trustworthiness judgments. One study found that women tend to judge trustworthy faces significantly more trustworthy than men do, although no such differences were found for untrustworthy or neutral faces (Mattarozzi et al., 2015). In addition, women appear to respond more accurately (more in line with experimenter-defined dimensions) than men in trustworthiness judgment tasks (Dzhelyova et al., 2012). These findings tend to indicate that women and men may make face trustworthiness judgments by utilizing different traits, which remain unclear.
In the present study, we explored how trustworthiness judgments rely on attractiveness and typicality cues and whether these relationships are affected by the gender of the rater. In detail, we hypothesized that (1) perceived trustworthiness might not always peak at the average face, like attractiveness; (2) perceived trustworthiness might exhibit a nonlinear relationship with attractiveness and typicality; and (3) there might be a gender-related influence on the above-mentioned relationship.
Materials and Methods
Participants
In total, 119 adults participated in the present study, including (1) 40 participants [19–27 years, mean (M) ± standard deviation (SD) = 22.61 ± 2.67 years] in the face trustworthiness judgment task; (2) 39 participants (19–26 years, M = 21.87 ± 2.02 years) in the face attractiveness judgment task; and (3) 40 participants (19–28 years, M = 22.28 ± 2.36 years) in the facial typicality judgment task. Details on the study participants are shown in Table 1. Participants reported no abnormal neurological history, had normal or corrected-to-normal vision, and were right-handed. All participants provided written informed consent prior to the experiment and were compensated for their participation. All procedures were approved by the Institutional Review Board (2017-IRB-001).
Table 1
| Trait | Women | Men | ||||
|---|---|---|---|---|---|---|
| N | Age range (years) | M ± SD (years) | N | Age range (years) | M ± SD (years) | |
| Trustworthiness | 20 | 19–27 | 22.71 ± 2.87 | 20 | 19–27 | 22.50 ± 2.52 |
| Attractiveness | 20 | 19–25 | 21.00 ± 1.49 | 19 | 19–26 | 22.89 ± 2.14 |
| Typicality | 21 | 19–26 | 21.29 ± 2.12 | 19 | 19–28 | 23.37 ± 2.17 |
Demographic details of the participants.
All participants completed one of three tasks on the same set of face images. Given this between-subjects design, the sample size was determined on the basis of the desired power (0.80), alpha level (0.05), effect size (0.33), and number of groups (three in the main analysis). Using G*Power 3.1 (Faul et al., 2007), the minimum required sample size was calculated as 90.
Stimuli
Black-and-white photographs of young adults were selected from the CAS-PEAL-R1 Face Database (Gao et al., 2008) and SCUT-FBP5500 Database (Liang et al., 2018). Only faces from a frontal view and categorized as neutral faces were selected and utilized. Existing studies have found that women and men make similar trustworthiness and attractiveness judgments on male faces but not on female faces (Foos and Clark, 2011; Mattarozzi et al., 2015). Therefore, we used male faces only. One hundred faces were used to create an average (typical) face, with 20 of the hundred used in the main task and 15 in the practice. None of these faces were familiar to the participants. An average male composite (the typical face) was created using a standard procedure in Abrosoft FantaMorph v5 (Abrosoft Co., Beijing, China; Figure 1A). A set of landmark points (n = 194) was placed on each face to generate the average locations of each landmark. For faces used in the practice and main task, blemishes and hair on the forehead were removed using Adobe Photoshop (Adobe Systems, San Jose, CA, United States). The averaging process can make the composite face more symmetrical than the individual faces and thus affect judgments of facial traits. Therefore, to parse out the effect of averageness from the effect of symmetry, all individual faces were made perfectly symmetrical by averaging each face with its mirror image. All individual faces were standardized on the interocular distance of the average composite face.
Figure 1
For each face used in the main task, we constructed a continuum of faces (n = 14) by manipulating faces’ distance from the typical (average) face (Distance from the Typical face, DFT; Sofer et al., 2015) along individual dimensions from −175 to 175% in 25% steps (Figure 1B). The transformation between the corresponding landmark points (n = 194) of two faces is linear. All faces [n = 20 (original face) × 14 (level) = 280] had the texture of the average face. Each image was cropped tightly to the widest part of the face (thus removing the ears). All stimulus faces were presented against a uniform black background on a Dell P2217H, 22-inch monitor (1920 × 1080 pixels). When viewed from about 70 cm, images of faces subtended a visual angle of approximately 7.67° (H, ranging from 6.77° to 8.56°) × 6.62° (W, ranging from 5.62° to 7.61°).
Procedure
Experiments were conducted with MATLAB 2016a (MathWorks, Natick, MA) and PsychToolbox1 (Golarai et al., 2015). Participants were seated in a comfortable chair in a sound-attenuated room and randomly assigned to one of the three judgment tasks. To familiarize participants with their task, experiments started with a practice session, in which 15 faces were presented in random order. Participants were asked to rely on their “gut feeling” to rate faces on trustworthiness, attractiveness, or typicality using a seven-point Likert scale ranging from one (very untrustworthy/very unattractive/very typical) to seven (very trustworthy/very attractive/very distinct; Figure 2). They were encouraged to use the full range of the scale. Participants completed each trial at their own pace by clicking on the labeled buttons below each face. Once participants made the choice, a mask was presented for 500 ms to avoid the potential effect of visual adaptation on facial evaluation. In the main task, participants first previewed all 281 faces, which were presented for 800 ms each, followed by a mask for 500 ms. Then, after a three-minute break, they rated the 281 faces presented. To avoid participant fatigue, two rest intervals were implemented during testing. All faces were presented in a pseudo-randomized order (each particular face was constrained not to appear more than once per 20 trials; any particular level of faces constrained not to appear more than once per 14 trials) to eliminate the effect of familiarity on judgments of face traits.
Figure 2
Results
Agreement in Trait Judgments
Inter-rater agreement for the three kinds of trait judgments (i.e., trustworthiness, attractiveness, and typicality) was very high, with Cronbach’s alphas ranging from 0.98 to 0.99.
To further examine inter-rater agreement, we calculated Pearson’s correlations between each participant’s ratings and mean ratings across remaining individuals in their respective judgment groups (Kramer et al., 2018). Then, correlation coefficients were Fisher Z-transformed for normalization and tested against 0 with a one-sample t-test. We found that normalized correlation coefficients were significantly greater than zero (p < 0.001; Table 2), indicating that face-based trait judgments are highly reliable across participants. To explore the differences in judgment type and gender, we also performed a 3 (judgment type: trustworthiness, attractiveness, and typicality) × 2 (gender: women and men) two-way ANOVA followed up with post-hoc tests. The main effect of judgment type was significant [F(2,113) = 7.99, p < 0.001, partial η2 = 0.12]. Post-hoc (Bonferroni corrected) tests demonstrated no significant differences between agreements in attractiveness and typicality judgments, but both were significantly higher than that of trustworthiness judgments (compared with attractiveness: p = 0.002; compared with typicality: p = 0.003). These results suggest that trustworthiness may be a relatively complicated trait compared with attractiveness and typicality. We also found a significant main effect of gender [F(1,113) = 13.98, p < 0.001, partial η2 = 0.11], indicating higher agreement in judgments of facial traits among women than men. No interaction between judgment type and gender was found [F(2,113) = 0.05, p = 0.95, partial η2 = 0.001].
Table 2
| Trustworthiness | Attractiveness | Typicality | |
|---|---|---|---|
| All | 0.98 ± 0.24∗∗∗ | 1.14 ± 0.19∗∗∗ | 1.13 ± 0.19∗∗∗ |
| Women | 1.03 ± 0.24∗∗∗ | 1.21 ± 0.12∗∗∗ | 1.20 ± 0.16∗∗∗ |
| Men | 0.90 ± 0.23∗∗∗ | 1.05 ± 0.20∗∗∗ | 1.04 ± 0.19∗∗∗ |
Descriptive statistics on agreements (Z-value) in trait judgments (M ± SD).
p < 0.001.
Influence of DFT on Trait Judgments
We next explored the influence of DFT on participants’ judgments of trustworthiness, attractiveness, and typicality.
To identify the predicted DFT where facial trait judgments reached a maximum, Gaussian curves were fitted to mean judgments for all tested faces (Group All; DeBruine et al., 2007). For ease of presentation, ratings of face typicality were reversed so that the higher the rating, the more typical the face. For all three kinds of judgments, the Gaussian curve fits were very good, with R2 values of 0.93–0.95. Moreover, all three judgments peaked around the average face (trustworthiness: 3.64%; attractiveness: 7.41%; typicality: 2.80%; Figure 3). Thus, our results suggest that the average face is typical/trustworthy/attractive in general, as found in prior research (Langlois and Roggman, 1990; Todorov et al., 2015).
Figure 3
Previous studies have shown that attractive faces are not always average, whereas trustworthy faces are almost always average (DeBruine et al., 2007; Sofer et al., 2015). To re-examine these findings, we chose the most/least attractive original face based on mean participant ratings (most attractive rating = 4.87; least attractive rating = 2.08). Figure 4A shows the mean attractiveness judgments of the most/least attractive face continuum (called Group Most and Group Least, respectively) as a function of DFT. The predicted DFT for the peak of mean attractiveness judgments for Group Most and Group Least was distant from the average face (Group Most, peak = 21.64%; Group Least, peak = −21.54%). To confirm this impression, we fitted Gaussian curves to each participant’s data and calculated the predicted DFT for the peak of attractiveness judgment for each participant. A one-sample t-test against 0 was then conducted. Results showed that the predicted DFT for the peak of attractiveness judgments was indeed greater than 0% in Group Most (p < 0.001) and less than 0% in Group Least (p < 0.001). These results are consistent with previous findings that perceived attractiveness may not always peak at the average face (Sofer et al., 2015), confirming the reliability and effectiveness of the present method.
Figure 4
We then conducted similar analyses to examine whether trustworthiness would always peak around the average face. We chose the most/least trustworthy original face based on mean participant ratings (most trustworthy rating = 4.93; least trustworthy rating = 2.03). Figure 4B shows the mean trustworthiness judgments of the most/least trustworthy face continuum (called Group Most and Group Least, respectively) as a function of DFT. The predicted DFT for the peak of mean trustworthiness judgment for Group Most was distant from the average face (30.84%), whereas the peak for Group Least was close to the average face (−4.07%). Results of Gaussian curve fitting in each participant showed that the predicted DFT for the peak of trustworthiness judgments was indeed greater than 0% in Group Most (p < 0.001) but not different from 0% in Group Least (p = 0.456) indicating that, as attractiveness, perceived trustworthiness does not always peak at the average face.
To validate our method, we performed the same procedures for typicality judgments. We found that the predicted DFT for the peak of mean typicality judgments was indeed close to the average face in both Group Most (peak = 0.64%, p = 0.747) and Group Least (peak = 3.21%, p = 0.277; Figure 4C).
No significant differences were found when repeating the same analysis over data split based on the gender of the participants (Table 3).
Table 3
| Trait | All | Women | Men | |
|---|---|---|---|---|
| Attractiveness | Group All | 8.39 ± 14.31 | 4.66 ± 5.380 | 12.32 ± 10.88 |
| Group Most | 23.24 ± 22.69 | 37.84 ± 15.38 | 28.83 ± 27.34 | |
| Group Least | −21.28 ± 14.31 | −22.78 ± 9.860 | −19.70 ± 18.02 | |
| Trustworthiness | Group All | 2.96 ± 10.36 | 3.56 ± 4.800 | 2.36 ± 14.02 |
| Group Most | 20.99 ± 22.38 | 25.23 ± 14.59 | 16.75 ± 27.87 | |
| Group Least | −2.19 ± 18.41 | 0.52 ± 20.92 | −4.9 ± 15.58 | |
| Typicality | Group All | 2.75 ± 7.270 | 1.24 ± 4.890 | 3.9 ± 9.070 |
| Group Most | −0.91 ± 16.89 | 1.33 ± 10.40 | −1.7 ± 22.34 | |
| Group Least | −1.96 ± 11.26 | −3.93 ± 16.84 | −0.96 ± 16.76 |
Descriptive statistics of the predicted DFT for peaks in trait judgments based on Gaussian curve fitting in each participant’s data (M ± SD).
As both the peak trustworthy and attractive faces are not always the average face, we wondered which trait was more related to face typicality. We fitted Gaussian curves to each original face continuum and calculated the corresponding predicted DFT for peaks of trustworthiness and attractiveness judgments. To model the relationship between mean trustworthiness/attractiveness ratings of original faces (Table 4) and the predicted DFT for peaks of mean trustworthiness/attractiveness judgments in corresponding continua, we fitted linear equations to the observed data (Figure 5). We conducted the same procedure for typicality judgments as a control. If one face-based trait judgment is mainly related to a face’s DFT, then for every face continuum this trait judgment should always peak at the average face and no significant correlation should be found between original faces’ ratings and the peaks of this trait judgment. Moreover, linear fitting slopes reflect the effect size of the DFT on trait judgments; i.e., the bigger the slope, the smaller the effect size.
Table 4
| Attractiveness | Trustworthiness | Typicality |
|---|---|---|
| 2.74 ± 1.14 | 3.23 ± 1.37 | 4.18 ± 1.38 |
| 4.49 ± 1.59 | 4.93 ± 1.38 | 5.40 ± 1.19 |
| 4.59 ± 1.83 | 3.63 ± 1.37 | 4.35 ± 1.42 |
| 3.82 ± 1.52 | 4.73 ± 1.41 | 4.95 ± 1.13 |
| 4.87 ± 1.26 | 4.53 ± 1.45 | 5.35 ± 1.23 |
| 3.59 ± 1.70 | 2.40 ± 1.24 | 3.23 ± 1.39 |
| 3.56 ± 1.50 | 3.58 ± 1.39 | 4.58 ± 1.34 |
| 2.08 ± 0.96 | 3.00 ± 1.26 | 3.40 ± 1.24 |
| 2.74 ± 1.70 | 2.03 ± 1.03 | 3.13 ± 1.11 |
| 4.03 ± 1.14 | 4.20 ± 1.47 | 5.20 ± 1.34 |
| 2.54 ± 1.21 | 2.43 ± 0.98 | 3.63 ± 1.31 |
| 3.77 ± 1.29 | 4.65 ± 1.27 | 5.05 ± 1.26 |
| 4.13 ± 1.59 | 4.48 ± 1.34 | 5.30 ± 1.14 |
| 4.00 ± 1.15 | 4.65 ± 1.41 | 5.55 ± 1.11 |
| 3.31 ± 1.06 | 3.78 ± 1.21 | 4.65 ± 1.29 |
| 3.13 ± 1.13 | 3.15 ± 1.33 | 4.25 ± 1.43 |
| 3.44 ± 1.47 | 3.68 ± 1.47 | 4.20 ± 1.30 |
| 4.21 ± 1.45 | 4.28 ± 1.34 | 5.40 ± 1.28 |
| 2.46 ± 1.10 | 3.38 ± 1.08 | 3.83 ± 1.34 |
| 3.38 ± 1.29 | 4.13 ± 1.30 | 4.88 ± 1.20 |
Descriptive statistics on trait judgments of each original face (M ± SD).
Figure 5
We did not find significant correlations for typicality judgments (r = 0.41, p = 0.075, R2 = 0.17, slope = 5.54); however, trustworthiness and attractiveness judgments exhibited significant correlations (for trustworthiness: r = 0.65, p = 0.002, R2 = 0.42, slope = 9.94; for attractiveness: r = 0.63, p = 0.003, R2 = 0.40, slope = 13.98). We utilized the bootstrap method (n = 1,000), resampling the participants to simulate the distributions of slopes if the experiments were repeated with different subjects. We found that the slope for trustworthiness judgments was significantly lower than that for attractiveness judgments (p = 0.037), suggesting that the relationship between trustworthiness and typicality may be closer than that between attractiveness and typicality.
To explore the effect of gender on relationships among trait judgments, we performed similar analyses for women and men separately. We found that the slope difference between attractiveness and trustworthiness judgments was significant in men but not in women (men, p = 0.029; women, p = 0.345; Table 5), indicating that there are gender differences in the role of typicality in face evaluation.
Table 5
| Trait | Women | Men |
|---|---|---|
| Attractiveness | 11.15 ± 1.55 | 15.48 ± 3.18* |
| Trustworthiness | 10.57 ± 1.15### | 8.65 ± 1.39 |
| Typicality | 4.96 ± 0.99^^^ | 6.53 ± 1.59^^^ |
Slopes of fitted lines for the relationship between mean trait ratings of original faces and the predicted DFT for peaks of mean trait judgments in corresponding continua by gender of the participants (M ± SD).
p < 0.05 between attractiveness and trustworthiness judgments.
p < 0.001 between trustworthiness and typicality judgments.
p < 0.001 between typicality and attractiveness judgments.
Principal Component Analyses on Trait Judgments
We also conducted principal component analyses (PCAs) on these trait judgments as described in the previous study (Oosterhof and Todorov, 2008), with mean ratings of the three trait judgments for all tested faces (n = 281). Based on the contribution to the variance of trait judgments, we choose the first two components. The first principal component (PC) accounted for 97.19% of variance, while the second PC accounted for 1.85% of variance. According to the previous study, the first component could be interpreted as the valence (trustworthiness) evaluation, while the second component could be interpreted as the dominance evaluation (Oosterhof and Todorov, 2008). For illustration purposes, we constructed a space based on the first two PCs with the locations of trait judgments represented by their correlations with these two PCs (Oosterhof and Todorov, 2008). In this space, the location of trustworthiness judgments was closer to that of typicality judgments than to that of attractiveness judgments (Figure 6A). These findings suggest that face typicality may be a more important determinant of perceived trustworthiness than attractiveness.
Figure 6
We conducted similar analyses for the two genders of participants separately. Principal component analyses showed that the relationship between perceived trustworthiness and typicality was closer in men than in women (Figures 6B,C). These results indicate that women may count more on perceived attractiveness than men to make facial trustworthiness judgments, whereas men might count more on perceived typicality than women to make trustworthiness judgments.
Regression Models of Trustworthiness Judgments
To compare the roles of typicality and attractiveness in trustworthiness judgments, we conducted three regression models by implementing: (1) typicality (Model T); (2) attractiveness (Model A); and (3) typicality and attractiveness combined (Model C). We utilized the bootstrap method (n = 1,000), resampling the faces to simulate the distributions of performances (adjusted R2) of models if the experiments were repeated with different faces. We found that the performance of Model T was better than that of Model A (p < 0.001; Figure 7), indicating typicality may be more important for trustworthiness than attractiveness.
Figure 7
We conducted similar analyses for the two genders of participants separately and compared the models within and between genders. Using the bootstrap method (n = 1,000), we found that the performance of Model A was better in women than in men (p = 0.008), whereas the performance of Model T was better in men than in women (p = 0.033; Figure 7).
When analyzing influence of DFT on trait judgments, we found that the predicted DFT for the peak of trustworthiness judgments was not close to the average face in Group Most but was close in Group Least, whereas the predicted DFT for the peak of attractiveness judgments was not close to the average face in both Group Most and Group Least, indicating a potential nonlinear relationship between perceived trustworthiness and typicality/ attractiveness. To explore this possibility, we divided all tested faces into three sub-groups based on the mean ratings of face trustworthiness: Group Untrustworthy, faces with a mean score of < 3; Group Trustworthy, faces with a mean score of ≥ 5; and Group Neutral, all remaining faces. To exclude the potential effects of different numbers of faces across sub-groups on regression analyses, we randomly chose the same number of faces from each sub-group (n = 61) to perform the above regression analyses.
Figure 8A shows the performance of the three models for the different trustworthiness sub-groups. The statistical analysis method is the same as Figure 7. We utilized the bootstrap method (n = 1,000), resampling the faces to simulate the distributions of performances (adjusted R2) of Models if the experiments were repeated with different faces. We found that the performance of Model C in Group Trustworthy was significantly lower than the one in Group Untrustworthy (p = 0.002), and the latter one was significantly lower than the one in Group Neutral (p = 0.024), suggesting that people may need additional cues besides attractiveness and typicality to make non-neutral (untrustworthy and especially trustworthy) judgments of facial trustworthiness. The performance of Model T across three sub-groups showed the similar pattern as the one of Model C: Group Trustworthy vs. Group Untrustworthy, p < 0.001; Group Untrustworthy vs. Group Neutral, p = 0.059. However, the performance of Model A across three sub-groups showed a different pattern from those of Models C and T: The performance of Model A in Group Trustworthy was similar to the one in Group Untrustworthy (p = 0.479), and both of them were significantly lower than the one in Group Neutral (Group Untrustworthy: p = 0.002; Group Trustworthy: p < 0.001). Moreover, the performance of Model A was significantly lower than those for Model T in Group Untrustworthy and Neutral (Group Untrustworthy: p < 0.001; Group Neutral: p = 0.011). However, this phenomenon was reversed in Group Trustworthy (p = 0.015), indicating that people may rely more on attractiveness to judge trustworthy faces but rely more on typicality to make trustworthiness judgments of untrustworthy or neutral faces.
Figure 8
We next performed the above-mentioned analyses in women and men separately (Figures 8B,C). In Group Untrustworthy and Trustworthy but not in Group Neutral, adjusted R2 values of Model A were significantly or marginally higher in women than in men (Group Untrustworthy, p = 0.017; Group Trustworthy, p = 0.055). In Group Neutral but not in Group Untrustworthy and Trustworthy, the performance of Model T was significantly higher in men than in women (p = 0.005). These results indicate that women may count more on perceived attractiveness than men to trust or distrust faces, whereas men might count more on perceived typicality than women to make trustworthiness judgments of neutral faces.
Discussion
In the present study, we investigated the relationship between trustworthiness and attractiveness/typicality judgments. Our results showed that trustworthy faces were not always typical. Moreover, we found that trustworthiness exhibited a complex nonlinear relationship with perceived attractiveness and typicality: Trustworthiness judgments relied more on attractiveness when judging trustworthy faces but relied more on typicality when judging neutral and untrustworthy faces. Furthermore, our study showed that there were gender differences in trustworthiness judgments. Below, we discuss the significance of these findings for understanding the underlying mechanisms of facial trait evaluation.
Trustworthy Faces, Like Attractive Faces, Are Not Always Average
By determining the predicted DFT for peaks of trustworthiness judgments in different face continua (all faces, the most attractive face, and the most trustworthy face), we found that trustworthiness judgments in general (e.g., Group All) peaked around the average face. These findings are consistent with previous studies, which have reported that the relationship between face DFT and trustworthiness represents an inverted u-shape whenever natural faces, computer-generated faces, or a continuum of faces that vary on a typicality-attractiveness dimension are evaluated (Sofer et al., 2015, 2017; Todorov et al., 2015). However, it should be noted that in the most trustworthy face continuum (or the so-called typicality-trustworthiness dimension), which to the best of our knowledge has not been studied previously, trustworthiness judgments peaked away from the average face and toward the most trustworthy face. That is, an average face is trustworthy, but trustworthy faces are not always average. These results echo those found in the present study and previous research on judgments of attractiveness, which is a personality trait highly related to trustworthiness: Attractiveness judgments peak away from the average face and toward the most/least attractive face in the most/least attractive face continuum, respectively (DeBruine et al., 2007; Sofer et al., 2015). Our study compensates for the lack of trustworthiness judgments in the typicality-trustworthiness dimension and indicates that face typicality is important for trustworthiness judgments but to a smaller degree than previously thought (Sofer et al., 2015; Todorov et al., 2015).
Both trustworthiness and attractiveness judgments are important components of face evaluation. To further explore the role of face typicality in face evaluation, we compared values of face typicality for trustworthiness and attractiveness judgments. Our results showed that the trustworthiness/attractiveness ratings of each original face affected the predicted DFT of the trustworthiness/attractiveness judgment peaks in the corresponding face continuum, with a larger slope for attractiveness judgments than for trustworthiness ones. These findings indicate that the value of face typicality may be greater for trustworthiness judgments than for attractiveness judgments. These results are in line with locations of face trustworthiness and attractiveness in the 2D space of face evaluation defined by valence and dominance evaluation of faces, in which trustworthiness is closer to the valence dimension than attractiveness (Oosterhof and Todorov, 2008). Thus, the value of face typicality for one trait judgments may be associated with this trait judgment’s relationship with valence evaluation of faces: The closer the trait judgment to valence evaluation of faces, the larger the face typicality value. Our findings indicate a close link between face evaluation (especially valence evaluation) and face recognition, adding important information to the current body of knowledge regarding the role of typicality in face processing.
Nonlinear Relationships of Face Attractiveness and Typicality With Perceived Trustworthiness
To explore and differentiate the roles of typicality and attractiveness in trustworthiness judgments, we constructed principal component analysis and multiple regression models. Our results showed that, in general, both face typicality and attractiveness predicted trustworthiness well, with better performance of typicality than attractiveness (Figures 6, 7). However, when we divided all tested faces into three sub-groups based on the mean ratings of face trustworthiness (Figure 8), we found that neither typicality nor attractiveness predicted trustworthiness when judging trustworthy faces as well as they did when judging neutral faces, indicating that additional information besides typicality and attractiveness is required in order to trust people based on faces. We also found that people relied on perceived attractiveness more than typicality when giving high trustworthiness ratings. Attractiveness may carry additional social information that cannot be explained by typicality, as reflected in the present work and prior studies that attractive faces are not always average (typical) faces (Valentine, 1991; DeBruine et al., 2007). Several facial traits have been proposed to influence both attractiveness and trustworthiness judgments, such as emotional expressions and feminine/masculine facial cues (Oosterhof and Todorov, 2008; Hu et al., 2018). These facial traits may contribute more to judge trustworthy faces than neutral faces. Though emotionally neutral faces were used in the present study, they may still show subtle cues of emotion (e.g., anger and happiness) that may contribute to perceived trustworthiness (Oosterhof and Todorov, 2008; Todorov and Engell, 2008; Lischke et al., 2018). Further studies are needed to explore the potential factors that are not clued by attractiveness and typicality but may affect judgments of trustworthy faces.
Gender Differences in Trustworthiness Judgments
We also investigated potential differences in how people of different genders judge trustworthiness, which have been rarely studied. Our results showed that men relied more on typicality when judging a face as untrustworthy or neutral, whereas women relied more on typicality when judging a face as untrustworthy but more on attractiveness when judging a face as trustworthy. Moreover, we found that the relationship between trustworthiness and typicality judgments was closer in men than in women when judging neutral faces, whereas women counted on face attractiveness more than men did to evaluate face trustworthiness when judging trustworthy (and likely untrustworthy) faces.
Our data indicate that women and men may utilize different traits to evaluate face trustworthiness. Previous studies have shown the influence of gender on a broad range of social behaviors, including trust (Derks et al., 2014; Mattarozzi et al., 2015). It has been suggested that men are more trusting than women (Derks et al., 2014), though other research has cast doubt on gender differences in trust (Kanagaretnam et al., 2009). However, the role of gender of rater in facial trustworthiness judgments has been rarely studied. One previous study found that women are faster and more accurate in the perception of facial trustworthiness than men (Dzhelyova et al., 2012). Moreover, adaptation to facial trustworthiness occurs in women but not in men (Wincenciak et al., 2013). Here, we found that gender differences appeared as differential patterns in trustworthiness judgments under different judgment conditions. Since only male faces were used in the present study, we are unable to disentangle whether such gender differences are caused by different strategies of evaluating face trustworthiness between women and men or between same-gender and opposite-gender judgments. Moreover, previous studies have found that female faces are perceived to be more trustworthy than the male ones (Dong et al., 2018). Future studies could also incorporate female face stimuli and a cross-gender design to better understand gender effects on facial trait judgments.
Taken together, our study’s findings delineated the relationship between face trustworthiness and typicality/attractiveness in detail and found significant gender effects on trustworthiness judgments. Our findings demonstrated that to judge the trustworthiness of a face is a more complex process than previously thought, which may lead to a better understanding of the mechanisms underlying highly flexible and sophisticated social interactions in humans.
Statements
Data availability statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
Ethics statement
The studies involving human participants were reviewed and approved by the Ethics Committee of the Institutional Review Board of Institute of Biophysics, Chinese Academy of Sciences (CAS). The patients/participants provided their written informed consent to participate in this study. Written informed consent was obtained from the individual(s) for the publication of any potentially identifiable images or data included in this article.
Author contributions
NLi and NLiu designed the experiments and drafted the manuscript. NLi performed the experiments and analyzed the data. NLiu supervised the data analysis. All the authors approved the final manuscript for submission.
Funding
The project was supported by the Beijing Municipal Science & Technology Commission (Z181100001518002), CAS (QYZDB-SSW-SMC033), and the National Natural Science Foundation of China (31730039).
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Footnotes
References
1
AntonakisJ.DalgasO. (2009). Predicting elections: child’s play!Science323:1183. doi: 10.1126/science.1167748
2
CalvoM. G.Gutierrez-GarciaA.BeltranD. (2018). Neural time course and brain sources of facial attractiveness vs. trustworthiness judgment. Cogn. Affect. Behav. Neurosci.18, 1233–1247. doi: 10.3758/s13415-018-0634-0
3
DeBruineL. M.JonesB. C.UngerL.LittleA. C.FeinbergD. R. (2007). Dissociating averageness and attractiveness: attractive faces are not always average. J. Exp. Psychol. Hum. Percept. Perform.33, 1420–1430. doi: 10.1037/0096-1523.33.6.1420
4
DerksJ.LeeN. C.KrabbendamL. (2014). Adolescent trust and trustworthiness: role of gender and social value orientation. J. Adolesc.37, 1379–1386. doi: 10.1016/j.adolescence.2014.09.014
5
DongY.LiuY.JiaY.LiY.LiC. (2018). Effects of facial expression and facial gender on judgment of trustworthiness: the modulating effect of cooperative and competitive settings. Front. Psychol.9:2022. doi: 10.3389/fpsyg.2018.02022
6
DotschR.HassinR. R.TodorovA. (2017). Statistical learning shapes face evaluation. Nat. Hum. Behav.1, 1–6. doi: 10.1038/s41562-016-0001
7
DzhelyovaM.PerrettD. I.JentzschI. (2012). Temporal dynamics of trustworthiness perception. Brain Res.1435, 81–90. doi: 10.1016/j.brainres.2011.11.043
8
FaulF.ErdfelderE.LangA. G.BuchnerA. (2007). G*power 3: a flexible statistical power analysis program for the social, behavioral, and biomedical sciences. Behav. Res. Methods39, 175–191. doi: 10.3758/BF03193146
9
FoosP. W.ClarkM. C. (2011). Adult age and gender differences in perceptions of facial attractiveness: beauty is in the eye of the older beholder. J. Genet. Psychol.172, 162–175. doi: 10.1080/00221325.2010.526154
10
FugèreM. A.ChabotC.DoucetteK.CousinsA. J. (2017). The importance of physical attractiveness to the mate choices of women and their mothers. Evol. Psychol. Sci.3, 243–252. doi: 10.1007/s40806-017-0092-x
11
GaoW.CaoB.ShanS. G.ChenX. L.ZhouD. L.ZhangX. H.et al. (2008). The CAS-PEAL large-scale Chinese face database and baseline evaluations. IEEE Trans. Syst. Man Cybern. A-Syst. Hum.38, 149–161. doi: 10.1109/TSMCA.2007.909557
12
GolaraiG.GhahremaniD. G.EberhardtJ. L.GabrieliJ. D. (2015). Distinct representations of configural and part information across multiple face-selective regions of the human brain. Front. Psychol.6:1710. doi: 10.3389/fpsyg.2015.01710
13
Gutierrez-GarciaA.BeltranD.CalvoM. G. (2019). Facial attractiveness impressions precede trustworthiness inferences: lower detection thresholds and faster decision latencies. Cognit. Emot.33, 378–385. doi: 10.1080/02699931.2018.1444583
14
HuY.AbbasiN. U. H.ZhangY.ChenH. (2018). The effect of target sex, sexual dimorphism, and facial attractiveness on perceptions of target attractiveness and trustworthiness. Front. Psychol.9:942. doi: 10.3389/fpsyg.2018.00942
15
KanagaretnamK.MestelmanS.NainarK.ShehataM. (2009). The impact of social value orientation and risk attitudes on trust and reciprocity. J. Econ. Psychol.30, 368–380. doi: 10.1016/j.joep.2008.12.003
16
KramerR. S. S.MilevaM.RitchieK. L. (2018). Inter-rater agreement in trait judgements from faces. PLoS One13:e0202655. doi: 10.1371/journal.pone.0202655
17
LangloisJ. H.KalakanisL.RubensteinA. J.LarsonA.HallamM.SmootM. (2000). Maxims or myths of beauty? A meta-analytic and theoretical review. Psychol. Bull.126, 390–423. doi: 10.1037/0033-2909.126.3.390
18
LangloisJ. H.RoggmanL. A. (1990). Attractive faces are only average. Psychol. Sci.1, 115–121.
19
LemayE. P.Jr.ClarkM. S.GreenbergA. (2010). What is beautiful is good because what is beautiful is desired: physical attractiveness stereotyping as projection of interpersonal goals. Personal. Soc. Psychol. Bull.36, 339–353. doi: 10.1177/0146167209359700
20
Lemmers-JansenI. L. J.FettA. J.ShergillS. S.van KesterenM. T. R.KrabbendamL. (2019). Girls-boys: an investigation of gender differences in the behavioral and neural mechanisms of trust and reciprocity in adolescence. Front. Hum. Neurosci.13:257. doi: 10.3389/fnhum.2019.00257
21
LiangL.LinL.JinL.XieD.LiM. (2018). “SCUT-FBP5500: a diverse benchmark dataset for multi-paradigm facial beauty prediction.” in 2018 24th International Conference on Pattern Recognition (ICPR); August 20-24, 2018; IEEE.
22
LischkeA.JungeM.HammA. O.WeymarM. (2018). Enhanced processing of untrustworthiness in natural faces with neutral expressions. Emotion18, 181–189. doi: 10.1037/emo0000318
23
MaF.XuF.LuoX. (2016). Children’s facial trustworthiness judgments: agreement and relationship with facial attractiveness. Front. Psychol.7:499. doi: 10.3389/fpsyg.2016.00499
24
MattarozziK.TodorovA.MarzocchiM.VicariA.RussoP. M. (2015). Effects of gender and personality on first impression. PLoS One10:e0135529. doi: 10.1371/journal.pone.0135529
25
OlsonI. R.MarshuetzC. (2005). Facial attractiveness is appraised in a glance. Emotion5, 498–502. doi: 10.1037/1528-3542.5.4.498
26
OosterhofN. N.TodorovA. (2008). The functional basis of face evaluation. Proc. Natl. Acad. Sci. U. S. A.105, 11087–11092. doi: 10.1073/pnas.0805664105
27
PerrettD. I.MayK. A.YoshikawaS. (1994). Facial shape and judgements of female attractiveness. Nature368, 239–242. doi: 10.1038/368239a0
28
RhodesG. (2006). The evolutionary psychology of facial beauty. Annu. Rev. Psychol.57, 199–226. doi: 10.1146/annurev.psych.57.102904.190208
29
RhodesG.YoshikawaS.PalermoR.SimmonsL. W.PetersM.LeeK.et al. (2007). Perceived health contributes to the attractiveness of facial symmetry, averageness, and sexual dimorphism. Perception36, 1244–1252. doi: 10.1068/p5712
30
SoferC.DotschR.OikawaM.OikawaH.WigboldusD. H. J.TodorovA. (2017). For your local eyes only: culture-specific face typicality influences perceptions of trustworthiness. Perception46, 914–928. doi: 10.1177/0301006617691786
31
SoferC.DotschR.WigboldusD. H.TodorovA. (2015). What is typical is good: the influence of face typicality on perceived trustworthiness. Psychol. Sci.26, 39–47. doi: 10.1177/0956797614554955
32
TodorovA.DotschR.PorterJ. M.OosterhofN. N.FalvelloV. B. (2013). Validation of data-driven computational models of social perception of faces. Emotion13, 724–738. doi: 10.1037/a0032335
33
TodorovA.EngellA. D. (2008). The role of the amygdala in implicit evaluation of emotionally neutral faces. Soc. Cogn. Affect. Neurosci.3, 303–312. doi: 10.1093/scan/nsn033
34
TodorovA.MandisodzaA. N.GorenA.HallC. C. (2005). Inferences of competence from faces predict election outcomes. Science308, 1623–1626. doi: 10.1126/science.1110589
35
TodorovA.OlivolaC. Y.DotschR.Mende-SiedleckiP. (2015). Social attributions from faces: determinants, consequences, accuracy, and functional significance. Annu. Rev. Psychol.66, 519–545. doi: 10.1146/annurev-psych-113011-143831
36
TodorovA.PakrashiM.OosterhofN. N. (2009). Evaluating faces on trustworthiness after minimal time exposure. Soc. Cogn.27, 813–833. doi: 10.1521/soco.2009.27.6.813
37
TsankovaE.KappasA. (2016). Facial skin smoothness as an indicator of perceived trustworthiness and related traits. Perception45, 400–408. doi: 10.1177/0301006615616748
38
ValentineT. (1991). A unified account of the effects of distinctiveness, inversion, and race in face recognition. Q. J. Exp. Psychol. Sect. A43, 161–204. doi: 10.1080/14640749108400966
39
van’t WoutM.SanfeyA. G. (2008). Friend or foe: the effect of implicit trustworthiness judgments in social decision-making. Cognition108, 796–803. doi: 10.1016/j.cognition.2008.07.002
40
WillisJ.TodorovA. (2006). First impressions: making up your mind after a 100-ms exposure to a face. Psychol. Sci.17, 592–598. doi: 10.1111/j.1467-9280.2006.01750.x
41
WincenciakJ.DzhelyovaM.PerrettD. I.BarracloughN. E. (2013). Adaptation to facial trustworthiness is different in female and male observers. Vis. Res.87, 30–34. doi: 10.1016/j.visres.2013.05.007
42
XuF.WuD. C.ToriyamaR.MaF. L.ItakuraS.LeeK. (2012). Similarities and differences in Chinese and Caucasian adults’ use of facial cues for trustworthiness judgments. PLoS One7:e53387. doi: 10.1371/journal.pone.0053387
43
ZebrowitzL. A. (2004). The origin of first impressions. J. Cult. Evol. Psychol.2, 93–108. doi: 10.1556/JCEP.2.2004.1-2.6
44
ZebrowitzL. A.WangR.BronstadP. M.EisenbergD.UndurragaE.Reyes-GarcíaV.et al. (2011). First impressions from faces among U.S. and culturally isolated Tsimane’ people in the Bolivian rainforest. J. Cross-Cult. Psychol.43, 119–134. doi: 10.1177/0022022111411386
Summary
Keywords
face perception, trustworthiness, attractiveness, typicality, gender differences
Citation
Li N and Liu N (2021) The Nonlinear and Gender-Related Relationships of Face Attractiveness and Typicality With Perceived Trustworthiness. Front. Psychol. 12:656084. doi: 10.3389/fpsyg.2021.656084
Received
20 January 2021
Accepted
18 June 2021
Published
14 July 2021
Volume
12 - 2021
Edited by
Marco Tommasi, University of Studies G. d’Annunzio Chieti and Pescara, Italy
Reviewed by
Elena Tsankova, Institute for Population and Human Studies (BAS), Bulgaria; Junchen Shang, Liaoning Normal University, China
Updates
Copyright
© 2021 Li and Liu.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Ning Liu, ningliu.brain@gmail.com
This article was submitted to Personality and Social Psychology, a section of the journal Frontiers in Psychology
Disclaimer
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.