ORIGINAL RESEARCH article

Front. Psychol., 18 January 2021

Sec. Psychology of Language

Volume 11 - 2020 | https://doi.org/10.3389/fpsyg.2020.619222

Emotivity in the Voice: Prosodic, Lexical, and Cultural Appraisal of Complaining Speech

  • School of Communication Sciences and Disorders, McGill University, Montreal, QC, Canada

Abstract

Emotive speech is a social act in which a speaker displays emotional signals with a specific intention; in the case of third-party complaints, this intention is to elicit empathy in the listener. The present study assessed how the emotivity of complaints was perceived in various conditions. Participants listened to short statements describing painful or neutral situations, spoken with a complaining or neutral prosody, and evaluated how complaining the speaker sounded. In addition to manipulating features of the message, social-affiliative factors which could influence complaint perception were varied by adopting a cross-cultural design: participants were either QuƩbƩcois (French Canadian) or French and listened to utterances expressed by both cultural groups. The presence of a complaining tone of voice had the largest effect on participant evaluations, while the nature of statements had a significant, but smaller influence. Marginal effects of culture on explicit evaluation of complaints were found. A multiple mediation analysis suggested that mean fundamental frequency was the main prosodic signal that participants relied on to detect complaints, though most of the prosody effect could not be linearly explained by acoustic parameters. These results highlight a tacit agreement between speaker and listener: what characterizes a complaint is how it is said (i.e., the tone of voice), more than what it is about or who produces it. More generally, the study emphasizes the central importance of prosody in expressive speech acts such as complaints, which are designed to strengthen social bonds and supportive responses in interactive behavior. This intentional and interpersonal aspect in the communication of emotions needs to be further considered in research on affect and communication.

Introduction

While the role of the voice in social interaction has been receiving growing interest over the last decades, literature on the topic has been scattered across research fields. On the one hand, experimental psychology has been focusing on affective speech and emotions (Frick, 1985; Scherer, 2003; Juslin, 2013); on the other hand, intentionality, speech acts, and attitudes have been mostly addressed by pragmatics and theoretical linguistics (Searle, 1965; Grice, 1989; Wichmann, 2002; Culpeper and Terkourafi, 2017). A large part of our daily social interactions is inherently emotive, relying on the attitudinal, intentional use of emotional signals (Caffi and Janney, 1994). These interactions, involving both speaker and listener in a complex collaborative timeline, remain poorly understood. The nature and components of emotive interactions can be investigated through an intersectional approach, embedding social and affective psychology methods into the theoretical pragmatics framework of emotivity through the Emotions As Social Information (EASI) model (Van Kleef, 2009). Focusing on the case of complaints, the present study examines how emotivity is conveyed through speech, and how affective signals in the voice are processed in different social and cultural contexts.

Complaints are intentional verbal expressions of social pain, distress, or displeasure (Boxer, 1993; Drew, 1998; Laforest, 2002), and are usually divided in two categories. Direct complaints are addressed directly to the source of the issue, with the purpose of terminating or solving the issue (Trenchs, 1994; Laforest, 2002). The present study focuses on indirect or third-party complaints, which are addressed to a third party usually unrelated to the issue (Drew, 1998; Edwards, 2005). Third-party complaints are non-instrumental in nature (Alicke et al., 1992); they do not aim to solve the problem they address, but have a more indirect function of promoting social affiliation though affectivity and empathy (Drew and Walker, 2009; Ogden, 2010). In what follows, the term complaint will be used to refer exclusively to third-party complaints.

The social importance of complaints is implied by their frequency; it is said that individuals complain more than four times a day on average (Alicke et al., 1992). While many types of speech acts can lead to a strengthening of social bonds, complaints appear to directly serve this purpose; this is accomplished through long, interactive sequences in which the complainer negotiates the affiliation of their listener (Drew and Walker, 2009; Selting, 2010). Complaints are usually defined by tightly bounded topics with a clear beginning and end (Drew, 1998), often used as ice-breakers or conversation openers (Boxer, 1993; Kowalski, 2002). Complainers may open with an initial complaint to probe the affiliative response of their listener, which will determine the course of the negotiation (Traverso, 2009). Ultimately, it is the listener who chooses whether or not to collaborate and affiliate with the speaker (Edwards, 2005; Selting, 2010).

Beyond describing a negative situation, a core function of complaints is to provide evidence of how the speaker feels about the situation (Drew, 1998). In order to gain affiliation, a complaint should allow the listener to share the affective state of the speaker and empathize with them (AcuƱa-Ferreira, 2002; Edwards, 2005). Since most complaints describe a past event involving felt pain or distress, it is unlikely that the speaker is fully experiencing these emotions as they complain; rather, these expressions may be viewed as instances of ā€œreconstructed affectā€ (Selting, 2010). Complaining is thus an emotive or expressive speech act (Scarantino, 2017), a type of social performance in which the speaker intentionally displays affective markers to achieve interactive goals (Caffi and Janney, 1994; AcuƱa-Ferreira, 2002). These markers are the negotiating products of a complaint, informing the listener of the complainer’s emotions (Edwards, 2005) and sharing these emotions through mood contagion (Kowalski, 2002). The affective component of a complaint is usually more important than the object of the complaint itself, from which the interaction can drift off while remaining a complaining collaboration (Edwards, 2005; Traverso, 2009).

The Emotions as Social Information (EASI) model (Van Kleef, 2009) provides a useful framework for investigating the perceptual and social dimensions of complaining speech in greater depth. The EASI model emphasizes that affective displays are more than biological symptoms and can be used to influence others by triggering inferential processes and affective reactions. For complaints to succeed (i.e., promote social affiliation and strengthen bonds), complainers and listeners need to effectively display, perceive and respond to communicative signals of affect and emotivity, which are frequently marked through a complainer’s voice, or speech prosody. Here, we refer to prosody as suprasegmental acoustic features of speech - pitch, loudness, voice quality, rhythm - that speakers modulate, intentionally or not, to express meanings, emotions, and attitudes in their voice (Pell, 2001; Scherer, 2003). The manner in which prosody is used in complaining interactions and its impact on listeners has seldom been explored.

According to Brunswikian lens models of speech, the emotions of speakers are encoded by a constellation of acoustic cues that are then decoded by listeners into emotional representations (Brunswik, 1956; Grandjean et al., 2006; Laukka et al., 2016). A number of studies have reported that vocal expressions of basic emotion (e.g., anger, sadness, happiness) show specific patterns of pitch, loudness, rhythm, and voice quality that yield successful recognition of these emotions by listeners (see Juslin and Laukka, 2003 and Scherer and BƤnziger, 2004 for reviews). However, vocal changes are not always symptoms of the speaker’s internal emotional state; for example, prosody can be intentionally used as an expressive device to elicit empathy in the listener, allowing interactants to experience (Aziz-Zadeh et al., 2010; Rodero, 2011) and understand (Regenbogen et al., 2012; Ong et al., 2018) the speaker’s feelings. This combination of affective and inferential processing of prosody provides the speaker with important emotional influence and bolsters supportive behavior from the listener, with potential social benefits for both parties (Van Kleef, 2009; Pell and Kotz, 2020).

It has been reported that prosodic features of complaints signal increased affectivity through elevated mean fundamental frequency and frequency variability, syllable elongation, and emphatic accentuations (AcuƱa-Ferreira, 2002; Ogden, 2010; Selting, 2010; Rao, 2013). In emotional contexts, these acoustic changes are often associated with negative and high arousal emotions, like anger, sadness, surprise and indignation (Drew, 1998; Selting, 2010). Complaints may also be viewed as expressions of pain and suffering, which are associated with specific forms of vocal expression (Lerner et al., 2016; Raine et al., 2019). The present study is based on a large set of complaining utterances that display many of the acoustic and emotional properties described above, as well as voice quality patterns that resemble expressions of simulated pain (Mauchand and Pell, n.d. a, under review; Raine et al., 2019).

While the role of prosody in communicating the emotive involvement of complainers is heavily suggested, most of the literature on complaints comes from the pragmatics field, based largely on descriptive and qualitative analyses of conversations (Boxer, 1993; Drew, 1998; AcuƱa-Ferreira, 2002; Edwards, 2005; Traverso, 2009; Ogden, 2010; Selting, 2010; Rao, 2013). No experimental investigation has been conducted to establish how prosody affects the perception of complaints, especially with respect to other lexical or contextual cues that complainers often provide. As mentioned above, the emotive involvement of the speaker is often more important than the object of the complaint, meaning that even innocuous topics can be the focus of valid complaints (Boxer, 1993). Still, the preference of complainers to provide specific descriptions (Alicke et al., 1992), expletives (Drew, 1998), and extreme-case formulation (Selting, 2010) suggest that complaining emotive interactions depend on both linguistic and paralinguistic cues, albeit in an unclear manner.

The integration of prosodic and verbal affective signals and their combined impacts on social perception can be complex. The relative effects of cues in each channel may vary at different stages of perception, processing, and evaluation (Paulmann and Kotz, 2008; Pell et al., 2011; Meconi et al., 2018), and likely depend heavily on task demands (Regenbogen et al., 2012) and the emotional salience of cues (Wambacq and Jerger, 2004). In expressive speech acts, the role of prosody is traditionally described as an indirect, illocutionary force that can only convey meaning with the appropriate verbal statement (Grice, 1989; Wichmann, 2000). However, recent studies suggest that prosody alone can reveal the intentions of a speaker in a powerful manner (Hellbernd and Sammler, 2016; Caballero et al., 2018; Truesdale and Pell, 2018). For example, in motivating and persuasive speech, prosody can ā€œtagā€ verbal information as important and increase the persuasiveness of a speaker even when the verbal information is not credible (Zougkou et al., 2017; Van Zant and Berger, 2020). Prosody is thus an important emotive and persuasive device in low-involvement communicative situations (Gelinas-Chebat and Chebat, 1992), which is often the case of third-party complaints (Alicke et al., 1992; Boxer, 1993).

The use of affect as social information further depends on a number of social-relational factors, such as cultural display rules, familiarity, or group biases (Van Kleef, 2009). Indeed, if the traditional view of emotions as genuine biological responses could imply a universal consistency in their expression (Frick, 1985; Ekman et al., 1987), describing affective displays as social tools implies investigating how social and cultural contexts affect their usage (Van Kleef, 2009; Scarantino, 2017). Several studies already suggest that despite a basic universality, emotional communication can be affected by cultural in-group advantages (Elfenbein and Ambady, 2002), depend on cultural proximity (Laukka et al., 2016) and seem to mainly affect positive rather than negative emotions (Sauter et al., 2010; Scherer et al., 2011; see Laukka and Elfenbein, 2020 for a review). Often, out-group accent perception does not impede how well emotions are recognized but does affect perceived intensity, empathic arousal or physiological responses from listeners (Soto and Levenson, 2009; Mac et al., 2010; Thierry et al., 2015). Beyond emotions, a speaker’s accent is a marker of identity: the information (or lack thereof) that it carries is known to interfere with speech processing (Floccia et al., 2006; Sumner and Samuel, 2009), create biases and stereotypes (Kuiper, 2005; Lev-Ari and Keysar, 2010; Heblich et al., 2015), and affect the appraisal of diverse pragmatic cues (Yuan et al., 2019; Jiang et al., 2020). Cultural factors may thus affect numerous stages of production, perception, and interpretation of emotive speech.

Complaining appears to be a convention rooted in a number of cultures. Be it the French se plaindre (Traverso, 2009), the Australian whinge (Edwards, 2005), the German Jammern (Winchatz, 2016), or the Israeli kiturim (Katriel, 2013), many societies have defined complaining as a cultural custom, each with their own specificities and social implications. These potential cultural specificities raise the question of what constitutes a complaint across cultures. Yet, few studies have directly investigated the cross-cultural aspect of complaints. An investigation by Rao (2013) reported that Mexican Spanish complaints showed intonational variation typical of European Spanish complaints, but in a more accentuated manner. Similarly, Mauchand and Pell, n.d. b, under review reported that Canadian French (QuƩbƩcois) and European French complaints show strong acoustic similarities but sometimes differ in the weight given to certain prosodic cues and the emotional representations they convey. Parallel work on direct complaints also show some pragmatic differences between native and non-native complaints (Trenchs, 1994; Kraft and Geluykens, 2002). Beyond the definition of complaints, these acoustic differences could affect cross-cultural understanding of complaining speech, individuals being potentially more sensitive to emotive prosodic signals from their own group. To date, work which sheds light on these questions has not been undertaken.

The goal of the present study was to give insight on how third-party complaints are perceived from affective prosody and other cues that mark a speaker’s ā€œcomplaining intentions,ā€ using the Emotions as Social Information (EASI) model as a general reference (Van Kleef, 2009). Furthermore, we explored the role of social-relational variables in this context by studying two francophone groups: French (i.e., European French) and QuĆ©bĆ©cois (i.e., French Canadian). While mutually intelligible, these two groups have different cultural backgrounds and distinct accents, thus allowing the isolation of cultural group (dis)advantages in the processing of complaints in the absence of language barriers. French and QuĆ©bĆ©cois participants listened to pre-validated utterances that varied in prosody, verbal content, and speaker accent, and evaluated ā€œhow complainingā€ each utterance sounded. The study also investigated the relationship between encoding and decoding processes by analyzing how the perception of expressive speech acts, such as complaints, is driven by particular acoustic features of vocal affect signals. It was predicted that a speaker’s tone of voice would be the main marker of a complaining intention, especially when verbal cues did not convey high emotive involvement, i.e., when speakers complained about innocuous rather than explicitly pain-related topics. The detection of complaints was expected to depend on how the speakers produced emotive signals, especially through modulation of voice pitch and other emotion-related cues, which are likely to mediate the effect of complaining prosody on participant’s evaluations. Finally, it was predicted that social-relational factors would influence complaint perception: participants were expected to discriminate complaints from neutral utterances better for speakers of their own cultural group, potentially because of underlying biases and/or specific display rules associated with complaining speech.

Materials and Methods

Participants

Power analyses for mixed models (Judd et al., 2016) were performed to determine the required number of participants. Large effects of prosody and verbal content were reported in previous studies with similar procedures (Caballero et al., 2018; Mauchand et al., 2020). Due to the large number of stimuli (n = 320), less than 25 participants were required to attain power over 99% for these effects. The effect of culture, if present, would be smaller based on previous cross-cultural studies that have used recognition tasks (Elfenbein and Ambady, 2002; Liu et al., 2015; Laukka et al., 2016; Jiang et al., 2018). Based on an effect size of 0.3 with intercepts and slopes variances of 0.1, a minimum sample size of 57 participants would be required to achieve power of 90% for this variable.

In total, 31 French and 27 QuĆ©bĆ©cois participants, aged 18–35, with no hearing or neurological impairment were recruited in the MontrĆ©al area. French participants were born in France, had lived in France until at least 18, and had arrived less than 3 years ago in MontrĆ©al (for study or work). QuĆ©bĆ©cois participants were born and lived in QuĆ©bec (a French-speaking province in Canada) until age 18 and had never lived in France or another francophone country. All participants spoke French as their mother tongue.

Data about participants’ personality and cultural attitudes were collected through a number of tests and questionnaires (see Mauchand and Pell, n.d. a, under review for a full report on these measures). Accent-based implicit biases were measured through a modified Implicit Association Test (Greenwald et al., 1998) consisting of Pleasant and Unpleasant words presented together with French and QuĆ©bĆ©cois neutral utterances (Mauchand and Pell, n.d. a, under review). Explicit attitudes toward French and QuĆ©bĆ©cois populations were probed through a questionnaire based on the Stereotype Content Model (Fiske et al., 2002), composed of 20 questions about the perceived Warmth and Competence of each community. Finally, empathic abilities were assessed through the Perspective-Taking and Empathic Concern subscales of the French version of the Interpersonal Reactivity Index (Gilet et al., 2013).

Materials

Materials were created and validated in a previous study focusing on the acoustic dimensions of speech complaints (Mauchand and Pell, n.d. b, under review). Stimuli were short spoken utterances describing a past event, constructed in the form of token sets (each composed of 4 unique utterances). A token set was built around a root sentence that was manipulated in two ways. First, we modified the verbal content by modifying the last word of the statement, to refer to a neutral event, e.g., ā€œIl a dit que j’étais sorti/He said I was outsideā€ (Control condition) or a socially painful event for the speaker, e.g., ā€œIl a dit que j’étais stupide/He said I was stupidā€ (Pain condition). The list of sentences, together with their English translation, can be found in the Appendix. For each type of statement, we then manipulated the form of prosodic expression: speakers uttered each sentence in a manner as if simply reporting the event (Neutral condition) or as if complaining to a friend (Complaint condition). One token set was thus composed of 4 utterances with different Statement/Prosody combinations: Control/Neutral, Control/Complaint, Pain/Neutral, Pain/Complaint. Forty-two token sets were thereby created.

Initially, 672 utterances were produced by 4 French and 4 QuƩbƩcois speakers (2 males and 2 females in each group) in order to modulate accent/sociocultural features of the stimuli. Recordings were digitally captured in a sound-attenuated chamber with a high-quality head-mounted microphone onto a Tascam recorder (sampling rate of 44.1 kHz, 16-bit, mono, wav format). They were then edited in Praat (Boersma and van Heuven, 2001) into short.wav audio files and normalized to a peak intensity of 70 dB.

A short validation study was conducted to ensure the quality of the recordings and to select a subset of the stimuli for the current study. Ten French (5 males, 5 females, age: M = 21.1, sd = 3.8) and 9 QuƩbƩcois (3 males, 6 females, age: M = 23.00, sd = 2.78) participants listened to all utterances from their own group (n = 336) and evaluated: (1) whether an utterance was a complaint (yes/no); and (2) if it was a complaint, its intensity of expression on a 5-point scale. Results of the validation task are displayed in Table 1. Pain/Complaint utterances were almost unanimously considered complaints with high intensity ratings, while Control/Neutral utterances were very rarely considered complaints. Results for Pain/Neutral and Control/Pain utterances suggest that prosody had a larger impact than statement type on the perception of complaints.

TABLE 1

Utterance/Unselected utterances
Selected utterances
prosody type(n = 352)(n = 320)
Proportion of YES answers to ā€œis the person complaining?ā€Intensity ratinga (1 to 5)Proportion of YES answers to ā€œis the person complaining?ā€Intensity ratinga (1 to 5)
QuƩbƩcois
Control/Neutral0.26 (0.27)1.80 (0.87)0.21 (0.20)1.89 (0.99)
Control/Complaint0.81 (0.21)2.94 (0.91)0.81 (0.19)2.89 (0.70)
Pain/Neutral0.60 (0.21)1.74 (0.63)0.64 (0.21)1.69 (0.63)
Pain/Complaint0.96 (0.07)3.15 (0.84)0.98 (0.05)3.34 (0.57)
French
Control/Neutral0.23 (0.16)1.53 (0.54)0.25 (0.15)1.43 (0.45)
Control/Complaint0.86 (0.13)3.45 (0.78)0.87 (0.13)3.37 (0.62)
Pain/Neutral0.57 (0.16)1.64 (0.38)0.53 (0.18)1.67 (0.43)
Pain/Complaint0.95 (0.06)3.47 (0.60)0.94 (0.08)3.46 (0.52)

Results of the validation/selection task, by speaker group (mean + standard deviation).

aNote that the rating is only made when answering YES to the previous question.

For the present study, a subset of utterances was selected to minimize the repetition of sentences in the experiment, to remove potential outliers, and to ensure that stimuli were representative of the speakers’ intentions (complaining vs. neutral) according to listeners from their own group. For each speaker, a token set was selected if there was enough consensus that the Control/Neutral utterance was NOT a complaint and that the Pain/Complaint utterance was indeed a complaint with high intensity ratings. To avoid selection bias on the prosody/statement effects, results for ā€œincongruentā€ utterances were not taken into account for the selection. Moreover, each speaker had a ā€œmirrorā€ speaker (of the same sex) in the other cultural group that uttered exactly the same token sets, such that each token set was present exactly once in each group. This selection process did not affect the overall perceptual quality of the stimulus set, as scores for selected and unselected items remained close. In total, there were 2 Accents Ɨ 4 Speakers Ɨ 10 Token Sets Ɨ 2 Prosodies Ɨ 2 Statements = 320 selected utterances.

Acoustic measures for each of the 320 selected utterances were collected using the Geneva Minimalistic Acoustic Parameter Set/GeMAPS (Eyben et al., 2016) package from the publicly available openSMILE toolkit (Eyben et al., 2010). The GeMAPS constitutes a reliable standardized baseline set of affect-related acoustic measures (for more details on the computation and implementation of the measures, see Eyben et al., 2016). A full acoustic analysis of all 672 stimuli is presented in a previous study (Mauchand and Pell, n.d. a, under review). The present study focuses on measures of pitch, voice quality, and rhythm known to be perceptually relevant in complaint production (AcuƱa-Ferreira, 2002; Rao, 2013) and other related modes of emotional expression (Laukka et al., 2016; Raine et al., 2019). Note that since the volume of stimuli was normalized for perception, intensity-related acoustic measures could not be reliably extracted for consideration in the present study. The following acoustic measures were computed as a mean measure over the full duration of each utterance:

  • •

    F0, the fundamental frequency, indexing pitch on a logarithmic semitone scale. Considering the importance of pitch in complaints, both the mean (F0 M) and the rescaled standard deviation (F0SD) over the utterance were computed.

  • •

    Jitter, indexing aperiodicity (instability) of the F0 signal – voice ā€œcreakinessā€

  • •

    Shimmer, the difference of the peak amplitudes of consecutive F0 periods, indexing voice roughness in dB

  • •

    Harmonics-to-Noise Ratio (HNR), indexing the relative amount of additive noise in the voice

  • •

    F1, first formant center frequency in Hertz

  • •

    Utterance duration and final word duration in seconds (computed on Praat).

These measures are summarized in Table 2.

TABLE 2

French
QuƩbƩcois
NeutralComplaintNeutralComplaint
F0 M28.43 (4.43)34.13 (3.57)28.93 (5.23)34.28 (6.14)
F0 SD0.14 (0.06)0.15 (0.08)0.17 (0.09)0.20 (0.08)
HNR6.64 (2.4)9.22 (1.66)6.40 (2.59)7.83 (2.94)
Jitter0.05 (0.02)0.04 (0.02)0.06 (0.03)0.05 (0.03)
Shimmer1.31 (0.42)1.06 (0.35)1.35 (0.44)1.18 (0.34)
F1535.98 (87.65)554.86 (74.77)534.44 (61.28)559.84 (60.59)
Duration1.16 (0.26)1.30 (0.30)1.42 (0.29)1.46 (0.39)
Final word duration0.38 (0.12)0.45 (0.14)0.47 (0.15)0.56 (0.17)

Summary of acoustic measures for the selected stimuli for each speaker group (Mean + standard deviation).

F0 M, mean fundamental frequency; F0 SD, rescaled standard deviation of the fundamental frequency; HNR, Harmonics-to-Noise Ratio; F1, first formant center frequency.

Procedure

Each participant was presented all 320 selected stimuli in a fully randomized order using Cedrus Superlab 5 software. The stimuli were divided in 8 blocks of 40 utterances, with a self-monitored break between each block. After presentation of an utterance, participants answered the question ā€œĆ€ quel point cette personne est-elle en train de se plaindre?ā€ (How much is this person complaining?) on a 7-point Likert scale ranging from Pas du tout (Not at all) to ƉnormĆ©ment (Very much) by pressing a button on a response box. No time limit was set. Participants were not given any indication or strategy on how to form their answer and were told that there was no right or wrong answer. The whole experiment lasted a little more than 1 h.

Results

Main Model

Participant’s ratings were analyzed through a Linear Mixed Effect Model using the R package lme4 (Bates et al., 2015). T-tests and p-values were computed with Satterthwaite’s approximation using lmerTest package (Kuznetsova et al., 2017). The model was built with the participant’s Response (0–6) as the response variable, and Participant Culture (French/QuĆ©bĆ©cois), Speaker Accent (French/QuĆ©bĆ©cois), Statement (Control/Pain), and Prosody (Neutral/Complaint) as predictors. All 2- and 3-way interactions were also entered as predictor terms. Participant and Speaker/TokenSet were added as random intercepts: TokenSet was nested within Speaker, such that Speaker was one random intercept and the interaction between Speaker and TokenSet was another random intercept, thus accounting for the variability of speakers and the variability of token sets within each speaker. Additionally, Culture, Statement and Prosody were added as uncorrelated by-Speaker/TokenSet slopes, and Accent, Statement and Prosody were added as uncorrelated by-Participant slopes.

The model accounted for 70% of the variance in the data (r2 = 0.70). The model revealed a significant effect of Content (β = 0.91, se = 0.13, t(19.13) = 7.04, p < 0.001), suggesting that when speakers provided linguistic evidence of a painful situation (Pain vs. Control statement), ratings increased by almost 1 point on the scale. A larger effect of Prosody was observed (β = 2.38, se = 0.21, t(12.82) = 11.38, p < 0.001); statements expressed in a complaining versus neutral tone tended to increase ratings by more than 2 points. Speaker accent was associated with a marginal, yet noticeable effect, as statements produced in the Québécois accent tended to be rated stronger exemplars of complaints than those produced in the French accent (β = 0.68, se = 0.30, t(6.20) = 2.27, p = 0.063). This trend was informed by another marginally significant pattern in the data, representing a 3-way interaction of Participant Culture, Speaker Accent and Prosody (β = 0.35, se = 0.18, t(13.71) = 1.98, p = 0.068). The effects of Prosody (Complaint > Neutral) on complaint perception tended to be greater when Québécois participants were listening to the Québécois accent. No other term showed a significant effect (ps > 0.1). Results are summarized in Table 3; Content and Prosody effects are detailed in Figure 1.

TABLE 3

French participants
QuƩbƩcois participants
AccentStatementProsodyM (SD)M (SD)
FrenchControlNeutral0.92 (0.66)0.89 (0.63)
Complaint3.28 (0.85)3.27 (1.03)
PainNeutral1.94 (0.95)1.70 (1.10)
Complaint3.96 (0.74)3.91 (0.79)
QuƩbƩcoisControlNeutral1.32 (0.72)1.22 (0.77)
Complaint3.88 (0.85)4.22 (1.07)
PainNeutral2.72 (0.85)2.37 (1.01)
Complaint4.76 (0.62)4.90 (0.62)

Mean rating of ā€œhow much the speaker is complainingā€ by French and QuĆ©bĆ©cois listeners, according to the speaker’s accent, prosody, and the type of statement (0–6 scale).

FIGURE 1

Follow-up analyses were run to further investigate the relative effects of lexical and prosodic manipulations on the participant’s responses. Looking at the model’s random slopes reveals important variance in these two predictors (0.82 for the slope of Prosody by Participant, 0.45 for the slope of Statement by Participant). A large negative correlation between the two slopes was found (r = āˆ’0.52), indicating that participants with greater Prosody coefficients tended to have smaller Statement coefficients (see Figure 2). Note that a possible outlier showing extreme coefficients can be seen on Figure 2 but removing this participant from the analysis did not affect results. Correlations were then calculated between the predicted random effects of Prosody and Content and IRI scores, revealing a medium correlation between a participant’s predicted Prosody effect and their score on the Perspective Taking scale (r = 0.21), but not the Empathic Concern scale (r = 0.06). This pattern was mirrored in correlations with the predicted Statement effect, although to a much lesser extent (PT scale: r = āˆ’0.13; EC scale: r = āˆ’0.01). These results suggest that participants who were more sensitive to complaining prosody (especially those with greater perspective-taking skills) relied less on the actual statements.

FIGURE 2

Analyses also probed the effect of speaker accent and assessed whether this effect could be driven by more specific social-relational factors such as cultural attitudes. A measure of implicit cultural bias was derived from a customized version of the IAT, designed to measure implicit attitudes held by French and QuĆ©bĆ©cois toward speakers of each group (Greenwald et al., 1998; Mauchand and Pell, n.d. a, under review) Based on the IAT D-score, the predicted random effects for each participant showed no particular relationship with implicit biases toward speakers of each cultural group (r = 0.04) nor with any of the Stereotype Content Model scores for either group (French Warmth: r = āˆ’0.05; French Competence: r = 0.04; QuĆ©bĆ©cois Warmth: r = 0.10; QuĆ©bĆ©cois Competence: r = 0.07). This suggests that accent effects were not strongly driven by implicit or explicit cultural biases.

Mediation of Prosody Effects by Acoustic Parameters

To determine how the effect of prosody on complaint perception relates to specific acoustic properties of the voice, a regression-based mediation analysis with multiple mediators was run following Vanderweele and Vansteelandt (2014). This method accounts for potential relationships between mediators and prevents any effect overlap and redundancies of running several individual mediation analyses. Acoustic parameters described in the Methods section were selected as mediators (see Table 2). The measures from each utterance were standardized by subtracting the mean and dividing by the standard deviation of all utterances. The mediation analysis was thus performed with Prosody as the treatment variable, Response as the outcome variable, and the eight acoustic parameters as mediators.

First, to assess how the treatment variable Prosody affected the mediators, eight linear regressions were run, each with a mediator as the response variable and Prosody as the predictor. Then, to evaluate the effects of the treatment and mediators on the outcome, a multiple linear regression was run with Response as the response variable and the treatment (Prosody) and all eight mediators as predictors. The direct effect of Prosody is given by its coefficient in the latter regression model; the indirect effect of Prosody through a given mediator is given by the product of the effect of Prosody on this mediator and the effect of the mediator on the Response; the total indirect effect of prosody is given by the sum of all such mediated effects.

The speaker’s mode of prosodic expression had significant effects on each mediator: compared to neutral statements, complaints showed increased F0M (β = 0.49, se < 0.01, t = 76.79, p < 0.001), increased F0SD (β = 0.11, se < 0.01, t = 15.39, p < 0.001), reduced shimmer (β = āˆ’0.26, se < 0.01, t = āˆ’37.16, p < 0.001), reduced jitter (β = āˆ’0.06, se < 0.01, t = āˆ’8.39, p < 0.001), longer utterance duration ((β = 0.09, se < 0.01, t = 17.91, p < 0.001), and final word duration (β = 0.25, se < 0.01, t = 34.56, p < 0.001), increased HNR (β = 0.38, se < 0.01, t = 55.22, p < 0.001), and increased F1 (β = 0.15, se < 0.01, t = 21.09, p < 0.001). In turn, participant’s Response/ratings were positively affected by F0M (β = 1.62, se = 0.05, t = 29.52, p < 0.001), Jitter (β = 0.25, se = 0.03, t = 7.49, p < 0.001), and utterance duration (β = 0.20, se = 0.03, t = 6.51, p < 0.001), and negatively affected by Shimmer (β = āˆ’0.16, se = 0.04, t = āˆ’4.22, p < 0.001), HNR (β = āˆ’0.78, se = 0.06, t = āˆ’12.77, p < 0.001), and F1 (β = āˆ’0.19, se = 0.03, t = āˆ’6.20, p < 0.001. No effect of F0SD (β < 0.06, se = 0.03, t = 1.67, p = 0.094) or final word duration (β < āˆ’0.02, se = 0.03, t = āˆ’0.81, p = 0.420) were found. As shown in Figure 3, F0M was by far the greatest mediator of Prosody on Response (β = 0.79), followed by shimmer (β = 0.04) and utterance duration (β = 0.02). Meanwhile, the mediations of HNR (β = āˆ’0.29), F1 (β = āˆ’0.03), and jitter (β = āˆ’0.02) were negative. Most of the Prosody effect was not linearly mediated by acoustic measures, as the total indirect effect of Prosody (β = 0.54) accounted for much less of the total effect (β = 2.38). The mediation model is illustrated in Figure 3.

FIGURE 3

Discussion

Our results provide experimental evidence supporting the literature on complaints, emotive communication, and vocal affect. As elaborated below, they emphasize the important role of prosody in conveying emotive information in communication and its relationship to other message-level (e.g., lexical) and social-relational (e.g., cultural) dimensions of social interaction.

Affective Prosody, Effective Complaint

The core of the study measured how listeners evaluate the complaining nature of utterances in different situations combining several factors. The manipulation of the speaker’s prosody was revealed to have the largest effect on listener’s evaluations; everything else controlled for, switching from a ā€œneutralā€ to a complaining tone of voice led to a marked increase in whether statements were judged to be a complaint. This finding parallels the ability to recognize basic emotions and evaluate speaker arousal from vocal expressions (Scherer, 2003; Grandjean et al., 2006), extending this ability to the general perception of a speaker’s emotivity in discourse. Through the speaker’s intention to foreground speaker affect (Arndt and Janney, 1991; Caffi and Janney, 1994), our results show that complaints can be discriminated from vocal signals without requiring complex attitudinal inferences from situational context (Wichmann, 2000). This exemplifies the use of emotional expressions as a social tool, providing listeners with affective information that allows them to make inferences and to (voluntarily) share the speaker’s emotive state (Van Kleef, 2009; Scarantino, 2017). Here, prosody appears to be the main device in the collaborative treatment of affectivity (Drew, 1998; Selting, 2010), constituting a relatively direct and effective way for listeners to assess a complainer’s subjective state (AcuƱa-Ferreira, 2002; Edwards, 2005). It can be said that detecting the emotivity of the speaker is a crucial first step in complaining interactions; by allowing listeners to recognize complaints, prosody is likely to play a key role in facilitating the affiliative and empathic response of listeners (Boxer, 1993; Traverso, 2009).

Prosody was not the only way speakers could influence listener’s evaluation of complaints. Utterances that described an explicitly painful situation were perceived as stronger exemplars of complaints than statements which did not. Affective words and sentences are known to affect a listener’s perception of emotions in speech (Pell et al., 2011; Regenbogen et al., 2012; Rigoulot et al., 2020). However, this effect did not interact with prosody and was small enough that control statements spoken in a complaining tone were perceived as more complaining than pain-related sentences in a neutral tone. This confirms an important characteristic of complaints: how we complain is more important than what we complain about, and one can virtually complain about anything (Alicke et al., 1992; Boxer, 1993). Still, the description of a past situation that would typically be associated with (social) pain can facilitate the perception of an utterance as a complaint; this factor is likely to play a role in how complaining interactions unfold in spontaneous interactions.

Interestingly, the perceptual weight given to the statement seemed to be greater when prosody was less efficient; listeners who were less sensitive to prosodic signals could presumably compensate by relying on the more tangible, explicit nature of verbal information (Zougkou et al., 2017). The relative weighting of prosodic and linguistic information can be partially explained by listener’s empathic abilities; individuals with heightened perspective-taking skills (or cognitive empathy), relied more on prosody and less on the verbal statement. In contrast, participants with greater empathic concern (or affective empathy) did not show such associations. These results are congruent with the nature of the task, which required understanding the speaker’s intention; in this context, the interpretation of the displayed affect would have been driven by inferential rather than affective processes (Van Kleef, 2009). Future research using other designs such as self-ratings or physiological measures (de Vignemont and Singer, 2006; Lang et al., 2011; Kanske et al., 2015) could further distinguish affective from inferential processes in empathy and assess how listeners actually share a complainer’s affective state from prosody.

From Acoustic Signals to Emotive Representations

The manipulation of prosody in our study allowed us to determine to what extent these cues are instrumental for listeners to recognize the speaker’s intent to complain; however, it does not explain which acoustic cues drive these judgments and how they do it. Prosody researchers who have adopted a Brunswikian approach have stressed that while emotion encoding and decoding have been widely covered by the literature in a separate manner, investigations that combine both processes are lacking (Juslin and Laukka, 2003; Scherer, 2003; Grandjean et al., 2006). Acoustic analyses of the present stimuli had revealed a number of parameters that speakers seem to manipulate in order to convey their complaints (Mauchand and Pell, n.d. b, under review). In particular, increased mean F0 and F0 variability, decreased shimmer, increased Harmonics-to-Noise ratio, and lengthened final word were widely used acoustic strategies to communicate complaints. The multiple mediation analysis performed here assessed if and how these parameters were actually used by listeners in their evaluations.

Results of the mediation analysis suggest that mean F0 was by far the most important acoustic parameter in mediating the effect of Prosody; complainers increased their mean pitch, which was perceived as more complaining by listeners. Fundamental frequency is known to be the most directly accessible marker of affect for listeners, and is modulated in both a discrete and continuous manner to express basic emotions (Frick, 1985; Grandjean et al., 2006; Eyben et al., 2016) and attitudes (Jiang and Pell, 2017; Caballero et al., 2018; Mauchand et al., 2018; Truesdale and Pell, 2018). Increased F0 mean also marks non-aggressivity and is central to affiliative behaviors as described by the Frequency code (Ohala, 1984; Gussenhoven, 2004), which could explain its central importance in the production and perception of complaints.

Differences in voice quality also showed notable patterns in mediating the effect of prosody on complaint recognition. Compared to neutral speech, complaints displayed reduced shimmer, increased HNR, and to a lesser extent reduced jitter, indicating that speakers employed a less rough, less creaky and less noisy voice when they were complaining. Evidence of increased voice control (Latoszek et al., 2018) while complaining is also characteristic of simulated but not natural pain (Lautenbacher et al., 2017; Raine et al., 2019). Interestingly, HNR and Jitter negatively mediated the participant’s response, suggesting that listeners may perceive that complaints are not genuine but reconstructed displays of affect (Selting, 2010). This impression may also explain why even complaints with pain-related statements were rarely evaluated using the highest points on the scale. In addition, reduced shimmer was associated with a slight increase in complaint ratings, possibly due to the importance of this acoustic marker in detecting sadness (Juslin and Laukka, 2003). Increased F0SD and Final World Duration, which were associated with complaining prosody, did not significantly affect listener’s judgments in the current study. It should be borne in mind that complaints occur in complex interactions, and the role of prosodic features may not be limited to signaling an emotive intent. Dynamic variations in pitch and rhythm, which mark the emphatic structure of speech (among others), could instead help to coordinate the upcoming interaction and indicate how the collaborative treatment of affect should proceed (Selting, 1994; Szczepek Reed, 2011). Also, the fact that effort-related parameters, such as higher F1 and larger F0 variation (Traunmüller and Eriksson, 2000), had little or even negative effects on the perception of complaints reaffirms that successful complaints are conveyed through affiliative signals (as per the Frequency code), rather than effort-derived meanings (as per the Effort code) (Ohala, 1984; Gussenhoven, 2004).

It is important to note that while a portion of the prosody effect on complaint perception was mediated by specific forms of acoustic change, a large part of the effect remains unexplained in the model. As our selected acoustic parameters cover many of the core acoustic features of utterances (except loudness), it is unlikely that entering more parameters as mediators would significantly increase the proportion of the mediated effect. Instead, it appears that the transformation of acoustic signals into an emotive representation is not a linear process that can be fully decomposed. In the context of our task, it is likely that the apparent contrast between neutral and complaining prosody allowed a discrete categorization of the two utterance types; the relative salience of certain parameters (such as pitch or vocal noise) could then further modulate the perception of utterances within each category.

Social-Relational Factors in Emotive Communication

While evaluations of complaints relied mainly on prosodic and lexical information, the cultural manipulation of this experiment had a marginal, but still noteworthy, impact on perceptual judgments. Overall, QuƩbƩcois speakers were rated as producing stronger (i.e., more prototypical) complaints than French speakers, and there was a strong trend for QuƩbƩcois listeners to recognize complaining prosody better when produced by other QuƩbƩcois speakers.

In a previous study (Mauchand and Pell, n.d. b, under review), differences between French and QuƩbƩcois complaints were reported at both the acoustic and perceptual level, motivating our continued interest in how socio-cultural variables influence complaint perception. In that study, we found that QuƩbƩcois speakers, when complaining, used greater pitch variability and distinct rhythmic patterns than French speakers and were perceived as angrier and more surprised (as opposed to sad for the French speakers, Mauchand and Pell, n.d. b, under review). Of key interest, QuƩbƩcois speakers used a harsher voice quality than French speakers when producing complaints (reduced HNR). Here, the mediation analysis revealed that HNR reduced the intensity of the perceived complaints; the harsher vocal quality of QuƩbƩcois speakers might thus have facilitated the detection of complaints by certain listeners. This facilitation was enhanced at the in-group level, as QuƩbƩcois listeners seemed more attuned to prosodic contrasts produced by other QuƩbƩcois speakers. This finding suggests the existence of cultural display rules and in-group advantages in emotive speech communication as is the case for the expression of emotions (Elfenbein and Ambady, 2002). However, the absence of a similar in-group advantage for the French group suggests this effect might depend on the interplay of individual, cultural and contextual factors. For example, the exposure of our French participants to the QuƩbƩcois culture in this study could have reduced potential in-group advantages for that group. However, French participants were very recent immigrants in QuƩbec, and most of them reported having very few QuƩbƩcois people in their social and professional circles. On the other hand, QuƩbƩcois participants reported having more French acquaintances, and are frequently exposed to French-accented speech from an early age (Kircher, 2012). Thus, the asymmetry in cultural effects may alternatively be due to a lack of sensitivity of French participants to the more expressive QuƩbƩcois complaining style.

While the decoding of emotive cues in the voice may be enhanced for certain in-group interactions, this facility does not seem to originate from cultural bias or prejudice. No relationship was found between the effect of accent and either implicit or explicit biases toward either group, even though such biases exist between French and QuĆ©bĆ©cois communities (Auger and Valdman, 1999; Kircher, 2012; Mauchand and Pell, n.d. a, under review). While stereotypes and prejudice do affect neural activity (Quadflieg and Macrae, 2011; Jiang et al., 2018) and affective empathy (Xu et al., 2009; Contreras-Huerta et al., 2014), they often don’t impede speech comprehension and affect recognition (Gill, 1994; Lev-Ari and Keysar, 2010; Thierry et al., 2015). Thus, accent effects may instead arise from processing issues and/or differential use of prosodic signals. Even then, the potential impact of accent cues were minimal when compared to the efficacy of both speaker groups to convey a complaining intention through prosody. These results thus reveal a strong consistency of speakers in intentionally using emotions as social signals and of listeners to infer their intentions in the case of complaints. This inference process can be subtly modulated by social-relational factors, such as the culturally normative usage of certain prosodic cues (Elfenbein and Ambady, 2002; Laukka et al., 2016; Scherer, 2003; Van Kleef, 2009). Other factors not taken into account here may also play an important role in natural complaint perception: here, the absence of context, visual cues, or a true indication of the social proximity between speaker and listener might explain why evaluations of complaints rarely reached the end of the scale. Sex/gender is also often mentioned as an important factor in complaining (AcuƱa-Ferreira, 2002; Selting, 2010); anecdotally, speaker sex was tentatively added as a parameter in our model, but did not show any significant effect (although this could be due to the small number of male/female speakers in our experiment). Future studies should investigate how a wider range of these social factors influence inferential and affective processes underlying emotive speech communication.

Conclusion

As the first perceptual investigation of complaining speech, the present study reaffirms the central role of prosody as a social device to foreground the emotive state of the speaker. The effective production and appraisal of emotive features in the voice denote a tacit understanding between speaker and listener on how complaints are performed, which depends on the capacity of listeners to detect these signals and collaborate with the social goals of the speaker (i.e., to commiserate and co-complain). Listeners also use linguistic evidence describing the nature and/or antecedents of a complaint when evaluating these speech acts, although these cues may be less diagnostic than prosodic contrasts for determining when a speaker intends to complain (and seek social affiliation and support). As such, complaints can be qualified as acts of manipulation without deception, similar to other emotive acts like persuasion, motivation or charismatic speech: intentional displays of emotion that regulate the dispositional affect of listeners and promote social affiliation. This metapragmatic understanding of human affect, central to speaker/listener relationships, needs to be systematically considered in future investigations of speech, attitudes, and emotions (Pell and Kotz, 2020). Including social-relational factors, such as cultural relationships, is crucial to advance perspectives in this literature; future work should investigate how more distant cultures communicate complaints and other types of emotive meanings. Experimental approaches that include empathic assessments, neurophysiological measures, or which study group interactions would also produce valuable evidence to build on theoretical frameworks describing emotive communication, affect, and prosody.

Statements

Data availability statement

The datasets presented in this study can be found in online repositories. Data and materials can be found on the OpenScience Framework (Foster and Deardorff, 2017; https://osf.io/9az68/?view_only=9a5529fc1a0645aa9b17f6589a79e848).

Ethics statement

The studies involving human participants were reviewed and approved by the Faculty of Medicine and Health Sciences Institutional Review Board (McGill IRB). The patients/participants provided their written informed consent to participate in this study.

Author contributions

MM designed the experiment, collected, analyzed and interpreted data, and drafted and developed the manuscript. MP supervised the experiment and provided critical revisions of the manuscript. Both authors contributed to the article and approved the submitted version.

Funding

This research was supported by an Insight Grant to MP from the Social Sciences and Humanities Research Council of Canada (G245592 SSHRC 435-2017-0885) as well as scholarships to MM from McGill Faculty of Medicine, the Integrated Program in Neuroscience at McGill, and the Quebec Bio-Imaging Network (QBIN/RBIQ).

Acknowledgments

The authors would like to thank Emmanuelle Beauvais-Lacasse for her help in recruiting and testing participants.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

  • 1

    AcuƱa-FerreiraV. (2002). Gendered emotive displays in complaint discourse.Estud. Sociolingüística3139–172.

  • 2

    AlickeM. D.BraunJ. C.GlorJ. E.KlotzM. L.MageeJ.SederholmH.et al (1992). Complaining behavior in social interactions.Pers. Soc. Psychol. Bull.18286–295.

  • 3

    ArndtH.JanneyR. W. (1991). Verbal, prosodic, and kinesic emotive contrasts in speech.J. Pragmat.15521–549. 10.1016/0378-2166(91)90110-J

  • 4

    AugerJ.ValdmanA. (1999). Letting french students hear the diverse voices of francophony.Mod. Lang. J.83403–412. 10.1111/0026-7902.00030

  • 5

    Aziz-ZadehL.ShengT.GheytanchiA.AlemanA. (2010). Common premotor regions for the perception and production of prosody and correlations with empathy and prosodic ability.PLoS One5:e8759. 10.1371/journal.pone.0008759

  • 6

    BatesD.MachlerM.BolkerB. M.WalkerS. C. (2015). Fitting linear mixed-effects models using Lme4.J. Stat. Softw.671–48. 10.18637/jss.v067.i01

  • 7

    BoersmaP.van HeuvenV. (2001). Speak and UnSpeak with praat.Glot Int.5341–347.

  • 8

    BoxerD. (1993). Social distance and speech behavior: the case of indirect complaints.J. Pragmat.19103–125. 10.1016/0378-2166(93)90084-3

  • 9

    BrunswikE. (1956). Perception and the Representative Design of Psychological Experiments.Berkeley, CA: Univ of California Press.

  • 10

    CaballeroJ. A.VergisN.JiangX.PellM. D. (2018). The sound of Im/Politeness.Speech Commun.10239–53. 10.1016/j.specom.2018.06.004

  • 11

    CaffiC.JanneyR. W. (1994). Toward a pragmatics of emotive communication.J. Pragmat.22325–373. 10.1016/0378-2166(94)90115-5

  • 12

    Contreras-HuertaL. S.HielscherE.SherwellC. S.RensN.CunningtonR. (2014). Intergroup relationships do not reduce racial bias in empathic neural responses to pain.Neuropsychologia64263–270. 10.1016/j.neuropsychologia.2014.09.045

  • 13

    CulpeperJ.TerkourafiM. (2017). ā€œPragmatic approaches (Im)politeness,ā€ in The Palgrave Handbook of Linguistic (Im)Politeness, edsCulpeperJ.HaughM.KĆ”dĆ”rD. (London: Palgrave Macmillan), 11–39. 10.1057/978-1-137-37508-7_2

  • 14

    de VignemontF.SingerT. (2006). The empathic brain: how, when and why?Trends Cogn. Sci.10435–441. 10.1016/j.tics.2006.08.008

  • 15

    DrewP. (1998). Complaints about transgressions and misconduct.Res. Lang. Soc. Interact.31295–325. 10.1080/08351813.1998.9683595

  • 16

    DrewP.WalkerT. (2009). Going too far: complaining, escalating and disaffiliation.J. Pragmat.412400–2414. 10.1016/j.pragma.2008.09.046

  • 17

    EdwardsD. (2005). Moaning, whinging and laughing: the subjective side of complaints.Discourse Stud.75–29. 10.1177/1461445605048765

  • 18

    EkmanP.FriesenW. V.O’SullivanM.ChanA.Diacoyanni-TarlatzisI.HeiderK.et al (1987). Universals and cultural differences in the judgments of facial expressions of emotion.J. Pers. Soc. Psychol.53712–717. 10.1037/0022-3514.53.4.712

  • 19

    ElfenbeinH. A.AmbadyN. (2002). On the universality and cultural specificity of emotion recognition: a meta-analysis.Psychol. Bull.128203–235. 10.1037/0033-2909.128.2.203

  • 20

    EybenF.SchererK. R.SchullerW.SundbergJ.AndrE.BussoC.et al (2016). The Geneva minimalistic acoustic parameter set (GeMAPS) for voice research and affective computing.IEEE Trans. Affect. Comput.7190–202. 10.1109/TAFFC.2015.2457417

  • 21

    EybenF.WƶllmerM.SchullerB. (2010). ā€œOpenSMILE – the munich versatile and fast open-source audio feature extractor,ā€ in MM’10 – Proceedings of the ACM Multimedia 2010 International Conference (New York, NY: ACM Press), 1459–1462. 10.1145/1873951.1874246

  • 22

    FiskeS. T.CuddyA. J. C.GlickP.XuJ. (2002). A model of (Often Mixed) stereotype content: competence and warmth respectively follow from perceived status and competition.J. Pers. Soc. Psychol.82878–902. 10.1037//0022-3514.82.6.878

  • 23

    FlocciaC.GoslinJ.GirardF.KonopczynskiG. (2006). Does a regional accent perturb speech processing?J. Exp. Psychol. Hum. Percept. Perform.321276–1293. 10.1037/0096-1523.32.5.1276

  • 24

    FosterE. D.DeardorffA. (2017). Open science framework (OSF).J. Med. Libr. Assoc.105203–206. 10.5195/JMLA.2017.88

  • 25

    FrickR. W. (1985). Communicating emotion: the role of prosodic features.Psychol. Bull.97412–429. 10.1037/0033-2909.97.3.412

  • 26

    Gelinas-ChebatC.ChebatJ.-C. (1992). Effects of two voice characteristics on the attitudes toward advertising messages.J. Soc. Psychol.132447–459. 10.1080/00224545.1992.9924724

  • 27

    GiletA.-L.MellaN.StuderJ.GrühnD. (2013). Assessing dispositional empathy in adults: a French validation of the interpersonal reactivity index (IRI).Can. J. Behav. Sci.4542–48. 10.1037/a0030425

  • 28

    GillM. M. (1994). Accent and stereotypes: their effect on perceptions of teachers and lecture comprehension.J. Appl. Commun. Res.22348–361. 10.1080/00909889409365409

  • 29

    GrandjeanD.BƤnzigerT.SchererK. R. (2006). Intonation as an interface between language and affect.Prog. Brain Res.156235–247. 10.1016/S0079-6123(06)56012-1

  • 30

    GreenwaldA. G.McGheeD. E.SchwartzJ. L. K. (1998). Measuring individual differences in implicit cognition: the implicit association test.J. Pers. Soc. Psychol.741464–1480. 10.1037/0022-3514.74.6.1464

  • 31

    GriceH. P. (1989). ā€œLogic and conversation,ā€ in Studies in the Way of Words, ed.GriceP. (Cambridge, MA: Harvard University Press), 1–13. 10.1093/acprof:oso/9780195133004.003.0001

  • 32

    GussenhovenC. (2004). The Phonology of Tone and Intonation.Cambridge: Cambridge University Press, 10.1017/CBO9780511616983

  • 33

    HeblichS.LameliA.RienerG. (2015). The effect of perceived regional accents on individual economic behavior: a lab experiment on linguistic performance, cognitive ratings and economic decisions.PLoS One10:e0124732. 10.1371/journal.pone.0113475

  • 34

    HellberndN.SammlerD. (2016). Prosody conveys Speaker’s intentions: acoustic cues for speech act perception.J. Mem. Lang.8870–86. 10.1016/j.jml.2016.01.001

  • 35

    JiangX.Gossack-KeenanK.PellM. D. (2020). To believe or not to believe? How voice and accent information in speech alter listener impressions of trust.Q. J. Exp. Psychol.7355–79. 10.1177/1747021819865833

  • 36

    JiangX.PellM. D. (2017). The sound of confidence and doubt.Speech Commun.88106–126. 10.1016/j.specom.2017.01.011

  • 37

    JiangX.SanfordR.PellM. D. (2018). Neural architecture underlying person perception from in-group and out-group voices.NeuroImage181582–597. 10.1016/J.NEUROIMAGE.2018.07.042

  • 38

    JuddC. M.WestfallJ.KennyD. A. (2016). Experiments with more than one random factor: designs, analytic models, and statistical power.Annu. Rev. Psychol.68601–625. 10.1146/annurev-psych-122414-033702

  • 39

    JuslinP. N. (2013). ā€œVocal affect expression: problems and promises,ā€ in Evolution of Emotional Communication, edsAltenmüllerE.SchmidtS.ZimmermannE. (Oxford: Oxford University Press), 252–274. 10.1093/acprof:oso/9780199583560.003.0016

  • 40

    JuslinP. N.LaukkaP. (2003). Communication of emotions in vocal expression and music performance: different channels, same code?Psychol. Bull.129770–814. 10.1037/0033-2909.129.5.770

  • 41

    KanskeP.BƶcklerA.TrautweinF. M.SingerT. (2015). Dissecting the social brain: introducing the EmpaToM to reveal distinct neural networks and brain-behavior relations for empathy and theory of mind.NeuroImage1226–19. 10.1016/j.neuroimage.2015.07.082

  • 42

    KatrielT. (2013). ā€œGriping’ as a verbal ritual in some israeli discourse,ā€ in Cultural Communication and Intercultural Contact, ed.CarbaughD. (New York, NY: Psychology Press), 10.4324/9780203812259

  • 43

    KircherR. (2012). How pluricentric is the french language? An investigation of attitudes towards quebec french compared to European French.J. French Lang. Stud.22345–370. 10.1017/S0959269512000014

  • 44

    KowalskiR. M. (2002). Whining, griping, and complaining: positivity in the negativity.J. Clin. Psychol.581023–1035. 10.1002/jclp.10095

  • 45

    KraftB.GeluykensR. (2002). Complaining in French L1 and L2: a cross-linguistic investigation.Eurosla Yearb.2227–242. 10.1075/eurosla.2.14kra

  • 46

    KuiperL. (2005). Perception is reality: parisian and provencal perceptions of regional varieties of French1.J. Sociolinguistics928–52. 10.1111/j.1360-6441.2005.00280.x

  • 47

    KuznetsovaA.BrockhoffP. B.ChristensenR. H. B. (2017). LmerTest package: tests in linear mixed effects models.J. Stat. Softw.821–26. 10.18637/jss.v082.i13

  • 48

    LaforestM. (2002). Scenes of family life: complaining in everyday conversation.J. Pragmat.341595–1620. 10.1016/S0378-2166(02)00077-2

  • 49

    LangS.YuT.MarklA.MüllerF.KotchoubeyB. (2011). ā€œHearing others’ pain: neural activity related to empathy.Cogn. Affect. Behav. Neurosci.11386–395. 10.3758/s13415-011-0035-0

  • 50

    LatoszekB. B. vDe BodtM.GerritsE.MarynY. (2018). The exploration of an objective model for roughness with several acoustic markers.J. Voice32149–161. 10.1016/j.jvoice.2017.04.017

  • 51

    LaukkaP.ElfenbeinH. A. (2020). Cross-cultural emotion recognition and in-group advantage in vocal expression: a meta-analysis.Emot. Rev.1–9. 10.1177/1754073919897295

  • 52

    LaukkaP.ElfenbeinH. A.ThingujamN. S.RockstuhlT.IrakiF. K.ChuiW.et al (2016). The expression and recognition of emotions in the voice across five nations: a lens model analysis based on acoustic features.J. Pers. Soc. Psychol.111686–705. 10.1037/pspi0000066

  • 53

    LautenbacherS.Salinas-RannebergM.NiebuhrO.KunzM. (2017). Phonetic characteristics of vocalizations during pain.Pain Rep.2:e597. 10.1097/PR9.0000000000000597

  • 54

    LernerA.CohenA.AvigalM.OshratY.BlochA.ZeiligG. (2016). ā€œSpeech prosody as a biosignal for physical pain detection,ā€ in Proceedings of the Conf Proc 8th Speech Prosody, Boston, MA, 420–424. 10.21437/SpeechProsody.2016-86

  • 55

    Lev-AriS.KeysarB. (2010). Why don’t we believe non-native speakers? The influence of accent on credibility.J. Exp. Soc. Psychol.461093–1096. 10.1016/j.jesp.2010.05.025

  • 56

    LiuP.RigoulotS.PellM. D.BernsteinL. E.TanakaA. (2015). Cultural differences in on-line sensitivity to emotional voices: comparing east and west.Front. Hum. Neurosci.9:311. 10.3389/fnhum.2015.00311

  • 57

    MacD.-K.AubergƩV.RilliardA.CastelliE. (2010). Cross-Cultural Perception of Vietnamese Audio-Visual Prosodic Attitudes.Chicago, IL: Speech Prosody.

  • 58

    MauchandM.VergisN.PellM. D. (2018). Ironic Tones of Voices.ā€ In Proceedings of the 9th International Conference on Speech Prosody 2018 (Poznan: Speech Prosody), 443–447. 10.21437/SpeechProsody.2018-90

  • 59

    MauchandM.VergisN.PellM. D. (2020). Irony, prosody, and social impressions of affective stance.Discourse Process.57141–157. 10.1080/0163853X.2019.1581588

  • 60

    MeconiF.DoroM.LomorielloA. S.MastrellaG.SessaP. (2018). Neural measures of the role of affective prosody in empathy for pain.Sci. Rep.8:291. 10.1038/s41598-017-18552-y

  • 61

    OgdenR. (2010). Prosody in Interaction.Amsterdam: John Benjamins Pub. Co, 81–104.

  • 62

    OhalaJ. J. (1984). An ethological perspective on common cross-language utilization of Fo of Voice1.Phonetica411–16. 10.1159/000261706

  • 63

    OngD. C.ZakiJ.GoodmanN. D. (2018). Computational models of emotion inference in theory of mind: a review and roadmap.Topics Cogn. Sci.11338–357. 10.1111/tops.12371

  • 64

    PaulmannS.KotzS. A. (2008). An ERP investigation on the temporal dynamics of emotional prosody and emotional semantics in pseudo- and lexical-sentence context.Brain Lang.10559–69. 10.1016/j.bandl.2007.11.005

  • 65

    PellM. D. (2001). Influence of emotion and focus location on prosody in matched statements and questions.J. Acoust. Soc. Am.1091668–1680. 10.1121/1.1352088

  • 66

    PellM. D.JaywantA.MonettaL.KotzS. a (2011). Emotional speech processing: disentangling the effects of prosody and semantic cues.Cogn. Emot.25834–853. 10.1080/02699931.2010.516915

  • 67

    PellM. D.KotzS. A. (2020). The next frontier: prosody research gets interpersonal.Emot. Rev.

  • 68

    QuadfliegS.MacraeC. N. (2011). Stereotypes and stereotyping: what’s the brain got to do with it?Eur. Rev. Soc. Psychol.22215–273. 10.1080/10463283.2011.627998

  • 69

    RaineJ.PisanskiK.SimnerJ.RebyD. (2019). Vocal communication of simulated pain.Bioacoustics28404–426. 10.1080/09524622.2018.1463295

  • 70

    RaoR. (2013). Intonational variation in third party complaints in spanish.J. Speech Sci.3141–168.

  • 71

    RegenbogenC.SchneiderD. A.FinkelmeyerA.KohnN.DerntlB.KellermannT.et al (2012). The differential contribution of facial expressions, prosody, and speech content to empathy.Cogn. Emot.26995–1014. 10.1080/02699931.2011.631296

  • 72

    RigoulotS.JiangX.VergisN.PellM. D. (2020). Neurophysiological correlates of sexually evocative speech.Biol. Psychol.154:107909. 10.1016/j.biopsycho.2020.107909

  • 73

    RoderoE. (2011). Intonation and emotion: influence of pitch levels and contour type on creating emotions.J. Voice25e25–e34. 10.1016/J.JVOICE.2010.02.002

  • 74

    SauterD. A.EisnerF.EkmanP.ScottS. K. (2010). Cross-cultural recognition of basic emotions through nonverbal emotional vocalizations.Proc. Natl. Acad. Sci. U.S.A.1072408–2412. 10.1073/pnas.0908239106

  • 75

    ScarantinoA. (2017). How to do things with emotional expressions: the theory of affective pragmatics.Psychol. Inquiry28165–185. 10.1080/1047840X.2017.1328951

  • 76

    SchererK. R. (2003). Vocal communication of emotion: a review of research paradigms.Speech Commun.40227–256. 10.1016/S0167-6393(02)00084-5

  • 77

    SchererK. R.BƤnzigerT. (2004). ā€œEmotional expression in prosody: a review and an agenda for future research,ā€ in Speech Prosody 2004. Proceedings of the 2nd International Conference on Speech Prosody, Nara, 359–366.

  • 78

    SchererK. R.Clark-PolnerE.MortillaroM. (2011). In the eye of the beholder? Universality and cultural specificity in the expression and perception of emotion.Int. J. Psychol.46401–435. 10.1080/00207594.2011.626049

  • 79

    SearleJ. R. (1965). What is a speech act.Perspectives in the Philosophy of Language: A Concise Anthology2000253–268.

  • 80

    SeltingM. (1994). Emphatic speech style?: with special focus on the prosodic signalling of heightened emotive involvement in conservation.J. Pragmat.22375–408. 10.1016/0378-2166(94)90116-3

  • 81

    SeltingM. (2010). Affectivity in conversational storytelling: an analysis of displays of anger or indignation in complaint stories.Pragmatics20229–277. 10.1075/prag.20.2.06sel

  • 82

    SotoJ. A.LevensonR. W. (2009). Emotion recognition across cultures: the influence of ethnicity on empathic accuracy and physiological linkage.Emotion9874–884. 10.1037/a0017399

  • 83

    SumnerM.SamuelA. G. (2009). The effect of experience on the perception and representation of dialect variants.J. Mem. Lang.60487–501. 10.1016/j.jml.2009.01.001

  • 84

    Szczepek ReedB. (2011). Beyond the particular: prosody and the coordination of actions.Lang. Speech5513–34. 10.1177/0023830911428871

  • 85

    ThierryG.PaulmannS.JonczykR.HatzidakiA.BausC.CostaA. (2015). The way you say it, the way i feel it: emotional word processing in accented speech.Front. Psychol.6:351. 10.3389/fpsyg.2015.00351

  • 86

    TraunmüllerH.ErikssonA. (2000). Acoustic effects of variation in vocal effort by men, women, and children.J. Acoust. Soc. Am.1073438–3451. 10.1121/1.429414

  • 87

    TraversoV. (2009). The dilemmas of third-party complaints in conversation between friends.J. Pragmat.412385–2399. 10.1016/j.pragma.2008.09.047

  • 88

    TrenchsM. (1994). Complaining in catalan, complaining in english: a comparative study of native and EFL speakers.Rev. Esp. Lingüíst. Apl.10271–288.

  • 89

    TruesdaleD. M.PellM. D. (2018). The sound of passion and indifference.Speech Commun.99124–134. 10.1016/J.SPECOM.2018.03.007

  • 90

    Van KleefG. A. (2009). How emotions regulate social life.Curr. Dir. Psychol. Sci.18184–188. 10.1111/j.1467-8721.2009.01633.x

  • 91

    Van ZantA. B.BergerJ. (2020). How the voice persuades.J. Pers. Soc. Psychol.118661–682. 10.1037/pspi0000193

  • 92

    VanderweeleT. J.VansteelandtS. (2014). Mediation analysis with multiple mediators.Epidemiol. Method295–115. 10.1515/em-2012-0010

  • 93

    WambacqI. J. A.JergerJ. F. (2004). Processing of affective prosody and lexical-semantics in spoken utterances as differentiated by event-related potentials.Cogn. Brain Res.20427–437. 10.1016/J.COGBRAINRES.2004.03.015

  • 94

    WichmannA. (2000). The Attitudinal Effects of Prosody, and How They Relate to Emotion.Newcastle: ITRW on Speech and Emotion, 143–148.

  • 95

    WichmannA. (2002). ā€œAttitudinal intonation and the inferential process. Speech Prosody 2002,ā€ in Proceedings of the 1st International Conference on Speech Prosody, (Aix-en-Provence: Speech Prosody), 11–16.

  • 96

    WinchatzM. R. (2016). ā€œJammern [Whining] as a German way of speaking,ā€ in The Handbook of Communication in Cross-Cultural Perspective, ed.CarbaughD. (New York, NY: Routledge), 10.4324/9781315709321

  • 97

    XuX.ZuoX.WangX.HanS. (2009). Do you feel my pain? racial group membership modulates empathic neural responses.J. Neurosci.298525–8529. 10.1523/JNEUROSCI.2418-09.2009

  • 98

    YuanY. C.LiaoW.BazarovaN. N. (2019). Judging expertise through communication styles in intercultural collaboration.Manage. Commun. Q.33238–271. 10.1177/0893318918824674

  • 99

    ZougkouK.WeinsteinN.PaulmannS. (2017). ERP correlates of motivating voices: quality of motivation and time-course matters.Soc. Cogn. Affect. Neurosci.121687–1700. 10.1093/scan/nsx064

Appendix

TABLE A1

Experimental Stimuli
English translation
PainControlPainControl
Ils ont tout fait sans moiIls ont tout fait sans boireThey did everything without meThey did everything without drinking
Ils sont partis sans moiIls sont partis sans barqueThey left without meThey left without a boat
Ils m’ont demandĆ© de partirIls m’ont demandĆ© de resterThey asked me to leaveThey asked me to stay
Ils refusĆ© de venir chez moiIls ont refusĆ© venir chez MarcThey decided not to come to my placeThey decided not to come to Marc’s
Ils ont refusĆ© m’inviterIls ont refusĆ© de mĆ©langerThey refused to invite meThey refused to stir
Ils ont dĆ©cidĆ© de pas m’inviterIls ont dĆ©cidĆ© de pas mĆ©langerThey decided not to invite meThey decided not to stir
Ils ont dƩcidƩ de pas jouer avec moiIls ont dƩcidƩ de pas jouer avec MarcThey decided not to play with meThey decided not to play with Marc
Ils ont dĆ©cidĆ© d’y aller sans moiIls ont dĆ©cidĆ© d’y aller sans masquesThe decided to go without meThey decided to go without masks
Il m’a choisi en dernierIl m’a choisi en deuxiĆ©meHe chose me lastHe chose me in second
Elle veut que personne m’aimeElle veut que personne marcheShe wants no one to like meShe wants no one to walk
Il me parle jamaisIl me parle jeudiHe never talks to meHe talks to me thursday
Elle a profitĆ© de moiElle a profitĆ© de l’offreShe took advantage of meShe took advantage of the sale
Il m’a fait pleurerIl m’a fait parlerHe made me cryHe made me talk
Elle m’a rĆ©pondu mĆ©chammentElle m’a rĆ©pondu normalementHe answered me harshlyHe answered me normally
Elle a dit qu’elle m’aimait pasElle a dit qu’il m’aimait bienShe said he didn’t like meShe said he appreciated me
Il veut vraiment pas de moiIl veut vraiment pas de masqueHe really doesn’t want meHe really doesn’t want a mask
Elle continue de m’ignorerElle continue de mesurerShe keeps ignoring meShe keeps measuring
Il a dit que j’étais stupideIl a dit que j’étais sortiHe said I was stupidHe said I went out
Elle a dit que j’étais saleElle a dit que j’étais jeuneShe said I was dirtyShe told me I was young
Il a dit que j’étais gros(se)Il a dit que j’étais grandHe said I was fatHe said I was tall
Elle pense que je suis mauvaiseElle pense que je suis bronzƩShe thinks I am badShe thinks I am tan
Ils pensent que je suis mƩchant(e)Ils pensent que je suis belgeThey think I am meanThey think I am belgian
Elle pense que je suis peureux (se)Elle pense que je suis parti(e)Shee thinks I am scaredShe thinks I am gone
Il pense que je suis faibleIl pense que je suis fierHe thinks I am weakHe thinks I am proud
Ils font des blagues sur moiIls font des blagues sur MarsThey made jokes about meThey make jokes about Mars
Ils font des blagues sur mon poidsIls font des blagues sur MontrƩalThey make jokes about my weightThey make jokes about Montreal
Elle dƩteste mon idƩeElle dƩteste mƩlangerShe hates my ideaShe hates stirring
Il m’a fait passer pour un fouIl m’a fait passer pour un frĆ©reHe made me look like madmanHe made me look like a brother
Elle m’a donnĆ© une claqueElle m’a donnĆ© une glaceShee gave me a slapShe gave me an ice cream
Il m’a donnĆ© un coup de piedIl m’a donnĆ© un coup de mainHe gave me a kickHe gave me help
Il m’empĆŖche de m’amuserIl m’empĆŖche de glisserHe prevents me from having funHe rpevents me from slipping
Elle m’empĆŖche de dormirElle m’empĆŖche de tomberShe prevent me from sleepingShe prevents me from falling
Il continue de me mentirIl continue de me montrerHe keeps lying to meHe keeps showing me
Elle continue de m’insulterElle continue de mĆ©langerShe keeps insulting meShe keeps stirring
Il a marchƩ sur mon chienIl a marchƩ sur mon cheminHe stepped on my dogHe stepped on my path
Elle a marchƩ sur ma mainElle a marchƩ sur ma routeShe stepped on my handShe stepped on my road
Il a frappƩ ma jambeIl a frappƩ ma balleHe kicked my legHe kicked my ball
Elle a pris ma placeElle a pris ma mainShe took my spotShe took my hand
Il m’a fait tomberIl m’a fait comprendreHe made me fallHe amde me understand
Elle veut me faire raterElle veut me faire resterShe wants to make me failShe wants to make me stay
Il essaye de m’énerverIl essaye de mesurerHe’s trying to annoy meHe’s trying to measure

Sentences constructed for the experiment with English translations. Only the bolded final words differed between the Pain and Control version of one sentence root.

Summary

Keywords

pragmatics, cross-cultural, vocal affect, emotive involvement, complaint

Citation

Mauchand M and Pell MD (2021) Emotivity in the Voice: Prosodic, Lexical, and Cultural Appraisal of Complaining Speech. Front. Psychol. 11:619222. doi: 10.3389/fpsyg.2020.619222

Received

19 October 2020

Accepted

21 December 2020

Published

18 January 2021

Volume

11 - 2020

Edited by

Oliver Niebuhr, University of Southern Denmark, Denmark

Reviewed by

Albert Rilliard, UPR3251 Laboratoire d’Informatique pour la MĆ©canique et les Sciences de l’IngĆ©nieur (LIMSI), France; Joan Ma, Queen Margaret University, United Kingdom

Updates

Copyright

*Correspondence: Maƫl Mauchand,

This article was submitted to Language Sciences, a section of the journal Frontiers in Psychology

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics