“Motherese” Prosody in Fetal-Directed Speech: An Exploratory Study Using Automatic Social Signal Processing

Parlato-Oliveira, Erika; Saint-Georges, Catherine; Cohen, David; Pellerin, Hugues; Pereira, Isabella Marques; Fouillet, Catherine; Chetouani, Mohamed; Dommergues, Marc; Viaux-Savelon, Sylvie

doi:10.3389/fpsyg.2021.646170

ORIGINAL RESEARCH article

Front. Psychol. , 09 March 2021

Sec. Developmental Psychology

Volume 12 - 2021 | https://doi.org/10.3389/fpsyg.2021.646170

This article is part of the Research Topic Are There Different Types of Child-Directed Speech? Dynamic Variations According to Individual and Contextual Factors View all 11 articles

“Motherese” Prosody in Fetal-Directed Speech: An Exploratory Study Using Automatic Social Signal Processing

$\nErika Parlato-Oliveira,,$ Erika Parlato-Oliveira^1,2,3

Catherine Saint-Georges^3,4

David Cohen^3,4

Hugues Pellerin^3,4

Isabella Marques Pereira¹

Catherine Fouillet⁴

Mohamed Chetouani³

Marc Dommergues⁵

Sylvie Viaux-Savelon^2,4^*

¹School of Medicine, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil
²CRPMS, Université de Paris, Paris, France
³Institut des Systèmes Intelligents et de Robotiques (ISIR), équipe Perception, Interaction, et Robotiques Sociales (PIRoS), Sorbonne Université, Paris, France
⁴Service de Psychiatrie de l'Enfant et de l'Adolescent, APHP Pitié Salpêtrière, Sorbonne Université, Paris, France
⁵Service de Gynécologie Obstétrique, APHP Pitié Salpêtrière, Sorbonne Université, Paris, France

Introduction: Motherese, or emotional infant directed speech (IDS), is the specific form of speech used by parents to address their infants. The prosody of IDS has affective properties, expresses caregiver involvement, is a marker of caregiver-infant interaction quality. IDS prosodic characteristics can be detected with automatic analysis. We aimed to explore whether pregnant women “speak” to their unborn baby, whether they use motherese while speaking and whether anxio-depressive or obstetrical status impacts speaking to the fetus.

Participants and Methods: We conducted an observational study of pregnant women with gestational ages from 26 to 38 weeks. Women were recruited in a university hospital department of obstetrics. Thirty-five women agreed to participate in the study, and 26 audio records were exploitable. We collected obstetrical and sociodemographic data, pregnancy outcomes, anxiety and depressive status using the Covy and Raskin Scales, and life events using the Sensations During Pregnancy and Life Event Questionnaire. Each participant was left alone with an audio recorder with a recommendation to feel free to speak to her fetus as she would have done at home. The recording was stopped after 3 min. Audio recordings were analyzed by two methods: psycholinguist experts' annotation and computational objective automatic analyses.

Results: Most mothers (89%) reported speaking to their fetuses. We found a correlation between maternal first perceptions of fetal movements and the start of mother's speaking to fetus. Motherese prosody was detected with both annotation and automatic analysis with a significant correlation between the two methods. In this exploratory study, motherese use was not associated with maternal anxiodepressive or obstetrical status. However, the more future mothers were depressed, the less they spoke with their fetuses during the recording.

Conclusion: Fetal directed speech (FDS) can be detected during pregnancy, and it contains a period of prosody that shares the same characteristics of motherese that can be described as prenatal motherese or emotional fetal-directed speech (e-FDS). This means that pregnant women start using motherese much earlier than expected. FDS seems to be correlated with maternal first perceptions of fetal movements and depression scores. However, more research is needed to confirm these exploratory results.

Introduction

Infant-directed speech (IDS) or motherese is a specific register, which includes peculiar prosodic characteristics, that parents or caregivers often use when speaking to infants (Fernald and Simon, 1984; Fisher and Tokura, 1995; Spinelli et al., 2017). The use of motherese progressively increases as the baby grows and then usually decreases and disappears when the child becomes able to communicate verbally (Saint-Georges et al., 2013). IDS has been studied extensively across a number of interactive situations and contexts, especially by researchers interested in understanding language acquisition. IDS is also a marker of the parent infant interaction quality. Motherese characteristics have been shown to be linked with emotional prosody characteristics (Trainor et al., 2000). Behavioral studies have shown that infants prefer and respond better to motherese than to regular prosody, typical of adult directed speech (Fernald and Kuhl, 1987; Dupoux and Mehler, 1990; Saint-Georges et al., 2013; Outters et al., 2020). IDS has affective properties, expresses parental involvement, and contributes to regulating caregiver-infant interactions (Cohen et al., 2013). Thus, IDS is part of an interactive loop that may play an important role in infants' cognitive and social development (Saint-Georges et al., 2013). Experimental data suggest that very young infants in their first month of life (Cooper and Aslin, 1990; Cooper, 1993) or in their first week (Ramus, 1999) and even neonates (Saito et al., 2007) are sensitive to this prosody.

The in utero period has been less explored. A recent study (Bartha-Doering et al., 2019) suggested that neural discrimination of speech begins in utero. Some reports also show that future mothers speak to their fetus (DeCasper et al., 1994). In addition, parents observing their fetuses during ultrasound prenatal screening were shown to present mirroring movement activities (Ammaniti et al., 2010). These studies suggest that motherese may already be present in the prenatal period and may be associated with mother involvement and emotional tone regarding prenatal interactions. However, motherese has not been clearly demonstrated during the prenatal period.

With the development of automatized methods of social signal processing, machine-learning methods can now detect the acoustic characteristics of emotional speech, such as motherese, in the human voice. It can distinguish motherese sequences from adult-directed speech (Mahdhaoui et al., 2011). Traditionally, the design of computerized classifiers aims to capture supra-segmental features like pitch (fundamental frequency), duration, energy of vocalizations as well as global dynamics of spectrum (Williams and Stevens, 1972; Sherer, 1986; Chetouani et al., 2014). Mel frequency cepstral coefficients (MFCC) capture short-term dynamics of spectrum and are termed segmental features. The motherese detection algorithm system exploits the combination of two classifiers, segmental and supra-segmental, that are weighted and fused to reach best classification rates (Mahdhaoui et al., 2011). Previous works have shown that these methods can contribute to exploring parent-infant interactions in video/audio recordings in natural or experimental settings (Cohen et al., 2013; Weisman et al., 2016; Bourvis et al., 2018). Moreover, these automatic analyses of motherese have contributed to state the universality of the emotional prosodic characteristics of IDS across languages (Parlato et al., 2020).

Here, we describe an exploratory observational study based on interviews and audio recording of volunteer pregnant women with the following aims: (1.1) to determine whether pregnant women speak to their unborn baby, (1.2) if so, to determine, if they speak using motherese prosody or not, with two methods (1.3) with prosody analyzed by clinical experts, and (1.4) using computational analysis of speech with machine learning method. In addition, (2) we will assess if prenatal stress, obstetrical or fetal complications, and future mother emotional state would influence the quantity and characteristics of mother's prosody (Watson et al., 2002; Viaux-Savelon et al., 2012).

Methods

Participants and Ethics

From September 2013 to January 2014, we proposed to pregnant women attending the prenatal clinic of the Pitié-Salpêtrière University Hospital in Paris, France, to participate in a survey on maternal speech. They received oral and written information explaining that their participation would require filling out self-questionnaires, answering questions related to their emotional status, and being audiotaped when speaking to their future baby. The study was approved by the local Ethical Committee (CPPIDF6) under the number n°09012014. All participants gave a written consent. The inclusion criteria were mothers aged 18 or above, with a gestational age of 26–38 weeks, and able to understand the protocol. Indeed, during this period of pregnancy, future mothers are less concerned by the fetus viability. They begin preparing their meeting and relationship with the future child with more dreams and more detailed representations about the future baby (Ammaniti et al., 2000). Provided women were fluent in French, they could be included even if French was not their native language. Exclusion criteria were mental disorders and absence of health coverage according to the French ethical rules that require that studies be carried out only on people with health coverage. The mental disorder information were extracted from the obstetric record. Mental disorders were considered to be present if the pregnant woman was cared or treated for mental disorder by specialist before the pregnancy.

Data Collection

Clinical Data Collection

We collected several clinical variables. Social and demographic characteristics included age, parity, marital status, native language, education level, and occupation. We also assessed life events using the Sensations During Pregnancy and Life Event Questionnaire (Tordjman et al., 2004). Fetal-oriented interaction variables included gestational age when the pregnant women reported spontaneous moment of speaking to their infant to come and the gestational age they started perceiving fetal movement. To assess maternal anxiety-depression status, we used two specific scales. Maternal anxiety was assessed using the Covi anxiety scale, a questionnaire completed by the investigator (EP), based on clinical assessment. This score ranges from 0 (no anxiety) to 12 (high anxiety), with a threshold of six defining clinically relevant anxiety (Covi, 1986). Depression was assessed using the RASKIN score based on clinical assessment. This score ranges from 0 (no depression) to 12 (high depression), with a threshold of six defining clinically relevant depression (Raskin and Crook, 1976). In case of clinically relevant depression or anxiety, we planned that the investigator would warn the doctor or midwife in charge of the patient to organize adequate follow-up.

Finally, we retrospectively recorded medical history and obstetrical outcomes after birth based on obstetrical and neonatal records by professionals who were blinded to the audio analysis. Variables are listed in Table 1 (pregnancy and delivery sections). Breastfeeding initiation was collected as some studies have pointed out that stress events during pregnancy may influence breastfeeding initiation or duration (Evers et al., 1998; Figueiredo et al., 2013).

TABLE 1

Table 1. Description of study participants N = 35.

Audio Data Collection and Analyses

After the questionnaires were completed, the investigator invited the participant to sit in a quiet room, independent of the prenatal clinic suite. The participant was left alone with a recorder (Zoom recorder AT170 PRO-Sony) lying on a nearby table. She was asked to feel free to speak or not to her baby, as she would have done at home. The mother was taken to a quiet room with a comfortable chair and invited to speak with her fetus, if only she wanted to. The interviewer would turn on the recorder and leave the room, so as not to intimidate the mother and allow the environment to be as close as possible to the mother's usual situation with her fetus. The recording was stopped after 3 min. Audio recordings were analyzed at two levels: (i) maternal vocalization characteristics (low-level features) and (ii) affective speech analysis (high-level audio features).

Maternal Vocalization Characteristic

During a dialogue or a monolog, vocalization can be characterized by a series of quantitative parameters that allow describing the features and their dynamics. For our survey, we adapted this method to the monolog uttered by the mother, making the hypothesis that what we recorded was the equivalent of a dialogue between the mother and her unborn child. We first segmented and annotated the mothers' vocalization based on the Weisman et al. (2016) method. Two experts (EP and IR), one linguist and one speech therapist, listened to every 3-min recording. Using the ELAN EUDICO Linguistic Annotator (Institut Max-Planck, Nimègue, Nederland), they worked together to split them into segments of vocalization defined as continuous streams of speech with <150 ms of silence. Then, they labeled each segment as vocalization, laughing, singing, crying or other sounds. The maternal vocalizations consisted in the mother's recorded sound production directed or not to the fetus. For examples: “I'm very tired,” “I can't wait to see you,” “My baby, your room is ready, we await you with love,” “I am afraid about childbirth.” Maternal vocalization, maternal pause, and silence were extracted using an automated algorithm (for details see Bourvis et al., 2018). It calculated the duration of each segment and the amount of pause time within each 3-min recording, corresponding to the sum of silences >150 ms between two segments. Thus, we obtained the maternal vocalization mean duration, the vocalization number during the 3-min window, the maternal pause mean duration and the vocalization ratio of time during the 3 min.

Affective Speech Analysis

Each speech segment labeled “vocalization” by the experts was extracted as a digital audio sample, stored and submitted to affective speech analysis based on high-level audio features. The goal of this analysis was to categorize each vocalization as “motherese,” based on the presence of the emotional component of IDS, vs. “non-motherese,” which refers to prosody more typical of adult directed speech. This was achieved by two methods: expert evaluation by listening to the audio samples of the segments labeled “vocalization” and computational automatic assessment of the same digital samples.

For the manual qualitative annotation, the two experts (EPO and IMP) worked independently to assess the presence of motherese characteristics in each vocalization segment. Interrater agreement between the two independent raters was calculated on the whole sample of vocalizations and was found to equal 80%. In case of disagreement, they listened again together to the remaining segments with no agreement (20%) and reached a consensus. This method allowed us to obtain a unique manual label for each vocalization segment.

Automated labeling for motherese or non-motherese was performed using an ad hoc algorithm developed in the ISIR (Institut des Systèmes Intelligents et de Robotiques) laboratory in Paris. This motherese classifier, based on machine learning methods, uses both segmental (mel-frequency cepstrum coefficients, MFCCs) and suprasegmental (e.g., statistics with regard to fundamental frequency, energy, and duration) acoustic characteristics of speech and SVM (support vector machine) classifiers. The algorithm classifier was trained on a data set of both motherese and non-motherese. It can distinguish emotional sequences of motherese from normal speech (Mahdhaoui et al., 2011). In previous studies, it was able to identify motherese during early interaction in both experimental (Weisman et al., 2016; Bourvis et al., 2018) and natural settings (Cohen et al., 2013), in both mothers and fathers (Cohen et al., 2013; Weisman et al., 2016; Parlato et al., 2020), in various languages (Parlato et al., 2020), and in parents speaking to infants with later psychopathology (e.g., autism, Cohen et al., 2013).

Both motherese detection methods created two subclass labels of maternal vocalization: “motherese” labeled Emotional Fetal-Directed Speech (e-FDS) vs. “non-motherese” (non-e-FDS). Three variables were derived: e-FDS ratio during the 3 min, non-e-FDS ratio during the 3 min, and e-FDS/vocalization ratio (duration of “motherese” vocalization/duration of maternal vocalization).

Statistical Analysis

Statistical analyses were performed using R Software, Version 2.12.2. For all tests, the level of significance alpha was fixed at 5%. Given the sample size and the exploratory nature of the study, we used univariate analysis only. Quantitative variables were presented as the mean, standard deviation, and range. Qualitative variables were presented as frequencies.

First, we explored the correlation between maternal first perception of fetal movements and first vocalizations to the fetus.

We then successively estimated the relationship between:

(i) Variable “Vocalization ratio of time during the 3 min” and the following variables: “depression score” (score Raskin), “anxiety score” (score Covy), “life events” and “fetal risk”.

(ii) Variable ratio Emotional-Fetal Directed Speech (e-FDS)/vocalization according to psycholinguist expert and the following variables: “depression score” (score Raskin), “anxiety score” (score Covy), “life event” and “fetal risk”.

The relationship between two continuous variables was either tested using Pearson r or Spearman rho, depending on the validity of the assumptions. The relationship between a continuous and a binary variable (fetal risk) was either tested using the Welch t-test or Wilcoxon rank sum test, depending on the validity of the assumptions. Finally, we estimated the agreement between our experts' measures and the algorithm's measures using ICC (single random raters, ICC2) and calculated the 95% confidence interval (R psych package).

Results

Flow Chart

One hundred forty-five pregnant women were considered eligible from September 2013 to January 2014. Thirty-five of them agreed to participate in the study and were enrolled. Four women participated in the clinical part of the study but did not record audio data; five audio records were not exploitable because of technical difficulties. Thus, 26 audio records were used for analysis. The most frequent reason declared by the invited mothers to decline the participation to the research was the lack of available time, considering that the interview needed at least 1 h and 30 min. Indeed in order to avoid displacements, we proposed the study to mothers who were already present at the Hospital for a pregnancy consultation.

Description of the Sample (N = 35)

Table 1 summarizes the study sample in terms of socio-demographics, life events and pregnancy risk factors, delivery, psychopathology and fetal-oriented interaction variables. Mothers presented a high number of life events and significant obstetrical (60%) and medical (65.7%) history: endocrinologic conditions (N = 7), multisystemic pathologies (N = 3), neurologic disorders (N = 4), psychiatric disorders (N = 2), uterine anomaly (N = 2), social precariousness (N = 2), and cardiac anomaly (N = 1). Regarding fetal risk, we found trisomy 21 risk (N = 3), intrauterine growth retardation (N = 2), premature delivery threats (N = 2), cardiovascular anomalies (N = 1), drug exposure (N = 1), and prior history of neonatal death (N = 1). This could be related to recruitment inside a free public university hospital in a maternity unit specializing in complex cases.

Nevertheless, our sample presented a low mean level of anxiety and depression scores and a high percentage of breastfeeding compared to the French general population (68.1–70.5% in the general population with 59% exclusive breastfeeding https://drees.solidarites-sante.gouv.fr/IMG/pdf/dt68-sources_et_methodes.pdf; Kersuzan et al., 2014).

Analysis of Fetal-Oriented Interaction Variables (N = 35)

Mothers declared to start speaking or vocalizing to their fetus on average at 3.63 (±1.64) months during pregnancy. Additionally, they started perceiving fetal movement on average at 2.83 (±1.89) months during pregnancy. We found a significant correlation between speaking to the fetus and perceiving fetal movements (Figure 2, right).

Regarding audio analyses, only 26 mothers were included because of technical issues (see Figure 1 flow chart). For low-level audio analysis (quantitative speech analysis), a total of 856 vocalization segments (mean vocalization number = 32.92) were detected. The duration of the vocalizations ranged from 0 to 3.95 s during the 3-min audio record (Table 2). The vocalization ratio (vocalization time during the 3 min) ranged from 0 to 65%. Indeed, two mothers did not speak during the 3-min recording.

FIGURE 1

Figure 1. Flow chart of recruitment.

TABLE 2

Table 2. Maternal vocalization characteristics during the experiment (N = 26).

Regarding high-level audio analysis (qualitative affective speech analysis), two complementary methods were performed: a qualitative manual annotation of maternal vocalization to the fetus by two expert psycholinguists and an automatic classification. In both clinical expert and automatized classifications, we found that pregnant women when speaking to their fetus (FDS) used sometimes a specific prosody that usually characterized motherese (or emotional IDS), which we called emotional fetal directed speech (e-FDS). We called the FDS without motherese characteristics “non-e-FDS.” The automatic classification yielded a mean e-FDS ratio during the 3 min of 0.12, whereas the expert classification found a mean e-FDS ratio during the 3 min of 0.14 (Table 2). Figure 2 (left) shows the strong and significant correlation between expert and automatic classification on e-FDS recognition. We also calculated the intraclass correlation (ICC) between the “two” raters (the expert and the algorithm) and found a good and very significant ICC (ICC = 0.79 (95% CI: 0.59–0.90), p < 0.001).

FIGURE 2

Figure 2. Correlations between expert and automatic classification of emotional fetal directed speech (e-FDS, left) and between speaking to fetus and perceiving fetal movements (right).

Correlation of Maternal Audio Data With Maternofetal Characteristics and Anxiety Depression Status (N = 26)

Given the limited sample size, we used only exploratory univariate analysis to address whether some stress or psychopathological variables could influence the ability to produce e-FDS. We found no association between speaking to the fetus (whether prosody had characteristics of e-FDS or not) and being a fetus at risk during pregnancy (correlation ratio = 0.36 (±0.2) and 0.33 (±0.2), respectively, t-test, p = 0.69).

Figure 3 shows the correlation between the speaking-to-fetus ratio and the Covy anxiety score, Raskin depression score and number of maternal stressful events. As shown, we found no correlation with the Covy anxiety score, a tendential negative correlation with the number of maternal stressful events, and a significant negative correlation with the Raskin depression score (ρ = −0.4, p = 0.046), meaning that the more the future mothers were depressed during pregnancy, the less they spoke to their fetuses during the experiment.

FIGURE 3

Figure 3. Correlations between speaking to fetus and Raskin depression score (upper left), Covy anxiety ratio (upper right) and number of mother stressful life events.

We performed the same analyses using only the e-FDS/vocalization ratio. None of the variables modulated the e-FDS/vocalization ratio, meaning that the impact of the Raskin depression score was on speaking to the fetus as a whole whether the pregnant women had e-FDS prosody or not. However, there was a trend toward a negative correlation between the e-FDS/vocalization ratio and the number of maternal stressful events (rho = −0.421, p = 0.072).

Discussion

Can We Define Emotional Fetal Directed Speech (e-FDS)?

To answer this question, we proposed to address two different issues: (1) Is the mother speaking to the fetus during the experiment truly oriented toward the fetus? And (2) does FDS include some sequences that share the same prosodic characteristics of postnatal IDS (motherese)?

To address this first question (Is the mother speaking to the fetus during the experiment truly oriented toward the fetus?), we explored whether the pregnant women reported spontaneous moments of speaking and vocalization with their infant to come. For the mothers who reported doing so (n = 26), mothers started speaking or vocalizing with their fetus on average at 3.63 months during pregnancy (Table 1). Additionally, they started perceiving fetal movement on average at 2.83 months during pregnancy. This means that they could feel physically the existence of their fetus before they reported speaking to their fetus. Given the significant correlation between speaking to the fetus and perceiving fetal movement gestational ages (Figure 2, right), we can hypothesize that speaking to the fetus was indeed oriented toward the fetus. This result supports the hypothesis of a preliminary dialogue between future mothers and their fetus, as shown when mothers observed fetal movements during ultrasound scans that were interpreted by mothers as a response or solicitation from the fetus. Mirroring movements were seen as motor turn taking (Ammaniti et al., 2010). We believe that the current results on e-FDS are in the same vein and support the idea that prenatal development influences maternal infant attachment (Ammaniti et al., 2014; Feldman, 2016; Malm et al., 2016) and maternal representations of her future child (Viaux-Savelon et al., 2012, 2020).

Regarding the second question (do pregnant mothers sometimes use a motherese prosody (or here e-FDS) when speaking to their fetus?), our results show that futures mothers can use motherese prosody in their fetal-directed speech. The “manual” study of acoustic components of the voice takes a very long time and only allows the study of very short voice segments. The use of an automatic classifier allows extensive study of all vocalizations based on their acoustic characteristics and open perspectives for larger studies, and the machine learning classifier remains blind to the experiment or context. In this study, in addition to the expert “manual” categorization, the presence of e-FDS is confirmed by automatic measures that are strongly objective. Indeed, we found a strong and significant correlation between expert and automatic classification on e-FDS recognition (ρ = 0.87 p < 0.01) and a good and very significant ICC between expert and algorithm [ICC = 0.79 (95% CI: 0.59–0.90), p < 0.001]. This methodology of motherese detection using an algorithm has already shown robustness, as we have been able to distinguish motherese in early interaction with children with pathological outcome (Cohen et al., 2013), with both mothers and fathers (Weisman et al., 2016), and in five different languages (Parlato et al., 2020). Here, automatic annotation was useful to confirm that the prosody used during FDS shared the same characteristics of motherese. Manual and automatic labeling comparison realizes a validation of the two methods.

Does Maternal Anxiety Depression Status Influence the Quality and Quantity of Fetal Directed Speech?

As expected, despite the limited sample size, the results show that the more the future mothers were depressed during pregnancy, the less they spoke to their fetuses during the experiment (Figure 3). We also found a tendency for a significant negative correlation between stressful life events and the speaking-to-fetus ratio (p = 0.69). These results are contingent with previous studies that have shown the impact of maternal prenatal states. Prenatal stress, particularly concerning prenatal diagnosis, increases the level of anxiety, disrupts the emotional investment of the parents toward the fetus (Watson et al., 2002; Petersen and Jahn, 2008; Kaasen et al., 2010) and disrupts parent-infant interactions after birth (Viaux-Savelon et al., 2012). In addition, pregnant women with depressive and anxiety symptoms talk and sing less to their fetuses (Hernandez-Reif et al., 2018).

Regarding fetal-directed speech quality (e-FDS or fetus-directed motherese), we found only a trend toward a negative correlation between the e-FDS/speaking to fetus ratio and the number of maternal stressful events. Thus, a high number of stressful events may reduce mothers' affective involvement with their future infant. We know that depressed mothers of young infants are not only less likely to speak with them (Herrera et al., 2004) but also more likely to display a reduced prosody of motherese with them (Bettes, 1988; Kaplan et al., 2001). Moreover, even when depressed mothers produce motherese, their infants fail to learn in response to their own-mother infant directed speech, despite normal competence (Kaplan et al., 1999, 2002). In our study with fetuses, we found that depressed mothers speak less to their fetus, but we did not find a correlation between depression score and e-FDS ratio. However, we cannot exclude that motherese quality could be poorer and less able to prepare language acquisition. As suggested in a recent study (Bartha-Doering et al., 2019), neural discrimination of speech could begin in utero. So we could expect that depression during the end of pregnancy may have repercussions on the first steps of language acquisition. However, given the small size of our sample and the exploratory nature of the study, we cannot conclude, and further studies with larger samples would be helpful.

Anxiety and depression status and a high level of stressful life events influence at least the quantity of fetal directed speech. Therefore, the quantity of fetal directed speech may be a sign to consider when detecting depression during pregnancy. Indeed, supporting these mothers in their investment toward the fetus and the future infant is compulsory for the prevention of later psychopathology (Mazzeschi et al., 2015; Røhder et al., 2020).

Finally, we found no significant association between speaking to the fetus (whether prosody had characteristics of e-FDS or not) and having a fetus at risk during pregnancy. This was not our hypothesis. However, the mothers' and fetuses' medical and obstetrical history of our population is very heterogeneous in this small sample size, and all risks may not be similar. In addition, the gestational age of the stressful event could also influence the impact on maternal representations and involvement. Again, a larger study would be necessary to better explore these factors with a comparison group according to the type of stress factor (e.g., mother complication/fetal complication/others) (Viaux-Savelon et al., 2012, 2020; Pisoni et al., 2016; Cuijlits et al., 2019).

Study Limitations

As noted above, our sample was scarce (N = 26) and did not permit us to draw conclusions regarding the effects of various complex factors, such as maternal anxiety depression status or fetus risk. As many women declined participation in the study, we must discuss whether future mothers who agreed to participate could be more susceptible to speaking to their fetus than future mothers who declined participation. This study is only exploratory and used an experimental context. We also need to confirm that speaking to fetus also occurs spontaneously in more ecological contexts (e.g., at home). This might be achievable with automatic recording using portable devices for example. Additionally, we did not perform multivariate models to explore how relevant variables are robustly correlated or not. Nevertheless, it is important to note that one mother who declared before the audio recording she did not usually speak to her fetus actually spoke a lot to her during the recording. This may suggest that speaking to her fetus may be a widespread phenomenon.

Conclusion

Fetal directed speech (FDS) can be detected during pregnancy, and it contains a period of prosody that shares the same characteristics of motherese that can be described as prenatal motherese or emotional fetus-directed speech (e-FDS). This means that pregnant women start using motherese much earlier than expected. FDS seems to be correlated with maternal first perceptions of fetal movements and depression scores. Although this study was exploratory, our results show that the more future mothers were depressed, the less they spoke to their fetuses during pregnancy. Therefore, the quantity of fetal directed speech may represent a useful sign for clinicians to detect prenatal depression and maternal involvement during pregnancy. However, more research (e.g., larger sample; prospective design with several timeline measures during pregnancy) is needed to confirm these exploratory results. Automatic audio detection and social signal processing should enable larger studies that explore prenatal emotional involvement with future infants.

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics Statement

The studies involving human participants were reviewed and approved by the local Ethical Committee (CPPIDF6) under the number n°09012014. The patients/participants provided their written informed consent to participate in this study.

Author Contributions

DC, SV-S, MD, and MC designed the study. CF, MD, EP-O, IP, and SV-S recruited the participants and assessed both obstetrical and psychological data. EP-O, SV-S, and CF performed the experiments. EP-O and IP assessed motherese prosody. CS-G, DC, and MC performed the automatic signal processing. DC and HP performed the statistical analysis. EP-O, DC, MD, and SV-S wrote the first draft of the manuscript. All authors contributed to the final version of the manuscript.

Funding

This study was supported by the CAPES fund of Ministry of Education of Brazil and the Centre d'Activité et de Recherche en Psychiatrie Infanto-Juvénile (CARPIJ).

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

Ammaniti, A., Mazzoni, S., and Menozzi, F. (2010). Ecographia in gravidanza: studio della co-genitorialità/ultrasound during pregnancy: a study on co-parenting. Infanz. Adolesc. 9, 151–157. doi: 10.1710/535.6404

CrossRef Full Text | Google Scholar

Ammaniti, M., Tambellini, R., and Perucchini, P. (2000). De la grossesse à la période post-accouchement: instabilité et évolution des représentations maternelles. Devenir 12, 57–74.

Google Scholar

Ammaniti, M., Trentini, C., Menozzi, F., and Tambelli, R. (2014). “Transition to parenthood: studies of intersubjectivity in mothers and fathers,” in Early Parenting and Prevention of Disorder: Psychoanalytic Research at Interdisciplinary Frontiers, eds R. Emde and M. L. Bohleber (London: Routledge: Karnac Books), 129–164.

Google Scholar

Bartha-Doering, L., Alexopoulos, J., Giordano, V., Stelzer, L., Kainz, T., Benavides-Varela, S., et al. (2019). Absence of neural speech discrimination in preterm infants at term-equivalent age. Dev. Cogn. Neurosi. 39:100679. doi: 10.1016/j.dcn.2019.100679

PubMed Abstract | CrossRef Full Text | Google Scholar

Bettes, B. A. (1988). Maternal depression and motherese: temporal and intonational features. Child Dev. 59, 1089–1096. doi: 10.2307/1130275

PubMed Abstract | CrossRef Full Text | Google Scholar

Bourvis, N., Singer, M., Saint Georges, C., Bodeau, N., Chetouani, M., Cohen, D., et al. (2018). Pre-linguistic infants employ complex communicative loops to engage mothers in social exchanges and repair interaction ruptures. R. Soc. Open Sci. 5:e170274. doi: 10.1098/rsos.170274

PubMed Abstract | CrossRef Full Text | Google Scholar

Chetouani, M., Mahdhaoui, A., and Ringeval, F. (2014). Time-scale feature extractions for emotional speech characterization. Cogn. Comput. 1, 194–201. doi: 10.1007/s12559-009-9016-9

CrossRef Full Text | Google Scholar

Cohen, D., Cassel, R., Saint-Georges, C., Mahdhaoui, A., Laznik, M. C., Apicella, F., et al. (2013). Do parentese prosody and fathers' commitment facilitate social interaction in infants who later develop autism? PLoS ONE 8:e61402. doi: 10.1371/journal.pone.0061402

CrossRef Full Text | Google Scholar

Cooper, R. P. (1993). The effect of prosody on young infants' speech perception. Adv. Infancy Res. 8, 137–167.

Google Scholar

Cooper, R. P., and Aslin, R. N. (1990). Preference for infant-directed speech in the first month after birth. Child Dev. 61, 1584–1595. doi: 10.2307/1130766

PubMed Abstract | CrossRef Full Text | Google Scholar

Covi, L. (1986). New concepts and treatments for anxiety. Md. Med. J. 35:821.

Google Scholar

Cuijlits, I., van de Wetering, A. P., Endendijk, J. J., van Baar, A. L., Potharst, E. S., and Pop, V. J. M. (2019). Risk and protective factors for pre- and postnatal bonding. Infant Ment. Health J. 40, 768–785. doi: 10.1002/imhj.21811

PubMed Abstract | CrossRef Full Text | Google Scholar

DeCasper, A. J., Lecanuet, J. P., Busnel, M. C., Granier-Deferre, C., and Maugeais, R. (1994). Fetal reactions to recurrent maternal speech. Infant Behav. Dev. 17:159164. doi: 10.1016/0163-6383(94)90051-5

CrossRef Full Text

Dupoux, E., and Mehler, J. (1990). Naître Humain. Paris: Odile Jacob.

Google Scholar

Evers, S., Doran, L., and Schellenberg, K. (1998). Influences on breastfeeding rates in low income communities in Ontario. Can. J. Public Health 89, 203–207. doi: 10.1007/BF03404475

PubMed Abstract | CrossRef Full Text | Google Scholar

Feldman, R. (2016). The neurobiology of mammalian parenting and the biosocial context of human caregiving. Horm. Behav. 77, 3–17. doi: 10.1016/j.yhbeh.2015.10.001

PubMed Abstract | CrossRef Full Text | Google Scholar

Fernald, A., and Kuhl, P. K. (1987). Acoustic determinants of infant preference for motherese speech. Infant Behav. Dev. 10, 279–293. doi: 10.1016/0163-6383(87)90017-8

CrossRef Full Text | Google Scholar

Fernald, A., and Simon, T. (1984). Expanded intonation contours in mothers' speech to newborns. Dev. Psychol. 20, 104–113. doi: 10.1037/0012-1649.20.1.104

CrossRef Full Text | Google Scholar

Figueiredo, B., Dias, C., Brandão, S., Canário, C., and Nunes-Costa, R. (2013). Breastfeeding and postpartum depression: state of the art review. J. Pediatr. 89, 332–338. doi: 10.1016/j.jped.2012.12.002

PubMed Abstract | CrossRef Full Text | Google Scholar

Fisher, C., and Tokura, H. (1995). The given-new contract in speech to infants. J. Mem. Lang. 34, 287–310. doi: 10.1006/jmla.1995.1013

CrossRef Full Text | Google Scholar

Hernandez-Reif, M., Kendrick, A., and Avery, D. M. (2018). Pregnant women with depressive and anxiety symptoms read, talk, and sing less to their fetuses. J. Affect. Disord. 15, 532–537. doi: 10.1016/j.jad.2017.12.108

PubMed Abstract | CrossRef Full Text | Google Scholar

Herrera, E., Reissland, N., and Shepherd, J. (2004). Maternal touch and maternal child-directed speech: effects of depressed mood in the postnatal period. J. Affect. Disord. 81, 29–39. doi: 10.1016/j.jad.2003.07.001

PubMed Abstract | CrossRef Full Text | Google Scholar

Kaasen, A., Helbig, A., Malt, U. F., Naes, T., Skari, H., and Haugen, G. (2010). Acute maternal social dysfunction, health perception, and psychological distress after ultrasonographic detection of a fetal structural anomaly. BJOG 117, 1127–1138. doi: 10.1111/j.1471-0528.2010.02622.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Kaplan, P. S., Bachorowski, J.-A., Smoski, M. J., and Zinser, M. (2001). Role of clinical diagnosis and medication use in effects of maternal depression on infant-directed speech. Infancy 2, 537–548. doi: 10.1207/S15327078IN0204_08

PubMed Abstract | CrossRef Full Text | Google Scholar

Kaplan, P. S., Bachorowski, J.-A., and Zarlengo-Strouse, P. (1999). Child-directed speech produced by mothers with symptoms of depression fails to promote associative learning in 4-month-old infants. Child Dev. 70, 560–570. doi: 10.1111/1467-8624.00041

PubMed Abstract | CrossRef Full Text | Google Scholar

Kaplan, P. S., Bachorowski, J. A., Smoski, M. J., and Hudenko, W. J. (2002). Infants of depressed mothers, although competent learners, fail to learn in response to their own mothers' infant-directed speech. Psychol. Sci. 13, 268–271. doi: 10.1111/1467-9280.00449

PubMed Abstract | CrossRef Full Text | Google Scholar

Kersuzan, C., Gojard, S., Tichit, C., Thierry, X., Wagner, S., Nicklaus, S., et al. (2014). Prévalence de l'allaitement à la maternité selon les caractéristiques des parents et les conditions de l'accouchement. Résultats de l'Enquête Elfe maternité, France metropolitaine, 2011. Bull. Epidémiol. Hebd. 27, 440–449. Available online at: http://www.invs.sante.fr/neh/2014/27/2014_27_1.html

Google Scholar

Mahdhaoui, A., Chetouani, M., Cassel, R. S., Saint-Georges, C., Parlato, E., Laznik, M. C., et al. (2011). Computerized home video detection for motherese may help to study impaired interaction between infants who become autistic and their parents. Int. J. Methods Psychiatr. Res. 20, e6–e18. doi: 10.1002/mpr.332

PubMed Abstract | CrossRef Full Text | Google Scholar

Malm, M. C., Hildingsson, I., Rubertsson, C., Rådestad, I., and Lindgren, H. (2016). Prenatal attachment and its association with foetal movement during pregnancy—a population based survey. Women Birth 29, 482–486. doi: 10.1016/j.wombi.2016.04.005

PubMed Abstract | CrossRef Full Text | Google Scholar

Mazzeschi, C., Pazzagli, C., Radi, G., Raspa, V., and Buratta, L. (2015). Antecedents of maternal parenting stress: the role of attachment style, prenatal attachment, and dyadic adjustment in first-time mothers. Front. Psychol. 6:1443. doi: 10.3389/fpsyg.2015.01443

PubMed Abstract | CrossRef Full Text | Google Scholar

Outters, V., Schreiner, M., Behne, T., and Mani, N. (2020). Maternal input and infants' response to infant-directed speech. Infancy 25, 478–499. doi: 10.31219/osf.io/mw9es

PubMed Abstract | CrossRef Full Text | Google Scholar

Parlato, E., Chetouani, M., Cadic, J. M., Viaux, S., Xavier, J., Ouss, L., et al. (2020). The emotional component of Infant directed-speech: a cross-cultural study using machine learning. Neuropsychiatr. Enfance. Adolesc. 68, 106–113. doi: 10.1016/j.neurenf.2019.10.004

CrossRef Full Text | Google Scholar

Petersen, J., and Jahn, A. (2008). Suspicious findings in antenatal care and their implications from the mothers' perspective: a prospective study in Germany. Birth 35, 41–49. doi: 10.1111/j.1523-536X.2007.00210.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Pisoni, C., Garofoli, F., Tzialla, C., Orcesi, S., Spinillo, A., Politi, P., et al. (2016). Complexity of parental prenatal attachment during pregnancy at risk for preterm delivery. J. Matern. Fetal Neonatal Med. 29, 771–776. doi: 10.3109/14767058.2015.1017813

PubMed Abstract | CrossRef Full Text | Google Scholar

Ramus, F. (1999). “La discrimination des langues par la prosodie: modélisation linguistique et études comportementales,” in De la caractérisation à l'identification des langues, Actes de la 1^ère journé d'étude sur l'identification automatique des langues, ed F. Pellegrino (Lyon: Editions de l'Institut des Sciences de l'Homme), 186–201.

Google Scholar

Raskin, A., and Crook, T. (1976). Sensitivity of rating scales completed by psychiatrists, nurses and patients to antidepressant drug effects. J. Psychiatr. Res. 13, 31–41. doi: 10.1016/0022-3956(76)90007-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Røhder, K., Væver, M. S., Aarestrup, A. K., Jacobsen, R. K., Smith-Nielsen, J., and Schiøtz, M. L. (2020). Maternal-fetal bonding among pregnant women at psychosocial risk: the roles of adult attachment style, prenatal parental reflective functioning, and depressive symptoms. PLoS ONE 15:e0239208. doi: 10.1371/journal.pone.0239208

PubMed Abstract | CrossRef Full Text | Google Scholar

Saint-Georges, C., Chetouani, M., Cassel, R., Apicella, F., Mahdhaoui, A., Muratori, F., et al. (2013). Motherese in interaction: at the cross-road of emotion and cognition? (a systematic review). PLoS ONE 8:e78103. doi: 10.1371/journal.pone.0078103

PubMed Abstract | CrossRef Full Text | Google Scholar

Saito, Y., Aoyama, S., Kondo, T., Fukumoto, R., Konishi, N., Nakamura, K., et al. (2007). Frontal cerebral blood flow change associated with infant-directed speech. Arch. Dis. Child. Fetal Neonatal 92, F113–F116. doi: 10.1136/adc.2006.097949

PubMed Abstract | CrossRef Full Text | Google Scholar

Sherer, K. R. (1986). Vocal affect expression: a review and a model for future research. Psychol. Bull. 99, 143–165. doi: 10.1037/0033-2909.99.2.143

PubMed Abstract | CrossRef Full Text | Google Scholar

Spinelli, M., Fasolo, M., and Mesman, J. (2017). Does prosody make the difference? a meta-analysis on relations between prosodic aspects of infant-directed speech and infant outcomes. Dev. Rev. 44, 1–18. doi: 10.1016/j.dr.2016.12.001

CrossRef Full Text | Google Scholar

Tordjman, S., Zenasni, F., and Granier-Deferre, C. (2004). Presentation and Validation of the Sensations During Pregnancy and Life Events Questionnaire. Aix-en-Provence: International Society for Developmental Psychobiology.

Trainor, L. J., Austin, C. M., and Desjardins, R. N. (2000). Is infant-directed speech prosody a result of the vocal expression of emotion? Psychol. Sci. 11, 188–195. doi: 10.1111/1467-9280.00240

PubMed Abstract | CrossRef Full Text | Google Scholar

Viaux-Savelon, S., Decherf, M., Bodeau, N., Ville, Y., Marey, Y., Cohen, C., et al. (2020). Foetal chromosomal microarray genetic screening for minor ultrasound anomalies affects maternal representations and emotional state: an exploratory study. Devenir 32, 105–177. doi: 10.3917/dev.202.0105

CrossRef Full Text | Google Scholar

Viaux-Savelon, S., Dommergues, M., Rosenblum, O., Bodeau, N., Aidane, E., Philippon, O., et al. (2012). Soft markers detected at fetal scan induce perturbation of early mother-infant interactions. PLoS ONE 7:e30935. doi: 10.1371/journal.pone.0030935

CrossRef Full Text

Watson, M., Hall, S., Langford, K., and Marteau, T. (2002). Psychological impact of the detection of soft markers on routine ultrasound scanning: a pilot study investigating the modifying role of information. Prenat. Diagn. 22, 569–575. doi: 10.1002/pd.373

PubMed Abstract | CrossRef Full Text | Google Scholar

Weisman, O., Chetouani, M., Saint-Georges, C., Bourvis, N., Delaherche, E., Zagoory-Sharon, O., et al. (2016). Dynamics of non-verbal vocalizations and hormones during father-infant interaction. IEEE Trans. Affect. Comput. 7, 337–345. doi: 10.1109/TAFFC.2015.2478468

CrossRef Full Text | Google Scholar

Williams, C. E., and Stevens, K. N. (1972). Emotions and speech: some acoustic correlates. J. Acoust. Soc. Am. 52, 1238–1250. doi: 10.1121/1.1913238

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: motherese, prenatal, mother-fetus interaction, fetal-directed speech, machine learning, social signal processing

Citation: Parlato-Oliveira E, Saint-Georges C, Cohen D, Pellerin H, Pereira IM, Fouillet C, Chetouani M, Dommergues M and Viaux-Savelon S (2021) “Motherese” Prosody in Fetal-Directed Speech: An Exploratory Study Using Automatic Social Signal Processing. Front. Psychol. 12:646170. doi: 10.3389/fpsyg.2021.646170

Received: 25 December 2020; Accepted: 15 February 2021;
Published: 09 March 2021.

Edited by:

Chiara Suttora, University of Bologna, Italy

Reviewed by:

Paola Zanchi, University of Milano-Bicocca, Italy
Eon-Suk Ko, Chosun University, South Korea

Copyright © 2021 Parlato-Oliveira, Saint-Georges, Cohen, Pellerin, Pereira, Fouillet, Chetouani, Dommergues and Viaux-Savelon. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Sylvie Viaux-Savelon, c3lsdmllLnZpYXV4QGljbG91ZC5jb20=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

“Motherese” Prosody in Fetal-Directed Speech: An Exploratory Study Using Automatic Social Signal Processing

Introduction

Methods

Participants and Ethics

Data Collection

Clinical Data Collection

Audio Data Collection and Analyses

Maternal Vocalization Characteristic

Affective Speech Analysis

Statistical Analysis

Results

Flow Chart

Description of the Sample (N = 35)

Analysis of Fetal-Oriented Interaction Variables (N = 35)

Correlation of Maternal Audio Data With Maternofetal Characteristics and Anxiety Depression Status (N = 26)

Discussion

Can We Define Emotional Fetal Directed Speech (e-FDS)?

Does Maternal Anxiety Depression Status Influence the Quality and Quantity of Fetal Directed Speech?

Study Limitations

Conclusion

Data Availability Statement

Ethics Statement

Author Contributions

Funding

Conflict of Interest

References

95% of researchers rate our articles as excellent or good

95% of researchers rate our articles as excellent or good