- Department of Spanish, Linguistics, and Theory of Literature (Linguistics), Faculty of Philology, University of Seville, Seville, Spain
Introduction
Language is probably the most distinctive of human traits. We are endowed with a species-specific capacity for generating complex arrays of symbols that encode equally complex thoughts, which can be in turn shared with others to fulfill different purposes: informing, manipulating, socializing, amusing, and many others. Language is usually externalized via speech, although deaf people use gestures for conveying linguistic meanings. How language evolved has been a serious concern for many disciplines, from linguistics to anthropology to human history. Specifically, it is not clear when we started to use sounds for sharing our linguistic thoughts. Because apes rely on gestures for voluntary information exchanges (Graham et al., 2022; Hobaiter et al., 2022), it has been argued that language evolved “from hand to mouth” (Corballis, 2002) as a result of some brain rewiring (Brown et al., 2021), also because the primate vocal tract is essentially speech-ready (Fitch et al., 2016). Nonetheless, comparative and paleoanthropological research suggests that some minor anatomical modifications in our speech organs, seemingly associated to the evolution of human-like languages, have occurred in the human lineage (Lieberman and McCarthy, 1975; Blasi et al., 2019; Dediu et al., 2021; Nishimura et al., 2022). In turn, we can expect that some of the changes experienced by our vocal system resulted from genetic and epigenetic changes. This possibility is supported by the existence of several genetic conditions impacting on our speech abilities (Sataloff, 1995; Newbury and Monaco, 2010; Morgan, 2013), but also by the finding that the regions that are differentially methylated in modern humans compared to Neanderthals and Denisovans are enriched in genes associated with face and vocal tract anatomy (Gokhman et al., 2020). Nonetheless, we still lack a good understanding of how and why selected genetic and epigenetic changes impacted on specific speech features.
Genetics of the human pitch vocal system
Gisladottir et al. (2023) have recently uncovered several common variants in the gene ABCC9 that are associated with higher voice pitch. In addition, these variants are associated with reduced expression of ABCC9 in the adrenal glands and greater pulse pressure, implicating hormonal and cardiovascular systems. Finally, according to Open Target Genetics (https://genetics.opentargets.org/), the same variants are associated with the expression of the gene in the dorsolateral prefrontal cortex in the Common Mind dataset. This brain region has experienced molecular, cellular, and structural changes in the human lineage (Falk, 2014; Ma et al., 2022), including changes in the expression pattern of FOXP2 (Ma et al., 2022), a gene linked to speech abilities (Morgan et al., 2016). Interestingly as well, most of the common variants associated with higher voice pitch uncovered by Gisladottir et al. (2023) are derived compared to apes. These authors also identified a fixed change in the coding region of the gene that occurred after our split from great apes, but before the split between modern humans and Neanderthals/Denisovans. Lastly, another region of the gene is among the set of accelerated regions in humans compared to other primates (Bi et al., 2023). All this evidence suggests that the ABCC9 gene might have been subject to positive selection in our lineage, with some potential impact, specifically, on our speech abilities, the control of stress, and aspects of brain function.
Although the distinctive attributes of human speech mostly depend on its segmental components (as vowels or consonants), suprasegmental features are also important. These include pitch, which is the perceptive quality of sound frequency. Tonal languages use changes in pitch levels and contours to convey different lexical and grammatical meanings. More generally, pitch is one key prosodic cue, which is used for marking the main structural components of an utterance (like phrases or clauses) or sentence types (like affirmative vs. interrogative). Gisladottir et al. (2023) finding is a valuable contribution to the ongoing efforts for decoding the genetics and the evolution of the human vocal system, as it provides a direct link between one specific component of human speech and one specific gene. This commentary paper is aimed to hypothesize about one possible rationale for the selection events on this gene, as part of the evolution of human language and the human species, more generally. Nonetheless, other alternative explanations for the observed genetic variations and their relationship to voice pitch are conceivable, given the multiple functions performed by ABCC9, particularly at the brain level (see Nelson et al., 2015 for a review).
A rationale for the genetic changes in ABCC9 impacting on the human pitch vocal system
Interestingly enough, the gene highlighted by Gisladottir et al. (2023) is positively selected in tamed foxes (Trut et al., 2009), which have been used as an animal model of domestication processes (Trut et al., 2009; but see Lord et al., 2020 for a critical view). Domestication usually gives rise to a distinctive set of changes in different parts of the body, the so-called “domestication syndrome” (Wilkins et al., 2014; see Sánchez-Villagra et al., 2016 for some criticism). These include modifications in the skull, the facial area, and the vocal tract (Riede and Fitch, 1999; Kruska, 2005; Trut et al., 2009; Zeder, 2012; Wilkins et al., 2014). Also, changes in vocalization patterns are commonly observed in domesticated animals, typically resulting in more complex and varied vocalizations (Corbett, 2004; Okanoya, 2017). Specifically, domestic animals can produce vocalizations with a higher pitch frequency compared to their wild conspecifics, as observed in cats (Nicastro, 2004; Yeon et al., 2011). It has been argued that these coordinated modifications of the body of domesticates result, at least in part, from changes in the hypothalamic-pituitary adrenal (HPA) axis in response to selection for tameness and increased tolerance to humans, which is usually the first step in any domestication process (Wilkins et al., 2014). Significantly, the ABCC9 variants uncovered by Gisladottir et al. (2023) are also associated, as noted, with the expression of the gene in the adrenal glands. Overall, this evidence suggests that ABCC9 might be part of the genetic architecture of domestication.
All this might be relevant for human evolution, and particularly, the evolution of speech, if one considers some accounts of the human history according to which our species experienced a process similar to animal domestication. This view is commonly referred to as the human self-domestication (HSD) hypothesis (Hare, 2017; Wrangham, 2019). In brief, it is argued, diverse external factors (living in harsher climatic conditions, the colonization of new territories, the advent of co-parenting) might have promoted a selection toward increased prosocial behavior in the hominin group and ultimately, have triggered the physiological mechanisms underlying domestication events, as sketched above, including changes in the HPA axis. In turn, this is expected to have resulted in a constellation of body, cognitive, and behavioral changes in humans that parallel domestication features in other species. In fact, the external factors that seemingly favored prosocial behaviors in humans can be expected to trigger self-domestication features in any species, as observed in bonobos (Hare et al., 2012) or elephants (Raviv et al., 2023). In the case of humans, the traits resulting from our self-domestication include modifications in the skull and the facial area that have been characterized as an increased “feminization” of inherited hominin features. These changes ultimately entail a reduction of sexual dimorphic traits and a potentiation of neotenic (i.e., childish) features. Interestingly, signs of this “feminization” appear variably during our history, with a peak during the Upper Paleolithic (Cieri et al., 2014), when modern languages, endowed with more complex phonologies and demanding more sophisticated speech abilities, have been claimed to have emerged (Benítez-Burraco and Progovac, 2020). Signs of “feminization,” and ultimately of HSD, also appear variably in present-day populations, arguably in response to differences in social conditions and behavior, including the status of women in society (Gleeson and Kushnick, 2018). A higher pitch voice could be considered a more feminine trait and accordingly, a less threatening, more self-domesticated phenotype, contrary to a lower pitch, associated to male rudeness (Aung and Puts, 2020). Levinson (2022) has discussed these high-pitched vocalizations in relation to “cuteness selection.” In essence, this process involves the generalization of mother–infant interaction patterns to all adults and ultimately, the spread of the sort of interactions that promote the acquisition of culturally transmitted systems like human languages (see Benítez-Burraco and Kempe, 2018 for discussion). It happens that mother-infant interactions also feature a higher pitch in other social species, like bottlenose dolphins (Sayigh et al., 2023). More generally, changes in vocal behavior have been observed in other primates claimed to have gone through a self-domestication process, including bonobos (Gruber and Clay, 2016), and marmoset monkeys, in which increased affiliative vocal behaviors correlate with the size of facial fur patches, a hallmark of self-domestication (Ghazanfar et al., 2020).
In our view, and summarizing the discussion above, HSD might have accelerated changes in ABCC9 due to its role in the human vocal repertoire and other characteristics relevant for the HSD phenotype. More specifically, we wish to argue that HSD could have resulted in selected changes in ABCC9 that favored higher pitch vocal sounds, which were later selected, or co-opted, as part of a general trend toward increased cuteness. Ultimately, these changes might have contributed to the potentiation of cultural mechanisms (e.g., bonding systems) that promote the complexification and diversification of languages. This possibility is supported by the phenotypic profile of clinical conditions resulting from mutations in ABCC9 in present-day populations. One of these conditions is Cantú Syndrome (OMIM#239850), which presents with an attenuation of most of the traits impacted by (self-)domestication, including macrocephaly, increased body size, hypertrichosis, coarse facial features, and significantly, a hoarse voice which is suggestive of a lower pitch (Grange et al., 2014). With regards to the cognitive profile and social functioning abilities of the affected people, they show symptoms of autism spectrum disorder (ASD), particularly, at younger ages (Roessler et al., 2021). ASD has been argued to entail reduced features of HSD too (Benítez-Burraco et al., 2016). Needless to say, because of the pleiotropic nature of genes, one could imagine other explanations for the selection of the genetic variants attested by Gisladottir et al. (2023). For properly testing the hypothesis sketched in this commentary, further in silico and in vitro analyses are needed, including a survey of these variants in a larger number of domestic species. More specifically, generating animal models bearing some of the most promising variants could enable to uncover the true mechanism accounting for the changes observed in (and hypothesized for) voice pitch.
Discussion
To conclude, the research by Gisladottir et al. (2023) opens a promising avenue of research that could result in the formulation of more robust bridging hypotheses between the genetic changes occurred in the course of human evolution, the changes experienced by our body and behavior, and the emergence of human distinctive traits, specifically, our idiosyncratic speech abilities.
Author contributions
AB-B conceived the paper, reviewed the available literature, analyzed the data, and wrote the paper.
Funding
This research was supported by grant PID2020-114516GB-I00 funded by MCIN/AEI/10.13039/501100011033 (to AB-B).
Acknowledgments
The author wishes to thank Rósa Gisladottir for her feedback during the writing of this manuscript.
Conflict of interest
The author declares that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher's note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
Aung, T., and Puts, D. (2020). Voice pitch: a window into the communication of social power. Curr. Opin Psychol. 33, 154–161. doi: 10.1016/j.copsyc.2019.07.028
Benítez-Burraco, A., and Kempe, V. (2018). The emergence of modern languages: Has human self-domestication optimized language transmission? Front. Psychol. 9, 551. doi: 10.3389/fpsyg.2018.00551
Benítez-Burraco, A., Lattanzi, W., and Murphy, E. (2016). Language impairments in ASD resulting from a failed domestication of the human brain. Front. Neurosci. 10, 373. doi: 10.3389/fnins.2016.00373
Benítez-Burraco, A., and Progovac, L. (2020). A four-stage model for language evolution under the effects of human self-domestication. Lang. Commun. 73, 1–17 doi: 10.1016/j.langcom.2020.03.002
Bi, X., Zhou, L., Zhang, J. J., Feng, S., Hu, M., Cooper, D. N., et al. (2023). Lineage-specific accelerated sequences underlying primate evolution. Sci. Adv. 9, eadc9507. doi: 10.1126/sciadv.adc9507
Blasi, D. E., Moran, S., Moisik, S. R., Widmer, P., Dediu, D., and Bickel, B. (2019). Human sound systems are shaped by post-Neolithic changes in bite configuration. Science 363, eaav3218. doi: 10.1126/science.aav3218
Brown, S., Yuan, Y., and Belyk, M. (2021). Evolution of the speech-ready brain: The voice/jaw connection in the human motor cortex. J. Compar. Neurol. 529, 1018–1028. doi: 10.1002/cne.24997
Cieri, R., Churchill, S., Franciscus, R., Tan, J., and Hare, B. (2014). Craniofacial feminization, social tolerance, and the origins of behavioral modernity. Curr. Anthropol. 55, 419–443. doi: 10.1086/677209
Corballis, M. C. (2002). From Hand to Mouth: The Origins of Language. Princeton: Princeton University Press. doi: 10.1515/9780691221731
Corbett, L. (2004). “Dingo,” in Canids: Foxes, Wolves, Jackals and Dogs, eds C. Sillero-Zubiri, M. Hoffmann, and D. W. Macdonald (International Union for Conservation of Nature and Natural Resources).
Dediu, D., Moisik, S. R., Baetsen, W. A., Bosman, A. M., and Waters-Rist, A. L. (2021). The vocal tract as a time machine: inferences about past speech and language from the anatomy of the speech organs. Philos. Trans. R. Soc. London B Biol. Sci. 376, 20200192. doi: 10.1098/rstb.2020.0192
Falk, D. (2014). Interpreting sulci on hominin endocasts: old hypotheses and new findings. Front. Hum. Neurosci. 8, 134. doi: 10.3389/fnhum.2014.00134
Fitch, W. T., de Boer, B., Mathur, N., and Ghazanfar, A. A. (2016). Monkey vocal tracts are speech-ready. Sci. Adv. 2, e1600723. doi: 10.1126/sciadv.1600723
Ghazanfar, A. A., Kelly, L. M., Takahashi, D. Y., Winters, S., Terrett, R., and Higham, J. P. (2020). Domestication phenotype linked to vocal behavior in marmoset monkeys. Curr. Biol. 30, 5026–5032.e3. doi: 10.1016/j.cub.2020.09.049
Gisladottir, R. S., Helgason, A., Halldorsson, B. V., Helgason, H., Borsky, M., Chien, Y. R., et al. (2023). Sequence variants affecting voice pitch in humans. Sci. Adv. 9, eabq2969. doi: 10.1126/sciadv.abq2969
Gleeson, B. T., and Kushnick, G. (2018). Female status, food security, and stature sexual dimorphism: Testing mate choice as a mechanism in human self-domestication. Am. J. Phys. Anthropol. 167, 458–469. doi: 10.1002/ajpa.23642
Gokhman, D., Nissim-Rafinia, M., Agranat-Tamir, L., Housman, G., García-Pérez, R., Lizano, E., et al. (2020). Differential DNA methylation of vocal and facial anatomy genes in modern humans. Nat. Commun. 11, 1189. doi: 10.1038/s41467-020-15020-6
Graham, K. E., Badihi, G., Safryghin, A., Grund, C., and Hobaiter, C. (2022). A socio-ecological perspective on the gestural communication of great ape species, individuals, and social units. Ethol. Ecol. Evolut. 34, 235–259. doi: 10.1080/03949370.2021.1988722
Grange, D. K., Nichols, C. G., and Singh, G. K. (2014). “Cantú Syndrome,” in GeneReviews®, ed. M. P. Adam (Seattle: University of Washington).
Gruber, T., and Clay, Z. (2016). A comparison between bonobos and chimpanzees: a review and update. Evolut. Anthropol. 25, 239–252. doi: 10.1002/evan.21501
Hare, B. (2017). Survival of the friendliest: homo sapiens evolved via selection for prosociality. Ann. Rev. Psychol. 68, 155–186. doi: 10.1146/annurev-psych-010416-044201
Hare, B., Wobber, V., and Wrangham, R. (2012). The self-domestication hypothesis: Evolution of bonobo psychology is due to selection against aggression. Animal Behav. 83, 573–585. doi: 10.1016/j.anbehav.2011.12.007
Hobaiter, C., Graham, K. E., and Byrne, R. W. (2022). Are ape gestures like words? Outstanding issues in detecting similarities and differences between human language and ape gesture. Philos. Trans. Royal Soc. Lond. B Biol. Sci. 377, 20210301. doi: 10.1098/rstb.2021.0301
Kruska, D. C. (2005). On the evolutionary significance of encephalization in some eutherian mammals: effects of adaptive radiation, domestication, and feralization. Brain Behav. Evo. 65, 73–108. doi: 10.1159/000082979
Levinson, S. C. (2022). The interaction engine: cuteness selection and the evolution of the interactional base for language. Philos. Trans. R. Soc. B 377, 20210108. doi: 10.1098/rstb.2021.0108
Lieberman, P., and McCarthy, R. C. (1975). “The evolution of speech and language,” in The role of speech in language (Cambridge, MA: MIT Press), 83–106.
Lord, K. A., Larson, G., Coppinger, R. P., and Karlsson, E. K. (2020). The history of farm foxes undermines the animal domestication syndrome. Trends Ecol. Evol. 35, 125–136. doi: 10.1016/j.tree.2019.10.011
Ma, S., Skarica, M., Li, Q., Xu, C., Risgaard, R. D., Tebbenkamp, A. T. N., et al. (2022). Molecular and cellular evolution of the primate dorsolateral prefrontal cortex. Science 377, eabo7257. doi: 10.1126/science.abo7257
Morgan, A. (2013). Speech-language pathology insights into genetics and neuroscience: beyond surface behaviour. Int. J. Speech-Language Pathol. 15, 245–254. doi: 10.3109/17549507.2013.777786
Morgan, A., Fisher, S. E., Scheffer, I., and Hildebrand, M. (2016). “FOXP2-related speech and language disorder,” in GeneReviews®, ed. M. P. Adam (Seattle: University of Washington).
Nelson, P. T., Jicha, G. A., Wang, W. X., Ighodaro, E., Artiushin, S., Nichols, C. G., et al. (2015). ABCC9/SUR2 in the brain: implications for hippocampal sclerosis of aging and a potential therapeutic target. Age. Res. Rev. 24, 111–125. doi: 10.1016/j.arr.2015.07.007
Newbury, D. F., and Monaco, A. P. (2010). Genetic advances in the study of speech and language disorders. Neuron 68, 309–320. doi: 10.1016/j.neuron.2010.10.001
Nicastro, N. (2004). Perceptual and acoustic evidence for species-level differences in meow vocalizations by domestic cats (Felis catus) and African wild cats (Felis silvestris lybica). J. Compar. Psychol. 118, 287–296. doi: 10.1037/0735-7036.118.3.287
Nishimura, T., Tokuda, I. T., Miyachi, S., Dunn, J. C., Herbst, C. T., Ishimura, K., et al. (2022). Evolutionary loss of complexity in human vocal anatomy as an adaptation for speech. Science 377, 760–763. doi: 10.1126/science.abm1574
Okanoya, K. (2017). Sexual communication and domestication may give rise to the signal complexity necessary for the emergence of language: an indication from songbird studies. Psychon. Bull. Rev. 24, 106–110. doi: 10.3758/s13423-016-1165-8
Raviv, L., Jacobson, S. L., Plotnik, J. M., Bowman, J., Lynch, V., and Benítez-Burraco, A. (2023). Elephants as an animal model for self-domestication. Proc. Natl. Acad. Sci. USA. 120, e2208607120. doi: 10.1073/pnas.2208607120
Riede, T., and Fitch, T. (1999). Vocal tract length and acoustics of vocalization in the domestic dog (Canis familiaris). J. Exper. Biol. 202, 2859–2867. doi: 10.1242/jeb.202.20.2859
Roessler, H. I., van der Heuvel, L. M., Shields, K., Guilliams, K. P., Knoers, N. V. A. M., van Haaften, G., et al. (2021). Behavioral and cognitive functioning in individuals with Cantú syndrome. Am. J. Med. Genet. Part A 185, 2434–2444. doi: 10.1002/ajmg.a.62348
Sánchez-Villagra, M. R., Geiger, M., and Schneider, R. A. (2016). The taming of the neural crest: a developmental perspective on the origins of morphological covariation in domesticated mammals. R. Soc. Open Sci. 3, 160107. doi: 10.1098/rsos.160107
Sataloff, R. T. (1995). Genetics of the voice. J. Voice. 9, 16–19. doi: 10.1016/S0892-1997(05)80218-8
Sayigh, L. S., El Haddad, N., Tyack, P. L., Janik, V. M., Wells, R. S., and Jensen, F. H. (2023). Bottlenose dolphin mothers modify signature whistles in the presence of their own calves. Proc. Natl. Acad. Sci. USA. 120, e2300262120. doi: 10.1073/pnas.2300262120
Trut, L., Oskina, I., and Kharlamova, A. (2009). Animal evolution during domestication: the domesticated fox as a model. BioEssays. 31, 349–360. doi: 10.1002/bies.200800070
Wilkins, A. S., Wrangham, R. W., and Fitch, W. T. (2014). The “domestication syndrome” in mammals: a unified explanation based on neural crest cell behavior and genetics. Genetics 197, 795–808. doi: 10.1534/genetics.114.165423
Wrangham, R. (2019). The Goodness Paradox: The Strange Relationship Between Virtue and Violence in Human Evolution. New York: Vintage.
Yeon, S. C., Kim, Y. K., Park, S. J., Lee, S. S., Lee, S. Y., Suh, E. H., et al. (2011). Differences between vocalization evoked by social stimuli in feral cats and house cats. Behav. Proc. 87, 183–189. doi: 10.1016/j.beproc.2011.03.003
Keywords: pitch, self-domestication, speech, ABCC9, vocal behavior
Citation: Benítez-Burraco A (2023) Toward an evolutionary account of the changes in the human pitch vocal system. Front. Psychol. 14:1249727. doi: 10.3389/fpsyg.2023.1249727
Received: 30 June 2023; Accepted: 06 October 2023;
Published: 19 October 2023.
Edited by:
Wei Chen, Shaoxing University, ChinaReviewed by:
Jakub Polák, Charles University, CzechiaCopyright © 2023 Benítez-Burraco. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Antonio Benítez-Burraco, abenitez8@us.es
†ORCID: Antonio Benítez-Burraco orcid.org/0000-0003-4574-5666