Skip to main content

HYPOTHESIS AND THEORY article

Front. Neurosci., 16 March 2021
Sec. Auditory Cognitive Neuroscience
This article is part of the Research Topic What is Musical Creativity? Interdisciplinary Dialogues and Approaches View all 13 articles

The Role of Canalization and Plasticity in the Evolution of Musical Creativity

  • Department of Musicology, Adam Mickiewicz University in Poznań, Poznań, Poland

Creativity is defined as the ability to generate something new and valuable. From a biological point of view this can be seen as an adaptation in response to environmental challenges. Although music is such a diverse phenomenon, all people possess a set of abilities that are claimed to be the products of biological evolution, which allow us to produce and listen to music according to both universal and culture-specific rules. On the one hand, musical creativity is restricted by the tacit rules that reflect the developmental interplay between genetic, epigenetic and cultural information. On the other hand, musical innovations seem to be desirable elements present in every musical culture which suggests some biological importance. If our musical activity is driven by biological needs, then it is important for us to understand the function of musical creativity in satisfying those needs, and also how human beings have become so creative in the domain of music. The aim of this paper is to propose that musical creativity has become an indispensable part of the gene-culture coevolution of our musicality. It is suggested that the two main forces of canalization and plasticity have been crucial in this process. Canalization is an evolutionary process in which phenotypes take relatively constant forms regardless of environmental and genetic perturbations. Plasticity is defined as the ability of a phenotype to generate an adaptive response to environmental challenges. It is proposed that human musicality is composed of evolutionary innovations generated by the gradual canalization of developmental pathways leading to musical behavior. Within this process, the unstable cultural environment serves as the selective pressure for musical creativity. It is hypothesized that the connections between cortical and subcortical areas, which constitute cortico-subcortical circuits involved in music processing, are the products of canalization, whereas plasticity is achieved by the means of neurological variability. This variability is present both at the level of an individual structure’s enlargement in response to practicing (e.g., the planum temporale) and within the involvement of neurological structures that are not music-specific (e.g., the default mode network) in music processing.

Introduction

Music has been observed in all human cultures and has accompanied our ancestors since the dawn of our species (Morley, 2013), or perhaps even earlier as part of ancient hominin behavior (Mithen, 2006). Therefore, musicality as a set of abilities that enable music production and recognition (Fitch, 2015), is often claimed to be a product of biological evolution (Roederer, 1984; Hagen and Bryant, 2003; Peretz, 2006; Hagen and Hammerstein, 2009; Mithen, 2009). As music has its roots in the vocal behavior of our ancestors (Bannan, 2012; Morley, 2013), the evolution of musical creativity cannot be explained without tracing the beginnings of hominin vocal communication. Singing, the vocal form of musical activity, belongs together with crying, laughter and speech, as a group of characteristic vocalizations of Homo sapiens, and is often compared with songbirds’ songs (Merker, 2005; Fitch and Jarvis, 2013; Rothenberg et al., 2014) or the songs of other species (Geissmann, 2000; Payne, 2000). Human and songbird songs are good examples of ritual culture because the structures of their songs are the objects of imitation by conspecifics (Merker, 2009). In order to achieve this trait, the evolution of vocal learning among songbirds and hominins was necessary. Another important similarity between human and songbird songs is that both humans and songbirds modify their songs so that new versions become an object of cultural transmission. This means that apart from an urge to imitate a song, individuals are also prone to invent at least some new structural elements. This task necessitates both the knowledge of rules that govern a particular song style and creativity, as not all possible sound variants can be recognized as a song and therefore appreciated by conspecifics. From this point of view, musical creativity, similar to songbird vocal creativity, should not only have the ability to generate something novel and original in the domain of music, but also have something valuable (Merker, 2006). The invention of any new musical piece must be a result of the “creative thinking process” which in music can be viewed as consisting of three stages: (i) product intentions, (ii) the thinking process, and (iii) the creative product (Webster, 1990, 2002). While the product intentions phase is strictly related to the kind of urge that drives a creator to musical activity (e.g., composition, performance, improvisation) the thinking process is based on divergent and convergent thoughts (Webster, 2002). Divergent thoughts involve the imagination of musical elements such as melodic or rhythmic motives (Webster, 2002) which can occur involuntarily (Copeland, 2019). In contrast, convergent thoughts seem to be analytical and lead to aesthetic decisions (Webster, 2002). Only after this is the final creative product achievable. Since the result of an effective creative process in music is usually positively assessed by a social environment, being creative gives an advantage to creative individuals. Therefore, musical creativity can be viewed as a biological adaptation. As every creative act operates on the existing cultural information, musical creativity can also be understood as “an adaptive process of knowledge acquisition” (Reybrouck, 2006). However, while creativity in general refers to the ability that allows us to use all possible cognitive resources, the creative process in the domain of song is restricted by the scope of sound features which characterize conspecific songs. Therefore, in contrast to general creativity that is often understood as being free from any limitations, the evolutionary beginnings of creative abilities in the domain of song seem to be tightly connected to a domain-specific form of communication.

The fact that humans are so constrained by music-specific features (Harwood, 1976; Nettl, 2000; Brown and Jordania, 2011; Savage et al., 2015; Mehr et al., 2019) and at the same time so creative while inventing the variations of a sound sequence, suggests that human musicality must be based on at least two antithetical predispositions. One impels humans to be as precise as possible in copying musical distinctive sound features. This copying capacity is possible thanks to a human ability unique among primates to vocally learn (Janik and Slater, 1997; Merker, 2005, 2012), especially the sounds of speech and singing (Bannan, 2008). The other predisposition allows us to be inventive enough to create sound sequences that are ear-catching for the other members of a social group. Without this ability, no variations of music, and thus any cultural evolution of music, would be possible (Savage, 2019). By taking into account the contrastive character of the aforementioned abilities, a multistage evolutionary scenario must be considered to explain the origin of musical creativity. After all, it is hard to imagine how such a complex variability of human songs could have evolved as a result of one accidental event. Instead, the appearance of musical creativity as we know it today must have been evolving gradually in response to many evolutionary pressures. On the one hand, these pressures must have led to the appearance of music-specific perceptive, functional and behavioral biases (Mehr et al., 2019). On the other hand, these biases must not have been strong enough to restrict the scope of hominins’ vocal repertoire to a closed set of calls. The result is Homo sapien musical culture which can be characterized by the influence of at least four levels of constrains: (1) inherited perceptive and behavioral biases which influence the existence of musical universals; (2) enculturated (culturally inherited) biases which consists of implicitly learned elements of a musical system such as culture-specific pitch intervals and rhythm ratios; (3) limitations of creativity which are related to the efficiency of the brain, restricted for instance, by the capacity of working memory; (4) social selective pressures which act as feedback able to modify former constraints in the long run. The existence of all these constraints indicates that gene-culture coevolution (Lumsden and Wilson, 1980) could have been an evolutionary mechanism leading to the origins of human musicality (Podlipniak, 2015, 2017; Killin, 2016, 2017; Patel, 2018; Savage et al., 2020). It is worth mentioning, that all these aforesaid constraints act simultaneously and must have been present at all stages of our gene-culture coevolution, which has led to the emergence of human musical culture. Therefore, the evolution of musical creativity, being an inseparable part of this process, must have been influenced, at least, by some of the same evolutionary forces that have shaped the evolution of human musicality.

This paper takes a theoretical and naturalistic approach in order to explain the evolutionary roots of musical creativity. Since musical creativity is based on innovative thinking in the domain of sound communication, the presented framework is focused only on the relatively recent evolutionary history of the Homo lineage in which vocal learning started to play an important role, rather than depicting the whole evolutionary origin of human musicality. We begin with the idea of gene-culture coevolution and its aforementioned two main evolutionary mechanisms—plasticity and canalization. The next section is devoted to the conditions that influenced the evolution of musical creativity, in which the point of departure for the coevolutionary pathways leading to emergence of human musicality is demonstrated, including details concerning the possible interplay between canalized and innovative factors in the domain of pitch and rhythm. In the following section, the view of musical syntax as a canalized framework for musical creativity is discussed. In the conclusion the importance of musical creativity for the evolution of human musicality is emphasized, and possible future research that can contribute to the evolutionary explanation of the origin of musical creativity is suggested.

Plasticity and Canalization as the Main Mechanisms of Gene-Culture Coevolution

In general, gene-culture coevolution is a circular process in which genetic information influences culture whilst also simultaneously changing in response to culture that acts as a selective environment (Lumsden and Wilson, 1982). The influence of genetic information on culture can be achieved by different mechanisms related to either somatic characteristics or behavioral proclivities. Wilson (Wilson, 1978) described this influence metaphorically by saying that “the genes hold culture on a leash” (p. 167). These mechanisms restrict the type of cultural information to be exchanged (e.g., the dominance of visual and acoustic cues as the means of communication in primates) but they also allow a species to create a species-specific cultural niche (e.g., agriculture). However, apart from being under such constraints many animals are able to socially learn (Laland, 2017) which makes them flexible at overcoming new challenges that can appear in a fast-changing environment. Such flexibility is especially useful in a complex and often fast changing cultural environment that characterizes human culture. Nonetheless, not all cultural traits change equally fast. When a particular cultural environment is stable enough, i.e., it lasts throughout many generations; it can become a source of selection. As a consequence, such a stable cultural environment can lead to the appearance of traits that are under genetic control. The clearest example of gene-culture coevolution is the evolution of lactase persistence (Gerbault et al., 2011). It is assumed that the domestication of animals by humans created a pastoral cultural niche which became a selective environment for the appearance of lactose tolerance among adults about 8,000 years ago (Leonardi et al., 2012). As a result, a population dominated by individuals characterized by lactose tolerance has been prone to sustain and elaborate the practice of dairying. In this example, a cultural invention—farming—became a starting point for a chain of interdependent coevolutionary events. Similar coevolutionary mechanisms have been proposed for the evolution of many human traits such as handedness, mating preference (Laland, 2008), language (Dor and Jablonka, 2000, 2001; Bickerton, 2010; Deacon, 2010) and music (Podlipniak, 2017; Patel, 2018; Savage et al., 2020). Music in gene-culture coevolution was initially invented by hominins and by having an adaptive function, music abilities have been preferred by natural selection which has strengthened and accelerated the evolution of human musicality. According to Patel for example, the invention of music had unexpected adaptive effect—social bonding, which was the cause of the coevolutionary chain of circumstances (Patel, 2018). In another coevolutionary scenario of music evolution (Podlipniak, 2017), the social bonding function was an attribute of music right from the beginning. In all these scenarios, however, genes and culture has been permanently interacting.

Plasticity

The necessary condition for gene-culture coevolution is the ability of phenotypes to modify behavior in response to the environment. This kind of response is often called behavioral plasticity (Dor and Jablonka, 2010; Mery and Burns, 2010), a form of developmental (or phenotypic) plasticity (Pigliucci, 2001; Fusco and Minelli, 2010) which, along with genetic variation, is the property of all living organisms (West-Eberhard, 2005). Thanks to plasticity every phenotype can be flexible to some extent, which allows it to adapt in response to environmental changes during its lifetime. For behavioral plasticity to occur a phenotype has to learn. Although behavioral plasticity and learning are often associated with neuronal plasticity, both can be achieved by other mechanisms too (Jablonka and Lamb, 2005; Mery and Burns, 2010). It must be emphasized, however, that developmental, including behavioral, plasticity being the terms that refer to the functional explanation of the adaptation to the environment (the survival value of a trait), belongs to Tinbergen’s ultimate level of explanation (Tinbergen, 1963) which focuses on the phylogeny and function of a phenotypic trait (Fitch, 2015). In contrast, neuronal plasticity as a physiological mechanism that allows the brain to modify the preexisting neuronal connections in response to changes in afferent inputs or efferent requirements (Pascual-Leone et al., 2005) represents Tinbergen’s proximal level of explanation (Tinbergen, 1963), which concerns the mechanistic and developmental elucidations of how a phenotypic trait works and develops in ontogeny (Fitch, 2015). Therefore, although behavioral plasticity can be achieved by the means of neuronal plasticity, only the former refers to the gene-culture coevolutionary explanations of the appearance of musical creativity.

From this perspective plasticity is a crucial process that enables cultural change. It seems that plasticity has played a crucial role in the evolution of humans, since our culture is cumulative (Tomasello, 1999), which means that cultural traits such as behaviors, technologies and ideas are not only learned and transmitted from generation to generation but also modified and improved so that new elements are often added to the existing ones. An increasing number of innovations and modifications lead to the appearance of a rich cultural environment which undergoes its own evolution. The process of cultural evolution resembles to some extent genetic evolution (Creanza et al., 2017), since in both cases the gradual process of transmission and modification of traits by means of natural selection is observed. This means that all cultural traits have to compete to reproduce (cultural transmission). However, in contrast to genetic evolution in which variations of traits are achieved by means of mutations and recombination, at the level of cultural evolution plasticity is a source of diversity. Moreover, without plasticity the interaction between cultural and genetic evolution would be impossible. After all, thanks to plasticity the first Anatolian farmers were able to invent a dairy based diet which started the gene-culture coevolution of the aforementioned lactase persistence.

Canalization

The concept of “canalization” was originally proposed by Waddington (1942) and refers to “the ability of a genotype to produce relatively constant phenotypes regardless of environmental and genetic variabilities” (Takahashi, 2019, p. 14; cf. also Dor and Jablonka, 2010, p. 138; Loison, 2019, p. 6). Importantly, such constancy concerns not only somatic but also behavioral traits. Canalization can be achieved by different means including cultural and genetic control over a given trait. A good example of an evolutionary mechanism related to gene-culture coevolution in which canalization plays a crucial role is the Baldwin effect (Baldwin, 1896a,b; Sznajder et al., 2012). In the first step of the Baldwinian mode of evolution, organisms adapt to the environmental challenges by means of phenotypic adaptation. In the case of culturally achieved phenotypic adaptation, adaptive learning plays a crucial role. Thanks to genetic variability of a population, sooner or later a new genotype appears and endues an individual with an instinct allowing the replacement of phenotypic adaptation with genetic adaptation (Dor and Jablonka, 2000). If this learning is costly, i.e., it needs a lot of energy and/or is time-consuming, such an individual is favored by natural selection and in the long run the whole population becomes dominated by individuals endowed with this instinct. In other words, in these circumstances a trait that is initially achieved by plastic response to environmental pressure is followed by a genetic change which takes control over the previously culturally achieved trait. The ability to develop this trait, independent of whether it is achieved by learning or genetic control, represents an example of canalization. Different circumstances can promote either plasticity or canalization. While plasticity is an effective mechanism for adaptation to a fast-changing environment, canalization can be viewed as a safety-valve based on evolutionary “memory.” As such, canalization can be understood as “the buffering of environmental and genetic < < noise> >” (Dor and Jablonka, 2010, p. 135).

The Evolution of the Musical Mind and Musical Creativity

The processing of music is a complex task which consists of many sequential and simultaneous computations that reflects different adaptive mechanisms of a different evolutionary age. Only a small part of these processes can be viewed as music-specific. The vast majority of these mechanisms fulfill more general functions related to audition, including auditory scene analysis such as the detection and recognition of sound sources (Bregman, 1990), and various forms of acoustic communication (Zimmermann et al., 2013; Ackermann et al., 2014; Scheumann et al., 2014), a lot of which humans share with other animal species. Although many of these non-music-specific abilities are necessary for music perception, they are not sufficient to experience music in its whole structural and meaningful form. After all, only Homo sapiens among all living primates is able to vocalize syntactically organized and culturally transmitted sound patterns based on rhythm and pitches. Of course, this does not mean that human musicality evolved from scratch. On the contrary, many preadaptations among our ancestral primates must have existed to allow natural selection to design music-like behavioral traits in some evolutionary lineages of primates. A good example of such a behavioral trait can be found in gibbon songs (Geissmann, 2000; Merker, 2000). This opens the possibility that even the common ancestor of great and lesser apes was endowed with the abilities to allow some form of singing (Jordania, 2014). However, taking into account that gibbon songs are under strong genetic control rather than being vocally learned calls (Brockelman and Schilling, 1984), it is reasonable to assume that the evolution of culturally flexible musical behavior among hominins was influenced by additional factors. This means that human-specific musical brain equipment had to have evolved relatively recently after the split between the common ancestor of chimpanzees and hominins. This equipment must have allowed our ancestors to use sounds as culturally heritable units. This ability let our ancestral hominins to partially free their vocalizations from affective calls (Ackermann et al., 2014). The unchained vocal calls became based on discrete units which became the units of cultural inheritance (Savage, 2019). Moreover, the sequences of these units, being culturally transmitted information, started to be the objects of modification. This is the evolutionary stage where plasticity and creativity started to act in the domain of vocal communication.

Environmental Sounds, Speech, and Music

Although vision is the dominant sense among primates (in humans for instance the processing of visual signal involves a remarkably bigger amount of neural tissue than the processing of auditory stimuli; Deutsch, 2019), hearing is still an important source of information about the environment for humans and the same must have been true for hominins. Therefore, it is not surprising that auditory information serves as an additional clue to navigate in the world (Horowitz, 2012). However, apart from inferring information about the environment, primates also use sound to transmit signals concerning their intentions (Hauser and Konishi, 1999; Hauser, 2000). Since primates are able to auditory learn, i.e., to associate sounds with referential meaning (Wright et al., 1990) and chimpanzees even modify and use sounds as sound symbols (Watson et al., 2015), it is reasonable to assume that hominins were able to use sounds intentionally as a medium of communication. It has been proposed that the increasing role of sociality among hominins that belong to the lineage leading to Homo sapiens, created an evolutionary pressure for the development of intentional vocalizations, i.e., speech and singing (Dunbar, 1996). The conscious intentionality of vocal expressions imposes a set of new properties on the production and processing of sound. First of all, the coding of meaning in vocally produced sounds necessitates the vocal control over the acoustic features of vocalized sounds. Secondly, the acoustic features of sound must be linked together with meaningful mental categories. Thirdly, these mental categories must be processed separately in order to avoid confusion in the process of decoding information. Fourthly, auditory perception should be biased in favor of acoustic features that are crucial for the conspecific intentional vocalizations. Fifthly, the intentional vocalizations should be ductile enough to code new meaningful elements. Finally, these elements should be prone to be learnable, i.e., to be culturally inherited. All these properties characterize the production and processing of both speech and singing.

Human vocal learning is especially efficient as far as the speech and singing features are concerned (Jackendoff and Lerdahl, 2006). The mental categories of words in speech and musical pitches and rhythms in music are mapped into conceptual meaning (Bickerton, 2009) and preconceptual emotional (Podlipniak, 2020) and kinetic impressions (Grahn and Rowe, 2009, 2013; Levitin et al., 2018), respectively. Although neuroimaging studies show that certain brain areas are involved in the processing of both language and music (Fedorenko et al., 2009; Kunert et al., 2015) there are also neural structures that are activated when only one of these two phenomena is processed (Rogalsky et al., 2011; Perruchet and Poulin-Charronnat, 2013; Norman-Haignere et al., 2015). This dissociation indicates the existence of language and music specific networks. The perceptive preference for the specific features of speech and music is observed in children right after birth (Fassbender, 1996; McMullen and Saffran, 2004; Perani et al., 2010; Brandt et al., 2012). Finally, both language and music represent a recursive open system (Merker, 2002; Pinker and Jackendoff, 2005), which means that humans are able to produce an enormous number of new and original musical and speech phrases. Importantly, in contrast to many songbird species, humans are prone to learn and modify these phrases during their whole life. However, some elements of speech and music such as phonemes and pitch intervals seem to be easier to learn in childhood whereas some, such as new words or new musical phrases, are equally easy to learn during adulthood (Trainor, 2005; White et al., 2013; Li et al., 2014; Friedmann and Rusou, 2015; Birdsong, 2018). This suggests that learning plasticity is a complex ability that consists of selective constraints which act differently at different stages of development.

Rhythm, Movement, Pitch, and Vocal Learning

The music specific elements of musical code are discrete rhythm measures and pitch intervals. Since both these musical features have been recognized as musical universals (Brown and Jordania, 2011; Savage et al., 2015; Trehub, 2015; Mehr et al., 2019) it is reasonable to assume that music communication is based on sounds interpreted by human minds in term of discrete pitches and rhythms. Moreover, rhythm measures and pitch intervals are crucial for musical syntax. However, discrete pitches and rhythms can exist independently as in music based solely on rhythm, as well as in non-metrical music, e.g., in some oral traditions of Gregorian chanting, respectively. This indicates that the processing of musical pitch and the processing of musical rhythm are in fact separate abilities. Therefore, human musicality cannot be treated as a monolithic entity or as a product of one evolutionary episode. Indeed, many scholars have proposed the multistage scenarios of music’s origins (Mithen, 2006). In the majority of these scenarios, musical rhythm is usually indicated as evolutionarily more ancient than musical pitch.

Musical rhythm is organized in a framework based on the sensation of periodicity known as musical pulse. Additionally, this framework is usually organized into hierarchical beat patterns called meters (London, 2012). This means that our perception of musical rhythm is based on precise mental expectations concerned with when sounds may occur (Huron, 2006). The evolutionary roots of musical rhythm have often been searched for in the coupling between auditory signals and movement. The main hint for such a claim is that humans spontaneously synchronize their movements with musical beat, the perception of which is based on the sensation of musical pulse. It is also known that the processing of isochronous musical stimuli involves the activity of motor brain areas including cortico-striatal loops (Geiser et al., 2012). This can additionally suggest that our perception of musical meter is at least partly based on motor experience (Repp, 2007). Therefore, rather than auditory, music is an auditory-motor phenomenon which along with its time dependence, makes music experience and creativity unique (Bashwiner, 2018) among other human sound expressions. The ability of auditory-motor synchronization affects not only one individual, but also the social level of behavior by allowing people to move together in synchrony in response to music. It has been also proposed that apart from movement synchronization, collective listening to music leads to the alignment of brain states (Bharucha et al., 2011).

Although the human ability to synchronize movements with beat seems to be exceptional among primates, there are studies which indicate that motor-beat synchronization restricted to 600ms periodicity can be achieved by chimpanzees (Hattori et al., 2013, 2015). Chimpanzees are also prone to rhythmic swaying in response to auditory stimuli and the beat rate of these stimuli influences the periodicity of chimps swaying in a bipedal position (Hattori and Tomonaga, 2020). These observations suggest that primitive auditory-motor coupling was present in the last common ancestor of Homo sapiens and chimpanzees. This coupling could have probably been one of the ancient preadapations for human musicality. However, while auditory-motor synchronization among chimpanzees is restricted to a narrow scope of tempi (Hattori et al., 2013), humans are able to spontaneously synchronize with isochronous auditory stimuli in a multi-timescale (Parncutt, 1994; Toiviainen et al., 2010; Patel and Iversen, 2014). The widening of a beat extraction timescale is a good example of a change that enables plasticity in the domain of beat perception and production. Therefore, while the first hominins were most probably unable to use sounds in different timescales, the broadening of beat sensitivity among the next generations of hominins opened the window of creativity in the domain of rhythm. Although the change of tempo does not seem to be a very creative task for humans today, it must have been a great achievement for hominins. The extension of a beat extraction timescale was also a necessary condition for the development of rhythm measures quantification. As a result of this, the perception of rhythm measures in terms of simple integer ratios is nowadays a cross-cultural universal feature of human music perception (Jacoby and McDermott, 2017). However, having many rhythm measures available, hominins developed a medium of communication susceptible to cultural change and inventions. Therefore, on the one hand the abilities to align movements with beat and to quantify rhythm measures in terms of simple integer ratios are the examples of canalized skills. At the neuronal level the ability of beat extraction is based on cortico-striatal loops, especially on connections between the supplementary motor area, the premotor cortex, auditory cortex and the putamen (Grahn and Rowe, 2009). This suggests that the evolution of hominins’ auditory-motor synchronization has been achieved by the development of these cortico-subcortical connections. On the other hand, the possibility of sensing different periodicities as musical pulse along with flexibility in the use of rhythm measures became an area of behavioral plasticity, due to the fact that both the striatum and the neocortex belong to the major sites of synaptic plasticity in the brain enabling learning (Pascual-Leone et al., 2005; Surmeier et al., 2009; Perrin and Venance, 2019). Both the ability of beat extraction and the sense of different periodicities most probably became the first capacities which enabled divergent musical thought composed of rhythmic kernels of musical thinking (Webster, 2002). Note however, that neither all canalized rhythmical skills, nor the whole scope of plasticity in the domain of musical rhythm appeared at the same time in the evolution. In order to allow hominins to evolve into a rhythmically creative species like Homo sapiens, a chain of successive behavioral changes had to take place. The first step in this process had to be a gradual extension of beat extraction timescale. It is possible that this step could have been achieved previously by means of behavioral plasticity. However, the scope of creativity in the domain of beat extraction must have been constrained by the restricted range of possible periodicities. This range must have gradually broadened until the appearance of anatomically modern humans in whom it reaches the span of 300–900 ms (Toiviainen et al., 2010). Importantly, beat extraction is prone to enculturation which additionally restricts creativity (London, 2012). Even today people from different cultures are biased to sense different pulse of the same music sequence depending on their cultural background (Agawu, 1987). Nonetheless, this kind of culturally induced constraint can be overcome by a creative individual thanks to often conscious convergent thinking that interplays with divergent thinking (Webster, 2002).

Another supposed area of the interplay between plasticity and canalization in the evolution of music creativity is the domain of musical pitch. Pitch in music is usually perceived as a sequence of discrete sensations representing pitch intervals (Krumhansl, 1990; Rakowski, 1999, 2009); thus, creativity in this domain may consist of composing subsequent pitches in a melody and sometimes in changing the size of pitch intervals. In contrast to the experience of musical rhythm which is based on the predictions about sound timing that are independent of the spectral characteristic of the sounds, musical pitch is sensed only if the perceived sound contains harmonics. The crucial acoustic parameter that influences the sensation of pitch is fundamental frequency (F0). Simply speaking, while musical rhythm tells us “when” sounds occur, musical pitch answers the question about the content of musical sequence. Not only humans use harmonic sounds as a medium of coding their intentions. Many other species, including primates, communicate using sound frequency modulation (Hauser and Konishi, 1999; Hauser, 2000; Horowitz, 2012). Continuous changes of sound frequency are parts of the affective calls of nonhuman primates (Zimmermann et al., 2013; Scheumann et al., 2014). This indicates that hominins also had to use pitch in their calls. However, a leap forward in the evolution of musical creativity was the appearance of digital (discrete) elements of vocalizations. In contrast to affective calls, the features of which are present also in human affective prosody in speech, the digital forms of human vocalizations (articulate speech and singing) are subjects to greater volitional control which gives more space for creativity.

Although not only pitch is used in a discrete form in human vocal expressions, as in the case of vowels and consonants in speech (Jackendoff, 2009), harmonic sounds must have been pivotal in the evolution of both music and speech. The appearance of digital vocal communication was possible thanks to vocal learning—the ability that allows us to vocally reproduce the acoustic parameters of the sounds that we hear (Janik and Slater, 1997; Merker, 2012). The evolution of vocal learning is also related to increased cortico-striatal connectivity (Jarvis, 2007; Fitch and Jarvis, 2013). Although humans are not equally good at vocal learning of all sounds, we are especially predisposed to imitate the distinctive elements of speech and singing. In other words, our cognitive system is biased in favor of speech formats and F0 of sounds in terms of their perception as well as production. Yet the volitional vocal control of F0 was the most important evolutionary change (Bannan, 2012) that initiated the creative use of discrete pitch. Only having the ability to sustain F0 of sung sounds and to master the size of vocalized pitch intervals allowed hominins to be inventive in producing original sequences of discrete pitches. Therefore, while the vocal learning biases became the canalized roots of music and speech development, the establishment of the basic discrete units of speech and music—e.g., phonemes of a particular language and discrete pitches of culture-specific musical system—represents the scope of plasticity. However, it is possible that at the ancient stages of hominins’ vocalizations, the choice of spectral cues (F0 or speech formants) as the distinctive features of discrete vocalization units was also established by means of behavioral plasticity. It is well known that practicing music can lead to the enlargement of and the induction of neuroplasticity in cortical areas involved in the processing of pitch information such as the planum temporale (Meyer et al., 2012; Bashwiner et al., 2016) as well as to the increased connectivity between areas involved in music processing such as the auditory, the sensorimotor, and the prefrontal cortices (Klein et al., 2016). Such neuronal mechanisms enable behavioral plasticity and could have been present to some extent in early musical hominins allowing them to create discrete vocalizations. The fact that there is a correspondence between the number of vowels in language and the number of pitches in musical scales, as well as a relationship between vowel formants and F0 in song based on meaningless syllables (Fenk-Oczlon, 2017) can support such a scenario. Only after a persistent use of F0 as a culturally learned distinctive clue for the recognition of song units did the perceptive bias in favor of musical pitch appear. From a cognitive point of view, the perceptive biases in favor of musical pitch and regularity in musical rhythm consist of an active search for relevant information ignoring irrelevant acoustic features. These biases restrict musical creativity in a similar way to phonemic constraints in language (Merker, 2006). The appearance of musical pitch extended the scope of mental categories that constitute the units of musical thoughts present in Webster’s divergent thinking phase (Webster, 2002).

Musical Syntax as a Canalized Framework for Musical Creativity

The perceptive biases that constrained musical creativity in the domain of rhythm and pitch structure are not restricted solely to the active search for the distinctive features of musical discrete units. Music similar to language is a complex communicative tool, the structure of which is governed by syntactic rules (Lerdahl and Jackendoff, 1983). These rules mean that musical structure is perceived in terms of two types of hierarchy—rhythm hierarchy based on meter (London, 2012) and pitch hierarchy based on pitch centricity (Krumhansl and Cuddy, 2010). From this point of view music represents a generative and recursive system known as the Humboldt system (Merker, 2002), which allows people to create an enormous number of socially appreciated music sequences. This enormity of such sequences does not mean, however, that every sequence of sound can be recognized as correct. Quite the opposite, there is at least an equally big set of sound sequences that are unacceptable by a social group as faultless musical pieces. This means that even a musically untrained listener is able to recognize syntactical faults in musical sequences such as an out-of-key note in a tonal sequence (Tillmann et al., 2000; Gorzelańczyk et al., 2017). Such a tacit recognition of faults resembles the feelings that accompany native speakers when they listen to grammatical errors in their mother tongues. The main difference between music and language in this respect is that word hierarchies (grammatical relations) are conceptual. Although musical syntax, in contrast to language, is not related to propositional semantics (Lerdahl, 2013) the structural hierarchies in music are easily recognizable by means of emotional clues. It is assumed that the emotional response to sound sequences occurs as a result of fulfilling or not the predictions based on the implicit statistical learning of sound distribution in the musical environment.

The fact that musical syntax is learned implicitly and spontaneously in childhood suggests the existence of developmental predispositions related to this task. It has been hypothesized that the ability to recognize pitch center (Podlipniak, 2016), as well as to organize musical pitch in a syntactic way (Podlipniak, 2020), evolved by the means of the Baldwin effect, which means that after cultural invention of pitch hierarchy it was overtaken by genetic control. Regardless of whether it is true or not, the fact that tonal organization of music is prevalent across musical cultures (Mehr et al., 2019) strengthens the view that human proclivity to implicitly learn the rules of pitch distribution in music is a result of the process of canalization. After all, the prevalence of tonality among musical cultures across the world, which are enormously diversified in respect of other musical traits, indicates that pitch syntax must have become a stable musical feature. It has been either culturally transmitted from an ancient common ancestral culture or it develops independently thanks to some genetic proclivities. The same reason can explain the universality of hierarchical organization of musical rhythm. As a result, musical syntax usually consists of pitch and rhythm hierarchies. Therefore, both explanations are consistent with the claim that the use of a pitch-rhythm framework for musical syntax is an example of a canalized behavioral trait of Homo sapiens. To some extent musical syntax is like the grammar of a mother tongue. As the tacit knowledge of grammar restricts the possible word flexion and order in sentences, the musical syntax puts constraints on musical expressions. In other words, musical syntax answers the question when and what kind of pitch is acceptable. However, there is not only one answer to this question. Moreover, the permanently changing social environment means that the aesthetic preferences are also changing. As a result, different tunes may bring the house down in different times. Having many possible melodic variants that are congruent with the syntactic rules of a particular music style, and that can simultaneously satisfy the aesthetic preferences of a given social group opens space for creativity. This kind of creativity is often called “combinational” (Boden, 2004, p. 3) since the innovation represents a new combination of sounds within a framework of the present syntactic rules. Sometimes, however, an individual is able to create sound sequences that surpass the existing rules and to implement new ones. This kind of creativity is called “exploratory” (Boden, 2004, p. 4) as it explores formerly unknown rules of organization. It is possible that the neurological structures that are not music-specific, such as the default mode network, play some role in such innovative thinking. In both cases, however, the creative individuals must be familiar with the aforesaid implicit knowledge which means that they are constrained by the existing implicitly learned rules. After all, changing the rules necessitates the knowledge of what has changed. The new rules, if socially accepted, become implicitly learned by the next generations of listeners until new innovative rules are created and socially accepted. This endless process is a good example of cultural evolution which runs faster than gene-culture coevolution. However, in the long run, even in a fast-changing culture, some features can become canalized. Canalization opens the coevolutionary concatenation of events. The example of a ubiquitous syntactic pitch-rhythm framework of music suggests that human musicality has become a subject of canalization. However, although musical creativity, being a force that leads to musical change, acts against canalization it is at the same time a product of it. Without the canalized perceptual traits such as rhythm measures, pitch intervals, etc. musical creativity would be devoid of a well-defined space.

Conclusion

According to the presented view, musical creativity did not only appear in the course of hominin evolution but also became a driving force of the gene-culture coevolution of human musicality. Being an inseparable part of a cultural plastic change, creativity in the domain of pitch and rhythm gave diversity which became the subjects of cultural selection. Only such circumstances opened space for the canalization of pitch intervals and rhythm measures as discrete musical units that dominated human musicality for millennia. Moreover, since the default experience of musical pitch and rhythm is preconceptual, their appearance most probably preceded the evolution of the conceptual mind. This suggests that creativity in the domain of musical pitch and rhythm represents a domain specific ability in contrast to creativity that operates in the conceptual mind. Of course, it does not exclude the possibility that people can be creative in music using conceptual resources. This is especially possible as part of Webster’s aforementioned creative thinking process at the level of convergent thoughts (Webster, 2002). After all, professional musicians are able to learn pitch intervals and rhythm schemes as precise defined concepts and categorically perceived entities. This task necessitates, however, strenuous and time-consuming learning, suggesting the crucial role of phenotypic plasticity in this process. Moreover, musical phrases can be the source of associations that can be used in creative composing which can be consciously incorporated into analytical thinking leading to aesthetic decisions. Nevertheless, music-specific creativity operates mainly in the realm of tonal music, whereas general creativity can be especially desirable in “sound arts” and other kinds of music that abandon rhythm and pitch syntaxes.

The proposed view that musical creativity is linked to gene-culture coevolution emphasizes the important role of individual invention in the process of generating cultural variability. After all, an increasing number of song variants achieved by individuals’ creativity extends the scope of a cultural environment to be selected. Independent of this variability the social success of creative individuals could facilitate natural selection of those individuals whose creativity meets the social requirements. As a result, over a long period of time human musicality appears to be a permanently changing capacity influenced by both social aesthetic trends and our predispositions to be creative in the domain of music. This implies that musical creativity is a part of human nature which means that musical creative thoughts should be achievable by the majority of people. Although the proposed evolutionary scenario is speculative there are some possible areas of study which can explore its implications. The search for genetic predispositions of musicality is the most promising scope of research in this respect. Another area research that can help to answer the question about the role of plasticity and canalization in evolution musical creativity is the neuroscience of music. The recognition of the limits of neural plasticity related to the processing of musical pitch and rhythm could shed some light on the specificity of creativity in the domain of music.

Data Availability Statement

The original contributions presented in the study are included in the article further inquiries can be directed to the corresponding author.

Author Contributions

The author confirms being the sole contributor of this work and has approved it for publication.

Funding

Funding from Adam Mickiewicz University in Poznań supported the open access publication fee.

Conflict of Interest

The author declares that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

I would like to thank the reviewers for their critical and helpful suggestions and comments. I would also like to thank Peter Kośmider-Jones for proofreading and language consultation.

References

Ackermann, H., Hage, S. R., and Ziegler, W. (2014). Brain mechanisms of acoustic communication in humans and nonhuman primates: An evolutionary perspective. Behav. Brain Sci. 37, 529–546. doi: 10.1017/S0140525X13003099

PubMed Abstract | CrossRef Full Text | Google Scholar

Agawu, V. K. (1987). The rhythmic structure of West African music. J. Musicol. 5, 400–418. doi: 10.2307/763699

CrossRef Full Text | Google Scholar

Baldwin, J. M. (1896a). A new factor in evolution. Am. Nat. 30, 441–451. doi: 10.1086/276408

CrossRef Full Text | Google Scholar

Baldwin, J. M. (1896b). A new factor in evolution (Continued). Am. Nat. 30, 536–553. doi: 10.1086/276428

CrossRef Full Text | Google Scholar

Bannan, N. (2008). Language out of music: the four dimensions of vocal learning. Aust. J. Anthropol. 19, 272–293. doi: 10.1111/j.1835-9310.2008.tb00354.x

CrossRef Full Text | Google Scholar

Bannan, N. (2012). “Harmony and its role in human evolution,” in Music, Language, and Human Evolution, ed. N. Bannan (Oxford: Oxford University Press), 288–340. doi: 10.1093/acprof:osobl/9780199227341.003.0012

CrossRef Full Text | Google Scholar

Bashwiner, D. M. (2018). “The neuroscience of musical creativity,” in The Cambridge Handbook of the Neuroscience of Creativity, eds R. E. Jung and O. Vartanian (New York, NY: Cambridge University Press), 495–516. doi: 10.1017/9781316556238.029

CrossRef Full Text | Google Scholar

Bashwiner, D. M., Wertz, C. J., Flores, R. A., and Jung, R. E. (2016). Musical creativity “Revealed” in brain structure: interplay between motor, default mode, and limbic networks. Sci. Rep. 6:20482. doi: 10.1038/srep20482

PubMed Abstract | CrossRef Full Text | Google Scholar

Bharucha, J. J., Curtis, M., and Paroo, K. (2011). “Musical communication as alignment of brain states,” in Language and Music as Cognitive Systems, eds P. Rebuschat, M. Rohrmeier, J. A. Hawkins, and I. Cross (Oxford, New York: Oxford University Press), 139–155. doi: 10.1093/acprof:oso/9780199553426.003.0016

CrossRef Full Text | Google Scholar

Bickerton, D. (2009). “Syntax for non-syntacticians A brief primer,” in Biological Foundations and Origin of Syntax, eds D. Bickerton and E. Szathmáry (Cambridge: The MIT Press), 3–13. doi: 10.7551/mitpress/9780262013567.001.0001

CrossRef Full Text | Google Scholar

Bickerton, D. (2010). Adam’s Tongue: How Humans Made Language, How Language Made Humans. New York, NY: Hill and Wang.

Google Scholar

Birdsong, D. (2018). Plasticity, variability and age in second language acquisition and bilingualism. Front. Psychol. 9:81. doi: 10.3389/fpsyg.2018.00081

PubMed Abstract | CrossRef Full Text | Google Scholar

Boden, M. A. (2004). The Creative Mind: Myths and Mechanisms, 2 Edn. New York, NY: Routledge.

Google Scholar

Brandt, A., Gebrian, M., and Slevc, L. R. (2012). Music and early language acquisition. Front. Psychol. 3:327. doi: 10.3389/fpsyg.2012.00327

PubMed Abstract | CrossRef Full Text | Google Scholar

Bregman, A. S. (1990). Auditory Scene Analysis: The Perceptual Organization of Sound. London: The MIT Press.

Google Scholar

Brockelman, W. Y., and Schilling, D. (1984). Inheritance of stereotyped gibbon calls. Nature 312, 634–636. doi: 10.1038/312634a0

PubMed Abstract | CrossRef Full Text | Google Scholar

Brown, S., and Jordania, J. (2011). Universals in the world’s musics. Psychol. Music 41, 229–248. doi: 10.1177/0305735611425896

CrossRef Full Text | Google Scholar

Copeland, N. E. (2019). InMI and its potential originality – musical creativity in composers’ minds. Interdiscip. Stud. Musicol. 19, 41–52. doi: 10.14746/ism.2019.19.3

CrossRef Full Text | Google Scholar

Creanza, N., Kolodny, O., and Feldman, M. W. (2017). Cultural evolutionary theory: How culture evolves and why it matters. Proc. Natl. Acad. Sci. U.S.A. 114, 7782–7789. doi: 10.1073/pnas.1620732114

PubMed Abstract | CrossRef Full Text | Google Scholar

Deacon, T. W. (2010). A role for relaxed selection in the evolution of the language capacity. Proc. Natl. Acad. Sci. U.S.A. 107, 9000–9006. doi: 10.1073/pnas.0914624107

PubMed Abstract | CrossRef Full Text | Google Scholar

Deutsch, D. (2019). Musical Illusions and Phantom Words: How Music and Speech Unlock Mysteries of the Brain. New York, NY: Oxford University Press, doi: 10.1093/oso/9780190206833.001.0001

CrossRef Full Text | Google Scholar

Dor, D., and Jablonka, E. (2000). From cultural selection to genetic selection: a framework for the evolution of language. Selection 1, 33–56. doi: 10.1556/Select.1.2000.1-3.5

CrossRef Full Text | Google Scholar

Dor, D., and Jablonka, E. (2001). “How language changed the genes: toward an explicit account of the evolution of language,” in New Essays on the Origin of Language, eds J. Trabant and S. Ward (Berlin: De Gruyter Mouton), 149–176. doi: 10.1515/9783110849080.149

CrossRef Full Text | Google Scholar

Dor, D., and Jablonka, E. (2010). “Plasticity and canalization in the evolution of linguistic communication: an evolutionary developmental approach,” in The Evolution of Human Language, eds R. K. Larson, V. Deprez, and H. Yamakido (Cambridge: Cambridge University Press), 135–147. doi: 10.1017/CBO9780511817755.010

CrossRef Full Text | Google Scholar

Dunbar, R. I. M. (1996). Grooming, Gossip, and the Evolution of Language. London: Harvard University Press.

Google Scholar

Fassbender, C. (1996). “Infants’ auditory sensitivity towards acoustic parameters of speech and music,” in Musical Beginnings: Origins and Development of Musical Competence, eds I. Deliège and J. Sloboda (Oxford: Oxford University Press), 56–87. doi: 10.1093/acprof:oso/9780198523321.003.0003

CrossRef Full Text | Google Scholar

Fedorenko, E., Patel, A., Casasanto, D., Winawer, J., and Gibson, E. (2009). Structural integration in language and music: evidence for a shared system. Mem. Cognit. 37, 1–9. doi: 10.3758/MC.37.1.1

PubMed Abstract | CrossRef Full Text | Google Scholar

Fenk-Oczlon, G. (2017). What vowels can tell us about the evolution of music. Front. Psychol. 8:1581. doi: 10.3389/fpsyg.2017.01581

PubMed Abstract | CrossRef Full Text | Google Scholar

Fitch, W. T. (2015). Four principles of bio-musicology. Philos. Trans. R. Soc. London B Biol. Sci. 370, 20140091. doi: 10.1098/rstb.2014.0091

PubMed Abstract | CrossRef Full Text | Google Scholar

Fitch, W. T., and Jarvis, E. D. (2013). “Birdsong and other animal models for human speech, song, and vocal learning,” in Language, Music and the Brain, ed. M. A. Arbib (Cambridge: The MIT Press), 499–539.

Google Scholar

Friedmann, N., and Rusou, D. (2015). Critical period for first language: the crucial role of language input during the first year of life. Curr. Opin. Neurobiol. 35, 27–34. doi: 10.1016/j.conb.2015.06.003

PubMed Abstract | CrossRef Full Text | Google Scholar

Fusco, G., and Minelli, A. (2010). Phenotypic plasticity in development and evolution: facts and concepts. Introduction. Philos. Trans. R. Soc. Lond. B. Biol. Sci. 365, 547–556. doi: 10.1098/rstb.2009.0267

PubMed Abstract | CrossRef Full Text | Google Scholar

Geiser, E., Notter, M., and Gabrieli, J. D. E. (2012). A corticostriatal neural system enhances auditory perception through temporal context processing. J. Neurosci. 32, 6177–6182. doi: 10.1523/JNEUROSCI.5153-11.2012

PubMed Abstract | CrossRef Full Text | Google Scholar

Geissmann, T. (2000). “Gibbon songs and human music from an evolutionary perspective,” in The Origins of Music, eds N. L. Wallin, B. Merker, and S. Brown (London: The MIT Press), 103–123.

Google Scholar

Gerbault, P., Liebert, A., Itan, Y., Powell, A., Currat, M., Burger, J., et al. (2011). Evolution of lactase persistence: an example of human niche construction. Philos. Trans. R. Soc. Lond. B. Biol. Sci. 366, 863–877. doi: 10.1098/rstb.2010.0268

PubMed Abstract | CrossRef Full Text | Google Scholar

Gorzelańczyk, E. J., Podlipniak, P., Walecki, P., Karpiński, M., and Tarnowska, E. (2017). Pitch syntax violations are linked to greater skin conductance changes, relative to Timbral violations – the predictive role of the reward system in perspective of cortico–subcortical loops. Front. Psychol. 8:586. doi: 10.3389/fpsyg.2017.00586

PubMed Abstract | CrossRef Full Text | Google Scholar

Grahn, J. A., and Rowe, J. B. (2009). Feeling the beat: premotor and striatal interactions in musicians and nonmusicians during beat perception. J. Neurosci. 29, 7540–7548. doi: 10.1523/JNEUROSCI.2018-08.2009

PubMed Abstract | CrossRef Full Text | Google Scholar

Grahn, J. A., and Rowe, J. B. (2013). Finding and feeling the musical beat: striatal dissociations between detection and prediction of regularity. Cereb. Cortex 23, 913–921. doi: 10.1093/cercor/bhs083

PubMed Abstract | CrossRef Full Text | Google Scholar

Hagen, E. H., and Bryant, G. A. (2003). Music and dance as a coalition signaling system. Hum. Nat. 14, 21–51. doi: 10.1007/s12110-003-1015-z

PubMed Abstract | CrossRef Full Text | Google Scholar

Hagen, E. H., and Hammerstein, P. (2009). Did Neanderthals and other early humans sing? Seeking the biological roots of music in the territorial advertisements of primates, lions, hyenas, and wolves. Music. Sci. 13, 291–320. doi: 10.1177/1029864909013002131

CrossRef Full Text | Google Scholar

Harwood, D. L. (1976). Universals in music: a perspective from cognitive psychology. Ethnomusicology 20, 521–533. doi: 10.2307/851047

CrossRef Full Text | Google Scholar

Hattori, Y., and Tomonaga, M. (2020). Rhythmic swaying induced by sound in chimpanzees (Pan troglodytes). Proc. Natl. Acad. Sci. U.S.A. 117, 936–942. doi: 10.1073/pnas.1910318116

PubMed Abstract | CrossRef Full Text | Google Scholar

Hattori, Y., Tomonaga, M., and Matsuzawa, T. (2013). Spontaneous synchronized tapping to an auditory rhythm in a chimpanzee. Sci. Rep. 3:1566. doi: 10.1038/srep01566

PubMed Abstract | CrossRef Full Text | Google Scholar

Hattori, Y., Tomonaga, M., and Matsuzawa, T. (2015). Distractor effect of auditory rhythms on self-paced tapping in chimpanzees and humans. PLoS One 10:e0130682. doi: 10.1371/journal.pone.0130682

PubMed Abstract | CrossRef Full Text | Google Scholar

Hauser, M. D. (2000). “The sound and the fury: primate vocalizations as reflections of emotion and thought,” in The origins of music, eds N. L. Wallin, S. Brown, and B. Merker (Cambridge, MA: The MIT Press), 77–102.

Google Scholar

Hauser, M. D., and Konishi, M. (1999). The Design of Animal Communication. London: The MIT Press.

Google Scholar

Horowitz, S. S. (2012). The Universal Sense: How Hearing Shapes the Mind. New York, NY: Bloomsbury.

Google Scholar

Huron, D. B. (2006). Sweet Anticipation: Music and The Psychology of Expectation. London: The MIT Press.

Google Scholar

Jablonka, E., and Lamb, M. J. (2005). Evolution in Four Dimensions: Genetic, Epigenetic, Behavioral, and Symbolic Variation in the History of Life. London: MIT Press.

Google Scholar

Jackendoff, R. (2009). Parallels and nonparallels between language and music. Music Percept. 26, 195–204. doi: 10.1525/mp.2009.26.3.195

PubMed Abstract | CrossRef Full Text | Google Scholar

Jackendoff, R., and Lerdahl, F. (2006). The capacity for music: what is it, and what’s special about it? Cognition 100, 33–72. doi: 10.1016/j.cognition.2005.11.005

PubMed Abstract | CrossRef Full Text | Google Scholar

Jacoby, N., and McDermott, J. H. (2017). Integer ratio priors on musical rhythm revealed cross-culturally by iterated reproduction. Curr. Biol. 27, 359–370. doi: 10.1016/j.cub.2016.12.031

PubMed Abstract | CrossRef Full Text | Google Scholar

Janik, V. M., and Slater, P. J. B. (1997). Vocal learning in mammals. Adv. Stud. Behav. 26, 59–99. doi: 10.1016/S0065-3454(08)60377-0

CrossRef Full Text | Google Scholar

Jarvis, E. D. (2007). Neural systems for vocal learning in birds and humans: a synopsis. J. Ornithol. 148, 35–44. doi: 10.1007/s10336-007-0243-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Jordania, J. (2014). Tigers, Lions and Humans: History of Rivalry, Conflict, Reverence and Love. Tibilisi: Logos.

Google Scholar

Killin, A. (2016). Rethinking music’s status as adaptation versus technology: a niche construction perspective. Ethnomusicol. Forum 25, 1–24. doi: 10.1080/17411912.2016.1159141

CrossRef Full Text | Google Scholar

Killin, A. (2017). Plio-pleistocene foundations of hominin musicality: coevolution of cognition, sociality, and music. Biol. Theory 12, 222–235. doi: 10.1007/s13752-017-0274-6

CrossRef Full Text | Google Scholar

Klein, C., Liem, F., Hänggi, J., Elmer, S., and Jäncke, L. (2016). The “silent” imprint of musical training. Hum. Brain Mapp. 37, 536–546. doi: 10.1002/hbm.23045

PubMed Abstract | CrossRef Full Text | Google Scholar

Krumhansl, C. L. (1990). Cognitive Foundations of Musical Pitch. New York: Oxford University Press, doi: 10.1093/acprof:oso/9780195148367.001.0001

CrossRef Full Text | Google Scholar

Krumhansl, C. L., and Cuddy, L. L. (2010). “A theory of tonal hierarchies in music,” in Music Perception, eds M. R. Jones, R. R. Fay, and A. N. Popper (London: Springer), 51–87. doi: 10.1007/978-1-4419-6114-3_3

CrossRef Full Text | Google Scholar

Kunert, R., Willems, R. M., Casasanto, D., Patel, A. D., and Hagoort, P. (2015). Music and language syntax interact in Broca’s area: an fMRI study. PLoS One 10:e141069. doi: 10.1371/journal.pone.0141069

PubMed Abstract | CrossRef Full Text | Google Scholar

Laland, K. N. (2008). Exploring gene-culture interactions: Insights from handedness, sexual selection and niche-construction case studies. Philos. Trans. R. Soc. B Biol. Sci. 363, 3577–3589. doi: 10.1098/rstb.2008.0132

PubMed Abstract | CrossRef Full Text | Google Scholar

Laland, K. N. (2017). Darwin’s Unfinished Symphony: How Culture Made the Human Mind. Princeton, NJ: Princeton University Press.

Google Scholar

Leonardi, M., Gerbault, P., Thomas, M. G., and Burger, J. (2012). The evolution of lactase persistence in Europe. A synthesis of archaeological and genetic evidence. Int. Dairy J. 22, 88–97. doi: 10.1016/j.idairyj.2011.10.010

CrossRef Full Text | Google Scholar

Lerdahl, F. (2013). “Musical syntax and its relation to linguistic syntax,” in Language, Music, and the Brain, ed. M. A. Arbib (London: The MIT Press), 257–272. doi: 10.7551/mitpress/9780262018104.003.0010

CrossRef Full Text | Google Scholar

Lerdahl, F., and Jackendoff, R. (1983). A Generative Theory of Tonal Music. Cambridge: The MIT Press.

Google Scholar

Levitin, D. J., Grahn, J. A., and London, J. (2018). The psychology of music: rhythm and movement. Annu. Rev. Psychol. 69, 51–75. doi: 10.1146/annurev-psych-122216-011740

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, P., Legault, J., and Litcofsky, K. A. (2014). Neuroplasticity as a function of second language learning: Anatomical changes in the human brain. Cortex 58, 301–324. doi: 10.1016/j.cortex.2014.05.001

PubMed Abstract | CrossRef Full Text | Google Scholar

Loison, L. (2019). Canalization and genetic assimilation: Reassessing the radicality of the Waddingtonian concept of inheritance of acquired characters. Semin. Cell Dev. Biol. 88, 4–13. doi: 10.1016/j.semcdb.2018.05.009

PubMed Abstract | CrossRef Full Text | Google Scholar

London, J. (2012). Three things linguists need to know about rhythm and time in music. Empir. Musicol. Rev. 7, 5–11.

Google Scholar

Lumsden, C. J., and Wilson, E. O. (1980). Translation of epigenetic rules of individual behavior into ethnographic patterns. Proc. Natl. Acad. Sci. U.S.A. 77, 4382–4386. doi: 10.1073/pnas.77.7.4382

PubMed Abstract | CrossRef Full Text | Google Scholar

Lumsden, C. J., and Wilson, E. O. (1982). Précis of genes, mind, and culture. Behav. Brain Sci. 5, 1–37. doi: 10.1142/5786

CrossRef Full Text | Google Scholar

McMullen, E., and Saffran, J. R. (2004). Music and language: a developmental comparison. Music Percept. 21, 289–311. doi: 10.1525/mp.2004.21.3.289

PubMed Abstract | CrossRef Full Text | Google Scholar

Mehr, S. A., Singh, M., Knox, D., Ketter, D. M., Pickens-Jones, D., Atwood, S., et al. (2019). Universality and diversity in human song. Science 366:eaax0868. doi: 10.1126/science.aax0868

PubMed Abstract | CrossRef Full Text | Google Scholar

Merker, B. (2000). “Synchronous chorusing and human origins,” in The Origins of Music, eds N. L. Wallin, B. Merker, and S. Brown (Cambridge, MA: The MIT Press), 315–327.

Google Scholar

Merker, B. (2002). Music: the missing Humboldt system. Music. Sci. 6, 3–21. doi: 10.1177/102986490200600101

CrossRef Full Text | Google Scholar

Merker, B. (2005). The conformal motive in birdsong, music, and language: an introduction. Ann. N. Y. Acad. Sci. 1060, 17–28. doi: 10.1196/annals.1360.003

PubMed Abstract | CrossRef Full Text | Google Scholar

Merker, B. (2006). “Layered constraints on the multiple creativities of music,” in Musical Creativity. Multidisciplinary Research in Theory and Practice, eds I. Deliège and G. A. Wiggins (Hove, NY: Psychology Press of the Taylor & Francis Group), 25–41.

Google Scholar

Merker, B. (2009). “Ritual foundations of human uniqueness,” in Communicative musicality. Exploring the Basis of Human Companionship, eds C. Malloch and S. Trevarthen (Oxford, New York: Oxford University Press), 45–59.

Google Scholar

Merker, B. (2012). “The vocal learning constellation,” in Music, Language, and Human Evolution, ed. N. Bannan (London: Oxford University Press), 215–260. doi: 10.1093/acprof:osobl/9780199227341.003.0009

CrossRef Full Text | Google Scholar

Mery, F., and Burns, J. G. (2010). Behavioural plasticity: an interaction between evolution and experience. Evol. Ecol. 24, 571–583. doi: 10.1007/s10682-009-9336-y

CrossRef Full Text | Google Scholar

Meyer, M., Elmer, S., and Jäncke, L. (2012). Musical expertise induces neuroplasticity of the planum temporale. Ann. N. Y. Acad. Sci. 1252, 116–123. doi: 10.1111/j.1749-6632.2012.06450.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Mithen, S. J. (2006). The Singing Neanderthals: The Origins of Music, Language, Mind, and Body. Cambridge: Harvard University Press.

Google Scholar

Mithen, S. J. (2009). The music instinct: the evolutionary basis of musicality. Ann. N. Y. Acad. Sci. 1169, 3–12. doi: 10.1111/j.1749-6632.2009.04590.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Morley, I. (2013). The Prehistory of Music: Human Evolution, Archaeology, and the Origins of Musicality. New York: Oxford University Press.

Google Scholar

Nettl, B. (2000). “An ethnomusicologist contemplates universals in musical sound and musical culture,” in The Origins of Music, eds N. L. Wallin, B. Merker, and S. Brown (London: The MIT Press), 463–472.

Google Scholar

Norman-Haignere, S., Kanwisher, N. G., and McDermott, J. H. (2015). Distinct cortical pathways for music and speech revealed by hypothesis-free voxel decomposition. Neuron 88, 1281–1296. doi: 10.1016/j.neuron.2015.11.035

PubMed Abstract | CrossRef Full Text | Google Scholar

Parncutt, R. (1994). A perceptual model of pulse salience and metrical accent in musical rhythms. Music Percept. Interdiscip. J. 11, 409–464. doi: 10.2307/40285633

CrossRef Full Text | Google Scholar

Pascual-Leone, A., Amedi, A., Fregni, F., and Merabet, L. B. (2005). The plastic humen brain cortex. Annu. Rev. Neurosci. 28, 377–401. doi: 10.1146/annurev.neuro.27.070203.144216

PubMed Abstract | CrossRef Full Text | Google Scholar

Patel, A. D. (2018). “Music as a transformative technology of the mind: an update,” in The Origins of Musicality, ed. H. Honing (Cambridge, MA: The MIT Press), 113–126. doi: 10.7551/mitpress/10636.003.0009

CrossRef Full Text | Google Scholar

Patel, A. D., and Iversen, J. R. (2014). The evolutionary neuroscience of musical beat perception: the Action Simulation for Auditory Prediction (ASAP) hypothesis. Front. Syst. Neurosci. 8:57. doi: 10.3389/fnsys.2014.00057

PubMed Abstract | CrossRef Full Text | Google Scholar

Payne, K. (2000). “The progressively changing songs of humpback whales: a window on the creative process in a wild animal,” in The Origins of Music, eds N. L. Wallin, B. Merker, and S. Brown (London: MIT Press), 135–150.

Google Scholar

Perani, D., Saccuman, M. C., Scifo, P., Spada, D., Andreolli, G., Rovelli, R., et al. (2010). Functional specializations for music processing in the human newborn brain. Proc. Natl. Acad. Sci. U.S.A. 107, 4758–4763. doi: 10.1073/pnas.0909074107

PubMed Abstract | CrossRef Full Text | Google Scholar

Peretz, I. (2006). The nature of music from a biological perspective. Cognition 100, 1–32. doi: 10.1016/j.cognition.2005.11.004

PubMed Abstract | CrossRef Full Text | Google Scholar

Perrin, E., and Venance, L. (2019). Bridging the gap between striatal plasticity and learning. Curr. Opin. Neurobiol. 54, 104–112. doi: 10.1016/j.conb.2018.09.007

PubMed Abstract | CrossRef Full Text | Google Scholar

Perruchet, P., and Poulin-Charronnat, B. (2013). Challenging prior evidence for a shared syntactic processor for language and music. Psychon. Bull. Rev. 20, 310–317. doi: 10.3758/s13423-012-0344-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Pigliucci, M. (2001). “Characters and environments,” in The Character Concept in Evolutionary Biology, ed. G. P. Wagner (San Diego, CA: Academic Press), 363–388. doi: 10.1016/B978-012730055-9/50028-8

CrossRef Full Text | Google Scholar

Pinker, S., and Jackendoff, R. (2005). The faculty of language: what’s special about it? Cognition 95, 201–236. doi: 10.1016/j.cognition.2004.08.004

PubMed Abstract | CrossRef Full Text | Google Scholar

Podlipniak, P. (2015). “The origin of music and the Baldwin effect,” in Proceedings of Ninth Triennial Conference of the European Society for the Cognitive Sciences of Music, eds J. Ginsborg, A. Lamont, and S. Bramley (Manchester: Royal Northern College of Music), 671–677.

Google Scholar

Podlipniak, P. (2016). The evolutionary origin of pitch centre recognition. Psychol. Music 44, 527–543. doi: 10.1177/0305735615577249

CrossRef Full Text | Google Scholar

Podlipniak, P. (2017). The role of the Baldwin effect in the evolution of human musicality. Front. Neurosci. 11:542. doi: 10.3389/fnins.2017.00542

PubMed Abstract | CrossRef Full Text | Google Scholar

Podlipniak, P. (2020). Pitch syntax as an evolutionary prelingual innovation. Music. Sci. doi: 10.1177/1029864920941551 [Epub ahead of print],

CrossRef Full Text | Google Scholar

Rakowski, A. (1999). Perceptual dimensions of pitch and their appearance in the phonological system of music. Music. Sci. 3, 23–39. doi: 10.1177/102986499900300102

CrossRef Full Text | Google Scholar

Rakowski, A. (2009). The domain of pitch in music. Arch. Acoust. 34, 429–443.

Google Scholar

Repp, B. H. (2007). Hearing a melody in different ways: Multistability of metrical interpretation, reflected in rate limits of sensorimotor synchronization. Cognition 102, 434–454. doi: 10.1016/j.cognition.2006.02.003

PubMed Abstract | CrossRef Full Text | Google Scholar

Reybrouck, M. (2006). “Musical creativity between symbolic modelling and perceptual constraints. The role of adaptive behaviour and epistemic autonomy,” in Musical Creativity. Multidisciplinary Research in Theory and Practice, eds I. Deliège and G. A. Wiggins (Hove: Psychology Press of the Taylor & Francis Group), 43–59.

Google Scholar

Roederer, J. G. (1984). The search for a survival value of music. Music Percept. Interdiscip. J. 1, 350–356. doi: 10.2307/40285265

CrossRef Full Text | Google Scholar

Rogalsky, C., Rong, F., Saberi, K., and Hickok, G. (2011). Functional anatomy of language and music perception: temporal and structural factors investigated using functional magnetic resonance imaging. J. Neurosci. 31, 3843–3852. doi: 10.1523/JNEUROSCI.4515-10.2011

PubMed Abstract | CrossRef Full Text | Google Scholar

Rothenberg, D., Roeske, T. C., Voss, H. U., Naguib, M., and Tchernichovski, O. (2014). Investigation of musicality in birdsong. Hear. Res. 308, 71–83. doi: 10.1016/j.heares.2013.08.016

PubMed Abstract | CrossRef Full Text | Google Scholar

Savage, P. E. (2019). Cultural evolution of music. Palgrave Commun. 5, 1–12. doi: 10.1057/s41599-019-0221-1

CrossRef Full Text | Google Scholar

Savage, P. E., Brown, S., Sakai, E., and Currie, T. E. (2015). Statistical universals reveal the structures and functions of human music. Proc. Natl. Acad. Sci. U.S.A. 112, 8987–8992. doi: 10.1073/pnas.1414495112

PubMed Abstract | CrossRef Full Text | Google Scholar

Savage, P. E., Loui, P., Tarr, B., Schachner, A., Glowacki, L., Mithen, S., et al. (2020). Music as a coevolved system for social bonding. Behav. Brain Sci. doi: 10.1017/S0140525X20000333 [Epub ahead of print],

CrossRef Full Text | PubMed Abstract | Google Scholar

Scheumann, M., Hasting, A. S., Kotz, S. A., and Zimmermann, E. (2014). The voice of emotion across species: how do human listeners recognize animals’ affective states? PLoS One 9:e91192. doi: 10.1371/journal.pone.0091192

PubMed Abstract | CrossRef Full Text | Google Scholar

Surmeier, D. J., Plotkin, J., and Shen, W. (2009). Dopamine and synaptic plasticity in dorsal striatal circuits controlling action selection. Curr. Opin. Neurobiol. 19, 621–628. doi: 10.1016/j.conb.2009.10.003

PubMed Abstract | CrossRef Full Text | Google Scholar

Sznajder, B., Sabelis, M. W., and Egas, M. (2012). How adaptive learning affects evolution: reviewing theory on the baldwin effect. Evol. Biol. 39, 301–310. doi: 10.1007/s11692-011-9155-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Takahashi, K. H. (2019). Multiple modes of canalization: links between genetic, environmental canalizations and developmental stability, and their trait-specificity. Semin. Cell Dev. Biol. 88, 14–20. doi: 10.1016/j.semcdb.2018.05.018

PubMed Abstract | CrossRef Full Text | Google Scholar

Tillmann, B., Bharucha, J. J., and Bigand, E. (2000). Implicit learning of tonality: a self-organizing approach. Psychol. Rev. 107, 885–913. doi: 10.1037/0033-295X.107.4.885

PubMed Abstract | CrossRef Full Text | Google Scholar

Tinbergen, N. (1963). On aims and methods of Ethology. Z. Tierpsychol. 20, 410–433. doi: 10.1111/j.1439-0310.1963.tb01161.x

CrossRef Full Text | Google Scholar

Toiviainen, P., Luck, G., and Thompson, M. R. (2010). Embodied meter: hierarchical eigenmodes in music-induced movement. Music Percept. 28, 59–70. doi: 10.1525/mp.2010.28.1.59

PubMed Abstract | CrossRef Full Text | Google Scholar

Tomasello, M. (1999). The Cultural Origins of Human Cognition. Harvard: Harvard University Press.

Google Scholar

Trainor, L. J. (2005). Are there critical periods for musical development? Dev. Psychobiol. 46, 262–278. doi: 10.1002/dev.20059

PubMed Abstract | CrossRef Full Text | Google Scholar

Trehub, S. E. (2015). Cross-cultural convergence of musical features. Proc. Natl. Acad. Sci. U.S.A. 112, 8809–8810. doi: 10.1073/pnas.1510724112

PubMed Abstract | CrossRef Full Text | Google Scholar

Waddington, C. H. (1942). Canalization of development and the inheritance of acquired characters. Nature 150, 563–565. doi: 10.1038/150563a0

CrossRef Full Text | Google Scholar

Watson, S. K., Townsend, S. W., Schel, A. M., Wilke, C., Wallace, E. K., Cheng, L., et al. (2015). Vocal learning in the functionally referential food grunts of chimpanzees. Curr. Biol. 25, 495–499. doi: 10.1016/j.cub.2014.12.032

PubMed Abstract | CrossRef Full Text | Google Scholar

Webster, P. R. (1990). Creativity as creative thinking. Music Educ. J. 76, 22–28. doi: 10.2307/3401073

CrossRef Full Text | Google Scholar

Webster, P. R. (2002). “Creative thinking in music: advancing a model,” in Creativity and Music Education, eds T. Sullivan and L. Willingham (Toronto, ON: Canadian Music Educators’ Association), 16–34.

Google Scholar

West-Eberhard, M. J. (2005). Developmental plasticity and the origin of species differences. Proc. Natl. Acad. Sci. U.S.A. 102, 6543–6549. doi: 10.1073/pnas.0501844102

PubMed Abstract | CrossRef Full Text | Google Scholar

White, E., Hutka, S., Williams, L., and Moreno, S. (2013). Learning, neural plasticity and sensitive periods: implications for language acquisition, music training and transfer across the lifespan. Front. Syst. Neurosci. 7:90. doi: 10.3389/fnsys.2013.00090

PubMed Abstract | CrossRef Full Text | Google Scholar

Wilson, E. O. (1978). On Human Nature. Cambridge, MA: Harvard University Press.

Google Scholar

Wright, A. A., Shyan, M. R., and Jitsumori, M. (1990). Auditory same/different concept learning by monkeys. Anim. Learn. Behav. 18, 287–294. doi: 10.3758/BF03205288

CrossRef Full Text | Google Scholar

Zimmermann, E., Leliveld, L., and Schehka, S. (2013). “Toward the evolutionary roots of affective prosody in human acoustic communication: A comparative approach to mammalian voices,” in Evolution of Emotional Communication: from Sounds in Nonhuman Mammals to Speech and Music in Man, eds E. Altenmüller, S. Schmidt, and E. Zimmermann (Oxford: Oxford University Press), 116–132.

Google Scholar

Keywords: canalization, plasticity, gene-culture coevolution, musical creativity, musical syntax, cortico-subcortical loops, premotor cortex

Citation: Podlipniak P (2021) The Role of Canalization and Plasticity in the Evolution of Musical Creativity. Front. Neurosci. 15:607887. doi: 10.3389/fnins.2021.607887

Received: 18 September 2020; Accepted: 24 February 2021;
Published: 16 March 2021.

Edited by:

David Michael Bashwiner, University of New Mexico, United States

Reviewed by:

Joseph Jordania, University of Melbourne, Australia
Charilaos Lavranos, Ionian University, Greece

Copyright © 2021 Podlipniak. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Piotr Podlipniak, cG9kbGlwQGFtdS5lZHUucGw=; cG9kbGlwQHBvY3p0YS5vbmV0LnBs orcid.org/0000-0002-4326-559X

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.