- 1Wellcome Centre for Human Neuroimaging, University College London, London, United Kingdom
- 2Department of Psychology, Durham University, Durham, United Kingdom
Inner speech emerges in early childhood, in parallel with the maturation of the dorsal language stream. To date, the developmental relations between these two processes have not been examined. We review evidence that the dorsal language stream has a role in supporting the psychological phenomenon of inner speech, before considering pediatric studies of the dorsal stream’s anatomical development and evidence for its emerging functional roles. We examine possible causal accounts of the relations between these two developmental processes and consider their implications for phylogenetic theories about the evolution of inner speech and the accounts of the ontogenetic relations between language and cognition.
Development of Inner Speech
Inner speech – the experience of speaking silently in one’s head – is an enigmatic everyday phenomenon. It has been suggested to play an important role in psychological processes as diverse as memory, cognition, emotional regulation, auditory verbal hallucinations, and even consciousness and self-reflection (Alderson-Day and Fernyhough, 2015). Various domains of scholarship, including philosophy, psychology, and neuroscience, have seen renewed interest in inner speech, where it is seen as providing a context for exploring questions about the relationship between language and thought, the boundary between typical and atypical experience, and the emergence and maintenance of self-regulation (Fernyhough, 2016).
The origins of modern interest in inner speech can be traced to the Russian developmental psychologist, Vygotsky, who proposed that it develops through the gradual internalization of linguistic interactions that have been shaped by social interaction. Vygotsky argued that infants begin life embedded in social exchanges which, with the emergence of language, become linguistically mediated. In time, words that had previously been used to regulate the behavior of others are “turned back on the self” to regulate the child’s own behavior. In the preschool and early school years, such self-directed speech is mainly overt and audible, constituting a developmental stage known as private speech. With further development, these overt dialogues with the self become internalized so that they are entirely covert and inaudible, marking the development of inner speech.
Research in the last few decades has largely confirmed Vygotsky’s view of the development and functions of private and inner speech (Winsler et al., 2009). In particular, empirical studies have supported Vygotsky’s insight that private speech peaks in the preschool and early school years (between 4 and 7 years of age) and gradually reduces in frequency in middle childhood (Winsler et al., 2009). Although studying inner speech in childhood is fraught with difficulty, there is a consensus that this pattern corresponds to the emergence of fully internalized inner speech as private speech “goes underground” (Vygotsky, 1987), and the findings suggest that children begin to understand the concept of inner speech in the preschool and middle school years (Flavell et al., 1993, 1997, 2001; Fernyhough, 2009). Furthermore, there has been a growing recognition that overt self-directed speech (or private speech) continues to have important psychological functions into adulthood (Duncan and Tarulli, 2009). Fernyhough (2004) has proposed that adults can move flexibly between inner and overt private speech.
Studies of the various linguistic parameters in inner speech have so far focused on adult inner speech. Oppenheim and Dell (2008) have suggested that inner speech is phonetically impoverished in comparison to overt speech because inner speech lacks some of the phonetic components present in overt speech or because the internal monitoring system fails to detect the full range of phonetic features of the produced inner speech. However, others have shown that phonetics is fully specified in inner speech. For example, Corcoran (1966) has shown that readers automatically access phonetics in inner speech during silent reading. Özdemir et al. (2007) reported that the “uniqueness point”, the place in the sequence of the word’s phonemes at which it deviates from every other word in the language, influenced phoneme monitoring in inner speech suggesting that inner speech is specified to the same level as overt speech. Slevc and Ferreira (2006) documented a phonemic similarity effect in inner speech, again suggesting phonemic representation in inner speech. An fMRI study showed that manipulation of phonetic variables affects activation in phonological regions, even during a covert condition (Kell et al., 2017). Lastly, people’s ability to detect verbal transformations in inner speech (Sato et al., 2004) also suggests that the phonological representation is highly specified in inner speech. Others found that inner speech monitoring is influenced by lexical bias, suggesting that it is specified at the lexical level (Nooteboom, 2005; Geva and Warburton, 2019). While Slevc and Ferreira (2006) showed that monitoring of inner speech is not subject to the semantic similarity effect, this should not be simply interpreted as inner speech lacking semantic information. Rather, it might be that semantic information is not used for the task of monitoring errors. Lastly, recent studies have suggested that inner speech also carries prosodic information (Breen and Clifton, 2011; Filik and Barber, 2011; Geva and Warburton, 2019). However, it has been argued that information about prosody can be accessed by speakers even before inner speech is evoked (Coltheart et al., 1993; Rastle and Coltheart, 2000), and studies of tip-of-the-tongue somewhat support this argument (reviewed in Geva and Warburton, 2019).
Drawing on ideas of Vygotsky (1987), Fernyhough (2004) has suggested that inner speech can vary between fully specified expanded inner speech to a highly condensed form, with these variations reflecting levels of specification of syntax, semantics, and phonology. Expanded inner speech bears fully specified linguistic information and is similar to overt speech, while condensed inner speech lacks phonology (and all linguistic levels that follow, such as prosody and articulation) and full syntactic structure, and its semantics may be different to that of overt speech, such as being more idiosyncratic and personal in nature. Fernyhough (2004) further suggests that the transition from expanded to condensed inner speech is part of a developmental process and that adults can move flexibly between different forms of inner speech and overt private speech as conditions and task demands change.
Neural Correlates of Inner Speech
With advances in neuroscientific methodology, attention has turned to the neural correlates of self-directed speech, although to date, this has mostly focused on inner speech in adults (Perrone-Bertolotti et al., 2014; Geva, 2018). Recent studies of inner speech function in adults with brain damage have shown that, for some patients, inner speech can be preserved while there is marked impairment in overt speech. More interestingly, other individuals can have preserved overt speech, but at the same time a salient impairment in inner speech (Geva et al., 2011a; Langland-Hassan et al., 2015; Stark et al., 2017). This dissociation suggests that somewhat distinct neural mechanisms support each type of speech. Although inner speech is (in the Vygotskian view) seen as developing out of overt speech, the process of internalization involves various types of semantic and syntactic transformation (Vygotsky, 1987) which make plausible the involvement of distinct neural substrates.
In the last 40 years, hundreds of functional imaging studies have examined the neural correlates of inner speech. These studies have used diverse tasks ranging from silent word repetition (Shuster and Lemieux, 2005; Pei et al., 2011), verb generation (Frings et al., 2006), stem completion (Rosen et al., 2000), and rhyme judgment (Paulesu et al., 1993; Pugh et al., 1996; Lurito et al., 2000; Poldrack et al., 2001; Owen et al., 2004; Hoeft et al., 2007) to silent reading (Bookheimer et al., 1995). Converging evidence from these studies of task-dependent inner speech points to the involvement of the left inferior frontal gyrus (IFG), and the left angular (AG) and supramarginal gyri (SMG) in the production and processing of inner speech (reviewed in Geva, 2018). These areas are connected via the dorsal language stream (Hickok and Poeppel, 2007; Saur et al., 2008), suggesting that it is involved in inner speech processing (Geva et al., 2011c; Rijntjes et al., 2012).
Spontaneous inner speech has only been scarcely studied, but findings so far support those from studies of task-dependent inner speech. A study by Doucet et al. (2012) found higher levels of spontaneous inner speech to be associated with increase in spontaneous fluctuations of activity (tested using resting state fMRI) in a fronto-parietal network, which includes the IFG, temporo-parietal junction, and superior temporal regions. In accordance with this result, it was shown that during resting state (while participants lie inside the scanner without performing any task and without exposure to any specific external stimulus), significant bursts of activation can be recorded in bilateral auditory cortex, which might be related to spontaneous occurrences of inner speech (Hunter et al., 2006). A detailed study of a single participant experiencing spontaneous inner speech in the scanner showed activation in left IFG and superior temporal sulcus (STS) as well as superior and middle temporal gyri during inner speech compared with rest. Left IFG activation was also present when comparing inner speech to other inner experiences (Kühn et al., 2014). In the only fMRI study that has directly compared spontaneous and elicited inner speech, a Region of Interest (ROI) analysis was used to contrast inner speech elicited by a task with occurrences of spontaneous inner speech. The results showed distinct patterns of activation associated with the two speech types, with left IFG activating in elicited, but not in spontaneous inner speech (Hurlburt et al., 2016). The implications of this finding are that it should not be assumed that activations associated with task-based inner speech reflect those found when inner speech arises spontaneously.
Buchsbaum and D’Esposito (2008) suggested that area Spt (Sylvian parietal temporal area, which is located within the Sylvian fissure at the parietal-temporal boundary), is the key area along the dorsal language stream that acts as an interface between the auditory-phonological system and the motor system. This function would implicate it in inner speech production in adults and would point to its potential as a starting point for exploring the neural substrates of inner speech in childhood. In the next sections, we present the current knowledge of dorsal stream anatomy and then discuss its development during childhood, as well as what is known about its function in pediatric populations.
Dorsal Language Stream Anatomy
The dorsal language stream has been studied for more than a century, beginning with the seminal work of Dejerine (1895) and Wernicke (1874). It is specified in the classical Wernicke-Lichtheim-Geschwind anatomical model, where it is suggested that Broca’s area is connected to Wernicke’s area in the posterior temporal cortex via the arcuate fasciculus (AF). Advances in neuroimaging allowed further anatomical characterization of the dorsal language stream. In the past, connections between various areas in the human brain were mainly studied post-mortem. Today, the preferred methodology for defining anatomical white matter connections in vivo is diffusion tensor imaging (DTI). DTI images quantify the level and direction of the movement of water molecules in a tissue. As water molecules behave differently in different types of tissue, DTI can reliably distinguish between cell bodies (gray matter), tracts (white matter), and cerebrospinal fluid (CSF) (Pierpaoli et al., 1996; Pierpaoli and Basser, 1996; Basser and Pierpaoli, 1998). In recent years, DTI studies have refined, altered, and expanded upon the classical Wernicke-Lichtheim-Geschwind anatomical model of the language system (Hagoort, 2014). For terms related to DTI methodology, see Box – DTI Glossary. For a review of the use of DTI in language studies, see Geva et al. (2011b).
BOX. DTI Glossary (adapted from Geva et al., 2011b).
Diffusion Tensor Imaging (DTI) – An MRI technique which is sensitive to the microscopic motion of water molecules in a tissue.
Diffusion tensor images are based on measurements of the movement of molecules:
Isotropic movement is a completely random movement which occurs in the absence of any restriction. This movement is equal in every direction and it is a characteristic of the movement of water molecules in neuronal cells (gray matter) and the cerebrospinal fluid (CSF).
Anisotropic movement is movement which occurs in the presence of physical restriction and is therefore larger in one direction. As axons restrict the movement of molecules parallel to the trajectory of the axon, the movement in the white matter is more anisotropic.
Eigenvector is the direction of movement of the water molecules (the diffusivity), while eigenvalue is the value of the diffusivity along the direction of the associated eigenvector. The tensor represents the overall movement of the water molecules, derived by averaging the strength of movement along the x, y, and z axes.
DTI studies commonly report the following parameters:
Fractional Anisotropy (FA) – A function of the eigenvalues, normalized to be between 0 (movement is completely unrestricted) and 1 (movement is restricted towards one direction), representing how similar the diffusivity values are in the different directions.
Axial Diffusivity (AD) – The value of the main (largest) eigenvalue. Also reffered to as Longitudinal Diffusivity.
Radial Diffusivity (RD) – The average of the two smaller eigenvalues. Also reffered to as Transverse Diffusivity.
Mean Diffusivity (MD) – The average of the three eigenvalues. This value describes the average distance traveled within a specific voxel.
Apparent Diffusion Coefficient (ADC) – The diffusion coefficient along a particular direction. In the context of DTI, MD and ADC are often used interchangeably.
Catani et al. (2005) suggested that in addition to the direct AF pathway between posterior temporal and inferior frontal regions (termed by Catani and colleagues the long segment), there are two other tracts: the anterior segment, which connects the posterior IFG with the inferior parietal lobe; and the posterior segment, which connects the inferior parietal lobe with the posterior temporal gyrus (see Figure 1). Later studies confirmed these findings in both adults (Parker et al., 2005; Frey et al., 2008) and children (Eluvathingal et al., 2007; Tak et al., 2016). These three segments are also referred to as the fronto-temporal (FT) segment (the long segment); fronto-parietal (FP) segment (the anterior segment), and temporo-parietal (TP) segment (the posterior segment) (Eluvathingal et al., 2007; see Table 1). In addition, imaging studies have suggested that a separate tract, the superior longitudinal fasciculus (SLF), also forms part of the dorsal language stream (Frey et al., 2008; Saur et al., 2008). The SLF can be divided into three components, of which only SLF III forms part of the dorsal language stream, connecting parietal area 40 (SMG), the ventral parts of peri-central Brodmann Areas (BA) 43, 2, 4, and 6, and BA 44 (pars opercularis) (Makris et al., 2005). SLF III differs from the long segment of the AF, which in its posterior part reaches the SMG (BA 40), posterior superior temporal gyrus (pSTG; BA 22), and the temporo-occipital region (BA 37). Lastly, it has been suggested that the dorsal pathway can be divided into two sections according to their frontal termination point: dorsal pathway I includes AF/SLF fibers which terminate at the premotor cortex, while dorsal pathway II includes AF/SLF fibers which terminate in the IFG BA 44 (Friederici, 2011, 2012). For details, see Table 1.
Figure 1. Tractography reconstruction of the three segments of the dorsal language pathway: the fronto-temporal (FT) segment/long segment (red); the fronto-parietal (FP) segment /anterior segment (green); and the temporo-parietal (TP) segment/posterior segment (yellow). The figure is adapted from Catani et al. (2005), and is being used with the permission of John Wiley and Sons.
Table 1. Descriptions of the subcomponents of the dorsal language pathway, according to different studies.
Based on these anatomical definitions, the most likely tracts to support inner speech, within the dorsal language stream, are either the fronto-temporal or fronto-parietal segments. However, note that the exact anatomical end points of the various tracts are not agreed upon (see Martino et al., 2013 for an excellent discussion regarding the differences between various anatomical studies). In addition, in many imaging studies, these tracts are not distinguished, due to the methodological limitations of DTI, and are referred to as simply the dorsal stream or AF/SLF (Friederici, 2009).
In addition to the dorsal language stream, the human language system is supported by a ventral language stream (Hickok and Poeppel, 2007; Weiller et al., 2009), which mostly runs medially to the temporal lobe. This pathway connects occipital and temporal areas with frontal regions. It includes the inferior fronto-occipital fascicle (IFOF), which connects the occipital lobe, parietal lobe, and the posterior temporal cortex with the frontal lobe. In addition, the inferior longitudinal fascicle (ILF) connects the posterior occipito-temporal region and the temporal pole. Lastly, the uncinate fasciculus (UF) connects the anterior temporal cortex to inferior frontal areas (reviewed in Duffau, 2016).
Pediatric Studies of the Dorsal Language Stream
Anatomical Studies
The field of developmental cognitive neuroscience has seen a recent increase in interest in the role of the dorsal language stream in both typical development (Tak et al., 2016) and language and speech disorders (Morgan et al., 2016). In a pioneering study of its kind, full-term newborns were scanned within the first 3 days of life. DTI images showed that dorsal pathway I, which terminates in the premotor cortex, is already fully present at birth, while dorsal pathway II, which connects to the IFG, was undetectable (Perani et al., 2011). Similarly, in a study of language pathways among 6- to 22-week-old infants, it was shown that all language tracts were detectable at this age (both ventral and dorsal), although the AF showed the highest variability, terminating in the pre-central gyrus in most cases, and not reaching the IFG (Dubois et al., 2016). Among 0- to 54-month-olds, the SLF was found to be the least developed tract in the newborns, when compared to projection, callosal, brainstem, limbic, and other association fibers, and in fact, it could not be delineated before the age of 12 months (Hermoye et al., 2006). A study which included participants ranging the entire age span from neonates to adults showed that the SLF was difficult to identify in neonates and that it was significantly smaller in infants up to the age of 1 year. However, it could easily be identified in late childhood (6–10 years) (Zhang et al., 2007). While data from these studies converge to suggest that the dorsal language stream, or at least its portion which terminates in the IFG, is under-developed at birth, the explanation for this finding varies. Most authors interpret their findings as reflecting genuine anatomical difference between infants/children and adults (Hermoye et al., 2006; Zhang et al., 2007; Perani et al., 2011). However, Dubois et al. (2016) argue that the difference can be attributed to methodological issues, as studies of infants do not take into account the differences between the dorsal and ventral bundles in adults. Interestingly, post-mortem dissections of fetal human brains at 19–20 weeks gestational age showed that some of the ventral pathway, but not the dorsal one, is already present at this gestational age. In the ventral pathway, the external capsule (which contains the ILF and IFOF) was not clearly visible, but the UF was clearly identified. In healthy neonates, both the ILF and IFOF were identified, though they were not developed enough to reveal their projection to the frontal, temporal, and occipital lobes using DTI (Huang et al., 2006). The SLF was also not visible in the fetus, and it could also not be identified in the neonate. The temporal projection of the SLF was only clearly identifiable in the DTI scans of 5- to 6-year-olds (Huang et al., 2006). Hence, the finding of an existing ventral, but not dorsal, pathway in the fetus, suggests that the under-developed presentation of the dorsal pathway in DTI studies of infancy and childhood might be a genuine anatomical finding, rather than a methodological artifact. However, as the cause of death of the fetuses in the study by Huang et al. is not reported, and as fetal brains are rarely obtained without damage, these results should be interpreted with caution.
Further studies included school-aged children as well. Brauer et al. (2013) expanded on Perani et al.’s (2011) findings, showing that 7-year-olds already have both dorsal pathways I and II in place, similarly to adults, therefore obtaining very similar results to those obtained by Zhang et al. (2007). However, fractional anisotropy (FA) values, a commonly used DTI parameter (see Box – DTI Glossary), were still lower for 7-year-olds, compared to adults. Significant correlations between age and diffusivity parameters were found among cohorts of various age ranges [6- to 17-year-olds, examining the three segments of the AF (Eluvathingal et al., 2007); 4- to 17-year-olds, examining white matter integrity in the area of the AF (Paus et al., 1999); 5- to 18-year-olds, examining the AF (Schmithorst et al., 2002)]. Eluvathingal et al. (2007) distinguish between patterns of maturation based on different diffusivity parameters: The AF fronto-parietal segment showed a significant increase in FA with age, accompanied by significant decreases in mean, transverse, and axial diffusivity, suggesting increases in myelination. The authors suggest that this tract undergoes development mainly at the tested age range (6–17 years of age). The fronto-temporal and temporo-parietal segments of the AF showed significant age-related decreases in mean, transverse, and axial diffusivity measures that were not accompanied by significant increase in FA, which, according to the authors, suggest that much of the tracts’ maturation occurred before the age of 6 (Eluvathingal et al., 2007). A more recent DTI study of the maturation of the dorsal language pathway examined typically developing children in five age groups: 0–2, 3–5, 6–8, 9–11, and 12–14 years. It was found that the posterior segment developed first and actually showed an almost complete maturation already in the youngest age groups. This was followed by the anterior segment, which showed maturation in the middle age groups (around 6–8 years). Finally, the direct segment was suggested to mature only in the early teen years (Tak et al., 2016). Skeide et al. (2016) examined three age groups, similar to the middle ones of Tak et al. (2016), 3–4, 6–7 and 9–10-year-olds, as well as a group of adults. They showed a gradual and steady increase in FA of the dorsal pathway between the four age groups. While data in these studies suggest that the AF reaches maturation around the early school age years, non-linear relations were not statistically evaluated, and it is therefore difficult to determine at which age the maturation plateaus, signifying the age in which the language tracts reach an adult level of development. In addition, some of these studies did not include a group of adults, for comparing the level of maturation of the white matter tracts.
A few studies directly evaluated the age of maturation of various white matter tracts. Maturation was defined as the age at which diffusivity parameters reach a plateau. A longitudinal study which scanned children (aged 5–17) three times over a period of 3 years found increase in FA for both the AF and ILF, the latter forming part of the ventral language stream. However, the slopes were not dependent on initial age of testing, suggesting that the rate of change is equivalent across this age range (Yeatman et al., 2012). Studying participants aged 6–30 years Lebel et al. (2008) suggested that the AF reaches full maturation between the teen years and early 20s. A study of 7- to 68-year-olds found similar results, showing that all three segments of the dorsal language stream (anterior, posterior, and direct) reach full maturation around age 20–30 (Hasan et al., 2010). The authors further suggest that developmental studies should evaluate maturation of anatomical brain structures using non-linear relations.
In summary, there is an agreement in the literature that the ventral language pathway is already detectable at birth (Perani et al., 2011; Tak et al., 2016) and matures faster than the dorsal language pathway (Brauer et al., 2013; Dubois et al., 2016; Tak et al., 2016). In addition, by late childhood, children’s dorsal pathway has similar anatomical structure to that of adults, although full maturation (as reflected in diffusivity parameters) is only achieved in the late teens or even early 20s (see Figure 2).
Figure 2. Tractography reconstruction of the left hemispheric language pathways, as they appear in the newborn (left) and adult (right) human brain. Tracts include: dorsal pathway I (AF/SLF fibers terminating at the premotor cortex) in yellow; dorsal pathway II (AF/SLF fibers terminating in the IFG BA 44) in blue; ventral pathway in green. The figure is adapted from Perani et al. (2011) and is being used with the permission of the authors and the National Academy of Sciences.
The Functional Role of the Dorsal Pathway During Development
We have argued that the dorsal language stream supports the development and maintenance of inner speech. Much research has been done on the role of the dorsal language stream in language processing. Here, we ask whether some of the more well-established functions of this pathway have overlaps with inner speech and try to establish how it can support various and potentially distinct functions at the same time.
Two influential models of language development and processing assign specific functions to the dorsal language stream. The first describes language processing in general, suggesting that acoustic speech signals which are processed in posterior brain regions are transferred through the dorsal language stream to the frontal lobe, where they are converted into articulatory representations (Hickok and Poeppel, 2007). This process is essential for language acquisition, as infants and children learn to produce heard words (Hickok and Poeppel, 2007). Later in adulthood, this processing stream can be used for repetition (Saur et al., 2008; Kümmerer et al., 2013). However, based on the anatomical findings showing that the dorsal stream is under-developed in early childhood, developmental studies of the two language pathways suggested that, in early childhood, language development is actually dependent on the ventral pathway, not the dorsal one, while the dorsal pathway only subserves higher language functions which develop later (Brauer et al., 2013; Skeide et al., 2016).
Reconciling this apparent contradiction, Friederici (2009) suggested that language acquisition is dependent on dorsal pathway I, which terminates in the premotor cortex and develops early, while higher language functions depend on dorsal pathway II, which develops later and terminates more anteriorly in the IFG. This suggestion is supported by studies of adults learning an artificial language. In one study, a significant correlation was found between performance on an artificial language learning task and the integrity of the left long segment, which connects auditory and motor regions. No correlation was found between language learning and the integrity of any of the other language tracts examined (the anterior segment, the posterior segment, or the IFOF) (Lopez-Barroso et al., 2013). Another study demonstrated that performance on an artificial language learning task was reduced when participants’ subvocal rehearsal was blocked (using articulatory suppression), compared to a condition of no suppression, therefore allowing rehearsal. Additionally, task performance correlated with the integrity of the fibers running through the extreme capsule/external capsule, only when subvocal rehearsal was suppressed. The authors suggest that in adults, language learning without subvocal rehearsal is associated with the ventral pathway (Lopez-Barroso et al., 2011). Together, these studies suggest that the association between adult language learning and the dorsal pathway is mediated by inner speech, a suggestion that supports our hypothesis.
A second influential and extensively studied model describes the process of adult reading. According to the Dual-Route model (Paap et al., 1987; Paap and Noel, 1991; Coltheart et al., 1993; Rastle and Coltheart, 2000), word reading can be achieved through one of two routes. The first is a lexical route, dedicated to reading frequent regular, as well as irregular, words by means of whole word recognition. The second is the sublexical route, which supports the reading of new words and non-words, by utilizing direct grapheme to phoneme translation (but see connectionist models, for example Seidenberg and McClelland, 1989). It has been suggested that the lexical and sublexical routes are supported by the ventral and dorsal systems, respectively (Schlaggar and McCandliss, 2007). However, the dorsal portion relevant for reading was found to be the temporo-parietal segment (Pugh et al., 2000; Schlaggar and McCandliss, 2007; Vandermosten et al., 2012). Later studies have extended this model, adding the frontal segments (fronto-temporal and fronto-parietal) (Vanderauwera et al., 2017), showing their association with phonological awareness (reviewed in Vandermosten et al., 2012). Among a group of children aged 7–11, higher phonological awareness (the ability to parse the word into syllables and phonemes and manipulate these phonemes to make up new words) was associated with lower FA in the left AF, over and above age. The correlation was specific to the tract and task (compared with word reading, verbal short-term memory, and repetition tasks). The negative correlation is interpreted as experience-based successful pruning (Yeatman et al., 2011). A longitudinal study of 5-year-old pre-readers found similar results: children were tested at the start and end of their last nursery year, and it was found that better phonological awareness (end phoneme and rhyme identification tasks) was a significant predictor of FA in the left dorsal fronto-temporal segment, over and above naming and letter identification. This correlation was not found for the temporo-parietal segment (Vandermosten et al., 2015). Paralleling the early internalization of overt speech, studies have shown that during reading acquisition, children slowly switch from overt to covert reading (Kragler, 1995; Prior and Welling, 2001). However, studies have yet to test whether this transition is associated with anatomical developments of the ventral or the dorsal routes of language.
Studies of word learning and repetition emphasize a specific functional directionality of the dorsal language pathway, in which processing of input phonological data in posterior regions precedes retrieval of articulatory information in frontal regions, therefore suggesting that information propagates from posterior temporal to anterior frontal regions (Friederici, 2009; Agosta et al., 2010). Direct cortical stimulation of posterior language areas (SMG, middle and posterior STG and the adjacent middle temporal gyrus; MTG) of awake adults resulted in evoked potentials in anterior language areas (Broca’s area or adjacent regions), supporting the idea of processing progressing from posterior to anterior regions. However, in addition, stimulation of anterior regions also resulted in evoked potentials in all posterior regions tested (Matsumoto et al., 2004). A similar study using direct cortical stimulation in adult patients also showed bidirectional connectivity between pSTG and IFG (David et al., 2013), further suggesting that the connection is direct, and also providing evidence that propagation of information is faster from posterior to anterior regions, compared to the other direction. Koubeissi et al. (2012) also highlighted the bidirectionality of the connection, by showing that some patients have evoked response in posterior regions after stimulation of anterior regions, while others show the opposite response. Lastly, a neuro-computational model of the dorsal language stream also suggested a bidirectional transfer of information in this route (Schomers et al., 2017).
In summary, adult patient studies show that information propagates along both anterior and posterior directions within the human dorsal language pathway, and hence, one should be cautious in assuming posterior-to-anterior direction. Most developmental studies have so far focused on those language functions which are supported by unidirectional propagation of information in the dorsal route from posterior to anterior parts. We suggest that some reciprocal fibers in this pathway which send information in the other direction might be essential for inner speech development.
The Dorsal Language Stream in Atypical Development
Some studies suggest a reduced use of inner speech among individuals with autistic spectrum disorder (ASD) (reviewed in Alderson-Day and Fernyhough, 2015). The reduction in inner speech use in some, but not all tasks, might be explained by the difference between dialogic and monologic thinking, with the former having its roots in communication with others, and the latter rooted in communication with the self (Fernyhough, 1996). Accordingly, it is expected that dialogic inner speech will be more affected among individuals with ASD (Alderson-Day and Fernyhough, 2015), a hypothesis that is confirmed in one study (Williams et al., 2012). A comprehensive review of DTI studies of ASD showed that people with ASD have white matter abnormalities across the brain, including in the AF/SLF, but not exclusively (Travers et al., 2012). In addition, correlations between diffusivity parameters and behavioral measurements have been inconsistent (Travers et al., 2012). A single study suggested that inner speech develops more slowly among children with specific language impairments (SLI), compared to typically developing children (Lidstone et al., 2012), but no neural correlates were studied. To the best of our knowledge, no other studies have examined inner speech in atypical pediatric populations. In cases where inner speech has been studied in atypical development, findings regarding white matter abnormalities are inconsistent, and associations with behavioral measurements vary greatly. However, this area of research offers an opportunity to further our understanding of the normal and abnormal development of inner speech and its neural correlates. We suggest that future studies of inner speech developmental abnormalities also examine whether behavioral performance correlates with dorsal stream anatomical integrity.
Dorsal Language Stream – Inner Speech Hypothesis
By combining findings from different disciplines, we have presented evidence that the maturation of the dorsal language stream, especially the fronto-temporal and fronto-parietal segments, during childhood occurs in parallel with the development of inner speech. We therefore suggest that there is a link between these neuro-anatomical and psychological developments. This suggestion is based on findings from three separate lines of research. First, inner speech emerges around the early school years; second, the FT and FP segments of the AF/SLF mature around the same time; and third, adult studies suggest the involvement of those dorsal pathway segments in inner speech processing.
In addition, there is also more specific experimental evidence to support this hypothesis: firstly, studies suggested that language learning in adults is mediated by subvocal rehearsal and is correlated with the integrity of the dorsal tracts (Lopez-Barroso et al., 2011, 2013); and secondly, children’s performance on phonological awareness tasks, often requiring inner speech, is correlated with dorsal pathway development (Yeatman et al., 2011; Vandermosten et al., 2012, 2015).
Evidence for the parallel emergence of the neural pathway of the dorsal stream and the psychological process of inner speech should not, however, be interpreted uncritically as evidence for causation in any particular direction. The development of language is, of course, not solely influenced by maturation of brain structures. Large variability in both brain maturation and language abilities among individuals is partly due to environmental exposure (Kidd et al., 2018). It is well established that environment induces brain changes, especially during childhood (Sale, 2018). It is also known that induced white matter changes can be documented in animals in vivo (Sale, 2018) and in humans using DTI (Scholz et al., 2009). For example, in the area of language development, it has been shown that following 100 h of training program, poor readers showed changes in diffusivity parameters, suggesting increased myelination. Moreover, these changes occurred in the same frontal region where the children with poor reading ability showed lower FA than children with normal reading abilities. Lastly, changes were specific to the group which underwent the remediation program (Keller and Just, 2009). Together, these studies suggest that observed changes in brain maturation can be environmentally induced.
It would therefore be a mistake to assume that the emergence of inner speech is only developmentally constrained by dorsal pathway maturation. Following Vygotsky, Luria argued for bidirectional causation between biological maturation and sociocultural experience, fitting with the view that the internalization of social exchanges creates a new functional system of inner speech (Luria, 1965; Fernyhough, 2010). This view is in keeping with similar views of developmental interplay between interaction with the environment and biological maturation in the human brain (Gómez-Robles et al., 2015).
Lastly, we do not intend to minimize the role of the ventral language stream in inner speech development. Tasks requiring internal content analysis, as is the case in most occurrences of natural inner speech, probably rely on an interaction between the dorsal and the ventral streams (Rijntjes et al., 2012). However, as the ventral stream is already highly developed at birth, it is the maturation of the dorsal stream that presents the main constraint on inner speech development during childhood. Further research on the interplay between the ventral and the dorsal language streams may pay dividends for our understanding of functionally relevant distinctions between forms of inner speech, such as the distinction that can be made between subvocal rehearsal and planning (Alderson-Day and Fernyhough, 2015).
A Comment on Inner Speech and the Origin of Language
Understanding the neurodevelopment of inner speech could be significant for current discussions about the origin of language in human evolution. There are contentious debates on whether language evolved as mechanism for symbolic thought (using inner speech) (Everaert et al., 2015, 2017) or as means of communication (Pinker and Jackendoff, 2005; Corballis, 2017). Jackendoff (1996) and others (Rijntjes et al., 2012) have discussed the importance of inner speech in human evolution, suggesting that the development of inner speech supported more complex and abstract thought. However, Pinker and Jackendoff (2005) emphasize that, in their view, language evolved initially as means of communication, and that inner speech is a “by-product”: a later evolutionary development which is a result of internalizing external speech, which in turn supports more complex thinking. Here, we extend this hypothesis to suggest that this evolutionary development is related specifically to anatomical changes in the dorsal language stream.
Comparative studies have found some substantial differences between dorsal stream tracts in humans, monkeys, and apes, suggesting an evolutionary change affecting these tracts. The human SLF III (the fronto-parietal segment) is similar to that of rhesus monkeys (Thiebaut de Schotten et al., 2012) and macaques (Croxson et al., 2005). The long segment of the AF, on the other hand, shows intra-species variations. In macaques (Rilling et al., 2008) and rhesus monkeys (Petrides and Pandya, 2009; Thiebaut de Schotten et al., 2012), AF connectivity in both anterior and posterior sites is limited. In these monkey species, the AF does not reach the middle or inferior temporal gyri in the posterior end and has less widespread connectivity in the anterior end. In chimpanzee, both parietal and frontal connectivities are wider than in the macaque; however, it is still not as developed as in humans (Rilling et al., 2008).
Additionally, in the macaque (Rilling et al., 2008) and rhesus monkey (Thiebaut de Schotten et al., 2012), the ventral pathway is substantially more developed than the dorsal pathway, as is the case in human infants (see section “Anatomical Studies”). The monkey ventral pathway resembles the human one in its anatomy (Croxson et al., 2005; Thiebaut de Schotten et al., 2012). In chimpanzees, the opposite is found: the dorsal pathway is more developed than the ventral one, as is the case in adult humans (Rilling et al., 2008).
Using neurocomputational modeling, Schomers et al. (2017) demonstrated that intra-species anatomical differences along the dorsal pathway are associated with functional differences. They suggest that compared with the monkey, the human anatomy of the dorsal pathway gives rise to stronger and longer-lasting neural activations, as well as parallel, rather than serial, activation (Schomers et al., 2017). They further suggest that the activity in the human model but not in the monkey model “can be viewed as reflecting (subvocal) articulation” (Schomers et al., 2017, p. 3051).
In summary, comparative studies show that monkeys and even chimpanzees have substantially less developed AF, compared with humans. It has already been suggested that changes in the dorsal tracts were the key element in human language evolution (Aboitiz and García, 2009; Friederici, 2009; Aboitiz, 2012). Aboitiz and colleagues further argue that these changes gave rise to inner speech and its associated function: phonological working memory (Aboitiz and García, 2009; Aboitiz, 2012). If early humans had under-developed AF, and if highly developed AF is the neural substrate for inner speech production (as we argue here), then, one might suggest that early humans had no, or at least limited, inner speech. In the absence of inner speech, language would have been initially used as means of communication (Pinker and Jackendoff, 2005; Corballis, 2017) rather than as mechanism for symbolic thought (Everaert et al., 2015, 2017).
Another line of evidence connecting inner speech with language evolution comes from genetic studies. The FOXP2 gene has long been associated with speech and language in humans (Lai et al., 2001; Vargha-Khadem et al., 2005), and later, it has been argued that both FOXP2 and its target genes have undergone adaptive protein evolution during human evolution (Enard et al., 2002; Zhang et al., 2002). The FOXP2 gene was first identified in the KE family, whose affected members have a mutation in this gene, and they suffer from speech and language deficits (Lai et al., 2001). A later study has shown that those affected individuals suffer from phonological loop impairments, even when the task requires only inner speech, with no overt recitation (Schulze et al., 2018). Others have shown an association between FOXP2 mutations and auditory hallucinations in schizophrenia (Sanjuan et al., 2006; Tolosa et al., 2010). Building on these findings, Crespi et al. (2017) have studied more than 800 healthy individuals, finding an association between a specific variant of the gene and inner speech scores (based on self-rating). Together, these studies link inner speech to one of the main genes implicated in the evolution of language, putting inner speech as a main component in the evolution of language as a whole (Crespi et al., 2017).
Lastly, we do not argue that the ontogeny (of inner speech) recapitulates its phylogeny. That is, the anatomical changes in the language pathways that occur during embryonic development and early childhood are somewhat different from those that came about in the course of evolution. The bidirectional causal view that we have espoused here is in keeping with the finding that human infants are born without a fully matured dorsal pathway. It is the development of this neural system, in parallel with human infants’ socially and linguistically patterned experience, that makes the emergence of inner speech possible.
Conclusion
The anatomy of the arcuate fasciculus was described more than 200 years ago, and its role in language processing has been discussed extensively (Catani and Mesulam, 2008). Together with subcomponents of the SLF, it forms the dorsal language stream. Neurodevelopmental studies have shown that humans are born with a dorsal language stream which is not fully developed and that it slowly matures throughout early childhood. Based on the temporal co-occurrence of dorsal stream maturation and the emergence of inner speech in children, as well as findings from studies of language development and adult language processing, we have suggested that the maturation of the dorsal language stream is closely linked to inner speech development. Studies of the neural mechanisms associated with inner speech in children are scarce. However, recent methodological advances in the study of neuro-development (Satterthwaite et al., 2014) and brain networks (Bassett and Bullmore, 2017) on the one hand, and inner speech (Geva and Warburton, 2019) on the other hand, can all contribute to our ability to make progress in this area. By linking findings from different disciplines, studies on the neural mechanisms of inner speech development can further our understanding of the role of inner speech and bridge the gap between research into language, cognition, development, and evolution.
Author Contributions
SG initiated the article and conducted the literature review. Both authors drafted parts of the manuscript and critically revised the work for intellectual content. Both authors approved the submitted version.
Funding
CF is supported by the Wellcome Trust grant WT108720.
Conflict of Interest Statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
References
Aboitiz, F. (2012). Gestures, vocalizations, and memory in language origins. Front. Evol. Neurosci. 4:2. doi: 10.3389/fnevo.2012.00002
Aboitiz, F., and García, R. (2009). Merging of phonological and gestural circuits in early language evolution. Rev. Neurosci. 20, 71–84. doi: 10.1515/REVNEURO.2009.20.1.71
Agosta, F., Henry, R. G., Migliaccio, R., Neuhaus, J., Miller, B. L., Dronkers, N. F., et al. (2010). Language networks in semantic dementia. Brain 133, 286–299. doi: 10.1093/brain/awp233
Alderson-Day, B., and Fernyhough, C. (2015). Inner speech: development, cognitive functions, phenomenology, and neurobiology. Psychol. Bull. 141, 931–965. doi: 10.1037/bul0000021
Basser, P. J., and Pierpaoli, C. (1998). A simplified method to measure the diffusion tensor from seven MR images. Magn. Reson. Med. 39, 928–934.
Bassett, D. S., and Bullmore, E. T. (2017). Small-world brain networks revisited. Neuroscience 23, 499–516. doi: 10.1177/1073858416667720
Bookheimer, S. Y., Zeffiro, T. A., Blaxton, T., Gaillard, W., and Theodore, W. (1995). Regional cerebral blood flow during object naming and word reading. Hum. Brain Mapp. 3, 93–106. doi: 10.1002/hbm.460030206
Brauer, J., Anwander, A., Perani, D., and Friederici, A. D. (2013). Dorsal and ventral pathways in language development. Brain Lang. 127, 289–295. doi: 10.1016/j.bandl.2013.03.001
Breen, M., and Clifton, C. (2011). Stress matters: effects of anticipated lexical stress on silent reading. J. Mem. Lang. 64, 153–170. doi: 10.1016/j.jml.2010.11.001
Buchsbaum, B. R., and D’Esposito, M. (2008). The search for the phonological store: from loop to convolution. J. Cogn. Neurosci. 20, 762–778. doi: 10.1162/jocn.2008.20501
Catani, M., Jones, D. K., and Ffytche, D. H. (2005). Perisylvian language networks of the human brain. Ann. Neurol. 57, 8–16. doi: 10.1002/ana.20319
Catani, M., and Mesulam, M. (2008). The arcuate fasciculus and the disconnection theme in language and aphasia: history and current state. Cortex 44, 953–961. doi: 10.1016/j.cortex.2008.04.002
Coltheart, M., Curtis, B., Atkins, P., and Haller, M. (1993). Models of reading aloud - dual-route and parallel-distributed-processing approaches. Psychol. Rev. 100, 589–608. doi: 10.1037/0033-295X.100.4.589
Corballis, M. C. (2017). Language evolution: a changing perspective. Trends Cogn. Sci. 21, 229–236. doi: 10.1016/j.tics.2017.01.013
Corcoran, D. W. J. (1966). An acoustic factor in letter cancellation. Nature 210, 658. doi: 10.1038/210658a0
Crespi, B., Read, S., and Hurd, P. (2017). Segregating polymorphisms of FOXP2 are associated with measures of inner speech, speech fluency and strength of handedness in a healthy population. Brain Lang. 173, 33–40. doi: 10.1016/j.bandl.2017.06.002
Croxson, P. L., Johansen-Berg, H., Behrens, T. E. J., Robson, M. D., Pinsk, M. A., Gross, C. G., et al. (2005). Quantitative investigation of connections of the prefrontal cortex in the human and macaque using probabilistic diffusion tractography. J. Neurosci. 25, 8854–8866. doi: 10.1523/JNEUROSCI.1311-05.2005
David, O., Job, A.-S., De Palma, L., Hoffmann, D., Minotti, L., and Kahane, P. (2013). Probabilistic functional tractography of the human cortex. NeuroImage 80, 307–317. doi: 10.1016/j.neuroimage.2013.05.075
Doucet, G., Naveau, M., Petit, L., Zago, L., Crivello, F., Jobard, G., et al. (2012). Patterns of hemodynamic low-frequency oscillations in the brain are modulated by the nature of free thought during rest. NeuroImage 59, 3194–3200. doi: 10.1016/j.neuroimage.2011.11.059
Dubois, J., Poupon, C., Thirion, B., Simonnet, H., Kulikova, S., Leroy, F., et al. (2016). Exploring the early organization and maturation of linguistic pathways in the human infant brain. Cereb. Cortex 26, 2283–2298. doi: 10.1093/cercor/bhv082
Duffau, H. (2016). “White matter pathways in the human” in Neurobiology of language. eds. G. Hickok and S. L. Small (Amsterdam: Elsevier), 129–137.
Duncan, R., and Tarulli, D. (2009). “On the persistence of private speech: empirical and theoretical considerations” in Private speech, executive functioning, and the development of verbal self-regulation. eds. A. Winsler, C. Fernyhough and I. Montero (Cambridge: Cambridge University Press), 176–187.
Eluvathingal, T. J., Hasan, K. M., Kramer, L., Fletcher, J. M., and Ewing-Cobbs, L. (2007). Quantitative diffusion tensor tractography of association and projection fibers in normally developing children and adolescents. Cereb. Cortex 17, 2760–2768. doi: 10.1093/cercor/bhm003
Enard, W., Przeworski, M., Fisher, S. E., Lai, C. S. L., Wiebe, V., Kitano, T., et al. (2002). Molecular evolution of FOXP2, a gene involved in speech and language. Nature 418, 869–872. doi: 10.1038/nature01025
Everaert, M. B. H., Huybregts, M. A. C., Berwick, R. C., Chomsky, N., Tattersall, I., Moro, A., et al. (2017). What is language and how could it have evolved? Trends Cogn. Sci. 21, 569–571. doi: 10.1016/j.tics.2017.05.007
Everaert, M. B. H., Huybregts, M. A. C., Chomsky, N., Berwick, R. C., and Bolhuis, J. J. (2015). Structures, not strings: linguistics as part of the cognitive sciences. Trends Cogn. Sci. 19, 729–743. doi: 10.1016/j.tics.2015.09.008
Fernyhough, C. (1996). The dialogic mind: a dialogic approach to the higher mental functions. New Ideas Psychol. 14, 47–62. doi: 10.1016/0732-118X(95)00024-B
Fernyhough, C. (2004). Alien voices and inner dialogue: towards a developmental account of auditory verbal hallucinations. New Ideas Psychol. 22, 49–68. doi: 10.1016/J.NEWIDEAPSYCH.2004.09.001
Fernyhough, C. (2009). What can we say about the inner experience of the young child? Behav. Brain Sci. 32:143. doi: 10.1017/S0140525X09000612
Fernyhough, C. (2010). “Vygotsky, Luria, and the social brain” in Self- and social-regulation: Exploring the relations between social interaction, social cognition, and the development of executive functions. eds. B. Sokol, U. Müller, J. Carpendale, A. Young, and G. Iarocci (Oxford, UK: Oxford University Press), 56–79.
Fernyhough, C. (2016). The voices within: The history and science of how we talk to ourselves. London: Profile Books.
Filik, R., and Barber, E. (2011). Inner speech during silent reading reflects the reader’s regional accent. PLoS One 6:e25782. doi: 10.1371/journal.pone.0025782
Flavell, J. H., Flavell, E. R., and Green, F. L. (2001). Development of children’s understanding of connections between thinking and feeling. Psychol. Sci. 12, 430–432. doi: 10.1111/1467-9280.00379
Flavell, J. H., Green, F. L., and Flavell, E. R. (1993). Children’s understanding of the stream of consciousness. Child Dev. 64, 387–398. doi: 10.2307/1131257
Flavell, J. H., Green, F. L., Flavell, E. R., and Grossman, J. B. (1997). The development of children’s knowledge about inner speech. Child Dev. 68, 39–47. doi: 10.2307/1131923
Frey, S., Campbell, J. S. W., Pike, G. B., and Petrides, M. (2008). Dissociating the human language pathways with high angular resolution diffusion fiber tractography. J. Neurosci. 28, 11435–11444. doi: 10.1523/JNEUROSCI.2388-08.2008
Friederici, A. D. (2009). Pathways to language: fiber tracts in the human brain. Trends Cogn. Sci. 13, 175–181. doi: 10.1016/j.tics.2009.01.001
Friederici, A. D. (2011). The brain basis of language processing: from structure to function. Physiol. Rev. 91, 1357–1392. doi: 10.1152/physrev.00006.2011
Friederici, A. D. (2012). Language development and the ontogeny of the dorsal pathway. Front. Evol. Neurosci. 4:3. doi: 10.3389/fnevo.2012.00003
Frings, M., Dimitrova, A., Schorn, C. F., Elles, H. G., Hein-Kropp, C., Gizewski, E. R., et al. (2006). Cerebellar involvement in verb generation: an fMRI study. Neurosci. Lett. 409, 19–23. doi: 10.1016/j.neulet.2006.08.058
Geva, S. (2018). “Inner speech and mental imagery: a neuroscientific perspective” in Inner speech: New voices. eds. P. Langland-Hassan and A. Vicente (Oxford, UK: Oxford University Press).
Geva, S., Bennett, S., Warburton, E. A., and Patterson, K. (2011a). Discrepancy between inner and overt speech: implications for post stroke aphasia and normal language processing. Aphasiology 25, 323–343. doi: 10.1080/02687038.2010.511236
Geva, S., Correia, M., and Warburton, E. A. (2011b). Diffusion tensor imaging in the study of language and aphasia. Aphasiology 25, 543–558. doi: 10.1080/02687038.2010.534803
Geva, S., Jones, P. S., Crinion, J. T., Price, C. J., Baron, J. C., and Warburton, E. A. (2011c). The neural correlates of inner speech defined by voxel-based lesion-symptom mapping. Brain 134, 3071–3082. doi: 10.1093/brain/awr232
Geva, S., and Warburton, E. A. (2019). A test battery for inner speech functions. Arch. Clin. Neuropsychol. 34, 97–113. doi: 10.1093/arclin/acy018
Gómez-Robles, A., Hopkins, W. D., Schapiro, S. J., and Sherwood, C. C. (2015). Relaxed genetic control of cortical organization in human brains compared with chimpanzees. Proc. Natl. Acad. Sci. USA 112, 14799–14804. doi: 10.1073/pnas.1512646112
Hagoort, P. (2014). Nodes and networks in the neural architecture for language: Broca’s region and beyond. Curr. Opin. Neurobiol. 28, 136–141. doi: 10.1016/J.CONB.2014.07.013
Hasan, K. M., Kamali, A., Abid, H., Kramer, L. A., Fletcher, J. M., and Ewing-Cobbs, L. (2010). Quantification of the spatiotemporal microstructural organization of the human brain association, projection and commissural pathways across the lifespan using diffusion tensor tractography. Brain Struct. Funct. 214, 361–373. doi: 10.1007/s00429-009-0238-0
Hermoye, L., Saint-Martin, C., Cosnard, G., Lee, S.-K., Kim, J., Nassogne, M.-C., et al. (2006). Pediatric diffusion tensor imaging: normal database and observation of the white matter maturation in early childhood. NeuroImage 29, 493–504. doi: 10.1016/j.neuroimage.2005.08.017
Hickok, G., and Poeppel, D. (2007). The cortical organization of speech processing. Nat. Rev. Neurosci. 8, 393–402. doi: 10.1038/nrn2113
Hoeft, F., Meyler, A., Hernandez, A., Juel, C., Taylor-Hill, H., Martindale, J. L., et al. (2007). Functional and morphometric brain dissociation between dyslexia and reading ability. Proc. Natl. Acad. Sci. USA 104, 4234–4239. doi: 10.1073/pnas.0609399104
Huang, H., Zhang, J., Wakana, S., Zhang, W., Ren, T., Richards, L. J., et al. (2006). White and gray matter development in human fetal, newborn and pediatric brains. NeuroImage 33, 27–38. doi: 10.1016/j.neuroimage.2006.06.009
Hunter, M. D., Eickhoff, S. B., Miller, T. W. R., Farrow, T. F. D., Wilkinson, I. D., and Woodruff, P. W. R. (2006). Neural activity in speech-sensitive auditory cortex during silence. Proc. Natl. Acad. Sci. USA 103, 189–194. doi: 10.1073/pnas.0506268103
Hurlburt, R. T., Alderson-Day, B., Kühn, S., and Fernyhough, C. (2016). Exploring the ecological validity of thinking on demand: neural correlates of elicited vs. spontaneously occurring inner speech. PLoS One 11:e0147932. doi: 10.1371/journal.pone.0147932
Jackendoff, R. (1996). How language helps us think. Pragmat. Cogn. 4, 1–34. doi: 10.1075/pc.4.1.03jac
Kell, C. A., Darquea, M., Behrens, M., Cordani, L., Keller, C., and Fuchs, S. (2017). Phonetic detail and lateralization of reading-related inner speech and of auditory and somatosensory feedback processing during overt reading. Hum. Brain Mapp. 38, 493–508. doi: 10.1002/hbm.23398
Keller, T. A., and Just, M. A. (2009). Altering cortical connectivity: remediation-induced changes in the white matter of poor readers. Neuron 64, 624–631. doi: 10.1016/j.neuron.2009.10.018
Kidd, E., Donnelly, S., and Christiansen, M. H. (2018). Individual differences in language acquisition and processing. Trends Cogn. Sci. 22, 154–169. doi: 10.1016/j.tics.2017.11.006
Koubeissi, M. Z., Lesser, R. P., Sinai, A., Gaillard, W. D., Franaszczuk, P. J., and Crone, N. E. (2012). Connectivity between perisylvian and bilateral basal temporal cortices. Cereb. Cortex 22, 918–925. doi: 10.1093/cercor/bhr163
Kragler, S. (1995). The transition from oral to silent reading. Read. Psychol. 16, 395–408. doi: 10.1080/0270271950160402
Kühn, S., Fernyhough, C., Alderson-Day, B., and Hurlburt, R. T. (2014). Inner experience in the scanner: can high fidelity apprehensions of inner experience be integrated with fMRI? Front. Psychol. 5:1393. doi: 10.3389/fpsyg.2014.01393
Kümmerer, D., Hartwigsen, G., Kellmeyer, P., Glauche, V., Mader, I., Klöppel, S., et al. (2013). Damage to ventral and dorsal language pathways in acute aphasia. Brain 136, 619–629. doi: 10.1093/brain/aws354
Lai, C. S. L., Fisher, S. E., Hurst, J. A., Vargha-Khadem, F., and Monaco, A. P. (2001). A forkhead-domain gene is mutated in a severe speech and language disorder. Nature 413, 519–523. doi: 10.1038/35097076
Langland-Hassan, P., Faries, F. R., Richardson, M. J., and Dietz, A. (2015). Inner speech deficits in people with aphasia. Front. Psychol. 6. doi: 10.3389/fpsyg.2015.00528
Lebel, C., Walker, L., Leemans, A., Phillips, L., and Beaulieu, C. (2008). Microstructural maturation of the human brain from childhood to adulthood. NeuroImage 40, 1044–1055. doi: 10.1016/j.neuroimage.2007.12.053
Lidstone, J. S. M., Meins, E., and Fernyhough, C. (2012). Verbal mediation of cognition in children with specific language impairment. Dev. Psychopathol. 24, 651–660. doi: 10.1017/S0954579412000223
Lopez-Barroso, D., Catani, M., Ripolles, P., Dell’Acqua, F., Rodriguez-Fornells, A., and de Diego-Balaguer, R. (2013). Word learning is mediated by the left arcuate fasciculus. Proc. Natl. Acad. Sci. USA 110, 13168–13173. doi: 10.1073/pnas.1301696110
Lopez-Barroso, D., De Diego-Balaguer, R., Cunillera, T., Camara, E., Münte, T. F., and Rodriguez-Fornells, A. (2011). Language learning under working memory constraints correlates with microstructural differences in the ventral language pathway. Cereb. Cortex 21, 2742–2750. doi: 10.1093/cercor/bhr064
Luria, A. R. (1965). L.S. Vygotsky and the problem of localization of functions. Neuropsychologia 3, 387–392. doi: 10.1016/0028-3932(65)90012-6
Lurito, J. T., Kareken, D. A., Lowe, M. J., Chen, S. H. A., and Mathews, V. P. (2000). Comparison of rhyming and word generation with FMRI. Hum. Brain Mapp. 10, 99–106. doi: 10.1002/1097-0193(200007)10:3<99::aid-hbm10>3.0.co;2-q
Makris, N., Kennedy, D. N., McInerney, S., Sorensen, A. G., Wang, R., Caviness, V. S., et al. (2005). Segmentation of subcomponents within the superior longitudinal fascicle in humans: a quantitative, in vivo, DT-MRI study. Cereb. Cortex 15, 854–869. doi: 10.1093/cercor/bhh186
Martino, J., De Witt Hamer, P. C., Berger, M. S., Lawton, M. T., Arnold, C. M., de Lucas, E. M., et al. (2013). Analysis of the subcomponents and cortical terminations of the perisylvian superior longitudinal fasciculus: a fiber dissection and DTI tractography study. Brain Struct. Funct. 218, 105–121. doi: 10.1007/s00429-012-0386-5
Matsumoto, R., Nair, D. R., LaPresto, E., Najm, I., Bingaman, W., Shibasaki, H., et al. (2004). Functional connectivity in the human language system: a cortico-cortical evoked potential study. Brain 127, 2316–2330. doi: 10.1093/brain/awh246
Morgan, A., Bonthrone, A., and Liegeois, F. J. (2016). Brain basis of childhood speech and language disorders: are we closer to clinically meaningful MRI markers? Curr. Opin. Pediatr. 28, 725–730. doi: 10.1097/MOP.0000000000000420
Nooteboom, S. G. (2005). Lexical bias revisited: detecting, rejecting and repairing speech errors in inner speech. Speech Comm. 47, 43–58. doi: 10.1016/j.specom.2005.02.003
Oppenheim, G. M., and Dell, G. S. (2008). Inner speech slips exhibit lexical bias, but not the phonemic similarity effect. Cognition 106, 528–537. doi: 10.1016/j.cognition.2007.02.006
Owen, W. J., Borowsky, R., and Sarty, G. E. (2004). FMRI of two measures of phonological processing in visual word recognition: ecological validity matters. Brain Lang. 90, 40–46. doi: 10.1016/s0093-934x(03)00418-8
Özdemir, R., Roelofs, A., and Levelt, W. J. M. (2007). Perceptual uniqueness point effects in monitoring internal speech. Cognition 105, 457–465. doi: 10.1016/j.cognition.2006.10.006
Paap, K. R., McDonald, J. E., Schvaneveldt, R. W., and Noel, R. W. (1987). “Frequency and pronounceability in visually presented naming and lexical-decision tasks” in Attention and performance XII. ed. M. Coltheart (Hillsdale, NJ: Lawrence Erlbaum Associates, Publishers), 221–244.
Paap, K. R., and Noel, R. W. (1991). Dual-route models of print to sound–still a good horse race. Psychol. Res. 53, 13–24. doi: 10.1007/BF00867328
Parker, G. J. M., Luzzi, S., Alexander, D. C., Wheeler-Kingshott, C. A. M., Clecarelli, O., and Ralph, M. A. L. (2005). Lateralization of ventral and dorsal auditory-language pathways in the human brain. NeuroImage 24, 656–666. doi: 10.1016/j.neuroimage.2004.08.047
Paulesu, E., Frith, C. D., and Frackowiak, R. S. J. (1993). The neural correlates of the verbal component of working memory. Nature 362, 342–345. doi: 10.1038/362342a0
Paus, T., Zijdenbos, A., Worsley, K., Collins, D. L., Blumenthal, J., Giedd, J. N., et al. (1999). Structural maturation of neural pathways in children and adolescents: in vivo study. Science 283, 1908–1911. doi: 10.1126/SCIENCE.283.5409.1908
Pei, X., Leuthardt, E. C., Gaona, C. M., Brunner, P., Wolpaw, J. R., and Schalk, G. (2011). Spatiotemporal dynamics of electrocorticographic high gamma activity during overt and covert word repetition. NeuroImage 54, 2960–2972. doi: 10.1016/j.neuroimage.2010.10.029
Perani, D., Saccuman, M. C., Scifo, P., Anwander, A., Spada, D., Baldoli, C., et al. (2011). Neural language networks at birth. Proc. Natl. Acad. Sci. USA 108, 16056–16061. doi: 10.1073/pnas.1102991108
Perrone-Bertolotti, M., Rapin, L., Lachaux, J. P., Baciu, M., and Lœvenbruck, H. (2014). What is that little voice inside my head? Inner speech phenomenology, its role in cognitive performance, and its relation to self-monitoring. Behav. Brain Res. 261, 220–239. doi: 10.1016/j.bbr.2013.12.034
Petrides, M., and Pandya, D. N. (2009). Distinct parietal and temporal pathways to the homologues of Broca’s area in the monkey. PLoS Biol. 7:16. doi: 10.1371/journal.pbio.1000170
Pierpaoli, C., and Basser, P. J. (1996). Toward a quantitative assessment of diffusion anisotropy. Magn. Reson. Med. 36, 893–906.
Pierpaoli, C., Jezzard, P., Basser, P. J., Barnett, A., and DiChiro, G. (1996). Diffusion tensor MR imaging of the human brain. Radiology 201, 637–648.
Pinker, S., and Jackendoff, R. (2005). The faculty of language: what’s special about it? Cognition 95, 201–236. doi: 10.1016/J.COGNITION.2004.08.004
Poldrack, R. A., Temple, E., Protopapas, A., Nagarajan, S., Tallal, P., Merzenich, M., et al. (2001). Relations between the neural bases of dynamic auditory processing and phonological processing: evidence from fMRI. J. Cogn. Neurosci. 13, 687–697. doi: 10.1162/089892901750363235
Prior, S. M., and Welling, K. A. (2001). Read in your head: a Vygotskian analysis of the transition from oral to silent reading. Read. Psychol. 22, 1–15. doi: 10.1080/02702710151130172
Pugh, K. R., Mencl, W. E., Jenner, A. R., Katz, L., Frost, S. J., Lee, J. R., et al. (2000). Functional neuroimaging studies of reading and reading disability (developmental dyslexia). Ment. Retard. Dev. Disabil. Res. Rev. 6, 207–213. doi: 10.1002/1098-2779(2000)6:3<207::AID-MRDD8>3.0.CO;2-P
Pugh, K. R., Shaywitz, B. A., Shaywitz, S. E., Constable, R. T., Skudlarski, P., Fulbright, R. K., et al. (1996). Cerebral organization of component processes in reading. Brain 119, 1221–1238.
Rastle, K., and Coltheart, M. (2000). Lexical and nonlexical print-to-sound translation of disyllabic words and nonwords. J. Mem. Lang. 42, 342–364. doi: 10.1006/jmla.1999.2687
Rijntjes, M., Weiller, C., Bormann, T., and Musso, M. (2012). The dual loop model: its relation to language and other modalities. Front. Evol. Neurosci. 4:9. doi: 10.3389/fnevo.2012.00009
Rilling, J. K., Glasser, M. F., Preuss, T. M., Ma, X., Zhao, T., Hu, X., et al. (2008). The evolution of the arcuate fasciculus revealed with comparative DTI. Nat. Neurosci. 11, 426–428. doi: 10.1038/nn2072
Rosen, H. J., Ojemann, J. G., Ollinger, J. M., and Petersen, S. E. (2000). Comparison of brain activation during word retrieval done silently and aloud using fMRI. Brain Cogn. 42, 201–217. doi: 10.1006/brcg.1999.1100
Sale, A. (2018). A systematic look at environmental modulation and its impact in brain development. Trends Neurosci. 41, 4–17. doi: 10.1016/j.tins.2017.10.004
Sanjuan, J., Tolosa, A., Gonzalez, J. C., Aguilar, E. J., Perez-Tur, J., Najera, C., et al. (2006). Association between FOXP2 polymorphisms and schizophrenia with auditory hallucinations. Psychiatr. Genet. 16, 67–72. doi: 10.1097/01.ypg.0000185029.35558.bb
Sato, M., Baciu, M., Loevenbruck, H., Schwartz, J. L., Cathiard, M. A., Segebarth, C., et al. (2004). Multistable representation of speech forms: a functional MRI study of verbal transformations. NeuroImage 23, 1143–1151. doi: 10.1016/j.neuroimage.2004.07.055
Satterthwaite, T. D., Elliott, M. A., Ruparel, K., Loughead, J., Prabhakaran, K., Calkins, M. E., et al. (2014). Neuroimaging of the Philadelphia neurodevelopmental cohort. NeuroImage 86, 544–553. doi: 10.1016/j.neuroimage.2013.07.064
Saur, D., Kreher, B. W., Schnell, S., Kummerer, D., Kellmeyer, P., Vry, M. S., et al. (2008). Ventral and dorsal pathways for language. Proc. Natl. Acad. Sci. USA 105, 18035–18040. doi: 10.1073/pnas.0805234105
Schlaggar, B. L., and McCandliss, B. D. (2007). Development of neural systems for reading. Annu. Rev. Neurosci. 30, 475–503. doi: 10.1146/annurev.neuro.28.061604.135645
Schmithorst, V. J., Wilke, M., Dardzinski, B. J., and Holland, S. K. (2002). Correlation of white matter diffusivity and anisotropy with age during childhood and adolescence: a cross-sectional diffusion-tensor MR imaging study. Radiology 222, 212–218. doi: 10.1148/radiol.2221010626
Scholz, J., Klein, M. C., Behrens, T. E. J., and Johansen-Berg, H. (2009). Training induces changes in white-matter architecture. Nat. Neurosci. 12, 1370–1371. doi: 10.1038/nn.2412
Schomers, M. R., Garagnani, M., and Pulvermüller, F. (2017). Neurocomputational consequences of evolutionary connectivity changes in perisylvian language cortex. J. Neurosci. 37, 3045–3055. doi: 10.1523/JNEUROSCI.2693-16.2017
Schulze, K., Vargha-Khadem, F., and Mishkin, M. (2018). Phonological working memory and FOXP2. Neuropsychologia 108, 147–152. doi: 10.1016/j.neuropsychologia.2017.11.027
Seidenberg, M. S., and McClelland, J. L. (1989). A distributed, developmental model of word recognition and naming. Psychol. Rev. 96, 523–568. doi: 10.1037/0033-295X.96.4.523
Shuster, L. I., and Lemieux, S. K. (2005). An fMRI investigation of covertly and overtly produced mono- and multisyllabic words. Brain Lang. 93, 20–31. doi: 10.1016/j.bandl.2004.07.007
Skeide, M. A., Brauer, J., and Friederici, A. D. (2016). Brain functional and structural predictors of language performance. Cereb. Cortex 26, 2127–2139. doi: 10.1093/cercor/bhv042
Slevc, L. R., and Ferreira, V. S. (2006). Halting in single word production: a test of the perceptual loop theory of speech monitoring. J. Mem. Lang. 54, 515–540. doi: 10.1016/j.jml.2005.11.002
Stark, B. C., Geva, S., and Warburton, E. A. (2017). Inner speech’s relationship with overt speech in poststroke aphasia. J. Speech Lang. Hear. Res. 60, 2406–2415. doi: 10.1044/2017_JSLHR-S-16-0270
Tak, H. J., Kim, J. H., and Son, S. M. (2016). Developmental process of the arcuate fasciculus from infancy to adolescence: a diffusion tensor imaging study. Neural Regen. Res. 11, 937–943. doi: 10.4103/1673-5374.184492
Thiebaut de Schotten, M., Dell’Acqua, F., Valabregue, R., and Catani, M. (2012). Monkey to human comparative anatomy of the frontal lobe association tracts. Cortex 48, 82–96. doi: 10.1016/j.cortex.2011.10.001
Tolosa, A., Sanjuán, J., Dagnall, A. M., Moltó, M. D., Herrero, N., and de Frutos, R. (2010). FOXP2 gene and language impairment in schizophrenia: association and epigenetic studies. BMC Med. Genet. 11:114. doi: 10.1186/1471-2350-11-114
Travers, B. G., Adluru, N., Ennis, C., Tromp, D. P. M., Destiche, D., Doran, S., et al. (2012). Diffusion tensor imaging in autism spectrum disorder: a review. Autism Res. 5, 289–313. doi: 10.1002/aur.1243
Vanderauwera, J., Wouters, J., Vandermosten, M., and Ghesquière, P. (2017). Early dynamics of white matter deficits in children developing dyslexia. Dev. Cogn. Neurosci. 27, 69–77. doi: 10.1016/j.dcn.2017.08.003
Vandermosten, M., Boets, B., Wouters, J., and Ghesquière, P. (2012). A qualitative and quantitative review of diffusion tensor imaging studies in reading and dyslexia. Neurosci. Biobehav. Rev. 36, 1532–1552. doi: 10.1016/j.neubiorev.2012.04.002
Vandermosten, M., Vanderauwera, J., Theys, C., De Vos, A., Vanvooren, S., Sunaert, S., et al. (2015). A DTI tractography study in pre-readers at risk for dyslexia. Dev. Cogn. Neurosci. 14, 8–15. doi: 10.1016/j.dcn.2015.05.006
Vargha-Khadem, F., Gadian, D. G., Copp, A., and Mishkin, M. (2005). FOXP2 and the neuroanatomy of speech and language. Nat. Rev. Neurosci. 6, 131–138. doi: 10.1038/nrn1605
Weiller, C., Musso, M., Rijntjes, M., and Saur, D. (2009). Please don’t underestimate the ventral pathway in language. Trends Cogn. Sci. 13, 369–370. doi: 10.1016/j.tics.2009.06.007
Williams, D. M., Bowler, D. M., and Jarrold, C. (2012). Inner speech is used to mediate short-term memory, but not planning, among intellectually high-functioning adults with autism spectrum disorder. Dev. Psychopathol. 24, 225–239. doi: 10.1017/S0954579411000794
Winsler, A., Fernyhough, C., and Montero, I. (2009). Private speech, executive functioning, and the development of verbal self-regulation. Cambridge, UK: Cambridge University Press.
Yeatman, J. D., Dougherty, R. F., Ben-Shachar, M., and Wandell, B. A. (2012). Development of white matter and reading skills. Proc. Natl. Acad. Sci. USA 109, E3045–E3053. doi: 10.1073/pnas.1206792109
Yeatman, J. D., Dougherty, R. F., Rykhlevskaia, E., Sherbondy, A. J., Deutsch, G. K., Wandell, B. A., et al. (2011). Anatomical properties of the arcuate fasciculus predict phonological and reading skills in children. J. Cogn. Neurosci. 23, 3304–3317. doi: 10.1162/jocn_a_00061
Zhang, J., Evans, A., Hermoye, L., Lee, S.-K., Wakana, S., Zhang, W., et al. (2007). Evidence of slow maturation of the superior longitudinal fasciculus in early childhood by diffusion tensor imaging. NeuroImage 38, 239–247. doi: 10.1016/j.neuroimage.2007.07.033
Keywords: neural developmental mechanism, dorsal language pathway, ventral language pathway, arcuate fasciculus, superior longitudinal fasciculus
Citation: Geva S and Fernyhough C (2019) A Penny for Your Thoughts: Children’s Inner Speech and Its Neuro-Development. Front. Psychol. 10:1708. doi: 10.3389/fpsyg.2019.01708
Edited by:
Thomas M. Brinthaupt, Middle Tennessee State University, United StatesReviewed by:
Emily M. Elliott, Louisiana State University, United StatesCyrille Magne, Middle Tennessee State University, United States
Copyright © 2019 Geva and Fernyhough. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Sharon Geva, cy5nZXZhQHVjbC5hYy51aw==