- Department of Psychology, Smith College, Northampton, MA, United States
Introduction: The role of language in false belief reasoning has been much debated for twenty-five years or more, especially the relative contributions of general language development, complement syntax, vocabulary, and executive function. However, the empirical studies so far have fallen short, in that they generally have too few participants for adequate statistical modeling; they do not include control variables; or they are cross-sectional rather than longitudinal, making inferences about causal direction much more tenuous.
Methods: The present study considers the role of these different variables in the development of false belief reasoning over several months of testing, with 258 children aged three to five years. The children are also from under-resourced communities, broadening the populations that generally contribute such data.
Results: A cross-sectional and a longitudinal regression analysis reveals the contribution of each variable to the children’s success on the false belief measures. Finally, a structural equation model tests the relative contribution of the different potential factors over time, how they interact, and change. The model is an excellent fit to the data. Inhibitory control, complement comprehension and vocabulary all have effects on false belief reasoning at the first time point (T1). However, at T3, the major proximal contribution is the child’s comprehension of complements, though the longitudinal pathways of vocabulary and inhibitory control also pave the way.
Discussion: Our data confirm the specific contribution of complement syntax but also makes clear, as do training studies, that a certain amount of preparedness in vocabulary and in executive function skills is also necessary.
1 Introduction
1.1 The course of early theory of mind
Theory of mind, defined as the understanding that others’ mental states may be different from one’s own, has been studied exhaustively for four decades now, and there remain many debates about the influences on the developmental path children take. The general developmental course itself is not disputed, though the extent of cross-cultural variation is still not established. The stepwise development defined by Wellman and Liu (2004) in terms of definable and well-matched experimental tasks is generally accepted, though there is still uncertainty about the causal necessity of each stage. The first step in their developmental sequence, at around age 18 months, is understanding others’ desires, that is, recognizing that two people may want different things than the child does (Repacholi and Gopnik, 1997). Following this, the child comes to understand that people will act according to their beliefs, even if different from the child’s. Understanding that “seeing leads to knowing,” or knowledge access, is slightly later. Realizing that others can have false beliefs, when the child has evidence to the contrary, comes in around age four or five years. Finally, understanding “hidden emotions” is found to be a still later development (Wellman et al., 2011). However, the sequence of middle steps may be different for Chinese children, where understanding different beliefs may follow knowledge access (Wellman et al., 2006; Yu and Wellman, 2024). Still further theory of mind skills, such as second order false belief reasoning (Miller, 2009) and understanding sarcasm and irony (Filippova and Astington, 2010; Winner, 1997) do not develop until early school age.
Digging deeper, it is clear there are also important developments before age two. In the first year of life, children are sensitive to others’ intentions and goals, and differentiate between intentional and unintentional actions (Woodward, 2009). When learning their first language, children make use of shared attention with a caregiver to establish the referents of early words (Baldwin, 1995; Tomasello, 1999). Furthermore, infants in the second year of life can track which objects another person has seen before, and will show that person toys that they have not seen, or point out to them where an object was hidden (Tomasello, 1999).
Why has false belief reasoning occupied center stage in the literature about theory of mind? Philosophers and psychologists have considered the achievement of false belief understanding to be a watershed not only in early child development (Dennett, 1978; Wellman et al., 2001) but also in assessing the theory of mind capabilities of other species (Call and Tomasello, 2008; Premack and Woodruff, 1978) and recently, of A.I. (Strachan et al., 2024) In a false-belief task, you must hold in memory alternative perspectives within a given context (e.g., where a target object actually is and where a character thinks it is). You then must also inhibit the prepotent response of selecting a perspective that is congruent with your own (i.e., selecting where you know the target object to be) (Leslie et al., 2005). To understand another’s false beliefs is to entertain ideas inconsistent with the reality you know, and to reason from that representation about either the cause or the consequence of that belief. Westra and Nagel (2021) refer to this as non-factive reasoning, in contrast to the factive reasoning that is entailed in recognizing what someone knows. In recognizing knowledge, you pay attention to whether the person has perceptual access to the thing or event in question, and since you share the perception, there is no contradiction in perspective. We know that quite young children are active observers of attention and perceptual access, but do not seem attentive to mistaken beliefs.
Since the work of Wellman and Liu (2004), there has been an explosion of research on infant theory of mind, with some researchers claiming that given the right tasks, infants can be shown to recognize that another character has a false belief (Baillargeon et al., 2010; Scott and Baillargeon, 2017). The theory of mind tasks used with infants are called implicit tasks, because no behavioral decision is required from the infant, that is, there is no step of response selection. The first type of task uses the length of gaze as the measure. Infants in the second year of life or even younger have been shown to gaze for a more protracted time at events in which a human character acts in a way contrary to expectation, namely, a way that is not in keeping with the belief they should have formed (Onishi and Baillargeon, 2005). For example, in a version of an unseen displacement task, they are surprised when the character being observed goes to a location where an object really is, when the character did not see it move there. In a second task type, infants look expectantly at a location where a character should go, based on where that character falsely believes something to be (Southgate et al., 2007). In a third design, very young children come to the assistance of another individual specifically if that person was not witness to how something works (Buttelmann et al., 2009). Clearly, success at a young age on implicit tasks calls into question the necessity of language as an aid to reasoning.
The age gap between these tasks and explicit false belief mastery around age four requires explanation. On the account provided by several researchers (e.g., Baillargeon et al., 2010), the gap is a result of the task demands of explicit tasks, namely, response selection sets the bar too high for infants. On this account infants have implicit understanding of others’ beliefs, as rich as that of older children or adults (Baillargeon et al., 2018). However, others ask: Are the infants in these studies acting on the basis of a belief attribution, or something simpler (Southgate, 2013)? The most reductive explanation is that the infant is responding to some accidental but correlated feature of the set-up (Heyes, 2014). Some have found a lack of connection between the looking tasks and the helping tasks (e.g., Poulin-Dubois and Yott, 2018), arguing against abstract mental states underlying both. Other theorists suggest that the child is responsive to a behavioral rule, such as “people go to where they last saw something” (Perner and Ruffman, 2005). Southgate and Vernetti (2014) suggested that infants may be able to follow an agent’s point of view at ages as young as 6 months, but the difference is that they do not yet contrast it with their own. In a widely-cited account, Apperly and Butterfill (2009) proposed that infants may “register” another’s belief rather than represent it. Registration allows infants to trace and track another’s belief but in a limited way that betrays it as different from mature representation, for example, if the belief being tracked has relationships to other mental states, or involves subtle properties not easily captured by vision. As one example, registration may be restricted to tracking expected location rather than how a person construes an object (see also Low, 2010). On Apperly and Butterfill’s two-systems account, the registration that infants use gives way to real representation in older children.
Unfortunately, these fascinating ideas remain in doubt due to a replicability crisis. Many failures to replicate the findings have been reported across developmental labs, and the difficulty is that failures do not get published (Kulke and Rakoczy, 2018; Kulke et al., 2018; Rakoczy, 2012; but see Baillargeon et al., 2018). Brooks and Meltzoff (2015) found continuity in a small group of nineteen children from the first implicit understanding in anticipatory looking in infancy at 10.5 months through to explicit false belief development at age 4.5 years. However, implicit theory of mind tasks used beyond infancy do not show a consistent relationship to explicit tasks with the same preschool children (Low, 2010; Grosse Wiesmann et al., 2017). Nevertheless, the putative gap between infant and four-year-old theory of mind needs explanation and a popular suggestion is that the child needs to develop a mature executive function before success on explicit false belief tasks, because of the cognitive demands of the task. We will refer to this as the executive function hypothesis.
1.2 Executive functions and theory of mind
Executive function development is on a time course that has some correspondence with theory of mind changes, with some significant maturation happening around four or five years of age. Two aspects in particular have been highlighted as potential determinants of the change in false belief reasoning development in previous research. One is working memory (Brandt et al., 2023; Carlson et al., 2002), and the second is inhibitory control (Carlson et al., 2015; Carlson and Moses, 2001; Carlson et al., 2002). In an explicit false-belief task, it is necessary to hold alternatives in memory (e.g., what an object actually is and what someone thinks it is). The child also has to inhibit the alternative that is in keeping with their own beliefs, and to resist that lure of reality. Inhibitory control would also seem to be prerequisite for the step of inhibiting the reality response. In fact, in an important meta-analysis of previous work on executive function and false belief reasoning, Devine and Hughes (2014) report that inhibitory control is a reliable predictor of false belief understanding in preschoolers. Nevertheless, is it important to consider the role of executive function in conjunction with the role of another significant skill developing over the preschool years, namely language.
1.3 Language and theory of mind
Alternative proposals have highlighted the role of language in the process of theory of mind development (de Villiers and de Villiers, 2009), and some theorists argue that language is an alternative to bridge the gap from infant implicit understanding to explicit false belief reasoning (Apperly and Butterfill, 2009). Studies have found that the child’s own language appears to be a key factor in mastery of explicit false belief understanding in young children (Astington and Baird, 2005; San Juan and Astington, 2012). However, is it learning mental state vocabulary (Shatz, 1994), engaging in rich discourse (Harris et al., 2005; Nelson, 2005), or acquiring grammatical structures that contribute to general reasoning (Farrar et al., 2017; Ruffman et al., 2002). In particular, are specific syntactic achievements necessary (de Villiers and de Villiers, 2000, 2014)?
1.3.1 Sentential complements
Explicit false belief reasoning requires minimally two processes:
a. The representation of alternatives, one of which is false.
b. The ability to inhibit a prepotent response in response selection.
The representation of false alternatives is much more easily done with language than with images or even words (Nordmeyer and de Villiers, 2019). Neither images nor words can be false, as they are not propositions in themselves. Of course, one can form different images representing different perspectives, but not one that is false, or negative, without something extra.
“Consider, for example, negation. It’s easy to tell somebody that it’s not going to rain. Try drawing them a picture of it’s not going to rain … Think about trying to draw a picture of “there’s not a giraffe standing beside me” (Fodor, 1994).
This is trivially easy to do, unless one needs to recognize what it is a picture of. Language itself has rich ways of representing not only different perspectives, for example through deixis (de Villiers, 2018), but it can also represent negation and other logical terms that images do not. The unique part about certain embedded sentences, or sentential complements, is that they can represent a false proposition inside a true sentence.
de Villiers (2007) has argued that complements constitute an example of language-as-cognitive tool that has special utility in representing the states of others’ minds. The complement in (1) is distinct from the adjunct clause in (2), because the embedded proposition in (1) can be false:
1. Miriam said that she baked the bread.
2. Miriam relaxed after she baked the bread.
The complement structure only occurs under communication and mental state verbs. Thus, complements can express mistakes and lies, and with mental state verbs like think or believe, they can describe false beliefs. Possible worlds can be described in which those propositions could be true, namely, worlds in the mind of the sentence subject. Philosophers call these propositional attitudes, in which sentences express false propositions as belonging to another’s mind or perspective. While it is true that specific vocabulary words exist to name the state of a false belief, on its own a verb such as “deluded” (3) does not capture the content of the delusion, and the contents of propositional attitudes (4) matter in predicting what the subject will do, or in explaining what they have done.
1. Miriam’s friend was deluded
2. Miriam’s friend thought that Miriam baked the bread.
The properties that sentential complements have for capturing mental states are discussed extensively in the literature on propositional attitudes (Davidson, 1984; Richard, 1990; Segal, 1998).
The special characteristic of complements is their contrast of two perspectives, making them uniquely suited to the representation of propositional attitudes such as belief. There are multiple more subtle differences among complements in English and other languages, representing a network of precise decisions in the learning process for children to master (de Villiers and Roeper, 2016). One difference is whether the embedded clause is tensed: non-finite clauses have neither tense, nor an independent truth-value (5):
1. Anna wanted him to play in the garden.
The phrase “him to play in the garden” is not something one can evaluate for truth, unlike the final clause in a tensed complement (6):
1. Anna thought he was playing in the garden.
Children master nonfinite complements (7) before they master tensed, finite complements (8), and do not make “reality” mistakes with the former, even when there is a contrast in what happened (de Villiers et al., 2012),
1. Anna told him to play in the garden, but he went to sleep instead.
What did Anna tell him to do? – Three-year-olds: play in the garden.
1. Anna said he was playing in the garden, but he went to sleep instead.
What did Anna say he was doing? – Three-year-olds: sleeping.
De Villiers (1995) argued that much of this grammar is established with verbs of communication, as they share many of the same linguistic distinctions with mental verbs, but the difference is that communication events are overt and not covert as with hidden mental states. The child can understand the reference of a verb like “say,” and has undoubtedly heard household arguments about events of speech in which truth can be checked:
1. You said you fed the dog but you did not!
Think of the mindreading needed instead to produce:
1. You thought you fed the dog but you did not!
At the end of this acquisition process, the child understands the syntax of finite, non-factive, complements. However, the fine-grained distinctions in meaning among mental verbs-which number in the hundreds-still take time and experience in the world, a process beyond syntax itself, and into discourse pragmatics.
But how could acquiring the structure of complements play a role in establishing the very concept of false belief? It should surely be otherwise, namely that first the child understands false beliefs and second, learns to encode them in complements. Then one would see a correlation between the two achievements, but the lines of causation would be the other way round. Further, many researchers argue in favor of a bidirectional relationship between ToM and language. On the one hand, the child’s growing understanding of people’s actions, goals, and desires surely provides some conceptual grounding for the meaning involved. Mental states are unobservable entities, so hearing a new word like “think” or “forget” will prompt children to consider what these terms might be labeling, and to conceptualize the distinctions among them (Pyers and Senghas, 2009; Low, 2010). Importantly though, the appearance of these verbs in sentence frames with complements sets limits on their possible meaning (Papafragou, 2001). Could sentential complement structures like (1) have a particular enabling function for false belief reasoning? This particular debate has resulted in much attention, both theoretically (Hinzen, 2007; de Villiers, 2007) and empirically (Boeg Thomsen et al., 2021; Brandt et al., 2023; Farrar et al., 2017; Cheung et al., 2004).
1.3.2 Empirical studies
We know that children begin using verbs such as think and know from an early age (Bartsch and Wellman, 1995; Diessel and Tomasello, 2001; Shatz, 1994; Shatz et al., 1983) but their first uses may be less like expressions of propositional attitudes than like stereotyped forms, often self-referent, with narrow functions. The forms in (11) and (12) do not capture the contrast between truth and reality, or contrasts across minds, unlike the form in (1).
1. I do not know (used as an escape from questioning).
2. I think it’s a dog (I think used as “maybe”).
Examples like (12) led Diessel and Tomasello (2001) to argue that children treat the high-frequency string “I think” more like an adverb, without explicit mental-state reference. As a result, hearing examples like (12) with first-person subjects might not help children to recognize false-belief understanding. The very first expressions of third person propositional attitudes seem to emerge around 3 or 3.5 years in spontaneous speech, and occur more rarely (Bartsch and Wellman, 1995). However, in experimental settings when children are tested about understanding the forms, consistent difficulty is revealed. For instance, de Villiers (1999) arranged scenarios as in (13) in which characters made statements that were either lies or mistakes, such as:
1. The woman said she found her slipper. But look, it was really a mouse.
What did the woman say she found?
Three-year-olds consistently answer “mouse,” even though the answer is provided in the sentence and one can argue that no “mind reading” is necessary in the situation. Answering correctly entails understanding the discourse and reconstructing what the question refers to, which is only possible if the syntax of complements is mastered. Four-and five-year-olds answer “slipper,” as do adults. A longitudinal study of three-and four-year-olds by de Villiers and Pyers (2002) and a very large cross-sectional study of 1,000 children aged four to ten in the standardization of the DELV language assessment test (Seymour et al., 2003) exposed the time course and uniformity of this development (de Villiers et al., 2003).
The finding of correlation between complements and performance on explicit false belief reasoning tasks has been documented now in several different languages: English (e.g., de Villiers and Pyers, 2002; Brandt et al., 2023; Boeg Thomsen et al., 2021) German (Perner et al., 2003; Grosse Wiesmann et al., 2017); Danish (Knuppel et al., 2007), Mandarin (Mo et al., 2014; Guo et al., 2022; Brandt et al., 2023; Li and Leung, 2024), and ASL (Schick et al., 2007). Most but not all of the studies exploring the connection have used the comprehension test originating in the work of de Villiers (1995). In a variant, Brandt et al. (2016) evaluated children’s comprehension of complements using first versus third person subjects, and found that only performance with third-person complements correlated with 4-year-olds’ false belief performance. Aksu-Koç et al. (2005) found that production of complements in Turkish predicted false belief reasoning better than the production of evidential markers did. In Dutch, de Mulder et al. avoided the truth contrasts of the complement comprehension task and asked the children simply to report what different people reported happening, when the truth of the matter was unknown. Success on that task proved unrelated to false belief reasoning. Yet in Brandt et al., in English, truth contrasts were also side-stepped: the child had to recall what was said in a complement across an intervening sentence that was merely a distractor, but did not deny its truth. In contrast to the results in de Mulder et al., performance on this task was well correlated with false beliefs. In the work in German by Grosse Wiesmann et al. (2017), the child’s imitation of complements was used instead, because imitation can betray whether children understand the crucial difference in placement of the complementizer in German. The children’s skill at accurate imitation of the complements was connected to their false belief reasoning. Hence, the basic result seems to survive most variations in the task used (but see Brandt et al. (2023) on complement production).
The effects are less clear in children learning Cantonese (Cheung et al., 2004; Cheung et al., 2009; Tardif et al., 2007), a language in which the surface markers of complementation are virtually non-existent and there is no wh-question movement. Tardif et al. (2007) reported a large longitudinal study of children learning Cantonese in Hong Kong, and though she found significant correlations between complement comprehension on the de Villiers and Pyers (2002) “memory for complements” task and false belief understanding, the children overall were surprisingly poor at the complement comprehension test, even at age six. The complements did not seem to be prerequisite for false belief reasoning in Cantonese, a finding echoed in a recent longitudinal path analysis by Siu and Cheung (2022). Yet in closely-related Mandarin, Guo et al. (2022) present robust evidence of a correlation between complements and false belief reasoning in both typical children and children with autism [see also Li and Leung, 2024 (this issue)].
Brandt et al. (2023) explored the CHILDES data on parent–child conversations in Mandarin and found that the parents, unlike their English counterparts, do not show the frequent use of “I think” as an epistemic adverb. In their experimental study, Brandt et al. (2023) found that discriminating the use of the non-factive verb falsely think from know complements was related to false belief understanding, regardless of the subject being first or third person. English speaking children, however, were affected by the person of the verb in a parallel study. Following a suggestion by Tardif and Wellman (2000) on the relative lack of discourse on mental states from Chinese parents to children, Mo et al. (2014) trained children on complements using verbs of communication. They found the training paid off in children’s improvement on false belief reasoning.
When children are language delayed, either by DLD (Developmental Language Disorder), by autism spectrum disorders, or by deafness, the path of development can be stretched out in a way that allows researchers to see the order of mastery (Durrleman et al. 2016). The case of language-delayed deaf children is especially relevant here, as these children do not usually have other associated developmental problems or neurodivergent minds, but may just have restricted access to the primary input. Children who are born to hearing parents and do not have the benefit of exposure to signed languages often struggle to learn oral language, and are delayed several years as a result. Schick et al. (2007) and Tager-Flusberg and Joseph (2005) studied these children’s language and false belief development, and report delays in false belief understanding concomitant with their language delay. Both vocabulary and complement comprehension predicted their false belief understanding, even though the false belief tasks were designed to be as minimally verbal as possible. That is, the language required for the task was ruled out as a variable. Since the linguistic skill is needed equally for non-or low-verbal false belief tasks, it suggests that language is needed to develop the false belief reasoning, not just to understand the classic tasks. In an extreme case in Nicaragua, researchers for several decades have studied the invention of a sign language NSL (Nicaraguan Sign Language) by deaf children sent away to special boarding schools. The first generation of such signers had relatively incomplete grammars, without sophisticated grammatical ways to indicate other people’s perspectives. The deficiency was then remedied in successive generations of learners. The developmental inversion is that older learners of NSL can be less competent than younger learners. Pyers (2004) and Pyers and Senghas (2009) found that those signers who could express propositional contents under mental state verbs were able to do false belief reasoning (in a nonverbal test), but those signers who did not, failed them even as adults. Incomplete language development hinders theory of mind development even in adult populations.
1.3.3 Explanatory accounts
Despite the empirical data primarily in support of complements as a factor in false belief development, the explanation is still very much debated. Sentential complements represent one example of complex language, possibly the apex of syntactic success in the preschool years. Perhaps general language development (usually indexed by vocabulary) or general syntactic skill are the real predictors of false belief, and complements are just representative of that stage of acquisition (Ruffman, et al., 2002). Call this the general language hypothesis. Some have argued that complements reflect the kind of rich discourse that encourages false belief reasoning, that is, the content of the theory about minds is exposed by such conversation (Harris et al., 2005; Nelson, 2005; Tompkins et al., 2018). This can be called the discourse content hypothesis. Alternatively, developing complement structures might scaffold the kind of careful reasoning that the scenarios entail, allowing the child to entertain multiple perspectives on the world and successfully choose among them (for a rich discussion of these and other alternatives, see San Juan and Astington, 2017; de Villiers, 2021). This we call the complements facilitate reasoning hypothesis.
Empirical results that might allow differentiation of these alternatives are ambivalent, as not all the studies included both general language and complementation measures in the same investigation. For example, Low (2010) found that understanding sentential complements predicted standard false belief tasks in a cross-sectional sample of English-speaking children, once age, nonverbal ability and implicit false belief scores were controlled, but no alternative indices of language were included. Meta-analyses provide some help with this question, but the results are somewhat mixed. In a meta-analysis in Milligan et al. (2007) considered all the language measures used by studies to that date. Various different language measures, including syntax (29%), general language (27%), semantics (23%), and receptive vocabulary (12%) made significant contributions to false belief understanding. However, memory for complements had the strongest effect size in relation to false belief reasoning, with an effect size of 44%, though such studies were less common. Farrar et al. (2017) provide an updated meta-analysis of those studies that contained data on the role of complements and general language as predictors of false belief reasoning. In 10 of the 18 studies (55%) that compared both, the general language hypothesis was supported over and above the specific role of complements. These studies used a wide variety of measures to assess “general language ability,” including receptive vocabulary and different measures of syntax development. However, six of these ten negative studies were for Cantonese and Korean. Mental state verbs differ in these Asian languages compared to English, in that the distinction between true and false beliefs is carried lexically in the verb (see Tardif et al., 2007). Nevertheless, the majority of these studies tested complements with communication verbs (except for Cheung, 2006; study 2). Thus Farrar et al. argue that even these cross-linguistic studies on Asian languages can be used to evaluate the relative contribution of complementation versus general language.
Since then, more recent work has used a variety of measures to distinguish the contribution of complements versus more general language, usually taking out the contribution of these skills and then testing whether complements explain added variance. Omit Brandt et al. (2023), and Boeg Thomsen et al. (2021) both contained other language measures, and after taking out the contribution of vocabulary and other syntax, found a significant additional contribution of complements measures to false belief understanding in keeping with the complements facilitate reasoning hypothesis. An exception mentioned is De Mulder et al. (2019) in a Dutch study, using a complement task that avoided truth contrasts altogether. That task made no contribution to false belief reasoning relative to other language measures such as vocabulary and general syntax. Hence the contribution made by the truth contrasts of non-factive complements is still a live issue (see also de Villiers et al., 2012).
Longitudinal studies are rarer, but they can help identify the direction of influence between the variables, as well as control for the level of initial false belief reasoning. It would be natural to propose that children at three or four years do not yet have the conceptual resources to consider others’ perspectives and mental worlds, leading to errors with false complements as a result of their failures to understand others’ false beliefs. De Villiers and Pyers (2002) studied a small group (N = 28) of children over a year in preschool, and tested them at four points on a battery of theory of mind and language tests. Though they had begun the study expecting that false belief understanding might be necessary for comprehending complementation, the reverse turned out to be the case. When children acquired an understanding of sentential complements, then they began to systematically pass false belief tasks. As a control, de Villiers and Pyers took a language sample and looked at the children’s general command of syntax using the IPSYN (Index of Productive Syntax, Scarborough, 1990), an index of the variety and complexity of sentence use. This measure was not as well linked to false belief mastery.
Boeg Thomsen et al. (2021) did a small longitudinal study with 45 English speaking children designed specifically to try to tease apart the potential contribution of two aspects of complement-clause acquisition. As discussed, their complement comprehension task varied from the more common one by not asking complex questions, and by entailing a distractor clause that did not contrast truth. They tested proficiency with the perspective-marking syntactic structure itself, and in a separate task, understanding of the specific mental verbs used, namely know versus think. As additional language tests, they included memory span, a test of vocabulary, and receptive grammar, and epistemic modals. Furthermore, they avoided complement structures in the false belief task altogether making this one of the most stringent tests to date. They found a robust effect linking third person complements to false belief reasoning when controlling for the rest, as well as an additional effect of mental verb understanding. These studies also are in keeping with the complements facilitate reasoning hypothesis.
1.4 Executive function and complements
Given two potential explanations of the changes in theory of mind reasoning in the later preschool years, is it possible to decide between the complements facilitate reasoning hypothesis and the executive function hypothesis? Some have argued that the complement comprehension task (de Villiers and Pyers, 2002) might in itself be an inhibitory control task, in that to answer the critical question, the reality response must be suppressed (Boeg Thomsen et al., 2021). Unfortunately, it is not often that language and executive function are pitted against one another in the same study. In the cross-sectional study by Jenkins and Astington (1996), there was no longer any effect of working memory (a component of EF) in their 3-and 4-year olds’ false belief understanding when controlling for language. Similarly, in Hughes (1998), taking out verbal skills reduced the effects of EF (working memory and inhibitory control) on false belief reasoning in their sample. However, those studies did not consider complements. Low (2010) found that both complement comprehension and executive functioning predicted explicit false belief reasoning in preschoolers, but he could not separate the effects of each of those variables in regression analyses.
Studying children with deafness and delayed language can provide more perspective here. In de Villiers and de Villiers (2012) the oral deaf children were on par with their hearing peers on executive function tasks, but they showed significant delays in false belief tasks even when the language demands of the task were minimized. They report a dissociation of deception and false belief tasks, in that the deaf children were equivalent to their hearing peers on deception games. Furthermore, language, and in particular complement syntax, proved to be the best predictors of false belief reasoning; however, executive function skills, especially inhibitory control, were the best predictors of deception.
A longitudinal study by Farrant et al. (2012) was rich enough to explore the relative contributions of vocabulary, general language, executive functioning and complements to false belief reasoning in English-speaking preschool children. Farrant et al. added to the model the variable of mind-mindedness (Meins et al., 2002), predicting that variation in maternal input about mental states (via perhaps the discourse content hypothesis) would predict children’s ability on sentence complements, which would then predict false belief understanding. Their sample included 91 typically-developing Australian children studied twice across a year. Importantly, the effects of variation in maternal mental talk were completely mediated by the children’s own competence at sentential complements, which predicted their belief ability. Cognitive flexibility was a further predictor, and the direction of effect was that sentential complements predicted this executive function index rather than vice versa. However, the Farrant study had a relatively small sample size for the number of variables, and they did not use structural equation modeling for the longitudinal portion of their study. The recent study by Boeg Thomsen et al. (2021) also tested the effects of inhibitory control as well as several other language variables in their longitudinal work, but there is a limit on the power of linear regressions with so few participants (45) and so many test variables. The regressions revealed a bidirectional effect of complements and false belief reasoning across time.
In sum, the existing studies are typically too small and contain too many variables for a sufficiently powerful statistical technique such as structural equation modeling that can differentiate direction of effects in a longitudinal study.
1.5 The current study
In the current study we had the opportunity to test the different theoretical models of false belief development on a large sample of low-income children studied over the course of several years on a research grant attached to a preschool curricular intervention study (Lonigan et al., 2015). In our project the children received a large battery of language, executive function, and theory of mind measures, and these were repeated several times over the course of the study, making this an ideal group to test competing models. Lonigan et al. (2015) is a report on the curriculum intervention, using some measures that overlapped with the current study but it does not include the measures of complex language, executive function or theory of mind used here.
The intervention study itself was a large-scale cluster-randomized investigation of the effects of an integrated literacy-and math-focused preschool curriculum. The basic curriculum incorporated central elements of the Literacy Express Curriculum (Lonigan et al., 2005) and Pre-K Mathematics (Klein et al., 2002), two preschool curricula rated as effective by the US Department of Education’s What Works Clearinghouse. One hundred and ten center-based preschools serving low-income communities in the Houston, TX and Tallahassee, FL areas were randomly assigned to one of three conditions: 1. the base curriculum with added explicit socio-emotional instruction [the Promoting Alternative Strategies Thinking curriculum (PATHS) (Domitrovich et al., 2007)]; 2. the base literacy and math curriculum and general classroom and behavior management instruction for the teachers, but with no explicit lessons and activities targeting socio-emotional skills (called the Implicit Socio-Emotional Condition); or 3. a “business-as-usual” control condition. There were significant positive impacts of the two versions of the curriculum on language, phonological awareness, math, and socioemotional outcomes. There were no added benefits to academic or socioemotional outcomes for the children receiving explicit socioemotional instruction (Lonigan et al., 2015). For this reason, in the current research we combine the first two conditions into the Intervention Condition and compare that to the Control Condition to explore the effects of the intervention on false belief reasoning in our sample of children.
Our study is unique in using a longitudinal design with a very large N (258 preschoolers), allowing for powerful statistical analyses to separate the effects of different possible predictors of the development of false belief reasoning. Both the familiar verbal false belief reasoning tasks (unseen object displacement and unexpected contents) and low verbal tasks (thought-bubble picture narratives) were employed, with enough false belief questions (14) to provide considerable variance in the dependent variable to tease apart effects of different independent variables. Most importantly, longitudinal measures of inhibitory control and several relevant measures of language acquisition were taken. The language measures included expressive vocabulary, morphosyntax, and complement comprehension and there were enough items in each measure to provide sufficient variance to separate the effects, if any, of each aspect of language. Prior studies of the effects of children’s language acquisition on their false belief reasoning have often had too few subjects for reliable statistical analyses, have taken too few measures of false belief reasoning, have not pitted executive functioning against language, or have not separated the effects of different aspects of language.
1.6 Predictions
Based on our review of past work, we make the following predictions:
1. Curricular intervention that enriches socio-emotional understanding is expected to improve false belief reasoning.
2. Inhibitory control will predict false belief reasoning concurrently but not across time when other variables such as language are controlled.
3. Language measures will predict false belief understanding both concurrently and over time.
4. Complement comprehension will be a significant predictor of false belief reasoning over and above other control variables, including more general language measures
5. Complement comprehension will be correlated with inhibitory control, but complements will be a more important proximal predictor of false belief reasoning compared to inhibitory control.
2 Method
2.1 Participants
The participants were 258 children from subsidized1 preschools in Texas and Florida. 90.4% of the children came from low-income families and were eligible for free lunches. Tables 1 and 2 give the sample statistics on gender and ethnicity. They were recruited as a part of a curriculum intervention project funded by the National Institutes of Health, the School Readiness Research Consortium. All of the children were assessed twice on a battery of socio-emotional, cognitive, quantitative, language and pre-reading tasks during the preschool year. The first time was in September at the beginning of the school year before the curriculum intervention (Time 1) and the second time was in April/May at the end of the school year, after the intervention (Time 3). The children ranged from 3.3 to 5.3 years old (Mean = 4.58, sd. = 0.34) at Time 1 and from 3.8 to 5.9 years old (Mean = 5.11, sd. = 0.34) at Time 3.
Out of 760 preschoolers who completed Time 3 of the NIH curriculum project, the 258 children in the present study included only those children who completed all of the verbal and low verbal false belief tasks as well as all of the relevant tests of inhibitory control and language acquisition at both Time 1 and Time 3. We thus avoided the issue of imputation of missing data. Given the intensity of the testing schedule across four projects, some loss of data was inevitable, especially as these were low-income day care centers with considerable changeover and parents who moved a lot. However, there is no evidence that the children that we excluded for missing tests (170) constituted a different population than those who completed all the measures (258). We checked this by comparing the two groups on the vocabulary measure, which was present for 100% of the sample, and also age. The groups did not differ statistically on either measure. Children who were not monolingual speakers of English were also excluded. Parents had to return a questionnaire on home language use to determine if the child was monolingual in English (183 of these were missing). Children who were reported by their primary caregiver as having more than 10% Spanish input and/or use were excluded from the present study to eliminate possible effects of bilingualism. There were 149 such exclusions.
2.2 Procedure
The children were tested individually on the full assessment battery in three one-hour testing sessions. The testing took place over several days, contingent on the availability of particular children in the center on a given day, and because the overall test battery including this study was very large. Tests were given in a fixed order across the children in each of these sessions.
2.2.1 Measures for the present study
Background measures
Nonverbal IQ
Children’s nonverbal IQ was assessed using the Pattern Analysis subtest of the Stanford-Binet IQ test, which measured children’s capacity to recognize abstract visual patterns and to solve pattern matching problems. This test was carried out at Time 2, midway between Time 1 and Time 3.
Verbal memory
Verbal memory was assessed with the Word Span subtest on the Comprehensive Test of Phonological Processing (CTOPP) (Wagner et al., 1999). The children were asked to repeat a string of familiar English words said quickly by the tester (e.g., “fish, bed, dog”) in the same order as they had been produced. The strings of words increased in length until the child failed to repeat them accurately. This test was also administered at Time 2.
False belief reasoning
Verbal FB reasoning tasks
The unexpected contents task is a standard task about whether children can remember their earlier false belief about the expected contents of a familiar container after knowing the true content, and whether they can predict others’ false belief about the container’s contents (Perner et al., 1987). Each participant was shown a box (e.g., a crayon box) and the tester asked them what they believed to be in the box. After the child’s answer, the tester opened the box and showed that it actually contained an unexpected item (e.g., a spoon). The child was then asked what they thought was inside the box before it was opened, and what their friend or a toy character would think before it was opened. Each child was tested on two different containers at T1 and another two containers at T3, so they could not just remember what was in the container from T1 to T3. Our prior research over some twenty years had shown the four containers with unexpected contents to produce essentially equivalent responses from preschoolers.
The unseen object displacement task is a modification of the Sally-Anne test (Wimmer and Perner, 1983), which is the standard task measuring whether the children can predict a character’s behavior after successfully reasoning about their false belief (de Villiers and de Villiers, 2012). Children were told two different picture-supported stories, each involving two characters. One character (e.g., a boy) put an object (e.g., a basketball) at one place and left the room. The other character, without deceptive intent, put the object somewhere else. The children were then asked where the boy would first look for the basketball when he came back into the room and why he would look there for the ball. Different stories were used at T1 and T3. Prior research had shown that the four stories produced very similar responses from preschoolers.
Children’s performance received a score of either 1 (for a correct response) or 0 (for an incorrect response) for each verbal false belief question across the Unexpected Contents and Unseen Displacement tasks. This produced a total verbal false belief score out of 8 for each participant.
Low-verbal FB tasks
Both 2-picture sequences (Woolfe et al., 2002) and 4-picture sequences (de Villiers and de Villiers, 2012) were employed. The low-verbal tasks were tested together. Three training items were completed before the administration of low-verbal tasks to make sure the children understood the nature of thought-bubbles as representing the contents of what a character was thinking. Children who failed the training would not be given the low-verbal tasks.
The 2-picture sequence procedure (Woolfe et al., 2002) contained three false belief trials and two true belief trial. The children were shown pictures in which a character was thinking something but the thought bubble was empty, and were asked to choose from three pictured objects what they thought should go into the bubble. An example of a false belief trial showed a picture of a boy fishing with a bending rod. The end of the line and the hook were covered by a flap. The tester instructed the child to lift the flap and a boot was revealed to the child but not the character in the picture. Then the tester pointed to the thought bubble on the boy’s head and asked “What goes it here?.” The child would then choose among three pictures: a fish (false belief), a boot (reality) and a bird (distraction). An example of a true belief trial would be identical to the false belief trials except it would reveal a fish rather than a boot.
The 4-picture-sequence procedure (de Villiers and de Villiers, 2012) included three false belief trials and one true belief trial. In these picture sequences, one of the two characters replaced the contents of a familiar container (e.g., playdough) with an unusual object (e.g., worms). A second character either observed or did not observe this replacement. The children were then asked what should be put into the thought bubble above the second character’s head before s/he opened the container (e.g., a picture of playdough or a picture of worms). The picture was drawn in a way such that it was reasonable for the character to want to find either object that could be in the container.
For each false belief trial, the child received a score of 1 (for a correct answer) or 0 (for an incorrect answer). The total score out of 6 false belief trials was then recorded as the child’s low-verbal false belief score.
Language
Vocabulary
Children’s achieved vocabulary in English was assessed with the Expressive One Word Picture Vocabulary Test (EOWPVT-R; Brownell, 2000) at both T1 and T3. They were asked to name the objects, actions, or concepts shown in colored pictures. In all our analyses we used the children’s raw scores on the test, the number of words correctly produced.
Morphosyntax
As an additional measure of the children’s general language development, we used their raw scores on the morphosyntax items of the Risk Subtest of the Diagnostic Evaluation of Language Variation – Screening Test (DELV-ST; Seymour et al., 2003). This test was used because it assesses African-American children’s acquisition of morphosyntax that is neutral with respect to differences between African American English (AAE) and Mainstream American English (MAE). We did not wish to use standardized tests of MAE that are biased against children speaking other dialects of English, since a high proportion of the children in our study were African American (See Table 1), many of them AAE speakers.
False complement comprehension
Children’s ability to process syntactic structures that differentiate what a person said from reality was tested using the task developed by de Villiers and Pyers (2002). The False Complement Comprehension task contains 8 trials testing whether the children can hold the complement structure associated with the verb “say” in memory and successfully reproduce it. In each trial, a story was told along with pictures about one character saying something which was different from reality. For example, “The woman said there was a bug in her cereal. But look, it was just a raisin!” At the end of the story the tester asked “What did the woman say was in her cereal?” For each trial, the child received a score of either 1 (passed) or 0 (failed). The total score out of 8 was then recorded.
Inhibitory control
Bird and dragon task
The Bird and Dragon Task is a simplified version of “Simon Says” (Reed et al., 1984; Kochanska et al., 1996). In this task children need to selectively inhibit commanded actions. In 10 familiarization trials, the tester first asked the children to imitate self-directed action (e.g., “Touch your ears”). Then the tester introduced two puppets: a “nice bird” and a “naughty dragon.” The instruction was to follow the bird’s commands but not the dragon’s. Children’s performance on each dragon command received a score of 1 (performed a full movement), 2 (performed a wrong movement), 3 (performed a partial movement), or 4 (did not move), and the mean of scores on 7 dragon commands was recorded as the “inhibition score.”
Knock-tap task
The Knock-Tap Task required children to be able to switch from imitating hand actions to doing the opposite action (Korkman et al., 1998). First the children were asked to imitate the examiner by either knocking with a closed fist or tapping with an open palm on a box for 8 trials. Then for 8 pseudorandom opposite trials the children had to tap when the examiner knocked and knock when the examiner tapped. Thus, in this task the children had to inhibit the prepotent response of imitating the tester’s hand action, the response that had just been primed. Percentage of correct responses over 8 opposite trials was recorded for each child.
3 Results
3.1 Background measures
Our analysis of the factors contributing to false belief reasoning in the children began with two regression analyses. The first looks at the cross-sectional predictors at T3, taking out the level of FB at T1. The second takes advantage of the longitudinal data to look also at predictors from T1 to T3. Finally, we construct a structural equation model to look at the interactive effect of the predictors. But first an outcome score, a composite of false belief measures was required. Children’s scores on the four false belief tasks were significantly intercorrelated (see Table 3), so a composite Total FB score was created by adding together the responses to all of the FB questions. Scores varied from 0 to 14 correct out of 14.
Although we intended to create a composite executive function score, a composite inhibitory control score was not calculated from the Bear-Dragon and Knock-Tap Test because the partial correlations between performances on the two measures were low (r = 0.133, p = 0.033 at T1; r = 0.114, p = 0.069 at T3; partial correlations controlling for Age). The Bear-Dragon test was selected as the predictor because it had the strongest bivariate correlation with False Belief.
The children showed significant longitudinal growth over the 6 months between T1 and T3 in their false belief reasoning, in each of the Inhibitory Control tasks, and in the three Language measures (see Table 4).
Table 4. Longitudinal growth in false belief reasoning, inhibitory control, and language measures between Time 1 (early in the preschool year) and Time 3 (late in the preschool year).
There were significant effects of both Gender and Curriculum Intervention on the children’s total FB score at T3 (Gender: F (N = 132) FB mean = 9.53/14, M (N = 126) FB mean = 8.40/14; t(256) = 3.03, p = 0.003) (Intervention: INT (N = 215) FB mean = 9.25/14, CONTROL (N = 43) FB mean = 7.63/14; t(256) = 3.27, p = 0.001). Therefore, both Gender and Intervention were entered as background predictor variables in the subsequent analyses of the relationship between inhibitory control and language development and the children’s FB reasoning toward the end of preschool. Predictors of Total FB reasoning scores at T3 were examined using both linear hierarchical multiple regressions and structural equation modeling.
3.2 Cross-sectional predictors of false belief reasoning at time 3: regression 1
First, a cross-sectional multiple regression was performed relating T3 raw scores on the Bear-Dragon Game and the three Language measures to the children’s T3 Total FB scores. The Language measures were kept separate to investigate the independent effects of Morphosyntax, Vocabulary, and Complements on FB reasoning, although the regression also showed the combined effects of the three on the variance in total FB score at T3. Predictors were entered in four blocks: first, Background Variables [Age at T3, Gender (1 = Female, 0 = Male), and Intervention (1 = Intervention Group, 0 = Control Group)]; second, the children’s T1 FB scores; third, performance on the Bear-Dragon task; and finally, the three Language measures (see Table 5).
Table 5. Cross-sectional hierarchical linear regression predicting T3 False Belief from Background variables (age, gender and intervention), T1 false belief scores, inhibitory control, and language measures.
The Background variables accounted for a significant proportion of the variance in T3 FB scores (R2 = 0.157, p = 0.000) and each of the three variables was a significant contributor (see Table 5). Children’s FB scores at T1 were major predictors of their FB reasoning at T3, accounting for an additional 24.7% of the variance (p < 0.001). Performance on the Bear-Dragon Game at T3 was also a significant predictor of later FB reasoning (∆R2 = 0.009, p = 0.050), although the additional variance accounted for by this measure was small. Finally, the T3 Language measures added substantially to the prediction of T3 Total FB scores (∆R2 = 0.127, p = 0.000). Of the Language measures, both Expressive Vocabulary and Complement Comprehension were significant independent predictors of T3 FB (Vocabulary: B = 0.204, t = 3.86, p < 0.001; Complements: B = 0.306, t = 6.60, p < 0.001), but Morphosyntax was not in itself a significant predictor.
3.3 Longitudinal predictors of false belief reasoning at time 3: regression 2
The second regression considered the possible longitudinal effects of the additional measures taken at T1, not just the continuity in FB scores from T1 to T3. For the longitudinal regression analysis between T1 and T3, the predictor variables were also entered in four blocks (Table 6). First, the three background variables were entered: Age at T3, Gender, and Intervention. Next, the children’s T1 FB scores were entered. Then scores on the Bear-Dragon Inhibitory Control game at T1 were entered. Finally, the three Language measures from T1 were entered as a block: DELV-ST dialect neutral Morphosyntax, Expressive Vocabulary, and Complement Comprehension. Thus, the first two blocks of this regression analysis were the same as the cross-sectional T3 analysis, but longitudinal Inhibitory Control and Language predictors were entered into this regression.
Table 6. Longitudinal hierarchical linear regression predicting T3 false belief from background variables (age, gender and intervention), T1 false belief scores, inhibitory control, and language measures.
The results for Block 3 and Block 4 of the longitudinal regression were very similar to those from the cross-sectional analysis: performance on the T1 Bear-Dragon game added significantly to the variance in T3 FB scores accounted for by the variables in Blocks 1 and 2 (∆R2 = 0.030, p = 0.000) and Expressive Vocabulary and Complement Comprehension at T1 were each independent predictors of later T3 FB reasoning (Vocabulary: B = 0.261, t = 4.65,p < 0.001; Complements: B = 0.138, t = 2.55, p < 0.011). The Language measures as a group accounted for 7.1% of the variance in FB scores (p < 0.0001). Morphosyntax scores on the DELV-ST did not add any predictiveness to the Language measures.
3.4 Structural equation model
The power of structural equation methods lies in their ability to model complex interactions and direction of effects, especially in longitudinal data. A structural equation model was fitted to the data from T1 and T3 using the same predictor variables and outcome measures as the regressions (see Figure 1). The model shows both concurrent effects of Inhibitory Control and Language measures at T1 and T3 on FB scores at T1 and T3, and longitudinal effects of the T1 measures on T3 scores. The final model is an excellent fit to the data, with fit indices >0.95 and RMSEA and SRMSR values <0.07.
Figure 1. Concurrent and longitudinal structural equation model of the relationships between inhibitory control (Bear-Dragon game), the three language measures, and total false belief scores at Tl and T3. Effects of gender and curriculum intervention are also part of the model. Raw scores were entered for the Bear-Dragon game, DELV-ST Morphosyntax, complement comprehension, and vocabulary at Tl and T3. The figure gives standardized parameter estimates for the hypothesized longitudinal SEM. All reported estimates are the maximum likelihood standardized point estimates. χ2 (21, N = 258) = 38.008, p = 0.013; Comparative Fit Index = 0.978; Tucker-Lewis Index = 0.954; Root Mean Square Error of Approximation (RMSEA) = 0.056; Standardized Root Mean Square Residual (SRMSR) = 0.068. All error covariances were calculated but are not shown.
The SEM complements the findings of the two regression analyses. Inhibitory Control (Bear-Dragon) at T1 has significant direct effects on FB at T1 (Standardized Parameter (SP) = 0.12, p < 0.001) and longitudinally at T3 (SP = 0.111, p < 0.05). Bear-Dragon scores at T3 did not add to the fit of the model and standardized parameter estimates between that measure and FB at T3 were not statistically significant, so that variable was excluded from the final model.
Of the Language measures, Vocabulary and Complement Comprehension have stronger effects than Morphosyntax. Morphosyntax scores on the DELV-ST at T1 are significantly related to FB at T1 (SP = 0.158, p < 0.001), but not to FB at T3. T3 Morphosyntax scores did not have significant effects on T3 FB, so they are excluded from the model.
Complement Comprehension scores have direct effects on FB at T1 (SP = 0.355, p < 0.001), but not longitudinally on FB at T3. Instead, they have strong indirect effects on T3 FB through their effects on T3 Complement Comprehension (SP = 0.475, p < 0.001). T3 Complement Comprehension is significantly related to FB at T3 (SP = 0.269, p < 0.001).
Expressive Vocabulary at T1 has significant direct effects on both T1 (SP = 0.163, p < 0.001) and T3 FB (SP = 0.179, p < 0.001), and T1 Vocabulary has a significant longitudinal effect on T3 Vocabulary (SP = 0.730, p < 0.001); but T3 Vocabulary is not significantly related to T3 FB in this model. The best fit directional pathway between Inhibitory Control at T1 and Complement Comprehension at T1 goes from Complements to Inhibitory Control (SP = 0.329, p < 0.001).
4 Discussion
The data analyses allow separation of the various influences on false belief reasoning over time, taking into account the child’s initial level of success. Of course, these results do not rule out the possibility that different developmental paths might occur in other cultures and other languages, which is why a broader net of research must still be cast. This is the first study of a large sample of poorly-resourced children in the US, the majority of whom were African American or Hispanic.
Our first prediction concerned the effects of the intervention on false belief reasoning. Since the interventions were randomized, all the groups contained an equivalent mix of children from low-income households with parents of low educational achievement. The larger group of researchers had an interest in curricular improvements that could be implemented in low-income day centers that could pay off in early schooling, and as such, were not focused on either syntax nor theory of mind (Lonigan et al., 2015). However, the intensive exposure to books, exercises and materials that focused on emotion regulation undoubtedly enriched the children’s exposure to language about the mind, and it paid off in the strong effects of the intervention versus business-as-usual uncovered in the analyses. These results cannot distinguish between the various hypotheses, but are at least compatible with the discourse content hypothesis, namely, that the children’s development of theory of mind is enriched by more mind-talk (Cutting and Dunn, 1999; Meins et al., 2002). They reinforce and refine the Lonigan et al. (2015) results in that they demonstrate an effect of teacher-delivered, richer day care curricula on theory of mind development in the least-resourced children.
The finding that the intervention impacted theory of mind outcomes for the children was not previously reported, as the published report from the longitudinal study looked only at emotion understanding and parental reports of social development (Lonigan et al., 2015). Including all the groups may have increased the variance in achievements of the sample. The regression already accounted for the variance due to intervention, age and gender, and the language variables then entered were additional sources of influence.
Like some other studies, we also found an effect of gender on false belief reasoning, a fact that is not fully explained. Some prior studies with preschool-aged children have found girls to show slightly better performance on emotion understanding and false belief tasks (Banerjee, 1997; Charman and Clements, 2002), and some found differences in later theory of mind skills (Calero et al., 2013; Kuhnert et al., 2017). However, several others have found no difference (Devine and Hughes, 2012; Mathieson and Banerjee, 2011; Walker, 2005). A gender effect has sometimes been attributed to the differential discourse of parents by the gender of their child, with girls receiving a higher volume of talk about people and relationships, hence mental states (Charman and Clements, 2002). The other possibility is that it reflects the finding that girls outpace boys in language development, though that difference has usually been found to be limited to the early years. In the current data, the girls showed small but statistically significant differences in each of the language skills relative to the boys.
Turning to the theoretical significance of the findings, one can see reflections here of all the major theoretical proposals of factors that influence development. The results of the SEM turned out to be more subtle than our initial predictions. First, it is evident that the child’s executive functioning, indexed here by inhibitory control, exerts differential influence across ages. Whether measured at T1 or at T3, inhibitory control has a small but significant effect on false belief reasoning at T3 as predicted. However, the SEM model makes clear that the variance in inhibitory control at T1 has its significant influence by affecting false beliefs at T1. By T3, inhibitory control makes no difference, primarily because the variance at T3 is so limited: the majority get full points on the measure. Recall also that our measure was limited to a single task, because of the lack of correlation between the two inhibitory control tasks with which we began. Surprisingly, inhibitory control was not a contributor to complement comprehension: the SEM makes clear that the direction of influence is the reverse. The complement comprehension test has been seen as potentially a kind of inhibitory control task, but that is not evident here. Farrant et al. (2012) found the same direction of effect in their SEM model with typical children, namely, that sentential complements predicted executive function rather than vice versa.
A picture is beginning to emerge of how the different factors contribute to the two processes involved in explicit false belief reasoning. Inhibitory control would seem to be needed for the second step, of suppressing the prepotent response. If the representation of alternatives is appropriate then inhibitory control can assist, but if the representation is inadequate then inhibitory control will not help. The representation must be built first, and language, specifically complementation, is the ideal structure. A child could have developed inhibitory control for other decisions but still fail on both complements and on false beliefs, as in the case of the oral deaf children in de Villiers and de Villiers (2012). A child could theoretically have complements but not inhibitory control, perhaps a child with ADHD, though this has not been explored. The interpretation is partly supported by the results of a recent training study (Boeg Thomsen et al., 2024) that demonstrates that preschool children benefited most from complement training when they had better executive function skills to begin with.
Second, there is support here for the general language hypothesis such as vocabulary development or general morphosyntax, as predicted. The child’s vocabulary at T1 matters for false belief understanding both at T1 and at T3, but like inhibitory control, the effect of T3 vocabulary on false belief understanding at T3 disappears, and a similar pattern occurs with morphosyntax, despite there being plenty of variance in both still at T3. A major effect at both time points is complement comprehension. What matters at T3 is complement comprehension at T3. That is, the proximal effect by the final stage is the children’s complement comprehension measured then. The other influences at an earlier point affect the earlier stages of false belief understanding comprehension. The average false belief understanding score at T1 was 5.46, well below chance for 14 items (p = 0.001), but by T3, it was 10.52 (above chance, p = 0.001) with a distribution that shows more equal proportions of passers and failers. It is therefore possible that vocabulary, morphosyntax and inhibitory control each make a difference in getting any points at all on the false belief task, but once that is underway, it is complement comprehension that matters. Of course, the present study is still limited by the range of language measures used, and it remains possible that other complex language could play a role in enhancing false belief reasoning (Schroeder et al., 2021; Charnavel et al., 2024).
The importance of these findings is that they reveal how different studies, with small and slightly different age samples, can come up with different results in which executive function, vocabulary, or complement syntax might make different contributions statistically. Furthermore, enriched talk about the mind via specially prepared curricula or books may also be effective in promoting growth. Only with longitudinal data of this degree of richness can a full developmental story emerge. However, though the age span covered here is quite broad (3.3 to 5.9), the longitudinal time interval was still relatively short as the children averaged 4.6 at T1 and 5.1 at T3.
The result that complements predict false belief reasoning confirms the predictions and is compatible with the data from a number of training studies that have set out to test whether false complements can be trained, and if so, whether that linguistic training impacts false belief reasoning. An excellent review of existing training studies is provided in Boeg Thomsen et al. (2024). Though some studies do not conclusively demonstrate a specific effect of complementation (Hale and Tager-Flusberg, 2003; Lohmann and Tomasello, 2003), others provide convincing evidence that specific training on complements has positive effects on later false belief reasoning (Mo et al., 2014; Durrleman et al., 2019; Durrleman and de Villiers, 2024; Boeg Thomsen et al., 2024). Training studies, since they constitute intervention in the causal chain, can be considered the ultimate experimental test of the causal hypothesis, unlike even the best of longitudinal studies that do not train. But training studies do not settle the question of whether this is the normal path of development. Together with reliable longitudinal data, the case is more robust.
Several theorists have maintained that language may be prerequisite for, or an important determinant of, explicit but not implicit theory of mind (Apperly and Butterfill, 2009; Low, 2010). Low and Watts (2013) suggested a signature limit (Apperly and Butterfill, 2009) on the implicit theory of mind system. The proposal is that the implicit system may be capable of tracking only a subset of mental states, such as a protagonist’s beliefs about the location of an object, but not beliefs about the object’s properties or identity. Results of a study by Fizke et al. (2017) support this proposal. Perhaps language is more influential, or necessary, for beliefs about object identity. In unseen displacement the child must anticipate where a person will look, and that could be done by a system that recognizes another’s goals or intentions, but not necessarily beliefs. However, object identity tasks require representing how the other person conceives of an object, and that is a more demanding representation (Rakoczy, 2017). More precise work is needed to test this distinction.
The overall findings of the study are compatible with the claim that sentential complements play a significant role in explicit false belief reasoning development, separate from general morphosyntax, vocabulary and executive function. Many questions remain about the scope of the finding cross linguistically, as so few languages have so far been investigated and the range of mental state expressions is diverse, as are the techniques for assessing them. Questions also remain about the role of implicit theory of mind as an additional component of influence on the variance in children’s achievement. More research is needed on whether there is real continuity of skills across the whole span from infancy to higher order belief at age six or seven, and what effects there might be of the type of belief engaged in the task.
Data availability statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
Ethics statement
The studies involving humans were approved by Smith College Institutional Review Board. The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation in this study was provided by the participants’ legal guardians/next of kin.
Author contributions
JV: Conceptualization, Funding acquisition, Investigation, Methodology, Writing – original draft, Writing – review & editing. PV: Conceptualization, Data curation, Formal analysis, Funding acquisition, Methodology, Writing – original draft, Writing – review & editing.
Funding
The author(s) declare that financial support was received for the research, authorship, and/or publication of this article. The research was supported by a Program grant from the National Institutes of Child Health and Development (NICHD) (PO1 HD048497). The authors received funding from NIH for conducting the research.
Acknowledgments
We gratefully acknowledge the work on structural equation modeling carried out by two students of ours at the time they were at Smith College: Meng Chen, now in a doctoral program at Penn. State University, and Rachel Yan, now in a doctoral program at Michigan University. We also must thank the hard work of everyone in the School Readiness Research Consortium who collected the data: C. J. Lonigan, B. M. Phillips, and J. Clancy-Menchetti (Florida State University); S. Landry, P. Swank, M. Assel, and H. Taylor (University of Texas in Houston).
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Footnotes
1. ^In the US, Early Education and Care (EEC) provides subsidies to families in need to help cover the cost of child care. Child care programs accepting these subsidies use EEC’s Child Care Financial Assistance (CCFA) system for documenting family eligibility, tracking child attendance, and submitting requests for reimbursement.
References
Aksu-Koç, A., Aydın, Ç., Avcı, G., Sefer, N., and Yaşa, Y. (2005). The relation between mental verbs and theory of mind performance: Evidence from Turkish children. Berlin: In IASCL Convention.
Apperly, I. A., and Butterfill, S. A. (2009). Do humans have two systems to track beliefs and belief-like states? Psychol. Rev. 116, 953–970. doi: 10.1037/a0016923
Astington, J. A., and Baird, J. (2005). Why language matters for theory of mind. Oxford: Oxford University Press.
Baillargeon, R., Buttelmann, D., and Southgate, V. (2018). Invited commentary: interpreting failed replications of early false-belief findings: methodological and theoretical considerations. Cogn. Dev. 46, 112–124. doi: 10.1016/j.cogdev.2018.06.001
Baillargeon, R., Scott, R. M., and He, Z. (2010). False-belief understanding in infants. Trends Cogn. Sci. 14, 110–118. doi: 10.1016/j.tics.2009.12.006
Baldwin, D. (1995). “Understanding the link between joint attention and language” in Joint attention: Its origins and role in development. eds. C. Moore and P. J. Dunham (Hove: Psychology Press).
Banerjee, M. (1997). Hidden emotions: pre-schoolers' knowledge of appearance-reality and emotion display rules. Soc. Cogn. 15, 107–132. doi: 10.1521/soco.1997.15.2.107
Bartsch, K., and Wellman, H. M. (1995). Children talk about the mind. Oxford: Oxford University Press.
Boeg Thomsen, D., Kandemirci, B., Theakston, A., and Brandt, S. (2024). Do complement clauses with first-or third-person perspective support false-belief reasoning? A training study with English-speaking 3-year-olds. Dev. Psychol. doi: 10.1037/dev0001808
Boeg Thomsen, D., Theakston, A., Kandemirci, B., and Brandt, S. (2021). Do complement clauses really support false-belief resoning? A longitudinal study with English-speaking 2-to 3-year-olds. Dev. Psychol. 57, 1210–1227. doi: 10.1037/dev0001012
Brandt, S., Buttelmann, D., Lieven, E., and Tomasello, M. (2016). Children’s understanding of first and third person perspectives in complement clauses and false belief tasks. J. Exp. Child Psychol. 151, 131–143. doi: 10.1016/j.jecp.2016.03.004
Brandt, S., Hargreaves, S., and Theakston, A. (2023). Putting complement clauses into context: testing the effects of story context, false-belief understanding, and syntactic form on children’s and adults’ comprehension and production of complement clauses. Cogn. Sci. 47:e13311. doi: 10.1111/cogs.13311
Brandt, S., Li, H., and Chan, A. (2023). What makes a complement false? Looking at the effects of verbal semantics and perspective in mandarin children’s interpretation of complement-clause constructions and their false-belief understanding. Cogn. Ling. 99–132. doi: 10.1515/cog-2021-0108
Brooks, R., and Meltzoff, A. N. (2015). Connecting the dots from infancy to childhood: a longitudinal study connecting gaze following, language, and explicit theory of mind. J. Exp. Child Psychol. 130, 67–78. doi: 10.1016/j.jecp.2014.09.010
Brownell, R. (2000). Expressive one-word picture vocabulary test. 3rd Edn. Novato, CA: Academic Therapy.
Buttelmann, D., Carpenter, M., and Tomasello, M. (2009). Eighteen-month-old infants show false belief understanding in an active helping paradigm. Cognition 112, 337–342. doi: 10.1016/j.cognition.2009.05.006
Calero, C. I., Salles, A., Semelman, M., and Sigman, M. (2013). Age and gender dependent development of theory of mind in 6-to 8-years old children. Front. Hum. Neurosci. 7:281. doi: 10.3389/fnhum.2013.00281
Call, J., and Tomasello, M. (2008). Does the chimpanzee have a theory of mind? 30 years later. Trends Cogn. Sci. 12, 187–192. doi: 10.1016/j.tics.2008.02.010
Carlson, S. M., Claxton, L. J., and Moses, L. J. (2015). The relation between executive function and theory of mind is more than skin deep. J. Cogn. Dev. 16, 186–197. doi: 10.1080/15248372.2013.824883
Carlson, S. M., and Moses, L. J. (2001). Individual differences in inhibitory control and children’s theory of mind. Child Dev. 72, 1032–1053. doi: 10.1111/1467-8624.00333
Carlson, S. M., Moses, L. J., and Breton, C. (2002). How specific is the relation between executive function and theory of mind? Contributions of inhibitory control and working memory. Infant Child Dev. 11, 73–92. doi: 10.1002/icd.298
Charman, T., and Clements, W. (2002). Is there a gender difference in false belief development? Soc. Dev. 11, 1–10. doi: 10.1111/1467-9507.00183
Charnavel, I., de Villiers, J. G., and Durrleman, S. (2024) Taking perspective in motives and purposes: The case of adjunct clauses and theory of mind. Paper in symposium, complex language and theory of mind IASCL. July 18th, 2024.
Cheung, H. (2006). False belief and language comprehension in Cantonese-speaking children. J. Exp. Child Psychol. 95, 79–98. doi: 10.1016/j.jecp.2006.05.002
Cheung, H., Hsuan-Chih, C., Creed, N., Ng, L., Ping Wang, S. P., and Mo, L. (2004). Relative roles of general and complementation language in theory-of-mind development: evidence from Cantonese and English. Child Dev. 75, 1155–1170. doi: 10.1111/j.1467-8624.2004.00731.x
Cheung, H., Hsuan-Chih, C., and Yeung, W. (2009). Relations between mental verb and false-belief understanding in Cantonese-speaking children. J. Exp. Child. Psych. 104, 141–155. doi: 10.1016/j.jecp.2009.05.004
Cutting, A. L., and Dunn, J. (1999). Theory of mind, emotion understanding, language, and family background: individual differences and interrelations. Child Dev. 70, 853–865. doi: 10.1111/1467-8624.00061
Davidson, D. (1984). Thought and talk.” inquiries into truth and interpretation. Oxford: Clarendon Press, 155–170.
De Mulder, H. N. M., Wijnen, F., and Coopmans, P. H. A. (2019). Interrelationships between theory of mind and language development: a longitudinal study of Dutch-speaking kindergartners. Cogn. Dev. 51, 67–82. doi: 10.1016/j.cogdev.2019.03.006
de Villiers, J. G. (1995). “Questioning minds and answering machines” in Proceedings of the Boston University conference on language development. eds. D. M. Laughlin and S. McEwen (Somerville, MA: Cascadilla Press).
de Villiers, J. G. (1999). “On acquiring the structural representations for false complements” in UMOP: New perspectives on language acquisition. ed. B. Hollebrandse (Amherst, MA: University of Massachusetts).
de Villiers, J. G. (2007). The interface of language and theory of mind. Lingua 117, 1858–1878. doi: 10.1016/j.lingua.2006.11.006
de Villiers, J. G. (2018). “Perspectives on truth: the case of language and false belief reasoning” in Semantics in language acquisition. eds. K. Syrett and S. Arunachalam (Amsterdam: John Benjamins), 222–245.
de Villiers, J. G. (2021). With language in mind. Lang. Learn. Dev. 17, 71–95. doi: 10.1080/15475441.2020.1820338
de Villiers, P. A., Burns, F., and Pearson, B. (2003). “The role of language in theory of mind development of language-impaired children: complementing theories” in Proceeding of the 27th annual Boston university conference on language development. eds. B. Beachley, A. Brown, and F. Conlin, vol. 1 (Somerville, MA: Cascadilla Press).
de Villiers, J. G., and de Villiers, P. A. (2000). “Linguistic determinism and the understanding of false beliefs” in Children’s reasoning and the mind. eds. P. Mitchell and K. Riggs (Hove: Psychology Press), 191–228.
de Villiers, J. G., and de Villiers, P. A. (2009). “Complements enable representation of the contents of false beliefs: the evolution of a theory of mind” in Language acquisition. ed. S. Foster (New York: Palgrave McMillian), 169–195.
de Villiers, P. A., and de Villiers, J. G. (2012). Deception dissociates from false belief reasoning in deaf children: implications for the implicit versus explicit theory of mind distinction. Br. J. Dev. Psychol. 30, 188–209. doi: 10.1111/j.2044-835X.2011.02072.x
de Villiers, J. G., and de Villiers, P. A. (2014). The role of language in theory of mind development. Top. Lang. Disord. 34, 313–328. doi: 10.1097/tld.0000000000000037
de Villiers, J. G., Harrington, E., Gadilauskas, E., and Roeper, T. (2012). “Tense and truth in children's question answering” in Proceedings of the Boston University conference on language development. eds. A. Biller, E. Chung, and A. Kimball (Somerville, MA: Cascadilla Press).
de Villiers, J. G., and Pyers, J. E. (2002). Complement to cognition: a longitudinal study of the relationship between complex syntax and false-belief-understanding. Cogn. Dev. 17, 1037–1060. doi: 10.1016/S0885-2014(02)00073-4
de Villiers, J. G., and Roeper, T. (2016). “The acquisition of complements” in Oxford handbook of developmental linguistics. eds. W. Snyder and J. Lidz (Oxford: Oxford University Press), 279–309.
Dennett, D. C. (1978). Beliefs about beliefs. Behav. Brain Sci. 1, 568–570. doi: 10.1017/S0140525X00076664
Devine, R. T., and Hughes, C. (2012). Silent films and strange stories: theory of mind, gender, and social experiences in middle childhood. Child Dev. 84, 989–1003. doi: 10.1111/cdev.12017
Devine, R. T., and Hughes, C. (2014). Relations between false belief understanding and executive function in early childhood: a meta-analysis. Child Dev. 85, 1777–1794. doi: 10.1111/cdev.12237
Diessel, H., and Tomasello, M. (2001). The acquisition of finite complement clauses in English: a usage-based approach to the development of grammatical constructions. Cogn. Linguist. 12, 97–141. doi: 10.1515/cogl.12.2.97
Domitrovich, C. E., Cortes, R., and Greenberg, M. T. (2007). Improving young children’s social and emotional competence: a randomized trial of the preschool PATHS curriculum. J. Prim. Prev. 28, 67–91. doi: 10.1007/s10935-007-0081-0
Durrleman, S., Burnel, M., de Villiers, J. G., Thommen, E., Yan, R., and Delage, H. (2019). The impact of grammar on mentalizing: a training study including children with autism spectrum disorder and developmental language disorder. Front. Psychol. 10, 1–15. doi: 10.3389/fpsyg.2019.02478
Durrleman, S., Burnel, M., Thommen, E., Foudon, N., Sonié, S., Reboul, A., et al. (2016). The language-cognition interface in ASD: complement sentences and false belief reasoning. Res. Autism Spectrum Disord. 21, 109–120. doi: 10.1016/j.rasd.2015.10.003
Durrleman, S., and de Villiers, J. G. (2024). Complements and false belief reasoning: An ecological training study in children with and without autism. Paper presented in symposium: “Complex language and theory of mind. Prague: International Association of Child Language.
Farrar, M. J., Benigno, J. P., Tompkins, V., and Gage, N. A. (2017). Are there different pathways to explicit false belief understanding? General language and complementation in typical and atypical children. Cogn. Dev. 43, 49–66. doi: 10.1016/j.cogdev.2017.02.005
Filippova, E., and Astington, J. W. (2010). Children’s understanding of social-cognitive and social-communicative aspects of discourse irony. Child Dev. 81, 913–928. doi: 10.1111/j.1467-8624.2010.01442.x
Fizke, E., Butterfill, S., van de Loo, L., Reindl, E., and Rakoczy, H. (2017). Are there signature limits in early theory of mind? J. Exp. Child Psychol. 162, 209–224. doi: 10.1016/j.jecp.2017.05.005
Fodor, J. A. (1994). The human language series: Part 1: Discovering the human language. Director. G. Searchinger [video/DVD].
Farrant, B. M., Maybery, M. T., and Fletcher, J. (2012). Language, cognitive flexibility, and explicit false belief understanding: Longitudinal analysis in typical development and specific language impairment. Child Dev, 83, 223–235. doi: 10.1111/j.1467-8624.2011.01681.x
Grosse Wiesmann, C., Friederich, A. D., Singer, T., and Steinbeis, N. (2017). Implicit and explicit false belief development in preschool children. Dev. Sci. 20, 1–15. doi: 10.1111/desc.12445
Guo, Q., Pan, Q., Liu, Q., Wang, T., Cao, S., Lin, Y., et al. (2022). Relationship between different types of complement syntax and false belief in mandarin-speaking children with autism spectrum disorder and typically developing children. Front. Psychol. 13:1045227. doi: 10.3389/fpsyg.2022.1045227
Hale, C. M., and Tager-Flusberg, H. (2003). The influence of language on theory of mind: a training study. Dev. Sci. 6, 346–359. doi: 10.1111/1467-7687.00289
Harris, P. L., de Rosnay, M., and Pons, F. (2005). Language and children’s understanding of mental states. Curr. Directions Psychol. Sci. 14, 69–73. doi: 10.1111/j.0963-7214.2005.00337.x
Heyes, C. (2014). False belief in infancy: a fresh look. Develop. Sci. 17, 647–659. doi: 10.1111/desc.12148
Hughes, C. (1998). Executive function in preschoolers: links with theory of mind and verbal ability. Br. J. Exp. Psychol. 16, 233–253. doi: 10.1111/j.2044-835X.1998.tb00921.x
Jenkins, J. M., and Astington, J. W. (1996). Cognitive factors and family structure associated with theory of mind development in young children. Dev. Psychol. 32, 70–78. doi: 10.1037/0012-1649.32.1.70
Klein, A., Starkey, P., and Ramirez, A. (2002). Pre-K mathematics curriculum. Glendale, IL: Scott Foresman.
Knuppel, A., Steensgaard, R., and Jensen de Lopez, K. (2007). Mental state talk by Danish preschool children. Nordlyd 34, 110–130. doi: 10.7557/12.125
Kochanska, G., Murray, K., Jacques, T. Y., Koenig, A. L., and Vandegeest, K. A. (1996). Inhibitory control in young children and its role in emerging internalization. Child Dev. 67, 490–507. doi: 10.2307/1131828
Korkman, M., Kirk, U., and Kemp, S. (1998). NESPY: A developmental neuropsychological assessment. San Antonio, TX: The Psychological Corporation.
Kuhnert, R. L., Begeer, S., Fink, E., and de Rosnay, M. (2017). Gender-differentiated effects of theory of mind, emotion understanding, and social preference on prosocial behavior development: a longitudinal study. J. Exp. Child Psychol. 154, 13–27. doi: 10.1016/j.jecp.2016.10.001
Kulke, L., and Rakoczy, H. (2018). Implicit theory of mind–an overview of current replications and non-replications. Data Brief 16, 101–104. doi: 10.1016/j.dib.2017.11.016
Kulke, L., von Duhn, B., Schneider, D., and Rakoczy, H. (2018). Is implicit theory of mind a real and robust phenomenon? Results from a systematic replication study. Psychol. Sci. 29, 888–900. doi: 10.1177/0956797617747090
Leslie, A. M., German, T. P., and Polizzi, P. (2005). Belief-desire reasoning as a process of selection. Cogn. Psychol. 50, 45–85. doi: 10.1016/j.cogpsych.2004.06.002
Li, H., and Leung, M. (2024). The roles of language and executive function in mandarin-speaking children’s theory of mind development. Front. Psychol. 15:1354207. doi: 10.3389/fpsyg.2024.1354207
Lohmann, H., and Tomasello, M. (2003). The role of language in the development of false belief understanding: a training study. Child Dev. 74, 1130–1144. doi: 10.1111/1467-8624.00597
Lonigan, C. J., Clancy-Menchetti, J., Phillips, B. M., McDowell, K., and Farver, J. M. (2005). Literacy express: A preschool curriculum. Tallahassee, FL: Literacy Express.
Lonigan, C. J., Phillips, B. M., Clancy, J. L., Landry, S. H., Swank, P. R., Assel, M., et al. (2015). Impacts of a comprehensive school readiness curriculum for preschool children at risk for educational difficulties. Child Dev. 86, 1773–1793. doi: 10.1111/cdev.12460
Low, J. (2010). Preschoolers’ implicit and explicit false-belief understanding: relations with complex syntactical mastery. Child Dev. 81, 597–615. doi: 10.1111/j.1467-8624.2009.01418.x
Low, J., and Watts, J. (2013). Attributing false beliefs about object identity reveals a signature blind spot in humans’ efficient mind-reading system. Psychol. Sci. 24, 305–311. doi: 10.1177/0956797612451469
Mathieson, K., and Banerjee, R. (2011). Peer play, emotion understanding, and socio moral explanation: the role of gender. Br. J. Dev. Psychol. 29, 188–196. doi: 10.1111/j.2044-835X.2010.02020.x
Meins, E., Fernyhough, C., Wainwright, R., Das Gupta, M., Fradley, E., and Tuckey, M. (2002). Maternal mind-mindedness and attachment security as predictors of theory of mind understanding. Child Dev. 73, 1715–1726. doi: 10.1111/1467-8624.00501
Miller, S. A. (2009). Children’s understanding of second-order mental states. Psychol. Bull. 135, 749–773. doi: 10.1037/a0016854
Milligan, K., Astington, J. W., and Dack, L. A. (2007). Language and theory of mind: meta-analysis of the relation between language ability and false-belief understanding. Child Dev. 78, 622–646. doi: 10.1111/j.1467-8624.2007.01018.x
Mo, S., Su, Y., Sabbagh, M. A., and Xiu, J. (2014). Sentential complements and false belief understanding in Chinese mandarin-speaking preschoolers: a training study. Cogn. Dev. 29, 50–61. doi: 10.1016/j.cogdev.2013.11.001
Nelson, K. (2005). “Language pathways into the community of minds” in Why language matters for theory of mind. eds. J. W. Astington and J. A. Baird (Oxford: Oxford University Press), 26–49.
Nordmeyer, A. E., and de Villiers, J. G. (2019). To be or not to be: examining the role of language in a concept of negation. In Proceedings of the Annual Meeting of the Cognitive Science Society.
Onishi, K. H., and Baillargeon, R. (2005). Do 15-month-old infants understand false beliefs? Science 308, 255–258. doi: 10.1126/science.1107621
Papafragou, A. (2001). “Linking early linguistic and conceptual capacities: the role of theory of mind” in Conceptual structure, discourse and language IV. eds. A. Cienki, B. Luka, and M. Smith (Stanford, CA: CSLI Publications).
Perner, J., Leekam, S. R., and Wimmer, H. (1987). Three-year-olds' difficulty with false belief: the case for a conceptual deficit. Br. J. Dev. Psychol. 5, 125–137. doi: 10.1111/j.2044-835X.1987.tb01048.x
Perner, J., and Ruffman, T. (2005). Infants' insight into the mind: how deep? Science 308, 214–216. doi: 10.1126/science.1111656
Perner, J., Sprung, M., Zauner, P., and Haider, H. (2003). Want that is understood well before say that, think that, and false belief: a test of de Villiers’ linguistic determinism on German-speaking children. Child Dev. 74, 179–188. doi: 10.1111/1467-8624.t01-1-00529
Poulin-Dubois, D., and Yott, J. (2018). Probing the depth of infants’ theory of mind: disunity in performance across paradigms. Dev. Sci. 21:e12600. doi: 10.1111/desc.12600
Premack, D., and Woodruff, G. (1978). Does the chimpanzee have a theory of mind? Behav. Brain Sci. 1, 515–526. doi: 10.1017/S0140525X00076512
Pyers, J. E. (2004). The relationship between language and false-belief understanding: Evidence from learners of an emerging sign language in Nicaragua. Unpublished doctoral dissertation. Berkeley: University of California.
Pyers, J. E., and Senghas, A. (2009). Language promotes false-belief understanding. Psychol. Sci. 20, 805–812. doi: 10.1111/j.1467-9280.2009.02377.x
Rakoczy, H. (2012). Do infants have a theory of mind? Br. J. Dev. Psychol. 30, 59–74. doi: 10.1111/j.2044-835X.2011.02061.x
Rakoczy, H. (2017). In defense of a developmental dogma: children acquire propositional attitude folk psychology around age 4. Synthese 194, 689–707. doi: 10.1007/s11229-015-0860-8
Reed, M. A., Pien, D. L., and Rothbart, M. K. (1984). Inhibitory self-control in preschool children. Merrill-Palmer Q. 30, 131–147.
Repacholi, B. M., and Gopnik, A. (1997). Early reasoning about desires: evidence from 14-and 18-month-olds. Dev. Psychol. 33, 12–21. doi: 10.1037/0012-1649.33.1.12
Richard, M. (1990). Propositional attitudes: An essay on thoughts and how we ascribe them. Cambridge, U.K.: Cambridge University Press.
Ruffman, T., Slade, L., and Crowe, E. (2002). The relation between children’s and mothers’ mental state language and theory-of-mind understanding. Child Dev. 73, 734–751. doi: 10.1111/1467-8624.00435
San Juan, V., and Astington, J. W. (2012). Bridging the gap between implicit and explicit understanding: how language development promotes the processing and representation of false belief. Brit. J. Dev. Psychol. 30, 105–122. doi: 10.1111/j.2044-835X.2011.02051.x
San Juan, V., and Astington, J. W. (2017). Does language matter for implicit theory of mind? The effects of epistemic verb training on implicit and explicit false-belief understanding. Cogn. Dev. 41, 19–32. doi: 10.1016/j.cogdev.2016.12.003
Scarborough, H. S. (1990). Index of productive syntax. Appl. Psycholing 11, 1–22. doi: 10.1017/S0142716400008262
Schick, B., de Villiers, P. A., de Villiers, J. G., and Hoffmeister, R. (2007). Language and theory of mind: a study of deaf children. Child Dev. 78, 376–396. doi: 10.1111/j.1467-8624.2007.01004.x
Schroeder, K., Durrleman, S., Çokal, D., Delgado, A. S., Marin, A. M., and Hinzen, W. (2021). Relations between intensionality, theory of mind and complex syntax in autism spectrum conditions and typical development. Cogn. Dev. 59:101071. doi: 10.1016/j.cogdev.2021.101071
Scott, R. M., and Baillargeon, R. (2017). Early false-belief understanding. Trends Cogn. Sci. 21, 237–249. doi: 10.1016/j.tics.2017.01.012
Segal, G. (1998). “Representing representations” in Language and thought. eds. P. Carruthers and J. Boucher (Cambridge: Cambridge University Press), 146–161.
Seymour, H., Roeper, T., and de Villiers, J. G. (2003). Diagnostic evaluation of language variation – screening test (DELV-ST). San Antonio, TX: Harcourt.
Shatz, M. (1994). “Theory of mind and the development of social-linguistic intelligence in early childhood” in Children’s early understanding of mind: Origins and development. eds. C. Lewis and P. Mitchell (Hillsdale, NJ: Erlbaum).
Shatz, M., Wellman, H. M., and Silber, S. (1983). The acquisition of mental verbs: a systematic investigation of first references to mental state. Cognition 14, 301–321. doi: 10.1016/0010-0277(83)90008-2
Siu, C. T. S., and Cheung, H. (2022). A longitudinal reciprocal relation between theory of mind and language. Cogn. Dev. 62:101176. doi: 10.1016/j.cogdev.2022.101176
Southgate, V. (2013). “Early manifestations of mindreading” in Understanding other minds: Perspectives from developmental social neuroscience. eds. S. Baron-Cohen, M. Lombardo, and H. Tager-Flusberg (Oxford: Oxford University Press), 3–18.
Southgate, V., Senju, A., and Csibra, G. (2007). Action anticipation through attribution of false belief by 2-year-olds. Psychol. Sci. 18, 587–592. doi: 10.1111/j.1467-9280.2007.01944.x
Southgate, V., and Vernetti, A. (2014). Belief-based action prediction in preverbal infants. Cognition 130, 1–10. doi: 10.1016/j.cognition.2013.08.008
Strachan, J. W., Albergo, D., Borghini, G., Pansardi, O., Scaliti, E., Gupta, S., et al. (2024). Testing theory of mind in large language models and humans. Nat. Hum. Behav. 8, 1285–1295. doi: 10.1038/s41562-024-01882-z
Tager-Flusberg, H., and Joseph, R. M. (2005). “How language facilitates the acquisition of false-belief understanding in children with autism” in Why language matters for theory of mind. eds. J. Astington and J. Baird (Oxford, U.K: Oxford University Press), 298–318.
Tardif, T., and Wellman, H. M. (2000). Acquisition of mental state language in mandarin-and Cantonese-speaking children. Dev. Psychol. 36, 25–43. doi: 10.1037/0012-1649.36.1.25
Tardif, T., Wing-Chee, S. C., and Kaciroti, N. (2007). Language and false belief: evidence for general, not specific, effects in Cantonese-speaking preschoolers. Dev. Psychol. 43, 318–340. doi: 10.1037/0012-1649.43.2.318
Tomasello, M. (1999). The cultural origins of human cognition. Cambridge, MA: Harvard University Press.
Tompkins, V., Benigno, J., Kiger Lee, B., and Wright, B. (2018). The relation between parents’ mental state talk and children’s social understanding: a meta-analysis. Soc. Dev. 27, 223–246. doi: 10.1111/sode.12280
Wagner, R. K., Torgesen, J. K., and Rashotte, C. A. (1999). Comprehensive test of phonological processing (CTOPP). Austin, TX: Pro-Ed.
Walker, S. (2005). Gender differences in the relationship between young children's peer-related social competence and individual differences in theory of mind. J. Genet. Psychol. 166, 297–312. doi: 10.3200/GNTP.166.3.297-312
Wellman, H. M., Cross, D., and Watson, J. (2001). Meta-analysis of theory-of-mind development: the truth about false belief. Child Dev. 72, 655–684. doi: 10.1111/1467-8624.00304
Wellman, H. M., Fang, F., Liu, D., Zhu, L., and Liu, G. (2006). Scaling theory-of-mind understandings in Chinese children. Psychol. Sci. 17, 1075–1081. doi: 10.1111/j.1467-9280.2006.01830.x
Wellman, H. M., Fang, F., and Peterson, C. (2011). Sequential progressions in a theory-of-mind scale: longitudinal perspectives. Child Dev. 82, 780–792. doi: 10.1111/j.1467-8624.2011.01583.x
Wellman, H. M., and Liu, D. (2004). Scaling of theory-of-mind tasks. Child Dev. 75, 523–541. doi: 10.1111/j.1467-8624.2004.00691.x
Westra, E., and Nagel, J. (2021). Mindreading in conversation. Cognition 210:104618. doi: 10.1016/j.cognition.2021.104618
Wimmer, H., and Perner, J. (1983). Beliefs about beliefs: representation and constraining function of wrong beliefs in young children’s understanding of deception. Cognition 13, 103–128. doi: 10.1016/0010-0277(83)90004-5
Winner, E. (1997). The point of words: Children’s understanding of metaphor and irony. Cambridge, MA: Harvard University Press.
Woodward, A. L. (2009). Infants’ grasp of others’ intentions. Curr. Dir. Psychol. Sci. 18, 53–57. doi: 10.1111/j.1467-8721.2009.01605.x
Woolfe, T., Want, S. C., and Siegal, M. (2002). Signposts to development: theory of mind in deaf children. Child Dev. 73, 768–778. doi: 10.1111/1467-8624.00437
Keywords: theory-of-mind, complements, structural equation, language, inhibitory control, longitudinal
Citation: de Villiers JG and de Villiers P (2025) The contributions of language and inhibitory control to false belief reasoning over time. Front. Psychol. 15:1455941. doi: 10.3389/fpsyg.2024.1455941
Edited by:
Douglas Frye, University of Pennsylvania, United StatesReviewed by:
Gary Morgan, Fundació per a la Universitat Oberta de Catalunya, SpainElisabet Serrat, University of Girona, Spain
Ira Andrew Noveck, Centre National de la Recherche Scientifique (CNRS), France
Kevin John Riggs, University of Hull, United Kingdom
Copyright © 2025 de Villiers and de Villiers. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Jill G. de Villiers, amRldmlsbGlAc21pdGguZWR1
†These authors have contributed equally to this work and share first authorship