- Department of Child and Adolescent Psychiatry, Psychosomatics and Psychotherapy, Ludwig-Maximilian-University of Munich, Munich, Germany
Math difficulties (MD) manifest across various domain-specific and domain-general abilities. However, the existing cognitive profile of MD is incomplete and thus not applicable in typical settings such as schools or clinics. So far, no review has applied inclusion criteria according to DSM or ICD, summarized domain-specific abilities or examined the validity of response time scores for MD identification. Based upon stringent clinical criteria, the current meta-analysis included 34 studies which compared cognitive performances of a group with MD (n = 680) and a group without MD (n = 1565). Criteria according to DSM and ICD were applied to identify MD (percentile rank ≤ 16, age range 8–12 years, no comorbidities/low IQ). Effect sizes for 22 abilities were estimated and separated by their level and type of scoring (AC = accuracy, RT = response time). A cognitive profile of MD was identified, characterized by distinct weaknesses in: (a) computation (calculation [AC], fact retrieval [AC]), (b) number sense (quantity processing [AC], quantity-number linking [RT], numerical relations [AC]), and (c) visual-spatial short-term storage [AC]. No particular strength was found. Severity of MD, group differences in reading performance and IQ did not significantly moderate the results. Further analyses revealed that (a) effects are larger when dealing with numbers or number words than with quantities, (b) MD is not accompanied by any weakness in abilities typically assigned to reading, and (c) weaknesses in visual-spatial short-term storage emphasize the notion that number and space are interlinked. The need for high-quality studies investigating domain-general abilities is discussed.
Introduction
About 3–6% of all children experience severe difficulties in mathematics despite having normal intelligence and access to adequate education (Shalev, 2007; Moll et al., 2014; Fortes et al., 2016). If not identified and treated at an early stage, math difficulties (MD) can persistently affect academic functioning (Shalev et al., 2005; Morgan et al., 2009) and increase the risk of mental health problems (Willcutt et al., 2013; Endlich et al., 2014; Devine et al., 2018).
Given these possible consequences, it is essential to identify MD correctly (Ritchie and Bates, 2013; Lewis and Fisher, 2016). Although different diagnostic approaches exist (e.g., response to intervention), in most cases MD is diagnosed when performance in cognitive abilities related to MD is below average. Usually DSM-5 or ICD-10 (soon ICD-11) is used for this kind of MD identification. However, both offer only a short description of cognitive abilities which are affected by MD. In fact, a valid cognitive profile of MD based on clinical criteria in accordance with DSM or ICD is still missing (Geary, 2010; Pham and Riviere, 2015; Träff et al., 2017).
What is known is that MD manifest across several mathematical (e.g., fact retrieval) and also non-mathematical (e.g., working memory) abilities, which are also referred to as domain-specific abilities and domain-general abilities, respectively (Henik et al., 2011). While prior reviews have reported a huge variety of strengths and weaknesses in those abilities (e.g., Cowan and Powell, 2014; Karagiannakis et al., 2014; Shin and Bryant, 2015; Peng et al., 2018), they have not applied inclusion criteria to identify MD which are in accordance with diagnostic procedures given by DSM and ICD. In most cases, inclusion criteria were too liberal (e.g., high cut-off value, age range too broad) or group differences between people with and without MD were not controlled for (e.g., comorbid reading difficulties). In addition, important parts to complete the overall cognitive profile of MD have never been systematically reviewed yet. Most importantly, no meta-analysis has summarized domain-specific abilities or has systematically considered different levels of abilities. Especially number sense, which refers to the basic processing of magnitudes and numbers, consists of multiple sub-abilities (Butterworth et al., 2011; Kaufmann and von Aster, 2012; Hirsch et al., 2018). Although DSM and ICD suggest to measure number sense for MD identification, it is unclear how each sub-ability (e.g., quantity processing) differs between people with and without MD. Also, most studies have focused only on abilities which were measured by using accuracy scores (e.g., number of solved items) while response time scores are clearly under researched and not discussed in terms of their validity to differentiate between people with and without MD (e.g., Hanich et al., 2001; Mammarella et al., 2013).
Therefore, the aim of this review is to fill these research gaps and to define a most comprehensive cognitive profile of MD based on clinical criteria given by DSM and ICD. For this reason, we applied strict inclusion criteria, differentiated several ability levels and compared various scoring types to make the most precise statements about strengths and weaknesses in MD.
The manuscript is structured as follows: In the first part of the introduction we summarize the results of prior reviews, while in the second part we relate these results to the aims of our analysis. After describing the methods and results of our study we discuss how and why our cognitive profile of MD differs from existing results and how it is related to existing cognitive theories and neurobiological studies about MD.
Prior Reviews and Meta-Analysis
Domain-Specific Abilities
According to DSM and ICD MD is described by weaknesses in computation, math reasoning and number sense (American Psychiatric Association, 2013; World Health Organization, 2018). These weaknesses are also referred to as domain-specific weaknesses, since they are all directly related to mathematics (Henik et al., 2011).
Computation and Math Reasoning
Computation means to solve arithmetical problems. Depending on the type of arithmetical problem, computation can be divided into two sub-abilities: Fact retrieval and calculation. Fact retrieval is used when simple arithmetical problems (e.g., multiplication tables) are solved by retrieval of facts from long-term memory. Calculation is used when dealing with more complex and multi-step problems (e.g., 352 + 943), which require an overall understanding of basic arithmetic operations (Busch et al., 2015). Both calculation and fact retrieval are considered as the hallmark symptoms of MD, as deficits in these abilities are most prevalent (Jordan and Hanich, 2003). However, that is not the case with math reasoning and number sense which still lack a consistent definition. As a consequence, both abilities are measured in various ways using different tasks, which in turn, lead to different strengths and weaknesses in people with MD (Berch, 2005; Gersten et al., 2005).
Math reasoning is understood as the application of mathematical knowledge to solve unfamiliar problems (Lindquist et al., 2017). Although several tasks to measure math reasoning exist, no clear differentiation in terms of sub-abilities can be made. Based on the description of math reasoning above, two groups of tasks can be separated. The first group consists of tasks which require participants to solve rather complex problems by using their full mathematical knowledge (Kaufmann and von Aster, 2012; Casey et al., 2015). A typical task for this group is word problems, which is often used to measure math reasoning and also leads to large deficits in people with MD (Yip et al., 2020). In addition, tasks which involve the interpretation of data (e.g., tables and charts) or geometry have been applied too (Proctor, 2012; Tolar et al., 2016). The second group includes tasks which focus more on reasoning than on knowledge. For example, Zhang et al. (2017) used number series (i.e., find a pattern in a given sequence of numbers by applying basic arithmetic operations) in addition to word problems to measure math reasoning. However, number series tasks correlate strongly with fluid reasoning (i.e., non-verbal IQ) so that deficits in math reasoning could be moderated more by general IQ than by MD status (Floyd et al., 2003; Benson et al., 2016). But no studies exist which compare both type of tasks in people with MD.
Number Sense
Certainly, the most difficult domain-specific ability to pinpoint and to measure is number sense. Number sense in general refers to difficulties in processing magnitudes and numbers (Butterworth et al., 2011). However, several sub-abilities have been assigned to number sense in the past (for an overview: Kaufmann and von Aster, 2012; Hirsch et al., 2018). These sub-abilities can be differentiated in terms of what type of information is processed and what type of cognitive task is demanded as follows: (1) knowing numerals (e.g., counting aloud, transcoding), (2) processing quantities (e.g., non-symbolic comparison), (3) linking quantities to numerals/numbers (e.g., subitizing and dot enumeration), and (4) relating numbers (e.g., symbolic comparison, number line). This sequence (from numerals and quantities to numbers) is also in line with various numerical development models (Krajewski and Schneider, 2009; Michalczyk et al., 2013; Simanowski and Krajewski, 2019). Another approach would be to separate between processing non-symbolic numerosities (e.g., set of dots), symbolic numbers (e.g., digits) and the mapping between both (Kolkman et al., 2013; Huijsmans et al., 2020). Although longitudinal studies have shown that number sense in kindergarten is strongly predictive of math performance in school, people with MD show different strengths and weaknesses depending on which number sense sub-ability is measured (Jordan et al., 2009; Desoete et al., 2012; Geary et al., 2012). Currently only one meta-analysis examines differences between people with and without MD in number sense, which is also the only published meta-analysis for domain-specific abilities in general. Based on 19 studies, Schwenk et al. (2017) reported a more severe weakness in processing of numbers (symbolic comparison) than of quantities (non-symbolic comparison). Effect size for quantity processing was 0.24, while the effect size for number processing was 0.75. This result is in line with a meta-analysis by Schneider et al. (2017) who reported a higher correlation of math performance with number than quantity processing in people without MD. Regarding the simultaneously processing of quantities and numbers, results are still ambiguous. Overall, people with MD do not always show a consistent weakness in dot enumeration or subitizing tasks (Landerl, 2013; Szûcs et al., 2013; Skagerlund and Träff, 2014). Especially for subitizing there are still a lot of unsolved questions, for example, whether subitizing even belongs to mathematics or whether it is rather a general cognitive mechanism to process different kinds of magnitudes (e.g., space and time) (Anobile et al., 2019). However, a robust weakness seems to be prevalent in people with MD when dealing with numerals, especially in transcoding (Moura et al., 2015; Iglesias-Sarmiento and Deaño, 2016; Raddatz et al., 2017).
Domain-General Abilities
Domain-general abilities are part of the overall cognitive functioning and are therefore not strictly limited to mathematics (Geary et al., 2017; Silver et al., 2020). Nevertheless, several studies and multiple meta-analyses have tried to relate certain strengths and weaknesses in domain-general abilities to MD (Henik et al., 2011; Fias et al., 2013; Watson and Gable, 2013). This approach is also supported by fMRI studies. Here, a neuronal network of brain areas linked to domain-specific as well as domain-general abilities was identified when doing arithmetic tasks [for an overview: Kucian (2016), Peters and de Smedt (2018)].
Short-Term Working Memory
Most reviews about distinct strengths and weaknesses in domain-general abilities focused on short-term working memory, which compromises of four sub-abilities: auditory and visual-spatial short-term storage, working memory capacity and attentional control (Schneider and McGrew, 2018). Auditory or visual-spatial short-term storage refer to the ability to temporarily store verbal or visual-spatial information, respectively (Lehnert and Zimmer, 2006). Regarding auditory short-term storage, most reviews reported a small weakness in people with MD (Swanson and Jerman, 2006; David, 2012). Only one review by Johnson et al. (2010) found a larger difference between people with and without MD that corresponded to a medium effect size of 0.60. By comparison, effects in visual-spatial short-term storage were in total only analyzed by 2 reviews, which reported both a medium effect size of 0.60 (David, 2012; Peng et al., 2018). In recent years, several studies examined differences between both sub-abilities in people with MD. Since they found larger effects in visual-spatial than in auditory short-term storage (Landerl et al., 2009; Szûcs et al., 2013; Kroesbergen and van Dijk, 2015; Menon, 2016), these studies suggested a general weakness of people with MD in processing visual-spatial information.
Working memory capacity, another sub-abilities of short-term working memory, is the ability to store and process information simultaneously (Swanson, 2012). It is usually measured by tasks in which participants are required to recall a list of items (e.g., letters and digits) in reverse order, or by tasks in which participants have to answer a set of questions while simultaneously memorizing the last word of each question in the given order (i.e., complex span). Regardless of the task, the performance of people with MD was usually lower compared to people without MD and most reviews reported medium effect sizes in working memory capacity. Johnson et al. (2010) estimated a large effect of 0.91 when verbal items had to be memorized while the effect was small for visual items. This difference between type of items was not confirmed by Swanson et al. (2009), who reported medium effect sizes for both type of items. However, since no review reported none or small effect sizes, a general weakness of people with MD in working memory capacity is assumed (Peng and Fuchs, 2014; Attout and Majerus, 2015; Mammarella et al., 2018).
The last sub-ability of short-term working memory is attentional control (or executive functions). Attentional control is understood as the ability to monitor, adapt, and regulate cognitive performance in reaction to changing task settings (van der Sluis et al., 2004). In contrast to the other sub-abilities of short-term working memory, attentional control can be further divided into 3 sub-abilities: Inhibition (i.e., deliberately inhibit a prepotent response), shifting (i.e., shift between tasks), and updating (i.e., update task-relevant information in memory) (Miyake et al., 2000). For people with MD, findings are too ambiguous to draw conclusions yet. Across all sub-abilities, a review by Johnson et al. (2010) reported a small difference between people with and without MD which does not correspond to a distinct weakness. In contrast, Peng et al. (2018) found a large difference between both groups. Peng et al. (2018) also analyzed sub-abilities and estimated a small effect size of 0.37 in inhibition while effect sizes in shifting and updating were 0.75 and 0.76, respectively. No further review about strengths and weaknesses of people with MD in attentional control exists.
Other Abilities
Other abilities which were summarized by reviews about MD are processing speed, phonological processing, visual processing and fluid reasoning. However, most of them were not analyzed by more than 2 reviews and findings are mixed. Regarding processing speed, it is the ability to encode information quickly and to perform simple cognitive tasks based on this information fast (Conway et al., 2002). Several sub-abilities can be separated for processing speed (Salthouse, 2000), however, it is mainly perceptual speed which is at its core (Schneider and McGrew, 2018). Perceptual speed basically means to compare simple visual stimuli for differences and similarities very quickly and is measured by visual matching or coding tasks (Ackerman and Beier, 2007). For people with MD, reviews by Johnson et al. (2010) and Peng et al. (2018) reported small and medium effects in favor of people without MD, respectively. However, Johnson et al. (2010) also included studies which measured processing speed by applying rapid naming tasks. Since these tasks require participants to rapidly retrieve the names of well-known stimuli (e.g., letters and numbers) from memory, they are actually measuring retrieval fluency (Koponen et al., 2020). As a consequence, only the review by Peng et al. (2018) remains relevant, which reported a weakness for people with MD in processing speed.
Retrieval fluency, on the other hand, was analyzed by 3 reviews. While Swanson and Jerman (2006) and Peng et al. (2018) reported a weakness corresponding to a medium effect size, Swanson et al. (2009) on the other hand only estimated a small effect size of 0.39 in a follow-up review of his previous one. Retrieval fluency itself is considered as a sub-ability of phonological processing which is described as the ability to “use phonological information (…) in processing written and oral language” (Wagner and Torgesen, 1987). Another sub-ability of phonological processing is phonetic coding (or phonological awareness) in which tasks phonemes, syllables, or onset-rimes have to be manipulated (Treiman and Zukowski, 1996). Only a review by Peng et al. (2018) summarized differences between people with and without MD and reported a distinct weakness in people with MD with a large effect size of 1.31. Since phonological processing is strongly related to writing and especially reading (Wimmer et al., 1991), the weakness found by Peng et al. (2018) can also be compared with reviews analyzing the overall reading and writing ability in people with MD. However, two reviews by Swanson and Jerman (2006) and Swanson et al. (2009) only partially confirm this weakness in phonological processing. While both reviews included the same tasks to measure phonological processing, Swanson and Jerman (2006) reported a small effect size of 0.3 while Swanson et al. (2009) found a large effect size of 1.03.
Besides phonological processing, there is also visual processing, which is the overall ability to perceive, discriminate, manipulate and recall non-linguistic images (Schneider and McGrew, 2018). It is distinct from perceptual speed in a way that it’s not about the fast encoding of simple visual stimuli but the accurate processing of complex visual or visuospatial stimuli. A vast array of sub-abilities have been described in the past (for an overview: Hegarty and Waller, 2005). For people with MD, Peng et al. (2018) reported a small effect size in mental rotation, visuospatial perception and spatial visualization. However, the overall number of studies examining differences in people with and without MD is small and no conclusions regarding strengths and weaknesses in MD can be drawn yet.
The last ability which is commonly summarized by reviews is fluid reasoning. Fluid reasoning is a special case since it is considered as a first-order factor of intelligence on which all other cognitive abilities discussed so far load on as second-order factors (Schneider and McGrew, 2018). Findings by reviews regarding differences between people with and without MD in fluid reasoning are very mixed. They vary from small differences for visual items (Swanson and Jerman, 2006), to medium and large differences for verbal items (Swanson et al., 2009), to large differences regardless of item type (Johnson et al., 2010). Reported differences are in favor of those without MD but given the range of differences it is unclear whether MD is associated with a distinct weakness in fluid reasoning.
To sum it up: Based on published reviews and studies, MD is usually accompanied by domain-specific weaknesses in calculation, fact retrieval and math reasoning; and by domain-general weaknesses in working memory capacity and most likely in visual-spatial short-term storage. For auditory short-term storage the difference between people with and without MD is too small to be considered as a weakness. For all the other domain-specific and domain-general abilities, findings are too ambiguous to draw conclusions.
Contributions of Present Meta-Analysis Over and Above Existing Ones
Despite the existing evidence given by published meta-analyses, the cognitive profile of MD remains incomplete and not applicable in typical settings. Specifically, we identified three key issues which our meta-analysis addresses. First, DSM or ICD are mostly used to diagnose MD. However, no review so far has applied inclusion criteria similar to diagnostic criteria stated by DSM and ICD. Second, domain-specific abilities have never been systematically summarized yet although they are most important for MD identification. Instead, domain-general abilities were the focus of meta-analyses in the past. And third, it is unclear if strengths or weaknesses in abilities are similar if different scoring types are used (e.g., accuracy and response time). So far, meta-analyses focused mainly on accuracy scores and it is unclear, if response time scores can be used for MD identification.
Inclusion Criteria According to DSM and ICD
Regarding the first issue, DSM and ICD define clear criteria to identify MD. That is, MD manifests during the first years of formal schooling and is therefore mostly diagnosed within this period. Overall math performance is below average for age. To test for low math performance, clinical interviews and standardized math achievement tests are used. For tests, performance should be at least 1 standard deviation below the population mean (i.e., percentile rank ≤ 16). Low math performance is not attributable to other causes [e.g., intellectual disabilities, neurological disorders, attention deficit hyperactivity disorder (ADHD)]. Also, to diagnose isolated MD, other learning disabilities, especially reading disabilities (RD), need to be excluded based on the same criteria stated above.
So far, no review applied these inclusion criteria. Regarding the first criterion, onset and age range, only Schwenk et al. (2017) defined an acceptable age range of 6–14 years of age. Johnson et al. (2010) and Peng and Fuchs (2014) included studies with adolescents and applied an upper age limit of 18 and 20, respectively. In reviews by Swanson et al. (2009); David (2012), and Peng et al. (2018) the age range is unclear. For the second criterion, cut-off value, no review is an accordance with the DSM or ICD. The most conservative criterion is used by Schwenk et al. (2017) who only included studies which applied a cut-off of percentile rank 25 or lower. In comparison, Johnson et al. (2010); Peng and Fuchs (2014) and Peng et al. (2018) used a liberal cut-off of percentile rank 35 while reviews by Swanson and Jerman (2006); Swanson et al. (2009), and David (2012) do not mention any cut-off or accepted teacher ratings as criteria instead. For the last criterion, other reasons and comorbidities, only Peng and Fuchs (2014); Schwenk et al. (2017), and Peng et al. (2018) used an IQ criterion of at least 80. However, studies testing students in educational settings do not always apply an additional IQ criterion since an IQ at or greater than 70 is often required to attend regular schools in most countries. Regarding ADHD, it is hardly excluded in studies about MD and therefore not applied as an exclusion criterion in any review about MD either. RD was excluded in multiple reviews (Swanson and Jerman, 2006; Swanson et al., 2009; David, 2012; Peng and Fuchs, 2014; Peng et al., 2018). However, it is often unclear if the inclusion criteria for RD were analogous to those for MD.
That means, no review was completely in accordance with DSM and ICD and it remains unclear if the published strengths and weaknesses in domain-specific and domain-general abilities can be used for MD identification. In particular, strict or liberal cut-offs seem to lead to different cognitive profiles (Murphy et al., 2007; Swanson et al., 2018; Busch et al., 2019). Especially weaknesses in number sense and visual-spatial short-term storage were more pronounced when stricter cut-offs were applied. Also, 20–40% of people with MD also suffer from RD (Moll et al., 2019). Several studies have shown that RD is accompanied by distinctive domain-general weaknesses in phonological processing (Cirino et al., 2015), auditory short-term storage (Mähler and Schuchardt, 2016) and retrieval fluency (Willcutt et al., 2013), which are also present in comorbid cases of MD and RD. Therefore, a proper exclusion of RD is necessary to define a cognitive profile of MD.
Summary of Domain-Specific Abilities
Regarding the second issue, prior studies have shown various strengths and weaknesses in domain-specific abilities. While weaknesses in mathematical core abilities like calculation or fact retrieval are mandatory for MD, findings regarding math reasoning and number sense are mixed. Especially number sense can be measured in various ways and consists of different sub-abilities. If those sub-abilities differ in their effect sizes, like they do in number sense, and if no differentiation is made between sub-abilities, conclusions about the overall ability can be seriously biased and can lead to an incorrect MD identification. In addition, number sense is described as a precursor ability to later math abilities like computation (Butterworth et al., 2011; Träff et al., 2020). That means, its development must be considered when MD is identified during the first years of formal schooling. It cannot be ruled out that basic number sense sub-abilities are already developed, even in people with MD, so that no differences between people with and without MD can be found. For example, Fazio et al. (2014) and Schneider et al. (2017) reported low correlations between quantity processing and math performance. Similarly, Schwenk et al. (2017) reported larger differences between people with and without MD in number processing, which is more advanced, than in quantity processing. Since no meta-analysis has systematically summarized number sense sub-abilities yet, the average performance of people with MD during the first years of formal schooling is unknown.
Differentiation Between Type of Scoring
Regarding the third issue, abilities can be measured by accuracy or response time scores. So far, only a review by Schwenk et al. (2017) explicitly specified type of scores and summarized scores solely based on response time. For all other reviews it is unclear, what type of scores were included which in turn can seriously affect the size of the reported effects. While most abilities are, per definition, clearly based upon certain type of scores (e.g., processing speed), it is not so obvious for others (e.g., fact retrieval and attentional control). From a statistical perspective, response time scores are usually not normally distributed like accuracy scores. Instead, they follow an asymmetric ex-Gaussian distribution which affects the robustness of most statistical tests (Marmolejo-Ramos et al., 2015). To overcome this problem, data based on response time scores is sometimes transformed in such a way that it is more normally distributed (Lachaud and Renaud, 2011; Speelman and McGann, 2013). However, those procedures vary between studies and skewed distributions often remain skewed even after transformation (Lo and Andrews, 2015). Therefore, mixing accuracy and response time scores also means to mix two different types of data with unpredictable consequences for the overall effect. Thus, it is better to separate between accuracy and response time scores and to control for different data distributions between both scores while simultaneously assume rather similar distributions within. In addition, tasks measuring accuracy or response time scores can also include additional time constraints when there is a time limit or participants are required to work “as fast as possible.” Several studies have reported slower processing speed in people with MD (Proctor, 2012; Niileksela and Reynolds, 2014). For those with MD, additional time constraints can complicate tasks thus negatively affecting their performance compared to people without MD. However, studies about MD rarely analyze response time scores (e.g., Hanich et al., 2001; Mammarella et al., 2013). Consequently, it is unclear if and how response time scores can be used for MD identification and whether the effect sizes are similar regardless of scoring type.
Aims
As described, people with MD show various strengths and weaknesses in domain-specific as well as domain-general abilities. However, the cognitive profile of MD is still incomplete and previous meta-analysis failed to apply stringent inclusion criteria. The aim of this study was to summarize the domain-specific and domain-general strengths and weaknesses in MD by using diagnostic procedures in accordance with the DSM and ICD. A systematic review and meta-analysis was performed, including studies which compared people with and without MD on several levels of abilities and scoring types.
Method
Search Strategies
We searched for studies which compared a group with MD with a typically developing group (TD) on any cognitive measure. The literature search was conducted in February 2021 using the following databases: PsycINFO, MEDLINE, ERIC, ProQuest, PSYNDEX and MathEduc. ProQuest and PSYCINFO were also used to find relevant dissertations and master’s theses. In addition, we searched the following published reviews about MD for relevant citations: Geary (2004, 2010, 2011), Swanson and Jerman (2006), Swanson et al. (2009), Johnson et al. (2010), Raghubar et al. (2010), David (2012), Peng and Fuchs (2014), Shin and Bryant (2015), Peng et al. (2016), Peng et al. (2018), and Vanbinst and de Smedt (2016).
Depending on the database, we searched with English and/or German search terms using title, abstract and keywords as search fields. To find studies comparing people with and without MD, we combined keywords for MD with keywords for difference and group membership using AND (see Supplementary Material).
Figure 1 provides a flow chart of the literature search according to Moher et al. (2009). The literature search initially provided 3039 studies. After removal of duplicates and application of every inclusion criteria, the number of studies was narrowed down to 35. The quality of those studies was assessed by using the Appraisal Tool For Cross-Sectional Studies (AXIS) (Downes et al., 2016), which confirmed proper study quality for all studies. Three studies were based on the same data set (de Weerdt et al., 2013a,b; Desoete and de Weerdt, 2013), which we therefore treated as a single study. One study (Schleifer and Landerl, 2011) consisted of three sub-studies with different samples, which we considered as 3 different studies. The remaining total pool of 34 studies was divided into 12 different data sets based on scoring and level of every outcome (see Section “Coding Procedures”). For every data set we applied risk of bias analysis and excluded outcomes with a very small number of studies (see Section “Statistical Methods”). Depending on the data set, 1–4 studies were excluded. For data set RT - TC (i.e., response time scores without time constraints) only 3 studies could be identified in total. After risk of bias analysis, no study remained, thus we could not perform any meta-analysis for data set RT - TC.
Figure 1. Flow chart of literature search. AC, accuracy; RT, response time; TC, time constraints; HA, high-level ability; MA, medium-level ability; LA, low-level ability.
Inclusion Criteria
The MD and TD groups each had at least two persons. All included studies were published either in English or German. Because of the language skills of both authors, no other languages could be considered. Publication year of all studies was not before 1992 (publication year of ICD-10).
The MD was defined as a percentile rank at or below 16 in a standardized math test, a lag of at least 18 chronological or 15 instructional months in math (i.e., at least 1.5 years or 1.5 grades), or an existing diagnosis of MD (DSM- IV/5: 315.1; ICD-10: F81.2). Although other criteria and cut-offs exist to diagnose MD (Möller et al., 2012; Kaufmann et al., 2013), we chose PR ≤ 16 (i.e., one standard deviation) since it is recommended as cut-off in DSM-5.
All participants of our included studies received regular education (i.e., no special education) and were between 8 and 12 years old. In case years of age were not reported, participants had to attend 2nd to 6th grade. According to DSM and ICD, MD manifests during the first years of formal schooling and is mostly diagnosed during primary education (Nelson and Powell, 2018). Although very severe math difficulties can already be identified at a very young age (Stock et al., 2010), math performance of students in 1st grade still varies too much for MD identification (Kohn et al., 2013; Koponen et al., 2018). For this reason, we restricted the age range to the typical age range in which MD is usually identified.
For every study, MD and TD groups were matched for age or grade and for gender to control for age-moderated and gender-moderated differences between both groups, respectively.
To control for other causes for low math performance, only studies about MD were included in which, first, math performance was not primarily associated with a genetic or neurological disorder like fragile X syndrome (Murphy, 2009), Turner syndrome (Baker and Reiss, 2016), 22q11.2 deletion syndrome (de Smedt et al., 2009; Brankaer et al., 2016), neurofibromatosis type I (Orraca-Castillo et al., 2014), cerebral palsy (van Rooijen et al., 2015), epilepsy (van Iterson et al., 2015). Second, MD was not primarily associated with low birth weight or premature birth (Taylor et al., 2009; Jaekel and Wolke, 2014). And third, studies controlled for intellectual disability which was defined in accordance with DSM and ICD as an IQ lower than 70. Since an IQ of 70 is relatively low, studies reporting an “average IQ” as their selection criteria were included too.
Because 20–40% of people with MD also show low reading performance (Dirks et al., 2008; Landerl and Moll, 2010; Fischbach et al., 2013; Willcutt et al., 2013; Moll et al., 2014), we included only studies which controlled for reading difficulties (RD). Analogous to MD, RD was defined as a percentile rank above 16 in a standardized reading test, a lag of less than 18 chronological or 15 instructional months in reading (i.e., less than 1.5 years or 1.5 grades), or no existing diagnosis of RD (DSM-IV/5: 315.0; ICD-10: F81.0; ICD-11: 6A03.1).
As a final inclusion criterion, all studies had to report enough data to compute effect sizes based on the standardized mean difference between both groups.
Coding Procedures
For the descriptive variables, we coded study, sample, and diagnostic characteristics. Study characteristics included (a) year of publication and (b) type of publication. Sample characteristics of MD and TD group were (a) sample size, (b) mean age in months, and (c) percentage of male participants. Diagnostic characteristics contained (a) measures of IQ, reading and math performance; (b) criteria used to exclude intellectual disabilities and RD, and criteria used to diagnose MD; and (c) if ADHD was excluded or not.
For the outcome variables, we used an exploratory approach according to Brown et al. (2003). That meant instead of defining a coding scheme for study outcomes in advance we derived relevant coding categories for the outcomes from the included studies themselves. This approach reduced the risk of excluding important data a priori by not-coding relevant outcomes and was therefore most suitable to derive a comprehensive cognitive profile of MD. Furthermore, precision of effect size estimations increased since we could consider dependent correlations between outcomes which could have otherwise been ignored (Riley, 2009).
We started this exploratory coding approach by separating between the scoring and level of every outcome of every study. For scoring, we coded for each outcome if accuracy (AC; e.g., number of solved items) or response time (RT; e.g., mean response time for solved items) was measured and if any time constraints were involved (TC; e.g., time limit, instructed to “work as quickly as possible”). This resulted in 4 different categories for scoring: Accuracy without time constraints (AC − TC) and with time constraints (AC + TC) as well as response time without time constraints (RC − TC) and with time constraints (RT + TC).
For level, we coded every outcome on the following three hierarchical levels of abilities in accordance to their description in the Introduction: High-level ability (HA), medium-level ability (MA), and low-level ability (LA). First, we assigned study outcomes to the same low-level abilities when they shared a similar operationalization (e.g., counting forward from 1 to 10 and counting backward from 8 to 2 was defined as low-level ability number word knowledge). Second, we assigned similar low-level abilities to its respective medium-level abilities (e.g., low-level abilities number word knowledge and quantity processing were assigned to medium-level ability number sense). Third, we assigned medium-level abilities to its corresponding high-level abilities (e.g., medium-level abilities number sense and computing were assigned to high-level ability mathematics). For better understanding, high-level, medium-level and low-level ability can also be referred to as first-order, second-order and third-order factor, respectively. If we could not assign a study outcome to a single low-level ability (e.g., index scores which comprised several low-level abilities) this outcome was only assigned to its respective medium-level or high-level ability (e.g., a single outcome comprising tasks measuring quantity processing and numerical relations was only assigned to medium-level ability number sense and high-level ability mathematics). Taken together, a single outcome was coded on one of four different categories for scoring (AC − TC, RT − TC, AC + TC, and RT + TC) and on three different levels (HA, MA, and LA) which resulted in 4 × 3 independent data sets of coded outcomes. See Supplementary Table 1 for the final coding scheme.
Based upon the set of included studies the first author and a student assistant developed the coding scheme by scanning all reported study outcomes as described above. In case of disagreements, the relevant study outcomes were discussed until a consensus was reached. After the coding scheme was finalized, all descriptive and outcome variables were coded. The first 10 studies were double-coded by the first author and the student assistant. Inter-rater agreement varied between 88 and 97% using Cohen’s kappa (Cohen, 1960) for nominal and intraclass correlation coefficients (Hallgren, 2012) for ordinal or higher levels of measurement, respectively. Lower agreement rates were due to incomplete descriptions of diagnostic cut-off criteria, tests, and measures provided by the studies.
In case of missing data, we contacted the study’s authors. If they didn’t provide the data, effect sizes and standard deviations were computed by transforming reported data using formulas provided by Borenstein (2009). If transforming was not possible, we differentiated between descriptive and outcome data. In case of missing descriptive data, studies were excluded from further analysis if the missing data was related to the inclusion criteria of the meta-analysis. This was done to ensure a clean data set strictly according to our criteria. For missing outcome data, studies were excluded if there was a high risk of a selective outcome reporting bias (Pigott et al., 2013) according to AXIS. Serious selective outcome reporting indicated that a study was strongly biased toward certain conclusions. This also made it unclear, if the methods (esp. statistical procedures) were properly conducted and if we could trust the outcomes which were reported. For this reason, we excluded those studies instead of trying to address this issue by using risk of bias analysis. However, no study had to be excluded because of serious selective outcome reporting.
We assessed the quality of each study by using the Appraisal Tool For Cross-Sectional Studies (AXIS) (Downes et al., 2016) which contains 20 yes/no questions evaluating the quality of reporting, study design, and possibility of bias of cross-sectional studies (e.g., “Was the target population clearly defined?”, Were the limitations of the study discussed?”). The final judgment about study quality is determined by an overall subjective assessment of every item since no numerical rating scale is provided.
Initially we coded 9 high-level abilities, 19 medium-level abilities and 13 low-level abilities. After we separated the overall data set into the corresponding data sets for accuracy and response time with or without time constraints and after we excluded outcomes with a very small number of studies only 5 high-level abilities, 9 medium-level abilities and 8 low-level abilities remained which were included in the meta-analysis. For domain-specific abilities most outcomes fulfilled our criterion and were reported by at least 3 studies with at least 2 different first authors (see Section “Statistical Methods”). However, for domain-general abilities we had to exclude most of the coded outcomes. We did not find more than 2 studies for high-level abilities processing speed, visual processing, learning efficiency, and comprehension knowledge. As a result, our analysis about domain-general abilities only covered short-term working memory (16 studies), reaction and decision speed (3 studies) and fluid reasoning (3 studies).
Statistical Methods
All analyses were conducted using R (R Core Team, 2020) and R package robumeta (Fisher et al., 2017). We used the standardized mean difference between the MD and TD group as effect size measure. For this reason Hedges’ g (Hedges, 1981) was calculated which corrects Cohen’s d (Cohen, 1969) for a small sample bias.
Before conducting the meta-analysis, we applied several measures to reduce the risk of bias in individual studies (e.g., selective reporting) and across studies (e.g., publication bias). Studies were excluded if AXIS indicated a low study quality and a high risk of reporting bias. To reduce the influence of outliers within each data set we applied a 90% Winsorizing to these effect sizes (Dixon, 1960). That is, effect sizes below the 5th or above the 95th percentile were set to the 5th or 95th percentile, respectively.
Outcomes were excluded, if they were not reported by at least 3 studies which in addition were published by at least 2 different first authors. The validity of a meta-analysis depends more on the expected heterogeneity and quality of the studies included than the total number of included studies (Valentine et al., 2010). Since our study sample was well defined because of stringent inclusion criteria (e.g., small age range and clinical diagnostic criteria) and since each study was critically appraised by using a checklist, we expected rather similar effect sizes and smaller standard errors for outcomes. For this reason, we accepted a smaller number of studies per outcome as sufficient. We specifically settled on 3 studies as the minimum number of studies per outcome in the unlikely event that the effect sizes of 2 studies were very contrasting so that the effect size of the third study could give us the right direction of the overall average effect size. The additional criterion of at least 2 different first authors for each outcome was necessary to reduce the risk of authorship bias (Moulin and Amaral, 2020) since the number of researchers focusing on MD is small.
Regarding the analysis, most studies provided more than one outcome and multiple effect sizes for each outcome. To account for statistical dependencies of multiple outcomes and effect sizes from the same studies (Jackson et al., 2011; Moeyaert et al., 2017) we performed a robust variance estimation method based on a random effects model (RVE; Raudenbush, 2009; Hedges et al., 2010; Tipton, 2015). Although we used strict inclusion criteria, we applied a random effects model because we still expected effects of the same outcomes to vary across studies. The RVE method corrects the standard error of the average effect size estimate by taking into account the correlations between effect sizes from the same sample (i.e., same study). For this, an estimation of the mean correlation between all pairs of effect sizes within a study is needed. We chose p = 0.70 which is a rather conservative estimate to reduce the risk of a type I error. In addition, we performed sensitivity analysis with various estimates of p to examine the robustness of our results. But, as Hedges et al. (2010) already pointed out, the estimate of the mean correlation has actually no considerable effect on the standard error estimates.
After the analysis, we conducted several meta-regressions to assess risk of bias across studies. For every outcome we applied the same criterion regarding minimum number of studies as for the overall analysis. That is, a meta-regression was only performed if the respective moderator was reported by at least 3 studies which were published by at least 2 different first authors. To examine publication bias, we applied Egger’s regression test (Egger et al., 1997) and Funnel plot tests (Sterne and Egger, 2001) which examine the influence of the standard errors and the sample size on the estimated average effect sizes, respectively. Both tests, when significant, indicate the presence of a small-study effect which means that too many studies with small sample sizes and large effect sizes are in the data set. However, it should be noted that both tests are prone to false positive results, especially when heterogeneity in the data is high (Rodgers and Pustejovsky, 2020). We created no funnel or forest plots since both type of plots did not account for statistically dependent effect sizes thus making any interpretation misleading (Doleman et al., 2020). To control for biases based on the characteristics of the TD and MD group (e.g., selection bias) we analyzed the moderating effect of severity of MD (i.e., difference in math performance between groups) as well as differences in IQ and reading performance between groups.
Results
Study Characteristics
In total and across all data sets 34 studies were included, comprising 320 effect sizes. The total MD sample consisted of 680 unique people (43.6% male) with a mean age of 117.3 months and the total TD sample covered 1565 unique people (48.4% male) with a mean age of 117.5 months. Regarding IQ, 85.3% of all studies measured non-verbal IQ or a combination of non-verbal and verbal IQ. Cut-off used to exclude low IQ ranged from 80 to 90. Regarding reading performance, 88% of all studies measured reading fluency or a combination of reading fluency and reading comprehension. Most studies (85.3%) applied percentile ranks to exclude RD, which ranged from percentile rank 16 (the pre-defined minimum) to 40. Only 5.9% or 8.8% of all studies used delays in months or an existing diagnosis of RD as exclusion criteria, respectively. Regarding math performance, 97.1% of all studies measured computation or a combination of computation with number sense and/or math reasoning. Similar to reading performance, most studies applied percentile ranks to define MD, which ranged from percentile rank 5–16 (the pre-defined maximum). For MD, 20.6% or 8.8% of all studies used delays in months or an existing diagnosis of MD as inclusion criteria, respectively. Regarding ADHD, less than half of all included studies (47.1%) controlled for ADHD within their samples. Descriptive information and assignment of studies to each data set are listed in Table 1.
Estimated Effects for High-Level Abilities, Medium-Level Abilities, and Low-Level Abilities
The size of the effect is interpreted according to Cohen (1988) (i.e., none or not relevant: g < 0.2; small: 0.2 ≤ g < 0.5; medium: 0.5 ≤ g < 0.8; large: g ≥ 0.8). In addition, only medium to large effect sizes were considered as a particular strength or weakness. Positive effect sizes reflect better scores in favor of the TD group (e.g., higher accuracy and faster response time) and vice versa. We had to control for type I error when determining statistical significance since degrees of freedom were small for some abilities. In line with Tanner-Smith et al. (2016), p < 0.05 was used when degrees of freedom were greater than or equal 4 and p < 0.01 when degrees of freedom were less than 4.
Table 2 lists the estimated effects for high-level abilities, medium-level abilities, and low-level abilities for data sets AC − TC and AC + TC, and RT + RT. To facilitate comparison of scoring and level of outcomes, not all statistics have been reported in Table 2 (e.g., confidence intervals). An extensive list of all statistics is provided by Supplementary Table 2. For a better overview of the results, the estimated effects are shown in Figure 2.
Figure 2. Estimated effects for domain-specific and domain-general abilities. AC, accuracy; RT, response time; TC, time constraints; STS, short-term memory; font formatting reflects ability level (high-level ability, bold and uppercase; medium-level ability, bold and italic; low-level ability, no formatting); size of circle reflects size of effect (i.e., the larger the circle the larger the effect in favor of TD); color of circle reflects type of effect (green = large, yellow = medium, red = small, black = none); *significant effect sizes.
All included outcomes were in favor of the TD group. However, for medium-level abilities and low-level abilities the number of studies was small and p < 0.01 was mostly used as significance level. As a consequence, most outcomes were not statistically significant.
Regarding domain-specific abilities, the MD group showed significant weaknesses in high-level ability mathematics regardless of data set. However, weaknesses in accuracy without any time constraints were usually large compared to medium weaknesses in accuracy with time constraints or response time with time constraints. Especially low-level ability calculation and medium-level ability math reasoning were more affected in people with MD compared to low-level ability fact retrieval. For medium-level ability number sense, results were mixed and dependent on the data set. Effect sizes for low-level ability numerical relations (i.e., mostly symbolic comparison tasks) were large in all data sets, but only significant in data set AC - TC. This was interesting compared to low-level ability quantity processing (i.e., mostly non-symbolic comparison tasks). Here, effect sizes were medium and significant in data set AC + TC while only a small and non-significant effect size could be found for data set RT + TC. Also, low-level ability quantity-number linking (e.g., subitizing and dot enumeration) was larger and significant for data set RT + TC compared to data set AC + TC. Low-level ability number word knowledge (e.g., counting) was only reported for data set AC − TC and a large effect size was found.
For high-level ability reading and writing and their corresponding medium and low-level abilities effect sizes were either small or not-relevant and not-significant. Also, for low-level ability naming facility, which included RAN tasks with numbers as stimuli, we did not find a weakness for people with MD. However, small effect sizes for reading and writing were to be expected since we controlled for RD in our sample of studies.
Regarding domain-general abilities, only some effect sizes for high-level ability short-term working memory were significant. In particular, effect sizes for medium-level abilities visual-spatial short-term storage and working memory were medium compared to small effect sizes for medium-level ability auditory short-term storage. We could not find significant effects for medium-level ability attentional control. The only difference was that effect sizes for data set RT + TC were larger compared to data set AC + TC. However, this was moderated more by the tasks than by MD status since tasks measuring attentional control (e.g., stroop task and flanker task) are usually designed around response time differences.
For high-level ability reaction and decision speed, people with MD showed no significant differences compared to people without MD. Also, no significant difference was found for high-level ability fluid reasoning which was also because all studies controlled for low IQ within their samples.
Risk of Bias Analysis
Results of risk of bias analysis are shown in Supplementary Table 3. For some outcomes the number of studies was too small to perform moderator analysis. To determine statistical significance, we used the same procedure as described above (i.e., p < 0.01 for df < 4; p < 0.05 for df ≥ 4). We only identified publication bias when outcomes were measured using time constraints (AC + TC and RT + TC). Egger’s test indicated a publication bias in data set AC + TC and in data set RT + TC for high-level ability mathematics and medium-level ability number sense, respectively. Otherwise, no other moderating effect for any ability could be identified. That is, sample size (i.e., funnel plot test), differences in IQ or reading performance and severity of MD did not significantly moderate the estimated effects in any data set.
Discussion
This meta-analysis compared people with and without MD regarding their domain-specific and domain-general abilities. Outcomes differed based on their type of scoring (accuracy or response time with/without time constraints) and their level (high-level ability, medium-level ability, and low-level ability). A cognitive profile of MD based on criteria given by DSM and ICD was found, which is characterized by significant weaknesses in the in the following abilities: High-level abilities mathematics (AC − TC, AC + TC, and RT + TC), short-term working memory (AC − TC); medium-level abilities computation (AC + TC), number sense (AC − TC, AC + TC, and RT + TC), visual-spatial short-term storage (AC − TC), working memory capacity (AC − TC); and low-level abilities fact retrieval (AC + TC), quantity processing (AC + TC), quantity-number linking (RT + TC), and numerical relations (AC − TC). Based on the included studies, no particular strength in people with MD compared to those without MD could be found. Severity of MD, group differences in reading performance and IQ did not significantly moderated the results.
Domain-Specific Abilities: Various Weaknesses Across Outcome Levels and Scoring
Regarding domain-specific abilities, this was the first meta-analysis to even summarize abilities in this area. As expected, MD affects all mathematical abilities which is in line with previous research defining MD as a heterogeneous disorder (Kaufmann et al., 2013; Karagiannakis et al., 2014).
Computation and Math Reasoning
For computation, results are mostly consistent with previous studies reporting severe weaknesses in people with MD (Busch et al., 2015). Interestingly, we only found a small effect size for response time scores based on 3 studies. Kucian (2005) is a brain-imaging studies in which 6th grade students had to solve arithmetic facts by choosing between two different answers. Rotem and Henik (2013) applied a similar paradigm wherein arithmetic facts (e.g., 3 × 6 = 18) were presented to 6th grade students and they had to decide if the total equation was right or wrong. The sample in Raddatz et al. (2017) was younger and consisted of students from 2nd to 4th grade which had to solve simple and more complex calculation tasks. Although all 3 studies differ in their study design and sample, these methodological differences do not seem sufficient for us to explain why people with MD answer nearly as fast as people without MD (RT + TC) but simultaneously struggle to find the correct answer when investigating accuracy scores (AC + TC). One way out of this dilemma is to look at how response time scores are usually being analyzed. That is, only response times of correct answer are considered for further analyses, which was also the case in these 3 studies. Applied to our overall review, this means that people with MD are as fast as people without MD in calculation and especially fact retrieval when they know the results. “Knowing the results” basically means, that people with MD can retrieve facts from memory as easily and as fast as people without MD provided they have memorized those facts beforehand. This hypothesis is supported by the fact that we did not find any weakness in retrieval fluency and especially naming facility (i.e., rapid naming), most likely because we excluded reading disabilities in our sample (Moll et al., 2019). Nevertheless, further studies examining response time scores in fact retrieval tasks are needed to provide a final answer. Also, the large effect sizes for calculation was not significant (p = 0.022) when using p < 0.01. Only 5 studies applied an additional calculation task, after they identified people with or without MD based (partially) on calculation tasks. Since weaknesses in calculation are a core element of MD, we can only strongly assume this effect size to become significant with more studies.
One of the largest effect sizes overall was found for math reasoning. As mentioned earlier, math reasoning can be operationalized in various ways (e.g., word problems and number series) and it is not clear yet what math reasoning is actually about and how it is related to fluid reasoning. Unfortunately, we could not solve this riddle. Only 3 studies could be included in our meta-analysis, which did not allow for any further differentiation of sub-abilities. Also, all 3 studies did not report IQ scores and hence no meta-regression was possible to check how math reasoning is related to fluid reasoning.
Number Sense
Regarding number sense, this meta-analysis clearly underlines the essential role of number sense for MD in general (Butterworth et al., 2011) while also gives important hints regarding sub-abilities and scoring. Most interesting, people with MD showed large weaknesses in numerical relations regardless of scoring while weaknesses in quantity processing and quantity-number linking were smaller and inconsistent across types of scoring. In development models about mathematics, at first children learn to process non-symbolic quantities (e.g., ••• > •) and to link non-symbolic quantities to respective symbolic quantities (i.e., ••• = 3 = “three”). The subsequent understanding of relations between those numbers (e.g., 6 > 4) is seen as the final step to process numbers correctly (Träff et al., 2020). While there is an ongoing debate on how representations of non-symbolic and symbolic quantities are connected to one another, quantity processing undeniably plays an important part in early math development (Kuhn et al., 2016). Nevertheless, its correlation with later math abilities has proven to be rather low (Fazio et al., 2014). Considering that our sample consisted of 2nd to 6th grade students, we assume that our participants with and without MD were already too matured in their overall math development (McCaskey et al., 2018) so that weaknesses in more basic number sense abilities like quantity processing and quantity-number linking were less prevalent. From an empirical perspective, our results are in line with several studies showing that weaknesses in numerical relations are more severe and robust in people with MD than weaknesses in other number sense abilities (de Smedt et al., 2013; Sasanguie et al., 2013; Vanbinst and de Smedt, 2016; Schneider et al., 2017; Schwenk et al., 2017). The other sub-ability of number sense for which we identified a large but non-significant weakness in people with MD is number word knowledge. However, 3 of the 4 included studies which reported data on number word knowledge were based on a German-speaking sample. The German number-word system is not structured along the place value of numbers. That means, the order of tens and units in German number words is inverted compared to its Arabic counterparts (Klein et al., 2013). For example, number 23 is written and spoken as “three-and-twenty” instead of “twenty-three” like in English. Besides German, this so called “inversion property” is also common in other languages like Arabic or Dutch and particularly challenging for people with MD (Moeller et al., 2015; van der Ven et al., 2017). Although weaknesses in number word knowledge for people with MD have also been reported in languages without inversion property (Moura et al., 2013), we cannot completely exclude the possibility of a language bias for the large weakness found in our meta-analysis.
Domain-General Abilities: Lack of Studies or Lack of Findings
Our lack of results regarding domain-general abilities was surprising regarding the huge body of research published in recent years about the important role of domain-general abilities for math development (Taub et al., 2008; Henik et al., 2011; Fias et al., 2013). This is because we could not consider a lot of studies since they did not meet our inclusion criteria. Since we excluded RD, we did not find substantial weaknesses in those domain-general abilities typically associated with reading difficulties and comorbid math and reading difficulties: Auditory short-term storage, retrieval fluency and phonetic coding (Peng et al., 2017; Moll et al., 2019). Also, differences in reading performance between the MD and TD group did not significantly moderate effect sizes in any of the included abilities. While this cannot be considered as a particular strength compared to people without MD, it is important to point out that people with MD perform about as well as people without MD on these domain-general abilities. As a consequence, our results contradict those of other reviews which reported medium to large weaknesses and did not exclude RD or failed to mention the exact criteria (Swanson and Jerman, 2006; Swanson et al., 2009; Johnson et al., 2010; Peng et al., 2018). Those reviews also applied liberal criteria regarding cut-offs for MD or age range, so we are not able to pinpoint the exact reason for the different results. For phonetic coding we can provide an alternative explanation for the small effect size we found in our study. This ability is associated with the development of number word knowledge and knowledge about basic arithmetic facts (de Smedt and Boets, 2010; Pollack and Ashby, 2018). Only later are facts retrieved directly from memory and people rely less on phonetic coding. Like quantity processing an age-bias is possible, as our sample of participants could have been too old to have a more severe weakness in phonetic coding. Also, Moll et al. (2019) give hints as to why we even found effects, albeit very small ones, even though RD was excluded in our study. That is, domain-specific ability computation relies stronger on language abilities than number sense and is therefore stronger associated with reading abilities. Especially, retrieving arithmetic facts from memory depends more on the processing of verbal information compared to symbolic or non-symbolic magnitude comparisons. As a result, comorbidity rates between MD and RD are four times higher when MD identification is solely based on computation than on number sense. In our meta-analysis, 97.1% of all included studies applied computation tasks to identify MD. Although we excluded severe reading difficulties, we cannot rule out the possibility that our sample of people with MD still had minor problems in reading because of the tests used to identify MD. That being said, our findings are still in line with multiple studies comparing weaknesses in MD, RD, and MD + RD on these domain-general abilities (Schuchardt et al., 2008; Willburger et al., 2008; Cirino et al., 2015; Moll et al., 2015).
Short-Term Working Memory
For working memory capacity our results are in accordance with multiple reviews reporting medium weaknesses in people with MD despite different inclusion criteria (e.g., Swanson and Jerman, 2006; Johnson et al., 2010; David, 2012; Peng and Fuchs, 2014; Peng et al., 2018). However, medium weaknesses in working memory capacity were also reported in people with RD and MD + RD using various stimuli and tasks (de Weerdt et al., 2013b; Willcutt et al., 2013; Cirino et al., 2015; Mähler and Schuchardt, 2016). Therefore, a general weakness in working memory capacity in people with learning difficulties seems apparent. For this reason, we do not consider the significant weaknesses in working memory capacity to be distinct enough for MD identification. We also found a medium weakness in visual-spatial short-term storage which was particularly interesting compared to the small weakness in corresponding auditory short-term storage. In recent years, several studies found weaknesses in people with MD when processing visual or visuospatial information (Landerl et al., 2009; Szûcs et al., 2013; Skagerlund and Träff, 2014). Hence, a link between number and space was assumed (Huber et al., 2015; Wong, 2017) which was transformed into a General Magnitude Deficit theory (Lourenco et al., 2016; Lourenco and Bonny, 2017; Tobia et al., 2018). According to this, numerical (e.g., dot pattern) and non-numerical magnitudes (e.g., length and time) are based upon similar neurocognitive mechanism. As a result, weaknesses in this system will lead to difficulties processing quantities correctly, which in turn make the understanding of numbers and of number relations more problematic. However, a falsification of this theory is still pending and several authors have reported contradictory results, especially for people with MD (Mussolin et al., 2011; de Visscher et al., 2018). While a General Magnitude Deficit theory seems too linear and too broad to account for complex neurodevelopmental effects and the interconnectivity between multiple brain regions (Skagerlund et al., 2016; McCaskey et al., 2017; Kucian et al., 2018), our meta-analysis gives at least further evidence that weaknesses in number and space do occur simultaneously in MD. Nevertheless, we cannot corroborate this observation with weaknesses in other visual-laden abilities like perceptual speed or visual processing since no studies based on our inclusion criteria were found.
The final ability in our study which belonged to short-term working memory is attentional control. Here, the number of included studies was small and all effect sizes were not significant. Based on the results, weaknesses for people with MD are more severe when scores are based on response time instead of accuracy which is in line with several studies (e.g., Peng et al., 2012; Wang et al., 2012). Most studies measured low-level ability inhibition, hence its effect size was similar to medium-level ability attentional control and therefore medium. We did not find enough data for other low-level abilities like shifting and updating. An explanation for the weakness in inhibition is provided by Geary et al. (2000). They suggested that people with MD have difficulties in fact retrieval because they cannot inhibit similar but nonetheless incorrect results. For example, 3 × 4 can trigger the result 8 (i.e., 2 × 4), 9 (i.e., 3 × 3), or 16 (i.e., 4 × 4), which distract from the correct result 12. However, weaknesses in attentional control and more specifically inhibition are also common in people with ADHD (Barkley, 1997). ADHD was not excluded in our sample since most studies did not control for it. Thus only 3 out of 7 studies reporting data on inhibition excluded ADHD. The findings of our meta-analysis do not support Peng et al. (2018), who reported only a small weakness for inhibition and found instead a large one for updating and shifting in people with MD. Two explanations for this difference in findings between both reviews emerge. First, it is unclear if effect sizes in Peng et al. (2018) are based on accuracy, response time, or both. Since this meta-analysis found smaller effect sizes in accuracy and larger ones in response time the type of score seems to affect the severity of deficits. Second, deficits in inhibition, updating and shifting have also been found for people with isolated RD and comorbid MD+RD (van der Sluis et al., 2004; Booth et al., 2010). Especially shifting and updating seem to be more related to RD than inhibition (Peng et al., 2013; Moura et al., 2014). Since Peng et al. (2018) did not report their criteria used to exclude RD it is unclear if their findings were corroborated by reading difficulties.
Other Abilities
The last two high-level abilities for which studies reported data are reaction and decision speed and fluid reasoning. We did not discuss fluid reasoning much in the introduction and not mentioned reaction and decision speed at all because reported data on both abilities are more a by-product than the main focus of its respective studies. Reaction and decision speed means to react and decide quickly to the onset of a simple stimulus (e.g., press a button when hearing a sound). In studies about MD it is mostly measured as a part of an extensive test battery. For fluid reasoning, most studies controlled for low IQ which in turn affects the overall effect. Also, it is unclear if the respective IQ test was also used to control for low IQ which would result in higher average IQs and smaller variances in the MD and TD group. For both abilities the number of studies reporting data is small. We found a small and non-significant weakness for reaction and decision speed which supports findings by Szûcs et al. (2013) and Raddatz et al. (2017) in that very simple non-numerical tasks do not differ well between people with and without MD. The average effect size for fluid reasoning was medium in our meta-analysis. Considering the methodological uncertainties with IQ scores and the fact that other reviews reported mixed findings for fluid reasoning, we refrain from comparing our results with theirs and from deriving any conclusion about the role of IQ for MD.
Limitations
First, the number of studies for each outcome varied considerably and dropped with increasing specificity of outcomes (e.g., from high-level to low-level ability). While meta-analyses can be performed with no more than 2 studies, results are usually more robust and less prone to publication bias when more studies are included (Lin, 2018). Especially when random effects and a certain degree of heterogeneity between and within studies are expected, effect sizes based on the data of few studies need to be interpreted cautiously (Valentine et al., 2010). To solve this problem, we applied meta-regressions to examine the influence of moderating variables. However, also those measures are accompanied with various issues (e.g., low power for small sample size, no random assignment of studies to moderators) and therefore only give hints on possible data problems (Oxman and Guyatt, 1992; Walker et al., 2008; Wood and Eagly, 2009). Most importantly, we have therefore used strict inclusion criteria in accordance with DSM and ICD to derive valid results. On the one hand, those criteria were responsible for our small sample of studies. On the other hand, they led to a representative sample of MD without other corrupting conditions (comorbidities) or issues (broad age range and high cut-off value) usually accompanied with more liberal criteria. Nevertheless, Egger’s test also indicated a publication bias in data set AC + TC and in data set RT + TC for high-level ability mathematics and medium-level ability number sense, respectively. Although this does not fundamentally contradict our findings, a small-study effect for those abilities is possible.
Second, across all data sets, heterogeneity I2 (Higgins and Thompson, 2002) was substantial and on average 57.5% (AC − TC: 53.6%, AC + TC: 52.6%, RT + TC: 66.4%). Although descriptive information of all included studies indicated a homogenous data set, there is inconsistency in the data which was neither accounted for by our statistical model nor further explained by any moderator we applied. Also, heterogeneity was larger for response time scores than for accuracy scores. From a methodological perspective, small samples tend to increase heterogeneity (IntHout et al., 2015). Also type of scoring could have affected heterogeneity. Data set RT + TC had the highest heterogeneity, the smallest sample size and studies varied considerably in how they measured response time scores (e.g., total time and time for correct answers), detected outliers, or transformed data (e.g., log transformation). Those variations may have affected effect sizes, hence led to greater inconsistencies between studies in data set RT + TC. From a theoretical perspective, Kaufmann et al. (2013) defined heterogeneity as a feature of MD. According to them, people with MD have a core weakness in domain-specific abilities which can be accompanied by weaknesses in domain-general abilities as well. However, the type and severity of all those weaknesses varies depending on the neurofunctional and behavioral development of each individual person with MD. As a result, differences between people with and without MD are less moderated by study design. Instead, heterogeneous manifestations of MD exist within every MD sample due to interindividual neurodevelopmental differences (Zhang et al., 2017; McCaskey et al., 2018). This perspective is also in accordance with a vast research body about different subtypes of MD (Landerl et al., 2004, 2009; Gold et al., 2013; Bartelet et al., 2014; Cirino et al., 2015; Shin and Bryant, 2015; Träff et al., 2017). However, such research questions can only be answered by using controlled studies. While we do not deny the possibility of different subtypes of MD to explain the overall heterogeneity, we can neither test nor control for it. The scope of this meta-analysis was to summarize strengths and weaknesses of MD based on clinical criteria.
Third, ADHD was not deliberately excluded by us. About 10–20% of people with MD also have ADHD or show symptoms thereof (Gross-Tsur et al., 1996; Fortes et al., 2016). Kuhn et al. (2016) have shown that the profile of people with MD + ADHD is a combination of the distinct weaknesses of people with MD or ADHD. Only 16 out of 34 studies of our sample controlled for ADHD. And those who did relied mostly on existing information instead of applying any additional measures. As a result, studies including comorbid cases of MD + ADHD are possible and effect sizes for ADHD-specific outcomes like attentional control were interpreted carefully.
Fourth, studies that did not control for reading difficulties were excluded rather than being coded and used as moderator. We refrained from using a subgroup analysis to compare effect sizes of studies that excluded RD to those that did not for two reasons. First, for studies that did not exclude RD we could not make any assumptions about the distribution of reading performance within each sample. Especially if reading performance was not measured people with RD could have been in the sample thus co-founding the subgroup analysis. Second, subgroup analyses have been criticized for their low statistical power because categories with a different amount of studies and varying study quality are being compared (Oxman and Guyatt, 1992; Walker et al., 2008). While meta-analyses are based on a systematic literature following clear inclusion criteria, subgroup analyses are only observational (i.e., cross-sectional studies) because neither are the studies randomly assigned to each group nor are any moderating variables being controlled for (Higgins and Green, 2011). While this problem also applies to meta-regressions, it is especially pronounced for subgroup analyses. Since this meta-analysis had already to deal with low sample sizes because of our conservative inclusion criteria, we decided that the statistical power of any subgroup analysis would have been too low to allow any conclusions. This also applies to ADHD for which we did not perform a subgroup analysis either.
Fifth, we did not find any particular strength for the MD group based on the effect size. There were many small effect sizes which we refrained from classifying as a particular “strength” or “non-weakness.” By using the terms “strength” and “weakness” we tried to emphasize our exploratory coding approach. Since we did not restrict our coding scheme to certain abilities, we could not rule out that a medium to large effects sizes in favor of the MD group in any ability was theoretically possible.
Implications and Conclusion
We identified a cognitive profile of domain-specific and domain-general deficits of MD which is based on criteria according to the DSM and ICD. Whereas DSM and ICD only describe in a very general way which abilities are affected in MD, we found a distinct set of well operationalized abilities which evidently differ best between groups of children (8–12 years old) with and without MD. These were: Calculation (AC − TC), fact retrieval (AC + TC), quantity processing (AC + TC), quantity-number linking (RT + TC), numerical relations (AC − TC), and visual-spatial short-term storage (AC − TC). This profile helps experts working with DSM or ICD to revise their general diagnostic procedures und treatment plans. Also, for clinical guidelines about diagnostic and treatment of MD, our profile serves as high quality evidence-based information derived by a systematic literature and meta-analysis. However, we also want to point out that this profile is based on average group differences and therefore cannot account for the heterogenous development of MD in individuals.
Our meta-analysis was the first in many aspects. We used stringent inclusion criteria according to DSM and ICD, summarized domain-specific abilities in a systematic way, applied a multi-level coding scheme and differentiated scoring types. While this approach was very laborious and needed many adjustments, it resulted in a comprehensive profile about MD which has various implications for further research. Most importantly, when controlling for reading difficulties MD is not accompanied by substantial weaknesses in those domain-general abilities which are typically assigned to reading (i.e., phonetic coding, naming facility, and auditory short-term storage). Also, visual-spatial short-term storage is more affected in MD compared to auditory short-term storage which emphasizes the notion that number and space are interlinked and overall important for math development. In the past, multiple fMRI studies have reported neuronal activity in areas allocated to domain-general abilities when doing arithmetic and even suggested a neuronal network of domain-specific and domain-general abilities (Kucian, 2016; Peters and de Smedt, 2018). If and how this applies to MD is still unknown. Unfortunately, also our meta-analysis could not derive a distinct pattern of strengths and weaknesses in domain-general abilities since most studies about domain-general abilities did not meet our stringent inclusion criteria. A pressing need for high quality studies investigating those abilities is obvious.
Data Availability Statement
The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found at: https://osf.io/sbfcz/.
Author Contributions
SH and GS-K contributed to conception and design of the study. SH performed the literature search, coding, statistical analysis, and wrote the first draft of the manuscript. GS-K revised first draft of the manuscript. Both authors contributed to the article and approved the submitted version.
Funding
This work was funded by Deutsche Gesellschaft für Kinder- und Jugendpsychiatrie, Psychosomatik und Psychotherapie and by Bundesverband Legasthenie & Dyskalkulie.
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s Note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary Material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpsyg.2022.842391/full#supplementary-material
References
Ackerman, P. L., and Beier, M. E. (2007). Further explorations of perceptual speed abilities in the context of assessment methods, cognitive abilities, and individual differences during skill acquisition. J. Exp. Psychol. Appl. 13, 249–272. doi: 10.1037/1076-898x.13.4.249
American Psychiatric Association (2013). Diagnostic and Statistical Manual of Mental Disorders, 5th Edn. Virginia: American Psychiatric Association, doi: 10.1176/appi.books.9780890425596
Anobile, G., Arrighi, R., and Burr, D. C. (2019). Simultaneous and sequential subitizing are separate systems, and neither predicts math abilities. J. Exp. Child Psychol. 178, 86–103. doi: 10.1016/j.jecp.2018.09.017
Ashkenazi, S., Mark-Zigdon, N., and Henik, A. (2009). Numerical distance effect in developmental dyscalculia. Cogn. Dev. 24, 387–400. doi: 10.1016/j.cogdev.2009.09.006
Ashkenazi, S., Mark-Zigdon, N., and Henik, A. (2013). Do subitizing deficits in developmental dyscalculia involve pattern recognition weakness? Dev. Sci. 16, 35–46. doi: 10.1111/j.1467-7687.2012.01190.x
Attout, L., and Majerus, S. (2015). Working memory deficits in developmental dyscalculia: the importance of serial order. Child Neuropsychol. 21, 432–450. doi: 10.1080/09297049.2014.922170
Baker, J. M., and Reiss, A. L. (2016). A meta-analysis of math performance in Turner syndrome. Dev. Med. Child Neurol. 58, 123–130. doi: 10.1111/dmcn.12961
Barkley, R. A. (1997). Behavioral inhibition, sustained attention, and executive functions: constructing a unifying theory of ADHD. Psychol. Bull. 121, 65–94. doi: 10.1037/0033-2909.121.1.65
Bartelet, D., Ansari, D., Vaessen, A., and Blomert, L. (2014). Cognitive subtypes of mathematics learning difficulties in primary education. Res. Dev. Disabil. 35, 657–670. doi: 10.1016/j.ridd.2013.12.010
Benson, N. F., Kranzler, J. H., and Floyd, R. G. (2016). Examining the integrity of measurement of cognitive abilities in the prediction of achievement: comparisons and contrasts across variables from higher-order and bifactor models. J. Sch. Psychol. 58, 1–19. doi: 10.1016/j.jsp.2016.06.001
Berch, D. B. (2005). Making sense of number sense: implications for children with mathematical disabilities. J. Learn. Disabil. 38, 333–339. doi: 10.1177/00222194050380040901
Booth, J. N., Boyle, J. M. E., and Kelly, S. W. (2010). Do tasks make a difference? Accounting for heterogeneity of performance of children with reading difficulties on tasks of executive function: findings from a meta-analysis. Br. J. Dev. Psychol. 28, 133–176. doi: 10.1348/026151009X485432
Borenstein, M. (2009). “Effect sizes for continuous data,” in The Handbook of Research Synthesis and Meta-Analysis, 2nd Edn, eds H. Cooper, L. V. Hedges, and J. C. Valentine (New York, NY: Russell Sage Foundation), 221–235. doi: 10.3760/cma.j.issn.0412-4081.2019.07.009
Brankaer, C., Ghesquière, P., de Wel, A., Swillen, A., and de Smedt, B. (2016). Numerical magnitude processing impairments in genetic syndromes: a cross-syndrome comparison of Turner and 22q11.2 deletion syndromes. Dev. Sci. 20:e12458. doi: 10.1111/desc.12458
Brown, S. A., Upchurch, S. L., and Acton, G. J. (2003). A framework for developing a coding scheme for meta-analysis. West. J. Nurs. Res. 25, 205–222. doi: 10.1177/0193945902250038
Busch, J., Schmidt, C., and Grube, D. (2015). Arithmetic fact retrieval: are there differences between children with developmental dyscalculia and those with mathematical difficulties? Z. Psychol. 223, 110–119. doi: 10.1027/2151-2604/a000209
Busch, J., Schmidt, C., Studte, S., and Grube, D. (2019). Kognitive merkmale rechenschwacher kinder in abhängigkeit vom cut-off kriterium [do the cognitive profiles of children with mathematical difficulties vary as a function of the used cut-off criterion?]. Lernen Und Lernstörungen 8, 167–178. doi: 10.1024/2235-0977/a000258
Butterworth, B., Varm, S., and Laurillard, D. (2011). Dyscalculia: from brain to education. Science 332, 1049–1053. doi: 10.1126/science.1201536
Casey, B. M., Pezaris, E., Fineman, B., Pollock, A., Demers, L., and Dearing, E. (2015). A longitudinal analysis of early spatial skills compared to arithmetic and verbal skills as predictors of fifth-grade girls’ math reasoning. Learn. Individ. Differ. 40, 90–100. doi: 10.1016/j.lindif.2015.03.028
Censabella, S., and Noël, M. P. (2008). The inhibition capacities of children with mathematical disabilities. Child Neuropsychol. 14, 1–20. doi: 10.1080/09297040601052318
Cirino, P. T., Fuchs, L. S., Elias, J. T., Powell, S. R., and Schumacher, R. F. (2015). Cognitive and mathematical profiles for different forms of learning difficulties. J. Learn. Disabil. 48, 156–175. doi: 10.1177/0022219413494239
Cohen, J. (1960). A coefficient of agreement for nominal scales. Educ. Psychol. Meas. 20, 37–46. doi: 10.1177/001316446002000104
Cohen, J. (1969). Statistical Power Analysis for the Behavioral Sciences. Cambridge, MA: Academic Press.
Cohen, J. (1988). Statistical Power Analysis for the Behavioral Sciences, 2nd Edn. New York, NY: Lawrence Erlbaum Associates, doi: 10.4324/9780203771587
Compton, D. L., Fuchs, L. S., Fuchs, D., Lambert, W., and Hamlett, C. (2012). The cognitive and academic profiles of reading and mathematics learning disabilities. J. Learn. Disabil. 45, 79–95. doi: 10.1177/0022219410393012
Conway, A. R. A., Cowan, N., Bunting, M. F., Therriault, D. J., and Minkoff, S. R. B. (2002). A latent variable analysis of working memory capacity, short-term memory capacity, processing speed, and general fluid intelligence. Intelligence 30, 163–183. doi: 10.1016/S0160-2896(01)00096-4
Cowan, R., and Powell, D. (2014). The contributions of domain-general and numerical factors to third-grade arithmetic skills and mathematical learning disability. J. Educ. Psychol. 106, 214–229. doi: 10.1037/a0034097
David, C. V. (2012). Working memory deficits in math learning difficulties: a meta-analysis. Int. J. Dev. Disabil. 58, 67–84. doi: 10.1179/2047387711Y.0000000007
de Smedt, B., and Boets, B. (2010). Phonological processing and arithmetic fact retrieval: evidence from developmental dyslexia. Neuropsychologia 48, 3973–3981. doi: 10.1016/j.neuropsychologia.2010.10.018
de Smedt, B., Noël, M.-P., Gilmore, C., and Ansari, D. (2013). How do symbolic and non-symbolic numerical magnitude processing skills relate to individual differences in children’s mathematical skills? A review of evidence from brain and behavior. Trends Neurosci. Educ. 2, 48–55. doi: 10.1016/j.tine.2013.06.001
de Smedt, B., Swillen, A., Verschaffel, L., and Ghesquiere, P. (2009). Mathematical learning disabilities in children with 22q11.2 deletion syndrome: a review. Dev. Disabil. Res. Rev. 15, 4–10. doi: 10.1002/ddrr.44
de Visscher, A., Noël, M.-P., Pesenti, M., and Dormal, V. (2018). Developmental dyscalculia in adults: beyond numerical magnitude impairment. J. Learn. Disabil. 51, 600–611. doi: 10.1177/0022219417732338
de Weerdt, F., Desoete, A., and Roeyers, H. (2013a). Behavioral inhibition in children with learning disabilities. Res. Dev. Disabil. 34, 1998–2007. doi: 10.1016/j.ridd.2013.02.020
de Weerdt, F., Desoete, A., and Roeyers, H. (2013b). Working memory in children with reading disabilities and/or mathematical disabilities. J. Learn. Disabil. 46, 461–472. doi: 10.1177/0022219412455238
Desoete, A., Ceulemans, A., de Weerdt, F., and Pieters, S. (2012). Can we predict mathematical learning disabilities from symbolic and non-symbolic comparison tasks in kindergarten? Findings from a longitudinal study. Br. J. Educ. Psychol. 82, 64–81. doi: 10.1348/2044-8279.002002
Desoete, A., and de Weerdt, F. (2013). Can executive functions help to understand children with mathematical learning disorders and to improve Instruction? Learn. Disabil. 11, 27–39.
Devine, A., Hill, F., Carey, E., and Szûcs, D. (2018). Cognitive and emotional math problems largely dissociate: prevalence of developmental dyscalculia and mathematics anxiety. J. Educ. Psychol. 110, 431–444. doi: 10.1037/edu0000222
Dirks, E., Spyer, G., van Lieshout, E. C. D. M., and de Sonneville, L. (2008). Prevalence of combined reading and arithmetic disabilities. J. Learn. Disabil. 41, 460–473. doi: 10.1177/0022219408321128
Dixon, W. J. (1960). Simplified estimation from censored normal samples. Ann. Math. Stat. 31, 385–391. doi: 10.1214/aoms/1177705900
Doleman, B., Freeman, S. C., Lund, J. N., Williams, J. P., and Sutton, A. J. (2020). Funnel plots may show asymmetry in the absence of publication bias with continuous outcomes dependent on baseline risk: presentation of a new publication bias test. Res. Synth. Methods 11, 522–534. doi: 10.1002/jrsm.1414
Downes, M. J., Brennan, M. L., Williams, H. C., and Dean, R. S. (2016). Development of a critical appraisal tool to assess the quality of cross-sectional studies (AXIS). BMJ Open 6:e011458. doi: 10.1136/bmjopen-2016-011458
Egger, M., Smith, G. D., Schneider, M., and Minder, C. (1997). Bias in meta-analysis detected by a simple, graphical test. Br. Med. J. 315, 629–634. doi: 10.1136/bmj.315.7109.629
Endlich, D., Dummert, F., Schneider, W., and Schwenck, C. (2014). Behavior problems in children with isolated and combined learning deficiencies. Kindheit Und Entwicklung 23, 61–69. doi: 10.1026/0942-5403/a000128
Evans, M. (2008). Underlying Cognitive Processes in Reading, Math, and Comorbid Reading and Math Learning Disabilities. Available online at: http://hdl.handle.net/10012/3535 (accessed February 15, 2021).
Fazio, L. K., Bailey, D. H., Thompson, C. A., and Siegler, R. S. (2014). Relations of different types of numerical magnitude representations to each other and to mathematics achievement. J. Exp. Child Psychol. 123, 53–72. doi: 10.1016/j.jecp.2014.01.013
Fias, W., Menon, V., and Szûcs, D. (2013). Multiple components of developmental dyscalculia. Trends Neurosci. Educ. 2, 43–47. doi: 10.1016/j.tine.2013.06.006
Fischbach, A., Schuchardt, K., Brandenburg, J., Klesczewski, J., Balke-Melcher, C., Schmidt, C., et al. (2013). Prävalenz von lernschwächen und lernstörungen: zur bedeutung der diagnosekriterien [prevalence of poor learners and children with learning disorders: investigating the role of diagnostic criteria]. Lernen Und Lernstörungen 2, 65–76. doi: 10.1024/2235-0977/a000035
Fisher, Z., Tipton, E., and Zhipeng, H. (2017). Robumeta: Robust Variance Meta-Regression. (R Package Version 2.0).
Floyd, R. G., Evans, J. J., and McGrew, K. S. (2003). Relations between measures of Cattell-Horn-Carroll (CHC) cognitive abilities and mathematics achievement across the school-age years. Psychol. Sch. 40, 155–171. doi: 10.1002/pits.10083
Fortes, I. S., Paula, C. S., Oliveira, M. C., Bordin, I. A., de Jesus Mari, J., and Rohde, L. A. (2016). A cross-sectional study to assess the prevalence of DSM-5 specific learning disorders in representative school samples from the second to sixth grade in Brazil. Eur. Child Adolesc. Psychiatry 25, 195–207. doi: 10.1007/s00787-015-0708-2
Geary, D. C. (2004). Mathematics and learning disabilities. J. Learn. Disabil. 37, 4–15. doi: 10.1177/00222194040370010201
Geary, D. C. (2010). Mathematical disabilities: reflections on cognitive, neuropsychological, and genetic components. Learn. Individ. Differ. 20, 130–133. doi: 10.1016/j.lindif.2009.10.008
Geary, D. C. (2011). Consequences, characteristics, and causes of mathematical learning disabilities and persistent low achievement in mathematics. J. Dev. Behav. Pediatr. 32, 250–263. doi: 10.1097/DBP.0b013e318209edef
Geary, D. C., Hamson, C. O., and Hoard, M. K. (2000). Numerical and arithmetical cognition: a longitudinal study of process and concept deficits in children with learning disability. J. Exp. Child Psychol. 77, 236–263. doi: 10.1006/jecp.2000.2561
Geary, D. C., Hoard, M. K., Nugent, L., and Bailey, D. H. (2012). Mathematical cognition deficits in children with learning disabilities and persistent low achievement: a five-year prospective study. J. Educ. Psychol. 104, 206–223. doi: 10.1037/a0025398
Geary, D. C., Nicholas, A., Li, Y., and Sun, J. (2017). Developmental change in the influence of domain-general abilities and domain-specific knowledge on mathematics achievement: an eight-year longitudinal study. J. Educ. Psychol. 109, 680–693. doi: 10.1037/edu0000159
Gersten, R., Jordan, N. C., and Flojo, J. R. (2005). Early identification and interventions for students with mathematics difficulties. J. Learn. Disabil. 38, 293–304. doi: 10.1177/00222194050380040301
Gold, A. B., Ewing-Cobbs, L., Cirino, P. T., Fuchs, L. S., Stuebing, K. K., and Fletcher, J. M. (2013). Cognitive and behavioral attention in children with math difficulties. Child Neuropsychol. 19, 420–437. doi: 10.1080/09297049.2012.690371
Gross-Tsur, V., Manor, O., and Shalev, R. S. (1996). Developmental dyscalculia: prevalence and demographic features. Dev. Med. Child Neurol. 38, 25–33. doi: 10.1111/j.1469-8749.1996.tb15029.x
Hallgren, K. A. (2012). Computing inter-rater reliability for observational data: an overview and tutorial. Tutor. Quant. Methods Psychol. 8, 23–34. doi: 10.20982/tqmp.08.1.p023
Hanich, L. B., Jordan, N. C., Kaplan, D., and Dick, J. (2001). Performance across different areas of mathematical cognition in children with learning difficulties. J. Educ. Psychol. 93, 615–626. doi: 10.1037/0022-0663.93.3.615
Hedges, L. V. (1981). Distribution theory for glass’s estimator of effect size and related estimators. J. Educ. Stat. 6, 107–128. doi: 10.2307/1164588
Hedges, L. V., Tipton, E., and Johnson, M. C. (2010). Robust variance estimation in meta-regression with dependent effect size estimates. Res. Synth. Methods 1, 39–65. doi: 10.1002/jrsm.5
Hegarty, M., and Waller, D. A. (2005). “Individual differences in spatial abilities,” in The Cambridge Handbook of Visuospatial Thinking, eds A. Miyake and P. Shah (Cambridge, MA: Cambridge University Press), 121–169. doi: 10.1017/CBO9780511610448.005
Heine, A., Wissmann, J., Tamm, S., De Smedt, B., Schneider, M., Stern, E., et al. (2013). An electrophysiological investigation of non-symbolic magnitude processing: numerical distance effects in children with and without mathematical learning disabilities. Cortex 49, 2162–2177. doi: 10.1016/j.cortex.2012.11.009
Henik, A., Rubinsten, O., and Ashkenazi, S. (2011). The ‘where’ and ‘what’ in developmental dyscalculia. Clin. Neuropsychol. 25, 989–1008. doi: 10.1080/13854046.2011.599820
Higgins, J. P. T., and Green, S. (eds) (2011). Cochrane Handbook for Systematic Reviews of Interventions Version 5.1.0 [Updated March 2011]. Available online at: http://handbook.cochrane.org (accessed July 5, 2021).
Higgins, J. P. T., and Thompson, S. G. (2002). Quantifying heterogeneity in a meta-analysis. Stat. Med. 21, 1539–1558. doi: 10.1002/sim.1186
Hirsch, S., Lambert, K., Coppens, K., and Moeller, K. (2018). Basic numerical competences in large-scale assessment data: structure and long-term relevance. J. Exp. Child Psychol. 167, 32–48. doi: 10.1016/j.jecp.2017.09.015
Huber, S., Sury, D., Moeller, K., Rubinsten, O., and Nuerk, H. C. (2015). A general number-to-space mapping deficit in developmental dyscalculia. Res. Dev. Disabil. 4, 32–42. doi: 10.1016/j.ridd.2015.06.003
Huijsmans, M. D. E., Kleemans, T., van der Ven, S. H. G., and Kroesbergen, E. H. (2020). The relevance of subtyping children with mathematical learning disabilities. Res. Dev. Disabil. 104:103704. doi: 10.1016/j.ridd.2020.103704
Iglesias-Sarmiento, V., and Deaño, M. (2016). Arithmetical difficulties and low arithmetic achievement: analysis of the underlying cognitive functioning. Span. J. Psychol. 19:e36. doi: 10.1017/sjp.2016.40
IntHout, J., Ioannidis, J. P. A., Borm, G. F., and Goeman, J. J. (2015). Small studies are more heterogeneous than large ones: a meta-meta-analysis. J. Clin. Epidemiol. 68, 860–869. doi: 10.1016/j.jclinepi.2015.03.017
Jackson, D., Riley, R., and White, I. R. (2011). Multivariate meta-analysis: potential and promise. Stat. Med. 30, 2481–2498. doi: 10.1002/sim.4172
Jaekel, J., and Wolke, D. (2014). Preterm birth and dyscalculia. J. Pediatr. 164, 1327–1332. doi: 10.1016/j.jpeds.2014.01.069
Johnson, E. S., Humphrey, M., Mellard, D. F., Woods, K., and Swanson, H. L. (2010). Cognitive processing deficits and students with specific learning disabilities: a selective meta-analysis of the literature. Learn. Disabil. Q. 33, 3–18. doi: 10.1177/073194871003300101
Jordan, N. C., and Hanich, L. B. (2003). Characteristics of children with moderate mathematics deficiencies: a longitudinal perspective. Learn. Disabil. Res. Practice 18, 213–221. doi: 10.1111/1540-5826.00076
Jordan, N. C., Kaplan, D., Ramineni, C., and Locuniak, M. N. (2009). Early math matters: kindergarten number competence and later mathematics outcomes. Dev. Psychol. 45, 850–867. doi: 10.1037/a0014939
Karagiannakis, G. N., Baccaglini-Frank, A. E., and Papadatos, Y. (2014). Mathematical learning difficulties subtypes classification. Front. Hum. Neurosci. 8:57. doi: 10.3389/fnhum.2014.00057
Karakonstantaki, E. S., Simos, P. G., Michalis, V., and Micheloyannis, S. (2018). Assessment and conceptual remediation of basic calculation skills in elementary school students. Br. J. Dev. Psychol. 36, 78–97. doi: 10.1111/bjdp.12214
Kaufmann, L., Handl, P., and Thöny, B. (2003). Evaluation of a numeracy intervention program focusing on basic numerical knowledge and conceptual knowledge: a pilot study. J. Learn. Disabil. 36, 564–573. doi: 10.1177/00222194030360060701
Kaufmann, L., Mazzocco, M. M., Dowker, A., von Aster, M., Göbel, S. M., Grabner, R. H., et al. (2013). Dyscalculia from a developmental and differential perspective. Front. Psychol. 4:516. doi: 10.3389/fpsyg.2013.00516
Kaufmann, L., and von Aster, M. (2012). The diagnosis and management of dyscalculia. Dtsch. Arztebl. Int. 109, 767–778. doi: 10.3238/arztebl.2012.0767
Klein, E., Bahnmueller, J., Mann, A., Pixner, S., Kaufmann, L., Nuerk, H. C., et al. (2013). Language influences on numerical development-Inversion effects on multi-digit number processing. Front. Psychol. 4:480. doi: 10.3389/fpsyg.2013.00480
Kohn, J., Wyschkon, A., Ballaschk, K., Ihle, W., and Esser, G. (2013). Verlauf von umschriebenen entwicklungsstörungen: eine 30-monats-follow-up-studie [long-term course of dyslexia, dyscalculia, and expressive language disorder: a 30-month follow-up study]. Lernen Und Lernstörungen 2, 77–89. doi: 10.1024/2235-0977/a000032
Kolkman, M. E., Kroesbergen, E. H., and Leseman, P. P. M. (2013). Early numerical development and the role of nonsymbolic and symbolic skills. Learn. Instruction 25, 95–103.
Koponen, T., Aro, M., Poikkeus, A. M., Niemi, P., Lerkkanen, M. K., Ahonen, T., et al. (2018). Comorbid fluency difficulties in reading and math: longitudinal stability across early grades. Except. Child. 84, 298–311. doi: 10.1177/0014402918756269
Koponen, T., Eklund, K., Heikkila, R., Salminen, J., Fuchs, L., Fuchs, D., et al. (2020). Cognitive correlates of the covariance in reading and arithmetic fluency: importance of serial retrieval fluency. Child Dev. 91, 1063–1080. doi: 10.1111/cdev.13287
Krajewski, K., and Schneider, W. (2009). Early development of quantity to number-word linkage as a precursor of mathematical school achievement and mathematical difficulties: findings from a four-year longitudinal study. Learn. Instruction 19, 513–526. doi: 10.1016/j.learninstruc.2008.10.002
Kroesbergen, E. H., and van Dijk, M. (2015). Working memory and number sense as predictors of mathematical (dis-)ability. Z. Psychol. 223, 102–109. doi: 10.1027/2151-2604/a000208
Kucian, K. (2005). Development of Cerebral Representations for Numbers in Normally Achieving Children and Children With Developmental Dyscalculia ProQuest Dissertations & Theses Global. (UMI No. 305368810). Yellow Springs, OH: Antioch University.
Kucian, K. (2016). “Developmental dyscalculia and the brain,” in Development of Mathematical Cognition: Neural Substrates and Genetic Influences, Vol. 2, eds D. B. Berch, D. C. Geary, and K. Mann Koepke (Cambridge, MA: Elsevier Academic Press), 165–193. doi: 10.1016/B978-0-12-801871-2.00007-1
Kucian, K., McCaskey, U., von Aster, M., and O’Gorman Tuura, R. (2018). Development of a possible general magnitude system for number and space. Front. Psychol. 9:2221. doi: 10.3389/fpsyg.2018.02221
Kuhn, J.-T., Ise, E., Raddatz, J., Schwenk, C., and Dobel, C. (2016). Basic numerical processing, calculation, and working memory in children with dyscalculia and/or ADHD symptoms. Z. Kinder Jugendpsychiatr. Psychother. 44, 365–375. doi: 10.1024/1422-4917/a000450
Lachaud, C. M., and Renaud, O. (2011). A tutorial for analyzing human reaction times: how to filter data, manage missing values, and choose a statistical model. Appl. Psycholinguist. 32, 389–416. doi: 10.1017/s0142716410000457
Landerl, K. (2013). Development of numerical processing in children with typical and dyscalculic arithmetic skills - a longitudinal study. Front. Psychol. 4:459. doi: 10.3389/fpsyg.2013.00459
Landerl, K., Bevan, A., and Butterworth, B. (2004). Developmental dyscalculia and basic numerical capacities: a study of 8?9-year-old students. Cognition 93, 99–125. doi: 10.1016/j.cognition.2003.11.004
Landerl, K., Fussenegger, B., Moll, K., and Willburger, E. (2009). Dyslexia and dyscalculia: two learning disorders with different cognitive profiles. J. Exp. Child Psychol. 103, 309–324. doi: 10.1016/j.jecp.2009.03.006
Landerl, K., and Moll, K. (2010). Comorbidity of learning disorders: prevalence and familial transmission. J. Child Psychol. Psychiatry 51, 287–294. doi: 10.1111/j.1469-7610.2009.02164.x
Lehnert, G., and Zimmer, H. D. (2006). Auditory and visual spatial working memory. Mem. Cogn. 34, 1080–1090. doi: 10.3758/BF03193254
Lewis, K. E., and Fisher, M. B. (2016). Taking stock of 40 years of research on mathematical learning disability: methodological issues and future directions. J. Res. Math. Educ. 47, 338–371. doi: 10.5951/jresematheduc.47.4.0338
Lin, L. F. (2018). Bias caused by sampling error in meta-analysis with small sample sizes. PLoS One 13:e0204056. doi: 10.1371/journal.pone.0204056
Lindquist, M., Philpot, R., Mullis, I. V. S., and Cotter, K. E. (2017). “TIMSS 2019 mathematics framework,” in TIMSS 2019 Assessment Frameworks, eds I. V. S. Mullis and M. O. Martin (Amsterdam: International Association for the Evaluation of Educational Achievement), 11–26.
Lo, S., and Andrews, S. (2015). To transform or not to transform: using generalized linear mixed models to analyse reaction time data. Front. Psychol. 6:1171. doi: 10.3389/fpsyg.2015.01171
Lourenco, S. F., Ayzenberg, V., and Lyu, J. (2016). A general magnitude system in human adults: evidence from a subliminal priming paradigm. Cortex 81, 93–103. doi: 10.1016/j.cortex.2016.04.013
Lourenco, S. F., and Bonny, J. W. (2017). Representations of numerical and non-numerical magnitude both contribute to mathematical competence in children. Dev. Sci. 20:e12418. doi: 10.1111/desc.12418
Mähler, C., and Schuchardt, K. (2016). Working memory in children with specific learning disorders and/or attention deficits. Learn. Individ. Differ. 49, 341–347. doi: 10.1016/j.lindif.2016.05.007
Mammarella, I. C., Caviola, S., Cornoldi, C., and Lucangeli, D. (2013). Mental additions and verbal-domain interference in children with developmental dyscalculia. Res. Dev. Disabil. 34, 2845–2855. doi: 10.1016/j.ridd.2013.05.044
Mammarella, I. C., Caviola, S., Giofrè, D., and Szûcs, D. (2018). The underlying structure of visuospatial working memory in children with mathematical learning disability. Br. J. Dev. Psychol. 36, 220–235. doi: 10.1111/bjdp.12202
Marmolejo-Ramos, F., Cousineau, D., Benites, L., and Maehara, R. (2015). On the efficacy of procedures to normalize Ex-Gaussian distributions. Front. Psychol. 5:1548. doi: 10.3389/fpsyg.2014.01548
McCaskey, U., von Aster, M., Maurer, U., Martin, E., O’Gorman Tuura, R., and Kucian, K. (2018). Longitudinal brain development of numerical skills in typically developing children and children with developmental dyscalculia. Front. Hum. Neurosci. 11:629. doi: 10.3389/fnhum.2017.00629
McCaskey, U., von Aster, M., O’Gorman Tuura, R., and Kucian, K. (2017). Adolescents with developmental dyscalculia do not have a generalized magnitude deficit – processing of discrete and continuous magnitudes. Front. Hum. Neurosci. 11:102. doi: 10.3389/fnhum.2017.00102
Mejias, S., Mussolin, C., Rousselle, L., Gregoire, J., and Noël, M. P. (2012). Numerical and nonnumerical estimation in children with and without mathematical learning disabilities. Child Neuropsychol. 18, 550–575. doi: 10.1080/09297049.2011.625355
Menon, V. (2016). Working memory in children’s math learning and its disruption in dyscalculia. Curr. Opin. Behav. Sci. 10, 125–132. doi: 10.1016/j.cobeha.2016.05.014
Michalczyk, K., Krajewski, K., Preβler, A.-L., and Hasselhorn, M. (2013). The relationships between quantity-number competencies, working memory, and phonological awareness in 5- and 6-year-olds. Br. J. Dev. Psychol. 31, 408–424. doi: 10.1111/bjdp.12016
Miles, J., and Stelmack, R. M. (1994). Learning disability subtypes and the effects of auditory and visual priming on visual event-related potentials to words. J. Clin. Exp. Neuropsychol. 16, 43–64. doi: 10.1080/01688639408402616
Miyake, A., Friedman, N. P., Emerson, M. J., Witzki, A. H., and Howerter, A. (2000). The unity and diversity of executive functions and their contributions to complex ‘frontal lobe’ tasks: a latent variable analysis. Cogn. Psychol. 41, 49–100. doi: 10.1006/cogp.1999.0734
Moeller, K., Zuber, J., Olsen, N., Nuerk, H.-C., and Willmes, K. (2015). Intransparent German number words complicate transcoding - a translingual comparison with Japanese. Front. Psychol. 6:740. doi: 10.3389/fpsyg.2015.00740
Moeyaert, M., Ugille, M., Natasha Beretvas, S., Ferron, J., Bunuan, R., and van den Noortgate, W. (2017). Methods for dealing with multiple outcomes in meta-analysis: a comparison between averaging effect sizes, robust variance estimation and multilevel meta-analysis. Int. J. Soc. Res. Methodol. 20, 559–572. doi: 10.1080/13645579.2016.1252189
Moher, D., Liberati, A., Tetzlaff, J., Altman, D. G., and The Prisma Group. (2009). Preferred reporting items for systematic reviews and meta-analyses: the PRISMA statement. PLoS Med. 6:e1000097. doi: 10.1371/journal.pmed.1000097
Moll, K., Göbel, S. M., and Snowling, M. J. (2015). Basic number processing in children with specific learning disorders: comorbidity of reading and mathematics disorders. Child Neuropsychol. 21, 399–417. doi: 10.1080/09297049.2014.899570
Moll, K., Kunze, S., Neuhoff, N., Bruder, J., and Schulte-Körne, G. (2014). Specific learning disorder: prevalence and gender differences. PLoS One 9:e103537. doi: 10.1371/journal.pone.0103537
Moll, K., Landerl, K., Snowling, M. J., and Schulte-Körne, G. (2019). Understanding comorbidity of learning disorders: task-dependent estimates of prevalence. J. Child Psychol. Psychiatry 60, 286–294. doi: 10.1111/jcpp.12965
Möller, K., Fischer, U., Cress, U., and Nuerk, H.-C. (2012). “Diagnostics and intervention in developmental dyscalculia: current issues and novel perspectives,” in Reading, Writing, Mathematics and the Developing Brain: Listening to Many Voices, eds Z. Breznitz, O. Rubinsten, V. J. Molfese, and D. L. Molfese (Berlin: Springer), 233–275. doi: 10.1007/978-94-007-4086-0_14
Morgan, P. L., Farkas, G., and Wu, Q. (2009). Five-year growth trajectories of kindergarten children with learning difficulities in mathematics. J. Learn. Disabil. 42, 306–321. doi: 10.1177/0022219408331037
Morsanyi, K., Devine, A., Nobes, A., and Szûcs, D. (2013). The link between logic, mathematics and imagination: evidence from children with developmental dyscalculia and mathematically gifted children. Dev. Sci. 16, 542–553. doi: 10.1111/desc.12048
Morsanyi, K., van Bers, B. M. C. W., O’Connor, P. A., and McCormack, T. (2018). Developmental dyscalculia is characterized by order processing deficits: evidence from numerical and non-numerical ordering tasks. Dev. Neuropsychol. 43, 595–621. doi: 10.1080/87565641.2018.1502294
Moulin, T. C., and Amaral, O. B. (2020). Using collaboration networks to identify authorship dependence in meta-analysis results Res. Synth. Methods 11, 655–668. doi: 10.1002/jrsm.1430
Moura, O., Simões, M. R., and Pereira, M. (2014). Executive functioning in children with developmental dyslexia. Clin. Neuropsychol. 28, S20–S41. doi: 10.1080/13854046.2014.964326
Moura, R., Lopes-Silva, J. B., Vieira, L. R., Paiva, G. M., de Almeida Prado, A. C., Wood, G., et al. (2015). From ‘five’ to 5 for 5 minutes: arabic number transcoding as a short, specific, and sensitive screening tool for mathematics learning difficulties. Arch. Clin. Neuropsychol. 30, 88–98. doi: 10.1093/arclin/acu071
Moura, R., Wood, G., Pinheiro-Chagas, P., Lonnemann, J., Krinzinger, H., Willmes, K., et al. (2013). Transcoding abilities in typical and atypical mathematics achievers: the role of working memory and procedural and lexical competencies. J. Exp. Child Psychol. 116, 707–727. doi: 10.1016/j.jecp.2013.07.008
Murphy, M. M. (2009). A review of mathematical disabilities in children with fragile X snydrome. Dev. Disabil. Res. Rev. 15, 21–27. doi: 10.1002/ddrr.49
Murphy, M. M., Mazzocco, M. M. M., Hanich, L. B., and Early, M. C. (2007). Cognitive characteristics of children with mathematics learning disability (MLD) vary as a function of the cutoff criterion used to define MLD. J. Learn. Disabil. 40, 458–478. doi: 10.1177/00222194070400050901
Mussolin, C., de Volder, A., Grandin, C., Schlogel, X., Nassogne, M. C., and Noel, M. P. (2010a). Neural correlates of symbolic number comparison in developmental dyscalculia. J. Cogn. Neurosci. 22, 860–874. doi: 10.1162/jocn.2009.21237
Mussolin, C., Martin, R., and Schiltz, C. (2011). Relationships between number and space processing in adults with and without dyscalculia. Acta Psychol. 138, 193–203. doi: 10.1016/j.actpsy.2011.06.004
Mussolin, C., Mejias, S., and Noël, M. P. (2010b). Symbolic and nonsymbolic number comparison in children with and without dyscalculia. Cognition 115, 10–25. doi: 10.1016/j.cognition.2009.10.006
Nelson, G., and Powell, S. R. (2018). A systematic review of longitudinal studies of mathematics difficulty. J. Learn. Disabil. 51, 523–539. doi: 10.1177/0022219417714773
Niileksela, C. R., and Reynolds, M. R. (2014). Global, broad, or specific cognitive differences? Using a MIMIC model to examine differences in CHC abilities in children with learning disabilities. J. Learn. Disabil. 47, 224–236. doi: 10.1177/0022219412453172
Orraca-Castillo, M., Estevez-Perez, N., and Reigosa-Crespo, V. (2014). Neurocognitive profiles of learning disabled children with neurofibromatosis type 1. Front. Hum. Neurosci. 8:386. doi: 10.3389/fnhum.2014.00386
Oxman, A. D., and Guyatt, G. H. (1992). A consumer’s guide to subgroup analyses. Ann. Intern. Med. 116, 78–84. doi: 10.7326/0003-4819-116-1-78
Peng, P., Congying, S., Beilei, L., and Sha, T. (2012). Phonological storage and executive function deficits in children with mathematics difficulties. J. Exp. Child Psychol. 112, 452–466. doi: 10.1016/j.jecp.2012.04.004
Peng, P., Cuicui, W., and Jessica, N. (2018). Understanding the cognition related to mathematics difficulties: a meta-analysis on the cognitive deficit profiles and the bottleneck theory. Rev. Educ. Res. 88, 434–476. doi: 10.3102/0034654317753350
Peng, P., and Fuchs, D. (2014). A meta-analysis of working memory deficits in children with learning difficulties: is there a aifference between verbal domain and numerical domain? J. Learn. Disabil. 49, 3–20. doi: 10.1177/0022219414521667
Peng, P., Namkung, J., Barnes, M., and Sun, C. (2016). A meta-analysis of mathematics and working memory: moderating effects of working memory domain, type of mathematics skill, and sample characteristics. J. Educ. Psychol. 108, 455–473. doi: 10.1037/edu0000079
Peng, P., Tao, S., and Li, B. L. (2013). The deficit profile of working memory, inhibition, and updating in Chinese children with reading difficulties. Learn. Individ. Differ. 25, 111–117. doi: 10.1016/j.lindif.2013.01.012
Peng, P., Wang, C., Tao, S., and Sun, C. (2017). The deficit profiles of Chinese children with reading difficulties: a meta-analysis. Educ. Psychol. Rev. 29, 513–564. doi: 10.1007/s10648-016-9366-2
Peters, L., and de Smedt, B. (2018). Arithmetic in the developing brain: a review of brain imaging studies. Dev. Cogn. Neurosci. 30, 265–279. doi: 10.1016/j.dcn.2017.05.002
Pham, A. V., and Riviere, A. (2015). Specific learning disorders and ADHD: current issues in diagnosis across clinical and educational settings [Article]. Curr. Psychiatry Rep. 17:38. doi: 10.1007/s11920-015-0584-y
Pigott, T. D., Valentine, J. C., Polanin, J. R., Williams, R. T., and Canada, D. D. (2013). Outcome-reporting bias in education research. Educ. Res. 42, 424–432. doi: 10.3102/0013189x13507104
Pollack, C., and Ashby, N. C. (2018). Where arithmetic and phonology meet: the meta-analytic convergence of arithmetic and phonological processing in the brain. Dev. Cogn. Neurosci. 30, 251–264. doi: 10.1016/j.dcn.2017.05.003
Proctor, B. (2012). Relationships between Cattell–Horn–Carroll (CHC) cognitive abilities and math achievement within a sample of college students with learning disabilities. J. Learn. Disabil. 45, 278–287. doi: 10.1177/0022219410392049
R Core Team (2020). R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing. Available online at: https://www.R-project.org/ (accessed May 19, 2021).
Raddatz, J., Kuhn, J.-T., Holling, H., Moll, K., and Dobel, C. (2017). Comorbidity of arithmetic and reading disorder: basic number processing and calculation in children with learning impairments. J. Learn. Disabil. 50, 298–308. doi: 10.1177/0022219415620899
Raghubar, K. P., Barnes, M. A., and Hecht, S. A. (2010). Working memory and mathematics: a review of developmental, individual difference, and cognitive approaches. Learn. Individ. Differ. 20, 110–122. doi: 10.1016/j.lindif.2009.10.005
Raudenbush, S. W. (2009). “Analyzing effect sizes: random-effects models,” in The Handbook of Research Synthesis and Meta-Analysis, 2nd Edn, eds H. Cooper, L. V. Hedges, and J. C. Valentine (New York, NY: Russell Sage Foundation), 295–315.
Reikerås, E. K. L. (2006). Performance in solving arithmetic problems: a comparison of children with different levels of achievement in mathematics and reading. Eur. J. Spec. Needs Educ. 21, 233–250. doi: 10.1080/08856250600810633
Riley, R. D. (2009). Multivariate meta-analysis: the effect of ignoring within-study correlation [Article]. J. R. Stat. Soc. A 172, 789–811. doi: 10.1111/j.1467-985X.2008.00593.x
Ritchie, S. J., and Bates, T. C. (2013). Enduring links from childhood mathematics and reading achievement to adult socioeconomic status. Psychol. Sci. 24, 1301–1308. doi: 10.1177/0956797612466268
Rodgers, M. A., and Pustejovsky, J. E. (2020). Evaluating meta-analytic methods to detect selective reporting in the presence of dependent effect sizes. Psychol. Methods Online ahead of print., doi: 10.1037/met0000300,
Rotem, A., and Henik, A. (2013). The development of product parity sensitivity in children with mathematics learning disability and in typical achievers. Res. Dev. Disabil. 34, 831–839. doi: 10.1016/j.ridd.2012.11.001
Rousselle, L., and Noël, M. P. (2008). Mental arithmetic in children with mathematics learning disabilities: the adaptive use of approximate calculation in an addition verification task. J. Learn. Disabil. 41, 498–513. doi: 10.1177/0022219408315638
Salthouse, T. A. (2000). Aging and measures of processing speed. Biol. Psychol. 54, 35–54. doi: 10.1016/S0301-0511(00)00052-1
Sasanguie, D., Göbel, S. M., Moll, K., Smets, K., and Reynvoet, B. (2013). Approximate number sense, symbolic number processing, or number–space mappings: what underlies mathematics achievement? J. Exp. Child Psychol. 114, 418–431. doi: 10.1016/j.jecp.2012.10.012
Schleifer, P., and Landerl, K. (2011). Subitizing and counting in typical and atypical development. Dev. Sci. 14, 280–291. doi: 10.1111/j.1467-7687.2010.00976.x
Schneider, M., Beeres, K., Coban, L., Merz, S., Susan Schmidt, S., Stricker, J., et al. (2017). Associations of non-symbolic and symbolic numerical magnitude processing with mathematical competence: a meta-analysis. Dev. Sci. 20, 1–16. doi: 10.1111/desc.12372
Schneider, W. J., and McGrew, K. S. (2018). “The cattell-horn-carroll theory of cognitive abilities,” in Contemporary Intellectual Assessment: Theories, Tests and Issues, 4th Edn, eds D. P. Flanagan and E. M. McDonough (New York, NY: The Guilford Press), 73–163.
Schuchardt, K., and Mähler, C. (2010). Do subgroups of children with mathematical disabilities differ in their working memory, basic arithmetical knowledge, and numeric competences? Z. Entwicklungspsychol. Pädagog. Psychol. 42, 217–225. doi: 10.1026/0049-8637/a000022
Schuchardt, K., Mähler, C., and Hasselhorn, M. (2008). Working memory deficits in children with specific learning disorders. J. Learn. Disabil. 41, 514–523. doi: 10.1177/0022219408317856
Schwenk, C., Sasanguie, D., Kuhn, J.-T., Kempe, S., Doebler, P., and Holling, H. (2017). (Non-)symbolic magnitude processing in children with mathematical difficulties: a meta-analysis. Res. Dev. Disabil. 64, 152–167. doi: 10.1016/j.ridd.2017.03.003
Shalev, R. S. (2007). “Prevalence of developmental dyscalculia,” in Why is Math so Hard for Some Children? The Nature and Origins of Mathematical Learning Difficulties and Disabilities, eds D. B. Berch and M. M. M. Mazzocco (Baltimore, MD: Paul H Brookes Publishing), 49–60.
Shalev, R. S., Manor, O., and Gross-Tsur, V. (2005). Developmental dyscalculia: a prospective six-year follow-up. Dev. Med. Child Neurol. 47, 121–125. doi: 10.1017/S0012162205000216
Shin, M., and Bryant, D. P. (2015). A synthesis of mathematical and cognitive performances of students with mathematics learning disabilities. J. Learn. Disabil. 48, 96–112. doi: 10.1177/0022219413508324
Silver, A. M., Elliott, L., Imbeah, A., and Libertus, M. E. (2020). Understanding the unique contributions of home numeracy, inhibitory control, the approximate number system, and spontaneous focusing on number for children’s math abilities. Math. Think. Learn. 22, 296–311. doi: 10.1080/10986065.2020.1818469
Simanowski, S., and Krajewski, K. (2019). Specific preschool executive functions predict unique aspects of mathematics development: a 3-year longitudinal study. Child Dev. 90, 544–561. doi: 10.1111/cdev.12909
Skagerlund, K., Karlsson, T., and Träff, U. (2016). Magnitude processing in the brain: an fMRI study of time, space, and numerosity as a shared cortical system. Front. Hum. Neurosci. 10:500. doi: 10.3389/fnhum.2016.00500
Skagerlund, K., and Träff, U. (2014). Development of magnitude processing in children with developmental dyscalculia: space, time, and number. Front. Psychol. 5:675. doi: 10.3389/fpsyg.2014.00675
Speelman, C. P., and McGann, M. (2013). How mean is the mean? Front. Psychol. 4:451. doi: 10.3389/fpsyg.2013.00451
Sterne, J. A. C., and Egger, M. (2001). Funnel plots for detecting bias in meta-analysis: guidelines on choice of axis. J. Clin. Epidemiol. 54, 1046–1055. doi: 10.1016/S0895-4356(01)00377-8
Stock, P., Desoete, A., and Roeyers, H. (2010). Detecting children with arithmetic disabilities from kindergarten: evidence from a 3-year longitudinal study on the role of preparatory arithmetic abilities. J. Learn. Disabil. 43, 250–268. doi: 10.1177/0022219409345011
Swanson, H. L. (2012). Cognitive profile of adolescents with math disabilities: are the profiles different from those with reading disabilities? Child Neuropsychol. 18, 125–143. doi: 10.1080/09297049.2011.589377
Swanson, H. L., and Jerman, O. (2006). Math disabilities: a selective meta-analysis of the literature. Rev. Educ. Res. 76, 249–274. doi: 10.3102/00346543076002249
Swanson, H. L., Jerman, O., and Zheng, X. (2009). Math disabilities and reading disabilities: can they be separated? J. Psychoeduc. Assess. 27, 175–196. doi: 10.1177/0734282908330578
Swanson, H. L., Olide, A. F., and Kong, J. E. (2018). Latent class analysis of children with math difficulties and/or math learning disabilities: are there cognitive differences? J. Educ. Psychol. 110, 931–951. doi: 10.1037/edu0000252
Szûcs, D., Devine, A., Soltesz, F., Nobes, A., and Gabriel, F. (2013). Developmental dyscalculia is related to visuo-spatial memory and inhibition impairment. Cortex 49, 2674–2688. doi: 10.1016/j.cortex.2013.06.007
Tanner-Smith, E. E., Tipton, E., and Polanin, J. R. (2016). Handling complex meta-analytic data structures using robust variance estimates: a tutorial in R. J. Dev. Life Course Criminol. 2, 85–112. doi: 10.1007/s40865-016-0026-5
Taub, G. E., Keith, T. Z., Floyd, R. G., and McGrew, K. S. (2008). Effects of general and broad cognitive abilities on mathematics achievement. Sch. Psychol. Q. 23, 187–198. doi: 10.1037/1045-3830.23.2.187
Taylor, H. G., Espy, K. A., and Anderson, P. J. (2009). Mathematics deficiencies in children with very low birth weight or very preterm birth. Dev. Disabil. Res. Rev. 15, 52–59. doi: 10.1002/ddrr.51
Tipton, E. (2015). Small sample adjustments for robust variance estimation with meta-regression. Psychol. Methods 20, 375–393. doi: 10.1037/met0000011
Tobia, V., Rinaldi, L., and Marzocchi, G. M. (2018). Time processing impairments in preschoolers at risk of developing difficulties in mathematics. Dev. Sci. 21:e12526. doi: 10.1111/desc.12526
Tolar, T. D., Fuchs, L., Fletcher, J. M., Fuchs, D., and Hamlett, C. L. (2016). Cognitive profiles of mathematical problem solving learning disability for different definitions of disability. J. Learn. Disabil. 49, 240–256. doi: 10.1177/0022219414538520
Träff, U., Olsson, L., Östergren, R., and Skagerlund, K. (2017). Heterogeneity of developmental dyscalculia: cases with different deficit profiles. Front. Psychol. 7:2000. doi: 10.3389/fpsyg.2016.02000
Träff, U., Olsson, L., Skagerlund, K., and Östergren, R. (2020). Kindergarten domain-specific and domain-general cognitive precursors of hierarchical mathematical development: a longitudinal study. J. Educ. Psychol. 112, 93–109. doi: 10.1037/edu0000369
Treiman, R., and Zukowski, A. (1996). Children’s sensitivity to syllables, onsets, rimes, and phonemes. J. Exp. Child Psychol. 61, 193–215. doi: 10.1006/jecp.1996.0014
Valentine, J. C., Pigott, T. D., and Rothstein, H. R. (2010). How many studies do you need? A primer on statistical power for meta-analysis. J. Educ. Behav. Stat. 35, 215–247. doi: 10.3102/1076998609346961
van der Sluis, S., de Jong, P. F., and van der Leij, A. (2004). Inhibition and shifting in children with learning deficits in arithmetic and reading. Journal of Experimental Child Psychology 87, 239–266. doi: 10.1016/j.jecp.2003.12.002
van der Sluis, S., de Jong, P. F., and van der Leij, A. (2005). Working memory in Dutch children with reading- and arithmetic-related LD. J. Learn. Disabil. 38, 207–221. doi: 10.1177/00222194050380030301
van der Ven, S. H. G., Klaiber, H. D., and van der Maas, H. L. J. (2017). Four and twenty blackbirds: how transcoding ability mediates the relationship between visuospatial working memory and math in a language with inversion. Educ. Psychol. 37, 487–505. doi: 10.1080/01443410.2016.1150421
van Iterson, L., de Jong, P. F., and Zijlstra, B. J. H. (2015). Pediatric epilepsy and comorbid reading disorders, math disorders, or autism spectrum disorders: impact of epilepsy on cognitive patterns. Epilepsy Behav. 44, 159–168. doi: 10.1016/j.yebeh.2015.02.007
van Rooijen, M., Verhoeven, L., and Steenbergen, B. (2015). From numeracy to arithmetic: precursors of arithmetic performance in children with cerebral palsy from 6 till 8 years of age. Res. Dev. Disabil. 4, 49–57. doi: 10.1016/j.ridd.2015.07.001
Vanbinst, K., and de Smedt, B. (2016). Individual differences in children’s mathematics achievement: the roles of symbolic numerical magnitude processing and domain-general cognitive functions. Prog. Brain Res. 227, 105–130. doi: 10.1016/bs.pbr.2016.04.001
Wagner, R. K., and Torgesen, J. K. (1987). The nature of phonological processing and its causal role in the acquisition of reading skills. Psychol. Bull. 101, 192–212. doi: 10.1037/0033-2909.101.2.192
Walker, E., Hernandez, A. V., and Kattan, M. W. (2008). Meta-analysis: its strengths and limitations. Cleve. Clin. J. Med. 75, 431–439. doi: 10.3949/ccjm.75.6.431
Wang, L.-C., Tasi, H.-J., and Yang, H.-M. (2012). Cognitive inhibition in students with and without dyslexia and dyscalculia. Res. Dev. Disabil. 33, 1453–1461. doi: 10.1016/j.ridd.2012.03.019
Watson, S. M. R., and Gable, R. A. (2013). Unraveling the complex nature of mathematics learning disability: implications for research and practice. Learn. Disabil. Q. 36, 178–187. doi: 10.1177/0731948712461489
Willburger, E., Fussenegger, B., Moll, K., Wood, G., and Landerl, K. (2008). Naming speed in dyslexia and dyscalculia. Learn. Individ. Differ. 18, 224–236. doi: 10.1016/j.lindif.2008.01.003
Willcutt, E. G., Petrill, S. A., Wu, S., Boada, R., DeFries, J. C., Olson, R. K., et al. (2013). Comorbidity between reading disability and math disability: concurrent psychopathology, functional impairment, and neuropsychological functioning. J. Learn. Disabil. 46, 500–516. doi: 10.1177/0022219413477476
Wimmer, H., Landerl, K., Linortner, R., and Hummer, P. (1991). The relationship of phonemic awareness to reading acquisition - more consequence than predcondition but still important. Cognition 40, 219–249. doi: 10.1016/0010-0277(91)90026-z
Wong, W. I. (2017). The space-math link in preschool boys and girls: importance of mental transformation, targeting accuracy, and spatial anxiety. Br. J. Dev. Psychol. 35, 249–266. doi: 10.1111/bjdp.12161
Wood, W., and Eagly, A. H. (2009). “Advantages of certainty and uncertainty,” in The Handbook of Research Synthesis and Meta-Analysis, 2nd Edn, eds H. Cooper, L. V. Hedges, and J. C. Valentine (New York, NY: Russell Sage Foundation), 455–472.
World Health Organization (2018). International Statistical Classification of Diseases and Related Health Problems (11th Revision). Available online at: https://icd.who.int/browse11/l-m/en. (accessed May 20, 2021).
Yip, E. S.-K., Wong, T. T.-Y., Cheung, S.-H., and Chan, K. K.-W. (2020). Do children with mathematics learning disability in Hong Kong perceive word problems differently? Learn. Instruction 68:101352. doi: 10.1016/j.learninstruc.2020.101352
Keywords: math difficulties, dyscalculia, math disabilities, domain-general abilities, domain-specific abilities, meta-analysis, systematic review
Citation: Haberstroh S and Schulte-Körne G (2022) The Cognitive Profile of Math Difficulties: A Meta-Analysis Based on Clinical Criteria. Front. Psychol. 13:842391. doi: 10.3389/fpsyg.2022.842391
Received: 23 December 2021; Accepted: 16 February 2022;
Published: 11 March 2022.
Edited by:
Evelyn Kroesbergen, Radboud University Nijmegen, NetherlandsReviewed by:
Pierluigi Zoccolotti, Sapienza University of Rome, ItalyIlona Friso-van den Bos, University of Twente, Netherlands
Copyright © 2022 Haberstroh and Schulte-Körne. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Gerd Schulte-Körne, gerd.schulte-koerne@med.uni-muenchen.de