- 1Department of Education and Social Work, University of Luxembourg, Esch-Belval, Luxembourg
- 2University of Teacher Education, Vaud, Switzerland
- 3EQUALE Research Unit, University of Liège, Liège, Belgium
- 4Laboratoire 2LPN, University of Lorraine, Nancy, France
The few studies that have analyzed the factorial structure of early number skills have mainly used confirmatory factor analysis (CFA) and have yielded inconsistent results, since early numeracy is considered to be unidimensional, multidimensional or even underpinned by a general factor. Recently, the bifactor exploratory structural equation modeling (bifactor-ESEM)—which has been proposed as a way to overcome the shortcomings of both the CFA and the exploratory structural equation modeling (ESEM)—proved to be valuable to account for the multidimensionality and the hierarchical nature of several psychological constructs. The present study is the first to investigate the dimensionality of early number skills measurement through the application of the bifactor-ESEM framework. Using data from 644 prekindergarten and kindergarten children (4 to 6 years old), several competing models were contrasted: the one-factor CFA model; the independent cluster model (ICM-CFA); the exploratory structural equation modeling (ESEM); and their bifactor counterpart (bifactor-CFA and bifactor-ESEM, respectively). Results indicated acceptable fit indexes for the one-factor CFA and the ICM-CFA models and excellent fit for the others. Among these, the bifactor-ESEM with one general factor and three specific factors (Counting, Relations, Arithmetic) not only showed the best model fit, but also the best coherent factor loadings structure and full measurement invariance across gender. The bifactor-ESEM appears relevant to help disentangle and account for general and specific factors of early numerical ability. While early numerical ability appears to be mainly underpinned by a general factor whose exact nature still has to be determined, this study highlights that specific latent dimensions with substantive value also exist. Identifying these specific facets is important in order to increase quality of early numerical ability measurement, predictive validity, and for practical implications.
Introduction
Early Numeracy and the Importance of Its Dimensionality
Early number skills refer to a set of elementary competencies comprising counting, number relations and basic arithmetic operations. These skills have repeatedly been shown to predict more complex mathematical skills as well as general school achievement (Clements and Sarama, 2007, 2011; Duncan et al., 2007; Jordan et al., 2009, 2010). For example, Watts et al. (2014) found that preschool mathematics ability was a significant predictor of math skills up to the age of 15 after accounting for cognitive skills and family background characteristics. It has even been highlighted that early number skills are more strongly related to later math achievement than are early literacy skills to later reading scores, and that early number skills are the strongest predictor of later general school success (Duncan et al., 2007).
Given their importance, early number skills have become the focus of growing interest during these past two decades. Many experts and national commissions have emphasized the necessity of focusing upon early mathematics education (Starkey et al., 2004; National Council of Teachers of Mathematics, 2006; Clements and Sarama, 2007; Ginsburg et al., 2008; National Mathematics Advisory Panel, 2008; National Research Council, 2009; Frye et al., 2013). Indeed, enhancing the early number skills of pupils at (pre)kindergarten can assist children entering with very varied number knowledge, and can help with the process of placing at-risk children onto an appropriate learning trajectory (Ginsburg et al., 2008; Jordan et al., 2010; Scalise et al., 2017). However, despite the growing attention they have received recently, early number skills continue to be studied far less than early reading skills (Mazzocco and Claessens, 2020).
In this context, one major question is to precisely understand the dimensionality of early number skills. Knowing how early numeracy is structured not only allows researchers to assess them more properly, but also to develop more appropriate early intervention strategies. In addition, an accurate identification of the early number skills structure is important to study their relationships with other variables. For example, Brunner (2008) showed that the understanding of the relationships between cognitive abilities and students' characteristics depends strongly on the structural conception of applied cognitive abilities. More precisely, he compared two models of mathematical ability and showed that the correlation between mathematical ability and students' socioeconomic status (SES) was considerably different depending on the model considered. He concluded that in terms of implications for policy makers, results might lead to a decision to invest in specific math interventions with low SES children or might motivate interventions targeting a general factor as well as more domain-specific dimensions (i.e., programs seeking to foster reasoning across education domains, specific mathematical abilities). Thus, without a comprehensive understanding of the early numeracy dimensionality, researchers continue to base their work on unvalidated conceptual frameworks, and risk using a misspecified measurement model which might lead researchers to misleading results and conclusions (Brunner, 2008; Dierendonck et al., 2019; Cimino et al., 2020; Zhang et al., 2020).
In the domain of early literacy, the identification of the set of competencies that support reading acquisition has yielded more effective literacy instruction (Juel and Minden-Cupp, 2000; National Early Literacy Panel, 2008). Regarding numeracy, it is still not clear to date whether specific early numeracy sub-skills exist or are only different means of assessing a general early numeracy construct (Purpura and Lonigan, 2013; Milburn et al., 2019). Consequently, a central question is whether early numeracy skills should be fostered and assessed in a broad perspective, as an overall competence assuming unidimensionality, or whether specific aspects of early numeracy should be considered, assuming multidimensionality.
Current Knowledge About Early Number Skills Structure
To date, the structure of early number skills has not been the focus of much attention. Interestingly, two of the most important official documents—reports by the National Research Council (2009) and the National Council of Teachers of Mathematics (2006)—do not assume the same number of numeracy sub-dimensions (“Number,” “Relations,” and “Operations” for the NRC and “Numbers” and “Operations” for the NCTM), suggesting a tripartite and a bipartite model of early numeracy, respectively. The few empirical studies that have analyzed the dimensionality of early number skills (see Appendix 1 for a detailed overview) have mainly investigated whether an a priori model was confirmed through confirmatory factor analysis (CFA), in particular, through the one-factor model (OF) for unidimensionality or the independent cluster model (ICM) for multidimensionality. As depicted in Figure 1, the one-factor model (OF) considers only one general ability or dimension. Individual differences are supposed to be due to individual differences on a single common latent factor. The first-order factor model (ICM-CFA) assumes that the measured ability is composed of several first-order specific facets F1, F2 and F3 which may be correlated, but are depicted by independent factors. Although CFA is the most commonly used approach to model construct-relevant multidimensionality, it is often criticized for its very restrictive independent cluster model assumption, which requires that each item to be defined by only one content domain. Under this overly restrictive assumption, factor correlations tend to be inflated (Asparouhov and Muthén, 2009; Morin et al., 2020).
Figure 1. Simplified graphical representations of five alternative models of early numeracy: (A) the one-factor model (OF), (B) the independent cluster model (ICM-CFA), (C) the exploratory structural equation model (ESEM), (D) the bifactor-CFA model (Bifactor-CFA), and (E) the bifactor exploratory structural equation model (Bifactor-ESEM).
The dimensionality studies of early number skills yielded inconsistent results. In studies that did not consider the hypothesis of a general factor underlying early numeracy, the number of first-order factors varies from 2 to 5. This variation is partly explained by the fact that the researchers used different measurement tools. In the only two studies we are aware of, that consider the hypothesis of a general factor through a bifactor-CFA model (see below), the number of specific factors ranges from 2 (Mou et al., 2021) to 7 (Ryoo et al., 2015).
How to Capture the Potential Multidimensionality and Hierarchically Nature of Early Numeracy?
Bifactor-CFA
In order to take into account the shared communality of each item, the bifactor-CFA (see Figure 1) has been proposed (Reise, 2012). The construct under consideration is assumed to be independently and directly influenced by a general factor and by several specific facets. All factors are supposed to be orthogonal, and the model divides the total observed covariance of the items into a general component (G) which underlies all items, and specific facets (S1, S2, S3…), thus explaining the residual covariance that is not explained by the general factor (Figure 1). The bifactor-CFA better represents the multidimensionality of a construct by taking into account its hierarchical nature, and it is increasingly popular in the literature. As far as we know, only two studies of early number skills have tested a bifactor-CFA representation of their data (see Appendix 1 for details). Mou et al. (2021) focused on set-to-number and number-to-set tasks (Wynn, 1992; Le Corre et al., 2006). They have compared a single-factor model, a 2-factor ICM-CFA and a bifactor-CFA model with a general factor and two specific dimensions. They concluded that set-to-number and number-to-set tasks have substantial overlap but do not appear to be conceptually interchangeable, as the bifactor model showed the best fit to the data. Ryoo et al. (2015) conducted several CFAs (one-factor model, ICM-CFA model, second-order model1 and bifactor-CFA model) across four time points and observed inconsistent results. Best fitting models were, respectively, a 7-factor bifactor-CFA for first and second time points, a 7-factor ICM-CFA model for the third time point and an 8-factor ICM-CFA model for the fourth time point. They did not test models with <7 factors. They concluded that when the shared variance between all items is kept under control, several specific factors appear to explain the residual covariance, which suggests that early number skills seem to have a hierarchical structure.
Bifactor-CFA however raises some measurement and theoretical concerns (Morin et al., 2016a; Bonifay et al., 2017; Sellbom and Tellegen, 2019). In particular, the bifactor-CFA neglects the possibility that items may have cross-loadings on the non-target specific factors. Such unmodeled cross-loadings tend to result in inflated G-factor loadings (Murray and Johnson, 2013; Morin et al., 2016a). In order to take the item cross-loadings into consideration, Asparouhov and Muthén (2009) proposed the exploratory structural equation modeling (ESEM) framework as an alternative to CFA models.
ESEM and Bifactor-ESEM
ESEM allows for the integration of exploratory factor analysis (EFA) within structural equation modeling (SEM). ESEM with target rotation relies on similarly defined factors as in ICM-CFA, but allows for cross-loadings to be freely estimated between items and non-target factors. Relative to ICM-CFA, ESEM estimates loadings for indicators on all factors, and provides more precision in the estimation of factors and more accurate estimates of factor correlations, which results in better discriminant validity (Asparouhov and Muthén, 2009; Morin et al., 2016a,b). Moreover, when estimated with target rotation, ESEM can be used for purely confirmatory purposes (Asparouhov and Muthén, 2009).
To take into account the advantages of the ESEM framework, as well as the hierarchical nature of some constructs, the bifactor exploratory structural equation modeling (bifactor-ESEM) has recently been proposed by some authors (Jennrich and Bentler, 2011; Morin et al., 2016a,b) (Figure 1). Unlike CFA, this new modeling takes the relations of non-target constructs and items into account, and unlike ESEM, it allows the coexistence of both a latent general construct and specific subdomains. By overcoming the CFA and ESEM shortcomings, the bifactor-ESEM might be the most comprehensive and flexible model, able to describe the complex psychological characteristics with most accuracy (Gu et al., 2020).
Since this pioneering work, the bifactor-ESEM proved to be very interesting in the way it accounts for the dimensionality of several psychological constructs (Fadda et al., 2017; Sánchez-Oliva et al., 2017; Gu et al., 2020). However, neither ESEM nor bifactor-ESEM has already been used to study the dimensionality of early number skills. Now, on the one hand, the structure of this domain should be studied by allowing cross-loadings on non-target specific factors, in order to prevent the G-factor loadings inflation to inflate G-factor loadings (Asparouhov and Muthén, 2009). On the other hand, the high intercorrelations between the first-order factors found in early numeracy (i.e., Purpura and Lonigan, 2013)—which is higher than in literacy for example (Storch and Whitehurst, 2002; Thomas et al., 2018)—might mask a more general factor describing variation in responses across all the observed indicators (Betts et al., 2011; Ryoo et al., 2015). Importantly, while bifactor models tend to show superior goodness of fit in model comparison studies (Bonifay et al., 2017), model comparisons need to be anchored both in theory and in a detailed examination of parameter estimates, as suggested by Morin et al. (2020).
The Current Study
To our knowledge, no study applied the bifactor-ESEM framework in the field of early numeracy. Given the strong correlations between subdimensions within that domain and the relevance of the bifactor-ESEM framework applied in other areas, the current study aims to investigate the multidimensionality and hierarchical nature of early skills by assuming that a bifactor-ESEM representation is theoretically supported and would better account for numeracy data than rival measurement models. We suggest that our study could serve as a foundation for future studies aiming to examine the dimensionality of early numeracy by using the bifactor-ESEM framework.
Materials and Methods
Ethics
The study was reviewed and approved by the Ethics Review Panel of the University of Luxembourg in charge of study coordination. The legal guardian of the pupils provided written informed consent to participate in this study.
Participants and Procedure
The present article is based on pretest data collected during a quasi-experimental study conducted in four education systems in continental Europe (Belgium, France, Luxembourg and Switzerland), and aimed at measuring the efficacy of a 12 week numerical games intervention with pupils aged from 4 to 6 years (for details and results, see de Chambrier et al., 2021). There were 23 participating schools (46 classrooms) with pupils from mixed socio-economic backgrounds. While 724 children took part in the study, 80 children were not considered, either because they faced major developmental challenges (trisomy/autism), because they were newly arrived in the country and did not speak the school language, or because they were not present on the day of the pretest. The remaining sample of 644 children can be described as follows: mean age of 61.5 months (sd = 7.1) at pretest time, 47.5% girls, 41.6% attending pre-kindergarten, 27.6% from Belgium, 23.8% from France, 28.9% from Luxembourg, and 19.7% from Switzerland.
Measure
Early numerical ability of children was measured through a test inspired by the TEMA-3 (Ginsburg and Baroody, 2003) and the TEDI-MATH (Van Nieuwenhoven et al., 2001). The TEMA-3 provides a comprehensive assessment of mathematical ability for children aged 3 to 9 years, while the TEDI-MATH enables the diagnosis of mathematical learning disabilities of children aged 5 to 8. These two validated instruments were selected for their psychometrical quality shown in the French-speaking (for the TEDI-MATH) and in the English-speaking (for the TEMA-3) contexts. Relatively to the original tests, the subtests included in the instrument used in the current study were mainly a function of the age of our participants (4 to 6 year old). Overall, the subtests of the French test that were intended to children of our participants' age group (from 5 to 6 years old) were retained and were completed by subtests intended to younger children (from 4 to 5 years old) selected and translated from the English test. Content and descriptive statistics of the 41 items2 across the three theoretical sub-categories (counting, number relations, arithmetic) are reported in Table 1. It has to be noted that some items (marked with an asterisk) were only administered if students were successful in previous items.
Data Analysis
To test our hypothesis, we followed the procedure recommended by Morin et al. (2016a,b). These authors suggest making an initial assessment of the first source of construct-relevant multidimensionality due to the fallible nature of items that include at least some degree of association with non-target dimensions (i.e., cross-loadings). This corresponded to comparing ICM-CFA and ESEM models with one (Aunio et al., 2004), two (National Research Council, 2009; Aunio and Niemivirta, 2010) or three (National Council of Teachers of Mathematics, 2006; Purpura and Lonigan, 2013; Aunio et al., 2019; Milburn et al., 2019) sub-dimensions. ESEM analyses were conducted using a confirmatory approach to rotation (i.e., target rotation), which allows for the a priori specification of indicators of each factor, and for the free estimation of cross-loadings targeted to be as close to zero as possible (Asparouhov and Muthén, 2009). Subsequently, Morin et al. (2016a,b) recommend assessing the second source of construct-relevant multidimensionality due to the potential hierarchical nature of the construct being investigated. This meant comparing the bifactor-counterpart (bifactor-CFA or bifactor-ESEM) of the best fitting first-order solution (for more details, see below). According to this procedure, the bifactor-ESEM solution would have to be retained if the ESEM solution better accounts for the data than the CFA model, and if the bifactor-ESEM solution better fits the data than the first-order ESEM solution.
All analyses were conducted with Mplus 8.4 (Muthen and Muthen, 2012–2019) on the 40 items. Full information robust maximum likelihood (FIML) estimation (Enders, 2010) was used to handle the missing data at item level. As the test included binary coded items except the first one (1_1), the weighted least squares mean and variance adjusted (WLSMV) estimator was used for the factorial analyses. In order to take into account the hierarchical nature of the data (students are nested in classroom), we applied the Mplus design-based adjustment implemented by the TYPE=COMPLEX function (Asparouhov, 2005). For estimating model fit, we followed typical interpretation guidelines (Hu and Bentler, 1999; Marsh et al., 2004) by using several common fit indices: the chi-square statistic (although this statistic is highly sensitive to sample size), the root mean square error of approximation (RMSEA) with its 90% confidence interval, the comparative fit index (CFI), the Tucker-Lewis index (TLI) and the standardized root mean square residual (SRMR). RMSEA must be below 0.06 for an excellent model fit and below 0.08 for an acceptable fit. CFI and TLI must be above 0.95 for an excellent fit and above 0.90 for an acceptable fit. The acceptable range of SRMR is between 0 and 0.08.
Complementary to model fit considerations and as mentioned earlier, Morin et al. (2016a,b) suggested to analyze parameter estimates by starting with a comparison between the ICM-CFA and ESEM solutions. If factors are well-defined by strong target factor loadings in the ESEM solution, the focus has then to be put on the factor correlations matrix. If a discrepant pattern of factor correlations is observed between ICM-CFA and ESEM, the latter model should be retained. Otherwise, parsimony principle argues for retaining the ICM-CFA model. Once the optimal first-order solution has been retained, then Morin et al. (2016a,b) suggested comparing this solution with its bifactor counterpart (respectively, the bifactor-CFA or the bifactor-ESEM). The bifactor representation has to be retained if general and specific dimensions are well-defined and, in the case of an ESEM framework, if cross-loadings decrease in the bifactor-ESEM solution compared to the ESEM solution. For all models, model based composite reliability coefficient calculated from the model standardized parameter estimated as McDonald' (1970) omega (ω) coefficient was reported. According to Perreira et al. (2018), minimal level of acceptability of omega reliability coefficients are 0.60 for measures corresponding to first-order models and 0.50 for measures related to bifactor models.
Measurement invariance across gender (girls/boys) was tested following the procedure described by Toth-Kiraly and Neff (2021) based on Millsap (2011). Nevertheless, as our data includes only binary items, except one, test of the weak invariance level was impossible. More precisely, we successively tested invariance on the configural, strong, strict, latent covariance-variance, and latent means levels. As noted by Brown (2015) and Little (2013), in the context of invariance testing, the chi-square difference test is too sensitive to sample size and to trivial fluctuations and differences. Therefore, alternative fit measures (ΔRMSEA, ΔCFI, and ΔTLI) were used to compare the models. The guidelines suggested by Chen (2007) were followed in comparing nested invariance models (ΔRMSEA ≤ 0.015, ΔCFI ≤ 0.010). TLI change analysis with guidelines similar to CFI was also conducted for purposes of parsimony, as recommended by Marsh et al. (2009).
As the literature does not agree on the number of dimensions, factorial solutions with one, two and three dimensions were compared. With reference to models depicted in Figure 1 and following suggestions made by Morin et al. (2016a,b), eight alternative models were compared: the one-factor model (Model 1), four ICM-CFA models with 2 to 3 factors (Models 2 to 5), the ESEM model (Model 6) related to the best-fitting ICM-CFA model, and the bifactor counterpart of the best-fitting ICM-CFA and ESEM model, respectively, the bifactor-CFA (Model 7) and the bifactor-ESEM (Model 8). Model 2 tested the theoretical two-factor structure with counting/number relations as the first sub-dimension and arithmetic as the second sub-dimension. Model 3 tested the theoretical two-factor structure with counting as the first sub-dimension and number relations/arithmetic as the second sub-dimension. Model 4 tested the theoretical two-factor structure with number relations as the first sub-dimension and counting/arithmetic as the second sub-dimension. Model 5 tested the theoretical three-factor structure with counting as the first sub-dimension, number relations as the second sub-dimension, and arithmetic as the third sub-dimension.
Results
Alternative Representations of Early Numeracy
Estimation procedures for all CFA and ESEM solutions converged properly. Fit indices of the rival models of early numeracy are reported in Table 2. Models 1 to 5 showed acceptable but not excellent fit indexes, as CFI and TLI were below 0.95. SRMR for these models was above 0.08. Among ICM-CFA solutions, Model 5 (with 3 correlated factors) was retained due to its superior fit and contrasted with alternative models. Models 6 and 7 showed excellent empirical fit for all indices, except for SRMR (0.081 and 0.083, respectively). The best fit indices were observed for the bifactor-ESEM solution (Model 8: CFI = 0.977; TLI = 0.972; RMSEA = 0.022; SRMR = 0.070), demonstrating a higher level of fit to the data than the ESEM Model 6 (ΔCFI = +0.013; ΔTLI = +0.014; ΔRMSEA = −0.005) and the bifactor-CFA Model 7 (ΔCFI = +0.010; ΔTLI = +0.009; ΔRMSEA = −0.004). As model parameter estimates have to be analyzed before any decision, factor correlations for the ICM-CFA Model 5 (|r| = 0.778 to 0.899, M|r| = 0.856) and the ESEM Model 6 (|r| = 0.161 to 0.539, M|r| = 0.344) are reported in Table 3. A higher adequacy of the ESEM representation is suggested since factor correlations were considerably reduced between CFA and ESEM solutions. Standardized parameter estimates for Model 5 to 8 are reported in Table 4, but we only compare below the ESEM representation with its bifactor counterpart, as they showed their superior adequacy over CFA models.
Table 3. Standardized factor correlations for the ICM-CFA solutions with 3 first-order factors (above the diagonal) and the corresponding ESEM solutions (below the diagonal).
In the bifactor-ESEM solution, the loadings on the general dimension revealed a well-defined G-factor (|λ| =0.207 to 0.803; M|λ| = 0.603; ω = 0.958). Loadings on the G-factor were high for items associated with counting/enumeration (|λ| = 0.461 to 0.803; M|λ| = 0.674) and lower for items assessing number relations (|λ| = 0.419 to 0.729, M|λ| = 0.611) and arithmetic (|λ| = 0.207 to 0.752, M|λ| = 0.509). The specific counting dimension was well-defined (|λ| = 0.137 to 0.538; M|λ| = 0.370; ω = 0.799) with 15 items on 16 showing a statistically significant loading, but four counting items (1_4 to 1_7) showed negative loading over 0.250 on the specific relations dimension. The specific relations dimension was relatively poorly defined (|λ| = 0.021 to 0.594; M|λ| = 0.237; ω = 0.505) with only 3 items on 11 showing a statistically significant loading. Here again, several items targeting the specific relations dimension (6_1, 7_1, 7_3, 8_1, 8_2, 8_4) showed significant cross-loadings over 0.250 on the specific arithmetic dimension. These unexpected significant cross-loadings regarding the specific counting and the specific relations dimensions suggested that some items of the test were maybe measuring other very specific counting and relations facets of early number skills (see the discussion section). The specific arithmetic dimension was not better defined (|λ| = 0.004 to 0.567; M|λ| = 0.223; ω = 0.456) with 7 of the 13 target items showing a statistically significant saturation and reasonable omega just below the 0.50 satisfactory minimal level suggested by Perreira et al. (2018). In this case, only one item (9_4) showed a statistically significant loading over 0.250 on a non-target specific dimension. Here, the fact that the factor loading of several target items are not significant on the specific arithmetic dimension and the absence of significant cross-loadings reflects that the variance included in these items were mainly used in defining the G-factor. Comparison between the factor loadings of the ESEM solution and the bifactor-ESEM solution showed that cross loadings tend to be significantly less numerous and smaller in the bifactor-ESEM solution, suggesting the presence of an unmodeled general dimension reflected by the numerous and higher cross loadings in the ESEM solution. More precisely, there were 57 statistically significant cross loadings in the ESEM solution while only 29 statistically significant cross loadings remained in the bifactor-ESEM solution. For all these reasons, we considered that the bifactor-ESEM solution (Model 8) was the best representation of our early numeracy data. The explained common variance (ECV) for this model was 0.75, meaning that the general factor explained 75% of the common variance extracted with 25% of the common variance spread across specific factors.
Measurement Invariance
To verify the extent to which the bifactor-ESEM representation of early numeracy was the same for boys and girls, we conducted five tests of measurement invariance across gender groups (Models G1 to G5). Goodness-of-fit results associated with all the models are reported in Table 5. Model G1 (configural invariance) with no invariance constraints provided excellent fit to the data (CFI = 0.985, TLI = 0.981, RMSEA = 0.020). This result suggests that the structure was the same across gender groups. Then, Model G2 put equality constraints on factor loadings and thresholds. Differences between fit indices of Models G1 and G2 were negligeable (ΔCFI = −0.001, ΔTLI = +0.001, ΔRMSEA = 0.000), supporting strong invariance across gender groups. In Model G3, equality constraints were put on item uniquenesses. Differences between fit indices of Models G2 and G3 were negligeable (ΔCFI = +0.002, ΔTLI = +0.003, ΔRMSEA = −0.002), supporting strict invariance across gender groups. When equality constraints were placed on the latent variance-covariance matrix (Model G4), model fit did not decrease substantially between Models G3 and G4 (ΔCFI = +0.002, ΔTLI = +0.003, ΔRMSEA = −0.001), supporting latent variance-covariance invariance across gender groups. Finally, when latent means were constrained to be equal across gender groups (Model G5), differences between fit indices of Models G4 and G5 were again negligeable (ΔCFI = +0.001, ΔTLI = +0.001, ΔRMSEA = −0.001), supporting latent means invariance.
Table 5. Measurement invariance across gender for the bifactor-ESEM representation of early numeracy with one global factor and three specific facets (Model 8).
Discussion
This study applied the integrative psychometric framework developed by Morin et al. (2016a) in order to investigate sources of construct-relevant multidimensionality of early numeracy and to test the superiority of the bifactor-ESEM of early numeracy. As stated by Gu et al. (2020), by overcoming the shortcomings of both the CFA and the ESEM models, the bifactor-ESEM is theoretically the most comprehensive and flexible model, able to describe more accurately the complex psychological characteristics.
In a first step, ICM-CFA and ESEM models with one, two or three sub-dimensions were compared. The ESEM solution provided a better fit to the data and lower factor correlations when compared to the ICM-CFA solution. In a second step, the ESEM model was compared to its bifactor counterpart (bifactor-ESEM). Among all models tested, the bifactor-ESEM solution with one general dimension and three specific facets (specific counting, specific relations, specific arithmetic) showed the best empirical fit with factors moderately well-defined, with less and lower significant cross loadings. A closer look to the loadings and cross-loadings on the specific counting and the specific relations factors lead to consider that four counting items (asking to count not starting from one) and four relations items (with perceptive bias) could be very specific and maybe represent independent specific counting and relations factors.
Data and analyses supported our hypothesis that early number skills could be underpinned by a latent common general factor; when the shared variance between the items is taken into account, specific numerical facets remain. The bifactor nature of early number skills had already been suggested by Ryoo et al. (2015) but using a bifactor-CFA model. However, the bifactor-CFA neglects the possibility that items may have cross-loadings on the non-target specific factors, which might result in inflated G-factor loadings (Murray and Johnson, 2013; Morin et al., 2016a). In contrast, the current study showed empirical support for the bifactor-ESEM solution as a comprehensive model of early numeracy. The explained common variance showed that the general factor explained 75% of the common variance extracted while 25% of the common variance was spread across the three specific factors. This suggests the existence of a general factor underlying all items of the test while taking into consideration the items' cross-loadings, which provides more accurate estimates of factor correlations and thereby better discriminant validity (Asparouhov and Muthén, 2009; Morin et al., 2016a). In practical terms, it means that factor scores (computed from the parameter estimates of the bifactor-ESEM solution) should be preferred to classical scale scores (computed by sum or average of items scores) which are unable to take cross-loadings into account and to adequately disaggregate the variance attributed to the general and the specific factors. Thus, the current study highlighted that early number skills—at least the ones that were measured here—might be considered as a multidimensional hierarchical construct that is underpinned by a single common factor and has valuable specific facets.
Specific Facets
In the current study, when keeping under control the shared variance among items, several specific numerical facets emerged. In other words, the specific factors explain the shared variance among items that remains after controlling the effect of the general factor. This is of particular importance because the interpretation of specific factors in bifactor-ESEM differs from the typical interpretation of correlated factors in ICM-CFA, as they account for very different parts of the observed variance among items. With regards to this important difference, it is nevertheless interesting to note that our ICM-CFA, ESEM and bifactor-ESEM models are rather in line with the tripartite model of early numeracy supported by the NRC (Numbering, Relations and Arithmetic) than with the bipartite model supported by the NCTM (Numbers and Operations), since our data suggest three correlated factors (in ICM-CFA and ESEM) or one general dimension and three specific facets (in bifactor-ESEM) of early number skills. The counting factor (or specific counting facet) includes most of the verbal counting and object counting items, and is therefore quite similar to the Numbering dimension of the NRC, defined as the children's knowledge of the rules and processes of the counting sequence and the ability to obtain quantity in a flexible manner (Purpura and Lonigan, 2013). However, while the NRC Numbering dimension explicitly includes counting forward and backward from numbers other than one, our four items that required the children to count from numbers other than one seem to have a particular behavior. In other studies, in which items requiring counting forward from numbers other than one were included (Aunio and Niemivirta, 2010; Purpura and Lonigan, 2013), factor solutions with more than three dimensions were not explored. Thus, our results suggest than it could be interesting to distinguish more than three factors, considering that the ability to count from numbers other than one might constitute an ability per se. This factor might correspond to the breakable level of verbal counting mastery, as described by Fuson (1988). This level of verbal counting mastery is necessary to count from numbers other than one, while counting up to and enumerating collections can be done within an unbreakable list level of verbal string mastery. Therefore, such counting tasks might constitute a second specific counting dimension which could correspond to a higher level of counting skills than the first one. More studies are needed to investigate this hypothesis.
Our data support a specific Relations dimension, but again, several items appeared having a particular behavior. These items were involving a perceptual bias: the conservation of number task, and the three items in which children had to compare two collections with a perceptual trap. Perceptual bias occurs when the visuospatial appearance makes children think there are more elements in a first collection than in a second one, when in fact there are the same number in both collections or there are more elements in the second. This perceptual bias typically occurs when items of one of the collections are more dispersed. In order to succeed in this kind of tasks, children must inhibit the inappropriate “length-equals-number heuristic” (Bjorklund and Harnishfeger, 1990; Houdé et al., 2011), or more generally the visuospatial heuristic according to which “bigger is more.” It has been found that children are hardly capable of resisting these before the age of 7 years-old, leading to a failure in grasping the number principle per se. This can be explained by the fact that in many daily life situations, numerosity and occupied space are strongly linked, as more objects usually occupy more space. Houdé et al. (2011), by comparing the brain activation in children who were conservers (9–10 years old) and non-conservers (5–6 years old) in the Piagetian sense, found that while both groups of children processed the Piagetian task as a quantitative number task, activation in cerebral regions underpinning executive control, spatial working memory and episodic memory retrieval were necessary to resist the perceptual bias in order to succeed the task. Thus, it seems that the ability to refer to the cardinal property of numbers beyond the misleading perceptual characteristics of the items (Gelman and Gallistel, 1978) corresponds to a specific dimension of early number skills. The perceptual appearance being very influential at the age of our participants, these tasks could also assess an ability that relies on “a large-scale executive brain network” (Houdé et al., 2011; p. 344) rather than on typically numerical skill. Again, further studies are needed in this direction.
Finally, the last specific factor that emerged in the bifactor-ESEM solution is relatively well-defined by the different items planned to assess children's arithmetical skills. This one is quite similar to the NRC Arithmetic and to the NCTM Operation dimensions, defined as the understanding of the ways in which groups are composed and decomposed by differentiating sets and subsets (Purpura and Lonigan, 2013). Interestingly, in the studies analyzing the factorial structure of number skills that included arithmetic items, such a dimension was always extracted (Aunio et al., 2006; Aunio and Niemivirta, 2010; Purpura and Lonigan, 2013; Ryoo et al., 2015). Our study extends this finding through the bifactor-ESEM framework by showing that when keeping under control the common general factor that underlines early number skills, arithmetic problem responses are psychometrically distinct from other responses. However, the items loading on this specific dimension were more or less high. It should be specified that the items belonging to this specific facet were considerably different from each other. Indeed, the arithmetical tasks included in the test required the children to solve number combinations with concrete elements, to solve story problems and to decompose numbers. Thus, it could be that with more arithmetical subtests and items than in the current study, more specific arithmetical facets would appear beyond the control of the general factor.
General Factor
The current results also indicate that a considerable part of variance was shared across all items of the test used, suggesting the existence of a general factor underlying early number skills. But what this latent common factor is still needs to be identified. This remains difficult since latent common factors can be domain-general abilities, domain-specific abilities, or even a mix of them. Since we did not measure general cognitive abilities in the current study, we can only formulate some hypotheses comparing our numerical data to the literature. In the case of early number skills, the domain-general factors that could be candidates for the latent general factor identified are the covariates that have been found to be associated to early number skills, such as general intelligence (Dickerson Mayes et al., 2009; Krajewski and Schneider, 2009; Green et al., 2018), working memory (Bull et al., 2008; Alloway and Alloway, 2009), linguistic skills (Kleemans et al., 2011; Raghubar and Barnes, 2017), executive functions (Espy et al., 2004; Kroesbergen et al., 2009), and visuo-spatial ability (Kyttälä et al., 2003). By looking at the items that loaded the more and the less on the general factor and by knowing from literature by which factors these numerical abilities are strongly underpinned, some suppositions can be made about what this general factor might most likely correspond to. The items that loaded the most on the general factor were verbal counting, counting small quantities, comparing/ordering few and small quantities, adding few concrete elements and solving simple problems (type change involving a small addition). On the opposite, the items that loaded the less on the general factor were adding higher quantities of concrete elements (for instance 7 and 5), complex arithmetic tasks such as number decomposition, difficult verbal problems (type change involving a subtraction or type combine), counting higher collections (12 or more) and conservation of numbers. Because complex arithmetic tasks and word problems in particular are known to be strongly underpinned by working memory (Geary et al., 2004; Träff, 2007; Zheng et al., 2011), language skills (Van Rinsveld et al., 2015) and general intelligence (Hornung et al., 2014), these factors do not appear to be the best candidates for the general factor highlighted here. The exclusion of language skills as a potential general factor is also supported by the fact that such skills were found not to predict non-linguistic arithmetic (LeFevre et al., 2010), items that highly loaded on the general factor in the current study. The observation that conservation of numbers did not strongly load on the general factor gives reason to think that executive functions are not either a good candidate for the general factor. Indeed, the ability to solve this task has been found to strongly rely on the executive function of inhibitory control (Houdé et al., 2011, see above). A general cognitive factor that might correspond to the current general factor would be visuo-spatial ability, since both non-linguistic arithmetic (LeFevre et al., 2010) and counting knowledge (Kyttälä et al., 2003)—which strongly loaded on it here—were previously found to be related to visuospatial skills.
The latent common factor found in the current study could also be a more domain-specific ability. Interestingly, Mou et al. (2021), who showed the superiority of a bifactor-CFA by measuring simultaneously numerical and more general abilities, highlighted that only one third of their general latent factor variance was explained by the combination of general non-numerical dimensions, such as cognitive and linguistic abilities and age. This suggested that the general factor they found was mainly numerical. From the literature, a domain-specific factor that could be a candidate for the latent general factor found here is number sense. It refers to elementary intuitions about quantity, such as the rapid and accurate perception of small numerosities, the ability to compare roughly larger numerical magnitudes, to count, and to comprehend simple arithmetic operations, and is considered as an essential tool to develop higher order mathematical skills (Case et al., 1992; Geary, 1995; Dehaene, 1997, 2001; Berch, 2005; Gersten et al., 2005; Jordan et al., 2010; Chu et al., 2015; Sowinski et al., 2015). In the current study, the fact that the items loading the more on the general factor were very fundamental skills gives credit to the possibility that number sense corresponds to this general factor.
The last hypothesis is that the general factor would only be a psychometric artifact due to the positive manifold of correlations observed between cognitive test tasks (van der Maas et al., 2006, 2011; Borsboom, 2017; Fried et al., 2017). According to van Bork et al. (2017), “the existence of a general factor is not tested against the data because any dataset that features a positive manifold will necessarily support a general factor model, whether or not a general factor underlies the data” (p. 764). This is the main argument of the precited authors who developed the relatively recent Network Analysis approach, somehow rejecting the latent variable approach (for a critique of this position, see Guyon et al., 2017). The proponents of Network Analysis offer an alternative explanation for the positive intercorrelations between tasks scores, based on the biological concept of mutualism (van der Maas et al., 2006). This concept postulates that the positive manifold emerges purely by mutually beneficial relationships between cognitive processes and its developmental dynamic. In concrete words, cognitive abilities (the nodes) are conceptualized as being more or less directly related to each other (the edges) without postulating any latent factor. Recently, Kan et al. (2019) and Kan et al. (2020) have shown the superior fit of a network model of intelligence compared to a higher-order g model and a bifactor-CFA model. As Network Analysis is not implemented in MPlus and currently not available with binary variables within the reference R package Psychonetrics (Epskamp, 2020a,b), we did not have the opportunity to compare the ICM-CFA and ESEM models with a Network Analysis model of early numeracy. Anyway, this would not have provided a definitive answer to the question of which statistical model best represents reality, as the aim in the social sciences is to obtain the best theoretically and practically useful representation of what is under investigation. Importantly, authors of the Network Analysis approach have never compared network models with ESEM and bifactor-ESEM models. Our results provide evidence that both sources of construct-relevant psychometric multidimensionality were present in the early numeracy test used, supporting the use of ESEM and suggesting the appropriateness of the bifactor-ESEM, with regards to traditional ICM-CFA models. ICM-CFA and ESEM solutions differed indeed in their factor correlations with much lower factor correlations for ESEM than for ICM-CFA. Including cross-loadings in ESEM decreases the amount of shared variance among items (i.e., the positive manifold), suggesting that ESEM resulted in a better differentiation between early numeracy factors than ICM-CFA. Moreover, the existence of significant and substantial non-target cross-loadings in ESEM supported the need for a bifactor-ESEM analysis. Examination of the bifactor-ESEM solution revealed that many fewer items were presenting meaningful and substantial non-target cross-loadings compared to ESEM solutions. On this basis, and because factor loadings on the general factor are large, and correlations between the specific factors scores are non-significant within the bifactor-ESEM solution (see Morin et al., 2020), we would support that the general factor underlined in this study is not a psychometric artifact, and not better or worse than alternative equivalents models (MacCallum et al., 1993; Williams, 2012).
Limitations, Conclusion, and Further Prospects
There are several limitations to this study. First, the sample of children was a convenience sample. Even if we tried to enroll schools with socio-economically diverse background, the sample is not representative, and results cannot be generalized. Secondly, within-participants variations were not examined across time. It would be interesting to know if the bifactor-ESEM structure of early number skills changes over time or not. Third, pupils in our study were assessed through a French-speaking test. Further studies should confirm if the bifactor-ESEM representation of early numeracy is still adequate when English-speaking tests are used.
The present study highlighted the relevance of the bifactor-ESEM framework when it comes to better identifying the dimensionality of early number skills. Indeed, it was found from the current data that when keeping under control the high intercorrelations within that domain, three specific numerical dimensions remained. These specific facets of early numeracy, which are different in nature compared to correlated factors in ICM-CFA or ESEM, have a substantive value. However, the specific facets highlighted here should not obscure the fact that early numeracy appears to be principally a unidimensional construct.
Determining the structure of early numeracy skills is crucial to assessing them properly, as well as teaching them efficiently, developing appropriate early intervention strategies for at-risk children, and studying their relations with other variables. For instance, if the structure of early numeracy that stood out in the current experiment is confirmed by further studies, this suggests that when seeking to assess the number skills of young children, subtests measuring both the general latent factor and the three highlighted specific dimensions should minimally be taken into consideration. The refined investigation of early numeracy structure conducted here could also lead to more targeted teaching activities for all pupils and to reinforcement activities for pupils identified as at risk of school failure, as it has been done in the domain of early literacy (Juel and Minden-Cupp, 2000). Concretely, teaching materials could be developed to enhance pupils' skills on the general factor and the specific dimensions. Also, as pointed by Brunner (2008), the relationships between math abilities and other variables strongly depends on the structural conception of math skills. For instance, the relationships between transversal variables such as socioeconomic status or general intelligence could be different with the general or the specific math dimensions highlighted in the current study. This could lead to identify some math abilities that are more dependent on children's cognitive characteristics or socioeconomic heritage, and others that could be more dependent, for example, on the instruction received.
In order to further investigate the dimensionality of early numeracy and in particular to better identify what the current general factor could be, a bifactor-ESEM analysis could be conducted on additional early number skills tasks as well as on tasks measuring the more general candidates. For example, number sense is typically assessed by the speed with which individuals make magnitude comparisons, or by their ability to place numbers on a number line (Berch, 2005; Siegler and Ramani, 2009), skills that were not assessed here. Including such tasks in a further study as well as general abilities such as non-verbal intelligence, working memory, etc., and observing how the numerical items would behave on the general factor, would allow to identify more reliably what this general factor could correspond to. Further studies could also address the predictive validity of the bifactor-ESEM solution of early numeracy. Demonstrating it would strengthen the relevance and the added value of such a model.
In the future, it also seems to us that to diminish sources of confusion in the understanding of the early number skills structure, harmonization efforts should be made regarding the selected tasks and their headings. For example, while several authors included arithmetic tasks in their studies (Aunio and Niemivirta, 2010; Purpura and Lonigan, 2013; Ryoo et al., 2015), other researchers did not (Cirino, 2011; Hirsch et al., 2018). This can, of course, have an incidence on the number of factors that stand out. The number of tasks among each type of skills is also very different from study to study, which is known to affect the factor structure (Purpura and Lonigan, 2013). For example, Lee et al. (2012) administered a 1 min number facts task as a measure of arithmetic ability, while Purpura and Lonigan's (2013) included eight different arithmetic tasks comprising 38 items in total. An additional source of confusion in the factors identified might come from the use—or not—of Arabic numerals. Indeed, some authors used several tasks involving digits (i.e., Cirino, 2011; Purpura and Lonigan, 2013; Ryoo et al., 2015) while others did not (Aunio and Niemivirta, 2010) or only a few (Hirsch et al., 2018). Actually, the difference between symbolic and non-symbolic skills has been found to be a major distinction (Cirino, 2011). Since tasks involving Arabic numerals are generally considered as falling within the relations dimension (National Research Council, 2009; Cirino, 2011; Purpura and Lonigan, 2013), the interlacing between this dimension and Arabic numerals might, maybe improperly, strengthen the incidence of this dimension. Finally, the headings of the subtests are also sometimes confusing. For example, while the counting tasks used by the majority of the authors consist of counting forward or backward, or in enumerating collections and are considered as belonging to the Number dimension, the Hirsch et al. (2018) “counting task” requires the children to draw a line below the nth bell among a line of bells, and this appeared as an eponymous factor per se. This task is therefore very similar to the subtest called “ordinality” in Purpura and Lonigan's (2013) study (identify the nth picture in a line) that was part of their relations dimension.
In sum, a promising avenue to further study the early number skills dimensionality is to apply the bifactor-ESEM framework on data including candidates for the general factor, as well as by taking into account as comprehensively as possible specific facets that might stand out. Precisely understanding what the important general factor underlying early number skills is, and the exact specific dimensions that remain after having kept under control the high intercorrelations within that domain, would constitute significant progress in the study of early numeracy. This analysis could be interestingly contrasted with a network analysis in order to discard the psychometric artifact hypothesis that several authors are supporting concerning general latent factor models.
Data Availability Statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
Ethics Statement
The study was reviewed and approved by the Ethics Review Panel of the University of Luxembourg in charge of study coordination. The legal guardian of the pupils provided written informed consent to participate in this study.
Author Contributions
CD and AFC wrote the manuscript. All authors contributed to the conception of the study, the manuscript revision, and approved the submitted version.
Funding
The present study was mainly supported by a grant from the University of Luxembourg (grant R-STR-3051-00-Z, 2015). The University of Teacher Education from State of Vaud, through the Lasalé Lab (Laboratoire sur l'accrochage scolaire et les alliances éducatives), supported the data collection in Switzerland. In Belgium, the data collection was supported by a grant delivered by the Sectorial Credits for Research in Human Sciences of the University of Liège. In France, the Ecole supérieure du professorat et de l'éducation de Lorraine supported the data collection.
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Supplementary Material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpsyg.2021.680124/full#supplementary-material
Footnotes
1. ^In a second-order (or higher-order) model, paths are specified from a second-order factor to the first-order factors, which in turns have paths leading to observed indicators. Second-order model relies on stringent proportionality constraints which are nevertheless unlikely to hold in practice (for details, see Canivez, 2016; Morin et al., 2016a).
2. ^An Item Response Theory (IRT) model including difficulty and discrimination parameters (Birnbaum, 1968) was used to estimate pupil competence and item parameters of the test (for details, see Masked Reference). As an alternative to the classical theory (or true score theory) where a total test score is calculated simply by adding correct answers (but with measurement error), IRT modeling estimates the probability that a student will give a correct answer to a specific item, depending both on the student ability and the specific items characteristics, leading to a weighted score (for more details about IRT, see van der Linden, 2017). The analyses were conducted using Conquest 2.0 software, which generated a Warm's mean likelihood estimation (WLE) for each pupil. On 41 items, one (7_2) was a posteriori discarded from the analyses because of bad item fit statistics.
References
Alloway, T. P., and Alloway, R. G. (2009). Investigating the predictive roles of working memory and IQ in academic attainment. J. Exp. Child Psychol. 106, 20–29. doi: 10.1016/j.jecp.2009.11.003
Asparouhov, T. (2005). Sampling weights in latent variable modeling. Struct. Equat. Model. 12, 411–434. doi: 10.1207/s15328007sem1203_4
Asparouhov, T., and Muthén, B. (2009). Exploratory structural equation modeling. Struct. Equat. Model. 16, 397–438. doi: 10.1080/10705510903008204
Aunio, P., Ee, J., Lim, S. E. A., Hautamäki, J., and Van Luit, J. (2004). Young children's number sense in Finland, Hong Kong, and Singapore. Int. J. Early Years Educ. 12, 195–216. doi: 10.1080/0966976042000268681
Aunio, P., Hautamäki, J., Heiskari, P., and Van Luit, J. E. H. (2006). The early numeracy test in Finnish: Children's norms. Scandinavian J. Psychology. 47, 369–378. doi: 10.1111/j.1467-9450.2006.00538.x
Aunio, P., Korhonen, J., Ragpot, L., Törmänen, M., Mononen, R., and Henning, E. (2019). Multi-factorial approach to early numeracy – the effects of cognitive skills, language factors and kindergarten attendance on early numeracy performance of South African first graders. Int. J. Educ. Res. 97, 65–76. doi: 10.1016/j.ijer.2019.06.011
Aunio, P., and Niemivirta, M. (2010). Predicting children's mathematical performance in grade one by early numeracy. Learn. Individ. Differ. 20, 427–435. doi: 10.1016/j.lindif.2010.06.003
Berch, D. B. (2005). Making sense of number sense: implications for children with mathematical disabilities. J. Learn. Disbil. 38, 333–339. doi: 10.1177/00222194050380040901
Betts, J., Pickart, M., and Heistad, D. (2011). Investigating early literacy and numeracy: exploring the utility of the bifactor model. Sch. Psychol. Q. 26, 97–107. doi: 10.1037/a0022987
Birnbaum, A. (1968). “Some latent trait models and their use in inferring an examinee's ability,” in Statistical Theories of Mental Test Scores, eds F. M. Lord and M. R. Novick (Reading, MA: Addison-Wesley), 397–479.
Bjorklund, D. F., and Harnishfeger, K. K. (1990). The resources construct in cognitive development: diverse sources of evidence and a theory of inefficient inhibition. Dev. Rev. 10, 48–71. doi: 10.1016/0273-2297(90)90004-N
Bonifay, W., Lane, S. P., and Reise, S. P. (2017). Three concerns with applying a bifactor model as a structure of psychopathology. Clin. Psychol. Sci. 5, 184–186. doi: 10.1177/2167702616657069
Borsboom, D. (2017). A network theory of mental disorders. World Psychiatry 16, 5–13. doi: 10.1002/wps.20375
Brown, T. A. (2015). Confirmatory Factor Analysis for Applied Research. New York, NY: The Guilford Press.
Brunner, M. (2008). No g in education? Learn. Individ. Differ. 18, 152–165. doi: 10.1016/j.lindif.2007.08.005
Bull, R., Espy, K. A., and Wiebe, S. A. (2008). Short-term memory, working memory, and executive functioning in preschoolers: longitudinal predictors of mathematical achievement at age 7 years. Dev. Neuropsychol. 33, 205–228. doi: 10.1080/87565640801982312
Canivez, G. L. (2016). “Bifactor modeling in construct validation of multifactored tests: Implications for multidimensionality and test interpretation,” in Principles and Methods of Test Construction: Standards and Recent Advancements, eds K. Schweizer and C. DiStefano (Gottingen: Hogrefe), 247–271.
Case, L. P., Harris, K. R., and Graham, S. (1992). Improving the mathematical problem-solving skills of students with learning disabilities: self-regulated strategy development. J. Spec. Educ. 26, 1–19. doi: 10.1177/002246699202600101
Chen, F. F. (2007). Sensitivity of goodness of fit indexes to lack of measurement. Struct. Equat. Model. 14, 464–504. doi: 10.1080/10705510701301834
Chu, F. C., vanMarle, K., and Geary, D. C. (2015). Early numerical foundations of young children's mathematical development. J. Exp. Child Psychol. 132, 205–212. doi: 10.1016/j.jecp.2015.01.006
Cimino, A. N., Killian, M. O., Von Ende, A. K., and Segal, E. A. (2020). Measurement models in social work research: a data-based illustration of four confirmatory factor models and their conceptual application. Br. J. Soc. Work 50, 282–301. doi: 10.1093/bjsw/bcz164
Cirino, P. T. (2011). The interrelationships of mathematical precursors in kindergarten. J. Exp. Child Psychol. 4, 713–733. doi: 10.1016/j.jecp.2010.11.004
Clements, D. H., and Sarama, J. (2007). Effects of a preschool mathematics curriculum: Summative research on the Building Blocks project. J. Res. Math. Educ. 38, 136–163.
Clements, D. H., and Sarama, J. (2011). Early childhood mathematics intervention. Science 333, 968–970. doi: 10.1126/science.1204537
de Chambrier, A.-F., Baye, A., Tinnes-Vigne, M., Tazouti, Y., Vlassis, J., Poncelet, D., et al. (2021). Enhancing children?s numerical skills through a play-based intervention at kindergarten and at home: a quasi-experimental study. Early Childhood Research Quarterly. 54, 164–178. https://doi.org/10.1016/j.ecresq.2020.09.003
Dehaene, S. (1997). The Number Sense: How the Mind Creates Mathematics. New York, NY: Oxford University Press.
Dickerson Mayes, S., Calhoun, S. L., Bixler, E. O., and Zimmerman, D. N. (2009). IQ and neuropsychological predictors of academic achievement. Learn. Individ. Differ. 19, 238–241. doi: 10.1016/j.lindif.2008.09.001
Dierendonck, C., Milmeister, P., Kerger, S., and Poncelet, D. (2019). Examining the measure of student engagement in the classroom using the bifactor model: increased validity when predicting misconduct at school. Int. J. Behav. Dev. 24, 279–286. doi: 10.1177/0165025419876360
Duncan, G. J., Dowsett, C. J., Claessens, A., Magnuson, K., Huston, A. C., Klebanov, P., et al. (2007). School readiness and later achievement. Dev. Psychol. 43, 1428–1446. doi: 10.1037/0012-1649.43.6.1428
Epskamp, S. (2020a). Psychometric network models from time-series and panel data. Psychometrika 85, 1–26. doi: 10.1007/s11336-020-09697-3
Epskamp, S. (2020b). Psychonetrics: Structural Equation Modeling and Confirmatory Network Analysis. R Package Version 0.7.1. Available online at: https://cran.r-project.org/web/packages/psychonetrics/index.html
Espy, K. A., McDiarmid, M. M., Cwik, M. F., Stalets, M. M., Hamby, A., and Senn, T. E. (2004). The contribution of executive functions to emergent mathematical skills in preschool children. Dev. Neuropsychol. 26, 465–486. doi: 10.1207/s15326942dn2601_6
Fadda, D., Scalas, F., Meleddu, M., and Morin, A. J. S. (2017). A bifactor-ESEM representation of the Questionnaire for Eudaimonic Wellbeing. Pers. Individ. Dif. 116, 216–222. doi: 10.1016/j.paid.2017.04.062
Fried, E. I., van Borkulo, C. D., Cramer, A. O. J., Boschloo, L., Schoevers, R. A., and Borsboom, D. (2017). Mental disorders as networks of problems: a review of recent insights. Soc. Psychiatry Psychiatr. Epidemiol. 52, 1–10. doi: 10.1007/s00127-016-1319-z
Frye, D., Baroody, A. J., Burchinal, M., Carver, S. M., Jordan, N. C., McDowell, J., et al. (2013). Teaching Math to Young Children: A Practice Guide (NCEE 2014-4005). Washington, DC: U.S. Department of Education, Institute of Education Sciences, National Center for Education Evaluation and Regional Assistance. Retrieved from: https://ies.ed.gov/ncee/wwc/Docs/PracticeGuide/early_math_pg_111313.pdf
Geary, D. C. (1995). Reflections of evolution and culture in children's cognition: implications for mathematical development and instruction. Am. Psychol. 50, 24–37. doi: 10.1037/0003-066X.50.1.24
Geary, D. C., Hoard, M. K., Byrd-Craven, J., and DeSoto, M. (2004). Strategy choices in simple and complex addition: contributions of working memory and counting knowledge for children with mathematical disability. J. Exp. Child Psychol. 88, 121–151. doi: 10.1016/j.jecp.2004.03.002
Gelman, R., and Gallistel, C. (1978). The Child's Understanding of Number. Cambridge, MA: Harvard University Press.
Gersten, R., Jordan, N. C., and Flojo, J. R. (2005). Early identification and interventions for students with mathematics difficulties. J. Learn. Disabil. 38, 293–304. doi: 10.1177/00222194050380040301
Ginsburg, H. P., and Baroody, A. J. (2003). Test of Early Mathematics Ability, 3rd Edn (TEMA-3). Austin, TX: Pro-Ed.
Ginsburg, H. P., Lee, J. S., and Boyd, J. S. (2008). Mathematics education for young children: what it is and how to promote it. Soc. Policy Rep. 22, 1–23. doi: 10.1002/j.2379-3988.2008.tb00054.x
Green, C. T., Bunge, S., Chiongbian, V. B., Barrow, M., and Ferrer, E. (2018). Fluid reasoning predicts future mathematical performance among children and adolescents. J. Exp. Child Psychol. 157, 125–143. doi: 10.1016/j.jecp.2016.12.005
Gu, H., Wen, Z., and Fan, X. (2020). Investigating the multidimensionality of the Work-Related Flow Inventory (WOLF): a bifactor exploratory structural equation modeling framework. Front. Psychol. 11:740. doi: 10.3389/fpsyg.2020.00740
Guyon, H., Falissard, B., and Kop, J.-L. (2017). Modeling psychological attributes in psychology – an epistemological discussion: network analysis vs. latent variables. Front. Psychol. 8:798. doi: 10.3389/fpsyg.2017.00798
Hirsch, S., Lambert, K., Coppens, K., and Moeller, K. (2018). Basic numerical competences in large-scale assessment data: structure and long-term relevance. J. Exp. Child Psychol. 167, 32–48. doi: 10.1016/j.jecp.2017.09.015
Hornung, C., Schiltz, C., Brunner, M., and Martin, R. (2014). Predicting first-grade mathematics achievement: the contributions of domain-general cognitive abilities, nonverbal number sense, and early number competence. Front. Psychol. 5:272. doi: 10.3389/fpsyg.2014.00272
Houdé, O., Pineau, A., Leroux, G., Poirel, N., Perchey, G., Lanoë, C., et al. (2011). Functional MRI study of Piaget's conservation-of-number task in preschool and school-age children: a neo-Piagetian approach. J. Exp. Child Psychol. 110, 332–346. doi: 10.1016/j.jecp.2011.04.008
Hu, L.-T., and Bentler, P. M. (1999). Cutoff criteria for fit indexes in covariance structure analysis: conventional criteria versus new alternatives. Struct. Equat. Model. 6, 1–55. doi: 10.1080/10705519909540118
Jennrich, R. I., and Bentler, P. M. (2011). Exploratory bi-factor analysis. Psychometrika 76, 537–549. doi: 10.1007/s11336-011-9218-4
Jordan, N. C., Glutting, J., and Ramineni, C. (2010). The importance of number sense to mathematics achievement in first and third grades. Learn. Individ. Differ. 20, 82–88. doi: 10.1016/j.lindif.2009.07.004
Jordan, N. C., Kaplan, D., Ramineni, C., and Locuniak, M. N. (2009). Early math matters: kindergarten number competence and later mathematics outcomes. Dev. Psychol. 45, 850–867. doi: 10.1037/a0014939
Juel, C., and Minden-Cupp, C. (2000). Learning to read words: linguis tic units and instructional strategies. Read. Res. Q. 35, 128–134. doi: 10.1598/RRQ.35.4.2
Kan, K.-J., de Jonge, H., van der Maas, H. L. J., Levine, S. Z., and Epskamp, S. (2020). How to compare psychometric factor and network models. J. Intelligence 8:35. doi: 10.3390/jintelligence8040035
Kan, K.-J., van der Maas, H. L. J., and Levine, S. Z. (2019). Extending psychometric network analysis: empirical evidence against g in favor of mutualism? Intelligence 73, 52–62. doi: 10.1016/j.intell.2018.12.004
Kleemans, T., Segers, E., and Verhoeven, L. (2011). Cognitive and linguistic precursors to numeracy in kindergarten: evidence from first and second language learners. Learn. Individ. Differ. 21, 555–561. doi: 10.1016/j.lindif.2011.07.008
Krajewski, K., and Schneider, W. (2009). Exploring the impact of phonological awareness, visual–spatial working memory, and preschool quantity-number competencies on mathematics achievement in elementary school: findings from a 3-year- longitudinal study. J. Exp. Child Psychol. 103, 516–531. doi: 10.1016/j.jecp.2009.03.009
Kroesbergen, E. H., Van Luit, J. E. H., Van Lieshout, C. D. M., Van Loosbroek, E., and Van de Rijt, B. A. M. (2009). Individual differences in early numeracy. The role of executive functions and subitizing. J. Psychoeduc. Assess. 27, 226–236. doi: 10.1177/0734282908330586
Kyttälä, M., Aunio, P., Lehto, J. E., Van Luit, J. E. H., and Hautamäki, J. (2003). Visuospatial working memory and early numeracy. Educ. Child Psychol. 20, 65–76.
Le Corre, M., Van de Walle, G., Brannon, E. M., and Carey, S. (2006). Re-visiting the competence/performance debate in the acquisition of the counting principles. Cogn. Psychol. 52, 130–169. doi: 10.1016/j.cogpsych.2005.07.002
Lee, Y.-S., Lembke, E., Moore, D., Ginsburg, H. P., and Pappas, S. (2012). Item-level and construct evaluation of early numeracy curriculum-based measures. Assess. Effect. Interv. 37, 107–117. doi: 10.1177/1534508411431255
LeFevre, J. A., Fast, L., Skwarchuk, S. L., Smith-Chant, B. L., Bisanz, J., Kamawar, D., et al. (2010). Pathways to mathematics: longitudinal predictors of performance. Child Dev. 81, 1753–1767. doi: 10.1111/j.1467-8624.2010.01508.x
MacCallum, R. C., Wegener, D. T., Uchino, B. N., and Fabrigar, L. R. (1993). The problem of equivalent models in applications of covariance structure analysis. Psychol. Bull. 114, 185–199. doi: 10.1037/0033-2909.114.1.185
Marsh, H. W., Hau, K.-T., and Wen, Z. (2004). In search of golden rules: comment on hypothesis-testing approaches to cutoff values for fit indexes and dangers in overgeneralizing Hu and Bentler (1999). Struct. Equat. Model. 11, 320–341. doi: 10.1207/s15328007sem1103_2
Marsh, H. W., Muthén, B., Asparouhov, T., Lüdtke, O., Robitzsch, A., Morin, A. J. S., et al. (2009). Exploratory structural equation modeling, integrating CFA and EFA: application to students' evaluations of university teaching. Struct. Equat. Model. 16, 439–476. doi: 10.1080/10705510903008220
Mazzocco, M. M. M., and Claessens, A. (2020). Introduction to the special issue: parents supporting early mathematical thinking. Early Child. Res. Q. 50, 1–3. doi: 10.1016/j.ecresq.2019.07.007
McDonald, R. P. (1970). Theoretical foundations of principal factor analysis and alpha factor analysis. Br. J. Math. Stat. Psychol. 23, 1–21. doi: 10.1111/j.2044-8317.1970.tb00432.x
Milburn, T. F., Lonigan, C. J., DeFlorio, L., and Klein, A. (2019). Dimensionality of preschoolers' informal mathematical abilities. Early Child. Res. Q. 47, 487–495. doi: 10.1016/j.ecresq.2018.07.006
Millsap, R. E. (2011). Statistical Approaches to Measurement Invariance. New York, NY: Taylor and Francis.
Morin, A. J. S., Arens, A. K., and Marsh, H. W. (2016a). A bifactor exploratory structural equation modeling framework for the identification of distinct sources of construct-relevant psychometric multidimensionality. Struct. Equat. Model. 23, 116–139. doi: 10.1080/10705511.2014.961800
Morin, A. J. S., Arens, A. K., Tran, A., and Caci, H. (2016b). Exploring sources of construct relevant multidimensionality in psychiatric measurement: a tutorial and illustration using the Composite Scale of Morningness. Int. J. Methods Psychiatr. Res. 25, 277–288. doi: 10.1002/mpr.1485
Morin, A. J. S., Myers, N. D., and Lee, S. (2020). “Modern factor analytic techniques: bifactor models, exploratory structural equation modeling (ESEM) and bifactor-ESEM,” in Handbook of Sport Psychology, 4th Edn, 1044–1073. eds G. Tenenbaum and R. C. Eklund (London: Wiley).
Mou, Y., Zhang, B., Piazza, M., and Hyde, D. C. (2021). Comparing set-to-number and number-to-set measures of cardinal number knowledge in preschool children using latent variable modeling. Early Child. Res. Q. 45, 125–135. doi: 10.1016/j.ecresq.2020.05.016
Murray, A. L., and Johnson, W. (2013). The limitations of model fit in comparing the bi-factor versus higher-order models of human cognitive ability structure. Intelligence 41, 407–422. doi: 10.1016/j.intell.2013.06.004
Muthen, L., and Muthen, B. (2012–2019). Mplus Statistical Analysis With Latent Variables: User's Guide. Los Angeles, CA: Muthen and Muthen.
National Council of Teachers of Mathematics (2006). Curriculum Focal Points for Prekindergarten Through Grade 8 Mathematics: A Quest for Coherence. Retrieved from: https://www.nctm.org/Publications/teaching-children-mathematics/2006/Vol13/Issue3/Curriculum-Focal-Points-for-Pre-K-Grade-8-Mathematics_-A-Quest-for-Coherence/ (accessed May 28, 2021).
National Early Literacy Panel (2008). Developing Early Literacy: Report of the National Early Literacy Panel. Washington, DC: National Institute for Literacy. Retrieved from: https://lincs.ed.gov/publications/pdf/NELPReport09.pdf
National Mathematics Advisory Panel (2008). Foundations for Success: The Final Report of the National Mathematics Advisory Panel. U.S. Department of Education, Washington, DC.
National Research Council (2009). Mathematics Learning in Early Childhood: Paths Toward Excellence and Equity. Washington, DC: National Academies Press.
Perreira, T. A., Morin, A. J. S., Hebert, M., Gillet, N., Houle, S. A., and Berta, W. (2018). The short form of the Workplace Affective Commitment Multidimensional Questionnaire (WACMQ-S): a bifactor-ESEM approach among healthcare professionals. J. Vocat. Behav. 106, 62–83. doi: 10.1016/j.jvb.2017.12.004
Purpura, D. J., and Lonigan, C. J. (2013). Informal numeracy skills: the structure and relations among numbering, relations, and arithmetic operations in preschool. Am. Educ. Res. J. 50, 178–209. doi: 10.3102/0002831212465332
Raghubar, K. P., and Barnes, M. A. (2017). Early numeracy skills in preschool-aged children: a review of neurocognitive findings and implications for assessment and intervention. Clin. Neuropsychol. 31, 329–351. doi: 10.1080/13854046.2016.1259387
Reise, S. P. (2012). The rediscovery of bifactor measurement models. Multivariate Behav. Res. 47, 667–696. doi: 10.1080/00273171.2012.715555
Ryoo, J. H., Molfese, V. J., Brown, E. T., Karp, K. S., Welch, G. W., and Bovaird, J. A. (2015). Examining factor structures on the Test of Early Mathematics Ability – 3: a longitudinal approach. Learn. Individ. Differ. 41, 21–29. doi: 10.1016/j.lindif.2015.06.003
Sánchez-Oliva, D., Morin, A. J., Teixeira, P. J., Carraça, E. V., Palmeira, A. L., and Silva, M. N. (2017). A bifactor exploratory structural equation modeling representation of the structure of the basic psychological needs at work scale. J. Vocat. Behav. 98, 173–187. doi: 10.1016/j.jvb.2016.12.001
Scalise, N., Daubert, E. N., and Ramani, G. B. (2017). Narrowing the early mathematics gap: a play-based intervention to promote low-income preschoolers' number skills. J. Num. Cogn. 3, 559–581. doi: 10.5964/jnc.v3i3.72
Sellbom, M., and Tellegen, A. (2019). Factor analysis in psychological assessment research: common pitfalls and recommendations. Psychol. Assess. 31, 1428–1441. doi: 10.1037/pas0000623
Siegler, R. S., and Ramani, G. B. (2009). Playing linear number board games - but not circular ones - improves low-income preschoolers' numerical understanding. J. Educ. Psychol. 101, 545–560. doi: 10.1037/a0014239
Sowinski, C., LeFevre, J.-A., Skwarchuk, S.-L., Kamawar, D., Bisanz, J., and Smith-Chant, B. (2015). Refining the quantitative pathway of the Pathways to mathematics model. J. Exp. Child Psychol. 131, 73–93. doi: 10.1016/j.jecp.2014.11.004
Starkey, P., Klein, A., and Wakeley, A. (2004). Enhancing young children's mathematical knowledge through a pre-kindergarten mathematics intervention. Early Child. Res. Q. 19, 99–120. doi: 10.1016/j.ecresq.2004.01.002
Storch, S. A., and Whitehurst, G. J. (2002). Oral language and code-related precursors to reading: Evidence from a longitudinal structural model. Dev. Psychol. 38, 934–947. doi: 10.1037/0012-1649.38.6.934
Thomas, A., Hoareau, L., Luxembourger, C., Jarlégan, A., Hubert, B., and Tazouti, Y. (2018). Composition et Structuration des Différentes Dimensions de la Littératie et la Numératie Émergentes. Communication orale présentée au 10e colloque international RIPSYDEVE, Lille.
Toth-Kiraly, I., and Neff, K. D. (2021). Is self-compassion universal? Support for the measurement invariance of the Self-Compassion Scale across populations. Assessment 28, 169–185. doi: 10.1177/1073191120926232
Träff, U. (2007). The contribution of working memory to children's mathematical word problem solving. Appl. Cogn. Psychol 21, 1201–1216. doi: 10.1002/acp.1317
van Bork, R., Epskamp, S., Rhemtulla, M., Borsboom, D., and van der Maas, H. L. J. (2017). What is the p-factor of psychopathology? Some risks of general factor modeling. Theory Psychol. 27, 759–773. doi: 10.1177/0959354317737185
van der Linden, W. (Ed.). (2017). Handbook of Item Response Theory, Vol. 3: Applications, 1st Edn. Boca Raton, FL: Chapman and Hall/CRC.
van der Maas, H. L., Molenaar, D., Maris, G., Kievit, R. A., and Borsboom, D. (2011). Cognitive psychology meets psychometric theory: on the relation between process models for decision making and latent variable models for individual differences. Psychol. Rev. 118, 339–356. doi: 10.1037/a0022749
van der Maas, H. L. J., Dolan, C. V., Grasman, R. P. P. P., Wicherts, J. M., Huizenga, H. M., and Raijmakers, M. E. J. (2006). A dynamical model of general intelligence: the positive manifold of intelligence by mutualism. Psychol. Rev. 113, 842–861. doi: 10.1037/0033-295X.113.4.842
Van Nieuwenhoven, C., Grégoire, J., and Noël, M.-P. (2001). TEDI-MATH, Test Diagnostique des Apprentissages de Base en Mathématiques. Paris: Editions du Centre de Psychologie Appliquée.
Van Rinsveld, A., Brunner, M., Landerl, K., Schiltz, C., and Ugen, S. (2015). The relation between language and arithmetic in bilinguals: insights from different stages of language acquisition. Front. Psychol. 6:265. doi: 10.3389/fpsyg.2015.00265
Watts, T. W., Duncan, G. J., Siegler, R. S., and Davis-Kean, P. E. (2014). What's past is prologue: relations between early mathematics knowledge and high school achievement. Educ. Res. 43, 352–360. doi: 10.3102/0013189X14553660
Williams, L. J. (2012). “Equivalent models: concepts, problems, alternatives,” in Handbook of Structural Equation Modeling, ed R. H. Hoyle (New York, NY: The Guilford Press), 247–260.
Wynn, K. (1992). Children's acquisition of the number words and the counting system. Cogn. Psychol. 24, 220–251. doi: 10.1016/0010-0285(92)90008-P
Zhang, B., Sun, T., Cao, M., and Drasgow, F. (2020). Using bifactor models to examine the predictive validity of hierarchical constructs: pros, cons, and solutions. Organ. Res. Methods. 24(3): 530–571. doi: 10.1177/1094428120915522
Keywords: early numeracy, early number skills, confirmatory factor analysis, exploratory structural equation modeling, kindergarten, bifactor model, ESEM, structure
Citation: Dierendonck C, de Chambrier A-F, Fagnant A, Luxembourger C, Tinnes-Vigne M and Poncelet D (2021) Investigating the Dimensionality of Early Numeracy Using the Bifactor Exploratory Structural Equation Modeling Framework. Front. Psychol. 12:680124. doi: 10.3389/fpsyg.2021.680124
Received: 13 March 2021; Accepted: 14 May 2021;
Published: 22 June 2021.
Edited by:
Jason C. Immekus, University of Louisville, United StatesReviewed by:
Beáta Bőthe, Université de Montréal, CanadaRenzo Bianchi, Université de Neuchâtel, Switzerland
Copyright © 2021 Dierendonck, de Chambrier, Fagnant, Luxembourger, Tinnes-Vigne and Poncelet. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Christophe Dierendonck, Y2hyaXN0b3BoZS5kaWVyZW5kb25jayYjeDAwMDQwO3VuaS5sdQ==
†These authors have contributed equally to this work and share first authorship