SYSTEMATIC REVIEW article

Front. Pediatr., 24 November 2021

Sec. Children and Health

Volume 9 - 2021 | https://doi.org/10.3389/fped.2021.698549

Predictive Validity of Developmental Screening Questionnaires for Identifying Children With Later Cognitive or Educational Difficulties: A Systematic Review

  • 1. Departamento de Pediatría, Clínica Alemana, Santiago, Chile

  • 2. Facultad de Medicina Clínica Alemana, Universidad del Desarrollo, Santiago, Chile

  • 3. Departamento de Desarrollo Académico e Investigación, Clínica Alemana, Santiago, Chile

  • 4. Departamento Medicina Interna y Centro de excelencia CIGES, Universidad de La Frontera, Temuco, Chile

Abstract

Context: Parent/caregiver completing developmental screening questionnaires (DSQs) for children before 5 years of age is currently recommended. The DSQs recommended by the American Academy of Pediatrics (AAP) are the Ages and Stages Questionnaires (ASQ), Parents' Evaluation of Developmental Status (PEDS), and the Survey of Well-being of Young Children (SWYC). Nevertheless, their predictive validity has not been well-established.

Objective: To assess in the current literature, the value of AAP-recommended DSQs (ASQ, PEDS, SWYC) administered between 0 and 5 years of age, for predicting long-term cognitive achievement and/or school performance (CA/SP), after 1 year or more of evaluation and at/or after age 5 years, in the general population.

Data Sources: Cochrane, MEDLINE PubMed, CINAHL, EMBASE, Web of Science, Scielo, and Scopus databases (until March 2021).

Study Selection: Two authors selected the studies. Forward and backward citation follow-up was done; authors of DSQ were contacted to identify additional studies.

Data Extraction: Cohorts were identified, and authors of selected studies were contacted to corroborate and complete extracted data.

Results: Thirty-two publications, corresponding to 10 cohorts, were included. All cohorts used ASQ. Only cohort using PEDS was identified but did not meet the inclusion criteria. No cohorts conducted with SWYC were identified. Associations between ASQ and CA/SP were extracted for eight cohorts. The odds ratios were >3, and the area under the curve was 0.66–0.87. A trade-off between sensitivity and specificity was observed.

Limitations: Heterogeneity in population characteristics and in DSQ adaptations.

Conclusions: A positive association between ASQ and later CA/SP was found in different social, cultural, and economic settings. Additional studies are necessary to determine the impact factors in the predictive capacity of DSQs.

Systematic Review Registration: PROSPERO, identifier: CRD42020183883.

Highlights

  • - ASQ is the most widely used developmental screening questionnaire in follow-up cohorts of young children.

  • - Positive association between ASQ and later cognitive and/or school performance was found.

  • - Trade-off between sensitivity and specificity could be explained by different scoring methods.

Introduction

It is estimated that one in six children has a developmental disability, defined by problems in cognitive, behavioral, language, learning, or physical performance, which are often more prevalent in children with biological risk factors such as prematurity (13). Considering that development is a continuum and that the first 5 years of life are recognized as a critical period for subsequent cognitive performance and school success, it is accepted that these disabilities begin in early childhood, under the definition of developmental delay (DD) (4). Early detection of DD allows for timely and effective interventions (5, 6). For this reason, early screening and referral of developmental difficulties are a critical element in the routine health supervision of children to guarantee that children have adequate conditions for optimal learning (7, 8).

Considering that the accuracy of healthcare providers for detecting DD is low when they rely on judgment or surveillance alone (9, 10), the current recommendation is to use standardized, valid, and reliable tools for screening at specific ages (7). The new guidelines from the American Academy of Pediatrics (AAP) focus on parent/caregiver-completed developmental screening questionnaires (DSQs) for children before 5 years of age (11). If screening results suggest delayed development or if parents have concerns, the child should be referred to a comprehensive developmental evaluation, which includes the application of a developmental diagnostic assessment (7). The Bayley Scale of Infant and Toddler Development is currently one of the most used tools with this purpose.

The DSQs recommended in the updated clinical report of the AAP are the Ages and Stages Questionnaires (ASQ), subsequently updated as ASQ-3 (12); Parents' Evaluation of Developmental Status (PEDS) and its complement Developmental Milestones (PEDS:DM) (13, 14); and the Survey of Well-being of Young Children (SWYC) (15). These questionnaires report values of sensitivity and specificity levels of 70–80%, thresholds recommended by the AAP statement in developmental screening tests (7, 11). The use of DSQs has increased in recent years because of their acceptable psychometric properties, versatility, cost-effectiveness, and parent empowerment (1619). These questionnaires have been validated in a range of cultural and linguistic contexts and are widely used around the world in general populations and clinical samples (2023).

In a recent study, Sheldrick et al., compared the three recommended DSQs, reporting adequate specificity and sensitivity for detecting concurrent severe DD (>70%) but low sensitivity to mild delays (24–62%) among children aged 9 months to 5 years, with no one questionnaire emerging superior (24). Despite numerous DSQ studies that analyzed concurrent validity (2528), the predictive validity of these questionnaires has not been well-established, probably due to its complexity (29).

As background information, there are systematic literature reviews that analyze the predictive validity of developmental screening tools and developmental diagnostic assessment. In an extremely premature population, Wong et al., (30) reported a global sensitivity of 55% and a specificity of 84% of developmental assessments for identifying those children who will have cognitive problems later at school age. Luttikhuizen dos Santos et al., (31) reported that the mental coefficient of the Bayley test correlated significantly with subsequent cognitive functioning, r = 0.61.

In the general population, Sim et al., (32) demonstrated robust predictive validity of later disorders of language and socioemotional functioning, particularly when parent-report tools were used. In a recent publication, Cairney et al., (33) analyzed the predictive value of preschool developmental assessment on later educational outcomes in high-income countries, showing a consistent association between relatively poor early child development and later educational difficulties. They report ASQ as having the best correlation despite including only one study using ASQ in their review (33). Although these studies suggest robust predictive ability of the DSQs, none of the published studies analyzed the DSQs as a whole. We are not aware of any other publication to date that systematically reviews studies exploring an association between DSQ and later cognitive or educational performance.

The objective of this review is to assess in the current literature the value of AAP recommended DSQs (ASQ, PEDS, SWYC) administered between 0 and 5 years of age, for predicting long-term cognitive achievement and/or school performance after 1 year or more of evaluation and at/or after age 5 years, in the general population.

Methods

Protocol and Registration

Our systematic review protocol was registered in advance with PROSPERO (International Prospective Register of Systematic Reviews) on July 5, 2020 (registration no. CRD42020183883).

Eligibility Criteria

Included studies were in English and Spanish languages from peer-reviewed articles of cohort studies, which included two or more serial developmental evaluations with at least one DSQ before 5 years of age and at least one evaluation of intelligence or academic performance during school age (at 5 years of age or later and with at least 1 year between evaluations). In the first selection, we included three types of studies: those with an early developmental assessment, based on DSQ administered before 5 years of age; studies that conducted a developmental assessment at school age with intelligence or academic performance assessments in cohorts previously assessed with any DSQ; and finally, those that described the association between DSQ and school age assessment.

We included cohorts assessed with DSQ (ASQ, PEDS, and SWYC) applied in general populations, in any condition (whether completed by parents, education professionals, with or without assistance in completing it). We accepted those cases in which adjustments of the original test have been made to local conditions (including language translations, sociocultural adaptations, and/or validation process).

We excluded studies in which the developmental screening was performed after 5 years of age; studies that included concurrent evaluations or with <1-year difference between the screening test and the learning/intelligence evaluation; studies focused on children with known conditions or disease that severely affects development and cognition, such as genetic and/or metabolic diseases. We excluded prevalence and case–control studies, because of potentially overestimating the properties of the test, and case series (34).

Data Sources

A systematic search was carried out in Cochrane, MEDLINE PubMed, CINAHL, EMBASE, Web of Science, Scielo, and Scopus databases (until March 13, 2021) to identify the literature published. For the systematic search, we used the following terms: “infant,” “child, preschool” for population identification. The index tests were identified using the terms: “surveys and questionnaires,” “developmental screening,” “Ages and Stages,” “Parents Evaluation of Developmental Status,” “Survey of Well-being of Young Children,” “parents' evaluation.” Finally, the terms used to identify the reference test were “intelligence test”, “developmental disabilities,” “intellectual disability,” “intelligence,” “academic performance,” “intellectual quotient.”

To complete the search, the authors of the DSQ were contacted to identify additional studies that met the inclusion criteria.

Study Selection

A multiple-stage process was used to identify the studies and the cohorts behind them. First, two authors screened the titles and abstracts of studies retrieved from the electronic search for possible inclusion based on the predefined inclusion criteria. Second, forward and backward citation follow-up for each of the previously identified studies was done using Google Scholar–related references. The full text of all relevant studies identified was evaluated to select studies for final inclusion.

To identify and match the cohorts in the different publications reported separately, authors, site, and characteristics of the studied populations were considered. Although each cohort could have several published studies, only those that contributed data for either early developmental assessment with DSQ and/or academic or cognitive tests were included in the review.

Data Extraction

All information included was either published or extracted from published cohorts with the help of the authors. A data extraction form was completed for each cohort. The authors of the different cohorts were contacted to verify the cohorts and to corroborate the information extracted and to request additional information necessary to complete the data: author, study design, site, population, sampling method, sample size, age at DSQ, and cognitive/academic assessment and scoring method.

When children had more than one evaluation, each DSQ assessment was considered as a separate point for the analysis. When there was more than one simultaneous assessment of academic or cognitive performance, the cognitive assessment was considered as the most objective.

Evaluation of Risk of Bias

Two reviewers independently evaluated the risk of bias in each study using the Quality of Diagnostic Accuracy Studies version 2 (QUADAS-2) checklist. Each study was given a grade of “low,” “high,” or “unclear” for risk of bias and concerns regarding their applicability (35). Any disagreement between reviewers was resolved by consensus.

Data Synthesis

A qualitative analysis of the results was performed and summarized. The population characteristics, type of reference standard, index test, and reported comparison measures were summarized for each cohort [area under the curve (AUC), sensitivity, specificity, positive and negative predictive values, odds ratio (OR), correlation coefficients]. When necessary, the findings from the comparison measures were recalculated based on the more exact information provided by the corresponding authors. Based on sensitivity, specificity, and predictive values, 2 × 2 tables were constructed, and the summary receiver operating characteristic and forest plot were calculated using RevMan 5.0. Because of significant heterogeneity, no summary measures were calculated.

Results

The literature search yielded 2,277 citations after excluding duplicates. A total of 396 studies were selected for full text review, selecting 74 studies. Of these, 32 publications met the inclusion criteria, corresponding to 10 cohort studies (Figure 1). The cohorts studied came from South America: Chile (19, 27, 36, 37) and Colombia (26, 38, 39); Europe: Catalonia, Spain (4043), the Netherlands (28, 4447); France Poitiers and Nancy (48) and France Loire (4951) and Norway (5254); Asia: South Korea (5557) and North India (5862); Oceania: Australia (63). Authors from seven of the 10 included cohorts reviewed and completed the data extraction form.

Figure 1

All the 10 cohorts included ASQ assessments. Only one prospective study using PEDS was identified but did not meet the inclusion criteria for this review because of the age of the children at first assessment (64). No cohorts conducted with SWYC were identified.

Two of the cohorts used abbreviated forms of ASQ, including only some domains (Norway and Australia) and one, an extended form of the test (Colombia). Except for the cohorts from Spain, France, and South Korea that used the official translated ASQ versions, the rest used locally translated and adapted versions. This information could not be obtained for the Australian cohort. All relevant characteristics are presented (Table 1).

Table 1

City/CountrySample characteristicsInitial evaluation N and gestational age distributionIndex test: Developmental assessment tool/Delay thresholdDSQ assessment ageN included/N evaluations in follow-upReference standard for cognitive/academic assessment/Delay thresholdAge at evaluation
Australia (63)Different SESTotal: 50
AT : 35
MLPT: 10
EP: 5
Abbreviated ASQ-2. Gross motor and fine motor domains/Threshold for delay not defined, continuous scores for Gross Motor and Fine Motor domains were considered4–48 months (11 ASQ forms)33/301WISC-IV/Threshold for delay not defined, continuous scores were considered6–12 years
Catalonia, Spain (4043)Middle-high SESTotal: 179
AT: 89
LP: 90 LP
ASQ-3 Spanish edition/Score >2 SDs below the mean in any of ASQ domains48 months133Standardized school test of the Education Department of Catalonia. Children scored low on at least in one of the competences measured by the test: communicative-linguistic and mathematics, determined by defined norms8–9 years
Chile (19, 27, 36, 37)Middle-high SESTotal: 306
AT: 119
MLPT: 124
EP: 63
ASQ-3 Chilean validation/Score > 2 SDs below the mean in any of ASQ domains8–18–30 months232/283WISC-III/A score of <85 points (equivalent to < -1.5 SD) in verbal and/or performance scales6–9 years
Colombia (26, 38, 39)Poorest SESTotal: 770
AT: 653
PT: 117
Extended ASQ (EASQ)/Threshold for delay not defined, continuous scores were considered6–42 months (16 ASQ forms)470WISC-V and school achievement.**/Threshold for delay not defined, continuous scores were considered6–8 years
France, Loire
(4951)
46.5% upper SESPT GA <35: 3197
GA Median 32
(IQR 30-33)
ASQ- 2 French edition/Overall ASQ scores. ROC curves were drawn to establish the optimal cutoff values18, 24, and 36 months1,775/4,626GSA/Children belonging to the first decile of the GSA score (<38) were considered to have severe school difficulties.5 years
France, Poitiers and Nancy (48)16.9% of the families have financial difficultiesTotal: 1,225
AT: 1,156
PT: 69
GA Median 40
(IQR 39-40)
ASQ- 2 French edition/Overall ASQ scores. ROC curves were drawn to establish the optimal cutoff values36 months939WPSSI-III/Score of <85 in verbal, performance or full-scale5–6 years
Netherlands
(28, 4447)
Population sampleTotal: 1,983
AT: 544
MLPT: 927
EP: 512
ASQ−2 translated and adapted to Dutch/Score > 2 SDs below the mean in any of ASQ domains48 months1,286 (5 years) 378 (7 years)Special education, medical childcare centers or having special educational needs in mainstream education (enrollment in special education or having special educational needs in mainstream education as criteria for developmental disability.) WISC-III NL, parental report of executive functioning, attention (TEACH-NL) and memory (AVLT), and visuomotor integration (Beery) tested/The 10th percentile, defined as a z score below 1.28, was the cutoff.5 years 7 years
North India (5862)Low and middle SES422 GA not reportedASQ-3 “home procedure” translated and culturally adapted to Hindi/ Score below the 25th percentile any of ASQ domains12–36 months (11 ASQ forms)350WISC-IVINDIA (index scores from three out of four subtests)/ index scores from three out of four subtests, defining general ability z-score6–9 years
Norway (5254)Population sample114,500 GA not reportedAbbreviated ASQ validated for Norwegian population: gross motor, fine motor and communication scales. Communication scale was modified as an extended form and/threshold for delay not defined, continuous scores were considered18, 36, and 60 months8,371Subscale on writing within the communication domain in the vineland adaptive behavior scale-II/threshold for delay not defined, continuous scores were considered8 years
South Korea (5557)Population sampleTotal: 1,475
AT: 1,395
PT: 80
GA Mean 38.7 ± 1.7
Korean ASQ/Not defined6–12–24–36–48 months697WPPSI-R/ A score of 89–80 was classified as low average, 79–70 as borderline, and 69 and below as indicating intellectual deficiency5 years

Qualitative summary of included cohorts.

SES, socioeconomic status; GA, Gestational age; AT, Born at term (38–42 weeks GA); PT, Preterm (<37 weeks GA); LP, Late preterm (34–36 weeks GA); MLPT, Moderately and late preterm (32–36 weeks GA); EP, Early Preterm (<32 weeks GA); ASQ, Ages and Stages Questionnaire; WPPSI, Wechsler Preschool and Primary Scale of Intelligence; WISC, The Wechsler Intelligence Scale for Children; GSA, The teacher-completed Global School Adaptation; IQR, Interquartile range.

**

School achievement was assessed using the arithmetic (calculations) and reading comprehension subtests in the Woodcock-Muñoz Test of Achievement (WM-III), the Spanish version of the Woodcock-Johnson; and a subset of 75 words from the Test de Vocabulario en Imágenes de Peabody (TVIP), the Spanish version of the Peabody Picture Vocabulary Test-Revised.

Comparison measures between ASQ and cognitive/academic performance assessments in school age were extracted for eight of the cohorts (Table 2). In the five cohorts that report results based on the entire ASQ, a positive association was shown. Using the extended ASQ, the Colombian cohort reported a low global correlation at 6–8 years of age, with higher correlations for the Problem Solving and Communication domains, whereas in the Chilean cohort, all domains independently were significant predictors of long-term cognitive difficulties, except for personal–social. In studies that analyzed abbreviated forms of ASQ, positive associations were found for communication trajectories in Norway, and for the gross motor trajectories but not for fine motor trajectories in Australia, no other domains were analyzed.

Table 2

City/CountryAUCOR (univariate)Correlation/other association measures
Australia (63)NRNRRegression analysis: gross motor trajectory set of predictors added 19.5% of the variance for IQ. Fine Motor not significant
Catalonia, Spain (4043)Total sample: 0.736.5 [IC 95%, 1.9–22.2]NR
Chile (19, 27, 36, 37)Total sample: 0.8*
8 months: 0.77
18 months: 0.75
30 months: 0.87
6.38* [IC95% 2.1–19.3]NR
Colombia (26, 38, 39)NRNRPearson correlations on internally standardized scores Overall score for total sample r = 0.1 6–18 months: problem solving r = 0.19, other domains not significant 19–30 months: not significant 31–42 months: problem solving and communication r = 0.31; gross Motor: r = 0.25
France, Loire (4951)18 months: 0.66 24 months: 0.72 36 months: 0.773.8* [3.0–4.8]NR
France, Poitiers, and Nancy (48)0.786.7 [IC95% 3.8–12.0]NR
Netherlands (28, 4447)NR12.9* [IC95% 6.7–25.2]NR
Norway (5254)NRCommunication domain: 3.2–9.8 depending on developmental trajectoriesNR

Comparison measures and main results of included cohorts.

AUC, Area under the curve; OR, Odds Ratio; IQ, Intellectual Quotient; NR, Not reported and not possibility of calculation with the available data.

*

Values calculated in base to the information sent by the authors.

The extracted or calculated AUC ranged between 0.66 and 0.87, and the ORs were all >3 (Table 2). In five cohorts, a 2 × 2 table was constructed, allowing the calculation of sensitivity and specificity, showing a trade-off between them (Figures 2, 3).

Figure 2

Figure 3

Risk-Of-Bias Assessment

The assessment of each of the cohort studies for each dimension of the QUADAS is detailed in Table 3. The risk of bias in patient selection for most cohorts was low. However, external validity was limited, because special inclusion criteria based on gestational age or socioeconomic status were used in some cohorts. In relation to index test interpretation and applicability, we found some issues of concern due to differences in the scoring method and adaptations of the test. Another source of bias was due to significant dropout rate or follow-up of specific subgroups.

Table 3

Risk of biasConcerns about applicability
Country/cityPatient selection*Index test**Reference standard***Flow and timing****Patient selection*Index test**Reference standard***
AustraliaLowNot clearLowLowNot clearHighLow
Catalonia, SpainLowLowLowLowHighLowLow
ChileHighLowLowLowHighLowLow
ColombiaLowNot clearLowHighHighHighLow
France, LoireLowHighLowHighHighLowLow
Francia, Poitiers, and NancyLowHighLowLowLowLowLow
NetherlandsLowLowLowLowLowLowLow
North IndiaLowLowLowLowHighLowHigh
NorwayLowNot clearLowNot clearLowHighHigh
South KoreaLowNot clearLowNot clearLowLowLow

Risk of bias and concerns about applicability.

France PN, France Poitiers and Nancy; France L, France Loire; ASQ, Ages and Stages Questionnaires; EASQ, Extended ASQ; SES, Socioeconomic Status.

*

Patient Selection, High risk of bias: Non-consecutive or non-random sampling methods. In Chile the sample was an opportunity sample. In Australia: newspaper and radio announcements, and snowballing techniques considered as low risk of bias. High concerns regarding applicability: Special inclusion criterion based on gestational age or socioeconomic status characteristics of the population. Only children from medium and high SES in Chile and Catalonia. Only children from medium-low SES in Colombia and India. Only preterm in Francia L.

**

Index Test, High risk of bias: Scoring methods and thresholds were defined after reviewing the reference standard. In France P&N and France L the cutoff points for ASQ were defined based on the ROC curve of the reference standard. High concerns regarding applicability: Special adaptations that can‘t extrapolated to other population groups. Colombia adapted ASQ- EASQ. In Norway and Australia include only some scales of ASQ.

***

Reference Standard, High risk of bias: Inappropriate test used for population under study or if assessors were not blinded to results of early developmental test. Not reported in any cohort. High concerns regarding applicability: Nonuniversal tests. In Netherlands a series of parameters including Physical conditions and school support requirements. In Norway only one domain was used. ****Flow and Timing, High risk of bias: Participants received different assessments, if all children were not included in follow-up or if dropout rate were >35%. In France L the dropout rate was 55%. In Colombia true positives were excluded.

Discussion

Our search identified 10 cohorts including children from early age who were all assessed with ASQ and followed to school age. Eight of these cohorts describe comparison measures showing adequate capacity to predict later cognitive achievement/school performance. The ORs reported were >3, and the AUC was high (0.66–0.87), showing trade-offs in sensitivity and specificity, which could be explained by the different scoring methods and thresholds used (28, 36, 42, 48, 50); the optimal cutoff point, for a screening test, is the one that yields sensitivity and specificity values >70% (7).

This review is in line with the results reported in both the reviews by Cairney et al., (33) and Sim et al., (32) that showed a consistent association between different developmental screening assessment tools and later educational performance. They reported better predictive capacity especially when using a parent-reported assessment than direct child assessment.

Our review expands these results by including studies using adapted/translated versions of ASQ, which increases the evidence supporting its widespread applicability. Some groups have adapted the form of application of the test, such as the “Home Procedure” model in India, abbreviated form in Norway and Australia, and extended ASQ in Colombia (26, 53, 58, 63). All these modifications could potentially impact the psychometric properties of the test, as shown by Velikonja et al., (25) in the analysis of ASQ concurrent validity studies. The heterogeneity regarding age at evaluation could also impact the results. In only two cohorts, a trend to improved predictive properties of the tests with assessment age was observed (37, 50). The heterogeneity among the studies did not allow conclusions in the domain analysis, as only some of the cohorts were included analysis by domain, and two cohorts used abbreviated forms of the ASQ, including only specific domains of the test (36, 52, 63).

The cohorts emerge from different socioeconomic, clinical, and cultural backgrounds. Some cohorts were population-based, whereas others corresponded to samples with specific socioeconomic or biological characteristics, which could compromise external validity of this data. It has been shown that the prevalence of DD increases with biological and psychosocial adversity (22, 65). In extremely premature infants, the predictive validity of developmental diagnostic tests has been well-established (30, 31). These variables can modify the developmental trajectories of children and, consequently, the predictive capacity of the questionnaires (44, 66).

Another factor that could alter developmental trajectories is the interventions carried out in children, data not reported by any of the cohorts. Only in the study from Catalonia was there evidence of a lack of association between the evaluation carried out at the age of 4 years and referral to support programs in development (42). It is described that in real world, referral rates for early intervention among children with positive screens ranged from 10–86% (67, 68).

Strengths and Limitations

The limitations of this review include great heterogeneity in population characteristics and in the way DSQ was used, such as thresholds considered and special adaptation of the questionnaires. Therefore, any summary result resulting from meta-analysis would be uninterpretable and will not allow any subgroup analysis. In addition, the variability of both initial and outcome assessments makes the mathematical synthesis of results difficult. In addition, several current ongoing cohorts are being studied and will need to be included in the future. There are currently no published studies of cohorts using SWYC and PEDS:DM as they are relatively new. Only one prospective study using PEDS was identified but did not meet the inclusion criteria for this review (64). Other studies analyzed the predictive validity of some DSQ for adaptive skills and behavior or social–emotional problems at school age. Although this is outside the purpose of this review, they contribute to understanding the scope of developmental screening in early stages of life (69, 70).

One of the key strengths of this review is the systematic and comprehensive literature search that is highly sensitive in capturing all available data relevant to the research question in different social, cultural, and economic settings. The presented analysis was based in cohorts and not individual studies with potentially overlapping populations with the additional advantage of having contacted a significant number of authors to corroborate and better extract data.

Conclusions

ASQ is the most widely used DSQ in follow-up cohorts. Associations between early ASQ assessment and later cognitive achievement/school performance have been established, suggesting it is a promising tool in early child assessment in different social, cultural, and economic settings. Additional studies are needed to determine the impact of different settings, prematurity, developmental interventions, age at assessment, and test adaptations in the predictive capacity of DSQ.

Publisher's Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Statements

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Author contributions

LS: conceptualized and designed the study, designed the data collection instruments and search strategy, collected and reviewed the data, carried out the initial analyses, drafted the initial manuscript, and reviewed and revised the manuscript. AM: designed the study, reviewed the data collection instruments, collected and reviewed the data, reviewed, and revised the manuscript. OC: conceptualized, designed and performed the search strategy and reviewed and revised the manuscript. PS: reviewed and revised the data collection instruments and the final manuscript. All authors approved the final manuscript as submitted and agree to be accountable for all aspects of the work.

Acknowledgments

Our sincere acknowledgment to Xavier Demestre, Silvia Martinez-Nadal, and Paula Rojas, for reading the manuscript and giving their feedback. LS is a Ph.D. candidate in Methodology of Biomedical Research and Public Health program, Universitat Autònoma de Barcelona (UAB), Barcelona, Spain.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

    Abbreviations

  • DSQ

    developmental screening questionnaires

  • ASQ

    Ages and Stages Questionnaires

  • DD

    developmental delay

  • AAP

    American Academy of Pediatrics

  • PDS

    Parents&#8217

  • Evaluation of Developmental Status
  • PEDS

    Parents&#8217

  • Evaluation of Developmental Status
  • DM

    Developmental Milestones

  • SWYC

    Survey of Well-being of Young Children

  • QUADAS-2

    Quality of Diagnostic Accuracy Studies

  • AUC

    area under the curve

  • OR

    Odds Ratio.

References

  • 1.

    BoyleCABouletSSchieveLACohenRABlumbergSJYeargin-AllsoppMet al. Trends in the prevalence of developmental disabilities in US children, 1997-2008. Pediatrics. (2011) 127:103442. 10.1542/peds.2010-2989

  • 2.

    RosenbergSAZhangDRobinsonCC. Prevalence of developmental delays and participation in early intervention services for young children. Pediatrics. (2008) 121:e15039. 10.1542/peds.2007-1680

  • 3.

    AlloteyJZamoraJCheong-SeeFKalidindiMArroyo-ManzanoDAsztalosEet al. Cognitive, motor, behavioural and academic performances of children born preterm: a meta-analysis and systematic review involving 64 061 children. BJOG. (2018) 125:1625. 10.1111/1471-0528.14832

  • 4.

    ShevellM. Global developmental delay and mental retardation or intellectual disability: conceptualization, evaluation, and etiology. Pediatr Clin North Am. (2008) 55:107184. 10.1016/j.pcl.2008.07.010

  • 5.

    NordhovSMRønningJADahlLBUlvundSETunbyJKaaresenPI. Early intervention improves cognitive outcomes for preterm infants: randomized controlled trial. Pediatrics. (2010) 126:e108894. 10.1542/peds.2010-0778

  • 6.

    RaoNSunJChenEEIpP. Effectiveness of early childhood interventions in promoting cognitive development in developing countries: a systematic review and meta-analysis. HK J Paediatr. (2017) 22:1425. Available online at: http://www.hkjpaed.org/details.asp?id=1103&show=1234

  • 7.

    Council on Children With Disabilities Section on Developmental Behavioral Pediatrics Bright Futures Steering Committee Medical Home Initiatives for Children With Special Needs Project Advisory Committee. Identifying infants and young children with developmental disorders in the medical home: an algorithm for developmental surveillance and screening. Pediatrics. (2006) 118:40520. 10.1542/peds.2006-1231

  • 8.

    WilliamsPGOkamotoJLieserDDelConte BDonoghueEEarlsMet al. The pediatrician's role in optimizing school readiness. Pediatrics. (2016) 138:e20162293. 10.1542/peds.2016-2293

  • 9.

    SheldrickRCMerchantSPerrinEC. Identification of developmental-behavioral problems in primary care: a systematic review. Pediatrics. (2011) 128:35663. 10.1542/peds.2010-3261

  • 10.

    ThomasRESpraginsWMazloumGCronkhiteMMaruG. Rates of detection of developmental problems at the 18-month well-baby visit by family physicians' using four evidence-based screening tools compared to usual care: a randomized controlled trial. Child Care Health Dev. (2016) 42:38293. 10.1111/cch.12333

  • 11.

    LipkinPHMaciasMM. Promoting optimal development: identifying infants and young children with developmental disorders through developmental surveillance and screening. Pediatrics. (2020) 145:e20193449. 10.1542/peds.2019-3449

  • 12.

    SquiresJBrickerD. Ages and Stages Questionnaires User's Guide. 3rd ed. Baltimore, USA: Paul H Brookes Publishing Co (2009).

  • 13.

    GlascoeFP. The value of “parents” evaluations of developmental status' in detecting and addressing children's developmental and behavioral problems. Diagnostique. (1998) 23:185203. 10.1177/073724779802300401

  • 14.

    BrothersKBPageGlascoe FRobertshawNS. PEDS: developmental milestones - an accurate brief tool for surveillance and screening. Clin Pediatr. (2008) 47:2719. 10.1177/0009922807309419

  • 15.

    SheldrickRCPerrinEC. Evidence-based milestones for surveillance of cognitive, language, and motor development. Acad Pediatr. (2013) 13:57786. 10.1016/j.acap.2013.07.001

  • 16.

    LipkinPHMaciasMMChenBCouryDGottschlichEA. Trends in pediatricians ' developmental screening : 2002–2016. Pediatrics. (2020) 145:e20190851. 10.1542/peds.2019-0851

  • 17.

    SchonwaldAHoranKHuntingtonN. Developmental screening: is there enough time?Clin Pediatr. (2009) 48:64855. 10.1177/0009922809334350

  • 18.

    SchonwaldAHuntingtonNChanERiskoWBridgemohanC. Routine developmental screening implemented in urban primary care settings: more evidence of feasibility and effectiveness. Pediatrics. (2009) 123:6608. 10.1542/peds.2007-2798

  • 19.

    ArmijoISchonhautLCorderoM. Validation of the chilean version of the ages and stages questionnaire (ASQ-CL) in community health settings. Early Hum Dev. (2015) 91:6716. 10.1016/j.earlhumdev.2015.10.001

  • 20.

    SmallJWHix-SmallHVargas-BaronEMarksKP. Comparative use of the ages and stages questionnaires in low- and middle-income countries. Dev Med Child Neurol. (2019) 61:43143. 10.1111/dmcn.13938

  • 21.

    MarksKPMadsenSjö NWilsonP. Comparative use of the ages and stages questionnaires in the USA and Scandinavia: a systematic review. Dev Med Child Neurol. (2019) 61:41930. 10.1111/dmcn.14044

  • 22.

    WoolfendenSEapenVWilliamsKHayenASpencerNKempLet al. Systematic review of the prevalence of parental concerns measured by the parents' evaluation of developmental status (PEDS) indicating developmental risk. BMC Pediatr. (2014) 14:113. 10.1186/1471-2431-14-231

  • 23.

    RousseauMDionneCSavardRTSchonhautLLondonoM. Translation and cultural adaptation of the ages and stages questionnaires (ASQ) worldwide. J Dev Behav Pediatr. (2021) 42:490501. 10.1097/DBP.0000000000000940

  • 24.

    SheldrickRCMarakovitzSGarfinkelDCarterASPerrinEC. Comparative accuracy of developmental screening questionnaires. JAMA Pediatr. (2020) 174:36674. 10.1001/jamapediatrics.2019.6000

  • 25.

    VelikonjaTCalderonASleedMDeightonJ. The psychometric properties of the ages & stages questionnaires for ages 2-2. 5: a systematic review. Child Care Health Dev. (2017) 43:117. 10.1111/cch.12397

  • 26.

    Rubio-CodinaMAraujoMCAttanasioOMuP. Concurrent validity and feasibility of short tests currently used to measure early childhood development in large scale studies. PLoS ONE. (2016) 11:e0160962. 10.1371/journal.pone.0160962

  • 27.

    SchonhautLArmijoISchönstedtMAlvarezJCorderoM. Validity of the ages and stages questionnaires in term and preterm infants. Pediatrics. (2013) 131:e146874. 10.1542/peds.2012-3313

  • 28.

    KerstjensJMBosAF. ten Vergert EMJ, de Meer G, Butcher PR, Reijneveld SA. Support for the global feasibility of the ages and stages questionnaire as developmental screener. Early Hum Dev. (2009) 85:4437. 10.1016/j.earlhumdev.2009.03.001

  • 29.

    MarksKGlascoeFPAylwardGPShevellMILipkinPHSquiresJK. The thorny nature of predictive validity studies on screening tests for developmental-behavioral problems. Pediatrics. (2008) 122:8668. 10.1542/peds.2007-3142

  • 30.

    WongHSSanthakumaranSCowanFMModiN. Developmental assessments in preterm children: a meta-analysis. Pediatrics. (2016) 138:e20160251. 10.1542/peds.2016-0251

  • 31.

    Luttikhuizendos Santos ESdeKieviet JFKönigsMvanElburg RMOosterlaanJ. Predictive value of the bayley scales of infant development on development of very preterm/very low birth weight children: a meta-analysis. Early Hum Dev. (2013) 89:48796. 10.1016/j.earlhumdev.2013.03.008

  • 32.

    SimFThompsonLMarryatLRamparsadNWilsonP. Predictive validity of preschool screening tools for language and behavioural difficulties: a PRISMA systematic review. PLoS ONE. (2019) 14:e0211409. 10.1371/journal.pone.0211409

  • 33.

    CairneyDGKazmiADelahuntyLM. The predictive value of universal preschool developmental assessment in identifying children with later educational difficulties : a systematic review. PLoS ONE. (2021) 16:e0247299. 10.1371/journal.pone.0247299

  • 34.

    BossuytPMLeeflangMM. Chapter 6: developing criteria for including studies. In: Bossuyt P, editor. Cochrane Handbook for Systematic Reviews of Diagnostic Test Accuracy Version 0.4 [updated September 2008]. The Cochrane Collaboration (2008). Available online at: https://methods.cochrane.org/sdt/handbook-dta-reviews

  • 35.

    WhitingPFRutjesAWSWestwoodMEMallettSDeeksJJReitsmaJBet al. QUADAS-2: a revised tool for the quality assessment of diagnostic accuracy studies. Ann Intern Med. (2011) 155:52938. 10.7326/0003-4819-155-8-201110180-00009

  • 36.

    SchonhautLPérezMArmijoIMaturanaA. Comparison between ages & stages questionanaire and bayley scales, to predict cognitive delay in school age. Early Hum Dev. (2020) 141:104933. 10.1016/j.earlhumdev.2019.104933

  • 37.

    SchonhautBLPérezRMCastillaFAMCastroMSSalinasAPArmijoRI. Predictive value of ages & stages questionnaires for cognitive performance during early child education. Rev Chil Pediatr. (2017) 88:3540. 10.1016/j.rchipe.2016.08.008

  • 38.

    Rubio-CodinaMGrantham-McGregorS. Predictive validity in middle childhood of short tests of early childhood development used in large scale studies compared to the Bayley-III, the family care indicators, height-for-age, and stunting: a longitudinal study in Bogota, Colombia. PLoS ONE. (2020) 15:e0231317. 10.1371/journal.pone.0231317

  • 39.

    Rubio-CodinaMAraujoMCAttanasioOGrantham-McGregorS. Validez concurrente y viabilidad de pruebas cortas comúnmente usadas para medir el desarrollo infantil temprano en estudios a gran escala: Metodología y resultados. Banco Interamericano del Desarrollo Division de protección social y salud (2016). Available online at: https://publications.iadb.org/bitstream/handle/11319/7823/Validez-concurrente-y-viabilidad-de-pruebas-cortas-comunmente-usadas-para-medir-el-desarrollo-infantil-temprano-en-estudios-a-gran-escala-metodologia-y-resultados.pdf (accessed April 18, 2021).

  • 40.

    DemestreXSchonhautLMorillasJMartínez-NadalSVilaCRaspallFet al. Development deficit risks in the late premature newborn: evaluation at 48 months using the ages stages questionnaires®. An Pediatr. (2016) 84:3945. 10.1016/j.anpede.2015.09.023

  • 41.

    SchonhautLMartinez-NadalSArmijoIDemestreX. Reliability and agreement of ages and stages questionnaires®: results in late preterm and term-born infants at 24 and 48 months. Early Hum Dev. (2019) 128:5561. 10.1016/j.earlhumdev.2018.11.008

  • 42.

    Martínez-NadalSSchonhautLArmijoIDemestreX. Predictive value of the ages and stages questionnaire® for school performance and school intervention in late preterm and term-born children. Child Care Health Dev. (2021) 47:103111. 10.1111/cch.12814

  • 43.

    Martínez-NadalSDemestreXSchonhautLMuñozSRSalaP. Impact of neonatal morbidity on the risk of developmental delay in late preterm infants. Early Hum Dev. (2018) 116:406. 10.1016/j.earlhumdev.2017.11.001

  • 44.

    HornmanJdeWinter AFKerstjensJMBosAF. Stability of developmental problems after school entry. J Pediatr. (2017) 187:739. 10.1016/j.jpeds.2017.05.022

  • 45.

    KerstjensJMdeWinter AFBocca-TjeertesIFtenVergert EMJReijneveldSABosAF. Developmental delay in moderately preterm-born children at school entry. J Pediatr. (2011) 159:928. 10.1016/j.jpeds.2010.12.041

  • 46.

    CserjesiRvanBraeckel KNJAButcherPRKerstjensJMReijneveldSABoumaAet al. Functioning of 7-year-old children born at 32 to 35 weeks' gestational age. Pediatrics. (2012) 130:e83846. 10.1542/peds.2011-2079

  • 47.

    DotingaBMEshuisMSBocca-TjeertesIFKerstjensJMvanBraeckel KNJAReijneveldSAet al. Longitudinal growth and neuropsychological functioning at age 7 in moderate and late preterms. Pediatrics. (2016) 138:e20153638. 10.1542/peds.2015-3638

  • 48.

    CharkalukMRousseauJCalderonJBernardJY. Ages and stages questionnaire at 3 years for predicting IQ at 5–6 years. Pediatrics. (2017) 139:e20162798. 10.1542/peds.2016-2798

  • 49.

    FlamantCBrangerBTichSNTdela Rochebrochard ESavagnerCBerlieIet al. Parent-completed developmental screening in premature children: a valid tool for follow-up programs. PLoS ONE. (2011) 6:e20004. 10.1371/journal.pone.0020004

  • 50.

    HalbwachsMMullerJBTichSNTGascoinGChauty-FrondasABrangerBet al. Predictive value of the parent-completed ASQ for school difficulties in preterm-born children <35 weeks' GA at five years of age. Neonatology. (2014) 106:3116. 10.1159/000363216

  • 51.

    Halbwachs M Muller JB The Tich SN de La Rochebrochard E Gascoin G Branger B . Usefulness of parent-completed ASQ for neurodevelopmental screening of preterm children at five years of age. PLoS ONE. (2013) 8:e71925. 10.1371/journal.pone.0071925

  • 52.

    FufenJSynnveSMariVWPatriciaERagnhildBNEspenRet al. Predicting literacy skills at 8 years from preschool language trajectories: a population-based cohort study. J Speech Lang Hear Res. (2020) 63:275262. 10.1044/2020_JSLHR-19-00286

  • 53.

    WoodMEFrazierJANordengHMELapaneKL. Longitudinal changes in neurodevelopmental outcomes between 18 and 36 months in children with prenatal triptan exposure: findings from the norwegian mother and child cohort study. BMJ Open. (2016) 6:e011971. 10.1136/bmjopen-2016-011971

  • 54.

    SchjølbergSPsycholCEadiePZachrissonHDØyenA-SPriorM. Predicting language development at age 18 months: data from the norwegian mother and child cohort study. J Dev Behav Pediatr. (2011) 32:37583. 10.1097/DBP.0b013e31821bd1dd

  • 55.

    BhangSYHaEParkHHaMHongYCKimBNet al. Maternal stress and depressive symptoms and infant development at six months: the mothers and children's environmental health (MOCEH) prospective study. Korean Med Sci. (2016) 31:84351. 10.3346/jkms.2016.31.6.843

  • 56.

    ShahSJeongKSParkHHongYCKimYKimBet al. Environmental pollutants affecting children's growth and development: collective results from the MOCEH study, a multi-centric prospective birth cohort in Korea. Environ Int. (2020) 137:105547. 10.1016/j.envint.2020.105547

  • 57.

    JeongKSParkHHaEShinJHongYCHaMet al. High maternal blood mercury level is associated with low verbal IQ in children. J Korean Med Sci. (2017) 32:1097104. 10.3346/jkms.2017.32.7.1097

  • 58.

    KvestadITanejaSKumarTBhandariNStrandTAHysingM. The assessment of developmental status using the ages and Stages questionnaire-3 in nutritional research in north Indian young children. Nutr J. (2013) 12:111. 10.1186/1475-2891-12-50

  • 59.

    KvestadIHysingMShresthaMUlakMThorne-LymanALHenjumSet al. Vitamin B-12 status in infancy is positively associated with development and cognitive functioning 5 y later in Nepalese children. Am J Clin Nutr. (2017) 105:112231. 10.3945/ajcn.116.144931

  • 60.

    KvestadITanejaSUpadhyayRPHysingMBhandariNStrandTA. Vitamin B12, folate, and cognition in 6- To 9-year-olds: a randomized controlled trial. Pediatrics. (2020) 145:e20192316. 10.1542/peds.2019-2316

  • 61.

    ChowdhuryRTanejaSKvestadIHysingMBhandariNStrandTA. Vitamin D status in early childhood is not associated with cognitive development and linear growth at 6 – 9 years of age in North Indian children : a cohort study. Nutr J. (2020) 19:19. 10.1186/s12937-020-00530-2

  • 62.

    ChowdhuryRTanejaSBhandariNKvestadIStrandTABhanMK. Vitamin-D status and neurodevelopment and growth in young north Indian children: a secondary data analysis. Nutr J. (2017) 16:18. 10.1186/s12937-017-0285-y

  • 63.

    PiekJPDawsonLSmithLMGassonN. The role of early fine and gross motor development on later motor and cognitive ability. Hum Mov Sci. (2008) 27:66881. 10.1016/j.humov.2007.11.002

  • 64.

    WakeMGernerBGallagherS. Does parents' evaluation of developmental status at school entry predict language, achievement, and quality of life 2 years later?Ambul Pediatr. (2005) 5:1439. 10.1367/A04-162R.1

  • 65.

    PotijkMRKerstjensJMBosAFReijneveldSdeWinter AF. Developmental delay in moderately preterm-born children with low socioeconomic status: risks multiply. J Pediatr. (2013) 163:128995. 10.1016/j.jpeds.2013.07.001

  • 66.

    AylwardGP. Developmental screening and assessment: what are we thinking?J Dev Behav Pediatr. (2009) 30:16973. 10.1097/DBP.0b013e31819f1c3e

  • 67.

    SheldrickRCBreuerDJHassanRChanKPolkDEBenneyanJet al. System dynamics model of clinical decision thresholds for the detection of developmental-behavioral disorders. Implement Sci. (2016) 11:156. 10.1186/s13012-016-0517-0

  • 68.

    WallisKERiveraLBDGuthrieWBennettAEMandellDSMillerJS. Provider responses to positive developmental screening: disparities in referral practices?J Dev Behav Pediatr. (2021) 42:2331. 10.1097/DBP.0000000000000855

  • 69.

    SananesRGoldbergCSNewburgerJWHuCTrachtenbergFGaynorJWet al. Six-year neurodevelopmental outcomes for children with single-ventricle physiology. Pediatrics. (2021) 147:e2020014589. 10.1542/peds.2020-014589

  • 70.

    PiekJPBarrettNCSmithLMRigoliDGassonN. Do motor skills in infancy and early childhood predict anxious and depressive symptomatology at school age?Hum Mov Sci. (2010) 29:77786. 10.1016/j.humov.2010.03.006

Summary

Keywords

screening tools, developmental screening questionnaires, cognition, educational difficulties, Ages and Stages Questionnaires (ASQ)

Citation

Schonhaut L, Maturana A, Cepeda O and Serón P (2021) Predictive Validity of Developmental Screening Questionnaires for Identifying Children With Later Cognitive or Educational Difficulties: A Systematic Review. Front. Pediatr. 9:698549. doi: 10.3389/fped.2021.698549

Received

21 April 2021

Accepted

12 October 2021

Published

24 November 2021

Volume

9 - 2021

Edited by

Marjolein Verhoeven, Utrecht University, Netherlands

Reviewed by

Achyut Kumar Pandey, Banaras Hindu University, India; Anneloes Van Baar, Utrecht University, Netherlands

Updates

Copyright

*Correspondence: Luisa Schonhaut

This article was submitted to Children and Health, a section of the journal Frontiers in Pediatrics

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics