Skip to main content

BRIEF RESEARCH REPORT article

Front. Psychol., 03 August 2021
Sec. Educational Psychology

The Contribution of Word-, Sentence-, and Discourse-Level Abilities on Writing Performance: A 3-Year Longitudinal Study

\nLnia Carvalhais
Lénia Carvalhais1*Teresa LimpoTeresa Limpo2Luísa lvares Pereira,Luísa Álvares Pereira3,4
  • 1Portucalense Institute for Human Development (INPP), Department of Psychology and Education, Universidade Portucalense, Porto, Portugal
  • 2Faculty of Psychology and Education Sciences, University of Porto, Porto, Portugal
  • 3Department of Education and Psychology, University of Aveiro, Aveiro, Portugal
  • 4Centro de Linguística da Universidade Nova de Lisboa, Universidade Nova de Lisboa, Lisboa, Portugal

Writing is a foundational skill throughout school grades. This study analyzed the development of different levels of written language (word, sentence, and discourse) and explored the relationship between these levels and writing performance. About 95 Portuguese students from two cohorts—Grades 4–7 (n = 47) 6–9 (n = 48)—were asked to produce a descriptive text two times, with a 3-year interval. The produced texts were used to assess spelling, syntactic correctness and complexity, and descriptive discourse as well as text length and quality. The main results showed that there were improvements from Grades 4 to 7 and 6 to 9 in word- and sentence-level skills, along with increases in some dimensions of the descriptive discourse. Moreover, the older cohort performed better than the younger cohort in terms of spelling, syntactic complexity, and text quality, but not in terms of syntactic correctness, one dimension of the descriptive discourse, and text length. Regression analyses showed that writing performance was predicted by word and sentence levels in the younger cohort only, and by discourse-level variables in both cohorts. Overall, despite indicating a generalized growth in writing skills throughout schooling, this study also highlighted the areas that may need additional attention from teachers, mainly in terms of the descriptive features.

Introduction

Writing is a complex skill (Dockrell et al., 2014). It requires the production of legible letters following conventional spellings, to produce words that are organized into sentences and form a coherent written text, expressing the writer's ideas (Abbott et al., 2010). Given the complexity of writing, research into the development of this ability throughout schooling is particularly relevant to understand the trajectories of learning and, based on these, to provide educational guidelines to foster writing skills. Much of extant research provides cross-sectional comparisons (Lerkkanen et al., 2004), and a few studies provide a longitudinal analysis of writing development (Berninger et al., 2010; Jagaiah et al., 2020), mainly with long gaps between the measurement points. These may help to better gauge the development of writing, given its long learning curve. This was the goal of the present study in which we examined the development of different levels of written language and their contribution to writing performance in two cohorts of Portuguese students (from Grades 4 to 7 and 6 to 9). These grades were chosen as they represent critical transitions for Portuguese students, from the first cycle of basic education (Grades 1–4) to the second one (Grades 5–6), and from the second cycle to the third cycle (Grades 7–9).

Levels of language, an analytic tool, is used to understand the complexity of oral and written language (Berninger and Garvey, 1982) based on the analysis of words, sentences, and discourse. In the word level, spelling could be defined as the ability to retrieve, assemble, and select orthographic symbols (Abbott and Berninger, 1993). Large-span cross-sectional studies found that spelling errors (an indicator of spelling skill) decreased throughout schooling, for example, from Grades 2 to 5 (Alves and Limpo, 2015), Grades 2 to 6 (Llaurado and Dockrell, 2020; Magalhães et al., 2020), and Grades 1 to 9 (Bahr et al., 2012). However, a few studies examined the type of errors produced, which can inform about the spelling difficulties of students in each language. Portuguese is a romance language with a simple syllabic structure and orthographic complexities and inconsistencies, classified as an intermediate depth orthography (Seymour et al., 2003; see more details about the Portuguese spelling system in Supplementary Material, section 1). Among several error categorization systems (Treiman et al., 2019), phonological, orthographic, and morphological assessment of spelling (POMAS) seems to be particularly useful, given its specificity of analysis and theoretical support. Grounded on the triple word form theory (Bahr et al., 2012), POMAS codes misspellings into three categories: phonological, orthographic, and morphological. Findings from POMAS revealed that from Grades 1 to 9 there was a decrease in phonological errors coupled with an increase in morphological ones, with most errors across grades being orthographic (Bahr et al., 2012). Despite not assessing morphological errors, Magalhães et al. (2020) found a similar pattern in Portuguese children from Grades 2, 4, and 6. The authors also found that stress marks errors—largely underexplored in Portuguese studies—were present equally in the assessed grades.

Writing a text also requires sentence-level abilities as children need to convert their ideas into sentences. Two key sentence-level measures are syntactic complexity and correctness (Dockrell et al., 2014). One of the most frequent measures of syntactic complexity is clause length (Jagaiah et al., 2020), which is the mean number of words per clause (Berman and Slobin, 1994). Syntactic correctness can be measured through the correctness of word sequences, defined as two contiguous syntactically and semantically acceptable writing units (Videen et al., 1982). A systematic review on Grades 1–12 concluded that syntactic complexity increased throughout schooling (Jagaiah et al., 2020). Similar evidence was found for syntactic correctness. In Grades 3–5, Dockrell et al. (2014) found that younger students produced significantly less correct word sequences than older ones. Likewise, similar findings were found by Malecki and Jewell (2003) showed that several indicators of syntactic correctness consistently increased from early elementary (Grades 1–2) to elementary grades (Grades 3–5), and from elementary to middle grades (Grades 6–8).

By serving specific communicative goals and functions, writing a text requires discourse-related knowledge concerning the structural features of each genre (Berman and Nir-sagiv, 2007; Graham et al., 2013; Dockrell et al., 2014). Like word and sentence levels, discourse-level abilities seem to increase throughout schooling, with students progressively producing texts with more and more genre-specific features. Tolchinsky (2019) found that descriptiveness (i.e., degree to which descriptive texts include the representative features of this genre) increased from Grades 1 to 4. Berman and Nir-sagiv (2007) found similar increases across grades in narrative and expository writing in older samples (Grades 4, 7, and 11 and University).

In addition to improvements in word, sentence, and discourse levels throughout schooling, research has shown that mastering these levels is important for writing performance, assessed in terms of the quality and amount of writing (Berninger, 2012). Both in primary and middle grades, a few studies found that better writing performance is predicted by (a) higher spelling skills [Grades 1–3 in Graham et al. (1997); Grades 3–6 in Abbott et al. (2010); and Grades 7–8 in Limpo et al. (2017)], (b) greater sentence-level abilities [Grades 2 and 3 in Arfé et al. (2016); Grades 3–5 and 5–7 in Beers and Nagy (2011); and Grades 7–8 in Limpo et al. (2017)]; and (c) more genre-related knowledge, including descriptive texts (Beers and Nagy, 2011; Tolchinsky, 2019). Providing stronger evidence on these links, meta-analyses showed that interventions promoting the writing levels of students improved the overall writing performance [Grades 1–6 in Graham et al. (2012) and Grades 4–12 in Graham and Perin (2007)].

Present Study

As most findings surveyed above came from cross-sectional studies, it seems crucial to complement them with longitudinal findings to bring new inputs about writing development. To that end, the following research questions were addressed in a two-cohort sample of Portuguese students: Which are the developmental trajectories of word, sentence, and discourse levels of written language? Moreover, to which degree do these levels predict writing performance? Word, sentence, and discourse levels were measured through spelling, syntactic complexity/correctness, and descriptiveness, whereas writing performance was measured via text length and writing quality. The younger and older cohort were assessed at Grades 4 and 7 and 6 and 9, respectively. Based on the previous research, we expected a skill increase in all levels and an association between these and writing performance. We also hypothesized that the older cohort would show better writing skills than the younger cohort.

Method

Participants and Design

Participants were 101 students from Grades 4 to 9 and enrolled in a cluster of public schools located in urban middle-class neighborhoods from the Center of Portugal. Among these, six children were dropped from the analyses based on the following criteria: four had special education needs and two were identified as extreme outliers in one of the variables under analysis (viz., morphological misspellings per 100 words, which lied more than 3.0 times the interquartile range above the third quartile). All analyses were then based on the data from 95 students and were divided into two cohorts that were assessed twice (T1–T2), with a 3-year gap. The younger cohort was composed of 47 students in Grade 4 (51% girls) with an average age of 9.28 years at T1 (SD = 0.45). The older cohort included 48 students in Grade 6 (62% girls) with an average age of 11.35 years at T1 (SD = 0.53).

Procedure

After the formal agreement from the principal of the school cluster, permission was given to contact the teachers of Grade 4 classes (a total of five) and Grade 6 classes (a total of four). After being explained about the goals and procedures of the study, including the possibility to withdraw at any moment, all teachers and students agreed to participate. In group, students were asked to produce a descriptive text in response to the prompt “Please describe your school,” which has been successfully used in previous research (e.g., Berninger et al., 2009; Dockrell et al., 2014). The full administration procedure lasted for 50 min. This is the typical duration of writing tasks in the participating schools, also used in prior studies (e.g., Llaurado and Dockrell, 2020). The exact same procedure was followed for both cohorts and testing moments. To potentiate the engagement of students, they were told that the best texts would be posted at the school webpage.

Measures

Further details on the measures described below can be found in Supplementary Material, sections 2 and 3.

Word-Level Measures

Based on POMAS (Bahr et al., 2012), misspellings were counted separately by category: phonological errors, orthographic errors, morphological errors, stress marks, and illegible errors. However, the illegible errors were ignored as they were negligible (below 1%). This measure was re-scored by a second judge in the written products of 20% of the pupils at both T1 and T2. Reliability was good for all misspelling types, as indicated by the intraclass correlation coefficient (ICC) for single measures (>0.80).

Sentence-Level Measures

The sentence-level measures included the clause length and percentage of incorrect word sequences. Clause length was computed by averaging the number of words per clause, employing the computerized language analysis (CLAN) software (MacWhinney, 2000). The percentage of incorrect word sequences was calculated by examining the total number of incorrect sequences divided by the total number of sequences. Based on the scoring of 20% of the measures by a second judge, we concluded that reliability was high (ICC for individual measures >0.94).

Discourse-Level Measures

To measure descriptiveness (i.e., the presence of features typical of the descriptive text), we followed the taxonomy proposed by Adam (2001), including anchoring, aspectualization, relation, and subthematization categories. This taxonomy was used in previous studies, which relied on a dichotomous scale to indicate the presence or absence of the category [Grades 4–6 in Moura et al. (2015); Grade 3 in Pereira and Gonçalves (2017)]. Because our sample was older and we were concerned that this dichotomic coding would lead to ceiling effects, we added a new level indicating the presence and elaboration of information in each category. Thus, we used a three-point scale from 0 to 2, with the highest scores indicating a higher degree of discourse elaboration [for a similar coding scheme in argumentative texts, see Limpo and Alves (2013)]. Two independent judges scored these dimensions across all texts. Disagreements were solved through a discussion.

Writing Performance

Two measures were used: text length and text quality. Text length was measured through the total number of words provided by CLAN. Text quality was assessed using a holistic scale ranging from 1 (low quality) to 7 (high quality), on creativity, coherence, syntax, and vocabulary (Alves et al., 2016). To avoid transcription biases on quality assessment, texts were previously typed, and misspellings were corrected (Berninger and Swanson, 1994). Two independent judges rated the text quality of all texts produced. Inter-reliability was high, as measured by the ICC for average measures, which was 0.91 at T1 and T2.

Results

Excepting one measure, we confirmed that our study revealed no distributional problems, as the absolute values of these indexes of skewness and kurtosis did not exceed 3.0 and 10.0, respectively (Kline, 2005). We found a ceiling effect in the descriptive dimension of aspectualization in the younger cohort at T2. Thus, this variable was not included in the analyses. Descriptive statistics for variables are presented on Table 1.

TABLE 1
www.frontiersin.org

Table 1. Means and SD for all variables by cohort and time.

Cohort and Time Differences at the Word Level

To examine whether the type of misspellings varied across cohort and time, we conducted a 2 (cohort [younger, older]) ×2 (time [T1, T2]) × 4 (misspellings type [phonological, orthographic, morphological, stress mark]) ANOVA, with repeated measures on the two last factors. The results revealed a main effect of time, [F(1, 279) = 64.30, p < 0.001, ηp2 = 0.41], a main effect of misspellings type, [F(1, 279) = 37.67, p < 0.001, ηp2 = 0.39], and a main effect of cohort, [F(1, 93) = 10.76, p < 0.001, ηp2 = 0.10]. Additionally, there were significant two-way interactions of misspellings type with both cohort, [F(3, 279) = 4.86, p = 0.003, ηp2 = 0.05], and time, [F(3, 279) = 5.97, p = 0.001, ηp2 = 0.06]. These were decomposed with simple-effect analyses followed up by pairwise comparisons with Bonferroni adjustments, for misspellings type × cohort and misspellings type × time. Overall, the results showed that stress mark errors were the most frequent and that, except morphological errors, the younger cohort produced more misspellings of all types than the older one (for complete results, see Supplementary Material, section 4).

Cohort and Time Differences at the Sentence Level

To examine whether the percentage of incorrect word sequences and clause length varied across cohort and time, we conducted two 2 (cohort [younger, older]) × 2 (time [T1, T2]) ANOVAs. For incorrect word sequences, the results revealed a main effect of time, [F(1, 93) = 24.21, p < 0.001, ηp2 = 0.21], with a decrease from T1 to T2. For clause length, the results showed a main effect of time, [F(1, 93) = 9.14, p = 0.003, ηp2 = 0.09], a main effect of cohort, [F(1, 93) = 15.85, p < 0.001, ηp2 = 0.15], and an interaction between the two, [F(1, 93) = 4.56, p = 0.04, ηp2 = 0.05]. Simple-effect analyses showed that clause length increased across time only for the older cohort, [F(1, 93) = 13.45, p < 0.001, ηp2 = 0.13], and that clause length differed between the cohorts at T2 only, [F(1, 93) = 17.11, p < 0.001, ηp2 = 0.16]. The older cohort produced longer clauses than the younger one.

Cohort and Time Differences at the Discourse Level

To examine whether descriptive dimensions varied across cohort and time, we conducted a 2 (cohort [younger, older]) × 2 (time [T1, T2]) × 3 (descriptive dimensions [anchoring, sub-thematization, relationship]) ANOVA, with repeated measures on the two last factors. The results revealed a main effect of time, [F(1, 186) = 6.38, p = 0.01, ηp2 = 0.06] and a main effect of descriptive dimensions, [F(2, 186) = 293.42, p < 0.001, ηp2 = 0.76]. In addition, we found significant two-way interactions of descriptive dimensions with cohort, [F(2, 186) = 5.90, p = 0.003, ηp2 = 0.06], and with time, [F(2, 186) = 22.25, p < 0.001, ηp2 = 0.19]. These were decomposed with simple effects analyses followed up by pairwise comparisons with Bonferroni adjustments, for descriptive dimensions × cohort and descriptive dimensions × time. In general, the findings showed different performances between dimensions, with higher results in sub-thematization, which increased from T1 to T2 in both cohorts (for complete results, see Supplementary Material, section 4).

Cohort and Time Differences in Writing Performance

To examine whether text length and text quality varied across cohort and time, we conducted two 2 (cohort [younger, older]) × 2 (time [T1, T2]) ANOVAs. For text length, the results revealed a main effect of time, [F(1, 93) = 30.63, p < 0.001, ηp2 = 0.25], with pupils writing longer texts at T2 than T1. For text quality, the results showed a main effect of cohort, [F(1, 93) = 6.99, p = 0.01, ηp2 = 0.07], with the older cohort producing better texts than the younger one; and a main effect of time, [F(1, 93) = 94.80, p < 0.001, ηp2 = 0.51], with pupils writing better texts at T2 than T1.

Contribution of Written Language Levels to Writing Performance

Table 2 presents the bivariate correlations between all variables at T1 and T2 for both cohorts. In general, we found that (a) for the younger cohort, writing performance was associated with word and discourse variables at T1, but only with discourse variables at T2 and (b) for the older cohort, text length was associated with sentence-level variables and text quality with spelling-level, sentence-level, and discourse-level variables at T1, whereas at T2 only discourse level was related to writing performance.

TABLE 2
www.frontiersin.org

Table 2. Bivariate correlations for all variables at T1 and T2 for the younger cohort (below the diagonal) and the older cohort (above the diagonal).

To test the contribution of the three levels of language to writing performance, we conducted a set of stepwise regression analyses to predict text length and quality at T1 and T2, separately by cohort (final model estimates are presented on Tables 3, 4, for text length and text quality, respectively). For predicting text length and quality at T1, we progressively introduced word-, sentence-, and discourse-level variables at T1 step-by-step. For predicting text length and quality at T2, we introduced text length and quality at T1 as a first step, followed by a step-by-step inclusion of writing levels at T2.

TABLE 3
www.frontiersin.org

Table 3. Parameter estimates of the final regression models predicting text length at T1 and T2 for each cohort.

TABLE 4
www.frontiersin.org

Table 4. Parameter estimates of the final regression models predicting text quality at T1 and T2 for each cohort.

Predicting T1 Text Length

For the younger cohort, Steps 1 and 2 did not reach statistical significance, but the inclusion of discourse-level variables increased the amount of variance explained in text length for both the younger, Rchange = 0.19, [Fchange(3, 37) = 4.09, p = 0.01]. The final model with all predictors explained 43% of the variance in text length at T1, [F(9, 37) = 3.04, p = 0.01]. Significant and independent predictors were clause length (b = 0.28) and the descriptive dimensions of anchoring (b = 0.28) and sub-thematization (b = 0.30). The first steps and the final model were, however, not significant for the older cohort, R2 = 0.26, [F(9, 38) = 1.52, p = 0.18].

Predicting T2 Text Length

For the younger cohort, Step 1 with text length at T1 and Step 2 with word-level predictors did not reach statistical significance. However, the inclusion of sentence-level predictors at Step 3 increased the amount of variance explained in text length, Rchange2 = 0.26, [Fchange(2, 39) = 8.70, p = 0.001]. When Step 4 was added, there was no significant increase. Together, text length at T1 and the three levels of written language at T2 explained 47% of the text length at T2, [F(10, 36) = 3.19, p = 0.01]. Significant and independent predictors were T1 text length (b = 0.28), stress mark errors (b = 0.32), incorrect word sequences (b = 0.40), and clause length (b = 0.37). For the older cohort, there was no significant contribution of Steps 1–3, but the inclusion of descriptive dimensions significantly raised the amount of variance explained, Rchange2 = 0.26, [Fchange(3, 37) = 6.43, p = 0.001]. The final model explained 49% of the variance in T2 text length, [F(10, 37) = 3.61, p = 0.02]. Significant and independent predictors were the descriptive dimensions of anchoring (b = 0.26) and sub-thematization (b = 0.43).

Predicting T1 Text Quality

For the younger cohort, Step 1 with word-level predictors made a significant contribution to text quality, R2 = 0.23, [F(4, 42) = 3.14, p = 0.02]. The inclusion of sentence-level predictors did not increase the amount of variance explained, but the inclusion of discourse-level predictors did, Rchange2 = 0.22, [Fchange(3, 37) = 5.37, p = 0.004]. Together, the word-, sentence, and discourse-level predictors explained 50% of the variance in text quality at T1, [F(9, 37) = 4.05, p = 0.001]. The variables with significant and independent contributions were the number of stress mark errors per 100 words (b = −0.40) and the descriptive dimensions of anchoring (b = 0.33) and sub-thematization (b = 0.29). For the older cohort, none of the first steps nor the final model reached statistical significance, R2 = 0.32, [F(9, 38) = 1.99, p = 0.07].

Predicting T2 Text Quality

Step 1 of the analyses, including T1 text quality, proved significant for both the younger, R2 = 0.22, [F(1, 45) = 12.35, p = 0.001], and older cohorts, R2 = 0.08, [F(1, 46) = 5.25, p = 0.03]. Similar across the two cohorts, Steps 2 and 3 did not reach significance, but the inclusion of discourse-level predictors on Step 4 increased the amount of variance explained in text quality for both the younger, Rchange2 = 0.20, [Fchange(3, 36) = 5.74, p = 0.003], and older cohorts, Rchange2 = 0.21, [Fchange(3, 37) = 4.10, p = 0.01]. The final model for the younger cohort explained 58% of the variance in text quality, [F(10, 36) = 4.96, p < 0.001]. Significant and independent predictors were text quality at T1 (b = 0.41), clause length (b = 0.28), and the descriptive dimensions of sub-thematization (b = 0.35) and relationship (b = 0.31). The final model for the older cohort explained 37% of the variance in text quality at T2, [F(10, 37) = 2.22, p = 0.04]. Only text quality at T1 (b = 0.29) and the descriptive dimension of anchoring (b = 0.39) made a significant contribution.

Discussion

The first goal of the present study was to trace the developmental path of writing at word, sentence, and discourse levels. The overall results indicated that older students showed better performance at word and sentence levels than younger students, but mixed findings were found for the discourse level.

In line with prior studies (Alves and Limpo, 2015; Llaurado and Dockrell, 2020; Magalhães et al., 2020), we found a general decrease in misspellings from T1 to T2 and more misspellings in the younger than the older cohort. This finding is not surprising as students in higher grades had more years of formal instruction and therefore had more spelling knowledge and writing experience. Moreover, it has been suggested that in free writing older students may be better at selecting words they know how to spell correctly (Graham and Santangelo, 2014). A misspelling analysis revealed four noteworthy findings. First, the pattern of older students producing less misspellings than younger ones was not observed for morphological errors. This finding aligns well with the proposal of Bahr et al. (2012), who suggested that more morphological errors may occur in older students due to the use of more complex vocabulary. This may require advanced morphological (derivational) knowledge that takes time to growth (Nagy et al., 2006; Berninger et al., 2010). In the future, POMAS could be used to examine the development of misspellings beyond Grade 9. Second, phonological misspellings decreased from T1 to T2 in both cohorts, being the least frequent type of misspelling in the older cohort. This is in line with previous research (Bahr et al., 2012) and suggests that sound-based spellings are learned in the earliest phases of spelling development (Treiman and Bourassa, 2000). Third, stress mark errors were the most frequent misspelling in both timepoints and cohorts, also corroborating past findings with younger Portuguese students (Magalhães et al., 2020). Stress mark errors indicate poor lexical knowledge of stress and difficulties in prosodic and orthographic mapping (Defior et al., 2012). From an applied standpoint, this means that spelling instruction is not being entirely successful in fostering this kind of knowledge. Finally, after stress mark errors, orthographic misspellings were the most predominant errors in Grades 4, 6, and 7. This is a common finding in the field (Bahr et al., 2012; Magalhães et al., 2020; Mesquita et al., 2020), suggesting that the Portuguese orthographic complexities and inconsistencies take several years to be mastered. Future research seems to be needed for developing evidence-based practices for improving orthographic knowledge beyond primary grades.

In line with previous meta-analytic findings (Jagaiah et al., 2020), the sentence-level results showed longer clauses in the older than the younger cohort. Moreover, we found stronger time-related increases in the ability of students to craft complex sentences in the older cohort. These results indicate that improvements in syntactic complexity are more salient in older writers, as it has been proposed by scholars in the field (Hunt, 1970; Berninger et al., 2011). This may be related to teacher practices, with a sentence-related explicit instruction, including vocabulary and sentence expansion exercises only at later stages when students are more familiarized with complex genres (Connors, 2000). Our results showed a growth in syntactic correctness from Grades 4 to 7 and 6 to 9, with better performances in the older than the younger cohort. Similar findings have been reported in the field (Malecki and Jewell, 2003; Dockrell et al., 2014), indicating that the ability to craft syntactically correct sentences progresses throughout the primary and middle school. The measure of incorrect word sequences seems particularly sensitive to gauge that progress, including in older students (Espin et al., 2000; Weissenburger and Espin, 2005).

The analyses examining cohort and grade differences in the discourse level revealed three main findings. First, the younger cohort performed better than the older one in terms of anchoring. This unexpected finding may be related to the Portuguese curricula, which, in the initial grades, emphasize the use of titles and introductory sentences to contextualize the theme to readers (Buescu et al., 2015). Though vivid in late primary and early middle grades, these recommendations may be lost over the years. Second, we observed the anticipated increase from T1 to T2 in the ability of students to connect the prompt with other topics, with older students performing better than younger ones. Tolchinsky (2019) already suggested that older students tend to provide elaborated descriptions of topic-related aspects, whereas younger ones usually present lists of attributes, with a few efforts to articulate content. This progressive increase in ideas elaboration over time is common to other genres (Berman and Nir-sagiv, 2007; Beers and Nagy, 2009) and may be linked to progressive mastery of writing of students. Third, relationship was the most absent descriptive feature (either alone in Grade 4, or together with anchoring in Grades 6, 7, and 9). Clearly, ability of the students to relate and compare concepts was poor in all grades assessed, which is alarming, given the importance of this feature. Descriptive texts should establish links between sub-topics through comparisons or metaphors, which allow readers to form a picture in their minds (Adam, 2001). More research is needed to understand which factors underlie this poor performance and which strategies may be used to foster it.

The second goal of this study was to examine the contribution of word, sentence, and discourse levels to writing performance. However, considering the participants/predictors ratio, some caution is needed when interpreting these findings, which should be replicated in future studies with larger samples. Concerning word-level predictors, we found that more stress mark errors were associated with poorer texts at T1. This finding aligns with those of Magalhães et al. (2020) showing that in Grade 4, stress mark errors were reliable predictors of text quality. Regarding sentence-level predictors, we found the overall contribution of syntactic complexity to the amount and quality of students writing, confirming the importance of producing complex and good sentences to perform well in writing (Beers and Nagy, 2011; Limpo et al., 2017). Interestingly, the contribution of word- and sentence-level predictors to writing performance was only observed in the younger cohort, indicating that once students master a writing level, its role in writing performance diminishes (Graham, 2006). A striking finding involving both word- and sentence-level predictors at T2 in the younger cohort was that more stress mark errors, and more incorrect word sequences were associated with longer texts. Although this finding may be an artifact of the current study, it may also hint that by ignoring some aspects of writing, such as word stress and sentence correctness, students may be able to write more. Additional research is needed to replicate and explore these results.

Discourse-level variables were the most salient predictors of writing performance in both cohorts, confirming that the amount and quality of the writing of students is heavily dependent on their ability to follow genre-specific structures (Graham, 2006). This means that a powerful way to increase writing performance in a given genre is to improve student's knowledge about its underlying. Previous meta-analyses on the best methods to develop writing are in line with this conclusion (Graham and Perin, 2007; Graham et al., 2012). It should, however, be noted that our findings showed that not all descriptive categories contributed equally to writing performance. The more relevant category seems to be subthematization, that is, the ability of students to elaborate on the content. Experimental research is needed to examine the degree to which teaching of each of the descriptive categories results in better writing performance.

Implications for Applied Settings

The findings of the current study provide relevant hints for practice. Despite the general growth in writing, there seems to be room for improvement. In younger students, teachers may need to provide additional instruction in terms of syntactic complexity, whereas in older students, they may need to focus on stress mark and orthographic knowledge as well as on descriptive features, mainly, anchoring.

Conclusion

In sum, this study showed the overall growth in word and sentence levels and an increase in the discourse level only for sub-thematization. Moreover, whereas word and sentence level predicted writing performance only in the younger cohort, the discourse level was a relevant predictor in both cohorts. By helping us to understand the long-term curve of writing development, these findings provide hints for researchers to develop evidence-based practices tailored to the writing needs of students.

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics Statement

Ethical review and approval was not required for the study on human participants in accordance with the local legislation and institutional requirements. Written informed consent from the participants' legal guardian/next of kin was not required to participate in this study in accordance with the national legislation and the institutional requirements.

Author Contributions

LP was responsible for initial study design and data collection. LC coordinated data coding and the manuscript preparation. TL analyzed the data. All authors contributed to the design and conceptualization of the study, literature review, discussion, and wrote and reviewed the manuscript and approved its final version.

Funding

This project was supported by National Funds through FCT—Fundação para a Ciência e Tecnologia, I. P., under the project UIDB/00914/2020. Universidade Portucalense, Infante D. Henrique funded this work by supporting its publication fees.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher's Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Acknowledgments

The authors would like to thank Inês Soares de Moura for her collaboration in the data collection and Filipa Reis for data coding. We would also like to thank Protextos team (University of Aveiro) for the support in the design and discussion of instruments.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpsyg.2021.668139/full#supplementary-material

References

Abbott, R., Berninger, V., and Fayol, M. (2010). Longitudinal relationships of levels of language in writing and between writing and reading in grades 1 to 7. J. Educ. Psychol. 102, 281–298. doi: 10.1037/a0019318

CrossRef Full Text | Google Scholar

Abbott, R. D., and Berninger, V. W. (1993). Structural equation modeling of relationships among developmental skills and writing skills in primary and intermediate grade writers. J. Educ. Psychol. 85, 478–508. doi: 10.1037/0022-0663.85.3.478

CrossRef Full Text | Google Scholar

Adam, J. M. (2001). Les Textes: Types et Prototypes: Récit, Description, Argumentation, Explication et Dialogue. Paris: Nathan.

Google Scholar

Alves, R. A., and Limpo, T. (2015). Progress in written language bursts, pauses, transcription, and written composition across schooling. Sci. Stud. Read. 19, 374–391. doi: 10.1080/10888438.2015.1059838

CrossRef Full Text | Google Scholar

Alves, R. A., Limpo, T., Fidalgo, R., Carvalhais, L., Pereira, L. A., and Castro, S. L. (2016). The impact of promoting transcription on early text production: effects on bursts and pauses, levels of written language, and writing performance. J. Educ. Psychol. 108, 665–679. doi: 10.1037/edu0000089

CrossRef Full Text | Google Scholar

Arfé, B., Dockrell, J. E., and De Bernardi, B. (2016). The effect of language specific factors on early written composition: the role of spelling, oral language, and text generation skills in a shallow orthography. Read. Writ. 29, 501–527. doi: 10.1007/s11145-015-9617-5

CrossRef Full Text | Google Scholar

Bahr, R., Silliman, E., Berninger, V., and Dow, M. (2012). Linguist pattern analysis of misspellings of typically developing writers in Grades 1-9. J. Speech Lang. Hear. Res. 55,1587–1599. doi: 10.1044/1092-4388(2012/10-0335)

CrossRef Full Text | Google Scholar

Beers, S. F., and Nagy, W. E. (2009). Syntactic complexity as a predictor of adolescent writing quality: which measures? which genre? Read. Writ. Interdiscip. J. 22, 185–200. doi: 10.1007/s11145-007-9107-5

CrossRef Full Text | Google Scholar

Beers, S. F., and Nagy, W. E. (2011). Writing development in four genres from grades three to seven: syntactic complexity and genre differentiation. Read. Writ. 24, 183–202. doi: 10.1007/s11145-010-9264-9

CrossRef Full Text | Google Scholar

Berman, R. A., and Nir-sagiv, B. (2007). Comparing narrative and expository text construction across adolescence: a developmental paradox. Discourse Process. 43, 79–120. doi: 10.1080/01638530709336894

CrossRef Full Text | Google Scholar

Berman, R. A., and Slobin, D. I.,. (1994). Relating Events in Narrative: A Crosslinguistic Developmental Study. Mahwah, NJ: Erlbaum.

Google Scholar

Berninger, G., and Garvey, C. (1982). Tag constructions: structure and function in child discourse. J. Child Lang. 9, 151–168. doi: 10.1017/S0305000900003688

CrossRef Full Text | Google Scholar

Berninger, V., Abbott, R., Swanson, H. L., Lovitt, D., Trivedi, P., Lin, S., et al. (2010). Relationship of word- and sentence-level working memory to reading and writing in second, fourth, and sixth grade. Lang. Speech Hear. Serv. Sch. 41, 179–193. doi: 10.1044/0161-1461(2009/08-0002)

PubMed Abstract | CrossRef Full Text | Google Scholar

Berninger, V., and Swanson, H. L. (1994). Modifying Hayes and Flower's model of skilled writing to explain beginning and developing writing, in Children's Writing: Toward a Process Theory of Development of Skilled Writing, ed E. Butterfield (Greenwich, CT: JAI Press), 57–81.

Berninger, V. W., (2012). Past, Present, and Future Contributions of Cognitive Writing Research to Cognitive Psychology. London: Psychology Press.

Google Scholar

Berninger, V. W., Abbott, R. D., Augsburger, A., and Garcia, N. (2009). Comparison of pen and keyboard transcription modes in children with and without learning disabilities. Learn. Disabil. Q. 32, 123–141. doi: 10.2307/27740364

CrossRef Full Text | Google Scholar

Berninger, V. W., Nagy, W. E., and Beers, S. F. (2011). Child writers' construction and reconstruction of single sentences and construction of multi-sentence texts: contributions of syntax and transcription to translation. Read. Writ. Interdiscip. J. 24, 151–182. doi: 10.1007/s11145-010-9262-y

PubMed Abstract | CrossRef Full Text | Google Scholar

Buescu, H. C., Morais, J., Rocha, M. R., and Magalhães, V. F. (2015). Programa e metas curricularesde português do ensino básico [Program and Curriculum Goals of Portuguese in Basic Education]. Retrieved from the Directorate-General for Education of the Portuguese Ministry of Education and Science website: https://www.dge.mec.pt/sites/default/files/Basico/Metas/Portugues/pmcpeb_julho_2015.pdf (accessed May 17, 2021).

Google Scholar

Connors, R. J. (2000). The erasure of the sentence. Coll. Compost. Commun. 52, 96–128. doi: 10.2307/358546

CrossRef Full Text | Google Scholar

Defior, S., Gutiérrez-Palma, N., and Cano-Marín, M. J. (2012). Prosodic awareness skills and literacy acquisition in Spanish. J. Psycholinguist. Res. 41, 285–294. doi: 10.1007/s10936-011-9192-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Dockrell, J. E., Connelly, V., Walter, K., and Critten, S. (2014). Assessing children's writing products: the role of curriculum-based measures. Br. Educ. Res. J. 41, 575–595. doi: 10.1002/berj.3162

CrossRef Full Text | Google Scholar

Espin, C., Shin, J., Deno, S. L., Skare, S., Robinson, S., and Benner, B. (2000). Identifying indicators of written expression proficiency for middle school students. J. Spec. Educ. 34, 140–153. doi: 10.1177/002246690003400303

CrossRef Full Text | Google Scholar

Graham, S. (2006). Writing, in Handjournal of Educational Psychology, eds P. Alexander and P. Winne (Mahwah, NJ: Erlbaum), 457–478.

Google Scholar

Graham, S., Berninger, V. W., Abbott, R. D., Abbott, S. P., and Whitaker, D. (1997). Role of mechanics in composing of elementary school students: a new methodological approach. J. Educ. Psychol. 89, 170–182. doi: 10.1037/0022-0663.89.1.170

CrossRef Full Text | Google Scholar

Graham, S., Gillespie, A., and McKeown, D. (2013). Writing: importance, development, and instruction. Read. Writ. 26, 1–15. doi: 10.1007/s11145-012-9395-2

CrossRef Full Text | Google Scholar

Graham, S., McKeown, D., Kiuhara, S., and Harris, K. R. (2012). A meta-analysis of writing instruction for students in the elementary grades. J. Educ. Psychol. 104, 879–896. doi: 10.1037/a0029185

CrossRef Full Text | Google Scholar

Graham, S., and Perin, D. (2007). A meta-analysis of writing instruction for adolescent students. J. Educ. Psychol. 99, 445–476. doi: 10.1037/0022-0663.99.3.445

CrossRef Full Text | Google Scholar

Graham, S., and Santangelo, T. (2014). Does spelling instruction make students better spellers, readers, and writers? a meta-analytic review. Read. Writ. Interdiscip. J. 27, 1703–1743. doi: 10.1007/s11145-014-9517-0

CrossRef Full Text | Google Scholar

Hunt, K. W. (1970). Syntactic maturity in school children and adults. Monogr. Soc. Res. Child Dev. 35:67. doi: 10.2307/1165818

CrossRef Full Text

Jagaiah, T., Olinghouse, N. G., and Kearns, D. M. (2020). Syntactic complexity measures: variation by genre, grade-level, students' writing abilities, and writing quality. Read. Writ. 33, 2577–2638. doi: 10.1007/s11145-020-10057-x

CrossRef Full Text | Google Scholar

Kline, R. B. (2005). Principles and Practice of Structural Equation Modeling, 2nd Edn. New York, NY: The Guilford Press.

Google Scholar

Lerkkanen, M., Rasku-Puttonen, H., Aunola, K., and Nurmi, J. (2004). The developmental dynamics of literacy skills during the first grade. Educ. Psychol. 24, 793–810. doi: 10.1080/0144341042000271782

CrossRef Full Text | Google Scholar

Limpo, T., and Alves, R. A. (2013). Modeling writing development: contribution of transcription and self-regulation to Portuguese students' text generation quality. J. Educ. Psychol. 105, 401–413. doi: 10.1037/a0031391

CrossRef Full Text | Google Scholar

Limpo, T., Alves, R. A., and Connelly, V. (2017). Examining the transcription-writing link: Effects of handwriting fluency and spelling accuracy on writing performance via planning and translating in middle grades. Learn. Individ. Differ. 53, 26–36. doi: 10.1016/j.lindif.2016.11.004

CrossRef Full Text | Google Scholar

Llaurado, A., and Dockrell, J. (2020). The impact of orthography on text production in three languages: Catalan, English, and Spanish. Front. Psychol. 11:878. doi: 10.3389/fpsyg.2020.00878

PubMed Abstract | CrossRef Full Text | Google Scholar

MacWhinney, B. (2000). The Childes Project: Tools for Analyzing Talk. Mahwah, NJ: Erlbaum.

Google Scholar

Magalhães, S., Mesquita, A., Filipe, M., Veloso, A., Castro, S. L., and Limpo, T. (2020). Spelling performance of portuguese children: comparison between grade level, misspelling type, and assessment task. Front. Psychol. 11:547. doi: 10.3389/fpsyg.2020.00547

PubMed Abstract | CrossRef Full Text | Google Scholar

Malecki, C., and Jewell, J. (2003). Developmental, gender, and practical considerations in scoring curriculum-based measurement writing probes. Psychol. Sch. 40, 379–390. doi: 10.1002/pits.10096

CrossRef Full Text | Google Scholar

Mesquita, A., Carvalhais, L., Limpo, T., and Castro, S. L. (2020). Portuguese spelling in primary grades: complexity, length, and lexicality effects. Read. Writ. 33, 1325–1349. doi: 10.1007/s11145-019-10012-5

CrossRef Full Text | Google Scholar

Moura, I., Pereira, L. Á., and Coimbra, R. L. (2015). Ortografia e produção textual: estudo de caso no sistema de ensino português. Indagatio Didática 7, 62–86. doi: 10.34624/id.v7i4.2893

CrossRef Full Text | Google Scholar

Nagy, W., Berninger, V. W., and Abbott, R. D. (2006). Contributions of morphology beyond phonology to literacy outcomes of upper elementary and middle-school students. J. Educ. Psychol. 98, 134–147. doi: 10.1037/0022-0663.98.1.134

CrossRef Full Text | Google Scholar

Pereira, Á., and Gonçalves, C. (2017). Aprender a escrever em contextos multilingues através da descrição de obras de arte. Signo Pensamiento 36, 84–99. Retrived from: https://revistas.javeriana.edu.co/index.php/signoypensamiento/article/view/21062 (accessed June 06, 2021).

Google Scholar

Seymour, P. H. K., Aro, M., Erskine, J. M., and the COST Action A8 network (2003). Foundation literacy acquisition in European orthographies. Br. J. Psychol. 94, 143–174. doi: 10.1348/000712603321661859

PubMed Abstract | CrossRef Full Text | Google Scholar

Tolchinsky, L. (2019). Evolving structure of descriptive texts and learner's abilities. J. Lit. Res. 51, 293–314. doi: 10.1177/1086296X19858354

CrossRef Full Text | Google Scholar

Treiman, R., and Bourassa, D. C. (2000). Children's written and oral spelling. Appl. Psycholinguist. 21, 183–204. doi: 10.1017/S0142716400002022

CrossRef Full Text | Google Scholar

Treiman, R., Caravolas, M., and Kessler, B. (2019). What methods of scoring young children's spelling best predict later spelling performance? J. Res. Read. 42, 80–96. doi: 10.1111/1467-9817.12241

PubMed Abstract | CrossRef Full Text | Google Scholar

Videen, J., Deno, S. L., and Marston, D. (1982). Correct Word Sequences: A Valid Indicator of Proficiency in Written Expression, Vol. IRLD-RR-84. Minneapolis, MN: University of Minnesota, Institute for Research on Learning Disabilities.

Google Scholar

Weissenburger, J., and Espin, C. (2005). Curriculum-based measures of writing across grade levels. J. Sch. Psychol. 43, 153–169. doi: 10.1016/j.jsp.2005.03.002

CrossRef Full Text | Google Scholar

Keywords: writing performance, levels of language, grade level, longitudinal study, spelling, syntactic complexity measures, descriptive text

Citation: Carvalhais L, Limpo T and Pereira LÁ (2021) The Contribution of Word-, Sentence-, and Discourse-Level Abilities on Writing Performance: A 3-Year Longitudinal Study. Front. Psychol. 12:668139. doi: 10.3389/fpsyg.2021.668139

Received: 15 February 2021; Accepted: 22 June 2021;
Published: 03 August 2021.

Edited by:

Kuniyoshi L. Sakai, The University of Tokyo, Japan

Reviewed by:

Paz Suárez-Coalla, University of Oviedo, Spain
Gavin Bui, Hang Seng University of Hong Kong, China
Sarah Critten, Coventry University, United Kingdom

Copyright © 2021 Carvalhais, Limpo and Pereira. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Lénia Carvalhais, leniac@upt.pt

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.