The graded predictive pre-activation in Chinese sentence reading: evidence from eye movements

Chang, Min; Zhang, Kuo; Sun, Yue; Li, Sha; Wang, Jingxin

doi:10.3389/fpsyg.2023.1136488

BRIEF RESEARCH REPORT article

Front. Psychol. , 29 June 2023

Sec. Psychology of Language

Volume 14 - 2023 | https://doi.org/10.3389/fpsyg.2023.1136488

This article is part of the Research Topic Eye-tracking While Reading for Psycholinguistic and Computational Models of Language Comprehension View all 12 articles

The graded predictive pre-activation in Chinese sentence reading: evidence from eye movements

$\r\nMin Chang$ Min Chang¹

Kuo Zhang²

Yue Sun^3,4

Sha Li⁵

Jingxin Wang^3,4*

¹School of Education Science, Nantong University, Nantong, China
²Department of Social Psychology, Nankai University, Tianjin, China
³Faculty of Psychology, Tianjin Normal University, Tianjin, China
⁴Academy of Psychology and Behavior, Tianjin Normal University, Tianjin, China
⁵School of Psychology, Fujian Normal University, Fuzhou, China

Previous research has revealed that graded pre-activation rather than specific lexical prediction is more likely to be the mechanism for the word predictability effect in English. However, whether graded pre-activation underlies the predictability effect in Chinese reading is unknown. Accordingly, the present study tested the generality of the graded pre-activation account in Chinese reading. We manipulated the contextual constraint of sentences and the predictability of target words as independent variables. Readers’ eye movement behaviors were recorded via an eye tracker. We examined whether processing an unpredictable word in a solid constraining context incurs a prediction error cost when this unpredictable word has a predictable alternative. The results showed no cues of prediction error cost on the early eye movement measures, supported by the Bayes Factor analyses. The current research indicates that graded predictive pre-activation underlies the predictability effect in Chinese reading.

Introduction

Prediction is a fundamental principle of language processing (Clark, 2013). Efficient language comprehension depends on two streams of information, i.e., the top-down expectation and the bottom-up conceptual input. In speech comprehension, listeners could predict the content at the end of other speakers’ turns to make efficient turn-taking using statistical regularities information in speech (Scott et al., 2009). In reading comprehension, readers could make use of contextual predictability information to facilitate word identification and semantic integration (for a review see Staub, 2015). A word’s predictability, as measured by the word’s cloze value, i.e., the proportion of participants who give this word in a non-speeded sentence completion task (Taylor, 1953), has been shown to influence reading times and saccadic behavior in reading tasks using the eye-tracking method of English, German, and Chinese (Rayner and Well, 1996; Kliegl et al., 2004; Rayner et al., 2005; Wang et al., 2010; Staub, 2015; Liu et al., 2018; Zhao et al., 2019; Chang et al., 2020a,b). Specifically, predictable words are easier to read, receive fewer and shorter fixations, and elicit longer progressive or incoming saccade length than unpredictable words, i.e., the word predictability effect. However, the mechanisms for the predictability effect in Chinese reading have not been investigated previously. Thus, the present study aims to determine how prediction occurs, i.e., the mechanism of word predictability effect in Chinese reading.

Two competing theoretical accounts explain the mechanisms for predictability effects, each of which has different predictions for processing unexpected words (Luke and Christianson, 2016; for a review see Staub, 2015). First, the word prediction could be defined as an “all-or-none” process in which readers may maintain specific, discrete predictions of upcoming perceptual input, also termed lexical prediction (also see Delong et al., 2014). According to this lexical prediction account, strong constraining sentences support expectations for predictable words with much facilitation. Reading can be facilitative when readers encounter predictable words but slow down when readers encounter unpredictable words in a sufficient constraining context, i.e., producing the prediction error cost (Kutas et al., 2011; Luke and Christianson, 2016). For example, readers would predict the most probable word gift in the constraining sentence “Today was Annie’s birthday, her mother bought her a-.” This predictable word gift would be processed quickly as it matches readers’ expectations. On contrary, readers might be surprised when encountering an unexpected word like book, then they would spend more time reading this unexpected word (i.e., prediction error cost) as they must suppress the activated gift. While a neutral constraining sentence like “When Annie went home, her mother brought her a-” provides little contextual information to readers. Thus, processing the unpredictable words would rarely incur prediction error cost as no predictable word is pre-activated. Therefore, according to the lexical prediction account, the comparison of processing unpredictable words between the constraining context and the neutral context would cause a prediction error cost.

Second, prediction in language comprehension could also involve graded pre-activation so readers make diffuse, cost-free, and ubiquitous pre-activation of likely upcoming input (Luke and Christianson, 2016; for reviews, see Staub, 2015; Kuperberg and Jaeger, 2016). Compared to the lexical prediction account, the key prediction of this account is that processing the unpredictable word would not incur a prediction error cost when the expected word is another more possible alternative in a strong constraining sentence. Because not only the predictable word but also the unpredictable word would be pre-activated before the perceptual input is encountered. In the neutral context of the above example, readers would pre-activate a set of words that suit the context, like book, hat, skirt, and guitar. Please notice that these words mentioned above are nouns, which could be pre-activated at syntactic or semantic representation even if the word identities are not. Readers may not be able to predict gift, but they can be confident that the upcoming word will be a noun or something that could be carried. Thus, even if people do not predict specific words, they could predict some aspects of future stimuli (Pickering and Gambi, 2018). Therefore, according to the graded pre-activation account, the comparison of processing unpredictable words between the constraining context and the neutral context would not cause a prediction error cost.

The graded pre-activation account has been well-demonstrated in English reading (for a review see Kuperberg and Jaeger, 2016), as evidenced by the reliable correlation between word predictability (measured as word surprizal or cloze probability) and processing times (Monsalve et al., 2012; Smith and Levy, 2013; Goodkind and Bicknell, 2018), N400 amplitude (Delong et al., 2005; Frank et al., 2015), or neural activity (Henderson et al., 2016). Specifically, the word predictability was inversely correlated with reading times (e.g., gaze duration in Goodkind and Bicknell, 2018), N400 amplitudes of words (Delong et al., 2005), and changes in brain activation levels in the temporal, parietal, occipital, cingulate, and frontal regions (Carter et al., 2019). In addition, Luke and Christianson (2016) conducted a large-scale survey that provided cloze values for words in the Provo Corpus. Their results showed that most words had a more-expected competitor but with no misprediction error cost. Even if the word identity was rarely predicted, its semantic and morphosyntactic information was predictable. These findings support the graded prediction account but not the specific lexical prediction account. The null prediction error cost (as the key opinion of graded pre-activation account) also has been demonstrated by Frisson et al. (2017) using a controlled-experimental design with an eye-tracking method using a corpus study with high ecological validity.

Frisson et al. (2017) jointly manipulated the contextual constraint of sentences and the cloze probability of target words to explore the cognitive mechanism of predictability effects in English. They compared the processing of the same unpredictable word (e.g., chair) in the constraining context (e.g., “The young nervous paratrooper jumped out of the plane/chair when he heard the shots”) and the neutral context (e.g., “The tired movie maker was sleeping in the plane/chair when he was woken up by a scream”) to test the prediction error cost. Also, the cloze values for unpredictable words in the constraining and neutral sentences were comparable. Their results showed significant word predictability effects and contextual constraint effects, but null prediction error cost in the early or later eye movement measures. This study firstly provided evidence from the controlled experimental design for the absence of a prediction error cost and further supported that the graded pre-activation but not the lexical prediction account underlies the mechanism of word predictability effects.

Notably, the null prediction error cost in constraining sentences might be due to the priming effect from the pre-target word area. The richer information preceding the target words might facilitate automatic priming to the target words in the strong constraining sentences but not the neutral sentences (see Kuperberg and Jaeger, 2016). Although whether there is an interference from the priming effect in predictive processing is unclear, it is recommendable to control the pre-target region to investigate the predictive processing, especially in Chinese such visually denser scripts.

For Chinese reading, there have been several studies investigating how the word predictability affects eye movement behaviors or interplays with other linguistic factors (Rayner et al., 2005; Wang et al., 2010; Liu et al., 2018; Zhao et al., 2019; Chang et al., 2020a,b). However, studies of Chinese to date have yet to investigate the mechanism of word predictability effects. Whether prediction error cost exists in Chinese reading is still being determined. Chinese scripts lack morphosyntactic information, which readers use as cues for prediction. Moreover, parafoveal processing is more efficient in Chinese than English (Vasilev and Angele, 2017). Thus, readers might heavily rely on bottom-up perceptual processing in Chinese reading. Such Chinese script characteristics might make it hard to produce a specific word prediction in Chinese reading. Therefore, predictive processing might rely on graded pre-activation rather than lexical prediction. The present study aimed to provide experimental evidence for the graded pre-activation account in Chinese reading.

Accordingly, the present study was a follow-up to a previous study (Frisson et al., 2017) but further made more rigid control of the pre-target context. There is no explicit visual marker in Chinese to demarcate work boundaries (Li et al., 2015). Characters, the component of words, are created from differing numbers of strokes. These characteristics, therefore, bring about the increased visual density in this language and lead to deeper parafoveal pre-processing, as demonstrated by the well-established semantic preview effect in Chinese, which is equivocal in English (Zhou et al., 2013; Rayner et al., 2014). The different content immediately before the target words might influence the processing of target words differently (Reichle et al., 2003). Moreover, early eye-tracking studies have found that transitional probabilities (i.e., the statistical likelihood that word N will follow word N-1) between word N-1 and word N influence fixation times on word N (McDonald and Shillcock, 2003; Frisson et al., 2005; Wang et al., 2010). Hence, it is necessary to control the influence of the pre-target region across conditions.

Given the above considerations, we manipulated the contextual constraint and word predictability to address the question using a natural sentence reading task, consistent with Frisson et al. (2017). However, we went further by constructing compound sentences, with the first half-sentences controlling contextual constraint and the second half-sentences having identical content at least three characters before the target words to control the possible priming effect or pre-target influence on the target words. We obtained the contextual constraint effect, word predictability effect, and the prediction error cost by three comparisons: (1) constraining context-unpredictable (CU) vs. constraining context-predictable (CP), testing the word predictability effect; (2) neutral context–predictable (NP) vs. constraining context-predictable (CP), testing the contextual constraining effect, and (3) constraining context-unpredictable (CU) vs. neutral context-unpredictable word (NU), testing the prediction error cost. According to the lexical prediction account, unpredictable word processing in the constraining context would result in extra prediction error cost but not in the neutral context. Thus, we compared CU and NU to evaluate the prediction error cost, as Frisson et al. (2017).

We expected to find the typical word predictability effect, i.e., predictable words yielding shorter reading times than unpredictable words. We also expected the significant contextual constraint effect, i.e., the strong constraining sentences but not the neutral sentences make target words read faster. The contextual effects and the standard word predictability effects in the first-pass reading measures demonstrated that we manipulated the two factors successfully. However, the two effects mentioned above are not key evidences to our hypothesis. The prediction error cost (CU vs. NU) is the primary evidence for distinguishing the two accounts. Specifically, if readers spent longer time on reading unpredictable word in CU than in NU (i.e., significant prediction error cost), then the result supported the lexical prediction account, otherwise (null prediction error cost) supported the graded pre-activation account.

Materials and methods

Ethics approval

The study was approved by the research ethics committee at the Tianjin Normal University and conducted according to the Declaration of Helsinki principles.

Participant

Forty-four undergraduates aged 18–26 years (M = 20.5 years, 34 female) from the author’s university participated in the eye-tracking experiment for remuneration. The participant number was the same as Frisson et al. (2017). All were native Chinese readers, screened for normal acuity (more excellent than 20/40 in Snellen values) using a Tumbling E eye chart (Taylor, 1978), and naive to the purpose of the experiment. Informed consent was obtained from all individual participants in the study.

Design and stimuli

We constructed 48 sets of sentence frames, a number larger than Frisson et al. (2017). The experiment used a within-subjects design with the factors of sentence constraint (Constraining, Neutral) and word predictability (Predictable, Unpredictable) as independent variables. See Table 1, each sentence frame had a strong constraining sentence and a neutral sentence. The first half-sentence was manipulated to control the contextual constraint; predictable or unpredictable target words were inserted in the middle of the second half-sentence. At least three characters before target words were identical in the constraining and neutral conditions (excluding only five sets of sentences). As stated in the introduction, we conducted three comparisons to obtain the contextual constraint effect, word predictability effect, and prediction error cost. The most crucial comparison was the third one, i.e., constraining context-unpredictable word (CU) vs. neutral context-unpredictable word (NU), testing the prediction error cost. The significant prediction error cost indicates that an unexpected word in a constraining context with a predictable alternative will incur a processing cost, which supports the lexical prediction account.

TABLE 1

Table 1. An example stimulus.

In the cloze test, students were given the sentences truncated immediately before the target word and asked to provide the next word in the sentences. Twenty-two college students who did not participate in the experiment completed the cloze test. A predictable or unpredictable word was embedded in the constraining context (labeled CP and CU, respectively, see Table 1). The same two words were embedded in the corresponding neutral context and embedded in the constraining context. Given that the two target words, such as model/girl in the neutral context, were the same as targets in the constraining context, we labeled them as NP and NU, following Frisson et al. (2017). Please note that NP and NU were unpredictable because the neutral context did not provide strong word constraints. The mean cloze probability of the target words in the four conditions (CP, CU, NP, and NU) were 0.75 (SD = 0.16), 0.02 (SD = 0.04), 0.05 (SD = 0.06), and 0.04 (SD = 0.08), respectively. In the constraining context, t-tests showed that the cloze values for CP were significantly higher than for CU [t(94) = 30, p < 0.001]. In the neutral context, the two unpredictable targets had comparable cloze values [t(94) = 1.07, p = 0.288]. Importantly, the cloze values for the same unpredictable word (such as girl) in constraining and neutral contexts were comparable [t(94) = 1.52, p = 0.13].

The two target words in one sentence frame were matched for word frequency [ Cai and Brysbaert, 2010; Predictable: M = 64/million, SD = 80; Unpredictable: M = 44/million, SD = 104; t(94) = 1.06, p = 0.291] and the whole word complexity in strokes [Predictable: M = 17.41, SD = 5.11; Unpredictable: M = 15.88, SD = 4.97; t(94) = 1.50, p = 0.137]. Forty participants evaluated sentences naturalness (using a 7-point scale, ranging from 1 = entirely unnatural to 7 = entirely natural). The average ratings were 5.41 (SD = 0.74), 5.31 (SD = 0.71), 5.32 (SD = 0.66), and 5.20 (SD = 0.7) for each conditions, respectively. The ANOVA analysis showed that the four conditions were comparable in naturalness [F_{(3, 188)} = 0.85, p = 0.468].

We adopted a counterbalanced design in which the experimental sentences were divided into four lists, and one version of each sentence frame was in one list. Each participant read one list with equal numbers of sentences in each condition. Each list also included 40 filler sentences and began with six practice sentences. Eleven participants were randomly allocated to each list.

Apparatus and procedure

An SR Eyelink 1000 plus eye tracker tracked right-eye movements during binocular viewing at 1000 Hz. Stimuli were displayed in Song 32-point font as black-on-white text on a high-resolution (1920 × 1080 pixels) monitor with a fresh rate of 60 Hz. At 65 cm viewing distance, each character subtended 1° and so was of normal size for reading.

Participant took part individually and was instructed to read normally and for comprehension. At the start of the experiment, a 3-point horizontal calibration procedure was performed across the same line as each sentence presentation (ensuring 0.30° or better spatial accuracy for all participants). Calibration accuracy was checked before each trial and the eye-tracker recalibrated as required to maintain high spatial accuracy. At the start of each trial, a fixation square equal in size to one character was presented on the left side of the screen. Once the participant fixated on this location, the first half-sentence was presented with the first character replacing the square. Participant pressed the space key once they finished reading the first half-sentence. Then the same fixation square was presented again at the same position and disappeared once the participant fixated it, then the second half-sentence was presented. Participant pressed a response key once they finished reading the second half-sentence. This was replaced by a comprehension question requiring a yes/no button-press response on 25% of trials. The experiment lasted approximately 30 min for each participant.

Data analysis

Accuracy for answering comprehension questions was high for all participants (M = 84%, SD = 0.06, range = [73%, 95%]). We output the data of the second half-sentences and thus removed the data based on the second half-sentences. Following standard procedures, short (< 80 ms) and long (> 1200 ms) fixations were removed. Trials with head-movement, tracking-loss, or error were excluded, which affected seven trials (0.3%), as were trials for sentences receiving fewer than six fixations, which affected 99 trials (4.7%). In total, 5% of trials (106) were removed. The remaining data were analyzed by linear mixed-effects models (LMEs; Baayen et al., 2008) for continuous variables and generalized mixed-effects models for binomial variables, using the lme4 package (Version 1.1-21; Bates et al., 2015) in R (R Development Core Team, 2016). For all measures, models with the maximum random-effects structure were used (Barr et al., 2013), with the three comparisons as fixed factors and participant and stimuli as crossed random effects. If models did not converge, the random-effects structure was reduced by first trimming this for stimuli. Log-transformed fixation-time effects are reported alongside untransformed means. Following convention, t/z values > 1.96 were considered significant.

Results

We expected significant word predictability effects and contextual constraint effects on the early eye movement measures and explored whether unpredictable words in constraining sentences incur processing costs on early word identification or later semantic integration. Thus, consistent with Frisson et al. (2017), we reported four measures of first-pass reading for the target words, i.e., the word-skipping (SKIP, probability of not fixating a word during first-pass reading), first-fixation duration (FFD, duration of the first fixation on a word during first-pass reading), single-fixation duration (SFD, duration of the first fixation on a word receiving only one first pass fixation), gaze duration (GD, sum of all first pass fixations on a word). We also reported three measures concerning later semantic integration, i.e., regressions-out rate (RO, probability of first-pass regression from a word), regression path duration (RPD, the sum of all fixation durations beginning with the initial fixation on the target word and ending when the eyes exited the word to the right, including time spent rereading earlier words and time spent rereading the word itself) and total reading time (TRT, sum of all fixations on a target word). Target word means were shown in Table 2, and statistical effects were summarized in Table 3.

TABLE 2

Table 2. Means and standard errors for target word measures (M ± SE).

TABLE 3

Table 3. Summary of statistical effects (continuous variables were log-transformed).

Word predictability effect and contextual constraining effect

We observed significant word predictability effects (CP vs. CU) and contextual constraining effects (CP vs. NP) on the first pass reading measures (see Figure 1). The word predictability effects, significant on FFD, SFD, and GD, were due to longer reading times for CU than CP conditions (FFD: b = 0.06, CI = [0.02, 0.11], SE = 0.02, t = 2.63; SFD: b = 0.05, CI = [0.01, 0.1], SE = 0.02, t = 2.2; GD: b = 0.08, CI = [0.03, 0.13], SE = 0.03, t = 2.89).¹ The comparison between CP and NP revealed significant contextual constraining effects on the early skipping rate (b = −0.34, CI = [−0.63, −0.05], SE = 0.15, z = −2.3) and gaze duration (b = 0.06, CI = [0.01, 0.11], SE = 0.03, t = 2.16). Readers made more skipping and shorter first-pass fixation durations on the target word. The clear word predictability and contextual constraining effects indicated that we manipulated the two factors successfully.

FIGURE 1

Figure 1. Context-predictable (CP), CU, NP, and NU represent constraining context with predictable word, constraining context with unpredictable word, neutral context with predictable word, and neutral context with unpredictable word, respectively. The contrast between CP and NP represents the contextual constraining effect; the contrast between CP and CU represents the word predictability effect; the contrast between CU and NU represents the prediction error cost. Figure describes the gaze duration in each condition. Asterisks indicate significant effect where t > 1.96.

Prediction error cost

Most crucially, the prediction error cost was not significant on all the measures (| z/t| s < 1.3), i.e., an unexpected word did not incur processing cost in the constraining context with a predictable alternative, compared to the same target word in the neutral context.

We conducted Bayes factors analyses (Kass and Raftery, 1995) to determine the strength of the evidence for the null prediction error cost on the first-pass fixation time measures. The analyses were conducted using the lmBF function within the BayesFactor package (Version 0.9.12-4.2; Morey et al., 2015; R Development Core Team, 2016). Analyses were conducted with scaling factor for g-priors set to 0.5, using 10,000 Monte Carlo iterations. We first computed the Bayes Factor for a model with a fixed effect of prediction error cost (CU vs. NU) and random participant and item intercepts of FFD, SFD, and GD, i.e., BF₁. Then we computed Bayes Factor for a model with only random participant and item intercepts, i.e., BF₀. The critical value was the ratio of BF₁ and BF₀, i.e., BF₁₀, it is itself a Bayes Factor comparing the model with an effect of prediction error cost and participant and item intercepts, to a model with the only participant and item intercepts. According to Vandekerckhove et al. (2015), Bayes Factors (BF₁₀ < 1/3) were taken to provide moderate to strong evidence for the null model. Thus, the present results (FFD, BF₁₀ = 0.11; SFD, BF₁₀ = 0.03; GD, BF₁₀ = 0.27) provided moderate to strong evidence for the null model, i.e., the null prediction error cost.

Discussion

In the present experiment, we manipulated the contextual constraint of sentences and word predictability to investigate whether there is a prediction error cost in Chinese reading. We tested the prediction error cost by comparing the processing of unpredictable words between constraining contexts and neutral contexts (i.e., CU vs. NU). The results showed significant contextual effects and standard word predictability effects in the early stage of word processing, with shorter reading times (FFD, SFD, and GD) for more predictable words, which is in line with previous findings from Chinese studies (Rayner et al., 2005; Wang et al., 2010; Liu et al., 2018; Zhao et al., 2019; Chang et al., 2020a,b). Importantly, no significant prediction error cost was observed across a wide range of eye movements, i.e., the reading is not disruptive if the readers encounter the unpredictable word in a strong constraining sentence with a predictable alternative, supported by the Bayes factor analyses. This result resonated with findings from English studies (Frisson et al., 2005, 2017; Luke and Christianson, 2016). In particular, the findings suggested that readers make diffuse and graded pre-activation of likely upcoming input.

The current experiment adopted a similar design as Frisson et al. (2017). The key comparison between unpredictable words in the constraining and neutral sentences showed no prediction error cost on the fixation duration measures both for Frisson et al. and the present study. This is what we and Frisson et al. (2017) have found in common, indicating that the lexical prediction account would not seem able to account for the predictability effect both in English and Chinese. Notably, the present study differed from Frisson et al. (2017) on the numerical trend. They found a numerical trend in the opposite direction, i.e., the processing advantage for unpredictable words in constraining sentences compared to neutral sentences. Although this processing benefit did not reach significance on reading time measures, this trend was significant in the first pass regression rate (z = −2.03). The significant benefit of unpredictable words in constraining sentences might be due to the semantic priming effect or the transitional probability effect, i.e., the statistical likelihood that a word preceding the target might influence target word processing.

Like Frisson et al. (2017) study, the present study provided clear and strong evidence for null prediction error cost (t/z < 1.29). Unlike Frisson et al. (2017) we did not find significant benefits for unpredictable words in constraining sentences when controlling the pre-target region, providing stronger support for graded pre-activation account. The characteristics of the Chinese language could explain this. Chinese lacks overt cues (markers for number, gender, the tense of verbs, and case) to syntactic structure, which a reader utilizes to produce predictions about upcoming stimuli in English (see Kuperberg and Jaeger, 2016 for a review). Furthermore, the word predictability is lower in Chinese than in English, as shown by the comparison between cloze probability reported by Pan et al. (2021) in Beijing Sentence Corpus (BSC) and that by Luke and Christianson (2016) in Provo Corpus. The grand mean of cloze scores for the words in BSC is 0.07, far less than that reported in Luke and Christianson (M = 0.13). Thus, the sentence constraint in Chinese may be weaker than that in English. It is reasonable that we found more consistent results on the several eye movement measures.

The findings are consistent with the multi-representational hierarchical generative architecture, which views prediction as a graded and probabilistic phenomenon (Kuperberg and Jaeger, 2016). Also, this architecture suggests distinguishing between predictive pre-activation and pre-activation through priming. The present study attempted to control interference from the priming effect across conditions by constructing compound sentences in which the first half-sentences controlled the contextual constraint and the second half-sentences were identical at least three characters before the target words. Thus, the content of the pre-target region was identical in the constraining and neutral sentences. The null prediction error cost on the first pass reading measures and the later eye movement measures suggest that encountering an unexpected word in a constraining sentence does not interrupt early lexical identification and later semantic integration. Readers pre-activate not only one specific item but a range of possible words. The present study confirmed the graded pre-activation mechanism of predictive processing in Chinese reading.

Limitations and future directions

The study had one limitation. The number of participants in the cloze task might influence the cloze value of words. There is a positive correlation between the number of participants and the precision of word’s cloze value. Our present study recruited 22 participants for the cloze task. Although we successfully balanced the cloze values between CU and NU, however, the sample size might be not big enough provide a precise cloze value of a word.

Thus, future studies could recruit as many participants as possible to obtain more precise word cloze value. Besides, cross-linguistic studies are highly needed to explore how linguistic characteristics (e.g., word space, word length, and complexity) influence predictive language processing. In addition, to improve the external validity, studies about predictive language comprehension of special readers (e.g., non-native speakers, children with dyslexia, and older adults) are needed. These studies will inform us of the mechanism of reading difficulty for non-native speakers, children with dyslexia, and older adults.

Conclusion

In summary, we conducted an eye-tracking experiment to investigate whether processing an unpredictable word incurs prediction error cost when there is a predictable alternative. The null prediction error cost supports that the graded pre-activation account underlies the word predictability effect in Chinese reading.

Data availability statement

The original contributions presented in this study are included in the article/supplementary material, further inquiries can be directed to the corresponding author.

Ethics statement

The studies involving human participants were reviewed and approved by the research Ethics Committee at Tianjin Normal University and conducted according to the Declaration of Helsinki principles. The patients/participants provided their written informed consent to participate in this study.

Author contributions

MC and JW designed the experiment and wrote the manuscript. MC and YS experimented and analyzed the data. KZ and SL provided good suggestions. All authors contributed to the article and approved the submitted version.

Funding

This research was supported by grants from the National Natural Science Foundation of China to JW (81771823) and Fujian Social Science Planning Project under Grant to SL (FJ2020C071).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Footnotes

^ The word predictability effect was also significant on RPD while we did not mention it in the Results and Discussion. As the results on RPD, RO, and TRT might represent a mixture of predictability effect and semantic integrative effect. We want to obtain the clear and genuine predictability effect. Following the tradition of eye movement research, however, we reported these later eye movement measures in the table which could be accessible for other researchers for meta-analysis. Thus, we did not mention and discuss these later eye movement measures in sections “Results and Discussion.”

References

Baayen, R. H., Davidson, D. J., and Bates, D. M. (2008). Mixed-effects modeling with crossed random effects for subjects and items. J. Mem. Lang. 59, 390–412. doi: 10.1016/j.jml.2007.12.005

CrossRef Full Text | Google Scholar

Barr, D. J., Levy, R., Scheepers, C., and Tily, H. J. (2013). Random effects structure for confirmatory hypothesis testing: keep it maximal. J. Mem. Lang. 68, 255–278. doi: 10.1016/j.jml.2012.11.001

PubMed Abstract | CrossRef Full Text | Google Scholar

Bates, D., Mächler, M., Bolker, B. M., and Walker, S. C. (2015). Fitting linear mixed-effects models using lme4. J. Stat. Softw. 67, 1–48. doi: 10.18637/jss.v067.i01

CrossRef Full Text | Google Scholar

Cai, Q., and Brysbaert, M. (2010). SUBTLEX-CH: Chinese word and character frequencies based on film subtitles. PLoS One 5:e10729. doi: 10.1371/journal.pone.0010729

PubMed Abstract | CrossRef Full Text | Google Scholar

Carter, B. T., Foster, B., Muncy, N. M., and Luke, S. G. (2019). Linguistic networks associated with lexical, semantic and syntactic predictability in reading: a fixation-related fMRI study. NeuroImage 189, 224–240. doi: 10.1016/j.neuroimage.2019.01.018

PubMed Abstract | CrossRef Full Text | Google Scholar

Chang, M., Hao, L., Zhao, S., Li, L., Paterson, K. B., and Wang, J. (2020a). Flexible parafoveal encoding of character order supports word predictability effects in Chinese reading: evidence from eye movements. Attent. Percept. Psychophys. 82, 2793–2801. doi: 10.3758/s13414-020-02050-x

PubMed Abstract | CrossRef Full Text | Google Scholar

Chang, M., Zhang, K., Hao, L., Zhao, S., McGowan, V. A., Warrington, K. L., et al. (2020b). Word predictability depends on parafoveal preview validity in Chinese reading. Vis. Cogn. 28, 33–40. doi: 10.1080/13506285.2020.1714825

CrossRef Full Text | Google Scholar

Clark, A. (2013). Whatever next? Predictive brains, situated agents, and the future of cognitive science. Behav. Brain Sci. 36, 181–204. doi: 10.1017/S0140525X12000477

PubMed Abstract | CrossRef Full Text | Google Scholar

Delong, K. A., Troyer, M., and Kutas, M. (2014). Pre-processing in sentence comprehension: sensitivity to likely upcoming meaning and structure. Lang. Linguistics Compass 8, 631–645. doi: 10.1111/lnc3.12093

PubMed Abstract | CrossRef Full Text | Google Scholar

Delong, K. A., Urbach, T. P., and Kutas, M. (2005). Probabilistic word pre-activation during language comprehension inferred from electrical brain activity. Nat. Neurosci. 8, 1117–1121. doi: 10.1038/nn1504

PubMed Abstract | CrossRef Full Text | Google Scholar

Frank, S. L., Otten, L. J., Galli, G., and Vigliocco, G. (2015). The ERP response to the amount of information conveyed by words in sentences. Brain Lang. 140, 1–11. doi: 10.1016/j.bandl.2014.10.006

PubMed Abstract | CrossRef Full Text | Google Scholar

Frisson, S., Harvey, D. R., and Staub, A. (2017). No prediction error cost in reading: evidence from eye movements. J. Mem. Lang. 95, 200–214. doi: 10.1016/j.jml.2017.04.007

CrossRef Full Text | Google Scholar

Frisson, S., Rayner, K., and Pickering, M. J. (2005). Effects of contextual predictability and transitional probability on eye movements during reading. J. Exp. Psychol. 31, 862–877. doi: 10.1037/0278-7393.31.5.862

PubMed Abstract | CrossRef Full Text | Google Scholar

Goodkind, A., and Bicknell, K. (2018). “Predictive power of word surprisal for reading times is a linear function of language model quality,” in Proceedings of the 8th workshop on cognitive modeling and computational linguistics (CMCL), Salt Lake City, UT, 10–18. doi: 10.18653/v1/w18-0102

PubMed Abstract | CrossRef Full Text | Google Scholar

Henderson, J. M., Choi, W., Lowder, M. W., and Ferreira, F. (2016). Language structure in the brain: a fixation-related fMRI study of syntactic surprisal in reading. NeuroImage 132, 293–300. doi: 10.1016/j.neuroimage.2016.02.050

PubMed Abstract | CrossRef Full Text | Google Scholar

Kass, R. E., and Raftery, A. E. (1995). Bayes factors. J. Am. Stat. Assoc. 90, 773–795. doi: 10.1080/01621459.1995.10476572

CrossRef Full Text | Google Scholar

Kliegl, R., Grabner, E., Rolfs, M., and Engbert, R. (2004). Length, frequency, and predictability effects of words on eye movements in reading. Eur. J. Cogn. Psychol. 16, 262–284. doi: 10.1080/09541440340000213

CrossRef Full Text | Google Scholar

Kuperberg, G. R., and Jaeger, T. F. (2016). What do we mean by prediction in language comprehension? Lang. Cogn. Neurosci. 31, 32–59. doi: 10.1080/23273798.2015.1102299

PubMed Abstract | CrossRef Full Text | Google Scholar

Kutas, M., DeLong, K. A., and Smith, N. J. (2011). “A look around at what lies ahead: prediction and predictability in language processing,” in Predictions in the brain: using our past to generate a future, ed. M. Bar (Oxford: Oxford University Press), 190–207. doi: 10.1093/acprof:oso/9780195395518.003.0065

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, X., Zang, C., Liversedge, S. P., and Pollatsek, A. (2015). “The role of words in Chinese reading,” in The Oxford handbook of reading, eds A. Pollatsek and R. Treiman (New York, NY: Oxford University Press), 232–244.

Google Scholar

Liu, Y., Guo, S., Yu, L., and Reichle, E. D. (2018). Word predictability affects saccade length in Chinese reading: an evaluation of the dynamic-adjustment model. Psychon. Bull. Rev. 25, 1891–1899. doi: 10.3758/s13423-017-1357-x

PubMed Abstract | CrossRef Full Text | Google Scholar

Luke, S. G., and Christianson, K. (2016). Limits on lexical prediction during reading. Cogn. Psychol. 88, 22–60. doi: 10.1016/j.cogpsych.2016.06.002

PubMed Abstract | CrossRef Full Text | Google Scholar

McDonald, S. A., and Shillcock, R. C. (2003). Low-level predictive inference in reading: the influence of transitional probabilities on eye movements. Vis. Res. 43, 1735–1751. doi: 10.1016/S0042-6989(03)00237-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Monsalve, I. F., Frank, S. L., and Vigliocco, G. (2012). “Lexical surprisal as a general predictor of reading time,” in Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics, Avignon, 398–408.

Google Scholar

Morey, R. D., Rouder, J. N., Jamil, T., and Morey, M. R. D. (2015). Package ‘bayesfactor’. Available online at: https://cran.r-project.org/web/packages/BayesFactor/ (accessed June 10, 2015).

Google Scholar

Pan, J., Yan, M., Richter, E. M., Shu, H., and Kliegl, R. (2021). The Beijing sentence corpus: a Chinese sentence corpus with eye movement data and predictability norms. Behav. Res. Methods 54, 1989–2000. doi: 10.3758/s13428-021-01730-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Pickering, M. J., and Gambi, C. (2018). Predicting while comprehending language: a theory and review. Psychol. Bull. 144, 1002–1044. doi: 10.1037/bul0000158

PubMed Abstract | CrossRef Full Text | Google Scholar

R Development Core Team (2016). R: a language and environment for statistical computing. Vienna: R Foundation for Statistical Computing.

Google Scholar

Rayner, K., Li, X., Juhasz, B. J., and Yan, G. (2005). The effect of word predictability on the eye movements of Chinese readers. Psychon. Bull. Rev. 12, 1089–1093. doi: 10.3758/BF03206448

PubMed Abstract | CrossRef Full Text | Google Scholar

Rayner, K., Schotter, E. R., and Drieghe, D. (2014). Lack of semantic parafoveal preview benefit in reading revisited. Psychon. Bull. Rev. 21, 1067–1072. doi: 10.3758/s13423-014-0582-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Rayner, K., and Well, A. D. (1996). Effects of contextual constraint on eye movements in reading: a further examination. Psychon. Bull. Rev. 3, 504–509. doi: 10.3758/BF03214555

PubMed Abstract | CrossRef Full Text | Google Scholar

Reichle, E. D., Rayner, K., and Pollatsek, A. (2003). The E-Z reader model of eye-movement control in reading: comparisons to other models. Behav. Brain Sci. 26, 445–476. doi: 10.1017/S0140525X03000104

PubMed Abstract | CrossRef Full Text | Google Scholar

Scott, S. K., Mcgettigan, C., and Eisner, F. (2009). A little more conversation, a little less action - candidate roles for motor cortex in speech perception. Nat. Neurosci. 10, 295–302. doi: 10.1038/nrn2603

PubMed Abstract | CrossRef Full Text | Google Scholar

Smith, N. J., and Levy, R. (2013). The effect of word predictability on reading time is logarithmic. Cognition 128, 302–319. doi: 10.1016/j.cognition.2013.02.013

PubMed Abstract | CrossRef Full Text | Google Scholar

Staub, A. (2015). The effect of lexical predictability on eye movements in reading: critical review and theoretical interpretation. Lang. Linguistics Compass 9, 311–327. doi: 10.1111/lnc3.12151

CrossRef Full Text | Google Scholar

Taylor, H. R. (1978). Applying new design principles to the construction of an illiterate E chart. Am. J. Optom. Physiol. Opt. 55, 348–351. doi: 10.1097/00006324-197805000-00008

PubMed Abstract | CrossRef Full Text | Google Scholar

Taylor, W. L. (1953). “Cloze Procedure”: a new tool for measuring readability. J. Q. 30, 415–433. doi: 10.1177/107769905303000401

CrossRef Full Text | Google Scholar

Vandekerckhove, J., Matzke, D., and Wagenmakers, E. J. (2015). “Model comparison and the principle of parsimony,” in The Oxford handbook of computational and mathematical psychology, eds J. R. Busemeyer, Z. Wang, J. T. Townsend, and A. Eidels (Oxford: Oxford University Press), 300–319.

Google Scholar

Vasilev, M. R., and Angele, B. (2017). Parafoveal preview effects from word N + 1 and word N + 2 during reading: a critical review and Bayesian meta-analysis. Psychon. Bull. Rev. 24, 666–689. doi: 10.3758/s13423-016-1147-x

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, H.-C., Pomplun, M., Chen, M., Ko, H., and Rayner, K. (2010). Estimating the effect of word predictability on eye movements in Chinese reading using latent semantic analysis and transitional probability. Q. J. Exp. Psychol. 63, 1374–1386. doi: 10.1080/17470210903380814

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhao, S., Li, L., Chang, M., Xu, Q., Zhang, K., Wang, J., et al. (2019). Older adults make greater use of word predictability in Chinese reading. Psychol. Aging 34, 780–790. doi: 10.1037/pag0000382

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhou, W., Kliegl, R., and Yan, M. (2013). A validation of parafoveal semantic information extraction in reading Chinese. J. Res. Read. 36(Suppl.1), S51–S63. doi: 10.1111/j.1467-9817.2013.01556.x

CrossRef Full Text | Google Scholar

Keywords: lexical predictability, contextual constraint, graded pre-activation, Chinese reading, eye movement

Citation: Chang M, Zhang K, Sun Y, Li S and Wang J (2023) The graded predictive pre-activation in Chinese sentence reading: evidence from eye movements. Front. Psychol. 14:1136488. doi: 10.3389/fpsyg.2023.1136488

Received: 03 January 2023; Accepted: 12 June 2023;
Published: 29 June 2023.

Edited by:

Marijan Palmovic, University of Zagreb, Croatia

Reviewed by:

Xiaolu Wang, Zhejiang University City College, China
Qiaoyun Liao, Shanghai International Studies University, China

Copyright © 2023 Chang, Zhang, Sun, Li and Wang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Jingxin Wang, d2p4cHN5QDEyNi5jb20=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

The graded predictive pre-activation in Chinese sentence reading: evidence from eye movements

Introduction

Materials and methods

Ethics approval

Participant

Design and stimuli

Apparatus and procedure

Data analysis

Results

Word predictability effect and contextual constraining effect

Prediction error cost

Discussion

Limitations and future directions

Conclusion

Data availability statement

Ethics statement

Author contributions

Funding

Conflict of interest

Publisher’s note

Footnotes

References

95% of researchers rate our articles as excellent or good

95% of researchers rate our articles as excellent or good