Skip to main content

ORIGINAL RESEARCH article

Front. Psychol., 20 June 2024
Sec. Educational Psychology

The effects of EFL wordlist and proficiency on vocabulary knowledge

  • School of Foreign Languages, Shanghai Jiao Tong University, Shanghai, China

Introduction: The Gaokao Word List (GWL) in China serves as a guideline for learning L2 vocabulary, but there are few studies verifying its effect on university EFL learners’ vocabulary knowledge.

Method: This study investigated the effects of the GWL and EFL proficiency on 66 Chinese university EFL learners’ vocabulary knowledge by administering word recognition tests.

Results and discussion: The results showed that: (1) the GWL had significant effects on participants’ receptive vocabulary knowledge; (2) EFL proficiency had significant effects on participants’ word recognition, without interaction with the GWL. These findings were discussed through the lens of frequency of exposure, accounting for the overwhelming GWL effect on learners’ vocabulary knowledge. We suggest EFL proficiency be taken into consideration when the GWL is revised in the future, to smoothen the transition in vocabulary learning from high school to university, and improve vocabulary learning efficiency.

1 Introduction

Vocabulary is crucial to foreign language teaching and learning. As suggested by Meara (1996), learners with larger vocabulary are more proficient in various language skills than those with smaller vocabulary. However, words in a language are too numerous to acquire, especially for L2 learners. Considering the workings of human memory and the tendency to forget information over time, long-term retention of newly-learnt vocabulary poses great challenges for learners. The majority of forgetting occurs shortly after initial learning, with a decrease in the rate of forgetting thereafter (Schmitt, 2010b). It is recommended to increase the frequency of exposure so as to retain more words in memory. To repeatedly encounter target words, West et al. (1934) introduced the concept of “vocabulary selection,” and wordlist is the realized product of such concept. In high schools in China, the Gaokao Word List (GWL) is deemed as the framework for foreign language teaching and learning, since the Gaokao, or the National College Entrance Examination in China, is a standardized test taken by over 10 million students in their final year of high school each year to gain admission to universities. Due to the competitiveness of the life-changing exam, students undergo extensive preparation to excel in this high-stakes examination, reflecting the importance of this exam in shaping students’ future prospects. In practice, teachers usually follow the GWL in their routine vocabulary instruction to expand students’ vocabulary in preparation for the Gaokao (Gu and Li, 2009; Bao and Xu, 2022).

Although the GWL plays a guiding role in vocabulary learning, the extent of its influence is unknown. It remains to be investigated whether the vocabulary knowledge of university EFL learners, who have graduated from high schools for over 3 years, still displays the influence of the GWL. In EFL learning contexts, is it likely that advanced EFL learners are immune to the effect of the GWL, whereas lower-proficiency learners are more prone to the GWL effect? In response, this study examined the effects of the GWL and L2 proficiency on EFL learners’ vocabulary knowledge, so as to see whether the GWL effect on learners’ vocabulary knowledge is long-lasting, and whether learners of different proficiency levels react differently to the GWL.

2 Literature review

2.1 Word list

In EFL learning, vocabulary is considered to be of utmost importance. Nevertheless, a significant portion of EFL students encounter difficulties due to inadequate exposure to target language vocabulary, whereas repeated exposure to the target vocabulary is deemed essential for the effective acquisition of new words (Webb, 2007). Previous studies indicate that learners may need between 5 to 16 or more exposures to successfully learn a new word (Nation, 1990). Although some new words may be acquired incidentally, the vocabulary knowledge is often not deeply entrenched (Pigada and Schmitt, 2006). In contrast, intentional vocabulary learning involves explicit instruction and repeated focus on words by learners. It has been shown when learners consciously concentrate on acquiring vocabulary, the retention rate is higher compared to incidental learning (Schmitt, 2010b).

Many EFL learners equate language learning with intentional vocabulary learning, investing considerable time in memorizing words from wordlists (Read, 2010), in an attempt to increase exposure to target words. Wordlists, commonly used tools for explicit vocabulary learning, are typically created based on specific criteria to facilitate English learners’ vocabulary development (Nation, 2016). When it comes to developing vocabulary knowledge, the utilization of wordlists is highly beneficial to EFL learners with limited exposure to the target language (Read, 2010; Yamamoto, 2014). Depending on their design, wordlists can serve at least two heuristic functions. Firstly, a wordlist can act as a reflection of what learners have learned, acting as a representation of their knowledge. Secondly, a wordlist can function as a guide for what learners should learn (Pinchbeck et al., 2022).

English, being one of the mandatory subjects in high school, holds a pivotal role in the Gaokao. The Ministry of Education of China (2017) has established definitive guidelines for English learning and evaluation, i.e., the English Curriculum Standards for High Schools, which outlines the vocabulary to be taught, learned, and evaluated in the Gaokao. In accordance with the official Gaokao Word List, high school graduates should aim to acquire around 3,000 basic words. The GWL, compiled by language educators with expertise in EFL teaching, serves as a guide for students by focusing on common expressions related to various aspects of daily life (Bao and Xu, 2022). Despite indications that the selected words are relevant to commonly discussed topics, the specific criteria for word selection were not clearly reported in the introduction of the GWL. By calculating the coverage of the GWL words in the BNC/COCA frequency-based wordlists (i.e., a series of 1,000-word lists based on frequency level) (Nation, 2016), we found that 79.37% of the GWL words are covered in the first 3,000 words (i.e., 41.69% in the 1st 1,000 words, 27.92% in the 2nd 2000 words, 9.76% in the 3rd 1,000 words), indicating that frequency could be the basic criterion for selecting words on the GWL.

The GWL was created to assist students in preparing for the Gaokao English exam, serving as a crucial reference for curriculum design, word selection in learning materials, classroom instruction, and language assessments (Sun, 2005; Gu and Li, 2009; Bao and Xu, 2022). Language-focused education in China typically involves explicit vocabulary instruction, with the official wordlist being a valuable resource for such endeavors (Sun, 2005). High school students in China commonly employ the wordlist learning strategy (Qian, 2020), spending a large amount of time memorizing words from the GWL in preparation for the Gaokao exam. Teachers often provide explicit vocabulary instruction and frequent quizzes based on the wordlist to monitor students’ progress in vocabulary knowledge (Zhang, 2015). Additionally, the construction of the Gaokao exam heavily relies on the GWL, which serves as a primary guideline for English testing. Nation (2016) stressed the importance of empirical studies on influential wordlists. Reynolds et al. (2018) discovered that wordlists designed for high school students have a lasting impact on university students’ vocabulary knowledge, with listed words significantly outperforming off-list words. Despite the widely recognized influence of the GWL on vocabulary learning and teaching, there is a lack of empirical evidence verifying its effects.

2.2 Vocabulary knowledge assessment

Vocabulary knowledge can be categorized into receptive (passive) and productive (active) dimensions (Henriksen, 1999; Laufer and Goldstein, 2004; Milton, 2009; Schmitt, 2010a). Receptive vocabulary knowledge involves recognizing the form of a word (Laufer et al., 2004), or providing its synonym or translation in one’s first language (Webb, 2009), while productive vocabulary knowledge is often defined as the ability to recall both the form and meaning of words (Laufer et al., 2004; Webb, 2008). Nation (2013) expanded this view by considering vocabulary knowledge as multifaceted, encompassing form (e.g., pronunciation, spelling), meaning (e.g., form/meaning relationships, associations), and use (e.g., grammatical functions, collocations). However, in vocabulary research, the focus tends to be on the form and meaning of words (Laufer and Goldstein, 2004; Zhong, 2018), with many studies operationalizing vocabulary breadth knowledge as knowledge of form-meaning mapping (Laufer and Goldstein, 2004; Schmitt, 2014).

The testing of form-meaning mapping knowledge falls into four categories: passive recognition test, passive recall test, active recall test, and active recognition test (Laufer et al., 2004; Laufer and Goldstein, 2004; Schmitt, 2010a). Passive recognition test is a commonly utilized format that assesses learners’ vocabulary knowledge by requiring learners to demonstrate that they know the target language word by recognizing the form of a word or selecting the corresponding L1 translation from provided options. This test format is considered fundamental in evaluating word knowledge (Milton, 2009). On the other hand, passive recall test requires learners to supply the L1 translation of an L2 word (Laufer and Goldstein, 2004; Schmitt, 2010a). In contrast to passive tests, active recall test prompts learners to provide the L2 target word in response to its L1 translation. Another vocabulary test format is the active recognition test, where learners must choose the corresponding L2 word when given the L1 translation. These assessment formats typically involve translation equivalents (Laufer and Goldstein, 2004). However, the use of translation as a teaching or testing tool is not always liked. Communicative approaches to language teaching favor gap-fill tests where learners are instructed to fill in missing words in sentences to demonstrate their knowledge of the target words (Milton, 2009).

The assessment of vocabulary knowledge was moderated by test format (Bowles and Salthouse, 2008). For example, tasks requiring the recall of lexical meanings and forms, such as word definition tasks, show greater susceptibility to individual differences compared to word recognition tasks, which only involves recognizing the forms of the L2 words (Verhaeghen, 2003; Bowles and Salthouse, 2008). Word recognition tasks do not require retrieving the target word from memory, thereby helping to mitigate the confounding effect of individual differences. In this study, the vocabulary knowledge of EFL learners will be assessed using word recognition task.

The Yes/No vocabulary test is a passive recognition test format that has been extensively researched (Milton, 2009). This test format is designed to gauge learners’ receptive vocabulary through a simple judgment of whether a given lexical item is known or not (Meara, 1996). Within the vocabulary testing literature, Yes/No vocabulary tests have been proposed as easy alternatives to multiple-choice tests (Meara and Buxton, 1987). It has been utilized in the large-scale European Dialang project for assessing vocabulary proficiency in 14 European languages.1 Compared to other assessment methods, the Yes/No format offers the advantage of being quick and easy to construct the vocabulary test. Since participants are less likely to feel bored and lose concentration in a short time span, the test results are likely to be more reliable (Milton, 2009).

There are also criticisms regarding the limitations of the vocabulary test format. One issue is that learners’ self-report of their knowledge of a word may not be an accurate reflection of their actual vocabulary knowledge (Nation, 1990). When learners do not really recognize a word or are not sure, they might guess. Lucky guesses may lead to an overestimation of the learners’ true vocabulary knowledge (Urdaniz and Skoufaki, 2022). To address the potential guesswork, pseudowords are often included in vocabulary tests. Pseudowords are fabricated words that resemble real words but are not actually part of the language (Milton, 2009). In this study, we utilized pseudowords generated by Wuggy (Keuleers and Brysbaert, 2010) to take into consideration the possibility that certain learners might overestimate their knowledge. Another issue is that the Yes/No test format may pose challenges for individuals with dyslexia or those whose L1 and L2 are cognate languages (Beeckmans et al., 2001). Cognates have typically been described in same-script language pairs as words that share similar form and meaning (Dijkstra, 2007). Studies involving different-script languages have also used the term “cognate” to indicate phonologically similar lexical pairs (Kim and Davis, 2003; Hoshino and Kroll, 2008). There exist a few Chinese-English cognates, which are transliterated loan words, with their pronunciations as similar as possible to their English counterparts (Wen and van Heuven, 2017). To address these concerns, we excluded transliterated loanwords from our experimental stimuli and recruited Mandarin-speaking university students without dyslexia to participate in the study.

Several studies have confirmed the efficacy of the Yes/No test format. Mochida and Harrington (2006) utilized the same lexical items in both the Yes/No test and the Vocabulary Levels Test (VLT) to allow a direct comparison of test performance. They found that performance on the Yes/No test could reliably predict scores on the VLT among EFL university students. The various scoring methods for the Yes/No test all demonstrated strong predictive power for VLT performance, with correlation coefficients exceeding 0.85. Regardless of the scoring method used, the Yes/No test performance accounted for more than 75% of the variance in VLT scores. Also, Meara (1996) observed a moderately strong correlation, around 0.7, between Yes/No test performance and traditional multiple-choice tests taken by EFL learners. Further supporting the efficacy of the Yes/No test, Lemhöfer and Broersma (2012) developed the LexTALE, a validated Yes/No vocabulary measure that reliably assesses English vocabulary knowledge among medium- to high-proficient EFL learners. The relevant literature findings confirm the reliability and validity of the Yes/No test as a measure of L2 receptive vocabulary knowledge, prompting the current study to adopt the Yes/No vocabulary test format to evaluate learners’ receptive vocabulary knowledge.

2.3 L2 proficiency and vocabulary knowledge

The importance of English proficiency in vocabulary acquisition is well-established in relevant literature, with vocabulary knowledge often seen as a key indicator of overall L2 proficiency (Alderson, 2005; Mitsugi, 2018; Zeng et al., 2019). Variations in L2 proficiency can significantly influence learners’ vocabulary retention and word recall abilities (Agustin-Llach, 2022). Previous research has shown that learners’ proficiency levels play a crucial role in their vocabulary learning outcomes. For example, Teng (2022) identified English proficiency as a notable predictor of vocabulary learning performance, aligning with findings from Tekmen and Daloǧlu’s (2006) study, wherein advanced learners acquired a significantly higher number of words compared to those with lower proficiency levels. Moreover, studies have consistently demonstrated a positive relationship between L2 proficiency and vocabulary knowledge. Studies by Masrai (2023) and Zareva et al. (2005) have highlighted a strong correlation between vocabulary knowledge and overall language proficiency. Specifically, Milton et al. (2010) revealed significant positive associations between written and aural vocabulary knowledge and the performance of L2 learners in various language skills such as writing, reading, listening, and speaking.

During the initial stages of language acquisition, L2 learners are prone to be negatively affected by their first language, which can hinder the acquisition of L2 vocabulary. The similarities in structure between learners’ L1 and L2 can have an impact on the ease with which vocabulary is learned (Crystal, 1987). It is suggested that learning a second language that shares structural similarities with one’s L1 may be easier compared to learning a language that is structurally different. For example, the comprehension and production of words containing phonemes that are absent in the learner’s L1 can present difficulties (Laufer, 1990). However, as L2 proficiency improves, learners will concurrently incorporate more sophisticated lexical items in their language use. A longitudinal study has confirmed the influence of proficiency levels on vocabulary acquisition (Salsbury et al., 2011).

While extensive research exists on the relationship between vocabulary knowledge and L2 proficiency, limited attention has been given to investigating how proficiency levels influence vocabulary acquisition when learners are guided by an official wordlist. It remains largely unknown whether learners with varying proficiency levels respond similarly to the guided wordlist (i.e., the GWL). Therefore, it is imperative to examine how L2 proficiency influences vocabulary acquisition when guided by the GWL.

2.4 The current study

The GWL was compiled to assist vocabulary learning and teaching during high school. We hypothesized that the GWL exerts significant effects on EFL learners’ vocabulary knowledge, even long after they enter universities. Whether university EFL learners’ vocabulary knowledge is still shaped or constrained by the high-school GWL after over 3 years in university remains to be revealed. However, few studies have been conducted to confirm the washback effect of the GWL on the vocabulary knowledge of university students, and to provide empirical evidence for the revision or compilation of the wordlist. In classroom settings, the GWL presents all target words to learners of different proficiency levels. During vocabulary learning process, whether learners’ proficiency levels interact with the GWL influence is still unknown to us. We thus have no idea whether compiling graded wordlists tailored to learners of different proficiency levels is advisable. Nevertheless, few studies have investigated whether L2 proficiency affects wordlist-based vocabulary learning. Thus, by consulting Brysbaert et al.’s (2021) standard online lexical recognition paradigm featuring good reliability and validity in evaluating learners’ vocabulary knowledge, this study explores the effects of an official wordlist for high school learners and L2 proficiency on vocabulary knowledge of university EFL learners across China.

The research questions are as follows:

1. Does the GWL have long-lasting effects on EFL learners’ vocabulary knowledge?

2. Does L2 proficiency have any impact on EFL learners’ wordlist-based vocabulary learning?

3 Method

3.1 Participants

A total of 66 university students with Mandarin Chinese as their L1, who had taken the Gaokao, were recruited as participants via online crowdsourcing in the present study. The sampling method has been frequently adopted in previous seminal studies to access participants from diverse geographical areas, thus enhancing the sample’s representativeness (e.g., Kuperman et al., 2012; Warriner et al., 2013; Brysbaert et al., 2014). The participants come from various regions in China, including East China, Central China, South China, North China, Northeast, Northwest, and Southwest. Participants completed a set of vocabulary recognition tests, which contained 30 pseudowords to help detect participants’ noncompliance with task instructions or inattentiveness to the task. Following the practice by Brysbaert et al. (2021) to encourage faithful responses, we explicitly warned the participants that both true words and pseudowords were included and that “Yes” responses to pseudowords would result in penalty points. To ensure data quality, an automatic filter was implemented so that surveys with more than five “Yes” responses to pseudowords were excluded from the final dataset, which is a common practice in word recognition task. As a result, 24 participants were removed. Participants who gave “N” response (i.e., “I do not know the meaning of this word”) to all real and pseudo words were also excluded from analysis. Accordingly, two participants were filtered out. The final sample included 40 participants (62.5% female), with their mean age being 20.68 years (SD = 1.68). The majority of participants in the study are either junior (27.5%) or senior (50%) university students, with sophomores and freshmen making up 17.5 and 5% of the sample, respectively.

3.2 Experimental materials

The vocabulary recognition tests employed in this study contained 35 GWL listed words, 35 matched words outside the GWL (referred to as “the matched off-list words”), and 30 pseudowords (Note. the rationale for using pseudowords in word recognition test has been clarified in section 2.2 Vocabulary Knowledge Assessment). Specifically, the procedure of word set construction was as follows.

Firstly, we imported the following information for the 2,070 words on the GWL: word frequency, dominant part of speech, and word concreteness. As the GWL encompasses general English words used in everyday interactions (Bao and Xu, 2022), we opted to retrieve word frequency from the SUBTLEX-US corpus, which is derived from subtitles of movies and TV shows and contains 51 million word tokens from 8,388 subtitle files (Brysbaert and New, 2009). Traditionally, corpora were primarily sourced from books, newspapers, and magazines, which often cover themes not commonly discussed in daily life (Brysbaert and New, 2009). Conversely, subtitles from films and television offer a more realistic representation of everyday language usage (New et al., 2007). In addition, research on American English and other languages has indicated that word frequencies based on film and television subtitles are better predictors of word processing efficiency than word frequencies based on books and other written sources (e.g., Brysbaert and New, 2009; Cai and Brysbaert, 2010; Keuleers et al., 2010).

We utilized the Zipf value of word frequency, which is measured on a logarithmic scale ranging from 1 (representing low frequency) to around 7 (representing high frequency). The Zipf value is calculated using the formula log10 (frequency per billion words), where a Zipf value of 1 corresponds to a word frequency of 10 per billion words and a Zipf value of 2 corresponds to a word frequency of 100 per billion words, and so on (van Heuven et al., 2014). The Zipf value was chosen in lieu of using raw frequency data for two reasons. Firstly, the size of a corpus can skew raw frequency data. Secondly, the raw frequency effect tends to follow a logarithmic curve, thereby amplifying the differences between words with similar frequencies (Brysbaert et al., 2018).

Secondly, the dominant part-of-speech (PoS) information was sourced from the SUBTLEX-US corpus. Within the SUBTLEX-US corpus, the CLAWS algorithm, developed at Lancaster University, was employed to assign PoS tags to words. The accuracy of the CLAWS tagger has been validated through its application to other corpora (e.g., the British National Corpus and the Corpus of Contemporary American English), achieving accuracy rates between 96 and 97% (Brysbaert et al., 2012). The SUBTLEX-derived PoS tags come with information on how frequent each word form is used as a particular part of speech (Strik Lievers et al., 2021). Furthermore, the PoS tags obtained from this corpus comprise one of the most comprehensive sets of part-of-speech tags readily available for statistical analysis (Brysbaert et al., 2012).

Thirdly, word concreteness values were extracted from the largest database of word concreteness ratings available (Brysbaert et al., 2014). A total of over 4,000 fluent English-speaking adults were invited to rate 37,058 words on a scale ranging from 1 (representing high abstractness) to 5 (representing high concreteness). The correlation between the concreteness ratings provided by the MRC database (Coltheart, 1981) and those by Brysbaert et al. (2014) was notably strong (r = 0.919), highlighting the validity of the concreteness ratings.

Lastly, 219 words that do not appear in the GWL were sampled matching the GWL words on part-of-speech, word length [t (68) = 0.000, p = 1], word frequency [t (68) = 0.127, p = 0.900], number of morphemes [t (68) = 0.000, p = 1], orthographic similarity [t (68) = 0.127, p = 0.889], and concreteness [t (68) = 0.051, p = 0.960]. A series of independent samples t-tests have shown that there are no significant differences between the two sets of words in terms of the lexico-semantic variables, as shown in the above square brackets. The descriptive data for the two word subsets are displayed in Table 1. Then, we randomly selected 35 word pairs from the 219 pairs of word stimulus.

Table 1
www.frontiersin.org

Table 1. Descriptive data for the two word subsets.

Our word recognition task required the participants to indicate whether they knew each item by selecting “Yes” or “No.” This is a standard online word recognition paradigm, which has shown good reliability and validity in assessing participants’ vocabulary knowledge (Brysbaert et al., 2021). The inter-rater reliability as measured by Intraclass Correlation Coefficient for the vocabulary recognition tests was 0.869, indicating the vocabulary tests are reliable. In addition, to check the consistency of results across different participants, we tested another group of university EFL learners using the same word recognition task (N = 44, average age = 20.43 years). The Pearson correlation between the two groups’ vocabulary recognition performance was significantly strong (r = 0.892, p < 0.001), which suggests the reliability of the vocabulary test.

The English proficiency test used in this study was adapted from the Syndicate (2001). Our pilot study suggested that selecting 10 multiple-choice items and 10 cloze items (i.e., the total score is 20) was most suitable for an online survey paradigm, balancing test time and effectiveness. The internal reliability of this proficiency test, as calculated by Cronbach’s Alpha, was 0.779, indicating a high level of reliability.

3.3 Experimental procedures

The online testing was comprised of vocabulary recognition tests and English proficiency test. Participants were assigned to complete the vocabulary recognition test by selecting “Yes” or “No.” Subsequently, participants completed the English proficiency test and provided demographic information.

3.4 Statistical analysis

To answer the two research questions, we conducted linear mixed-effects (LME) modeling with the lme4 package in R because it allows including participant as random effect, which enables us to account for individual differences. We included the following variables as fixed effects in the LME model: L2 proficiency, word source (listed words vs. off-list words), and the interaction between proficiency and word source. The recognition rate was the dependent variable. Finally, effect sizes were calculated for the model using the MUMIn function. MUMIn provides R2 values for the fitted model in two forms. Marginal R2 values are associated with the fixed effects, while conditional R2 values reflect both the fixed and the random effects combined.

4 Results (the effects of the GWL and L2 proficiency on vocabulary knowledge)

Descriptive statistics showed that the average L2 proficiency of the 40 participants was 12.175 (SD = 4.408). The recognition rates for these two sets of words are shown in Table 2. The vocabulary recognition rates of the GWL listed words were higher than those of the off-list words.

Table 2
www.frontiersin.org

Table 2. Recognition rates.

The results of the linear mixed-effects model are presented in Table 3. Considering the R2 values (R2 marginal = 0.594, R2 conditional = 0.907), this model seems to explain a substantial amount of variance in learners’ word recognition performance. The LME results indicate that both word source and L2 proficiency have significant effects on participants’ receptive knowledge of general English words.

Table 3
www.frontiersin.org

Table 3. The LME model for recognition rates.

As for the first research question, the results reveal that a significant difference in word recognition is observed between the GWL listed words and the matched off-list words, (F (1, 38) = 54.306, Pr(>F) < 0.001). The result indicates that the recognition rates of the words in the GWL are significantly higher than those of the words outside the list. Our result is consistent with the findings by Reynolds et al. (2018) that the reference wordlist in high school has a significant impact on university learners’ vocabulary knowledge.

As for the second research question, further analysis using the anova() function reveals that L2 proficiency makes a positive and significant contribution to participants’ vocabulary knowledge of general English words, (F (1, 38) = 6.198, Pr(>F) = 0.017). Our results indicate that vocabulary knowledge tends to develop concurrently with the improvement of L2 proficiency. Learners of higher proficiency boast a larger vocabulary than do learners of lower proficiency. The LME results reveal that there is no interaction effect of L2 proficiency by word source on word recognition, (F (1, 38) = 0.028, Pr(>F) = 0.867).

5 Discussion

By manipulating the word source (on-list words vs. off-list words), our study revealed that both GWL and L2 proficiency have significant effects on learners’ vocabulary knowledge, without interaction. After controlling for other lexical variables, the GWL listed words were recognized more accurately than their off-list counterparts. Even after learners have studied in universities for over 3 years, the GWL effect on learners’ vocabulary knowledge is still robust. Under the guidance of the GWL, L2 proficiency still significantly influenced learners’ vocabulary knowledge. Learners of higher proficiency boast more vocabulary knowledge than do learners of lower proficiency. We discussed our study results on the basis of relevant psycholinguistic theories.

5.1 The wordlist effect on vocabulary knowledge

The GWL has a significant effect on vocabulary recognition, consistent with the findings by Reynolds et al. (2018) that processing words within wordlist is superior to that outside the list. The participants in this study were university students, most of whom were juniors or seniors. Even after 3 years or more, their knowledge of the GWL listed words learned in high school remained significantly superior to that of the off-list words. This finding suggests that not only does the wordlist influence EFL vocabulary learning in high school, but it also has a strong and long-lasting effect on university learners, in line with the findings of Reynolds et al. (2018), which examined the washback effect of an official reference wordlist compiled for high school learners to prepare for the College Entrance Examination. They revealed the significant influence of the wordlist on the vocabulary learning and retention of EFL university students, indicating the long-lasting influence of official reference wordlists.

The wordlist authorized by the national education authority plays a significant role in language education, including teaching, learning, assessment, and curriculum development (Sun, 2005; Bao and Xu, 2022). In practice, textbooks, classroom instruction, and language tests aim to incorporate as many words from the official list as possible (Nation, 2016). English teaching and learning materials developed based on the wordlist are the primary source of input for high school students in China (Gu and Li, 2009), where achieving a high score on the Gaokao exam is crucial for admission to top universities. Teaching and learning strategies in Chinese high schools are influenced by the wordlist and test-taking objectives, with a focus on preparing students for the Gaokao exam (Bao and Xu, 2022). Students dedicate most of their English learning time to materials that align with the Gaokao exam (Sun, 2005; Gu and Li, 2009). Through repeated exposure to the listed words in various contexts, learners are more likely to acquire and retain these listed words. Research has shown that encountering a word at least eight times increases the likelihood of acquisition (Gullberg et al., 2012; Elgort et al., 2018; Peters and Webb, 2018), highlighting the importance of repeated exposure in vocabulary learning (Nation, 2014). Thus, the official wordlist brings about learners’ repeated encounters with the listed words, which facilitates the acquisition of these words, as evidenced by significantly higher recognition rates of the listed words compared to words not included in the list.

The interpretation of the effect that frequency of exposure has on vocabulary acquisition is straightforward: the more often a learner is exposed to any learning material, the stronger the memory and the more likely the material will be learned and retained over time (Ellis, 2012). According to the lexical entrenchment hypothesis (Diependaele et al., 2013), the process of acquiring words is driven by exposure. The total number of exposures to words, which accumulates over time, reflects a person’s experience and usage of those words. Variation in exposure to words in a language is the main factor that determines word processing, both in L1 and L2 (Brysbaert et al., 2017). The recognition differences between the GWL listed words and the off-list words can be explained by variations in exposure to these two sets of words. Through repeated encounters with the listed words, the quality of their vocabulary knowledge is enhanced (Nation, 2013), and the links between the form and meaning representations of these listed words are stronger than those of the off-list words. After learners repeatedly encountered the GWL listed words, the representations of these listed words were entrenched in mental lexicon and were retained over time. With the mapping between lexical form and meaning reinforced, learners can accurately activate the corresponding conceptual representations in recognition tests. By contrast, the off-list words are exposed far less, which leads to less efficient word recognition. Under the guidance of the GWL, students engage in learning the same set of words from textbooks, consult the GWL or dictionaries, encounter them in their readings, and repeatedly practice them. While this intensive approach may foster a deep comprehension of the listed vocabulary, excessive focus on a limited number of words could potentially limit learners’ motivation or ability to acquire a broader range of vocabulary (Schmitt, 2014).

5.2 The L2 proficiency effect on wordlist-based vocabulary knowledge

The L2 proficiency has a significant impact on participants’ ability to recognize vocabulary, which supports previous research findings (e.g., Zeng et al., 2019; Agustin-Llach, 2022; Teng, 2022) that English proficiency serves as a strong predictor of vocabulary knowledge or word recognition performance.

On the one hand, as learners’ English proficiency improves and vocabulary exposure accumulates, their semantic network becomes richer, resulting in improved vocabulary knowledge (Schmitt, 2014; Enayat and Derakhshan, 2021). Learners with higher proficiency are able to respond accurately when words are presented, whereas those with lower proficiency may provide incorrect responses due to erroneous connections between lemma (i.e., semantic and syntactic information about words) and lexeme (i.e., morphological and formal information about words), or weak connections that fail to activate the corresponding lemma information (Jiang, 2000). On the other hand, morphological awareness (MA) varies among learners of different proficiency levels. Higher-proficiency learners possess strong MA, which allows them to process target words efficiently, particularly words with complex morphology (Zeng et al., 2019). Nagy et al. (2014) suggest that learners with strong MA are able to decode morphologically complex words, such as derived words and compound words, by utilizing both morphological and semantic channels. In contrast, learners with weak MA may have difficulty recognizing morphologically complex words due to blockage or absence of these channels.

There is no interaction between L2 proficiency and the wordlist, indicating that the wordlist has beneficial impacts on vocabulary knowledge of learners with varying proficiency levels, consistent with findings by Yamamoto (2014). Lower-proficiency learners are significantly poorer at recognizing words than their higher-proficiency counterparts, even for the GWL listed words. Vocabulary knowledge contributes to improving overall language proficiency, which in turn affects learners’ vocabulary recognition outcomes (Laufer and Nation, 1995; Nation, 2013). Constrained by language proficiency, lower-proficiency learners tend to acquire poor or incomplete vocabulary knowledge (e.g., inaccurate lexeme information, lack of lemma information, lack of links between lemma and lexeme), even when guided by the same wordlist. With higher demands for future academic development, learners with higher proficiency should expand their vocabulary knowledge beyond the current scope of the GWL. Their repertoire of words should include more advanced vocabulary appropriate for their level of proficiency and future academic growth. A single reference wordlist for learners with different proficiency levels might disregard the variations in their demands of language development, and might impede their ability to effectively expand their vocabulary knowledge. To address the issue, Nation (2016) proposed that wordlist compilers should create graded wordlists that align with learners’ language proficiency levels. Thus, we propose that future revisors of the GWL should take into account the L2 proficiency of high school students across the country and develop graded wordlists that cater to learners at different proficiency levels.

Last but not least, our results (i.e., both wordlist and L2 proficiency significantly influence EFL learners’ vocabulary knowledge), together with those of Reynolds et al. (2018) can be generalized to EFL wordlists outside China. That is, in EFL learning contexts where target language input is limited, when assisted or guided by a wordlist, EFL learners’ vocabulary knowledge will be greatly influenced or shaped by the wordlist. With learners’ repeated exposure to the listed vocabulary, the knowledge of the listed words becomes more entrenched in learners’ mental lexicon than the off-list words over a long period of time. When developing wordlists for EFL learners, wordlist compilers or revisors should take wordlist’ long-term washback effects and L2 proficiency effect into consideration.

6 Conclusion

By manipulating word source (i.e., included in the GWL or not), we conducted an online vocabulary knowledge test to examine the effects of the GWL and L2 proficiency on university EFL learners’ word recognition performance. Our study revealed that the GWL had a significant and long-lasting effect on the vocabulary knowledge of university EFL learners in China, with the recognition rates of the list words being significantly higher than those of the off-list words. Moreover, L2 proficiency had a significant effect on vocabulary recognition, and had no interaction with the GWL. Be the words on-list or off-list, the word recognition performance of higher-proficiency participants was significantly better than that of lower-proficiency participants.

Our findings verify the significant guiding role of wordlists in EFL vocabulary learning. In EFL learning environments with limited target language input, wordlists specify the target vocabulary for learners, making vocabulary teaching/learning more efficient and straightforward. In addition, the long-term washback effect of wordlist has been confirmed, with knowledge of the listed words entrenched in learners’ mental lexicon. Even after 3 years or more, learners’ vocabulary knowledge within the scope of wordlist is still significantly superior to that outside the list. To improve EFL learners’ vocabulary learning efficiency, wordlist compilers should consider users’ L2 proficiency so that graded wordlists tailored to various proficiency levels can be developed.

There are some limitations in this study. Since we adopted the online recognition paradigm to reach participants from various regions in China, we did not collect the reaction time data, which might also reflect the effects of the variables concerned on vocabulary recognition performance.

Data availability statement

The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding author.

Ethics statement

Ethical approval was not required for the studies involving humans in accordance with the local legislation and institutional requirements. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.

Author contributions

ZB: Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Validation, Visualization, Writing – original draft, Writing – review & editing. CP: Conceptualization, Formal analysis, Funding acquisition, Project administration, Resources, Software, Supervision, Writing – review & editing.

Funding

The author(s) declare that no financial support was received for the research, authorship, and/or publication of this article.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Footnotes

References

Agustin-Llach, M. (2022). How age and L2 proficiency affect the L2 lexicon. System 104:102697. doi: 10.1016/j.system.2021.102697

Crossref Full Text | Google Scholar

Alderson, C. (2005). Diagnosing foreign language proficiency: The Interface between learning and assessment. London: Continuum.

Google Scholar

Bao, Z., and Xu, X. (2022). Evaluating word lists against word frequency, lexical age-of-acquisition and concreteness. Lingua 278:103417. doi: 10.1016/j.lingua.2022.103417

Crossref Full Text | Google Scholar

Beeckmans, R., Eyckmans, J., Janssens, V., Dufranne, M., and Van de Velde, H. (2001). Examining the yes/no vocabulary test: some methodological issues in theory and practice. Lang. Test. 18, 235–274. doi: 10.1177/026553220101800301

Crossref Full Text | Google Scholar

Bowles, R. P., and Salthouse, T. A. (2008). Vocabulary test format and differential relations to age. Psychol. Aging 23, 366–376. doi: 10.1037/0882-7974.23.2.366

PubMed Abstract | Crossref Full Text | Google Scholar

Brysbaert, M., Keuleers, E., and Mandera, P. (2021). Which words do English non-native speakers know? New supernational levels based on yes/no decision. Second. Lang. Res. 37, 207–231. doi: 10.1177/0267658320934526

Crossref Full Text | Google Scholar

Brysbaert, M., Lagrou, E., and Stevens, M. (2017). Visual word recognition in a second language: a test of the lexical entrenchment hypothesis with lexical decision times. Biling. Lang. Congn. 20, 530–548. doi: 10.1017/S1366728916000353

Crossref Full Text | Google Scholar

Brysbaert, M., Mandera, P., and Keuleers, E. (2018). The word frequency effect in word processing: an updated review. Curr. Dir. Psychol. Sci. 27, 45–50. doi: 10.1177/0963721417727521

Crossref Full Text | Google Scholar

Brysbaert, M., and New, B. (2009). Moving beyond Kucera and Francis: a critical evaluation of current word frequency norms and the introduction of a new and improved word frequency measure for American English. Behav. Res. Methods 41, 977–990. doi: 10.3758/BRM.41.4.977

PubMed Abstract | Crossref Full Text | Google Scholar

Brysbaert, M., New, B., and Keuleers, E. (2012). Adding part-of-speech information to the SUBTLEX-US word frequencies. Behav. Res. Methods 44, 991–997. doi: 10.3758/s13428-012-0190-4

PubMed Abstract | Crossref Full Text | Google Scholar

Brysbaert, M., Warriner, A. B., and Kuperman, V. (2014). Concreteness ratings for 40 thousand generally known English word lemmas. Behav. Res. Methods 46, 904–911. doi: 10.3758/s13428-013-0403-5

PubMed Abstract | Crossref Full Text | Google Scholar

Cai, Q., and Brysbaert, M. (2010). SUBTLEX-CH: Chinese word and character frequencies based on film subtitles. PLoS One 5:e10729. doi: 10.1371/journal.pone.0010729

PubMed Abstract | Crossref Full Text | Google Scholar

Coltheart, M. (1981). The MRC psycholinguistic database. Q. J. Exp. Psychol. 33, 497–505. doi: 10.1080/14640748108400805

Crossref Full Text | Google Scholar

Crystal, D. (1987). The Cambridge encyclopedia of language. UK: Cambridge University Press.

Google Scholar

Diependaele, K., Lemhöfer, K., and Brysbaert, M. (2013). The word frequency effect in first and second language word recognition: a lexical entrenchment account. Q. J. Exp. Psychol. 66, 843–863. doi: 10.1080/17470218.2012.720994

PubMed Abstract | Crossref Full Text | Google Scholar

Dijkstra, T. (2007). “The multilingual lexicon” in Handbook of psycholinguistics. ed. M. G. Gaskell (Oxford: Oxford University Press), 251–265.

Google Scholar

Elgort, I., Brysbaert, M., Stevens, M., and Van Assche, E. (2018). Contextual word learning during reading in a second language: an eye-movement study. Stud. Second. Lang. Acquis. 40, 341–366. doi: 10.1017/S0272263117000109

Crossref Full Text | Google Scholar

Ellis, N. (2012). “What can we count in language, and what counts in language acquisition, cognition, and use?” in Frequency effects in language learning and processing. eds. S. T. Gries and D. Divjak (Berlin, Boston: De Gruyter Mouton), 7–34.

Google Scholar

Enayat, J. M., and Derakhshan, A. (2021). Vocabulary size and depth as predictors of second language speaking ability. System 99:102521. doi: 10.1016/j.system.2021.102521

Crossref Full Text | Google Scholar

Gu, X., and Li, Z. (2009). A study on word list transition from English teaching syllabus to English curriculum standard [in Chinese]. Curricul. Teach. Mater. Method 29, 46–51. doi: 10.19877/j.cnki.kcjcjf.2009.03.012

Crossref Full Text | Google Scholar

Gullberg, M., Roberts, L., and Dimroth, C. (2012). What word-level knowledge can adult learners acquire after minimal exposure to a new language? Int. Rev. Appl. Linguis. Lang. Teach. 50, 239–276. doi: 10.1515/iral-2012-0010

Crossref Full Text | Google Scholar

Henriksen, B. (1999). Three dimensions of vocabulary development. Stud. Second. Lang. Acquis. 21, 303–317. doi: 10.1017/S0272263199002089

Crossref Full Text | Google Scholar

Hoshino, N., and Kroll, J. F. (2008). Cognate effects in picture naming: does cross-language activation survive a change of script? Cognition 106, 501–511. doi: 10.1016/j.cognition.2007.02.001

PubMed Abstract | Crossref Full Text | Google Scholar

Jiang, N. (2000). Lexical representation and development in a second language. Appl. Linguis. 21, 47–77. doi: 10.1093/applin/21.1.47

Crossref Full Text | Google Scholar

Keuleers, E., and Brysbaert, M. (2010). Wuggy: a multilingual pseudoword generator. Behav. Res. Methods 42, 627–633. doi: 10.3758/BRM.42.3.627

PubMed Abstract | Crossref Full Text | Google Scholar

Keuleers, E., Brysbaert, M., and New, B. (2010). SUBTLEX-NL: a new frequency measure for Dutch words based on film subtitles. Behav. Res. Methods 42, 643–650. doi: 10.3758/BRM.42.3.643

PubMed Abstract | Crossref Full Text | Google Scholar

Kim, J., and Davis, C. (2003). Task effects in masked cross-script translation and phonological priming. J. Mem. Lang. 49, 484–499. doi: 10.1016/S0749-596X(03)00093-7

Crossref Full Text | Google Scholar

Kuperman, V., Stadthagen-Gonzalez, H., and Brysbaert, M. (2012). Age-of-acquisition ratings for 30,000 English words. Behav. Res. Methods 44, 978–990. doi: 10.3758/s13428-012-0210-4

PubMed Abstract | Crossref Full Text | Google Scholar

Laufer, B. (1990). Ease and difficulty in vocabulary learning: some teaching implications. Foreign Lang. Ann. 23, 147–155. doi: 10.1111/j.1944-9720.1990.tb00355.x

Crossref Full Text | Google Scholar

Laufer, B., Elder, C., Hill, K., and Congdon, P. (2004). Size and strength: do we need both to measure vocabulary knowledge? Lang. Test. 21, 202–226. doi: 10.1191/0265532204lt277oa

Crossref Full Text | Google Scholar

Laufer, B., and Goldstein, Z. (2004). Testing vocabulary knowledge: size, strength, and computer Adaptiveness. Lang. Learn. 54, 399–436. doi: 10.1111/j.0023-8333.2004.00260.x

Crossref Full Text | Google Scholar

Laufer, B., and Nation, P. (1995). Vocabulary size and use: lexical richness in L2 written production. Appl. Linguis. 16, 307–322. doi: 10.1093/applin/16.3.307

Crossref Full Text | Google Scholar

Lemhöfer, K., and Broersma, M. (2012). Introducing LexTALE: a quick and valid lexical test for advanced learners of English. Behav. Res. Methods 44, 325–343. doi: 10.3758/s13428-011-0146-0

PubMed Abstract | Crossref Full Text | Google Scholar

Masrai, A. (2023). Lexical knowledge and L2 general language proficiency: collocational competence and vocabulary size as determinants of lexical knowledge. Cogn. Process. 24, 289–300. doi: 10.1007/s10339-022-01120-2

PubMed Abstract | Crossref Full Text | Google Scholar

Meara, P. (1996). “The dimension of lexical competence” in Performance and competence in second language acquisition. eds. G. Brown, K. Malmkjaer, and J. Williams (Cambridge: Cambridge University Press), 35–53.

Google Scholar

Meara, P., and Buxton, B. (1987). An alternative to multiple choice vocabulary tests. Lang. Test. 4, 142–154. doi: 10.1177/026553228700400202

Crossref Full Text | Google Scholar

Milton, J. (2009). Measuring second language vocabulary acquisition. Bristol, Blue Ridge Summit: Multilingual Matters.

Google Scholar

Milton, J., Wade, J., and Hopkins, N. (2010). “Aural word recognition and oral competence in a foreign language” in Further insights into nonnative vocabulary teaching and learning. eds. R. Chacón-Beltrán, C. Abello-Contesse, and M. Torreblanca-López (Bristol, Blue Ridge Summit: Multilingual Matters), 83–98.

Google Scholar

Ministry of Education of China (2017). English curriculum standards for high schools (in Chinese). Beijing: People’s Education Press.

Google Scholar

Mitsugi, S. (2018). Proficiency influences orthographic activations during L2 spoken-word recognition. Int. J. Biling. 22, 199–214. doi: 10.1177/1367006916666388

Crossref Full Text | Google Scholar

Mochida, K., and Harrington, M. (2006). The yes/no test as a measure of receptive vocabulary knowledge. Lang. Test. 23, 73–98. doi: 10.1191/0265532206lt321oa

Crossref Full Text | Google Scholar

Nagy, W. E., Carlisle, J. F., and Goodwin, A. P. (2014). Morphological knowledge and literacy acquisition. J. Learn. Disabil. 47, 3–12. doi: 10.1177/0022219413509967

Crossref Full Text | Google Scholar

Nation, I. S. P. (1990). Teaching and learning vocabulary. New York: Heinle and Heinle.

Google Scholar

Nation, I. S. P. (2013). Learning vocabulary in another language. New York: Cambridge University Press.

Google Scholar

Nation, I. S. P. (2014). How much input do you need to learn the most frequent 9,000 words? Read. Foreign Lang. 26, 1–16. doi: 10.26686/wgtn.12543437.v1

Crossref Full Text | Google Scholar

Nation, I. S. P. (2016). Making and using word lists for language learning and testing. Amsterdam/Philadelphia: John Benjamins Publishing Company.

Google Scholar

New, B., Brysbaert, M., Veronis, J., and Pallier, C. (2007). The use of film subtitles to estimate word frequencies. Appl. Psycholinguist. 28, 661–677. doi: 10.1017/S014271640707035X

Crossref Full Text | Google Scholar

Peters, E., and Webb, S. (2018). Incidental vocabulary acquisition through viewing L2 television and factors that affect learning. Stud. Second. Lang. Acquis. 40, 551–577. doi: 10.1017/S0272263117000407

Crossref Full Text | Google Scholar

Pigada, M., and Schmitt, N. (2006). Vocabulary acquisition from extensive reading: a case study. Read. Foreign Lang. 18, 1–28.

Google Scholar

Pinchbeck, G. G., Brown, D., Mclean, S., and Kramer, B. (2022). Validating word lists that represent learner knowledge in EFL contexts: the impact of the definition of word and the choice of source corpora. System 106:102771. doi: 10.1016/j.system.2022.102771

Crossref Full Text | Google Scholar

Qian, C. (2020). The application of word list approach in senior high school English teaching [Unpublished thesis]. Nanjing: Nanjing Normal University.

Google Scholar

Read, J. (2010). Assessing vocabulary. New York: Cambridge University Press.

Google Scholar

Reynolds, B. L., Shih, Y.-C., and Wu, W.-H. (2018). Modeling Taiwanese adolescent learners’ English vocabulary acquisition and retention: the washback effect of the college entrance examination Center’s reference word list. Engl. Specif. Purp. 52, 47–59. doi: 10.1016/j.esp.2018.08.001

Crossref Full Text | Google Scholar

Salsbury, T., Crossley, S. A., and McNamara, D. S. (2011). Psycholinguistic word information in second language oral discourse. Second. Lang. Res. 27, 343–360. doi: 10.1177/0267658310395851

Crossref Full Text | Google Scholar

Schmitt, N. (2010a). Researching vocabulary: A vocabulary research manual. London: Palgrave MacMillan.

Google Scholar

Schmitt, N. (2010b). “Key issues in teaching and learning vocabulary” in Insights into non-native vocabulary teaching and learning. eds. R. Chacón-Beltrán, C. Abello-Contesse, and M. Torreblanca-López (Bristol, Blue Ridge Summit: Multilingual Matters), 28–40.

Google Scholar

Schmitt, N. (2014). Size and depth of vocabulary knowledge: what the research shows. Lang. Learn. 64, 913–951. doi: 10.1111/lang.12077

Crossref Full Text | Google Scholar

Strik Lievers, F., Bolognesi, M., and Winter, B. (2021). The linguistic dimensions of concrete and abstract concepts: lexical category, morphological structure, countability, and etymology. Cogn. Linguis. 32, 641–670. doi: 10.1515/cog-2021-0007

Crossref Full Text | Google Scholar

Sun, W. (2005). On the compilation principles and methods of English teaching vocabulary syllabus of basic education [in Chinese]. Curricul. Teach. Mater. Method 25, 61–65.

Google Scholar

Syndicate, U. C. L. E. (2001). Quick placement test. Oxford: Oxford University Press.

Google Scholar

Tekmen, E. A. F., and Daloǧlu, A. (2006). An investigation of incidental vocabulary acquisition in relation to learner proficiency level and word frequency. Foreign Lang. Ann. 39, 220–243. doi: 10.1111/j.1944-9720.2006.tb02263.x

Crossref Full Text | Google Scholar

Teng, M. F. (2022). The roles of second-language proficiency level and working memory on vocabulary learning from word-focused exercises. RELC J. :003368822211022. doi: 10.1177/00336882221102228

Crossref Full Text | Google Scholar

Urdaniz, R. P., and Skoufaki, S. (2022). Spanish L1 EFL learners’ recognition knowledge of English academic vocabulary: the role of cognateness, word frequency and length. Appl. Linguis. Rev. 13, 661–703. doi: 10.1515/applirev-2018-0109

Crossref Full Text | Google Scholar

van Heuven, W. J., Mandera, P., Keuleers, E., and Brysbaert, M. (2014). SUBTLEX-UK: a new and improved word frequency database for British English. Q. J. Exp. Psychol. 67, 1176–1190. doi: 10.1080/17470218.2013.850521

PubMed Abstract | Crossref Full Text | Google Scholar

Verhaeghen, P. (2003). Aging and vocabulary scores: a meta-analysis. Psychol. Aging 18, 332–339. doi: 10.1037/0882-7974.18.2.332

PubMed Abstract | Crossref Full Text | Google Scholar

Warriner, A. B., Kuperman, V., and Brysbaert, M. (2013). Norms of valence, arousal, and dominance for 13,915 English lemmas. Behav. Res. Methods 45, 1191–1207. doi: 10.3758/s13428-012-0314-x

PubMed Abstract | Crossref Full Text | Google Scholar

Webb, S. (2007). The effects of repetition on vocabulary knowledge. Appl. Linguis. 28, 46–65. doi: 10.1093/applin/aml048

Crossref Full Text | Google Scholar

Webb, S. (2008). Receptive and productive vocabulary sizes of L2 learners. Stud. Second. Lang. Acquis. 30, 79–95. doi: 10.1017/S0272263108080042

Crossref Full Text | Google Scholar

Webb, S. (2009). The effects of receptive and productive learning of word pairs on vocabulary knowledge. RELC J. 40, 360–376. doi: 10.1177/0033688209343854

Crossref Full Text | Google Scholar

Wen, Y., and van Heuven, W. J. (2017). Chinese translation norms for 1,429 English words. Behav. Res. Methods 49, 1006–1019. doi: 10.3758/s13428-016-0761-x

PubMed Abstract | Crossref Full Text | Google Scholar

West, M., Faucett, L., and Maki, I. (1934). A study of English word-values determined from the latest extensive word-counts. Mod. Lang. J. 18, 357–358. doi: 10.2307/315339

Crossref Full Text | Google Scholar

Yamamoto, Y. (2014). Multidimensional vocabulary acquisition through deliberate vocabulary list learning. System 42, 232–243. doi: 10.1016/j.system.2013.12.005

Crossref Full Text | Google Scholar

Yarkoni, T., Balota, D., and Yap, M. (2008). Moving beyond Coltheart’s N: a new measure of orthographic similarity. Psychon. Bull. Rev. 15, 971–979. doi: 10.3758/pbr.15.5.971

PubMed Abstract | Crossref Full Text | Google Scholar

Zareva, A., Schwanenflugel, P., and Nikolova, Y. (2005). Relationship between lexical competence and language proficiency: variable sensitivity. Stud. Second. Lang. Acquis. 27, 567–595. doi: 10.1017/S0272263105050254

Crossref Full Text | Google Scholar

Zeng, T., Han, B., Zhai, M., and Mu, Y. (2019). The effect of language proficiency on L2 English learners’ processing of morphologically complex words: evidence from masked transposed letter priming. Neurosci. Lett. 704, 84–88. doi: 10.1016/j.neulet.2019.03.042

PubMed Abstract | Crossref Full Text | Google Scholar

Zhang, J. (2015). English vocabulary teaching in Chinese junior high schools [Unpublished thesis]. West Lafayette, Indiana: Purdue University.

Google Scholar

Zhong, H. F. (2018). The relationship between receptive and productive vocabulary knowledge: a perspective from vocabulary use in sentence writing. Lang. Learn. J. 46, 357–370. doi: 10.1080/09571736.2015.1127403

Crossref Full Text | Google Scholar

Keywords: the Gaokao Word List, L2 proficiency, vocabulary knowledge, wordlist effect, word recognition

Citation: Bao Z and Peng C (2024) The effects of EFL wordlist and proficiency on vocabulary knowledge. Front. Psychol. 15:1289106. doi: 10.3389/fpsyg.2024.1289106

Received: 20 September 2023; Accepted: 10 June 2024;
Published: 20 June 2024.

Edited by:

Douglas F. Kauffman, Consultant, Greater Boston, United States

Reviewed by:

Mojgan Rashtchi, Islamic Azad University North Tehran Branch, Iran
Sophia Skoufaki, University of Essex, United Kingdom

Copyright © 2024 Bao and Peng. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Cheng Peng, philipeng@sjtu.edu.cn

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.