- 1Unidad de Educación, Escuela de Posgrado, Universidad Peruana Unión, Lima, Peru
- 2Escuela de Medicina Humana, Facultad de Ciencias de la Salud, Universidad Peruana Unión, Lima, Peru
- 3Facultad de Teología, Universidad Peruana Unión, Lima, Peru
- 4Escuela de Posgrado, Universidad Peruana Unión, Lima, Peru
- 5Escuela Profesional de Psicología, Facultad de Ciencias de la Salud, Universidad Peruana Unión, Lima, Peru
- 6Facultad de Farmacia y Bioquímica, Universidad Científica del Sur, Lima, Peru
- 7South American Center for Education and Research in Public Health, Universidad Norbert Wiener, Lima, Peru
- 8Facultad de Ciencias Químicas, Universidad Nacional San Agustín de Arequipa, Arequipa, Peru
- 9Faultad de Ciencias de la Salud, Universidad Peruana Unión, Tarapoto, Peru
Background: English is a global language used to communicate with members of the international community. Self-efficacy in learning English is an important factor that is influenced by perceived importance, interest, and beliefs about the ability to successfully perform tasks in English.
Objective: The aim is to develop and validate a measurement instrument to assess English self-efficacy.
Methods: A total of 453 students from different Peruvian universities participated, with ages ranging from 18 to 60 years (M = 23; SD = 6.18). Statistical techniques for latent variables were used and recommendations for educational and psychological tests were followed in its construction. The sample was divided into two groups to perform exploratory factor analysis (EFA) and confirmatory factor analysis (CFA).
Results: The English Self-Efficacy Scale (ESS-P) is representative and relevant in terms of its item content (Aiken’s V > 0.70). Its internal structure is organized into three first order factors and a second order factor that are consistent with the theoretical proposal and was confirmed through CFA with excellent goodness-of-fit indices (χ2 = 1184.9, gl = 626, CFI = 0.92, TLI = 0.92, RMSEA = 0.06 y SRMR = 0.04). It also has adequate internal consistency in its three factors (Reading α/ω = 0.96), Oral Communication (α/ω = 0.95), and Writing (α/ω = 0.97) and the whole scale (α/ω = 0.98), is invariant with respect to sex, and has a conceptual relationship with variables such as academic self-efficacy and exam anxiety.
Conclusion: The ESS-P is a measurement instrument with evidence of validity, factorial invariance, and good reliability of its scores. Therefore, it can be used in future studies in the academic context.
1. Introduction
English is referred to as the global language due to its use in connecting and communicating with members of the international community (Botes et al., 2020). Nonetheless, the level of English proficiency as a foreign language in Peru is moderate, ranking 51st in Education First English Proficient Index (2022). The learning of this language for non-native speakers is not an easy process, being influenced by various motivational factors: perceived importance, interest, and self-efficacy (Bai et al., 2020). The latter constitutes a topic of research in the field of English as a Foreign Language (EFL).
Positive psychology has been a rapidly growing field in the last decade, focusing on how individuals can live optimally and reach their full potential through an approach based on empirical data and scientific methods (Seligman and Csikszentmihalyi, 2000; Seligman, 2004). Within the subfield of second language learning, positive psychology has been applied in various contexts and levels of identity, ranging from a general trait to a specific state (Mercer and MacIntyre, 2014; Gabryś-Barker and Gałajda, 2016; Mercer et al., 2018). Self-efficacy refers to the set of beliefs that individuals have about their abilities to perform activities necessary to achieve desired goals (Bandura, 1977). This construct is based on the Social Cognitive Theory (SCT; Bandura, 2008). In the educational field, the beliefs that students have about their own capabilities play an important role, not only in general but also in a specific area such as English, and specific items can be used to measure self-efficacy in that area (Bai et al., 2020). Therefore, English self-efficacy is defined as the belief that a person has about the effectiveness of their abilities to perform a task in English successfully (Wang et al., 2014).
In the context of second language learning, it has been found that self-efficacy in specific areas, such as reading, speaking, and listening comprehension, is related to positive self-concept variables and language competence (Chen et al., 2020). Currently, there are two approaches to the dimensional structure of English self-efficacy. The first approach has been divided into four categories (Wang et al., 2014; Wang and Bai, 2017; Ngoc Truong and Wang, 2019), these are: (a) Reading, which refers to the ability to comprehend written text in English, taking into account the purpose of reading, recognition of key words, identification of explicit and implicit main ideas, type of text, and manifestation of a critical position (Newton et al., 2018); (b) Speaking, which encompasses the ability to communicate orally in English spontaneously, asking questions to obtain information about a specific topic, as well as answering the questions of the interlocutor, accompanied by facial expressions, gestures, or body movements (Burns, 2019); (c) Writing, which involves writing a specific type of text in English in a coherent and cohesive manner, considering the purpose, recipient, type of language to use, and writing processes (planning, first draft, revision, and final draft; Hyland, 2019); and (d) Listening, which refers to the ability to understand what the English speaker communicates, identifying the content of the message (Gilakjani and Sabouri, 2016). On the other hand, according to the other approach, they are divided into three categories (Chauvin et al., 2020), these are: (a) media comprehension, related to understanding audio, radio, and video programs; (b) receptive and productive written communication, composed of reading and writing tasks; and (c) receptive and productive oral communication, related to the listening and speaking tasks that are found in social interaction (Chauvin et al., 2020).
Several instruments have been developed to evaluate English self-efficacy based on Wang’s qualitative study, with a sample of 4 boys aged 6 to 9 years (Wang, 2004), whose results could be more applicable to these age ranges, but they could lead to the omission of items most relevant for male and female adults. One of these instruments is the Questionnaire of English Self-Efficacy (QESE), which has a theoretical structure consisting of four dimensions and was developed from a qualitative approach using interviews, observations, and verbal protocols with Chinese elementary school English students in the United States (Wang, 2004). This instrument was later adapted in Korea with a sample of university students (Wang et al., 2013). This version was also evaluated with Chinese high school students aged 15 to 19 years (Wang and Bai, 2017) and university students in China, with modifications to some of its items (Wang et al., 2014). This same model was also applied to university students in Vietnam (Ngoc Truong and Wang, 2019). On the other hand, the Professional English Self-Efficacy Questionary (PESEQ) was designed from the same approach as the QESE and was applied to a professional French population (PESEQ) with the purpose of measuring English self-efficacy at the professional level. It showed a structure of three dimensions: media comprehension, receptive and productive written communication (reading and writing), and receptive and productive oral communication (listening and speaking; Chauvin et al., 2020). Both instruments have similar theoretical approaches with 4 and 3 dimensions, respectively, but have been applied to different populations: high school students, university students, and professionals. The ESS-P, proposed in the present study, has a structure similar to the scales mentioned above since it focuses on reading and writing, as well as oral communication which includes speaking and listening intrinsically; however, this scale differs from the previous ones because it introduces a set of new items developed from a different theoretical scope, and by its characteristics serves to evaluate self-efficacy for English in undergraduate and postgraduate students in general contexts, not exclusively for the academic environment such as QESE or only for the work context such as PESEQ.
Due to the fact that having a better proficiency in the English language requires students to have higher self-efficacy in performing various activities involving the language (Anam and Stracke, 2020), some researchers have found differences between men and women in terms of self-efficacy scores related to English achievement (Kutuk et al., 2022; Xu et al., 2022). For example, higher levels of self-efficacy for writing have been recorded for males compared to females (Ramos-Villagrasa et al., 2018); however, another study showed a stronger relationship between self-efficacy in writing and writing performance for female students than for male students (Pajares et al., 2001). A more recent study, on the other hand, revealed that the relationship between writing self-efficacy and writing performance was equal for both women and men, indicating that the significant differences in self-efficacy and writing performance in favor of female students were due more to women’s confidence in the stereotype that attributes higher self-efficacy for languages to them (Sun et al., 2021). Therefore, it is important to take into account measurement invariance, as it allows verifying whether members of different groups have the same understanding of the items on a scale. In this way, it ensures that people with the same level of a trait will give the same responses to the scale, regardless of their group affiliation (Milfont and Fischer, 2010). To make these comparisons, it is necessary that measurement invariance is met, as otherwise, it cannot be guaranteed that the indicators reflect the same construct in all groups and have the same meaning (Byrne, 2008; Dimitrov, 2010). Thus, it becomes necessary to compare self-efficacy for English between men and women (Byrne and Stewart, 2009), in order to estimate latent factors and make comparisons between individuals of both sexes, without varying the relevant properties of the scale, so as not to reach incorrect conclusions (García-Álvarez et al., 2022). If the measurement instruments have not shown invariance, the conclusions derived from the studies can be erroneous and biased toward one of the groups, without reflecting the true differences in the way people respond to the items (Byrne, 2008).
To develop adequate English self-efficacy, it is necessary to select cognitive and self-regulation strategies that motivate its development (Velasco-Zárate and Meza-Cano, 2020). English self-efficacy is positively related to academic self-efficacy, which indicates the reciprocal predictive capacity of both variables (Wang et al., 2018), and since academic self-efficacy has a negative association with exam anxiety (Preiss et al., 2006), a negative association between English self-efficacy and exam anxiety would also be expected. English self-efficacy also shows significant correlations with learning strategies (Montaño-González and Cancino, 2020) and academic achievement (Kitikanan and Sasimonton, 2017). Additionally, it has been shown that self-efficacy for academic writing in English can be improved through technological strategies (Velasco-Zárate and Meza-Cano, 2020).
In this sense, due to the lack of Spanish-speaking instruments that aim to measure self-efficacy, the purpose of this study is to develop an instrument that allows measuring English self-efficacy in university students, providing evidence in terms of content, internal structure, sex invariance, and relationship with other variables.
2. Methods
2.1. Design and participants
It is an instrumental research design (Ato et al., 2013). To calculate the sample size, we evaluated the size of the effect that includes the number of visible and hidden variables in the model, the expected effect (λ = 0.10), the desired level of statistical significance (α = 0.05) and the statistical power (1 − β = 0.90), which leads us to recommend a minimum sample of 199. A total of 418 students from 19 Peruvian universities, aged between 18 and 50 years (M = 21.8; SD = 3.94), participated in the study. The majority (51.7%) were women, attending public universities (66.5%), at a basic level of English language study (59.6%), majoring in Engineering, Industry, and Construction (42.1%), residing in the Ancash region (61.0%), and attending public universities (66.5). They were selected through non-probabilistic sampling (Table 1). Additionally, the sample was divided into two groups, with 230 cases randomly selected for the exploratory factor analysis (EFA) and 188 for the confirmatory factor analysis (CFA; Lloret-Segura et al., 2014).
2.2. Instruments
English Self-Efficacy Scale (ESS-P). The English Self-Efficacy Scale (ESS-P) was developed from a literature review (Gilakjani and Sabouri, 2016; Newton et al., 2018; Burns, 2019; Hyland, 2019), and aims to assess self-efficacy in English. Initially, it focused on four areas like previous instruments developed by other authors (Wang et al., 2014), but later three dimensions based on the main skills were proposed: reading (Newton et al., 2018), writing (Hyland, 2019), and speaking/listening (Gilakjani et al., 2016; Burns, 2019); which were combined into an oral communication dimension based on the criteria of Chauvin et al. (2020). Each of the items is measured on a 5-point Likert scale: 0 = I cannot do it at all, 1 = I cannot do it, 2 = Relatively sure I can do it, 3 = I can do it, 4 = Totally sure I can do it. Although it was originally recommended to use a 0 to 100 scale (Bandura, 2006), a subsequent review of self-efficacy scales has shown the frequent use of 5-response scales with satisfactory results (Betz, 2013), so a 5-response option was also chosen in the present study.
Single-Item Academic Self-Efficacy (IUAA; Dominguez-Lara et al., 2019). It is a measure of Academic Self-Efficacy, whose single question is, “How sure are you that you will be able to efficiently perform the tasks demanded of you in your academic life?,” which was organized under a five-point scale, from Not at all sure to Very sure.
Single-Item Test Anxiety (SITA; Dominguez-Lara, 2018). This is an instrument that evaluates exam anxiety through a single item: “During exams I feel a lot of tension” with 4 response alternatives: 1 = almost never, 2 = sometimes, 3 = frequently, and 4 = almost always.
2.3. Procedure
The research protocol was approved by the Research Ethics Committee of the Peruvian university with reference number CE-EPG-0000119. Contact was established with administrators of public and private universities throughout Peru. A pilot test was conducted through a Zoom meeting to identify problems in item writing and comprehension. Then, informed consent forms were sent through Google Forms, WhatsApp groups, and emails to students. Participants were allowed to leave the study at any time if they wished. Finally, the study was carried out following the ethical standards established in the Helsinki Declaration, including the protection of personal information privacy and confidentiality and the minimization of any impact on the physical, mental, and social health of participants.
2.4. Data analysis
To evaluate the content validity of the ESS-P, five university professors with experience in teaching English at the undergraduate and graduate level were asked to rate the relevance of the items and provide suggestions for improvement (Suárez-Alvarez et al., 2018). The responses were analyzed using the Aiken coefficient for dichotomous judge ratings (V = 1, p = 0.031; Aiken, 1985). A pilot test was conducted with 20 undergraduate and postgraduate students to verify the understanding of the items and to obtain the discriminate index and a preliminary reliability analysis (Muñiz and Fonseca-Pedrero, 2019).
Before starting the Exploratory Factor Analysis (EFA), certain conditions must be met, such as linear correlation in the item matrix and verification of common factors through the Kaiser-Meyer-Olkin (KMO) coefficient. High values of KMO indicate that the correlations between the items can be explained by other variables (Kaiser, 1974). In addition, the Barlett sphericality test was used to determine if the correlation matrix is an identity matrix, which means that the correlations are zero.
The Exploratory Factor Analysis (EFA) was performed using the minimum residual extraction method with oblimin rotation, considering factor loadings greater than 0.30 as a criterion for belonging to a factor (Nunnally and Bernstein, 1994) and the number of factors was determined through a parallel analysis. Subsequently, Confirmatory Factor Analysis (CFA) was chosen and estimated using the lavaan library in the RStudio interface. A confirmatory factor analysis was performed on the unifactorial scale using the MLR estimator, which is suitable for numerical variables and is robust to normality deviations in inference (Muthén and Muthén, 2017). Fit measures were used to evaluate the factor structure, such as the Confirmatory Fit Index and Tucker-Lewis Index (CFI and TLI ≥ 0.95; Schumacker and Lomax, 2016), the Root Mean Square Error of Approximation and the Standardized Root Mean Square Residuals (RMSEA and SRMSR ≤ 0.05; Kline, 2016). Factor loadings (λ) with values greater than 0.50 were considered adequate. In addition, the average extracted variance (AVE) was calculated and a value greater than 0.50 was considered appropriate, indicating that more than 50% of the variance of the construct is explained by its indicators (Fornell and Larcker, 1981).
In addition, reliability was evaluated through internal consistency using the Cronbach’s alpha coefficient (Cronbach, 1951) and McDonald’s Omega (Hayes and Coutts, 2020), which is suitable for factor models. The relationship with other convergent variables was also analyzed through correlations based on structural equation models between the scores of the English Self-Efficacy Scale (ESS-P), the single item of Academic self-efficacy, and Test Anxiety. An effect magnitude of r ≥ 0.20 was considered the minimum recommended, while r ≥ 0.50 was moderate and r ≥ 0.80 was strong.
To evaluate the consistency of the factors based on the sex of the participants, a series of increasingly strict hierarchical variance models were used. First, the configuration invariance (reference model) was evaluated, followed by metric invariance (equality of factor loads), scalar invariance (equality of factor loads and intercepts), and finally, strict invariance (equality of factor loads, intercepts, and residuals) was evaluated. To compare the models, a formal statistical test was first used, using the difference in the Confirmatory Fit Index (ΔCFI), where values lower than 0.010 provided evidence of model consistency between groups (Chen, 2007; Finch and French, 2018).
3. Results
3.1. Content validity analysis
To evaluate the content validity of the questionnaire, five experts in teaching English were recruited as judges. The 65 items were evaluated in terms of clarity, consistency, context, and relevance to the construct. The Aiken V Coefficient was used to analyze the results, and two items (6 and 26) were removed for not meeting the standards of clarity, consistency, context, and relevance to the construct (V = 0.80, p > 0.05).
3.2. Pilot test
The pilot test with 20 undergraduate and graduate students allowed for improvement of the instrument. Based on the collected data, the discrimination index of the items was evaluated using Pearson’s product–moment correlation between the values of each item and the sum total of the other items. It was found that item 6 had the lowest value and item 15 had the highest value (Min. = 0.429, Max. = 0.943), indicating that all values were appropriate (>0.2; Rust et al., 2021). Additionally, an adequate preliminary reliability was obtained for the instrument as a whole (α = 0.984 and ω = 0.985).
3.3. Preliminary analysis
The descriptive analysis of the 50 items of the ESS-P was carried out in two groups: the EFA group with 230 participants and the CFA group with 188 participants. For each item, the mean, standard deviation, skewness index (g1), and kurtosis index (g2) were provided. The values of the skewness index (g1) for both groups were found within the range of ±2, indicating a symmetrical distribution of the data (Finney and DiStefano, 2006). Most of the values of the kurtosis index (g2) were also found within the range of ±2, indicating a relatively normal distribution of the data. In the EFA group, the item with the highest mean is item 2 with a score of 3, closely followed by item 8 with a score of 2.54. On the other hand, the item with the lowest mean is item 24 with a score of 2.03, closely followed by item 1 with a score of 2.76. In the CFA group, the item with the highest mean is item 17 with a score of 2.37, closely followed by item 18 with a score of 2.36. On the other hand, the item with the lowest mean is item 24 with a score of 1.98, closely followed by item 1 with a score of 2.72 (Table 2).
3.4. Preliminary evidence-based internal structure
The sample (n = 230) was found to be adequate for conducting the Exploratory Factor Analysis (EFA) through the Kaiser-Meyer-Olkin (KMO) measure of sample adequacy and the Bartlett’s test of sphericity. The KMO was 0.969, indicating an adequate measure of sample adequacy. Additionally, Bartlett’s test was significant (p < 0.001). The number of factors was determined through parallel analysis and the scree plot (Figure 1), which suggested the existence of 3 factors. The EFA was carried out using the minimum residuals extraction method with oblimin rotation, retaining only those items with a factor loading greater than or equal to 0.4 and not presenting factorial complexity that could make interpretation difficult. Finally, 39 items organized into a 3-factor structure were obtained (Table 3), which were decided to be named: reading (Factor 1), Oral Communication (Factor 2), and writing (Factor 3).
3.5. Internal structure validation
Three confirmatory factor analysis (CFA) models were carried out based on the three factors and 39 items identified in an exploratory factor analysis (EFA), using a sample of 188 participants (n = 188). In the first model, adequate fit indices were found: χ2 = 1231.460, df = 699, p ≤ 0.001, CFI = 0.91, TLI = 0.90, RMSEA = 0.06 (90% CI 0.06–0.07), SRMR = 0.04. However, it was decided to retain only factor loadings (λ) greater than 0.70, leading to the removal of items 2 and 5. The second model also presented adequate fit values: χ2 = 1075.910, df = 626, p = <0.001, CFI = 0.92, TLI = 0.91, RMSEA = 0.06 (90% CI 0.06–0.07), SRMR = 0.04. However, by retaining factor loadings (λ) greater than 0.70, item 1 was eliminated. In the third and final model, adequate fit indices were obtained: χ2 = 1015.890, df = 591, p < 0.001, CFI = 0.92, TLI = 0.92, RMSEA = 0.06 (90% CI 0.06–0.07), SRMR = 0.04, and all factor loadings (λ) were greater than 0.70. Moreover, the average variance extracted (AVE) value was acceptable, being higher than 0.50 for each dimension (AVE > 0.50). However, convergent validity was not adequate, as it was observed that the AVE values were smaller than the shared variance between factors (AVE > φ2) in all cases. Despite this, internal consistency was satisfactory, with Cronbach’s alpha and McDonald’s omega coefficients for Reading (α/ω = 0.96), Oral Communication (α/ω = 0.95), and Writing (α/ω = 0.97) showing adequate values.
3.6. Second-order model
Because a discriminant validity was not obtained and that the values of the correlations between the factors were greater than 0.80, which would indicate that the dimensions belong to a second-order factor, so a model was made (Mason and Perreault, 1991; Schwarz et al., 2014) with 3 first-order and 1 second-order factors, similar to several models presented in previous studies for the same construct (Wang et al., 2014; Ngoc Truong and Wang, 2019; Chauvin et al., 2020), obtaining adequate adjustment indices χ2 = 1021.61, gl = 591, p < 0.001, CFI = 0.92, TLI = 0.92, RMSEA = 0.06 [IC 90% = 0.05–0.07] y SRMR = 0.04. Regarding this model, the internal consistency coefficients Cronbach alpha and McDonald’s Omega for the dimensions Reading (α/ω = 0.96), Oral Communication (α/ω = 0.95) y Writing (α/ω = 0.97) as well as for the full scale (α/ω = 0.98), showed adequate values (Figure 2).
3.7. Factor invariance by sex
The factor invariance between sexes was evaluated. Table 4 shows evidence of strict invariance according to the Δ CFI criterion. When adding the equal means restriction, the model fit did not worsen significantly, suggesting that the latent means are similar for both sexes. Therefore, models M1, M2, M3, and M4 meet the expected criteria and confirm the factor invariance of the ESS-P. This allows for comparison of different measures in sex groups.
3.8. Validation based on relationship with other variables
Based on a literature review, we proposed a model to assess the convergence of the ESS-P with other measures. For this purpose, a structural equation modeling (SEM) approach was used to examine the latent relationships between the dimensions of the ESS-P (Reading, Oral Communication, and Writing) and the variables Academic self-efficacy and Test Anxiety. The structural model showed adequate fit indices: χ2 = 1115.750, gl = 661, p = <0.001 CFI = 0.92, TLI = 0.91, RMSEA = 0.06 (90% CI 0.05–0.07), SRMR = 0.04. Furthermore, the measurement models were appropriately represented by their items, as shown in Figure 3. The results revealed that the dimensions of the ESS-P (Reading, Oral Communication, and Writing) are positively related to academic self-efficacy and negatively related to test anxiety. These relationships suggest that as proficiency in English skills increases, academic self-efficacy also increases, and test anxiety decreases. Taking these results into account, it can be concluded that the ESS-P demonstrates strong evidence of convergent validity, indicating that this assessment tool is consistent with other relevant measures in the academic and psychological domains. This convergence supports the usefulness of the ESS-P as a valid instrument for measuring English skills in students and their relationship with academic self-efficacy and test anxiety.
4. Discussion
Self-efficacy refers to an individual’s belief in their ability to perform a task or achieve a goal. Self-efficacy in second language learning, such as English, plays an important role in student success and motivation. Thus, in Peru and other Spanish-speaking countries, appropriate instruments are needed to measure English self-efficacy, considering dimensions such as: reading, writing and oral communication. The aim of this study was to develop an English Self-Efficacy Scale (ESS-P) and provide the first evidence of validity, reliability, and invariance according to sex in Peruvian university students.
The evidence based on the content was carried out in two phases. First, a literature review was conducted to provide information about the scales developed and related ideas about the concept of English self-efficacy. Second, the representativeness and relevance of the items were verified by 5 experts, as relevance and representativeness are important when developing a scale as they provide quality and coherence of the items according to the theoretical aspects of the scale being developed (Clark and Watson, 1995; Eignor, 2013). A pilot test was also conducted to qualitatively verify the participants’ understanding of the items, obtaining adequate discrimination indices for each of them (>0.2; Rust et al., 2021).
Evidence based on the internal structure of the ESS-P was carried out in two phases. In the first phase, an Exploratory Factor Analysis (EFA) was used which showed the existence of three proposed factors (Reading, Writing, and Oral Communication). Then, a Confirmatory Factor Analysis (CFA) based on the structure of the EFA was carried out, which showed that the goodness-of-fit indices were adequate. In this way, the theoretical proposal of three dimensions to evaluate English self-efficacy was empirically tested, which are similar to previous proposals (Wang et al., 2013, 2014; Chauvin et al., 2020; Kim et al., 2021). The factors were named: (a) reading, which refers to the belief in the ability to understand written texts in English; (b) writing, refers to the confidence in the ability to produce written texts in English with meaning and coherence (Wang et al., 2013, 2014; Kim et al., 2021); and (c) oral communication which refers to the confidence in the ability to communicate effectively through spoken language (Chauvin et al., 2020). Additionally, the factor loads were adequate, greater than 0.50, indicating a robust factor structure, as the items are influenced in a homogeneous and strong manner by the latent variable. This means that each aspect of the construct of English self-efficacy is adequate and relevant to the construct being studied, and allows subjects with high and low English self-efficacy to be differentiated (Kline, 2016).
Reliability of the ESS-P was assessed in terms of each dimension using Cronbach’s Alpha and McDonald’s Omega (ω) coefficients, which indicated that the ESS-P is internally consistent with values greater than 0.70 (Raykov and Hancock, 2005). Therefore, the ESS-P is considered a reliable tool.
In terms of convergent validity, which seeks to ensure that the items of a variable reflect the corresponding factor, it is measured through the positive correlation between the indicators of the same variable. Average Variance Extracted (AVE) is a common indicator for measuring convergent validity and is considered acceptable if it is greater than 0.5 (Fornell and Bookstein, 1982). An AVE less than 0.5 indicates that the explained variance is less than the error variance (Chin, 1998). In the present three-dimensional model, the AVE exceeded the threshold with a value greater than 0.50, which is considered acceptable. Although convergent validity was not adequate, as the AVE values were lower than the shared variance between factors (AVE > φ2) in all cases, and the correlations between factors were greater than 0.80, a second-order model with 3 factors was proposed (Mason and Perreault, 1991; Schwarz et al., 2014), similar to previous studies (Wang et al., 2014; Ngoc Truong and Wang, 2019; Chauvin et al., 2020).
The factor invariance analysis in relation to the ESS-P and sex has shown the stability of the thresholds, factor loadings, intercepts, and residuals across groups. This indicates that the items measure the latent variable in the same way for both men and women (Brown, 2015). Therefore, it can be stated that differences in the scores of men and women are due to differences in the latent trait and not a bias in the measurement instrument. These results are important because they will allow for future studies on self-efficacy based on sex and provide useful information for its application.
The evidence based on the relationship with other variables showed that the ESS-P is correlated with academic variables such as self-efficacy and anxiety, showing positive correlations of the scores of English self-efficacy with academic self-efficacy and negative correlation with anxiety toward the exams, which would be supported by previous studies (Preiss et al., 2006; Wang et al., 2018). Similarly, the correlation between the ESS-P and the academic level of the participants was verified, because there is evidence of the association of self-efficacy for English and academic achievement (Kitikanan and Sasimonton, 2017).
4.1. Implications
The English Self-Efficacy Scale (ESS-P) provides a valuable resource for educators and professionals to assess students’ self-efficacy beliefs in reading, writing, and oral communication skills in English. This information can guide professionals in identifying areas where students may need additional support, designing interventions to improve their self-efficacy, and ultimately, their English skills. Furthermore, by taking into account gender differences in self-efficacy, professionals can tailor their strategies to address the specific needs of men and women in learning English. Additionally, policymakers can use the findings of these studies to inform the development of programs and educational policies that promote English learning in Spanish-speaking communities. This may include creating curricula and educational materials that cater to the specific needs of self-efficacy in reading, writing, and oral communication, as well as implementing teacher training programs that teach them how to assess and foster self-efficacy in their students. Also, empirical evidence supports the three-factor structure of the ESS-P. This can serve as a foundation for future research that delves deeper into self-efficacy and its relationship with English learning in different contexts and populations.
4.2. Limitations
The sample for this study focused on Peruvian university students, which limits the generalization of the results to other contexts and populations. Future research could expand the sample to include students from different educational levels, age groups, and cultural contexts, allowing for the evaluation of the applicability and validity of the ESS-P in various situations. Moreover, the cross-sectional design of the study does not allow for establishing causal relationships between English self-efficacy and other variables, such as academic performance or life satisfaction. Future research could employ longitudinal or experimental designs to examine the effects of English self-efficacy on learning and success over time. Additionally, a test–retest analysis was not conducted to assess the temporal stability of the ESS-P, thus including this evaluation in future studies is recommended. Although acceptable convergent validity was identified through AVE, it is suggested to analyze the items using an Item Response Theory model. While the sample was heterogeneous, with different levels of English proficiency, these differences did not significantly affect the model’s goodness of fit. However, future studies may need to examine these differences among populations, as this was not the main objective of the present study. Furthermore, researchers could explore the interaction of self-efficacy with other variables, such as motivation, learning strategies, and social support, to gain a more comprehensive understanding of the factors contributing to English learning.
5. Conclusion
The ESS-P is a valid and reliable instrument that assesses three dimensions of English self-efficacy: reading, writing, and oral communication for measuring English in Peruvian university students. Moreover, it has been demonstrated that the scale is invariant across gender, allowing for comparisons between men and women. The development of the scale contributes to the field of research in teaching and learning English as a second language in Spanish-speaking contexts. The scale can be a useful tool for researchers and practitioners seeking to understand and improve English self-efficacy and, ultimately, student performance and motivation. The implications of these findings can inform educational practices and policies, focusing on promoting English self-efficacy and addressing gender differences.
Data availability statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
Ethics statement
The research protocol was approved by the Research Ethics Committee of the Universidad Peruana Unión with reference number CE-EPG-0000119. The patients/participants provided their written informed consent to participate in this study.
Author contributions
MM-T, DC-A, and WM-G participated in the conceptualization of the idea. LS-S, OR-L, and SM-G were in charge of the methodology and software. WM-G, MM-T, FS-S, and DC-A were responsible for validation, formal analysis, and research. Data curation and resources were commissioned by WM-G, LS-S, SM-G, FS-S, and OR-L. The writing of the first draft, review and editing, visualization and supervision were carried out by MM-T, WM-G, DC-A, LS-S, SM-G, and OR-L. All authors contributed to the article and approved the submitted version.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpsyg.2023.1187342/full#supplementary-material
References
Aiken, L. R. (1985). Three coefficients for analyzing the reliability and validity of ratings. Educ. Psychol. Meas. 45, 131–142. doi: 10.1177/0013164485451012
Anam, S., and Stracke, E. (2020). The role of self-efficacy beliefs in learning English as a foreign language among young Indonesians. TESOL J. 11, 1–12. doi: 10.1002/tesj.440
Ato, M., López, J. J., and Benavente, A. (2013). Un sistema de clasificación de los diseños de investigación en psicología. Anales de Psicologia 29, 1038–1059. doi: 10.6018/analesps.29.3.178511
Bai, B., Nie, Y., and Lee, A. N. (2020). Academic self-efficacy, task importance and interest: relations with English language learning in an Asian context. J. Multiling. Multicult. Dev. 43, 438–451. doi: 10.1080/01434632.2020.1746317
Bandura, A. (1977). Self-efficacy: toward a unifying theory of behavioral change. Psychol. Rev. 84, 191–215. doi: 10.1037//0033-295x.84.2.191
Bandura, A. (2006). “Guide for constructing self-efficacy scales” in Self-efficacy beliefs of adolescents. eds. T. Urdan and F. Pajares (Greenwich, CT: Information Age Publishing), 5, 307–337.
Bandura, A. (2008). “Toward an agentic theory of the self” in Self-processes, learning, and enabling human potential: Dynamic new approach. eds. H. W. Marsh, R. G. Craven, and D. M. McInerey (Greenwich, CT: Information Age Publishing), 15–49.
Betz, N. E. (2013). “Assessment of self-efficacy” in APA handbook of testing and assessment in psychology, Vol. 2: Testing and assessment in clinical and counseling psychology. eds. K. F. Geisinger, B. A. Bracken, J. F. Carlson, J.-I. C. Hansen, N. R. Kuncel, and S. P. Reise (American Psychological Association), 379–391.
Botes, E., Gottschling, J., Stadler, M., and Greiff, S. (2020). A systematic narrative review of international posture: what is known and what still needs to be uncovered. System 90:102232. doi: 10.1016/j.system.2020.102232,102232
Brown, T. A. (2015). Confirmatory factor analysis for applied research. 2nd Edn. New York, NY: Guilford Press.
Burns, A. (2019). Concepts for teaching speaking in the English language Classroom1. Learn J 12, 1–11.
Byrne, B. M. (2008). Testing for multigroup equivalence of a measuring instrument: a walk through the process. Psicothema 20, 872–882.
Byrne, B. M., and Stewart, S. M. (2009). TEACHER’S CORNER: the MACS approach to testing for multigroup invariance of a second-order structure: a walk through the process. Struct. Equ. Model. Multidiscip. J. 13, 287–321. doi: 10.1207/S15328007SEM1302_7
Chauvin, R., Fenouillet, F., and Scott Brewer, S. (2020). An investigation of the structure and role of English as a foreign language self-efficacy beliefs in the workplace. System 91, 1–17. doi: 10.1016/j.system.2020.102251,102251
Chen, F. F. (2007). Sensitivity of goodness of fit indexes to lack of measurement invariance. Struct. Equ. Model. Multidiscip. J. 14, 464–504. doi: 10.1080/10705510701301834
Chen, X., Lake, J., and Padilla, A. M. (2020). Self-views and positive psychology constructs among second language learners in Japan, Taiwan, and the United States. Front. Psychol. 11:2176. doi: 10.3389/fpsyg.2020.02176
Chin, W. W. (1998). “The partial least squares approach for structural equation modeling” in Modern methods for business research. ed. G. A. Marcoulides (Mahwah, NJ: Lawrence Erlbaum Associates Publishers), 295–336.
Clark, L. A., and Watson, D. (1995). Constructing validity: basic issues in objective scale development. Psychol. Assess. 7, 309–319. doi: 10.1037/1040-3590.7.3.309
Cronbach, L. J. (1951). Coefficient alpha and the internal structure of tests. Psychometrika 16, 297–334. doi: 10.1007/BF02310555
Dimitrov, D. M. (2010). Testing for factorial invariance in the context of construct validation. Meas. Eval. Couns. Dev. 43, 121–149. doi: 10.1177/0748175610373459
Dominguez-Lara, S. A. (2018). Single-item of test anxiety: evidence of convergent and incremental validity in college students. Educacion Medica 19, 264–270. doi: 10.1016/j.edumed.2017.04.004
Dominguez-Lara, S. A., Navarro-Loli, J. S., and Prada-Chapoñan, R. (2019). Single item academic self-efficacy: further validity evidences with big five model in college students. Avaliacao Psicologica 18, 210–217. doi: 10.15689/AP.2019.1802.16070.12
Education First English Proficient Index. (2022). EF EPI Índice EF de nivel de inglés. Available at: https://bit.ly/3uCGQoO
Eignor, D. R. (2013). “The standards for educational and psychological testing” in APA handbook of testing and assessment in psychology, Vol. 1: Test theory and testing and assessment in industrial and organizational psychology. eds. K. F. Geisinger, B. A. Bracken, J. F. Carlson, J.-I. C. Hansen, N. R. Kuncel, and S. P. Reise (Washington, DC: American Psychological Association)
Finch, W. H., and French, B. F. (2018). A simulation investigation of the performance of invariance assessment using equivalence testing procedures. Struct. Equ. Modeling 25, 673–686. doi: 10.1080/10705511.2018.1431781
Finney, S. J., and DiStefano, C. (2006). “Non-normal and categorical data in structural equation modeling” in Structural Equation Modeling: A Second Course. eds. G. R. Hancock and R. O. Mueller (Greenwich, CT: Information Age), 269–314.
Fornell, C., and Bookstein, F. L. (1982). Two structural equation models: LISREL and PLS applied to consumer exit-voice theory. J. Market. Res. 19:440. doi: 10.2307/3151718
Fornell, C., and Larcker, D. F. (1981). Structural equation models with unobservable variables and measurement error: algebra and statistics. J. Market. Res. 18, 382–388. doi: 10.1177/002224378101800313
Gabryś-Barker, D., and Gałajda, D. (2016). Positive psychology perspectives on foreign language learning and teaching. Int J Bil Educ Bilingual. Switzerland: Springer International Publishing. doi: 10.1007/978-3-319-32954-3
García-Álvarez, D., Cobo-Rendón, R., and Hernández-Lalinde, J. (2022). Validity, reliability and factorial invariance of the general self-efficacy and academic self-efficacy scales in university students. Retos 46, 1093–1104. doi: 10.47197/retos.v46.94281
Gilakjani, A. P., and Sabouri, N. B. (2016). Learners’ listening comprehension difficulties in English language learning: a literature review. Engl. Lang. Teach. 9, 123–133. doi: 10.5539/elt.v9n6p123
Gilakjani, A. P., Sabouri, N. B., Pourhosein, A., and Branch, L. (2016). English language teaching 9, 123–133. doi: 10.5539/elt.v9n6p123,
Hayes, A. F., and Coutts, J. J. (2020). Use omega rather than Cronbach’s alpha for estimating reliability. But…. Commun. Methods Meas. 14, 1–24. doi: 10.1080/19312458.2020.1718629
Kaiser, H. F. (1974). An index of factorial simplicity. Psychometrika 39, 31–36. doi: 10.1007/BF02291575
Kim, D. H., Wang, C., and Truong, T. N. N. (2021). Psychometric properties of a self-efficacy scale for English language learners in Vietnam. Lang. Teach. Res. 1–16:136216882110278. doi: 10.1177/13621688211027852
Kitikanan, P., and Sasimonton, P. (2017). The relationship between English self-efficacy and English learning achievement of L2 Thai learners. LEARN J 10, 149–164.
Kline, R. B. (2016). Principles and practice of structural equation modeling. New York: Guilford Press.
Kutuk, G., Putwain, D. W., Kaye, L. K., and Garrett, B. (2022). Relations between gender stereotyping and foreign language attainment: the mediating role of language learners’ anxiety and self-efficacy. Br. J. Educ. Psychol. 92, 212–235. doi: 10.1111/BJEP.12446
Lloret-Segura, S., Ferreres-Traver, A., Hernández-Baeza, A., and Tomás-Marco, I. (2014). El análisis factorial exploratorio de los ítems: Una guía práctica, revisada y actualizada. Anales de Psicologia 30, 1151–1169. doi: 10.6018/analesps.30.3.199361
Mason, C. H., and Perreault, W. D. (1991). Collinearity, power, and interpretation of multiple regression analysis. J. Market. Res. 28, 268–280. doi: 10.1177/002224379102800302
Mercer, S., and MacIntyre, P. D. (2014). Introducing positive psychology to SLA. Stud Second Lang Learn Teach 4, 153–172. doi: 10.14746/ssllt.2014.4.2.2
Mercer, S., MacIntyre, P., Gregersen, T., and Talbot, K. (2018). “Positive language education: combining positive education and language education” in Theory and practice of second language acquisition, vol. 4
Milfont, L., and Fischer, R. (2010). Testing measurement invariance across groups: applications in cross-cultural research. Int J Psychol Res 3, 111–130. doi: 10.21500/20112084.857
Montaño-González, J., and Cancino, M. (2020). Exploring the Relationship between Language Learning Strategies and Self-Efficacy of Chilean University EFL Students. Mextesol Journal, 44, 1–11.
Muñiz, J., and Fonseca-Pedrero, E. (2019). Ten steps for test development. Psicothema 31, 7–16. doi: 10.7334/psicothema2018.291
Muthén, L. K., and Muthén, B. O. (2017). Mplus user’s guide: statistical analysis with latent variables. 8th Edn. Los Angeles, CA: Muthén & Muthén.
Newton, J. M., Ferris, D. R., Goh, C. C. M., Grabe, W., Stoller, F. L., and Vandergrift, L. (2018). Teaching English to second language learners in academic contexts: Reading, writing, listening, and speaking. New York, NY: Routledge.
Ngoc Truong, T. N., and Wang, C. (2019). Understanding Vietnamese college students’ self-efficacy beliefs in learning English as a foreign language. System 84, 123–132. doi: 10.1016/j.system.2019.06.007
Pajares, F., Hartley, J., and Valiante, G. (2001). Response format in writing self-efficacy assessment: greater discrimination increases prediction. Meas. Eval. Couns. Dev. 33, 214–221. doi: 10.1080/07481756.2001.12069012
Preiss, R. W., Gayle, B. M., and Allen, M. (2006). “Test anxiety, academic self-efficacy, and study skills: a Meta-analytic review” in Classroom communication and instructional processes: Advances through meta-analysis. eds. B. Mae Gayle, R. W. Preiss, N. Burrell, and M. Allen (London: Lawrence Erlbaum Associates Publishers), 99–111.
Ramos-Villagrasa, P. J., Sánchez-Iglesias, I., Grande-de-Prado, M., Oliván-Blázquez, B., Martín-Peña, J., and Cancer-Lizaga, P. (2018). Versión española de la “Self-Efficacy for Writing Scale” (SEWS). Anales de Psicologia 34, 86–91. doi: 10.6018/analesps.34.1.264931
Raykov, T., and Hancock, G. R. (2005). Examining change in maximal reliability for multiple-component measuring instruments. Br. J. Math. Stat. Psychol. 58, 65–82. doi: 10.1348/000711005X38753
Rust, J., Kosinski, M., and Stillwell, D. (2021). Modern psychometrics: the science of psychological assessment 4th Edn. New York, NY: Routledge.
Schumacker, R. E., and Lomax, R. G. (2016). A Beginner’s guide to structural equation modeling. 4th Edn. New York: Routledge.
Schwarz, C., Schwarz, A., and Black, W. C. (2014). Examining the impact of multicollinearity in discovering higher-order factor models. Commun. Assoc. Inf. Syst. 34, 1191–1208. doi: 10.17705/1cais.03463
Seligman, M. E. P. (2004). Authentic happiness: using the new positive psychology to realize your potential for lasting fulfillment. Am. J. Psychiatry 161, 936–937. doi: 10.1176/appi.ajp.161.5.936
Seligman, M. E. P., and Csikszentmihalyi, M. (2000). Positive psychology. An introduction. Am. Psychol. 55, 5–14. doi: 10.1037/0003-066X.55.1.5
Suárez-Alvarez, J., Pedrosa, I., Lozano, L. M., García-Cueto, E., Cuesta, M., and Muñiz, J. (2018). Using reversed items in likert scales: a questionable practice. Psicothema 30, 149–158. doi: 10.7334/psicothema2018.33
Sun, T., Wang, C., Lambert, R. G., and Liu, L. (2021). Relationship between second language English writing self-efficacy and achievement: a meta-regression analysis. J. Second. Lang. Writ. 53, 1–14. doi: 10.1016/j.jslw.2021.100817,100817
Velasco-Zárate, K., and Meza-Cano, J. M. (2020). Self-efficacy for the learning of academic writing in ESL mediated by webquest-wiki in a group of pre-service language teachers in Mexico. Ikala 25, 289–305. doi: 10.17533/udea.ikala.v25n02a07
Wang, C. (2004). Self-regulated learning strategies and self-efficacy beliefs of children learning English as a second language [Doctoral dissertation, Ohio State University].
Wang, C., and Bai, B. (2017). Validating the instruments to measure ESL/EFL learners’ self-efficacy beliefs and self-regulated learning strategies. TESOL Q. 51, 931–947. doi: 10.1002/tesq.355
Wang, C., Harrison, J., Cardullo, V., and Lin, X. (2018). Exploring the relationship among international students’ English self-efficacy, using English to Learn self-efficacy, and academic self-efficacy. J. Int. Stud. 8, 233–250. doi: 10.5281/zenodo.1134299
Wang, C., Kim, D. H., Bai, R., and Hu, J. (2014). Psychometric properties of a self-efficacy scale for English language learners in China. System 44, 24–33. doi: 10.1016/j.system.2014.01.015
Wang, C., Kim, D. H., Bong, M., and Ahn, H. S. (2013). Examining measurement properties of an English self-efficacy scale for English language learners in Korea. Int. J. Educ. Res. 59, 24–34. doi: 10.1016/j.ijer.2013.02.004
Keywords: self-efficacy, English, students, psychometric properties, invariance
Citation: Mendoza-Torres ME, Morales-García WC, Sairitupa-Sanchez LZ, Morales-García SB, Rivera-Lozada O, Sucapuca-Sucapuca FE and Cunza-Aranzábal DF (2023) Psychometric properties and invariance of an English self-efficacy scale for university students in Peru. Front. Psychol. 14:1187342. doi: 10.3389/fpsyg.2023.1187342
Edited by:
J. Lake, Fukuoka Jo Gakuin University, JapanReviewed by:
Jingjing Huang, Hefei Normal University, ChinaAntonio Luque, University of Almeria, Spain
Copyright © 2023 Mendoza-Torres, Morales-García, Sairitupa-Sanchez, Morales-García, Rivera-Lozada, Sucapuca-Sucapuca and Cunza-Aranzábal. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Wilter C. Morales-García, wiltermorales@upeu.edu.pe