Skip to main content

ORIGINAL RESEARCH article

Front. Psychiatry , 26 February 2025

Sec. Public Mental Health

Volume 16 - 2025 | https://doi.org/10.3389/fpsyt.2025.1528509

Further validation of the Chinese short Warwick Edinburgh mental wellbeing scale in the adult population of Macau: an application of classic test theory and item response theory

  • 1Faculty of Medicine, Macau University of Science and Technology, Macao, Macao SAR, China
  • 2Faculty of Medicine and Health, The University of Sydney, Sydney, NSW, Australia
  • 3Faculty of Health, University of Technology Sydney, Sydney, NSW, Australia
  • 4Department of Health and Biomedical Sciences, RMIT University, Melbourne, VIC, Australia

Background: This study aims to validate the Chinese version of the Short Warwick-Edinburgh Mental Well-being Scale (SWEMWBS) by employing both Classical Test Theory (CTT) and Item Response Theory (IRT) approaches.

Methods: Data were gathered through a population-based, cross-sectional health survey using an online self-reported questionnaire. The scale underwent Exploratory Factor Analysis (EFA) and Confirmatory Factor Analysis (CFA). Measurement invariance by gender was assessed using standard procedures. The Grade Response Model (GRM) of the IRT analysis was applied to the data, estimating the discrimination and difficulty parameters at different thresholds. The results were analyzed both graphically and through parameter values.

Results: Factor analyses confirmed that a single-factor model of the scale fit the data well, with an overall Eigenvalue of 4.55, explaining 65.0% of the total variance. Model fit statistics were slightly better for males than for females. Measurement invariance examinations also yielded satisfactory Goodness-of-Fit statistics (CFI = 0.940, TFI = 0.910, RMSEA < 0.001) with minimal changes in item loadings and indicator threshold patterns across groups. The IRT results demonstrated high discrimination parameters, ranging from 2.17 to 3.67, and nearly evenly distributed difficulty parameters, ranging from -2.23 to 1.77. Graphical examinations indicated good performance of the scale across the latent trait continuum.

Conclusions: The results indicated that, as a single-factor scale, the instrument exhibits good quality at both the scale and item levels. It has high discriminative power and an adequate response set for assessing a full range of the latent trait, namely mental well-being.

Introduction

Mental well-being, often synonymous with positive mental health, encompasses a multifaceted understanding of self-acceptance, personal growth, resilience, autonomy, and mastery of one’s environment (1). Rather than solely focusing on mental illness, the World Health Organization (WHO) increasingly emphasizes the significance of positive mental health for overall population well-being (2). Over the past two decades, mental well-being has garnered substantial attention. The WHO defines positive mental health as a “state of well-being in which individuals recognize their own abilities, effectively cope with life’s normal stresses, contribute productively, and engage with their community” (3). This definition underscores that mental health extends beyond the mere absence of illness (4).

Based on the initial concept of mental well-being proposed by scholars such as Jahoda, Kayes et al., Waterman et al., and others (1, 58), Tennant and colleagues proposed a two-dimensional model of mental well-being consisting of hedonic and eudaimonic aspects (9, 10). The hedonic aspect refers to the individual’s subjective feelings of happiness and life satisfaction, whereas the eudaimonic aspect relates to psychological functioning and the actualization of an individual’s potential, capacity, and positive relationship with self and others. These early models significantly influenced the development and validation of the Warwick-Edinburgh Mental Well-being Scale (WEMWBS) (9, 10).

Since the scale’s publication in the UK, the WEMWBS has been translated into many languages and used in various countries and regions. Interested readers can visit the official website for a list of translations (https://warwick.ac.uk/fac/sci/med/research/platform/wemwbs/using/translations/). Numerous validation studies have been conducted on both the original 14-item scale (1116) and the 7-item Short Warwick-Edinburgh Mental Well-being Scale (SWEMWBS) (1728). The Chinese version of the SWEMWBS was first translated by Ng et al. in 2014 (17) and has been used in several studies since then (2123, 2933).

In terms of validating the SWEMWBS, many studies have utilized classical psychometric evaluations to examine reliability and validity, including factor analyses for structural validity and conventional techniques for convergent and divergent validity (20, 23, 25, 26, 3436). Some validation studies also included measurement invariance assessments to provide further evidence for the scale’s validity (20, 34, 35). The exploratory and confirmatory factor analysis results of most studies affirmed that a single-factor model with global goodness-of-fit indices fit best to the data (20, 23, 25, 26, 34). Reliability assessments indicated reasonable reliability for the single-factor scale, with Cronbach’s alpha values ranging from 0.77 to 0.88 (20, 26). Sarasjärvi et al. reported a reliability measure of McDonald’s omega at 0.92 (25). For convergent validity, Ng et al. found moderate correlations between the scale and the General Health Questionnaire (GHQ-12) (r = -0.52, p < 0.001), Happiness Index (r = 0.53, p < 0.001), and EQ-VAS (r = 0.40, p < 0.001) (23). Koushede et al. reported moderate to high correlations with WHO-5 (r = 0.75, p < 0.05), Self-rated Health (r = 0.48, p < 0.05), PHQ-4 (r = -0.58, p < 0.05), and PSS (r = 0.65, p < 0.05) (20). Melendez-Torres et al. found correlations with Life Satisfaction (r = 0.50, p < 0.001), Somatization (r = -0.53, p < 0.001), School Pressure (r = -0.30, p < 0.001), and Bullying Victimization (r = -0.30, p < 0.001) (35). More recently, Sarasjärvi et al. found high correlations with GHQ-12 (r = -0.758, p < 0.001), MHI-5 (r = 0.731, p < 0.001), BDI-6 (r = -0.866, p < 0.001), and EUROHIS-QOL8 (r = 0.718, p < 0.001) (25). However, low to moderate correlations were found between SWEMWBS items and Need Satisfaction (r = 0.544, p < 0.001), Need Frustration (r = -0.359, p < 0.001), and Depression (r = -0.33, p < 0.001) in Zayed et al.’s study (26). Criterion validity was assessed in some studies (20, 33). For example, Ng et al. found that participants with lower self-rated health were more likely to have lower than medium well-being (OR = 9.1, 95% CI = 8.05-11.23) (20).

The Rasch Model and Item Response Theory (IRT) have been used as alternative approaches for validating WEMWBS for a long time (37, 38). These methods have become more frequent in recent validation studies on SWEMWBS (14, 28, 39). Results from these studies suggest that all items have high discrimination ability with relatively large discrimination parameters. For difficulty parameters, the scale appears to measure and discriminate respondents better with a negative latent trait, indicating a lower level of mental well-being (28, 39). Overall, the scale showed good functioning along the latent trait continuum, particularly within the range of about -2.0 to 2.0 (28, 39).

Few studies have employed both Classical Test Theory (CTT) and IRT to examine the structural validity of the SWEMWBS using factor analyses. Smith et al. evaluated the scale with factorial analyses and also utilized the Graded Response Model (GRM) of IRT to examine age differences in the latent factor (Theta). No statistically significant differences were found between middle-aged and young adults (14).

For the Chinese SWEMWBS, of the eight studies identified in the literature (17, 2123, 2932), four validation studies employed the classical approach (17, 2123). These studies utilized samples based in a single location, namely Hong Kong, except for one conducted in China. One study involved patients with mental illness (17), another university students (21), and the China study utilized small samples of healthy participants and patients with chronic diseases (33). Sun et al. conducted the only study on the general population (22). To the author’s knowledge, no study has been conducted on the Chinese SWEMWBS in the community employing both classical psychometric evaluation and IRT. Hence, this study aims to further validate the Chinese SWEMWBS using a community sample of Macau residents and adopting both Classical Test Theory, particularly factor analysis and measurement invariance, and IRT approaches.

Methods

Sample and data collection

Data utilized in this study was collected from a population-based cross-sectional health survey using a self-reported online questionnaire. The survey was conducted between April and July 2024 among adult residents in Macau. The online questionnaire was distributed through 23 associations and societies involved in the city-wide survey as collaborating partners. These collaborating partners represented many different natures of groupings, with some being professional bodies, others being community associations, and some non-government organizations. With the support of the management of these organizations, members were encouraged to participate in the survey through public appeals and personal invitations. The total number of potential participants was more than 50,000, accounting for nearly 9% of the adult population in Macau. The sample consisted of 1460 respondents, with 1001 females (68.6%) and nearly 45% in the age group of 18-34 years (n=655, 44.9%). Ethics approval was obtained from the Faculty Ethics Research Committee of the Faculty of Medicine, Macau University of Science and Technology (MUST-FMD-200402025001).

Measure

The short form of the Warwick-Edinburgh mental well-being scale

The 14-item WEMWBS was validated and widely used in many mental well-being studies. It had good content and structural validity with a single factor demonstrated by Confirmatory factor analysis (9). It also showed high reliability, with Cronbach’s alpha scores ranging from 0.89 to 0.91. WEMWBS correlated strongly with other mental health and well-being scales and lower correlations with scales measuring overall health with a test-retest reliability of 0.83 at one week (9). The Short Warwick Edinburgh Mental Wellbeing Scale (SWEMWBS). (SWEMWBS © NHS Health Scotland, University of Warwick and University of Edinburgh, 2008, all rights reserved) was derived from the original scale with 7 items by Stewart-Brown et al. in 2009 (37). As with the full scale, the short form was also designed as a scale with a 5-point frequency Likert response set ranging from 1=rarely to 5=all the time, resulting in a minimum total score of 7 and a maximum of 35. More information on the SWEMWBS can be found on the official website. (https://www.corc.uk.net/outcome-experience-measures/short-warwick-edinburgh-mental-wellbeing-scale-swemwbs/) The authors recommended that the raw scores of the SWEMWBS should be converted to the provided corresponding matric scores before data analysis and interpretation.

Data analysis

Data were analyzed using the STATA statistical software (StataNow 18.5). For structural validity, the scale was subjected to Exploratory Factor Analysis (EFA) and Confirmatory Factor Analyses (CFA) with two separate subsets of data, the test and the validation datasets, generated randomly and evenly from the total dataset. To determine whether the 7-item scale was suitable for factor analysis, Barlette’s test for sphericity and the Kaiser-Meyer-Olkin (KMO) measure of sample adequacy were used. Upon the satisfaction of these examinations, EFA was conducted using principal component factors with orthogonal varimax rotation as the extraction method. The factor model was examined using the Eigenvalue (>1.0) and the overall model statistics with a Likelihood Ratio test. CFA was carried out on the validation dataset to further examine the factor structure of the scale. Structural Equation Modelling (SEM) was used as the analytical approach. The maximum likelihood estimation for covariance structure analysis was used to examine the factorial structure after testing for multivariate normality. Results indicated that the assumption of multivariate normality was not violated (Mardia mSkewness, Mardia mKurtosis, and Henze-Zirkler tests p>0.05). The goodness of fit of the model to the data was determined using multiple criteria, including the Chi-square value (χ2, p>0.05), the Comparative Fit Index (CFI, >0.90), the Root Mean-Square Error of Approximation (RMSEA, app. = 0.06), the Tucker–Lewis index (TLI, >0.90), the Standardized root mean squared residual (SRMR, <0.08), and the Akaike’s information criterion (AIC) and Bayesian information criterion (BIC), as suggested by Byrne (36). The factor loading of each item was also calculated and reported using the path diagram. Upon confirming the unidimensional structure of the scale, the scale was subjected to further examinations of the measurement invariance properties. The standard procedures of the measurement invariance were applied to ascertain whether there were any differences in the factor structure and the measurement between gender groups. Four different models were fitted to the data to examine the hierarchy of factors invariance, with model 1 being the configurational model, which is the less restrictive model. Model 2, the loadings model, and Model 3, the loadings and thresholds, were then fitted and followed by the additive model, including the loading, threshold, and residual as Model 4. At each step, the invariance test was conducted to compare between models to determine whether the fitted model was worse off than the previous model. As noted by Anthony et al., there was a lack of agreement in the indices of the invariance test for categorical and ordinal data; the decision to compare the model could only be based on the current recommendation (34). Adopting a similar approach, Cheung et al. recommended that a decrement in the CFI or RMSEA by a magnitude of -0.01 in the model in comparison to the previous model suggested minimal measurement variance (40).

In terms of the IRT approach, while agreeing with the view of Hanzlová et al. that the response set of the items is polytomous and ranked, thus the Grade Response Model (GRM) of the IRT analysis is an appropriate approach (28). As defined by the GRM, two main parameters were estimated, namely the discriminant and the difficulty parameters at different thresholds (refers to the latent trait and symbolized by theta θ). The discriminant parameter refers to how well the item could be used to distinguish respondents with different levels of the latent trait. In the current analysis, four difficulty parameters were estimated for each item since there were 5 responses in the response set (4 ranks). In the GRM, the difficulty parameters of different thresholds could be understood as a 50% chance that the respondent would choose a specific response or higher on the scale (e.g. choosing sometimes, often, or all the time). For determining the acceptable values of these parameters, it was suggested that a value within the range between 0 and ≥ 2 and -2 to 2 for discriminant and difficulty, respectively (41). The range of discrimination parameter values and the interpretation were listed in the Supplementary Information. For assessing whether the GRM fitted well with the data, the results were examined by the values of the parameters and graphically including the Item Characteristics Curves (ICC), Category Characteristics Curves (CCC), Item Information Functions (IIF), and the Test Information Function (TIF). Similar to the measurement invariance examination in the factor analysis, the GRM was also fitted by gender groups to investigate for any difference in the item responses between males and females.

Results

Factor analysis

Before conducting the Exploratory Factor Analysis (EFA), the sample was randomly split into two equal datasets, resulting in 730 data points for both the testing and confirmatory datasets. To assess the suitability of the testing data for factor analysis, the Bartlett test of sphericity and the Kaiser-Meyer-Olkin (KMO) Measure of Sampling Adequacy were applied, yielding results of χ² (21) = 3016.271, p < 0.0001 and KMO = 0.908, respectively. These results suggested that factor analyses were appropriate for analyzing these data. The EFA results indicated that a single-factor model fit the data well, with an Eigenvalue of 4.55, explaining 65.0% of the total variance. Separate analyses by sex resulted in slightly better model fits for males than females, with Eigenvalues of 4.74 and 4.44, and 67.7% and 63.4% of the total variance explained, respectively (Table 1). The factor structure was further examined using Confirmatory Factor Analysis (CFA) with the confirmatory dataset. As summarized in Table 1, the Goodness of Fit statistics suggested a good fit of the single-factor model to the data, with all statistics meeting the criteria except the RMSEA, which was slightly larger than the acceptable value for a good fit. The factor loadings of the items ranged from moderately high to high (0.673 to 0.843). The loadings for females were generally lower than those for males, with slight deviations in model fit statistics. The path diagram of the CFA is depicted in Figure 1.

Table 1
www.frontiersin.org

Table 1. Results of the exploratory and confirmatory factor analyses.

Figure 1
www.frontiersin.org

Figure 1. Path diagram of the confirmatory factor analysis.

Measurement invariance of the scale was examined by fitting four different models with increasing constraints by sex. The results are presented in Table 2. The configurational (baseline) model demonstrated an acceptable fit to the data with reasonably satisfactory Goodness-of-Fit statistics (CFI = 0.940, TFI = 0.910, RMSEA < 0.001), although the RMSEA was slightly inflated. This suggested that the constructs were similar between the male and female groups regarding free and fixed loadings. When more restrictive models were fitted, the Goodness-of-Fit statistics revealed minimal changes with increasing model restrictions. As shown in Table 2, the changes in CFI were less than -0.01 across models except for the additive model. Similar patterns of changes were observed for the RMSEA values. These results suggested that the item loadings and indicator thresholds exhibited similar patterns across groups.

Table 2
www.frontiersin.org

Table 2. Measurement invariance tests of the SWEMWBS by sex.

IRT analysis

The discrimination and difficulty parameters at different thresholds were estimated using the Graded Response Model (GRM) of Item Response Theory (IRT) analysis. The results for these parameters across the full dataset are summarized in Table 3. The discrimination parameters for all items were high, with values greater than 2, ranging from 2.17 for item 1 to 3.67 for item 4 (Table 3). This indicated that the items had good discriminating ability to distinguish respondents with different latent trait levels. These results were further verified by the Item Information Functions (IIFs) (Figure 2), showing item 1 at the lowest position with the lowest discrimination parameter value, containing less information as exhibited by a flat curve. In contrast, item 4 had the highest discriminating value, occupying the highest position in the graph and containing more information as reflected in the multiple wave shapes, indicating variability in the curve (Figure 2).

Table 3
www.frontiersin.org

Table 3. Discriminant and difficulty parameters obtained from the GRM for the full sample and by sex.

Figure 2
www.frontiersin.org

Figure 2. Item Information Functions (IFF) of the scale.

Regarding the difficulty parameters for different thresholds, of the 28 parameter estimates, ranging from -2.23 for item 7 to 1.77 for item 3 (Table 3), slightly more than half (16, 57%) were negative, and 12 were positive. This suggested that these items could measure and distinguish respondents with negative latent traits (mental well-being) slightly better than those with positive traits. However, the differences were not significant. Some differences in the difficulty parameters at various thresholds were observed between and within items. The difference between items reflected that individuals endorsing the same category threshold for different items would have different degrees of the latent trait. The differences in the difficulty parameters within each item also varied across all 7 items. For instance, the differences in item 1 were 0.90, 1.03, and 1.18, whereas the differences in item 7 were 0.95, 1.05, and 1.42. This suggested that item 1 provided slightly better functioning than item 7. The differences in difficulty parameters were lowest in item 2, with values of 0.80, 1.05, and 1.02. These results were supported by the Item Characteristic Curves (ICCs), which show various item characteristics in Figure 3.

Figure 3
www.frontiersin.org

Figure 3. Item Characteristics Curves (ICC) of each item.

The results depicted in the Category Characteristics Curves (CCCs) provided additional information on the scale’s functioning in terms of item responses (Figure 4). Each item’s plot exhibited distinct peaks along the latent trait spectrum without closely overlapping or superimposing on each other. This suggested that each response category had its probability distribution at all levels of the latent trait. Figure 4 displayed a satisfactory set of CCCs for these items, with items 4 and 5 having sharper peaks than the other items. For the full scale, the Test Information Function (TIF) graph indicated good performance across the entire latent trait range, particularly within the range of -2.0 to about 1.8 of the theta value, with minimal standard errors (Figure 5). The Test Characteristics Curve (TCC) also provided the predicted scores of 10.8, 23.5, and 33.5 corresponding to -1.96, 0, and 1.96 of the theta values (Supplementary Figure S1 in supplementary materials).

Figure 4
www.frontiersin.org

Figure 4. Category Characteristics Curves (CCC) of each item.

Figure 5
www.frontiersin.org

Figure 5. Test Information Function (TIF) of the scale.

Similar to the factor analysis, the IRT analysis by gender was also conducted, with results reported in Table 3 and Supplementary Figures S2, S3, S4 in the supplementary materials. As shown, there were differences in the discrimination and difficulty parameters at different thresholds between male and female respondents (Table 3). Overall, the discrimination parameter estimates for males were larger than those for females, particularly for item 4, with values of 4.23 for males and 2.60 for females. Differences in the difficulty parameters at different thresholds were also observed, with larger values for males on the negative latent trait than for females, but larger values for females on the positive latent trait than for males. This suggested differences in responses to the scale items between sexes along the latent trait continuum. These differences were also reflected in the results obtained from the TIF and TCC (Supplementary Figures S3, S4), although the CCC did not show significant differences between groups (Supplementary Figure S2).

Discussions

The growing importance of mental well-being as a global indication of mental healthiness addresses the balance between the negative focuses on mental health problems and illnesses and the positive development of mental wellness. The availability of a well-constructed and validated instrument for the measurement and assessment of the well-defined construct of mental well-being is of paramount importance for the ongoing development of research and clinical practice in the discipline of mental public health.

The SWEMWBS, as one of the few instruments designed specifically for the assessment of mental well-being, has been properly translated into different languages. With the growing development in the field of mental well-being research, there is also an increasing demand for the scale to be used as the outcome measure of research and practice globally. Hence, more substantiating evidence on the reliability and validity of different language versions of the scale will certainly enhance the confidence of researchers and practitioners in the adaptation and usage of the scale. In recent years, an emerging trend in the interest of mental well-being, particularly intervention programs in improving the mental well-being of young people, in the East Asia region has been observed (42) The availability of a theoretically-based and well-designed scale with strong substantiating evidence in the psychometric properties for assessing mental well-being in the Chinese and other East Asian languages will further motivate research in this field.

This study is the first to employ both CTT and IRT approaches for validating the Chinese SWEMWBS. The factor loadings and the measurement invariance results of the factor analyses in this study were comparable to those obtained from the studies by Anthony et al. and Melendez-Torres et al. (34, 35). In terms of the model fit statistics, particularly the greater than the expected values of the RMSEA (0.06) in the full model and the models for males and females, a possible explanation is that there would be a few larger residuals involved that might have skewed the calculation resulting in a larger value. This phenomenon is worthy of more investigation, given that all other goodness-of-fit statistics indicated a reasonably good fit of the model to the data. Another possible explanation is that some other models, such as a two-factor model, may provide a better fit to the data. Considering the overall evidence provided by the model statistics, in conjunction with the volume of evidence from the literature, the single model is still the best option. For the measurement invariance, the results of the current study echoed those obtained by the two reports. In this study, the changes in CFI and RMSEA were less than or close to -0.01 across models. These results were similar to that obtained by Anthony et al. (34). These suggested that the model fits did not change significantly from the most unrestricted configural model to more restricted invariance Metric and Scalar models demonstrating the measure invariance between groups.

The IRT GRM provides further evidence for the scale’s validity with large values of discrimination parameters demonstrating the high discriminative ability of all items, particularly item 4. The difficulty parameters at each threshold also showed good performance of all items along the latent trait continuum. The five-category response set was also shown to be appropriate and adequate, as well as performed well based on the results of the CCC. Different peaks were shown clearly in the curves for all items. For the whole test, the functioning of the scale was demonstrated to provide good information in the TIF graph. In comparison to the study by Hanzlová among adolescents in the UK, these results were comparable and consistent (39). In contrast to the few validation studies of SWEMWBS identified in the literature using the IRT approach, further examinations of the scale by GRM with different gender groups were performed. The results indicated slight differences in the discriminative power of the scale between males and females in favor of the male counterpart. Similarly, differences were also found in the difficulty parameters at each threshold with larger values for males on the negative latent trait than females but larger values for females on the positive latent trait than males. This finding suggested that males and females respond differently to the scale, although the extent of differences may not be problematic. This has not been reported in the literature and can be considered unique. These results could be due to the characteristics of the respondents in the sample or a gender difference in the interpretation of the translated wording of the scale. As such, it is worthy of further examination in future studies. In terms of the impact of gender differences on the validity of the scale, given the small extension of these differences, there may not be a very significant bearing. However, there could be an implication to the utility of the scale for assessing the mental well-being of the general adult population. The study results suggested that the scale could be slightly more sensitive in measuring the mental well-being of males than females. Hence, users of the scale should take note of the possible gender differences in the result of the assessment. Using the standardized conversion score suggested by the original authors could probably help in interpreting the test results. Future validity studies could be conducted to further examine the gender differences in the responses to the scale.

Some limitations have been identified in this study. First, is the skewed gender and age distributions of the sample. The sample consisted of nearly 70% of females with 45% of respondents in the age group 18-34 years. These reflected some gender and age imbalance in sampling. This may pose some limitations to the generalizability of the results obtained. Second, although it is not the aim of the current study to implement a full Classic Test Theory validation study, there is a lack of data using other instruments in assessing a similar construct of mental well-being, such as WHO-5 and PHQ-9, in this study. As such, the convergency validity of the scale cannot be fully examined. In future validation studies, a more gender and age-balanced sample should be used to ensure the generalizability of results. Moreover, it is prudent to include other measures of mental well-being for a more comprehensive examination of the validity of the scale.

In conclusion, the psychometric properties of the Chinese version of the SWEMWBS have been further examined using the Classic Test Theory and the Item Response Theory approaches. Results indicated that as a single-factor scale, on both the scale and item levels, the quality of the instrument is good, with high discriminative power and an adequate response set for assessing a full range of the latent trait, namely mental well-being.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

The studies involving humans were approved by Faculty Ethics Research Committee of the Faculty of Medicine, Macau University of Science and Technology (MUST-FMD-200402025001). The studies were conducted in accordance with the local legislation and institutional requirements. The ethics committee/institutional review board waived the requirement of written informed consent for participation from the participants or the participants’ legal guardians/next of kin because The survey used an online questionnaire with information on the study provided on the front page. Participants’ consented by clicking a button indicating willful participation to the survey before filling in the questionnaire.

Author contributions

LL: Conceptualization, Data curation, Formal analysis, Methodology, Project administration, Supervision, Visualization, Writing – original draft, Writing – review & editing. ML: Writing – original draft, Writing – review & editing.

Funding

The author(s) declare that no financial support was received for the research, authorship, and/or publication of this article.

Acknowledgments

The author would like to sincerely thank Ms. Ka Ian U, and Ms. Jingyu Zhang for their dedication and effort in collecting the data.

Conflict of interest

The authors declare that in the research and production of the manuscript, there are no commercial or financial relationships that could be considered a potential conflict of interest.

The author(s) declared that they were an editorial board member of Frontiers, at the time of submission. This had no impact on the peer review process and the final decision.

Generative AI statement

The author(s) declare that no Generative AI was used in the creation of this manuscript.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpsyt.2025.1528509/full#supplementary-material

References

1. Jahoda M. Current concepts of positive mental health. New York: Basic Books (1958).

Google Scholar

2. WHO. World mental health report: transforming mental health for all. Licence: CC BY-NC-SA 3.0 IGO. Geneva: World Health Organization (2022).

Google Scholar

3. WHO. Promoting mental health: concepts, emerging evidence, practice (Summary Report). Geneva: World Health Organization (2004).

Google Scholar

4. Stewart-Brown S, Samaraweera PC, Taggart F, Kandala NB, Stranges S. Socioeconomic gradients and mental health: implications for public health. Br J Psychiatry. (2015) 206(6):461–5. doi: 10.1192/bjp.bp.114.147280

PubMed Abstract | Crossref Full Text | Google Scholar

5. WHO. Wellbeing measures in primary health care/the depcare project. Copenhagen: WHO Regional Office for Europe (1998).

Google Scholar

6. Lamers SM, Westerhof GJ, Bohlmeijer ET, ten Klooster PM, Keyes CL. Evaluating the psychometric properties of the Mental Health Continuum-Short Form (MHC-SF). J Clin Psychol. (2011) 67:99–110. doi: 10.1002/jclp.20741

PubMed Abstract | Crossref Full Text | Google Scholar

7. Keyes CL. Mental illness and/or mental health? Investigating axioms of the complete state model of health. J Consult Clin Psychol. (2005) 73:539–48. doi: 10.1037/0022-006X.73.3.539

PubMed Abstract | Crossref Full Text | Google Scholar

8. Waterman AS. Two conceptions of happiness: Contrasts of personal expressiveness (eudaimonia) and hedonic enjoyment. J Pers Soc Psych. (1993) 64:678–91. doi: 10.1037/0022-3514.64.4.678

Crossref Full Text | Google Scholar

9. Tennant R, Hiller L, Fishwick R, Platt S, Joseph S, Weich S, et al. The Warwick-Edinburgh Mental Well-being Scale (WEMWBS): development and UK validation. Health Qual Life Outcomes. (2007) 5:63. doi: 10.1186/1477-7525-5-63

PubMed Abstract | Crossref Full Text | Google Scholar

10. Joshanloo M, Weijers D. A two-dimensional conceptual framework for understanding mental well-being. PloS One. (2019) 14:e0214045. doi: 10.1371/journal.pone.0214045

PubMed Abstract | Crossref Full Text | Google Scholar

11. Castellví P, Forero CG, Codony M, Vilagut G, Brugulat P, Medina A, et al. The Spanish version of the Warwick-Edinburgh mental well-being scale (WEMWBS) is valid for use in the general population. Qual Life Res. (2014) 23:857–68. doi: 10.1007/s11136-013-0513-7

PubMed Abstract | Crossref Full Text | Google Scholar

12. Trousselard M, Steiler D, Dutheil F, Claverie D, Canini F, Fenouillet F, et al. Validation of the Warwick-Edinburgh Mental Well-Being Scale (WEMWBS) in French psychiatric and general populations. Psychiatry Res. (2016) 245:282–90. doi: 10.1016/j.psychres.2016.08.050

PubMed Abstract | Crossref Full Text | Google Scholar

13. Oyebode O, Torres-Sahli M, Kapinga D, Bruno-McClung E, Willans R, Shah N, et al. Swahili translation and validation of the Warwick Edinburgh Mental Wellbeing Scale (WEMWBS) in adolescents and adults taking part in the girls’ education challenge fund project in Tanzania. Health Qual Life Outcomes. (2023) 21:43. doi: 10.1186/s12955-023-02119-9

PubMed Abstract | Crossref Full Text | Google Scholar

14. Smith ORF, Alves DE, Knapstad M, Haug E, Aarø LE. Measuring mental well-being in Norway: validation of the Warwick-Edinburgh Mental Well-being Scale (WEMWBS). BMC Psychiatry. (2017) 17:182. doi: 10.1186/s12888-017-1343-x

PubMed Abstract | Crossref Full Text | Google Scholar

15. Mavali S, Mahmoodi H, Sarbakhsh P, Shaghaghi A. Psychometric properties of the warwick-edinburgh mental wellbeing scale (WEMWBS) in the Iranian older adults. Psychol Res Behav Manage. (2020) 13:693–700. doi: 10.2147/PRBM.S256323

PubMed Abstract | Crossref Full Text | Google Scholar

16. Perera BPR, Caldera A, Godamunne P, Stewart-Brown S, Wickremasinghe AR, Jayasuriya R. Measuring mental well-being in Sri Lanka: validation of the Warwick Edinburgh Mental Well-being Scale (WEMWBS) in a Sinhala speaking community. BMC Psychiatry. (2022) 22:569. doi: 10.1186/s12888-022-04211-8

PubMed Abstract | Crossref Full Text | Google Scholar

17. Ng SS, Lo AW, Leung TK, Chan FS, Wong AT, Lam RW, et al. Translation and validation of the Chinese version of the short Warwick-Edinburgh Mental Well-being Scale for patients with mental illness in Hong Kong. East Asian Arch Psychiatry. (2014) 24:3–9.

PubMed Abstract | Google Scholar

18. Haver A, Akerjordet K, Caputi P, Furunes T, Magee C. Measuring mental well-being: A validation of the Short Warwick-Edinburgh Mental Well-Being Scale in Norwegian and Swedish. Scand J Public Health. (2015) 43:721–7. doi: 10.1177/1403494815588862

PubMed Abstract | Crossref Full Text | Google Scholar

19. Ringdal R, Bradley Eilertsen ME, Bjørnsen HN, Espnes GA, Moksnes UK. Validation of two versions of the Warwick-Edinburgh Mental Well-Being Scale among Norwegian adolescents. Scand J Public Health. (2018) 46:718–25. doi: 10.1177/1403494817735391

PubMed Abstract | Crossref Full Text | Google Scholar

20. Koushede V, Lasgaard M, Hinrichsen C, Meilstrup C, Nielsen L, Rayce SB, et al. Measuring mental well-being in Denmark: Validation of the original and short version of the Warwick-Edinburgh mental well-being scale (WEMWBS and SWEMWBS) and cross-cultural comparison across four European settings. Psychiatry Res. (2019) 271:502–9. doi: 10.1016/j.psychres.2018.12.003

PubMed Abstract | Crossref Full Text | Google Scholar

21. Fung SF. Psychometric evaluation of the warwick-edinburgh mental well-being scale (WEMWBS) with chinese university students. Health Qual Life Outcomes. (2019) 17:46. doi: 10.1186/s12955-019-1113-1

PubMed Abstract | Crossref Full Text | Google Scholar

22. Sun Y, Luk TT, Wang MP, Shen C, Ho SY, Viswanath K, et al. The reliability and validity of the Chinese Short Warwick-Edinburgh Mental Well-being Scale in the general population of Hong Kong. Qual Life Res. (2019) 28:2813–20. doi: 10.1007/s11136-019-02218-5

PubMed Abstract | Crossref Full Text | Google Scholar

23. Hong Y, Jiang X, Zhang T, Luo N, Yang Z. Examining the relationship between the Short Warwick-Edinburgh Mental Well-being Scale (SWEMWBS) and EQ-5D-5L and comparing their psychometric properties. Health Qual Life Outcomes. (2023) 21:25. doi: 10.1186/s12955-023-02108-y

PubMed Abstract | Crossref Full Text | Google Scholar

24. Scior K, Patel M, Goldsmith-Sumner A, Hayden N, Lee JY, Lunsky Y, et al. Development and initial psychometric properties of the Warwick-Edinburgh Mental Wellbeing Scale-Intellectual Disability version. J Intellect Disabil Res. (2023) 67:893–900. doi: 10.1111/jir.13039

PubMed Abstract | Crossref Full Text | Google Scholar

25. Sarasjärvi KK, Elovainio M, Appelqvist-Schmidlechner K, Solin P, Tamminen N, Therman S. Exploring the structure and psychometric properties of the Warwick-Edinburgh Mental Well-Being Scale (WEMWBS) in a representative adult population sample. Psychiatry Res. (2023) 328:115465. doi: 10.1016/j.psychres.2023.115465

PubMed Abstract | Crossref Full Text | Google Scholar

26. Zayed K, Omara E, Al-Shamli A, Al-Rawahi N, Haramlah AA, Al-Attiyah AA, et al. A validation study of the Arabic version of the Warwick-Edinburgh Mental Well-being scale among undergraduate students. BMC Psychol. (2023) 11:399. doi: 10.1186/s40359-023-01443-5

PubMed Abstract | Crossref Full Text | Google Scholar

27. Hauch D, Fjorback LO, Juul L. Psychometric properties of the Short Warwick-Edinburgh Mental Well-Being Scale in a sample of Danish schoolchildren. Scand J Public Health. (2023) 51:1214–21. doi: 10.1177/14034948221110002

PubMed Abstract | Crossref Full Text | Google Scholar

28. Hanzlová R, Kudrnáč A. Validation and psychometric evaluation of the Short Warwick-Edinburgh Mental Well-Being Scale (SWEMWBS) among Czech adolescents using Item Response Theory. Health Qual Life Outcomes. (2024) 2 2:66. doi: 10.1186/s12955-024-02280-9

PubMed Abstract | Crossref Full Text | Google Scholar

29. Guo N, Luk TT, Ho SY, Lee JJ, Shen C, Oliffe J, et al. Problematic smartphone use and mental health in chinese adults: A population-based study. Int J Environ Res Public Health. (2020) 17:844. doi: 10.3390/ijerph17030844

PubMed Abstract | Crossref Full Text | Google Scholar

30. Lee EK, Wong B, Chan PHS, Zhang DD, Sun W, Chan DC, et al. Effectiveness of a mindfulness intervention for older adults to improve emotional well-being and cognitive function in a Chinese population: A randomized waitlist-controlled trial. Int J Geriatr Psychiatry. (2022) 37(1). doi: 10.1002/gps.5616

PubMed Abstract | Crossref Full Text | Google Scholar

31. Ng SSW, Leung TKS, Ng PPK, Ng RKH, Wong ATY. Activity participation and perceived health status in patients with severe mental illness: a prospective study. East Asian Arch Psychiatry. (2020) 30:95–100. doi: 10.12809/eaap1970

PubMed Abstract | Crossref Full Text | Google Scholar

32. Wong RHK, Wing Cheong Au R, Wai Lan C, Pi Fan C, Chiu FPF, Chu MMK, et al. A randomized controlled trial on the effectiveness of the abridged illness management and recovery program for people with schizophrenia spectrum disorders. Hong Kong J Occup Ther. (2023) 36:118–27. doi: 10.1177/15691861231204904

PubMed Abstract | Crossref Full Text | Google Scholar

33. Ng Fat L, Scholes S, Boniface S, Mindell J, Stewart-Brown S. Evaluating and establishing national norms for mental wellbeing using the short Warwick-Edinburgh Mental Well-being Scale (SWEMWBS): findings from the Health Survey for England. Qual Life Res. (2017) 26:1129–44. doi: 10.1007/s11136-016-1454-8

PubMed Abstract | Crossref Full Text | Google Scholar

34. Anthony R, Moore G, Page N, Hewitt G, Murphy S, Melendez-Torres GJ. Measurement invariance of the short Warwick-Edinburgh Mental Wellbeing Scale and latent mean differences (SWEMWBS) in young people by current care status. Qual Life Res. (2022) 31:205–13. doi: 10.1007/s11136-021-02896-0

PubMed Abstract | Crossref Full Text | Google Scholar

35. Melendez-Torres GJ, Hewitt G, Hallingberg B, Anthony R, Collishaw S, Hall J, et al. Measurement invariance properties and external construct validity of the short Warwick-Edinburgh mental wellbeing scale in a large national sample of secondary school students in Wales. Health Qual Life Outcomes. (2019) 17:139. doi: 10.1186/s12955-019-1204-z

PubMed Abstract | Crossref Full Text | Google Scholar

36. Byrne BM. Structural equation modeling with EQS and EQS/Windows. Thousand Oaks, CA: Sage Publications (1994).

Google Scholar

37. Stewart-Brown S, Tennant A, Tennant R, Platt S, Parkinson J, Weich S. Internal construct validity of the Warwick-Edinburgh Mental Well-being Scale (WEMWBS): a Rasch analysis using data from the Scottish Health Education Population Survey. Health Qual Life Outcomes. (2009) 7:15. doi: 10.1186/1477-7525-7-15

PubMed Abstract | Crossref Full Text | Google Scholar

38. Bartram DJ, Sinclair JM, Baldwin DS. Further validation of the Warwick-Edinburgh Mental Well-being Scale (WEMWBS) in the UK veterinary profession: Rasch analysis. Qual Life Res. (2013) 22:379–91. doi: 10.1007/s11136-012-0144-4

PubMed Abstract | Crossref Full Text | Google Scholar

39. Hanzlová R, Lynn P. Item response theory-based psychometric analysis of the Short Warwick-Edinburgh Mental Well-Being Scale (SWEMWBS) among adolescents in the UK. Health Qual Life Outcomes. (2023) 21:108. doi: 10.1186/s12955-023-02192-0

PubMed Abstract | Crossref Full Text | Google Scholar

40. Cheung GW, Rensvold RB. Evaluating goodness-of-fit indexes for testing measurement invariance. Struct Eq Model. (2002) 9:233–55. doi: 10.1207/S15328007SEM0902_5

Crossref Full Text | Google Scholar

41. Bichi AA, Talib R. Item response theory: an introduction to latent trait models to test and item development. Int J Eval Res Edu. (2018) 2:142–51. doi: 10.11591/ijere.v7.i2.pp142-151

Crossref Full Text | Google Scholar

42. Lam LT, Lam MK. Child and adolescent mental well-being intervention programme: A systematic review of randomised controlled trials. Front Psychiatry. (2023) 14:1106816. doi: 10.3389/fpsyt.2023.1106816

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: SWEMWBS, mental well-being, classic test theory, IRT, Chinese

Citation: Lam LT and Lam MK (2025) Further validation of the Chinese short Warwick Edinburgh mental wellbeing scale in the adult population of Macau: an application of classic test theory and item response theory. Front. Psychiatry 16:1528509. doi: 10.3389/fpsyt.2025.1528509

Received: 14 November 2024; Accepted: 06 February 2025;
Published: 26 February 2025.

Edited by:

Sai-fu Fung, City University of Hong Kong, Hong Kong SAR, China

Reviewed by:

Kashef N Zayed, Sultan Qaboos University, Oman
Chen Jiang, Ningbo Women and Children’s Hospital, China

Copyright © 2025 Lam and Lam. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Lawrence T. Lam, dG1sYW1AbXVzdC5lZHUubW8=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Research integrity at Frontiers

Man ultramarathon runner in the mountains he trains at sunset

95% of researchers rate our articles as excellent or good

Learn more about the work of our research integrity team to safeguard the quality of each article we publish.


Find out more