- 1School of Psychology, Korea University, Seoul, South Korea
- 2Department of Psychology, Seoul National University, Seoul, South Korea
- 3Department of Psychology, Chungnam National University, Daejeon, South Korea
- 4Dr. Shin’s Neuropsychiatric Clinic, Seoul, South Korea
Background: An association between gaming disorder (GD) and the symptoms of common mental disorders is unraveled yet. In this preregistered study, we quantitatively synthesized reliability, convergent and discriminant validity of GD scales to examine association between GD and other constructs.
Methods: Five representative GD instruments (GAS-7, AICA, IGDT-10, Lemmens IGD-9, and IGDS9-SF) were chosen based on recommendations by the previous systematic review study to conduct correlation meta-analyses and reliability generalization. A systematic literature search was conducted through Pubmed, Proquest, Embase, and Google Scholar to identify studies that reported information on either reliability or correlation with related variables. 2,124 studies were full-text assessed as of October 2020, and 184 were quantitatively synthesized. Conventional Hedges two-level meta-analytic method was utilized.
Results: The result of reliability generalization reported a mean coefficient alpha of 0.86 (95% CI = 0.85–0.87) and a mean test-retest estimate of 0.86 (95% CI = 0.81–0.89). Estimated effect sizes of correlation between GD and the variables were as follows: 0.33 with depression (k = 45; number of effect sizes), 0.29 with anxiety (k = 37), 0.30 with aggression (k = 19), –0.22 with quality of life (k = 18), 0.29 with loneliness (k = 18), 0.56 with internet addiction (k = 20), and 0.40 with game playtime (k = 53), respectively. The result of moderator analyses, funnel and forest plots, and publication bias analyses were also presented.
Discussion and Conclusion: All five GD instruments have good internal consistency and test-retest reliability. Relatively few studies reported the test-retest reliability. The result of correlation meta-analysis revealed that GD scores were only moderately associated with game playtime. Common psychological problems such as depression and anxiety were found to have a slightly smaller association with GD than the gaming behavior. GD scores were strongly correlated with internet addiction. Further studies should adopt a rigorous methodological procedure to unravel the bidirectional relationship between GD and other psychopathologies.
Limitations: The current study did not include gray literature. The representativeness of the five tools included in the current study could be questioned. High heterogeneity is another limitation of the study.
Systematic Review Registration: [https://www.crd.york.ac.uk/PROSPERO/], identifier [CRD42020219781].
Introduction
Since games are one of the most popular leisure activities worldwide, they are now available almost everywhere via computers, mobile phones, and tablets. Generally, gamers enjoy gaming as a leisure activity, and the games seem to affect them positively (Jones et al., 2014). Increasing concerns, however, have been raised about excessive gaming behaviors. American Psychiatric Association (2013) has already introduced the provisional diagnostic criteria for internet gaming disorder in Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition (DSM-5). The World Health Organization (WHO) recently adopted gaming disorder (GD) as a diagnosis in the eleventh edition of the International Classification of Diseases (World Health Organization [WHO], 2018). Despite the few discrepancies in the diagnostic criteria for GD in ICD-11 and DSM-5, the common symptoms of GD include continuation of gaming and impaired control over gaming behavior, which result in functional impairments (Jo et al., 2019).
The official listing of GD diagnosis is debatable (Aarseth et al., 2017; Griffiths et al., 2017; Király and Demetrovics, 2017; Kuss et al., 2017; Van Den Brink, 2017; Rumpf et al., 2018; Van Rooij et al., 2018). Several high-quality studies including epidemiological studies (Lemmens et al., 2015; Pontes et al., 2016; Wittek et al., 2016; Han et al., 2018), clinical outcome studies (see King et al., 2017), neuroimaging studies (Fauth-Bühler and Mann, 2017; Han et al., 2017; Liu et al., 2018), and experimental studies (Sariyska et al., 2017; Kräplin et al., 2021) have been published in the recent years, showing improvements with regard to the quality of studies and methodological issues raised by researchers (Petry and O’Brien, 2013; Van Rooij et al., 2018). Most studies, nonetheless, have relied on self-report assessment tools rather than relying on structured clinical interviews, which is partially due to the inconsistency in definition and the different diagnostic criteria (Jeong et al., 2018). Whether the assessment tools are reliable and whether they could validly measure GD are important questions that should be answered.
Another unresolved but important issue is the association between GD and the symptoms of common mental disorders (see Billieux et al., 2017; Van Rooij et al., 2018). Pontes and Griffiths (2019) commented the importance of key risk factors related to comorbidities. Literature has reported mixed results in the association between gaming disorder and psychiatric disorders. Associations between gaming disorder and the common symptoms of mental disorders were found to be considerably weaker than between symptoms of other disorders at least in young age group (Wichstrøm et al., 2018). In contrast, some studies have reported that the underlying mental illness can be a strong predictor of problematic gaming (Kardefelt-Winther, 2014; Billieux et al., 2015), perhaps even a cause (Van Rooij et al., 2018). Authors also have different interpretations for the association. Some authors consider strong association between GD and mental disorders a natural result because clinicians seldomly assess GD without considering comorbidities (Wichstrøm et al., 2019). On the other hand, strong association is also a basis for supporting the idea that GD may be a consequence of other mental disorders (Van Rooij et al., 2018).
In the current study, we focused on construct validity among several aspects of validity since convergent and discriminant validity provide information on the association between GD and other constructs. Reliability and construct validity provide information on what GD assessment tools consistently measure. Poor construct validity of the measure limits the ability of the tools to achieve its intended purpose of measurement because it remains unclear whether the GD instruments represent the construct of the GD or other psychopathological features. If GD instruments have enough construct validity, the association between GD and gaming behavior would be expected to have stronger association compared to the associations between GD and other psychopathological variables.
To our knowledge, no study has systematically examined association between GD scales and symptoms of common psychiatric comorbidities and compared it to the association between GD and gaming behavior. The recent studies on psychological science adopted the reliability generalization and the correlation meta-analytic technique to perform a meta-analysis of a sample of studies with the purpose of estimating the population reliability and population correlation value of the respective studies (Rodriguez and Maeda, 2006; López-Pina et al., 2015; Miller et al., 2018). In the current study, we quantitatively synthesized the bivariate Pearson’s correlation coefficients between GD assessment tools and common psychological problem (e.g., depression, anxiety, aggression) scales, which refers to the statistic of construct validity, to examine the association between GD and psychological variables. We also conducted reliability generalization to examine the consistency of the scales.
Recently, King et al. (2020) reviewed 32 GD assessment tools in their qualitative review paper, recommending five GD instruments with relatively great evidential support. The five tools are 7-item Game Addiction Scale (GAS-7; Lemmens et al., 2009), 9-item Internet Gaming Disorder Scale-Short Form (IGDS9-SF; Pontes and Griffiths, 2015), 10-item Internet Gaming Disorder Test (IGDT-10; Király et al., 2017), Assessment of Internet and Computer Addiction Scale-Gaming (AICA; Müller et al., 2014), and Lemmens Internet Gaming Disorder Scale-9 (Lemmens IGD-9; Lemmens et al., 2015). Among excluded instruments, Young Internet Addiction scale (Young, 1998) is the most frequently utilized scale, and Young Diagnostic Questionnaire (Young, 1998) is the most cited instrument (King et al., 2020). However, they are relatively old scales and are more related to internet addiction rather than GD. In general, YIAT, GAS-7, and IGDS9-SF are frequently used in the field, and IGDT-10 is an instrument that is evenly used in both the West and the East (King et al., 2020). King et al. (2020) recommended the five tools in consideration of the following factors: DSM -5 and ICD-11 coverage, existence of longitudinal studies, adaptation of structured interview, validation of reliability and cut-off score, dimensionality, criterion validity, test refinement and impairment. Divergent validity, however, was not examined by King et al. (2020). Given the importance of the association between GD and other mental disorders, synthesizing and comparing the magnitude of convergent and discriminant validity can significantly contribute to the understanding of GD.
The GD studies often operationalized the convergent validity as there is a bivariate association between a gaming behavior (i.e., hours per week spent gaming) and a score on a GD tool (King et al., 2020). The given association between a score on a GD tool and a gaming behavior represents convergent validity. The associations between the GD tools and other variables can be operationalized as discriminant validity. In a recent article of theirs, Rönkkö and Cho (2020) provided a general definition of discriminant validity. A discriminant validity means that the two measures intended to measure distinct constructs have discriminant validity if the absolute value of the correlation between the measures after correcting for measurement error is low enough for the measures to be regarded as measuring distinct constructs (Rönkkö and Cho, 2020). If the associations between GD and other psychological variables are too strong, the GD tools may reveal the weaknesses in discriminant validity and present the diagnostic needs from the other psychiatric disorders. If the associations are too small, it might not properly reflect the pain and burden of problematic gaming. By quantitatively synthesizing the correlation coefficients to estimate convergent and discriminant validity coefficient, we can quantify and compare the magnitude of each association between GD and other variables.
This study’s objectives are to (1) synthesize the reliability coefficients; (2) examine the convergent and discriminant validity of the GD tools, further investigating the overall association between the GD tools and other psychological/behavioral variables; and (3) investigate how the study characteristics and potential moderator variables affect the reliability and validity estimates, wherein the potential influencing variables include the specific GD instrument used in the study, the type of the sample, study location, and gender ratio of the study participants. Demographic variables such as age, gender, and study location are variables often examined for measurement invariance in this field (see King et al., 2020), and significant moderators of the prevalence rate of GD (see Andreetta et al., 2020; Stevens et al., 2021). Since five scales which cover different domain of diagnostic criteria were included, we did not perform quantitative synthesis on factor structure in order to prevent confusion. Since there is no gold standard for GD diagnosis, and only few studies adopted rigorous clinical interview, we were unable to conduct a meta-analysis for predictive validity of GD assessment tools.
Methods
Search Strategy
The current study was conducted based on the PRISMA statement (Moher et al., 2009; Page et al., 2021) and recommendations received for the correlational meta-analyses (Quintana, 2015). PRISMA checklist (Page et al., 2021) is included in Supplementary Material 1. The protocol for the current study has been preregistered on PROSPERO (CRD42020219781). While full electronic search strategy for databases using search terms is a standard procedure for the systematic review, the search strategy in the current study was modified because too many irrelevant and unqualified studies were searched with broad search terms, whereas too many missing studies were searched when narrowing the scope. The first database search for all the published studies with GD assessment tools was executed in PubMed, Proquest, and Embase on August 18, 2020, resulting in 1,343 potentially eligible articles. However, we found too many relevant studies were missing. Great heterogeneity in articles of diagnostic criteria (e.g., DSM-5 and ICD-11 from WHO), type of gaming (e.g., mobile, computer, video-only, online, smartphone gaming), name of the disorder and key-terms (e.g., game addiction, internet gaming, online gaming, video gaming, problematic, overuse, excessive) were factors that made standard search procedure ineffective.
Therefore, we modified our search strategy by selecting a few GD scales to be included in advance. Since King et al. (2020) nicely reviewed 32 GD assessment tools in qualitative way, the five recommended tools with great evidential support were chosen to extract and synthesize the correlation data. The second database search included all the empirical studies that had employed at least one of the 5 GD assessment tools. The search was carried out via two different procedures: (1) A computer-based search of Pubmed, Proquest, and Embase using broad keywords to ensure that all studies adopted one of the five scales are included (e.g., IGDS AND (SF OR short OR 9) not to omit any empirical studies that adopted IGDS9-SF), and (2) a procedural collection of the all Google Scholar citation records for the five tools (as of October 2020). The duplicates of the identified articles were first eliminated by using the Endnote software1 version 20 followed by double-checking from the authors. Search strategy of the current study is provided in the Supplementary Material 2.
Inclusion and Exclusion Criteria
Articles were included if they (a) were peer-reviewed journal articles, (b) used one of the five tools recommended by the current systematic review paper, (c) reported the reliability coefficient or bivariate correlation coefficient via the scales of depression, anxiety, aggression, loneliness, quality of life, internet addiction and game playtime, and (d) written in English. Articles were excluded if they (a) did not include relevant information for GD, (b) non-empirical studies such as meta-analyses and systematic review papers, or (c) did not include the reliability or validity coefficient. Due to difficulties in searching, data extracting, and assessing the study quality, we decided to include the articles which were published with the peer-reviewed process.
Coding Procedure
All the preselected variables were coded. The coded variables included demographic information of the study, name of the utilized assessment tool, psychometric information, and bivariate Pearson’s correlation coefficient. The potentially eligible articles were systematically coded by three co-authors, namely, SY, YY, and ER. For the longitudinal studies that reported repetitive information using the same sample, we coded the information reported during the first wave. This is because it often contains a larger sample that that during the second or third wave. For multisite cross-sectional studies that included more than one effect size, information for the rest of effect sizes were coded separately. For studies that used various scales to measure only one psychological variable, the effect sizes were integrated into one effect size by calculating the average.
The candidate studies for data synthesis were evenly split between three raters SY, YY, and ER, and then cross-checked by the corresponding author independently. Overall, the level of agreement on the coding was 92.7%, and all the coded information was reached an agreement. A copy of the coding sheet is available in the Supplementary Material 3.
Selection Process
After the elimination of duplicates using Endnote software, 605 articles were identified via the database keyword search and 1,519 articles were identified via the Google Scholar citation records. Total of 2,124 studies were full-text screened to identify the potentially eligible studies based on the inclusion and exclusion criteria. We found and removed duplicates within and between each database. There were 135 overlapping studies within the Google scholar citation records, and 37 overlapping studies between electronic database search records and Google scholar citation records. As a result, 249 potentially eligible studies were identified. E-mails requesting additional data were sent to the corresponding authors of 49 studies. As of February 2021, 17 authors (34.7%) had responded to the request, and the information provided was finally included in the quantitative synthesis. As a result, 184 of the 249 studies were quantitatively synthesized and 65 were excluded. Among 65 excluded studies, 33 did not include any information on the variables of our interest. The rest 32 studies were excluded due to no reply to the inquiry. Figure 1 presents a flowchart of the database search, screening, and data coding process. The list of the included studies is provided in Supplementary Material 4.
Meta-Analytic Method and Statistical Analysis for Reliability Generalization
Reliability generalization is a powerful tool to characterize the mean measurement error variance across studies, and also the variabilities in score quality and the study features (Vacha-Haase, 1998). We utilized this technique to estimate the overall level of reliability of the included studies and to find differences in the level of reliability among the five instruments. Separate meta-analyses were conducted for reliability generalization and validity generalization. The current study utilized a meta-analytic technique to quantitatively synthesize the findings of various studies and examine the overall reliability of the GD assessment tools that are frequently used. Cronbach’s alpha coefficients (Cronbach, 1951) were frequently reported, allowing us to synthesize the findings. Information on test-retest reliability, however, was less frequently reported. To conduct reliability generalization of the internal consistency, we extracted the Cronbach’s alpha coefficient for just the total score of the five GD assessment tools drawn from eligible studies. Cronbach’s alpha coefficient offers information on the internal consistency of the test scale (Tavakol and Dennick, 2011). With regard to the calculation of the mean coefficient alpha, Bonett’s (2002) transformation was applied to normalize the distribution and stabilize their variance: Li = Ln (1–αi), where Ln is the natural logarithm. After synthesizing reliability with transformed values, we converted the Bonett-transformed metric back to the original metric of Cronbach’s alpha coefficient to facilitate interpretation. The test-retest reliability coefficients reported from the included studies were descriptively presented in the result. We adopted the same correlation meta-analysis technique for the quantitative synthesis of test-retest reliability coefficients since test-retest reliability is often measured with a correlation coefficient.
Meta-Analytic Method and Statistical Analysis for Validity Generalization
We coded all bivariate correlation coefficients between GD and psychological variables if the number of effect sizes is sufficient enough to conduct quantitative synthesis (j > 10). We considered the correlation between a GD scale score and the game playtime as a convergent validity variable. Depression, anxiety, impulsivity, loneliness, aggression, gambling addiction, internet addiction, alcohol addiction, and quality of life (QOL) were considered potential discriminant validity variables. Among ten variables, gambling addiction (j = 5, number of effect sizes), alcohol addiction (j = 2), and impulsivity (j = 6) were excluded due to the insufficient number of effect sizes for quantitative synthesis. As a result, we performed quantitative synthesis of correlation between GD and seven psychological variables: depression, anxiety, quality of life (QOL), aggression, loneliness, internet addiction, and game playtime.
To estimate the overall mean effect size and correlation coefficient, the current literature has dominantly adopted two approaches (Field and Gillett, 2010; Brannick et al., 2019). These two approaches were proposed by Schmidt and Hunter (1998) and Hedges (Hedges, 1992; Hunter and Schmidt, 2004; Borenstein et al., 2011). However, determining which approach is more appropriate for the correlation coefficient’s meta-analysis has been controversial (Field, 2005; Field and Gillett, 2010). In addition to the two commonly adopted techniques, Brannick et al. (2019) also introduced a novel estimator, providing better coverage and slightly better credibility values than the commonly used approaches. These meta-analytic methods are based on the random-effects model. A random-effects model allows the true effect to differ in each study, whereas a fixed effect model assumes all the studies share a common effect size (Borenstein et al., 2010). As the studies included in this meta-analysis were conducted in different regions and have differ samples, a random effects model was used to derive the effect size and confidence level.
For correcting measurement unreliability, Hunter-Schimidt estimator (Hunter and Schmidt, 2004). Morris estimator (Brannick et al., 2019) apply the individual correction technique to estimate the mean effect size. Hedges method (Borenstein et al., 2011), however, does not adopt the individual correction technique to estimate the effect size. As the current study also aims to conduct reliability generalization to examine the reliability of the GD assessment tools, we utilized the Hedges method. The current study adopted a conventional two-level meta-analytic method instead of a three-level model or robust variance estimation technique to estimate the pooled effect size of the correlation. Although a three-level model and robust variance estimation technique have several advantages over a conventional two-level meta-analytic model (Hedges et al., 2010; Assink and Wibbelink, 2016; Harrer et al., 2021), scarce information on the variance of effect size within individual studies made it difficult to apply a three-level model or robust variance estimation method. We therefore conducted the conventional two-level meta-analysis in the current study.
Heterogeneity and Moderator Analyses
As the current study synthesized the findings of studies that used five different assessment tools, a high heterogeneity was expected. To examine the heterogeneity of the quantitative synthesis, the current study reported Tau (T), Tau-squared (T2), and I2 as the measures of heterogeneity between the studies. Tau and Tau-squared are reported in the same metric as the effect size, providing information about the dispersion of true effects on the absolute scale (Borenstein et al., 2017). A guide to interpret the I2 statistic (Borenstein et al., 2017) is as follows: small heterogeneity (I2 25%), moderate heterogeneity (I2 50%), and considerable heterogeneity (I2 = 75%).
Categorical moderator analyses were conducted to identify the potential impacts of reliability and validity generalizations. One study characteristic moderator, (a) the specific GD instrument used in the study (categorized into “IGDS9-SF,” “GAS-7,” “Lemmens IGD-9,” “AICA,” and “IGDT-10”), was considered the potential impact for reliability generalization. Three study characteristics were considered as the potential categorical moderators for validity generalization, namely, (a) the specific GD instrument used in the study, (b) the type of the sample (categorized into “adolescents,” “adults,” and “both”), and (c) the study location (categorized into six continents). Categorical moderator analyses were conducted when each of the subgroups had at least 4 studies. Fu et al. (2011) suggested that each subgroup should have at least four studies for a categorical moderator analysis. Some subgroups with an insufficient number of studies (less than four studies) were excluded from the moderator analysis. To investigate whether the continuous moderator (d) gender ratio affects effect sizes, we performed a meta-regression with the ratio of male participants.
Statistical Software
The statistical analysis was conducted in R software (version 4.0.3) using metafor (Viechtbauer, 2010), meta (Schwarzer, 2007), and dmetar packages (Harrer et al., 2019). The packages provide various functions to facilitate study synthesis. These include moderator analysis, meta-regression analysis, Egger’s regression test (Egger et al., 1997), sensitivity meta-analysis for publication bias, and various types of meta-analytical plotting.
Publication Bias
Rothstein et al. (2005) suggested that publication bias, also known as file-drawer problem, could occur since studies without statistically significant results are less likely to be published. The current study examined the risk of publication bias by drawing a funnel plot and conducting Egger’s test (Egger et al., 1997). Egger’s regression test quantifies the funnel plot asymmetry and performs a statistical test. If the p-value of Egger’s test is significant, the significant asymmetry in the Funnel plot caused by the publication bias or “small study effects” is indicated (Sterne et al., 2001). Cumulative meta-analysis and sensitivity analysis were additionally conducted when Egger’s test indicated the presence of publication bias.
Results
Description of Included Studies
The current study included 184 articles that reported the results from 205 independent samples with 285,752 participants. The estimated mean age of the study samples based on the studies’ reported statistic was 22.12, and 60.7% of the participants were male. Of the studies included up to December 2020, 159 studies (86.4%) have been published since 2016 and 102 studies (55.4%) since 2019. While 94 studies were conducted in Europe, 61 studies were conducted in Asia. Regarding the targeted age group, 63 studies targeted adult samples, 56 studies targeted adolescent samples, and the remaining 65 included both adult and adolescent samples. Of the 184 studies, 49 conducted factor analysis and reported related statistics. While most of the studies (k = 42) conducted confirmatory factor analysis, two studies conducted exploratory factor analysis and five studies conducted both. IGDS9-SF was found to be the most frequently utilized tool (k = 81, 44.0%). Key characteristics of the included studies are reported in Table 1.
Reliability Generalization
Result of Reliability Generalization
Cronbach’s alpha coefficient of 193 effect sizes (from 172 studies) were quantitatively synthesized for the respective reliability generalization. The number of studies reporting the Cronbach’s alpha coefficients of GD assessment tools were as follows: 90 effect sizes from the 76 studies for IGDS9-SF, 58 effect sizes from the 53 studies for GAS-7, 20 effect sizes from the 18 studies for Lemmens IGD-9, 16 effect sizes from the 16 studies for IGDT-10 and, 9 effect sizes from the 9 studies for AICA. All the five assessment tools demonstrated an appropriate level of reliability. The estimated average reliability coefficient obtained from Bonett’s transformation was 1.97 (95% CI = 1.90–2.04). Then, to facilitate the interpretation, Bonett’s transformed reliability coefficient was transformed back into Cronbach’s alpha coefficient. The result of RG reported a mean coefficient alpha of 0.86 (95% CI = 0.85–0.87). The result of RG for each of the five GD Assessment Tools is summarized in Table 2. The forest plot of RG is included in the (Supplementary Figure 1).
A total of 8 studies reported test-retest reliability ranging from 0.78 to 0.94. The number of studies reporting the test-retest reliability of the GD assessment tools are as follows: 4 studies for IGDS9-SF (0.78–0.94), 3 studies for GAS-7 (0.80–0.83), 1 study for Lemmens IGD-9 (0.83), and none for IGDT-10 and AICA. The estimated pooled coefficient of test-rest reliability was 0.86 (95% CI = 0.81–0.89).
Heterogeneity and Moderator Analysis
The results of the heterogeneity test for reliability were significant for all the included studies (τ = 0.483, τ2 = 0.233, I2 = 99.3%). To assess the effect of the specific GD instrument used in the study on heterogeneity, a categorical moderator analysis on moderator (a) was conducted. Reliability was revealed to be significantly heterogeneous depending on the measure verified via an omnibus test of hypothesis [QM (4) = 57.56, p < 0.001]. Since IGDS9-SF showed the highest Bonett-transformed coefficient alpha, ANOVA was conducted between the measures. All ANOVA comparisons were conducted to examine whether significant difference exists between the magnitude of each coefficient. The results show that the Bonett-transformed coefficient alpha of IGDS9-SF was significantly higher than the coefficients of GAS-7, Lemmens IGD-9, and IGDT-10 (all p < 0.001) but was not higher than the coefficient of AICA (p = 0.06). The ANOVA result between AICA and Lemmens IGD-9 was also statistically significant (p < 0.05).
Publication Bias
Publication bias was assessed using funnel plots and Egger’s regression test. The result of Egger’s regression test did not indicate the presence of publication bias for IGDS9-SF (z = 1.37, p = 0.17), Lemmens IGD-9 (z = –0.76, p = 0.45), IGDT-10 (z = –0.76, p = 0.45), and AICA (z = 0.03, p = 0.97). Egger’s test of GAS-7, however, indicated the presence of publication bias (z = –2.02, p = 0.04). Funnel plots are included in the (Supplementary Figure 2).
Association and Validity Generalization
Results of Validity Generalization
A total of 210 effect sizes were extracted and synthesized for validity generalization from the 115 studies analyzed. The number of studies reporting the correlation coefficients between GD assessment tools and psychological or behavioral measurement are as follows: 45 effect sizes from the 44 studies for depression, 37 effect sizes from the 36 studies for anxiety, 19 effect sizes from the 17 studies for aggression, 18 effect sizes from the 17 studies for quality of life and loneliness, 20 effect sizes from the 18 studies for internet addiction, and 53 effect sizes from the 51 studies for game playtime. DASS-21(Depression Anxiety Stress Scales), developed by Antony et al. (1998), is the most frequently utilized psychological scale for depression (k = 8) and anxiety (k = 8). The Satisfaction with Life Scale (SWLS) for quality of life (k = 13), Buss-Perry Aggression Questionnaire (BPAQ) for aggression (k = 8), UCLA Loneliness Scale for loneliness (k = 16) and Young’s Internet Addiction Test (k = 10) for internet addiction were also frequently utilized (Russell et al., 1980; Diener et al., 1985; Buss and Perry, 1992; Young, 1998).
The results of the quantitative synthesis for the association between GD and other variables are shown in Table 3. The overall estimated mean effect sizes of the psychological variables for GD are as follows: Depression (r = 0.33), anxiety (r = 0.29), aggression (r = 0.30), QOL (r = –0.22), and loneliness (r = 0.29). The estimated effect sizes of internet addiction and game playtime are r = 0.56 and r = 0.40. The forest plots displaying the population estimate and the effect sizes of individual studies for each of the variables are presented in Figures 2–4.
Figure 2. Forest plots of correlations and 95% confidence interval (CI) for random effects meta-analysis model for (A) depression, and (B) anxiety.
Figure 3. Forest plots of correlations and 95% confidence interval (CI) for random effects meta-analysis model for (A) aggression, (B) quality of life, (C) loneliness, and (D) internet addiction.
Figure 4. Forest plot of correlations and 95% confidence interval (CI) for random effects meta- analysis model for game playtime.
Heterogeneity and Moderator Analyses
The results of the quantitative synthesis indicated high levels of heterogeneity for all the variables. The heterogeneity estimates are presented in Table 3. Categorical moderator analyses and meta regression analyses using moderators were conducted to identify the potential sources of heterogeneity. Moderator (a), the specific GD instrument used in the study, (categorized into “IGDS9-SF,” “GAS-7,” “Lemmens IGD-9,” “AICA,” and “IGDT-10”), moderator (b), the type of the sample (categorized into “adolescents,” “adults,” and “both”), and (c) the study location (categorized into six continents) were used as the moderators if each of the subgroups had sufficient number of studies (Fu et al., 2011). Moderator (a) was a significant moderator for anxiety and GD (p = 0.02), and moderator (c) was a significant moderator for aggression and GD (p < 0.01). Moderator (d), gender ratio of the participants of each study, was a significant moderator only for game playtime (p = 0.04), indicating that the studies having more male participants reported smaller correlation coefficients between GD and game playtime. The results of the categorical and continuous moderator analysis of validity generalization are presented in the (Supplementary Tables 2, 3).
Publication Bias
Publication bias for validity generalization was assessed by using funnel plots and Egger’s regression test. The funnel plots for all the variables have been visualized in Supplementary Figure 3. Since visual inspection can be subjective, Egger’s regression tests for the detection of funnel plot asymmetry were performed (Sterne et al., 2000). The results of the regression tests for game play time were statistically significant (t = 3.16, p < 0.01), suggesting the presence of evidence for publication bias. Cumulative meta-analysis and sensitivity analysis were further conducted to investigate the publication bias of studies reporting the correlation between GD and game playtime. The results of the cumulative meta-analysis and sensitivity analysis revealed that the studies conducted by, and Brunborg et al. (2014) and Bányai et al. (2019) had influenced the overall effect size estimate as two studies reported exceptionally small and large effect sizes. Omitting study by Brunborg et al. (2014) decreased the overall effect size estimate between GD and game playtime to r = 0.39 while omitting study by Bányai et al. (2019) increased the overall effect size estimate to r = 0.41. The result of sensitivity analysis for GD and game playtime is provided in Figure 5. The results of Egger’s regression test for the other variables were insignificant (for depression t = 0.98, p = 0.33; for anxiety t = 1.02, p = 0.31; for aggression t = –0.23, p = 0.82; for QOL t = –0.37, p = 0.72; for loneliness t = 0.33, p = 0.75; for internet addiction t = 0.49, p = 0.63).
Discussion
Reliability
The current study aimed to provide information on what GD scales measure, and how consistent the measure is. The current study conducted meta-analyses by quantitatively synthesizing the Cronbach’s alpha reliability coefficients and bivariate Pearson’s correlations. The result of the quantitative synthesis of alpha coefficients, reliability generalization, showed an estimated alpha coefficient of 0.86. A high value of alpha coefficient is usually desirable (Cronbach, 1951), but an alpha coefficient above 0.9 may indicate unnecessary redundancy rather than a desirable level of internal consistency (Streiner, 2003). In this regard, the estimated alpha coefficient of 0.86 can be interpreted as an indication of good internal consistency (Gliem and Gliem, 2003). With respect to the moderator analysis, each tool displayed Cronbach’s alpha coefficients ranged from 0.81 to 0.89. The 172 studies in total presented 193 effect sizes of alpha coefficients as the measures of internal consistency. Alpha coefficients of studies with IGDS9-SF were most frequently reported, and the result of ANOVA revealed that IGDS9-SF possesses the highest estimated alpha followed by AICA. The funnel plot and Egger’s test of each GD tool indicated the existence of a potential publication bias for GAS-7 (z = –2.02, p = 0.04). The funnel plots for the GD tools are provided in the (Supplementary Figure 2).
Given that the current study only included the psychometrically sound tools to synthesize the reliability coefficients, there is a possibility that the reliability estimation of the current study might be positively biased. A categorical moderator analysis with the specific GD instrument used in the study, was performed to examine whether there were differences between each GD tool. The results of the omnibus subgroup test rejected the null hypothesis, indicating that there are differences between the estimated alpha coefficients of each of the tools. ANOVA analyses between every two GD tools were further performed as the omnibus test results increase the type 1 error. The results indicated that IGDS9-SF (α = 0.89) had the highest estimated alpha, followed by AICA (α = 0.85). Lemmens IGD-9 showed the lowest estimated alpha (α = 0.81) among all the tools.
Caution should be taken in interpreting the results of the pooled Cronbach’s alpha coefficient. The high Cronbach’s alpha is not a perfect index of internal consistency as alpha by itself does not assure an excellent degree of internal consistency (Tavakol and Dennick, 2011). An alpha coefficient can be susceptible to the length of the test, undue narrowness (Streiner, 2003), and dimensionality (Green et al., 1977). The test-retest reliability coefficients can provide additional information on overall reliability when they are interpreted together with the internal consistency coefficients. An intraclass correlation coefficient or test-retest interval correlation coefficient can be referred as the stability or reproducibility of the test (Polit, 2014). The estimated average of the eight reliability coefficients was 0.86 (95% CI = 0.81–0.89) which can be interpreted as a good level (Cicchetti, 1994). More studies should examine the test-retest reliability of GD assessment tools as a very small number of studies have reported on retest reliability in comparison to the studies that have reported on internal consistency.
Validity and Association
The bivariate Pearson’s correlation between the seven variables and GD tools were coded. The estimated effect sizes of the correlation ranged between 0.22 and 0.56 in magnitude. The estimated associations between GD and psychological/behavioral variables were found. The Hedge’s estimator (Borenstein et al., 2011) for the seven variables are as follows: 0.33 for depression, 0.29 for anxiety, 0.30 for aggression, –0.22 for QOL, 0.29 for loneliness, 0.56 for internet addiction, and 0.40 for game playtime.
By synthesizing the effect size of correlation coefficients and examining the convergent and discriminant validity of GD tools, we aimed to scrutinize the association between GD and mental disorders. Unfortunately, the current study offers information only on the association, rather than on causality. The results from the current study do not suggest that the correlation effect sizes are small or large enough to help the society make clear distinction. Since the labeling of the effect size magnitude can be arbitrary (Schober et al., 2018), we suggest an interpretation of the results by comparing each of the effect sizes. For instance, GD tools have a correlation effect size of 0.40 with game playtime and 0.33 with depression, meaning that the depression was found to have a slightly smaller association with GD than the gaming behavior. Anxiety (r = 0.29), aggression (r = 0.30), and loneliness (r = 0.29) showed similar magnitudes of correlation effect sizes. QOL was the only variable negatively associated with GD (r = –0.22). Internet addiction showed the highest correlation with GD. The overlapped items between internet addiction and gaming disorder, especially the IGD criteria for DSM-5, might contribute toward a high association between internet addiction and GD.
The results of the moderator analysis show that the specific GD instrument used in the study significantly moderates the correlation between anxiety and GD. IGDS9-SF captures higher associations (r = 0.33) between anxiety and GD than GAS-7 (r = 0.23). This might be due to the different features of each scales. Study location was found to be a significant moderator for the correlation between aggression and GD. The studies conducted in Asia reported higher association (r = 0.38) between aggression and GD than the studies conducted in Europe (r = 0.24). This is consistent with the findings of previous studies. Studies reporting the role of aggression in gaming disorders have investigated the mediating role of ethnicity and cultural differences (Kim et al., 2018; Prescott et al., 2018). Anderson et al. (2010) also reported that cultural difference can moderate the association between violence, prosocial behavior, and video gaming. A continuous variable moderator analysis shows that the gender ratio of study participants was a significant moderating continuous variable. The higher the percentage of female participants, the stronger the association between game playtime and GD (b = 0.6302 for intercept; b = –0.0033 for one percent point increase in the percent of male participants). The males are known to be more vulnerable than females in developing a gaming disorder (Dong et al., 2018; Fam, 2018). The game playtime seems to have a more direct effect on females than on males.
The Egger’s test, cumulative meta-analysis, and sensitivity analysis revealed an asymmetry in the publications reporting the correlations between game playtime and GD. The studies conducted by Brunborg et al. (2014) and Bányai et al. (2019) influenced the overall effect size. Notably, Bányai et al. (2019) reported Pearson’s bivariate correlation between game playtime and GD of r = –0.01, which is in essence zero. Since the study by Bányai et al. (2019) included e-sport gamers who spent significantly more time playing games than recreational gamers, the correlation reported by the author significantly differs from that of the other studies. The findings of Bányai et al. (2019) presented the moderating role of gaming motivation in causing GD and psychiatric distress, indicating that gaming behavior itself can have even no association with the GD.
The main findings of the current study show that the magnitudes of the effect sizes of convergent and discriminant validities of GD are not significantly different. Given the association of 0.40 between game playtime and GD, common symptoms (e.g., depression, anxiety) of psychopathology also showed considerable associations with GD. As González-Bueso et al. (2018) commented, we agree to the idea that whether the problematic gaming behaviors are a consequence, or a trigger of other psychopathologies cannot be unraveled yet. Studies have reported that just as problematic gaming increases psychological distress, psychological factors such as low self-esteem and loneliness also bidirectionally affect or predict problematic gaming (Lemmens et al., 2011; Tian et al., 2017; Tras, 2019; Wartberg et al., 2019).
To identify the unraveled relationship between GD and psychopathology, and move beyond these debates, future studies must come to a consensus on the diagnostic criteria of gaming disorder. Delphi method can be helpful in developing the diagnostic criteria of GD and arriving at a consensus (Castro-Calvo et al., 2021). The tools should be improved and unified rather than continuously developed by various researchers. Importantly, the clinician interview must be adopted in this field to verify the positive cases of GD and report comorbid psychopathologies (Pontes and Griffiths, 2019). Of the 184 studies included in the current meta-analysis study, only nine studies included clinical samples and adopted structured clinician interviews in a strict sense (e.g., Müller et al., 2019; Wölfling et al., 2019; Phan et al., 2020). Longitudinal and high-quality clinical trial studies (e.g., Han et al., 2017; Li et al., 2017; Wölfling et al., 2019) are also necessary to rebut the argument that the problematic gaming behavior is a consequence of other psychopathologies. With respect to the other aspects of validity, future studies should actively examine the predictive validity using gold standard tool of the diagnosis.
Study Limitations
Some limitations should be noted. First, despite our effort to include all the relevant studies, some could not be coded owing to unreported data. To minimize this limitation, we reached out to researchers, and received relevant information from 17 researchers. Second, the current study focuses on the five GD assessment tools recommend by King et al. (2020). Since more than 40 assessment tools have been developed to assess GD, the representativeness of the five tools included in the current study could be questioned. Rather than establishing our own selection criteria, we selected the five GD assessment tools based on a rigorous review article by King et al. (2020). The third limitation might reside in the conventional two-level meta-analysis model and the high level of study heterogeneity found in both reliability and validity generalization. While efforts were made to investigate the potential reason for high heterogeneity, the categorical and continuous moderator analysis only partially adjusted the heterogeneity. We adopted the conventional two-level meta-analysis model instead of three-level model or robust variance estimation method due to scarce report of the variance of the individual effect sizes within each study. We used effect sizes from longitudinal studies (k = 17) and several effect sizes reported from the same sample (k = 3), and those effect-sizes reported from the same study were not analyzed repeatedly in the current study. If variance of the individual effect sizes within each study are accumulated in a future, a three-level meta-analysis model or robust estimation technique would be recommended to handle the dependent effect sizes and considering within- and between-study heterogeneity. The fourth limitation is that due to insufficient number of studies, we did not perform a meta-analysis for GD and attention deficit hyperactivity disorder, which is a common psychiatric comorbidity in clinical practice (Yen et al., 2017). Five studies reported Pearson’s correlation coefficients ranging from 0.16 to 0.38 between GD and impulsivity. Given the high heterogeneity, we decided that the number of studies on impulsivity was insufficient to carry out a meta-analysis. Fifth, since majority of the included studies in the current study adopted either GAS-7 and IGDS9-SF, the feature of the GAS-7 and IGDS9-SF might affect the effect size estimation. The limitation should be addressed as more studies in this field are conducted.
Conclusion
Despite its limitations, this is the first and largest systematic review study (with 184 studies and 285,752 study participants) to examine the association between GD and psychological/behavioral variables by synthesizing the reliability, and convergent and discriminant validity information of the five GD assessment tools (e.g., IGDS9-SF, GAS-7, Lemmens IGD-9, AICA, and IGDT-10). In addition to the reliability generalization of the GD assessment tools, a major strength of this study is that we applied meta-analytic techniques to investigate the magnitude of relationships between GD and common symptoms of mental disorders (e.g., depression, anxiety disorders, addictions, impulsivity, and hostility), as indicated in previous studies (Han et al., 2017; Na et al., 2017; Wang et al., 2017; González-Bueso et al., 2018; Liu et al., 2018). We also applied same meta-analytic technique to examine the magnitude of association between GD and the gaming behavior. We believe that this meta-analysis provides current status of GD. Future studies should address debatable issues in reliability and convergent/discriminant validity of the GD assessment tools, and more studies should be conducted to better understand the bidirectional relationship between GD and other psychopathologies.
Data Availability Statement
The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding author/s.
Author Contributions
SY, W-YA, JK, S-HS, JC, and K-HC contributed to the conception and design of the study. SY, YY, and ER coded the data and wrote the first draft of the manuscript. SY, YY, ER, and K-HC double-checked the coded data. SY and YY analyzed data. K-HC supervised the overall study process. W-YA, JK, S-HS, JC, and K-HC contributed editing the draft of the manuscript. All authors have read and approved the submitted manuscript.
Funding
This work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIT) (No. NRF-2020R1A2C2099665). The study was also supported by the Investigating Scientific Evidence for Registering Gaming Disorder on Korean Standard Classification of Disease and Cause of Death project, of Ministry of Health and Welfare, Korea, and Korea Creative Content Agency.
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s Note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Acknowledgments
We would like to thank Editage (www.editage.co.kr) for English language editing.
Supplementary Material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpsyg.2021.764209/full#supplementary-material
Footnote
References
Aarseth, E., Bean, A. M., Boonen, H., Colder Carras, M., Coulson, M., Das, D., et al. (2017). Scholars’ open debate paper on the World Health Organization ICD-11 Gaming Disorder proposal. J. Behav. Addict. 6, 267–270. doi: 10.1556/2006.5.2016.088
American Psychiatric Association (2013). Diagnostic and Statistical Manual of Mental Disorders, 5th Edn. Arlington, VA: American Psychiatric Publishing. doi: 10.1176/appi.books.9780890425596
Anderson, C. A., Shibuya, A., Ihori, N., Swing, E. L., Bushman, B. J., Sakamoto, A., et al. (2010). Violent video game effects on aggression, empathy, and prosocial behavior in eastern and western countries: a meta-analytic review. Psychol. Bull. 136, 151–173. doi: 10.1037/a0018251
Andreetta, J., Teh, J., Burleigh, T. L., Gomez, R., and Stavropoulos, V. (2020). Associations between comorbid stress and internet gaming disorder symptoms: are there cultural and gender variations? Asia Pac. Psychiatry 12:e12387. doi: 10.1111/appy.12387
Antony, M. M., Bieling, P. J., Cox, B. J., Enns, M. W., and Swinson, R. P. (1998). Psychometric properties of the 42-item and 21-item versions of the depression anxiety stress scales in clinical groups and a community sample. Psychol. Assess. 10, 176–181. doi: 10.1037/1040-3590.10.2.176
Assink, M., and Wibbelink, C. J. (2016). Fitting three-level meta-analytic models in R: A step-by-step tutorial. Quant. Methods Psychol. 12, 154–174. doi: 10.20982/tqmp.12.3.p154
Bányai, F., Griffiths, M. D., Demetrovics, Z., and Király, O. (2019). The mediating effect of motivations between psychiatric distress and gaming disorder among esport gamers and recreational gamers. Compr. Psychiatry 94:152117. doi: 10.1016/j.comppsych.2019.152117
Billieux, J., King, D. L., Higuchi, S., Achab, S., Bowden-Jones, H., Hao, W., et al. (2017). Functional impairment matters in the screening and diagnosis of gaming disorder: commentary on: Scholars’ open debate paper on the World Health Organization ICD-11 Gaming Disorder proposal (Aarseth et al.). J. Behav. Addict. 6, 285–289.
Billieux, J., Schimmenti, A., Khazaal, Y., Maurage, P., and Heeren, A. (2015). Are we overpathologizing everyday life? A tenable blueprint for behavioral addiction research. J. Behav. Addict. 4, 119–123. doi: 10.1556/2006.4.2015.009
Bonett, D. G. (2002). Sample size requirements for testing and estimating coefficient alpha. J. Educ. Behav. Stat. 27, 335–340. doi: 10.3102/10769986027004335
Borenstein, M., Hedges, L. V., Higgins, J. P., and Rothstein, H. R. (2010). A basic introduction to fixed-effect and random-effects models for meta-analysis. Res. Synth. Methods. 1, 97–111. doi: 10.1002/jrsm.12
Borenstein, M., Hedges, L. V., Higgins, J. P., and Rothstein, H. R. (2011). Introduction to Meta-Analysis. Hoboken, NJ: John Wiley & Sons. doi: 10.1002/9780470743386
Borenstein, M., Higgins, J. P., Hedges, L. V., and Rothstein, H. R. (2017). Basics of meta-analysis: I2 is not an absolute measure of heterogeneity. Res. Synth. Methods. 8, 5–18. doi: 10.1002/jrsm.1230
Brannick, M. T., Potter, S. M., Benitez, B., and Morris, S. B. (2019). Bias and precision of alternate estimators in meta-analysis: benefits of blending schmidt-hunter and hedges approaches. Organ. Res. Methods. 22, 490–514. doi: 10.1177/1094428117741966
Brunborg, G. S., Mentzoni, R. A., and Frøyland, L. R. (2014). Is video gaming, or video game addiction, associated with depression, academic achievement, heavy episodic drinking, or conduct problems? J. Behav. Addict. 3, 27–32. doi: 10.1556/jba.3.2014.002
Buss, A. H., and Perry, M. (1992). The aggression questionnaire. J. Pers. Soc. Psychol. 63, 452–459. doi: 10.1037/0022-3514.63.3.452
Castro-Calvo, J., King, D. L., Stein, D. J., Brand, M., Carmi, L., Chamberlain, S. R., et al. (2021). Expert appraisal of criteria for assessing gaming disorder: an international delphi study. Addiction 116, 2463–2475. doi: 10.1111/add.15411
Cicchetti, D. V. (1994). Guidelines, criteria, and rules of thumb for evaluating normed and standardized assessment instruments in psychology. Psychol. Assess. 6, 284–290. doi: 10.1037/1040-3590.6.4.284
Cronbach, L. J. (1951). Coefficient alpha and the internal structure of tests. Psychometrika. 16, 297–334. doi: 10.1007/BF02310555
Diener, E., Emmons, R. A., Larsen, R. J., and Griffin, S. (1985). The satisfaction with life scale. J. Pers. Assess. 49, 71–75. doi: 10.1207/s15327752jpa4901_13
Dong, G., Wang, L., Du, X., and Potenza, M. N. (2018). Gender-related differences in neural responses to gaming cues before and after gaming: implications for gender-specific vulnerabilities to Internet gaming disorder. Soc. Cogn. Affect. Neurosci. 13, 1203–1214. doi: 10.1093/scan/nsy084
Egger, M., Smith, G. D., Schneider, M., and Minder, C. (1997). Bias in meta-analysis detected by a simple, graphical test. Bmj 315, 629–634. doi: 10.1136/bmj.315.7109.629
Fam, J. Y. (2018). Prevalence of internet gaming disorder in adolescents: a meta-analysis across three decades. Scand. J. Psychol. 59, 524–531. doi: 10.1111/sjop.12459
Fauth-Bühler, M., and Mann, K. (2017). Neurobiological correlates of internet gaming disorder: similarities to pathological gambling. Addict. Behav. 64, 349–356. doi: 10.1016/j.addbeh.2015.11.004
Field, A. P. (2005). Is the meta-analysis of correlation coefficients accurate when population correlations vary? Psychol. Methods. 10, 444–467. doi: 10.1037/1082-989X.10.4.444
Field, A. P., and Gillett, R. (2010). How to do a meta-analysis. Br. J. Math. Stat. Psychol. 63, 665–694. doi: 10.1348/000711010X502733
Fu, R., Gartlehner, G., Grant, M., Shamliyan, T., Sedrakyan, A., Wilt, T. J., et al. (2011). Conducting quantitative synthesis when comparing medical interventions: AHRQ and the effective health care program. J. Clin. Epidemiol. 64, 1187–1197. doi: 10.1016/j.jclinepi.2010.08.0100
Gliem, J. A., and Gliem, R. R. (2003). “Calculating, interpreting, and reporting Cronbach’s alpha reliability coefficient for Likert-type scales,” in Proceedings of the Midwest Research-to-Practice Conference in Adult, Continuing, and Community Education, Columbus, OH.
González-Bueso, V., Santamaría, J. J., Fernández, D., Merino, L., Montero, E., and Ribas, J. (2018). Association between internet gaming disorder or pathological video-game use and comorbid psychopathology: a comprehensive review. Int. J. Environ. Res. Public Health 15:668. doi: 10.3390/ijerph15040668
Green, S. B., Lissitz, R. W., and Mulaik, S. A. (1977). Limitations of coefficient alpha as an index of test unidimensionality1. Educ. Psychol. Meas. 37, 827–838. doi: 10.1177/001316447703700403
Griffiths, M. D., Kuss, D. J., Lopez-Fernandez, O., and Pontes, H. M. (2017). Problematic gaming exists and is an example of disordered gaming: commentary on: Scholars’ open debate paper on the World Health Organization ICD-11 Gaming Disorder proposal (Aarseth et al.). J. Behav. Addict. 6, 296–301. doi: 10.1556/2006.6.2017.037
Han, D. H., Kim, S. M., Bae, S., Renshaw, P. F., and Anderson, J. S. (2017). Brain connectivity and psychiatric comorbidity in adolescents with internet gaming disorder. Addict. Biol. 22, 802–812. doi: 10.1111/adb.12347
Han, D. H., Yoo, M., Renshaw, P. F., and Petry, N. M. (2018). A cohort study of patients seeking internet gaming disorder treatment. J. Behav. Addict. 7, 930–938. doi: 10.1556/2006.7.2018.102
Harrer, M., Cuijpers, P., Furukawa, T. A., and Ebert, D. D. (2021). Doing Meta-Analysis With R: A Hands-On Guide. Boca Raton, FL: Chapmann & Hall/CRC Press.
Harrer, M., Cuijpers, P., Furukawa, T., and Ebert, D. (2019). Doing Meta-Analysis in R: A Hands-on Guide. Available online at: https://bookdown.org/MathiasHarrer/Doing_Meta_Analysis_in_R (accessed August 25, 2021).
Hedges, L. V., Tipton, E., and Johnson, M. C. (2010). Robust variance estimation in meta-regression with dependent effect size estimates. Res. Synth. Methods 1, 39–65. doi: 10.1002/jrsm.5
Hunter, J. E., and Schmidt, F. L. (2004). Methods of Meta-Analysis: Correcting Error and Bias in Research Findings. Thousand Oaks, CA: Sage Publishing.
Jeong, H., Yim, H. W., Lee, S.-Y., Lee, H. K., Potenza, M. N., Kwon, J.-H., et al. (2018). Discordance between self-report and clinical diagnosis of internet gaming disorder in adolescents. Sci. Rep. 8, 1–8. doi: 10.1038/s41598-018-28478-8
Jo, Y. S., Bhang, S. Y., Choi, J. S., Lee, H. K., Lee, S. Y., and Kweon, Y.-S. (2019). Clinical characteristics of diagnosis for internet gaming disorder: comparison of DSM-5 IGD and ICD-11 GD diagnosis. J. Clin. Med. 8, 945–957. doi: 10.3390/jcm8070945
Jones, C., Scholes, L., Johnson, D., Katsikitis, M., and Carras, M. C. (2014). Gaming well: links between videogames and flourishing mental health. Front. Psychol. 5:260–267. doi: 10.3389/fpsyg.2014.00260
Kardefelt-Winther, D. (2014). Problematizing excessive online gaming and its psychological predictors. Comput. Hum. Behav. 31, 118–122. doi: 10.1016/j.chb.2013.10.017
Kim, E., Yim, H. W., Jeong, H., Jo, S.-J., Lee, H. K., Son, H. J., et al. (2018). The association between aggression and risk of Internet gaming disorder in Korean adolescents: the mediation effect of father-adolescent communication style. Epidemiol. Health 40:e2018039. doi: 10.4178/epih.e2018039
King, D. L., Chamberlain, S. R., Carragher, N., Billieux, J., Stein, D., Mueller, K., et al. (2020). Screening and assessment tools for gaming disorder: a comprehensive systematic review. Clin. Psychol. Rev. 77:101831. doi: 10.1016/j.cpr.2020.101831
King, D. L., Delfabbro, P. H., Wu, A. M., Doh, Y. Y., Kuss, D. J., Pallesen, S., et al. (2017). Treatment of internet gaming disorder: an international systematic review and CONSORT evaluation. Clin. Psychol. Rev. 54, 123–133. doi: 10.1016/j.cpr.2017.04.002
Király, O., and Demetrovics, Z. (2017). Inclusion of gaming disorder in ICD has more advantages than disadvantages: commentary on: scholars’ open debate paper on the World Health Organization ICD-11 Gaming Disorder proposal (Aarseth et al.). J. Behav. Addict. 6, 280–284. doi: 10.1556/2006.6.2017.046
Király, O., Sleczka, P., Pontes, H. M., Urbán, R., Griffiths, M. D., and Demetrovics, Z. (2017). Validation of the ten-item Internet Gaming Disorder Test (IGDT-10) and evaluation of the nine DSM-5 Internet Gaming Disorder criteria. Addict. Behav. 64, 253–260. doi: 10.1016/j.addbeh.2015.11.005
Kräplin, A., Scherbaum, S., Kraft, E.-M., Rehbein, F., Bühringer, G., Goschke, T., et al. (2021). The role of inhibitory control and decision-making in the course of Internet Gaming Disorder. J. Behav. Addict. 9, 990–1001. doi: 10.1556/2006.2020.00076
Kuss, D. J., Griffiths, M. D., and Pontes, H. M. (2017). DSM-5 diagnosis of Internet Gaming Disorder: Some ways forward in overcoming issues and concerns in the gaming studies field: Response to the commentaries. J. Behav. Addict. 6, 133–141. doi: 10.1556/2006.6.2017.032
Lemmens, J. S., Valkenburg, P. M., and Gentile, D. A. (2015). The internet gaming disorder scale. Psychol. Assess. 27:567. doi: 10.1037/pas0000062
Lemmens, J. S., Valkenburg, P. M., and Peter, J. (2009). Development and validation of a game addiction scale for adolescents. Media Psychol. 12, 77–95. doi: 10.1080/15213260802669458
Lemmens, J. S., Valkenburg, P. M., and Peter, J. (2011). Psychosocial causes and consequences of pathological gaming. Comput. Hum. Behav. 27, 144–152. doi: 10.1016/j.chb.2010.07.015
Li, W., Garland, E. L., McGovern, P., O’Brien, J. E., Tronnier, C., and Howard, M. O. (2017). Mindfulness-oriented recovery enhancement for internet gaming disorder in US adults: a stage I randomized controlled trial. Addict Behav. 31, 393–402. doi: 10.1037/adb0000269
Liu, L., Yao, Y.-W., Li, C.-s. R., Zhang, J.-T., Xia, C.-C., Lan, J., et al. (2018). The comorbidity between internet gaming disorder and depression: interrelationship and neural mechanisms. Front. Psychiatry 9:154. doi: 10.3389/fpsyt.2018.00154
López-Pina, J. A., Sánchez-Meca, J., López-López, J. A., Marín-Martínez, F., Núñez-Núñez, R. M., Rosa-Alcázar, A. I., et al. (2015). The Yale–Brown obsessive compulsive scale: a reliability generalization meta-analysis. Assessment 22, 619–628. doi: 10.1177/1073191114551954
Miller, B. K., Nicols, K. M., Clark, S., Daniels, A., and Grant, W. (2018). Meta-analysis of coefficient alpha for scores on the narcissistic personality inventory. PLoS One 13:e0208331. doi: 10.1371/journal.pone.0208331
Moher, D., Liberati, A., Tetzlaff, J., and Altman, D. G. (2009). Preferred reporting items for systematic reviews and meta-analyses: the PRISMA statement. PLoS Med. 6:e1000097. doi: 10.1371/journal.pmed.1000097
Müller, K. W., Beutel, M. E., and Wölfling, K. (2014). A contribution to the clinical characterization of Internet addiction in a sample of treatment seekers: validity of assessment, severity of psychopathology and type of co-morbidity. Compr. Psychiatry 55, 770–777. doi: 10.1016/j.comppsych.2014.01.010
Müller, K. W., Beutel, M. E., Dreier, M., and Wölfling, K. (2019). A clinical evaluation of the DSM-5 criteria for internet gaming disorder and a pilot study on their applicability to further Internet-related disorders. J. Behav. Addict. 8, 16–24. doi: 10.1556/2006.7.2018.140
Na, E., Lee, H., Choi, I., and Kim, D. J. (2017). Comorbidity of Internet Gaming Disorder and alcohol use disorder: a focus on clinical characteristics and gaming patterns. Am. J. Addict. 26, 326–334. doi: 10.1111/ajad.12528
Page, M. J., McKenzie, J. E., Bossuyt, P. M., Boutron, I., Hoffmann, T. C., Mulrow, C. D., et al. (2021). The PRISMA 2020 statement: an updated guideline for reporting systematic reviews. Bmj 372:71. doi: 10.1136/bmj.n71
Petry, N. M., and O’Brien, C. P. (2013). Internet gaming disorder and the DSM-5. Addiction 108, 1186–1187. doi: 10.1111/add.12162
Phan, O., Prieur, C., Bonnaire, C., and Obradovic, I. (2020). Internet gaming disorder: exploring its impact on satisfaction in life in PELLEAS adolescent Sample. Int. J. Environ. Res. Public Health. 17:3. doi: 10.3390/ijerph17010003
Polit, D. F. (2014). Getting serious about test–retest reliability: a critique of retest research and some recommendations. Qual. Life Res. 23, 1713–1720. doi: 10.1007/s11136-014-0632-9
Pontes, H. M., and Griffiths, M. D. (2015). Measuring DSM-5 internet gaming disorder: development and validation of a short psychometric scale. Comput. Hum. Behav. 45, 137–143. doi: 10.1016/j.chb.2014.12.006
Pontes, H. M., and Griffiths, M. D. (2019). A new era for gaming disorder research: Time to shift from consensus to consistency. Addict. Behav. 103:106059. doi: 10.1016/j.addbeh.2019.106059
Pontes, H. M., Macur, M., and Griffiths, M. D. (2016). Internet gaming disorder among slovenian primary schoolchildren: findings from a nationally representative sample of adolescents. J. Behav. Addict. 5, 304–310. doi: 10.1556/2006.5.2016.042
Prescott, A. T., Sargent, J. D., and Hull, J. G. (2018). Metaanalysis of the relationship between violent video game play and physical aggression over time. Proc. Natl. Acad. Sci. U.S.A. 115, 9882–9888. doi: 10.1073/pnas.1611617114
Quintana, D. S. (2015). From pre-registration to publication: a non-technical primer for conducting a meta-analysis to synthesize correlational data. Front. Psychol. 6:1549. doi: 10.3389/fpsyg.2015.01549
Rodriguez, M. C., and Maeda, Y. (2006). Meta-analysis of coefficient alpha. Psychol. Methods. 11, 306–322. doi: 10.1037/1082-989X.11.3.306
Rönkkö, M., and Cho, E. (2020). An updated guideline for assessing discriminant validity. Organ. Res. Methods 2:1094428120968614. doi: 10.1177/1094428120968614
Rothstein, H. R., Sutton, A. J., and Borenstein, M. (2005). Publication Bias in Meta-Analysis: Prevention, Assessment and Adjustments. Hoboken, NJ: John Wiley & Sons. doi: 10.1002/0470870168
Rumpf, H.-J., Achab, S., Billieux, J., Bowden-Jones, H., Carragher, N., Demetrovics, Z., et al. (2018). Including gaming disorder in the ICD-11: The need to do so from a clinical and public health perspective: Commentary on: a weak scientific basis for gaming disorder: Let us err on the side of caution (van Rooij et al., 2018). J. Behav. Addict. 7, 556–561. doi: 10.1556/2006.7.2018.59
Russell, D., Peplau, L. A., and Cutrona, C. E. (1980). The revised UCLA loneliness scale: Concurrent and discriminant validity evidence. J. Pers. Soc. Psychol. 39:472. doi: 10.1037/0022-3514.39.3.472
Sariyska, R., Lachmann, B., Markett, S., Reuter, M., and Montag, C. (2017). Individual differences in implicit learning abilities and impulsive behavior in the context of internet addiction and Internet Gaming Disorder under the consideration of gender. Addict. Behav. Rep 5, 19–28. doi: 10.1016/j.abrep.2017.02.002
Schmidt, F. L., and Hunter, J. E. (1998). The validity and utility of selection methods in personnel psychology: practical and theoretical implications of 85 years of research findings. Psychol. Bull. 124:262. doi: 10.1037/0033-2909.124.2.262
Schober, P., Boer, C., and Schwarte, L. A. (2018). Correlation coefficients: Appropriate use and interpretation. Anesth. Analg. 126, 1763–1768. doi: 10.1213/ANE.0000000000002864
Schwarzer, G. (2007). Meta: an R package for meta-analysis. R News 7, 40–45. doi: 10.1007/978-3-319-21416-0
Sterne, J. A., Egger, M., and Smith, G. D. (2001). Investigating and dealing with publication and other biases in meta-analysis. Bmj 323, 101–105. doi: 10.1136/bmj.323.7304.101
Sterne, J. A., Gavaghan, D., and Egger, M. (2000). Publication and related bias in meta-analysis: power of statistical tests and prevalence in the literature. J. Clin. Epidemiol. 53, 1119–1129. doi: 10.1016/S0895-4356(00)00242-0
Stevens, M. W., Dorstyn, D., Delfabbro, P. H., and King, D. L. (2021). Global prevalence of gaming disorder: a systematic review and meta-analysis. Aust. N. Z. J. Psychiatry 55, 553–568. doi: 10.1177/0004867420962851
Streiner, D. L. (2003). Starting at the beginning: an introduction to coefficient alpha and internal consistency. J. Pers. Assess. 80, 99–103. doi: 10.1207/S15327752JPA8001_18
Tavakol, M., and Dennick, R. (2011). Making sense of Cronbach’s alpha. Int. J. Med. Educ. 2, 53–55. doi: 10.5116/ijme.4dfb.8dfd
Tian, Y., Bian, Y., Han, P., Gao, F., and Wang, P. (2017). Associations between psychosocial factors and generalized pathological internet use in Chinese university students: a longitudinal cross-lagged analysis. Comput. Hum. Behav. 72, 178–188. doi: 10.1016/j.chb.2017.02.048
Tras, Z. (2019). Internet addiction and loneliness as predictors of Internet Gaming Disorder in adolescents. Educ. Res. Rev. 14, 465–473. doi: 10.5897/ERR2019.3768
Vacha-Haase, T. (1998). Reliability generalization: exploring variance in measurement error affecting score reliability across studies. Educ. Psychol. Meas. 58, 6–20. doi: 10.1177/0013164498058001002
Van Den Brink, W. (2017). ICD-11 gaming disorder: needed and just in time or dangerous and much too early? Commentary on: Scholars’ open debate paper on the World Health Organization ICD-11 Gaming Disorder proposal (Aarseth et al.). J. Behav. Addict. 6, 290–292. doi: 10.1556/2006.6.2017.040
Van Rooij, A. J., Ferguson, C. J., Colder Carras, M., Kardefelt-Winther, D., Shi, J., Aarseth, E., et al. (2018). A weak scientific basis for gaming disorder: let us err on the side of caution. J. Behav. Addict. 7, 1–9. doi: 10.1556/2006.7.2018.19
Viechtbauer, W. (2010). Conducting meta-analyses in R with the metafor package. J. Stat. Softw. 36, 1–48. doi: 10.18637/jss.v036.i03
Wang, C.-Y., Wu, Y.-C., Su, C.-H., Lin, P.-C., Ko, C.-H., and Yen, J.-Y. (2017). Association between Internet Gaming Disorder and generalized anxiety disorder. J. Behav. Addict. 6, 564–571. doi: 10.1556/2006.6.2017.088
Wartberg, L., Kriston, L., Zieglmeier, M., Lincoln, T., and Kammerl, R. (2019). A longitudinal study on psychosocial causes and consequences of Internet gaming disorder in adolescence. Psychol. Med. 49, 287–294. doi: 10.1017/S003329171800082X
Wichstrøm, L., Penelo, E., Rensvik Viddal, K., de la Osa, N., and Ezpeleta, L. (2018). Explaining the relationship between temperament and symptoms of psychiatric disorders from preschool to middle childhood: hybrid fixed and random effects models of Norwegian and Spanish children. J. Child Psychol. Psychiatry 59, 285–295. doi: 10.1111/jcpp.12772
Wichstrøm, L., Stenseng, F., Belsky, J., von Soest, T., and Hygen, B. W. (2019). Symptoms of internet gaming disorder in youth: predictors and comorbidity. J. Abnorm. Child Psychol 47, 71–83. doi: 10.1007/s10802-018-0422-x
Wittek, C. T., Finserås, T. R., Pallesen, S., Mentzoni, R. A., Hanss, D., Griffiths, M. D., et al. (2016). Prevalence and predictors of video game addiction: a study based on a national representative sample of gamers. Int. J. Ment. Health Addict. 14, 672–686. doi: 10.1007/s11469-015-9592-8
Wölfling, K., Müller, K. W., Dreier, M., Ruckes, C., Deuster, O., Batra, A., et al. (2019). Efficacy of short-term treatment of internet and computer game addiction: a randomized clinical trial. JAMA Psychiatry 76, 1018–1025. doi: 10.1001/jamapsychiatry.2019.1676
World Health Organization [WHO] (2018). The ICD-11 Classification of Mental and Behavioral Disorders: Diagnostic Criteria for Research. Available online at: https://icd.who.int/en (accessed August 25, 2021).
Yen, J.-Y., Liu, T.-L., Wang, P.-W., Chen, C.-S., Yen, C.-F., and Ko, C.-H. (2017). Association between Internet Gaming Disorder and adult attention deficit and hyperactivity disorder and their correlates: Impulsivity and hostility. Addict. Behav. 64, 308–313. doi: 10.1016/j.addbeh.2016.04.024
Keywords: gaming disorder (GD), meta-analysis, convergent validity, discriminant validity, reliability generalization meta-analysis, validity generalization, association
Citation: Yoon S, Yang Y, Ro E, Ahn W-Y, Kim J, Shin S-H, Chey J and Choi K-H (2021) Reliability, and Convergent and Discriminant Validity of Gaming Disorder Scales: A Meta-Analysis. Front. Psychol. 12:764209. doi: 10.3389/fpsyg.2021.764209
Received: 25 August 2021; Accepted: 10 November 2021;
Published: 07 December 2021.
Edited by:
Marta Martín-Carbonell, Universidad Cooperativa de Colombia, ColombiaReviewed by:
Sai-fu Fung, City University of Hong Kong, Hong Kong SAR, ChinaDanka Purić, University of Belgrade, Serbia
Copyright © 2021 Yoon, Yang, Ro, Ahn, Kim, Shin, Chey and Choi. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Kee-Hong Choi, a2Nob2lAa29yZWEuYWMua3I=