Skip to main content

ORIGINAL RESEARCH article

Front. Oncol., 09 December 2022
Sec. Cancer Epidemiology and Prevention

Psychometric properties of the EQ-5D-5L compared with EQ-5D-3L in cancer patients in Iran

Nasrin MoradiNasrin Moradi1Thomas G. Poder,Thomas G. Poder2,3Hossein SafariHossein Safari4Mohammad M. MojahedianMohammad M. Mojahedian5Hosein Ameri*Hosein Ameri6*
  • 1Department of Health Management and Economics, School of Public Health, Iran University of Medical Science, Tehran, Iran
  • 2Department of Management, Evaluation and Health Policy, School of Public Health, University of Montreal, Montreal, QC, Canada
  • 3Centre de recherche de l’Institut universitaire en santé mentale de Montréal, CIUSSS de l’Est de l’île de Montréal, Montreal, QC, Canada
  • 4Health Promotion Research Centre, Iran University of Medical Sciences, Tehran, Iran
  • 5Department of Pharmacoeconomics, School of Pharmacy, Iran University of Medical Science, Tehran, Iran
  • 6Health Policy and Management Research Center, Department of Health Management and Economics, School of Public Health, Shahid Sadoughi University of Medical Sciences, Yazd, Iran

Background and Objective: Psychometric evidence to support the validity and reliability of the EuroQol-5 Dimensions (EQ-5D) in cancer patients is limited. This study aimed to test the validity and reliability of the EQ-5D-5L (5L) in comparison with EQ-5D-3L (3L) in cancer patients.

Methods: Data of 650 cancer patients were collected through consecutive sampling method from three largest governmental cancer centers in Iran between June 2021 and January 2022. The data were gathered using the 3L, 5L, and the European Organization for Research and Treatment of Cancer quality of life questionnaire (QLQ-C30) instruments. The 3L and 5L were compared in terms of ceiling effect, discriminatory power, convergent and known-groups validity, relative efficiency, inconsistency, agreement, and reliability.

Results: Compared with the 3L, ceiling effect decreased by 27.86%. Absolute and relative informativity of discriminatory power improved by 45.93% and 22.92% in the 5L, respectively. All convergent validity coefficients with 5L were stronger than with 3L. Both 3L and 5L demonstrated good known-groups validity, and the relative efficiency was higher for 5L in 4 out of 7 patients’ characteristics. The two instruments showed low overall inconsistency (1.45%) and 92.57% of the differences of observations between the 3L and 5L were within the 95% limit of agreement. The interclass correlation coefficient (ICC) for 3L and 5L indexes were 0.88 and 0.85, respectively, and kappa coefficients in the 3L dimensions (range=0.66-0.92) were higher than the 5L(range=0.64-0.79).

Conclusions: The 5L demonstrated to be better than the 3L in terms of ceiling effect, inconsistency, discriminatory power, convergent validity, relative efficiency.

Introduction

Cost-utility analysis (CUA) is one of the economic evaluation approaches which is widely used for assessing healthcare interventions (1). CUA evaluates interventions in terms of cost per quality adjusted life-years (QALYs) gained, where QALYs combine length of life and health-related quality of life (HRQoL) into a single index ranging from 0 (death) to 1 (perfect health), with negative values indicating health states worse than death (2). HRQoL is a multidimensional construct widely used to assess the impact of health status on quality of life (3). In order to calculate QALYs, preference-based HRQoL instruments such as the EuroQol-5 Dimensions (EQ-5D) and the Short-Form 6 dimensions (SF-6D) are widely used (4). The EQ-5D is the most popular type of preference-based instrument that is developed based on general dimensions of health, and can be used in both clinical trials and health services research (5).

EQ-5D is commonly used in versions: EQ-5D-3L (3L), EQ-5D-5L (5L), and Youth’ versions of the EQ-5D (EQ-5D-Y-3L and EQ-5D-Y-5L). The 3L consists of two parts: a classification system of five dimensions (mobility, self-care, usual activities, pain/discomfort, and anxiety/depression) with 3 levels of response options (no problems, some or moderate problems, and extreme problems) per each dimension and a visual analogue scale (VAS). The classification system of 3L generates 243 (i.e., 35) possible health states. The VAS records the current self-rated health of respondents on a vertical line ranging from 0 (the worst imaginable health) to 100 (the best imaginable health). The new version of EQ-5D, the 5L includes a classification system of five dimensions with 5 response levels per each dimension (no problems, slight problems, moderate problems, severe problems, and unable to/extreme problems) and a VAS. The 5L classification system defines 3125 (i.e., 55) possible health states. The EQ-5D-Y is used for adolescents between 7 to 12 years, which includes 5 dimensions with 3 levels and is labelled in a way that children can easily understand it (6).

Increasing evidence showed that the 5L has gained a widespread attention in studies, because expanding the range of responses from three to five levels on each of the dimensions has increased the instrument’s sensitivity and decreased its ceiling effect threshold (7). This version has been translated into more than 100 languages including Persian, and its psychometric properties have been assessed in comparison with the 3L in various general and patient groups (811), but not in a developing country.

Given the growing number of cancers and available treatments, an increasing use of the 5L to perform CUA and to assess the HRQoL in cancer patients is expected in the years to come. To the best of our knowledge, the psychometric properties of the 5L in comparison with 3L has never been assessed for multiple cancers simultaneously with value sets of both versions elicited from the country’s general population. A study recently addressed the psychometric properties of the Spanish versions of EQ-5D-Y-3L and EQ-5D-Y-5L in children with cancer. Overall, the study showed that the properties of EQ-5D-Y-5L were better than those of EQ-5D-Y-3L (12). In Iran, the value sets of the 3L in 2017 and 5L in 2022 were derived from the general population (13, 14). The aim of the present study was to assess the psychometric properties of the 5L in Iranian cancer patients and then compare its properties with those of the 3L in the same set of patients.

Methods

Study design and data collection

A convenient sample of 650 cancer inpatients and outpatients were selected consecutively from surgery, chemotherapy, and radiotherapy wards in three of the largest governmental cancer centers in three Iranian provinces (Tehran, Esfahan, and Fars) between June 2021 and January 2022. These provinces contain almost than a third of the Iran’s population, and their cancer centers admitted patients from all over the country. Patients with a pathological confirmation of the diagnosis of cancer, healthy cognitive status, and who completed informed consent were recruited in accordance with the ethical standards of the national research committee (approval no. IR. IUMS.1400.394).

Data were gathered using the 3L, 5L, European Organization for Research and Treatment of Cancer Quality of Life Questionnaire (QLQ-C30), and a number of questions on demographic characteristics through face-to-face interviews during a meeting between patients and researchers in patient rooms. Some clinical data were also extracted from the medical records of patients (cancer diagnosis date and service type). The questionnaires were completed in a random sequence to avoid potential bias from an order effect.

EQ-5D instrument

The EQ-5D is commonly used in two versions: the 3L and 5L. In 2013, the 3L value set was derived from the general public in Iran using the time trade-off method, with a scale ranging from -0.113 (‘33333’ representing the worst health state) to 1 (‘11111’ representing the best health state) (13). In 2022, the Iranian value set of the 5L was generated from the general public using the EuroQoL Group’s Valuation Technology (EQ-VT) protocol, and its scoring function was based on composite time trade-off (cTTO) and discrete choice experiment (DCE) methods. The cTTO was conducted on 86 health states and 196 health states were selected for the DCE valuation. The scores of 5L index range from -1.19 (‘55555’ representing the worst health state) to 1 (‘11111’ representing the best health state) (14).

The QLQ-C30 is a self-reported 30-item questionnaire assessing the HRQoL of cancer patients (15). It contains a global health scale, five functional scales (physical, role, emotional, cognitive, and social), three symptom scales (fatigue, nausea/vomiting, pain), and six single items. All QLQ-C30 items have 4 response levels, except for the two items related to global health status that have 7 response levels. The scoring procedure of QLQC30 scales performs in the two stages. First, the average score of items is calculated in order that computing the raw score. Second, the raw score is converted to a score range from 0 to 100. High score for both functional scales and global health status represents high level of functioning and HRQoL, and higher score of the symptom scales/items represents more health problems (15). The QLQ-C30 version 3 was translated and validated in a sample of cancer patients in Iran (16).

Data analysis

Ceiling effect

Ceiling effect for both 5L and 3L was calculated as the proportion of patients reporting ‘no problem’ (level 1) on each dimension and the proportion of ‘no problem’ on all dimensions (health state “11111”). The percentage of ‘no problem’ less than 15% for all dimensions is considered an acceptable percentage of ceiling effect (17). Based on the results of most studies, we hypothesized that the ceiling effect of 5L is lower than 3L (8, 9, 11, 1822). Since the ceiling effect was very small in some patients, in addition to absolute reduction, a relative reduction was calculated when going from the 3L to 5L using the following formula: (ceiling3L - ceiling5L)/ceiling3L.The differences between ceiling effects of the 3L and 5L were assessed by the McNemar test.

Inconsistency properties

Inconsistency properties were assessed based on the criteria presented by Janssen et al. (23). An inconsistent response is considered to be present if a response in the 3L was followed by a response in the 5L that was at least two levels away. To quantify inconsistency size, the 3L responses were first recoded on the 5L scale (the 3L5L as follows: 1 = 1, 2 = 3, 3 = 5, and then calculated as |3L5L - 5L| - 1; a value of two or more indicated inconsistency.

Discriminatory power

Discriminatory power of instruments is assessed using informativity’s indices (Shannon index and Shannon Evenness index) (24). The Shannon index (H′) was calculated as follows:

H=i=1LpiLog2pi

here H′ denotes the absolute amount of informativity captured, L is the number of levels in a dimension of the EQ-5D, and pi = ni/N, the proportion of observations in the ith level (i = 1,…, L), where ni is the observed number of responses in category i and N is the total sample size. The higher the H′, the more the absolute information is. The optimal amount of the H′ is when the responses are evenly distributed across all levels, and was defined as follows: H′max = log2L. The Shannon Evenness index (J′) reflects the relative informativity of the questionnaire, regardless of the number of levels, and is calculated by the following formula: J′ = H′/H′max. Higher J′ indicates greater relative information.

Convergent validity

The convergent validity was assessed by considering the correlation between the EQ-5D dimensions and QLQ-C30 selected scales using the Spearman’s rank correlation coefficient. It is expected that the degree of correlation of each dimension in both EQ-5D with the scales of QLQ-C30 that are theoretically very similar would be higher than those that are theoretically dissimilar (e.g., usual activity of the EQ-5D is expected to be more correlated with physical function than with social function of the QLQ-C30). It is also expected that the correlation between the 3L and QLQ-C30 would be similar to or less than that of the 5L and the other one.

Known-groups validity

The known-groups validity of both EQ-5D was assessed by testing the difference between the mean score of EQ-5D index in each of the patients’ characteristics using independent sample t-test or ANOVA test. It is expected that the mean EQ-5D index score will be higher for patients with younger age, higher education, and for those receiving less severe treatment strategies and having shorter duration of diagnosis and no comorbidities. The relative precision of both versions was assessed using the relative efficiency (RE). The RE is calculated as the ratio of ANOVA F-statistics (F-statistic5L/F-statistic3L). A RE greater than 1 reveals that the 5L has discriminatory ability greater than the 3L and vice versa (10).

Reliability

The reliability of both EQ-5D versions and each of the EQ-5D dimensions were assessed respectively using the intra-class correlation (ICC) and Cohen’s weighted kappa coefficients. A sub-sample of 70 patients in a time interval of 1–2 weeks from the first survey was selected and only patients whose response on the general health status (QoL question) remained unchanged were included in final analysis (i.e., to avoid an external shock on data). The ICC was computed using one-way random and single measure methods. Both ICC and kappa were interpreted as follows: “poor,”< 0.40; “fair to good,” 0.40–0.75; and “excellent,” > 0.75 (25).

Agreement

The Bland–Altman plot is one of the adequate methods for assessing agreement and systematic bias in both measures (26). The plot shows the average value of the 3L and 5L index scores (x-axis) against their difference (y-axis). The 95% limits of agreement were computed as mean difference between the 3L and 5L index scores ±1.96 SD of the differences. The closer to zero the mean difference between scores of both EQ-5D, the better the level of agreement.

All statistical analyses were performed with STATA version 14.0.

Results

Patients’ demographic and clinical characteristics in both surveys are presented in Table 1. In the first survey, of 650 patients who completed the questionnaires, 6 patients were excluded from final analysis because of missing data on the QLQ-C30 scales. The mean age of the patients was 51.71 (SD ± 13.5). The second survey showed that the response of 68 patients on the general health status (QoL question) remained unchanged. The demographic and clinical characteristics of patients in both surveys were very similar, except for gender. The mean age of the patients in the second survey was 52.4 (SD ± 14.3). The largest number of cancer patients was colorectal cancer (22.14%) (Table 1)

TABLE 1
www.frontiersin.org

Table 1 Patients’ characteristics in the surveys.

Ceiling effect

The highest proportion of ‘‘no problems’’ was in the ‘‘self-care’’ dimension for the 3L (68.73) and 5L (64.86), while the lowest proportion was in the ‘‘pain/discomfort’’ (29.26 and 24.61, respectively). Ceiling effects in all dimensions of the 5L were less than those reported for the 3L, and the difference of ceiling effects between the two versions were statistically significant (P< 0.001). Furthermore, the proportion of patients reporting the health state ‘11111’ decreased significantly from 12.07% in the 3L to 9.44% in the 5L (P< 0.001) (Table 2).

TABLE 2
www.frontiersin.org

Table 2 Proportion of ‘‘no problems’’ and ceiling effects difference between 3L and 5L.

Inconsistency

The inconsistent response pairs for each dimension of two versions of the EQ-5D are presented in Table 3. The highest proportion of inconsistency was found in the ‘‘anxiety/depression’’ (2.32%), followed by “mobility” (2.16%) (Table 3). Table 3 also shows that the overall proportion and average size of inconsistency were 1.45%, and 1, respectively.

TABLE 3
www.frontiersin.org

Table 3 Inconsistency properties of the 3L and 5L.

Discriminatory power

The values of absolute (H ′) and relative (J′) informativity were higher for the 5L compared to the 3L, while the absolute informativity in the overall classification system increased from 0.94 for the 3L to 1.38 for the 5L, and relative informativity increased from 0.48 to 0.59. The “usual activities” dimension demonstrated the highest increase in absolute (142.65%) and relative (486.07%) informativity (Table 4).

TABLE 4
www.frontiersin.org

Table 4 Shannon index (H′) and Shannon Evenness index (J′) for the 3L and 5L.

Convergent validity

The spearman’s rank correlation coefficients showed that the subscales of QLQ-C30 were significantly correlated with the 5L dimensions from 0.37 to 0.60 and with the 3L dimensions from 0.29 to 0.57. The correlation between “anxiety/depression” and “emotional functioning” had the highest value for both the 3L (0.57) and 5L (0.60) (Table 5). For each EQ-5D version, the spearman coefficient showed that the degree of correlation between dimensions conceptually relevant was higher than dimensions conceptually irrelevant. As the highest degree of correlation was between the usual activities of EQ-5D and the physical functioning of QLQ-C30 followed by between the pain/discomfort of EQ-5D and the pain of QLQ-C30 (Table 5).

TABLE 5
www.frontiersin.org

Table 5 Convergent validity of the 3L and 5L with QLQ-C30 scales.

Known-groups validity

Table 6 shows that mean scores of both EQ-5D versions were higher for patients who were male, younger, better educated, married, those receiving less severe current treatment strategies, those having shorter duration of disease since diagnosis, and those without comorbidities. Results also showed that the difference between mean scores of the 3L were significant within age, education, marital status, and treatment status (P<0.05). These differences also were significant within age, education, and treatment status (P<0.05) for the 5L. Furthermore, the RE was more than 1 for education (1.18), treatment status (1.02), duration of disease since diagnosis (1.14), and comorbidities (2.29).

TABLE 6
www.frontiersin.org

Table 6 Known-groups validity and relative efficiency of the 3L and 5L.

Reliability

The “mobility” dimension showed the highest coefficient of Kappa for 3L (0.92) and 5L (0.79), while the ‘‘anxiety/depression’’ dimension demonstrated the lowest for both versions (0.66 and 0.64, respectively). The agreement rate ranged from 0.76 (anxiety/depression) to 0.90 (mobility) for the 3L and from 0.73 to 0.86 for the 5L. The ICCs for the 3L and 5L indexes were 0.88 and 0.85, respectively, indicating good reproducibility for both versions (Table 7).

TABLE 7
www.frontiersin.org

Table 7 Test–retest reliability of the 3L and 5L.

Agreement

The Bland-Altman plot showed that mean difference between two versions of the EQ-5D was 0.21. The plot also revealed that 92.57% of the differences of observations between the 3L and 5L were within the 95% limits of agreement (-0.20 to 0.62). 7.43% of the differences distributed above the upper 95% limit, while none of them was below the limit. Differences between scores of the two instruments tended to increase at lower mean values (Figure 1).

FIGURE 1
www.frontiersin.org

Figure 1 Bland–Altman plot of the 3L and 5L index values.

Discussion

This is the first study to use national value sets of both EQ-5D versions to assess psychometric properties of 3L and 5L in the context of multiple cancers in terms of ceiling effects, inconsistency informativity, convergent and known-groups validity, reliability, and agreement. The overall mean 5L index score (0.44) was found to be substantially lower than that of the 3L (0.65).

Ceiling effects were observed for both 3L and 5L (12.07 and 9.44, respectively), but were lower than the acceptable limit of 15% reported for instruments (17). These results were also slightly lower than what reported in another study conducted for multiple cancers in Korea (16.8% and 9.7%, respectively). This difference may be due to the lower proportion of more severe cancers in Korea compared to this study. The largest proportion of cancer in Korea (21) was breast (32.9%), followed by colorectal (13.7%), while it was colorectal (22.1%) and lung (20.4%) in this study. Similar to previous studies (11, 21, 2729), expanding two more levels to the 3L significantly reduced the overall ceiling effect; in this study by 9.44 percentage points, with a relative reduction of 21.79%. The highest proportion of ‘‘no problems’’ responses were reported in “self-care” dimension for both versions. Similar results were found for other diseases such as psoriasis (11), and diabetes (27).

The overall proportion and average value of inconsistent responses were very low (1.45% and 1, respectively) and fell within what was reported in previous studies (0-10.6%), while the highest proportion was in “anxiety/depression” (2.32%) and the lowest in “self-care” (0.46%). The higher variability in the ‘‘anxiety/depression’’ dimension can be explained by its potentially more subjective nature. This was consistent with the finding obtained from studies conducted on general population and diseases (7, 11), while inconsistent with what was reported in Korea (21). The difference can be described due to differences in the proportion of cancer types in the two studies.

The overall discriminatory power increased in all dimensions and in overall classification system when moving from 3L to 5L. The absolute informativity (H ′) of the 5L compared to 3L was as high as 0.44 (45.93%). We also found that relative informativity (J ′) for the overall classification system increased by 22.92%. Similar results were reported in another study conducted on multiple cancers (21), which shows that adding extra levels of severity to the EQ-5D’s descriptive system was used efficiently. Our results of relative informativity were inconsistent with the findings from psoriatic patients and general population (11, 30). It may be due to the differences in the sample selected in our study compared to other studies; cancer patients having more moderate/extreme conditions.

As expected, the correlation coefficients between the 5L dimensions and QLQ-C30 subscales (from 0.37 to 0.60) were higher than those for the 3L dimensions and QLQ-C30 subscales (from 0.29 to 0.57). The pattern of correlation of the 3L and 5L with QLQ-C30 scales was alike, the highest correlation being between “anxiety/depression” and “emotional functioning”, followed by “usual activities” and “physical functioning”, and the lowest was for “self-care” and “social functioning”. This shows that the degree of correlation between the dimensions of both EQ-5D versions that are theoretically very similar to scales of QLQ-C30 was higher than those of the EQ-5D dimensions and QLQ-C30 that are theoretically dissimilar. Therefore, the convergent validity of both EQ-5D versions was confirmed in this study, and supports the results of the study conducted in Korea (21) and other studies (10, 11). Furthermore, the 5L showed stronger correlations with other measures compared to 3L for all dimensions in our study and previous studies (10, 11, 31, 32).

The results of known-groups validity confirmed sufficient ability of both EQ-5D versions to discriminate the difference between their mean scores within demographic and clinical characteristics. The higher mean scores of EQ-5D were associated with higher education, being younger, married, and for those patients who were under less severe treatment strategies, with shorter duration of disease since diagnosis, and no comorbidities. These findings were consistent with previous studies (30, 33, 34), while the result of known-groups for gender was not. Known-groups revealed that female gender had higher utility scores than male for both EQ-5D versions. Higher score for women can be linked to higher number of less severe cancer (breast cancer) among them. However, previous studies revealed that the mean score of EQ-5D in breast cancer was higher than other cancers such as digestive system cancer (35). Furthermore, the RE results demonstrated a higher discriminatory efficiency for the 5L for education variable and all clinical variables. The higher discriminatory efficiency of the 5L in clinical variables was reported in studies conducted on psoriatic (11) and acute myeloid leukemia (36) patients.

ICC and Kappa coefficients confirmed a high reproducibility for both EQ-5D versions in our study. As the ICC values for the 3L (0.88) and 5L (0.83) indicated an excellent level of 0.75 reproducibility (25), Kappa also revealed that all dimensions of both EQ-5D fell within the range reported for a good level of reproducibility (25). Reliability was better than what reported in Korea (21). This can be explained by the fact that the average time interval between the two surveys in our study was shorter (7.5 versus 11.5 days), therefore the condition of the patients might have changed less. Similar to the study in Korea, the results of our reliability showed that the 5L was less reproducible than the 3L in all dimensions. This may be due to a large number of levels in the 5L that may affect recall bias. This result was not supported in studies conducted in the general public (37) and in patients with diabetes (27). This difference can be explained by the differences in the health status of target populations.

The Bland–Altman plot revealed that the proportion of 92.57% of the differences of observations between the 3L and 5L was within the 95% limit of agreement (-0.20 to 0.62). Although the high proportion of observations fell within the limit, these differences of observations were not distributed symmetrically. The plot showed that the agreement between the two instruments was weaker for patients in more severe health states (i.e., patients with lower utility value) where the majority of the differences in utility scores lied outside the limits of agreement. That is, the 3L overestimated the utilities for more severe health states and underestimated them for better health states. This might be due to differences between floor effects in 3L and 5L (-0.113 and -1.19, respectively), and to the presence of more negative values in the Iranian value set of the 5L compared to the 3L value set. One limitation that should be noted is that we did not assess responsiveness in the study, which is an important measurement property.

Conclusion

Findings suggest that the 5L was better than the 3L in terms of ceiling effect, inconsistency, discriminatory power, convergent validity, and relative precision. Both versions of the EQ-5D demonstrated good known-groups validity and reliability as well as 92.57% of the differences of observations between the 3L and 5L was within the 95% limit of agreement. It is thus recommended to use the 5L in cancer research or clinical practice.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation. Requests to access these datasets should be directed to SGFtZXJ5N0B5YWhvby5jb20=.

Ethics statement

The studies involving human participants were reviewed and approved by the Iranian national research committee (approval no. IR. IUMS.1400.394). The patients/participants provided their written informed consent to participate in this study.

Author contributions

Study design and statistical analysis and interpretation of thedata: HA and NM. Drafting of the manuscript: HA and TGP andHS. Critical revision of the manuscript for important intellectualcontent: HA, TGP and MM. All authors contributed to the article and approved the submitted version.

Funding

This study was funded by Health Promotion Research Centre (grant number 20916) and Iranian National Science Foundation (grant number 98025084).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1. Excellence C. Methods for the development of NICE public health guidance. National institute of health and care excellence. (2012).

Google Scholar

2. Weinstein MC, Torrance G, McGuire A. QALYs: the basics. Value in health. (2009); 12:S5–S9. doi: 10.1111/j.1524-4733.2009.00515.x

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Guyatt GH, Feeny DH, Patrick DL. Measuring health-related quality of life. Annals of internal medicine. (1993); 118(8):622–9. doi: 10.7326/0003-4819-118-8-199304150-00009

CrossRef Full Text | Google Scholar

4. Yousefi M, Nahvijou A, Sari AA, Ameri H. Mapping QLQ-C30 onto EQ-5D-5L and SF-6D-V2 in patients with colorectal and breast cancer from a developing country. Value Health Regional Issues. (2021) 24:57–66. doi: 10.1016/j.vhri.2020.06.006

CrossRef Full Text | Google Scholar

5. Kennedy-Martin M, Slaap B, Herdman M, van Reenen M, Kennedy-Martin T, Greiner W, et al. Which multi-attribute utility instruments are recommended for use in cost-utility analysis? a review of national health technology assessment (HTA) guidelines. Eur J Health Economics. (2020) 21(8):1245–57. doi: 10.1007/s10198-020-01195-8

CrossRef Full Text | Google Scholar

6. Craig BM, Pickard AS, Rand-Hendriksen K. Do health preferences contradict ordering of EQ-5D labels? Qual Life Res (2015) 24(7):1759–65. doi: 10.1007/s11136-014-0897-z

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Buchholz I, Janssen MF, Kohlmann T, Feng Y-S. A systematic review of studies comparing the measurement properties of the three-level and five-level versions of the EQ-5D. Pharmacoeconomics. (2018) 36(6):645–61. doi: 10.1007/s40273-018-0642-5

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Kim SH, Hwang JS, Kim TW, Hong YS, Jo M-W.. Validity and reliability of the EQ-5D for cancer patients in Korea. Supportive Care Cancer. (2012) 20(12):3155–60. doi: 10.1007/s00520-012-1457-0

CrossRef Full Text | Google Scholar

9. Lang H-C, Chuang L, Shun S-C, Hsieh C-L, Lan C-F. Validation of EQ-5D in patients with cervical cancer in Taiwan. Supportive Care cancer. (2010) 18(10):1279–86. doi: 10.1007/s00520-009-0745-9

CrossRef Full Text | Google Scholar

10. Yfantopoulos JN, Chantzaras AE. Validation and comparison of the psychometric properties of the EQ-5D-3L and EQ-5D-5L instruments in Greece. Eur J Health Economics. (2017) 18(4):519–31. doi: 10.1007/s10198-016-0807-0

CrossRef Full Text | Google Scholar

11. Yfantopoulos J, Chantzaras A, Kontodimas S. Assessment of the psychometric properties of the EQ-5D-3L and EQ-5D-5L instruments in psoriasis. Arch Dermatol Res (2017) 309(5):357–70. doi: 10.1007/s00403-017-1743-2

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Perez-Sousa MA, Olivares PR, Gusi N. Psychometric properties of the Spanish versions of EQ-5D-Y-3L and EQ-5D-Y-5L in children with cancer: A comparative study. Int J Environ Res Public Health (2022) 19(18):11420. doi: 10.3390/ijerph191811420

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Goudarzi R, Sari AA, Zeraati H, Rashidian A, Mohammad K, Amini S. Valuation of quality weights for EuroQol 5-dimensional health states with the time trade-off method in the capital of Iran. Value Health regional issues. (2019) 18:170–5. doi: 10.1016/j.vhri.2019.01.007

CrossRef Full Text | Google Scholar

14. Afshari S, Goudarzi R, Mahboub–Ahari A, Yaseri M, Sari AA, Ameri H, et al. A national survey of Iranian general population to identify the EQ-5D-5L value set. Tehran: Tehran University of Medical Sciences, School of Public Health (2022).

Google Scholar

15. Scott N, Fayers P, Aaronson N, Bottomley A, de Graeff A, Groenvold M. EORTC QLQ-C30 reference values manual Brussels. Belgium: EORTC Quality of Life Group (2008).

Google Scholar

16. Montazeri A, Harirchi I, Vahdani M, Khaleghi F, Jarvandi S, Ebrahimi M, et al. The European organization for research and treatment of cancer quality of life questionnaire (EORTC QLQ-C30): translation and validation study of the Iranian version. Supportive Care Cancer. (1999) 7(6):400–6. doi: 10.1007/s005200050300

CrossRef Full Text | Google Scholar

17. Terwee CB, Bot SD, de Boer MR, van der Windt DA, Knol DL, Dekker J, et al. Quality criteria were proposed for measurement properties of health status questionnaires. J Clin Epidemiol (2007) 60(1):34–42. doi: 10.1016/j.jclinepi.2006.03.012

PubMed Abstract | CrossRef Full Text | Google Scholar

18. De Smedt D, Clays E, Doyle F, Kotseva K, Prugger C, Pająk A, et al. Validity and reliability of three commonly used quality of life measures in a large European population of coronary heart disease patients. Int J Cardiol (2013) 167(5):2294–9. doi: 10.1016/j.ijcard.2012.06.025

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Li L, Liu C, Cai X, Yu H, Zeng X, Sui M, et al. Validity and reliability of the EQ-5D-5 l in family caregivers of leukemia patients. BMC cancer. (2019) 19(1):522. doi: 10.1186/s12885-019-5721-2

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Kim M-H, Cho Y-S, Uhm W-S, Kim S, Bae S-C. Cross-cultural adaptation and validation of the Korean version of the EQ-5D in patients with rheumatic diseases. Qual Life Res (2005) 14(5):1401–6. doi: 10.1007/s11136-004-5681-z

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Kim SH, Kim HJ, Lee S-i, Jo M-W. Comparing the psychometric properties of the EQ-5D-3L and EQ-5D-5L in cancer patients in Korea. Qual Life Res (2012) 21(6):1065–73. doi: 10.1007/s11136-011-0018-1

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Sakthong P, Sonsa-ardjit N, Sukarnjanaset P, Munpan W. Psychometric properties of the EQ-5D-5L in Thai patients with chronic diseases. Qual Life Res (2015) 24(12):3015–22. doi: 10.1007/s11136-015-1038-z

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Janssen MF, Birnie E, Haagsma JA, Bonsel GJ. Comparing the standard EQ-5D three-level system with a five-level version. Value Health (2008) 11(2):275–84.

PubMed Abstract | Google Scholar

24. Janssen MFB, Birnie E, Bonsel GJ. Evaluating the discriminatory power of EQ-5D, HUI2 and HUI3 in a US general population survey using shannon’s indices. Qual Life Res (2007) 16(5):895–904. doi: 10.1007/s11136-006-9160-6

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Fleiss JL, Levin B, Paik MC. Statistical methods for rates and proportions. (John Wiley & Sons. Willy) (2013).

Google Scholar

26. Bland JM, Altman DG. Statistical methods for assessing agreement between two methods of clinical measurement. Int J Nurs Stud (2010) 47(8):931–6. doi: 10.1016/S0140-6736(86)90837-8

CrossRef Full Text | Google Scholar

27. Pattanaphesaj J, Thavorncharoensap M. Measurement properties of the EQ-5D-5L compared to EQ-5D-3L in the Thai diabetes patients. Health Qual Life outcomes. (2015) 13(1):1–8. doi: 10.1186/s12955-014-0203-3

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Nahvijou A, Safari H, Ameri H. Comparing the performance of the EQ-5D-5L with two versions of the SF-6Dv2 in patients with breast cancer. Health Serv Outcomes Res Methodology. (2020) 20(2):183–94. Available at: https://link.springer.com/article/10.1007/s10742-020-00215-7

Google Scholar

29. Yousefi M, Safari H, Akbari Sari A, Raei B, Ameri H. Assessing the performance of direct and indirect utility eliciting methods in patients with colorectal cancer: EQ-5D-5L versus c-TTO. Health Serv Outcomes Res Methodology. (2019) 19(4):259–70. Available at: https://link.springer.com/article/10.1007/s10742-019-00204-5

Google Scholar

30. Kangwanrattanakul K, Parmontree P. Psychometric properties comparison between EQ-5D-5L and EQ-5D-3L in the general Thai population. Qual Life Res (2020) 29(12):3407–17. doi: 10.1007/s11136-020-02595-2

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Janssen M, Pickard AS, Golicki D, Gudex C, Niewada M, Scalone L, et al. Measurement properties of the EQ-5D-5L compared to the EQ-5D-3L across eight patient groups: a multi-country study. Qual Life Res (2013) 22(7):1717–27. doi: 10.1007/s11136-012-0322-4

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Kim S-H, Jo M-W, Lee J-W, Lee H-J, Kim JK. Validity and reliability of EQ-5D-3L for breast cancer patients in Korea. Health Qual Life outcomes. (2015) 13(1):203. doi: 10.1186/s12955-015-0399-x

PubMed Abstract | CrossRef Full Text | Google Scholar

33. Zare F, Ameri H, Madadizadeh F, Aghaei MR. Validity and reliability of the EQ-5D-3L (a generic preference-based instrument used for calculating quality-adjusted life-years) for patients with type 2 diabetes in Iran. Diabetes Metab Syndrome: Clin Res Rev (2021) 15(1):319–24. doi: 10.1016/j.dsx.2021.01.009

CrossRef Full Text | Google Scholar

34. Nahvijou A, Safari H, Ameri H. Psychometric properties of the SF-6Dv2 in an Iranian breast cancer population. Breast Cancer. (2021) 28(4):937–43. doi: 10.1007/s12282-021-01230-3

PubMed Abstract | CrossRef Full Text | Google Scholar

35. Pickard AS, Wilke CT, Lin H-W, Lloyd A. Health utilities using the EQ-5D in studies of cancer. Pharmacoeconomics. (2007) 25(5):365–84. doi: 10.2165/00019053-200725050-00002

PubMed Abstract | CrossRef Full Text | Google Scholar

36. Yu H, Zeng X, Sui M, Liu R, Tan RL-Y, Yang J, et al. A head-to-head comparison of measurement properties of the EQ-5D-3L and EQ-5D-5L in acute myeloid leukemia patients. Qual Life Res (2021) 30(3):855–66. doi: 10.1007/s11136-020-02644-w

PubMed Abstract | CrossRef Full Text | Google Scholar

37. Kim TH, Jo M-W, Lee S-i, Kim SH, Chung SM. Psychometric properties of the EQ-5D-5L in the general population of south Korea. Qual Life Res (2013) 22(8):2245–53. doi: 10.1007/s11136-012-0331-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: EQ-5D-5L, EQ-5D-3L, psychometric properties, cancer patients, Iran

Citation: Moradi N, Poder TG, Safari H, Mojahedian MM and Ameri H (2022) Psychometric properties of the EQ-5D-5L compared with EQ-5D-3L in cancer patients in Iran. Front. Oncol. 12:1052155. doi: 10.3389/fonc.2022.1052155

Received: 18 October 2022; Accepted: 22 November 2022;
Published: 09 December 2022.

Edited by:

Alireza Sadjadi, Tehran University of Medical Sciences, Iran

Reviewed by:

Hasan Yusefzadeh, Urmia University of Medical Sciences, Iran
Miguel Ángel Pérez-Sousa, Universidad de Córdoba, Spain

Copyright © 2022 Moradi, Poder, Safari, Mojahedian and Ameri. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Hosein Ameri, aGFtZXJ5N0B5YWhvby5jb20=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.