Development and psychometric testing of a pediatric chronic graft-versus-host disease symptom scale: protocol for a two-phase, mixed methods study

Mitchell, Sandra A.; Hunter, Rachael; Fry, Abigail; Pavletic, Steven Z.; Widemann, Brigitte C.; Wiener, Lori

doi:10.3389/fpsyg.2023.1243005

STUDY PROTOCOL article

Front. Psychol., 08 January 2024

Sec. Psycho-Oncology

Volume 14 - 2023 | https://doi.org/10.3389/fpsyg.2023.1243005

This article is part of the Research TopicHighlights in Psycho-Oncology: Study ProtocolsView all 8 articles

Development and psychometric testing of a pediatric chronic graft-versus-host disease symptom scale: protocol for a two-phase, mixed methods study

Sandra A. Mitchell¹^*^†

Rachael Hunter²

Abigail Fry²

Steven Z. Pavletic³

Brigitte C. Widemann²

Lori Wiener²^†

¹Outcomes Research Branch, Healthcare Delivery Research Program, National Cancer Institute, Bethesda, MD, United States
²Pediatric Oncology Branch, Center for Cancer Research, National Cancer Institute, Bethesda, MD, United States
³Immune Deficiency Cellular Therapy Program, Center for Cancer Research, National Cancer Institute, Bethesda, MD, United States

Background: Chronic graft-versus-host disease (cGVHD) is a debilitating late complication of hematopoietic stem cell transplantation. It is often accompanied by extensive symptom burden. No validated cGVHD patient-reported outcome (PRO) measure exists to evaluate cGVHD symptom bother in children and adolescents younger than 18 years. This paper presents the study protocol for a multi-center, two-phase protocol to develop a psychometrically valid pediatric cGVHD Symptom Scale (PCSS) and a companion caregiver-proxy measure to capture the symptom burden experienced by children with cGVHD. In the first phase of the study, our aim is to evaluate the comprehension, clarity and ease of response of the PCSS through cognitive interviewing and to iteratively refine the measure to optimize content validity. In the second phase of the study, we will quantitatively examine the measurement properties of the PCSS in children and their caregiver-proxies.

Methods and analysis: Eligible participants are children/adolescents ages 5–17 with cGVHD who are receiving systemic immunosuppressive treatment or have recently tapered to discontinuation. In the first phase, we are enrolling 60 child and caregiver-proxy dyads in three child age strata (5–7, 8–12, and 13–17 years old). Semi-scripted cognitive debriefing interviews are conducted to assess comprehension, clarity, and ease of response of each PCSS item with the child alone, and then jointly with the caregiver-proxy to explore discordant ratings. In phase two, an age-stratified cohort of 120 child-caregiver dyads will be enrolled to evaluate test–retest reliability, construct validity, and responsiveness. Anchors for known-groups validity include the PedsQL module and clinical variables, including cGVHD clinician-rated severity scores. In participants ages 13–17, we will also compare responses on the PCSS with those from the Lee cGVHD Symptom Scale, to gauge the youngest age at which adolescent respondents can comprehend this adult measure.

Discussion: This study will yield a well-validated, counterpart measure to the Lee cGVHD Symptom Scale for use in children with cGVHD and their caregiver-proxies. This new patient-reported outcome measure can be integrated into clinical trials and care delivery for pediatric transplant survivors to improve the precision and accuracy with which their cGVHD symptom experience is captured.

Clinical trial registration: www.ClinicalTrials.gov, NCT04044365.

Introduction

Hematopoietic stem cell transplantation (HSCT) is a curative option for many children with cancer and certain genetic disorders. However, approximately 40% of stem cell transplant recipients experience chronic graft-versus-host disease (cGVHD), a late complication of treatment that causes burdensome symptoms, functional impairment and diminished quality of life (Agh et al., 2019).

Chronic GVHD can involve almost every organ system although it most commonly affects the skin, eyes, mouth, liver, intestines, lungs, and the musculoskeletal system (Haroun et al., 2023). Children living with cGVHD often face many years of a disabling and painful chronic illness compounded by the side effects of prolonged immunosuppression, particularly long-term corticosteroids, needed to treat cGVHD manifestations (Mitchell et al., 2021).

There has been a sustained effort through the NIH cGVHD Consensus Consortium to standardize the diagnosis, severity grading and criteria for defining therapeutic response for patients with cGVHD on clinical trials (Carpenter et al., 2015; Jagasia et al., 2015; Lee et al., 2015; Cuvelier et al., 2019; Kitko et al., 2021). However, since objective changes in cGVHD-related organ system manifestations may not fully reflect therapeutic response or subtle but clinically meaningful improvements, the NIH Consensus Panel has recommended evaluation of both the clinical signs of chronic GVHD, and also that the patient experience of cGVHD symptoms be captured by self-report using well-validated patient-reported outcomes (PRO; Lee et al., 2015, 2022). In addition, PROs can be used as a clinical tool to optimize symptom management, promote communication and shared decision-making with clinicians, and increase patient and family engagement in care, all ultimately leading to better clinical outcomes (Bele et al., 2020; Leahy and Steineck, 2020).

The Lee cGVHD Symptom Score (LSS) has been recognized as a core cGVHD outcome measure in the adult population (Lee et al., 2002, 2015; Merkel et al., 2016). However, the instrument has not been validated for use in children/adolescents younger than 18, and the phrasing of many of the symptom terms (such as limited joint movement and thickened skin) used in the LSS may not be well understood by younger children. There is currently no available pediatric PRO measure of cGVHD symptom bother.

To ensure the developmental appropriateness and strong measurement properties of pediatric PRO measures, the International Society for Pharmacoeconomics and Outcomes Research (ISPOR) and the International Society for Quality of Life Research (ISOQOL) recommends the use of both qualitative and quantitative methods to develop, refine, and test new PRO measures for this patient population (Matza et al., 2013; Reeve et al., 2013). This two phase study builds upon our prior work conducting concept elicitation interviews to identify the concepts and specific phrasings that children and adolescents use to describe their cGVHD symptoms (Wiener et al., 2014).

Objectives and specific aims

The overall objective of this two-phase multi-center study is to develop a psychometrically valid pediatric cGVHD Symptom Scale (PCSS; including both a child and a caregiver-proxy version), as a counterpart to the LSS, to accurately and precisely capture cGVHD symptom bother in pediatric transplant survivors.

This objective will be addressed through four specific aims:

1. Evaluate the comprehensibility, clarity, acceptability, and ease of judgment of three age appropriate versions of the PCSS items and response choices with children across the developmental spectrum, and with their caregiver-proxies.

2. Evaluate the validity (concurrent validity, known-groups validity, and validity of the response scales), test–retest reliability, responsiveness, minimally important difference (MID), and the acceptability and time required to complete the PCSS (both pediatric and caregiver-proxy versions).

3. Determine the youngest age at which the PCSS can be comprehended.

4. Examine concordance between the PCSS and the LSS in order to establish the youngest age at which the LSS validly captures cGVHD symptom bother.

Guided by developmental science and expert recommendations for the development and testing of patient-reported outcome measures (Riley, 2004; Klassen et al., 2010; Mokkink et al., 2010; Patrick et al., 2011; Arbuckle and Abetz-Webb, 2013; Coyne et al., 2016; Tomlinson et al., 2016; Pinheiro et al., 2018; Terwee et al., 2018; Leahy and Steineck, 2020; Coombes et al., 2021, 2023; Rothmund et al., 2023), this multi-site study will achieve its aims in two phases. An overview of the two phases of the study is provided in Table 1.

TABLE 1

Table 1. Overview of the two phases of the mixed-methods study.

Methods and analyses

Phase 1: establish content validity of the candidate items through cognitive interviewing

Design, sample, and setting

A 10-domain structure (Figure 1) and an initial pool of 50 items were constructed based on our prior concept elicitation study (Wiener et al., 2014). The PCSS (both child and caregiver-proxy versions) evaluates how much the child has been bothered by each symptom in the past 1 month. The 1 month recall period and the focus on the concept of symptom bother align the PCSS with the LSS, the measure used with respondents ages 18 and older. Symptom bother incorporates symptom frequency, severity, and interference with usual activities (Gawlicki et al., 2014). Responses are provided on a three-point scale incorporating a pictogram (for respondent’s ages 5–7 years) or a five-point Likert scale (for children ages 8–17 and caregiver-proxy respondents).

FIGURE 1

Figure 1. Domain framework.

Participants are recruited at one of 13 pediatric transplant centers in the United States and Canada. We will enroll three cohorts reflecting distinct developmental age bands [children ages 5–7 and their caregiver-proxies (n = 20 dyads); children ages 8–12 and their caregiver-proxies (n = 20 dyads); and children ages 13–17 and their caregiver-proxies (n = 20 dyads); Arbuckle and Abetz-Webb, 2013; Matza et al., 2013]. Where possible, we will attempt to balance the gender ratios within each age band. Every effort will be made to enroll participants who are currently or have recently experienced diverse clinical manifestations of cGVHD. In support of that objective, all of the children participating in the study meet established criteria for a diagnosis of cGVHD (Jagasia et al., 2015) and have received cGVHD-directed systemic therapy in the past year.

Overview of data collection procedures

After parent/caregiver/guardian (hereafter termed “caregiver-proxy”) has consented to participate with their child in the study, and the child assent has been obtained, a mutually convenient time is established for the interview procedures to be completed via web-based videoconferencing (Fry et al., 2021). Approximately 48 h prior to the scheduled interview, the caregiver is sent a link to complete the caregiver-proxy PCSS electronically and provide basic demographic and developmental information about their child. Subsequently a semi-scripted digitally recorded cognitive interview is conducted remotely, with the child, and then jointly with child and caregiver-proxy, to explore the comprehension, clarity and ease of response of each of the items that comprise the PCSS.

The cognitive interview schedules are tailored to children and their caregiver-proxies stratified by age (5–7, 8–12, and 13–17 years of age) and are designed to:

• Determine the extent to which each of the three age groups comprehend the symptom terms (e.g., thickened skin, tightness, and loss of appetite) and response choices of the PCSS.

• Evaluate and refine the PCSS to be clear, easy to judge, and relevant for capturing cGVHD symptom bother.

• Determine whether only caregiver-proxy report should be recommended for children in the 5–7 age range.

Cognitive interviewing is a qualitative method designed to examine the question-response process, specifically the processes and considerations used by respondents as they form answers to survey questions, and to identify and resolve sources of error (Willis, 2005). Through the interviewing process, various types of question-response problems that would not normally be identified in a traditional survey interview, such as interpretive errors and recall accuracy, are surfaced (Miller et al., 2014). In the present study, the cognitive interviews address four critical questions: (1) What is the most appropriate wording for the respondent instructions for completing the instrument (e.g., is the target concept of symptom bother well-understood)? (2) Within each age group, do respondents interpret and comprehend the symptom terms (e.g., fatigue, cutaneous symptoms, digestive symptoms, and oral and ocular symptoms) in a comparable manner? (3) Is the five-point (or in the case of 5–7 year olds, the three points) response scale well comprehended by respondents and easy for them to use to provide their responses? (4) Are the verbal descriptors (e.g., sometimes, often) used for each of the response categories clear and easy for the respondent to judge/distinguish?

Cognitive interviews

During the interview, the child-participant is first presented each PCSS item on the computer screen. Child-participants have each PCSS item and the response choices read to them, in addition to seeing each question and the response choices on the screen. The child provides their verbal response to the item, using the three- or five-point scale. Participants ages 8–17 are also asked whether the item was difficult to understand and/or answer. The interviewer notes any verbal or behavioral indicators of comprehension difficulties (e.g., facial expressions indicating confusion, pausing, changing an answer, and looking to the caregiver for assistance). If the child-participant requests help or clarification of any item, the interviewer encourages completion of the item to the best of their ability based on the instructions and flags that item on the interview summary.

After completing the PCSS, items identified by the child as difficult to understand, confusing or unclear and items where there were behavioral indicators of comprehension difficulties (i.e., participant appeared to be hesitant, uncertain or puzzled in making their response) are retrospectively probed. This is a common strategy in cognitive interviewing because it minimizes participants’ ability to anticipate interviewers’ questions or to change their responses to later questions which may occur if probing is conducted during the initial completion of the PRO measure. To conserve participant burden, each interview schedule is assigned a specific subset of PCSS items for probing. These items may be those symptom terms that are expected to occur less frequently (e.g., cough, shortness of breath with exercise) and/or which are anticipated to present difficulties for respondents because they reflect abstract or complex constructs/domains (e.g., genitalia, body image, muscle/joint pain, and cramping). This approach also ensures that each PCSS item receives adequate attention in the cognitive interviewing irrespective of whether the child expresses difficulties with comprehension or clarity (Arbuckle and Abetz-Webb, 2013).

After the child has been debriefed on specific items, the child and their caregiver-proxy are invited to jointly explore any differences in their responses to items in the PCSS, in order to identify whether the differing reports might reflect child non-comprehension (Tomlinson et al., 2021). By comparing the congruence between symptom reports provided by child and caregiver-proxy and probing the source of that discrepancy (i.e., misunderstanding of the question phrasing versus differences in caregiver-child appraisal of the symptom experience), we will be able to describe variation in agreement by symptom, and to explore qualitatively the basis for any divergence in ratings of symptom presence/absence provided by child-participant and caregiver-proxy. The child and caregiver-proxy will also be asked about items that should be included in the PCSS and how the existing items could be improved.

To ensure consistency in interviewing methods, interviews are centralized at the coordinating center and are performed by two trained researchers experienced with conducting pediatric qualitative interviews in medical settings. The interview and other research procedures take approximately 45–60 min. All interviews are recorded. In the case of study participants who do not have access to a computer at their home, interviews are conducted in their healthcare setting during a routine clinic appointment. Alternatively, we also offer all study measures on paper, and participants can complete interviews via telephone. Enrollment and data collection procedures are summarized in Figure 2. Table 2 describes the purpose and examples of semi-scripted interview questions.

FIGURE 2

Figure 2. Enrollment and qualitative data collection procedures (Phase 1).

TABLE 2

Table 2. Purpose and examples of semi-scripted cognitive interview questions and probes.

Data analysis

Following the completion of each interview, the two interviewers summarize the results for that interview. The summaries include the following information: (1) Overall themes and behavioral observations; (2) General difficulties in completing the PCSS, along with detailed descriptions of each difficulty; (3) Difficulties in answering specific questions, along with detailed descriptions of those difficulties; and (4) Responses to specific interview topics of interest [comprehension and clarity of each item, interpretation of the concept of bother, relevance of the response scale(s) and their ease of use, and understanding of the recall period]. The interviewer also notes caregiver-proxy or child recommendations for phrasing adjustments to the PCSS to improve comprehension or clarity, and documents elements of interview quality (e.g., length of the interview, any technical difficulties, child and caregiver engagement in the interview process, and relevant developmental, clinical or environmental contextual factors). The Interviewer Case Report Form (CRF; see Supplementary Table 1) was developed to code and summarize each interview. The CRF includes a place to enter the child and caregiver-proxy responses to PCSS items, capture self-reported and behavioral indicators of comprehension difficulties, summarize item-by-item comprehension as determined jointly by the two interviewers, and document the child’s verbatim statements rephrasing the item or describing its meaning in their own words that indicate comprehension or non-comprehension. While the digital recordings are not transcribed, interviewers review the recording after the interview is concluded to confirm coding or improve the capture of verbatim statements reflecting comprehension or misunderstanding.

Data are analyzed after every five interviews, and cumulative item-by-item determinations of both comprehension and saturation, stratified by age band, are documented in a coverage and saturation matrix (Kerr et al., 2010; Tomlinson et al., 2016; Meadows, 2021). Each transcript is mapped back to each PCSS item and a cumulative summary of the findings for each item and next steps (i.e., revise and retest, comparatively test a revision, continue testing, saturation achieved, and eliminate item from item pool) is constructed for each item and organized in Microsoft Excel. This approach also ensures that all PCSS items receive a comparable extent of probing across interviews within each age band and identifies the timepoint at which no new information about an item is being generated by the interviews.

Each PCSS item is debriefed with at least five children within each age band. Cognitive interviews are normally based on intensive interviewing in relatively small samples (Willis et al., 2005; Willis, 2005, 2015). Approximately 5–8 participants per item and per age group is a general rule of thumb to achieve saturation with cognitive interviews (Kerr et al., 2010; Hennink et al., 2017) since the type of logical and structural problems that we wish to identify are relatively independent of sample size (Miller et al., 2014). Symptom terms eliciting confusion or demonstrating misinterpretation by three or more respondents within an age band are considered for revision. Interview scripts are slightly refined to accommodate these changes to PCSS items.

When an item has been revised, it remains in the item pool for debriefing until it has been determined to be acceptably comprehended. Criteria or thresholds for concluding that a PCSS item is acceptably comprehended include resolution of any observed barriers to comprehension or clarity, achievement of an item-level comprehension rate (as demonstrated by the child correctly rephrasing the item in their own words) of 80% within each age band (Miller et al., 2014). An a priori stopping rule for concluding that comprehension of the PCSS by 5–7 year old cannot be achieved was established as the point at which 30% or more of the PCSS items are misunderstood and unable to be correctly rephrased. Study reporting will be guided by the Standards for Reporting Qualitative Research (SRQR) checklist (O'Brien et al., 2014) and the Consolidated Criteria for Reporting Qualitative Research (COREQ; Tong et al., 2007; Buus and Perron, 2020). SPSS and Excel are used to support data management and analysis.

Phase 2: evaluate the measurement properties of the PCSS

Design and sample

In the second phase of this study, a cohort of 120 child and caregiver-proxy dyads, stratified by child age, will be recruited at 13 centers in the United States and Canada. Eligibility criteria are the same as those for the qualitative phase (children ages 5–17, with a diagnosis of cGVHD and currently receiving systemic immunosuppression or have had their immunosuppression tapered to discontinuation within the past 12 months). Participants in phase 1 of the study will also be approached, evaluated for eligibility, and if willing to participate, will be consented and enrolled.

Data collection procedures

After enrollment, the child participant and their caregiver-proxy will be provided with their own unique secure electronic link to independently complete the study measures [PCSS, PedsQL general (Varni et al., 2007c) and transplant modules (Lawitschka et al., 2014), and LSS; corresponding to their role (child or caregiver-proxy) and age band (5–7; 8–12, and 13–17 years); see Table 3 for a summary of the assessment schedule]. To examine test-re-test reliability, both the child and their caregiver-proxy will complete a second PCSS within 24–48 h of their initial completion of the measures (T2). To explore the responsiveness of the PCSS, study participants will complete the PCSS and PedsQL (general and transplant module) approximately 3 months (T3) after enrollment. They will also be asked to indicate whether their cGVHD symptoms have worsened, are unchanged, or have improved. Caregiver-proxy participants may assist their younger child (ages 5–7 and 8–12 as warranted) in completing this global assessment of change in cGVHD symptoms. Respondents ages 13–17, and their caregiver-proxies will also complete the LSS at all three timepoints. The measures are described in more detail in Supplementary Table 2.

TABLE 3

Table 3. Summary of patient-reported outcomes by timing of assessment and age group (Phase 2).

Child participants and their caregiver-proxies will receive phone and/or e-mail reminders to log into their unique survey interface to complete required study measures. To minimize missing data, completeness will be monitored prospectively by the study team. Back-up data collection procedures will be implemented for participants who do not complete their scheduled measures.

Statistical analysis

The primary objective of the second phase is to quantitatively evaluate the test–retest reliability, construct validity, responsiveness, and the acceptability and time required to complete the PCSS (in both pediatric and caregiver-proxy versions). STATA software will be used to support data management and analysis.

Reliability

For test–retest reliability, we will examine the level of agreement between total PCSS score at T1 and T2 using the Intraclass Correlation Coefficient (ICC) in all respondents, within 3–5 days of their initial completion. We will evaluate at T2 if by child and caregiver-proxy report the child remains in stable health and with stable symptoms of cGVHD (Qin et al., 2019). The ICC will be computed based on a two-way, mixed effects analysis of variance model (Shrout and Fleiss, 1979; Liljequist et al., 2019). ICC ≥ 0.70 will be interpreted as demonstrating adequate test–retest reliability.

Construct validity

For concurrent validity, we will examine the associations using Pearson correlation coefficients between the PCSS and the total and subscale scores of the PedsQL and PedsQL-SCTM. We will also examine associations between subscales of the PCSS and conceptually relevant items and domains of the PedsQL and the PedsQL-SCTM. We have pre-specified 25 comparisons between PCSS and the PedsQL, and PedsQL SCTM. To control for inflated Type 1 error, we will adjust for multiple comparisons using the Benjamini-Hochberg step-up method. The anticipated pooled sample size of 120 pediatric participants across the designated age cohorts (ages 5–7, 8–12, and 13–17 years) provides adequate precision for two-sided (1–0.05/25) % confidence intervals for Pearson correlation coefficients. With a sample size of 120 pediatric participants, the two-sided 99% confidence interval around a Pearson correlation of 0.8 is 0.70–0.87. Confidence interval calculations are based on a z transformation.

To evaluate known-groups validity, we will use independent samples t-tests (for two group comparisons), ANOVA (for three or more group comparisons), or Jonkheere-Terpstra tests for differences (for monotonically increasing comparison groups such as those created based on cGVHD severity scores). In evaluating known-groups validity, we will determine if mean item or mean subscale scores differ across relevant indicators of the respondent’s health status, quality of life, or clinical anchors such as NIH cGVHD severity score, NIH cGVHD organ-specific severity scores, and intensity of immunosuppression. Chi-squared tests will also be used to evaluate if the distribution of scores (range 0–4) differs between the known groups. Analysis for a clinical anchor will not be undertaken if it is observed prevalence is <20 or > 80%.

To evaluate the construct validity of the response scales, we will confirm the monotonicity of the PCSS response choices by comparing mean PedsQL total scores across subgroups with worsening PCSS item level scores using Jonckheere-Terpstra tests for differences. We will also plot mean PedsQL total scores across the response choices for each item in the PCSS. We will summarize the proportion of PCSS items where we observed monotonically decreasing mean PedsQL scores across response choices that indicate more symptom severity (higher PedsQL scores reflect better quality of life, and thus as symptom severity increases, it is expected that PedsQL scores will decline).

Given that cGVHD requiring prolonged systemic immunosuppression has an overall incidence of <20% in pediatric transplant survivors (range 5–65% incidence), we anticipate that it will prove challenging to accrue a sufficiently large sample of children to permit stratified psychometric analyses within each age cohort. Accordingly, we plan to pool data across age cohorts for the primary analysis. Sensitivity analyses will be conducted to compare results of pooled analyses with those obtained in the age-specific subgroups, thus identifying age-related trends. As described above, for the primary analysis only, p values will be adjusted for multiple comparisons using Benjamini-Hochberg step-up method (Keselman et al., 2002).

Prior to the primary analysis, histograms, descriptive statistics (mean, standard deviation, median, range), box plots, and or normal Q-Q plots will be reviewed for each item to identify drastic departures from normality. Data transformations and/or non-parametric methods (e.g., Wilcoxon rank sum tests and Spearman correlations) will be employed throughout should data appear highly non-normal.

We will also explore the congruence between symptom ratings provided by children and their caregiver-proxies, and examine the subscale structure using confirmatory factor analysis in children (pooling data across age cohorts to achieve a sufficiently large sample for the factor analysis). To evaluate child and caregiver-proxy congruence between symptom ratings, we will use weighted kappa with 95% confidence intervals to estimate the item-level agreement between the child’s severity rating and their caregiver-proxy’s severity rating for each symptom. Sensitivity analyses will compare the results of these pooled analyses with those derived from analyses within age cohorts, thus identifying age-related trends. Confirmatory factor analysis (CFA) will be used to explore the factor structure of the PCSS, based on the 10 domain constructs (skin, eyes, mouth, eating/digestion, lungs and breathing, muscles and joints, urogenital, energy, body image, and mood) hypothesized to underlie the PCSS. Fit indices [specifically root-mean-squared error of approximation (RMSEA), comparative fit index (CFI)] will be used to test the adequacy of the fit of the hypothesized 10-factor model to the observed data. Good model fit is indicated by RMSEA values <0.09 and/or CFI values greater than 0.90. The project sample size of 120 child respondents is likely too small to provide definitive results using CFA, however results will be informative toward the interpretability of subscale scores and will provide power estimates for a future definitive study powered to provide a robust evaluation of the PCSS factor structure.

To inform the comparability of the PCSS and the LSS which is currently used with persons aged 16 years or older, respondents between the ages of 13 and 17 will complete both measures. We will use weighted kappa to estimate agreement between PCSS responses and responses to corresponding items from the LSS, in children between the ages of 13 and 17 and their caregiver-proxies. While this does not substitute for a full psychometric test of the equivalence of both versions, results will provide initial empiric estimates of the lowest age at which younger adolescents can meaningfully complete the LSS, and preliminary information about the comparability of PCSS and LSS values which may be useful toward justifying pooled analyses or interpreting outcomes specifically in trials of new cGVHD therapies when both the PCSS and the LSS are used. In subgroup analysis of respondents ages 13–17, we will examine the congruence between their symptom endorsement rates and their symptom severity ratings on the items of the PCSS and corresponding items in the LSS. This will inform our knowledge of the comparability of both measures in younger and older adolescents and derive preliminary estimates of the lowest age at which young adolescents can meaningfully complete the LSS.

Responsiveness to change

To explore the responsiveness of the PCSS, study participants will be categorized into three groups based on responses to a question about their global impression of symptom change from baseline to 3 months (Coon and Cook, 2018). Within each group (no change, worsened, and improved), the total PCSS score will be compared used a paired t-test. The standardized response mean will be computed as the mean change scored divided by the standard deviation of the change scores within each change category (worsening vs. no change vs. improvement). Values greater than 0.8 will be considered large and values between 0.5 and 0.8 will be considered moderate. Subsequently the change in total score will be investigated using analysis of covariance in which the T1 score is a covariate and change in symptom severity (worsening vs. no change vs. improvement) is a factor in modeling the T2 rating. Lastly, change scores will be modeled using a generalized linear regression model. cGVHD severity scores and change in PedsQL physical function scores will be incorporated into this model in confirmatory analysis. These various approaches to examining responsiveness to change will be triangulated to preliminarily propose clinically meaningful thresholds for change at the individual and group level (Trigg and Griffiths, 2021).

Ethical, equity, and regulatory considerations

Children of both genders and from all racial and ethnic groups are eligible for this study if they meet the eligibility criteria outlined in the study protocol. Children under the age of 5 are excluded because these children are unlikely to be able to understand and report on the concepts of symptom intensity and interference (Tomlinson et al., 2021), especially for symptoms such as muscle cramping, skin thickening, itching, shortness of breath, muscle/joint pain, and body image changes all of which are common in cGVHD. Non-English speaking participants are not included because at this initial stage of measure validation, it is important to first create a reliable and valid measure in English before it is put through the rigorous translation and linguistic validation process used to create versions of the measure in other languages. A finalized English language measure will not be available until after Phase 2 is completed; as such, this work provides the foundation for future studies to develop and test the PCSS for children and caregiver-proxies who speak languages other than English. Several design features of this multi-site study including representation from different geographic regions of United States and Canada and the use of remote interviewing techniques support our efforts to recruit a diverse population with respect to gender, race and ethnicity, socioeconomic status, and severity of cGVHD (Upadhyay and Lipkovich, 2020).

Parent/caregiver consent and child participant assent is obtained from each family. Where deemed appropriate by the consenting site investigator and the child’s parent(s) or guardian, the child will also be included in all discussions about the trial, and age-appropriate language will be used to describe the procedures and tests involved in this study, along with the risks, discomforts, and benefits of participation. The assent process will take place in conjunction with consent; therefore, in person and remote assent are permitted under the same circumstances as in person and remote consent. Children under 18 will provide assent for their participation in accordance with their institution’s local requirements and the assent process will be documented per local procedures. A waiver of the documentation of informed consent for caregivers of participants who will be completing questionnaires about their children and participating in cognitive interviews was granted by the IRB at the coordinating center.

Throughout the study, participants may be asked to discuss issues that are sensitive such as body image changes, and mood, and could experience distress as a result. To address this concern, study team members explain during the consent process that should this occur, an option to follow-up with a clinical team member will be provided.

Participant information is coded with an ID number. Project data will be stored in a password protected secure computing environment to ensure that only approved members of the research team will have access to these data.

Discussion

There is currently no available pediatric PRO measure of cGVHD symptom bother. Building upon our prior concept elicitation work (Wiener et al., 2014), this multi-center two-phase study uses quantitative and qualitative methods to develop, refine, and establish the measurement properties of a new pediatric measure of cGVHD symptom bother for children and their caregiver-proxies, the Pediatric cGVHD Symptom Scale or PCSS. In phase 1, we will use cognitive interviewing to examine and refine the comprehensibility, clarity and ease of response of the PCSS. In phase 2, we will use classical and modern measurement theory to examine the test–retest reliability, construct validity, and responsiveness to change of this new measure. Knowledge derived from this study will contribute to refinement of the item phrasing across diverse age groups, provide evidence that the instrument has acceptable preliminary measurement properties to support its use in clinical trials and clinical practice, and offer preliminary estimates of clinically important change that improve its interpretability as an outcome measure.

Several design features contribute to the strengths of this study including sample recruitment at 13 geographically dispersed transplant centers in the United States and Canada, which permits enrollment of a diverse sample with respect to age bands, cGVHD manifestations, race/ethnicity, and other sociodemographic factors. Inclusion of respondents currently receiving systemic treatment for cGVHD and those who have recently tapered systemic immunosuppression to discontinuation will mitigate the possibility that the reliability and validity estimates are inflated by the absence of symptoms. Our study design also includes several different objective measures of cGVHD manifestations and their clinical severity to strengthen interpretations of known groups validity, clinical stability (for test–retest reliability), and clinically significant change over time (Paiva et al., 2014). The study adheres to international consensus standards for the development, refinement, and validation of pediatric PRO measures (Patrick et al., 2011; Arbuckle and Abetz-Webb, 2013; Matza et al., 2013; Reeve et al., 2013).

The study has some potential caveats that will need to be considered in interpreting the findings and that should be addressed in future research. First, while the sample sizes generally accord with recommended guidelines for cognitive interviewing studies to establish content validity (Willis, 2005; DeWalt et al., 2007; Patrick et al., 2011; Miller et al., 2014), recruitment of a larger sample of children with cGVHD, which is a relatively rare complication, would increase rigor yet would also be logistically challenging to achieve. In addition, while dividing the sample into narrow age bands is consistent with expert recommendations for development of a pediatric PRO and helps to assure developmentally appropriate item phrasing (Arbuckle and Abetz-Webb, 2013), it could also be the case that important findings within any one age group might be missed due to sparse data. At the same time, in both phases of our study, the interpretability of our findings is strengthened by the fact that study participants’ clinical characteristics are similar to those of the children who will complete this symptom scale in the future. Further, the sample population for this two-phase study may not ultimately be representative of the entire cGVHD patient population since only English-speaking respondents are included. We also excluded children with significant cognitive disabilities as we wanted to distinguish between children who did not understand a PCSS item due to their disability vs. a poorly worded item. As such, future studies may need to enhance recruitment of children with disease- and/or treatment-related cognitive deficits, and children and caregiver-proxies with lower English proficiency. Instrument validation is an iterative process that includes both quantitative and qualitative methods to evaluate and strengthen the measurement properties of an instrument. Continued research to confirm and extend the insights derived from this study will be required. Despite these limitations, the results that are anticipated after conducting this two phase study are expected to be useful in helping children to self-report bothersome cGVHD symptoms that can negatively affect their daily life.

Conclusion

This study constitutes an important first step toward our long-term research goal of making available a psychometrically sound pediatric PRO measure of cGVHD symptoms. This new measure will serve as a companion to the Lee cGVHD Symptom Scale used with individuals older than age 18 that can be integrated into clinical trials and care delivery for pediatric allogeneic hematopoietic stem cell transplant survivors.

Ethics statement

All study participants provided consent to participate in this study.

Author contributions

SM and LW: study conception and design. SM, LW, RH, AF, and SP: participant recruitment and data collection. SM, LW, RH, AF, SP, and BW: analysis and interpretation of data and drafting of the manuscript. All authors contributed to the article and approved the submitted version.

Funding

This research was funded by the NCI Cancer Moonshot. This work was also supported by the Intramural Research Program of the National Cancer Institute, National Institutes of Health (SP, BW, and LW).

Acknowledgments

The authors gratefully acknowledge the staff at each of our 13 participating sites, and the children and families who are sharing their valuable time with us and providing such meaningful data and insights.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpsyg.2023.1243005/full#supplementary-material

Abbreviations

cGVHD, Chronic graft-versus-host disease; CFA, Confirmatory factor analysis; CFI, Comparative fit index; HSCT, Hematopoietic stem cell transplantation; ICC, Intraclass correlation coefficient; ISOQOL, International Society for Quality of Life Research; ISPOR, International Society for Pharmacoeconomic and Outcomes Research; IQR, Interquartile range; LSS, Lee chronic graft-versus-host disease symptom scale; MID, Minimally important difference; NCI, National Cancer Institute; PCSS, Pediatric chronic graft-versus-host disease symptom scale; PedsQL, Pediatric quality of life inventory™ PedsQL-SCTM, Pediatric quality of life inventory™-stem cell transplant module; PRO, Patient-reported outcome; RMSEA, Root-mean-squared error of approximation; SD, Standard deviation; US, United States.

References

Agh, T., Csanadi, M., Voko, Z., Webb, T., Jeyakumaran, D., Trudeau, J., et al. (2019). Humanistic burden of patients with chronic graft-versus-host disease—systematic literature review of health-related quality of life and functional status. Expert. Rev. Hematol. 12, 295–309. doi: 10.1080/17474086.2019.1602036

PubMed Abstract | CrossRef Full Text | Google Scholar

Arbuckle, R., and Abetz-Webb, L. (2013). "not just little adults": qualitative methods to support the development of pediatric patient-reported outcomes. Patient 6, 143–159. doi: 10.1007/s40271-013-0022-3

CrossRef Full Text | Google Scholar

Bele, S., Chugh, A., Mohamed, B., Teela, L., Haverman, L., and Santana, M. J. (2020). Patient-reported outcome measures in routine pediatric clinical care: a systematic review. Front. Pediatr. 8:364. doi: 10.3389/fped.2020.00364

PubMed Abstract | CrossRef Full Text | Google Scholar

Buus, N., and Perron, A. (2020). The quality of quality criteria: replicating the development of the consolidated criteria for reporting qualitative research (COREQ). Int. J. Nurs. Stud. 102:103452. doi: 10.1016/j.ijnurstu.2019.103452

PubMed Abstract | CrossRef Full Text | Google Scholar

Carpenter, P. A., Kitko, C. L., Elad, S., Flowers, M. E., Gea-Banacloche, J. C., Halter, J. P., et al. (2015). National Institutes of Health consensus development project on criteria for clinical trials in chronic graft-versus-host disease: V. The 2014 ancillary therapy and supportive care working group report. Biol. Blood Marrow Transplant. 21, 1167–1187. doi: 10.1016/j.bbmt.2015.03.024

PubMed Abstract | CrossRef Full Text | Google Scholar

Coombes, L., Bristowe, K., Ellis-Smith, C., Aworinde, J., Fraser, L. K., Downing, J., et al. (2021). Enhancing validity, reliability and participation in self-reported health outcome measurement for children and young people: a systematic review of recall period, response scale format, and administration modality. Qual. Life Res. 30, 1803–1832. doi: 10.1007/s11136-021-02814-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Coombes, L., Harðardóttir, D., Braybrook, D., Roach, A., Scott, H., Bristowe, K., et al. (2023). Design and Administration of Patient-Centred Outcome Measures: the perspectives of children and young people with life-limiting or life-threatening conditions and their family members. Patient 16, 473–483. doi: 10.1007/s40271-023-00627-w

PubMed Abstract | CrossRef Full Text | Google Scholar

Coon, C. D., and Cook, K. F. (2018). Moving from significance to real-world meaning: methods for interpreting change in clinical outcome assessment scores. Qual. Life Res. 27, 33–40. doi: 10.1007/s11136-017-1616-3

CrossRef Full Text | Google Scholar

Coyne, I., Amory, A., Gibson, F., and Kiernan, G. (2016). Information-sharing between healthcare professionals, parents and children with cancer: more than a matter of information exchange. Eur. J. Cancer Care 25, 141–156. doi: 10.1111/ecc.12411

PubMed Abstract | CrossRef Full Text | Google Scholar

Cuvelier, G. D. E., Nemecek, E. R., Wahlstrom, J. T., Kitko, C. L., Lewis, V. A., Schechter, T., et al. (2019). Benefits and challenges with diagnosing chronic and late acute GVHD in children using the NIH consensus criteria. Blood 134, 304–316. doi: 10.1182/blood.2019000216

PubMed Abstract | CrossRef Full Text | Google Scholar

DeWalt, D. A., Rothrock, N., Yount, S., and Stone, A. A. (2007). Evaluation of item candidates: the PROMIS qualitative item review. Med. Care 45, S12–S21. doi: 10.1097/01.mlr.0000254567.79743.e200005650-200705001-00003[pii]

PubMed Abstract | CrossRef Full Text | Google Scholar

Fry, A., Mitchell, S. A., and Wiener, L. (2021). Considerations for conducting and reporting digitally supported cognitive interviews with children and adults. J. Patient Rep. Outcomes 5:131. doi: 10.1186/s41687-021-00371-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Gawlicki, M. C., McKown, S. M., Talbert, M. J., and Brandt, B. A. (2014). Application of bother in patient reported outcomes instruments across cultures. Health Qual. Life Outcomes 12:18. doi: 10.1186/1477-7525-12-18

PubMed Abstract | CrossRef Full Text | Google Scholar

Haroun, E., Agrawal, K., Leibovitch, J., Kassab, J., Zoghbi, M., Dutta, D., et al. (2023). Chronic graft-versus-host disease in pediatric patients: differences and challenges. Blood Rev. 60:101054. doi: 10.1016/j.blre.2023.101054

PubMed Abstract | CrossRef Full Text | Google Scholar

Hennink, M. M., Kaiser, B. N., and Marconi, V. C. (2017). Code saturation versus meaning saturation: how many interviews are enough? Qual. Health Res. 27, 591–608. doi: 10.1177/1049732316665344

PubMed Abstract | CrossRef Full Text | Google Scholar

Jagasia, M. H., Greinix, H. T., Arora, M., Williams, K. M., Wolff, D., Cowen, E. W., et al. (2015). National Institutes of Health consensus development project on criteria for clinical trials in chronic graft-versus-host disease: I. The 2014 diagnosis and staging working group report. Biol. Blood Marrow Transplant. 21, 389–401. doi: 10.1016/j.bbmt.2014.12.001

CrossRef Full Text | Google Scholar

Kerr, C., Nixon, A., and Wild, D. (2010). Assessing and demonstrating data saturation in qualitative inquiry supporting patient-reported outcomes research. Expert Rev. Pharmacoecon. Outcomes Res. 10, 269–281. doi: 10.1586/erp.10.30

PubMed Abstract | CrossRef Full Text | Google Scholar

Keselman, H. J., Cribbie, R., and Holland, B. (2002). Controlling the rate of type I error over a large set of statistical tests. Br. J. Math. Stat. Psychol. 55, 27–39. doi: 10.1348/000711002159680

CrossRef Full Text | Google Scholar

Kitko, C. L., Pidala, J., Schoemans, H. M., Lawitschka, A., Flowers, M. E., Cowen, E. W., et al. (2021). National Institutes of Health consensus development project on criteria for clinical trials in chronic graft-versus-host disease: IIa. The 2020 clinical implementation and early diagnosis working group report. Transplant Cell Ther. 27, 545–557. doi: 10.1016/j.jtct.2021.03.033

PubMed Abstract | CrossRef Full Text | Google Scholar

Klassen, A. F., Strohm, S. J., Maurice-Stam, H., and Grootenhuis, M. A. (2010). Quality of life questionnaires for children with cancer and childhood cancer survivors: a review of the development of available measures. Support Care Cancer 18, 1207–1217. doi: 10.1007/s00520-009-0751-y

PubMed Abstract | CrossRef Full Text | Google Scholar

Lawitschka, A., Buehrer, S., Bauer, D., Peters, K., Silbernagl, M., Zubarovskaya, N., et al. (2020). A web-based Mobile app (INTERACCT app) for adolescents undergoing Cancer and hematopoietic stem cell transplantation aftercare to improve the quality of medical information for clinicians: observational study. JMIR Mhealth Uhealth 8:e18781. doi: 10.2196/18781

PubMed Abstract | CrossRef Full Text | Google Scholar

Lawitschka, A., Guclu, E. D., Varni, J. W., Putz, M., Wolff, D., Pavletic, S., et al. (2014). Health-related quality of life in pediatric patients after allogeneic SCT: development of the PedsQL stem cell transplant module and results of a pilot study. Bone Marrow Transplant. 49, 1093–1097. doi: 10.1038/bmt.2014.96

PubMed Abstract | CrossRef Full Text | Google Scholar

Leahy, A. B., and Steineck, A. (2020). Patient-reported outcomes in pediatric oncology: the patient voice as a gold standard. JAMA Pediatr. 174:e202868. doi: 10.1001/jamapediatrics.2020.2868

PubMed Abstract | CrossRef Full Text | Google Scholar

Lee, S., Cook, E. F., Soiffer, R., and Antin, J. H. (2002). Development and validation of a scale to measure symptoms of chronic graft-versus-host disease. Biol. Blood Marrow Transplant. 8, 444–452. doi: 10.1053/bbmt.2002.v8.pm12234170

PubMed Abstract | CrossRef Full Text | Google Scholar

Lee, S. J., Cutler, C., Blazar, B. R., Tu, A., Yang, Z., and Pavletic, S. Z. (2022). Correlation of patient-reported outcomes with clinical organ responses: data from the Belumosudil chronic graft-versus-host disease studies. Transplant Cell Ther. 28, 700.e1–700.e6. doi: 10.1016/j.jtct.2022.06.020

PubMed Abstract | CrossRef Full Text | Google Scholar

Lee, S. J., Onstad, L., Chow, E. J., Shaw, B. E., Jim, H. S. L., Syrjala, K. L., et al. (2018). Patient-reported outcomes and health status associated with chronic graft-versus-host disease. Haematologica 103, 1535–1541. doi: 10.3324/haematol.2018.192930

PubMed Abstract | CrossRef Full Text | Google Scholar

Lee, S. J., Wolff, D., Kitko, C., Koreth, J., Inamoto, Y., Jagasia, M., et al. (2015). Measuring therapeutic response in chronic graft-versus-host disease. National Institutes of Health consensus development project on criteria for clinical trials in chronic graft-versus-host disease: IV. The 2014 response criteria working group report. Biol. Blood Marrow Transplant. 21, 984–999. doi: 10.1016/j.bbmt.2015.02.025

PubMed Abstract | CrossRef Full Text | Google Scholar

Liljequist, D., Elfving, B., and Skavberg Roaldsen, K. (2019). Intraclass correlation—a discussion and demonstration of basic features. PLoS One 14:e0219854. doi: 10.1371/journal.pone.0219854

PubMed Abstract | CrossRef Full Text | Google Scholar

Matza, L. S., Patrick, D. L., Riley, A. W., Alexander, J. J., Rajmil, L., Pleil, A. M., et al. (2013). Pediatric patient-reported outcome instruments for research to support medical product labeling: report of the ISPOR PRO good research practices for the assessment of children and adolescents task force. Value Health 16, 461–479. doi: 10.1016/j.jval.2013.04.004

PubMed Abstract | CrossRef Full Text | Google Scholar

Meadows, K. (2021). Cognitive interviewing methodologies. Clin. Nurs. Res. 30, 375–379. doi: 10.1177/10547738211014099

CrossRef Full Text | Google Scholar

Merkel, E. C., Mitchell, S. A., and Lee, S. J. (2016). Content validity of the Lee chronic graft-versus-host disease symptom scale as assessed by cognitive interviews. Biol. Blood Marrow Transplant. 22, 752–758. doi: 10.1016/j.bbmt.2015.12.026

PubMed Abstract | CrossRef Full Text | Google Scholar

Miller, K., Willson, S., Chepp, V., and Padilla, J. (eds.). (2014). Cognitive Interviewing Methodology. Hoboken, NJ: John Wiley & Sons

Google Scholar

Mitchell, S. A., Leidy, N. K., Mooney, K. H., Dudley, W. N., Beck, S. L., LaStayo, P. C., et al. (2010). Determinants of functional performance in long-term survivors of allogeneic hematopoietic stem cell transplantation with chronic graft-versus-host disease (cGVHD). Bone Marrow Transplant. 45, 762–769. doi: 10.1038/bmt.2009.238

PubMed Abstract | CrossRef Full Text | Google Scholar

Mitchell, S. A., Wiener, L., Hoag, J., Fry, A., and Bevans, M. F. (2021). “Health-related quality of life in adult and pediatric survivors” in Blood and Marrow Transplantation Long Term Management. eds. B. Savani and A. Tichelli. 2nd ed (Hoboken, NJ: Wiley-Blackwell), 355–380.

Google Scholar

Mokkink, L. B., Terwee, C. B., Patrick, D. L., Alonso, J., Stratford, P. W., Knol, D. L., et al. (2010). The COSMIN study reached international consensus on taxonomy, terminology, and definitions of measurement properties for health-related patient-reported outcomes. J. Clin. Epidemiol. 63, 737–745. doi: 10.1016/j.jclinepi.2010.02.006

PubMed Abstract | CrossRef Full Text | Google Scholar

O'Brien, B. C., Harris, I. B., Beckman, T. J., Reed, D. A., and Cook, D. A. (2014). Standards for reporting qualitative research: a synthesis of recommendations. Acad. Med. 89, 1245–1251. doi: 10.1097/acm.0000000000000388

CrossRef Full Text | Google Scholar

Paiva, C. E., Barroso, E. M., Carneseca, E. C., de Pádua Souza, C., Dos Santos, F. T., Mendoza López, R. V., et al. (2014). A critical analysis of test-retest reliability in instrument validation studies of cancer patients under palliative care: a systematic review. BMC Med. Res. Methodol. 14:8. doi: 10.1186/1471-2288-14-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Patrick, D. L., Burke, L. B., Gwaltney, C. J., Leidy, N. K., Martin, M. L., Molsen, E., et al. (2011). Content validity--establishing and reporting the evidence in newly developed patient-reported outcomes (PRO) instruments for medical product evaluation: ISPOR PRO good research practices task force report: part 2--assessing respondent understanding. Value Health 14, 978–988. doi: 10.1016/j.jval.2011.06.013

CrossRef Full Text | Google Scholar

Pinheiro, L. C., McFatrich, M., Lucas, N., Walker, J. S., Withycombe, J. S., Hinds, P. S., et al. (2018). Child and adolescent self-report symptom measurement in pediatric oncology research: a systematic literature review. Qual. Life Res. 27, 291–319. doi: 10.1007/s11136-017-1692-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Qin, S., Nelson, L., McLeod, L., Eremenco, S., and Coons, S. J. (2019). Assessing test-retest reliability of patient-reported outcome measures using intraclass correlation coefficients: recommendations for selecting and documenting the analytical formula. Qual. Life Res. 28, 1029–1033. doi: 10.1007/s11136-018-2076-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Reeve, B. B., Wyrwich, K. W., Wu, A. W., Velikova, G., Terwee, C. B., Snyder, C. F., et al. (2013). ISOQOL recommends minimum standards for patient-reported outcome measures used in patient-centered outcomes and comparative effectiveness research. Qual. Life Res. 22, 1889–1905. doi: 10.1007/s11136-012-0344-y

PubMed Abstract | CrossRef Full Text | Google Scholar

Riley, A. W. (2004). Evidence that school-age children can self-report on their health. Ambul. Pediatr. 4, 371–376. doi: 10.1367/a03-178r.1

CrossRef Full Text | Google Scholar

Rothmund, M., Meryk, A., Rumpold, G., Crazzolara, R., Sodergren, S., Darlington, A. S., et al. (2023). A critical evaluation of the content validity of patient-reported outcome measures assessing health-related quality of life in children with cancer: a systematic review. J. Patient Rep. Outcomes 7:2. doi: 10.1186/s41687-023-00540-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Shrout, P., and Fleiss, J. (1979). Intraclass correlations: uses in assessing rater reliability. Psychol. Bull. 86, 420–428. doi: 10.1037//0033-2909.86.2.420

CrossRef Full Text | Google Scholar

Terwee, C. B., Prinsen, C. A. C., Chiarotto, A., Westerman, M. J., Patrick, D. L., Alonso, J., et al. (2018). COSMIN methodology for evaluating the content validity of patient-reported outcome measures: a Delphi study. Qual. Life Res. 27, 1159–1170. doi: 10.1007/s11136-018-1829-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Tomlinson, D., De Mol Van Otterloo, S., O'Sullivan, C., Gibson, P., Johnston, D. L., Portwine, C., et al. (2016). Methodological issues identified during cognitive interviews in the development of a pediatric cancer symptom screening tool. Psychooncology 25, 349–353. doi: 10.1002/pon.3821

PubMed Abstract | CrossRef Full Text | Google Scholar

Tomlinson, D., Plenert, E., Dadzie, G., Loves, R., Cook, S., Schechter, T., et al. (2021). Reasons for disagreement between proxy-report and self-report rating of symptoms in children receiving cancer therapies. Support Care Cancer 29, 4165–4170. doi: 10.1007/s00520-020-05930-y

PubMed Abstract | CrossRef Full Text | Google Scholar

Tong, A., Sainsbury, P., and Craig, J. (2007). Consolidated criteria for reporting qualitative research (COREQ): a 32-item checklist for interviews and focus groups. Int. J. Qual. Health Care 19, 349–357. doi: 10.1093/intqhc/mzm042

PubMed Abstract | CrossRef Full Text | Google Scholar

Trigg, A., and Griffiths, P. (2021). Triangulation of multiple meaningful change thresholds for patient-reported outcome scores. Qual. Life Res. 30, 2755–2764. doi: 10.1007/s11136-021-02957-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Upadhyay, U. D., and Lipkovich, H. (2020). Using online technologies to improve diversity and inclusion in cognitive interviews with young people. BMC Med. Res. Methodol. 20:159. doi: 10.1186/s12874-020-01024-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Varni, J. W., Burwinkle, T. M., Katz, E. R., Meeske, K., and Dickinson, P. (2002). The PedsQL (TM) in pediatric cancer—reliability and validity of the pediatric quality of life inventory (TM) generic Core scales, multidimensional fatigue scale, and Cancer module. Cancer 94, 2090–2106. doi: 10.1002/Cncr.10427

CrossRef Full Text | Google Scholar

Varni, J. W., Burwinkle, T. M., Seid, M., and Skarr, D. (2003). The PedsQL (TM) 4.0 as a pediatric population health measure: feasibility, reliability, and validity. Ambul. Pediatr. 3, 329–341. doi: 10.1367/1539-4409(2003)003<0329:Tpaapp>2.0.Co;2

CrossRef Full Text | Google Scholar

Varni, J. W., Katz, E. R., Seid, M., Quiggins, D. J. L., and Friedman-Bender, A. (1998). The pediatric cancer quality of life inventory-32 (PCQL-32) – I. Reliability and validity. Cancer 82, 1184–1196. doi: 10.1002/(Sici)1097-0142(19980315)82:6<1184::Aid-Cncr25>3.0.Co;2-1

PubMed Abstract | CrossRef Full Text | Google Scholar

Varni, J. W., and Limbers, C. A. (2009). The PedsQL (TM) 4.0 generic Core scales young adult version feasibility, reliability and validity in a university student population. J. Health Psychol. 14, 611–622. doi: 10.1177/1359105309103580

PubMed Abstract | CrossRef Full Text | Google Scholar

Varni, J. W., Limbers, C., and Burwinkle, T. M. (2007a). Literature review: health-related quality of life measurement in pediatric oncology: hearing the voices of the children. J. Pediatr. Psychol. 32, 1151–1163. doi: 10.1093/jpepsy/jsm008

PubMed Abstract | CrossRef Full Text | Google Scholar

Varni, J. W., Limbers, C. A., and Burwinkle, T. M. (2007b). How young can children reliably and validly self-report their health-related quality of life?: an analysis of 8,591 children across age subgroups with the PedsQL 4.0 generic Core scales. Health Qual. Life Outcomes 5:1. doi: 10.1186/1477-7525-5-1

PubMed Abstract | CrossRef Full Text | Google Scholar

Varni, J. W., Limbers, C. A., and Burwinkle, T. M. (2007c). Parent proxy-report of their children's health-related quality of life: an analysis of 13,878 parents' reliability and validity across age subgroups using the PedsQL (TM) 4.0 generic Core scales. Health Qual. Life Outcomes 5, 1–10. doi: 10.1186/1477-7525-5-2

CrossRef Full Text | Google Scholar

Wiener, L., Baird, K., Crum, C., Powers, K., Carpenter, P., Baker, K. S., et al. (2014). Child and parent perspectives of the chronic graft-versus-host disease (cGVHD) symptom experience: a concept elicitation study. Support Care Cancer 22, 295–305. doi: 10.1007/s00520-013-1957-6

PubMed Abstract | CrossRef Full Text | Google Scholar

Willis, G. B. (2005). Cognitive Interviewing: A Tool for Improving Questionnaire Design. Thousand Oaks, CA: SAGE Publications, Inc

Google Scholar

Willis, G. B. (2015). The practice of cross-cultural cognitive interviewing. Public Opin. Q. 79, 359–395. doi: 10.1093/poq/nfu092

CrossRef Full Text | Google Scholar

Willis, G. B., Reeve, B. B., and Barofsky, I. (2005). “The use of cognitive interviewing techniques in quality of life and patient-reported outcomes measurement” in Outcomes Assessment in Cancer: Findings and Recommendations of the Cancer Outcomes Measurement Working Group. eds. J. Lipscomb, C. C. Gotay, and C. Snyder (Cambridge, UK: Cambridge University Press), 610–622.

Google Scholar

Keywords: chronic graft-versus-host disease, pediatrics, hematopoietic stem cell transplant, symptom scale, patient-reported outcomes, symptom bother, validation

Citation: Mitchell SA, Hunter R, Fry A, Pavletic SZ, Widemann BC and Wiener L (2024) Development and psychometric testing of a pediatric chronic graft-versus-host disease symptom scale: protocol for a two-phase, mixed methods study. Front. Psychol. 14:1243005. doi: 10.3389/fpsyg.2023.1243005

Received: 20 June 2023; Accepted: 23 November 2023;
Published: 08 January 2024.

Edited by:

Simon Dunne, Dublin City University, Ireland

Reviewed by:

Miki Nishitani, Dana–Farber Cancer Institute, United States
Stephanie Lee, Fred Hutchinson Cancer Center, United States
Qasir Abbas, Government College University, Faisalabad, Pakistan

Copyright © 2024 Mitchell, Hunter, Fry, Pavletic, Widemann and Wiener. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Sandra A. Mitchell, c2FuZHJhLm1pdGNoZWxsQG5paC5nb3Y=

^†These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.