Skip to main content

ORIGINAL RESEARCH article

Front. Psychiatry, 08 May 2020
Sec. Aging Psychiatry

Is Barthel Index Suitable for Assessing Activities of Daily Living in Patients With Dementia?

  • 1Department of Nursing, Zengcheng Branch, Nanfang Hospital, Southern Medical University, Guangzhou, China
  • 2Department of Nursing, School of Nursing and Rehabilitation, Konan Women’s University, Kobe, Japan

Objectives: To evaluate application of the Barthel Index (BI) in assessing basic activities of daily living (ADL) of patients with dementia using Rasch analysis.

Design: A multi-country cross-sectional study.

Setting and Participants: Nineteen long-term care facilities located in China, Japan, South Korea, and Thailand. A total of 644 patients with dementia were included.

Methods: Unidimensionality, global and item fit, local dependence, person-item targeting, threshold disordering, and differential item functioning (DIF) were examined. Negative correlations between scores for DIF items and Neuropsychiatric Inventory Nursing Home version (NPI-NH) were evaluated.

Results: Item reliability (1.0) and person reliability (.88) were acceptable. The Rasch dimension explained 72.9% of the variance (Eigenvalue = 27), while the first contrast explained 6.6% (Eigenvalue = 2.4). The “mobility” was misfitting to the Rasch model (infit mean square = 1.86). The overall difficulty of the BI exceeded patients’ ability (person location = −2.27 logits). The “stairs climbing” and “mobility” showed narrow category thresholds (< 1.4 logits). The location of “controlling bladder” and “toilet use” overlapped. Removing “stairs climbing”, collapsing categories with narrow threshold widths in “mobility”, and combining “controlling bowel” and “controlling bladder” into one item, improved unidimensionality, and item fit of the scale. Only three items (“grooming”, “dressing”, and “toilet use”) were free from DIF across countries. The scores for “feeding” were negatively related to scores for “disinhibition” (r = −0.46, P < 0.01), and scores for “controlling bowel” were negatively related to scores for “disinhibition” (r = −0.44, P < 0.01), “agitation” (r = −0.32, P < 0.05), and “aggression” (r = −0.27, P < 0.01) in Japanese samples.

Conclusions and Implications: The performance of the BI for assessing patients with dementia might be compromised by misfit items, person–item mistargeting, measurement gaps, redundant items, narrow threshold width, and item bias. Mobility ability might not be helpful for determining capability of basic ADL in the patients. Comparisons of BI scores between countries should be undertaken with caution due to item bias. Neuropsychiatric symptoms might interact with basic ADL abilities of the patients. We will not suggest using the instrument in patients with dementia, without future refining to improve its performance.

Introduction

Progressive decline in the ability to perform activities of daily living (ADL) and subsequent loss of independence are the defining features and most severe characteristics of dementia (13). Assessment of ADL is important when measuring disease severity (4), evaluating interventions, and determining care needs (5) in patients with dementia. However, there is currently no general consensus regarding instruments for ADL evaluation in patients with dementia (1, 6). Although some instruments have been specifically designed to assess ADL in patients with dementia (79), time-consuming and incomplete validation restrict their use in clinical practice or epidemiological studies (1, 10).

The Barthel Index (BI) is a widely used measure of basic ADL function (self-maintenance skills such as dressing, bathing, and grooming), because of its simplicity, communicability, and ease of scoring (11, 12). Similar to other ADL instruments, the items of the BI possess a hierarchy of difficulty and yield ordinal intervals between adjacent scores (13). Therefore, practitioners and researchers may have difficulty interpreting the clinical meaning of BI summary scores or changes in scores. Although the BI has been validated and is used globally, empirical evidence for its validity for assessing patients with dementia is scarce.

Rasch analysis, a probabilistic mathematical modeling technique based on item response theory, is based on two assumptions: 1) that an instrument should be unidimensional such that items measure a single underlying construct; and 2) that scores on a measure depend on both person ability and item difficulty. If data fit a Rasch model, for the same person ability, the probability of endorsing an easy item must be higher than the probability of endorsing a more difficult item and vice versa (14, 15). Rasch analysis can transform ordinal data into equal interval measures expressed in linear log-odds probability units and therefore may provide interpretable information of the instrument in evaluating a targeting population. Rasch analysis focuses on individual items in an instrument rather than summary scores and therefore may provide diagnostic techniques for identifying ways of improving items. To the best of our knowledge, Rasch analysis of BI data from patients with dementia has not yet been performed.

The present study investigated the application of the BI for assessing basic ADL in patients with dementia using Rasch analysis, aiming to provide diagnostic information for further revision of the instrument.

Methods

Study Design

This cross-sectional study was part of the Prevalence of SymptOms of DementIa in East-Asian Cross-cultural (ePiSODIC) study comparing the type and prevalence of neuropsychiatric symptoms in patients with dementia residing in long-term care facilities (LTCFs) in East Asia. A detailed description of The ePiSODIC has been presented elsewhere (16).

Setting

Data were collected between September 2015 and April 2016 from study sites located in four countries and one district (Table 1). Among the study sites, 19 facilities were recruited, including one dementia care unit in a nursing home in southern China, nine group homes in a major metropolitan area of Japan, two special dementia care units in a general hospital in Western Japan, two LTCFs in Central Japan, one nursing home and one group home in a metropolitan area in South Korea, one nursing home in Taiwan district, and three nursing homes in Thailand.

TABLE 1
www.frontiersin.org

Table 1 Demographic and clinical characteristics of the participants.

Participants

The eligibility criteria of participants were as follows: 1) age ≥ 60 years; 2) admitted to a facility dedicated to caring for patients with dementia; 3) diagnosed with dementia or with a Clinical Dementia Rating (CDR) score ≥ 1; 4) on a stable dose of psychotropic medication and clinically stable for at least 2 weeks prior to the study. Exclusion criteria were as follows: 1) past or present comorbidity of any other psychotic disorder; 2) complication with another end-stage disease; 3) the patient or their legal guardian refused to participate in this study.

The diagnostic criteria for dementia were based on the International Classification of Diseases, 10th Revision, Diagnostic and Statistical Manual of Mental Disorders, 4th edition, or international diagnosis guidelines (16). Demographics and clinical data were obtained from medical records.

Instruments

All patients included were assessed with the BI. This scale contains 10 items with varying weights measuring basic ADL. Two items regarding bathing and grooming are scored on a 2-point scale (0 or 1 points); six items regarding feeding, dressing, controlling one’s bladder, controlling one’s bowel, toilet use (getting onto and off the toilet), and stair climbing (ascending and descending stairs) are scored on a 3-point scale (0, 1, or 2 points). Two items regarding transferring (moving from a wheelchair to the bed and vice versa) and mobility (walking ability on a level surface) are scored on a 4-point scale (0, 1, 2, or 3 points). The BI score is the cumulative score of all 10 items, with a maximum score of 20 corresponding to complete independence, and a minimum score of 0 corresponding to total dependence. The Chinese version of the BI has been widely used in older populations (17). The BI has shown high internal consistency and inter-rater reliability, good concurrent validity, and adequate responsiveness among samples from various populations, such as stroke patients (18, 19) and patients receiving neurorehabilitation (20, 21). However, its psychometric properties in patients with dementia have not been examined yet.

Dementia severity was assessed using the CDR, where scores of 0 indicate no severity, 0.5 indicates questionable severity, 1 indicates mild dementia, 2 indicates moderate dementia, and 3 indicates severe dementia (22). The CDR is a widely used clinical scale for globally staging the level of dementia severity and is generally robust against the influence of cultural bias (23). It has established test–retest reliability, inter-rater reliability, concurrent validity, and can be administered by any trained personnel (2426).

The Mini-Mental State Examination (MMSE) was used to measure cognitive function. It has a maximum score of 30, with lower scores indicating more severe cognitive impairment (27). The MMSE has established internal consistency (Cronbach’s α > 0.7), test–retest reliability (ICC 0.95), construct validity, and criterion validity (28, 29). Using the common cut-off value of <24, a recent systematic review summarized that sensitivity (0.75–0.88) and specificity (0.73–0.94) of the MMSE for detecting all-cause dementia in hospital samples were acceptable (30).

The Neuropsychiatric Inventory, Nursing Home version (NPI-NH) was used to assess the presence, frequency, and severity of behavioral and psychological symptoms of dementia (BPSD). The NPI-NH has established internal consistency (Cronbach’s α 0.64–0.80), test–retest reliability (ICC 0.89–0.93), inter-rater reliability, structural validity, and concurrent validity (Pearson correlation coefficient 0.54–0.67) (31, 32).

Because of the differences in the educational backgrounds of the healthcare professionals, their roles, and the staff mix among the study sites, the type of health care professionals who assessed the participants using the scales differed among the study sites. In mainland China and Taiwan, psychiatrists assessed the participants using the CDR and the MMSE, and graduate student nurses interviewed the staff members who took care of the participants to complete the NPI-NH and the BI. In South Korea, an experienced graduate student nurse interviewed the staff for the CDR and MMSE assessment, and registered nurses who were in charge of the participants completed the NPI-NH and the BI. In the Japanese hospital and long-term care facilities, registered nurses who were in charge of the participants filled out the CDR and MMSE sheets. In Japanese group homes, the staff members who lived with the participants filled out the CDR and MMSE sheets. In Thailand, a registered nurse and a clinical psychologist interviewed the staff for the CDR and MMSE assessment. The NPI-NH and the BI were assessed by the same method as for the CDR in Japan and in Thai. All the raters were trained in each region. The method regarding who performed the assessment using the above-mentioned instruments has been described previously (16).

To improve internal validity of the study, a standardized procedure of data collection was used. Skype conference was conducted to discuss all the issues that arose in the data collection phase. The quality of data collected has been monitored by the coinvestigators in each study site. Moreover, the principal investigator (the fifth author of this paper, Dr. Kiyoko Makimoto) flew to each study site at least once to make an on-site inspection during the data collection phase. Ten percent of the sample data had been checked for accuracy.

Sample Size Calculations

For well-targeted tests, a sample size of 150 (n range, 108–243) was indicated to provide 99% confidence of item calibration ±0.5 logits. Conversely, for tests that are not well-targeted, a larger sample size (n = 243) is required for similar item location precision (33). Previous studies suggest that sample size greater than 300 may translate into an increasing possibility for type I errors, although both infit and outfit mean square statistics are relatively insensitive to sample size variation for polytomous data (34). Therefore, we used a random-sample of 300 out of the whole dataset to validate our results. We found that the results based on the random-sample were well consistent with those based on the whole dataset (see Supplementary File Table S1).

Statistical Analysis

To ensure the requirement of a unidimensional construct for Rasch analysis, we first assessed the dimensionality of the scale using principal component analysis. If the proportion of variance explained by the Rasch dimension was >60% and the proportion of unexpected variance that accounted for the first contrast was <5%, the results were considered to support unidimensionality (33). The eigenvalue of the first contrast in the principal component analysis on standardized residuals less than 2.0 was considered ideal to support unidimensionality (33), and less than 3.0 was considered acceptable (35).

Internal consistency is indicated by person and item reliability which are analogous to Cronbach’s alpha in Rasch analysis. Person reliability < 0.8 implies that the instrument may not be sensitive enough to distinguish between high and low performers. Item reliability <0.9 implies that the sample is not large enough to confirm the item difficulty hierarchy of the instrument (33).

A nonsignificant log-likelihood chi-square value indicates good global fit to the Rasch model (33). A standard deviation (SD) of global fit residual >1.5 indicates poor model fit (36). Item fit was detected by the inlier-pattern-sensitive fit mean square (infit MnSq) and standardized Z (ZSTD). If the infit MnSq is acceptable (≤1.5), the ZSTD can be ignored (37). Misfitting items should be corrected first, before removing them from the scale (33). A large positive item residual correlation with other items (r > +.7) suggests highly local dependence (33).

A person–item map plotting item difficulty and person ability along a logit scale was used to demonstrate targeting of the BI to patients. A value of zero was allocated to the mean of item difficulty. The mean person location should approximate zero for a well-targeted tool (38). A well targeted instrument should contain items spanning across the full range of individual person locations. Thus, the map may help to identify gaps in coverage or redundant items. Gaps of more than 0.5 logits in the distribution of item difficulties indicate that items are needed to fill these gaps. If there are items with approximately the same difficulty, the items may be duplicative (33).

Threshold disordering was evaluated by visual inspection of category functions to determine whether the response probabilities are arranged in ascending order concordant with the categories. The logit distance between two adjacent categories should be <1.4 logits to show an appropriate threshold width (33). Narrow threshold width can usually be resolved by collapsing responses (15).

Differential item functioning (DIF) examines whether or not an item shows different endorsement probabilities across person groups with equal levels of capacity. In this study, DIF was identified by using log-odds estimators in Mantel for polytomies. Bonferroni’s correction was used for multiple comparisons. Because we observed obvious DIF across countries, we have then chosen Chinese and Japanese samples for further analysis of DIF by gender, age, and CDR, respectively. The sample size in the two sites was adequate for independent Rasch analysis. Regarding DIF across age, samples were divided into two groups (< 85-year-old and ≥85-year-old). Uniform DIF means that DIF is the same for all ability levels of the groups. Nonuniform DIF means that DIF differs with groups’ ability levels. Nonuniform DIF often requires removal of the item from the scale, whereas weighting or transitioning away from biased items is a potential solution for uniform DIF (39).

Although the underlying mechanisms of DIF are currently unclear, BPSD might contribute to the potential measurement bias when assessing basic ADL in patients with dementia. For example, distracting behaviors or refusal to eat may interfere with food intake (40); spatial agnosia may result in person with dementia being unable to locate the toilet, causing agitation and incontinence (41). To examine whether BPSD (measured by the NPI-NH in this study) has a negative impact on ADL abilities in the patients, Spearman’s rank correlation was furtherly conducted to identify negative correlations between NPI-NH and DIF item scores.

Rasch analysis was performed using WINSTEPS® 4.0 (SWREG Inc., USA). Patient characteristics are presented as raw frequencies and percentages for categorical variables and as mean and SD for continuous variables. Spearman’s rank correlation was conducted with SPSS version 19.0 (IBM Corp, Armonk, NY, USA).

Ethical Considerations

This research was approved by the ethics committee for each academic institution and care facilities (e.g., in China, the study was approved by the Nanfang Hospital Ethical Committee), and written informed consent was obtained from all patients and/or their legal representatives if the patients did not have the capacity to consent. All necessary measures to safeguard participants’ anonymity and confidentiality of information were thoroughly followed.

Results

In total, 644 eligible participants were included in the analysis (Table 1).

Unidimensionality

The proportion of variances explained by the Rasch dimension was 72.9%. The proportion of unexpected variance accounted for by the first contrast (the largest secondary dimension) was 6.6%. The eigenvalues of the Rasch dimension and the first contrast were 27.0 and 2.4, respectively (see Table 2). Unidimensionality is a critical property of good measurement and a prerequisite to the summation of items within an instrument. These results indicated that the unidimensionality of the BI was not robust.

TABLE 2
www.frontiersin.org

Table 2 Unidimensionality and Reliability after modification of the items in the Barthel Index (n = 644).

The potential multidimensionality of the BI was not resolved by collapsing the misfit item “mobility” (collapsing categories with narrow threshold) or removing the “mobility” item. Considering that climbing stairs might not be necessary or possible for patients with dementia residing in LTCF, we removed the “stairs climbing” item. After removing the “stairs climbing” item, the unidimentionality for the scale was established (Table 2). However, the “mobility” item was still misfitting to the Rasch model (Table 3). After removing the “stairs climbing” item, collapsing categories with narrow threshold in the “mobility” item, and combining the “controlling bowel” and “controlling bladder” into one item, the unidimentionality (Table 2) and item fit (Table 3) of the scale became acceptable.

TABLE 3
www.frontiersin.org

Table 3 The item location and results of item fit statistics of the Barthel Index for assessing patients with dementia (n = 644).

Internal Consistency

The values of person and item reliability were 0.88 and 1.00, respectively (Table 2), implying that the internal consistency of the BI was acceptable in assessing patients with dementia.

Global Fit, Item Fit, and Local Dependence

Mean standardized fit residuals (−0.1) and corresponding standard deviations (0.94) fell within an acceptable range. The log-likelihood chi-square value was nonsignificant (χ2 = 8116, P = 0.78). The findings indicate an acceptable global fit to the Rasch model.

The item “mobility” was misfitting to the Rasch model (infit MnSq > 1.5). The results of item fit analysis are shown in Table 3. No local dependence was detected (Supplementary File Table S2).

Targeting

Targeting indicates whether the set of items is of appropriate difficulty for the target population. The mean person location was −2.27 (SD = 2.71) in the total samples and varied among patients with mild (mean = −0.82, SD = 2.43), moderate (mean = −1.68, SD = 2.22), and severe (mean = −3.23, SD = 2.39) dementia. The negative mean score of person location and the skewed person-item map (Figure 1) indicated that activities in the BI were difficult for patients to complete.

FIGURE 1
www.frontiersin.org

Figure 1 The Wright person–item map of the Barthel Index for assessing patients with mild (blue symbol), moderate (orange symbol), and severe dementia (red symbol) (n = 644). Note: each “*” = 1 person, and each “#” = 4 persons; M, mean person ability or mean item difficulty; S, one standard deviation; T, two standard deviations; CDR, Clinical Dementia Rating. The vertical line is a continuum representing the measures of person ability (left side) and item difficulty (right side), plotted in logit units. The person ability and item difficulty increase from the bottom to the top.

Figure 1 shows that “bathing” was the most difficult activity, while “transfer” was the easiest (regarding item difficulty hierarchy, see also Table 3), and 28.4% of patients’ ability fell outside the coverage of the scale. On inspection of the person–item map, measurement gaps were observed between “controlling bowels” and “feeding” (> 1.0 logits), between “stair climbing” and “dressing”, and between “mobility” and “transfer” (> 0.5 logits, < 1.0 logits). The mean item difficulty of “controlling bladder” and “toilet use” overlapped at the same location, indicating potential redundancy of the items.

Threshold Ordering

All BI items exhibited appropriately ordered thresholds, indicating that the respondents had no difficulty consistently discriminating between response categories. However, “mobility” for total samples (Figure 2A), “stair climbing” in patients with mild dementia (Figure 2B), “mobility” for patients with moderate to severe dementia (Figures 2C, D), and “transfer” for patients with severe dementia (Figure 2D) demonstrated narrow threshold widths (< 1.4 logits), indicating that collapsing the categories of the items or revision is needed.

FIGURE 2
www.frontiersin.org

Figure 2 The logit distance between category thresholds for the 10 items of the Barthel Index in all the patients (A), patients with mild dementia (B), patients with moderate dementia (C), and patients with severe dementia (D). The red asterisk indicates a threshold width of less than 1.4 logits.

Differential Item Functioning

DIF indicates the potential measurement bias of an item. In the current study, when analyzing DIF related to countries, only three items (“grooming”, “dressing”, and “toilet use”) demonstrated invariant measurement traits, indicating that comparisons of BI scores between countries may not be reliable. Therefore, we further explored DIF by gender, age, and CDR in Chinese and Japanese samples, respectively. We have chosen Chinese and Japanese samples for further analysis, because the sample size in the two sites was adequate for independent Rasch analysis. DIF was found for “feeding”, “stairs climbing”, “mobility”, and “controlling bowel” in Japanese samples and for “feeding”, “stairs climbing”, “mobility”, and “transfer” in Chinese samples (Table 4). We removed the items with nonuniform DIF one by one and found that the unidimensionality and reliability were consistent with those of the original scale (Supplementary File Table S1).

TABLE 4
www.frontiersin.org

Table 4 DIF for the items of the Barthel Index across gender, age, and CDR in Chinese and Japanese samples, respectively.

Regarding DIF items, in the Japanese samples, scores for “feeding” were negatively related to scores for “disinhibition” (r = −0.46, P < 0.01), and scores for “controlling bowel” were negatively related to scores for “disinhibition” (r = −0.44, P < 0.01), “agitation” (r = −0.32, P < 0.05), and “aggression” (r = −0.27, P < 0.01). In the Chinese samples, no negative correlations between NPI-NH and DIF item scores were observed.

Based on results of Rasch analysis, we provided suggestions for future refining of the instrument when assessing patients with dementia, including filling measurement gaps (e.g., the gap between the “feeding” and “controlling bowel” items), modifying redundant items (e.g., “controlling bladder” and “toilet use”), collapsing narrow thresholds for the “mobility”, “transfer”, and “stair climbing” items, and considering potential interactions between BPSD and ADL abilities (Supplementary File Table S3). Although modifying the instrument was not the purpose of the current study, we tried to do a limited revision of the scale based on these suggestions. For example, after removing the “stairs climbing” item, collapsing categories with narrow threshold width in the “mobility” item, and combining the “controlling bowel” and “controlling bladder” into one item, unidimentionality and item fit of the revised scale were better than those of the original scale (see Table 2 and Table 3).

Discussion

Assessment of ADL in patients with dementia is important for evaluating interventions and determining care needs. To the best of our knowledge, the BI, a widely used basic ADL scale, has not been previously examined for assessing patients with dementia. One strength of the current study is that, by using Rasch analysis, a rigorous technique focusing on individual items, we evaluated the application of the BI for assessing the patients and provided suggestions for future refining of the instrument in assessing basic ADL in patients with dementia. Another strength is that our data were obtained from a multi-country study on the prevalence of BPSDs, enabling examination of DIF across countries, and exploration of the potential contributions of interactions between BPSD and ADL to item bias.

Unidimensionality is a prerequisite to the summation of item scores within an instrument (33). There is a discrepancy relating to the unidimensional qualities of the BI in previous studies (4244). By using Rasch analysis, researchers report that the BI is not unidimensional in stroke patients (45), patients receiving neurologic rehabilitation (13), and patients in acute care settings (46). “Controlling bowel” and “controlling bladder” misfit the Rasch model based on the BI scores from neurologic rehabilitation population (13), while more than half of the items in the BI misfit to the Rasch model for older acute medical patients (46). Our findings suggest that the robustness of the unidimensionality of the BI should be established for assessing patients with dementia.

In this study, the “mobility” item exhibited misfitting to the current Rasch model, and the “mobility” and “stairs climbing” showed narrow threshold width. These problems might have compromised unidimensionality of the BI. Previous studies based on Rasch analysis also suggest that the adjacent rating categories “wheel chair independent” and “walks with help” in the “mobility” item might be ambiguous since many patients use a wheelchair while able to walk short distances (45). However, when we removed the misfit item “mobility”, the “stair climbing” item and the “transfer” item became misfit to the Rasch model. When we removed the “stairs climbing” item and collapsed categories with narrow threshold width of the “mobility” item, unidimentionality and item fit of the scale were established. These findings suggest that the mobility ability might not be helpful for constructing basic ADL capability in patients with dementia. This may have occurred because the limitations of basic ADL among patients are most often the result of cognitive decline, while physical abilities of the patients remain relatively intact until the late stages of the disease (47). The discordance between mobility ability and basic ADL capability compromises the utility of the scale in the patients, particularly when there are interactions between BPSD (e.g., nighttime behavior or wandering behavior) and mobility ability. Since 2017, BI summary scores have been used as an appraisal of eligible beneficiaries of public long-term care insurance in China (48). However, maintenance of misfit items, such as “mobility”, may lead to distorted results used for assessing ADL dependence in applicants with dementia. This finding has implications for long-term care insurance policy, driven by reported levels of basic ADL capacity.

Our findings demonstrated that the BI was not well targeted to patients with item difficulty exceeding person abilities, and patients with more severe dementia demonstrated greater person–item mismatch. Moreover, patients with mild to severe severity of dementia coexisted in the same locations, suggesting that basic ADL might not be specific for differentiating severity of dementia. The item difficulty hierarchy of the BI in the present study might help in understanding the order of loss of ADL abilities in the patients and provide useful information for observing and identifying potential functional impairment among the patients. For example, inability to perform the easiest ADLs (such as transfer) indicates severe functional dependence of the patients, whereas inability to perform only the most difficult ADLs (such as bathing) suggests mild functional dependence of the patients. The items with higher difficulty level tend to be lost at the earlier stage than those with lower difficulty level. Previous studies suggest that the order of loss of ADLs for nursing home residents is: bathing, dressing, transfer and locomotion, toileting, and finally eating (49). Our findings suggest that patient with dementia also tend to be dependent for bathing, while transfer and mobility might be the last ability lost in the patient.

Identifying DIF is a critical step for improving measurement. DIF for the BI has not been previously reported (46). In the current study, violation of measurement invariance across countries suggests that BI scores should not be compared between cultures. Although the underlying mechanisms of DIF in patients are currently unclear, BPSD, cultural factors (e.g., caregivers are expected to provide assistance as an expression of caring in Chinese culture) (50), multiple medications (e.g., prevalence of psychotropic prescriptions) (51), and interactions between patients and their caregivers (52) may contribute to DIF. Available studies have indicated an association between presence/degree of behavioral disturbance (e.g., apathy, paranoia, agitation) and physical ADL performance (5355). The correlations between scores of the BI and the NPI-NH found in the present study suggest that neuropsychiatric symptoms should be taken into consideration when assessing basic ADL abilities of patients with dementia.

Several limitations of the present study should be noted. First, since the BI and the NPI-NH are informant-based assessments, scoring could be affected by information provided by the informant and memory bias. Second, correlations between specific NPI-NH and BI items might be unstable in different samples due to different prevalence rates of BPSDs in the samples and memory bias. Observation of patients’ ADL ability and BPSD by trained research assistants may help to overcome this limitation in future studies. Third, the MMSE is a norm-based psychometric measure which compares individual functioning to a reference group, therefore, confounding factors, such as education level and floor effect, might impact this norm-based measures (30). Adjusting for confounding should be taken to improve diagnostic accuracy in our future research, although 75.6% of the subjects were at moderate to severe stage of dementia in this study. Furthermore, previous studies suggest that CDR 0.5 may describe a broader population that includes subjects with mild cognitive impairment and mild dementia (56). We therefore excluded subjects with CDR score of 0.5 from this study. Improved staging tools should be used to accurately differentiate early stage of dementia from mild cognitive impairment in future studies.

Conclusions and Implications

Using Rasch analysis, the current study revealed that the utility of the BI for assessing basic ADL in patients with dementia is compromised by person–item mistargeting, measurement gaps, redundant items, narrow threshold width, and item bias. Thus, the BI summary score may not help in differentiating ADL levels among patients. Mobility may not be helpful in determining ADL capability of the patients. Comparisons of BI scores between cultures may not be reliable due to item bias across countries. We will not suggest using the instrument in patients with dementia without future refining to improve its performance for the patients. Dementia specific instruments for ADL measurement, such as the ALD-IS (57), might help to capture transition across the spectrum of basic ADL abilities of patients with dementia. Therefore, we provided useful information to help guide future revision or development of the ADL instrument for assessing basic ADL in patients with advanced dementia based on findings from the current Rasch analysis.

Data Availability Statement

The datasets generated for this study are available on request to the corresponding author.

Ethics Statement

This research was approved by the ethics committee for each academic institution and care facilities (e.g., in China, the study was approved by the Nanfang Hospital Ethical Committee), and written informed consent was obtained from all patients and/or their legal representatives if the patients did not have capacity to consent.

Author Contributions

Study concept and design: XL and KM. Acquisition of data: LD, JW, YY, and HW. Analysis and interpretation of data: YY, LD, and XL. Drafting of the manuscript: XL, LD, and YY. All authors contributed to revising the manuscript critically for important intellectual content, and final approval of the version to be submitted. YY and LD contributed equally to the work.

Funding

XL has received support from the Science and Technology Planning Project of Guangdong Province, China (Grant Number 2017A020215041) and the National Natural Science Foundation of China (Grant Number 81400868). Dr. Makimoto has received funding from the JSPS KAKENHI (Grant Number JP26305018).

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

We thank our fellows of the ePiSODIC Study Group, including Dr. Miyae Yamakawa (Department of Health Sciences, Graduate School of Medicine, Osaka University), Dr. Younhee Kang (College of Nursing, Ewha Womans University), Dr. Huei-chuan Sung (Graduate Institute of Long-term Care, Tzu Chi University of Science and Technology), Dr. Sirirat PANUTHAI (Faculty of Nursing, Chiang Mai University), Dr. Sayuri Kobayashi (Faculty of Nursing Komazawa Women’s University), and Dr. Mizue Suzuki (Department of Nursing, School of Medicine, Hamamatsu University), for collection of the primary data. We thank Benjamin Knight, MSc., from Liwen Bianji, Edanz Editing China (www.liwenbianji.cn/ac), for editing the English text of a draft of this manuscript.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpsyt.2020.00282/full#supplementary-material

References

1. Desai AK, Grossberg GT, Sheth DN. Activities of daily living in patients with dementia: clinical relevance, methods of assessment and effects of treatment. CNS Drugs (2004) 18:853–75. doi: 10.2165/00023210-200418130-00003

CrossRef Full Text | Google Scholar

2. Giebel CM, Sutcliffe C, Challis D. Activities of daily living and quality of life across different stages of dementia: a UK study. Aging Ment Health (2015) 19:63–71. doi: 10.1080/13607863.2014.915920

CrossRef Full Text | Google Scholar

3. Roehr S, Riedel-Heller SG, Kaduszkiewicz H, Wagner M, Fuchs A, van der Leeden C, et al. Is function in instrumental activities of daily living a useful feature in predicting Alzheimer’s disease dementia in subjective cognitive decline? Int J Geriatr Psychiatry (2019) 34:193–203. doi: 10.1002/gps.5010

CrossRef Full Text | Google Scholar

4. Mao HF, Chang LH, Tsai AY, Huang WW, Tang LY, Lee HJ, et al. Diagnostic accuracy of Instrumental Activities of Daily Living for dementia in community-dwelling older adults. Age Ageing (2018) 47:551–7. doi: 10.1093/ageing/afy021

CrossRef Full Text | Google Scholar

5. Lin CY, Shih PY, Ku LE. Activities of daily living function and neuropsychiatric symptoms of people with dementia and caregiver burden: The mediating role of caregiving hours. Arch Gerontol Geriatr (2019) 81:25–30. doi: 10.1016/j.archger.2018.11.009

CrossRef Full Text | Google Scholar

6. Luttenberger K, Schmiedeberg A, Grassel E. Activities of daily living in dementia: revalidation of the E-ADL Test and suggestions for further development. BMC Psychiatry (2012) 12:208. doi: 10.1186/1471-244X-12-208

CrossRef Full Text | Google Scholar

7. Ard MC, Galasko DR, Edland SD. Improved statistical power of Alzheimer clinical trials by item-response theory: proof of concept by application to the activities of daily living scale. Alzheimer Dis Assoc Disord (2013) 27:187–91. doi: 10.1097/WAD.0b013e318265bcc1

CrossRef Full Text | Google Scholar

8. Boyd PA, Wilks SE, Geiger JR. Activities of daily living assessment among nursing home residents with advanced dementia: Psychometric reevaluation of the Bristol Activities Of Daily Living Scale. Health Soc Work (2018) 43:101–8. doi: 10.1093/hsw/hly010

CrossRef Full Text | Google Scholar

9. Mack JL, Patterson MB. An empirical basis for domains in the analysis of dependency in the activities of daily living (ADL): results of a confirmatory factor analysis of the Cleveland Scale for Activities of Daily Living (CSADL). Clin Neuropsychol (2006) 20:662–77. doi: 10.1080/13854040590967081

CrossRef Full Text | Google Scholar

10. Sikkes SA, de Lange-de Klerk ES, Pijnenburg YA, Scheltens P, Uitdehaag BM. A systematic review of Instrumental Activities of Daily Living scales in dementia: room for improvement. J Neurol Neurosurg Psychiatry (2009) 80:7–12. doi: 10.1136/jnnp.2008.155838

CrossRef Full Text | Google Scholar

11. Bouwstra H, Smit EB, Wattel EM, van der Wouden JC, Hertogh C, Terluin B, et al. Measurement Properties of the Barthel Index in Geriatric Rehabilitation. J Am Med Dir Assoc (2019) 20:420–5. doi: 10.1016/j.jamda.2018.09.033

CrossRef Full Text | Google Scholar

12. Hopman-Rock M, van Hirtum H, de Vreede P, Freiberger E. Activities of daily living in older community-dwelling persons: a systematic review of psychometric properties of instruments. Aging Clin Exp Res (2018) 31:917–25. doi: 10.1007/s40520-018-1034-6

CrossRef Full Text | Google Scholar

13. Tennant A, Geddes JM, Chamberlain MA. The Barthel Index: an ordinal score or interval level measure? Clin Rehabil (1996) 10:301–8. doi: 10.1177/026921559601000407

CrossRef Full Text | Google Scholar

14. Belvedere SL, de Morton NA. Application of Rasch analysis in health care is increasing and is applied for variable reasons in mobility instruments. J Clin Epidemiol (2010) 63:1287–97. doi: 10.1016/j.jclinepi.2010.02.012

CrossRef Full Text | Google Scholar

15. Robinson M, Johnson AM, Walton DM, MacDermid JC. A comparison of the polytomous Rasch analysis output of RUMM2030 and R (ltm/eRm/TAM/lordif). BMC Med Res Methodol (2019) 19:36. doi: 10.1186/s12874-019-0680-5

CrossRef Full Text | Google Scholar

16. Makimoto K, Kang Y, Kobayashi S, Liao XY, Panuthai S, Sung HC, et al. Prevalence of behavioural and psychological symptoms of dementia in cognitively impaired elderly residents of long-term care facilities in East Asia: a cross-sectional study. Psychogeriatrics (2019) 19:171–80. doi: 10.1111/psyg.12380

CrossRef Full Text | Google Scholar

17. National Health Commission of the People’s Republic of China. Nursing classification [National Health Commission of the PRC Web site] National Health Commission of the People's Republic of China (2013) [updated November 14, 2013; cited 2018 September 10]. Available from: Available at: http://www.nhc.gov.cn/wjw/pjl/201412/941e75f5e9514b8ea1b5e2e05954d09e.shtml.

Google Scholar

18. Quinn TJ, Langhorne P, Stott DJ. Barthel index for stroke trials: development, properties, and application. Stroke (2011) 42:1146–51. doi: 10.1161/strokeaha.110.598540

CrossRef Full Text | Google Scholar

19. Hsueh IP, Lin JH, Jeng JS, Hsieh CL. Comparison of the psychometric characteristics of the functional independence measure, 5 item Barthel index, and 10 item Barthel index in patients with stroke. J Neurol Psychiatry (2002) 73:188–90. doi: 10.1136/jnnp.73.2.188

CrossRef Full Text | Google Scholar

20. Hobart JC, Lamping DL, Freeman JA, Langdon DW, McLellan DL, Greenwood RJ, et al. Evidence-based measurement: which disability scale for neurologic rehabilitation? Neurology (2001) 57:639–44. doi: 10.1212/wnl.57.4.639

CrossRef Full Text | Google Scholar

21. Houlden H, Edwards M, McNeil J, Greenwood R. Use of the Barthel Index and the Functional Independence Measure during early inpatient rehabilitation after single incident brain injury. Clin Rehabil (2006) 20:153–9. doi: 10.1191/0269215506cr917oa

CrossRef Full Text | Google Scholar

22. Morris JC. The Clinical Dementia Rating (CDR): current version and scoring rules. Neurology (1993) 43:2412–4. doi: 10.1212/wnl.43.11.2412-a

CrossRef Full Text | Google Scholar

23. Lim WS, Chong MS, Sahadevan S. Utility of the clinical dementia rating in Asian populations. Clin Med Res (2007) 5:61–70. doi: 10.3121/cmr.2007.693

CrossRef Full Text | Google Scholar

24. Morris JC, Ernesto C, Schafer K, Coats M, Leon S, Sano M, et al. Clinical dementia rating training and reliability in multicenter studies: the Alzheimer’s Disease Cooperative Study experience. Neurology (1997) 48:1508–10. doi: 10.1212/wnl.48.6.1508

CrossRef Full Text | Google Scholar

25. Marin DB, Flynn S, Mare M, Lantz M, Hsu MA, Laurans M, et al. Reliability and validity of a chronic care facility adaptation of the Clinical Dementia Rating scale. Int J Geriatr Psychiatry (2001) 16:745–50. doi: 10.1002/gps.385

CrossRef Full Text | Google Scholar

26. Han HR, Park SY, Song H, Kim M, Kim KB, Lee HB. Feasibility and validity of dementia assessment by trained community health workers based on Clinical Dementia Rating. J Am Geriatr Soc (2013) 61:1141–5. doi: 10.1111/jgs.12309

CrossRef Full Text | Google Scholar

27. Folstein MF, Folstein SE, McHugh PR. “Mini-mental state”. A practical method for grading the cognitive state of patients for the clinician. J Psychiatr Res (1975) 12:189–98. doi: 10.1016/0022-3956(75)90026-6

CrossRef Full Text | Google Scholar

28. Tombaugh TN, McIntyre NJ. The mini-mental state examination: a comprehensive review. J Am Geriatr Soc (1992) 40:922–35. doi: 10.1111/j.1532-5415.1992.tb01992.x

CrossRef Full Text | Google Scholar

29. Shigemori K, Ohgi S, Okuyama E, Shimura T, Schneider E. The factorial structure of the Mini-Mental State Examination (MMSE) in Japanese dementia patients. BMC Geriatr (2010) 10:36. doi: 10.1186/1471-2318-10-36

CrossRef Full Text | Google Scholar

30. Hwang AB, Boes S, Nyffeler T, Schuepfer G. Validity of screening instruments for the detection of dementia and mild cognitive impairment in hospital inpatients: A systematic review of diagnostic accuracy studies. PloS One (2019) 14:e0219569. doi: 10.1371/journal.pone.0219569

CrossRef Full Text | Google Scholar

31. Wood S, Cummings JL, Hsu MA, Barclay T, Wheatley MV, Yarema KT, et al. The use of the neuropsychiatric inventory in nursing home residents. Characterization and measurement. Am J Geriatr Psychiatry (2000) 8:75–83. doi: 10.1097/00019442-200002000-00010

CrossRef Full Text | Google Scholar

32. Chen S, Lin K, Wang H, Yamakawa M, Makimoto K, Liao X. Reliability and structural validity of the Chinese version of the Neuropsychiatric Inventory, Nursing Home version. Psychogeriatrics (2018) 18:113–22. doi: 10.1111/psyg.12292

CrossRef Full Text | Google Scholar

33. Linacre JM. (2018). Winsteps® Rasch model [computer program] Version 4.0 ed. Beaverton, OR: Winsteps.com.

Google Scholar

34. Smith AB, Rush R, Fallowfield LJ, Velikova G, Sharpe M. Rasch fit statistics and sample size considerations for polytomous data. BMC Med Res Methodol (2008) 8:33. doi: 10.1186/1471-2288-8-33

CrossRef Full Text | Google Scholar

35. Lin GH, Huang YJ, Lee YC, Lee SC, Chou CY, Hsieh CL. Development of a Computerized Adaptive Testing System for Assessing 5 Functions in Patients with Stroke: A Simulation and Validation Study. Arch Phys Med Rehabil (2019) 100:899–907. doi: 10.1016/j.apmr.2018.09.122

CrossRef Full Text | Google Scholar

36. Lutomski JE, Krabbe PF, den Elzen WP, Olde-Rikkert MG, Steyerberg EW, Muntinga ME, et al. Rasch analysis reveals comparative analyses of activities of daily living/instrumental activities of daily living summary scores from different residential settings is inappropriate. J Clin Epidemiol (2016) 74:207–17. doi: 10.1016/j.jclinepi.2015.11.006

CrossRef Full Text | Google Scholar

37. Hagquist C, Bruce M, Gustavsson JP. Using the Rasch model in nursing research: an introduction and illustrative example. Int J Nurs Stud (2009) 46:380–93. doi: 10.1016/j.ijnurstu.2008.10.007

CrossRef Full Text | Google Scholar

38. Pallant JF, Tennant A. An introduction to the Rasch measurement model: an example using the Hospital Anxiety and Depression Scale (HADS). Br J Clin Psychol (2007) 46:1–18. doi: 10.1348/014466506x96931

CrossRef Full Text | Google Scholar

39. Hagquist C, Andrich D. Recent advances in analysis of differential item functioning in health research using the Rasch model. Health Qual Life Outcomes (2017) 15:181. doi: 10.1186/s12955-017-0755-0

CrossRef Full Text | Google Scholar

40. Liu W, Cheon J, Thomas SA. Interventions on mealtime difficulties in older adults with dementia: a systematic review. Int J Nurs Stud (2014) 51:14–27. doi: 10.1016/j.ijnurstu.2012.12.021

CrossRef Full Text | Google Scholar

41. Kyle G. An insight into continence management in patients with dementia. Br J Community Nurs (2012) 17128:125–6130–1. doi: 10.12968/bjcn.2012.17.3.125

CrossRef Full Text | Google Scholar

42. Shah S. In praise of the biometric and psychometric qualities of the Barthel Index. Physiotherapy (1994) 80:769–71. doi: 10.1016/S0031-9406(10)60621-9

CrossRef Full Text | Google Scholar

43. Hsieh CL, Hsueh IP. A cross-validation of the comprehensive assessment of activities of daily living after stroke. Scand J Rehabil Med (1999) 31:83–8. doi: 10.1080/003655099444588

CrossRef Full Text | Google Scholar

44. Leung SO, Chan CC, Shah S. Development of a Chinese version of the Modified Barthel Index– validity and reliability. Clin Rehabil (2007) 21:912–22. doi: 10.1177/0269215507077286

CrossRef Full Text | Google Scholar

45. van Hartingsveld F, Lucas C, Kwakkel G, Lindeboom R. Improved interpretation of stroke trial results using empirical Barthel item weights. Stroke (2006) 37:162–6. doi: 10.1161/01.STR.0000195176.50830.b6

CrossRef Full Text | Google Scholar

46. de Morton NA, Keating JL, Davidson M. Rasch analysis of the barthel index in the assessment of hospitalized older patients after admission for an acute medical condition. Arch Phys Med Rehabil (2008) 89:641–7. doi: 10.1016/j.apmr.2007.10.021

CrossRef Full Text | Google Scholar

47. Johnson N, Barion A, Rademaker A, Rehkemper G, Weintraub S. The Activities of Daily Living Questionnaire: a validation study in patients with dementia. Alzheimer Dis Assoc Disord (2004) 18:223–30. doi: 10.1037/t28752-000

CrossRef Full Text | Google Scholar

48. Guangzhou Municipal Human Resources and Social Security Bureau (MHRSSB). Guangzhou pilot implementation of long-term care insurance system [MHRSSB of Guangzhou Web site]. Guangzhou Municipal Human Resources and Social Security Bureau (MHRSSB) (2017). [updated 2018 September 10; cited 2017 July 31]. Available from: http://www.gz.gov.cn/zfjgzy/gzsrlzyhshbzj/xxgk/bmdt/content/post_2981073.html.

Google Scholar

49. Liu W, Unick J, Galik E, Resnick B. Barthel Index of activities of daily living: item response theory analysis of ratings for long-term care residents. Nurs Res (2015) 64:88–99. doi: 10.1097/nnr.0000000000000072

CrossRef Full Text | Google Scholar

50. Chang CC, Roberts BL. Cultural perspectives in feeding difficulty in Taiwanese elderly with dementia. J Nurs Scholarsh (2008) 40:235–40. doi: 10.1111/j.1547-5069.2008.00231.x

CrossRef Full Text | Google Scholar

51. Higami Y, Yamakawa M, Kang Y, Kobayashi S, Liao XY, Sung HC, et al. Prevalence of incontinence among cognitively impaired older residents in long-term care facilities in East Asia: A cross-sectional study. Geriatr Gerontol Int (2019) 19:444–50. doi: 10.1111/ggi.13639

CrossRef Full Text | Google Scholar

52. Liu W, Williams K, Batchelor-Murphy M, Perkhounkova Y, Hein M. Eating performance in relation to intake of solid and liquid food in nursing home residents with dementia: A secondary behavioral analysis of mealtime videos. Int J Nurs Stud (2019) 96:18–26. doi: 10.1016/j.ijnurstu.2018.12.010

CrossRef Full Text | Google Scholar

53. Norton LE, Malloy PF, Salloway S. The impact of behavioral symptoms on activities of daily living in patients with dementia. Am J Geriatr Psychiatry (2001) 9:41–8. doi: 10.1097/00019442-200102000-00007

CrossRef Full Text | Google Scholar

54. Benke T, Delazer M, Sanin G, Schmidt H, Seiler S, Ransmayr G, et al. Cognition, gender, and functional abilities in Alzheimer’s disease: how are they related? J Alzheimers Dis (2013) 35:247–52. doi: 10.3233/jad-122383

CrossRef Full Text | Google Scholar

55. Travis Seidl JN, Massman PJ. Cognitive and Functional Correlates of NPI-Q Scores and Symptom Clusters in Mildly Demented Alzheimer Patients. Alzheimer Dis Assoc Disord (2016) 30:145–51. doi: 10.1097/wad.0000000000000104

CrossRef Full Text | Google Scholar

56. Choi SH, Lee BH, Kim S, Hahm DS, Jeong JH, Yoon SJ, et al. Interchanging scores between clinical dementia rating scale and global deterioration scale. Alzheimer Dis Assoc Disord (2003) 17:98–105. doi: 10.1097/00002093-200304000-00008

CrossRef Full Text | Google Scholar

57. Reisberg B, Finkel S, Overall J, Schmidt-Gollas N, Kanowski S, Lehfeld H, et al. The Alzheimer’s disease activities of daily living international scale (ADL-IS). Int Psychogeriatr (2001) 13:163–81. doi: 10.1017/s1041610201007566

CrossRef Full Text | Google Scholar

Keywords: activities of daily living, Barthel Index, dementia, long-term care, Rasch analysis

Citation: Yi Y, Ding L, Wen H, Wu J, Makimoto K and Liao X (2020) Is Barthel Index Suitable for Assessing Activities of Daily Living in Patients With Dementia? Front. Psychiatry 11:282. doi: 10.3389/fpsyt.2020.00282

Received: 22 December 2019; Accepted: 23 March 2020;
Published: 08 May 2020.

Edited by:

Luis Agüera-Ortiz, Complutense University of Madrid, Spain

Reviewed by:

Maria João Forjaz, Carlos III Health Institute, Spain
Ralf Ihl, University Hospital of Düsseldorf, Germany

Copyright © 2020 Yi, Ding, Wen, Wu, Makimoto and Liao. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Xiaoyan Liao, liaoxy@smu.edu.cn

These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.