Physical activity (PA) is reduced in persons with multiple sclerosis (MS), though it is known to aid in symptom and fatigue management. Methods for measuring PA are diverse and the impact of this heterogeneity on study outcomes is unclear. We aimed to clarify this impact by comparing common methods for deriving PA metrics in MS populations.
First, a rapid review of existing literature identified methods for calculating PA in studies which used the Actigraph GT3X in populations with MS. We then compared methods in a prospective study on 42 persons with MS [EDSS 4.5 (3.5–6)] during a voluntary course of inpatient neurorehabilitation. Mixed-effects linear regression identified methodological factors which influenced PA measurements. Non-parametric hypothesis tests, correlations, and agreement statistics assessed overall and pairwise differences between methods.
In the rapid review, searches identified 421 unique records. Sixty-nine records representing 51 eligible studies exhibited substantial heterogeneity in methodology and reporting practices. In a subsequent comparative study, multiple methods for deriving six PA metrics (step count, activity counts, total time in PA, sedentary time, time in light PA, time in moderate to vigorous PA), were identified and directly compared. All metrics were sensitive to methodological factors such as the selected preprocessing filter, data source (vertical vs. vector magnitude counts), and cutpoint. Additionally, sedentary time was sensitive to wear time definitions. Pairwise correlation and agreement between methods varied from weak (minimum correlation: 0.15, minimum agreement: 0.03) to perfect (maximum correlation: 1.00, maximum agreement: 1.00). Methodological factors biased both point estimates of PA and correlations between PA and clinical assessments.
Methodological heterogeneity of existing literature is high, and this heterogeneity may confound studies which use the Actigraph GT3X. Step counts were highly sensitive to the filter used to process raw accelerometer data. Sedentary time was particularly sensitive to methodology, and we recommend using total time in PA instead. Several, though not all, methods for deriving light PA and moderate to vigorous PA yielded nearly identical results. PA metrics based on vertical axis counts tended to outperform those based on vector magnitude counts. Additional research is needed to establish the relative validity of existing methods.