Investigating the Impact of Environment and Data Aggregation by Walking Bout Duration on Parkinson’s Disease Classification Using Machine Learning

Rehman, Rana Zia Ur; Guan, Yu; Shi, Jian Qing; Alcock, Lisa; Yarnall, Alison J.; Rochester, Lynn; Del Din, Silvia

doi:10.3389/fnagi.2022.808518

ORIGINAL RESEARCH article

Front. Aging Neurosci., 22 March 2022

Sec. Parkinson’s Disease and Aging-related Movement Disorders

Volume 14 - 2022 | https://doi.org/10.3389/fnagi.2022.808518

This article is part of the Research TopicActivities of Daily Living and Everyday Functioning: From Normal Aging to Neurodegenerative DiseasesView all 14 articles

Investigating the Impact of Environment and Data Aggregation by Walking Bout Duration on Parkinson’s Disease Classification Using Machine Learning

Rana Zia Ur Rehman¹

Yu Guan²

Jian Qing Shi^3,4

Lisa Alcock¹

Alison J. Yarnall^1,5

Lynn Rochester^1,5

Silvia Del Din^1*

¹Translational and Clinical Research Institute, Newcastle University, Newcastle upon Tyne, United Kingdom
²School of Computing, Newcastle University, Newcastle upon Tyne, United Kingdom
³School of Mathematics, Statistics and Physics, Newcastle University, Newcastle upon Tyne, United Kingdom
⁴Department of Statistics and Data Science, Southern University of Science and Technology, Shenzhen, China
⁵The Newcastle upon Tyne Hospitals NHS Foundation Trust, Newcastle upon Tyne, United Kingdom

Parkinson’s disease (PD) is a common neurodegenerative disease. PD misdiagnosis can occur in early stages. Gait impairment in PD is typical and is linked with an increased fall risk and poorer quality of life. Applying machine learning (ML) models to real-world gait has the potential to be more sensitive to classify PD compared to laboratory data. Real-world gait yields multiple walking bouts (WBs), and selecting the optimal method to aggregate the data (e.g., different WB durations) is essential as this may influence classification performance. The objective of this study was to investigate the impact of environment (laboratory vs. real world) and data aggregation on ML performance for optimizing sensitivity of PD classification. Gait assessment was performed on 47 people with PD (age: 68 ± 9 years) and 52 controls [Healthy controls (HCs), age: 70 ± 7 years]. In the laboratory, participants walked at their normal pace for 2 min, while in the real world, participants were assessed over 7 days. In both environments, 14 gait characteristics were evaluated from one tri-axial accelerometer attached to the lower back. The ability of individual gait characteristics to differentiate PD from HC was evaluated using the Area Under the Curve (AUC). ML models (i.e., support vector machine, random forest, and ensemble models) applied to real-world gait showed better classification performance compared to laboratory data. Real-world gait characteristics aggregated over longer WBs (WB 30–60 s, WB > 60 s, WB > 120 s) resulted in superior discriminative performance (PD vs. HC) compared to laboratory gait characteristics (0.51 ≤ AUC ≤ 0.77). Real-world gait speed showed the highest AUC of 0.77. Overall, random forest trained on 14 gait characteristics aggregated over WBs > 60 s gave better performance (F1 score = 77.20 ± 5.51%) as compared to laboratory results (F1 Score = 68.75 ± 12.80%). Findings from this study suggest that the choice of environment and data aggregation are important to achieve maximum discrimination performance and have direct impact on ML performance for PD classification. This study highlights the importance of a harmonized approach to data analysis in order to drive future implementation and clinical use.

Clinical Trial Registration: [09/H0906/82].

Introduction

Parkinson’s disease (PD) is the second most common neurodegenerative disease after Alzheimer’s disease (Nutt and Wooten, 2005; Feigin et al., 2019). PD prevalence has doubled over the past 25 years and now affects approximately 10 million people worldwide (Dorsey et al., 2018). Due to the progressive nature of PD (Dorsey et al., 2013; Emamzadeh and Surguchov, 2018), both motor (Hobert et al., 2019) and non-motor (Przedborski et al., 2003; Jankovic, 2008) symptoms have a significant impact on quality of life and increased burden on healthcare costs (von Campenhausen et al., 2011). Currently, diagnostic criteria for PD are based on motor features assessed with clinical scales (Jankovic, 2008; Postuma et al., 2015). However, the diagnostic accuracy of PD in a clinical setting is only 74% if performed by a non-expert and 80% by a movement disorder specialist (Rizzo et al., 2016). Given the relatively low rates of accurate diagnosis, particularly in the early stages, there is a need for additional diagnostic aids (Mancini et al., 2011). The application of gait analysis may be a promising addition to the diagnostic toolkit (Buckley et al., 2019; Viceconti et al., 2020).

Previous work has shown that an objective gait assessment obtained in lab settings and the clinic can be used to classify PD using machine learning (ML) approaches (Rehman et al., 2019a,b, 2020a,b). However, assessing gait in both the lab and the clinic has some key limitations. The patient is required to attend specialist facilities, and assessments often do not represent the range of challenges associated with habitual walking (Orendurff et al., 2008). Moreover, individuals tend to perform better (walk faster) during performance tests which reflects walking capacity (“can do”) (Del Din et al., 2016a) compared to everyday life which captures the functional performance (“actually do”) of the participant (Hillel et al., 2019; Maetzler et al., 2020; Shah et al., 2020c; Warmerdam et al., 2020; Atrsaei et al., 2021). The real world may therefore provide a more sensitive and pragmatic context to identify and classify PD (Shah et al., 2020c). Increasing interest in the use of inertial measurement units (IMUs) to monitor gait in people with PD in the lab is evident (González et al., 2010; McCamley et al., 2012; Zijlstra and Zijlstra, 2013; Godfrey et al., 2015; Del Din et al., 2016c), as is monitoring gait continuously in the real world over multiple consecutive days (De Bruin et al., 2007; Weiss et al., 2013, 2014; Godfrey et al., 2014). However, several methodological challenges remain for a better understanding and analysis of real-world gait data. These include extraction of relevant gait characteristics and appropriate use of data aggregation for analysis, e.g., averaging gait characteristics using various WB durations.

Spatiotemporal gait characteristics [from the gait domains of pace, rhythm, variability, asymmetry, and postural control (Lord et al., 2013)] from lab and real-world data are significantly different in people with PD compared to healthy controls (HCs) (Maetzler et al., 2013; Del Din et al., 2016a). However, methods for analysis of data obtained in real-world settings rely on selecting the protocol for gait assessment (e.g., environment and duration) and data aggregation by walking bout (WB) duration (e.g., aggregating all WB’s or selecting an optimal bout duration) (Del Din et al., 2016a,b, 2019; Shah et al., 2020a; Warmerdam et al., 2020). All these options impact on the quantification of spatiotemporal gait characteristics and subsequent results (Del Din et al., 2016b).

Real-world gait consists of a variety of WBs of different durations, the majority of which are short (<10 s, approximately 50%) with only 3% over 60 s for both PD and HC (Del Din et al., 2016a). In contrast, lab-based gait assessments are based on standardized walking distances such as 4 or 10 m (Del Din et al., 2016a,c; Van Ancum et al., 2019) or duration (e.g., 2 min) (Rehman et al., 2019a,b; Del Din et al., 2020). Comparison of data obtained in the lab and in the real-world is therefore challenging.

In previous work, ML classifiers have been trained on data from lab-based gait assessments (Rehman et al., 2019a,b, 2020a,b). The impact of environment (lab vs. real-world) and data aggregation by WB duration on PD classification has not been thoroughly explored. Different WB durations also influence the distribution of gait characteristics. Therefore, ML models need to be able to account for multiple distributions (due to inclusion of a variety of short and long WBs) of real-world gait characteristics. To the best of the authors’ knowledge, the impact of WB durations on the classification of PD using machine learning approaches has not yet been investigated.

The aims of this study are therefore to: (i) investigate the impact of environment (gait assessment in lab vs. real world) and (ii) data aggregation by WB duration on gait characteristics and performance of ML models to accurately classify PD. Based on current available univariate analyses (Del Din et al., 2016a,2019; Shah et al., 2020c), we hypothesized that: (i) real-world gait would be more sensitive for performing the ML based classification of PD compared to lab gait assessment; (ii) associations between lab-based and real-world gait would vary depending on WB duration; and (iii) ML model performance would be influenced by WB duration.

Materials and Methods

Participants

In this cross-sectional analysis, 52 HCs and 47 people with PD were included from the 18 month time point of the “Incidence of Cognitive Impairment in Cohorts with Longitudinal Evaluation–GAIT” (ICICLE-GAIT) study (Khoo et al., 2013; Yarnall et al., 2014). ICICLE-GAIT is a study nested within ICICLE-PD which recruited participants between June 2009 and December 2011 (Khoo et al., 2013). PD participants were recruited from local movement disorders clinics (Khoo et al., 2013) and had a diagnosis of idiopathic PD according to the United Kingdom Brain Bank Criteria (Khoo et al., 2013; Yarnall et al., 2014). PD participants who have Parkinsonism disorders or an atypical form of Parkinson’s disease, with poor knowledge of working English language, or with cognitive impairment (Mini-Mental State Examination score < 24) were excluded from the study. The HC participants were recruited from the local community and included provided that they were able to walk independently and were without significant motor, mood, or cognitive impairment. ICICLE-GAIT received ethical approval from the Newcastle and North Tyneside Research Ethics Committee (REC No. 09/H0906/82). Study procedures were conducted according to the Declaration of Helsinki and all participants gave written informed consent prior to participating.

Demographics and Clinical Characteristics

Participant demographics such as sex, age, mass, height, and BMI were recorded. The Montreal Cognitive Assessment (MoCA) was used to measure global cognition (Nasreddine et al., 2005). Balance confidence was assessed using the Activities Specific Balance Confidence scale (ABC) (Powell and Myers, 1995). PD motor severity was assessed with Part III of the Movement Disorder Society Unified Parkinson’s Disease Rating Scale (MDS-UPDRS III) (Goetz et al., 2008) and disease stages were also recorded according to Hoehn and Yahr (1998). The levodopa equivalent daily dose (LEDD; mg/day) was also calculated (Tomlinson et al., 2010; Lawson et al., 2016).

Gait Assessment

Equipment and Protocol

Gait assessment was performed in the lab and real-world using a tri-axial (x: vertical, y: mediolateral, z: anteroposterior) accelerometer (Axivity, AX3, sample frequency: 100 Hz, range: ±8 g) on the lower back, as shown in Figure 1A. In the lab, a 2 min continuous walk around an oval circuit was performed (Rehman et al., 2019b). PD participants’ gait was assessed while optimally medicated (approximately one hour after medication intake). In the real-world, gait was monitored continuously for 7 days (Del Din et al., 2016a,b, 2019). This took place following the lab assessment. Participants were instructed to perform their usual activities. Further details can be found in previous work (Godfrey et al., 2014; Del Din et al., 2016a,2019).

FIGURE 1

Figure 1. Overall workflow from gait assessment to classification: (A) Gait assessment protocol, (B) WB detection and gait characterization, (C) 14 gait characteristics (Var, variability; Asy, asymmetry), (D) Classification modeling.

Data Processing and Extraction of Gait Characteristics–Lab

Data from the accelerometer was downloaded to a computer for offline processing in MATLAB (R2019a). The vertical component of the transformed acceleration signal was filtered first to 20 Hz with a 4th order Butterworth filter (Moe-Nilssen, 1998; Zijlstra and Hof, 2003; McCamley et al., 2012). To detect the events within each gait cycle (Figure 1B), the initial contact (IC, heel strike) and final contact (FC, toe-off) points were identified with the help of a Gaussian continuous wavelet transform. Additional temporal gait characteristics (step time, swing time, and stance time) were quantified based on IC and FC (McCamley et al., 2012; Godfrey et al., 2015; Del Din et al., 2016c). For the evaluation of spatial characteristics (step length) the inverted pendulum model was utilized (Zijlstra and Zijlstra, 2013) and step velocity was calculated as the ratio of step length and step time (Del Din et al., 2016c). Variability was calculated as the standard deviation from all steps and asymmetry as the absolute difference of alternative steps (left and right) (Del Din et al., 2016c). The detailed method for the evaluation of spatiotemporal gait characteristics is described in previous work (Lord et al., 2013; Godfrey et al., 2015; Del Din et al., 2016c).

Fourteen spatiotemporal gait characteristics (Figure 1C) were extracted based on ICs and FCs and mapped onto five domains: pace (step velocity, step length, swing time variability), rhythm (step time, swing time, stance time), variability (step velocity variability, step length variability, step time variability, swing time variability, stance time variability), asymmetry (step time asymmetry, swing time asymmetry, stance time asymmetry), and postural control (step length asymmetry) (Lord et al., 2013; Godfrey et al., 2015; Del Din et al., 2016c).

Data Processing, Extraction of Gait Characteristics, and Data Aggregation–Real-World

For the real-world gait assessment, data was downloaded to a computer for offline processing in MATLAB (R2019a). Accelerometry data was segmented into each calendar day and WBs were detected based on the magnitude and standard deviation of the acceleration signal (Del Din et al., 2016d; Hickey et al., 2016). A WB was defined as the continuous length of time spent during walking (Godfrey et al., 2014), with at least three steps (Del Din et al., 2016a,2019). No resting period thresholds between consecutive WBs were set so that each WB was individually considered (and not merged to other WBs) (Barry et al., 2015). Gait characteristics were firstly evaluated for each WB by combining all steps within a WB. Then, all WBs were combined for each day to provide a daily average. Finally, each day was combined to provide a 7 day average for each gait characteristic (Del Din et al., 2016a,2019). The same fourteen gait characteristics were extracted from the real-world (Lord et al., 2013; Godfrey et al., 2015; Del Din et al., 2016c) for comparison with lab-based gait (Figure 1C).

To investigate the impact of real-world data aggregation by WB duration on gait characteristics and ML models, a comprehensive approach was adopted. WB of various durations (seconds) were considered and aggregated over the 7 days (Figure 1A). In total, fourteen WB durations were chosen, and the average of all WBs was used to describe each gait characteristic. The six most optimal and distinct WB durations without having redundant information by combining the incremental WBs (i.e., WBs ≤ 10 s, 10 < WBs ≤ 30 s, 30 < WBs ≤ 60 s, 60 < WBs ≤ 120 s, WBs > 60 s, WBs > 120 s) are presented in the manuscript to reduce the data for clear message. However, the remaining WB durations (i.e., WBs ≤ 5 s, WBs ≤ 30 s, WBs ≤ 60 s, WBs ≤ 90 s, WBs ≤ 120 s, 5 < WBs ≤ 10 s, 60 < WBs ≤ 90 s, 90 < WBs ≤ 120 s) are provided in the Supplementary Material.

Statistical Analysis

Normality of the data (gait characteristics) was checked by plotting histograms and using the Shapiro Wilk test for each environmental condition. In addition, rain clouds and box plots were used to visually check the distribution for each group and shift in distribution among groups (PD vs. HC) for each gait characteristic. To evaluate differences between PD, HC, and the impact of environment, a mixed ANOVA was performed on the data aggregation by WB duration and their combined effect (interaction) on each gait characteristic. Based on the data distribution, student t-test and Mann Whitney U-test were used to evaluate differences between the PD and HC groups. Given the exploratory nature of this analysis, we used a p-value < 0.05 to guide statistical interpretation and did not make adjustments for multiple comparisons (Rothman, 1990; Perneger, 1998). This is due to the inclusion of mixed ANOVA for overall statistics and area under the receiver operating characteristics curve (AUC) analysis for each gait characteristic to investigate its discriminatory power (PD vs. HC) under different environmental conditions and aggregation by WB duration. In addition, the p-values are provided to assess the statistical significance of between group differences. The relationship between lab and real-world gait characteristics was also assessed with the Pearson’s correlation coefficient.

Machine Learning Classification of Parkinson’s Disease

Three different ML models were used: support vector machine (SVM), random forest (RF), and an ensemble of these two classifiers (Figure 1D; Rehman et al., 2019a,b). The ensemble model made the decision based on soft voting (probability) (Pedregosa et al., 2011). Due to the variety of data distributions, instead of training a separate model for each WB threshold, ML models were trained by combining all WB duration data. This allowed one single model to be developed, which could cater for the distribution of the entire dataset. Training performance of the models was evaluated using a 10-fold cross-validation technique on 70% of data, and separate testing was done on each WB duration threshold by keeping 30% of the data for testing. This rigorous training and testing process was repeated 10 times based on different random seed values. Classifier performance was evaluated in terms of accuracy, F1 score, AUC, sensitivity, and specificity (Rehman et al., 2019a, 2020a,b). In addition, influential gait characteristics were also identified based on their importance in RF and recursive feature elimination (RFE) technique with SVM-linear (Rehman et al., 2019a,b). Model hyperparameters were optimized with grid search. The standard python library SciKit learn was used for ML analysis.

Results

Demographic and clinical characteristics are summarized in Table 1. There were no significant differences between the PD and HC groups for sex, age, height, mass, and BMI. People with PD had lower cognitive scores (MoCA) and reduced Activities-Specific Balance Confidence (ABC) score compared to HC (p < 0.05). PD participants had an average disease duration of 26 months, the majority of which were Hoehn and Yahr stage II with an average MDS-UPDRS III score of 31.5 ± 9.8.

TABLE 1

Table 1. Demographics and clinical measures of the Parkinson’s disease (PD) and healthy controls (HC) group.

Impact of Environment and Data Aggregation on Gait Characteristics

Overall Statistics

The distribution of WB depending on duration is shown in Figure 2 and the distribution of step velocity is shown in Figure 3. Distributions for the remainder of gait characteristics are reported in Supplementary Figures 1, 2. Overall, mixed ANOVA statistical analysis results are presented in Table 2. Statistical differences between PD and HC are displayed with a heat map in Figure 4.

FIGURE 2

Figure 2. Distribution of all detected walking bouts (WBs) in Parkinson’s disease (PD) and healthy control (HC) groups in the real-world assessment of 7 days. WB are categorized intro 14 thresholds based on time in seconds followed by their average of 7 days.

FIGURE 3

Figure 3. Distribution of step velocity (m/s) into different WB thresholds and average of 7 days.

TABLE 2

Table 2. Mixed ANOVA results: main effects and interactions between gait assessment environmental conditions (lab vs. real-world) and groups (PD vs. HC) for each gait characteristic.

FIGURE 4

Figure 4. Comparison of effect of environment (lab vs. real-world) and WB duration (threshold) on discrimination between PD and HC participants (Dark highlighted color means lower p values).

There was a significant interaction (Table 2) between environment (lab and real-world, including data aggregation by WB duration) and group (HC, PD) for seven characteristics (step velocity, step time, swing time, stance time, step time variability, stance time variability, and swing time asymmetry). However, the main effect of within participant factor revealed that all the gait characteristics evaluated in the lab setting were significantly different from those evaluated in the real-world setting and for all data aggregations. Similarly, the main effect of between-participant factor revealed that there were significant differences between PD and HC for all the gait characteristics except step velocity variability, step length variability, and step length asymmetry.

Impact of Environment [Lab vs. Real-World (All Walking Bouts)] on Between-Group Differences (Parkinson’s Disease vs. Healthy Control)

Lab vs. Real-World (All Walking Bouts)

During the 2-min continuous walk in the lab, both groups walked faster with a shorter step time and longer step length compared to when in the real-world regardless of WB duration. Even for the longest WB (>120 s), these characteristics (step velocity and step length) were reduced compared to the 2-min (120 s) continuous walk in the lab. As an example, the distribution of step velocity for both groups and environmental conditions under different data aggregation by WB durations is shown in Figure 3. Gait variability (swing time, step time, stance time, step length, and step velocity) was reduced when measured in the lab compared to the real-world. Gait was more symmetrical when measured in the lab (step time, stance time, swing time, and step length) for both PD and HC compared to real-world gait irrespective of WB duration.

Parkinson’s Disease vs. Healthy Control

Both in the lab and real-world conditions, when combining all WB durations, PD participants walked slower, with slower step time and shorter step length compared to HC (Supplementary Figure 1). Step velocity and step length were significantly different (p < 0.05) between PD and HC in both lab and real-world conditions (Figure 4). In the real-world, PD participants had significantly lower swing time variability compared to HC (p = 0.033). None of the asymmetry-based gait characteristics measured in the lab were significantly different between PD and HC, except for step length asymmetry (Figure 4). Similarly, for asymmetry-based characteristics in the real-world, only swing time asymmetry, based on the combination of all WBs, was significantly higher for PD compared to HC (p = 0.006) (Figure 4).

Effect of Real-World Data Aggregation on Gait

Distribution of Data Aggregation by Walking Bout Durations

All PD and HC participants had WBs across all duration thresholds. The distribution of WBs are shown in Figure 2 and Supplementary Figure 3. The majority of WBs (87% for PD and 85% for HC) were of shorter duration (≤10 s), with relatively few WB per day found over 120 s (1.8% for PD and 1.9% for HC). Overall, HC had a greater number of WB (total n = 170,368 from 52 HC) over 7 days of continuous assessment compared to the PD group (total n = 134,156 from 47 PD).

Effect of Data Aggregation by Walking Bout Durations on Between Group Differences

For both PD and HC groups, the slowest speed was observed during very short WBs (≤10 s) compared to long WBs > 10 s (Figure 3 and Supplementary Figures 2, 4). The most significant (p < 0.01) group differences between PD and HC were found in longer WBs (>60 s or 120 s) as compared to shorter WBs (Figure 4). Similarly, reduced step length and shorter step time were observed in short WBs (≤10 s) as compared to long WBs. Step time was significantly slower for PD in longer WBs such as >60 s (p = 0.001) and >120 s (p < 0.001) compared to HC. Interestingly, gait variability in the longer WBs (>60 s and >120 s) resulted in significant differences between the groups for swing time variability, step length variability, step time variability, and stance time variability. All the asymmetry-based gait characteristics behaved differently in short and long WBs. For example, in the longer WBs (>120 s), asymmetry-based gait characteristics were similar or close to lab-based gait asymmetry characteristics.

To summarize, in the real world, significant group differences (PD vs. HC) were identified for WBs longer than 60 or 120 s for all gait characteristics apart from asymmetry-based gait characteristics.

Association Between Gait Characteristics Collected in the Lab vs. Real-World

The association between lab and real-world gait characteristics is shown in Figure 5. In general, lab and real-world gait characteristics showed either no correlation or weak-to-moderate association with one another. However, stronger correlations were noted for the PD group compared to HC with correlations >0.5 observed for step length, step time, and stance time. Stance time resulted in the strongest correlations, followed by step time, compared to other gait characteristics, with the strongest correlation of 0.59 observed for WBs of 10–30 s and 30–60 s. Real-world gait speed resulted in a weak correlation for both PD and HC (max 0.388) with lab-based gait speed in the longer WBs > 60 or >120 s. Short WBs (≤10 s) had weak correlation compared to longer WBs (>10 s). Variability characteristics were negatively correlated between the lab and real-world gait assessments. Results for all WB durations are presented in Supplementary Figure 5.

FIGURE 5

Figure 5. Association between lab-based gait characteristics with real-world gait. Pearson’s correlation r values for both PD and HC.

Impact of Environment and Data Aggregation on Classification of Parkinson’s Disease

Results showing the impact of environment and data aggregation by WB durations on each individual gait characteristic and ML models for classification of PD are presented in Figures 6, 7 (Supplementary Figures 6–8) and Table 3 (Supplementary Table 1).

FIGURE 6

Figure 6. Discrimination of PD from HC based on each individual gait characteristic with area under the receiver operating characteristics curve (AUC).

FIGURE 7

Figure 7. Accuracy of trained classifiers based on the lab and real-world assessment of gait grouped by WB durations (▲ means average testing accuracy across all tests on independent dataset).

TABLE 3

Table 3. Evaluation metrics (accuracy, sensitivity, and specificity) of the trained classifiers on the lab and real-world test data under various walking bout (WB) durations.

Machine Learning Performance in the Lab vs. Real-World

Based on the AUC for each gait characteristic discriminating PD from HC (Figure 6), real-world gait characteristics combining all WBs had relatively low AUC values compared to gait assessed in lab settings. However, the asymmetry-based characteristics in lab had lower AUC (0.51–0.57) as compared to real-world asymmetry-based gait characteristics (AUC = 0.61–0.68).

For the classification of PD, various gait characteristics were statistically significant between the groups (PD vs. HC) in lab and real-world settings. Therefore, the classifiers were trained on the overall 14 gait characteristics. During the training phase, performance of the classifiers was based on the 10-fold cross-validation and ranged between 72 and 95% (Supplementary Figure 7). These trained classifiers were tested separately on the 30% average lab and real-world test data. Random forest performed better under both environment conditions. When combining all WBs, real-world gait gave lower classification performance (accuracy: 60.67 ± 10.65) compared to lab-based gait (accuracy: 64.67 ± 15.02). This lower performance from real-world data (all WBs combined) was observed (Figure 7 for F1 score) in all the three models (random forest, support vector machine, and ensemble model). However, only RF performance was statistically significant from SVM under both environmental conditions.

Impact of Data Aggregation by Walking Bout Duration on Machine Learning Performance

Real-world gait characteristics had higher AUC compared to lab-based gait assessment for selected WB durations (Figure 6). The maximum AUC of 0.765 was observed for step velocity in real-world gait assessment from longer WBs (>120 s), followed by the lab-based step velocity (AUC of 0.721), 30 < WBs ≤ 60 s (AUC of 0.714), and WBs > 60 s (AUC of 0.709). All the rhythm-based gait characteristics (step time, swing time, and stance time) had an AUC around 0.7 when aggregated across longer WBs > 60 s or > 120 s. A maximum AUC of 0.703 was found in 30 < WBs ≤ 60 s for swing time asymmetry. To summarize, the maximum AUC from real-world gait characteristics were found in the longer WBs (30 < WBs ≤ 60 s, WBs > 60 s, and WBs > 120 s).

Data distribution for different thresholds of WB duration varies. Therefore, classifiers were trained on the combined data to accommodate all distributions. In addition, because there were differences in gait characteristics (discriminating PD vs. HC) for different thresholds of WB duration, classifiers were trained on all 14 gait characteristics. During the training phase, the performance of the classifier based on the 10-fold cross-validation ranged between 72 and 95% (Supplementary Figure 7). Overall, RF performed better on the new data set (30%) used for testing as compared to other classifiers (Figure 7). In addition, RF classification performance was significantly different from SVM in all WB durations except WBs < 10 s, while RF was only significantly different from ensemble model in 10 < WBs < 30 s and WBs > 120 s. Longer WBs > 60 s gave a better classification performance in discriminating PD from HC as compared to other WB durations in the real-world data. However, the classifier performance varies with different data aggregation by WB durations, which indicates that comparing gait performance from the same participants in different environments (and WB durations) can influence the classifiers. Maximum testing performance of the classifiers were obtained from 30 < WBs ≤ 60 s, WBs > 60 s, and WBs > 120 s (Table 3).

Influential gait characteristics were similar to those characteristics with a higher AUC (e.g., step velocity, step length, step time, swing time, stance time, and swing time variability; Supplementary Figure 9). Based on the importance of characteristics in the RF classifier, swing time, step velocity, stance time, swing time variability, and step length were identified as the top five characteristics. Similarly, based on the RFE with support vector machine, step velocity, step length, stance time, step time, and swing time were identified. Based on the common characteristics in the top five, step velocity, step length, swing time, and stance time were identified by both classifiers.

Discussion

This is the first study to comprehensively investigate the impact of environment and data aggregation by WB duration on ML performance for the classification of PD. Based on the results, environment and aggregation of real-world data by WB duration influenced each individual gait characteristics for both groups and subsequent performance of ML models. We found a weak to moderate association between lab and real-world gait for both PD and HC. Based on the AUC of each gait characteristic compared to the lab, real-world gait better discriminated PD from HC, with step velocity in longer WBs (>120 s) providing the highest AUC of 0.765. In terms of PD classification, ML performance using real-world data gave better results compared to lab-based gait assessment for selected WB durations (WBs > 60 s; 30 < WB ≤ 60 s; > 120 s). Our findings show that testing environment and data aggregation (by WB duration) influence accuracy of ML performance and, therefore, classification of PD.

Impact of Environment and Data Aggregation on Gait Characteristics

Lab vs. Real-World

In the present study, gait assessed in the lab appeared to give different values and results compared to the real-world gait assessed for 7 consecutive days across all gait domains (pace, rhythm, variability, asymmetry, and postural control). These findings align with previous work (Del Din et al., 2016a), where PD and HC gait was assessed in the lab (10 m walk in a straight line) and in the real world for over 7 days. A major factor explaining the differences observed between environments is that in the lab, gait is measured under controlled settings during scripted tests (reflecting capacity), whilst real-world gait is characterized by natural walking behavior executed under variable settings and conditions (reflecting performance) (Del Din et al., 2016a,b, 2019; Shah et al., 2020c). People tend to walk faster with longer steps, lower variability, and higher asymmetry in the lab compared to real-world (Del Din et al., 2016a), which is evident from the present findings and in agreement with others (Takayanagi et al., 2019). Findings from this study support previous work showing that real-world gait is more variable than lab-based gait (Del Din et al., 2016a,b, 2019).

We found that the association (correlation) between lab-based and real-world gait characteristics was weak to moderate, irrespective of WB duration, suggesting real-world and lab-based gait are measuring different aspects and constructs (i.e., performance vs. capacity) of walking (Maetzler et al., 2020). These results concur with previous work showing that walking speed during a 4 m walk had low correlation with real-world gait (Van Ancum et al., 2019). One reason for the low correlation could be the heterogeneous distribution of real-world characteristics. Moreover, other gait activities, such as turning, were not accounted for. For example, people with PD tend to turn, on average, >60 times every hour (El-Gohary et al., 2014) rather than walk in perfectly straight lines in the real-world (Galperin et al., 2019; Hillel et al., 2019), which cannot be evaluated using a tri-axial accelerometer alone.

There are many other factors influencing the complexity of real-world gait. Real-world gait is intrinsically dual task and is cognitively demanding due to complex and challenging environments in comparison to scripted gait lab tests when attention is heightened (Robles-García et al., 2015; Del Din et al., 2016a). Another important factor, especially in PD, is that medication affects gait (Ghoraani et al., 2019; Evers et al., 2020) and this is difficult to account for in the real-world where medication regimes and intake may be unknown and will impact on gait and motor fluctuations. In the present study, the lab-based gait assessment was performed one hour after medication intake in the practically defined “on” state. Therefore, we may expect to see an individual’s optimal capacity. Conversely, in the real-world, walking may take place at all points of the medication cycle, resulting in on-off fluctuations in motor function and, consequently, in gait (Ghoraani et al., 2019). This can act as a confounding factor when averaging gait characteristics across different WB durations and identifies an important area for future work to understand the effect of medication on real-world gait.

Parkinson’s Disease vs. Healthy Control

Gait characteristics extracted from the 2-min walk in the lab were statistically different for PD and HC compared to the real-world when combining all WBs together, except for asymmetry characteristics. These findings are difficult to compare with previous work where different protocols for gait assessment in the lab have been utilized [e.g., 10 m (Del Din et al., 2016a) or 7 m walk (Shah et al., 2020c)]. In real-world conditions, pace characteristics, such as step velocity and step length, were significantly different between PD and HC across all WBs > 10 s. Other gait characteristics (variability and asymmetry) behaved differently depending on WB duration, with differences in gait between PD and HC present for medium-to-long WB, but not for shorter WB. One possible reason for these discrepancies could be related to the algorithm, i.e., the performance of gait and step detection algorithms in shorter WBs may be challenged by noisy signals and presence of shuffling and weight transfer activities (Del Din et al., 2016a,c; Atrsaei et al., 2021). The other possible reason could be methodological: the choice of WB duration across which results are averaged may impact on gait differences, i.e., between-group gait differences found in medium WBs (which represent a high percentage of the total number of WBs) may drive results even when data are combined with results from longer WB durations as these represent a lower percentage and may offer reduced statistical power when making group comparisons (Supplementary Figures 1, 2, 6). Moreover, asymmetry that is quantified during shorter walking bouts in the real world may be linked to necessary gait adaptations to navigate complex environments. Results from medium duration WBs (e.g., 30–60 s) were comparable to those from longer WBs (>60 or >120 s) even though the latter represented only a small percentage [1.83% (PD) and 1.89% (HC)] of the total WBs. Gait characteristics, from every domain, were significantly different between PD and HC for medium-to-long WBs (30–60 s, >60, >120 s). These results are in line with previous work where the largest differences between the PD and HC groups were found in the longer WBs (>120 s) (Del Din et al., 2016a).

Impact of Environment and Walking Bout Duration on Parkinson’s Disease Classification

Parkinson’s disease (PD) classification was more accurate for lab-based gait assessment than the real-world when all the WBs were combined. Our findings for step velocity and step length yielded greater AUC, while previous work, which focused on other biomechanical characteristics, showed that foot strike angle resulted as the gait characteristic providing highest AUC (Shah et al., 2020a,c). This could be due to the different protocol (2-min walk vs. 7 min walk), cohort characteristics, and sample size (our study group mean MDS-UPDRS III score: 31; PD n = 47 while Shah’s group mean MDS-UPDRS III score: 35 with PD n = 29). Discrepancies between studies show how, depending on the PD cohort disease severity, stage, sample size, and different gait characteristics may lead to higher classification performances as reflecting various level of impairment and progression. In addition, novel insights from our work showed that gait was more asymmetrical in the real-world, and this domain resulted in higher AUC than lab-based asymmetry results.

Because real-world gait presents a heterogeneous distribution, combining all WBs may increase spread of the data thereby “masking” significant differences between groups (Del Din et al., 2016a,b, 2019; Shah et al., 2020a,c,b; Atrsaei et al., 2021). As indicated previously, gait assessment conditions, such as lab and real-world, directly influence gait characteristics (Del Din et al., 2016a,2019; Van Ancum et al., 2019; Shah et al., 2020c) with optimal walking capacity found under brief testing conditions (lab) compared to real-world performance (what people actually do during everyday activities) (Maetzler et al., 2020). ML models are directly influenced by gait features obtained under these different environments, which in turn impact classification accuracy. Therefore, combining all walking bouts obtained in the real-world can result in less optimal performance in the ML classifiers explaining our findings.

Data Aggregation by Walking Bout Duration

No previous study has investigated the impact of WB durations on the classification of PD using ML approaches. However, within univariate gait analysis, based on Del Din et al. (2016a), longer WBs were found to be better at discriminating PD from HC. This is in contrast to Shah et al. (2020a) where 90% of participants presented with WBs less than 53 strides. Therefore, only gait characteristics from short WBs < 12 strides (<24 steps) were found to be reliable and more sensitive when discriminating PD from HC. However, due to the small sample size in Shah et al. (2020a), the effect of longer WB was possibly dampened (e.g., WBs > 60 s long can have more than 53 strides) and had not been comprehensively investigated. Other factors, such as the algorithms, sensor location, and the protocol used and experimental set-up all influence the findings. In the present study, the sensor was attached on the lower back, and for Shah et al. (2020), sensors were attached to the ankles. The comprehensive approach taken in this study (i.e., quantifying various WB durations with reasonable sample size), highlighted that longer WBs were better for discriminating PD from HC.

Overall, the random forest classifier gave better classification performance as compared to SVM (Rehman et al., 2019a). ML models gave optimal performance from WBs > 60 s followed by 30 < WBs ≤ 60 s and WBs > 120 s compared to lab and short WBs (<30 s). As discussed, real-world walking leads to various WB durations with a variety of gait speeds (Del Din et al., 2016a,2019). In real-world conditions, both PD and HC groups performed a large number of very short WBs (e.g., <10 s) rather than prolonged WBs (e.g., >120 s). Short WBs most likely reflect habitual behaviors and moving in a constrained environment, such as a house, while longer WBs may represent walking outdoors which influence gait characteristics and ultimately the accuracy of the classifier. This is evident from the present results as shorter WBs (<10 s) demonstrated poor discriminative performance compared to longer (>60 s) WBs (accuracy of 56–59% vs. accuracy of 68–76%).

The most influential characteristics for the classification of PD were related to pace and rhythm. Particularly, step velocity, step length, swing time, stance time, swing time variability, and step time, which were identified by both random forest and SVM. The results are in line with previous work (Rehman et al., 2019a,b, 2020a) which showed that pace (step velocity and step length) are the most common and influential characteristics for not only differentiating early-stage PD in univariate fashion (e.g., t-test or with AUC), but also in ML classifiers. In addition, based on the AUC values, pace characteristics (e.g., step velocity, step length) gave optimal performance in both lab and real-world data. However, the best results were obtained in the real-world for longer WBs (30–60 s, >60 s, and >120 s). The results from this study are in line with the previous work (Shah et al., 2020a,c) where the effect of WB duration influenced the AUC, and real-world gait was found to be more sensitive for discrimination purposes.

Key Insights With Clinical Implications

• During the 2-min continuous walk in the lab, both PD and HC groups walked faster, with quicker and longer steps, lower variability, and higher asymmetry than when in the real-world, regardless of WB duration. Lab based assessment represents gait capacity, whereas real-world data reflect gait performance.

• Group differences (PD vs. HC) in gait, both in the lab and real-world conditions when combining all WBs, showed that PD participants walked slower, with shorter steps than HC. However, in the real world, significant between-group differences were influenced by WB duration (i.e., identified for WB longer than 60 or 120 s for all gait characteristics apart from those related to asymmetry). From a clinical perspective, the assessment in the clinic and outside the clinic can contain similar information. However, walking performance assessed over longer walks can offer increased sensitivity.

• Lab and real-world gait assessments assess different aspects of gait. No correlation or weak-to-moderate association was observed between the assessments. In routine clinical practice, these two streams of information can reflect different gait constructs and therefore provide complementary information to support clinical decision making.

• Individual gait characteristics measured in the real-world and averaged across all WBs (univariate analysis) had relatively low AUC values compared to gait assessed in the lab. However, specific real-world WB durations (i.e., longer 30–60 s, >60 s, >120 s) give higher AUC compared to lab-based gait assessment. This reinforces the need to consider the impact of real-world data aggregation levels for targeting specific clinical questions/aspects (e.g., classification of PD).

• With ML-based multivariate analysis, choice of environment (lab vs. real-world) and data aggregation by WB durations clearly impacted on the ML classifier performance. Our findings suggest that ML-based models should be tested on the real-world longer WBs in clinical practice as an informed pre-screening decision-making tool for PD.

• Gait assessed with wearables in the real-world paired with ML gave reasonably accurate classification performance at early stages compared to current gold standard PD clinical diagnostics. This inexpensive and objective solution motivates its adoption in clinical practice and could be a promising addition to the current clinical diagnostic toolkit and complement clinical decision-making. An accurate early diagnosis of PD is important to ensure that timely and targeted treatments (both pharmacological and non-pharmacological) can be provided.

Limitations and Recommendations for Future Work

There are limitations to this work. From the lab-based gait assessment, only 2 min of continuous walking were utilized and compared to 60 < WBs ≤ 120 s or 90 < WBs ≤ 120 s (Supplementary Figures 1, 2) of real-world gait data. However, real-world walking comprises additional complexity, with varying visual stimuli (i.e., day, night), cognitive load (single and dual task), and motor demand (i.e., uphill, downhill), which is not reflected in lab-based gait assessments. The context (e.g., indoor vs. outdoor walking) in which short and long walking bouts happen is not measured in this work. Future work is required to develop methods for characterizing contextual information. Understanding under what scenarios gait assessment could improve the classification results. In the lab, PD participants were assessed one hour after medication intake. However, in the real-world gait assessment, we could not objectively control and assess the effect of medication on gait performance. Future studies should investigate this and the effect that medication “ON” and “OFF” states could have on the results. The ML-based findings from this early PD cohort with an average disease duration of 26 months may not be generalizable to advanced PD stages. In this work, only 14 clinically relevant gait characteristics based on the heel strike and toe-off gait events were considered. In future studies, other signal-based characteristics independent of foot contact detection should be compared. Furthermore, in routine clinical practice, misdiagnosis of PD can delay subsequent intervention and treatments. Therefore, future work should look at ML classification of PD vs. atypical parkinsonian disorders (i.e., Multiple System Atrophy and Progressive Supranuclear Palsy) to identify discriminatory gait features.

Conclusion

In this study, we investigated the impact of environment and data aggregation by WB duration on gait characteristics and on the performance of ML models for the classification of PD. Real-world gait characteristics aggregated over medium to long WBs (e.g., WBs > 30 s) gave better discrimination performance (0.51 ≤ AUC ≤ 0.77) compared to lab-based gait characteristics (0.51 ≤ AUC ≤ 0.72), with real-world step velocity showing the highest AUC (0.77). Gait data aggregation by WB durations influenced ML classification performance. ML models applied to real-world gait showed better classification performance compared to lab data. Overall, RF trained on 14 gait characteristics aggregated over WBs > 60 s gave better performance (F1 score = 77.20 ± 5.51%) as compared to lab-based data. Findings from this study suggest that choice of environment and data aggregation by WB duration are important to achieve maximum discrimination performance and have a direct impact on ML performance for PD classification.

Data Availability Statement

All the relevant data is either reported in the form of table or displayed in figure. The plots of all the digital gait characteristics are presented in the Supplementary Material. Due to data privacy and sharing agreement, raw dataset is not publically available. However, it can be made available upon reasonable request from the corresponding author. Requests to access these datasets should be directed to SD, c2lsdmlhLmRlbGRpbkBuY2wuYWMudWs=.

Ethics Statement

The studies involving human participants were reviewed and approved by the Newcastle and North Tyneside Research Ethics Committee (REC No. 09/H0906/82). Study procedures were conducted according to Declaration of Helsinki. The patients/participants provided their written informed consent to participate in this study.

Author Contributions

RR conceptualized and designed the study, and performed data analysis, statistical analysis, drafting, and critical revision of the manuscript. SD helped in conceptualization of this study, interpretation of data, and critical revision of the manuscript for important intellectual content. YG and JS provided support for statistical analysis, interpretation, and critical revision of the manuscript for important intellectual content. LA, AY, and LR were involved in interpretation of data and critical revision of the manuscript. All authors contributed to the article and approved the submitted version.

Funding

This work was supported by “Keep Control” project which is European Union’s Horizon 2020 Research and Innovation ITN program under the Marie Skłodowska-Curie grant agreement no. 721577. The ICICLE-Gait study was supported by Parkinson’s United Kingdom (J-0802, G-1301) and by the National Institute for Health Research (NIHR) Newcastle Biomedical Research Center (BRC) based at Newcastle upon Tyne Hospital NHS Foundation Trust and Newcastle University. This work was also supported by the NIHR/Wellcome Trust Clinical Research Facility (CRF) infrastructure at Newcastle upon Tyne Hospitals NHS Foundation Trust. All opinions are those of the authors and not the funders. Subsequent to this work, RR and SD were supported by the Innovative Medicines Initiative 2 Joint Undertaking (IMI2 JU) project IDEA-FAST–Grant Agreement 853981. In addition LR, AY, LA, and SD were also supported by the Innovative Medicines Initiative 2 Joint Undertaking (IMI2 JU) project Mobilise-D that has received funding from the Innovative Medicines Initiative 2 Joint Undertaking (JU) under grant agreement no. 820820. This JU received support from the European Union’s Horizon 2020 Research and Innovation Program and the European Federation of Pharmaceutical Industries and Associations (EFPIA).

Author Disclaimer

Content in this publication reflects the authors’ view and neither IMI nor the European Union, EFPIA, or any Associated Partners are responsible for any use that may be made of the information contained herein.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Acknowledgments

We would like to thank all the participants and assessors of the ICICLE study, Rachael Lawson, and Brook Galna for their support.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fnagi.2022.808518/full#supplementary-material

References

Atrsaei, A., Corrà, M. F., Dadashi, F., Vila-Chã, N., Maia, L., Mariani, B., et al. (2021). Gait speed in clinical and daily living assessments in Parkinson’s disease patients: performance versus capacity. NPJ Parkinsons Dis. 7:24. doi: 10.1038/s41531-021-00171-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Barry, G., Galna, B., Lord, S., Rochester, L., and Godfrey, A. (2015). Defining ambulatory bouts in free-living activity: impact of brief stationary periods on bout metrics. Gait Posture 42, 594–597. doi: 10.1016/j.gaitpost.2015.07.062

PubMed Abstract | CrossRef Full Text | Google Scholar

Buckley, C., Alcock, L., Mcardle, R., Rehman, R. Z. U., Del Din, S., Mazzà, C., et al. (2019). The role of movement analysis in diagnosing and monitoring neurodegenerative conditions: insights from gait and postural control. Brain Sci. 9:34. doi: 10.3390/brainsci9020034

PubMed Abstract | CrossRef Full Text | Google Scholar

De Bruin, E. D., Najafi, B., Murer, K., Uebelhart, D., and Aminian, K. (2007). Quantification of everyday motor function in a geriatric population. J. Rehabil. Res. Dev. 44, 417–428. doi: 10.1682/jrrd.2006.01.0003

PubMed Abstract | CrossRef Full Text | Google Scholar

Del Din, S., Galna, B., Godfrey, A., Bekkers, E. M., Pelosin, E., Nieuwhof, F., et al. (2019). Analysis of free-living gait in older adults with and without Parkinson’s disease and with and without a history of falls: identifying generic and disease-specific characteristics. J. Gerontol. Ser. A 74, 500–506. doi: 10.1093/gerona/glx254

PubMed Abstract | CrossRef Full Text | Google Scholar

Del Din, S., Godfrey, A., Galna, B., Lord, S., and Rochester, L. (2016a). Free-living gait characteristics in ageing and Parkinson’s disease: impact of environment and ambulatory bout length. J. Neuroeng. Rehabil. 13:46. doi: 10.1186/s12984-016-0154-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Del Din, S., Godfrey, A., Mazzà, C., Lord, S., and Rochester, L. (2016b). Free-living monitoring of Parkinson’s disease: lessons from the field. Mov. Disord. 31, 1293–1313. doi: 10.1002/mds.26718

PubMed Abstract | CrossRef Full Text | Google Scholar

Del Din, S., Godfrey, A., and Rochester, L. (2016c). Validation of an accelerometer to quantify a comprehensive battery of gait characteristics in healthy older adults and Parkinson’s disease: toward clinical and at home use. IEEE J. Biomed. Health Inf. 20, 838–847. doi: 10.1109/JBHI.2015.2419317

PubMed Abstract | CrossRef Full Text | Google Scholar

Del Din, S., Hickey, A., Hurwitz, N., Mathers, J. C., Rochester, L., and Godfrey, A. (2016d). Measuring gait with an accelerometer-based wearable: influence of device location, testing protocol and age. Physiol. Meas. 37, 1785–1797. doi: 10.1088/0967-3334/37/10/1785

PubMed Abstract | CrossRef Full Text | Google Scholar

Del Din, S., Yarnall, A. J., Barber, T. R., Lo, C., Crabbe, M., Rolinski, M., et al. (2020). Continuous real-world gait monitoring in idiopathic REM sleep behavior disorder. J. Parkinsons Dis. 10, 283–299. doi: 10.3233/JPD-191773

PubMed Abstract | CrossRef Full Text | Google Scholar

Dorsey, E. R., Elbaz, A., Nichols, E., Abd-Allah, F., Abdelalim, A., Adsuar, J. C., et al. (2018). Global, regional, and national burden of Parkinson’s disease, 1990–2016: a systematic analysis for the Global Burden of Disease Study 2016. Lancet Neurol. 17, 939–953. doi: 10.1016/S1474-4422(18)30295-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Dorsey, E. R., George, B. P., Leff, B., and Willis, A. W. (2013). The coming crisis: obtaining care for the growing burden of neurodegenerative conditions. Neurology 80, 1989–1996. doi: 10.1212/WNL.0b013e318293e2ce

PubMed Abstract | CrossRef Full Text | Google Scholar

El-Gohary, M., Pearson, S., Mcnames, J., Mancini, M., Horak, F., Mellone, S., et al. (2014). Continuous monitoring of turning in patients with movement disability. Sensors 14, 356–369. doi: 10.3390/s140100356

PubMed Abstract | CrossRef Full Text | Google Scholar

Emamzadeh, F. N., and Surguchov, A. (2018). Parkinson’s disease: biomarkers, treatment, and risk factors. Front. Neurosci. 12:612. doi: 10.3389/fnins.2018.00612

PubMed Abstract | CrossRef Full Text | Google Scholar

Evers, L. J., Raykov, Y. P., Krijthe, J. H., De Lima, A. L. S., Badawy, R., Claes, K., et al. (2020). Real-life gait performance as a digital biomarker for motor fluctuations: the Parkinson@ home validation study. J. Med. Internet Res. 22:e19068. doi: 10.2196/19068

PubMed Abstract | CrossRef Full Text | Google Scholar

Feigin, V. L., Nichols, E., Alam, T., Bannick, M. S., Beghi, E., Blake, N., et al. (2019). Global, regional, and national burden of neurological disorders, 1990–2016: a systematic analysis for the Global Burden of Disease Study 2016. Lancet Neurol. 18, 459–480. doi: 10.1016/S1474-4422(18)30499-X

PubMed Abstract | CrossRef Full Text | Google Scholar

Galperin, I., Hillel, I., Del Din, S., Bekkers, E. M., Nieuwboer, A., Abbruzzese, G., et al. (2019). Associations between daily-living physical activity and laboratory-based assessments of motor severity in patients with falls and Parkinson’s disease. Parkinsonism Relat. Disord. 62, 85–90. doi: 10.1016/j.parkreldis.2019.01.022

PubMed Abstract | CrossRef Full Text | Google Scholar

Ghoraani, B., Hssayeni, M. D., Bruack, M. M., and Jimenez-Shahed, J. (2019). Multilevel features for sensor-based assessment of motor fluctuation in Parkinson’s disease subjects. IEEE J. Biomed. Health Inf. 24, 1284–1295. doi: 10.1109/JBHI.2019.2943866

PubMed Abstract | CrossRef Full Text | Google Scholar

Godfrey, A., Del Din, S., Barry, G., Mathers, J. C., and Rochester, L. (2015). Instrumenting gait with an accelerometer: a system and algorithm examination. Med. Eng. Phys. 37, 400–407. doi: 10.1016/j.medengphy.2015.02.003

PubMed Abstract | CrossRef Full Text | Google Scholar

Godfrey, A., Lord, S., Galna, B., Mathers, J. C., Burn, D. J., and Rochester, L. (2014). The association between retirement and age on physical activity in older adults. Age Ageing 43, 386–393. doi: 10.1093/ageing/aft168

PubMed Abstract | CrossRef Full Text | Google Scholar

Goetz, C. G., Tilley, B. C., Shaftman, S. R., Stebbins, G. T., Fahn, S., Martinez-Martin, P., et al. (2008). Movement disorder society-sponsored revision of the unified Parkinson’s disease rating scale (MDS-UPDRS): scale presentation and clinimetric testing results. Mov. Disord. 23, 2129–2170. doi: 10.1002/mds.22340

PubMed Abstract | CrossRef Full Text | Google Scholar

González, R. C., López, A. M., Rodriguez-Uría, J., Alvarez, D., and Alvarez, J. C. (2010). Real-time gait event detection for normal subjects from lower trunk accelerations. Gait Posture 31, 322–325. doi: 10.1016/j.gaitpost.2009.11.014

PubMed Abstract | CrossRef Full Text | Google Scholar

Hickey, A., Del Din, S., Rochester, L., and Godfrey, A. (2016). Detecting free-living steps and walking bouts: validating an algorithm for macro gait analysis. Physiol. Meas. 38, N1–N15. doi: 10.1088/1361-6579/38/1/N1

PubMed Abstract | CrossRef Full Text | Google Scholar

Hillel, I., Gazit, E., Nieuwboer, A., Avanzino, L., Rochester, L., Cereatti, A., et al. (2019). Is every-day walking in older adults more analogous to dual-task walking or to usual walking? Elucidating the gaps between gait performance in the lab and during 24/7 monitoring. Eur. Rev. Aging Phys. Act. 16:6. doi: 10.1186/s11556-019-0214-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Hobert, M. A., Nussbaum, S., Heger, T., Berg, D., Maetzler, W., and Heinzel, S. (2019). Progressive gait deficits in Parkinson’s disease: a wearable-based biannual 5-year prospective study. Front. Aging Neurosci. 11:22. doi: 10.3389/fnagi.2019.00022

PubMed Abstract | CrossRef Full Text | Google Scholar

Hoehn, M. M., and Yahr, M. D. (1998). Parkinsonism: onset, progression, and mortality. Neurology 50:318. doi: 10.1212/wnl.50.2.318

PubMed Abstract | CrossRef Full Text | Google Scholar

Jankovic, J. (2008). Parkinson’s disease: clinical features and diagnosis. J. Neurol. Neurosurg. Psychiatry 79, 368–376.

Google Scholar

Khoo, T. K., Yarnall, A. J., Duncan, G. W., Coleman, S., O’brien, J. T., Brooks, D. J., et al. (2013). The spectrum of nonmotor symptoms in early Parkinson disease. Neurology 80, 276–281. doi: 10.1212/WNL.0b013e31827deb74

PubMed Abstract | CrossRef Full Text | Google Scholar

Lawson, R. A., Yarnall, A. J., Duncan, G. W., Breen, D. P., Khoo, T. K., Williams-Gray, C. H., et al. (2016). Cognitive decline and quality of life in incident Parkinson’s disease: the role of attention. Parkinsonism Relat. Disord. 27, 47–53. doi: 10.1016/j.parkreldis.2016.04.009

PubMed Abstract | CrossRef Full Text | Google Scholar

Lord, S., Galna, B., Verghese, J., Coleman, S., Burn, D., and Rochester, L. (2013). Independent domains of gait in older adults and associated motor and nonmotor attributes: validation of a factor analysis approach. J. Gerontol. Ser. A 68, 820–827. doi: 10.1093/gerona/gls255

PubMed Abstract | CrossRef Full Text | Google Scholar

Maetzler, W., Domingos, J., Srulijes, K., Ferreira, J. J., and Bloem, B. R. (2013). Quantitative wearable sensors for objective assessment of Parkinson’s disease. Mov. Disord. 28, 1628–1637. doi: 10.1002/mds.25628

PubMed Abstract | CrossRef Full Text | Google Scholar

Maetzler, W., Rochester, L., Bhidayasiri, R., Espay, A. J., Sánchez-Ferro, A., and Van Uem, J. M. (2020). Modernizing daily function assessment in Parkinson’s disease using capacity, perception, and performance measures. Mov. Disord. 36, 76–82. doi: 10.1002/mds.28377

PubMed Abstract | CrossRef Full Text | Google Scholar

Mancini, M., King, L., Salarian, A., Holmstrom, L., Mcnames, J., and Horak, F. B. (2011). Mobility lab to assess balance and gait with synchronized body-worn sensors. J. Bioeng. Biomed. Sci. 12(Suppl. 1):7. doi: 10.4172/2155-9538.S1-007

PubMed Abstract | CrossRef Full Text | Google Scholar

McCamley, J., Donati, M., Grimpampi, E., and Mazza, C. (2012). An enhanced estimate of initial contact and final contact instants of time using lower trunk inertial sensor data. Gait Posture 36, 316–318. doi: 10.1016/j.gaitpost.2012.02.019

PubMed Abstract | CrossRef Full Text | Google Scholar

Moe-Nilssen, R. (1998). A new method for evaluating motor control in gait under real-life environmental conditions. Part 2: gait analysis. Clin. Biomech. 13, 328–335. doi: 10.1016/s0268-0033(98)00090-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Nasreddine, Z. S., Phillips, N. A., Bédirian, V., Charbonneau, S., Whitehead, V., Collin, I., et al. (2005). The Montreal cognitive assessment, MoCA: a brief screening tool for mild cognitive impairment. J. Am. Geriatr. Soc. 53, 695–699. doi: 10.1111/j.1532-5415.2005.53221.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Nutt, J. G., and Wooten, G. F. (2005). Diagnosis and initial management of Parkinson’s disease. N. Engl. J. Med. 353, 1021–1027.

Google Scholar

Orendurff, M. S., Schoen, J. A., Bernatz, G. C., Segal, A. D., and Klute, G. K. (2008). How humans walk: bout duration, steps per bout, and rest duration. J. Rehabil. Res. Dev. 45, 1077–1089. doi: 10.1682/jrrd.2007.11.0197

PubMed Abstract | CrossRef Full Text | Google Scholar

Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., et al. (2011). Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830. doi: 10.1080/13696998.2019.1666854

PubMed Abstract | CrossRef Full Text | Google Scholar

Perneger, T. V. (1998). What’s wrong with Bonferroni adjustments. BMJ 316, 1236–1238. doi: 10.1136/bmj.316.7139.1236

PubMed Abstract | CrossRef Full Text | Google Scholar

Postuma, R. B., Berg, D., Stern, M., Poewe, W., Olanow, C. W., Oertel, W., et al. (2015). MDS clinical diagnostic criteria for Parkinson’s disease. Mov. Disord. 30, 1591–1601. doi: 10.1002/mds.26424

PubMed Abstract | CrossRef Full Text | Google Scholar

Powell, L. E., and Myers, A. M. (1995). The activities-specific balance confidence (ABC) scale. J. Gerontol. Ser. A Biol. Sci. Med. Sci. 50, M28–M34.

Google Scholar

Przedborski, S., Vila, M., and Jackson-Lewis, V. (2003). Series introduction: neurodegeneration: what is it and where are we? J. Clin. Invest. 111, 3–10. doi: 10.1089/brain.2021.0059

PubMed Abstract | CrossRef Full Text | Google Scholar

Rehman, R. Z. U., Del Din, S., Guan, Y., Yarnall, A. J., Shi, J. Q., and Rochester, L. (2019a). Selecting clinically relevant gait characteristics for classification of early Parkinson’s disease: a comprehensive machine learning approach. Sci. Rep. 9:17269. doi: 10.1038/s41598-019-53656-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Rehman, R. Z. U., Del Din, S., Shi, J. Q., Galna, B., Lord, S., Yarnall, A. J., et al. (2019b). Comparison of walking protocols and gait assessment systems for machine learning-based classification of Parkinson’s disease. Sensors 19:5363. doi: 10.3390/s19245363

PubMed Abstract | CrossRef Full Text | Google Scholar

Rehman, R. Z. U., Buckley, C., Micó-Amigo, M. E., Kirk, C., Dunne-Willows, M., Mazzà, C., et al. (2020a). Accelerometry-based digital gait characteristics for classification of Parkinson’s disease: what counts? IEEE Open J. Eng. Med. Biol. 1, 65–73.

Google Scholar

Rehman, R. Z. U., Klocke, P., Hryniv, S., Galna, B., Rochester, L., Del Din, S., et al. (2020b). Turning detection during gait: algorithm validation and influence of sensor location and turning characteristics in the classification of Parkinson’s disease. Sensors 20:5377. doi: 10.3390/s20185377

PubMed Abstract | CrossRef Full Text | Google Scholar

Rizzo, G., Copetti, M., Arcuti, S., Martino, D., Fontana, A., and Logroscino, G. (2016). Accuracy of clinical diagnosis of Parkinson disease. Neurology 86, 566–576. doi: 10.1212/wnl.0000000000002350

PubMed Abstract | CrossRef Full Text | Google Scholar

Robles-García, V., Corral-Bergantiños, Y., Espinosa, N., Jácome, M. A., García-Sancho, C., Cudeiro, J., et al. (2015). Spatiotemporal gait patterns during overt and covert evaluation in patients with Parkinson’s disease and healthy subjects: is there a Hawthorne effect? J. Appl. Biomech. 31, 189–194. doi: 10.1123/jab.2013-0319

PubMed Abstract | CrossRef Full Text | Google Scholar

Rothman, K. J. (1990). No adjustments are needed for multiple comparisons. Epidemiology 1, 43–46.

PubMed Abstract | Google Scholar

Shah, V. V., Mcnames, J., Harker, G., Mancini, M., Carlson-Kuhta, P., Nutt, J. G., et al. (2020a). Effect of bout length on gait measures in people with and without Parkinson’s disease during daily life. Sensors 20:5769. doi: 10.3390/s20205769

PubMed Abstract | CrossRef Full Text | Google Scholar

Shah, V. V., Mcnames, J., Mancini, M., Carlson-Kuhta, P., Nutt, J. G., El-Gohary, M., et al. (2020b). Digital biomarkers of mobility in Parkinson’s disease during daily living. J. Parkinsons Dis. 10, 1099–1111. doi: 10.3233/JPD-201914

PubMed Abstract | CrossRef Full Text | Google Scholar

Shah, V. V., Mcnames, J., Mancini, M., Carlson-Kuhta, P., Spain, R. I., Nutt, J. G., et al. (2020c). Laboratory versus daily life gait characteristics in patients with multiple sclerosis, Parkinson’s disease, and matched controls. J. Neuroeng. Rehabil. 17:159. doi: 10.1186/s12984-020-00781-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Takayanagi, N., Sudo, M., Yamashiro, Y., Lee, S., Kobayashi, Y., Niki, Y., et al. (2019). Relationship between daily and in-laboratory gait speed among healthy community-dwelling older adults. Sci. Rep. 9:3496. doi: 10.1038/s41598-019-39695-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Tomlinson, C. L., Stowe, R., Patel, S., Rick, C., Gray, R., and Clarke, C. E. (2010). Systematic review of levodopa dose equivalency reporting in Parkinson’s disease. Mov. Disord. 25, 2649–2653. doi: 10.1002/mds.23429

PubMed Abstract | CrossRef Full Text | Google Scholar

Van Ancum, J. M., Van Schooten, K. S., Jonkman, N. H., Huijben, B., Van Lummel, R. C., Meskers, C. G., et al. (2019). Gait speed assessed by a 4-m walk test is not representative of daily-life gait speed in community-dwelling adults. Maturitas 121, 28–34. doi: 10.1016/j.maturitas.2018.12.008

PubMed Abstract | CrossRef Full Text | Google Scholar

Viceconti, M., Hernandez Penna, S., Dartee, W., Mazzà, C., Caulfield, B., Becker, C., et al. (2020). Toward a regulatory qualification of real-world mobility performance biomarkers in Parkinson’s patients using digital mobility outcomes. Sensors 20:5920. doi: 10.3390/s20205920

PubMed Abstract | CrossRef Full Text | Google Scholar

von Campenhausen, S., Winter, Y., Rodrigues e Silva, A., Sampaio, C., Ruzicka, E., Barone, P., et al. (2011). Costs of illness and care in Parkinson’s disease: an evaluation in six countries. Eur. Neuropsychopharmacol. 21, 180–191. doi: 10.1016/j.euroneuro.2010.08.002

PubMed Abstract | CrossRef Full Text | Google Scholar

Warmerdam, E., Hausdorff, J. M., Atrsaei, A., Zhou, Y., Mirelman, A., Aminian, K., et al. (2020). Long-term unsupervised mobility assessment in movement disorders. Lancet Neurol. 19, 462–470. doi: 10.1016/S1474-4422(19)30397-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Weiss, A., Brozgol, M., Dorfman, M., Herman, T., Shema, S., Giladi, N., et al. (2013). Does the evaluation of gait quality during daily life provide insight into fall risk? A novel approach using 3-day accelerometer recordings. Neurorehabil. Neural Repair 27, 742–752. doi: 10.1177/1545968313491004

PubMed Abstract | CrossRef Full Text | Google Scholar

Weiss, A., Herman, T., Giladi, N., and Hausdorff, J. M. (2014). Objective assessment of fall risk in Parkinson’s disease using a body-fixed sensor worn for 3 days. PLoS One 9:e96675. doi: 10.1371/journal.pone.0096675

PubMed Abstract | CrossRef Full Text | Google Scholar

Yarnall, A. J., Breen, D. P., Duncan, G. W., Khoo, T. K., Coleman, S. Y., Firbank, M. J., et al. (2014). Characterizing mild cognitive impairment in incident Parkinson disease: the ICICLE-PD study. Neurology 82, 308–316. doi: 10.1212/WNL.0000000000000066

PubMed Abstract | CrossRef Full Text | Google Scholar

Zijlstra, A., and Zijlstra, W. (2013). Trunk-acceleration based assessment of gait parameters in older persons: a comparison of reliability and validity of four inverted pendulum based estimations. Gait Posture 38, 940–944. doi: 10.1016/j.gaitpost.2013.04.021

PubMed Abstract | CrossRef Full Text | Google Scholar

Zijlstra, W., and Hof, A. L. (2003). Assessment of spatio-temporal gait parameters from trunk accelerations during human walking. Gait Posture 18, 1–10. doi: 10.1016/s0966-6362(02)00190-x

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: Parkinson’s disease, gait, real-world, accelerometer, machine learning, laboratory, gait aggregation, wearables

Citation: Rehman RZU, Guan Y, Shi JQ, Alcock L, Yarnall AJ, Rochester L and Del Din S (2022) Investigating the Impact of Environment and Data Aggregation by Walking Bout Duration on Parkinson’s Disease Classification Using Machine Learning. Front. Aging Neurosci. 14:808518. doi: 10.3389/fnagi.2022.808518

Received: 03 November 2021; Accepted: 14 February 2022;
Published: 22 March 2022.

Edited by:

Ondrej Bezdicek, Charles University, Czechia

Reviewed by:

Federico Villagra, Aberystwyth University, United Kingdom
Thomas Schauer, Technical University of Berlin, Germany

Copyright © 2022 Rehman, Guan, Shi, Alcock, Yarnall, Rochester and Del Din. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Silvia Del Din, c2lsdmlhLmRlbC1kaW5AbmNsLmFjLnVr

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Investigating the Impact of Environment and Data Aggregation by Walking Bout Duration on Parkinson’s Disease Classification Using Machine Learning

Introduction

Materials and Methods

Participants

Demographics and Clinical Characteristics

Gait Assessment

Equipment and Protocol

Data Processing and Extraction of Gait Characteristics–Lab

Data Processing, Extraction of Gait Characteristics, and Data Aggregation–Real-World

Statistical Analysis

Machine Learning Classification of Parkinson’s Disease

Results

Impact of Environment and Data Aggregation on Gait Characteristics

Overall Statistics

Impact of Environment [Lab vs. Real-World (All Walking Bouts)] on Between-Group Differences (Parkinson’s Disease vs. Healthy Control)

Lab vs. Real-World (All Walking Bouts)

Parkinson’s Disease vs. Healthy Control

Effect of Real-World Data Aggregation on Gait

Distribution of Data Aggregation by Walking Bout Durations

Effect of Data Aggregation by Walking Bout Durations on Between Group Differences

Association Between Gait Characteristics Collected in the Lab vs. Real-World

Impact of Environment and Data Aggregation on Classification of Parkinson’s Disease

Machine Learning Performance in the Lab vs. Real-World

Impact of Data Aggregation by Walking Bout Duration on Machine Learning Performance

Discussion

Impact of Environment and Data Aggregation on Gait Characteristics

Lab vs. Real-World

Parkinson’s Disease vs. Healthy Control

Impact of Environment and Walking Bout Duration on Parkinson’s Disease Classification

Data Aggregation by Walking Bout Duration

Key Insights With Clinical Implications

Limitations and Recommendations for Future Work

Conclusion

Data Availability Statement

Ethics Statement

Author Contributions

Funding

Author Disclaimer

Conflict of Interest

Publisher’s Note

Acknowledgments

Supplementary Material

References

94% of researchers rate our articles as excellent or good