ORIGINAL RESEARCH article

Front. Neuroinform., 12 February 2025

Volume 19 - 2025 | https://doi.org/10.3389/fninf.2025.1544372

Quantitative evaluation method of stroke association based on multidimensional gait parameters by using machine learning

Cheng Wang,,Cheng Wang1,2,3Zhou Long,Zhou Long1,2Xiang-Dong Wang,Xiang-Dong Wang4,5You-Qi KongYou-Qi Kong6Li-Chun ZhouLi-Chun Zhou6Wei-Hua JiaWei-Hua Jia7Pei LiPei Li8Jing WangJing Wang8Xiao-Juan WangXiao-Juan Wang8Tian Tian
Tian Tian8*
  • 1Jinan Zhougke Ubiquitous-Intelligent Institute of Computing Technology, Jinan, China
  • 2Shandong Academy of Intelligent Computing Technology, Shandong Institutes of Industrial Technology (SDIIT), Jinan, China
  • 3Ningbo Institute of Information Technology Application CAS, Ningbo, China
  • 4Bejing Key Laboratory of Mobile Computing and Pervasive Device, Beijing, China
  • 5Institute of Computing Technology (ICT) Chinese Academy of Sciences (CAS), Beijing, China
  • 6Department of Neurology, Beijing Chao-Yang Hospital, Capital Medical University, Beijing, China
  • 7Department of Neurology, Beijing Shijingshan Hospital, Shijingshan Teaching Hospital of Capital Medical University, Beijing, China
  • 8General Practice Department, Beijing Chao-Yang Hospital, Capital Medical University, Beijing, China

Objective: NIHSS for stroke is widely used in clinical, but it is complex and subjective. The purpose of the study is to present a quantitative evaluation method of stroke association based on multi-dimensional gait parameters by using machine learning.

Methods: 39 ischemic stroke patients with hemiplegia were selected as the stroke group and 187 healthy adults from the community as the control group. Gaitboter system was used for gait analysis. Through the labeling of stroke patients by clinicians with NIHSS score, all gait parameters obtained were used to select appropriate gait parameters. By using machine learning algorithm, a discriminant model and a hierarchical model were trained.

Results: The discriminant model was used to distinguish between healthy people and stroke patients. The overall detection accuracy of the model based on KNN, SVM and Randomforest algorithms is 92.86, 92.86 and 90.00%, respectively. The hierarchical model was used to judge the severity of stroke in stroke patients. The model based on Randomforest, SVM and AdaBoost algorithm had an overall detection accuracy of 71.43, 85.71 and 85.71%, respectively.

Conclusion: The proposed stroke association quantitative evaluation method based on multi-dimensional gait parameters has the characteristics of high accuracy, objectivity, and quantification.

Graphical abstract
www.frontiersin.org

Graphical Abstract. Diagram of quantitative evaluation method of stroke association based on gait parameters.

1 Introduction

Stroke has a high incidence rate, mortality, disability, and recurrence rate, ranking among the leading causes of death among residents for many years (Wan and Dong, 2017; Wang W. et al., 2017). More than 60% of stroke patients still have different degrees of neurological dysfunction after treatment (Wang L. D. et al., 2018), which seriously affects the quality of life of patients. The main reasons for the delay before hospitalization of stroke patients were that they could not quickly identify the signs of stroke or did not give first aid at the first time. Early warning of the occurrence or recurrence of stroke and active emergency measures are of great significance for reducing the disability rate and mortality rate of stroke and improving the prognosis of patients (Nagao et al., 2020) Patients with hemiplegia need to assess the motor function of lower limbs, which is conducive to the formulation of rehabilitation plans, and the monitoring of limb function during the rehabilitation process, to adjust the treatment plan in time. NIHSS is the most widely used clinical and experimental stroke functional examination scale, which is recognized as reliable, effective, and sensitive (Nagao et al., 2020; Khalil and Lahoud, 2020). It can help clinicians accurately assess and communicate with each other about the neurological deficit of stroke patients and guide patients to make long-term rehabilitation and nursing plans (Wang Y. X. et al., 2016; Wu et al., 2019; Jauch et al., 2013). However, NIHSS contains many contents, which is complex and professional and cannot be mastered by ordinary patients outside the hospital. Therefore, for stroke evaluation, a simple, quantitative, objective, and rapid method or system is needed.

2 Related work

With the development of science and technology, gait analysis technology has been gradually applied to the gait research of stroke (Wan et al., 2014; Aoki et al., 2013; Estrada-Barranco et al., 2022; Mizuta et al., 2022; Chow and Stokic, 2021). At present, gait analysis in clinic is mostly used to reveal that there are obvious differences in gait of stroke patients in many aspects. By determining gait characteristics (gait parameters) of stroke patients, it can effectively evaluate the effect of rehabilitation quality of stroke patients (Seok et al., 2021; Hussain and Park, 2021; Titus et al., 2018; Hu et al., 2009). In addition, there are a small number of artificial intelligence studies on gait analysis of stroke patients, such as distinguishing whether it is stroke according to the symmetry, regularity, and stability of gait (Kim et al., 2022; Krawczyk et al., 2012), and there are also studies on classification and evaluation of gait of stroke patients (Li et al., 2019; Andrea et al., 2016; Kaczmarczyk et al., 2009; Cui et al., 2018). Almost no artificial intelligent studies of stroke are based on more comprehensive spatial–temporal characteristics (parameters) of gait.

In this paper, combined with the NIHSS score for stroke in clinic and gait analysis, the study is on quantitative evaluation and analysis of stroke association using machine learning method. Since the gait analysis system is wearable and easy to operate simply, the objective of the study is to warn the occurrence or recurrence of stroke outside the hospital, and to assist doctors in quantitative, objective and simple identification and grading of stroke degree in the hospital. Specifically, as it is shown in graphical abstract, through the labeling of stroke patients by clinicians with NIHSS score, all gait parameters obtained from gait analysis system were used to select appropriate gait parameters. By using machine learning algorithm, a discriminant model for distinguishing between normal healthy people and stroke patients, and a hierarchical model for judging the severity of stroke in stroke patients were trained. Finally, experiments were conducted to evaluate the effectiveness of the two trained models.

3 Materials and methods

3.1 Participants of study

39 ischemic stroke hemiplegic patients admitted to Beijing Chao-Yang Hospital, Capital Medical University from June 2018 to October 2018 were selected as the stroke group, including 24 males and 15 females, aged (63.4 ± 14.5) years, with a height of (169.3 ± 7.1) cm and a weight of (77.5 ± 7.5) kg; 187 healthy adults from the community served as the control group, including 68 males and 119 females, aged (47.4 ± 12.4) years, with a height of (165.2 ± 7.4) cm and a weight of (69.7 ± 17.6) kg. Inclusion criteria: ① patients with gait abnormality caused by acute stroke; ② the muscle strength of the affected limb is grade IV or above, and can walk independently; ③ the cognitive function is normal and can cooperate with the gait parameter acquisition process. Exclusion criteria: ① vestibular / cerebellar dysfunction and other lower limb muscle, bone and nervous system diseases that can cause walking disorders; ② Serious systemic diseases such as heart, lung, liver, and kidney dysfunction. Inclusion criteria of the control group: no nervous system, lower limb muscle, bone and other diseases, no serious cardiopulmonary diseases. The study was approved by the institutional review board of Beijing Chao-yang Hospital, Beijing China, and was conducted in accordance with the Declaration of Helsinki. All participants have signed informed consent forms. The study protocol was submitted and approved by the Research Ethics Committee of the Beijing Chao-Yang Hospital (the number is 2018-sci-153). Institutional Review Board reference date is 2018-6-11.

3.2 Test tools of gait analysis

The gait analysis equipment used in this study was the Gaitboter gait analysis system developed by the Institute of computing technology, Chinese Academy of Sciences. The system includes a wearable gait acquisition device and corresponding gait analysis software that integrates motion sensors, plantar membrane pressure and sound. Gaitboter is a sports shoe with a multi-sensor fusion gait data acquisition circuit built in. The sensors include accelerometers, gyroscopes, and plantar membrane pressure sensors (sampling rate is 80 Hz) and micro microphones (sampling rate is 4,000 Hz). The mobile phone or tablet with gait analysis software can collect and analyze the gait data obtained by the acquisition device through Bluetooth. Previous experiments show that due to the fusion of multiple sensor information, the accuracy of gait space–time parameters measured by the system used in this paper is higher than that measured by the system using only inertial sensors such as acceleration and gyroscope (Wang C. et al., 2016; Wang C. et al., 2017) at the same time, it has good sensitivity and reliability (Kong et al., 2018).

3.3 Test methods of gait analysis

① The experimental operator was an experienced doctor who explained the experimental process and precautions to the subjects before the test; ② The subjects wore comfortable and light clothes and chose wearable gait analysis equipment of appropriate size; ③ Choose a 10 m long and 3 m wide walkway with sufficient light, flat ground and no obstacles as the gait analysis walkway; ④ During the test, the subjects were required to walk in a straight line at a uniform speed with their own gait without interference. According to the actual situation of the subjects, tried to let the subjects walk more times, that is, different subjects walked different times, and each walking process covered about 2–50 steps. Gait information collected by wearable devices was transmitted to mobile phones or tablets through Bluetooth. The gait parameters of subjects were analyzed automatically. Before the formal test, the subjects needed to walk on the walkway twice to adapt to the environment.

4 Quantitative evaluation method of stroke association based on multidimensional gait parameters

The subjects in the stroke group were scored by experienced doctors according to the NIHSS to assess the severity of stroke. NIHSS consists of 11 evaluation indexes, including consciousness level, gaze, visual field, facial paralysis, upper limb movement, lower limb movement, etc. The higher the score, the worse the function. The minimum score is 0 and the maximum score is 42. In previous studies, Kongyouqi et al. (Kong et al., 2018), Gaitboter gait analysis system was used to analyze and detect the gait of stroke patients and healthy control group. It was found that compared with the control group, the stride of the stroke group was shorter, the gait frequency and speed were decreased, and the swing phase time, support phase and bipedal support time were prolonged (p < 0.01). Based on this conclusion, as it is shown in graphical abstract, this paper presents an intelligent gait analysis method for stroke: firstly, appropriate gait features were selected from all the existing gait features (gait parameters); secondly, stroke patients were labeled with NIHSS score by clinicians; finally a classification model for stroke patients (hereinafter referred to as the discrimination model) and a classification model for stroke severity (hereinafter referred to as the hierarchical model) through machine learning algorithm were trained. The establishment steps of the two models were basically divided into data set selection, data normalization processing, feature selection, classification algorithm selection and model training, which are described in detail below. Figure 1 shows the flow diagram of model construction.

Figure 1
www.frontiersin.org

Figure 1. Diagram of stroke gait analysis model establishment process.

4.1 Data set selection

Gait characteristics: A high-precision gait detection and parameter calculation method were adopted (Wang C. et al., 2016; Wang C. et al., 2017; Wang C. et al., 2018). A total of 32 gait characteristic parameters were calculated. According to the parameter categories, they were classified as follows (Table 1):

Table 1
www.frontiersin.org

Table 1. Gait characteristic parameters (Tian et al., 2021).

Feature correlation analysis: As it is shown in Figure 2, stride is positively correlated with step speed, and stride is also positively correlated with ground angle and the sagittal max angle; Step speed is positively correlated with the sagittal max angle; Left time is positively correlated with right swing phase; The cross min (max) angle is positively correlated with foot rotation; Ground angle is also positively correlated with step speed; The left (right) of cross min (max) angle is positively correlated with the left (right) of cross max (min) angle, which is consistent with the situation of normal gait walking. These gait feature parameters will be used as the features of our next model training data set, which is divided into training set and test set.

Figure 2
www.frontiersin.org

Figure 2. Relevance analysis of features.

As we know, by using all 32 feature parameters including the highly correlated ones, the machine learning models can be biased, therefore, reducing the features redundancy before using all 32 feature parameters in machine learning models is important. In addition, too many features may also lead to over-fitting and more calculation, and too few features may lead to under-fitting. In this paper, cross-validation was used to select the optimal feature combination and eliminate redundant features and aimed at evaluating the impact of different feature subsets on model performance (In the section Feature Selection, there is a detailed introduction).

Data label: For the label of the data in the data set (including training set and test set), we adopted the NIHSS score which is used to assess the severity of stroke patients. The NIHSS consists of 11 evaluation indexes, each of which has a score range of 0–4 points. A score of 0 indicates that the evaluation index has normal function, and the higher the score, the worse the function. After all the 11 evaluation indicators are evaluated, the total score of NIHSS will be generated. The minimum score is 0 and the maximum score is 42 (Hage, 2011), as it is shown in Table 2. Experienced clinicians score stroke patients in the data set through the NIHSS mentioned above, and it is data label.

Table 2
www.frontiersin.org

Table 2. Table of comparing between NIHSS scores and stroke severity.

4.2 Data normalization processing

The gait characteristic parameters described above represent different physical meanings, and the numerical magnitude is also very different. Therefore, before model training, we need to normalize all the data, otherwise, especially the features with large data values will have a great impact on model training (Duda, 2001). The normalization method is to quantitatively convert the values, that is, the values corresponding to each parameter are mapped to a small range. Here we take (−1, 1), and the normalization calculation formula is as follows in Equation 1:

x = 1 + 2 x x max x min     (1)

x is the value of the original data, x is the scaled value, x max is the maximum value and x min is the minimum value in the data.

4.3 Feature selection

Feature selection is very important for the final training model. Improper feature selection leads to poor classification performance of the trained model. Even if the feature selection is appropriate, the number (dimension) of features will also have a great impact on the classification performance of the final training model. Too many features may lead to over-fitting and more calculation, and too few features may lead to under-fitting. The feature selection methods used in this study is a kind of ensemble-learning-based methods that aim to construct a group of feature subsets from several different algorithms, and then produce an aggregated result out of the group. In this way, the instability and perturbation issues of single algorithm can be alleviated, and also, the subsequent learning (model training) tasks can be enhanced (Li et al., 2017). In this paper, we use Randomforest (Ho, 1995; Ho, 1998; Hastie et al., 2001; Mariana and Lucian, 2016; Biau and Scornet, 2015), AdaBoost (Joglekar, 2016; Kégl, 2013; Collins et al., 2002), GradientBoosting (Friedman, 2001; Mason et al., 1999; Zhu, 2016; Ayyadevara, 2018), DecisionTree (Rokach and Maimon, 2008; Freund and Mason, 2002; Fürnkranz, 2017). Although a comparison of Randomforest-based feature selection methods (Speiser et al., 2019) presented that the method VSURF (Genuer et al., 2015) & Boruta (Kursa and Rudnicki, 2010) implemented in the R package are preferable, combined to the characteristics of this study, the reason for not choosing this method is: (a) it is a feature selection method obtained by a single method (not ensemble-learning method), without considering the instability and perturbation issues of a single algorithm; (b) it also does not consider the coherence of subsequent learning (model training) tasks, such as the inconvenience of integration caused by the different environments of R language and the overall system implementation (Python); (c) our paper adopts an ensemble-learning-based method, which can improve the robustness of feature selection by integrating multiple model algorithms; (d) the method used in our paper integrates the feature importance of multiple model algorithms to fully control the process of feature selection, and it has good flexibility, controllability and interpretability. Specifically, the algorithm in our study filters the features and selects the most important features. The idea of this method is to directly use the machine learning model to select features, establish prediction models for each individual feature and response variable, and use cross-validation to select models. The specific steps and processes are shown in Figure 3.

Figure 3
www.frontiersin.org

Figure 3. Process of feature selection based on ensemble-learning-based methods.

According to Figure 3, the typical process of feature selection is to first determine the selection criteria and then determine which features to select. Here, the selection criteria is the feature numbers/dimensions, and the determination of which features should be selected is the highest ranking of feature importance according to the numbers/dimensions. Feature numbers (dimensions) are gotten as follows: Considering that there are 32 feature parameters in the data set, in order to avoid the problems of over-fitting and under-fitting caused by too much or too little feature selection, we selected the top 10 features, 20 features and 32 features (all features) from the ranking(feature sets were manually divided into small, medium, and large three different levels subsets: 10/20/32) according to their importance, and used LogisticRegression (Cox, 1958; Liu et al., 2009; Lin et al., 2007), GaussianNB (Long et al., 2012; James and Vimina, 2022), DecisionTree, Randomforest, SVM (Vapnik, 1998; Zheng et al., 2008; Tong and Chang, 2001) algorithm for cross-validation to obtain their accuracy. Then we compared the accuracy of 10 features, 20 features and 32 features in the case of each algorithm, and further comprehensively judged the number of features selected by the final classifier.

In this case, final features selection are as follows: The number of features has been determined above, so we need to determine the specific features. The ranking of feature importance was selected by the ensemble of four different models/algorithms: Randomforest, AdaBoost, Gradientboosting and Decisiontree. According to the number of previously determined features, we selected the corresponding number of features with the highest ranking according to their importance, and finally formed a feature vector.

Discriminant model: According to the above feature selection algorithm, 10 features, 20 features and 32 features were selected for cross-validation with 5 common algorithms, and their corresponding accuracy is shown in Table 3. It can be seen from the table that when 10 features are selected, the classifier has better classification effect and less calculation. Figure 4 shows the visual sorting of feature importance selected by the ensemble of four different models/algorithms: Randomforest, AdaBoost, Gradientboosting and Decisiontree algorithms (the abscissa is the relative importance of features). In this way, the 10 features we finally selected form a feature vector: right step speed, left step speed, right off ground angle, left off ground angle, right sagittal min angle, left sagittal min angle, right stride, left stride, right sagittal max angle, left sagittal max angle.

Table 3
www.frontiersin.org

Table 3. Discriminant model: table of comparing for features selection.

Figure 4
www.frontiersin.org

Figure 4. Discriminant model: order of feature’s importance.

Hierarchical model: According to the feature selection algorithm, 10 features, 20 features and 32 features are selected for cross-validation with five common algorithms, and their corresponding accuracy is shown in Table 4. It can be seen from the table that when 10 features are selected, the classifier has better classification effect and less calculation.

Table 4
www.frontiersin.org

Table 4. Hierarchical model: table of comparing for features selection.

Figure 5 shows the visual sorting of feature importance selected by the ensemble of four different models/algorithms: Randomforest, AdaBoost, Gradientboosting and Decisiontree algorithms (the abscissa is the relative importance of features). In this way, the 10 features finally selected form a feature vector: healthy side ground angle, affected side ground angle, healthy side off ground angle, affected side off ground angle, healthy side cross max angle, affected side cross max angle, healthy side stride, affected double time, healthy side sagittal max angle, affected side sagittal max angle.

Figure 5
www.frontiersin.org

Figure 5. Hierarchical model: order of feature’s importance.

4.4 Classification algorithm selection

After determining the features of model training, it is necessary to select a classification algorithm. We selected the classification algorithm of the model through cross-validation. Specifically, we selected nine common classification algorithms and calculated the performance indicators of each classification algorithm. Each algorithm performed 5-fold cross-validation calculations, calculated the prediction accuracy for five times, and obtained the prediction accuracy of the algorithm (the average value of 5 times, and the average accuracy is one of the commonly used evaluation indicators). We compared the corresponding accuracy values obtained in the case of each algorithm, and finally selected the algorithm with higher accuracy value as our model training algorithm.

Discriminant model: According to the determination method mentioned above, each algorithm’s accuracy value, standard deviation and confidence interval are shown in Table 5. Through comparison, the following three algorithms are selected for classifier modeling: KNN, SVM and Randomforest.

Table 5
www.frontiersin.org

Table 5. Discriminant model: comparing between different algorithms for algorithm’s selection.

Hierarchical model: According to the determination method mentioned above, each algorithm’s accuracy value, standard deviation and confidence interval are shown in Table 6. Through comparison, the following three algorithms are selected for classifier modeling: SVM, Randomforest and AdaBoost.

Table 6
www.frontiersin.org

Table 6. Hierarchical model: comparing between different algorithms for algorithm’s selection.

4.5 Model training

Model training is to build a classifier, that is, the process of model training and parameter tuning according to the previously selected data sets, features and algorithms, and its output is the built model. In this paper, a 5-fold cross-validation method is used for parameter tuning. Its basic idea is to group the training sets in the original data set. One part is used as the training set to train the model, and the other part is used as the test set to evaluate the model.

Discriminant model: as shown in Table 7, we selected totally 156 patients and normal people as the training set to build the discriminant model. For three different algorithms, the relevant parameters obtained are: KNN parameter n_neighbors = 6, SVM parameter c = 2.1, max_iter = 21, kernel = poly, Randomforest parameter n_ estimators = 9.

Table 7
www.frontiersin.org

Table 7. Data of experiment 1.

Hierarchical model: as shown in Table 8, we selected 32 patients with a total of 515 steps as our total sample, of which the training set is composed of 25 patients with a total of 457 steps to build the hierarchical model. For three different algorithms, the relevant parameters obtained are: Adaboost parameter n_estimators = 7, learning_ rate = 1.8, SVM parameter C = 0.1, max_iter = 7, kernel = poly, Randomforest parameter n_ estimators = 52.

Table 8
www.frontiersin.org

Table 8. Data of experiment 2.

5 Experimental results

Based on the trained discriminant model and hierarchical model for stroke gait analysis, 226 cases of data (including normal healthy people and stroke patients) were collected from Beijing Chao-Yang Hospital, Capital Medical University, and the models trained in this paper were tested to verify the effectiveness. The experiment is divided into two parts. The first part (hereinafter referred to as Experiment 1) is to verify the discriminant model, that is, to distinguish between normal healthy people and stroke patients, and the second part (hereinafter referred to as Experiment 2) is to verify the hierarchical model, that is, to judge the severity of stroke in stroke patients.

5.1 Experimental data

The details of the subjects participating in the experiment are shown in Table 7. This data is used as the data set of Experiment 1.

In Table 7, 32 of 39 stroke patients were scored and labeled by clinicians with NIHSS. The scores of the labeled patients are distributed between 1 and 5 points, that is, all patients are non-severe stroke patients, as shown in Table 9. During the test, the patients were required to walk on a flat ground along a straight line. According to the actual situation of the patients, the patients should walk as many times as possible, that is, different patients walked at different times. Each walk covered about 2–50 steps.

Table 9
www.frontiersin.org

Table 9. NIHSS score of strokes.

In this paper, we calculated the parameters of each step in each walk of each patient to form the data set of Experiment 2. The total number of steps of 32 patients was 515. In consideration of the symmetry of the sample, we classified the patients with a NIHSS score of 1 as very mild stroke, and the patients with a NIHSS score of 2–5 as mild stroke. The specific experimental data of Experiment 2 are shown in Table 8.

5.2 Experimental results

The purpose of the experiment is to verify the performance of the discriminant model based on machine learning to distinguish between normal healthy people and stroke patients, and the performance of the hierarchical model to distinguish the severity of stroke in stroke patients.

(a) Experiment 1

Test results: as shown in Table 7, we selected 70 patients and normal people as test sets to verify the performance of the model. The experimental results are shown in Table 10. From the table:

Table 10
www.frontiersin.org

Table 10. Experiment 1: test results by model comparing between different algorithms.

1. The model based on KNN, SVM and Randomforest algorithms has achieved 98.28, 98.28 and 96.55% detection accuracy for normal healthy people, respectively.

2. The detection accuracy of the model based on KNN, SVM and Randomforest algorithms for stroke patients was 66.67, 66.67 and 58.33%, respectively.

3. The overall detection accuracy of the model based on KNN, SVM and Randomforest algorithms is 92.86, 92.86 and 90.00%, respectively.

(b) Experiment 2

Test results: as shown in Table 8 we selected 32 patients with a total of 515 steps as our total sample. The test set consists of 7 patients with a total of 58 steps to verify the performance of the model. It should be noted that the judgment criterion of this experiment is: each prediction result is the prediction result of a certain step for a person. If the prediction result of each step is consistent with the labeled data, it is judged that the recognition is correct, otherwise it is a recognition error. The same judgment is made on all the steps of the person, and finally the principle of the minority obeying the majority is adopted. In other words, the total number of steps judged to be correct is more than the number of steps judged to be wrong, then the total judgment result is that the detection is correct; Otherwise, it is a detection error. The experimental results are shown in Table 11. Subject/tester 1–4 are mild stroke patients, and subject/tester 5–7 are very mild stroke patients. It can be seen from the table that the overall detection accuracy of the model based on Randomforest, SVM and AdaBoost algorithms reaches 71.43, 85.71 and 85.71%, respectively.

Table 11
www.frontiersin.org

Table 11. Experiment 2: test results by model comparing between different algorithms.

6 Discussion

In this paper, a quantitative evaluation method of stroke association based on multi-dimensional gait parameters is proposed. This method is based on machine learning method, training a model, and then analyzing gait of stroke. That is, through all existing gait features (gait parameters), select appropriate gait features (gait parameters), label stroke patients with NHISS scale by clinicians, and finally train a discriminant model for stroke patients or not and a hierarchical model for stroke severity grading evaluation through machine learning algorithm. Based on the data of 226 cases collected in Beijing Chao-Yang Hospital, Capital Medical University, experiments were conducted on the two models. The purpose of Experiment 1 was to use the discriminant model to distinguish between normal healthy people and stroke patients. The experimental results show that the three algorithms have achieved 96.55–98.28% detection accuracy for normal people, which shows that the classifier can better distinguish healthy people. The accuracy of the model based on KNN, SVM and Randomforest is 58.33–66.67% for stroke patients. Compared with healthy people, the accuracy of stroke patients’ detection is low. This is because the NIHSS scores of stroke patients in our test set are small, that is, the degree of stroke of these patients is very slight, so the classifier is difficult to distinguish under the current training of relatively small data sets. The overall detection accuracy of the models by the three algorithms has reached more than 90%, and the results of KNN and SVM detection are better, which fully shows that the model built in this paper can provide a certain degree of early warning for the onset or recurrence of stroke. The purpose of Experiment 2 was to use the hierarchical model to distinguish the severity of stroke in stroke patients. It can be seen from the result that the overall detection accuracy of the models is up to 85.71% according to the number of steps, of which the results of AdaBoost and SVM are better, which fully shows that the model built in this paper can monitor the rehabilitation status of stroke patients after stroke. However, because the overall sample size of our experiment is not too large, and the selected patients have mild stroke, it has limitations. In the future, we need to expand the sample size for further in-depth research. In addition, previous studies have shown that the presence and progression of cerebral atrophy(subtype of ischemic stroke) is a potentially relevant (although still poorly characterized) manifestation of acute cerebral small vessel disease-related to gait disturbances (Grau-Olivares et al., 2010; Smith and Arboix, 2012). Because the pathophysiology, prognosis and clinical characteristics of acute small vessel ischemic stroke are different from other types of cerebral infarction, the two models in the proposed method for fine-grained domain of ischemic stroke is also one of the future research directions. For example, it can be used to study the relationship between lacunar and non-lacunar acute stroke (Rudilosso et al., 2022).

7 Conclusion

The two experimental results showed that the stroke association quantitative evaluation method by using machine learning based on multi-dimensional gait parameters proposed in this paper has the characteristics of high accuracy, objectivity, and quantification. It is expected to be used in clinical early warning, rehabilitation monitoring and evaluation of post-stroke recurrence to assist clinicians more efficiently.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

The studies involving humans were approved by Research Ethics Committee of the Beijing Chao-Yang Hospital (the number is 2018-sci-153). The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.

Author contributions

CW: Writing – review & editing, Data curation, Funding acquisition, Writing – original draft. ZL: Data curation, Writing – review & editing, Investigation, Software. X-DW: Conceptualization, Funding acquisition, Validation, Visualization, Writing – original draft. Y-QK: Data curation, Investigation, Writing – original draft. L-CZ: Conceptualization, Methodology, Supervision, Writing – original draft. W-HJ: Resources, Visualization, Writing – original draft. PL: Investigation, Writing – original draft. JW: Formal analysis, Writing – original draft. X-JW: Formal analysis, Writing – review & editing. TT: Formal analysis, Methodology, Project administration, Resources, Supervision, Writing – review & editing.

Funding

The author(s) declare that financial support was received for the research, authorship, and/or publication of this article. This work is partly supported by 2025 Key Technology Innovation Program of Ningbo(2020Z014), Capital Health Development Research Project (2020-3-7091), and Henan Science and Technology Project (192102310218,212102310557). The authors are grateful for these supports.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The authors declare that no Gen AI was used in the creation of this manuscript.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Abbreviations

NIHSS, National Institutes of Health Stroke Scale; SVM, Support Vector Machine; KNN, K-Nearest Neighbor.

References

Altman, N.S. (1992). An Introduction to Kernel and Nearest-Neighbor Nonparametric Regression. American Statist. 46, 175–185.

Google Scholar

Andrea, M., Diana, T., Andrea, C., and Angelo, S. (2016). A machine learning framework for gait classification using inertial sensors: application to elderly, post-stroke and Huntington's disease patients. Sensors 16, 1–14. doi: 10.3390/s16010134

PubMed Abstract | Crossref Full Text | Google Scholar

Aoki, J., Kimura, K., Koga, M., Kario, K., Nakagawara, J., Furui, E., et al. (2013). NIHSS-time score easily predicts outcomes in rt-PA patients: the SAMURAI rt-PA registry. J. Neurol. Sci. 327, 6–11. doi: 10.1016/j.jns.2013.01.029

PubMed Abstract | Crossref Full Text | Google Scholar

Ayyadevara, V. K. (2018). Gradient Boosting Machine. Pro machine learning algorithms: A hands-on approach to implementing algorithms in python and R. doi: 10.1007/978-1-4842-3564-5_6

Crossref Full Text | Google Scholar

Biau, G., and Scornet, E. (2015). A random Forest guided tour. TEST 25, 1–31.

Google Scholar

Chow, J. W., and Stokic, D. S. (2021). Longitudinal changes in Temporospatial gait characteristics during the first year post-stroke. Brain Sci. 11:1648. doi: 10.3390/brainsci11121648

PubMed Abstract | Crossref Full Text | Google Scholar

Collins, M., Schapire, R. E., and Singer, Y. (2002). Logistic Regression, AdaBoost and Bregman distances. Mach. Learn. 48, 253–285. doi: 10.1023/A:1013912006537

PubMed Abstract | Crossref Full Text | Google Scholar

Cox, D. R. (1958). The regression analysis of binary sequences. J. R. Stat. Soc. Ser. B Methodol. 20, 215–232. doi: 10.1111/j.2517-6161.1958.tb00292.x

Crossref Full Text | Google Scholar

Cui, C., Bian, G. B., Hou, Z. G., Zhao, J., Su, G., Zhou, H., et al. (2018). Simultaneous recognition and assessment of post-stroke Hemiparetic gait by fusing kinematic, kinetic, and electrophysiological data. IEEE Trans. Neural Syst. Rehabil. Eng. 26, 856–864. doi: 10.1109/TNSRE.2018.2811415

PubMed Abstract | Crossref Full Text | Google Scholar

Duda, R. O. (2001). Pattern classification. Second Edn.

Google Scholar

Estrada-Barranco, C., Molina-Rueda, F., and Cuerda, C. (2022). Validity of the Wisconsin gait scale to predict the risk of falls in stroke patients. Rev. Neurol. 73, 50–56. doi: 10.33588/rn.7302.2020281

PubMed Abstract | Crossref Full Text | Google Scholar

Freund, Y., and Mason, L. (2002). “The alternating decision tree learning algorithm,” in Proceedings of the Sixteenth International Conference on Machine Learning (ICML ‘99). San Francisco, CA, USA: Morgan Kaufmann. 124–133.

Google Scholar

Freund, Y., and Schapire, R. E. (1999). Large margin classification using the perceptron algorithm. Mach. Learn. 37, 277–296. doi: 10.1023/A:1007662407062

Crossref Full Text | Google Scholar

Friedman, J. H. (2001). Greedy function approximation: a gradient boosting machine. Ann. Stat. 29, 1189–1232. doi: 10.1214/aos/1013203451

PubMed Abstract | Crossref Full Text | Google Scholar

Fürnkranz, J. (2017). Decision Tree. Encyclopedia of Machine Learning And Data Mining. doi: 10.1007/978-1-4899-7687-1_66

Crossref Full Text | Google Scholar

Gallant, S. I. (1990). Perceptron-based learning algorithms. IEEE Trans. Neural Netw. 1, 179–191. doi: 10.1109/72.80230

PubMed Abstract | Crossref Full Text | Google Scholar

Genuer, R., Poggi, J. M., and Tuleau-Malot, C. (2015). VSURF: An R package for variable selection using random forests. R J.l 7:9. doi: 10.32614/RJ-2015-018

Crossref Full Text | Google Scholar

Grau-Olivares, M., Arboix, A., Junqué, C., Arenaza-Urquijo, E. M., Rovira, M., and Bartrés-Faz, D. (2010). Progressive gray matter atrophy in lacunar patients with vascular mild cognitive impairment. Cerebrovasc. Dis. 30, 157–166. doi: 10.1159/000316059

PubMed Abstract | Crossref Full Text | Google Scholar

Hage, V. (2011). The nih stroke scale: a window into neurological status. NurseCom Nursin Spectrum (Greater Chicago) 24, 44–49.

Google Scholar

Hastie, T. J., Tibshirani, R. J., and Friedman, J. H. (2001). The elements of statistical learning. Springer. 1.

Google Scholar

Ho, T.K. (1995). “Random decision forests.” in Document Analysis and Recognition, 1995. Proceedings of the Third International Conference on.

Google Scholar

Ho, T. K. (1998). The random subspace method for constructing decision forests. IEEE Transactions on Pattern Analysis & Machine Intelligence 20, 832–844. doi: 10.1109/34.709601

PubMed Abstract | Crossref Full Text | Google Scholar

Hu, X. Y., Yun, X. P., and Zhang, T. Application of gait analysis based on digigal video and digital image processing in stroke rehabilitation training[J]. Chinese J Cerebrovas Dis. (2009) 6:5. doi: 10.3969/j.issn.1672-5921.2009.09.003

Crossref Full Text | Google Scholar

Hussain, I., and Park, S. J. (2021). Prediction of myoelectric biomarkers in post-stroke gait. Sensors 21:5334. doi: 10.3390/s21165334

PubMed Abstract | Crossref Full Text | Google Scholar

James, D.E., and Vimina, E.R. Machine learning-based early diabetes prediction. (2022).

Google Scholar

Jauch, E. C., Saver, J. L., Adams, H. P., and Bruno, A. (2013). Guidelines for the early Management of Patients with Acute Ischemic Stroke: A guideline for healthcare professionals from the American Heart Association/American Stroke Association. Stroke Dallas 44, 870–947. doi: 10.1161/STR.0b013e318284056a

Crossref Full Text | Google Scholar

Joglekar, S. (2016). Adaboost-sachin joglekar's blog. Codesachin. Wordpress.com.

Google Scholar

Kaczmarczyk, K., Wit, A., Krawczyk, M., and Zaborski, J. (2009). Gait classification in post-stroke patients using artificial neural networks. Gait Posture 30, 207–210. doi: 10.1016/j.gaitpost.2009.04.010

PubMed Abstract | Crossref Full Text | Google Scholar

Kégl, B. (2013). The return of AdaBoost.MH: Multi-class hamming trees. doi: 10.48550/arXiv.1312.6086

Crossref Full Text | Google Scholar

Khalil, H. M., and Lahoud, N. (2020). Knowledge of stroke warning signs, risk factors, and response to stroke among Lebanese older adults in Beirut. J. Stroke Cerebrovasc. Dis. 29:104716. doi: 10.1016/j.jstrokecerebrovasdis.2020.104716

PubMed Abstract | Crossref Full Text | Google Scholar

Kim, H., Kim, Y. H., Kim, S. J., and Choi, M. T. (2022). Pathological gait clustering in post-stroke patients using motion capture data. Gait Posture 94, 210–216. doi: 10.1016/j.gaitpost.2022.03.007

PubMed Abstract | Crossref Full Text | Google Scholar

Kong, Y. Q., Liu, M. Y., Wang, C., Zhou, L. C., and Wang, X. D. (2018). Sensibility and reliability of the gait analysis sys tem based on the sound and motion sensors. Chin. J Stroke 13, 62–65.

Google Scholar

Krawczyk, M., Syczewska, M., and Szczerbik, E. (2012). Gait kinematics and clinical test changes in post-stroke patients during rehabilitation. Preliminary results of 12 patients of randomized clinical trial. Advan. Rehab. 26, 13–18. doi: 10.2478/rehab-2013-0025

PubMed Abstract | Crossref Full Text | Google Scholar

Kursa, M. B., and Rudnicki, W. R. (2010). Feature selection with Boruta package. J. Stat. Softw. 36, 1–13. doi: 10.18637/jss.v036.i11

Crossref Full Text | Google Scholar

Li, J., Cheng, K., Wang, S., Morstatter, F., Trevino, R. P., Tang, J., et al. (2017). A Data Perspective. Feature selection. Association for Computing Machinery (ACM) 50, 1–45. doi: 10.1145/3136625

PubMed Abstract | Crossref Full Text | Google Scholar

Li, M., Tian, S., Sun, L., and Chen, X. (2019). Gait analysis for post-stroke Hemiparetic patient by multi-features fusion method. Sensors (Basel). 19:1737.

PubMed Abstract | Google Scholar

Lin, C. J., Weng, R. C., and Keerthi, S. S. (2007). Trust region Newton methods for large-scale logistic regression[C]//International Conference on Machine Learning. ACM, 561–568. doi: 10.1145/1273496.1273567

Crossref Full Text | Google Scholar

Liu, J., Chen, J., and Ye, J. “Large-scale sparse logistic regression.” In proceedings of the 15th ACM SIGKDD international conference on knowledge discovery and data mining, Paris, France, June 28–July 1, 2009. (2009).

Google Scholar

Long, J, Mo, Y, Ming, Z, Liu, X, and Zhao, T. “Target-dependent twitter sentiment classification.” In the 49th annual meeting of the Association for Computational Linguistics: human language technologies, proceedings of the conference, 19–24 June, 2011, Portland, Oregon, USA. (2012).

Google Scholar

Mariana, B., and Lucian, D. (2016). Random forest in remote sensing: a review of applications and future directions. ISPRS J. Photogrammetry & Remote Sensing.

Google Scholar

Mason, L., Baxter, J., Bartlett, P., and Frean, M. (1999). Boosting Algorithms as Gradient Descent in Function Space. Adv. Neural Inf. Proces. Syst.

Google Scholar

Mizuta, N., Hasui, N., Nishi, Y., Higa, Y., Matsunaga, A., Deguchi, J., et al. (2022). Merged swing-muscle synergies and their relation to walking characteristics in subacute post-stroke patients: an observational study. PLoS One 17:17. doi: 10.1371/journal.pone.0263613

PubMed Abstract | Crossref Full Text | Google Scholar

Nagao, Y., Nakajima, M., and Inatomi, Y. (2020). Pre-hospital delay in patients with acute ischemic stroke in a multicenter stroke registry:K-PLUS. J. Stroke Cerebrovasc. Dis. 29:105284. doi: 10.1016/j.jstrokecerebrovasdis.2020.105284

Crossref Full Text | Google Scholar

Robbins, H., and Monro, S. (1985). A Stochastic Approximation Method[M]. New York: Springer.

Google Scholar

Rokach, L., and Maimon, O.Z. Data mining with decision trees [electronic resource]: Theory and applications. (2008).

Google Scholar

Ruck, D. W. (1990). Feature selection using a multilayer perceptron. Neural Network Comput 2, 40–48.

Google Scholar

Rudilosso, S., Rodríguez-Vázquez, A., Urra, X., and Arboix, A. (2022). The potential impact of neuroimaging and translational research on the clinical Management of Lacunar Stroke. Int. J. Mol. Sci. 23:497. doi: 10.3390/ijms23031497

PubMed Abstract | Crossref Full Text | Google Scholar

Seok, L. H., Hokyoung, R., Shi-Uk, L., Jae-sung, C., Sungmin, Y., Hyeon, P. J., et al. (2021). Analysis of gait characteristics using hip-knee Cyclograms in patients with hemiplegic stroke. Sensors 21:7685. doi: 10.3390/s21227685

PubMed Abstract | Crossref Full Text | Google Scholar

Smith, E. E., and Arboix, A. (2012). Focal cortical thinning is caused by remote subcortical infarcts: spooky action at a distance. Neurology 79, 2016–2017. doi: 10.1212/WNL.0b013e3182749f6e

PubMed Abstract | Crossref Full Text | Google Scholar

Soucy, P., and Mineau, G.W. “A simple KNN algorithm for text categorization.” in IEEE International Conference on Data Mining. (2001).

Google Scholar

Speiser, J. L., Miller, M. E., Tooze, J., and Ip, E. (2019). A comparison of random forest variable selection methods for classification prediction modeling. Expert Syst. Appl. 134, 93–101. doi: 10.1016/j.eswa.2019.05.028

PubMed Abstract | Crossref Full Text | Google Scholar

Tian, T., Wang Cheng, X., Yuan, B. Y., Jing, W., Zhou, L., Xiangdong, W., et al. (2021). A wearable gait analysis system used in type 2 diabetes mellitus patients: A case-control study, vol. 14. UK: Dove Press, 1799–1808.

Google Scholar

Titus, A. W., Hillier, S., and Louw, Q. A. I.-J. (2018). An analysis of trunk kinematics and gait parameters in people with stroke. Afr J Disabil 7:310.

Google Scholar

Tong, S., and Chang, E. “Support vector machine active learning for image retrieval.” In Proceedings of the ninth ACM international conference on multimedia, ACM. (2001).

Google Scholar

Vapnik, V. (1998). Statistical learning theory.

Google Scholar

Wan, C. X., and Dong, X. (2017). History, present situation and future of severe stroke rehabilitation. Practical Geriatrics 31, 715–717.

Google Scholar

Wan, J. P., Zhang, S., Liu, K. Q., Cai, X., Lou, Y. P., Chen, Z. C., et al. (2014). Risk factors of hemorrhagic transformation in different locations and its relation to clinical outcomes of patients with acute ischemic stroke following intravenous thrombolysis. Zhejiang Da Xue Xue Bao Yi Xue Ban 43, 36–42

PubMed Abstract | Google Scholar

Wang, W., Jiang, B., Sun, H., and Ru, X. (2017). Prevalence, incidence, and mortality of stroke in China results from a Nationwide population-based survey of 480 687 adults. Circulation: Official J. American Heart Association 135, 759–771. doi: 10.1161/CIRCULATIONAHA.116.025250

Crossref Full Text | Google Scholar

Wang, L. D., Liu, J. M., Yang, Y., Peng, B., and Wang, Y. L. (2018). Summary of China stroke prevention and treatment report 2017. Chinese J. Cerebrovascular Dis. 15, 611–616.

Google Scholar

Wang, C., Wang, X.D., Long, Z., Tian, T., and Li, J.T. “Estimation of Spatial-Temporal Gait Parameters based on the Fusion of Inertial and Film-Pressure Signals.” in 2018 IEEE International Conference on Bioinformatics and Biomedicine (BIBM). (2018).

Google Scholar

Wang, C., Wang, X. D., Long, Z., Yuan, J., Qian, Y. L., and Li, J. T. (2016). Estimation of temporal gait parameters using a wearable microphone-sensor-based system. Sensors 16:2167. doi: 10.3390/s16122167

PubMed Abstract | Crossref Full Text | Google Scholar

Wang, C., Wang, X., Long, Z., Yuan, J., Qian, Y.L., and Li, J.T. “Multimodal gait analysis based on wearable inertial and microphone sensors.” in IEEE Smartworld, Ubiquitous Intelligence & Computing, Advanced & Trusted Computed, Scalable Computing & Communications, Cloud & Big Data Computing, Internet of People & Smart City Innovation. (2017).

Google Scholar

Wang, Y. X., Yuan, J. L., and Hu, W. L. (2016). Research progress of common stroke scales. Chinese J. Stroke 11, 1072–1077.

Google Scholar

Wong, W., Cheung, D., and Kao, B. (2009). Secure kNN computation on encrypted databases[J]. ACM. doi: 10.1145/1559845.1559862

Crossref Full Text | Google Scholar

Wu, Z., Zeng, M., Li, C., Qiu, H., Feng, H., Xu, X., et al. (2019). Time-dependence of NIHSS in predicting functional outcome of patients with acute ischemic stroke treated with intravenous thrombolysis. Postgrad. Med. J. 95, 181–186. doi: 10.1136/postgradmedj-2019-136398

PubMed Abstract | Crossref Full Text | Google Scholar

Zheng, L., Zhou, H., Wang, C., and Cen, K. (2008). Combining support vector regression and ant Colony optimization to reduce NOx emissions in coal-fired utility boilers. Energy Fuel 22, 1034–1040. doi: 10.1021/ef700451v

Crossref Full Text | Google Scholar

Zhu, W. Gradient Boosting. (2016).

Google Scholar

Keywords: quantitative evaluation, gait parameters, stroke, NIHSS, machine learning

Citation: Wang C, Long Z, Wang X-D, Kong Y-Q, Zhou L-C, Jia W-H, Li P, Wang J, Wang X-J and Tian T (2025) Quantitative evaluation method of stroke association based on multidimensional gait parameters by using machine learning. Front. Neuroinform. 19:1544372. doi: 10.3389/fninf.2025.1544372

Received: 12 December 2024; Accepted: 28 January 2025;
Published: 12 February 2025.

Edited by:

Ying Xie, Kennesaw State University, United States

Reviewed by:

Lili Zhang, Hewlett Packard Enterprise, United States
Pablo Ordonez, Kennesaw State University, United States

Copyright © 2025 Wang, Long, Wang, Kong, Zhou, Jia, Li, Wang, Wang and Tian. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Tian Tian, dHQ3OTEyMDFAMTYzLmNvbQ==

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Research integrity at Frontiers

94% of researchers rate our articles as excellent or good

Learn more about the work of our research integrity team to safeguard the quality of each article we publish.


Find out more