Decoding Wilson disease: a machine learning approach to predict neurological symptoms

Yang, Yulong; Wang, Gang-Ao; Fang, Shuzhen; Li, Xiang; Ding, Yufeng; Song, Yuqi; He, Wei; Rao, Zhihong; Diao, Ke; Zhu, Xiaolei; Yang, Wenming

doi:10.3389/fneur.2024.1418474

ORIGINAL RESEARCH article

Front. Neurol., 19 June 2024

Sec. Artificial Intelligence in Neurology

Volume 15 - 2024 | https://doi.org/10.3389/fneur.2024.1418474

This article is part of the Research TopicExploring the Future of Neurology: How AI is Revolutionizing Diagnoses, Treatments, and BeyondView all 10 articles

Decoding Wilson disease: a machine learning approach to predict neurological symptoms

Yulong Yang¹^†

Gang-Ao Wang²^†

Shuzhen Fang¹^†

Xiang Li¹

Yufeng Ding¹

Yuqi Song¹

Wei He¹

Zhihong Rao¹

Ke Diao¹

Xiaolei Zhu²^*

Wenming Yang^1,3,4^*

¹Department of Neurology, The First Affiliated Hospital of Anhui University of Traditional Chinese Medicine, Hefei, Anhui, China
²School of Information and Artificial Intelligence, Anhui Agricultural University, Hefei, Anhui, China
³Center for Xin'an Medicine and Modernization of Traditional Chinese Medicine, Institute of Health and Medicine Hefei Comprehensive National Science Center, Hefei, Anhui, China
⁴Key Laboratory of Xin'An Medicine, Ministry of Education, Hefei, Anhui, China

Objectives: Wilson disease (WD) is a rare autosomal recessive disorder caused by a mutation in the ATP7B gene. Neurological symptoms are one of the most common symptoms of WD. This study aims to construct a model that can predict the occurrence of neurological symptoms by combining clinical multidimensional indicators with machine learning methods.

Methods: The study population consisted of WD patients who received treatment at the First Affiliated Hospital of Anhui University of Traditional Chinese Medicine from July 2021 to September 2023 and had a Leipzig score ≥ 4 points. Indicators such as general clinical information, imaging, blood and urine tests, and clinical scale measurements were collected from patients, and machine learning methods were employed to construct a prediction model for neurological symptoms. Additionally, the SHAP method was utilized to analyze clinical information to determine which indicators are associated with neurological symptoms.

Results: In this study, 185 patients with WD (of whom 163 had neurological symptoms) were analyzed. It was found that using the eXtreme Gradient Boosting (XGB) to predict achieved good performance, with an MCC value of 0.556, ACC value of 0.929, AUROC value of 0.835, and AUPRC value of 0.975. Brainstem damage, blood creatinine (Cr), age, indirect bilirubin (IBIL), and ceruloplasmin (CP) were the top five important predictors. Meanwhile, the presence of brainstem damage and the higher the values of Cr, Age, and IBIL, the more likely neurological symptoms were to occur, while the lower the CP value, the more likely neurological symptoms were to occur.

Conclusions: To sum up, the prediction model constructed using machine learning methods to predict WD cirrhosis has high accuracy. The most important indicators in the prediction model were brainstem damage, Cr, age, IBIL, and CP. It provides assistance for clinical decision-making.

Introduction

Wilson disease (WD), alsoknown as hepatolenticular degeneration (HLD), was described by Wilson in 1912 and is a rare autosomal recessive inherited disorder (1, 2). The disease is caused by homozygous or compound heterozygous mutations in ATP7B, which encodes the transmembrane copper-transporting ATPase 2 (commonly referred to as ATP7B). Abnormal function of ATP7B can lead to impaired copper homeostasis and copper overload in the brain, liver and other organs (3). Neurological symptoms are one of the most common types of symptoms associated with this disease. According to research reports, 18–68% of WD patients experience neurological symptoms for the first time, with an average age of 20–30 years (4, 5). However, most patients exhibit liver symptoms in the early stages, followed by severe neurological symptoms (1). Although UWDRS, GAS and other scales can assess the neurological symptoms of patients, the treatment effect is often poor when patients experience neurological symptoms (6–8). Therefore, predicting and intervening in advance on the relevant influencing factors that cause neurological symptoms in patients is crucial for their prognosis.

Machine learning is a research field dedicated to understanding how computers acquire knowledge from data (9). Machine learning technology involves computer-based statistical methods that are trained to identify common patterns and hidden correlations in large datasets. In clinical practice, machine learning methods can help clinicians classify patients with multiple diseases according to several variables (10–12). In recent years, multiple studies have been reported on the application of machine learning methods in predicting Parkinson's disease and Alzheimer's disease (13, 14). In addition, research on machine learning-based prediction of liver cirrhosis in WD has also achieved great success (15). However, so far, no model was used to predict the development of neurological symptoms in WD.

Therefore, this study combines machine learning with clinical multidimensional indicators, such as general clinical information, imaging examinations, blood and urine tests, and clinical scale measurements, to determine the most important indicators related to WD neurological symptoms and construct a predictive model for the occurrence of neurological symptoms in WD patients. Intended to provide clinical decision-making tools for clinicians, it is hoped that through early intervention of these important factors, patients can delay or even avoid the occurrence of neurological symptoms. The flowchart of this study was shown in Figure 1.

Figure 1

Figure 1. The flowchart of this study.

Materials and methods

Study design and population

The study population included 185 WD patients enrolled between July 2021 and October 2023. All patients were inpatients of the Brain Disease Center of the First Affiliated Hospital of Anhui University of Traditional Chinese Medicine and conformed to the EASL clinical practice guidelines for WD (16). The inclusion criteria were as follows: age 12–60 years old (Patients < 12 years old may not be able to cooperate in completing MRI and other examination, while patients >60 years old may exhibit symptoms that are confused with the symptoms of this disease due to age); routine copper depletion treatment at a stable dosage in the previous 4 weeks. The exclusion criteria were as follows: age < 12 years old or >60 years old; unable to cooperate in completing the scale assessment; combined with other neurological, orthopedic, or cardiovascular/respiratory diseases; combined with severe mental symptoms.

This study was conducted following the 1964 Declaration of Helsinki and approved by the Ethics Committee of the First Affiliated Hospital of Anhui University of Traditional Chinese Medicine (Approval No.: 2021AH-66). All subjects received written informed consent.

Data processing

In this study, we included data from 185 patients for prediction (please refer to Supplementary material for details). Before conducting data prediction, outliers were discarded, and for the discarded outliers and missing values, we used a imputation method. If the discarded outliers and missing values are of discrete type, we use mode imputation. If the discarded outliers and missing values are of continuous type, we use mean to randomly select imputation within the variance range. To validate the reliability of the imputed data, we compared the means, medians, and standard deviations before and after imputation, with errors within 3%. Subsequently, we conducted machine learning cross-validation, further confirming the reliability of the imputed data. Meanwhile, the R code was implemented in Empower Stats. Based on previous research results, eligible WD patients were randomly divided into a training group and a testing group at a ratio of 9:1 (17, 18). For some discrete-valued features such as the presence of cirrhosis and ascites, the plurality complement was used, and for some continuous-valued features, the mean complement method was used. Modeling was performed on the training set, followed by validation on an independent test set. The data is presented in Table 1.

Table 1

Table 1. Number of data set.

Neuropsychiatric assessment and clinical objective indicators

All subjects were assessed for neuropsychiatric symptoms using the Unified Wilson Disease Rating Scale (UWDRS) (8). The full UWDRS includes three subscales: the neurological subscale, the liver subscale, and the psychiatric subscale. Since this experiment was mainly to evaluate the neurological symptoms of patients, the subjects were classified into those with neurological symptoms (WD-N, n = 163) and those without neurological symptoms (no-WD-N, n = 22) according to whether the score on the neurological subscale was zero or not. Additionally, through the electronic medical record system, this study collected the following information as predictive factors: gender, age, blood routine examination, liver function, kidney function, liver fibrosis, blood clotting function, 24 h urinary copper, ceruloplasmin, cranial magnetic resonance lesion sites, abnormal liver ultrasound findings and the presence of K-F rings, as well as cirrhosis and ascites.

Statistical analysis

IBM SPSS 25.0 statistical software was utilized to compare the characteristics of demographics, clinical objective indicators, and scale measures of WD patients with and without neurological symptoms. Continuous variable data were expressed as mean ± SD, and differences between groups were analyzed using t-test. Meanwhile, categorical variable data were expressed as numbers and percentages, and the difference between groups was analyzed using the Chi-squared test or Fisher's exact test. Specifically, P > >0.05 indicates that the statistical results have no statistically significant difference, P < 0.05 indicates a statistically significant difference, P < 0.01 indicates a highly statistically significant difference, and P < 0.001 indicates an extremely statistically significant difference.

Machine learning methods

Five common machine learning methods, namely Random Forest (RF), Support Vector Machine (SVM), eXtreme Gradient Boosting (XGB), Logistic regression (LR), and Back propagation (BP) neural network were used to model the data in this study (19–23).

RF: Random forest (19) is a machine learning method based on integrated learning of decision trees. It adopts the concepts of bootstrap sampling and feature randomization to construct multiple decision trees for prediction. Then, it aggregates the results of multiple decision trees through voting to improve prediction accuracy and robustness.

SVM: SVM can be used for classification tasks, serving as a supervised learning algorithm (21). For binary classification problems, the objective of SVM is to find a hyperplane that maximizes the margins between different classes.

XGB: XGB (20) is a machine learning algorithm implemented in the gradient boosting framework. It is an improved algorithm based on gradient boosting machines with the incorporation of a regularization term to effectively prevent the overfitting problem. It has been widely used in the field of machine learning owing to its high prediction accuracy and computational efficiency.

LR: LR (22) is a statistical model used for binary classification tasks, where the goal is to predict the probability that an instance belongs to a particular class. Unlike linear regression, which predicts a continuous value, logistic regression predicts the probability of the instance belonging to the default class (class 1 in binary classification) using a logistic function (specific algorithms and parameters can be found in the Supplementary material).

BP: BP neural network (23), often simply referred to as neural networks, are a type of artificial neural network that is widely used for supervised learning tasks, including classification and regression. They are composed of multiple layers of interconnected nodes, or neurons, organized in a hierarchical manner (specific algorithms and parameters can be found in the Supplementary material).

Shapley Additive Explanations: The Shapley Additive Explanations (SHAP) algorithm (24) is a unified framework for explaining predictions of black-box models. It assigns importance scores to each feature to determine the contribution of each feature to the final model prediction. Different from the XGB model which only computes feature importance, the SHAP feature selection method evaluates the impact of each feature not only by calculating its SHAP value but also by explaining the specific behavior of the feature that affects the prediction results. The SHAP algorithm has been increasingly used as a model interpretation method in fields such as biomedical research. The calculation of SHAP values is given below:

\begin{array}{c} ϕ_{i} = \sum_{S \subseteq {1, 2, \dots, p} \ {i}} \frac{| S |! (p - | S | - 1)!}{p!} [f (S \cup {i}) - f (S)] \end{array}

where ϕ_i represents the SHAP value for feature i, p denotes the number of features, S denotes a subset composed of features other than feature i, |S| denotes the size of the set S, and f(S) denotes the model prediction output for input feature set S. If the SHAP value ϕ_i is greater than zero, it is considered that the feature positively contributes to the model prediction. Conversely, if the SHAP value is less than zero, it indicates that the feature negatively affects the model prediction.

The experimental environment utilized Python 3.8.12, with RF and SVM constructed using the sklearn 1.2.1 package. The XGB package version employed was 1.5.0, and the shap package version used was 0.39.0.

Model evaluation metrics

To evaluate the performance of the model and facilitate a more intuitive comparison with other methods, this study took the area under the receiver operating characteristic (AUROC) curve and the area under the precision-recall curve (AUPRC) as the primary evaluation metrics. The receiver operating characteristic (ROC) curve is a vital measure of the model's robustness. Given the imbalanced nature of the independent test datasets used in this study, the area under the precision-recall curve (AUPRC) was also used for model evaluation. Additionally, specificity (SP), sensitivity (SN), accuracy (ACC), and Matthew's correlation coefficient (MCC) were chosen as additional evaluation parameters, and their definitions are as follows:

\begin{array}{c} \begin{matrix} S n = \frac{T P}{T P + F N} \end{matrix} \end{array}

\begin{array}{c} \begin{matrix} S p = \frac{T N}{T N + F P} \end{matrix} \end{array}

\begin{array}{c} \begin{matrix} A C C = \frac{T P + T N}{T P + T N + F P + F N} \end{matrix} \end{array}

\begin{array}{c} \begin{matrix} M C C = \frac{T P \times T N - F P \times F N}{\sqrt{(T P + F P) (T P + F N) (T N + F P) (T N + F N)}} \end{matrix} \end{array}

where TP, TN, FP, and FN represent true positives, true negatives, false positives, and false negatives, respectively.

Results

Comparison of characteristics of patients with WD

There was no statistically significant difference between the WD-N group and the no-WD-N group in terms of gender, presence or absence of cirrhosis, ascites, cerebral cortex damage, liver capsule smoothing, enhanced internal echo in the liver, homogeneous echogenicity in the liver, and clear internal blood vessels in the liver (P > 0.05). Meanwhile, there was no statistical difference between the WD-N group and the no-WD-N group in terms of WBC, RBC, Hb, PLT, PT, INR, APTT, FBG, TT, ALT, AST, TBA, TBIL, DBIL, IBIL, TP, ALB, GGT, ALP, BUN, and 24-h urine copper (P > 0.05). Additionally, there was a statistically significant difference between the WD-N group and the no-WD-N group in terms of K-F ring, lenticular nucleus damage, thalamus damage, brainstem damage, cerebral ventricular system dilation, deepening of the sulci and fissures of the brain (P < 0.001), cerebral peduncle damage, and other brainstem damage (P < 0.05). Furthermore, the age (P < 0.001), Cr, psychiatric symptom scores, and liver symptom scores (P < 0.01) of the WD-N group were significantly higher than those of the no-WD-N group, while CP (P < 0.05) was significantly lower than that of the no-WD-N group. The detailed information and the units of all the variables are listed in Table 2.

Table 2

Table 2. Subtype information of WD patients at baseline and follow-up.

Selection of machine learning modeling methods

In this study, five machine learning methods, namely XGB, RF, SVM, LR, and BP neural network, were used to construct a prediction model. Firstly, the training set was used for modeling through 5-fold cross-validation (25) and grid search (26). Then, the optimal hyperparameters were obtained. Finally, the model was tested on an independent test set using a selection of this model. The results indicated that the XGB model achieved the best prediction performance among the five machine learning models, and the MCC, ACC, AUC, and AUPRC values of the XGB model on the test set were 0.556, 0.929, 0.835, and 0.975 respectively (details are provided in Figure 2), and the AUC and AUPRC curves of the XGB model were plotted (the details are provided in Figures 3A, B).

Figure 2

Figure 2. Performance comparison between SVM, RF, XGB, LR, and BP on test sets. SVM, Support Vector Machine; RF, Random Forest; XGB, eXtreme Gradient Boosting; LR, logistic regression; BP, back propagation neural network. MCC, Matthews correlation coefficient; ACC, accuracy; AUC, area under the receiver operating characteristic curve; AUPRC, area under the precision-recall curve.

Figure 3

Figure 3. The ROC and PR curves of the prediction models. (A) The ROC curve of the prediction model; (B) the PR curve of the prediction model.

As shown in the Figure 2, compared to XGB, LR, and BP neural network exhibit significant differences in test results. This is because the data we used are non-linear and discrete, with relatively small feature dimensions, making this feature type very suitable for algorithms developed based on tree models. Similarly, the results of XGB and RF based on tree models mentioned earlier also differ from SVM, as described in the text above.

Summary of predictive factors for neurological symptoms prediction models

Additionally, the SHAP model interpretation package was used to analyze the features of the XGB classifier and graph all instances so that the magnitude of the feature's influence on the prediction can be observed. The wider the regional distribution, the greater its influence (the details are illustrated in Figure 4). Then, to plot a standard bar graph representing the significance of each feature, the average absolute values of the SHAP values were calculated for each feature (the details are illustrated in Figure 5). Based on the Gini impurity, it was found that the most important predictors in the prediction model were brainstem damage, Cr, age, IBIL, and CP.

Figure 4

Figure 4. Feature analysis results. The abscissa represents the weight of the influence, instead of the specific value of the feature; the influence of feature value size on the results is represented by color (red represents a large value, blue represents a small value, and purple is adjacent to the mean). Cr, creatinine; IBIL, indirect bilirubin; CP, ceruloplasmin; DBIL, direct bilirubin; TT, thrombin time; WBC, white blood cell; BUN, blood urea nitrogen; Hb, hemoglobin; APTT, activated partial thromboplastin time; PLT, platelet count; GGT, γ-Glutamyl Transferase; PT, prothrombin time.

Figure 5

Figure 5. Magnitude of the influence of the indicator on neurological symptoms. Cr, creatinine; IBIL, indirect bilirubin; CP, ceruloplasmin; DBIL, direct bilirubin; TT, thrombin time; WBC, white blood cell; BUN, blood urea nitrogen; Hb, hemoglobin; APTT, activated partial thromboplastin time; PLT, platelet count; GGT, γ-Glutamyl Transferase; PT, prothrombin time.

Discussion

As far as we know, this is the first prospective case-control study to combine machine learning methods with multidimensional clinical data to predict the onset of neurological symptoms in WD patients. In this study, the XGB method in machine learning was employed to construct models to predict the occurrence of neurological symptoms in WD patients. The MCC, ACC, AUC, and AUPRC values on the test set were 0.556, 0.929, 0.835, and 0.975, respectively, achieving good results. The results suggested that brainstem damage (0.794), Cr (0.702), age (0.627), IBIL (0.577), and CP (0.533) were the top five important predictors of Gini impurity, and the presence of brainstem damage and the higher the values of Cr, Age, and IBIL, the more likely neurological symptoms were to occur, while the lower the CP value, the more likely neurological symptoms were to occur.

When comparing the demographic and clinical characteristics of the patients in the WD-N and no-WD-N groups, the latter were significantly older than the former, which is consistent with previous findings (27). Meanwhile, serological tests showed that patients with neurological symptoms had a significantly higher Cr value and a significantly lower CP value than those without neurological symptoms, and both were good predictors of the occurrence of neurological symptoms. Previous studies have found that Cr may have a certain correlation with neurological symptoms (28). Additionally, previous studies have found that CP plays an important role in the metabolism and development of neural tissues (29), and the deficiency or functional impairment of ceruloplasmin is a typical feature of various neurodegenerative diseases (30, 31). Therefore, it can be speculated that elevated Cr with diminished CP may be associated with the development of neurological symptoms. Moreover, it was found that the WD-N group had a higher incidence of the K-F ring than the no-WD-N group. These results are consistent with previous research findings (16, 32–34). Furthermore, the WD-N group had significantly higher psychiatric symptom scores and liver symptom scores than the no-WD-N group. This suggests that neurological patients often have psychiatric symptoms, and they tend to have more pronounced hepatic symptoms than hepatic patients. Also, it was found that the WD-N group had a higher incidence of lenticular nucleus damage, thalamus damage, brainstem damage, cerebral ventricular system dilation, deepening of the sulci, and fissures of the brain cerebral peduncle damage and other brain region damage than the no-WD-N group. This finding is also consistent with clinical practice. However, some patients have imaging manifestations but no neurological symptoms in clinical practice, and the accuracy of determining whether a patient has neurological symptoms by looking at them for imaging damage is not very high, which has drawn our attention.

Therefore, this study employed a machine learning approach to predict the onset of neurological symptoms in WD patients, in an attempt to find more sensitive predictive indicators. In this experiment, five machine learning methods, namely RF, SVM, XGB, logistic regression, and BP neural network, were used to construct a prediction model, and the results indicated that the XGB model achieved the best prediction performance among the five machine learning models. Finally, the SHAP model interpretation package was utilized to analyze the features of the XGB classifier. It was found that the most important predictors in the prediction model were brainstem damage, Cr, age, IBIL, and CP. The brainstem or truncus cerebri forms the intermediate part of the vertebrate central nervous system, and it is rostral to the diencephalon and caudal to the spinal cord (35). Brainstem damage is correlated with multiple neurological symptoms of ataxia, dysarthria, gait abnormalities, and muscle spasms (36–38). Additionally, age is also an important predictor of neurological symptoms in WD patients. Considering the previous findings that the average age of N-WD is older than that of no-N-WD (27), it can be speculated that the likelihood of neurological symptoms in WD patients increases with age. A study has found that the bilirubin/albumin ratio is an important risk factor for the occurrence of hepatic encephalopathy, and reducing free bilirubin may provide a new approach to the treatment of hepatic encephalopathy. It can be seen that indirect bilirubin is also closely related to neurological symptoms (39). Finally, it was found that CP is an important predictor of WD. Combined with our finding that CP values in the N-WD group were lower than those in the no-N-WD group, it can be speculated that the lower the CP, the more likely patients are to develop neurological symptoms.

To sum up, the presence of brainstem damage and the higher the values of Cr, Age, and IBIL, the more likely neurological symptoms were to occur, while the lower the CP value, the more likely neurological symptoms were to occur. Therefore, when the patient is older, it is important to be alert to the occurrence of neurological symptoms; When patients experience brainstem damage or low CP values, we should actively intervene to prevent neurological symptoms from occurring. During daily care, patients should be advised to reduce their intake of foods that can cause elevated Cr and IBIL levels, and actively reduce Cr and IBIL when it is high to prevent the occurrence of neurological symptoms.

However, our study still has some limitations. Firstly, the study does not include external validation, which increases the risk of overfitting. Secondly, the number of samples in this study queue does not match well, so there exists selection bias and recall bias. Finally, this prediction model can only predict the presence of neurological symptoms and cannot evaluate the severity of neurological symptoms. Therefore, our future work aims to address the issues of data scarcity and imbalance by using methods such as oversampling or data augmentation.

Conclusions

In conclusion, the prediction model for predicting neurological symptoms in WD, constructed by using the XGB method in machine learning, achieved good performance, and furthermore, we will continue to collect patient data to keep the model updated. Meanwhile, the SHAP method was used to analyze clinical indicators and their impact on neurological symptoms. The results revealed that the most important factors in the prediction model were brainstem damage, Cr, age, IBIL, and CP. And the presence of brainstem damage and the higher the values of Cr, Age, and IBIL, the more likely neurological symptoms were to occur, while the lower the CP value, the more likely neurological symptoms were to occur. It provides assistance for clinical decision-making.

Data availability statement

The training data presented in the study is included in the article/Supplementary material, further inquiries can be directed to the corresponding authors.

Ethics statement

The studies involving humans were approved by the First Affiliated Hospital of Anhui University of Chinese Medicine (Approval number: 2021AH-66). The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation in this study was provided by the participants' legal guardians/next of kin.

Author contributions

YY: Writing – original draft, Writing – review & editing, Data curation, Formal analysis, Methodology, Project administration, Resources, Supervision, Validation. G-AW: Writing – original draft, Writing – review & editing, Data curation, Methodology, Supervision. SF: Writing – original draft, Writing – review & editing. XL: Data curation, Methodology, Writing – original draft, Writing – review & editing. YD: Data curation, Methodology, Supervision, Writing – original draft, Writing – review & editing. YS: Writing – original draft, Writing – review & editing, Formal analysis, Project administration. WH: Writing – original draft, Writing – review & editing, Formal analysis, Project administration, Validation. ZR: Writing – original draft, Writing – review & editing, Data curation, Methodology, Supervision. KD: Writing – original draft, Writing – review & editing, Data curation, Formal analysis, Methodology, Project administration. XZ: Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing – original draft. WY: Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing.

Funding

The author(s) declare that financial support was received for the research, authorship, and/or publication of this article. This research was supported by the Regional Innovation and Development Joint Fund of NSFC (No. U22A20366), the National Natural Science Foundation of China (No. 82305185), the Anhui Province Clinical Medical Research Transformation Special Project (2022042951070-2006), the Collaborative Innovation Project of Anhui Colleges and Universities (No. GXXT-2020-025), the project of Anhui Provincial Department of Education (2022AH050495), and the University Natural Science Research Project of Anhui Province, China under grant (2023AH050998).

Acknowledgments

We would like to thank the Imaging Center of The First Affiliated Hospital of Anhui University of Traditional Chinese Medicine for its instrument support. At the same time, we would like to express our gratitude to the teachers from the School of Information and Artificial Intelligence at Anhui Agricultural University for their technical support in data processing.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fneur.2024.1418474/full#supplementary-material

References

1. Członkowska A, Litwin T, Dusek P, Ferenci P, Lutsenko S, Medici V, et al. Wilson disease. Nat Rev Dis Prim. (2018) 4:21. doi: 10.1038/s41572-018-0018-3

PubMed Abstract | Crossref Full Text | Google Scholar

2. Nagral A, Sarma MS, Matthai J, Kukkle PL, Devarbhavi H, Sinha S, et al. Wilson's Disease: Clinical practice guidelines of the Indian national association for study of the Liver, the Indian society of pediatric gastroenterology, hepatology and nutrition, and the movement disorders society of India. J Clin Exp Hepatol. (2019) 9:74–98.

Google Scholar

3. Beyzaei Z, Mehrzadeh A, Hashemi N, Geramizadeh B. The mutation spectrum and ethnic distribution of Wilson disease, a review. Molec Genet Metabol Rep. (2024) 38:101034. doi: 10.1016/j.ymgmr.2023.101034

PubMed Abstract | Crossref Full Text | Google Scholar

4. Pfeiffer RF. Wilson's disease. Semin Neurol. (2007) 27:123–32. doi: 10.1055/s-2007-971173

PubMed Abstract | Crossref Full Text | Google Scholar

5. Guindi M. Wilson disease. Semin Diagn Pathol. (2019) 36:415–22. doi: 10.1053/j.semdp.2019.07.008

PubMed Abstract | Crossref Full Text | Google Scholar

6. Volpert HM, Pfeiffenberger J, Gröner JB, Stremmel W, Gotthardt DN, Schäfer M, et al. Comparative assessment of clinical rating scales in Wilson's disease. BMC Neurol. (2017) 17:140. doi: 10.1186/s12883-017-0921-3

PubMed Abstract | Crossref Full Text | Google Scholar

7. Zhan T, Guan Y, Sun C, Wang L, Wang Y, Assessment XLi, et al. and factors affecting quality of life among patients with Wilson's disease. Sci Rep. (2024) 14:8636. doi: 10.1038/s41598-024-59377-w

PubMed Abstract | Crossref Full Text | Google Scholar

8. Karantzoulis S, Heuer K, Sparling N, Meltzer B, Teynor M. Exploring the content validity of the unified wilson disease rating scale: insights from qualitative research. Adv Ther. (2024) 41:2070–82. doi: 10.1007/s12325-024-02833-w

PubMed Abstract | Crossref Full Text | Google Scholar

9. Choi RY, Coyner AS, Kalpathy-Cramer J, Chiang MF, Campbell JP. Introduction to machine learning, neural networks, and deep learning. Transl Vis Sci Technol. (2020) 9:14. doi: 10.1167/tvst.9.2.14

PubMed Abstract | Crossref Full Text | Google Scholar

10. Abdollahi M, Ashouri S, Abedi M, Azadeh-Fard N, Parnianpour M, Khalaf K, et al. Using a motion sensor to categorize nonspecific low back pain patients: a machine learning approach. Sensors. (2020) 20:3600. doi: 10.3390/s20123600

PubMed Abstract | Crossref Full Text | Google Scholar

11. Swaminathan S, Qirko K, Smith T, Corcoran E, Wysham NG, Bazaz G, et al. A machine learning approach to triaging patients with chronic obstructive pulmonary disease. PLoS ONE. (2017) 12:e0188532. doi: 10.1371/journal.pone.0188532

PubMed Abstract | Crossref Full Text | Google Scholar

12. Rehman RZU, Del Din S, Guan Y, Yarnall AJ, Shi JQ, Rochester L, et al. Selecting clinically relevant gait characteristics for classification of early parkinson's disease: a comprehensive machine learning approach. Sci Rep. (2019) 9:17269. doi: 10.1038/s41598-019-53656-7

PubMed Abstract | Crossref Full Text | Google Scholar

13. Amboni M, Ricciardi C, Adamo S, Nicolai E, Volzone A, Erro R, et al. Machine learning can predict mild cognitive impairment in Parkinson's disease. Front Neurol. (2022) 13:1010147. doi: 10.3389/fneur.2022.1010147

PubMed Abstract | Crossref Full Text | Google Scholar

14. Chang CH, Lin CH, Lane HY. Machine learning and novel biomarkers for the diagnosis of Alzheimer's disease. Int J Mol Sci. (2021) 22:2761. doi: 10.3390/ijms22052761

PubMed Abstract | Crossref Full Text | Google Scholar

15. Chen K, Wan Y, Mao J, Lai Y, Zhuo-Ma G, Hong P, et al. Liver cirrhosis prediction for patients with Wilson disease based on machine learning: a case-control study from southwest China. Eur J Gastroenterol Hepatol. (2022) 34:1067–73. doi: 10.1097/MEG.0000000000002424

PubMed Abstract | Crossref Full Text | Google Scholar

16. Liver EAfSo. EASL clinical practice guidelines: Wilson's disease. J Hepatol. (2012) 56:671–85. doi: 10.1016/j.jhep.2011.11.007

PubMed Abstract | Crossref Full Text | Google Scholar

17. Wang, L., Huang, C., Wang, M., Xue, Z., and Wang, Y. NeuroPred-PLM: an interpretable and robust model for neuropeptide prediction by protein language model. Brief Bioinform. (2023) 24:bbad077. doi: 10.1093/bib/bbad077

PubMed Abstract | Crossref Full Text | Google Scholar

18. Xu H, Zhao Z. NetBCE: an interpretable deep neural network for accurate prediction of linear B-cell epitopes. Gen Proteom Bioinform. (2022) 20:1002–12. doi: 10.1016/j.gpb.2022.11.009

PubMed Abstract | Crossref Full Text | Google Scholar

19. Yang Z, Lv H, Xu Z, Wang X. Source discrimination of mine water based on the random forest method. Sci Rep. (2022) 12:19568. doi: 10.1038/s41598-022-24037-4

PubMed Abstract | Crossref Full Text | Google Scholar

20. Yan Z, Wang J, Dong Q, Zhu L, Lin W, Jiang X, et al. XGBoost algorithm and logistic regression to predict the postoperative 5-year outcome in patients with glioma. Ann Transl Med. (2022) 10:860. doi: 10.21037/atm-22-3384

PubMed Abstract | Crossref Full Text | Google Scholar

21. Salas-Zárate R, Alor-Hernández G, Salas-Zárate MDP, Paredes-Valverde MA, Bustos-López M, Sánchez-Cervantes JL, et al. Detecting depression signs on social media: a systematic literature review. Healthcare (Basel). (2022) 10:291. doi: 10.3390/healthcare10020291

PubMed Abstract | Crossref Full Text | Google Scholar

22. Das A. Logistic regression. in Encyclopedia of Quality of Life and Well-Being Research, Cham: Springer International Publishing (2024). p. 3985–3986. doi: 10.1007/978-3-031-17299-1_1689

Crossref Full Text | Google Scholar

23. Li X, Wang J, Yang C. Risk prediction in financial management of listed companies based on optimized BP neural network under digital economy. Neural Comput Applic. (2023) 35:2045–58. doi: 10.1007/s00521-022-07377-0

Crossref Full Text | Google Scholar

24. Lundberg SM, Lee SI. A unified approach to interpreting model predictions. In: 31st Annual Conference on Neural Information Processing Systems (NIPS), Long Beach, CA. (2017).

Google Scholar

25. Fushiki T. Estimation of prediction error by using K-fold cross-validation. Stat Comput. (2011) 21:137–46. doi: 10.1007/s11222-009-9153-8

Crossref Full Text | Google Scholar

26. Liashchynskyi P, Liashchynskyi P. Grid search, random search, genetic algorithm: a big comparison for NAS. arXiv preprint arXiv:1912.06059 (2019).

Google Scholar

27. Zhang S, Yang W, Li X, Pei P, Dong T, Yang Y, et al. Clinical and genetic characterization of a large cohort of patients with Wilson's disease in China. Transl Neurodegener. (2022) 11:13. doi: 10.1186/s40035-022-00287-0

PubMed Abstract | Crossref Full Text | Google Scholar

28. Heider P, Pelisek J, Poppert H, Eckstein HH. Evaluation of serum matrix metalloproteinases as biomarkers for detection of neurological symptoms in carotid artery disease. Vasc Endovascular Surg. (2009) 43:551–60. doi: 10.1177/1538574409334826

PubMed Abstract | Crossref Full Text | Google Scholar

29. Maltais D, Desroches D, Aouffen M, Mateescu MA, Wang R, Paquin J, et al. The blue copper ceruloplasmin induces aggregation of newly differentiated neurons: a potential modulator of nervous system organization. Neuroscience. (2003) 121:73–82. doi: 10.1016/S0306-4522(03)00325-7

PubMed Abstract | Crossref Full Text | Google Scholar

30. Zhao X, Shao Z, Zhang Y, Liu F, Liu Z, Liu Z, et al. Ceruloplasmin in Parkinson's disease and the nonmotor symptoms. Brain Behav. (2018) 8:e00995. doi: 10.1002/brb3.995

PubMed Abstract | Crossref Full Text | Google Scholar

31. Zanardi A, Alessio M. Ceruloplasmin deamidation in neurodegeneration: from loss to gain of function. Int J Mol Sci. (2021) 22:663. doi: 10.3390/ijms22020663

PubMed Abstract | Crossref Full Text | Google Scholar

32. Joshi G, Dhingra D, Tekchandani U, Kaushik S. Kayser-Fleischer ring in Wilson's disease. QJM. (2019) 112:629. doi: 10.1093/qjmed/hcz027

PubMed Abstract | Crossref Full Text | Google Scholar

33. Degirmenci C, Palamar M. Evaluation and grading of Kayser-Fleischer ring in Wilson disease by Scheimpflug camera. Eur J Ophthalmol. (2021) 31:2116–20. doi: 10.1177/1120672120931025

PubMed Abstract | Crossref Full Text | Google Scholar

34. Arora N, Bhat P, Goel R, Pannu AK, Malhotra P, Suri V, et al. Kayser-Fleischer ring. QJM. (2020) 113:361. doi: 10.1093/qjmed/hcz221

PubMed Abstract | Crossref Full Text | Google Scholar

35. Nieuwenhuys R. The structural, functional, and molecular organization of the brainstem. Front Neuroanat. (2011) 5:33. doi: 10.3389/fnana.2011.00033

PubMed Abstract | Crossref Full Text | Google Scholar

36. Lai YY, Siegel JM. Brainstem-mediated locomotion and myoclonic jerks. I Neural substrates. Brain Res. (1997) 745:257–64. doi: 10.1016/S0006-8993(96)01177-8

PubMed Abstract | Crossref Full Text | Google Scholar

37. Ferrante E, Marazzi MR, Trimboli M, Dalla Costa D, Erminio C, Nobili L, et al. Brainstem lesion causing paroxysmal ataxia, dysarthria, diplopia and hemifacial spasm (PADDHS). Epileptic Disord. (2019) 21:389–90. doi: 10.1684/epd.2019.1080

PubMed Abstract | Crossref Full Text | Google Scholar

38. Saposnik G, Caplan LR. Convulsive-like movements in brainstem stroke. Arch Neurol. (2001) 58:654–7. doi: 10.1001/archneur.58.4.654

PubMed Abstract | Crossref Full Text | Google Scholar

39. Li Y, Liu H, Chen K, Wu X, Wu J, Yang Z, et al. Pathological significance and prognostic roles of indirect bilirubin/albumin ratio in hepatic encephalopathy. Front Med. (2021) 8:706407. doi: 10.3389/fmed.2021.706407

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: Wilson disease, machine learning, neurological symptom, prediction model, eXtreme Gradient Boosting (XGB)

Citation: Yang Y, Wang G-A, Fang S, Li X, Ding Y, Song Y, He W, Rao Z, Diao K, Zhu X and Yang W (2024) Decoding Wilson disease: a machine learning approach to predict neurological symptoms. Front. Neurol. 15:1418474. doi: 10.3389/fneur.2024.1418474

Received: 16 April 2024; Accepted: 28 May 2024;
Published: 19 June 2024.

Edited by:

Eishi Asano, Wayne State University, United States

Reviewed by:

Wen-Xiong Chen, Guangzhou Medical University, China
Abeer El-Sayyid El-Bashbishy, Mansoura University, Egypt

Copyright © 2024 Yang, Wang, Fang, Li, Ding, Song, He, Rao, Diao, Zhu and Yang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Wenming Yang, eWFuZ3dtODgxMEAxMjYuY29t; Xiaolei Zhu, eGx6aHVfbWRsQGhvdG1haWwuY29t

^†These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.