- 1Laboratory of Brain Disorders, Collaborative Innovation Center for Brain Disorders, Beijing Institute of Brain Disorders, Capital Medical University, Ministry of Science and Technology, Beijing, China
- 2Suzhou Guangji Hospital, The Affiliated Guangji Hospital of Soochow University, The Institute of Mental Health, Suzhou, China
- 3Beijing Key Laboratory of Mental Disorders, The National Clinical Research Center for Mental Disorders, Beijing Anding Hospital, Beijing Institute for Brain Disorders Center of Schizophrenia, Capital Medical University, Beijing, China
- 4Department of Psychiatry, First Affiliated Hospital of Henan University of Science and Technology, Luoyang, China
- 5Beijing Advanced Innovation Centre for Biomedical Engineering, Beijing Advanced Innovation Center for Big Data-Based Precision Medicine, The School of Biological Science and Medical Engineering, Beihang University, Beijing, China
- 6Department of Psychology, Peking University, Beijing, China
- 7Advanced Innovation Center for Human Brain Protection, Capital Medical University, Beijing, China
- 8Institute of Military Cognition and Brain Sciences, Academy of Military Medical Sciences, Beijing, China
Background: The search for a method that utilizes biomarkers to identify patients with schizophrenia from healthy individuals has occupied researchers for decades. However, no single indicator can be employed to achieve the good in clinical practice. We aim to develop a comprehensive machine learning pipeline based on neurocognitive and electrophysiological combined features for distinguishing schizophrenia patients from healthy people.
Methods: In the present study, 69 patients with schizophrenia and 50 healthy controls participated. Neurocognitive (contains seven specific domains of cognition) and electrophysiological [prepulse inhibition, electroencephalography (EEG) power spectrum, detrended fluctuation analysis, and fractal dimension (FD)] features were collected, all these features were taken together to generate the identification models of schizophrenia by applying logistics, random forest, and extreme gradient boosting algorithm. The classification capabilities of these models were also evaluated.
Results: Both the neurocognitive and electrophysiological feature sets showed a good classification effect with the highest accuracy greater than 85% and AUC greater than 90%. Specifically, the performances of the combined neurocognitive and electrophysiological feature sets achieved the highest accuracy of 93.28% and AUC of 97.91%. The extreme gradient boosting algorithm as a whole presented more stably and precisely in classification efficiency.
Conclusion: The highest classification accuracy of 93.28% by combination of neurocognitive and electrophysiological features shows that both measurements are appropriate indicators to be used in discriminating schizophrenia patients and healthy individuals. Also, among three algorithms, extreme gradient boosting had better classified performances than logistics and random forest algorithms.
Introduction
Schizophrenia is one of the most severe mental disorders, affecting 20 million people worldwide (1). Extensive studies show that cognitive deficits are one of the core features of significant neurological dysfunction associated with schizophrenia and are typically associated with a poor prognosis (2, 3). In addition, cognitive deficits are no less predictive of schizophrenia and its level than positive and negative symptoms, if not better (4, 5). Several specific areas of cognition can be assessed by a neurocognitive measure battery, which usually involves the speed of processing, attention, working memory, and verbal learning (6).
In addition, pre-pulse inhibition (PPI) is considered an indicator that reflects information processing deficits in patients with schizophrenia, which is based on electrophysiological measures (7, 8). Most previous studies show that PPI is reduced in schizophrenic patients and their unaffected first-degree relatives (9). However, in comparison to neurocognitive measurements, PPI presents only a moderate effect size (Cohen’s d < 0.8) (10). Yang et al. (11) reports a novel PPI paradigm involving attentional enhancement effects of PPI while providing a more significant effect size (Cohen’s d > 1.2). Additionally, electroencephalography (EEG) is a non-invasive electrophysiological measure widely applied to assess the neural response of the brain to external stimulation. The EEG power spectrum describes the distribution of power into each frequency band and is commonly used in schizophrenia research (12, 13). Extensive research shows that patients with chronic schizophrenia have abnormal EEG frequencies at rest compared with healthy individuals (12, 14–16). Meanwhile, more advanced EEG analytical methods have been studied in recent years, such as detrended fluctuation analysis (DFA), a fractal analytical method to quantify long-range temporal correlations (LRTCs) in power-law form. A previous study reports strongly reduced LRTCs in both alpha and beta frequency bands in patients with schizophrenia (17).
The current gold standard for schizophrenia diagnosis is built on the International Classification of Diseases, 11th Revision (ICD-11) or the Diagnostic and Statistical Manual of Mental Disorders, 5th Edition (DSM-5). These diagnostic methods rely on descriptive psychopathology, which, to some extent, reflects the subjective judgment of psychiatrists. Therefore, there is an urgent need for clinicians to have an objective measure of characteristics. Nevertheless, frustratingly, due to the heterogeneity of the etiology and clinical variability, excellent biomarkers for the diagnosis of schizophrenia are still lacking. In fact, there is no single indicator that can be adopted in clinical practice. In recent years, the role of machine learning in auxiliary diagnosis has received increasing attention throughout the field of schizophrenia research. In translational medicine and clinical practices, these methods are widely involved in exploration for presymptomatic screening, prognostic prediction, and supporting treatment decisions (18). However, to date, there is a paucity of literature about building machine learning models on neurocognitive and electrophysiological biomarkers.
This study developed a comprehensive machine learning pipeline based on neurocognitive (contains seven specific areas of cognition) and electrophysiological [PPI, EEG power spectrum, detrended fluctuation analysis, and fractal dimension (FD)] features by using logistics, random forest, and extreme gradient boosting (XGBoost) algorithms and evaluated their classification capabilities separately.
Materials and Methods
Experimental Subjects
This study enrolled 69 patients with schizophrenia and 50 healthy controls. The diagnosis was established by the researchers from interviews using the Structured Clinical Interview for DSM-IV (SCID) and supplemented by clinical notes. All subjects were right-handed, and their audiometric assessments (pure tone audiometry, 1,000 Hz) were normal. The inclusion criteria for the enrollment of patients with schizophrenia are as follows: 1) all clinically stable subjects had no history of neurological disorders or head trauma, 2) no history of electroconvulsive therapy within the past 6 months, and 3) no history of alcohol/drug dependence or abuse (except tobacco). Patients were excluded because of unstable medical conditions or IQ below 70. During this study, all patients received antipsychotic treatments as usual. The healthy control group (CON) consisted of subjects matched to the schizophrenia group (SCZ) in terms of gender, age, years of education, and smoking history. The exclusion criteria for the CON include substance abuse, suicidal risk, major head trauma, and neuropsychiatric disorders. Before signing the informed consent, each subject received a detailed description of the aims and procedures for participation in the study. The independent ethics committee of Beijing Anding Hospital approved the study. The psychopathological status of the patients was also assessed by the Positive and Negative Syndrome Scale (PANSS). The demographic and clinical characteristics of SCZ and CON are summarized in Table 1.
Neurocognitive Assessments
The neurocognitive function of the subjects was assessed using the Repeatable Battery for the Assessment of Neuropsychological Status (RBANS; Chinese version) (19). The RBANS assesses five separate cognitive domains: immediate memory (IMM), delayed memory (DEM), visuospatial and constructional (VC), attention (ATT), and language functioning (LAN). Besides RBANS, the Stroop Color-Word Test (Chinese version) (20) was also administered. Each subject was asked to complete two interference tasks, and color (INT-C) and word (INT-W) interference times were recorded. In this part of the experiment, five RBANS features (IMM, DEM, VC, ATT, LAN) and two Stroop features (INT-C, INT-W) were extracted.
Electrophysiological Assessments
Prepulse Inhibition Measures
Subjects were comfortably seated in a reclining chair with their arms fully relaxed in a natural position. Acoustic startle measured through electromyogram (EMG) signal was recorded from the right orbiculate oculi muscle. Electrode impedances were maintained at <5 kΩ. The eye-blink component of auditory startle reflexes was quantified by the human EMG startle reflexes system (EMG XEYE human startle reflex, Tian Ming Hong Yuan Instruments Company, Beijing, China). In addition, the EMG was bandpass filtered to 100–1,000 Hz and amplified 10,000 times. Acoustic startle stimuli were presented binaurally through two headphones. Acoustic signals were characterized by a sound-level meter (AUDit and System 824, Larson Davis, United States).
PPI was tested according to the same paradigm used in previous research (11, 21). In short, the precedence effect was utilized to generate two different perceived spatial relations between the prepulse and background sound: perceptual separation and perceptual colocation. A more detailed description of the PPI paradigm and related theories are available in previous studies (11, 21). Finally, two features were extracted from this test (perceived spatial colocation PPI, PSC-PPI; perceived spatial separation PPI, PSS-PPI).
Electroencephalography Recording and Processing
Electroencephalography Data Preprocessing
Subjects were comfortably seated in a reclining chair. Then, they closed their eyes and remained relaxed and quiet for 5 min. Continuous EEG was digitized at 1,000 Hz using the EGI EEG system (EGI, Electrical Geodesics, Inc., America) with 128-electrode HydroCelnet referenced to the vertex (Cz). Off-line preprocessing of EEG data was conducted by using EEGLAB (v2019.1) (22) and FieldTrip (23) toolboxes in MATLAB (MATLAB Release 2017b, MathWorks, Inc.). EEG raw data was first resampled to a 500-Hz sampling rate and bandpass filtered to 0.5–45 Hz. For each subject, artifact removal was administered using both continuous raw data and independent component analysis (ICA, algorithm: runica) within EEGLAB. ICA components were classified using an EEGLAB plugin ICLabel tool (24). Eye movement, blink, heartbeat, muscular activity, or other artifacts were distinguished from the ICA data. The EEG data were then manually inspected to verify artifact removal. The bad electrodes were replaced with interpolated data from the remaining electrodes. Finally, all electrodes were rereferenced to an average reference.
Power Spectrum Features
The power spectral density (PSD) of each electrode was evaluated using the Fast Fourier Transform (FFT, Welch method, 2s sliding window, 50% overlap, 0.5-Hz frequency step), yielding an EEG spectrum ranging from 0.5 to 45 Hz. The frequency bands were selected as follows: delta (1.0–4.0 Hz), theta (4.0–8.0 Hz), alpha (8.0–14.0 Hz), beta (14.0–30.0 Hz). D, T, A, and B denote delta, theta, alpha, and beta frequency bands, respectively. AL and AR were computed by averaging the power in the alpha band for the left (Fp1, F3, C3, P3, O1, F7, T3, T5) and right hemispheres (Fp2, F4, C4, P4, O2, F8, T4, T6). (D + T)L and (D + T)R were averaged by summing the power of delta and theta in the left and right hemispheres. AFp and AO were calculated by averaging the alpha band power for the Fp channels (Fp1, Fp2) and the O channels (O1, O2). The absolute power (Abs) and relative power (Rel) in each frequency band were computed for each electrode. A/T ratio, A/B ratio, (D + T)/(A + B) ratio, (D + T)/(A + B) ratio, (D + T)L/(D + T)R ratio, AFp/AO ratio were calculated for Abs as well as Rel. In PSD, 20 features were extracted. For detailed features information, see Table 2.
Detrended Fluctuation Analysis Features
Detrended fluctuation analysis is an analytical method based on scale-free theory for estimating long-range temporal correlations (LRTCs) in power-law form (25). That is, if a time series data has a non-random temporal structure with slowly decaying autocorrelations, DFA can quantify the rate of decay of these correlations as indexed by the DFA power-law exponent. Some evidence suggests that the DFA reflects brain maturation and may prove useful as a potential biomarker for the pathophysiology of neurodevelopmental disorders (26). DFA calculation was performed using the Neurophysiological Biomarker Toolbox (NBT).1 First, all electrodes were filtered in delta, theta, alpha, and beta oscillations, respectively. Then, the amplitude envelope was generated from each frequency band. Finally, the DFA value for each electrode was estimated per participant and stored for each frequency band separately. DFA-D, DFA-T, DFA-A, DFA-B were computed by averaging all electrodes in delta, theta, alpha, and beta frequency bands.
Fractal Dimension Features
Brain complexity can be described as the highly structured temporal structure observed in the EEG signal between pure randomness (e.g., white noise) and the absence of variability (constancy or pure periodicity). The EEGLAB plugin myFractal2 was used to calculate FD for each electrode. Finally, the FD feature was extracted by averaging the FD value of all electrodes.
Statistical Analyses
Statistics were performed in RStudio (Version 1.2.5033, RStudio, Inc., Boston, United States) with R software (Version 3.6.3). The demographic and primary clinical data include gender, age, years of education, smoking history, duration of illness, age of illness onset, chlorpromazine equivalent doses, PANSS total score, PANSS positive score, PANSS negative score, and PANSS general psychopathology score. All demographic and clinical variables except gender and smoking history were expressed as means and SDs. The independent t-test and the chi-square test were conducted to evaluate potential differences in demographic, clinical variables neurocognitive, and electrophysiological between CON and SCZ. The false discovery rate (FDR) was computed to adjust P-values for multiple testing based on the Benjamini-Hochberg method (27). P < 0.05 (two-tailed) was considered as indicative of statistical significance. PASS version 11.0 (NCSS, LLC., Kaysville, UT, United States) was used for statistical power calculation.
Classification
All analyses were carried out using R 3.6.3 software. To choose the optimal features for SCZ and CON classification, the classification ability of each feature was first evaluated. A self-compiled function was used to compute Cohen’s d-values of each feature. Receiver operating characteristic curve (ROC) analysis of each feature was created using the package of pROC in R. ROC values of every feature containing accuracy, sensitivity, specificity, and area under ROC curve (AUC) was used for analyses.
All features were then divided into two different categories: neurocognitive and electrophysiological sets. The rfe function (R caret package) was used for feature selection by the multivariate recursive feature elimination method (28). The main idea of multivariate recursive feature elimination is to build the model repeatedly and then select the best (or worst) features, put the selected features aside, and then repeat the process over the remaining features until all the features have been traversed. In this process, the order to be eliminated is the order of the features. The rfe was first fitted to all features using the bagged tree algorithm. Each feature was ranked according to its importance to the model. In each iteration of feature selection, the ranked features were retained, the model was refitted, and performance was assessed. The final selection of features in each set was based on 10-fold cross-validation.
Then, two optimal subsets (neurocognitive and electrophysiological selected feature sets) and one combination feature set (containing two subsets) were obtained. After feature selection, the logistics algorithm (R stats package), the random forest algorithm (29) (R randomForest package), and the Extreme Gradient Boosting XGBoost algorithm (30) (R xgboost package) were utilized to estimate the classification models from the three features sets. When dealing with medium-sized structured data or table data, it is generally considered that the algorithm based on decision tree is the best. Random forest is an ensemble method to build decision trees. Intuitively speaking, each decision tree is a classifier, and then, for an input sample, N trees have N classification results. The random forest integrates all the classified voting results and specifies the classification with the most votes as the final output. The classification results of several weak classifiers are voted to form a strong classifier, which is the idea of the random forest. XGBoost is another ensemble machine learning algorithm that uses a decision tree as a weak classifier and then integrates these weak classifiers into a strong classifier. In the process of integration, different weights are usually given according to classification accuracy of weak classifiers. Moreover, after adding weak classifiers, the data is usually reweighed to strengthen the classification. Shortly after the XGBoost was put forward, 17 of the 29 champions in Kaggle Data Challenge 2015 used the XGBoost method, which defeated the neural network method.
Finally, the performance of these models is validated using the 10-fold cross-validation method. The results of the validation were then averaged. The classification performance was evaluated by accuracy, sensitivity, and specificity. Besides this, the performance of each model was also evaluated using ROC curves.
Results
Demographics and Clinical Characteristics
SCZ (N = 69) and CON (N = 50) were well matched for gender, age, years of education, and smoking history. There were no significant differences between SCZ and CON in the distribution of these characteristics (Table 1).
Statistical Comparisons of All Extracted Features Between Schizophrenia Group and Control Group
The statistical analysis results of all features are presented in Table 2. The means and standard deviations of all features are shown. In total, all neurocognitive features were statistically different between SCZ and CON. It can also be observed that all PPI features differed significantly between the two groups. Among the EEG power spectrum features, only absolute theta power (Abs-T), absolute power AFp/AO ratio (Abs-AFp/AO), relative delta power (Rel-D), and relative power theta (Rel-T) showed statistically significant differences. In addition, the DFA of CON is lower than that of SCZ (DFA-T) in the theta band as well as both alpha and beta bands of CON being significantly higher than that of SCZ (DFA-A, DFA-B). None of the FDs differed significantly between SCZ and CON. Among all features, the P-values for IMM, LAN, ATT, DEM, PSS-PPI, and DFA-B were less than 0.001.
Cohen’s d and Classification Performance of Single Feature
To choose optimal features to distinguish SCZ from CON, the Cohen’s d and the ROC values were first evaluated for each feature. The ROC values included accuracy (%), sensitivity (%), specificity (%), and AUC (%). Table 3 shows these indexes from the neurocognitive and electrophysiological features. As can be seen in Table 3, a total of 14 features had d-values that exceeded the range of medium effect sizes (Cohen’s d = 0.5) (31), which is generally consistent with the statistical result. These include IMM, VC, LAN, ATT, DEM, INT-C, INT-W, PSC-PPI, PSS-PPI, Abs-T, Abs-AFp/AO, Rel-T, DFA-A, and DFA-B. Among them, the Cohen’s d-values of LAN, ATT, DEM, and PSS-PPI were all greater than 0.8, indicating a large effect size, whereas the Cohen’s d-value of IMM reached 1.42, suggesting a minimal effect size. In Table 3, it is evident that the neurocognitive features performed better than the electrophysiological features in terms of ROC values. IMM was the most suitable neurocognitive feature for classification with an accuracy of 84.03%, and its AUC reached 91.87%. Five neurocognitive features (IMM, LAN, ATT, DEM, and INT-W) had AUC values greater than 70%, demonstrating functional discrimination capacities of features (32). The PSS-PPI was the best potential electrophysiological feature with an accuracy of 80.67% and an AUC of 84.32%. All four electrophysiological features (PSC-PPI, PSS-PPI, DFA-A, and DFA-B) had AUCs greater than 70%.
Classification Performances of Combined Features
As mentioned in the methods, the REF method was used to select the neurocognitive and electrophysiological feature sets that could optimally distinguish between CON and SCZ. In addition, 10-fold cross-validation was used for feature selection to prevent overfitting. Finally, the neurocognitive selected features (NSF) subset contained immediate memory, delayed memory, attention, language functioning, color interference time, and word interference time. The electrophysiological selected features (ESF) subset contained PSC-PPI, PSS-PPI, Abs-T, Abs-A, Abs-AFp/AO, Abs-(D + T)/(A + B), Rel-D, Rel-T, Rel-A/B, DFA-A, and DFA-B. All selected features (ASF) included two subsets as described above. Then, the logistics, random forest and XGBoost algorithms were implemented to build classification models from the NSF subset, ESF subset, and ASF set. On this basis, these models were evaluated using a 10-fold cross-validation method.
Finally, the fitted values for the probability of people with schizophrenia ranged from 0 to 1. Accuracy, sensitivity, and specificity were calculated by setting cutoff points at 0.5 of the fitted values, and then the ROC curves were evaluated and AUCs were calculated based on the fitted values. These model evaluation indicators (accuracy, sensitivity, specificity, and AUC) are shown in Table 4. Figure 1 shows the ROC curves for each model to represent the differences of ROC curves more clearly. As can be seen from Table 4 and Figure 1, the model containing all selected feature sets exhibits the best classification performance regardless of the algorithm used (logistics accuracy of 87.39%, random forest and XGBoost accuracy of 93.28%). The NSF subset models and the ESF subset models showed roughly comparable levels of classification accuracy, but the NSF subset models performed better than the ESF subset in terms of AUC values. Besides this, the XGBoost algorithms performed more consistently and accurately in both accuracy and AUC among all the compared algorithms. The ASF set model with the XGBoost algorithm achieved the highest accuracy of 93.28% and an AUC of 97.91%. Statistical power was calculated by tests for one ROC curve procedure in PASS software. The power analysis showed that 50 patients and 69 healthy subjects were sufficient to have more than 90% statistical power at a two-sided alpha of 0.05 for significance level (Supplementary Table 1).
Figure 1. Receiver Operator Characteristics (ROC) curves for classification of schizophrenia patients and controls based on different combinations of features using logistics (A), random forest (B), and XGBoost algorithm (C). NSF subset, Neurocognitive Selected Features subset include IMM, LAN, ATT, DEM, INT-C, INT-W features; ESF subset, Electrophysiological Selected Features subset include PSC-PPI, PSS-PPI, Abs-T, Abs-A, Abs-AFp/AO, Abs-(D+T)/(A+B), Rel-D, Rel-T, Rel-A/B, DFA-A, DFA-B. ASF set, All Selected Features set include NSF subset and ESF subset. The red, green and blue lines show the ROC curves for the NSF subset, ESF subset and ASF set, respectively. The AUC of logistics models based on NSF subset, ESF subset, ASF set was 89.88%, 90.84%, 92.54%. The AUC of random forest models based on NSF subset, ESF subset, ASF set was 96.59 91.88%, 97.36%. The AUC of XGBoost models based on NSF subset, ESF subset, ASF set was 93.99%, 90.52%, 97.91%.
To better present the differences in the classification ability of these three algorithms, scatterplots were drawn with the fitted values of the NSF subset as the horizontal coordinates and the fitted values of the ESF subset as the vertical coordinates (Figure 2). It is apparent that the fitted values of the random forest and XGBoost algorithm are more densely distributed in this plot compared to the logistics algorithm, which also suggests that the random forest and XGBoost algorithms provide better performance in distinguishing schizophrenia patients from healthy individuals.
Figure 2. The horizontal/longitudinal coordinate axis represents the probability of people with schizophrenia (%). NSF subset, Neurocognitive Selected Features subset include IMM, LAN, ATT, DEM, INT-C, INT-W features; ESF subset, Electrophysiological Selected Features subset include PSC-PPI, PSS-PPI, Abs-T, Abs-A, Abs-AFp/AO, Abs-(D + T)/(A + B), Rel-D, Rel-T, Rel-A/B, DFA-A, DFA-B; ASF set, All Selected Features set include NSF subset and ESF subset. (A–C) Scatter plots from two features set (NSF subset and ESF subset) using logistics, random forest and XGBoost models.
Discussion
Classification Performance of Neurocognition in Schizophrenia
First, this study sought to determine which feature set was more beneficial in distinguishing between schizophrenic patients and healthy control subjects to construct the model. The second aim of the project was to identify which machine learning algorithm performed better in terms of classification ability and robustness.
The highest classification accuracy was 84.03% based on a single neurocognitive feature, achieved by immediate memory. A high level of immediate memory is considered as a predictor of improved cognitive impairments in schizophrenic patients (33). This also implies a potential opportunity for evaluating the prognostic or medical effect by using an immediate memory score. Other neurocognitive indicators performed well. For instance, the delayed memory score also received an excellent classified effect (accuracy of 83.19% and AUC of 86.58%). The classification accuracy of the combined neurocognitive feature set was slightly higher than that of any single feature classifier (accuracy of 89.08% and AUC of 93.99%).
Previous studies demonstrate the importance of neurocognitive-based machine learning techniques for diagnosing schizophrenia. Vacca et al. (34) employed several machine learning techniques (logistics regression, decision tree, random forest, k-nearest neighbor, neural network, support vector machine) to distinguish between 86 schizophrenic patients and 115 healthy subjects. The best methods turned out to be support vector machine and neural network with accuracies of 87 and 84.8%. Antonucci et al. (35) investigated the discriminatory performance of genetic, environmental, and neurocognitive classifiers by using support vector classification and repeated nested cross-validation. The cognitive classifier showed an accuracy of 88.7%, followed by environmental (65.1%) and genetic (55.5%) classifiers. The findings of the two studies are in line with the current study, revealing that neurocognition is a robust indicator in differentiating schizophrenia from healthy people.
Classification Performance of Electrophysiology in Schizophrenia
PPI deficits are involved in the biological bases of schizophrenia and proposed as a potential biomarker for genetic studies with more than 50% of PPI variance attributed to genetic factors (36). Mackeprang et al. (37) argue that PPI was not affected by antipsychotic treatment and, instead, was a stable vulnerability index for schizophrenia. Mena et al. (38) used longitudinal data to examine whether PPI deficits exhibit a temporary effect in the acute phase of schizophrenia. The result suggests that the PPI remained reduced at the 3 months post-discharge assessment, implying that the PPI was a biomarker of schizophrenia. However, in practice, the PPI did not separate schizophrenic subjects from healthy controls (39). In comparison to neurocognitive and other electrophysiological measures, the PPI showed a medium Cohen’s d-value of below 0.6 (40). The previous study establishes a novel PPI paradigm by using a precedence effect–based perceived separation, which produced a Cohen’s d-value of 1.27 and an AUC of 85.2% (11). The current study analyzes whether PSC-PPI and PSS-PPI single features could distinguish SCZ from CON. The accuracy of PSS-PPI was 80.67%, and the AUC was 84.32%.
In addition, other single markers were tested for their ability to identify schizophrenia based on the EEG power spectrum, detrended fluctuation analysis, and fractal dimension. DFA is a method to evaluate the large-scale functional neural dysconnectivity of schizophrenic patients at the temporal level. It was found that DFA was substantially attenuated in both alpha and beta frequency bands in patients. DFA-B presented the highest accuracy of 71.43% and AUC of 71.94% in a single EEG feature.
Compared with the best single electrophysiological feature (PSS-PPI), the combined electrophysiological feature increased the accuracy from 80.67 to 88.24% and the AUC from 84.42 to 91.88%. Devia et al. (41) report that the EEG signals from a free-viewing paradigm distinguished patients from healthy subjects with an overall accuracy of 71%. Thilakavathi et al. (42) analyzed the EEG power spectrum and found that the vector machine classifier produced an accuracy of 88% when features were combined together. Laton et al. (43) used a combination of oddball and mismatch event-related potentials and increased accuracy from 79.8 to 84.7% for a single feature. The above findings suggest that it is essential to select the appropriate EEG feature set to better distinguish between schizophrenic and healthy individuals.
Classification Performance of Combined Neurocognitive and Electrophysiological Features in Schizophrenia
The results show that both neurocognitive and electrophysiological feature sets had a good performance with accuracy values greater than 80% and AUC values greater than 85%. Specifically, the combined neurocognitive and electrophysiological features delivered the highest accuracy of 93.28% and AUC of 97.91%. In fact, the XGBoost algorithm as a whole presented a more stable and accurate classification efficiency in this study. As previous research indicates, XGBoost had several advantages in terms of speed and accuracy over other tree-based ensemble methods, such as Random Forests, AdaBoost, and the traditional gradient boosted trees (30).
Potential Limitations
There are several potential limitations to this study. First, this is a single-center study with limited sample size. Due to the relatively small number of subjects in the study, caution should be exercised when attempting to generalize these findings to clinical applications. Second, there is a lack of drug-naïve patient groups. Drugs may have potentially confounded the findings of the classification performances of these models and clinical status. Third, it is related to the duration of the disease. Nevertheless, no correlation was found between all features and CPZe as well as duration of illness (Table 5). The last one is model interpretability. Indeed, classic statistical regression models, such as linear regression, perform better in terms of interpretability than black-box machine learning models. Perhaps it is for this reason that the acceptance of machine learning among clinicians is lacking. However, schizophrenia is likely to be etiologically heterogeneous, resulting in poor prediction performance of a linear model. Linear regression cannot model the inherent complexity of data sets (such as feature interaction). Therefore, when choosing an appropriate machine learning model, we usually need to weigh the accuracy and interpretability of the model.
In future research, to provide a simple, robust, and reliable model for detecting schizophrenia, first-episode, drug-naïve patients with schizophrenia will be recruited. Moreover, patients with depression and bipolar disorder will also be invited to participate in a future study to assess the specificity of the model and to determine whether the severity of these features varies across psychiatric disorders. Besides this, additional EEG features may be identified in the classification. Further research may also be conducted to evaluate the efficacy of interventions and prognosis.
Conclusion
In this study, a comprehensive machine learning pipeline was provided to detect patients with schizophrenia by applying logistics, random forest, and extreme gradient boosting (XGBoost) algorithm classifiers to neurocognition, electrophysiology, and their combination. The highest classification accuracy of 93.28% was achieved by combining neurocognitive and electrophysiological features, suggesting that these measurements are appropriate indicators for discriminating schizophrenia patients from healthy individuals. Also, among these three algorithms, XGBoost has better classification performances than the other two algorithms. These results suggest that neurocognitive and electrophysiological features can be used along with machine learning for potential clinical applications.
Data Availability Statement
The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding authors.
Ethics Statement
The studies involving human participants were reviewed and approved by the Ethics Committee of Beijing Anding Hospital. The patients/participants provided their written informed consent to participate in this study.
Author Contributions
G-ZY, C-YW, and MF designed the study. LL provided the PPI paradigm for the study. QT, N-BY, YF, FD, Q-JB, and F-CZ acquired and analyzed the data. J-CZ supplied the machine learning algorithm for the study. QT wrote the article, which all authors reviewed. All authors approved the final version to be published and can certify that no other individuals not listed as authors have made substantial contributions to the manuscript.
Funding
This study was funded by the National Natural Science Foundation of China (81971250), Jiangsu Province Key Research & Development Project (BE2018662 and BE2020764), Beijing Municipal Administration of Hospitals Clinical Medicine Development of Special Funding Support (ZYLX201807), Prevention and Control Program of Suzhou Major Diseases (Gwzx201801), the Project of Invigorating Health Care through Science, Technology and Education, Suzhou Medical Youth Talent (kjxw2018048), People’s Livelihood Projects of Suzhou Science and Technology Bureau (SYSD2018168 and SYSD2018166), and Suzhou Clinical Medical Center for Mood Disorders (No. Szlcyxzx202109 to Xiao-Bin Zhang).
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s Note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Acknowledgments
We are grateful to Xiao-Bin Zhang for providing funding support and manuscript writing advice.
Supplementary Material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpsyt.2022.810362/full#supplementary-material
Footnotes
References
1. Vos T, Abajobir AA, Abate KH, Abbafati C, Abbas KM, Abd-Allah F, et al. Global, regional, and national incidence, prevalence, and years lived with disability for 328 diseases and injuries for 195 countries, 1990–2016: a systematic analysis for the Global Burden of Disease Study 2016. Lancet. (2017) 390:1211–59.
2. Bowie CR, Reichenberg A, Patterson TL, Heaton RK, Harvey PD. Determinants of real-world functional performance in schizophrenia subjects: correlations with cognition, functional capacity, and symptoms. Am J Psychiatry. (2006) 163:418–25. doi: 10.1176/appi.ajp.163.3.418
3. Green MF, Kern RS, Braff DL, Mintz J. Neurocognitive deficits and functional outcome in schizophrenia: are we measuring the “right stuff”? Schizophrenia Bull. (2000) 26:119–36.
4. Galderisi S, Rossi A, Rocca P, Bertolino A, Mucci A, Bucci P, et al. The influence of illness−related variables, personal resources and context−related factors on real−life functioning of people with schizophrenia. World Psychiatry. (2014) 13:275–87.
5. Galderisi S, Rossi A, Rocca P, Bertolino A, Mucci A, Bucci P, et al. Pathways to functional outcome in subjects with schizophrenia living in the community and their unaffected first-degree relatives. Schizophr Res. (2016) 175:154–60.
6. Nuechterlein KH, Barch DM, Gold JM, Goldberg TE, Green MF, Heaton RK. Identification of separable cognitive factors in schizophrenia. Schizophr Res. (2004) 72:29–39.
7. Klamer D, Svensson L, Fejgin K, Palsson E. Prefrontal NMDA receptor antagonism reduces impairments in pre-attentive information processing. Eur Neuropsychopharmacol. (2011) 21:248–53. doi: 10.1016/j.euroneuro.2010.10.010
8. Swerdlow NR, Bhakta SG, Light GA. Room to move: plasticity in early auditory information processing and auditory learning in schizophrenia revealed by acute pharmacological challenge. Schizophr Res. (2018) 199:285–91. doi: 10.1016/j.schres.2018.03.037
9. Kumari V, Das M, Zachariah E, Ettinger U, Sharma T. Reduced prepulse inhibition in unaffected siblings of schizophrenia patients. Psychophysiology. (2005) 42:588–94. doi: 10.1111/j.1469-8986.2005.00346.x
10. Light GA, Swerdlow NR, Rissling AJ, Radant A, Sugar CA, Sprock J, et al. Characterization of neurophysiologic and neurocognitive biomarkers for use in genomic and clinical outcome studies of schizophrenia. PLoS One. (2012) 7:e39434. doi: 10.1371/journal.pone.0039434
11. Yang NB, Tian Q, Fan Y, Bo QJ, Zhang L, Li L, et al. Deficits of perceived spatial separation induced prepulse inhibition in patients with schizophrenia: relationships to symptoms and neurocognition. BMC Psychiatry. (2017) 17:135. doi: 10.1186/s12888-017-1276-4
12. Grin-Yatsenko VA, Ponomarev VA, Pronina MV, Poliakov YI, Plotnikova IV, Kropotov JD. Local and widely distributed EEG activity in schizophrenia with prevalence of negative symptoms. Clin Eeg Neurosci. (2017) 48:307–15.
13. Boutros NN, Arfken C, Galderisi S, Warrick J, Pratt G, Iacono W. The status of spectral EEG abnormality as a diagnostic test for schizophrenia. Schizophr Res. (2008) 99:225–37.
14. Koshiyama D, Miyakoshi M, Tanaka-Koshiyama K, Joshi YB, Molina JL, Sprock J, et al. Neurophysiologic characterization of resting state connectivity abnormalities in schizophrenia patients. Front Psychiatry. (2020) 11:608154. doi: 10.3389/fpsyt.2020.608154
15. Tanaka-Koshiyama K, Koshiyama D, Miyakoshi M, Joshi YB, Molina JL, Sprock J, et al. Abnormal spontaneous gamma power is associated with verbal learning and memory dysfunction in schizophrenia. Front Psychiatry. (2020) 11:832. doi: 10.3389/fpsyt.2020.00832
16. Yadav S, Haque NS, Das B, Das J, Tikka SK. Resting state quantitative electroencephalogram gamma power spectra in patients with first episode psychosis: an observational study. Asian J Psychiatr. (2021) 57:102550. doi: 10.1016/j.ajp.2021.102550
17. Nikulin VV, Jönsson EG, Brismar T. Attenuation of long-range temporal correlations in the amplitude dynamics of alpha and beta neuronal oscillations in patients with schizophrenia. Neuroimage. (2012) 61:162–9.
18. Li W, Yang Y, Liu ZH, Zhao YJ, Zhang Q, Zhang L, et al. Progression of mental health services during the COVID-19 outbreak in china. Int J Biol Sci. (2020) 16:1732–8. doi: 10.7150/ijbs.45120
19. Wang JH, Li CB, Cheng Y, Yi ZH, Long B, Wang JJ. Reliability and validity of repeatable battery for the assessment of neuropsychological status (RBANS) in schizophrenic patients: a preliminary study. Shanghai Arch Psychiatry. (2009) 21:265–8.
20. Xue W, Yanping R, Jiali H, Dongfeng Z. Evaluation of cognitive functions and its corelation with psychiatric symptoms in patients with schizophrenia. J Capital Med Univ. (2008) 4:423–7.
21. Li XB, Bo QJ, Tian Q, Yang NB, Mao Z, Zheng W, et al. Impact of childhood trauma on sensory gating in patients with first-episode schizophrenia. BMC Psychiatry. (2018) 18:258. doi: 10.1186/s12888-018-1807-7
22. Delorme A, Makeig S. EEGLAB: an open source toolbox for analysis of single-trial EEG dynamics including independent component analysis. J Neurosci Methods. (2004) 134:9–21. doi: 10.1016/j.jneumeth.2003.10.009
23. Oostenveld R, Fries P, Maris E, Schoffelen JM. FieldTrip: open source software for advanced analysis of MEG, EEG, and invasive electrophysiological data. Comput Intell Neurosci. (2011) 2011:156869. doi: 10.1155/2011/156869
24. Pion-Tonachini L, Kreutz-Delgado K, Makeig S. ICLabel: an automated electroencephalographic independent component classifier, dataset, and website. Neuroimage. (2019) 198:181–97. doi: 10.1016/j.neuroimage.2019.05.026
25. Hardstone R, Poil SS, Schiavone G, Jansen R, Nikulin VV, Mansvelder HD, et al. Detrended fluctuation analysis: a scale-free view on neuronal oscillations. Front Physiol. (2012) 3:450. doi: 10.3389/fphys.2012.00450
26. Smit DJ, de Geus EJ, van de Nieuwenhuijzen ME, van Beijsterveldt CE, van Baal GC, Mansvelder HD, et al. Scale-free modulation of resting-state neuronal oscillations reflects prolonged brain maturation in humans. J Neurosci. (2011) 31:13128–36. doi: 10.1523/JNEUROSCI.1678-11.2011
27. Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc B (Methodological). (1995) 57:289–300.
28. Guan W, Zhou M, Hampton CY, Benigno BB, Walker LD, Gray A, et al. Ovarian cancer detection from metabolomic liquid chromatography/mass spectrometry data by support vector machines. BMC Bioinformatics. (2009) 10:259. doi: 10.1186/1471-2105-10-259
30. Chen T, Guestrin C. Xgboost: a scalable tree boosting system. Proceedings of the 22nd Acm Sigkdd International Conference on Knowledge Discovery and Data Mining. New York, NY: ACM (2016).
32. Elith J, Phillips SJ, Hastie T, Dudík M, Chee YE, Yates CJ. A statistical explanation of MaxEnt for ecologists. Divers Distrib. (2011) 17:43–57.
33. Xiu MH, Guan HY, Zhao JM, Wang KQ, Pan YF, Su XR, et al. Cognitive enhancing effect of high-frequency neuronavigated rTMS in chronic schizophrenia patients with predominant negative symptoms: a double-blind controlled 32-Week follow-up study. Schizophr Bull. (2020) 46:1219–30. doi: 10.1093/schbul/sbaa035
34. Vacca A, Longo R, Mencar C. Identification and evaluation of cognitive deficits in schizophrenia using “Machine learning”. Psychiatr Danub. (2019) 31:261–4.
35. Antonucci LA, Pergola G, Pigoni A, Dwyer D, Kambeitz-Ilankovic L, Penzel N, et al. A pattern of cognitive deficits stratified for genetic and environmental risk reliably classifies patients with schizophrenia from healthy control subjects. Biol Psychiatry. (2020) 87:697–707. doi: 10.1016/j.biopsych.2019.11.007
36. Anokhin AP, Heath AC, Myers E, Ralano A, Wood S. Genetic influences on prepulse inhibition of startle reflex in humans. Neurosci Lett. (2003) 353:45–8. doi: 10.1016/j.neulet.2003.09.014
37. Mackeprang T, Kristiansen KT, Glenthoj BY. Effects of antipsychotics on prepulse inhibition of the startle response in drug-naive schizophrenic patients. Biol Psychiatry. (2002) 52:863–73. doi: 10.1016/s0006-3223(02)01409-9
38. Mena A, Ruiz-Salas JC, Puentes A, Dorado I, Ruiz-Veguilla M, De la Casa LG. Reduced prepulse inhibition as a biomarker of schizophrenia. Front Behav Neurosci. (2016) 10:202. doi: 10.3389/fnbeh.2016.00202
39. Massa N, Owens AV, Harmon W, Bhattacharya A, Ivleva EI, Keedy S, et al. Relationship of prolonged acoustic startle latency to diagnosis and biotype in the bipolar-schizophrenia network on intermediate phenotypes (B-SNIP) cohort. Schizophr Res. (2019) 216:357–66. doi: 10.1016/j.schres.2019.11.013
40. Light GA, Swerdlow NR. Future clinical uses of neurophysiological biomarkers to predict and monitor treatment response for schizophrenia. Ann N Y Acad Sci. (2015) 1344:105–19. doi: 10.1111/nyas.12730
41. Devia C, Mayol-Troncoso R, Parrini J, Orellana G, Ruiz A, Maldonado PE, et al. EEG classification during scene free-viewing for schizophrenia detection. IEEE T Neur Sys Reh. (2019) 27:1193–9.
42. Thilakavathi B, Devi SS, Malaiappan M, Bhanu K. EEG power spectrum analysis for schizophrenia during mental activity. Australas Phys Eng S. (2019) 42:887–97.
Keywords: schizophrenia, neurocognition, electrophysiology, electroencephalography, prepulse inhibition (PPI), biomarker, machine learning, classification
Citation: Tian Q, Yang N-B, Fan Y, Dong F, Bo Q-J, Zhou F-C, Zhang J-C, Li L, Yin G-Z, Wang C-Y and Fan M (2022) Detection of Schizophrenia Cases From Healthy Controls With Combination of Neurocognitive and Electrophysiological Features. Front. Psychiatry 13:810362. doi: 10.3389/fpsyt.2022.810362
Received: 06 November 2021; Accepted: 21 February 2022;
Published: 05 April 2022.
Edited by:
Junpeng Zhang, Sichuan University, ChinaReviewed by:
Takefumi Ueno, Hizen Psychiatric Center (NHO), JapanMary V. Seeman, University of Toronto, Canada
Copyright © 2022 Tian, Yang, Fan, Dong, Bo, Zhou, Zhang, Li, Yin, Wang and Fan. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Guang-Zhong Yin, yingzgjyy@163.com; Chuan-Yue Wang, wang_cy@ccmu.edu.cn; Ming Fan, fanmingchina@126.com