- 1Department of General Practice, School of Medicine, Second Affiliated Hospital, Zhejiang University, Hangzhou, China
- 2Epilepsy Center, School of Medicine, Second Affiliated Hospital, Zhejiang University, Hangzhou, China
- 3School of Computer Science, Fudan University, Shanghai, China
- 4Department of Neurology, School of Medicine, Second Affiliated Hospital, Zhejiang University, Hangzhou, China
- 5Department of Neurosurgery, School of Medicine, Second Affiliated Hospital, Zhejiang University, Hangzhou, China
- 6Shanghai Pudong New Area Mental Health Center, Tongji University School of Medicine, Shanghai, China
Purpose: We are aiming to build a supervised machine learning-based classifier, in order to preoperatively distinguish focal cortical dysplasia (FCD) from glioneuronal tumors (GNTs) in patients with epilepsy.
Methods: This retrospective study was comprised of 96 patients who underwent epilepsy surgery, with the final neuropathologic diagnosis of either an FCD or GNTs. Seven classical machine learning algorithms (i.e., Random Forest, SVM, Decision Tree, Logistic Regression, XGBoost, LightGBM, and CatBoost) were employed and trained by our dataset to get the classification model. Ten features [i.e., Gender, Past history, Age at seizure onset, Course of disease, Seizure type, Seizure frequency, Scalp EEG biomarkers, MRI features, Lesion location, Number of antiepileptic drug (AEDs)] were analyzed in our study.
Results: We enrolled 56 patients with FCD and 40 patients with GNTs, which included 29 with gangliogliomas (GGs) and 11 with dysembryoplasic neuroepithelial tumors (DNTs). Our study demonstrated that the Random Forest-based machine learning model offered the best predictive performance on distinguishing the diagnosis of FCD from GNTs, with an F1-score of 0.9180 and AUC value of 0.9340. Furthermore, the most discriminative factor between FCD and GNTs was the feature “age at seizure onset” with the Chi-square value of 1,213.0, suggesting that patients who had a younger age at seizure onset were more likely to be diagnosed as FCD.
Conclusion: The Random Forest-based machine learning classifier can accurately differentiate FCD from GNTs in patients with epilepsy before surgery. This might lead to improved clinician confidence in appropriate surgical planning and treatment outcomes.
Introduction
Focal cortical dysplasia (FCD) is a distinctive malformation of cortical development that is highly associated with refractory epilepsy. Around 12–40% of patients with FCD were submitted to surgery for refractory epilepsy (1). Neuropathological findings have also reported glioneuronal tumors (GNTs) are another important cause of refractory epilepsy in children and young adults, including gangliogliomas (GGs) and dysembryoplastic neuroepithelial tumors (DNTs) (2). Previous studies have demonstrated that patients with FCD and GNTs have different postoperative seizure outcomes. Up to 80% patients with GNTs could achieve seizure-free during the follow-up. However, only 40–50% patients with FCD experienced no seizures after surgery (3, 4). As for the surgical protocol in many patients with FCD, wide cortical resection over the MRI-delineated lesion with invasive electroencephalography is frequently recommended, due to obscure histologic boundary and poor prognosis (5). In contrast, recent studies on tumor-associated epilepsy have emphasized that total surgical resection of the tumor is sufficient and effective for seizure control in most patients with GNTs (6). Thus, it is crucial to make the differential diagnosis of FCD and GNTs preoperatively. However, their clinical manifestation and imaging findings could be similar, especially in cases of mass-like FCD (7). What' more, type III FCD was accompanied by an additional brain lesion as noted in the classification system by the International League Against Epilepsy (ILAE) (8).
Some factors have been reported to differentiate FCD from GNTs before surgery. Rácz's et al. study indicated that age at epilepsy onset was younger in patients with FCD compared to those with GNTs (9). Though 18F-FDG PET can't contribute to the differentiation of FCD and GNTs, 11C-methionine PET identified a significant difference between them (10). But, up to now, 11C-methionine PET was unavailable in most hospitals worldwide. Several surface EEG biomarkers were also revealed to be significantly correlated with an underlying cortical dysplasia (11). Despite these factors, it remained a challenge to effectively differentiate FCD from GNTs preoperatively.
Machine learning, as an important branch of artificial intelligence, has been applied to automated seizure detection (12), prediction of antiepileptic drugs (AEDs) response (13), pre-surgical planning and surgical outcome prediction (14). In the present study, we adopted supervised machine learning-based algorithms to train the classifier to differentiate FCD from GNTs, using seven representative classification algorithms, i.e., Decision Tree (15), Random Forest (16), Logistic Regression (17), Support Vector Machine (SVM) (18), XGBoost (19), Catboost (20), and LightGBM (21). In addition, we included several features to predict different pathological results, attempting to identify the most valuable feature. Based on our results, one can use the trained classifier to make a diagnosis prediction of FCD or GNTs before surgery, hence helping clinicians to make better surgical planning.
Methods
Patients and Definitions
The study was retrospectively conducted from January 2013 to December 2018 in the Second Affiliated Hospital of Zhejiang University, a tertiary referral hospital in Zhejiang, China. All the patients underwent epilepsy surgery at the Epilepsy Center. Inclusion criteria were as follows: (1) patients were diagnosed as epilepsy according to the guidelines for the Classification and Diagnosis of Epilepsy of ILAE (22). (2) the neuropathologic diagnosis of either FCD or GNTs was established by two senior neuropathologists (8, 23), discrepancies were discussed and resolved by verification from a third senior neuropathologist. (3) all the patients underwent a non-invasive pre-surgical evaluation, including long-term video-EEG monitoring, high-resolution MRI with epilepsy sequence and PET-CT for some of them; for patients whose surgical protocols were with difficulties, invasive evaluation with the stereo-electroencephalography was carried out. Among 308 patients who underwent epilepsy surgery from January 2013 to December 2018 in our center, 98 patients met inclusion criteria, 2 patients did not obtain informed consent, and 96 patients were included in the final analysis. Informed consents were obtained from all the participants, and the study was approved by the Second Affiliated hospital of Zhejiang University School of Medicine Ethics Committee.
The information of ten features (i.e., Gender, Past history, Age at seizure onset, Course of disease, Seizure type, Seizure frequency, Scalp EEG biomarkers, MRI features, Lesion location, Number of AEDs) were recorded. Past history included encephalitis, perinatal brain injury, febrile convulsion, traumatic brain injuries and other known secondary causes. Seizure type was classified according to the new operational classification by ILAE (24), and seizure frequency was grouped into four categories: every few years, once a year, once few months and several times a month (25). Video EEG (VEEG) was performed using digital VEEG systems (Nicolet, VIASYS, United States and Biologic, NATUS, United States), with scalp electrodes placed according to the international 10–20 system. All the patients were monitored for at least 24 h. For the patients with long-term monitoring, the first 24 h recordings were chosen without AEDs tapering. Two EEG experts were blind to the MRI results and underlying histopathology. EEG recordings were evaluated in both referential and bipolar montages, and positive biomarkers were considered to be present when consensus between two independent EEG experts was achieved. The positive biomarkers of FCD were defined as the presence of continuous epileptiform discharges, two types of rhythmic epileptiform discharges, polyspikes, repetitive activity and polyspikes, frequent rhythmic bursting epileptiform activity, or repetitive discharges according to Epitashvili's et al. study (11). Examples of positive EEG biomarkers were shown in (Supplementary Figure 1). As for MRI protocols, patients were conducted on a 3.0T scanner (MR750, GE Healthcare, United States) with an 8-channel brain phased array coil. High resolution coronal T2-weighted images perpendicular to the long axis of the hippocampus were acquired using spoiled gradient echo sequence with TR/TE = 5,518/176 ms, flip angle = 110°, slice thickness = 2 mm, matrix = 512 × 512. Sagittal 3D T1-weighted images were acquired using brain volume imaging (BRAVO) sequence with TR/TE = 8.2/3.2 ms, TI = 450 ms, flip angle = 12°, slice thickness = 1 mm, matrix = 256 × 256. High resolution axial T2-weighted images and fluid-attenuated inversion-recovery (FLAIR) sequence were also obtained. Contrast-enhanced images T1-and T2-weighted images were obtained, if necessary. Typical MRI characteristics of FCD type I included subtle white matter signal changes and regional reduction of the white matter volume. Typical MRI characteristics of FCD type II included focal cortical thickening, blurring of the gray-white matter interface, focally increased signal on T2-weighted imaging, and a tail of increased signal from the cortex to an underlying ventricle on T2-weighted imaging (transmantle sign) (26, 27). GGs usually presents as a cyst with an enhancing mural nodule, with minimal to no surrounding edema and no significant mass effect. Foci of calcification are frequent (40–50%) in GGs and areas of cortical dysplasia can be seen adjacent to the tumor (28, 29). On MRI, DNTs appear well-demarcated and frequently wedge shaped, hypointense on T1WI, and hyperintense on T2WI, lack of edema and mass effect. Calcifications can be seen in 20% DNTs, and 20% DNTs have nodular or ring-like enhancement (28, 29). Typical characteristics of GGs and DNTs on MRI imaging were considered as typical characteristics of GNTs (28, 29). Typical MRI characteristics of FCD or GNTs were considered when consensus between two independent neuroradiologists was achieved. Examples of MRI for FCD and GNTs were shown in (Supplementary Figure 2).
Machine Learning
Our work was aiming to build a binary classification model capable of distinguishing FCD from GNTs. The process of the supervised machine learning-based analysis included the following steps, i.e., data preprocessing, feature selection, algorithm selection, parameter tuning, and performance evaluation. The method was as the same as our previous study (13). The workflow of data preparation and machine-learning based modeling was shown in the (Supplementary Figure 3).
Data Preprocessing
In our analysis, 56 patients with FCD and 40 with GNTs were recruited. To solve the unbalanced sample problem, we over-sampled the minority type to 56 by using the SMOTE technique (30). (https://www.jair.org/index.php/jair/article/view/10302). Then we randomly split the entire dataset into a training and validation dataset and a test dataset. The training and validation dataset were used to train and validate the prediction model, while the test dataset was applied to evaluate the prediction performance of the trained model. We used 50% of patients for training and validation, the rest for test. The aim of the training and validation stage is to find an optimal set of parameters that can achieve the highest prediction performance. We further applied the 5-fold cross-validation method by randomly dividing the training and validation dataset into 5 subsets with equal sample sizes. The cross-validation process was repeated for 5 rounds. For each round, one of the 5 subsets were retained as the validation data to evaluate the model, and the remaining 4 subsets were used for training. We have made our dataset available to the public via Harvard Dataverse (https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/6F7QPP).
Algorithm Selection and Parameter Tuning
For machine learning algorithm selection, we included classical algorithms such as Random Forest, SVM, Decision Tree and Logistic Regression, as well as new algorithms, i.e., XGBoost, LightGBM, and CatBoost. For each algorithm, we should determine an optimal set of parameters. Based on the training and validation dataset, we applied grid search to go through the parameter space, which covers a finite set of parameter combinations. For each parameter combination, we evaluated the model's prediction performance using the training and validation dataset. We record the parameters leading to the highest F1-score. To train and evaluate the classification model (31), we used the scikit-learn library, a representative open source machine learning toolkit, written in the Python programming language. This library supports a number of supervised machine learning algorithms, including Decision Tree, Random Forest, Logistic Regression, Support Vector Machine (SVM), XGBoost, Catboost, and LightGBM. After selecting a specified algorithm, the scikit-learn library is able to process the training and validation dataset to obtain a classification model. Then this model can be further applied to the test dataset.
Performance Evaluation
Based on the test dataset, we used precision, recall, F1-score, and the AUC (Area Under the ROC Curve) value to evaluate the predictive performance of our trained model (32). Precision was the fraction of patients with FCD who were finally diagnosed with FCD. Recall was the fraction of patients with FCD who have been adequately identified by the model. F1-score was the harmonic mean of precision and recall, with its best value at 1 and worst value at 0. F1-score was calculated as follows:
From the perspective of clinicians, high precision means that our prediction rarely over-reports or over-represents the fraction of patients with predicted FCD who are in fact diagnosed with FCD. Meanwhile, high recall means the fraction of patients with FCD who are uncovered accurately. A higher value of F1-score indicates a better overall predictive performance of a classifier. AUC is another important metric for evaluating a classification model's performance, which denotes the probability that a machine learning algorithm will rank higher of a random positive instance than a randomly chosen negative instance. The value of AUC is between 0 and 1. For a perfect classifier, the AUC value will be 1. For a completely random classifier, the AUC value will be 0.5. If the AUC value is smaller than 0.5, we could invert all the outputs of the classifier and obtain a new AUC value larger than 0.5. An AUC value close to 1 indicates that the model is good at distinguishing FCD from GNTs.
Statistical Analysis
Statistical analysis was performed using python. Continuous variable (course of disease) with normal distribution was represented as mean ± standard deviation (SD), non-normal variable (age at seizure onset) was reported as median [interquartile range (IQR)]. Categorical variables were described in the form of frequency and percentage. Independent student's t-test were conducted to compare the means of the continuous variables with normal distribution while Welch's t-test was used if the data was not normally distributed. Chi-Squared (χ2) Statistics was used to compare the frequencies of categorical variables between FCD and GNTs Groups. And we calculated the Chi-Square (χ2) Statistics to evaluate the dependence of each selected feature on different pathological results (33). A larger χ2 value indicated a better discriminative power of the feature. A value of p < 0.05 was considered significant. All the tests were two tailed.
Results
Patient Characteristics
A total of 96 patients who underwent epilepsy surgery were analyzed in our study, including 56 patients with FCD (FCD I: n = 16; FCD II: n = 40) and 40 patients with GNTs (GG: n = 29; DNTs = 11). Ten features were reviewed and recorded; the details were shown in Table 1. The median age at seizure onset (months) in FCD group was much lower than that in GNTs group (77 vs. 155, P = 0.002, also see Figure 1); Course of disease (months) in FCD group was longer than that in GNTs group, but not statistically significant (105 vs. 69, P = 0.12, also see Figure 1). Thirty-five (62.5%) patients with FCD showed scalp EEG biomarkers of FCD, whereas only 13 (32.5%) patients with GNTs had the positive biomarkers (p = 0.04, also see Figure 1). Thirty six (64.3%) patients with FCD had typical MRI characteristics of FCD, and 29 (72.5%) patients in GNTs group had typical MRI characteristics of GNTs (p < 0.001, also see Figure 1). As for AEDs, 37 (66.0%) patients in FCD group were taking more than 3 kinds of AEDs, while only 6 (15.0%) patients in GNTs group were taking 3 or more kinds of AEDs (p < 0.001, also see Figure 1). However, there were no significant differences in gender, past history, seizure type, seizure frequency, and lesion location between two groups with FCD and GNTs.
Figure 1. The comparison of patients with FCD and GNTs in terms of different features. FCD, focal cortical dysplasia; GNTs, glioneuronal tumors; AEDs, antiepileptic drugs.
Machine Learning Algorithms Used to Distinguish FCD From GNTs
With the current dataset, we adopted supervised machine learning algorithms to preoperatively predict pathological diagnosis of patients with epilepsy. A wide variety of machine learning algorithms were selected to build classification models, including Random Forest, Catboost, SVM, XGBoost, LightGBM, Logistic Regression, and Decision Tree. As shown in Table 2, the F1-scores of these seven models (Random Forest, Catboost, SVM, XGBoost, LightGBM, Logistic Regression, and Decision Tree) were 0.9180, 0.9000, 0.8621, 0.8667, 0.8750, 0.8889, and 0.8000, and the AUC values were 0.9340, 0.9515, 0.9055, 0.9630, 0.8531, 0.9132, and 0.7873, respectively. The precision/positive predictive value and recall/sensitivity for each model were also described in Table 2. Overall, our data revealed that Random Forest and Catboost were most effective in distinguishing patients with FCDs from those with GNTs. Furthermore, Random Forest-based classifier achieved the highest F1-score of 0.9180 and an AUC value of 0.9340, providing the best discriminatory ability in the prediction of pathological diagnosis.
The Features Used to Distinguish FCD From GNTs
Next, the Chi-Square analysis was employed to identify the discriminative power of each feature to preoperatively make the diagnosis of FCD or GNTs. The top 5 ranked features that effectively contributed to distinguishing FCD from GNTs were Age at seizure onset, Course of disease, MRI features, Number of AEDs, and Scalp EEG biomarkers, with the Chi-square values of 1,213.000, 334.800, 19.969, 13.946, and 4.200, respectively (Table 3).
To visualize the difference between patients with FCD and GNTs, we analyzed in terms of Age at seizure onset, Course of disease, MRI features, Number of AEDs and Scalp EEG biomarkers as shown in Figure 1. Age at seizure onset was revealed to be the most discriminative feature to distinguish between patients with FCD and GNTs, meaning that younger age at seizure onset would increase the probability of the diagnosis of FCD.
Discussion
In the present study, we demonstrated that the Random Forest-based machine learning model provided the best predictive performance on distinguishing FCD from GNTs, with an F1-score of 0.9180 and AUC value of 0.9340. Of ten included features, “Age at seizure onset” was revealed to be the most discriminative feature. With this supervised machine learning-based approach, one would accurately differentiate FCD from GNTs in patients with epilepsy before surgery, allowing clinicians to make the surgical planning properly and individually.
For all the patients who underwent epilepsy surgery, the ultimate desired outcomes were complete seizure freedom without further AEDs. Therefore, accurate preoperative diagnosis of FCD or GNTs based upon clinical features was of great importance, when planning the extent of resection and choosing the invasive evaluation as noted above. With widespread use in image recognition, language processing, and data mining, machine learning-based techniques have received increasing attention in medical applications, including the use of epilepsy (14). One challenge is that there are a series of potential supervised ML algorithms which could be selected. To our knowledge, which algorithm is the most suitable one for our problem is unknown. Our study focused on the differential diagnosis of FCD and GNTs before surgery, indicating that two classification algorithms (Random Forest and Catboost) were quite effective to predict between FCD and GNTs. Particularly, the Random Forest-based model performed best in prediction. Logistic regression was a widely used statistical method with an F1-score of 0.8889 in our study, which was much lower compared to that of Random Forest. Consequently, our Random Forest-based model would be considered as a potential and powerful classifier to predict the preoperative pathological diagnosis for patients with epilepsy. Consistent with our result, Paldino et al. have indicated that the Random Forest classifier achieved 100% sensitivity and 95.4% specificity in predicting language impairment with DTI-based whole-brain tractography data from pediatric patients with malformations of cortical development (34). A later study conducted by Grinspan et al. has also demonstrated that the Random Forest classifier achieved AUCs of 84.1 and 73.4% at each center in predicting emergency department visit rates for the following year, using a combination of demographic characteristics, insurance, comorbidity, and medication data in medical records at two pediatric referral centers (35). In our study, the consistent rate between conventional preoperative diagnosis and postoperative pathology was 76%, while the consistent rate was 89.6% when preoperative Random Forest algorithm was used to predict postoperative pathology, showing a statistically significant difference (Supplementary Table 1, χ2 = 6.184, p = 0.013). As far as we know, this was the first study reporting that machine learning-based algorithms could be used to differentiate FCD from GNTs in patients with epilepsy. For the next step, we will use a larger sample to train our algorithm. One practical challenge is that different hospitals might host their patient databases on computers with different operating systems, including Windows, Linux and MacOSX. Our algorithm is implemented using the scikit-learn library (https://scikit-learn.org/stable/), which is an open source library written in the Python programming language. Thanks to the cross-platform nature of Python, our algorithm can be directly deployed on computers with any mainstream operating system without modification. Our algorithm could directly access a hospital's database of patient records, and read the patient information automatically to provide the predicted diagnosis of FCD or GNTs. In short, our algorithm has no special requirement for either the operating system or the computer hardware. It is convenient to be employed in clinical applications. If the diagnosis given by the classifier is FCD, wider cortical resection over the MRI-delineated lesion may be taken into consideration by neurosurgeons, in order to achieve favorable seizure outcomes. Furthermore, having a good knowledge of the potential postsurgical outcome may improve clinicians' and patients' confidence in epilepsy surgery.
As for the top 5 ranked features which contributed most to distinguishing FCD from GNTs in patients with epilepsy, the feature “Age at seizure onset” had the highest Chi-square value at 1,213.000, suggesting patients who have the younger age at seizure onset were more likely to be diagnosed as FCD finally. This result was consistent with the study from Rácz et al. which indicated that age at epilepsy onset was significantly earlier in patients with FCD than that in GNTs (9). The second feature “Course of disease” had the Chi-square value at 334.800, suggesting that epileptic patients with FCD had a longer course of disease compared to patients with GNTs. A possible explanation could be that GNTs group had a higher proportion (72.5%) of patients with typical characteristics of GNTs and consequently underwent surgical treatment earlier, which was also a reason for the number difference of AEDs between two groups. As the commonly used method to distinguish FCD from GNTs, “MRI” was the third feature with the Chi-square value at 19.969, which was however obviously lower than the former. Epitashvili et al. have demonstrated that six surface EEG biomarkers (continuous epileptiform discharges, two types of rhythmic epileptiform discharges, polyspikes, repetitive activity, and polyspikes, frequent rhythmic bursting epileptiform activity or repetitive discharges) were significantly associated with an underlying cortical dysplasia (11). However, the single feature “Scalp EEG biomarkers” was also shown with less significance in our study, meaning the requirement of machine learning-based comprehensive evaluation progressed from signal processing analyses.
The predictive performance of a model depends on the large scale of dataset, the number and quality of features, and the design of the algorithms. Our study had some limitations. First, the current dataset was collected at a local tertiary hospital, and the sample may not be representative of all the regions in China and other countries. In the future, a prospective multicenter study with a larger sample size should be required. Second, ten features were included in our study, however the weight of each feature in the final model differed, which possibly increased the risk of overfitting or bias. Finally, some features were not included in this work, such as multiple seizure types, other MRI sequences (DTI) and PET-CT finding. The diagnostic validity of machine learning-based approach was associated with comprehensive parameters, thereby more features were considered, the higher level of performance we would achieve.
Conclusion
Taken together, this study highlighted the potential of a supervised machine learning-based model to differentiate FCD from GNTs in patients with epilepsy before surgery, contributing to appropriate surgical planning. With the availability and convenience of this model, clinicians will benefit from the novel approach in clinical applications.
Data Availability Statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
Ethics Statement
The studies involving human participants were reviewed and approved by the Second Affiliated hospital of Zhejiang University School of Medicine Ethics Committee. Written informed consent to participate in this study was provided by the participants' legal guardian/next of kin.
Author Contributions
YG and CHS: conceptualization and writing. YSL, LJY, and YC: methodology. WJM, ZJW, and JMZ: data collecting and confirmation. MPD and CHS: project administration and supervision. All authors contributed to the article and approved the submitted version.
Funding
This work was supported by National Natural Science Foundation of China (Grant No. 81871010; 81701266). The Outstanding Clinical Discipline Project of Shanghai Pudong (Grant No. PWYgy2018-10).
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Supplementary Material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fneur.2020.548305/full#supplementary-material
References
1. Guerrini R, Dobyns WB. Malformations of cortical development: clinical features and genetic causes. Lancet Neurol. (2014) 13:710–26. doi: 10.1016/S1474-4422(14)70040-7
2. Piao YS, Lu DH, Chen L, Liu J, Wang W, Liu L, et al. Neuropathological findings in intractable epilepsy: 435 Chinese cases. Brain Pathol. (2010) 20:902–8. doi: 10.1111/j.1750-3639.2010.00386.x
3. Kim DW, Lee SK, Chu K, Park KI, Lee SY, Lee CH, et al. Predictors of surgical outcome and pathologic considerations in focal cortical dysplasia. Neurology. (2009) 72:211–6. doi: 10.1212/01.wnl.0000327825.48731.c3
4. Englot DJ, Berger MS, Barbaro NM, Chang EF. Factors associated with seizure freedom in the surgical resection of glioneuronal tumors. Epilepsia. (2012) 53:51–7. doi: 10.1111/j.1528-1167.2011.03269.x
5. Krsek P, Maton B, Jayakar P, Dean P, Korman B, Rey G, et al. Incomplete resection of focal corticaldysplasia is the main predictor of poor postsurgical outcome. Neurology. (2009) 72:217–23. doi: 10.1212/01.wnl.0000334365.22854.d3
6. Zaatreh MM, Firlik KS, Spencer DD, Spencer SS. Temporal lobe tumoral epilepsy: characteristics and predictors of surgical outcome. Neurology. (2003) 61:636–41. doi: 10.1212/01.WNL.0000079374.78589.1B
7. Raybaud C, Shroff M, Rutka JT, Chuang SH. Imaging surgical epilepsy in children. Childs Nerv Syst. (2006) 22:786–809. doi: 10.1007/s00381-006-0132-5
8. Blumcke I, Thom M, Aronica E, Armstrong DD, Vinters HV, Palmini A, et al. The clinicopathologic spectrum of focal cortical dysplasias: a consensus classification proposed by anad hoc task force of the ILAE diagnostic methods commission. Epilepsia. (2011) 52:158–74. doi: 10.1111/j.1528-1167.2010.02777.x
9. Rácz A, Müller AM, Schwerdt J, Becker A, Vatter H, Elger CE. Age at epilepsy onset in patients with focal cortical dysplasias, gangliogliomas and dysembryoplastic neuroepithelial tumours. Seizure. (2018) 58:82–9. doi: 10.1016/j.seizure.2018.04.002
10. Phi JH, Paeng JC, Lee HS, Wang KC, Cho Bk, Lee JY, et al. Evaluation of focal cortical dysplasia and mixed neuronal and glial tumors in pediatric epilepsy patients using 18F-FDG and 11C-methionine pet. J Nucl Med. (2010) 51:728–34. doi: 10.2967/jnumed.109.070920
11. Epitashvili N, San Antonio-Arce V, Brandt A, Schulze-Bonhage A. Scalp electroencephalographic biomarkers in epilepsy patients with focal cortical dysplasia. Ann Neurol. (2018) 84:564–75. doi: 10.1002/ana.25322
12. Temko A, Thomas E, Marnane W, Lightbody G, Boylan GB. Performance assessment for EEG-based neonatal seizure detectors. Clin Neurophysiol. (2011) 122:474–82. doi: 10.1016/j.clinph.2010.06.035
13. Yao L, Cai M, Chen Y, Shen C, Shi L, Guo Y. Prediction of antiepileptic drug treatment outcomes of patients with newly diagnosed epilepsy by machine learning. Epilepsy Behav. (2019) 96:92–7. doi: 10.1016/j.yebeh.2019.04.006
14. Abbasi B, Goldenholz DM. Machine learning applications in epilepsy. Epilepsia. (2019) 60:2037–47. doi: 10.1111/epi.16333
15. Quinlan JR. C4.5: Programs for Machine Learning. Burlington, MA: Morgan Kaufmann Publishers Inc. (1993). 302p.
17. Cox DR. The regression analysis of binary sequences. J Roy Stat Soc B. (1958) 20:215–42. doi: 10.1111/j.2517-6161.1958.tb00292.x
18. Hearst MA, Dumais ST, Osuna E, Platt J, Scholkopf B. Support vector machines. IEEE Intell Syst Appl. (1998) 13:18–28. doi: 10.1109/5254.708428
19. Chen T, Guestrin C. XGBoost: a scalable tree boosting system. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. San Francisco, CA: ACM (2016). p. 785–94.
20. Prokhorenkova LO, Gusev G, Vorobev A, Dorogush AV, Gulin A. CatBoost: unbiased boosting with categorical features. In: Proceedings of the Advances in Neural Information Processing System. Montreal (2018). p. 6639–9.
21. Ke G, Meng Q, Finley T, Wang TF, Chen W, Ma WD, et al. LightGBM: a highly efficient gradient boosting decision tree. In: Proceeding of the Advances in Neural Information Processing Systems. Long Beach, CA (2017). p. 3149–57.
22. Berg AT, Berkovic SF, Brodie MJ, Buchhalter J, Cross JH, van Emde Boas, et al. Revised terminology and concepts for organization of seizures and epilepsies: report of the ILAE commission on classification and terminology, 2005-2009. Epilepsia. (2010) 51:676–85. doi: 10.1111/j.1528-1167.2010.02522.x
23. Louis DN, Ohgaki H, Wiestler OD, Cavenee WK, Burger PC, Jouvet A, et al. The 2007 WHO classification of tumours of the central nervous system. Acta Neuropathol. (2007) 114:97–109. doi: 10.1007/s00401-007-0243-4
24. Fisher RS, Cross JH, French JA, Higurashi N, Hirsch E, Jansen FE, et al. Operational classification of seizure types by the international league against epilepsy: position paper of the ILAE commission for classification and terminology. Epilepsia. (2017) 58:522–30. doi: 10.1111/epi.13670
25. Choi H, Hamberger MJ, Munger Clary H, Leob R, Onchiri FM, Baker G, et al. Seizure frequency and patient-centered outcome assessment in epilepsy. Epilepsia. (2014) 55:1205–12. doi: 10.1111/epi.12672
26. Hong SJ, Bernhardt BC, Caldairou B, Hall JA, Guiot MC, Schrader D, et al. Multimodal MRI profiling of focal cortical dysplasia type II. Neurology. (2017) 88:734–42. doi: 10.1212/WNL.0000000000003632
27. Krsek P, Pieper T, Karlmeier A, Hildebrandt M, Kolodziejczyk D, Winkler P, et al. Different presurgical characteristics and seizure outcomes in children with focal cortical dysplasia type I or II. Epilepsia. (2009) 50:125–37. doi: 10.1111/j.1528-1167.2008.01682.x
28. Demir MK, Yapicier O, Yilmaz B, Kiliç T. Magnetic resonance imaging findings of mixed neuronal-glial tumors with pathologic correlation: a review. Acta Neurol Belg. (2018) 118:379–86. doi: 10.1007/s13760-018-0981-1
29. Abdel Razek AAK, Elsebaie NA, Zamora C, Castillo M. Imaging of neuronal and mixed glioneuronal tumors. J Comput Assist Tomogr. (2020) 44:356–69. doi: 10.1097/RCT.0000000000001010
30. Chawla NV, Bowyer KW, Hall LO, Kegelmeyer WP. SMOTE: synthetic minority over-sampling technique. J Artif Intrll Res. (2002) 16:321–57. doi: 10.1613/jair.953
31. Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, et al. Scikit-learn: machine learning in python. J Mach Learn Res. (2011) 12:2825–30. doi: 10.5555/1953048.2078195
32. Fawcett T. An introduction to ROC analysis. Pattern Recogn Lett. (2006) 27:861–74. doi: 10.1016/j.patrec.2005.10.010
33. Yang Y, Pedersen JO. A comparative study on feature selection in text categorization. In: Proceedings of the Fourteenth International Conference on Machine Learning. Burlington, MA: Morgan Kaufmann Publishers Inc. (1997). p. 412–20
34. Paldino MJ, Hedges K, Zhang W. Independent contribution of individual white matter pathways to language function in pediatric epilepsy patients. Neuroimage Clin. (2014) 6:327–32. doi: 10.1016/j.nicl.2014.09.017
Keywords: machine learning, epilepsy, focal cortical dysplasia, glioneuronal tumors, distinguish
Citation: Guo Y, Liu Y, Ming W, Wang Z, Zhu J, Chen Y, Yao L, Ding M and Shen C (2020) Distinguishing Focal Cortical Dysplasia From Glioneuronal Tumors in Patients With Epilepsy by Machine Learning. Front. Neurol. 11:548305. doi: 10.3389/fneur.2020.548305
Received: 02 April 2020; Accepted: 14 October 2020;
Published: 24 November 2020.
Edited by:
Fernando Cendes, Campinas State University, BrazilReviewed by:
Stefan Rampp, University Hospital Erlangen, GermanyRobert LeMoyne, Northern Arizona University, United States
Copyright © 2020 Guo, Liu, Ming, Wang, Zhu, Chen, Yao, Ding and Shen. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Chunhong Shen, 2513109@zju.edu.cn; Lijun Yao, lyao@tongji.edu.cn