A novel cascade machine learning pipeline for Alzheimer’s disease identification and prediction

Zhou, Kun; Piao, Sirong; Liu, Xiao; Luo, Xiao; Chen, Hongyi; Xiang, Rui; Geng, Daoying

doi:10.3389/fnagi.2022.1073909

ORIGINAL RESEARCH article

Front. Aging Neurosci., 16 January 2023

Sec. Alzheimer's Disease and Related Dementias

Volume 14 - 2022 | https://doi.org/10.3389/fnagi.2022.1073909

This article is part of the Research TopicNeuroimaging Biomarkers and Cognition in Alzheimer’s disease Spectrum, Volume IIView all 8 articles

A novel cascade machine learning pipeline for Alzheimer’s disease identification and prediction

Kun Zhou¹

Sirong Piao²

Xiao Liu³

Xiao Luo¹

Hongyi Chen¹

Rui Xiang¹

Daoying Geng^1,2^*

¹Academy for Engineering and Technology, Fudan University, Shanghai, China
²Department of Radiology, Huashan Hospital, Fudan University, Shanghai, China
³School of Computer and Information Technology, Beijing Jiaotong University, Beijing, China

Introduction: Alzheimer’s disease (AD) is a progressive and irreversible brain degenerative disorder early. Among all diagnostic strategies, hippocampal atrophy is considered a promising diagnostic method. In order to proactively detect patients with early Alzheimer’s disease, we built an Alzheimer’s segmentation and classification (AL-SCF) pipeline based on machine learning.

Methods: In our study, we collected coronal T1 weighted images that include 187 patients with AD and 230 normal controls (NCs). Our pipeline began with the segmentation of the hippocampus by using a modified U2-net. Subsequently, we extracted 851 radiomics features and selected 37 features most relevant to AD by the Hierarchical clustering method and Least Absolute Shrinkage and Selection Operator (LASSO) algorithm. At last, four classifiers were implemented to distinguish AD from NCs, and the performance of the models was evaluated by accuracy, specificity, sensitivity, and area under the curve.

Results: Our proposed pipeline showed excellent discriminative performance of classification with AD vs NC in the training set (AUC=0.97, 95% CI: (0.96-0.98)). The model was also verified in the validation set with Dice=0.93 for segmentation and accuracy=0.95 for classification.

Discussion: The AL-SCF pipeline can automate the process from segmentation to classification, which may assist doctors with AD diagnosis and develop individualized medical plans for AD in clinical practice.

Introduction

Alzheimer’s disease (AD) is a progressive, irreversible degenerative brain disease characterized by memory loss and cognitive impairment (Sevigny et al., 2016; Li et al., 2019). Detecting AD at an earlier or prodromal stage is vital for preventing disease progression. Neuroimaging biomarkers may give more predictive information and make the diagnosis more reliable (Márquez and Yassa, 2019). Magnetic resonance imaging (MRI) is a type of brain imaging biomarker that can be used to detect the structures in brain volume, such as hippocampus atrophy or ventricle enlargement, and therefore the onset of Alzheimer’s disease and distinguishing normal controls (NC) from AD (Jack Jr et al., 2011; Hosseini-Asl et al., 2016).

Nowadays, machine learning and radiomics features analysis have been employed to analyze MRI data for identifying accurate biomarkers of AD (Moradi et al., 2015; Pellegrini et al., 2018; Feng et al., 2021). The most widely used classification techniques are support vector machine (SVM), artificial neural network (ANN), and deep learning. The main difference between SVM and ANN is the optimization problem. Compared with ANN, SVM is more generalizable and can obtain global optimal solutions. Among them, feature extraction is an important step (Anami and Elemmi, 2022). It is useful to combine neural networks and intelligent agents for medical image analysis according to the research of Shi et al. (Sarvamangala and Kulkarni, 2022). However, deep learning can be trained to automatically extract features from target regions without intervention (Shen et al., 2017; Chen et al., 2022). When the number of available samples is large enough, deep learning will be highly effective (Shen et al., 2017).

In recent years, researchers provided analysis of the works done using machine learning for Alzheimer’s disease. For instance, Gray et al. (2013) proposed a multi-modality classification framework derived from random forest and evaluated the framework by application to neuroimaging and biological data from the Alzheimer’s Disease Neuroimaging Initiative. Feng et al. (2021) developed three classification models to discriminate AD, mild cognitive impairment (MCI), and NC patients by extracting and analyzing 3,360 radiological features from 3D T1-weighted magnetization-prepared rapid gradient echo images. Cao et al. (2017) proposed a multicore-based downsampling and oversampling method. The sparsity of regions of interest (ROI) was achieved using marginal Fisher analysis with ℓ2.1-multiple kernel learning based on paradigms, which enabled simultaneously select subsets of relevant brain regions and improved the accuracy of AD classification. Besides, In the study of Zhou et al. (2018), Kruthika et al. (2019), Peng et al. (2019), Lin et al. (2020), and Richhariya et al. (2020), the researchers used multistage classifier, SVM, recursive feature elimination, and TrAdaBoost methods to analyze and identify AD based on MRI images, and the classification accuracy is 0.88, 0.97, 0.89, and 0.94, respectively.

Deep learning models, including convolutional neural networks (CNN), have increasingly been used for image analysis and computer vision. In the field of medical imaging, deep learning-based medical imaging applications outperform traditional methods in complex tasks (Duraisamy et al., 2019; Singh et al., 2020). Of all brain regions associated with AD, the hippocampus is one of the earliest to undergo pathology changes (Han et al., 2014; Dhikav et al., 2017; Park et al., 2022). Helaly et al. (2022) proposed a deep learning Alzheimer’s disease hippocampus segmentation framework (DL-AHS) for hippocampal segmentation to detect and identify AD. In addition, the data was augmented using deep convolutional generative adversarial networks (DC-GAN). Sun et al. (2020) proposed a V-Net-based 3D CNN to segment the bilateral hippocampus from 3D brain MRI scans and diagnosed AD progression status. Kwak et al. (2022) constructed a deep convolutional neural network to classify stable and progressive MCI and evaluated the relative contribution of each hippocampal subfield. Buvaneswari and Gayathri (2021) used the seven morphological features like grey matter, white matter, cortex surface, gyri and sulci contour, cortex thickness, hippocampus and cerebrospinal fluid to accurately classify AD and dementia conditions.

As mentioned above, promising progress has been made in AD prediction studies based on structural radiological features and machine learning. However, most studies have focused on the 3D T1 sequence, which has high resolution with long acquisition times. However, coronal T1 weighted images are more commonly applied in clinical practice. In addition, current classification algorithms are based on semi-automatic or manual marking of ROI, and there is no automatic framework from segmentation to classification. Nevertheless, a fast and accurate identification framework is extremely important for the early diagnosis of AD in routine clinical work.

Motivated by these properties and important results, we devised a contribution to the study of AD using coronal T1 weighted images. We first migrated the U²-net to the hippocampus segmentation task and added a deep supervision mechanism module to improve the model performance. Second, we applied different feature extraction methods to extract features from the segmented hippocampal regions and analyzed which features were more relevant to AD classification. Finally, we performed a binary classification task to detect if patients are healthy or have dementia on our datasets and explored the possibility of applying our model to clinical practice.

Materials and methods

The AL-SCF pipeline workflow includes the MRI preprocessing, the hippocampus segmentation, and the AD classification model. All the specific processes of the pipeline are shown in Figure 1.

FIGURE 1

Figure 1. The structure of Alzheimer’s segmentation and classification (AL-SCF). This study is mainly divided into three parts. The first part is data collection and preprocessing, the second part is hippocampus segmentation, and the third part is Alzheimer’s disease (AD) classification.

Patients

In total, 187AD patients aged 50–75 years were retrospectively selected from the electronic medical records at Huashan Hospital, Shanghai, China from April 2016 to 2021. The patient’s demographic information is provided in Table 1.

TABLE 1

Table 1. Patient demographic information.

All the AD patients were diagnosed by a qualified neurologist using criteria for amnestic AD Fennema-Notestine et al. (2009), with mini-mental state examination (MMSE) scores between 12 and 27 (inclusive) and clinical dementia rating (CDR) scores of 1 or 2. The inclusion criteria of AD patients were as follows: (1) right-handed Han Chinese patients older than 50 years; (2) clinically diagnosed as AD confirmed by qualified neurologists; (3) with available hippocampus MRI images. The exclusion criteria of AD patients included: (1) presented structural abnormalities such as cortical infarction, tumor, or subdural hematoma; (2) low-quality MRI scanning. 230 age and sex-matched normal controls (NC) with hippocampus MRI images were also enrolled.

All participants were scanned using a standard eight-channel head coil on a 3 Tesla MR scanner (Discovery 750, GE Healthcare, United States). Foam pads and headphones were used to minimize participants’ head motion and scanner noise. In the model construction, coronal T1 weighted images were used. The parameters were as follows: repetition time (TR) = 225 ms, echo time (TE) = 24 ms, inversion time (TI) = 780 ms, slice thickness = 4 mm. The field of view was 240 mm × 240 mm, and the matrix was 512 × 512, resulting in the in-plane resolution of 0.47 mm × 0.47 mm.

Hippocampus annotation

All MRI images were first resampled to voxel size 1 mm × 1 mm × 1 mm with a resolution of 224 × 224 × 16 by linear interpolation method. Then, we performed the image grayscale normalization with the image grayscale unified adjustment to [0,255] and cropped all images to 128 × 128 × 16.

Hippocampus was annotated manually by two neuroradiologists, both with 5 years of experience using ITK-SNAP software.¹ All segmentation results were corrected by a senior neuroradiologist with 15 years of clinical experience. In this study, the dataset was randomly divided into the training set (n = 334) and test set (n = 83) with a ratio of 8:2.

Segmentation model

The segmentation model consists of an encoder and a decoder, which is a two-level nested structure, and the top level is a large U-shaped structure consisting of five stages (Qin et al., 2020). Specifically, in the decoder of our model, a deep supervision mechanism (Lee et al., 2015) was used to improve the performance of our model. It can upsample the feature maps to the same size and fuse them to generate the final segmentation results with a concatenation operation followed by a 1 × 1 convolution layer.

For training the segmentation model, the epoch was set to 250 and the batch size was 12. After applying augmentation techniques containing flipping, translation, and rotation to the training set, the number of data samples increased to 668. We used the Adam optimizer to update the gradient with an initial learning rate of 0.001 and momentum β_1 = 0.9. The experiments were deployed on NVIDIA Titan GPU with the Pytorch² framework.

Classification model

Feature extraction

Radiomics features were extracted by Pyradiomics (Van Griethuysen et al., 2017). According to guidelines from the Image Biomarker Standardization Initiative (IBSI; Zwanenburg et al., 2020), the extracted radiomics features included 140 first-order features, 175 morphological features, and 501 higher-order texture features, involving Gray Level Co-occurrence Matrix (GLCM), Gray Level Size Zone Matrix (GLSZM), Gray Level Run Length Matrix (GLRLM), Neighbouring Gray Tone Difference Matrix (NGTDM), and Gray Level Dependence Matrix (GLDM).

Feature selection and classification models

The hierarchical clustering method (Murtagh and Contreras, 2012) and the Least Absolute Shrinkage and Selection Operator (LASSO) algorithm (Roth, 2004) were performed to reduce the dimensionality of features. First, hierarchical clustering was used by calculating Spearman’s correlation matrix of the extracted features and the highly correlated features should be removed. Subsequently, LASSO was applied to select representative features and remove irrelevant and redundant features which would degrade the performance of the further process. Moreover, Gaussian Naive Bayes (GNB; Kohavi, 1996), random forest (RF; Breiman, 2001), support vector machine (SVM; Graves, 2012), and AdaBoost (Dietterich, 2000) classifiers were implemented in the AD classification task.

Evaluation metrics

Dice coefficient is a statistical metric that measures the similarity between ground truth and segmentation results, which is defined as follows:

\begin{array}{l} Dice = \frac{2 \sum_{i = 1}^{N} p_{i} q_{i}}{\sum_{i = 1}^{N} p_{i}^{2} + \sum_{i = 1}^{N} q_{i}^{2}} & (1) \end{array}

where N is the number of total pixels on the segmentation results; $p_{i}$ and $q_{i}$ are the pixels of predicted segmentation result and the ground truth, respectively. For example, $Dice = 1$ means two samples are exactly overlapping.

The diagnostic performance of the classification model was assessed using receiver operation characteristics (ROC) curve analysis and measured by area under the ROC curve (AUC). Then, we defined AD patients as positive samples and obtained the calculation formulas of accuracy (ACC), sensitivity (SEN), and specificity (SPE) by the confusion matrix:

{\begin{matrix} A C C = \frac{T P + T N}{T P + T N + F P + F N} \\ S E N = \frac{T P}{T P + F N} \\ S P E = \frac{T N}{T N + F P} \end{matrix} (2)

Statistical analysis

In statistical tests of demographical and clinical characteristics, the Mann–Whitney U-test was used for numerical variables, and Fisher’s exact test was used for categorical variables. Statistical analyses were performed using SPSS (version 22.0, IBM). In addition, feature selection and radiomics signature construction and validation of the classification model were conducted using scikit-learn packages³ based on Python (Version 3.8.0; https://www.python.org).

Results

Segmentation results

A total of 668 MRI images along with their corresponding hippocampal binary masks were taken as the training set to fit into the segmentation model. Then, the model was tested for another 83 MRI images and the segmentation results are shown in Figure 2. Figure 2A are some of the sample ground-truth, and the corresponding segmentation results before and after data augmentation are shown in Figures 2B,C. It is observed that the segmentation results after data augmentation are much more identical to their corresponding ground-truth images from Figures 2D,E.

FIGURE 2

Figure 2. The segmentation results of U²-net (A–E). (A) ground truth (GT). (B) Segmentation results before data augmentation. (C) Segmentation results after data augmentation. (D) The difference between GT and segmentation results before data augmentation (red areas are the segmentation results). (E) The difference between GT and segmentation results after data augmentation (red areas are the segmentation results). Columns from left to right are the samples (Subject 1–8).

We also performed a quantitative analysis of the segmentation results. Table 2 shows that the predicted segmentation results achieved Dice = 0.95 ± 0.03 in the training set and Dice = 0.90 ± 0.02 in the test set before data augmentation. The final Dice = 0.93 ± 0.01 in our test set after data augmentation, which was significantly higher (p < 0.05) than when no data augmentation was applied at a cost of several seconds. Figure 3 shows the distribution of segmentation results in training and test sets. It is noted that the results are more concentrated and better for more difficult segmented images when applying data enhancement.

TABLE 2

Table 2. Dice coefficients of segmentation models in the training and test sets.

FIGURE 3

Figure 3. The distribution of Dice coefficients of segmentation results.

Classification results

In the part of feature extraction, 851 radiomics features were extracted for the region of the hippocampus. After using Spearman’s correlation matrix filter, 727 features remained. Then, we used the LASSO filter with 5-fold cross-validation to select 37 features for the AD classification task. The importance of 37 features were determined with the weights of LASSO, which was shown in Figure 4. It is illustrated that the shape_MajorAxisLength feature has the largest importance value of 0.84, followed by the shape_Maximum2DDiameterColumn feature with an importance value of 0.81.

FIGURE 4

Figure 4. The importance of 37 radiomics features generated from Least Absolute Shrinkage and Selection Operator (LASSO).

For all classification models, we used 5-fold cross-validation to make the results more reliable. Table 3 and Figure 5 show the classification results and ROC curves for AD classification in the training set. Among the four classifiers, the SVM classifier achieved the best performance in the classification task, with ACC = 0.97 (95%CI: 0.95–0.98), SEN = 0.97 (95%CI: 0.94–0.98), SPE = 0.96 (95%CI: 0.95–0.99), and AUC = 0.98 (95%CI: 0.96–1.00).

TABLE 3

Table 3. Classification results of different classifiers in the training set.

FIGURE 5

Figure 5. Receiver operation characteristics (ROC) curves for AD classification in the training set. (A) Support vector machine (SVM) classifier; (B) RF classifier; (C) GNB classifier; (D) Adaboost classifier.

Table 4 shows the classification results in the test set. For the performance of different classifiers, the SVM classifier got ACC = 0.95, SEN = 0.96, SPE = 0.94, and AUC = 0.97, which performs the best. In addition, we also use ROC and AUC values to evaluate different methods. In order to facilitate comparative analysis, we draw the ROC curves of different classification algorithms in the same coordinate graph, as shown in Figure 6. Among them, the ROC curve of SVM is closer to the upper left corner of the coordinate, which means the corresponding AUC value is the largest. Besides, the AUC value of GNB is significantly lower than the other three.

TABLE 4

Table 4. Classification results of different classifiers in the test set.

FIGURE 6

Figure 6. The receiver operation characteristics (ROC) curve of SVM in the test set.

Discussion

In this study, we proposed a novel machine learning pipeline to automatically segment the hippocampus and classify AD from NC. The AL-SCF pipeline achieved excellent performance in hippocampus segmentation and AD classification. The total time of the whole pipeline was less than 1 min in the test set. Thus, our AL-SCF pipeline would be helpful to improve diagnostic accuracy and assist radiologists in clinical practice.

Our method first cropped the coronal T1 weighted images to eliminate the negative effects of the low percentage of hippocampal regions in the brain. We also adopted data augmentation in the segmentation task, which could improve the Dice of hippocampus segmentation. According to Table 2, data augmentation methods are significantly effective to improve the model performance when the image samples are limited. Besides, U2-net was transferred in the hippocampus segmentation from the target detection tasks. The model not only retains the characteristics of traditional U-net but also adds a deep supervision mechanism to learn low-dimensional and high-dimensional features. The convergence of the loss function can be accelerated by calculating the difference between the segmentation results and the Ground Truth of each layer. In previous studies, Sohail and Anwar (2022) and Liu et al. (2020) proposed simple applications for U-net, with Dice of 0.87 and 0.89, respectively. Our segmentation model achieved Dice of 0.93, better than the former results.

In this study, the features of original images and the wavelet-transformed higher-order features (Gillies et al., 2016; Yip and Aerts, 2016) were collected in the feature extraction part. Among them, the wavelet transform could concentrate the energy of the original image on a small portion of wavelet coefficients and provide more useful information for feature extraction (Feng and Ding, 2020). The hierarchical clustering algorithm was used for feature selection, which effectively avoid overfitting, and the subsequent LASSO algorithm further optimized the combined form and weights of the selected features. In addition, we used the grid search method (λ = 0.05, 0.10.0.0.60) and nested 5-fold cross-validation to determine the optimal LASSO hyperparameters λ and evaluate the performance of the model. Our classification results confirmed the effectiveness of the methods.

The majority of current studies (Liu et al., 2020; Nadal et al., 2020; Park et al., 2022) used deep learning networks in AD classification while we took the classical classifiers in machine learning, which is more interpretable and can find radiomics features associated with AD. Radiomic analysis has been applied to some AD studies. Zhang et al. (2012) indicated that 3D texture analysis could distinguish AD patients from the NC group, with classification ACC between 0.64 and 0.96 due to different ROI selections. Feng et al. (2018) built a support vector machine model which achieves an ACC = 0.87 (SPE = 0.89 and SEN = 0.84) in predicting AD vs. NC. The proposed classification models based on hippocampus radiomic features in our study showed high performance (ACC = 0.95 and AUC = 0.97) in distinguishing AD-NC.

In our study, we selected the hippocampus as the ROIs and the results showed that the radiomics features of the hippocampus can be used for the diagnosis of AD from normal controls. Furthermore, we found the shape features of the hippocampus (lateral maximum diameter and axial length) are highly correlated with AD. These are similar to the findings of Shen et al. (2012) and Nadal et al. (2020) used the shape descriptors from statistical shape models (SSM) as features to classify AD from normal control cases and discovered the shape of the hippocampus can provide valuable information for the diagnosis of AD. Nadal et al. presented a brain T1-weighted structural magnetic resonance imaging (MRI) biomarker which combined several individual MRI biomarkers such as volumetric measurements, hippocampal shape, and hippocampal texture. Their experiments showed that both common and uncommon individual MRI biomarkers contributed to the classification of AD.

However, our study still has some inevitable limitations. First, although the sample size of our study is relatively larger than that of some machine learning studies (Feng et al., 2021; Liu et al., 2022), the sample size is still relatively limited. Further studies are needed to conduct prospective multi-center studies in conjunction with multiple centers. Second, our study only investigated the effect of radiomics features in the hippocampus as a biomarker for AD, the application of other potential biomarkers for prediction could be analyzed in future studies.

Conclusion

The AL-SCF pipeline is a non-invasive method that includes the automatic segmentation of the hippocampus and AD classification. The results suggest that micro-structural changes in the brain hippocampus region reflected by radiomic features can be a reliable method to assist radiologists in clinical practice. Furthermore, the proposed automatic segmentation and classification framework could be easily applied to other radiomics studies in the future.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

The studies involving human participants were reviewed and approved by Huashan Hospital Institutional Review Board, Fudan University. Written informed consent for participation was not required for this study in accordance with the national legislation and the institutional requirements.

Author contributions

KZ conducted the experiment, performed the data processing and analysis, and wrote and edited the manuscript. XLi, XLu, and HC collected the data, performed the data processing and analysis, and edited the manuscript. SP and RX collected the data and performed the data processing and analysis. DG supervised the whole study. All authors contributed to the article and approved the submitted version.

Funding

This study was supported by the Greater Bay Area Institute of Precision Medicine (Guangzhou), Fudan University (KCH2310094, 21618), and Medical Engineering Joint Fund of Fudan University.

Acknowledgments

The authors thank the study investigators and patients who participated in the original trial.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Footnotes

1. ^http://www.itksnap.org/

2. ^https://pytorch.org/

3. ^https://scikit-learn.org/

References

Anami, B. S., and Elemmi, M. C. (2022). Comparative analysis of SVM and ANN classifiers for defective and non-defective fabric images classification. J. Text. Inst. 113, 1072–1082. doi: 10.1080/00405000.2021.1915559

CrossRef Full Text | Google Scholar

Breiman, L. (2001). Random forests. Mach. Learn. 45, 5–32. doi: 10.1023/A:1010933404324

CrossRef Full Text | Google Scholar

Buvaneswari, P. R., and Gayathri, R. (2021). Deep learning-based segmentation in classification of Alzheimer's disease. Arab. J. Sci. Eng. 46, 5373–5383. doi: 10.1007/s13369-020-05193-z

CrossRef Full Text | Google Scholar

Cao, P., Liu, X., Yang, J., Zhao, D., Huang, M., Zhang, J., et al. (2017). Nonlinearity-aware based dimensionality reduction and over-sampling for AD/MCI classification from MRI measures. Comput. Biol. Med. 91, 21–37. doi: 10.1016/j.compbiomed.2017.10.002

PubMed Abstract | CrossRef Full Text | Google Scholar

Chen, X., Wang, X., Zhang, K., Fung, K. M., Thai, T. C., Moore, K., et al. (2022). Recent advances and clinical applications of deep learning in medical image analysis. Med. Image Anal. 79:102444. doi: 10.1016/j.media.2022.102444

PubMed Abstract | CrossRef Full Text | Google Scholar

Dhikav, V., Duraiswamy, S., and Anand, K. S. (2017). Correlation between hippocampal volumes and medial temporal lobe atrophy in patients with Alzheimer's disease. Ann. Indian Acad. Neurol. 20, 29–35. doi: 10.4103/0972-2327.199903

PubMed Abstract | CrossRef Full Text | Google Scholar

Dietterich, T. G. (ed.) (2000). “Ensemble methods in machine learning” in International Workshop on Multiple Classifier. Berlin, Heidelberg: Springer, 1–15.

Google Scholar

Duraisamy, B., Shanmugam, J. V., and Annamalai, J. (2019). Alzheimer disease detection from structural MR images using FCM based weighted probabilistic neural network. Brain Imaging Behav. 13, 87–110. doi: 10.1007/s11682-018-9831-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Feng, Q., and Ding, Z. (2020). MRI radiomics classification and prediction in Alzheimer’s disease and mild cognitive impairment: a review. Curr. Alzheimer Res. 17, 297–309. doi: 10.2174/1567205017666200303105016

PubMed Abstract | CrossRef Full Text | Google Scholar

Feng, Q., Niu, J., Wang, L., Pang, P., Wang, M., Liao, Z., et al. (2021). Comprehensive classification models based on amygdala radiomic features for Alzheimer's disease and mild cognitive impairment. Brain Imaging Behav. 15, 2377–2386. doi: 10.1007/s11682-020-00434-z

PubMed Abstract | CrossRef Full Text | Google Scholar

Feng, F., Wang, P., Zhao, K., Zhou, B., Yao, H., Meng, Q., et al. (2018). Radiomic features of hippocampal subregions in Alzheimer’s disease and amnestic mild cognitive impairment. Front. Aging Neurosci. 10:290. doi: 10.3389/fnagi.2018.00290

PubMed Abstract | CrossRef Full Text | Google Scholar

Fennema-Notestine, C., Hagler, D. J. Jr., McEvoy, L. K., Fleisher, A. S., Wu, E. H., Karow, D. S., et al. (2009). Structural MRI biomarkers for preclinical and mild Alzheimer's disease. Hum. Brain Mapp. 30, 3238–3253. doi: 10.1002/hbm.20744

PubMed Abstract | CrossRef Full Text | Google Scholar

Gillies, R. J., Kinahan, P. E., and Hricak, H. (2016). Radiomics: images are more than pictures, they are data. Radiology 278, 563–577. doi: 10.1148/radiol.2015151169

PubMed Abstract | CrossRef Full Text | Google Scholar

Graves, A. J. S. S. I. W. R. N. N. (2012). Long Short-term Memory. 37–45, Berlin, Heidelberg Springer.

Google Scholar

Gray, K. R., Aljabar, P., Heckemann, R. A., Hammers, A., Rueckert, D., and Initiative, A. S. D. N. (2013). Random forest-based similarity measures for multi-modal classification of Alzheimer's disease. NeuroImage 65, 167–175. doi: 10.1016/j.neuroimage.2012.09.065

PubMed Abstract | CrossRef Full Text | Google Scholar

Han, S. H., Lee, M. A., An, S. S., Ahn, S. W., Youn, Y. C., and Park, K. Y. (2014). Diagnostic value of Alzheimer's disease-related individual structural volume measurements using IBASPM. J. Clin. Neurosci. 21, 2165–2169. doi: 10.1016/j.jocn.2014.03.036

PubMed Abstract | CrossRef Full Text | Google Scholar

Helaly, H. A., Badawy, M., and Haikal, A. Y. (2022). Toward deep MRI segmentation for Alzheimer’s disease detection. Neural Comput. Applic. 34, 1047–1063. doi: 10.1007/s00521-021-06430-8

CrossRef Full Text | Google Scholar

Hosseini-Asl, E., Gimel'farb, G., and El-Baz, A. (2016). Alzheimer's disease diagnostics by a deeply supervised adaptable 3D convolutional network. arXiv preprint arXiv:1607.00556 [Epub ahead of preprint], doi: 10.2741/4606

CrossRef Full Text | Google Scholar

Jack, C. R. Jr., Albert, M. S., Knopman, D. S., McKhann, G. M., Sperling, R. A., Carrillo, M. C., et al. (2011). Introduction to the recommendations from the National Institute on Aging-Alzheimer's Association workgroups on diagnostic guidelines for Alzheimer's disease. Alzheimers Dement. 7, 257–262. doi: 10.1016/j.jalz.2011.03.004

PubMed Abstract | CrossRef Full Text | Google Scholar

Kohavi, R. (1996). "Scaling up the accuracy of naive-bayes classifiers: A decision-tree hybrid", in KDD'96: Proceedings of the Second International Conference on Knowledge Discovery and Data Mining, pp. 202–207.

Google Scholar

Kruthika, K., Maheshappa, H., and Initiative, A. S. D. N. (2019). Multistage classifier-based approach for Alzheimer's disease prediction and retrieval. Inf. Med. Unlocked 14, 34–42. doi: 10.1016/j.imu.2018.12.003

CrossRef Full Text | Google Scholar

Kwak, K., Niethammer, M., Giovanello, K. S., Styner, M., and Dayan, E., Alzheimer's Disease Neuroimaging I (2022). Differential role for hippocampal subfields in Alzheimer's disease progression revealed with deep learning. Cereb. Cortex 32, 467–478. doi: 10.1093/cercor/bhab223

PubMed Abstract | CrossRef Full Text | Google Scholar

Lee, C.-Y., Xie, S., Gallagher, P., Zhang, Z., and Tu, Z. (2015). "Deeply-supervised nets", in Proceedings of the Eighteenth International Conference on Artificial Intelligence and Statistics, (pp. 562–570).

Google Scholar

Li, H., Habes, M., Wolk, D. A., Fan, Y., and Initiative, A.s. D. N. (2019). A deep learning model for early prediction of Alzheimer's disease dementia based on hippocampal magnetic resonance imaging data. Alzheimers Dement. 15, 1059–1070. doi: 10.1016/j.jalz.2019.02.007

PubMed Abstract | CrossRef Full Text | Google Scholar

Lin, Y., Huang, K., Xu, H., Qiao, Z., Cai, S., Wang, Y., et al. (2020). Predicting the progression of mild cognitive impairment to Alzheimer's disease by longitudinal magnetic resonance imaging-based dictionary learning. Clin. Neurophysiol. 131, 2429–2439. doi: 10.1016/j.clinph.2020.07.016

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu, S., Jie, C., Zheng, W., Cui, J., and Wang, Z. J. F. I. A. N. (2022). Investigation of underlying association between whole brain regions and Alzheimer’s disease: a research based on an artificial intelligence model. Front. Aging Neurosci. 14:872530. doi: 10.3389/fnagi.2022.872530

CrossRef Full Text | Google Scholar

Liu, M., Li, F., Yan, H., Wang, K., Ma, Y., Shen, L., et al. (2020). A multi-model deep convolutional neural network for automatic hippocampus segmentation and classification in Alzheimer’s disease. NeuroImage 208:116459. doi: 10.1016/j.neuroimage.2019.116459

PubMed Abstract | CrossRef Full Text | Google Scholar

Márquez, F., and Yassa, M. A. (2019). Neuroimaging biomarkers for Alzheimer’s disease. Mol. Neurodegener. 14, 1–14. doi: 10.1186/s13024-019-0325-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Moradi, E., Pepe, A., Gaser, C., Huttunen, H., Tohka, J., and Neuroimaging I, A.'s. D. (2015). Machine learning framework for early MRI-based Alzheimer's conversion prediction in MCI subjects. NeuroImage 104, 398–412. doi: 10.1016/j.neuroimage.2014.10.002

PubMed Abstract | CrossRef Full Text | Google Scholar

Murtagh, F., and Contreras, P. (2012). Algorithms for hierarchical clustering: an overview. Wiley Interdiscip. Rev. Data Min. Knowl. Discov. 2, 86–97. doi: 10.1002/widm.53

CrossRef Full Text | Google Scholar

Nadal, L., Coupé, P., Helmer, C., Manjon, J. V., Amieva, H., Tison, F., et al. (2020). Differential annualized rates of hippocampal subfields atrophy in aging and future Alzheimer's clinical syndrome. Neurobiol. Aging 90, 75–83. doi: 10.1016/j.neurobiolaging.2020.01.011

PubMed Abstract | CrossRef Full Text | Google Scholar

Park, H. Y., Suh, C. H., Heo, H., Shim, W. H., and Kim, S. J. (2022). Diagnostic performance of hippocampal volumetry in Alzheimer’s disease or mild cognitive impairment: a meta-analysis. Eur. Radiol. 32, 6979–6991. doi: 10.1007/s00330-022-08838-9

CrossRef Full Text | Google Scholar

Pellegrini, E., Ballerini, L., Hernandez, M., Chappell, F. M., Gonzalez-Castro, V., Anblagan, D., et al. (2018). Machine learning of neuroimaging for assisted diagnosis of cognitive impairment and dementia: a systematic review. Alzheimers Dement. (Amst.) 10, 519–535. doi: 10.1016/j.dadm.2018.07.004

PubMed Abstract | CrossRef Full Text | Google Scholar

Peng, J., Zhu, X., Wang, Y., An, L., and Shen, D. (2019). Structured sparsity regularized multiple kernel learning for Alzheimer's disease diagnosis. Pattern Recogn. 88, 370–382. doi: 10.1016/j.patcog.2018.11.027

PubMed Abstract | CrossRef Full Text | Google Scholar

Qin, X., Zhang, Z., Huang, C., Dehghan, M., Zaiane, O. R., and Jagersand, M. (2020). U2-net: going deeper with nested U-structure for salient object detection. Pattern Recogn. 106:107404. doi: 10.1016/j.patcog.2020.107404

CrossRef Full Text | Google Scholar

Richhariya, B., Tanveer, M., Rashid, A. H., and Initia, A. D. N. (2020). Diagnosis of Alzheimer's disease using universum support vector machine based recursive feature elimination (USVM-RFE). Biomed. Signal Process. Control 59:101903. doi: 10.1016/j.bspc.2020.101903

CrossRef Full Text | Google Scholar

Roth, V. (2004). The generalized LASSO. IEEE Trans. Neural Netw. 15, 16–28. doi: 10.1109/TNN.2003.809398

CrossRef Full Text | Google Scholar

Sarvamangala, D. R., and Kulkarni, R. V. (2022). Convolutional neural networks in medical image understanding: a survey. Evol. Intell. 15, 1–22. doi: 10.1007/s12065-020-00540-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Sevigny, J., Chiao, P., Bussière, T., Weinreb, P. H., Williams, L., Maier, M., et al. (2016). The antibody aducanumab reduces Aβ plaques in Alzheimer’s disease. Nature 537, 50–56. doi: 10.1038/nature19323

PubMed Abstract | CrossRef Full Text | Google Scholar

Shen, K.-K., Fripp, J., Mériaudeau, F., Chételat, G., Salvado, O., Bourgeat, P., et al. (2012). Detecting global and local hippocampal shape changes in Alzheimer's disease using statistical shape models. NeuroImage 59, 2155–2166. doi: 10.1016/j.neuroimage.2011.10.014

PubMed Abstract | CrossRef Full Text | Google Scholar

Shen, D., Wu, G., and Suk, H. I. (2017). Deep learning in medical image analysis. Annu. Rev. Biomed. Eng. 19, 221–248. doi: 10.1146/annurev-bioeng-071516-044442

PubMed Abstract | CrossRef Full Text | Google Scholar

Singh, S. P., Wang, L., Gupta, S., Goli, H., Padmanabhan, P., and Gulyás, B. (2020). 3D deep learning on medical images: a review. Sensors 20:5097. doi: 10.3390/s20185097

PubMed Abstract | CrossRef Full Text | Google Scholar

Sohail, N., and Anwar, S. M. (2022). A modified U-net based framework for automated segmentation of hippocampus region in brain MRI. IEEE Access. 10, 31201–31209. doi: 10.1109/Access.2022.3159618

CrossRef Full Text | Google Scholar

Sun, J., Yan, S., Song, C., and Han, B. (2020). Dual-functional neural network for bilateral hippocampi segmentation and diagnosis of Alzheimer’s disease. Int. J. Comput. Assist. Radiol. Surg. 15, 445–455. doi: 10.1007/s11548-019-02106-w

CrossRef Full Text | Google Scholar

Van Griethuysen, J. J., Fedorov, A., Parmar, C., Hosny, A., Aucoin, N., Narayan, V., et al. (2017). Computational radiomics system to decode the radiographic phenotype. Cancer Res. 77, e104–e107. doi: 10.1158/0008-5472.CAN-17-0339

PubMed Abstract | CrossRef Full Text | Google Scholar

Yip, S. S., and Aerts, H. J. (2016). Applications and limitations of radiomics. Phys. Med. Biol. 61, R150–R166. doi: 10.1088/0031-9155/61/13/R150

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, J., Yu, C., Jiang, G., Liu, W., and Tong, L. (2012). 3D texture analysis on MRI images of Alzheimer’s disease. Brain Imaging Behav. 6, 61–69. doi: 10.1007/s11682-011-9142-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhou, K., He, W., Xu, Y., Xiong, G., and Cai, J. (2018). Feature selection and transfer learning for Alzheimer’s disease clinical diagnosis. Appl. Sci. 8:1372. doi: 10.3390/app8081372

CrossRef Full Text | Google Scholar

Zwanenburg, A., Vallières, M., Abdalah, M. A., Aerts, H. J., Andrearczyk, V., Apte, A., et al. (2020). The image biomarker standardization initiative: standardized quantitative radiomics for high-throughput image-based phenotyping. Radiology 295, 328–338. doi: 10.1148/radiol.2020191145

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: Alzheimer’s disease, coronal T1 weighted images, machine learning, automatic segmentation, radiomics classification

Citation: Zhou K, Piao S, Liu X, Luo X, Chen H, Xiang R and Geng D (2023) A novel cascade machine learning pipeline for Alzheimer’s disease identification and prediction. Front. Aging Neurosci. 14:1073909. doi: 10.3389/fnagi.2022.1073909

Received: 19 October 2022; Accepted: 30 December 2022;
Published: 16 January 2023.

Edited by:

Yong Liu, Beijing University of Posts and Telecommunications (BUPT), China

Reviewed by:

Thippa Reddy Gadekallu, VIT University, India
Md. Sarwar Hosain, Pabna University of Science and Technology, Bangladesh
Celestine Iwendi, University of Bolton, United Kingdom

Copyright © 2023 Zhou, Piao, Liu, Luo, Chen, Xiang and Geng. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Daoying Geng, ✉ ZGFveWluZ2dlbmdAZnVkYW4uZWR1LmNu

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.