- 1Key Laboratory for NeuroInformation of Ministry of Education, High-Field Magnetic Resonance Brain Imaging Key Laboratory of Sichuan Province, Center for Information in Medicine, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu, China
- 2School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu, China
- 3EEG and Cognition Laboratory, University of A Coruña, A Coruña, Spain
Optimized magnetic resonance imaging (MRI) features and abnormalities of brain network architectures may allow earlier detection and accurate prediction of the progression from mild cognitive impairment (MCI) to Alzheimer's disease (AD). In this study, we proposed a classification framework to distinguish MCI converters (MCIc) from MCI non-converters (MCInc) by using a combination of FreeSurfer-derived MRI features and nodal features derived from the thickness network. At the feature selection step, we first employed sparse linear regression with stability selection, for the selection of discriminative features in the iterative combinations of MRI and network measures. Subsequently the top K features of available combinations were selected as optimal features for classification. To obtain unbiased results, support vector machine (SVM) classifiers with nested cross validation were used for classification. The combination of 10 features including those from MRI and network measures attained accuracies of 66.04, 76.39, 74.66, and 73.91% for mixed conversion time, 6, 12, and 18 months before diagnosis of probable AD, respectively. Analysis of the diagnostic power of different time periods before diagnosis of probable AD showed that short-term prediction (6 and 12 months) achieved more stable and higher AUC scores compared with long-term prediction (18 months), with K-values from 1 to 30. The present results suggest that meaningful predictors composed of MRI and network measures may offer the possibility for early detection of progression from MCI to AD.
Introduction
Mild cognitive impairment (MCI), commonly characterized by slight cognitive deficits but largely intact activities of daily living (Petersen, 2004), is a transitional stage between the healthy aging and dementia. Several studies have suggested that individuals with MCI tend to progress to Alzheimer's disease (AD) at a rate of approximately 10–15% per year (Hänninen et al., 2002; Grundman et al., 2004), while normal controls (NC) develop dementia at a lower rate of 1–2% per year (Bischkopf et al., 2002). In these studies, conversion was considered over the course of 6 months up to a 4-year follow-up period. MCI remains challenging for diagnosis due to the mild symptoms of cognitive impairment, various etiologies and pathologies, and high rates of reversion back to normal. Thus, early detection of MCI individuals who are suffering from a high risk of conversion from MCI to AD is of increasing clinical importance in potentially delaying or preventing the transition from MCI to AD.
Magnetic resonance imaging (MRI) techniques have provided an efficient and non-invasive way to delineate brain atrophy. Recently, several studies have demonstrated that cortical thickness and subcortical volumetry/shape derived from baselines MRI scans can detect patterns of cerebral atrophy in AD (Fan et al., 2008; Lerch et al., 2008; Vemuri et al., 2008; Frisoni et al., 2010; Julkunen et al., 2010), but with that these have limited prediction accuracy of the conversion to AD in MCI patients (Risacher et al., 2009; Cuingnet et al., 2011). The limited sensitivity of MRI biomarkers in predicting the conversion of MCI subjects has prompted researchers to evaluate the combined prognostic value of different biomarkers. Recent findings (Cui et al., 2011; Gomar et al., 2011; Ewers et al., 2012; Westman et al., 2012; Liu et al., 2014) show that the combination of a range of different biomarkers have better predictive power compared with a single biomarker. However, collecting multi-modality data at the same time may not be applicable in practice.
In addition to the raw features obtained from MRI, structural brain network measures, referred to as the anatomical connection pattern between different neuronal elements (He et al., 2009; Jie et al., 2014; Li and Zhao, 2015), provide new insights into the network organization, topology, and complex dynamics of the brain, as well as further understanding of the pathogenesis of neurological disorders (Bullmore and Sporns, 2009; Zalesky et al., 2010). Abnormalities of structural networks have been observed in AD and MCI patients (Stam et al., 2007; He et al., 2008; Yao et al., 2010; Tijms et al., 2013; Zhou and Lui, 2013). Yao and colleagues used thickness cortical networks to study the aberrant brain structures in MCI and report that the nodal centrality in MCI, compared with a NC group, showed decreases in the left lingual gyrus, middle temporal gyrus (MTG), and increases in the precuneus cortex (Yao et al., 2010). Zhou and Lui (2013) also used cortical thickness to detect small-world properties alteration in MCI and reported that MCI converters (MCIc) showed the lowest local efficiency during the conversion period to AD; while the MCI non-converters (MCInc) showed the highest local and global efficiency.
These approaches which used optimized MRI features achieve encouraging accuracies (over 60%). However, few studies analyzed the co-variation of abnormalities in different regions of interest (ROIs), which can be characterized by network patterns and could contribute to reliable and sensitive classification (Dai et al., 2013). Indeed, the pattern of AD pathology is complex and evolves as disease progresses (Fan et al., 2008) and many regions share similar patterns of abnormal brain morphometric. Thus, informative network topology may be potentially useful for classification. In addition, many factors such as the heterogeneity of the MRI images (Eskildsen et al., 2013) and the imbalanced data between groups (Johnstone et al., 2012; Dubey et al., 2014) can also lead to overestimations.
The main objective of the current study was to determine whether the combined use of structural brain measures and thickness network alterations, may improve the accuracy and the sensitivity in identifying prodromal AD. To this end, we proposed a classification framework to distinguish MCIc from MCInc by using a combination of features from FreeSurfer-derived MRI features and nodal parameters derived from thickness network. To obtain predictive nodal information for each individual, we first established a weight network by using a kernel function and then thresholded it to a binary network. Finally, nodal properties were measured at a high discriminative connection cost. At the feature selection step, we first employed sparse linear regression with stability selection for robust feature selection in the iterative combination of MRI and network measures, and then top K features of available combinations were selected as optimal features for classification. To obtain unbiased results, support vector machine (SVM) classifiers with nested cross validation were used for classification. The secondary goal of this study was to measure the impact of different conversion time periods before diagnosis of probable AD, and to evaluate different predictive values between two groups. To that purpose, we homogenized the MCIc images with respect to “time to conversion.” Thus, MCIc patients were subdivided into four groups: mixed for baseline, 6, 12, and 18 months before diagnosis of probable AD. Our hypothesis was that network topological measures might be potentially useful for classification of imminent conversion, and the effective combination of brain morphometric and thickness network measures may improve the prediction of conversion from MCI to AD. Besides, more stable and higher classification accuracy could be obtained for the short-term prediction (6 and 12 months) compared with the long-term prediction (18 months).
Materials and Methods
Participants
Data used in this article were obtained from the Alzheimer's Disease Neuroimaging Initiative (ADNI) database (adni.loni.usc.edu). The ADNI was launched in 2003 as a public-private partnership, led by Principal Investigator Michael W. Weiner, MD. The primary goal of ADNI has been to test whether serial MRI, positron emission tomography (PET), other biological markers, and clinical and neuropsychological assessment can be combined to measure the progression of MCI and early Alzheimer's disease (AD).
The eligibility criteria for inclusion of subjects are described at: http://adni.loni.usc.edu/wp-content/uploads/2010/09/ADNI_GeneralProceduresManual.pdf. General criteria for MCI were as follows: (1) Mini-Mental-State-Examination (MMSE) scores between 24 and 30 (inclusive), (2) a memory complaint, objective memory loss measured by education adjusted scores on the Wechsler Memory Scale Logical Memory II, (3) a Clinical Dementia Rating (CDR) of 0.5, and (4) absence of significant levels of impairment in other cognitive domains, essentially preserved activities of daily living, and an absence of dementia.
Several studies, which rendered the MCI converters with respect to “time to conversion,” have used baseline MRI scans to predict the conversion, since the MCI patients could convert anytime over the course of 6 months to 4 years. We categorized the MCI patients into converters and non-converters as in Wolz et al. (2011), where non-converters were defined as those that did not have a change of diagnosis within 36 months and the complementary MCI patients constituted the MCIc group. To assess the diagnostic power of different time periods before diagnosis of probable AD, we selected scans at various intervals prior to diagnosis. We selected MCIc scans at 6 (MCIc_m6), 12 (MCIc_m12), and 18 months (MCIc_m18) prior to AD diagnosis. MCIc scans at 24 and 36 months prior to AD diagnosis were excluded from the analysis due to the small samples and large imbalances between the two groups. To evaluate our method in comparison with the method using baseline scans for prediction, we also selected MCIc baseline data (MCIc_mixed) for prediction. Table 1 summarizes the selected MCI patients in our study.
MRI Imaging Acquisition
All scans used in the study were T1-weighted MPRAGE images acquired in 1.5-Tesla MR imaging instruments using a standardized protocol (Jack et al., 2008). Pre-processing images were downloaded from the public ADNI site (adni.loni.usc.edu). The images were preprocessed according to a number of steps detailed in the ADNI website, which contained (1) grad warp correction of image geometry distortion due to gradient non-linearity, (2) B1 non-uniformity processing to correct the image intensity non-uniformity, and (3) N3 processing to reduce residual intensity non-uniformity.
Feature Extraction
MRI Features
The FreeSurfer 5.30 software package was utilized for cortical reconstruction and volumetric segmentation (FreeSurfer v5.30, http://surfer.nmr.mgh.harvard.edu/fswiki). In brief, the processing contains automated Talairach spaces transformation, intensity inhomogeneity correction, removal of non-brain tissue, intensity normalization, tissue segmentation (Fischl et al., 2002), automated topology correction, surface deformation to generate the gray/white matter boundary and gray matter/ Cerebrospinal Fluid (CSF) boundary, and parcellation of the cerebral cortex (Desikan et al., 2006). The quality of the raw MRI images, Talairach registration, intensity normalization, brain segmentation, and surface demarcation were assessed using a manual inspection protocol. The images that failed the stages of quality assurance were removed from subsequent analysis. The atlas used in FreeSurfer included 34 cortical ROIs per hemisphere (Table 2). For each cortical ROI, cortical thickness (CT), cortical volume (CV), and cortical surface area (CS) were calculated as three subtypes of MRI features. CT at each vertex of the cortex was calculated as the average shortest distance between white and pail surfaces. CS was calculated by computing the area of every triangle in a standardized spherical surface tessellation. CV at each vertex was computed by the product of the CS and CT at each surface vertex. This yielded a total of 204 cortical features for each subject (Figure 1A).
Figure 1. Proposed prediction framework. (A) Feature extraction: T1-weigthed images are processed and individual thickness network is constructed based on the difference in cortical thickness of a pair of ROIs. (B) Classification: SVM classifier with nested cross validation is implemented for classification.
Thickness Network Features
Similar to a prior study (Dai et al., 2013), the thickness network matrix Wij(i, j = 1, 2, …, N, here N= 68) for each individual was obtained by calculating the difference in cortical thickness between each pair of regions, and measured using the following kernel, with the weight defined as:
where CTk(i) represents the cortical thickness of i ROI of k subjects, and the kernel width α is 0.01. To simplify the statistical calculation, the thickness network matrix of each individual was thresholded into a binary matrix Bij = [bij], where the bij was 1 if the weight of the two ROIs was larger than the given threshold, and 0 otherwise. The threshold represents the network connection cost, defined as the ratio of the supra-threshold connections relative to the total possible number of connections in the network (Fornito et al., 2010). After applying each threshold, these binary matrices were then used as a basis for the network construction and graph analysis. We analyzed the full range of costs from 8 to 40%, at 1% intervals. The nodal properties were then extracted at a connection cost of 18%, at which the clustering coefficient showed the largest difference between the MCIc_mixed and MCInc groups. Finally, 136 nodal features including nodal path length (NL) and nodal degree (ND) were employed for subsequent analysis (Figure 1A). In brief, for a given node i, nodal path length and nodal degree were defined as follows:
where Lij refers to the minimum number of edges between node pairs i and j, V is the size of a graph, and bij is the connection status between the node pairs i and j. Intuitively, path length Li measures the speed of the message that passes through a given node, and the degree of an individual node ki is equal to the number of links connected to that node, thus reflecting the level of interaction in the network.
Feature Selection
In the current study, as shown in Figure 1, we evaluated 340 features from five different categories (three types of MRI features and two types of network features) for each subject. We implemented the combination in an iterative manner to avoid making an arbitrary choice of the combination. Features were combined in every possible combination. The iteration pattern was described as follows:
where i refers to the type of features, sum refers to the number of total iterative models. A total of 31 combinations were obtained for each diagnostic pair.
In each combination, we applied sparse linear regression for features selection using the L1-norm regularization (Tibshirani, 1996). Let be a n×m matrix that represents m features of n samples, be a n dimensional corresponding classification labels (yi = 1 for MCIc and yi = −1 for MCInc). The linear regression model was defined as follows:
where and denotes the regression coefficient vector and the predicted label vector. One approach is to estimate the w by minimizing the following objective function:
where λ > 0 is a regularization parameter which controls the sparsity of the model, i.e., many of the entries of w are zero, and ||w||1 is the L1-norm of w, which is defined as . In this study, the SLEP package (Liu et al., 2009) was used for solving sparse linear regression. To address the problem of proper regularization we applied the stability selection using subsampling or bootstrapping (Meinshausen and Bühlmann, 2010) for robust feature selection. For each combination, we selected the top K (K = 10) features for subsequent analysis. After feature selection of each combination, the likelihood for a feature index being selected in the combinations was calculated as follows:
where sum is the number of combinations, l is the features index and sf is a binary function determining if l is selected in a combination. is an expression of how often a feature is included among all combinations. Finally, the top K features were selected for classification.
Classification
For the selected features, the SVM classifier was implemented using the LIBSVM toolbox (Chang and Lin, 2011), with radial basis function (RBF) and an optimal value for the penalized coefficient C (a constant determining the tradeoff between training error and model flatness). The RBF kernel was defined as follows:
where x1, x2 are the two feature vectors and σ controls the width of the RBF kernel. In order to obtain an unbiased estimation and select the optimal SVM model, a nested cross validation (CV) was employed. For a training set, we selected the optimal hyperparameters (C and σ) through a grid-search and a 10-fold CV (inner CV). The outer CV that we used was the leave-one-out cross validation (LOOCV). In each fold of the outer CV, one sample was kept out for validation and the remaining were used for feature selection and training the classifier; then the performance of the training classifier was evaluated using the held-out sample. This run was repeated until all the subjects were excluded. The pipeline of our classification framework is presented in Figure 1. To evaluate the quality of the classification, we report four established measures: accuracy, sensitivity, specificity, and area under the curve (AUC). These measures were defined as follows:
where TP, TN, FP, FN denote true positive, true negative, false positive, and false negative, respectively. Following a common convention, we considered a correctly predicted MCIc as a true positive.
Results
The LOOCV results of classification and receiver operating characteristic curves (ROCs) are depicted in Table 3 and Figure 2A. For the MCInc vs. MCIc_mixed model, the proposed method achieved a classification accuracy of 66.04% (sensitivity = 55.26%, specificity = 75.90%, AUC = 0.7346). For classifying MCIc_m6 from MCInc, combining the MRI with network measures, resulted in a higher accuracy of 76.39% (sensitivity = 65.57%, specificity = 84.34%, AUC = 0.8130). Specifically, we obtained slightly lower levels of accuracies for 12 and 18 months (74.66 and 73.91%, respectively) compared to the classification of MCInc vs. MCIc_m6.
Figure 2. ROC curves for the four diagnostic pairs using (A) top 10 combined features, (B) top 10 MRI features, and (C) top 10 network features.
By using the top 10 combined features, the features most often selected by the sparse linear regression with the stability selection, we achieved AUC scores in a range between 0.7346 and 0.8130. The features selected (listed in Table 4) show roughly similar features among four diagnostic pairs and include the left inferior parietal cortex (IPC), left frontal pole, left precuneus cortex, left postcentral gyrus, left entorhinal cortex, left MTG, left banks superior temporal sulcus, right caudal middle frontal gyrus, right supramarginal gyrus, right posterior cingulate cortex, right isthmus of the cingulate cortex, and right lingual gyrus. These selected regions have been shown to be related with MCI conversion (Chételat et al., 2005; Fan et al., 2008; Misra et al., 2009; Risacher et al., 2009; Yao et al., 2010; Cai et al., 2015; Kandiah et al., 2015). Moreover, note that nearly all involved network features included the nodal degree (ND).
Table 4. Top 10 combined features selected by the sparse linear regression with stability selection in the LOOCV experiments.
To demonstrate the impact of the number of selected features, we conducted the classification using the top K combined features for K = 1, 2, …, 30. The classification performances and AUC scores are depicted in Supplementary Table 1 and Figure 3, respectively. As shown in Figure 3, the AUC stabilizes after the top 12–15 features are included and the best classification results are observed in the classification of MCInc vs. MCIc_m6 and MCInc vs. MCIc_m12.
To examine the added benefit of the network measures, we applied the sparse linear regression with the stability selection to either the MRI or the network measures. The classifier model performances and ROCs are depicted in Table 5 and Figure 2. As shown in Table 5, MRI achieved the best AUC scores (0.8002 for MCInc vs. MCIc_m6), while the network biomarkers performed slightly worse (AUC = 0.6974, 0.6006, 0.7481, 0.6140, for mixed, 6, 12, and 18 months before diagnosis of probable AD, respectively). The top 10 MRI and network features are listed in Supplementary Tables 2, 3. Note that most items in Table 4 and Supplementary Tables 2, 3 match, and that several cortical surface area (CS) features were included in the classifier, only when the signal MRI was used for prediction.
Discussion
In this study, we established an efficient MCI conversion classification framework using a combination of MRI and network measures. The increased prediction accuracies that we observed suggest that it may be possible to identify conversion from MCI to AD using the combination of MRI and network measures. Moreover, the homogenization of the MCIc sub-groups showed improved classification of the short-term prediction, yielding a more consistent pattern of cortical neurodegeneration.
Our findings show (Tables 3, 5) that the combination of MRI and thickness network measures outperforms either MRI or network measures alone, in the prediction of conversion from MCI to AD. In addition, the results showed that brain morphometric was a better predictor compared with thickness network measures, suggesting abnormalities may exist across different ROIs during the conversion period to AD. Moreover, the increased predictive power of the combined classification methodology suggests that a co-variation of the abnormalities across different regions is necessary for the detection of the early transition from MCI to AD. Without requiring new sources of information, our prediction AUCs are in line with previous studies (Cui et al., 2011; Ye et al., 2012; Eskildsen et al., 2013; Raamana et al., 2015), which used multivariate biomarkers including thickness, thickness network, CSF, and cognitive measures. Cui et al. (2011) showed that with a combination of MRI, CSF, neuropsychological and functional measures (NMs), MCInc vs. MCIc were classified with an AUC of 0.796 at baseline. However, the specificity that was achieved was under 50% (48.28%), despite adding CSF and five NMs measures that have been thought to be useful in conversion prediction. On the other hand, Ye et al. (2012) who used a spare logistic regression with stability selection and a combination of 15 features including MRI, APOE gene, and cognitive measures, achieved the best reported classification results to date with an AUC of 0.8587 (Ye et al., 2012). Our results demonstrate slightly lower accuracy levels, but we only used one source of information and a smaller number of selected features. In addition, obtaining CSF and APOE gene measures may not be applicable for some subjects, and thus make be difficult to obtain during data integration. Eskildsen et al. (2013) have also distinguished MCIc from MCInc at various intervals prior to diagnosis, with AUC scores of 0.809 and 0.762 for MCIc_m6 and MCIc_m12, respectively. Raamana et al. (2015) achieved an AUC of 0.680 using a novel approach that utilizes thickness network fusion measures for the prediction of MCI conversion. Classification results are summarized in Table 6.
Importantly, the stability selection provides a small subset of discriminative patterns (see Table 4 and Supplementary Tables 2, 3) for effective and efficient screens. Our findings showed that most of the MRI features in the top 10 combined features were cortical thickness and volume. The consistent features that were included in most pairs with a high frequency were the cortical thickness and volume of the left IPC; and the cortical volume of the left MTG and of the right supramarginal gyrus (SMG), suggesting that abnormities in these regions may be important predictors of conversion (Chételat et al., 2005; Pennanen et al., 2005; Fan et al., 2008; Karas et al., 2008; Whitwell et al., 2008; Desikan et al., 2009; Schroeter et al., 2009; Li et al., 2011; Wang et al., 2016). Additionally, we found that the features selected were predominately in the left hemisphere (Table 4). The potential asymmetry is possible related to the disease progression, since the pattern of atrophy in AD was fairly symmetric (Fan et al., 2008). Besides, the selected ROIs were functionally associated with episodic memory (MTG, IPC) and attention (posterior cingulate cortex). Other features that were included were the nodal degree of the left MTG, the right lingual gyrus (LING) and the left postcentral gyrus (PoCG). Previous studies have found that subjects with MCI have abnormal network patterns in the LING and MTG (Yao et al., 2010). In addition, He and colleagues demonstrated an abnormal correlation between bilateral PoCG in AD (He et al., 2008). Moreover, the ROIs selected showed a small overlap between MRI and thickness network, suggesting that informative co-variation of the abnormalities may provide complementary information for classification. Together, our results suggest that changes in the cortical regions may be associated with mechanisms underlying the conversion of MCI to AD, and structural network architecture can be a potential predictor for the classification of imminent conversion.
The classification performances obtained for the MCIc sub-groups showed an improvement when time-homogenization was utilized, which was in line with a previous study (Eskildsen et al., 2013). We found that short-term prediction (6 and 12 months follow up) showed slightly better performances compared with long-term prediction of 18 months (Figure 3). The likelihood for MCIc subjects to be accurately predicted increased with the reduction of conversion prior diagnosis. The small overlap in brain atrophy and network topology, we believe, is the primary reason for improving short-term predictions. Additionally, the relatively low sensitivity for MCInc vs. MCInc_m18 possibly due to the small sample size available to construct the long-term (18 months) classifier model.
On the other hand, we investigates whether the number of features selected influences the classification results. Overall, we found that the AUC scores stabilized after the top nine features were added to the classifier model for the 6 and 12 months follow up. In contrast, for the 18 months follow up, the AUC values increased when the number of selected features was increased, and a strong relationship was observed in the classification of MCInc vs. MCIc_mixed. The stable performances that were observed for the short-term predictions may be attributed to mechanisms associated with the conversion to AD, suggesting more consistent patterns of abnormalities in brain atrophy and network features. The effect of the homogenization of the MCIc patients reveals that predictions are superior when subjects display variable time periods to conversion. Specifically, compared to combined MRI and network features, the top 10 MRI features showed similar performances for short-term predictions, suggesting that the abnormal brain atrophy patterns are strong predictors for short-term prediction. For MCIC_m18 prediction, the sensitivity increased by 25% and the AUC increased by 4%, when we used combined feature sets compared with MRI measures alone, which may indicate that these classes of measures provide complementary information for diagnostic classification. Therefore, informative structural network measures could be potentially useful for classification, especially at the early stage of impairment.
This study has several limitations. One limitation is that there is no consensus regarding the time boundary for MCI converters and MCI non-converters. Another limitation related to network features, is whether the extracted network features reflect characteristics related to AD in an integral and accurate manner. Although several studies (Stam et al., 2007, 2009; He et al., 2008; Yao et al., 2010; Shu et al., 2012; Zhao et al., 2012; Tijms et al., 2014) show that AD and MCI are associated with changes in network properties, there is little agreement about the nature of these changes. Another drawback is that the accuracy of some discriminant classifiers should be interpreted with caution. Future studies are warranted where larger samples and more advanced fusion methods, using more than just node quantitative measurements, may limit overestimation and may overcome direct comparison. Moreover, further studies are needed in order to examine the diagnostic power of the relationship between structural and functional connectivity abnormalities in MCI sub-groups.
Conclusion
This study investigated the diagnostic power of the combination of MRI and thickness network measures derived from structural MRI to distinguish individuals with MCIc from MCInc. Without requiring new sources of information, our approach shows that the effective combination of MRI and thickness network measures improves the discrimination between MCIc and MCInc, compared with the use of either MRI or network measures separately. Moreover, the selected features are interpretable and are in line with previous findings, and the similar spatial patterns of brain morphometric and structural network alterations are shared among the four groups that we examined. By using longitudinal measures, we also found that short-term prediction shows more stable and better performances compared with long-term prediction. Together, our study provides a new insight into the prediction of MCI to AD conversion, and revealed that structural connectivity is a potential predictor for classification of imminent conversion.
Author Contributions
RW was in charge of the data analysis and manuscript writing. CL helped in speeding up the data analysis. LL helped in calculation and manuscript writing. NF helped was in charge of manuscript verifying. All authors reviewed the manuscript.
Funding
This research was supported by grants from NSFC (Nos. 61203363 and 61473062), Spanish Ministry of Science and Innovation (PSI2012-34212), the Ramón y Cajal national fellowship program, the 111 project (B12027), and the Fundamental Research Funds for the Central Universities.
Conflict of Interest Statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Acknowledgments
Data collection and sharing for this project was funded by the Alzheimer's Disease Neuroimaging Initiative (ADNI) (National Institutes of Health Grant U01 AG024904) and DOD ADNI (Department of Defense award number W81XWH-12-2-0012). ADNI is funded by the National Institute on Aging, the National Institute of Biomedical Imaging and Bioengineering, and through generous contributions from the following: AbbVie, Alzheimer's Association; Alzheimer's Drug Discovery Foundation; Araclon Biotech; BioClinica, Inc.; Biogen; Bristol-Myers Squibb Company; CereSpir, Inc.; Eisai Inc.; Elan Pharmaceuticals, Inc.; Eli Lilly and Company; EuroImmun; F. Hoffmann-La Roche Ltd and its affiliated company Genentech, Inc.; Fujirebio; GE Healthcare; IXICO Ltd.; Janssen Alzheimer Immunotherapy Research & Development, LLC.; Johnson & Johnson Pharmaceutical Research & Development LLC.; Lumosity; Lundbeck; Merck & Co., Inc.; Meso Scale Diagnostics, LLC.; NeuroRx Research; Neurotrack Technologies; Novartis Pharmaceuticals Corporation; Pfizer Inc.; Piramal Imaging; Servier; Takeda Pharmaceutical Company; and Transition Therapeutics. The Canadian Institutes of Health Research is providing funds to support ADNI clinical sites in Canada. Private sector contributions are facilitated by the Foundation for the National Institutes of Health (www.fnih.org). The grantee organization is the Northern California Institute for Research and Education, and the study is coordinated by the Alzheimer's Disease Cooperative Study at the University of California, San Diego. ADNI data are disseminated by the Laboratory for Neuro Imaging at the University of Southern California.
Supplementary Material
The Supplementary Material for this article can be found online at: http://journal.frontiersin.org/article/10.3389/fnagi.2016.00076
References
Bischkopf, J., Busse, A., and Angermeyer, M. C. (2002). Mild cognitive impairment–a review of prevalence, incidence and outcome according to current approaches. Acta Psychiatr. Scand. 106, 403–414. doi: 10.1034/j.1600-0447.2002.01417.x
Bullmore, E., and Sporns, O. (2009). Complex brain networks: graph theoretical analysis of structural and functional systems. Nat. Rev. Neurosci. 10, 186–198. doi: 10.1038/nrn2575
Cai, S., Huang, L., Zou, J., Jing, L., Zhai, B., Ji, G., et al. (2015). Changes in thalamic connectivity in the early and late stages of amnestic mild cognitive impairment: a resting-state functional magnetic resonance study from ADNI. PLoS ONE 10:e0115573. doi: 10.1371/journal.pone.0115573
Chang, C. C., and Lin, C. J. (2011). LIBSVM: a library for support vector machines. Acm Trans. Intell. Syst. Technol. 2, 2–27. doi: 10.1145/1961189.1961199
Chételat, G., Landeau, B., Eustache, F., Mézenge, F., Viader, F., De La Sayette, V., et al. (2005). Using voxel-based morphometry to map the structural changes associated with rapid conversion in MCI: a longitudinal MRI study. Neuroimage 27, 934–946. doi: 10.1016/j.neuroimage.2005.05.015
Cui, Y., Liu, B., Luo, S., Zhen, X., Fan, M., Liu, T., et al. (2011). Identification of conversion from mild cognitive impairment to Alzheimer's disease using multivariate predictors. PLoS ONE 6:e21896. doi: 10.1371/journal.pone.0021896
Cuingnet, R., Gerardin, E., Tessieras, J., Auzias, G., Lehéricy, S., Habert, M. O., et al. (2011). Automatic classification of patients with Alzheimer's disease from structural MRI: a comparison of ten methods using the ADNI database. Neuroimage 56, 766–781. doi: 10.1016/j.neuroimage.2010.06.013
Dai, D., He, H. G., Vogelstein, J. T., and Hou, Z. G. (2013). Accurate prediction of AD patients using cortical thickness networks. Mach. Vision Appl. 24, 1445–1457. doi: 10.1007/s00138-012-0462-0
Desikan, R. S., Cabral, H. J., Fischl, B., Guttmann, C. R., Blacker, D., Hyman, B. T., et al. (2009). Temporoparietal MR imaging measures of atrophy in subjects with mild cognitive impairment that predict subsequent diagnosis of Alzheimer disease. AJNR Am. J. Neuroradiol. 30, 532–538. doi: 10.3174/ajnr.A1397
Desikan, R. S., Segonne, F., Fischl, B., Quinn, B. T., Dickerson, B. C., Blacker, D., et al. (2006). An automated labeling system for subdividing the human cerebral cortex on MRI scans into gyral based regions of interest. Neuroimage 31, 968–980. doi: 10.1016/j.neuroimage.2006.01.021
Dubey, R., Zhou, J., Wang, Y., Thompson, P. M., and Ye, J. (2014). Analysis of sampling techniques for imbalanced data: an n = 648 ADNI study. Neuroimage 87, 220–241. doi: 10.1016/j.neuroimage.2013.10.005
Eskildsen, S. F., Coupé, P., García-Lorenzo, D., Fonov, V., Pruessner, J. C., Collins, D. L., et al. (2013). Prediction of Alzheimer's disease in subjects with mild cognitive impairment from the ADNI cohort using patterns of cortical thinning. Neuroimage 65, 511–521. doi: 10.1016/j.neuroimage.2012.09.058
Ewers, M., Walsh, C., Trojanowski, J. Q., Shaw, L. M., Petersen, R. C., Jack, C. R. Jr., et al. (2012). Prediction of conversion from mild cognitive impairment to Alzheimer's disease dementia based upon biomarkers and neuropsychological test performance. Neurobiol. Aging 33, 1203–1214. doi: 10.1016/j.neurobiolaging.2010.10.019
Fan, Y., Batmanghelich, N., Clark, C. M., and Davatzikos, C. (2008). Spatial patterns of brain atrophy in MCI patients, identified via high-dimensional pattern classification, predict subsequent cognitive decline. Neuroimage 39, 1731–1743. doi: 10.1016/j.neuroimage.2007.10.031
Fischl, B., Salat, D. H., Busa, E., Albert, M., Dieterich, M., Haselgrove, C., et al. (2002). Whole brain segmentation: automated labeling of neuroanatomical structures in the human brain. Neuron 33, 341–355. doi: 10.1016/S0896-6273(02)00569-X
Fornito, A., Zalesky, A., and Bullmore, E. T. (2010). Network scaling effects in graph analytic studies of human resting-state FMRI data. Front. Syst. Neurosci. 4:22. doi: 10.3389/fnsys.2010.00022
Frisoni, G. B., Fox, N. C., Jack, C. R. Jr., Scheltens, P., and Thompson, P. M. (2010). The clinical use of structural MRI in Alzheimer disease. Nat. Rev. Neurol. 6, 67–77. doi: 10.1038/nrneurol.2009.215
Gomar, J. J., Bobes-Bascaran, M. T., Conejero-Goldberg, C., Davies, P., and Goldberg, T. E. (2011). Utility of combinations of biomarkers, cognitive markers, and risk factors to predict conversion from mild cognitive impairment to Alzheimer disease in patients in the Alzheimer's disease neuroimaging initiative. Arch. Gen. Psychiatry 68, 961–969. doi: 10.1001/archgenpsychiatry.2011.96
Grundman, M., Petersen, R. C., Ferris, S. H., Thomas, R. G., Aisen, P. S., Bennett, D. A., et al. (2004). Mild cognitive impairment can be distinguished from Alzheimer disease and normal aging for clinical trials. Arch. Neurol. 61, 59–66. doi: 10.1001/archneur.61.1.59
Hänninen, T., Hallikainen, M., Tuomainen, S., Vanhanen, M., and Soininen, H. (2002). Prevalence of mild cognitive impairment: a population-based study in elderly subjects. Acta Neurol. Scand. 106, 148–154. doi: 10.1034/j.1600-0404.2002.01225.x
He, Y., Chen, Z., and Evans, A. (2008). Structural insights into aberrant topological patterns of large-scale cortical networks in Alzheimer's disease. J. Neurosci. 28, 4756–4766. doi: 10.1523/JNEUROSCI.0141-08.2008
He, Y., Chen, Z., Gong, G., and Evans, A. (2009). Neuronal networks in Alzheimer's disease. Neuroscientist 15, 333–350. doi: 10.1177/1073858409334423
Jack, C. R. Jr., Bernstein, M. A., Fox, N. C., Thompson, P., Alexander, G., Harvey, D., et al. (2008). The Alzheimer's Disease Neuroimaging Initiative (ADNI): MRI methods. J. Magn. Reson. Imaging 27, 685–691. doi: 10.1002/jmri.21049
Jie, B., Zhang, D., Wee, C. Y., and Shen, D. (2014). Topological graph kernel on multiple thresholded functional connectivity networks for mild cognitive impairment classification. Hum. Brain Mapp. 35, 2876–2897. doi: 10.1002/hbm.22353
Johnstone, D., Milward, E. A., Berretta, R., and Moscato, P. (2012). Multivariate protein signatures of pre-clinical Alzheimer's disease in the Alzheimer's disease neuroimaging initiative (ADNI) plasma proteome dataset. PLoS ONE 7:e34341. doi: 10.1371/journal.pone.0034341
Julkunen, V., Niskanen, E., Koikkalainen, J., Herukka, S. K., Pihlajamäki, M., Hallikainen, M., et al. (2010). Differences in cortical thickness in healthy controls, subjects with mild cognitive impairment, and Alzheimer's disease patients: a longitudinal study. J. Alzheimers Dis. 21, 1141–1151. doi: 10.3233/JAD-2010-100114
Kandiah, N., Chander, R. J., Ng, A., Wen, M. C., Cenina, A. R., and Assam, P. N. (2015). Association between white matter hyperintensity and medial temporal atrophy at various stages of Alzheimer's disease. Eur. J. Neurol. 22, 150–155. doi: 10.1111/ene.12546
Karas, G., Sluimer, J., Goekoop, R., Van Der Flier, W., Rombouts, S. A., Vrenken, H., et al. (2008). Amnestic mild cognitive impairment: structural MR imaging findings predictive of conversion to Alzheimer disease. AJNR Am. J. Neuroradiol. 29, 944–949. doi: 10.3174/ajnr.A0949
Lerch, J. P., Pruessner, J., Zijdenbos, A. P., Collins, D. L., Teipel, S. J., Hampel, H., et al. (2008). Automated cortical thickness measurements from MRI can accurately separate Alzheimer's patients from normal elderly controls. Neurobiol. Aging 29, 23–30. doi: 10.1016/j.neurobiolaging.2006.09.013
Li, C., Wang, J., Gui, L., Zheng, J., Liu, C., and Du, H. (2011). Alterations of whole-brain cortical area and thickness in mild cognitive impairment and Alzheimer's disease. J. Alzheimers Dis. 27, 281–290. doi: 10.3233/JAD-2011-110497
Li, L., and Zhao, D. (2015). Age-related inter-region EEG coupling changes during the control of bottom-up and top-down attention. Front. Aging Neurosci. 7:223. doi: 10.3389/fnagi.2015.00223
Liu, F., Wee, C. Y., Chen, H., and Shen, D. (2014). Inter-modality relationship constrained multi-modality multi-task feature selection for Alzheimer's Disease and mild cognitive impairment identification. Neuroimage 84, 466–475. doi: 10.1016/j.neuroimage.2013.09.015
Liu, J., Ji, S. W., and Ye, J. P. (2009). SLEP: Sparse Learning with Efficient Projections: Arizona State University [Online]. Available online at: http://www.public.asu.edu/~jye02/Software/SLEP
Meinshausen, N., and Bühlmann, P. (2010). Stability selection. J. R. Stat. Soc. Ser. B Stat. Methodol. 72, 417–473. doi: 10.1111/j.1467-9868.2010.00740.x
Misra, C., Fan, Y., and Davatzikos, C. (2009). Baseline and longitudinal patterns of brain atrophy in MCI patients, and their use in prediction of short-term conversion to AD: results from ADNI. Neuroimage 44, 1415–1422. doi: 10.1016/j.neuroimage.2008.10.031
Pennanen, C., Testa, C., Laakso, M. P., Hallikainen, M., Helkala, E. L., Hänninen, T., et al. (2005). A voxel based morphometry study on mild cognitive impairment. J. Neurol. Neurosurg. Psychiatry 76, 11–14. doi: 10.1136/jnnp.2004.035600
Petersen, R. C. (2004). Mild cognitive impairment as a diagnostic entity. J. Intern. Med. 256, 183–194. doi: 10.1111/j.1365-2796.2004.01388.x
Raamana, P. R., Weiner, M. W., Wang, L., and Beg, M. F. (2015). Thickness network features for prognostic applications in dementia. Neurobiol. Aging 36(Suppl. 1), S91–S102. doi: 10.1016/j.neurobiolaging.2014.05.040
Risacher, S. L., Saykin, A. J., West, J. D., Shen, L., Firpi, H. A., and McDonald, B. C. (2009). Baseline MRI predictors of conversion from MCI to probable AD in the ADNI cohort. Curr. Alzheimer Res. 6, 347–361. doi: 10.2174/156720509788929273
Schroeter, M. L., Stein, T., Maslowski, N., and Neumann, J. (2009). Neural correlates of Alzheimer's disease and mild cognitive impairment: a systematic and quantitative meta-analysis involving 1351 patients. Neuroimage 47, 1196–1206. doi: 10.1016/j.neuroimage.2009.05.037
Shu, N., Liang, Y., Li, H., Zhang, J., Li, X., Wang, L., et al. (2012). Disrupted topological organization in white matter structural networks in amnestic mild cognitive impairment: relationship to subtype. Radiology 265, 518–527. doi: 10.1148/radiol.12112361
Stam, C. J., De Haan, W., Daffertshofer, A., Jones, B. F., Manshanden, I., Van Cappellen Van Walsum, A. M., et al. (2009). Graph theoretical analysis of magnetoencephalographic functional connectivity in Alzheimer's disease. Brain 132, 213–224. doi: 10.1093/brain/awn262
Stam, C. J., Jones, B. F., Nolte, G., Breakspear, M., and Scheltens, P. (2007). Small-world networks and functional connectivity in Alzheimer's disease. Cereb. Cortex 17, 92–99. doi: 10.1093/cercor/bhj127
Tibshirani, R. (1996). Regression shrinkage and selection via the Lasso. J. R. Stat. Soc. Ser. B Methodol. 58, 267–288.
Tijms, B. M., Wink, A. M., De Haan, W., Van Der Flier, W. M., Stam, C. J., Scheltens, P., et al. (2013). Alzheimer's disease: connecting findings from graph theoretical studies of brain networks. Neurobiol. Aging 34, 2023–2036. doi: 10.1016/j.neurobiolaging.2013.02.020
Tijms, B. M., Yeung, H. M., Sikkes, S. A., Möller, C., Smits, L. L., Stam, C. J., et al. (2014). Single-subject gray matter graph properties and their relationship with cognitive impairment in early- and late-onset Alzheimer's disease. Brain Connect. 4, 337–346. doi: 10.1089/brain.2013.0209
Vemuri, P., Gunter, J. L., Senjem, M. L., Whitwell, J. L., Kantarci, K., Knopman, D. S., et al. (2008). Alzheimer's disease diagnosis in individual subjects using structural MR images: validation studies. Neuroimage 39, 1186–1197. doi: 10.1016/j.neuroimage.2007.09.073
Wang, M., Yang, P., Zhao, Q. J., Wang, M., Jin, Z., and Li, L. (2016). Differential preparation intervals modulate repetition processes in task switching: an ERP study. Front. Hum. Neurosci. 10:57. doi: 10.3389/fnhum.2016.00057
Westman, E., Muehlboeck, J. S., and Simmons, A. (2012). Combining MRI and CSF measures for classification of Alzheimer's disease and prediction of mild cognitive impairment conversion. Neuroimage 62, 229–238. doi: 10.1016/j.neuroimage.2012.04.056
Whitwell, J. L., Shiung, M. M., Przybelski, S. A., Weigand, S. D., Knopman, D. S., Boeve, B. F., et al. (2008). MRI patterns of atrophy associated with progression to AD in amnestic mild cognitive impairment. Neurology 70, 512–520. doi: 10.1212/01.wnl.0000280575.77437.a2
Wolz, R., Julkunen, V., Koikkalainen, J., Niskanen, E., Zhang, D. P., Rueckert, D., et al. (2011). Multi-method analysis of MRI images in early diagnostics of Alzheimer's disease. PLoS ONE 6:e25446. doi: 10.1371/journal.pone.0025446
Yao, Z., Zhang, Y., Lin, L., Zhou, Y., Xu, C., Jiang, T., et al. (2010). Abnormal cortical networks in mild cognitive impairment and Alzheimer's disease. PLoS Comput. Biol. 6:e1001006. doi: 10.1371/journal.pcbi.1001006
Ye, J., Farnum, M., Yang, E., Verbeeck, R., Lobanov, V., Raghavan, N., et al. (2012). Sparse learning and stability selection for predicting MCI to AD conversion using baseline ADNI data. BMC Neurol. 12:46. doi: 10.1186/1471-2377-12-46
Zalesky, A., Fornito, A., and Bullmore, E. T. (2010). Network-based statistic: identifying differences in brain networks. Neuroimage 53, 1197–1207. doi: 10.1016/j.neuroimage.2010.06.041
Zhao, X., Liu, Y., Wang, X., Liu, B., Xi, Q., Guo, Q., et al. (2012). Disrupted small-world brain networks in moderate Alzheimer's disease: a resting-state FMRI study. PLoS ONE 7:e33540. doi: 10.1371/journal.pone.0033540
Keywords: mild cognitive impairment, MRI, structural network, prediction, early detection
Citation: Wei R, Li C, Fogelson N, Li L for the Alzheimer's Disease Neuroimaging Initiative (2016) Prediction of Conversion from Mild Cognitive Impairment to Alzheimer's Disease Using MRI and Structural Network Features. Front. Aging Neurosci. 8:76. doi: 10.3389/fnagi.2016.00076
Received: 11 January 2016; Accepted: 29 March 2016;
Published: 19 April 2016.
Edited by:
Junfeng Sun, Shanghai Jiao Tong University, ChinaReviewed by:
Ramesh Kandimalla, Emory University, USAYingying Tang, Shanghai Mental Health Center, China
Copyright © 2016 Wei, Li, Fogelson, Li for the Alzheimer's Disease Neuroimaging Initiative. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Ling Li, bGlsaW5nQHVlc3RjLmVkdS5jbg==
†Data used in preparation of this article were obtained from the Alzheimer's Disease Neuroimaging Initiative (ADNI) database (http://adni.loni.usc.edu). As such, the investigators within the ADNI contributed to the design and implementation of ADNI and/or provided data but did not participate in analysis or writing of this report. A complete listing of ADNI investigators can be found at: http://adni.loni.usc.edu/wp-content/uploads/how_to_apply/ADNI_Acknowledgement_List.pdf