- 1School of Artificial Intelligence, Beijing Normal University, Beijing, China
- 2Engineering Research Center of Intelligent Technology and Educational Application, Beijing Normal University, Beijing, China
- 3State Key Laboratory of Cognitive Neuroscience and Learning, Beijing Normal University, Beijing, China
- 4Beijing Key Laboratory of Brain Imaging and Connectomics, Beijing Normal University, Beijing, China
Structural magnetic resonance imaging (MRI) features have played an increasingly crucial role in discriminating patients with Alzheimer's disease (AD) and mild cognitive impairment (MCI) from normal controls (NC). However, the large number of structural MRI studies only extracted low-level neuroimaging features or simply concatenated multitudinous features while ignoring the interregional covariate information. The appropriate representation and integration of multilevel features will be preferable for the precise discrimination in the progression of AD. In this study, we proposed a novel inter-coupled feature representation method and built an integration model considering the two-level (the regions of interest (ROI) level and the network level) coupled features based on structural MRI data. For the intra-coupled interactions about the network-level features, we performed the ROI-level (intra- and inter-) coupled interaction within each network by feature expansion and coupling learning. For the inter-coupled interaction of the network-level features, we measured the coupled relationships among different networks via Canonical correlation analysis. We evaluated the classification performance using coupled feature representations on the Alzheimer's Disease Neuroimaging Initiative (ADNI) database. Results showed that the coupled integration model with hierarchical features achieved the optimal classification performance with an accuracy of 90.44% for AD and NC groups, with an accuracy of 87.72% for the MCI converter (MCI-c) and MCI non-converter (MCI-nc) groups. These findings suggested that our two-level coupled interaction representation of hierarchical features has been the effective means for the precise discrimination of MCI-c from MCI-nc groups and, therefore, helpful in the characterization of different AD courses.
Introduction
Alzheimer's disease (AD) is one of the most severe neurodegenerative dementias in the elderly, and mild cognitive impairment (MCI) is a prodromal stage with a higher risk of progression to AD in patients with MCI than normal controls (NC) (Dona et al., 2016; Arbabshirani et al., 2017; Zhang et al., 2021a). Neuroimaging techniques provide objective and effective tools to study the human brain and have been widely used in the diagnosis of AD or MCI from NC. Neuroimaging studies have shown remarkable structural and functional alterations in the human brain during the course of AD (Vemuri and Jack, 2010; Rathore et al., 2017; Leandrou et al., 2018). Structural magnetic resonance imaging (MRI) studies have extracted hierarchical features [the voxel level, the regions of interest (ROI) level, or the network level] as explicit variables to discriminate AD and MCI from NC. However, a large number of structural MRI studies only extracted low-level features, or simply concatenated multitudinous features, while ignoring the interregional covariate information among features (Anstey and Maller, 2003; Yang et al., 2011; Moradi et al., 2015; Hu et al., 2016; Rathore et al., 2017; Rondina et al., 2018), so they cannot fully exploit the latent and complex information integrated with hierarchical features. The effective feature representations help to enhance the performance of classification. Therefore, the appropriate representation and integration of multilevel features will be preferable for the precise discrimination of AD, MCI, and NC.
Based on structural MRI data, researchers extracted hierarchical imaging features, such as the gray matter (GM) density as the voxel-level features (Moradi et al., 2015; Zeifman et al., 2015), the average gray matter volume (GMV) of brain regions as the ROI-level features (Shi et al., 2014; Hu et al., 2016), or the independent components (ICs) from independent component analysis (ICA) as brain network-level features (Yang et al., 2011). Moradi et al. considered the smoothed GM density from structural MRI data as voxel-level features for AD conversion prediction in subjects with MCI (Moradi et al., 2015). In comparison with the voxel-level features with redundant information but expensive computation, ROI-level features significantly reduce the dimensionality of brain imaging data by uniting the structural adjacent voxels. The GMV from different ROIs has been applied as an independent variable to investigate the predictive power for distinguishing AD with MCI (Zhang et al., 2011) and classifying AD from NC (Rondina et al., 2018). ICA is a data-driven approach that decomposes the whole-brain voxel-vise information into a few maximally independent components based on inter-regional covariance relationships. The brain GM networks obtained from ICA have been considered as brain network-level features to differentiate individuals with AD and NCs, thus providing new avenues for the network-level features in AD classification (Yang et al., 2011; Wei et al., 2016). However, it has been noted that ROI-level features in the same network exhibited more complicated regional dependencies than those in different brain networks (Liu et al., 2017a; Rathore et al., 2017; Filippi et al., 2020; Feng et al., 2021). Nevertheless, the aforementioned studies mostly constructed classification models using the single level of features separately while neglecting the complex interaction relationships among multilevel features.
There were explicit and hidden coupled interactions, much more abundant than simple linear correlation among attributes or features of objects in many domains, like the recommender systems (Wang and Cao, 2020; Zhang et al., 2021b), outlier detection (Pang et al., 2016), and pieces of neuroscience research (Shi et al., 2014, 2015, 2020). Many coupled analysis models were proposed to analyze the explicit and hidden couplings and revealed the non-independent and identical distribution (non-IIDness) characteristics for different data types (Wang et al., 2013, 2015a,b). For numerical data, Wang et al. detailed the intra-coupled interaction to capture the correlations between a feature and its own expanded powers and the inter-coupled interaction to quantify the interactive relationships among each feature and the expanded powers of the other features (Wang et al., 2013). A few imaging studies investigated AD classification with the coupling characteristics of the ROI-level features (Shi et al., 2014, 2020). Although such studies demonstrated high accuracy for AD, MCI, and NC classifications with coupled feature analysis, they still weakened or overlooked the coupled relationship at network-level features. The ROI-level features within the same network strongly interacted with each other (Brickman et al., 2007). Different brain networks collaborated with each other and carried explicit or implicit relationships (Betzel et al., 2014; Zuo et al., 2017). Consequently, greater effort should be focused on designing an appropriate coupled interaction model to integrate the ROI-level and network-level coupling relationships.
To integrate the intrinsic coupling relationships of the ROI-level and network-level features from structural MRI data, we proposed a novel inter-coupled feature representation method for the network-level features and built a two-level (the ROI level and the network level) coupled feature integration model for AD, MCI, and NC classification. For the intra-coupled interactions about the network-level features, we performed the ROI-level (intra- and inter-) coupled interaction within each network by feature expansion and coupling learning. For the inter-coupled interaction of the network-level features, we introduced the measurement of the coupled relationships among different networks via Canonical correlation analysis (CCA). We compared the identification performances in AD, MCI, and NC classification with different feature representation models. We hypothesized that two-level (the ROI level and the network level) coupled feature integration models would achieve better or comparable AD classification performance.
Materials and Methods
Participants
This study included 121 patients with AD and 120 NC subjects, and 126 MCI converters (MCI-c) and 108 MCI non-converters (MCI-nc), with baseline structural MRI data from the Alzheimer's Disease Neuroimaging Initiative (ADNI) database (adni.loni.usc.edu). The up-to-date information on ADNI's general inclusion criteria is described at www.adni-info.org. Briefly, the subjects were between 55 and 90 years of age. General group inclusion/exclusion criteria were as follows: (1) NC subjects: Mini-Mental State Examination (MMSE) scores between 26 and 30, a Clinical Dementia Rating (CDR) score of 0, non-depressed, non-MCI, and non-demented; (2) AD subjects: MMSE scores <26, a CDR score of 0.5 or 1, and met the National Institute of Neurological and Communicative Disorders and Stroke and the Alzheimer's Disease and Related Disorders Association (NINCDS/ADRDA) criteria for probable AD diagnosis; and (3) MCI subjects who had a CDR score of 0.5, MMSE scores between 21 and 30, and memory complaints and abnormal memory function according to the Logical Memory II subscale (Delayed Paragraph Recall but an absence of dementia. The patients with MCI who converted to AD within 3-year follow-up were classified into the MCI-c group; otherwise, they were classified into the MCI-nc group. With respect to the gender ratio and age, the AD group did not significantly differ from the NC group (p = 0.14 in the gender ratio and p = 0.68 in age), and the MCI-c did not significantly differ from the MCI-nc group (p = 0.66 in the gender ratio and p = 0.20 in age). However, the AD group exhibited significantly lower MMSE scores (p = 1.25 E − 42) than the NC group. Table 1 lists the demographics of all these subjects.
Structural MRI Data Acquisition
Structural MRI images were acquired from multiple sites and platforms with different acquisition parameters, which can be found at http://adni.loni.usc.edu/methods/documents/mriprotocols/. The T1-weighted magnetization prepared rapid gradient echo (MPRAGE) images of all these subjects were obtained from 1.5T or 3T scanners. For intensity non-uniformity and gradient nonlinearity correction, the grad warp, B1 calibration, and N3 correction were implemented on each structural MRI image. The processed NIFTI images were downloaded for this study. Details of the protocols of MRI image correction can be found at http://adni.loni.usc.edu/methods/mri-analysis/mri-pre-processing/.
Image Preprocessing
All of the spatial preprocessing of structural MRI images was performed via Statistical Parametric Mapping (SPM8) software (https://www.fil.ion.ucl.ac.uk/spm/software/spm8/) in MATLAB. The Voxel-Based Morphometry (VBM) Toolbox (http://dbm.neuro.uni-jena.de/wordpress/vbm/download/) was used for the automated segmentation and normalization of structural MRI images. First, each image was segmented into three parts: GM, white matter, and cerebrospinal fluid (CSF) (Rajapakse et al., 1997; Manjón et al., 2010). A de-noising filter and a classical Markov random field (MRF) approach were implemented to further improve the segmentation effect (Ashburner, 2007). Then, GM images were normalized by the Diffeomorphic Anatomical Registration using Exponential Lie Algebra (DARTEL) protocol and transformed into the Montreal Neurological Institute (MNI) space (Ashburner, 2007). Finally, all the subjects' GM images were smoothed with a kernel of 8-mm full width at half maximum (FWHM).
Feature Extraction
In this study, the brain network-level features were extracted via ICA using the Fusion ICA toolbox (FIT) (https://trendscenter.org/software/fit/). The GM images of the AD and NC groups were decomposed into a mixing coefficient matrix and a source matrix with the Minimum Description Length (MDL) criteria to estimate the optimal number of ICs. Each row of the source matrix represents an IC, and each column of the mixing coefficient matrix represents the contribution of each subject to the corresponding IC. A two-sample t-test was performed on the mixing coefficient of each IC, and then these IC maps with significant between-group differences were converted to a z-score brain map and reshaped to a binarization mask with a threshold Z ≥ 3. For each IC, the main brain clusters were reported based on the Anatomical Automatic Labeling (AAL) atlas. For each subject in the AD, MCI-c, MCI-nc, and NC groups, only the top 3 ROIs ranked by the cluster size were selected as the ROI-level features within each network. The average GMV of each ROI falling into the brain network template was regarded as the ROI-level original feature value. The average GMV of voxels within each binarization network template was calculated as the value of network-level original features.
Two-Level Coupled Feature Representation
We took the AD and NC groups as an example to illustrate the implementation of coupled feature representation at the ROI level and the network level. The two-level coupled feature representations of the MCI-c and MCI-nc groups were generated using the same method as the AD and NC groups.
Suppose that there are m1 samples in the AD group and m2 samples in the NC group (M = m1 + m2), we assume that there are N ROI-level original features and L brain network-level original features for each subject. For the lth brain network, if there are n ROI-level original features (n × L = N), and the numerical value of the kth ROI-level features of the ith subject is denoted as , then the ROI-level original feature vector can be represented as . The whole ROI-level original feature vector for the ith subject is . For the ith subject, the brain network-level original feature vector is , and the numerical value of the jth network-level features is denoted as vi,j. The superscript ⊤ represents a transpose operator of a vector or a matrix. In particular, we considered two levels of feature representation with the Original Features Matrix (OFM): and as follows:
Compared with the prior ROI-level coupled feature representation method (Shi et al., 2014), the present study proposed a novel two-level coupled feature representation method that attempted to investigate the complex coupling relationship of both the network-level feature matrix VOFM and the ROI-level feature matrix ZOFM for the identification of NC and AD using structural MRI data. We illustrated and schematized our framework in Figure 1 compared with the previous ROI-level coupled interaction representation method (Shi et al., 2014).
Figure 1. A scheme of the proposed framework of two-level coupled interaction representation of neuroimaging features.
The Network-Level Intra-coupled Interactions
We illustrated the method of performing the network-level intra-coupled feature representation by the ROI-level feature matrix of the lth brain network as an example.
Referring to previous study about the coupled attribute analysis on numerical data (Wang et al., 2013), the ROI-level feature vector of each brain network, , was mapped into the expanded feature space, employing a matrix expansion with E1 power as follows:
and we can represent the ROI-level Extended Features Matrix (EFM) of this brain network as follows:
Next, the Pearson's correlation coefficient, R, between each pair of the ROI-level features of ZEFM, was calculated as the network-level intra-coupled weight matrix to reflect the ROI-level (intra- and inter-) coupled interactions within each brain network from both the linear and non-linear aspects. If the p-value of R was >0.05, the correlation coefficient was revised to 0. In this way, Rintra describes the correlation between the kth ROI-level original feature and its own expanded powers, and Rinter describes the pairwise correlation between the kth ROI-level feature and all the expanded powers of the others, as follows:
where γpq is the revised Pearson's correlation coefficient between the pth and qth power of the kth ROI-level original feature, and , respectively, and is the revised Pearson's correlation coefficient between and (k ≠ τ).
For the lth brain network from the ith subject, the network-level intra-coupled feature vector can be represented as . The expanded vector of the kth ROI-level feature is , and the expanded vector of other features is
Finally, the kth ROI-level coupled feature vector of the ith subject for the lth brain network is denoted as follows:
where ,
Then, the network-level intra-coupled feature matrix (CFM) of the lth brain network for the ith subject is:
The network-level intra-coupled feature vector of the ith subject is the concatenation of all networks' coupled features vectors, as follows:
Note that the interactions on ROIs belonging to different networks are not included.
The Network-Level Inter-Coupled Interactions
For the network-level inter-coupled representation, we can represent the CFM as F ∈ ℝM × L with the first E2 coefficients from CCA. In contrast to the network-level intra-coupled feature representation used with the revised Pearson's correlation coefficient, we chose the CCA coefficients as the coupling weights matrix for the network-level coupled feature representation. CCA is a way of inferring information from cross-covariance matrices of different network-level features. The canonical correlation for the canonical variate pairs from any two network-level is as follows: , which can represent the inter-coupled interactions of different network-level features. The top E2 canonical correlations are used for inter-coupled interactions description rather than simply involving the whole wl, revised as . For the whole brain, the network-level inter-coupled feature vector of the ith subject is denoted as follows:
where and .
To obtain the coupled feature representation at two levels, the final CFMnetwork for the ith subject can be represented as follows: .
Classification With Coupled Features
Boosting is a machine learning approach based on the idea of improving the accuracy of a decision by combining many relatively weak base learners (Schapire, 2013). The AdaBoost algorithm works by updating parameters of feature distribution in weak learners over training samples after each iteration sequentially and adaptively (Freund and Schapire, 1997; Collins et al., 2002). In this study, the two-level coupled feature matrix was represented for the following classification analysis. We chose an SVM classifier with a linear kernel function as the base learner. In total, L + 1 base learners were trained, of which the L base learners were trained for different brain network-level intra-coupled features, and one was trained for the brain network-level inter-coupled matrix.
We carried out separate analyses on two tasks: AD vs. NC and MCI-c vs. MCI-nc classification. First, the boosting models were constructed on the two-level Coupled Features Matrix, denoted as CFMnetwork for AD vs. NC, and MCI-c vs. MCI-nc classification. The 10-fold cross-validation was applied to evaluate the performance, and the average results were reported. In our two-level coupled feature representation and classification scheme, several parameters need to be set, including E1 for the parameter of network-level intra-coupled expansion and E2 for network-level inter-coupled coefficient selection. Here, the optimal values of E1 and E2 were searched from a small set of {2, 3, 4} and {1, 2, 3, 4, 5}, respectively.
We also constructed three other kinds of feature matrices separately: (1) The ROI-level Original Features Matrix only, denoted as OFMROI; (2) the ROI-level Coupled Features Matrix across the whole brain without the network-level information, denoted as CFMROI; and 3) the network-level Original Features Matrix without coupling interaction information, denoted as OFMnetwork. To validate the advantage of the two-level coupled feature representation, we compared the classification performances with these three different brain features.
Results
The number of estimated ICs was 49 for the AD and NC groups with the structural MRI data, and 21 ICs showed significant between-group differences with Bonferroni correction. The results of our two-level coupled feature representation and the classification model showed that the best prediction accuracy is 90.44%, sensitivity is 88.5%, and specificity is 93.67% for AD and NC groups and the best prediction accuracy is 87.72%, sensitivity is 84.16%, and specificity is 91.64% for the MCI-c and MCI-nc groups.
Based on the two-level coupled feature representation, Tables 2, 3 show the classification results and give the detailed results of the best parameters of E1 and E2 as references for future studies. When E1 and E2 were set as 3 and 2, the two-level coupled feature representation achieved the best performance for AD vs. NC classification. The same parameter selection is applicable to the MCI-c vs. MCI-nc distinction.
Table 3. Classification results for MCI-c vs. MCI-nc with different kinds of feature representations.
The results of the comparison for four different brain feature representations for AD and NC classification are shown in Table 2. The best classification accuracies for different features are 69.21% for OFMROI, 73.51% for CFMROI, 71.29% for OFMnetwork, and 90.44% for CFMnetwork. Table 3 shows the results of the classification performances for the four kinds of feature matrices for the MCI-c and MCI-nc classification. The best classification accuracies for different features are 64.15% for OFMROI, 75.10% for CFMROI, 68.62% for OFMnetwork, and 87.72% for CFMnetwork.
Discussion
The current study proposed a novel network-level inter-coupled representation approach, integrated the intrinsic coupled relationships of both the ROI-level and the brain network-level features, and then applied them to the classification of subjects with AD, MCI-c, and MCI-nc from the normal elderly individuals based on structural MRI data. By integrating the intra- and inter-coupled interactions among the ROI-level and network-level features, we obtained the innovative coupled neuroimaging features, CFMnetwork and achieved the optimal classification accuracy for both AD vs. NC and MCI-nc vs. MCI-c classification compared with the OFMROI, CFMROI, and OFMnetwork. These results indicated the effectiveness of the coupled interaction representation among different levels of neuroimaging features. Furthermore, the best-coupled expansion parameter E1 was 3 for the network-level intra-coupled interaction, and the best-coupled coefficient selection E2 was 2 for the network-level inter-coupled interaction.
Two-Level Coupled Feature Representation for AD and NC Classification
In the current study, we explored the coupled interaction representation of two-level (the ROI -level and the network- level) neuroimaging features on structural MRI data. For AD and NC classification, the OFMnetwork representation obtained slightly better performance (accuracy = 71.29%) than the OFMROI representation (accuracy = 69.21%), and the CFMnetwork representation achieved much greater accuracy (accuracy = 90.44%) than the CFMROI representation (accuracy = 73.51%). Overall, the network-level feature representations showed preferable results to the ROI-level features, which suggested the advantages of the network-level features in the AD classification task. A number of studies built classification models to distinguish patients with AD from NCs based on the single- level features from brain neuroimaging data, such as the ROI-level features (Zhang et al., 2011; Zhan et al., 2015; Rondina et al., 2018) or the network-level features (Yang et al., 2011). For example, to identify the conversion from normal elderly cognition to AD, Zhan et al. defined 90 ROIs and computed the mean GMV as the ROI-level feature matrix and achieved an accuracy of 83.83% (Zhan et al., 2015). The ROI-level features computed by the ratio of increased GMV have also been extracted from structural MRI data, and they obtained a classification accuracy of 76.11% between AD and NC (Rondina et al., 2018). Different from them, Wang et al. considered the corresponding coefficients of ICs decomposed using the ICA algorithm as the network-level features and got 80.7% accuracy with the SVM classifier for the diagnosis of individuals with AD and HC (Yang et al., 2011). In this study, we not only extracted the ROI-level features but also obtained the network-level features and integrated them. Although the measurements or definitions of original features in our study were different from those in the prior studies, our study attempted to integrate hierarchical features from sMRI for the classification of AD and NC.
Compared with the original features (OFMROI and OFMnetwork), the coupled features (CFMROI and CFMnetwork) helped improve the classification results in this study. Among the four kinds of feature representations, the CFMnetwork obtained the best classification performance of AD and NC (accuracy = 90.44%), which demonstrated the strengths of the integration of multilevel (the ROI level and the network level) coupled interaction representation of hierarchical features. It has been demonstrated that there were strong couplings, including the relations that exist explicitly or implicitly between source and destination entities, among values, attributes, and objects for numerical data (Wang et al., 2013; Cao, 2015). Wang et al. introduced the framework to quantify and integrate the intra-coupled and inter-coupled interactions with the original information from numerical data (Wang et al., 2013). Many studies indicated that the original neuroimaging features exhibited complex regional dependencies, and the features in different brain networks changed diversely along with the progression of MCI and AD (Liu et al., 2017c; Zheng et al., 2019; Lee et al., 2020). Inspired by these pieces of research, we quantitatively measured the network-level intra-coupled relationships and proposed the network-level inter-coupled interaction feature representation. Recently, several studies focusing on the coupled interactions for ROI-level features have been reported, in which they analyzed the ROI-level coupled relationships and appealed to the coupling analysis for numerical data (Shi et al., 2014, 2020). By hypothesizing that the ROI-level features (the average GMV) were related to each other in some ways, Shi et al. introduced the coupled interaction representation for the ROI-level features and adopted the coupled boosting algorithm to analyze the pairwise coupled-diversity correlation between modalities with the best performance of 86.% for AD and NC classification (Shi et al., 2014). Our model achieved higher accuracy of 90.44%, which illustrated the advantages of our two-level coupled interactions representation.
Two-Level Coupled Interaction Representation for MCI-C and MCI-Nc Classification
MCI is an intermediate stage in the trajectory from normal cognition to AD and is important for the early diagnosis of AD (Ahmed et al., 2017; Arbabshirani et al., 2017; Thung et al., 2018). To classify MCI-c and MCI-nc, we integrated the intra-coupled and inter-coupled interactions among the ROI-level and network-level features with the best accuracy of 87.72% compared with other feature representations. Considering the GM density from structural MRI data as the voxel-level features, Wang et al. obtained an accuracy of 69.77% for MCI-c vs. MCI-nc based on informed Partial Least Square models (Wang et al., 2016). Based on 38 subcortical volumes as ROI-level features, Aleksandra et al. classified MCI vs. NC with the Random Forest model (Lebedeva et al., 2017). Apart from the slight differences in classifiers, a common practice in former studies was the straight concatenation of all ROI-level features as independent variables into the input feature matrix. However, these schemes lost sight of the complicated dependencies among ROI-level features (Guo et al., 2015) and the diversified and heterogenous changes for different structural networks (Sui et al., 2014; Liu et al., 2017b). Compared with the abovementioned studies, we believe that the proposed two-level coupled interaction integration method which was validated could be more powerful for the diagnosis of MCI conversion to AD with promising results.
Methodological Considerations
ICA is a popular data-driven method to study brain functional networks (Damoiseaux et al., 2012) and structural networks (Guo et al., 2015; Liu et al., 2017c). The network-level features extracted by ICA could effectively reduce the data dimensions and depend entirely on brain neuroimaging data themselves without prior knowledge. It has been confirmed that ROIs in the same brain network carried similar and interregional covariate information and exhibited more complicated regional dependencies than those in different brain networks (Liu et al., 2017a; Filippi et al., 2020; Wang et al., 2022). Thus, we performed ICA to identify brain structural networks from AD and NC groups and defined the representation of the network-level and ROI-level neuroimaging features.
Then, we designed the two-level coupled interaction integration of hierarchical features to evaluate the network-level intra-coupled and inter-coupled effects in AD and MCI classification. More specifically, we innovatively considered both the network-level intra-coupled interaction for every network individually, quantified by the intra-coupled and inter-coupled interactions among the ROI-level features within this network but not ROIs across the whole brain; and the network-level inter-coupled interaction among different network-level features was captured by the coupled coefficients between the ROI-level feature set of this network and the ROI-level features set of others. Besides, CCA can maximize the correlation between a linear combination of the variables in two datasets and has been applied to identify the relationship between brain networks (Sui et al., 2012; Ouyang et al., 2015; Taquet et al., 2021). In this study, CCA was performed on the ROI-level feature sets of any two brain networks and obtained the inter-coupled coefficients of network-level features to avoid reusing the ROI-level features information for network-level coupled interaction representation.
In the current study, E1 was denoted as the maximal power for the expansion of the ROI-level features in the network-level intra-coupled interaction representation and E2 as the number of the CCA coefficients selected to express the information for the network-level inter-coupled interaction representation. In this way, we integrated the two-level coupled interactions, including both the intra-coupled and inter-coupled interactions for both the network-level and the ROI-level features. We set the range of E1, from 2 to 4, and E2, from 1 to 5, respectively. When the value of E1 increases, the value of E1! will grow correspondingly so will E2!. The coupled interactions for feature values are quantified by a Taylor-like expansion, . Along with the increase of E1 and E2, the reciprocals, and , decreased accordingly and caused the corresponding weight value of the expanded items to be too small to capture the interactions among different features. Furthermore, the greater E1 or E2 may have less significant effects on the classification performance. Then, the appropriate E1 or E2 helps to fully exploit the information of coupled interactions within hierarchical features. As our results indicated, the classification performance changed with the variation of the two coupled interaction parameters. When E1 = 3 and E2 = 2, the best result was obtained in this tudy, which implied that the information of coupled interactions within hierarchical features has been fully exploited. When E1 = 1, the number of the ROI-level features was still invariant, which meant that the ROI-level coupled feature matrix was the original ROI-level feature matrix without coupled interaction analysis. When E1 increased, the number of ROI features increased with E1-fold accordingly. When E1 was equal to 3, each ROI-level feature was expanded three times in numerical space than the original feature. The inter-coupled interaction parameter for brain network-level features indicated that the first E2 pairs of canonical variables via CCA were maximally adequate to express the information among brain network-level features. When E2 was equal to 2, the top two coefficients of CCA were selected for the network-level inter-coupled interaction representation. With regard to the ROI-level and network-level coupled interactions of parameters setting, we recommend E1 = 3 and E2 = 2 for similar analysis in the future.
Limitations and Future Work
The current study focused on constructing a novel coupled relationship representation to combine the ROI-level and network-level features, and then, we only adopted the numerical features from the structural MRI data. As different neuroimaging modality features provide complementary information, the coupled interactions of different modalities are heterogeneous (Zhang et al., 2011; Rathore et al., 2017). The coupled interactions based on multi-modality features are a novel issue that needs more exploration. The representation and integration of the intra-coupled interaction and inter-coupled interaction at multilevels, including the modality level, the network level, and the ROI level, will be investigated in future studies.
Conclusion
In the current study, we proposed a network-level inter-coupled interaction representation approach with the independent components from ICA as the network-level features and the CCA weights for network-level inter-coupled characteristics. Then, we integrated the ROI-level and network-level coupled interactions based on structural MRI data to classify subjects with AD, MCI-c, MCI-nc, and NC. Our results demonstrated that the two-level coupled interaction feature representation outperformed the original feature representation and the single-level coupled representation and provided a perspective based on the coupled interaction integration of hierarchical neuroimaging features.
Data Availability Statement
Publicly available datasets were analyzed in this study. This data can be found at: adni.loni.usc.edu.
Ethics Statement
The ADNI study was approved by the Institutional Review Boards/Research Ethics Boards of each participating site. All participants provided their written informed consent in this study. The ethics committees/institutional review boards providing the approval for the ADNI study are: Albany Medical Center Committee on Research Involving Human Subjects Institutional Review Board, Boston University Medical Campus and Boston Medical Center Institutional Review Board, Butler Hospital Institutional Review Board, Cleveland Clinic Institutional Review Board, Columbia University Medical Center Institutional Review Board, Duke University Health System Institutional Review Board, Emory Institutional Review Board, Georgetown University Institutional Review Board, Health Sciences Institutional Review Board, Houston Methodist Institutional Review Board, Howard University Office of Regulatory Research Compliance, Icahn School of Medicine at Mount Sinai Program for the Protection of Human Subjects, Indiana University Institutional Review Board, Institutional Review Board of Baylor College of Medicine, Jewish General Hospital Research Ethics Board, Johns Hopkins Medicine Institutional Review Board, Lifespan – Rhode Island Hospital Institutional Review Board, Mayo Clinic Institutional Review Board, Mount Sinai Medical Center Institutional Review Board, Nathan Kline Institute for Psychiatric Research & Rockland Psychiatric Center Institutional Review Board, New York University Langone Medical Center School of Medicine Institutional Review Board, Northwestern University Institutional Review Board, Oregon Health and Science University Institutional Review Board, Partners Human Research Committee Research Ethics, Board Sunnybrook Health Sciences Centre, Roper St. Francis Healthcare Institutional Review Board, Rush University Medical Center Institutional Review Board, St. Joseph's Phoenix Institutional Review Board, Stanford Institutional Review Board, The Ohio State University Institutional Review Board, University Hospitals Cleveland Medical Center Institutional Review Board, University of Alabama Office of the IRB, University of British Columbia Research Ethics Board, University of California Davis Institutional Review Board Administration, University of California Los Angeles Office of the Human Research Protection Program, University of California San Diego Human Research Protections Program, University of California San Francisco Human Research Protection Program, University of Iowa Institutional Review Board, University of Kansas Medical Center Human Subjects Committee, University of Kentucky Medical Institutional Review Board, University of Michigan Medical School Institutional Review Board, University of Pennsylvania Institutional Review Board, University of Pittsburgh Institutional Review Board, University of Rochester Research Subjects Review Board, University of South Florida Institutional Review Board, University of Southern, California Institutional Review Board, UT Southwestern Institution Review Board, VA Long Beach Healthcare System Institutional Review Board, Vanderbilt University Medical Center Institutional Review Board, Wake Forest School of Medicine Institutional Review Board, Washington University School of Medicine Institutional Review Board, Western Institutional Review Board, Western University Health Sciences Research Ethics Board, and Yale University Institutional Review Board.
Author Contributions
KL, QL, LY, and XG conceived, designed the experiments, and drafted the manuscript. KL and QL analyzed the data. All authors contributed to the article and approved the submitted version.
Funding
This work was supported by the National Natural Science Foundation of China (Grant Nos. 62071051 and 61671066) and the Key Program of the National Natural Science Foundation of China (Grant No. 61731003).
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher's Note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Acknowledgments
Data collection and sharing for this project were funded by the Alzheimer's Disease Neuroimaging Initiative (ADNI) (National Institutes of Health Grant U01 AG024904) and DOD ADNI (Department of Defense Award No. W81XWH-12-2-0012). ADNI was funded by the National Institute on Aging, the National Institute of Biomedical Imaging and Bioengineering, and through generous contributions from the following: AbbVie, Alzheimer's Association; Alzheimer's Drug Discovery Foundation; Araclon Biotech; BioClinica, Inc.; Biogen; Bristol-Myers Squibb Company; CereSpir, Inc.; Cogstate; Eisai Inc.; Elan Pharmaceuticals, Inc.; Eli Lilly and Company; EuroImmun; F. Hoffmann-La Roche Ltd. and its affiliated company Genentech, Inc.; Fujirebio; GE Healthcare; IXICO Ltd.; Janssen Alzheimer Immunotherapy Research & Development, LLC.; Johnson & Johnson Pharmaceutical Research & Development LLC.; Lumosity; Lundbeck; Merck & Co., Inc.; Meso Scale Diagnostics, LLC.; NeuroRx Research; Neurotrack Technologies; Novartis Pharmaceuticals Corporation; Pfizer Inc.; Piramal Imaging; Servier; Takeda Pharmaceutical Company; and Transition Therapeutics. The Canadian Institutes of Health Research was providing funds to support ADNI clinical sites in Canada. Private sector contributions are facilitated by the Foundation for the National Institutes of Health (www.fnih.org). The grantee organization is the Northern California Institute for Research and Education, and the study was coordinated by the Alzheimer's Therapeutic Research Institute at the University of Southern California. ADNI data are disseminated by the Laboratory for Neuro Imaging at the University of Southern California.
References
Ahmed, O. B., Benois-Pineau, J., Allard, M., Catheline, G., and Amar, C. B. (2017). Recognition of Alzheimer's disease and mild cognitive impairment with multimodal image-derived biomarkers and multiple kernel learning. Neurocomputing 220, 98–110. doi: 10.1016/j.neucom.2016.08.041
Anstey, K. J., and Maller, J. J. (2003). The role of volumetric MRI in understanding mild cognitive impairment and similar classifications. Aging Ment. Health 7, 238–250. doi: 10.1080/1360786031000120732
Arbabshirani, M. R., Plis, S., Sui, J., and Calhoun, V. D. (2017). Single subject prediction of brain disorders in neuroimaging: promises and pitfalls. Neuroimage 145, 137–165. doi: 10.1016/j.neuroimage.2016.02.079
Ashburner, J. (2007). A fast diffeomorphic image registration algorithm. Neuroimage 38, 95–113. doi: 10.1016/j.neuroimage.2007.07.007
Betzel, R. F., Byrge, L., He, Y., Goñi, J., Zuo, X.-N., and Sporns, O. (2014). Changes in structural and functional connectivity among resting-state networks across the human lifespan. Neuroimage 102, 345–357. doi: 10.1016/j.neuroimage.2014.07.067
Brickman, A. M., Habeck, C., Zarahn, E., Flynn, J., and Stern, Y. (2007). Structural MRI covariance patterns associated with normal aging and neuropsychological functioning. Neurobiol. Aging 28, 284–295. doi: 10.1016/j.neurobiolaging.2005.12.016
Cao, L. (2015). Coupling learning of complex interactions. Inf. Process. Manag. 51, 167–186. doi: 10.1016/j.ipm.2014.08.007
Collins, M., Schapire, R. E., and Singer, Y. (2002). Logistic regression, AdaBoost and bregman distances. Mach. Learn. 48, 253–285. doi: 10.1023/A:1013912006537
Damoiseaux, J. S., Prater, K. E., Miller, B. L., and Greicius, M. D. (2012). Functional connectivity tracks clinical deterioration in Alzheimer's disease. Neurobiol. Aging 33, 828.e19–e30. doi: 10.1016/j.neurobiolaging.2011.06.024
Dona, O., Thompson, J., and Druchok, C. (2016). Comprehensive review on magnetic resonance imaging in Alzheimer's disease. Crit. Rev. Biomed. Eng. 44, 213–225. doi: 10.1615/CritRevBiomedEng.2016019544
Feng, J., Zhang, S.-W., Chen, L., and Xia, J. (2021). Alzheimer's disease classification using features extracted from nonsubsampled contourlet subband-based individual networks. Neurocomputing 421, 260–272. doi: 10.1016/j.neucom.2020.09.012
Filippi, M., Basaia, S., Canu, E., Imperiale, F., Magnani, G., Falautano, M., et al. (2020). Changes in functional and structural brain connectome along the Alzheimer's disease continuum. Mol. Psychiatry 25, 230–239. doi: 10.1038/s41380-018-0067-8
Freund, Y., and Schapire, R. E. (1997). A decision-theoretic generalization of on-line learning and an application to boosting. J. Comput. Syst. Sci. 55, 119–139. doi: 10.1006/jcss.1997.1504
Guo, X., Wang, Y., Guo, T., Chen, K., Zhang, J., Li, K., et al. (2015). Structural covariance networks across healthy young adults and their consistency. J. Magn. Reson. Imaging 42, 261–268. doi: 10.1002/jmri.24780
Hu, K., Wang, Y., Chen, K., Hou, L., and Zhang, X. (2016). Multi-scale features extraction from baseline structure MRI for MCI patient classification and AD early diagnosis. Neurocomputing 175, 132–145. doi: 10.1016/j.neucom.2015.10.043
Leandrou, S., Petroudi, S., Kyriacou, P. A., Reyes-Aldasoro, C. C., and Pattichis, C. S. (2018). Quantitative MRI brain studies in mild cognitive impairment and Alzheimer's disease: a methodological review. IEEE Rev. Biomed. Eng. 11, 97–111. doi: 10.1109/RBME.2018.2796598
Lebedeva, A. K., Westman, E., Borza, T., Beyer, M. K., Engedal, K., Aarsland, D., et al. (2017). MRI-based classification models in prediction of mild cognitive impairment and dementia in late-life depression. Front. Aging Neurosci. 9, 13. doi: 10.3389/fnagi.2017.00013
Lee, P.-L., Chou, K.-H., Chung, C.-P., Lai, T.-H., Zhou, J. H., Wang, P.-N., et al. (2020). Posterior cingulate cortex network predicts Alzheimer's disease progression. Front. Aging Neurosci. 12, 608667. doi: 10.3389/fnagi.2020.608667
Liu, J., Li, M., Pan, Y., Lan, W., Zheng, R., Wu, F.-X., et al. (2017a). Complex brain network analysis and its applications to brain disorders: a survey. Complexity 2017, 1–27. doi: 10.1155/2017/8362741
Liu, K., Chen, K., Yao, L., and Guo, X. (2017b). Prediction of mild cognitive impairment conversion using a combination of independent component analysis and the cox model. Front. Hum. Neurosci. 11, 33. doi: 10.3389/fnhum.2017.00033
Liu, K., Yao, S., Chen, K., Zhang, J., Yao, L., Li, K., et al. (2017c). Structural brain network changes across the adult lifespan. Front. Aging Neurosci. 9, 275. doi: 10.3389/fnagi.2017.00275
Manjón, J. V., Coupé, P., Martí-Bonmatí, L., Collins, D. L., and Robles, M. (2010). Adaptive non-local means denoising of MR images with spatially varying noise levels. J. Magn. Reson. Imaging 31, 192–203. doi: 10.1002/jmri.22003
Moradi, E., Pepe, A., Gaser, C., Huttunen, H., and Tohka, J. (2015). Machine learning framework for early MRI-based Alzheimer's conversion prediction in MCI subjects. Neuroimage 104, 398–412. doi: 10.1016/j.neuroimage.2014.10.002
Ouyang, X., Chen, K., Yao, L., Hu, B., Wu, X., Ye, Q., et al. (2015). Simultaneous changes in gray matter volume and white matter fractional anisotropy in Alzheimer's disease revealed by multimodal CCA and joint ICA. Neuroscience 301, 553–562. doi: 10.1016/j.neuroscience.2015.06.031
Pang, G., Cao, L., Chen, L., and Liu, H. (2016). “Unsupervised feature selection for outlier detection by modelling hierarchical value-feature couplings,” in 2016 IEEE 16th International Conference on Data Mining (ICDM) (Barcelona: IEEE), 410–419. doi: 10.1109/ICDM.2016.0052
Rajapakse, J. C., Giedd, J. N., and Rapoport, J. L. (1997). Statistical approach to segmentation of single-channel cerebral MR images. IEEE Trans. Med. Imaging 16, 176–186. doi: 10.1109/42.563663
Rathore, S., Habes, M., Iftikhar, M. A., Shacklett, A., and Davatzikos, C. (2017). A review on neuroimaging-based classification studies and associated feature extraction methods for Alzheimer's disease and its prodromal stages. Neuroimage 155, 530–548. doi: 10.1016/j.neuroimage.2017.03.057
Rondina, J. M., Ferreira, L. K., de Souza Duran, F. L., Kubo, R., Ono, C. R., Leite, C. C., et al. (2018). Selecting the most relevant brain regions to discriminate Alzheimer's disease patients from healthy controls using multiple kernel learning: a comparison across functional and structural imaging modalities and atlases. NeuroImage Clin. 17, 628–641. doi: 10.1016/j.nicl.2017.10.026
Schapire, R. E. (2013). Boosting: foundations and algorithms. Kybernetes 42, 164–166. doi: 10.1108/03684921311295547
Shi, Y., Gao, Y., Liao, S., Zhang, D., Gao, Y., and Shen, D. (2015). Semi-Automatic segmentation of prostate in CT images via coupled feature representation and spatial-constrained transductive lasso. IEEE Trans. Pattern Anal. Mach. Intell. 37, 2286–2303. doi: 10.1109/TPAMI.2015.2424869
Shi, Y., Suk, H.-I., Gao, Y., Lee, S.-W., and Shen, D. (2020). Leveraging coupled interaction for multimodal Alzheimer's disease diagnosis. IEEE Trans. Neural Networks Learn. Syst. 31, 186–200. doi: 10.1109/TNNLS.2019.2900077
Shi, Y., Suk, H.-I., Gao, Y., and Shen, D. (2014). “Joint coupled-feature representation and coupled boosting for AD diagnosis,” in 2014 IEEE Conference on Computer Vision and Pattern Recognition (Columbus: IEEE), 2721–2728. doi: 10.1109/CVPR.2014.354
Sui, J., Adali, T., Yu, Q., Chen, J., and Calhoun, V. D. (2012). A review of multivariate methods for multimodal fusion of brain imaging data. J. Neurosci. Methods 204, 68–81. doi: 10.1016/j.jneumeth.2011.10.031
Sui, J., Huster, R., Yu, Q., Segall, J. M., and Calhoun, V. D. (2014). Function-structure associations of the brain: evidence from multimodal connectivity and covariance studies. Neuroimage 102, 11–23. doi: 10.1016/j.neuroimage.2013.09.044
Taquet, M., Smith, S. M., Prohl, A. K., Peters, J. M., Warfield, S. K., Scherrer, B., et al. (2021). A structural brain network of genetic vulnerability to psychiatric illness. Mol. Psychiatry 26, 2089–2100. doi: 10.1038/s41380-020-0723-7
Thung, K.-H., Yap, P.-T., Adeli, E., Lee, S.-W., and Shen, D. (2018). Conversion and time-to-conversion predictions of mild cognitive impairment using low-rank affinity pursuit denoising and matrix completion. Med. Image Anal. 45, 68–82. doi: 10.1016/j.media.2018.01.002
Vemuri, P., and Jack, C. R. (2010). Role of structural MRI in Alzheimer's disease. Alzheimers Res. Ther. 2, 23. doi: 10.1186/alzrt47
Wang, C., Chi, C. H., Zhou, W., and Wong, R. (2015a). Coupled interdependent attribute analysis on mixed data. Proc. Natl. Conf. Artif. Intell. 3, 1861–1867. Available online at: https://dl.acm.org/doi/10.5555/2886521.2886579
Wang, C., Dong, X., Zhou, F., Cao, L., and Chi, C. H. (2015b). Coupled attribute similarity learning on categorical data. IEEE Trans. Neur. Netw. Learn. Syst. 26, 781–797. doi: 10.1109/TNNLS.2014.2325872
Wang, C., She, Z., and Cao, L. (2013). “Coupled attribute analysis on numerical data,” in IJCAI International Joint Conference on Artificial Intelligence (Beijing), 1736–1742.
Wang, P., Chen, K., Yao, L., Hu, B., Wu, X., Zhang, J., et al. (2016). Multimodal classification of mild cognitive impairment based on partial least squares. J. Alzheimers Dis. 54, 359–371. doi: 10.3233/JAD-160102
Wang, Q., Zhao, S., He, Z., Zhang, S., Jiang, X., Zhang, T., et al. (2022). Modeling functional difference between gyri and sulci within intrinsic connectivity networks. Cereb. Cortex. bhac111. doi: 10.1093/cercor/bhac111
Wang, S., and Cao, L. (2020). Inferring implicit rules by learning explicit and hidden item dependency. IEEE Trans. Syst. Man Cybern. Syst. 50, 935–946. doi: 10.1109/TSMC.2017.2768547
Wei, R., Li, C., Fogelson, N., and Li, L. (2016). Prediction of conversion from mild cognitive impairment to Alzheimer's disease using MRI and structural network features. Front. Aging Neurosci. 8, 76. doi: 10.3389/fnagi.2016.00076
Yang, W., Lui, R. L. M., Gao, J.-H., Chan, T. F., Yau, S.-T., Sperling, R. A., et al. (2011). Independent component analysis-based classification of Alzheimer's disease MRI data. J. Alzheimers Dis. 24, 775–783. doi: 10.3233/JAD-2011-101371
Zeifman, L. E., Eddy, W. F., Lopez, O. L., Kuller, L. H., Raji, C., Thompson, P. M., et al. (2015). Voxel level survival analysis of grey matter volume and incident mild cognitive impairment or Alzheimer's disease. J. Alzheimers Dis. 46, 167–178. doi: 10.3233/JAD-150047
Zhan, Y., Chen, K., Wu, X., Zhang, D., Zhang, J., Yao, L., et al. (2015). Identification of conversion from normal elderly cognition to Alzheimer's disease using multimodal support vector machine. J. Alzheimers Dis. 47, 1057–1067. doi: 10.3233/JAD-142820
Zhang, B., Lin, L., and Wu, S. (2021a). A review of brain atrophy subtypes definition and analysis for Alzheimer's disease heterogeneity studies. J. Alzheimers Dis. 80, 1339–1352. doi: 10.3233/JAD-201274
Zhang, D., Wang, Y., Zhou, L., Yuan, H., and Shen, D. (2011). Multimodal classification of Alzheimer's disease and mild cognitive impairment. Neuroimage 55, 856–867. doi: 10.1016/j.neuroimage.2011.01.008
Zhang, Q., Cao, L., Shi, C., and Niu, Z. (2021b). “Neural time-aware sequential recommendation by jointly modeling preference dynamics and explicit feature couplings,” in IEEE Transactions on Neural Networks and Learning Systems, 1–13. doi: 10.1109/TNNLS.2021.3069058
Zheng, W., Yao, Z., Li, Y., Zhang, Y., Hu, B., and Wu, D. (2019). Brain connectivity based prediction of Alzheimer's disease in patients with mild cognitive impairment based on multi-modal images. Front. Hum. Neurosci. 13, 399. doi: 10.3389/fnhum.2019.00399
Keywords: coupled interaction representation, hierarchical features, classification, mild cognitive impairment, Alzheimer's disease, structural MRI
Citation: Liu K, Li Q, Yao L and Guo X (2022) The Coupled Representation of Hierarchical Features for Mild Cognitive Impairment and Alzheimer's Disease Classification. Front. Neurosci. 16:902528. doi: 10.3389/fnins.2022.902528
Received: 23 March 2022; Accepted: 25 April 2022;
Published: 03 June 2022.
Edited by:
Shijie Zhao, Northwestern Polytechnical University, ChinaReviewed by:
Xi Jiang, University of Electronic Science and Technology of China, ChinaQi Zhang, University of Technology Sydney, Australia
Copyright © 2022 Liu, Li, Yao and Guo. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Xiaojuan Guo, gxj@bnu.edu.cn
*Data used in preparation of this article were obtained from the Alzheimer's Disease Neuroimaging Initiative (ADNI) database (adni.loni.usc.edu). As such, the investigators within the ADNI contributed to the design and implementation of ADNI and/or provided data but did not participate in analysis or writing of this report. A complete listing of ADNI investigators can be found at: http://adni.loni.usc.edu/wp-content/uploads/how_to_apply/ADNI_Acknowledgement_List.pdf.