Skip to main content

REVIEW article

Front. Neurosci., 06 August 2018
Sec. Brain Imaging Methods
This article is part of the Research Topic 10 Years of Impactful, Open Neuroscience View all 47 articles

Classification and Prediction of Brain Disorders Using Functional Connectivity: Promising but Challenging

  • 1The Mind Research Network, Albuquerque, NM, United States
  • 2School of Computer & Information Technology, Shanxi University, Taiyuan, China
  • 3Department of Electrical and Computer Engineering, University of New Mexico, Albuquerque, NM, United States

Brain functional imaging data, especially functional magnetic resonance imaging (fMRI) data, have been employed to reflect functional integration of the brain. Alteration in brain functional connectivity (FC) is expected to provide potential biomarkers for classifying or predicting brain disorders. In this paper, we present a comprehensive review in order to provide guidance about the available brain FC measures and typical classification strategies. We survey the state-of-the-art FC analysis methods including widely used static functional connectivity (SFC) and more recently proposed dynamic functional connectivity (DFC). Temporal correlations among regions of interest (ROIs), data-driven spatial network and functional network connectivity (FNC) are often computed to reflect SFC from different angles. SFC can be extended to DFC using a sliding-window framework, and intrinsic connectivity states along the time-varying connectivity patterns are typically extracted using clustering or decomposition approaches. We also briefly summarize window-less DFC approaches. Subsequently, we highlight various strategies for feature selection including the filter, wrapper and embedded methods. In terms of model building, we include traditional classifiers as well as more recently applied deep learning methods. Moreover, we review representative applications with remarkable classification accuracy for psychosis and mood disorders, neurodevelopmental disorder, and neurological disorders using fMRI data. Schizophrenia, bipolar disorder, autism spectrum disorder (ASD), attention deficit hyperactivity disorder (ADHD), Alzheimer's disease and mild cognitive impairment (MCI) are discussed. Finally, challenges in the field are pointed out with respect to the inaccurate diagnosis labeling, the abundant number of possible features and the difficulty in validation. Some suggestions for future work are also provided.

Introduction

Brain disorders such as schizophrenia (SZ) and bipolar disorder (BP) are considered in terms of disruptions of the normal-range operation of brain functions. While psychiatric disorders are diagnosed based on symptom scores from clinical interview, there are no existing gold standards that can be used for definitive validation. Brain functional neuroimaging techniques including functional magnetic resonance imaging (fMRI) (Lee et al., 2013; Power et al., 2014b), positron emission tomography (PET), and electroencephalography (EEG) have become important tools in investigating brain disease (Abi-Dargham and Horga, 2016). There is much hope that brain functional connectivity revealed using functional neuroimaging data can be used to characterize brain function abnormality and in turn benefit diagnosis and treatment (Deco and Kringelbach, 2014). Among diverse modalities, fMRI enables non-invasive investigation of brain function with high spatial resolution and has been widely used to detect and characterize brain networks or connectivity among functionally interconnected regions. Investigating differences in functional network (or connectivity) between disorders such as SZ and BP may provide new insights into their disease mechanisms (Birur et al., 2017). Furthermore, the identified changes in connectivity measures may be useful as biomarkers which can be employed to classify individual patients using machine learning methods (Arbabshirani et al., 2017; Stephan et al., 2017). In this paper, we restrict our review to fMRI data, but some methods are able to be easily expanded to other brain functional imaging modalities as well.

There have been a variety of methods proposed to measure functional connectivity (FC) among brain regions using fMRI data (Van Den Heuvel and Hulshoff Pol, 2010; Smith et al., 2013; Calhoun and De Lacy, 2017). While different approaches have different assumptions and advantages, a detailed review is important to help us understand the ways in which these approaches have been used. How to select features from a large amount of measures as biomarker for building model to classify or predict brain disorders is an important and challenging problem. Classification and prediction are two forms of analysis which are used for building models to separate classes and to predict future outcomes. Generally, classification is to classify categorical disease labels that have been already acquired concurrently with or prior to the scan, while prediction is to predict unknown disease labels, future progression, or continuous-valued functions. Compared with classification, prediction is harder but more promising for clinical utility. In the context of neuroimaging, although increasing studies have tended to shift their concentration to the prediction problem, the majority of previous studies on brain disorders focused on identifying neuromarkers for classifying different groups. In this paper, we primarily aim to present a comprehensive review summarizing various brain functional connectivity measures and typical classification strategies, in order to provide guidance in this field. It is worth noting that most of the measures and strategies used in the classification problem can also be applied or extended to the prediction problem. We also survey recent exciting applications that employed fMRI data to differentiate mental disorders and other brain diseases. The challenges and difficulties as well as potential solutions are pointed out in the end.

Functional Connectivity Measures from fMRI Data

Functional connectivity reflects the organization and inter-relationship of spatially separated brain regions. Methods for measuring and delineating functional connectivity play a key role, since the used measures may greatly affect the identification of biomarkers and the accuracy of individual-subject classification and prediction. Typically, functional connectivity is assumed to be stationary over the scanning time (usually several minutes), and most previous fMRI studies applied a static functional connectivity (SFC) analysis. Until recently, more emerging exciting work have proven that regarding brain functional connectivity as dynamic over time can be successful in uncovering the disruptions to the normal human brain in disease condition (Calhoun et al., 2014). Figure 1 summarizes the primary functional connectivity analysis methods and possible connectivity features used for classification/prediction problem.

FIGURE 1
www.frontiersin.org

Figure 1. The primary functional connectivity analysis methods and possible connectivity features used for classification/prediction problem.

Static Functional Connectivity Analyses

From a view of methodology, there are generally three kinds of strategies analyzing SFC (Calhoun and De Lacy, 2017). The first is a model-driven strategy which uses prior knowledge to decide sets of brain regions/voxels and then limit connectivity analysis to some specific regions/voxels. The second approach is more data-driven and maps whole brain functional networks using decomposition or clustering methods. In such case, brain voxels assigned to the same component or cluster reflect regions which are highly correlated. The third combines the idea of the above two strategies, which firstly extracts co-activated regions using a data-driven method and then estimates functional connectivity among the regions. We outline several typical methods as below.

Model-Driven Analysis for Assessing Connectivity Among Regions or Seeds

Brain functional connectivity analysis among a priori regions of interest (ROIs) or voxels (Poldrack, 2007) is the most widely applied model-driven method. Three key steps include the determination of locations and shapes of ROIs or the locations of voxels, the computation of representative time series of ROIs or voxels, and the assessment of connectivity (or coupling) among different ROIs or connectivity between each seed (ROI or voxel) and all other voxels within brain. As such, the resulting functional connectivity strengths reflect the temporal fluctuation relations among the selected voxels or regions. ROI-based functional connectivity strengths can be easily taken as features in classification and prediction problems, since the corresponding connectivity features of a new/testing subject can be directly computed between the brain regions (or voxels) selected using the training subjects. While ROIs and voxels are usually determined by subjective experience and prior knowledge, the resulting functional connectivity can be greatly sensitive to the empirical selection and show a very different pattern for small changes in the ROIs. Hence, how to decide a reasonable region including voxels with consistent brain function is a challenge. Considerable research work (Thirion et al., 2014; Glasser et al., 2016) has attempted to delineate a parcellation of brain by employing information of multiple modalities of imaging, however, inconsistencies still exists. The representative time series of voxels in one region can be calculated as the mean of all voxels' time series or the first principal component of all voxels' time series using principal component analysis (PCA). Although averaging and PCA can decrease the noise effect in the representative time series of ROIs to some extent, the obtained functional connectivity can still be related to noise. Functional connectivity between two representative time series is mostly estimated by computing correlations to measure their linear relationship, but also can be assessed by mutual information to identify non-linear relationships (Wang et al., 2014). Coherence estimates the linear relationship in the frequency domain (Sun et al., 2004), and connectivity within a specific frequency can be achieved by methods such as wavelet decomposition (Skidmore et al., 2011). It is worth noting that different measurements may reflect disparate connectivity meaning. In addition to the above computation steps, different preprocessing strategies also could affect the resulting functional connectivity strengths. Whether regressing out global mean is a controversial issue (Murphy et al., 2009; Hayasaka, 2013) and how to remove out head motion also deserve further investigation (Friston et al., 1996; Power et al., 2014a). These shortcomings should be carefully addressed while conducting analyses using the method.

Data-Driven Analysis for Estimating Spatial Functional Network Maps

In contrast to model-driven methods, data-driven approaches estimating functional networks do not require the specification of predefined brain regions or voxels. These popular approaches include spatial independent component analysis (ICA) (Calhoun et al., 2001; Calhoun and Adali, 2012; Du and Fan, 2013; Du et al., 2016; Calhoun and De Lacy, 2017), principle component analysis (PCA), and clustering methods (Van Den Heuvel et al., 2008; Du et al., 2014c). In particular, ICA is a widely used approach that has shown great promise in identifying network-based biomarkers of psychiatric disorders such as schizophrenia (SZ) (Garrity et al., 2007; Ongür et al., 2010; Calhoun et al., 2011; Khadka et al., 2013; Meda et al., 2014; Du et al., 2015b, 2018). Spatial ICA of an individual-subject's fMRI data decomposes the fMRI data matrix (time points × voxels) into a linear combination of multiple maximally spatially independent components (ICs), of which meaningful ICs can be regarded as brain functional networks. In each network, the voxels with greater Z-scores tend to have higher intra-connectivity (or co-activation) (Du et al., 2018) and can be interpreted as a weighted seed maps (Joel et al., 2011). The mixing matrix in the decomposition includes the time series of the ICs, where each time series reflects the temporal fluctuation of each IC. In addition to less prior knowledge needed in advance, other advantages of ICA relative to the ROI-based method include (1) simultaneous estimation of multiple networks from whole-brain data, (2) overlapping components, which provide a spatial filtering of artifacts (Sochat et al., 2014; Du Y. H. et al., 2016a) or potentially interesting overlapping networks (Xu et al., 2013), and (3) adaptivity of components among subjects, allowing for inter-subject variability in regions to be captured (Allen et al., 2012).

The primary shortcoming of applying ICA on fMRI data is that ICA generates ICs in an arbitrary order. To solve the problem, two strategies are typically adopted in fMRI studies with multiple subjects (Calhoun et al., 2009b) to make ICs of different subjects comparable. The first strategy is to perform ICA for each subject separately, and then establish correspondence of ICs across subjects using methods such as subjective identification (McKeown et al., 1998; Calhoun, 2001), clustering (Moritz et al., 2003; Esposito et al., 2005; De Martino et al., 2007), and automated matching based on reproducibility (Yang et al., 2008). These methods could be sensitive to different source separations in multiple ICA decompositions of different subjects. For instance, one single IC detected for a certain subject may be split into several ICs including smaller active areas with closely related time courses for other subjects (McKeown et al., 1998), making it difficult if not impossible to establish correspondence among ICs of different subjects. The second strategy, often referred to as group ICA, implements one ICA on all subjects' data and then obtains subject-specific ICs from the group-level ICs somehow, which establishes direct correspondence of ICs across different subjects. The fMRI data of multiple subjects are typically grouped in three different ways with distinct hypotheses imposed upon multi-subject fMRI data, including spatial concatenation, temporal concatenation, and tensor organization. The spatial concatenation method concatenates multi-subject fMRI data along the spatial dimension supposing that corresponding ICs of all subjects have common temporal information (Svensén et al., 2002). The more frequently applied temporal concatenation method concatenates the multi-subject fMRI data along the temporal dimension (Calhoun et al., 2001, 2009b; Beckmann et al., 2009), followed by estimation of single-subject maps and time courses using an approach called back-reconstruction which includes PCA-based methods (Calhoun et al., 2001), spatio-temporal (dual)-regression (Beckmann et al., 2009; Erhardt et al., 2011) and group information guided ICA (GIG-ICA) (Du and Fan, 2013; Du et al., 2014a, 2015b; Du Y. H. et al., 2016a). Each of these can be considered as providing a different balance between ensuring matches via a group model and allowing individual subject variability to be captured. GIG-ICA is one of the more flexible approaches and estimates the subject-specific ICs by optimizing the independence measure of multiple ICs for each subject while preserving the correspondence of ICs across different subjects. GIG-ICA has been shown to well represent individual subject maps and provides an improved approach for addressing individual subject artifacts than single-subject ICA followed by group ICA (Du and Fan, 2013; Du Y. H. et al., 2016a). The tensor probabilistic ICA method stacks the original multi-subject fMRI data along a separate third dimension with a hypothesis that different subjects have common group spatial ICs and time courses but subject specific loading parameters (Beckmann and Smith, 2005; Lee et al., 2008).

Independent vector analysis (IVA) is another method which optimizes the independence among each subject's components and the dependence among corresponding components of different subjects. Several advancements of IVA have been made for achieving reliable source separation for linearly dependent Gaussian and non-Gaussian sources (Anderson et al., 2010; Dea et al., 2011; Li et al., 2011; Adali et al., 2014; Anderson M. et al., 2014; Boukouvalas et al., 2015). Among those, IVA-GL, which is a combination of two IVA algorithms, IVA with multivariate Gaussian component vectors (IVA-G) (Anderson et al., 2012) and IVA with multivariate Laplace component vectors (IVA-L) (Lee et al., 2008), provides an attractive tradeoff in terms of complexity and performance. A direct comparison of IVA and GIG-ICA was performed in recent work (Du et al., 2017b) which emphasized the advantages of the two approaches. For sources with slight or moderate inter-subject spatial variability, GIG-ICA obtained components with higher accuracy than IVA. For datasets where all subjects had a subject-unique source with large inter-subject spatial variability, IVA showed better performance in the component/time courses (TC) accuracy of the unique source, although GIG-ICA in general still performed better for other subject-common sources compared to IVA. Therefore, a framework that leverages the strengths of IVA and GIG-ICA is expected to achieve high accuracy for both subject-common and subject-unique networks.

It is also well-acknowledged that another pitfall of data-driven approaches (Calhoun and De Lacy, 2017) is the requirement to select a certain model order (e.g., the number of components in decomposition methods or the number of clusters in clustering methods) that may greatly affect the resulting brain network maps. While employing ICA to extract functional networks, the number of components is typically estimated using information-theoretic principles, such as a modified minimum description length (MDL) criteria (Li et al., 2007). Since different estimation methods result in different numbers of components (Zuo et al., 2010), it is important to consider the impact of the model order. Moreover, it is likely that a single model order is not the best solution, rather one can consider evaluating the impact of a range of model orders which enables a hierarchical evaluation of the brain's spatial organization (Ma et al., 2011; Calhoun and De Lacy, 2017).

It is known that features are required to be comparable across subjects for the purpose of classification or prediction. In decomposition-based methods, how to propagate components (indicating functional networks) to a new subject that is not included in the original training set is an important issue. In the case of applying individual ICA on each subject separately, the obtained components from each coming subject have to be well-matched with the components from the training set using some matching rules so that their used features are consistent. In group ICA framework, there are several ways to do this, one can use spatio-temporal regression to generate spatial and temporal features from new subjects (Erhardt et al., 2011). Another approach is to use spatially constrained ICA (Lin et al., 2010; Du and Fan, 2013; Du et al., 2014b, 2015b; Du Y. H. et al., 2016a). The latter approach is more optimal as individual data sets will have results that are optimized for independence, and will also provide spatial and temporal features that are adapted to each individual subject. A classification study using this framework can be found in Du et al. (2015b).

Functional Network Connectivity Analysis

Functional network connectivity (FNC) (Jafri et al., 2008; Allen et al., 2011) analysis employs a strategy that combines model-driven and data-driven methods. The framework typically includes two steps. It first performs group ICA on fMRI data of multiple subjects, resulting in subject-specific functional networks (indicated by ICs) and their associated fluctuations (reflected by TCs). Then, the connectivity between any two networks can be obtained by computing connectivity measure such as Pearson correlation between their post-processed TCs, resulting in a connectivity matrix including connectivity strengths among all networks. Similar to the ROI-based method, FNC also reflects temporal connectivity among different brain regions. The difference between the ROI-based and FNC method is that a data-driven method is applied to fMRI data in the FNC analysis to generate brain regions that are functionally co-activated (i.e., regions in one network), while in ROI-based method brain regions are usually decided via prior knowledge (e.g., brain atlas) rather than using the in-house fMRI data. Similar to ICA, it is necessary to determine the number of components in advance in the FNC method. FNC approaches typically use, a high model order (e.g., 100 or larger) to provide a more detailed parcellation of the brain.

Other Functional Connectivity Measures

In addition to the typical approaches for assessing functional connectivity (e.g. correlation) other meaningful measurements have also been proposed. For example, the regional homogeneity (ReHo) (Zang et al., 2004) has been proposed to reflect regional functional connectivity (or co-activation) where Kendall's coefficient concordance (KCC) is used to measure the similarity of the time series of a given voxel to those of its nearest neighbors. A similar approach is Cohe-ReHo (Liu et al., 2010) computed based on coherence metrics. Regional connectivity may serve as features for differentiating patients and healthy controls. Moreover, after functional connectivity matrices are obtained from either model-driven or data-driven techniques, graph-theory derived metrics (Liu et al., 2008; Lynall et al., 2010; Yu Q. et al., 2013) such as the averaged node strength, clustering coefficient, global efficiency, and local efficiency (Rubinov and Sporns, 2010) can be calculated. These graph-based measures provide powerful features which integrate across the whole brain and can be used in classifying and predicting individual patients.

Dynamic (Time-Resolved) Functional Connectivity Analyses

All of the above mentioned analysis approaches estimate brain functional connectivity by computing an average of the full time series (e.g., computing Pearson correlation between two ROIs using BOLD signals within 5 or 10 min) and generate a static value to reflect the connection strength. In recent years, there have been much interests in computing time-resolved connectivity measures and successful applications in identifying biomarkers from dynamic connectivity (Chang and Glover, 2010; Sakoglu et al., 2010; Allen et al., 2014; Zalesky et al., 2014; Du et al., 2015a; Sadaghiani et al., 2015; Du Y. H. et al., 2016b). In such analysis, brain functional connectivity can vary within a short period (e.g., tens of seconds) rather than be considered as static over time. Such results tend to further expand the available information, and avoid the strong assumption that brain activity is static over time.

While dynamic functional connectivity (DFC) has emerged as a promising topic in the recent fMRI literature, there are also some critical comments on the theory of dynamic connectivity. Laumann et al. (2017) suggested that correlations measured by resting-state BOLD are relatively stable over short timescales and may not reflect moment-to-moment changes in cognitive content. Though this issue is still not completely settled, many new studies have shown a relationship between behavior, emotion, and cognition during rest with dynamic connectivity features, giving us confidence in its potential utility. In addition, since dynamic connectivity has shown to be a useful tool for identifying biomarkers, we introduce some typical approaches and applications in terms of dynamic connectivity.

Sliding Time-Window Based Dynamic Connectivity Analysis

There are numerous methods which can be used to estimate DFC (Calhoun et al., 2014; Chen J. E. et al., 2017; Preti et al., 2017). The sliding time-window technique (Sakoglu et al., 2010; Hutchison et al., 2013; Hindriks et al., 2016; Shakil et al., 2016) is the most widely used. By assessing functional connectivity in different time-windows, one can easily expand existing static connectivity strategies to be time-resolved. DFC can then be evaluated by measuring functional connectivity among ROIs or voxels in a sliding window yielding multiple connectivity matrices (Du Y. H. et al., 2016b; Du et al., 2017a,c), performing ICA (or IVA) on fMRI data in different windows to generate dynamic spatial network patterns (Kiviniemi et al., 2011), or segmenting time series of networks (i.e., ICs) into short time series and then computing time-varying FNC (Allen et al., 2014). The sliding-window technique has also been applied to evaluate ReHo and brain graph, yielding time-varying ReHo values (Deng et al., 2016) and time-varying graphs (Yu Q. B. et al., 2015; Du Y. H. et al., 2016b).

Dynamic connectivity analyses among brain regions and networks have attracted increasing interests. Various approaches to further investigate the time-varying connectivity patterns is a topic of ongoing work. Different connectivity states, reflecting specific configurations of connected regions, can be revealed by post-hoc analyses of dynamic connectivity (Calhoun et al., 2014; Damaraju et al., 2014; Rashid et al., 2014; Du et al., 2015a, 2017c; Yu Q. B. et al., 2015; Du Y. H. et al., 2016b). Therefore, changes in connectivity states among different clinical populations might provide unique or additional biomarkers of disorders not detectable with SFC measures. Researchers have applied clustering (Allen et al., 2014; Du Y. H. et al., 2016b), principal components analysis (PCA) (Leonardi et al., 2013), Fisher discrimination dictionary learning (FDDL) (Li et al., 2014), and spatial and temporal independent components analysis (ICA) (Yaesoubi et al., 2015b; Miller et al., 2016) to extract connectivity states. These methods typically estimate connectivity states with discrepant patterns due to their different assumptions (Calhoun et al., 2014). Clustering approaches may fail to converge when working on “noisy” data that do not necessarily have desirable distributions. A more serious shortcoming of clustering is that the method always can yield a partition with any given number of clusters, regardless if the used features show patterns indicating clusters. The above mentioned decomposition-based work (Leonardi et al., 2013; Li et al., 2014; Yaesoubi et al., 2015b; Miller et al., 2016) focuses on group-level connectivity states that are common across subjects. One can also use GIG-ICA to estimate connectivity states at both group-level and subject-level (Du et al., 2017a,c). The method first computes the group-level connectivity states by analyzing multiple subjects' dynamic connectivity, and then guided by the group-level states it correspondingly estimates the subject-specific connectivity states that are independent from each other.

There has been considerable work using DFC analyses to investigate impairments in schizophrenia-spectrum and mood disorders (Damaraju et al., 2014; Rashid et al., 2014; Du Y. H. et al., 2016b; Du et al., 2017a,c) as well as classifying individual patients based on DFC measures (Rashid et al., 2016). Damaraju et al. (2014) computed dynamic FNC matrices of healthy controls (HCs) and SZ patients, and then clustered the time-varying FNC into different states, suggesting that states exhibiting cortical-subcortical negative connectivity and strong positive connectivity between sensory networks are those that show the group differences of thalamic hyperconnectivity and sensory hypoconnectivity. Rashid et al. (2014) also analyzed dynamic DFC of SZ patients and BP patients using a clustering method, and found that SZ patients showed more changes than BP subjects, including both hyper and hypo connectivity in one common connectivity state. Du Y. H. et al. (2016b) estimated dynamic connectivity within the default mode network (DMN) of 82 HCs and 82 SZ patients using a ROI-based method, and then applied K-means to extract connectivity states. The results showed that HCs spent more time in a state that reflected stronger connectivity between anterior and posterior brain regions, while SZ patients spent more time in a disconnected state. Another study (Du et al., 2017c) extracted connectivity states from whole-brain ROI-based DFC of 238 HCs, 140 bipolar disorder with psychosis (BPP), 132 schizoaffective disorder (SAD) and 113 SZ patients using GIG-ICA. Results showed that DFC provided more informative measures than the SFC method. Diagnosis-related connectivity states were evident using DFC analysis. For the dominant state consistent across groups, 22 instances of hypoconnectivity (with decreasing trends from HC to BPP to SAD to SZ) mainly involving post-central, frontal and cerebellar cortices as well as 34 examples of hyperconnectivity (with increasing trends from HC to BPP to SAD to SZ) primarily involving thalamus and temporal cortices were found. Interestingly, hypoconnectivities/hyperconnectivities also showed negative/positive correlations, respectively, with clinical symptom scores. Regarding frontal connectivities, BPP resembled HC while SAD and SZ were more similar. Using a similar framework, whole-brain DFC from resting-state fMRI data of 70 HCs, 53 individuals at clinical high-risk (CHR) for psychosis, and 58 early illness schizophrenia (ESZ) patients were utilized to estimate the inherent connectivity states, and then group differences were identified (Du et al., 2017a). The work found widespread connectivity alterations in both CHR and ESZ groups, and ESZ patients generally showed more connectivity differences with larger changes than CHR individuals relative to controls. Inspired by these studies, we believe that changes of connections within states, temporal measures such as dwell time in different states, as well as disease-specific states in dynamic connectivity analysis are able to provide interesting features for classification of diseases in future.

Furthermore, the time-varying patterns in brain activity and their relationships with time-varying brain connectivity are also important for advancing our understanding of brain networks and the underlying mechanism of brain dynamics. A recent study (Fu et al., 2017) developed a framework based on the sliding window approach for characterizing time-varying brain activity and exploring its associations with time-varying brain connectivity. This framework was applied to a resting-state fMRI dataset including 151 SZ patients and 163 age- and gender-matched HCs, suggesting that amplitude of low frequency fluctuation (ALFF) and FNC were correlated along time and these relationships are significantly changed in SZ.

Windowless Methods for Extracting Dynamic Connectivity

The above mentioned sliding time-window methods have been extensively used and are successful to estimate dynamic connectivity. However, there is an apparent limit in lacking standards for setting the window length, although previous studies have suggested 30–60 s of window length that are feasible in capturing DFC (Zalesky and Breakspear, 2015). If the window length is too short, the time points in each window could be too few to generate robust estimation of connectivity strengths. In contrast, long window length might decrease the temporal variations of functional connectivity, consequently hindering from detecting effective connectivity states.

Several windowless-based methods have been proposed to avoid the problem in selecting the window length. The recently proposed time-frequency analysis (Yaesoubi et al., 2015a) explored the connectivity by using multiple frequencies, which can be conceptually seen as adapting the observation window to the frequency content of the original time courses. Bayesian approach (Robinson et al., 2015; Taghia et al., 2017) has also been employed to study dynamic connectivity, which regards extracting time-varying functional networks as selecting dynamic models in the Bayesian setting. More recently, a new approach (Yaesoubi et al., 2018) was proposed to estimate DFC with the main advantage of capturing connectivity with arbitrary rates of change. In the approaches based on windowing operation, observable rate of change is driven by the length of the window, but in this approach there is no requirement for a windowing operation.

Classification or Prediction Strategies

Brain disorders cause serious impairments or debilitating behavior and represent a major health and financial burden globally (Vigo et al., 2016). In the United States, brain disorders (such as the symptoms, diagnosis, and treatments) are typically defined using the Diagnostic and Statistical Manual (DSM) (American Psychiatric Association, 2013). There are also some alternatives offer standard criteria for the classification of brain disorders, such as ICD-10 Classification of Mental and Behavioral Disorders, produced by the world health organization (WHO). However, over the years new knowledge is continuously added, resulting in changes in the diagnosis and disease classification (e.g., some are not valid, some are changed, and new ones appear). In addition, many mental illnesses are diagnosed based largely on symptoms, rather than biological criteria. More recently, there has been a focus on the importance of looking across disorders and also on continuous measures of assessment in both health and disease, e.g., the research domain criterion (RDoC) (Insel and Cuthbert, 2015). In this context, there has been an increasing trend to identify biological markers. Brain functional connectivity has been of great interest in the search for markers of numerous brain disorders. In the following, we will review some commonly used feature selection and classification (or prediction) strategies in fMRI functional connectivity based brain disorder studies. Several key aspects of feature selection methods and classifiers are compared and their promise and pitfall are discussed.

Feature Selection Strategies

The properties of fMRI data make feature selection especially important in the classification and prediction (Van Schooten et al., 2014). The dimension of functional connectivity is large even if ones only evaluate connectivity between defined ROIs. If the functional connectivity is calculated between voxels, the number of features will go up (potentially millions of features). Functional connectivity relating to a specific brain disorder often focuses on a small portion of all possible connections/associations. In that case, if all functional connections are used as features in a classifier, it would cause an overfitting problem since algorithm tries to fit the classifier to every feature even the irrelevant ones. If the classifier variables are overfitted to the training samples, they might work poorly on the samples not in the training sets, resulting in unsatisfied performance in classification. Another problem is that functional connectivity might provide substantial redundant information for classification. Using all connections as features with redundant information might be detrimental to the results of classification. Considering this, it is important to incorporate good feature selection strategies to identify appropriate functional connectivity features for the classification of brain disorders. Table 1 summarize the properties of different feature selection methods.

TABLE 1
www.frontiersin.org

Table 1. Summary of the feature selection methods.

Filter Methods

A widely applied feature selection strategy is filter-based method, where feature selection is independent from classifier/model building (Guyon and Elisseeff, 2003). They use the general characteristics of dataset and assign proxy measures to features from which a number of features with top scores are selected. A good filter method is sensitive to the discretionary power so as to suppress the least interesting features. The most popular filter method is to use group-level statistical tests. Generally, functional connectivity with group difference are first identified using different statistical tests such as t-test, Welch's t-test and ranksum-test and then these functional connections are used as input features of classification approaches (Calhoun et al., 2008; Anderson et al., 2011; Bassett et al., 2012; Du et al., 2012; Arbabshirani et al., 2013; Fekete et al., 2013; Guo H. et al., 2014; Dyrba et al., 2015). A major problem with this strategy is that group difference is sometimes investigated using whole data (Arbabshirani et al., 2017). That is, the label information for testing samples is used for feature selection, which will result in biased classification results. Another issue is that features are often selected based on their p-values. However, functional connectivities which show small p-values for group comparisons do not necessarily reflect those with the largest discrimination power. One previous study in our group has shown that features can have different distributions but comparable group means for different cohorts (Arbabshirani et al., 2017). This type of features might have a large p-value of statistical tests but good classification performance. There are also other filter methods used in the classification of brain disorders. Fisher score is a univariate feature selection algorithm which has been applied to determine the discriminatory power of features between two groups with equal probability (Gu et al., 2012; Khazaee et al., 2015). Correlation-based feature selection (CFS) is a simple algorithm which ranks features based on a hypothesis that good feature subsets contain features highly correlated with the classification (Hall, 1999; Shen et al., 2010; Tang et al., 2012; Su et al., 2013; Challis et al., 2015). RELIEF based algorithms are another large family of filter methods which estimate the scores of features according to how well their values distinguish between instances (Kira and Rendell, 1992). These methods are not dependent on heuristics, run in low-order polynomial time, and are noise-tolerant and robust to feature interactions, as well as being applicable for binary or continuous data (Kira and Rendell, 1992). The minimum redundancy, maximum relevance (mRMR) algorithm has also been used for the feature selection (Lord et al., 2012). This method uses each feature's predictive power and the mutual information between features to rank the most relevant features. mRMR can achieve satisfactory results compared with an exhaustive search, without the increase in time cost for ordering the feature list. The major advantages of filter methods are their effectiveness in computation time and their robustness to overfitting (Hamon, 2013). However, filter methods also have several drawbacks. First, the features selected by filter methods are not optimized to suit any specific classifier. Secondly, some of the filter methods tend to select redundant features since they ignore the relationships between features.

Wrapper Methods and Embedded Methods

Wrapper methods, which involve optimizing classifiers as part of the feature selection, have also been used in the classification (Guyon and Elisseeff, 2003; Fan et al., 2011; Venkataraman et al., 2012; Yu Y. et al., 2013b). Generally, wrapper methods use classifiers or predictive model to rank features. This class of methods evaluates the classification performance of different combinations of features and tries to identify the optimal subset of features that can provide the largest discriminatory power. Since the number of possible feature combinations grows exponentially as the number of features increase, customizable heuristics and termination-conditions are typically employed in wrapper methods to avoid that the selection of features is beyond a computer's processing power. Various wrapper methods have been employed in the brain disorders classification studies. Recursive feature elimination (RFE) is the most popular used wrapper method which selects features by recursively considering smaller and smaller combinations of features (Castro et al., 2011, 2014; Ladha and Deepa, 2011; Colby et al., 2012; Dai D. et al., 2012; Du et al., 2015b). This algorithm trains classifiers using the initial set of features and ranks the features according to their importance. The least important features are then discarded and the procedure is recursively repeated using the remaining features until a pre-desired number of features is select. Another widely used wrapper method is the genetic algorithm (GA) family, which uses binary encoding and specific mutation for feature selections (Yang and Honavar, 1998). Initially, binary encoded subsets of predictors (a feature is either included or not in the subset) are created and their corresponding fitness values, such as classification accuracy, are calculated. The encoded subsets then undergo cross-over and are subject to random mutations. This process is repeated again and again to create better subsets of predictors. Wrapper methods tend to select better performing features than filter methods and can provide the best feature selections specific for a particular type of classifier. However, wrapper methods also have two major shortcomings. First, wrapper methods might overfit if the number of observations is not large. And secondly, wrapper methods are computationally much more expensive since they need to create classifiers recursively.

Embedded methods, which combine classification and feature selection into the decision process, have also been applied to classification (Lal et al., 2006). Embedded methods are similar to wrapper methods since both of them incorporate feature selection into the classifier construction process. However, wrapper methods use a learning machine to measure the quality of subsets of features without incorporating knowledge about the specific structure of the classification or regression function; therefore they can combine with any learning machine. In embedded methods, the learning part and the feature selection part cannot be separated. An intrinsic model building metric is used during the learning process for embedded methods in which the feature selections are specific to given learning machines. A common category of embedded methods is using a regularization penalty to enforce the sparsity of features in order to identify features with more discriminatory power. The most popular embedded method with regularization penalty is the least absolute shrinkage and selection operator (LASSO) method (Tibshirani, 1996; Jie et al., 2014; Watanabe et al., 2014; Rosa et al., 2015; Fonti and Belitser, 2017). The LASSO method builds a linear model and penalizes the regression weights using L1 penalty. Amount of weights are shrunk to zero and those features with non-zero weights are selected finally. Ridge regression is another embedded method used for the feature selection (Yu and Liu, 2003; Ng, 2004). Similar to LASSO method, ridge regression shrinks the regression weights by incorporating a penalty. However, the ridge penalty behaves differently than LASSO penalty. The ridge penalty would be more likely to select features with high correlations than the LASSO penalty and tend to provide better classification performance. The elastic net algorithm is an extension of LASSO (Zou and Hastie, 2005; Gheiratmand et al., 2017; Teipel et al., 2017). It overcomes LASSO limitations on the feature number selections and the stabilization of feature selection by using a combination of LASSO and ridge regression methods. Since embedded methods select features specific to the classifiers, they are much faster and less computationally expensive.

Classification and Prediction Models

Traditional Classifiers

A wide range of classifiers has been applied in the classification of brain disorders. Support vector machine (SVM) is so far the most popular method (Lord et al., 2012; Anderson and Cohen, 2013; Yu Y. et al., 2013b; Watanabe et al., 2014; Du et al., 2015b; Dyrba et al., 2015; Khazaee et al., 2015; Liu et al., 2015; Sacchet et al., 2015; Cabral et al., 2016). SVM is a type of supervised learning classifier with learning algorithms used for classification and regression (Cortes and Vapnik, 1995b). Standard SVM is a binary classifier which generalizes the optimally separating hyperplane to better separate different groups of data. The basic idea of SVM is to find an observation of one class which is closest to an observation from the other class. The hyperplane is drawn in a way that maximizes the distance between these observations so that the hyperplane can separate the observations into different sides. Since a “slack variable” is used in the SVM classifier, SVM allows overlaps between different groups. There is no assumption needed for the SVM classifier, making it a very flexible method. However, it is also hard to interpret the results from SVM compared with the other traditional classifiers. The original SVM classifier is a linear classifier. By incorporating the different kernel functions to maximum-margin hyperplanes, SVM can become non-linear classifiers. The kernel functions transfer the original features space to a higher-dimensional feature space so that the algorithm can fit the maximum-margin hyperplane in a new feature space. Several common kernels are widely used in SVM, such as polynomial kernel, sigmoid kernel, and Gaussian RBF kernel. The choice of kernel is crucial for building a successful SVM-based classifier. Different types of the kernel will be suitable for different studies depending on the characteristics of features. SVM with different kernels will have different hyperparameters needed to be optimized. For example, SVM with linear kernel has only one hyperparameter to be adjusted which is called soft margin. In addition, SVM approaches using non-linear kernels have one or more additional hyperparameters to be tuned. The optimization of hyperparameters is usually based on a grid search over pre-provided candidate values. It is very important in SVM as these parameters significantly influence classification performance and accuracy.

Linear discriminant analysis (LDA) is another widely used classifier (Dai Z. et al., 2012; Cetin et al., 2016; De Marco et al., 2017; Qureshi et al., 2017a; Wang et al., 2017), which projects features into a lower-dimensional space in which different groups of data can be maximally separately (Altman et al., 1994). LDA is a generalization of Fisher's linear discriminant and is based on the concept of searching for a linear combination of features that separate two groups (Mika et al., 1999). LDA explains the group labels by the values of continuous independent variables. By projecting the data into a lower-dimensional space, LDA can avoid the overfitting problem and reduce the overall computational costs. LDA is very similar to principal component analysis (PCA). PCA is used for finding the axes that maximize the variance of data while LDA is used find finding the axes that maximize the separation between multiple groups. LDA also has two major limitations. First, LDA requires the assumption of a common covariance structure in the groups of data, which is very rare in real applications. Second, although LDA can be used for multi-class classification problem, it is more suited to the two-class problem.

Deep Learning Classifiers

Deep learning methods have attracted increasing interesting in various areas and also have been applied in the classification of brain disorders (Plis et al., 2014; Iidaka, 2015; Lecun et al., 2015; Calhoun and Sui, 2016; Hu et al., 2016; Kim et al., 2016; Han et al., 2017; Jang et al., 2017; Ju et al., 2017). In contrast to traditional machine learning methods, deep learning methods are capable of learning the optimal representation directly from the raw data through using a hierarchical structure with different levels of complexity (Lecun et al., 2015; Schmidhuber, 2015; Vieira et al., 2017). Deep learning methods apply non-linear transformations to the raw data, and the transformations provide hidden features with higher levels of abstraction, which will be with more informatics to the original input data space at the lower levels. This advantage not only helps to automatically solve difficulties in the feature selections, especially when the dimension of features is too large or when there is limited prior knowledge about the data, but also can improve classification performance compared with a traditional classifier.

The artificial neural network (ANN) is popular in the classification of patients using fMRI data (Guo H. et al., 2014; Kim et al., 2016). ANN learns to do tasks from examples by constructing layers with artificial neurons and connections between them. For example, in brain disorder classification, it learns to identify individuals with brain disorder by analyzing training subjects which are labeled as healthy or disorder and using this information to classify other individuals. An auto-encoder is a type of ANN popular used for the brain disorders classifications (Kim et al., 2016; Guo X. et al., 2017; Ju et al., 2017). This method comprises two stages. The first stage is encoding, which maps the input to a hidden representation. The second stage is decoding, which maps hidden representation back to obtain the output that is as close to the input as possible. By imposing sparsity on the hidden layers during training, an auto-encoder can learn useful structures from the input data. This allows sparse representations of inputs, which are useful in pre-training for classification tasks. Deep belief network (DBN) is another class of ANN been used in the classification of brain disorders using fMRI data (Farzi et al.), which is composed of multiple layers of latent variables and the connections between them (Hinton, 2009). A DBN is somewhat unique in that it allows undirected connections between some layers, called restricted Boltzmann machines (RBM) (Hjelm et al., 2014). DBN usually trains these layers using an unsupervised learning algorithm such as the gradient descent algorithm. Therefore, instead of using deterministic functions and the reconstruction error (like the auto-encoder), DBN is pre-trained using maximum-likelihood estimation (Vieira et al., 2017).

Several critical issues challenge the using of deep learning in classification (Schmidhuber, 2015; Vieira et al., 2017). The first challenge is the amount of time and computational resources. The number of layers, nodes and the function of each node are usually manually determined, although some automated optimization strategies have been proposed. A large number of parameters needs to be estimated in the deep learning methods, which makes them cost much more computational resources. A second challenge is the potential overfitting problem when using deep learning methods. Since the feature dimension of fMRI data is usually very large while the number of samples is relatively small, deep learning methods will tend to learn features in the data which are specific or limited to the study. Although there are several approaches developed to address this problem, such as regularization strategies and pre-selection of features (i.e., reducing the dimensionality of feature input), these approaches also introduce other critical problems, such as how to induce appropriate sparsity and how to select the best subset of features. The third challenge is the interpretability of results obtained from deep learning methods. The deep learning methods are often treated as a black box, which use consecutive non-linear transformations on the raw features to map them to another space with higher levels of abstraction. Although the model information, such as the node in the hidden layers and the connection between them, has been demonstrated to be useful for distinguishing brain disorders, it is difficult to back-construct them to the original feature space, which will result in problems of interpreting the results. Because of these issues, a deep learning method might work well in the classification of a brain disorder but does not provide any information about the underlying neuroanatomical or neurofunctional alterations. That would be of limited clinical utility (Vieira et al., 2017). Although these issues remain unsolved, deep learning methods are still with a great potential to improve the diagnosis of brain disorders and could be promising tools for advancing the knowledge of disrupted brain cognitive functions in brain disorders.

A summary of the properties of different classifier models can be found in Table 2.

TABLE 2
www.frontiersin.org

Table 2. Summary of traditional classifiers and deep learning classifiers.

Binary Classification to Multi-Class Classification

In the context of the classification of brain disorders, the majority of the conventional studies have just focused on binary classification, in which only the comparison between patients and healthy controls was taken into account. However, from the clinical perspective, it would be more critical to identify and develop biomarkers to differentiate different brain disorders which share similar symptoms. It is also important to separate patients into different sub-groups according to the different stages of brain disorder progression. Therefore, the multi-class classification problem can be a more significant issue for real clinical utility. During the recent decade, increasing brain disorder studies have drawn their attention to multi-class classification. Since most of the traditional classifiers, such as SVM and LDA, were originally designed for binary classification problem (Cortes and Vapnik, 1995a; Mika et al., 1999), many strategies have been developed to make the traditional classifiers work for multi-class classification problems. The most commonly used strategy is to transform multi-class classification problem to binary classification problem. This strategy includes two different techniques, one-against-one and one-against-whole (Nasrabadi, 2007). The former builds binary classifiers for all pairs of groups and uses a voting scheme to make the final decision. The latter one trains a single classifier for each class (against other classes) and generate a real-value confidence score for the final decision. Although this strategy accompanied with traditional classifiers has been widely applied in numerous neuroimaging classification studies, such problem transformation is still controversial. Some other approaches have also been proposed (Hsu and Lin, 2002; Fei and Liu, 2006), but none of them have been applied to any multi-class brain disorder studies (Kumar and Gopal, 2011; Vieira et al., 2017). Compared with the traditional classifiers, deep learning classifiers are more suitable for multi-class comparison because the application of these classifiers on multi-class problems is more straightforward. In the output layer, deep learning classifiers use a softmax activation function, which can be derived by extending simple logistic regression, to represent a categorical distribution instead of group labels. In that case, the probabilities of each input feature belonging to a class are obtained from the output layer, providing a more intuitive index of multi-class membership those sophisticated indices generated from traditional classifiers (Vieira et al., 2017). Nowadays, there is a growing trend toward using deep learning classifiers to separate different brain disorders or brain disorder subtypes, or to diagnose the progression of brain disorder.

Applications Using Brain Functional Connectivity in the Classification of Brain Disorders

During the period from 1990 to 2017, more than 200 papers used functional connectivity features alone or multi-modality features including functional connectivity to classify or predict brain disorders. In this section, we primarily focus on studies working on classifying patients with a brain disorder from healthy controls (i.e., a binary classification problem), and also include some work distinguishing multiple different disorders (i.e., a multi-class classification problem). We mainly summarize studies relating to schizophrenia, bipolar disorder, autism spectrum disorder (ASD), attention deficit hyperactivity disorder (ADHD), Alzheimer's disease (AD) and mild cognitive impairment (MCI), some of which share very similar symptoms and common changes in the brain that can confound diagnosis, such as SZ vs. BP, ASD vs. ADHD, and AD vs. MCI. Although other brain disorders such as depression also deserve review in the future, our primary goal here is to provide an overview on how far brain functional connectivity features have been used to classify brain disorders and how well the classification frameworks have worked. Figure 2 and Tables 36 present a summary of the existing application studies that reported their classification accuracy. Regarding the performance, the average classification accuracy is around 80% for those studies, with AD/MCI related studies showing the highest accuracy. In these applications, there are trends from using connectivity features alone (e.g., spatial maps of ICA and functional connectivity) to using complex network properties (e.g., graph-theory based measures); from using static connectivity measures to using dynamic connectivity measures; from using features from single imaging modality to using features from multiple modalities; from using traditional classifiers to using more complex deep learning classifiers; and from classifying patients from healthy controls to classifying multiple groups. In each of the following subsections, we focus on some typical works in more detail to highlight these potential trends. If there are both binary and multi-class classification works, we will describe binary classification studies first. Similarly, we try to first state studies using simple features or classifiers and then that using more complex features or classifiers.

FIGURE 2
www.frontiersin.org

Figure 2. Summary of the existing application studies (included in Tables 16). (A) Total number of papers for 2-year intervals for each disease type. The legend shows the color code for each disease type. This legend also applies to subfigure (B,D). (B) Scatter plot of the reported classification accuracy vs. the total sample size. In the subfigure (B), square shape indicates study using features from one modality, while circle shape represent study using features from multiple modalities. (C) Histogram of the sample sizes (including all patients and healthy controls) of the surveyed studies. Vertical dashed lines indicate mean (red) and median (blue) of the sample size among all studies. (D) Disorder specific boxplot plots of reported classification accuracies of the surveyed papers. For each disease type, the accuracies in different studies are shown using a boxplot. Green shape means a 95% confidence interval for the mean while orange shape means standard deviation.

TABLE 3
www.frontiersin.org

Table 3. Summary of functional connectivity based SZ/BP classification studies.

TABLE 4
www.frontiersin.org

Table 4. Summary of functional connectivity based AD/MCI classification studies.

TABLE 5
www.frontiersin.org

Table 5. Summary of functional connectivity based ASD classification studies.

TABLE 6
www.frontiersin.org

Table 6. Summary of functional connectivity-based ADHD classification studies.

Schizophrenia and Bipolar Disorder

Schizophrenia is a severe chronic brain disorder whose symptoms can include delusions, disorganized thinking, hallucinations and social withdrawal (Endicott and Spitzer, 1978; Kay et al., 1987; Calhoun et al., 2008; Fu et al., 2017). Although schizophrenia only affects about 1% of the population worldwide (Bhugra, 2005; Van Os et al., 2010), the symptoms can be very disabling. The symptoms of schizophrenia are categorized into three types: positive, negative and cognitive, and these symptoms usually start in young adulthood and last a long time (American Psychiatric Association, 2013). Bipolar disorder is a mood disorder marked by alternating episodes of mania and depression. Bipolar disorder includes four basic subtypes and all of them involve clear changes in mood, energy, and activity levels (https://www.nimh.nih.gov/health/topics/bipolar-disorder/index.shtml). The root causes of bipolar disorder are not clearly understood, although it is known that both environmental and genetic factors are involved. There is no standard clinical test for either schizophrenia or bipolar disorder. Therefore, it is important to investigate the possibility of using neuroimaging data in the automatic diagnosis of these two brain disorders.

Many studies have focused on distinguishing SZ and HC based on the fMRI functional connectivity. ICA based spatial map is one of the most popular used functional features in the classification (Demirci et al., 2008; Arribas et al., 2010; Castro et al., 2011; Du et al., 2012). For example, Du et al. used ICA to extract individual spatial maps as the initial features and then combined a two-level feature identification scheme with kernel principal component analysis (KPCA) and Fisher's linear discriminant analysis (FLD) in the classification of SZ (Du et al., 2012). By using a majority vote methods that use multiple features, they achieved a classification accuracy of 98% in the auditory oddball task and 93% in the resting-state. The connectivity between identified networks (i.e., FNC) is another important feature for the classification (Anderson and Cohen, 2013; Arbabshirani et al., 2013; Kaufmann et al., 2015). Functional connectivity between ROIs defined by different atlases (i.e., ROI-based) is also commonly used to classify SZs and HCs (Venkataraman et al., 2012; Su et al., 2013; Yu Y. et al., 2013a,b; Watanabe et al., 2014; Kim et al., 2016). Automated anatomical labeling (AAL) atlas is the most popular atlas using in the classification, although some other atlases are also used. Besides these straightforward connectivity features (component spatial maps and functional connectivity), high-level network organization has also been considered as important biomarkers. Bassett et al. (2012) used the size of connected components in graphs build from functional connectivity among time-courses for 90 AAL regions as the input features of SVM and achieved up to 75% classification accuracy and 85% sensitivity. Studies also combined functional connectivity with other features from other modalities to distinguish SZ and HC. Yang et al. proposed a hybrid machine learning method to classify SZs and HCs, using features from fMRI and single nucleotide polymorphism (SNP) data (Yang et al., 2010). They combined three models (SNPs, voxels in the fMRI map contributing to classification and network maps from ICA) into a single module using a majority voting approach to make a final decision. Through a leave-one-out cross-validation, they demonstrated that this framework can provide higher classification accuracy (Combined: 87%; SNP: 74%, voxel: 83%, ICA: 83%). In the 24th Machine Learning for Signal Processing competition (MLSP) (Silva et al., 2014), participants were asked to automatically differentiate 69 schizophrenia patients from 75 healthy controls using multimodal features, including FNC features from fMRI data and component loadings using ICA from structural MRI data. Performance was estimated using the area under the receiver operating characteristic curve (AUC). No entry was able to attain an overall AUC of 0.9 or higher, and the median AUC is near 0.75 across all 2087 entries. The winning team got an overall AUC of 0.89 by means of a Gaussian process (GP) classifier with prior distribution scaled by a probit transformation. Temporal dynamics in the functional connectivity are widely observed in numerous neuroimaging studies and are suggested to be neural origin. Cetin et al. (2016) used static FNC and dynamic FNC obtained from fMRI and MEG data to differentiate schizophrenia patients from healthy controls. They used a leave-one-out cross validation method to examine the classification accuracy. Their results showed that using the combined fMRI and MEG features from FNC improved the classification performance (in which the highest accuracy is 85.71%) compared to using fMRI and MEG FNC features separately (in which the highest accuracy is 75.82%), and using the combined fMRI and MEG features from dynamic FNC improved more (in which the highest accuracy is 90.11%). Increasing studies have demonstrated the benefits of using deep learning in the classification during recent years. Kim et al. (2016) used a L1-norm regularization for feature selection and a deep neural network (DNN) with multiple hidden layers as the classifier. Their results showed that the DNN can obtain about 86% accuracy of two-group classification which is much better than that obtained by SVM.

Functional connectivity-based features for classification of SZ and BP patients at the individual level have been studied as well (Calhoun et al., 2008; Arribas et al., 2010; Rashid et al., 2016). In a previous study (Calhoun et al., 2008), the distance to mean image for each group is constructed using ICA spatial maps of the temporal lobe and the default mode networks. This feature was used in a leave-one-out cross-validation framework, and the approach classified schizophrenia and bipolar patients at the individual level with the accuracy of around 83–95%. A supervised method for automatic classification of healthy controls, patients with bipolar disorder, and patients with schizophrenia using brain imaging data was proposed in Arribas et al. (2010). The spatial maps of independent components were used as the features and a dimension reduction stage comprising two steps is performed (1. t-test; 2. singular value decomposition). The reduced features were then used as input of a probabilistic Bayesian classifiers classifier. The experimental results showed that the average three-way correct classification rate (CCR) is in the range of 70–72%, demonstrating their proposed method to be a reliable framework on classification analyses of both schizophrenia and bipolar disorder patients. More recently, time-varying patterns in the functional connectivity have been used to distinguish SZ from BP patients. Rashid et al. proposed a framework for classification of schizophrenia, bipolar and healthy subjects based on their static and dynamic FNC (Rashid et al., 2016). The classification performance between static and dynamic connectivity features was compared through a cross-validation framework. The overall results showed that dynamic FNC (with the classification accuracy 84.28%) significantly outperforms static FNC (with the classification accuracy 59.12%) in terms of predictive accuracy, suggesting that dynamic patterns in functional connectivity might provide distinct and more information over the SFC.

SZ, SAD, and BPP have overlapping clinical symptoms (Cosgrove and Suppes, 2013; Cardno and Owen, 2014; Pearlson et al., 2016), hence it is very difficult to distinguish them in clinical diagnosis. Du et al. has identified markers from subject-specific brain networks using resting-state fMRI data via GIG-ICA, and then classified healthy controls, SZ patients, BPP patients, patients suffering from schizoaffective disorder with manic episodes (SADM) disorders, and patients suffering from schizoaffective disorder with depressive episodes exclusively (SADD) (Du et al., 2015b). Using the training set, the spatial maps of the typical functional networks were used as the features in a multi-class (five-class) SVM classifier and the RFE was employed for feature selection. For each subject of the testing set, subject-specific networks were computed under the guidance of the group-level networks obtained from the training set, and then the corresponding features were inputted to the classifier trained using the original samples. Results showed that the discriminative regions mainly included frontal, parietal, precuneus, cingulate, supplementary motor, cerebellar, insula and supramarginal cortices, and these regions can provide 68.75% classification accuracy for the new coming subjects (i.e., the independent testing set). Based on measures from functional networks, hierarchical clustering and projection approaches were performed to further investigate the relationship among those groups. Interestingly, the linkage result from the hierarchical clustering showed that using network measures, SADM group and SADD group were closest to each other; SAD group was more similar to SZ group compared to other groups; and BP group was closer to HC group than other patients groups. These results provide an interesting view on the relationship among these symptom-related diseases in addition to accurate separation. The framework and results of this study (Du et al., 2015b) are shown in Figures 3, 4, respectively.

FIGURE 3
www.frontiersin.org

Figure 3. Flowchart of one study (Du et al., 2015b) that includes classifying HCs, SZ patients, BPP patients, SADM patients, and SADD patients. The spatial network maps of the training set computed from GIG-ICA were used as the features in a multiclass (five-class) SVM classifier, that yielded 68.75% classification accuracy for the new coming subjects. The figure is reused with permission from Du et al. (2015b).

FIGURE 4
www.frontiersin.org

Figure 4. Relationship between those original subjects evaluated using network measures in the study of Du et al. (2015b). (A) Distance matrix computed using the feature vectors of 93 subjects. The x-axis and y-axis denote subject ID. Subjects with ID 1–20 are HCs, subjects with ID 21–40 are SZ patients, subjects with ID 41–60 are BP patients, subjects with ID 61–80 are SADM patients, and subjects with ID 81–93 are SADD patients. (B) The mean distance matrix obtained by averaging the values in each inter-group and intra-group related sub-block of the distance matrix. (C) The projection results of 93 subjects using t-distributed stochastic neighbor embedding (t-SNE) method. Each point denotes one subject, and different colors denote different groups. Each ellipse reflects mean (center) and standard deviation for one group. (D) The linkage results from the hierarchical clustering method. The x-axis denotes the subject ID, which is as same as that in (A). In (D), “HC” denotes that most of the subjects clustered into the related group are healthy controls. “SZ,” “BP,” “SADM,” and “SADD” have similar meanings. The figure is reused with permission from Du et al. (2015b).

Autism Spectrum Disorder and Attention Deficit Hyperactivity Disorder

ASD is a complex neurodevelopmental disorder characterized by a wide range of symptoms, skills, and levels of disability that affects how a person acts and interacts with others, communicates, and learns (American Psychiatric Association, 2013). This disorder begins early in childhood and lasts throughout one's life. It is estimated that ASD has a prevalence of 1:68 in the United States (Autism and Developmental Disabilities Monitoring Network Surveillance Year 2008 Principal Investigators; Centers for Disease Control Prevention, 2012) and the lifetime costs of treating an American with ASD has exceeded one million dollars (Greenspan, 2015). The exact cause of autism is still unknown and it might be caused by genetic, brain structure and function, developmental and environmental factors (Wing, 1996). Effective treatments and services can moderate the symptoms and improve the lives. However, ASD is a heterogeneous condition which means there is no same profile for the individuals with ASD and their specific symptoms may change with development (Lord et al., 2000). Consequently, the diagnosis and definition of ASD is still a challenging issue. It is common that children are diagnosed with ASD until ages five and six when is too late for effective treatments. ADHD is another commonly found brain disorder affecting children which share overlapping and confusing symptoms with ASD (Anckarsäter et al., 2006; Happé et al., 2006; Rommelse et al., 2010). Children with ADHD may be inattention, hyperactivity or impulsivity that interferes with school and home life. ADHD is more common in boys than in girls and is usually diagnosed during the early school years and last into adulthood. It is estimated that 3–10% of school-aged children are affected by the ADHD (Biederman, 2005; Dey et al., 2014). The cause of ADHD is still unclear and researchers demonstrate that several things, such as heredity, chemical imbalance, brain changes or injury, and poor nutrition might be involved as possible causes. Currently, a diagnosis of ADHD is mainly based on the behavioral symptoms described in DSM (American Psychiatric Association, 2013). However, DSM can be misleading since there is no valid test for ADHD and ADHD has a high rate of comorbidity, which can confuse matters. Due to the difficulty in diagnosis of ASD and ADHD, an increasing number of studies are using neuroimaging data to develop approaches to try to better characterize and predict these brain disorders. In the following, we review studies using functional connectivity features in the classification of ASD and ADHD.

Studies using functional connectivity as features to classify ASD began around 2011. Anderson et al. calculated the functional connectivity from 7266 ROI covering gray matter during the resting-state and then used these as the features in a thresholding leave-one-out classifier (Anderson et al., 2011). The classifier performed at 89% accuracy for the subjects < 20 years age and at 79% for all subjects. In another study, Murdaugh et al. used seed-based functional connectivity (seed: medial prefrontal cortex, posterior cingulate cortex and angular gyrus) as well as whole-brain functional connectivity in a logistic regression classifier for distinguishing ASD from controls and found that both whole-brain and seed-based connectivity patterns can achieve accuracy up to 96.3% (Murdaugh et al., 2012). The Autism Brain Imaging Data Exchange (ABIDE) initiative has aggregated functional and structural brain imaging data collected from laboratories around the world to accelerate the understanding of the neural bases of ASD (http://fcon_1000.projects.nitrc.org/indi/abide/)(Di Martino et al., 2014, 2017). Plitt et al. used 178 age and IQ matched cohorts from ABIDE and calculated the functional connectivity between three different ROI sets. They used RFE for feature selection in both logistic regression and SVM classifier and obtained an overall 76.7% accuracy of classification (Plitt et al., 2015). Functional connectivity is also combined with the features from other modalities in the classification of ASD. Deshpande et al. identified 18 activated regions from an experiment involving physical and intentional causality and calculated causal connectivity weights, functional connectivity from fMRI, and fractional anisotropy obtained from DTI data for each participant (Deshpande et al., 2013). These features were used in a recursive cluster elimination based SVM classifier and finally achieved a maximum classification accuracy of 95.9%. Deep learning classifiers are applied in the classification of ASD during recent years. Iidaka selected more subjects from ABIDE (312 subjects with ASD and 328 control subjects) and the resting-state functional connectivity between 90 ROIs are used as input of the probabilistic neural network (PNN) for classification. PNN obtained classification results of ~90% accuracy (Iidaka, 2015). Chen et al. constructed functional network between signals in different frequency bands using ABIDE dataset and showed that the most of the discriminative features were concentrated on the Slow-4 band (0.027–0.073 Hz) (Chen H. et al., 2016).

There has also been a fair amount of work using functional connectivity to classify ADHD and healthy controls. Zhu et al. (2008) first used ReHo from fMRI in a PCA-based Fisher discriminative analysis (PC-FDA) to build a linear classifier and the results showed a classification accuracy of 85% using a leave-one-out cross-validation. Wang et al. (2013) extracted ReHo from resting-state fMRI signals and used as input of SVM. They selected features according to a cross-validation procedure and showed that the optimized model produced a total accuracy of 80%. Graph-based measures of functional connectivity are becoming important features that distinguish ADHD from healthy controls (Fair et al., 2013; Dey et al., 2014). Fair et al. used node strength based on the functional connectivity network to successfully classify two subtypes of ADHD (Combined (ADHD-C) and Inattentive (ADHD-I)) from healthy controls with accuracy up to 82.7% (Fair et al., 2013). This graphical measure is also able to separate three groups of cohorts with an overall accuracy of 69.2% in the 3-group classification. Existing studies also use functional connectivity measures along with other fMRI features or other modal features to classify ADHD (Colby et al., 2012; Dai D. et al., 2012; Sato et al., 2012; Anderson A. et al., 2014). For example, Colby et al. combined morphological measures from structural MRI and functional features such as functional connectivity and graphical measures from fMRI as the input features of the SVM and used RFE algorithm for the feature selection. They were able to classify the diagnosis of ADHD with 55% accuracy using this SVM-RFE classifier (Colby et al., 2012). Anderson et al. used functional connectivity measures along with many other features such as curvature index, folding index, Gaussian curvature, gray matter volume, mean curvature, surface area, thickness average, and phenotypic data in a multimodal neuroimaging framework and obtained 66.8% accuracy of two-group classification in an ADHD dataset with a large number of subjects (472 healthy controls and 276 ADHD) (Anderson A. et al., 2014).

Studies have shown that ASD and ADHD have both shared and disorder-specific abnormalities in brain function (Christakou et al., 2013; Chantiluke et al., 2014). However, few studies have used functional connectivity features to distinguish ASD and ADHD and it is still a challenging issue whether functional connectivity can be a powerful biomarker for distinguishing these two brain disorders.

Alzheimer's Disease and Mild Cognitive Impairment

MCI is a syndrome which causes greater memory loss than expected by aging (Gauthier et al., 2006). It is reported that about 3–19% of adults older than 65 years suffer MCI. The symptoms of MCI are not as severe as that in AD and thus people with MCI can carry out their normal daily activities (Albert et al., 2011). There are several subtypes of MCI and one subtype called amnestic MCI which is associated with memory loss has a high risk of progression to AD (Gauthier et al., 2006). Research has shown that the brain areas of memory are impaired in both MCI and AD, while the cognitive domains are only impaired in AD (Petersen et al., 1999). Although the rates of progression varied considerably among literature and the progression is not inevitable, amnestic MCI is still considered to be a forerunner of AD. AD is the most common type of dementia causes problems with memory, thinking and behavior (Strittmatter et al., 1993). AD is increasingly prevalent in individuals over the age of 65 and the significance of AD as a public health problem became evident (Glenner, 1990). It is estimated that 60 new case of AD exists in every hour and by 2050, this number will go to double (Alzheimer's Association, 2015). Between 2000 and 2013, the death results from AD increased remarkably 71%, making AD the sixth leading cause of death in the United States (Alzheimer's Association, 2015).

Traditionally, the diagnosis of AD mainly depends on the clinical examinations and the evaluations of individuals' perception and behavior (Arbabshirani et al., 2017). Improving diagnosis of AD and MCI patients might help to identify diseases earlier in the disease's progress, which may be crucial in developing treatments for these disorders. Considering the severe health impact of AD and MCI and their overall effect on caregivers and society, there has been a large numbers of studies using neuroimaging features, especially the functional connectivity in fMRI to diagnose these brain disorders. Wang et al. proposed a discriminative model of AD based on the Pseudo-Fisher Linear Discriminative Analysis (pFLDA) (Wang et al., 2006). They used the correlation/anti-correlation coefficients of two anti-correlated networks in resting brains as the features of the classification model and obtain a CCR of 83%. Challis et al. employed Bayesian Gaussian process logistic regression (GP-LR) models with linear and non-linear covariance functions in the classification of AD and MCI (Challis et al., 2015). By using functional connectivity as features, they achieved 75% accuracy disambiguating healthy controls from individuals with MCI and 97% accuracy disambiguating individuals with MCI and individuals with AD. Not only the functional connectivity itself, but also its extended or related metrics, such as graphic metrics, have been used as features for the diagnosis of AD and MCI. Jie et al. have developed a novel framework to integrate multiple connectivity properties for improving the diagnosis of MCI (Jie et al., 2014). A multi-kernel learning (MKL) technique was adopted and two types of kernels were used to quantify the local and global connectivity properties respectively. 91.9% classification accuracy was achieved by this method, which is much better than that in previous studies using single connectivity properties. Another study combined graphic theoretical approaches with machine learning method to investigate the atypical functional brain network in patients with AD (Khazaee et al., 2015). They performed statistical analysis on connectivity which is measured by correlation coefficient to search altered connectivity patterns in patients and then calculated three graphic metrics, clustering coefficient, local efficiency, and normalized local efficiency based on the connectivity matrix. A SVM classifier was finally used to explore diagnosis ability of these graphic metrics. Their results showed that those graphic metrics can well separate patients with AD and healthy controls with 100% accuracy. Functional connectivity from fMRI is also incorporated with features from other modalities in the diagnosis of AD. Dai et al. proposed a methodological framework using features from multi-modalities to discriminate patients with AD from healthy controls (Dai Z. et al., 2012). The gray matter volume from structural MRI and three functional characteristics from fMRI were used as the features of classifiers. By using leave-one-out cross-validation, this method provided satisfactory classification accuracy of 89.47% with a sensitivity of 87.50% and a specificity of 90.91%. Schouten et al. used measures from structural MRI, diffusion MRI and resting-state fMRI as the input features of elastic net classifier to classify AD (Schouten et al., 2016). They showed the gray matter density achieved the best classification accuracy among all single modal imaging and multimodal combination can significantly improve the classification performance. These findings suggested that different MRI modalities provide complementary information for classifying AD. The human brain is a dynamic system with non-stationary neural activity and rapidly-changing neural interaction. Increasing evidence shows that functional connectivity is not static but varies significantly in time. There already exist studies using dynamic patterns in functional connectivity as features for the classification of dementia and its pre-stages. A MCI study applied a sliding window approach to estimate dynamic functional correlation tensors between white matters and DFC between gray matters and used these as features to classify MCI subjects (Chen X. et al., 2017). They found that the dynamic functional features significantly improved the classification performance, showing that the functional information in gray matter and white matter is complimentary.

Although vast majority of AD or MCI classification studies used traditional classifiers such as SVM and LDA, increasing studies have considered the advantages of deep learning classifiers over the traditional ones and started using deep learning models in the classification of AD and MCI (Suk et al., 2016; Meszlényi et al., 2017). Meszlenyi et al. described a convolutional neural network for functional connectivity classification called connectome-convolutional neural network (CCNN) (Meszlényi et al., 2017). By testing the performance of CCNN model on both simulated datasets and a public MCI dataset, they showed that the developed model is capable of distinguishing subjects of different groups. Their results also demonstrated that the CCNN model can combine different functional connectivity metrics in the classification and such combination results in better performance than other classifiers using single metric only.

Challenges and Difficulties in Identifying Biomarkers of Brain Disorders and Classification of Individual Subject

Lacking Gold Standards for Diagnoses

Analyzing fMRI data for the ultimate goal of identifying biomarkers and diagnosing brain disorders using neuroimage-based measures is promising but challenging, due to the fact that the current diagnostic categorization itself used as prior guidance could be inaccurate and need further refinement (Insel and Cuthbert, 2015). So far, there is no gold standard for the complex diagnosis. The diagnosis is determined solely by observable symptoms, and the interview and history are the main factors that influence the diagnosis. For example, in clinical diagnosis, it can be difficult to distinguish SZ, BP, and SAD that show overlapping clinical symptoms (Cosgrove and Suppes, 2013; Malaspina et al., 2013; Cardno and Owen, 2014). SZ is a psychotic disorder characterized by altered perception, loss of motivation and judgment, and impairment in social cognition. BP is a mood disorder marked by alternating episodes of mania and depression. SAD is diagnosed when the symptom criteria for SZ are met and during the same continuous period there are major depressive, manic or mixed episodes. In fact, there are also overlapping symptoms such as social withdrawal and communication impairment between ASD and SZ spectrum disorders (Fitzgerald, 2013; Chisholm et al., 2015). ASD, a neurodevelopmental disorder, is characterized by a spectrum of abnormal behaviors including persistent deficits in social communication and interaction across multiple contexts. ADHD is marked by an ongoing pattern of inattention and/or hyperactivity-impulsivity that interferes with functioning or development. Research work also shows a high rate of overlapping symptoms between ASD and ADHD (Taurines et al., 2012). Therefore, the similarities in symptoms between these brain disorders give rise to difficulties in clinical diagnosis.

Most existing fMRI studies (Calhoun et al., 2009a; Koike et al., 2013; Du et al., 2017c), which applied statistical analyses to investigate differences among multiple groups or performed supervised learning approaches to explore biomarkers for effective individual diagnosis and treatment, rely on the diagnostic labeling. The assumptions in those studies are (1) diagnostic groups are distinct from each other and (2) individuals are homogeneous within each predefined group. However, in practice patients could be incorrectly diagnosed due to the overlapping or similar symptoms of diseases, causing that subjects assigned into the same group may show biologically inconsistent alterations. Therefore, the possible bias in the diagnosis labeling will result in inaccurate biomarkers and consequently affect the discriminative power of the classifier constructed based on the provided labels.

There is a great need for the development of disease categories built on biological data and supported by objective and quantitative validation, i.e., the approach recently emphasized by the RDoC initiative (www.nimh.nih.gov/rdoc) (Insel et al., 2010; Cuthbert and Insel, 2013). Due to imperfections of the current disease nosology (especially for psychiatric disorders), how to identify markers/features from a large amount of possibly relevant measures (e.g., high-dimensional neuroimaging data) and then rebuild or refine the nosology based on the neuroimaging-features is a big challenge. One way forward is to consider identifying markers and rebuilding a nosology of disorders (or classifying individual subjects) as one combined problem. The most important and difficult issue is how to propose a “mathematical, precise resolution of what constitutes ‘sufficiently similar' patients” (Djulbegovic and Paul, 2011; Marquand et al., 2016).

Difficulties in Identifying Accurate Pathological Features as Biomarkers from High Dimensional Measures

Given that there are generally more features than samples, it is advantageous to reduce the number of possible measures to focus on a subset of particular interest. As discussed in section Feature Selection Strategies, most relevant work has extracted features in the context of group labeling (e.g., SZ or HC). Even if feature selection is performed using a supervised method, the resulting features are not necessarily able to show a clustering property within each group as expected, since there are usually abundant unrelated and redundant measures considered. In the event the diagnosis is inaccurate, selection of features so that they can show clustering (or similar) patterns within the same group and distinct patterns between different groups is more difficult. Without using group labeling, Clementz et al. (2016) constructed biotypes using a panel of cognitive and electro physiological features that were selected according to known relevance to psychosis and brain function. Promisingly, biotypes showed more reasonable neurobiological heterogeneity and coherent subgroups in psychosis than diagnosis-based category (Clementz et al., 2015; Meda et al., 2016). However, the selected features depended on subjective empirical knowledge and were not automatically extracted from available data. In contrast, some research work (Gates et al., 2014; Geisler et al., 2015; Sun et al., 2015) used all available features and did not further refine features according to prior knowledge. Such selected features working well for one dataset may not converge to a consistent grouping for a different dataset. More advanced methods which can automatically select features that have a good differentiating ability under the condition of no or less guidance of diagnosis labeling are still under way. Semi-supervised feature selection methods (Sheikhpour et al., 2017), which allow using both labeled and unlabeled samples to discover the feature relevance, may be promising and beneficial.

Challenges in Validating Biomarkers and Classification

Once the biomarkers and biologically-derived classification are obtained, validating the biomarkers and categories (or classification) is another important issue. Most related studies have classified independent subjects based on the identified biomarkers and a well-trained model, and then compared the classification outputs with the diagnosis labels. However, researchers should be aware that the diagnosis labels used as ground-truth could be inaccurate. Some work (Geisler et al., 2015; Clementz et al., 2016) evaluated derived categories using external independent measures or other features that were highly correlated with the used features of the same dataset to see if subjects in one group showed greater similarity in terms of those additional metrics. However, this kind of validation is circular to some extent. A more reasonable technique is to assess biomarker and cluster (or classification) reproducibility by adding additional independent subjects' data or re-sampling of the original data, since a rational classification of brain disorders should be able to map onto pathophysiology using different datasets.

Other Issues that Should Be Considered

There are also other issues which deserve consideration in future clinical applications. In most neuroimaging-based studies focusing on the classification/prediction problem, accuracy, sensitivity, and specificity were used to evaluate the distinguishing ability of the biomarkers identified and the model built. Unlike the screening test (Grimes and Schulz, 2002) that is to detect potential disorders or diseases in people who do not have any symptoms of disease, these assessing metrics (accuracy, sensitivity, and specificity) cannot provide a realistic measure of the positive (probability of having the disease given a positive test) and negative (probability of not having the disorder given a negative test) predictive value (Castellanos et al., 2013), since prevalence of different diseases influences positive/negative predictive value.

In addition to accurately classify the categorization of brain disorders, increasing studies focus on prediction of continuous variables such as individual cognitive scores, symptomatic scores and behavioral performance using fMRI data (Meskaldji et al., 2016; Meng et al., 2017; Shen et al., 2017; Yoo et al., 2018). These studies used different brain connectivity features as the inputs and generate predictors of these features for new coming subjects. Linear regression and partial least square (PLS) regression are the most commonly used methods to achieve the goal. PLS, in which the predictor variables are projected to a new space of components with regard to response variables, is particularly useful, since the number of features is usually much larger than the number of observations/subjects. Support vector regression (Dosenbach et al., 2010), a supervised learning algorithm, which considers all features simultaneously and generates a model that assigns different weights to different features, can also be employed. Generally, the correlation between predicted variables and real recorded variables in the testing set is used to evaluate the performance of the model.

It should be noted that brain diseases can also induce spatial changes due to atrophy for example. In the preprocessing step, inter-subject spatial alignment of fMRI data is typically achieved through registering their co-registered structural MRI images to an anatomic template or directly registering fMRI data to an echo planar imaging (EPI) template. However, these registration methods cannot guarantee fully accurate inter-subject functional consistency, although the following spatial smoothing of fMRI data can reduce the inter-subject functional variability to some extent. Therefore, functional connectivity computed between given brain regions may not accurately correspond across subjects, although the adaptive ICA-based methods are likely more robust to this than ROI or voxel based approaches. In the future, advanced normalization methods (Khullar et al., 2011; Jiang et al., 2013; Cetin et al., 2015) based on function information directly from fMRI data can help address this issue.

Summary

Mapping brain functional connectivity using fMRI data is now a major emphasis of ongoing research, frequently with a goal of identifying biomarkers and classifying different brain disorders. In this paper, we comprehensively reviewed different approaches which make efforts to accurately map the functional connectome. We included both the traditional static connectivity analysis and the more recently applied dynamic connectivity analysis. Connectivity measures that can be potentially taken as features (i.e., biomarkers) for classification and prediction were clearly summarized for each method. Furthermore, we surveyed various feature selection and classifier building strategies in order to provide guidance on how to perform the classification and predication problem in practice. After that, an updated overview on applications of classifying SZ, BP, ASD, ADHD, were shown. Finally, we discussed gaps in the research and areas that particularly deserve improvement.

Author Contributions

YD proposed the framework and wrote the paper. ZF drafted and revised the paper. VC revised the manuscript and gave final approval.

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

This work was partially supported by National Institutes of Health grants 5P20RR021938/P20GM103472 & R01EB020407 and National Science Foundation grant 1539067 (to VC), National Natural Science Foundation of China (Grant No. 61703253, to YD) and Natural Science Foundation of Shanxi Province (Grant No. 2016021077, to YD).

References

Abi-Dargham, A., and Horga, G. (2016). The search for imaging biomarkers in psychiatric disorders. Nat. Med. 22, 1248–1255. doi: 10.1038/nm.4190

PubMed Abstract | CrossRef Full Text | Google Scholar

Abraham, A., Milham, M. P., Di Martino, A., Craddock, R. C., Samaras, D., Thirion, B., et al. (2017). Deriving reproducible biomarkers from multi-site resting-state data: an autism-based example. Neuroimage 147, 736–745. doi: 10.1016/j.neuroimage.2016.10.045

PubMed Abstract | CrossRef Full Text | Google Scholar

Adali, T., Anderson, M., and Fu, G. S. (2014). Diversity in independent component and vector analyses: identifiability, algorithms, and applications in medical imaging. IEEE Signal Process. Mag. 31, 18–33. doi: 10.1109/MSP.2014.2300511

CrossRef Full Text | Google Scholar

Albert, M. S., Dekosky, S. T., Dickson, D., Dubois, B., Feldman, H. H., Fox, N. C., et al. (2011). The diagnosis of mild cognitive impairment due to Alzheimer's disease: Recommendations from the National Institute on Aging-Alzheimer's Association workgroups on diagnostic guidelines for Alzheimer's disease. Alzheimers Dement. 7, 270–279. doi: 10.1016/j.jalz.2011.03.008

PubMed Abstract | CrossRef Full Text | Google Scholar

Allen, E. A., Damaraju, E., Plis, S. M., Erhardt, E. B., Eichele, T., and Calhoun, V. D. (2014). Tracking whole-brain connectivity dynamics in the resting state. Cereb. Cortex 24, 663–676. doi: 10.1093/cercor/bhs352

PubMed Abstract | CrossRef Full Text | Google Scholar

Allen, E. A., Erhardt, E. B., Damaraju, E., Gruner, W., Segall, J. M., and Silva, R. F. (2011). A baseline for the multivariate comparison of resting-state networks. Front. Syst. Neurosci. 5:2. doi: 10.3389/fnsys.2011.00002

PubMed Abstract | CrossRef Full Text | Google Scholar

Allen, E. A., Erhardt, E. B., Wei, Y., Eichele, T., and Calhoun, V. D. (2012). Capturing inter-subject variability with group independent component analysis of fMRI data: a simulation study. Neuroimage 59, 4141–4159. doi: 10.1016/j.neuroimage.2011.10.010

PubMed Abstract | CrossRef Full Text | Google Scholar

Altman, E. I., Marco, G., and Varetto, F. (1994). Corporate distress diagnosis: comparisons using linear discriminant analysis and neural networks (the Italian experience). J. Bank. Finance 18, 505–529. doi: 10.1016/0378-4266(94)90007-8

CrossRef Full Text | Google Scholar

Alzheimer's Association (2015). 2015 Alzheimer's disease facts and figures. Alzheimers Dement. 11, 332–384. doi: 10.1016/j.jalz.2015.02.003

CrossRef Full Text

American Psychiatric Association (2013). Diagnostic and Statistical Manual of Mental Disorders (DSM-5®). Washington, DC: American Psychiatric Association.

Anckarsäter, H., Stahlberg, O., Larson, T., Hakansson, C., Jutblad, S.-B., Niklasson, L., et al. (2006). The impact of ADHD and autism spectrum disorders on temperament, character, and personality development. Am. J. Psychiatry 163, 1239–1244. doi: 10.1176/ajp.2006.163.7.1239

PubMed Abstract | CrossRef Full Text | Google Scholar

Anderson, A., and Cohen, M. S. (2013). Decreased small-world functional network connectivity and clustering across resting state networks in schizophrenia: an fMRI classification tutorial. Front. Hum. Neurosci. 7:520. doi: 10.3389/fnhum.2013.00520

PubMed Abstract | CrossRef Full Text | Google Scholar

Anderson, A., Douglas, P. K., Kerr, W. T., Haynes, V. S., Yuille, A. L., Xie, J., et al. (2014). Non-negative matrix factorization of multimodal MRI, fMRI and phenotypic data reveals differential changes in default mode subnetworks in ADHD. Neuroimage 102, 207–219. doi: 10.1016/j.neuroimage.2013.12.015

PubMed Abstract | CrossRef Full Text | Google Scholar

Anderson, J. S., Nielsen, J. A., Froehlich, A. L., Dubray, M. B., Druzgal, T. J., Cariello, A. N., et al. (2011). Functional connectivity magnetic resonance imaging classification of autism. Brain 134, 3742–3754. doi: 10.1093/brain/awr263

PubMed Abstract | CrossRef Full Text | Google Scholar

Anderson, M., Adali, T., and Li, X. L. (2012). Joint blind source separation with multivariate Gaussian model: algorithms and performance analysis. IEEE Trans. Signal Process. 60, 1672–1683. doi: 10.1109/TSP.2011.2181836

CrossRef Full Text | Google Scholar

Anderson, M., Fu, G. S., Phlypo, R., and Adali, T. (2014). Independent vector analysis: identification conditions and performance bounds. IEEE Trans. Signal Process. 62, 4399–4410. doi: 10.1109/TSP.2014.2333554

CrossRef Full Text | Google Scholar

Anderson, M., Li, X. L., and Adali, T. (2010). “Nonorthogonal independent vector analysis using multivariate gaussian model,” in Latent Variable Analysis and Signal Separation, Vol. 6365, eds V. Vigneron, V. Zarzoso, E. Moreau, R. Gribonval, and E. Vincent (Berlin; Heidelberg: Springer), 354–361.

Google Scholar

Anticevic, A., Cole, M. W., Repovs, G., Murray, J. D., Brumbaugh, M. S., Winkler, A. M., et al. (2013). Characterizing thalamo-cortical disturbances in schizophrenia and bipolar illness. Cereb. Cortex 24, 3116–3130. doi: 10.1093/cercor/bht165

PubMed Abstract | CrossRef Full Text | Google Scholar

Arbabshirani, M. R., Kiehl, K. A., Pearlson, G. D., and Calhoun, V. D. (2013). Classification of schizophrenia patients based on resting-state functional network connectivity. Front. Neurosci. 7:133. doi: 10.3389/fnins.2013.00133

PubMed Abstract | CrossRef Full Text | Google Scholar

Arbabshirani, M. R., Plis, S., Sui, J., and Calhoun, V. D. (2017). Single subject prediction of brain disorders in neuroimaging: promises and pitfalls. Neuroimage 145, 137–165. doi: 10.1016/j.neuroimage.2016.02.079

PubMed Abstract | CrossRef Full Text | Google Scholar

Arribas, J. I., Calhoun, V. D., and Adali, T. (2010). Automatic Bayesian classification of healthy controls, bipolar disorder, and schizophrenia using intrinsic connectivity maps from FMRI data. IEEE Trans. Biomed. Eng. 57, 2850–2860. doi: 10.1109/TBME.2010.2080679

PubMed Abstract | CrossRef Full Text | Google Scholar

Autism and Developmental Disabilities Monitoring Network Surveillance Year 2008 Principal Investigators; Centers for Disease Control and Prevention (2012). Prevalence of Autism Spectrum Disorders: Autism and Developmental Disabilities Monitoring Network, 14 Sites, United States, 2008. MMWR Surveill. Summ. 61, 1–19.

Bassett, D. S., Nelson, B. G., Mueller, B. A., Camchong, J., and Lim, K. O. (2012). Altered resting state complexity in schizophrenia. Neuroimage 59, 2196–2207. doi: 10.1016/j.neuroimage.2011.10.002

PubMed Abstract | CrossRef Full Text | Google Scholar

Beckmann, C. F., and Smith, S. M. (2005). Tensorial extensions of independent component analysis for multisubject FMRI analysis. Neuroimage 25, 294–311. doi: 10.1016/j.neuroimage.2004.10.043

PubMed Abstract | CrossRef Full Text | Google Scholar

Beckmann, C., Mackay, C., Filippini, N., and Smith, S. (2009). Group comparison of resting-state FMRI data using multi-subject ICA and dual regression. Neuroimage 47(Suppl. 1), S148. doi: 10.1016/S1053-8119(09)71511-3

CrossRef Full Text | Google Scholar

Bernas, A., Aldenkamp, A. P., and Zinger, S. (2018). Wavelet coherence-based classifier: a resting-state functional MRI study on neurodynamics in adolescents with high-functioning autism. Comp. Methods Prog. Biomed. 154, 143–151. doi: 10.1016/j.cmpb.2017.11.017

PubMed Abstract | CrossRef Full Text | Google Scholar

Bhugra, D. (2005). The global prevalence of schizophrenia. PLoS Med. 2:e151. doi: 10.1371/journal.pmed.0020151

PubMed Abstract | CrossRef Full Text | Google Scholar

Biederman, J. (2005). Attention-deficit/hyperactivity disorder: a selective overview. Biol. Psychiatry 57, 1215–1220. doi: 10.1016/j.biopsych.2004.10.020

PubMed Abstract | CrossRef Full Text | Google Scholar

Birur, B., Kraguljac, N. V., Shelton, R. C., and Lahti, A. C. (2017). Brain structure, function, and neurochemistry in schizophrenia and bipolar disorder-a systematic review of the magnetic resonance neuroimaging literature. Npj Schizophr. 3:15. doi: 10.1038/s41537-017-0013-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Bohland, J. W., Saperstein, S., Pereira, F., Rapin, J., and Grady, L. (2012). Network, anatomical, and non-imaging measures for the prediction of ADHD diagnosis in individual subjects. Front. Syst. Neurosci. 6:78. doi: 10.3389/fnsys.2012.00078

PubMed Abstract | CrossRef Full Text | Google Scholar

Boukouvalas, Z., Fu, G. S., and Adali, T. (2015). “An efficient multivariate generalized Gaussian distribution estimator: application to IVA,” in 2015 49th Annual Conference on Information Sciences and Systems (CISS) (Baltimore, MD).

Google Scholar

Cabral, C., Kambeitz-Ilankovic, L., Kambeitz, J., Calhoun, V. D., Dwyer, D. B., Von Saldern, S., et al. (2016). Classifying schizophrenia using multimodal multivariate pattern recognition analysis: evaluating the impact of individual clinical profiles on the neurodiagnostic performance. Schizophr. Bull. 42, S110–S117. doi: 10.1093/schbul/sbw053

PubMed Abstract | CrossRef Full Text | Google Scholar

Calhoun, V. D. (2001). fMRI activation in a visual-perception task: network of areas detected using the general linear model and independent components analysis. Neuroimage 14, 1080–1088. doi: 10.1006/nimg.2001.0921

PubMed Abstract | CrossRef Full Text | Google Scholar

Calhoun, V. D., Adali, T., Pearlson, G. D., and Pekar, J. J. (2001). A method for making group inferences from functional MRI data using independent component analysis. Hum. Brain Mapp. 14, 140–151. doi: 10.1002/hbm.1048

PubMed Abstract | CrossRef Full Text | Google Scholar

Calhoun, V. D., and Adali, T. (2012). Multisubject independent component analysis of fMRI: a decade of intrinsic networks, default mode, and neurodiagnostic discovery. IEEE Rev. Biomed. Eng. 5, 60–73. doi: 10.1109/RBME.2012.2211076

PubMed Abstract | CrossRef Full Text | Google Scholar

Calhoun, V. D., and De Lacy, N. (2017). Ten key observations on the analysis of resting-state functional MR imaging data using independent component analysis. Neuroimaging Clin. N. Am. 27, 561–579. doi: 10.1016/j.nic.2017.06.012

PubMed Abstract | CrossRef Full Text | Google Scholar

Calhoun, V. D., and Sui, J. (2016). Multimodal fusion of brain imaging data: a key to finding the missing link (s) in complex mental illness. Biol. Psychiatry 1, 230–244. doi: 10.1016/j.bpsc.2015.12.005

PubMed Abstract | CrossRef Full Text | Google Scholar

Calhoun, V. D., Eichele, T., and Pearlson, G. (2009a). Functional brain networks in schizophrenia: a review. Front. Hum. Neurosci. 3:17. doi: 10.3389/neuro.09.017.2009

PubMed Abstract | CrossRef Full Text | Google Scholar

Calhoun, V. D., Liu, J., and Adali, T. (2009b). A review of group ICA for fMRI data and ICA for joint inference of imaging, genetic, and ERP data. Neuroimage 45, S163–S172. doi: 10.1016/j.neuroimage.2008.10.057

PubMed Abstract | CrossRef Full Text | Google Scholar

Calhoun, V. D., Maciejewski, P. K., Pearlson, G. D., and Kiehl, K. A. (2008). Temporal lobe and “default” hemodynamic brain modes discriminate between schizophrenia and bipolar disorder. Hum. Brain Mapp. 29, 1265–1275. doi: 10.1002/hbm.20463

PubMed Abstract | CrossRef Full Text | Google Scholar

Calhoun, V. D., Miller, R., Pearlson, G. D., and Adali, T. (2014). The chronnectome: time-varying connectivity networks as the next frontier in fMRI data discovery. Neuro 84, 262–274. doi: 10.1016/j.neuron.2014.10.015

PubMed Abstract | CrossRef Full Text | Google Scholar

Calhoun, V. D., Sui, J., Kiehl, K., Turner, J., Allen, E., and Pearlson, G. (2011). Exploring the psychosis functional connectome: aberrant intrinsic networks in schizophrenia and bipolar disorder. Front. Psychiatry 2:75. doi: 10.3389/fpsyt.2011.00075

PubMed Abstract | CrossRef Full Text | Google Scholar

Cardno, A. G., and Owen, M. J. (2014). Genetic relationships between schizophrenia, bipolar disorder, and schizoaffective disorder. Schizophr. Bull. 40, 504–515. doi: 10.1093/schbul/sbu016

PubMed Abstract | CrossRef Full Text | Google Scholar

Castellanos, F. X., Di Martino, A., Craddock, R. C., Mehta, A. D., and Milham, M. P. (2013). Clinical applications of the functional connectome. Neuroimage 80, 527–540. doi: 10.1016/j.neuroimage.2013.04.083

PubMed Abstract | CrossRef Full Text | Google Scholar

Castro, E., Gómez-Verdejo, V., Martínez-Ramón, M., Kiehl, K. A., and Calhoun, V. D. (2014). A multiple kernel learning approach to perform classification of groups from complex-valued fMRI data analysis: application to schizophrenia. Neuroimage 87, 1–17. doi: 10.1016/j.neuroimage.2013.10.065

PubMed Abstract | CrossRef Full Text | Google Scholar

Castro, E., Martínez-Ramón, M., Pearlson, G., Sui, J., and Calhoun, V. D. (2011). Characterization of groups using composite kernels and multi-source fMRI analysis data: application to schizophrenia. Neuroimage 58, 526–536. doi: 10.1016/j.neuroimage.2011.06.044

PubMed Abstract | CrossRef Full Text | Google Scholar

Cetin, M. S., Houck, J. M., Rashid, B., Agacoglu, O., Stephen, J. M., Sui, J., et al. (2016). Multimodal classification of schizophrenia patients with MEG and fMRI data using static and dynamic connectivity measures. Front. Neurosci. 10:466. doi: 10.3389/fnins.2016.00466

CrossRef Full Text | Google Scholar

Çetin, M. S., Khullar, S., Damaraju, E., Michael, A. M., Baum, S. A., and Calhoun, V. D. (2015). Enhanced disease characterization through multi network functional normalization in fMRI. Front. Neurosci. 9:95. doi: 10.3389/fnins.2015.00095

PubMed Abstract | CrossRef Full Text | Google Scholar

Challis, E., Hurley, P., Serra, L., Bozzali, M., Oliver, S., and Cercignani, M. (2015). Gaussian process classification of Alzheimer's disease and mild cognitive impairment from resting-state fMRI. Neuroimage 112, 232–243. doi: 10.1016/j.neuroimage.2015.02.037

PubMed Abstract | CrossRef Full Text | Google Scholar

Chang, C., and Glover, G. H. (2010). Time-frequency dynamics of resting-state brain connectivity measured with fMRI. Neuroimage 50, 81–98. doi: 10.1016/j.neuroimage.2009.12.011

PubMed Abstract | CrossRef Full Text | Google Scholar

Chantiluke, K., Christakou, A., Murphy, C. M., Giampietro, V., Daly, E. M., Ecker, C., et al. (2014). Disorder-specific functional abnormalities during temporal discounting in youth with Attention Deficit Hyperactivity Disorder (ADHD), Autism and comorbid ADHD and Autism. Psychiatry Res. Neuroimaging 223, 113–120. doi: 10.1016/j.pscychresns.2014.04.006

PubMed Abstract | CrossRef Full Text | Google Scholar

Chen, C. P., Keown, C. L., Jahedi, A., Nair, A., Pflieger, M. E., Bailey, B. A., et al. (2015). Diagnostic classification of intrinsic functional connectivity highlights somatosensory, default mode, and visual regions in autism. Neuroimage Clin. 8, 238–245. doi: 10.1016/j.nicl.2015.04.002

PubMed Abstract | CrossRef Full Text | Google Scholar

Chen, G., Ward, B. D., Xie, C., Li, W., Wu, Z., Jones, J. L., et al. (2011). Classification of Alzheimer disease, mild cognitive impairment, and normal cognitive status with large-scale network analysis based on resting-state functional MR imaging. Radiology 259, 213–221. doi: 10.1148/radiol.10100734

PubMed Abstract | CrossRef Full Text | Google Scholar

Chen, H., Duan, X., Liu, F., Lu, F., Ma, X., Zhang, Y., et al. (2016). Multivariate classification of autism spectrum disorder using frequency-specific resting-state functional connectivity—a multi-center study. Prog. Neuropsychopharmacol. Biol. Psychiatry. 64, 1–9. doi: 10.1016/j.pnpbp.2015.06.014

PubMed Abstract | CrossRef Full Text | Google Scholar

Chen, J. E., Rubinov, M., and Chang, C. (2017). Methods and considerations for dynamic analysis of functional MR imaging data. Neuroimaging Clin. N. Am. 27, 547–560. doi: 10.1016/j.nic.2017.06.009.

PubMed Abstract | CrossRef Full Text | Google Scholar

Chen, X., Zhang, H., Gao, Y., Wee, C. Y., Li, G., and Shen, D. (2016). High-order resting-state functional connectivity network for MCI classification. Hum. Brain Mapp. 37, 3282–3296. doi: 10.1002/hbm.23240

PubMed Abstract | CrossRef Full Text | Google Scholar

Chen, X., Zhang, H., Zhang, L., Shen, C., Lee, S. W., and Shen, D. (2017). Extraction of dynamic functional connectivity from brain grey matter and white matter for MCI classification. Hum. Brain Mapp. 38, 5019–5034. doi: 10.1002/hbm.23711

PubMed Abstract | CrossRef Full Text | Google Scholar

Cheng, H., Newman, S., Goñi, J., Kent, J. S., Howell, J., Bolbecker, A., et al. (2015). Nodal centrality of functional network in the differentiation of schizophrenia. Schizophr. Res. 168, 345–352. doi: 10.1016/j.schres.2015.08.011

PubMed Abstract | CrossRef Full Text | Google Scholar

Chisholm, K., Lin, A., Abu-Akel, A., and Wood, S. J. (2015). The association between autism and schizophrenia spectrum disorders: a review of eight alternate models of co-occurrence. Neurosci. Biobehav. Rev. 55, 173–183. doi: 10.1016/j.neubiorev.2015.04.012

PubMed Abstract | CrossRef Full Text | Google Scholar

Christakou, A., Murphy, C., Chantiluke, K., Cubillo, A., Smith, A., Giampietro, V., et al. (2013). Disorder-specific functional abnormalities during sustained attention in youth with attention deficit hyperactivity disorder (ADHD) and with autism. Mol. Psychiatry 18, 236–244. doi: 10.1038/mp.2011.185

PubMed Abstract | CrossRef Full Text | Google Scholar

Clementz, B. A., Sweeney, J. A., Hamm, J. P., Ivleva, E. I., Ethridge, L. E., Pearlson, G. D., et al. (2016). Identification of distinct psychosis biotypes using brain-based biomarkers. Am. J. Psychiatry 173, 373–384. doi: 10.1176/appi.ajp.2015.14091200

PubMed Abstract | CrossRef Full Text | Google Scholar

Clementz, B. A., Sweeney, J., Keshavan, M. S., Pearlson, G., and Tamminga, C. A. (2015). Using biomarker batteries. Biol. Psychiatry 77, 90–92. doi: 10.1016/j.biopsych.2014.10.012

PubMed Abstract | CrossRef Full Text | Google Scholar

Colby, J. B., Rudie, J. D., Brown, J. A., Douglas, P. K., Cohen, M. S., and Shehzad, Z. (2012). Insights into multimodal imaging classification of ADHD. Front. Syst. Neurosci. 6:59. doi: 10.3389/fnsys.2012.00059

PubMed Abstract | CrossRef Full Text | Google Scholar

Cortes, C., and Vapnik, V. (1995a). Support-vector networks. Mach. Learn. 20, 273–297.

Google Scholar

Cortes, C., and Vapnik, V. (1995b). Support vector machine. Mach. Learn. 20, 273–297.

Google Scholar

Cosgrove, V. E., and Suppes, T. (2013). Informing DSM-5: biological boundaries between bipolar I disorder, schizoaffective disorder, and schizophrenia. BMC Med. 11:127. doi: 10.1186/1741-7015-11-127

CrossRef Full Text | Google Scholar

Cuthbert, B. N., and Insel, T. R. (2013). Toward the future of psychiatric diagnosis: the seven pillars of RDoC. BMC Med. 11:126. doi: 10.1186/1741-7015-11-126

CrossRef Full Text | Google Scholar

Dai, D., Wang, J., Hua, J., and He, H. (2012). Classification of ADHD children through multimodal magnetic resonance imaging. Front. Syst. Neurosci. 6:63. doi: 10.3389/fnsys.2012.00063

PubMed Abstract | CrossRef Full Text | Google Scholar

Dai, Z., Yan, C., Wang, Z., Wang, J., Xia, M., Li, K., et al. (2012). Discriminative analysis of early Alzheimer's disease using multi-modal imaging and multi-level characterization with multi-classifier (M3). Neuroimage 59, 2187–2195. doi: 10.1016/j.neuroimage.2011.10.003

PubMed Abstract | CrossRef Full Text | Google Scholar

Damaraju, E., Allen, E. A., Belger, A., Ford, J. M., McEwen, S., Mathalon, D. H., et al. (2014). Dynamic functional connectivity analysis reveals transient states of dysconnectivity in schizophrenia. Neuroimage Clin. 5, 298–308. doi: 10.1016/j.nicl.2014.07.003

PubMed Abstract | CrossRef Full Text | Google Scholar

De Marco, M., Beltrachini, L., Biancardi, A., Frangi, A. F., and Venneri, A. (2017). Machine-learning Support to Individual Diagnosis of Mild Cognitive Impairment Using Multimodal, MRI and cognitive assessments. Alzheimer Dis. Assoc. Disord. 31, 278–286. doi: 10.1097/WAD.0000000000000208

PubMed Abstract | CrossRef Full Text | Google Scholar

De Martino, F., Gentile, F., Esposito, F., Balsi, M., Di Salle, F., Goebel, R., et al. (2007). Classification of fMRI independent components using IC-fingerprints and support vector machine classifiers. Neuroimage 34, 177–194. doi: 10.1016/j.neuroimage.2006.08.041

PubMed Abstract | CrossRef Full Text | Google Scholar

De Vos, F., Koini, M., Schouten, T. M., Seiler, S., Van Der Grond, J., Lechner, A., et al. (2018). A comprehensive analysis of resting state fMRI measures to classify individual patients with Alzheimer's disease. Neuroimage 167, 62–72. doi: 10.1016/j.neuroimage.2017.11.025

PubMed Abstract | CrossRef Full Text | Google Scholar

Dea, J. T., Anderson, M., Allen, E., Calhoun, V. D., and Adali, T. (2011). “IVA for multi-subject fMRI analysis: a comparative study using a new simulation toolbox,” in 2011 IEEE International Workshop on Machine Learning for Signal Processing (Mlsp) (Beijing).

Google Scholar

Deco, G., and Kringelbach, M. L. (2014). Great expectations: using whole-brain computational connectomics for understanding neuropsychiatric disorders. Neuron 84, 892–905. doi: 10.1016/j.neuron.2014.08.034

PubMed Abstract | CrossRef Full Text | Google Scholar

Demirci, O., Clark, V. P., and Calhoun, V. D. (2008). A projection pursuit algorithm to classify individuals using fMRI data: application to schizophrenia. Neuroimage 39, 1774–1782. doi: 10.1016/j.neuroimage.2007.10.012

PubMed Abstract | CrossRef Full Text | Google Scholar

Deng, L., Sun, J., Cheng, L., and Tong, S. (2016). Characterizing dynamic local functional connectivity in the human brain. Sci. Rep. 6:26976. doi: 10.1038/srep26976

PubMed Abstract | CrossRef Full Text | Google Scholar

Deshpande, G., Libero, L. E., Sreenivasan, K. R., Deshpande, H. D., and Kana, R. K. (2013). Identification of neural connectivity signatures of autism using machine learning. Front. Hum. Neurosci. 7:670. doi: 10.3389/fnhum.2013.00670

PubMed Abstract | CrossRef Full Text | Google Scholar

Deshpande, G., Wang, P., Rangaprakash, D., and Wilamowski, B. (2015). Fully connected cascade artificial neural network architecture for attention deficit hyperactivity disorder classification from functional magnetic resonance imaging data. IEEE Trans. Cybernet. 45, 2668–2679. doi: 10.1109/TCYB.2014.2379621

PubMed Abstract | CrossRef Full Text | Google Scholar

Dey, S., Rao, A. R., and Shah, M. (2012). Exploiting the brain's network structure in identifying ADHD subjects. Front. Syst. Neurosci. 6:75. doi: 10.3389/fnsys.2012.00075

PubMed Abstract | CrossRef Full Text | Google Scholar

Dey, S., Rao, A. R., and Shah, M. (2014). Attributed graph distance measure for automatic detection of attention deficit hyperactive disordered subjects. Front. Neural Circuits 8:64. doi: 10.3389/fncir.2014.00064

PubMed Abstract | CrossRef Full Text | Google Scholar

Di Martino, A., O'connor, D., Chen, B., Alaerts, K., Anderson, J. S., Assaf, M., et al. (2017). Enhancing studies of the connectome in autism using the autism brain imaging data exchange II. Sci Data 4:170010. doi: 10.1038/sdata.2017.10

PubMed Abstract | CrossRef Full Text | Google Scholar

Di Martino, A., Yan, C. G., Li, Q., Denio, E., Castellanos, F. X., Alaerts, K., et al. (2014). The autism brain imaging data exchange: towards a large-scale evaluation of the intrinsic brain architecture in autism. Mol. Psychiatry 19, 659–667. doi: 10.1038/mp.2013.78

PubMed Abstract | CrossRef Full Text | Google Scholar

Djulbegovic, B., and Paul, A. (2011). From efficacy to effectiveness in the face of uncertainty: indication creep and prevention creep. JAMA 305, 2005–2006. doi: 10.1001/jama.2011.650

PubMed Abstract | CrossRef Full Text | Google Scholar

Dos Santos Siqueira, A., Junior, B., Eduardo, C., Comfort, W. E., Rohde, L. A., and Sato, J. R. (2014). Abnormal functional resting-state networks in ADHD: graph theory and pattern recognition analysis of fMRI data. Biomed. Res. Int. 2014:380531. doi: 10.1155/2014/380531

CrossRef Full Text | Google Scholar

Dosenbach, N. U. F., Nardos, B., Cohen, A. L., Fair, D. A., Power, J. D., Church, J. A., et al. (2010). Prediction of individual brain maturity using fMRI. Science 329, 1358–1361. doi: 10.1126/science.1194144

PubMed Abstract | CrossRef Full Text | Google Scholar

Du, J., Wang, L., Jie, B., and Zhang, D. (2016). Network-based classification of ADHD patients using discriminative subnetwork selection and graph kernel PCA. Comput. Med. Imaging Graph. 52, 82–88. doi: 10.1016/j.compmedimag.2016.04.004

PubMed Abstract | CrossRef Full Text | Google Scholar

Du, W., Calhoun, V. D., Li, H., Ma, S., Eichele, T., Kiehl, K. A., et al. (2012). High classification accuracy for schizophrenia with rest and task fMRI data. Front. Hum. Neurosci. 6:145. doi: 10.3389/fnhum.2012.00145

PubMed Abstract | CrossRef Full Text | Google Scholar

Du, Y. H., Allen, E. A., He, H., Sui, J., and Calhoun, V. D. (2014a). “Brain functional networks extraction based on fMRI artifact removal: single subject and group approaches,” The 36th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC) Chicago, IL, 1026–1029.

Du, Y. H., Allen, E. A., He, H., Sui, J., Wu, L., and Calhoun, V. D. (2016a). Artifact removal in the context of group ICA: a comparison of single-subject and group approaches. Hum. Brain Mapp. 37, 1005–1025. doi: 10.1002/hbm.23086

PubMed Abstract | CrossRef Full Text | Google Scholar

Du, Y. H., and Fan, Y. (2013). Group information guided ICA for fMRI data analysis. Neuroimage 69, 157–197. doi: 10.1016/j.neuroimage.2012.11.008

PubMed Abstract | CrossRef Full Text | Google Scholar

Du, Y. H., Fryer, S. L., Fu, Z. N., Lin, D. D., Sui, J., Chen, J. Y., et al. (2017a). Dynamic functional connectivity impairments in early schizophrenia and clinical high-risk for psychosis. Neuroimage. doi: 10.1016/j.neuroimage.2017.10.022. [Epub ahead of print].

PubMed Abstract | CrossRef Full Text | Google Scholar

Du, Y. H., Fryer, S. L., Lin, D. D., Sui, J., Yu, Q. B., Chen, J. Y., et al. (2018). Identifying functional network changing patterns in individuals at clinical high-risk for psychosis and patients with early illness schizophrenia: a group ICA study. Neuroimage Clin. 17, 335–346. doi: 10.1016/j.nicl.2017.10.018

PubMed Abstract | CrossRef Full Text | Google Scholar

Du, Y. H., Lin, D. D., Yu, Q. B., Sui, J., Chen, J. Y., Rachakonda, S., et al. (2017b). Comparison of IVA and GIG-ICA in brain functional network estimation using fMRI data. Front. Neurosci. 11:267. doi: 10.3389/fnins.2017.00267

CrossRef Full Text | Google Scholar

Du, Y. H., Liu, J. Y., Sui, J., He, H., Pearlson, G. D., and Calhoun, V. D. (2014b). “Exploring difference and overlap between schizophrenia, schizoaffective and bipolar disorders using resting-state brain functional networks,” in The 36th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC) (Chicago, IL), 1517–1520.

Du, Y. H., Pearlson, G. D., He, H., Wu, L., and Calhoun, V.D. (2015a). “Identifying brain dynamic network states via GIG-ICA: application to schizophrenia, bipolar and schizoaffective disorders,” in IEEE 12th International Symposium on Biomedical Imaging (ISBI) (New York, NY), 478–481.

Google Scholar

Du, Y. H., Pearlson, G. D., Liu, J. Y., Sui, J., Yu, Q. B., He, H., et al. (2015b). A group ICA based framework for evaluating resting fMRI markers when disease categories are unclear: application to schizophrenia, bipolar, and schizoaffective disorders. Neuroimage 122, 272–280. doi: 10.1016/j.neuroimage.2015.07.054

PubMed Abstract | CrossRef Full Text | Google Scholar

Du, Y. H., Pearlson, G. D., Yu, Q., He, H., Lin, D. D., Sui, J., et al. (2016b). Interaction among subsystems within default mode network diminished in schizophrenia patients: A dynamic connectivity approach. Schizophr. Res. 170, 55–65. doi: 10.1016/j.schres.2015.11.021.

PubMed Abstract | CrossRef Full Text | Google Scholar

Du, Y. H., Pearlson, G., Lin, D. D., Sui, J., Chen, J. Y., Salman, M., et al. (2017c). Identifying dynamic functional connectivity biomarkers using GIG-ICA: application to schizophrenia, schizoaffective disorder and psychotic bipolar disorder. Hum. Brain Mapp. 38, 2683–2708. doi: 10.1002/hbm.23553

PubMed Abstract | CrossRef Full Text | Google Scholar

Du, Y. H., Sui, J., Yu, Q. B., H, H., and Calhoun, V. D. (2014c). “Semi-supervised learning of brain functional networks,” in IEEE 11th International Symposium on Biomedical Imaging (ISBI) (Melbourne, VIC), 1–4.

Google Scholar

Dyrba, M., Grothe, M., Kirste, T., and Teipel, S. J. (2015). Multimodal analysis of functional and structural disconnection in Alzheimer's disease using multiple kernel SVM. Hum. Brain Mapp. 36, 2118–2131. doi: 10.1002/hbm.22759

PubMed Abstract | CrossRef Full Text | Google Scholar

Endicott, J., and Spitzer, R. L. (1978). A diagnostic interview: the schedule for affective disorders and schizophrenia. Arch. Gen. Psychiatry 35, 837–844. doi: 10.1001/archpsyc.1978.01770310043002

PubMed Abstract | CrossRef Full Text | Google Scholar

Erhardt, E. B., Rachakonda, S., Bedrick, E. J., Allen, E. A., Adali, T., and Calhoun, V. D. (2011). Comparison of multi-subject ICA methods for analysis of fMRI data. Hum. Brain Mapp. 32, 2075–2095. doi: 10.1002/hbm.21170

PubMed Abstract | CrossRef Full Text | Google Scholar

Esposito, F., Scarabino, T., Hyvarinen, A., Himberg, J., Formisano, E., Comani, S., et al. (2005). Independent component analysis of fMRI group studies by self-organizing clustering. Neuroimage 25, 193–205. doi: 10.1016/j.neuroimage.2004.10.042

PubMed Abstract | CrossRef Full Text | Google Scholar

Fair, D., Nigg, J. T., Iyer, S., Bathula, D., Mills, K. L., Dosenbach, N. U., et al. (2013). Distinct neural signatures detected for ADHD subtypes after controlling for micro-movements in resting state functional connectivity MRI data. Front. Syst. Neurosci. 6:80. doi: 10.3389/fnsys.2012.00080

PubMed Abstract | CrossRef Full Text | Google Scholar

Fan, Y., Liu, Y., Wu, H., Hao, Y., Liu, H., Liu, Z., et al. (2011). Discriminant analysis of functional connectivity patterns on Grassmann manifold. Neuroimage 56, 2058–2067. doi: 10.1016/j.neuroimage.2011.03.051

PubMed Abstract | CrossRef Full Text | Google Scholar

Farzi, S., Kianian, S., and Rastkhadive, I. (2017). “Diagnosis of attention deficit hyperactivity disorder using deep belief network based on greedy approach,” in 2017 5th International Symposium on Computational and Business Intelligence (ISCBI) (Dubai), 96–99.

Google Scholar

Fei, B., and Liu, J. (2006). Binary tree of SVM: a new fast multiclass training and classification algorithm. IEEE Trans. Neural Netw. 17, 696–704. doi: 10.1109/TNN.2006.872343

PubMed Abstract | CrossRef Full Text | Google Scholar

Fekete, T., Wilf, M., Rubin, D., Edelman, S., Malach, R., and Mujica-Parodi, L. R. (2013). Combining classification with fMRI-derived complex network measures for potential neurodiagnostics. PLoS ONE 8:e62867. doi: 10.1371/journal.pone.0062867

PubMed Abstract | CrossRef Full Text | Google Scholar

Fitzgerald, M. (2013). Overlap between schizophrenia and autism spectrum disorders. Eur. Child Adolesc. Psychiatry 22:S112. doi: 10.1108/AMHID-09-2013-0058

CrossRef Full Text | Google Scholar

Fonti, V., and Belitser, E. (2017). Feature selection using LASSO.

Friston, K. J., Williams, S., Howard, R., Frackowiak, R. S., and Turner, R. (1996). Movement-related effects in fMRI time-series. Magn. Reson. Med. 35, 346–355. doi: 10.1002/mrm.1910350312

PubMed Abstract | CrossRef Full Text | Google Scholar

Fu, Z., Tu, Y., Di, X., Du, Y., Pearlson, G., Turner, J., et al. (2017). Characterizing dynamic amplitude of low-frequency fluctuation and its relationship with dynamic functional connectivity: an application to schizophrenia. Neuroimage. doi: 10.1016/j.neuroimage.2017.09.035. [Epub ahead of print].

PubMed Abstract | CrossRef Full Text | Google Scholar

Garrity, A. G., Pearlson, G. D., McKiernan, K., Lloyd, D., Kiehl, K. A., and Calhoun, V. D. (2007). Aberrant “default mode” functional connectivity in schizophrenia. Am J Psychiatry 164, 450–457. doi: 10.1176/ajp.2007.164.3.450

PubMed Abstract | CrossRef Full Text | Google Scholar

Gates, K. M., Molenaar, P. C. M., Iyer, S. P., Nigg, J. T., and Fair, D. A. (2014). Organizing heterogeneous samples using community detection of GIMME-derived resting state functional networks. PLoS ONE 9:e91322. doi: 10.1371/journal.pone.0091322

PubMed Abstract | CrossRef Full Text | Google Scholar

Gauthier, S., Reisberg, B., Zaudig, M., Petersen, R. C., Ritchie, K., Broich, K., et al. (2006). Mild cognitive impairment. Lancet 367, 1262–1270. doi: 10.1016/S0140-6736(06)68542-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Geisler, D., Walton, E., Naylor, M., Roessner, V., Lim, K. O., Charles Schulz, S., et al. (2015). Brain structure and function correlates of cognitive subtypes in schizophrenia. Psychiatry Res. 234, 74–83. doi: 10.1016/j.pscychresns.2015.08.008

PubMed Abstract | CrossRef Full Text | Google Scholar

Gheiratmand, M., Rish, I., Cecchi, G. A., Brown, M. R., Greiner, R., Polosecki, P. I., et al. (2017). Learning stable and predictive network-based patterns of schizophrenia and its clinical symptoms. Npj Schizophr. 3:22. doi: 10.1038/s41537-017-0022-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Glasser, M. F., Coalson, T. S., Robinson, E. C., Hacker, C. D., Harwell, J., Yacoub, E., et al. (2016). A multi-modal parcellation of human cerebral cortex. Nature 536, 171–178. doi: 10.1038/nature18933

PubMed Abstract | CrossRef Full Text | Google Scholar

Glenner, G. G. (1990). “Alzheimer's disease,” in Biomedical Advances in Aging, ed A. L. Goldstein (Boston, MA: Springer), 51–62.

Google Scholar

Greenspan, S. (2015). Autism spectrum disorder.

Grimes, D. A., and Schulz, K. F. (2002). Uses and abuses of screening tests. Lancet 359, 881–884. doi: 10.1016/S0140-6736(02)07948-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Gu, Q., Li, Z., and Han, J. (2012). Generalized fisher score for feature selection. arXiv preprint arXiv:1202.3725.

Google Scholar

Guo, H., Cheng, C., Cao, X., Xiang, J., Chen, J., and Zhang, K. (2014). Resting-state functional connectivity abnormalities in first-onset unmedicated depression. Neural Regen. Res. 9, 153–63. doi: 10.4103/1673-5374.125344

PubMed Abstract | CrossRef Full Text | Google Scholar

Guo, H., Liu, L., Chen, J., Xu, Y., and Jie, X. (2017a). Alzheimer classification using a minimum spanning tree of high-order functional network on fMRI dataset. Front. Neurosci. 11:639. doi: 10.3389/fnins.2017.00639

CrossRef Full Text | Google Scholar

Guo, H., Zhang, F., Chen, J., Xu, Y., and Xiang, J. (2017b). Machine learning classification combining multiple features of a hyper-network of fMRI data in Alzheimer's disease. Front. Neurosci. 11:615. doi: 10.3389/fnins.2017.00615

CrossRef Full Text

Guo, S., Kendrick, K. M., Yu, R., Wang, H. L. S., and Feng, J. (2014). Key functional circuitry altered in schizophrenia involves parietal regions associated with sense of self. Hum. Brain Mapp. 35, 123–139. doi: 10.1002/hbm.22162

PubMed Abstract | CrossRef Full Text | Google Scholar

Guo, X., Dominick, K. C., Minai, A. A., Li, H., Erickson, C. A., and Lu, L. J. (2017). Diagnosing autism spectrum disorder from brain resting-state functional connectivity patterns using a deep neural network with a novel feature selection method. Front. Neurosci. 11:460. doi: 10.3389/fnins.2017.00460

CrossRef Full Text | Google Scholar

Guyon, I., and Elisseeff, A. (2003). An introduction to variable and feature selection. J. Mach. Learn. Res. 3, 1157–1182. doi: 10.1162/153244303322753616

CrossRef Full Text | Google Scholar

Hall, M. (1999). Correlation-Based Feature Selection for Machine Learning. Ph.D. thesis, Department of Computer Science, Waikato University.

Hamon, J. (2013). Optimisation Combinatoire Pour la Sélection de Variables en Régression en Grande Dimension: Application en Génétique Animale. Université des Sciences et Technologie de Lille-Lille, I.

Han, S., Huang, W., Zhang, Y., Zhao, J., and Chen, H. (2017). Recognition of early-onset schizophrenia using deep-learning method. Appl. Informatics 4:16. doi: 10.1186/s40535-017-0044-3

CrossRef Full Text | Google Scholar

Happé, F., Booth, R., Charlton, R., and Hughes, C. (2006). Executive function deficits in autism spectrum disorders and attention-deficit/hyperactivity disorder: examining profiles across domains and ages. Brain Cogn. 61, 25–39. doi: 10.1016/j.bandc.2006.03.004

PubMed Abstract | CrossRef Full Text | Google Scholar

Hayasaka, S. (2013). Functional connectivity networks with and without global signal correction. Front. Hum. Neurosci. 7:80. doi: 10.3389/fnhum.2013.00880

PubMed Abstract | CrossRef Full Text | Google Scholar

Heinsfeld, A. S., Franco, A. R., Craddock, R. C., Buchweitz, A., and Meneguzzi, F. (2018). Identification of autism spectrum disorder using deep learning and the ABIDE dataset. Neuroimage Clin. 17, 16–23. doi: 10.1016/j.nicl.2017.08.017

PubMed Abstract | CrossRef Full Text | Google Scholar

Hindriks, R., Adhikari, M. H., Murayama, Y., Ganzetti, M., Mantini, D., Logothetis, N. K., et al. (2016). Can sliding-window correlations reveal dynamic functional connectivity in resting-state fMRI? Neuroimage 127, 242–256. doi: 10.1016/j.neuroimage.2015.11.055

PubMed Abstract | CrossRef Full Text | Google Scholar

Hinton, G. E. (2009). Deep belief networks. Scholarpedia 4:5947. doi: 10.4249/scholarpedia.5947

CrossRef Full Text | Google Scholar

Hjelm, R. D., Calhoun, V. D., Salakhutdinov, R., Allen, E. A., Adali, T., and Plis, S. M. (2014). Restricted Boltzmann machines for neuroimaging: an application in identifying intrinsic networks. Neuroimage 96, 245–260. doi: 10.1016/j.neuroimage.2014.03.048

PubMed Abstract | CrossRef Full Text | Google Scholar

Hsu, C.-W., and Lin, C.-J. (2002). A comparison of methods for multiclass support vector machines. IEEE Trans. Neural Netw. 13, 415–425. doi: 10.1109/72.991427

PubMed Abstract | CrossRef Full Text | Google Scholar

Hu, C. H., Ju, R. H., Shen, Y. S., Zhou, P., and Li, Q. Z. (2016). “Clinical decision support for Alzheimer's Disease based on deep learning and brain network,” in 2016 IEEE International Conference on Communications (ICC) (Kuala Lumpur).

Google Scholar

Hutchison, R. M., Womelsdorf, T., Gati, J. S., Everling, S., and Menon, R. S. (2013). Resting-state networks show dynamic functional connectivity in awake humans and anesthetized macaques. Hum. Brain Mapp. 34, 2154–2177. doi: 10.1002/hbm.22058

PubMed Abstract | CrossRef Full Text | Google Scholar

Iidaka, T. (2015). Resting state functional magnetic resonance imaging and neural network classified autism and control. Cortex 63, 55–67. doi: 10.1016/j.cortex.2014.08.011

PubMed Abstract | CrossRef Full Text | Google Scholar

Insel, T. R., and Cuthbert, B. N. (2015). Medicine. Brain disorders? Precisely. Science 348, 499–500. doi: 10.1126/science.aab2358

PubMed Abstract | CrossRef Full Text | Google Scholar

Insel, T., Cuthbert, B., Garvey, M., Heinssen, R., Pine, D. S., Quinn, K., et al. (2010). Research domain criteria (RDoC): toward a new classification framework for research on mental disorders. Am. J. Psychiatry 167, 748–751. doi: 10.1176/appi.ajp.2010.09091379

PubMed Abstract | CrossRef Full Text | Google Scholar

Jafri, M. J., Pearlson, G. D., Stevens, M., and Calhoun, V. D. (2008). A method for functional network connectivity among spatially independent resting-state components in schizophrenia. Neuroimage 39, 1666–1681. doi: 10.1016/j.neuroimage.2007.11.001

PubMed Abstract | CrossRef Full Text | Google Scholar

Jahedi, A., Nasamran, C. A., Faires, B., Fan, J., and Müller, R.-A. (2017). Distributed intrinsic functional connectivity patterns predict diagnostic status in large autism cohort. Brain Connect. 7, 515–525. doi: 10.1089/brain.2017.0496

PubMed Abstract | CrossRef Full Text | Google Scholar

Jang, H., Plis, S. M., Calhoun, V. D., and Lee, J. H. (2017). Task-specific feature extraction and classification of fMRI volumes using a deep neural network initialized with a deep belief network: evaluation using sensorimotor tasks. Neuroimage 145, 314–328. doi: 10.1016/j.neuroimage.2016.04.003

PubMed Abstract | CrossRef Full Text | Google Scholar

Jiang, D., Du, Y., Cheng, H., Jiang, T., and Fan, Y. (2013). Groupwise spatial normalization of fMRI data based on multi-range functional connectivity patterns. Neuroimage 82, 355–372. doi: 10.1016/j.neuroimage.2013.05.093

PubMed Abstract | CrossRef Full Text | Google Scholar

Jie, B., Zhang, D., Gao, W., Wang, Q., Wee, C.-Y., and Shen, D. (2014). Integration of network topological and connectivity properties for neuroimaging classification. IEEE Trans. Biomed. Eng. 61, 576–589. doi: 10.1109/TBME.2013.2284195

PubMed Abstract | CrossRef Full Text | Google Scholar

Joel, S. E., Caffo, B. S., van Zijl, P. C. M., and Pekar, J. J. (2011). On the relationship between seed-based and ICA-Based measures of functional connectivity. Magn. Reson. Med. 66, 644–657. doi: 10.1002/mrm.22818

PubMed Abstract | CrossRef Full Text | Google Scholar

Ju, R., Hu, C., and Li, Q. (2017). “Early diagnosis of Alzheimer's disease based on resting-state brain networks and deep learning,” in IEEE/ACM Transactions on Computational Biology and Bioinformatics.

PubMed Abstract | Google Scholar

Kaufmann, T., Skåtun, K. C., Alnæs, D., Doan, N. T., Duff, E. P., Tønnesen, S., et al. (2015). Disintegration of sensorimotor brain networks in schizophrenia. Schizophr. Bull. 41, 1326–1335. doi: 10.1093/schbul/sbv060

PubMed Abstract | CrossRef Full Text | Google Scholar

Kay, S. R., Flszbein, A., and Opfer, L. A. (1987). The positive and negative syndrome scale (PANSS) for schizophrenia. Schizophr. Bull. 13:261. doi: 10.1093/schbul/13.2.261

CrossRef Full Text | Google Scholar

Khadka, S., Meda, S. A., Stevens, M. C., Glahn, D. C., Calhoun, V. D., Sweeney, J. A., et al. (2013). Is aberrant functional connectivity a psychosis endophenotype? A resting state functional magnetic resonance imaging study. Biol. Psychiatry 74, 458–466. doi: 10.1016/j.biopsych.2013.04.024

PubMed Abstract | CrossRef Full Text | Google Scholar

Khazaee, A., Ebrahimzadeh, A., and Babajani-Feremi, A. (2015). Identifying patients with Alzheimer's disease using resting-state fMRI and graph theory. Clin. Neurophysiol. 126, 2132–2141. doi: 10.1016/j.clinph.2015.02.060

PubMed Abstract | CrossRef Full Text | Google Scholar

Khullar, S., Michael, A. M., Cahill, N. D., Kiehl, K. A., Pearlson, G., Baum, S. A., et al. (2011). ICA-fNORM: spatial normalization of fMRI data using intrinsic group-ICA networks. Front. Syst. Neurosci. 5:93. doi: 10.3389/fnsys.2011.00093

PubMed Abstract | CrossRef Full Text | Google Scholar

Kim, J., Calhoun, V. D., Shim, E., and Lee, J.-H. (2016). Deep neural network with weight sparsity control and pre-training extracts hierarchical features and enhances classification performance: evidence from whole-brain resting-state functional connectivity patterns of schizophrenia. Neuroimage 124, 127–146. doi: 10.1016/j.neuroimage.2015.05.018

PubMed Abstract | CrossRef Full Text | Google Scholar

Kira, K., and Rendell, L. A. (1992). “The feature selection problem: traditional methods and a new algorithm,” in Proceedings of the 10th National Conference on Artificial Intelligence (San Jose, CA), 129–134.

Google Scholar

Kiviniemi, V., Vire, T., Remes, J., Elseoud, A. A., Starck, T., Tervonen, O., et al. (2011). A sliding time-window ICA reveals spatial variability of the default mode network in time. Brain Connect. 1, 339–347. doi: 10.1089/brain.2011.0036

PubMed Abstract | CrossRef Full Text | Google Scholar

Koike, S., Takano, Y., Iwashiro, N., Satomura, Y., Suga, M., Nagai, T., et al. (2013). A multimodal approach to investigate biomarkers for psychosis in a clinical setting: the integrative neuroimaging studies in schizophrenia targeting for early intervention and prevention (IN-STEP) project. Schizophr. Res. 143, 116–124. doi: 10.1016/j.schres.2012.11.012

PubMed Abstract | CrossRef Full Text | Google Scholar

Ktena, S. I., Parisot, S., Ferrante, E., Rajchl, M., Lee, M., Glocker, B., et al. (2017). Metric learning with spectral graph convolutions on brain connectivity networks. Neuroimage 169, 431–442. doi: 10.1016/j.neuroimage.2017.12.052

PubMed Abstract | CrossRef Full Text | Google Scholar

Kumar, M. A., and Gopal, M. (2011). Reduced one-against-all method for multiclass SVM classification. Expert Syst. Appl. 38, 14238–14248. doi: 10.1016/j.eswa.2011.04.237

CrossRef Full Text | Google Scholar

Ladha, L., and Deepa, T. (2011). Feature selection methods and algorithms. Int. J. Comp. Sci. Eng. 3, 1787–1797.

Google Scholar

Lal, T., Chapelle, O., Weston, J., and Elisseeff, A. (2006). Embedded methods. Feature Extract. 207, 137–165. doi: 10.1007/978-3-540-35488-8_6

CrossRef Full Text | Google Scholar

Laumann, T. O., Snyder, A. Z., Mitra, A., Gordon, E. M., Gratton, C., Adeyemo, B., et al. (2017). On the stability of BOLD fMRI correlations. Cereb. Cortex 27, 4719–4732. doi: 10.1093/cercor/bhw265

PubMed Abstract | CrossRef Full Text | Google Scholar

Lecun, Y., Bengio, Y., and Hinton, G. (2015). Deep learning. Nature 521, 436–444. doi: 10.1038/nature14539

PubMed Abstract | CrossRef Full Text | Google Scholar

Lee, J. H., Lee, T. W., Jolesz, F. A., and Yoo, S. S. (2008). Independent vector analysis (IVA): multivariate approach for fMRI group study. Neuroimage 40, 86–109. doi: 10.1016/j.neuroimage.2007.11.019

PubMed Abstract | CrossRef Full Text | Google Scholar

Lee, M. H., Smyser, C. D., and Shimony, J. S. (2013). Resting-state fMRI: a review of methods and clinical applications. Am. J. Neuroradiol. 34, 1866–1872. doi: 10.3174/ajnr.A3263

PubMed Abstract | CrossRef Full Text | Google Scholar

Leonardi, N., Richiardi, J., Gschwind, M., Simioni, S., Annoni, J. M., Schluep, M., et al. (2013). Principal components of functional connectivity: a new approach to study dynamic brain connectivity during rest. Neuroimage 83, 937–950. doi: 10.1016/j.neuroimage.2013.07.019

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, X. L., Adali, T., and Anderson, M. (2011). Joint blind source separation by generalized joint diagonalization of cumulant matrices. Signal Process. 91, 2314–2322. doi: 10.1016/j.sigpro.2011.04.016

CrossRef Full Text | Google Scholar

Li, X., Zhu, D., Jiang, X., Jin, C., Zhang, X., Guo, L., et al. (2014). Dynamic functional connectomics signatures for characterization and differentiation of PTSD patients. Hum Brain Mapp. 35, 1761–1778. doi: 10.1002/hbm.22290

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, Y. O., Adali, T., and Calhoun, V. D. (2007). Estimating the number of independent components for functional magnetic resonance imaging data. Hum. Brain Mapp. 28, 1251–1266. doi: 10.1002/hbm.20359

PubMed Abstract | CrossRef Full Text | Google Scholar

Lin, Q. H., Liu, J. Y., Zheng, Y. R., Liang, H. L., and Calhoun, V. D. (2010). Semiblind spatial ICA of fMRI using spatial constraints. Hum. Brain Mapp. 31, 1076–1088. doi: 10.1002/hbm.20919

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu, D., Yan, C., Ren, J., Yao, L., Kiviniemi, V. J., and Zang, Y. (2010). Using coherence to measure regional homogeneity of resting-state FMRI signal. Front. Syst. Neurosci. 4:24. doi: 10.3389/fnsys.2010.00024

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu, F., Guo, W., Fouche, J.-P., Wang, Y., Wang, W., Ding, J., et al. (2015). Multivariate classification of social anxiety disorder using whole brain functional connectivity. Brain Struct. Funct. 220, 101–115. doi: 10.1007/s00429-013-0641-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu, J., Li, M., Lan, W., Wu, F.-X., Pan, Y., and Wang, J. (2016). “Classification of Alzheimer's disease using whole brain hierarchical network,” in IEEE/ACM Transactions on Computational Biology and Bioinformatics. doi: 10.1109/TCBB.2016.2635144

CrossRef Full Text | Google Scholar

Liu, Y., Guo, W., Zhang, Y., Lv, L., Hu, F., Wu, R., et al. (2017). Decreased resting-state interhemispheric functional connectivity correlated with neurocognitive deficits in drug-naive first-episode adolescent-onset schizophrenia. Int. J. Neuropsychopharmacol. 21, 33–41. doi: 10.1093/ijnp/pyx095

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu, Y., Liang, M., Zhou, Y., He, Y., Hao, Y., Song, M., et al. (2008). Disrupted small-world networks in schizophrenia. Brain 131, 945–961. doi: 10.1093/brain/awn018

PubMed Abstract | CrossRef Full Text | Google Scholar

Lord, A., Horn, D., Breakspear, M., and Walter, M. (2012). Changes in community structure of resting state functional connectivity in unipolar depression. PLoS ONE 7:e41282. doi: 10.1371/journal.pone.0041282

PubMed Abstract | CrossRef Full Text | Google Scholar

Lord, C., Cook, E. H., Leventhal, B. L., and Amaral, D. G. (2000). Autism spectrum disorders. Neuron 28, 355–363. doi: 10.1016/S0896-6273(00)00115-X

PubMed Abstract | CrossRef Full Text | Google Scholar

Lynall, M. E., Bassett, D. S., Kerwin, R., McKenna, P. J., Kitzbichler, M., Muller, U., et al. (2010). Functional connectivity and brain networks in schizophrenia. J. Neurosci. 30, 9477–9487. doi: 10.1523/JNEUROSCI.0333-10.2010

PubMed Abstract | CrossRef Full Text | Google Scholar

Ma, S., Correa, N. M., Li, X. L., Eichele, T., Calhoun, V. D., and Adali, T. (2011). Automatic identification of functional clusters in FMRI data using spatial dependence. IEEE Trans. Biomed. Eng. 58, 3406–3417. doi: 10.1109/TBME.2011.2167149

PubMed Abstract | CrossRef Full Text | Google Scholar

Malaspina, D., Owen, M. J., Heckers, S., Tandon, R., Bustillo, J., Schultz, S., et al. (2013). Schizoaffective disorder in the DSM-5. Schizophr. Res. 150, 21–25. doi: 10.1016/j.schres.2013.04.026

PubMed Abstract | CrossRef Full Text | Google Scholar

Marquand, A. F., Wolfers, T., Mennes, M., Buitelaar, J., and Beckmann, C. F. (2016). Beyond lumping and splitting: a review of computational approaches for stratifying psychiatric disorders. Biol. Psychiatry Cogn. Neurosci. Neuroimaging 1, 433–447. doi: 10.1016/j.bpsc.2016.04.002

PubMed Abstract | CrossRef Full Text | Google Scholar

McKeown, M. J., Makeig, S., Brown, G. G., Jung, T. P., Kindermann, S. S., Bell, A. J., et al. (1998). Analysis of fMRI data by blind separation into independent spatial components. Hum. Brain Mapp. 6, 160–188.

PubMed Abstract | Google Scholar

Meda, S. A., Clementz, B. A., Sweeney, J. A., Keshavan, M. S., Tamminga, C. A., Ivleva, E. I., et al. (2016). Examining functional resting-state connectivity in psychosis and its subgroups in the bipolar-schizophrenia network on intermediate phenotypes cohort. Biol. Psychiatry Cogn. Neurosci. Neuroimaging 1, 488–497. doi: 10.1016/j.bpsc.2016.07.001

PubMed Abstract | CrossRef Full Text | Google Scholar

Meda, S. A., Ruaño, G., Windemuth, A., O'neil, K., Berwise, C., Dunn, S. M., et al. (2014). Multivariate analysis reveals genetic associations of the resting default mode network in psychotic bipolar disorder and schizophrenia. Proc. Natl. Acad. Sci. U.S.A. 111, 6864–6864. doi: 10.1073/pnas.1313093111

PubMed Abstract | CrossRef Full Text | Google Scholar

Meng, X., Jiang, R., Lin, D., Bustillo, J., Jones, T., Chen, J., et al. (2017). Predicting individualized clinical measures by a generalized prediction framework and multimodal fusion of MRI data. Neuroimage 145, 218–229. doi: 10.1016/j.neuroimage.2016.05.026

PubMed Abstract | CrossRef Full Text | Google Scholar

Meskaldji, D. E., Preti, M. G., Bolton, T. A., Montandon, M. L., Rodriguez, C., Morgenthaler, S., et al. (2016). Prediction of long-term memory scores in MCI based on resting-state fMRI. Neuroimage Clin. 12, 785–795. doi: 10.1016/j.nicl.2016.10.004

PubMed Abstract | CrossRef Full Text | Google Scholar

Meszlényi, R. J., Buza, K., and Vidnyánszky, Z. (2017). Resting state fMRI functional connectivity-based classification using a convolutional neural network architecture. Front. Neuroinform. 11:61. doi: 10.3389/fninf.2017.00061

CrossRef Full Text | Google Scholar

Mika, S., Ratsch, G., Weston, J., Scholkopf, B., and Mullers, K.-R. (1999). “Fisher discriminant analysis with kernels,” in Neural Networks for Signal Processing, IX: Proceedings of the 1999 IEEE Signal Processing Society Workshop (New York, NY: IEEE), 41–48.

Google Scholar

Mikolas, P., Melicher, T., Skoch, A., Matejka, M., Slovakova, A., Bakstein, E., et al. (2016). Connectivity of the anterior insula differentiates participants with first-episode schizophrenia spectrum disorders from controls: a machine-learning study. Psychol. Med. 46, 2695–2704. doi: 10.1017/S0033291716000878

PubMed Abstract | CrossRef Full Text | Google Scholar

Miller, R. L., Yaesoubi, M., Turner, J. A., Mathalon, D., Preda, A., Pearlson, G., et al. (2016). Higher dimensional meta-state analysis reveals reduced resting fMRI connectivity dynamism in schizophrenia patients. PLoS ONE 11:e0149849. doi: 10.1371/journal.pone.0149849

PubMed Abstract | CrossRef Full Text | Google Scholar

Moritz, C. H., Rogers, B. P., and Meyerand, M. E. (2003). Power spectrum ranked independent component analysis of a periodic fMRI complex motor paradigm. Hum. Brain Mapp. 18, 111–122. doi: 10.1002/hbm.10081

PubMed Abstract | CrossRef Full Text | Google Scholar

Murdaugh, D. L., Shinkareva, S. V., Deshpande, H. R., Wang, J., Pennick, M. R., and Kana, R. K. (2012). Differential deactivation during mentalizing and classification of autism based on default mode network connectivity. PLoS ONE 7:e50064. doi: 10.1371/journal.pone.0050064

PubMed Abstract | CrossRef Full Text | Google Scholar

Murphy, K., Birn, R. M., Handwerker, D. A., Jones, T. B., and Bandettini, P. A. (2009). The impact of global signal regression on resting state correlations: are anti-correlated networks introduced? Neuroimage 44, 893–905. doi: 10.1016/j.neuroimage.2008.09.036

PubMed Abstract | CrossRef Full Text | Google Scholar

Nasrabadi, N. M. (2007). Pattern recognition and machine learning. J. Electron. Imaging 16:049901. doi: 10.1117/1.2819119

CrossRef Full Text | Google Scholar

Ng, A. Y. (2004). “Feature selection, L 1 vs. L 2 regularization, and rotational invariance,” in Proceedings of the Twenty-First International Conference on Machine learning (New York, NY: ACM), 78.

Google Scholar

Nielsen, J. A., Zielinski, B. A., Fletcher, P. T., Alexander, A. L., Lange, N., Bigler, E. D., et al. (2013). Multisite functional connectivity MRI classification of autism: ABIDE results. Front. Hum. Neurosci. 7:599. doi: 10.3389/fnhum.2013.00599

PubMed Abstract | CrossRef Full Text | Google Scholar

Ongür, D., Lundy, M., Greenhouse, I., Shinn, A. K., Menon, V., Cohen, B. M., et al. (2010). Default mode network abnormalities in bipolar disorder and schizophrenia. Psychiatry Res. 183, 59–68. doi: 10.1016/j.pscychresns.2010.04.008

PubMed Abstract | CrossRef Full Text | Google Scholar

Onoda, K., Yada, N., Ozasa, K., Hara, S., Yamamoto, Y., Kitagaki, H., et al. (2017). Can a resting-state functional connectivity index identify patients with Alzheimer's disease and mild cognitive impairment across multiple sites? Brain Connect. 7, 391–400. doi: 10.1089/brain.2017.0507

CrossRef Full Text | Google Scholar

Park, B.-Y., Kim, M., Seo, J., Lee, J.-M., and Park, H. (2016). Connectivity analysis and feature classification in attention deficit hyperactivity disorder sub-types: a task functional magnetic resonance imaging study. Brain Topogr. 29, 429–439. doi: 10.1007/s10548-015-0463-1

PubMed Abstract | CrossRef Full Text | Google Scholar

Park, J. E., Park, B., Kim, S. J., Kim, H. S., Choi, C. G., Jung, S. C., et al. (2017). Improved diagnostic accuracy of Alzheimer's Disease by combining regional cortical thickness and default mode network functional connectivity: validated in the Alzheimer's disease neuroimaging initiative set. Korean J. Radiol. 18, 983–991. doi: 10.3348/kjr.2017.18.6.983

PubMed Abstract | CrossRef Full Text | Google Scholar

Pearlson, G. D., Clementz, B. A., Sweeney, J. A., Keshavan, M. S., and Tamminga, C. A. (2016). Does biology transcend the symptom-based boundaries of psychosis? Psychiatr. Clin. N. Am. 39, 165–174. doi: 10.1016/j.psc.2016.01.001

PubMed Abstract | CrossRef Full Text | Google Scholar

Petersen, R. C., Smith, G. E., Waring, S. C., Ivnik, R. J., Tangalos, E. G., and Kokmen, E. (1999). Mild cognitive impairment: clinical characterization and outcome. Arch. Neurol. 56, 303–308. doi: 10.1001/archneur.56.3.303

PubMed Abstract | CrossRef Full Text | Google Scholar

Plis, S. M., Hjelm, D. R., Salakhutdinov, R., Allen, E. A., Bockholt, H. J., Long, J. D., et al. (2014). Deep learning for neuroimaging: a validation study. Front. Neurosci. 8:229. doi: 10.3389/fnins.2014.00229

PubMed Abstract | CrossRef Full Text | Google Scholar

Plitt, M., Barnes, K. A., and Martin, A. (2015). Functional connectivity classification of autism identifies highly predictive brain features but falls short of biomarker standards. Neuroimage Clin. 7, 359–366. doi: 10.1016/j.nicl.2014.12.013

PubMed Abstract | CrossRef Full Text | Google Scholar

Poldrack, R. A. (2007). Region of interest analysis for fMRI. Soc. Cogn. Affect Neurosci. 2, 67–70. doi: 10.1093/scan/nsm006

PubMed Abstract | CrossRef Full Text | Google Scholar

Power, J. D., Mitra, A., Laumann, T. O., Snyder, A. Z., Schlaggar, B. L., and Petersen, S. E. (2014a). Methods to detect, characterize, and remove motion artifact in resting state fMRI. Neuroimage 84, 320–341. doi: 10.1016/j.neuroimage.2013.08.048

PubMed Abstract | CrossRef Full Text | Google Scholar

Power, J. D., Schlaggar, B. L., and Petersen, S. E. (2014b). Studying brain organization via spontaneous fMRI signal. Neuron 84, 681–696. doi: 10.1016/j.neuron.2014.09.007

PubMed Abstract | CrossRef Full Text | Google Scholar

Preti, M. G., Bolton, T. A., and Van De Ville, D. (2017). The dynamic functional connectome: state-of-the-art and perspectives. Neuroimage 160, 41–54. doi: 10.1016/j.neuroimage.2016.12.061

PubMed Abstract | CrossRef Full Text | Google Scholar

Qureshi, M. N. I., Oh, J., Cho, D., Jo, H. J., and Lee, B. (2017a). Multimodal discrimination of schizophrenia using hybrid weighted feature concatenation of brain functional connectivity and anatomical features with an extreme learning machine. Front. Neuroinform. 11:59. doi: 10.3389/fninf.2017.00059

CrossRef Full Text | Google Scholar

Qureshi, M. N. I., Oh, J., Min, B., Jo, H. J., and Lee, B. (2017b). Multi-modal, multi-measure, and multi-class discrimination of ADHD with hierarchical feature extraction and extreme learning machine using structural and functional brain MRI. Front. Hum. Neurosci. 11:157. doi: 10.3389/fnhum.2017.00157

CrossRef Full Text | Google Scholar

Rahim, M., Thirion, B., Comtat, C., and Varoquaux, G. (2016). Transmodal learning of functional networks for Alzheimer's disease prediction. IEEE J. Select. Top. Signal Process. 10, 1204–1213. doi: 10.1109/JSTSP.2016.2600400

PubMed Abstract | CrossRef Full Text | Google Scholar

Rashid, B., Arbabshirani, M. R., Damaraju, E., Cetin, M. S., Miller, R., Pearlson, G. D., et al. (2016). Classification of schizophrenia and bipolar patients using static and dynamic resting-state fMRI brain connectivity. Neuroimage 134, 645–657. doi: 10.1016/j.neuroimage.2016.04.051

PubMed Abstract | CrossRef Full Text | Google Scholar

Rashid, B., Damaraju, E., Pearlson, G. D., and Calhoun, V. D. (2014). Dynamic connectivity states estimated from resting fMRI Identify differences among Schizophrenia, bipolar disorder, and healthy control subjects. Front. Hum. Neurosci. 8:897. doi: 10.3389/fnhum.2014.00897

PubMed Abstract | CrossRef Full Text | Google Scholar

Riaz, A., Asad, M., Alonso, E., and Slabaugh, G. (2017). Fusion of fMRI and non-imaging data for ADHD classification. Comput. Med. Imaging Graph. 65, 115–128. doi: 10.1016/j.compmedimag.2017.10.002

CrossRef Full Text

Robinson, L. F., Atlas, L. Y., and Wager, T. D. (2015). Dynamic functional connectivity using state-based dynamic community structure: method and application to opioid analgesia. Neuroimage 108, 274–291. doi: 10.1016/j.neuroimage.2014.12.034

PubMed Abstract | CrossRef Full Text | Google Scholar

Rommelse, N. N., Franke, B., Geurts, H. M., Hartman, C. A., and Buitelaar, J. K. (2010). Shared heritability of attention-deficit/hyperactivity disorder and autism spectrum disorder. Eur. Child Adolesc. Psychiatry 19, 281–295. doi: 10.1007/s00787-010-0092-x

PubMed Abstract | CrossRef Full Text | Google Scholar

Rosa, M. J., Portugal, L., Hahn, T., Fallgatter, A. J., Garrido, M. I., Shawe-Taylor, J., et al. (2015). Sparse network-based models for patient classification using fMRI. Neuroimage 105, 493–506. doi: 10.1016/j.neuroimage.2014.11.021

PubMed Abstract | CrossRef Full Text | Google Scholar

Rubinov, M., and Sporns, O. (2010). Complex network measures of brain connectivity: uses and interpretations. Neuroimage 52, 1059–1069. doi: 10.1016/j.neuroimage.2009.10.003

PubMed Abstract | CrossRef Full Text | Google Scholar

Sacchet, M. D., Prasad, G., Foland-Ross, L. C., Thompson, P. M., and Gotlib, I. H. (2015). Support vector machine classification of major depressive disorder using diffusion-weighted neuroimaging and graph theory. Front. Psychiatry 6:21. doi: 10.3389/fpsyt.2015.00021

PubMed Abstract | CrossRef Full Text | Google Scholar

Sadaghiani, S., Poline, J. B., Kleinschmidt, A., and D'esposito, M. (2015). Ongoing dynamics in large-scale functional connectivity predict perception. Proc. Natl. Acad. Sci. U.S.A. 112, 8463–8468. doi: 10.1073/pnas.1420687112

PubMed Abstract | CrossRef Full Text | Google Scholar

Sadeghi, M., Khosrowabadi, R., Bakouie, F., Mahdavi, H., Eslahchi, C., and Pouretemad, H. (2017). Screening of autism based on task-free fMRI using graph theoretical approach. Psychiatry Res. Neuroimaging 263, 48–56. doi: 10.1016/j.pscychresns.2017.02.004

PubMed Abstract | CrossRef Full Text | Google Scholar

Sakoglu, U., Pearlson, G. D., Kiehl, K. A., Wang, Y. M., Michael, A. M., and Calhoun, V. D. (2010). A method for evaluating dynamic functional network connectivity and task-modulation: application to schizophrenia. MAGMA 23, 351–366. doi: 10.1007/s10334-010-0197-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Sato, J. R., Hoexter, M. Q., Fujita, A., and Rohde, L. A. (2012). Evaluation of pattern recognition and feature extraction methods in ADHD prediction. Front. Syst. Neurosci. 6:68. doi: 10.3389/fnsys.2012.00068

PubMed Abstract | CrossRef Full Text | Google Scholar

Schmidhuber, J. (2015). Deep learning in neural networks: an overview. Neural Netw. 61, 85–117. doi: 10.1016/j.neunet.2014.09.003

PubMed Abstract | CrossRef Full Text | Google Scholar

Schouten, T. M., Koini, M., De Vos, F., Seiler, S., Van Der Grond, J., Lechner, A., et al. (2016). Combining anatomical, diffusion, and resting state functional magnetic resonance imaging for individual classification of mild and moderate Alzheimer's disease. Neuroimage Clin. 11, 46–51. doi: 10.1016/j.nicl.2016.01.002

PubMed Abstract | CrossRef Full Text | Google Scholar

Shakil, S., Lee, C. H., and Keilholz, S. D. (2016). Evaluation of sliding window correlation performance for characterizing dynamic functional connectivity and brain states. Neuroimage 133, 111–128. doi: 10.1016/j.neuroimage.2016.02.074

PubMed Abstract | CrossRef Full Text | Google Scholar

Sheikhpour, R., Sarram, M. A., Gharaghani, S., and Chahooki, M. A. Z. (2017). A Survey on semi-supervised feature selection methods. Pattern Recogn. 64, 141–158. doi: 10.1016/j.patcog.2016.11.003

CrossRef Full Text | Google Scholar

Shen, H., Li, Z., Zeng, L.-L., Yuan, L., Chen, F., Liu, Z., et al. (2014). Internetwork dynamic connectivity effectively differentiates schizophrenic patients from healthy controls. Neuroreport 25, 1344–1349. doi: 10.1097/WNR.0000000000000267

PubMed Abstract | CrossRef Full Text | Google Scholar

Shen, H., Wang, L., Liu, Y., and Hu, D. (2010). Discriminative analysis of resting-state functional connectivity patterns of schizophrenia using low dimensional embedding of fMRI. Neuroimage 49, 3110–3121. doi: 10.1016/j.neuroimage.2009.11.011

PubMed Abstract | CrossRef Full Text | Google Scholar

Shen, X. L., Finn, E. S., Scheinost, D., Rosenberg, M. D., Chun, M. M., Papademetris, X., et al. (2017). Using connectome-based predictive modeling to predict individual behavior from brain connectivity. Nat. Protoc. 12, 506–518. doi: 10.1038/nprot.2016.178

PubMed Abstract | CrossRef Full Text | Google Scholar

Sheng, C., Xia, M., Yu, H., Huang, Y., Lu, Y., Liu, F., et al. (2017). Abnormal global functional network connectivity and its relationship to medial temporal atrophy in patients with amnestic mild cognitive impairment. PLoS ONE 12:e0179823. doi: 10.1371/journal.pone.0179823

PubMed Abstract | CrossRef Full Text | Google Scholar

Silva, R. F., Castro, E., Gupta, C. N., Arbabshirani, M., Potluru, V. K., et al. (2014). “The tenth annual MLSP competition: schizophrenia classification challenge,” in 2014 IEEE International Workshop on Machine Learning for Signal Processing (MLSP) (Reims), 1–6.

Google Scholar

Skåtun, K. C., Kaufmann, T., Doan, N. T., Alnæs, D., Córdova-Palomera, A., Jönsson, E. G., et al. (2016). Consistent functional connectivity alterations in schizophrenia spectrum disorder: a multisite study. Schizophr. Bull. 43, 914–924. doi: 10.1093/schbul/sbw145

PubMed Abstract | CrossRef Full Text | Google Scholar

Skidmore, F., Korenkevych, D., Liu, Y., He, G., Bullmore, E., and Pardalos, P. M. (2011). Connectivity brain networks based on wavelet correlation analysis in Parkinson fMRI data. Neurosci. Lett. 499, 47–51. doi: 10.1016/j.neulet.2011.05.030

PubMed Abstract | CrossRef Full Text | Google Scholar

Smith, S. M., Vidaurre, D., Beckmann, C. F., Glasser, M. F., Jenkinson, M., Miller, K. L., et al. (2013). Functional connectomics from resting-state fMRI. Trends Cogn. Sci. 17, 666–682. doi: 10.1016/j.tics.2013.09.016

PubMed Abstract | CrossRef Full Text | Google Scholar

Sochat, V., Supekar, K., Bustillo, J., Calhoun, V., Turner, J. A., and Rubin, D. L. (2014). A robust classifier to distinguish noise from FMRI independent components. PLoS ONE 9:e95493. doi: 10.1371/journal.pone.0095493

PubMed Abstract | CrossRef Full Text | Google Scholar

Stephan, K. E., Schlagenhauf, F., Huys, Q. J. M., Raman, S., Aponte, E. A., Brodersen, K. H., et al. (2017). Computational neuroimaging strategies for single patient predictions. Neuroimage 145, 180–199. doi: 10.1016/j.neuroimage.2016.06.038

PubMed Abstract | CrossRef Full Text | Google Scholar

Strittmatter, W. J., Saunders, A. M., Schmechel, D., Pericak-Vance, M., Enghild, J., Salvesen, G. S., et al. (1993). Apolipoprotein E: high-avidity binding to beta-amyloid and increased frequency of type 4 allele in late-onset familial Alzheimer disease. Proc. Natl. Acad. Sci. U.S.A. 90, 1977–1981. doi: 10.1073/pnas.90.5.1977

PubMed Abstract | CrossRef Full Text | Google Scholar

Su, L., Wang, L., Shen, H., Feng, G., and Hu, D. (2013). Discriminative analysis of non-linear brain connectivity in schizophrenia: an fMRI Study. Front. Hum. Neurosci. 7:702. doi: 10.3389/fnhum.2013.00702

PubMed Abstract | CrossRef Full Text | Google Scholar

Suk, H.-I., Wee, C.-Y., Lee, S.-W., and Shen, D. (2016). State-space model with deep learning for functional dynamics estimation in resting-state fMRI. Neuroimage 129, 292–307. doi: 10.1016/j.neuroimage.2016.01.005

PubMed Abstract | CrossRef Full Text | Google Scholar

Sun, F. T., Miller, L. M., and D'esposito, M. (2004). Measuring interregional functional connectivity using coherence and partial coherence analyses of fMRI data. Neuroimage 21, 647–658. doi: 10.1016/j.neuroimage.2003.09.056

PubMed Abstract | CrossRef Full Text | Google Scholar

Sun, H., Lui, S., Yao, L., Deng, W., Xiao, Y., Zhang, W., et al. (2015). Two patterns of white matter abnormalities in medication-naive patients with first-episode schizophrenia revealed by diffusion tensor imaging and cluster analysis. JAMA Psychiatry 72, 678–686. doi: 10.1001/jamapsychiatry.2015.0505

PubMed Abstract | CrossRef Full Text | Google Scholar

Svensén, M., Kruggel, F., and Benali, H. (2002). ICA of fMRI group study data. Neuroimage 16, 551–563. doi: 10.1006/nimg.2002.1122

PubMed Abstract | CrossRef Full Text | Google Scholar

Taghia, J., Ryali, S., Chen, T., Supekar, K., Cai, W., and Menon, V. (2017). Bayesian switching factor analysis for estimating time-varying functional connectivity in fMRI. Neuroimage 155, 271–290. doi: 10.1016/j.neuroimage.2017.02.083

PubMed Abstract | CrossRef Full Text | Google Scholar

Tang, Y., Wang, L., Cao, F., and Tan, L. (2012). Identify schizophrenia using resting-state functional connectivity: an exploratory research and analysis. Biomed. Eng. 11:50. doi: 10.1186/1475-925X-11-50

CrossRef Full Text | Google Scholar

Taurines, R., Schwenck, C., Westerwald, E., Sachse, M., Siniatchkin, M., and Freitag, C. (2012). ADHD and autism: differential diagnosis or overlapping traits? A selective review. Atten. Defic. Hyperact. Disord. 4, 115–139. doi: 10.1007/s12402-012-0086-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Teipel, S. J., Grothe, M. J., Metzger, C. D., Grimmer, T., Sorg, C., Ewers, M., et al. (2017). Robust detection of impaired resting state functional connectivity networks in Alzheimer's disease using elastic net regularized regression. Front. Aging Neurosci. 8:318. doi: 10.3389/fnagi.2016.00318

PubMed Abstract | CrossRef Full Text | Google Scholar

Thirion, B., Varoquaux, G., Dohmatob, E., and Poline, J. B. (2014). Which fMRI clustering gives good brain parcellations? Front. Neurosci. 8:167. doi: 10.3389/fnins.2014.00167

PubMed Abstract | CrossRef Full Text | Google Scholar

Tibshirani, R. (1996). Regression shrinkage and selection via the lasso. J. R. Stat. Soc. B 58, 267–288.

Google Scholar

Uddin, L. Q., Supekar, K., Lynch, C. J., Khouzam, A., Phillips, J., Feinstein, C., et al. (2013). Salience network–based classification and prediction of symptom severity in children with autism. JAMA Psychiatry 70, 869–879. doi: 10.1001/jamapsychiatry.2013.104

PubMed Abstract | CrossRef Full Text | Google Scholar

Van Den Heuvel, M. P., and Hulshoff Pol, H. E. (2010). Exploring the brain network: a review on resting-state fMRI functional connectivity. Eur. Neuropsychopharmacol. 20, 519–534. doi: 10.1016/j.euroneuro.2010.03.008

PubMed Abstract | CrossRef Full Text | Google Scholar

Van Den Heuvel, M., Mandl, R., and Hulshoff Pol, H. (2008). Normalized cut group clustering of resting-state FMRI data. PLoS ONE 3:e2001. doi: 10.1371/journal.pone.0002001

PubMed Abstract | CrossRef Full Text | Google Scholar

Van Os, J., Kenis, G., and Rutten, B. P. (2010). The environment and schizophrenia. Nature 468, 203–212. doi: 10.1038/nature09563

PubMed Abstract | CrossRef Full Text | Google Scholar

Van Schooten, S., Harel, R., Ercan, S., and De Groot, E. (2014). Applying Feature Selection Methods on fMRI Data. Student project report.

Venkataraman, A., Whitford, T. J., Westin, C.-F., Golland, P., and Kubicki, M. (2012). Whole brain resting state functional connectivity abnormalities in schizophrenia. Schizophr. Res. 139, 7–12. doi: 10.1016/j.schres.2012.04.021

PubMed Abstract | CrossRef Full Text | Google Scholar

Vieira, S., Pinaya, W. H., and Mechelli, A. (2017). Using deep learning to investigate the neuroimaging correlates of psychiatric and neurological disorders: Methods and applications. Neurosci. Biobehav. Rev. 74(Pt A), 58–75. doi: 10.1016/j.neubiorev.2017.01.002

PubMed Abstract | CrossRef Full Text | Google Scholar

Vigo, D., Thornicroft, G., and Atun, R. (2016). Estimating the true global burden of mental illness. Lancet Psychiatry 3, 171–178. doi: 10.1016/S2215-0366(15)00505-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, H. E., Bénar, C. G., Quilichini, P. P., Friston, K. J., Jirsa, V. K., and Bernard, C. (2014). A systematic framework for functional connectivity measures. Front. Neurosci. 8:405. doi: 10.3389/fnins.2014.00405

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, H., Chen, C., and Fushing, H. (2012). Extracting multiscale pattern information of fMRI based functional brain connectivity with application on classification of autism spectrum disorders. PLoS ONE 7:e45502. doi: 10.1371/journal.pone.0045502

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, J., Wilson, R. C., and Hancock, E. R. (2017). “Detecting Alzheimer's disease using directed graphs,” in International Workshop on Graph-Based Representations in Pattern Recognition (Anacapri: Springer), 94–104.

Google Scholar

Wang, K., Jiang, T., Liang, M., Wang, L., Tian, L., Zhang, X., et al. (2006). “Discriminative analysis of early Alzheimer's disease based on two intrinsically anti-correlated networks with resting-state fMRI,” in International Conference on Medical Image Computing and Computer-Assisted Intervention (Copenhagen: Springer), 340–347.

Google Scholar

Wang, X., Jiao, Y., Tang, T., Wang, H., and Lu, Z. (2013). Altered regional homogeneity patterns in adults with attention-deficit hyperactivity disorder. Eur. J. Radiol. 82, 1552–1557. doi: 10.1016/j.ejrad.2013.04.009

PubMed Abstract | CrossRef Full Text | Google Scholar

Watanabe, T., Kessler, D., Scott, C., Angstadt, M., and Sripada, C. (2014). Disease prediction based on functional connectomes using a scalable and spatially-informed support vector machine. Neuroimage 96, 183–202. doi: 10.1016/j.neuroimage.2014.03.067

PubMed Abstract | CrossRef Full Text | Google Scholar

Wee, C.-Y., Yang, S., Yap, P.-T., Shen, D., and Initiative, A. S. D. N. (2016). Sparse temporally dynamic resting-state functional connectivity networks for early MCI identification. Brain Imaging Behav. 10, 342–356. doi: 10.1007/s11682-015-9408-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Wee, C.-Y., Yap, P.-T., Denny, K., Browndyke, J. N., Potter, G. G., Welsh-Bohmer, K. A., et al. (2012a). Resting-state multi-spectrum functional connectivity networks for identification of MCI patients. PLoS ONE 7:e37828. doi: 10.1371/journal.pone.0037828

PubMed Abstract | CrossRef Full Text | Google Scholar

Wee, C.-Y., Yap, P.-T., Li, W., Denny, K., Browndyke, J. N., Potter, G. G., et al. (2011). Enriched white matter connectivity networks for accurate identification of MCI patients. Neuroimage 54, 1812–1822. doi: 10.1016/j.neuroimage.2010.10.026

PubMed Abstract | CrossRef Full Text | Google Scholar

Wee, C.-Y., Yap, P.-T., Zhang, D., Denny, K., Browndyke, J. N., Potter, G. G., et al. (2012b). Identification of MCI individuals using structural and functional connectivity networks. Neuroimage 59, 2045–2056. doi: 10.1016/j.neuroimage.2011.10.015

PubMed Abstract | CrossRef Full Text | Google Scholar

Wing, L. (1996). Autistic spectrum disorders. BMJ 312:327.

Google Scholar

Xu, J. S., Zhang, S., Calhoun, V. D., Monterosso, J., Li, C. S. R., Worhunsky, P. D., et al. (2013). Task-related concurrent but opposite modulations of overlapping functional networks as revealed by spatial IA. Neuroimage 79, 62–71. doi: 10.1016/j.neuroimage.2013.04.038

CrossRef Full Text | Google Scholar

Yaesoubi, M., Adali, T., and Calhoun, V. D. (2018). A window-less approach for capturing time-varying connectivity in fMRI data reveals the presence of states with variable rates of change. Hum. Brain. Mapp. 39, 1626–1636. doi: 10.1002/hbm.23939

PubMed Abstract | CrossRef Full Text | Google Scholar

Yaesoubi, M., Allen, E. A., Miller, R. L., and Calhoun, V. D. (2015a). Dynamic coherence analysis of resting fMRI data to jointly capture state-based phase, frequency, and time-domain information. Neuroimage 120, 133–142. doi: 10.1016/j.neuroimage.2015.07.002

PubMed Abstract | CrossRef Full Text | Google Scholar

Yaesoubi, M., Miller, R. L., and Calhoun, V. D. (2015b). Mutually temporally independent connectivity patterns: a new framework to study the dynamics of brain connectivity at rest with application to explain group difference based on gender. Neuroimage 107, 85–94. doi: 10.1016/j.neuroimage.2014.11.054

PubMed Abstract | CrossRef Full Text | Google Scholar

Yang, H., Liu, J., Sui, J., Pearlson, G., and Calhoun, V. D. (2010). A hybrid machine learning method for fusing fMRI and genetic data: combining both improves classification of schizophrenia. Front. Hum. Neurosci. 4:192. doi: 10.3389/fnhum.2010.00192

PubMed Abstract | CrossRef Full Text | Google Scholar

Yang, J., and Honavar, V. (1998). Feature subset selection using a genetic algorithm. IEEE Intell. Syst. Appl. 13, 44–49. doi: 10.1109/5254.671091

CrossRef Full Text | Google Scholar

Yang, W., Lui, R. L., Gao, J.-H., Chan, T. F., Yau, S.-T., Sperling, R. A., et al. (2011). Independent component analysis-based classification of Alzheimer's disease MRI data. J. Alzheimers Dis. 24, 775–783. doi: 10.3233/JAD-2011-101371

PubMed Abstract | CrossRef Full Text | Google Scholar

Yang, Z., Laconte, S., Weng, X., and Hu, X. (2008). Ranking and averaging independent component analysis by reproducibility (RAICAR). Hum. Brain Mapp. 29, 711–725. doi: 10.1002/hbm.20432

PubMed Abstract | CrossRef Full Text | Google Scholar

Yoo, K., Rosenberg, M. D., Hsu, W. T., Zhang, S., Li, C. R., Scheinost, D., et al. (2018). Connectome-based predictive modeling of attention: comparing different functional connectivity features and prediction methods across datasets. Neuroimage 167, 11–22. doi: 10.1016/j.neuroimage.2017.11.010

PubMed Abstract | CrossRef Full Text | Google Scholar

Yu, L., and Liu, H. (2003). “Feature selection for high-dimensional data: a fast correlation-based filter solution,” in Proceedings of the 20th International Conference on Machine Learning (ICML-03), 856–863.

Google Scholar

Yu, Q. B., Erhardt, E. B., Sui, J., Du, Y. H., He, H., Hjelm, D., et al. (2015). Assessing dynamic brain graphs of time-varying connectivity in fMRI data: application to healthy controls and patients with schizophrenia. Neuroimage 107, 345–355. doi: 10.1016/j.neuroimage.2014.12.020

PubMed Abstract | CrossRef Full Text | Google Scholar

Yu, Q., Sui, J., Liu, J., Plis, S. M., Kiehl, K. A., Pearlson, G., et al. (2013). Disrupted correlation between low frequency power and connectivity strength of resting state brain networks in schizophrenia. Schizophr. Res. 143, 165–171. doi: 10.1016/j.schres.2012.11.001

PubMed Abstract | CrossRef Full Text | Google Scholar

Yu, R., Zhang, H., An, L., Chen, X., Wei, Z., and Shen, D. (2017). Connectivity strength-weighted sparse group representation-based brain network construction for MCI classification. Hum. Brain Mapp. 38, 2370–2383. doi: 10.1002/hbm.23524

PubMed Abstract | CrossRef Full Text | Google Scholar

Yu, Y., Shen, H., Zeng, L.-L., Ma, Q., and Hu, D. (2013a). Convergent and divergent functional connectivity patterns in schizophrenia and depression. PLoS ONE 8:e68250. doi: 10.1371/journal.pone.0068250

PubMed Abstract | CrossRef Full Text | Google Scholar

Yu, Y., Shen, H., Zhang, H., Zeng, L.-L., Xue, Z., and Hu, D. (2013b). Functional connectivity-based signatures of schizophrenia revealed by multiclass pattern analysis of resting-state fMRI from schizophrenic patients and their healthy siblings. Biomed. Eng. 12:10. doi: 10.1186/1475-925X-12-10

CrossRef Full Text | Google Scholar

Zalesky, A., and Breakspear, M. (2015). Towards a statistical test for functional connectivity dynamics. Neuroimage 114, 466–470. doi: 10.1016/j.neuroimage.2015.03.047

PubMed Abstract | CrossRef Full Text | Google Scholar

Zalesky, A., Fornito, A., Cocchi, L., Gollo, L. L., and Breakspear, M. (2014). Time-resolved resting-state brain networks. Proc. Natl. Acad. Sci. U.S.A. 111, 10341–10346. doi: 10.1073/pnas.1400181111

PubMed Abstract | CrossRef Full Text | Google Scholar

Zang, Y., Jiang, T., Lu, Y., He, Y., and Tian, L. (2004). Regional homogeneity approach to fMRI data analysis. Neuroimage 22, 394–400. doi: 10.1016/j.neuroimage.2003.12.030

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, J., Zhou, L., Wang, L., and Li, W. (2015). Functional brain network classification with compact representation of SICE matrices. IEEE Trans. Biomed. Eng. 62, 1623–1634. doi: 10.1109/TBME.2015.2399495

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, X., Hu, B., Ma, X., and Xu, L. (2015). Resting-state whole-brain functional connectivity networks for MCI classification using l2-regularized logistic regression. IEEE Trans. Nanobioscience 14, 237–247. doi: 10.1109/TNB.2015.2403274

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhou, Y., Yu, F., and Duong, T. (2014). Multiparametric MRI characterization and prediction in autism spectrum disorder using graph theory and machine learning. PLoS ONE 9:e90405. doi: 10.1371/journal.pone.0090405

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhu, C.-Z., Zang, Y.-F., Cao, Q.-J., Yan, C.-G., He, Y., Jiang, T.-Z., et al. (2008). Fisher discriminative analysis of resting-state brain function for attention-deficit/hyperactivity disorder. Neuroimage 40, 110–120. doi: 10.1016/j.neuroimage.2007.11.029

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhu, D., Li, K., Terry, D. P., Puente, A. N., Wang, L., Shen, D., et al. (2014). Connectome-scale assessments of structural and functional connectivity in MCI. Hum. Brain Mapp. 35, 2911–2923. doi: 10.1002/hbm.22373

PubMed Abstract | CrossRef Full Text | Google Scholar

Zou, H., and Hastie, T. (2005). Regularization and variable selection via the elastic net. J. R. Stat. Soc. Ser. B 67, 301–320. doi: 10.1111/j.1467-9868.2005.00503.x

CrossRef Full Text | Google Scholar

Zuo, X. N., Kelly, C., Adelstein, J. S., Klein, D. F., Castellanos, F. X., and Milham, M. P. (2010). Reliable intrinsic connectivity networks: test-retest evaluation using ICA and dual regression approach. Neuroimage 49, 2163–2177. doi: 10.1016/j.neuroimage.2009.10.080

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: fMRI, functional connectivity, biomarker, classification, brain disorders

Citation: Du Y, Fu Z and Calhoun VD (2018) Classification and Prediction of Brain Disorders Using Functional Connectivity: Promising but Challenging. Front. Neurosci. 12:525. doi: 10.3389/fnins.2018.00525

Received: 22 March 2018; Accepted: 12 July 2018;
Published: 06 August 2018.

Edited by:

Russell A. Poldrack, Stanford University, United States

Reviewed by:

Emily Finn, National Institute of Mental Health (NIMH), United States
Dante R. Chialvo, Center for Complex Systems & Brain Sciences (CEMSC3), Argentina

Copyright © 2018 Du, Fu and Calhoun. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Yuhui Du, eWR1QG1ybi5vcmc=

Co-first author.

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.