Identifying and Predicting Autism Spectrum Disorder Based on Multi-Site Structural MRI With Machine Learning

Duan, YuMei; Zhao, WeiDong; Luo, Cheng; Liu, XiaoJu; Jiang, Hong; Tang, YiQian; Liu, Chang; Yao, DeZhong

doi:10.3389/fnhum.2021.765517

ORIGINAL RESEARCH article

Front. Hum. Neurosci., 22 February 2022

Sec. Brain Imaging and Stimulation

Volume 15 - 2021 | https://doi.org/10.3389/fnhum.2021.765517

This article is part of the Research Topic AI-based computer-aided diagnosis and prognosis for psychiatric disorders View all 5 articles

Identifying and Predicting Autism Spectrum Disorder Based on Multi-Site Structural MRI With Machine Learning

$\nYuMei Duan&#x;$ YuMei Duan¹^†

WeiDong Zhao²^†

Cheng Luo³^†

XiaoJu Liu⁴

Hong Jiang⁵^†

YiQian Tang²

Chang Liu^2,3^*

DeZhong Yao³^*

¹Department of Computer and Software, Chengdu Jincheng College, Chengdu, China
²College of Computer, Chengdu University, Chengdu, China
³The Key Laboratory for Neuro Information of Ministry of Education, Center for Information in Bio Medicine, High-Field Magnetic Resonance Brain Imaging Key Laboratory of Sichuan Province, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu, China
⁴Department of Abdominal Oncology, Cancer Center, West China Hospital, Sichuan University, Chengdu, China
⁵Department of Neurosurgery, Rui-Jin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China

Although emerging evidence has implicated structural/functional abnormalities of patients with Autism Spectrum Disorder(ASD), definitive neuroimaging markers remain obscured due to inconsistent or incompatible findings, especially for structural imaging. Furthermore, brain differences defined by statistical analysis are difficult to implement individual prediction. The present study has employed the machine learning techniques under the unified framework in neuroimaging to identify the neuroimaging markers of patients with ASD and distinguish them from typically developing controls(TDC). To enhance the interpretability of the machine learning model, the study has processed three levels of assessments including model-level assessment, feature-level assessment, and biology-level assessment. According to these three levels assessment, the study has identified neuroimaging markers of ASD including the opercular part of bilateral inferior frontal gyrus, the orbital part of right inferior frontal gyrus, right rolandic operculum, right olfactory cortex, right gyrus rectus, right insula, left inferior parietal gyrus, bilateral supramarginal gyrus, bilateral angular gyrus, bilateral superior temporal gyrus, bilateral middle temporal gyrus, and left inferior temporal gyrus. In addition, negative correlations between the communication skill score in the Autism Diagnostic Observation Schedule (ADOS_G) and regional gray matter (GM) volume in the gyrus rectus, left middle temporal gyrus, and inferior temporal gyrus have been detected. A significant negative correlation has been found between the communication skill score in ADOS_G and the orbital part of the left inferior frontal gyrus. A negative correlation between verbal skill score and right angular gyrus and a significant negative correlation between non-verbal communication skill and right angular gyrus have been found. These findings in the study have suggested the GM alteration of ASD and correlated with the clinical severity of ASD disease symptoms. The interpretable machine learning framework gives sight to the pathophysiological mechanism of ASD but can also be extended to other diseases.

1. Introduction

Autism Spectrum Disorder, known as ASD, is a complex neuro-developmental disorder and has been characterized by a series of symptoms including early-onset difficulties in social communication as well as restricted, repetitive behaviors and interests (Pagnozzi et al., 2018). The symptoms of ASD generally occur within the first 3 years of life and tend to last even one's whole life (Hazlett et al., 2017). ASD brings significant impairments on an individual's language, emotions, behavior, self-control, learning, and memory and also is accompanied by intellectual disability. Moreover, it is reported that patients with ASD are far more likely to encounter premature death than healthy controls (Hirvikoski et al., 2015). According to the Morbidity and Mortality Weekly Report (MMWR) Series published by the Centers for Disease Controls and Prevention (CDC) in the United States, the prevalence of ASD among children has increased from 1 in 150 to 1 in 54 over 16 years (from 2000 to 2016) and the incidence rate of ASD was 4.3 times higher in boys than girls (Maenner et al., 2020) in 2016. For each patient with ASD, the average lifetime social cost is approximately $3.6 million (Cakir et al., 2020).

Actually, if ASD is unable to be detected and intervened at an earlier age, the impairments are irreversible. Therefore, early and accurate identification and diagnosis are crucial to improving the life quality of ASD patients and their families. Unfortunately, it is notoriously difficult to diagnose, especially in children, since the cause of ASD is a result of combined factors, including genetics, the structure and function of the brain, as well as environmental influences (Rakić et al., 2020). Until now, there are still no effective medical treatments for ASD. For the current practice guidelines to assess, diagnose and treat ASD, it is recommended to use the behavioral observation of symptomology following the Diagnostic and Statistical Manual (Fifth Edition) (DSM-5) (American Psychiatric Association, 2013) symptom criteria and the International Classification of Mental and Behavioral Disorders (Tenth Edition) (ICD-10) (Organization, 1993). However, uniformity is lacking while using these practice guidelines, so it is probably prone to misdiagnosis (Eslami et al., 2021). Furthermore, these guidelines cannot point out the biological bases related to behavioral symptoms due to unclear neuroanatomy. Finally, these limitations have resulted in calls for more optimal diagnostic approaches for ASD.

In the last few decades, advances in non-invasive neuroimaging techniques and analysis have provided crucial knowledge to uncover patterns of brain structure and function that would be symptomatic for the autism spectrum. The vast majority of statistical methods on Structural MRI have intended to explore the common patterns between patients with ASD and healthy groups, but previous volumetric and morphological analysis on structural MRI often has derived contradicted results. For example, some research work reported decreased volumes of the amygdala for ASD (van Rooij Daan et al., 2017) while others did not find significant alterations (Maier et al., 2015). Focusing on the hippocampus volumes, some reported its reduction, others reported its enlargement or no changes (Barnea-Goraly et al., 2014; Maier et al., 2015). Xiao et al. (2014) has found that both gray and white matter (WM) has a significant increment with ASD, and Hazlett et al. (2017) has pointed out brain volume overgrowth is related to the emergence and severity of ASD. While Palmen et al. (2005) and Jou et al. (2011) have noted that there is no difference or decreased WM volume between ASD and healthy controls, and Riddle et al. (2016) conducted voxel-based morphometry analysis and found that the total brain volume and the left anterior superior temporal gyrus increased for children aged 2–4 with ASD. But these brain structural abnormalities are subtle at later ages (Riedel et al., 2014). These inconsistent findings are most likely due to different collecting approaches and limited sample size with heterogeneous characteristics of subjects (Riddle et al., 2016). Moreover, traditional statistical analysis is based on mass univariate techniques which process a single voxel independently and ignore the relationship between voxels (Bonnici et al., 2012; Samartsidis et al., 2016). Furthermore, it defines the common pattern at the level of groups and is unable to predict the unknown sample at the level of individuals (Zhutovsky et al., 2019; Hu et al., 2021).

Most recently, the rapid advance of machine learning has made it becomes possible to explore the underlying neural mechanisms and provide accurate predictions and convincing explanations for ASD from various aspects (Khodatars et al., 2020; Eslami et al., 2021). Knutson (2013) has pointed out that machine learning can detect differences in neuroimaging data that might not be detected with traditional univariate analysis. In previous studies, typical statistical machine learning and deep learning have been utilized to identify ASD from NC in terms of structural and functional alterations. Statistical machine learning requires the design of handmade features (feature extraction/feature selection) and implement the identification of patients with ASD based on these features (feature classification). Ecker et al. (2010) has applied SVM to investigate the whole-brain differences of GM and WM volume on 44 subjects and obtained significant predictive power. Additionally, it has been found that these brain differences are related to symptom severity. Ecker et al. (2010) and Wee et al. (2014) have extracted morphological features based on structural images and used SVM or multi-kernel technique to achieve satisfactory results. Furthermore, Zheng et al. (2018) have constructed a multi-feature-based network based on morphological features to explore the cortico-cortical similarities of ASD. Bilgen et al. (2020) have modeled the morphological relationship between pairs of ROIs with a cortical networks and verified the classification performance of different machine learning methods. Concerning female children, Calderoni et al. (2012) have detected the abnormality of the gray matter volume based on SVM-RFE (Leung et al., 2006; ChenZhiHong et al., 2020) and found the increased cortical volume in some brain regions involving the left superior frontal gyrus (SFG). In addition, bilateral SFG and right temporoparietal junction (TPJ) resulted in the appearance of some atypical symptoms of ASD and might be relevant to the pathophysiology of female children in ASD. These findings are helpful to reveal the important influence of the structural alterations and the relationship between the brain structure and the pathophysiology of ASD.

However, the lack of a sufficiently large sample at a single site probably leads to poor generalizability that is notably serious for neuroimaging due to limited participants and super-high dimensionality of data. Consequently, the investigation of large sample data from multi-site has attracted increasing attention. Some studies (Spera et al., 2019; Mwiza et al., 2020) have figured out the superiority of machine learning for the classification of multi-site data based on fMRI or the combining structural and MRI in the Autism Imaging Data Exchange database (Di Martino et al., 2017). Due to the excellent performance of deep learning in the field of artificial intelligence on large sample data, some researchers have begun to detect abnormalities of functional connectivity based on deep learning. Deep learning has combined dimensionality reduction and feature classification and implemented the end-to-end classification model automatically, which has achieved satisfaction performance (Eslami et al., 2019; Sherkatghanad et al., 2020). Furthermore, some attempts have been done to fuse structural and functional features with the model of deep learning to improve the classification performance (Rakić et al., 2020). But it cannot be denied that deep learning handles data with the mechanism of a black box and it is so hard to identify the abnormal brain regions and connect the classification accuracy with the underlying mechanism of ASD. Furthermore, multi-site data also has brought the issue of data heterogeneity due to different scanning parameters and participant populations. The direct way to address the heterogeneity issue is to apply dimensionality reduction to transform source data into features in the field of machine learning (Wang et al., 2020). Furthermore, these studies also utilized leave-one-site-out cross-validation to evaluate the classification performance in the expectation of reducing the impact of heterogeneity simultaneously (Rakić et al., 2020; Eslami et al., 2021). In order to make the results robust, Ashourvan et al. (2016) have further proposed intra-site cross-validation and inter-site cross-validation and achieved 65% accuracy with functional connections (FC) to identify ASD from normal control.

In fact, detecting the structural/functional brain alterations is vital to reveal the pathological mechanism of ASD. In particular, these brain regions with obvious differences can be recognized as the neuro-imaging biomarkers related to the disease. Based on this kind of neuro-imaging biomarkers (brain regions), achieving excellent classification performance even from different multi-sites would be the most desirable and helpful in the clinical diagnosis. Meanwhile, aiming at a few investigations on volumetric changes based on machine learning, this study has applied machine learning techniques followed the unified framework to implement model-level, feature-level, and biology-level assessment successively. First of all, a searchlight-based classification method has been used to detect the volumetric changes locally and some candidate brain regions have been defined based on the areas of the volumetric changes at the model-level assessment; Regarding distinguished brain regions, this study has processed the “visual lesion” analysis at the feature-level assessment. Stability based on nested cross-validation and multi-site validation of each region has been evaluated. The candidate regions with good stability performance have been preserved and considered as candidate biomarkers related to ASD. Finally, this study investigated the relationship between candidate biomarkers and symptom severity and analyzed our results with previous findings.

The main contributions of the study are discussed as follows: (1) Previous machine learning studies on ASD mainly focus on the classification performance or the important features. Furthermore, this study paid attention to the interpretability of the machine learning model to explore abnormal brain regions related to ASD and conducted model level and feature level assessment to ensure the robustness and stability of the results. (2) The correlation analysis between abnormal brain regions and clinical severity in our study has further proved the relationship between the volume changes of some specific brain regions and the clinical symptom. (3) The findings in our study are partly consistent with previous research work. The abnormal gray matter (GM) volume in the temporal lobe, Broca and Wernicke area probably provides the support for the social brain hypothesis and the broken mirror theory of ASD, which is helpful to understand the neuroanatomy of ASD.

The structure of this study is as follows: First, in section 2, we provide a brief introduction to the pre-processing procedure and statistical analysis of sMRI data. In section 3, we describe the machine learning workflow in detail. Experimental results and discussion are provided in section 4. Finally, in section 4, we conclude the study and discuss the future direction.

2. Material

2.1. Participants

All data carried in the present study came from the Autism Imaging Data Exchange (ABIDE II) (http://fcon_1000.projects.nitrc.org/indi/abide/abide_II.html). Briefly, ABIDE with ABIDE I and ABIDE II is a public repository that provides structural MRI and resting-state fMRI acquired on ASD and matched control subjects for the purpose of data sharing and scientific research (Martino et al., 2013). The ABIDE II includes 1,114 data sets from 19 independent sites which comprise 521 participants with ASD and 593 typically developing controls (TDC) with the age from 5 to 64. All participants in ABIDE have received approval from the Institutional Review Board (IRB) of each site. In the present study, we have selected three independent datasets from Georgetown University (GU), Oregon Health and Science University (OHSU), and University of California Los Angeles (UCLA) which are collected by the same scanner (Siemens) and all participants are children with the age from 7 to 15 to reduce the variability of multi-site neuroimaging data. Since GU has the greatest participants, machine learning methods were conducted on GU with nested cross-validation. Furthermore, we also trained machine learning models for GU and tested them on OHSU and UCLA to verify their robustness. Demographics information of participants is summarized in Table 1. The scanning parameters of the three sites are listed in Table 2.

TABLE 1

Table 1. Demographics information.

TABLE 2

Table 2. The scanning parameters of structural MRI imaging in Georgetown University (GU), Oregon Health and Science University (OHSU), and University of California Los Angeles (UCLA) with Siemens.

2.2. MRI Data Pre-processing

All structural images were processed using the SPM8 package (Welcome Trust Center for Neuroimaging, London, UK, http://www.filion.ucl.ac.uk/spm/software/spm8/) and the VBM8 (Voxel-Based Morphometry) toolbox (http://dbm.neuro.uni-jena.de/vbm) running under Matlab R2014a (Mathworks). At first, all T1-weighted images were corrected for bias-field inhomogeneities and then segmented into GM, WM, and CSF (cerebrospinal fluid) based on a tissue probability map (Mazziotta et al., 1995). The segmented GM/WM image was spatially normalized to the “IXI500_MNI152” template based on the DARTEL algorithm (Ashburner, 2007). After that, non-linear warping for the effect of spatial normalization was corrected to generate GM/WM modulated normalized images. Finally, spatial smoothing (Gaussian kernel with 6 mm full-width at half-maximum) was conducted on GM/WM images to remove noise.

2.3. Statistical Analysis

In the present study, a two-sample t-test has been employed on the GU dataset with age, gender, Total Intracranial Volume (TIV) as the effect-of-no-interest covariates to identify group differences between ASD and TDC. A significance level of p < 0.001 (uncorrected) was established with an extent threshold of 50 voxels. Meanwhile, an absolute threshold mask of 0.1 was used on GM/WM volume images to avoid potential edge effects.

3. Methods

This study aimed to identify the brain abnormality and predict ASD from TDC via machine learning techniques. However, neuroimaging-based ML models like the “black-box” and unable to be understood from the prospect of neuroscience. To address this issue, Kohoutov et al. (2020) has developed a unified framework to enhance the interpretability of ML models and provide mechanistic insights into underlying neural or disease processes. The proposed framework contains a three-stage process of assessment including Model-level assessment, Feature-level assessment, and Biology-level assessment. In the first stage, the ML model has been built from observations and assessed in terms of its sensitivity, specificity, and generalizability. In the second stage, significant features have been identified from a prediction within the model. Finally, the neuroscientific plausibility of the ML model has been proved with evidence from previous literature and other studies.

However, ML models based on neuroimaging are often built on numerous features and limited participants, which makes the model is prone to overfitting and leads to poor generalization and expensive computational cost even if dimensional reduction techniques have been used. Moreover, isolated features are often insufficient to acquire satisfactory predictive performance and explain the model performance. Consequently, the study has designed a neighborhood-to-regional machine learning workflow within this unified framework to identify structural alteration and discriminant ASD from TDC. The workflow proposed in the study has been illustrated in Figure 1.

FIGURE 1

Figure 1. The machine learning workflow proposed in the study.

3.1. Model-Level Assessment

First, the study has built an ML model based on the searchlight technique (Kriegeskorte et al., 2006). A spherical window is centered at each voxel to generate a data matrix from the voxel and its neighbors. In light of the spherical window, PCA(Principal Component Analysis) has been used to reduce the dimensionality of the matrix, and SVM(Support Vector Machine) has been used to achieve the classification.

3.1.1. Principal Component Analysis

Supposed data matrix obtained from training data $x = {x_{1}, x_{2}, . . . x_{m}} \in ℝ^{m \times n}$ is obtained from a spherical window, where m is the number of subjects in the training dataset, n represents the voxel number centered a specific voxel within a spherical window, PCA (Wold et al., 1987) has been used to reduce the dimensionality of the matrix by transforming high-dimensional data into lower-dimensional features while preserving its maximum variance. To this end, data points are projected from high-dimensional space to low-dimensional space with the following linear combinations:

\begin{array}{l} y = \sum_{j = 1}^{n} a_{j} x_{j} = X a & (1) \end{array}

where $a = {a_{1}, a_{2}, . . ., a_{n}} \in ℝ^{n \times k}$ and k ≪ n, y is the low-dimensional features. Meanwhile, the variance of the low-dimensional feature is given by:

\begin{array}{l} var (X a) = a^{T} S a & (2) \end{array}

where S is the symmetric covariance matrix of data samples. Hence, the linear combination with maximum variance can be achieved by optimizing the following problem:

\begin{array}{l} max a^{T} S a s . t . a^{T} a = 1 & (3) \end{array}

In light of the Lagrangian multiplier method with the restrictions of orthogonality of different coefficient vectors, we can obtain the following equation easily:

\begin{array}{l} S a - λ a = 0 \Leftrightarrow S a = λ a & (4) \end{array}

where a is the orthonormal eigenvector of the covariance matrix S and λ refers to the corresponding eigenvalue. Thus, the maximum variance corresponds to the largest eigenvalue as follow:

\begin{array}{l} max var (X a) = max a^{T} S a = max λ a^{T} a = max λ & (5) \end{array}

As a consequence, the eigenvectors of S corresponding to the first k largest eigenvalues can be considered as the coefficient vectors a and these linear combinations Xa_k are called the principal components (PCs) of the dataset. The quality of a given PC Xa_j is measured according to the following proportion of total variance:

\begin{array}{l} π_{j} = \frac{λ_{j}}{\sum_{j = 1}^{n} λ_{j}} = \frac{λ_{j}}{tr (S)} & (6) \end{array}

where tr(S) denotes the trace of S, and λ_j is the jth eigenvalue of S. The proportion of total variance preserved by a set of S of PCs can be expressed as a percentage of total variance as follows:

\begin{array}{l} \sum_{j = 1}^{k} π_{j} = \frac{\sum_{j = 1}^{k} λ_{j}}{tr (S)} & (7) \end{array}

In practice, it is common to use some predefined percentage of the total variance to decide how many PCs should be retained, rather than setting the number of the coefficient vector k directly. In our study, 80% of total variability has been used.

3.1.2. Support Vector Machine

After that, supposed a set of feature-label pairs(f_i, y_i), $i = 1, \dots, m, f_{i} \in ℝ^{k}, y_{i} \in {- 1, + 1}$ , the classification with linear SVM (Fan et al., 2008) has been implemented according to solving the following unconstrained optimization problem:

\begin{array}{l} min_{w} \frac{1}{2} w^{T} w + C \sum_{i = 1}^{l} ξ (w; f_{i}, y_{i}) & (8) \end{array}

where C is a penalty parameter and the loss function $ξ (w; f_{i}, y_{i}) = max {(1 - y_{i} w^{T} f_{i}, 0)}^{2}$ .

When a new testing data point x arrives, it can be projected into low-dimensional space by PCA as follows:

\begin{array}{l} x^{'} = x a & (9) \end{array}

and then the low-dimensional feature x′ is predicted as positive if w^Tx > 0 and negative, otherwise. In the present study, the number of the coefficient vector k of PCA is determined when preserving the energy of PCs is 80% and the penalty parameter C = 1 of linear SVM is used in default.

The classification accuracy of a spherical window around a specific voxel has indicated how well centered voxel in the local spherical neighborhood differentiates between different groups. According to slide the spherical searchlight window on each voxel of GM/WM images, a 3D accuracy map has been obtained to explore the local spatial pattern of GM/WM volume. The ML model based on the neighborhood window is useful to relieve overfitting and computational cost problem.

In order to assess the robustness of the results, the 5-fold cross-validation has been employed. For 5-fold cross-validation, the dataset has been divided randomly into five equal subsets. One subset has been used for testing and the other subsets have been used for training machine learning models. Repeating this process five times, the average 3D accuracy map has been obtained to evaluate the local structural differences between ASD and TDC ultimately. Generally, 5-fold cross-validation also has been repeated several times to enhance the robustness of the results. The higher accuracy of the voxels have, the more significant structural changes around the voxels. Similar to the previous study (Feng et al., 2012), a rigorous threshold (70% in the present study) has been set to identify meaningful clusters (features) with a cluster size larger than 50 voxels. The brain regions involved in these clusters can be considered as candidate brain regions that are related to structural alteration.

3.2. Feature-Level Assessment

Feature-level assessment in the study has processed the ‘virtual lesion' analysis based on these candidate brain regions (Chang et al., 2015) involved in the above clusters. Originally, the “virtual lesion” analysis has been applied to investigate how individual regions or networks contribute to the prediction of ML models by removing or using each region or network at a time from the model based on a selected parcellation. Based on AAL parcellation, we have divided the clusters identified in model-level assessment into different brain regions and utilized the “virtual lesion” analysis to investigate their classification performance separately based on three different ML models including PCA+Ridge, PCA+SVM, and Bagging. For PCA+Ridge and PCA+SVM, PCA has been used to reduce the dimensionality of data, and Ridge/SVM has been used as the classifier, respectively.

3.2.1. Ridge Classifier

Ridge method has been proposed to solve the regression problem originally by imposing a penalty on the coefficient vector w on the following objective function (Rifkin et al., 2003):

\begin{array}{l} min_{w} ‖ X w - y ‖_{2}^{2} + α ‖ w ‖_{2}^{2} & (10) \end{array}

where X is the dataset, y is the data label. The penalty factor α is used to control the amount of shrinkage. The larger the value of α, the greater the amount of shrinkage. When utilized for classification problems, the Ridge classifier converts binary targets y to {−1, +1} and treats them as regression tasks, optimizing the above objective function.

3.2.2. Bagging Classifier

As a kind of ensemble algorithms, the bagging method has used a base estimator to build several instances from random subsets of the original training set and then average the predictions of these instances to drive a final prediction, which is helpful to reduce the variance of a base estimator. In this present study, the base estimator used a decision tree by default. Given training vectors $x_{i} \in ℝ^{n}, i = 1, \dots m$ and the label vector y ∈ ℝ^l, a decision tree employs a tree to model the classification problem, which partitions the feature space recursively to make samples with the same label grouped together. Supposed the data at node m be expressed by Q_m with N_m samples. For each candidate split θ = (j, t_m) consisting of a feature j and threshold t_m, partition training data into $Q_{m}^{l e f t} (θ)$ and $Q_{m}^{r i g h t} (θ)$ subsets as follows:

\begin{array}{l} Q_{m}^{l e f t} (θ) = {(x, y) ∣ x_{j} < = t_{m}} & (11) \end{array}

\begin{array}{l} Q_{m}^{right} (θ) = Q_{m} \ Q_{m}^{left} (θ) & (12) \end{array}

The “best” split has been determined according to the following objective function:

\begin{array}{l} θ^{*} = {argmin}_{θ} G (Q_{m}, θ) & (13) \end{array}

where

\begin{array}{l} G (Q_{m}, θ) = \frac{N_{m}^{l e f t}}{N_{m}} H (Q_{m}^{l e f t} (θ)) + \frac{N_{m}^{right}}{N_{m}} H (Q_{m}^{right} (θ)) & (14) \end{array}

and H(Q_m) is the impurity function using Gini index to evaluate the performance of the candidate split whether they grouped samples with the same label into the same group:

\begin{array}{l} H (Q_{m}) = \sum_{k} p_{m k} (1 - p_{m k}) & (15) \end{array}

where p_mk is the probability of picking up a data point with class label k in node m. For subsets $Q_{m}^{left} (θ^{*})$ and $Q_{m}^{right} (θ^{*})$ , the same procedure was executed recursively until the maximum depth is reached.

3.2.3. Hyperparameter Tuning Based on Optuna

Since ML models are sensible to the setting of hyper-parameters, the hyper-parameter tuning technique based on Optuna has been employed (Akiba et al., 2019) to seek the optimal parameters for these models. With Optuna, the optimal percentage of the total variance in PCA has been searched from 0.6 to 0.99 with step 0.1. The optimal penalty parameter C of SVM and the optimal penalty coefficient α of Ridge have been searched from 10⁻¹⁰ to 10¹⁰ satisfied a uniform distribution in the log domain. For Bagging, the optimal number of the estimator has been searched from 3 to 30 with step 1. The optimal percentage of samples and features to draw from dataset X to train each base estimator has been searched from 0.5 to 1 with a uniform distribution in the linear domain.

3.3. Biology-Level Assessment

To explore the association between the regional GM volume reduction and the clinical severity of ASD, the study has performed correlation analysis of regional GM volume of candidate biomarkers with the clinical scores with ADI_R (the Autism Diagnostic Interview–Revised) (Lord et al., 1994) and ADOS_G (the Autism Diagnostic Observation Schedule) (Lord, 2000). ADI_R and ADOS_G are considered as the “gold standard” assessment measures in the evaluation of ASD. ADOS_G is a semi-structured, standardized assessment of communication, social interaction and play and imaginative use of materials for individuals. However, unlike ADOS_G, ADI_R is a comprehensive parent interview to measure social interaction, communication and language, and repetitive, restricted, and stereotyped interests and behavior. Scores assessed by ADOS_G and ADI_R are able to reflect the symptom severity of ASD. Meanwhile, the study also has compared these findings with previous literature in the section of “Discussion” to explore the neurobiological meaning of the structural alteration.

4. Experiments and Results

4.1. Experiments Setting

All experiments have been implemented on Ubuntu 16.4 with Python 3.7 and sklearn package 2.4. In the stage of feature-level assessment, 5-fold nested cross-validation has been conducted on GU dataset with ten iterations to evaluate the robustness and generalizability of ML models on individual candidate brain regions. For 5-fold nested cross-validation, it consists of an outer loop and an inner loop. During the outer loop, the dataset is split randomly into five equal subsets. Among these subsets, one subset is test data and the other subsets are training data. During the inner loop, the training data further is divided into five equal subsets, one subset is validation data and the rest subsets have been used to test the performance of different hyper-parameters. Therefore, the inner loop is used to tune the hyper-parameters, and the outer loop is used to estimate the model performance with optimal hyper-parameters. Besides, a multi-site validation also has been adopted, which trains ML models on GU and tests the predictive performance of the models using OHSU and UCLA. For each ML model of an individual brain region, the stability analysis has been conducted in terms of sensitivity, specificity, and permutation test. Sensitivity and specificity are two important metrics to measure the predictive ability of the ML model. Selecting the optimal balance between sensitivity and specificity depends on the purpose for which the test is used. The study has defined a threshold of 20% to quantify the differences between sensitivity and specificity, and a good balance between sensitivity and specificity should be less than the threshold. Furthermore, the permutation test (Ojala et al., 2010) was also used to evaluate the statistical significance of the predictive performance for each brain region. For the permutation test, the class labels of training data were randomly permuted and then 5-fold cross-validation was performed on the permuted training set. The permutation was repeated 5,000 times. During the permutation test, the statistical significance p is defined as the percentage of the accuracies that was equal to or greater than the accuracies obtained from the non-permuted data. Brain regions with p < 5% (p < 0.05) were considered statistically significant. Brain regions without a good balance between sensitivity and specificity and without statistical significance in permutation tests have been excluded from candidate brain regions. The final candidate brain regions have considered the structural biomarkers related to ASD.

4.2. The Results of Statistical Analysis

The between-group differences found by the two-sample t-test on GU have been illustrated in Figure 2 and Table 3. It can be found that the atrophy of the GM volume is widespread covering the frontal lobe, parietal lobe, temporal lobe, occipital lobe, insula and the limbic system, especially nearby insula, temporal lobe, and inferior parietal lobule. The atrophy brain regions involved in brain function including visual information processing (e.g., BA18, BA19, and BA20), the language understanding, processing and representation and auditory processing (e.g., BA21, BA22, BA39, BA40, BA44, and BA47), emotion regulation (e.g., BA23), Olfactory function (e.g., BA25, BA28), and face recognition (e.g., BA37), cognitive function (e.g., BA10), visual-motor coordination (e.g., BA7), and emotional correlation (e.g., BA13). ASD patient's dysfunction to some extent probably means that the decrease of GM volume is related to ASD. In addition, there were no significant volumetric differences for WM.

FIGURE 2

Figure 2. The decreased Gray Matter (GM) volume detected by statistical analysis.

TABLE 3

Table 3. The different brain regions detected by statistical analysis on the GU dataset.

4.3. The Results of Model-Level Assessment

According to the findings in model-level assessment based on the searchlight method, the study has found structural differences of GM within twenty-four clusters, as shown in Figure 3 and Table 4. It can be seen that these clusters have covered most brain regions found by traditional statistical analysis, except FFG.R,CAL.L,INS.L, bilateral MOG, and PCG.R. For these brain regions failed to be detected in model-level assessment, the possible reason is that the differences are not obvious, and the areas of the clusters containing these brain regions are small in statistical analysis, e.g., the cluster of CAL.L only has 93 volxes, MOG.R and PCG.R only have 63 and 56 voxels, respectively. Significantly, although the peak MNI coordinates of some clusters may be different, model-level assessment and statistical analysis have detected similar brain regions, such as cluster 20 (in Table 3) and cluster 19 (in Table 4), cluster 21 (in Table 3) and cluster 14 (in Table 4), cluster 26 (in Table 3) and cluster 13 (in Table 4), cluster 10 (in Table 3) and cluster 24 (in Table 4), cluster 11 (in Table 3) and cluster 20 (in Table 4). Furthermore, we have considered abnormal clusters identified by the model-level assessment as the features and classified them on GU and multi-site data. The good classification performances have been shown in Supplementary Tables 6, 7 separately to demonstrate the effectiveness of these abnormal clusters.

FIGURE 3

Figure 3. The structural alteration by model-level analysis.

TABLE 4

Table 4. The candidate brain regions detected by model-level assessment.

4.4. The Results of Feature-Level Assessment

For the candidate brain regions detected in model-level assessment, the 'virtual lesion' analysis has been further conducted to select robust and discriminant brain regions which can be considered the neuroimaging biomarkers of ASD. The classification performances of final selected brain regions with nested cross-validation and multi-site validation have been listed in Tables 5, 6 separately. The bold values are the best performance for these brain regions. We have also illustrated the ROC curves of candidate biomarkers for different ML models on the GU dataset with the best accuracies larger than 70% in Figure 4, which include ORBinf.L,STG.L,SMG.L,SMG.R,ANG.L,ANG.R. The ROC curves of other candidate biomarkers on GU and multi-site data have been provided in the Supplementary Material.

TABLE 5

Table 5. The classification performance on GU.

TABLE 6

Table 6. The classification performance on the multi-site dataset (OHSU and UCLA).

FIGURE 4

Figure 4. The ROC curves of different methods for candidate biomarkers including ORBinf.L (left opercular part of inferior frontal gyrus) (A), STG.L (left superior temporal gyrus) (B), SMG.L (left supramarginal gyrus) (C), SMG.R (right supramarginal gyrus) (D), ANG.L (left angular gyrus) (E), and ANG.R (right angular gyrus) (F).

4.5. Neuroanatomical Correlations Between Regional GM Volume and Symptom Severity

We have processed correlation analysis to assess the relationship between the regional GM volume of candidate biomarkers in the GU dataset and ASD symptom severity. Clinical scores(ADOS_G and ADI_R) of thirty-six participants are available in GU. Results have revealed negative correlations between the communication skill scores in ADOS_G and regional GM volume in gyrus rectus (r = −0.356, p < 0.05), left middle temporal gyrus (r = −0.330, p < 0.05), and left inferior temporal gyrus (r = −0.339, p < 0.05). In particular, a significant negative correlation has been found between the communication skill scores in ADOS_G and the orbital part of the left inferior frontal gyrus (r = −0.433, p < 0.01). For ADI_R scores, we also found a negative correlation between verbal skill scores and right angular gyrus (r = −0.344, p < 0.05) and a significant negative correlation between non-verbal communication skills and right angular gyrus (r = −0.424, p < 0.01). No significant positive correlation between regional GM volume and clinical scores was found.

5. Discussion

Current VBM findings have delineated brain regions with consistently increased or reduced GM volume (Cauda et al., 2011). In this study, statistical analysis has reported a widespread reduction of GM volume in ASD with 7-13 years old. Research on brain development in ASD across the lifespan has demonstrated a complex neurodevelopmental trajectory, characterized by an early brain overgrowth followed by undergrowth in middle childhood and early adolescence (Courchesne et al., 2001). This might support the findings of statistical analysis in our study.

Our results based on machine learning have demonstrated that a widespread structural alteration of GM volume involved in bilateral superior temporal gyrus, bilateral middle temporal gyrus, left inferior temporal gyrus, right orbital SFG, bilateral opercular inferior frontal gyrus, left orbital inferior frontal gyrus, right rolandic operculum, right olfactory cortex, right gyrus rectus, right insula, right inferior parietal lobe with Supramarginal gyrus and Angular gyrus. Especially, multi-site dataset validation also has verified the robustness of the machine learning framework with three-level assessment. Since ASD is a complex neurodevelopmental disorder, involving language, reading, emotion, social interaction impairments, the quantitative meta-analysis in Geschwind and Levitt (2007) and Maximo et al. (2014) have suggested that ASD is unlikely to be associated with the abnormalities in one specific region alone but to be linked to the abnormalities of multiple, spatially distributed, neural systems. The finding may shed light on the widespread differences in GM volume found in our study.

The findings in the study have almost covered the whole temporal lobe including bilateral superior temporal gyrus, bilateral middle temporal gyrus, and left inferior temporal gyrus. Since attention has been directed to explore the neurobiological mechanism of ASD first, the abnormality of the temporal lobe has been speculated to link with the deficits in language and social behavior of patients with ASD (Hauser et al., 1975; Bachevalier, 1994; Kates et al., 2010). Ritvo et al. (1986) has examined the brains of four autistic subjects and found the localized pathological changes in the temporal lobe from autopsy-based research. It is considered that the superior temporal gyrus is a potential import biomarker of ASD (Pierce, 2011; Sophia et al., 2013). Based on the VBM-Dartel technique, Riddle et al. (2016) have revealed enlargement of the left anterior superior temporal gyrus in ASD. It is believed (Bigler et al., 2007) that the superior temporal gyrus plays a crucial role in social cognition, which participates in auditory and language processing. On the other hand, the VBM analysis has found reduced GM volume in the middle temporal gyrus (Kohoutov et al., 2020). According to analyze structural images of low functioning ASD children from 2 to 10 years old, the reduction of GM volume in the left inferior temporal gyrus has been identified, which appears to be involved in visual object perception (Riva et al., 2013). Similarly, RT et al. (2000) have revealed the fMRI (functional MRI) alterations of the inferior temporal gyrus when engaging in facial recognition tasks. Zilbovicius et al. (2000) has also found the localized dysfunction of the temporal lobe from the aspect of PET. Brothers (2002) has proposed the concept of the social brain first in 1990, which was defined as a group of interrelated neuroanatomical structures which are used to process social information, recognize other individuals and evaluate their psychological state, including intentions, dispositions, desires, and beliefs. The temporal lobe plays a very important role in the hypothesis of the social brain. The posterior superior temporal sulcus recognizes biological movements, such as eyes, hands, and other body movements and helps to interpret and predict others' behavior and intentions (Allison et al., 2000). The fMRI study of patients with ASD has shown that the differences in the activation on temporal lobe compared with their families and normal people, and the worse the social ability, the weaker the activation. Furthermore, it has been found that the degree of activation was positively correlated with the clinical manifestations of social impairment (Sugrue et al., 2010). The abnormality of the temporal lobe found in this study not only supports the hypothesis of the social brain but also suggests that the area of temporal lobe abnormalities in patients with autism may be larger than that found in previous studies. However, the hypothesis still needs to be further confirmed by quantitative and qualitative autopsy reports and animal studies.

Furthermore, the study has found GM abnormalities in the Broca area (posterior frontal lobe corresponding to BA44) and Wernicke area (superior marginal gyrus corresponding to BA39 and angular gyrus corresponding to BA40). Meanwhile, we also found negative correlations between GM volume in the right angular gyrus and verbal/nonverbal communication score in ADI_R. Language deficits are the core diagnostic characteristics in ASD and both of Broca area and Wernicke area are associated with language understanding. Adam et al. (2004) found that the Wernicke area, which is responsible for the understanding of single words, is more active than the Broca area, which is responsible for the understanding of complex sentences and has proposed the “underconnectivity theory” to explain why some patients with ASD have excellent ability to process single words, rather than complex sentences. Osbarn (2020) has found weakened functional connectivity in the area of Wernicke. Recently, researchers have established “the broken mirror theory” of autistic patients. It is supported that the dysfunction of the Human Mirror Neuron System (MNS) is the main cause of social and cognitive deficits in ASD (Vivanti and Rogers, 2014). The Broca area in humans has been considered as homologous to F5 as a part of MNS. The results in our studies also supported the MNS dysfunction in ASD individuals. However, the relevant evidence about the role of MNS in ASD still is not enough which urges us to build a more perfect MNS theory to understand the causes of social communication disorder in ASD (Southgate and Hamilton, 2008).

For other regions identified in the present study, they have also been reported in previous literature. Shijun (2021) has constructed a three-dimensional residual network based on deep learning and found the GM reduction in the orbital inferior frontal gyrus and Rolandic operculum. Riva et al. (2013) also found the reduced GM volume in the orbital part of the inferior frontal gyrus. In light of the meta-analysis based on large samples, the volume of GM in the insula and inferior parietal lobe decreased (Cauda et al., 2011). Li et al. (2019) have found that the GM volume of the bilateral gyrus rectus decreased, and the left rectus was negatively correlated with the clinical symptom score. Although it is not reported that the volume changes in the olfactory cortex of patients with ASD, the olfactory cortex is located at the anterior bottom of the limbic system and reciprocally connected with other structures, such as the amygdala, hippocampus, hypothalamus, the olfactory cortex is related to emotion and memory. As a consequence, the abnormal GM in the olfactory cortex in ASD may lead to emotional and memory problems. It is worth mentioning that some current studies on ASD have found GM differences in the cerebellum, hippocampus, and parahippocampal gyrus (Faridi and Khosrowabadi, 2017; Lotze et al., 2019). In particular, a decreased number of Purkinje cells in the cerebellar hemisphere have been observed (Ritvo et al., 1986). The study only found that the differences of the parahippocampal gyrus during statistical analysis. For model-level assessment, we have detected the alterations in these three brain regions, including the bilateral hippocampus, and parahippocampal gyrus and left crus in the cerebellum. However, they have been excluded from the candidate brain regions due to the poor performance in stability analysis. Traut et al. (2018) have compared and analyzed cerebellar volume of a large sample of ASD patients with normal subjects, and reported that the change of the cerebellar volume was significantly correlated with age, gender, and IQ rather than ASD diagnosis. Even though some studies have reported the differences of WM in ASD (Ecker et al., 2010; Xiao et al., 2014; Górriz et al., 2019), our study has not found abnormal white matter based on structural images. At present, contradictory conclusions often have been derived from a variety of ASD research due to the heterogeneity of subjects, including different subtypes, different scanning parameters from different centers, different ages and genders in ASD (Pua et al., 2017; Hiremath et al., 2021). In addition, the inconsistent findings of research work based on machine learning might be induced by different feature exaction techniques. For example, Haar et al. (2016) have achieved poor classification based on morphological features of ROI on the multi-site dataset and suggested that anatomical abnormalities may be only present in some distinct subgroups of ASD, while Zheng et al. (2018) have obtained superior classification performance with multi-feature-based networks based on morphological features.

Our results suggest that structural MRI can provide neuroimaging-based biomarkers for ASD. Such biomarkers could be used to complete and improve the diagnosis and treatment of ASD clinically (Walsh et al., 2011). On the one hand, we can utilize the classification performance of these identified biomarkers based on the machine learning model proposed in the study to improve diagnostic accuracy. On the other hand, the behavioral social malfunctioning in ASD might be modified by neural or behavioral treatments. For example, it is also reported that the behavioral training of facial expression communication behavior can help to improve the neural activities of ASD patients related to some social brain regions, such as MTG, so as to improve their capability of expression recognition (Bölte et al., 2015).

Although our results are consistent with some previous reports, several limitations of our study should be acknowledged. First, the confounding factors should be considered. ASD is a complex disease with multiple confounding factors, such as age, gender, IQ, and the inherent heterogeneity of the disorder. Our studies attempted to control age and gender in statistical analysis but failed to find an appropriate approach to remove the influence of the confounding factors in machine learning methods. On the other hand, the controlling of confounding factors is still highly controversial in the studies of ASD (Thomaidis et al., 2015). Some researchers have claimed that IQ should be strictly matched or statistically regressed out, while others have argued that the variability truly associated with ASD could be also discarded as “non-specific” when attempting to control some non-specific factors, such as IQ (Osbarn, 2020). Some studies have investigated the gender differences in ASD (Halladay et al., 2015; Prosperi et al., 2021) due to a high incidence rate of ASD in boys. Second, the current study took advantage of the relatively large sample of participants with ASD in the ABIDE II database to process multi-site validation via machine learning. However, we have to point that all three databases used in our study were collected by Siemens scanner with similar scanning parameters, which might make a less dispersion among data. In the future, the investigation of ASD about neuroanatomical alterations on larger samples from diverse clinical and demographic subgroups will significantly promote understanding neuropathology mechanism in ASD.

6. Conclusion

In this study, the VBM analysis has revealed a widespread reduction of GM volume when comparing ASD with TDC. Furthermore, our machine learning analysis followed the unified machine learning framework has revealed candidate neuroimaging biomarkers related to ASD and confirmed the relationship between regional GM volume and symptom severity. Our results have suggested that candidate neuroimaging biomarkers are useful to characterize the profile of brain anatomy in ASD and improve the diagnosis performance in clinical applications.

Data Availability Statement

Publicly available datasets were analyzed in this study. This data can be found here: http://fcon_1000.projects.nitrc.org/indi/abide/abide_II.html.

Author Contributions

YD, WZ, and CLi conceived and designed this study. HJ, YT, and XL participated in the analysis of MRI dataset. CLu and DY helped to improve the manuscript. All authors contributed to the article, read, and approved the final manuscript.

Funding

This work is supported by the China Postdoctoral Science Foundation (No. 2016M592656), Sichuan Science and Technology Program (No. 2018JY0272), Science & Technology Bureau of Chengdu (No. 2020-YF09-00005-SN), Erasmus+ SHYFTE Project (No. 598649-EPP-1-2018-1-FR-EPPKA2-CBHE-JP), and the Key Laboratory of Pattern Recognition and Intelligent Information Processing, Institutions of Higher Education of Sichuan Province (No. MSSB-2021-12).

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher's Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fnhum.2021.765517/full#supplementary-material

References

Adam, J. M., Cherkassky, V. L., Keller, T. A., and Minshew, N. J. (2004). Cortical activation and synchronization during sentence comprehension in high-functioning autism: evidence of underconnectivity. Brain. 127, 1811–1821. doi: 10.1093/brain/awh199

PubMed Abstract | CrossRef Full Text | Google Scholar

Akiba, T., Sano, S., Yanase, T., Ohta, T., and Koyama, M. (2019). “Optuna: a next-generation hyperparameter optimization framework,” in Proceedings of the 25rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (Anchorage, AK), 2623–2631.

Google Scholar

Allison, T., Puce, A., and Mccarthy, G. (2000). Social perception from Visual Cues: role of the STS region. Trends Cogn. Sci. 4, 267–278. doi: 10.1016/S1364-6613(00)01501-1

PubMed Abstract | CrossRef Full Text | Google Scholar

American Psychiatric Association. (2013). “Anxiety disorders,” in Diagnostic and Statistical Manual of Mental Disorders, 5th Edn (Arlington, VA). doi: 10.1176/appi.books.9780890425596.dsm05

PubMed Abstract | CrossRef Full Text | Google Scholar

Ashburner, J. (2007). A fast diffeomorphic image registration algorithm. Neuroimage 38, 95–113. doi: 10.1016/j.neuroimage.2007.07.007

PubMed Abstract | CrossRef Full Text | Google Scholar

Ashourvan, A., Gu, S., Mattar, M. G., Vettel, J. M., Bassett, D. S., Abraham, A., et al. (2016). Deriving reproducible biomarkers from multi-site resting-state data: an Autism-based example. Neuroimage 157, 1–37. doi: 10.1016/j.neuroimage.2016.10.045

PubMed Abstract | CrossRef Full Text | Google Scholar

Bachevalier, J. (1994). Medial temporal lobe structures and autism: a review of clinical and experimental findings. Neuropsychologia 32, 627–648. doi: 10.1016/0028-3932(94)90025-6

PubMed Abstract | CrossRef Full Text | Google Scholar

Barnea-Goraly, N., Frazier, T. W., Piacenza, L., Minshew, N. J., Keshavan, M. S., Reiss, A. L., et al. (2014). A preliminary longitudinal volumetric MRI study of amygdala and hippocampal volumes in autism. Progr. Neuropsychopharmacol. Biol. Psychiatry 48, 124–128. doi: 10.1016/j.pnpbp.2013.09.010

PubMed Abstract | CrossRef Full Text | Google Scholar

Bigler, E. D., Mortensen, S., Neeley, E. S., Ozonoff, S., Krasny, L., Johnson, M., et al. (2007). Superior temporal gyrus, language function, and autism. Dev. Neuropsychol. 31, 217–238. doi: 10.1080/87565640701190841

PubMed Abstract | CrossRef Full Text | Google Scholar

Bilgen, I., Guvercin, G., and Rekik, I. (2020). Machine learning methods for brain network classification: application to autism diagnosis using cortical morphological networks. arXiv. doi: 10.1016/j.jneumeth.2020.108799

PubMed Abstract | CrossRef Full Text | Google Scholar

Bölte, S., Ciaramidaro, A., Schlitt, S., Hainz, D., and Walter, H. (2015). Training-induced plasticity of the social brain in autism spectrum disorder. Br. J. Psychiatry 207, 149–57. doi: 10.1192/bjp.bp.113.143784

PubMed Abstract | CrossRef Full Text | Google Scholar

Bonnici, H. M., Chadwick, M. J., Dharshan, K., Demis, H., Nikolaus, W., and Maguire, E. A. (2012). Multi-voxel pattern analysis in human hippocampal subfields. Front. Hum. Neurosci. 6:290. doi: 10.3389/fnhum.2012.00290

PubMed Abstract | CrossRef Full Text | Google Scholar

Brothers, L. (2002). “The social brain: A project for integrating primate behaviour and neurophysiology in a new domain,” in Foundations in Social Neuroscience (Cambridge, MA: MIT Press), 367–389.

Cakir, J., Frye, R. E., and Walker, S. J. (2020). The lifetime social cost of autism: 1990–2029. Res. Autism.Spectr. Disord. 72:101502. doi: 10.1016/j.rasd.2019.101502

CrossRef Full Text | Google Scholar

Calderoni, S., Retico, A., Biagi, L., Tancredi, R., Muratori, F., and Tosetti, M. (2012). Female children with autism spectrum disorder: An insight from mass-univariate and pattern classification analyses. Neuroimage 59, 1013–1022. doi: 10.1016/j.neuroimage.2011.08.070

PubMed Abstract | CrossRef Full Text | Google Scholar

Cauda, F., Geda, E., Sacco, K., D'Agata, F., Duca, S., Geminiani, G., et al. (2011). Grey matter abnormality in autism spectrum disorder: An activation likelihood estimation meta-analysis study. J. Neurol. Neurosurg. Psychiatry 82, 1304–1313. doi: 10.1136/jnnp.2010.239111

PubMed Abstract | CrossRef Full Text | Google Scholar

Chang, L. J., Gianaros, P. J., Manuck, S. B., Anjali, K., Wager, T. D., and Ralph, A. (2015). A sensitive and specific neural signature for picture-induced negative affect. PLoS Biol. 13:e1002180. doi: 10.1371/journal.pbio.1002180

PubMed Abstract | CrossRef Full Text | Google Scholar

Chen, Z., Yan, T., Wang, E., Jiang, H., Tang, Y., Yu, X., et al. (2020). Detecting abnormal brain regions in schizophrenia using structural MRI via machine learning. Comput. Intell. Neurosci. 2020:6405930. doi: 10.1155/2020/6405930

PubMed Abstract | CrossRef Full Text | Google Scholar

Courchesne, E., Karns, C. M., Davis, H. R., Ziccardi, R., Carper, R. A., Tigue, Z. D., et al. (2001). Unusual brain growth patterns in early life in patients with autistic disorder: an MRI study. Neurology 57, 245–254. doi: 10.1212/WNL.57.2.245

PubMed Abstract | CrossRef Full Text | Google Scholar

Di Martino, A., O'Connor, D., Chen, B., Alaerts, K., Anderson, J. S., Assaf, M., et al. (2017). Enhancing studies of the connectome in autism using the autism brain imaging data exchange II. Scientific Data 4:170010. doi: 10.1038/sdata.2017.10

PubMed Abstract | CrossRef Full Text | Google Scholar

Ecker, C., Rocha-Rego, V., Johnston, P., Mourao-Miranda, J., Marquand, A., Daly, E. M., et al. (2010). Investigating the predictive value of whole-brain structural MR scans in autism: a pattern classification approach. Neuroimage 49, 44–56. doi: 10.1016/j.neuroimage.2009.08.024

PubMed Abstract | CrossRef Full Text | Google Scholar

Eslami, T., Almuqhim, F., Raiker, J. S., and Saeed, F. (2021). Machine learning methods for diagnosing autism spectrum disorder and attention- deficit/hyperactivity disorder using functional and structural MRI: a survey. Front. Neuroinform. 14:575999. doi: 10.3389/fninf.2020.575999

PubMed Abstract | CrossRef Full Text | Google Scholar

Eslami, T., Mirjalili, V., Fong, A., Laird, A. R., and Saeed, F. (2019). ASD-DiagNet: a hybrid learning approach for detection of autism spectrum disorder using fMRI data. Front. Neuroinform. 13:70. doi: 10.3389/fninf.2019.00070

PubMed Abstract | CrossRef Full Text | Google Scholar

Fan, R. E., Chang, K. W., Hsieh, C. J., Wang, X. R., and Lin, C. J. (2008). LIBLINEAR: a library for large linear classification. J. Mach. Learn. Res. 9, 1871–1874. doi: 10.5555/1390681.1442794

PubMed Abstract | CrossRef Full Text | Google Scholar

Faridi, F., and Khosrowabadi, R. (2017). Behavioral, cognitive and neural markers of asperger syndrome. Basic Clin. Neurosci. 8, 349–359. doi: 10.18869/nirp.bcn.8.5.349

PubMed Abstract | CrossRef Full Text | Google Scholar

Feng, L., Guo, W., Yu, D., Gao, Q., Gao, K., Xue, Z., et al. (2012). Classification of different therapeutic responses of major depressive disorder with multivariate pattern analysis method based on structural mr scans. PLoS ONE 7:e40968. doi: 10.1371/journal.pone.0040968

PubMed Abstract | CrossRef Full Text | Google Scholar

Geschwind, D. H., and Levitt, P. (2007). Autism spectrum disorders: developmental disconnection syndromes. Curr. Opin. Neurobiol. 17, 103–111. doi: 10.1016/j.conb.2007.01.009

PubMed Abstract | CrossRef Full Text | Google Scholar

Górriz, J. M., Ramírez, J., Segovia, F., Martínez, F. J., Lai, M. C., Lombardo, M. V., et al. (2019). A machine learning approach to reveal the neurophenotypes of autisms. Int. J. Neural. Syst. 29, 1–22. doi: 10.1142/S0129065718500582

PubMed Abstract | CrossRef Full Text | Google Scholar

Haar, S., Berman, S., Behrmann, M., and Dinstein, I. (2016). Anatomical abnormalities in autism? Cereb. Cortex 26, 1440–1452. doi: 10.1093/cercor/bhu242

PubMed Abstract | CrossRef Full Text | Google Scholar

Halladay, A. K., Bishop, S., Constantino, J. N., Daniels, A. M., Koenig, K., Palmer, K., et al. (2015). Sex and gender differences in autism spectrum disorder: summarizing evidence gaps and identifying emerging areas of priority. Mol. Autism. 6:36. doi: 10.1186/s13229-015-0019-y

PubMed Abstract | CrossRef Full Text | Google Scholar

Hauser, S. L., Robert, D. G., and Paul, R. N. (1975). Pneumographic findings in the infantile autism syndrome. a correlation with temporal lobe disease. Brain J. Neurol. 98, 667–688. doi: 10.1093/brain/98.4.667

PubMed Abstract | CrossRef Full Text | Google Scholar

Hazlett, H. C., Gu, H., Munsell, B. C., Kim, S. H., Styner, M., Wolff, J. J., et al. (2017). Early brain development in infants at high risk for autism spectrum disorder. Nature 542, 348–351. doi: 10.1038/nature21369

PubMed Abstract | CrossRef Full Text | Google Scholar

Hiremath, C. S., Sagar, K. J. V., Yamini, B. K., Girimaji, A. S., Kumar, R., Sravanti, S. L., et al. (2021). Emerging behavioral and neuroimaging biomarkers for early and accurate characterization of autism spectrum disorders: a systematic review. Transl. Psychiatry 11:42. doi: 10.1038/s41398-020-01178-6

PubMed Abstract | CrossRef Full Text | Google Scholar

Hirvikoski, T., Mittendorfer-Rutz, E., Boman, M., Larsson, H., and Bolte, S. (2015). Premature mortality in autism spectrum disorder. British J. Psychiatry 208, 232–238. doi: 10.1192/bjp.bp.114.160192

PubMed Abstract | CrossRef Full Text | Google Scholar

Hu, J., Chen, X., Li, M., Xu, H.-.L., Huang, Z., Chen, N., et al. (2021). Pattern of cerebellar grey matter loss associated with ataxia severity in spinocerebellar ataxias type 3: a multi-voxel pattern analysis. Brain Imaging Behav. 1, 1–12. doi: 10.1007/s11682-021-00511-x

PubMed Abstract | CrossRef Full Text | Google Scholar

Jou, R. J., Mateljevic, N., Minshew, N. J., Keshavan, M. S., and Hardan, A. Y. (2011). Reduced central white matter volume in autism: Implications for long?range connectivity. Psychiatry Clin. Neurosci. 65, 98–101. doi: 10.1111/j.1440-1819.2010.02164.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Kates, W. R., Mostofsky, S. H., Zimmerman, A. W., Mazzocco, M., and Reiss, A. L. (2010). Neuroanatomical and neurocognitive differences in a pair of monozygous twins discordant for strictly defined autism. Ann. Neurol. 43, 782–791. doi: 10.1002/ana.410430613

PubMed Abstract | CrossRef Full Text | Google Scholar

Khodatars, M., Shoeibi, A., Ghassemi, N., Jafari, M., Khadem, A., Sadeghi, D., et al. (2020). Deep learning for neuroimaging-based diagnosis and rehabilitation of autism spectrum disorder: a review. arXiv. doi: 10.1016/j.compbiomed.2021.104949

PubMed Abstract | CrossRef Full Text | Google Scholar

Knutson, B. (2013). Interpretable whole-brain prediction analysis with GraphNet. Neuroimage 72, 304–321. doi: 10.1016/j.neuroimage.2012.12.062

PubMed Abstract | CrossRef Full Text | Google Scholar

Kohoutov, L., Heo, J., Cha, S., Lee, S., and Woo, C. W. (2020). Toward a unified framework for interpreting machine-learning models in neuroimaging. Nat. Protoc. 15, 1–37. doi: 10.1038/s41596-019-0289-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Kriegeskorte, N., Goebel, R., and Bandettini, P. (2006). Information-based functional brain mapping. Proc. Natl. Acad. Sci. U.S.A. 103, 3863–3868. doi: 10.1073/pnas.0600244103

PubMed Abstract | CrossRef Full Text | Google Scholar

Leung, Y. Y., Chang, C. Q., Hung, Y. S., and Fung, P. (2006). Gene selection for b cancer classification using support vector machines. Mach. Learn. 46, 389–422. doi: 10.1023/A:1012487302797

PubMed Abstract | CrossRef Full Text

Li, G., Rossbach, K., Jiang, W., Zhao, L., and Du, Y. (2019). Reduction in grey matter volume and its correlation with clinical symptoms in Chinese boys with low functioning autism spectrum disorder. J. Intell. Disabil. Res. 63, 113–123. doi: 10.1111/jir.12552

PubMed Abstract | CrossRef Full Text | Google Scholar

Lord, C. (2000). The autism diagnostic observation schedule-generic : a standard measure of social and communication deficits associated with the spectrum of autism. J. Autism Dev. Disord. 30, 205–223. doi: 10.1023/A:1005592401947

PubMed Abstract | CrossRef Full Text | Google Scholar

Lord, C., Rutter, M., and Couteur, A. L. (1994). Autism diagnostic interview-revised: A revised version of a diagnostic interview for caregivers of individuals with possible pervasive developmental disorders. J. Autism. Dev. Disord. 24, 659–685. doi: 10.1007/BF02172145

PubMed Abstract | CrossRef Full Text | Google Scholar

Lotze, M., Domin, M., Gerlach, F. H., Gaser, C., Lueders, E., Schmidt, C. O., et al. (2019). Novel findings from 2,838 adult brains on sex differences in gray matter brain volume. Sci. Rep. 9, 1671. doi: 10.1038/s41598-018-38239-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Maenner, M. J., Shaw, K. A., Baio, J., Washington, A., and Dietz, P. M. (2020). Prevalence of autism spectrum disorder among children aged 8 years autism and developmental disabilities monitoring network, 11 sites, United States, 2016. MMWR Surveill. Summ. 69, 1–12. doi: 10.15585/mmwr.ss6706a1

PubMed Abstract | CrossRef Full Text | Google Scholar

Maier, S., Tebartz van Elst, L., Beier, D., Ebert, D., Fangmeier, T., Radtke, M., et al. (2015). Increased hippocampal volumes in adults with high functioning autism spectrum disorder and an IQ>100: a manual morphometric study. Psychiatry Res. Neuroimaging 234, 152–155. doi: 10.1016/j.pscychresns.2015.08.002

PubMed Abstract | CrossRef Full Text | Google Scholar

Martino, A. D., Yan, C. G., Li, Q., Denio, E., Castellanos, F. X., Alaerts, K., et al. (2013). The autism brain imaging data exchange: towards a large-scale evaluation of the intrinsic brain architecture in autism. Mol. Psychiatry. 19, 659–667. doi: 10.1038/mp.2013.78

PubMed Abstract | CrossRef Full Text | Google Scholar

Maximo, J. O., Cadena, E. J., and Kana, R. K. (2014). The Implications of Brain Connectivity in the Neuropsychology of Autism. Neuropsychol. Rev. 24, 16–31. doi: 10.1007/s11065-014-9250-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Mazziotta, J. C., Toga, A. W., Evans, A., Fox, P., and Lancaster, J. (1995). A probabilistic atlas of the human brain: theory and rationale for its development: the international consortium for brain mapping (icbm). Neuroimage 2, 89–101. doi: 10.1006/nimg.1995.1012

PubMed Abstract | CrossRef Full Text | Google Scholar

Mwiza, K., Shuo, Z., and Gaolang Gong, H. L. (2020). Improving multi-site autism classification based on site-dependence minimisation and second-order functional connectivity. bioRxiv. 1, 1–35. doi: 10.1101/2020.02.01.930073

CrossRef Full Text | Google Scholar

Ojala, M., and Garriga, G. C. (2010). Permutation tests for studying classifier performance. Journal of Machine Learning Research. doi: 10.1109/ICDM.2009.108

CrossRef Full Text | Google Scholar

Organization W. H. (1993). The ICD-10 Classification of Mental and Behavioural Disorders: Diagnostic Criteria for Research. Geneva: Clinical Descriptions and Diagnostic Guidelines Geneva.

Google Scholar

Osbarn, S. (2020). Wernicke's Area in A ea in Autism: rsfMRI study (Ph.D. thesis).

Pagnozzi, A. M., Conti, E., Calderoni, S., Fripp, J., and Rose, S. E. (2018). A systematic review of structural MRI biomarkers in autism spectrum disorder: A machine learning perspective. Int. J. Dev. Neurosci. 71, 68–82. doi: 10.1016/j.ijdevneu.2018.08.010

PubMed Abstract | CrossRef Full Text | Google Scholar

Palmen, S. J., Pol, H. H. F., Kemner, C., Schnack, H. G., Durston, S., Lahuis, B. E., et al. (2005). Increased gray-matter volume in medication-naive high-functioning children with autism spectrum disorder. Psychol. Med. 35, 561. doi: 10.1017/S0033291704003496

PubMed Abstract | CrossRef Full Text | Google Scholar

Pierce, K. (2011). Early functional brain development in autism and the promise of sleep fMRI. Brain Res. 1380, 162–174. doi: 10.1016/j.brainres.2010.09.028

PubMed Abstract | CrossRef Full Text | Google Scholar

Prosperi, M., Turi, M., Guerrera, S., Napoli, E., Tancredi, R., Igliozzi, R., et al. (2021). Sex Differences in autism spectrum disorder: an investigation on core symptoms and psychiatric comorbidity in preschoolers. Front. Integr. Neurosci. 14:62. doi: 10.3389/fnint.2020.594082

PubMed Abstract | CrossRef Full Text | Google Scholar

Pua, E. P. K., Bowden, S. C., and Seal, M. L. (2017). Autism spectrum disorders: neuroimaging findings from systematic reviews. Res. Autism. Spectr. Disord. 34, 28–33. doi: 10.1016/j.rasd.2016.11.005

CrossRef Full Text | Google Scholar

Rakić, M., Cabezas, M., Kushibar, K., Oliver, A., and Lladó, X. (2020). Improving the detection of autism spectrum disorder by combining structural and functional MRI information. Neuroimage Clin. 25:102181. doi: 10.1016/j.nicl.2020.102181

PubMed Abstract | CrossRef Full Text | Google Scholar

Riddle, K., Cascio, C. J., and Woodward, N. D. (2016). Brain structure in autism: a voxel-based morphometry analysis of the Autism Brain Imaging Database Exchange (ABIDE). Brain Imaging Behav. 11, 1–11. doi: 10.1007/s11682-016-9534-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Riedel, A., Maier, S., Ulbrich, M., Biscaldi, M., Ebert, D., Fangmeier, T., et al. (2014). No significant brain volume decreases or increases in adults with high-functioning autism spectrum disorder and above average intelligence: A voxel-based morphometric study. Psychiatry Res. 223, 67–74. doi: 10.1016/j.pscychresns.2014.05.013

PubMed Abstract | CrossRef Full Text | Google Scholar

Rifkin, R., Yeo, G., and Poggio, T. (2003) “Regularized Least-Squares Classification” in Advances in Learning Theory: Methods, Models Applications, NATO Science Series III: Computer & Systems Sciences, eds. Suykens, J.A.K. Horvath G. Basu S. Micchelli C. Vandewalle, J. (Amsterdam: IOS Press), 190, 131–153.

Google Scholar

Ritvo, E. R., Freeman, B., Scheibel, A. B., Duong, T., Robinson, H., Guthrie, D., et al. (1986). Lower Purkinje cell counts in the cerebella of four autistic subjects: initial findings of the UCLA-NSAC autopsy research report. Am. J. Psychiatry 143, 862. doi: 10.1176/ajp.143.7.862

PubMed Abstract | CrossRef Full Text | Google Scholar

Riva, D., Annunziata, S., Contarino, V., Erbetta, A., Aquino, D., and Bulgheroni, S. (2013). Gray matter reduction in the vermis and CRUS-II is associated with social and interaction deficits in low-functioning children with autistic spectrum disorders: a VBM-DARTEL study. Cerebellum 12, 676–685. doi: 10.1007/s12311-013-0469-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Samartsidis, P., Montagna, S., Nichols, T. E., and Johnson, T. D. (2016). The coordinate-based meta-analysis of neuroimaging data. Stat. Sci. 32, 580–599. doi: 10.1214/17-STS624

PubMed Abstract | CrossRef Full Text | Google Scholar

Schultz, R. T., Gauthier, I., Klin, A., Fulbright, R. K., Anderson, A. W., Volkmar, F., et al. (2000). Abnormal ventral temporal cortical activity during face discrimination among individuals with autism and Asperger syndrome. Arch. Gen. Psychiatry 57, 331. doi: 10.1001/archpsyc.57.4.331

PubMed Abstract | CrossRef Full Text | Google Scholar

Sherkatghanad, Z., Akhondzadeh, M., Salari, S., Zomorodi-Moghadam, M., Abdar, M., Acharya, U. R., et al. (2020). Automated detection of autism spectrum disorder using a convolutional neural network. Front. Neurosci. 13:1325. doi: 10.3389/fnins.2019.01325

PubMed Abstract | CrossRef Full Text | Google Scholar

Shijun, L. (2021). Brain differences in autism spectrum disorder (Ph.D. thesis).

Google Scholar

Sophia, M., Daniel, K., Samson, A. C., Valerie, K., Janusch, B., Michel, G., et al. (2013). Convergent findings of altered functional and structural brain connectivity in individuals with high functioning autism: a multimodal MRI study. PLoS ONE 8:e67329. doi: 10.1371/journal.pone.0067329

PubMed Abstract | CrossRef Full Text | Google Scholar

Southgate, V., and Hamilton, A. (2008). Unbroken mirrors: challenging a theory of Autism. Trends Cogn. Sci. 12, 225–229. doi: 10.1016/j.tics.2008.03.005

PubMed Abstract | CrossRef Full Text | Google Scholar

Spera, G., Retico, A., Bosco, P., Ferrari, E., Palumbo, L., Oliva, P., et al. (2019). Evaluation of altered functional connections in male children with autism spectrum disorders on multiple-site data optimized with machine learning. Front. Psychiatry. 10:620. doi: 10.3389/fpsyt.2019.00620

PubMed Abstract | CrossRef Full Text | Google Scholar

Sugrue, D. D., Voos, M. C., Ventola, P. E., Klin, A., Kaiser, M. D., Velde, C. V., et al. (2010). Neural signatures of autism. Proc. Natl. Acad. Sci. U.S.A. 107, 21223–21228. doi: 10.1073/pnas.1010412107

PubMed Abstract | CrossRef Full Text | Google Scholar

Thomaidis, L., Kyprianou, M., and Choleva, A. (2015). Early screening of autism: Is age a confounding factor when screening for autism? J. Paediatr. Child Health. 51, 1046–1047. doi: 10.1111/jpc.12997

PubMed Abstract | CrossRef Full Text | Google Scholar

Traut, N., Beggiato, A., Bourgeron, T., Delorme, R., Rondi-Reig, L., Paradis, A. L., et al. (2018). Cerebellar volume in autism: literature meta-analysis and analysis of the autism brain imaging data exchange cohort. Biol. Psychiatry 83, 579–588. doi: 10.1016/j.biopsych.2017.09.029

PubMed Abstract | CrossRef Full Text | Google Scholar

van Rooij, D., aan Anagnostou, E., Arango, C., Auzias, G., Behrmann, M., Busatto, G.eraldo F., Calderoni, S., et al. (2017). Cortical and subcortical brain morphometry differences between patients with autism spectrum disorder and healthy individuals across the lifespan: results from the ENIGMA ASD working group. Am. J. Psychiatry 175, 359–369. doi: 10.1176/appi.ajp.2017.17010100

PubMed Abstract | CrossRef Full Text | Google Scholar

Vivanti, G., and Rogers, S. J. (2014). Autism and the mirror neuron system: Insights from learning and teaching. Philos. Trans. R. Soc. B Biol. Sci. 369:20130184. doi: 10.1098/rstb.2013.0184

PubMed Abstract | CrossRef Full Text | Google Scholar

Walsh, P., Elsabbagh, M., Bolton, P., and Singh, I. (2011). In search of biomarkers for autism: scientific, social and ethical challenges. Nat. Rev. Neurosci. 12, 603–612. doi: 10.1038/nrn3113

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, M., Zhang, D., Huang, J., Yap, P. T., Shen, D., and Liu, M. (2020). Identifying autism spectrum disorder with multi-site fMRI via low-rank domain adaptation. IEEE Trans. Med. Imaging 39, 644–655. doi: 10.1109/TMI.2019.2933160

PubMed Abstract | CrossRef Full Text | Google Scholar

Wee, C. Y., Wang, L., Shi, F., Yap, P. T., and Shen, D. (2014). Diagnosis of autism spectrum disorders using regional and interregional morphological features. Hum. Brain Mapp. 35, 3414–3430. doi: 10.1002/hbm.22411

PubMed Abstract | CrossRef Full Text | Google Scholar

Wold, S., Esbensen, K., and Geladi, P. (1987). Principal component analysis. Chemometr. Intell. Lab. Syst. 2, 37–52. doi: 10.1016/0169-7439(87)80084-9

CrossRef Full Text | Google Scholar

Xiao, Z., Qiu, T., Ke, X., Xiao, X., Xiao, T., Liang, F., et al. (2014). Autism spectrum disorder as early neurodevelopmental disorder: evidence from the brain imaging abnormalities in 2-3 years old toddlers. J. Autism. Dev. Disord. 44, 1633–1640. doi: 10.1007/s10803-014-2033-x

PubMed Abstract | CrossRef Full Text | Google Scholar

Zheng, W., Eilamstock, T., Wu, T., Spagna, A., Chen, C., Hu, B., et al. (2018). Multi-feature based network revealing the structural abnormalities in autism spectrum disorder. IEEE Trans. Affect. Comput. 12, 732–742. doi: 10.1109/TAFFC.2018.2890597

CrossRef Full Text | Google Scholar

Zhutovsky, P., Thomas, R. M., Olff, M., Rooij, S., Kennis, M., Wingen, G., et al. (2019). Individual prediction of psychotherapy outcome in posttraumatic stress disorder using neuroimaging data. Transl. Psychiatry 9:326. doi: 10.1038/s41398-019-0663-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Zilbovicius, M., Boddaert, N., Belin, P., Poline, J. B., and Samson, Y. (2000). Temporal lobe dysfunction in childhood autism: a PET study. Am. J. Psychiatry 157, 1988–1993. doi: 10.1176/appi.ajp.157.12.1988

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: autism spectrum disorder, structural MRI, multi-site data, machine learning, searchlight technique

Citation: Duan Y, Zhao W, Luo C, Liu X, Jiang H, Tang Y, Liu C and Yao D (2022) Identifying and Predicting Autism Spectrum Disorder Based on Multi-Site Structural MRI With Machine Learning. Front. Hum. Neurosci. 15:765517. doi: 10.3389/fnhum.2021.765517

Received: 27 August 2021; Accepted: 13 December 2021;
Published: 22 February 2022.

Edited by:

Miseon Shim, Korea University, South Korea

Reviewed by:

Xun Yang, Chongqing University, China
Weihao Zheng, Lanzhou University, China

Copyright © 2022 Duan, Zhao, Luo, Liu, Jiang, Tang, Liu and Yao. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Chang Liu, bGl1Y2hhbmdAY2R1LmVkdS5jbg==; DeZhong Yao, ZHlhb0B1ZXN0Yy5lZHUuY24=

^†These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.