Explaining deep learning-based representations of resting state functional connectivity data: focusing on interpreting nonlinear patterns in autism spectrum disorder

Kim, Young-geun; Ravid, Orren; Zheng, Xinyuan; Kim, Yoojean; Neria, Yuval; Lee, Seonjoo; He, Xiaofu; Zhu, Xi

doi:10.3389/fpsyt.2024.1397093

ORIGINAL RESEARCH article

Front. Psychiatry , 20 May 2024

Sec. Computational Psychiatry

Volume 15 - 2024 | https://doi.org/10.3389/fpsyt.2024.1397093

This article is part of the Research Topic Computational Research to Shed Light on the Pathogenesis of Psychiatric and Mental Health Disorders View all 3 articles

Explaining deep learning-based representations of resting state functional connectivity data: focusing on interpreting nonlinear patterns in autism spectrum disorder

Young-geun Kim^1,2,3†

Orren Ravid^2†

Xinyuan Zheng³

Yoojean Kim²

Yuval Neria^1,2

Seonjoo Lee^1,2,3

Xiaofu He^1,2*

Xi Zhu^1,2*

¹Department of Psychiatry, Columbia University Irving Medical Center, New York, NY, United States
²Department of Biostatistics, Columbia University Irving Medical Center, New York, NY, United States
³Mental Health Data Science, New York State Psychiatric Institute, New York, NY, United States

Background: Resting state Functional Magnetic Resonance Imaging fMRI (rs-fMRI) has been used extensively to study brain function in psychiatric disorders, yielding insights into brain organization. However, the high dimensionality of the rs-fMRI data presents significant challenges for data analysis. Variational autoencoders (VAEs), a type of neural network, have been instrumental in extracting low-dimensional latent representations of resting state functional connectivity (rsFC) patterns, thereby addressing the complex nonlinear structure of rs-fMRI data. Despite these advances, interpreting these latent representations remains a challenge. This paper aims to address this gap by developing explainable VAE models and testing their utility using rs-fMRI data in autism spectrum disorder (ASD).

Methods: One-thousand one hundred and fifty participants (601 healthy controls [HC] and 549 patients with ASD) were included in the analysis. RsFC correlation matrices were extracted from the preprocessed rs-fMRI data using the Power atlas, which includes 264 regions of interest (ROIs). Then VAEs were trained in an unsupervised manner. Lastly, we introduce our latent contribution scores to explain the relationship between estimated representations and the original rs-fMRI brain measures.

Results: We quantified the latent contribution scores for both the ASD and HC groups at the network level. We found that both ASD and HC groups share the top network connectivitives contributing to all estimated latent components. For example, latent 0 was driven by rsFC within ventral attention network (VAN) in both the ASD and HC. However, we found significant differences in the latent contribution scores between the ASD and HC groups within the VAN for latent 0 and the sensory/somatomotor network for latent 2.

Conclusion: This study introduced latent contribution scores to interpret nonlinear patterns identified by VAEs. These scores effectively capture changes in each observed rsFC feature as the estimated latent representation changes, enabling an explainable deep learning model that better understands the underlying neural mechanisms of ASD.

1 Introduction

The use of functional magnetic resonance imaging (fMRI) data has been pivotal in the field of psychiatry over the last few decades for studying brain function, evaluating underlying neural mechanisms of treatment interventions, and aiding in the diagnosis of a variety of mental disorders. Resting state fMRI (rs-fMRI), which measures spontaneous neural activities in the absence of any specific task, has been useful in revealing intrinsic patterns of brain functional connectivity and providing insights into brain functional organization. One of the most common methods to extract measures of brain networks of rs-fMRI data is to calculate the correlation coefficient between pairs of different regions of interest (ROIs) in the brain. Examining differences in connectivity between healthy controls (HC) and those with psychiatric disorders has helped to understand the underlying neural network deficits associated those disorders (1). In recent years, machine learning approaches have gained prominence for analyzing rs-fMRI data for the diagnosis of psychiatric disorders and the predicting treatment outcomes at the individual level (2).

Despite its utility, leveraging rs-fMRI data presents significant challenges. One of the main chanllenges lies in the high dimensionality of the data. For instance, rs-fMRI data are often partitioned into several hundred ROIs based on various brain atlases (3), e.g., 264 ROIs in the Power atlas (4) or 333 in the Gordon atlas (5). This results in connectivity matrices with more than ten thousand image features, posing substantial challenges for standard machine learning techniques such as random forest, support vector machine, and regressions. Thus, dimensionality reduction methods are often used as a preprocessing step before applying machine learning algorithms to rs-fMRI datasets. Traditional linear methods, such as principal component analysis (PCA) and independent component analysis (ICA), have been used to transform the high-dimensional connectome features into a lower-dimensional space. However, these methods lack to address the complex nonlinear structure of rs-fMRI data. For example, PCA may exhibit biases (6) and ICA can have scalability issues (7) when the data dimension exceeds the sample size. Recently, deep learning-based approaches have extracted low-dimensional latent factors (called representations) of resting state functional connectivity patterns (rsFC), showing remarkable performance with expressive nonlinear neural networks (8, 9).

As such, the advent of neural networks has provided new avenues for dimensionality reduction in neuroimaging. One prominent model architecture is the autoencoder (AE) framework (10), which aims to learn a relatively low-dimensional latent representation of the original data, which can then be decoded to recover the data through the decoding phase. Additionally, to produce more effective and interpretable latent representations, variational autoencoder (VAE) approaches have been introduced and have yielded promising results (11). Compared to AEs, VAE approaches possess prominent properties: 1) VAEs are probabilistic models that learn distributions of latent representations, allowing for better modeling of complex data structures. They are seen as an extension of nonlinear ICA (12), using nonlinear neural networks to model the data generation mechanism and creating low-dimensional latent representations consisting of statistically independent components (13). VAEs enable the delineation of each learned component’s role within the framework of nonlinear data generation models, offering a more precise understanding of complex neuroimaging data. In the implementation, the representations follow user-specified distributions called priors, and one common choice is multivariate Gaussian distributions. 2) Compared to AEs, the training for VAEs is regularized, which helps prevent overfitting and enforces the independence between components in the estimated representations (14). This makes the representations more interpretable, as each component has a distinct role from all the others (15). For example, in the analysis of hand-written digit images (e.g., Modified National Institute of Standards and Technology [MNIST] http://yann.lecun.com/exdb/mnist/), VAEs may estimate two-dimensional representations where the first component explains the size of digits and the second one explains the slant. In the context of rs-fMRI data, specific latent components may explain within-network or between-network connectivity across various networks. 3) Moreover, VAEs provide personalized inference on the latent space by approximating the distribution of latent representations given the observations. This allows for subject-level information such as uncertainty or variance of estimated representations.

Despite improvements in the performance of machine learning models using latent features, interpreting the significance of the latent features remains a challenge. At first glance, many neural network models can appear as black boxes, with complex and difficult-to-interpret representations. However, strides have been made across the deep learning community to develop tools for interpreting these models. Much of the initial work in visualizing the latent features of autoencoder models originated in the field of computer vision. The tools developed there benefited from the fact that images are relatively easy to interpret and understand naturally by humans. Thus, it was easier to see the qualitative contribution of each latent feature as we could directly examine the images produced by these tools. However, brain imaging modalities such as rs-fMRI do not offer the same level of natural interpretability.

Some initial efforts have been made to generate interpretable latent representations of rs-fMRI data using VAEs. For example, Kim et al. (2021) trained VAEs using large rs-fMRI data from Human Connectome Project. They extracted 2D grids of rs-fMRI patterns at every time as images and input them into the VAE models. The results demonstrated that estimated representations from VAEs effectively characterized individual identification (16). Another study applied VAEs to rs-fMRI data from Autism Brain Imaging Data Exchange (ABIDE) and found an autism spectrum disorder (ASD)-related latent factor (17). However, this study only used a two-dimensional representation space and lacked explanation of the complicated information in rs-fMRI related to ASD. Moreover, the latent representations were based on individual brain regions (e.g., frontal cortices and frontoparietal) rather than brain networks (e.g., executive control network [ECN], salience network [SN], and default-mode network [DMN]).

The goal of this paper is to extract the latent representations from VAE models trained on rs-fMRI data, and create explainable VAE models by visualizing and quantifying the latent representation based on the input rs-fMRI brain features. Here we test the utility of this tool using the rs-fMRI dataset.

2 Methods

2.1 Dataset

We used publicly available data from the Paris-Saclay Center for Data Science that was initially published for competition in the Imaging-Psychiatry Challenge (IMPAC; https://paris-saclay-cds.github.io/autism_challenge/). The dataset comprised rs-fMRI images from 1,150 participants, including 601 HC and 549 patients with ASD, collected from 35 sites. The demographic characteristics of these participants are detailed in Table 1. We further excluded 121 participants who failed to pass quality control procedures.

Table 1

Table 1 The descriptive information for each site from the public dataset.

2.2 Image acquisition and processing

All time-series imaging data were acquired using specific atlases and a fetcher provided by IMPAC. We extracted rsFC matrices from the preprocessed rs-fMRI data using Power atlas with 264 ROIs (4). The ROI names and the networks they belong to can be found at https://www.jonathanpower.net/2011-neuron-bigbrain.html. Pairwise correlations were calculated between each ROI for each participant and transformed into a correlation matrix whose elements are Pearson correlations ranging from –1 to 1. Finally, we vectorized correlation matrices by flattening the lower triangular part. For a connectivity matrix with $N$ ROIs, the length of the 1D vectorized correlation vector was calculated by $(N - 1) \times N / 2$ . The 1D vector was used as an input signal in VAE models. To correct site effects and adjust for age and gender covariates, we performed a Combat algorithm on the correlation vectors using neuroHarmonize (18).

2.3 Variational autoencoders

2.3.1 Model architectures

We denote high-dimensional observations and low-dimensional representations by $X \in ℝ^{m}$ and $Z \in ℝ^{n}$ , respectively. Realizations of random variables are denoted by small characters. VAEs consist of two parts: (i) encoders modeling the distribution of representations given observations, $q_{ϕ} (z | x)$ , and (ii) decoders modeling the distribution of observations given representations, $p_{θ} (x | z)$ (Figure 1), where $ϕ$ and $θ$ are neural network parameters. Both $q_{ϕ} (z | x)$ and $p_{θ} (x | z)$ are usually modeled as multivariate Gaussian distributions with diagonal variances, $N (μ_{ϕ} (z | x), Σ_{ϕ} (z | x))$ and $N (μ_{θ} (x | z), I_{m})$ , respectively, to apply reparameterization trick (11) where $I_{m}$ is the identity matrix of size $m$ . Encoders extract representations from observations, decoders reconstruct the original data with them, and they are trained by maximizing evidence lower bounds (ELBOs) (19). For example, for a given $x$ , we can first sample $z$ following $N (μ_{ϕ} (z | x), Σ_{ϕ} (z | x))$ , and then use $μ_{θ} (x | z)$ as reconstruction results.

Figure 1

Figure 1 VAEs model the data generation mechanism with low-dimensional representationsand neural networks called decoders. The encoders estimate representations with observations and decoders reconstruct the original data with representations.

Compared to AEs, VAEs are distinct in that they are generative models, i.e., they model distributions of observations. Latent representations $Z$ consist of statistically independent components and nonlinear decoders input $Z$ to model $p_{θ} (x | z)$ , e.g., $N (μ_{θ} (x | z), I_{m})$ . When we assume that the $p_{θ} (x | z)$ has zero-variance, the data generation structure of VAE reduces to nonlinear ICA (20). By estimating the nonlinear decoders with likelihood maximization and approximating their inverse mapping with encoders, VAEs can separate blind sources from high-dimensional complicated observations, e.g., functional connectivity.

2.3.2 Loss function

The loss function of VAEs is the negative ELBO, $- E_{q_{ϕ} (z | x)} \log p_{θ} (x | z) + D_{K L} (q_{ϕ} (z | x) ∥ p (z))$ , and is an upper bound of the negative data log-likelihood, $- \log p_{θ} (x)$ , where $D_{K L}$ denotes the Kullback-Leibler (KL) Divergence and $p (z)$ denotes a user-specified prior distribution of representations, e.g., multivariate standard Gaussian distributions. Minimizing the loss function of VAEs is equivalent to maximizing a lower bound of likelihoods. In the loss function, the first term is called the reconstruction error which measures how reconstruction results differ from the original observations, and it is the mean squared error when we use Gaussian distributions for decoder distributions. The second term measures the discrepancy between $q_{ϕ} (z | x)$ and $p (z)$ . Considering that the loss function of AEs is the reconstruction error, training VAEs can be viewed as training AEs with the KL-regularization term for the learning data generation mechanism.

2.4 Latent contribution scores to explain nonlinear representations

2.4.1 Latent contribution scores

We introduce our latent contribution scores to explain the relationship between estimated representations and the original rs-fMRI brain measures.

For any observation $x \in ℝ^{m}$ , encoder parameter $ϕ$ , and decoder parameter $θ$ , to measure the contribution of each component in $z \in ℝ^{n}$ , we propose a matrix $D (x)$ whose elements are latent contribution scores,

\begin{array}{l} D {(x)}_{k l} = E_{q_{ϕ} (z ❘ x)} \partial {\hat{x}}_{l} / \partial z_{k}, & (1) \end{array}

where $\hat{x}$ denotes the reconstruction result, $k = 1, \dots, n$ , and $l = 1, \dots, m$ .

The proposed latent contribution scores in Equation (1) are interpretable in two respects: (i) they are extensions of mixing weights in ICA to the nonlinear data generation mechanism, and (ii) they are input perturbation-based scores (21, 22). For (i), $\hat{x}$ is the reconstruction result with estimated sources whose gradients are mixing weights under a linear generation mechanism. Similar to the explanation on mixing weights in ICA, we can explain the contribution of estimated representations on reconstructions as follows: The increment of the $k$ -th element of estimated representations by one unit yields the increment of the $l$ -th element of reconstructed observations, e.g., the $l$ -th element of reconstructed rs-fMRI functional connectivity, by $D {(x)}_{k l}$ units on average over $q_{ϕ} (z ❘ x)$ . With these scores, we can explain how estimated representations change reconstructions in each group, e.g., ASD and HC groups. For (ii), in the interpretable machine learning literature, input perturbation-based scores are types of feature importance measures used to explain how the outputs of complicated and nonlinear networks respond to the perturbation on latent components. This provides an estimate of how important each feature is for the model’s decision-making process. Our scores are input perturbation-based scores in which they average gradients, the marginal changes of outputs by decoders with respect to input (estimated) representations.

2.4.2 Numerical approximation for latent contribution scores

Numerical approximation for the proposed latent contribution scores consists of two parts: (i) approximating gradients for a given representation and (ii) averaging gradients computed in (i) over encoder distributions. For (i), we computed average slopes with small perturbations. Let $x$ be an observation and $\hat{z}$ be an estimated representation sampled from $q_{ϕ} (z ❘ x)$ . We first compute reconstructions using $\hat{z}$ and $\hat{z} + ε$ , denoted by $\hat{x} (\hat{z})$ and $\hat{x} (\hat{z} + e_{k})$ , respectively, where $e_{k}$ denotes the $k$ -th component of standard basis of $ℝ^{n}$ , and then compute $\hat{x} (\hat{z} + e_{k}) - \hat{x} (\hat{z})$ which is a numerical approximation of partial gradients up to constant multiplication. For (ii), in computing scores for the $k$ -th latent component, we used fixed points rather than sampling to provide deterministic scores. For all axes except the $k$ -th axis, we used means of encoder distributions, and for the $k$ -th axis, we used pre-specified grid points ranging from means minus three standard deviations to plus three standard deviations. We first approximate gradients at each grid point, and then average them to compute latent contribution scores.

2.5 Explaining rs-fMRI and brain networks with latent contribution scores

2.5.1 Model architecture

In our experiments on the rs-fMRI dataset, the observations $X$ are the lower triangular part of functional connectivity matrices from the Power atlas with 264 ROIs. Each element is the correlation of the resting state activity between one of 264 brain regions and another (Figure 2). Both the encoder and decoder have one hidden layer. The sizes of the respective layers were chosen by performing a sparse grid search for each of the layers’ sizes independently and evaluating the performance of the model both with respect to the loss function (23). For hyperparameter tuning, we considered the following choices: {tanh, scaled exponential linear unit [SELU]} for activation functions (24); {20, 40, 50, 80, 100, 150, 200, 250} for the number of the hidden nodes; {2, 5, 10, 15, 20} for the latent dimension. We used the loss of VAEs for the model selection criterion. Both the encoder and decoder have one hidden layer. The sizes of the respective layers were chosen by performing a sparse grid search for each of the layers’ sizes independently and evaluating the performance of the model, both with respect to the loss function (23). The chosen number of hidden nodes and latent dimension are 80 and 5, respectively.

Figure 2

Figure 2 Diagram of VAE pipeline: The model was trained using rs-fMRI data. The samples were then split into a training+validation (70%) and independent-test (30%) data. Then 20% of the training data was set aside for validation and hyperparameter tuning. Once the training+validation was completed, the model’s performance was evaluated on the independent test data, which provides an unbiased estimate of how the model generalizes to unseen data. The resulting VAE model learned to encode patterns from the input brain features into its latent representation.

2.5.2 Training of VAEs

We trained VAEs in an unsupervised fashion without using labels about ASD and HC groups. We standardized the data with median and interquartile ranges and added Gaussian noise with a zero-mean and standard deviation of 0.1 to input data for denoising purposes to learn robust representations (25). The whole dataset was split into training+validation (70%), and test (30%) sets. The batch size, the number of epochs, and the weight decay were 128, 1000, and 0.1, respectively. We applied $L_{2}$ regularization. For the stopping criterion to evaluate convergence, we used the validation loss, negative ELBO. Figure 3 provides training and validation loss curves. There was no notable overfitting issue.

Figure 3

Figure 3 Training and validation loss curves from VAEs with the best validation parameters.

3 Results

We compared latent contribution scores of each latent variable based on the rsFC. Figure 4 provides a visualization of the top 0.05% resting-state functional connectivities with the highest latent contribution scores. The depicted brain network features change the most as the estimated representation (latent) changes.

Figure 4

Figure 4 Visualization of top 0.05% functional connectivity based on latent contribution scores. Each row shows results on each component of estimated representations. The left and right columns display results on the ASD and HC groups, respectively.

We quantified the latent contribution scores for the ASD and HC groups at the network level, as detailed in Table 2 and Figure 5. The ASD and HC groups share the top network connectivity for all estimated latent components. For example, within ventral attention network (VAN) contributes the most to latent 0 in both the ASD and HC. Similarly, latent 1 is primarily influenced by the rsFC between somatomotor (SMN)-memory retrieval networks, latent 2 is driven by the rsFC within SMN; latent 3 is driven by rsFC between memory retrieval-cerebellar networks, and latent 4 is driven by rsFC between cerebellar-dorsal attention networks in both groups.

Table 2

Table 2 Summary of top 3 network connectivity having the highest latent contribution scores.

Figure 5

Figure 5 Summary of top 15 network connectivity having the highest latent contribution scores. We averaged scores from rsFC across ROIs over the network level.

Among the rsFC network ROIs that contribute the most to each latent component (as shown in Table 2), we further compared them between the ASD and HC groups. We first conducted t-tests to filter ROIs with significantly different latent contribution scores between ASD and HC groups, using a significance level of 1%. Subsequently, we averaged the scores at the network level. We found significant differences in the latent contribution scores in the VAN network (top network driven by latent 0), and the SMN (top 2 network driven by latent 1).

We compared the mutual information (MI) between latent components from Denoising AEs (DAEs) (26) and from our method. DAEs focus solely on minimizing the reconstruction error, aiming for a closer one-to-one relation between the reduced dimensions and the original data, without learning data distributions. The smaller MI indicates weaker dependencies and, consequently, better disentanglement in interpretation. The results showed that our method exhibited a smaller test MI of 0.6253 compared to the DAEs, which had an MI of 0.6511. The DAEs were implemented by removing the KL regularization term from the ELBO.

4 Discussion

In this study, we proposed the use of latent contribution scores to explain nonlinear patterns identified by VAEs. These scores effectively capture the marginal changes in each component of the observations as the estimated representations change. With this toolkit, we were able to examine both quantitative and qualitative analyses of the differences in how a VAE-based model represents psychiatric disorders.

Specifically, we were able to quantify which brain networks most significantly contribute to each latent component that differentiated between ASD and HC. We identified five latent components, including within VAN driven latent 0; SMN-memory retrieval networks driven latent 1, memory retrieval-cerebellar networks driven latent 3, and cerebellar-dorsal attention networks driven latent 4. Among these 5 latent components, the latent contribution scores in the ventral attention network driven latent 0, and the SMN driven latent 1 are significantly different between the ASD and HC groups. The VAN and SMN are two important networks implicated in ASD. VAN plays a crucial role in processing sensory information and direct attention. Studies suggest that individuals with ASD have altered rsFC in the VAN, which could contribute to difficulties with focusing, maintaining and shifting attention and social communication (27). The SMN is involved in the processing of sensory information and controlling motor functions. Individuals with ASD often show different sensory processing compared with the HC group. Altered rsFC in the SMN could potentially contribute to these sensory processing differences. Moreover, altered rsFC in SMN may contribute to the motor coordination difficulties in individuals with ASD (28).

Our approach is generally applicable to a broader class of dimension reduction methods, including autoencoders and their derivatives, bidirectional generative adversarial networks (29, 30), and deep belief networks (31) that use probabilistic encoders and any desired imaging modality. In fact, our technique does not require the model to even be a neural network as long as it has mapping from high dimensional observations to estimated representations and vice versa, and the gradient can be numerically approximated. Some examples of models that fall in this category are VAE-based generative adversarial networks (GANs) (32) and hyperspherical VAEs (33). Another advantage of our approach lies in its visualization capability. When observations are visually perceived as in natural images, we can display our latent contribution scores. For example, when the data modality is 4D fMRI voxel-time space data, we can visualize the proposed contribution scores for each latent component in the 4D space and interpret their spatial-temporal patterns.

It is important to note that our analysis was only done on a VAE applied to ROI-to-ROI measures extracted from resting-state timeseries data. Other resting-state measurements such as amplitude of low-frequency fluctuation (ALFF) and regional homogeneity (ReHo) have not been used for this analysis and may be explored in the future works. Additionally, we used the Power atlas (4) with 264 ROIs in this study. Future studies could try leveraging rsFC matrices using different atlases as well. Moreover, as different MRI modalities contain complementary information for ASD, including task-based fMRI, T1 structural MRI and diffusion weighted imaging, fusing multiple modalities may provide additional information, and contribute to each latent component. The proposed deep learning model can potentially be used to combine different imaging modalities via stacked autoencoders, and explain the contributions of each modality to the latent components, which can help in understanding the mechanism of psychiatric disorders such as ASD. The ABIDE dataset primarily focuses on ASD, while ASD frequently coexists with attention deficit hyperactivity disorder (ADHD) and anxiety disorders (34), posing a significant challenge in differentiating the neurodevelopmental impacts of each condition. The comorbidity complicates the analysis, as the overlapping symptoms and neurobiological features may obscure the specific contributions of ASD to brain network configurations. Further research could test the generalizability of our model with respect to comorbidity with other disorders. For example, by comparing latent contribution scores across three groups - HC, ASD without comorbidities, and ASD with comorbidities - we can better dissect the interaction between these disorders and their associations with brain networks.

5 Conclusion

In conclusion, our proposed latent contribution scores enhance the interpretability of deep learning models. These models, applied to the rs-fMRI data, can be understood and interpreted by humans. Moreover, explainable VAEs offer insights into which features, from either single modality or a combination of multiple modalities, are most important for particular prediction tasks, such as the classification of ASD from HC. This is valuable for feature engineering and for understanding the underlying neural mechanisms of psychiatric disorders.

Data availability statement

The original contributions presented in the study are included in the article/supplementary material. Further inquiries can be directed to the corresponding authors.

Author contributions

YouK: Writing – original draft, Writing – review & editing. OR: Writing – original draft, Writing – review & editing. XYZ: Investigation, Software, Visualization, Writing – review & editing. YooK: Investigation, Writing – review & editing. YN: Investigation, Writing – review & editing. SL: Investigation, Writing – review & editing. XH: Writing – original draft, Writing – review & editing. XZ: Writing – original draft, Writing – review & editing.

Funding

The author(s) declare financial support was received for the research, authorship, and/or publication of this article. XZ is supported by NIH K01MH122774 and by a NARSAD Young Investigator Grant from the Brain & Behavior Research Foundation 27040; YouK is supported by NIH R01MH124106.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1. Monk CS, Peltier SJ, Wiggins JL, Weng SJ, Carrasco M, Risi S, et al. Abnormalities of intrinsic functional connectivity in autism spectrum disorders. Neuroimage. (2009) 47:764–72. doi: 10.1016/j.neuroimage.2009.04.069

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Khosla M, Jamison K, Ngo GH, Kuceyeski A, Sabuncu MR. Machine learning in resting-state fMRI analysis. Magn Reson Imaging. (2019) 64:101–21. doi: 10.1016/j.mri.2019.05.031

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Seitzman BA, Gratton C, Marek S, Raut RV, Dosenbach NUF, Schlaggar BL, et al. A set of functionally-defined brain regions with improved representation of the subcortex and cerebellum. Neuroimage. (2020) 206:116290. doi: 10.1016/j.neuroimage.2019.116290

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Power JD, Cohen AL, Nelson SM, Wig GS, Barnes KA, Church JA, et al. Functional network organization of the human brain. Neuron. (2011) 72:665–78. doi: 10.1016/j.neuron.2011.09.006

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Gordon EM, Laumann TO, Adeyemo B, Huckins JF, Kelley WM, Petersen SE. Generation and evaluation of a cortical area parcellation from resting-state correlations. Cereb Cortex. (2016) 26:288–303. doi: 10.1093/cercor/bhu239

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Johnstone IM, Paul D. PCA in high dimensions: an orientation. Proc IEEE Inst Electr Electron Eng. (2018) 106:1277–92. doi: 10.1109/JPROC.2018.2846730

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Le QKA, Ngiam J, Ng AY. ICA with reconstruction cost for efficient overcomplete feature learning. Adv Neural Inf Process Syst. (2011) 24.

Google Scholar

8. Heinsfeld AS, Franco AR, Craddock RC, Buchweitz A, Meneguzzi F. Identification of autism spectrum disorder using deep learning and the ABIDE dataset. NeuroImage Clin. (2018) 17:16–23. doi: 10.1016/j.nicl.2017.08.017

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Khodatars M, Shoeibi A, Sadeghi D, Ghaasemi N, Jafari M, Moridian P, et al. Deep learning for neuroimaging-based diagnosis and rehabilitation of Autism Spectrum Disorder: A review. Comput Biol Med. (2021) 139:104949. doi: 10.1016/j.compbiomed.2021.104949

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Yang YM, Wu QMJ, Wang YN. Autoencoder with invertible functions for dimension reduction and image reconstruction. IEEE T Syst Man Cy-S. (2018) 48:1065–79. doi: 10.1109/TSMC.2016.2637279

CrossRef Full Text | Google Scholar

11. Lopez R, Regier J, Jordan MI, Yosef N. Information constraints on auto-encoding variational bayes. Adv Neur In. (2018) 31.

Google Scholar

12. Hyvarinen A, Pajunen P. Nonlinear independent component analysis: Existence and uniqueness results. Neural Netw. (1999) 12:429–39. doi: 10.1016/S0893-6080(98)00140-3

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Khemakhem I, Kingma DP, Monti RP, Hyvarinen A. Variational autoencoders and nonlinear ICA: A unifying framework. Pr Mach Learn Res. (2020) 108:2207–16.

Google Scholar

14. Khemakhem IKDP, Monti RP, Hyvärinen A. Variational autoencoders and nonlinear ICA: A unifying framework. arXiv. (2020) 1907:04809.

Google Scholar

15. Christopher PBHI, Pal A, Matthey L, Watters N, Desjardins G, lerchner A. Understanding disentangling in β-VAE. arXiv. (2018) 1804:03599.

Google Scholar

16. Kim JH, Zhang YZ, Han K, Wen ZY, Choi M, Liu ZM. Representation learning of resting state fMRI with variational autoencoder. Neuroimage. (2021) 241. doi: 10.1016/j.neuroimage.2021.118423

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Choi H. Functional connectivity patterns of autism spectrum disorder identified by deep feature learning. ArXiv. (2017) 1707:07932.

Google Scholar

18. Pomponio R, Erus G, Habes M, Doshi J, Srinivasan D, Mamourian E, et al. Harmonization of large MRI datasets for the analysis of brain imaging patterns throughout the lifespan. Neuroimage. (2020) 208:116450. doi: 10.1016/j.neuroimage.2019.116450

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Hoffman MD, Blei DM, Wang C, Paisley J. Stochastic variational inference. J Mach Learn Res. (2013) 14:1303–47.

Google Scholar

20. Hyvarinen A, Pajunen P. Nonlinear independent component analysis: Existence and uniqueness results. Neural Networks. (1999) 12:429–39. doi: 10.1016/S0893-6080(98)00140-3

CrossRef Full Text | Google Scholar

21. Ribeiro MT, Singh S, Guestrin C. (2016). Why should I trust you?” Explaining the predictions of any classifier, in: Kdd’16: Proceedings of the 22nd Acm Sigkdd International Conference on Knowledge Discovery and Data Mining, . pp. 1135–44.

Google Scholar

22. Fong RC, Vedaldi A. Interpretable explanations of black boxes by meaningful perturbation. IEEE I Conf Comp Vis. (2017), 3449–57. doi: 10.1109/ICCV.2017.371

CrossRef Full Text | Google Scholar

23. Sheela KGD SN. Review on methods to fix number of hidden neurons in neural networks. Math Problems Eng. (2013). doi: 10.1155/2013/425740

CrossRef Full Text | Google Scholar

24. Klambauer GU,T, Mayr A, Hochreiter S. Self-Normalizing neural networks. arXiv. (2017) 1706:02515.

Google Scholar

25. Im DJ, Ahn S, Memisevic R, Bengio Y. Denoising criterion for variational auto-encoding framework. Aaai Conf Artif Inte. (2017), 2059–65. doi: 10.1609/aaai.v31i1.10777

CrossRef Full Text | Google Scholar

26. Vincent PLH, Bengio Y, Manzagol P. Extracting and composing robust features with denoising autoencoders. (2008), 1096–103. doi: 10.1145/1390156

CrossRef Full Text | Google Scholar

27. Farrant K, Uddin LQ. Atypical developmental of dorsal and ventral attention networks in autism. Dev Sci. (2016) 19:550–63. doi: 10.1111/desc.12359

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Wang C, Yang L, Lin Y, Wang C, Tian P. Alteration of resting-state network dynamics in autism spectrum disorder based on leading eigenvector dynamics analysis. Front Integr Neurosci. (2022) 16:922577. doi: 10.3389/fnint.2022.922577

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Hong T, Huang P, Zhai X, Gu C, Tian B, Jin B, et al. MARS-GAN: multilevel-feature-learning attention-aware based generative adversarial network for removing surgical smoke. IEEE Trans Med Imaging. (2023) 42:2299–312. doi: 10.1109/TMI.2023.3245298

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Jeff DKP, Darrell T. Adversarial feature learning. ArXiv. (2016) 1605:09782.

Google Scholar

31. Song SH, Kim DK. Development of a stress classification model using deep belief networks for stress monitoring. Healthc Inform Res. (2017) 23:285–92. doi: 10.4258/hir.2017.23.4.285

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Razghandi Z MH, Erol-Kantarci M, Turgut D. Variational autoencoder generative adversarial network for synthetic data generation in smart home. ArXiv. (2022) 2201:07387. doi: 10.1109/ICC45855.2022.9839249

CrossRef Full Text | Google Scholar

33. Davidson TRFL, Cao ND, Kipf T, Tomczak JM. Hyperspherical variational auto-encoders. ArXiv. (2022) 1804:00891.

Google Scholar

34. Avni E, Ben-Itzchak E, Zachor DA. The presence of comorbid ADHD and anxiety symptoms in autism spectrum disorder: clinical presentation and predictors. Front Psychiatry. (2018) 9:717. doi: 10.3389/fpsyt.2018.00717

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: deep learning, variational autoencoder, resting state fMRI, functional connectivity, autism spectrum disorder

Citation: Kim Y-g, Ravid O, Zheng X, Kim Y, Neria Y, Lee S, He X and Zhu X (2024) Explaining deep learning-based representations of resting state functional connectivity data: focusing on interpreting nonlinear patterns in autism spectrum disorder. Front. Psychiatry 15:1397093. doi: 10.3389/fpsyt.2024.1397093

Received: 06 March 2024; Accepted: 26 April 2024;
Published: 20 May 2024.

Edited by:

Mingkuan Sun, Nanjing Medical University, China

Reviewed by:

Caglar Uyulan, Izmir Kâtip Çelebi University, Türkiye
Lejun Gong, Nanjing University of Posts and Telecommunications, China

Copyright © 2024 Kim, Ravid, Zheng, Kim, Neria, Lee, He and Zhu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Xiaofu He, eGlhb2Z1LmhlQG55c3BpLmNvbHVtYmlhLmVkdQ==; Xi Zhu, eGkuemh1QG55c3BpLmNvbHVtYmlhLmVkdQ==

^†These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Explaining deep learning-based representations of resting state functional connectivity data: focusing on interpreting nonlinear patterns in autism spectrum disorder

1 Introduction

2 Methods

2.1 Dataset

2.2 Image acquisition and processing

2.3 Variational autoencoders

2.3.1 Model architectures

2.3.2 Loss function

2.4 Latent contribution scores to explain nonlinear representations

2.4.1 Latent contribution scores

2.4.2 Numerical approximation for latent contribution scores

2.5 Explaining rs-fMRI and brain networks with latent contribution scores

2.5.1 Model architecture

2.5.2 Training of VAEs

3 Results

4 Discussion

5 Conclusion

Data availability statement

Author contributions

Funding

Conflict of interest

Publisher’s note

References

94% of researchers rate our articles as excellent or good

94% of researchers rate our articles as excellent or good