AUTHOR=Eguchi Akihiro , Horii Takato , Nagai Takayuki , Kanai Ryota , Oizumi Masafumi TITLE=An Information Theoretic Approach to Reveal the Formation of Shared Representations JOURNAL=Frontiers in Computational Neuroscience VOLUME=14 YEAR=2020 URL=https://www.frontiersin.org/journals/computational-neuroscience/articles/10.3389/fncom.2020.00001 DOI=10.3389/fncom.2020.00001 ISSN=1662-5188 ABSTRACT=

Modality-invariant categorical representations, i.e., shared representation, is thought to play a key role in learning to categorize multi-modal information. We have investigated how a bimodal autoencoder can form a shared representation in an unsupervised manner with multi-modal data. We explored whether altering the depth of the network and mixing the multi-modal inputs at the input layer affect the development of the shared representations. Based on the activation of units in the hidden layers, we classified them into four different types: visual cells, auditory cells, inconsistent visual and auditory cells, and consistent visual and auditory cells. Our results show that the number and quality of the last type (i.e., shared representation) significantly differ depending on the depth of the network and are enhanced when the network receives mixed inputs as opposed to separate inputs for each modality, as occurs in typical two-stage frameworks. In the present work, we present a way to utilize information theory to understand the abstract representations formed in the hidden layers of the network. We believe that such an information theoretic approach could potentially provide insights into the development of more efficient and cost-effective ways to train neural networks using qualitative measures of the representations that cannot be captured by analyzing only the final outputs of the networks.