- 1Institute for High Performance Computing and Networking of the National Research Council of Italy, Naples, Italy
- 2Manchester Metropolitan University, Manchester, United Kingdom
- 3University of Cagliari, Cagliari, Italy
- 4The University of Manchester, Manchester, United Kingdom
Editorial on the Research Topic
Language Representation and Learning in Cognitive and Artificial Intelligence Systems
Introduction
In recent years, the rise of deep learning has transformed the field of Natural Language Processing (NLP), thus, producing models based on neural networks with impressive achievements in various tasks, such as language modeling (Devlin et al., 2019), syntactic parsing (Pota et al., 2019), machine translation (Artetxe et al., 2017), sentiment analysis (Fu et al., 2019), and question answering (Zhang et al., 2019). This progress has been accompanied by a myriad of new end-to-end neural network architectures able to map input text to some output prediction. On the other hand, architectures inspired by human cognition have recently appeared (Dominey, 2013; Hinaut and Dominey, 2013; Golosio et al., 2015), this is aimed at modeling language comprehension and learning by means of neural models built according to current knowledge on how verbal information is stored and processed in the human brain.
Despite the success of deep learning in different NLP tasks and the interesting attempts of cognitive systems, natural language understanding still remains an open challenge for machines.
The goal of this Research Topic is to describe novel and very interesting theoretical studies, models, and case studies in the areas of NLP as well as Cognitive and Artificial Intelligence (AI) systems, based on knowledge and expertise coming from heterogeneous but complementary disciplines (machine/deep learning, robotics, neuroscience, psychology).
Papers Included in This Research Topic
Stille et al. propose a large-scale neural model, including cognitive and lexical levels of the human neural system, with the aim of simulating the human behavior occurring in medical screenings. The large-scale neural model is biologically inspired and built by exploiting the Neural Engineering Framework and the Semantic Pointer Architecture. The authors simulate parts of both the screenings, using either the normal neural model or the neural model including neural deficits. The simulated screenings are focused on the detection of developmental problems in lexical storage and retrieval, as well as of mild cognitive impairment and early dementia.
Jacobs proposes a heuristic tool called SentiArt for realizing different sentiment analyses for text segments and figures. The tool uses vector space models together with theory-guided and empirically validated label lists to compute the valence of each word in a text by locating its position in a 2d emotion potential space spanned by the >2 million words of the vector space model. By means of two computational poetics studies, the author experimentally shows the ability of SentiArt to determine the emotion of text passages and to compute emotional figure profiles and personality figure profiles for main characters from the book series (stories, novels, plays, or ballads).
Ferrone and Zanzotto describe a survey aimed to deeply investigate the link between symbolic and distributed/distributional representations of Natural Language. In particular, the survey describes the general concept of representation, the notion of concatenative composition and the difference between local and distributed representations. Furthermore, it deeply addresses the general issue of compositionality, analyzing three different approaches: compositional distributional semantics, holographic reduced representations and recurrent neural networks.
Nakashima et al. presents a new unsupervised machine learning method for phoneme and word discovery from multiple speakers. Human infants can acquire knowledge of phonemes and words from interactions with their mother as well as with others surrounding them. Authors propose a phoneme and word discovery method that simultaneously uses non-parametric Bayesian double articulation analyzer and deep sparse autoencoder with parametric bias in a hidden layer. Their system reduces the negative effect of speaker-dependent acoustic features in an unsupervised manner by using a speaker index required to be obtained through another speaker recognition method. This can be regarded as a more natural computational model of phoneme and word discovery by humans, because it does not use transcription.
Wallbridge et al. proposes a dynamic method of communication between robots and humans in order to generate Spatial Referring Expressions describing a location. The focus of most algorithms for generation is to create a non-ambiguous description, but this is not how people naturally communicate. The authors call dynamic description how humans tend to give an underspecified description and then rely on a strategy of repair to reduce the number of possible locations or objects until the correct one is identified. The authors present a method for generating these dynamic descriptions for Human Robot Interaction, using machine learning to generate repair statements in a two-dimensional environment (game-like scenario).
Miyazawa et al. presents a unified framework, integrating a cognitive architecture in a real robot for the simultaneously comprehension of concepts, actions, and language. Their integration is based on various cognitive modules and leveraging mainly multimodal categorization by using multilayered multimodal latent Dirichlet allocation (mMLDA). The integration of reinforcement learning and mMLDA enables actions based on understanding. Furthermore, the mMLDA, in conjunction with grammar learning and based on the Bayesian hidden Markov model, allows the robot to verbalize its own actions and understand user utterances. Decision making and language understanding by using abstracted concepts are verified using a real robot.
Conclusions
Despite relevant progress having been made in the field of AI applied to NLP in the last decade, the goal of creating truly human-like intelligent systems still seems very distant. The difficulties encountered in the development of the most recent systems clearly show that the problem of human-machine interaction through natural language can no longer be addressed as a simple input-output problem. To make a qualitative leap, AI systems should become more complete multimodal systems which are able to integrate skills in areas of AI that are currently treated separately and should be capable of developing an internal representation of the external world through the combination of other information besides the verbal one. Such combination can be achieved through the integration of AI systems in robots. Embodied architectures in robots should be able to learn in a similar way to humans through interaction with humans themselves and be capable of proactively adapting operating in the environment to find the information necessary to learn and to interact more profitably with humans. From this perspective, perhaps the approaches inspired by neuroscience and cognitive models can still provide new important ideas to this field.
Author Contributions
All authors contributed equally to manuscript writing, read, and approved the final version.
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
References
Artetxe, M., Labaka, G., Agirre, E., and Cho, K. (2017). Unsupervised neural machine translation. arXiv preprint arXiv:1710.11041. doi: 10.18653/v1/D18-1399
Devlin, J., Chang, M. W., Lee, K., and Toutanova, K. (2019). “BERT: Pre-training of deep bidirectional transformers for language understanding.” in Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 1:4171–86. doi: 10.18653/v1/N19-1423
Dominey, P. F. (2013). Recurrent temporal networks and language acquisition—from corticostriatal neurophysiology to reservoir computing. Front. Psychol. 4:500. doi: 10.3389/fpsyg.2013.00500
Fu, X., Wei, Y., Xu, F., Wang, T., Lu, Y., Li, J., et al. (2019). Semi-supervised aspect-level sentiment classification model based on variational autoencoder. Knowledge Based Syst. 171, 81–92. doi: 10.1016/j.knosys.2019.02.008
Golosio, B., Cangelosi, A., Gamotina, O., and Masala, G. L. (2015). A cognitive neural architecture able to learn and communicate through natural language. PLoS ONE 10:e0140866. doi: 10.1371/journal.pone.0140866
Hinaut, X., and Dominey, P. F. (2013). Real-time parallel processing of grammatical structure in the fronto-striatal system: a recurrent network simulation study using reservoir computing. PLoS ONE 8:e52946. doi: 10.1371/journal.pone.0052946
Pota, M., Marulli, F., Esposito, M., De Pietro, G., and Fujita, H. (2019). Multilingual POS tagging by a composite deep architecture based on character-level features and on-the-fly enriched Word Embeddings. Knowledge Based Syst. 164, 309–323. doi: 10.1016/j.knosys.2018.11.003
Zhang, Z., Wu, Y., Zhou, J., Duan, S., and Zhao, H. (2019). SG-Net: Syntax-guided machine reading comprehension. The Thirty-Fourth AAAI Conference on Artificial Intelligence, AAAI, 2020, The Thirty-Second Innovative Applications of Artificial Intelligence Conference, IAAI 2020, The Tenth AAAI Symposium on Educational Advances in Artificial Intelligence, EAAI 2020, (New York, NY), 9636–43.
Keywords: Natural Language Processing (NLP), artificial intelligence, cognitive systems, robotics, deep learning, machine learning, language representation and language processing
Citation: Esposito M, Masala GL, Golosio B and Cangelosi A (2020) Editorial: Language Representation and Learning in Cognitive and Artificial Intelligence Systems. Front. Robot. AI 7:69. doi: 10.3389/frobt.2020.00069
Received: 13 February 2020; Accepted: 27 April 2020;
Published: 16 June 2020.
Edited and reviewed by: Mikhail Prokopenko, University of Sydney, Australia
Copyright © 2020 Esposito, Masala, Golosio and Cangelosi. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Massimo Esposito, bWFzc2ltby5lc3Bvc2l0byYjeDAwMDQwO2ljYXIuY25yLml0