AUTHOR=Hiippala Tuomo TITLE=Rethinking multimodal corpora from the perspective of Peircean semiotics JOURNAL=Frontiers in Communication VOLUME=9 YEAR=2024 URL=https://www.frontiersin.org/journals/communication/articles/10.3389/fcomm.2024.1337434 DOI=10.3389/fcomm.2024.1337434 ISSN=2297-900X ABSTRACT=
This article discusses annotating and querying multimodal corpora from the perspective of Peircean semiotics. Corpora have had a significant impact on empirical research in the field of linguistics and are increasingly considered essential for multimodality research as well. I argue that Peircean semiotics can be used to gain a deeper understanding of multimodal corpora and rethink the way we work with them. I demonstrate the proposed approach in an empirical study, which uses Peircean semiotics to guide the process of querying multimodal corpora using computer vision and vector-based information retrieval. The results show that computer vision algorithms are restricted to particular domains of experience, which may be circumscribed using Peirce's theory of semiotics. However, the applicability of such algorithms may be extended using annotations, which capture aspects of meaning-making that remain beyond algorithms. Overall, the results suggest that the process of building and analysing multimodal corpora should be actively theorized in order to identify new ways of working with the information stored in them, particularly in terms of dividing the annotation tasks between humans and algorithms.