Skip to main content

ORIGINAL RESEARCH article

Front. Comput. Sci.
Sec. Computer Security
Volume 6 - 2024 | doi: 10.3389/fcomp.2024.1473457

MCOT: Multimodal Fake News Detection with Contrastive Learning and Optimal Transport

Provisionally accepted
  • University of Electronic Science and Technology of China, Chengdu, China

The final, formatted version of the article will be published soon.

    The proliferation of social media platforms has facilitated the spread of fake news, posing significant risks to public perception and societal stability. Existing methods for multimodal fake news detection have made important progress in combining textual and visual information but still face challenges in effectively aligning and merging these different types of data. These challenges often result in incomplete or inaccurate feature representations, thereby limiting overall performance. To address these limitations, we propose a novel framework named MCOT (Multimodal Fake News Detection with Contrastive Learning and Optimal Transport). MCOT integrates textual and visual information through three key components: cross-modal attention mechanism, contrastive learning, and optimal transport. Specifically, we first use cross-modal attention mechanism to enhance the interaction between text and image features. Then, we employ contrastive learning to align related embeddings while distinguishing unrelated pairs, and we apply optimal transport to refine the alignment of feature distributions across modalities. This integrated approach results in more precise and robust feature representations, thus enhancing detection accuracy. Experimental results on two public datasets demonstrate that the proposed MCOT outperforms state-of-the-art methods.

    Keywords: Fake news detection, Multimodal data, cross-modal attention, Contrastive learning, Optimal transport

    Received: 31 Jul 2024; Accepted: 28 Oct 2024.

    Copyright: © 2024 Shen, Huang, Hu, Cai and Zhou. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

    * Correspondence: Shimin Cai, University of Electronic Science and Technology of China, Chengdu, China

    Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.