Skip to main content

ORIGINAL RESEARCH article

Front. Big Data

Sec. Big Data Networks

Volume 8 - 2025 | doi: 10.3389/fdata.2025.1521653

This article is part of the Research Topic Interdisciplinary Approaches to Complex Systems: Highlights from FRCCS 2023/24 View all 4 articles

Lightweight and Hybrid Transformer-Based Solution for Quick and Reliable DeepFake Detection

Provisionally accepted
  • 1 Manipal University Jaipur, Jaipur, India
  • 2 Department of Computer Engineering, Modeling, Electronics and Systems, University of Calabria, Arcavacata, Calabria, Italy

The final, formatted version of the article will be published soon.

    Rapid advancements in artificial intelligence and generative artificial intelligence have enabled the creation of fake images and videos that appear highly realistic. According to a report published in 2022, approximately 71% of people rely on fake videos and become victims of blackmail.Moreover, these fake videos and images are used to tarnish the reputation of popular public figures. This has increased the demand for deepfake detection techniques. The accuracy of the techniques proposed in the literature so far varies with changes in fake content generation techniques. Additionally, these techniques are computationally intensive. The techniques discussed in the literature are based on convolutional neural networks, linformer models, or transformer models for deepfake detection, each with its advantages and disadvantages. In this manuscript, a hybrid architecture combining transformer and linformer models is proposed for deepfake detection. This architecture converts an image into patches and performs position encoding to retain spatial relationships between patches. Its encoder captures the contextual information from the input patches, and Gaussian Error Linear Unit resolves the vanishing gradient problem. The Linformer component reduces the size of the attention matrix. Thus, it reduces the execution time to half without compromising accuracy. Moreover, it utilizes the unique features of transformer and linformer models to enhance the robustness and generalization of deepfake detection techniques. The low computational requirement and high accuracy of 98.9% increase the real-time applicability of the model, preventing blackmail and other losses to the public.

    Keywords: Deep fake, Social safety, transformer, Blackmail, Computation, generative

    Received: 02 Nov 2024; Accepted: 11 Mar 2025.

    Copyright: © 2025 Rani, Kothekar, Philip, Dhaka, Zumpano and Vocaturo. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

    * Correspondence:
    Vijaypal Singh Dhaka, Manipal University Jaipur, Jaipur, India
    Eugenio Vocaturo, Department of Computer Engineering, Modeling, Electronics and Systems, University of Calabria, Arcavacata, 87036, Calabria, Italy

    Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

    Research integrity at Frontiers

    Man ultramarathon runner in the mountains he trains at sunset

    94% of researchers rate our articles as excellent or good

    Learn more about the work of our research integrity team to safeguard the quality of each article we publish.


    Find out more