Skip to main content

ORIGINAL RESEARCH article

Front. Comput. Sci.
Sec. Computer Vision
Volume 6 - 2024 | doi: 10.3389/fcomp.2024.1480481
This article is part of the Research Topic Foundation Models for Healthcare: Innovations in Generative AI, Computer Vision, Language Models, and Multimodal Systems View all 4 articles

Towards Improving Precision and Complexity of Transformer-based Cost-sensitive Learning Models for Plant Disease Detection

Provisionally accepted
Manh-Tuan Do Manh-Tuan Do *Manh-Hung Ha Manh-Hung Ha *Duc-Chinh Nguyen Duc-Chinh Nguyen *Oscal Tzyh-Chiang Chen Oscal Tzyh-Chiang Chen *
  • Vietnam National University, Hanoi, Hanoi, Vietnam

The final, formatted version of the article will be published soon.

    Early and accurate detection of plant diseases is crucial for making informed decisions to increase the yield and quality of crops through the decision of appropriate treatments. This study introduces an automated system for early disease detection in plants that enhanced a lightweight model based on the robust machine learning algorithm. In particular, we introduced a transformer module, a fusion of the SPP and C3TR modules, to synthesize features in various sizes and handle uneven input image sizes. The proposed model combined with transformerbased long-term dependency modeling and convolution-based visual feature extraction to improve object detection performance. To optimize a model to a lightweight version, we integrated the proposed transformer model with the Ghost module. Such an integration acted as regular convolutional layers that subsequently substituted for the original layers to cut computational costs. Furthermore, we adopted the SIoU loss function, a modified version of CIoU, applied to the YOLOv8s model, demonstrating a substantial improvement in accuracy. We implemented quantization to the YOLOv8 model using ONNX Runtime to enhance to facilitate real-time disease detection on strawberries. Through an experiment with our dataset, the proposed model demonstrated mAP@.5 characteristics of 80.30%, marking an 8% improvement compared to the original YOLOv8 model. In addition, the parameters and complexity were reduced to approximately one-third of the initial model. These findings demonstrate notable improvements in accuracy and complexity reduction, making it suitable for detecting strawberry diseases in diverse conditions.

    Keywords: DNN, transformer, Ghost Conv, SIoU loss function, Pre-trained, quantization, Android application

    Received: 14 Aug 2024; Accepted: 31 Dec 2024.

    Copyright: © 2024 Do, Ha, Nguyen and Tzyh-Chiang Chen. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

    * Correspondence:
    Manh-Tuan Do, Vietnam National University, Hanoi, Hanoi, Vietnam
    Manh-Hung Ha, Vietnam National University, Hanoi, Hanoi, Vietnam
    Duc-Chinh Nguyen, Vietnam National University, Hanoi, Hanoi, Vietnam
    Oscal Tzyh-Chiang Chen, Vietnam National University, Hanoi, Hanoi, Vietnam

    Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.