Skip to main content

ORIGINAL RESEARCH article

Front. Plant Sci.
Sec. Technical Advances in Plant Science
Volume 15 - 2024 | doi: 10.3389/fpls.2024.1492504
This article is part of the Research Topic Non-Destructive Quality Assessment and Intelligent Packaging of Agricultural Products View all 4 articles

BHC-YOLOV8:Improved YOLOv8-based BHC target detection model for tea leaf disease and defect in real-world scenarios

Provisionally accepted
Zhan BaiShao Zhan BaiShao 1*Xiong Xi Xiong Xi 1Xiaoli Li Xiaoli Li 2Luo Wei Luo Wei 1
  • 1 School of Electrical and Automation Engineering, East China Jiaotong University, Nanchang, China
  • 2 College of Biosystems Engineering and Food Science, Zhejiang University, Hangzhou, Zhejiang Province, China

The final, formatted version of the article will be published soon.

    The detection efficiency of tea diseases and defects ensures the quality and yield of tea. However, in actual production, on the one hand, the tea plantation has high mountains and long roads, and the safety of inspection personnel cannot be guaranteed; On the other hand, the inspection personnel have factors such as lack of experience and fatigue, resulting in incomplete and slow testing results. Introducing visual inspection technology can avoid the above problems. Firstly, a dynamic sparse attention mechanism (Bi Former) is introduced into the model backbone.It filters out irrelevant key value pairs at the coarse region level, utilizing sparsity to save computation and memory; Jointly apply fine region token to token attention in the remaining candidate regions.Secondly, Haar wavelets are introduced to improve the down sampling module. By processing the input information flow horizontally, vertically, and diagonally, the original image is reconstructed..Finally, a new feature fusion network is designed using a multi-head attention mechanism to decompose the main network into several cascaded stages, each stage comprising a sub-backbone for parallel processing of different features. Simultaneously, skip con-nections are performed on features from the same layer, and unbounded fusion weight normalization is introduced to constrain the range of each weight value.After the above improvements, the confidence level of the current mainstream models increased by 7.1%, mAP0.5 increased by 8%, and reached 94.5%. After conducting ablation experiments and comparing with mainstream models, the feature fusion network proposed in this paper reduced computational complexity by 10.6 GFlops, increased confidence by 2.7%, and increased mAP0.5 by 3.2%.This paper developed a new network based on YOLOv8 to overcome the difficulties of tea diseases and defects such as small target, multiple occlusion and complex background.

    Keywords: Bi Former, Haar, Down sampling, Skip connections, YOLOv8, Tea

    Received: 09 Sep 2024; Accepted: 28 Oct 2024.

    Copyright: © 2024 BaiShao, Xi, Li and Wei. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

    * Correspondence: Zhan BaiShao, School of Electrical and Automation Engineering, East China Jiaotong University, Nanchang, China

    Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.