Skip to main content

ORIGINAL RESEARCH article

Front. Plant Sci.

Sec. Technical Advances in Plant Science

Volume 16 - 2025 | doi: 10.3389/fpls.2025.1423682

This article is part of the Research Topic Machine Vision and Machine Learning for Plant Phenotyping and Precision Agriculture, Volume II View all 7 articles

Improved RT-DETR and its application to fruit ripeness detection

Provisionally accepted
  • 1 Wuhan University of Science and Technology, Wuhan, Hubei Province, China
  • 2 National University of Defense Technology, Changsha, China

The final, formatted version of the article will be published soon.

    Crop maturity status recognition is a pivotal component of automated harvesting. Traditional manual detection methods are plagued by inefficiency and high costs. To address this challenge, we propose enhancements to the Real-Time DEtection TRansformer(RT-DETR) method for crop maturity detection. We refine the original model's Backbone structure: firstly, we enhance the HG Block ( Highway Gated Block) by substituting the conventional convolution with the Rep Block during feature extraction. The incorporation of multiple branches in this block proves beneficial for enhancing model accuracy. Subsequently, we replace the conventional convolution in the Rep Block with Partial Convolution (PConv). PConv applies convolution solely to a portion of the input channel system, thus mitigating computational redundancy. Lastly, we introduce the Efficient Multi-Scale Attention (EMA)mechanism and adopt a cross-space learning approach to ensure uniform distribution of spatial semantic features within each feature group, thereby yielding superior results in terms of model parameters. We conduct ablation experiments and juxtapose the RT-DETR model with its familial counterparts to substantiate the accuracy and efficacy of our proposed enhancements. The experimental findings affirm the efficacy of the refined algorithm in discerning crop maturity levels, resulting not only in improved detection accuracy but also in reductions in model parameters and computational complexity. Relative to the original model, the average accuracy mAP@0.5 witnessed a 2.9% enhancement, with a reduction in model size by 5.5% and computational complexity by 9.6%. In order to verify the generality of the model, we also compared the standard RT-DETR model, the YOLOv8 model, and the improved model on the plant pest detection datasets, and the experimental results show that the improved model outperforms the other models for general scenarios. This method holds significant implications for the advancement of automated crop maturity detection methodologies.

    Keywords: RT-DETR, PConv, Rep Block, EMA, Crop maturity detection

    Received: 03 Oct 2024; Accepted: 16 Jan 2025.

    Copyright: © 2025 Cao and Yun. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

    * Correspondence: Bai Yun, National University of Defense Technology, Changsha, China

    Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

    Research integrity at Frontiers

    Man ultramarathon runner in the mountains he trains at sunset

    94% of researchers rate our articles as excellent or good

    Learn more about the work of our research integrity team to safeguard the quality of each article we publish.


    Find out more