Skip to main content

ORIGINAL RESEARCH article

Front. Genet.
Sec. Genomics of Plants and the Phytoecosystem
Volume 15 - 2024 | doi: 10.3389/fgene.2024.1518205
This article is part of the Research Topic Responses and Adaptation of Plants to Abiotic Stress: Genetics, Evolution and Molecular Insights View all 3 articles

Classical and machine learning tools for identifying yellow-seeded Brassica napus by fusion of hyperspectral features

Provisionally accepted
Fan Liu Fan Liu 1Fang Wang Fang Wang 2*Zaiqi Zhang Zaiqi Zhang 1Liang Cao Liang Cao 1Jinran Wu Jinran Wu 3Yougan Wang Yougan Wang 4
  • 1 Hunan University of Medicine, Huaihua, Hunan Province, China
  • 2 Xiangtan University, Xiangtan, China
  • 3 School of Science, Australian Catholic University, Brisbane, Australia
  • 4 The University of Queensland, Brisbane, Queensland, Australia

The final, formatted version of the article will be published soon.

    Due to its favorable traits-such as lower lignin content, higher oil concentration, and increased protein levels-the genetic improvement of yellow-seeded rapeseed has attracted more attention than other rapeseed color variations. Traditionally, yellow-seeded rapeseed has been identified visually, but the complex variability in the seed coat color of Brassica napus has made manual identification challenging and often inaccurate. Another method, using the RGB color system, is frequently employed but is sensitive to photographic conditions, including lighting and camera settings. In this study, we present four data-driven models to identify yellow-seeded Brassica napus using hyperspectral features combined with simple yet intelligent techniques. One model employs partial least squares regression (PLSR) to predict the R, G, and B color channels, effectively distinguishing yellow-seeded varieties from others according to globally accepted yellow-seed classification protocols. Another model uses logistic regression (Logit-R) to produce a probability-based assessment of yellow-seeded status. Additionally, we implement two intelligent models-random forest and support vector classifier-to evaluate features selected through lassopenalized logistic regression. Our findings indicate significant recognition accuracies of 96.55% and 98% for the PLSR and Logit-R models, respectively, aligning closely with the accuracy of previous methods. This approach represents a meaningful advancement in identifying yellowseeded rapeseed, with high recognition accuracy demonstrating the practical applicability of these models.

    Keywords: Rapeseed (Brassica napus), yellow-seeded, hyperspectral feature, Logistic regression, partial least squares regression, machine learning

    Received: 28 Oct 2024; Accepted: 26 Dec 2024.

    Copyright: © 2024 Liu, Wang, Zhang, Cao, Wu and Wang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

    * Correspondence: Fang Wang, Xiangtan University, Xiangtan, China

    Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.