Skip to main content

ORIGINAL RESEARCH article

Front. Plant Sci.
Sec. Plant Bioinformatics
Volume 16 - 2025 | doi: 10.3389/fpls.2025.1511097

Identification of maize kernel varieties based on interpretable ensemble algorithms

Provisionally accepted
春光 毕 春光 毕 1,2Xinhua Bi Xinhua Bi 2Jinjing Liu Jinjing Liu 2*Hao Xie Hao Xie 2*Shuo Zhang Shuo Zhang 3*He Chen He Chen 2*Mohan Wang Mohan Wang 4*Lei Shi Lei Shi 1,2*Shaozhong Song Shaozhong Song 5*
  • 1 Institute for the Smart Agriculture, Jilin Agricultural University, ChangChun, China
  • 2 College of Information Technology, Jilin Agricultural University, ChangChun, China
  • 3 Changchun Humanities and Sciences College, Changchun, Hebei Province, China
  • 4 Jilin Zhongnong Sunshine Data Company Limited, Changchun, Hebei Province, China
  • 5 School of Data Science and Artificial Intelligence, Jilin Engineering Normal University, Changchun, China

The final, formatted version of the article will be published soon.

    Maize kernel variety identification is extremely important for reducing maize storage losses and ensuring food security. However, traditional single models show limitations when processing largescale multimodal data. The aim of this study is to construct an interpretable ensemble learning model for maize seed grain variety identification by improving differential evolutionary algorithm through multimodal data fusion. First, morphological and hyperspectral data of maize samples were extracted and preprocessed, and three methods were used to screen features, respectively. Subsequently, morphological data and hyperspectral data are fused, the base learner of the Stacking integration model was selected by diversity index and performance index, and the parameters of the base learner are optimized using a differential evolution algorithm that introduces multiple mutation strategies, dynamically adjusts mutation factors and recombination rates. Finally, Shapley Additive exPlanation is used for interpretable ensemble learning. The experimental results show that the accuracy of HDE-Stacking identification can reach 97.78%, in which the bands of 784 nm, 910 nm, 732 nm, 962 nm, and 666 nm have a positive impact on the identification results. This research provides a scientific basis for the efficient identification of corn kernels of different varieties, which helps to improve the accuracy and traceability of germplasm resource management. It also has important practical value in agricultural production, can effectively improve the efficiency of quality management, and provides technical support to ensure food security.

    Keywords: Maize kernel, Variety identification, Stacking ensemble model, Multimodal data, differential evolutionary algorithm, SHAP value

    Received: 14 Oct 2024; Accepted: 20 Jan 2025.

    Copyright: © 2025 毕, Bi, Liu, Xie, Zhang, Chen, Wang, Shi and Song. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

    * Correspondence:
    Jinjing Liu, College of Information Technology, Jilin Agricultural University, ChangChun, China
    Hao Xie, College of Information Technology, Jilin Agricultural University, ChangChun, China
    Shuo Zhang, Changchun Humanities and Sciences College, Changchun, 130117, Hebei Province, China
    He Chen, College of Information Technology, Jilin Agricultural University, ChangChun, China
    Mohan Wang, Jilin Zhongnong Sunshine Data Company Limited, Changchun, Hebei Province, China
    Lei Shi, Institute for the Smart Agriculture, Jilin Agricultural University, ChangChun, China
    Shaozhong Song, School of Data Science and Artificial Intelligence, Jilin Engineering Normal University, Changchun, China

    Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.