AUTHOR=Zhu Dongyu , Han Junying , Liu Chengzhong , Zhang Jianping , Qi Yanni TITLE=Modeling of flaxseed protein, oil content, linoleic acid, and lignan content prediction based on hyperspectral imaging JOURNAL=Frontiers in Plant Science VOLUME=15 YEAR=2024 URL=https://www.frontiersin.org/journals/plant-science/articles/10.3389/fpls.2024.1344143 DOI=10.3389/fpls.2024.1344143 ISSN=1664-462X ABSTRACT=
Protein, oil content, linoleic acid, and lignan are several key indicators for evaluating the quality of flaxseed. In order to optimize the testing methods for flaxseed’s nutritional quality and enhance the efficiency of screening high-quality flax germplasm resources, we selected 30 flaxseed species widely cultivated in Northwest China as the subjects of our study. Firstly, we gathered hyperspectral information regarding the seeds, along with data on protein, oil content, linoleic acid, and lignan, and utilized the SPXY algorithm to classify the sample set. Subsequently, the spectral data underwent seven distinct preprocessing methods, revealing that the PLSR model exhibited superior performance after being processed with the SG smoothing method. Feature wavelength extraction was carried out using the Successive Projections Algorithm (SPA) and the Competitive Adaptive Reweighted Sampling (CARS). Finally, four quantitative analysis models, namely Partial Least Squares Regression (PLSR), Support Vector Regression (SVR), Multiple Linear Regression (MLR), and Principal Component Regression (PCR), were individually established. Experimental results demonstrated that among all the models for predicting protein content, the SG-CARS-MLR model predicted the best, with and of 0.9563 and 0.9336, with the corresponding Root Mean Square Error Correction (RMSEC) and Root Mean Square Error Prediction (RMSEP) of 0.4892 and 0.5616, respectively. In the optimal prediction models for oil content, linoleic acid and lignan, the