- 1College of Resource and Environment, Anhui Science and Technology University, Fengyang, China
- 2College of Environmental, Hohai University, Nanjing, China
- 3Anhui Engineering Research Center of Smart Crop Planting and Processing Technology, Fengyang, China
- 4Anhui Province Agricultural Waste Fertilizer Utilization and Cultivated Land Quality Improvement Engineering Research Center, Anhui Science and Technology University, Fengyang, China
- 5College of Hydrology and Water Resources, Hohai University, Nanjing, China
The Soil Plant Analysis Development (SPAD) is a vital index for evaluating crop nutritional status and serves as an essential parameter characterizing the reproductive growth status of winter wheat. Non-destructive and accurate monitorin3g of winter wheat SPAD plays a crucial role in guiding precise management of crop nutrition. In recent years, the spectral saturation problem occurring in the later stage of crop growth has become a major factor restricting the accuracy of SPAD estimation. Therefore, the purpose of this study is to use features selection strategy to optimize sensitive remote sensing information, combined with features fusion strategy to integrate multiple characteristic features, in order to improve the accuracy of estimating wheat SPAD. This study conducted field experiments of winter wheat with different varieties and nitrogen treatments, utilized UAV multispectral sensors to obtain canopy images of winter wheat during the heading, flowering, and late filling stages, extracted spectral features and texture features from multispectral images, and employed features selection strategy (Boruta and Recursive Feature Elimination) to prioritize sensitive remote sensing features. The features fusion strategy and the Support Vector Machine Regression algorithm are applied to construct the SPAD estimation model for winter wheat. The results showed that the spectral features of NIR band combined with other bands can fully capture the spectral differences of winter wheat SPAD during the reproductive growth stage, and texture features of the red and NIR band are more sensitive to SPAD. During the heading, flowering, and late filling stages, the stability and estimation accuracy of the SPAD model constructed using both features selection strategy and features fusion strategy are superior to models using only a single feature strategy or no strategy. The enhancement of model accuracy by this method becomes more significant, with the greatest improvement observed during the late filling stage, with R2 increasing by 0.092-0.202, root mean squared error (RMSE) decreasing by 0.076-4.916, and ratio of performance to deviation (RPD) increasing by 0.237-0.960. In conclusion, this method has excellent application potential in estimating SPAD during the later stages of crop growth, providing theoretical basis and technical support for precision nutrient management of field crops.
1 Introduction
As a principal food crop worldwide, wheat provides vital carbohydrate sustenance to millions of people, holding an essential position in addressing global food security (Asseng et al., 2011). The Soil Plant Analysis Development (SPAD) indicator, which represents the relative chlorophyll content in leaves (Netto et al., 2005), is intimately connected with crop nutritional status, serving as a significant index of crop growth and development (Gitelson et al., 2005; Zhang et al., 2019; Liu et al., 2020). Consequently, the accurate and efficient acquisition of SPAD information for winter wheat is of paramount importance for field management decisions and crop nutrition monitoring. The conventional approaches for obtaining crop chlorophyll content primarily rely on destructive field sampling and laborious chemical analysis in the laboratory. These approaches are not only time-consuming and labor-intensive but also incur significant costs (Sampson et al., 2003). Existing research has demonstrated a close correlation between data obtained from the handheld SPAD-502 chlorophyll meter and the chlorophyll content analyzed chemically in a laboratory setting (Rigon et al., 2016). Although the SPAD acquisition method offers more convenience than traditional chlorophyll measurement techniques and allows for non-destructive sampling (Uddling et al., 2007), its limitations in measurement points entail certain labor costs, resulting in small operational scales, low efficiency, and difficulty in meeting the demands for large-scale acquisition of crop chlorophyll content information (Wang J et al., 2021; Wu et al., 2023). Therefore, there is a pressing need for a method that can rapidly, non-destructively, and accurately obtain crop SPAD information.
High-throughput phenotyping (HTP) technology has made this work possible, especially the unmanned aerial vehicle (UAV) phenotyping platform equipped with high-performance image sensors (Guo et al., 2021). Its advantages of high spatio-temporal resolution, flexibility, and easy operation have been widely applied to crop growth monitoring (Liu J et al., 2022; Yin et al., 2022; Fan et al., 2023). With the development of sensor technology, RGB sensors (Vélez et al., 2022), multispectral sensors (Impollonia et al., 2022), hyperspectral sensors (Zhou X et al., 2023), and lidar (Jimenez-Berni et al., 2018) have provided substantial data support for crop phenotyping, intelligent breeding, and more (Jin et al., 2015). Although RGB sensors are cost-effective, the spectral information they can acquire is limited, and the limited number of spectral bands often struggle to accurately capture the complete physiological and biochemical information of crops (Feng et al., 2022). Hyperspectral sensors and lidar offer high data accuracy, but due to their high cost, they are not suitable for widespread application in field crop monitoring (Madec et al., 2017; Fan et al., 2022). Therefore, the use of a lightweight, cost-effective UAV equipped with a multispectral sensor to capture remote sensing information of the winter wheat canopy for SPAD estimation becomes particularly essential.
Currently, spectral features of UAV imagery (vegetation indices, bands reflectance) have been extensively utilized in the field of precision agriculture for estimating crop physiological parameters (Luo et al., 2022), such as Leaf Area Index (LAI) (Qiao et al., 2022; Zhou et al., 2022), Above-Ground Biomass (AGB) (Han et al., 2022; Yue et al., 2023), SPAD (Wang J et al., 2021; Yang et al., 2021), and nitrogen content (Yang et al., 2019; Xu et al., 2023). However, under high canopy cover in the later stages of crop growth, vegetation indices can be affected by spectral saturation effects (Mutanga and Skidmore, 2004; Yue et al., 2019; Mutanga et al., 2023). Moreover, during the reproductive growth stage of crops, the emergence of crop spikes further increases the spectral mixing effect, reducing the sensitivity of vegetation indices (Jay et al., 2019; Wang W et al., 2021). It is challenging to establish a reliable SPAD estimation model using only spectral features during the reproductive growth stage. Texture features, as remote sensing information reflecting image grayscale attributes, color, and spatial structure, can not only characterize differences in canopy image spatial structure arrangement caused by crop growth and development (Liu et al., 2022a) but also reflect changes in crop leaf color due to different treatments such as varieties, fertilizers, and density (Li R. et al., 2022). Texture features have the potential to be combined with spectral features (Fu et al., 2021), and features fusion can reduce the impact of spectral saturation effects to enhance the accuracy of estimation models (Li et al., 2020). However, with the increase of features, while improving model accuracy, there may be information redundancy (Zheng et al., 2020), increasing data complexity. Therefore, to eliminate this redundancy, selecting appropriate features is extremely necessary for optimizing model structure and accuracy.
Features selection plays a pivotal role in machine learning, as it directly determines the input features of the prediction model, influencing its performance and effectiveness. RFE method is widely used in features selection nowadays (Zhu et al., 2020; Shao et al., 2022; Yang et al., 2022). This method is based on multiple rounds of training using a specific algorithm model. It constructs new feature subsets by removing particular variables and conducts the next round of training based on the new feature set (Chen et al., 2021; Rasel et al., 2021). This process is repeated until the model performs optimally. In recent years, the Boruta features selection method has quickly gained popularity due to its low operational cost and speed (Shu et al., 2021; Tatsumi et al., 2021; Shu et al., 2023). The goal of Boruta is to filter out all feature sets related to the target parameter. It allows a more comprehensive understanding of the factors influencing the target parameter, thereby effectively performing features selection (Kursa and Rudnicki, 2010).
Considering the impact of spectral saturation in the later stages of crop growth, features selection and features fusion have the potential to enhance the performance of prediction models. Identifying the remote sensing features that influence the SPAD of winter wheat in the later stages of growth and jointly constructing a high-accuracy SPAD model will be a key focus of future research. Therefore, this study utilizes a UAV equipped with a multispectral sensor to capture high-resolution images of the winter wheat canopy, and employs image processing technology to extract the spectral and texture features of the wheat, and utilizes two features selection methods (Boruta and RFE) to explore the potential of spectral and texture features in predicting SPAD during the later stages of winter wheat growth. We also develop a winter wheat SPAD monitoring model that combines features selection and features fusion strategy. The objectives of this study are: (1) to identify the spectral and texture features sensitive to SPAD during the reproductive growth stage of winter wheat; (2) to evaluate the performance of the winter wheat SPAD prediction model under features selection strategy; and (3) to explore the potential of combining features selection and features fusion strategy to estimate SPAD in the later stages of winter wheat growth.
2 Materials and methods
2.1 Field experimental design
The winter wheat experiment took place in Chuzhou City, located in Anhui Province, China (32°48′52″N,117°46′7″E) from 2020 to 2021 (Figure 1A). This region, positioned in the mid-lower reaches of the Yangtze River, experiences a subtropical monsoon climate. It is characterized by a moist weather pattern, distinct four seasons, an average annual temperature of 15.4°C, and an annual rainfall between 1000-1100mm. The region records an average of 144 rainy days per year and approximately 210 frost-free days annually.
Figure 1 Study area (A). The experiment field was in Chuzhou, Anhui province, China. (B), the unmanned aerial vehicle (UAV) for acquiring low-altitude remote sensing data and plates that correct 5%, 10%, 20% and 40% reflectivity of calibrate differences and (C) field experimental design.
The field experiment consisted of 36 plots, each measuring 16m² (2×8 m). It involved three winter wheat varieties with stable high-yield potential (V1: Huaimai 44, V2: Yannong 999, and V3: Ningmai 13) and four levels of nitrogen fertilizer application (N0-N3: 0, 100, 200, 300 kg/ha), with each treatment replicated three times (Figure 1C). The fertilization scheme involved the application of phosphorus (P=90kg/ha) and potassium (K=135 kg/ha) as base fertilizers before sowing, with a 6:4 ratio of nitrogen fertilizer applied pre-sowing and at the jointing stage. The sowing date was November 7, 2020, with the winter wheat manually strip-sown at a row spacing of 30 cm. The harvest took place on June 3, 2021, and other field management measures were based on local high-yield cultivation practices. Throughout the entire growth stage of the winter wheat, there were no occurrences of diseases, pests, or weed hazards, and the field environment was good, with no instances of drought or waterlogging.
2.2 UAV-based data collection and pre-processing
The experiment utilized a Phantom 4 Multispectral (P4M) RTK (DJI Technology Co., Shenzhen, China) UAV to collect multispectral (MS) data during the wheat heading stage on April 18, 2021, the flowering stage on April 29, 2021, and the late filling stage on May 24, 2021. The UAV is equipped with five monochrome sensors, each with 2.08 million pixels: Blue, Green, Red, Red edge, and NIR, with center wavelengths of 450 ± 16nm, 560 ± 16nm, 650 ± 16nm, 730 ± 16nm, and 840 ± 26nm, and bandwidths of 20nm, 20nm, 10nm, 10nm, 40nm, respectively. It is also equipped with a real-time kinematic (RTK) system, which allows for vertical positioning accuracy of ±1.5 cm and horizontal positioning accuracy of ±1cm, enabling the P4M to obtain high-precision spectral and texture information (Oktay and Coban, 2017). Flights were conducted between 11:00 and 13:00 on clear, windless days with stable solar radiation intensity. Four radiometric calibration panels with known reflectance were deployed on the ground for radiometric calibration during subsequent data processing (Figure 1B). The DJI GS PRO software (https://www.dji.com/cn/ground-station-pro/) was used to pre-plan the flight route, relying on the UAV’s autopilot system to execute the predefined flight plan (Şahin et al., 2022). Each flight lasted 20 minutes, with a flight altitude of 30 m, a flight speed of 2m/s, forward and side overlap of 90% and 85% respectively, and an image resolution of 1600×1300 pixels.
The UAV MS images collected at each growth stage were imported into the PIX4Dmapper software (version 4.4.12, Pix4D SA, Prilly, Switzerland) for image stitching. The initial step involved aligning the images using the feature point matching algorithm. Subsequently, a dense point cloud and texture mesh were generated by utilizing both the UAV image data and location information. To improve data quality, this study carried out radiometric correction on the MS images using the empirical line method (ELM) and the image information of ground radiometric calibration panels with known reflectance (Di Gennaro et al., 2022). A shapefile was generated using ArcGIS 10.2 (Environmental Systems Research Institute, Inc, RedLands, CA, USA), and plot areas were delineated based on the orthoimage of the experimental area.
2.3 SPAD data collection
Field measurements at each stage were conducted prior to the UAV flights on the same day, with chlorophyll values based on SPAD determined for winter wheat across the 36 plots. Winter wheat SPAD was measured using a SPAD-502 chlorophyll meter (Konica Minolta Optics Inc., Osaka, Japan) (Jiang et al., 2022). For each plot, three average-growth wheat plants were selected, and SPAD readings were taken at the 1/6, 3/6, and 5/6 length of the flag leaves. The average of the 9 SPAD values in the sample plot was used as the actual SPAD measurement in the sample plot. A total of 324 times were collected in each growth stage, and the measured SPAD data of 36 sample plots were calculated.
2.4 Extraction of features from UAV imagery
2.4.1 Spectral features extraction
This study used band reflectance and vegetation indices as spectral features (SF). Vegetation indices are obtained by calculations or combinations of characteristic bands, providing robust vegetation information factors and enhancing, to a certain extent, the expressive ability of remote sensing data (Bai et al., 2022). The reflectance of the five original bands from the wheat canopy during the heading, flowering, and late filling stages was obtained from the MS images for vegetation index calculation. This study selected a total of 30 widely-used spectral features for monitoring crop growth and evaluating parameters These features include five original bands (I), seven with only the visible bands (II), five with the RE band but without NIR band (III), eight with the NIR band but without RE band (IV), and five with both the NIR and RE bands (V) (Table 1).
2.4.2 Texture features extraction
As a common means of characterizing image information, texture reflects important information such as surface structure and spatial arrangement in the image without relying on brightness. It generally uses high-energy narrow peaks in the spectrum to detect the periodicity of the image (Bharati et al., 2004; Pu and Cheng, 2015). This study uses the Gray Level Co-occurrence Matrices (GLCM) method to extract the texture information of B, G, R, RE and NIR bands in winter wheat canopy MS images. GLCM requires user-defined parameters such as window size and orientation. The parameters for extracting GLCM textures in this study are as follows: a window size of 3 pixels by 3 pixels, an extraction direction set at 45°, and grayscale quantization levels defaulted to 64. The texture information includes Mean (Me), Variance (Va), Homogeneity (Ho), Contrast (Cn), Dissimilarity (Di), Entropy (En), Second moment (Se) and Correlation (Cr), a total of eight indicators (Table 2). In order to obtain more texture information for feature selection, the maximum value (MAX), minimum value (MIN), mean value (MEAN) and standard deviation (SD) of the GLCM index in each sample area were calculated, resulting in a total of 5(bands) × 8(GLCM indices) × 4(statistical metrics) = 160 texture metrics.
2.5 Features selection strategy and features fusion strategy
UAV multispectral remote sensing data was analyzed in order to identify the most effective approach to maximize improvements in SPAD estimates during later stages of growth (Figure 2). The features selection strategy is used to select SF and TF that are sensitive to SPAD. SPAD prediction model is further constructed to check the difference between implementing and not implementing the features selection strategy, different types of feature subsets are fused in the features fusion strategy to construct SPAD estimation model together.
Figure 2 Flowchart of features selection strategy and features fusion strategy applied in this experimental study.
2.5.1 Features selection
In this study, two features selection methods were selected as features selection strategy: Recursive feature elimination (RFE) and Boruta. The RFE algorithm is a greedy search algorithm that incrementally selects the optimal features based on the model’s performance. Initially, RFE requires the selection of a specific machine learning model to serve as the estimator, such as linear regression, support vector machines, and random forests (RF). The RFE algorithm first uses all features to train the model, calculates the importance of each feature and sorts it, and uses each feature subset to train the model, compares the model results obtained from each subset, and then recalculates the model based on specific features. Train the model and repeat this process until the optimal feature combination is selected to maximize model performance (Kuhn, 2008). Ultimately, RFE yields a subset of features that the algorithm deems the most representative and significant. By employing this recursive approach, RFE systematically eliminates irrelevant or redundant features while retaining the most pertinent ones, thereby enhancing the model’s generalizability and efficacy. The essence of the RFE algorithm lies in iteratively training the model and selecting features to identify an optimal subset, thereby optimizing the model’s performance and interpretability.
The Boruta algorithm is a wrapper based on the RF algorithm (Kursa and Rudnicki, 2010). Therefore, Boruta has the advantages of the random forest algorithm, has low running time cost, and can run results without relying on parameter adjustment. It is an ensemble method where multiple independent decision trees vote for classification, classify all trees based on a given attribute and calculate the importance of all trees, i.e. the Z score, which reflects the fluctuations in accuracy between trees in the forest. When Boruta is running, it will create “shadow” attributes obtained by reshuffling the original attributes, and randomly disrupt the order of feature parameters. When calculating feature importance, the feature parameters are divided into three categories, namely Z score and significant features with a Z score higher than the “Shadow” attribute are called “Confirmed” (important), features with a Z score close to the “Shadow” attribute are called “Tentative” (potentially important), and features with a Z score significantly lower than the “Shadow” attribute are called “Rejected” (not important). In previous studies (Bai et al., 2022; Li Z et al., 2022), “Confirmed+Tentative” was used as the final result of feature selection, and no actual research was conducted to explore which feature parameter set performs better in building a model. Therefore, this study divides the Boruta algorithm screening results into two categories: “Confirmed” and “Confirmed+Tentative”, analyzes the impact of the two feature parameter sets on model construction and compares their accuracy.
In this study, the features screened by the RFE method are prefixed with R-. The feature parameter sets screened by the Boruta method are “Confirmed” and “Confirmed+Tentative”, represented by C- and CT- respectively. For example, the spectral features screened by the RFE algorithm are prefixed with R-SF means.
2.5.2 Features fusion
Features fusion is a method of building a model by fusing different types of remote sensing features together (Liu et al., 2022b; Liu et al., 2023). In this study, the features fusion strategy is mainly divided into two parts: 1. First, fuse SF and TF, implement the features selection method to optimize, then build SPAD estimation models, and compare it with the model built by the fused features that did not participate in feature selection. 2. Based on the features selection strategy, the SPAD estimation model is constructed by fusing the selected feature subsets of different categories, to identify a SPAD estimation model that delivers optimal performance during the later stages of wheat growth.
2.6 Model development and accuracy evaluation
Based on R language version 4.1.3 (R Foundation, Vienna, Austria), the support vector machine regression (SVR) algorithm was used to predict winter wheat SPAD. The idea of SVR originated from the support vector machine. It transforms the input low-dimensional sample set into a high-dimensional space and finds a hyperplane that can be closest to all feature sample sets to implement regression, minimize structural risks, and improve sample discreteness (Smola and Schölkopf, 2004). In this study, the SPAD data set of winter wheat in each growth stage was divided into a calibration set and a validation set. The dataset was randomly sampled, 2/3 was used for model training (Calibration), and 1/3 was used to evaluate the performances (Validation) (Table 3). Three statistical indicators are used to test the machine learning model: the coefficient of determination (R2), root mean square error (RMSE), and the ratio of performance to deviation (RPD) (Bruning et al., 2019). Their calculation formulas are presented as Equations 1–3.
In the formula, and are the observed values and measured values of SPAD, is the average of SPAD observed values, n is the sample size, and SD is the standard deviation of reference values.
3 Results
3.1 Analysis of SPAD variation
Figure 3 illustrates the dynamic changes in SPAD for wheat under different nitrogen treatments, varieties, and growth stages. As observed in the Figure 3, SPAD exhibits noticeable differences across various nitrogen gradient treatments, generally trending upward with increasing nitrogen application, peaking at the N3 level. The performance of SPAD varies across growth stages, initially showing a slight increase followed by a sharp decrease. Notably, the SPAD values are highest during the Flowering stage and lowest in the Late filling stage. Furthermore, no significant differences in SPAD were found among different wheat varieties, with the trends being relatively consistent across various periods and nitrogen treatments.
3.2 The optical features with SF and TF
In this study, 30 spectral features and 160 texture features were extracted from UAV multispectral images. Not all features are helpful for wheat SPAD estimation, and redundant features may affect model accuracy. Optimizing remote sensing features sensitive to SPAD using features selection strategy (Figure 4). As can be seen from Figures 5A–C, during the heading stage, three identical feature sets were selected, encompassing all five types of spectral features. In the flowering stage, spectral feature types IV and V performed the most effectively, with C-SF and CT-SF selecting all features from types IV and V, while R-SF predominantly chose type IV as the optimal variable set. In the late filling stage, types IV and V continued to excel, and the three feature sets showed a high degree of consistency in favoring spectral feature types IV and V. The commonality between spectral feature types IV and V is their composition, which includes participation from the near-infrared band.
Figure 4 The optimal number of variables selected by Boruta and RFE features selection methods for different feature sets during different growth stages.
Figure 5 The radar chart of features selection for the three growth stages. (A–C) spectral features and (D–F) texture features.
The results of texture features band type selection for the three stages are shown in Figures 5D–F. During the heading stage, the R, RE, and NIR bands showed better texture performance, far exceeding the B and G bands. During the flowering stage, the trend of texture feature band types selected by the three methods was consistent, with the Red and NIR bands showing better performance. During the late filling stage, the texture feature band performances were consistent across the three selection methods, with no significant differences among the five texture feature band types. The texture performance of the R and NIR bands was relatively stable, and the number of retained features after selection was significantly higher than that of other bands, except during the late filling stage where there was no significant difference among the five bands.
3.3 Construction and validation of the SPAD prediction model were carried out using different strategy
3.3.1 Estimation of SPAD during the late growth stage of winter wheat using features selection strategy
In this study, SPAD estimation models were developed for the heading stage, flowering stage, and late filling stage based on spectral features and texture features extracted from UAV multispectral images. For each stage, the original feature set was included in the model construction process. A total of 24 (4*2*3) SPAD regression prediction models were built using four feature sets, two types of features (spectral features and texture features), and three growth stages (Figure 6).
From the perspective of model accuracy before and after feature selection, the accuracy of the models constructed using the selected dataset is generally higher compared to that of the initial dataset, but there are also some exceptions. For example, during the flowering stage, the SF dataset (Validation: R2 = 0.805, RMSE=3.211, RPD=1.998) has higher accuracy than the R-SF, C-SF, and CT-SF datasets for constructing SPAD estimation models. The accuracy among the three datasets composed after features selection varies, and their performance differs in different growth stages and feature types. For example, during the heading stage, the R-TF dataset has lower accuracy (Validation: R2 = 0.800, RMSE=3.732, RPD=2.080) compared to the C-TF and CT-TF datasets, while during the flowering stage, the R-TF dataset has higher accuracy (Validation: R2 = 0.799, RMSE=2.941, RPD=2.181) than the C-TF and CT-TF datasets. The accuracy of the C- and CT- datasets selected by the Boruta algorithm is not consistently higher or lower.
3.3.2 Estimation of SPAD during the late growth stage of winter wheat using features fusion strategy
SPAD estimation models were established for three growth stages based on spectral and texture features extracted from UAV multispectral images using a features fusion strategy, followed by feature selection. The original feature set was included in model construction at each stage, and a total of 12 (4*3) SPAD regression prediction models were constructed using four feature sets, one feature type (spectral and texture feature fusion), and three growth stages.
As shown in Figure 7, under this feature strategy, the SPAD estimation model for winter wheat canopy at the heading stage (R-SFTF) constructed using the RFE features selection method combined with the SFTF dataset achieved the highest level of accuracy. The performance indicators of the model were as follows: Validation: R2 = 0.861, RMSE=3.604, RPD=2.154. The SPAD estimation model for winter wheat at the flowering stage (C-SFTF) constructed using the Boruta features selection method combined with the SFTF dataset had the best accuracy, with specific performance indicators of Validation: R2 = 0.740, RMSE=3.256, RPD=1.971. The SPAD estimation model for winter wheat at the late filling stage (R-SFTF) constructed using the RFE features selection method combined with the SFTF dataset had the best performance, with Validation: R2 = 0.761, RMSE=6.250, RPD=1.983.
On the other hand, the features fusion strategy was executed to establish SPAD estimation models for the heading stage, flowering stage, and late filling stage. For each stage, 27 SPAD prediction models (9*3) were constructed using 9 (3*3) feature sets, 1 feature type (features fusion set), and 3 growth stages. Additionally, 30 regression models were created by adding the unfiltered SFTF feature set as a comparison at each stage (Figure 8).
Figure 8 The SVR regression models constructed using the fusion dataset of selected spectral and texture features. C-, CT- and R- represent the feature set selected by the Boruta algorithm “Confirmed”, Boruta algorithm “Confirmed+Tentative” and RFE algorithm. SF represents the spectral features, TF represents the texture features. A represents the first five fusion treatments, which include SFTF, C-SF--C-TF, C-SF--CT-TF, C-SF--R-TF, and CT-SF--C-TF. B encompasses the latter five fusion treatments, consisting of CT-SF--CT-TF, CT-SF--R-TF, R-SF--C-TF, R-SF--CT-TF, and R-SF--R-TF.
As shown in Figure 8, under this strategy, during the heading stage, the R-SF, C-SF, and CT-SF feature sets were the same. Therefore, there were three optimal estimation models that fused the R-TF data with these feature sets: R-SF–R-TF, C-SF–R-TF, and CT-SF–R-TF. The performance indicators of these models are as follows: Validation: R2 = 0.857, RMSE=3.134, RPD=2.477. During the flowering stage, the best accuracy for canopy SPAD estimation in winter wheat was achieved by the fusion model (R-SF–R-TF) constructed using the RFE features selection method combined with the SF and TF datasets. The specific performance indicators are as follows: Validation: R2 = 0.807, RMSE=2.850, RPD=2.251. For the late filling stage, the best performance was observed in the fusion model (C-SF–CT-TF) constructed by combining the SF and TF datasets using the Boruta features selection method. The performance indicators for this model are as follows: Validation: R2 = 0.809, RMSE=5.878, RPD=2.108.
To further analyze the accuracy of the SPAD estimation models, From the Figure 9, it can be observed that the optimal monitoring models for the later growth stages of winter wheat were all SPAD estimation models constructed using a dual-strategy. At each stage, the data points are clustered closely around the 1:1 line, demonstrating strong agreement between the model’s predicted values and the field-measured values, with minimal errors. These results indicate the models’ capability to accurately estimate the SPAD in winter wheat.
4 Discussion
4.1 The main spectral and texture features that affect the SPAD during the later growth stages of winter wheat
Wheat exhibits significant SPAD variation during the heading, flowering, and late filling stages, with a close relationship between SPAD and nitrogen treatment. Specifically, wheat SPAD exhibits a gradual upward trend with increased field nitrogen gradient treatments. This is because nitrogen is one of the essential nutrients for plant growth and development (Wang W et al., 2021) and a crucial component of chloroplasts in leaves. SPAD, as a measure of relative chlorophyll content in leaves, is inevitably closely related to nitrogen. During the heading and flowering stages, wheat plants absorb nitrogen from the soil through their roots. This nitrogen is then translocated to various parts of the plant, including the leaves (El-Hendawy et al., 2017). In contrast to the heading and flowering stages, SPAD in wheat drops sharply during the late filling stage. This may be due to nitrogen translocation within the plant at this time. During the filling stage, nitrogen from the leaves gradually moves into the grains, a process accompanied by a decline in photosynthetic capacity and senescence of the leaves (Bowman et al., 2015). Leaves begin to turn yellow, and the chlorophyll content decreases sharply, leading to lower SPAD values.
Spectral features have been verified to be closely associated with crop agronomic traits and are considered important indicators for estimating crop phenotypic information (Duan et al., 2019). It is well known that the sensitivity of spectral data to vegetation decreases during the later growth stages of crops (Zhu et al., 2023). This can be attributed to the canopy closure phenomenon and the complex spectral mixing and canopy structure effects that affect the sensitivity of spectral data to vegetation (Li R et al., 2022). Additionally, the canopy information generated by the spike emergence further exacerbates this effect (Wang et al., 2022). In this study, it was found that the NIR spectral bands combined with other spectral bands during the later growth stages of crops were the most frequently selected spectral features after optimal treatment (Figures 5, 10). This finding is consistent with the conclusion reached by Yin et al. (2023), that the sensitive spectral features for estimating SPAD in winter wheat are combinations of near-infrared and other bands using three features selection methods. This may be related to the sensitivity of the NIR bands to the growth status of crops. As the growth stage progresses, nitrogen in winter wheat leaves gradually transfers to the spike organs, the leaves wilt, and the appearance of wheat plants changes, leading to a sharp change in reflectance in the near-infrared bands. Zheng et al. (2018) found that the near-infrared spectral information was most important when estimating leaf nitrogen content (LNC) using mean squared error (MSE) to evaluate variable importance. LNC is strongly correlated with SPAD, and the results of this study are similar to their conclusion, indicating the great potential of near-infrared spectral data for estimating SPAD during the later growth stages of winter wheat.
Figure 10 The importance of spectral features selected by different features selection methods. The Boruta algorithm displays the top 8 spectral features in terms of importance, while the RFE algorithm retains fewer features and only shows the top 4. The * symbol indicates that the spectral features involves the NIR band.
In this study, the texture features of the red and near-infrared bands maintained a high sensitivity to SPAD information during the reproductive growth stage of wheat (Figures 5, 11). This is consistent with the conclusion reached by Bai et al. (2022), that the texture features of the red band can better predict soybean yield compared to the blue and green bands, and with the conclusion reached by Lu et al. (2021), that extracted texture features from the near-infrared band obtained by multispectral sensors are more effective in predicting potassium content in rice plants than other bands. This may be because the texture features of the red and near-infrared bands can better describe the spatial distribution changes of the crop canopy, providing valuable information for estimating crop phenotypes. It also demonstrates the feasibility of using UAV equipped with RGB sensors (including the red band) or multispectral sensors (including the red and near-infrared bands) to monitor crop phenotypic information using texture features. Furthermore, previous studies that used texture features to construct crop phenotyping models mostly considered the relationship between the mean (MEAN) of texture feature indices and crop phenotypes (Fu et al., 2021; Liu et al., 2022a; Luo et al., 2022; Zhou et al., 2022), while rarely exploring other statistical indices. In this study, further analysis of statistical indices of texture features (Figures 11, 12) extracted four different statistical indices based on GLCM features for each sample area: MEAN, SD, MAX, and MIN. The commonly used MEAN feature consistently maintained a leading position among the four statistical indices (Figure 12), but its importance gradually decreased as the growth stage progressed (Figure 11). The changing trend of importance for other indices was opposite to MEAN. From the heading stage to the late filling stage, their importance gradually increased and reached a peak in the late filling stage, with SD showing the best trend. This indicates that other statistical indices also have the potential to estimate SPAD in wheat, especially SD. This is consistent with the conclusion reached by Bai et al. (2022), that using texture features incorporating SD information achieved good accuracy and precision in predicting early soybean yield. The results of this study indicate that the SD information of texture features also has the potential to estimate crop SPAD.
Figure 11 The importance of texture features selected by different features selection methods. The Boruta and RFE algorithms display the top 8 texture features in terms of importance. The * symbol indicates that the texture features belongs to the red and NIR bands.
Figure 12 The statistical indices of the retained texture features at the three growth stages of wheat.
4.2 The impact of features selection strategy on SPAD estimation
Features selection methods play a significant role in machine learning and can directly impact the predictive model (Kursa and Rudnicki, 2010). When estimating crop phenotypes using remote sensing features, selecting appropriate remote sensing features through features selection strategy can improve model performance and eliminate interference (Lin et al., 2021; Xu et al., 2023). In this study, two features selection methods, Boruta and RFE, were used to select optimal spectral and texture features and build models. Additionally, a comparison of model accuracy before and after features selection was conducted to further analyze the mechanism of features selection methods and select suitable methods for crop phenotype studies. From Figure 13, it can be observed that the accuracy of the SPAD estimation model constructed using the subset of variables after features selection is generally better than that of the original variable set. However, there are also some exceptions, as the Boruta method shows a slight decrease in model accuracy (Figure 13, Late filling, TF and C-TF). This may be related to the principle of the Boruta method. Boruta cannot identify and explore features with low individual value but high combined value, leading to the removal of low-value features and reducing or eliminating the high value generated by feature combinations, thus resulting in decreased model accuracy. Furthermore, Boruta focuses on selecting all features related to SPAD (Kursa and Rudnicki, 2010), overlooking the relationship between features, which may lead to multicollinearity among features and subsequently reduce model performance.
Figure 13 The comparison of R2 values for the validation accuracy of winter wheat SPAD estimation models constructed using features selection methods to select suitable remote sensing features at three growth stages.
Therefore, features selection techniques should not only consider the relationship between features and SPAD but also take into account the relationship between features. RFE does this well by continuously combining subsets of variables to build models in order to find the optimal variable set with the minimum RMSE. For example, Wang J et al. (2021) used two features selection methods, RF and r, to construct RF and SVR models for estimating winter wheat SPAD, and compared them with the RFE features selection method. They found that the RFE features selection method combined with RF and SVR models had higher predictive accuracy. Lin et al. (2021) compared three features selection methods, Boruta, Sequential Forward Selection (SFS), and Permutation Importance-Recursive Feature Elimination (PI-RFE), and found that the PI-RFE method had the best dimensionality reduction effect on multidimensional feature sets, greatly improving the accuracy of FVC inversion. In this study, whether it was at the heading stage or flowering stage, the RFE features selection method combined with the SVR algorithm provided the best SPAD prediction model accuracy. In the late filling stage, the RFE-SVR combination still showed high SPAD estimation ability (Figures 7, 8). Overall, compared with the Boruta method, the RFE combined with the SVR algorithm can achieve higher model accuracy and stronger model performance. RFE takes into account the interrelationships between features and between features and SPAD, thereby improving the performance of prediction model.
4.3 The impact of features fusion strategy on SPAD estimation
Features fusion strategy involve merging different types of features into a more optimal feature set, combining features with different features to construct models, and have the potential to mitigate the impact of spectral saturation effects, enhance model performance and increase the interpretability of prediction models (Zhou et al., 2022; Sun et al., 2023). This explains why the optimal SPAD estimation models in the late growth stage of winter wheat are constructed using features fusion strategy. This study comprehensively reports the essence of the impact of features fusion strategy on model performance through two features fusion approaches. When only implementing features fusion strategy (Table 4), it was found that not all feature fusions lead to improved accuracy. Blindly combining features together to build models may have a counterproductive effect, causing data redundancy and reducing model performance (Son et al., 2015; Li et al., 2020). This conclusion is in line with the discoveries of Zhou L et al. (2023), who observed a decrease in model accuracy when agronomic practice information (API) was fused with spectral and texture features to construct a rice yield estimation model. The essence of the impact of features fusion strategy on model accuracy lies in the contribution of multi-characteristic features to the model and the interactions between features. This also explains why there are differences in predictive model results when different variables are fused (Figure 8). Similar to the findings of Liu J et al. (2022), who estimated rice nitrogen use efficiency (NUE) using different numbers of spectral features and concluded that increasing the number of variables does not necessarily improve model accuracy, this study also discovered that there is no essential correlation between model accuracy and the quantity of features (Table 4; Figures 4, 6–8, 12, 13). Adding variables does not necessarily make the model more robust, and features selection strategy are powerful means to address these issues.
After combining features fusion strategy with features selection strategy, the stability, accuracy, and robustness of the SPAD estimation models for winter wheat are at the forefront (Figure 9). As the growth stage progresses, the improvement in model accuracy gradually increases. The highest increase in model accuracy is observed in the late filling stage, where compared to models using only initial spectral or texture features, R2Val increased by 0.092 to 0.202, RMSEVal decreased by 0.076 to 4.916, and RPDVal increased by 0.237 to 0.960. Sun et al. (2023) developed maize LAI estimation model using SVM, RF, BPNN, and PLSR algorithms by selecting five spectral features and three texture indices through correlation analysis. The results showed that the SVM algorithm with multi-variable fusion achieved the highest accuracy. Compared to SVM models using only spectral or texture features as input variables, R2Val increased by 0.023 to 0.192, RMSEVal decreased by 0.015 to 0.036, and RPDVal increased by 0.074 to 0.746. Ma et al. (2022) used four features selection methods, including correlation coefficient, MIC, RF, and RFE, to select three spectral features and three texture features to construct a cotton yield estimation model. The results showed that the RFE_ELM model based on the fusion of spectral and texture features achieved the highest accuracy, with an increase in R2Val of 0.073 to 0.187 compared to the non-fusion model. The findings of this study surpass those of prior research, indicating that the combination of features fusion strategy and features selection strategy is an effective method to mitigate the impact of spectral saturation effects and improve the accuracy of crop phenotype estimation, fully tapping into the potential of multi-characteristic features in estimating crop phenotypes.
4.4 Limitations and future research perspectives
This study conducted field experiments on winter wheat using three locally stable and high-yielding varieties (Huaimai 44, Yannong 999, and Ningmai 13) and four nitrogen fertilizer application levels (N0-N3: 0, 100, 200, 300 kg/ha). In the future, we will systematically supplement the dataset and conduct field experiments on more varieties and nitrogen fertilizer treatments. At the same time, because of the swift advancement in sensor technology, there are still limitations in building SPAD prediction models relying solely on single sensor data. In the future, we plan to incorporate RGB and hyperspectral data into the SPAD prediction model. The machine learning algorithms employed in this study are relatively single, and in the future, we will introduce multiple algorithms for comparison and select the most suitable algorithm for SPAD estimation. Furthermore, the use of multi-source remote sensing information and background effect removal can also improve the accuracy of crop phenotype estimation based on UAV. In the future, we will combine them to further improve the accuracy of SPAD estimation for winter wheat. Although the combination of features fusion strategy and features selection strategy performs well in SPAD estimation during the later growth stage of wheat, the results of this study only come from small-scale experiments. In the future, we will conduct large-scale integrated winter wheat SPAD remote sensing monitoring research using satellite and UAV platforms.
5 Conclusion
This study demonstrates that spectral and texture features have good potential for monitoring the SPAD status of winter wheat in the reproductive stage. Spectral features in the near-infrared band can fully capture the spectral differences of wheat SPAD in the reproductive growth stage, while texture features in the red and near-infrared bands are more sensitive to wheat SPAD. Among the SPAD estimation models under different strategy, the SVR model combining features selection and features fusion strategy had the highest accuracy, and compared with the SVR model that only used initial spectral or texture features as input, the stability of the model was improved. As the growth stage progresses, the enhancement of model accuracy by this method becomes more significant, with the greatest improvement observed during the late filling stage. R2Val increased from 0.092 to 0.202, RMSEVal decreased from 0.076 to 4.916, and RPDVal increased from 0.237 to 0.960. This study shows that UAV remote sensing technology combined with features selection and fusion strategy has good prospects for monitoring winter wheat SPAD in the reproductive growth stage, and can accomplish precise monitoring of winter wheat growth under different varieties and nitrogen treatments, providing scientific guidance and theoretical support for fine management of field crops nutrition.
Data availability statement
The original contributions presented in the study are included in the article/supplementary material. Further inquiries can be directed to the corresponding authors.
Author contributions
XS: Conceptualization, Data curation, Formal analysis, Methodology, Resources, Writing – original draft, Writing – review & editing, Software. YN: Data curation, Formal analysis, Writing – review & editing. HS: Writing – review & editing. AH: Data curation, Methodology, Software, Writing – review & editing. HY: Data curation, Formal analysis, Writing – review & editing. YZ: Conceptualization, Data curation, Methodology, Writing – review & editing. JLi: Data curation, Methodology, Software, Writing – review & editing. WW: Data curation, Methodology, Writing – review & editing. HW: Supervision, Writing – review & editing. QM: Writing – review & editing, Formal analysis, Methodology. JLiu: Methodology, Writing – review & editing, Funding acquisition, Software, Writing – original draft. XL: Methodology, Software, Writing – original draft, Writing – review & editing, Funding acquisition, Supervision, Validation. YA: Supervision, Validation, Writing – review & editing.
Funding
The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This research was funded by Scientific research projects in higher education institutions of Anhui Province (no. 2023AH051855; 2022AH051623); Anhui Province Crop Intelligent Planting and Processing Technology Engineering Research Center Open Research Project (no. ZHKF03); Natural Science Foundation of Hebei Province (no. C2023408010); Scientific research projects in higher education institutions of Hebei Province (no. QN2024158).
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
Asseng, S., Foster, I., Turner, N. C. (2011). The impact of temperature variability on wheat yields. Glob. Change Biol. 17, 997–1012. doi: 10.1111/j.1365-2486.2010.02262.x
Bai, D., Li, D., Zhao, C., Wang, Z., Shao, M., Guo, B., et al. (2022). Estimation of soybean yield parameters under lodging conditions using rgb information from unmanned aerial vehicles. Front. Plant Sci. 13, 1012293. doi: 10.3389/fpls.2022.1012293
Bannari, A., Morin, D., Bonn, F., Huete, A. (1995). A review of vegetation indices. Remote Sens. Rev. 13, 95–120. doi: 10.1080/02757259509532298
Bharati, M. H., Liu, J. J., MacGregor, J. F. (2004). Image texture analysis: methods and comparisons. Chemometrics Intell. Lab. Syst. 72, 57–71. doi: 10.1016/j.chemolab.2004.02.005
Bowman, B. C., Chen, J., Zhang, J., Wheeler, J., Wang, Y., Zhao, W., et al. (2015). Evaluating grain yield in spring wheat with canopy spectral reflectance. Crop Sci. 55, 1881–1890. doi: 10.2135/cropsci2014.08.0533
Bruning, B., Liu, H., Brien, C., Berger, B., Lewis, M., Garnett, T. (2019). The development of hyperspectral distribution maps to predict the content and distribution of nitrogen and water in wheat (Triticum aestivum). Front. Plant Sci. 10. doi: 10.3389/fpls.2019.01380
Chen, L., Xing, M., He, B., Wang, J., Shang, J., Huang, X., et al. (2021). Estimating soil moisture over winter wheat fields during growing season using machine-learning methods. IEEE J. Sel. Top. Appl. Earth Observ. Remote Sens. 14, 3706–3718. doi: 10.1109/JSTARS.2021.3067890
Datt, B. (1999). A new reflectance index for remote sensing of chlorophyll content in higher plants: tests using Eucalyptus leaves. J. Plant Physiol. 154, 30–36. doi: 10.1016/S0176-1617(99)80314-9
Datt, B., McVicar, T. R., Van Niel, T. G., Jupp, D. L., Pearlman, J. S. (2003). Preprocessing EO-1 Hyperion hyperspectral data to support the application of agricultural indexes. IEEE Trans. Geosci. Remote Sens. 41, 1246–1259. doi: 10.1109/TGRS.2003.813206
de Castro, A. I., Jurado-Expósito, M., Peña-Barragán, J. M., López-Granados, F. (2012). Airborne multi-spectral imagery for mapping cruciferous weeds in cereal and legume crops. Precis. Agric. 13, 302–321. doi: 10.1007/s11119-011-9247-0
Di Gennaro, S. F., Toscano, P., Gatti, M., Poni, S., Berton, A., Matese, A. (2022). Spectral comparison of UAV-Based hyper and multispectral cameras for precision viticulture. Remote Sens. 14, 449. doi: 10.3390/rs14030449
Duan, B., Liu, Y., Gong, Y., Peng, Y., Wu, X., Zhu, R., et al. (2019). Remote estimation of rice LAI based on Fourier spectrum texture from UAV image. Plant Methods 15, 1–12. doi: 10.1186/s13007-019-0507-8
El-Hendawy, S. E., Hassan, W. M., Al-Suhaibani, N. A., Schmidhalter, U. (2017). Spectral assessment of drought tolerance indices and grain yield in advanced spring wheat lines grown under full and limited water irrigation. Agric. Water Manage. 182, 1–12. doi: 10.1016/j.agwat.2016.12.003
Fan, Y., Feng, H., Yue, J., Liu, Y., Jin, X., Xu, X., et al. (2023). Comparison of different dimensional spectral indices for estimating nitrogen content of potato plants over multiple growth periods. Remote Sens. 15, 602. doi: 10.3390/rs15030602
Fan, K., Li, F., Chen, X., Li, Z., Mulla, D. J. (2022). Nitrogen balance index prediction of winter wheat by canopy hyperspectral transformation and machine learning. Remote Sens. 14, 3504. doi: 10.3390/rs14143504
Feng, H., Tao, H., Li, Z., Yang, G., Zhao, C. (2022). Comparison of UAV RGB imagery and hyperspectral remote-sensing data for monitoring winter wheat growth. Remote Sens. 14, 3811. doi: 10.3390/rs14153811
Fu, Y., Yang, G., Song, X., Li, Z., Xu, X., Feng, H., et al. (2021). Improved estimation of winter wheat aboveground biomass using multiscale textures extracted from UAV-based digital images and hyperspectral feature analysis. Remote Sens. 13, 581. doi: 10.3390/rs13040581
Gitelson, A. A., Merzlyak, M. N. (1996). Signature analysis of leaf reflectance spectra: algorithm development for remote sensing of chlorophyll. J. Plant Physiol. 148, 494–500. doi: 10.1016/S0176-1617(96)80284-7
Gitelson, A. A., Merzlyak, M. N., Chivkunova, O. B. (2001). Optical properties and nondestructive estimation of anthocyanin content in plant leaves. Photochem. Photobiol. 74, 38–45. doi: 10.1562/0031-8655(2001)074<0038:OPANEO>2.0.CO;2
Gitelson, A. A., Viña, A., Arkebauer, T. J., Rundquist, D. C., Keydan, G., Leavitt, B. (2003). Remote estimation of leaf area index and green leaf biomass in maize canopies. Geophys. Res. Lett. 30, 1248. doi: 10.1029/2002GL016450
Gitelson, A. A., Viña, A., Ciganda, V., Rundquist, D. C., Arkebauer, T. J. (2005). Remote estimation of canopy chlorophyll content in crops. Geophys. Res. Lett. 32, L08403. doi: 10.1029/2005GL022688
Guo, W., Carroll, M. E., Singh, A., Swetnam, T. L., Merchant, N., Sarkar, S., et al. (2021). UAS-based plant phenotyping for research and breeding applications. Plant Phenomics 2021. doi: 10.34133/2021/9840192
Han, Y., Tang, R., Liao, Z., Zhai, B., Fan, J. (2022). A novel hybrid GOA-XGB model for estimating wheat aboveground biomass using UAV-based multispectral vegetation indices. Remote Sens. 14, 3506. doi: 10.3390/rs14143506
Hassan, M. A., Yang, M., Rasheed, A., Jin, X., Xia, X., Xiao, Y., et al. (2018). Time-series multispectral indices from unmanned aerial vehicle imagery reveal senescence rate in bread wheat. Remote Sens. 10, 809. doi: 10.3390/rs10060809
Hunt, E. R., Cavigelli, M., Daughtry, C. S., Mcmurtrey, J. E., Walthall, C. L. (2005). Evaluation of digital photography from model aircraft for remote sensing of crop biomass and nitrogen status. Precis. Agric. 6, 359–378. doi: 10.1007/s11119-005-2324-5
Hunt, E. R., Jr., Daughtry, C., Eitel, J. U., Long, D. S. (2011). Remote sensing leaf chlorophyll content using a visible band index. Agron. J. 103, 1090–1099. doi: 10.2134/agronj2010.0395
Impollonia, G., Croci, M., Blandinières, H., Marcone, A., Amaducci, S. (2022). Comparison of PROSAIL model inversion methods for estimating leaf chlorophyll content and LAI using UAV imagery for hemp phenotyping. Remote Sens. 14, 5801. doi: 10.3390/rs14225801
Jay, S., Baret, F., Dutartre, D., Malatesta, G., Héno, S., Comar, A., et al. (2019). Exploiting the centimeter resolution of UAV multispectral imagery to improve remote-sensing estimates of canopy structure and biochemistry in sugar beet crops. Remote Sens. Environ. 231, 110898. doi: 10.1016/j.rse.2018.09.011
Jay, S., Gorretta, N., Morel, J., Maupas, F., Bendoula, R., Rabatel, G., et al. (2017). Estimating leaf chlorophyll content in sugar beet canopies using millimeter-to centimeter-scale reflectance imagery. Remote Sens. Environ. 198, 173–186. doi: 10.1016/j.rse.2017.06.008
Jiang, J., Johansen, K., Stanschewski, C. S., Wellman, G., Mousa, M. A., Fiene, G. M., et al. (2022). Phenotyping a diversity panel of quinoa using UAV-retrieved leaf area index, SPAD-based chlorophyll and a random forest approach. Precis. Agric. 23, 961–983. doi: 10.1007/s11119-021-09870-3
Jimenez-Berni, J. A., Deery, D. M., Rozas-Larraondo, P., Condon, A. T. G., Rebetzke, G. J., James, R. A., et al. (2018). High throughput determination of plant height, ground cover, and above-ground biomass in wheat with LiDAR. Front. Plant Sci. 9, 237. doi: 10.3389/fpls.2018.00237
Jin, X., Yang, G., Xu, X., Yang, H., Feng, H., Li, Z., et al. (2015). Combined multi-temporal optical and radar parameters for estimating LAI and biomass in winter wheat using HJ and RADARSAR-2 data. Remote Sens. 7, 13251–13272. doi: 10.3390/rs71013251
Jurgens, C. (1997). The modified normalized difference vegetation index (mNDVI) a new index to determine frost damages in agriculture based on Landsat TM data. Int. J. Remote Sens. 18, 3583–3594. doi: 10.1080/014311697216810
Kuhn, M. (2008). Building predictive models in R using the caret package. J. Stat. Software 28, 1–26. doi: 10.18637/jss.v028.i05
Kumar, S., Gautam, G., Saha, S. K. (2015). Hyperspectral remote sensing data derived spectral indices in characterizing salt-affected soils: a case study of Indo-Gangetic plains of India. Environ. Earth Sci. 73, 3299–3308. doi: 10.1007/s12665-014-3613-y
Kursa, M. B., Rudnicki, W. R. (2010). Feature selection with the Boruta package. J. Stat. Software 36, 1–13. doi: 10.18637/jss.v036.i11
Le Maire, G., François, C., Soudani, K., Berveiller, D., Pontailler, J., Bréda, N., et al. (2008). Calibration and validation of hyperspectral indices for the estimation of broadleaved forest leaf chlorophyll content, leaf mass per area, leaf area index and leaf canopy biomass. Remote Sens. Environ. 112, 3846–3864. doi: 10.1016/j.rse.2008.06.005
Li, Z., Chen, Z., Cheng, Q., Duan, F., Sui, R., Huang, X., et al. (2022). UAV-based hyperspectral and ensemble machine learning for predicting yield in winter wheat. Agronomy 12, 202. doi: 10.3390/agronomy12010202
Li, R., Wang, D., Zhu, B., Liu, T., Sun, C., Zhang, Z. (2022). Estimation of nitrogen content in wheat using indices derived from RGB and thermal infrared imaging. Field Crop Res. 289, 108735. doi: 10.1016/j.fcr.2022.108735
Li, B., Xu, X., Zhang, L., Han, J., Bian, C., Li, G., et al. (2020). Above-ground biomass estimation and yield prediction in potato by using UAV-based RGB and hyperspectral imaging. Isprs-J. Photogramm. Remote Sens. 162, 161–172. doi: 10.1016/j.isprsjprs.2020.02.013
Lichtenthaler, H. K., Lang, M., Sowinska, M., Heisel, F., Miehe, J. A. (1996). Detection of vegetation stress via a new high resolution fluorescence imaging system. J. Plant Physiol. 148, 599–612. doi: 10.1016/S0176-1617(96)80081-2
Lin, X., Chen, J., Lou, P., Yi, S., Qin, Y., You, H., et al. (2021). Improving the estimation of alpine grassland fractional vegetation cover using optimized algorithms and multi-dimensional features. Plant Methods 17, 1–18. doi: 10.1186/s13007-021-00796-5
Liu, Y., Feng, H., Yue, J., Fan, Y., Bian, M., Ma, Y., et al. (2023). Estimating potato above-ground biomass by using integrated unmanned aerial system-based optical, structural, and textural canopy measurements. Comput. Electron. Agric. 213, 108229. doi: 10.1016/j.compag.2023.108229
Liu, Y., Feng, H., Yue, J., Jin, X., Li, Z., Yang, G. (2022a). Estimation of potato above-ground biomass based on unmanned aerial vehicle red-green-blue images with different texture features and crop height. Front. Plant Sci. 13, 938216. doi: 10.3389/fpls.2022.938216
Liu, Y., Feng, H., Yue, J., Li, Z., Yang, G., Song, X., et al. (2022b). Remote-sensing estimation of potato above-ground biomass based on spectral and spatial features extracted from high-definition digital camera images. Comput. Electron. Agric. 198, 107089. doi: 10.1016/j.compag.2022.107089
Liu, Y., Su, L., Wang, Q., Zhang, J., Shan, Y., Deng, M. (2020). Comprehensive and quantitative analysis of growth characteristics of winter wheat in China based on growing degree days. Adv. Agron. 159, 237–273. doi: 10.1016/bs.agron.2019.07.007
Liu, J., Zhu, Y., Tao, X., Chen, X., Li, X. (2022). Rapid prediction of winter wheat yield and nitrogen use efficiency using consumer-grade unmanned aerial vehicles multispectral imagery. Front. Plant Sci. 13. doi: 10.3389/fpls.2022.1032170
Lu, J., Eitel, J. U., Engels, M., Zhu, J., Ma, Y., Liao, F., et al. (2021). Improving Unmanned Aerial Vehicle (UAV) remote sensing of rice plant potassium accumulation by fusing spectral and textural information. Int. J. Appl. Earth Obs. Geoinf. 104, 102592. doi: 10.1016/j.jag.2021.102592
Luo, S., Jiang, X., He, Y., Li, J., Jiao, W., Zhang, S., et al. (2022). Multi-dimensional variables and feature parameter selection for aboveground biomass estimation of potato based on UAV multispectral imagery. Front. Plant Sci. 13, 948249. doi: 10.3389/fpls.2022.948249
Ma, Y., Ma, L., Zhang, Q., Huang, C., Yi, X., Chen, X., et al. (2022). Cotton yield estimation based on vegetation indices and texture features derived from RGB image. Front. Plant Sci. 13, 925986. doi: 10.3389/fpls.2022.925986
Madec, S., Baret, F., De Solan, B., Thomas, S., Dutartre, D., Jezequel, S., et al. (2017). High-throughput phenotyping of plant height: comparing unmanned aerial vehicles and ground LiDAR estimates. Front. Plant Sci. 8, 2002. doi: 10.3389/fpls.2017.02002
Mao, W., Wang, Y., Wang, Y. (2003). “Real-time detection of between-row weeds using machine vision,” in 2003 ASAE Annual Meeting:American Society of Agricultural and Biological Engineers, Vol. 1. doi: 10.13031/2013.15381
Mutanga, O., Masenyama, A., Sibanda, M. (2023). Spectral saturation in the remote sensing of high-density vegetation traits: A systematic review of progress, challenges, and prospects. Isprs-J. Photogramm. Remote Sens. 198, 297–309. doi: 10.1016/j.isprsjprs.2023.03.010
Mutanga, O., Skidmore, A. K. (2004). Narrow band vegetation indices overcome the saturation problem in biomass estimation. Int. J. Remote Sens. 25, 3999–4014. doi: 10.1080/01431160310001654923
Netto, A. T., Campostrini, E., de Oliveira, J. G., Bressan-Smith, R. E. (2005). Photosynthetic pigments, nitrogen, chlorophyll a fluorescence and SPAD-502 readings in coffee leaves. Sci. Hortic. 104, 199–209. doi: 10.1016/j.scienta.2004.08.013
Oktay, T., Coban, S. (2017). Simultaneous longitudinal and lateral flight control systems design for both passive and active morphing TUAVs. Elektron. Elektrotech 23, 15–20. doi: 10.5755/j01.eie.23.5.19238
Peña Barragán, J. M., López Granados, F., Jurado Expósito, M., García Torres, L. (2007). Mapping Ridolfia segetum patches in sunflower crop using remote sensing. Weed Res. 47, 164–172. doi: 10.1111/j.1365-3180.2007.00553.x
Peñuelas, J., Gamon, J. A., Fredeen, A. L., Merino, J., Field, C. B. (1994). Reflectance indices associated with physiological changes in nitrogen-and water-limited sunflower leaves. Remote Sens. Environ. 48, 135–146. doi: 10.1016/0034-4257(94)90136-8
Poblete, T., Camino, C., Beck, P., Hornero, A., Kattenborn, T., Saponari, M., et al. (2020). Detection of Xylella fastidiosa infection symptoms with airborne multispectral and thermal imagery: Assessing bandset reduction performance from hyperspectral analysis. Isprs-J. Photogramm. Remote Sens. 162, 27–40. doi: 10.1016/j.isprsjprs.2020.02.010
Pu, R., Cheng, J. (2015). Mapping forest leaf area index using reflectance and textural information derived from WorldView-2 imagery in a mixed natural forest area in Florida, US. Int. J. Appl. Earth Obs. Geoinf. 42, 11–23. doi: 10.1016/j.jag.2015.05.004
Qiao, L., Gao, D., Zhao, R., Tang, W., An, L., Li, M., et al. (2022). Improving estimation of LAI dynamic by fusion of morphological and vegetation indices based on UAV imagery. Comput. Electron. Agric. 192, 106603. doi: 10.1016/j.compag.2021.106603
Rasel, S. M., Chang, H., Ralph, T. J., Saintilan, N., Diti, I. J. (2021). Application of feature selection methods and machine learning algorithms for saltmarsh biomass estimation using Worldview-2 imagery. Geocarto Int. 36, 1075–1099. doi: 10.1080/10106049.2019.1624988
Ren, S., Chen, X., An, S. (2017). Assessing plant senescence reflectance index-retrieved vegetation phenology and its spatiotemporal response to climate change in the Inner Mongolian Grassland. Int. J. Biometeorol. 61, 601–612. doi: 10.1007/s00484-016-1236-6
Richardson, A. J., Everitt, J. H. (1992). Using spectral vegetation indices to estimate rangeland productivity. Geocarto Int. 7, 63–69. doi: 10.1080/10106049209354353
Rigon, J., Capuani, S., Fernandes, D. M., Guimarães, T. M. (2016). A novel method for the estimation of soybean chlorophyll content using a smartphone and image analysis. Photosynthetica 54, 559–566. doi: 10.1007/s11099-016-0214-x
Şahin, H., Kose, O., Oktay, T. (2022). Simultaneous autonomous system and powerplant design for morphing quadrotors. Aircr. Eng. Aerosp. Technol. 94, 1228–1241. doi: 10.1108/AEAT-06-2021-0180
Sampson, P. H., Zarco-Tejada, P. J., Mohammed, G. H., Miller, J. R., Noland, T. L. (2003). Hyperspectral remote sensing of forest condition: Estimating chlorophyll content in tolerant hardwoods. For. Sci. 49, 381–391. doi: 10.1093/forestscience/49.3.381
Sellaro, R., Crepy, M., Trupkin, S. A., Karayekov, E., Buchovsky, A. S., Rossi, C., et al. (2010). Cryptochrome as a sensor of the blue/green ratio of natural radiation in Arabidopsis. Plant Physiol. 154, 401–409. doi: 10.1104/pp.110.160820
Shao, G., Han, W., Zhang, H., Wang, Y., Zhang, L., Niu, Y., et al. (2022). Estimation of transpiration coefficient and aboveground biomass in maize using time-series UAV multispectral imagery. Crop J. 10, 1376–1385. doi: 10.1016/j.cj.2022.08.001
Shu, M., Bai, K., Meng, L., Yang, X., Li, B., Ma, Y. (2023). Assessing maize lodging severity using multitemporal UAV-based digital images. Eur. J. Agron. 144, 126754. doi: 10.1016/j.eja.2023.126754
Shu, M., Zuo, J., Shen, M., Yin, P., Wang, M., Yang, X., et al. (2021). Improving the estimation accuracy of SPAD values for maize leaves by removing UAV hyperspectral image backgrounds. Int. J. Remote Sens. 42, 5862–5881. doi: 10.1080/01431161.2021.1931539
Smola, A. J., Schölkopf, B. (2004). A tutorial on support vector regression. Stat. Comput. 14, 199–222. doi: 10.1023/B:STCO.0000035301.49549.88
Son, H., Kim, C., Kim, C., Kang, Y. (2015). Prediction of government-owned building energy consumption based on an RReliefF and support vector machine model. J. Civ. Eng. Manage. 21, 748–760. doi: 10.3846/13923730.2014.893908
Sun, X., Yang, Z., Su, P., Wei, K., Wang, Z., Yang, C., et al. (2023). Non-destructive monitoring of maize LAI by fusing UAV spectral and textural features. Front. Plant Sci. 14, 1158837. doi: 10.3389/fpls.2023.1158837
Tatsumi, K., Igarashi, N., Mengxue, X. (2021). Prediction of plant-level tomato biomass and yield using machine learning with unmanned aerial vehicle imagery. Plant Methods 17, 1–17. doi: 10.1186/s13007-021-00761-2
Uddling, J., Gelang-Alfredsson, J., Piikki, K., Pleijel, H. (2007). Evaluating the relationship between leaf chlorophyll concentration and SPAD-502 chlorophyll meter readings. Photosynth. Res. 91, 37–46. doi: 10.1007/s11120-006-9077-5
Vélez, S., Vacas, R., Martín, H., Ruano-Rosa, D., Álvarez, S. (2022). A novel technique using planar area and ground shadows calculated from UAV RGB imagery to estimate pistachio tree (Pistacia vera L.) canopy volume. Remote Sens. 14, 6006. doi: 10.3390/rs14236006
Wang, W., Wu, Y., Zhang, Q., Zheng, H., Yao, X., Zhu, Y., et al. (2021). AAVI: A novel approach to estimating leaf nitrogen concentration in rice from unmanned aerial vehicle multispectral imagery at early and middle growth stages. IEEE J. Sel. Top. Appl. Earth Observ. Remote Sens. 14, 6716–6728. doi: 10.1109/JSTARS.2021.3086580
Wang, W., Zheng, H., Wu, Y., Yao, X., Zhu, Y., Cao, W., et al. (2022). An assessment of background removal approaches for improved estimation of rice leaf nitrogen concentration with unmanned aerial vehicle multispectral imagery at various observation times. Field Crop Res. 283, 108543. doi: 10.1016/j.fcr.2022.108543
Wang, J., Zhou, Q., Shang, J., Liu, C., Zhuang, T., Ding, J., et al. (2021). UAV-and machine learning-based retrieval of wheat SPAD values at the overwintering stage for variety screening. Remote Sens. 13, 5166. doi: 10.3390/rs13245166
Woebbecke, D. M., Meyer, G. E., Von Bargen, K., Mortensen, D. A. (1995). Color indices for weed identification under various soil, residue, and lighting conditions. Trans. Asae 38, 259–269. doi: 10.13031/2013.27838
Wu, Q., Zhang, Y., Zhao, Z., Xie, M., Hou, D. (2023). Estimation of relative chlorophyll content in spring wheat based on multi-temporal UAV remote sensing. Agronomy 13, 211. doi: 10.3390/agronomy13010211
Xu, S., Xu, X., Blacker, C., Gaulton, R., Zhu, Q., Yang, M., et al. (2023). Estimation of leaf nitrogen content in rice using vegetation indices and feature variable optimization with information fusion of multiple-sensor images from UAV. Remote Sens. 15, 854. doi: 10.3390/rs15030854
Yang, H., Hu, Y., Zheng, Z., Qiao, Y., Zhang, K., Guo, T., et al. (2022). Estimation of potato chlorophyll content from UAV multispectral images with stacking ensemble algorithm. Agronomy 12, 2318. doi: 10.3390/agronomy12102318
Yang, B., Wang, M., Sha, Z., Wang, B., Chen, J., Yao, X., et al. (2019). Evaluation of aboveground nitrogen content of winter wheat using digital imagery of unmanned aerial vehicles. Sensors 19, 4416. doi: 10.3390/s19204416
Yang, X., Yang, R., Ye, Y., Yuan, Z., Wang, D., Hua, K. (2021). Winter wheat SPAD estimation from UAV hyperspectral data using cluster-regression methods. Int. J. Appl. Earth Obs. Geoinf. 105, 102618. doi: 10.1016/j.jag.2021.102618
Yin, C., Lv, X., Zhang, L., Ma, L., Wang, H., Zhang, L., et al. (2022). Hyperspectral UAV images at different altitudes for monitoring the leaf nitrogen content in cotton crops. Remote Sens. 14, 2576. doi: 10.3390/rs14112576
Yin, Q., Zhang, Y., Li, W., Wang, J., Wang, W., Ahmad, I., et al. (2023). Estimation of winter wheat SPAD values based on UAV multispectral remote sensing. Remote Sens. 15, 3595. doi: 10.3390/rs15143595
Yue, J., Yang, G., Tian, Q., Feng, H., Xu, K., Zhou, C. (2019). Estimate of winter-wheat above-ground biomass based on UAV ultrahigh-ground-resolution image textures and vegetation indices. Isprs-J. Photogramm. Remote Sens. 150, 226–244. doi: 10.1016/j.isprsjprs.2019.02.022
Yue, J., Yang, H., Yang, G., Fu, Y., Wang, H., Zhou, C. (2023). Estimating vertically growing crop above-ground biomass based on UAV remote sensing. Comput. Electron. Agric. 205, 107627. doi: 10.1016/j.compag.2023.107627
Zhang, S., Zhao, G., Lang, K., Su, B., Chen, X., Xi, X., et al. (2019). Integrated satellite, unmanned aerial vehicle (UAV) and ground inversion of the SPAD of winter wheat in the reviving stage. Sensors 19, 1485. doi: 10.3390/s19071485
Zheng, H., Li, W., Jiang, J., Liu, Y., Cheng, T., Tian, Y., et al. (2018). A comparative assessment of different modeling algorithms for estimating leaf nitrogen content in winter wheat using multispectral images from an unmanned aerial vehicle. Remote Sens. 10, 2026. doi: 10.3390/rs10122026
Zheng, H., Ma, J., Zhou, M., Li, D., Yao, X., Cao, W., et al. (2020). Enhancing the nitrogen signals of rice canopies across critical growth stages through the integration of textural and spectral information from unmanned aerial vehicle (UAV) multispectral imagery. Remote Sens. 12, 957. doi: 10.3390/rs12060957
Zhou, C., Gong, Y., Fang, S., Yang, K., Peng, Y., Wu, X., et al. (2022). Combining spectral and wavelet texture features for unmanned aerial vehicles remote estimation of rice leaf area index. Front. Plant Sci. 13. doi: 10.3389/fpls.2022.957870
Zhou, L., Meng, R., Yu, X., Liao, Y., Huang, Z., Lü, Z., et al. (2023). Improved yield prediction of ratoon rice using unmanned aerial vehicle-based multi-temporal feature method. Rice Sci. 30, 247–256. doi: 10.1016/j.rsci.2023.03.008
Zhou, X., Yang, M., Chen, X., Ma, L., Yin, C., Qin, S., et al. (2023). Estimation of cotton nitrogen content based on multi-angle hyperspectral data and machine learning models. Remote Sens. 15, 955. doi: 10.3390/rs15040955
Zhu, Y., Liu, J., Tao, X., Su, X., Li, W., Zha, H., et al. (2023). A three-dimensional conceptual model for estimating the above-ground biomass of winter wheat using digital and multispectral unmanned aerial vehicle images at various growth stages. Remote Sens. 15, 3332. doi: 10.3390/rs15133332
Keywords: UAV, winter wheat, SPAD, features selection strategy, features fusion strategy
Citation: Su X, Nian Y, Shaghaleh H, Hamad A, Yue H, Zhu Y, Li J, Wang W, Wang H, Ma Q, Liu J, Li X and Alhaj Hamoud Y (2024) Combining features selection strategy and features fusion strategy for SPAD estimation of winter wheat based on UAV multispectral imagery. Front. Plant Sci. 15:1404238. doi: 10.3389/fpls.2024.1404238
Received: 20 March 2024; Accepted: 17 April 2024;
Published: 10 May 2024.
Edited by:
Liujun Xiao, Nanjing Agricultural University, ChinaCopyright © 2024 Su, Nian, Shaghaleh, Hamad, Yue, Zhu, Li, Wang, Wang, Ma, Liu, Li and Alhaj Hamoud. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Xinwei Li, bGl4d0BhaHN0dS5lZHUuY24=; Yousef Alhaj Hamoud, WW91c2VmLWhhbW91ZDExQGhodS5lZHUuY24=