Enhancing yield prediction in maize breeding using UAV-derived RGB imagery: a novel classification-integrated regression approach

Ge, Haixiao; Zhang, Qi; Shen, Min; Qin, Yang; Wang, Lin; Yuan, Cansheng

doi:10.3389/fpls.2025.1511871

ORIGINAL RESEARCH article

Front. Plant Sci. , 20 March 2025

Sec. Technical Advances in Plant Science

Volume 16 - 2025 | https://doi.org/10.3389/fpls.2025.1511871

This article is part of the Research Topic Machine Vision and Machine Learning for Plant Phenotyping and Precision Agriculture, Volume II View all 20 articles

Enhancing yield prediction in maize breeding using UAV-derived RGB imagery: a novel classification-integrated regression approach

Haixiao Ge¹

Qi Zhang²

Min Shen¹

Yang Qin¹

Lin Wang¹

Cansheng Yuan^1*

¹College of Rural Revitalization, Jiangsu Open University, Nanjing, China
²Institute of Agricultural Resources and Environment, Jiangsu Academy of Agricultural Sciences, Nanjing, China

Accurate grain yield prediction is crucial for optimizing agricultural practices and ensuring food security. This study introduces a novel classification-integrated regression approach to improve maize yield prediction using UAV-derived RGB imagery. We compared three classifiers—Support Vector Machine (SVM), Decision Tree (DT), and Random Forest (RF)—to categorize yield data into low, medium, and high classes. Among these, SVM achieved the highest classification accuracy and was selected for classifying data prior to regression. Two methodologies were evaluated: Method 1 (direct RF regression on the full dataset) and Method 2 (SVM classification followed by class-specific RF regression). Multi-temporal vegetation indices (VIs) were analyzed across key growth stages, with the early vegetative phase yielding the lowest prediction errors. Method 2 significantly outperformed Method 1, reducing RMSE by 45.1% in calibration (0.28 t/ha vs. 0.51 t/ha) and 3.3% in validation (0.89 t/ha vs. 0.92 t/ha). This integrated framework demonstrates the advantage of combining classification and regression for precise yield estimation, providing a scalable tool for maize breeding programs. The results highlight the potential of UAV-based phenotyping to enhance agricultural productivity and support global food systems.

1 Introduction

Accurate crop yield prediction is essential for optimizing agricultural decisions, including harvest planning, crop insurance, and storage management. Reliable yield forecasts are crucial for farmers, agronomists, and agricultural policymakers to ensure efficient resource allocation and enhance productivity. Traditional yield estimation methods typically rely on field sampling, which is labor-intensive, destructive, and prone to inaccuracies (Liang et al., 2024). These methods usually involve collecting samples from the field and analyzing them to estimate the overall yield. However, this process is not only time-consuming but also disruptive to the growing crop. Additionally, the experiential knowledge of farmers and agricultural technicians is often used to predict crop yield, but this approach remains subjective and can be prone to errors, especially in large-scale or diverse farming systems (Zhang et al., 2020). As a result, yield estimates based on such knowledge can vary significantly, lacking consistency and often leading to inaccurate predictions.

To address these challenges, remote sensing technologies have emerged as powerful tools in modern agriculture. Over the past decade, the development of high-throughput phenotyping (HTP) systems—based on both ground-based mobile platforms and aerial systems—has revolutionized how we monitor crop growth and predict yield. These systems provide high spatial and temporal resolution data, which can be directly related to grain yield or crop responses to both biotic and abiotic stresses (Feng et al., 2020). Among these technologies, the application of unmanned aerial vehicles (UAVs) with high spatial resolution imagery has shown considerable success in estimating crop yield, particularly through the use of vegetation indices (VIs), such as the Normalized Difference Vegetation Index (NDVI) (Ballesteros et al., 2014; Candiago et al., 2015). These indices have been demonstrated to correlate strongly with crop yield, making them an effective tool for monitoring crop health and predicting yield (Hassan et al., 2019). Numerous statistical methods have been employed to estimate agricultural variables from UAV-derived VI data. Linear regression models are commonly used to calibrate the relationship between UAV-based VIs and measured agricultural variables, such as crop height and yield. For instance, Geipel et al. (2014) utilized UAV-RGB imagery to predict corn grain yield by calculating crop height and VIs. Similarly, Vega et al. (2015) found a strong linear relationship between NDVI and yield when extracting NDVI data from multi-temporal UAV imagery.

While these methods are useful, they also have limitations. One major drawback is their inability to account for the complex relationships between multiple variables involved in crop growth and yield. Traditional regression models may oversimplify these relationships, leading to lower accuracy in yield prediction (Duan et al., 2021). In contrast, machine learning techniques have gained popularity due to their ability to model complex, non-linear relationships between numerous variables without relying on explicit equations. For instance, machine learning models such as support vector machines (SVM) and random forests (RF) have been successfully applied to estimate crop yield by integrating various input features like weather conditions, soil properties, and remote sensing data (Cai et al., 2019). These models can handle large and high-dimensional datasets, making them well-suited for real-time yield prediction in precision agriculture. By capturing intricate interactions between environmental factors, crop physiology, and management practices, machine learning models are able to provide more robust predictions than traditional methods. Moreover, machine learning algorithms can be trained to adapt to new data, improving their accuracy over time as more information becomes available. This adaptability makes machine learning particularly valuable in agricultural settings, where conditions and inputs vary widely across regions and seasons.

Maize (Zea mays L.) stands as a global staple crop with triple significance in food security, bioenergy production, and livestock nutrition (Herrmann et al., 2020). In breeding programs, the annual yield testing protocol for new cultivars demands particularly accurate prediction methodologies, as even marginal improvements in estimation accuracy can substantially accelerate cultivar development cycles. Traditional machine learning regression approaches for yield prediction, however, often face limitations when dealing with the inherent heterogeneity across diverse maize cultivars and environmental interactions. Recent advances in two-stage analytical frameworks combining classification with regression have demonstrated remarkable success in spectral analysis applications. Notably, Wang et al. (2014) achieved enhanced coal property predictions through initial spectral classification, while Wang et al. (2019) improved glucose content estimation via categorical preprocessing of spectral data. These successes suggest that a classification-before-regression approach could effectively address the spectral complexity and cultivar variability challenges in maize yield prediction. By stratifying populations into homogeneous subgroups before applying subgroup-specific regression models, this methodology minimizes inter-group interference while maximizing intra-group pattern recognition - a critical advantage for precision agriculture applications.

This study introduces three significant advancements to maize yield prediction research. First, we establish a novel two-stage framework involving initial yield potential classification using UAV-derived RGB imagery followed by subgroup-optimized regression modeling. Second, we systematically investigate the temporal dynamics of VIs across critical growth stages and their cultivar-specific relationships with final yield - an aspect previously under characterized in maize phenomics. Third, through comprehensive comparison with conventional regression techniques, we demonstrate the superior performance of our stratified approach in handling cultivar diversity.

The manuscript is organized as follows: Section 1 provides a comprehensive review of the relevant literature on crop yield estimation, focusing on UAVs and machine learning techniques. Section 2 outlines the methodology used in this study, including data collection, feature extraction, and the application of classification and regression procedures. Section 3 presents the results and performance evaluation of the proposed framework. Section 4 discusses the implications of the findings, addresses the limitations of the current study, and offers recommendations for future research directions. Finally, Section 5 presents the conclusion, summarizing the key findings and emphasizing the potential impact of the proposed classification-before-regression technique on improving yield estimation accuracy in maize breeding programs.

2 Materials and methods

2.1 Experimental location and plant materials

This study was conducted at a maize breeding base in Nong’an County, Changchun City, Jilin Province, China (125°8’28’’ E, 44°22’25’’ N), situated within the fertile black soil (Chernozem) region of the Songnen Plain, a key maize-producing area in Northeast China (Figure 1). The experimental site featured flat terrain and uniform soil properties optimized for high-yield maize cultivation. A total of 72 plots (5.0 m × 6.0 m each) were arranged in a randomized complete block design, with eight rows per plot, 60 cm row spacing, and 25 cm plant spacing. Forty-two maize genotypes were selected from the core germplasms of the Jilin Academy of Agricultural Sciences (JAAS), representing elite inbred lines, widely adopted hybrids, and locally adapted landraces. These genotypes were sown on May 2, 2021 (Table 1), aligning with the optimal planting window for maize in Jilin Province. They were chosen for their adaptability to Jilin’s temperate climate—including tolerance to early-season cold stress and resistance to prevalent diseases (e.g., northern leaf blight, stalk rot)—genetic diversity, and alignment with regional breeding objectives such as yield stability, abiotic stress resilience, and dual-purpose (grain/silage) utility. The planting density was standardized at 75,000 plants per hectare, consistent with local high-yield practices. Field management followed regional protocols: a base application of 200 kg/ha compound fertilizer (15:15:15 NPK) at sowing, 150 kg/ha urea topdressing at the V6 stage, supplemental irrigation during critical growth phases, and integrated pest management to minimize biotic stressors.

Figure 1

Figure 1. Location of the study area and overview of the experimental field.

Table 1

Table 1. Details of the experiment and UAV flights in 2021.

2.2 Data collection

2.2.1 Yield collection

At the end of the maize growing season (late September to early October), all maize plots under investigation were harvested manually. In each plot, all the maize plants were harvested, and the total grain yield was measured. To minimize the effects of plot boundaries, the entire area of each 5.0 m × 6.0 m plot was included in the harvest, ensuring that data collected from the entire plot represented the full yield potential of the genotype being evaluated.

The harvested cobs were threshed, and the grains were dried to a moisture content of approximately 12%. The dried grains were then weighed using an electronic scale with an accuracy of ± 0.1 g. The final yield was converted to kilograms per hectare (kg/ha) based on the plot area. This method, where the entire plot is harvested, helps reduce the potential bias from edge effects and provides a more accurate reflection of the genotype’s performance across the entire plot. The grain yield was then used as the target variable for training and testing the yield prediction models.

2.2.2 UAV image acquisition

The acquisition dates of UAV-based images are provided in Table 1. In this study, the UAV-based remote sensing system comprised a consumer-grade RGB camera and a UAV platform (Phantom 4 Pro, DJI, Shenzhen, China). Under optimal conditions, the UAV system could hover for up to 30 minutes. The RGB camera was equipped with a one-inch complementary metal-oxide-semiconductor (CMOS) sensor, capable of capturing still images with a spatial resolution of approximately 20 million pixels. To ensure image quality, the RGB camera was positioned vertically downward during each flight. The flight elevation was set to 50 m, resulting in a ground sampling distance (GSD) of 1.36 cm/pixel. The UAV control app (Pix4Dcapture, Pix4D Corporation, Lausanne, Switzerland) was used to design, control, and monitor UAV flights. Before each flight, waypoints were predefined to achieve a minimum 70% overlap in both sideward and forward directions. All flights were conducted under stable ambient light conditions. After each flight, geo-information was acquired from the onboard GPS equipment integrated into the UAV system, and images were subsequently downloaded from an SD card for further image processing analysis.

2.3 Image processing

2.3.1 Image mosaicking

Image mosaicking was performed using Pix4Dmapper software. The specific operation process was as follows: (1) import all images from the same date of the UAV flight into Pix4Dmapper; (2) select the coordinate system and processing options template; (3) align the raw images with altitude and spatial position information; and (4) export the orthophoto map in TIFF format. Images collected during eight other periods of maize growth were pre-processed following the afore-mentioned steps.

Additionally, the radiance reaching the lens had a linear correlation with the digital number (DN) for each band. Consequently, an empirical linear equation (Equation 1) described by Yu et al. (2016) was adopted to maintain radiometric consistency in multi-temporal images. The equation is defined as:

\begin{array}{l} D N_{normalized} = a \times D N_{raw} + b & (1) \end{array}

where a and b are normalization coefficients derived from the reference image (July 9). These coefficients were calculated by minimizing the radiometric differences between subsequent-date images and the reference.

2.3.2 VIs calculation

VIs were calculated from UAV-based remotely sensed orthomosaics. A total of 14 VIs, widely applied in crop research, were selected (Ge et al., 2021; Kim et al., 2018). The corresponding formulations of these VIs are provided in Table 2. The calculation of VIs for each plot involved three steps: (1) regions of interest (ROIs) were generated using ArcGIS v.10.8 software (ESRI, Redlands, CA, USA) to manually delineate the plots from the orthomosaics (Figure 1); (2) a Python script was used to calculate VIs based on the R, G, and B bands of the orthomosaics; (3) the mean VI value of each ROI was calculated using the “ZonalStatisticsAsTable” module in ArcGIS v.10.8 software.

Table 2

Table 2. Formulations of the selected VIs in this study.

2.4 Model building

2.4.1 Classification method

In this study, classification was performed using three commonly applied methods: SVM, Decision Tree (DT), and RF. The classification method with the highest accuracy in the validation set was selected as the final method for this study. Hyperparameters for each model were tuned, as detailed in Table 3. The models were tested and validated using 5-fold cross-validation. The key hyperparameters for each model were selected based on their influence on the classification accuracy: (1) SVM: We tested both the ‘linear’ and ‘rbf’ kernels, selecting the optimal one based on the classification results; (2) DT: Hyperparameters such as “min_samples_leaf” and “min_samples_split” were optimized for each growth stage; (3) RF: “max_depth” and “n_estimators” were optimized for each growth stage to ensure the best model fit. The classification accuracy was evaluated based on metrics such as overall accuracy and F1-score. The best model, demonstrating the highest classification accuracy, was selected for further analysis.

Table 3

Table 3. Detail of the user-defined parameters in the classifier models with 5-fold cross validation during the calibration.

2.4.2 Regression method

The RF model was used to establish regression models for yield estimation. RF is an ensemble learning algorithm that aggregates the results of individual decision trees using bagging and random feature selection. The final prediction is made by averaging the outputs of all trees in the forest (Jordan and Mitchell, 2015). RF is not sensitive to collinearity between variables, which allows it to handle complex datasets and avoid overfitting, leading to high prediction accuracy. In this study, regression models were developed using UAV-based VIs and measured yield data. The RF algorithm was implemented using the “RandomForestRegressor” function in the “scikit-learn” Python package (https://scikit-learn.org/stable/). During calibration, three hyperparameters—”max_depth”, “min_samples_split”, and “min_samples_leaf”—were tuned using a grid search method with 5-fold cross-validation.

2.4.3 Calibration methods

Two different procedures were implemented to predict maize yield using UAV-based VIs (Figure 2). The detailed descriptions of these two strategies are as follows:

Figure 2

Figure 2. The framework for predicting maize yield in this study.

Method 1 (Regression models using the full sample set): The complete set of VIs from Table 2 was used as input for the RF regression model. This approach aimed to predict maize yield directly from all available VI data.

Method 2 (Regression models using grouped sample sets after classification): In this method, the samples were first classified into three yield levels based on measured yield: low-level yield range (30%) corresponding to plots with the lowest yield values; medium-level yield range (40%) corresponding to plots with intermediate yield values; and high-level yield range (30%) corresponding to plots with the highest yield values. These yield levels were determined using the optimal classifier identified in Section 2.4.1. Following classification, separate RF regression models were applied to each yield group.

To optimize data collection and prediction accuracy, maize growth cycles were systematically categorized into four distinct phenological phases: vegetative stage, panicle formation stage, ripening stage, and whole growth cycle. This classification framework was specifically applied to synchronize with multi-temporal UAV observation schedules, with detailed stage-specific data acquisition timelines presented in Table 4.

Table 4

Table 4. The division of different growth stages for predicting maize yield.

2.4.4 Statistical analysis

In this study, the Pearson correlation coefficient (r) was used to analyze the relationships between UAV-based VIs and grain yield at different growth stages. The selected VIs and the yield data were used to create the raw dataset matrix with VIs as the independent variables (X) and grain yield as the dependent variable (Y). Before classification and regression, the dataset matrix was randomly split into training (70%) and testing (30%) sets for each of the four growth stages.

Additionally, two metrics, overall accuracy and F1-score, were selected to assess the accuracies of the different classifiers. Finally, the predictive performance of RF regression models was quantitatively evaluated using the coefficient of determination (R²) and root mean square error (RMSE). These statistical metrics were calculated in Equations 2, 3:

\begin{array}{l} R^{2} = 1 - \frac{\sum_{i = 1}^{n} {(y_{i} - \hat{y_{i}})}^{2}}{\sum_{i = 1}^{n} {(y_{i} - \bar{y})}^{2}} & (2) \end{array}

\begin{array}{l} R M S E = \sqrt{\frac{\sum_{i = 1}^{n} {(y_{i} - \hat{y_{i}})}^{2}}{n}} & (3) \end{array}

where $y_{i}$ is the measured yield, $\hat{y_{i}}$ is the predicted yield, $\bar{y_{i}}$ is the mean value of all the measured yield and $n$ is the number of samples.

3 Results

3.1 Correlation analysis of VIs and maize yield

Pearson correlation analysis was conducted to explore the relationship between maize yield and various VIs across different growth periods (Figure 3). The analysis revealed significant variations in the strength and significance of these correlations at different growth stages. At the early growth stages (DAS=68, 80, and 86), many VIs exhibited significant correlations with maize yield, particularly at DAS=68. Notably, ExR and NGI indices demonstrated the strongest correlations with yield at this stage, with r values of 0.39 and -0.38, respectively (Supplementary Table 1). This suggests that these indices are effective predictors of yield potential during the early stages of crop development. During the middle growth stages (DAS=95, 104, and 115), the correlation coefficients between VIs and yield varied more significantly across different periods. For instance, ExR exhibited the strongest correlation at DAS=95 with an r value of 0.54, but its correlation strength decreased at DAS=104 and DAS=115. Indices such as R, G, G/B, and INT showed good correlations at some individual stages but exhibited random variations overall, indicating some degree of inconsistency in their correlations with yield. In the late growth stages (DAS=125, 139, and 149), the correlations between most VIs and maize yield were generally weak and not significant. All VIs had absolute correlation coefficients (|r|) below 0.26, suggesting that these indices are less suitable for predicting yield during the post-heading stage of crop development.

Figure 3

Figure 3. Pearson correlations between UAV-based VIs and yield in maize breeding across different growth periods. n.s., *, and ** represent ‘not significant’, p<0.05, and p<0.01, respectively.

3.2 Maize yield prediction based on method 1

The results of predicted yield calculated by Method 1 at each growth stage are presented in Figure 4. In the calibration datasets, the RF models at the vegetative stage and combined stages achieved slightly better results than those obtained at the panicle-formation and ripening stages, with R² values ranging from 0.92 to 0.93 and RMSE values ranging from 0.51 t/ha to 0.52 t/ha. The predicted yield was overestimated at different growth stages when the measured yield was less than 8.4 t/ha. In contrast, all the predicted yields were underestimated when the measured yield exceeded 10.9 t/ha. Compared to the models in the calibration set, the RF models at each growth stage showed lower accuracies and higher RMSE values in the validation set, with the scatter distributions calculated by these RF models deviating further from the 1:1 line. Although the optimal models for yield prediction were obtained at the vegetative and combined stages, the prediction accuracies were somewhat unacceptable, with R² values of 0.14 and 0.10 and RMSE values of 0.92 t/ha and 0.93 t/ha, respectively.

Figure 4

Figure 4. Relationship between measured and predicted yields based on Method 1 at each growth stage: (A) Vegetative stage, (B) Panicle-formation stage, (C) Ripening stage, and (D) Whole growth period. The scatterplots include results from both the calibration set and the validation set.

3.3 Maize yield prediction based on method 2

The classification performance of three machine learning methods, including SVM, DT, and RF, was evaluated for yield-level classification across different growth stages, based on the validation set results presented in Table 5. SVM generally outperformed DT and RF, achieving the highest F1-score of 0.71 and an overall accuracy of 68% at the vegetative stage. At the panicle-formation stage, SVM also demonstrated better performance, with an F1-score of 0.69 and an overall accuracy of 64%, compared to DT’s F1-score of 0.60 and RF’s F1-score of 0.62. In the ripening stage, SVM maintained an F1-score of 0.56, but its overall accuracy of 50% was lower than RF’s 57%. Across the whole growth stage, SVM achieved an F1-score of 0.61 and an overall accuracy of 59%, which was higher than DT’s 57%. These results indicate that SVM is more effective for yield-level classification in maize breeding, particularly at the vegetative stage, which is crucial for early yield prediction.

Table 5

Table 5. The accuracy of yield-level classification by using the three machine learning methods at each growth stage.

After selecting SVM for its superior classification performance, all yield samples were classified into three levels. For each level, the RF regression model at each growth stage was applied to predict yield. Figure 5 shows the quantification results of Method 2. In the calibration set, there was a good relationship between measured yield and predicted yield (R² = 0.91-0.96 and RMSE = 0.28-0.44 t/ha), indicating robust performance in predicting yield using Method 2 at each growth stage. Generally, the scatter distributions at each growth stage were close to the 1:1 line. In the validation set, the agreement between measured and predicted yield at the vegetative stage was better than at other stages, with R² and RMSE values of 0.42 and 0.89 t/ha, respectively. This result indicated that yield simulation at the early growth stage was more reasonable due to higher prediction accuracy of yield. However, Method 2 in the validation set provided less accurate quantification results than at the same growth stage in the calibration set. Additionally, the regression models at each growth stage tended to underestimate yield when low levels of measured yield occurred.

Figure 5

Figure 5. Relationship between measured and predicted yields based on Method 2 at each growth stage: (A) Vegetative stage, (B) Panicle-formation stage, (C) Ripening stage, and (D) Whole growth period. The scatterplots include results from both the calibration set and the validation set.

3.4 Method 1 vs. Method 2

Figure 6 illustrates the comparison of prediction results between Method 1 and Method 2 across different growth stages. As shown in the figure, Method 2 achieved significantly higher R² values and lower RMSE values compared to Method 1 during the vegetative stage, indicating a marked improvement in prediction accuracy. For instance, in the calibration set, Method 2 achieved R² values of 0.91-0.96 and RMSE values of 0.28-0.44 t/ha, demonstrating robust performance. In the validation set, the agreement between measured and predicted yield at the vegetative stage was better than at other stages, with R² and RMSE values of 0.42 and 0.89 t/ha, respectively. These results highlight the effectiveness of the classification-before-regression strategy in enhancing yield prediction accuracies, particularly during the early growth stages.

Figure 6

Figure 6. The comparison of the yield prediction accuracy by using Method 1 and Method 2 at each growth stage.

4 Discussion

4.1 The optimal growth stage for predicting maize yield

One of the primary objectives of this study was to investigate the optimal growth stage for collecting UAV-based VIs suitable for predicting yield in maize breeding. The UAV-based VIs data were collected at the vegetative (DAS = 68, 80, and 86), panicle-formation (DAS = 95, 104, and 115), and ripening (DAS = 125, 139, and 149) stages. These growth stages are known as critical periods in maize development and growth since various stresses during these times can significantly impact yield. As shown in Supplementary Table 1 and Figure 3, UAV-based VIs had strong correlations with yield during the vegetative period across different maize cultivars. For example, certain color indices such as ExR and NGI demonstrated strong correlations with yield at early growth stages (e.g., DAS=68), highlighting their potential as effective predictors of yield potential during this period.

The vegetative stage is considered the critical phase for maize development as it is during this period that the plant’s biomass accumulates rapidly, laying the foundation for subsequent growth and yield formation. During this stage, the plant is highly sensitive to environmental stresses, and any adverse conditions can significantly impact its growth and ultimately the yield. Therefore, early prediction of yield during the vegetative stage can provide valuable information for farmers to make timely decisions regarding field management practices, such as fertilization and irrigation, to optimize yield. The strong correlations between UAV-based VIs and yield during the vegetative stage suggest that this period is an optimal time window for predicting maize yield. This finding is consistent with previous studies that have shown the potential of UAV-based VIs for yield prediction during the early growth stages of crops. For instance, a study by Yang et al. (2022) found that the optimal phenological phase for maize yield prediction using high-frequency UAV remote sensing was during the vegetative stage.

In addition, the relatively weak correlation between UAV-based VIs and yield during the post-heading stage (Figure 3) is consistent with the findings of Zheng et al. (2019). This is mainly because the emergence of maize panicles alters canopy structure, significantly influencing the relationship between UAV-based VIs and yield. During the post-heading stage, maize genotypes consist of stems, leaves, and panicles, with both leaves and panicles contributing to canopy reflectance. Thus, VIs calculated from maize canopy reflectance exhibited varying correlations with yield. Furthermore, panicle traits (such as number, length, and weight) vary across cultivars, influencing canopy structure differently. Consequently, the correlation between VIs and yield becomes more complex for different cultivars in the post-heading stage. By contrast, as mentioned above, UAV-based VIs at the vegetative stage demonstrated better performance in correlating with yield. Therefore, it is more informative to predict yield in maize breeding during the early growth stage.

However, the relationship between single-stage VIs and yield is still affected by differences in maize cultivars. Although all maize cultivars were sown simultaneously, the maize in each plot was not at a consistent phenological stage on the imaging and field sampling days. Phenological variations can increase spatial variability. Therefore, cultivars can significantly impact maize grain yield prediction using UAV-based data. Previous studies on rice (Duan et al. 2021) and wheat (Dong et al. 2020) have investigated the influence of cultivars on crop yield prediction. The varying morphology of crop cultivars makes grain yield prediction more inaccurate and complicated. Therefore, the phenological influence of different cultivars should be considered in further analysis.

4.2 Improved RF regression based on SVM classification

In this study, the performance of the two quantification methods (Method 1 and Method 2) was evaluated using R² and RMSE, with the detailed results presented in Figure 6. As discussed, Method 2 outperformed Method 1, particularly during the vegetative and panicle-formation stages. This improvement can be attributed to the higher classification accuracy achieved for each yield class in maize breeding during the pre-heading period (Table 5). The categorization into yield classes reduced the uncertainty in yield predictions by narrowing the yield range within each class, thereby enhancing the model’s performance (Figure 7).

Figure 7

Figure 7. Yield maps in the maize breeding; (A) measured maize yield, (B, C) predicted yield from the Method 1 and Method 2 at the vegetative stage in the calibration and validation sets, respectively.

Considering the large number of maize genotypes in breeding programs, which may number in the hundreds, it is crucial to explore how classification strategies impact yield prediction models. In the case of Method 2, the 72 maize yield samples were categorized into three types using the optimal classifier method (SVM), with the detailed classification results presented in Table 5. Among the four growth periods, the vegetative stage provided the best classification results, with overall accuracies of 68% for the validation set. Scatter plot analysis showed that the improvement in prediction accuracy with Method 2 was particularly significant for medium and high yield levels. These findings align with those of Wang et al. (2014), who demonstrated that models built on classified sample sets performed better than those using the full sample set. This suggests that SVM classification can enhance RF regression performance by improving the accuracy of models that categorize maize genotypes into distinct yield levels.

However, the lower validation accuracies compared to calibration accuracies point to several challenges. First, the use of a single year’s data resulted in a limited sample size, which may have led to overfitting during the calibration stage. Overfitting occurs when a machine learning model learns the training data too precisely, capturing noise and random fluctuations rather than the underlying patterns, which negatively affects its performance on unseen data, as observed in the validation set. Second, the significant phenotypic variation among maize cultivars resulted in different phenological stages on the imaging and field sampling days. This increased spatial variability complicated the relationship between VIs and yield, making it more difficult to characterize yield accurately using color features. Furthermore, while Method 2 shows potential for improving yield predictions in maize breeding, it may encounter limitations in large-scale applications due to the variability in environmental conditions and genetic traits. For instance, UAV-based data collection across diverse ecological regions may provide more reliable results and enhance model generalizability. Thus, expanding the sample size and incorporating data from various ecological regions will be critical for improving the robustness and accuracy of the model. Future research should focus on these aspects, incorporating larger, more diverse datasets and addressing overfitting risks through techniques such as cross-validation, the explainable artificial intelligence (XAI) technique or data augmentation (Naga Srinivasu et al., 2024).

In summary, Method 2 holds promise for enhancing yield prediction in maize breeding programs by improving classification and reducing uncertainty in predictions. However, for broader applicability, its effectiveness needs to be validated through further research with larger sample sizes, diverse environmental conditions, and comprehensive phenological data. The integration of phenological and environmental variables will be essential for improving the robustness and generalizability of the models, as also suggested by studies such as those by Guo et al. (2023) and Guo et al. (2021), which highlight the need for multidimensional data integration in predictive modeling for agriculture.

5 Conclusions

This study demonstrates the potential of UAV-based imagery in predicting grain yield for maize breeding. Multi-temporal UAV images were collected from a field experiment, and various VIs were calculated from these images to predict maize yield at four critical growth stages. The results show that the accuracy of yield prediction is higher when using UAV-based images from the early growth stages. A significant improvement in prediction accuracy was achieved by applying a classification-before-regression strategy, where the raw dataset was first grouped into three yield classes using the SVM method. RF regression models were then applied to predict yield for each class separately. This approach reduced prediction errors, with RMSE values of 0.28 t/ha in the calibration set and 0.89 t/ha in the validation set. The classification-before-regression strategy outperformed traditional regression models, demonstrating the potential of machine learning techniques in precision agriculture.

In summary, this work highlights the effectiveness of UAV-based imaging systems as a tool for gathering field-scale phenotypic data in maize breeding programs. By integrating RF regression models with SVM classification, this study offers a promising approach to predicting within-field yield variations. These findings contribute to the growing body of research in precision agriculture by showing that machine learning techniques can significantly improve yield prediction accuracy. For future research, the proposed methodology should be tested under varying climatic zones, to assess its robustness and generalizability. Furthermore, future studies could explore the use of UAV-based remote sensing data for other crops, as well as the time-dynamic information provided by multi-temporal VIs, which could further enhance prediction accuracy.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding author.

Author contributions

HG: Investigation, Writing – original draft, Writing – review & editing. QZ: Data curation, Formal Analysis, Funding acquisition, Investigation, Writing – review & editing. MS: Data curation, Resources, Writing – review & editing. YQ: Resources, Writing – review & editing. LW: Writing – review & editing. CY: Conceptualization, Data curation, Formal Analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing – review & editing.

Funding

The author(s) declare that financial support was received for the research and/or publication of this article. This study was supported by the Natural Science Foundation of the Jiangsu Higher Education Institutions of China (Grant No. 23KJB210003).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Generative AI was used in the creation of this manuscript.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpls.2025.1511871/full#supplementary-material

References

Ahmad, I. S., Reid, J. F. (1996). Evaluation of color representations for maize images. J. Agric. Eng. Res. 63, 185–195. doi: 10.1006/jaer.1996.0020

Crossref Full Text | Google Scholar

Ballesteros, R., Ortega, J., Hernández, D., Moreno, M. (2014). Applications of georeferenced high-resolution images obtained with unmanned aerial vehicles. Part I: Description of image acquisition and processing. Precis. Agric. 15, 579–592. doi: 10.1007/s11119-014-9355-8

Crossref Full Text | Google Scholar

Cai, Y. P., Guan, K. Y., Lobell, D., Potgieter, A. B., Wang, S. W., Peng, J., et al. (2019). Integrating satellite and climate data to predict wheat yield in Australia using machine learning approaches. Agric. For. Meteorol. 274, 144–159. doi: 10.1016/j.agrformet.2019.03.010

Crossref Full Text | Google Scholar

Candiago, S., Remondino, F., De Giglio, M., Dubbini, M., Gattelli, M. (2015). Evaluating multispectral images and vegetation indices for precision farming applications from UAV images. Remote Sens. 7, 4026–4047. doi: 10.3390/rs70404026

Crossref Full Text | Google Scholar

Dong, J., Lu, H., Wang, Y., Ye, T., Yuan, W., Sensing, R. (2020). Estimating winter wheat yield based on a light use efficiency model and wheat variety data. ISPRS J. Photogramm. Remote Sens. 160, 18–32. doi: 10.1016/j.isprsjprs.2019.12.005

Crossref Full Text | Google Scholar

Duan, B., Fang, S., Gong, Y., Peng, Y., Wu, X., Zhu, R. J. (2021). Remote estimation of grain yield based on UAV data in different rice cultivars under contrasting climatic zone. Field Crops Res. 267, 108148. doi: 10.1016/j.fcr.2021.108148

Crossref Full Text | Google Scholar

Feng, A. J., Zhou, J. F., Vories, E. D., Sudduth, K. A., Zhang, M. N. (2020). Yield estimation in cotton using UAV-based multi-sensor imagery. Biosyst. Eng. 193, 101–114. doi: 10.1016/j.biosystemseng.2020.02.014

Crossref Full Text | Google Scholar

Ge, H., Ma, F., Li, Z., Tan, Z., Du, C. (2021). Improved accuracy of phenological detection in rice breeding by using ensemble models of machine learning based on UAV-RGB imagery. Remote Sens. 13, 2678. doi: 10.3390/rs13142678

Crossref Full Text | Google Scholar

Geipel, J., Link, J., Claupein, W. (2014). Combined spectral and spatial modeling of corn yield based on aerial images and crop surface models acquired with an unmanned aircraft system. Remote Sens. 6, 10335–10355. doi: 10.3390/rs61110335

Crossref Full Text | Google Scholar

Guo, Y., Fu, Y., Hao, F., Zhang, X., Wu, W., Jin, X., et al. (2021). Integrated phenology and climate in rice yields prediction using machine learning methods. Ecol. Indic. 120, 106935. doi: 10.1016/j.ecolind.2020.106935

Crossref Full Text | Google Scholar

Guo, Y., Xiao, Y., Hao, F., Zhang, X., Chen, J., de Beurs, K., et al. (2023). Comparison of different machine learning algorithms for predicting maize grain yield using UAV-based hyperspectral images. Int. J. Appl. Earth Obs. Geoinf. 124, 103528. doi: 10.1016/j.jag.2023.103528

Crossref Full Text | Google Scholar

Hassan, M. A., Yang, M. J., Rasheed, A., Yang, G. J., Reynolds, M., Xia, X. C., et al. (2019). A rapid monitoring of NDVI across the wheat growth cycle for grain yield prediction using a multi-spectral UAV platform. Plant Sci. 282, 95–103. doi: 10.1016/j.plantsci.2018.10.022

PubMed Abstract | Crossref Full Text | Google Scholar

Herrmann, I., Bdolach, E., Montekyo, Y., Rachmilevitch, S., Townsend, P. A., Karnieli, A. (2020). Assessment of maize yield and phenology by drone-mounted superspectral camera. Precis. Agric. 21, 51–76. doi: 10.1007/s11119-019-09659-5

Crossref Full Text | Google Scholar

Hunt, E. R., Cavigelli, M., Daughtry, C. S., Mcmurtrey, J. E., Walthall, C. L. (2005). Evaluation of digital photography from model aircraft for remote sensing of crop biomass and nitrogen status. Precis. Agric. 6, 359–378. doi: 10.1007/s11119-005-2324-5

Crossref Full Text | Google Scholar

Jordan, M. I., Mitchell, T. M. (2015). Machine learning: Trends, perspectives, and prospects. Science 349, 255–260. doi: 10.1126/science.aaa8415

PubMed Abstract | Crossref Full Text | Google Scholar

Kim, D.-W., Yun, H. S., Jeong, S.-J., Kwon, Y.-S., Kim, S.-G., Lee, W. S., et al. (2018). Modeling and testing of growth status for Chinese cabbage and white radish with UAV-based RGB imagery. Remote Sens. 10, 563. doi: 10.3390/rs10040563

Crossref Full Text | Google Scholar

Liang, Y., Li, H., Wu, H., Zhao, Y., Liu, Z., Liu, D., et al. (2024). A rotated rice spike detection model and a crop yield estimation application based on UAV images. Comput. Electron. Agric. 224, 109188. doi: 10.1016/j.compag.2024.109188

Crossref Full Text | Google Scholar

Liu, K. L., Li, Y. Z., Han, T. F., Yu, X. C., Ye, H. C., Hu, H. W., et al. (2019). Evaluation of grain yield based on digital images of rice canopy. Plant Methods 15, 1–11. doi: 10.1186/s13007-019-0416-x

PubMed Abstract | Crossref Full Text | Google Scholar

Maimaitijiang, M., Sagan, V., Sidike, P., Maimaitiyiming, M., Hartling, S., Peterson, K. T., et al. (2019). Vegetation index weighted canopy volume model (CVMVI) for soybean biomass estimation from unmanned aerial system-based RGB imagery. ISPRS J. Photogramm. Remote Sens. 151, 27–41. doi: 10.1016/j.isprsjprs.2019.03.003

Crossref Full Text | Google Scholar

Naga Srinivasu, P., Ijaz, M. F., Woniak, M. (2024). XAI-driven model for crop recommender system for use in precision agriculture. Comput. Intell. 40, e12629. doi: 10.1111/coin.12629

Crossref Full Text | Google Scholar

Vega, F. A., Ramirez, F. C., Saiz, M. P., Rosua, F. O. (2015). Multi-temporal imaging using an unmanned aerial vehicle for monitoring a sunflower crop. Biosyst. Eng. 132, 19–27. doi: 10.1016/j.biosystemseng.2015.01.008

Crossref Full Text | Google Scholar

Wang, Y., Wang, D., Zhang, G., Wang, J. (2013). Estimating nitrogen status of rice using the image segmentation of GR thresholding method. Field Crops Res. 149, 33–39. doi: 10.1016/j.fcr.2013.04.007

Crossref Full Text | Google Scholar

Wang, Y., Yang, M., Wei, G., Hu, R., Luo, Z., Li, G. (2014). Improved PLS regression based on SVM classification for rapid analysis of coal properties by near-infrared reflectance spectroscopy. Sens. Actuators B Chem. 193, 723–729. doi: 10.1016/j.snb.2013.12.028

Crossref Full Text | Google Scholar

Wang, S., Zhao, Y., Hu, R., Zhang, Y., Han, X. (2019). Analysis of near-infrared spectra of coal using deep synergy adaptive moving window partial least square method based on genetic algorithm. Chin. J. Anal. Chem. 47, e19034–e19044. doi: 10.1016/S1872-2040(19)61150-3

Crossref Full Text | Google Scholar

Woebbecke, D. M., Meyer, G. E., Von Bargen, K., Mortensen, D. A. (1995). Color indices for weed identification under various soil, residue, and lighting conditions. Trans. ASAE. 38, 259–269. doi: 10.13031/2013.27838

Crossref Full Text | Google Scholar

Yang, B., Zhu, W. X., Rezaei, E. E., Li, J., Sun, Z. G., Zhang, J. Q. (2022). The optimal phenological phase of maize for yield prediction with high-frequency UAV remote sensing. Remote Sens. 14, 1559. doi: 10.3390/rs14071559

Crossref Full Text | Google Scholar

Yu, N., Li, L., Schmitz, N., Tian, L. F., Greenberg, J. A., Diers, B. W. (2016). Development of methods to improve soybean yield estimation and predict plant maturity with an unmanned aerial vehicle based platform. Remote Sens. Environ. 187, 91–101. doi: 10.1016/j.rse.2016.10.005

Crossref Full Text | Google Scholar

Zhang, M., Zhou, J., Sudduth, K. A., Kitchen, N. R. (2020). Estimation of maize yield and effects of variable-rate nitrogen application using UAV-based RGB imagery. Biosyst. Eng. 189, 24–35. doi: 10.1016/j.biosystemseng.2019.11.001

Crossref Full Text | Google Scholar

Zheng, H., Cheng, T., Zhou, M., Li, D., Yao, X., Tian, Y., et al. (2019). Improved estimation of rice aboveground biomass combining textural and spectral analysis of UAV imagery. Precis. Agric. 20, 611–629. doi: 10.1007/s11119-018-9600-7

Crossref Full Text | Google Scholar

Keywords: maize, yield prediction, UAV-based imagery, random forest, pre-regression classification

Citation: Ge H, Zhang Q, Shen M, Qin Y, Wang L and Yuan C (2025) Enhancing yield prediction in maize breeding using UAV-derived RGB imagery: a novel classification-integrated regression approach. Front. Plant Sci. 16:1511871. doi: 10.3389/fpls.2025.1511871

Received: 15 October 2024; Accepted: 04 March 2025;
Published: 20 March 2025.

Edited by:

Fernando Auat, Heriot-Watt University, United Kingdom

Reviewed by:

Yahui Guo, Central China Normal University, China
Parvathaneni Naga Srinivasu, Amrita Vishwa Vidyapeetham University, India
Ghulam Mustafa, Hohai University, China

Copyright © 2025 Ge, Zhang, Shen, Qin, Wang and Yuan. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Cansheng Yuan, Y2Fuc2hlbmdfeXVhbkAxNjMuY29t

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Enhancing yield prediction in maize breeding using UAV-derived RGB imagery: a novel classification-integrated regression approach

1 Introduction

2 Materials and methods

2.1 Experimental location and plant materials

2.2 Data collection

2.2.1 Yield collection

2.2.2 UAV image acquisition

2.3 Image processing

2.3.1 Image mosaicking

2.3.2 VIs calculation

2.4 Model building

2.4.1 Classification method

2.4.2 Regression method

2.4.3 Calibration methods

2.4.4 Statistical analysis

3 Results

3.1 Correlation analysis of VIs and maize yield

3.2 Maize yield prediction based on method 1

3.3 Maize yield prediction based on method 2

3.4 Method 1 vs. Method 2

4 Discussion

4.1 The optimal growth stage for predicting maize yield

4.2 Improved RF regression based on SVM classification

5 Conclusions

Data availability statement

Author contributions

Funding

Conflict of interest

Generative AI statement

Publisher’s note

Supplementary material

References

95% of researchers rate our articles as excellent or good

95% of researchers rate our articles as excellent or good